Generate audio from input text
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
The name of the model to query.
See all of Together AI's chat models
cartesia/sonic "cartesia/sonic"
Input text to generate the audio for
The voice to use for generating the audio. View all supported voices here.
laidback woman, polite man, storyteller lady, friendly sidekick The format of audio output
mp3, wav, raw Language of input text
en, de, fr, es, hi, it, ja, ko, nl, pl, pt, ru, sv, tr, zh Audio encoding of response
pcm_f32le, pcm_s16le, pcm_mulaw, pcm_alaw Sampling rate to use for the output audio
If true, output is streamed for several characters at a time instead of waiting for the full response. The stream terminates with data: [DONE]. If false, return the encoded audio as octet stream
OK
The response is of type file.