Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
One of the available TTS models: tts-1, tts-1-hd or gpt-4o-mini-tts.
The text to generate audio for. The maximum length is 4096 characters.
4096The voice to use when generating the audio. Supported voices are alloy, ash, ballad, coral, echo, fable, onyx, nova, sage, shimmer, and verse. Previews of the voices are available in the Text to speech guide.
"ash"
Control the voice of your generated audio with additional instructions. Does not work with tts-1 or tts-1-hd.
4096The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm.
mp3, opus, aac, flac, wav, pcm The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.
0.25 <= x <= 4Response
OK
The response is of type file.