TTS Text-to-Speech

curl --request POST \
  --url https://api.apimart.ai/v1/audio/speech \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
    "model": "gpt-4o-mini-tts",
    "input": "The quick brown fox jumps over the lazy dog.",
    "voice": "alloy",
    "response_format": "opus",
    "speed": 1.0
  }' \
  --output speech.opus

Binary audio data stream

POST

audio

speech

curl --request POST \
  --url https://api.apimart.ai/v1/audio/speech \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
    "model": "gpt-4o-mini-tts",
    "input": "The quick brown fox jumps over the lazy dog.",
    "voice": "alloy",
    "response_format": "opus",
    "speed": 1.0
  }' \
  --output speech.opus

Binary audio data stream

curl --request POST \
  --url https://api.apimart.ai/v1/audio/speech \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
    "model": "gpt-4o-mini-tts",
    "input": "The quick brown fox jumps over the lazy dog.",
    "voice": "alloy",
    "response_format": "opus",
    "speed": 1.0
  }' \
  --output speech.opus

Binary audio data stream

Authorizations

Authorization

string

required

All APIs require Bearer Token authenticationGet API Key:Visit API Key Management Page to get your API KeyAdd to request header:

Authorization: Bearer YOUR_API_KEY

Body

model

string

default:"gpt-4o-mini-tts"

required

TTS model nameAvailable models:

gpt-4o-mini-tts - GPT-4o Mini TTS model

Example: "gpt-4o-mini-tts"

input

string

required

The text to convert to speechMaximum length: 4096 charactersExample: "The quick brown fox jumps over the lazy dog."

voice

string

required

Voice selectionAvailable voices:

alloy - Neutral, balanced voice
echo - Male, calm voice
fable - British, narrative voice
onyx - Male, deep voice
nova - Female, energetic voice
shimmer - Female, gentle voice

Example: "alloy"

response_format

string

default:"wav"

required

Audio output formatSupported formats:

wav - WAV format, uncompressed (default)
opus - Opus format, for internet streaming
aac - AAC format
flac - FLAC format, lossless compression
pcm - PCM format, raw audio data

Example: "wav"

speed

number

default:"1.0"

Speech playback speedRange: 0.25 to 4.0

0.25 - Slowest speed (1/4x)
1.0 - Normal speed (default)
4.0 - Fastest speed (4x)

Example: 1.0

Response

Returns binary audio data stream on success, which can be saved as an audio file or played directly. Returns JSON formatted error information on error, including error code, message, and type.

Whisper-1 Audio Transcription Get Task Status

Overview

Text Series

Image Series

Video Series

Audio Series

Task Management

Account Management

TTS Text-to-Speech

Authorizations

Body

Response

Overview

Text Series

Image Series

Video Series

Audio Series

Task Management

Account Management

​Authorizations

​Body

​Response

Authorizations

Body

Response