Text Series
General Chat API (Default Streaming)
- Unified chat API interface supporting all text generation models
- Select different AI models via the model parameter
- Compatible with OpenAI Chat Completions API format
POST
Authorizations
All API endpoints require Bearer Token authenticationGet your API Key:Visit the API Key Management Page to get your API KeyAdd it to the request header:
Body
Model nameSupported models include:
- OpenAI:
gpt-5,gpt-5.1,gpt-5-chat-latest,gpt-5-mini - Anthropic:
claude-opus-4-8,claude-opus-4-7,claude-opus-4-6,claude-sonnet-4-6,claude-opus-4-5-20251101 - Google:
gemini-3.5-flash,gemini-3.1-pro-preview,gemini-3-pro-preview,gemini-3-pro-preview-thinking,gemini-3-flash-preview,gemini-2.5-pro,gemini-2.5-flash,gemini-2.5-flash-lite - DeepSeek:
deepseek-v4-pro,deepseek-v4-flash,deepseek-v3.2,deepseek-v3.2-exp,deepseek-r1-250528,deepseek-v3-0324 - More models being added continuously…
List of conversation messagesMessage array. Each message contains Advanced usage:Add system prompt (to define AI behavior):Multi-turn conversation (with context):Role descriptions:
role and content fields.💡 Quick fill (Try it area):- Click ”+ Add an item” to add a message
- Enter
user(user message),assistant(AI response), orsystem(system prompt) forrole - Enter what you want to say in
content
user: User message (use this most of the time)system: System prompt to set AI behavior and roleassistant: AI’s previous responses, used for conversation context
Controls output randomness, range 0-2
- Lower values (e.g., 0.2) make output more deterministic
- Higher values (e.g., 1.8) make output more random
Maximum number of tokens to generateDifferent models have different maximum limits, please refer to specific model documentation
Whether to use streaming output
true: Streaming response (SSE format)false: Complete response at once
Nucleus sampling parameter, range 0-1Controls diversity of generated text, recommend using either this or temperatureDefault: 1.0
Frequency penalty, range -2.0 to 2.0Positive values reduce the likelihood of repeating the same wordsDefault: 0
Presence penalty, range -2.0 to 2.0Positive values increase the likelihood of talking about new topicsDefault: 0
Stop sequencesUp to 4 sequences where generation will stop when encountered
Number of completions to generateDefault: 1⚠️ Note: Must enter a plain number (e.g.,
1), do not use quotes or it will cause an errorResponse
Unique identifier for the response
Object type, fixed as
chat.completionCreation timestamp
The actual model name used
List of generated responses
Token usage statistics
Supported Models
OpenAI Series
gpt-5- GPT-5 base modelgpt-5.1- GPT-5.1 enhanced versiongpt-5-chat-latest- GPT-5 latest chat versiongpt-5-mini- GPT-5 lightweight version, cost-effective
Anthropic Series
claude-opus-4-8- Claude Opus 4.8 flagship modelclaude-opus-4-7- Claude Opus 4.7 flagship modelclaude-opus-4-6- Claude Opus 4.6 flagship modelclaude-sonnet-4-6- Claude Sonnet 4.6 balanced versionclaude-opus-4-5-20251101- Claude Opus 4.5 model
Google Series
gemini-3.5-flash- Gemini 3.5 fast versiongemini-3.1-pro-preview- Gemini 3.1 Pro preview versiongemini-3-pro-preview- Gemini 3 Pro preview versiongemini-3-pro-preview-thinking- Gemini 3 Pro deep thinking preview versiongemini-3-flash-preview- Gemini 3 Flash preview versiongemini-2.5-pro- Gemini 2.5 professional versiongemini-2.5-flash- Gemini 2.5 fast versiongemini-2.5-flash-lite- Gemini 2.5 ultra-lightweight version
DeepSeek Series
deepseek-v4-pro- DeepSeek V4 professional versiondeepseek-v4-flash- DeepSeek V4 fast versiondeepseek-v3.2- DeepSeek V3.2 standard versiondeepseek-v3.2-exp- DeepSeek V3.2 experimental versiondeepseek-r1-250528- DeepSeek R1 reasoning modeldeepseek-v3-0324- DeepSeek V3 standard version