/chat/completions
Create model responses for chat conversations. This is the primary endpoint for interactive AI applications.
POST
/v1/chat/completionsCreates a model response for the given chat conversation.
Request Body
modelID of the model to use (required)messagesList of messages comprising the conversation (required)temperatureSampling temperature 0-2 (default: 1). Higher values = more randommax_tokensMaximum tokens to generatestreamStream response using server-sent events (default: false)Request Body
{
"model": "your-model",
"messages": [
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Hello!"}
],
"temperature": 0.7,
"max_tokens": 1000,
"stream": false
}Response
Returns a chat completion object with choices, usage metrics, and finish reason.
Response
{
"id": "chatcmpl-abc123",
"object": "chat.completion",
"created": 1677858242,
"model": "your-model",
"choices": [
{
"index": 0,
"message": {
"role": "assistant",
"content": "Hello! How can I help you today?"
},
"finish_reason": "stop"
}
],
"usage": {
"prompt_tokens": 20,
"completion_tokens": 12,
"total_tokens": 32
}
}