> ## Documentation Index > Fetch the complete documentation index at: https://opencompress.mintlify.site/llms.txt > Use this file to discover all available pages before exploring further. # Chat Completions > POST /v1/chat/completions — Create a compressed chat completion. ## Endpoint ``` POST https://www.opencompress.ai/api/v1/chat/completions ``` ## Headers | Header | Required | Description | | --------------- | -------- | ----------------------------- | | `Authorization` | Yes | `Bearer sk-occ-your-key-here` | | `Content-Type` | Yes | `application/json` | ## Request body Model identifier. See [Supported Models](/features/supported-models) for the full list. Examples: `gpt-4o`, `claude-sonnet-4-6`, `gemini-2.5-pro` Array of message objects. Each message has a `role` and `content`. Supported roles: `system`, `user`, `assistant` If `true`, returns a stream of server-sent events. Sampling temperature (0-2). Passed through to the model. Maximum tokens to generate. Passed through to the model. Nucleus sampling parameter. Passed through to the model. Stop sequences. Passed through to the model. All standard OpenAI parameters (`frequency_penalty`, `presence_penalty`, `logprobs`, `tools`, `tool_choice`, etc.) are passed through to the upstream model. ## Response Unique identifier for this completion. Always `"chat.completion"`. Unix timestamp of when the completion was created. The model used, matching your request. Array of completion choices. Index of this choice. The assistant's message with `role` and `content`. Why generation stopped: `"stop"`, `"length"`, `"tool_calls"`. Token usage statistics. Number of tokens in the compressed prompt. Number of tokens in the response. Total tokens (prompt + completion). ## Example ```python Python theme={null} from openai import OpenAI client = OpenAI( base_url="https://www.opencompress.ai/api/v1", api_key="sk-occ-your-key-here", ) response = client.chat.completions.create( model="gpt-4o", messages=[ {"role": "system", "content": "You are a concise technical writer."}, {"role": "user", "content": "Explain how JWT authentication works."}, ], temperature=0.7, max_tokens=500, ) print(response.choices[0].message.content) ``` ```bash cURL theme={null} curl https://www.opencompress.ai/api/v1/chat/completions \ -H "Authorization: Bearer sk-occ-your-key-here" \ -H "Content-Type: application/json" \ -d '{ "model": "gpt-4o", "messages": [ {"role": "system", "content": "You are a concise technical writer."}, {"role": "user", "content": "Explain how JWT authentication works."} ], "temperature": 0.7, "max_tokens": 500 }' ```