Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Headers
Bearer authentication header of the form Bearer <API_KEY>, where <API_KEY> is your auth token.
Body
The name of the model to use.
"deepseek-ai/DeepSeek-R1"
The prompt to generate completions for. It can be a single string or an array of strings. It can also be an array of integers or an array of integer arrays, which allows to pass already tokenized prompt. If multiple prompts are specified, several choices with corresponding index will be returned in the output."
"The sky is"
What sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.
We generally recommend altering this or top_p
but not both.
0 <= x <= 2
1
The maximum number of tokens to generate in the completion.
If the token count of your prompt (previous messages) plus max_tokens
exceed the model's context length, the behavior is depends on context_length_exceeded_behavior
. By default, max_tokens
will be lowered to fit in the context window instead of returning an error.
Whether to stream back partial progress. If set, tokens will be sent as data-only server-sent events
true
Response
OK