Body
Model name
Text for the model to generate a response from
Used for fill-in-the-middle models, text that appears after the user prompt and before the model response
Base64-encoded images for models that support image input
Structured output format for the model to generate a response from. Supports either the string "json" or a JSON schema object.
System prompt for the model to generate a response from
When true, returns a stream of partial responses
When true, returns separate thinking output in addition to content
When true, returns the raw response from the model without any prompt templating
Model keep-alive duration (for example 5m or 0 to unload immediately)
Runtime options that control text generation
Response
Generation responses
Model name
ISO 8601 timestamp of response creation
The model's generated text response
The model's generated thinking output
Indicates whether generation has finished
Reason the generation stopped
Time spent generating the response in nanoseconds
Time spent loading the model in nanoseconds
Number of input tokens in the prompt
Time spent evaluating the prompt in nanoseconds
Number of output tokens generated in the response
Time spent generating tokens in nanoseconds

