Generate text responses using large language models.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
The language model to use.
"openai/gpt-oss-120b"
Conversation messages in chat format.
Maximum number of tokens to generate.
Controls randomness in output.
Nucleus sampling parameter.
Whether to stream responses incrementally.
Chat completion generated successfully