Overview
Cerebras provides an OpenAI-compatible API athttps://api.cerebras.ai/v1 for fast inference. Some OpenAI parameters are not supported: frequency_penalty, logit_bias, presence_penalty, parallel_tool_calls, and service_tier.
Model Class: CerebrasModel
Authentication
Examples
Model Settings
You can set model parameters in two ways: on the model or on the Agent. On the model:Parameters
| Parameter | Type | Description | Default | Source |
|---|---|---|---|---|
max_tokens | int | Maximum tokens to generate | Model default | Base |
temperature | float | Sampling temperature (0.0-2.0) | 1.0 | Base |
top_p | float | Nucleus sampling | 1.0 | Base |
seed | int | Random seed | None | Base |
stop_sequences | list[str] | Stop sequences | None | Base |
timeout | float | Request timeout (seconds) | 600 | Base |
presence_penalty, frequency_penalty, logit_bias, parallel_tool_calls, service_tier. Omit these in settings when using the Cerebras provider.
