Cerebras

Cerebras provides ultra-fast inference on their Wafer Scale Engine hardware. Uses an OpenAI-compatible API.

Installation

import "github.com/xraph/nexus/providers/cerebras"

provider := cerebras.New(os.Getenv("CEREBRAS_API_KEY"))

gw := nexus.New(
    nexus.WithProvider(provider),
)

Option	Description
`cerebras.WithBaseURL(url)`	Override the API base URL (default: `https://api.cerebras.ai/v1`)

Model	Context	Max Output	Input Price	Output Price
`llama3.1-8b`	8,192	4,096	$0.10/M	$0.10/M
`llama3.1-70b`	8,192	4,096	$0.60/M	$0.60/M