Lepton AI

Lepton AI provides serverless GPU inference for open-source models. Uses an OpenAI-compatible API.

Installation

import "github.com/xraph/nexus/providers/lepton"

provider := lepton.New(os.Getenv("LEPTON_API_KEY"))

gw := nexus.New(
    nexus.WithProvider(provider),
)

Option	Description
`lepton.WithBaseURL(url)`	Override the API base URL (default: `https://api.lepton.ai/v1`)

Model	Context	Max Output	Input Price	Output Price
`llama3.1-8b`	131K	4,096	$0.07/M	$0.07/M
`llama3.1-70b`	131K	4,096	$0.80/M	$0.80/M