Nexus

Lepton AI

Use Lepton AI's serverless GPU inference with Nexus.

Lepton AI provides serverless GPU inference for open-source models. Uses an OpenAI-compatible API.

Installation

import "github.com/xraph/nexus/providers/lepton"

Quick Start

provider := lepton.New(os.Getenv("LEPTON_API_KEY"))

gw := nexus.New(
    nexus.WithProvider(provider),
)

Options

OptionDescription
lepton.WithBaseURL(url)Override the API base URL (default: https://api.lepton.ai/v1)

Capabilities

CapabilitySupported
ChatYes
StreamingYes
EmbeddingsNo
VisionNo
ToolsNo
ThinkingNo

Models

ModelContextMax OutputInput PriceOutput Price
llama3.1-8b131K4,096$0.07/M$0.07/M
llama3.1-70b131K4,096$0.80/M$0.80/M

On this page