AI Infra

ClawRouter

by BlockRunAI

LLM router for autonomous agents — smart routing, wallet-based authentication, and USDC micropayments for AI API calls

An agent-native LLM router that enables autonomous AI agents to access 55+ models without API keys or credit cards. It performs local, smart routing across models based on 15 dimensions to reduce AI API costs by up to 92%. Agents authenticate via wallet signatures and pay per-request with USDC via the x402 protocol, supporting Base and Solana networks. It integrates as an OpenClaw plugin or runs as a standalone OpenAI-compatible proxy.

View on GitHub ↗

Key features

100% local routing with sub-1ms latency for all LLM requests
Smart routing across 55+ models based on 15 dimensions
Agent-native authentication using wallet signatures, no API keys
USDC micropayments via x402 protocol on Base and Solana
Integrated image and video generation with chat slash commands

Languages

TypeScript86%Shell8%JavaScript5%PowerShell1%Python1%

Top contributors

+101 stars since added

Started at 6.5k ★ when added to RepoDepot.

Topics

aiai-agentsanthropiccost-optimizationdeepseekgeminillmllm-routermicropaymentsopenaiopenclawsmart-routingstablecoin-paymentstablecoin-paymentsusdcx402

README

View on GitHub ↗

The LLM router built for autonomous agents

Agents can't sign up for accounts. Agents can't enter credit cards.
Agents can only sign transactions.

ClawRouter is the only LLM router that lets agents operate independently.

10 models free, no crypto required. No signup. No API key. No credit card.

ClawRouter is an open-source smart LLM router that reduces AI API costs by up to 92%. It analyzes each request across 15 dimensions and routes to the cheapest capable model in under 1ms, entirely locally. ClawRouter is the only LLM router built for autonomous AI agents — it uses wallet signatures for authentication (no API keys) and USDC micropayments via the x402 protocol (no credit cards). 55+ models from OpenAI, Anthropic, Google, xAI, DeepSeek, and more. MIT licensed.

Why ClawRouter exists

Every other LLM router was built for human developers — create an account, get an API key, pick a model from a dashboard, pay with a credit card.

Agents can't do any of that.

ClawRouter is built for the agent-first world:

Starts at $0 — 10 NVIDIA models are free forever (incl. 1M-context DeepSeek V4 + a vision-capable Nemotron Omni)
No accounts — a wallet is generated locally, no signup
No API keys — your wallet signature IS authentication
No model selection — 15-dimension scoring picks the right model automatically
No credit cards — agents pay per-request with USDC via x402
No trust required — runs locally, <1ms routing, zero external dependencies

This is the stack that lets agents operate autonomously: x402 + USDC + local routing.

How it compares

	OpenRouter	LiteLLM	Martian	Portkey	ClawRouter
Models	200+	100+	Smart routing	Gateway	55+
Free tier	Rate-limited	BYO keys	No	No	8 models, no signup
Routing	Manual selection	Manual selection	Smart (closed)	Observability	Smart (open source)
Auth	Account + API key	Your API keys	Account + API key	Account + API key	Wallet signature
Payment	Credit card	BYO keys	Credit card	$49-499/mo	USDC per-request
Runs locally	No	Yes	No	No	Yes
Open source	No	Yes	No	Partial	Yes
Agent-ready	No	No	No	No	Yes

✓ Open source · ✓ Smart routing · ✓ Runs locally · ✓ Crypto native · ✓ Agent ready

We're the only one that checks all five boxes.

Quick Start

No wallet? 10 models work free out of the box. Install, run, and pin nvidia/gpt-oss-120b (or any of the 10) — no crypto, no signup, no balance required. Add USDC later when you want paid models.

Option A — OpenClaw Agent

OpenClaw is an AI coding agent. If you're using it, ClawRouter installs as a plugin. Two paths:

A1. Recommended — one-shot install script:

curl -fsSL https://blockrun.ai/ClawRouter-update | bash
openclaw gateway restart

This handles everything: registration, models config, auth profile, wallet setup. Smart routing (blockrun/auto) is now your default model.

A2. If you prefer pure npm:

npm install -g @blockrun/clawrouter
clawrouter setup            # finishes OpenClaw integration — REQUIRED
openclaw gateway restart

⚠️ Skipping clawrouter setup will leave you broken. Bare npm install -g only puts the package on disk; it does NOT register the plugin with OpenClaw, sync the models allowlist, or write the auth profile. Symptom: /models in your bot shows only ~7 entries (OpenClaw's hardcoded defaults) instead of the full ~38 BlockRun models. Run clawrouter setup to repair, or use path A1 to begin with.

Option B — Standalone (continue.dev, Cursor, VS Code, any OpenAI-compatible client)

Using Claude Code? Check out BRCC — it's purpose-built for Claude Code with the same smart routing and x402 payments.

No OpenClaw required. ClawRouter runs as a local proxy on port 8402.

1. Start the proxy

npx @blockrun/clawrouter

2. Fund your wallet — optional, skip for free tier Your wallet address is printed on first run. For paid models, send a few USDC on Base or Solana — $5 covers thousands of requests. To stay at $0, pin any of the 10 free models (e.g. nvidia/gpt-oss-120b) or use /model free inside OpenClaw.

3. Point your client at http://localhost:8402

continue.dev — ~/.continue/config.yaml

Important: apiBase must end with /v1/ (including the trailing slash). Without it, continue.dev constructs the URL as /chat/completions instead of /v1/chat/completions, and the proxy returns 404.

models:
  - name: ClawRouter Auto
    provider: openai
    model: blockrun/auto
    apiBase: http://localhost:8402/v1/
    apiKey: x402
    roles:
      - chat
      - edit
      - apply

To pin a specific model, replace blockrun/auto with any model from blockrun.ai/models, e.g. anthropic/claude-opus-4.6, xai/grok-4-0709.

Both provider: openai and provider: clawrouter work — just make sure apiBase ends with /v1/.

Legacy JSON format (~/.continue/config.json)

{
  "models": [
    {
      "title": "ClawRouter Auto",
      "provider": "openai",
      "model": "blockrun/auto",
      "apiBase": "http://localhost:8402/v1/",
      "apiKey": "x402"
    }
  ]
}

Cursor — Settings → Models → OpenAI-compatible

Set base URL to http://localhost:8402, API key to x402, model to blockrun/auto.

Any OpenAI SDK

from openai import OpenAI
client = OpenAI(base_url="http://localhost:8402", api_key="x402")
response = client.chat.completions.create(model="blockrun/auto", messages=[...])

Routing Profiles

Choose your routing strategy with /model <profile>:

Profile	Strategy	Savings	Best For
`/model free`	Free NVIDIA models	100%	$0 balance, learning
`/model auto`	Balanced (default)	74-100%	General use
`/model eco`	Cheapest possible	95-100%	Maximum savings
`/model premium`	Best quality	0%	Mission-critical

Shortcuts: /model grok, /model br-sonnet, /model gpt5, /model o3

How It Works

100% local routing. <1ms latency. Zero external API calls.

Request → Weighted Scorer (15 dimensions) → Tier → Best Model → Response

Tier	ECO Model	AUTO Model	PREMIUM Model
SIMPLE	nvidia/gpt-oss-120b (FREE)	gemini-2.5-flash ($0.30/$2.50)	kimi-k2.6
MEDIUM	gemini-3.1-flash-lite ($0.25/$1.50)	kimi-k2.5 ($0.60/$3.00)	gpt-5.3-codex ($1.75/$14.00)
COMPLEX	gemini-3.1-flash-lite ($0.25/$1.50)	gemini-3.1-pro ($2/$12)	claude-opus-4.6 ($5/$25)
REASONING	grok-4-1-fast ($0.20/$0.50)	grok-4-1-fast-reasoning ($0.20/$0.50)	claude-sonnet-4.6 ($3/$15)

Blended average: $2.05/M vs $25/M for Claude Opus = 92% savings

Image Generation

Generate images directly from chat with /cr-imagegen:

/cr-imagegen a dog dancing on the beach
/cr-imagegen --model dall-e-3 a futuristic city at sunset
/cr-imagegen --model banana-pro --size 2048x2048 mountain landscape

The slash command is /cr-imagegen to avoid colliding with Telegram channel commands. Typing /imagegen in chat still works for backward compatibility.

Model	Provider	Price	Max Size
`nano-banana`	Google Gemini Flash	$0.05/image	1024x1024
`banana-pro`	Google Gemini Pro	$0.10/image	4096x4096
`dall-e-3`	OpenAI DALL-E 3	$0.04/image	1792x1024
`gpt-image`	OpenAI GPT Image 1	$0.02/image	1536x1024
`flux`	Black Forest Flux 1.1	$0.04/image	1024x1024
`xai/grok-imagine-image`	xAI Grok Imagine	$0.02/image	1024x1024
`xai/grok-imagine-image-pro`	xAI Grok Imagine Pro	$0.07/image	1024x1024
`zai/cogview-4`	Zhipu CogView-4	$0.015/image	1440x1440

Video Generation

Generate short AI videos directly from chat with /videogen:

/videogen a red apple slowly spinning
/videogen --model seedance-2-fast --duration=5 a cat waving
/videogen --model grok-video a neon city at night

Or drive it over HTTP — ClawRouter proxies the BlockRun gateway, handles x402 payment, and downloads the returned MP4 to local disk, rewriting url to http://localhost:8402/videos/<file>.mp4 so the asset survives past the upstream's temporary bucket.

curl -X POST http://localhost:8402/v1/videos/generations \
  -H "Content-Type: application/json" \
  -d '{"model":"bytedance/seedance-2.0-fast","prompt":"a red apple slowly spinning","duration_seconds":5}'

Model	Provider	Price	Duration
`bytedance/seedance-1.5-pro`	ByteDance Seedance	$0.03/sec	5s default, up to 10s
`bytedance/seedance-2.0-fast`	ByteDance Seedance	$0.15/sec	5s default, up to 10s
`bytedance/seedance-2.0`	ByteDance Seedance	$0.30/sec	5s default, up to 10s
`xai/grok-imagine-video`	xAI Grok Imagine	$0.05/sec	8s default

Calls block for 30–120s while the upstream polls the job. Text-to-video and image-to-video (image_url parameter) are both supported. Seedance 2.0 Fast typically returns in 60–80s; 2.0 Pro trades latency for quality.

Image Editing (img2img)

Edit existing images with /img2img:

/img2img --image ~/photo.png change the background to a starry sky
/img2img --image ./cat.jpg --mask ./mask.png remove the background

Option	Required	Description
`--image <path>`	Yes	Local image file path (supports `~/`)
`--mask <path>`	No	Mask image (white = area to edit)
`--model <model>`	No	Model to use (default: `gpt-image-1`)
`--size <WxH>`	No	Output size (default: `1024x1024`)

API endpoint: POST http://localhost:8402/v1/images/image2image — see full docs.

Models & Pricing

55+ models across 9 providers, one wallet. Starting at $0.0002/request.

💡 "Cost per request" = estimated cost for a typical chat message (~500 input + 500 output tokens).

Budget Models (under $0.001/request)

Model	Input $/M	Output $/M	~$/request	Context	Features
nvidia/gpt-oss-120b	FREE	FREE	$0	128K	default free model
nvidia/gpt-oss-20b	FREE	FREE	$0	128K	smaller, faster
nvidia/mistral-small-4-119b	FREE	FREE	$0	131K	fastest free chat (114 tok/s)
nvidia/deepseek-v4-pro	FREE	FREE	$0	1M	reasoning, MMLU-Pro 87.5
nvidia/deepseek-v4-flash	FREE	FREE	$0	1M	reasoning, ~5x faster than v4-pro
nvidia/qwen3-next-80b-a3b-thinking	FREE	FREE	$0	131K	reasoning
nvidia/qwen3-coder-480b	FREE	FREE	$0	131K	coding
nvidia/glm-4.7	FREE	FREE	$0	131K	reasoning
nvidia/llama-4-maverick	FREE	FREE	$0	131K	reasoning
nvidia/nemotron-3-nano-omni-30b-a3b-reasoning	FREE	FREE	$0	256K	reasoning, vision (text+img+video+audio)
openai/gpt-5-nano	$0.05	$0.40	$0.0002	128K	tools
openai/gpt-4.1-nano	$0.10	$0.40	$0.0003	128K	tools
google/gemini-2.5-flash-lite	$0.10	$0.40	$0.0003	1M	tools
openai/gpt-4o-mini	$0.15	$0.60	$0.0004	128K	tools
xai/grok-4-fast	$0.20	$0.50	$0.0004	131K	tools
xai/grok-4-fast-reasoning	$0.20	$0.50	$0.0004	131K	reasoning, tools
xai/grok-4-1-fast	$0.20	$0.50	$0.0004	131K	tools
xai/grok-4-1-fast-reasoning	$0.20	$0.50	$0.0004	131K	reasoning, tools
xai/grok-4-0709	$0.20	$1.50	$0.0009	131K	reasoning, tools
openai/gpt-5-mini	$0.25	$2.00	$0.0011	200K	tools
deepseek/deepseek-chat	$0.28	$0.42	$0.0004	128K	tools
deepseek/deepseek-reasoner	$0.28	$0.42	$0.0004	128K	reasoning, tools
xai/grok-3-mini	$0.30	$0.50	$0.0004	131K	tools
minimax/minimax-m2.7	$0.30	$1.20	$0.0008	205K	reasoning, agentic, tools
minimax/minimax-m2.5	$0.30	$1.20	$0.0008	205K	reasoning, agentic, tools
google/gemini-2.5-flash	$0.30	$2.50	$0.0014	1M	vision, tools
openai/gpt-4.1-mini	$0.40	$1.60	$0.0010	128K	tools
google/gemini-3-flash-preview	$0.50	$3.00	$0.0018	1M	vision
moonshot/kimi-k2.5	$0.60	$3.00	$0.0018	262K	reasoning, vision, agentic, tools
moonshot/kimi-k2.6	$0.95	$4.00	$0.0025	262K	reasoning, vision, agentic, tools

Mid-Range Models ($0.001–$0.01/request)

Model	Input $/M	Output $/M	~$/request	Context	Features
anthropic/claude-haiku-4.5	$1.00	$5.00	$0.0030	200K	vision, agentic, tools
zai/glm-5	$1.00	$3.20	$0.0021	200K	tools
openai/o1-mini	$1.10	$4.40	$0.0028	128K	reasoning, tools
openai/o3-mini	$1.10	$4.40	$0.0028	128K	reasoning, tools
openai/o4-mini	$1.10	$4.40	$0.0028	128K	reasoning, tools
zai/glm-5-turbo	$1.20	$4.00	$0.0026	200K	tools
google/gemini-2.5-pro	$1.25	$10.00	$0.0056	1M	reasoning, vision, tools
openai/gpt-5.2	$1.75	$14.00	$0.0079	400K	reasoning, vision, agentic, tools
openai/gpt-5.3	$1.75	$14.00	$0.0079	128K	reasoning, vision, agentic, tools
openai/gpt-5.3-codex	$1.75	$14.00	$0.0079	400K	agentic, tools
openai/gpt-4.1	$2.00	$8.00	$0.0050	128K	vision, tools
openai/o3	$2.00	$8.00	$0.0050	200K	reasoning, tools
google/gemini-3-pro-preview	$2.00	$12.00	$0.0070	1M	reasoning, vision, tools
google/gemini-3.1-pro	$2.00	$12.00	$0.0070	1M	reasoning, vision, tools
xai/grok-2-vision	$2.00	$10.00	$0.0060	131K	vision, tools
openai/gpt-4o	$2.50	$10.00	$0.0063	128K	vision, agentic, tools
openai/gpt-5.4	$2.50	$15.00	$0.0088	400K	reasoning, vision, agentic, tools

Premium Models ($0.01+/request)

Model	Input $/M	Output $/M	~$/request	Context	Features
anthropic/claude-sonnet-4.6	$3.00	$15.00	$0.0090	200K	reasoning, vision, agentic, tools
xai/grok-3	$3.00	$15.00	$0.0090	131K	reasoning, tools
anthropic/claude-opus-4.6	$5.00	$25.00	$0.0150	200K	reasoning, vision, agentic, tools
openai/gpt-5.5	$5.00	$30.00	$0.0175	1M	reasoning, vision, agentic, tools
openai/o1	$15.00	$60.00	$0.0375	200K	reasoning, tools
openai/gpt-5.2-pro	$21.00	$168.00	$0.0945	400K	reasoning, tools
openai/gpt-5.4-pro	$30.00	$180.00	$0.1050	400K	reasoning, tools

Free tier: 10 models cost nothing — /model free points to gpt-oss-120b, or pick any free model directly (e.g., /model deepseek-v4-pro for reasoning, /model nemotron-omni for vision, /model qwen-thinking, /model deepseek-v4-flash for 1M context, /model mistral-small). Best value: gpt-5-nano and gemini-2.5-flash-lite deliver strong results at ~$0.0003/request.

Payment

No account. No API key. Payment IS authentication via x402.

Request → 402 (price: $0.003) → wallet signs USDC → retry → response

USDC stays in your wallet until spent — non-custodial. Price is visible in the 402 header before signing.

Dual-chain support: Pay with USDC on Base (EVM) or USDC on Solana. Both wallets are derived from a single BIP-39 mnemonic on first run.

/wallet              # Check balance and address (both chains)
/wallet export       # Export mnemonic + keys for backup
/wallet recover      # Restore wallet from mnemonic on a new machine
/wallet solana       # Switch to Solana USDC payments
/wallet base         # Switch back to Base (EVM) USDC payments
/chain solana        # Alias for /wallet solana
/stats               # View usage and savings
/stats clear         # Reset usage statistics
/exclude             # Show excluded models
/exclude add <model> # Block a model from routing (aliases work: "grok-4", "free")
/exclude remove <model> # Unblock a model
/exclude clear       # Remove all exclusions

Fund your wallet:

Base (EVM): Send USDC on Base to your EVM address
Solana: Send USDC on Solana to your Solana address
Coinbase/CEX: Withdraw USDC to either network
Credit card: Reach out to @bc1max on Telegram

Screenshots

Smart Routing in Action

ClawRouter savings

Telegram Integration

Configuration

For basic usage, no configuration needed. For advanced options:

Variable	Default	Description
`BLOCKRUN_WALLET_KEY`	auto-generated	Your wallet private key
`BLOCKRUN_PROXY_PORT`	`8402`	Local proxy port
`CLAWROUTER_DISABLED`	`false`	Disable smart routing
`CLAWROUTER_SOLANA_RPC_URL`	`https://api.mainnet-beta.solana.com`	Solana RPC endpoint

Full reference: docs/configuration.md

Model Exclusion

Block specific models from being routed to. Useful if a model doesn't follow your agent instructions or you want to control costs.

/exclude add nvidia/gpt-oss-120b   # Block the default free model
/exclude add grok-4                # Aliases work — blocks all grok-4 variants
/exclude add gpt-5.4               # Skip expensive models
/exclude                           # Show current exclusions
/exclude remove grok-4             # Unblock a model
/exclude clear                     # Remove all exclusions

Exclusions persist across restarts (~/.openclaw/blockrun/exclude-models.json). If all models in a tier are excluded, the safety net ignores the filter so routing never breaks.

Troubleshooting

When things go wrong, run the doctor:

npx @blockrun/clawrouter doctor

This collects diagnostics and sends them to Claude Sonnet for AI-powered analysis:

🩺 BlockRun Doctor v0.12.24

System
  ✓ OS: darwin arm64
  ✓ Node: v20.11.0

Wallet
  ✓ Address: 0x1234...abcd
  ✓ Balance: $12.50

Network
  ✓ BlockRun API: reachable (142ms)
  ✗ Local proxy: not running on :8402

📤 Sending to Claude Sonnet 4.6 (~$0.003)...

🤖 AI Analysis:
The local proxy isn't running. Run `openclaw gateway restart` to fix.

Use Opus for complex issues:

npx @blockrun/clawrouter doctor opus

Ask a specific question:

npx @blockrun/clawrouter doctor "why is my request failing?"
npx @blockrun/clawrouter doctor opus "深度分析我的配置"

Cost: Sonnet ~$0.003 (default) | Opus ~$0.01

Development

git clone https://github.com/BlockRunAI/ClawRouter.git
cd ClawRouter
npm install
npm run build
npm test

Support

Channel	Link
📅 Schedule Demo	calendly.com/vickyfu9/30min
💬 Community Telegram	t.me/blockrunAI
🐦 X / Twitter	x.com/blockrunai
📱 Founder Telegram	@bc1max
✉️ Email	vicky@blockrun.ai

From the BlockRun Ecosystem

⚡ ClawRouter

The LLM router built for autonomous agents

You're here. 55+ models, local smart routing, x402 USDC payments — the only stack that lets agents operate independently.

curl -fsSL https://blockrun.ai/ClawRouter-update | bash

🤖 BRCC

BlockRun for Claude Code

Run Claude Code with 50+ models, no rate limits, no Anthropic account, no phone verification. Pay per request with USDC — your wallet is your identity.

curl -fsSL https://blockrun.ai/brcc-install | bash

More Resources

Resource	Description
Documentation	Full docs
Model Pricing	All models & prices
Image Generation & Editing	API examples, 5 models
Routing Profiles	ECO/AUTO/PREMIUM details
Architecture	Technical deep dive
Configuration	Environment variables
Troubleshooting	Common issues

Blog

Article	Topic
11 Free AI Models, Zero Cost	How BlockRun gives developers top-tier LLMs for nothing
ClawRouter Cuts LLM API Costs 500×	Deep dive into cost savings
ClawRouter vs OpenRouter	Head-to-head comparison
[Smart LLM Router: 14-Dimension Classifier](docs/smart-llm-router-14-dimensio

Similar ai infra

AI Infra

n8n

by n8n-io

Workflow automation platform for technical teams — visually build AI agent workflows with 400+ integrations

$npx n8n

193k+8.2kn8n@2.26.8· 1d agoTypeScript

GitHub ↗

AI Infrareplaces Firebase

supabase

by supabase

Postgres development platform — open-source alternative to Firebase with built-in AI/vector tools

104k+2.5kv1.26.05· 1mo agoTypeScript

GitHub ↗

AI Infrareplaces Notion

AppFlowy

by AppFlowy-IO

AI collaborative workspace — self-hosted Notion alternative for projects, wikis, and data control

$flatpak install flathub io.appflowy.AppFlowy

73k+2.1k0.12.2· 2w agoDart

GitHub ↗

AI Infrareplaces Vercel

coolify

by coollabsio

Self-hostable PaaS alternative to Vercel, Heroku, Netlify — deploy static sites, databases, and full-stack apps on your own servers

$curl -fsSL https://cdn.coollabs.io/coolify/install.sh | bash

57k+3.2kv4.1.2· 2w agoPHP

GitHub ↗

See all AI Infra →