ClawRouter
LLM router for autonomous agents — smart routing, wallet-based authentication, and USDC micropayments for AI API calls
An agent-native LLM router that enables autonomous AI agents to access 55+ models without API keys or credit cards. It performs local, smart routing across models based on 15 dimensions to reduce AI API costs by up to 92%. Agents authenticate via wallet signatures and pay per-request with USDC via the x402 protocol, supporting Base and Solana networks. It integrates as an OpenClaw plugin or runs as a standalone OpenAI-compatible proxy.
- 100% local routing with sub-1ms latency for all LLM requests
- Smart routing across 55+ models based on 15 dimensions
- Agent-native authentication using wallet signatures, no API keys
- USDC micropayments via x402 protocol on Base and Solana
- Integrated image and video generation with chat slash commands
README
View on GitHub ↗
The LLM router built for autonomous agents
Agents can't sign up for accounts. Agents can't enter credit cards.
Agents can only sign transactions.
ClawRouter is the only LLM router that lets agents operate independently.
10 models free, no crypto required. No signup. No API key. No credit card.
ClawRouter is an open-source smart LLM router that reduces AI API costs by up to 92%. It analyzes each request across 15 dimensions and routes to the cheapest capable model in under 1ms, entirely locally. ClawRouter is the only LLM router built for autonomous AI agents — it uses wallet signatures for authentication (no API keys) and USDC micropayments via the x402 protocol (no credit cards). 55+ models from OpenAI, Anthropic, Google, xAI, DeepSeek, and more. MIT licensed.
Why ClawRouter exists
Every other LLM router was built for human developers — create an account, get an API key, pick a model from a dashboard, pay with a credit card.
Agents can't do any of that.
ClawRouter is built for the agent-first world:
- Starts at $0 — 10 NVIDIA models are free forever (incl. 1M-context DeepSeek V4 + a vision-capable Nemotron Omni)
- No accounts — a wallet is generated locally, no signup
- No API keys — your wallet signature IS authentication
- No model selection — 15-dimension scoring picks the right model automatically
- No credit cards — agents pay per-request with USDC via x402
- No trust required — runs locally, <1ms routing, zero external dependencies
This is the stack that lets agents operate autonomously: x402 + USDC + local routing.
How it compares
| OpenRouter | LiteLLM | Martian | Portkey | ClawRouter | |
|---|---|---|---|---|---|
| Models | 200+ | 100+ | Smart routing | Gateway | 55+ |
| Free tier | Rate-limited | BYO keys | No | No | 8 models, no signup |
| Routing | Manual selection | Manual selection | Smart (closed) | Observability | Smart (open source) |
| Auth | Account + API key | Your API keys | Account + API key | Account + API key | Wallet signature |
| Payment | Credit card | BYO keys | Credit card | $49-499/mo | USDC per-request |
| Runs locally | No | Yes | No | No | Yes |
| Open source | No | Yes | No | Partial | Yes |
| Agent-ready | No | No | No | No | Yes |
✓ Open source · ✓ Smart routing · ✓ Runs locally · ✓ Crypto native · ✓ Agent ready
We're the only one that checks all five boxes.
Quick Start
No wallet? 10 models work free out of the box. Install, run, and pin
nvidia/gpt-oss-120b(or any of the 10) — no crypto, no signup, no balance required. Add USDC later when you want paid models.
Option A — OpenClaw Agent
OpenClaw is an AI coding agent. If you're using it, ClawRouter installs as a plugin. Two paths:
A1. Recommended — one-shot install script:
curl -fsSL https://blockrun.ai/ClawRouter-update | bash
openclaw gateway restart
This handles everything: registration, models config, auth profile, wallet setup. Smart routing (blockrun/auto) is now your default model.
A2. If you prefer pure npm:
npm install -g @blockrun/clawrouter
clawrouter setup # finishes OpenClaw integration — REQUIRED
openclaw gateway restart
⚠️ Skipping
clawrouter setupwill leave you broken. Barenpm install -gonly puts the package on disk; it does NOT register the plugin with OpenClaw, sync the models allowlist, or write the auth profile. Symptom:/modelsin your bot shows only ~7 entries (OpenClaw's hardcoded defaults) instead of the full ~38 BlockRun models. Runclawrouter setupto repair, or use path A1 to begin with.
Option B — Standalone (continue.dev, Cursor, VS Code, any OpenAI-compatible client)
Using Claude Code? Check out BRCC — it's purpose-built for Claude Code with the same smart routing and x402 payments.
No OpenClaw required. ClawRouter runs as a local proxy on port 8402.
1. Start the proxy
npx @blockrun/clawrouter
2. Fund your wallet — optional, skip for free tier
Your wallet address is printed on first run. For paid models, send a few USDC on Base or Solana — $5 covers thousands of requests. To stay at $0, pin any of the 10 free models (e.g. nvidia/gpt-oss-120b) or use /model free inside OpenClaw.
3. Point your client at http://localhost:8402
continue.dev — ~/.continue/config.yaml
Important:
apiBasemust end with/v1/(including the trailing slash). Without it, continue.dev constructs the URL as/chat/completionsinstead of/v1/chat/completions, and the proxy returns 404.
models:
- name: ClawRouter Auto
provider: openai
model: blockrun/auto
apiBase: http://localhost:8402/v1/
apiKey: x402
roles:
- chat
- edit
- apply
To pin a specific model, replace blockrun/auto with any model from blockrun.ai/models, e.g. anthropic/claude-opus-4.6, xai/grok-4-0709.
Both provider: openai and provider: clawrouter work — just make sure apiBase ends with /v1/.
Legacy JSON format (~/.continue/config.json)
{
"models": [
{
"title": "ClawRouter Auto",
"provider": "openai",
"model": "blockrun/auto",
"apiBase": "http://localhost:8402/v1/",
"apiKey": "x402"
}
]
}
Cursor — Settings → Models → OpenAI-compatible
Set base URL to http://localhost:8402, API key to x402, model to blockrun/auto.
Any OpenAI SDK
from openai import OpenAI
client = OpenAI(base_url="http://localhost:8402", api_key="x402")
response = client.chat.completions.create(model="blockrun/auto", messages=[...])
Routing Profiles
Choose your routing strategy with /model <profile>:
| Profile | Strategy | Savings | Best For |
|---|---|---|---|
/model free |
Free NVIDIA models | 100% | $0 balance, learning |
/model auto |
Balanced (default) | 74-100% | General use |
/model eco |
Cheapest possible | 95-100% | Maximum savings |
/model premium |
Best quality | 0% | Mission-critical |
Shortcuts: /model grok, /model br-sonnet, /model gpt5, /model o3
How It Works
100% local routing. <1ms latency. Zero external API calls.
Request → Weighted Scorer (15 dimensions) → Tier → Best Model → Response
| Tier | ECO Model | AUTO Model | PREMIUM Model |
|---|---|---|---|
| SIMPLE | nvidia/gpt-oss-120b (FREE) | gemini-2.5-flash ($0.30/$2.50) | kimi-k2.6 |
| MEDIUM | gemini-3.1-flash-lite ($0.25/$1.50) | kimi-k2.5 ($0.60/$3.00) | gpt-5.3-codex ($1.75/$14.00) |
| COMPLEX | gemini-3.1-flash-lite ($0.25/$1.50) | gemini-3.1-pro ($2/$12) | claude-opus-4.6 ($5/$25) |
| REASONING | grok-4-1-fast ($0.20/$0.50) | grok-4-1-fast-reasoning ($0.20/$0.50) | claude-sonnet-4.6 ($3/$15) |
Blended average: $2.05/M vs $25/M for Claude Opus = 92% savings
Image Generation
Generate images directly from chat with /cr-imagegen:
/cr-imagegen a dog dancing on the beach
/cr-imagegen --model dall-e-3 a futuristic city at sunset
/cr-imagegen --model banana-pro --size 2048x2048 mountain landscape
The slash command is
/cr-imagegento avoid colliding with Telegram channel commands. Typing/imagegenin chat still works for backward compatibility.
| Model | Provider | Price | Max Size |
|---|---|---|---|
nano-banana |
Google Gemini Flash | $0.05/image | 1024x1024 |
banana-pro |
Google Gemini Pro | $0.10/image | 4096x4096 |
dall-e-3 |
OpenAI DALL-E 3 | $0.04/image | 1792x1024 |
gpt-image |
OpenAI GPT Image 1 | $0.02/image | 1536x1024 |
flux |
Black Forest Flux 1.1 | $0.04/image | 1024x1024 |
xai/grok-imagine-image |
xAI Grok Imagine | $0.02/image | 1024x1024 |
xai/grok-imagine-image-pro |
xAI Grok Imagine Pro | $0.07/image | 1024x1024 |
zai/cogview-4 |
Zhipu CogView-4 | $0.015/image | 1440x1440 |
Video Generation
Generate short AI videos directly from chat with /videogen:
/videogen a red apple slowly spinning
/videogen --model seedance-2-fast --duration=5 a cat waving
/videogen --model grok-video a neon city at night
Or drive it over HTTP — ClawRouter proxies the BlockRun gateway, handles x402 payment, and downloads the returned MP4 to local disk, rewriting url to http://localhost:8402/videos/<file>.mp4 so the asset survives past the upstream's temporary bucket.
curl -X POST http://localhost:8402/v1/videos/generations \
-H "Content-Type: application/json" \
-d '{"model":"bytedance/seedance-2.0-fast","prompt":"a red apple slowly spinning","duration_seconds":5}'
| Model | Provider | Price | Duration |
|---|---|---|---|
bytedance/seedance-1.5-pro |
ByteDance Seedance | $0.03/sec | 5s default, up to 10s |
bytedance/seedance-2.0-fast |
ByteDance Seedance | $0.15/sec | 5s default, up to 10s |
bytedance/seedance-2.0 |
ByteDance Seedance | $0.30/sec | 5s default, up to 10s |
xai/grok-imagine-video |
xAI Grok Imagine | $0.05/sec | 8s default |
Calls block for 30–120s while the upstream polls the job. Text-to-video and image-to-video (image_url parameter) are both supported. Seedance 2.0 Fast typically returns in 60–80s; 2.0 Pro trades latency for quality.
Image Editing (img2img)
Edit existing images with /img2img:
/img2img --image ~/photo.png change the background to a starry sky
/img2img --image ./cat.jpg --mask ./mask.png remove the background
| Option | Required | Description |
|---|---|---|
--image <path> |
Yes | Local image file path (supports ~/) |
--mask <path> |
No | Mask image (white = area to edit) |
--model <model> |
No | Model to use (default: gpt-image-1) |
--size <WxH> |
No | Output size (default: 1024x1024) |
API endpoint: POST http://localhost:8402/v1/images/image2image — see full docs.
Models & Pricing
55+ models across 9 providers, one wallet. Starting at $0.0002/request.
💡 "Cost per request" = estimated cost for a typical chat message (~500 input + 500 output tokens).
Budget Models (under $0.001/request)
| Model | Input $/M | Output $/M | ~$/request | Context | Features |
|---|---|---|---|---|---|
| nvidia/gpt-oss-120b | FREE | FREE | $0 | 128K | default free model |
| nvidia/gpt-oss-20b | FREE | FREE | $0 | 128K | smaller, faster |
| nvidia/mistral-small-4-119b | FREE | FREE | $0 | 131K | fastest free chat (114 tok/s) |
| nvidia/deepseek-v4-pro | FREE | FREE | $0 | 1M | reasoning, MMLU-Pro 87.5 |
| nvidia/deepseek-v4-flash | FREE | FREE | $0 | 1M | reasoning, ~5x faster than v4-pro |
| nvidia/qwen3-next-80b-a3b-thinking | FREE | FREE | $0 | 131K | reasoning |
| nvidia/qwen3-coder-480b | FREE | FREE | $0 | 131K | coding |
| nvidia/glm-4.7 | FREE | FREE | $0 | 131K | reasoning |
| nvidia/llama-4-maverick | FREE | FREE | $0 | 131K | reasoning |
| nvidia/nemotron-3-nano-omni-30b-a3b-reasoning | FREE | FREE | $0 | 256K | reasoning, vision (text+img+video+audio) |
| openai/gpt-5-nano | $0.05 | $0.40 | $0.0002 | 128K | tools |
| openai/gpt-4.1-nano | $0.10 | $0.40 | $0.0003 | 128K | tools |
| google/gemini-2.5-flash-lite | $0.10 | $0.40 | $0.0003 | 1M | tools |
| openai/gpt-4o-mini | $0.15 | $0.60 | $0.0004 | 128K | tools |
| xai/grok-4-fast | $0.20 | $0.50 | $0.0004 | 131K | tools |
| xai/grok-4-fast-reasoning | $0.20 | $0.50 | $0.0004 | 131K | reasoning, tools |
| xai/grok-4-1-fast | $0.20 | $0.50 | $0.0004 | 131K | tools |
| xai/grok-4-1-fast-reasoning | $0.20 | $0.50 | $0.0004 | 131K | reasoning, tools |
| xai/grok-4-0709 | $0.20 | $1.50 | $0.0009 | 131K | reasoning, tools |
| openai/gpt-5-mini | $0.25 | $2.00 | $0.0011 | 200K | tools |
| deepseek/deepseek-chat | $0.28 | $0.42 | $0.0004 | 128K | tools |
| deepseek/deepseek-reasoner | $0.28 | $0.42 | $0.0004 | 128K | reasoning, tools |
| xai/grok-3-mini | $0.30 | $0.50 | $0.0004 | 131K | tools |
| minimax/minimax-m2.7 | $0.30 | $1.20 | $0.0008 | 205K | reasoning, agentic, tools |
| minimax/minimax-m2.5 | $0.30 | $1.20 | $0.0008 | 205K | reasoning, agentic, tools |
| google/gemini-2.5-flash | $0.30 | $2.50 | $0.0014 | 1M | vision, tools |
| openai/gpt-4.1-mini | $0.40 | $1.60 | $0.0010 | 128K | tools |
| google/gemini-3-flash-preview | $0.50 | $3.00 | $0.0018 | 1M | vision |
| moonshot/kimi-k2.5 | $0.60 | $3.00 | $0.0018 | 262K | reasoning, vision, agentic, tools |
| moonshot/kimi-k2.6 | $0.95 | $4.00 | $0.0025 | 262K | reasoning, vision, agentic, tools |
Mid-Range Models ($0.001–$0.01/request)
| Model | Input $/M | Output $/M | ~$/request | Context | Features |
|---|---|---|---|---|---|
| anthropic/claude-haiku-4.5 | $1.00 | $5.00 | $0.0030 | 200K | vision, agentic, tools |
| zai/glm-5 | $1.00 | $3.20 | $0.0021 | 200K | tools |
| openai/o1-mini | $1.10 | $4.40 | $0.0028 | 128K | reasoning, tools |
| openai/o3-mini | $1.10 | $4.40 | $0.0028 | 128K | reasoning, tools |
| openai/o4-mini | $1.10 | $4.40 | $0.0028 | 128K | reasoning, tools |
| zai/glm-5-turbo | $1.20 | $4.00 | $0.0026 | 200K | tools |
| google/gemini-2.5-pro | $1.25 | $10.00 | $0.0056 | 1M | reasoning, vision, tools |
| openai/gpt-5.2 | $1.75 | $14.00 | $0.0079 | 400K | reasoning, vision, agentic, tools |
| openai/gpt-5.3 | $1.75 | $14.00 | $0.0079 | 128K | reasoning, vision, agentic, tools |
| openai/gpt-5.3-codex | $1.75 | $14.00 | $0.0079 | 400K | agentic, tools |
| openai/gpt-4.1 | $2.00 | $8.00 | $0.0050 | 128K | vision, tools |
| openai/o3 | $2.00 | $8.00 | $0.0050 | 200K | reasoning, tools |
| google/gemini-3-pro-preview | $2.00 | $12.00 | $0.0070 | 1M | reasoning, vision, tools |
| google/gemini-3.1-pro | $2.00 | $12.00 | $0.0070 | 1M | reasoning, vision, tools |
| xai/grok-2-vision | $2.00 | $10.00 | $0.0060 | 131K | vision, tools |
| openai/gpt-4o | $2.50 | $10.00 | $0.0063 | 128K | vision, agentic, tools |
| openai/gpt-5.4 | $2.50 | $15.00 | $0.0088 | 400K | reasoning, vision, agentic, tools |
Premium Models ($0.01+/request)
| Model | Input $/M | Output $/M | ~$/request | Context | Features |
|---|---|---|---|---|---|
| anthropic/claude-sonnet-4.6 | $3.00 | $15.00 | $0.0090 | 200K | reasoning, vision, agentic, tools |
| xai/grok-3 | $3.00 | $15.00 | $0.0090 | 131K | reasoning, tools |
| anthropic/claude-opus-4.6 | $5.00 | $25.00 | $0.0150 | 200K | reasoning, vision, agentic, tools |
| openai/gpt-5.5 | $5.00 | $30.00 | $0.0175 | 1M | reasoning, vision, agentic, tools |
| openai/o1 | $15.00 | $60.00 | $0.0375 | 200K | reasoning, tools |
| openai/gpt-5.2-pro | $21.00 | $168.00 | $0.0945 | 400K | reasoning, tools |
| openai/gpt-5.4-pro | $30.00 | $180.00 | $0.1050 | 400K | reasoning, tools |
Free tier: 10 models cost nothing —
/model freepoints to gpt-oss-120b, or pick any free model directly (e.g.,/model deepseek-v4-profor reasoning,/model nemotron-omnifor vision,/model qwen-thinking,/model deepseek-v4-flashfor 1M context,/model mistral-small). Best value:gpt-5-nanoandgemini-2.5-flash-litedeliver strong results at ~$0.0003/request.
Payment
No account. No API key. Payment IS authentication via x402.
Request → 402 (price: $0.003) → wallet signs USDC → retry → response
USDC stays in your wallet until spent — non-custodial. Price is visible in the 402 header before signing.
Dual-chain support: Pay with USDC on Base (EVM) or USDC on Solana. Both wallets are derived from a single BIP-39 mnemonic on first run.
/wallet # Check balance and address (both chains)
/wallet export # Export mnemonic + keys for backup
/wallet recover # Restore wallet from mnemonic on a new machine
/wallet solana # Switch to Solana USDC payments
/wallet base # Switch back to Base (EVM) USDC payments
/chain solana # Alias for /wallet solana
/stats # View usage and savings
/stats clear # Reset usage statistics
/exclude # Show excluded models
/exclude add <model> # Block a model from routing (aliases work: "grok-4", "free")
/exclude remove <model> # Unblock a model
/exclude clear # Remove all exclusions
Fund your wallet:
- Base (EVM): Send USDC on Base to your EVM address
- Solana: Send USDC on Solana to your Solana address
- Coinbase/CEX: Withdraw USDC to either network
- Credit card: Reach out to @bc1max on Telegram
Screenshots
Smart Routing in Action
|
Telegram Integration
|
Configuration
For basic usage, no configuration needed. For advanced options:
| Variable | Default | Description |
|---|---|---|
BLOCKRUN_WALLET_KEY |
auto-generated | Your wallet private key |
BLOCKRUN_PROXY_PORT |
8402 |
Local proxy port |
CLAWROUTER_DISABLED |
false |
Disable smart routing |
CLAWROUTER_SOLANA_RPC_URL |
https://api.mainnet-beta.solana.com |
Solana RPC endpoint |
Full reference: docs/configuration.md
Model Exclusion
Block specific models from being routed to. Useful if a model doesn't follow your agent instructions or you want to control costs.
/exclude add nvidia/gpt-oss-120b # Block the default free model
/exclude add grok-4 # Aliases work — blocks all grok-4 variants
/exclude add gpt-5.4 # Skip expensive models
/exclude # Show current exclusions
/exclude remove grok-4 # Unblock a model
/exclude clear # Remove all exclusions
Exclusions persist across restarts (~/.openclaw/blockrun/exclude-models.json). If all models in a tier are excluded, the safety net ignores the filter so routing never breaks.
Troubleshooting
When things go wrong, run the doctor:
npx @blockrun/clawrouter doctor
This collects diagnostics and sends them to Claude Sonnet for AI-powered analysis:
🩺 BlockRun Doctor v0.12.24
System
✓ OS: darwin arm64
✓ Node: v20.11.0
Wallet
✓ Address: 0x1234...abcd
✓ Balance: $12.50
Network
✓ BlockRun API: reachable (142ms)
✗ Local proxy: not running on :8402
📤 Sending to Claude Sonnet 4.6 (~$0.003)...
🤖 AI Analysis:
The local proxy isn't running. Run `openclaw gateway restart` to fix.
Use Opus for complex issues:
npx @blockrun/clawrouter doctor opus
Ask a specific question:
npx @blockrun/clawrouter doctor "why is my request failing?"
npx @blockrun/clawrouter doctor opus "深度分析我的配置"
Cost: Sonnet ~$0.003 (default) | Opus ~$0.01
Development
git clone https://github.com/BlockRunAI/ClawRouter.git
cd ClawRouter
npm install
npm run build
npm test
Support
| Channel | Link |
|---|---|
| 📅 Schedule Demo | calendly.com/vickyfu9/30min |
| 💬 Community Telegram | t.me/blockrunAI |
| 🐦 X / Twitter | x.com/blockrunai |
| 📱 Founder Telegram | @bc1max |
| vicky@blockrun.ai |
From the BlockRun Ecosystem
⚡ ClawRouterThe LLM router built for autonomous agents You're here. 55+ models, local smart routing, x402 USDC payments — the only stack that lets agents operate independently.
|
🤖 BRCCBlockRun for Claude Code Run Claude Code with 50+ models, no rate limits, no Anthropic account, no phone verification. Pay per request with USDC — your wallet is your identity.
|
More Resources
| Resource | Description |
|---|---|
| Documentation | Full docs |
| Model Pricing | All models & prices |
| Image Generation & Editing | API examples, 5 models |
| Routing Profiles | ECO/AUTO/PREMIUM details |
| Architecture | Technical deep dive |
| Configuration | Environment variables |
| Troubleshooting | Common issues |
Blog
| Article | Topic |
|---|---|
| 11 Free AI Models, Zero Cost | How BlockRun gives developers top-tier LLMs for nothing |
| ClawRouter Cuts LLM API Costs 500× | Deep dive into cost savings |
| ClawRouter vs OpenRouter | Head-to-head comparison |
| [Smart LLM Router: 14-Dimension Classifier](docs/smart-llm-router-14-dimensio |
Similar ai infra
n8n
Workflow automation platform for technical teams — visually build AI agent workflows with 400+ integrations
supabase
Postgres development platform — open-source alternative to Firebase with built-in AI/vector tools
AppFlowy
AI collaborative workspace — self-hosted Notion alternative for projects, wikis, and data control
coolify
Self-hostable PaaS alternative to Vercel, Heroku, Netlify — deploy static sites, databases, and full-stack apps on your own servers