ClaudeStore/Docs
⚠ These docs are a work in progress. Some content may be inaccurate or temporarily AI-generated.

ClaudeStore vs VseLLM — Claude-Only vs Multi-Provider Gateway [2026]

ClaudeStore vs VseLLM comparison: Claude-focused gateway with billing about 30% below Anthropic's official rates versus a multi-provider router with its own markup. Pricing, features, and migration guide.

Quick Answer

ClaudeStore vs VseLLM in one line: ClaudeStore is Claude-only, with billing about 30% below Anthropic's official rates, and full Anthropic API features (prompt caching, extended thinking). VseLLM is a multi-provider gateway (OpenAI + Anthropic + others) with its own markup and a normalized API that can hide provider-specific features.

Feature Comparison

FeatureClaudeStoreVseLLM
Provider focusClaude onlyMulti-provider
Effective Claude pricing~30% cheaper than Anthropic official ratesMarkup varies by gateway pricing
Prompt cachingNative (90% savings)Not always exposed
Extended thinkingNativeLimited
Native Anthropic APIYes (/v1/messages)Normalized only
OpenAI-compat endpointYesYes
Per-key spend limitsYesLimited
RU payment (Mir/SBP)YesYes

When to choose VseLLM

If your workload genuinely needs multi-provider routing — e.g. fallback from Claude to GPT-4 on rate limits, or A/B testing across providers — VseLLM's normalized API is convenient.

When to choose ClaudeStore

If you're committed to Claude (Sonnet 4.6, Opus 4.6, Haiku 4.5), ClaudeStore is the cleaner choice:

  • Lower effective public pricing — requests are billed at about 30% below Anthropic's official token rates
  • Prompt caching saves 90% on repeated system prompts (huge for IDE integrations)
  • Extended thinking supported natively for complex reasoning
  • Native Anthropic format — code that works with anthropic-sdk works as-is
  • Cursor, Claude Code, VS Code integrations documented out of the box

Ready to start?

Get API access to all Claude models in under 2 minutes.

View Plans