Your orchestration layer is routed through Vercel AI Gateway, providing unified caching, rate limiting, and per-token observability across all model providers.
GPT-4o (Reasoning)
642,891 / 1.0M tokens
Claude 3.5 Sonnet (Coding)
128,400 / 500k tokens
Ollama Local (Metadata Extraction)
Unlimited