Intelligent model routing that collapses inference overhead without degrading semantic integrity. One line to deploy.
Connect via universal gateway. Full OpenAI/Anthropic SDK compatibility. Zero config.
Two-layer classifier: rules (<1ms) + LLM for ambiguity. 16 task types, 5 complexity levels.
Scoring formula picks optimal model. Auto-fallback on failure. Budget → $0.32. Frontier → $15.
Use model="auto" to let the router analyze complexity and pick the best tier for you. Average savings: 60-70%.