We are moving past the "one model to rule them all" phase. Routing to multiple...
https://garrettwigp625.tearosediner.net/the-llm-sprawl-how-to-compare-gpt-claude-gemini-grok-and-perplexity-in-one-workflow
We are moving past the "one model to rule them all" phase. Routing to multiple models makes sense because they have different strengths and failure modes. You might use a fast model for classification and a smart one for logic. But tread lightly