Why run everything on one model? Routing tasks based on cost, reasoning power,...
https://reidwxzz567.image-perth.org/how-to-stop-burning-cash-on-output-tokens-a-practical-guide-for-engineering-leads
Why run everything on one model? Routing tasks based on cost, reasoning power, or specific failure modes is a smart move until you look at the logs