Model Routing

#routing

TLDR

  1. send requests to expert or cheaper models, use small routing model to predict which model to send inputs to