Sequential Model Fallback

< 1 min read

📌 Strategy: Uses tokens first across all models, then falls back to credits.
📌 Best for: Users who want to ensure all tokens are used first before switching to credits.

How it Works #

If the primary model (e.g., DeepSeek-7B) lacks tokens, the system checks the next model in the priority list.
If no models have tokens available, the system falls back to credits, starting from the highest-priority model.

Example Scenario #

Model	Priority	Available Tokens	Credit Needed	User Credits
DeepSeek-7B	1	100	20	50
LLaMA 3-8B	2	0	15	30
Mistral-7B	3	0	10	60

Final Selection: #

✅ DeepSeek-7B → Uses Tokens
✅ LLaMA 3-8B → Uses Credits
✅ Mistral-7B → Uses Credits

📌 Why? The system first tries all models for tokens, only falling back to credits if necessary.

Updated on February 11, 2025

Did you find this doc usefull?

Happy
Normal
Sad

Leave a Reply Cancel reply