📌 Strategy: Uses tokens first across all models, then falls back to credits.
📌 Best for: Users who want to ensure all tokens are used first before switching to credits.
How it Works #
- If the primary model (e.g., DeepSeek-7B) lacks tokens, the system checks the next model in the priority list.
- If no models have tokens available, the system falls back to credits, starting from the highest-priority model.
Example Scenario #
Model | Priority | Available Tokens | Credit Needed | User Credits |
---|---|---|---|---|
DeepSeek-7B | 1 | 100 | 20 | 50 |
LLaMA 3-8B | 2 | 0 | 15 | 30 |
Mistral-7B | 3 | 0 | 10 | 60 |
Final Selection: #
✅ DeepSeek-7B → Uses Tokens
✅ LLaMA 3-8B → Uses Credits
✅ Mistral-7B → Uses Credits
📌 Why? The system first tries all models for tokens, only falling back to credits if necessary.