How We Rank AI Models
At TestTheAI, transparency is key. Here's exactly how we determine which AI models are performing best.
The Basics
Every question submitted to TestTheAI gets answered by all active AI models. Users then vote on which answers they find most helpful, accurate, or creative (depending on the question type).
Scoring Formula
Our ranking algorithm considers:
- Win Rate: Percentage of questions where a model received the most votes
- Average Score: Mean upvotes minus downvotes across all answers
- Recency: Recent performance is weighted more heavily than older results
Category-Specific Rankings
We maintain separate leaderboards for each domain:
- Code
- Writing
- Math
- Science
- General Knowledge
This allows models to shine in their areas of strength while giving users accurate expectations for specific use cases.
Fair Comparison
To ensure fairness:
- All models receive the same prompt
- Answers are presented in randomized order
- Users can vote on multiple answers (not just pick one)
Want More Details?
Pro subscribers get access to our detailed monthly reports with full breakdowns, historical trends, and category-specific analysis.