| Coding | Arena.ai Code Arena, Vals SWE-bench, Vals Vibe Code, Vellum, Artificial Analysis | 10% to 35% per source |
| Writing | Creative writing arenas, long-form writing evaluations, broad text preference sources | 10% to 35% per source |
| Math | ProofBench, Riemann-bench, AIME-style sources, general intelligence sources | 10% to 35% per source |
| Image generation | Text-to-image arenas and image-quality leaderboards | 35% to 65% per source |
| Local models | Hardware fit, memory fit, runtime support, Ollama pulls, Hugging Face downloads | Contextual scoring |
| API pricing | Official provider price pages only, normalized when units are comparable | No resale router prices |
| Local test records | Hardware setup, screenshots, raw logs, test JSON, method notes | Measured speed, memory pressure, fit boundary, reproducibility |