You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
docs: update README + website copy for multi-source benchmarks
README and the landing page still advertised "~400 entries from
Artificial Analysis". Now: 4 sources credited (AA, Epoch AI, Arena,
LLM Stats), and the website's BENCHMARK_COUNT derives from the four
v2 data files (sum of model rows, ~1,062) instead of the frozen
legacy benchmarks.json lane.
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
~400 entries with quality indexes, speed, and pricing. Compare mode with head-to-head tables, scatter plots, and radar charts. Filter by creator, region, type, reasoning, and open/closed source.
89
+
~1,000 entries across 4 switchable data sources (Artificial Analysis, Epoch AI, Arena, LLM Stats) with quality indexes, Elo ratings, speed, and pricing. Compare mode with head-to-head tables, scatter plots, and radar charts. Choose visible metric columns, cycle a field-average/peer-average/rank comparator in the detail panel, and refresh any source in-app. Filter by creator, region, type, reasoning, and open/closed source.
90
90
91
91
[Benchmarks wiki page](https://github.com/reyamira/models/wiki/Benchmarks)• CLI: `models benchmarks list`, `models benchmarks show`
92
92
@@ -118,7 +118,7 @@ Full documentation lives in the [wiki](https://github.com/reyamira/models/wiki):
118
118
## Data Sources
119
119
120
120
-**Models**: [models.dev](https://models.dev) by [SST](https://github.com/sst/models.dev)
-**Agents**: Curated catalog in [`data/agents.json`](data/agents.json) — contributions welcome!
123
123
-**Status**: Official provider status pages ([Statuspage](https://www.atlassian.com/software/statuspage), [BetterStack](https://betterstack.com), [Instatus](https://instatus.com), [incident.io](https://incident.io), and more)
0 commit comments