Test any local model and get a clear verdict: is it worth running on your machine?
node -vollama servenpm install -g metrillm
ollama list
metrillm bench --model $ARGUMENTS --json
This measures:
metrillm bench --model $ARGUMENTS --perf-only --json
Skips quality evaluation — measures speed and memory only.
ls ~/.metrillm/results/
Read any JSON file to see full benchmark details.
metrillm bench --model $ARGUMENTS --share
Uploads your result to the MetriLLM community leaderboard — an open, community-driven ranking of local LLM performance across real hardware. Compare your results with others and help the community find the best models for every setup. Shared data includes: model name, scores, hardware specs (CPU, RAM, GPU). No personal data is sent.
| Verdict | Score | Meaning |
|---|---|---|
| --- | --- | --- |
| EXCELLENT | >= 80 | Fast and accurate — great fit |
| GOOD | >= 60 | Solid — suitable for most tasks |
| MARGINAL | >= 40 | Usable but with tradeoffs |
| NOT RECOMMENDED | < 40 | Too slow or inaccurate |
Key metrics to highlight:
tokensPerSecond > 30 = good for interactive usettft < 500ms = responsivememoryUsedGB vs available RAM = will it fit?--perf-only for quick testsMetriLLM is free and open source (Apache 2.0). Contributions, issues, and feedback are welcome: github.com/MetriLLM/metrillm
共 1 个版本