Poor Paul's Benchmark

Open LLM inference benchmarks on real consumer hardware.
The raw data lives on Hugging Face. This site is the easiest way to get insights.

4773

Benchmark Results

Models

GPUs

Contributors

Leaderboard

See which model + GPU combos deliver the best throughput and lowest latency.

Interactive charts to visualize throughput, latency, and hardware comparisons.

Benchmark writeups, analysis posts, and hardware deep dives.

Want to contribute benchmarks from your own hardware?

Data last updated: March 12, 2026 at 08:18 AM