DeepSWE Preview — Benchmarks
Benchmark scores for DeepSWE Preview aggregated from public leaderboards, with how it ranks among open models. See hardware requirements for what you need to run it.
Coding
| Benchmark | Score | Open rank | All models |
|---|---|---|---|
| SWE-bench Verified | 58.8% | #10 / 13 | #79 / 163 |
Scores aggregated from public benchmark sources (each linked from the benchmark pages). llmrun does not run these benchmarks.