
Blog
AI research and insights
SWE-rebench dataset: More than 21,000 verifiable tasks for SWE agents
SWE-rebench dataset: More than 21,000 verifiable tasks for SWE agents
Our AI R&D team announces the open-source release of the SWE-rebench dataset of more than 21,000 real-world, interactive software engineering tasks. For a detailed methodology and technical report, please see our accompanying paper on arXiv.




.jpg?cache-buster=2025-07-04T11:51:51.357Z)

.png?cache-buster=2025-07-08T08:57:44.740Z)





