Here we provide our manual classifications of the problems in SWE-bench lite
Note
The classification results differ slightly with the paper as we have performed further manual checking and modified the classifications of few problems.
First install the neccessary graphing utilizies:
cd classification
pip install -r requirements_graph.txtRun the following command to produce Figure 5 in the paper:
python graph_classification.pyThis will produce various graphs: benchmark_bar_location.pdf, benchmark_pie_description.pdf, and benchmark_pie_patch.pdf
For the detailed classifications, we have released our raw classifications for each problem and can be access in swebench_lite_classifications.csv
To obtain the problems in SWE-bench Lite-S, run the following command:
python load_filtered_benchmark.pyThis produces the swebench_lite_s_problems.txt which contains the instance_ids of the subset of problems
Note: we filter out the problems that contain the exact patch in the problem description, misleading solutions, or do not provide enough information in the original issue description.