🎯 PDB Leaderboard

Precision-aware evaluation of LLM debugging capabilities

PDB-Single-Hard Leaderboard

 

# Model Precision â–¼ Recall Pass@1

PDB-Multi Leaderboard

 

# Model Precision â–¼ Recall Pass@1

Leaderboard Submission

To submit a new model to the PDB leaderboard, please send an email to wangzhu@usc.edu with the subject line PDB submission: <new model name>, <set name>, where <set name> is one of the evaluation sets (e.g. PDB-Multi or PDB-Single-Hard), and attach both the generated result file and the score file produced by our evaluation pipeline. We will verify the submission against our evaluation protocol and update the leaderboard once the results are confirmed.