Evaluation of performance metrics for code generation models
This project aims to understand how to correctly evaluate code generation models, compare existing performance metrics, and introduce a new metric that better correlates with human judgment.