Measuring Coding Challenge Competence With APPS
This is the repository for Measuring Coding Challenge Competence With APPS by Dan Hendrycks*, Steven Basart*, Saurav Kadavath, Mantas Mazeika, Akul Arora, Ethan Guo, Collin Burns, Samir Puranik, Horace He, Dawn Song, and Jacob Steinhardt.
Download the APPS dataset here.
This repository contains evaluation code.
For other benchmarks of enormous Transformers, see a dataset which tests ability in competition MATH, a dataset which tests knowledge of ETHICS, and a dataset spanning 50+ academic subjects.
Citation
If you find this useful in your research, please consider citing
@article{hendrycksapps2021,
title={Measuring Coding Challenge Competence With APPS},
author={Dan Hendrycks and Steven Basart and Saurav Kadavath and Mantas Mazeika and Akul Arora and Ethan Guo and Collin Burns and Samir Puranik and Horace He and Dawn Song and Jacob Steinhardt},
journal={arXiv preprint arXiv:2105.09938},
year={2021}
}