Leang, J.O.J., Hong G., Li W., Cohen, S.B.: Theorem Prover as a Judge for Synthetic Data Generation, 2025.
Preprint. [arXiv]
Leang, J.O.J., Gema, A.P., Cohen, S.B.: CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning, 2024.
Preprint. [arXiv]
Gema, A.P., Leang, J.O.J., Hong, G., Devoto, A., Mancino, A.C.M., Saxena, R., He, X., Zhao, Y., Du, X., Ghasemi Madani, M.R., Barale, C., McHardy, R., Harris, J., Kaddour, J., van Krieken, E., and Minervini, P.: Are We Done with MMLU?, 2024.
In Proceedings of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), 2025. [arXiv]