Publications

2025

  1. PiCSAR: Probabilistic Confidence Selection And Ranking for Reasoning Chains
    Joshua Ong Jun Leang, Zheng Zhao, Aryo Pradipta Gema, Sohee Yang, and 6 more authors
    arXiv preprint, 2025
  2. CoMAT: Chain of mathematically annotated thought improves mathematical reasoning
    Joshua Ong Jun Leang, Aryo Pradipta Gema, and Shay B Cohen
    In Proceedings of the Empirical Methods in Natural Language Processing (EMNLP), 2025
  3. Theorem Prover as a Judge for Synthetic Data Generation
    Joshua Ong Jun Leang, Giwon Hong, Wenda Li, and Shay B Cohen
    In Proceedings of the Association for Computational Linguistics (ACL), 2025
  4. Are We Done with MMLU?
    Aryo Pradipta Gema, Joshua Ong Jun Leang, Giwon Hong, Alessio Devoto, and 7 more authors
    In Proceedings of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), 2025