Evaluating GRPO and DPO for Faithful Chain-of-Thought Reasoning in LLMs
arXiv preprint arXiv:2512.22631
Research in Explainable AI, NLP, and Human-AI Collaboration
arXiv preprint arXiv:2512.22631
Proceedings of the 15th Joint Conference on Lexical and Computational Semantics (*SEM 2026), ACL 2026, pp. 497–515
Proceedings of the 2nd LUHME Workshop, ECAI 2025, 30–39
Proceedings of the 6th Workshop on Gender Bias in Natural Language Processing (GeBNLP), ACL 2025, pp. 92–104
† Equal contribution.
Computational Linguistics in the Netherlands Journal, Vol. 15 (2026), pp. 59–77
arXiv preprint arXiv:2506.04050
arXiv preprint arXiv:2502.00837
Strategic Organization (under revision)
arXiv preprint arXiv:2412.00962
arXiv preprint arXiv:2412.00956
2024
Applied Sciences, Volume 14, Issue 19, 8620, 2024
Computational Linguistics in the Netherlands Journal, Volume 13, 233–259, 2024
CLEF 2023: Conference and Labs of the Evaluation Forum, Working Notes, 1000–1011
Tehran: Arshadan Publication, 2022 [In Persian]
International Journal of Environmental Research and Public Health, Volume 19, Issue 22, 15036, 2022