"Extended Abstract: Graduate Student Descent Considered Harmful? A Proposal for Studying Overfitting in Reward Functions", The Multi-disciplinary Conference on Reinforcement Learning and Decision Making, Providence, RI, 2022.
"Extended Abstract: Partial Return Poorly Explains Human Preferences", The Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM), Providence, RI, 2022.
"The Irrationality of Neural Rationale Models", Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2nd Workshop on Trustworthy Natural Langauge Processing (TrustNLP), 07/2022.
"Revisiting Human-Robot Teaching and Learning Through the Lens of Human Concept Learning Theory", ACM/IEEE International Conference on Human-Robot Interaction (HRI), 03/2022.
"Set-based State Estimation with Probabilistic Consistency Guarantee under Epistemic Uncertainty", IEEE Robotics and Automation Letters (RA-L), vol. 7, issue 3, 03/2022.
"The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications", Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI), Washington, D.C. , 02/2023.
"The Solvability of Interpretability Evaluation Metrics", Findings of the Association for Computational Linguistics: EACL: Association for Computational Linguistics, 05/2023.