Probing Reasoning Flaws and Safety Hierarchies with Chain-of-Thought Difference Amplification
Kamesh R · NeurIPS 2025, LLM-Evals Workshop
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures
Kamesh R · Preprint
Indian Grammatical Tradition-Inspired Universal Semantic Representation Bank (USR Bank 1.0)
Kamesh R et al. · AACL-IJCNLP 2025, BHASHA Workshop
PerplexMATH: Steering LLMs Toward Mathematical Reasoning
Jerome Francis, Kamesh R, Serena Pei · ICML 2025, NewInML Workshop (Poster) [poster]