Publications

AI safety, reasoning evaluation, multilingual NLP.

2025

Probing Reasoning Flaws and Safety Hierarchies with Chain-of-Thought Difference Amplification

Kamesh R · NeurIPS 2025, LLM-Evals Workshop

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

Kamesh R · Preprint

Indian Grammatical Tradition-Inspired Universal Semantic Representation Bank (USR Bank 1.0)

Kamesh R et al. · AACL-IJCNLP 2025, BHASHA Workshop

PerplexMATH: Steering LLMs Toward Mathematical Reasoning

Jerome Francis, Kamesh R, Serena Pei · ICML 2025, NewInML Workshop (Poster)  [poster]