Publications

Conference Papers

Published in MINT@NeurIPS, 2024

We formulate a synthetic testbed to stress-test the sparse autoencoder (SAE) approach to interpretability in the text domain, using formal languages.

Authors: Abhinav Menon, Manish Shrivastava, Ekdeep Singh Lubana, David Krueger
Download Paper

Published in ECML PKDD, 2023

We categorize and present a dataset of factual inconsistencies, along with neural baselines for the classification of these inconsistencies.

Authors: Tathagata Raha, Mukund Choudhary, Abhinav Menon, Harshit Gupta, KV Aditya Srivatsa, Manish Gupta, Vasudeva Varma
Download Paper