I am currently pursuing a PhD in Computer Science & Engineering at Indian Institute of Technology Jodhpur. My research focuses on Natural Language Processing and Computer Vision.
I design methods for contextual text error correction in Indic languages, multi-modal knowledge extraction, and low-resource learning.
") does not match the recommended repository name for your site ("").
", so that your site can be accessed directly at "http://".
However, if the current repository name is intended, you can ignore this message by removing "{% include widgets/debug_repo_name.html %}" in index.html.
",
which does not match the baseurl ("") configured in _config.yml.
baseurl in _config.yml to "".

S Gautam, AS Penamakuri, A Bhandari, G Harit
Proceedings of the 5th Workshop on Multilingual Representation Learning (MRL 2025)
This paper introduces MMCRICBENCH-3K, a benchmark for Visual Question Answering on cricket scorecards designed to evaluate large vision-language models on complex numerical and cross-lingual reasoning over semi-structured tabular images. Empirical results show that state-of-the-art models struggle with structure-aware numerical reasoning and cross-lingual generalization.
S Gautam, AS Penamakuri, A Bhandari, G Harit
Proceedings of the 5th Workshop on Multilingual Representation Learning (MRL 2025)
This paper introduces MMCRICBENCH-3K, a benchmark for Visual Question Answering on cricket scorecards designed to evaluate large vision-language models on complex numerical and cross-lingual reasoning over semi-structured tabular images. Empirical results show that state-of-the-art models struggle with structure-aware numerical reasoning and cross-lingual generalization.

S Gautam, A Bhandari, G Harit
Findings of the Association for Computational Linguistics: NAACL 2025
This paper introduces TabComp, a dataset for visual table reading comprehension. The dataset is designed to advance research in understanding and extracting information from tables in documents.
S Gautam, A Bhandari, G Harit
Findings of the Association for Computational Linguistics: NAACL 2025
This paper introduces TabComp, a dataset for visual table reading comprehension. The dataset is designed to advance research in understanding and extracting information from tables in documents.

A Bhandari, S Sharma, R Uyyala, R Pal, M Verma
Proceedings of the 11th International Conference on Advances in Information Technology 2020
This paper presents a novel approach for reversible data hiding using multi-layer perceptron for pixel prediction. Reversible data hiding is a technique that allows the original cover media to be perfectly restored after the hidden data has been extracted.
A Bhandari, S Sharma, R Uyyala, R Pal, M Verma
Proceedings of the 11th International Conference on Advances in Information Technology 2020
This paper presents a novel approach for reversible data hiding using multi-layer perceptron for pixel prediction. Reversible data hiding is a technique that allows the original cover media to be perfectly restored after the hidden data has been extracted.