Vitthal Bhandari

I am a graduate student in Computational Linguistics at the University of Washington . Before coming to Seattle, I spent more than 4 years in the banking industry as a generalist software engineer working at American Express, Standard Chartered Bank, and PayPal. I completed my Bachelor's in Computer Science and Engineering from BITS Pilani where I also did a minor in Data Science and worked with Prof. Poonam Goyal and Prof. Sundaresan Raman.

Research

Adaptive Memory for LLM Agents over Long-Horizon Settings

Graduate Thesis — In Progress, with Prof. Lucy Wang

Exploring adaptive memory layer solutions for LLM agents over long-horizon settings. Working with Prof. Lucy Wang on replicating memory evals for coding agents with ~1M context, and improving memory recall for long trajectories.

Data Curation for Low-Resource ASR: How Much Data Is Enough?

Follow-up Experiments after ACL + Blog

Asked myself: "How much data is enough data?". Conducted data curation experiments on 6 languages by finetuning only on low-perplexity utterances and found that using high-quality curated data often leads to faster convergence as opposed to full finetuning.

Blog

Voices from the Margins: Modeling Linguistic Diversity in Spontaneous Speech for Low-Resource Languages

Vitthal Bhandari, Tiya Kumar, Kate Mulhern

Oral — ACL 2026 Workshop on Computational Methods for Endangered Languages

Led an end-to-end ASR project on 21 extremely low-resource languages (under 10 hours of spontaneous Mozilla Common Voice speech). Fine-tuned MMS-1b, XLS-R-1b, and Whisper-large, built a KenLM n-gram ARPA pipeline and integrated pyctcdecode beam search with shallow fusion. 4-gram LM decoding beat greedy decoding by up to 27%. Presented as an oral paper at at ACL 2026.

Paper | Code

Language Modeling from Scratch: Experiments with BPE Tokenization

Course Project — Stanford CS 336

Implemented language modelling from scratch using Python and PyTorch. Most interesting finding was how much tokenization can be made more efficient through proper use of data structures and multithreading. See blog and code!

Blog | Code

On the Challenges of Building Datasets for Hate Speech Detection

Vitthal Bhandari

Preprint

This paper presents a comprehensive framework that standardizes the dataset creation pipeline across seven critical checkpoints by identifying systemic challenges in hate speech dataset creation.

arXiv

Leveraging Pretrained Language Models for Detecting Homophobia and Transphobia in Social Media Comments

Vitthal Bhandari and Poonam Goyal

Poster - ACL 2022 Workshop on Language Technology for Equality, Diversity and Inclusion

I contributed to a shared task focused on identifying homophobic and transphobic content in YouTube comments by implementing basic classifiers using multilingual pre-trained language models to analyze English, Tamil, and code-mixed datasets.

Paper | Code

Reviewing the collaborative role of Image processing in retinal imaging

Rehana Khan, Vitthal Bhandari, Sundaresan Raman, Abhishek Vyas, Akshay Raman, Maitreyee Roy and Rajiv Raman

Springer Nature - Teleophthalmology and Digital Health: A Practical Guide to Applications

Paper

Coursework

LING 573: Natural Language Processing Systems and Applications
LING 575: Speech Technology for Endangered Languages
LING 572: Advanced Statistical Methods for Natural Language Processing
LING 571: Deep Processing Techniques for Natural Language Processing
LING 570: Shallow Processing Techniques for Natural Language Processing
LING 575: Societal Impacts of Language Technology
Stanford CS 336: Language Modeling from Scratch

Credits of this template go to source code.