Hi there!

I am a third-year PhD student at NYU, researching the application of Vision-Language Models in Robotics. I earned my Bachelor’s degree in Physics and Master’s in Artificial Intelligence from the Indian Institute of Technology Bombay (India), leveraging my background to develop AI models that can reason about real-world interactions. These models are built on powerful foundation models like ChatGPT, and my research aims to uncover the extent of their real-world understanding. In short—yes, I’m trying to make robots as smart as humans! You can find a very cool demo about my work here.

At my core, I’m a problem solver who likes to solve the toughest of problems. My interests are quite diverse - data science, mathematics, physics, stock markets and I try my best to stay connected with the latest of updates in these areas. You can find my CV here. I am pursuing full-time opportunities starting August 2027 across AI research and applied development, with interest in both advancing core methods and translating them into high-impact real world solutions.

Recent News

  • Feb 2026 — My paper BOP-ASK: Object-Interaction Reasoning for Vision-Language Models has been accepted to CVPR 2026! See you in Denver in June 2026!
  • Dec 2025 — I will be interning at NVIDIA in Summer 2026 with the Robotics Perception team under Stan Birchfield and Jonathan Tremblay.
  • Oct 2025 — My paper MapleGrasp: Mask-guided Feature Pooling for Language-driven Efficient Robotic Grasping has been accepted to WACV 2026! Will be presenting in Tucson in March 2026!
  • Sept 2025 — My paper Grounding LLMs For Robot Task Planning Using Closed-loop State Feedback has been accepted to the Journal of Advanced Robotics Research!