Profile Image
dibyadip (at) u.nus.edu

I am a second-year PhD Student in the School of Computing at the National University of Singapore. I am a part of CVML@NUS advised by Angela Yao. My graduate research is supported by the President's Graduate Fellowship.

In the summer of 2024, I interned at Meta Reality Labs, hosted by Fadime Sener. I completed my bachelor's in ECE from Jadavpur University where I worked with Sanjoy Kumar Saha and Ananda S. Chowdhury. My bachelor's thesis was on Open-Set Metric Learning for Person Re-identification in the Wild.

Broadly, I am interested in understanding how humans perceive the 4D (3D+time) world. My current research focuses on developing large vision-language models for video understanding and generation, as well as 4D modeling of human-object interactions.

I’m currently looking for an intern to work on analyzing hallucinations in VideoLLMs. Master's or senior undergrads with experience in LLMs/VLMs are preferred. Feel free to reach out if you're interested!

News

Publications & Preprints

N/A
Streaming VideoLLMs for Real-Time Procedural Video Understanding
Dibyadip Chatterjee, Edoardo Remelli, Yale Song, Bugra Tekin, Abhay Mittal, Bharat Bhatnagar, Necati Cihan Camgöz, Shreyas Hampali, Eric Sauser, Shugao Ma, Angela Yao, Fadime Sener
Coming Soon
N/A
On the Utility of 3D Hand Poses for Action Recognition
Md Salman Shamil, Dibyadip Chatterjee, Fadime Sener, Shugao Ma, Angela Yao
ECCV 2024
N/A
Opening the Vocabulary of Egocentric Actions
Dibyadip Chatterjee, Fadime Sener, Shugao Ma, Angela Yao
NeurIPS 2023
Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities
Fadime Sener, Dibyadip Chatterjee, Daniel Shelepov, Kun He, Dipika Singhania, Robert Wang, Angela Yao
N/A
Technical Report: Temporal Aggregate Representations
Fadime Sener, Dibyadip Chatterjee, Angela Yao
arXiv 2021
N/A
Open-set Metric Learning for Person Re-Identification in the Wild
Arindam Sikdar, Dibyadip Chatterjee, Arpan Bhowmik, Ananda S. Chowdhury
ICIP 2020

Academic Service

  •  Conference Reviewer: CVPR, ICCV, ECCV, BMVC, ACCV, AAAI, TPAMI, IJCV
  •  Teaching Assistant: CS4243 (Computer Vision and Pattern Recognition), BT3017 (Feature Engineering for Machine Learning)