I am a fifth year PhD student at Center for Research in Computer Vision (CRCV), University of Central Florida (UCF), under the supervision of Prof. Yogesh Singh Rawat.
I have a broad interest in deep learning and computer vision. My current research mainly focuses on data-efficient approaches for dense video tasks.
Looking for full-time positions (Jan'26)! Feel free to drop me an email.
May'25: Started internship at Amazon, Bellevue, WA Apr'25: Received Doctoral Research Support Award from UCF. 💥 Mar'25:Student Travel Award grant to attend ICLR 2025 Mar'25:STPro accepted at CVPR 2025 💥 Feb'25:Student Travel Award grant to attend WACV 2025 Feb'25:Student Travel Award grant to attend AAAI 2025 Jan'25:CoSPaL (First author paper) accepted at ICLR 2025 💥 Jan'25:Selected for Doctoral consortium at IEEE/CVF WACV 2025 💥 Dec'24:SMT (First author paper) accepted at AAAI 2025 💥 May'24: Started internship at Amazon, Palo Alto, CA Dec'23:SSL-AL accepted at AAAI 2024 💥 Oct'23:Benchmark-SSL (First author paper) accepted at NeurIPS Self-Superivsed Workshop 2024 Mar'23:MAMA-VAD accepted at CVPR Workshops 2022 Mar'22:E2E-SSL (First author paper) accepted at CVPR 2022 💥
Developed first vision language models (VLMs) for dense multimodal video detection task without any labels. Devised context aware and self-paced progressive scene learning approach.
Learning from mistakes on labelled set and transfer that learning to pseudo labels from unlabeled set to enhance spatio-temporal localization.
Class-agnostic spatio-temporal refinement module and temporal coherency constraint for better spatio-temporal localization.
First exhaustive study on impact of pre-training in self-supervised learning for videos. Proposed a simple knowledge distillation
approach outperforming previous works with 90% less videos.
First end-to-end semi-supervised approach for video action detection task. Short-term and long-term smoothness constraints to exploit spatio-temporal coherency.
Video Action Detection: Analysing Limitations and Challenges
Rajat Modi, Aayush Rana, Akash Kumar,
Praveen Tirupattar, Shruti Vyas, Yogesh Singh Rawat, Mubarak Shah
Computer Vision and Pattern Recognition Conference (CVPR Workshops), 2022 1st Workshop on Vision Datasets Understanding paper
Developed new spatio-temporal surveillance based dataset for real-world challenges.
Publications (Funding projects)
Below is a list of my works (in chronological order) for funding projects.
Benchmarking Robustness of Gait Recognition Models
Reeshoon Sayera, Sirshapan Mitra, Prudvi Kamtam, Akash Kumar, Yogesh Singh Rawat
Under review
Investigate the robustness of gait recognition models against perturbations and corruptions, focusing on both key components: the parsing model and the gait recognition model.
Gait recognition under limited labels settings: A generalized approach
Sirshapan Mitra, Akash Kumar, Yogesh Singh Rawat
Under review
A versatile solution applicable to all limited label settings (semi-supervised & domain adaptation), via low-dimensional clustering and knowledge distillation.
Feel free to steal this website's source code. Do not scrape the HTML from this page itself, as it includes analytics tags that you do not want on your own website — use the github code instead. Also, consider using Leonid Keselman's Jekyll fork of this page.