Image
Shamit Lal

I am a Senior Applied Scientist in the AGI team at Amazon, where I work on advancing research in image generation/editing diffusion models and multimodal large language models. I am a core contributor of Amazon's flagship foundational models: Nova 2, Nova 1, and Titan Image Generator models.
Previously, I worked as a researcher at Fyusion, where I worked on problems related to 6DOF pose estimation, object detection and segmentation, and distillation of large models.
Even before that, I completed my masters in computer vision (MSCV) at Carnegie Mellon University advised by Professor Katerina Fragkiadaki.


News


Dec 2025
Amazon Nova 2 launched at Reinvent 2025
Dec 2024
Amazon Nova 1 launched at Reinvent 2024
Aug 2024
Amazon Titan Image Generator V2 launched
Dec 2023
Amazon Titan Image Generator launched at Reinvent 2023
Mar 2021
CoCoNets accepted at CVPR 2021
Jan 2021
2 papers (Disentangling 3D Prototypical Networks and HyperDynamics) accepted at ICLR 2021
Oct 2020
3D-OES accepted at CoRL 2020
Oct 2020
Disentangling 3D Prototypical Networks paper accepted as at NeurIPS ORLR workshop 2020 (oral).
May 2020
3DQ-Nets accepted at CVPR workshop 2020.
Aug 2019
Started my masters program at Carnegie Mellon University.
Mar 2019
Promoted to SDE-2 at Amazon.
Nov 2018
Online Video Summarization accepted at WACV 2019.
Oct 2017
Our paper Image Colorization Using Adversarial Training got accepted at ICSPS 2017.
Aug 2017
Graduated from DTU, joining Amazon India as SDE-1.
Dec 2016
ACM ICPC India Finalist (National Rank 28).

Released Models and Tech Reports


Image
Amazon Nova 2: Multimodal Reasoning and Generation Models
Amazon Artificial General Intelligence
Dec 2025
Image
Amazon Nova 1: The Amazon Nova Family of Models: Technical Report and Model Card
Amazon Artificial General Intelligence
Dec 2024
Image
Amazon Titan Image Generator V2
Amazon AWS AI Labs
Aug 2024
Image
Amazon Titan Image Generator
Amazon AWS AI Labs
Dec 2023

Publications and Pre-Prints


Image
Efficient scaling of diffusion transformers for text-to-image generation
Hao Li, Shamit Lal, Zhiheng Li, Yusheng Xie, Ying Wang, Yang Zou, Orchid Majumder, R Manmatha, Zhuowen Tu, Stefano Ermon, Stefano Soatto, Ashwin Swaminathan
Dec 2024
Image
CoCoNets: Continuous Contrastive 3D Scene Representations
Shamit Lal*, Mihir Prabhudesai*, Ishita Mediratta, Adam W Harley, Katerina Fragkiadaki (* = Equal Contribution)
CVPR 2021
Image
Disentangling 3D Prototypical Networks For Few-Shot Concept Learning
Mihir Prabhudesai*, Shamit Lal*, Darshan Patil*, Hsiao-Yu Tung, Adam W Harley, Katerina Fragkiadaki (* = Equal Contribution)
ICLR 2021 | Neurips 2020 ORLR Workshop (Oral)
Image
HyperDynamics: Meta-learning object and agent dynamics with hypernetworks
Zhou Xian, Shamit Lal, Hsiao-Yu Fish Tung, Emmanouil Antonios Platanios, Katerina Fragkiadaki
ICLR 2021
Image
3D-OES: Viewpoint-Invariant Object-Factorized Environment Simulators
Hsiao-Yu Fish Tung*, Zhou Xian*, Mihir Prabhudesai, Shamit Lal, Katerina Fragkiadaki (* = Equal Contribution)
CoRL 2020
Image
3D Object Recognition By Corresponding and Quantizing Neural 3D Scene Representations
Mihir Prabhudesai*, Shamit Lal*, Hsiao-Yu Fish Tung, Adam W. Harley, Shubhankar Potdar, Katerina Fragkiadaki (* = Equal Contribution)
Arxiv (full paper in submission) | CVPR Workshop 2020
Image
Online Video Summarization: Predicting Future to Better Summarize Present
Shamit Lal*, Shivam Duggal*, Indu Sreedevi (* = Equal Contribution)
WACV 2019
Image
Automatic image colorization using adversarial training
Shamit Lal, Vineet Garg, O.P.Verma
ICSPS 2017