Bharath Hariharan

I am an associate professor in Computer Science at Cornell University. I work on computer vision and machine learning, in particular on important problems that defy the "Big Data" label. I enjoy problems that require marrying advances in machine learning with insights from computer vision, geometry and domain-specific knowledge. A sampling of the research problems my group works on is presented below; an exhaustive list of publications is available on scholar.


My work has been recognized with an NSF CAREER award and a PAMI Young Researcher Award.

My CV is here and my research statement is here.



Note to prospective PhD students: Admissions at Cornell are done through a committee. If you are interested in working with me, please directly apply through the application website and mention my name

Image
Associate Professor
311 Gates Hall
Cornell University
bharathh-AT-cs-DOT-cornell-DOT-edu

Teaching

PhD students

Former PhD students

Research

Recognition for satellite images and earth science

A variety of scientific disciplines, including environmental science and the earth sciences, need to know what is there in any place on the planet at any time. This requires recognition on satellite images as well as combining information from multiple modalities (satellite, aerial and ground) captured at the same location. Recognition on satellite images is in itself also a fundamental challenge given the absence of large labeled datasets. As part of this project, we have built one of the most accurate foundation vision-language model for satellite images as well as new self-supervised representations for satellite images. See our work here.
Image

4D Reconstruction and recognition

Humans live in a 4D world that is constantly changing. Yet computer vision systems are classically focused on static images or static scenes. Even work on dynamic scenes or video understanding focuses on short time horizons. We are beginning to explore novel tracking formulations that track individual pixels through long-term occlusions, or objects through state change. We are also exploring new benchmarks for multimodal video understanding and new architectures that can efficiently represent and understand long complex videos.
Image

Representative recent publications


    Tracking and Understanding Object Transformations
    Yihong Sun, Xinyu Yang, Jennifer J. Sun, Bharath Hariharan
    In NeurIPS 2025.
    paper    bibtex
    MONITRS: Multimodal Observations of Natural Incidents Through Remote Sensing
    Shreelekha Revankar, Utkarsh Mall, Cheng Perng Phoo, Kavita Bala, Bharath Hariharan
    In NeurIPS 2025.
    paper    bibtex
    DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery
    Utkarsh Mall, Cheng Perng Phoo, Mia Chiquier, Bharath Hariharan, Kavita Bala, Carl Vondrick
    In CVPR 2025.
    paper    bibtex
    3D Synthesis for Architectural Design
    I-Ting Tsai, Bharath Hariharan
    In WACV 2025.
    paper    bibtex
    Scale-aware Recognition in Satellite Images under Resource Constraints
    Shreelekha Revankar, Cheng Perng Phoo, Utkarsh Mall, Bharath Hariharan, Kavita Bala
    In ICLR 2025.
    paper    bibtex
    MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos
    Yihong Sun, Bharath Hariharan
    In ECCV 2024.
    paper    bibtex
    Remote sensing vision-language foundation models without annotations via ground remote alignment
    Utkarsh Mall, Cheng Perng Phoo, Meilin Kelsey Liu, Carl Vondrick, Bharath Hariharan, Kavita Bala
    In ICLR 2024.
    paper    bibtex
    Tracking Everything Everywhere All at Once
    Qianqian Wang, Yen-Yu Chang, Ruojin Cai, Zhengqi Li, Bharath Hariharan, Aleksander Holynski, Noah Snavely
    In ICCV 2023 (Best Student Paper)
    paper    bibtex
    Emergent Correspondence from Image Diffusion
    Luming Tang, Menglin Jia, Qianqian Wang, Cheng Phoo, Bharath Hariharan
    In NeurIPS 2023.
    paper    bibtex
    Visual Prompt Tuning
    Menglin Jia, Luming Tang, Bor-Chun Chen, Claire Cardie, Serge Belongie, Bharath Hariharan, Ser-Nam Lim
    In ECCV 2022.
    paper    bibtex