2023
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Chaitanya Ryali*, Yuan-Ting Hu*, Daniel Bolya*, Chen Wei, Haoqi Fan, Po-Yao Huang, Vaibhav Aggarwal, Arkabandhu Chowdhury, Omid Poursaeed, Judy Hoffman, Jitendra Malik, Yanghao Li* , Christoph Feichtenhofer*
International Conference on Machine Learning (ICML ), 2023 (Oral )
Paper
/
Code
Scaling Language-Image Pre-training via Masking
Yanghao Li* , Haoqi Fan*, Ronghang Hu*, Christoph Feichtenhofer†, Kaiming He†
Computer Vision and Pattern Recognition (CVPR ), 2023
Paper
/
Code
2022
Masked Autoencoders As Spatiotemporal Learners
Christoph Feichtenhofer*, Haoqi Fan*, Yanghao Li , Kaiming He
Conference on Neural Information Processing Systems (NeurIPS ), 2022
Paper
/
Code
Exploring Plain Vision Transformer Backbones for Object Detection
Yanghao Li , Hanzi Mao, Ross Girshick*, Kaiming He*
European Conference on Computer Vision (ECCV ), 2022
Paper
/
Code
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection
Yanghao Li* , Chao-Yuan Wu*, Haoqi Fan, Karttikeya Mangalam, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer*
Computer Vision and Pattern Recognition (CVPR ), 2022
Paper
/
Code
Masked Autoencoders are Scalable Vision Learners
Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li , Piotr Dollár, Ross Girshick
Computer Vision and Pattern Recognition (CVPR ), 2022 (Oral ). Best Paper Nominee
Paper
/
Code
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Chao-Yuan Wu*, Yanghao Li* , Karttikeya Mangalam, Haoqi Fan, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer*
Computer Vision and Pattern Recognition (CVPR ), 2022 (Oral )
Paper
Reversible Vision Tranformers
Karttikeya Mangalam, Haoqi Fan, Yanghao Li , Chao-Yuan Wu, Bo Xiong, Christoph Feichtenhofer, Jitendra Malik
Computer Vision and Pattern Recognition (CVPR ), 2022 (Oral ).
Paper
/
Code
Ego4d: Around the world in 3,000 hours of egocentric video
Kristen Grauman et al.
Computer Vision and Pattern Recognition (CVPR ), 2022 (Oral ). Best Paper Nominee
Paper
/
Website
2021
Benchmarking Detection Transfer Learning with Vision Transformers
Yanghao Li , Saining Xie, Xinlei Chen, Piotr Dollár, Kaiming He, Ross Girshick
Tech report , 2021
Paper
Multiscale Vision Transformers
Haoqi Fan*, Bo Xiong*, Karttikeya Mangalam*, Yanghao Li* , Zhicheng Yan, Jitendra Malik, Christoph Feichtenhofer*
International Conference on Computer Vision (ICCV ), 2021
Paper
/
Code
Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos
Yanghao Li , Tushar Nagarajan, Bo Xiong, Kristen Grauman
Computer Vision and Pattern Recognition (CVPR ), 2021
Paper
/
Code
PyTorchVideo: A Deep Learning Library for Video Understanding
Haoqi Fan*, Tullie Murrell*, Heng Wang‡, Kalyan Vasudev Alwala‡, Yanghao Li‡ , Yilei Li‡, Bo Xiong ‡,
Nikhila Ravi, Meng Li, Haichuan Yang, Jitendra Malik, Ross Girshick, Matt Feiszli, Aaron Adcock†, Wan-Yen Lo†, Christoph Feichtenhofer†
ACM International Conference on Multimedia (ACM MM ), 2021
Paper
/
Code
/
Website
Previous
EGO-TOPO: Environment Affordances from Egocentric Video
Tushar Nagarajan, Yanghao Li , Christoph Feichtenhofer, Kristen Grauman
Computer Vision and Pattern Recognition (CVPR ), 2020 (Oral )
Paper
/
Code
/
Website
Scale-Aware Trident Networks for Object Detection
Yanghao Li* , Yuntao Chen*, Naiyan Wang, Zhaoxiang Zhang
International Conference on Computer Vision (ICCV ), 2019 (Oral )
Paper
/
MXNet Code
/
Detectron2 Code
Temporal Bilinear Networks for Video Action Recognition
Yanghao Li , Sijie Song, Yuqi Li, Jiaying Liu
Association for the Advancement of Artificial Intelligence (AAAI ), 2019 (Oral )
Paper
Adaptive Batch Normalization for Practical Domain Adaptation
Yanghao Li , Naiyan Wang, Jianping Shi, Xiaodi Hou, and Jiaying Liu
Pattern Recognition, 2018
Paper
/
ICLR workshop
/
Webpage
Factorized Bilinear Models for Image Recognition
Yanghao Li , Naiyan Wang, Jiaying Liu, Xiaodi Hou
International Conference on Computer Vision (ICCV ), 2017
Paper
/
Code
Demystifying Neural Style Transfer
Yanghao Li , Naiyan Wang, Jiaying Liu, Xiaodi Hou
International Joint Conference on Artificial Intelligence (IJCAI ), 2017 (Oral )
Paper
/
Code
Online Human Action Detection using Joint Classification-Regression Recurrent Neural Networks
Yanghao Li , Cuiling Lan, Junliang Xing, Wenjun Zeng, Chunfeng Yuan, Jiaying Liu
European Conference on Computer Vision (ECCV ), 2016
Paper
/
Website
PySlowFast: video understanding codebase for state-of-the-art research
Haoqi Fan, Yanghao Li , Wan-Yen Lo, Christoph Feichtenhofer
PyTorchVideo: A Deep Learning Library for Video Understanding
Haoqi Fan*, Tullie Murrell*, Heng Wang‡, Kalyan Vasudev Alwala‡, Yanghao Li‡ , Yilei Li‡, Bo Xiong ‡,
Nikhila Ravi, Meng Li, Haichuan Yang, Jitendra Malik, Ross Girshick, Matt Feiszli, Aaron Adcock†, Wan-Yen Lo†, Christoph Feichtenhofer†
Paper
/
Post
SimpleDet - A Simple and Versatile Framework for Object Detection and Instance Recognition
Yuntao Chen, Chenxia Han, Yanghao Li , Zehao Huang, Yi Jiang, Naiyan Wang, Zhaoxiang Zhang
Paper
Last update: Apr. 2022      Template