Kecheng Zheng

Kecheng Zheng is currently a researcher at Ant Research working with Yujun Shen. My research focuses on computer vision and deep learning, particularly on multi-modal learning.

From Nov. 2018 to Jan. 2019, I was under the supervision of Wei Wei who is a research scientist in GOOGLE cloud.

From Sep. 2019 to May. 2020, I was an intern at JD AI Lab, working with Wu Liu.

From Jul. 2020 to Jan. 2021, I worked at Intelligent Multimedia Group (IMG) in MSRA as a research intern, under the supervision of Cuiling Lan.

From Mar. 2022 to Jul. 2022, I worked at Ant Research as a research intern working with Deli Zhao.

Email: zkechengzk@gmail.com  /  Google Scholar  /  Github

profile photo
News

  • [03/2025] Six papers accepted by CVPR 2025 (6/6)~

  • [02/2025] Three papers accepted by ICLR 2025 (3/4)~

  • [11/2024] Three papers accepted by NeurIPS 2024 (3/4)~

  • [06/2024] Four papers accepted by ECCV 2024 (4/8)~

    -->
  • Research

    I'm interested in computer vision, representation learning and multi-modal learning.

    Image
    CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
    Arxiv, 2023  
    Image
    Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase
    NeurIPS, 2023  
    Image
    Cones 2: Customizable Image Synthesis with Multiple Subjects
    Zhiheng Liu*, Yifei Zhang*, Yujun Shen, Kecheng Zheng, Kai Zhu, Ruili Feng, Yu Liu, Deli Zhao, Jingren Zhou, Yang Cao
    NeurIPS, 2023  
    Image
    Cones: Concept Neurons in Diffusion Models for Customized Generation.
    Zhiheng Liu*, Ruili Feng*, Kai Zhu, Yifei Zhang, Kecheng Zheng, Yu Liu, Deli Zhao, Jingren Zhou, Yang Cao
    ICML, 2023   Oral Presentation!
    Image
    RLEG: Vision-Language Representation Learning with Diffusion-based Embedding Generation.
    Liming Zhao, Kecheng Zheng, Yun Zheng, Deli Zhao, Jingren Zhou,
    ICML, 2023  
    Image
    Self-Organizing Pathway Expansion for Non-Exemplar Class-Incremental Learning
    Kai Zhu*, Kecheng Zheng*, Ruili Feng, Deli Zhao, Yang Cao, Zheng-jun Zha
    ICCV, 2023  
    Image
    Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models
    Kecheng Zheng*, Wei Wu*, Ruili Feng Kai Zhu, Jiawei Liu, Deli Zhao, Zheng-jun Zha, Wei Chen, Yujun Shen
    ICCV, 2023  
    Image
    Neural Dependencies Emerging from Learning Massive Categories
    Ruili Feng, Kecheng Zheng, Kai Zhu, Yujun Shen, Jian Zhao, Yukun Huang, Deli Zhao, Jingren Zhou, Michael Jordan, Zheng-Jun Zha
    CVPR, 2023  
    Image
    Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection
    Fan Lu, Kai Zhu, Wei Zhai, Kecheng Zheng, Yang Cao
    CVPR, 2023  
    Image
    Rank Diminishing in Deep Neural Networks
    Ruili Feng, Kecheng Zheng, Yukun Huang, Deli Zhao, Michael I. Jordan, Zheng-jun Zha
    NeurIPS, 2022  
    Image
    Uncertainty-Aware Hierarchical Refinement for Incremental Implicitly-Refined Classification
    Jian Yang, Kai Zhu, Kecheng Zheng, Yang Cao
    NeurIPS, 2022  
    Image
    Image
    Unleashing the Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification
    Zizheng Yang, Xin Jin, Kecheng Zheng, Feng Zhao
    CVPR, 2022  
    ArXiv / Code / bibtex

    We design an Unsupervised Pre-training framework for ReID based on the contrastive learning (CL) pipeline, dubbed UP-ReID.

    Image
    Image
    Cloth-Changing Person Re-identification from A Single Image with Gait Prediction and Regularization
    Xin Jin, Tianyu He, Kecheng Zheng, Zhiheng Ying, Xu Shen, Zhen Huang , Ruoyu Feng , Jianqiang Huang , Xian-Sheng Hua , Zhibo Chen
    CVPR, 2022  
    ArXiv / Code / bibtex

    We focus on handling well the CC-ReID problem under a more challenging setting, i.e., just from a single image, which enables high-efficiency and latency-free pedestrian identify for real-time surveillance applications.

    Image
    Image
    Calibrated Feature Decomposition for Generalizable Person Re-Identification
    Kecheng Zheng, Jiawei Liu, Wei Wu, Liang Li, Zheng-jun Zha
    Arxiv, 2022  
    ArXiv / Code / bibtex

    We propose a simple yet effective Calibrated Feature Decomposition (CFD) module that focuses on improving the generalization capacity for person re-identification through a more judicious feature decomposition and reinforcement strategy.

    Image
    Image
    Debiased Batch Normalization via Gaussian Process for Generalizable Person Re-Identification
    Jiawei Liu, Zhipeng Huang, Kecheng Zheng, Liang Li, Zheng-jun Zha
    AAAI, 2022  
    Arxiv

    We propose a novel Debiased Batch Normalization via Gaussian Process approach (GDNorm) for generalizable person re-identification, which models the feature statistic estimation from BN layers as a dynamically self-refining Gaussian process to alleviate the bias to unseen domain for improving the generalization.

    Image
    Image
    Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-Identification
    Zhipeng Huang, Jiawei Liu, Kecheng Zheng, Liang Li, Zheng-jun Zha
    AAAI, 2022  
    arxiv

    We propose a novel modality-adaptive mixup and invariant decomposition (MID) approach for RGB-infrared person re-identification towards learning modality-invariant and discriminative representations.

    Image
    Image
    Pose-Guided Feature Learning with Knowledge Distillation for Occluded Person Re-Identification
    Kecheng Zheng, Cuiling Lan, Jiawei Liu, Wenjun Zeng, Zhizheng Zhang, Zheng-jun Zha
    ACM MM, 2021  
    ArXiv / bibtex

    We propose a network named Pose-Guided Feature Learning with Knowledge Distillation (PGFL-KD), where the pose information is exploited to regularize the learning of semantics aligned features but is discarded in testing.

    Image
    Image
    Group-aware Label Transfer for Domain Adaptive Person Re-identification
    Kecheng Zheng, Wu Liu, Lingxiao He, Jiebo Luo, Tao Mei, Zheng-jun Zha
    CVPR, 2021  
    ArXiv / Code / bibtex

    We propose a Group-aware Label Transfer (GLT) algorithm, which enables the online interaction and mutual promotion of pseudo-label prediction and representation learning.

    Image
    Image
    Spatial-Temporal Correlation and Topology Learning for Person Re-Identification in Videos
    Jiawei Liu, Zheng-jun Zha, Wei Wu, Kecheng Zheng, Qibin Sun
    CVPR, 2021   Oral Presentation!
    ArXiv / bibtex

    We propose a novel Spatial-Temporal Correlation and Topology Learning framework (CTL) to pursue discriminative and robust representation by modeling cross-scale spatial-temporal correlation.

    Image
    Image
    Disentanglement-based Cross-Domain Feature Augmentation for Effective Unsupervised Domain Adaptive Person Re-identification
    Zhizheng Zhang , Cuiling Lan, Wenjun Zeng, Quanzeng You , Zicheng Liu , Kecheng Zheng, Zhibo Chen
    ArXiv, 2021  
    ArXiv / code(coming soon) / bibtex

    We propose a Disentanglement-based Cross-Domain Feature Augmentation (DCDFA) strategy, where the augmented features characterize well the target and source domain data distributions while inheriting reliable identity labels.

    Image
    Image
    Exploiting Sample Uncertainty for Domain Adaptive Person Re-Identification
    Kecheng Zheng, Cuiling Lan, Wenjun Zeng, Zhizheng Zhang, Zheng-jun Zha
    AAAI, 2021  
    ArXiv / Code / bibtex

    We propose to estimate and exploit the credibility of the assigned pseudo-label of each sample to alleviate the influence of noisy labels.

    Image
    Image
    Memory Enhanced Embedding Learning for Cross-Modal Video-Text Retrieval
    Rui Zhao*, Kecheng Zheng*, Zheng-jun Zha, Hongtao Xie, Jiebo Luo
    *Equal Contribution
    ArXiv, 2020  
    ArXiv / bibtex

    We propose a novel memory enhanced embedding learning (MEEL) method for video-text retrieval.

    Image
    Image
    Hierarchical Gumbel Attention Network for Text-based Person Search
    Kecheng Zheng, Wu Liu, Jiawei Liu, Tao Mei, Zheng-jun Zha
    ACM MM, 2020  
    Acm / bibtex

    We propose a novel hierarchical Gumbel attention network for text-based person search via Gumbel top-k re-parameterization algorithm.

    Image
    Image
    Abstract Reasoning with Distracting Features
    Kecheng Zheng, Wei Wei, Zheng-jun Zha
    NeurIPS, 2019  
    ArXiv / Code / bibtex

    We first illustrate that one of the main challenges in such a reasoning task is the presence of distracting features, which requires the learning algorithm to leverage counterevidence and to reject any of the false hypotheses in order to learn the true patterns.

    Image
    Image
    STACKED CONVOLUTIONAL DEEP ENCODING NETWORK FOR VIDEO-TEXT RETRIEVAL
    Rui Zhao*, Kecheng Zheng*, Zheng-jun Zha
    *Equal Contribution
    ICME, 2020  
    ArXiv / bibtex

    We propose a stacked convolutional deep encoding network for video-text retrieval task, which considers to simultaneously encode long-range and short-range dependency in the videos and texts.

    Image
    Image
    LA-Net: Layout-Aware Dense Network for Monocular Depth Estimation
    Kecheng Zheng, Zheng-jun Zha, Yang Cao, Xuejin Chen, Feng Wu
    ACM MM, 2018   Oral Presentation!
    Acm / bibtex

    We propose a novel Layout-Aware Convolutional Neural Network (LA-Net) for accurate monocular depth estimation by simultaneously perceiving scene layout and local depth details.

    CodeBase

    CoDeF, GitHub stars

    FastReID: a Pytorch Toolbox for General Instance Re-identification, GitHub stars

    Carver: Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase,GitHub stars

    Cones 2, GitHub stars

    Cones, GitHub stars

    Academic Services
    Journal reviewer: IJCV, TNNLS, TMM, YCSVT

    Conference reviewer: CVPR, ICCV, ECCV, NeurIPS, ICML, ACM MM, AAAI

    Feel free to steal this website's source code.