I am a postdoctoral associate at University of Maryland, College Park working with Furong Huang and Tom Goldstein. I obtained my Ph.D. from Electrical and Computer Engineering at the University of California, Riverside, under the guidance of M. Salman Asif. Originally from Wuhan, the "River City" of China, I completed my B.S. in Automation at Wuhan University of Technology. I am currently on the job market for faculty and research positions starting in Fall 2026.

My research focuses on understanding and enabling reliable long-horizon behavior in autonomous agents and robotic systems, with an emphasis on memory, adaptation, and robustness through interaction with the environment.

Research Interests

    World Modeling and Multimodal Reasoning
  • Multi-modal reasoning across vision, language, and action [ MORSE, Zebra-COT ]
  • Embodied AI for navigation and manipulation in robotics [ TraceGen ]
  • Safety, Alignment, and Model Adaptation
  • Agentic safety and alignment in multi-agent ecosystems [ AegisLLM ]
  • Controllable and safe AI via model editing and unlearning [ SLUG, Robust Eval ]
  • Enhancing user privacy via de-identification while preserving utility [ Disguise ]
  • Robust Perception and Decision-Making
  • Exposing model vulnerabilities via efficient blackbox attacks [ BASES, EBAD, Context-Aware-Attacks, ZQA ]
  • Detecting adversarial attacks using language models [ SCENE-Lang ]
  • Improving physical sensing via end-to-end optimization [ Learn PR, Learn CDI ]

News

  • [Mar. 2026] Low-Frequency Trap on VLM Temporal Reasoning Evaluation has been accepted to ICLR 2026 ICBNB @ Rio de Janeiro.
  • [Feb. 2026] TraceGen on 3D Trace Generation for Robotic Manipulation has been accepted to CVPR 2026 @ Denver.
  • [Jan. 2026] Zebra-CoT on Interleaved Vision-Language Reasoning has been accepted to ICLR 2026 @ Rio de Janeiro.
  • [Jun. 2025] Our paper on Efficient Training for VLM has been accepted to ICCV 2025 @ Honolulu.
  • [Jun. 2025] Our paper on Model Tampering Attack for Robust Evaluation has been accepted to TMLR.
  • [May 2025] Our paper SLUG on Efficient Unlearning has been accepted to ICML 2025 @ Vancouver.
  • [Mar. 2025] AegisLLM on Agentic System for Safety has been accepted to ICLR 2025 BuildingTrust @ Singapore.
  • [Oct. 2024] Our papers on Efficient and Robust Unlearning are accepted to NeurIPS 2024 SafeGenAI @ Vancouver.
  • [Sept. 2024] Our paper on Cross-modality Safety Alignment is accepted to EMNLP 2024 @ Miami.
  • [Jul. 2024] Excited to join UMD UMIACS as a postdoc, working on Advanced Autonomous Robotics.
  • [Dec 2023] Our paper on Face De-identification is accepted to AAAI 2024 @ Vancouver.
  • [May 2023] Acknowledged as CVPR 2023 Outstanding Reviewer.
  • [May 2023] Recognized as Outstanding Teaching Assistant by ECE Graduate Division for 2023.
  • [Apr. 2023] Received Dissertation Year Program Fellowship 2023/2024.
  • [Feb. 2023] Our paper EBAD is accepted to CVPR 2023 @ Vancouver.
  • [Feb. 2023] Received HEERF Dissertation Year Program Award 2022/2023.
  • [Feb. 2023] Our paper BASES is accepted to NeurIPS 2022 @ New Orleans.
  • [Jun. 2022] Presenting our paper ZQA at CVPR 2022 @ New Orleans.

Selected Publications

Low-Frequency Trap: Evaluating Temporal Reasoning Failures in Vision-Language Models
Sarvesh Baskar*, Muhammad R. Islam*, Zikui Cai†, Ankit Nakhawa, Anirudh Satheesh, Tom Goldstein, Furong Huang
ICLR 2026 ICBNB
TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos
Seungjae Lee, Yoonkyo Jung, Inkook Chun, Yao-Chih Lee, Zikui Cai, Hongjia Huang, Aayush Talreja, Tan Dat Dao, Yongyuan Liang, Jia-Bin Huang, Furong Huang
CVPR 2026
Zebra-CoT: A Dataset for Interleaved Vision-Language Reasoning
Ang Li*, Charles Wang*, Deqing Fu*, Kaiyu Yue*, Zikui Cai*, Wang Bill Zhu*, Ollie Liu*, Peng Guo*, Willie Neiswanger, Furong Huang, Tom Goldstein, Micah Goldblum
ICLR 2026
Targeted Unlearning with Single Layer Unlearning Gradient
Zikui Cai, Yaoteng Tan, M Salman Asif
ICML 2025
AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security
Zikui Cai, Shayan Shabihi, Bang An, Zora Che, Brian R. Bartoldson, Bhavya Kailkhura, Tom Goldstein, Furong Huang
ICLR 2025 BuildingTrust
Model Tampering Attacks Enable More Rigorous Evaluations of LLM Capabilities
Zora Che, Stephen Casper, Robert Kirk, et al.
TMLR 2025
Can Textual Unlearning Solve Cross-Modality Safety Alignment?
Trishna Chakraborty, Erfan Shayegani, Zikui Cai, et al.
EMNLP 2024
Disguise without Disruption: Utility-Preserving Face De-Identification
Zikui Cai, Zhongpai Gao, Benjamin Planche, Meng Zheng, Terrence Chen, M. Salman Asif, Ziyan Wu
AAAI 2024
Ensemble-based Blackbox Attacks on Dense Prediction
Zikui Cai*, Yaoteng Tan*, M. Salman Asif
CVPR 2023
Blackbox Attacks via Surrogate Ensemble Search
Zikui Cai, Chengyu Song, Srikanth V. Krishnamurthy, Amit K. Roy-Chowdhury, M. Salman Asif
NeurIPS 2022
Zero-Query Transfer Attacks on Context-Aware Object Detectors
Zikui Cai, Shantanu Rane, Alejandro E. Brito, et al.
CVPR 2022
Context-Aware Transfer Attacks for Object Detection
Zikui Cai, Xinxin Xie, Shasha Li, et al.
AAAI 2022
Exploiting Multi-Object Relationships for Detecting Adversarial Attacks in Complex Scenes
Mingjun Yin, Shasha Li, Zikui Cai, et al.
ICCV 2021
Data-Driven Illumination Patterns For Coded Diffraction Imaging
Zikui Cai, Rakib Hyder, M. Salman Asif
ICIP 2021
Solving Phase Retrieval with a Learned Reference
Rakib Hyder*, Zikui Cai*, M. Salman Asif
ECCV 2020