Skip to content

YU-deep/Awesome-Latent-Space

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

112 Commits
 
 
 
 
 
 
 
 

Repository files navigation

icon The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Awesome list badge GitHub stars MIT License arXiv Hugging Face PRs welcome WeChat Group Semantic Scholar Citations

This repository manually collects works in latent space, which will be continuously updated.

📖 News

[2026/04/03] We release our survey: The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook!

[2025/11/30] We release the initial version!

Star History Chart

🌟 Overview

📄 Citation

If you find this survey helpful, a citation to our paper would be greatly appreciated:

@article{yu2026latent,
  title={The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook},
  author={Yu, Xinlei and Chen, Zhangquan and He, Yongbo and Fu, Tianyu and Yang, Cheng and Xu, Chengming and Ma, Yue and Hu, Xiaobin and Cao, Zhe and Xu, Jie and others},
  journal={arXiv preprint arXiv:2604.02029},
  year={2026}
}

🤝 Contributing

We warmly welcome contributions of excellent resources you find via pull request. Please follow the instruction in CONTRIBUTING.md if you want to make one. Additionally, if you want to have any other issue, please add our wechat group.

🔥 Methods

Large-Language-Model

Date Paper Title Introduction Code
2024/09 Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding image -
2024/09 Uncovering Latent Chain of Thought Vectors in Language Models image -
2024/10 Understanding Reasoning in Chain-of-Thought from the Hopfieldian View image -
2024/10 ICLR'25
Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
image Github
2024/11 Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding image Github
2024/12 COLM'25
Training Large Language Models to Reason in a Continuous Latent Space
image Github
2024/12 Compressed Chain of Thought: Efficient Reasoning Through Dense Representations image -
2024/12 ICML'25
Deliberation in Latent Space via Differentiable Cache Augmentation
image -
2025/01 Latent-space adversarial training with post-aware calibration for defending large language models against jailbreak attacks image Github
2025/01 LF-Steering: Latent Feature Activation Steering for Enhancing Semantic Consistency in Large Language Models image -
2025/02 ICML'25
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
image -
2025/02 ICML'25
Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
image -
2025/02 NeurIPS'25
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
image Github
2025/02 ICLR'26
LLM Pretraining with Continuous Concepts
image Github
2025/02 ACL'25
SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs
image Github
2025/02 Human Preferences in Large Language Model Latent Space: A Technical Analysis on the Reliability of Synthetic Data in Voting Outcome Prediction image -
2025/02 ICLR'25
Reasoning with Latent Thoughts: On the Power of Looped Transformers
image -
2025/02 Beyond Words: A Latent Memory Approach to Internal Reasoning in LLMs image -
2025/02 EMNLP'25
CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation
image Github
2025/03 ICLR'25
Reasoning to Learn from Latent Thoughts
image Github
2025/03 Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation image -
2025/03 MoLAE: Mixture of Latent Experts for Parameter-Efficient Language Models image -
2025/04 Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models - -
2025/04 Efficient Pretraining Length Scaling image -
2025/05 SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning image Github
2025/05 NeurIPS'25
Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought
image Github
2025/05 Enhancing Latent Computation in Transformers with Latent Tokens image -
2025/05 Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space image Github
2025/05 Internal Chain-of-Thought: Empirical Evidence for Layer-wise Subtask Scheduling in LLMs image Github
2025/05 Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space image -
2025/05 NeurIPS'25
Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
image Github
2025/05 LARES: Latent Reasoning for Sequential Recommendation image -
2025/05 NeurIPS'25
Hybrid Latent Reasoning via Reinforcement Learning
image Github
2025/05 NeurIPS'25
System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts
image -
2025/05 ICLR'26
Reinforced Latent Reasoning for LLM-based Recommendation
image Github
2025/05 Continuous Chain of Thought Enables Parallel Exploration and Reasoning image Github
2025/05 ICML'25
Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration
image Github
2025/06 Efficient Post-Training Refinement of Latent Reasoning in Large Language Models image Github
2025/06 DART: Distilling Autoregressive Reasoning to Silent Thought image -
2025/06 EMNLP'25
Parallel Continuous Chain-of-Thought with Jacobi Iteration
image Github
2025/07 Latent Chain-of-Thought? Decoding the Depth-Recurrent Transformer image Github
2025/07 CTRLS: Chain-of-Thought Reasoning via Latent State Transition image -
2025/07 Geometry of Knowledge Allows Extending Diversity Boundaries of Large Language Models image -
2025/08 Bridging Search and Recommendation through Latent Cross Reasoning image -
2025/08 LatentPrompt: Optimizing Promts in Latent Space image -
2025/08 Latent Fusion Jailbreak: Blending Harmful and Harmless Representations to Elicit Unsafe LLM Outputs image -
2025/09 Decoding in Latent Spaces for Efficient Inference in LLM-based Recommendation image -
2025/09 LTA-thinker: Latent Thought-Augmented Training Framework for Large Language Models on Complex Reasoning image Github
2025/09 EMNLP'25
The Transfer Neurons Hypothesis: An Underlying Mechanism for Language Latent Space Transitions in Multilingual LLMs
image -
2025/09 LatentGuard: Controllable Latent Steering for Robust Refusal of Attacks and Reliable Response Generation image -
2025/09 ICLR'26
SIM-CoT: Supervised Implicit Chain-of-Thought
image Github
2025/09 PonderLM-2: Pretraining LLM with Latent Thoughts in Continuous Space image Github
2025/09 Fast Thinking for Large Language Models image -
2025/09 Learning to Ponder: Adaptive Reasoning in Latent Space image -
2025/09 Identity Bridge: Enabling Implicit Reasoning via Shared Latent Memory image -
2025/09 ICLR'26
MemGen: Weaving Generative Latent Memory for Self-Evolving Agents
image Github
2025/09 LatentEvolve: Self-Evolving Test-Time Scaling in Latent Space image Github
2025/09 MARCOS: Deep Thinking by Markov Chain of Continuous Thoughts image -
2025/09 A Formal Comparison Between Chain of Thought and Latent Thought - -
2025/09 ICLR'26
Latent Thinking Optimization: Your Latent Reasoning Language Model Secretly Encodes Reward Signals in Its Latent Thoughts
image -
2025/10 Thoughtbubbles: an Unsupervised Method for Parallel Thinking in Latent Space image Github
2025/10 Analyzing Latent Concepts in Code Language Models image -
2025/10 Exploring System 1 and 2 communication for latent reasoning in LLMs image -
2025/10 ICLR'26
KaVa: Latent Reasoning via Compressed KV-Cache Distillation
image -
2025/10 ICLR'26
Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization
image Github
2025/10 ICLR'26
LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning
image Github
2025/10 ICLR'26
SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs
image Github
2025/10 Encode, Think, Decode: Scaling test-time reasoning with recursive latent thoughts image -
2025/10 Parallel Test-Time Scaling for Latent Reasoning Models image Github
2025/10 LatentBreak: Jailbreaking Large Language Models through Latent Space Feedback image -
2025/10 Kelp: A Streaming Safeguard for Large Models via Latent Dynamics-Guided Risk Detection image Github
2025/10 Tracing the Traces: Latent Temporal Signals for Efficient and Accurate Reasoning image -
2025/10 Unlocking Out-of-Distribution Generalization in Transformers via Recursive Latent Space Reasoning image Github
2025/10 Language Models are Injective and Hence Invertible image -
2025/10 LLM Latent Reasoning as Chain of Superposition image Github
2025/10 ICLR'26
ActivationReasoning: Logical Reasoning in Latent Activation Spaces
image -
2025/10 ICLR'26
Emotions Where Art Thou: Understanding and Characterizing the Emotional Latent Space of Large Language Models
image -
2025/10 NeurIPS'25
SALS: Sparse Attention in Latent Space for KV cache Compression
image -
2025/10 NeurIPS'25
SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens
image Github
2025/10 Scaling Latent Reasoning via Looped Language Models image -
2025/10 ICLR'26
Cache-to-Cache: Direct Semantic Communication Between Large Language Model
image Github
2025/10 NeurIPS'25
Thought Communication in Multiagent Collaboration
image -
2025/11 SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization image Github
2025/11 Think Consistently, Reason Efficiently: Energy-Based Calibration for Implicit Chain-of-Thought image -
2025/11 Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models image Github
2025/11 SpiralThinker: Latent Reasoning through an Iterative Process with Text-Latent Interleaving image -
2025/11 Enabling Agents to Communicate Entirely in Latent Space image -
2025/11 Improving Latent Reasoning in LLMs via Soft Concept Mixing image -
2025/11 Your Latent Reasoning is Secretly Policy Improvement Operator - -
2025/11 CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning image Github
2025/11 Learning When to Stop: Adaptive Latent Reasoning via Reinforcement Learning image Github
2025/11 Visualizing LLM Latent Space Geometry Through Dimensionality Reduction image Github
2025/11 Polarity-Aware Probing for Quantifying Latent Alignment in Language Models image Github
2025/11 Latent Collaboration in Multi-Agent Systems image Github
2025/12 Latent Debate: A Surrogate Framework for Interpreting LLM Thinking image Github
2025/12 Lightweight Latent Reasoning for Narrative Tasks image -
2025/12 ICLR'26
Think-While-Generating: On-the-Fly Reasoning for Personalized Long-Form Generation
image -
2025/12 ReLaX: Reasoning with Latent Exploration for Large Reasoning Models image -
2025/12 Reinforcement Learning for Latent-Space Thinking in LLMs image Github
2025/12 Reasoning Palette: Modulating Reasoning via Latent Contextualization for Controllable Exploration for (V)LMs image -
2025/12 JEPA-Reasoner: Decoupling Latent Reasoning from Token Generation image -
2025/12 Do Latent Tokens Think? A Causal and Adversarial Analysis of Chain-of-Continuous-Thought image -
2025/12 iCLP: Large Language Model Reasoning with Implicit Cognition Latent Planning image Github
2025/12 Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space image -
2025/12 Learning Evolving Latent Strategies for Multi-Agent Language Systems without Model Fine-Tuning - -
2026/01 Parallel Latent Reasoning for Sequential Recommendation image -
2026/01 Latent Space Communication via K-V Cache Alignment image -
2026/01 Layer-Order Inversion: Rethinking Latent Multi-Hop Reasoning in Large Language Models image Github
2026/01 FlashMem: Distilling Intrinsic Latent Memory via Computation Reuse image -
2026/01 IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck image Github
2026/01 Breaking Model Lock-in: Cost-Efficient Zero-Shot LLM Routing via a Universal Latent Space image Github
2026/01 Silence the Judge: Reinforcement Learning with Self-Verifier via Latent Geometric Clustering image -
2026/01 Reasoning Beyond Chain-of-Thought: A Latent Computational Mode in Large Language Models image -
2026/01 RISER: Orchestrating Latent Reasoning Skills for Adaptive Activation Steering image Github
2026/01 GeoSteer: Faithful Chain-of-Thought Steering via Latent Manifold Gradients image -
2026/01 Reasoning While Recommending: Entropy-Guided Latent Reasoning in Generative Re-ranking Models image -
2026/01 Latent-Space Contrastive Reinforcement Learning for Stable and Efficient LLM Reasoning image -
2026/01 UniCog: Uncovering Cognitive Abilities of LLMs through Latent Mind Space Analysis image Github
2026/01 S2GR: Stepwise Semantic-Guided Reasoning in Latent Space for Generative Recommendation image -
2026/01 The Geometric Reasoner: Manifold-Informed Latent Foresight Search for Long-Context Reasoning image -
2026/01 PILOT: Planning via Internalized Latent Optimization Trajectories for Large Language Models image -
2026/01 Beyond Imitation: Reinforcement Learning for Active Latent Planning image Github
2026/01 Latent Adversarial Regularization for Offline Preference Optimization image -
2026/01 Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization image Github
2026/01 Depth-Recurrent Attention Mixtures: Giving Latent Reasoning the Attention it Deserves image -
2026/01 From Logits to Latents: Contrastive Representation Shaping for LLM Unlearning - -
2026/01 ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought image Github
2026/02 G-MemLLM: Gated Latent Memory Augmentation for Long-Context Reasoning in Large Language Models image -
2026/02 Do Latent-CoT Models Think Step-by-Step? A Mechanistic Study on Sequential Reasoning Tasks image Github
2026/02 Capabilities and Fundamental Limits of Latent Chain-of-Thought - -
2026/02 Restoring Exploration after Post-Training: Latent Exploration Decoding for Large Reasoning Models image Github
2026/02 No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs - Github
2026/02 CoLT: Reasoning with Chain of Latent Tool Calls image -
2026/02 Internalizing LLM Reasoning via Discovery and Replay of Latent Actions image Github
2026/02 Inference-Time Rethinking with Latent Thought Vectors for Math Reasoning image -
2026/02 LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning image Github
2026/02 DeltaKV: Residual-Based KV Cache Compression via Long-Range Similarity image Github
2026/02 Pretraining with Token-Level Adaptive Latent Chain-of-Thought image -
2026/02 Latent Reasoning with Supervised Thinking States image -
2026/02 Dynamics Within Latent Chain-of-Thought: An Empirical Study of Causal Structure image -
2026/02 Next Concept Prediction in Discrete Latent Space Leads to Stronger Language Models image Github
2026/02 Talking with the Latents -- how to convert your LLM into an astronomer image -
2026/02 Latent Thoughts Tuning: Bridging Context and Reasoning with Fused Information in Latent Tokens image Github
2026/02 Prioritize the Process, Not Just the Outcome: Rewarding Latent Thought Trajectories Improves Reasoning in Looped Language Models image -
2026/02 ICLR'26
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation
image Github
2026/02 Jailbreaking Leaves a Trace: Understanding and Detecting Jailbreak Attacks from Internal Representations of Large Language Models image -
2026/02 ICLR'26
Native Reasoning Models: Training Language Models to Reason on Unverifiable Data
image -
2026/02 ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces image -
2026/02 SpiralFormer: Looped Transformers Can Learn Hierarchical Dependencies via Multi-Resolution Recursion image -
2026/02 GTS: Inference-Time Scaling of Latent Reasoning with a Learnable Gaussian Thought Sampler image -
2026/02 Measuring and Mitigating Post-hoc Rationalization in Reverse Chain-of-Thought Generation image -
2026/02 Inner Loop Inference for Pretrained Transformers: Unlocking Latent Capabilities Without Training - Github
2026/02 LatentMem: Customizing Latent Memory for Multi-Agent Systems image Github
2026/02 Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems image -
2026/03 LaSER: Internalizing Explicit Reasoning into Latent Space for Dense Retrieval image Github
2026/03 ICLR'26
Multi-Head Low-Rank Attention
- Github
2026/03 AdaPonderLM: Gated Pondering Language Models with Token-Wise Adaptive Depth image -
2026/03 PonderLM-3: Adaptive Token-Wise Pondering with Differentiable Masking image -
2026/03 When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning - Github
2026/03 ICLR'26
∇-REASONER: LLM REASONING VIA TEST-TIMEGRADIENT DESCENT IN LATENT SPACE
- -
2026/03 SPOT: Span-level Pause-of-Thought for Efficient and Interpretable Latent Reasoning in Large Language Models image -
2026/03 NextMem: Towards Latent Factual Memory for LLM-based Agents image Github
2026/03 Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations image -
2026/03 LoopRPT: Reinforcement Pre-Training for Looped Language Models image -

Vision-Language-Model

Date Paper Title Introduction Code
2024/10 Reducing hallucinations in large vision-language models via latent space steering image Github
2024/12 CVPR'25
Perception Tokens Enhance Visual Reasoning in Multimodal Language Models
image Github
2025/01 Efficient Reasoning with Hidden Thinking image Github
2025/02 NeurIPS'25
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Document Understanding
image -
2025/03 Not Only Text: Exploring Compositionality of Visual Representations in Vision-Language Models image Github
2025/05 NeurIPS'25
Towards General Continuous Memory for Vision-Language Models
image Github
2025/05 NeurIPS'25
Image Tokens Matter: Mitigating Hallucination in Discrete Tokenizer-based Large Vision-Language Models via Latent Editing
image Github
2025/06 Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens image Github
2025/08 Multimodal Chain of Continuous Thought for Latent-Space Reasoning in Vision-Language Models image -
2025/09 ICLR'26
MILR: Improving Multimodal Image Generation via Test-Time Latent Reasoning
image Github
2025/09 ICLR'26
Latent Visual reasoning
image Github
2025/10 Auto-scaling Continuous Memory for GUI Agent image Github
2025/10 Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space image Github
2025/10 CVPR'26
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views
image Github
2025/10 Latent Chain-of-Thought for Visual Reasoning image Github
2025/10 Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs image Github
2025/11 Multimodal Reasoning via Latent Refocusing image -
2025/11 CVPR'26
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Model
image Github
2025/11 L2V-CoT: Cross-Modal Transfer of Chain-of-Thought Reasoning via Latent Intervention image -
2025/11 Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens image Github
2025/11 Reading Between the Lines: Abstaining from VLM-Generated OCR Errors via Latent Representation Probes image -
2025/11 Monet: Reasoning in Latent Visual Space Beyond Image and Language image Github
2025/12 Interleaved Latent Visual Reasoning with Selective Perceptual Modeling image Github
2025/12 Mull-Tokens: Modality-Agnostic Latent Thinking image -
2025/12 VL-JEPA: Joint Embedding Predictive Architecture for Vision-language image -
2025/12 Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space image Github
2025/12 Sketch-in-Latents: Eliciting Unified Reasoning in MLLMs image Github
2025/12 Latent Implicit Visual Reasoning image -
2026/01 Forest Before Trees: Latent Superposition for Efficient Visual Reasoning image Github
2026/01 Controlling Multimodal Conversational Agents with Coverage-Enhanced Latent Actions image -
2026/01 LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning image Github
2026/01 PREGEN: Uncovering Latent Thoughts in Composed Video Retrieval image -
2026/01 Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning image Github
2026/01 CausalEmbed: Auto-Regressive Multi-Vector Generation in Latent Space for Visual Document Embedding image -
2026/02 PolarMem: A Training-Free Polarized Latent Graph Memory for Verifiable Multimodal Agents image Github
2026/02 LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs image Github
2026/02 Dual Latent Memory for Visual Multi-agent System image Github
2026/02 Learning Modal-Mixed Chain-of-Thought Reasoning with Latent Embeddings image -
2026/02 Toward Cognitive Supersensing in Multimodal Large Language Model image Github
2026/02 Visual Reasoning over Time Series via Multi-Agent Systems image -
2026/02 Vision-aligned Latent Reasoning for Multi-modal Large Language Model image -
2026/02 Multimodal Latent Reasoning via Hierarchical Visual Cues Injection image -
2026/02 LCLA: Language-Conditioned Latent Alignment for Vision-Language Navigation image -
2026/02 MaD-Mix: Multi-Modal Data Mixtures via Latent Space Coupling for Vision-Language Model Training image -
2026/02 Reason-IAD: Knowledge-Guided Dynamic Latent Reasoning for Explainable Industrial Anomaly Detection image Github
2026/02 Revis: Sparse Latent Steering to Mitigate Object Hallucination in Large Vision-Language Models image Github
2026/02 OneLatent: Single-Token Compression for Visual Latent Reasoning image -
2026/02 The Vision Wormhole: Latent-Space Communication in Heterogeneous Multi-Agent Systems image Github
2026/02 Test-Time Computing for Referring Multimodal Large Language Models image -
2026/02 CrystaL: Spontaneous Emergence of Visual Latents in MLLMs image Github
2026/02 Imagination Helps Visual Reasoning, But Not Yet in Latent Space image -
2026/02 Steering and Rectifying Latent Representation Manifolds in Frozen Multi-modal LLMs for Video Anomaly Detection image -
2026/03 Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding image -

Vision-Language-Action-Model

Date Paper Title Introduction Code
2024/10 ICLR'25
Latent Action Pretraining from Videos
image Github
2025/05 UniVLA: Learning to Act Anywhere with Task-centric Latent Actions image Github
2025/07 NeurIPS'25
ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning
image Github
2025/09 Align-Then-Steer: Adapting the Vision-Language Action Models through Unified Latent Guidance image Github
2025/09 OccVLA: Vision-Language-Action Model with Implicit 3D Occupancy Supervision image -
2025/09 Latent Action Pretraining Through World Modeling image -
2025/09 Seeing Space and Motion: Enhancing Latent Actions with Spatial and Dynamic Awareness for VLA image -
2025/11 SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models image Github
2025/11 LatBot: Distilling Universal Latent Actions for Vision-Language-Action Models image -
2025/11 Unifying Perception and Action: A Hybrid-Modality Pipeline with Implicit Visual Chain-of-Thought for Robotic Action Generation image Github
2025/12 SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models at Minimal Overhead image Github
2025/12 GLaD: Geometric Latent Distillation for Vision-Language-Action Models image -
2025/12 Latent Chain-of-Thought World Modeling for End-to-End Autonomous Driving image -
2025/12 WholeBodyVLA: Towards Unified Latent VLA for Whole-Body Loco-Manipulation Control image Github
2025/12 Motus: A Unified Latent Action World Model image Github
2025/12 LoLA: Long Horizon Latent Action Learning for General Robot Manipulation image -
2025/12 ColaVLA: Leveraging Cognitive Latent Reasoning for Hierarchical Parallel Trajectory Planning in Autonomous Driving image Github
2026/01 Learning to Act Robustly with View-Invariant Latent Actions image -
2026/01 CLAP: Contrastive Latent Action Pretraining for Learning Vision-Language-Action Models from Human Videos image -
2026/01 LaST0: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model image -
2026/01 LatentVLA: Efficient Vision-Language Models for Autonomous Driving via Latent Action Prediction image -
2026/01 Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning image -
2026/01 LangForce: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries image Github
2026/01 CARE: Multi-Task Pretraining for Latent Continuous Action Representation in Robot Control image -
2026/01 Vision-Language Models Unlock Task-Centric Latent Actions image -
2026/02 Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models image Github
2026/02 DriveWorld-VLA: Unified Latent-Space World Modeling for Autonomous Driving image -
2026/02 Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning image -
2026/02 ConLA: Contrastive Latent Action Learning from Human Videos for Robotic Manipulation image Github
2026/02 VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model image Github
2026/02 FUTURE-VLA: Forecasting Unified Trajectories Under Real-time Execution image -
2026/02 UniLACT: Depth-Aware RGB Latent Action Learning for Vision-Language-Action Models image -
2026/02 CVPR'26
JALA: Joint-Aligned Latent Action: Towards Scalable VLA Pretraining in the Wild
image -
2026/03 LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving image Github
2026/03 Chain of World: World Model Thinking in Latent Motion image Github

About

A paper list of Awesome Latent Space.

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors