Pinned
The AI Timeline
1,989 posts
covering the latest trending AI & LLM research, see "highlights" for all weekly threads, ran by @bycloudai
- 🚨This week's top AI/ML research papers: - Log-Linear Attention - Beyond the 80/20 Rule - Why Gradients Rapidly Increase Near the End of Training - How much do language models memorize? - General agents need world models - The Illusion of Thinking - MiMo-VL Technical Report -
- 🚨This week’s top AI/ML research papers: - LLaVA-o1 - Marco-o1 - The Dawn of GUI Agent - Hymba - When Precision Meets Position - Multimodal Autoregressive Pre-training of Large Vision Encoders - Generative World Explorer - That Chip Has Sailed - Is Your LLM Secretly a World
- 🚨This week's top AI/ML research papers: - AlphaEvolve - Qwen3 Technical Report - Insights into DeepSeek-V3 - Seed1.5-VL Technical Report - BLIP3-o - Parallel Scaling Law for LMs - HealthBench - Learning Dynamics in Continual Pre-Training for LLMs - Learning to Think - Beyond
- 🚨 Last 2 week's top AI/ML research papers: - Transformers without Normalization - Block Diffusion - Compute Optimal Scaling of Skills - DAPO: An OS LLM RL System at Scale - Teaching LLMs How to Learn with Contextual Fine-Tuning - GR00T N1 - Why the Brain Cannot Be a Digital
- 🚨This week’s top AI/ML research papers: - Differential Transformer - GSM-Symbolic - Pixtral 12B - Intelligence at the Edge of Chaos - Cheating Automatic LLM Benchmarks - nGPT - Upcycling Large Language Models into Mixture of Experts - Personalized Visual Instruction Tuning -
- 🚨This week's top AI/ML research papers: - Inference-Time Scaling for Generalist Reward Modeling - Multi-Token Attention - Why do LLMs attend to the first token? - Command A - LLMs Pass the Turing Test - Advances and Challenges in Foundation Agents - PaperBench - Effectively
- 🚨This week's top AI/ML research papers: - Energy-Based Transformers are Scalable Learners and Thinkers - Dynamic Chunking for End-to-End Hierarchical Sequence Modeling - Pre-Trained Policy Discriminators are General Reward Models - First Return, Entropy-Eliciting Explore -
- 🚨This week's top AI/ML research papers: - GPT-4o System Card: Native Image Generation - Anthropic's On the Biology of a LLM - Gemma 3 Technical Report - Qwen2.5-Omni Technical Report - Reasoning to Learn from Latent Thoughts - Defeating Prompt Injections by Design - Scaling
- 🚨This week’s top AI/ML research papers: - Molmo and PixMo - MaskLLM - Are We Closer to an AI Doctor? - Programming Every Example - MIMO - Pixel-Space Post-Training of Latent Diffusion Models - Phantom of Latent for Large Language and Vision Models - Making Text Embedders
- 🚨This week's top AI/ML research papers: - Demystifying Long Chain-of-Thought Reasoning in LLMs - OmniHuman-1 - LIMO - s1: Simple test-time scaling - Process Reinforcement through Implicit Rewards - Iterate to Accelerate - Efficient Reasoning with Hidden Thinking - Fully
- 🚨This week's top AI/ML research papers: - Absolute Zero - RM-R1 - Seed-Coder - Flow-GRPO - ZeroSearch - Ming-Lite-Uni - A Survey on Large Multimodal Reasoning Models - On Path to Multimodal Generalist - ZeroSearch - HunyuanCustom - Unified Multimodal CoT Reward Model through
- 🚨This week’s top AI/ML research papers: - Mixture-of-Transformers - BitNet a4.8 - LoRA vs Full Fine-tuning: An Illusion of Equivalence - Mixtures of In-Context Learners - Emergence of Hidden Capabilities - DimensionX - The Surprising Effectiveness of Test-Time Training for
- 🚨This week’s top AI/ML research papers: - MovieGen - Were RNNs All We Needed? - Contextual Document Embeddings - RLEF - ENTP - VinePPO - When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o1 - LLMs Know More Than















