Log inSign up
DeepSeek
168 posts
Image
user avatar
DeepSeek
@deepseek_ai
Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.
deepseek.com
Joined October 2023
0
Following
1M
Followers
  • Pinned
    user avatar
    DeepSeek
    @deepseek_ai
    Apr 26
    🔥DeepSeek Input Cache Price Drop! Effective immediately, the price for input cache hits across the ENTIRE DeepSeek API series is reduced to just 1/10th of the original price! Build more efficiently for less. 📌Reminder: The DeepSeek-V4-Pro 75% OFF promotion is still active
    Image
    1.5M
  • user avatar
    DeepSeek
    @deepseek_ai
    Jan 28, 2025
    To prevent any potential harm, we reiterate that @deepseek_ai is our sole official account on Twitter/X. Any accounts: - representing us - using identical avatars - using similar names are impersonations. Please stay vigilant to avoid being misled!
    8.2M
  • user avatar
    DeepSeek
    @deepseek_ai
    Jan 20, 2025
    🚀 DeepSeek-R1 is here! ⚡ Performance on par with OpenAI-o1 📖 Fully open-source model & technical report 🏆 MIT licensed: Distill & commercialize freely! 🌐 Website & API are live now! Try DeepThink at chat.deepseek.com today! 🐋 1/n
    Image
    13M
  • user avatar
    DeepSeek
    @deepseek_ai
    Feb 21, 2025
    🚀 Day 0: Warming up for #OpenSourceWeek! We're a tiny team @deepseek_ai exploring AGI. Starting next week, we'll be open-sourcing 5 repos, sharing our small but sincere progress with full transparency. These humble building blocks in our online service have been documented,
    2.5M
  • user avatar
    DeepSeek
    @deepseek_ai
    Feb 14, 2025
    🎉 Excited to see everyone’s enthusiasm for deploying DeepSeek-R1! Here are our recommended settings for the best experience: • No system prompt • Temperature: 0.6 • Official prompts for search & file upload: bit.ly/4hyH8np • Guidelines to mitigate model bypass
    1.8M
  • user avatar
    DeepSeek
    @deepseek_ai
    Feb 18, 2025
    🚀 Introducing NSA: A Hardware-Aligned and Natively Trainable Sparse Attention mechanism for ultra-fast long-context training & inference! Core components of NSA: • Dynamic hierarchical sparse strategy • Coarse-grained token compression • Fine-grained token selection 💡 With
    Image
    Image
    Image
    Image
    2.6M
  • user avatar
    DeepSeek
    @deepseek_ai
    Aug 21, 2025
    Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀 🧠 Hybrid inference: Think & Non-Think — one model, two modes ⚡️ Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528 🛠️ Stronger agent skills: Post-training boosts tool use and
    Image
    DeepSeek
    From chat.deepseek.com
    2.1M
  • user avatar
    DeepSeek
    @deepseek_ai
    Dec 26, 2024
    🚀 Introducing DeepSeek-V3! Biggest leap forward yet: ⚡ 60 tokens/second (3x faster than V2!) 💪 Enhanced capabilities 🛠 API compatibility intact 🌍 Fully open-source models & papers 🐋 1/n
    Image
    GIF
    Image
    7.4M
  • user avatar
    DeepSeek
    @deepseek_ai
    Mar 25, 2025
    🚀 DeepSeek-V3-0324 is out now! 🔹 Major boost in reasoning performance 🔹 Stronger front-end development skills 🔹 Smarter tool-use capabilities ✅ For non-complex reasoning tasks, we recommend using V3 — just turn off “DeepThink” 🔌 API usage remains unchanged 📜 Models are
    Image
    Image
    GIF
    1.6M
  • user avatar
    DeepSeek
    @deepseek_ai
    Feb 28, 2025
    🚀 Day 5 of #OpenSourceWeek: 3FS, Thruster for All DeepSeek Data Access Fire-Flyer File System (3FS) - a parallel file system that utilizes the full bandwidth of modern SSDs and RDMA networks. ⚡ 6.6 TiB/s aggregate read throughput in a 180-node cluster ⚡ 3.66 TiB/min
    Image
    GitHub - deepseek-ai/3FS: A high-performance distributed file system designed to address the...
    From github.com
    3.2M
  • user avatar
    DeepSeek
    @deepseek_ai
    Feb 24, 2025
    🚀 Day 1 of #OpenSourceWeek: FlashMLA Honored to share FlashMLA - our efficient MLA decoding kernel for Hopper GPUs, optimized for variable-length sequences and now in production. ✅ BF16 support ✅ Paged KV cache (block size 64) ⚡ 3000 GB/s memory-bound & 580 TFLOPS
    Image
    GitHub - deepseek-ai/FlashMLA: FlashMLA: Efficient Multi-head Latent Attention Kernels
    From github.com
    1.7M
  • user avatar
    DeepSeek
    @deepseek_ai
    May 29, 2025
    🚀 DeepSeek-R1-0528 is here! 🔹 Improved benchmark performance 🔹 Enhanced front-end capabilities 🔹 Reduced hallucinations 🔹 Supports JSON output & function calling ✅ Try it now: chat.deepseek.com 🔌 No change to API usage — docs here: api-docs.deepseek.com/guides/reasoni… 🔗
    Image
    Image
    GIF
    1.5M
  • user avatar
    DeepSeek
    @deepseek_ai
    Mar 1, 2025
    🚀 Day 6 of #OpenSourceWeek: One More Thing – DeepSeek-V3/R1 Inference System Overview Optimized throughput and latency via: 🔧 Cross-node EP-powered batch scaling 🔄 Computation-communication overlap ⚖️ Load balancing Statistics of DeepSeek's Online Service: ⚡ 73.7k/14.8k
    4M
  • user avatar
    DeepSeek
    @deepseek_ai
    Feb 25, 2025
    🚀 Day 2 of #OpenSourceWeek: DeepEP Excited to introduce DeepEP - the first open-source EP communication library for MoE model training and inference. ✅ Efficient and optimized all-to-all communication ✅ Both intranode and internode support with NVLink and RDMA ✅
    Image
    GitHub - deepseek-ai/DeepEP: DeepEP: an efficient expert-parallel communication library
    From github.com
    1.4M

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement