Michael Diskin

Head of LLM R&D · Wildberries
Researcher · HSE University

prof_pic.png

📍 Everywhere & nowhere

Moscow · Yerevan · Tel Aviv
Zürich · London · Abu Dhabi

Industry. Head of LLM R&D at Wildberries — Russia’s largest e-commerce platform. Built the LLM & embeddings organization from scratch (30+ people, 4–5 teams), shipping search, retrieval, machine translation, and RAG systems at scale. Previously ML at Yandex.

Research. Published at NeurIPS, ICML, ICLR, and EMNLP (700+ citations). Core topics: distributed training, collaborative deep learning, graph neural networks. Co-created Hivemind — an open-source framework for decentralized training.

Teaching. Lecturer at Harbour.Space University and Yandex School of Data Analysis (NLP, Deep Vision & Graphics, Reinforcement Learning).

news

Feb 13, 2026 Attended MLWS @ MBZUAI; no talk this time, but many valuable conversations and networking with the community.
Nov 09, 2025 Attended EMNLP 2025, presented at a workshop, and had many productive discussions with colleagues; also helped organize and connect the Russian-speaking NLP/ML community on site.
Nov 01, 2025 Synthetic Proofs with Tool-Integrated Reasoning: Contrastive Alignment for LLM Mathematics with Lean appeared in ACL Anthology (MathNLP @ EMNLP 2025).
Oct 17, 2025 Think, Align, Select: Query–Key Scores for LLM Reasoning was published on OpenReview (NeurIPS workshop track).
Apr 01, 2024 Started teaching Deep Learning in Applications at Harbour.Space University in Barcelona.

selected publications

  1. SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient
    Max Ryabinin, Tim Dettmers, Michael Diskin, and 1 more author
    In International Conference on Machine Learning (ICML), 2023
  2. A Critical Look at the Evaluation of GNNs under Heterophily: Are We Really Making Progress?
    Oleg Platonov, Denis Kuznedelev, Michael Diskin, and 2 more authors
    In International Conference on Learning Representations (ICLR), 2023
  3. Distributed Deep Learning in Open Collaborations
    Michael Diskin, Alexey Bukhtiyarov, Max Ryabinin, and 13 more authors
    In Advances in Neural Information Processing Systems (NeurIPS), 2021