Ying Li

šŸ‘‹ Hi, I’m Ying Li, a researcher working on Efficient AI, LLM/MLLM inference, and Machine Learning Systems.
My work focuses on making large models faster, lighter, and deployable on resource-constrained platforms — with applications ranging from dynamic inference to AI for Science.


šŸ” Research Interests

  • Efficient AI & model compression
  • LLM/MLLM inference acceleration
  • Dynamic / speculative decoding
  • Machine learning systems
  • AI for Science

šŸ“š My Research

(Full list on Google Scholar)