Skip to content
View wln20's full-sized avatar

Highlights

  • Pro

Block or report wln20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
wln20/README.md

Image

  • 🌱 Hi! I'm Luning Wang, currently a Master's student at the University of Michigan. Before that, I did my undergrad at the Department of Electronic Engineering, Tsinghua University.
  • 📖 I mainly focused on the infrastructure and efficiency optimization of Large Language Models (a.k.a MLSys or AI-Infra) in my past works. Aside from that, I’m also learning related AI techs like RL and Multimodal/Diffusion Models. See my Homepage and LinkedIn to learn more about me.
  • 🤝 I'm open to discussion & collaboration. Feel free to chat with me!

Pinned Loading

  1. Attention-Viewer Attention-Viewer Public

    A plug-and-play tool for visualizing attention-score heatmap in generative LLMs. Easy to customize for your own need.

    Python 51 5

  2. CSKV CSKV Public

    [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios

    Python 16

  3. thu-nics/qllm-eval thu-nics/qllm-eval Public

    Code Repository of Evaluating Quantized Large Language Models

    Python 136 10