Skip to content
View haeggee's full-sized avatar

Highlights

  • Pro

Block or report haeggee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
haeggee/README.md

Hi there 👋

Pinned Loading

  1. epfml/llm-baselines epfml/llm-baselines Public

    nanoGPT-like codebase for LLM training

    Python 113 36

  2. swiss-ai/Megatron-LM swiss-ai/Megatron-LM Public

    Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    Python 42 19

  3. epfml/schedules-and-scaling epfml/schedules-and-scaling Public

    Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"

    Python 86 8

  4. epfml/getting-started epfml/getting-started Public

    Python 26 16

  5. swiss-ai/MoE swiss-ai/MoE Public

    some mixture of experts architecture implementations

    Python 24 3

  6. algolab algolab Public

    Algorithms Lab @ ETH Zurich, Fall 21

    C++ 22 3