Highlights
- Pro
Pinned Loading
-
-
swiss-ai/Megatron-LM
swiss-ai/Megatron-LM PublicForked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
-
epfml/schedules-and-scaling
epfml/schedules-and-scaling PublicCode for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.




