[#10580][fix] re-enable NemotronH MOE MMLU test#10594
Conversation
Signed-off-by: Suyog Gupta <41447211+suyoggupta@users.noreply.github.com>
|
/bot run |
📝 WalkthroughWalkthroughThis PR modifies a single test file by adjusting a memory configuration parameter from 0.5 to 0.4 in the BF16 test, and uncommenting sampling parameters code in the FP8 test to include MMLU evaluation. Changes
Estimated code review effort🎯 1 (Trivial) | ⏱️ ~3 minutes 🚥 Pre-merge checks | ✅ 1 | ❌ 2❌ Failed checks (2 warnings)
✅ Passed checks (1 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing touches
📜 Recent review detailsConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Pro 📒 Files selected for processing (1)
🧰 Additional context used📓 Path-based instructions (2)**/*.py📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
**/*.{cpp,cc,cxx,h,hpp,hxx,cu,cuh,py}📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
🧠 Learnings (2)📓 Common learnings📚 Learning: 2025-07-28T17:06:08.621ZApplied to files:
🧬 Code graph analysis (1)tests/integration/defs/accuracy/test_llm_api_autodeploy.py (1)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)
🔇 Additional comments (2)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|
PR_Github #31487 [ run ] triggered by Bot. Commit: |
|
PR_Github #31487 [ run ] completed with state |
There was a problem hiding this comment.
Pull request overview
This PR re-enables the previously disabled MMLU accuracy test for the Nemotron MOE FP8 model and adjusts memory configuration for the BF16 test to prevent out-of-memory issues.
Changes:
- Reduced
free_mem_ratiofrom 0.5 to 0.4 in the BF16 test to accommodate multi-stream MOE memory requirements - Re-enabled MMLU evaluation in the FP8 test by uncommenting the test code
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
Signed-off-by: Suyog Gupta <41447211+suyoggupta@users.noreply.github.com> Signed-off-by: Daniil Kulko <kulkodaniil@gmail.com>
fixes #10580
Summary by CodeRabbit
✏️ Tip: You can customize this high-level summary in your review settings.