Skip to content

Bbq support#26

Open
akhauriyash wants to merge 4 commits into
prodfrom
bbq_support
Open

Bbq support#26
akhauriyash wants to merge 4 commits into
prodfrom
bbq_support

Conversation

@akhauriyash
Copy link
Copy Markdown
Collaborator

@akhauriyash akhauriyash commented May 20, 2026

Summary

Enable xLLM partial-RoPE layout wiring for BBQ 375B training.

Changes

  • Add --xllm-partial-rope-layout to scripts/models/xllm-375B.sh
  • Teach miles_plugins/mbridge/xllm.py to:
    • derive rotary_percent from rope_head_dim / head_dim
    • enable xllm_partial_rope_layout when rope_head_dim * 2 == head_dim

Why

xLLM 375B uses partial RoPE with rope_head_dim=64 and head_dim=128. The trainer needs to opt into the Megatron-LM xLLM partial-RoPE path so its logprobs match the HF/SGLang reference layout.

Validation

  • python3 -m py_compile miles_plugins/mbridge/xllm.py
  • bash -n scripts/models/xllm-375B.sh
  • git diff --check

@akhauriyash akhauriyash requested a review from a team as a code owner May 20, 2026 17:35
@akhauriyash
Copy link
Copy Markdown
Collaborator Author

@copilot resolve the merge conflicts in this pull request

Copy link
Copy Markdown

Copilot AI commented May 20, 2026

@copilot resolve the merge conflicts in this pull request

Resolved and pushed in commit dea08c4. I merged prod into bbq_support, resolved the conflicts in miles_plugins/mbridge/xllm.py and scripts/models/xllm-375B.sh, and verified both files (py_compile and bash -n).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants