Skip to content

请问目前有支持megatron基于moe_server的grpo训练吗 #9529

@ooochen-30

Description

@ooochen-30

Checklist / 检查清单

  • I have searched existing issues, and this is a new question or discussion topic. / 我已经搜索过现有的 issues,确认这是一个新的问题与讨论。

Question Description / 问题描述

在example中好像没有看到megatron基于moe_server的grpo示例,想知道目前有支持吗,或者有计划实现吗,我正在尝试对qwen3.5-35b-a3b做grpo,希望能通过megatron和server模式满足显存和训练时长的优化

Metadata

Metadata

Assignees

Labels

questionFurther information is requested

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions