-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Pull requests: modelscope/ms-swift
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[bugfix] guard _repair_ms_bench against an empty messages list
#9608
opened Jun 20, 2026 by
he-yufeng
Loading…
[bugfix] guard load_audio against bytes input
#9606
opened Jun 20, 2026 by
he-yufeng
Loading…
1 of 4 tasks
[bugfix] include image mode and size in tmp image cache key
#9605
opened Jun 20, 2026 by
he-yufeng
Loading…
1 of 4 tasks
[compat] compat moe_router_load_balancing_type mcore>=0.16
#9603
opened Jun 18, 2026 by
Jintao-Huang
Collaborator
Loading…
feat(megatron): add nccl_comm_warmup to avoid iteration-1 NCCL cudaMalloc OOM (#6387)
#9602
opened Jun 18, 2026 by
yuchenwang3
Contributor
Loading…
fix(megatron): align moe_router_load_balancing_type to single-string Literal (#8480)
#9600
opened Jun 18, 2026 by
yuchenwang3
Contributor
Loading…
feat(packing): add opt-in packing_strategy (binpack | sequential) for order-preserving packing
#9598
opened Jun 18, 2026 by
yuchenwang3
Contributor
Loading…
Fix NPU Qwen3.5 grpo bugs
#9589
opened Jun 17, 2026 by
addsubmuldiv
Collaborator
Loading…
2 of 4 tasks
Update gkd_loss.py teacher_prompt supports two formats:
#9562
opened Jun 15, 2026 by
yhy19
Loading…
1 of 4 tasks
[bugfix] add optional MEDIA_DECODE_TIMEOUT to prevent silent media-decode hang
#9541
opened Jun 11, 2026 by
HaozheZhang6
Contributor
Loading…
1 of 4 tasks
fix(megatron): pre-initialize NCCL communicator for MoE expert DP group to prevent lazy-init deadlock
#9486
opened Jun 3, 2026 by
zb2313
Loading…
Add lightweight TGS and MFU logging for training
#9465
opened Jun 1, 2026 by
WendaDeng
Loading…
3 tasks
[compat] Support Qwen2-Audio with newer transformers
#9453
opened May 30, 2026 by
MWXGOD
Loading…
2 of 10 tasks
feat: Add native support for PyTorch Profiler in ms-swift
#9449
opened May 29, 2026 by
qq1243196045
Contributor
Loading…
1 of 4 tasks
[bugfix] fix MiniCPMV4_6 text-only batch graph break in distributed training
#9443
opened May 28, 2026 by
randydl
Contributor
Loading…
[bugfix] Gemma4 suffix: drop trailing newline
#9440
opened May 28, 2026 by
shanhaoli
Loading…
2 tasks
fix: add noqa annotation for dataset utils import
#9433
opened May 27, 2026 by
udjevdbaj
Loading…
1 task
fix: 更新 MindSpeed import 路径 (lite.ops.triton → ops.triton,适配 core_r0.16.0)
#9415
opened May 26, 2026 by
Weizhena
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.