-
Notifications
You must be signed in to change notification settings - Fork 4.9k
Pull requests: deepspeedai/DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Don't swallow KeyboardInterrupt/SystemExit in NPUOpBuilder
#8096
opened Jun 27, 2026 by
ajinkyajawale14499
Loading…
Add numerical-correctness test for Muon under ZeRO-1/2
#8091
opened Jun 25, 2026 by
whycoming
Contributor
Loading…
Warn when zero.Init silently falls back to a single rank (#8084)
#8089
opened Jun 24, 2026 by
akshansh47
Loading…
fix: use local ev_values and wrap dict.values() in list()
#8087
opened Jun 23, 2026 by
hashwnath
Loading…
3 tasks done
fix: add buffer-length check in shm.cpp
#8082
opened Jun 20, 2026 by
orbisai0security
Contributor
Loading…
3 tasks done
fix: sanitize subprocess call in ds_aio_job.py
#8081
opened Jun 20, 2026 by
orbisai0security
Contributor
Loading…
3 tasks done
ZeRO 1/2: wait on all IPG-bucket producer streams in average_tensor (#8061)
#8080
opened Jun 19, 2026 by
arunshar
Contributor
Loading…
Avoid CUDA context initialization during op compatibility checks at import
#8078
opened Jun 19, 2026 by
Achyuthan-S
Loading…
feat: add Trackio as a new experiment monitoring backend
#8065
opened Jun 15, 2026 by
chanduripranav
Loading…
[DeepCompile] fix gather params in dynamo skipped frames for ZeRO3
#8059
opened Jun 11, 2026 by
XAheli
Loading…
7 tasks done
feat(zenflow): run the overlapped CPU optimizer in a native process
#8058
opened Jun 10, 2026 by
Antlera
Collaborator
Loading…
Fix eigenvalue parsing for compression-only quantize configs
#8057
opened Jun 10, 2026 by
sowndappan5
Contributor
Loading…
Add optional torchembed RoPE backend to apply_rotary_pos_emb
#8052
opened Jun 7, 2026 by
py-ai-dev
Loading…
Fix minor comment/docstring typos in runtime and inference modules
#8046
opened Jun 3, 2026 by
nathon-lee
Contributor
Loading…
zero3: defer param release during retain_graph backward #7352
#8045
opened Jun 3, 2026 by
nathon-lee
Contributor
Loading…
[Draft] Add On-Policy Distillation (OPSD) Trainer in DeepSpeed
#8027
opened May 26, 2026 by
PKUWZP
Collaborator
Loading…
3 of 5 tasks
Refactor/torch autocast encapsulate global state
#7946
opened Apr 2, 2026 by
nathon-lee
Contributor
Loading…
Fix ZeRO-3 optimizer initialization validation (#7844)
#7929
opened Mar 28, 2026 by
amadhan882
Loading…
doc: Remove suggestion to build extensions in parallel
#7899
opened Mar 12, 2026 by
Flamefire
Contributor
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.