-
Notifications
You must be signed in to change notification settings - Fork 32.9k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add EXAONE 4.5 implementations
#45471
opened Apr 16, 2026 by
nuxlear
Contributor
Loading…
3 of 6 tasks
skip test_flash_attn_2_can_dispatch_composite_models tests for
#45470
opened Apr 16, 2026 by
kaixuanliu
Contributor
Loading…
Fix: propagate interpolate_pos_encoding through Pixio model hierarchy
#45469
opened Apr 16, 2026 by
Spectual
Loading…
3 of 6 tasks
Fix MPS SDPA output shape when value head dim differs from query head dim
#45467
opened Apr 16, 2026 by
Jah-yee
Loading…
chore(sec): added a handful of security checks
#45462
opened Apr 15, 2026 by
tarekziade
Collaborator
Loading…
Remove redundant condition checks in
get_image_size method
#45461
opened Apr 15, 2026 by
JiauZhang
Loading…
fix(tokenization): re-raise ImportError to allow RuntimeError/OSError fallback (#45459)
#45460
opened Apr 15, 2026 by
cloudyun888
Loading…
Allow loading Qwen Thinker 'base' models without generative head
#45457
opened Apr 15, 2026 by
tomaarsen
Member
Loading…
2 of 6 tasks
[
fix] Make Qwen2_5OmniProcessor warning a lot less noisy via warning_once
#45455
opened Apr 15, 2026 by
tomaarsen
Member
Loading…
2 of 6 tasks
refactor: replace wildcard imports with explicit imports in model __init__.py files
#45452
opened Apr 15, 2026 by
DavidSolanas
Loading…
[loading] Clean way to add/remove full parts in checkpoint names
#45448
opened Apr 15, 2026 by
Cyrilvallez
Member
Loading…
[
fix] Always early return for non-Mistral models in _patch_mistral_regex
#45444
opened Apr 14, 2026 by
tomaarsen
Member
Loading…
3 of 6 tasks
Raise 400 on model mismatch when
transformers serve is pinned
#45443
opened Apr 14, 2026 by
qgallouedec
Member
Loading…
fix(DSV3): parity between native
DeepseekV3MoE and remote official implementation
#45441
opened Apr 14, 2026 by
casinca
Contributor
Loading…
4 of 6 tasks
Add Gemma4ForSequenceClassification
#45438
opened Apr 14, 2026 by
Charly21r
Contributor
Loading…
4 tasks done
Fix spurious position_ids warnings for at least 40 architectures
#45437
opened Apr 14, 2026 by
tomaarsen
Member
Loading…
2 of 6 tasks
Add expert parallelism (EP) support for Qwen3 MoE + fix GroupedGemmParallel for 2D meshes
#45436
opened Apr 14, 2026 by
AmineDiro
Member
Loading…
7 tasks done
do not index past decoded chars with special tokens
#45435
opened Apr 14, 2026 by
itazap
Collaborator
Loading…
chore(qa): split pipeline and add type checking
#45432
opened Apr 14, 2026 by
tarekziade
Collaborator
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-03-16.