Skip to content

Pull requests: huggingface/tokenizers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

chore(deps): bump qs and express in /tokenizers/examples/unstable_wasm/www dependencies Pull requests that update a dependency file javascript Pull requests that update Javascript code
#2067 opened May 22, 2026 by dependabot Bot Loading…
Make BPE/WordPiece training deterministic
#2066 opened May 22, 2026 by ATOM00blue Loading…
Document that train_new_from_iterator uses BPE for WordPiece
#2065 opened May 21, 2026 by adityasingh2400 Loading…
3 tasks done
chore(deps): bump qs and body-parser in /tokenizers/examples/unstable_wasm/www dependencies Pull requests that update a dependency file javascript Pull requests that update Javascript code
#2063 opened May 20, 2026 by dependabot Bot Loading…
chore(deps-dev): bump webpack-dev-server from 5.2.1 to 5.2.4 in /tokenizers/examples/unstable_wasm/www dependencies Pull requests that update a dependency file javascript Pull requests that update Javascript code
#2062 opened May 20, 2026 by dependabot Bot Loading…
Fix Encode Inputs doc page rendering (closes #1909)
#2061 opened May 18, 2026 by LeSingh1 Loading…
serialize tokenizer vocab and added_tokens compactly
#2056 opened May 13, 2026 by ArthurZucker Collaborator Loading…
Add scaling_bench: encode_batch vs worker-pool comparison (#1900)
#2048 opened May 1, 2026 by stargazerZJ Loading…
5 of 6 tasks
WIP: testing only
#2047 opened Apr 30, 2026 by assafvayner Draft
4 tasks
Batch encode: simple lock-free scheduler
#2044 opened Apr 28, 2026 by sebpop Contributor Loading…
feat(ByteLevel): skip per-byte transform for printable-ASCII tokens
#2038 opened Apr 26, 2026 by KimYannn Loading…
2 of 3 tasks
feat(NFC): skip Unicode pass for all-ASCII inputs
#2037 opened Apr 26, 2026 by KimYannn Loading…
2 of 3 tasks
feat: SIMD ASCII fast path for Lowercase normalizer (~30-49x)
#2036 opened Apr 26, 2026 by KimYannn Loading…
6 of 7 tasks
V0.23 release
#2032 opened Apr 24, 2026 by ArthurZucker Collaborator Loading…
ProTip! What’s not been updated in a month: updated:<2026-04-22.