Skip to content

P2728: use buffer capacity 64 for the SIMD performance table#238

Merged
ednolan merged 1 commit into
mainfrom
enolan_simdwording3
Jun 10, 2026
Merged

P2728: use buffer capacity 64 for the SIMD performance table#238
ednolan merged 1 commit into
mainfrom
enolan_simdwording3

Conversation

@ednolan

@ednolan ednolan commented Jun 10, 2026

Copy link
Copy Markdown
Member

64 output code units is the smallest capacity at which the prototype kernel's whole-block paths all fit (a 16-lane block of 3-byte output needs 48 bytes), and for the UTF-8 direction it makes the iterator's buffer exactly one cache line. Update the Wikipedia timing estimates to match the new Latin-row rates.

64 output code units is the smallest capacity at which the prototype
kernel's whole-block paths all fit (a 16-lane block of 3-byte output
needs 48 bytes), and for the UTF-8 direction it makes the iterator's
buffer exactly one cache line. Update the Wikipedia timing estimates
to match the new Latin-row rates.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
@ednolan ednolan force-pushed the enolan_simdwording3 branch from 2a91a51 to 7beb551 Compare June 10, 2026 06:10
@ednolan ednolan merged commit cd3a0fb into main Jun 10, 2026
52 checks passed
@coveralls

coveralls commented Jun 10, 2026

Copy link
Copy Markdown

Coverage Status

coverage: 99.744%. remained the same — enolan_simdwording3 into main

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants