NVIDIA / cutlass Public

Notifications You must be signed in to change notification settings
Fork 1.7k
Star 9.3k

Code
Issues 454
Pull requests 100
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: NVIDIA/cutlass

Labels 24 Milestones 3

New pull request New

100 Open 807 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[CuTeDSL] Flash Attention v2 for SM120 (Blackwell GeForce)

#3030 opened Feb 13, 2026 by blake-snc

Loading…

Add option to not suffix prints with new line

#3028 opened Feb 13, 2026 by SzymonOzog

Loading…

CUTLASS 3x Green Context Support

#3019 opened Feb 11, 2026 by XiaoSongXS

Loading…

minor: wrong cordinate in layout algebra docs section

#3014 opened Feb 10, 2026 by JINO-ROHIT

Loading…

Declare CUDA standard 20 as requirement for example 63 (fixes #3011)

#3013 opened Feb 10, 2026 by reuterbal

Loading…

Fix typo in cute's tutorial example

#3012 opened Feb 10, 2026 by lygztq

Loading…

use compiler macro to imporve the compatibility

#3008 opened Feb 6, 2026 by reed-lau

Loading…

Resolve build warnings in C++20

#2998 opened Feb 3, 2026 by Algy

Loading…

add: add comments to help understand

#2993 opened Feb 2, 2026 by meiniangpp416

Loading…

Fix/nvfp4 tensor init

#2989 opened Jan 28, 2026 by michael604work

Loading…

Fix mixed_input_fmha_decode example

#2986 opened Jan 26, 2026 by anakinxc

Loading…

Fix redundant tile copies in wgmma_sm90 tutorial pipeline loop

#2982 opened Jan 25, 2026 by Johnsonms

Loading…

Fix error in Blackwell document of referring to Mxf4 format as NVF4

#2977 opened Jan 23, 2026 by zianglih

Loading…

fix(examples): fix device compatibility check for Ada FP8 GEMM inactive-30d

#2954 opened Jan 13, 2026 by w1ndseeker

Loading…

Update profiler.md with how to use generator.py inactive-30d

#2943 opened Jan 10, 2026 by aidando73

Loading…

cutlass profiler - align emitted SFA/SFB kernel naming with typical convention inactive-30d

#2942 opened Jan 10, 2026 by aidando73

Loading…

Fix Warp Memory Access Arrangement in Epilogue: Upper Bound memory access width by output tile width inactive-30d

#2938 opened Jan 8, 2026 by lukas-ruettgers

Loading…

add missing Mma-specialization for m16n8k32.s32.s4.s4.s32 inactive-30d

#2936 opened Jan 8, 2026 by dxqb • Draft

feat(examples/test_run): use runtime sm arch

#2916 opened Dec 31, 2025 by tpoisonooo

Loading…

Fix finding cuDNN

#2890 opened Dec 19, 2025 by TLescoatTFX

Loading…

docs: Add FP16 GEMM documentation to sgemm_sm80.cu - Fixes #1686 inactive-30d

#2870 opened Dec 10, 2025 by blueberrycongee

Loading…

[WIP]Unit tests for Kernels that perform BF16 x BF16 = MXFP8 and MXFP8 x MXFP8 = BF16 inactive-30d

#2857 opened Dec 8, 2025 by Shreya-gaur

Loading…

Remove redundant "from" from comment inactive-30d

#2853 opened Dec 8, 2025 by crcrpar

Loading…

add SM75_16x8x8_F16F16F16F16_TN inactive-30d

#2851 opened Dec 6, 2025 by jinzhen-lin

Loading…

use cp.async.bulk for per-row data; quiets synccheck inactive-30d

#2850 opened Dec 5, 2025 by v0i0

Loading…

Previous 1 2 3 4 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!