Bugfix - Add SM80/90 architecture to CUTLASS builds for CUDA 12.8+ for A100#785
Bugfix - Add SM80/90 architecture to CUTLASS builds for CUDA 12.8+ for A100#785
Conversation
Add sm_80 to the CUTLASS architecture lists to ensure Ampere GPU support is included in builds targeting CUDA 12.8+ and 12.9+.
There was a problem hiding this comment.
Pull request overview
Updates the CUTLASS build configuration to ensure Ampere (A100 / SM80) is included in CUTLASS builds when using CUDA 12.8+.
Changes:
- Add
sm_80to the CUTLASSARCHSlist for CUDA>= 12.8. - Expand the CUDA
>= 12.9CUTLASSARCHSlist to include80and90in addition to100/103.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #785 +/- ##
=======================================
Coverage 85.70% 85.70%
=======================================
Files 102 102
Lines 7703 7703
=======================================
Hits 6602 6602
Misses 1101 1101
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
There was a problem hiding this comment.
Pull request overview
Copilot reviewed 1 out of 1 changed files in this pull request and generated 1 comment.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
| if [ -d cutlass ]; then rm -rf cutlass; fi | ||
| git clone --branch v4.1.0 --depth 1 https://github.com/NVIDIA/cutlass.git && cd cutlass | ||
| else ifeq ($(shell echo $(CUDA_VER)">=12.8" | bc -l), 1) | ||
| $(eval ARCHS := "90;100") | ||
| $(eval ARCHS := "80;90;100") | ||
| if [ -d cutlass ]; then rm -rf cutlass; fi | ||
| git clone --branch v3.9.2 --depth 1 https://github.com/NVIDIA/cutlass.git && cd cutlass |
Add sm_80 to the CUTLASS architecture lists to ensure Ampere GPU support is included in builds targeting CUDA 12.8+ for A100