AMD/ROCm - Fix VAE_KL_MEM_RATIO overestimation for modern ROCm by ReinerBforartists · Pull Request #12685 · Comfy-Org/ComfyUI

ReinerBforartists · 2026-02-27T19:12:51Z

VAE_KL_MEM_RATIO is set to 2.73 for AMD/ROCm in comfy/sd.py. This value was introduced for older ROCm versions where memory overhead was significantly higher. On modern ROCm (7.x), this massively overestimates VRAM requirements for VAE operations, causing ComfyUI to unnecessarily offload models from VRAM before VAE encoding/decoding.

Impact: On GPUs with limited VRAM (8-16GB), this overestimation may cause frequent unnecessary model offloading, significantly impacting performance. On larger GPUs (32GB) the impact is less noticeable but still causes suboptimal memory management.

Tested on: AMD Radeon AI PRO R9700 (32GB VRAM, gfx1201), ROCm 7.2, Windows and Linux
Fix: A value of 1.0 worked correctly with no OOM errors. 1.3 is suggested as a conservative value to maintain a safety margin for older hardware or ROCm versions.

Change: comfy/sd.py: VAE_KL_MEM_RATIO = 2.73 → 1.3 for AMD

Related: WAN 2.2 i2v: Second run is 4 5to 5 times slower on AMD GPU (ROCm) #12672

coderabbitai · 2026-02-27T19:15:37Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2ec0fd0 and 9aec99e.

📒 Files selected for processing (1)

comfy/sd.py

📝 Walkthrough

Walkthrough

This pull request changes the VAE memory ratio constant for AMD GPU devices in the VAE initialization path, reducing VAE_KL_MEM_RATIO from 2.73 to 1.3 in both occurrences within the AMD branch. The modification affects memory estimation for encode/decode operations on AMD hardware only and does not change behavior for non-AMD devices.

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately describes the main change: adjusting VAE_KL_MEM_RATIO for AMD/ROCm to fix memory overestimation on modern versions.
Description check	✅ Passed	The description clearly explains the rationale, impact, testing details, and the specific change being made to VAE_KL_MEM_RATIO for AMD devices.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@comfy/sd.py`:
- Around line 442-445: Add a brief inline comment explaining the rationale for
the VAE_KL_MEM_RATIO override when running on AMD (model_management.is_amd()) by
documenting that 1.3 is a conservative 30% safety margin for modern ROCm (e.g.,
ROCm 7.x) and reference the tracking issue or PR (for example "see issue `#2`");
place this comment immediately adjacent to the VAE_KL_MEM_RATIO = 1.3 assignment
so future maintainers understand why it differs from the original 2.73/1.0
value.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 1f1ec37 and 2ec0fd0.

📒 Files selected for processing (1)

comfy/sd.py

…d comment

lostdisc · 2026-03-07T07:47:51Z

I remember the 2.73 value came from last October (v0.3.65), about 3 weeks after ROCm 6.4.4 w/Pytorch for Windows came out. It was in the same release as cudnn/MIOpen being disabled for AMD, but not related according to comfyanonymous.

I tried out a 1.3 value with SDXL, but surprisingly didn't see a difference in peak VRAM usage during VAE decode. It didn't break anything either, though.

AMD/ROCm - Fix VAE_KL_MEM_RATIO overestimation for modern ROCm #2

2ec0fd0

ReinerBforartists requested review from Kosinkadink, comfyanonymous and guill as code owners February 27, 2026 19:12

ReinerBforartists changed the title ~~AMD/ROCm - Fix VAE_KL_MEM_RATIO overestimation for modern ROCm #2~~ AMD/ROCm - Fix VAE_KL_MEM_RATIO overestimation for modern ROCm Feb 27, 2026

coderabbitai Bot reviewed Feb 27, 2026

View reviewed changes

Comment thread comfy/sd.py

AMD/ROCm - Fix VAE_KL_MEM_RATIO overestimation for modern ROCm - Adde…

9aec99e

…d comment

Apophis3158 mentioned this pull request Mar 11, 2026

[Issue]: [MIOpen - gfx1200/Windows] First SD generation at VAE stage is extremely slow and crashes GPU driver - even with AOTriton enabled ROCm/TheRock#1542

Open

Apophis3158 mentioned this pull request Apr 15, 2026

[ROCm] better AMD CDNA4 and RDNA4 support for VAE #13411

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AMD/ROCm - Fix VAE_KL_MEM_RATIO overestimation for modern ROCm#12685

AMD/ROCm - Fix VAE_KL_MEM_RATIO overestimation for modern ROCm#12685
ReinerBforartists wants to merge 2 commits intoComfy-Org:masterfrom
ReinerBforartists:2-amdrocm---fix-vae_kl_mem_ratio-overestimation-for-modern-rocm

ReinerBforartists commented Feb 27, 2026

Uh oh!

coderabbitai Bot commented Feb 27, 2026 •

edited

Loading

Walkthrough

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

lostdisc commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ReinerBforartists commented Feb 27, 2026

Uh oh!

coderabbitai Bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lostdisc commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coderabbitai Bot commented Feb 27, 2026 •

edited

Loading