feat: add wan2.2_t2v model and quantization config by Charles2530 · Pull Request #454 · ModelTC/LightCompress

Charles2530 · 2026-03-10T03:00:29Z

Add wan2.2_t2v model and quant configuration, corresponding config and script changes

Add a small test script to load sharded safetensors from a Hugging Face repo/local dir and print parameter keys with shapes. Made-with: Cursor

…sformer experts Add support for skipping quantization on specified transformer blocks (block_ids: [0, 40] → block 0 of transformer and transformer_2) to improve quality of the two highest-impact blocks. Changes: - base_blockwise_quantization.py: add _get_ignored_block_ids_set and _is_ignored_block helpers; modify set_no_quant_layer to skip all linear layers when layer_names is empty; modify run to skip block_transform for ignored blocks so AWQ scales are not applied - configs/…/awq_w_a_skip_first.yaml: new config with ignored_layers block_ids [0, 40] and separate save_path Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…dance

CLAUDE.md

llmc/compression/quantization/base_blockwise_quantization.py

docs/wan2.1_quantization_guide.md

…uant/run config Made-with: Cursor

Made-with: Cursor

Charles2530

solve problem above in comments

Made-with: Cursor

gushiqiao · 2026-03-30T08:27:19Z

可以解一下合并冲突，然后就可以merge了

Charles2530 · 2026-03-31T00:33:42Z

你好，已经解决冲突了

JiwaniZakir

The changes to wan_i2v/awq_w_a.yaml, wan_t2v/awq_w_a.yaml, wan_t2v/rtn_w_a.yaml, and wan_t2v/smoothquant_w_a.yaml are purely removing trailing newlines (introducing \ No newline at end of file), which is a regression in the existing files unrelated to the stated goal of this PR and goes against POSIX file conventions.

The new wan2_2_t2v/awq_w_a.yaml uses type: Wan2T2V, whereas the existing wan_t2v configs use type: WanT2V — the diff doesn't include any code registering or implementing the Wan2T2V model class, so it's unclear whether this will resolve correctly at runtime or silently fall back to an incorrect handler.

The newly added docs/wan2.1_quantization_guide.md documents Wan2.1 models (WanI2V, WanT2V) exclusively, but this PR introduces Wan2.2 support (wan2_2_t2v). The guide should either be updated to cover Wan2.2 specifics (notably guidance_scale_2, which appears only in the new config) or a separate doc should be added, since guidance_scale_2: 3.0 in the calib/eval sections is a new parameter with no explanation anywhere in the documentation.

The wan2_2_t2v directory only ships an AWQ config, whereas the existing wan_t2v directory also provides RTN and SmoothQuant variants. If those methods are also supported for Wan2.2, the missing configs should be included for consistency; if not, a comment explaining the omission would be helpful.

Charles2530 and others added 11 commits March 10, 2026 10:57

feat: add wan2.2_t2v model and quantization config

70d3676

feat: wan2.2-t2v quantization configs and model updates

84f89f9

Wan2.2: MoE calibration split, blockwise input, OOM fixes and config

715104f

update wan2.2

02b4133

feat: add HF state_dict print tool

6e2dddb

Add a small test script to load sharded safetensors from a Hugging Face repo/local dir and print parameter keys with shapes. Made-with: Cursor

debug by claude

57df671

wan2.2: use official Wan2.2 backend for A14B import and native save

1a998a0

chore: update skip-first quant task notes and run script

df9a09b

fix(wan2.2): enforce native save structure and align default dual gui…

f261203

…dance

fix(wan): preserve catcher kwargs forwarding during calibration

007360e

gushiqiao reviewed Mar 30, 2026

View reviewed changes

CLAUDE.md Outdated Show resolved Hide resolved

gushiqiao reviewed Mar 30, 2026

View reviewed changes

llmc/compression/quantization/base_blockwise_quantization.py Outdated Show resolved Hide resolved

gushiqiao reviewed Mar 30, 2026

View reviewed changes

llmc/compression/quantization/base_blockwise_quantization.py Outdated Show resolved Hide resolved

gushiqiao reviewed Mar 30, 2026

View reviewed changes

docs/wan2.1_quantization_guide.md Show resolved Hide resolved

Charles2530 force-pushed the feat/wan2.2-t2v branch from 088d80d to 007360e Compare March 30, 2026 06:06

Charles2530 and others added 5 commits March 30, 2026 14:07

Delete CLAUDE.md

5aee498

docs/wan2.2 + refactor(wan2.2): move native save helpers and update q…

5a51ded

…uant/run config Made-with: Cursor

chore: tidy wan2.2 docs and quant exports

e0fc7d4

Made-with: Cursor

refactor(wan2.2): move Wan2.2 save logic; update wan_t2v configs

0cdfa67

Made-with: Cursor

chore: update wan i2v/t2v configs and eval defaults

3349931

Made-with: Cursor

Charles2530 commented Mar 30, 2026

View reviewed changes

Charles2530 and others added 2 commits March 30, 2026 14:53

chore(wan2.2): add awq_w_a placeholders

fb0e364

Made-with: Cursor

Delete tools/print_state_dict_hf.py

366478b

Merge branch 'main' into feat/wan2.2-t2v

e3e1243

JiwaniZakir reviewed Apr 1, 2026

View reviewed changes

gushiqiao approved these changes Apr 1, 2026

View reviewed changes

gushiqiao merged commit 9701d03 into ModelTC:main Apr 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add wan2.2_t2v model and quantization config#454

feat: add wan2.2_t2v model and quantization config#454
gushiqiao merged 19 commits intoModelTC:mainfrom
Charles2530:feat/wan2.2-t2v

Charles2530 commented Mar 10, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Charles2530 left a comment

Uh oh!

gushiqiao commented Mar 30, 2026

Uh oh!

Charles2530 commented Mar 31, 2026

Uh oh!

JiwaniZakir left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Charles2530 commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Charles2530 left a comment

Choose a reason for hiding this comment

Uh oh!

gushiqiao commented Mar 30, 2026

Uh oh!

Charles2530 commented Mar 31, 2026

Uh oh!

JiwaniZakir left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Charles2530 commented Mar 10, 2026 •

edited

Loading