Skip to content

Conversation

@Priyjain-amd
Copy link

@Priyjain-amd Priyjain-amd commented Dec 16, 2025

Add support Int4OpaqueTensor from torchao in vLLM's LLaMA model implementation.
These changes enable vLLM to load models quantized with torchao using int4_packing_format="opaque" without errors during weight loading.

Encountered error: NotImplementedError: Int4OpaqueTensor dispatch: attempting to run unimplemented operator/function:
func=<OpOverload(op='aten.slice', overload='Tensor')>

Issue #3499

@pytorch-bot
Copy link

pytorch-bot bot commented Dec 16, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3494

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 16, 2025
block_size = list(self.block_size)
block_size[dim] = min(block_size[dim], new_shape[dim])

return Int4TilePackedTo4dTensor(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Int4OpqueTensor?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I accidentally committed the incorrect changes; they’re resolved now.

- Issue pytorch#3499

Signed-off-by: priyansh jain <[email protected]>

Change-Id: I0b2cb105c5eacdd251d159ae5f007b2118bb9ce7
@Priyjain-amd Priyjain-amd marked this pull request as ready for review December 18, 2025 05:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants