Fixed Masking in HookedTransformer.generate by tuomaso · Pull Request #999 · TransformerLensOrg/TransformerLens

tuomaso · 2025-08-11T18:57:23Z

Description

Currently calling HookedTransformer.generate() with a batch of inputs, i.e. a list of two prompts does not correctly mask the padding tokens, causing the batched outputs to be different from the outputs of generating 1-by-1 for the shorter sequences. Current code does not mask attention at all, so I implemented a ~5 line change to mask attention correctly using existing functions. This fix only works when padding_side="left" so I changed the default value for generate to be "left", not sure how to fix this for right padding. After the fix .generate() with a batch gives the same output as generating one by one (with do_sample = False).

Fixes # (issue)

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Changing the default value/fixing behavior will change some outputs but only ones that were broken already.

Screenshots

Before

Please attach before and after screenshots of the change if applicable.

After

Checklist:

I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I have not rewritten tests relating to key interfaces which would affect backward compatibility

danwilhelm · 2025-11-10T13:10:23Z

I've run into this problem as well. Due to no attention masking, the TransformerLens generate returns different results than Transformers generate when passed >1 prompts. The provided changes here remedy this issue in my case.

danwilhelm · 2025-11-10T13:41:46Z

It may be best to update run_with_hooks and run_with_cache as well, since their >1 prompt output is now inconsistent with generate.

However, I suspect this proposed solution should not be made default. Existing users likely expect the existing behavior, even though it is inconsistent with Transformers. Perhaps it would be best to enable the attention masking behavior via a flag instead?

tuomaso added 2 commits August 11, 2025 11:08

fixed batching in generate

22edbb5

added test case

11c85b5

tuomaso mentioned this pull request Aug 15, 2025

[Bug Report] Padding Tokens not masked during HookedTransformer.generate() #1005

Open

louislichen mentioned this pull request Dec 30, 2025

Batch vs single prompt outputs differ when using ReplacementModel safety-research/circuit-tracer#67

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed Masking in HookedTransformer.generate#999

Fixed Masking in HookedTransformer.generate#999
tuomaso wants to merge 2 commits intoTransformerLensOrg:mainfrom
tuomaso:fix-generate-masking

tuomaso commented Aug 11, 2025

Uh oh!

danwilhelm commented Nov 10, 2025

Uh oh!

danwilhelm commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tuomaso commented Aug 11, 2025

Description

Type of change

Screenshots

Before

After

Checklist:

Uh oh!

danwilhelm commented Nov 10, 2025

Uh oh!

danwilhelm commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants