Add DeepSeek Engram layer by shuningjin · Pull Request #3010 · AI-Hypercomputer/maxtext

shuningjin · 2026-01-26T20:49:10Z

Description

Background

Paper: https://arxiv.org/pdf/2601.07372
Reference Code: https://github.com/deepseek-ai/Engram/blob/main/engram_demo_v1.py

What this PR does

Add Engram layer: engram.py

NgramHashMapping (non-parametric): CompressedTokenizer + hashing logic, convert "input_id" to "ngram hash_token_id"
- CompressedTokenizer (non-parametric): convert "input_id" to "compresed_input_id"
Engram (multi-branch): inputs are "ngram hash_token_id" and "transformer state", MultiHeadEmbedding (lookup embedding using hash id as static memory) + context-aware gating (dot product static memory with contextual state) + ShortConv (temporal smoothing)
- MultiHeadEmbedding: convert ngram hash_token_id to ngram embedding vector
- ShortConv (multi-branch): depthwise (mix time steps, not mix channel), causal, short means kernel size is small

Add unit test: tests.unit.engram_vs_reference_test

for each component, verify the output matches that from reference code

Implementation Notes

Placement of: NgramHashMapping

NgramHashMapping converts vanilla token-ids to hashed ngram token-ids, which Engram consumes for embedding lookup
Future: I would like to NgramHashMapping and hash_input_ids generation be put in data input pipeline, which is CPU intensive. Just like how we put tokenizer and input_ids generation in pipeline.
- This is the code skeleton: https://screenshot.googleplex.com/7YYxr4z7UqvBkpN
- Also: b/478294696#comment5

Multi-branch

Engram and ShortConv handles multi-branch input and multi-branch output (if mhc_expansion_rate > 1), using nnx.vmap for independent norm per branch
Future: to be integrated into multi-branch backbone like mHC.

Tests

unit test against reference

python3 -m pytest -v --pyargs tests.unit.engram_vs_reference_test -rP -s

log: https://paste.googleplex.com/5905570101067776

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

codecov · 2026-01-26T21:01:59Z

Codecov Report

❌ Patch coverage is 0% with 198 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/MaxText/layers/engram.py	0.00%	198 Missing ⚠️

📢 Thoughts on this report? Let us know!

shuningjin · 2026-02-04T22:20:08Z

@gemini-cli /review

RissyRan

@gemini-cli /review

github-actions · 2026-02-05T03:16:54Z

🤖 Hi @RissyRan, I've received your request, and I'm working on it now! You can track my progress in the logs for more details.

github-actions

## 📋 Review Summary

This pull request introduces a JAX implementation of the DeepSeek Engram layer, along with comprehensive unit tests that validate its behavior against a PyTorch reference. The code is well-structured and the implementation appears to be correct and thorough. The core logic is sound, and the use of vectorization with nnx.vmap is a good practice for performance.

🔍 General Feedback

Good Testing: The inclusion of unit tests comparing the JAX implementation to a PyTorch reference is excellent. This provides high confidence in the correctness of the implementation.
Clear Implementation: The code in engram.py is well-commented and organized, making it easy to follow the logic from the original paper.
TODOs: I've commented on the TODOs left in the code. Addressing them will improve the clarity and robustness of the implementation.

src/MaxText/pyconfig.py

src/MaxText/layers/engram.py

tests/unit/engram_vs_reference_test.py

github-actions · 2026-02-05T03:18:49Z

🤖 I'm sorry @RissyRan, but I was unable to process your request. Please see the logs for more details.

RissyRan

I reviewed the test and CompressedTokenizer. Will continue to review the rest part tomorrow.

src/MaxText/pyconfig.py

src/MaxText/layers/engram.py

tests/unit/engram_vs_reference_test.py

RissyRan

Thanks for the change! I left some initial comments, and may need to go over multihead embedding and conv parts. It should be quick.

src/MaxText/layers/engram.py

tests/unit/engram_vs_reference_test.py

RissyRan

LGTM in general! Just a few minor comments. I will have a try to integrate this change with a decoder layer tomorrow and see how it goes.

src/MaxText/layers/engram.py

src/MaxText/pyconfig.py

src/MaxText/layers/engram.py

aireenmei

Thanks for the work!

shuningjin force-pushed the shuningjin-engram branch from e8ae3c9 to f095801 Compare January 29, 2026 15:00

shuningjin changed the title ~~[DRAFT] do no merge~~ [DRAFT] engram Jan 29, 2026

shuningjin force-pushed the shuningjin-engram branch 2 times, most recently from 93458cf to 21cec5f Compare January 30, 2026 17:52

shuningjin changed the title ~~[DRAFT] engram~~ Add DeepSeek Engram layer Feb 4, 2026

shuningjin force-pushed the shuningjin-engram branch from bb190ed to 2dc37df Compare February 4, 2026 21:34

shuningjin marked this pull request as ready for review February 4, 2026 21:48

shuningjin requested review from A9isha, NicoGrande, NuojCheng, RissyRan, SurbhiJainUSC, aireenmei, bvandermoon, gagika, gobbleturk, hengtaoguo, jesselu-google, jiangjy1982, khatwanimohit, parambole, richjames0, shralex, suexu1025 and vipannalla as code owners February 4, 2026 21:48

shuningjin assigned RissyRan and gagika Feb 4, 2026

RissyRan reviewed Feb 5, 2026

View reviewed changes

github-actions bot reviewed Feb 5, 2026

View reviewed changes

shuningjin force-pushed the shuningjin-engram branch from 2dc37df to 5371cae Compare February 5, 2026 05:16

RissyRan reviewed Feb 5, 2026

View reviewed changes

src/MaxText/pyconfig.py Show resolved Hide resolved

src/MaxText/layers/engram.py Show resolved Hide resolved

src/MaxText/layers/engram.py Outdated Show resolved Hide resolved

tests/unit/engram_vs_reference_test.py Outdated Show resolved Hide resolved

RissyRan reviewed Feb 5, 2026

View reviewed changes

shuningjin force-pushed the shuningjin-engram branch from 5371cae to 2c4e71f Compare February 6, 2026 06:53

shuningjin assigned aireenmei Feb 6, 2026

shuningjin force-pushed the shuningjin-engram branch 3 times, most recently from 92b7c55 to 09e7c1e Compare February 10, 2026 00:19

shuningjin assigned parambole Feb 10, 2026

shuningjin force-pushed the shuningjin-engram branch from 09e7c1e to 1606c36 Compare February 10, 2026 05:17

RissyRan reviewed Feb 10, 2026

View reviewed changes

src/MaxText/layers/engram.py Show resolved Hide resolved

gagika reviewed Feb 10, 2026

View reviewed changes

src/MaxText/pyconfig.py Outdated Show resolved Hide resolved

src/MaxText/layers/engram.py Outdated Show resolved Hide resolved

src/MaxText/layers/engram.py Show resolved Hide resolved

shuningjin force-pushed the shuningjin-engram branch from 1606c36 to 4e7559c Compare February 10, 2026 19:52

aireenmei reviewed Feb 10, 2026

View reviewed changes

src/MaxText/layers/engram.py Show resolved Hide resolved

src/MaxText/layers/engram.py Show resolved Hide resolved

src/MaxText/layers/engram.py Show resolved Hide resolved

src/MaxText/layers/engram.py Show resolved Hide resolved

Add DeepSeek Engram layer

49499b6

shuningjin force-pushed the shuningjin-engram branch from 1ec2336 to 49499b6 Compare February 11, 2026 00:31

shuningjin unassigned parambole Feb 11, 2026

aireenmei approved these changes Feb 11, 2026

View reviewed changes

gagika approved these changes Feb 11, 2026

View reviewed changes

RissyRan approved these changes Feb 11, 2026

View reviewed changes

shuningjin unassigned aireenmei, RissyRan and gagika Feb 11, 2026

shuningjin added the pull ready label Feb 11, 2026

copybara-service bot merged commit b10c284 into main Feb 11, 2026
104 of 105 checks passed

copybara-service bot deleted the shuningjin-engram branch February 11, 2026 18:28

Conversation

shuningjin commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Background

What this PR does

Implementation Notes

Tests

Checklist

Uh oh!

codecov bot commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

shuningjin commented Feb 4, 2026

Uh oh!

RissyRan left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Feb 5, 2026

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

🔍 General Feedback

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Feb 5, 2026

Uh oh!

RissyRan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RissyRan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RissyRan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aireenmei left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

shuningjin commented Jan 26, 2026 •

edited

Loading

codecov bot commented Jan 26, 2026 •

edited

Loading