[bugfix] include image mode and size in tmp image cache key by he-yufeng · Pull Request #9605 · modelscope/ms-swift

he-yufeng · 2026-06-20T00:14:37Z

PR type

Bug Fix
New Feature
Document Updates
More Models or Datasets Support

PR information

Template._save_pil_image() keyed the temp image cache on sha256(image.tobytes()). Image.tobytes() returns only the flattened pixel stream, without the image mode, width, or height. Two images that share the same pixel bytes but differ in shape (e.g. 120x80 and 80x120) therefore hash to the same cache path. Since the method skips saving when the path already exists, the second image silently reuses the first image's PNG, so multimodal inference/training can read the wrong image.

This includes the mode and size in the hash input so images with different dimensions get distinct cache files. Behavior for any single image is unchanged (the file is still written once and reused on repeat).

Experiment results

Added test_save_pil_image_dimension_collision in tests/general/test_template.py: it builds two RGB images with identical pixel bytes but transposed dimensions, saves both, and asserts the cache paths differ and each saved file keeps its own size. The test fails on the previous hash and passes after this change.

$ python -m pytest tests/general/test_template.py::test_save_pil_image_dimension_collision -q
1 passed

gemini-code-assist

Code Review

This pull request prevents cache collisions for images that share the same flattened pixel bytes but differ in mode or dimensions by prepending metadata (mode, width, and height) to the image bytes before hashing. A unit test has been added to verify this fix. The reviewer suggested using incremental hashing with hasher.update() to avoid unnecessary memory overhead from copying the entire image byte stream during concatenation.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-20T00:15:25Z

+        meta = f'{image.mode}-{image.width}x{image.height}-'.encode()
+        img_hash = hashlib.sha256(meta + img_bytes).hexdigest()


Concatenating meta + img_bytes creates a new bytes object in memory, which copies the entire image byte stream. For large images, this can lead to unnecessary memory overhead and performance degradation. Instead, you can update the hash incrementally using hasher.update() to avoid this extra memory allocation.

Suggested change

meta = f'{image.mode}-{image.width}x{image.height}-'.encode()

img_hash = hashlib.sha256(meta + img_bytes).hexdigest()

meta = f'{image.mode}-{image.width}x{image.height}-'.encode()

hasher = hashlib.sha256(meta)

hasher.update(img_bytes)

img_hash = hasher.hexdigest()

[bugfix] include image mode and size in tmp image cache key

7454301

gemini-code-assist Bot reviewed Jun 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] include image mode and size in tmp image cache key#9605

[bugfix] include image mode and size in tmp image cache key#9605
he-yufeng wants to merge 1 commit into
modelscope:mainfrom
he-yufeng:fix/image-cache-hash-include-dimensions

he-yufeng commented Jun 20, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		meta = f'{image.mode}-{image.width}x{image.height}-'.encode()
		img_hash = hashlib.sha256(meta + img_bytes).hexdigest()

Conversation

he-yufeng commented Jun 20, 2026

PR type

PR information

Experiment results

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant