[bugfix] guard _repair_ms_bench against an empty messages list by he-yufeng · Pull Request #9608 · modelscope/ms-swift

he-yufeng · 2026-06-20T15:29:45Z

Summary

_repair_ms_bench (the repair_messages hook for the iic/ms_bench dataset) reads messages[0] right after ast.literal_eval, so a row whose messages are empty — e.g. the literal string "[]" — raises IndexError: list index out of range and aborts the whole ms_bench dataset load instead of just skipping that one row.

The function already returns None to skip rows it can't use (the MOSS case), and MessagesPreprocessor drops None rows, so returning None for an empty list is the same, consistent behaviour.

Fix

An early if not messages: return None guard right after the parse.

Test

Added tests/general/test_repair_ms_bench.py (pure unit tests, no network): empty "[]" / [] returns None, the default system message is stripped, a normal conversation passes through, and a MOSS row is skipped. The empty case raises IndexError without the fix.

Verified locally: pytest tests/general/test_repair_ms_bench.py (4 passed); flake8 / isort / yapf clean.

_repair_ms_bench reads messages[0] right after ast.literal_eval, so a row whose messages are empty (e.g. the string "[]") raises IndexError and aborts the whole ms_bench dataset load instead of just skipping that row. The function already returns None to skip rows it can't use (the MOSS case), so do the same for an empty list. Added pure unit tests for the empty, default-system, normal and MOSS cases.

gemini-code-assist

Code Review

This pull request adds a safety check to the _repair_ms_bench function in swift/dataset/dataset/llm.py to return None when messages is empty, preventing potential index out of bounds errors. Additionally, a comprehensive suite of unit tests has been introduced in tests/general/test_repair_ms_bench.py to verify this behavior and other edge cases. There are no review comments, and the changes look solid.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist Bot reviewed Jun 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] guard _repair_ms_bench against an empty messages list#9608

[bugfix] guard _repair_ms_bench against an empty messages list#9608
he-yufeng wants to merge 1 commit into
modelscope:mainfrom
he-yufeng:fix/repair-ms-bench-empty-messages

he-yufeng commented Jun 20, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

he-yufeng commented Jun 20, 2026

Summary

Fix

Test

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant