-
Notifications
You must be signed in to change notification settings - Fork 2.8k
feat(litellm): add flatten content option for auto_router compatibility #11133
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When both
litellmFlattenContent(enabled by default) andlitellmUsePromptCacheare enabled with a model that supports prompt caching, the flattening is undone. The flattening converts array content to strings here, but lines 194-207 below wrap string content back into arrays to addcache_control. This means auto_router users who also enable prompt caching will still encounter the embedding model failure this PR intends to fix. Consider either moving the flattening after the prompt caching logic, or having the prompt caching logic skip re-wrapping when flattening is enabled.Fix it with Roo Code or mention @roomote and request a fix.