Add support for multimodal embeddings in vectorizers #452

vishal-bala · 2025-12-16T11:58:14Z

This PR generalizes the BaseVectorizer to be agnostic to any modality (since it previously exclusively supported text inputs). Building from the new base, this PR then extends the implementation for some vectorizers to support multimodal embeddings (renaming them away from being specifically for text).

`BaseVectorizer`

The move away from having the BaseVectorizer explicitly expect text inputs means a change in the signature of the embed methods away from vectorizer.embed(text="lorem ipsum...") to vectorizer.embed(content="lorem ipsum..."). This is a breaking change for existing usages of the vectorizers that use the keyword argument, and the usages will need to be updated to align with the new schema.

Caching for multimodal embeddings is supported for all vectorizers introduced in this PR.

Multimodal Implementations

The following vectorizers have been renamed to no longer be explicitly text vectorizers, and moved to no longer be defined in the vectorize.text module. Imports and usages for these vectorizers will need to be updated to avoid errors. The CustomTextVectorizer has also been renamed and moved to be redisvl.utils.vectorize.custom.CustomVectorizer.

VoyageAI

Old: redisvl.utils.vectorize.text.voyageai.VoyageAITextVectorizer
New: redisvl.utils.vectorize.voyageai.VoyageAIVectorizer

from redisvl.utils.vectorize import VoyageAIVectorizer

# --- Basic usage
vectorizer = VoyageAIVectorizer(
    model="voyage-3-large",
    api_config={"api_key": "your-voyageai-api-key"} # OR set VOYAGE_API_KEY in your env
)
query_embedding = vectorizer.embed(
    content="your input query text here",
    input_type="query"
)
doc_embeddings = vectorizer.embed_many(
    contents=["your document text", "more document text"],
    input_type="document"
)

# --- Multimodal usage - requires Pillow and voyageai>=0.3.6 (for video)
from PIL import Image
from voyageai.video_utils import Video

vectorizer = VoyageAIVectorizer(
    model="voyage-multimodal-3.5",
    api_config={"api_key": "your-voyageai-api-key"} # OR set VOYAGE_API_KEY in your env
)

# text
text_embedding = vectorizer.embed(
    content="your input query text here",
    input_type="query"
)

# image
image_embedding = vectorizer.embed_image(
    "path/to/your/image.jpg",
    input_type="query"
)
image_embedding = vectorizer.embed(
    Image.open("path/to/your/image.jpg"),
    input_type="query"

# video
video_embedding = vectorizer.embed_video(
    "path/to/your/video.mp4",
    input_type="document"
)
video_embedding = vectorizer.embed(
    Video.from_path("path/to/your/video.mp4", model=vectorizer.model),
    input_type="document"
)

Vertex AI

Old: redisvl.utils.vectorize.text.vertexai.VertexAITextVectorizer
New: redisvl.utils.vectorize.vertexai.VertexAIVectorizer

from redisvl.utils.vectorize import VertexAIVectorizer

# Basic usage

vectorizer = VertexAIVectorizer(
    model="textembedding-gecko",
    api_config={
        "project_id": "your_gcp_project_id", # OR set GCP_PROJECT_ID
        "location": "your_gcp_location",     # OR set GCP_LOCATION
    })
embedding = vectorizer.embed("Hello, world!")

# Multimodal usage
from vertexai.vision_models import Image, Video

vectorizer = VertexAIVectorizer(
    model="multimodalembedding@001",
    api_config={
        "project_id": "your_gcp_project_id", # OR set GCP_PROJECT_ID
        "location": "your_gcp_location",     # OR set GCP_LOCATION
    }
)
text_embedding = vectorizer.embed("Hello, world!")

image_embedding = vectorizer.embed(Image.load_from_file("path/to/your/image.jpg"))
image_embedding = vectorizer.embed_image("path/to/your/image.jpg")

video_embedding = vectorizer.embed(Video.load_from_file("path/to/your/video.mp4"))
video_embedding = vectorizer.embed_video("path/to/your/video.mp4")

Amazon Bedrock

Old: redisvl.utils.vectorize.text.bedrock.BedrockTextVectorizer
New: redisvl.utils.vectorize.bedrock.BedrockVectorizer

from redisvl.utils.vectorize import BedrockVectorizer

vectorizer = BedrockVectorizer(
    model="amazon.titan-embed-text-v2:0",
    api_config={
        "aws_access_key_id": "your_access_key",
        "aws_secret_access_key": "your_secret_key",
        "aws_region": "us-east-1"
    }
)

embedding = vectorizer.embed("Hello, world!")

# Multimodal usage
from pathlib import Path
from PIL import Image

vectorizer = BedrockVectorizer(
    model="amazon.titan-embed-image-v1:0",
    api_config={
        "aws_access_key_id": "your_access_key",
        "aws_secret_access_key": "your_secret_key",
        "aws_region": "us-east-1"
    }
)
image_embedding = vectorizer.embed(Path("path/to/your/image.jpg"))
image_embedding = vectorizer.embed(Image.open("path/to/other/image.png"))
image_embedding = vectorizer.embed_image("path/to/your/image.jpg")

# Embedding a list of mixed modalities
embeddings = vectorizer.embed_many(
    ["Hello", "world!", Path("path/to/your/image.jpg"), Image.open("path/to/other/image.png")],
    batch_size=2
)

Hugging Face

While the sentence-transformers package does not explicitly allow for multimodal usage (the package is designed for text-based use-cases), some officially supported multimodal models can be used without issue via the SentenceTransformer class. This PR removes strict enforcement of text inputs for the HFTextVectorizer to enable these use-cases.

from PIL import Image
from redisvl.utils.vectorize import HFTextVectorizer

vectorizer = HFTextVectorizer(model="sentence-transformers/clip-ViT-L-14")
embeddings1 = vectorizer.embed("Hello, world!")
embeddings2 = vectorizer.embed(Image.open("path/to/your/image.jpg"))

Open Topics

Since this PR introduces a few breaking changes, do we want to maintain backwards compatibility (with deprecation warnings) for syntax that is changing? This includes:

vectorizer.embed(text=...) -> vectorizer.embed(content=...)
VoyageAITextVectorizer -> VoyageAIVectorizer
VertexAITextVectorizer -> VertexAIVectorizer
BedrockTextVectorizer -> BedrockVectorizer
CustomTextVectorizer -> CustomVectorizer

…-py 7.x

…tween `HybridQuery` and `AggregateHybridQuery`

Since some multimodal models can be used with `sentence-transformers` just by passing in Image objects instead of strings, we don't want to block that option.

…E-1240/multimodal-embeddings # Conflicts: # docs/api/query.rst # docs/user_guide/11_advanced_queries.ipynb # redisvl/index/index.py # redisvl/query/hybrid.py # tests/integration/test_hybrid.py # tests/unit/test_hybrid_types.py # uv.lock

justin-cechmanek

All the changes look good. Not sure how to best handle the breaking changes around the vectorizer class names and text/content parameter name.

justin-cechmanek · 2025-12-16T23:02:40Z

We can maintain backward compatibility and a deprecation warning by having wrapper classes that extend the newly changed vectorizers.

vishal-bala added 30 commits December 4, 2025 14:26

Abstract full-text query construction into helper class

51ce1e0

Remove unused imports

9b7283b

Formatting

99f9d99

Implement HybridQuery with tests

527b024

Implement vsim search method params and vsim filtering in HybridQuery

6c0edd7

Update redisvl.query.aggregate.HybridQuery deprecation message

da2283e

Add support for combination methods and postprocessing

80f1927

Update hybrid search usage based on in-practice constraints

9832369

Update/fix existing tests

b691255

Implement async hybrid search

4b3a1fe

Update docstrings

4d3ba70

Update GH Actions test configuration to include Redis 8.4.0 and redis…

093fbde

…-py 7.x

Update uv.lock

1c0e77f

Python 3.9 compatibility fixes

6fdf59a

Fix method reference

900073a

Catch ModuleNotFoundError as well

e1e261d

Standardize test skip reason

cee1fba

Update expected number of results to hybrid search default

cdba22f

Remove ambiguous redisvl.query import for HybridQuery

3173502

Update docs

ecd5c1b

Update imports

bf87c6a

Fix test skipping logic (for Python 3.9 issues)

e44c439

Re-add additional test skipping logic

7210aae

Oops missed one

3b6ef3a

Remove deprecated HybridQuery class

9c5fb4b

Manage dependency logic for user guide notebook

8d4e91d

Make hybrid_search always available but validate and raise errors

362fbf5

Add warning note about inconsistent linear combination definitions be…

eb8e3ae

…tween `HybridQuery` and `AggregateHybridQuery`

Validate that hybrid_search method exists

5109c7a

Fix error message check

1a71bef

vishal-bala added 18 commits December 9, 2025 14:22

Add See Also references [skip ci]

9acdaab

Ditch Redis 8.0.2 from testing matrix

eb49885

Reflect query syntax in hybrid query docs note

d8accf5

Document that default combination method is RRF

58e8da6

Update docs overview to show "Python >=3.9", not 3.8

ca256e8

Make vectorizer base framework agnostic to modality

427ab43

Add multimodal support to VoyageAI vectorizer

46cdcdf

Update test references

e1919c7

Remove strict validation of string inputs for HFTextVectorizer

bdacb6f

Since some multimodal models can be used with `sentence-transformers` just by passing in Image objects instead of strings, we don't want to block that option.

Add convenience methods for embedding image/video with VoyageAI

6d8be7d

Add support for multimodal embeddings with VertexAI

7cdad0a

Simplify _client definition

91513c9

Implement multimodal support for BedrockVectorizer

2b81d3c

Better implementation of cache serializing for embeddings

948e709

Generalize CustomVectorizer to be agnostic to modality

cb05d96

Cache bytes and use hex for key creation

14bcb80

Add convenience method for embedding images with Bedrock

033967d

Serialize vendor-specific image/video types

24df3c6

vishal-bala self-assigned this Dec 16, 2025

vishal-bala added 2 commits December 16, 2025 16:15

Ditch type enforcement test

2537ce8

Update docs

e8d00ad

Base automatically changed from feat/RAAE-1236/hybrid-search to main December 16, 2025 18:46

vishal-bala marked this pull request as ready for review December 16, 2025 18:48

vishal-bala requested review from justin-cechmanek and rbs333 December 16, 2025 18:48

justin-cechmanek reviewed Dec 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for multimodal embeddings in vectorizers #452

Add support for multimodal embeddings in vectorizers #452

vishal-bala commented Dec 16, 2025 •

edited

Loading

Uh oh!

justin-cechmanek left a comment

Uh oh!

justin-cechmanek commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add support for multimodal embeddings in vectorizers #452

Are you sure you want to change the base?

Add support for multimodal embeddings in vectorizers #452

Conversation

vishal-bala commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

BaseVectorizer

Multimodal Implementations

VoyageAI

Vertex AI

Amazon Bedrock

Hugging Face

Open Topics

Uh oh!

justin-cechmanek left a comment

Choose a reason for hiding this comment

Uh oh!

justin-cechmanek commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vishal-bala commented Dec 16, 2025 •

edited

Loading

`BaseVectorizer`