Phonetic vectorization?

This is more of a question than a bug report:

I have a somewhat different use case than is covered in the documentation of how to use this library. In my case, I am wanting to search for similar-sounding syllables, rather than character-by-character matching of text. So my plan is to use some sort of phonetic encoding on my corpus (i.e. Soundex, Metaphone, etc). But I am not certain how to do this in such a way that would be compatible with neofuzz's Process -- it doesn't look like scikit-learn provides an out-of-the-box Vectorizer for phonetic encoding of text. And I'm not sure if the SubWordVectorizer could somehow be leveraged for this.

Any pointers on how to achieve this with neofuzz?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Phonetic vectorization? #13

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Phonetic vectorization? #13

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions