This repository provides a unified interface to compare various genomic language models. It includes basic functionality for loading models, generating embeddings, and saving outputs.
Some models, such as BERTax, require specific methods for inference, while most others are accessible via the Hugging Face Transformers library.
The dataset used for evaluation is based on the Scorpio-Gene-Taxa dataset.