Change the repository type filter
All
Repositories list
40 repositories
cisnlp.github.io
Public- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
Language-Mixing
Publicmanchu-in-context-mt
PublicGlotWeb
Public🕸 GlotWeb: Web Indexing for Low-Resource Languages -- under construction.MIB-circuit-track
Publicspatial_intuitions
Public- Tracing Multilingual Factual Knowledge Acquisition in Pretraining
MEXA
Public🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual AlignmentGlotCC
Public🕸 GlotCC Dataset and Pipline -- NeurIPS 2024code-specific-neurons
Public💻🔍 How Programming Concepts and Neurons Are Shared in Code Language Modelsoscar-io
Publicungoliant
Publicoscar-tools
PublicLangSAMP
PublicLangSAMP: Language-Script Aware Multilingual Pretraininganalogical_reasoning
PublicTransliteration-PPA
PublicBreaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignmentlohoravens-webpage
PublicMaskLID
Public💬 MaskLID: Code-Switching Language Identification through Iterative Masking -- ACL 2024GlotScript
Public🖋 Resource and Tool for Writing System Identification -- LREC 2024Taxi1500
PublicTransMI
PublicTransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated DataTransliCo
PublicTransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language ModelsSpatial_Schemas
PublicXAMPLER
PublicGlot500
PublicGlot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023