Cohere releases tiny-aya-global, multilingual text model covering 100+ languages
Cohere Labs has released tiny-aya-global, a lightweight text generation model trained to support conversational tasks across 100+ languages. The model is available on Hugging Face under a CC-BY-NC-4.0 license and builds on the tiny-aya-base architecture.
Cohere Labs has released tiny-aya-global, a lightweight multilingual text generation model designed for conversational tasks across 100+ languages.
Model Details
The tiny-aya-global model is available on Hugging Face and supports text generation and conversational applications. It's built as a fine-tuned variant of the tiny-aya-base model, optimized for multilingual performance across a broad language spectrum.
The model covers extensive language support spanning:
- Major European languages: English, Dutch, French, Italian, Portuguese, Romanian, Spanish, Czech, Polish, Ukrainian, Russian, Greek, German, Danish, Swedish, Norwegian, Catalan, Galician, Welsh, Irish, Basque, Croatian, Latvian, Lithuanian, Slovak, Slovenian, Estonian, Finnish, Hungarian, Serbian, Bulgarian
- Middle Eastern and South Asian languages: Arabic, Farsi, Urdu, Turkish, Maltese, Hebrew, Hindi, Marathi, Bengali, Gujarati, Punjabi, Tamil, Telugu, Nepali
- Southeast Asian languages: Tagalog, Malay, Indonesian, Javanese, Khmer, Thai, Lao, Burmese
- East Asian languages: Chinese, Japanese, Korean
- African languages: Amharic, Hausa, Igbo, Malagasy, Shona, Swahili, Wolof, Xhosa, Yoruba, Zulu
Availability and Licensing
The model is distributed under the CC-BY-NC-4.0 license and can be accessed directly from Hugging Face. As of release, the model has generated 1,204 downloads and received 56 likes on the platform, indicating early adoption among developers building multilingual applications.
The model uses the transformers library and safetensors format for compatibility with standard ML tooling.
What This Means
Tiny-aya-global fills a gap in accessible multilingual models by providing a lightweight alternative for organizations needing broad language coverage without deploying massive foundation models. The focus on lower-resource languages alongside major ones suggests Cohere's intent to extend conversational AI capabilities beyond English-dominant markets. For teams building applications in non-English regions, the model's breadth of language support reduces the need to manage separate language-specific models or expensive API calls to large closed models.
Related Articles
NVIDIA Releases Nemotron 3.5 Content Safety: 4B-Parameter Multimodal Model with Custom Policy Enforcement and 140-Langua
NVIDIA has released Nemotron 3.5 Content Safety, a 4B-parameter model built on Google Gemma 3 4B IT that provides multimodal safety classification across approximately 140 languages. The model includes a 128K context window, custom enterprise policy enforcement, auditable reasoning traces, and is releasing its training dataset.
NVIDIA Releases Nemotron 3.5 ASR: 600M-Parameter Streaming Speech Model for 40 Languages
NVIDIA released Nemotron 3.5 ASR, a 600M-parameter speech-to-text model supporting 40 language-locales from a single checkpoint. The model achieves 0.07 seconds to final transcript after speech ends and ranks 2nd in latency among streaming ASR models according to Artificial Analysis benchmarks.
Google DeepMind releases Gemma 4 12B Unified: encoder-free multimodal model with 256K context window
Google DeepMind has released Gemma 4 12B Unified, an encoder-free multimodal model that processes text, images, and audio through a single decoder-only transformer. The model features 11.95 billion parameters, a 256K token context window, and achieves 77.2% on MMLU Pro and 72.0% on LiveCodeBench v6.
ByteDance Open-Sources Bernini-R Video Diffusion Model With Semantic Planning Architecture
ByteDance released Bernini-R, an open-source video generation and editing model that combines an MLLM-based semantic planner with a DiT-based renderer. The model requires Hopper-class GPUs (H100/H800/H200) for optimal performance and supports multiple tasks including text-to-video, video editing, and reference-guided generation.
Comments
Loading...