Cohere releases tiny-aya-global, multilingual text model covering 100+ languages
Cohere Labs has released tiny-aya-global, a lightweight text generation model trained to support conversational tasks across 100+ languages. The model is available on Hugging Face under a CC-BY-NC-4.0 license and builds on the tiny-aya-base architecture.
Cohere Labs has released tiny-aya-global, a lightweight multilingual text generation model designed for conversational tasks across 100+ languages.
Model Details
The tiny-aya-global model is available on Hugging Face and supports text generation and conversational applications. It's built as a fine-tuned variant of the tiny-aya-base model, optimized for multilingual performance across a broad language spectrum.
The model covers extensive language support spanning:
- Major European languages: English, Dutch, French, Italian, Portuguese, Romanian, Spanish, Czech, Polish, Ukrainian, Russian, Greek, German, Danish, Swedish, Norwegian, Catalan, Galician, Welsh, Irish, Basque, Croatian, Latvian, Lithuanian, Slovak, Slovenian, Estonian, Finnish, Hungarian, Serbian, Bulgarian
- Middle Eastern and South Asian languages: Arabic, Farsi, Urdu, Turkish, Maltese, Hebrew, Hindi, Marathi, Bengali, Gujarati, Punjabi, Tamil, Telugu, Nepali
- Southeast Asian languages: Tagalog, Malay, Indonesian, Javanese, Khmer, Thai, Lao, Burmese
- East Asian languages: Chinese, Japanese, Korean
- African languages: Amharic, Hausa, Igbo, Malagasy, Shona, Swahili, Wolof, Xhosa, Yoruba, Zulu
Availability and Licensing
The model is distributed under the CC-BY-NC-4.0 license and can be accessed directly from Hugging Face. As of release, the model has generated 1,204 downloads and received 56 likes on the platform, indicating early adoption among developers building multilingual applications.
The model uses the transformers library and safetensors format for compatibility with standard ML tooling.
What This Means
Tiny-aya-global fills a gap in accessible multilingual models by providing a lightweight alternative for organizations needing broad language coverage without deploying massive foundation models. The focus on lower-resource languages alongside major ones suggests Cohere's intent to extend conversational AI capabilities beyond English-dominant markets. For teams building applications in non-English regions, the model's breadth of language support reduces the need to manage separate language-specific models or expensive API calls to large closed models.
Related Articles
NVIDIA Releases GR00T N1.7, 3B-Parameter Open-Source Humanoid Robot Model Trained on 20,854 Hours of Human Video
NVIDIA released GR00T N1.7, a 3-billion parameter open-source Vision-Language-Action model for humanoid robots with commercial licensing. The model was trained on 20,854 hours of human egocentric video data and demonstrates the first documented scaling law for robot dexterity, where increasing human video data from 1,000 to 20,000 hours more than doubles task completion rates.
Tencent Releases HY-World 2.0: Open-Source Multi-Modal Model Generates 3D Worlds from Text and Images
Tencent has released HY-World 2.0, an open-source multi-modal world model that generates navigable 3D environments from text prompts, single images, multi-view images, or video. The model produces editable 3D assets including meshes and 3D Gaussian Splattings that can be directly imported into game engines like Unity and Unreal Engine.
OpenAI Releases GPT-5.4 Image 2 with 272K Context Window and Image Generation
OpenAI has released GPT-5.4 Image 2, combining the GPT-5.4 reasoning model with image generation capabilities. The multimodal model features a 272K token context window and is priced at $8 per million input tokens and $15 per million output tokens.
OpenAI releases ChatGPT Images 2.0 with 3840x2160 resolution at $30 per 1M output tokens
OpenAI released ChatGPT Images 2.0, pricing output tokens at $30 per million with maximum resolution of 3840x2160 pixels. CEO Sam Altman claims the improvement from gpt-image-1 to gpt-image-2 equals the jump from GPT-3 to GPT-5.
Comments
Loading...