Portugal releases Amália, open-source 9B parameter AI model trained on European Portuguese
Portugal has released Amália, its first national AI model trained specifically for European Portuguese. Built on EuroLLM-9B with 9 billion parameters, the model is fully open-source with weights, datasets, and code published under an open license. The government has committed €5.5m in initial funding through 2027.
Portugal releases Amália, open-source 9B parameter AI model trained on European Portuguese
Portugal has released Amália, its first national AI model trained specifically for European Portuguese, with the model's weights, training data, and source code all published under an open license. The 9 billion parameter model is built on EuroLLM-9B, a European foundation model, and represents a €5.5m initial investment through Portugal's Recovery and Resilience Plan.
Technical specifications
Amália—Automatic Multimodal Language Assistant with Artificial Intelligence—is based on the EuroLLM-9B foundation model. A team of more than 60 researchers and students expanded it with European Portuguese datasets, added a larger context window than the base model, strengthened safety and evaluation systems, and built in multimodal capabilities to handle both images and text.
The test version was completed in September 2025 and presented at the PROPOR conference in Brazil. Funding is secured through the end of 2027.
Deployment model: Infrastructure, not consumer product
Amália is not designed as a consumer-facing chatbot. Instead, it functions as foundational infrastructure that other software can call on. Planned applications include an AI teaching assistant, a virtual guide for Portuguese museums and monuments, a digital assistant for citizen services, and decision-support tools for the Portuguese Navy.
The model is distributed through NOVA University Lisbon, Instituto Superior Técnico, and the universities of Porto, Minho, and Coimbra, coordinated with the Foundation for Science and Technology.
European Portuguese vs Brazilian Portuguese
European Portuguese differs significantly from Brazilian Portuguese in grammar, idiom, and cultural references. Large commercial models, trained predominantly on Brazilian Portuguese, tend to flatten these distinctions. Amália addresses this gap for Portugal's 10 million citizens and public services that require accurate cultural and linguistic register.
The release follows the OpenEuroLLM alliance and broader European efforts to reduce dependence on American and Chinese AI systems for foundational language tasks. It coincides with infrastructure investments including Nscale's €695m data-centre expansion in Portugal with Microsoft.
Open license details
Unlike commercial models accessed through APIs and usage fees, Amália ships with its weights, datasets, and code published under an open license. Governments, universities, and companies can inspect training data, adapt the model, and run it on their own hardware without recurring costs or dependencies on external providers.
This approach enables the auditing and verification that Portugal's government considers necessary for systems integrated into citizen services and military applications.
What this means
Portugal's release of Amália represents a concrete implementation of digital sovereignty—not just stated policy but functioning infrastructure with multi-year funding commitments. The model's value lies in its specificity: accurate European Portuguese for a defined user base, not a scaled-down competitor to GPT-4 or Claude. Success depends on actual adoption by universities, companies, and government departments over the next two years. Publishing the model is the easy part; integration into production systems is where most national AI projects stall. Portugal has structured funding and institutional coordination through 2027, which provides a realistic timeline to determine whether Amália becomes operational infrastructure or remains a well-documented research artifact.
Related Articles
Google launches Gemini 3.1 Flash Lite Image with 4-second generation time, $0.25 per 1M input tokens
Google has released Gemini 3.1 Flash Lite Image, a text-to-image model that generates 1K resolution images in approximately 4 seconds — 2.7× faster than Gemini 3.1 Flash Image. The model is priced at $0.25 per 1M input tokens and $1.50 per 1M output tokens, with a 66K context window and knowledge cutoff of January 2025.
DeepReinforce Releases Ornith-1.0, Open-Source Agentic Coding Model in 9B to 397B Sizes
DeepReinforce has released Ornith-1.0, an MIT-licensed model designed for agentic coding tasks with variants ranging from 9B to 397B parameters. Built on top of Apache 2.0-licensed Gemma 4 and Qwen 3.5 base models, the company claims it achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks.
DeepSeek Releases V4 Models: 1M Context Window, 90% Less KV Cache Than V3
DeepSeek has released two new MoE models: DeepSeek-V4-Pro with 1.6T parameters (49B activated) and DeepSeek-V4-Flash with 284B parameters (13B activated). Both models support a one million token context window and use a hybrid attention architecture that requires only 27% of single-token inference FLOPs and 10% of KV cache compared to DeepSeek-V3.2.
DeepSeek Releases V4-Pro with 1.6T Parameters, 1M Token Context at 27% Inference Cost of V3
DeepSeek has released two Mixture-of-Experts models: V4-Pro with 1.6 trillion parameters (49B activated) and V4-Flash with 284B parameters (13B activated), both supporting 1 million token context windows. V4-Pro requires only 27% of inference FLOPs and 10% of KV cache compared to V3.2 at 1M token context, trained on over 32 trillion tokens.
Comments
Loading...