rag

4 articles tagged with rag

April 7, 2026
model releaseMicrosoft

Microsoft open-sources Harrier embedding model with 27B parameters, 131K context window

Microsoft's Bing team has open-sourced Harrier, a 27-billion-parameter embedding model that supports over 100 languages and features a 131,072-token context window. The model ranks first on the MTEB v2 multilingual benchmark, outperforming proprietary offerings from OpenAI and Amazon, and is available on Hugging Face under the MIT license.

April 1, 2026

Google Deepmind identifies six attack categories that can hijack autonomous AI agents

A Google Deepmind paper introduces the first systematic framework for 'AI agent traps'—attacks that exploit autonomous agents' vulnerabilities to external tools and internet access. The researchers identify six attack categories targeting perception, reasoning, memory, actions, multi-agent networks, and human supervisors, with proof-of-concept demonstrations for each.

March 27, 2026
model release

Chroma releases Context-1, a 20B parameter retrieval agent for complex multi-hop search

Chroma has released Context-1, a 20B parameter Mixture of Experts model trained specifically for retrieval tasks that require multi-hop reasoning. The model decomposes complex queries into subqueries, performs parallel tool calls, and actively prunes its own context mid-search—achieving comparable performance to frontier models at a fraction of the cost and up to 10x faster inference speed.

February 28, 2026
model release

Perplexity open-sources embedding models matching Google and Alibaba with lower memory requirements

Perplexity has open-sourced two text embedding models designed to match or exceed the performance of Google's and Alibaba's embeddings while requiring significantly less memory. The move brings competitive embedding technology into the open-source ecosystem.