Olmo
1 article tagged with Olmo
June 25, 2026
researchAi2
AI2 Research: Hybrid Models Excel at Content Words, Transformers Better at Token Repetition
Allen Institute for AI researchers conducted token-level analysis comparing their 7B-parameter Olmo 3 transformer and Olmo Hybrid models. The study finds hybrid architectures show a loss gap advantage of 0.04 on content words (nouns, verbs, adjectives) versus 0.02 on function words, while transformers match or exceed hybrids on repeated tokens and closing braces.