instruction-following
1 article tagged with instruction-following
April 24, 2026
model releaseOpenAI
OpenAI GPT-5.5 scores 93/100 in benchmark test, loses points for ignoring instructions
OpenAI's GPT-5.5 scored 93 out of 100 points in a 10-round benchmark test covering summarization, reasoning, coding, and creative tasks. The model lost points primarily for ignoring specific instructions, such as using unauthorized sources when asked to summarize from a single news outlet.