synthid

7 articles tagged with synthid

May 27, 2026

model release

Google launches Gemini Omni, multimodal AI video generator with avatar cloning and physics modeling

Google has released Gemini Omni, a multimodal AI video generation tool that accepts text, images, audio, and video as inputs. The first tier, Gemini Omni Flash, includes avatar cloning that creates digital versions of users and incorporates physics modeling for realistic motion.

May 27, 2026 · 1:51 AM

May 20, 2026

model release

Google releases Gemini Omni Flash video generation model with conversational editing, withholds speech synthesis

Google DeepMind released Gemini Omni Flash, the first model in its new Omni family that generates and edits video from image, audio, video, and text inputs. The model is rolling out to Gemini app subscribers and YouTube Shorts with a 10-second clip limit, while speech-editing capabilities remain withheld pending safety testing.

May 20, 2026 · 9:20 AM

May 19, 2026

product updateOpenAI

OpenAI adopts C2PA metadata standard and Google's SynthID watermarking for AI image detection

OpenAI is joining the C2PA open standard and embedding Google DeepMind's invisible SynthID watermark in all AI-generated images from its models. The company is launching a public verification tool that checks for both C2PA metadata and SynthID watermarks, though detection only works for images created by OpenAI's own products.

May 19, 2026 · 11:05 PM

product update

Google adds SynthID AI detection to Search and Chrome, verified 50 million times since launch

Google is integrating its SynthID AI detection technology into Search and Chrome, starting with Lens, AI Mode, and Circle to Search today. According to Google, SynthID verification tools have been used 50 million times globally since the feature was added to Gemini for checking images, video, and audio.

May 19, 2026 · 6:09 PM

April 15, 2026

model releaseGoogle DeepMind

Google DeepMind releases Gemini 3.1 Flash TTS with audio tags for precise speech control across 70+ languages

Google DeepMind launched Gemini 3.1 Flash TTS, a text-to-speech model that achieved an Elo score of 1,211 on the Artificial Analysis TTS leaderboard. The model introduces audio tags that allow developers to control vocal style, pace, and delivery through natural language commands embedded in text input, with support for 70+ languages.

April 15, 2026 · 4:21 PM

March 25, 2026

changelog

Google's Lyria 3 Pro extends AI music generation to 3-minute songs with structural control

Google released Lyria 3 Pro, an updated music generation model capable of creating full 3-minute songs—six times longer than the 30-second limit of its predecessor launched last month. The new version adds granular control over song structure, allowing users to specify intros, verses, choruses, and bridges. It's available now for paid Gemini users, enterprise customers, and developers via API.

March 25, 2026 · 5:35 PM

model release

Google launches Lyria 3 Pro, extending AI music generation to 3-minute tracks

Google announced Lyria 3 Pro, an upgraded music generation model capable of creating tracks up to three minutes long—a tenfold increase from Lyria 3's 30-second maximum. The model adds structural music understanding (intros, verses, choruses, bridges) and rolls out to Gemini app paid subscribers, Google Vids, ProducerAI, and enterprise tools including Vertex AI, the Gemini API, and AI Studio.

March 25, 2026 · 4:50 PM

← Back to all news