synthid
7 articles tagged with synthid
Google launches Gemini Omni, multimodal AI video generator with avatar cloning and physics modeling
Google has released Gemini Omni, a multimodal AI video generation tool that accepts text, images, audio, and video as inputs. The first tier, Gemini Omni Flash, includes avatar cloning that creates digital versions of users and incorporates physics modeling for realistic motion.
Google releases Gemini Omni Flash video generation model with conversational editing, withholds speech synthesis
Google DeepMind released Gemini Omni Flash, the first model in its new Omni family that generates and edits video from image, audio, video, and text inputs. The model is rolling out to Gemini app subscribers and YouTube Shorts with a 10-second clip limit, while speech-editing capabilities remain withheld pending safety testing.
OpenAI adopts C2PA metadata standard and Google's SynthID watermarking for AI image detection
OpenAI is joining the C2PA open standard and embedding Google DeepMind's invisible SynthID watermark in all AI-generated images from its models. The company is launching a public verification tool that checks for both C2PA metadata and SynthID watermarks, though detection only works for images created by OpenAI's own products.
Google adds SynthID AI detection to Search and Chrome, verified 50 million times since launch
Google is integrating its SynthID AI detection technology into Search and Chrome, starting with Lens, AI Mode, and Circle to Search today. According to Google, SynthID verification tools have been used 50 million times globally since the feature was added to Gemini for checking images, video, and audio.
Google DeepMind releases Gemini 3.1 Flash TTS with audio tags for precise speech control across 70+ languages
Google DeepMind launched Gemini 3.1 Flash TTS, a text-to-speech model that achieved an Elo score of 1,211 on the Artificial Analysis TTS leaderboard. The model introduces audio tags that allow developers to control vocal style, pace, and delivery through natural language commands embedded in text input, with support for 70+ languages.
Google's Lyria 3 Pro extends AI music generation to 3-minute songs with structural control
Google released Lyria 3 Pro, an updated music generation model capable of creating full 3-minute songs—six times longer than the 30-second limit of its predecessor launched last month. The new version adds granular control over song structure, allowing users to specify intros, verses, choruses, and bridges. It's available now for paid Gemini users, enterprise customers, and developers via API.
Google launches Lyria 3 Pro, extending AI music generation to 3-minute tracks
Google announced Lyria 3 Pro, an upgraded music generation model capable of creating tracks up to three minutes long—a tenfold increase from Lyria 3's 30-second maximum. The model adds structural music understanding (intros, verses, choruses, bridges) and rolls out to Gemini app paid subscribers, Google Vids, ProducerAI, and enterprise tools including Vertex AI, the Gemini API, and AI Studio.