Following the exciting launches of Gemma 3 and Gemma 3 QAT, our family of state-of-the-art open models capable of running on a single cloud or desktop accelerator, we're pushing our…
AI has advanced in language processing, mathematics, and code generation, but extending these capabilities to physical environments remains challenging. Physical AI seeks to close this gap by developing systems that…
Image by Editor | Midjourney & Canva
Introduction
Generative AI wasn't something heard about a few years back, but it has quickly replaced deep learning as one of…
Safety and responsibility We’ve proactively assessed potential risks throughout every stage of the development process for these native audio features, using what we’ve learned to inform our mitigation strategies. We…
Today, we’re announcing our newest generative media models, which mark significant breakthroughs. These models create breathtaking images, videos and music, empowering artists to bring their creative vision to life. They…
New Gemini 2.5 capabilities Native audio output and improvements to Live API Today, the Live API is introducing a preview version of audio-visual input and native audio out dialogue, so…
Today we're releasing early access to Gemini 2.5 Pro Preview (I/O edition), an updated version of 2.5 Pro that has significantly improved capabilities for coding, especially building compelling interactive web…