Google’s annual developer conference, I/O, took place today with a variety of announcements reiterating the company’s continued focus on artificial intelligence (AI).
From advancements in its Gemini AI model to new developer tools and integrations, the Google I/O 2024 came with innovations targeted at ensuring users and developers get the most out of AI.
Focus on User Experience
Smarter Search
Google Search is getting better with AI as overviews will provide users with accurate summaries of their search queries, while the ability to leverage Gemini for trip planning brings a more seamless travel venture.
Google plans to utilize generative AI to organize entire search results pages, further simplifying information access.
Google Lens Video Search
Users can now search by recording a video with Google Lens. This allows you to ask questions about the video content while recording and Google’s AI will attempt to find relevant answers.
Circle to Search Solves Math Problems
Users can now circle math problems on their Android devices and receive step-by-step assistance, but not complete solutions, to solve them.
Enhanced Google Photos
“Ask Photos,” powered by Gemini, lets users search their photo library using natural language queries. No more tedious tagging, this AI-powered search understands photo content and metadata, making finding specific photos super easy.
Project Astra
This multimodal AI assistant is designed to be a do-everything virtual assistant capable of understanding what it sees through your device’s camera, remembering locations of your belongings, and completing tasks on your behalf.
Scam Call Detection
Google is integrating Gemini Nano, its on-device AI model, into a future version of Android. This technology will listen for “conversation patterns commonly associated with scams” in real-time, alerting users to potential fraud attempts during calls.
Developer Tools and Integrations
Gemini Everywhere
Developers will have access to Gemini’s capabilities through various platforms. The Places API on Google Maps will allow developers to show AI-generated summaries of locations within their own apps.
Also, Gemini Nano will be embedded directly into Chrome 126, enabling developers to create on-device AI features.
Project IDX Open Beta
This next-generation browser-based development environment now has integrations with Google Maps Platform, Chrome Dev Tools, and Lighthouse, simplifying app development and debugging.
Firebase Genkit
This new open-source framework simplifies the process of building AI-powered applications in JavaScript/TypeScript. Developers can leverage Genkit for tasks like content generation, summarization, text translation, and image creation.
AI Model Upgrades
Gemini 1.5 Pro
This model can now analyze documents, codebases, videos, and audio recordings twice the size compared to initial models, making it ideal for handling complex tasks.
Gems for Custom Chatbots
This feature allows users to personalize Gemini’s responses and areas of expertise.
Gemini Live
This new experience allows for in-depth voice chats with Gemini. Users can interrupt Gemini, ask clarifying questions, and benefit from its real-time speech adaptation. Gemini Live can also see and respond to a user’s surroundings through photos or videos captured by their smartphone camera. Paid subscribers can access information from PDFs using Gemini.
AI Assistant in Chrome
Chrome desktop will integrate Gemini Nano, a lightweight version of the AI model, to generate text for social media posts, product reviews, and more directly within the browser.
AI Overviews in Google Search
Google Search will display “AI Overviews” with summarized answers from the web, similar to other AI search tools.
Imagen 3
Google’s latest image generation model comes with improved accuracy in translating text prompts into images and offers more creative and detailed results. Imagen 3 produces fewer artifacts and errors, making it a powerful tool for visual content creation.
Gemma 2
The next generation of Google’s Gemma models will launch with a 27 billion parameter model in June.
Veo Video Generation
This AI model can generate 1080p video clips based on text prompts. Veo can capture different visual styles and even make edits to existing footage, opening doors for innovative video creation.
SynthID Upgrades
Google’s AI watermarking tool, SynthID, will now embed watermarks into Veo-generated videos and detect other AI-generated videos.
Beyond AI
Pixel 8a
The latest addition to the Pixel line has the Tensor G3 chip and starts at $499.
Pixel Slate
Google’s Pixel Tablet is now available for purchase, both with and without the detachable base.
Google I/O 2024 focused on AI and its integration into user experiences and developer tools. These ultimately enhance search, content creation, communication, and application development.