Google announces Project Astra as "AI agent for everyday life"
GG
By
PublishedMay 15, 2024
No headings found
At Google's I/O 2024, Demis Hassabis, CEO of deepmind gave his first ever appearances on stage about Google’s "Universal AI agent for everyday life” or in short ‘Project Astra’. Project Astra would be loaded with features like object recognition made up of AI agents as they call it. And, in this article, we will discuss further about Google’s Project Astra in more detail.
Google's Project Astra Overview
Google's new AI progress, particularly with Gemini models, is a big deal in tech. Gemini 1.5 Flash, which you can try out now, makes those large language models faster and cheaper. Plus, with improvements to Gemini 1.5 Pro, like a bigger context window and better responses, tasks such as translation and coding become even smoother.
What can it do?
The Project Astra can describe things, give you information, and help you in everyday life as well. Through the Project Astra, you can simply show what's in front of you, and it will answer pretty accurately. It can also help you with your assignments or even with your coding.
What Else from the I/O?
Expanding Gemini's Reach
Gemini Nano's evolution to incorporate image understanding brings multimodal capabilities to smartphone applications, underscoring Google's commitment to versatility and accessibility. Additionally, the introduction of Gemma 2 and PaliGemma models, optimized for TPUs and GPUs respectively, showcases Google's dedication to optimizing AI performance across various hardware platforms.
Google Search Gets Smarter
Google's Gemini Nano now understands images, making smartphone apps even smarter. This shows Google's focus on making AI useful for everyone. Also, the new Gemma 2 and PaliGemma models, optimized for TPUs and GPUs (computer chips), show how Google is making AI work better on all sorts of devices.
Veo, Imagen 3 and SynthID
Google unveils Veo, a groundbreaking model that turns text into high-quality 1080p videos. Veo can understand both regular language and cinematic terms, letting users create videos that match their style and vision. Though it's only available to a select few for now, Veo plans to open up to more creators with a waitlist.
Google's latest innovation, Imagen 3, takes text and turns it into images with better natural language skills, promising top-notch image quality. The new SynthID tech now also protects text and video content, ensuring it's authentic and safe, especially in videos made by Veo.
Google Photos introduces Ask Photos, enabling conversational image search and automatic generation of image highlights. Meanwhile, Gemini's integration into Android phones as the default AI assistant brings multimodal support and enhanced accessibility features to users' fingertips.
Google's Project Astra: What’s next?
Google has big plans for Gemini, aiming to make it even better with features like Gemini Live for live voice chats and Gems for personalized AI help. Plus, integrating Gemini into Google Workspace means smarter help with all kinds of work tasks, showing Google's dedication to making AI boost productivity for everyone.
Meanwhile, watch our review of Honor Magic V2
Article Last updated: May 15, 2024
Best Tech Deals
No Active Polls
There are currently no polls available. Check back later for new polls to participate in!
Polls will appear here when available
How did we do with this article?
😍
Loved it!
😕
Needs work
Conversation
We’d love to hear your thoughts! Let's keep it respectful and on-topic. Any inappropriate remarks may be removed. Happy commenting! Privacy Policy
Be the first to share your thoughts-start the conversation!