Emerging Technology
After Anthropic, Google Plans To Develop AI That Can Control Computers
Updated on Mon, Oct 28, 2024
Now, Google is set to revolutionize the way we interact with our personal computers by developing an artificial intelligence system that can control actions on a web browser.
The innovative project, known as Project Jarvis, aims to automate tasks such as research gathering, online shopping and flight bookings, streamlining users' everyday online activities.
The company plans to showcase this groundbreaking technology in December, coinciding with the supposed launch of its next flagship Gemini large language model (LLM).
So, what is this AI agent all about?
What Is Google's AI Agent Capable Of?
Google's upcoming AI agent is designed to take over web browser operations, potentially handling tasks such as booking flights, online shopping or even organizing research on a topic — tasks that would usually require substantial human input.
The concept behind this AI involves creating a digital assistant that can not only understand natural language commands but also execute meaningful actions autonomously. For example, if a user asks the AI to research a travel destination and book suitable flights, the AI could complete the task without needing further instructions.
This AI development closely follows Anthropic's recent release of their Claude 3.5 model, which has similar capabilities, including performing tasks like moving a cursor, typing text and managing software applications on computers.
Google's efforts seem to be aimed at refining and potentially extending these capabilities for broader use. This presents Google with an opportunity to take on Anthropic in the AI consumer market.
However, Anthropic already seems to be at the forefront of AI-powered agents.
Anthropic's Dominance In AI Agents
Anthropic's new AI model, Claude 3.5, brought the concept of "AI agents" into the spotlight with its ability to handle complete workflows autonomously.
For instance, it can interact with computer screens by interpreting visual information, clicking buttons and even filling out forms — all actions that typically need human attention. These features represent a step beyond the AI chatbots we're familiar with, which have traditionally been used for simple conversational or search-related tasks.
Claude 3.5 introduced the "Computer Use" feature allowing developers to integrate its capabilities into existing systems via an API. This model, described by Anthropic as being at an experimental stage is only available to developers.
However, it has set a precedent in AI agent's evolving role in managing digital interactions. Google's response seems to be a clear indication that the tech giant does not intend to fall behind in this space.
What Is Google's Ambition?
Google's AI agent reportedly leverages its advancements in large language models (LLMs) and integrates them with capabilities to perform end-to-end actions on a user's behalf. By merging the intelligence of natural language processing with autonomous interaction capabilities, Google seeks to position this technology as a game-changer for productivity and automation.
Sundar Pichai, the CEO of Google, mentioned at Google’s I/O 2024 event that universal AI agents, such as Google's upcoming Project Astra, will bring significant advancements to productivity by combining multimodal capabilities with the ability to complete end-to-end tasks autonomously. Pichai stated, " Still, we are in the early days of the AI platform shift. We see so much opportunity ahead, for creators, for developers, for startups, for everyone. Helping to drive those opportunities is what our Gemini era is all about."
Surprisingly, Google has previously partnered with Anthropic and has funded the AI startup's initial development efforts. The due is now poised to develop its own versions of AI agents to challenge the leader, OpenAI, in the emerging autonomous AI landscape.
What Does It Mean For The Future?
The integration of AI into daily tasks is becoming increasingly prevalent. With Google's Project Jarvis and Anthropic’s Computer Use, the tech giants want to enhance user experiences by allowing AI agents to handle mundane tasks/
This will free up time for users to focus on more important activities and engage in creative, strategic thinking. As companies race to develop similar technologies, we may see a shift in AI market’s dynamics.
With AI becoming a standard feature in web browsers, can Google's new AI agent revolutionize productivity and redefine human-computer interaction?
Let us know your thoughts in the comments below!
First published on Mon, Oct 28, 2024
Liked what you read? That’s only the tip of the tech iceberg!
Explore our vast collection of tech articles including introductory guides, product reviews, trends and more, stay up to date with the latest news, relish thought-provoking interviews and the hottest AI blogs, and tickle your funny bone with hilarious tech memes!
Plus, get access to branded insights from industry-leading global brands through informative white papers, engaging case studies, in-depth reports, enlightening videos and exciting events and webinars.
Dive into TechDogs' treasure trove today and Know Your World of technology like never before!
Disclaimer - Reference to any specific product, software or entity does not constitute an endorsement or recommendation by TechDogs nor should any data or content published be relied upon. The views expressed by TechDogs' members and guests are their own and their appearance on our site does not imply an endorsement of them or any entity they represent. Views and opinions expressed by TechDogs' Authors are those of the Authors and do not necessarily reflect the view of TechDogs or any of its officials. While we aim to provide valuable and helpful information, some content on TechDogs' site may not have been thoroughly reviewed for every detail or aspect. We encourage users to verify any information independently where necessary.
Trending TD NewsDesk
CES 2026 Updates: Intel, Atlas, Smart Bricks, And More
Intel Launches Next-Generation PC Chip at CES 2026
CES 2026 Is Here: Latest Reveals From Samsung, LG, And Plaud!
AWS re:Invent 2025: Amazon & Google Bring Multicloud Service For Faster Connectivity
Grok Is Under Fire As France And India Complain About Sexualized Deepfake Images
Join Our Newsletter
Get weekly news, engaging articles, and career tips-all free!
By subscribing to our newsletter, you're cool with our terms and conditions and agree to our Privacy Policy.
Join The Discussion