Trending

OpenAI to Integrate Shopify Seamlessly into ChatGPT for In‑Chat Shopping

Red Hat Bets on Open SLMs and Inference Optimization for Responsible, Enterprise‑Ready AI

OpenAI’s o3 and o4‑mini–Reasoning Models Exhibit Increased Hallucination

Table of Contents

Amazon Nova Act: An AI Agent That Can Control a Web Browser

Read Time: 2 minutes

Table of Contents

Amazon introduces Nova Act, an AI agent capable of independently navigating web browsers and completing tasks. With the Nova Act SDK, developers can build custom AI applications, taking on competitors like OpenAI and Anthropic.

Amazon has introduced Nova Act, a general-purpose AI agent capable of controlling a web browser and independently performing simple actions. Alongside Nova Act, Amazon is launching the Nova Act SDK, a developer toolkit for building agent prototypes. While currently available as a research preview, this AI-powered agent will also form the backbone of the upcoming Alexa+ upgrade, enhancing Amazon’s voice assistant with generative AI capabilities.

Competing with AI Leaders

Nova Act positions Amazon in direct competition with AI agents from companies like OpenAI and Anthropic. These agents are designed to assist users by navigating the web and completing various online tasks. Amazon asserts that Nova Act has outperformed its competitors in internal tests. For instance, on the ScreenSpot Web Text benchmark, Nova Act scored 94%, surpassing OpenAI’s CUA at 88% and Anthropic’s Claude 3.7 Sonnet at 90%.

Practical Applications and Developer Access

With Nova Act, developers can build AI agents capable of performing actions like ordering food, booking reservations, or filling out forms. The Nova Act SDK provides customizable tools that developers can use to define the agent’s workflows, including when human intervention may be necessary. Developers can access the SDK via the newly launched Nova website, nova.amazon.com, which also serves as a showcase for Amazon’s Nova foundation models.

AGI Vision and Leadership

Amazon’s San Francisco-based AGI lab, co-led by former OpenAI researchers David Luan and Pieter Abbeel, spearheaded the development of Nova Act. Both Luan and Abbeel have extensive AI backgrounds, with Luan previously founding Adept and Abbeel co-founding Covariant. According to Luan, AI agents like Nova Act are crucial in achieving artificial general intelligence (AGI) — AI systems capable of performing any task a human can do on a computer.

Challenges and Expectations

The AI agent landscape remains competitive, and Nova Act will face scrutiny over its reliability and effectiveness. Early reports suggest that previous AI agents from OpenAI and Anthropic struggled with autonomy, often making errors and requiring human intervention. With the upcoming release of Alexa+, Amazon aims to demonstrate whether Nova Act has solved these challenges.

community

Get Instant Domain Overview
Discover your competitors‘ strengths and leverage them to achieve your own success