OpenAI is reportedly planning to release artificial intelligence (AI) agents that can conduct tasks on computer systems. According to a report, the company is working on several agent-related research projects, one of which is called an “Operator” that can execute multi-step actions on a computer. The AI agents are said to be released in January 2025 as a research preview for developers. The company reportedly plans to make its AI agents accessible through a native application programming interface (API), which developers can use to create software and apps.
OpenAI’s AI Agent
AI agents have become a recent trend in the AI field. These are small AI models that have a limited but specialized knowledge base and can use specific software to execute tasks like mimicking keystrokes, button clicks, etc. Due to the specialized nature of the models, they can complete tasks with accuracy and speed.
According to a Bloomberg report, OpenAI has developed a new AI agent called Operator that can complete tasks on computers. Citing people familiar with the matter, the publication claimed that users will be able to tell the AI agent to perform complex tasks like writing code or booking a ticket, and he will be able to execute them.
On Wednesday, OpenAI officials reportedly revealed plans to release the tool as a research preview in January 2025. It is being said that the company will create a new API for developers through which developers will get access to it.
Notably, OpenAI is reportedly working on several agent-related research projects, which are nearing completion. It is said that such an agent is capable of executing tasks in a web browser. Details about other projects are not known at this time.
OpenAI CEO Sam Altman mentioned AI agents as a focus of the company during a question-and-answer session on Reddit earlier this month. Responding to a user, he said, “We will have better and better models.” But I think the next big breakthrough that will be realized will be agents.”
OpenAI’s competitor Anthropic released native AI agents last month. Dubbed Computer Access, these agents can understand and interact with computers, essentially allowing them to take control and complete tasks on the PC. These agents are built on the upgraded version of Cloud 3.5 Sonnet.