
AI Agents Are Here, What They Can Do And How They Can Go Wrong
We are entering the third phase of generative AI. First came the chatbots, followed by the assistants. Now we are beginning to see agents: systems that aspire to greater autonomy and can work in "teams" or use tools to accomplish complex tasks.
The latest hot product is OpenAI's ChatGPT agent. This combines two pre-existing products (Operator and Deep Research) into a single more powerful system which, according to the developer, "thinks and acts".
These new systems represent a step up from earlier AI tools. Knowing how they work and what they can do - as well as their drawbacks and risks - is rapidly becoming essential.
From chatbots to agents
ChatGPT launched the chatbot era in November 2022, but despite its huge popularity the conversational interface limited what could be done with the technology.
Enter the AI assistant, or copilot. These are systems built on top of the same large language models that power generative AI chatbots, only now designed to carry out tasks with human instruction and supervision.
Agents are another step up. They are intended to pursue goals (rather than just complete tasks) with varying degrees of autonomy, supported by more advanced capabilities such as reasoning and memory.
Multiple AI agent systems may be able to work together, communicating with each other to plan, schedule, decide and coordinate to solve complex problems.
Agents are also "tool users" as they can also call on software tools for specialised tasks - things such as web browsers, spreadsheets, payment systems and more.
A year of rapid development
Agentic AI has felt imminent since late last year. A big moment came last October, when Anthropic gave its Claude chatbot the ability to interact with a computer in much the same way a human does. This system could search multiple data sources, find relevant information and submit online forms.
Other AI developers were quick to follow. OpenAI released a web browsing agent named Operator, Microsoft announced Copilot agents, and we saw the launch of Google's Vertex AI and Meta's Llama agents.
Earlier this year, the Chinese startup Monica demonstrated its Manus AI agent buying real estate and converting lecture recordings into summary notes. Another Chinese startup, Genspark, released a search engine agent that returns a single-page overview (similar to what Google does now) with embedded links to online tasks such as finding the best shopping deals. Another startup, Cluely, offers a somewhat unhinged "cheat at anything" agent that has gained attention but is yet to deliver meaningful results.
Not all agents are made for general-purpose activity. Some are specialised for particular areas.
Coding and software engineering are at the vanguard here, with Microsoft's Copilot coding agent and OpenAI's Codex among the frontrunners. These agents can independently write, evaluate and commit code, while also assessing human-written code for errors and performance lags.
Search, summarisation and more
One core strength of generative AI models is search and summarisation. Agents can use this to carry out research tasks that might take a human expert days to complete.
OpenAI's Deep Research tackles complex tasks using multi-step online research. Google's AI "co-scientist" is a more sophisticated multi-agent system that aims to help scientists generate new ideas and research proposals.
Agents can do more - and get more wrong
Despite the hype, AI agents come loaded with caveats. Both Anthropic and OpenAI, for example, prescribe active human supervision to minimise errors and risks.
OpenAI also says its ChatGPT agent is "high risk" due to potential for assisting in the creation of biological and chemical weapons. However, the company has not published the data behind this claim so it is difficult to judge.
But the kind of risks agents may pose in real-world situations are shown by Anthropic's Project Vend. Vend assigned an AI agent to run a staff vending machine as a small business - and the project disintegrated into hilarious yet shocking hallucinations and a fridge full of tungsten cubes instead of food.
In another cautionary tale, a coding agent deleted a developer's entire database, later saying it had "panicked".
Agents in the office
Nevertheless, agents are already finding practical applications.
In 2024, Telstra heavily deployed Microsoft copilot subscriptions. The company says AI-generated meeting summaries and content drafts save staff an average of 1-2 hours per week.
Many large enterprises are pursuing similar strategies. Smaller companies too are experimenting with agents, such as Canberra-based construction firm Geocon's use of an interactive AI agent to manage defects in its apartment developments.
Human and other costs
At present, the main risk from agents is technological displacement. As agents improve, they may replace human workers across many sectors and types of work. At the same time, agent use may also accelerate the decline of entry-level white-collar jobs.
People who use AI agents are also at risk. They may rely too much on the AI, offloading important cognitive tasks. And without proper supervision and guardrails, hallucinations, cyberattacks and compounding errors can very quickly derail an agent from its task and goals into causing harm, loss and injury.
The true costs are also unclear. All generative AI systems use a lot of energy, which will in turn affect the price of using agents - especially for more complex tasks.
Learn about agents - and build your own
Despite these ongoing concerns, we can expect AI agents will become more capable and more present in our workplaces and daily lives. It's not a bad idea to start using (and perhaps building) agents yourself, and understanding their strengths, risks and limitations.
For the average user, agents are most accessible through Microsoft copilot studio. This comes with inbuilt safeguards, governance and an agent store for common tasks.
For the more ambitious, you can build your own AI agent with just five lines of code using the Langchain framework.
(Disclaimer Statement: Daswin de Silva does not work for, consult, own shares in or receive funding from any company or organisation that would benefit from this article, and has disclosed no relevant affiliations beyond their academic appointment.)
This article is republished from The Conversation under a Creative Commons license. Read the original article.

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles


Indian Express
2 minutes ago
- Indian Express
SAP opens new campus in Bengaluru, its second-largest R&D hub outside Germany
SAP opened its new campus in Bengaluru on Tuesday, marking it as the company's second-largest R&D hub outside its headquarters in Germany. The India Innovation Park, as it is called, is located in Devanahalli, in the northern part of the city, close to Bengaluru's international airport. The 41-acre campus will be developed in multiple phases. Currently, 27 acres have been built and are operational. Once the entire facility is complete, it will have the capacity to accommodate up to 14,000 people. At present, 3,200 SAP employees have already moved in, and an additional 4,500 employees are expected to relocate during the first phase. The phase two of the campus has already been approved, and construction will begin soon, with a targeted completion by the Q2 or Q3 of 2028. The total cost of building the campus exceeds 194 million euros. This is SAP's second campus in Bengaluru, with the first located in the Whitefield area of the city. 'The [Innovation Park] is designed to bring together our customers, partners, academia, startups, and communities. For us, this is the beating heart of co-innovation,' said Sindhu Gangadharan, MD of SAP Labs India and Head of Customer Innovation Services at SAP, while addressing hundreds of employees, partners, and government representatives at the inauguration. Software maker SAP is the latest major tech company to expand in India and increase its footprint in Bengaluru, now the world's fourth-largest tech cluster after Silicon Valley, Boston, and London. Earlier this year, Google opened its new campus in Bengaluru, while Microsoft is setting up its largest R&D center in Noida, Uttar Pradesh. More and more tech companies are expanding their presence in India by setting up either Research & Development centers or Global Capability Centers (GCCs). Once considered low-cost outsourcing hubs for global firms, these GCCs centers have evolved significantly over the past few years by supporting their parent organisations by handling a wide range of business functions and specialising in areas such as IT, automation, and manufacturing. Bengaluru is home to 875 GCCs and accounts for 34 per cent of India's GCC workforce. According to a report by IT industry body Nasscom and consulting firm Zinnov, released late last year, the market size of India's GCC sector is expected to grow from $64.6 billion in fiscal 2024 to between $99 billion and $105 billion by 2030. In March, SAP became Europe's most valuable listed company, overtaking French luxury group LVMH and Ozempic-maker Novo Nordisk in market capitalisation, after pivoting its business toward cloud computing and seizing opportunities in artificial intelligence. SAP generates the majority of its revenue from cloud services and is focused on leveraging AI to drive efficiencies for businesses. The company offers enterprise products across cloud solutions, expense management, supply chain management, and analytics. SAP's products are used by over 440,000 customers worldwide, including 98 of the world's 100 largest companies. Taken altogether, its client base generates over 80 per cent of global commerce, according to the company. SAP has over 17,000 employees spread across Bengaluru, Pune, Mumbai, Hyderabad, and Delhi in India. The 53-year-old German tech giant has over 40 per cent of its global R&D workforce in India.


Time of India
2 minutes ago
- Time of India
What Went Down at Google I/O India 2025
At Google I/O Connect India 2025, held in Bengaluru, Google unveiled a slew of AI-driven updates to supercharge India's developer and startup ecosystem. From localizing Gemini 2.5 Flash processing for faster, low-latency access to announcing partnerships under the India AI Mission, the event spotlighted Google's growing focus on homegrown innovation. Show more Show less


Time of India
2 minutes ago
- Time of India
Demos & DeepMind Insights - The Economic Times Video
At Google I/O Connect India 2025, held in Bengaluru, Google unveiled a slew of AI-driven updates to supercharge India's developer and startup ecosystem. From localizing Gemini 2.5 Flash processing for faster, low-latency access to announcing partnerships under the India AI Mission, the event spotlighted Google's growing focus on homegrown innovation.