logo
#

Latest news with #LeslieD'Monte

Mint Explainer: Is OpenAI exaggerating the powers of its new ChatGPT Agent?
Mint Explainer: Is OpenAI exaggerating the powers of its new ChatGPT Agent?

Mint

time2 days ago

  • Business
  • Mint

Mint Explainer: Is OpenAI exaggerating the powers of its new ChatGPT Agent?

Leslie D'Monte OpenAI has flagged the agent as high-risk under its safety framework. Is this just marketing hype or a sign that AI is genuinely becoming more powerful and autonomous? OpenAI CEO Sam Altman. Photo AFP Gift this article On Thursday, OpenAI launched its autonomous ChatGPT Agent, a tool that's capable of finding and buying things online, managing your calendar, and booking you an appointment with a doctor. It's essentially a digital assistant that doesn't just provide information but complete actual tasks. On Thursday, OpenAI launched its autonomous ChatGPT Agent, a tool that's capable of finding and buying things online, managing your calendar, and booking you an appointment with a doctor. It's essentially a digital assistant that doesn't just provide information but complete actual tasks. That being said, OpenAI has flagged the agent as high-risk under its safety framework, warning it could potentially be used to create dangerous biological or chemical substances. Is this just marketing hype, timed to build momentum for the launch of GPT-5, or a sign that AI agents are genuinely becoming more powerful and autonomous, akin to the agents who protect the computer-generated world of The Matrix? What is ChatGPT Agent? Say you want to rearrange your calendar, find a doctor and schedule an appointment, or research competitors and deliver a report. ChatGPT Agent can now do it for you. Also Read | Deep research with AI is days' worth of work in minutes The agent can browse websites, run code, analyse data, and even create slide decks or spreadsheets—all based on your instructions. It combines the strengths of OpenAI's earlier tools—operator (which could navigate the web) and deep research (which could analyse and summarise information)—into a single system. You stay in control throughout: ChatGPT asks for permission before doing anything important, and you can stop or take over at any time. This new capability is available to Pro, Plus, and Team users through the tools dropdown. How does it work? ChatGPT Auses a powerful set of tools to complete tasks, including a visual browser to interact with websites like a human, a text-based browser for reasoning-heavy searches, a terminal for code execution, and direct application programming interface (API) access. It can also connect to apps such as Gmail or GitHub to fetch relevant information. You can log in to websites within the agent's browser, allowing it to dig deeper into personalised content. All of this runs on its own virtual computer, which keeps track of context even across multiple tools. The agent can switch between browsers, download and edit files, and adapt its methods to complete tasks quickly and accurately. It's built for back-and-forth collaboration—you can step in anytime to guide or change the task, and ChatGPT can ask for more input when needed. If a task takes time, you'll get updates and a notification on your phone once it's done. Has OpenAI tested its performance? OpenAI said on Humanity's Last Exam (HLE), which tests expert-level reasoning across subjects, ChatGPT Agent achieved a new high score of 41.6, rising to 44.4 when multiple attempts were run in parallel and the most confident response was selected. On FrontierMath, the toughest known math benchmark, the agent scored 27.4% using tools such as a code-executing terminal—far ahead of previous models. In real-world tasks, ChatGPT agent performs at or above human levels in about half of the cases, based on OpenAI's internal evaluations. These tasks include building financial models, analysing competitors, and identifying suitable sites for green hydrogen projects. ChatGPT Agent also outperforms others on specialised tests such as DSBench for data science, and the SpreadsheetBench for spreadsheet editing (45.5% vs Copilot Excel's 20.0%). On BrowseComp and WebArena, which test browsing skills, the agent achieves the highest scores to date, according to OpenAI. What are some of the things it can do? Consider the case of travel planning. The agent won't just suggest ideas but navigate booking websites, fill out forms, and even make reservations one you give it permission. You can also ask it to read your emails, find meeting invitations, and automatically schedule appointments in your calendar, or even draft and send follow-up emails. This level of coordination typically required juggling between apps, but the agent manages it in a single conversational flow. Another example involves shopping and price comparison. You can tell the agent to 'order the best-reviewed smartphone under ₹ 15,000", and it can search online stores, compare prices and reviews, and proceed to checkout on a preferred platform. Customer support and task automation are other examples, where the agent is used to troubleshoot an issue, log into support portals, and even file return or refund requests. How are AI agents typically built? Unlike basic chat bots, AI agents are autonomous systems that can plan, reason, and complete complex, multi-step tasks with minimal input—such as coding, data analysis, or generating reports. They are built by combining ways to take in information, think, and take action. Developers begin by deciding what the agent should do, following which the agent collects data like such as or images from its environment. AI agents use large language models (LLMs) like GPT-4 as their core 'brain", which allows them to understand and respond to natural language instructions. To allow AI agents to take action, developers connect the LLM to things like a web browser, code editor, calculator, and APIs for services such as Gmail or Slack. Frameworks like LangChain help integrate these parts, and keep track of information. Some AI agents learn from experience and get better over time. Testing and careful setup make sure they work well and follow rules. Does ChatGPT Agent have credible competition? Google's Project Astra, part of its Gemini AI line, is developing a multimodal assistant that can see, hear, and respond in real time. Gemini CLI is an open-source AI agent that brings Google's Gemini model directly to the terminal for fast, lightweight access. It integrates with Gemini Code Assist, offering developers on all plans AI-powered coding in both VS Code and the command line. Microsoft is embedding Copilot into Windows, Office, and Teams, giving its agent access to workflows, system controls, and productivity tools, soon enhanced by a dedicated Copilot Runtime. Meta is building more socially focused agents within messaging and the metaverse, which could evolve into utility tools. Apple is revamping Siri through Apple Intelligence, combining GPT-level reasoning with strict privacy features and deep on-device integration. Other smart agents include Oracle's Miracle Agent, IBM's Watson tools, Agentforce from Salesforce Anthropic's Claude 3.5, and Perplexity AI's action-oriented agents through its Comet project, blending search with agentic behaviour. The competitive advantage, though, may go to companies that can integrate these AI agents into everyday applications and call for action with a single, unified tool – a task that ChatGPT Agent has demonstrated. Why did OpenAI warn that ChatGPT Agent could be used to trigger biological warfare? OpenAI claimed ChatGPT Agent's superior capabilities could, in theory, be misused to help someone create dangerous biological or chemical substances. However, it clarified that there was no solid evidence it could actually do so. Regardless, OpenAI is activating the highest level of safety measures under its internal 'preparedness framework'. These include thorough threat modeling to anticipate potential misuse, special training to ensure the model refuses harmful requests, and constant monitoring using automated systems that watch for risky behaviour. There are also clear procedures in place for suspicious activity. Should we take this risk seriously? Ja-Nae Duane, AI expert and MIT Research Fellow and co-author of SuperShifts, said the more autonomous the agent, the more permissions and access rights it would require. For example, buying a dress requires wallet access; scheduling an event requires calendar and contact list access. 'While standard ChatGPT already presents privacy risks, the risks from ChatGPT Agent are exponentially higher because people will be granting it access rights to external tools containing personal information (like calendar, email, wallet, and more). There's a significant gap between the pace of AI development and AI literacy; many people haven't even fully understood ChatGPT's existing privacy risks, and now they're being introduced to a feature with exponentially more risks," he said. Also Read | Google's Veo 3 brings the era of video on command Duane added that the key risks included data leaks, mistaken actions, prompt injection, and account compromise, especially when handling sensitive information. Malicious actors, he warned, could exploit them by manipulating inputs, abusing tool access, stealing credentials, or poisoning data to bias outputs. Poor third-party integration and an over-reliance of them could worsen the impact, while the agent's 'black box" nature would make it hard to trace errors, he added. In the wrong hands, these agents could be weaponised for fraud, phishing, or even to generate malware. What are the other concern areas for enterprises? Developers are increasingly deploying AI agents across IT, customer service, and enterprise workflows. According to Nasscom, 46% of Indian firms are experimenting with these agents, particularly in IT, HR, and finance, while manufacturing leads in robotics, quality control, and automation. Beyond concerns around hallucinations, security, privacy, and copyright or intellectual property (IP) violations, a key challenge for businesses is ensuring a return on investment. Gartner noted that many so-called agentic use cases could be handled by simpler tools and predicted that more than 40% of such projects would be scrapped by 2027 over high costs, unclear value, or inadequate risk controls. Of the thousands of vendors in this space, only around 130 are seen as credible; many engage in 'agent washing" by repackaging chatbots, robotic process automation (RPA), or basic assistants as autonomous agents. Nasscom corroborated these concerns, highlighting that 62% of enterprises were still only testing agents in-house. Why is 'humans-in-the-loop' a must? OpenAI CEO Sam Altman advised granting agents only the minimum access needed for each task, not blanket permissions. Nasscom believes that to scale responsibly, enterprises must prioritise human-AI collaboration, trust, and data readiness. It has recommended firms adopt AI agents with a 'human-in-the-loop" approach, reflecting the need for oversight and contextual judgment. According to Duane, users must understand both the tool's strengths and its limits, especially when handling sensitive data. Caution is key, as misuse could have serious consequences. She also emphasised the importance of AI literacy, noting that AI was evolving far faster than most people's understanding of how to use it responsibly. Also Read | Mint Primer: Are firms wasting their money on AI agents? Topics You May Be Interested In

Are firms wasting their money on AI agents?
Are firms wasting their money on AI agents?

Mint

time07-07-2025

  • Business
  • Mint

Are firms wasting their money on AI agents?

Next Story Leslie D'Monte Most firms are still trying out AI agents, hailed by Big Tech as game changers. Analysts warn that many of these may be scrapped within two years as they hide high costs, uncertain returns, and weak risk controls. Can firms unlock value through human-AI agent teamwork? Google and Microsoft recently called AI agents the next big shift, unveiling Project Mariner, Gemini tools, Azure AI Foundry and NLWeb. Gift this article Why is Big Tech bullish on AI agents? Unlike basic chatbots, AI agents are autonomous systems that can plan, reason and complete complex multi-step tasks with minimal input—such as coding, data analysis and generating reports. Developers use them across IT, customer support, and enterprise workflows. Google and Microsoft recently called AI agents the next big shift, unveiling Project Mariner, Gemini tools, Azure AI Foundry and NLWeb. Salesforce, Amazon, IBM, and Meta are also building these platforms to automate workflows and enhance productivity. Nasscom has said 46% of firms are experimenting with AI agents, mainly in IT. Also read | Global poverty: How to deal with funding cuts What should firms be wary of? Many so-called agentic use cases today can be done with simpler tools, says Gartner, which predicts over 40% of such projects will be scrapped by 2027 due to high costs, vague value or weak risk controls. It adds that of thousands of vendors, only about 130 are seen as credible; and many engage in 'agent washing"—rebranding chatbots, robotic process automation (RPA), or assistants as agents without real autonomy. Most current systems also lack the maturity to deliver complex outcomes or sustained return on investment (ROI). Nasscom echoes this, noting 62% of enterprises are only doing in-house agent testing. Also read | How India's $12 bn R&D push could reshape its tech future How big is the market for AI agents? The global AI agents market, valued at $5.4 billion in 2024, is pegged to touch $50.3 billion by 2030, per Grand View Research. North America led with 40.1% revenue share, while Asia-Pacific is the fastest-growing region. Machine learning, single-agent systems, and ready-to-deploy agents dominated through tech, system type, and deployment model, respectively. Also read | Shades of grey: Inside the world of pre-IPO trading How can companies unlock real value? AI agent adoption is hindered by privacy concerns, regulation gaps, and limited focus on talent readiness. Integration with legacy enterprise software remains complex and costly. Regardless, Gartner predicts 15% of daily work decisions will be made autonomously by agentic AI by 2028, up from 0% in 2024. It also urges agentic AI-use only when the returns are clear. Nasscom sees strong potential in real-time decision-making and agility. Key focus areas include data governance and AI risk protocols. Also read | China's economy beats the gloom. Can it do more? Can humans make AI agents more effective? Though AI agents are designed to be autonomous, Nasscom's recent study of over 100 global enterprises reveals how businesses are transitioning from early-stage GenAI applications towards more goal-oriented, human-plus, AI agentic systems. It believes that to scale responsibly, enterprises must prioritize human-AI collaboration, trust and data readiness. Nasscom adds that most (77%) firms adopt AI agents with a 'human-in-the-loop" approach, reflecting the need for oversight and contextual judgment. Also read | Residential sales: Where have all the buyers gone? Topics You May Be Interested In Catch all the Business News, Market News, Breaking News Events and Latest News Updates on Live Mint. Download The Mint News App to get Daily Market Updates.

Are phones that can stream live content without Wi-Fi or internet the next big thing?
Are phones that can stream live content without Wi-Fi or internet the next big thing?

Mint

time29-04-2025

  • Mint

Are phones that can stream live content without Wi-Fi or internet the next big thing?

{{^loggedIn}} {{/loggedIn}} {{#loggedIn}} {{/loggedIn}} Next Story Business News/ Companies / Are phones that can stream live content without Wi-Fi or internet the next big thing? Leslie D'Monte Direct-to-mobile (D2M) technology allows smartphones to stream content directly from satellites or broadcast towers, eliminating the need for mobile data or Wi-Fi. India is pioneering this technology, with commercial D2M-ready phones set to provide access to multimedia content for millions of users. Direct-to-mobile (D2M) technology allows phones to receive broadcast content from satellites or broadcast towers, without relying on mobile data or Wi-Fi. Gift this article When you watch a live Indian Premier League cricket match or a movie on your smartphone, the device uses mobile data or Wi-Fi to stream the content. But if everyone tries to stream the same event through the mobile internet, the network could potentially crash. When you watch a live Indian Premier League cricket match or a movie on your smartphone, the device uses mobile data or Wi-Fi to stream the content. But if everyone tries to stream the same event through the mobile internet, the network could potentially crash. Now, what if your smartphone could stream live TV, emergency alerts or video content directly from satellites or broadcast towers – without relying on mobile data or Wi-Fi – much like how TVs pick up over-the-air signals? That is the promise of direct-to-mobile (D2M) technology, which allows phones to receive broadcast content from satellites or broadcast towers. India is among the first countries to leverage the technology. Last January, the ministry of information and broadcasting said it was planning a pilot test for D2M broadcast transmissions which, as outlined in a 2022 paper by the Indian Institute of Technology (IIT) Kanpur, does not depend on data connectivity. The approach is already in use for emergency government notifications. And on 28 April, Lava International and HMD said they are planning commercial launches of D2M-ready phones for Indian consumers even as trials are under way. HMD is working with IIT Kanpur-incubated FreeStream Technologies, Tejas Networks and Sinclair, while Lava is collaborating with Tejas. Also Read | Trai for interoperable set-top boxes, voluntary sharing of broadcasting infra A D2M-capable phone operates with a special chip that can tune into broadcast signals such as satellite or TV frequencies. The phone has software that decodes the broadcasts and displays the content. The broadcast signals can be transmitted over the 5G Broadcast standard—a next-generation evolution of the LTE broadcast and eMBMS systems. LTE (long-term evolution) broadcast and eMBMS (evolved multimedia broadcast multicast services) technologies allow mobile networks to send the same video, TV, or alert to many phones at once, instead of sending it separately to each user. Smoother, bigger Newer technology such as 5G Broadcast allows even smoother, bigger broadcasts directly to smartphones without needing Wi-Fi or mobile data. However, for voice calls, texting, and internet browsing, the phone will still use regular mobile networks (like 4G/5G) or Wi-Fi, as it currently does. The new HMD devices – low-cost feature phones, dongles, smartphones and tablets – will be powered by Tejas technology, tested extensively over live networks by Prasar Bharati in collaboration with IIT Kanpur and Tejas. Chipsets (SL-3000 from Tejas unit Saankhya Labs) power these devices, alongside a core network platform that delivers targeted ads, emergency alerts, educational content and more, according to Parag Naik, EVP at Tejas Networks. According to Ravi Kunwar, VP and CEO of HMD India and APAC, this will enable the broadcast of a 'vast array of multimedia content" like over-the-top (OTT) content, live TV, video, audio and text messages directly without the need for Wi-Fi or the internet. D2M technology is expected to be lapped up by the 80-90 million 'TV dark homes,' or those that do not have TV sets, across India. That said, the growth of the global D2M technology market is being driven by the increasing use of mobile devices and applications as well as remote work and collaboration, according to market research firm DataHorizzon Research. The firm segments the market into healthcare, e-commerce and marketing, highlighting that telemedicine has emerged as a significant player with healthcare providers delivering services directly to users' mobile devices. Nevertheless, D2M technology is still in its early stages. Companies including Qualcomm, Samsung, and Indian telecommunications giant Jio and the Department of Telecommunications are testing the technology. While Lava's and HMD's dedicated D2M-only phones are yet to hit the market, newer 5G phones are incorporating D2M features, either through chipset upgrades or software updates. Also Read | Ministry of information and broadcasting to pilot D2M broadcast technology These phones are more like D2M-ready devices, which require specific infrastructure such as broadcast towers and spectrum to fully function. D2M-capable phones will likely be priced similarly to standard 5G smartphones. Expect to pay an extra ₹ 1,000-2,000 due to added hardware like dedicated receivers or antennas, if required. China has shown interest in 5G Broadcast while South Korea and Japan have explored 5G Broadcast and similar technologies that may pave the way for D2M-like services. For now, though, India appears to have taken the lead in this space. Topics You May Be Interested In Catch all the Business News , Corporate news , Breaking News Events and Latest News Updates on Live Mint. Download The Mint News App to get Daily Market Updates.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store