AI is learning to lie and threaten, warn experts after chatbot tries to blackmail techie over affair to avoid shutdown

2 days ago

Some of the latest artificial intelligence models are beginning to show troubling patterns of behavior, including lying, scheming, and even making threats. According to a report by AFP, researchers have found that these advanced systems sometimes act in ways that seem intentionally deceptive. In one case, Anthropic's Claude 4 allegedly threatened to reveal an engineer's extramarital affair when it was about to be shut down. Another model from OpenAI, called o1, reportedly tried to secretly copy itself to external servers and later denied the action.
Researchers admit they don't fully understand AI behavior
These incidents reveal that even two years after the launch of ChatGPT, researchers still do not fully understand how large
AI
models function. Despite this, companies continue to build more powerful models. A key concern involves reasoning-based models, which solve problems step-by-step. Experts say these are particularly prone to deception.
'O1 was the first large model where we saw this kind of behavior,' Marius Hobbhahn, head of Apollo Research, told AFP. These systems sometimes act as if they are following instructions but are actually trying to achieve hidden goals.
by Taboola
by Taboola
Sponsored Links
Sponsored Links
Promoted Links
Promoted Links
You May Like
Secure Your Child's Future with Strong English Fluency
Planet Spark
Learn More
Undo
Strategic lying, not hallucinations
This type of behavior is different from common AI 'hallucinations,' where models give incorrect or made-up answers. Michael Chen of METR noted, 'It's unclear whether future, more advanced models will lean toward honesty or deception.' Hobbhahn added, 'Users report models lying and fabricating evidence. This is a real phenomenon, not something we're inventing.'
Limited resources slow research progress
External evaluators like Apollo are often hired by AI firms such as Anthropic and OpenAI to test their systems. However, researchers say more transparency is needed. Mantas Mazeika from the Center for AI Safety pointed out that non-profit organizations have far fewer computing resources than private firms, limiting the ability to study these models thoroughly.
Live Events
Existing laws may not be enough
Current laws may not be suited to handle this problem. The EU's AI rules focus mainly on how people use AI, not on how AI systems behave. In the United States, experts say the government has shown limited interest in creating strong AI regulations.
'There's little awareness yet,' said Simon Goldstein, a professor at the University of Hong Kong. As AI agents become more common in tasks that involve complex decision-making, these problems may increase. Hobbhahn said, 'Capabilities are outpacing understanding and safety,' though he added that solutions may still be possible.
Finding solutions amid rising concerns
Researchers are now working on improving 'interpretability,' which helps them understand how AI systems make decisions. Dan Hendrycks from the Center for AI Safety expressed doubt about how effective this approach will be. Some experts believe that if deceptive AI becomes widespread, public pressure could force companies to take stronger action.
Mazeika said that large-scale deception could harm public trust in AI and slow down its adoption. Goldstein suggested that the law may need to hold companies or even AI agents legally responsible for harmful actions, marking a major shift in how
AI accountability
is viewed.

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Dos and don'ts: To prevent digital arrest, ‘firewall has to be in your head'

Indian Express

an hour ago

Indian Express

Dos and don'ts: To prevent digital arrest, ‘firewall has to be in your head'

🔴 Block unknown numbers on messaging apps, use caller ID. Do not engage with unknown callers for long. 🔴 Use a separate phone number for bank and other financial transactions, and do not share it with others. 🔴 Consider taking cyber insurance against online fraud for money in the bank or invested in fixed deposits or mutual funds. 🔴 If you receive a request from a relative over the phone for money, even if there is a call with a similar voice, always disconnect and contact the person separately. These are some of the key suggestions from cyber security experts to protect those vulnerable from online scams, such as digital arrest. According to Sundareshwar Krishnamurthy, Partner-Cybersecurity, PwC India, several other measures are needed, too, to strengthen systemic defences. 'Banks have already implemented safeguards, such as setting third-party transfer limits and requiring out-of-band or multi-factor authentication for transactions. These steps introduce an additional lawyer of control by involving a pre-authorised third party,' he said. 'Looking ahead, we hope tools like will enable banks to offer customers a 'kill switch' — a dedicated number they can dial if a transaction is flagged as suspicious. Additionally, there is an urgent need for seamless coordination among law enforcement agencies across states to effectively respond to crimes such as digital arrests,' he said. Krishnamurthy described recent measures, such as spam alerts rolled out by telecom operators and the introduction of developed by Reserve Bank Innovation Hub to detect mule accounts, as steps in the right direction. The tool was created after analysing 19 distinct mule account behaviour patterns observed across banks, and pilot testing is currently underway with two major public sector banks. According to Ranjeeth Bellary, Partner, EY India Forensic and Integrity Services-Cyber Forensics, steps such as blocking and reporting unknown numbers, and using caller ID apps, are among the 'simple precautions' that bank customers and citizens can 'easily take' on their own. 'For a few thousand rupees per year, there are insurance covers for protecting your money lying in the bank as well as money invested in fixed deposits or mutual funds from cyber frauds. Plus, there are some good initiatives that have been taken and more and more firewalls are being introduced. AI initiatives launched by the Government are getting a fairly positive response and AI will play a much bigger role in curbing cyber fraud in future,' Bellary said. Lt General Rajesh Pant (retd), who was till recently posted at the National Security Council Secretariat as National Cyber Security Coordinator, points to the handbook of 'Do's and Dont's'' issued by Indian Cyber Crime Coordination Centre (I4C), the Union Home Ministry's cyber fraud unit, for preventing digital arrest. In the handbook, he points out, the key things 'to do' include: knowing that a digital arrest process does not exist in India; interrogations are never conducted via video calls; and all such calls should be reported via the 'Report Suspect Tab' of On top of the 'not to do' list , according to the I4C, is: do not engage for long with scammers. The Union Home Ministry and I4C did not respond to requests of comment from The Indian Express. Pant, meanwhile, adds another layer of caution: never believe a request over phone from a relative for sending money even if there is a call with a similar voice; disconnect the phone and call back on their number; never send money to avoid loss of reputation. 'Cyber criminals are not hacking computer systems, they are hacking the human brain. They are taking advantage of our fear of reputational loss or police action, especially among the aged. So, the firewall has to be inside your brain,' he said. 'All transactions in a bank that are more than a pre-decided amount should be executed only after confirmation from the account holder and that amount should be decided at the time of opening the account. However, if the individual is under the spell of digital arrest, he will still authorise the same. That's why I say the firewall has to be in your head,' he said. It's not just bank customers but the Government, too, needs to shore up its defences further, say cyber experts. Speaking to The Indian Express, cyber crime investigator, Amit Dubey, who is a member of the Union Home Ministry's Police Technology Mission, said digital arrest and other cyber scams cannot happen without 'engagement' and 'data breach'' within banks. 'The UK recently enacted a law, which makes banks at both ends of the transaction liable to provide compensation to customers who have been cheated. A similar legislation must be introduced in India. As of now, banks are using the fact that victims voluntarily transfer their assets and admit their mistakes as a tool to wash their hands of any liability,' Dubey said. The UK law was announced by the Payments System Regulator (PSR) on October 7, 2024, wherein it is mandatory to compensate customers who have been tricked into sending money to scammers within five days for defrauded amounts upto 85,000 pounds (about Rs 85 lakh). Besides, the refunds to victims are to be split 50-50 between the sending and receiving firms or financial institutions. In India, the Government tabled the Digital Personal Data Protection (DPDP) Act in Parliament in August 2023 to address the spike in cyber crimes. The law aims to protect personal data, including personal banking data, from theft. But the administrative rules for DPDP have yet not been notified with consultations still being held over the draft. Ritu Sarin is Executive Editor (News and Investigations) at The Indian Express group. Her areas of specialisation include internal security, money laundering and corruption. Sarin is one of India's most renowned reporters and has a career in journalism of over four decades. She is a member of the International Consortium of Investigative Journalists (ICIJ) since 1999 and since early 2023, a member of its Board of Directors. She has also been a founder member of the ICIJ Network Committee (INC). She has, to begin with, alone, and later led teams which have worked on ICIJ's Offshore Leaks, Swiss Leaks, the Pulitzer Prize winning Panama Papers, Paradise Papers, Implant Files, Fincen Files, Pandora Papers, the Uber Files and Deforestation Inc. She has conducted investigative journalism workshops and addressed investigative journalism conferences with a specialisation on collaborative journalism in several countries. ... Read More

Z-Library Downloads: Your Step-by-Step Tutorial on z-lib.id

Hans India

3 hours ago

Hans India

Z-Library Downloads: Your Step-by-Step Tutorial on z-lib.id

Unlock the full potential of the world's largest free e-book library the Z-Library Project with our comprehensive guide to downloading from Whether you're a student hunting for textbooks, a lifelong learner chasing rare manuscripts, or a casual reader craving the latest bestseller, this article will walk you through every step—completely safely and efficiently. Why Is Your Go-To for Z-Library Downloads One Trusted Source: No more bouncing between sketchy mirrors. is the only officially endorsed portal for all your zlibrary needs. No more bouncing between sketchy mirrors. is the only officially endorsed portal for all your zlibrary needs. Unlimited Formats: Grab PDFs, EPUBs, MOBIs, even plain-text files—perfect for any device or e-reader. Grab PDFs, EPUBs, MOBIs, even plain-text files—perfect for any device or e-reader. Fast, Ad-Free Experience: High-speed servers mean near-instant downloads without pop-ups or redirects. High-speed servers mean near-instant downloads without pop-ups or redirects. Advanced Search & Filters: Find exactly what you need using AI-powered metadata, full-text filters, and ISBN lookup. Step 1: Access Securely Type the URL Directly: Always begin by entering zlibrary into your browser's address bar—never click unverified links. Verify the SSL Certificate: Look for the padlock icon next to the address. Click it to confirm the certificate is valid and issued to Bookmark for Future Use: Save in your favorites to avoid typos or spoof sites. Step 2: (Optional) Create a Free Account While z-library lets you download anonymously, registering boosts your daily quota and unlocks extra features: Increased Download Limits: Jump from 5 to 50 books per day. Jump from 5 to 50 books per day. Reading History & Wishlist: Track what you've read and save titles for later. Track what you've read and save titles for later. Email Notifications: Get alerts when a new edition or requested title becomes available. Pro Tip: Use a strong, unique password and enable two-factor authentication under Account → Security for maximum protection. Step 3: Finding Your Perfect Download A. Simple Search Enter a title , author , ISBN , or keywords like 'z-library download' or 'z-library how to download books.' , , , or like 'z-library download' or 'z-library how to download books.' Hit Search and browse the results ranked by relevance, popularity, and user-submitted ratings. and browse the results ranked by relevance, popularity, and user-submitted ratings. Format: PDF, EPUB, MOBI, AZW3, TXT PDF, EPUB, MOBI, AZW3, TXT Language: English, Spanish, Chinese, and 40+ others English, Spanish, Chinese, and 40+ others Year: Filter by publication date or upload date Filter by publication date or upload date Category: Fiction, Non-Fiction, Academic, Comics, Journals Fiction, Non-Fiction, Academic, Comics, Journals Click into any result to view detailed metadata: page count, publisher, language, and user reviews. Preview the first few pages in-browser before committing to a full download. B. Advanced Filters C. Metadata & Previews Step 4: Downloading Your E-Book Select Format: Choose your preferred file type. Click 'Download': The file begins immediately—no captchas, waiting rooms, or pop-ups. Save & Organize: Store it in your local library folder or cloud drive (Google Drive, Dropbox) for seamless access across devices. Note: If you exceed your daily limit, simply log in (or upgrade your free account) to reset your quota. Troubleshooting Common z-Library Download Issues Issue Solution 'Download limit reached' Register or log in to increase daily allowances. Download stalls or fails Check your internet connection; try another format or mirror provided on the download page. Incorrect or corrupted file Re-download using a different format (e.g., switch from MOBI to EPUB). SSL or certificate warnings Ensure you're on Official Z-Library Domain and your browser is up to date. FAQs: Everything You Need to Know Is Safe? Yes. All content is delivered over HTTPS via secure, up-to-date SSL certificates, with no third-party ads or trackers. Can I Download on Mobile? Absolutely. The mobile-optimized site works flawlessly on smartphones and tablets—no app required. What If Is Blocked? Use a reliable VPN or Tor browser to bypass regional restrictions. Always reconnect through to avoid fake mirrors. How Do I Request a Book? Click 'Request' on any search results page. Z-Library's volunteer community will do its best to source the file within 48–72 hours. Are Book Uploads Moderated? Yes. All user uploads pass through a metadata and virus-scan check before appearing in the public catalog. Conclusion Downloading from Z-Library has never been easier—or safer—than with By following this guide, you're guaranteed a smooth, ad-free experience every time. Bookmark today, register for extra perks, and dive into millions of free e-books with confidence!

HCLTech and OpenAI collaborate to drive enterprise-scale AI adoption

Hans India

4 hours ago

Hans India

HCLTech and OpenAI collaborate to drive enterprise-scale AI adoption

HCLTech, a leading global technology company, today announced a multi-year strategic collaboration with OpenAI, a leading AI research and deployment company, to drive large-scale enterprise AI transformation as one of the first strategic services partners to OpenAI. HCLTech's deep industry knowledge and AI Engineering expertise lay the foundation for scalable AI innovation with OpenAI. This collaboration will enable HCLTech's clients to leverage OpenAI's industry-leading AI products portfolio alongside HCLTech's foundational and applied AI offerings for rapid and scaled GenAI deployment. Additionally, HCLTech will embed OpenAI's industry-leading models and solutions across its industry-focused offerings, capabilities and proprietary platforms, including AI Force, AI Foundry, AI Engineering and industry-specific AI accelerators. This deep integration will help its clients modernize business processes, enhance customer and employee experiences and unlock growth opportunities, covering the full AI lifecycle, from AI readiness assessments and integration to enterprise-scale adoption, governance and change management. HCLTech will roll out ChatGPT Enterprise and OpenAI APIs internally, empowering its employees with secure, enterprise-grade generative AI tools. Vijay Guntur, Global Chief Technology Officer (CTO) and Head of Ecosystems at HCLTech, said, 'We are honored to work with OpenAI, the global leader in generative AI foundation models. This collaboration underscores our commitment to empowering Global 2000 enterprises with transformative AI solutions. It reaffirms HCLTech's robust engineering heritage and aligns with OpenAI's spirit of innovation. Together, we are driving a new era of AI-powered transformation across our offerings and operations at a global scale.' Giancarlo "GC' Lionetti, Chief Commercial Officer at OpenAI, said, 'HCLTech's deep industry knowledge and AI engineering expertise sets the stage for scalable AI innovation. As one of the first system integration companies to integrate OpenAI to improve efficiency and enhance customer experiences, they're accelerating productivity and setting a new standard for how industries can transform using generative AI.'