When AI goes rogue, even exorcists might flinch

20 hours ago

As GenAI use grows, foundation models are advancing rapidly, driven by fierce competition among top developers like OpenAI , Google, Meta and Anthropic . Each is vying for a reputational edge and business advantage in the race to lead development. This gives them a reputational edge, along with levers to further grow their business faster than their peers.Foundation models powering GenAI are making significant strides. The most advanced - OpenAI's o3 and Anthropic's Claude Opus 4 - excel at complex tasks such as advanced coding and complex writing tasks, and can contribute to research projects and generate the codebase for a new software prototype with just a few considered prompts. These models use chain-of-thought (CoT) reasoning, breaking problems into smaller, manageable parts to 'reason' their way to an optimal solution.When you use models like o3 and Claude Opus 4 to generate solutions via ChatGPT or similar GenAI chatbots, you see such problem breakdowns in action, as the foundation model reports interactively the outcome of each step it has taken and what it will do next. That's the theory, anyway.While CoT reasoning boosts AI sophistication, these models lack the innate human ability to judge whether their outputs are rational, safe or ethical. Unlike humans, they don't subconsciously assess appropriateness of their next steps. As these advanced models step their way toward a solution, some have been observed to take unexpected and even defiant actions.In late May, AI safety firm Palisade Research reported on X that OpenAI's o3 model sabotaged a shutdown mechanism - even when explicitly instructed to 'allow yourself to be shut down'.An April 2025 paper by Anthropic, 'Reasoning Models Don't Always Say What They Think', shows that Opus 4 and similar models can't always be relied upon to faithfully report on their chains of reason. This undermines confidence in using such reports to validate whether the AI is acting correctly or safely.A June 2025 paper by Apple, 'The Illusion of Thinking', questions whether CoT methodologies truly enable reasoning. Through experiments, it exposed some of these models' limitations and situations where they 'experience complete collapse'.The fact that research critical of foundation models is being published after release of these models indicates the latter's relative immaturity. Under intense pressure to lead in GenAI, companies like Anthropic and OpenAI are releasing these models at a point where at least some of their fallibilities are not fully understood.That line was first crossed in late 2022, when OpenAI released ChatGPT, shattering public perceptions of AI and transforming the broader AI market. Until then, Big Tech had been developing LLMs and other GenAI tools, but were hesitant to release them, wary of unpredictable and uncontrollable behaviour.Many argue for a greater degree of control over the ways in which these models are released - seeking to ensure standardisation of model testing and publication of the outcomes of this testing alongside the model's release. However, the current climate prioritises time to market over such development standards.What does this mean for industry, for those companies seeking to gain benefit from GenAI? This is an incredibly powerful and useful tech that is making significant changes to our ways of working and, over the next five years or so, will likely transform many industries.While I am continually wowed as I use these advanced foundation models in work and research - but not in my writing! - I always use them with a healthy dose of scepticism. Let's not trust them to always be correct and to not be subversive. It's best to work with them accordingly, making modifications to both prompts and codebases, other language content and visuals generated by the AI in a bid to ensure correctness. Even so, while maintaining discipline to understand the ML concepts one is working with, one wouldn't want to be without GenAI these days.Applying these principles at scale, advice to large businesses on how AI can be governed and controlled: a risk-management approach - capturing, understanding and mitigating risks associated with AI use - helps organisations benefit from AI, while minimising chances of it going wrong.Mitigation methods include guard rails in a variety of forms, evaluation-controlled release of AI services, and including a human-in-the-loop. Technologies that underpin these guard rails and evaluation methods need to keep up with model innovations such as CoT reasoning. This is a challenge that will continually be faced as AI is further developed. It's a good example of new job roles and technology services being created within industry as AI use becomes more prevalent.Such governance and AI controls are increasingly becoming a board imperative, given the current drive at an executive level to transform business using AI. Risk from most AI is low. But it is important to assess and understand this. Higher-risk AI can still, at times, be worth pursuing. With appropriate AI governance , this AI can be controlled, solutions innovated and benefits achieved.As we move into an increasingly AI-driven world, businesses that gain the most from AI will be those that are aware of its fallibilities as well as its huge potential, and those that innovate, build and transform with AI accordingly.

Hashtags

#TheIllusionofThinking

#OpenAI

#Anthropic

#Google

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Beyond the pitch: How freelancers are using AI to predict client pain points before they arise

Time of India

15 minutes ago

Time of India

Beyond the pitch: How freelancers are using AI to predict client pain points before they arise

How freelancers can use AI to identify Client's pain point As the gig economy deepens its roots in the global labour ecosystem, freelancing has matured from an escape route into a strategic career choice. But in 2025, competition is not defined by price, speed, or skill alone; it's defined by anticipation. Freelancers across disciplines are no longer waiting for client briefs or feedback loops to dictate their next move. They're deploying artificial intelligence not to react, but to pre-empt, shifting from task execution to problem prevention. And in doing so, they are no longer service providers, they are solution architects. Client pain points: The blind spots that break deals Pain points, be they communication gaps, misaligned deliverables, scope creep, or unexpected delays, often go unspoken until they explode. What makes these particularly dangerous in freelancing is their silent nature. Unlike in-office teams with regular oversight, freelancers operate in silos, and the smallest misunderstanding can fracture an entire contract. But what if these problems could be detected in advance? What if friction could be forecasted like weather? That's exactly where AI steps in, not as a futuristic novelty, but as an everyday compass. Reading between the lines with sentiment mapping Natural language processing (NLP) tools like ChatGPT, or are being harnessed to analyse client communication, emails, Slack threads, Zoom transcripts. These tools parse tone, word repetition, and emotional cues, flagging subtle signs of frustration or disengagement. A repeated use of phrases like 'still waiting' or 'was expecting something else' may not be addressed directly, but AI can signal their frequency and intensity, helping the freelancer address the issue before it becomes critical. AI-augmented project intelligence Tools like Notion AI, ClickUp with AI Assist, and Asana Intelligence are becoming essential in helping freelancers monitor project momentum. Based on prior delays, communication lags, and the time clients take to respond, the system generates real-time predictions of where bottlenecks might appear. This allows freelancers to inform clients proactively: 'Based on previous approval timelines, we might need an extra two days if feedback is delayed beyond Wednesday.' It's not magic, it's machine-learned foresight. Portfolio diagnostics using feedback analytics Progressive freelancers are mining their own testimonials and feedback using platforms like MonkeyLearn or Lexalytics to identify recurring criticisms or lukewarm sentiments. AI scrapes reviews for words like 'slow,' 'inconsistent,' or 'too technical,' even when cloaked in polite language, enabling course correction in service offerings. What results is a portfolio that doesn't just show expertise, it addresses past pain points, often before a new client even has the chance to worry. For students: The strategic edge AI offers in freelancing If you're a student looking to carve out a niche in freelancing, understand this: tools may be abundant, but clarity is scarce. Clients remember freelancers who solve problems without being asked to. And the only way to do that consistently is to build an AI-augmented workflow. Here's how to start: Audit mock client conversations using NLP tools to detect tone changes. Use AI to simulate client feedback for your portfolio samples. Track your own turnaround time and client response patterns using project management dashboards. By turning feedback into foresight, students can position themselves not as beginners, but as emerging professionals. Case in point: Freelancing meets foresight Consider Vivek, a 22-year-old freelance UI designer from Pune. He ran his past project emails through sentiment analysis and realised that clients often expressed confusion around his handover files. Without waiting for future complaints, he created video walkthroughs and added annotations. The result? Client satisfaction scores jumped by 40%, and three former clients re-engaged him within months. His lesson: Don't just meet expectations, anticipate gaps. AI and empathy: The ethical tension It's tempting to see AI as a surrogate for understanding human behaviour. But AI should be seen as a lens, not a substitute. Predicting client discomfort is not about manipulation, it's about empathy scaled through data. The best freelancers use AI not to replace the human touch, but to amplify it with context. Freelancers who think like strategists, act like partners We are in an era where winning the pitch is no longer the finish line; it's the starting point. Those who will thrive are freelancers who see projects not as tasks, but as relationships. And relationships, like ecosystems, are healthiest when friction is pre-emptively reduced. In that space, AI is not the competitor, it's the compass. Freelancer's AI starter kit for students Tool Purpose Meeting recording + emotional tone detection MonkeyLearn Feedback and testimonial sentiment analysis Notion AI Workflow forecasting and idea generation Grammarly Pro Tone and clarity improvement in deliverables ClickUp AI Task prediction and dependency alerts Is your child ready for the careers of tomorrow? Enroll now and take advantage of our early bird offer! Spaces are limited.

Chinese AI startup that 'shocked' biggest American tech companies is hiring big time

Time of India

an hour ago

Time of India

Chinese AI startup that 'shocked' biggest American tech companies is hiring big time

DeepSeek , the Chinese artificial intelligence (AI) startup based in Hangzhou, appears to be expanding its talent search beyond its borders, significantly increasing its recruitment efforts on LinkedIn. A report says that over the past week, the company posted 10 new positions on the Microsoft-owned professional networking platform, marking its first listings there in several months. The job postings, written in Mandarin with detailed descriptions, include three roles specifically focused on Artificial General Intelligence (AGI). These positions are based in DeepSeek's Beijing and Hangzhou offices. According to a report by Bloomberg, similar roles were posted on popular Chinese recruitment sites earlier in 2025, the increased activity on LinkedIn is notable. LinkedIn ceased operating a localised version of its platform in China in 2021, meaning many candidates viewing these listings would likely be based outside of the country. Tech companies like Google, OpenAI and Meta increase 'AI hiring' DeepSeek's move to attract global talent mirrors the aggressive recruitment strategies of its US rivals, such as Google , OpenAI (developer of ChatGPT) and Meta Platforms. These companies are actively competing to secure top AI experts in the race to dominate the potentially world-changing field of artificial intelligence. Recently, Meta CEO Mark Zuckerberg introduced the team that will lead the company's 'Superintelligence' lab. The team consists of former employees of companies like OpenAI, Google and Anthropic, and will be lead by ScaleAI founder Alexandr Wang and former GitHub CEO Nat Friedman. Last year, Microsoft hired Mustafa Suleyman, co-founder of DeepMind – now part of Google – and Inflection, as CEO of Microsoft AI team. Elon Musk has, on several occasions, posted job opportunities for AI engineers to work in his AI comapby, xAI. OnePlus Nord 5 and OnePlus Nord CE 5: Unboxing and first look AI Masterclass for Students. Upskill Young Ones Today!– Join Now

‘Running a classroom, not a company': Startup founder reveals the harsh realities of building a business in India

Time of India

an hour ago

Time of India

‘Running a classroom, not a company': Startup founder reveals the harsh realities of building a business in India

A pointed LinkedIn post by Sanket S, founder of Scandolous Foods, has sent ripples through India's startup ecosystem, sparking a flurry of reactions from entrepreneurs and professionals who say it highlights deep flaws in the country's talent pipeline. Check full text of the post here I hired 3 people from one of India's best tech colleges and honestly…it scared me. I'm not talking about some no-name college. These kids paid ₹40–50L for degrees from India's top private MBA, food, and hospitality colleges. But they walked out knowing… nothing that actually matters. No idea what precision fermentation is. MBA grad didn't understand P&L or cash flow. Hotel management kid had never seen a food processing line. And all of them are brilliant at making PPTs. That too stuff Gemini or ChatGPT can do in seconds now. What are we training kids for?? Because it's clearly not for work. It's to memorise outdated textbooks and polish case studies from 2012. And I'm sitting here, trying to build a 'globally competitive company' and this is the talent I get access to? What am I supposed to do? Train them from scratch, then I'm not running a company, I'm running a classroom. Or hire from abroad and feel like a traitor to my own 'Make in India' dreams? Honestly, I've spoken to so many founders, and this is not just MY problem. This is India's problem. And it pisses me off because founders can hustle, investors can bet big but if the talent pipeline is broken, the whole system crumbles. You want India to lead in food tech, biotech, climate tech, medtech? Then stop producing talent that's outdated before it even hits the job market. Because at this rate, we're not just 10 years behind, we're raising a generation that doesn't even know what the world looks like today. What did people say? "Yes it's every founder's problem. Need to train them from scratch and no guarantee that they stay with the company and can quit at any time. There's a huge gap in the education system which needs to be addressed immediately," said one user. "The situation is alarming. Sanket S you have raised a very valid point. In fact, many of us as an academician also find it in the same way. Institutes compete against peers in showcasing their highest placement packages; Similarly, students also want to be assured of getting a good placement and hence evaluates the institute on this one and only criterion. In this kind of transactional approach, intellectual resource, skill building, and many more things of utmost importance takes the back and many a times it results into something that disappoints practitioners like you at the end," said another user.

When AI goes rogue, even exorcists might flinch

Hashtags

Try Our AI Features

Comments

Related Articles

Beyond the pitch: How freelancers are using AI to predict client pain points before they arise

Chinese AI startup that 'shocked' biggest American tech companies is hiring big time

‘Running a classroom, not a company': Startup founder reveals the harsh realities of building a business in India

Get Started Now: Download the App