Chain Of Thought For Reasoning Models Might Not Work Out Long-Term

01-07-2025

chain, isolated, 3d rendering
New reasoning models have something interesting and compelling called 'chain of thought.' What that means, in a nutshell, is that the engine spits out a line of text attempting to tell the user what the LLM is 'thinking about' as it completes a task.
For instance, if you ask a model a question like: 'what does (X) person do at (X) company?' you might get a chain of thought with items like this, given the system knows how to find the relevant info:
That's a simple example, but chain of thought has been something people rely on quite a bit over the last couple of years.
However, experts are now looking at the limitations of chain of thought reasoning, and suggesting that maybe this resource is lulling us into a false sense of security when it comes to trusting the results that we get from AI.
Language is Limited
One way to describe the limitations of reasoning chains of thought is that language itself is not precise, or particularly easily benchmarked.
Language is clunky. And there are hundreds of languages used across the globe. So the idea that a machine could precisely explain its working in a particular language is an idea with various strict limitations.
Take a look at this excerpt from a paper released by Anthropic, which is actually an academic treatise written by a number of authors.
This and other sources suggest that the chain of thought is just not sophisticated enough to really be accurate, especially as models get larger and demonstrate higher performance levels.
Or check out this idea posed by Melanie Mitchell at Substack in 2023, as these CoT systems were really about to take off:
'Reasoning is a central aspect of human intelligence, and robust domain-independent reasoning abilities have long been a key goal for AI systems,' Mitchell wrote. 'While large language models (LLMs) are not explicitly trained to reason, they have exhibited 'emergent' behaviors that sometimes look like reasoning. But are these behaviors actually driven by true abstract reasoning abilities, or by some other less robust and generalizable mechanism—for example, by memorizing their training data and later matching patterns in a given problem to those found in training data?'
Mitchell then asked why this matters.
'If robust general-purpose reasoning abilities have emerged in LLMs, this bolsters the claim that such systems are an important step on the way to trustworthy general intelligence,' she added. 'On the other hand, if LLMs rely primarily on memorization and pattern-matching rather than true reasoning, then they will not be generalizable—we can't trust them to perform well on 'out of distribution' tasks, those that are not sufficiently similar to tasks they've seen in the training data.'
Testing Faithfulness?
Alan Turing came up with the Turing test in the mid-1900s – the idea that you can measure how good computers are at acting like humans. There are also a lot of things that you can also measure about LLMs using high-level test sets – you can tell how good the models are at math, or at higher-level thought problems.
But how do you tell if the machine is being truthful, or in the words of the authors, 'faithful'?
The above paper goes into the ins and outs of testing faithfulness of LLM models. When I read this entire explanation, what I got is that faithfulness is subjective in a way that math and stochastics are not. Then we have a limited ability to really understand whether machines are being faithful to us in their results.
Think of it this way – we know that for their responses to our questions or statements, LLMs are going on the Internet and looking very broadly at what humans have written. Then they're imitating that. So they're imitating the technically correct knowledge, they imitate the way in which they produce results, they imitate the ways that humans talk – And they also might imitate the ways that humans hedge, the ways that humans hide information, the sins of omission for which we always castigate the media, and a human's propensity to just lie or dissemble in quite sophisticated, or alternately, simple ways.
Chasing Incentives
Furthermore, the authors of the paper point out that LLMs might chase incentives in the same way humans do. They might highlight some particular information that's inaccurate, or less accurate, to get a reward or a prize. Again, they may have learned this directly from us, from how we act on the Internet.
The authors of the above paper call it 'reward hacking.'
'Reward hacking is an undesired behavior:' they write. 'Even though it might produce rewards on one given task, the behavior that generates them is very unlikely to generalize to other tasks (to use the same example, other video games probably don't have that same bug). This makes the model at best useless and at worst potentially dangerous, since maximizing rewards in real-world tasks might mean ignoring important safety considerations (consider a self-driving car that maximizes its 'efficiency' reward by speeding or running red lights).'
At best, useless, and at worst, potentially dangerous. That does not sound good.
Philosophy of Technology
Here's another important aspect of this that I think deserves attention.
This whole idea of evaluating chains of thought is not a technical idea. It doesn't have to do with how many parameters the machine has, or how they're weighted, or how to do particular math problems. It has more to do with the training data and how it's used intuitively. In other words, this debate covers more of the black box of stuff that quants don't engage in when they evaluate models.
That makes me think that we actually do need something I've called for many times – a new army of paid philosophers to figure out how to interact with AI. Rather than mathematicians, we need more people who are willing to think deeply and apply human concepts and ideas, often intuitive ones, and ones based on society and history of civilization, to AI. We are woefully behind in this, because we've solely focused on hiring people who can write Python.
I'll get off the soapbox, but in figuring out how to move beyond chains of thought, we may end up having to realign more of our efforts when it comes to AI job training.

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

SAP to Acquire SmartRecruiters: Integrating Innovative Talent Acquisition Portfolio Will Help Customers Attract and Retain Top Talent

Yahoo

13 minutes ago

Yahoo

SAP to Acquire SmartRecruiters: Integrating Innovative Talent Acquisition Portfolio Will Help Customers Attract and Retain Top Talent

WALLDORF, Germany and SAN FRANCISCO, Aug. 1, 2025 /PRNewswire/ -- SAP (NYSE: SAP) and SmartRecruiters today announced that SAP has entered into an agreement to acquire SmartRecruiters, a leading talent acquisition (TA) software provider. SmartRecruiters' deep expertise in high-volume recruiting, recruitment automation, and AI-enabled candidate experience and engagement are considered an ideal addition to the SAP SuccessFactors human capital management (HCM) suite. The planned acquisition will strengthen SAP's all-in-one HCM suite, so customers have the tools they need to attract and retain top talent in an increasingly competitive landscape. SmartRecruiters' powerful, user-friendly interfaces and seamless workflows will complement SAP's robust HR tools – improving decision-making, reducing time-to-hire and providing a better experience for candidates. Embedded analytics and AI-driven recommendations from both companies will provide rich insights into talent pools, hiring bottlenecks and workforce planning. "Hiring the right people is not just an HR priority – it's a business priority. With this planned acquisition, we will help our customers attract and hire the best talent so they can advance their talent acquisition agendas with speed and agility, while lowering their total cost of ownership," said Muhammad Alam, member of the Executive Board of SAP SE, SAP Product & Engineering. "Customers will be able to manage the entire candidate lifecycle — from sourcing and interviewing to onboarding and beyond — all in a single system to streamline the experience for recruiters, hiring managers, and, in particular, candidates." Customers can expect enhanced and AI-enabled recruiting and hiring capabilities, making applicant tracking and candidate screening more efficient. Data-driven hiring and recruitment analytics will flow directly into SAP's existing HCM tools, providing a single system of record and harmonized data for compliant, seamless operations. The SmartRecruiters portfolio will also continue to be available standalone for the foreseeable future. SmartRecruiters' Software-as-a-Service solutions and platform enable more than 4,000 organizations globally to efficiently manage their hiring workflows end-to-end, offering a compelling experience to recruiters, hiring managers and candidates. SmartRecruiters CEO, Rebecca Carr said, "SmartRecruiters' mission has always been to make hiring easy. Joining forces with SAP presents a tremendous opportunity for enterprises worldwide to benefit from our industry leading approach to talent acquisition. I couldn't be more excited for the opportunity this planned acquisition presents for our customers, partners and employees as we build the future of hiring together." The transaction is expected to close in the fourth quarter of 2025, subject to customary closing conditions, including regulatory approvals. Terms of the transaction were not disclosed. J.P. Morgan served as exclusive financial advisor to SmartRecruiters. Visit the SAP News Center. Get SAP news via LinkedIn and Bluesky. About SAPAs a global leader in enterprise applications and business AI, SAP (NYSE:SAP) stands at the nexus of business and technology. For over 50 years, organizations have trusted SAP to bring out their best by uniting business-critical operations spanning finance, procurement, HR, supply chain, and customer experience. For more information, visit About SmartRecruiters SmartRecruiters is the Recruiting AI Company that transforms hiring for the world's leading enterprises. Built for global scale, SmartRecruiters delivers an AI-powered hiring platform that automates and optimizes the entire talent acquisition process, ensuring faster and smarter hiring decisions. More than 4,000 organizations, including Amazon, Visa, and McDonald's, rely on SmartRecruiters to build winning teams. For more information, visit This document contains forward-looking statements, which are predictions, projections, or other statements about future events. These statements are based on current expectations, forecasts, and assumptions that are subject to risks and uncertainties that could cause actual results and outcomes to materially differ. Additional information regarding these risks and uncertainties may be found in our filings with the Securities and Exchange Commission, including but not limited to the risk factors section of SAP's 2024 Annual Report on Form 20-F. © 2025 SAP SE. All rights and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP SE or an SAP affiliate company in Germany and other countries. Please see for additional trademark information and notices. Note to editors:To preview and download broadcast-standard stock footage and press photos digitally, please visit On this platform, you can find high resolution material for your media channels. For customers interested in learning more about SAP products: Global Customer Center: +49 180 534-34-24United States Only: 1 (800) 872-1SAP (1-800-872-1727) For more information, press only:Joellen Perry, +1 (626)-265-0370, ETDaniel Reinhardt, +49 151 168 10 157, CESTVictoria Dixon, +1 (703) 288 6020, PT SAP Press Room; press@ Please consider our privacy policy. If you received this press release in your e-mail and you wish to unsubscribe to our mailing list please contact press@ and write Unsubscribe in the subject line. Logo: View original content: SOURCE SAP SE Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

The fellowship offering job-hunting grads an AI training lifeline

Fast Company

14 minutes ago

Fast Company

The fellowship offering job-hunting grads an AI training lifeline

In early March, Volkan Çinar, a chemistry postdoc at MIT, received an email recruiting him to train AI models. Çinar studies carbon-carbon bonds formation in graphene. Given the stiff competition for jobs in academia, Çinar was no longer sure if his dream of working in academia made sense. So he was receptive to the email's pitch. The email came from Handshake, the job search platform which connects 18 million students from 1,600 higher ed institutions to career opportunities, introducing its new MOVE (Model Validation Expert) Fellowship. The new program gives Handshake an entrée into the high end of AI model training, the hot sector that's seen Meta acquire a 49% stake in Scale for more than $14 billion and Surge bootstrap itself to $1 billion in revenue. For talent like Çinar, MOVE offers better money than teaching and comes with AI training. 'I'd never considered working in AI,' Çinar says. 'But given that I'm exploring other positions, I thought I'd give it a try,' even if it meant the risk of paving the road for AI models to take over his field. What to expect from the program The Fellowship's acceptance rate and pay range How to make yourself competitive for an AI gig A better way to source expert talent for AI labs

Metro by T-Mobile is the smartest switch you'll make this year

Digital Trends

14 minutes ago

Digital Trends

Metro by T-Mobile is the smartest switch you'll make this year

Let's cut right to it. Phone plans are getting complicated and expensive. Between hidden fees, activation charges, and plans that look like a great deal until you read the fine print, most people pay more than they should. But Metro by T-Mobile is doing things differently. And honestly? It's refreshing. Metro just dropped two great deals, and if you're tired of overpaying for your phone bill, now is the time to make the switch. The $40 Unlimited Plan — Free 5G Phone Included Yes, you read that right. Bring your number to Metro, pay just $40 a month, and you're getting unlimited 5G data and a free 5G phone. No promo code, no gimmicks, no activation fee. And here's the kicker: That $40 rate? It's locked in for five years. That's what Metro calls the '$40, PERIOD' plan. For once something where the price will not creep up every year (if not sooner). Metro's not only giving you a high-speed, unlimited data plan on the nation's best network, they're making sure your bill doesn't balloon six months from now. That kind of long-term value doesn't exist with other carriers. Did we mention the phone is free? Because that's worth repeating. You can choose from phones like the Samsung Galaxy A16 or Moto G Power and walk out with a 5G device on day one without spending a dime extra. The $25 BYOD Plan — Keep Your Phone, Cut Your Bill Already love your phone? Metro has something for you too. The $25 Bring Your Own Device (BYOD) plan is for customers who want to keep what they've got and save big while doing it. You'll get unlimited talk, text, and 5G data for just $25 a month. Taxes and fees included. The price? Locked in for five years! Let's go! And you don't need to jump through hoops to get it. No ID required. No credit check. No activation fee. Just bring your phone, sign up online, turn on AutoPay, and you're good to go. First month is $30, and after that it drops to $25/month with AutoPay. Both Plans, One Powerful Network Here's the part you don't want to miss: Metro runs on the T-Mobile network, the fastest, most reliable in the country. Watching videos, gaming, FaceTiming, or just making calls? You'll be doing it with serious speed and coverage. Plus, both plans include T-Mobile Tuesdays (hello, free perks), Scam Shield protection, and unlimited everything. The choice is yours: want a brand-new 5G phone or just want to stop overpaying for the one you already have, Metro by T-Mobile has a plan for you. No contracts. No surprises. Just unbeatable value and a network that delivers. Switching? Yeah, it's kind of a no-brainer.

Chain Of Thought For Reasoning Models Might Not Work Out Long-Term

Hashtags

Try Our AI Features

Comments

Related Articles

SAP to Acquire SmartRecruiters: Integrating Innovative Talent Acquisition Portfolio Will Help Customers Attract and Retain Top Talent

The fellowship offering job-hunting grads an AI training lifeline

Metro by T-Mobile is the smartest switch you'll make this year

Get Started Now: Download the App