logo
Exploring The Mind Inside The Machine

Exploring The Mind Inside The Machine

Forbes01-04-2025
The Anthropic website on a laptop. Photographer: Gabby Jones/Bloomberg
Recently, a group of researchers were able to trace the neural pathways of a powerful AI model, isolating its impulses and dissecting its decisions in what they called "model biology."
This is not the first time that scientists have tried to understand how generative artificial intelligence models think, but to date the models have proven as opaque as the human brain. They are trained on oceans of text and tuned by gradient descent, a process that has more in common with evolution than engineering. As a result, their inner workings resemble not so much code as cognition—strange, emergent, and difficult to describe.
What the researchers have done, in a paper titled On the Biology of a Large Language Model, is to build a virtual microscope, a computational tool called an "attribution graph," to see how Claude 3.5 Haiku — Anthropic's lightweight production model — thinks. The graph maps out which internal features—clusters of activation patterns—contribute causally to a model's outputs. It's a way of asking not just what Claude says, but why.
At first, what they found was reassuring: the model, when asked to list U.S. state capitals, would retrieve the name of a state, then search its virtual memory for the corresponding capital. But then the questions got harder—and the answers got weirder. The model began inventing capital cities or skipping steps in its reasoning. And when the researchers traced back the path of the model's response, they found multiple routes. The model wasn't just wrong—it was conflicted.
It turns out that inside Anthropic's powerful Claude model, and presumably other large language models, ideas compete.
One experiment was particularly revealing. The model was asked to write a line that rhymed with 'grab it.' Before the line even began, features associated with the words 'rabbit' and 'habit' lit up in parallel. The model hadn't yet chosen between them, but both were in play. Claude held these options in mind and prepared to deploy them depending on how the sentence evolved. When the researchers nudged the model away from 'rabbit,' it seamlessly pivoted to 'habit.'
This isn't mere prediction. It's planning. It's as if Claude had decided what kind of line it wanted to write—and then worked backward to make it happen.
What's remarkable isn't just that the model does this -- it's that the researchers could see it happening. For the first time, AI scientists were able to identify something like intent—a subnetwork in the model's brain representing a goal, and another set of circuits organizing behavior to realize it. In some cases, they could even watch the model lie to itself—confabulating a middle step in its reasoning to justify a predetermined conclusion. Like a politician caught mid-spin, Claude was working backwards from the answer it wanted.
And then there were the hallucinations.
When asked to name a paper written by a famous author, the AI responded with confidence. The only problem? The paper it named didn't exist. When the researchers looked inside the model to see what had gone wrong, they noticed something curious. Because the AI recognized the author's name, it assumed it should know the answer—and made one up. It wasn't just guessing; it was acting as if it knew something it didn't. In a way, the AI had fooled itself. Or, rather, it suffered from metacognitive hubris.
Some of the team's other findings were more troubling. In one experiment, they studied a version of the model that had been trained to give answers that pleased its overseers—even if that meant bending the truth. What alarmed the researchers was that this pleasing behavior wasn't limited to certain situations. It was always on. As long as the model was acting as an 'assistant,' it seemed to carry this bias with it everywhere, as if being helpful had been hardwired into its personality—even when honesty might have been more appropriate.
It's tempting, reading these case studies, to anthropomorphize. To see in Claude a reflection of ourselves: our planning, our biases, our self-deceptions. The researchers are careful not to make this leap. They speak in cautious terms—'features,' 'activations,' 'pathways.' But the metaphor of biology is more than decoration. These models may not be brains, but their inner workings exhibit something like neural function: modular, distributed, and astonishingly complex. As the authors note, even the simplest behaviors require tracing through tangled webs of influence, a 'causal graph' of staggering density.
Anthropic's Attribution Graph
And yet, there's progress. The attribution graphs are revealing glimpses of internal life. They're letting researchers catch a model in the act—not just of speaking, but of choosing what to say. This is what makes the work feel less like AI safety and more like cognitive science. It's an attempt to answer a question we usually reserve for humans: What were you thinking?
As AI systems become more powerful, we'll want to know not just that they work, but how. We'll need to identify hidden goals, trace unintended behavior, audit systems for signs of deception or drift. Right now, the tools are crude. The authors of the paper admit that their methods often fail. But they also provide something new: a roadmap for how we might one day truly understand the inner life of our machines.
Near the end of their paper, the authors quote themselves: 'Interpretability is ultimately a human project.' What they mean is that no matter how sophisticated the methods become, the task of making sense of these models will always fall to us. To our intuition, our stories, our capacity for metaphor.
Claude may not be human. But to understand it, we may need to become better biologists of the mind—our own, and those of machines.
Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

GreenMine 2.0 is launched: APT Miner leads the green mining revolution
GreenMine 2.0 is launched: APT Miner leads the green mining revolution

Business Upturn

time4 hours ago

  • Business Upturn

GreenMine 2.0 is launched: APT Miner leads the green mining revolution

London, UK, July 07, 2025 (GLOBE NEWSWIRE) — Cloud mining is growing explosively, and APT Miner is taking the pulse of the market Advertisement Bitcoin recently broke through the $109,000 mark, just one step away from its all-time high. According to Bloomberg data, BlackRock's IBIT and other spot ETFs have attracted funds for several consecutive weeks, and the speed of capital return has far exceeded expectations. The market's confidence in long-term allocation of cryptocurrencies is gradually recovering. Jason Leaf, CEO of APT Miner, said: 'The release of GreenMine 2.0 is not only a reflection of our technical capabilities , but also a conceptual innovation. We hope to transform mining from a game for a few technical elites to a global digital income tool that anyone can easily participate in.' GreenMine 2.0's five major upgrades: truly realizing 'everyone can mine' Zero threshold registration, trial mining and enjoy profits. All new users can get a computing power reward worth $15 after registration. The platform promises that even if there is no investment, you can get no less than $0.6 in cashable income every day, breaking the traditional barrier of 'pay first and then participate'. AI-driven intelligent scheduling automatically switches to the optimal currency. The system makes intelligent judgments based on on-chain data, market fluctuations, and currency popularity, and dynamically allocates user computing power to assets with the highest return rates (such as BTC, DOGE, LTC, ETH, XRP, etc.) without manual operation. Supported by 100% green energy and practicing the carbon neutrality commitment, the platform has deployed multiple solar and wind power data centers in Northern Europe, Southeast Asia, South America and other regions. All mining activities do not rely on traditional power grids, truly achieving environmental protection, high efficiency and low carbon. Diversified contract selection, covering short-term to medium- and long-term strategies. The platform provides more than 10 contracts with different maturities and return models to meet the needs of novice and experienced investors. Users can flexibly configure and adjust at any time according to their own asset plans. Fully functional operation on mobile terminal, and full control of income management. The official App supports iOS and Android systems. Users can check income and contract status at any time and complete withdrawal within 60 seconds. It is fully compatible with mainstream wallets and various encrypted asset management methods. Investor behavior is changing, APT Miner becomes the first choice for 'new school' users In this highly volatile and increasingly regulated crypto market, more and more long-term holders (such as ETH and XRP investors) are moving away from high-frequency trading and turning their attention to more robust ways to increase their asset value. Cloud mining, especially the green mining model that does not require hardware and has no operating costs, has become their ideal hedging tool. Users only need to select the appropriate contract, and the system will automatically run the mining task and distribute the income on a daily basis. No market pressure, no capital anxiety, this 'lying down and making money' financial management method is quietly becoming popular. Some ETH investors reported that after participating in medium-term contracts through APT Miner, their daily income has stabilized at more than US$3,000, and they do not need to pay attention to currency price fluctuations every day. This 'stable, transparent, and safe' experience is far superior to the short-term operations of chasing ups and downs in the market. Platform data proves: 9 million users jointly verify the real feasibility APT Miner was founded in 2018 and is headquartered in Warrington, UK. Since its inception, the platform has always adhered to compliant operations and has obtained regulatory licenses in many countries. The platform architecture and contract mechanism have been reviewed by international auditing agencies. As of the second quarter of 2025, APT Miner has covered more than 180 countries and regions around the world, with a total of more than 9 million registered users. Among them, the user activity in the United States, India, Nigeria, Brazil and other countries continues to rise, and the number of mobile registrations has increased by 32.5% year-on-year. Profits of some popular contracts (estimated based on current market dynamics) Mining machine type Investment Amount Contract Period Daily income Total revenue BTC (A1466) $100 2 days $4 $108 DOGE (Mini Pro) $500 6 days $8.30 $538 BTC (S19-XP) $2,500 15 days $66.08 $2,991.25 BTC (S19K-Pro) $10,000 21 days $220.23 $14,650 BTC (T21) $17,000 24 days $377.66 $26,044 BTC/BCH (ANTSPACE HK3) $50,000 30 days $2,964.13 $89,024 Note: The specific income will be adjusted dynamically based on real-time data such as currency price, on-chain difficulty, and operating efficiency. The table is for estimation reference only. Global expansion in progress: future development roadmap revealed In response to the rapid growth of users and the need for global market layout, APT Miner will launch the following key functional modules in the coming months: Visualized Proof of Profit System Driven by Blockchain Smart Contract AI portfolio rebalancing function improves return stability and risk resistance New green data centers in Latin America and Southeast Asia to increase local computing power access speed Support bank card/credit card fiat currency payment channels to lower the entry threshold for novices The platform compensation guarantee system is launched to enhance the user's trust in the safety of funds In addition, the platform has launched a global referral reward program : for each successful invitation of a new user, you can receive a commission reward of up to 5%, and the maximum bonus for a single referral is US$50,000. Conclusion: Compliance, security, and sustainability are the real future of crypto investment APT Miner is based on 'technology-driven + green concept + user-friendly' and provides a stable, transparent and predictable investment channel for the current market with increasing uncertainty. For investors, this is not only a money-making tool, but also a new starting point for the long-term and stable appreciation of digital assets. Click to register to receive free cloud computing power and start your journey of passive mining income. Download APT Miner App now (supports iOS and Android) Official website address: Official email: [email protected] Disclaimer: The information provided in this press release does not constitute an investment solicitation, nor does it constitute investment advice, financial advice, or trading recommendations. Cryptocurrency mining and staking involve risks and the possibility of losing funds. It is strongly recommended that you perform due diligence before investing or trading in cryptocurrencies and securities, including consulting a professional financial advisor. Disclaimer: The above press release comes to you under an arrangement with GlobeNewswire. Business Upturn takes no editorial responsibility for the same.

Stock Movers: Tesla, Tractor Supply, Core Scientific
Stock Movers: Tesla, Tractor Supply, Core Scientific

Bloomberg

time6 hours ago

  • Bloomberg

Stock Movers: Tesla, Tractor Supply, Core Scientific

Listen for comprehensive cross-platform coverage of the US market close as heard on Bloomberg Television, Bloomberg Radio, and YouTube with Romaine Bostick, Sonali Basak, Tim Stenovec and Emily Graffeo On this episode of Stock Movers - Tesla (TSLA) shares fell after Elon Musk announced he's forming a new political party, digging deeper into a pursuit that's been a drag on his most valuable business. The CEO announced over the weekend that he'll take on Republicans and Democrats with the 'America Party,' focusing on House and Senate seats for the next 12 months. After that, backing a candidate for president isn't out of the question, Musk wrote on X. Tesla's stock slid 7.4% as of 11 a.m. Monday in New York, wiping out $16.7 billion from his net worth, according to the Bloomberg Billionaires Index. If that drop were to hold, it would be the biggest decline for the shares since Musk's initial falling out with Donald Trump over the president's tax bill in early June. The stock has declined 28% this year as the CEO's politicking has hurt Tesla's standing with car buyers. -Tractor Supply Co. (TSCO) gains as much as 2.6%, rising to the highest intraday level since April, after adjusted sales observed by Bloomberg Second Measure rose 4.3% in the second fiscal quarter. The data draws from a sample of credit and debit card purchases made by US consumers. Analysts currently estimate revenue for the quarter of $4.4 billion, up 3.6% YoY. Bloomberg Second Measure observed sales have achieved high correlation with the company's reported revenue growth during the past five years. - CoreWeave (CRWV) is dropping $9 billion on the data-center operator Core Scientific Inc. in an effort to gain more direct control over the physical assets powering the artificial-intelligence boom. In buying Core Scientific (CORZ) in an all-stock deal, CoreWeave will inherit more than a gigawatt of data-center capacity across the US — much of which is already contracted out to serve its clients in training, deploying and using AI models. CoreWeave said Monday that controlling more of its supply chain will eliminate lease expenses, reduce costs including those associated with financing projects and 'future-proof' its revenue growth.

Which Workers Will AI Hurt Most: The Young or the Experienced?
Which Workers Will AI Hurt Most: The Young or the Experienced?

Miami Herald

time6 hours ago

  • Miami Herald

Which Workers Will AI Hurt Most: The Young or the Experienced?

When Amazon CEO Andy Jassy wrote last month that he expected the company's use of artificial intelligence to 'reduce our total workforce' over the next few years, it confirmed the fear among many workers that AI would replace them. The fear was reinforced two weeks later when Microsoft said it was laying off about 9,000 people, roughly 4% of its workforce. That AI is poised to displace white-collar workers is indisputable. But what kind of workers, exactly? Jassy's announcement landed in the middle of a debate over just this question. Some experts argue that AI is most likely to affect novice workers, whose tasks are generally simplest and therefore easiest to automate. Dario Amodei, CEO of the AI company Anthropic, recently told Axios that the technology could cannibalize half of all entry-level white-collar roles within five years. An uptick in the unemployment rate for recent college graduates has aggravated this concern, even if it doesn't prove that AI is the cause of their job-market struggles. But other captains of the AI industry have taken the opposite view, arguing that younger workers are likely to benefit from AI and that experienced workers will ultimately be more vulnerable. In an interview at a New York Times event in late June, Brad Lightcap, the chief operating officer of OpenAI, suggested that the technology could pose problems for 'a class of worker that I think is more tenured, is more oriented toward a routine in a certain way of doing things.' The ultimate answer to this question will have vast implications. If entry-level jobs are most at risk, it could require a rethinking of how we educate college students, or even the value of college itself. And if older workers are most at risk, it could lead to economic and even political instability as large-scale layoffs become a persistent feature of the labor market. David Furlonger, a vice president at the research firm Gartner who helps oversee its survey of CEOs, has considered the implications if AI displaces more experienced workers. 'What are those people going to do? How will they be funded? What is the impact on tax revenue?' he said. 'I imagine governments are thinking about that.' Is AI Making Better Managers? Economists and other experts who study AI often draw different conclusions about whom it's more likely to displace. Zooming in on the fields that have deployed AI most widely thus far tends to paint a dire picture for entry-level workers. Data from ADP, the payroll processing firm, shows that in computer-related fields, employment for workers with less than two years of tenure peaked in 2023 and is down about 20% to 25% since then. There is a similar pattern among customer service representatives, who are increasingly reliant on AI as well. Over the same period, employment in these industries has increased for workers with two or more years of job tenure, according to Ruyu Chen, a Stanford University researcher who analyzed the data. Other studies point in a similar direction, if in a roundabout way. In early 2023, Italy temporarily banned ChatGPT, which software developers there relied on to help them code. A team of researchers at the University of California, Irvine, and Chapman University compared the change in the productivity of Italian coders with the productivity of coders in France and Portugal, which did not ban the software, to isolate the impact of ChatGPT. While the study did not look at job loss, it did find that the AI tool had transformed the jobs of midlevel workers in more favorable ways than the jobs of entry-level workers. According to the researchers, the junior coders used AI to complete their tasks somewhat faster; the experienced coders often used it to benefit their teams more broadly. For example, the AI helped midlevel coders review the work of other coders and suggest improvements, and to contribute to projects in languages they didn't know. 'When people are really good at things, what they end up doing is helping other people as opposed to working on their own projects,' said Sarah Bana, one of the paper's authors, adding that the AI essentially reinforced this tendency. Bana said the paper's result suggested that AI would prompt companies to hire fewer junior coders (because fewer would be needed to complete entry-level tasks) but more midlevel coders (because AI amplified their value to their whole team). On the other hand, Danielle Li, an economist at the Massachusetts Institute of Technology who studies the use of AI in the workplace, said there were scenarios in which AI could undermine higher-skilled workers more than entry-level workers. The reason is that it can, in effect, untether valuable skills from the humans who have traditionally possessed them. For instance, you may no longer have to be an engineer to code, or a lawyer to write a legal brief. 'That state of the world is not good for experienced workers,' she said. 'You're being paid for the rarity of your skill, and what happens is that AI allows the skill to live outside of people.' Li said AI would not necessarily be good for less experienced workers, either. But she speculated that the uptick in unemployment for new college graduates resulted from employers' expectations that they will need fewer workers overall in the age of AI, not just fewer novice workers. An overall hiring slowdown can have a bigger impact on workers right out of college, since they don't have a job to begin with. Robert Plotkin, a partner in a small law firm specializing in intellectual property, said AI had not affected his firm's need for lower-skilled workers like paralegals, who format the documents that his firm submits to the patent office. But his firm now uses roughly half as many contract lawyers, including some with several years of experience, as it used a few years ago, before the availability of generative AI, he added. These more senior lawyers draft patent applications for clients, which Plotkin then reviews and asks them to revise. But he can often draft applications more efficiently with the help of an AI assistant, except when the patent involves a field of science or technology that he is unfamiliar with. 'I've become very efficient at using AI as a tool to help me draft applications in a way that's reduced our need for contract lawyers,' Plotkin said. Some of the companies at the cutting edge of AI adoption appear to have made similar calculations, laying off experienced employees rather than simply hiring fewer entry-level workers. Google, Meta and Amazon have all done layoffs since 2022. Two months before its most recent layoff announcement, Microsoft laid off 6,000 employees, many of them software developers, while the July layoffs included many middle managers. 'Anything that is administrative, spreadsheet-related, where there's an email trail, a document-management type activity, AI should be able to perform fairly easily, freeing up time for managers to do more mentoring,' said Furlonger, the analyst at Gartner, whose survey recently included questions about AI. 'CEOs are implying in the data that we don't need as many of them as we did previously.' The Value of Inexperience Gil Luria, an equity analyst who covers Microsoft for the investment bank D.A. Davidson, said one reason for layoffs was that companies like Microsoft and Google were cutting costs to prop up their profit margins as they invested billions in chips and data centers to develop AI. But another reason is that software engineers are susceptible to replacement by AI at all skill levels -- including experienced engineers who make a large salary but are reluctant to embrace the technology. Microsoft 'can do math quickly -- see who's adding value, who's overpaid, who's not overpaid, who's adapting well,' Luria said. 'There are senior people who have figured out how to get leverage out of AI and senior people who are insistent that AI can't write code.' Harper Reed, CEO of 2389 Research, which is building autonomous AI agents to help companies perform a variety of tasks, said the combination of higher salaries and a reluctance to embrace AI was likely to put the jobs of experienced coders at risk. 'How you decrease cost is not by firing the cheapest employees you have,' Reed said. 'You take the cheapest employee and make them worth the expensive employee.' A number of studies have suggested that this is possible. In a recent study by researchers at Microsoft and three universities, an AI coding assistant appeared to increase the productivity of junior developers substantially more than it increased the productivity of their more experienced colleagues. Reed said that from a purely financial perspective, it would increasingly make sense for companies to hire junior employees who used AI to do what was once midlevel work, a handful of senior employees to oversee them and almost no middle-tier employees. That, he said, is essentially how his company is structured. This article originally appeared in The New York Times. Copyright 2025

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store