DeepSeek: Smarter Software Vs. More Compute

07-05-2025

Daniel A. Keller, CEO and President of InFlux Technologies Limited. Cofounder of Flux.
Getty Images
When ChatGPT was released by OpenAI in 2022, it was the peak expression of AI chatbots built on large language models (LLMs). With an accessible interface and absolutely no need for external gadgets, it was the power of interactive AI in the palms of users, literally!
Barely five days after its launch, ChatGPT broke the 1 million download milestone. (For context, that took Facebook 10 months to achieve.) Of course, there were a few problems, like the occasional lags and hallucinations, but version after version, ChatGPT continued to expand its frontiers.
There were also apprehensions about the development cost of ChatGPT-4, somewhere between $48 to $71 million. But it was all completely justifiable. Sixteen thousand H100s GPUs don't come cheap, and salaries have to be paid.
Or was it?
Rise Of The Deep
On January 20, 2025, the world woke up to news that would change the trajectory of AI technology. A little-known Chinese company had launched DeepSeek R1, an AI with capabilities comparable to OpenAI's ChatGPT.
And the shocker?
The initial reports claimed it did it with fewer, cheaper and older GPUs at a development cost of only $5.6 million. The ripple effect sent shock waves across the markets. By Monday, Nvidia, the biggest supplier of AI GPU chips, lost almost $600 billion in market value as investors started reconsidering their options. Indexes and corporations like Nasdaq, Microsoft and Alphabet also plummeted. Within a week, Deepseek had overtaken ChatGPT to become the most downloaded application on the Apple App Store.
But since then, DeepSeek has come under scrutiny, with the head of Google's DeepMind calling its claims "exaggerated" and one critic suggesting it actually cost DeepSeek over $1 billion to create its AI model.
Nevertheless, DeepSeek's arrival has caused a shift. The investment rationale for the supply chain had been quite simple: more spending and better outcomes for AI.
Until now.
The Paradigm Shift
Deepseek's story is exceptional for several reasons. First, due to the United States' efforts to stem the flow of advanced AI technology to competing nations, the Biden administration restricted the export of GPUs to China, limiting the availability of advanced AI GPUs like the A100s and the H100s. As a result, Deepseek presumably had to rely on less sophisticated but more available GPUs like the H800.
The ability of Deepseek to turn this crippling limitation into one of the marvels of AI innovation highlights a very critical question: Is ingenuity and better software architecture a more sustainable alternative to advanced but expensive GPUs?
GPU availability (significantly advanced chips like the H100s) is one of the rate-limiting steps for AI research and development; even in the U.S., Nvidia, the top producer of GPUs globally, continues to grapple with meeting its high demand. A breakthrough that demonstrates that companies and research labs can maximize their computing power and cut down costs is a game-changer for the entire industry, but how exactly did DeepSeek achieve this?
Flipping The Game
Before Deepseek's emergence in AI, it had always been a game of who was bigger. Bigger financial investments translate into bigger LLM Models, which in turn require more compute resources and, hopefully, bigger innovative strides.
However, DeepSeek's approach was counterintuitive. Instead of slapping on more compute and developing bigger models, the Chinese company focused on optimizing for a more efficient use of available resources. This included enhancing its model abilities through reinforcement learning, leveraging improved software architecture and optimizing its algorithm.
Rather than dwarfing prevailing challenges with sheer brute power, Deepseek turned the game on its head. Early benchmarks showed it was 20 times more efficient and far less compute-intensive than its more pronounced competitors.
Since it relied on reinforcement learning, Deepseek-R1 also eliminated the need for large teams of human reviewers and supervised fine-tuning, keeping operating costs to a minimum.
Another important paradigm that Deepseek adopted was its incorporation of MOE (mixture of experts) architecture. MOE leverages multiple expert sub-models and uses selective gating to activate only the most relevant parameters for each input. For context, the Deepseek MoE framework comprises around 671 billion parameters; however, less than 0.5% of these parameters are used during any input.
Picture a diverse team of seasoned experts across different disciplines. When needed, the gating mechanism dynamically selects the best combination of experts to solve the problem.
The result?
Dynamic routing and allocation lowers the amount of computation the model requires by reducing unnecessary computation. This approach also improves efficiency, promotes seamless scalability and supports progressive fine-tuning of different expert system components for specific problems.
Implications For The Broader AI Industry
Compute-efficient AI solutions encourage democratization, allowing for dynamic innovations from different quarters. This could, in turn, promote cheaper access to AI resources, breaking Big Tech's monopoly on AI innovation.
Deepseek's open-source nature provides a level playing field for researchers to engage in deep R&D without breaking the bank. Its lower energy requirements and smaller carbon footprint can also positively drive environmentally sustainable designs for data centers in the near future.
However, as revolutionary as the emergence of Deepseek has been, there are also a few drawbacks (on top of the dubiousness of its claims).
First, while DeepSeek's open-source nature encourages technology sharing and participation, it also means malicious actors can repurpose it, raising fresh concerns about heightened misinformation, deepfakes and other sinister possibilities.
Another danger hinges on data sovereignty and the possibility of the Chinese government mining users' data.
Rounding Off
While DeepSeek has demonstrated capabilities that are comparable to OpenAI ChatGPT in many ways, its long-term effect on repositioning AI technology, compute and market dynamics still remains to be seen.
Whatever the future might hold, Deepseek's successful deployment of a powerful open-source model has introduced a new level playing field for innovation in the AI industry. As this distills into the mainstream, its ripple effect could determine the face of the next iteration of artificial intelligence.
Forbes Technology Council is an invitation-only community for world-class CIOs, CTOs and technology executives. Do I qualify?

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

3 in 4 in Singapore not able to identify deepfake content: Cyber Security Agency survey

Yahoo

17 minutes ago

Yahoo

3 in 4 in Singapore not able to identify deepfake content: Cyber Security Agency survey

SINGAPORE - Only one in four people here are able to distinguish between deepfake and legitimate videos, even though a majority said they are confident in identifying deepfake content. This is one of the key findings of a survey released on July 2 by the Cyber Security Agency (CSA) of Singapore. Questions related to deepfakes are new in the Cybersecurity Awareness Survey 2024 given the prevalence of generative artificial intelligence tools that make it easier to create fake content to scam unsuspecting victims. Overall, 1,050 respondents aged 15 and above were polled in October 2024 on their attitude towards issues such as cyber incidents and mobile security, and adoption of cyber hygiene practices. Nearly 80 per cent said they are confident in identifying deepfakes, citing telltale signs such as suspicious content and unsynchronised lip movements. However, only a quarter of them could correctly distinguish between deepfake and legitimate videos when they were put to the test. 'With cyber criminals constantly devising new scam tactics, we need to be vigilant, and make it harder for them to scam us,' said CSA's chief executive David Koh. 'Always stop and check with trusted sources before taking any action, so that we can protect what is precious to us.' Compared with an earlier survey conducted in 2022, more people know what phishing is. But when tested on their ability to distinguish between phishing and legitimate content, only 13 per cent of the respondents were able to correctly identify them, a drop from 24 per cent in 2022. There has been an increase in the installation of cybersecurity apps and adoption of two-factor authentication (2FA) over the years. More respondents have installed security apps in 2024, with 63 per cent having at least one app installed, up from 50 per cent in 2022. The adoption of 2FA across all online accounts and apps also increased from 35 per cent in 2022 to 41 per cent in 2024. Though 36 per cent of respondents in 2024 accepted their mobile devices' updates immediately, 32 per cent preferred to continue using their devices and update later. Those who choose not to update their devices remained low at three per cent, down from four per cent in 2022. Around one quarter of respondents in the 2024 survey said they have been hit with at least one cyber incident, a slight drop from 30 per cent in 2022. There was also a drop in percentage of respondents who perceived that their devices were likely to be compromised by virus or malware, from 60 per cent in 2022 to 57 per cent in 2024. Nearly 40 per cent of people perceived themselves as being at risk of falling for online scams, down from 43 per cent in 2022. Source: The Straits Times © SPH Media Limited. Permission required for reproduction Discover how to enjoy other premium articles here

Berenberg Upgrades Autodesk (ADSK) Stock to Buy from Hold

Yahoo

21 minutes ago

Yahoo

Berenberg Upgrades Autodesk (ADSK) Stock to Buy from Hold

Autodesk, Inc. (NASDAQ:ADSK) is one of the Top 10 AI and Technology Stocks to Buy According to Analysts. On June 27, Berenberg upgraded the company's stock to 'Buy' from 'Hold' with a price objective of $365, an increase from the prior target of $325, as reported by The Fly. The firm noted a compelling margin-expansion opportunity at Autodesk, Inc. (NASDAQ:ADSK). Unlike earlier AI tools, which simply improved user productivity, the AI agents have the ability to execute business tasks with full autonomy and agency, according to the firm's analyst. A software engineer using AutoCAD Civil 3D to create a 3D design in a modern office setting. The firm believes that this technological advance unlocks fundamental new value in AI adoption for enterprises, and this trend can benefit Autodesk, Inc. (NASDAQ:ADSK). The company is focusing its growth investments on the strategic priorities in cloud, platform, and AI. Furthermore, it continues to optimize its sales and marketing and has been investing to enable future optimization, which fuels higher margins. For Q2 2026, Autodesk, Inc. (NASDAQ:ADSK) expects revenue in the range of $1,720 million – $1,730 million, and EPS (GAAP) of between $1.37 – $1.46. Autodesk, Inc. (NASDAQ:ADSK) is a leading AI and technology business because it is engaged in developing advanced software platforms for engineering, design, and manufacturing. The company uses AI for the automation of design processes and optimization of construction planning. Parnassus Investments, an investment management company, released its Q3 2024 investor letter. Here is what the fund said: 'In Software, we added Autodesk, Inc. (NASDAQ:ADSK) and Cloudflare while exiting We believe Autodesk's dominant position in architecture, engineering and construction software allows it to increase margins and offer attractive revenue growth. Autodesk is a market-leading vertical software company with the ability to meaningfully improve its margins, while its revenue growth should accelerate as it completes its sales channel re-alignment.' While we acknowledge the potential of ADSK to grow, our conviction lies in the belief that some AI stocks hold greater promise for delivering higher returns and have limited downside risk. If you are looking for an AI stock that is more promising than ADSK and that has 100x upside potential, check out our report about this cheapest AI stock. READ NEXT: 13 Cheap AI Stocks to Buy According to Analysts and 11 Unstoppable Growth Stocks to Invest in Now Disclosure: None. Insider Monkey focuses on uncovering the best investment ideas of hedge funds and insiders. Please subscribe to our free daily e-newsletter to get the latest investment ideas from hedge funds' investor letters by entering your email address below.

New World Shares Jump Following Record $11.2 Billion Loan Deal

Bloomberg

22 minutes ago

Bloomberg

New World Shares Jump Following Record $11.2 Billion Loan Deal

New World Development Co. shares surged to the highest level this year after the company closed a record $11.2 billion loan refinancing package, helping ease investor anxiety. The Hong Kong builder's stock jumped as much as 11%, leading its peers in a gauge of developers in the financial hub. Trading volumes climbed to four times the three-month average around midday.