DeepSeek's distilled new R1 AI model can run on a single GPU

29-05-2025

DeepSeek's updated R1 reasoning AI model might be getting the bulk of the AI community's attention this week. But the Chinese AI lab also released a smaller, 'distilled' version of its new R1, DeepSeek-R1-0528-Qwen3-8B, that DeepSeek claims beats comparably-sized models on certain benchmarks.
The smaller updated R1, which was built using the Qwen3-8B model Alibaba launched in May as a foundation, performs better than Google's Gemini 2.5 Flash on AIME 2025, a collection of challenging math questions.
DeepSeek-R1-0528-Qwen3-8B also nearly matches Microsoft's recently released Phi 4 reasoning plus model on another math skills test, HMMT.
So-called distilled models like DeepSeek-R1-0528-Qwen3-8B are generally less capable than their full-sized counterparts. On the plus side, they're far less computationally demanding. According to the cloud platform NodeShift, Qwen3-8B requires a GPU with 40GB-80GB of RAM to run (e.g., an Nvidia H100). The full-sized new R1 needs around a dozen 80GB GPUs.
DeepSeek trained DeepSeek-R1-0528-Qwen3-8B by taking text generated by the updated R1 and using it to fine-tune Qwen3-8B. In a dedicated webpage for the model on the AI dev platform Hugging Face, DeepSeek describes DeepSeek-R1-0528-Qwen3-8B as 'for both academic research on reasoning models and industrial development focused on small-scale models.'
DeepSeek-R1-0528-Qwen3-8B is available under a permissive MIT license, meaning it can be used commercially without restriction. Several hosts, including LM Studio, already offer the model through an API.

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Turkey unveils domestically built hypersonic missile

Yahoo

22 minutes ago

Yahoo

Turkey unveils domestically built hypersonic missile

STORY: :: Turkey unveiled its first domestically built hypersonic missile at an international arms fair :: July 22, 2025 :: Istanbul, Turkey :: The missiles typically launch a warhead that travels at five times the speed of sound at low altitudes Footage released by the Turkish missile manufacturer Roketsan showed the new weapon, known as the Tayfun Block-4, being showcased at the International Defence Industry Fair (IDEF). Hypersonic missiles typically launch a warhead that travels at more than five times the speed of sound or about 3,850 mph, often maneuvering at relatively low altitudes. The United States, China, Russia and other countries have also been developing hypersonic weapons in recent years.

Want to Rank in AI Search? Focus on These Sources

Entrepreneur

24 minutes ago

Entrepreneur

Want to Rank in AI Search? Focus on These Sources

As AI platforms like ChatGPT and Perplexity reshape how users discover information, brands must shift from traditional SEO to strategic AI citation optimization to remain visible. Opinions expressed by Entrepreneur contributors are their own. The way brands earn visibility through search is becoming unpredictable. As conversational AI platforms like ChatGPT, Perplexity, Gemini and Claude become primary entry points to information, it's clear they don't draw from the same sources or deem authority in the same way. What one system cites, another may ignore. For enterprise brands, this fragmentation means search optimization must become multidimensional. It is no longer sufficient to rank well on Google — it is essential to be cited across the various AI engines that users increasingly consult. Brands that align with each model's source preferences thrive; those that aren't cited disappear from view. Reddit is now a key citation engine Since Reddit began licensing its data to OpenAI and Google, it has quickly become a rich source for LLMs. Reddit's licensing revenue surged from $12.3 million to $81.6 million in less than a year as AI firms tapped its massive, topic-organized archives. LinkedIn data shows Reddit citations in ChatGPT increased by 436%, making it the platform's second most-used citation source behind Wikipedia at about 5.9% overall. Related: Everything You Need to Know About Reddit for Businesses in 2025 Big media brands are essential for trust signals Visibility in AI tools like ChatGPT depends on how well brands "interface with the minds of AI agents." High-profile coverage in outlets like The Wall Street Journal and The New York Times strengthens credibility signals that LLMs rely on. AI search reshapes referral traffic A TechCrunch report confirms that many websites saw organic traffic decline in 2024 due to AI-generated search results that deliver answers directly instead of driving clicks. Meanwhile, surveys show AI search referrals to US retail sites surged 1,300% during the holiday season, with users engaging more deeply with the content. Distinct citation patterns across engines data, cited by Search Engine Journal, indicates ChatGPT provides about 2.6 citations per response, Gemini about 6.1 and Perplexity around 6.6. A recent arXiv study confirms that different LLMs show varying preferences — OpenAI models cite Reuters and AP News most, while Perplexity often cites BBC. Related: Perplexity CEO Says AI Coding Tools Cut Work Time From 'Four Days to Literally One Hour' What this means for established brands Treat Reddit and niche forums as owned media: Community content such as how-to posts and genuine use-case anecdotes now surface in AI-generated responses. Reddit's structured, user-generated content is officially part of OpenAI's source pool. Brands should actively engage in these spaces to seed real-world case studies. Earn coverage in top-tier media: AI systems rely on signals of reliability that come from recognized media outlets. If you aren't mentioned in leading news outlets, you risk being invisible to AI responses. Optimize for conversational formats and structured data: Brands need to produce short, answer-ready content. SEO experts report that adopting schema markup (especially FAQ and JSON-LD) helps LLMs recognize and extract content, with a clear impact on citation frequency. Monitor citations across AI platforms: Traffic metrics are no longer sufficient to gauge visibility. Brands should track where they are being mentioned, referenced, or recommended in ChatGPT, Perplexity, Gemini, Claude and Google's "AI Overviews." Why brands must adapt now First impressions are now made by AI: Users increasingly turning to AI for answers, forming opinions based on what the AI cites, even before visiting a website. If your brand isn't cited, it may as well not exist in the moment that matters. AI visibility requires strategic alignment: This is not just marketing or PR. It's PR, content and community working in sync to influence AI citation outcomes. It demands an integrated strategy that prioritizes narrative framing, thought leadership in respected outlets, structured content and direct participation in forums. Quality matters more than volume: A deep, authoritative case study in a niche media outlet can carry more weight in citation algorithms than hundreds of shallow blog posts. Excelling in depth and reputation matters more than churning regardless of quality. Visibility is existential: AI tools are redefining the digital shelf. Unlike traditional paid ads or search rankings, citation in a conversational AI answer propels your brand into the user's decision frame. Ignored by AI, a brand risks fading into irrelevance. How to act now Audit your presence: Ask major AI platforms: "What are the top [product/service category] brands?" If you're missing, reassess your representation strategy. Ask major AI platforms: "What are the top [product/service category] brands?" If you're missing, reassess your representation strategy. Secure mentions in top media: Pursue place-based thought leadership in outlets like WSJ, NYT, FT, Reuters, Bloomberg and Washington Post. AI engines trust these sources more than niche blogs. Pursue place-based thought leadership in outlets like WSJ, NYT, FT, Reuters, Bloomberg and Washington Post. AI engines trust these sources more than niche blogs. Publish structured, AI-ready content: Create concise explainers, FAQs, comparison guides under 200 words, tagged with appropriate schema. Make your content easy for machines to parse and quote. Create concise explainers, FAQs, comparison guides under 200 words, tagged with appropriate schema. Make your content easy for machines to parse and quote. Engage community platforms authentically: Contribute real-world expertise in subreddits and specialized forums. Guide conversations — don't spam, and align posts with user intent and brand messaging. Contribute real-world expertise in subreddits and specialized forums. Guide conversations — don't spam, and align posts with user intent and brand messaging. Implement AI visibility monitoring: Use tools to track mentions, tone and volume of citations. Adjust content and engagement strategies based on what resonates — and be ready to pivot. Use tools to track mentions, tone and volume of citations. Adjust content and engagement strategies based on what resonates — and be ready to pivot. Measure sentiment directionally: Monitor tone in AI-generated mentions. Positive framing earns citations more consistently than neutral or negative narratives. Brands that act now to optimize their visibility in LLM ecosystems will control the narrative and establish authority before competitors. Those that don't adapt risk fading into silence — irrelevant at the very moment when AI serves as the first touchpoint with users.

Geek Wire

24 minutes ago

Geek Wire

From grief to innovation: Seattle tech vets building personal AI tool with persistent memory and privacy

GeekWire's startup coverage documents the Pacific Northwest entrepreneurial scene. Sign up for our weekly startup newsletter , and check out the GeekWire funding tracker and venture capital directory . Seattle tech and business leader Mary Jesse, CEO of ACME Brains, a new startup developing a personal context engine for AI systems. Mary Jesse couldn't sleep. Grieving after her husband's unexpected death from late-stage cancer, the longtime Seattle business and engineering leader typed three words into ChatGPT: 'I am sad.' The AI's surprisingly compassionate response helped her through that difficult moment, and others that followed, validating her experience and reassuring her that she could get through it. 'It was just really simple,' she recalled. 'But it was so helpful.' It also revealed the popular chatbot's limitations. Jesse found that ChatGPT couldn't easily resurface the context of their past conversations. She worried about the privacy implications, as well. Jesse said she wouldn't normally share such a personal story publicly, but the experience was the basis for what would become her next venture. She and two other tech industry veterans, Alan Caplan and Bob Bergstrom, this week unveiled their new Seattle-based startup, called ACME Brains, which is building what they call a 'personal context engine.' They say the patent-pending AI system will remember key details over time, and give users control over their data. The first product to use the system, currently under development by the startup, is called nexie. It's a personal AI assistant designed to seamlessly resurface information from past conversations, without requiring users to manually search through threads or craft elaborate prompts to maintain continuity over time. Nexie is currently in early development with a working prototype. The company plans to start alpha testing soon, followed by a beta program focused on gathering user feedback prior to a future public launch. The three co-founders bring a broad range of experience to the startup: Jesse, CEO, began her career in the wireless industry, in engineering and leadership roles at McCaw Cellular and AT&T Wireless before co-founding the mobile infrastructure company RadioFrame Networks. She has led and advised early-stage startups and was CEO of MTI, a global provider of smart locks and security systems. Bob Bergstrom, chief scientist. Bergstrom, chief scientist, has worked as both a software engineer and patent attorney for more than four decades. Earlier in his career, he conducted scientific research in x-ray crystallography, and he has since focused on intellectual property strategy and software development. Caplan, COO, was Amazon's first general counsel, starting in its early days after working with Jesse at McCaw Cellular. He led several business units at Amazon, including Kitchen, Payments, and Corporate Development, and went on to hold senior leadership roles at Blue Origin and Vulcan. Alan Caplan, COO. Jesse envisions nexie as everything from a digital journal to a travel companion or even a lightweight system for tracking personal contacts and relationships, depending on a user's needs. The subscription-based service will be available in free and premium versions. It won't rely on advertising or data monetization, a deliberate departure from many consumer tech platforms. While companies like OpenAI, Anthropic, and Google are all investing heavily in AI memory features, Jesse said ACME Brains is taking a different approach. Rather than embedding memory within a large language model, its architecture keeps the user's data separate and under their control — seeking to be more efficient and secure. Jesse sees nexie not as a competitor to the big AI platforms and existing LLMs, but as a tool that can enhance them — making their output more useful and meaningful for personal use. Over time, she believes the underlying system ACME Brains is developing could serve as a kind of 'personal credential,' carrying private, user-controlled data and context across AI apps and platforms. The Seattle-based startup has been bootstrapped by its founders so far, with about 11 people working across technology development, marketing, and operations, primarily in a virtual capacity. Jesse said ACME Brains expects a public launch of nexie by late 2025 or early 2026, with sign-ups for future beta testing now available at

DeepSeek's distilled new R1 AI model can run on a single GPU

Hashtags

Try Our AI Features

Comments

Related Articles

Turkey unveils domestically built hypersonic missile

Want to Rank in AI Search? Focus on These Sources

From grief to innovation: Seattle tech vets building personal AI tool with persistent memory and privacy

Get Started Now: Download the App