logo
Snorkel AI Raises $100 Million To Build Better Evaluators For AI Models

Snorkel AI Raises $100 Million To Build Better Evaluators For AI Models

Forbes29-05-2025
Snorkel AI CEO Alex Ratner said his company is placing more emphasis on helping subject matter experts build datasets and models for evaluating AI systems.
Alex Ratner, CEO of Snorkel AI remembers a time when data labeling —the grueling task of adding context to swathes of raw data and grading an AI model's response— was considered 'janitorial' work among AI researchers. But that quickly changed when ChatGPT stunned the world in 2022 and breathed new life (and billions of dollars) into a string of startups rushing to supply human-labeled data to the likes of OpenAI and Anthropic to train capable models.
Now, the crowded field of data labelling appears to be undergoing another shift. Fewer companies are training large language models from scratch, leaving that task instead to the tech giants. Instead, they are fine-tuning models and building applications in areas like software development, healthcare and finance, creating demand for specialized data. AI chatbots no longer just write essays and haikus; they're being tasked with high stakes jobs like helping physicians make diagnoses or screening loan applications, and they're making more mistakes. Assessing a model's performance has become crucial for businesses to trust and ultimately adopt AI, Ratner said. 'Evaluation has become the new entry point,' he told Forbes.
That urgency for measuring AI's abilities across very specific use cases has sparked a new direction for Snorkel AI, which is shifting gears to help enterprises create evaluation systems and datasets to test their AI models and adjust them accordingly. Data scientists and subject matter experts within an enterprise use Snorkel's software to curate and generate thousands of prompt and response pairs as examples of what a correct answer looks like to a query. The AI model is then evaluated according to that dataset, and trained on it to improve overall quality.
The company has now raised $100 million in a Series D funding round led by New York-based VC firm Addition at a $1.3 billion valuation— a 30% increase from its $1 billion valuation in 2021. The relatively small change in valuation could be a sign that the company hasn't grown as investors expected, but Ratner said it's a result of a 'healthy correction in the broader market.' Snorkel AI declined to disclose revenue.
Customer support experts at a large telecommunication company have used Snorkel AI to evaluate and fine tune its chatbot to answer billing related questions and schedule appointments, Ratner told Forbes. Loan officers at one of the top three U.S. banks have used Snorkel to train an AI system that mined databases to answer questions about large institutional customers, improving its accuracy from 25% to 93%, Ratner said. For nascent AI startup Rox that didn't have the manpower or time to evaluate its AI system for salespeople, Snorkel helped improve the accuracy by between 10% to 12%, Rox cofounder Sriram Sridharan told Forbes.
It's a new focus for the once-buzzy company, which spun out of the Stanford Artificial Intelligence Lab in 2019 with a product that helped experts classify thousands of images and text. But since the launch of ChatGPT in 2022, the startup has been largely overshadowed by bigger rivals as more companies flooded the data labelling space. Scale AI, which also offers data labeling and evaluation services, is reportedly in talks to finalize a share sale at a $25 billion valuation, up from its $13.8 billion valuation a year ago. Other competitors include Turing, which doubled its valuation to $2.2 billion from 2021, and Invisible Technologies, which booked $134 million in 2024 revenue without raising much from VCs at all.
Snorkel has faced macro challenges too: As AI models like those powering ChatGPT got better, they could label data on a massive scale for free, shrinking the size of the market further. Ratner acknowledged that Snorkel saw a brief period of slow growth right after OpenAI launched ChatGPT and said enterprises had paused pilots with some vendors to consider using AI models for labelling directly. But he said Snorkel's business bounced back in 2023 and has grown since.
Ratner said Snorkel's differentiator is its emphasis on bringing in subject matter experts — either its own or those within a company– and using a proprietary method called 'programmatic labelling,' to automatically assign labels to massive troves of data through simple keywords or bits of code as opposed to doing it manually. The aim is to help time-crunched experts like doctors and lawyers label data faster and more economically.
As it leans into evaluation, which also requires data generation, Snorkel has started hiring tens of thousands of skilled contractors like STEM professors, lawyers, accountants and fiction writers to create specialized datasets for multiple AI developers, who then use the datasets to evaluate their models (he declined to say which frontier AI labs Snorkel works with). They can also use this data to add new functionality to their chatbots, like the ability to break down and 'reason' about a difficult query or conduct in-depth research on a topic, Ratner said.
But even when it comes to building specialized evaluations, Snorkel faces fierce competition— new and old. The top AI companies have released a number of public benchmarks and open source datasets to evaluate their models. LMArena, a popular leaderboard for evaluating AI model performance, recently spun out as a new company and raised $100 million in seed funding from top investors at a hefty $600 million valuation, according to Bloomberg. Plus, companies like Scale, Turing and Invisible, all offer evaluation services. But Ratner said that unlike its rivals, Snorkel was built around human experts right from the start.
Saam Motamedi, a partner at Greylock who participated in the round, said these new specialized dataset services are a fast-growing part of Snorkel's business as the industry shifts to what's called 'post training' — the process of tweaking the model's performance for certain applications. AI has already soaked up most of the internet data, making datasets custom-made by domain experts even more valuable. 'I think that market tailwind has proven to be a really good one for Snorkel,' he said.
MORE FROM FORBES
Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Heard on the Street Recap: It's Raining Tariffs
Heard on the Street Recap: It's Raining Tariffs

Wall Street Journal

time11 minutes ago

  • Wall Street Journal

Heard on the Street Recap: It's Raining Tariffs

More tariff threats. Over the weekend President Trump said 30% tariffs on goods from the European Union and Mexico would go into effect on Aug. 1. This would be up from the current rates of 10% and 25% on the two markets' goods respectively. It was unclear if goods that are compliant with the U.S.-Mexico-Canada free-trade agreement would still be exempt from the Mexico tariff, as they are currently.

Meta is reportedly using actual tents to build data centers
Meta is reportedly using actual tents to build data centers

TechCrunch

time14 minutes ago

  • TechCrunch

Meta is reportedly using actual tents to build data centers

Meta and Mark Zuckerberg are in a hurry to build their superintelligence tech. The company has been poaching AI researchers, while CEO Mark Zuckerberg announced on Monday that Meta is building a 5-gigawatt data center called Hyperion. The urgency is palpable. As SemiAnalysis reported last week and Business Insider noted, Meta is so eager to boost its computing power that it's literally erecting tents for temporary data center capacity while its facilities are still under construction. These are all signs that Meta wants to build up its AI capacity faster after falling behind competitors like OpenAI, xAI, and Google — and that Zuckerberg isn't willing to wait for typical construction timelines to close the gap. 'This design isn't about beauty or redundancy. It's about getting compute online fast!' SemiAnalysis said in its report. 'From prefabricated power and cooling modules to ultra-light structures, speed is of the essence as there is no backup generation (ie, no diesel generators in sight),' it added. As for its Hyperion data center, Meta spokesperson Ashley Gabriel tells TechCrunch that it will be located in Louisiana and will likely have a capacity of 2 gigawatts by 2030.

Tesla on trial after runaway car on Autopilot kills stargazing woman
Tesla on trial after runaway car on Autopilot kills stargazing woman

Yahoo

time23 minutes ago

  • Yahoo

Tesla on trial after runaway car on Autopilot kills stargazing woman

A jury will decide whether Tesla is partly to blame for the death of a young woman who was hit by an electric car on Autopilot. Naibel Benavides was stargazing at the time of the collision, which sent her flying 22m (75ft) through the air in Florida. Her boyfriend was seriously injured in the 2019 incident, while her body was discovered in a wooded area. George McGee, the Tesla's driver, is not a plaintiff - and reached a separate settlement with the victims' families. Lawyers argue that the car's driver assistance feature should have warned the driver and braked before the collision. It is alleged the Model S sedan blew through red flights and a stop sign at nearly 70mph. But Tesla claims that the driver is solely to blame because he had reached down to pick up a dropped mobile phone at the time. In a statement, the company said: "The evidence clearly shows that this crash had nothing to do with Tesla's Autopilot technology. "Instead, like so many unfortunate accidents since cell phones were invented, this was caused by a distracted driver." Past cases against Tesla have been dismissed or settled, making this jury trial rare. Improvements to the company's driver assistance and partial self-driving features have been made in recent years - but in 2023, 2.3 million Tesla vehicles were recalled amid fears Autopilot was failing to sufficiently alert drivers not paying attention to the road. According to Sky's US partner network NBC News, Elon Musk was not in court as jury selection took place on Monday. Three potential jurors said they would struggle to be fair and impartial to Tesla because of the company's "ethics and ownership". After the jury was selected, a lawyer representing the victims said: "Evidence will show for years before and after this crime, Tesla ignored warnings." They added: "This is a case about shared responsibility. Tesla will take no responsibility for the failures of their Autopilot system. Evidence will show that every actor needs a stage and Tesla set the stage for the preventable actions that bring us here." The jury was also told that evidence will be introduced where Musk publicly declared that Tesla vehicles were "safer than a human".

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store