logo
An AI data trap catches Perplexity impersonating Google

An AI data trap catches Perplexity impersonating Google

Business Insider9 hours ago
If you want to succeed in AI, a good hack would be to impersonate Google. You just can't get caught.
This is what just happened to Perplexity, a startup that competes with ChatGPT, Google's Gemini, and other generative AI services.
Quality data is crucial for success in AI, but tech companies don't want to pay for this, so they crawl the web and scrape information for free, often without permission. This has sparked a backlash by some content creators and others interested in preserving the incentives that built the web.
Cloudflare and its CEO, Matthew Prince, have stormed into this battle with new features that help websites block unwanted AI bot crawlers. Cloudflare is an infrastructure, security, and software company that helps run about 20% of the internet. It thrives when the web does well, hence its interest in helping sites get paid for content.
Some Cloudflare customers recently complained to the company that Perplexity was evading these blocks and continued to scrape and collect data without permission.
So, CloudFlare set a digital trap and caught this startup red-handed, according to a Monday blog describing the escapade.
"Some supposedly 'reputable' AI companies act more like North Korean hackers," Prince wrote on X on Monday. "Time to name, shame, and hard block them."
Perplexity didn't respond to a request for comment.
The bait: Honeytrap domains and locked doors
Cloudflare created entirely new, unpublished websites and configured them with robots.txt files that explicitly blocked all crawlers — including Perplexity's declared bots, PerplexityBot and Perplexity-User. These test sites had no public links, search engine entries, or metadata that would normally make them discoverable.
Yet, when Cloudflare queried Perplexity's AI with questions about these specific sites, the startup's service responded with detailed information that could only have come from those restricted pages. The conclusion? Perplexity had accessed the content despite being clearly told not to.
The cloak: How Perplexity masked its crawl
Perplexity initially crawled these sites using its official user-agent string, complying with standard protocols. However, Cloudflare said it discovered that once blocked, Perplexity resorted to stealth tactics.
Cloudflare found that Perplexity began deploying undeclared crawlers disguised as normal web browsers and sending requests from unknown or rotated IP addresses and unofficial ASNs, [what is ASN? write out on first ref?] which are crucial identifiers that help route internet traffic efficiently.
When its official crawlers were blocked, Perplexity also used a generic web browser designed to impersonate Google's Chrome browser on Apple Mac computers. (Business Insider asked Google whether it has told Perplexity to stop impersonating Chrome. Google did not respond).
According to Cloudflare, Perplexity has been making millions of such "stealth" requests daily across tens of thousands of web domains.
This behavior not only violated web standards, but also betrays the fundamental trust that underpins the functioning of the open web, Cloudflare explained.
The comparison: How OpenAI gets it right
To emphasize what good bot behavior looks like, Cloudflare compared Perplexity's conduct to that of OpenAI's crawlers, which scrape data for developing ChatGPT and giant AI models such as the upcoming GPT-5.
When OpenAI's bots encountered a robots.txt file or a similar block, they simply backed off. No circumvention. No masking. No backdoor crawling, according to Cloudflare tests.
The Fallout: De-verification and blocking
As a result of these findings, Cloudflare has de-listed Perplexity as a verified bot and rolled out new detection and blocking techniques across its network.
Cloudflare's takedown serves as a cautionary tale in the AI arms race. While the web shifts toward stronger control over data access and usage, actors who flout these evolving norms may find themselves not just blocked, but publicly called out.
In an era where AI systems are hungry for training data, Cloudflare's sting operation is a signal to startups and established players alike: Respect the rules of the web, or risk being exposed.
Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Whatever you do, don't buy a Google Pixel phone right now
Whatever you do, don't buy a Google Pixel phone right now

Android Authority

time43 minutes ago

  • Android Authority

Whatever you do, don't buy a Google Pixel phone right now

C. Scott Brown / Android Authority Google's Pixel phones are often among the most highly-regarded in the Android world. The most recent Pixel 9 series is no exception to that rule, and, if anything, it's the best example of it yet. Whether we're talking about the baseline Pixel 9 or any of the flagship Pixel 9 Pro models, the current slate of Google Pixel phones is mighty impressive. That all said, if you're in the market for a new Android phone right now, you absolutely should not buy a Google Pixel — and there's a very good reason for that. Buy a Pixel 9 now or wait for the Pixel 10? 0 votes Buy a Pixel 9, the Pixel 10 rumors don't look good. NaN % Wait for the Pixel 10! It's almost here! NaN % Why buying a Google Pixel is a bad idea right now Ryan Haines / Android Authority If you closely follow the Pixel world, you may already know the answer to this one. Even so, it's an answer worth repeating. Last month, Google announced it would be holding a Made by Google event on Wednesday, August 20. Made by Google events are where Google traditionally announces its latest slate of Pixel hardware, and this year, that'll be the Google Pixel 10 series — including the Pixel 10, Pixel 10 Pro, Pixel 10 Pro XL, and Pixel 10 Pro Fold. In other words, replacements for all of the mainline Pixel 9 handsets available today. This means we're now in that awkward time where a company's current generation of phones is still available, but the new models are right around the corner. Sometimes, if we're not expecting a significant upgrade for the new phones, it can make sense to still buy the current generation. However, based on what we know about the Pixel 10 series so far, purchasing a Pixel 9 before the Pixel 10gets here would be a huge mistake. There are a few reasons for this, the most significant of which is Google's new Tensor G5 chip. Robert Triggs / Android Authority All past and present Pixel phones — including the Pixel 9 series — have been held back to some degree by Google's Tensor chips. Between lacking horsepower, poor battery life, and disappointing thermal management, Tensor chips have never performed on the same level as their Qualcomm Snapdragon counterparts. Word on the street is that the Tensor G5 inside all of Google's Pixel 10 models will be the first 3nm Tensor chip and the first Tensor chip manufactured by TSMC rather than Samsung Foundry. If both of these points are true, the Tensor G5 could be dramatically more powerful and efficient than any Tensor chip that has come before it — potentially resolving the performance and efficiency gap that Pixels have had since the Pixel 6 and its Tensor G1 silicon. Another major upgrade expected for all Pixel 10 models is Qi2 magnetic charging. For the first time in a major Android phone, every Pixel 10 will reportedly have magnets built into its backside, allowing you to use magnetic chargers and other accessories without requiring a magnetic case (just like Apple has offered for years with MagSafe on the iPhone). Purchasing a Pixel 9 before the Pixel 10 gets here would be a huge mistake. This feature alone may have convinced me to buy a Pixel 10 when it goes on sale, and I have a feeling it's something a lot of people will have a hard time living without once they try it. Between magnetic charging stands, wallets, car mounts, and more, the convenience of being able to use all of them without needing a specific case is incredible — and it'll give the Pixel 10 a unique capability no other major Android phone currently offers. And there's more, too. For the baseline Pixel 10, specifically, we can likely expect a significant camera upgrade in the form of a new 5x telephoto camera — something the Pixel 9 lacks entirely. We should also see battery and charging upgrades for all Pixel 10 models. The Pixel 10 Pro Fold is rumored to be the first foldable with an IP68 rating, and Google's new Magic Cue feature should lend the Pixel 10 handsets some AI magic. Do yourself a favor and wait for the Pixel 10 series At the time of publication, we're just a little over two weeks away from Google's Pixel 10 event. That means just two more weeks to wait before a new round of Pixel phones with a dramatically improved chipset, a game-changing charging/accessory system, bigger batteries, faster charging, upgraded cameras, and more. Although the Pixel 10 series will physically resemble its Pixel 9 predecessors, the internal changes we're expecting are nothing short of significant. You could buy a Pixel 9, but doing so would mean missing out on everything mentioned above when the Pixel 10 family arrives on August 20. As someone who's been reviewing and writing about phones for over a decade — and someone who's generally just a fan of Pixel phones — I'd strongly recommend holding off on buying a new Pixel phone until the Pixel 10 lineup is available. Robert Triggs / Android Authority On the one hand, if you wait to buy any of the Pixel 10 models, you'll be getting a phone that's better than its respective Pixel 9 predecessor in a multitude of ways. Plus, with rumors suggesting no major price increases, you'll pay the same amount that you would for a Pixel 9 today. On top of all that, even if you still want to buy a Pixel 9 once the Pixel 10 is revealed, you'll almost certainly be able to find last year's Pixels substantially discounted once the new models are here. Looked at this way, there's no tangible benefit of buying a Google Pixel right now. If anything, it puts you at a disadvantage. Buying a Google Pixel generally isn't a bad idea, but at this moment in time, it is — at least until August 20 rolls around. Follow

Gemini is getting ready to work with our favorite Google AI tool (APK teardown)
Gemini is getting ready to work with our favorite Google AI tool (APK teardown)

Android Authority

time43 minutes ago

  • Android Authority

Gemini is getting ready to work with our favorite Google AI tool (APK teardown)

Andy Walker / Android Authority TL;DR NotebookLM asks users to upload collections of documents which are organized into notebooks. Right now, NotebookLM draws only from these notebooks when answering your questions. Gemini support for notebook uploads may let users combine those resources with the wider internet. NotebookLM is arguably one of Google's most useful AI tools, and has helped introduce us to impressive features like Audio Overviews. Perhaps its greatest strength stems from how focused it is, pulling information from the resources you specifically provide it with, rather than broadly crawling the internet as a whole. And while we're glad that it doesn't look like NotebookLM is about to seriously change anything there, we have spotted one new way that might let us expand the horizons of our notebooks — just a little. ⚠️ An APK teardown helps predict features that may arrive on a service in the future based on work-in-progress code. However, it is possible that such predicted features may not make it to a public release. As Google continues to feel out its approach to AI-fueled tools and services, one recurring theme we've seen involves a lot of after-the-fact overlap: Google likes to port many of its best features across to its other solutions. Audio Overviews are a prime example of that, and after debuting with NotebookLM, they've expanded to stuff like the Gemini app itself. Today, we're flipping that expansion on its head a bit as we spot the Gemini app for Android getting ready to import NotebookLM notebooks. We're spotting developers' early work in this direction with the version beta build of the Google Android app, and while you won't see any of this popping up just yet, we've coaxed out a preview of how parts of the interface could appear. So far, we're able to see the option to import a notebook, but that's about as much as we've gotten out of it at the moment — the notebooks we've already created with NotebookLM don't appear to populate the list where they should. Why would you want to do this? While you're currently able to use NotebookLM's 'chat' option to ask questions about your data, that's limited to the information provided within. We could see this Gemini feature emerging as a way to go a little bit beyond that, starting with the notebook you upload but staying open to the possibility of getting answers from external sources. Even if that's not ultimately going to be the case, this could still be a benefit for heavy Gemini users, letting them stay right in their preferred app while tapping in to NotebookLM-style answers. Right now, it's hard to say for sure what Google's plans are, but hopefully we'll be able to check this out in a more functional form soon and start getting to the bottom of it. Follow

Pixel 10 series users could get free access to Google's smartest features (APK teardown)
Pixel 10 series users could get free access to Google's smartest features (APK teardown)

Android Authority

timean hour ago

  • Android Authority

Pixel 10 series users could get free access to Google's smartest features (APK teardown)

TL;DR Code within the latest version of the Google app suggests the upcoming Pixel 10 series could come with a free trial of Google AI Pro. The length of the trial is still unconfirmed, but based on past promotions, it could be between six and twelve months. Google is set to announce the Pixel 10 series flagships in the coming weeks. While the phones are well-rounded in specs, Google's trump card is the software experience, with the company going all in on providing meaningful AI features. Many of those features are expected to be on-device, but a few may require cloud access and potentially even a Google AI Pro plan. Thankfully, it seems that Google could bless Pixel 10 series buyers with a free trial of Google AI Pro, letting them enjoy all of the company's AI features. ⚠️ An APK teardown helps predict features that may arrive on a service in the future based on work-in-progress code. However, it is possible that such predicted features may not make it to a public release. Within Google app v16.30.59, we spotted code suggesting Pixel 10 series users would be getting a trial of Google AI Pro plan with their purchase, giving them access to all the good Gemini perks they would need on their phone. AssembleDebug / Android Authority This list of devices dictates which devices are eligible for free Google AI offers. We spotted the Galaxy Z Fold 7 and Flip 7 as new additions to this list recently, and Samsung eventually went on to announce that Fold 7, Flip 7, and Flip 7 FE buyers can get six months of Google AI Pro with 2TB of cloud storage for free with their phone purchase. It's unclear what the Pixel 10 series devices will offer in terms of trial tier and duration. Based on past trends, we speculate that Pixel 10 series users could get between six and twelve months of Google AI Pro with their phone purchase. Note that the Pixel 9 Pro phones came with one year of free Google AI Pro, while the base Pixel 9 came with a six-month trial, so Google could opt for a similar split again. We hope to learn more when the Pixel 10 series flagships launch in the coming weeks. Follow

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store