8–12 Jun 2026
Helsinki, Finland
Europe/Helsinki timezone

Traffic at 3am: Armed with a Toothpick against AI Scrapers

11 Jun 2026, 11:45
5m
Concert Hall

Concert Hall

Speaker

Cheryl Andrea Fernando (SURF)

Description

Observing random spikes in website traffic at odd hours? If you go to investigate what’s causing it, it’s just an endless amount of requests from a battalion of AI scrapers visiting your website for content to train their machine learning models on. It’s like the plague, they hunt you for your content and you’re out there battling it with a toothpick. In a more technical view, your toothpick is the robots.txt, a file that politely indicates to bots to not scrape your website content. But it’s not like most AI scrapers care (only some good ones do), it was just a suggestion anyway. As it is a growing concern where content and website owners are tired of having their content scraped without their approval and dealing with the overhead costs that AI scrapers bring, it’s good to know what possible mitigation strategies are available. Do we play it safe, monetize, or obliterate these scrapers? How do we deal with stealthy destructive scrapers that use residential proxies? Is it worth trying such tactics or is the soup of training data for ML models poisoning itself? This talk answers these questions and many more, as well as analyzes the future scope of tackling these AI scrapers.

What will the TNC audience take away from your talk?

Understanding the destructive nature of AI scrapers and its stealth with residential proxies, the pros and cons of the different mitigation tactics available from Big Tech and open source projects, future scope and addressing the complex problem of either fully blocking or allowing AI scrapers.

Are you a first time speaker at TNC? Yes

Primary author

Presentation materials