New: a project analyzing human language usage by scraping the web is shutting down because "generative AI has polluted the data." It's going to become much harder to analyze human use of language with the rise of AI-generated stuff being everywhere


404 Media
Project Analyzing Human Language Usage Shuts Down Because ‘Generative AI Has Polluted the Data’
Wordfreq shuts down because "I don’t think anyone has reliable information about post-2021 language usage by humans.”
