โก๏ธ๐ซ๐ท NEW - Mistral AI accused of stealing 70 TB of protected books to feed its AI.
According to these court documents, Guillaume Lample allegedly orchestrated the downloading of approximately 70 TB of data from Library Genesis, a pirate platform listing millions of copyrighted books and scientific articles.
In the fall of 2022, Meta's research teams were urgently trying to catch up with OpenAI and ChatGPT. According to internal exchanges revealed by Mediapart, when a researcher objected to these methods, stating, "I don't think we should use pirated works, it's a red line," Guillaume Lample reportedly replied: "Everyone uses LibGen. That's what OpenAI does with GPT3, what Google does with Palm, what DeepMind does with Chinchilla. So we're going to do it too."
