Nvidia accused of trying to cut a deal with Anna’s Archive for high‑speed access to the massive pirated book haul — allegedly chased stolen data to fuel its LLM

Nvidia accused of trying to cut a deal with Anna’s Archive for high‑speed access to the massive pirated book haul — allegedly chased stolen data to fuel its LLM

Court documents appear to show Nvidia management green lit the deal, despite Anna’s Archive’s warnings.

When you purchase through links on our site, we may earn an affiliate commission. Here’s how it works .

(Image credit: Nvidia) Share Share by: Copy link Facebook X Whatsapp Reddit Pinterest Flipboard Email Share this article 2 Join the conversation Follow us Add us as a preferred source on Google Nvidia has been accused of offering to pay for ‘high-speed access’ to Anna’s Archive , a notorious ‘shadow library’ portal, bursting with copyright-infringing materials. Documents published by TorrentFreak appear to show the Nvidia Data Strategy Team reaching out regarding payments for ‘high-speed access’ to Anna’s Archive. Moreover, if the documents are genuine, they indicate that green team management approved the payment plan “within a week.”

Nvidia, like other AI industry giants, is very interested in gaining access to the largest sources of human knowledge to improve LLM training quality. The likes of Meta and Anthropic have previously been found with their fingers all over pirated content. These super-wealthy firms jealously guard their own technologies, so evidence that they seem to have little or no regard for the intellectual property of others would be a source of irony.

TorrentFreak notes that the email snippets it has shared have been precipitated during the discovery phase of an ongoing class action lawsuit where Nvidia is accused of copyright infringement by training its models on content from the Books3 dataset, including copyrighted works taken from pirate site Bibliotik.

Nvidia decries 'far-fetched' reports of smuggling in face of DeepSeek training reports

Chinese AI startup gets access to 2,300 banned Blackwell GPUs by exploiting cloud loophole

Former Chinese gaming company with China govt ties accused of smuggling banned AI GPUs

In that case, Nvidia is defending its actions under ‘fair use,’ but the new evidence showing Anna’s Archive correspondence looks compelling. In fact, the authors behind the Books3 class action have filed an amended complaint significantly expanding the scope of the lawsuit, says TorrentFreak.

One of the most damning pieces of correspondence between Nvidia reps and Anna’s Archive is shown above. The snippet appears to show an unnamed Nvidia exec inquiring about the use of Anna’s Archive for LLM training.

Probably worse, though, is the section of the new court filing which alleges that “Within a week of contacting Anna’s Archive, and days after being warned by Anna’s Archive of the illegal nature of their collections, Nvidia management gave ‘the green light’ to proceed with the piracy.”

The proposed deal would mean providing Nvidia with high-speed access to ~500TB of data for LLM training. We don’t see evidence that the deal actually went through, or that any payments went to Anna’s Archive.

Get Tom's Hardware's best news and in-depth reviews, straight to your inbox.

Key considerations

  • Investor positioning can change fast
  • Volatility remains possible near catalysts
  • Macro rates and liquidity can dominate flows

Reference reading

More on this site

Informational only. No financial advice. Do your own research.

Leave a Comment