Lydie.cc - Data Preservation



Never trust a computer that you do not own
. The "cloud" is a network of computers you don't own:
  • Data in the cloud is being used to train Al.
  • Hackers can get it.
  • The corporation running it can wipe whatever they want.
  • Fascists are actively trying to ban commercial servers and services.
  • Media is disappearing, like on YouTube, streaming services and the US government.
Download everything. Buy hard drives and hoard as you go. Mirror that data once or twice. Store a copy powered off and unplugged. Burn the super important stuff to archival grade M-DISCs. Run AirSonic or something open source to permit remote access of your media.  And if you have some terabytes to spare, download and seed some torrents of public government information that has since been scrubbed from official websites, links below.



Fascists are deleting public data - help me preserve it!


We're facing an unprecedented challenge: important public information is being systematically removed from government websites by the US fascist administration. A community of digital archivists, myself among them, is racing against time to download and share this data through torrents, preventing its permanent loss.

Much of this essential data is sourced from sites like https://sciop.net/datasets/.

I've been contributing by running a torrent server, but its existing tired hard drives are dropping like flies. I have the benefit of free solar power and a free server with unmetered bandwidth, but without new storage, this vital preservation work will halt.

Every donation will help me acquire "renewed" datacenter grade 14TB hard drives, allowing me to continue my efforts to archive and protect public information for future generations. Currently, renewed Western Digital 14TB drives are about $160 ea, and the server can hold 8 of them. The GoFundMe goal is $1356 because that would fill the entire server with 112TB of storage.

07-13-2025: Donations have enabled me to purchase 5x 14TB drives so far!!  Only 3x to go!

Join me in saving our shared history!  https://gofund.me/17c41dda

You can find plenty of updates about this project on Mastodon: https://tech.lgbt/@Lydie


The torrents here are seeded by my torrent server 24/7
.  Please keep your downloads seeding as well! For torrents that I (re)created, I compress the data using 7z Ultra.

US Government Data Archive Links - this is where I got the data from originally:


https://commoncrawl.org/blog/january-2025-crawl-archive-now-available

https://meta.wikimedia.org/wiki/Data_dump_torrents#English_Wikipedia

https://github.com/end-of-term/eot2024

https://github.com/internetarchive/dweb-mirror

https://archive.org/details/20250128-cdc-datasets

https://wiki.archiveteam.org/index.php/Main_Page

https://github.com/lisawilliams/NIH_Data

https://archive.org/details/academictorrents_c5bf370a90cae548d5a306c1be7d79186b9f60b9

In general, I'm now hosting torrents supplied by SciOp. If you download torrents from there, you're likely pulling from my server!  Start here: https://sciop.net/datasets/


Personal data hoarding:

My personal archive consists of 172TB (and always growing). Below, the MyBooks are primary storage and 1st mirror (104TB).  The tall black unit is an air-gapped mirror (32TB).  The small black unit is the new air-gapped offsite mirror (36TB). There is no RAID in use.  DriveBender is still running the main NAS and air-gapped copy.  Stablebit DrivePool is doing the off-site mirror and torrent seed server.  All drives are standard NTFS.  If one drive dies, it's an easy recovery.  I've had two 8TB drives suffer instant death in the past year, and was back up in < 12 hours.


bulk cd/dvd ripping tower

junkbox server - torrent seeding server

off-site air-gapped NAS mirror

Off-line NAS mirror

primary and mirror live drives

been at this a long time !!

Go Home