Lydie.cc - Data Preservation
|
|
Never trust a computer that you do not own. The "cloud" is a network of computers you don't own:
Fascists are deleting public data - help me preserve it! We're facing an unprecedented challenge: important public information is being systematically removed from government websites by the US fascist administration. A community of digital archivists, myself among them, is racing against time to download and share this data through torrents, preventing its permanent loss. Much of this essential data is sourced from sites like https://sciop.net/datasets/. I've been contributing by running a torrent server, but its existing tired hard drives are dropping like flies. I have the benefit of free solar power and a free server with unmetered bandwidth, but without new storage, this vital preservation work will halt. Every donation will help me acquire "renewed" datacenter grade 14TB hard drives, allowing me to continue my efforts to archive and protect public information for future generations. Currently, renewed Western Digital 14TB drives are about $160 ea, and the server can hold 8 of them. The GoFundMe goal is $1356 because that would fill the entire server with 112TB of storage. 07-13-2025: Donations have enabled me to purchase 5x 14TB drives so far!! Only 3x to go! Join me in saving our shared history! https://gofund.me/17c41dda You can find plenty of updates about this project on Mastodon: https://tech.lgbt/@Lydie |
|
The torrents here are seeded by my torrent server 24/7. Please keep your downloads seeding as well! For torrents that I (re)created, I compress the data using 7z Ultra.
|
|
US Government Data Archive Links - this is where I got the data from originally: https://commoncrawl.org/blog/january-2025-crawl-archive-now-available https://meta.wikimedia.org/wiki/Data_dump_torrents#English_Wikipedia https://github.com/end-of-term/eot2024 https://github.com/internetarchive/dweb-mirror https://archive.org/details/20250128-cdc-datasets https://wiki.archiveteam.org/index.php/Main_Page https://github.com/lisawilliams/NIH_Data https://archive.org/details/academictorrents_c5bf370a90cae548d5a306c1be7d79186b9f60b9 In general, I'm now hosting torrents supplied by SciOp. If you download torrents from there, you're likely pulling from my server! Start here: https://sciop.net/datasets/ |
|
Personal data hoarding: My personal archive consists of 172TB (and always growing). Below, the MyBooks are primary storage and 1st mirror (104TB). The tall black unit is an air-gapped mirror (32TB). The small black unit is the new air-gapped offsite mirror (36TB). There is no RAID in use. DriveBender is still running the main NAS and air-gapped copy. Stablebit DrivePool is doing the off-site mirror and torrent seed server. All drives are standard NTFS. If one drive dies, it's an easy recovery. I've had two 8TB drives suffer instant death in the past year, and was back up in < 12 hours. |
|
![]() bulk cd/dvd ripping tower |
![]() junkbox server - torrent seeding server |
![]() off-site air-gapped NAS mirror |
![]() Off-line NAS mirror |
![]() primary and mirror live drives |
![]() been at this a long time !! |