It's A Digital Disease!

21 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
101
 
 
The original post: /r/datahoarder by /u/Samseurynck on 2025-03-09 19:53:26.

Hey all!

I'm wondering if anyone has been compiling a list of datasets that have been deleted since inauguration day. I don't need the sets themselves, but their names.

Anyone know of somewhere I might find this?

102
 
 
The original post: /r/datahoarder by /u/Darkleader22 on 2025-03-09 19:42:57.
103
 
 
The original post: /r/datahoarder by /u/timabell on 2025-03-09 19:27:24.

I'm guessing that there will be some people here who like me have a healthy lack of trust in cloud "backups" and proprietary backup formats. I've been working on a tool to help me back up my laptop home folder to a usb disk.

https://github.com/timabell/disk-hog-backup

I'd love to know if anyone else thinks like me, and if anyone else would find this useful.

I'd be open to any alpha testing and feedback.

I'm a linux user, but it would be cool to get it to support windows and mac too.

This is my first post here, bit I think it might be a bit of a spiritual home. I lost a lot of data from cheap CD-R disks many years ago (it literally peeled off) and have been paranoid about data loss ever since.

104
 
 
The original post: /r/datahoarder by /u/Dry_Inflation307 on 2025-03-09 17:26:02.
105
 
 
The original post: /r/datahoarder by /u/PsychologicalBass738 on 2025-03-09 17:56:56.

Thx for all your good advice and stressing on the importance of backup and following 3-2-1 in my previous post here. Totally get the risk of data lost thing, but what are some good strategies to properly backup things like photos, videos, important files that I can’t afford to lose? Any device or platform that works well for you guys? Preferablly something that doesn't take lots of effort and time to do everyday, automatically backup will be ideal. I’m also looking for a budget-friendly option to start with—something that works now and can scale up as my data grows. Thanks so much!

106
 
 
The original post: /r/datahoarder by /u/PizzaK1LLA on 2025-03-09 15:18:07.

Hi all, I'm the developer of SeekDownloader, I'd like you present to you a commandline tool I've been developing for 6 months so far, recently opensourced it, It's a easy to use tool to automatically download from the Soulseek network, with a simple goal, automation.

When selecting your music library(ies) by using the parameters -m/-M it will only try to download what music you're missing from your library, avoiding duplicate music/downloads, this is the main power of the entire tool, skipping music you already own and only download what you're missing out on.

With this example you could download all the songs of deadmau5, only the ones you're missing

There are way more features/parameters on my project page

dotnet SeekDownloader \

--soulseek-username "John" \

--soulseek-password "Doe" \

--soulseek-listen-port 12345 \

--download-file-path "~/Downloads" \

--music-library "~/Music" \

--search-term "deadmau5"

Project, https://github.com/MusicMoveArr/SeekDownloader

Come take a look and say hi :)

107
 
 
The original post: /r/datahoarder by /u/redditunderground1 on 2025-03-09 14:12:24.

https://preview.redd.it/wwq44x3x6one1.jpg?width=550&format=pjpg&auto=webp&s=b5ec38c8f60ffecbd8f7d952a45cb7d44209155a

I read that HDD's lose magnetism over time and they must be re-recorded periodically to preserve the data. On 3.8.2025 I tested a retired Toshiba 500gb HDD that was formatted and filled up to about 98% capacity with photos and videos on 2.17.2015. After it was retired, it was put in a ziplock bag and stored in a garage where temperatures ranged from 45F to 85F for the 10-year period. It was not run during that time. When I looked at it, all the data (photos / videos) were fine.

I didn't do any drive software tests on it, as I didn't have any to use. I downloaded some drive software awhile back and it took over my computer, so I was happy to get rid of it. I archive audio, photos, videos and text files. Either they work or they don't work...those are the tests I'm using here.

I then decided to do a microwave test on the HDD. I had originated this use of microwaving drives by accident. Last year I had ordered a 4TB Samsung SSD and it had problems from the start. But I was hopeful the bugs would work out and tried to use it anyway. I was transferring a 1.8TB file to it and it jammed near the end of the transfer. I was horrified to find out it would not let me delete my data before sending it back for a refund. Hence the microwave came into my head. It was a natural offshoot from using the microwave to treat moldy and mildewed paper, which I do regularly.

I can't tell you how long this original microwave test was on the 4TB SSD, but it was just a few seconds. I didn't know what would happen or if it would wreck the microwave, so it was short. When I plugged the SSD into the computer it would not show up. I was happy with the results and gave it a little more microwave radiation after that for good measure.

All we hear about nowadays is EMP danger with digital, so that also inspired me to do some microwave tests. Here are the tests for the HDD, SD card and thumb drives tested in a 1000-watt Samsung microwave.

Toshiba 500gb HDD

1 second microwave test: Passed (I don't think the microwave does much microwaving in the first second.)

2 second microwave test: Passed

3 second microwave test: Failed - drive made a loud pop and sparks near the cord port. Computer would not recognize the drive. (Drive was microwaved without the cord.)

Generic 4GB SD Card

1 second microwave test: Passed (I don't think the microwave does much microwaving in the first second.)

2 second microwave test: Passed

3 second microwave test: Passed / Failed (?) Some sparks. One computer would not recognize about 80% of the files and they only showed up as icons. When I clicked on an icon it would not load and it said the file was corrupted. Another computer played everything fine.

4 second microwave test: Failed - card made lots of sparks, plastic started to melt in spots on both sides of the card and there was a strong burned plastic smell. Both computers would not recognize the drive.

Note: This test should be rerun with multiple cards for 3, 4, 5, 6 second tests to pinpoint the failure. I used 1 card and it received a total of 9 seconds of microwaving before it failed. (Not counting the 1 second test.)

Generic 8GB Thumb Drives

I used 2 thumb drives for this test.

Thumb drive #1

1 second microwave test: Passed (I don't think the microwave does much microwaving in the first second.)

2 second microwave test: Failed - drive made an audio sound when inserting into the USB port, but the computer would not recognize it. I tried it on 2 computers.

Thumb drive #2

3 second microwave test: Failed. Drive made a loud pop and sparks inside of the USB connector. Both computers would not recognize the drive.

The rest of the HDD magnetism tests will be 12-year, 15-year, 18-year, 20-years and 22-years...if I'm still around.

108
 
 
109
 
 
The original post: /r/datahoarder by /u/salty_greens on 2025-03-09 12:18:33.

Hi everyone, I just got a Orico 3588C3 external hard drive enclosure. In the description it says it supports 18TB hard drive, but I’m planning to buy a 20TB drive if possible. Would it cause compatibility problems? Should I just buy the 18TB drive?

110
 
 
The original post: /r/datahoarder by /u/Ratathosk on 2025-03-09 12:03:51.

Hi everyone,

Long time lurker here. I hope this fits sorry if it doesn't.

I've got loads of TB of hi-res photos at this point and i'm getting a bit nervous about losing it all in a fire. I'm a 3-2 but not -1 of the backup rule. I've been thinking about making a lower-res online last-ditch backup but i have no idea what tools people use for that. The amount is probably too much for me to afford to keep in original size online, it's at least 2 TBs worth of "curated photos".

Say i've got 100 gb of online backup space, is there a good tool people use so you can adapt the end file size and convert all the photos in batches so it'll "fit"?

English isn't my first language so maybe i'm just missing a keyword in my searches.

111
 
 
The original post: /r/datahoarder by /u/KongoOtto on 2025-03-09 11:28:56.

I#m thinking about buying some 20 or 22 TB of the Toshiba MG10F Series.

Any thoughts or experience with those drives?

112
 
 
The original post: /r/datahoarder by /u/GlaciarWish on 2025-03-09 09:21:29.

Hello everyone,

I deleted one of my media folders by mistake.

Thankfully no impact as I preform weekly snapraid sync and scrub.

While restoring data I noticed inodes are not being restored for hardlinks creating duplicate remuxes in my case. Snapraid is not restoring the inodes unfortunately it seems.

Going forward, I will probably start using syslinks.

My only concern I have many files that matches torrents by 99.9% then download slightly different media - I had no issues with hardlinks setup.

Will this work with syslinks when file download extra media at 99.9%?

I am worried another drive will crash or upgraded (in process) then I will end up with many hardlinks not linking anymore and creating dupes which is already stressful for me.

I know there is Apps like jdupe but I am not sure how accurate are they?

Fyi only I am talking about +6000 hardlinks between cross-seed and Plex.

113
 
 
The original post: /r/datahoarder by /u/ViperSteele on 2025-03-09 04:44:32.

Data Hoarding Setup Review

Hi, I'm considering my first serious data hoarding setup. I'm planning on using a DAS because I really only use my computer in my home office. I don't watch our TVs much and if I do it something me and the wife want to watch together. I have a Philips 34" that I enjoy watching shows and reading on. So I don't think a NAS is something for me. And I don't have the knowledge or time to jump into a NAS setup.

Equipment

  • MacBook Air M2 1TB
  • DAS: Terramaster D2-320. Reasons: Easy to use for beginners, Thunderbolt connection for fast speeds with my MacBook Air, compact and aesthetically pleasing on my desk
  • 2 × 8TB Western Digital Blue HDD. Reason: Good price point and 8TB seems like a reasonable starting capacity. Reason: Will use Backblaze for redundancy/backup

Backup Strategy

  • Backblaze Personal $100 bucks a year to back up both my MacBook and external drives
  • RAID 0
  • Considering an additional external drive for Time Machine backups
  • This provides redundancy if any of my 8TB drives fail

What do you think of this strategy? Any critiques or personal anecdotes about similar setups would be appreciated. I haven't purchased anything yet, so I'm open to advice or scenarios I might not have considered. Thanks in advance!

114
 
 
The original post: /r/datahoarder by /u/artesons on 2025-03-09 04:26:46.
115
 
 
The original post: /r/datahoarder by /u/Empty_Use6095 on 2025-03-09 03:36:40.

Hello to all i thought that this would be the best place for me to ask this kind of question. I have just picked up a buffalo terastation pro NAS its got 4 bays and is pretty old. i know it will not work with windows 11 but is there a way i could get this to work with a linux distro. Any help will be greatly appreciated

116
 
 
The original post: /r/datahoarder by /u/OptionSuspicious3428 on 2025-03-09 02:25:38.

Looking to create my DAS on the cheap, this housing a good deal? What other options should i be looking for?

https://preview.redd.it/ejpcaf5arkne1.png?width=1765&format=png&auto=webp&s=b28fae9a27ca59d9c451132854f04e0680c627a4

117
 
 
The original post: /r/datahoarder by /u/FruitLong8561 on 2025-03-08 14:34:05.

Hi! I was wondering about the best methods used currently to fully digitize a scanned book rather than adding an OCR layer to a scanned image.

I was thinking of a tool that first does a quick scan of the file to OCR the text and preserve images and then flags low-confidence OCR results to allow humans to review it and make quick corrections then outputting a digital structured text file (like an epub) instead of a searchable bitmap image with a text layer.

I’d prefer an open-sourced solution or at the very least one with a reasonably-priced option for individuals that want to use it occasionally without paying an expensive business subscription.

If no such tool exists what is used nowadays for cleaning up/preprocessing scanned images and applying OCR while keeping the final file as light and compressed as possible? The solution I've tried (ilovepdf ocr) ends up turning a 100MB file into a 600MB one and the text isn't even that accurate.

I know that there's software for adding OCR (like Tesseract, OCRmyPDF, Acrobat, and FineReader) and programs to compress the PDF, but I wanted to hear some opinions from people who have already done this kind of thing before wasting time trying every option available to know what will give me the best results in 2025.

118
 
 
The original post: /r/datahoarder by /u/uboofs on 2025-03-09 03:17:40.
119
 
 
The original post: /r/datahoarder by /u/throwaway69xx420 on 2025-03-09 03:06:10.

Howdy peeps

I've began my foray into data hoarding. I'm at the point where I need to upgrade from 12tb! I recently bought the Seagate Expansion 24tb external drive from Best Buy for $279. I currently only have a Dell Optiplex acting as a server for the usual stuff.

Curious what are the pros and cons of chucking? Should I chuck before/after the warranty on the Seagate drive? Some concerns (not sure how accurate these are) are that I will void the warranty once I shuck so it brings up a question of whether I should shuck now or wait until the warranty expires. Another concern is that I might break my drive in the process. Any advice and tips would be appreciated! Thank you friends!

120
 
 
The original post: /r/datahoarder by /u/Forward-Inflation-77 on 2025-03-09 02:02:30.

I am just starting the process of digitizing my family photo albums. I realize this will be a project that will probably take me months if not years. Not really sure how many photos I have to do, guessing easily in the thousands.

I have started out using scan speeder and doing 4, 5 or 6 pictures at a time and saving as a TIFF file using a Brother 2900 flatbed scanner but didn't realize could only do a few scans on the free edition. Don't mind spending the $30 for a 1 year license. But I realize it's possible this may not get done in a years time. Even doing multiple at a time, still time consuming. I know there are photo scanners specifically made for projects like this but they are several hundred dollars. Not sure if I want to invest that much just for a one time project. Need to look into a service that does this, for those that have used a service, what did it cost? Do places like walmart do stuff like this? Or will it take a specialized service? I have used auto splitter but I liked scan speeder better. Of course, would have to pay for auto splitter as well and that is a 2 year license vs the 1 year on scan speeder.

When buying a photo scanner, I have read that it is not good idea to use the ADF on regular printers to scan them, there is a chance it could damage pictures. Isn't that how the photo scanners scan pictures, being fed through machine? Or are the photo scanners more delicate than your typical AIO printer?

For the pictures that have writing on the back, how does one go about preserving that? I know I could scan both front and back but that would make 2 different photos. How do you keep track of which one goes with which picture? Would naming the picture with what it says on back a good way to go about that? One time consuming thing about this is most of the pictures are in sleeves instead of just boxes.

121
 
 
The original post: /r/datahoarder by /u/DougPedersen on 2025-03-08 23:50:48.

I pulled the black plastic case apart, and found the 12TB drive. Disconnected the USB interface board. Then connected it to my Dell Desktop .. it worked GREAT!! Any thoughts on if the USB interface board could have been the only issue? Or maybe the drive is ready to fail again? I tried plugging it in via USB before pulling it apart, and the computer could not even recognize it.

122
 
 
The original post: /r/datahoarder by /u/jku2017 on 2025-03-08 23:18:23.

Out of 4 drives in got from Amazon, one makes a lot of repeated noises and never initializes. Are ironwolfs good quality?

123
 
 
The original post: /r/datahoarder by /u/Alpha_Datura on 2025-03-08 22:03:49.

It doesn't matter if it is Acronis True Image 2015, 2016, or 2021, it seems to sporadically use sector by sector, or it doesn't do it at all. Can anyone help me figure out how to specify it reliably? I usually use the 2016 version, but instructions for any version would be great.

Thank you for your time!

edit: If there is a version that has a checkbox for sector by sector copy, I would like to know which one to get!

124
 
 
The original post: /r/datahoarder by /u/Warcraft_Fan on 2025-03-08 21:51:53.

Title should have been reworded. Sorry the title sounded too much like tech support help not a valuable information post

I have 2 of these drive and I noticed some oddity with it. If they are connected to my motherboard's SATA ports, they stop working after sleep. They still spin up but any attempt to access the drive gets "can't find file at specified location" error.

Asus Prime x570 Pro. I've tried updating SATA driver, changing ports to AHCI, hot swap, etc and simply nothing will work after I sleep the PC. Using Windows 11, up to date

But when I moved the drives to Dell H310 (cross flashed to LSI IT firmware), it always worked fine after sleep. I tried to ue Google and got a few results on MG drives, they seem to not like Asus SATA ports for some reason.

Just passing info if anyone else had issues with MG drives (or any other drives) with Asus motherboard, and you have trouble accessing them after sleeping, get a HBA and use that instead of onboard SATA.

125
 
 
The original post: /r/datahoarder by /u/PsiNexus on 2025-03-08 21:48:46.

Today was UPS battery swap-out day, and when I powered my system back up, one of my 3 shucked WD drives was no longer detected by my server by my 4 bay USB enclosure. I pulled all 3 drives and put them into their original WD Easy store enclosures, and again only 2 of the 3 drives were detected (this time by my laptop, not the server). When looking at lsblk, the problem drive is reading as 0 GB and was not spinning up on connection, whereas the other two drives immediately spun up when I plugged them in via USB to my laptop.

The strange thing is that when I installed the non-functioning drive in my gaming tower it spun up and was immediately detected, with data accessible. A few power cycles confirmed it would keep spinning up. Smartctl does not show any red flags and the short test passes without issue. However, it still does not spin up over USB.

Does anyone have any ideas about what might be going on? The drive is 5 years old, so failure isn't unlikely, but it's confusing that it works in the desktop. I'm not concerned about data loss at this point, I have backups and it's a parity drive for SnapRAID anyways. And to get ahead of things, I agree that USB is not the way, but sometimes it's what you have, can afford, and has been reliable for 7 years.

Thanks for your time!

view more: ‹ prev next ›