Should I bother with raid

Dust0741@lemmy.world · 2 months ago

Should I bother with raid

𝕽𝖚𝖆𝖎𝖉𝖍𝖗𝖎𝖌𝖍 · 2 months ago

RAID 1 is mirroring. If you accidentally delete a file, or it becomes corrupt (for reasons other than drive failure), RAID 1 will faithfully replicate that delete/corruption to both drives. RAID 1 only protects you from drive failure.

Implement backups before RAID. If you have an extra drive, use it for backups first.

There is only one case when it’s smart to use RAID on a machine with no backups, and that’s RAID 0 on a read-only server where the data is being replicated in from somewhere else. All other RAID levels only protect against drive failure, and not against the far more common causes of data loss: user- or application-caused data corruption.

whodatdair@lemmy.blahaj.zone · edit-2 2 months ago

I know it’s not totally relevant but I once convinced a company to run their log aggregators with 75 servers and 15 disks in raid0 each.

We relied on the app layer to make sure there was at least 3 copies of the data and if a node’s array shat the bed the rest of the cluster would heal and replicate what was lost. Once the DC people swapped the disk we had automation to rebuild the disks and add the host back into the cluster.

It was glorious - 75 servers each splitting the read/write operations 1/75th and then each server splitting that further between 15 disks. Each query had the potential to have ~1100 disks respond in concert, each with a tiny slice of the data you asked for. It was SO fast.

𝕽𝖚𝖆𝖎𝖉𝖍𝖗𝖎𝖌𝖍 · 2 months ago

And that, kids, is a great use of RAID: under some other form of data redundancy.

Great story!

Decipher0771@lemmy.ca · 2 months ago

Big elk stack?