I have a server running Debian with 24 TB of storage. I would ideally like to back up all of it, though much of it is torrents, so only the ones with low seeders really need backed up. I know about the 321 rule but it sounds like it would be expensive. What do you do for backups? Also if anyone uses tape drives for backups I am kinda curious about that potentially for offsite backups in a safe deposit box or something.

TLDR: title.

Edit: You have mentioned borg and rsync, and while borg looks good, I want to go with rsync as it seems to be more actively maintained. I would like to also have my backups encrypted, but rsync doesn’t seem to have that built in. Does anyone know what to do for encrypted backups?

  • pe1uca@lemmy.pe1uca.dev
    link
    fedilink
    English
    arrow-up
    11
    ·
    6 months ago

    Well, I’m just starting with serious backups, AFAIK you only need to backup the data which you can’t replicate.

    Low seeded torrents are just hard to get, but not impossible. Personal photos, your notes, any other files generated by you are the ones which need backups.

    • taladar@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      7
      ·
      6 months ago

      Ideally you want to backup everything that you didn’t explicitly exclude since otherwise there is always something you forgot.

      • pe1uca@lemmy.pe1uca.dev
        link
        fedilink
        English
        arrow-up
        2
        ·
        6 months ago

        Well, I have my personal data in a specific folder, everything there is backed up.
        General media is in another one, which isn’t included.

  • Deckweiss@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    edit-2
    6 months ago

    The software borgbackup does some insane compression.

    It is more effective if you backup multiple machines tbh (my 3 linux computers with ~600gb used each get compressed down to a single ~350gb backup, because most of the files are the same programs and data over and over again)

    But it might do a decent enough job in your case.

    So one of the solutions might be getting a NAS and setting up borgbackup.

    You could also get a second one and put it in your parents or best friends home for an offsite backup.

    That way you don’t have to buy as large of a drive capacity, but will only have fixed costst (+electricity) instead of ongoing costs for some rented server storage.

    I guess that would be about 400$ per such a device, if you get a used office pc and buy new drives for it.


    Tape seems to be about half the price per TB, but then you need special reader/writer for it, which are usually connected via SAS and are FUCKING EXPENSIVE (over 4000$ as far as I can see).

    It only outscales HDDs in price after like ~600TB

  • ErwinLottemann@feddit.de
    link
    fedilink
    English
    arrow-up
    3
    ·
    6 months ago

    to your edit: rsync is a tool to copy/move files, borg is a backup utility. there are scripts that use rsync to create proper backups, but if you want to go by ‘more actively maintained’ you should look into how these scripts are maintained, not rsync itself.
    on the other hand - borg is actively maintained, there even are releases in the last two days, one stable and one beta. it also fulfills your ‘encrypted backup’ requirement and has a versioned backups built in.
    tl;dr comparing borg backup and rsync is comparing apples and oranges

  • capital@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    6 months ago

    My use case is basically the same as yours.

    I do restic to Wasabi.

    I’ve been on restic for a few years now and have never had an issue. I started out using Google Drive for the backend but that was though my college which went away eventually so I swapped over to Wasabi but I’m considering B2.

    It’s actively maintained and encrypted.

    There are a handful of backends it supports but can be extended by writing to an rclone backend.

  • solrize@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    6 months ago

    I’ve been using Borg and Hetzner Storage Box. There are some small VPS hosts that actually beat Hetzner’s pricing but I have been happy with Hetzner so am staying there for now. With 24TB of data you could also look at Hetzner’s SX64 dedicated server. It has a 6 core Ryzen cpu and 4x 16TB HDD’s for 81 euro/month. You could set it up as RAID 10 which would give you around 29 TiB of usable storage, and then you also have a fairly beefy processor that you can use for transcoding and stuff like that. You don’t want to seed from it since Hetzner is sticky about complaints that they might get.

    Tape drives are too expensive unless you have 100s of TB of data, I think. Hard drives are too unreliable. If you leave one in a closet for a few years, there’s a good chance it won’t spin back up.

    • dan@upvote.au
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      6 months ago

      for 81 euro/month.

      You can probably find something cheaper from their auction servers.

      I’ve got a storage VPS with HostHatch for my backups. It’s one of their Black Friday deals from a few years ago - 10TB storage for $10/month. Not sure they’ll offer that pricing again, but they did have something similar for around double the price during sales last year (still a good deal!)

      Tape drives are too expensive unless you have 100s of TB of data, I think

      The drives are expensive, and some manufacturers have expensive proprietary software, but the tapes themselves are cheaper per TB than hard drives, and they usually have a 20 or 30 year life guarantee. People seem to think tapes is old technology but modern tapes can fit 18TB uncompressed (they say 45 TB compressed but idk).

      The default tier of AWS glacier uses tape, which is why data retrieval takes a few hours from when you submit the request to when you can actually download the data, and costs a lot.

      • mea_rah@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        6 months ago

        The default tier of AWS glacier uses tape, which is why data retrieval takes a few hours from when you submit the request to when you can actually download the data, and costs a lot.

        AFAIK Glacier is unlikely to be tape based. A bunch of offline drives is more realistic scenario. But generally it’s not public knowledge unless you found some trustworthy source for the tape theory?

  • TedZanzibar@feddit.uk
    link
    fedilink
    English
    arrow-up
    3
    ·
    6 months ago

    Short answer: figure out how much of that is actually irreplaceable and then find a friend or friends who’d be willing to set aside some of their storage space for your backups in exchange for you doing the same.

    Tailscale makes the networking logistics incredibly simple and then you can do the actual backups however you see fit.

  • hperrin@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    6 months ago

    I have a machine at my parents’ house that has a single 20TB drive in it. I’ll log in once in a while and initiate an rsync to bring that up to current with my RAID at home. The specific reason I do it manually is in case there’s a ransomware attack. I won’t copy bad data. That’s also the reason I start it from the backup machine. The main machine doesn’t connect, the backup machine does, so ransomware wouldn’t cross that virtual boundary.

  • lorentz@feddit.it
    link
    fedilink
    English
    arrow-up
    2
    ·
    6 months ago

    I use rclone, which is essentially rsync for cloud services. It supports encrypion out of the box.

    • bandwidthcrisis@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      6 months ago

      I like the versatility of rclone.

      It can copy to a cloud service directly.

      I can chain an encryption process to that, so it encrypts then backs up.

      I can then mount the encrypted, remote files so that I can easily get to them locally easily (e.g. I could run diff or md5 on select files as naturally as if they were local).

      And it supports the rsync --backup options so that it can move locally deleted files elsewhere on the backup instead of deleting them there. I can set up a dir structure such as Oldfiles/20240301 Oldfiles/20240308 Etc that preserve deletions.

  • sloppy_diffuser@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    2
    ·
    6 months ago

    Important stuff (about 150G) is synced to all my machines and a b2 Backblaze bucket.

    I have a rented seed box for those low seeder torrents.

    The stuff I can download again is only on a mirrored lvm pool with an lvmcache. I don’t have any redundancy for my monerod data which is on an nvme.

    I’m moving towards an immutable OS with 30 days of snapshots. While not the main reason, it does push one to practicing better sync habits.

  • rambos@lemm.ee
    link
    fedilink
    English
    arrow-up
    1
    ·
    6 months ago

    I use Kopia to backup all personal data (nextcloud, immich, configs, etc) daily to another disk in the same server and also to backblaze B2. Its not proper 321 but feels good enough. I dont backup downloadable content because its expensive

  • Decronym@lemmy.decronym.xyzB
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    5 months ago

    Acronyms, initialisms, abbreviations, contractions, and other phrases which expand to something larger, that I’ve seen in this thread:

    Fewer Letters More Letters
    Git Popular version control system, primarily for code
    NAS Network-Attached Storage
    PSU Power Supply Unit
    RAID Redundant Array of Independent Disks for mass storage
    SSD Solid State Drive mass storage
    VPS Virtual Private Server (opposed to shared hosting)

    6 acronyms in this thread; the most compressed thread commented on today has 10 acronyms.

    [Thread #642 for this sub, first seen 30th Mar 2024, 20:45] [FAQ] [Full list] [Contact] [Source code]

  • taladar@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    1
    ·
    6 months ago

    I have just been using Borg with a Hetzner Storagebox as the target. That has the advantage of being off-site and not using up a lot of space since it deduplicates. It also encrypts the backup. It might take a while for the initial backup at 24TB though depending on your connection.

    • poncho@lemmynsfw.com
      link
      fedilink
      English
      arrow-up
      0
      ·
      6 months ago

      Damn never heard of them looks great. Is there any catch or is it like a small company that might go out of business in a few years? I still haven’t had to backup more then 4tb but once I do get up to those numbers they might be the best option compared to offsite hard drives like I been doing

      • buedi@kbin.social
        link
        fedilink
        arrow-up
        1
        ·
        6 months ago

        As mentioned already, Hetzner is a very big Hoster in Germany. I am a customer since nearly 15 years now and in all that time they also rised the prices only once for the package I use (and I think it was only recently in 2023 or so where it went from 4,90€ to 5,39€). Also their Storage Box seems to be not only one of the cheapest out there I have seen, but as far as I remember, you do not have to pay for the traffic if you want to restore your data, like it is with other hosters. Also they had a good service, were responsive if I opened a Ticket in the past and I can not remember if I had ever problems with the service I use (Web Hosting package).

        • 7Sea_Sailor@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          1
          ·
          edit-2
          6 months ago

          Can confirm that there is 0 ingress or egress fees, since this is not an S3 container storage server, but a simple FTP server that also has a borg&restic module. So it simply doesnt fall into the e/ingress cost model.

      • dan@upvote.au
        link
        fedilink
        English
        arrow-up
        1
        ·
        6 months ago

        is it like a small company that might go out of business in a few years?

        Hetzner is one of the largest hosting companies in the world.

      • qaz@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        6 months ago

        I have been using their nextcloud service for several years now and it works great.

  • cybersandwich@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    6 months ago

    I don’t have nearly that much worth backing up(5TB–and realistically only 2TB is probably critical), but I have a Synology Nas(12TB raid 1) and truenas (zfs striped/mirrored) that I back my stuff to (and they back up to each other).

    Then I have a raspberry pi with a USB drive (8tb) at my parents house 4 hours away, that my Synology backs up to (over tailscale).

    Oh, and I have a USB HDD(8tb) that I plug in and backup my Synology Nas to and throw in my fireproof safe. But thats a manual backup I do once every quarter or 6 months if I remember. That’s a very very last resort backup.

    My offsite is at my parents.

    And no, I have not tested it because I don’t know how I’m actually supposed to do that.

  • narc0tic_bird@lemm.ee
    link
    fedilink
    English
    arrow-up
    1
    ·
    6 months ago

    I backup my /home folder on my PC to my NAS using restic (used to use borg, but restic is more flexible). I backup somewhat important data to an external SSD on a weekly basis and very important data to cloud storage on a nightly basis. I don’t backup my *arr media at all (unless you count the automated snapshots on my NAS), as it’s not really important to me and can simply be redownloaded in most cases.

    So I don’t and wouldn’t apply the 321 rule to all data as it’s simply too expensive for the amount of data I have and it’d take months to upload with my non-fiber internet connection. But you should definitely apply it to data that’s important to you.