::  Posts  ::  RSS  ::  ◂◂RSS  ::  Contact

Backup Strategy

January 22nd, 2019
tech  [html]
After reading a post from someone who nearly lost all their data to a joint NAS and external hard drive failure I decided to think through my data durability.

Most of my stuff lives in the cloud: email on Gmail, docs in Google Docs, photos in Google Photos, data in Google Drive. I trust Google a lot for this sort of thing, both given their public reputation and as an employee. I think Google is extremely unlikely to lose or corrupt my data.

The next biggest place where I have things is on the server that runs jefftk.com. I back this up to my laptop with a cronjob that looks like:

rsync -t -l -r -c ps:jtk/ www-jefftk-com/ --delete-after
rsync -t -l -r -c ps:tc/ www-trycontra-com/ --delete-after
rsync -t -l -r -c ps:fr/ www-freeraisins-com/ --delete-after
...
rsync -t -l -r -c ps:kf/ www-kingfisherband-com/ --delete-after
rsync -t -l -r -c ps: ps-rest/ --delete-after
rsync -t -l -r -c ps:/etc/ ps-etc/ --delete-after

This copies each of my websites, the rest of my home directory, and the contents of /etc. It's set to run once a day, if my laptop is on then, which it is more often than not. It copies them into a subdirectory of ~/Google Drive, which means it then all gets synced to Drive.

I also have a cronjob set up to run twice a month that backs up the comments on my blog posts. It pulls out the /wsgi/json-comments/ urls from my blog posts, fetches them all, and saves them. When Google Plus gets turned off I'm planning to configure my comment system to serve those comments from my backup.

Finally, code lives on GitHub plus local checkouts on my server, laptop, or both.

Everything on my laptop is temporary, and anything important is under ~/Google Drive. All my dotfiles, including my ~/.full_history, are symlinks to into ~/Google Drive.

The main weakness in this setup is its dependency on Drive. Specifically:

  • For the portion of Google Drive synced to my laptop, if I accidentally deleted or changed things locally that could be synced up. If I noticed right away there's the Drive Trash which keeps things for 30 days, or I could contact support. Something that corrupted files in a subtle way could be pretty bad.

  • Very occasionally people have lost access to their Google accounts. For example, by forgetting your password or getting mistaken for a fake account. If this happened to me it would be a disaster: so much of what I use is a Google service.

I think both of these are too unlikely to be worth mitigating though. They could happen, but they're much less likely than the sort of failure (~1%/year) that you're trying to avoid by backing things up.

Update 2019-01-26: the day after posting this I spilled water on my laptop, but it was actually fine.

Comment via: facebook

Recent posts on blogs I like:

Train and Bus Cleaning

Well before the coronavirus struck, I noticed how trains in Asia were cleaner than in Europe, which are for the most part cleaner than in the United States. There are overlaps: the elevated BTS in Bangkok is similar to the cleaner cities in Europe, like L…

via Pedestrian Observations April 8, 2020

Improving your Internet for video calls

Lots of people have really bad video calls for avoidable reasons. Here’s how to fix that.

via benkuhn.net April 8, 2020

Finding home in the time of coronavirus

Disclaimer: I’m going to say this once. Obviously, I am not happy about the coronavirus’s threat to public health or the economic toll it’s taking. I do not think the existence of this pandemic is good. Just so happens that social distancing and remote wo…

via Holly Elmore March 31, 2020

more     (via openring)

More Posts:


  ::  Posts  ::  RSS  ::  ◂◂RSS  ::  Contact