What Do You Use For Data Backup?

by Cliff Ravenscraft on September 22, 2008

This weekend, I was following my friends on Twitter and noticed that our friend Leo Leporte experienced something that all true geeks fear most, DATA LOSS!

Here’s how I saw it go down on Twitter:

First off, let me say that my thoughts and prayers go out to Leo during this time of loss! Weirdly enough, I almost mean that as much seriously as much as I do in a lighthearted way as I would express the same sentiments to someone who has lost someone near and dear to them.

I thought that the reason why people used a Raid 5 backup (RAID 5 : Combines three or more disks in a way that protects data against loss of any one disk; the storage capacity of the array is reduced by one disk.) was because hardware such as an individual hard drive is susceptible to failure and so that if all the data is mirrored on all three or more drives, if one drive fails, you have two or more good copies of the data. However, it would appear that a simple failure of the hardware known as the “Raid 5 Card” can completely wipe out the benefit of having more than one external drive backup. This is exactly why I’m not at all fascinated by Raid 5 or Drobos!

To many of us geeks, we put so much of who we are into ones and zeroes. Luckily, Leo didn’t loose any of his own content, just all the blood sweat and tears poured into the systems that were in place to deliver the content. However, the contributions from hundreds, if not thousands, of people who left comments on his blog or posts in his forums have been forever lost. At least all those done since the last “good backups” of each of the sites he maintains.

I’ve lost data to hard drive and other hardware failures in the past. Each time, I say that I’ve learned my lesson and take an oath to do daily/weekly backups of all important data. That plan usually lasts for a few weeks and then live without adequate backup protection continues.

I almost lost all hope of adequate backups back in the Tape Drive Backup Days. It seemed that these backup tapes almost never completed a successful recoverable backup of data. However, today, things to do seem a little more hopeful.

In the past, I relied so heavily on the operating system and computer program backups. Today, it appears that as long as I keep my most important data organized in a single directory on my drive, I could recover within hours if I simply had a solid backup of that folder with all the sub folders of data that I have saved.

The question is, what is the best technique of backup up this valuable data? Is it external drives, Raid5, Drobo? Is it online backup? I think that for true security, it is best to use more than one approach. However, so many of us today, still do all our computing with no such backup program in place at all.

Since starting my own business, data backup has been a high priority on my project list. At the core, I have an external hard drive hooked up to my computer and I use the paid version of Allway Sync to keep my Important Docs and Podcast Archives folders consistently synced with my external backup drive. I must say that I really do love this little software package as it gives me a ton of control over what gets synced and I can choose to have it sync both ways so that when I delete something on my local drive it keeps it on the backup or it deletes the it off the backup drive as well.

The only issue here is that if I have a fire in my home, I’m going to loose both my hard drive and the external drive both and therefore loose all my data regardless of how diligent I am with keeping everything backed up.

I could get more than one external drive and allow for multiple backups to multiple external drive and store all but the current one off site. However, who on earth will keep up with that? This is why I’m looking into also creating an online, off site, backup solution in conjunction with my external drive backup.

I’m currently looking at Carbonite Online Backup, Mozy Online Backup, as well as using Amazon’s S3 Web Services with a program called Jungle Disk.

As I move forward into into my search for the right backup solution that works best for me, I’ll do my best to come share what I learn with you.

Of course, I really would encourage some community discussion here in the comments section. What sort of things are you doing in the area of backup?

  • I have to second Rachel's recommendations about Memeo. No, I don't work for them (or for Tanagra, their old company name). Memeo is the solution to "I meant to do it." Set it and forget it. (Although note that the initial backup can take "forever," depending on how many files need to be protected.)

    Memeo has saved my bacon numerous times. Even a complete hard drive failure on my wife's machine (although a nuisance) was uneventful data-wise -- nothing was lost. Once all of the software and drivers were reinstalled, Memeo restored everything in a matter of 75 minutes. (That sounds like a lot of time, but my wife's machine has boatloads of music and family photos.)

    We have legit copies of Memeo on all our machines -- except for my Ubuntu laptop. Rachel, put in a good word with your bosses about coming up with a Linux/Debian/Ubuntu equivalent.
  • My company, Memeo, addresses exactly the concerns that are raised in this blog. All too often people commit to backing up their data regularly, but never do (I am just as guilty of this). Our software solves this problem by including a unique feature as part of our backup tool that will instantaneously backup your files without any prompting. So the moment you save/modify/add a file it is backed up to any destination you like, whether that is an external hard drive, USB drive, iPod, FTP site, Network drive, etc. Memeo thinks the best strategy for a backup plan is to have both an external hard drive and online storage. So just in case your computer and your back up drive is stolen or broken Memeo recommends using a secure off-site backup option (which they also provide). It is the best of both worlds, fast local backup and safe, online backup.
  • If you do decide to go with Mozy, ensure that you get your 10% discount. The latest discount codes are available at http://www.unlimitedbackup.net.

    Thanks
    Martin
  • Data loss is always a painful thing. At IDrive, we provide secure online backup for situations such as this. Also, if you need your data restored FAST, we have a service where we can overnight you a hard-drive with all your files on it, then you just mail the hard-drive back when you're done. So give www.IDrive.com a try. 2GB accounts are totally free. 150GB accounts are $4.95/mo.
  • From my limited knowledge in the area of RAID I know there are several different settings for RAID.

    One is to combine the storage of all the devices so that they show up as one large drive on your Operating System. This option is what I believe Leo was using, but I may be wrong. I do know that on this setting if your RAID card goes down you lose all your data on all the drives.

    Another is where you have at least 2 drives (or more) with the exact same storage capacity and they show up on your Operating System as 1 drive with the size of one of the single drives. The reason it only has the space of one of the drives is because it mirrors all the data onto all the drives attached so if one of them goes down or your RAID card goes down you do not lose any data. This is the backup (maybe there are more) setting of RAID. It is basically to keep you safe if one of the drive's die.

    That's my knowledge of RAID. I'm sure there are other ways that RAID works but those are the 2 I know about. I have an uncle who was using the first setting and lost a bunch of data so now he is using the second one for backup's.
  • I forgot to mention other ways I also backup. Of course I use CD/DVD's to start files, I also use a USB key to keep a copy of a password protected and encrypted password database that has so many passwords that are so long I couldn't tell you them if you forced me.

    A true and critical backup should always off site via physically. transporting it or backing up online. The "backups" I have at home or more for opps moments as if my primary hardware goes bad but if my house caught on fire, etc. it wouldn't make a difference.

    I haven't noticed any of those Jungle Disk issues someone else mentioned. Of course I don't backup everything via Jungle disk that would take forever and cost more but I do backup about 2-3 gigs worth of stuff. The cost is very minimal but I have always said if you lost your primary copy almost all backup solutions will be a bargain.
  • Hubert Kirchgaessner
    The Drobo's do two things really well:
    * they isolate you from the complexity of RAID (which they are using, of course), through proprietary extensions to the RAID model
    * they know how to charge an arm and a leg for a solution that is anything but original. It is VERY pricey

    Having said this, you can use Drobo's to achieve data security: you would have to buy two of them, plus two of their Ethernet addons; equip them with roughly equivalent disk space; store them in physically separate locations (so that one disaster cannot destroy both of them) that both have high-speed Internet access (and I do mean high-speed); and then use their web update model to keep both boxes in sync. Expensive, but it is safe.

    At home, I am running a server (Windows 2003, but you could use any technology for that). All 3 machines on our network are backed up to that server every night. I keep backup copies of "static" material (e.g. older pictures, the bulk of the mp3 library, etc.) on two sets of DVD's, one set is kept out of house. The server uses Mozy to run a backup of frequently changing data every night into "the cloud". I regard this combination as reasonably safe.
  • Not to harp on it, but Wikipedia has a great article on RAID. To quote the entry:
    RAID systems with redundancy continue working without interruption when one, or sometimes more, disks of the array fail, although they are vulnerable to further failures. When the bad disk is replaced by a new one the array is rebuilt while the system continues to operate normally. Some systems have to be shut down when removing or adding a drive; others support hot swapping, allowing drives to be replaced without powering down. RAID with hot-swap drives is often used in high availability systems, where it is important that the system keeps running as much of the time as possible.
    RAID is not a good alternative to backing up data. Data may become damaged or destroyed without harm to the drive(s) on which it is stored. For example, part of the data may be overwritten by a system malfunction; a file may be damaged or deleted by user error or malice and not noticed for days or weeks; and of course the entire array is at risk of catastrophes such as theft, flood, and fire.
  • Awesome clarification Robert! Thanks for the note!
  • I have to say something.

    RAID was never meant to be a backup solution. It is a solution for redundancy/high availability. RAID (not the same thing as a drobo) keeps things running. Drives are more likely to fail than the data sub-system, IDE, SCSI, SATA. So RAIDs, just keep your data available, not as backup/archives. A drobo is just a way to have lots of room for data. Anyone that things their data is "safe" on a drobo by itself has been mislead. I use JungleDisk, but you can use anything that can be located off site. Tapes, external drives, Iomega Jazz drives (or whatever) are just a part of a smart solution. RAID keeps your data available, backups keep your data secure.

    Make copies. Put them somewhere else. Data safe.
    Use RAID. Data available.
  • Jeremy
    I bought jungle disk hoping that I would have more ability to automate backups. My experience with JungleDisk has not been all that positive. It does create a drive letter for you, which is kind of cool. It's painfully slow, though.

    Also, browsing your files is not all that simple. It uses whacky naming conventions, whereas a piece of software like S3 Bucket Explorer will let you name the fulders/buckets whatever you want.

    The problem with S3 Bucket Explorer -- as slick as it is -- is that it does not provide a drive letter for you to integrate with your backup solution(s). All operations have to be manual...at least for the time being (even though they promised the backup/sync feature back in early 08 and it never happened).

    I also have tried Carbonite. It has the flat annual rate going for it, though I suspect that's a ruse to get new customers and then jack up the price -- after all, who's going to want to back up all their data again once it's off-site? I have yet to find a software bundle that is not ugly and slow when it comes to recovering your files.

    As for JungleDisk, getting that monthly bill from Amazon Web Services can hurt a little bit, but you have to view it as a security investment. I also use SyncBack locally. It sounds like we have pretty much the same setup, Cliff.

    Even given all that, I prefer S3 Bucket Explorer since i can just drag a folder and tell it to overwrite all similar files. It works like Windows Explorer, so there's no learning curve. There's no whacky 24-character hashes before my folder name.

    Good luck!
    Jeremy
  • I current do three different backups. Time Machine for the daily, "Whoops" I didn't mean to delete the file" situations. I also to a SuperDuper! backup about once a week on a firewire drive so I will have a bootable full backup and finally I also use Jungle Disk using Amazon S3 for those really important files that would practically bring down my business if I lost that stuff and had nothing else. It's important to have the most important and critical things stored remotely.
  • I've been using something called SuperDuper! for backing up my MacBook Pro. It's creates a bootable backup of my hard drive so if this one every fails, I have a fresh back up ready to go. It has a smart back up feature which will only copy those files which are modified since the last back up. It has the ability to schedule when you start the backup and if you want to get really geeky it's even scriptable.

    For the other machines in the house, I'm thinking about getting a time capsule and just letting them use that.
blog comments powered by Disqus

Previous post:

Next post: