Hi everyone. I was considering backup options to Glacier Deep Archive, and wanted to know:
- Which software do you use to encrypt client-side, obfuscate, compress and deduplicate the data before you send it to S3?
- What is the difference between
Restore Requests (bulk)
andOutbound data transfer
and which one will I be using when I want to pull my data from AWS?
I’ll be storing approximately 8TB or so of data, which is why I was looking at inexpensive ways to back it up other than buying an HDD outright.
Thanks!
Recently I looked into the same thing, since AWS caught my eye with their apparently ridiculously low prices. Then I found this (presumably indepdenant) review, that changed my view on things: https://b3n.org/b2-vs-s3-nas-backup/
After reading that, I won’t go with AWS. I’m currently considering to abuse the OneDrive Office Family plan, which costs 99 $ a year for 6 TB of storage (split across 6 accounts), which comes down to 1,40 $ per month per TB. A price that I have not seen beaten by other storage / backup providers.
The problem with AWS is that one need to bring all storage from glacier to regular S3 (which is quite expensive), and then their egress costs are massive. I read more about it and completely agree that AWS is not worth it at all when pulling data in.
However, BackBlaze is quite expensive. I’d be paying $50 for my storage, which is simply not how much I’d like to pay. At this rate, I’d get a 10TB Ironwolf drive in an enclosure from Amazon for my cold storage.
TBH, I was looking at other providers, and Dropbox looks much less expensive at $20 for 9TB. It’s a lot more than the 8TB I thought I’d be able to get away with for Glacier storage, but at least it’s not $50. I will take a look at scaleaway, but I simply trust the bigger players to be around and keep my files safe than the smaller companies.
Edit: Scaleway seems less expensive than Dropbox with their glacier option. I really hope I can trust this company because this is an excellent price and I might even be willing to pay as much if it can be kept secure and safe. Now, I just need to read more about the value proposition of Dropbox vs Scaleway Glacier.
Regarding Dropbox: Where are you seeing 9 TB for 20 $? I’m in the EU so my pricing may vary, but all I can see is the Business plan for 16 € per month per user, with a minimum of 3 users in the plan, making it cost 48 € instead. Do you have access to something else?
Regarding Backblaze: Agreed that 50$ a month is a rough bill to pay, that sums up very fast if youre counting across the years. But their storage is also a lot more reliable than one single hard drive stored in a bank locker, with them always checking their arrays and replacing aged drives.
Regarding Scaleway: If im reading their pricing chart right, it would cost roughly 2 € / month / TB for glacier storage, and 9 € / TB when restoring from glacier to standard storage? A big questionmark for me is how ingress works. If I’m using this for backups in case of total system failure, i’ll want to upload differential backups (borg/duplicati) every couple days. How is that going to work with pricing, is that all running through standard storage driving up monthly cost, do I have to manually manage file history and deletion of older stuff or does my backup software handle that? Plus, you loose out on the instant file access that you get with Backblaze, or with something hacky like Dropbox / Onedrive. I’m still undecided which I value more, money or fast access.
Hello ! Just adding my two cents for Scaleway. I’ve used them personally for some services (and probably will add s3 storage in the near future)
It’s seems pretty reliable in my opinion.
If I were to pay $50 a month for 8TB, I’d have paid about $200 in 4 months. I can get a 10TB-12TB external drive (with Ironwolf/WD Red Pro CMR inside) for about $200.
I should have worded that properly, apologies.
That class of storage is very expensive to get your data back. Buying a drive will be cheaper.
I think that people would be using the service as a last resort, like when all other local or physical offsite backups fail.
In that sense, the cost to recover shouldn’t be the main factor when considering it.
Is there a less expensive alternative for Cloud storage with a decent SLA? I don’t want to go for the smaller companies, and BackBlaze is quite expensive too!
With my Synology NAS, I use icloud e2 for cloud storage. Reasonably priced, and it integrates with Synology’s Hyperbackup software.
But my needs are relatively small, sending < 5TB to my cloud backup. A few more TB and I may start looking at other options.
Wait, I’m looking at the data retrieval cost (bulk request) and it says it’s priced at $0.0025 per GB? That comes out to about $21 for a retrieval! Am i missing something important?
Take a look at the calculations here https://www.arqbackup.com/aws-glacier-pricing.html
It explains it a bit better. You have to factor in how many requests you need too. So both file sizes and amount of files.
Lots of answers in the comment about this particular storage type/vendor. Regardless, to answer your original question, rclone. Hands down. If you spend 30-60 minutes actually reading their documentation, you are set and understand so much more of what’s going on under the hood.
Thanks, I do know of rclone
and intend to study it. I was just wondering if the likes of borg
, duplicati
etc could be used.
Can’t speak for those but I tried Kopia and it did the job okay. Ultimately tho I landed on rclone.
-
I don’t encrypt before I push to S3. Probably bad practice on my part. I just rely on AWS encryption to secure my data. My backups are low-risk (imo). That said, I lock down the bucket so that only my account can access the objects. Compression I use
tar cjf
(bzip). Protip: Once the tar file is made, runtar ljf $archiveFile > archiveFile-ls.txt
and store the resulting file along with the tar file in standard storage. That way you know what is in the archive. -
Both.
Restore Requests
is to copy the data out from Glacier into Standard storage. Note that I said copy. When you perform a restore, your original object stays in glacier and AWS creates a copy to somewhere in S3 that you specify. Once the restore is complete, you can then download the copied object like any S3 object, triggering theOutbound data transfer
fee.
- I use Kopia (https://kopia.io/)
I’ve been using S3 but I’m considering Cloudflare R2 as it might be a bit cheaper
Really? Their pricing is even more expensive than AWS’ S3 Glacier Archive! I’d much rather use BackBlaze B2 than pay that much!