I figured most of you could relate to this.
I was updating my Proxmox servers from 7.4 to 8. First one went without problems. That second one though… Yea, not so much… I THINK it’s GRUB but not sure yet.
Now my Nextcloud, NAS, main reverse proxy and half my DNS went down. And no time to fix it before work. Lovely 🤕 Well I now know what I’ll be doing when I get home.
Out of morbid curiosity, What are some of ya’lls self hosting horror stories.?
I have a beefy Unraid server for Dockers and VMs. The idea was to have it replace all my computers. At home the VMs output the image to a monitor so that’s my desktop. And remotely I connect my phone to my home VPN and connect my phone to a lapdock and use it as a thin client to connect to my VMs. Nomachine for Linux/work, Moonlight for Windows/gaming.
Well, it’s been over a year of not being able to have my server reach an uptime higher that 15 days and I have no fucking idea why. There are no traces of any error anywhere.
I’m using 3 Ubiquiti APs and running my own management instance on my server in a docker container.
I still haven’t been able to figure out why, except for maybe crappy Ubiquiti firmware, but if that container goes down or loses connectivity then the APs flood my router with traffic and my whole network goes down.
Even wired connections don’t work since the router is locked up, and when my server comes back up it won’t be able to reestablish connection because the router is still dead.
The only way I’ve found to fix it is to power cycle the APs which is obviously a huge pain.
Can’t get any support from Ubiquiti on it since I’m not using one of their controllers even though it’s obviously a firmware issue. Definitely do not recommend.
That’s an odd one. I’ve delt with Unifi at a lot of scales and never heard of them acting up when the controller goes down. Do you perhaps use a guest network with an intercept page? That’s the only thing I can imagine possibly causing any issue.
My first power outage was a very bad experience since I was absolutely not prepared for it.
I have no ups since the grid is very stable in here (it’s been the one and only outage in 5 years). The outage was short but I had forgot to activate the option in the BIOS of my server to power on when plugged in. So my server stayed shut down after electricity was restored. Of course, I happened to be away for the whole week when this happened with no way to access my server physically.
This is the event that made me learn about and start using a KVM that I can use remotely.
Had my entire home setup (all my arr services, nextcloud, home assistant, monitoring, etc) all running in my k3s setup on like 5 vms at home. Had velero backups of it etc.
Fast forward to i have no idea what happened and my masters just died. Nothing should sync anymore etc. Nobody in k3s community had an idea either. So lost my entire cluster and the backups weren’t too useful since the cluster itself was dead.
Rebuilt with Talos. But man that sucked.
It is times like these the love I have for my pikvm is renewed ever stronger.