this post was submitted on 11 Mar 2025
131 points (99.2% liked)

Lemmy.ca's Main Community

3125 readers
198 users here now


Welcome to the lemmy.ca/c/main community!

All new users on lemmy.ca are automatically subscribed to this community, so this is the place to read announcements, make suggestions, and chat about the goings-on of lemmy.ca.

For support requests specific to lemmy.ca, you can use !lemmy_ca_support@lemmy.ca.


founded 4 years ago
MODERATORS
 

Sorry for the downtime! Unfortunately our secondary firewall took over for some reason, and haproxy failed to properly come up.

I'll be scheduling a maintenance window in the next few days to do some further digging, so I can make sure this is fully resolved.

all 20 comments
sorted by: hot top controversial new old
[–] BCsven@lemmy.ca 36 points 1 day ago (1 children)

As Canadian as it gets :) Apologizing for the free service you offer.

Don't sweat it, everyone here appreciates the effort put in to run this, even if they don't reach out and express it.

[–] Shadow@lemmy.ca 24 points 1 day ago (3 children)

I abhor the fact that status.lemmy.ca says anything other than 100% uptime.

[–] DarkSirrush@lemmy.ca 7 points 1 day ago (1 children)

During your next maintenance put in a second, secret uptime counter and once it hits a reasonable amount of time swap them, nobody will notice!

[–] Tlaloc_Temporal@lemmy.ca 4 points 1 day ago

Uptime counter failover, indistinguishable from a clock.

[–] corsicanguppy@lemmy.ca 2 points 1 day ago

The high work ethic is plain to see. Nice debug and recovery.

[–] Grappling7155@lemmy.ca 2 points 1 day ago

Even hyperscaler cloud service providers don’t aim for 100%, don’t sweat it

[–] slothrop@lemmy.ca 20 points 1 day ago (1 children)

I had the shakes, deetees, drools and bends for a while.
This is the only instance I'm not banned on.
.
.
.
.

/s

[–] Shadow@lemmy.ca 28 points 1 day ago

Oh hold on, let me go fix that....

[–] whoisearth@lemmy.ca 18 points 1 day ago (3 children)

Wearing my Ops hat (what I do for a living can't help it) have a few questions I'm more than happy to assist with depending on the answers.

  1. What TZ are you in?
  2. Are you open to volunteer support staff in other TZ?
  3. What monitoring solutions are you using? Nagios, Zabbix, etc.
  4. PagerDuty has a free tier do you have alerting and escalation setup?
  5. Do you have run books on the associated issues that come up?
  6. Could access be limited for a volunteer support staff?

I woke up just before 7am EDT and noticed the status page makes reference to PDT so I can only assume you're based out of BC. Given Canada has so many TZ it may be worthwhile investigating a more mature support model and give you some breathing room?

[–] Shadow@lemmy.ca 14 points 1 day ago* (last edited 1 day ago)
  1. Myself, Otter and the server are all in Vancouver. Smorks is out east and mp3 is in the middle of all of us. Between us we actually have pretty good coverage most days.
  2. Yes, if someone has professional SRE experience and wants to help out I'm open to it. There's very little on-going maintenance as things just run smoothly 99.999% of the time, but some additional eyes can't hurt and if someone wants to build new stuff there's things I'd like to do =)
  3. Betterstack at the moment, combined with some custom healthscript scripts which write to our discord and an improperly configured alert config on my phone that didn't wake me up.
  4. Oh neat, I didn't realize pagerduty had a free tier, I'll check it out.
  5. Nope. There haven't been issues that come up, execpt for this one stupid opnsense issue everything has been amazingly stable.
  6. To some degree yes.
[–] Sturgist@lemmy.ca 9 points 1 day ago

Not admin team, but I think I remember them saying the servers are in Vancouver.

[–] skankhunt42@lemmy.ca 6 points 1 day ago

I'm personally not willing to be on call but I was up around the same time and willing to help if they need it. I've been a Linux Admin for 13+ years and have my own k8s cluster in the basement.

I also don't mind the downtime. Thank you guys for everything you're doing!

[–] Album@lemmy.ca 6 points 1 day ago

Thank you for keeping the lights on!

[–] SneakyWeasel@lemmy.ca 7 points 1 day ago

All good. Made me fall asleep easier last night as i didn't doom scroll. Haha

[–] jerkface@lemmy.ca 5 points 1 day ago
[–] GameGod@lemmy.ca 4 points 1 day ago

You need that SRE team you said you don't have. :)

[–] Blaze@lemmy.dbzer0.com 4 points 1 day ago
[–] sloppychops@lemmy.ca 3 points 1 day ago

Love you. Mwaaaaah! 😘

[–] jerkface@lemmy.ca 2 points 1 day ago

more like nshaproxy