this post was submitted on 06 Sep 2023
1064 points (99.3% liked)

Technology

59389 readers
4184 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Semi-Hemi-Demigod@kbin.social 173 points 1 year ago (6 children)

Sysadmin pro tip: Keep a 1-10GB file of random data named DELETEME on your data drives. Then if this happens you can get some quick breathing room to fix things.

Also, set up alerts for disk space.

[–] Dkarma@lemmy.world 86 points 1 year ago (4 children)

The answer here is not storage it is better alerting.

[–] nickhammes@lemmy.world 30 points 1 year ago (1 children)

Why not both? Alerting to find issues quickly, a bit of extra storage so you have more options available in case of an outage, and maybe some redundancy for good measure.

[–] RupeThereItIs@lemmy.world 14 points 1 year ago (1 children)

A system this critical is on a SAN, if you're properly alerting adding a bit more storage space is a 5 minute task.

It should also have a DR solution, yes.

[–] nightwatch_admin@feddit.nl 1 points 1 year ago (1 children)

A system this critical is on a hypervisor with tight storage “because deduplication” (I’m not making this up).

[–] RupeThereItIs@lemmy.world 5 points 1 year ago (2 children)

This is literally what I do for a living. Yes deduplication and thin provisioning.

This is still a failure of monitoring or slow response to it.

You keep your extra capacity handy on the storage array, not with some junk files on the filesystem.

You also need to know how over provisioned you are and when you're likely to run out of capacity... you know this from monitoring.

Then when management fails to react promptly to your warnings. Shit like this happens.

[–] Semi-Hemi-Demigod@kbin.social 3 points 1 year ago

Then when management fails to react promptly to your warnings. Shit like this happens.

The important part is that you have your warnings in writing, and BCC them to a personal email so you can cover your ass

[–] nightwatch_admin@feddit.nl 1 points 1 year ago

Exactly, I was being sarcastic about management’s “solution”

[–] Agent641@lemmy.world 18 points 1 year ago (1 children)

Yes, alert me when disk space is about to run out so I can ask for a massive raise and quit my job when they dont give it to me.

Then when TSHTF they pay me to come back.

[–] XTornado@lemmy.ml 1 points 1 year ago

That high hourly rate is really satisfying, I guess....not been there.

[–] ipkpjersi@lemmy.ml 15 points 1 year ago (1 children)

A lot of companies have minimal alerting or no alerting at all. It's kind of wild. I literally have better alerting in my home setup than many companies do lol

[–] RagingRobot@lemmy.world 2 points 1 year ago (1 children)

It's certainly cheaper to not have any but it will limit growth substantially

[–] ipkpjersi@lemmy.ml 1 points 1 year ago (1 children)

I have free monitoring I set up myself though lol

[–] IonAddis@lemmy.world 4 points 1 year ago (1 children)

I imagine it's a case where if you're knowledgeable, yeah it's free. But if you have to hire people knowledgeable to implement the free solution, you still have to pay the people. And companies love to balk at that!

[–] ipkpjersi@lemmy.ml 2 points 1 year ago

I think it's that and any IT employees they have would not be allowed to work on it because they would be working on other stuff because companies wouldn't prioritize that, since they don't know how important it is until it's too late.

[–] looz@sopuli.xyz 9 points 1 year ago (2 children)

There's cases where disk fills up quicker than one can reasonably react, even if alerts are in place. And sometimes culprit is something you can't just go and kill.

And sometimes culprit is something you can’t just go and kill.

That's what the Yakuza is for.

[–] afraid_of_zombies@lemmy.world -1 points 1 year ago

Had an issue like that a few years back. A stand alone device that was filling up quickly. The poorly designed device could only be flushed via USB sticks. I told them that they had to do it weekly. Guess what they didn't do. Looking back I should have made it alarm and flash once a week on a timer.

[–] dx1@lemmy.world 53 points 1 year ago

The real pro tip is to segregate the core system and anything on your system that eats up disk space into separate partitions, along with alerting, log rotation, etc. And also to not have a single point of failure in general. Hard to say exact what went wrong w/ Toyota but they probably could have planned better for it in a general way.

[–] Lem453@lemmy.ca 31 points 1 year ago* (last edited 1 year ago) (3 children)

Even better, cron job every 5 mins and if total remaining space falls to 5% auto delete the file and send a message to sys admin

[–] Semi-Hemi-Demigod@kbin.social 21 points 1 year ago

Sends a message and gets the services ready for potential shutdown. Or implements a rate limit to keep the service available but degraded.

[–] gazter@aussie.zone 4 points 1 year ago

Also, if space starts decreasing much more rapidly than normal.

[–] bug@lemmy.one 3 points 1 year ago

At that point just set the limit a few gig higher and don't have the decoy file at all

[–] Maximilious@kbin.social 28 points 1 year ago* (last edited 1 year ago) (2 children)

10GB is nothing in an enterprise datastore housing PBs of data. 10GB is nothing for my 80TB homelab!

[–] Semi-Hemi-Demigod@kbin.social 28 points 1 year ago (1 children)

It not going to bring the service online, but it will prevent a full disk from letting you do other things. In some cases SSH won’t work with a full disk.

[–] GhostlyPixel@lemmy.world 29 points 1 year ago (2 children)

It’s all fun and games until tab autocomplete stops working because of disk space

[–] TrenchcoatFullofBats@belfry.rip 11 points 1 year ago

The real apocalypse

[–] model_tar_gz@lemmy.world 2 points 1 year ago

Tab complete in vim go lolllllooolol NO

[–] idunnololz@lemmy.world 11 points 1 year ago (1 children)

It's nothing for my homework folder.

[–] mohammed_alibi@lemmy.world 3 points 1 year ago

That's an incredible collection of homework!

[–] z00s@lemmy.world 8 points 1 year ago

Or make the file a little larger and wait until you're up for a promotion..

[–] mkhopper@lemmy.world 8 points 1 year ago