this post was submitted on 07 Jul 2024
840 points (92.7% liked)

linuxmemes

21675 readers
1292 users here now

Hint: :q!


Sister communities:


Community rules (click to expand)

1. Follow the site-wide rules

2. Be civil
  • Understand the difference between a joke and an insult.
  • Do not harrass or attack members of the community for any reason.
  • Leave remarks of "peasantry" to the PCMR community. If you dislike an OS/service/application, attack the thing you dislike, not the individuals who use it. Some people may not have a choice.
  • Bigotry will not be tolerated.
  • These rules are somewhat loosened when the subject is a public figure. Still, do not attack their person or incite harrassment.
  • 3. Post Linux-related content
  • Including Unix and BSD.
  • Non-Linux content is acceptable as long as it makes a reference to Linux. For example, the poorly made mockery of sudo in Windows.
  • No porn. Even if you watch it on a Linux machine.
  • 4. No recent reposts
  • Everybody uses Arch btw, can't quit Vim, and wants to interject for a moment. You can stop now.
  •  

    Please report posts and comments that break these rules!


    Important: never execute code or follow advice that you don't understand or can't verify, especially here. The word of the day is credibility. This is a meme community -- even the most helpful comments might just be shitposts that can damage your system. Be aware, be smart, don't fork-bomb your computer.

    founded 2 years ago
    MODERATORS
     
    you are viewing a single comment's thread
    view the rest of the comments
    [–] [email protected] 305 points 5 months ago (6 children)

    Fake news.

    Both Windows and Linux have their respective SIGTERM and SIGKILL equivalents. And both usually try SIGTERM before resorting to SIGKILL. That's what systemd's dreaded "a stop job is running" is. It waits a minute or so for the SIGTERM to be honoured before SIGKILLing the offending process.

    [–] [email protected] 45 points 5 months ago (19 children)

    Also fake because zombie processes.

    I once spent several angry hours researching zombie processes in a quest to kill them by any means necessary. Ended up rebooting, which was a sort of baby-with-the bath-water solution.

    Zombie processes still infuriate me. While I'm not a Rust developer, nor do I particularly care about the language, I'm eagerly watching Redox OS, as it looks like the micro kernel OS with the best chance to make to it useful desktop status. A good micro kernel would address so so many of the worst aspects of Linux.

    [–] [email protected] 82 points 5 months ago* (last edited 5 months ago) (1 children)

    Zombie processes are already dead. They aren't executing, the kernel is just keeping a reference to them so their parent process can check their return code (waitpid).

    All processes becomes zombies briefly after they exit, just usually their parents wait on them correctly. If their parents exit without waiting on the child, then the child gets reparented to init, which will wait on it. If the parent stays alive, but doesn't wait on the child, then it will remain zombied until the parent exits and triggers the reparenting.

    Its not really Linux's fault if processes don't clean up their children correctly, and I'm 99% sure you can zombie a child on redox given its a POSIX OS.

    Edit: https://gist.github.com/cameroncros/8ae3def101efc08be2cd69846d9dcc81 - Rust program to generate orphans.

    [–] [email protected] 3 points 5 months ago (1 children)

    I haven’t tried this, but if you just need the parent to call waitpid on the child’s pid then you should be able to do that by attaching to the process via gdb, breaking, and then manually invoking waitpid and continuing.

    [–] [email protected] 8 points 5 months ago* (last edited 5 months ago)

    I think that should do it. I'll try later today and report back.

    Of course, this risks getting into an even worse state, because if the parent later tries to correctly wait for its child, the call will hang.

    Edit: Will clean up the orphan/defunct process.

    If the parent ever tried to wait, they would either get ECHILD if there are no children, or it would block until a child exited.

    Will likely cause follow on issues - reaping someone elses children is generally frowned upon :D.

    [–] [email protected] 25 points 5 months ago

    Zombie processes are hilarious. They are the unkillable package delivery person of the Linux system. They have some data that must be delivered before they can die. Before they are allowed to die.

    Sometimes just listening to them is all they want. (Strace or redirect their output anywhere.)

    Sometimes, the whole village has to burn. (Reboot)

    [–] [email protected] 15 points 5 months ago* (last edited 5 months ago) (2 children)

    Performance is the major flaw with microkernels that have prevented the half-dozen or more serious attempts at this to succeed.

    Incurring context switching for low-level operations is just too slow.

    An alternative might be a safe/provable language for kernel and drivers where the compiler can guarantee properties of kernel modules instead of requiring hardware guarantees, and it ends up in one address space/protection boundary. But then the compiler (and its output) becomes a trusted component.

    [–] [email protected] 3 points 5 months ago (4 children)

    Thank you. Came here to say this. Microkernels are great for limited scope devices like microcontrollers but really suffer in general computing.

    load more comments (4 replies)
    [–] [email protected] 2 points 5 months ago

    where the compiler can guarantee properties of kernel modules instead of requiring hardware guarantees

    Then you would need to move compiler to kernel. Well, there is one: BPF(and derivatives). It's turing-incomplete by design.

    [–] [email protected] 9 points 5 months ago (3 children)

    RedoxOS would likely never become feature complete enough to be a stable, useful and daily-drivable OS. It's currently a hobbyist OS that is mainly used as a testbed for OS programming in Rust.

    If the RedoxOs devs could port the Cosmic DE, they'd become one of the best Toy OS and maybe become used on some serious projects . This could give them enough funds to become a viable OS used by megacorps on infrastructures where security is critical and it may lead it to develop into a truly daily drivable OS.

    [–] [email protected] 11 points 5 months ago (2 children)

    I believe the next step after that would be to wake up

    [–] [email protected] 5 points 5 months ago

    Hey, we can always dream!

    load more comments (1 replies)
    [–] [email protected] 4 points 5 months ago

    HURD 2, the return of rust

    [–] [email protected] 3 points 5 months ago (1 children)

    They are planning to port Cosmic DE, and have already ported several applications from Cosmic including the file manager and text editor if I remember correctly.

    [–] [email protected] 2 points 5 months ago* (last edited 5 months ago)

    They've shown the COSMIC terminal working on their last showcase video

    [–] jaybone 8 points 5 months ago (1 children)

    What does this have to do with Rust? Or redox, or micro kernels or Linux?

    load more comments (1 replies)
    [–] [email protected] 3 points 5 months ago

    I don't think a microkernel will help with zombies.

    [–] [email protected] 2 points 5 months ago (2 children)

    Ok, how change of kernel would fix userspace program not reading return value? And if you just want to use microkernel, then use either HURD or whatever DragonflyBSD uses.

    But generally microkernels are not solution to problems most people claim they would solve, especially in post-meltdown era.

    [–] [email protected] 2 points 5 months ago (1 children)

    This particular issue could be solved in most cases in a monolithic kernel. That it isn't, is by design. But it's a terrible design decision, because it can lead to situations where (for example) a zombie process locks a mount point and prevents unmounting because the kernel insists it's still in use by the zombie process. Which the kernel provides no mechanism for terminating.

    It is provable via experiment in Linux by use of fuse filesystems. Create a program that is guaranteed to become a zombie. Run it within a filesystem mounted by an in-kernel module, like a remote nfs mount. You now have a permanently mounted NFS mount point. Now, use mount something using fuse, say a WebDAV remote point. Run the same zombie process there. Again, the mount point is unmountable. Now, kill the fuse process itself. The mount point will be unmounted and disappear.

    This is exactly how microkernels work. Every module is killable, crashable, upgradable - all without forcing a reboot or affecting any processes not using the module. And in a well-designed microkernel, even processes using the module can in many cases continue functioning as if the restarted kernel module never changed.

    Fuse is really close to the capabilities of microkernels, except it's only filesystems. In a microkernel, nearly everything is like fuse. A linux kernel compiled such that everything is a loadable module, and not hard linked into the kernel, is close to a microkernel, except without the benefits of actually being a microkernel.

    Microkernels are better. Popularity does not prove superiority, except in the metric of popularity.

    [–] [email protected] 2 points 5 months ago* (last edited 5 months ago) (1 children)

    This particular issue could be solved in most cases in a monolithic kernel. That it isn't, is by design.

    It was(see CLONE_DETATCHED here) and is(source)

    Create a program that is guaranteed to become a zombie. Run it within a filesystem mounted by an in-kernel module, like a remote nfs mount. You now have a permanently mounted NFS mount point.

    Ok, this is not really good implementation. I'm not sure that standard requires zombie processes to keep mountpoints(unless its executable is located in that fs) untill return value is read. Unless there is call to get CWD of another process. Oh, wait. Can't ptrace issue syscall on behalf of zombie process or like that? Or use vfs of that process? If so, then it makes sense to keep mountpoint.

    Every module is killable, crashable, upgradable - all without forcing a reboot or affecting any processes not using the module.

    except without the benefits of actually being a microkernel.

    Except Linux does it too. If graphics module crashes, I still can SSH into system. And when I developed driver for RK3328 TRNG, it crashed a lot. Replaced it without reboot.

    Microkernels are better. Popularity does not prove superiority, except in the metric of popularity.

    As I said, we live in post-meltdown world. Microkernels are MUCH slower.

    [–] [email protected] 2 points 5 months ago* (last edited 5 months ago) (1 children)

    As I said, we live in post-meltdown world. Microkernels are MUCH slower.

    I've heard this from several people, but you're the lucky number by which I'd heard it enough that I bothered to gather some references to refute this.

    First, this is an argument that derived from first generation microkernels, and in particular, MINIX, which - as a teaching aid OS, never tried to play the benchmark game. It's been repeated, like dogma, through several iterations of microkernels which have, in the interim, largely erased most of those performance leads of monolithic kernels. One paper notes that, once the working code exceeds the L2 cache size, there is marginal advantage to the monolithic structure. A second paper running benchmarks on L^4^Linux vs Linux concluded that the microkernel penalty was only about 5%-10% slower for applications than the Linux monolithic kernel.

    This is not MUCH slower, and - indeed - unless you're doing HPC applications, is close enough to be unnoticeable.

    Edit: I was originally going to omit this, as it's propaganda from a vested interest, and includes no concrete numbers, but this blog entry from a product manager at QNX specifically mentions using microkernels in HPC problem spaces, which I thought was interesting, so I'm post-facto including it.

    load more comments (1 replies)
    [–] [email protected] 1 points 5 months ago (1 children)

    But generally microkernels are not solution to problems most people claim they would solve, especially in post-meltdown era.

    Can you elaborate? I am not an OS design expert, and I thought microkernels had some advantages.

    [–] [email protected] 2 points 5 months ago (1 children)

    Can you elaborate? I am not an OS design expert, and I thought microkernels had some advantages.

    Many people think that microcernels are only way to run one program on multiple machines without modyfing them. Counterexample to such statement is Plan 9, which had such capability with monolithic kernel.

    [–] [email protected] 2 points 5 months ago (1 children)

    That's not something I ever associated with microkernels to be honest. That's just clustering.

    I was more interested in having minimal kernels with a bunch of processes handling low level stuff like file systems that could be restarted if they died. The other cool thing was virtualized kernels.

    [–] [email protected] 1 points 5 months ago

    Well, even monolithic Linux can restart fs driver if it dies. I think.

    load more comments (12 replies)
    [–] Mojave 35 points 5 months ago (2 children)

    Clicking end task in windows task manager has definitely let the hanging task live in its non-responsive state for multiple hours before.

    [–] [email protected] 15 points 5 months ago (2 children)

    Been a while since I've been on Windows but I distinctly remember some button to kill a task without waiting. Maybe they removed it to make Windows soooo much more user friendly.

    [–] [email protected] 19 points 5 months ago (1 children)

    Off the top of my head: right click the task and hit end process. That has literally never failed me. Back in windows XP it might sometimes not actually kill the process but then there was always the "kill process tree" button to fall back on.

    [–] [email protected] 7 points 5 months ago (1 children)

    Yep, typically "Kill Process Tree" was like the nuke from orbit. You'll likely destroy any unsaved data, but it works nice when steam has 12 processes running at once.

    [–] Aux 3 points 5 months ago

    It's not really a nuke as some processes might be protected. The nuke is to use debugger privileges. Far Manager can kill processes using debugger privileges, that will literally nuke anything and in an instant: the app won't even receive any signals or anything.

    [–] [email protected] 7 points 5 months ago

    The normal Windows task manager's 'end task' button just politely asks the app to close - but then later will tell the user if the app is unresponsive, and offer to brutally murder it instead.

    There is also the sysinternals Process Monitor, which is basically 'expert' version of the task manager. Procmon does allow you to just kill a task outright.

    [–] Aux 9 points 5 months ago

    The end task doesn't terminate the app, it only sends a message to the window to close itself. The app will then decide what to do on its own. For example, if the app has multiple windows open, it might close the active one, but still continue running with other windows open. Or it might ignore the message completely.

    [–] [email protected] 11 points 5 months ago (1 children)

    That’s what systemd’s dreaded “a stop job is running” is

    The worst part of that is that you can't quickly login to check what it is (so maybe you can prevent it in the future?), or kill it anyway because it's likely to be something stupid and unimportant. And if it actually was important, well... it's gonna be shot in the head in a minute anyway, and there's nothing you can do to prevent it, so what's the point of delaying?

    [–] [email protected] 21 points 5 months ago (1 children)

    so what's the point of delaying?

    In the best case the offending process actually does shut down cleanly before the time is up. Like, some databases like redis keep written data in memory for fast access before actually writing the data to disc. If you were to kill such a process before all the data is written you'd lose it.

    So, admins of servers like these might even opt to increase the timeout, depending on their configuration and disc speed.

    [–] [email protected] 12 points 5 months ago* (last edited 5 months ago)

    I know what it theoretically is for, I still think it's a bad implementation.

    1. It often doesn't tell you clearly what it is waiting for.
    2. It doesn't allow you to checkout what's going on with the process that isn't responding, because logins are already disabled
    3. It doesn't allow you to cancel the wait and terminate the process anyway. 9/10 when I get it, it has been because of something stupid like a stale NFS mount or a bug in a unit file.
    4. If it is actually something important, like your Redis example, it doesn't allow you to cancel the shutdown, or to give it more time. Who's to say that your Redis instance will be able to persist its state to disk within 90 seconds, or any arbitrary time?

    Finally, I think that well written applications should be resilient to being terminated unexpectedly. If, like in your Redis example, you put data in memory without it being backed by persistent storage, you should expect to lose it. After all, power outages and crashes do happen as well.

    [–] [email protected] 8 points 5 months ago (1 children)

    Stop jobs are a systemdism and they're nice. I think the desktop environment kills its children on its own during reboot and it might not be as nice. Graphical browsers often complain about being killed after a reboot in GNOME.

    [–] [email protected] 3 points 5 months ago* (last edited 5 months ago) (1 children)

    AFAIK running firefox in a terminal and pressing ^C (SIGINT) has kind of the same effect as logging out or poweroff in GNOME (SIGTERM, if you're using systemd). This gives the browser (or other processes with crash recovery) enough time to save all its data and exit gracefully for the crash recovery the next time they are run.

    Please correct me if I'm wrong

    [–] [email protected] 5 points 5 months ago

    SIGTERM, if you're using systemd

    SIGTERM it was since original init

    [–] ConstantPain 3 points 5 months ago

    Windows gives you the option to kill on shutdown if the app is trying to delay the process. I think it's ideal.

    [–] dorumon 2 points 5 months ago

    BTW you can control systemd and how fast it chooses SIGKILL after sending SIGTERM. I don't know why people complain so much about it. It's really just there such that things on your computer end properly without any sort of data corruption or something bad going on after a reboot or the next time you turn on your computer.