serenity

mirror of https://github.com/SerenityOS/serenity.git synced 2025-01-24 02:12:09 -05:00

Author	SHA1	Message	Date
Andreas Kling	f598bbbb1d	Kernel: Prevent executing I/O instructions in userspace All threads were running with iomapbase=0 in their TSS, which the CPU interprets as "there's an I/O permission bitmap starting at offset 0 into my TSS". Because of that, any bits that were 1 inside the TSS would allow the thread to execute I/O instructions on the port with that bit index. Fix this by always setting the iomapbase to sizeof(TSS32), and also setting the TSS descriptor's limit to sizeof(TSS32), effectively making the I/O permissions bitmap zero-length. This should make it no longer possible to do I/O from userspace. :^)	2020-01-01 17:31:41 +01:00
Andreas Kling	fd740829d1	Kernel: Switch to eagerly restoring x86 FPU state on context switch Lazy FPU restore is well known to be vulnerable to timing attacks, and eager restore is a lot simpler anyway, so let's just do it eagerly.	2020-01-01 16:54:21 +01:00
Andreas Kling	54d182f553	Kernel: Remove some unnecessary leaking of kernel pointers into dmesg There's a lot more of this and we need to stop printing kernel pointers anywhere but the debug console.	2019-12-31 01:22:00 +01:00
Andreas Kling	610f3ad12f	Kernel: Add a basic thread boosting mechanism This patch introduces a syscall: int set_thread_boost(int tid, int amount) You can use this to add a permanent boost value to the effective thread priority of any thread with your UID (or any thread in the system if you are the superuser.) This is quite crude, but opens up some interesting opportunities. :^)	2019-12-30 19:23:13 +01:00
Andreas Kling	50677bf806	Kernel: Refactor scheduler to use dynamic thread priorities Threads now have numeric priorities with a base priority in the 1-99 range. Whenever a runnable thread is not scheduled, its effective priority is incremented by 1. This is tracked in Thread::m_extra_priority. The effective priority of a thread is m_priority + m_extra_priority. When a runnable thread is scheduled, its m_extra_priority is reset to zero and the effective priority returns to base. This means that lower-priority threads will always eventually get scheduled to run, once its effective priority becomes high enough to exceed the base priority of threads "above" it. The previous values for ThreadPriority (Low, Normal and High) are now replaced as follows: Low -> 10 Normal -> 30 High -> 50 In other words, it will take 20 ticks for a "Low" priority thread to get to "Normal" effective priority, and another 20 to reach "High". This is not perfect, and I've used some quite naive data structures, but I think the mechanism will allow us to build various new and interesting optimizations, and we can figure out better data structures later on. :^)	2019-12-30 18:46:17 +01:00
Andreas Kling	9e55bcb7da	Kernel: Make kernel memory regions be non-executable by default From now on, you'll have to request executable memory specifically if you want some.	2019-12-25 22:41:34 +01:00
Andreas Kling	52deb09382	Kernel: Enable PAE (Physical Address Extension) Introduce one more (CPU) indirection layer in the paging code: the page directory pointer table (PDPT). Each PageDirectory now has 4 separate PageDirectoryEntry arrays, governing 1 GB of VM each. A really neat side-effect of this is that we can now share the physical page containing the >=3GB kernel-only address space metadata between all processes, instead of lazily cloning it on page faults. This will give us access to the NX (No eXecute) bit, allowing us to prevent execution of memory that's not supposed to be executed.	2019-12-25 13:35:57 +01:00
Conrad Pankoff	0fdbe08637	Kernel: Fix debug message and kernel stack region names in thread setup	2019-12-24 01:28:38 +01:00
Conrad Pankoff	0cb89f5927	Kernel: Mark kernel stack regions as... stack regions	2019-12-24 01:28:38 +01:00
Conrad Pankoff	b557aab884	Kernel: Move ring0 stacks out of kmalloc_eternal This allows us to use all the same fun memory protection features as the rest of the system for ring0 processes. Previously a ring0 process could over- or underrun its stack and nobody cared, since kmalloc_eternal is the wild west of memory.	2019-12-24 01:28:38 +01:00
Conrad Pankoff	3aaeff483b	Kernel: Add a size argument to validate_read_from_kernel	2019-12-24 01:28:38 +01:00
Andreas Kling	523fd6533e	Kernel: Unlock the Process when exit()ing If there are more threads in a process when exit()ing, we need to give them a chance to unwind any kernel stacks. This means we have to unlock the process lock before giving control to the scheduler. Fixes #891 (together with all of the other "no more main thread" work.)	2019-12-22 12:38:01 +01:00
Andreas Kling	f4978b2be1	Kernel: Use IntrusiveList to make WaitQueue allocation-free :^)	2019-12-22 12:38:01 +01:00
Andreas Kling	4b8851bd01	Kernel: Make TID's be unique PID's This is a little strange, but it's how I understand things should work. The first thread in a new process now has TID == PID. Additional threads subsequently spawned in that process all have unique TID's generated by the PID allocator. TIDs are now globally unique.	2019-12-22 12:38:01 +01:00
Andreas Kling	16812f0f98	Kernel: Get rid of "main thread" concept The idea of all processes reliably having a main thread was nice in some ways, but cumbersome in others. More importantly, it didn't match up with POSIX thread semantics, so let's move away from it. This thread gets rid of Process::main_thread() and you now we just have a bunch of Thread objects floating around each Process. When the finalizer nukes the last Thread in a Process, it will also tear down the Process. There's a bunch of more things to fix around this, but this is where we get started :^)	2019-12-22 12:37:58 +01:00
Andreas Kling	3012b224f0	Kernel: Fix intermittent assertion failure in sys$exec() While setting up the main thread stack for a new process, we'd incur some zero-fill page faults. This was to be expected, since we allocate a huge stack but lazily populate it with physical pages. The problem is that page fault handlers may enable interrupts in order to grab a VMObject lock (or to page in from an inode.) During exec(), a process is reorganizing itself and will be in a very unrunnable state if the scheduler should interrupt it and then later ask it to run again. Which is exactly what happens if the process gets pre-empted while the new stack's zero-fill page fault grabs the lock. This patch fixes the issue by creating new main thread stacks before disabling interrupts and going into the critical part of exec().	2019-12-18 23:03:23 +01:00
Andreas Kling	7a64f55c0f	Kernel: Fix get_register_dump_from_stack() after IRQ entry changes I had to change the layout of RegisterDump a little bit to make the new IRQ entry points work. This broke get_register_dump_from_stack() which was expecting the RegisterDump to be badly aligned due to a goofy extra 16 bits which are no longer there.	2019-12-15 17:58:53 +01:00
Andreas Kling	b32e961a84	Kernel: Implement a simple process time profiler The kernel now supports basic profiling of all the threads in a process by calling profiling_enable(pid_t). You finish the profiling by calling profiling_disable(pid_t). This all works by recording thread stacks when the timer interrupt fires and the current thread is in a process being profiled. Note that symbolication is deferred until profiling_disable() to avoid adding more noise than necessary to the profile. A simple "/bin/profile" command is included here that can be used to start/stop profiling like so: $ profile 10 on ... wait ... $ profile 10 off After a profile has been recorded, it can be fetched in /proc/profile There are various limits (or "bugs") on this mechanism at the moment: - Only one process can be profiled at a time. - We allocate 8MB for the samples, if you use more space, things will not work, and probably break a bit. - Things will probably fall apart if the profiled process dies during profiling, or while extracing /proc/profile	2019-12-11 20:36:56 +01:00
Andrew Kaster	9058962712	Kernel: Allow setting thread names The main thread of each kernel/user process will take the name of the process. Extra threads will get a fancy new name "ProcessName[<tid>]". Thread backtraces now list the thread name in addtion to tid. Add the thread name to /proc/all (should it get its own proc file?). Add two new syscalls, set_thread_name and get_thread_name.	2019-12-08 14:09:29 +01:00
Andreas Kling	8bb98aa31b	Kernel: Use a WaitQueue to implement finalizer wakeup This gets rid of the special "Lurking" thread state and replaces it with a generic WaitQueue :^)	2019-12-01 19:17:17 +01:00
Andreas Kling	5859e16e53	Kernel: Use a dedicated thread state for wait-queued threads Instead of using the generic block mechanism, wait-queued threads now go into the special Queued state. This fixes an issue where signal dispatch would unblock a wait-queued thread (because signal dispatch unblocks blocked threads) and cause confusion since the thread only expected to be awoken by the queue.	2019-12-01 16:02:58 +01:00
Andreas Kling	f067730f6b	Kernel: Add a WaitQueue for Thread queueing/waking and use it for Lock The kernel's Lock class now uses a proper wait queue internally instead of just having everyone wake up regularly to try to acquire the lock. We also keep the donation mechanism, so that whenever someone tries to take the lock and fails, that thread donates the remainder of its timeslice to the current lock holder. After unlocking a Lock, the unlocking thread calls WaitQueue::wake_one, which unblocks the next thread in queue.	2019-12-01 12:07:43 +01:00
Andreas Kling	f75a6b9daa	Kernel: Demangle kernel C++ symbols correctly again I broke this while implementing module linking. Also move the actual demangling work to AK, in AK::demangle(const char*)	2019-11-29 14:59:15 +01:00
Andreas Kling	e34ed04d1e	Kernel+LibPthread+LibC: Create secondary thread stacks in userspace Have pthread_create() allocate a stack and passing it to the kernel instead of this work happening in the kernel. The more of this we can do in userspace, the better. This patch also unexposes the raw create_thread() and exit_thread() syscalls since they are now only used by LibPthread anyway.	2019-11-17 17:29:20 +01:00
Andreas Kling	794758df3a	Kernel: Implement some basic stack pointer validation VM regions can now be marked as stack regions, which is then validated on syscall, and on page fault. If a thread is caught with its stack pointer pointing into anything that's not a Region with its stack bit set, we'll crash the whole process with SIGSTKFLT. Userspace must now allocate custom stacks by using mmap() with the new MAP_STACK flag. This mechanism was first introduced in OpenBSD, and now we have it too, yay! :^)	2019-11-17 12:15:43 +01:00
Andreas Kling	73d6a69b3f	Kernel: Release the big process lock while yielding in sys$yield() Otherwise, a thread calling sched_yield() will prevent other threads in that process from entering the kernel.	2019-11-16 12:18:59 +01:00
Andreas Kling	cb5021419e	Kernel: Move Thread::m_joinee_exit_value into the JoinBlocker There's no need for this to be a permanent Thread member. Just use a reference in the JoinBlocker instead.	2019-11-14 21:04:34 +01:00
Andreas Kling	69efa3f630	Kernel+LibPthread: Implement pthread_join() It's now possible to block until another thread in the same process has exited. We can also retrieve its exit value, which is whatever value it passed to pthread_exit(). :^)	2019-11-14 20:58:23 +01:00
Sergey Bugaev	1e1ddce9d8	Kernel: Unwind kernel stacks before dying While executing in the kernel, a thread can acquire various resources that need cleanup, such as locks and references to RefCounted objects. This cleanup normally happens on the exit path, such as in destructors for various RAII guards. But we weren't calling those exit paths when killing threads that have been executing in the kernel, such as threads blocked on reading or sleeping, thus causing leaks. This commit changes how killing threads works. Now, instead of killing a thread directly, one is supposed to call thread->set_should_die(), which will unblock it and make it unwind the stack if it is blocked in the kernel. Then, just before returning to the userspace, the thread will automatically die.	2019-11-14 20:05:58 +01:00
Andreas Kling	083c5f8b89	Kernel: Rework Process::Priority into ThreadPriority Scheduling priority is now set at the thread level instead of at the process level. This is a step towards allowing processes to set different priorities for threads. There's no userspace API for that yet, since only the main thread's priority is affected by sched_setparam().	2019-11-06 16:30:06 +01:00
Andreas Kling	49635e62fa	LibELF: Move AK/ELF/ into Libraries/LibELF/ Let's arrange things like this instead. It didn't feel right for all of the ELF handling code to live in AK.	2019-11-06 13:42:38 +01:00
Drew Stratford	5efbb4ae95	Kernel: Fix bug in Thread::dispatch_signal(). dispatch_signal() expected a RegisterDump on the kernel stack. However in certain cases, like just after a clone, this was not the case and dispatch_signal() would instead write to an incorrect user stack pointer. We now use the threads TSS in situations where the RegisterDump may not be valid, fixing the issue.	2019-11-04 10:12:59 +01:00
Drew Stratford	44f22c99ef	Thread.cpp: add method get_RegisterDump_from_stack(). This refactors some the RegisterDump code from dispatch_signal into a stand-alone function, allowing for better reuse.	2019-11-04 10:12:59 +01:00
Andreas Kling	cc68654a44	Kernel+LibC: Implement clock_gettime() and clock_nanosleep() Only the CLOCK_MONOTONIC clock is supported at the moment, and it only has millisecond precision. :^)	2019-11-02 19:34:06 +01:00
Andreas Kling	904c871727	Kernel: Allow userspace stacks to grow up to 4 MB by default Make userspace stacks lazily allocated and allow them to grow up to 4 megabytes. This avoids a lot of silly crashes we were running into with software expecting much larger stacks. :^)	2019-10-31 13:57:07 +01:00
Andrew Kaster	98c86e5109	Kernel: Move E2BIG calculation from Thread to Process Thread::make_userspace_stack_for_main_thread is only ever called from Process::do_exec, after all the fun ELF loading and TSS setup has occured. The calculations in there that check if the combined argv + envp size will exceed the default stack size are not used in the rest of the stack setup. So, it should be safe to move this to the beginning of do_exec and bail early with -E2BIG, just like the man pages say. Additionally, advertise this limit in limits.h to be a good POSIX.1 citizen. :)	2019-10-23 07:45:41 +02:00
Andreas Kling	40beb4c5c0	Kernel: Don't leak an FPU state buffer for every spawned thread We were leaking 512 bytes of kmalloc memory for every new thread. This patch fixes that, and also makes sure to zero out the FPU state buffer after allocating it, and finally also makes the LogStream operator<< for Thread look a little bit nicer. :^)	2019-10-13 14:36:55 +02:00
Drew Stratford	c136fd3fe2	Kernel: Send SIGSEGV on seg-fault Now programs can catch the SIGSEGV signal when they segfault. This commit also introduced the send_urgent_signal_to_self method, which is needed to send signals to a thread when handling exceptions caused by the same thread.	2019-10-07 16:39:47 +02:00
Andreas Kling	d5f3972012	Kernel: No need to manually deallocate kernel stack Region in ~Thread() Since we're keeping this Region in an OwnPtr, it will be torn down when we get to ~OwnPtr anyway.	2019-09-27 19:10:52 +02:00
Drew Stratford	b65bedd610	Kernel: Change m_blockers to m_blocker. Because of the way signals now work there should not be more than one blocker per thread. This changes the blocker and thread class to reflect that.	2019-09-09 08:35:43 +02:00
Drew Stratford	e529042895	Kernel: Remove reduntant kernel/user signal stacks. Due to the changes in signal handling m_kernel_stack_for_signal_handler_region and m_signal_stack_user_region are no longer necessary, and so, have been removed. I've also removed the similarly reduntant m_tss_to_resume_kernel.	2019-09-09 08:35:43 +02:00
Andreas Kling	e386579436	Kernel: Fix bitrotted code behind #ifdef SIGNAL_DEBUG	2019-09-08 14:29:59 +02:00
Andreas Kling	899233a925	Kernel: Handle running programs that don't have a TLS image Programs without a PT_TLS header won't have a master TLS image for us to copy, so we shouldn't try to copy the m_master_tls_region then.	2019-09-07 17:06:25 +02:00
Andreas Kling	ec6bceaa08	Kernel: Support thread-local storage This patch adds support for TLS according to the x86 System V ABI. Each thread gets a thread-specific memory region, and the GS segment register always points _to a pointer_ to the thread-specific memory. In other words, to access thread-local variables, userspace programs start by dereferencing the pointer at [gs:0]. The Process keeps a master copy of the TLS segment that new threads should use, and when a new thread is created, they get a copy of it. It's basically whatever the PT_TLS program header in the ELF says.	2019-09-07 15:55:36 +02:00
Drew Stratford	95fe775d81	Kernel: Add SysV stack alignment to signal trampoline In both dispatch signal and asm_signal_trampoline we now ensure that the stack is 16 byte aligned, as per the System V ABI.	2019-09-05 16:37:09 +02:00
Drew Stratford	81d0f96f20	Kernel: Use user stack for signal handlers. This commit drastically changes how signals are handled. In the case that an unblocked thread is signaled it works much in the same way as previously. However, when a blocking syscall is interrupted, we set up the signal trampoline on the user stack, complete the blocking syscall, return down the kernel stack and then jump to the handler. This means that from the kernel stack's perspective, we only ever get one system call deep. The signal trampoline has also been changed in order to properly store the return value from system calls. This is necessary due to the new way we exit from signaled system calls.	2019-09-05 16:37:09 +02:00
Drew Stratford	259a1d56b0	Thread: added member m_kernel_stack_top. This value stores the top of a threads kernel_stack.	2019-09-05 16:37:09 +02:00
Andreas Kling	77737be7b3	Kernel: Stop eagerly loading entire executables We were forced to do this because the page fault code would fall apart when trying to generate a backtrace for a non-current thread. This issue has been fixed for a while now, so let's go back to lazily loading executable pages which should make everything a little better.	2019-08-15 10:29:44 +02:00
Andreas Kling	83fdad25ed	Kernel: For signal-killed threads, dump backtrace from finalizer thread Instead of dumping the dying thread's backtrace in the signal handling code, wait until we're finalizing the thread. Since signalling happens during scheduling, the less work we do there the better. Basically the less that happens during a scheduler pass the better. :^)	2019-08-06 19:45:08 +02:00
Andreas Kling	5e01ebfc56	Kernel: Clean up thread stacks when a thread dies We were forgetting where we put the userspace thread stacks, so added a member called Thread::m_userspace_thread_stack to keep track of it. Then, in ~Thread(), we now deallocate the userspace, kernel and signal stacks (if present.) Out of curiosity, the "init_stage2" process doesn't have a kernel stack which I found surprising. :^)	2019-08-01 20:17:12 +02:00

1 2 3

111 commits