src - OpenBSD base system

Age	Commit message (Collapse)	Author
2022-11-08	allow the KERN_AUTOCONF_SERIAL sysctl in pledge'd processes	Robert Nagy
	ok deraadt@
2022-11-08	timeout(9): remove unused, undocumented timeout_in_nsec() interface	Scott Soule Cheloha
	The kernel is not quite ready for timeout_in_nsec(). Remove it and kclock_nanotime(). Both are unused. Prompted by jsg@. ok kn@
2022-11-08	Enable gpiobl(4)	Tobias Heider

2022-11-08	Add gpiobl(4), a driver for gpio controlled display backlights. This will	Tobias Heider
	allow us to turn off the screen on Apple Silicon laptops until we have a proper display controller driver. ok kettenis@ patrick@
2022-11-08	Use four spaces not tabs on line break	Klemens Nanni

2022-11-08	Document ifc_list immutability	Klemens Nanni
	Move up to comment explaining different locks to account for all structs. OK millert mvs
2022-11-08	Implement alternative mailbox handling mechanism required by newer firmware.	Mark Kettenis
	ok patrick@
2022-11-08	tc_setclock: don't print a warning if tc_windup() rejects inittodr(9) time	Scott Soule Cheloha
	During resume, it isn't necessarily a problem if the UTC time we get from inittodr(9) lags behind the system UTC clock. In particular, if the active timecounter's frequency is low enough, tc_delta() might not overflow across a brief suspend. Remove the misleading warning message. The code is behaving as intended, just not in a way I anticipated when I added the warning message a few years ago. Discovered by kettenis@. Root cause isolated with kettenis@. Link: https://marc.info/?l=openbsd-tech&m=166790845619897&w=2 ok mlarkin@ kettenis@
2022-11-08	vmm(4): remove locking in vmm_intr_pending	Mike Larkin
	Removes a lock around an atomic write; this lock was causing slowdowns since the lock being requested is nearly always unavailable because it is held while the VM is running. Noticed by claudio@, help from mpi@, dlg@ and claudio@. ok dv
2022-11-08	Unlock SIOCIFGCLONERS	Klemens Nanni
	ifconfig(8) -C is the only user in base and the if_clone_attach() comment explains how this list is being built during autoconf(9). After that it is only ever read. Multiple threads may traverse the list in parallel and reading the `int' count is atomic. OK mvs
2022-11-08	Push kernel lock inside ifioctl_get()	Klemens Nanni
	After this mechanical move, I can unlock the individual SIOCG* in there. OK mvs
2022-11-08	arm64: switch to clockintr(9)	Scott Soule Cheloha
	Switch arm64 to the clockintr(9) subsystem. - Remove the custom per-CPU clock interrupt schedule from agtimer(4). - Remove the custom randomized statclock() pieces from agtimer(4). - Add agtimer_rearm(), agtimer_trigger(), and wire up agtimer_intrclock. There is one wart: - The AArch64 spec says that a value written to CNTV_TVAL_EL0 is "treated as a signed 32-bit integer" [1]. kettenis@ doesn't know what to make of this. I'm capping the value at INT32_MAX for now. It's possible I am misreading this, though. Tested by kettenis@ on his Apple M1 mini. Tested by me on my Raspberry Pi 4B. Link: https://marc.info/?l=openbsd-tech&m=166776342503304&w=2 [1] "Arm Architecture Reference Manual for A-profile architecture" issue I.a, section D17.11.27 ("CNTV_TVAL_EL0"). ok kettenis@
2022-11-08	fix indent	Klemens Nanni

2022-11-08	amd64: switch to clockintr(9)	Scott Soule Cheloha
	Switch amd64 to the clockintr(9) subsystem. There are lots of little changes, but the bigs ones are listed here. When using the local apic timer: - Run the timer in one-shot mode. - lapic_delay() is gone. We can't use it to delay(9) when running the timer in one-shot mode. - Add a randomized statclock(); stathz = hz. - Add support for switching to profhz when profiling is enabled; profhz = stathz * 10. When using the i8254/mc146818: - i8254's clockintr() no longer has a monopoly on hardclock(). - mc146818's rtcintr() no longer has a monopoly on statclock(). - In profiling mode, the statclock() will drift very slightly because (profhz = 1024) does not divide evenly into one billion. We could avoid this by setting (profhz = 512) instead and programming the RTC to run at that rate. Early revisions reviewed by mlarkin@. Extensively tested by mlarkin@ on a variety of physical and virtual hardware. Additional testing from dv@ and jmc@. Link: https://marc.info/?l=openbsd-tech&m=166776339203279&w=2 ok kettenis@ mlarkin@
2022-11-08	Extent the current suspend/resume implementation to include support for	Mark Kettenis
	parking CPUs in a WFE/WFI loop. ok deraadt@, mlarkin@
2022-11-08	This diff fixes panic tripped by KASSERT(st->sync_state == PFSYNC_S_NONE)	Alexandr Nedvedicky
	found in pfsync_insert_state(). It is caused by two packets which happen to belong to the same session. Think of UDP stream or two TCP SYN packets transmitted almost simultaneously. The first such packet wins a state lock and inserts state to table. The second packet waits for state lock as a reader. As soon as the first packet is done with state creation it drops the lock and is going to sent S_INS message to its peer via pfsync. The second update meanwhile obtains the state lock as a reader. It finds a state created by the first packet. Later the second packet also finds out the state needs to be updated, because sync_state is still set to PFSYNC_S_NONE. The second packet puts state to snapshot list marking it as S_UPD. All this happens before the first packet has a chance to make a progress. Think of the first packet loses cpu after dropping a write lock. Once the first packet gets running again it trips KASSERT() because sync_state is set to S_UPD. tested by hrvoje@ OK dlg@
2022-11-08	Push kernel lock into ifioctl_get()	Klemens Nanni
	Another mechanical diff without semantic changes to avoid churn in actual unlocking diffs. OK mpi
2022-11-08	acpihpet(4): disable/reenable acpihpet_delay() during suspend/resume	Scott Soule Cheloha
	We can't use the HPET to delay(9) after we halt it during suspend. Disable acpihpet_delay() before we halt the HPET and reenable it after we restart the HPET during resume. ok mlarkin@
2022-11-08	i386: add delay_fini()	Scott Soule Cheloha
	Not all of the clocks with a delay(9) implementation necessarily keep ticking across suspend/resume. We need a clean way to reverse delay_init() during suspend when those clocks stop ticking. Hence, delay_fini(). delay_fini() resets delay_func() to i8254_delay() if the given function pointer is the active delay(9) implementation. ok mlarkin@
2022-11-08	amd64: add delay_fini()	Scott Soule Cheloha
	Not all of the clocks with a delay(9) implementation necessarily keep ticking across suspend/resume. We need a clean way to reverse delay_init() during suspend when those clocks stop ticking. Hence, delay_fini(). delay_fini() resets delay_func() to i8254_delay() if the given function pointer is the active delay(9) implementation. ok mlarkin@
2022-11-08	Move definitions for CNTV_CTL_EL0 to armreg.h.	Mark Kettenis
	ok mpi@, jsg@, phessler@, patrick@
2022-11-08	Implement reading/writing/configuring pins in qcgpio(4). The code has	Patrick Wildt
	mostly been there, it only needed to be hooked up to our infrastructure. With this I can e.g. correctly see the lid state on the x13s. ok kettenis@
2022-11-08	Sprinkle some #ifdef MULTIPROCESSOR to make non-MP kernels build again.	Mark Kettenis

2022-11-08	Push kernel lock down into ifioctl()	Klemens Nanni
	This is a mechanical diff without semantical changes, locking ioctls individually inside ifioctl() rather than all of them around it. This allows us to unlock ioctls one by one. OK mpi
2022-11-08	Regen	Martin Pieuchot

2022-11-08	Mark mmap(2), munmap(2) and mprotect(2) as NOLOCK.	Martin Pieuchot
	Accesses to data structures used by these syscalls are serialized by the VM map lock with the exception of file mappings which are still protected by the KERNEL_LOCK(). Unlocking this set of syscalls improves most of userland workloads. Tested by many including robert@ (since 2 years), mlarkin@, kn@, sdk@, jca@, aoyama@, naddy@, Scott Bennett and others. Thanks to all! Joint work with kn@. ok robert@, aja@, kettenis@, kn@, deraadt@, beck@
2022-11-07	The gpiokeys(4) 'label' property seems to be optional. If we don't have	Patrick Wildt
	any, don't try and print it, and especially don't error out. Tested on Lenovo x13s (myself) and Pinebook Poop (kn@) ok kn@
2022-11-07	Add support for the PCIe controller on the Qualcomm SC8280XP. Thankfully	Patrick Wildt
	UEFI already initializes those, so we can simply just make use of that. That said, the ctrl/dbi region isn't the first in the register list, so instead try and look it up first and use it if available. Furthermore, the ATU region isn't part of the ctrl/dbi region, so if we are able to retrieve a separate reg for the ATU, use that instead. Some reshuffling is necessary to make that work. Tested on my Lenovo x13s and the MacchiatoBin ok kettenis@
2022-11-07	The ARM SMMUv2 does actually support #iommu-cells = <2>, where the second	Patrick Wildt
	cell is used as a mask for SMR to match a number of IDs. So far we have asserted that it's always 1, so loosen the restriction and pass both cells instead of only the sid. ok kettenis@
2022-11-07	Implement the "halt" IPI.	Mark Kettenis
	ok patrick@
2022-11-07	revert "move pf_purge out from under the kernel lock".	David Gwynne
	hrvoje popovski showed me pfsync blowing up with this. im backing it out quickly in case something else at the hackathon makes it harder to do later. kn@ agrees
2022-11-07	introduce a new kern.autoconf_serial sysctl that can be used by userland	Robert Nagy
	to monitor state changes of the kernel device tree input from dnd ok dlg@, deraadt@
2022-11-07	move pf_purge out from under the kernel lock and avoid the hogging cpu	David Gwynne
	this also avoids holding NET_LOCK too long. the main change is done by running the purge tasks in systqmp instead of systq. the pf state list was recently reworked so iteration over the state can be done without blocking insertions. however, scanning a lot of states can still take a lot of time, so this also makes the state list scanner yield if it has spent too much time running. the other purge tasks for source nodes, rules, and fragments have been moved to their own timeout/task pair to simplify the time accounting. in my environment, before this change pf purges often took 10 to 50ms. the softclock thread runs next to it often took a similar amount of time, presumably because they ended up spinning waiting for each other. after this change the pf_purges are more like 6 to 12ms, and dont block softclock. most of the variability in the runs now seems to come from contention on the net lock. tested by me sthen@ chris@ ok sashan@ kn@ claudio@
2022-11-07	vmm(4): set RAX guest register state based on VMCB	Dave Voutila
	The read/write register routines for SVM didn't acknowledge RAX in the VMCB as the de facto RAX state. When writing gprs, vmm should update RAX in the VMCB. When reading, it should be setting the guest regs state based on the VMCB. Needed for proper mmio emulation in userland. ok mlarkin@
2022-11-07	Modify TCP receive buffer size auto scaling to use the smoothed RTT	YASUOKA Masahiko
	(SRTT) instead of the timestamp option. Since the timestamp option is disabled on some OSs (eg. Windows) or dropped by some firewalls/routers, in such a case the window size had been fixed at 16KB, this limits throughput at very low on high latency networks. Also replace "tcp_now" from 2HZ tick counter to binuptime in milliseconds to calculate the SRTT better. tested by krw matthieu jmatthew dlg djm stu stsp ok claudio
2022-11-07	Run the ND6 expiry timer without kernel lock	Klemens Nanni
	Added in 2017 to Reduce contention on the NET_LOCK() by moving the nd6 address expiration task to the `softnettq`. This should no longer be needed thanks to sys/net/if.c r1.652 in 2022: Activate parallel IP forwarding. Start 4 softnet tasks. Limit the usage to the number of CPUs. Nothing in nd6_expire() or nd6_expire_timer_update() requires protection by the kernel lock. The interface list and per-interface address lists remain protected by the net lock. Tests by Hrvoje OK mpi
2022-11-07	Rename unreferenced field d_drivedata to smoke out any well	Kenneth R Westerback
	hidden uses.
2022-11-07	Nuke last references to d_drivedata.	Kenneth R Westerback

2022-11-07	Implement db_write_text/bytes() which add support for ddb(4)'s breakpoints.	Martin Pieuchot
	Based on a diff from gerhard@, ok kettenis@
2022-11-07	In kpageflttrap(), validate a non-NULL pcb_onfault against an array	Philip Guenther
	of permitted addresses, done via .nofault* sections that end up in the linked kernel's rodata. ok deraadt@ kettenis@
2022-11-06	Constify pfsync_acts[]; OK dlg	Klemens Nanni

2022-11-06	get rid of pfsync_state_export.	David Gwynne
	it wraps pf_state_export and has the same arguments and return type. pfsync can just call pf_state_export instead. ok clang
2022-11-06	vmm(4): allocate reference for vm and vcpu SLISTs	Dave Voutila
	Mischa Peters reported a performance regression in 7.2 when hosting numerous guests under vmm(4). While iterating through the list of vms during servicing an ioctl, vmm was triggering excessive wakeup calls due to hitting zero refcnt. Much guidance from dlg@ and testing from Mischa. OK mlarkin@.
2022-11-06	Enable IPv4, TCP, and UDP checksum offloading, and VLAN HW tagging	Moritz Buhl
	for em 82575, 82576, i350, and i210. Additional testing by Hrvoje Popovski OK dlg@
2022-11-06	move pfsync_state_import in if_pfsync.c to pf_state_import in pf.c	David Gwynne
	this is straightening the deck chairs. the state import and export code are used by both the pf ioctls and pfsync, but the export code is in pf.c and the import code is in if_pfsync. if pfsync was disabled then the ioctl stuff wouldnt link. moving the import code to pf.c makes it more symmetrical(?) and robust. tweaks and ok from kn@ sashan@
2022-11-06	Add FDT-based attachment for qciic(4).	Patrick Wildt
	ok kettenis@
2022-11-06	Add FDT-based attachment for qcgpio(4).	Patrick Wildt
	ok kettenis@
2022-11-06	make /dev/pf a clonable device.	David Gwynne
	this provides a 1:1 relationship of pfopen() calls to pfclose() calls. in turn, this makes it a lot easier to track stuff allocated by a process and then clean it up if that process goes away unexpectedly. the unique dev_t provided by the cloning machinery gives us a good identifier to track this state with too. discussed with h2k22 ok sashan@ deraadt@ agrees this is a good time to put this in
2022-11-06	Change character drawing depth when 'pseudo' framebuffer depth is changed.	Kenji Aoyama
	Tested on LUNA-88K2 with 4bpp/8bpp framebuffer by me.
2022-11-06	Disable smmu(4) for Qualcomm SC8280XP on FDT attachment like we already do	Patrick Wildt
	on ACPI. ok kettenis@