src - OpenBSD base system

Age	Commit message (Collapse)	Author
2019-10-17	in6_setsockaddr and in6_setpeeraddr can't fail, so let them return void.	David Gwynne
	this also brings them in line with the AF_INET equivalents. ok visa@ bluhm@
2019-10-16	tsleep(9) -> tsleep_nsec(9)	Martin Pieuchot
	ok cheloha@, visa@
2019-10-07	ip_ether.c is empty, and now unlinked from the build.	David Gwynne
	ok jca@ deraadt@ claudio@ visa@
2019-10-04	gif shouldn't include netinet/ip_ether.h, cos gif doesnt do etherip.	David Gwynne
	ip_ether.h is where netinet/ip_ipip.h got the forward declaration for struct tdb from though, so fix that before cutting ip_ether.h out of gif.
2019-10-04	get rid of prototypes for mplsip_input and mplsip_output. they don't exist.	David Gwynne

2019-09-30	remove the "copy function" argument to bpf_mtap_hdr.	David Gwynne
	it was previously (ab)used by pflog, which has since been fixed. apart from that nothing else used it, so we can trim the cruft. ok kn@ claudio@ visa@ visa@ also made sure i fixed ipw(4) so i386 won't break.
2019-09-02	Fix a route use after free in multicast route. Move the rt_mcast_del()	Alexander Bluhm
	out of the rtable_walk(). This avoids recursion to prevent stack overflow. Also it allows freeing the route outside of the walk. Now mrt_mcast_del() frees the route only when it is deleted from the routing table. If that fails, it must not be freed. After the route is returned by mfc_find(), it is reference counted. Then we need a rtfree(), but not in the other caes. Move rt_timer_remove_all() into rt_mcast_del(). OK mpi@
2019-08-06	When we needed the kernel lock for local IP packet delivery, mpi@	Alexander Bluhm
	introduced a queue to grab the lock for multiple packets. Now we have only netlock for both IP and protocol input. So the queue is not necessary anymore. It just switches CPU and decreases performance. So remove the inet and inet6 ip queue for local packets. To get TCP running on loopback, we have to queue once between TCP input and output of the two sockets. So use the loopback queue in looutput() unconditionally. OK visa@
2019-07-25	Add IFXF_AUTOCONF4 to if_xflags to match IFXF_AUTOCONF6. Let	Kenneth R Westerback
	ifconfig set/unset it. ok deraadt@ kmos@
2019-07-17	Introduce ETHER_IS_BROADCAST/ANYADDR/EQ() and use them where appropriate.	Martin Pieuchot
	ok dlg@, sthen@, millert@
2019-07-15	Initialize struct inpcb pool not on demand, but during initialization.	Alexander Bluhm
	Removes a global variable and avoids MP problems. OK mpi@ visa@
2019-07-12	Count the number of TCP SACK options that were dropped due to the	Alexander Bluhm
	sack hole list length or pool limit. OK claudio@
2019-07-10	Received SACK options are managed by a linked list at the TCP socket.	Alexander Bluhm
	There is a global tunable limit net.inet.tcp.sackholelimit, default is 32768. If an attacker manages to attach all these sack holes to a few TCP connections, the lists may grow long. Traversing them might cause higher CPU consumption on the victim machine. In practice such a situation is hard to create as the TCP retransmit and 2*msl timer flush the list periodically. For additional protection, enforce a per connection limit of 128 SACK holes in the list. reported by Reuven Plevinsky and Tal Vainshtein discussed with claudio@ and procter@; OK deraadt@
2019-07-08	free(9) sizes for M_RTABLE.	Martin Pieuchot
	ok kn@
2019-07-05	add ac_trunkport to arpcom so trunks can coordinate owning an interface	David Gwynne
	Ethernet interfaces can be used by trunk(4), and i'm about to commit a new aggr(4) driver which should not be able to use an interface while trunk owns it and visa versa.
2019-06-21	Prevent recursions by not deleting entries inside rtable_walk(9).	Martin Pieuchot
	rtable_walk(9) now passes a routing entry back to the caller when a non zero value is returned and if it asked for it. This allows us to call rtdeletemsg()/rtrequest_delete() from the caller without creating a recursion because of rtflushclone(). Multicast code hasn't been adapted and is still possibly creating recursions. However multicast route entries aren't cloned so if a recursion exists it isn't because of rtflushclone(). Fix stack exhaustion triggered by the use of "-msave-args". Issue reported by DÃ¡niel LÃ©vai on bugs@ confirmed by and ok bluhm@.
2019-06-13	In arp_rtrequest and nd6_rtrequest return early if the RTF_MPLS flag is	Claudio Jeker
	set. These mpls routes use the rt_llinfo structure to store the MPLS label and would confuse the arp and nd6 code. OK bluhm@ anton@ Reported-by: syzbot+927e93a362f3ae33dd9c@syzkaller.appspotmail.com
2019-06-13	Copy the user provided sockaddr into a normalized sockaddr in rtrequest()	Claudio Jeker
	before adding it to the routing table. The rtable code is doing memcmp() of those rt_dest sockaddrs so it is important that they are stored in a canonical form. To do this struct domain is extended to include the sockaddr size for this address family. OK bluhm@ anton@ Reported-by: syzbot+10fe9cd8d0211c562ead@syzkaller.appspotmail.com
2019-06-10	use m_microtime instead of microtime for SO_TIMESTAMP socketopt handling	David Gwynne
	drivers can set ph_timestamp when packets are received by the hardware, which should be more accurate and cheaper than getting the clock when the packet is queued on the socket.
2019-06-10	Use mallocarray(9) & put some free(9) sizes for M_IPMOPTS allocations.	Martin Pieuchot
	ok semarie@, visa@
2019-06-04	Add missing NULL check for the protocol control block (pcb) pointer in	anton
	mrt{6,}_ioctl. Calling shutdown(2) on the socket prior to the ioctl command can cause it to be NULL. ok bluhm@ claudio@ Reported-by: syzbot+bdc489ecb509995a21ed@syzkaller.appspotmail.com Reported-by: syzbot+156405fdea9f2ab15d40@syzkaller.appspotmail.com
2019-05-11	unbreak the build without IPSEC.	Sebastian Benoit
	ok claudio@ deraadt@
2019-04-28	Removes the KERNEL_LOCK() from bridge(4)'s output fast-path.	Martin Pieuchot
	This redefines the ifp <-> bridge relationship. No lock can be currently used across the multiples contexts where the bridge has tentacles to protect a pointer, use an interface index. Tested by various, ok dlg@, visa@
2019-04-23	a first cut at converting some virtual ethernet interfaces to if_vinput	David Gwynne
	this let's input processing bypass ifiqs. there's a performance benefit from this, and it will let me tweak the backpressure detection mechanism that ifiqs use without impacting on a stack of virtual interfaces. ive tested all of these except mpw, which i will end up testing soon anyway.
2019-04-22	In in_cksum() and in6_cksum() convert types to C99 style and make	Alexander Bluhm
	both functions consistent. In in_cksum() panic if len is longer than mbuf, but in in6_cksum() do not panic if off and len match exactly to the end of mbuf. OK claudio@
2019-04-05	In debug mode print TCP flag names to console correctly.	Alexander Bluhm
	from Mitchell Krome
2019-02-13	change rt_ifa_add and rt_ifa_del so they take an rdomain argument.	David Gwynne
	this allows mpls interfaces (mpe, mpw) to pass the rdomain they wish the local label to be in, rather than have it implicitly forced to 0 by these functions. right now they'll pass 0, but it will soon be possible to have them rx packets in other rdomains. previously the functions used ifp->if_rdomain for the rdomain. everything other than mpls still passes ifp->if_rdomain. ok mpi@
2019-02-10	remove the implict RTF_MPATH flag that rt_ifa_add() sets on new routes.	David Gwynne
	MPLS interfaces (ab)use rt_ifa_add for adding the local MPLS label that they listen on for incoming packets, while every other use of rt_ifa_add is for adding addresses on local interfaces. MPLS does this cos the addresses involved are in basically the same shape as ones used for setting up local addresses. It is appropriate for interfaces to want RTF_MPATH on local addresses, but in the MPLS case it means you can have multiple local things listening on the same label, which doesn't actually work. mpe in particular keeps track of in use labels to it can handle collisions, however, mpw does not. It is currently possible to have multiple mpw interfaces on the same local label, and sharing the same label as mpe or possible normal forwarding labels. Moving the RTF_MPATH flag out of rt_ifa_add means all the callers that still want it need to pass it themselves. The mpe and mpw callers are left alone without the flag, and will now get EEXIST from rt_ifa_add when a label is already in use. ok (and a huge amount of patience and help) mpi@ claudio@ is ok with the idea, but saw a much much earlier solution to the problem
2019-02-06	Fix a possible mbuf leak in tcp_usrreq(). Make the error handling	Alexander Bluhm
	more consistent to the other protocols' usrreq functions. OK visa@ claudio@
2019-02-04	Avoid an mbuf double free in the oob soreceive() path. In the	Alexander Bluhm
	usrreq functions move the mbuf m_freem() logic to the release block instead of distributing it over the switch statement. Then the goto release in the initial check, whether the pcb still exists, will not free the mbuf for the PRU_RCVD, PRU_RVCOOB, PRU_SENSE command. OK claudio@ mpi@ visa@ Reported-by: syzbot+8e7997d4036ae523c79c@syzkaller.appspotmail.com
2019-01-20	Refresh arp entries that are about to expire. Once their life time is less	Claudio Jeker
	then 1/8 of net.inet.ip.arptimeout the system will send out a arp request about every 30 seconds until either the entry is updated or expired. Not refreshing arp entries will result in packet drop every time a entry expires which is not ideal for important gateway entries. Came up with this after a discussion with deraadt@. OK benno@ deraadt@
2019-01-18	Bring back the ip_pcbopts() refactor. Pad the option buffer and therefor	Claudio Jeker
	the mbuf to the next word length as it is required by the standard. Also use the correct offset from the input mbuf. OK visa@, input & OK bluhm@
2019-01-18	Revert Rev 1.351, the change is not quite right yet.	Claudio Jeker

2019-01-08	Botched up an if conditional in the last commit. The IP length needs to	Claudio Jeker
	bigger than the IP header len to be valid. With this I can traceroute again.
2019-01-07	Validate the version, and all length fields of IP packets passed to a raw socket	Claudio Jeker
	with INP_HDRINCL. There is no reason to allow badly constructed packets through our network stack. Especially since they may trigger diagnostic checks further down the stack. Now EINVAL is returned instead which was already used for some checks that happened before. OK florian@ Reported-by: syzbot+0361ed02deed123667cb@syzkaller.appspotmail.com
2019-01-06	Rewrite ip_pcbopts() to fill a fresh mbuf with the ip options instead	Claudio Jeker
	of fiddling with the user supplied mbuf and then copy it at the end. OK visa@
2019-01-03	Replace a funky 'else switch' construct into something that is equal but	Claudio Jeker
	a lot easier to read. The if can simply return the error and so the else branch is no longer needed. Input and OK dhill@
2018-12-25	rework icmp6_error() to be closer to icmp_error()	denis
	input & OK mpi@
2018-12-20	Replace a wrong poor mans m_trailingspace() with the real thing. The mbuf	Claudio Jeker
	passed to ip_pcbopts could be a cluster and so the size check is all wrong. found by Greg Steuck; OK bluhm@ Reported-by: syzbot+c2543ae6b6692a5843e3@syzkaller.appspotmail.com eVS: ----------------------------------------------------------------------
2018-12-17	Switch from timeout_add with tvtohz to just timeout_add_tv. Now this change	Claudio Jeker
	will reduce the sleep time by one tick which doesn't matter in the common case. The code never passes a true 0 timeval to timeout_add_tv so the code will always sleep for at least 1 tick which is good enough. OK kn@, florian@, visa@, cheloha@
2018-12-11	split ether_output into resolution, encapsulation, and output functions	David Gwynne
	if if_output can be overridden on ethernet interfaces, it will allow things like vlan to do it's packet encapsulation during output before putting the packet directly on the underlying interface for output. this has two benefits. first, it can avoid having ether_output on pseudo interfaces recurse, which makes profiling of the network stack a lot clearer. secondly, and more importantly, it allows pseudo ethernet interface packet encapsulation to by run concurrently by the stack, rather than having packets unnecessarily serialied by an ifq. this diff just splits ether_output up, it doesnt have any interface take advantage of it yet. tweaks and ok claudio@
2018-12-04	Use m_align() and while there reorder the pkthdr initalisation a bit.	Claudio Jeker
	This also makes the IPv4 and IPv6 code more similar. OK phessler@
2018-12-03	In PRU_DISCONNECT don't fall through into PRU_ABORT since the latter frees	Claudio Jeker
	the inpcb apart from the disconnect. Just call soisdisconnected() and clear the inp->inp_faddr since the socket is still valid after a disconnect. Problem found by syzkaller via Greg Steuck OK visa@ Fixes: Reported-by: syzbot+2cd350dfe5c96f6469f2@syzkaller.appspotmail.com Reported-by: syzbot+139ac2d7d3d60162334b@syzkaller.appspotmail.com Reported-by: syzbot+02168317bd0156c13b69@syzkaller.appspotmail.com Reported-by: syzbot+de8d2459ecf4cdc576a1@syzkaller.appspotmail.com
2018-11-30	MH_ALIGN -> m_align. In revarprequest() set the ph_rtableid so that	Claudio Jeker
	the function is doing the same initialisation as arprequest(). OK bluhm@
2018-11-28	Further cleanup of icmp_do_error.	Claudio Jeker
	- Use m_align() since it handles all cases - Use same rounding logic in the size check as in m_align() so all data will filt always. - consolidate pkthdr initalisation into one place - use m_prepend() instead of direct pointer manipulation (including the panic in case an underflow happens). OK bluhm@
2018-11-19	Retire dom_rtkeylen from struct domain. Nothing is using this anymore.	Claudio Jeker
	It was used by the original patricia tree. OK mpi@
2018-11-14	provide ip_tos_patch() for setting ip_tos and patching the ipv4 cksum.	David Gwynne
	previously the gif code would patch the tos field and not recalc the cksum, which would cause ip input code to drop the packet due to a cksum failure. the ipip code patched ip_tos and unconditionally recalculated the cksum, making it correct, but also wiping out any errors that may have been present before the recalculation. updating the cksum rather than replacing it lets cksum failures still fire. ip_tos_patch() is provided in the ecn code since it's because of ecn propagation that we need to update the tos field. internally it works like pf_patch_8 and pf_cksum_fixup, but since pf is optional it rolls its own code. procter may fix that in the future... ok claudio@
2018-11-10	Do not translate the EACCES error from pf(4) to EHOSTUNREACH anymore.	Alexander Bluhm
	It also translated a documented send(2) EACCES case erroneously. This was too much magic and always prone to errors. from Jan Klemkow; man page jmc@; OK claudio@
2018-11-09	M_LEADINGSPACE() and M_TRAILINGSPACE() are just wrappers for	Claudio Jeker
	m_leadingspace() and m_trailingspace(). Convert all callers to call directly the functions and remove the defines. OK krw@, mpi@
2018-11-09	Remove the last few XXX rdomain markers. Even those functions respect the	Claudio Jeker
	rdomain now and are therefor rdomain save. OK mpi@