src - OpenBSD base system

Age	Commit message (Collapse)	Author
2014-07-01	take the biglock before calling the xs completion handler.	David Gwynne
	should be safe to call the midlayer io path without the biglock now.
2014-07-01	take the biglock when calling an adapters scsi_cmd handler.	David Gwynne

2014-07-01	start on being able to safely run io through the midlayer without	David Gwynne
	the kernel biglock. the plan is to have the midlayer assume its running without the biglock, but that it cant call adapters or devices without taking the biglock first. this diff just wraps the calls to the adapter iopool get and put handlers up in the biglock. this is safe now because of kettenis' commit to src/sys/kern/init_main.c r1.120. ive been running this in various places since early 2011.
2014-05-01	move pointer use to after a NULL pointer check	Jonathan Gray
	ok dlg@
2014-04-22	factor out the code that figures out whether you're probing or detaching	David Gwynne
	a whole bus, a target, or a specific lun on a target from the bioctl and scsi_req paths. i want to reuse this factored code for something claudio wants.
2014-04-20	make the status handler more like rdac and emc. the big functional change	David Gwynne
	is to check xs->status on completion to make sure it worked.
2014-04-19	move scsi_xs_put after checks that use fields in the xs	Jonathan Matthew
	ok dlg@
2014-04-19	implement emc_mpath_checksense() according to what my cx500 throws.	David Gwynne
	tested by jmatthew@
2014-04-17	rework this to implement the active path checks when mpath asks for	David Gwynne
	it rather than on attach. just need to implement a sense handler to detect failover and this is done. thanks to jmatthew@ for plugging this together again for me.
2014-04-03	massage the preferred path detection to happen when mpath asks for	David Gwynne
	a paths status, rather than on attach. the status it returns depends on the type of device you have. hds provides two types of arrays, symmetric and asymmetric. on a symmetric device you can shove io down any path to any port on any controller and it will work. on symmetric devices we say all paths are part of the same group, and unconditionally return active path status to any check request. on asymmetric devices we group paths by which controller in teh array they connect to. the controllers return whether theyre providing a preferred path via a couple of status bits in a hds specific vpd page, so we query that and return the state of the bits. unfortunately hds arrays dont report change of lun ownership in any way, so we dont currently have any way of failing over at the moment. ill have to think about the least worst way to handle that. tested by deraadt@ on hppa
2014-04-02	whitespace fix, no functional change	David Gwynne

2014-04-02	skey == SKEY_ILLEGAL_REQUEST && ASC_ASCQ(sense) == 0x9401 means	David Gwynne
	invalid request due to current lu ownership
2014-02-19	If a disk returns a size of 0, treat it as an error to let the	Martin Pieuchot
	driver re-probe for its capacity. Allow to fully recognized Lexar JumpDrive S33 USB 3.0 sticks. ok krw@, dlg@
2014-02-13	if an attached sd(4) is readonly, make sure it's noticable in the	Alexander Hall
	dmesg, or write operations just fail with EACCES for no obvious reason ok krw@ tedu@
2014-01-31	SUNW SUNWGS INT FCBPL can be considered an asym device now we can uniquely	David Gwynne
	identify them over multiple paths using their wwnn.
2014-01-31	if a device doesnt have device ids or serial numbers, try using node_wwn to	David Gwynne
	generate a devid. if its an fc device this is good enough.
2014-01-30	SGI branded seagate disks work fine	David Gwynne

2014-01-27	poison the io "allocated" by the default pool allocator so any attempt to	David Gwynne
	use it should cause a fault. based on discussion with miod@
2014-01-18	rename scsi_ioh_runqueue to scsi_iopool_run, and make it available	David Gwynne
	outside scsi_base.c. this will allow adapters to restrict access to iopool resources based on some state, and then kick the pending requests on the pool when the state comes good again. ive been avoiding this for a long time, but it is the least worst way to deal with some uses of XS_NO_CCB. discussion with kettenis@ helped me decide this was right.
2013-12-06	Add a DVACT_WAKEUP op to the *_activate() API. This is called after the	Theo de Raadt
	kernel resumes normal (non-cold, able to run processes, etc) operation. Previously we were relying on specific DVACT_RESUME op's in drivers creating callback/threads themselves, but that has become too common, indicating the need for a built-in mechanism. ok dlg kettenis, tested by a sufficient amount of people
2013-11-26	1 << 31 cleanup. Eitan Adler pointed out that there has been a	Theo de Raadt
	resurrection of the bad idiom in the tree. sufficient review by miod, kettenis, tedu
2013-11-23	fix format string; OK deraadt@	Gleydson Soares

2013-11-01	Sprinkle (long long) casts where %lld is being used to print daddr_t	Kenneth R Westerback
	variables. Some random whitespace/knf repairs encountered on the way. ok miod@ on inspection, feedback & more suggestions from millert@
2013-10-07	typo	Miod Vallat

2013-10-03	Print daddr_t variables with %lld, u_int64_t variables with %llu.	Kenneth R Westerback

2013-10-02	Use u_int64_t instead of daddr_t parameters to sd_cmd_rw*() functions.	Kenneth R Westerback
	Ditto disksize field of sd_softc and a couple of local calculation variables. scsi/* now daddr_t clean except where they really are 512-byte blocks.
2013-09-27	scsi_size() is now used only by cd(4). So move it from scsi_base.c	Kenneth R Westerback
	to cd.c and call it cd_size(), like sd_size() lives in sd.c. Tweak some daddr_t variables to u_int64_t on the way, when they are for disk sector numbers, not 512-byte block numbers.
2013-09-19	Tweak types to keep daddr_t address and sector address separate.	Kenneth R Westerback
	Prefer DL_ macros over handrolling. Fix the loop to allow for bigger (highly unlikely) bunches of bits to be broken up into rw_10 sized (<= UINT32_MAX sectors) chunks. Add check to make sure i/o request starts at a sector address.
2013-09-15	cddump() takes a daddr_t parameter. Call that parameter 'blkno' and not	Kenneth R Westerback
	'secno'. This is what sddump() already does and consistant is good. No function change.
2013-09-15	Use DL_SECTOBLK() and DL_BLKTOSEC() to clarify code and remove	Kenneth R Westerback
	repeated handrolling of same code. Use daddr_t variable to calculate daddr_t return values, and u_int64_t variables to calculate disk sector values. No functional change.
2013-09-08	fix next path selection so if the current path is NULL (which can occur if	David Gwynne
	paths are lost and groups become empty) we dont try and do stuff with it that causes null derefs and awesome panics.
2013-09-03	DELL MD3060e works	David Gwynne

2013-08-29	rename scsi_sem_{enter,leave} to scsi_pending_{start,finish}. these are	David Gwynne
	the wrappers around handling of pending work, theyre not semaphores. names from tedu@ ok krw@ guenther@
2013-08-27	make path driver match routes return 8 so they will definitely be higher	David Gwynne
	than the real device drivers. ses returns 3 on some dells, which could be confusing for autoconf if it has to decide between that and a path driver.
2013-08-27	get rid of the different path scheduler types, which simplifies the	David Gwynne
	code that picks the next path. we assume roundrobin within a group of paths now. the asym sym(4) devices work around this by putting every path in its own group.
2013-08-27	these were forgotten in the change from pointing paths to groups instead	David Gwynne
	of devices. fixes compilation when theyre enabled. how embarrassment.
2013-08-27	make scsi_sem_leave only run again once, no matter how many times	David Gwynne
	other things scsi_sem_enter. the things protected by this do as much work as they can, so they only need to be told to try again once. this isnt a semaphore anymore (and probably never was) so there's a name change coming too.
2013-08-26	implement handling of group failover.	David Gwynne
	if a controller sends sense data back, the path driver can tell mpath that its indicating failover which kicks off an iteration over all the groups until one says its active. if no groups claim to be active, a timeout fires the process off again after a second. you can start controller handover on rdac (well, an md3200i is all i had to test with, others might need more work) and everything keeps going. ill try to get to emc and hds working when i can poke hardware again.
2013-08-26	feng shui	David Gwynne

2013-08-26	all paths are considered active, not in some unknown state.	David Gwynne

2013-08-26	all FUJITSU MA disks ive found seem ok with being behind mpath.	David Gwynne

2013-08-26	pull rdac_c9 apart and use its guts to implement the status check	David Gwynne
	handler for the mpath midlayer to call. the status check is completely event driven. a group is considered active if the VOLACCESSCTL vpd page has some bits set.
2013-08-26	rename rdac_c8 to rdac_extdevid and use less magic numbers in the process.	David Gwynne

2013-08-26	when i first imagined how paths on mpath worked, i thought the	David Gwynne
	midlayer would be able to call things on paths to explicitely online or offline them. turns out thats not how the Real World(tm) works, instead its better to wait for failure and probe for the status of paths, and pick the active group of paths from that. there's even evidence that the mechanisms for forcing controllers into active/passive roles from the scsi initiator are being deprecated. they expect hosts to be able to cope with arbitrary controller role changes and failover accordingly. this replaces the online and offline function pointers in the path_ops structure with a status check function pointer. instead of returning a state, the checker is expected to call mpath_path_status() when its finished figuring out what the state is.
2013-08-26	my DELL MD3000i seems to return skey illegal request + asc 0x94 +	David Gwynne
	ascq 0x01, or skey unit attention + asc 0x8b + ascq 0x02 when i tell it to change controller ownership of a volume. i wish i knew what the numbers really meant, but alas, there's no doco cos this is all magical and unique apparently. anyway, empirically this can be used in rdac_checksense to return MPATH_SENSE_FAILOVER.
2013-08-26	checksense handlers in path drivers can return MPATH_SENSE_DECLINED	David Gwynne
	(who can tell ive spent time in web servers) to say they decline interpreting the sense data, or MPATH_SENSE_FAILOVER to say the sense data is from the controller saying its failed over. all path drivers currently decline handling sense data.
2013-08-26	free the dev slot on group allocation failure if we're building a new dev.	David Gwynne

2013-08-26	introduce the idea of groups of paths. mpath had stuff to managed	David Gwynne
	devices and paths. devices are what mpath presents as targets on its scsibus, and paths are the things attached to hardware controllers that are available to shove io down to the actual real target. all paths were considered usable for handling io on behalf of a device. this adds groups in between devices and paths. only paths on the first group in the list will now be used to handle io now. sym devices will only have one group. asym devices will treat each path as a different group. rdac, emc, and hds will group paths based on which controller in the array theyre connected to. in the future we will intercept sense data from passive controllers and use that to start running checks to pick a new primary group so we can handle controller failover situations. the group id in hds(4) is currently busted, everything else should be correct.
2013-08-26	rdac_groupid queries which controller the path is attached to, which we'll	David Gwynne
	use as the group id later on.
2013-08-26	now that mpath is attached before any hardware, we can simplify the code.	David Gwynne
	firstly, move the array of targets that mpath presents into the softc. secondly, when paths call the mpath api we can simply check if the softc global is not null rather than walk through autoconf data. mpath will either have already attached or will never attach in the future.