summaryrefslogtreecommitdiff
path: root/sys/scsi
AgeCommit message (Collapse)Author
2014-04-03massage the preferred path detection to happen when mpath asks forDavid Gwynne
a paths status, rather than on attach. the status it returns depends on the type of device you have. hds provides two types of arrays, symmetric and asymmetric. on a symmetric device you can shove io down any path to any port on any controller and it will work. on symmetric devices we say all paths are part of the same group, and unconditionally return active path status to any check request. on asymmetric devices we group paths by which controller in teh array they connect to. the controllers return whether theyre providing a preferred path via a couple of status bits in a hds specific vpd page, so we query that and return the state of the bits. unfortunately hds arrays dont report change of lun ownership in any way, so we dont currently have any way of failing over at the moment. ill have to think about the least worst way to handle that. tested by deraadt@ on hppa
2014-04-02whitespace fix, no functional changeDavid Gwynne
2014-04-02skey == SKEY_ILLEGAL_REQUEST && ASC_ASCQ(sense) == 0x9401 meansDavid Gwynne
invalid request due to current lu ownership
2014-02-19If a disk returns a size of 0, treat it as an error to let theMartin Pieuchot
driver re-probe for its capacity. Allow to fully recognized Lexar JumpDrive S33 USB 3.0 sticks. ok krw@, dlg@
2014-02-13if an attached sd(4) is readonly, make sure it's noticable in theAlexander Hall
dmesg, or write operations just fail with EACCES for no obvious reason ok krw@ tedu@
2014-01-31SUNW SUNWGS INT FCBPL can be considered an asym device now we can uniquelyDavid Gwynne
identify them over multiple paths using their wwnn.
2014-01-31if a device doesnt have device ids or serial numbers, try using node_wwn toDavid Gwynne
generate a devid. if its an fc device this is good enough.
2014-01-30SGI branded seagate disks work fineDavid Gwynne
2014-01-27poison the io "allocated" by the default pool allocator so any attempt toDavid Gwynne
use it should cause a fault. based on discussion with miod@
2014-01-18rename scsi_ioh_runqueue to scsi_iopool_run, and make it availableDavid Gwynne
outside scsi_base.c. this will allow adapters to restrict access to iopool resources based on some state, and then kick the pending requests on the pool when the state comes good again. ive been avoiding this for a long time, but it is the least worst way to deal with some uses of XS_NO_CCB. discussion with kettenis@ helped me decide this was right.
2013-12-06Add a DVACT_WAKEUP op to the *_activate() API. This is called after theTheo de Raadt
kernel resumes normal (non-cold, able to run processes, etc) operation. Previously we were relying on specific DVACT_RESUME op's in drivers creating callback/threads themselves, but that has become too common, indicating the need for a built-in mechanism. ok dlg kettenis, tested by a sufficient amount of people
2013-11-261 << 31 cleanup. Eitan Adler pointed out that there has been aTheo de Raadt
resurrection of the bad idiom in the tree. sufficient review by miod, kettenis, tedu
2013-11-23fix format string; OK deraadt@Gleydson Soares
2013-11-01Sprinkle (long long) casts where %lld is being used to print daddr_tKenneth R Westerback
variables. Some random whitespace/knf repairs encountered on the way. ok miod@ on inspection, feedback & more suggestions from millert@
2013-10-07typoMiod Vallat
2013-10-03Print daddr_t variables with %lld, u_int64_t variables with %llu.Kenneth R Westerback
2013-10-02Use u_int64_t instead of daddr_t parameters to sd_cmd_rw*() functions.Kenneth R Westerback
Ditto disksize field of sd_softc and a couple of local calculation variables. scsi/* now daddr_t clean except where they really are 512-byte blocks.
2013-09-27scsi_size() is now used only by cd(4). So move it from scsi_base.cKenneth R Westerback
to cd.c and call it cd_size(), like sd_size() lives in sd.c. Tweak some daddr_t variables to u_int64_t on the way, when they are for disk sector numbers, not 512-byte block numbers.
2013-09-19Tweak types to keep daddr_t address and sector address separate.Kenneth R Westerback
Prefer DL_ macros over handrolling. Fix the loop to allow for bigger (highly unlikely) bunches of bits to be broken up into rw_10 sized (<= UINT32_MAX sectors) chunks. Add check to make sure i/o request starts at a sector address.
2013-09-15cddump() takes a daddr_t parameter. Call that parameter 'blkno' and notKenneth R Westerback
'secno'. This is what sddump() already does and consistant is good. No function change.
2013-09-15Use DL_SECTOBLK() and DL_BLKTOSEC() to clarify code and removeKenneth R Westerback
repeated handrolling of same code. Use daddr_t variable to calculate daddr_t return values, and u_int64_t variables to calculate disk sector values. No functional change.
2013-09-08fix next path selection so if the current path is NULL (which can occur ifDavid Gwynne
paths are lost and groups become empty) we dont try and do stuff with it that causes null derefs and awesome panics.
2013-09-03DELL MD3060e worksDavid Gwynne
2013-08-29rename scsi_sem_{enter,leave} to scsi_pending_{start,finish}. these areDavid Gwynne
the wrappers around handling of pending work, theyre not semaphores. names from tedu@ ok krw@ guenther@
2013-08-27make path driver match routes return 8 so they will definitely be higherDavid Gwynne
than the real device drivers. ses returns 3 on some dells, which could be confusing for autoconf if it has to decide between that and a path driver.
2013-08-27get rid of the different path scheduler types, which simplifies theDavid Gwynne
code that picks the next path. we assume roundrobin within a group of paths now. the asym sym(4) devices work around this by putting every path in its own group.
2013-08-27these were forgotten in the change from pointing paths to groups insteadDavid Gwynne
of devices. fixes compilation when theyre enabled. how embarrassment.
2013-08-27make scsi_sem_leave only run again once, no matter how many timesDavid Gwynne
other things scsi_sem_enter. the things protected by this do as much work as they can, so they only need to be told to try again once. this isnt a semaphore anymore (and probably never was) so there's a name change coming too.
2013-08-26implement handling of group failover.David Gwynne
if a controller sends sense data back, the path driver can tell mpath that its indicating failover which kicks off an iteration over all the groups until one says its active. if no groups claim to be active, a timeout fires the process off again after a second. you can start controller handover on rdac (well, an md3200i is all i had to test with, others might need more work) and everything keeps going. ill try to get to emc and hds working when i can poke hardware again.
2013-08-26feng shuiDavid Gwynne
2013-08-26all paths are considered active, not in some unknown state.David Gwynne
2013-08-26all FUJITSU MA disks ive found seem ok with being behind mpath.David Gwynne
2013-08-26pull rdac_c9 apart and use its guts to implement the status checkDavid Gwynne
handler for the mpath midlayer to call. the status check is completely event driven. a group is considered active if the VOLACCESSCTL vpd page has some bits set.
2013-08-26rename rdac_c8 to rdac_extdevid and use less magic numbers in the process.David Gwynne
2013-08-26when i first imagined how paths on mpath worked, i thought theDavid Gwynne
midlayer would be able to call things on paths to explicitely online or offline them. turns out thats not how the Real World(tm) works, instead its better to wait for failure and probe for the status of paths, and pick the active group of paths from that. there's even evidence that the mechanisms for forcing controllers into active/passive roles from the scsi initiator are being deprecated. they expect hosts to be able to cope with arbitrary controller role changes and failover accordingly. this replaces the online and offline function pointers in the path_ops structure with a status check function pointer. instead of returning a state, the checker is expected to call mpath_path_status() when its finished figuring out what the state is.
2013-08-26my DELL MD3000i seems to return skey illegal request + asc 0x94 +David Gwynne
ascq 0x01, or skey unit attention + asc 0x8b + ascq 0x02 when i tell it to change controller ownership of a volume. i wish i knew what the numbers really meant, but alas, there's no doco cos this is all magical and unique apparently. anyway, empirically this can be used in rdac_checksense to return MPATH_SENSE_FAILOVER.
2013-08-26checksense handlers in path drivers can return MPATH_SENSE_DECLINEDDavid Gwynne
(who can tell ive spent time in web servers) to say they decline interpreting the sense data, or MPATH_SENSE_FAILOVER to say the sense data is from the controller saying its failed over. all path drivers currently decline handling sense data.
2013-08-26free the dev slot on group allocation failure if we're building a new dev.David Gwynne
2013-08-26introduce the idea of groups of paths. mpath had stuff to managedDavid Gwynne
devices and paths. devices are what mpath presents as targets on its scsibus, and paths are the things attached to hardware controllers that are available to shove io down to the actual real target. all paths were considered usable for handling io on behalf of a device. this adds groups in between devices and paths. only paths on the first group in the list will now be used to handle io now. sym devices will only have one group. asym devices will treat each path as a different group. rdac, emc, and hds will group paths based on which controller in the array theyre connected to. in the future we will intercept sense data from passive controllers and use that to start running checks to pick a new primary group so we can handle controller failover situations. the group id in hds(4) is currently busted, everything else should be correct.
2013-08-26rdac_groupid queries which controller the path is attached to, which we'llDavid Gwynne
use as the group id later on.
2013-08-26now that mpath is attached before any hardware, we can simplify the code.David Gwynne
firstly, move the array of targets that mpath presents into the softc. secondly, when paths call the mpath api we can simply check if the softc global is not null rather than walk through autoconf data. mpath will either have already attached or will never attach in the future.
2013-08-26make scsi_{xsh,ioh}_{add,del} return whether they moved somethingDavid Gwynne
on or off the queues so things calling them can tell if something is or isnt going to happen.
2013-08-25move the prototypes of scsi_sem_{enter,leave} into a header so thingsDavid Gwynne
other than scsi_base.c can use them.
2013-07-05Tweak calculations of resid to avoid unsigned vs signed comparison andKenneth R Westerback
make code clearer. Pointed out by brad@ and his friend llvm.
2013-06-11final removal of daddr64_t. daddr_t has been 64 bit for a long enoughTheo de Raadt
test period; i think 3 years ago the last bugs fell out. ok otto beck others
2013-06-10spelling in comment fix (for real, there's no code change here).David Gwynne
2013-06-10we dont need mpath ccbs if all theyre used for is storing xfers on a list.David Gwynne
we can just use the SIMPLEQ bits that are in xfers directly. this cuts out a bunch of pools and iopools goo. less is more.
2013-06-10toshiba disks in an md1220 work fineDavid Gwynne
2013-06-06Fix EOM handling. Makes Bacula much happier by allowing the use ofKenneth R Westerback
both 'Fast Forward Space File = yes' and 'Hardware End of Medium = yes'. Mostly taken from FreeBSD. Constant prodding by robert@, testing actual backup and restore by ajacoutot@.
2013-05-30typo: thing -> thin provisioningTed Unangst