diff options
author | Oliver O'Halloran <oohall@gmail.com> | 2019-09-03 12:15:55 +0200 |
---|---|---|
committer | Michael Ellerman <mpe@ellerman.id.au> | 2019-09-05 06:22:38 +0200 |
commit | b104af5a7687060792ca398bb86b033057afce75 (patch) | |
tree | 10b677425d96097a9ea4e089024f3145964d53fb /arch/powerpc/kernel/eeh_event.c | |
parent | powerpc/eeh: Make permanently failed devices non-actionable (diff) | |
download | linux-b104af5a7687060792ca398bb86b033057afce75.tar.xz linux-b104af5a7687060792ca398bb86b033057afce75.zip |
powerpc/eeh: Check slot presence state in eeh_handle_normal_event()
When a device is surprise removed while undergoing IO we will probably
get an EEH PE freeze due to MMIO timeouts and other errors. When a freeze
is detected we send a recovery event to the EEH worker thread which will
notify drivers, and perform recovery as needed.
In the event of a hot-remove we don't want recovery to occur since there
isn't a device to recover. The recovery process is fairly long due to
the number of wait states (required by PCIe) which causes problems when
devices are removed and replaced (e.g. hot swapping of U.2 NVMe drives).
To determine if we need to skip the recovery process we can use the
get_adapter_state() operation of the hotplug_slot to determine if the
slot contains a device or not, and if the slot is empty we can skip
recovery entirely.
One thing to note is that the slot being EEH frozen does not prevent the
hotplug driver from working. We don't have the EEH recovery thread
remove any of the devices since it's assumed that the hotplug driver
will handle tearing down the slot state.
Signed-off-by: Oliver O'Halloran <oohall@gmail.com>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
Link: https://lore.kernel.org/r/20190903101605.2890-5-oohall@gmail.com
Diffstat (limited to 'arch/powerpc/kernel/eeh_event.c')
0 files changed, 0 insertions, 0 deletions