diff options
author | Lukasz Dorau <lukasz.dorau@intel.com> | 2011-09-01 15:10:34 +0200 |
---|---|---|
committer | NeilBrown <neilb@suse.de> | 2011-09-06 07:19:58 +0200 |
commit | ba714450698a966d184f5337235b100cbfa8685e (patch) | |
tree | b6a1f482af99439ef2cbc5e2d4afbbfa71cd1ae7 /mdmon.h | |
parent | imsm: support 'missing' devices at Create (diff) | |
download | mdadm-ba714450698a966d184f5337235b100cbfa8685e.tar.xz mdadm-ba714450698a966d184f5337235b100cbfa8685e.zip |
FIX: Mdmon crashes after changing RAID level from 1 to 0
Description of the bug:
Sometimes mdmon crashes after changing RAID level from 1 to 0 (takeover).
Cause of the bug:
The managemon marks an active_array for removal from monitoring
by assigning a->container to NULL value (in the "manage_member" function).
Sometimes (during stress test) it happens right when the monitor
is in the "read_and_act" function and a->container pointer is in use.
This causes the monitor crashes.
Solution:
The active array has to be marked for removal in another way
than setting NULL pointer when it can be in use.
A new field "to_remove" was added to the "active_array" structure.
It is used in the managemon to mark a container to remove
(instead of the old assigment: a->container = NULL)
and monitor checks it to determine if the array should be removed.
The field "to_remove" should be checked in some other places
to avoid managing of the array which is going to be removed.
Signed-off-by: Lukasz Dorau <lukasz.dorau@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>
Diffstat (limited to 'mdmon.h')
-rw-r--r-- | mdmon.h | 1 |
1 files changed, 1 insertions, 0 deletions
@@ -28,6 +28,7 @@ struct active_array { struct mdinfo info; struct supertype *container; struct active_array *next, *replaces; + int to_remove; int action_fd; int resync_start_fd; |