An alarming ZFS status message and what is usually going on with itSuppose that you have a ZFS pool with redundancy (mirroring or ZFS's
version of RAID 5 or RAID 6), and that someday you run '
(This has been re-linewrapped for my convenience.) The rest of the What this generally really means is something like this:
(I have to admit that Sun's own error explanation page for this is pretty good, too. This is unfortunately somewhat novel, which explains why I didn't look at it before now.) I assume that ZFS throws up this alarming status message even though it automatically handled the issue because it doesn't want to hide that a problem happened from you. While the problem might just be a temporary glitch (we've seen this a few times on our iSCSI based fileservers), it might instead be an indication of a more serious issue that you should look into, so at least you need to know that something happened. (And even temporary glitches shouldn't happen all that often, or ideally at all; if they do, you have a problem somewhere.) Sidebar: Our experience with these errorsWe've seen a few of these temporary glitches with our iSCSI based
fileservers. So far our procedure to deal with
this is to note down at least which disk had the checksum errors
(sometimes we save the full ' On our fileservers, my suspicions are on the hardware or driver for the
onboard nVidia Ethernet ports. The fileservers periodically report that
they lost and then immediately regained the link on (In contributing evidence, the Linux iSCSI backends, running on very similar hardware, also had problems with their onboard nVidia Ethernet ports under sufficient load.) |
These are my WanderingThoughts GettingAround This is part of CSpace, and is written by ChrisSiebenmann. * * * Atom feeds are available; see the bottom of most pages. Categories: links, linux, programming, python, snark, solaris, spam, sysadmin, tech, unix, web |