Just in case it is of general interest: ZFS mirroring was the culprit in our case
rob at nofocus.org
Tue Nov 13 10:55:38 EST 2007
...though, we have seen super-greedyness of ZFS when resilvering. ;)
On Nov 13, 2007, at 09:17, Dale Ghent wrote:
> Interesting. What's your kernel patch level?
> We're running on 125101-10 with the exact same configuration as you
> (mirrored to two arrays, in separate buildings even) and haven't seen
> this problem.
> On Nov 13, 2007, at 1:23 AM, Pascal Gienger wrote:
>> Our latency problems went away like a miracle when we detached one
>> half of
>> the mirror (so it is no more a mirror).
>> Read-Rates are doubled (not per device, the total read rate!),
>> latency is
>> cut off. No more latency problems.
>> When attaching the volume again, resilvering puts the system to a
>> halt -
>> reads and writes do block for seconds (!). We will go on directly
>> with Sun
>> to solve the problem. Their "lowest I/O-priority to resilver disks"
>> not seem to be effective. It really blocks the kernel and you end up
>> thousand locks in "zfs_zget".
>> We have two SAN volumes in different buildings which are NOT the
>> bottleneck, tests show it.
>> Cyrus Home Page: http://cyrusimap.web.cmu.edu/
>> Cyrus Wiki/FAQ: http://cyrusimap.web.cmu.edu/twiki
>> List Archives/Info: http://asg.web.cmu.edu/cyrus/mailing-list.html
> Dale Ghent
> Specialist, Storage and UNIX Systems
> UMBC - Office of Information Technology
> ECS 201 - x51705
> Cyrus Home Page: http://cyrusimap.web.cmu.edu/
> Cyrus Wiki/FAQ: http://cyrusimap.web.cmu.edu/twiki
> List Archives/Info: http://asg.web.cmu.edu/cyrus/mailing-list.html
More information about the Info-cyrus