1 May 2011 11:36
Re: RAID6 r-m-w, op-journaled fs, SSDs
Dave Chinner <david <at> fromorbit.com>
2011-05-01 09:36:59 GMT
2011-05-01 09:36:59 GMT
On Sat, Apr 30, 2011 at 04:27:48PM +0100, Peter Grandi wrote: > Regardless, op-journaled file system designs like JFS and XFS > write small records (way below a stripe set size, and usually > way below a chunk size) to the journal when they queue > operations, XFS will write log-stripe-unit sized records to disk. If the log buffers are not full, it pads them. Supported log-sunit sizes are up to 256k. > even if sometimes depending on design and options > may "batch" the journal updates (potentially breaking safety > semantics). Also they do small write when they dequeue the > operations from the journal to the actual metadata records > involved. > > How bad can this be when the journal is say internal for a > filesystem that is held on wide-stride RAID6 set? I suspect very > very bad, with apocalyptic read-modify-write storms, eating IOPS. Not bad at all, because the journal writes are sequential, and XFS can have multiple log IOs in progress at once (up to 8 x 256k = 2MB). So in general while metadata operations are in progress, XFS will fill full stripes with log IO and you won't get problems with RMW. > Where are studies or even just impressions of anedoctes on how > bad this is? Just buy decent RAID hardware with a BBWC and journal IO does not(Continue reading)
RSS Feed