1 Jul 2002 01:19
Re: Block size optimization - let rsync find the optimal blocksize by itself.
Donovan Baarda <abo <at> minkirri.apana.org.au>
2002-06-30 23:19:26 GMT
2002-06-30 23:19:26 GMT
On Sun, Jun 30, 2002 at 06:23:10PM +0200, Olivier Lachambre wrote: [...] > Well, the first comment: during my work, I wanted to verify that the > theorical optimal block size sqrt(24*n/Q) given by Andrew Tridgell in his > PHd Thesis was actually the good one, and when doing the tests on randomly > generated & modified files I discovered that the size sqrt(78*n/Q) is the > actual optimal block size, I tried to understand this by reading all the > thesis, then quite a lot of documentation about rsync but I just can't > figure out why the theorical & experimental optimal block sizes so much > don't match. I _really_ don't think it's coming from my tests, there must be > somewhat else. Maybe the rsync developpers have just changed some part of > the algorithm. And also, even without using data compression during the > sync, rsync is always more efficient as it should be theorically, actually > between 1.5 and 2 times more efficient. Nobody will complain about that but > I'd be happy if someone would be nice enough to explain me this thing. I believe that the compression option turns on compression of transfered data, including file lists, instruction streams etc. Even with compression turned off, the miss data in the delta is still compressed. Another thing to be aware of is when rsync compresses miss data, it also compresses all the hit data to prime the compressor with context, and throws away the compressed output for the hit data. Perhaps this "context compression" is affecting the optimal block size? > Now the auto-optimization algorithm when updating many files at a time. > Let's consider a set of files to be updated. We will consider only the files > which have been changed since the last update (e.g. we can find the other > ones by sending a MD5 sum for each file and trying to match it). We sync the(Continue reading)
RSS Feed