SMP Version of tar
Wojciech Puchar
wojtek at wojtek.tensor.gdynia.pl
Wed Oct 10 20:46:29 UTC 2012
>
> Tim is correct in that gzip datastream allows for concatenation of
> compressed blocks of data, so you might break the input stream into
> a bunch of blocks [A, B, C, etc], and then can append those together
> into [A.gz, B.gz, C.gz, etc], and when uncompressed, you will get
> the original input stream.
> I think that Wojciech's point is that the compressed data stream for
> for the single datastream is different than the compressed data
> stream of [A.gz, B.gz, C.gz, etc]. Both will decompress to the same
> thing, but the intermediate compressed representation will be different.
So - after your response it is clear that parallel generated tar.gz will
be different and have slightly (can be ignored) worse compression, and
WILL be compatible with standard gzip as it can decompress from multiple
streams which i wasn't aware of.
That's good. at the same time parallel tar will go back to single thread
when unpacking standard .tar.gz - not a big deal, as gzip decompression is
untrafast and I/O is usually a limit.
More information about the freebsd-hackers
mailing list