CHANG Wei-ling1, YUN Xiao-chun2, FANG Bin-xing1, et al. HitIct:Chinese corpus for the evaluation of lossless compression algorithms[J]. 2009, 30(3): 42-47.
CHANG Wei-ling1, YUN Xiao-chun2, FANG Bin-xing1, et al. HitIct:Chinese corpus for the evaluation of lossless compression algorithms[J]. 2009, 30(3): 42-47.DOI:
a Chinese corpus for the evaluation of lossless compression algorithms based on ANSI code
was proposed.In accordance with the principle of application representativeness
Complementary principle and openness principle
a large number of candidate files were obtained from the Internet
and then average compression ratio
average correlation coefficient
compression ratio correlation coefficient and standard deviation were used to select the files that give the most accurate indication of the overall performance of compression algorithms.Experimental results show that this collection has a good representativeness and stability
and can be used as the supplementary test set of the main benchmark for comparing compression methods.