Thinking in Java-文件压缩

最新推荐文章于 2019-07-24 00:02:06 发布

原创最新推荐文章于 2019-07-24 00:02:06 发布 · 421 阅读

0 ·

CC 4.0 BY-SA版权

文章标签：

#Java #Thinking in java #IO #IO结构 #Java压缩

本文深入探讨了Java中压缩技术的应用，包括GZIP和ZIP格式的支持，以及如何使用Java类库进行文件压缩和解压缩操作。文章还介绍了Adler32和CRC32校验算法的使用，并提供了具体的代码示例。

压缩：java.util.zip;
1.Java I/O类库中的类支持读写压缩格式的数据流。你可以使用它们对其他的I/O类进行封装，以提供压缩功能。

2.这些类不是从Reader和Writer类派生出来的，而是属于InputStream和OutputStream继承层次机构的一部分。

3.继承关系如下：（以下图可以看出，压缩都是基于字节流的，字符流可采用转换流进行转换再进行压缩）

5.使用GZIP：压缩类的使用非常简单，直接将输出流封装成GZIPOutputStream或者ZipOutputStream、并将输入流封装成GZIPInputStream或ZipInputStream即可。其他全部的操作都是通常的I/O读写。这个例子吧面向字符的流和面向字节的流混合了起来；输入（in）用Reader类，而GZIPOutputStream的构造器只能接受OutputStream对象，不能接受Writer对象。

6.ZIP:
支持Zip格式的Java库更加全面。利用该库可以方便的保存，它甚至有一个独立的类，是得读取Zip文件更加方便。这个类库使用的是标准Zip格式，所以能与当前哪些可通过因
特网下载的压缩工具更好的协调工作。另外，采用Checknum类是来计算和校验文件的校验和方法。一共有两种类型：Adler32（它快一些）和CRC32（慢一些，但是更精准）

7.ZIP GZIP区别：

主要是适应系统不同，还有就是压缩率不一样；普遍使用的是zip压缩，Windows系统下就用zip，gzip为高压，可以把文件压缩得更小，便于放网盘或者网上供人下载；gzip是Linux下面用的格式，一般在Linux下解压，如果用Windows下的程序解压有可能丢失其中某些文件或属性。

8.Jar文件：java档案文件。Jar这种文件格式就像Zip文件格式是一样的，可以将一组文件压缩到单个压缩文件中。同Java的其他任何东西一样，可以将一组文件压缩到单个压缩文件中。同Java中的其他的文件一样，JAR文件也是跨平台的所以不必担心跨平台的问题。声音和图像文件可以像类文件一样被包含在其中。Jar文件非常有用，尤其是在涉及到因特网的时候。Jar文件一般都是压缩后的Java文件，可以调高网络传输的效率，处于安全考虑，JAR文件中的每个条目都可以加上数字化签名。
一个JAR文件由一组压缩文件构成，同时还有一张描述了所有这些文件的“文件清单”（可以自行创建，也可以由Jar程序自动生成。）

IO包装关系图：（IO类库的设计让我们喜忧参半，理解它需要花一些功夫，核心还是在装饰器模式上）

代码示例:

package Unit18IO系统.压缩;

import java.io.BufferedOutputStream;
import java.io.BufferedReader;
import java.io.FileInputStream;
import java.io.FileNotFoundException;
import java.io.FileOutputStream;
import java.io.FileReader;
import java.io.IOException;
import java.io.InputStreamReader;
import java.util.zip.GZIPInputStream;
import java.util.zip.GZIPOutputStream;

public class 用GZIP进行简单压缩 {

	public static void main(String[] args) throws FileNotFoundException, IOException{
		if(args.length == 0 ){
			System.out.println("" +
					"Usage:\n GZIPpcompress file \n" +
					"\t Uses GZIP compression to compress" +
					"the file to test.gz");
			System.exit(0);
		}
		
		BufferedReader in = new BufferedReader(
				new FileReader(args[0]));
		
		BufferedOutputStream out = new BufferedOutputStream(
				new GZIPOutputStream(
						new FileOutputStream("test.gz")));
		
		System.out.println("Writing file");
		
		int c ;
		while((c = in.read()) != -1)
			out.write(c);
		in.close();
		out.close();
		
		System.out.println("Reading file");
		BufferedReader in2 = new BufferedReader(
				new InputStreamReader(
						new GZIPInputStream(//可以看出，压缩和解压缩都是用的是同一个类，只是
<span style="white-space:pre">									</span>//其传入的参数由OutputStream变为InputStream
								new FileInputStream("test.gz"))));
		
		String s ;
		while((s = in2.readLine()) != null)
			System.out.println(s);	
	}
}

多文件压缩：

package Unit18IO系统.压缩;

import java.io.BufferedInputStream;
import java.io.BufferedOutputStream;
import java.io.BufferedReader;
import java.io.FileInputStream;
import java.io.FileOutputStream;
import java.io.FileReader;
import java.util.Date;
import java.util.Enumeration;
import java.util.zip.Adler32;
import java.util.zip.CheckedInputStream;
import java.util.zip.CheckedOutputStream;
import java.util.zip.ZipEntry;
import java.util.zip.ZipFile;
import java.util.zip.ZipInputStream;
import java.util.zip.ZipOutputStream;

public class 用Zip进行多文件保存 {

	public static void main(String[] args) throws Exception{
		
		args = new String[]{"E:\\xxxx\\1.txt","E:\\xxxx\\2.txt"};
		
		FileOutputStream f = 
				new FileOutputStream("test.zip");
		CheckedOutputStream csum = 
				new CheckedOutputStream(f , new Adler32());
		//Adler32和CRC32都是一种校验算法，此处采用Adler32校验，校验会产生一个校验值，用于保证数据的完整性。
		
		ZipOutputStream zos = new ZipOutputStream(csum);
		BufferedOutputStream out = 
				new BufferedOutputStream(zos);
		
		zos.setComment("A test of java Zipping");//
		
		//No corrsponding getComment() , though
		
		for(String arg : args){
			System.out.println("writing file " + arg);
			
			BufferedReader in = 
					new BufferedReader(new FileReader(arg));
			
			ZipEntry zet = new ZipEntry(arg);//对压缩包内的每一个子压缩文件设定属性，包括文件大小，日期，CRC校验和，额外字段数等
			zet.setComment("hello");
			zet.setTime(new Date().getTime());
			
			zos.putNextEntry(zet);//将文件压入ZipEntry中，然后由ZipOuputStream指定到目标位置。
			
			int c;
			while((c = in.read()) != -1)
				out.write(c);
			
			in.close();
			out.flush();
		}
		out.close();
		//Checksum valid only after the file has been closed!
		System.out.println("CheckNum : " + csum.getChecksum().getValue());
		//Now extract the file;
		System.out.println("-------------Reading file--------------");
	
		FileInputStream fi = new FileInputStream("test.zip");
		CheckedInputStream csumi = 
				new CheckedInputStream(fi , new Adler32());
		
		ZipInputStream in2 = new ZipInputStream(csumi);
		BufferedInputStream bis = new BufferedInputStream(in2);
		
		ZipEntry ze;
		while((ze = in2.getNextEntry()) != null){
			
			System.out.println("Reading file "+ ze);
			int x;
			while((x = bis.read()) != -1)
				System.out.write(x);
		}
		
		if(args.length == 1)
			System.out.println("CheckSum:" + csumi.getChecksum().getValue());
	
		bis.close();
		//Alternative way to open and read Zip files
		ZipFile zf = new ZipFile("test.zip");
		Enumeration e = zf.entries();
		
		while(e.hasMoreElements()){
			ZipEntry ze2 = (ZipEntry) e.nextElement();
			System.out.println("File :" +ze2);
			System.out.println(ze2.getCrc()+", time : " + ze2.getTime());
			//...and extract the data as before
		}
	}
}