XML文件解析工具化进阶版

原创于 2020-02-02 19:06:47 发布 · 194 阅读

0 ·

CC 4.0 BY-SA版权

文章标签：

#xml #java

Java 专栏收录该内容

23 篇文章

订阅专栏

在之前的《XML文件解析工具化基础篇》中，是根据标签名来解析XML文件的，然而在很多中情况下可能会存在父子标签相同的情况，此时，若继续使用标签名来解析就有些问题了。所以，再此，提出一种新的解析方法，根据文件标签层次来解析。

一、思路

不要在之前的工具上进行过大的改动，只需加上层次解析的方法即可。

若是按层次解析，由于要适配大多数的XML文件，可又不清楚一个文件究竟有多少层。为了解决这个问题，我们将文件分为两部分，根和其他。若是有多层，可以在parseElement(Element element)中再进行调用该方法。

1.对根解析

public void parseRoot(Document document) {
	Element root = (Element) document.getChildNodes().item(0);
	dealElement(root, 0);
}

在此处，在解释一次dealElement(element,index)方法，用于对element做进一步解析，例如取属性、取textContent或者进一步取Element。
而在此处，element指代的就是root，因为是根，所以下标为0.

2.对其他解析

public void parseElement(Element element) {
	NodeList nodeList = element.getChildNodes();
	for(int index=0;index<nodeList.getLength();index++) {
		Node node  = nodeList.item(index);
		if(node instanceof Element) {
			dealElement((Element)node, index);
		}
	}
}

相必大家对为什么要对node instanceof Element进行判断都有一些疑问吧
当我对node的类进行输出时，会出现下面这种情况：
在这里插入图片描述
在上面这张图中，有两种Text和Element两种。

在XML文件中,我们将整个文件看成一个节点列表NodeList，然而对于节点来说，它不光包括元素Element，还包括Text,然而我们只需要处理Element,所以在获取节点之后，要对其进行判断，选出Element进行处理。
如果大家还是不理解Node和Element的区别的话，可以看看这篇文章《xml中Node和Element的区别》。

二、工具

import java.io.IOException;
import java.io.InputStream;

import javax.xml.parsers.DocumentBuilder;
import javax.xml.parsers.DocumentBuilderFactory;
import javax.xml.parsers.ParserConfigurationException;

import org.w3c.dom.Document;
import org.w3c.dom.Element;
import org.w3c.dom.Node;
import org.w3c.dom.NodeList;
import org.xml.sax.SAXException;

public abstract class XmlParser {
	private static final DocumentBuilderFactory dbf;
	private static DocumentBuilder db;
	
	static {
		dbf = DocumentBuilderFactory.newInstance();
		try {
			db = dbf.newDocumentBuilder();
		} catch (ParserConfigurationException e) {
			e.printStackTrace();
		}
	}
	
	//获取需要读取的文件
	private static Document getDocument(InputStream is) {
		try {
			Document document = db.parse(is);
		} catch (SAXException e) {
			e.printStackTrace();
		} catch (IOException e) {
			e.printStackTrace();
		}
		return null;		
	}
	
	public static Document getDocument(String xmlPath) {
		InputStream is = XmlParser.class.getResourceAsStream(xmlPath);
		if(is == null) {
			System.out.println("文件路径["+xmlPath+"]不存在！");
			return null;
		}		
		return getDocument(is);
	}
	
	public  abstract void dealElement(Element element,int index);		
	
	//按标签解析
	public void parseTag(Document document,String tagName) {
		NodeList nodeList = document.getElementsByTagName(tagName);
		for(int index=0;index < nodeList.getLength();index++) {
			Element ele = (Element) nodeList.item(index);
			dealElement(ele,index);
		}
	}

	public void parseTag(Element element,String tagName) {
		NodeList nodeList = element.getElementsByTagName(tagName);
		for(int index=0;index<nodeList.getLength();index++) {
			Element ele = (Element) nodeList.item(index);
			dealElement(ele,index);
		}
	}
	
	//按层次解析
	public void parseRoot(Document document) {
		Element root = (Element) document.getChildNodes().item(0);
		dealElement(root, 0);
	}
	
	public void parseElement(Element element) {
		NodeList nodeList = element.getChildNodes();
		for(int index=0;index<nodeList.getLength();index++) {
			Node node  = nodeList.item(index);
			if(node instanceof Element) {
				dealElement((Element)node, index);
			}
		}
	}
}