一: 提取word全部文本
word分为doc,docx。2007以后的版本用XWPFDocument都可以适配
String filePath = "D:/My Documents/Desktop/123.docx";
FileInputStream fileInputStream = new FileInputStream(filePath);
XWPFDocument doc = new XWPFDocument(fileInputStream);
XWPFWordExtractor extractor = new XWPFWordExtractor(doc);
String text = extractor.getText(); //获取word全部文本
System.out.println(text);
此时结果为全部word内容
二:提取word里面的段
public static List<Map<String,Object>> getXParagraph(XWPFDocument doc){
List<Map<</