String urlPath = request.getScheme()+"://"+request.getServerName()+":"+request.getServerPort()+request.getContextPath() +"/ssl/test.htm";
URL url = new URL(urlPath);
URLConnection rulConnection = url.openConnection();
HttpURLConnection httpUrlConnection = (HttpURLConnection) rulConnection;
// step 1
Document document = new Document();
// step 2
PdfWriter writer = PdfWriter.getInstance(document, new FileOutputStream("D:/TEST.pdf"));
// step 3
document.open();
// step 4
XMLWorkerHelper.getInstance().parseXHtml(writer, document,httpUrlConnection.getInputStream(), Charset.forName("UTF-8"));
// step 5
document.close();
Maven:
<dependency> <groupId>com.itextpdf</groupId> <artifactId>itext-xtra</artifactId> <version>5.5.3</version> </dependency> <dependency> <groupId>com.itextpdf</groupId> <artifactId>itextpdf</artifactId> <version>5.5.3</version> </dependency> <dependency> <groupId>com.itext</groupId> <artifactId>itext-pdfa</artifactId> <version>5.5.3</version> </dependency> <dependency> <groupId>com.itext</groupId> <artifactId>itext-asian</artifactId> <version>5.1.1</version> </dependency> <dependency> <groupId>com.itextpdf</groupId> <artifactId>xmlworker</artifactId> <version>5.5.3</version> </dependency>
本文介绍了一种使用Java从远程URL抓取网页并将其转换为PDF文件的方法。通过使用iText库,结合HTTP URL连接获取HTML内容,再利用XMLWorker解析HTML并写入PDF中。文中还列举了必要的Maven依赖项。
1117

被折叠的 条评论
为什么被折叠?



