
TextToPdfContentTransformer text -> pdf http://www.pdfbox.org/ PDFBox
TextMiningContentTransformer doc -> txt http://www.textmining.org/ TextMining
StringExtractingContentTransformer textual format(text/plain application/x-javascript text/*) -> txt
RuntimeExecutableContentTransformer RuntimeExec() 动态执行外部操作系统命令行的指令
PoiHssfContentTransformer XLS -> Text http://jakarta.apache.org/poi/ POI
PdfToImageContentTransformer PDF -> PNG http://www.pdfbox.org/ PDFBox
PdfBoxContentTransformer PDF -> Text http://www.pdfbox.org/ PDFBox
OpenOfficeContentTransformer OpenOffice格式互转,
把Word/RTF/OpenDocument Text转换成PDF/Word/RTF/OpenDocument Text格式;
把Excel/OpenDocument Spreadsheet转换成PDF/Excel/OpenDocument Spreadsheet格式;
把PowerPoint/OpenDocument Presentation转换成PDF/Flash/PowerPoint/OpenDocument Presentation;
http://sourceforge.net/projects/joott/ JOOConverter
http://sourceforge.net/projects/joott/ JOOConverter
MediaWikiContentTransformer MEDIAWIKI -> HTML http://matheclipse.org/en/Java_Wikipedia_API
MailContentTransformer MSG -> TEXT
HtmlParserContentTransformer HTML -> TEXT
本文介绍了一系列文件格式转换工具,包括从PDF到图像、文本到PDF、文档到文本等。还涉及了利用OpenOffice进行多种格式之间的转换,以及HTML、邮件内容等特定格式的解析方法。
349

被折叠的 条评论
为什么被折叠?



