How to convert web pages and word doc to PDF files?

最新推荐文章于 2024-03-24 20:34:08 发布

转载最新推荐文章于 2024-03-24 20:34:08 发布 · 117 阅读

0 ·

CC 4.0 BY-SA版权

原文链接：http://www.cnblogs.com/open-coder/archive/2012/08/24/2653567.html

doc2pdf

How to convert the web pages and word document to PDF files with text / image format kept?

Sometimes, I will do some research on the internet and save some pages locally for afterward reviewing. And some problems like the text font, page layout, images are missing, or even could not open this address and so on, a lots of problems appear. So the best solution to deal with it is try to save them all at the first time. You may think we could copy the whole content from the web page into the word and save it as a world file. Yes, it works, at least all text will be saved. But something still missing, I tried several times like this before, but the result was not so good. I need to re-arrange the text and images layout in the word, that was a time consuming work for some guys that were not good at text layout with word.

There is a software named “CutePDF writer” which run based on the Print service. All most every program have implemented “Print” feature(“File” -> “Print”). With this kind of software you could convert a word document to PDF file, convert web pages to PDF file. And the same time, the created PDF files will have the same text font, passage layout, images, text style with the original one. I have cleaned the document about how to use CutePDF, you could download the document from the top link.

转载于:https://www.cnblogs.com/open-coder/archive/2012/08/24/2653567.html

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

abc2912333

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
分享

复制链接

分享到 QQ

分享到新浪微博

扫一扫
举报

举报

PyMuPDF 操作手册 - 01 从PDF中提取文本

专注于医院数据分析技术与系统开发的创作与分享。

06-17

1352

PyMuPDF Ver 1.24.4 操作手册 - 01 从PDF中提取文本

TPM零知识学习十 —— tpm全安装流程复盘（中）

phmatthaus的专栏

01-12

1720

TPM零知识学习十 —— tpm全安装流程复盘（中）

参与评论您还未登录，请先登录后发表或查看评论

word convert to html,How to Convert HTML to Word

weixin_39536010的博客

06-21

252

Why Use C#/VB.NET to Convert HTML to Word?Save HTML file content into Word document can be very easy only through copy and paste. Or users can right click on the html document and then choose edit. It...

word convert to html,How to Convert a Word Document to HTML

weixin_34466348的博客

06-21

255

What To KnowFile > Save As. Select a location. Name the file, and select .html as the type. Press Save.Editors like Dreamweaver can convert a Word document to HTML.This article explains how to use ...

How to convert docx/odt to pdf/html with Java?

wanderman1836的专栏

07-10

2541

How to convert docx/odt to pdf/html with Java? décembre 6, 2012angelozerrLaisser un commentaireGo to comments 31 Votes How to convert d

python读取pdf内容转word,如何使用python将txt文件或PDF文件转换成Word文档？

weixin_34556619的博客

03-26

454

您可以使用GroupDocs.Conversion Cloud，它提供了Python SDK文本/PDF到DOC/DOCX converrion和许多其他常见的文件格式，从一种格式到另一种格式，而不依赖于任何第三方工具或软件。下面是示例Python代码。# Import moduleimport groupdocs_conversion_cloud# Get your app_sid and ap...

How to get the number of pages in a Word Document

weixin_34267123的博客

11-19

620

2019独角兽企业重金招聘Python工程师标准>>> ...

linux、windows word转成pdf 来获取总页数 + POI修改word内容

qijingpei的博客

07-11

3316

背景因为本来用的是POI，调研了一些POI的api，虽然有一些获取总页数的方法，但是一旦word里有图片获得的总页数就不准确了。看有大牛提到过用转pdf的方法来获取word总页数，但是只适用于windows平台下，但我们甲方的服务器是Linux的，所以才采用了另一款转pdf的工具–Libreoffice 解决方法 1.对于有图片的word，可以先把它转换成pdf， 2.然后再读取pd...

XMLmind Word To XML Manual

weixin_30932215的博客

06-05

5039

XMLmind Word To XML Manual Hussein ShafiePixware SARL91 rue Gambetta,78120 Rambouillet,France,Phone: +33 (0)1 30 59 81 44,Web: http://www.xmlmind.com/w2x/Email: mailto:w2x-support@xmlmind.com(publ...

How to make an altcoin

weixin_39788534的博客

04-14

2659

pip install pip==9.0.3 Complete output from command python setup.py egg_info: Traceback (most recent call last): File "<string>", line 1, in <module> File "/tmp/pip-install...

c#使用word、excel、pdf ——转

09-09

445

一、C# Word操作引入Word COM组件菜单=》项目=》添加引用=》COM=》Microsoft Word 11.0 Object Libraryusing Word = Microsoft.Office.Interop.Word;1、功能：将数据以自制表格形式插入WORD中2、主要程序代码如下：创建新Wordobject oMissing = System.Reflecti...

Zotero Community Contribution Guide: Contributing to Open Source Communities and Building a ...

# Zotero Community Contribution Guide: Giving Back to the Open Source Community ...The Zotero community is a vibrant group composed of researchers, scholars, and developers dedicated to the collaborativ

（第75天）AutoUpgrade 升级：11GR2 到 19C

LuciferLiu_DBA

03-24

843

Oracle 从 12CR2 开始就开始大力推广 AutoUpgrade 升级工具，但是实际生产中使用并不广泛，但是这个工具已经发展的非常成熟了。本文介绍一下如何使用 AutoUpgrade 工具进行快速升级，操作非常简单。

09-15

from pdf2image import convert_from_path from paddleocr import PaddleOCR from cnocr import CnOcr except Exception as e: logger.critical(f"依赖初始化失败: {str(e)}") import tkinter.messagebox as ...

新型1D-LBP的高效方法，采用混合深度学习方法检测轴承失效.zip