word清除html标记,清除從word粘貼過來多餘html代碼的ClearWord函數

本文介绍了一种使用CFScript实现的功能,类似于FCKEditor的javascript功能,用于清除从Word粘贴到网页内容中的多余HTML代码。这个CleanWord函数通过正则表达式匹配并移除不必要的样式属性和标签,确保网页内容入库时不包含冗余信息,提高存储效率和执行性能。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

我们在利用FckEditor编辑器的时候会有一个清除从Word粘贴过来的多余html代码的功能,它是利用javascript编写的。有了这项功能以后,我们的网页内容可以直接从Word拷贝粘贴而不用担心内容裏会有一大堆多余的东西佔据资料库空间影响网页执行的性能了。

那么,我们参照了Fckeditor的javascript功能编写了CFScript功能版本的ClearWord函数,利用该函数在页面内容添加入库时可以直接进行清除冗余操作了。

function CleanWord(html)

{

html = REReplaceNocase(html,"

s*

","","all");

html = REReplaceNocase(html,"

.*?

"," ","all");

html = REReplaceNocase(html,"s*mso-[^:]+:[^;"]+;?","","all");

html = REReplaceNocase(html,"s*MARGIN: 0cm 0cm 0pts*;","","all");

html = REReplaceNocase(html,"s*MARGIN: 0cm 0cm 0pts*"","""","all");

html = REReplaceNocase(html,"s*TEXT-INDENT: 0cms*;","","all");

html = REReplaceNocase(html,"s*TEXT-INDENT: 0cms*"","""","all");

html = REReplaceNocase(html,"s*TEXT-ALIGN: [^s;]+;?"","""","all");

html = REReplaceNocase(html,"s*PAGE-BREAK-BEFORE: [^s;]+;?"","""","all");

html = REReplaceNocase(html,"s*FONT-VARIANT: [^s;]+;?"","""","all");

html = REReplaceNocase(html,"s*tab-stops:[^;"]*;?","","all");

html = REReplaceNocase(html,"s*tab-stops:[^"]*","","all");

html = REReplaceNocase(html,"s*face="[^"]*"","","all");

html = REReplaceNocase(html,"s*face=[^ >]*","","all");

html = REReplaceNocase(html,"s*FONT-FAMILY:[^;"]*;?","","all");

html = REReplaceNocase(html,"]*) class=([^ |>]*)([^>]*)","<1","all");

html = REReplaceNocase(html,"]*) style="([^""]*)"([^>]*)","<1","all");

html = REReplaceNocase(html,"s*style="s*"","","all");

html = REReplaceNocase(html,"]*>s* s*"," ","all");

html = REReplaceNocase(html,"]*>","","all");

html = REReplaceNocase(html,"]*) lang=([^ |>]*)([^>]*)","<1","all");

html = REReplaceNocase(html,"(.*?)","1","all");

html = REReplaceNocase(html,"(.*?)","1","all");

html = REReplaceNocase(html,"]*>","","all");

html = REReplaceNocase(html,"?w+:[^>]*>","","all");

html = REReplaceNocase(html,"s*","","all");

html = REReplaceNocase(html,"

]*)>","","all");

html = REReplaceNocase(html,"

]*)>","","all");

html = REReplaceNocase(html,"

]*)>","","all");

html = REReplaceNocase(html,"

]*)>","","all");

html = REReplaceNocase(html,"

]*)>","","all");

html = REReplaceNocase(html,"

]*)>","","all");

html = REReplaceNocase(html,"","

","all");
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值