1. Match tags except <em> and <strong>
Match:
</?(?!(?:em|strong)\b)[a-zA-Z](?:[^>"']|"[^"]*"|'[^']*')*>
Replace:
NULL
eg.
<b fdjfkjdk>inikkk</b fdjkfjk>
<i fff>nihao</i nihao>
<em ff>nihao</em nihao>
<big ff>nihao</big nihao>
result:
inikkk
nihao
<em ff>nihao</em nihao>
nihao
2. Match tags except <em> and <stong>, and any tags that contain attributes
Match:
</?(?!(?:em|strong)\s*>)[a-zA-Z](?:[^>"']|"[^"]*"|'[^']*')*>
Replace:
NULL
eg.
<b fdjfkjdk>inikkk</b fdjkfjk>
<i fff>nihao</i nihao>
<em ff>nihao</em nihao>
<big ff>nihao</big nihao>
<strong>nihao</strong>
result:
inikkk
nihao
nihao
nihao
<strong>nihao</strong>
3. Whitelist specific attributes
Match all tags except <a> <em> <strong>, with two exceptions,
Any <a> tags that have attributes other than href or title should be matached
Match:
</?(?!(?:em|strong|a(?:\s+(:href|title)\s*=\s*(?:"[^"]*"|'[^']*'))*)\s*>)[a-zA-Z](?:[^>"']|"[^"]*"|'[^']*')*>
Replace:
NULL
eg.
<b fdjfkjdk>inikkk</b fdjkfjk>
<i fff>nihao</i nihao>
<em ff>nihao</em nihao>
<big ff>nihao</big nihao>
<strong>nihao</strong>
<a nihao>fdjkf</nihao>
<a href="2222">fdjkf</nihao>
Result:
inikkk
nihao
nihao
nihao
<strong>nihao</strong>
fdjkf
<a href="2222">fdjkf <===A little error, </a> is also removed, can anyone supply solution? thanks.