PHP使用DOMXPath剥离标签并删除节点
问题描述:
我想与DOMDocument一起工作,但遇到一些问题。我有一个这样的字符串:PHP使用DOMXPath剥离标签并删除节点
Some Content to keep
<span class="ice-cts-1 ice-del" data-changedata="" data-cid="5" data-time="1414514760583" data-userid="1" data-username="Site Administrator" undefined="Site Administrator">
This content should remain, but span around it should be stripped
</span>
Keep this content too
<span>
<span class="ice-cts-1 ice-ins" data-changedata="" data-cid="2" data-time="1414512278297" data-userid="1" data-username="Site Administrator" undefined="Site Administrator">
This whole node should be deleted
</span>
</span>
我想要做的是,如果跨度有像ice-del
类保持内的内容,但删除的跨度标签。如果它有ice-ins
,请删除整个节点。
如果它只是一个空的跨度<span></span>
也可以删除它。这是我的代码有:
//this get the above mentioned string
$getVal = $array['body'][0][$a];
$dom = new DOMDocument;
$dom->loadHTML($getVal);
$xPath = new DOMXPath($dom);
$delNodes = $xPath->query('//span[@class="ice-cts-1 ice-del"]');
$insNodes = $xPath->query('//span[@class="ice-cts-1 ice-ins"]');
foreach($insNodes as $span){
//reject these changes, so remove whole node
$span->parentNode->removeChild($span);
}
foreach($delNodes as $span){
//accept these changes, so just strip out the tags but keep the content
}
$newString = $dom->saveHTML();
所以,我的代码工作删除整个跨度节点,但我如何采取一个节点,并剥离出它的标签,但保留其内容?
另外,我将如何删除并清空跨度?我相信我可以使用正则表达式或替换,但我有点想用dom来做到这一点。
感谢
答
不,我不会推荐的正则表达式,我强烈建议建立在你有什么权利现在使用这个美丽的HTML解析器。在这种情况下,您可以使用->replaceChild
:
$dom = new DOMDocument;
$dom->loadHTML($getVal);
$xPath = new DOMXPath($dom);
$spans = $xPath->query('//span');
foreach ($spans as $span) {
$class = $xPath->evaluate('string(./@class)', $span);
if(strpos($class, 'ice-ins') !== false || $class == '') {
$span->parentNode->removeChild($span);
} elseif(strpos($class, 'ice-del') !== false) {
$span->parentNode->replaceChild(new DOMText($span->nodeValue), $span);
}
}
$newString = $dom->saveHTML();