关于php抓取页面信息的简单代码
利用phpDOM函数实现简单的单页信息抓取(在这里尽抓取a标签,功能实现了,但是扩展页链接抓取没有实现,欢迎大家批评指导)
<?php
error_reporting(E_ERROR);
$pages=file_get_contents('http://www.php100.com');
//$pages=htmlspecialchars($pages);
$doc=newDOMDocument();
$new_doc=newDOMDocument('1.0','utf-8');
$doc->loadhtml($pages);
$dom=$doc->getElementsByTagName('a');
for($i=0;$i<$dom->length;$i++){
$node=$new_doc->createElement('a',$dom->item($i)->nodeValue);
$newnode=$new_doc->appendChild($node);
$newnode->setAttribute('href',$dom->item($i)->getAttribute('href'));
$newnode->setAttribute('style','display:block;margin-left:30px;');//echo$dom->item($i)->getAttribute('src').'</br>';
}
echo$new_doc->saveHTML();
?>
相关推荐
Noneyes 2020-11-10
zyyjay 2020-11-09
xuebingnan 2020-11-05
samtrue 2020-11-22
stefan0 2020-11-22
yifangs 2020-10-13
songshijiazuaa 2020-09-24
hebiwtc 2020-09-18
天步 2020-09-17
83911535 2020-11-13
whatsyourname 2020-11-13
zhouyuqi 2020-11-10
mathchao 2020-10-28
王志龙 2020-10-28
wwwsurfphpseocom 2020-10-28
diskingchuan 2020-10-23
savorTheFlavor 2020-10-23