php 爬虫 php爬虫 解决方案 » 免费领取超大流量手机卡,每月29元包185G流量+100分钟通话, 中国电信官方发货 LZ 是 php 懒虫 php能实现爬虫? 利用CURL抓取,但不能称之为爬虫<?php/** * @filename eeeeeeeeeeeeeeeeeeee.php * @encoding UTF-8 * @author Guo Bingbing * @copyright http://www.webapp123.com * @datetime 2014-6-5 9:13:13 * @version 1.0 * @Description */$url = "http://202.114.18.218/Main.aspx";$post_data = array ("programId" => "西区","txtyq" => "西十二舍",'txtld'=>'2层','Txtroom'=>'202','ImageButton1.x'=>44,'ImageButton1.y'=>15,'__VIEWSTATE'=>'/wEPDwULLTEyNjgyMDA1OTgPZBYCAgMPZBYKAgEPEA8WBh4NRGF0YVRleHRGaWVsZAUM5qW85qCL5Yy65Z+fHg5EYXRhVmFsdWVGaWVsZAUM5qW85qCL5Yy65Z+fHgtfIURhdGFCb3VuZGdkEBUGBuS4nOWMugbopb/ljLoM6Z+16IuR5LqM5pyfDOmfteiLkeS4gOacnwbntKvoj5gLLeivt+mAieaLqS0VBgbkuJzljLoG6KW/5Yy6DOmfteiLkeS6jOacnwzpn7Xoi5HkuIDmnJ8G57Sr6I+YAi0xFCsDBmdnZ2dnZxYBAgFkAgUPEA8WBh8ABQbmpbzlj7cfAQUG5qW85Y+3HwJnZBAVBAnopb/lha3oiI0J6KW/5LiJ6IiNDOilv+WNgeS6jOiIjQst6K+36YCJ5oupLRUECeilv+WFreiIjQnopb/kuInoiI0M6KW/5Y2B5LqM6IiNAi0xFCsDBGdnZ2cWAQICZAIJDxAPFgYfAAUJ5qW85bGC5Y+3HwEFCealvOWxguWPtx8CZ2QQFQcEMeWxggQy5bGCBDPlsYIENOWxggQ15bGCBDblsYILLeivt+mAieaLqS0VBwQx5bGCBDLlsYIEM+WxggQ05bGCBDXlsYIENuWxggItMRQrAwdnZ2dnZ2dnZGQCFw88KwANAGQCGQ88KwANAGQYAwUeX19Db250cm9sc1JlcXVpcmVQb3N0QmFja0tleV9fFgIFDEltYWdlQnV0dG9uMQUMSW1hZ2VCdXR0b24yBQlHcmlkVmlldzEPZ2QFCUdyaWRWaWV3Mg9nZLrpGga8HGSxHLUfEq03ORnKKqlo','__EVENTVALIDATION'=>'/wEWGQK/v+E5Auitx54JAtzWxOgGArnQx+gGAqGLoZoFAt2ds6UGAu24zOsNAuvCofMFApf71rcEAv365MwHAv36lIUBAtig2PsOApSUsNoIAoOU+OMOAoKU+OMOAoGU+OMOAoCU+OMOAoeU+OMOAoaU+OMOAo+UvJ4CAvrV2qsGAtLCmdMIAtLC1eQCAuzR9tkMAuzRirUFeF4+SvCc9iqd47hEgn0pgLyBk4k=');$ch = curl_init();curl_setopt($ch, CURLOPT_URL, $url);curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);curl_setopt($ch, CURLOPT_POST, 1);curl_setopt($ch, CURLOPT_POSTFIELDS, $post_data);$output = curl_exec($ch);curl_close($ch);print_r($output); 我原来也是怎这么写的。可还是没法运行报什么错?我这边完全正常,还需要注意:__VIEWSTATE,__EVENTVALIDATION 应该从源站上匹配抓取 <table cellspacing="0" rules="all" border="1" id="GridView2" height="118" width="90%"> <tr> <th scope="col">抄表数据</th><th scope="col">抄表时间</th> </tr><tr> <td>249.3</td><td>2014-6-5 7:31:47</td> </tr><tr> <td>249.7</td><td>2014-6-4 7:23:41</td> </tr><tr> <td>249.9</td><td>2014-6-3 7:30:45</td> </tr><tr> <td>249.9</td><td>2014-6-2 7:31:34</td> </tr><tr> <td>250.2</td><td>2014-6-1 7:30:57</td> </tr><tr> <td>250.5</td><td>2014-5-31 7:23:49</td> </tr><tr> <td>250.6</td><td>2014-5-30 7:31:10</td> </tr> </table>代码是正确的,但提交的数据必须是utf-8编码的 我原来也是怎这么写的。可还是没法运行报什么错?我这边完全正常,还需要注意:__VIEWSTATE,__EVENTVALIDATION 应该从源站上匹配抓取就是我这边是空白的,应该什么也没抓取到,__VIEWSTATE,__EVENTVALIDATION 也改成了自己这边的 我原来也是怎这么写的。可还是没法运行报什么错?我这边完全正常,还需要注意:__VIEWSTATE,__EVENTVALIDATION 应该从源站上匹配抓取就是我这边是空白的,应该什么也没抓取到,__VIEWSTATE,__EVENTVALIDATION 也改成了自己这边的先确定下自己是否开启了错误报告,display_errors 、error_reporting就算没抓取水电信息,也是会有输出的,而你说空白页面,那只可能是你没有开启错误显示! 别用 php 自己写了,LZ 考虑一下 Nutch 和 Solr 。 PHP如何用正则表达式匹配圆括号 求救:帮忙写个curl登录获得个人信息的代码!十万火急~ PHP COOKIE的问题 关于MySql重新安装的问题 如何判断文件是否正被其他程序打开? 讨论:如何实现以下功能? MySQL 返回:Got error 127 from table handler 你是php高手吗,先看看这段代码吧! 关于邮件列表 正则匹配<a>标签内的所有内容 RewriteRule . index.php 这句话什么意思 验证码刷新时,session为什么不改变?
* @filename eeeeeeeeeeeeeeeeeeee.php
* @encoding UTF-8
* @author Guo Bingbing
* @copyright http://www.webapp123.com
* @datetime 2014-6-5 9:13:13
* @version 1.0
* @Description
*/
$url = "http://202.114.18.218/Main.aspx";
$post_data = array ("programId" => "西区","txtyq" => "西十二舍",'txtld'=>'2层','Txtroom'=>'202','ImageButton1.x'=>44,'ImageButton1.y'=>15,'__VIEWSTATE'=>'/wEPDwULLTEyNjgyMDA1OTgPZBYCAgMPZBYKAgEPEA8WBh4NRGF0YVRleHRGaWVsZAUM5qW85qCL5Yy65Z+fHg5EYXRhVmFsdWVGaWVsZAUM5qW85qCL5Yy65Z+fHgtfIURhdGFCb3VuZGdkEBUGBuS4nOWMugbopb/ljLoM6Z+16IuR5LqM5pyfDOmfteiLkeS4gOacnwbntKvoj5gLLeivt+mAieaLqS0VBgbkuJzljLoG6KW/5Yy6DOmfteiLkeS6jOacnwzpn7Xoi5HkuIDmnJ8G57Sr6I+YAi0xFCsDBmdnZ2dnZxYBAgFkAgUPEA8WBh8ABQbmpbzlj7cfAQUG5qW85Y+3HwJnZBAVBAnopb/lha3oiI0J6KW/5LiJ6IiNDOilv+WNgeS6jOiIjQst6K+36YCJ5oupLRUECeilv+WFreiIjQnopb/kuInoiI0M6KW/5Y2B5LqM6IiNAi0xFCsDBGdnZ2cWAQICZAIJDxAPFgYfAAUJ5qW85bGC5Y+3HwEFCealvOWxguWPtx8CZ2QQFQcEMeWxggQy5bGCBDPlsYIENOWxggQ15bGCBDblsYILLeivt+mAieaLqS0VBwQx5bGCBDLlsYIEM+WxggQ05bGCBDXlsYIENuWxggItMRQrAwdnZ2dnZ2dnZGQCFw88KwANAGQCGQ88KwANAGQYAwUeX19Db250cm9sc1JlcXVpcmVQb3N0QmFja0tleV9fFgIFDEltYWdlQnV0dG9uMQUMSW1hZ2VCdXR0b24yBQlHcmlkVmlldzEPZ2QFCUdyaWRWaWV3Mg9nZLrpGga8HGSxHLUfEq03ORnKKqlo','__EVENTVALIDATION'=>'/wEWGQK/v+E5Auitx54JAtzWxOgGArnQx+gGAqGLoZoFAt2ds6UGAu24zOsNAuvCofMFApf71rcEAv365MwHAv36lIUBAtig2PsOApSUsNoIAoOU+OMOAoKU+OMOAoGU+OMOAoCU+OMOAoeU+OMOAoaU+OMOAo+UvJ4CAvrV2qsGAtLCmdMIAtLC1eQCAuzR9tkMAuzRirUFeF4+SvCc9iqd47hEgn0pgLyBk4k=');
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);
curl_setopt($ch, CURLOPT_POST, 1);
curl_setopt($ch, CURLOPT_POSTFIELDS, $post_data);
$output = curl_exec($ch);
curl_close($ch);
print_r($output);
<tr>
<th scope="col">抄表数据</th><th scope="col">抄表时间</th>
</tr><tr>
<td>249.3</td><td>2014-6-5 7:31:47</td>
</tr><tr>
<td>249.7</td><td>2014-6-4 7:23:41</td>
</tr><tr>
<td>249.9</td><td>2014-6-3 7:30:45</td>
</tr><tr>
<td>249.9</td><td>2014-6-2 7:31:34</td>
</tr><tr>
<td>250.2</td><td>2014-6-1 7:30:57</td>
</tr><tr>
<td>250.5</td><td>2014-5-31 7:23:49</td>
</tr><tr>
<td>250.6</td><td>2014-5-30 7:31:10</td>
</tr>
</table>
代码是正确的,但提交的数据必须是utf-8编码的
就算没抓取水电信息,也是会有输出的,而你说空白页面,那只可能是你没有开启错误显示!