如何获取这个页面的指定内容 本帖最后由 cnaaa8 于 2012-06-09 13:27:59 编辑 解决方案 » 免费领取超大流量手机卡,每月29元包185G流量+100分钟通话, 中国电信官方发货 file_get_contents(); 获取到源码后再用正则提取。 我知道是file_get_contents 但是如何提取出来啊 分析HTML如果你不会分析HTML的话说了你可能也无从下手。 <a href="#" class="J_show_list J_MakePoint" data-point-val="tbrate.2.3.1" data-param="timeLine=-30&result=1">23323</a>正则匹配这个就可以了。 $url = 'http://rate.taobao.com/user-rate-39512.htm';$s = file_get_contents($url);preg_match_all('#class="J_show_list J_MakePoint".+tbrate\.([\d.]+?).+>(\d+)</a>#isU', $s, $r);print_r($rArray( [0] => Array ( [0] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.2.1" data-param='timeLine=-7&result=1'>5250</a> [1] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.2.2" data-param='timeLine=-7&result=0'>13</a> [2] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.2.3" data-param='timeLine=-7&result=-1'>8</a> [3] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.3.1" data-param='timeLine=-30&result=1'>23123</a> [4] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.3.2" data-param='timeLine=-30&result=0'>74</a> [5] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.3.3" data-param='timeLine=-30&result=-1'>36</a> [6] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.4.1" data-param='timeLine=-210&result=1'>294425</a> [7] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.4.2" data-param='timeLine=-210&result=0'>1616</a> [8] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.4.3" data-param='timeLine=-210&result=-1'>796</a> ) [1] => Array ( /** 很显然标识数据类别的这节是不能少的 **/ [0] => 2.2.1 [1] => 2.2.2 [2] => 2.2.3 [3] => 2.3.1 [4] => 2.3.2 [5] => 2.3.3 [6] => 2.4.1 [7] => 2.4.2 [8] => 2.4.3 ) [2] => Array ( [0] => 5250 [1] => 13 [2] => 8 [3] => 23123 [4] => 74 [5] => 36 [6] => 294425 [7] => 1616 [8] => 796 )) 我发现PHP版一直删我楼,艹了 我要同时匹配这两段字符,该如何写正则呢<a href="http://ratehis.taobao.com/user-rate-39512--isarchive|true--detailed|1--goodNeutralOrBad|1--timeLine|-211--receivedOrPosted|0--buyerOrSeller|0.htm#RateType" class="J_MakePoint" data-point-val="tbrate.2.5.1">4508530</a><a href="#" class="J_show_list J_MakePoint" data-point-val="tbrate.2.4.1" data-param='timeLine=-210&result=1'>289048</a>提示:第一个的:class="J_MakePoint" data-point-val="tbrate.2.5.1">4508530</a>第二个的:class="J_show_list J_MakePoint" data-point-val="tbrate.2.4.1" data-param='timeLine=-210&result=1'>289048</a>都是需要提取<a>标签内的数字,如第一个中搞定4508530和第二个中的289048,用一个正则匹配 who know pdp php如何抓取网页中的数据? php mail的的问题 关于php + C++ 调用的问题或多数以什么样子的方式使用 谁给推荐一个php集成环境(win7 64bit) 分享一个wordpress中文主题:problog1.2最终版,主打商务,简洁 各位大虾请教下zencart在IE8浏览器兼容问题 抓取淘宝省市数据库地址 求一个URL解析函数 一个sql查询相关的问题 自己写的原创php内容,没用人看 难:求一段代码的正则
如果你不会分析HTML的话说了你可能也无从下手。
(
[0] => Array
(
[0] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.2.1" data-param='timeLine=-7&result=1'>5250</a>
[1] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.2.2" data-param='timeLine=-7&result=0'>13</a>
[2] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.2.3" data-param='timeLine=-7&result=-1'>8</a>
[3] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.3.1" data-param='timeLine=-30&result=1'>23123</a>
[4] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.3.2" data-param='timeLine=-30&result=0'>74</a>
[5] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.3.3" data-param='timeLine=-30&result=-1'>36</a>
[6] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.4.1" data-param='timeLine=-210&result=1'>294425</a>
[7] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.4.2" data-param='timeLine=-210&result=0'>1616</a>
[8] => class="J_show_list J_MakePoint" data-point-val="tbrate.2.4.3" data-param='timeLine=-210&result=-1'>796</a>
) [1] => Array
( /** 很显然标识数据类别的这节是不能少的 **/
[0] => 2.2.1
[1] => 2.2.2
[2] => 2.2.3
[3] => 2.3.1
[4] => 2.3.2
[5] => 2.3.3
[6] => 2.4.1
[7] => 2.4.2
[8] => 2.4.3
) [2] => Array
(
[0] => 5250
[1] => 13
[2] => 8
[3] => 23123
[4] => 74
[5] => 36
[6] => 294425
[7] => 1616
[8] => 796
))
我要同时匹配这两段字符,该如何写正则呢
<a href="http://ratehis.taobao.com/user-rate-39512--isarchive|true--detailed|1--goodNeutralOrBad|1--timeLine|-211--receivedOrPosted|0--buyerOrSeller|0.htm#RateType" class="J_MakePoint" data-point-val="tbrate.2.5.1">4508530</a><a href="#" class="J_show_list J_MakePoint" data-point-val="tbrate.2.4.1" data-param='timeLine=-210&result=1'>289048</a>提示:
第一个的:class="J_MakePoint" data-point-val="tbrate.2.5.1">4508530</a>
第二个的:class="J_show_list J_MakePoint" data-point-val="tbrate.2.4.1" data-param='timeLine=-210&result=1'>289048</a>
都是需要提取<a>标签内的数字,如第一个中搞定4508530和第二个中的289048,用一个正则匹配