这与防采集无关,而与他用的 UI 有关
data-poi 的值是(url编码到达 js 时已还原)
[{"shopid":566878,"name":"\u80e4\u9686\u6c47","address":"\u6c5f\u5e72\u533a\u65b0\u5858\u8def\u676d\u6d77\u8def238\u53f7\uff08\u68ee\u79be\u5e7f\u573a\uff09","range":"\u94b1\u6c5f\u65b0\u57ce","rangeid":5455,"disid":58,"disname":"\u6c5f\u5e72\u533a","dpshopid":0,"mapurl":"","trafficinfo":"","phone":"0571-86963688\/86963788","latlng":"[30.253548,120.206872]","city":50,"url":"","poiid":6307803,"poilevel":{"avgscore":"4.8","fbcount":1471},"cityname":"\u676d\u5dde","status":0,"subwayname":"","subwaydis":0,"subwayslug":"","appointmentDay":0}]json 解码后是 Array
(
    [0] => stdClass Object
        (
            [shopid] => 566878
            [name] => 胤隆汇
            [address] => 江干区新塘路杭海路238号(森禾广场)
            [range] => 钱江新城
            [rangeid] => 5455
            [disid] => 58
            [disname] => 江干区
            [dpshopid] => 0
            [mapurl] => 
            [trafficinfo] => 
            [phone] => 0571-86963688/86963788
            [latlng] => [30.253548,120.206872]
            [city] => 50
            [url] => 
            [poiid] => 6307803
            [poilevel] => stdClass Object
                (
                    [avgscore] => 4.8
                    [fbcount] => 1471
                )            [cityname] => 杭州
            [status] => 0
            [subwayname] => 
            [subwaydis] => 0
            [subwayslug] => 
            [appointmentDay] => 0
        ))

解决方案 »

  1.   

    只是json后的一种数据格式,你可以在json解码转过来就可以了,然后页面时采用的ajax进行局部数据输出的,当你只是采集的时候无法获取到对应的文字内容很正常。
      

  2.   

    再仔细分析了下他的页面代码,里面用JS调用的combo文件起到了解释的作用。这是YUI框架的原因啊