紧急求助,大家帮帮小弟,
目标站采集之前必须登录注册的账号,
要求:1:php提交post内容后获取到想要的内容!
2:最好是提交类似http://127.0.0.1/test.php?id=1 这样的网址,其中1的位置为变量,
3:post数据内也有一个变量,比如post为:aaaaaaaaaaaaa=1 其中1的位置是个变量
我想直接赋予http://127.0.0.1/test.php?id=1 中的变量,让他同时也等于post中的变量!比如:我访问http://127.0.0.1/test.php?id=5 那么post自动为aaaaaaaaaaaaa=5
我访问http://127.0.0.1/test.php?id=6 那么post自动为aaaaaaaaaaaaa=6
......
......
我赋予http://127.0.0.1/test.php?id=1000 那么post自动为aaaaaaaaaaaaa=1000以下是抓包内容给大家做参考:http://www.baidu.com/flow.phpPOST /flow.php HTTP/1.1
Host: www.baidu.com
User-Agent: Mozilla/5.0 (Windows NT 5.1; rv:12.0) Gecko/20100101 Firefox/12.0
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Language: zh-cn,zh;q=0.8,en-us;q=0.5,en;q=0.3
Accept-Encoding: gzip, deflate
Connection: keep-alive
Referer: http://www.baidu.com/flow.php?step=consignee
Cookie: real_ipd=123.9.3.58; 1006079161_ID=27fe6f1019a9b989de8d3401ea188f714cd0e834; 1006079161[visit_times]=1; 1006079161[history]=94788%2C94683; 53kf_61423877_keyword=http://www.baidu.com/s?ie=utf-8&bs=site%3Ajojo2u.com&f=8&rsv_bp=1&rsv_spt=3&wd=site%3Akanggou.com&rsv_n=2&rsv_sug3=1&rsv_sug1=1&rsv_sug4=53&inputT=603; onliner_zdfq61423877=0; Hm_lvt_69218a49e43c7e68acc8f8ab6b614a53=1358251922; Hm_lpvt_69218a49e43c7e68acc8f8ab6b614a53=1358251980; TNuv=1358251921934; ac=http%3A//www.baidu.com/flow.php%3Fstep%3Dconsignee; CNZZDATA3559069=cnzz_eid=8609596-1358251912-http%253A%252F%252Fwww.baidu.com%252Fs%253Fie%253Dutf-8%2526bs%253Dsite%25253Ajojo2u.com%2526f%253D8%2526rsv_bp%253D1%2526rsv_spt%253D3%2526wd%253Dsite%25253Akanggou.com%2526rsv_n%253D2%2526rsv_sug3%253D1%2526rsv_sug1%253D1%2526rsv_sug4%253D53%2526inputT%253D603&ntime=1358251912&cnzz_a=5&retime=1358251980219&sin=http%253A%252F%252Fwww.baidu.com%252Fs%253Fie%253Dutf-8%2526bs%253Dsite%25253Ajojo2u.com%2526f%253D8%2526rsv_bp%253D1%2526rsv_spt%253D3%2526wd%253Dsite%25253Akanggou.com%2526rsv_n%253D2%2526rsv_sug3%253D1%2526rsv_sug1%253D1%2526rsv_sug4%253D53%2526inputT%253D603<ime=1358251980219&rtime=0
Content-Type: application/x-www-form-urlencoded
Content-Length: 293
aaaaaaaaaaaaa=100HTTP/1.1 302 Moved Temporarily
Server: XT-server/0.0
Date: Tue, 15 Jan 2013 12:13:11 GMT
Content-Type: text/html; charset=gbk
Transfer-Encoding: chunked
Connection: keep-alive
Cache-Control: private
Location: flow.php?step=checkout
----------------------------------------------------------
http://www.baidu.com/flow.php?step=checkoutGET /flow.php?step=checkout HTTP/1.1
Host: www.baidu.com
User-Agent: Mozilla/5.0 (Windows NT 5.1; rv:12.0) Gecko/20100101 Firefox/12.0
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Language: zh-cn,zh;q=0.8,en-us;q=0.5,en;q=0.3
Accept-Encoding: gzip, deflate
Connection: keep-alive
Referer: http://www.baidu.com/flow.php?step=consignee
Cookie: real_ipd=123.9.3.58; 1006079161_ID=27fe6f1019a9b989de8d3401ea188f714cd0e834; 1006079161[visit_times]=1; 1006079161[history]=94788%2C94683; 53kf_61423877_keyword=http://www.baidu.com/s?ie=utf-8&bs=site%3Ajojo2u.com&f=8&rsv_bp=1&rsv_spt=3&wd=site%3Akanggou.com&rsv_n=2&rsv_sug3=1&rsv_sug1=1&rsv_sug4=53&inputT=603; onliner_zdfq61423877=0; Hm_lvt_69218a49e43c7e68acc8f8ab6b614a53=1358251922; Hm_lpvt_69218a49e43c7e68acc8f8ab6b614a53=1358251980; TNuv=1358251921934; ac=http%3A//www.baidu.com/flow.php%3Fstep%3Dconsignee; CNZZDATA3559069=cnzz_eid=8609596-1358251912-http%253A%252F%252Fwww.baidu.com%252Fs%253Fie%253Dutf-8%2526bs%253Dsite%25253Ajojo2u.com%2526f%253D8%2526rsv_bp%253D1%2526rsv_spt%253D3%2526wd%253Dsite%25253Akanggou.com%2526rsv_n%253D2%2526rsv_sug3%253D1%2526rsv_sug1%253D1%2526rsv_sug4%253D53%2526inputT%253D603&ntime=1358251912&cnzz_a=5&retime=1358251980219&sin=http%253A%252F%252Fwww.baidu.com%252Fs%253Fie%253Dutf-8%2526bs%253Dsite%25253Ajojo2u.com%2526f%253D8%2526rsv_bp%253D1%2526rsv_spt%253D3%2526wd%253Dsite%25253Akanggou.com%2526rsv_n%253D2%2526rsv_sug3%253D1%2526rsv_sug1%253D1%2526rsv_sug4%253D53%2526inputT%253D603<ime=1358251980219&rtime=0HTTP/1.1 200 OK
Server: XT-server/0.0
Date: Tue, 15 Jan 2013 12:13:12 GMT
Content-Type: text/html; charset=gbk
Transfer-Encoding: chunked
Connection: keep-alive
Cache-Control: private
Content-Encoding: gzipphp模拟 post提交 获取数据
目标站采集之前必须登录注册的账号,
要求:1:php提交post内容后获取到想要的内容!
2:最好是提交类似http://127.0.0.1/test.php?id=1 这样的网址,其中1的位置为变量,
3:post数据内也有一个变量,比如post为:aaaaaaaaaaaaa=1 其中1的位置是个变量
我想直接赋予http://127.0.0.1/test.php?id=1 中的变量,让他同时也等于post中的变量!比如:我访问http://127.0.0.1/test.php?id=5 那么post自动为aaaaaaaaaaaaa=5
我访问http://127.0.0.1/test.php?id=6 那么post自动为aaaaaaaaaaaaa=6
......
......
我赋予http://127.0.0.1/test.php?id=1000 那么post自动为aaaaaaaaaaaaa=1000以下是抓包内容给大家做参考:http://www.baidu.com/flow.phpPOST /flow.php HTTP/1.1
Host: www.baidu.com
User-Agent: Mozilla/5.0 (Windows NT 5.1; rv:12.0) Gecko/20100101 Firefox/12.0
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Language: zh-cn,zh;q=0.8,en-us;q=0.5,en;q=0.3
Accept-Encoding: gzip, deflate
Connection: keep-alive
Referer: http://www.baidu.com/flow.php?step=consignee
Cookie: real_ipd=123.9.3.58; 1006079161_ID=27fe6f1019a9b989de8d3401ea188f714cd0e834; 1006079161[visit_times]=1; 1006079161[history]=94788%2C94683; 53kf_61423877_keyword=http://www.baidu.com/s?ie=utf-8&bs=site%3Ajojo2u.com&f=8&rsv_bp=1&rsv_spt=3&wd=site%3Akanggou.com&rsv_n=2&rsv_sug3=1&rsv_sug1=1&rsv_sug4=53&inputT=603; onliner_zdfq61423877=0; Hm_lvt_69218a49e43c7e68acc8f8ab6b614a53=1358251922; Hm_lpvt_69218a49e43c7e68acc8f8ab6b614a53=1358251980; TNuv=1358251921934; ac=http%3A//www.baidu.com/flow.php%3Fstep%3Dconsignee; CNZZDATA3559069=cnzz_eid=8609596-1358251912-http%253A%252F%252Fwww.baidu.com%252Fs%253Fie%253Dutf-8%2526bs%253Dsite%25253Ajojo2u.com%2526f%253D8%2526rsv_bp%253D1%2526rsv_spt%253D3%2526wd%253Dsite%25253Akanggou.com%2526rsv_n%253D2%2526rsv_sug3%253D1%2526rsv_sug1%253D1%2526rsv_sug4%253D53%2526inputT%253D603&ntime=1358251912&cnzz_a=5&retime=1358251980219&sin=http%253A%252F%252Fwww.baidu.com%252Fs%253Fie%253Dutf-8%2526bs%253Dsite%25253Ajojo2u.com%2526f%253D8%2526rsv_bp%253D1%2526rsv_spt%253D3%2526wd%253Dsite%25253Akanggou.com%2526rsv_n%253D2%2526rsv_sug3%253D1%2526rsv_sug1%253D1%2526rsv_sug4%253D53%2526inputT%253D603<ime=1358251980219&rtime=0
Content-Type: application/x-www-form-urlencoded
Content-Length: 293
aaaaaaaaaaaaa=100HTTP/1.1 302 Moved Temporarily
Server: XT-server/0.0
Date: Tue, 15 Jan 2013 12:13:11 GMT
Content-Type: text/html; charset=gbk
Transfer-Encoding: chunked
Connection: keep-alive
Cache-Control: private
Location: flow.php?step=checkout
----------------------------------------------------------
http://www.baidu.com/flow.php?step=checkoutGET /flow.php?step=checkout HTTP/1.1
Host: www.baidu.com
User-Agent: Mozilla/5.0 (Windows NT 5.1; rv:12.0) Gecko/20100101 Firefox/12.0
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Language: zh-cn,zh;q=0.8,en-us;q=0.5,en;q=0.3
Accept-Encoding: gzip, deflate
Connection: keep-alive
Referer: http://www.baidu.com/flow.php?step=consignee
Cookie: real_ipd=123.9.3.58; 1006079161_ID=27fe6f1019a9b989de8d3401ea188f714cd0e834; 1006079161[visit_times]=1; 1006079161[history]=94788%2C94683; 53kf_61423877_keyword=http://www.baidu.com/s?ie=utf-8&bs=site%3Ajojo2u.com&f=8&rsv_bp=1&rsv_spt=3&wd=site%3Akanggou.com&rsv_n=2&rsv_sug3=1&rsv_sug1=1&rsv_sug4=53&inputT=603; onliner_zdfq61423877=0; Hm_lvt_69218a49e43c7e68acc8f8ab6b614a53=1358251922; Hm_lpvt_69218a49e43c7e68acc8f8ab6b614a53=1358251980; TNuv=1358251921934; ac=http%3A//www.baidu.com/flow.php%3Fstep%3Dconsignee; CNZZDATA3559069=cnzz_eid=8609596-1358251912-http%253A%252F%252Fwww.baidu.com%252Fs%253Fie%253Dutf-8%2526bs%253Dsite%25253Ajojo2u.com%2526f%253D8%2526rsv_bp%253D1%2526rsv_spt%253D3%2526wd%253Dsite%25253Akanggou.com%2526rsv_n%253D2%2526rsv_sug3%253D1%2526rsv_sug1%253D1%2526rsv_sug4%253D53%2526inputT%253D603&ntime=1358251912&cnzz_a=5&retime=1358251980219&sin=http%253A%252F%252Fwww.baidu.com%252Fs%253Fie%253Dutf-8%2526bs%253Dsite%25253Ajojo2u.com%2526f%253D8%2526rsv_bp%253D1%2526rsv_spt%253D3%2526wd%253Dsite%25253Akanggou.com%2526rsv_n%253D2%2526rsv_sug3%253D1%2526rsv_sug1%253D1%2526rsv_sug4%253D53%2526inputT%253D603<ime=1358251980219&rtime=0HTTP/1.1 200 OK
Server: XT-server/0.0
Date: Tue, 15 Jan 2013 12:13:12 GMT
Content-Type: text/html; charset=gbk
Transfer-Encoding: chunked
Connection: keep-alive
Cache-Control: private
Content-Encoding: gzipphp模拟 post提交 获取数据
Function getMyPage($u) {
IF($u=='') {Return '';} $r = parse_url($u);
$Cai = curl_init();
$Opt = Array(
CURLOPT_URL => $u,
CURLOPT_REFERER => $r['scheme'].'://'.$r['host'],
CURLOPT_FAILONERROR => 1,
CURLOPT_FOLLOWLOCATION => 1,
CURLOPT_RETURNTRANSFER => 1,// 获取的信息以檔案流的形式返回,而不是直接输出。
CURLOPT_TIMEOUT => 30, CURLOPT_SSL_VERIFYHOST => 1,// 从证书中检查SSL加密算法是否存在
CURLOPT_SSL_VERIFYPEER => 0,// 对认证证书来源的检查,0表示阻止对证书的合法性的检查 //CURLOPT_PROXY => '222.73.173.50:8080',
//CURLOPT_PROXYUSERPWD => 'username:password', //CURLOPT_PORT => 80, //端口
//CURLOPT_POST => 1, //发送一个常规的POST请求,类型为:application/x-www-form-urlencoded,就像表单提交的一样。
//CURLOPT_POSTFIELDS => $postfields,//要传送的所有数据,如果要传送一个檔案,需要一个@开头的檔案名('g=yes&username=see7di&password=www.7di.net999&id=&code=')
//CURLOPT_COOKIE => GetCook($url),
//CURLOPT_USERAGENT => 'Mozilla/5.0 (Windows NT 6.1; WOW64; rv:9.0.1) Gecko/20100101 Firefox/9.0.1',
//CURLOPT_COOKIEJAR => Dirname(__FILE__).$cookie_jar,
//CURLOPT_COOKIEFILE => Dirname(__FILE__).$cookie_jar,
);
curl_setopt_array($Cai,$Opt);
$html = curl_exec($Cai); IF(!$html){
Die('<br>Error number: '.curl_errno($Cai).'<br>Error: '.curl_error($Cai));
}
curl_close($Cai);
Return $html;
}
$url = 'http://www.sina.com.cn';
$postData = array(
'user_id' => $user_id
);
$curlPost = http_build_query($postData);
$ch=curl_init();
curl_setopt($ch,CURLOPT_URL,$url);
curl_setopt($ch,CURLOPT_HEADER,0);
curl_setopt($ch,CURLOPT_RETURNTRANSFER,1);
curl_setopt($ch,CURLOPT_POST,1);
curl_setopt($ch,CURLOPT_POSTFIELDS,$curlPost);
$info=curl_exec($ch); //获取返回值
curl_close($ch);
下邊是一個完整的curl的示例,你研究看看
http://bbs.csdn.net/topics/260025593
如果還是不懂,就加群討論223494678