~~~~~~如何提取视频网站中,网页里面的视频的地址??? 只能知道是 /player.swf?vid=UNKSnz87GrWQOez2L/NyCg&flag=1&p=96313&k=8,具体的flv文件是没法得到的 解决方案 » 免费领取超大流量手机卡,每月29元包185G流量+100分钟通话, 中国电信官方发货 用抓包工具得到FLV文件源地址.再分析.如果与写义的JS变量有关系,就直接用.如果与JS变量无关.那么,就要得到player.swf 文件,再取出 ActionScript 再分析了. 6rooms 的 flv 地址我曾经抓过,你需要先分析一下他的包,然后就可以用socket来模拟播放,从而分析server 返回的内容得到 flv 地址了,不过这些网站经常变的,采集规则也需要实时调整,否则就会采不到内容了,下面是采集 iask 的视频的 flv 地址的记录:采集列表页面地址: http://v.iask.com/c/%D7%D4%C5%C4/mr/1---------- php-5.1.2.debug ----------建立 CollectorFactory 建立 CollectorFactory 成功建立 iask.com --- ListCollectorListCollector 建立 iask.com --- ListCollector 成功采集 iask.com --- http://v.iask.com/c/%D7%D4%C5%C4/mr/1Use Host: v.iask.comcreate socket succeed v.iask.comSending request:GET /c/%D7%D4%C5%C4/mr/1 HTTP/1.1Accept: image/gif, image/x-xbitmap, image/jpeg, image/pjpeg, application/msword, application/x-shockwave-flash, application/vnd.ms-excel, application/vnd.ms-powerpoint, */*Accept-Language: zh-cnAccept-Encoding: gzip, deflateUser-Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)Host: v.iask.comConnection: CloseReading response:HTTP/1.1 200 OKDate: Thu, 28 Jun 2007 00:36:16 GMTServer: Apache/2.0.59 (Unix) PHP/4.4.3X-Powered-By: PHP/4.4.3Set-Cookie: PHPSESSID=a9710af20887538f4dbbfe50ba279f24; path=/Expires: Thu, 19 Nov 1981 08:52:00 GMTCache-Control: no-store, no-cache, must-revalidate, post-check=0, pre-check=0Pragma: no-cacheVary: Accept-EncodingContent-Encoding: gzipContent-Length: 4540Connection: closeContent-Type: text/html; charset=GB2312 采集 iask.com --- http://v.iask.com/c/%D7%D4%C5%C4/mr/1 成功Array( [max] => 1536888 [min] => 1031400 [id] => Array ( [0] => 1536888 [1] => 1534550 [2] => 1071033 [3] => 1070687 [4] => 1063903 [5] => 1063870 [6] => 1063867 [7] => 1063632 [8] => 1048791 [9] => 1040141 [10] => 1036389 [11] => 1035601 [12] => 1034794 [13] => 1031977 [14] => 1031405 [15] => 1031400 ) [link] => Array ( [0] => /b/1536888.html [1] => /b/1534550.html [2] => /b/1071033.html [3] => /b/1070687.html [4] => /b/1063903.html [5] => /b/1063870.html [6] => /b/1063867.html [7] => /b/1063632.html [8] => /b/1048791.html [9] => /b/1040141.html [10] => /b/1036389.html [11] => /b/1035601.html [12] => /b/1034794.html [13] => /b/1031977.html [14] => /b/1031405.html [15] => /b/1031400.html ) [thumb] => Array ( [0] => http://image2.sina.com.cn/kusou/v/8/88/88/1536888_1.jpg [1] => http://image2.sina.com.cn/kusou/v/0/50/50/1534550_1.jpg [2] => http://image2.sina.com.cn/kusou/v/3/33/33/1071033_1.jpg [3] => http://image2.sina.com.cn/kusou/v/7/87/87/1070687_1.jpg [4] => http://image2.sina.com.cn/kusou/v/3/3/3/1063903_1.jpg [5] => http://image2.sina.com.cn/kusou/v/0/70/70/1063870_1.jpg [6] => http://image2.sina.com.cn/kusou/v/7/67/67/1063867_1.jpg [7] => http://image2.sina.com.cn/kusou/v/2/32/32/1063632_1.jpg [8] => http://image2.sina.com.cn/kusou/v/1/91/91/1048791_1.jpg [9] => http://image2.sina.com.cn/kusou/v/1/41/41/1040141_1.jpg [10] => http://image2.sina.com.cn/kusou/v/9/89/89/1036389_1.jpg [11] => http://image2.sina.com.cn/kusou/v/1/1/1/1035601_1.jpg [12] => http://image2.sina.com.cn/kusou/v/4/94/94/1034794_1.jpg [13] => http://image2.sina.com.cn/kusou/v/7/77/77/1031977_1.jpg [14] => http://image2.sina.com.cn/kusou/v/5/5/5/1031405_1.jpg [15] => http://image2.sina.com.cn/kusou/v/0/0/0/1031400_1.jpg ))建立 iask.com --- DeailCollectorDetailCollector 建立 iask.com --- DetailCollector 成功采集 iask.com --- detail --- /b/1040141.htmlUse Host: v.iask.comcreate socket succeed v.iask.comSending request:GET /b/1040141.html HTTP/1.1Accept: image/gif, image/x-xbitmap, image/jpeg, image/pjpeg, application/msword, application/x-shockwave-flash, application/vnd.ms-excel, application/vnd.ms-powerpoint, */*Accept-Language: zh-cnAccept-Encoding: gzip, deflateUser-Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)Host: v.iask.comConnection: CloseReading response:HTTP/1.1 200 OKDate: Thu, 28 Jun 2007 00:36:16 GMTServer: Apache/2.0.59 (Unix) PHP/4.4.3X-Powered-By: PHP/4.4.3Set-Cookie: iask_v_vihi=x1040141; expires=Sat, 28 Jul 2007 00:36:16 GMTSet-Cookie: PHPSESSID=701b7a699252cc667fde17542a9779f8; path=/Expires: Thu, 19 Nov 1981 08:52:00 GMTCache-Control: no-store, no-cache, must-revalidate, post-check=0, pre-check=0Pragma: no-cacheVary: Accept-EncodingContent-Encoding: gzipContent-Length: 6943Connection: closeContent-Type: text/html; charset=GB2312<br><br>采集到:http://v.iask.com/b/vs_player.swf?vid=1040141<br>Use Host: v.iask.comcreate socket succeed v.iask.comSending request:GET /v_ask.php?test=0&vid=1040141 HTTP/1.1Accept: */*Referer: http://v.iask.com/b/vs_player.swf?vid=1040141x-flash-version: 9,0,16,0Accept-Encoding: gzip, deflateUser-Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)Host: v.iask.comConnection: CloseReading response:HTTP/1.1 200 OKDate: Thu, 28 Jun 2007 00:36:17 GMTServer: Apache/2.0.59 (Unix) PHP/4.4.3X-Powered-By: PHP/4.4.3Vary: Accept-EncodingContent-Encoding: gzipContent-Length: 180Connection: closeContent-Type: text/html; charset=GB2312 采集 iask.com --- detail --- /b/1040141.html 成功Array( [name] => 回应:朋友别哭-吕方 [type] => flv [swf] => http://v.iask.com/b/vs_player.swf?vid=1040141 [flv] => http://d19.v.iask.com/f/1/2087c58be71390bf1e655c7f5055c6e31040141.flv [length] => 48 [intro] => )normal exitTotal time used: 1.0520489215851Output completed (1 sec consumed) - Normal Termination 虽然只有两三行,但我还是不懂。求解。标题。要短。 帮忙近来看下,在线等。 怎么样用XMLHTTP方法提交表单数据,使用POST方法 【闭包问题】 关于javascrip的赋值问题 问题:F5能屏蔽掉吗? 如何做到网站主页刚出来就显示一个页面,或一个图片,几秒钟就没了!!! 这个递归问题出在哪,请高手帮忙看看 WdatePicker怎么输整点 JavaScript正则表达式问题 紧急求助getElementsByTagName怎样获得包含特殊字符的字段 为什么是number,而不是3呢?
如果与写义的JS变量有关系,就直接用.如果与JS变量无关.那么,就要得到player.swf 文件,再取出 ActionScript 再分析了.
建立 CollectorFactory 成功建立 iask.com --- ListCollectorListCollector
建立 iask.com --- ListCollector 成功采集 iask.com --- http://v.iask.com/c/%D7%D4%C5%C4/mr/1
Use Host: v.iask.comcreate socket succeed v.iask.com
Sending request:GET /c/%D7%D4%C5%C4/mr/1 HTTP/1.1
Accept: image/gif, image/x-xbitmap, image/jpeg, image/pjpeg, application/msword, application/x-shockwave-flash, application/vnd.ms-excel, application/vnd.ms-powerpoint, */*
Accept-Language: zh-cn
Accept-Encoding: gzip, deflate
User-Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)
Host: v.iask.com
Connection: CloseReading response:HTTP/1.1 200 OK
Date: Thu, 28 Jun 2007 00:36:16 GMT
Server: Apache/2.0.59 (Unix) PHP/4.4.3
X-Powered-By: PHP/4.4.3
Set-Cookie: PHPSESSID=a9710af20887538f4dbbfe50ba279f24; path=/
Expires: Thu, 19 Nov 1981 08:52:00 GMT
Cache-Control: no-store, no-cache, must-revalidate, post-check=0, pre-check=0
Pragma: no-cache
Vary: Accept-Encoding
Content-Encoding: gzip
Content-Length: 4540
Connection: close
Content-Type: text/html; charset=GB2312
采集 iask.com --- http://v.iask.com/c/%D7%D4%C5%C4/mr/1 成功
Array
(
[max] => 1536888
[min] => 1031400
[id] => Array
(
[0] => 1536888
[1] => 1534550
[2] => 1071033
[3] => 1070687
[4] => 1063903
[5] => 1063870
[6] => 1063867
[7] => 1063632
[8] => 1048791
[9] => 1040141
[10] => 1036389
[11] => 1035601
[12] => 1034794
[13] => 1031977
[14] => 1031405
[15] => 1031400
) [link] => Array
(
[0] => /b/1536888.html
[1] => /b/1534550.html
[2] => /b/1071033.html
[3] => /b/1070687.html
[4] => /b/1063903.html
[5] => /b/1063870.html
[6] => /b/1063867.html
[7] => /b/1063632.html
[8] => /b/1048791.html
[9] => /b/1040141.html
[10] => /b/1036389.html
[11] => /b/1035601.html
[12] => /b/1034794.html
[13] => /b/1031977.html
[14] => /b/1031405.html
[15] => /b/1031400.html
) [thumb] => Array
(
[0] => http://image2.sina.com.cn/kusou/v/8/88/88/1536888_1.jpg
[1] => http://image2.sina.com.cn/kusou/v/0/50/50/1534550_1.jpg
[2] => http://image2.sina.com.cn/kusou/v/3/33/33/1071033_1.jpg
[3] => http://image2.sina.com.cn/kusou/v/7/87/87/1070687_1.jpg
[4] => http://image2.sina.com.cn/kusou/v/3/3/3/1063903_1.jpg
[5] => http://image2.sina.com.cn/kusou/v/0/70/70/1063870_1.jpg
[6] => http://image2.sina.com.cn/kusou/v/7/67/67/1063867_1.jpg
[7] => http://image2.sina.com.cn/kusou/v/2/32/32/1063632_1.jpg
[8] => http://image2.sina.com.cn/kusou/v/1/91/91/1048791_1.jpg
[9] => http://image2.sina.com.cn/kusou/v/1/41/41/1040141_1.jpg
[10] => http://image2.sina.com.cn/kusou/v/9/89/89/1036389_1.jpg
[11] => http://image2.sina.com.cn/kusou/v/1/1/1/1035601_1.jpg
[12] => http://image2.sina.com.cn/kusou/v/4/94/94/1034794_1.jpg
[13] => http://image2.sina.com.cn/kusou/v/7/77/77/1031977_1.jpg
[14] => http://image2.sina.com.cn/kusou/v/5/5/5/1031405_1.jpg
[15] => http://image2.sina.com.cn/kusou/v/0/0/0/1031400_1.jpg
))建立 iask.com --- DeailCollectorDetailCollector
建立 iask.com --- DetailCollector 成功采集 iask.com --- detail --- /b/1040141.html
Use Host: v.iask.comcreate socket succeed v.iask.com
Sending request:GET /b/1040141.html HTTP/1.1
Accept: image/gif, image/x-xbitmap, image/jpeg, image/pjpeg, application/msword, application/x-shockwave-flash, application/vnd.ms-excel, application/vnd.ms-powerpoint, */*
Accept-Language: zh-cn
Accept-Encoding: gzip, deflate
User-Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)
Host: v.iask.com
Connection: CloseReading response:HTTP/1.1 200 OK
Date: Thu, 28 Jun 2007 00:36:16 GMT
Server: Apache/2.0.59 (Unix) PHP/4.4.3
X-Powered-By: PHP/4.4.3
Set-Cookie: iask_v_vihi=x1040141; expires=Sat, 28 Jul 2007 00:36:16 GMT
Set-Cookie: PHPSESSID=701b7a699252cc667fde17542a9779f8; path=/
Expires: Thu, 19 Nov 1981 08:52:00 GMT
Cache-Control: no-store, no-cache, must-revalidate, post-check=0, pre-check=0
Pragma: no-cache
Vary: Accept-Encoding
Content-Encoding: gzip
Content-Length: 6943
Connection: close
Content-Type: text/html; charset=GB2312<br><br>采集到:http://v.iask.com/b/vs_player.swf?vid=1040141<br>
Use Host: v.iask.comcreate socket succeed v.iask.com
Sending request:GET /v_ask.php?test=0&vid=1040141 HTTP/1.1
Accept: */*
Referer: http://v.iask.com/b/vs_player.swf?vid=1040141
x-flash-version: 9,0,16,0
Accept-Encoding: gzip, deflate
User-Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)
Host: v.iask.com
Connection: CloseReading response:HTTP/1.1 200 OK
Date: Thu, 28 Jun 2007 00:36:17 GMT
Server: Apache/2.0.59 (Unix) PHP/4.4.3
X-Powered-By: PHP/4.4.3
Vary: Accept-Encoding
Content-Encoding: gzip
Content-Length: 180
Connection: close
Content-Type: text/html; charset=GB2312
采集 iask.com --- detail --- /b/1040141.html 成功
Array
(
[name] => 回应:朋友别哭-吕方
[type] => flv
[swf] => http://v.iask.com/b/vs_player.swf?vid=1040141
[flv] => http://d19.v.iask.com/f/1/2087c58be71390bf1e655c7f5055c6e31040141.flv
[length] => 48
[intro] =>
)normal exit
Total time used: 1.0520489215851
Output completed (1 sec consumed) - Normal Termination