比如我要抓google中的信息
我先在google中输入"xx"
把返回的所有的数据中,取数据中域名
比如返回如下格式:事实恒等式
http://www.xxx.com/dddd.asp?....我要取www.xxx.com
请教如何取这样的值之前有参考过这个代码
http://blog.csdn.net/zou_seafarer/archive/2008/01/23/2061405.aspx
但是好象要抓google的比较的难
不太一样,很多垃圾信息!
请大侠帮下忙!
我先在google中输入"xx"
把返回的所有的数据中,取数据中域名
比如返回如下格式:事实恒等式
http://www.xxx.com/dddd.asp?....我要取www.xxx.com
请教如何取这样的值之前有参考过这个代码
http://blog.csdn.net/zou_seafarer/archive/2008/01/23/2061405.aspx
但是好象要抓google的比较的难
不太一样,很多垃圾信息!
请大侠帮下忙!
Dim x() As String, i As Long, j As Long, url As String
url = "http://www.google.cn/search?num=100&q=" & InputBox("Please enter something to search:", "Info", "china")
With CreateObject("Msxml2.XMLHTTP")
For j = 0 To 5
.Open "GET", url & "&start=" & j * 100, False
.Send
x = Filter(Filter(Split(.responseText, "</a></nobr></div>"), "<br><span class=a>www."), ".com/ -")
For i = 0 To UBound(x)
x(i) = Replace(Replace("www." & Split(Split(x(i), "<br><span class=a>www.")(1), ".com/ -")(0) & ".com", "<b>", ""), "</b>", "")
If Not x(i) Like "*.*.*.*" Then Debug.Print x(i)
Next
Next
End With
End Sub代码返回:www.china.com
www.chinatoday.com
www.chinanews.com
www.cctv.com
www.chinabig.com
www.china-airlines.com
www.chinavista.com
www.chinamobile.com
www.china-window.com
www.chinatour.com
www.googlechinablog.com
www.chinats.com
www.mtv-china.com
www.china-pub.com
www.cs-air.com
www.century21cn.com
www.gmchina.com
www.phpchina.com
www.ednchina.com
www.ce-air.com
www.fortunechina.com
www.china-channel.com
www.cswa.com
www.ctshk.com
www.eetchina.com
www.chinapages.com
www.h2o-china.com
www.chinaxinhuaair.com
www.travelchinaguide.com
www.cmbchina.com
www.expo-china.com
www.cnwa.com
www.east-china-fair.com
www.cechinamag.com
www.yhachina.com
www.esmchina.com
www.ellechina.com
www.chinainfobank.com
www.iic-china.com
www.china-designer.com
www.china-inc.com
www.digitalchina.com
www.chinabyte.com
www.nflchina.com
www.china-review.com
www.chinaeducationexpo.com
www.summerpalace-china.com
www.china-drama.com
www.china-insurance.com
www.china-fire.com
www.sourcecodechina.com
www.china-training.com
www.sinopec.com
www.chinaunionpay.com
www.china-bluestar.com
www.scichina.com
www.wowchina.com
www.atexpochina.com
www.china-flower.com
www.aluminiumchina.com
www.china-chigo.com
www.china-cdt.com
www.ed-china.com
www.china-holiday.com
www.nihao.com
www.xinhuanet.com
www.linuxworldchina.com
www.cwtc.com
www.china-gift.com
www.chinashoesexpo.com
www.chtf.com
www.1stchina.com
www.chinabadminton.com
www.sohochina.com
www.cetv.com
www.ceramsoc.com
www.cnipr.com
www.chinafotopress.com
www.china-lushan.com
www.zyjl-china.com
www.sd-china.com
www.expatscn.com
www.chinaenvironment.com
www.dongfang-china.com
www.china-weldnet.com
www.leehom-cn.com
www.promosalons-china.com
www.chinae.com
www.yz-china.com
www.china-ef.com
www.china-logisticsnet.com
www.umlchina.com
www.china-spacenews.com
www.nari-china.com
www.meetchina.com
www.chinapharmex.com
www.china-up.com
www.ciec-expo.com
www.cicgf.com
www.chinabiodiversity.com
www.chinaplaza.com
www.avexchina.com
www.lawinfochina.com
www.made-in-china.com
www.china-vn.com
www.allinprint.com
www.chinacareer.com
www.chinaacademyofart.com
www.china-htdl.com
www.tdkchina.com
www.china-ah.com
www.snweb.com
www.china-leather.com
www.china-askpro.com
www.china-fpsa.com
www.baking-china.com
www.china-arbitration.com
www.china-shufajia.com
www.horizon-china.com
www.smartcards-china.com
www.china-customs.com
www.china-machine.com
www.mozillaonline.com
www.archivesandwarehouse.com
www.citic.com
www.sac-china.com
www.china-kaidi.com
非VBS正则达式组件莫属!
没代码。。LZ GG下吧。
Dim x() As String, i As Long, j As Long, url As String
url = "http://www.google.cn/search?num=100&q=" & InputBox("Please enter something to search:", "Info", "Olympic")
For j = 0 To 5
WebBrowser1.Navigate url & "&start=" & j * 100
While WebBrowser1.Busy
DoEvents
Wend
x = Filter(Filter(Split(WebBrowser1.Document.body.innerhtml, "</a></nobr></div>", , vbTextCompare), "<br><span class=a>www.", , vbTextCompare), ".com/ -", , vbTextCompare)
For i = 0 To UBound(x)
x(i) = Replace(Replace("www." & Split(Split(x(i), "<br><span class=a>www.", , vbTextCompare)(1), ".com/ -", , vbTextCompare)(0) & ".com", "<b>", "", , , vbTextCompare), "</b>", "", , , vbTextCompare)
If Not x(i) Like "*.*.*.*" Then List1.AddItem x(i)
Next
NextEnd Sub
呵呵
还是csdn牛人多
先谢!!!