比如我要抓google中的信息
我先在google中输入"xx"
把返回的所有的数据中,取数据中域名
比如返回如下格式:事实恒等式
http://www.xxx.com/dddd.asp?....我要取www.xxx.com
请教如何取这样的值之前有参考过这个代码
http://blog.csdn.net/zou_seafarer/archive/2008/01/23/2061405.aspx
但是好象要抓google的比较的难
不太一样,很多垃圾信息!
请大侠帮下忙!

解决方案 »

  1.   

    在vb程序中用浏览器控件上网搜索或者自己用http协议与网站通讯,传回来的网站信息你随便处理,都是字符串。。
      

  2.   

    Sub Getall()
    Dim x() As String, i As Long, j As Long, url As String
    url = "http://www.google.cn/search?num=100&q=" & InputBox("Please enter something to search:", "Info", "china")
    With CreateObject("Msxml2.XMLHTTP")
    For j = 0 To 5
    .Open "GET", url & "&start=" & j * 100, False
    .Send
    x = Filter(Filter(Split(.responseText, "</a></nobr></div>"), "<br><span class=a>www."), ".com/ -")
    For i = 0 To UBound(x)
    x(i) = Replace(Replace("www." & Split(Split(x(i), "<br><span class=a>www.")(1), ".com/ -")(0) & ".com", "<b>", ""), "</b>", "")
    If Not x(i) Like "*.*.*.*" Then Debug.Print x(i)
    Next
    Next
    End With
    End Sub代码返回:www.china.com
    www.chinatoday.com
    www.chinanews.com
    www.cctv.com
    www.chinabig.com
    www.china-airlines.com
    www.chinavista.com
    www.chinamobile.com
    www.china-window.com
    www.chinatour.com
    www.googlechinablog.com
    www.chinats.com
    www.mtv-china.com
    www.china-pub.com
    www.cs-air.com
    www.century21cn.com
    www.gmchina.com
    www.phpchina.com
    www.ednchina.com
    www.ce-air.com
    www.fortunechina.com
    www.china-channel.com
    www.cswa.com
    www.ctshk.com
    www.eetchina.com
    www.chinapages.com
    www.h2o-china.com
    www.chinaxinhuaair.com
    www.travelchinaguide.com
    www.cmbchina.com
    www.expo-china.com
    www.cnwa.com
    www.east-china-fair.com
    www.cechinamag.com
    www.yhachina.com
    www.esmchina.com
    www.ellechina.com
    www.chinainfobank.com
    www.iic-china.com
    www.china-designer.com
    www.china-inc.com
    www.digitalchina.com
    www.chinabyte.com
    www.nflchina.com
    www.china-review.com
    www.chinaeducationexpo.com
    www.summerpalace-china.com
    www.china-drama.com
    www.china-insurance.com
    www.china-fire.com
    www.sourcecodechina.com
    www.china-training.com
    www.sinopec.com
    www.chinaunionpay.com
    www.china-bluestar.com
    www.scichina.com
    www.wowchina.com
    www.atexpochina.com
    www.china-flower.com
    www.aluminiumchina.com
    www.china-chigo.com
    www.china-cdt.com
    www.ed-china.com
    www.china-holiday.com
    www.nihao.com
    www.xinhuanet.com
    www.linuxworldchina.com
    www.cwtc.com
    www.china-gift.com
    www.chinashoesexpo.com
    www.chtf.com
    www.1stchina.com
    www.chinabadminton.com
    www.sohochina.com
    www.cetv.com
    www.ceramsoc.com
    www.cnipr.com
    www.chinafotopress.com
    www.china-lushan.com
    www.zyjl-china.com
    www.sd-china.com
    www.expatscn.com
    www.chinaenvironment.com
    www.dongfang-china.com
    www.china-weldnet.com
    www.leehom-cn.com
    www.promosalons-china.com
    www.chinae.com
    www.yz-china.com
    www.china-ef.com
    www.china-logisticsnet.com
    www.umlchina.com
    www.china-spacenews.com
    www.nari-china.com
    www.meetchina.com
    www.chinapharmex.com
    www.china-up.com
    www.ciec-expo.com
    www.cicgf.com
    www.chinabiodiversity.com
    www.chinaplaza.com
    www.avexchina.com
    www.lawinfochina.com
    www.made-in-china.com
    www.china-vn.com
    www.allinprint.com
    www.chinacareer.com
    www.chinaacademyofart.com
    www.china-htdl.com
    www.tdkchina.com
    www.china-ah.com
    www.snweb.com
    www.china-leather.com
    www.china-askpro.com
    www.china-fpsa.com
    www.baking-china.com
    www.china-arbitration.com
    www.china-shufajia.com
    www.horizon-china.com
    www.smartcards-china.com
    www.china-customs.com
    www.china-machine.com
    www.mozillaonline.com
    www.archivesandwarehouse.com
    www.citic.com
    www.sac-china.com
    www.china-kaidi.com
      

  3.   

     解决这等问题。。
    非VBS正则达式组件莫属!
    没代码。。LZ GG下吧。
      

  4.   

    Private Sub Command1_Click()
    Dim x() As String, i As Long, j As Long, url As String
    url = "http://www.google.cn/search?num=100&q=" & InputBox("Please enter something to search:", "Info", "Olympic")
    For j = 0 To 5
    WebBrowser1.Navigate url & "&start=" & j * 100
    While WebBrowser1.Busy
    DoEvents
    Wend
    x = Filter(Filter(Split(WebBrowser1.Document.body.innerhtml, "</a></nobr></div>", , vbTextCompare), "<br><span class=a>www.", , vbTextCompare), ".com/ -", , vbTextCompare)
    For i = 0 To UBound(x)
    x(i) = Replace(Replace("www." & Split(Split(x(i), "<br><span class=a>www.", , vbTextCompare)(1), ".com/ -", , vbTextCompare)(0) & ".com", "<b>", "", , , vbTextCompare), "</b>", "", , , vbTextCompare)
    If Not x(i) Like "*.*.*.*" Then List1.AddItem x(i)
    Next
    NextEnd Sub
      

  5.   

    3Q~~~
    呵呵
    还是csdn牛人多
    先谢!!!