想获取一个网站上所有栏目的链接该怎么做呢 想获取一个网站上所有栏目的链接该怎么做呢,包括栏目下的相关栏目请高手指点下谢谢 解决方案 » 免费领取超大流量手机卡,每月29元包185G流量+100分钟通话, 中国电信官方发货 private void button1_Click(object sender, EventArgs e) { for (int i = 0; i < webBrowser1.Document.Links.Count;i++ ) { string str = webBrowser1.Document.Links[i].OuterHtml; int a1 = str.IndexOf("href=\"") + 6; int a2 = str.LastIndexOf( "\" target"); listBox1.Items.Add(str.Substring(a1, a2-a1)); } } 有html么可以正则匹配出所有的a或者用dom加载,然后拉出所有的超链接 用正则表达式匹配<a href=''></a> 抓取页面通过正则获取连接和文字System.Net.HttpWebRequest request = (System.Net.HttpWebRequest)System.Net.WebRequest.Create(url); request.UserAgent = "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 2.0.50727; .NET CLR 3.0.04506.648; .NET CLR 3.5.21022)"; System.Net.WebResponse response = request.GetResponse(); System.IO.Stream resStream = response.GetResponseStream(); System.IO.StreamReader sr = new System.IO.StreamReader(resStream, encoding); string html = (sr.ReadToEnd()); resStream.Close(); sr.Close(); string strPattern=@"a[\s]+href=(? <Link>[^\s>]+)[^>]*>(? <Text>[^ <]*) </a>"; MatchCollection Matches=Regex.Matches(html,strPattern,RegexOptions.IgnoreCase|RegexOptions.Compiled); foreach(Match mc in Matches) { Console.Write(mc.Groups["Link"].Value.ToString().Trim()); Console.Write(mc.Groups["Text"].Value.ToString().Trim()); } 急急急!!!!在线等 c# ? 大家说说学习.net的心得!!! ContextMenuStrip的二级菜单怎么访问 紧急求助c#问题!!快来帮忙呀!! C#硬件按钮拍照,截取底层按钮按下windows消息怎么做? GGJJ:如何进行广告轮流播放? 如何结合这两种SQL方法? 请问如何将这个DataSet(XML信息)表中数据保存到数据库的表中? 谁能告诉我,ilasm.exe在那里有啊!!!急用(在线等待) 用c#开发一个模拟3D场景开车驾驶的程序 求解决方案 C#的一个很简单的问题。。请高手帮忙。小弟无尽感激
{
for (int i = 0; i < webBrowser1.Document.Links.Count;i++ )
{
string str = webBrowser1.Document.Links[i].OuterHtml;
int a1 = str.IndexOf("href=\"") + 6;
int a2 = str.LastIndexOf( "\" target");
listBox1.Items.Add(str.Substring(a1, a2-a1));
}
}
或者用dom加载,然后拉出所有的超链接
System.Net.HttpWebRequest request = (System.Net.HttpWebRequest)System.Net.WebRequest.Create(url);
request.UserAgent = "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 2.0.50727; .NET CLR 3.0.04506.648; .NET CLR 3.5.21022)";
System.Net.WebResponse response = request.GetResponse();
System.IO.Stream resStream = response.GetResponseStream();
System.IO.StreamReader sr = new System.IO.StreamReader(resStream, encoding);
string html = (sr.ReadToEnd());
resStream.Close();
sr.Close();
string strPattern=@"a[\s]+href=(? <Link>[^\s>]+)[^>]*>(? <Text>[^ <]*) </a>";
MatchCollection Matches=Regex.Matches(html,strPattern,RegexOptions.IgnoreCase|RegexOptions.Compiled);
foreach(Match mc in Matches)
{
Console.Write(mc.Groups["Link"].Value.ToString().Trim());
Console.Write(mc.Groups["Text"].Value.ToString().Trim());
}