目标页面如下</div>
<div id="rs"><table cellpadding="0"><tr><th rowspan="2" class="tt">友情链接</th><th><a href="s?wd=%C1%F7%C1%BF%20seo&rsp=0&oq=seo&f=1&rsv_ers=xn1&rs_src=0">小牛论坛</a>
</th><td></td><th><a href="fdw%D3%C5%BB%AF&rsp=1&oq=sdf&f=1&rsv_ers=xn1&rs_src=0">126邮箱 </a>
</th><td></td><th><a href="wtwet9%A4%BE%DF&rsp=2&oaasd&f=1&rsv_ers=xn1&rs_src=0">汽车之家 </a>
</th><td></td><th><a href="sfasf4%BA%BAseo&rsp=3&odfa1&rsv_ers=xn1&rs_src=0">武汉 交友</a>
</th><td></td><th><a href="sadfas3%D6%DDseo&rsp=4&oqdsfsa=1&rsv_ers=xn1&rs_src=0">广州信息</a>
</th></tr><tr><th><a href="dfasdfB1%B1%BE%A9seo&rsp=5&oq=seo&f=1&rsv_ers=xn1&rs_src=0">北京之家</a>
</th><td></td><th><a href="dfasfaD%B8seo&rsp=6&oq=ssfa&rsv_ers=xn1&rs_src=0">好多美女</a>
</th><td></td><th><a href="dfs2%DB%CC%B3&rsp=sdfa=1&rsv_ers=xn1&rs_src=0">湘 菜款</a>
</th><td></td><th><a href="sdf%C5%E0%D1%B5&rsp=8&oqdsfa1&rsv_ers=xn1&rs_src=0">在线资料</a>
</th><td></td><th><a href="dsfA%AF%BC%D2%D7%AFseo&rspfsdfeo&f=1&rsv_ers=xn1&rs_src=0">五笔之家</a>
</th></tr></table></div>正则表达式的值怎么写才得到以下结果:
126邮箱
汽车 之家
武汉 交友
广州信息
北京之家
好多美女
湘 菜款
在线资料
五笔之家我用的表达式是:
static ArrayList GetPageKeywords(string htmlCode)
{
ArrayList al = new ArrayList();
string strRegex = "(<a href=\"s?w.*?>).*?(</a>)"; //正则表达式; 这句取值失败,..
Regex r = new Regex(strRegex, RegexOptions.IgnoreCase);
MatchCollection mc = r.Matches(htmlCode);
foreach (Match m in mc)
al.Add(m);
return al;
}请说说下,怎么取才成功得到上面那结果
<div id="rs"><table cellpadding="0"><tr><th rowspan="2" class="tt">友情链接</th><th><a href="s?wd=%C1%F7%C1%BF%20seo&rsp=0&oq=seo&f=1&rsv_ers=xn1&rs_src=0">小牛论坛</a>
</th><td></td><th><a href="fdw%D3%C5%BB%AF&rsp=1&oq=sdf&f=1&rsv_ers=xn1&rs_src=0">126邮箱 </a>
</th><td></td><th><a href="wtwet9%A4%BE%DF&rsp=2&oaasd&f=1&rsv_ers=xn1&rs_src=0">汽车之家 </a>
</th><td></td><th><a href="sfasf4%BA%BAseo&rsp=3&odfa1&rsv_ers=xn1&rs_src=0">武汉 交友</a>
</th><td></td><th><a href="sadfas3%D6%DDseo&rsp=4&oqdsfsa=1&rsv_ers=xn1&rs_src=0">广州信息</a>
</th></tr><tr><th><a href="dfasdfB1%B1%BE%A9seo&rsp=5&oq=seo&f=1&rsv_ers=xn1&rs_src=0">北京之家</a>
</th><td></td><th><a href="dfasfaD%B8seo&rsp=6&oq=ssfa&rsv_ers=xn1&rs_src=0">好多美女</a>
</th><td></td><th><a href="dfs2%DB%CC%B3&rsp=sdfa=1&rsv_ers=xn1&rs_src=0">湘 菜款</a>
</th><td></td><th><a href="sdf%C5%E0%D1%B5&rsp=8&oqdsfa1&rsv_ers=xn1&rs_src=0">在线资料</a>
</th><td></td><th><a href="dsfA%AF%BC%D2%D7%AFseo&rspfsdfeo&f=1&rsv_ers=xn1&rs_src=0">五笔之家</a>
</th></tr></table></div>正则表达式的值怎么写才得到以下结果:
126邮箱
汽车 之家
武汉 交友
广州信息
北京之家
好多美女
湘 菜款
在线资料
五笔之家我用的表达式是:
static ArrayList GetPageKeywords(string htmlCode)
{
ArrayList al = new ArrayList();
string strRegex = "(<a href=\"s?w.*?>).*?(</a>)"; //正则表达式; 这句取值失败,..
Regex r = new Regex(strRegex, RegexOptions.IgnoreCase);
MatchCollection mc = r.Matches(htmlCode);
foreach (Match m in mc)
al.Add(m);
return al;
}请说说下,怎么取才成功得到上面那结果
{
ArrayList al = new ArrayList();
string strRegex = "(?i)(?<=<a[^>]*?>)[^<>]+?((?=</a>))";
Regex r = new Regex(strRegex, RegexOptions.IgnoreCase);
MatchCollection mc = r.Matches(htmlCode);
foreach (Match m in mc)
al.Add(m.Value);
return al;
}
MatchCollection matches=Regex.Matches(s, @"(?is)(?<=<div[^>]+id=""rs""[^>]*>.+?<a[^>]+>).+?(?=</a>.+?</div>)");
foreach (Match match in matches)
Response.Write(match.Value + "<br/>");
输出:
小牛论坛
126邮箱
汽车之家
武汉 交友
广州信息
北京之家
好多美女
湘 菜款
在线资料
五笔之家
/*
</div>
<div id="rs"><table cellpadding="0"><tr><th rowspan="2" class="tt">友情链接</th><th><a href="s?wd=%C1%F7%C1%BF%20seo&rsp=0&oq=seo&f=1&rsv_ers=xn1&rs_src=0">小牛论坛</a>
</th><td></td><th><a href="fdw%D3%C5%BB%AF&rsp=1&oq=sdf&f=1&rsv_ers=xn1&rs_src=0">126邮箱 </a>
</th><td></td><th><a href="wtwet9%A4%BE%DF&rsp=2&oaasd&f=1&rsv_ers=xn1&rs_src=0">汽车之家 </a>
</th><td></td><th><a href="sfasf4%BA%BAseo&rsp=3&odfa1&rsv_ers=xn1&rs_src=0">武汉 交友</a>
</th><td></td><th><a href="sadfas3%D6%DDseo&rsp=4&oqdsfsa=1&rsv_ers=xn1&rs_src=0">广州信息</a>
</th></tr><tr><th><a href="dfasdfB1%B1%BE%A9seo&rsp=5&oq=seo&f=1&rsv_ers=xn1&rs_src=0">北京之家</a>
</th><td></td><th><a href="dfasfaD%B8seo&rsp=6&oq=ssfa&rsv_ers=xn1&rs_src=0">好多美女</a>
</th><td></td><th><a href="dfs2%DB%CC%B3&rsp=sdfa=1&rsv_ers=xn1&rs_src=0">湘 菜款</a>
</th><td></td><th><a href="sdf%C5%E0%D1%B5&rsp=8&oqdsfa1&rsv_ers=xn1&rs_src=0">在线资料</a>
</th><td></td><th><a href="dsfA%AF%BC%D2%D7%AFseo&rspfsdfeo&f=1&rsv_ers=xn1&rs_src=0">五笔之家</a>
</th></tr></table></div>
*/结果
/*
1 <th><a href="s?wd=%C1%F7%C1%BF%20seo&rsp=0&oq=seo&f=1&rsv_ers=xn1&rs_src=0">小牛论坛</a>
</th> 小牛论坛
2 <th><a href="fdw%D3%C5%BB%AF&rsp=1&oq=sdf&f=1&rsv_ers=xn1&rs_src=0">126邮箱 </a>
</th> 126邮箱
3 <th><a href="wtwet9%A4%BE%DF&rsp=2&oaasd&f=1&rsv_ers=xn1&rs_src=0">汽车之家 </a>
</th> 汽车之家
4 <th><a href="sfasf4%BA%BAseo&rsp=3&odfa1&rsv_ers=xn1&rs_src=0">武汉 交友</a>
</th> 武汉 交友
5 <th><a href="sadfas3%D6%DDseo&rsp=4&oqdsfsa=1&rsv_ers=xn1&rs_src=0">广州信息</a>
</th> 广州信息
6 <th><a href="dfasdfB1%B1%BE%A9seo&rsp=5&oq=seo&f=1&rsv_ers=xn1&rs_src=0">北京之家</a>
</th> 北京之家
7 <th><a href="dfasfaD%B8seo&rsp=6&oq=ssfa&rsv_ers=xn1&rs_src=0">好多美女</a>
</th> 好多美女
8 <th><a href="dfs2%DB%CC%B3&rsp=sdfa=1&rsv_ers=xn1&rs_src=0">湘 菜款</a>
</th> 湘 菜款
9 <th><a href="sdf%C5%E0%D1%B5&rsp=8&oqdsfa1&rsv_ers=xn1&rs_src=0">在线资料</a>
</th> 在线资料
10 <th><a href="dsfA%AF%BC%D2%D7%AFseo&rspfsdfeo&f=1&rsv_ers=xn1&rs_src=0">五笔之家</a>
</th> 五笔之家
*/
能给讲解一下吗?多谢.
正则基础之——环视