如何提取width="3%" height="28" class="border">1.</td>中的"1",
如何提取(<a href="http://mp3.baidu.com/m?tn=baidump3&ct=134217728&lm=_1&li=500&word=%CA%C4%D1%D4" target=_blank>誓言</a>)的"誓言",会用正则表达式或能解决问题的请来
如何提取(<a href="http://mp3.baidu.com/m?tn=baidump3&ct=134217728&lm=_1&li=500&word=%CA%C4%D1%D4" target=_blank>誓言</a>)的"誓言",会用正则表达式或能解决问题的请来
Regex regex=new Regex(@"^.+>(.+)<.+$");
Match match=regex.Match(s);
string result=match.Group[1].value;
按你的写结果什么都没有啊
按你的写结果什么都没有啊
====================================
string s = "(<a href=\"http://mp3.baidu.com/m?tn=baidump3&ct=134217728&lm=_1&li=500&word=%CA%C4%D1%D4\" target=_blank>誓言</a>)";
Regex regex=new Regex(@"^.+>(.+)<.+$");
Match match=regex.Match(s);
string result=match.Groups[1].Value;
MessageBox.Show(result);不可能
恩,是能的,是我刚才没搞好的问题,那第一那个"1"怎么提取啊?
只是把s改为"width=\"3%\" height=\"28\" class=\"border\">1.</td>";只是这样取出来会是“1.”,多一个“.”号,自己再把它去了
============================================
那你就直接用截取所得的字符串,
不知道你要做什么...
private void button1_Click_1(object sender, System.EventArgs e)
{
string Url = "http://list.mp3.baidu.com/topso/mp3topsong.html?id=1#top2"; //得到指定Url的源码
string strResult="";
try
{
HttpWebRequest request = (HttpWebRequest)WebRequest.Create(Url);//声明一个HttpWebRequest请求
request.Timeout = 10000;//设置连接超时时间
request.Headers.Set("Pragma", "no-cache");
HttpWebResponse response = (HttpWebResponse)request.GetResponse();
Stream streamReceive = response.GetResponseStream();
Encoding encoding = Encoding.GetEncoding("GB2312");
StreamReader streamReader = new StreamReader(streamReceive,encoding);
strResult = streamReader.ReadToEnd();
}
catch
{
MessageBox.Show("出错");
}
int iBodyStart = strResult.IndexOf("<body", 0);
int start = strResult.IndexOf("歌曲TOP500", iBodyStart);
int iStart = strResult.IndexOf("<table", start);
int iEnd = strResult.IndexOf("</table>", iStart);
string strWeb = strResult.Substring(iStart, iEnd-iStart+8).Trim();
string zhuanxian = strWeb.Replace("-","_");
string zhuan = zhuanxian.Replace("<td","-");
string[] fen = zhuan.Split('-');;
string a1=fen[1];
Regex regex=new Regex(@"^.+>(.+)<.+$");
Match match=regex.Match(a1);
string result=match.Groups[1].Value;
richTextBox1.Text=result; }
string zhuanxian = strWeb.Replace("-","_");
string zhuan = zhuanxian.Replace("<td","-");
string[] fen = zhuan.Split('-');;
string a1=fen[1];
都删了,直接用Match match=regex.Match(strWeb);
str=str.Split('<')[1].Split('>')[0];
不是好了?何必这么麻烦?
就一个网页 在javascript 你可以参考参考
汗打反了……