在以下“原文本”中提取夹在“起始代码”和“终止代码”之间的所有html代码
谢谢 * * * 起始代码: * * *
<TD stylt="padding-bottom:10px;"><table width="99%" border="0" align="center" cellpadding="0" cellspacing="0"> * * * 终止代码: * * *
<td width="57"><a href="/ent/lscf/index.shtml">滚动新闻</a></td>
<td width="12">|</td>
<td width="57"><a href="/ent/mx/index.shtml">明 星</a> </td> * * * 原文本: * * *<TABLE width=1004 border=0 cellPadding=0 cellSpacing=0 bgcolor="#FFFFFF" class="menuborder">
<TBODY>
<TR>
<TD stylt="padding-bottom:10px;"><table width="99%" border="0" align="center" cellpadding="0" cellspacing="0">
<tr class="menubg1">
<td height="1"></td>
</tr>
<tr>
<td height="20" valign="bottom" class="menubg7"><table width="99%" border="0" align="center" cellpadding="0" cellspacing="0" class="font01">
<tr class="menubg4">
<td class="texttx001"><script language="javascript" src="http://www.ce.cn/include/ce_menu1.js"></script></td>
</tr>
</table></td>
</tr>
<tr class="menubg2">
<td height="1"></td>
</tr>
<tr>
<td height="20" valign="bottom" class="menubg8"><table width="99%" border="0" align="center" cellpadding="0" cellspacing="0" class="font01">
<tr class="menubg5">
<td class="texttx001"><script language="javascript" src="http://www.ce.cn/include/ce_menu2.js"></script></td>
</tr>
</table></td>
</tr>
<tr class="menubg3">
<td height="1"></td>
</tr>
<tr>
<td height="20" valign="bottom" class="menubg9"><table width="99%" border="0" align="center" cellpadding="0" cellspacing="0" class="font02">
<tr class="menubg6">
<td class="texttx001"><script language="javascript" src="http://www.ce.cn/include/ce_menu3.js"></script></td>
</tr>
</table></td>
</tr>
<tr bgcolor="#FFFFFF">
<td height="1"></td>
</tr>
<tr>
<td height="25"><script language="javascript" src="http://www.ce.cn/include/ce_menu4.js"></script></td>
</tr>
</table> </TD>
</TR>
</TBODY>
</TABLE>
<table width="1004" border="0" cellspacing="0" cellpadding="0" class="font-white">
<tr>
<td width="2" height="26" bgcolor="#333333"></td>
<td width="217" rowspan="2"><a href="http://www.ce.cn/ent/"><img src="/ent/images/ent2006_logo.gif" width="217" height="53" border="0"></a></td>
<td height="26" align="left" valign="middle" background="/ent/images/ent2006_bg001.gif"><table width="100%" height="26" border="0" cellpadding="0" cellspacing="0">
<tr align="center">
<td width="57"><a href="/ent/index.shtml">首 页</a></td>
<td width="12">|</td>
<td width="57"><a href="/ent/lscf/index.shtml">滚动新闻</a></td>
<td width="12">|</td>
<td width="57"><a href="/ent/mx/index.shtml">明 星</a> </td>
<td width="12">|</td>
<td width="57"><a href="/ent/dy/index.shtml">电 影</a></td>
<td width="12">|</td>
<td width="57"><a href="/ent/ds/index.shtml">电 视</a></td>
<td width="12">|</td>
<td width="57"><a href="/ent/yspl/index.shtml">影视评论</a></td>
<td width="12">|</td>
<td width="57"><a href="/ent/yyst/index.shtml">音 乐</a></td>
<td width="12">|</td>
<td width="57"><a href="http://www.ce.cn/ent/ylzk/No12/index.shtml" target="_blank">娱乐周刊</a></td>
</trs_outline>
<td width="12">|</td>
<td width="57"><a href="/ent/ztjx/index.shtml">专 题</a></td>
<td width="12">|</td>
<td width="57"><a href="http://www.chinaonly.cn/pint/">演出票</a></td>
<td width="12"> </td>
<td width="57"> </td>
</tr>
</table></td>
<td width="5" bgcolor="#6d6d6d"></td>
</tr>
<tr>
<td height="26" bgcolor="#333333"></td>
<td height="27" align="left" valign="middle" background="/ent/images/ent2006_bg002.gif"><marquee border="0" align="middle" scrollamount="2" scrolldelay="25" behavior="scroll" width="100%" onMouseMove="this.stop()" onMouseOut="this.start()">
谢谢 * * * 起始代码: * * *
<TD stylt="padding-bottom:10px;"><table width="99%" border="0" align="center" cellpadding="0" cellspacing="0"> * * * 终止代码: * * *
<td width="57"><a href="/ent/lscf/index.shtml">滚动新闻</a></td>
<td width="12">|</td>
<td width="57"><a href="/ent/mx/index.shtml">明 星</a> </td> * * * 原文本: * * *<TABLE width=1004 border=0 cellPadding=0 cellSpacing=0 bgcolor="#FFFFFF" class="menuborder">
<TBODY>
<TR>
<TD stylt="padding-bottom:10px;"><table width="99%" border="0" align="center" cellpadding="0" cellspacing="0">
<tr class="menubg1">
<td height="1"></td>
</tr>
<tr>
<td height="20" valign="bottom" class="menubg7"><table width="99%" border="0" align="center" cellpadding="0" cellspacing="0" class="font01">
<tr class="menubg4">
<td class="texttx001"><script language="javascript" src="http://www.ce.cn/include/ce_menu1.js"></script></td>
</tr>
</table></td>
</tr>
<tr class="menubg2">
<td height="1"></td>
</tr>
<tr>
<td height="20" valign="bottom" class="menubg8"><table width="99%" border="0" align="center" cellpadding="0" cellspacing="0" class="font01">
<tr class="menubg5">
<td class="texttx001"><script language="javascript" src="http://www.ce.cn/include/ce_menu2.js"></script></td>
</tr>
</table></td>
</tr>
<tr class="menubg3">
<td height="1"></td>
</tr>
<tr>
<td height="20" valign="bottom" class="menubg9"><table width="99%" border="0" align="center" cellpadding="0" cellspacing="0" class="font02">
<tr class="menubg6">
<td class="texttx001"><script language="javascript" src="http://www.ce.cn/include/ce_menu3.js"></script></td>
</tr>
</table></td>
</tr>
<tr bgcolor="#FFFFFF">
<td height="1"></td>
</tr>
<tr>
<td height="25"><script language="javascript" src="http://www.ce.cn/include/ce_menu4.js"></script></td>
</tr>
</table> </TD>
</TR>
</TBODY>
</TABLE>
<table width="1004" border="0" cellspacing="0" cellpadding="0" class="font-white">
<tr>
<td width="2" height="26" bgcolor="#333333"></td>
<td width="217" rowspan="2"><a href="http://www.ce.cn/ent/"><img src="/ent/images/ent2006_logo.gif" width="217" height="53" border="0"></a></td>
<td height="26" align="left" valign="middle" background="/ent/images/ent2006_bg001.gif"><table width="100%" height="26" border="0" cellpadding="0" cellspacing="0">
<tr align="center">
<td width="57"><a href="/ent/index.shtml">首 页</a></td>
<td width="12">|</td>
<td width="57"><a href="/ent/lscf/index.shtml">滚动新闻</a></td>
<td width="12">|</td>
<td width="57"><a href="/ent/mx/index.shtml">明 星</a> </td>
<td width="12">|</td>
<td width="57"><a href="/ent/dy/index.shtml">电 影</a></td>
<td width="12">|</td>
<td width="57"><a href="/ent/ds/index.shtml">电 视</a></td>
<td width="12">|</td>
<td width="57"><a href="/ent/yspl/index.shtml">影视评论</a></td>
<td width="12">|</td>
<td width="57"><a href="/ent/yyst/index.shtml">音 乐</a></td>
<td width="12">|</td>
<td width="57"><a href="http://www.ce.cn/ent/ylzk/No12/index.shtml" target="_blank">娱乐周刊</a></td>
</trs_outline>
<td width="12">|</td>
<td width="57"><a href="/ent/ztjx/index.shtml">专 题</a></td>
<td width="12">|</td>
<td width="57"><a href="http://www.chinaonly.cn/pint/">演出票</a></td>
<td width="12"> </td>
<td width="57"> </td>
</tr>
</table></td>
<td width="5" bgcolor="#6d6d6d"></td>
</tr>
<tr>
<td height="26" bgcolor="#333333"></td>
<td height="27" align="left" valign="middle" background="/ent/images/ent2006_bg002.gif"><marquee border="0" align="middle" scrollamount="2" scrolldelay="25" behavior="scroll" width="100%" onMouseMove="this.stop()" onMouseOut="this.start()">
解决方案 »
- 接口访问速度的问题
- 怎么移除Grid里所选中的一行
- 如何实现格式多样的对话框
- 求远程医疗系统建设的相关资料
- 为什么打开.aspx报应用程序出错?
- 用Reflector反编译的.dll文件中的中文全部显示为Ucode编码,请问如何显示出汉字?用程序编码显示也行!急!!
- 求助:在后台编码文件中怎么用不了application存储的变量呀。
- ASP.NET安装问题
- 使用treeview控件,我想给它们的节点图标都换一下,可它默认图标都在上面,怎么去掉默认的。
- 我如何访问web.config?
- 我如何用正则表达式匹配<span>与</span>之间的内容?谢谢!!!
- 继承自ProfileBase的自定义类,无法保存到Membership数据库中?
<TBODY>
<TR>
<TD stylt=""padding-bottom:10px;""><table width=""99%"" border=""0"" align=""center"" cellpadding=""0"" cellspacing=""0"">
<tr class=""menubg1"">
<td height=""1""></td>
</tr>
<tr>
<td height=""20"" valign=""bottom"" class=""menubg7""><table width=""99%"" border=""0"" align=""center"" cellpadding=""0"" cellspacing=""0"" class=""font01"">
<tr class=""menubg4"">
<td class=""texttx001""><script language=""javascript"" src=""http://www.ce.cn/include/ce_menu1.js""></script></td>
</tr>
</table></td>
</tr>
<tr class=""menubg2"">
<td height=""1""></td>
</tr>
<tr>
<td height=""20"" valign=""bottom"" class=""menubg8""><table width=""99%"" border=""0"" align=""center"" cellpadding=""0"" cellspacing=""0"" class=""font01"">
<tr class=""menubg5"">
<td class=""texttx001""><script language=""javascript"" src=""http://www.ce.cn/include/ce_menu2.js""></script></td>
</tr>
</table></td>
</tr>
<tr class=""menubg3"">
<td height=""1""></td>
</tr>
<tr>
<td height=""20"" valign=""bottom"" class=""menubg9""><table width=""99%"" border=""0"" align=""center"" cellpadding=""0"" cellspacing=""0"" class=""font02"">
<tr class=""menubg6"">
<td class=""texttx001""><script language=""javascript"" src=""http://www.ce.cn/include/ce_menu3.js""></script></td>
</tr>
</table></td>
</tr>
<tr bgcolor=""#FFFFFF"">
<td height=""1""></td>
</tr>
<tr>
<td height=""25""><script language=""javascript"" src=""http://www.ce.cn/include/ce_menu4.js""></script></td>
</tr>
</table> </TD>
</TR>
</TBODY>
</TABLE>
<table width=""1004"" border=""0"" cellspacing=""0"" cellpadding=""0"" class=""font-white"">
<tr>
<td width=""2"" height=""26"" bgcolor=""#333333""></td>
<td width=""217"" rowspan=""2""><a href=""http://www.ce.cn/ent/""><img src=""/ent/images/ent2006_logo.gif"" width=""217"" height=""53"" border=""0""></a></td>
<td height=""26"" align=""left"" valign=""middle"" background=""/ent/images/ent2006_bg001.gif""><table width=""100%"" height=""26"" border=""0"" cellpadding=""0"" cellspacing=""0"">
<tr align=""center"">
<td width=""57""><a href=""/ent/index.shtml"">首 页</a></td>
<td width=""12"">|</td>
<td width=""57""><a href=""/ent/lscf/index.shtml"">滚动新闻</a></td>
<td width=""12"">|</td>
<td width=""57""><a href=""/ent/mx/index.shtml"">明 星</a> </td>
<td width=""12"">|</td>
<td width=""57""><a href=""/ent/dy/index.shtml"">电 影</a></td>
<td width=""12"">|</td>
<td width=""57""><a href=""/ent/ds/index.shtml"">电 视</a></td>
<td width=""12"">|</td>
<td width=""57""><a href=""/ent/yspl/index.shtml"">影视评论</a></td>
<td width=""12"">|</td>
<td width=""57""><a href=""/ent/yyst/index.shtml"">音 乐</a></td>
<td width=""12"">|</td>
<td width=""57""><a href=""http://www.ce.cn/ent/ylzk/No12/index.shtml"" target=""_blank"">娱乐周刊</a></td>
</trs_outline>
<td width=""12"">|</td>
<td width=""57""><a href=""/ent/ztjx/index.shtml"">专 题</a></td>
<td width=""12"">|</td>
<td width=""57""><a href=""http://www.chinaonly.cn/pint/"">演出票</a></td>
<td width=""12""> </td>
<td width=""57""> </td>
</tr>
</table></td>
<td width=""5"" bgcolor=""#6d6d6d""></td>
</tr>
<tr>
<td height=""26"" bgcolor=""#333333""></td>
<td height=""27"" align=""left"" valign=""middle"" background=""/ent/images/ent2006_bg002.gif""><marquee border=""0"" align=""middle"" scrollamount=""2"" scrolldelay=""25"" behavior=""scroll"" width=""100%"" onMouseMove=""this.stop()"" onMouseOut=""this.start()"">
";
string sStart = @"<TD stylt=""padding-bottom:10px;""><table width=""99%"" border=""0"" align=""center"" cellpadding=""0"" cellspacing=""0"">";
string sEnd = @"<td width=""57""><a href=""/ent/lscf/index.shtml"">滚动新闻</a></td>
<td width=""12"">|</td>
<td width=""57""><a href=""/ent/mx/index.shtml"">明 星</a> </td>"; str = System.Text.RegularExpressions.Regex.Replace(str,@"[\s\S]*?"+sStart+@"([\s\S]*)?"+sEnd+@"[\s\S]*?$","$1"); Response.Write(str);
------说错了莫怪。
可我这个问题 你能帮我吗? 帮我一下 我立即给你发代码http://community.csdn.net/Expert/TopicView1.asp?id=5253088
<td height="1"></td>
</tr>
<tr>
结束是:
<td width="57"><a href="/ent/index.shtml">首 页</a></td>
<td width="12">|</td>
------说错了莫怪。
*********************
你那个问题我可以解决,不过比较烦一些
其实了比较好解决,难的在于分页的重写参见:http://www.valca.cn
我是用VB:
'sContent是原文本
Dim sTr1, sTr2 As String
sTr1 = Trim(TextBox1.Text)
sTr2 = Trim(TextBox2.Text)
If sTr1 <> "" Then
Dim sMatch As Match = Regex.Match(sContent, "^" + sTr1 + "([\s\S]*)" + sTr2 + "$", RegexOptions.Multiline)
strOut.Text = sMatch.Groups(0).Value
strOut.Text += sMatch.Groups(1).Value
End If
------说错了莫怪。
Dim sMatch As Match = Regex.Match(sContent, "^" + sTr1 + "([\s\S]*)" + sTr2 + "$", RegexOptions.Multiline)===》Dim sMatch As Match = Regex.Match(sContent, @"[\s\S]*?"+sTr1 +@"([\s\S]*)?"+sTr2 +@"[\s\S]*?$",RegexOptions.Multiline)
------说错了莫怪。
Dim sTr1, sTr2 As String
sTr1 = Trim(TextBox1.Text)
sTr2 = Trim(TextBox2.Text)
If sTr1 <> "" Then
Dim sMatch As Match = Regex.Match(sContent, sTr1 + "([\s\S]*)" + sTr2)
strOut.Text = sMatch.Groups(0).Value
strOut.Text += sMatch.Groups(1).Value
End If以上代码不行
Dim sTr1, sTr2 As String
sTr1 = Trim(TextBox1.Text)
sTr2 = Trim(TextBox2.Text)
sContent = Trim(TextBox3.Text)
If sTr1 <> "" Then
Dim sMatch As Match = Regex.Match(sContent, "[\s\S]*?" + sTr1 + "([\s\S]*)?" + sTr2 + "[\s\S]*?$", RegexOptions.Multiline)
strOut.Text = sMatch.Groups(0).Value
strOut.Text += sMatch.Groups(1).Value
End If******************
VB不用@,以上代码捕得的还是空的
------说错了莫怪。
可能就是str1和str2的问题了输入的内容要和原文一模一样,多一个空格或回车都不行,你可先从原文中copy一段,注
------说错了莫怪。
**********************
我确实是copy的,这点常识还是有的
你那里测试可以提取吗?
原文本是用WebRequest取的,使用的是默认编码,这个没问题吧
<TD stylt="padding-bottom:10px;"><table width="99%" border="0" align="center" cellpadding="0" cellspacing="0"> * * * 终止代码: * * *
用这个代码即可
嘎嘎public string caijiclass(int q1,int q2,string kai,string guan,string zong) //数据采集类2
{
string urlxxx;
try
{
q1 = zong.IndexOf(kai, q2);
q1 += kai.Length;
q2 = zong.IndexOf(guan, q1);
urlxxx=zong.Substring(q1, q2 - q1);
}
catch
{
urlxxx="无法读取" ;
}
return urlxxx;
}string ddd=caijiclass(0,0,开始字符,关闭字符,总字符);