数据是这个:
<p class="zdct7">汗 <span class="diczx4">拼音:</span><span class="dicpy">gān <script language="JavaScript" type="text/javascript">spf("gan1");</script> hán <script language="JavaScript" type="text/javascript">spf("han2");</script> hàn <script language="JavaScript" type="text/javascript">spf("han4");</script> </span>
要匹配出:gān
hán
hàn
<p class="zdct7">汗 <span class="diczx4">拼音:</span><span class="dicpy">gān <script language="JavaScript" type="text/javascript">spf("gan1");</script> hán <script language="JavaScript" type="text/javascript">spf("han2");</script> hàn <script language="JavaScript" type="text/javascript">spf("han4");</script> </span>
要匹配出:gān
hán
hàn
MatchCollection mc = Regex.Matches(str, @">([^<\u4e00-\u9fa5\(\)]+)<");
foreach (Match m in mc)
{
if (m.Groups[1].Value.Trim().Length != 0)
{
Console.WriteLine(m.Groups[1].Value.Trim());
}
}
/*
gān
hán
hàn
请按任意键继续. . .
*/
就像这样......
......
......
<tr>
<td class="zuib100"><p class="U6C57">汗</p></td>
<td>
<p class="zdct7">汗 <span class="diczx4">拼音:</span><span class="dicpy">gān <script language="JavaScript" type="text/javascript">spf("gan1");</script> hán <script language="JavaScript" type="text/javascript">spf("han2");</script> hàn <script language="JavaScript" type="text/javascript">spf("han4");</script> </span><p><span class="diczx4">部首:</span>氵,<span class="diczx4">部外笔画:</span>3,<span class="diczx4">总笔画:</span>6 <font size=3 color=#D1B7A7>;</font> <span class="diczx4">繁体部首:</span>水,<span class="diczx4">部外笔画:</span>3,<span class="diczx4">总笔画:</span>7</p>
</td>
</tr>
......
......
......