数据是这个:
<p class="zdct7">汗 <span class="diczx4">拼音:</span><span class="dicpy">gān <script language="JavaScript" type="text/javascript">spf("gan1");</script>  hán <script language="JavaScript" type="text/javascript">spf("han2");</script>  hàn <script language="JavaScript" type="text/javascript">spf("han4");</script>  </span>
要匹配出:gān
hán
hàn

解决方案 »

  1.   

    string str="<p class=\"zdct7\">汗 <span class=\"diczx4\">拼音:</span><span class=\"dicpy\">gān <script language=\"JavaScript\" type=\"text/javascript\">spf(\"gan1\");</script>  hán <script language=\"JavaScript\" type=\"text/javascript\">spf(\"han2\");</script>  hàn <script language=\"JavaScript\" type=\"text/javascript\">spf(\"han4\");</script>  </span>";
    MatchCollection mc = Regex.Matches(str, @">([^<\u4e00-\u9fa5\(\)]+)<");
    foreach (Match m in mc)
    {
        if (m.Groups[1].Value.Trim().Length != 0)
        {
            Console.WriteLine(m.Groups[1].Value.Trim());
        }
    }
    /*
    gān
    hán
    hàn
    请按任意键继续. . .
    */
      

  2.   

    2楼的,如果我要匹配的数据在整个html文件里,那又该怎么办呢?
    就像这样......
    ......
    ......
      <tr>
    <td class="zuib100"><p class="U6C57">&#27735;</p></td>
    <td>
    <p class="zdct7">汗 <span class="diczx4">拼音:</span><span class="dicpy">gān <script language="JavaScript" type="text/javascript">spf("gan1");</script>  hán <script language="JavaScript" type="text/javascript">spf("han2");</script>  hàn <script language="JavaScript" type="text/javascript">spf("han4");</script>  </span><p><span class="diczx4">部首:</span>氵,<span class="diczx4">部外笔画:</span>3,<span class="diczx4">总笔画:</span>6 <font size=3 color=#D1B7A7>;</font> <span class="diczx4">繁体部首:</span>水,<span class="diczx4">部外笔画:</span>3,<span class="diczx4">总笔画:</span>7</p>
    </td>
      </tr>
    ......
    ......
    ......