http://www.microbell.com/docdetail_381785.html这个地址我想用正则获取到 研究报告作者
 陈梦根,蔡笑 只要作者就行。http://topic.csdn.net/u/20100910/16/eca75610-5cef-444c-b7f4-c843583eeae2.html
这个贴中。如果字符串就那么多。提取的没错。但是如果是全文就提取不到了。

解决方案 »

  1.   


        <div class="index_main">
         <div class="indexSidebarLeft">
           <div class="class_listdt" style="_height:1610px;" >
             <div class="Breadcrumbs" style="text-align:left">位置:<a href="index.html" target="_blank">首 页</a> &gt; <a href="microns_4_1.html" target="_blank">宏观策略</a></div>
             <div class="new_zhengjing">
                     
         <table width="97%" border="0" cellpadding="0" cellspacing="0">
                       <tr>
                         <td width="7%"></td>
                         <td width="93%" rowspan="2" style="color:#F00000; font-size:15px;"><strong>研究报告:宏源证券-8月经济形势述评:经济回落态势有所放缓-100913</strong></td>
                </tr>
                       <tr>
                         <td align="center"><img src="images/bgicon11.gif" width="16" height="16" /></td>
                </tr>
                     </table>
             </div>
             <div style=" clear:both"></div>
             <div class="baogaonews"><table width="649" height="83" border="0" align="center" cellpadding="0" cellspacing="0" class="tablebg" style=" float:right; margin-top:5px;" >
    <tr>
                         <td width="75" height="20" ><span class=" STYLE4">股票名称:</span></td>
                <td width="214"><span class=" STYLE4"><b></b></span></td>
                <td width="105"><span class=" STYLE4">研究报告作者:</span></td>
    <td width="255"><span class=" STYLE4"><b>陈梦根,蔡笑</b></span></td>
               </tr>
                    <tr>
                         <td height="15"><span class=" STYLE4">股票代码:</span></td>
                          <td><span class=" STYLE4"><b></b></span></td>
                          <td><span class=" STYLE4">研究报告出处:</span></td>
                          <td><span class=" STYLE4"><b>宏源证券</b></span></td>
                </tr>
                    <tr>
                         <td height="18"><span class=" STYLE4">推荐评级:</span></td>
                          <td><span class=" STYLE4"><b>无</b></span></td>
                          <td><span class=" STYLE4">研究报告栏目:</span></td>
                          <td><span class=" STYLE4"><b>宏观策略</b></span></td>
                </tr>
                    <tr>
                      <td colspan="3" class="STYLE4">研究报告上传时间:
                        <b>2010-9-13 17:08:57</b></td>
                      <td>&nbsp;</td>
            </tr>
                   </table>
             </div>
               <div style=" clear:both"></div>
      

  2.   

    正则应该可以把。看这贴http://topic.csdn.net/u/20100910/16/eca75610-5cef-444c-b7f4-c843583eeae2.html