java 正则表达式 Matcher matcher = Pattern.compile("(?<=href=\").*?(?=\")").matcher("<a href=\"www.123.com\"/>");if (matcher.find())System.out.println(matcher.group()); 解决方案 » 免费领取超大流量手机卡,每月29元包185G流量+100分钟通话, 中国电信官方发货 我 要匹配 的是 一个 网页 上 的一些 数据再 c# 中 能找到所有的但是在java 中 只 找到 一个 这是我的代码String reg = "(?<=<span class=\"color_red price\") id=\"[\\w\\d]*\" productId=\"[\\d]*\" yhdPrice=\"[\\d]*\" productUnit=\"\" diaperNum=\"[\\d]*\" >[\\s]*?(?<price>¥[\\d]*)[\\s]*?</span>[\\s]*?<span id=\"[\\w\\d]*\" class=\"[\\w]*\"></span>[\\s]*?<del id=\"[\\w\\d]*\"></del>[\\s]*?</div>[\\s]*?<p class=\"title\">[\\s]*?<a class=\"title\" id=\"[\\w\\d]*\" pmId=\"[\\d]*\" href=\"http://item\\.yhd\\.com/item/[\\d]*\" target=\"[\\w]*\" title=\"[\\s\\S]*?\" onClick=\"[\\w]*\\(\'[\\d]+\',\'[\\w\\d]*\'\\);\"[\\s]*?singleFreeFlag=\"[\\d]*\">[\\s]*?(?<name>[\\s\\S]*?)[\\s]*?</a>[\\s]*?</p>[\\s]*?<div class=\"item_act clearfix\">[\\s]*?<div class=\"shopping_act fl\" id=\"[\\w\\d]*\" isnumBuy=\"[\\d]+\">[\\s]*?<a class=\"buy_btn\" id=\"[\\w\\d]*\" buyModule=\"[\\d]+\" index=\"[\\d]+\" specialType=\"[\\d]+\" productCode=\"[\\d]+\" sellProductId=\"[\\d]+\" merchantId=\"[\\d]+\"[\\s]*?inshop=\"[\\d]+\" pminfo=\"[\\d]+\" shoppingcount=\"[\\d]+\" href=\"http://item\\.yhd\\.com/item/[\\d]+\"[\\s]*?onclick=\"[\\w]*\\(\'[\\d]+\',\'[\\w\\d]*\'\\);\" ispromotion=\"[\\d]+\">[\\s\\S]*?</a>[\\s]*?</div>[\\s\\S]*?</div>[\\s\\S]*?<p class=\"comment\" id=\"[\\w\\d]*\" experienceCount=\"[\\d]+\" positiveRate=\"[\\d]+\" totalNum=\"[\\d]+\">[\\s]*?<i></i>[\\s]*?<a id=\"[\\w\\d]*\" pmId=\"[\\d]+\" href=\"http://item\\.yhd\\.com/item/[\\d]+\" target=\"_blank\" onClick=\"addTrackPositionToCookie\\(\'[\\d]+\',\'[\\w\\d]*\'\\);\">[\\s]*?(?<count>[\\d]+)</a>[\\s]*?(?=<span class=\"positiveRatio\" title=\"[\\s\\S]*\"><i></i>(?<good>[\\d]+%)</span>[\\s]*?</p>)";Pattern p = Pattern.compile(reg); Matcher m = p.matcher(html); while(m.find()){ System.out.println(m.group("name")); System.out.println(m.groupCount()); }分组 的值 是 4 参考 Matcher matcher = Pattern.compile("<a href=\"(.*?)\" title=\"(.*?)\".*?>(.*?)</a").matcher( "<a href=\"www.aaa.com\" title=\"ta\">aaaa</a><a href=aaaa/><a href=\"www.bbb.com\" title=\"tb\">ccc</a><a href=\"www.ccc.com\" title=\"tc\">bbb</a>"); while (matcher.find()) System.out.println(matcher.group(1)+" "+matcher.group(2)+" "+matcher.group(3)); 有 QQ 吗 高手 我QQ 397870376测试代码 太长了 <li class="search_item" id="producteg_16081" data-tcd="5.909759" data-tcs="3.0" ><!-- s 电器类标题hover3行显示,添加class:electrical_item_box--><div class="search_item_box" comproid="16081" id="itemSearchResultCon_16081" combine_type="1"productUnit="" diaperNum="500"><input id="shop_16081" type="hidden" value="0" /><input id="serise_16081" type="hidden" value="0" /><input id="shop_pm_909759" type="hidden" value="0" /><a class="search_prod_img" defaultFlag="1" id="pdlink1_16081" parentId=0 pmId="909759" isSnapProduct="0" isOverSea="0" isOTCorRX="0" isReserve="0" isGrouponProv="0" grouponId="0" grouponType="0" href="http://item.yhd.com/item/909759" target="_blank" onClick="addTrackPositionToCookie('1','pro_0');" isOne2more="0"><img width="200" height="200" src="http://d7.yihaodianimg.com/N02/M01/D5/05/CgQCsFNgrQiAdYt4AAM09ZWqxfs02301_200*200.jpg" alt="正品五粮液 浓香型 52度 500ml"/><span class="tag_video none"></span></a><div class="item_promotion_text" id="promostyle_909759" title="" promotionTag="0"><div class="tip" id="promotion_909759" title="正品保证,假一赔十。" pnameSubtitle="正品保证,假一赔十。" pstock="" stockFlag="" ruleType="0" target="_blank" isFirst="0" isD50="0" isPhoneCategory="0">正品保证,假一赔十。</div></div><div class="pricebox clearfix"><span class="color_red price" id="price0_16081" productId="16081" yhdPrice="569" productUnit="" diaperNum="500" >¥569</span><span id="unit_price_16081" class="unit_price"></span><del id="listprice0_16081"></del></div><p class="title"><a class="title" id="pdlink2_16081" pmId="909759" href="http://item.yhd.com/item/909759" target="_blank" title="正品五粮液 浓香型 52度 500ml 正品保证,假一赔十。" onClick="addTrackPositionToCookie('1','pro_0');" singleFreeFlag="0">正品五粮液 浓香型 52度 500ml 正品保证,假一赔十。</a></p><div class="item_act clearfix"><div class="shopping_act fl" id="shopping_act_16081" isnumBuy="0"><a class="buy_btn" id="buyButton_16081" buyModule="0" index="0" specialType="0" productCode="0000160816" sellProductId="16081" merchantId="2" inshop="0" pminfo="909759" shoppingcount="1" href="http://item.yhd.com/item/909759"onclick="addTrackPositionToCookie('1','pro_0');" ispromotion="0">查看详情</a></div></div><p class="comment" id="comment_16081" experienceCount="4379" positiveRate="94" totalNum="0"><i></i><a id="pdlinkcomment_16081" pmId="909759" href="http://item.yhd.com/item/909759" target="_blank" onClick="addTrackPositionToCookie('1','search_exp_0');">4379</a><span class="positiveRatio" title="好评率94%"><i></i>94%</span></p><div class="owner clearfix"><span id = "merchant_16081" class="o_1">1号店自营</span></div><div class="item_status clearfix"></div><u class="bg_border"></u></div></li><li class="search_item" id="producteg_12540334" data-tcd="5.39365062" data-tcs="3.0" ><!-- s 电器类标题hover3行显示,添加class:electrical_item_box--><div class="search_item_box" comproid="12540334" id="itemSearchResultCon_12540334" combine_type="1"productUnit="" diaperNum="0"><input id="shop_12540334" type="hidden" value="0" /><input id="serise_12540334" type="hidden" value="0" /><input id="shop_pm_39365062" type="hidden" value="0" /><a class="search_prod_img" defaultFlag="1" id="pdlink1_12540334" parentId=0 pmId="39365062" isSnapProduct="0" isOverSea="0" isOTCorRX="0" isReserve="0" isGrouponProv="0" grouponId="0" grouponType="0" href="http://item.yhd.com/item/39365062" target="_blank" onClick="addTrackPositionToCookie('1','pro_1');" isOne2more="0"><img width="200" height="200" src="http://d9.yihaodianimg.com/N02/M07/B6/BC/CgQCsVIpqVeASU0fAALBHIb-Nnw65500_200*200.jpg" alt="五粮液 浓香型52度 500ml*6瓶/整箱"/><span class="tag_video none"></span></a><div class="item_promotion_text" id="promostyle_39365062" title="" promotionTag="0"><div class="tip" id="promotion_39365062" title="自营正品保证 假一赔十" pnameSubtitle="自营正品保证 假一赔十" pstock="" stockFlag="" ruleType="0" target="_blank" isFirst="0" isD50="0" isPhoneCategory="0">自营正品保证 假一赔十</div></div><div class="pricebox clearfix"><span class="color_red price" id="price0_12540334" productId="12540334" yhdPrice="3339" productUnit="" diaperNum="0" >¥3339</span><span id="unit_price_12540334" class="unit_price"></span><del id="listprice0_12540334"></del></div><p class="title"><a class="title" id="pdlink2_12540334" pmId="39365062" href="http://item.yhd.com/item/39365062" target="_blank" title="五粮液 浓香型52度 500ml*6瓶/整箱 自营正品保证 假一赔十" onClick="addTrackPositionToCookie('1','pro_1');" singleFreeFlag="0">五粮液 浓香型52度 500ml*6瓶/整箱 自营正品保证 假一赔十</a></p><div class="item_act clearfix"><div class="shopping_act fl" id="shopping_act_12540334" isnumBuy="0"><a class="buy_btn" id="buyButton_12540334" buyModule="0" index="1" specialType="0" productCode="0125403341" sellProductId="12540334" merchantId="2" inshop="0" pminfo="39365062" shoppingcount="1" href="http://item.yhd.com/item/39365062"onclick="addTrackPositionToCookie('1','pro_1');" ispromotion="0">查看详情</a></div></div><p class="comment" id="comment_12540334" experienceCount="69" positiveRate="100" totalNum="0"><i></i><a id="pdlinkcomment_12540334" pmId="39365062" href="http://item.yhd.com/item/39365062" target="_blank" onClick="addTrackPositionToCookie('1','search_exp_1');">69</a><span class="positiveRatio" title="好评率100%"><i></i>100%</span></p><div class="owner clearfix"><span id = "merchant_12540334" class="o_1">1号店自营</span></div><div class="item_status clearfix"></div><u class="bg_border"></u></div></li>匹配 价格 名称 好评率 销售量 名称 匹配 <a class="title" id="pdlink2_16081" pmId="909759" href="http://item.yhd.com/item/909759" target="_blank" title="正品五粮液 浓香型 52度 500ml 正品保证,假一赔十。" onClick="addTrackPositionToCookie('1','pro_0');" singleFreeFlag="0">正品五粮液 浓香型 52度 500ml 正品保证,假一赔十。</a>要 正品五粮液 浓香型 52度 500ml 正品保证,假一赔十。价格 要¥569销售量 <a id="pdlinkcomment_12540334" pmId="39365062" href="http://item.yhd.com/item/39365062" target="_blank" onClick="addTrackPositionToCookie('1','search_exp_1');">69</a>69好评率 <span class="positiveRatio" title="好评率100%"><i></i>100%</span> 100% Matcher matcher = Pattern .compile( "<li[^>]+?search_item[\\s\\S]*?yhdprice=\"(.*?)\"[\\s\\S]+?diapernum=\"(.*?)\"[\\s\\S]+?title=\"(.*?)\"[\\s\\S]+?pdlinkcommen.+?>\\s*(\\d*)\\s*<[\\s\\S]*?好评率([\\d\\.%]*)[\\s\\S]+?(?=bg_border)",Pattern.CASE_INSENSITIVE) .matcher(html); while (matcher.find()){//System.out.println(matcher.group()); System.out.println(matcher.group(1) + " " + matcher.group(2) + " " + matcher.group(3) + " " + matcher.group(4) + " " + matcher.group(5));} 怎么判断返回值isEmpty() 有用过点聚的Weboffice进行word合并的大侠吗 spring的依赖与注入通过IOC实现 hibernate+struts的一个问题 求用NetBeans工具和JSF框架的一个小程序 jsp生成excel用GB2312数据显不完,用GBK能显示完,什么原因? JAVA的好处到底在哪?? java 生成xml问题,急 FLASH加载的BTF文件怎么反编译! 帮个忙:在servlet的destroy()里面关闭Connection是否可行?? JQPrint打印表单内容 关于hibernate自动创建表的问题
再 c# 中 能找到所有的
但是在java 中 只 找到 一个
这是我的代码String reg = "(?<=<span class=\"color_red price\") id=\"[\\w\\d]*\" productId=\"[\\d]*\" yhdPrice=\"[\\d]*\" productUnit=\"\" diaperNum=\"[\\d]*\" >[\\s]*?(?<price>¥[\\d]*)[\\s]*?</span>[\\s]*?<span id=\"[\\w\\d]*\" class=\"[\\w]*\"></span>[\\s]*?<del id=\"[\\w\\d]*\"></del>[\\s]*?</div>[\\s]*?<p class=\"title\">[\\s]*?<a class=\"title\" id=\"[\\w\\d]*\" pmId=\"[\\d]*\" href=\"http://item\\.yhd\\.com/item/[\\d]*\" target=\"[\\w]*\" title=\"[\\s\\S]*?\" onClick=\"[\\w]*\\(\'[\\d]+\',\'[\\w\\d]*\'\\);\"[\\s]*?singleFreeFlag=\"[\\d]*\">[\\s]*?(?<name>[\\s\\S]*?)[\\s]*?</a>[\\s]*?</p>[\\s]*?<div class=\"item_act clearfix\">[\\s]*?<div class=\"shopping_act fl\" id=\"[\\w\\d]*\" isnumBuy=\"[\\d]+\">[\\s]*?<a class=\"buy_btn\" id=\"[\\w\\d]*\" buyModule=\"[\\d]+\" index=\"[\\d]+\" specialType=\"[\\d]+\" productCode=\"[\\d]+\" sellProductId=\"[\\d]+\" merchantId=\"[\\d]+\"[\\s]*?inshop=\"[\\d]+\" pminfo=\"[\\d]+\" shoppingcount=\"[\\d]+\" href=\"http://item\\.yhd\\.com/item/[\\d]+\"[\\s]*?onclick=\"[\\w]*\\(\'[\\d]+\',\'[\\w\\d]*\'\\);\" ispromotion=\"[\\d]+\">[\\s\\S]*?</a>[\\s]*?</div>[\\s\\S]*?</div>[\\s\\S]*?<p class=\"comment\" id=\"[\\w\\d]*\" experienceCount=\"[\\d]+\" positiveRate=\"[\\d]+\" totalNum=\"[\\d]+\">[\\s]*?<i></i>[\\s]*?<a id=\"[\\w\\d]*\" pmId=\"[\\d]+\" href=\"http://item\\.yhd\\.com/item/[\\d]+\" target=\"_blank\" onClick=\"addTrackPositionToCookie\\(\'[\\d]+\',\'[\\w\\d]*\'\\);\">[\\s]*?(?<count>[\\d]+)</a>[\\s]*?(?=<span class=\"positiveRatio\" title=\"[\\s\\S]*\"><i></i>(?<good>[\\d]+%)</span>[\\s]*?</p>)";Pattern p = Pattern.compile(reg);
Matcher m = p.matcher(html);
while(m.find()){
System.out.println(m.group("name"));
System.out.println(m.groupCount());
}
分组 的值 是 4
"<a href=\"www.aaa.com\" title=\"ta\">aaaa</a><a href=aaaa/><a href=\"www.bbb.com\" title=\"tb\">ccc</a><a href=\"www.ccc.com\" title=\"tc\">bbb</a>");
while (matcher.find())
System.out.println(matcher.group(1)+" "+matcher.group(2)+" "+matcher.group(3));
我QQ 397870376
测试代码 太长了
<li class="search_item" id="producteg_16081" data-tcd="5.909759" data-tcs="3.0" >
<!-- s 电器类标题hover3行显示,添加class:electrical_item_box-->
<div class="search_item_box" comproid="16081" id="itemSearchResultCon_16081" combine_type="1"
productUnit="" diaperNum="500">
<input id="shop_16081" type="hidden" value="0" />
<input id="serise_16081" type="hidden" value="0" />
<input id="shop_pm_909759" type="hidden" value="0" />
<a class="search_prod_img" defaultFlag="1" id="pdlink1_16081" parentId=0 pmId="909759" isSnapProduct="0" isOverSea="0" isOTCorRX="0" isReserve="0" isGrouponProv="0" grouponId="0" grouponType="0" href="http://item.yhd.com/item/909759" target="_blank" onClick="addTrackPositionToCookie('1','pro_0');" isOne2more="0">
<img width="200" height="200" src="http://d7.yihaodianimg.com/N02/M01/D5/05/CgQCsFNgrQiAdYt4AAM09ZWqxfs02301_200*200.jpg" alt="正品五粮液 浓香型 52度 500ml"/>
<span class="tag_video none"></span>
</a>
<div class="item_promotion_text" id="promostyle_909759" title="" promotionTag="0">
<div class="tip" id="promotion_909759" title="正品保证,假一赔十。" pnameSubtitle="正品保证,假一赔十。" pstock="" stockFlag="" ruleType="0" target="_blank" isFirst="0" isD50="0" isPhoneCategory="0">
正品保证,假一赔十。
</div>
</div>
<div class="pricebox clearfix">
<span class="color_red price" id="price0_16081" productId="16081" yhdPrice="569" productUnit="" diaperNum="500" >
¥569
</span>
<span id="unit_price_16081" class="unit_price"></span>
<del id="listprice0_16081"></del>
</div>
<p class="title">
<a class="title" id="pdlink2_16081" pmId="909759" href="http://item.yhd.com/item/909759" target="_blank" title="正品五粮液 浓香型 52度 500ml 正品保证,假一赔十。" onClick="addTrackPositionToCookie('1','pro_0');" singleFreeFlag="0">
正品五粮液 浓香型 52度 500ml 正品保证,假一赔十。
</a>
</p>
<div class="item_act clearfix">
<div class="shopping_act fl" id="shopping_act_16081" isnumBuy="0">
<a class="buy_btn" id="buyButton_16081" buyModule="0" index="0" specialType="0" productCode="0000160816" sellProductId="16081" merchantId="2" inshop="0" pminfo="909759" shoppingcount="1" href="http://item.yhd.com/item/909759"
onclick="addTrackPositionToCookie('1','pro_0');" ispromotion="0">查看详情</a>
</div>
</div>
<p class="comment" id="comment_16081" experienceCount="4379" positiveRate="94" totalNum="0">
<i></i>
<a id="pdlinkcomment_16081" pmId="909759" href="http://item.yhd.com/item/909759" target="_blank" onClick="addTrackPositionToCookie('1','search_exp_0');">
4379</a>
<span class="positiveRatio" title="好评率94%"><i></i>94%</span>
</p>
<div class="owner clearfix">
<span id = "merchant_16081" class="o_1">1号店自营</span>
</div>
<div class="item_status clearfix">
</div>
<u class="bg_border"></u>
</div>
</li>
<li class="search_item" id="producteg_12540334" data-tcd="5.39365062" data-tcs="3.0" >
<!-- s 电器类标题hover3行显示,添加class:electrical_item_box-->
<div class="search_item_box" comproid="12540334" id="itemSearchResultCon_12540334" combine_type="1"
productUnit="" diaperNum="0">
<input id="shop_12540334" type="hidden" value="0" />
<input id="serise_12540334" type="hidden" value="0" />
<input id="shop_pm_39365062" type="hidden" value="0" />
<a class="search_prod_img" defaultFlag="1" id="pdlink1_12540334" parentId=0 pmId="39365062" isSnapProduct="0" isOverSea="0" isOTCorRX="0" isReserve="0" isGrouponProv="0" grouponId="0" grouponType="0" href="http://item.yhd.com/item/39365062" target="_blank" onClick="addTrackPositionToCookie('1','pro_1');" isOne2more="0">
<img width="200" height="200" src="http://d9.yihaodianimg.com/N02/M07/B6/BC/CgQCsVIpqVeASU0fAALBHIb-Nnw65500_200*200.jpg" alt="五粮液 浓香型52度 500ml*6瓶/整箱"/>
<span class="tag_video none"></span>
</a>
<div class="item_promotion_text" id="promostyle_39365062" title="" promotionTag="0">
<div class="tip" id="promotion_39365062" title="自营正品保证 假一赔十" pnameSubtitle="自营正品保证 假一赔十" pstock="" stockFlag="" ruleType="0" target="_blank" isFirst="0" isD50="0" isPhoneCategory="0">
自营正品保证 假一赔十
</div>
</div>
<div class="pricebox clearfix">
<span class="color_red price" id="price0_12540334" productId="12540334" yhdPrice="3339" productUnit="" diaperNum="0" >
¥3339
</span>
<span id="unit_price_12540334" class="unit_price"></span>
<del id="listprice0_12540334"></del>
</div>
<p class="title">
<a class="title" id="pdlink2_12540334" pmId="39365062" href="http://item.yhd.com/item/39365062" target="_blank" title="五粮液 浓香型52度 500ml*6瓶/整箱 自营正品保证 假一赔十" onClick="addTrackPositionToCookie('1','pro_1');" singleFreeFlag="0">
五粮液 浓香型52度 500ml*6瓶/整箱 自营正品保证 假一赔十
</a>
</p>
<div class="item_act clearfix">
<div class="shopping_act fl" id="shopping_act_12540334" isnumBuy="0">
<a class="buy_btn" id="buyButton_12540334" buyModule="0" index="1" specialType="0" productCode="0125403341" sellProductId="12540334" merchantId="2" inshop="0" pminfo="39365062" shoppingcount="1" href="http://item.yhd.com/item/39365062"
onclick="addTrackPositionToCookie('1','pro_1');" ispromotion="0">查看详情</a>
</div>
</div>
<p class="comment" id="comment_12540334" experienceCount="69" positiveRate="100" totalNum="0">
<i></i>
<a id="pdlinkcomment_12540334" pmId="39365062" href="http://item.yhd.com/item/39365062" target="_blank" onClick="addTrackPositionToCookie('1','search_exp_1');">
69</a>
<span class="positiveRatio" title="好评率100%"><i></i>100%</span>
</p>
<div class="owner clearfix">
<span id = "merchant_12540334" class="o_1">1号店自营</span>
</div>
<div class="item_status clearfix">
</div>
<u class="bg_border"></u>
</div>
</li>匹配 价格 名称 好评率 销售量
名称 匹配 <a class="title" id="pdlink2_16081" pmId="909759" href="http://item.yhd.com/item/909759" target="_blank" title="正品五粮液 浓香型 52度 500ml 正品保证,假一赔十。" onClick="addTrackPositionToCookie('1','pro_0');" singleFreeFlag="0">
正品五粮液 浓香型 52度 500ml 正品保证,假一赔十。
</a>
要 正品五粮液 浓香型 52度 500ml 正品保证,假一赔十。价格 要¥569
销售量 <a id="pdlinkcomment_12540334" pmId="39365062" href="http://item.yhd.com/item/39365062" target="_blank" onClick="addTrackPositionToCookie('1','search_exp_1');">
69</a>
69好评率 <span class="positiveRatio" title="好评率100%"><i></i>100%</span> 100%
Matcher matcher = Pattern
.compile(
"<li[^>]+?search_item[\\s\\S]*?yhdprice=\"(.*?)\"[\\s\\S]+?diapernum=\"(.*?)\"[\\s\\S]+?title=\"(.*?)\"[\\s\\S]+?pdlinkcommen.+?>\\s*(\\d*)\\s*<[\\s\\S]*?好评率([\\d\\.%]*)[\\s\\S]+?(?=bg_border)",Pattern.CASE_INSENSITIVE)
.matcher(html);
while (matcher.find()){//System.out.println(matcher.group());
System.out.println(matcher.group(1) + " " + matcher.group(2)
+ " " + matcher.group(3) + " " + matcher.group(4) + " " + matcher.group(5));}