我用正则表达式截取网页中的表格,
String buf = "<tr class=labcdkkkw><td>dd<td></tr><tr class....></tr>......";
Pattern pattern = Pattern.compile("<tr.*</tr>");
Matcher matcher = pattern.matcher(buf);
while (matcher.find()) {
System.out.println(matcher.group(0));
}结果是
<tr class=labcdkkkw><td>dd<td></tr><tr class....></tr>
但我想把所有<tr > </tr>都分开,怎么办呢?比如上面结果,我真正想要的是
<tr class=labcdkkkw><td>dd<td></tr>
<tr class....></tr>
String buf = "<tr class=labcdkkkw><td>dd<td></tr><tr class....></tr>......";
Pattern pattern = Pattern.compile("<tr.*</tr>");
Matcher matcher = pattern.matcher(buf);
while (matcher.find()) {
System.out.println(matcher.group(0));
}结果是
<tr class=labcdkkkw><td>dd<td></tr><tr class....></tr>
但我想把所有<tr > </tr>都分开,怎么办呢?比如上面结果,我真正想要的是
<tr class=labcdkkkw><td>dd<td></tr>
<tr class....></tr>
public static void main(String[] args) {
String buf = "<tr class=labcdkkkw><td>dd<td></tr><tr class....></tr>......";
Matcher m = Pattern.compile("<(?i)tr.*?>.*?<(?i)/tr>").matcher(buf);
while(m.find()){
System.out.println(m.group());
}
}结果
<tr class=labcdkkkw><td>dd<td></tr>
<tr class....></tr>