String regex = "[a-zA-z]+://[^\\s]*>";
Pattern p = Pattern.compile(regex);
String a="<http://tieba.baidu.com/f?kz=811750444>微软证实IE9将支持Canvas绘图音视频标签技术<http://tieba.baidu.com/f?kz=811750444#sub>"; Matcher m = p.matcher(a);
Vector links = new Vector();
while (m.find()) {
for (int i = 0; i <= m.groupCount(); i++) {
String tmp = m.group();
tmp = tmp.replaceAll(">", "");
tmp = tmp.replaceAll("'", "");
System.out.println(tmp);
}
}为什么我提取出来的是http://tieba.baidu.com/f?kz=811750444微软证实IE9将支持Canvas绘图音视频标签技术<http://tieba.baidu.com/f?kz=811750444#sub
请大家帮忙看看。。感激不尽啊
Pattern p = Pattern.compile(regex);
String a="<http://tieba.baidu.com/f?kz=811750444>微软证实IE9将支持Canvas绘图音视频标签技术<http://tieba.baidu.com/f?kz=811750444#sub>"; Matcher m = p.matcher(a);
Vector links = new Vector();
while (m.find()) {
for (int i = 0; i <= m.groupCount(); i++) {
String tmp = m.group();
tmp = tmp.replaceAll(">", "");
tmp = tmp.replaceAll("'", "");
System.out.println(tmp);
}
}为什么我提取出来的是http://tieba.baidu.com/f?kz=811750444微软证实IE9将支持Canvas绘图音视频标签技术<http://tieba.baidu.com/f?kz=811750444#sub
请大家帮忙看看。。感激不尽啊
改为
String regex = "[a-zA-z]+://[^\\s>]*>";
否则你的[^\\s]*>将匹配tieba.baidu.com/f?kz=811750444>微软证实IE9将支持Canvas绘图音视频标签技术<http://tieba.baidu.com/f?kz=811750444#sub>这一长串