问个递归的问题，有点迷糊了。

我的代码是这样的。static ArrayList goodlist = new ArrayList();public static void urlList(String url) throws Exception {

String link = null;
URL myurl = new URL(url);
if(checkLink(myurl))
{
BufferedReader in = new BufferedReader(new InputStreamReader(myurl
.openStream()));
String inputLine;
while ((inputLine = in.readLine()) != null) {
// System.out.println(inputLine);
Pattern p = Pattern.compile("(.*href=?[\"])(.*?)([\"].*)");
Matcher m = p.matcher(inputLine);
while (m.find()) {
if (!m.group(2).startsWith("http")) { link = url + "/" + m.group(2);
} else {
link = m.group(2);
}
if(!goodlist.contains(link))
{
if(link.contains("/index.html"))
{
link = link.replaceAll("/index.html", "");
goodlist.add(link);
}
goodlist.add(link);
}
}
}

in.close();
//System.out.println(goodlist.size());
/*
for(int i=0; i<goodlist.size(); i++)
{
System.out.println(goodlist.get(i));
}
*/
}
else
{
badlist.add(link);
}

}这样用String str = "http://www.yahoo.co.uk"; pl.urlList(str); 就可以得到goodlist,里面的值是这个str这个URL里面找到的所有的子页面。
我的问题是子页面里还会又子页面，要全找到的话，应该是用递归把。我的想法是:在上面代码的in.close()后面加上这段代码。如下：in.close();
//System.out.println(goodlist.size());

for(int i=0; i<goodlist.size(); i++)
{
System.out.println(goodlist.get(i));
}
但是貌似不行，运行报错，请高手看看这个递归有问题么？谢谢了啊。

解决方案 »

免费领取超大流量手机卡，每月29元包185G流量+100分钟通话, 中国电信官方发货

写错了，
for(int i=0; i<goodlist.size(); i++)
        {
            System.out.println(goodlist.get(i));
            urlList(goodlist.get(i));
        }
这样的
有一点小问题，可以不用递归，对你的程序稍作修改后就应该可以了：static ArrayList goodlist = new ArrayList();public static void urlList(String url) throws Exception
{
for( int i = 0; i < goodlist.size(); i++ ) {
extractUrls( (String) goodlist.get( i ) );
}
}public static void extractUrls(String url) throws Exception
{
String link = null;
URL myurl = new URL(url);
if(checkLink(myurl)) {
BufferedReader in = new BufferedReader(new InputStreamReader(myurl .openStream()));
String inputLine;
while ((inputLine = in.readLine()) != null) {
// System.out.println(inputLine);
Pattern p = Pattern.compile("(.*href=?[\"])(.*?)([\"].*)");
Matcher m = p.matcher(inputLine);
while (m.find()) {
if (!m.group(2).startsWith("http")) {
link = url + "/" + m.group(2); }
else {
link = m.group(2);
}
if(!goodlist.contains(link)) {
if(link.contains("/index.html")) {
link = link.replaceAll("/index.html", "");
goodlist.add(link);
}
goodlist.add(link);
}
}
}
in.close();
} else  {
badlist.add(link);
}
}
刚提交的代码有一点小错误，修改一下．static ArrayList goodlist = new ArrayList();public static void urlList(String url) throws Exception
{
goodlist.add( url );
    for( int i = 0; i < goodlist.size(); i++ ) {
        extractUrls( (String) goodlist.get( i ) );
    }
}public static void extractUrls(String url) throws Exception
{
    String link = null;
    URL myurl = new URL(url);
    if(checkLink(myurl)) {
        BufferedReader in = new BufferedReader(new InputStreamReader(myurl .openStream()));
        String inputLine;
        while ((inputLine = in.readLine()) != null) {
            // System.out.println(inputLine);
            Pattern p = Pattern.compile("(.*href=?[\"])(.*?)([\"].*)");
            Matcher m = p.matcher(inputLine);
            while (m.find()) {
                if (!m.group(2).startsWith("http")) {
                    link = url + "/" + m.group(2); }
                else {
                    link = m.group(2);
                }
                if(!goodlist.contains(link)) {
                    if(link.contains("/index.html")) {
                        link = link.replaceAll("/index.html", "");
                        goodlist.add(link);
                    }
                    goodlist.add(link);
                }
            }
        }
        in.close();
    } else  {
        badlist.add(link);
    }
}
有问题。当一个父URL的所有子URL都加入到goodlist中之后，应该把父URL从goodlist中移出。不移出的话，像现在这样:for(int i=0; i<goodlist.size(); i++),会形成死循环的。
因为这个方法是找到一个URL里面的超连接。
这个goodlist就是找到这些超连接的一个list。
感觉好像没有什么问题的啊，有没有高手来帮我分析下啊。还有我这种程序应该注意一些什么样的东西呢？