Java提取网页内容中文为乱码的问题~~

今天用Java做了一个提取网页内容的小实验，但不知道为什么提取的内容中文为乱码，
请高手们多多赐教，不甚感激~~~
源代码如下：
import java.io.BufferedReader;
import java.io.IOException;
import java.io.InputStreamReader;
import java.net.MalformedURLException;
import java.net.URL;public class WebContent{

    public String getOneHtml(String htmlurl) throws IOException{
        URL url;
        String temp;
        StringBuffer sb = new StringBuffer();
        try{
            url = new URL(htmlurl);
            BufferedReader in = new BufferedReader(new InputStreamReader(url
                    .openStream(), "gb2312"));
            while ((temp = in.readLine()) != null){
                sb.append(temp);
            }
            in.close();
        }catch(MalformedURLException me){
            System.out.println("your url is wrong,please input");
            me.getMessage();
            throw me;
        }catch (IOException e){
            e.printStackTrace();
            throw e;
        }
        return sb.toString();
    }
  public static void main(String []args){
     WebContent web = new WebContent();
     String webcontent = web.getOneHtml("http://www.baidu.com/");
     System.out.println(webcontent);
  }

解决方案 »

免费领取超大流量手机卡，每月29元包185G流量+100分钟通话, 中国电信官方发货

没有乱码呀你的Java文件是什么编码格式？
不知道你用的是不是eclipse或者MyEclipse，如果是的话请:Window->Preferences->General->Workspace，将右窗口Text file encoding 的值修改为GBK。应该搞定了
应该是文档的编码方式的问题
5楼是看代码里面的乱码
这个编写代码的程序编码的问题另外还有问题就是中文显示在jsp页面上也会有乱码的情况
需要在发送jsp是转下码,显示的时候再转一次
要清楚页面 ,IDE,以及流这些的编码是不是一样的