用spider抓取网页后,存入数据库,但抓取的网页是乱码,怎么解决????

解决方案 »

  1.   

    得到的数据用
    URLEncoder.encode(str,"UTF-8") 试试呗
      

  2.   

    你说的方法都试过,乱码是没了,但是整个页面都变成下面这样了。%3C%21DOCTYPE+html+PUBLIC+%22-%2F%2FW3C%2F%2FDTD+XHTML+1.0+Transitional%2F%2FEN%22+%22http%3A%2F%2Fwww.w3.org%2FTR%2Fxhtml1%2FDTD%2Fxhtml1-transitional.dtd%22%3E%0A%3C%21--%5B704%2C2%2C184889%5D+published+at+2012-10-19+11%3A29%3A43+from+%23182+by+7118--%3E%0A%0A%3C%21--+LLTJ_MT%3Aname+%3D%22%3F%3F%3F%A7%DB%3F%3F%3F%3F%3F%3F%A6%B1%3F%22+--%3E%0D%0A%3C%21--+LLTJ_ZT%3Aurl%3D%22http%3A%2F%2Ftravel.sina.com.cn%2Fz%2FChinaNationalTravel%2Findex.shtml%22%3B+name%3D%22%3F%3F%3F%A7%DB%3F%3F%3F%3F%3F%3F%A6%B1%3F%2CChina+National+Travel%22%3Btype%3D%22ZW%22%3B+--%3E%0D%0A%0A%3Chtml+xmlns%3D%22http%3A%2F%2Fwww.w3.org%2F1999%2Fxhtml%22%3E%0D%0A%3Chead%3E%0D%0A%3Cmeta+http-equ.........
      

  3.   

    ....转成GBK呢?  然后看看数据库的编码
      

  4.   


    都试过,数据库存的是2进制编码,初步怀疑是因为接口调用的时候编码不是utf-8所导致的。