报的错误如下:
org.htmlparser.util.EncodingChangeException: character mismatch (new: [0xfeff] != old: [0x9518锘]) for encoding change from gbk to UTF-8 at character offset 0
at org.htmlparser.lexer.InputStreamSource.setEncoding(InputStreamSource.java:280)
at org.htmlparser.lexer.Page.setEncoding(Page.java:865)
at org.htmlparser.tags.MetaTag.doSemanticAction(MetaTag.java:150)
at org.htmlparser.scanners.TagScanner.scan(TagScanner.java:69)
at org.htmlparser.scanners.CompositeTagScanner.scan(CompositeTagScanner.java:160)
at org.htmlparser.util.IteratorImpl.nextNode(IteratorImpl.java:92)
at org.htmlparser.Parser.parse(Parser.java:701)
at extraction.Clawer.GetHtmlStr(Clawer.java:76)
at extract_suning.ExtracBaseInfo.main(ExtracBaseInfo.java:28)不是很明白,求解释,求解决!!
org.htmlparser.util.EncodingChangeException: character mismatch (new: [0xfeff] != old: [0x9518锘]) for encoding change from gbk to UTF-8 at character offset 0
at org.htmlparser.lexer.InputStreamSource.setEncoding(InputStreamSource.java:280)
at org.htmlparser.lexer.Page.setEncoding(Page.java:865)
at org.htmlparser.tags.MetaTag.doSemanticAction(MetaTag.java:150)
at org.htmlparser.scanners.TagScanner.scan(TagScanner.java:69)
at org.htmlparser.scanners.CompositeTagScanner.scan(CompositeTagScanner.java:160)
at org.htmlparser.util.IteratorImpl.nextNode(IteratorImpl.java:92)
at org.htmlparser.Parser.parse(Parser.java:701)
at extraction.Clawer.GetHtmlStr(Clawer.java:76)
at extract_suning.ExtracBaseInfo.main(ExtracBaseInfo.java:28)不是很明白,求解释,求解决!!
我是抓取苏宁易购的网页,所以网页是无法修改;那class编码该改成什么呢?