如题用那个jar包解析pdf的内容,代码如下:
public class PdfToText { public static String getTxt(File f)throws Exception{
String ts="提取失败......";
try{
String temp = "";
PDDocument doc = PDDocument.load(f);
ByteArrayOutputStream out = new ByteArrayOutputStream();
OutputStreamWriter writer = new OutputStreamWriter(out);
PDFTextStripper stripper = new PDFTextStripper();
stripper.writeText(doc.getDocument(), writer);
doc.close();
out.close();
writer.close();
byte[] contents = out.toByteArray();
ts = new String(contents);
/* PDFParser parser = new PDFParser(f);
parser.parse();
PDDocument document = parser.getPDDocument();
PDFTextStripper stripper = new PDFTextStripper();
ts = stripper.getText(document);*/
}catch(Exception e){
e.printStackTrace();
}
return ts;
}
/**
* @param args
* @throws Exception
*/
public static void main(String[] args) throws Exception {
File file = new File("C:\\项目构架师的资料.pdf");
try{
System.out.println(PdfToText.getTxt(file));
}catch(Exception e){
e.printStackTrace();
}
}}
报错如下:
Exception in thread "main" java.lang.NoClassDefFoundError: org/bouncycastle/jce/provider/BouncyCastleProvider
at org.pdfbox.pdmodel.PDDocument.openProtection(PDDocument.java:905)
at org.pdfbox.pdmodel.PDDocument.decrypt(PDDocument.java:489)
at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:204)
at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:174)
at JavaBean.PdfToText.getTxt(PdfToText.java:24)
at JavaBean.PdfToText.main(PdfToText.java:50)
网上找了下,说这个类是jdk1.4自带的jce包中,我看了下jce包,不过没找到这个类。另外,也下了bouncycastle.jar,里面也没这个类。哪位大侠知道?
public class PdfToText { public static String getTxt(File f)throws Exception{
String ts="提取失败......";
try{
String temp = "";
PDDocument doc = PDDocument.load(f);
ByteArrayOutputStream out = new ByteArrayOutputStream();
OutputStreamWriter writer = new OutputStreamWriter(out);
PDFTextStripper stripper = new PDFTextStripper();
stripper.writeText(doc.getDocument(), writer);
doc.close();
out.close();
writer.close();
byte[] contents = out.toByteArray();
ts = new String(contents);
/* PDFParser parser = new PDFParser(f);
parser.parse();
PDDocument document = parser.getPDDocument();
PDFTextStripper stripper = new PDFTextStripper();
ts = stripper.getText(document);*/
}catch(Exception e){
e.printStackTrace();
}
return ts;
}
/**
* @param args
* @throws Exception
*/
public static void main(String[] args) throws Exception {
File file = new File("C:\\项目构架师的资料.pdf");
try{
System.out.println(PdfToText.getTxt(file));
}catch(Exception e){
e.printStackTrace();
}
}}
报错如下:
Exception in thread "main" java.lang.NoClassDefFoundError: org/bouncycastle/jce/provider/BouncyCastleProvider
at org.pdfbox.pdmodel.PDDocument.openProtection(PDDocument.java:905)
at org.pdfbox.pdmodel.PDDocument.decrypt(PDDocument.java:489)
at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:204)
at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:174)
at JavaBean.PdfToText.getTxt(PdfToText.java:24)
at JavaBean.PdfToText.main(PdfToText.java:50)
网上找了下,说这个类是jdk1.4自带的jce包中,我看了下jce包,不过没找到这个类。另外,也下了bouncycastle.jar,里面也没这个类。哪位大侠知道?
解决方案 »
- 用jsp+javabean+servlet做登陆界面时,login.jsp老是登陆到失败页面。小弟我怀疑是jdbc没有链接上mysql。请各位前辈帮忙看代码
- 求救!!!
- [急急急]MyEclipse 8.0GA 代码提示情况下输入request.getReq....会自动关闭
- jsp中嵌入javafx
- 谁来帮帮我——ssh时mysql乱码
- 100分 !!!Sun App Server8.1 缓存清理(Solaris系统)
- 求助JDOM+XPath编程
- EntityBean 查询不到数据问题
- 散分~~~问各位高手问题
- 在JBUILDER中引入上传SmartUpload后,编译时出现下面问题,这是为什么
- 这个问题该怎么解决。?关于JDBC
- java cache技术
jdk换成1.6的看看
http://www.bouncycastle.org/latest_releases.html
应该用这个吧,呵呵