Usage: java -jar pdfbox-app-x.y.z.jar ExtractText [OPTIONS] <PDF file> [Text File]
-password <password> Password to decrypt document
-encoding <output encoding> (ISO-8859-1,UTF-16BE,UTF-16LE,...)
-console Send text to console instead of file
-html Output in HTML format instead of raw text
-sort Sort the text before writing
-ignoreBeads Disables the separation by beads
-force Enables pdfbox to ignore corrupt objects
-debug Enables debug output about the time consumption of every stage
-startPage <number> The first page to start extraction(1 based)
-endPage <number> The last page to extract(inclusive)
-nonSeq Enables the new non-sequential parser
<PDF file> The PDF document to use
[Text File] The file to write the text to
每次调试出现这些东西,接着要实现那些功能怎么弄啊?是只要修改一下路径还是得改其他代码?【我是菜鸟,求各路大神指点】
-password <password> Password to decrypt document
-encoding <output encoding> (ISO-8859-1,UTF-16BE,UTF-16LE,...)
-console Send text to console instead of file
-html Output in HTML format instead of raw text
-sort Sort the text before writing
-ignoreBeads Disables the separation by beads
-force Enables pdfbox to ignore corrupt objects
-debug Enables debug output about the time consumption of every stage
-startPage <number> The first page to start extraction(1 based)
-endPage <number> The last page to extract(inclusive)
-nonSeq Enables the new non-sequential parser
<PDF file> The PDF document to use
[Text File] The file to write the text to
每次调试出现这些东西,接着要实现那些功能怎么弄啊?是只要修改一下路径还是得改其他代码?【我是菜鸟,求各路大神指点】
解决方案 »
免费领取超大流量手机卡,每月29元包185G流量+100分钟通话, 中国电信官方发货