IE中嵌入word的问题！！

看来你已经会使用word文档的线上浏览方式了。
你问的问题，我也不太会，这里有篇文档，还没有仔细研究过，不知道可不可以完成你要的功能吧？ java抽取word,pdf的四种武器(转载)
1 .用jacob
其实jacob是一个bridage，连接java和com或者win32函数的一个中间件，jacob并不能直接抽取word,excel等文件，需要自己写dll哦，不过已经有为你写好的了，就是jacob的作者一并提供了。 jacob jar与dll文件下载： http://www.matrix.org.cn/down_view.asp?id=13 下载了jacob并放到指定的路径之后(dll放到path,jar文件放到classpath)，就可以写你自己的抽取程序了，下面是一个简单的例子：
import java.io.File;
import com.jacob.com.*;
import com.jacob.activeX.*;
/**
* Title: pdf extraction
* Description: email:[email protected]
* Copyright: Matrix Copyright (c) 2003
* Company: Matrix.org.cn
* @author chris
* @version 1.0,who use this example pls remain the declare
*/
public class FileExtracter{
public static void main(String[] args) {
 ActiveXComponent component = new ActiveXComponent("Word.Application");
 String inFile = "c:\\test.doc";
String tpFile = "c:\\temp.htm";
 String otFile = "c:\\temp.xml";
 boolean flag = false;
 try {
 component.setProperty("Visible", new Variant(false));
 Object wordacc = component.getProperty("document．").toDispatch();
 Object wordfile = Dispatch.invoke(wordacc,"Open", Dispatch.Method,
 new Object[]{inFile,new Variant(false), new Variant(true)},
 new int[1] ).toDispatch();
 Dispatch.invoke(wordfile,"SaveAs", Dispatch.Method, new Object[]{tpFile,new Variant(8)}, new int[1]);
 Variant f = new Variant(false);
 Dispatch.call(wordfile, "Close", f);
 flag = true;
 } catch (Exception e) {
 e.printStackTrace();
 } finally {
 component.invoke("Quit", new Variant[] {});
 }
}
}
2. 用apache的poi来抽取word，excel。
poi是apache的一个项目，不过就算用poi你可能都觉得很烦，不过不要紧，这里提供了更加简单的一个接口给你：下载经过封装后的poi包： http://www.matrix.org.cn/down_view.asp?id=14 下载之后，放到你的classpath就可以了，下面是如何使用它的一个例子：
import java.io.*;
import org.textmining.text.extraction.WordExtractor;
/**
* Title: word extraction
* Description: email:[email protected]
* Copyright: Matrix Copyright (c) 2003
* Company: Matrix.org.cn
* @author chris
* @version 1.0,who use this example pls remain the declare
*/public class PdfExtractor {
 public PdfExtractor() {
 }
 public static void main(String args[]) throws Exception
 {
 FileInputStream in = new FileInputStream ("c:\\a.doc");
 WordExtractor extractor = new WordExtractor();
 String str = extractor.extractText(in);
 System.out.println("the result length is"+str.length());
 System.out.println("the result is"+str);
}
}
3. pdfbox-用来抽取pdf文件
但是pdfbox对中文支持还不好，先下载pdfbox： http://www.matrix.org.cn/down_view.asp?id=12 下面是一个如何使用pdfbox抽取pdf文件的例子：
import org.pdfbox.pdmodel.PDdocument．
import org.pdfbox.pdfparser.PDFParser;
import java.io.*;
import org.pdfbox.util.PDFTextStripper;
import java.util.Date;
/**
* Title: pdf extraction
* Description: email:[email protected]
* Copyright: Matrix Copyright (c) 2003
* Company: Matrix.org.cn
* @author chris
* @version 1.0,who use this example pls remain the declare
*/public class PdfExtracter{public PdfExtracter(){
 }
public String GetTextFromPdf(String filename) throws Exception
 {
 String temp=null;
 PDdocument．nbsppdfdocument．null;
 FileInputStream is=new FileInputStream(filename);
 PDFParser parser = new PDFParser( is );
 parser.parse();
 pdfdocument．nbsp= parser.getPDdocument．);
 ByteArrayOutputStream out = new ByteArrayOutputStream();
 OutputStreamWriter writer = new OutputStreamWriter( out );
 PDFTextStripper stripper = new PDFTextStripper();
 stripper.writeText(pdfdocument．getdocument．), writer );
 writer.close();
 byte[] contents = out.toByteArray(); String ts=new String(contents);
 System.out.println("the string length is"+contents.length+"\n");
 return ts;
}
public static void main(String args[])
{
PdfExtracter pf=new PdfExtracter();
PDdocument．nbsppdfdocument．nbsp= null;try{
String ts=pf.GetTextFromPdf("c:\\a.pdf");
System.out.println(ts);
}
catch(Exception e)
 {
 e.printStackTrace();
 }
}}
4. 抽取支持中文的pdf文件－xpdf
xpdf是一个开源项目，我们可以调用他的本地方法来实现抽取中文pdf文件。下载xpdf函数包： http://www.matrix.org.cn/down_view.asp?id=15 同时需要下载支持中文的补丁包： http://www.matrix.org.cn/down_view.asp?id=16 按照readme放好中文的patch，就可以开始写调用本地方法的java程序了下面是一个如何调用的例子：
import java.io.*;
/**
* Title: pdf extraction
* Description: email:[email protected]
* Copyright: Matrix Copyright (c) 2003
* Company: Matrix.org.cn
* @author chris
* @version 1.0,who use this example pls remain the declare
*/
public class PdfWin {
 public PdfWin() {
 }
 public static void main(String args[]) throws Exception
 {
 String PATH_TO_XPDF="C:\\Program Files\\xpdf\\pdftotext.exe";
 String filename="c:\\a.pdf";
 String[] cmd = new String[] { PATH_TO_XPDF, "-enc", "UTF-8", "-q", filename, "-"};
 Process p = Runtime.getRuntime().exec(cmd);
 BufferedInputStream bis = new BufferedInputStream(p.getInputStream());
 InputStreamReader reader = new InputStreamReader(bis, "UTF-8");
 StringWriter out = new StringWriter();
 char [] buf = new char[10000];
 int len;
 while((len = reader.read(buf))>= 0) {
 //out.write(buf, 0, len);
 System.out.println("the length is"+len);
 }
 reader.close();
 String ts=new String(buf);
 System.out.println("the str is"+ts);
 }

解决方案 »

免费领取超大流量手机卡，每月29元包185G流量+100分钟通话, 中国电信官方发货

你使用的是线上浏览方式吧？
不过你的问题我也不是很清楚，没有作过，下面的文章也许可以帮上你的忙。：） java抽取word,pdf的四种武器(转载)
1 .用jacob
其实jacob是一个bridage，连接java和com或者win32函数的一个中间件，jacob并不能直接抽取word,excel等文件，需要自己写dll哦，不过已经有为你写好的了，就是jacob的作者一并提供了。 jacob jar与dll文件下载： http://www.matrix.org.cn/down_view.asp?id=13 下载了jacob并放到指定的路径之后(dll放到path,jar文件放到classpath)，就可以写你自己的抽取程序了，下面是一个简单的例子：
import java.io.File;
import com.jacob.com.*;
import com.jacob.activeX.*;
/**
* Title: pdf extraction
* Description: email:[email protected]
* Copyright: Matrix Copyright (c) 2003
* Company: Matrix.org.cn
* @author chris
* @version 1.0,who use this example pls remain the declare
*/
public class FileExtracter{
public static void main(String[] args) {
 ActiveXComponent component = new ActiveXComponent("Word.Application");
 String inFile = "c:\\test.doc";
String tpFile = "c:\\temp.htm";
 String otFile = "c:\\temp.xml";
 boolean flag = false;
 try {
 component.setProperty("Visible", new Variant(false));
 Object wordacc = component.getProperty("document．").toDispatch();
 Object wordfile = Dispatch.invoke(wordacc,"Open", Dispatch.Method,
 new Object[]{inFile,new Variant(false), new Variant(true)},
 new int[1] ).toDispatch();
 Dispatch.invoke(wordfile,"SaveAs", Dispatch.Method, new Object[]{tpFile,new Variant(8)}, new int[1]);
 Variant f = new Variant(false);
 Dispatch.call(wordfile, "Close", f);
 flag = true;
 } catch (Exception e) {
 e.printStackTrace();
 } finally {
 component.invoke("Quit", new Variant[] {});
 }
}
}
2. 用apache的poi来抽取word，excel。
poi是apache的一个项目，不过就算用poi你可能都觉得很烦，不过不要紧，这里提供了更加简单的一个接口给你：下载经过封装后的poi包： http://www.matrix.org.cn/down_view.asp?id=14 下载之后，放到你的classpath就可以了，下面是如何使用它的一个例子：
import java.io.*;
import org.textmining.text.extraction.WordExtractor;
/**
* Title: word extraction
* Description: email:[email protected]
* Copyright: Matrix Copyright (c) 2003
* Company: Matrix.org.cn
* @author chris
* @version 1.0,who use this example pls remain the declare
*/public class PdfExtractor {
 public PdfExtractor() {
 }
 public static void main(String args[]) throws Exception
 {
 FileInputStream in = new FileInputStream ("c:\\a.doc");
 WordExtractor extractor = new WordExtractor();
 String str = extractor.extractText(in);
 System.out.println("the result length is"+str.length());
 System.out.println("the result is"+str);
}
}
3. pdfbox-用来抽取pdf文件
但是pdfbox对中文支持还不好，先下载pdfbox： http://www.matrix.org.cn/down_view.asp?id=12 下面是一个如何使用pdfbox抽取pdf文件的例子：
import org.pdfbox.pdmodel.PDdocument．
import org.pdfbox.pdfparser.PDFParser;
import java.io.*;
import org.pdfbox.util.PDFTextStripper;
import java.util.Date;
/**
* Title: pdf extraction
* Description: email:[email protected]
* Copyright: Matrix Copyright (c) 2003
* Company: Matrix.org.cn
* @author chris
* @version 1.0,who use this example pls remain the declare
*/public class PdfExtracter{public PdfExtracter(){
 }
public String GetTextFromPdf(String filename) throws Exception
 {
 String temp=null;
 PDdocument．nbsppdfdocument．null;
 FileInputStream is=new FileInputStream(filename);
 PDFParser parser = new PDFParser( is );
 parser.parse();
 pdfdocument．nbsp= parser.getPDdocument．);
 ByteArrayOutputStream out = new ByteArrayOutputStream();
 OutputStreamWriter writer = new OutputStreamWriter( out );
 PDFTextStripper stripper = new PDFTextStripper();
 stripper.writeText(pdfdocument．getdocument．), writer );
 writer.close();
 byte[] contents = out.toByteArray(); String ts=new String(contents);
 System.out.println("the string length is"+contents.length+"\n");
 return ts;
}
public static void main(String args[])
{
PdfExtracter pf=new PdfExtracter();
PDdocument．nbsppdfdocument．nbsp= null;try{
String ts=pf.GetTextFromPdf("c:\\a.pdf");
System.out.println(ts);
}
catch(Exception e)
 {
 e.printStackTrace();
 }
}}
4. 抽取支持中文的pdf文件－xpdf
xpdf是一个开源项目，我们可以调用他的本地方法来实现抽取中文pdf文件。下载xpdf函数包： http://www.matrix.org.cn/down_view.asp?id=15 同时需要下载支持中文的补丁包： http://www.matrix.org.cn/down_view.asp?id=16 按照readme放好中文的patch，就可以开始写调用本地方法的java程序了下面是一个如何调用的例子：
import java.io.*;
/**
* Title: pdf extraction
* Description: email:[email protected]
* Copyright: Matrix Copyright (c) 2003
* Company: Matrix.org.cn
* @author chris
* @version 1.0,who use this example pls remain the declare
*/
public class PdfWin {
 public PdfWin() {
 }
 public static void main(String args[]) throws Exception
 {
 String PATH_TO_XPDF="C:\\Program Files\\xpdf\\pdftotext.exe";
 String filename="c:\\a.pdf";
 String[] cmd = new String[] { PATH_TO_XPDF, "-enc", "UTF-8", "-q", filename, "-"};
 Process p = Runtime.getRuntime().exec(cmd);
 BufferedInputStream bis = new BufferedInputStream(p.getInputStream());
 InputStreamReader reader = new InputStreamReader(bis, "UTF-8");
 StringWriter out = new StringWriter();
 char [] buf = new char[10000];
 int len;
 while((len = reader.read(buf))>= 0) {
 //out.write(buf, 0, len);
 System.out.println("the length is"+len);
 }
 reader.close();
 String ts=new String(buf);
 System.out.println("the str is"+ts);
 }
=============================================================================
软件名称： NTKO Office文档控件
软件版本： 1.6
软件作者： NTKO
作者邮箱： [email protected],[email protected],[email protected]
软件网站： http://www.ntko.com
软件语言：中文
授权形式：演示版本软件
应用平台： Domino,IIS,Apache,WebSphere等
后台操作系统：windows,linux,unix
客户端操作系统：windows98+,win2k,me,xp,2003,nt
支持后台编程语言：asp,lotusscript,jsp,php,perl等
支持客户端编程语言：vbscript,j/javascript
软件大小： 67kb
界面预览：
软件简介： NTKO Office文档控件V1.6新功能：全面支持打印预览！卓越的性能，低廉的价格，标准高效的编程接口，让您以最低的成本，轻松创建极具特色的解决方案！NTKO Office文档控件可以帮助您快速创建和部署基于浏览器的，并且与Microsoft Office办公软件紧密集成的办公和文档管理解决方案。使用它，就能够在浏览器窗口中直接处理Word,Excel等Office文档。比如，您可以将所有的模板在服务器端集中管理，并且让您的员工能够联机创建Office文档。该控件支持强制痕迹保留，支持所有的Office菜单，无须FTP，和Domino可无缝集成；并且对Domino无须后台编程即可实现支持；支持各种后台Web服务器和编程语言。特色功能：1）全面支持打印预览！2）能够将编辑过的Office文档直接保存到Web服务器；3）支持所有的Office菜单；完全符合用户习惯；4）支持在线编辑包括Microsoft Word，Excel以及PowerPoint在内的所有Office文档类型；5）轻量级，只有不足100K。并且不需要其它动态链接库（DLL）支持；6）只需要客户安装win95，IE4.0，Office97以上版本即可运行；7）支持HTTP协议，HTTPS OVER SSL协议；8）支持强制痕迹保留，文档保护，禁止拷贝数据等许多功能；9）支持使用JavaScript和VBScript对其进行编程；10）支持各种后台Web服务器和编程语言。11）在下载和上载URL时不会产生临时文件；12）具有丰富的VBScript和JavaScript函数库；提交保存时支持RFC 1867（Form-based File Upload in HTML）标准；13）自动支持Session Cookie。使用当前页面的Session Cookie和服务器进行交互。 14）无须FTP，和Domino可无缝集成；15）Domino无须后台编程即可实现支持.16）简洁标准的编程，无任何自定义数据格式。17）因不使用临时文件，保存到服务器时不会引起防病毒软件如Norton Antivirus的警告。等等......
==============================================================================
如果只是浏览就可以用<iframe src="*.doc">
如果要在线编辑可以自己做个控件,内嵌OLE就可以了.[基本就是用CFrameWnd类建立框架以支持Doc/View用COleClientItem来嵌Word,用CWinInet类来实现CHttpFile对文件的下载上传.]
使用办公之星控件（http://www.standsoft.com），客户端无须安装office软件名称:办公之星控件（OStarOCX）
软件版本:V1.02
更新日期:2003.9.18
运行环境:Win9x/2000/Nt/xp
支持语言：VC/VB/PB/Dephi/ASP/JSP
界面语言:简体中文
授权方式:试用软件
软件大小:1036KB
官方网址:http://www.standsoft.com
电子信箱:[email protected]
软件作者:深圳智岛软件有限公司
演示工程下载：http://www.oapro.com/ostarocx/download_demo.asp
在线演示：http://www.oapro.com/ostarocx/product_demo.asp
控件简介:    办公之星控件(OStarOCX)是智岛软件公司(http://www.standsoft.com)推出的一款基于图文混排的文档处理控件(类似Word的控件)，办公之星控件(OStarOCX)完全封装了办公之星的文字、图形、表格、图表等功能，提供了几百个编程接口；在ERP、CRM、HR、OA等系统里，开发难度非常大的文档处理、统计、报表等输出功能都可由办公之星控件(OStarOCX)轻松实现，其所见即所得的效果跟现实里的完全一样。办公之星控件(OStarOCX)适用于Windows环境的各种开发工具(如VB，VC，Delphi,C+Builder, PB, Java等), 并且针对Web开发环境做了大量的优化。