本来想实现word to html的,小弟的水平比较差,呵呵, 只能转txt了.
代码
=======================
import java.io.*;
import org.textmining.text.extraction.*;
public class WordToTxt
{
public static void main(String[] args)
{
String paths = new String("D:\\com\\wordtohtml\\doc\\doc.doc");
try
{
FileInputStream in = new FileInputStream(paths);
WordExtractor extractor = new WordExtractor();
System.out.println(in.available());
String str = extractor.extractText(in);
System.out.println(str);
java.io.FileWriter fw=new java.io.FileWriter("doc.txt");
fw.write(str);
fw.close();
} catch (Exception e)
{
e.printStackTrace();
}
}
}
本贴相关附件如下:
tm-extractors-0.4.jar搜索更多相关主题的帖子: