
text-mining
Java based library that will extract text from Microsoft Word for Windows binary documents including Word 1.0/2.0/4.0/6.0/95/97/2000/xp/2003. Extracts text from fast-saved files as well.
Project Information
- License: GNU Lesser GPL
- 11 stars
- svn-based source control
Labels:
Word
Office
Textextraction