Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....DOC, DOCX, DOCM, DOT, DOTX, DOTM Spreadsheets : XLS, XLSX, XLSM...PST EMLX TXT DOCM PPS MSG XLSM DOTM DOCX OTT RTF MD BMP CSV ODS...