Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....Email : PST, OST, EML, EMLX, MSG eBook Formats : EPUB, FB2, CHM...OTS POT PST EMLX TXT DOCM PPS MSG XLSM DOTM DOCX OTT RTF MD BMP...