Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....DOTM, DOTX, DOCM, RTF, ODT, OTT, TXT, MD, WordprocessingML (XML)...DOC, DOCX, DOT, DOTX, DOTM, OTT, ODT Spreadsheets : XLS, XLSX...