Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....DOT, DOTX, DOTM Spreadsheets : XLS, XLSX, XLSM, XLSB, XLT, XLTX...CSV ODS XLAM XLSB DOC JP2 PPSX XLS TIFF XLT TIF EPUB PNG ODT JPEG...