Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....XLSB DOC JP2 PPSX XLS TIFF XLT TIF EPUB PNG ODT JPEG GZ DOT GIF...FB2 RTF CSV CHM XHTML BZ2 EPUB TIF PPSM JPEG DOT BMP OST POTM DOC...