Java document parser API to extract text, images, metadata & encoding from databases, Word, Excel, presentations, PDF, email, EPUB and ZIP files....containers like ZIP archives, OST/PST mail data files, eBooks,...Portable Formats : PDF Email : PST, OST, EML, EMLX, MSG eBook Formats...