C# .NET document parsing API to extract text, images, metadata & encoding from databases, PDF, Word, Excel, presentations, web, email, EPUB & zip file formats.... RTF Markup : HTML, XHTML, MHTML, MD, XML Portable Formats :...TAR PPSX XLSB XLSX DOC CHM MHTML DOCM GIF PPTX JPEG OTS DOT Extract...