C# .NET document parsing API to extract text, images, metadata & encoding from databases, PDF, Word, Excel, presentations, web, email, EPUB & zip file formats....OTP Text : TXT, RTF Markup : HTML, XHTML, MHTML, MD, XML Portable...XML XHTML TXT XLS TIF PPT POTM HTML EML MSG GZ PPSM RAR DOCX POTX...