C# .NET document parsing API To extract text, images, metadata & encoding from databases, PDF, Word, Excel, presentations, web, email, EPUB & zip file formats.... DOTX, DOCM, RTF, ODT, OTT, TXT, MD, WordprocessingML (XML) Spreadsheets...Encrypted PDF DOM-Based : XML, HTML, XHTML, MHTML Compression &...