C# .NET document parsing API to extract text, images, metadata & encoding from databases, PDF, Word, Excel, presentations, web, email, EPUB & zip file formats....: UTF32 LE, UTF32 BE, UTF16 LE, UTF16 BE, UTF8, and UTF7 Content...: UTF32 LE, UTF32 BE, UTF16 LE, UTF16 BE, UTF8, and ANSI Text...