Supported File Formats The following tables indicate the file formats from which GroupDocs.Parser for Java can extract data.
Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format PDF Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode PDF Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHTML Extensible Hypertext Markup Language File MHTML MIME HTML File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled HTML Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Editor Product Solution GroupDocs...Events Acquisition GroupDocs Documentation / GroupDocs.Parser Product...
This Documentation section explains features of EditableDocument class when editing Document with GroupDocs.Editor for Java API....Editor Product Solution GroupDocs...Acquisition GroupDocs Documentation / GroupDocs.Editor Product Family...
Add watermark to PDF, images and Documents. Watermarking Solution for Microsoft Office, PDF, OpenDocument, Images and etc....Documents Watermark Solution Add text and image watermarks for...for your documents and images. Search and modify document watermarks...
This article explains how to create instance of the EditableDocument class from HTML files from disk or from HTML markup with resources using GroupDocs.Editor for .NET API....Editor Product Solution GroupDocs...Acquisition GroupDocs Documentation / GroupDocs.Editor Product Family...
Compare and merge more than two TXT files in C# .NET applications. Retrieve differences summary in content, text & style of TXT files, images and Document formats....NET documents comparison API to detect the...files and export to a final document with a detailed summary of...
This article gives the knowledge of the API methods which can be used to perform operations about Document passwords or password dictionary....Editor Product Solution GroupDocs...Events Acquisition GroupDocs Documentation / GroupDocs.Search Product...
Document attributes is a special feature designed for marking indexed Documents with text labels without the need for re-indexing....Editor Product Solution GroupDocs...Events Acquisition GroupDocs Documentation / GroupDocs.Search Product...