The following tables indicate the file formats from which GroupDocs.Parser for Java can extract data. You can use the input below to filter supported formats by extension.
Tip Can’t find your file format?
We’re here to help! Please post a request on our Free Support Forum, and our team will assist you. Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain Text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format PDF Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode PDF Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHTML Extensible HyperText Markup Language File MHTML MIME HTML File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled HTML Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Watermark Product Solution GroupDocs...Template Extract Text (Accurate) Extract Text (Raw) Extract Structured...
This API allows you to redact, hide, or remove private contents in any type of file format using C# .NET language on any platform....Watermark Product Family GroupDocs.Merger...using C# Redact Text in RTF using C# Redact Text in XLSX using...
This article explains how to get a list of indexed documents from an index, and how to get the Text of indexed documents in HTML or plain Text format....Watermark Product Solution GroupDocs...get the text of indexed documents in HTML or plain text format...
This article explains how to get a list of indexed documents from an index, and how to get the Text of indexed documents in HTML or plain Text format....Watermark Product Solution GroupDocs...get the text of indexed documents in HTML or plain text format...
Find answers about extracting Text, images, and metadata of different files using code on any platform....Watermark Product Family GroupDocs.Merger...C# Extract Text from DOCM using Java Extract Text from MHTML...
This API allows you to perform Text search and index any type of file format using C# .NET language on any platform....Watermark Product Family GroupDocs.Merger...Answers Extract Text from RTF using C# Extract Text from ODT using...
Find answers about extracting Text, images, and metadata of different files using code on any platform....Watermark Product Family GroupDocs.Merger...to Extract Text from HTML in C# How to Extract Text from Word...
Find Answers by API GroupDocs.Total Product Family GroupDocs.Conversion Product Family GroupDocs.Annotation Product F......Watermark Product Family GroupDocs.Merger...using Java Redact Text in RTF using Java Redact Text in XLSX using...
Find Answers by API GroupDocs.Total Product Family GroupDocs.Conversion Product Family GroupDocs.Annotation Product F......Watermark Product Family GroupDocs.Merger...Answers Extract Text from PPT using Java Extract Text from DOC using...