The following tables indicate the file formats from which GroupDocs.Parser for Java can extract data. You can use the input below to filter supported formats by extension.
Tip Can’t find your file format?
We’re here to help! Please post a request on our Free Support Forum, and our team will assist you. Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format Pdf Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode Pdf Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHTML Extensible Hypertext Markup Language File MHTML MIME HTML File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled HTML Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
In this article, you will learn how to get default load options for a source format with GroupDocs.Conversion for Java API....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
Reading matroska format-specific properties The GroupDocs.Metadata API supports extracting format-specific information from MKV files.
The following are the steps to read native MKV metadata.
Load an MKV video Get the root metadata package Extract the native metadata package using the MatroskaRootPackage.getMatroskaPackage method Read the Matroska metadata properties on different levels of the format structure advanced_usage.managing_metadata_for_specific_formats.video.matroska.MatroskaReadNativeMetadataProperties
try (Metadata metadata = new Metadata(Constants.InputMkv)) { MatroskaRootPackage root = metadata.getRootPackageGeneric(); // Read the EBML header System....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
This page contains descriptions of all character types. Character types differ in how characters of these types are indexed....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
In this article, you will learn how to apply watermark to document pages when converting document with GroupDocs.Conversion for Node.js via Java API....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
This page describes how to edit annotation properties - change annotation position, size, appearance etc. when annotate documents using GroupDocs.Annotation for Java....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
Extracted data are stored in the instance of DocumentData class....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
This article explains how to get a list of indexed documents from an index, and how to get the text of indexed documents in HTML or plain text format....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
Following this guide, you will learn how to obtain basic document metadata like page count, size, and file type before editing it with GroupDocs.Editor for Node.js via Java API....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...