The following tables indicate the file formats from which GroupDocs.Parser for Java can extract data. You can use the input below to filter supported formats by extension.
Tip Can’t find your file format?
We’re here to help! Please post a request on our Free Support Forum, and our team will assist you. Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format Pdf Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode Pdf Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHTML Extensible Hypertext Markup Language File MHTML MIME HTML File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled HTML Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
This article gives the knowledge about the fuzzy search definition, and its capabilities in GroupDocs.Search using Java search API....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
GroupDocs.Metadata for Java provides functionality that allows working with different kinds of presentations such as PPT, PPTX, POTM, POTX, etc. For the full list of supported presentation formats please refer to Supported document formats.
Detecting the exact type of a presentation The following sample of code will help you to detect the exact type of a loaded presentation and extract some additional file format information.
Load a presentation Extract the root metadata package Use the getPresentationType method to obtain file format information advanced_usage....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
Reading matroska format-specific properties The GroupDocs.Metadata API supports extracting format-specific information from MKV files.
The following are the steps to read native MKV metadata.
Load an MKV video Get the root metadata package Extract the native metadata package using the MatroskaRootPackage.getMatroskaPackage method Read the Matroska metadata properties on different levels of the format structure advanced_usage.managing_metadata_for_specific_formats.video.matroska.MatroskaReadNativeMetadataProperties
try (Metadata metadata = new Metadata(Constants.InputMkv)) { MatroskaRootPackage root = metadata.getRootPackageGeneric(); // Read the EBML header System....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
This article shows how to clear existing text or image watermarks....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
Note GroupDocs.Parser is a feature-reach document data parsing API. Here you may find description of the most important features. Parse Document by Template GroupDocs.Parser allows to parse documents by user-defined templates.
It is easy to crate a template with data field definitions, table definitions. Then it’s easy to use the template (just pass the Template object to parseByTemplate(Template) method) and extract data such as prices, invoices, tables from your typical documents....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...
This is an introduction into edit document techniques explanation like main stages of document opening, editing and saving results within Java applications....Signature Product Solution GroupDocs...Us Contact Customers Legal Security Events Acquisition Ask AI...