The following tables indicate the file formats from which GroupDocs.Parser for Java can extract data. You can use the input below To filter supported formats by extension.
Tip Can’t find your file format?
We’re here To help! Please post a request on our Free Support Forum, and our team will assist you. Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format PDF Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode PDF Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHtml Extensible Hypertext Markup Language File MHtml MIME Html File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled Html Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Hypertext Markup Language File MIME HTML File Markdown (Formatted Text...Contents Scan Barcode Compiled HTML Help File Digital E-Book File...
This page describes how To detect document file type, size and calculate pages count when annotate documents or images with GroupDocs.Annotation....all formats except Email and Html. Width and height are the same...
Programmatically compare two or more CSV files in Java. Learn To accept, reject and highlight the changes. Compare password-protected CSV files using Java API....generates separate CSV and HTML files. The HTML output file highlights...
Learn how To render XLSX as PDF using Python. This tuTorial explains how To convert XLSX To PDF in Python for secure and portable document output....tutorial on how to render PDF as HTML using Python and unlock new...
Learn how To extract Text from PDF using Python. This guide walks through setup and code needed To extract text from PDF in Python without installing extra software....for_html_view() to prepare view settings...options using ViewInfoOptions.for_html_view() . By setting extract_text...
This article provides a detailed guide on how To extract text from MHtml using C#. Moreover, it includes code example for efficient text extraction from MHtml in C#....from MHTML using C# MHTML (MIME HTML) files, a web archive format...
This article provides a guide on how To extract text from MHtml using Java, along with a sample code example for efficient text extraction from MHtml in Java....MHTML using Java MHTML (MIME HTML) files, a web archive format...