The following tables indicate the file formats from which GroupDocs.Parser for Java can extract data. You can use the input below to filter supported formats by extension.
Tip Can’t find your file format?
We’re here to help! Please post a request on our Free Support Forum, and our team will assist you. Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format PDF Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode PDF Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHTML Extensible Hypertext Markup Language File MHTML MIME HTML File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled HTML Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Annotation Product Solution GroupDocs...Portable Document Format File Markup Document Type Parse Document...
This article describes how to insert edited presentation slide into existing PowerPoint presentation....Annotation Product Solution GroupDocs...method. Emitting HTML and CSS markup, which represents a content...
Let's swiftly grasp the process of how to convert CSV to HTML using Java. We will furnish you with vital steps and code example to export CSV to HTML using Java....Annotation Product Family GroupDocs...HTML serves as the standard markup language for constructing web...
This article will provide guidance on how to convert HTML to PPTX using C#. Moreover, a code example will be provided to illustrate how to export HTML to PPTX in C#....Annotation Product Family GroupDocs...using C# . HTML is a popular markup language used to create web...
In this article, we will provide you detail on how to convert DOCX to HTML using Node.js. Also, you will be provide a code example to export DOCX to HTML in Node.js....Annotation Product Family GroupDocs...formats. HTML, or HyperText Markup Language, stands as the backbone...
If you want to laern how to render HTML as PDF using Python, this topic is for you. Learn how to convert HTML to PDF in Python for printing, sharing, or archiving....Annotation Product Family GroupDocs...to PDF in Python , allowing markup-based files to be transformed...
Let's quickly understand how to convert CSV to HTML using C#. We will provide you with essential instructions and code example to export CSV to HTML using C#....Annotation Product Family GroupDocs...using C# HTML is the standard markup language for building web pages...
Quickly learn how to render HTML as PNG using Python. You will get code and steps to convert HTML to PNG in Python for custom dimensions for visual output....Annotation Product Family GroupDocs...not only reflect the original markup and design intent but also...
Let's convert HTML to TXT using Python with clean extraction. Easily export HTML to TXT in Python to capture readable text for indexing, logs, and lightweight storage....Annotation Product Family GroupDocs...need readable content without markup, images, or complex layout...