The following tables indicate the file formats from which GroupDocs.Parser for Java can extract data. You can use the input below to filter supported formats by extension.
Tip Can’t find your file format?
We’re here to help! Please post a request on our Free Support Forum, and our team will assist you. Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format PDF Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode PDF Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHTML Extensible Hypertext Markup Language File MHTML MIME HTML File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled HTML Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Signature Product Solution GroupDocs...and our team will assist you. Word Processing Document Type Parse...
This API allows you to perform text search and index any type of file format using C# .NET language on any platform....Signature Product Family GroupDocs.Metadata...Documents using C# Search Different Word Forms using C# Perform Reverse...
Find answers about converting documents and images of various types using code on any platform....Signature Product Family GroupDocs.Metadata...using Python Convert PDF to Word using Python Convert DOCX to...
Learn how to convert PDF to RTF using Node.js with a step-by-step guide. Easily export PDF to RTF in Node.js for editable and formatted text processing....Signature Product Family GroupDocs.Metadata...compatible with word processors like Microsoft Word and other text...
This article shows that how to provides syntax of all elements allowed in text search queries....Signature Product Solution GroupDocs...* N - wildcard word, * N ~~ M - wildcard word range, where N...
This API allows you to perform text search and index any type of file format using Java language on any platform....Signature Product Family GroupDocs.Metadata...using Java Search Different Word Forms using Java Perform Reverse...
Find answers about converting documents and images of various types using Java code on any platform....Signature Product Family GroupDocs.Metadata...Text using Java How to Convert Word Document to Image in Java How...
This section describes how to use GroupDocs.Viewer for Node.js to convert different document types to PDF, HTML, PNG, and JPEG formats....Signature Product Solution GroupDocs...basics Leave feedback Render Word documents Render PDF documents...
Quickly add image watermark to XLSX using Python to brand financial sheets. You’ll also see how to apply watermark to XLSX in Python across all worksheets cleanly....Signature Product Family GroupDocs.Metadata...brand or visually mark your Word documents, adding an image watermark...