The following tables indicate the file formats from which GroupDocs.Parser for Java can extract data. You can use the input below to filter supported formats by extension.
Tip Can’t find your file format?
We’re here to help! Please post a request on our Free Support Forum, and our team will assist you. Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format PDF Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode PDF Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHTML Extensible Hypertext Markup Language File MHTML MIME HTML File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled HTML Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Consulting Blog Knowledge Base New Releases Status Websites aspose...
It supports DOCX, DOCM, DOC, DOT, DOTM, XLS, XLSX, PDF, PPT, JPG, PNG, HTML, EML and many more...Consulting Blog Knowledge Base New Releases Status Websites aspose...
Convert text files to HTML, PDF, PNG, or JPEG using the GroupDocs.Viewer Python API. Learn how to render various file formats....Consulting Blog Knowledge Base New Releases Status Websites aspose...
It supports DOCX, DOCM, DOC, DOT, DOTM, XLS, XLSX, PDF, PPT, JPG, PNG, HTML, EML and many more...Consulting Blog Knowledge Base New Releases Status Websites aspose...
Split worksheets into pages when rendering spreadsheets to HTML, PDF, and image formats using GroupDocs.Viewer Python API....Consulting Blog Knowledge Base New Releases Status Websites aspose...
Note In this article, we will use GroupDocs.Assembly to generate a Numbered List report in Word Processing Document format based on the use case: Working with a Business Case. Note The code uses some of the objects defined in The Business Layer. Numbered List in Microsoft Word Document Creating a Numbered List Practising the following steps you can create Numbered List Template in MS Word 2013.
In your document, write a sentence like “We provide support for the following products:”....Consulting Blog Knowledge Base New Releases Status Websites aspose...
This article provides step-by-step instructions on merging PPTX presentations using GroupDocs.Merger for Python via .NET.... Create a new Merger instance with the first...
Note GroupDocs.Parser is a feature-reach document data parsing API. Here you may find description of the most important features. Parse Document by Template GroupDocs.Parser allows to parse documents by user-defined templates.
It is easy to crate a template with data field definitions, table definitions. Then it’s easy to use the template (just pass the Template object to parseByTemplate(Template) method) and extract data such as prices, invoices, tables from your typical documents....Consulting Blog Knowledge Base New Releases Status Websites aspose...
GroupDocs.Editor for .NET is a C# api allows to edit Microsoft Office documents like Word, Excel or PowerPoint or PDF without third party software installed....Consulting Blog Knowledge Base New Releases Status Websites aspose...
Hi Team,
We are using:
import com.groupdocs.merger.Merger;
import com.groupdocs.merger.domain.options.JoinOptions;
with below code ref to merge two pdf documents:
Map<String, String> map1 = null;
ByteArrayO…...resulting document: Merger merger = new Merger(firstPdfInputStream);...