GroupDocs.Parser provides the functionality to extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.
The following table provides the list of supported formats:
Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...Purchase Buy Now Pricing Information Free Trials Temporary License...usage / Extract data from various formats / Extract data from...
The following tables indicate the file formats from which GroupDocs.Parser for Java can extract data. You can use the input below to filter supported formats by extension.
Tip Can’t find your file format?
We’re here to help! Please post a request on our Free Support Forum, and our team will assist you. Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format PDF Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode PDF Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHTML Extensible Hypertext Markup Language File MHTML MIME HTML File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled HTML Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Purchase Buy Now Pricing Information Free Trials Temporary License...GroupDocs.Parser for Java can extract data. You can use the input...
We offer multiple solutions on how you can run GroupDocs.Editor examples, by building your own or using our back-end or front-end Java examples out-of-the-box....Purchase Buy Now Pricing Information Free Trials Temporary License...file, extract the folders on your local disk. The extracted files...
Download from GitHub The complete examples package of GroupDocs.Parser is hosted on GitHub. You can either download the ZIP file from here or clone the repository of GitHub using your favorite GitHub client.
In case you download the ZIP file, extract the folders on your local disk. The extracted files and folders will look like following image:
Open IntelliJ Idea and import the project to get started with it. Archived project file can also be imported....Purchase Buy Now Pricing Information Free Trials Temporary License...file, extract the folders on your local disk. The extracted files...
This section decsribes how to get started with GroupDocs.Merger for Java library...Purchase Buy Now Pricing Information Free Trials Temporary License...file, extract the folders on your local disk. The extracted files...
Quickly learn the workflow to extract images from Excel in Java and develop the parsing application to get images from Excel using Java in a few lines of code....Redaction Product Family How to Extract Images from Excel in Java This...the stepwise instructions to extract images from Excel in Java and...
Get familiar yourself with the process to add polyline annotation using Java and easily consume APIs to develop a capability to add polyline annotation in Java....had published an article on extracting annotations from PDF documents...documents in Java, see how to Extract Annotations from PDF in Java...
This article contains a good example of numeric flag or enumeration in Node.js via Java....Purchase Buy Now Pricing Information Free Trials Temporary License...interpreters to metadata properties extracted from various formats. To get...
It supports DOCX, DOCM, DOC, DOT, DOTM, XLS, XLSX, PDF, PPT, JPG, PNG, HTML, EML and many more...Purchase Buy Now Pricing Information Free Trials Temporary License...GroupDocs.Parser for .NET can extract data. You can use the input...
In this article, we will guide you on how to render DOCX as PNG using C# without installing extra software. We will also provide code to render DOCX to PNG using C#....DOCX as PNG allows you to extract specific images, graphics,...as PNG using C# With the information in this article, you can...