The following tables indicate the file formats from which Groupdocs.Parser for Java can extract data. You can use the input below to filter supported formats by extension.
Tip Can’t find your file format?
We’re here to help! Please post a request on our Free Support Forum, and our team will assist you. Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format PDF Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode PDF Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHTML Extensible Hypertext Markup Language File MHTML MIME HTML File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled HTML Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article offers guidance on how to add point annotation in PDF using C#. Additionally, it includes code and steps to insert point annotation to PDF in C#....Answers by API GroupDocs.Total Product Family GroupDocs.Conversion...Family GroupDocs.Annotation Product Family GroupDocs.Viewer Product...
Groupdocs.Metadata for Java provides functionality that allows working with different kinds of Presentations such as PPT, PPTX, POTM, POTX, etc. For the full list of supported Presentation formats please refer to Supported document formats.
Detecting the exact type of a Presentation The following sample of code will help you to detect the exact type of a loaded Presentation and extract some additional file format information.
Load a Presentation Extract the root metadata package Use the getPresentationType method to obtain file format information advanced_usage....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article provides step-by-step guide on how to add point annotation in PDF using Java. It also includes code example to insert point annotation to PDF in Java....Answers by API GroupDocs.Total Product Family GroupDocs.Conversion...Family GroupDocs.Annotation Product Family GroupDocs.Viewer Product...
Learn how to extract metadata from PowerPoint Presentations (.ppt, .pptx) using Groupdocs.Parser for .NET. Extract document properties like author, title, creation date, and comments from Presentation files....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
Groupdocs.Metadata for .NET provides functionality that allows working with different kinds of Presentations such as PPT, PPTX, POTM, POTX, etc. For the full list of supported Presentation formats please refer to Supported Document Formats.
Detecting the exact type of a Presentation The following sample of code will help you to detect the exact type of a loaded Presentation and extract some additional file format information.
Load a Presentation Extract the root metadata package Use the FileType property to obtain file format information AdvancedUsage....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article demonstrates how you can convert to and from project management formats with Groupdocs.Conversion for Java....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
To extract a text from Microsoft Office PowerPoint Presentations getText and getText(int) method is used. These methods allow to extract a text from the entire Presentation or a text from the selected slide.
Here are the steps to extract a text from Microsoft Office PowerPoint Presentations:
Instantiate Parser object for the initial Presentation; Call getText method and obtain TextReader object; Read a text from reader. Warning getText method returns null value if text extraction isn’t supported for the document....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...