The following tables indicate the file formats from which GroupDocs.Parser for Java can extract data. You can use the input below To filter supported formats by extension.
Tip Can’t find your file format?
We’re here To help! Please post a request on our Free Support Forum, and our team will assist you. Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format Pdf Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode Pdf Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHTML Extensible Hypertext Markup Language File MHTML MIME HTML File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled HTML Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Conversion Product Solution GroupDocs...Text Template Rich Text Format PDF Document Type Parse Document...
This article explains main principles and stages of editing documents programmatically with GroupDocs.EdiTor for .NET API....Conversion Product Solution GroupDocs...document of some format (like DOCX, PDF, or XLSX, for example). At this...
Full list of document formats GroupDocs.Conversion for Python via .NET can load and save — 20+ format families including Word Processing, Spreadsheet, Presentation, Pdf, Image, CAD, Email, eBook, Web, and more....Conversion Product Solution GroupDocs...GroupDocs.Conversion Product Family / GroupDocs.Conversion for Python...
It supports DOCX, DOCM, DOC, DOT, DOTM, XLS, XLSX, Pdf, PPT, JPG, PNG, HTML, EML and many more...Conversion Product Solution GroupDocs...Text Template Rich Text Format PDF Document Type Parse Document...
GroupDocs.Comparison for Python via .NET is AI agent and LLM friendly — machine-readable documentation, an MCP server, AGENTS.md shipped inside the pip package, and runnable code examples for AI-driven document review and change-detection pipelines....Conversion Product Solution GroupDocs...and today, between a reviewed PDF and the latest export. GroupDocs...
This article explains how To detect the document file type and calculate the number of pages when converting a file with GroupDocs.Conversion for Node.js via Java....Conversion Product Solution GroupDocs...GroupDocs.Conversion Product Family / GroupDocs.Conversion for Node...
Learn this article and check how To load and convert PST/OST documents with advanced options using GroupDocs.Conversion for .NET API....Conversion Product Solution GroupDocs...GroupDocs.Conversion Product Family / GroupDocs.Conversion for .NET...