The following tables indicate the file formats from which GroupDocs.Parser for Java can extract data. You can use the input below to filter supported formats by extension.
Tip Can’t find your file format?
We’re here to help! Please post a request on our Free Support Forum, and our team will assist you. Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format PDF Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode PDF Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHTML Extensible Hypertext Markup Language File MHTML MIME HTML File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled HTML Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Document Scalar Vector Graphics file Scalar Vector Graphics file...
Learn how to exclude system pre-installed fonts from HTML markup to reduce rendered document size when rendering documents using GroupDocs.Viewer for .NET....Mobipocket (MOBI), AZW3) LaTeX (TEX) Vector images (SVG) Sometimes including...Microsoft OneNote ONE format Vector images with font support (SVG)...
In this article, you will learn how to convert Publisher formats to other formats with GroupDocs.Conversion for .NET....including text, raster and vector graphics. These can be used...
In this article, you will learn how to convert Publisher formats to other formats with GroupDocs.Conversion for Node.js via Java....including text, raster and vector graphics. These can be used...
In this article, you will learn how to convert Publisher formats to other formats with GroupDocs.Conversion for Java....including text, raster and vector graphics. These can be used...
Prepare PDF, Word, and Excel documents as Markdown for retrieval-augmented generation with LLMs...C["Markdown"] D["Text Chunking"] E["Vector Embeddings"] F["LLM Query"]...
Effectively grasp the understanding to render SVG as PDF using Java. Furthermore, explore detailed step-by-step instructions and Java code for rendering SVG to PDF....the conversion of Scalable Vector Graphics ( SVG ) to Portable...
Explore what’s new in GroupDocs.Comparison for Python via .NET 25.12. Available now on Pypi and GroupDocs website....) [Enhancement] Preserving vector graphics from the target PDF...
Let's convert XLSX to GIF using Python in a few easy steps without installing extra APIs. Also, provided the code to effortlessly export XLSX to GIF in Python....export Excel data into scalable vector images—perfect for professional...