The following tables indicate the file formats from which GroupDocs.Parser for Java can extract data. You can use the input below to filter supported formats by extension.
Tip Can’t find your file format?
We’re here to help! Please post a request on our Free Support Forum, and our team will assist you. Word Processing Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode DOC Microsoft Word Document DOT Microsoft Word Document Template DOCX Office Open XML Document DOCM Office Open XML Macro-Enabled Document DOTX Office Open XML Document Template DOTM Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format PDF Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode PDF Portable Document Format File Markup Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode XHtml Extensible Hypertext Markup Language File MHtml MIME Html File MD Markdown (Formatted Text is Not supported) XML XML File Ebook Document Type Parse Document by Template Extract Text (Accurate) Extract Text (Raw) Extract Structured Text and Formatted Text Extract Text Areas Extract Metadata Extract Images Extract Containers and Attachments Parse Form Data Extract Table of Contents Scan Barcode CHM Compiled Html Help File EPUB Digital E-Book File Format FB2 FictionBook 2....Hypertext Markup Language File MIME HTML File Markdown (Formatted Text...Contents Scan Barcode Compiled HTML Help File Digital E-Book File...
This article demonstrates how to edit Mobi files using C# programming language....paginal mode in the resultant HTML document. allows to enable exporting...exporting language information in HTML. This is very useful for books...
GroupDocs Blog | Document Automation Solutions for .NET & Java Developers Recent content on GroupDocs Blog | Document Automation Solutions for .NET & Java Developers GroupDocs.Total for .NET 25.7 –......scanning results in respective HTML files. Efficient Text Search...scanning results in respective HTML files. Metadata Scrubbing -...
We are delighted to announce the release of GroupDocs.Assembly for .NET 19.1. The latest version allows you to merge the table cells having equal textual contents within your reports dynamically. Furthermore, we have introduced the feature of adding optional comments within the syntax tags that can provide a human-readable explanation. For more details on the latest version, please have a glance at the release notes.
Features Introduced Merging Table Cells Dynamically Using the latest version, you can tell the API to merge the table cells that have equal textual contents....Processing Documents Emails with HTML and RTF Bodies For more details...
Team GroupDocs is pleased to release GroupDocs.Text for .NET API which allows users to extract text from files and documents of various formats. The API facilitates the user with simple syntax, easy to use methods and few lines of code to perform text extraction operations.
Why GroupDocs.Text Is Developed? Suppose you are developing a text searching or text analyzing system, wouldn’t it be great if your system can read or analyze a document even if no document reader is installed on your system?...(XLS/XLSX/XLSM/XLSB/CSV/ODS) TXT HTML MHTML For more details on supported...
I am currently using GroupDocs.Viewer for .NET to render and view email (msg) files. However, I have encountered an issue: when viewing emails, any hyperlinks within the email body appear as plain text and are not clicka…...render the input MSG to the HTML format using HtmlViewOptions...
Դուք կսովորեք, թե ինչպես փոխարկել PDF-ը Word-ի C#-ով այս ձեռնարկում: Մենք կօգտագործենք կոնսոլային պարզ հավելված՝ PDF-ի DOC փոխակերպման համար՝ օգտագործելով C#:...Բացի այդ, EPUB, XLSX, PPTX, HTML, XML և մի շարք այլ ձևաչափեր...
This article explains the basics of creating a plugin for the Contao CMS using the Contao Viewer plugin as an example. You can download the completed plugin from Contao.
Requirements Contao CMS PHP 5.3 JavaScript PreparationTo create a GroupDocs Viewer plugin for Contao CMS: Install Contao CMS. The installation of this CMS is a simple. Allow GroupDocs to appear go to Admin > Setting > Security settings > Allowed Html tags and just add at the end....Security settings > Allowed HTML tags and just add at the end...Functionality # Create three folders: html , config and dca . Place an...
You will learn how to convert PDF to Word in C# in this tutorial. We'll utilize a simple console application for PDF to DOC conversion using C#....addition, EPUB, XLSX, PPTX, HTML, XML, and a variety of other...