GroupDocs.Parser provides the functionality to extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.
The following table provides the list of supported formats:
Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain Text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...Watermark Product Solution GroupDocs...Template Plain text Open Document Text Open Document Text Template...
The search api allow you to optimize, merge, delete, update and create indexes along with many other fascinating features...Watermark Product Solution GroupDocs...indexing. Support for custom text extractors. Options for compact...
This article explains that how to extract table of contents from Microsoft Office Word (.doc, .docx) documents...Watermark Product Solution GroupDocs...of objects. method returns a text from the chapter to which table...
Discover how to convert TXT to HTML using Node.js with GroupDocs.Conversion. Easily export TXT to HTML in Node.js with a reliable cross-platform API....Watermark Product Family GroupDocs.Merger...landscape, converting plain text into structured and presentable...
This page contains a description of all index settings that can be specified in an instance of the IndexSettings class....Watermark Product Solution GroupDocs...detect the following encodings of text files during indexing: UTF-32...
This page contains a description of all index settings that can be specified in an instance of the IndexSettings class....Watermark Product Solution GroupDocs...detect the following encodings of text files during indexing: UTF-32...
Find answers about signing digital documents and images of various types using code on any platform....Watermark Product Family GroupDocs.Merger...QR Code using Java How to Add Text Signature to RTF using Java...
This article explains that how to extract hyperlinks from document page....Watermark Product Solution GroupDocs...Description The page that contains the text area. The rectangular area on...
This article explains how to use PDF digital electronic signature features on document page....Watermark Product Solution GroupDocs...to setup special Pdf document Text signature appearance with GroupDocs...
The page describes how to add replacement annotation to a document using GroupDocs.Annotation for Java....Watermark Product Solution GroupDocs...replaces original text with specified text fragmentas shown in...