GroupDocs.Parser provides the functionality to extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.
The following table provides the list of supported formats:
Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain Text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...Search Product Solution GroupDocs...Template Plain text Open Document Text Open Document Text Template...
This article explains how to separately extract data from documents and add the extracted data to the index....Search Product Solution GroupDocs...GroupDocs.Search Product Family / GroupDocs.Search for Java /...
This article gives the knowledge about the complete specification of the Search query DSL used in Text queries using Java Search API....Search Product Solution GroupDocs...GroupDocs.Search Product Family / GroupDocs.Search for Java /...
A convenient Text extractor API that permits users to extract raw or formatted Text from different document formats. Besides, it is not only a Text extractor API, the user can extract metadata from the document as well....Search Product Solution GroupDocs...GroupDocs.Parser is a convenient text extractor API that permits users...
This article shows that how to redact data of sensitive nature from images of various formats like JPG, PNG, TIFF and others....Search Product Solution GroupDocs...JPG, PNG, TIFF and others. See full list at article. GroupDocs.Redaction...
This article explains how to update Barcode electronic signatures with GroupDocs.Signature for Python via .NET API....Search Product Solution GroupDocs..."sample_signed.pdf" ) as sign : # Create search options options = BarcodeSearchOptions...
This article demonstrate that how to associate each document with certain additional metadata....Search Product Solution GroupDocs...GroupDocs.Search Product Family / GroupDocs.Search for .NET /...
GroupDocs Blog - GroupDocs Blog | Document Automation Solutions for .NET & Java Developers...Implement option that allows setting text document encoding Implement...thumbnails Text selection and copying to the clipboard Text search...
Document attributes is a special feature designed for marking indexed documents with Text labels without the need for re-indexing....Search Product Solution GroupDocs...GroupDocs.Search Product Family / GroupDocs.Search for .NET /...
This page contains descriptions of all character types. Character types differ in how characters of these types are indexed....Search Product Solution GroupDocs...GroupDocs.Search Product Family / GroupDocs.Search for .NET /...