GroupDocs.Parser provides the functionality to extract data from Microsoft Office Word Documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.
The following table provides the list of supported formats:
Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...Editor Product Solution GroupDocs...Acquisition Ask AI GroupDocs Documentation / GroupDocs.Parser Product...
This article explains how to rearrange Document pages for PDF, Word, Excel, PowerPoint and many other file types using GroupDocs.Merger for .NET....Editor Product Solution GroupDocs...Acquisition Ask AI GroupDocs Documentation / GroupDocs.Merger Product...
GroupDocs.Merger for Python via .NET is a powerful API to merge several Documents into one, split single Document to multiple Documents, reorder or replace Document pages, change page orientation, manage Document protection, render Documents as images and more....Editor Product Solution GroupDocs...Events Acquisition GroupDocs Documentation / GroupDocs.Merger Product...
This section contains advanced examples and demonstrates all possible features for signing Documents with various electronic signature types, setup different options and visual effects....Editor Product Solution GroupDocs...Acquisition Ask AI GroupDocs Documentation / GroupDocs.Signature Product...
GroupDocs.Search has the ability to remove individual Documents from the index that are indexed from a stream or structure....Editor Product Solution GroupDocs...Acquisition Ask AI GroupDocs Documentation / GroupDocs.Search Product...
This topic describes how to optimize PDF file using the GroupDocs.Viewer Java API by removing the unused (orphaned) resources and thus to reduce the file size....Editor Product Solution GroupDocs...Acquisition Ask AI GroupDocs Documentation / GroupDocs.Viewer Product...
To extract metadata from Microsoft Office Word Documents getMetadata method is used. This method allows to extract the following metadata:
Name Description title The title of the Document. subject The subject of the Document. keywords The keyword of the Document. comments The comments of the Document. content-status The content status of the Document. category The category of the Document. company The company of the Document. manager The manager of the Document....Editor Product Solution GroupDocs...Acquisition Ask AI GroupDocs Documentation / GroupDocs.Parser Product...
Edit Metadata in CSharp on GroupDocs Blog | Document Automation Solutions for .NET & Java Developers Recent content in Edit Metadata in CSharp on GroupDocs Blog | Document Automation Solutions for ......CSharp on GroupDocs Blog | Document Automation Solutions for ...CSharp on GroupDocs Blog | Document Automation Solutions for ...