To extract a text from Microsoft Office Word documents getText and getText(int) methods are used. These methods allow to extract a text from the entire document or a text from the selected page. TextOptions parameter is ignored for Microsoft Office Words documents.
Here are the steps to extract a text from Microsoft Office Word document:
Instantiate Parser object for the initial document; Call getText method and obtain TextReader object; Read a text from reader....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
Learn how to easily extract table content from Word documents (.doc, .docx) using GroupDocs.Parser for .NET....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article explains how to search for encrypted electronic signatures in QR-code electronic signatures. This topic contains example of using standard encryption, serialization, class definition and search for embedded objects in QR-codes with GroupDocs.Signature Api....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article explains how to detect the document file type and calculate the number of pages when converting a file with GroupDocs.Conversion for Java....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
In this article, you will learn how to extract compressed files and convert them to desired format with GroupDocs.Conversion for .NET....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article explains how to detect the document's file type and calculate the number of pages when converting a file with GroupDocs.Conversion for .NET....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article describes the procedure of preserving and exporting all build-in and custom styles in the source WordProcessing document during its editing....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
GroupDocs.Metadata for .NET provides functionality that allows working with different kinds of WordProcessing documents such as DOC, DOCX, ODT, etc. For the full list of supported document formats please refer to Supported Document Formats.
Detecting the exact type of a document The following sample of code will help you to detect the exact type of a loaded document and extract some additional file format information.
Load a WordProcessing document Extract the root metadata package Use the FileType property to obtain file format information AdvancedUsage....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article explains how to sign a document electronically with generated Stamp signatures by GroupDocs.Signature for Python via .NET Api....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...