GroupDocs.Parser provides the functionality to extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.
The following table provides the list of supported formats:
Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...Comparison Product Solution GroupDocs...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article explains that how to optimize index to reduce the number of segments in an index using Java....Comparison Product Solution GroupDocs...over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with our...
This article gives the knowledge of the API methods which can be used to perform operations about spelling corrector....Comparison Product Solution GroupDocs...over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with our...
This article demonstrate that how to save computing resources, you can notify the index about the renaming of the document, and then the document will not be reindexed during the update operation...Comparison Product Solution GroupDocs...over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with our...
Learn advanced logging techniques available in the redaction API...Comparison Product Solution GroupDocs...formats like PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article gives the knowledge of the API methods which can be used to perform operations about Alphabets using Java....Comparison Product Solution GroupDocs...over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with our...
This article explains that how to extract attachments from PDF documents...Comparison Product Solution GroupDocs...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
An interface is used to receive the information about errors, warnings and events which occur while data extraction....Comparison Product Solution GroupDocs...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article gives the knowledge of the case sensitive search which allows you to find words considering uppercase and lowercase letters as distinct using Java....Comparison Product Solution GroupDocs...over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with our...