GroupDocs.Parser provides the functionality To extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.
The following table provides the list of supported formats:
Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article explains that how To extract images from Microsoft Office Word (.doc, .docx) documents...data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
Using the GroupDocs.Metadata for Node.js via Java you can easily search metadata and extract desired metadata properties from Pdf, DOCX, PPTX, XLSX, images, audio, video and many other files of different types in your Java solution....metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
This article explains the usage of SetProperties method is used To update or add metadata. You can easily add metadata To phoTos, Pdfs or you can update or add data To mp3 files....easily add metadata to photos, pdfs or you can update or add data...metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Programmatically render animated GIF and APNG images To HTML web page in C# with play and pause feature. Build your own animation viewer using .NET....documents, spreadsheets, PDF, presentations, emails, ZIP archives, Visio...
This article explains that how To integrate any paid or free OCR solution in Java....formats like PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article shows how To access IPTC metadata in a file of any supported format....pdf. Reading basic IPTC IIM properties...metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails, images...
Java API To remove all or selective metadata properties of DOCX, XLSX, PPTX, Pdf documents, JPEG, PNG, WebP images, Email, eBooks, Visio Drawings, Zip, etc....spreadsheets, presentations, PDF files, images, emails, eBooks, drawings...