Sort Score
Result 10 results
Languages All
Labels All
Results 711 - 720 of 1,632 for

text extraction

(0.06 sec)
  1. Existing objects in word processing document | ...

    Remove, inspect, and modify shapes (potential watermarks) in Word documents using Python via .NET....for shapes matching image or text criteria and removes the found...Create and initialize image or text search criteria Find possible...

    docs.groupdocs.com/watermark/python-net/existin...
  2. Releasing GroupDocs.Parser for Java – A Conveni...

    We are pleased to announce that the first version of GroupDocs.Parser for Java has been released. GroupDocs.Parser for Java allows the Java developers to extract raw and formatted Text from the popular document formats. The API also supports working with containers such as ZIP and email containers. You can also access the metadata attached to the documents using a few lines of code. Please continue to read more about the features and the file formats supported by the API....Java developers to extract raw and formatted text from the popular...Java. Extract text from various document formats Extract main...

    blog.groupdocs.com/parser/releasing-groupdocs.p...
  3. GroupDocs.Markdown for .NET — Export Documents ...

    Export PDF, Word, Excel, HTML, and more to Markdown with an on‑premises .NET API. First public release is published....work well with HTML or plain text, these native document formats...that generates text and answers based on large text corpora. RAG...

    blog.groupdocs.com/markdown/groupdocs-markdown-...
  4. Working with fonts in GroupDocs.Viewer for .NET

    Learn how to get list of used fonts, specify or replace missing fonts, exclude fonts...user adds some text to the DOCX document, this text always has some...} } Please note that font extraction is supported only for the...

    blog.groupdocs.com/viewer/working-with-fonts/
  5. Get a list of changes | GroupDocs

    This article explains how to get a collection of changes between compared documents when using GroupDocs.Comparison for Node.js via Java....) const text = ( change . getText () || ''...trim (); // Normalized change text const src = ( change . getSourceText...

    docs.groupdocs.com/comparison/nodejs-java/get-l...
  6. Extract TOC from EPUB Documents using GroupDocs...

    It gives us immense pleasure to announce the release of version 18.4 of GroupDocs.Text for .NET. The latest version allows extracting the table of contents from the EPUB documents. Furthermore, we have added the feature of detecting media type of .one file. Following sections provide details about the newly added features. Extracting TOC from EPUB Documents Using version 18.4, you can now extract TOC from the EPUB documents. To access the TOC, TableOfContents property of **EpubPackage **class is used....Text for .NET. The latest version allows extracting the table...the newly added features. Extracting TOC from EPUB Documents #...

    blog.groupdocs.com/parser/extract-toc-from-epub...
  7. GroupDocs.Parser for .NET

    This API allows you to perform Text search and index any type of file format using C# .NET language on any platform....using C# Extract Text from DOCM using C# Extract Text from MHTML...using C# Extract Text from TXT using C# Extract Text from EPUB...

    kb.groupdocs.com/parser/net/page/2/
  8. Highlighting search results | GroupDocs

    This article gives knowledge on how to highlight search results in the Text of a document....in the text of a document. Hit highlighting in the text of entire...document can be highlighted in the text of the document using the method...

    docs.groupdocs.com/search/nodejs-java/highlight...
  9. Extract data from Microsoft Office Word documen...

    GroupDocs.Parser provides the functionality to extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported. The following table provides the list of supported formats: Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain Text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...usage / Extract data from various formats / Extract data from...Microsoft Office Word documents Extract data from Microsoft Office...

    docs.groupdocs.com/parser/net/extract-data-from...
  10. Enabling language information | GroupDocs

    Follow this guide to learn how to edit Word documents using locale information and apply spell-checkers to document content written in different languages using GroupDocs.Editor for Node.js and Java....contain text in multiple languages. Unlike plain text documents...of every piece of text. allows extracting and exporting this...

    docs.groupdocs.com/editor/nodejs-java/enabling-...