Sort Score
Result 10 results
Languages All
Labels All
Results 941 - 950 of 1,629 for

text extraction

(0.12 sec)
  1. GroupDocs.Viewer for Python overview | GroupDocs

    GroupDocs.Viewer for Python: View files (DOCX, PDF, etc.) as HTML, PNG, JPEG, PDF. Cross-platform library for Python apps.... Extract text from PDF files and images...

    docs.groupdocs.com/viewer/python-net/product-ov...
  2. Convert PDF files | GroupDocs

    This article demonstrates how to convert PDF to Word, Excel, PowerPoint, HTML, image and other formats with GroupDocs.Conversion for .NET....PDF files is not limited to text only, it could be hyperlinks...content manipulations and editing text, tables, images, lists, etc...

    docs.groupdocs.com/conversion/net/convert/pdf/
  3. Indexing with stop words | GroupDocs

    Stop words are frequently used words that do not carry a semantic meaning and can be removed from an index to reduce its size. You can enable or disable the use of stop words by calling the setUseStopWords method of the IndexSettings class. The default value is true, meaning that stop words are filtered during indexing and not added to the index. A list of stop words to use during indexing can be specified in the stop word dictionary....need to keep all text information extracted from documents, and...

    docs.groupdocs.com/search/nodejs-java/indexing-...
  4. sample.chm

    4.13.4 ‘Internet protocol’ This option allows the user to select between the old and new Internet protocols and addressing schemes (IPv4 and IPv6). The selected protocol will be used for most outgo......the server to request coloured text. If you have a particularly...mode controls what happens when text printed in a PuTTY window reaches...

    docs.groupdocs.com/viewer/nodejs-java/_sample_f...
  5. Working with metadata in Presentations | GroupDocs

    GroupDocs.Metadata for Java provides functionality that allows working with different kinds of presentations such as PPT, PPTX, POTM, POTX, etc. For the full list of supported presentation formats please refer to Supported document formats. Detecting the exact type of a presentation The following sample of code will help you to detect the exact type of a loaded presentation and extract some additional file format information. Load a presentation Extract the root metadata package Use the getPresentationType method to obtain file format information advanced_usage....a loaded presentation and extract some additional file format...information. a presentation Extract the root metadata package Use...

    docs.groupdocs.com/metadata/java/working-with-m...
  6. Working with metadata in PNG images | GroupDocs

    Reading PNG metadata properties The GroupDocs.Metadata API supports extracting format-specific information from PNG images. The following are the steps to read the native PNG metadata. Load a PNG image Get the root metadata package Extract the native metadata package using PngRootPackage.PngPackage Read the PNG metadata properties AdvancedUsage.ManagingMetadataForSpecificFormats.Image.Png.PngReadTextChunks using (Metadata metadata = new Metadata(Constants.InputPng)) { var root = metadata.GetRootPackage(); foreach (var chunk in root.PngPackage.TextChunks) { Console.WriteLine(chunk.Keyword); Console.WriteLine(chunk.Text); var compressedChunk = chunk as PngCompressedTextChunk; if (compressedChunk !...Metadata API supports extracting format-specific information...the root metadata package Extract the native metadata package...

    docs.groupdocs.com/metadata/net/working-with-me...
  7. Update EPUB Metadata using C# – .NET Metadata E...

    Discover ways to view and edit EPUB eBook metadata using C#. Programmatically edit EPUB specific properties and Dublin Core items using C#....digital ebook format that contains text, images, and multimedia elements...using the Metadata class. Extract the metadata root package using...

    blog.groupdocs.com/metadata/view-edit-epub-meta...
  8. GroupDocs.Viewer Product Family on GroupDocs Bl...

    GroupDocs.Viewer Product Family on GroupDocs Blog | Document Automation Solutions for .NET & Java Developers Recent content in GroupDocs.Viewer Product Family on GroupDocs Blog | Document Automatio......present documents, including text formatting and images, in a...flat document, including the text, fonts, vector graphics, raster...

    blog.groupdocs.com/categories/groupdocs.viewer-...
  9. Working with worksheet backgrounds | GroupDocs

    Extract, remove, and watermark worksheet backgrounds using Python via .NET....Leave feedback On this page Extracting information about all worksheet...applies a centered, rotated text watermark to each worksheet’s...

    docs.groupdocs.com/watermark/python-net/working...
  10. Convert HTML to MHTML using Python

    Convert HTML to MHTML using Python for complete web archiving. Easily export HTML to MHTML in Python and bundle images, styles, and fonts into a single file.... Looking for a plain text companion output? See how to...HTML to TXT using Python to extract readable content from your...

    kb.groupdocs.com/conversion/python/convert-html...