Sort Score
Result 10 results
Languages All
Labels All
Results 941 - 950 of 1,680 for

text extraction

(0.19 sec)
  1. Working with metadata in PNG images | GroupDocs

    Reading PNG metadata properties The GroupDocs.Metadata API supports extracting format-specific information from PNG images. The following are the steps to read the native PNG metadata. Load a PNG image Get the root metadata package Extract the native metadata package using PngRootPackage.PngPackage Read the PNG metadata properties AdvancedUsage.ManagingMetadataForSpecificFormats.Image.Png.PngReadTextChunks using (Metadata metadata = new Metadata(Constants.InputPng)) { var root = metadata.GetRootPackage(); foreach (var chunk in root.PngPackage.TextChunks) { Console.WriteLine(chunk.Keyword); Console.WriteLine(chunk.Text); var compressedChunk = chunk as PngCompressedTextChunk; if (compressedChunk !...Metadata API supports extracting format-specific information...the root metadata package Extract the native metadata package...

    docs.groupdocs.com/metadata/net/working-with-me...
  2. Working with metadata in Presentations | GroupDocs

    GroupDocs.Metadata for Java provides functionality that allows working with different kinds of presentations such as PPT, PPTX, POTM, POTX, etc. For the full list of supported presentation formats please refer to Supported document formats. Detecting the exact type of a presentation The following sample of code will help you to detect the exact type of a loaded presentation and extract some additional file format information. Load a presentation Extract the root metadata package Use the getPresentationType method to obtain file format information advanced_usage....a loaded presentation and extract some additional file format...information. a presentation Extract the root metadata package Use...

    docs.groupdocs.com/metadata/java/working-with-m...
  3. Indexing with stop words | GroupDocs

    Stop words are frequently used words that do not carry a semantic meaning and can be removed from an index to reduce its size. You can enable or disable the use of stop words by calling the setUseStopWords method of the IndexSettings class. The default value is true, meaning that stop words are filtered during indexing and not added to the index. A list of stop words to use during indexing can be specified in the stop word dictionary....need to keep all text information extracted from documents, and...

    docs.groupdocs.com/search/nodejs-java/indexing-...
  4. Convert PDF to DOCX using C#

    Easily convert PDF to DOCX using C# with our step-by-step guide without installing extra tools. We will also provide you code example to export PDF to DOCX using C#.... Whether you are extracting text, modifying formatting,...

    kb.groupdocs.com/conversion/net/convert-pdf-to-...
  5. Get Document Information using Java

    Learn how to get document information using Java. Improve the efficiency of document management by mastering the technique of how to get file properties using Java.... Whether you’re handling text files, spreadsheets, presentations...information, allowing them to extract metadata, fetch properties...

    kb.groupdocs.com/metadata/java/get-document-inf...
  6. Working with metadata in PDF documents | GroupDocs

    Detecting the version of a PDF document The following sample of code will help you to detect the PDF version a loaded document and extract some additional file format information. Load a PDF document Extract the root metadata package Use the getPdfType method to obtain file format information advanced_usage.managing_metadata_for_specific_formats.document.pdf.PdfReadFileFormatProperties try (Metadata metadata = new Metadata(Constants.InputPdf)) { PdfRootPackage root = metadata.getRootPackageGeneric(); System.out.println(root.getPdfType().getFileFormat()); System.out.println(root.getPdfType().getVersion()); System.out.println(root.getPdfType().getMimeType()); System.out.println(root.getPdfType().geTextension()); } Reading built-in metadata properties To access built-in metadata of a PDF document, please use the getDocumentProperties method defined in the DocumentRootPackage class....version a loaded document and extract some additional file format...information. a PDF document Extract the root metadata package Use...

    docs.groupdocs.com/metadata/java/working-with-m...
  7. GroupDocs.Viewer for Node.js overview | GroupDocs

    GroupDocs.Viewer for Node.js enables you to convert files to HTML, PNG, JPEG, PDF file formats, and save attachments without relying on third-party applications. GroupDocs.Viewer for Node.js is cross-platform class library.... Extract text from PDF files and images...

    docs.groupdocs.com/viewer/nodejs-java/product-o...
  8. Get Document Information using C#

    Discover the process of how to get document information using C#. Enhance document management efficiency by learning how to get file properties using C#....Whether you’re dealing with text documents, spreadsheets, presentations...programmatically, enabling them to extract metadata, retrieve properties...

    kb.groupdocs.com/metadata/net/get-document-info...
  9. How to Convert DOCX to TXT using Java

    In this guide, we will provide easy steps to convert DOCX to TXT using Java. Also, we will provide you a code example representing how to export DOCX to TXT in Java....collaborating with basic text editors, extracting unformatted content...

    kb.groupdocs.com/conversion/java/how-to-convert...
  10. Convert PDF to DOCX using Java

    Effortlessly convert PDF to DOCX using Java, eliminating the need for additional tools. Also, we will furnish you with code example to export PDF to DOCX using Java.... Whether you’re extracting text, adjusting formatting,...

    kb.groupdocs.com/conversion/java/convert-pdf-to...