Sort Score
Result 10 results
Languages All
Labels All
Results 941 - 950 of 1,632 for

text extraction

(0.12 sec)
  1. How to Convert DOCX to TXT using Java

    In this guide, we will provide easy steps to convert DOCX to TXT using Java. Also, we will provide you a code example representing how to export DOCX to TXT in Java....collaborating with basic text editors, extracting unformatted content...

    kb.groupdocs.com/conversion/java/how-to-convert...
  2. Working with metadata in Presentations | GroupDocs

    GroupDocs.Metadata for Java provides functionality that allows working with different kinds of presentations such as PPT, PPTX, POTM, POTX, etc. For the full list of supported presentation formats please refer to Supported document formats. Detecting the exact type of a presentation The following sample of code will help you to detect the exact type of a loaded presentation and extract some additional file format information. Load a presentation Extract the root metadata package Use the getPresentationType method to obtain file format information advanced_usage....a loaded presentation and extract some additional file format...information. a presentation Extract the root metadata package Use...

    docs.groupdocs.com/metadata/java/working-with-m...
  3. sample.chm

    4.13.4 ‘Internet protocol’ This option allows the user to select between the old and new Internet protocols and addressing schemes (IPv4 and IPv6). The selected protocol will be used for most outgo......the server to request coloured text. If you have a particularly...mode controls what happens when text printed in a PuTTY window reaches...

    docs.groupdocs.com/viewer/nodejs-java/_sample_f...
  4. Working with metadata in PDF documents | GroupDocs

    Detecting the version of a PDF document The following sample of code will help you to detect the PDF version a loaded document and extract some additional file format information. Load a PDF document Extract the root metadata package Use the getPdfType method to obtain file format information advanced_usage.managing_metadata_for_specific_formats.document.pdf.PdfReadFileFormatProperties try (Metadata metadata = new Metadata(Constants.InputPdf)) { PdfRootPackage root = metadata.getRootPackageGeneric(); System.out.println(root.getPdfType().getFileFormat()); System.out.println(root.getPdfType().getVersion()); System.out.println(root.getPdfType().getMimeType()); System.out.println(root.getPdfType().geTextension()); } Reading built-in metadata properties To access built-in metadata of a PDF document, please use the getDocumentProperties method defined in the DocumentRootPackage class....version a loaded document and extract some additional file format...information. a PDF document Extract the root metadata package Use...

    docs.groupdocs.com/metadata/java/working-with-m...
  5. Convert XLSX to XLS using Python

    Learn how to convert XLSX to XLS using Python. Step-by-step guide and code example is included in this topic for how to export XLSX to XLS in Python....need to extract spreadsheet content into a plain text format for...

    kb.groupdocs.com/conversion/python/convert-xlsx...
  6. Convert HTML to MHTML using Python

    Convert HTML to MHTML using Python for complete web archiving. Easily export HTML to MHTML in Python and bundle images, styles, and fonts into a single file.... Looking for a plain text companion output? See how to...HTML to TXT using Python to extract readable content from your...

    kb.groupdocs.com/conversion/python/convert-html...
  7. GroupDocs.Viewer for Java overview | GroupDocs

    GroupDocs.Viewer for Java enables you to convert files to HTML, PNG, JPEG, PDF file formats, and save attachments without relying on third-party applications. GroupDocs.Viewer for Java is cross-platform class library.... Extract text from PDF files and images...

    docs.groupdocs.com/viewer/java/product-overview/
  8. GroupDocs.Viewer for Python overview | GroupDocs

    GroupDocs.Viewer for Python: View files (DOCX, PDF, etc.) as HTML, PNG, JPEG, PDF. Cross-platform library for Python apps.... Extract text from PDF files and images...

    docs.groupdocs.com/viewer/python-net/product-ov...
  9. Indexing with stop words | GroupDocs

    This article demonstrates that how you can enable or disable the use of stop words by setting a value using search API...need to keep all text information extracted from documents, and...

    docs.groupdocs.com/search/net/indexing-with-sto...
  10. Indexing with stop words | GroupDocs

    Stop words are frequently used words that do not carry a semantic meaning and can be removed from an index to reduce its size. You can enable or disable the use of stop words by calling the setUseStopWords method of the IndexSettings class. The default value is true, meaning that stop words are filtered during indexing and not added to the index. A list of stop words to use during indexing can be specified in the stop word dictionary....need to keep all text information extracted from documents, and...

    docs.groupdocs.com/search/nodejs-java/indexing-...