Detecting the version of a PDF document The following sample of code will help you to detect the PDF version a loaded document and Extract some additional file format information.
Load a PDF document Extract the root metadata package Use the FileType property to obtain file format information AdvancedUsage.ManagingMetadataForSpecificFormats.Document.Pdf.PdfReadFileFormatProperties
using (Metadata metadata = new Metadata(Constants.InputPdf)) { var root = metadata.GetRootPackage(); Console.WriteLine(root.FileType.FileFormat); Console.WriteLine(root.FileType.Version); Console.WriteLine(root.FileType.MimeType); Console.WriteLine(root.FileType.Extension); } Reading built-in metadata properties To access built-in metadata of a PDF document, please use the DocumentProperties property defined in the DocumentRootPackage class....version a loaded document and extract some additional file format...information. a PDF document Extract the root metadata package Use...
Detecting the version of a PDF document The following sample of code will help you to detect the PDF version a loaded document and Extract some additional file format information.
Load a PDF document Extract the root metadata package Use the getPdfType method to obtain file format information advanced_usage.managing_metadata_for_specific_formats.document.pdf.PdfReadFileFormatProperties
try (Metadata metadata = new Metadata(Constants.InputPdf)) { PdfRootPackage root = metadata.getRootPackageGeneric(); System.out.println(root.getPdfType().getFileFormat()); System.out.println(root.getPdfType().getVersion()); System.out.println(root.getPdfType().getMimeType()); System.out.println(root.getPdfType().getExtension()); } Reading built-in metadata properties To access built-in metadata of a PDF document, please use the getDocumentProperties method defined in the DocumentRootPackage class....version a loaded document and extract some additional file format...information. a PDF document Extract the root metadata package Use...
This article explains how to access EXIF metadata in a file of any supported format....that contains EXIF metadata Extract the EXIF metadata package using...direct access to the EXIF tags extracted from a file. a file that contains...
This article explains how to load PDF, Word, Excel, PowerPoint documents from a stream when using GroupDocs.Parser for Python via .NET....Parser ( stream ) as parser : # Extract text from the document text_reader...is not None : # Print the extracted text print ( text_reader )...
GroupDocs Blog - GroupDocs Blog | Document Automation Solutions for .NET & Java Developers...often have requirements to extract text from various documents...We have already discussed extracting ZIP archives, counting words...
This article explains that how to search text from Microsoft OneNote sections presentations...usage / Extract data from various formats / Extract data from...section. For example, text extraction isn’t supported for Zip archive...
GroupDocs.Viewer for Python via .NET is a powerful API supporting 180+ file formats, enabling developers to easily render documents to PDF, HTML, JPG, or PNG....NET Latest Answers Extract Text from PDF using Python...
To search a keyword in Microsoft Office Word documents search(String) method is used. This method returns the collection of SearchResult objects. For details, see Search Text.
Here are the steps to search a keyword in Microsoft Office Word document:
Instantiate Parser object for the initial document; Call search(String) method and obtain the collection of SearchResult objects; Iterate through the collection and get the position and text. Warning search(String) method returns null value if search isn’t supported for the document....Usage / Extract data from various formats / Extract data from...document. For example, text extraction isn’t supported for Zip archive...
To search a keyword in EPUB e-books Search(String) method is used. This method returns the collection of SearchResult objects....usage / Extract data from various formats / Extract data from...e-book. For example, text extraction isn’t supported for Zip archive...
We are about to release GroupDocs.Search for Java, a full-featured, back-end API that could easily be utilized by Java developers in their projects. It is a fascinating document search API that Extracts text and metadata from documents. Furthermore, it performs advanced searching and indexing operations on the basis of fuzzy and synonym algorithms. API also supports full-text search.
Features Offered by GroupDocs.Search for JavaGroupDocs will keep on updating this API with new features....fascinating document search API that extracts text and metadata from documents...events Extract the list of indexed documents Extract document...