API allows creating of full-text and / or metadata index on documents. To index only metadata without main content of documents, you only need to set IndexType.MetadataIndex when creating an index....to search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more...
This article explains that how to extract Markdown formatted text from document page....extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails...
This article explains the method which can be used when for some reason files have non-standard extensions or if its format is supported, but not pre-configured....document formats like PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails...
Reading BMP header properties The GroupDocs.Metadata API supports extracting format-specific information from BMP file headers.
The following are the steps to read the header of a BMP file.
Load a BMP image Get the root metadata package Extract the native metadata package using the BmpRootPackage.getBmpHeader method Read the BMP header properties advanced_usage.managing_metadata_for_specific_formats.image.bmp.BmpReadHeaderProperties
try (Metadata metadata = new Metadata(Constants.InputBmp)) { BmpRootPackage root = metadata.getRootPackageGeneric(); System.out.println(root.getBmpHeader().getBitsPerPixel()); System.out.println(root.getBmpHeader().getColorsImportant()); System.out.println(root.getBmpHeader().getHeaderSize()); System.out.println(root.getBmpHeader().getImageSize()); System.out.println(root.getBmpHeader().getPlanes()); } More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails...
Reading DICOM metadata properties The GroupDocs.Metadata API supports extracting format-specific information from DICOM images.
The following are the steps to read the native DICOM metadata.
Load a DICOM image Get the root metadata package Extract the native metadata package using the DicomRootPackage.getDicomPackage method Read the DICOM metadata properties examples.advanced_usage.managing_metadata_for_specific_formats.image.dicom.DicomReadNativeMetadataProperties
try (Metadata metadata = new Metadata(Constants.InputDicom)) { DicomRootPackage root = metadata.getRootPackageGeneric(); if (root.getDicomPackage() != null) { System.out.println(root.getDicomPackage().getBitsAllocated()); System.out.println(root.getDicomPackage().getReds()); System.out.println(root.getDicomPackage().getGreens()); System.out.println(root.getDicomPackage().getBlues()); System....and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, emails...
Search text by literal or regex and highlight found text in the document, loaded to the GroupDocs.Viewer for .NET...WordProcessing family (DOC, DOCX, ODT, RTF, etc.), Spreadsheet...loading two distinct documents — DOCX and PDF, and performing search...
This guide demonstrates how to edit content of PDF files like a common text documents using a GroupDocs.Editor for .NET....While you can freely edit any DOCX document by adding, removing...“truly” text documents like DOCX or TXT. Concluding: Editing...