This article shows how to extract data from documents of various formats including PDF, Microsoft Word (DOC, DOCX), Excel (XLS, XLSX), LibreOffice formats etc....getName () + ": " ); PageTextArea area = data . get ( i ). getPageArea...println ( area == null ? "Not a template field" : area . getText...