Monthly Archives: February 2013

Simple Example of Extracting Metadata and Text from PDF Using PDFBox

Below is a simple example of how to pull text and metadata our of a pdf file using PDFBox. Much simpler to understand than using Poi with DOC and DOCX–but maybe that’s just me!     import org.apache.pdfbox.pdmodel.PDDocument; import org.apache.pdfbox.pdmodel.PDDocumentInformation; … Continue reading

Posted in Uncategorized | 2 Comments

Examples of Extracting DOC and DOCX Metadata and Text Using Poi

UPDATE: this post was written in early 2013. At the time it was written, it worked perfectly. However, I have no idea if this will work in your situation, today, as things probably have changed…If it does work for you, … Continue reading

Posted in Uncategorized | 2 Comments