
Data Structure
Networking
RDBMS
Operating System
Java
MS Excel
iOS
HTML
CSS
Android
Python
C Programming
C++
C#
MongoDB
MySQL
Javascript
PHP
- Selected Reading
- UPSC IAS Exams Notes
- Developer's Best Practices
- Questions and Answers
- Effective Resume Writing
- HR Interview Questions
- Computer Glossary
- Who is Who
Read Data from PDF File and Display on Console in Java
There are several libraries to read data from a pdf using Java. Let us see how to read data from a PDF document and display it on the console using a library named PDFBox.
You can extract text using the getText() method of the PDFTextStripper class. This class extracts all the text from the given PDF document to use this.
Load an existing PDF document using the static method load() of the PDDocument class.
Instantiate the PDFTextStripper class.
Retrieve.read the contents of the PDF page to a String using the getText() method of the PDFTextStripper class.
Finally, close the document using the close() method of the PDDocument class as shown below.
Example
Assume we have a pdf with name sample.PDF in the directory D:// as shown below −
Following the Java program reads the contents of the above-mentioned PDF document and displays them on the console.
import java.io.File; import java.io.IOException; import org.apache.pdfbox.pdmodel.PDDocument; import org.apache.pdfbox.text.PDFTextStripper; public class PdfToConsole { public static void main(String args[]) throws IOException { //Loading an existing document File file = new File("D://Sample.pdf"); PDDocument document = PDDocument.load(file); //Instantiate PDFTextStripper class PDFTextStripper pdfStripper = new PDFTextStripper(); //Retrieving text from PDF document String text = pdfStripper.getText(document); System.out.println(text); //Closing the document document.close(); } }
Output
Tutorials Point originated from the idea that there exists a class of readers who respond better to online content and prefer to learn new skills at their own pace from the comforts of their drawing rooms. The journey commenced with a single tutorial on HTML in 2006 and elated by the response it generated, we worked our way to adding fresh tutorials to our repository which now proudly flaunts a wealth of tutorials and allied articles on topics ranging from programming languages to web designing to academics and much more.