Apache PDFBox ® - A Java PDF Library

The Apache PDFBox ® library is an open source Java tool for working with PDF documents. This project allows creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents. Apache PDFBox also includes several command-line utilities. Apache PDFBox is published under the Apache License v2.0.

Apache PDFBox 3.0.3 released
2024-08-08

The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 3.0.3. It is available for download at:

See the full release notes for details about this release.

The Migration Guide shall give users coming from PDFBox 2.0.x an overview about things to look at when switching over. More details to come.

Getting Help

To get help on using PDFBox, please Subscribe to the Users Mailing List and post your questions there. We're happy to help.

The project is a volunteer effort and we're always looking for interested people to help us improve PDFBox. There are a multitude of ways that you can help us depending on your skills. Subscribe to the Mailing Lists and find out how you can help.

Features

Extract Unicode text from PDF files.

Split a single PDF into many files or merge multiple PDF files.

Extract data from PDF forms or fill a PDF form.

Validate PDF files against the PDF/A-1b standard.

Print a PDF file using the standard Java printing API.

Save PDFs as image files, such as PNG or JPEG.

Create a PDF from scratch, with embedded fonts and images.

Digitally sign PDF files.

News

Apache PDFBox 2.0.32 released
2024-07-24

The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.32. It is available for download at:

See the full release notes for details about this release.

Apache PDFBox 2.0.31 released
2024-03-24

The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.31. It is available for download at:

See the full release notes for details about this release.

Apache PDFBox 3.0.2 released
2024-03-14

The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 3.0.2. It is available for download at:

See the full release notes for details about this release.

The Migration Guide shall give users coming from PDFBox 2.0.x an overview about things to look at when switching over. More details to come.

Apache PDFBox 3.0.1 released
2023-11-30

The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 3.0.1. It is available for download at:

See the full release notes for details about this release.

The Migration Guide shall give users coming from PDFBox 2.0.x an overview about things to look at when switching over. More details to come.

Apache PDFBox 2.0.30 released
2023-11-04

The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.30. It is available for download at:

See the full release notes for details about this release.

Copyright © 2009–2024 The Apache Software Foundation. Licensed under the Apache License, Version 2.0.
Apache PDFBox, PDFBox, Apache, the Apache feather logo and the Apache PDFBox project logos are trademarks of The Apache Software Foundation.