Designing a PDF Audiobook using Python

In this code, a simple implementation of PDF Audiobook is shown. PDF text is read to the user as audio using this code.

Introduction

Reading stories or essays or any text can be arduous, however an audio reading of the text is convenient and doesnt require as much concentration as reading requires. In this project, I implemented a simple PDF to audio converter. This code scans page(s) of PDF and reads it using audio, to the user. While this project is good for simple text reading, it does not perform good if a scientific paper with equations is given to it because equations are not supported to be read in pytesseract OCR library which we used to convert image to text.

Project Flow

Here is the project flow diagram:

First, we take the PDF file and convert each page into image using PyMuPDF software.
Then, we take the image(s) and scan the text in the image using Pytesseract OCR software.
Then, we use Google Text to Speech (gTTS) library to convert text to audio file.
Lastly, we get the Pygame mixer to play the audio file loud.

Prerequisite software

The software libraries required to run this code can be installed using:

pip install -r requirements.txt

Conclusion

It was seen that the code performs really well in reading straightforward PDF text files, however, if equations are involved in the text, then the reader cannot properly read the equations. Hence, the code is good for simple text but not for scientific papers as it will fumble reading the equations. However, text will be read just fine.

Please give a star to the repo to let me know if the work helped you.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
PDF_reading_software.py		PDF_reading_software.py
README.md		README.md
audiobook image.png		audiobook image.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Designing a PDF Audiobook using Python

Introduction

Project Flow

Prerequisite software

Conclusion

About

Releases

Packages

Languages

shayanalibhatti/Designing-a-PDF-Audiobook-using-Python

Folders and files

Latest commit

History

Repository files navigation

Designing a PDF Audiobook using Python

Introduction

Project Flow

Prerequisite software

Conclusion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages