Regarding the PDF layering issue #143643
Replies: 3 comments
This comment was marked as off-topic.
This comment was marked as off-topic.
-
Thanks for posting in the GitHub Community, @Hzjboss404 ! We’ve moved your post to our Programming Help 🧑💻 category, which is more appropriate for this type of discussion. Please review our guidelines about the Programming Help category for more information. |
Beta Was this translation helpful? Give feedback.
-
Hi @Hzjboss404 Here are some of my solutions to solve that issue. OCRmyPDF is a command-line tool designed specifically for adding OCR layers to PDFs without converting them into images. It preserves the original PDF content, including vector graphics and fonts, while adding a hidden text layer that enables search functionality.
Hope to be helpful for you! Thank you |
Beta Was this translation helpful? Give feedback.
-
Select Topic Area
Product Feedback
Body
Regarding the PDF layering issue, I want to perform OCR on a PDF without converting the original PDF into an image. The original PDF uses vector fonts which are infinitely clear, but I can only select one character at a time and cannot search. I aim to overlay the OCR text on the original PDF without converting it to an image. The mentioned software, UMI-OCR, skips over vector fonts, but what I need is to recognize vector fonts as if they were images and then overlay a text layer on the original PDF. This way, I can preserve the clarity of the vector fonts while also gaining search functionality.
Beta Was this translation helpful? Give feedback.
All reactions