1,524 questions
0
votes
1
answer
57
views
Random alignment when overlaying pdfs using pypdf
I'm trying to overlay 1 pdf onto multiple different pdf files. All of them have the exact same size (in terms of page dimensions) and they are all landscape. However, when overlaying them, the stamp ...
0
votes
2
answers
161
views
Merging pdfs with pypdf does not create an output file
I attempted to build a python "pdf envelope" script. I'm using the pypdf library to attempt to take the first command line argument as name of new file to be created, and subsequent ...
2
votes
0
answers
165
views
Can't find PDF text
I'm trying to come up with a nice feature for my work colleagues, that every day in the morning, we would get in Slack the menu of the day... inspired by https://github.com/lemedege/LunchBot/blob/main/...
0
votes
0
answers
96
views
Convert Form field objects to Text using PyPdf2
I have a pregenerated PDF with some form fields that I use PyPdf2 to fill. All fields are marked with readonly flag
form_fields = {"my_field": "Hello, World"}
pdf_content = open('...
1
vote
2
answers
135
views
How can I insert an image with a clickable hyperlink on the first page of a PDF using Python?
I'm trying to programmatically add a clickable image icon to the top-left corner of the first page of a PDF file using Python. I want the icon to link to a URL.
what I tried using reportlab and PyPDF2:...
1
vote
1
answer
195
views
Stuck during PyPDF3 installation
I want to install PyPDF3 on my Linux Mint, but the error message I get is very long but doesn't help me much. Any hints?
kav@webdev:~$ cat /etc/issue
Linux Mint 21.1 Vera \n \l
kav@webdev:~$ python3 -...
0
votes
0
answers
71
views
Why is the embedded image extracted from pypdf of lower resolution than when i used pdf2image to conver the entire pdf page?
I have a pdf document which I want to extract embedded images to be used.
I used the code below which to extract the embedded images from the first page of the pdf document:
from pypdf import ...
0
votes
2
answers
108
views
Add XMP dta to PDF file via pypdf
I need to add XMP data to a PDF file obtained through pypdf. In question reading/writing XML metadata on PDF files through pypdf, KyleBrooks answered that this is impossible. But that was in 2009. ...
2
votes
0
answers
113
views
Issue with Editable PDF Fields Not Displaying in Adobe Acrobat
I am filling in editable fields in a PDF file using Python (PyPDF library) and saving the file with the code below. However, when I open the PDF, the filled fields do not appear. If I open the saved ...
0
votes
0
answers
76
views
Need a function to overlay a pdf (containing comments) as a layer on another pdf
I tried overlaying a pdf(marked) which has comments in it as a layer on another pdf(master) using fitz and pyPDF. As expected, I'm getting the marked pdf overlayed on the master pdf as a toggleable ...
0
votes
0
answers
332
views
How to Extract Data (Including Checkboxes) from PDF Form Using Python (PyPDF2, OpenCV Not Working)
I’m trying to extract data from a PDF form that includes text fields, checkboxes and other input elements. The form looks like a structured intake form.
PyMuPDF
def extract_text_from_pdf(pdf_path):
...
-1
votes
1
answer
109
views
extraction of redline of word document converted to pdf using python library
I have a set of documents that were edited in Microsoft Word. Take for example a document that says "This is a test document." The document is then edited, with track changes on, to read &...
0
votes
0
answers
90
views
Python PyPDF not cropping as intended
I was trying to write a program that takes a PDF and splits each page into two pages. The original PDF is landscape and has two pages side-by-side 'per page', like this:
I want the PDF like this:
I ...
0
votes
1
answer
156
views
pypdf or pikepdf advice needed on bookmarks
I am sorry but I am unable to understand how to rearrange bookmarks in PDF document.
I have PDF document with medical records which was created by importing new and new items from individual ...
0
votes
0
answers
104
views
PDF form checkboxes checking using python and pdfrw
Has anyone experience with pdf form checkbox checking? The case is that within a django application, based on annotations of pdf forms I map and identify checkboxes I want to either check, or leave ...