Not able to read the pdf files

arvin_90 · September 9, 2022, 6:57am

Hello
I have PDF files that I can’t open. I’ve tried numerous libraries but have yet to find success. Previously, I was able to open PDF files, but not these PDF files. I believe these files contain the esignature, which is why they cannot be read.
Please assist me in opening these files.

Thank you in advance.

JDLH · September 9, 2022, 8:01pm

Hello, @arvin_90 :

I understand that there are a lot of different varieties of PDF file. It is not surprising to find a set of PDF files which don’t work using techniques that worked for previous PDF files.

As far as I know, the Python standard library does not have any modules for operating on PDF files as data. Thus you will have to use some third-party module which is able to operate on your PDF files. You can search PyPI for “PDF” and look at the various packages which it suggests.

Does anyone reading this thread know of a Python library for operating on PDF files which can handle advanced topics like protected and signed PDF files?

Are you able to open these PDF files with non-Python PDF applications, such as Adobe Acrobat or GhostScript? That might help you understand which PDF features you need support for in the Python module which you choose.

I’m sorry I can’t give you a direct answer. Hopefully this might help move your investigation forward a step or two.

arvin_90 · September 10, 2022, 5:33am

Thank you for responding.

Yes, I can open the PDF in Adobe Acrobat and I tried searching the Python libraries but had no luck.

JDLH · September 10, 2022, 11:22pm

Tell us more about your requirements.

Do you want to open these PDF files using Python code? Why is it important that you open them with Python? There are PDF tools that do not involve Python.

Can you use Acrobat to open these files, and save them without digital signatures or protection? Can you do this to one or two files, to figure out a method? Can you do this to all of the files, to remove the parts of the PDF file which your Python module cannot work with?

Hazrat · September 11, 2022, 3:11pm

I did try searching for a package which can open an E-signature file, but there doesn’t seem to be much hope

there are many packages which can :

verify
E-sign
automatic scan
compress

you may refer to these sources to get some help !

github.com

Abhiramborige/Crypto-systems/blob/master/digital_signature.py

from Crypto.Util.number import *
from random import *
from hashlib import sha1

# Hash of message in SHA1   
def hash_function(message):
    hashed=sha1(message.encode("UTF-8")).hexdigest()
    return hashed

# Modular Multiplicative Inverse
def mod_inverse(a, m) : 
    a=a%m; 
    for x in range(1,m) : 
        if((a*x)%m==1) : 
            return(x) 
    return(1)

# Global parameters are q,p, and g
def parameter_generation():

This file has been truncated. show original

arvin_90 · September 12, 2022, 6:59am

Because I want to extract the data, and save it to an excel file.
I am using PyPDF2 to read the file, but when I try to print the text, nothing appears. I’m having the same problem with Colab and Vscode.

Topic		Replies	Views
PDF File reader Python Help	26	5103	July 8, 2023
Opening multiple PDFs in Python and extracting the XML Files Python Help help	5	2404	October 21, 2021
From list of bytes to PDF file Python Help	1	2194	November 2, 2021
PDF Extraction with python wrappers Python Help	38	6367	January 15, 2024
Python utility to convert a text file to pdf? Python Help	3	2015	November 10, 2021

Not able to read the pdf files

Related Topics