PDF File reader

I’d use https://anonfiles.com/ and then send me the link.

PM it, if you rather not share it with all comers.

anonfiles is not available

Google drive? Dropbox?

Thank you.

As I suspected, that PDF does not contain any text; it’s an image of the text.

You may be able to use some OCR software on it, but that’s not something that I cam help you with.

ok thanks anyway !

import pytesseract
from pdf2image import convert_from_path

def extract_text_from_scanned_pdf(pdf_path):
    # Convert each page of the PDF to images
    images = convert_from_path(pdf_path)

    extracted_text = ''
    for image in images:
        # Perform OCR on each image
        text = pytesseract.image_to_string(image, lang='eng')
        extracted_text += text

    return extracted_text

def search_mix_design_number(text, desired_mix_design_number):
    # Search for the desired mix design number in the extracted text
    if desired_mix_design_number in text:
        return True
        return False

# Example usage
pdf_path = 'C:/Users/Operator/Onedrive/Desktop/Batch Reports/New Folder/1.pdf'  # Replace with the path to your PDF file
desired_mix_design_number = 'Mix Design Number 50'  # Replace with the desired mix design number

# Extract text from the scanned PDF
text = extract_text_from_scanned_pdf(pdf_path)

# Search for the mix design number
mix_design_found = search_mix_design_number(text, desired_mix_design_number)

# Print the result
if mix_design_found:
    print("Mix Design Number found in the extracted text")
    print("Mix Design Number not found in the extracted text")

this works

Thank you for sharing the script.

I’ve not tried it myself, but it’s good to know that it’s here if need it.