This is my submission for Tolstoy's Coding Assessment. My script uses a mix of tesseract and Google Cloud Vision to extract the relevant fields from the pdf. pandas==2.3.2 pdf2image==1.17.0 pillow==11 ...