Train a Tesseract model on the provided files.
Using Python. and OCR PDFs and return accuracy score for each page along with the raw text as well as the location of each word box. I have already all the python file - [login to view URL] it will create images of all pdf then [login to view URL] it will do optical character recognition, then [login to view URL] it will create the bounding boxes around each text.
"I want to continue the project where the program is more automated." automated process and Successfully test Postgres SQL DB functions. I need a highly experienced freelancer.
Please write top of the proposal time and money.
11 freelancers are bidding on average $239/hour for this job
We have a few OCR capable in extracting targeted text. As a stand alone. Not linked to an API. Contact me if you are interested for further information and demo
I am phd in statistics and have 5 years experience in data analytics. For one project, we needed to pull data from pdf files but data was in image format. I used Tesseract to pull data. I work with postgres database