Find Jobs
Hire Freelancers

Generate a synthetic dataset for Chinese and English (Python)

$30-250 USD

Closed
Posted about 3 years ago

$30-250 USD

Paid on delivery
Hi, I am looking for a freelancer to create an OCR dataset for segmenting Chinese and English text from any image. The dataset should be generated via Python. A lot of the code already exists; see "What already exists?" section. Lastly the dataset should be compatible with PyTorch. Tasks that need to be completed are: 1. extract sentence fragments from a dialogue text file and dictionary database 2. add sentence to image with entire sentence in bounding box (place randomly without overlap) 3. generate fixed dataset inside the `dataset/` directory 4. create a pytorch dataset with random chinese and english sentences for semantic segmentation What already exists? - [login to view URL] (synthetic single character dataset) - [login to view URL] ( chinese dictionary) - different dialogue txt files for english and chinese - [login to view URL] I have created a private GitHub repo for this project. You can get access to it for further details before the project begins. If you are interested in this project please start your bid with "OCR PROJECT". (There are many bots bidding.)
Project ID: 29879822

About the project

3 proposals
Remote project
Active 3 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
3 freelancers are bidding on average $279 USD for this job
User Avatar
***"OCR PROJECT"*** Greeting of the Day, I appreciate posting this kind of job. I understand your requirement and I want to help you out with smart solutions. I'm an expert Python Developer having 5+ years of experience with similar expertise. I have great expertise in Python, Data Processing, OCR, Data Science, Image Processing. I can create an OCR dataset for segmenting Chinese and English text from any image and can create a pytorch dataset with random chinese and english sentences for semantic segmentation. I hope I will get a positive response from your side. I'll be waiting for your valuable revert. Regards, Virang
$140 USD in 15 days
4.9 (35 reviews)
6.6
6.6
User Avatar
Hi, I'm a nativeChinese speaker and machine learning engineer with 4 years working experience. I've been involved in Chinese OCR project with leading tech company. For further discussion, pls contact me via inmail.
$556 USD in 14 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of GERMANY
Lübbecke, Germany
5.0
11
Payment method verified
Member since Apr 3, 2021

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.