Find Jobs
Hire Freelancers

Wikipedia data dump miner

$15-25 USD / hour

Closed
Posted over 6 years ago

$15-25 USD / hour

I 'm looking for wikipedia and machine learning expert. - Are you an expert Wikipedia dump files? - Do you love to write scripts that automate extractions? - Which scripting languages do you already know? Python, Bash? - Work closely with our teams building user experiences and collaborative machine learning algorithms. What do you think of this fist task 1. given two languages , say en and zh. 2. and a page category , like Living people. 3. and a specified WP dump date. 4. generate a set of sets of name string. where each set has all of the en redirects and zh redirects for a given pair of en-zh linked titles. For example, the set for Vladimr Putin's page would have all his redirects in English as well as his page name in Chinese and all of its redirects. If you like that as a starting task, please give me an hour estimate for it and we can start a contract with that as the first task going forward we have a bunch of tasks of this kind.
Project ID: 14845502

About the project

10 proposals
Remote project
Active 7 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
10 freelancers are bidding on average $21 USD/hour for this job
User Avatar
I can write a script using Python's request library that will generate a set of sets of name string, based on your specified criteria. The request library is really powerful and allows features such as persistent sessions (for fast querying). I can complete the initial task in 6 hours. If you are interested, we can talk via chat and I can tell you more about my previous (similar) work!
$25 USD in 20 days
5.0 (8 reviews)
4.5
4.5
User Avatar
i would like to offer you my expertiseas I have done number of my academic projects and I am a professional in the field Contact me and I’ll show you what i am capable of
$22 USD in 40 days
4.0 (18 reviews)
3.1
3.1
User Avatar
I'm CTO at datascraping [dot] club, we provide data scraping and websites scrapping services, have a lot of experience with machine learning and data scrapping in general. Would love to chat about your project and share my experience. Thanks
$22 USD in 40 days
5.0 (1 review)
2.2
2.2
User Avatar
Hello, my name is Michael. I represent Ukrainian based IT-company Webbook Inc that provides services in the IT-sphere for international business. We were carefully reviewing the requirements of the job description, so our devs can work on Your project without delay. We have years of working on projects related on any available CMS, from "scratch" with core php and php-frameworks(Yii/Yii2, Laravel, CodeIgniter), JavaScript, jQuery, AJAX, HTML5, CSS3, Bootstrap, javascript-frameworks, 3d desidg, graphic design etc. However, I shall discuss about the requirements and functionalities in details to have a better understanding about time frame and price. We are glad to chat with You and discuss all in details. Contact us and we will reply immediately. Waiting for Your reply! Best regards, Webbook team
$22 USD in 40 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I have hands on expertise in python ( beautiful soup ) web crawling, I am also a data engineer where day job involves creating data pipelines for extraction, transformations.
$27 USD in 30 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I have been editing Wikipedia more then 3 years, also I have use Pywikibot with my own scripts. Beside that, I love everything related to Wikipedia and I will do this job with love :).
$15 USD in 40 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of CHINA
Beijing, China
5.0
1
Member since Apr 7, 2017

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.