Find Jobs
Hire Freelancers

Crawler: Scrappy Optimization

$250-750 USD

In Progress
Posted almost 10 years ago

$250-750 USD

Paid on delivery
I need my scrapper/crawler existing code optimized. These are some of the problems that I am having that I need resolved. * Prevent random timeouts on links that are really working. For example, so links that are going to linkedin is making the crawler timed out. * Minimize cpu usage & make the crawler more efficient * Make the crawler efficiently insert to the database. You are free to use whether mysql or mongodb if you think what is the best. * Use different method of matching the url uniqueness (crawler options). Do not use mysql on this cause it is so expensive. One option is to use reddis. * Use different database for storing crawler jobs. Ex. reddis I have included detail of our scrapper build so that you have some history of the data we are scrapping.
Project ID: 5884265

About the project

12 proposals
Remote project
Active 10 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
12 freelancers are bidding on average $567 USD for this job
User Avatar
I am a senior Python Developer. I have more than 4 years of experience working under different employers and have proved them the best to my capacity. I have good proficiency in the all the areas of Python. Areas include Distributed Computing, Data Mining, Imaging, Network Programming, Offensive Security tools Development, Web Application Development and many more.
$555 USD in 12 days
4.8 (19 reviews)
6.1
6.1
User Avatar
hello i can optimize your scraper code , will be using mysql thanks
$526 USD in 5 days
4.8 (82 reviews)
6.2
6.2
User Avatar
Hi. I have a powerful experience in python and help you with that. I can help you using scrapy, beautifulsoup, selenium and so on. Please see the attachment. This is the sample source for scraping products of e-commerce site and scraping doctor personal info which is developed using scrapy framework and beautifulsoup I have enough skills to help you. Please hire me, then i'm ready to help you. Thanks. Regards.
$777 USD in 10 days
4.9 (23 reviews)
5.4
5.4
User Avatar
Hi, Can you show code of ypur scrapy project? Do you rerequest pages with timeout? Best regards, Ilshat
$736 USD in 10 days
4.7 (22 reviews)
5.1
5.1
User Avatar
A proposal has not yet been provided
$666 USD in 10 days
5.0 (11 reviews)
4.5
4.5
User Avatar
My name is Duane and I am an application developer with 7 years’ experience in the IT industry. I am the project manager for a small software company Water Web Design LLC. The technologies we use are Html 5, CSS 3, C, C++, PHP, MySql, Javascript, ajax, Java and Python. We develop websites, mobile applications and desktop applications. i am ready to start working on this project and would like to make this a long term business relationship.
$444 USD in 10 days
2.5 (2 reviews)
3.3
3.3
User Avatar
Hi, I am very pleased to help you. I'm the right person for your crawler:-) I'm experienced in python, and advanced in SQL optimization. Prevent random timeouts on links that are really working. For example, so links that are going to linkedin is making the crawler timed out. Solution: Add more request when time-out Minimize cpu usage & make the crawler more efficient Solution: add gevent(non-blocking asyn i/o) support and multithreading Make the crawler efficiently insert to the database. You are free to use whether mysql or mongodb if you think what is the best. Solution: Optimize SQL statements:-) Use different method of matching the url uniqueness (crawler options). Do not use mysql on this cause it is so expensive. One option is to use reddis. Solution: I have created a distributed crawler using redis. So just porting my code to your crawler will be ok Use different database for storing crawler jobs. Ex. reddis Solution: just porting my redis part will be ok. I'll keep in touch with you if I am awarded, thank you:-)
$277 USD in 10 days
5.0 (2 reviews)
1.3
1.3
User Avatar
My expertise include Data Mining, Image Processing, Web and Desktop Application Development, Network tools and many others. Try me! You will know. :)
$333 USD in 15 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hey there! Looking forward to be hired! I have many years of experience in python. Let me know if you'r interested.
$744 USD in 8 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
Chicago, United States
5.0
108
Payment method verified
Member since Sep 7, 2010

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.