I need to have a search engine + website interface built that will aggregate information from about 10 different e-commerce websites. Each website sells products, and I want my search engine to be able to search for products and provide the user with links to the products (on the other websites). This project will involve:
- developing a method for scraping product info from the relevant websites and then implementing the scraping
- analyzing the product info by a set of attributes (product title, price, URL, product size, product description, product seller, product location)
- placing this data in a database on which the search engine can perform searches
- creating an easy to use interface for the user to enter his search criteria
- enabling Google AdSense on the righthand column of the website so that I can generate click-through revenue
- connecting the website with a standard out-of-the-box webanalytics software package so that I can track what is going on, traffic, etc. Nothing fancy, though, but it needs to be integrated
- other basic webdesign work, pretty standard, nothing fancy
As part of this assignment, I need you to scrape the sites and place the data in a database. I would provide you with a list of sites.
No shopping cart or e-commerce functionality will be necessary. This is just an "aggregation" model. My search engine will provide the user with the available products, and the user will then click-through to the other websites.
## Deliverables
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
## Platform
I am flexible on this as long as you use a generally accepted platform that is well-known. I will look to you for advice on what would be quickest and most reliable.