This project involves working in C/C++, MySql Database and Java.
We need an application in C/C++ through which we can store some URLs in database. It can than pick the URLs from the databse and crawl through the websites while picking some content up. Some servers does not allow this kind of activity on their sites, the system should be able to act as a browser rather then an application. The crawling process must be controlable through some variables that we set in the application. For example, we must be able to set how many pages does the application crawl on a certain URL, the depth of links, how many links to follow. A full list of these variables will be supplied later.
A front-end is required where I can set the variables. A java front-end would be prefered.
## Deliverables
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
4) All rights to the software. Once the work has been done I would want the coder to delete all source code from his PC. If further changes are required I will send the piece of code that requires changes
## Platform
The programe must be platform independant. It must run on linux as well as windows.