Blog aggregator similar to techmeme

Closed Posted Dec 2, 2012 Paid on delivery
Closed Paid on delivery

I am looking to develop a platform similar to [url removed, login to view] but for a different industry. I have a collection of approximately 1200 blogs that can be used to seed.

The site would behave much like techmeme in that it would

1) scrape websites/blogs for data

2) collect and index the data

3) Algorithmically or by way of machine learning cluster articles/posts that relate to each other

4) present the data in real time in a structured and easy to navigate way

5) provide a backend that would allow an administrator/user some "editorializing" such as tagging one article/post in a cluster as the top story. Backend also needs to be able to manage all aspects of website - sponsorship, users, database updates, add new urls to the scraper, etc.

6) provide a means to organize and promote sponsorships throughout the site.

Based on my research, this project could be accomplished using a combination of Apache Nutch, solr, hadoop, and mahout.

This will likely be deployed on a platform like Amazon AWS.

Type of Website: News Media / Informational Content

Other Skills: hadoop, mahout, nutch, solr, lucene, java

Apache Solr Hadoop Java Website Design

Project ID: #4003882

About the project

11 proposals Remote project Active Jan 8, 2013

11 freelancers are bidding on average $4436 for this job

onelinewebdeUK

Hi, My names Mark from The iDevelopment Team based in Jersey. We specialise in web design & development & have proven reviews in this field - we would be pleased to help. Hope to hear from you soon and thank y More

$4950 USD in 45 days
(23 Reviews)
7.1
sritechnocrat

Sri Technocrat is marvelous in its quality. We have been maintaining the quality in every field whether it is services or training. We have proved our stability. We have been working with the same grace & quality. Our More

$3000 USD in 40 days
(30 Reviews)
6.5
buzzcoder

I'm interested in this project, please check your pm.

$5000 USD in 45 days
(40 Reviews)
5.4
ngcomp

Certified from Cloudera for Hadoop/NoSQL.

$5000 USD in 30 days
(1 Review)
3.6
pragyaatech

Sir we have some suggestion to make.

$5000 USD in 90 days
(2 Reviews)
3.0
QehZ081TY

We are freelance software developers. If you contact me I can give a quote for your project and we can discuss the details. <b><i>Removed by Admin</i></b>

$4000 USD in 1 day
(0 Reviews)
0.0
srin123

I have very good knowledge on hadoop , amazon ec2 setup with optimization,and also I have very good experience on community website building from database design to front end design,

$3800 USD in 45 days
(0 Reviews)
0.0
softomaniac2011

Dear Client, we have provided a brief SOW in PMB, please check and let me know your feedback........ cheers

$5000 USD in 45 days
(0 Reviews)
0.0
orda

Hello, We created big projects like that, we worked with solr, lucene, java. In PM message my propose and portfolio. Thanks, Andrew

$5000 USD in 90 days
(0 Reviews)
0.8
patodirahul

I am much interested in this, Please check your personal message

$4000 USD in 50 days
(0 Reviews)
0.0