Find Jobs
Hire Freelancers

Rapiedminer help

$10-30 USD

Closed
Posted almost 4 years ago

$10-30 USD

Paid on delivery
You will be working on a real dataset about SPAM emails. This dataset consists of the frequency of different words used in the emails. Each row represents one email. The last attribute “is_spam” represents whether the email is a SPAM email or not. 1. Import the data from [login to view URL] file into RapidMiner. 2. Measure the accuracy of Decision Tree without any data pre-processing. Also try to change the parameters of the classifier, and see which parameters give best results. 3. Measure accuracy after normalizing the data (using range and z-score normalizations). 4. Measure accuracy after discretizing the numeric attributes in 2, 3, and 4 bins (each). 5. Measure accuracy after reducing dimensions (use “Weight by ...” and “Select by weights” operators). Try different weighting schemes. 6. Measure accuracy by using all of the pre-processing steps excluding dimensionality reduction (tasks 3 to 4) 7. Measure accuracy by using all of the pre-processing steps including dimensionality reduction (tasks 3 to 5) 8. Try to find a combination of pre-processing steps which gives the best results. 9. Measure accuracy using Neural Network classifier and suitable data pre-processing steps. 10. Measure accuracy using any 3 classifiers not used in previous tasks using suitable data pre-processing steps. 11. For measuring accuracy in all the steps listed above, use 10-fold cross validation (X-Validation). Use your student id as the seed in randomization (wherever possible) 1. Rapidminer process files (rmp file) for each task. Name of the file should be “[login to view URL]”. In case a task has multiple sub tasks (like tasks 3, 4, 5, 8, and 10) the names should be like “[login to view URL]”, “[login to view URL]” “[login to view URL]”, ... 2. Write a report about the dataset and its characteristics. Discuss in detail which pre-processing steps were useful and which classifier produced the best results. The report should also include results of all the tasks in form of a table. You can generate the plots for comparing results using Microsoft Excel. 3. PowerPoint presentation of your project.
Project ID: 26533495

About the project

1 proposal
Remote project
Active 4 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
1 freelancer is bidding on average $25 USD for this job
User Avatar
hii... this is Gaurav Kumar l am a computer engenier work part time job for some pocket money thanku for the opportunity
$25 USD in 5 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of SAUDI ARABIA
Medina, Saudi Arabia
5.0
1
Payment method verified
Member since Nov 25, 2019

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.