Data Mining - Decision Tree Classifier and Pattern Mining.

Completed Posted 4 years ago Paid on delivery
Completed Paid on delivery

Develop a single [login to view URL] file to perform following 3 tasks. Each task is invoked using a different command line command.

Task 1: Build a decision tree classifier for an input dataset in the form of redwine.csv. Assume all attributes except the class are numerical, and the last attribute is the class. The output should be written to a file called [login to view URL] describing the accuracy of the classifier when tested on the input dataset and also the parameters used to build the classifier. [login to view URL] will perform this task when given these 2 command line arguments: 1, [login to view URL]; here [login to view URL] is the name of an input dataset.

Task 2: Find top 10 patterns from an input dataset D similar in form to the itemized data file produced in previous project (refer to [login to view URL]), except that the class values may be categorical. Assume the values of the last column are the classes. For convenience, here we refer to one class as P and the other as N. If the input file uses other class names, treat the lexically smaller as P and the other as N. (Assume there are just two classes.) The top 10 patterns should contain the top 5 among the single item patterns, plus the top 5 among all non-singleton patterns. The expectation is that you can find the correct top 5 single-item patterns, but you might not be able to find the true top 5 among all non-singleton patterns. The ranking is determined using the function GR(X) * sup(X,D), where X is a pattern and GR(X) is defined as (sup(X,P)+1)/(sup(X,N)+1). The top 10 patterns must be written to an output file called top10EPs.csv. For each of top 10 patterns X={x1,x2,…,xk}, there should be a line in this file containing the following:

GR(X) * sup(X,D), GR(X), sup(X,D), x1,x2,…,xk

[login to view URL] will perform this task when given these 2 command line arguments: 2, [login to view URL]; here [login to view URL] is the name of an input dataset.

Task 3: Given an input dataset D3 similar in form to [login to view URL] and a pattern X, build a decision tree classifier on mds(X,D3). Recall that mds(X,D) is the subset of instances in D satisfying all single-attribute conditions in X. The output is written to a file called [login to view URL] describing the accuracy of the classifier when tested on mds([login to view URL]) as a test dataset, the number of instances in mds([login to view URL]), and also the parameters used to build the classifier. [login to view URL] should perform this task when given these 3 commandline arguments: 3, [login to view URL], X

Here [login to view URL] is the name of an input dataset and X is a pattern. When specifying patterns, we will refer to the attributes as A1,…,An, in the left to right order as they appear in the input data. Any pattern will be described in the following form:

Ai1 CMP Val1, Ai2 CMP Val2, …, Ai3 CMP Val3

Here CMP is either < or >, Ai1, Ai2, …, Ai3 are attributes, and Val1, …, and Val3 are numerical values. A pattern can contain one or more attributes. An example pattern is A3<5, A6>4.

All outputs are written to the folder where the jar file is located.

Correctness, efficiency, and quality of the findings (for Task 2) will be important factors for marking and evaluation.

You should use Weka for classification and you can use any frequent pattern mining programs written in Java in your program. You cannot use emerging/contrast pattern mining programs written by others; of course you can implement your own.

Include a [login to view URL] file indicate which frequent pattern mining program you are using (indicate developed by whom and the source URL).

All the above information is included in [login to view URL]

Data Mining Data Processing Data Science Data Analytics Java

Project ID: #24443923

About the project

11 proposals Remote project Active 4 years ago

Awarded to:

apr159

Hello My name is Alejandro Pasos. I have a MS in computer science at University of California. Right now I am working with java weka so I am clear about that is required on your task. I am ready to start, I read your More

$250 USD in 14 days
(46 Reviews)
5.6

11 freelancers are bidding on average $195 for this job

dvcontact

Hi, Would you like to see a quick demo of your Decision tree classifier created for pattern mining and analysis before you award the project? You can consider it professionally done in a few hours. I can continue tell More

$150 USD in 5 days
(34 Reviews)
6.1
anchalsingh0005

Hello, ‌Hope you doing well.I have checked all your requirements and we able to do this and deliver in time.I have 5 years of experience in these types of work. So, I believe we can do that work with your support. ‌Reg More

$100 USD in 7 days
(30 Reviews)
4.9
MohammedSulhi

I am an experienced Data Scientist and Machine Learning Engineer. Deep learning, Artificial intelligence, machine learning, Data structures, and algorithms are my major fields. I finished specializations on Data Scienc More

$100 USD in 2 days
(6 Reviews)
3.5
hayat38402

I have extensive experience in the domains of:  Neural Network/ Natural Language Processing  Machine learning/Data Mining  Deep Learning and Computer Vision  Image Recognition & Artificial Intelligence  AI text a More

$280 USD in 7 days
(2 Reviews)
2.8
merinsinha

Hi i am senior Devoloper. For Professional works -Hire me. As 10+ years experiences in these field. I can give good quality work. I have read the guidelines of your work.I believe that i can provide you the best qualit More

$250 USD in 5 days
(2 Reviews)
0.4
ajcsgi82893181

plz give me a chance. plzzzzz

$155 USD in 3 days
(0 Reviews)
0.0
sednalondon

My proposal is founded in my long experience in the C++ code and the collection of Machine Learning tools that I have to apply to this kind of problem. Classification task using Supervised/Unsupervised Machine Learning More

$290 USD in 3 days
(0 Reviews)
0.0
DjSalman

I am a java Expert & have got plenty of experience in java, data processing, data mining, data science, data analytics kinds of work I assure you that i can do this work as I have recently done this sort of work Looki More

$218 USD in 7 days
(0 Reviews)
0.0
ranjithkumargkp

I have good experience on machine learning field and believe me I'm going to deliver you on time , my skills are SQL, python , r , SAS Tableau for visualization , I have good command over machine learning algorithms o More

$105 USD in 15 days
(0 Reviews)
0.0