icon

Usetutoringspotscode to get 8% OFF on your first order!

data mining

data miningPaper details:
Experimentation with Classification: Choose a dataset that is well suited for classification. You can use any dataset that you would like to classify. A good number of datasets can be found in the UCI machine learning data repository but feel free to use any dataset that you want. Make sure that you select a dataset that has a class variable. Then use a tool such as R to classify the dataset. The specific requirements for the assignment are as follows:

• Choose a dataset that is of interest to you and is well suited for classification
• Give a brief description of the dataset
• Test at least 3 classification algorithms. There are many algorithms available for R.
o A good resource for R can be found at the Data Mining Algorithms in R Wikibook
? http://en.wikibooks.org/wiki/Data_Mining_Algorithms_In_R/Classification
o Also the caret package in R would is a great place to start experimenting with classification methods.
• Design an experiment using training and testing (holdout method), cross-validation, or the bootstrap method.
• Compare the results of three or more classification methods using the same experimental setup using one or more classification evaluation methods discussed in class. The metrics that you choose are up to you and can include accuracy, error rate, sensitivity, specificity, precision, recall, and F measure.
• Write a report that describes your experiment and results. The report should be in IEEE conference paper format and should include an introductory section that details the dataset and the objectives of the analysis, a methodology section that explains the approach that you are using to mine the dataset including the algorithms and parameters (e.g. confidence and support) as well as any steps that you had to take to preprocess the data, a results section that shows the results of your analysis and any interesting patterns that you found, and a conclusion section that summarizes your results and discusses the limitations of your approach and any difficulties that you had with your experiment.
o Links to format templates:
http://www.ieee.org/conferences_events/conferences/publishing/templates.html
http://www.acm.org/sigs/publications/proceedings-templates

You can leave a response, or trackback from your own site.

Leave a Reply

Powered by WordPress | Designed by: Premium WordPress Themes | Thanks to Themes Gallery, Bromoney and Wordpress Themes