Top Banner
CS2220 Introduction to Computational Biology Weka Introduction Xiaoli Li
32

CS2220 Introduction to Computational Biology

Apr 09, 2022

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: CS2220 Introduction to Computational Biology

CS2220 Introduction to Computational Biology 

Weka Introduction

Xiaoli Li

Page 2: CS2220 Introduction to Computational Biology

http://www.kdnuggets.com/polls/2012/analytics-data-mining-big-data-software.html

Most Popular Tools for Data Mining

Page 3: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Page 4: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaSelect the Data for Exploring

Page 5: CS2220 Introduction to Computational Biology

We can see and edit our data

Page 6: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

Actual Weka data format

Example: Predict if we want to play

Page 7: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaClass label: To Play or Not to Play?

Page 8: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaOutlook: To Play or Not to Play?

Page 9: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Temperature: To Play or Not to Play?

Page 10: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Humidity: To Play or Not to Play?

Page 11: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaWindy: To Play or Not to Play?

Page 12: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaClassification: To Play or Not to Play?

Page 13: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaJ48 Decision Tree: To Play or Not to Play?

Actual tree structure

Page 14: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaDecision Tree: To Play or Not to Play?

Training set has been used as test set

Page 15: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaDecision Tree: To Play or Not to Play?

Leave one out CVTree structure

Page 16: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaDecision Tree: To Play or Not to Play?

Change the parameter of J48

Page 17: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

Page 18: CS2220 Introduction to Computational Biology

Pattern Discovery: ClusteringFind “natural” grouping of instances

given un-labeled data

Page 19: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaExample: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 20: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaExample: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 21: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaClustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 22: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Clustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 23: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Clustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 24: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Clustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 25: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Clustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}

Page 26: CS2220 Introduction to Computational Biology

Association Rules (Unsupervised Learning)Finding groups of items that tend to

occur together

Page 27: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

Page 28: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaExample: Supermarket Purchases

Page 29: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaExample: Supermarket Purchases

Page 30: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in Java

http://www.cs.waikato.ac.nz/~ml/weka/

Association Rules: Supermarket Purchases

Page 31: CS2220 Introduction to Computational Biology

WEKA – Data Mining Software in JavaAssociation Rules: Supermarket Purchases

Page 32: CS2220 Introduction to Computational Biology

Contact: [email protected] if you have questions