CS2220 Introduction to Computational Biology Weka Introduction Xiaoli Li
CS2220 Introduction to Computational Biology
Weka Introduction
Xiaoli Li
http://www.kdnuggets.com/polls/2012/analytics-data-mining-big-data-software.html
Most Popular Tools for Data Mining
WEKA – Data Mining Software in Java
http://www.cs.waikato.ac.nz/~ml/weka/
WEKA – Data Mining Software in JavaSelect the Data for Exploring
We can see and edit our data
WEKA – Data Mining Software in Java
Actual Weka data format
Example: Predict if we want to play
WEKA – Data Mining Software in JavaClass label: To Play or Not to Play?
WEKA – Data Mining Software in JavaOutlook: To Play or Not to Play?
WEKA – Data Mining Software in Java
http://www.cs.waikato.ac.nz/~ml/weka/
Temperature: To Play or Not to Play?
WEKA – Data Mining Software in Java
http://www.cs.waikato.ac.nz/~ml/weka/
Humidity: To Play or Not to Play?
WEKA – Data Mining Software in JavaWindy: To Play or Not to Play?
WEKA – Data Mining Software in JavaClassification: To Play or Not to Play?
WEKA – Data Mining Software in JavaJ48 Decision Tree: To Play or Not to Play?
Actual tree structure
WEKA – Data Mining Software in JavaDecision Tree: To Play or Not to Play?
Training set has been used as test set
WEKA – Data Mining Software in JavaDecision Tree: To Play or Not to Play?
Leave one out CVTree structure
WEKA – Data Mining Software in JavaDecision Tree: To Play or Not to Play?
Change the parameter of J48
WEKA – Data Mining Software in Java
Pattern Discovery: ClusteringFind “natural” grouping of instances
given un-labeled data
WEKA – Data Mining Software in JavaExample: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}
WEKA – Data Mining Software in JavaExample: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}
WEKA – Data Mining Software in JavaClustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}
WEKA – Data Mining Software in Java
http://www.cs.waikato.ac.nz/~ml/weka/
Clustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}
WEKA – Data Mining Software in Java
http://www.cs.waikato.ac.nz/~ml/weka/
Clustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}
WEKA – Data Mining Software in Java
http://www.cs.waikato.ac.nz/~ml/weka/
Clustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}
WEKA – Data Mining Software in Java
http://www.cs.waikato.ac.nz/~ml/weka/
Clustering: 3 Types of IRIS Plant{Setosa, Versicolor & Virginica}
Association Rules (Unsupervised Learning)Finding groups of items that tend to
occur together
WEKA – Data Mining Software in Java
WEKA – Data Mining Software in JavaExample: Supermarket Purchases
WEKA – Data Mining Software in JavaExample: Supermarket Purchases
WEKA – Data Mining Software in Java
http://www.cs.waikato.ac.nz/~ml/weka/
Association Rules: Supermarket Purchases
WEKA – Data Mining Software in JavaAssociation Rules: Supermarket Purchases
Contact: [email protected] if you have questions