1 UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY Tutorial 1: Introduction to WEKA and YALE TIES443: Introduction to DM 1 Tutorial 1 Tutorial 1 Prototyping DM Techniques Prototyping DM Techniques with WEKA and YALE with WEKA and YALE Open Open- Source Software Source Software Department of Mathematical Information Technology University of Jyväskylä Mykola Pechenizkiy Course webpage: http://www.cs.jyu.fi/~mpechen/TIES443 TIES443 November 7, 2006 UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY Tutorial 1: Introduction to WEKA and YALE TIES443: Introduction to DM 2 Contents Contents • Brief Review of DM Software – Commercial – Open-source • WEKA http://www.cs.waikato.ac.nz/~ml/weka/index.html • YALE http://rapid-i.com / • The R Project for Statistical Computing http://www.r-project.org / • Pentaho – whole BI solutions. http://www.pentaho.com/ – Matlab – Sami will tell you more during the 2nd Tutorial • WEKA vs. YALE Comparison – Exploration – Experimentation – Visualization • 1 st Assignment http://www.cs.jyu.fi/~mpechen/TIES443/tutorials/assignment1.pdf UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY Tutorial 1: Introduction to WEKA and YALE TIES443: Introduction to DM 3 Data Mining Software Data Mining Software • Many providers of commercial DM software – SAS Enterprise Miner, SPSS Clementine, Statistica Data Miner, MS SQL Server, Polyanalyst, KnowledgeSTUDIO, … – IBM Intelligent Miner. • Universities can now receive free copies of DB2 and Intelligent Miner for educational or research purposes. – See http://www.kdnuggets.com/software/suites.html for a list • Open Source: – WEKA (Waikato Environment for Knowledge Analysis) – YALE (Yet Another Learning Environment) – Many others • MLC++, Minitab, AlphaMiner, Rattle, KNIME – The Pentaho BI project – • “a pioneering initiative by the Open Source development community to provide organizations with a comprehensive set of BI capabilities that enable them to radically improve business performance, efficiency, and effectiveness.” UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY Tutorial 1: Introduction to WEKA and YALE TIES443: Introduction to DM 4 Data Mining with WEKA Data Mining with WEKA Copyright: Martin Kramer ([email protected]) The following slides are from http://prdownloads.sourceforge.net/weka/weka.ppt by Eibe Frank UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY Tutorial 1: Introduction to WEKA and YALE TIES443: Introduction to DM 5 WEKA: the software WEKA: the software • Machine learning/data mining software written in Java (distributed under the GNU Public License) • Used for research, education, and applications • Complements “Data Mining” book by Witten & Frank – http://www.cs.waikato.ac.nz/~ml/weka/book.html • Main features: – Comprehensive set of data pre-processing tools, learning algorithms and evaluation methods – Graphical user interfaces (incl. data visualization) – Environment for comparing learning algorithms UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY Tutorial 1: Introduction to WEKA and YALE TIES443: Introduction to DM 6 @relation heart-disease-simplified @attribute age numeric @attribute sex { female, male} @attribute chest_pain_type { typ_angina, asympt, non_anginal, atyp_angina} @attribute cholesterol numeric @attribute exercise_induced_angina { no, yes} @attribute class { present, not_present} @data 63,male,typ_angina,233,no,not_present 67,male,asympt,286,yes,present 67,male,asympt,229,yes,present 38,female,non_anginal,?,no,not_present ... WEKA only deals with WEKA only deals with “flat flat” files files
33
Embed
Prototyping DM Techniques with WEKA and YALE …mpechen/courses/TIES443/tutorials/tutorial1.pdf · Prototyping DM Techniques with WEKA and YALE ... – SAS Enterprise Miner, SPSS
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
1
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 1
Tutorial 1Tutorial 1
Prototyping DM Techniques Prototyping DM Techniques with WEKA and YALE with WEKA and YALE OpenOpen--Source SoftwareSource Software
Department of Mathematical Information TechnologyUniversity of Jyväskylä
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 3
Data Mining SoftwareData Mining Software
• Many providers of commercial DM software– SAS Enterprise Miner, SPSS Clementine, Statistica Data Miner, MS SQL Server, Polyanalyst, KnowledgeSTUDIO, …
– IBM Intelligent Miner. • Universities can now receive free copies of DB2 and Intelligent Miner for educational or research purposes.
– See http://www.kdnuggets.com/software/suites.html for a list
• Open Source:– WEKA (Waikato Environment for Knowledge Analysis)– YALE (Yet Another Learning Environment)– Many others
• MLC++, Minitab, AlphaMiner, Rattle, KNIME
– The Pentaho BI project –• “a pioneering initiative by the Open Source development communityto provide organizations with a comprehensive set of BI capabilities that enable them to radically improve business performance, efficiency, and effectiveness.”
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 4
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 142
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 143
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 144
25
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 145
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 146
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 147
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 148
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 149
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 150
26
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 151
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 152
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 153
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 154
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 155
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 156
27
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 157
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 158
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 159
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 160
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 161
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 162
Conclusion: try it yourself!Conclusion: try it yourself!
• WEKA is available at
http://www.cs.waikato.ac.nz/ml/weka
� Also has a list of projects based on WEKA
� YALE has different interfaces and ideas behind but it also integrates all available DM techniques from WEKA
28
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 163
YALE YALE –– Yet Another Learning Yet Another Learning
EnvironmentEnvironment
Artificial Intelligence Unit of the University of Dortmund.
The following slides are compiled from screenshots and related descriptions available from YALE pages http://rapid-i.com/
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 164
Features of YALEFeatures of YALE
• freely available open-source knowledge discovery environment
• 100% pure Java (runs on every major platform and operating system)
• KD processes are modeled as simple operator trees which is both intuitive and powerful
• operator trees or subtrees can be saved as building blocksfor later re-use
• internal XML representation ensures standardized interchange format of data mining experiments
• simple scripting language allowing for automatic large-scale experiments
• multi-layered data view concept ensures efficient and transparent data handling
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 165
Features of YALEFeatures of YALE
• Flexibility in using YALE: – graphical user interface (GUI) for interactive prototyping
– command line mode (batch mode) for automated large-scale applications
– Java API to ease usage of YALE from your own programs
• simple plugin and extension mechanisms, some plugins already exists and you can easily add your own
• powerful plotting facility offering a large set of sophisticated high-dimensional visualization techniques for data and models
• more than 350 machine learning, evaluation, in- and output, pre- and post-processing, and visualization operators plus numerous meta optimization schemes
• machine learning library WEKA fully integrated
• YALE’s potential application include text mining, multimedia mining, feature engineering, data stream mining and tracking drifting concepts, development of ensemble methods, and distributed data mining.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 166
Experiment Setup Experiment Setup the initial operator tree which only consist of a
root node.
The lower part of the YALE main frame serves for displaying and viewing log and error messages.
The "Tree View" tab is the most often used editor for YALE experiments. Left: the current operator tree. Right: a table with the parameters of the currently selected operator.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 167
After the learning operator "J48", a breakpoint indicates that the intermediate results can be inspected. Due to the modular concept of YALE, it is always possible to inspect and save intermediate results, e.g. the results for each individual run in a cross validation
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 168
add new operators to the experiment:• directly from the context menu of its parent. •the new operator dialogshown in this screenshot. Several search constrains exist and a short description for each operator is shown
29
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 169
The operator trees are coded and represented by a simple XML format. The XML editor tab allows for fast and direct manipulations of the current experiment.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 170
The "Box View" - is another viewer for YALE experiments. • the box format is an intuitive way of representing the nesting of the operators.• but editing is not possible
All views can also be printed and exported to a wide range of graphic formats including jpg, png, ps and pdf.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 171
"Monitor" tab provides an overview of the currently used memory and is an important tool for large-scale data mining tasks on huge data sets. The amount of used memory during an experiment run can even be logged in the same way like all other provided logging values.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 172
Data can be imported from several file formats with the attribute editor. Other file formats like Arff, C45, csv, and dBase can be loaded with specialized operators. Attribute Editor can be used to create meta data descriptions from almost arbitrary file formats. These meta data descriptions can then be used for an input operator which actually loads the data.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 173
Additional attributes (features) can easily be constructed from your data. YALE provides several approaches to construct the best feature space automatically. These approaches range from feature space transformations like PCA, GHA, ICA or the kernel versions to standard feature selection techniques to several evolutionary approaches for feature construction and extraction.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 174
Help features to ease the learning phase for new users:• An online tutorial, • tool tip texts, • a beginner and expert mode, operator info screens, • a GUI manual, • and the YALE tutorial.
30
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 175
Data Visualization Data Visualization
Each time a data set is presented in the results tab (e.g. after loading it), several views appear: a meta data view describing all attributes, a data view showing the actual data and a plot view providing a large set of (high-dimensional) plotters for the data set at hand.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 176
The basic scatter plotter: Two of the attribute are used as axes, the class label attribute is used for colorization. The legend at the top maps the colors used to the classes or, in case of a real-valued color plot column, to the corresponding real values.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 177
The standard scatter plotter even allows jittering, zooming, and displaying example ids. Double-clicking a data point opens a visualizer. The standard example visualizer is presented here.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 178
2D scatter plots can be put together to a scatter plot matrixwhere for all pairs of dimensions a usual scatter plot is drawn. This plotter is only available for less then 10 dimensions. For higher number of dimensions one of the other high-dimensional data plotter presented below should be used.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 179
A 3D scatter plot exists similar to the colorized 2D scatter plot discussed above. The viewport can be rotated and zoomed to fit your needs. The built-in 2D and 3D plotters are a quick and easy way to view your numerical and nominal results, even as online plot at experiment runtime!
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 180
SOM (Self-Organizing Map) plotter which uses a Kohonen net for dimensionality reduction. Plotting of the U-, the P-, and the U*-Matrix are supported with different color schemes. The data points can be colorized by one of the data columns, e.g. with the prediction label.
31
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 181
SOM (Self-Organizing Map) plotter which uses a Kohonennet for dimensionality reduction. a gray scale color scheme was used to plot the U-Matrix.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 182
The parallel plotter prints the axes of all dimensions parallel to each other. This is the natural visualization technique for series data but can also be useful for other types of data. The main advantage of parallel plots is that a very high number of dimensions can be visualized with this technique. The dimensions are colorized with the feature weights. The more yellow a dimension is marked, the more important this column is.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 183
quartile plots (also known as box plots) are often used for experiment results like performance values but it is possible to summarize the statistical properties of data columns in general with this type of plot.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 184
Histogram plots(also known as distribution plots)
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 185
RadViz is another high-dimensional data plotter where the data columns are placed as radial dimension anchors. Each data point is connected to each anchor with a spring corresponding to the feature values. This will lead to a fixed position in the two-dimensional plane. Again, weights are used to mark the more important columns.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 186
A survey plot is a sort of vertical histogram matrix also suitable for a large number of dimensions. Each line corresponds to one data point and can be colorized by one of the columns. The length of each section corresponds to the value of the data point for that dimension. For up to three dimensions the order of the histograms can be selected.
32
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 187
Andrews curves are another way of visualizing high-dimensional data. Each data point is projected onto a set of orthogonal trigonometric functions and displayed as a curve. It is known that Andrews curves preserve distances, so they have many uses for data analysis and exploration. Often outliers and hidden patterns can be well detected in these plots.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 188
Visualization of Models and other Results Visualization of Models and other Results
The result of a learning step is called model. Some models provide a graphical representation of the learned hypothesis. This screenshot presents a learned decision tree for the widely known "labor negotiations" data set from the UCI repository. Results like learned models, performance values, data sets or selected attributes are displayed when the experiment is completed or a breakpoint is reached
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 189
In cases where no graphical representation of a learned model is available, at least a textual description of the learned model is presented. In this screenshot you see a Stacking model consisting of a rule model (the upper half) and a neural network (starts at the lower half). Both base models are described by simple and understandable texts.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 190
This is a density plot (similar to a contour plot) of the decision function of a Support Vector Machine (SVM). Almost all SVM implementations in YALE provide a table and a plot view of the learned model. In this screenshot, red points refer to support vectors, blue points to normal training examples. Bluish regions will be predicted negative, reddish regions will be predicted positive.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 191
only the support vectors are shown colorized by the preditcted function value for the corresponding data point. Examples on the red side will be predicted positive; examples on the blue side will be predicted negative. There is a perfectly linear separation in two of the dimensions and it seems to be that the parameters were not chosen optimal since the number of support vectors is rather high.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 192
alpha values (Lagrange multipliers) of the SVM are plotted against the function values and colorized with the true label. We applied a slight jittering to make more points visible. This model seems to be "well-learned", since only few points have a alpha value not equal to zero and these are the points with function values approximately 0.
33
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 193
This surface plot presents the result of a meta optimization experiment: the parameters of one of the operators are optimized. the plot can be rotated and zoomed.
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 194
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 195
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 196
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 197
UNIVERSITY OF JYVÄSKYLÄ DEPARTMENT OF MATHEMATICAL INFORMATION TECHNOLOGY
Tutorial 1: Introduction to WEKA and YALETIES443: Introduction to DM 198
• My advise for you is to come back to this assignment and WEKA and YALE tools after each forthcoming lecture to see how the things are implemented and can be used in practice.