Top Banner
Students Academic Performance Knowledge Discovery from Data
31

Students academic performance using clustering technique

Apr 12, 2017

Download

Education

saniacorreya
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Students academic performance using clustering technique

Students Academic Performance

Knowledge Discovery from Data

Page 2: Students academic performance using clustering technique

Introduction..

Our project aim is to find students academic performance

and find out whether there is any general pattern in their

marks and performance.

So here ,We are analyzing both internal and external

marks of a student.

We did the following KDD preprocessing steps to mine

our data.

Page 3: Students academic performance using clustering technique

Learning the application domain

Learning the application domain is the first step in KDD process .

Need to have a clear understanding about the application

domain and our objectives.

The institution considered for mining is MCA batch of Rajagiri

College of Social Sciences.

We collected all previous year academic record from the

department of computer science

Page 4: Students academic performance using clustering technique

Create a target data set: data selection

 We selected 2007-2010 batch marks for analysing the

pattern.

There were around 45 records(45 students).

Both the internal and external marks of each student were

selected, in order to find out the performance pattern.

Page 5: Students academic performance using clustering technique

Internal & External Dataset

Page 6: Students academic performance using clustering technique

Data cleaning & preprocessing

Data cleaning is the step where noise and irrelevant data are

removed from the large data set.

This is a very important pre-processing step because our

outcome would be dependent on the quality of selected data.

Remove duplicate records, enter logically correct values for

missing records(absent students), remove unnecessary data

fields and standardize data format.

Page 7: Students academic performance using clustering technique

There was no much duplicate data or unnecessary data in the

collected record . The dataset was partially cleaned.

Student internal mark and external mark were stored in

different records.

By applying data integration these records were integrated

into one record.

The new dataset consist of internal mark details and external

mark details of each student in one record.

Page 8: Students academic performance using clustering technique
Page 9: Students academic performance using clustering technique

Data reduction & transformation

Data is transformed into appropriate form for making it ready for

data mining step.

The dataset contains marks of 5 theory paper and 2 lab paper of

all 5 semesters.

These marks are transformed into sum of internal marks and sum

of external marks of each student for the easiness of analysing

the pattern.

Page 10: Students academic performance using clustering technique
Page 11: Students academic performance using clustering technique

Cluster Analysis The data mining technique we used here is clustering. A cluster is a collection of data objects that are similar to

one another within same cluster and are dissimilar to objects in other cluster.

We first partitioned the set of data into groups based on data similarity and then assign labels

Choosing functions of data mining

Page 12: Students academic performance using clustering technique

K-MEANS Partitioning The K-means algorithm takes input parameter k and

partitions the set of n objects into k clusters. Here we selected no: of cluster as 4 Objects are distributed to a cluster based on cluster

center to which it is nearest. For each semester we found out the clusters separately

and labeled them as students Excellent, Good, Fair and Poor

Choosing mining algorithms

Page 13: Students academic performance using clustering technique

The Tool used for pattern evaluation is ORANGE

Page 14: Students academic performance using clustering technique

Orange Cluster Analysis

Page 15: Students academic performance using clustering technique

No of cluster selected is 4

Page 16: Students academic performance using clustering technique

Semester 1

poor

Fair

Good

Excellent

Page 17: Students academic performance using clustering technique

Semester 2

Page 18: Students academic performance using clustering technique

Semester 3

Page 19: Students academic performance using clustering technique

Semester 4

Page 20: Students academic performance using clustering technique

Semester 5

Page 21: Students academic performance using clustering technique

Centroid Analysis

Page 22: Students academic performance using clustering technique

Semester 1

Page 23: Students academic performance using clustering technique

Semester 2

Page 24: Students academic performance using clustering technique

Semester 3

Page 25: Students academic performance using clustering technique

Semester 4

Page 26: Students academic performance using clustering technique

Semester 5

Page 27: Students academic performance using clustering technique

Combined Centroid Analysis

Page 28: Students academic performance using clustering technique

Data mining search for patterns of interest

From the mining process we found that “All the 5 semester clusters followed the same pattern of performance”.

A student with high internal mark has higher external marks and a student with less internal marks has less external marks.

There is a direct relation between the internal and the external marks.

At some case this evaluation is not valid, cases like Being absent for internal exam and scoring high marks for

the externals (vice versa)

Page 29: Students academic performance using clustering technique

CONCLUSION

A students performance in his university exam can be predicted with the help of his internal marks. There is a direct relation between the internal and the external marks.

A student with low internals will get low marks for externals too

Page 30: Students academic performance using clustering technique

Use of discovered knowledgerepresentation

Page 31: Students academic performance using clustering technique

Thank You