Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013)

Students’ comprehensive evaluation system based on data mining method
He yongrong

Bian xiangjuan

School of Computer Science
Zhejiang International Studies University
Hangzhou, China
hangdianproe@163.com

School of Computer Science
Zhejiang International Studies University
Hangzhou, China
bianxiangjuan@163.com

Abstract—the paper researched comprehensive quality
evaluation system of college student, and introduces data
mining method to solve mess students data. Firstly, the paper
construct students’ data warehouse, then construct
comprehensive multi-dimensions OLAP (Online Analysis
Process) model, and through classification deterministic
method, we can get useful message from the system to guide
college manage students, and some Employers can find their
satisfied employees.

graduation information and the information of the students
adapt the society need. So, we use recent 5 years different
department’s students graduation situation, course
arrangement, employment tendency as basic data, the data
warehouse structure is like figure 2.

Keywords-Data mining, comprehensive evaluation, OLAP

I. INTRODUCTION
With the higher education change from elite education to
popular education, there are more and more student’s
management problems. The students’ data is huge and
complicated, and students’ state and development is also
can’t forecast. So the college managers hope to get a
students’ management system which has auxiliary
deterministic ability and data mining technology just provide
the effective method to solve the problem. Data mining
method try to find useful information from huge data, this is
the processes which find relationship between model and
data from huge data[1], this model and relationship can use to
forecast. Under this background, utilize data mining
technology in students’ management system, construct
perfect students’ evaluation system, which can improve the
students management level and speed up students’
management work specialized[2]. The paper firstly design
students data warehouse, then construct students’ data
mining model which include: students information data
mining, students’ course selection data mining, students’
obtain employee chance data mining, at last, we can obtain
data mining result to get some reason about: the factors
influencing student achievement, the factors influencing
students course selection and factors influencing students
obtain work chance, The whole process is just like figure 1.
II. STUDENTS COMPREHENSIVE EVALUATION DATA
WAREHOUSE CONSTRUCTION

To finish data mining work, the first thing is prepare data,
according to different department and spatiality’s students’
information data, so all collective database should be
reorganized and classified, systematized, to finish these work,
the only thing to do is transfer these data to data warehouse.
As teaching manager, their work often face deterministic
analysis, their most attractive information is student’s

Figure 1: The data mining process in the system

Figure 2:Students comprehensive evaluation data warehouse structure table

The student’s comprehensive data warehouse was
divided as computer department database, english
department etc, then divided as thirteen data mart, the whole
process:
Step 1 : construct data warehouse model function:
determine system main body, and relationship with these
bodies, refine the body’s every lever, such as , educational
administration management system mainly divided as

Published by Atlantis Press, Paris, France.
© the authors
0723
Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013)

students’ achievement analysis theme, course arrangement
theme, students’ arranged employment situation;
Step 2: data warehouse physical database’s construction:
define the system physical database model, not only record’s
type, default value, and constraints relationship, but also
some indexes and physical view;
Step 3: data extraction, transformation, and integration:
adopt Microsoft SQLSever tools extracting appoint record,
delete unqualified data, and processing preliminary data
integration.
Step 4: data import: when creating data warehouse, data
transforming function of Microsoft SQLSever is necessary,
because data from other database should be selected,
processing, and loading to data warehouse.
III. CONSTRUCTION

THE DETERMINISTIC TREE OF

STUDENT’S COMPREHENSIVE EVALUATION SYSTEM

The paper utilize deterministic classified theory to
construct deterministic tree of student’s comprehensive
evaluation system, to realize qualitative analysis, the whole
data classified process is like figure 3:

label N,”X.atrributes=V with probability 1” else For each
attribute
A
in
table
compute
AVG
ENTROPY(A,attributes,table);
AS=the
attributes
for
which
AVG
ENTROPY(A,attributes,table)is minimal;
If (AVG ENTROPY (A, attributes, table) is not substantially
smaller than ENTROPY (attributes, table)) then Label N
with most common value of attributes in table (deterministic
tree) r with frequencies of attributes in table (probabilistic
tree) ;
Else label N with AS;
FOR EACH VALUE V of AS DO
N1=ID3(SUBTABLE(table,A,V),attributes));
IF(N1!=null)then make an arc from N to N1 labeled
V;
End
End
End
Return N;
End

Figure 3: The whole process of data mining

Data mining classification is the key step of data mining
application, to realize it, a suitable algorithm should firstly be
selected, and also a proper program must be found to realize
this algorithm. The paper adopt the famous deterministic
classified method: ID3 algorithm [3], the algorithm trains all
samples from root node, select an attribute to divide these
samples, every value of attribute generates a branch, and
then transforms branch attribute value of samples subset to a
new node. This is a recursive process which has been used at
every node, until every sample at node has been divided to a
type. The figure 4 is showing the generating process of
deterministic
tree. ID3 algorithm is most typical
deterministic tree algorithm; it was first put forward by
J.R.Quinlan. The main core ideology of the algorithm is the
greedy search algorithm which selected maximum
information gain as current deterministic attribute. ID3
algorithm can keep depth of every branch of the
deterministic tree is least, here give the ID3 algorithm
detailed description:
ID3(table,attributes)
//input:training set table,attributes;
//output:Deterministic tree;
{if(table is empty)then Return(null);
N=a new node;//create node;
If(there are no predictive attributes in table)//the first
situation
then label N with most common value of attributes in
table(deterministic tree)or with frequencies of attributes in
table(probabilistic tree);
else If(all instances in table have the same value V of
attributes)then/the second situation

Figure 4 the generating process of deterministic tree.

The deterministic tree’s generation passed two stages:
studying and testing. At studying stage, deterministic tree
adopt top-down way’s to finish its recursive process, when
the tree began to generate, all data is in root node, then
divide by recursive way, until branch node generated. The
second stage is testing stage; this stage’s main aim is to
delete some noise and unmoral data. To stop the recursive
process, there are must be satisfied a condition: the data at
same node should must be same type; no other attribute can
be divided by data.
Students comprehensive evolution’s class was divided by
three areas: moral level, academic level, extracurricular
practice level [4], the order is: moral level, academic level,
extracurricular practice level; all the rules are constructed by
if/then language:
If (moral level1) then

Published by Atlantis Press, Paris, France.
© the authors
0724
Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013)

(if academic level1 or 2) and (extracurricular
practice level1) then (comprehensive quality1 )
(if academic level1 or 2) and (extracurricular practice
level2) then (comprehensive quality2 )
(if academic level1 or 3) and (extracurricular
practice level1) then (comprehensive quality2 )
else(comprehensive quality3)
If (moral level2) then
(if academic level1) and (extracurricular practice
level1)
then
(comprehensive
quality1
)
(if academic level1) and (extracurricular practice level2)
then
(comprehensive
quality2
)
(if academic level2) and (extracurricular practice level1)
then (comprehensive quality2 )
(if academic level2) and (extracurricular practice
level2) then (comprehensive quality3 )
(if academic level3 or 4) and (extracurricular
practice levela) then (comprehensive quality3 )
else (comprehensive quality4)
If (moral level3) then
(if academic level1) and (extracurricular practice
level1) then (comprehensive quality3 )
Else
else (comprehensive quality4)
According to classified rules, we can get deterministic
tree like figure 5:

Figure 6 the students comprehensive system functions

Fig 7 Integral rendering of The System

Figure 5 Students comprehensive evolution’s deterministic tree

IV. STUDENTS COMPREHENSIVE EVOLUTION PROTOTYPE
SYSTEM DEVELOPMENT

The system adopt multi-dimension model , the model can
be seen as three-dimensional model, database model was
adopted, the processed data can make following online
analyzing easily, the system can be searched by student
number, scheme, school year. The system was divided as
three isolated aero: moral, academic, extracurricular practice.
The moral data often come from themselves’ evaluation and
peer assessment and evaluated by instructor; The academic
data includes core courses, required courses, and elective
courses, when computing scores, can give different weight to
Corresponding course, and standardized all the score, at last
give the result by orders; extracurricular practice adopt bonus
system, divide some little project, every project has its upper
limit, then standardized the scores. The whole system
functional model is like figure 6:The Fig 7 and Fig 8 are the
system working interface.

Fig 8 Rendering of Evaluation Result

V. CONCLUSION
Blend the existing management system in a suit perfect
students management information system and adopt data
mining technology to get some useful information, which
can widely used in students comprehensive evaluation,
graduate interview recommendation and enrollment analysis
etc, it can improve students management level and speed the
students specialization .

Published by Atlantis Press, Paris, France.
© the authors
0725
Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013)

ACKNOWLEDGMENT
The paper was support by Zhejiang new talent program
project: “Student integration capability evaluating system
based on data mining technology”, The National Natural
Science Foundation of China “MEMS Multi-fields Uniform
Simulation Model Construction and Optimism Research”
(61100101), the paper was also supported by Key Discipline
of The Ocean Mechatronic Equipments Technology.
REFERENCES
[1] Bodea, Constanta-Nicoleta, Bodea etc. Student performance in online
project management courses: A data mining approach:3rd World Summit
on the Knowledge Society, WSKS 2010,2010, 470-479
[2] Ogor, Emmanuel N. Student academic performance monitoring and
evaluation using data mining techniques . Electronics, Robotics and
Automotive Mechanics Conference, CERMA 2007,2007,354-359
[3] Vialardi, César ; Chue, Jorge; Peche, Juan Pablo etc. A data mining
approach to guide students through the enrollment process based on
academic performance, User Modeling and User-Adapted Interaction, v 21,
n 1-2, p 217-248, April 2011
[4] Zhang, Zhiyu1 . Study and analysis of data mining technology in college
courses students failed,2010 IEEE International Conference on Intelligent
Computing and Integrated Systems, ICISS2010,2010. 800-802
[5] YANG Hong-ying, LUO Huan, HE Qiang. Data Mining in Network
Education Based on Decision Tree. Computer Knowledge and Technology,
Vol.6,No.10,April 2010, pp.2313-2314

Published by Atlantis Press, Paris, France.
© the authors
0726

More Related Content

PDF
J48 and JRIP Rules for E-Governance Data
PDF
IJCSI-10-6-1-288-292
PDF
EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT’S ACADEMIC PERFORMANCE
PDF
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
PDF
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
PDF
A Comparative Study of Educational Data Mining Techniques for Skill-based Pre...
PDF
Paper-Allstate-Claim-Severity
PDF
Performance Evaluation of Different Data Mining Classification Algorithm and ...
J48 and JRIP Rules for E-Governance Data
IJCSI-10-6-1-288-292
EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT’S ACADEMIC PERFORMANCE
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
EXTRACTING USEFUL RULES THROUGH IMPROVED DECISION TREE INDUCTION USING INFORM...
A Comparative Study of Educational Data Mining Techniques for Skill-based Pre...
Paper-Allstate-Claim-Severity
Performance Evaluation of Different Data Mining Classification Algorithm and ...

What's hot (20)

PDF
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
PDF
IRJET- Ordinal based Classification Techniques: A Survey
PDF
Data mining techniques a survey paper
PDF
ADABOOST ENSEMBLE WITH SIMPLE GENETIC ALGORITHM FOR STUDENT PREDICTION MODEL
PDF
PREDICTING STUDENT ACADEMIC PERFORMANCE IN BLENDED LEARNING USING ARTIFICIAL ...
PDF
E1802023741
PDF
Some Imputation Methods to Treat Missing Values in Knowledge Discovery in Dat...
PPT
PDF
An Influence of Measurement Scale of Predictor Variable on Logistic Regressio...
PDF
Data mining techniques
PDF
Comparison on PCA ICA and LDA in Face Recognition
PDF
DATA MINING on WEKA
PDF
B0930610
PDF
HOLISTIC EVALUATION OF XML QUERIES WITH STRUCTURAL PREFERENCES ON AN ANNOTATE...
PDF
Student Performance Data Mining Project Report
PDF
PROPOSAL OF A TWO WAY SORTING ALGORITHM AND PERFORMANCE COMPARISON WITH EXIST...
PDF
Feature selection in multimodal
PDF
Ijatcse71852019
DOC
Record matching over multiple query result - Document
PDF
A novel hybrid feature selection approach
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IRJET- Ordinal based Classification Techniques: A Survey
Data mining techniques a survey paper
ADABOOST ENSEMBLE WITH SIMPLE GENETIC ALGORITHM FOR STUDENT PREDICTION MODEL
PREDICTING STUDENT ACADEMIC PERFORMANCE IN BLENDED LEARNING USING ARTIFICIAL ...
E1802023741
Some Imputation Methods to Treat Missing Values in Knowledge Discovery in Dat...
An Influence of Measurement Scale of Predictor Variable on Logistic Regressio...
Data mining techniques
Comparison on PCA ICA and LDA in Face Recognition
DATA MINING on WEKA
B0930610
HOLISTIC EVALUATION OF XML QUERIES WITH STRUCTURAL PREFERENCES ON AN ANNOTATE...
Student Performance Data Mining Project Report
PROPOSAL OF A TWO WAY SORTING ALGORITHM AND PERFORMANCE COMPARISON WITH EXIST...
Feature selection in multimodal
Ijatcse71852019
Record matching over multiple query result - Document
A novel hybrid feature selection approach

Viewers also liked (17)

PPT
"What TIME is it?" by Caitlin McGowan
PPTX
Ekologi Kerusakan Lingkungan
PDF
Πρόσκληση υποβολής αιτήσεων ενίσχυσης έτους 2014 ''Εγκατάσταση Νέων Αγροτών''
PDF
[TIK] Network hardware
PPTX
05. Παρουσίαση της αίτησης συμμετοχής στην ΑΓΡΩΓΟΣ Αναπτυξιακή - Πάτρα, 07/02...
PDF
Οδηγός Επιχειρηματικότητας και εργασίας για Νέους Κτηνοτρόφους
ODP
Cassandra at Digby
PPTX
Kemiskinan dan Keterbelakangan
DOCX
Ekonomi teknik # 2
ODP
Nature of organizing
PDF
Μελέτη διάγνωσης των αναγκών της τοπικής αγοράς εργασίας
DOCX
Makalah softskill 2 rate of return
DOCX
Nilai Waktu dari Uang
PPT
Disability Project - ASD
DOCX
Tugas ekonomi teknik # 1
DOCX
Makalah pendidikan kewarganegaraan
DOCX
"MENGANALISIS SUKU BUNGA"
"What TIME is it?" by Caitlin McGowan
Ekologi Kerusakan Lingkungan
Πρόσκληση υποβολής αιτήσεων ενίσχυσης έτους 2014 ''Εγκατάσταση Νέων Αγροτών''
[TIK] Network hardware
05. Παρουσίαση της αίτησης συμμετοχής στην ΑΓΡΩΓΟΣ Αναπτυξιακή - Πάτρα, 07/02...
Οδηγός Επιχειρηματικότητας και εργασίας για Νέους Κτηνοτρόφους
Cassandra at Digby
Kemiskinan dan Keterbelakangan
Ekonomi teknik # 2
Nature of organizing
Μελέτη διάγνωσης των αναγκών της τοπικής αγοράς εργασίας
Makalah softskill 2 rate of return
Nilai Waktu dari Uang
Disability Project - ASD
Tugas ekonomi teknik # 1
Makalah pendidikan kewarganegaraan
"MENGANALISIS SUKU BUNGA"

Similar to Cs268 (20)

PDF
Predicting students' performance using id3 and c4.5 classification algorithms
PDF
RESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATION
PDF
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
PDF
Data Mining Techniques for School Failure and Dropout System
PDF
L016136369
DOCX
Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...
PDF
DATA MINING METHODOLOGIES TO STUDY STUDENT'S ACADEMIC PERFORMANCE USING THE...
PDF
Student Performance Evaluation in Education Sector Using Prediction and Clust...
PDF
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
PDF
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
PDF
A Survey on the Classification Techniques In Educational Data Mining
PDF
Analyzing undergraduate students’ performance in various perspectives using d...
PDF
Learning Strategy with Groups on Page Based Students' Profiles
PDF
Learning Strategy with Groups on Page Based Students' Profiles
PDF
Fd33935939
PDF
Fd33935939
PDF
Learning strategy with groups on page based students' profiles
PDF
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
PDF
Analysis on Student Admission Enquiry System
PDF
Analysis on Student Admission Enquiry System
Predicting students' performance using id3 and c4.5 classification algorithms
RESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATION
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Data Mining Techniques for School Failure and Dropout System
L016136369
Perfomance Comparison of Decsion Tree Algorithms to Findout the Reason for St...
DATA MINING METHODOLOGIES TO STUDY STUDENT'S ACADEMIC PERFORMANCE USING THE...
Student Performance Evaluation in Education Sector Using Prediction and Clust...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...
A Survey on the Classification Techniques In Educational Data Mining
Analyzing undergraduate students’ performance in various perspectives using d...
Learning Strategy with Groups on Page Based Students' Profiles
Learning Strategy with Groups on Page Based Students' Profiles
Fd33935939
Fd33935939
Learning strategy with groups on page based students' profiles
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
Analysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry System

Recently uploaded (20)

PPTX
How to Convert Tickets Into Sales Opportunity in Odoo 18
PDF
The AI Revolution in Customer Service - 2025
PPTX
Blending method and technology for hydrogen.pptx
PDF
ment.tech-Siri Delay Opens AI Startup Opportunity in 2025.pdf
PDF
Human Computer Interaction Miterm Lesson
PDF
NewMind AI Weekly Chronicles – August ’25 Week IV
PDF
Rapid Prototyping: A lecture on prototyping techniques for interface design
PDF
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
PDF
SaaS reusability assessment using machine learning techniques
PDF
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
PDF
Advancing precision in air quality forecasting through machine learning integ...
PPTX
Report in SIP_Distance_Learning_Technology_Impact.pptx
PDF
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
PDF
A hybrid framework for wild animal classification using fine-tuned DenseNet12...
PDF
EIS-Webinar-Regulated-Industries-2025-08.pdf
PDF
Transform-Your-Factory-with-AI-Driven-Quality-Engineering.pdf
PPTX
How to use fields_get method in Odoo 18
PDF
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
PDF
substrate PowerPoint Presentation basic one
PDF
Introduction to MCP and A2A Protocols: Enabling Agent Communication
How to Convert Tickets Into Sales Opportunity in Odoo 18
The AI Revolution in Customer Service - 2025
Blending method and technology for hydrogen.pptx
ment.tech-Siri Delay Opens AI Startup Opportunity in 2025.pdf
Human Computer Interaction Miterm Lesson
NewMind AI Weekly Chronicles – August ’25 Week IV
Rapid Prototyping: A lecture on prototyping techniques for interface design
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
SaaS reusability assessment using machine learning techniques
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
Advancing precision in air quality forecasting through machine learning integ...
Report in SIP_Distance_Learning_Technology_Impact.pptx
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
A hybrid framework for wild animal classification using fine-tuned DenseNet12...
EIS-Webinar-Regulated-Industries-2025-08.pdf
Transform-Your-Factory-with-AI-Driven-Quality-Engineering.pdf
How to use fields_get method in Odoo 18
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
substrate PowerPoint Presentation basic one
Introduction to MCP and A2A Protocols: Enabling Agent Communication

Cs268

  • 1. Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013) Students’ comprehensive evaluation system based on data mining method He yongrong Bian xiangjuan School of Computer Science Zhejiang International Studies University Hangzhou, China [email protected] School of Computer Science Zhejiang International Studies University Hangzhou, China [email protected] Abstract—the paper researched comprehensive quality evaluation system of college student, and introduces data mining method to solve mess students data. Firstly, the paper construct students’ data warehouse, then construct comprehensive multi-dimensions OLAP (Online Analysis Process) model, and through classification deterministic method, we can get useful message from the system to guide college manage students, and some Employers can find their satisfied employees. graduation information and the information of the students adapt the society need. So, we use recent 5 years different department’s students graduation situation, course arrangement, employment tendency as basic data, the data warehouse structure is like figure 2. Keywords-Data mining, comprehensive evaluation, OLAP I. INTRODUCTION With the higher education change from elite education to popular education, there are more and more student’s management problems. The students’ data is huge and complicated, and students’ state and development is also can’t forecast. So the college managers hope to get a students’ management system which has auxiliary deterministic ability and data mining technology just provide the effective method to solve the problem. Data mining method try to find useful information from huge data, this is the processes which find relationship between model and data from huge data[1], this model and relationship can use to forecast. Under this background, utilize data mining technology in students’ management system, construct perfect students’ evaluation system, which can improve the students management level and speed up students’ management work specialized[2]. The paper firstly design students data warehouse, then construct students’ data mining model which include: students information data mining, students’ course selection data mining, students’ obtain employee chance data mining, at last, we can obtain data mining result to get some reason about: the factors influencing student achievement, the factors influencing students course selection and factors influencing students obtain work chance, The whole process is just like figure 1. II. STUDENTS COMPREHENSIVE EVALUATION DATA WAREHOUSE CONSTRUCTION To finish data mining work, the first thing is prepare data, according to different department and spatiality’s students’ information data, so all collective database should be reorganized and classified, systematized, to finish these work, the only thing to do is transfer these data to data warehouse. As teaching manager, their work often face deterministic analysis, their most attractive information is student’s Figure 1: The data mining process in the system Figure 2:Students comprehensive evaluation data warehouse structure table The student’s comprehensive data warehouse was divided as computer department database, english department etc, then divided as thirteen data mart, the whole process: Step 1 : construct data warehouse model function: determine system main body, and relationship with these bodies, refine the body’s every lever, such as , educational administration management system mainly divided as Published by Atlantis Press, Paris, France. © the authors 0723
  • 2. Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013) students’ achievement analysis theme, course arrangement theme, students’ arranged employment situation; Step 2: data warehouse physical database’s construction: define the system physical database model, not only record’s type, default value, and constraints relationship, but also some indexes and physical view; Step 3: data extraction, transformation, and integration: adopt Microsoft SQLSever tools extracting appoint record, delete unqualified data, and processing preliminary data integration. Step 4: data import: when creating data warehouse, data transforming function of Microsoft SQLSever is necessary, because data from other database should be selected, processing, and loading to data warehouse. III. CONSTRUCTION THE DETERMINISTIC TREE OF STUDENT’S COMPREHENSIVE EVALUATION SYSTEM The paper utilize deterministic classified theory to construct deterministic tree of student’s comprehensive evaluation system, to realize qualitative analysis, the whole data classified process is like figure 3: label N,”X.atrributes=V with probability 1” else For each attribute A in table compute AVG ENTROPY(A,attributes,table); AS=the attributes for which AVG ENTROPY(A,attributes,table)is minimal; If (AVG ENTROPY (A, attributes, table) is not substantially smaller than ENTROPY (attributes, table)) then Label N with most common value of attributes in table (deterministic tree) r with frequencies of attributes in table (probabilistic tree) ; Else label N with AS; FOR EACH VALUE V of AS DO N1=ID3(SUBTABLE(table,A,V),attributes)); IF(N1!=null)then make an arc from N to N1 labeled V; End End End Return N; End Figure 3: The whole process of data mining Data mining classification is the key step of data mining application, to realize it, a suitable algorithm should firstly be selected, and also a proper program must be found to realize this algorithm. The paper adopt the famous deterministic classified method: ID3 algorithm [3], the algorithm trains all samples from root node, select an attribute to divide these samples, every value of attribute generates a branch, and then transforms branch attribute value of samples subset to a new node. This is a recursive process which has been used at every node, until every sample at node has been divided to a type. The figure 4 is showing the generating process of deterministic tree. ID3 algorithm is most typical deterministic tree algorithm; it was first put forward by J.R.Quinlan. The main core ideology of the algorithm is the greedy search algorithm which selected maximum information gain as current deterministic attribute. ID3 algorithm can keep depth of every branch of the deterministic tree is least, here give the ID3 algorithm detailed description: ID3(table,attributes) //input:training set table,attributes; //output:Deterministic tree; {if(table is empty)then Return(null); N=a new node;//create node; If(there are no predictive attributes in table)//the first situation then label N with most common value of attributes in table(deterministic tree)or with frequencies of attributes in table(probabilistic tree); else If(all instances in table have the same value V of attributes)then/the second situation Figure 4 the generating process of deterministic tree. The deterministic tree’s generation passed two stages: studying and testing. At studying stage, deterministic tree adopt top-down way’s to finish its recursive process, when the tree began to generate, all data is in root node, then divide by recursive way, until branch node generated. The second stage is testing stage; this stage’s main aim is to delete some noise and unmoral data. To stop the recursive process, there are must be satisfied a condition: the data at same node should must be same type; no other attribute can be divided by data. Students comprehensive evolution’s class was divided by three areas: moral level, academic level, extracurricular practice level [4], the order is: moral level, academic level, extracurricular practice level; all the rules are constructed by if/then language: If (moral level1) then Published by Atlantis Press, Paris, France. © the authors 0724
  • 3. Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013) (if academic level1 or 2) and (extracurricular practice level1) then (comprehensive quality1 ) (if academic level1 or 2) and (extracurricular practice level2) then (comprehensive quality2 ) (if academic level1 or 3) and (extracurricular practice level1) then (comprehensive quality2 ) else(comprehensive quality3) If (moral level2) then (if academic level1) and (extracurricular practice level1) then (comprehensive quality1 ) (if academic level1) and (extracurricular practice level2) then (comprehensive quality2 ) (if academic level2) and (extracurricular practice level1) then (comprehensive quality2 ) (if academic level2) and (extracurricular practice level2) then (comprehensive quality3 ) (if academic level3 or 4) and (extracurricular practice levela) then (comprehensive quality3 ) else (comprehensive quality4) If (moral level3) then (if academic level1) and (extracurricular practice level1) then (comprehensive quality3 ) Else else (comprehensive quality4) According to classified rules, we can get deterministic tree like figure 5: Figure 6 the students comprehensive system functions Fig 7 Integral rendering of The System Figure 5 Students comprehensive evolution’s deterministic tree IV. STUDENTS COMPREHENSIVE EVOLUTION PROTOTYPE SYSTEM DEVELOPMENT The system adopt multi-dimension model , the model can be seen as three-dimensional model, database model was adopted, the processed data can make following online analyzing easily, the system can be searched by student number, scheme, school year. The system was divided as three isolated aero: moral, academic, extracurricular practice. The moral data often come from themselves’ evaluation and peer assessment and evaluated by instructor; The academic data includes core courses, required courses, and elective courses, when computing scores, can give different weight to Corresponding course, and standardized all the score, at last give the result by orders; extracurricular practice adopt bonus system, divide some little project, every project has its upper limit, then standardized the scores. The whole system functional model is like figure 6:The Fig 7 and Fig 8 are the system working interface. Fig 8 Rendering of Evaluation Result V. CONCLUSION Blend the existing management system in a suit perfect students management information system and adopt data mining technology to get some useful information, which can widely used in students comprehensive evaluation, graduate interview recommendation and enrollment analysis etc, it can improve students management level and speed the students specialization . Published by Atlantis Press, Paris, France. © the authors 0725
  • 4. Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013) ACKNOWLEDGMENT The paper was support by Zhejiang new talent program project: “Student integration capability evaluating system based on data mining technology”, The National Natural Science Foundation of China “MEMS Multi-fields Uniform Simulation Model Construction and Optimism Research” (61100101), the paper was also supported by Key Discipline of The Ocean Mechatronic Equipments Technology. REFERENCES [1] Bodea, Constanta-Nicoleta, Bodea etc. Student performance in online project management courses: A data mining approach:3rd World Summit on the Knowledge Society, WSKS 2010,2010, 470-479 [2] Ogor, Emmanuel N. Student academic performance monitoring and evaluation using data mining techniques . Electronics, Robotics and Automotive Mechanics Conference, CERMA 2007,2007,354-359 [3] Vialardi, César ; Chue, Jorge; Peche, Juan Pablo etc. A data mining approach to guide students through the enrollment process based on academic performance, User Modeling and User-Adapted Interaction, v 21, n 1-2, p 217-248, April 2011 [4] Zhang, Zhiyu1 . Study and analysis of data mining technology in college courses students failed,2010 IEEE International Conference on Intelligent Computing and Integrated Systems, ICISS2010,2010. 800-802 [5] YANG Hong-ying, LUO Huan, HE Qiang. Data Mining in Network Education Based on Decision Tree. Computer Knowledge and Technology, Vol.6,No.10,April 2010, pp.2313-2314 Published by Atlantis Press, Paris, France. © the authors 0726