You are on page 1of 23

/13 / 48 2007

Data Mining

)) ((

. /.

. . .
/




.

.

.
Data Mining

.

.

"

:

.
:
.1
.2
.3
.4
.5

40

/13 / 48 2007

"
.1

.2
.3
.4

DM

"
1996 .
"
" .


.

.

"
.1
.2
.3
.4

.
.

.

.

"

"
Decision Tree Clustering
"

" -

" -

:
.1 "
.
.2
.

41

/13 / 48 2007

" -
277" 1530" %18


.

" -
19
. 2003 - 1985
2001 - 1997 .

" -


"
.

" -

:
: Microsoft SQL Server 2000
Decision Tree Clustering
.
:
.

-
.1
.2
.3
.4
.5
.6

:

.

.
.

.


.
.

42


.7

/13 / 48 2007

19
3
.

/ Data Mining

" - Essence Of Data Mining

Data
Information Knowledge
) . (Wu , 2000 :1
) (Seiner, 2002: 2

"(Daft , 2001 :258) .
" "

DM Gold Mining

.
) (Noonan , 2000 : 6
" "

). (Soni & Tang & Yang , 2002 : 1
Data Mining
)Knowledge Discovery in Databases (KDD
Fayyad
.

. (Zaiane,1999 : 3) (Houston & Others, 1999: 438) .
DM :
) (Information Discovery,Inc. 2000 : 4) (Ramachandran,2001 : 2
-1 Executives
" .
-2 End Users
.
-3 Analysts
.

43

/13 / 48 2007



) (Saarenvirla , 2001: 1 )(DSS
Decision Support System
) (Reactive
) (Proactive "
.
). (Rob & Coronel, 2000:609
DM
) Data Warehouse (DW
( Romney & Steinbart ,2000:599) .
) Artificial Intelligence ( AI
"

.( Avison & Shah ,1997:327 ) .


.

" Types Of Data Mining


:
)(Information Discovery,Inc., 2000 : 2-3) (Ramachandran, 2001:1

-1 Discovery

-2

Predictive Modeling

-3 Forensic Analysis


.
:
) (Ahola & Runsala , 2001:3

-1 Exploratory Analysis

44

/13 / 48 2007

-2 Predictive Analysis


.


(Lehman , 2001:7) .
:
) (Information Discovery,Inc. 2000 :5
)(Ramachandran, 2001: 2

Episodic Mining

-2 Strategic Mining

-3 Continuous Mining

" - Techniques Of Data Mining



:
) (Brand & Gerritsen ,1998 :1-3
)(Edelstein ,1997: 3
)(Ramachandran , 2001: 3-5 ) (Atre , 2001 : 2) (Tow Crows, 1999 : 6-15

-1 Classification


.
Decision Tree
Nearest Neighbor . Regression

-2 Association


. .
. Market Basket Analysis

45

/13 / 48 2007

-3 Sequential Analysis
Link Analysis
.

-4 Clustering





. ( K- Means) K
. Neural Networks

( Tow Crows , 1999 :10-15) :
- Decision Trees


.
- Neural Networks
" "
.
Input Layer
Hidden Layer
Output Layer
.
- Regression


.
- Time Series

".
- Rule Induction

. .
- K Nearest Neighbor
(K NN ) K
.

46

/13 / 48 2007

Discriminant Analysis
-

.
- Boosting


" .
- Genetic Algorithms

) ( .

" -

Data Mining Process

D M" "
" :
)(Brand & Gerritsen, 1998 :3
)( Tow Crows , 1999 :22
)(Saarenvirta , 2001: 6
)(Skalak , 2001 : 1

-1

Define business problem

" .

-2 Build data mining database D M

.
DW
Data Mart .
" "
. 90 % - 50 %

-3 Explore data

-4 Prepare data for modeling

47

/13 / 48 2007

-5 Build model D M

) (


.
.



.

-6

Evaluate model


.

.

-7

Deploy model and results

:

.
) (1

.
.

DW

) (1 Data Mining
Source : Rob, Peter & Coronel, Carlos Data Base Systems Design,

48Fourth Edition Course


Implementation and Management
Technology, 2000, P.611 .

" -

/13 / 48 2007

Data Mining Applications


:
) (Avison & Shah , 1997 :328 ) ( Ramachandran , 2001 : 3
) ( Wu , 2002:2
)(Tow Crows , 2002:1
-1 : Banking
2 : Financial
-3 : Telecommunications
-4 : Marketing
-5 : Insurance and Health Care
- 6 : Medicine
-7 : Transportation
-8 : Retailing
-9 : Customer Relationship Management
.
-10 : Quality Control or Error Analysis
.
-11 : Hiring
-12 Electronic Commerce
-13 Food Service Menu Analysis
-14 Warranty Analysis
Student Recruiting and Retention
-15

" Strategies of Data Mining Success


:
)(Noonan , 2000 :4
-1
) ( Skalak , 2001:2
-2 ( Hermiz , 1999:3 ) .
) ( Small , 1997:6
.
-3 "
) . ( Small , 1997:7
-4 ) ( Skalak , 2001:2
) ( Noonan , 2000:3 .

49

/13 / 48 2007

-5
) (White Cross , 2001:5

-6
( Hermiz , 1999:4 ) .
( Skalak , 2001:2 ).
-7 ( Smyth,2001:5 ) .

"

/ Performance



) . (330 : 2000
:
-1


) . ( 211 : 1999
-2 productivity

) ..( 42 : 2001
-3 Effectiveness
.
.






)(Robbins & Coulter , 1999:9




(Saunders, 1997:264) .


.

"




:
) (211-208 :1999 ) (50-48:2001
-1
50

/13 / 48 2007


.
-2
" .
-3

.
-4

.

"

:
) ) ( Rambaldi & Bautista,2000:14) (77 :1986 (52-50 :2001
-1

.
-2 .
-3 .
-4 .
-5
".

"


) .
(76:2001
( Kunstelj & Others,2001:10 ) .

) ( 1 .
) : (:1995 77
) (79-54 :2000
Historical Standard
-1
.
.

.
-2 Industry Standard

.

51

/13 / 48 2007


.

.
)(Saunders , 1997:264



.

" -



" .

.
SQL
( Structured Query Language Server 2000 ) Server 2000
Database Management System

(Gunderloy & Jorden,2000:263) .
:
)(Tiedrich, 2000:14) (Soni & Others, 2001:2) (Bloor Research, 2001:102
-1 )Microsoft Decision Tree (MDT
Microsoft Clustering
-2

Data
Mining .


.

"

"
) (Seidman, 2001: 114
.
Risk ) (2
Level

52

/13 / 48 2007

Low
Branch
Babil
.
Content Navigator



.

) (2 Risk
" Attributes
) (2
97 bad
% 35 180 % 64.64
.% 0.36
Node Path
. .

.

53

/13 / 48 2007

" -

"


(Seidman, 2001: 146) .

.
Node Attribute Set .
) Cluster 2 Cluster 1 (

Node path .
Risk Cluster 1
bad ) (3
39 12
bad % 31.18 27 good
% 68.82 0 . % 0
Node path


"
.

.

54

/13 / 48 2007

) (3 Risk

" -


" " :
-1 :
-
-
-2 :
-
- /
- /
-3 :
- /
- / ) (
- /

55

/13 / 48 2007


" (Rose, 1999:159) .
1997 . 2001
"
%50.7 2001
%18.8 . 2000

" "
"


" .

.


.

.


.




.






.

56

/13 / 48 2007

" -
.1
.2
.3
.4
.5
.6
.7


.

.

.

.


.


.

.

" -
.1
.2
.3
.4

.5

.6


.
SQL Server 2000
.


.

.

.

.

.
Level
"
. Branch

57

/13 / 48 2007

.
.
.7

28 2
.


.
.8


/ 2001 %699
.% 50.7

" -
.1

.2

.3

.4
.5

.6


.

.


)(DW

.



.

" "
.



.


.

58

/13 / 48 2007

" -
.1
.
.2
.
.3 " "
). Insightful Miner (I Miner

Refernces
" -
.1
.2
.3
.4
.5
.6

" "
.2000
" "
.2001
"
" 1983-1979 .1986
" "
.1995
"
" . 2000
"
"
.1999

"

Journals

1. Ahola, Jussi & Rinta-Runsala, Esa: Data Mining Case Studies in


Customer Profiling, Research Report TTE1-2001-29, Version
1.0, 2001.
2. Atre,Shaku : Defining Today,s Data Mining, Executive Update,
Business Intelligence Advisory Service , CUTTER , Vol.1 , No.4 ,
2001.
3. Brand, Estelle & Gerritsen, Rob: Data Mining Solutions DBMS
Magazine, 1998.
4. Edelstein, Herb: Mining For Gold Information Week April,
1997.

59

2007 / 48 /13

5. Hermiz, Keith B.: Critical Success Factors For Data Mining


Projects, DM Review Magazine, ES Media Group Feb. 1999.
6. Houston, Andrea L. & Others: Medical Data Mining on the
Internet: Research on a Cancer Information System Artificial
Intelligence Review 13:437-466, 2000.
7. Information Discovery, Inc.: A Characterization of Data Mining
Technologies and Processes DM Review Magazine, EC Media
Group, 2000.
8. Lehman, J.T.: Future Tense Intelligent Enterprise Magazine,
Oct. 2001.
9. Noonan, Jack: Data Mining Strategies DM Review Magazine,
EC Media Group, July 2000.
10. Rambaldi, Giacomo & Bautista, Mike : Monitoring and
Evaluation: Beyond Record-Keeping Special Reports Suhay
July-September, 2000.
11. Saarenvirta, Gary: Operation Data Mining DB2 Magazine,
Summer 2001.
12. Seiner, Robert S. : The IRM Test: How Did IT Get This Way?
A Self-Help Test DM Review Magazine, March 2002.
13. Skalak, David: Data Mining Blunders Exposed! DB2
Magazine, Summer 2001.
14. Small, Robert D.: Debunking Data Mining Myths Information
Week, CMP Media, Jan. 1997.
15. Wu, Jonathan: What is Data Mining?, DM Review Magazine,
EC Media Group, August 2000.
16. Wu, Jonathan: The Value in Mining Data, DM Review
Magazine, Feb. 2002.

Internet

1. Bloor Research : Databases on evaluation & comparison, 2001.


2. Kunstelj, Mateja & Leben, Anamarija & Vintar, Mirko :
Influences Of Information Technology On The Quality Of Public
Services , NISP Acee Annual Coference , 10-12/5/2001 , Jurmala ,
Latvia .
3. Ramachandran M, Pushpa: Mining for Gold Wipro
Technologies, December 2001. http://www.Wipro.com
4. Smyth, Padhraic: Data Mining At The Interface of Computer
Science And Statistics, 2001.
5. Soni, Sanjay & Tang, Zhohui & Yang, Jim : Performance Study
of Microsoft Data Mining Algorithms, Microsoft Corp., 2001.

60

2007 / 48 /13

6. Tiedrech, Alan H.: Microsoft Corp. SQL Server 2000 Analysis


Services, Datapro Information Services, Gartner Group, Inc.,
November 2000.
7. Two Crows Corporation: Introduction to Data Mining and
Knowledge Discovery, Third Edition, 1999.
8. Two Crows Corporation: Data Mining Applications, 2002.
http://www.Towcrows.com
9. White Cross: Mining Very Large Databases to Support
Knowledge Exploration, Version 1, January 5, 2001.
http://www.Whitecross.com
10. Zaiane, Osmar R.: Introduction to Data Mining , CMPUT 690
Principles of Knowledge Discovery in Databases , University of
Alberta, 1999 .

Books

1. Avison, David & Shah, Hanifa The Information Systems


Development Life Cycle : McGraw-Hill, UK, 1997.
2. Daft, Richard L. Organization Theory and Design Seventh
Edition, South-Western College Publishing , U.S.A , 2001 .
3. Gunderloy, Mike & Jorden, Joseph L. Mastering SQL Server
2000 SYBEX Inc., U.S.A, 2000.
4. Hempel, George H. & Simonson, Donald G. Bank Management,
Text and Cases Fifth Edition, John Wiley & Sons,Inc.U.S.A,1999.
5. Revsine, Lawrence & Collins, Daniel W. & Johnson,W. Bruce
Financial Reporting & Analysis Prentice Hall ,Inc. ,U.S.A,1999.
6. Rob, Peter & Coronel, Carlos Data Base Systems Design,
Implementation and Management Fourth Edition Course
Technology, 2000.
7. Robbins, Stephen & Coulter, Mary "Management", Sixth Edition,
Prentice-Hall, USA, 1999.
8. Romney, Marshall B. & Steinbart, Paul John Accounting
Information Systems Eighth Edition Prentice-Hall, Inc. USA,
2000.
9. Rose, Peter S. Commercial Bank Management Forth Edition,
McGraw-Hill, Inc., Singapore, 1999.
10. Saunders, Anthony Financial Institutions Management Second
Edition, McGraw-Hill, Inc., USA, 1997.
11. Seidman, Claude Data Mining with Microsoft SQL Server 2000
Microsoft Corp., U.S.A, 2001.

61

/13 / 48 2007

) (1




) (

-1
-2


- -1


-2

-1

-2

-3


-4




-1



-2


-3

-4

-5

-6

:
)(Revsine & Others ,1999:160-174
) ( 78:1995
) (Hempel & Simonson ,1999: 67) (79-54 :2000

62

You might also like