Welcome to Scribd!

Datamining

Uploaded by

api-19626062

0% found this document useful (0 votes)

160 views18 pages

Data Mining is the process of finding correlations or patterns among dozens of fields in large relational databases. Data Mining is one of several terms, including knowledge discovery, knowledge extraction, data archaeology, information harvesting and even data dredging. A Data Mining algorithm is used to identify items that occur together in a given event or record.

Original Description:

Original Title

datamining

Copyright

Available Formats

PPT, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PPT, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

160 views18 pages

Datamining

Uploaded by

api-19626062

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PPT, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 18

Search inside document

What is Data Mining ?

By
Saurabh Jain
General Concept of Data Mining
• Most organization have accumulated a great
deal of data, but, what they really want is
information
• Data mining is the process of finding
correlations or patterns among dozens of
fields in large relational databases.
General Concept of Data Mining
(cont’d)

• Data mining uses sophisticated statistical

analysis and modeling techniques to
uncover patterns and relationship hidden in
organization database.
• Data mining is one of several terms,
including knowledge discovery, knowledge
extraction, data archaeology, information
harvesting and even data dredging
How Does Data Mining Work?

Algorithms Technologies
• Associations • Neural networks
• Classifications • Decision trees
• Sequential discovery • Rule induction
• Clustering • Data visualization
Algorithms
1. Associations
- This is used to identify items that occur
together in a given event or record.
- This technique is often used for market
analysis, rules hidden between the attributes
Ex. “When people buy a hammer they also buy
nails 50% of the time.”
Algorithms (cont’d)

2. Classifications
This is used to classify database records
into a number of predefined classes based
on certain criteria.
Ex. “Customers with excellent credit history
have a debt/equity ratio of less than 10%”
Algorithms (cont’d)

3. Sequential Discovery
This helps identify patterns in time series.

Ex. “60% of customers buy TVs followed by

8mm camcorders.”
Algorithms (cont’d)

4. Clustering
This is used to segment the database into
different clusters, based on a set of
attributes.

Ex. “Understand the target market used to

classify new data”
Technologies
1. Neural networks
This trains the net on a
training dataset and
then use it to make
predictions.
Technologies (cont’d)
2. Decision trees
• A way of representing
a series of rule that
lead to value or class.
• This segregates the
data based on the
value of the variables.
Technologies (cont’d)

2. Decision Trees(cont’d)
- Cannot use continuous data
- Used for model understanding rather than
prediction
Technologies (cont’d)
3. Rule induction
All possible patterns in the database are
systematically pulled out and then the
accuracy and the coverage are calculated.
Ex.
IF breakfast cereals, then milk : accuracy 90%, coverage 15%
IF Friday and male and diapers, then beer:

accuracy 60%, coverage 0.1%

Technologies (cont’d)
4. Data visualization
• Graphics tools are used to illustrate data
relationships.
• Gain a deeper, intuitive understanding of
the data by presenting a picture for users.
Current Limitations

• Cost, Time and Effort

– Data Mining setup can be expensive
– Many man-hour of development are needed.
– Some of their functions involve steep learning
curves for the end-users.
– Extensive training and practices are still needed
for most users
Current Limitations (cont’d)
• Low-end software
– These have limited query capabilities and its
inability to perform multidimensional analyses.
– Many of the current methods are not truly
interactive and cannot incorporate prior
knowledge
• Large databases
– The large size presents problems in terms of
finding efficient algorithms for association rules.
Conclusion
• Data mining assists user finding patterns and
relationships in the data.
• Data mining is a powerful tool, not magic.
• Organize the large volumes of data into some form of
categories. => To avoid the GIGO, data should have
minimal missing values.
• Current Limitations : Cost, Time, effort, Low-end
software, and Large databases
Any Questions ????????
THANKING YOU !!!

Some Solutions To Enderton Logic
Document16 pages
Some Solutions To Enderton Logic
Jason
100% (1)
Din 48204
Document3 pages
Din 48204
Thanh Dang
100% (4)
Cable Schedule - Instrument - Surfin - Malanpur-R0
Document3 pages
Cable Schedule - Instrument - Surfin - Malanpur-R0
arunpandey1686
No ratings yet
Reclaimer PDF
Document8 pages
Reclaimer PDF
Siti Nurhidayati
No ratings yet
Management Information System: Dr. Anand Vyas
Document10 pages
Management Information System: Dr. Anand Vyas
SUFIYAN KHAN
No ratings yet
ML Lect1
Document51 pages
ML Lect1
physics lover
100% (1)
Data Mining
Document29 pages
Data Mining
Miel9226
No ratings yet
Unit 3: by Dr. Anand Vyas
Document20 pages
Unit 3: by Dr. Anand Vyas
Prince Singh
No ratings yet
Data Mining
Document3 pages
Data Mining
hadnica2
No ratings yet
Data Mining and Warehousing-1
Document43 pages
Data Mining and Warehousing-1
Vijay Kumar Saini
No ratings yet
Data Mining Concepts and Applications: Six Factors Behind The Sudden Rise in Popularity of Data Mining
Document36 pages
Data Mining Concepts and Applications: Six Factors Behind The Sudden Rise in Popularity of Data Mining
Ongudi Tiberius
No ratings yet
Lec 01 Data Mining
Document25 pages
Lec 01 Data Mining
Musa Savage
No ratings yet
Unit 1 - Big Data Technologies
Document89 pages
Unit 1 - Big Data Technologies
prakash N
No ratings yet
III CS Datamining - Unlocked
Document68 pages
III CS Datamining - Unlocked
Jana Jana
No ratings yet
Data Science PDF
Document11 pages
Data Science PDF
sredhar s
No ratings yet
Recommender System - Module 2 - Data Mining Techniques in Recommender System
Document58 pages
Recommender System - Module 2 - Data Mining Techniques in Recommender System
DainikMitra
No ratings yet
Recommender System - Module 2 - Data Mining Techniques in Recommender System
Document58 pages
Recommender System - Module 2 - Data Mining Techniques in Recommender System
DainikMitra
No ratings yet
Data Mining Techniques and Applications
Document16 pages
Data Mining Techniques and Applications
lokesh Koppanathi
No ratings yet
Data Mining Implementation
Document9 pages
Data Mining Implementation
akhmad faiz al khairi
No ratings yet
Data Mining
Document15 pages
Data Mining
akashsharma9011328268
No ratings yet
Authenticating and Reducing False Hits in Mining
Document37 pages
Authenticating and Reducing False Hits in Mining
Ujwala Bhoga
No ratings yet
Data Mining and Data Analysis UNIT-1 Notes For Print
Document22 pages
Data Mining and Data Analysis UNIT-1 Notes For Print
padma
No ratings yet
Data Mining
Document13 pages
Data Mining
Sunaina Bondlewad
No ratings yet
Presentation 1
Document28 pages
Presentation 1
Nisar Mohammad
No ratings yet
1 DataScience
Document91 pages
1 DataScience
Aman Singh
No ratings yet
Data Mining Fall-2019 Qs Ans
Document10 pages
Data Mining Fall-2019 Qs Ans
Happy Plants BD
No ratings yet
LECTURE NOTES ON DATA MINING and DATA WA
Document84 pages
LECTURE NOTES ON DATA MINING and DATA WA
Ali Azfar
No ratings yet
p144 Data Mining
Document11 pages
p144 Data Mining
jnanesh582
100% (3)
Datamining: by Guan Hang Su Cs157A Section 2 Fall 2005
Document31 pages
Datamining: by Guan Hang Su Cs157A Section 2 Fall 2005
lonelygirl
0% (1)
Intelligent Techniques Assignment
Document7 pages
Intelligent Techniques Assignment
Mano_Bili89
No ratings yet
HaftamuA ArticleReview
Document39 pages
HaftamuA ArticleReview
znabugrmay20adi
No ratings yet
Data Mining Slides
Document65 pages
Data Mining Slides
Kriwaczf
No ratings yet
Current Trends
Document35 pages
Current Trends
icecoolberge
No ratings yet
Unit 3
Document34 pages
Unit 3
varsha.j2177
No ratings yet
Data Mining
Document20 pages
Data Mining
NITIN KALRA
No ratings yet
ITS 3233 Business Intelligent: Data Mining
Document12 pages
ITS 3233 Business Intelligent: Data Mining
yanani
No ratings yet
Analytics and Business Intelligence
Document8 pages
Analytics and Business Intelligence
oureducation.in
No ratings yet
Data Mining and Its Techniques: A Review Paper: Maria Shoukat (MS Student)
Document7 pages
Data Mining and Its Techniques: A Review Paper: Maria Shoukat (MS Student)
mariashoukat
No ratings yet
KMSPquickreviewfinal
Document47 pages
KMSPquickreviewfinal
vk
No ratings yet
Part 1 - Introduction To Big Data
Document24 pages
Part 1 - Introduction To Big Data
asarisetya
No ratings yet
1 DataScience1 91
Document91 pages
1 DataScience1 91
Dikshant Chitara
No ratings yet
Data Mining and Warehousing
Document29 pages
Data Mining and Warehousing
Ayesha Waris
No ratings yet
CPE 445-Internet of Things - Chapter 7
Document39 pages
CPE 445-Internet of Things - Chapter 7
fa20-bce-046
No ratings yet
Data Mining
Document11 pages
Data Mining
Rahul Kalyankar
No ratings yet
12 When To Use Data Mining
Document19 pages
12 When To Use Data Mining
Muhammad Wildam
No ratings yet
Data Mining
Document14 pages
Data Mining
Ankit Gupta
No ratings yet
Unit 3 Data Mining
Document21 pages
Unit 3 Data Mining
badaltanwarr
No ratings yet
Data Warehouse Presentation
Document28 pages
Data Warehouse Presentation
Prasad Dhanikonda
No ratings yet
Data Mining Techniques Unit-1
Document122 pages
Data Mining Techniques Unit-1
Rohan Singh
No ratings yet
What Is Data Mining Again?: Unsuspected Relationships Summarize Understandable and Useful Models
Document29 pages
What Is Data Mining Again?: Unsuspected Relationships Summarize Understandable and Useful Models
Joseph Conteh
No ratings yet
Data Mining Overview
Document24 pages
Data Mining Overview
zoo7675
No ratings yet
# Understanding DM Architecture, KDD & DM Tools
Document29 pages
# Understanding DM Architecture, KDD & DM Tools
Dan Masanga
No ratings yet
1 ST Review Document
Document37 pages
1 ST Review Document
sumanice
No ratings yet
Data Mining Techniques: By-Priyank Yadav CSE
Document8 pages
Data Mining Techniques: By-Priyank Yadav CSE
Sudhakar Tripathi
No ratings yet
DIgitization Week 7
Document6 pages
DIgitization Week 7
Ilion Barboso
No ratings yet
Introduction To Data Mining For Business Analytics
Document51 pages
Introduction To Data Mining For Business Analytics
Sherwin Lopez
No ratings yet
Unit 2
Document10 pages
Unit 2
Yatin6004
No ratings yet
Data Mining Notes
Document14 pages
Data Mining Notes
rishikeshgondcool5
No ratings yet
Data Warehousing and Data Mining
Document84 pages
Data Warehousing and Data Mining
AnishSahni
No ratings yet
Data Mining Notes
Document75 pages
Data Mining Notes
Aravind Rossi
No ratings yet
Why We Need Data Mining?
Document39 pages
Why We Need Data Mining?
Bhanu Royce
No ratings yet
Data Science - Fundamentals and Components
Document21 pages
Data Science - Fundamentals and Components
Banuroopa Velkumar
No ratings yet
Big Data Modeling and Management Systems
From Everand
Big Data Modeling and Management Systems
Alexander Afriyie
No ratings yet
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
TOK Assessed Student Work
Document10 pages
TOK Assessed Student Work
Peter Jun Park
100% (1)
EUCLID
Document3 pages
EUCLID
Nandini Mourya
No ratings yet
Real Options BV Lec 14
Document49 pages
Real Options BV Lec 14
Anuranjan Tirkey
No ratings yet
Python Cheat Sheet-1
Document8 pages
Python Cheat Sheet-1
Revathy
No ratings yet
Role of Micro-Financing in Women Empowerment: An Empirical Study of Urban Punjab
Document16 pages
Role of Micro-Financing in Women Empowerment: An Empirical Study of Urban Punjab
Anum Zubair
No ratings yet
EHVAC
Document16 pages
EHVAC
sidharthchandak16
No ratings yet
Description - GB - 98926286 - Hydro EN-Y 40-250-250 JS-ADL-U3-A
Document6 pages
Description - GB - 98926286 - Hydro EN-Y 40-250-250 JS-ADL-U3-A
george
No ratings yet
Terasaki FDP 2013
Document40 pages
Terasaki FDP 2013
MannyBaldonadoDeJesus
100% (1)
Lecture No. 11
Document15 pages
Lecture No. 11
Sayeda Jabbin
No ratings yet
Registration Form - Synergies in Communication - 6th Edition - 2017-Drobot Ana
Document3 pages
Registration Form - Synergies in Communication - 6th Edition - 2017-Drobot Ana
Ana Irina
No ratings yet
End Points Subrogados
Document3 pages
End Points Subrogados
Agustina Andrade
No ratings yet
Project Synopsis On LAN Connection
Document15 pages
Project Synopsis On LAN Connection
ডৰাজবংশী
No ratings yet
Defenders of The Empire v1.4
Document13 pages
Defenders of The Empire v1.4
Iker Antolín Medina
No ratings yet
(Database Management Systems) : Biag, Marvin, B. BSIT - 202 September 6 2019
Document7 pages
(Database Management Systems) : Biag, Marvin, B. BSIT - 202 September 6 2019
Marcos Jeremy
No ratings yet
Cryptography Lab DA-1
Document19 pages
Cryptography Lab DA-1
Gautam Thothathri 19MIC0092
No ratings yet
Chapter 3 Payroll
Document5 pages
Chapter 3 Payroll
Pheng Tiosen
100% (2)
Historical Roots of The "Whitening" of Brazil
Document23 pages
Historical Roots of The "Whitening" of Brazil
FernandoMascarenhas
No ratings yet
BGP PDF
Document100 pages
BGP PDF
Jeya Chandran
No ratings yet
Denial of LOI & LOP For Ayurveda Colleges Under 13A For AY-2021-22 As On 18.02.2022
Document1 page
Denial of LOI & LOP For Ayurveda Colleges Under 13A For AY-2021-22 As On 18.02.2022
Gbp Gbp
No ratings yet
Modelsim
Document47 pages
Modelsim
Kishor Kumar
No ratings yet
GSM BSC6000 Performance Statistics
Document72 pages
GSM BSC6000 Performance Statistics
Ali Alshwal
No ratings yet
DLL - English 5 - Q3 - W8
Document8 pages
DLL - English 5 - Q3 - W8
Merlyn S. Al-os
No ratings yet
Clevite Bearing Book EB-40-07
Document104 pages
Clevite Bearing Book EB-40-07
lowelowel
No ratings yet
FDP VLSI Design at Deep Submicron Node PDF
Document2 pages
FDP VLSI Design at Deep Submicron Node PDF
praneethshub
No ratings yet
Rubric For Audio Speech Delivery
Document2 pages
Rubric For Audio Speech Delivery
Marie Sol Pangan
No ratings yet
3
Document76 pages
3
Uday Shankar
No ratings yet