Professional Documents
Culture Documents
DISEASE PREDICTION
USING
NAVE BAYES CLASSIFIER
PRESENTED BY:-
AMITESH GAURAV
ASHOK RAJAK
SHANU SONI
ABSTRACT
The main objective of this research is to develop an Intelligent System using data
mining modeling technique, name, Naive Bayes.
It is implemented as web based application in this user answers the predefined
questions.
It retrieves hidden data from stored database and compares the user values with trained
data set.
It can answer complex queries for diagnosing heart disease and thus assist healthcare
practitioners to make intelligent clinical decisions which traditional decision support
systems cannot.
By providing effective treatments, it also helps to reduce treatment costs.
INTRODUCTION
The Bayes theorem was developed and named for THOMAS BAYES (1702-1761).
Naive because it is based on independence assumption.
Describes what makes something "evidence" and how much evidence it is.
Bayesian Classifiers are statistical classifiers.
They can predict the probability that a data item is a member of a particular class.
Original
Belief
Observation
=
New Belief
EXAMPLE
Out of the 1030 women who get positive mammographies only 80 actually
have breast cancer, therefore, the probability is 80/1030 or 7.767%
USING BAYES
ALGORITHM
P(AB) P(B)
P(A)
P(B), P(AB), and P(AB) are known. P(A) is needed to find P(BA).
P(A) = P(AB) P(B) + P(AB) P(B)
P(A) = (0.8) ( 0.01) + (0.096) (0.99)
P(A) = 0.1030
P(BA) =
(0.8) (0.01)
(0.1030)
P(BA) = 0.07767
Why to prefer naive Bayes implementation :1) When the data is high.
2) When the attributes are independent of each other.
3) When we expect more efficient output, as compared to other methods
output.
DATA SOURCE
Predictable attribute:1. Diagnosis (value 0: <50% diameter narrowing (no heart disease); value 1: >50% diameter narrowing
(has heart disease))
IMPLEMENTATION OF BAYESIAN
CLASSIFICATION
The Nave Bayes Classifier technique is mainly applicable when the dimensionality of the inputs is high.
Despite its simplicity, Naive Bayes can often outperform more sophisticated classification methods.
Nave Bayes model recognizes the characteristics of patients with heart disease.
It shows the probability of each input attribute for the predictable state.
CONCLUSION
Decision Support in Heart Disease Prediction System is developed using Naive Bayesian
Classification .
The system extracts hidden knowledge from a historical heart disease database.
This model could answer complex queries, each with its own strength with ease of model
interpretation and an easy access to detailed information and accuracy.
The system is expandable in the sense that more number of records or attributes can be
incorporated and new significant rules can be generated using underlying Data Mining
technique.
Presently the system has been using 9 attributes of medical diagnosis.
It can also incorporate other data mining techniques and additional attributes for prediction.
PROJECT REFERENCES
http://www.tutorialspoint.com/data_mining/dm_bayesian_classification.htm
https://en.wikipedia.org/wiki/Statistical_classification
jmlr.csail.mit.edu/proceedings/papers/v6/mani10a/mani10a.pdf
http://www.cse.sc.edu/~rose/587/PPT/NaiveBayes
http://ic.unicamp.br/~rocha/teaching/2011s2/.../naive-bayes-classifier.pdf