Professional Documents
Culture Documents
Health
Why statistics
Introduction
In all walks of life, especially health care, we are under
constant threat of being overwhelmed by data. But
unless we are going to practice on the basis of hunches,
we need these data to make good decisions and so
optimize our treatment or policy plans. So, our first
objective must be to understand a little about data.
Data do not arise by magic; data collection is
determined by a human, with all his or her prejudices
and proneness to error. We rely on instruments, such as
questionnaires, mercury barometers and
spectrophotometers, to measure and record variables of
interest, such as peoples ages or blood pressures, or
amounts of chloro-fluorocarbons in the atmosphere.
But all instruments have limitations on their accuracy.
Introduction (contd..)
Humans tend to accept a body of raw data at
face value; a computer print-out of numbers
can appear to be an unalterable truth. It
may, in fact, be junk. Subsequent statistical
processing will not recycle junk into truth
(garbage in, garbage outprinciple).
The next problem is: how do we extract
meaningful information from masses of raw
data? (How do we see the forest, when the
trees keep getting in the way?)
Definition
Statistics
A science that deals with principles & methods
for the collection, presentation, analysis &
interpretation of data
A science that deals with the study of
aggregates or total or population
Bio-Statistics
Statistical methods used in the field of Health,
Biology, Medicine, Public Health
Some Basics
Measurement Scales
Depending on the nature of the variable, we have
different measurement scales.
Firstly, not all data are numbers, though they are often
represented as numbers. choose to code males as 1
and females as 2, especially when entering the data
into a computer. In this example, the numbers 1 and 2
have none of the usual properties of numbers
Two observations are equal if both are female (or both
male) and non-equal otherwise. Such a scale is called
nominal or categorical, and because of this lack of
mathematical properties it is called the weakest
scale.
Measurement of scales
(contd..)
A slightly stronger scale, one that uses the
mathematical notion of ordering is the ordinal scale.
An example of a variable measured on an ordinal scale
might be the position of an examination candidate in
the results order of merit. The number of the ranked
position tells us nothing about how much difference
exists between successive positions on the scale.
Interval scale: It possesses strong mathematical
properties due to the fact that in this scale equal
differences between points represent equal differences
in the measured quantity. For example, the difference
between 12 metres and 11 metres is the same as the
difference between 4 metres and 3 metres.
Measurement of scales
The ratio scale is considered a refinement of the
interval scale. In this scale, the order and size of
interval are important, but the ratio between two
measures also has meaning. This occurs when
there is a true zero point associated with the
scale.
For example,
temperature in degrees Celsius (interval scale)
Temperature in Degree Kelvin (ratio scale).
Simplification of data
The stronger measures can always be collapsed down to
form a weaker measure but not vice versa. Eg. Height of
students
when data are simplified or summarised, information is always
lost, understanding may be gained.
Here, if we measure heights on enough people, we may be
overwhelmed by the number of individual measurements, but
being able to say that x% are tall and (100 - x)% are short
may give a useful insight into this aspect of the population.
Of course, using this nominal scale of measuring height we
are no longer able to say what an individuals actual height is.
The reason we stress scales of measure is that the information
content of data depends on the scale, and different
descriptive techniques and different statistical tests are
appropriate to different scales
Descriptive statistics
Descriptive statistics includes methods for
presenting and summarising data.
These allow us to digest and understand
large quantities of data, and to effectively
communicate to others important aspects
of our research.
Descriptive statistics
Frequency Distributions and Data
Presentation
Table
grams/day
frequency
0-9
0.125
10-19
20-29
30-39
40-59
60-99
frequency
relative
125
125/1000 =
250
400
150
50
25
250/1000
400/1000
150/1000
50/1000
25/1000
=
=
=
=
=
0.250
0.400
0.150
0.050
0.025
Bar diagram
Inferential statistics
The inferential approach helps to decide
whether the outcome of the study is a result of
factors planned within design of the study or
determined by chance.
The two approaches are often used
sequentially in that first data are described
with descriptive statistics, and then additional
statistical manipulations are done to make
inferences about the likelihood that the
outcome the outcome was due to chance
through inferential statistics.
Statistical applications in
Health
Normal values of a characteristic
How to classify an individual as healthy
or sick?
Needs treatment or not ?
Based on the normal values of certain
clinical, laboratory & other measurements
Normal here is of statistical concept and
depends on the distribution of the
characteristic in the Population
Statistical applications in
Health
Most often in Health research, we need to
know whether the level of a parameter is
same between two or more groups
Eg.
Is the birth wt same in male &
female?
Is it same in babies born to
educated &
uneducated mothers?
We can answer this only by statistical
means (Testing of hypotheses)
Statistical applications in
Health
Sometimes, the interest could be the
relation between 2 variables Eg. Weeks
of gestation & Birth weight of the
newborn
We wish to know whether a change in
one brings a change in the other
Statistical methods that help in this are:
Exploratory - Scatter plot
Quantification Correlation Coefficient
Statistical applications in
Health
Quantifying the influence of one factor on another
How much birth weight can be gained if the
delivery can be postponed by a week?
How likely the new therapy improve
clinical
outcomes?
What is risk of breast cancer in a woman, if
her
presence of
cotinine in urine
Statistical applications in
Health
Agreement between 2 diagnostic
tests
Statistical applications in
Health
Development of new drugs and treatment modalities
What is the tolerable dose of a new drug?
What are the pharmaco-kinetics of the drug?
What is the effect of new drug/treatment in
condition?
treating a
with the
Statistical applications in
Health
Ensuring maximum benefit of
diagnosis and treatment with
minimum cost, based on available
resources
(Health Economics & Operational
Research)
Statistical applications in
Health
(Vital statistics)
Statistical applications in
Health
Population dynamics
Growth
Rural Urban components
Migration
Age composition
Expectation of life at birth
(Demography)
Statistical applications in
Health
Estimating the magnitude of various diseases
Distribution w.r. to age, place, time
Identifying the possible causative factors
Principles of different study designs for different
objectives
Control of Confounding, Bias
Role of any interactions
(Epidemiological methods)
Statistical applications in
Health
Estimating the potency and relative potency
of drugs
What is the dose at which 50% of subjects
respond? (LD50, ED50)
Eg. If the relative potency of a drug (A)
compared to another drug (B) is 1.5, then 1
unit of drug A is equivalent to 1.5 units of
drug B
(Biological Assays)
Statistical applications in
Health
Maintaining the quality of:
Drugs
Laboratory Instruments
Surgical Instruments
Statistical applications in
Health
Synthesis of data from separate but
similar, comparable studies to have a
quantitative summary of pooled results
Aim is to integrate the findings, pool the
data and find overall trend of results
Statistical applications in
Health
Use of current best evidence derived from
published clinical & epidemiological research in
management of patients, with due attention to:
Balance of risks & benefits of diagnostic tests
Alternative treatment regimens
Each patients unique circumstances, including
baseline risk, co-morbidities & personal
preferences
Population vs Sample
Note that results of sample are of interest only
if they tell us about the population from which
the sample is drawn
Intuitively, the bigger the Sample, the more
confident we are about the applicability of
results to the population
The more the Sample is representative of the
Population (Total), the more confident we are
So for a given investigation, clear definition of
Population & Sample and careful selection of
the Sample are very important
Qualitative Variables
Since Qualitative variables can a take a few values or
categories, they are also called Categorical variables
For example, any individual can have either A or B or
AB or O blood group, but no in between values
Examples: Race, smoking status, gender etc.
Quantitative Variables
Quantitative variables as the name indicates
can take any value on a continuous
spectrum.
For convenience sake we measure them in
some units (rounding off) however finer
they are (height in metres, cms, mm etc.)
But the actual values can lie on a continuous
spectrum. So they are also called Continuous
variables
Examples: Blood sugar, serum creatinine,
age, income, height, weight etc.
Variables
Qualitative
Quantitative
Continuous
Eg.
Temperature
Discrete
Eg.
Viral load
Nominal
Eg.
Bl. group
Ordinal
Eg.
Stage of dis.
Thank you