Professional Documents
Culture Documents
What is Statistics?
1.1
What is Statistics?
Statisticsisawaytogetinformationfromdata.
1.2
What is Statistics?
Statisticsisawaytogetinformationfromdata
Statistics
Data
Information
Statisticsisatoolforcreatingnewunderstandingfromasetof
numbers.
Definitions:OxfordEnglishDictionary
Copyright 2009 Cengage Learning
1.3
1.4
1.5
1.6
1.7
1.8
1.9
Descriptive Statistics
Descriptivestatisticsdealswithmethodsoforganizing,
summarizing,andpresentingdatainaconvenientand
informativeway.
Oneformofdescriptivestatisticsusesgraphicaltechniques,
whichallowstatisticspractitionerstopresentdatainwaysthat
makeiteasyforthereadertoextractusefulinformation.
Chapter2introducesseveralgraphicalmethods.
1.10
Descriptive Statistics
Anotherformofdescriptivestatisticsusesnumerical
techniquestosummarizedata.
Themeanandmedianarepopularnumericaltechniquesto
describethelocationofthedata.
Therange,variance,andstandarddeviationmeasurethe
variabilityofthedata
Chapter4introducesseveralnumericalstatisticalmeasures
thatdescribedifferentfeaturesofthedata.
Copyright 2009 Cengage Learning
1.11
studentshasofferedPepsiColaanexclusivityagreementthat
wouldgivePepsiexclusiverightstosellitsproductsatall
universityfacilitiesforthenextyearwithanoptionforfuture
years.
Inreturn,theuniversitywouldreceive35%oftheoncampus
revenuesandanadditionallumpsumof$200,000peryear.
Pepsihasbeengiven2weekstorespond.
1.12
Pepsicurrentlysellsanaverageof22,000cansperweek(over
the40weeksoftheyearthattheuniversityoperates).
Thecanssellforanaverageof75centseach.Thecosts
includinglaboramountto20centspercan.
Pepsiisunsureofitsmarketsharebutsuspectsitis
considerablylessthan50%.
Copyright 2009 Cengage Learning
1.13
Pepsiwouldsell88,000(22,000is25%of88,000)cansper
weekor3,520,000cansperyear.
Theprofitorlosscanbecalculated.
Theonlyproblemisthatwedonotknowhowmanysoft
drinksaresoldweeklyattheuniversity.
Copyright 2009 Cengage Learning
1.14
Accordingly,sheorganizesasurveythatasks500studentsto
keeptrackofthenumberofsoftdrinkstheypurchaseinthe
next7days.
Theresponsesarestoredinafileonthediskthataccompanies
thisbook.Case12.1
1.15
Inferential statistics
TheinformationwewouldliketoacquireinCase12.1isan
estimateofannualprofitsfromtheexclusivityagreement.The
dataarethenumbersofcansofsoftdrinksconsumedin7days
bythe500studentsinthesample.
Wewanttoknowthemeannumberofsoftdrinksconsumed
byall50,000studentsoncampus.
Toaccomplishthisgoalweneedanotherbranchofstatistics
inferentialstatistics.
1.16
Inferential statistics
Inferentialstatisticsisabodyofmethodsusedtodraw
conclusionsorinferencesaboutcharacteristicsofpopulations
basedonsampledata.Thepopulationinquestioninthiscase
isthesoftdrinkconsumptionoftheuniversity's50,000
students.Thecostofinterviewingeachstudentwouldbe
prohibitiveandextremelytimeconsuming.Statistical
techniquesmakesuchendeavorsunnecessary.Instead,wecan
sampleamuchsmallernumberofstudents(thesamplesizeis
500)andinferfromthedatathenumberofsoftdrinks
consumedbyall50,000students.Wecanthenestimateannual
profitsforPepsi.
1.17
Example 12.5
Whenanelectionforpoliticalofficetakesplace,thetelevision
networkscancelregularprogrammingandinsteadprovide
electioncoverage.
Whentheballotsarecountedtheresultsarereported.
However,forimportantofficessuchaspresidentorsenatorin
largestates,thenetworksactivelycompetetoseewhichwill
bethefirsttopredictawinner.
1.18
Example 12.5
Thisisdonethroughexitpolls,whereinarandomsampleof
voterswhoexitthepollingboothisaskedforwhomthey
voted.
Fromthedatathesampleproportionofvoterssupportingthe
candidatesiscomputed.
Astatisticaltechniqueisappliedtodeterminewhetherthereis
enoughevidencetoinferthattheleadingcandidatewillgarner
enoughvotestowin.
1.19
Example 12.5
TheexitpollresultsfromthestateofFloridaduringthe2000
yearelectionswererecorded(onlythevotesoftheRepublican
candidateGeorgeW.BushandtheDemocratAlbertGore).
Supposethattheresults(765peoplewhovotedforeitherBush
orGore)werestoredonafileonthedisk.(1=Goreand2=
Bush)
Xm1205
Thenetworkanalystswouldliketoknowwhethertheycan
concludethatGeorgeW.BushwillwinthestateofFlorida.
1.20
Example 12.5
Example12.5describesaverycommonapplicationof
statisticalinference.
Thepopulationthetelevisionnetworkswantedtomake
inferencesaboutistheapproximately5millionFloridianswho
votedforBushorGoreforpresident.
Thesampleconsistedofthe765peoplerandomlyselectedby
thepollingcompanywhovotedforeitherofthetwomain
candidates.
1.21
Example 12.5
Thecharacteristicofthepopulationthatwewouldliketo
knowistheproportionofthetotalelectoratethatvotedfor
Bush.
Specifically,wewouldliketoknowwhethermorethan50%
oftheelectoratevotedforBush(countingonlythosewho
votedforeithertheRepublicanorDemocraticcandidate).
1.22
Example 12.5
Becausewewillnotaskeveryoneofthe5millionactual
votersforwhomtheyvoted,wecannotpredicttheoutcome
with100%certainty.
Asamplethatisonlyasmallfractionofthesizeofthe
populationcanleadtocorrectinferencesonlyacertain
percentageofthetime.
Youwillfindthatstatisticspractitionerscancontrolthat
fractionandusuallysetitbetween90%and99%.
1.23
Sample
Asampleisasetofdatadrawnfromthe
population.
Potentiallyverylarge,butlessthanthepopulation.
E.g.asampleof765votersexitpolledonelectionday.
Copyright 2009 Cengage Learning
1.24
1.25
Sample
Subset
Parameter
Statistic
PopulationshaveParameters,
SampleshaveStatistics.
Copyright 2009 Cengage Learning
1.26
Descriptive Statistics
aremethodsoforganizing,summarizing,andpresenting
datainaconvenientandinformativeway.Thesemethods
include:
GraphicalTechniques(Chapter2),and
NumericalTechniques(Chapter4).
Theactualmethoduseddependsonwhatinformationwe
wouldliketoextract.Areweinterestedin
measure(s)ofcentrallocation?and/or
measure(s)ofvariability(dispersion)?
DescriptiveStatisticshelpstoanswerthesequestions
Copyright 2009 Cengage Learning
1.27
Inferential Statistics
DescriptiveStatisticsdescribethedatasetthatsbeing
analyzed,butdoesntallowustodrawanyconclusionsor
makeanyinterferencesaboutthedata.Henceweneed
anotherbranchofstatistics:inferentialstatistics.
Inferentialstatisticsisalsoasetofmethods,butitisusedto
drawconclusionsorinferencesaboutcharacteristicsof
populationsbasedondatafromasample.
1.28
Statistical Inference
Statisticalinferenceistheprocessofmakinganestimate,
prediction,ordecisionaboutapopulationbasedonasample.
Population
Sample
Inference
Statistic
Parameter
WhatcanweinferaboutaPopulationsParameters
basedonaSamplesStatistics?
Copyright 2009 Cengage Learning
1.29
Statistical Inference
Weusestatisticstomakeinferencesaboutparameters.
Therefore,wecanmakeanestimate,prediction,ordecision
aboutapopulationbasedonsampledata.
Thus,wecanapplywhatweknowaboutasampletothe
largerpopulationfromwhichitwasdrawn!
1.30
Statistical Inference
Rationale:
Largepopulationsmakeinvestigatingeachmemberimpractical
andexpensive.
Easierandcheapertotakeasampleandmakeestimatesaboutthe
populationfromthesample.
However:
Suchconclusionsandestimatesarenotalwaysgoingtobecorrect.
Forthisreason,webuildintothestatisticalinferencemeasuresof
reliability,namelyconfidencelevelandsignificancelevel.
1.31
Whenthepurposeofthestatisticalinferenceistodrawa
conclusionaboutapopulation,thesignificancelevel
measureshowfrequentlytheconclusionwillbewronginthe
longrun.
E.g.a5%significancelevelmeansthat,inthelongrun,thistype
ofconclusionwillbewrong5%ofthetime.
1.32
1.33
Inthiscase,ourconfidencelevelis95%(19/20=0.95),
whileoursignificancelevelis5%.
1.34
1.35