You are on page 1of 18

COMP527: Data Mining

COMP527: Data Mining

DrRobertSanderson (azaroth@liv.ac.uk)

Dept.ofComputerScience UniversityofLiverpool 2008

Isn'tthisslideterriblyuseful?Areyouintherightplace?No?Goingtostayanyway?Goon!Goodforyou!

Introduction to the Course

January 18, 2008

Slide 1

COMP527: Data Mining IntroductiontotheCourse

COMP527: Data Mining


InputPreprocessing AttributeSelection AssociationRuleMining ARM:APrioriandDataStructures ARM:Improvements ARM:AdvancedTechniques Clustering:Challenges,Basics Clustering:Agglomerative/Divisive Clustering:AdvancedAlgorithms HybridApproaches GraphMining,WebMining TextMining:Challenges,Basics TextMining:TextasData TextMining:TextasLanguage RevisionforExam

IntroductiontoDataMining IntroductiontoTextMining GeneralDataMiningIssues DataWarehousing Classification:Challenges,Basics Classification:Rules Classification:Trees Classification:Trees2 Classification:Bayes Classification:NeuralNetworks Classification:SVM Classification:Evaluation Classification:Evaluation2 Regression,Prediction

Introduction to the Course

January 18, 2008

Slide 2

COMP527: Data Mining

Today's Topics

Me,You:Introductions Lectures Tutorials References CourseSummary Assessment SomethingFun*

*Oratleastmorefun,hopefully

Introduction to the Course

January 18, 2008

Slide 3

COMP527: Data Mining

Introductions Dr.RobertSanderson

Office: 1.04,AshtonBuilding Extension: 54252[external:7954252] Email: azaroth@liv.ac.uk Web: http://www.csc.liv.ac.uk/~azaroth/ Hours: 10:00to18:00,notThursday Emailforatime,orshowupatanytimeknowingthat Imightnotbethere. Where'syouraccentfrom:NewZealand

Introduction to the Course

January 18, 2008

Slide 4

COMP527: Data Mining

Not Me!

SoyouwenttoWaikato? YourPhDisinDataMining? ...ComputerScience? ...Science?Math?Engineering? YouatleastwriteJava? ...C++? WhatsortofCSLecturerareyou?!

Introduction to the Course

January 18, 2008

Slide 5

COMP527: Data Mining

Me!

WenttoUniversityofCanterbury(NZ,notKent) ...ButIdoknowIanWittenquitewell. PhDisinFrench/History ...ButfocusedonComputingintheHumanities/Informatics Python! InformationScience:InformationRetrieval,DataMining,Text Mining,XML,Databases,Interoperability,GridProcessing, DigitalPreservation...

Introduction to the Course

January 18, 2008

Slide 6

COMP527: Data Mining

You!

...

Introduction to the Course

January 18, 2008

Slide 7

COMP527: Data Mining

Lectures

LectureSlots: Monday: Tuesday: Thursday: 1011am Here 1011am Here 910amHere

Courserequirement:30hoursoflectures SemesterTimetable: 8weeksclass,3weekseaster,4weeksclass.

Dates: 21stJanuaryto11thofMarch(Rob@conferenceon14th) 7thAprilto21stApril (Butmayrunto25th?)


Introduction to the Course January 18, 2008 Slide 8

COMP527: Data Mining

Tutorials/Lab Sessions

Location:
Lab 6, Tuesdays 3-4pm (just before departmental seminar)

Aims:
Provide time for practical experience Answer any questions from lectures/reading Informal self-assessment exercises

Software:
Data mining 'workbench' software WEKA installed on Windows image. May be available under Linux. Freely downloadable from University of Waikato:

http://www.cs.waikato.ac.nz/ml/weka/
Introduction to the Course January 18, 2008 Slide 9

COMP527: Data Mining

Course Web Sites

Departmental Home Page:


http://www.csc.liv.ac.uk/teaching/modules/newmscs2/comp527.html

Lecture Notes, Assignments, Exercises: http://www.csc.liv.ac.uk/~azaroth/courses/current/comp527/

Introduction to the Course

January 18, 2008

Slide 10

COMP527: Data Mining

Reference Texts

Witten,IanandEibeFrank,DataMining:PracticalMachineLearningToolsand Techniques,SecondEdition,MorganKaufmann,2005 Dunham,MargaretH,DataMining:IntroductoryandAdvancedTopics,Prentice Hall,2003

Introduction to the Course

January 18, 2008

Slide 11

COMP527: Data Mining

Frequently Used Resources

HanandKamber,DataMining:ConceptsandTechniques,Second Edition,MorganKaufmann,2006 Berry,Browne,LectureNotesinDataMining,WorldScientific,2006 BerryandLinoff,DataMiningTechniques,SecondEdition,Wiley,2004 Zhang,AssociationRuleMining,Springer,2002 Konchady,TextMiningApplicationProgramming,Thomson,2006 Weissetal.,TextMining:PredictiveMethodsforAnalyzing UnstructuredInformation,Springer,2005 Inmon,BuildingtheDataWarehouse,Wiley,1993 KDD (http://www.kdd2007.com) PAKDD (http://lamda.nju.edu.cn/conf/PAKDD07/) PKDD (http://www.ecmlpkdd2008.org/)
January 18, 2008 Slide 12

Introduction to the Course

COMP527: Data Mining

Frequently Used Websites http://citeseer.ist.psu.edu/ http://www.kdnuggets.com/ http://kdd.ics.uci.edu/

CiteSeer: KDNuggets: UCIRepository:

(plusfollowlinktoMachineLearningArchive)

Wikipedia: MathWorld: GoogleScholar: NaCTeM:

http://en.wikipedia.org/wiki/Data_mining http://mathworld.wolfram.com/ http://scholar.google.com/ http://www.nactem.ac.uk/


January 18, 2008 Slide 13

Introduction to the Course

COMP527: Data Mining

Course Summary

Introduction, Basics: Data Warehousing: Classification: Input Preprocessing: Association Rule Mining Clustering: Hybrid Approaches: Graph Mining: Text Mining: Revision: Total:

4 lectures 1 lecture 10 lectures 2 lectures 4 lectures 3 lectures 1 lecture 1 lecture 3 lectures 1 lecture 30 lectures

Introduction to the Course

January 18, 2008

Slide 14

COMP527: Data Mining

Assessment

75% End of Year Exam: 2 hours Short Answer and/or Essays Choose 4 of 5 sections 25% Continuous Assessment: 12% Assignment 1 (Due 2008-03-10 16:00:00) 13% Assignment 2 (Due 2008-04-25 16:00:00) Self assessment exercises Weekly (or as desired) during tutorial session

Introduction to the Course

January 18, 2008

Slide 15

COMP527: Data Mining

And Now...

... what you've all been waiting for ...

Something Fun! *

* (Or more fun than the rest of the lecture at least, your mileage may vary, opinions expressed herein bla bla bla)
Introduction to the Course January 18, 2008 Slide 16

COMP527: Data Mining

Nomic Mao The Rules:

Eachplayerisdealt7cardsbythedealer Thefirstpersontohavenocardsinhandwins Everyturn,eachplayerdiscardsacard Playstartswiththepersontotheleftofthedealerandproceeds totheleft Thedealerandthenthewinnerofeachroundmakesasecret rule Ifyoubreakarule,youreceiveapenaltyfromtherule'screator Thepenaltyis:Youmustdrawonecard

Introduction to the Course

January 18, 2008

Slide 17

COMP527: Data Mining

Advanced Rules

Laterrulesmayoverturnearlierrules,eithercompletelyorinpart Eachrulemayonlychangeoneaspectofthegameplay Penaltyconditionsforbreakingrulesinclude: Illegalcardplayed(egblackonred) Proceduralerror(egplayingoutofturn) Incorrectpenalty(egwhenalaterruleenablesaplay) Eachruleisnumbered(eg:ProceduralerrorunderRule3) Whentakingapenaltyforplayingoutofturn,ordiscarding multiplecards,youmustreturnthestateofthegametoasitwas beforethepenaltyandthenthepenaltyisincurred.

Introduction to the Course

January 18, 2008

Slide 18

You might also like