Assignment 6

Uploaded by

j__d

0% found this document useful (0 votes)

51 views2 pages

An assignment to understand Weka

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

An assignment to understand Weka

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

51 views2 pages

Assignment 6

Uploaded by

j__d

An assignment to understand Weka

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Assignment 6

Learning Objectives: 1. Get practice with a slightly more realistically scaled machine learning problem then what we have done in prior assignments this will give you the opportunity to apply a great many of the techniques we have explored so far 2. Apply features of TagHelper and Weka for processing raw text data. 3. Investigate issues related to feature space design. Description: Before we watch a movie, we read reviews about it!!. Typically movie reviews also indicate whether the movie is good or bad (with thumbs up/down symbols) or provide a rating on a scale of 0-5 or 0-10. Suppose if we want to apply machine learning algorithm to come up with a decision whether the reviews of a movie are positive or negative, then such a machine learning algorithm would take review comments / text as input and predict whether the review is negative / positive about the movie. In this assignment, we are given a bunch of reviews on several movies classified as positive and negative. The goal is to build a classifier that can correctly assign either a positive or negative tag to the movie review texts. Use MovieReviews.xls. Step-by-Step Guide: 1. Complete the week 6 and 7 assigned readings, and review the lecture slides from week 7, especially where instructions for using TagHelper tools were given 2. Manually examine some examples of given movie-reviews data and observe what could be likely features that could predict a review to be negative or positive. 3. Read the file into TagHelper tools and configure the customization panel so that you are using only unigram features, and you are using attribute selection to get the top 200 of these attributes, and do the classification using SMO. After you have run TagHelper tools, you will find a performance report and output file in the OUTPUT directory as well as a .arff file in the ARFF directory. (rename it as base-line .arff file and set it aside with the performance report.) a. Make a note of the baseline performance as indicated in the performance report. 4. Do an error analysis and determine where the machine learning algorithm is making mistakes 5. Load the .xls file into TagHelper tools again. Based on your error analysis, configure TagHelper in such a way as to try to compensate for the confusions

you observed in your error analysis. You may wish to create some new features using the Advanced Feature Editor. Now run TagHelper to obtain a new .arff file and performance report. Label the .arff file Final.arff. 6. Compare the performance obtained in Step 5 Vs Step 3 using the Experimenter to determine whether any observed difference in performance is statistically significant.

Deliverables: 1. Your baseline .arff file, your final .arff file after experimentation and modification, 2. Write up of your experimentation process that includes: a. Your observations from your initial exploratory analysis of the data, b. A description of your baseline performance and error analysis c. A description of what you tried for improving results over your baseline and why you thought it would work d. A comparison of the results of your final approach with your baseline approach. e. Was it significantly better? NOTE: Your report should display your understanding of the concepts and the logic/process you have chosen to uncover the hidden features in the text. Your report should explain why a particular technique seemed to be working or not working. The final performance (high or low accuracy) plays a secondary role in reviewing your report.

Elm Project
Document5 pages
Elm Project
Alaska Moon
No ratings yet
DW RT Hellormt PDF
Document22 pages
DW RT Hellormt PDF
KannanSruthy
No ratings yet
Wednesday, January 27 in Class Individual
Document8 pages
Wednesday, January 27 in Class Individual
sowmiyaraj88
No ratings yet
Wekappt
Document58 pages
Wekappt
bahubali king
No ratings yet
Tutorial - 2 - Hierarchical Design and Test Fixtures
Document13 pages
Tutorial - 2 - Hierarchical Design and Test Fixtures
Franco Aguilar
No ratings yet
BASIC ABAP Certification Questions PDF
Document9 pages
BASIC ABAP Certification Questions PDF
Tanmaya Kumar Sahu
No ratings yet
A453 - Controlled Assessment Preparation
Document8 pages
A453 - Controlled Assessment Preparation
api-253559971
No ratings yet
TIQ
Document7 pages
TIQ
bskuma
No ratings yet
Anp Solver User Guide
Document15 pages
Anp Solver User Guide
kustika
No ratings yet
ABAP Testing Tools New ABAP Debugger ABAP Analysis Tools
Document31 pages
ABAP Testing Tools New ABAP Debugger ABAP Analysis Tools
shivara143
No ratings yet
AUTOMATION STANDARDS AND BEST PRACTICES
Document10 pages
AUTOMATION STANDARDS AND BEST PRACTICES
abhishekdubey2011
No ratings yet
Copia de Lab 03 1basics (Level 1) - Lab
Document121 pages
Copia de Lab 03 1basics (Level 1) - Lab
Jairo Johan Colonia Guzman
No ratings yet
Part I - Installing Weka: HW Assignment 1
Document3 pages
Part I - Installing Weka: HW Assignment 1
Leonard Tambunan
No ratings yet
COSC 4P76 Machine Learning: Project Report Format: A. The Target Function
Document3 pages
COSC 4P76 Machine Learning: Project Report Format: A. The Target Function
ru
No ratings yet
ENEL3CC Phase 2 Requirements
Document4 pages
ENEL3CC Phase 2 Requirements
NOMPUMELELO MTHETHWA
No ratings yet
Fiori Web App Development
Document13 pages
Fiori Web App Development
Subrata Patra
No ratings yet
Analysis Tools - Runtime Analysis
Document28 pages
Analysis Tools - Runtime Analysis
Mike Beis
No ratings yet
Folder Structure in Framework
Document9 pages
Folder Structure in Framework
SapnilNaik
No ratings yet
Common LAB TASK MANUAL
Document18 pages
Common LAB TASK MANUAL
Fahad Iftkhar
No ratings yet
Agilewrap Quick Start Guide: Bringing Agility To Application Lifecycle
Document16 pages
Agilewrap Quick Start Guide: Bringing Agility To Application Lifecycle
Vanessa Bedoya
No ratings yet
Beginners Guide To Performance Profiling in Visual Studio
Document3 pages
Beginners Guide To Performance Profiling in Visual Studio
Shekar Mullangi
No ratings yet
Lab-11 Random Forest
Document2 pages
Lab-11 Random Forest
KamranKhan
No ratings yet
Spell Checking and Autocomplete Features in a Text Editor
Document5 pages
Spell Checking and Autocomplete Features in a Text Editor
sefdeni
No ratings yet
Create QTP Test Scripts
Document26 pages
Create QTP Test Scripts
nimjose
No ratings yet
Htu Chapter 1 Test Automation Framework
Document11 pages
Htu Chapter 1 Test Automation Framework
Chaitra Patil
No ratings yet
00 - PlantPAx System Design and Estimation Tools v5.0 (en-US)
Document79 pages
00 - PlantPAx System Design and Estimation Tools v5.0 (en-US)
Arielisto
No ratings yet
Test Automation Framework
Document11 pages
Test Automation Framework
Amit Amit Kumar
No ratings yet
Top 39 Automation Testing Interview Questions and Answers
Document11 pages
Top 39 Automation Testing Interview Questions and Answers
Hamo
100% (1)
KF5006 Applied Programming Assignment
Document6 pages
KF5006 Applied Programming Assignment
Anonymous R5asnaPUaw
No ratings yet
Basics - Lab
Document112 pages
Basics - Lab
Leonel marcos
No ratings yet
What Is Framework-Unit6
Document11 pages
What Is Framework-Unit6
Govada Dhana
No ratings yet
FRCD Lab Activity Manual - V7 00-2
Document87 pages
FRCD Lab Activity Manual - V7 00-2
Chance Daniel
No ratings yet
Qira Rashida Page 57
Document9 pages
Qira Rashida Page 57
projectjr2 Profs
No ratings yet
LAB 6 - Testing The ALU: Goals
Document9 pages
LAB 6 - Testing The ALU: Goals
Joey Wang
No ratings yet
Interview Selenium
Document17 pages
Interview Selenium
Satya Vani
No ratings yet
Automated Unit Test Generation Maximizes Code Coverage
Document10 pages
Automated Unit Test Generation Maximizes Code Coverage
Толганай Кыдырмоллаева
No ratings yet
Testing Interview Questions
Document35 pages
Testing Interview Questions
ganrgma
No ratings yet
Compartment and Access: Preface What's New? Getting Started
Document176 pages
Compartment and Access: Preface What's New? Getting Started
Mohamed Abdel Basit
No ratings yet
FOP Development: Testing: $revision: 627324 $
Document4 pages
FOP Development: Testing: $revision: 627324 $
cancelthis0035994
No ratings yet
To Accomplish This
Document3 pages
To Accomplish This
1 Zero
No ratings yet
BT interview questions cover HashMaps, HashTables, super vs this, types of testing
Document8 pages
BT interview questions cover HashMaps, HashTables, super vs this, types of testing
Administrator Bussines
No ratings yet
Automated Test Execution Effort Estimation Based On Functional Test Specifications
Document7 pages
Automated Test Execution Effort Estimation Based On Functional Test Specifications
pooja kawale
No ratings yet
Automated Testing Strategy
Document28 pages
Automated Testing Strategy
raheel
No ratings yet
Do-File Templates & Do-File Basics: Why Standardize and Annotate Your Do-Files?
Document7 pages
Do-File Templates & Do-File Basics: Why Standardize and Annotate Your Do-Files?
Francesca Romana Mussa
No ratings yet
Performance Testing
Document70 pages
Performance Testing
Reddeppa Bokkasam
No ratings yet
Function Module Important
Document12 pages
Function Module Important
Chandu Manikanta
No ratings yet
7641 Assignment 1
Document4 pages
7641 Assignment 1
Muhammad Aleem
No ratings yet
KB 51771 Lab Manual - VBA Error Handling
Document40 pages
KB 51771 Lab Manual - VBA Error Handling
VÕ QUỐC HIỆU
No ratings yet
Lab Manual Java
Document138 pages
Lab Manual Java
globo1
No ratings yet
Lab 10: Practice WEB1201: Web Fundamentals: Rev 1.1 November, 2019
Document6 pages
Lab 10: Practice WEB1201: Web Fundamentals: Rev 1.1 November, 2019
阿符的故事
No ratings yet
Agile Planning and Portfolio Management With Azure Boards
Document46 pages
Agile Planning and Portfolio Management With Azure Boards
sairam smart
No ratings yet
CS 240 Assignment Guidelines
Document4 pages
CS 240 Assignment Guidelines
Chaitanya Varier
No ratings yet
Requirement Use Case Application
Document9 pages
Requirement Use Case Application
Suya Rajasagi
No ratings yet
Agile Planning and Portfolio Management With Team Foundation Server 2015
Document75 pages
Agile Planning and Portfolio Management With Team Foundation Server 2015
Erte 12
No ratings yet
Java Application Profiling Using TPTP: Eclipse Corner Article
Document16 pages
Java Application Profiling Using TPTP: Eclipse Corner Article
Ravi Nekkalapu
No ratings yet
Tableau Prep Lesson 3
Document6 pages
Tableau Prep Lesson 3
jy
No ratings yet
Ab InitioFAQ2
Document14 pages
Ab InitioFAQ2
Sravya Reddy
No ratings yet
Next Generation ABAP Runtime Analysis - How to Analyze Performance
Document7 pages
Next Generation ABAP Runtime Analysis - How to Analyze Performance
小毛王
No ratings yet
Diagnostics: Demo Kit Nightly Build Demo Kit Latest Release
Document12 pages
Diagnostics: Demo Kit Nightly Build Demo Kit Latest Release
zzg
No ratings yet
Confident Programmer Problem Solver: Six Steps Programming Students Can Take to Solve Coding Problems
From Everand
Confident Programmer Problem Solver: Six Steps Programming Students Can Take to Solve Coding Problems
Cloudy Heaven Games
No ratings yet
Lesson 4: Educational Technology and Innovative Teaching
Document4 pages
Lesson 4: Educational Technology and Innovative Teaching
Terry Reyes
No ratings yet
Gapuz Critical Essay
Document4 pages
Gapuz Critical Essay
Matthew Gapuz
No ratings yet
Passport Criteria Evaluation
Document1 page
Passport Criteria Evaluation
api-422412369
No ratings yet
Full Text
Document21 pages
Full Text
Kartini Ambomai
No ratings yet
Dealing With Dominators in Meetings
Document2 pages
Dealing With Dominators in Meetings
Michael Goldman
No ratings yet
Orientation in Space, Manual Dexterity and Graphomotor Skills
Document5 pages
Orientation in Space, Manual Dexterity and Graphomotor Skills
mrgodzillaa
No ratings yet
Tips For Being A Super-Organised Student - Exercises
Document3 pages
Tips For Being A Super-Organised Student - Exercises
hasan ibrahim
No ratings yet
Psychological Testing in Disability Eval
Document247 pages
Psychological Testing in Disability Eval
Lau Ra Mar
100% (3)
Quantifiers Race
Document2 pages
Quantifiers Race
Laura Camelo
No ratings yet
Test 7
Document13 pages
Test 7
Minh Nguyễn Nhật
No ratings yet
Defense Mechanisms - Psikiatri
Document17 pages
Defense Mechanisms - Psikiatri
ind matthew
No ratings yet
CBR SYIFA QANITA 0304173181 Sociolinguistics
Document16 pages
CBR SYIFA QANITA 0304173181 Sociolinguistics
SyifaQanita
No ratings yet
G J E S R: Lobal Ournal of Ngineering Cience and Esearches
Document7 pages
G J E S R: Lobal Ournal of Ngineering Cience and Esearches
Pīyush Sīngh
No ratings yet
Utility of Evidence Based Nursng
Document12 pages
Utility of Evidence Based Nursng
rovica
No ratings yet
Communication Strategies Using Tech Tools
Document4 pages
Communication Strategies Using Tech Tools
Christine
No ratings yet
Easy French Prepositions Rules for Cities and Countries
Document5 pages
Easy French Prepositions Rules for Cities and Countries
lolo
No ratings yet
A Very Remarkable Piece of Iron Towards A Theory of Material Imagination in Virginia Woolf S Solid Objects
Document16 pages
A Very Remarkable Piece of Iron Towards A Theory of Material Imagination in Virginia Woolf S Solid Objects
intj2001712
No ratings yet
Capacity Building in E-Health & Health Informatics in Developing Countries "From Silos To Systems"
Document23 pages
Capacity Building in E-Health & Health Informatics in Developing Countries "From Silos To Systems"
AMIA
100% (1)
Sevastia Moundros Resume
Document1 page
Sevastia Moundros Resume
api-658863187
No ratings yet
Q1 PR2 LAS Week 2 Importance of Research Across Fields
Document12 pages
Q1 PR2 LAS Week 2 Importance of Research Across Fields
Analie Cabanlit
No ratings yet
Improvisational Insurrection Tracie Morris
Document26 pages
Improvisational Insurrection Tracie Morris
Sladja Blazan
No ratings yet
SotA and Gap
Document15 pages
SotA and Gap
Earl Calingacion
No ratings yet
Success For All
Document79 pages
Success For All
Kara
No ratings yet
Theophostic Prayer and Recovered Memory Therapy
Document9 pages
Theophostic Prayer and Recovered Memory Therapy
Alvaro Alzate
No ratings yet
Understanding Multiple Intelligences Through Famous Figures
Document60 pages
Understanding Multiple Intelligences Through Famous Figures
Rafał Maciej Sikora
No ratings yet
Hanoi University Assignment Cover Sheet SEO
Document33 pages
Hanoi University Assignment Cover Sheet SEO
Crystal Sky
No ratings yet
Models of Prevention
Document21 pages
Models of Prevention
precillathoppil
100% (5)
Andrea Villarreal Resume 2014
Document2 pages
Andrea Villarreal Resume 2014
api-257114951
No ratings yet
Contribution - 2019 - She Ji The Journal of Design Economics and Innovation
Document1 page
Contribution - 2019 - She Ji The Journal of Design Economics and Innovation
nitakuri
No ratings yet
The Difference Between Mentoring and Coaching: Valerie Pelan
Document4 pages
The Difference Between Mentoring and Coaching: Valerie Pelan
dobi
No ratings yet