COB Colloquium 4-29-14

Uploaded by

justinjee

0% found this document useful (0 votes)

3 views2 pages

Summary of NYU COB colloquium

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Summary of NYU COB colloquium

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

3 views2 pages

COB Colloquium 4-29-14

Uploaded by

justinjee

Summary of NYU COB colloquium

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

COB Colloquium

4-29-14
Justin Jee
Speaker: Constantin Aliferis
Title: Frontier Problems in Feature Selection for Big Data Analytics
What is big data? Is it defined by the 3 Vs: Volume, Velocity, and
A(V)ailability? Well, perhaps. Aliferis presents a model in which big data
is really correlated with global or population level data with millions of
dimensions, as opposed to small data which is clinical or
demographic data based n selected biomarkers. There are many facets
to analysis of such data, including filtering, correlation, etc. in trying to
develop a predictive model. However, such approaches are widely used
in advertising, finance, and numerous other industries.
The methods boil down to regularization, dimensionality reduction,
model selection parsimony criteria, feature construction, and most
important feature selection. The typical formulation of the feature
selection problem for supervised learning is to find the smallest set of
variables that yields maximum predictivity for the response T. A related
way of stating the problem is via irrelevancy. i.e. if variable A is
irrelevant for T we can safely drop it from further consideration and we
will not compromise our ability to predict T.
Aliferis defines terms. Strong relevancy: If variable A is strongly
relevant for T, we should never drop it from consideration, otherwise
we will compromise our ability to predict T (unique information). Weak
relevancy: contain non-unique information, and dropping weakly
relevant variables from consideration will not compromise the ability to
predict T. Wrappers try to solve feature selection by searching in the
space of feature subsets and evaluating each one with a user-specified
classifier and loss function estimator. Filter feature selection looks at
properties of the data (not a classifier). There is no filter or wrapper
that works universallythese must be tailored to the particular loss
function.
He notes that correlations are not synonymous with causations. He
connects the two using the Markov Blanket. The markov boundary, the
set of variables that renders all other variables irrelevant, of T is the
optimal solution to the feature selection problem. However, a priori
these are rare. Computational causal discovery is an old field. Aliferis
describes advances in path analysis, structured equation modeling, the
work by Pearl, Verma, and others, etc.

Common techniques for feature selection include univariate filtering,

heuristic search, forward-backward stepwise procedures, and PCA.
Aliferis quickly goes through an empirical comparison between over
100 different algorithms. There was never a case other than markov
blanket selection in which a single algorithm dominated for all cases.

Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5794)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (895)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (588)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (400)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (838)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2259)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (74)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (266)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (345)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1891)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1016)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1713)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1090)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1839)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (121)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2322)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
Rating: 3.5 out of 5 stars
3.5/5 (1937)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4610)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2104)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4200)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1103)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
A General Machine Learning-Based Approach For Inverse Design of One-Dimensional Photonic Crystals Toward Targeted Visible Light Re Ection Spectrum
Document18 pages
A General Machine Learning-Based Approach For Inverse Design of One-Dimensional Photonic Crystals Toward Targeted Visible Light Re Ection Spectrum
yassinebouazzi
No ratings yet
Research
Document55 pages
Research
louiejohn389
No ratings yet
Perplexed by Quality: A Perplexity-Based Method For Adult and Harmful Content Detection in Multilingual Heterogeneous Web Data
Document14 pages
Perplexed by Quality: A Perplexity-Based Method For Adult and Harmful Content Detection in Multilingual Heterogeneous Web Data
billeton
No ratings yet
8119-Article Text-8942-1-10-20230930
Document10 pages
8119-Article Text-8942-1-10-20230930
waityholivia
No ratings yet
Performance Analysis of Different Classification Methods in Data Mining For Diabetes Dataset Using WEKA Tool
Document6 pages
Performance Analysis of Different Classification Methods in Data Mining For Diabetes Dataset Using WEKA Tool
Editor IJRITCC
100% (1)
Chapter 1
Document27 pages
Chapter 1
ssahoo79
No ratings yet
ML Unit III
Document40 pages
ML Unit III
Vasu 22
No ratings yet
Stock Prediction Using Neural Networks and Evolution Algorithm
Document13 pages
Stock Prediction Using Neural Networks and Evolution Algorithm
IJRASETPublications
100% (1)
Mini Project Report On Heart Disease Pre
Document23 pages
Mini Project Report On Heart Disease Pre
gayu
No ratings yet
Naive Bayes Classification
Document10 pages
Naive Bayes Classification
Pooja Racha
100% (1)
Assessment Report Richa
Document12 pages
Assessment Report Richa
Peter Stark
No ratings yet
Using IBM Watson To Answer Two Important Questions About Your Customers - PDF
Document12 pages
Using IBM Watson To Answer Two Important Questions About Your Customers - PDF
Ashraf S. Hussein
No ratings yet
Preview PDF
Document65 pages
Preview PDF
joy marga
No ratings yet
Artificial Intelligence Chapter 2: Intelligent Agents
Document12 pages
Artificial Intelligence Chapter 2: Intelligent Agents
Rakshith S
No ratings yet
Automatic Trendline Detection With Hough Transform
Document11 pages
Automatic Trendline Detection With Hough Transform
lluuuk
No ratings yet
Deep Medicine
Document6 pages
Deep Medicine
jlk66150
No ratings yet
Lecture 4
Document50 pages
Lecture 4
Rakshith Kamath
No ratings yet
Dsbda Lab Manual
Document167 pages
Dsbda Lab Manual
sm3815749
No ratings yet
A Review of Machine Learning
Document21 pages
A Review of Machine Learning
manaz.mammu7550
No ratings yet
An Efficient Spam Detection Technique For IoT Devices Using Machine Learning
Document5 pages
An Efficient Spam Detection Technique For IoT Devices Using Machine Learning
NATIONAL ATTENDENCE
No ratings yet
Artificial Intelligence and Machine Learning in Satellite Communication
Document13 pages
Artificial Intelligence and Machine Learning in Satellite Communication
SalahuddinKhan
No ratings yet
Rainfall Prediction
Document33 pages
Rainfall Prediction
Naasif M
100% (1)
Student Project - Vangaurd
Document8 pages
Student Project - Vangaurd
Sonu Jose
No ratings yet
Opei NSAP Ds
Document767 pages
Opei NSAP Ds
prudhvi2121991
No ratings yet
Advanced Machine Learning and Artificial Intelligence
Document9 pages
Advanced Machine Learning and Artificial Intelligence
Kannan S
No ratings yet
SDS Deep Learning For Spatial Application
Document47 pages
SDS Deep Learning For Spatial Application
MIKATAgung Jalaludin
No ratings yet
Food Recommendation System Using One-Stage Algorithm
Document14 pages
Food Recommendation System Using One-Stage Algorithm
IJRASETPublications
No ratings yet
National Institute of Fashion Technology, Jodhpur: Introduction To Artificial Intelligence
Document12 pages
National Institute of Fashion Technology, Jodhpur: Introduction To Artificial Intelligence
Anushka Singh
No ratings yet
Code Smells and Detection Techniques: A Survey: Conference Paper
Document7 pages
Code Smells and Detection Techniques: A Survey: Conference Paper
Azhar Zafar
No ratings yet
Azure Ai Landscape
Document31 pages
Azure Ai Landscape
arjun.ec633
100% (1)