Image Searches, Abstraction, Invariance: 36-350: Data Mining 2 September 2009

Image Searches,
Abstraction, Invariance
36-350: Data Mining
2 September 2009
1
• Medical: x-rays, brain imaging, histology (“do
these look like cancerous cells?”)
• Satellite imagery
• Fingerprints
• Finding illustrations for lectures...
2
Searching for Images by
Searching for Text
• Assume there’s text accompanying the images
(“annotation”)
• tags
• Search those text records with the query phrase
• Take images which appear close to the query
phrase on highly-ranked records
• This how Google does it

3
Sometimes
this works
perfectly...
4
...and
sometimes
it doesn’t;
depends on
the text!
5
Searching for images by
representing images
• For text, we only cared about features, and
only worked with feature vectors
• Define numerical features for images and

everything carries over
• Abstraction
6
Abstraction
• Remove some of the details but keep others

• Kept details = features
• Then act on abstracta
• Hopes:
• Simplifies problem
• Lets you treat many problems similarly
7
Abstract level: feature vectors
Similarity Dimensionality
Classification Clustering etc.
matching Reduction
v1 v2 v3 v4 v5 v6
BoW
BoW
BoW
BoW
BoW
BoW
Text 1 Text 2 Text 3 Text 4 Text 5 Text 6
Concrete level: meaningful objects

8
matching Reduction
v1 v2 v3 v4 v5 v6
Topics
Topics
Topics
Topics
Topics
Topics
Text 1 Text 2 Text 3 Text 4 Text 5 Text 6

9
matching Reduction
v1 v2 v3 v4 v5 v6
Bitmap
Bitmap
Bitmap
Bitmap
Bitmap
Bitmap
Pic. 1 Pic. 2 Pic. 3 Pic. 4 Pic. 5 Pic.6

10
matching Reduction
v1 v2 v3 v4 v5 v6
Bag of colors
Bag of colors
Bag of colors
Bag of colors
Bag of colors
Bag of colors
Pic. 1 Pic. 2 Pic. 3 Pic. 4 Pic. 5 Pic.6

11
matching Reduction
v1 v2 v3 v4 v5 v6
Motifs
Motifs
Motifs
Motifs
Motifs
Motifs
Network 1 Network 2 Network 3 Network 4 Network 5 Network 6

12
• Need to find right (relevant) representation
• Representation = concrete/abstract
interface
• Go read The Sciences of the Artificial!
• Great methods at the abstract level
generally fail if the representation is bad
• missing what’s relevant
• including what’s irrelevant
• •comparing apples to kangaroos
both multicellular sexually-reproducing carbon-based lifeforms...
• A lot of your work will be designing

representations
13
matching Reduction
v1 v2 v3 v4 v5 v6
Bag of colors
Bitmap
Topics
Motifs
BoW
BoW
Social
Text 1 Text 2 Text 3 Network
Pic. 1 Pic. 2
14
flower1 flower2 flower3
tiger1 tiger2 tiger3
ocean1 ocean2 ocean3

15
Euclidean Distance of
Images
• Image is MxN pixels, each with 3 color
components, so a 3MN vector
• Euclidean distance possible, and OK for

some kinds of noise-removal
• but hopeless even at grouping flower1 with

flower2
• or slight changes in perspective, lighting...
16
Bag of Colors
• “If it works, try it some more”
• For each possible color, count how many
pixels there are of that color
• Use Euclidean distance on color-count

vectors
• Too many colors, so quantize them down to

a manageable number (like stemming, or
combining synonyms)
17
flower1
flower2
flower3
flower4
flower5
flower6
flower7 Multidimensional scaling
flower ocean tiger
flower8
ocean5 ocean6
1.0
flower9
tiger1 ocean1
tiger2 flower4 flower9 ocean4 ocean3
tiger3
0.5
tiger4 ocean7
tiger5
flower7
tiger6 flower3 tiger6 ocean2
V2
0.0
tiger7 flower2
tiger8 flower6 tiger4
flower8
tiger9 tiger2
ocean1 tiger1
−0.5
flower1
ocean2 tiger7
tiger8
ocean3 tiger9
ocean4 flower5
tiger5
−1.0
ocean5 tiger3
ocean6
ocean7 −1.0 −0.5 0.0 0.5 1.0
flower1
flower2
flower3
flower4
flower5
flower6
flower7
flower8
flower9
tiger1
tiger2
tiger3
tiger4
tiger5
tiger6
tiger7
tiger8
tiger9
ocean1
ocean2
ocean3
ocean4
ocean5
ocean6
ocean7
V1
Distances between images MDS plot of images

18
Representation and
Invariance
• Invariances of a representation = how can we

change the underlying object without changing
the representation?
• What differences does the representation ignore?
19
Invariants of bags of words
• Punctuation and word order
• Universal words (exact count of “the”, “of”,
“to”, ...), if using inverse document frequency
• Word-endings, if using stemming

• Grammar, context, word proximity ...
• “Send lawyers, guns and money” vs.
“Sending the Guns’ lawyers for the
money”
20
Invariants of bags of
colors
• Small changes in orientation, pose, some rotations

• Small amounts of color noise or weird colors
• Texture
21
Same color counts, different textures
22
Non-invariants
• Lighting, shadows
• Occlusion, 3D effects
• Blurring
• There are good ways to deal with blur
(from astronomy)
• but full vision is very, very hard
23
• Breaking an invariance is easy
• e.g., add features for textures
• or sub-divide the image and do color-
counts on each part
• Adding invariances is hard

• often need to go back to scratch and
chose a different representation
24
Similarity
search
with real
images
from the
web
(“retrievr”,
see notes)
25
26
• Typically works better with more restricted
domains (actually pretty good for medical
images)
27

Image Searches, Abstraction, Invariance: 36-350: Data Mining 2 September 2009

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Image Searches, Abstraction, Invariance: 36-350: Data Mining 2 September 2009

Uploaded by

Copyright:

Available Formats

Image Searches,

• This how Google does it

• Define numerical features for images and

• Remove some of the details but keep others

Concrete level: meaningful objects

Concrete level: meaningful objects

Concrete level: meaningful objects

Concrete level: meaningful objects

Concrete level: meaningful objects

• A lot of your work will be designing

tiger1 tiger2 tiger3

ocean1 ocean2 ocean3

• Euclidean distance possible, and OK for

• but hopeless even at grouping flower1 with

• or slight changes in perspective, lighting...

• Use Euclidean distance on color-count

• Too many colors, so quantize them down to

Distances between images MDS plot of images

• Invariances of a representation = how can we

• What differences does the representation ignore?

• Word-endings, if using stemming

• Small changes in orientation, pose, some rotations

• but full vision is very, very hard

• Adding invariances is hard

You might also like