Professional Documents
Culture Documents
Mallikarjun Hangarge
Document Image: Why?
• Paperless Solution
– Efficient transfer
– Organization
– Convenience
• Access to a variety of content
– Universal reader – email, attachments, spread
sheets
– Don’t need original applications
How Do We Acquire Document Image?
• Scanner
• Camera
• Smart Phones
Where we find ?
Everywhere
What we can do with them?
• Can we Access it?
– Search
– Browse
– “Read”
• Index and Retrieve them?
In their basic form not really!
• We can
– View
– Print
– Not much else
Why?
1. Image ID
Query
2. Structure
Documents
3. Decomposition
4. Handwriting
Layout
Similarity
Ranked 5. Stamps/ Logos
Results
6. Zone Classificatio
Images
w/Text
Genre Class
Classification Results
Hand
Signature
Noise Page Detection
Decomposition
Images Zone
Machine Segmentation
w/o Text Labeling