Professional Documents
Culture Documents
Code
Github:
https://github.com/wthoolihan/MLUC-2012-Examples
Search Intelligence
Search Intelligence
Search Intelligence
Get the most out of our XML in search
Approach 1: GUI
Search Intelligence
Get the most out of our XML in search
Approach 1: GUI
Search Intelligence
Get the most out of our XML in search
Approach 2: Syntax
Search Intelligence
Get the most out of our XML in search
Approach 2: Syntax
Search Intelligence
Get the most out of our XML in search
Approach 3: Facets
Search Intelligence
Get the most out of our XML in search
Approach 3: Facets, constraints, filters
Search Intelligence
Get the most out of our XML in search
Infer (Search Intelligence)
Enrich
Translate into more complex query Gain speed, accuracy
Custom constraints Custom grammar Custom snippets Custom term handling Search operators
Constraint:
Parser:
let $custom-parser-output := my:parse($qtext) search:resolve( $custom-parser-output, $options )
Execute search:
2nd half of search:search() accepts cts:query XML as input
search:parse() Strategy
1. Call search:parse() 2. Analyze and enrich the query XML 3. Call search:resolve()
Problem
Users will enter citation numbers, abbreviations, etc. expecting complete results Text editorial content follows different conventions Detect special cases pre-search and enrich query
Solution
do-stuff-here($q)
search:parse() Strategy
Weakness
Limited to single word token
Similar to custom term handling
Query:
Same as before: "1996 United States Olypmics" Start with the search:parse()output
Output:
Output:
Combining Examples
local:thsr-expand-runs($runs, $q-thsr) /local:resolve-runs($expanded)/local:detect-year($runs)
Takeaway
1. No added GUI 2. Didn't ask the user for additional input 3. Able to build more robust query before executing search
Detect entities
Transform text into element-based query Less false positives and exclusions Leverage indexes:
"New York Times"
Hazards
Chaos
Daisy chained transformations can have unintended consequences
Performance
Pre-search transformations need to be fast make sure to leverage indexes as much as possible Larger queries do take longer
Questions