You are on page 1of 1

The CUAHSI HIS and Water Data Center: Providing the Shale Network with Data Services for

Research Collaboration
www.cuahsi.org wdc.cuahsi.org
What is CUAHSI HIS?
The CUAHSI Hydrologic Information System (HIS) is an internet-based system for sharing hydrologic data. It consists of three components: a client (HydroDesktop),a data server stack (HydroServer), and a central metadata registry (HydroCatalog). These components use a type of XML, WaterML, as a transmission language for metadata and data. WaterML has recently been accepted as an international standard for time-series data by the Open Geospatial Consortium. HIS uses a Service-Oriented Architecture (SOA) like those mandated by US government agencies for distribution of government-collected data. The SOA provides an environment similar to search engines like Google, but specifically for water data sources:

Jon Pollak, Alva Couch, Jennifer Arrigo


The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI)
WDC Mission
Providing Production Quality Water Data Resources
providing simple and effective data discovery tools useful to researchers and educators in a variety of water-related disciplines providing simple and cost-effective data publication mechanisms for projects that do not desire to run their own data servers, and provide long term archiving of university research data providing educational and outreach resources that focus on data-driven and place-based learning working with government data providers and decision-makers to develop and support data standards to make more water data more easily accessible to the water research and education community developing alternative data discovery interfaces such as web-based search clients and mobile applications that enhance the accessibility of water data by diverse audiences The mission of the WDC is to empower scientists to discover, use, store, and share water data by:

CUAHSI HIS:HydroDesktop HydroDesktopClient UsingUsing CUAHSI HIS: The


A services-oriented architecture enables different data access clients to be customized for specific purposes. Currently, the main data access client is HydroDesktop, a Windows program that combines open-source GIS software with a data discovery client that searches the HIS Central catalog. HydroDesktop can search for data by: Where? Geography When? Time of Measurement What? Property Measured Who? Data Source GIS Coverages included with HydroDesktop download: Political Boundaries (Country, U.S. State, U.S. County) U.S. HUC8 Ability to delineate watersheds using EPA web services

Providing Data Services for the Shale Network


HydroServer is a software suite that publishes ODM data. The CUAHSI Water Data Center team hosts an instance of this software on our server, HydroPortal, for the Shale Network. New data is regularly catalogued in the HIS Central data catalog to enable data discovery.

How the Web Works

Catalog (Google)

GIS Layers

Map Interface

Data Values

Web Server (CNN.com)


How HIS Works

Access

Browser (Firefox)

HTML web language for text and pictures

Figure 1: a prototype faceted search web-based client that allows users to refine search results by specifying one facet of the metadata to filter at a time. The use of almost all metadata fields as search filters greatly speeds data discovery for specific geographic regions and quickly indicates the extent of data of interest.

Achieving the Mission Providing Data Services for the Shale Network
The ShaleNetwork is working to develop a database of PA waters in the gas production region as a mechanism to pull together this network of research and citizen scientists and to understand water quantity and quality data to make knowledge from the numbers.

HydroCatalog

The Shale Network data is registered in the CUAHSI HIS Central Catalog; data published by the Shale Network is discoverable and accessible in clients like HydroDesktop alongside over 100 other water data sources, including government agencies, academic researchers, and citizen scientist groups.

Do You Have Data to Publish?

Contribute to the HIS Catalog:


Do you have water quality or quantity data in the Marcellus Shale region?
Consider publishing your data with the Shale Network

HydroServer

Data access

HydroDesktop

The WDC will pursue its mission at the nexus of numerous stakeholders including water researchers, other data centers, sensor vendors, government data providers, and policymakers.

WaterML web language for water data

Using CUAHSI HIS: The HIS Central Metadata Catalog


Access over 100 hydrologic data sources including universities and state, provincial, and federal agencies with one catalog! Data Sources Available
120 100 80 60 40 20 0 2008 2009 2010 2011 2012 2013 Q1
28 39 56 77 97 103

The CUAHSI Water Data Center


The CUAHSI HIS system has seen increasing adoption by the water science community over time, with 23 new services registered by data providers in 2012, and interest and cooperation from government agencies to make their data available through the system. With the development phase ended, CUAHSIs role as a community organization is to maintain and operate this system long term through the Water Data Center.

The Shale Network team is aggregating water quality and quantity data from many different sources into a single database. The primary data sources include, but are not limited to, those in the diagram at left.

Do you have data you wish to publish independently?


Set up your own HydroServer and register your data with the HIS Catalog

Want to publish data but cant maintain a server?


Contact CUAHSI User Support Specialist Jon Pollak (jpollak@cuahsi.org) to inquire about other data publication options

Visit wdc.cuahsi.org for more information!

What is the Water Data Center?


The Water Data Center (WDC) will be a virtual deployment of CUAHSI HIS to the Microsoft Azure cloud and will include support personnel for software development, data curation, and user support. A Five-year proposal has been funded by NSF with a start date of April 1, 2013.
RESEARCH POSTER PRESENTATION DESIGN 2011

Time Series Downloaded


2000000 1500000 1000000 500000 0 May-11 May-12 Mar-11 Mar-12 Mar-13 Jan-11 Jan-12 Nov-11 Nov-12 Sep-11 Sep-12 Jan-13 Jul-11 Jul-12

Acknowledgements
Data is stored in a format known as the Observations Data Model (ODM). This relational database allows data and metadata to be stored, retrieved, and unambiguously interpreted.
This work has been supported by National Science Foundation grants EAR 07-53921 to CUAHSI and EAR 06-22374 to the University of Texas. CUAHSI HIS has been developed by a large team of colleagues from the University of Texas, San Diego Supercomputing Center, Utah State University, Idaho State University, Drexel University, City College of New York, and University of South Carolina. The CUAHSI Water Data Center is a facility funded by NSF grant number 1248152.

www.PosterPresentations.com

You might also like