DataStream Publishes Open Data Standard to Support Water Science
February 13, 2020
The publication of an open data standard is enabling valuable freshwater data to be organized, accessed, and shared in a harmonized way. This data standard underpins DataStream, a growing online platform for sharing water data collected by Canada’s diverse water monitoring and research community.
“The publication of this open data standard comes at a time when vast amounts of water quality data are being generated across sectors and jurisdictions,” said Carolyn DuBois, Water Program Director at The Gordon Foundation. “When brought together these datasets can generate powerful new insights into environmental change across distances and timescales that are beyond the scope of any one monitoring initiative alone.”
The open data standard helps advance these efforts by making it easier to share and integrate water data. It provides a common language and consistent structure for organizing data so there is clarity around what is being presented and how. Without structure, data cannot be used to its full potential, including in applications that leverage machine learning.
Our hope is that more and more research initiatives in Canada will adopt this and other relevant data standards.
“As we were building DataStream, we faced a very real challenge – everyone had their own way of organizing and describing their data,” explained DuBois. “We were fortunate to find that the United States Environmental Protection Agency and the United States Geological Survey had developed and rolled out a system to address this challenge for virtually every water quality parameter we encountered. The open data standard that we have published is based on this US data schema with some adaptations based on feedback from our users here in Canada.”
The data standard is incorporated into DataStream’s digital infrastructure, which is currently used by over 80 different monitoring groups to publish their results. DataStream is a growing online platform that leverages cutting edge blockchain technology, providing a reliable home for water data. Free and open for anyone to use, visitors to the site can query, visualize and download this data in a consistent format.
Now that the standard is openly available, software developers anywhere in the world can use it to develop their own freshwater research tools. It also means researchers and others with water data expertise can contribute and improve upon this standard moving forward.
“Our hope is that more and more research initiatives in Canada will adopt this and other relevant data standards rather than re-invent the wheel,” said DuBois. “It’s a practical and important step towards more open, collaborative scientific research practices that can support freshwater stewardship.”
DataStream is an online, open access platform for sharing water data. It is built with communities, policy-makers and researchers in mind, and designed to make it easy for diverse monitoring and research groups to share, visualize, and download data.
DataStream is led by The Gordon Foundation and carried out in collaboration with monitoring networks and regional partners – The Government of the Northwest Territories (DataStream's founding partner), the Atlantic Water Network, and the Lake Winnipeg Foundation – who are instrumental in growing the Mackenzie, Atlantic, and Lake Winnipeg DataStream hubs.
The Gordon Foundation is a charitable organization dedicated to protecting Canada's water and empowering Canada's North. For more information visit www.gordonfoundation.ca.
To access the open data standard, visit The Gordon Foundation’s Github site.
For more information please contact:
Gordon Shallard-Brown, Communications Manager, The Gordon Foundation
firstname.lastname@example.org 416.601.4776 ext. 230
Meghan joined us at the beginning of the year right after finishing her master's degree at the University of Waterloo. Her studies focused on nutrient contamination in the Lake Erie basin. She used long-term data and process-based models to predict past, present, and future nitrogen storage in the surrounding sub-basins of Lake Erie. Meghan will be contributing to the continued development of DataStream by working with data contributors and users across the Great Lakes region and beyond.
In May, the DataStream team gathered in Toronto for the 66th Annual Conference on Great Lakes Research, hosted by the International Association for Great Lakes Research (IAGLR).