Tag Archives: spatial data infrastructures

Preservation of Geospatial Data Primer

This document (French and English) is the last in a series that I wrote while in Canada on the preservation of geospatial data and I just received the finals today.  Fitting, since I have now been in Ireland for exactly one year today.  The past is however always part of the present and the future is it not?  In my view, the preservation of data should be part of any spatial data infrastructure and open data strategy.  It is simply part of the lifecycle management of a nations knowledge resources.  Data are modern artifacts as important as manuscripts, films or paintings.  If we invest so much in their capture, then we should also invest in their long term maintenance.

This primer is part of a series of Operational Policy documents developed by GeoConnections intended to inform Canadian Geospatial Data Infrastructure (CGDI) stakeholders about the nature and scope of digital geospatial data archiving and preservation and the realities, challenges and good practices of related operational policies.  GeoConnections produces a number of excellent documents on a wide range of contemporary data topics such as VGI, managing sensitive environmental data, data licences, data access, best practices for sharing data, open source, and a host of many others that are very relevant to governments world wide.

This primer starts by examining preservation responsibilities, legislation, acts, directives and policies.  3 preservation frameworks were also discussed:

  1. the Reference Model for an Open Archival Information System (OAIS) (CCSDS, 2012), developed by the Consultative Committee for Space Data Systems (CCSDS);
  2. the European Long Term Preservation of Earth Observation Space Data: European LTDP Common Guidelines (LTDP Working Group, 2012), developed by the Long Term Data Preservation (LTDP) Working Group of the Ground Segment Coordination Body (GSCB); and
  3. the Trustworthy Repositories Audit & Certification (TRAC) Audit and Certification: Criteria and Checklist (OCLC and CRL, 2007), developed by the Center for Research Libraries and the Online Computer Library Center, Inc.

The stucture of the document loosely follows the The International Research on Permanent Authentic Records in Electronic Systems (InterPARES) 2 record creator and preserver guidelines.  The work is grounded in the stufy of four cases were and includes challenges and best practices :

  1. The Canada centre for Remote Sensing (CCRS) Earth Observation Data Management System (EODMS)
  2. Land Information Ontario (LIO) Geographic Information Archive (GIA)
  3. Department of Fisheries and Oceans Canada (DFO), the Integrated Science Data Management Service (ISDM)
  4. International Polar Year (IPY) Data Preservation

Finally, the section on Establishing a Geospatial Data Preservation System guides data creators and preservers through a series of processes based on the frameworks, case studies, and guidelines.

GeoConnections has been studying the preservation and archiving of geospatial data since 2005. The following are the three reports in this series.

  1. Archiving, management and preservation of geospatial data summary report and recommendations (2005)
  2. Geospatial Data Archiving and Preservation – Research and Recommendations Executive Summary. (2011), Tracey P. Lauriault and Ed Kennedy, Hickling Arthurs and Low (HAL) NOTE – if you email me or GeoConnections, we can send you the full document.
  3. Geospatial Data Preservation Primer GeoConnections (2013) Tracey P. Lauriault, Ed Kennedy, with digital preservation subject matter expertise from Yvette Hackett, Library and Archives Canada Retired, reviewed by Marcel Fortin, Geographic Information Systems (GIS) & Map Librarian. Map and Data Library, University of Toronto. Hickling Arthurs and Low (HAL)

These documents are not for the faint at heart, but they inform practioners in all sectors, they are governmentality in action and are the datasets upon which critical data studies take shape.

 

New paper: Small data, data infrastructures and big data

The first Programmable City Working Paper has been published on SSRN, written by Rob Kitchin and Tracey P. Lauriault, and concerns the relationship between small and big data, the scaling-up of small data into data infrastructures, and how to conceptualize and make sense of such infrastructures.

Small data, data infrastructures and big data

Abstract
The production of academic knowledge has progressed for the past few centuries using small data studies characterized by sampled data generated to answer specific questions.  It is a strategy that has been remarkably successful, enabling the sciences, social sciences and humanities to advance in leaps and bounds.  This approach is presently being challenged by the development of big data.  Small data studies will, however, continue to be important in the future because of their utility in answering targeted queries.  Nevertheless, small data are being made more big data-like through the development of new data infrastructures that pool, scale and link small data in order to create larger datasets, encourage sharing and re-use, and open them up to combination with big data and analysis using big data analytics.  This paper examines the logic and value of small data studies, their relationship to emerging big data and data science, and the implications of scaling small data into data infrastructures, with a focus on spatial data examples.  The final section provides a framework for conceptualizing and making sense of data and data infrastructures.

Key words: big data, small data, data infrastructures, data politics, spatial data infrastructures, cyber-infrastructures, epistemology

Download the paper