Pilot Data Management Study - DM4(B)T: Data Management for (Build)TEDDI(NET) using Semantic Technologies

Project: Research council

Description

CONTEXT EPSRC funded 22 projects over two calls in 2010 and 2012 to investigate `Transforming Energy Demand through Digital Innovation' (TEDDI) as a means to find how and how people use energy in homes and what can be done reduce energy consumption. As a result a lot of data is being collected at different levels of detail in a variety of housing up and down the UK, but the mode, detail and quantity are largely defined by the needs of each individual project. At the same time, the research councils (RCUK) are defining guidelines for what happens to data generated by projects they fund, for which universities are then defining policies and finally researchers are then taking concrete actions to store, preserve and document data for future reference. The problem at this current time is that there is relatively little awareness, limited experience and only emerging practice of how to incorporate data management into much of (physical) science research. This is in stark contrast to established procedures for data formats and sharing in the biosciences, stemming from international collaboration on the Human Genome Project, and in the social sciences, where data from national surveys, including census data, have been centrally archived for many years. Consequently, current solutions adopted by (Build)TEDDI projects may be able to meet a minimal interpretation of the requirements, but not effectively deliver the desired data legacy, such as (for example) the means to execute trans-project queries, or being able to cite the results of such queries for the sake of reproducibility. AIMS AND OBJECTIVES The challenges described above, which we address in DM4(B)T in the microcosm of the TEDDI projects, are tackled in three ways: 1. Raising awareness with those who are responsible for data management (principal investigators), 2. Developing a framework to guide the process of making the choices for how to go about implementing data management and 3. Demonstrating example tools that will enable researchers to bring together and re-analyse data from different projects more easily, which together will help researchers (i) to satisfy funding and institutional guidelines for data management, (ii) begin the process of forming a data management culture in science research and (iii) create a substantial case study in science data management which can inform the three primary stakeholders (researchers, institutions and research councils) across a range of issues (see Recommendations below). Key activities and outputs: 1. Workshops (i) to gather information about current practice, (ii) present data management problems and outline analysis and solutions and (iii) to disseminate knowledge of tools and (new) practices to support effective data management. 2. Tools and techniques: to allow researchers to harness both the variety and volume of data being collected specifically within the (Build)TEDDI projects. The tools will be made available open-source for access by other researchers to expand and adapt. 3. Recommendations: these will take the form of an online report to identify routes to facilitate a sustainable data legacy (management, curation and citation) for projects in the science and engineering domain. APPLICATIONS AND BENEFITS 1. (Build)TEDDI projects will benefit directly from the above activities and outputs to meet institutional and research council requirements. 2. Other researchers will benefit from being able to access (Build)TEDDI data. 3. The outputs will benefit the wider research community in science and engineering through the provision of an easy-to-adopt (and adapt) data management methodology.
StatusFinished
Effective start/end date1/03/1631/08/17

Fingerprint

Information management
Innovation
Semantics
Social sciences
Energy utilization
Genes

Keywords

  • Energy
  • Energy Efficiency
  • Library and information studies
  • Information and Knowledge Management