What Can Bioinformatics Scientists Learn from Librarians? Using Library and Archival Methodologies t
Updated: Sep 29, 2022
Ene Belleh, Children's Hospital of Philadelphia; Jennae Luecke, Children's Hospital of Philadelphia; Allison Olsen, Children's Hospital of Philadelphia; Julianna Pakstis, Children's Hospital of Philadelphia
At the Children's Hospital of Philadelphia (CHOP), a team of librarians and archivists is developing an innovative biomedical research data archives and discovery platform to preserve and deliver CHOP Omics data. This work is part of Arcus, a strategic initiative of the CHOP Research Institute dedicated to making research data more broadly available within the institution. Arcus' data preservation and reuse work aims to increase collaboration, drive new discoveries, and reduce costs. Arcus is an end to end program that supports management of Omics data throughout its entire lifecycle. This support is accomplished when the Arcus Library Science team ingests, describes, and displays information about these Omics datasets. The data life cycle begins when Archivists, in communication with study teams, organize data and related information into a project template, which creates a consistent yet flexible structure that can organize a wide range of data contributions. In addition to the data, the project template includes protocol files authored by the study team, which describes file creators, filetypes, pipelines, and workflows. Once organized, automated steps can be used to create metadata description. Created files include JSON metadata records that store descriptive information about the data and its analysis in a structured way. The dataTypes set of fields compactly describes this complex data. Currently, the team is partnering with application development staff to build a Data Catalog that will make use of the metadata and project template structure, where researchers can find and request data for use in new research. This new research data will also be contributed back to the archives for future reuse. This poster will showcase visual examples of these methods for preserving and reusing Omics data.