A Framework for a Standard-Enabled FAIR Data Management Workflow for Synthetic Biology. Journal Article uri icon

Overview

abstract

  • Synthetic biology laboratories generate diverse forms of data and metadata throughout a project's life cycle, such as sequences, models, protocols, images, and time-series measurements. Unfortunately, these assets are scattered across spreadsheets, proprietary exports, custom scripts, etc. found in varied locations such as shared drives. Inconsistent metadata and data formats hinder provenance, reuse, security, compliance, automation, and scale-up. The central gap is a coherent way to link data, metadata, and code so they remain findable, accessible, interoperable, and reusable (FAIR). This perspective considers current practices through semistructured interviews with synthetic biology researchers in laboratories across the United States, and the findings were used to provide guidance to create a framework for an integrated data management workflow. This framework maps common data types to community standards that allow machine-accessible metadata, version control, and standards-compliant repositories. This perspective also offers a catalog of potential software solutions and stepwise adoption guidelines that turn the proposed framework into a day-to-day practice, democratizing the generation of standardized data. The result is that users gain a template that raises data to FAIR status, strengthens traceability for regulatory or defense contexts, and provides a stronger foundation for training machine learning models.

publication date

  • January 7, 2026

Date in CU Experts

  • January 7, 2026 11:52 AM

Full Author List

  • Vitalis C; Vidal G; Samineni SP; Fontanarrosa P; Myers CJ

author count

  • 5

Other Profiles

Electronic International Standard Serial Number (EISSN)

  • 2161-5063