Using an Open Source Python Toolbox (Signac) to Manage High Dimensional Research Data
Date
2023-04-14
Authors
Contributor
Advisor
Department
Instructor
Depositor
Speaker
Researcher
Consultant
Interviewer
Narrator
Transcriber
Annotator
Journal Title
Journal ISSN
Volume Title
Publisher
Volume
Number/Issue
Starting Page
Ending Page
Alternative Title
Abstract
Many research fields have entered the age of Big Data. For some researchers, big data means computationally generating large datasets with high dimensional parameter sweeps; for others, big data means generating terabytes of experimental data with many different types of metadata based on experimental conditions. Recording and storing these data in an organized way for future analysis can be challenging, as many ad hoc solutions might help the exact current situation but hurt one's progress later on. Having battled these challenges, I want to share my experience working with an open-source data management system based on Python called Signac. Signac was first developed in the Glotzer Group at the University of Michigan, where I was a graduate student, to help manage different kinds of molecular dynamics simulations, but later extended to support many different kinds of data. In this talk, I want to briefly talk about the design philosophies of Signac and give a quick demonstration of how one could use Signac to help with their research based on my personal experiences.
Description
2023 Symposium for Caring for Data in Hawaiʻi Presentation
Keywords
data management
Citation
Extent
20 minutes
Format
Video
Geographic Location
Time Period
Related To
Related To (URI)
Table of Contents
Rights
http://rightsstatements.org/vocab/InC/1.0/
Rights Holder
Local Contexts
Collections
Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.