Data Science For Molecular Genetics And Communication In The Natural Sciences
| dc.contributor.advisor | Kantar, Michael B. | |
| dc.contributor.author | Bartlett, Bjarne | |
| dc.contributor.department | Molecular Biosciences and Bioengineering | |
| dc.date.accessioned | 2022-10-19T22:36:15Z | |
| dc.date.available | 2022-10-19T22:36:15Z | |
| dc.date.issued | 2022 | |
| dc.description.abstract | “By 2025, it’s estimated that 463 exabytes of data will be created each day globally – that’s the equivalent of 212,765,957 DVDs per day!” -World Economic Forum Data science refers to the study of increasingly large and complex datasets. Data that are too large for standard tools (e.g., Excel, Google Sheets) to analyze are often referred to as “big data.” While big data exists across many areas and is thought to be the path to answering many questions, there is still no consensus on the fundamental principles and skills needed to interact with big data. Further, skills to study big data are not universally taught systematically at the college level–the resulting gap in skills leaves students unable to analyze the same big data that are touted as the way to answer complex questions. This dissertation proposes a plan to close the big data knowledge gap by incorporating data science principles from diverse disciplines into a biology curriculum. Specifically, essential information was distilled from three independent study systems in cancer diagnostics, plant genomics, and academic publishing. Each study system contributed a different perspective on skills and knowledge from analyzing big data. From these systems, I identified three critical areas that are central to using big data effectively. From these diverse perspectives, I developed a model to assist instructors in constructing curricula that will work in many different biological contexts. I piloted the use of these principles in a summer course. I found that by incorporating instruction developed across knowledge areas, meaningful data science instruction can occur in any curriculum at any student level. | |
| dc.description.degree | Ph.D. | |
| dc.identifier.uri | https://hdl.handle.net/10125/103919 | |
| dc.language | eng | |
| dc.publisher | University of Hawaii at Manoa | |
| dc.subject | Bioinformatics | |
| dc.title | Data Science For Molecular Genetics And Communication In The Natural Sciences | |
| dc.type | Thesis | |
| dc.type.dcmi | Text | |
| local.identifier.alturi | http://dissertations.umi.com/hawii:11493 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Bartlett_hawii_0085A_11493.pdf
- Size:
- 7.21 MB
- Format:
- Adobe Portable Document Format
