Actions Panel
Forecasting Costs for Preserving and Promoting Access to Biomedical Data: Meeting 3 In Person(Invite Only)
Date and time
Location
Keck Center, Room 206
500 Fifth St. NW Washington, D.C 20001Description
Committee Meeting #3: Forecasting Costs for Preserving, Archiving, and Promoting Access to Biomedical Data (In Person, invite only)
The National Academies of Sciences, Engineering, and Medicine invites you to participate in a consensus study committee meeting on May 6, 2019 from 10am-1:40pm ET. Over the course of the study, the committee will develop and demonstrate a framework for forecasting long-term costs for preserving, archiving, and accessing various types of biomedical data and estimating potential future benefits to research.
Download the Agenda and Meeting Materials
Learn more on the study webpage.
This study is sponsored by the U.S. National Library of Medicine of the National Institutes of Health.
This study is organized by the National Academies' Board on Mathematical Sciences and Analytics. Learn more about BMSA and sign up for our mailing list at nas.edu/bmsa.
If you are unable to attend in person, we encourage you to register as a remote participant.
Open session remote login: https://nasem.zoom.us/j/471850063
10:00 Welcome, introductions, and statement of meeting objectives
David Chu, Committee Chair
10:05 Disruptors in Digital Archiving: Presentation from the U.S. National Archives
Leslie Johnston, Director of Digital Preservation, U.S. National Archives
Prompting Questions:
- What models do you use to budget for data preservation?
- How do you factor in unexpected cost or budget allocation fluctuations related to data preservation?
- What disruptors have affected appraisal/reappraisal and redaction decisions, how?
- How have those disruptions affected decisions regarding preservation of existing data? Planning for future data?
- If you employ a cloud-based strategy, what happens if a cloud vendor’s services are no longer available?
- How do you think about format obsolescence?
11:05 Disruptors in the Cloud
Vamshidhar Kommineni, Principal Project Manager, Azure Blob Storage, Microsoft
Prompting questions:
- What changes in technologies, data volumes & types, and data uses might appear in the next 5-10-25 years that would be disruptive to cost models and risk assessment for data preservation, archiving and access?
- How do you forecast total cost of ownership of a cloud-based archive based archive over a 5-year life span? Over 10 years?
- What specific steps does your organization take to prepare for any of these eventualities?
12:05 Lunch—available for purchase in the refectory
1:00 Indicators of data management costs at CERN
Simone Campana, Deputy Project Leader of the Worldwide Computing Grid
Prompting questions:
- How does CERN determine what the lifespan of data saved?
- CERN has long time lines and the data generating rate is reported to be 25 PetaB/year. How does CERN plan for storage costs? What is CERN's idea of a planning tool?
- Zenodo - a general-purpose open-access repository - is run "as a marginal activity" - What does that imply for cost forecasting (e.g., how can CERN assume that it remains marginal)?
- How has the archival infrastructure evolved at CERN? How do they expect it to evolve? How open is CERN about its forecasting assumptions?
1:40 Open session adjourns