Agenda

Slides for the presentations are available in an Open Science Framework project.

Videos for the sessions are linked below in the Agenda.

Wednesday June 17 - Plenary Sessions

Video: https://www.youtube.com/watch?v=ivpI5bmHSmY&feature=youtu.be

All times Eastern Daylight Time (EDT) (UTC−04:00)
 
9:45am - 10:00am
Coffee via Zoom
 
10:00am - 10:10am
Welcome
Mercè Crosas (Chief Data Science and Technology Officer, IQSS)
 
10:10am – 10:30am
Global COVID-19 Data Sharing
Short presentations of some of the popular datasets on COVID-19 related data shared publicly using Dataverse.
European COVID-19 Datasets and CoronaWhy Community - Vyacheslav Tykhonov (Senior Data Scientist, DANS-KNAW)
China Data Lab - Resources for COVID-19 – Wendy Guan (Executive Director, Center for Geographic Analysis, Harvard University)
COVID-19 data from Population Council - Thoai Ngo (Director, Poverty, Gender, and Youth Program, Population Council)
 
10:30am - 11:30am
Panel on Improving and Standardizing COVID-19 Data Sharing
This panel will include recommendations on sharing COVID-19 data from three perspectives: from the Research Data Alliance community, from the National Institutes of Health, and from researchers who work on understanding the effectiveness of interventions.
Hilary Hanahoe (Secretary General Research Data Alliance): Recommendations from RDA working group
Susan Gregurick (Director of the Office of Data Science Strategy, NIH): Recommendations from NIH
Caroline Buckee and Satchit Balsari (Harvard T.H. Chan School of Public Health); Recommendations and needs from researchers
 
11:30am – 11:45am
Break
 
11:45am – 12:15pm
Dataverse New Features and Future Plans
 
12:15pm - 1:15pm
Lightning Video presentations from Dataverse Repositories World-wide
 
1:15pm - 1:30pm
#Dataverse2020: What's Ahead on Thursday and Friday

Thursday June 18 - Breakout Sessions

All times Eastern Daylight Time (EDT) (UTC−04:00)

8:00am - 9:30am
Core Trust Seal: https://youtu.be/abJ7a4wm_5o
Ceilyn Boyd (Harvard Library, Harvard University)
Philipp Conzett (UiT The Arctic University of Norway)
Sebastian Karcher (Qualitative Data Repository)
Sonia Barbosa (IQSS, Harvard University)
The Core Trust Seal (CTS, https://www.coretrustseal.org/) is the most widely used certification for data repositories. CTS is made up of a peer-reviewed self-assessment of a repository’s facilities, organization, and policies. Several of CTS’s requirements can be met based on Dataverse’s existing capabilities. Moreover, a number of repositories in the Dataverse community have already obtained the certification and can share their experience. Whether you are considering applying for CTS certification or whether you’ve already started the process, this session is for you!

8:00am - 9:30am
GDCC/Governance Session 2: https://youtu.be/LuO4EbGfRPg
Mercè Crosas (IQSS, Harvard University)
Jonathan Crabtree (Odum Institute, UNC Chapel Hill)
The session is open to all existing Global Dataverse Community Consortium (GDCC) members and others interested in becoming a member or learning more about GDCC. It will address the following topics:
•    Update on DataCite new fees model and how it affects some GDCC members
•    Changes in GDCC membership from fiscal year to calendar year
•    Governance of GDCC and community contributions, including the proposal for a new external advisory board
•    Current efforts in community collaboration projects
•    Community members input for prioritizing next new services
•    Q&A

Two identical sessions will be provided in order to give an opportunity for all GDCC members to participate. The other session is offered on Wednesday at 6:00pm EDT.

9:45am - 11:15am
Supporting Sensitive Data in Dataverse: https://youtu.be/1Uqbls419Go
Mercè Crosas (IQSS, Harvard University)
Marion Wittenberg (DANS)
Tania Schlatter (IQSS, Harvard University)
This session will present the current status of how Dataverse will support sensitive data, including an overview of methodology, and review of tools under development in the community. Participants will hear updates from Laura Huis in ’t Veld (DANS) about a DataTags recommendation tool, James Honaker (CRCS, Harvard University) about the OpenDP library of algorithms for privacy-preserving metadata, and Ilya Baldin (RENCI/UNC Chapel Hill) about the ImPACT project (Infrastructure for Privacy-Assured CompuTations) for trusted remote storage system components. There will be small group discussions of community members' use cases for sensitive data, followed by a report out with the larger group, and Q&A with presenters.

11:30am - 1:00pm
DevOps/Installation Deployments/Automated Testing: https://youtu.be/OTiEkaDc7j8
Don Sizemore (Systems Programmer/Analyst, Odum Institute UNC Chapel Hill)
Slava Tykhonov (DANS-KNAW/SSHOC Dataverse)
Over the last several years, the Dataverse Project has seen a number of new initiatives in the areas of DevOps, Installation Deployments, and Automated Testing. Attendees will discuss how the Dataverse Community currently uses technologies such as Vagrant, Ansible, Docker, Kubernetes, Jenkins, and more to help with the development process and running Dataverse in production and non-production environments. After a review of the history and current processes around these tools, we’ll work collaboratively to identify new directions, technologies, and use cases to support Dataverse into the future.

1:15pm - 2:45pm 
Geospatial Data: https://youtu.be/Ufc7oXA6ZJg
Eugene Barsky (The University of British Columbia)
Ben Lewis (Harvard Geospatial Library)
Bonnie Burns (Head of the Harvard Map Collection)
BFFs – Dataverse and Geospatial” – this session will bring together those working with Dataverse repository services for geospatial data as well as those exploring the possibility. The main focus is to share best practices, solve common problems and address technical issues with integrating geospatial data into a Dataverse repository and associated services. We will hear from developers of the Canadian Geodisy project, contributors to the Harvard Geospatial Library, Dataverse developers, and many others.  Please join us to learn more about how Dataverse can help you store and serve your geospatial data now, and provide valuable feedback to the Dataverse team as they move forward with new services.
Geospatial Data Discovery and Exploration at CSU Libraries, Kevin Worthington (CSU)
Geospatial Data Use Cases, Wim Hugo (DANS)
Harvard Dataverse/Harvard Geospatial Library Integration, Maura Carbone (Harvard), Marc McGee (Harvard)
Integration of Dataverse and Geospatial Applications at UCLA, Jamie Jamison (UCLA), Kristian Allen (UCLA), and Zhiyuan Yao (UCLA)
Geodesy, Paul Dante (UBC)
Ideas for Supporting Geospatial Previews in Dataverse, Jim Myers (GDCC)

3:00pm - 4:30pm
Encapsulation: https://youtu.be/AybUKlImUIs
Ana Trisovic (IQSS, Harvard University)
Craig Willis (WholeTale)
Sandra Gesing (University of Notre Dame)
Dataverse has a long-term commitment to preserving research artifacts to enable research transparency (FAIR principles), reproducibility, and reuse. In recent years, virtual containers have become one of the integral parts of the research process, yet, there is no standardized way of disseminating them. This session aims to investigate a sustainable way of capturing and storing virtual containers while complying with FAIR principles.

To learn more about this issue in the context of Dataverse developments, see the document at https://arxiv.org/abs/2005.02985. The session will feature short expert talks and an open discussion. The session schedule is:
•    Introduction and the Dataverse perspective on encapsulation, Ana Trisovic
•    The mybinder.org experience, Tim Head (JupyterHub and Binder)
•    Describing and preserving access to research software dependencies with Emulation as a Service Infrastructure (EaaSI), Euan Cochrane (Yale University Library)
•    Singularity: Containers for Scientific Compute, Vanessa Sochat (Stanford Research Computing Center)
•    Archiving reproducible research with the ReproZip ecosystem, Vicky Steeves (NYU Libraries and the Center for Data Science)
•    Open Discussion and Outro

4:45pm - 5:45pm
New Feature Demo: https://youtu.be/N_zcjYnNQ8o
Sonia Barbosa (IQSS, Harvard University)
Danny Brooke (IQSS, Harvard University)
Come and learn about the Dataverse releases and features from over the last year! 

6:00pm - 7:00pm
Flexible Metadata Session 1: https://youtu.be/3_SRARQiBr8
Dataverse has a rich and extensible set of metadata capabilities and many members of our community are interested in how to best leverage these capabilities to support researchers and in further expanding Dataverse’s metadata capabilities. This session will be separated into two, hour long breakouts to provide an opportunity to discuss what is currently possible, to learn about the range of potential new features being proposed, and to engage with others to help identify priorities and discuss how these ideas can be integrated into a coherent design and roadmap for long term development.

Session One (this session) : Panelists will give brief talks on metadata projects that have been recently implemented or are near-term opportunities. This discussion will focus on near-term metadata projects and possibilities that can be executed in the latest version.
A New DarwinCore Metadata Block, Olga Kurek (Mammal Research Institute of the Polish Academy of Sciences)
DDI Updates, Steve McEachern (Australian Data Archive)
Dataverse North Best Practices, Martine Gagnon (Dataverse North) and Alexandra Cooper (Dataverse North)

Session Two: Panel of speakers will give brief talks on longer term metadata issues and projects that have major design implications. Discussion in this session will focus on opportunities, challenges, and priorities for long term features and design development to facilitate metadata integration and exchange. Flexible Metadata Session 2 is offered on Friday at 8:30am EDT.
Leveraging External Metadata Services,  Jonas Kahle (WZB Berlin Social Science Center)
Submitted Metadata Issues, Philipp Conzett (UiT The Arctic University of Norway/DataverseNO)
Support for External Vocabularies, Richard Fulss (International Maize and Wheat Improvement Center (CIMMYT))

Friday June 19 - Breakout Sessions

All times Eastern Daylight Time (EDT) (UTC−04:00)

8:30am - 9:30am
Flexible Metadata Session 2: https://youtu.be/fOzhYfEr7Rk
Dataverse has a rich and extensible set of metadata capabilities and many members of our community are interested in how to best leverage these capabilities to support researchers and in further expanding Dataverse’s metadata capabilities. This session will be separated into two, hour long breakouts to provide an opportunity to discuss what is currently possible, to learn about the range of potential new features being proposed, and to engage with others to help identify priorities and discuss how these ideas can be integrated into a coherent design and roadmap for long term development.

Session One: Panelists will give brief talks on metadata projects that have been recently implemented or are near-term opportunities. This discussion will focus on near-term metadata projects and possibilities that can be executed in the latest version. Session One is offered on Thursday at 6:00pm EDT.
A New DarwinCore Metadata Block, Olga Kurek (Mammal Research Institute of the Polish Academy of Sciences)
DDI Updates, Steve McEachern (Australian Data Archive)
Dataverse North Best Practices, Martine Gagnon (Dataverse North) and Alexandra Cooper (Dataverse North)

Session Two (this session): Panel of speakers will give brief talks on longer term metadata issues and projects that have major design implications. Discussion in this session will focus on opportunities, challenges, and priorities for long term features and design development to facilitate metadata integration and exchange.
Leveraging External Metadata Services,  Jonas Kahle (WZB Berlin Social Science Center)
Submitted Metadata Issues, Philipp Conzett (UiT The Arctic University of Norway/DataverseNO)
Support for External Vocabularies, Richard Fulss (International Maize and Wheat Improvement Center (CIMMYT))

9:45am - 11:15am
External Tools: https://youtu.be/YH4I_kldmGI
Kaitlin Newson (Scholars Portal)
Phil Durbin (IQSS, Harvard University)
The external tools framework enables developers to create tools that connect to Dataverse to preview, explore, and curate datasets. In this session, we’ll review how external tools work in Dataverse, learn about external tools that have been developed by the community, and discuss ways we might enhance and expand on this powerful Dataverse feature.

11:30am - 1pm
Remote Storage/Large Datasets: https://youtu.be/LHyiA3JeiwE
Anna Dabrowski (Texas Advanced Computing Center)
Meghan Goodchild (Scholars Portal)
Gustavo Durand (IQSS, Harvard University)
The Remote Storage/Large Datasets breakout session will include brief presentations from community members and group discussion about improving data transfer mechanisms and storage options for Dataverse repositories. Presentations will cover data upload developments, including Globus integration and direct upload to S3, as well as Trusted Remote Storage Agents (TRSA) for sensitive data, and managing multiple data stores. This session is an opportunity to hear about and provide feedback on the efforts of community members, discuss policy and service implications for managing repositories, and talk about further coordinating our efforts.

1:15pm - 1:45pm
Closing Session/Social Event: https://youtu.be/l5y2L16PNxs

Wednesday June 17 - Breakout Session

All times Eastern Daylight Time (EDT) (UTC−04:00)

6:00pm - 7:30pm
GDCC/Governance Session 1
Mercè Crosas (Chief Data Science and Technology Officer, IQSS)
Jonathan Crabtree (Assistant Director of Cyberinfrastructure, Odum Institute UNC Chapel Hill)
The session is open to all existing Global Dataverse Community Consortium (GDCC) members and others interested in becoming a member or learning more about GDCC. It will address the following topics:
·   Update on DataCite new fees model and how it affects some GDCC members
·   Changes in GDCC membership from fiscal year to calendar year
·   Governance of GDCC and community contributions, including the proposal for a new external advisory board
·   Current efforts in community collaboration projects
·   Community members input for prioritizing next new services
·   Q&A

Two identical sessions will be provided in order to give an opportunity for all GDCC members to participate. The other session is offered on Thursday at 8:00am EDT.