Agenda

Slides for the presentations are available in an Open Science Framework project.

Videos for the sessions are linked below in the agenda.

Tuesday June 14 - Plenary Sessions

All times Eastern Daylight Time (EDT) (UTC−04:00)

Video: https://youtu.be/YghUP5mMwMU

8:00am - 8:15am
Coffee via Zoom

8:15am - 8:30am
Community Update Videos
Dwayne Liburd (IQSS, Harvard University)

8:30am - 9:00am
Dataverse Software - New Features and Future Plans
Gustavo Durand (IQSS, Harvard University)

9:00am - 9:30am
Welcome
Gary King (IQSS, Harvard University)

9:30am - 10:20am
Keynote
Session Chairs: Steven McEachern, Janet McDougall (the Australian Data Archive)
Aapooyaki Bonnie Healy (Blackfoot Confederacy)

10:25am - 11:10am
Indigenous Data Management: Indigenous Sovereignty in Action
Session Chairs: Steven McEachern, Janet McDougall (the Australian Data Archive)
Sarah Giesbrecht (The University of Winnipeg)
Anatsiisinsski (Chyloe Healy) (Blackfoot Confederacy Tribal Council)
The necessity of Indigenous data sovereignty and governance for reconciliation cannot be overstated, and while there has been a fair amount of research promoting their importance, the practical and technical details of Indigenous data management is less widely known. The Indigenous Data Management: Indigenous Sovereignty in Action project, funded by the SSHRC Connection Grant for Special Research Data Management, is intended to bring together experts in Indigenous data and information management to share their knowledge through a series of webinars and compile this information into an online toolkit. The webinars, which took place last year, introduced community members to the basics of research data management; ways to Indigenize and decolonize data management; and practical methods and technologies for data organization, management, and controlling access. The online toolkit, which is currently in development, will compile these lessons into a resource we hope will be useful to both non-academic and academic audiences. We hope these tools will be wide reaching and particularly valuable for Indigenous organizations, communities, and researchers on their journeys to Indigenous data sovereignty and self-determination in research.

For a discussion on applying Indigenous Data Sovereignty principles to RDM, we will provide an overview of our project so far and identify the key concerns for Indigenous data management to advance Indigenous data sovereignty; this presentation will utilize breakout rooms for discussion.

11:15am - 12:00pm
NIH/GREI: DMP/RDM Updates
Session Chair: Sonia Barbosa (IQSS, Harvard University)
Julie Goldman (HMS Countway Library, Harvard University) 
Melissa Renee Korf (Office of Research Administration, HMS, Harvard University)
The National Institutes of Health (NIH) has published a new Policy for Data Management and Sharing which will become effective for all competing applications that are submitted to the NIH on or after January 25, 2023.  The new policy will require a data management and sharing plan for ALL NIH-funded projects, an expansion from the current requirement for such a plan for projects over $500K in annual direct costs, and strongly encourages the use of established repositories to the extent possible for preserving and sharing scientific data. The Dataverse Project at IQSS is joining the NIH Office of Data Science Strategy (ODSS) and five other generalist repositories in launching the Generalist Repository Ecosystem Initiative (GREI) with the intention of supplementing domain-specific repositories in support of the NIH biomedical data ecosystem for data sharing. This update will provide an overview of Harvard University’s work to-date to create services, tools, and resources to facilitate the creation of data management and sharing plans, publicize Harvard Dataverse as an option for data sharing, and ultimately support compliance with the NIH Policy.

12:00pm - 12:30pm
Lunch Break

12:35pm - 2:00pm
Keynote: Local Contexts - TRADITIONAL KNOWLEDGE LABELS & NOTICES
Session Chairs: Steven McEachern, Janet McDougall (the Australian Data Archive)
Jane Anderson (New York University)

Plenary remarks by Steven McEachern

#Dataverse2022: What's Ahead Today, Wednesday and Thursday
Sonia Barbosa (IQSS, Harvard University)

Wednesday June 15 - Breakout Sessions

All times Eastern Daylight Time (EDT) (UTC−04:00)

8:00am - 10:00am
Updates from the Global Dataverse Community Consortium (GDCC) Session 2: https://youtu.be/sLoOrSMHK4Y
Jon Crabtree (Odum Institute, UNC Chapel Hill), Philipp Conzett (UiT), James Myers (GDCC, QDR)
The session is open to all existing Global Dataverse Community Consortium (GDCC) members and others interested in becoming a member or learning more about GDCC. It will address the following topics:

  • Financial updates
  • Ongoing projects
  • GDCC website
  • Governance and Advisory Board
  • Dataverse Community Survey 2022

Research Data Publishing Ethics - Outputs from FORCE11 & COPE Working Group: https://youtu.be/sLoOrSMHK4Y?t=3658
Katie Mika (IQSS, Harvard University)

The Dataverse Project for Managing and Sharing Your Research Data: https://youtu.be/sLoOrSMHK4Y?t=4412
Sonia Barbosa (IQSS, Harvard University)

10:00am - 11:30am (Concurrent Sessions)
Tools/Curation/Metadata Session 1: https://youtu.be/aIC_cfkWgZk
Session Chair: Slava Tykhonov (DANS-KNAW)

10:00am - 12:00pm (Concurrent Sessions)
Curation and/or Workflows Session 1: https://youtu.be/4b-1jwxKb9A
Session Chair: Sebastian Karcher (QDR)
Session Moderator: Dorothea Iglezakis (University of Stuttgart)
This session showcases the breadth of curation efforts taking place at different Dataverse installations. Topics discussed will include outreach to users such as user surveys and engagement dashboards, curation for difficult data such as large datasets or highly sensitive data, as well as approaches to improve machine-accessibility of data through standardized licenses and FAIR workflows.
The session will be structured as two sets of three talks, with  time for discussion after each set of talks.

  • Ceilyn Boyd, Katie Mike (Harvard Library): Curating Large Library Collections in the Harvard Dataverse Repository
  • Mahmood Shad (FASRC, Harvard): FAIR Computational Workflow Support in Dataverse
  • Christina Chan-Park (Baylor University), Laura Sare (Texas A&M University), Laura Waugh (Texas State University): Getting to Know You: Results of the Texas Data Repository User Survey
  • Wim Hugo (DANS-KNAW): Licences and Automation
  • Kathryn Claypool (ASU): Using Metadata to Catalog: When Protected Data Can’t Be Stored in Dataverse
  • Saadiah Ghazali (WorldFish): WorldFish Dataverse Performance

12:00pm - 12:15pm
Break

12:15pm - 1:00pm
Harvard Data Commons: https://youtu.be/S6tbrqOU8QM
Session Chair: Sonia Barbosa (IQSS, Harvard University)
At Harvard, the Dataverse and DASH (Digital Access to Scholarship at Harvard) systems provide researchers a way to share data and publications (including article preprints, reports,  and theses) and to indicate the relationships between them. Until now, these systems have been independent, requiring researchers to enter duplicate information or contact a repository manager to indicate bi-directional data-publication links. Through the Harvard Data Commons project, we’ve developed a Confederation of Open Access Repositories (COAR) Notify/Linked Data Notification-based mechanism for these repositories to inform each other of new materials potentially related to their holdings, allowing administrators to easily create back-links as appropriate.This talk will include a demonstration of the work and highlight the new capabilities in Dataverse that can be leveraged for other inter-application notifications.

  • Andrew Woods (Harvard University): Overview: Harvard Data Commons and Integrating Dataverse and DSpace

1:00pm - 1:30pm
Lunch Break

1:30pm - 3:30pm
Metadata Session 1: https://youtu.be/eeCMN_5Nj_0
Session Chair: Jennifer Doty (Emory University)
Session Moderator: José Carvalho (Universidade de Aveiro)
Learn about how Dataverse installations around the globe are developing and enhancing robust Dataverse metadata functionality for improved data user experiences.

  • Slava Tykhonov, Jerry de Vries, Andrea Scharnhorst (DANS-KNAW): Flexible Metadata Schemes Support for Dataverse
  • Wim Hugo ( DANS-KNAW): Where Does Metadata End, and the Graph Begin?
  • Oliver Bertuch (Forschungszentrum Jülich GmbH): HERMES + DATAVERSE - ARE WE READY FOR RESEARCH SOFTWARE?
  • John Huck (University of Alberta Library), Kaitlin Newson (Scholars Portal): One to Beam Up: Migrating a Dataverse Repository to a Shared Installation
  • Eugene Barsky, Paul Lesack (UBC):  Dryad2Dataverse
  • Jan Range (University of Stuttgart): EasyDataverse: Generation of a User-friendly Python Interface to a Dataverse Installation
  • Dorothea Iglezaki, Anett Seeland, Anne Kreuter, Usman Sikander Mirza (University of Stuttgart): Metadata2Dataverse

3:45pm - 4:45pm (Concurrent Sessions)
Preservation Session 1: https://youtu.be/qY3ZUwLqL8Y
Session Chair: Miguel Angel Mardero (IBICT)
Session Moderator: Amber Leahey (Scholars Portal)

  • Grant Hurley (Scholars Portal): Fixity in the Cloud: Preservation Planning for Scholars Portal Dataverse
  • Tiago Braga, Tatiana Canelhas Pignataro (IBICT/UFRJ): Hipátia: a model to implement trusted archival digital repositories
  • James Myers (GDCC, QDR), Andrew Woods (Harvard University): There and Back Again: Recent Updates to Dataverse’s BagIt-based Archiving Capabilities

3:45pm - 5:45pm (Concurrent Sessions)
Geospatial Session 1: https://youtu.be/EACJJaV3O1c
Session Chair: Kevin Worthington (Colorado State)
Data with a geospatial component offers many opportunities to enhance search, discovery and exploration online. This session will highlight efforts in this area which could help shape the way geospatial data is supported within Dataverse.

  • Kaitlin Newson (Scholars Portal): GeoJSON Preview
  • Bert Spaan (allmaps.org): Allmaps: curating, georeferencing and exploring for IIIF maps
  • Fu Xiaokang (CGA): Dataverse Enables Effective Sharing of COVID-19 Data
  • Kevin Worthington (Colorado State): IR Geospatial Data Support Comparison
  • Wim Hugo (DANS-KNAW): Where Does Metadata End, and the Graph Begin? (6/15/2022 only)
  • James Myers (GDCC, QDR): Experimenting with geospatial indexing (6/15/2022 only)

Thursday June 16 - Breakout Sessions

All times Eastern Daylight Time (EDT) (UTC−04:00)

8:15am - 9:15am  (Concurrent Sessions)
Software Session 2: https://youtu.be/-Re_elg67e8
Session ChairVaidas Morkevičius (Kaunas University of Technology)
Session Moderator: Deirdre Kirmis (Arizona State University Library)
This session is intended to present and discuss various enhancements and challenges related to Dataverse software.

8:15am - 10:15am (Concurrent Sessions)
Integrations/Tools Session 2: https://youtu.be/3ek7F_Dxcjk
Session Chair: Pedro Miguel Oliveira Bento Príncipe (University of Minho)

  • Baptiste Rouxel (Sciences Po): Creating a Customizable Open Source Dataverse Widget for Website Integration   
  • Ines SchmahlFlorian Rhiem (Forschungszentrum Jülich): From Lab Notebook to Publication - A Lighthouse Workflow for Research Data
  • Dan Willoughby (Storj): Using an Alternative to S3 - Connecting Dataverse to Storj
  • Yves Rozenholc (Université Paris Cité, France), Slava Tykhonov (DANS-KNAW, the Netherlands): Building the COVID-19 Museum Using Dataverse
  • Stefan Kasberger (AUSSDA): pyDataverse: Doing Tests, Data Migrations and Other API Stuff
  • James Myers (GDCC, QDR): Large Data Support; Globus
  • Ana Trisovic (IQSS, Harvard Univserity): Dataverse Integration With GitHub via a GitHub Action

10:30am - 12:30pm
Metadata Session 2: https://youtu.be/eeCMN_5Nj_0
Session Chair: Jennifer Doty (Emory University)
Session Moderator: José Carvalho (Universidade de Aveiro)
Note-taker: Julian Gautier (IQSS, Harvard University)

  • Slava TykhonovJerry de VriesAndrea Scharnhorst (DANS-KNAW)Flexible Metadata Schemes Support for Dataverse
  • Wim Hugo ( DANS-KNAW): Where Does Metadata End, and the Graph Begin?
  • Oliver Bertuch (Forschungszentrum Jülich GmbH): HERMES + DATAVERSE - ARE WE READY FOR RESEARCH SOFTWARE?
  • John Huck (University of Alberta Library), Kaitlin Newson (Scholars Portal): One to Beam Up: Migrating a Dataverse Repository to a Shared Installation
  • Eugene BarskyPaul Lesack (UBC): Dryad2Dataverse
  • Jan Range (University of Stuttgart): EasyDataverse: Generation of a User-friendly Python Interface to a Dataverse Installation
  • Dorothea Iglezakis , Anett Seeland, Anne Kreuter, Usman Sikander Mirza (University of Stuttgart)Metadata2Dataverse

12:30pm - 1:15pm
Lunch Break

1:15pm - 2:15pm (Concurrent Sessions)
Preservation Session 2: https://youtu.be/qY3ZUwLqL8Y
Session Chair: Miguel Angel Mardero (IBICT)
Session Moderator: Amber Leahey (Scholars Portal)

  • Grant Hurley (Scholars Portal): Fixity in the Cloud: Preservation Planning for Scholars Portal Dataverse
  • Tiago BragaTatiana Canelhas Pignataro (IBICT/UFRJ): Hipátia: a model to implement trusted archival digital repositories
  • James Myers (GDCC, QDR), Andrew Woods (Harvard University): There and Back Again: Recent Updates to Dataverse’s BagIt-based Archiving Capabilities
1:15pm - 2:15pm

Geospatial Session 1: https://youtu.be/EACJJaV3O1c
Session Chair: Kevin Worthington (Colorado State)
Data with a geospatial component offers many opportunities to enhance search, discovery and exploration online. This session will highlight efforts in this area which could help shape the way geospatial data is supported within Dataverse.

  • Kaitlin Newson (Scholars Portal): GeoJSON Preview
  • Bert Spaan (allmaps.org): Allmaps: curating, georeferencing and exploring for IIIF maps
  • Fu Xiaokang (CGA): Dataverse Enables Effective Sharing of COVID-19 Data
  • Kevin Worthington (Colorado State): IR Geospatial Data Support Comparison

2:30pm - 4:00pm (Concurrent Sessions)
Tools/Curation/Metadata Session 2: https://youtu.be/aIC_cfkWgZk

Session Chair: Slava Tykhonov (DANS-KNAW)

2:30pm - 4:30pm (Concurrent Sessions)
Curation and/or Workflows Session 2: https://youtu.be/4b-1jwxKb9A
This session showcases the breadth of curation efforts taking place at different Dataverse installations. Topics discussed will include outreach to users such as user surveys and engagement dashboards, curation for difficult data such as large datasets or highly sensitive data, as well as approaches to improve machine-accessibility of data through standardized licenses and FAIR workflows.

The session will be structured as two sets of three talks, with  time for discussion after each set of talks.

Session Chair: Sebastian Karcher (QDR)
Session Moderator: Dorothea Iglezakis (University of Stuttgart)

  • Ceilyn BoydKatie Mike (Harvard Library): Curating Large Library Collections in the Harvard Dataverse Repository
  • Mahmood Shad (Harvard, FASRC): FAIR Computational Workflow Support in Dataverse
  • Christina Chan-Park (Baylor University)Laura Sare (Texas A&M University)Laura Waugh (Texas State University): Getting to Know You: Results of the Texas Data Repository User Survey
  • Wim Hugo (DANS-KNAW): A Minimum Set of Machine-Readable and Actionable Licenses
  • Kathryn Claypool (ASU): Using Metadata to Catalog: When Protected Data Can’t Be Stored in Dataverse
  • Saadiah Ghazali (WorldFish): WorldFish Dataverse Performance

4:30pm - 5:00pm
Closing Session
Jonathan Crabtree (Odum Institute for Research in Social Science at UNC Chapel Hill)

Tuesday June 14 - Breakout Sessions

All times Eastern Daylight Time (EDT) (UTC−04:00)

2:00pm - 3:00pm (Concurrent Sessions)
Software Session 1: https://youtu.be/-Re_elg67e8
Session ChairVaidas Morkevičius (Kaunas University of Technology)
Session Moderator: Deirdre Kirmis (Arizona State University Library)
This session is intended to present and discuss various enhancements and challenges related to Dataverse software.

2:00pm - 4:00pm (Concurrent Sessions)
Integrations/Tools Session 1: https://youtu.be/3ek7F_Dxcjk
Session Chair: Pedro Miguel Oliveira Bento Príncipe (University of Minho)

  • Baptiste Rouxel (Sciences Po): Creating a Customizable Open Source Dataverse Widget for Website Integration   
  • Ines Schmahl, Florian Rhiem (Forschungszentrum Jülich): From Lab Notebook to Publication - A Lighthouse Workflow for Research Data
  • Dan Willoughby (Storj): Using an Alternative to S3 - Connecting Dataverse to Storj
  • Yves Rozenholc (Université Paris Cité, France), Slava Tykhonov (DANS-KNAW, the Netherlands): The COVID-19 Museum... a use case for the Dataverse community
  • Stefan Kasberger (AUSSDA): Doing tests, data migrations and other API stuff with the help of pyDataverse
  • James Myers (GDCC, QDR): Adding Lots of Zeros to the Size of Datafiles
  • Ana Trisovic (IQSS, Harvard University): Dataverse Integration With GitHub via a GitHub Action

4:00pm - 6:00pm
Updates from the Global Dataverse Community Consortium (GDCC) Session 1: https://youtu.be/sLoOrSMHK4Y
Jon Crabtree (Odum Institute, UNC Chapel Hill), Philipp Conzett (UiT), and James Myers (GDCC, QDR)
The session is open to all existing Global Dataverse Community Consortium (GDCC) members and others interested in becoming a member or learning more about GDCC. It will address the following topics:

  • Financial updates
  • Ongoing projects
  • GDCC website
  • Governance and Advisory Board
  • Dataverse Community Survey 2022

Research Data Publishing Ethics - Outputs from FORCE11 & COPE Working Group: https://youtu.be/sLoOrSMHK4Y?t=3658
Katie Mika (IQSS, Harvard University)

The Dataverse Project for Managing and Sharing Your Research Data: https://youtu.be/sLoOrSMHK4Y?t=4412
Sonia Barbosa (IQSS, Harvard University)