Usability Specialist, Data Science at IQSS, Harvard University
June 9 - Pre-Meeting Workshop: Towards a Common Deposit API (the Dataverse example)
For the past few years Dataverse has been using the SWORD protocol as the standard for a Data Deposit API, but is this the standard all repositories should use for Data Deposit APIs? We will discuss the good parts and the challenges of this approach. Additionally this presentation will lead into the Panel Discussion consisting of various stakeholders from publishers, domain and general repositories, funding agencies, researchers, and industry.
June 10 - Introduction to Dataverse 4.0 / Where We Are
Highlights of the latest version of Dataverse as well as an overview of current and upcoming collaborations for the future of Dataverse.
Data in Brief, an Open Access journal published by Elsevier, exclusively publishes data articles wherein researchers describe their datasets. Data in Brief requires that all data be made publicly available either directly with the article as supplementary files or in a public repository. Data in Brief has teamed up with DataVerse to provide a venue for authors to archive and openly share their data.
Project Manager, Deputy Research Librarian at Fudan University Library
Session 2: Dataverse Internationalization
Zhang Jilong & Yin Shenqin will discuss the internationalization development work done by Fudan University to support a Chinese language user interface in Dataverse. Additionally, the practice of data curation at Fudan University will be presented, as well as the branding and dissemination of Dataverse in China.
June 11 - Session 4: Big Data Repository for Structural Biology: Challenges and Opportunities
SBGrid (Morin et al., 2013, eLIFE and www.sbgrid.org) is a Harvard based structural biology global computing consortium with a primary focus on the curation of research software. Dr. Sliz will discuss a recent SBGrid project that aims to establish a repository for experimental datasets from SBGrid laboratories. Issues of handling large data volumes, data validation and repository sustainability will be addressed in this talk.
Session 1: Preservation Tools: Integration with Archivematica
Scholars Portal, a program of the Ontario Council of University Libraries (OCUL), provides the technical infrastructure to store, preserve, and provide access to shared digital library collections in Ontario - including hosting a local instance of Dataverse since 2011. As part of a national project known as Portage (a project of the Canadian Association of Research Libraries), Scholars Portal is partnering with Artefactual Systems, Dataverse, the University of British Columbia, the University of Alberta, and others, to integrate Dataverse with preservation software Archivematica. When completed, this project will facilitate the long-term preservation of research data according to the Open Archival Information System (OAIS) Reference Model.
Software Developer and Architect, International Institute of Social History, Amsterdam, Netherlands
Session 4: Data analysis in Dataverse & visualization of datasets on historical maps
The CLIO INFRA project (http://www.clio-infra.eu) hosted at the International Institute of Social History (IISH, http://www.socialhistory.org) integrates a number of data(hubs) on global social, economic and institutional indicators over the past five centuries. Considering a switch to Dataverse, the IISH is currently developing visualization tools that fit on top of Dataverse, especially historical maps with temporally accurate borders, such as provided in the NLGIS (http://nlgis.nl), Labour conflicts (http://laborconflicts.socialhistory.org) projects.
This talk will introduce the Dataverse widget developed to extract, transform and load data from datasets storage to platform independent internal data warehouses with possibilities to validate and check the quality of research datasets.
Isaac Peral Distinguished Researcher, Centro de Biotecnología y Genómica de Plantas, Universidad Politecnica de Madrid
Pre-Meeting Workshop: Metadata Profiles for Repositories: the Meta-Metadata
It would be useful to be able to discover what kinds of data are contained in the myriad general-purpose public data repositories. It would be even better if it were possible to query that data and/or have that data conform to a particular context-dependent data format. This was the ambition of the Data FAIRport project. I will be demonstrating the "strawman" demonstration of a fully-functional Data FAIRport, where the meta/data in a public repository can be "projected" into one of a number of different context-dependent formats, such that it can be cross-queried in combination with the (potentially "projected") data from other repositories.