About
Launched:
2007
Record Updated:
Apr 25, 2024
The Dataverse Project is an open-source web application to share, preserve, cite, explore, and analyze research data. It facilitates making data available to others and allows you to replicate others' work more easily. Researchers, journals, data authors, publishers, data distributors, and affiliated institutions all receive academic credit and web visibility.
A Dataverse repository is the software installation, which then hosts multiple virtual archives called Dataverse collections. Each Dataverse collection contains datasets, and each dataset contains descriptive metadata and data files (including documentation and code that accompany the data). As an organizing method, Dataverse collections may also contain other Dataverse collections.
Mission
The mission of the Dataverse Project is to revolutionize the way data is managed and shared by automating tasks traditionally carried out by professional archivists. Our goal is to empower data creators by providing services that allow them to receive proper credit for their data while ensuring long-term preservation. We aim to eliminate the dilemma researchers faced in the past, where they had to choose between control and credit or preservation. With the Dataverse Project, we break this dichotomy by creating a Dataverse collection on your website that maintains your branding and URL, offers academic citation for the data, and provides full credit and visibility. Dataverse in addition to what is mentioned above, also has fully-fledged support for quality control, e.g., in the form of curation workflows. Simultaneously, our Dataverse repositories, backed by institutions, guarantee long-term preservation.
Key Achievements
The Dataverse UI is being redesigned as a single-page application (SPA) in which internal and external Dataverse services will be delivered solely through improved APIs. This change will improve UI responsiveness and empower the Dataverse community to develop their own UIs and applications using extended Dataverse API endpoints. Read more about our roadmap for the Dataverse infrastructure redesign in: Restructuring the Dataverse UI as a Single-Page Application: https://docs.google.com/document/d/19pbENuYyHErEmblbFGQ47_uJpTfqVKbn9O0QftVqeeU/edit#heading=h.9b7lzr4a7odc
Generalist Repository Ecosystem Initiative (GREI). Multi-year grant starting in 2022 to develop collaborative approaches for data management and sharing through inclusion and enrichment of generalist repositories in the NIH data ecosystem, including the Harvard Dataverse Repository. The announcement has details of areas being explored: https://datascience.nih.gov/data-ecosystem/generalist-repository-ecosystem-initiative
CAFE GRANT:The BUSPH-HSPH Climate Change and Health Research Coordinating Center (CAFÉ) is a three-year cooperative agreement between the Boston University School of Public Health, the Harvard T.H. Chan School of Public Health, and the National Institutes of Health. CAFÉ aims to Convene, Accelerate, Foster, and Expand the climate and health community of practice, both in the US and globally. This collection serves the climate and health COP as a repository for datasets of any kind that enable broad, interdisciplinary research in the area of climate and health: https://github.com/Climate-CAFE/