Thoth Open Metadata

Thoth Open Metadata
United Kingdom of Great Britain and Northern Ireland

About

Launched: 2020
Record Updated: Nov 15, 2024
Format conversion tool or service
Open scholarly dataset
Web archiving system
Thoth is a metadata management and distribution platform and service designed to address the challenges faced by small- to medium-sized publishers in getting open access works into the book supply chain. It offers comprehensive solutions to streamline metadata creation, management, dissemination, and archiving workflows to improve the discoverability and accessibility of scholarly books.
The Thoth Open Archiving Network (TOAN) serves as a vital resource for publishers seeking to preserve their publications for the long-term. TOAN prodides transparent workflows to automatically archive their publications in multiple repository locations, thereby filling a significant gap in preservation infrastructure. This initiative provides an essential safeguard against the risk of complete loss of publishers' catalogues, supporting the long-term accessibility and preservation of scholarly works.

Mission

Thoth is uniquely designed as a metadata management and distribution platform to address the complexities and hurdles involved in integrating Open Access works into the book supply chain.
Our mission is:

  • To lower the entry barrier to good metadata management and practices for small/medium OA publishers who are currently struggling to produce their metadata to all the various different specifications that each distributing platform requires;
  • To help distribute Open Access books, which have been systematically excluded from a book supply chain that was created for closed books;
  • To expose quality and first-hand metadata, using industry standards, publicly for anyone to consume.

Key Achievements

As of Q4/2024, more than 30 independent, scholar-led, university and library publishers from across the globe are using Thoth's platform, open APIs, and services to create, manage, and disseminate high-quality open metadata for their open access books and chapters. Doing so, they are able to leverage powerful PID integrations of DOIs, ORCiDs, and RORs, and controlled vocabularies such as Thema to have their metadata readily available for free in a variety of platform-specific flavours of industry-standard formats such as ONIX, MARC21/MARCXML, KBART, BibTeX, CSV, Crossref XML, etc.
Over the past year, Thoth has established close relationships with multiple open infrastructures such as Crossref, OAPEN, DOAB, Open Book Collective, OASPA, and OPERAS.
The Thoth database has entries for more than 2.4k books and 12.5k chapters from >45 publishers, and growing. More than 30 international libraries are supporting Thoth through Open Book Collective membership.

Technical Attributes

Maintenance Status

Actively Maintained

Open Code Repository

Implemented

Technical Documentation

Implemented

Code License

Implemented

Open Data Statement

Implemented

Technical Attribute Statements

Programming Languages

  • rust

Technology Readiness Level

  • Actual system proven in operational environment

Code Licenses Used

  • Apache License, Version 2.0

Content Licensing

  • Creative commons licenses

Standards

Metadata

  • JSON
  • KBART
  • MARC
  • MARC XML
  • ONIX
Other:
ONIX 2.1, ONIX 3.0, MARC21, MARCXML, KBART, JSON, CSV, BibTeX, Crossref XML DOI deposit records for books and chapters, Crossmark,

Persistent Identifier

  • ORCiD
  • Research Organization Registry
Other:
DOIs

Security

running on Amazon AWS, so all standards that are supported by AWS.

Metrics

Thoth is currently working on a provision of usage metrics via the OPERAS Metrics service

Hosting Options

  • Through solution only

Integrations

  • Creative Commons Licenses
  • Crossref
  • Directory of Open Access Books (DOAB)
  • Janeway
  • OAPEN Library
  • Open Monograph Press (OMP)
  • Research Organization Registry (ROR)
  • Zenodo
Other:
Internet Archive, Figshare

Community Engagement

Code of Conduct

Implemented

Community Engagement

Implemented

Community Statements

User Contribution Pathways

  • Contribute funds
  • Contribute to code
  • Contribute to documentation
  • Contribute to governance
  • Contribute to user research or user testing
  • Contribute to working groups or interest groups

Community Engagement Activities

  • Annual meetings
  • Blogs
  • Community calls
  • Conference participation
  • Interest, working, user, or advisory groups
  • Social media
  • User research
  • Webinars and training

Policies & Governance

Governance Summary

Thoth Open Metadata has been set up as a UK Community Interest Company (CIC) limited by guarantee, to ensure that the community’s interest are being met, no private gain (such as paying out dividends to its membership) is sought, and any surplus or assets are used principally for the benefit of the community. Thoth has been formed as a “large membership” CIC, which means that the members have ultimate control over the activities of the CIC. It also requires there to be more members than Directors and allows for the membership to grow as the company grows. These members collectively decide when and how the overall membership will grow. Another key feature of a Community Interest Company is the Asset Lock, which guarantees transfer of resources to a predefined set of organisations in the case of the CIC facing insolvency. The Membership approves the appointments of Thoth’s Board of Directors, which is responsible for proper management of the company’s business.

Policies

Commitment to Equity & Inclusion

In Progress

Privacy Policy

Implemented

Web Accessibility Statement

In Progress

Governance Structure & Processes

Implemented

Transparent Pricing and Cost Expectations

Implemented

Policy Statements

Board Structure

Board of Directors

Board Level

The Board of Directors is responsible for the overall management of the non-profit CIC.

Community Governance

  • Formal

Additional Information

Organizational History

Thoth emerged out of the collaborative efforts of the Community-led Open Publication Infrastructures for Monographs (COPIM) project. Together with nearly two dozen partners from the academic and open publishing world we are currently part of the three-year Open Book Futures funded by the Research England Development Fund/UKRI and Arcadia, led by Lancaster University.
Thoth is increasingly integrated with other platforms such as the Open Book Collective, PKP’s OMP, and the Internet Archive (the latter via the Thoth Open Archiving Network). Moreover, Thoth actively engages with like-minded open infrastructures to foster partnerships to enable small-to-medium-sized publsihers to participate in the wider book supply chain. Through collaborations, such as Thoth's Crossref Sponsorship and our OPERAS, OAPEN and OASPA memberships, we seek to provide valuable resources and services to publishers from across the globe.
https://thoth.pub/about/partners

Organizational Structure

Business or Ownership Model

Non-profit organization

Full-time Staff

1-5

Volunteers

0

Non-profit Status

Community interest company (CIC)

Current Affiliations

Community-led Open Publication Infrastructures for Monographs (COPIM)
Copim Open Book Futures
Open Book Collective

Funding

Primary Funding Source

  • Contributions