- Nov 2024
-
www.nature.com www.nature.com
-
comprehensive policies supporting community-agreed practices
TRSP Desirable Characteristics
-
Providing data services e.g. portal and machine interfaces, data download or server-side processing.
TRSP Desirable Characteristics
-
A TRUSTworthy repository needs to focus on serving its target user community. Each user community likely has differing expectations from their community repositories, depending in part on the community’s maturity regarding data management and sharing. A TRUSTworthy repository is embedded in its target user community’s data practices, and so can respond to evolving community requirements
TRSP Desirable Characteristics
-
TRUSTworthy repositories take responsibility for the stewardship of their data holdings and for serving their user community.
TRSP Desirable Characteristics
-
Terms of use, both for the repository and for the data holdings.
TRSP Desirable Characteristics
-
The adoption of technological capabilities should be completed in conjunction with the organizational, managerial and stewardship capabilities that facilitate the continuing use of a data repository’s holdings
TRSP Desirable Characteristics
-
Adhering to the designated community’s metadata and curation standards, along with providing stewardship of the data holdings e.g. technical validation, documentation, quality control, authenticity protection, and long-term persistence.
TRSP Desirable Characteristics
-
A repository depends on the interaction of people, processes, and technologies to support secure, persistent, and reliable services. Its activities and functions are supported by software, hardware, and technical services.
TRSP Desirable Characteristics
-
Implementing relevant and appropriate standards, tools, and technologies for data management and curation
TRSP Desirable Characteristics
-
Implementing relevant data metrics and making these available to users.
TRSP Desirable Characteristics
-
Having plans and mechanisms in place to prevent, detect, and respond to cyber or physical security threats
TRSP Desirable Characteristics
-
Providing (or contributing to) community catalogues to facilitate data discovery.
TRSP Desirable Characteristics
-
To fulfill this role, TDRs must demonstrate essential and enduring capabilities necessary to enable access and reuse of data over time for the communities they serve
TRSP desirable Characteristics
Tags
- TRSP-TRUST-Discovery
- TRSP
- Services
- Authenticity
- API Provision
- User Focus
- TRSP-TRUST-Reuse
- Responsibility
- Quality Assurance
- TRSP-TRUST-Reliable
- API
- User Support
- Documentation
- Persistence
- TRSP-TRUST-Technology
- Support
- FAIR, CARE, TRUST - Adoption, Implementation, and Deployment
- TRSP-TRUST-Metrics
- TRSP-TRUST-Security
- TRSP-TRUST-Planning
- Transfer
- TRUST
- Portal
- Standards
- Community
- Metadata
- Terms of Use
- Validation
Annotators
URL
-
-
zenodo.org zenodo.org
-
Encoding of PID Policy
-
TRSP Desirable Characteristics PID services and PID Managers SHOULD have clear versioning policies.
-
TRSP Desirable Characteristics The PID manager MUST provide the functionality required to maintain PID attributes.
-
TRSP Desirable Characteristics PID Service SHOULD resolve at least p percent of PIDs in a randomised sample, where p is determined by community and dependency expectations.
-
TRSP Desirable Characteristics PID Service Providers MUST have a clear sustainability and succession plan with an exit strategy that guarantees the continuity of the resolution of its PIDs registered with the service.
-
TRSP Desirable Characteristics PID Service Providers SHOULD document a summary of their maintenance and availability provisions publicly.
-
TRSP Desirable Characteristics PID Services MUST meet 999 availability and uptime.
-
TRSP Desirable Characteristics The PID Manager MUST maintain the integrity of the relationship between entities and their PIDs, in conformance to a PID Scheme defined by a PID Authority.
-
TRSP Desirable Characteristics PID Manager MUST ensure that the entity remains linked to the PID. In case that the entity being identified is deleted or ceases to exist, tombstone information needs to be included in the PID attribute set.
-
TRSP Desirable Characteristics PID Services SHOULD aim for a persistence median that is acceptable to and aligns with community and dependency expectations.
-
TRSP Desirable Characteristics The PID manager SHOULD provide policies and contractual arrangements for transfer of ownership should the owner no longer be able to assume responsibilities in compliance with the policy.
-
TRSP Desirable Characteristics Services MUST be available to all researchers in the EU.
-
TRSP Desirable Characteristics The basic services of PID registration and resolution SHALL have no cost to end users.
-
TRSP Desirable Characteristics Physical and conceptual entities MUST be represented via a digital representation (e.g. landing page, metadata, attribute set, database index) to have a presence in the digital landscape.
-
TRSP Desirable Characteristics PID Authorities and Services MUST agree to be certified with a mutually agreed frequency in respect of policy compliance.
-
TRSP Desirable Characteristics A PID Service infrastructure MUST be at a minimum technology readiness level of 8. This applies to basic services (registration, resolution).
-
TRSP Desirable Characteristics The PID Manager MUST maintain entity metadata as accurately as possible in collaboration with the PID Owner. This copy is the authoritative version.
Tags
- Resolvable
- TRSP
- Services
- PID Persistence
- No Cost
- Quality Assurance
- Content Negotiation
- TRL
- Functionality
- Tombstone
- Documentation
- Maturity
- Persistence
- Ownership
- Continuity
- Free
- End User
- Certification
- Availability
- Resolution Percentage
- Uptime
- Metadata Maintenance
- Versioning
- PID
- Sustainability
- Transfer
- Accuracy
- Digital Representation
- Metadata
- EU
Annotators
URL
-
-
zenodo.org zenodo.org
-
TRSP Desirable Characteristics It is recognised that a repository may offer different levels of curation to different digital objects.
-
TRSP Desirable Characteristics
Support to users during or after submission: does the repository have a contact point (e.g. helpdesk email or contact form) to assist data depositors and data users?
-
TRSP Desirable Characteristics
Deposit and access agreements or licenses.
-
TRSP Desirable Characteristics The repository is managed on well-supported operating systems and other core infrastructural software and hardware appropriate to the services it provides to its Designated Community.
-
TRSP Desirable Characteristics The repository applies documented processes to ensure data and metadata storage and integrity.
-
TRSP Desirable Characteristics Digital object management takes place according to defined workflows from deposit to access
-
TRSP Desirable Characteristics
The repository must obtain all necessary rights from the depositor, and demonstrate that there are sufficient controls in place to ensure they are applied and monitored.
-
TRSP Desirable Characteristics
The repository addresses technical quality and standards compliance, and ensures that sufficient information is available for end users to make quality-related evaluations.
-
TRSP Desirable Characteristics The repository assumes responsibility for long-term preservation and manages this function in a planned and documented way.
-
TRSP Desirable Characteristics
The repository accepts data and metadata based on defined criteria to ensure relevance and understandability for users.
-
TRSP Requirements and Desirable Characteristics
-
TRSP Desirable Characteristics
The repository approach to changing and versioning data and metadata. How the approach and records of changes are communicated to data depositors and users.
-
TRSP Desirable Characteristics
The repository has adequate funding and sufficient numbers of staff managed through ac lear system of governance to effectively carry out the mission.
-
TRSP Desirable Characteristics
The repository guarantees the authenticity of the digital objects and provides provenance information.
-
TRSP Desirable Characteristics
The provenance information and audit trails recorded for data and metadata processing and versioning.
-
TRSP Desirable Characteristics
The repository enables reuse of the digital objects over time, ensuring that appropriate information is available to support understanding and use.
-
TRSP Desirable Characteristics
The repository enables users to discover the digital objects and refer to them in a persistent way through proper citation.
-
TRSP Desirable Characteristics
Identification checks for depositors.
-
TRSP Desireable Characteristics
... repository must identify the skills necessary to deliver the services it offers, and source and maintain those skills either as internal resources or through external engagement.
Tags
- TRSP-CTS-Disaster Recovery
- TRSP
- TRSP-CTS-Expertise
- TRSP-CTS-Preservation
- TSRP-CTS Reuse
- TSRP-CTS Identity Verification
- TRSP-CTS-Rights
- Licence
- TRSP-CTS-Versioning
- CTS
- User Support
- TRSP-CTS-Quality Assurance
- TSRP-CTS Authenticity
- TRSP-CTS-Workflow
- TSRP-CTS Provenance
- TRSP-CTS-Discovery
- DRS-CTM
- TRSP-CTS-Deposit
- Guidance
- Access Conditions
- TRSP-CTS Appraisal
- TRSP-CTS-
- TRSP-CTS-Adequate Resources
- Level of Curation
- TRSP-CTS-Technical
- TRSP-CTS-Storage Procedures
- TRSP-CTS-Planning
Annotators
URL
-
-
www.gida-global.org www.gida-global.org
-
TRSP Desirable Characteristics Data governance should take into account the potential future use and future harm based on ethical frameworks grounded in the values and principles of the relevant Indigenous community. Metadata should acknowledge the provenance and purpose and any limitations or obligations in secondary use inclusive of issues of consent.
-
TRSP Desirable Characteristics Ethical processes address imbalances in power, resources, and how these affect the expression of Indigenous rights and human rights. Ethical processes must include representation from relevant Indigenous communities
-
TRSP Desirable Characteristics Ethical data are data that do not stigmatise or portray Indigenous Peoples, cultures, or knowledges in terms of deficit. Ethical data are collected and used in ways that align with Indigenous ethical frameworks and with rights affirmed in UNDRIP. Assessing ethical benefits and harms should be done from the perspective of the Indigenous Peoples, nations, or communities to whom the data relate
-
TRSP Desirable Characteristics Resources must be provided to generate data grounded in the languages, worldviews, and lived experiences (including values and principles) of Indigenous Peoples.
-
TRSP Desirable Characteristics Use of Indigenous data invokes a reciprocal responsibility to enhance data literacy within Indigenous communities and to support the development of an Indigenous data workforce and digital infrastructure to enable the creation, collection, management, security, governance, and application of data
-
TRSP Desirable Characteristics Indigenous data use is unviable unless linked to relationships built on respect, reciprocity, trust, and mutual understanding, as defined by the Indigenous Peoples to whom those data relate. Those working with Indigenous data are responsible for ensuring that the creation, interpretation, and use of those data uphold, or are respectful of, the dignity of Indigenous nations and communities.
-
TRSP Desirable Characteristics Indigenous Peoples have the right to develop cultural governance protocols for Indigenous data and be active leaders in the stewardship of, and access to, Indigenous data especially in the context of Indigenous Knowledge
-
TRSP Desirable Characteristics Indigenous Peoples have the right to data that are relevant to their world views and empower self-determination and effective self-governance. Indigenous data must be made available and accessible to Indigenous nations and communities in order to support Indigenous governance.
-
TRSP Desirable Characteristics Indigenous Peoples have rights and interests in both Indigenous Knowledge and Indigenous data. Indigenous Peoples have collective and individual rights to free, prior, and informed consent in the collection and use of such data, including the development of data policies and protocols for collection.
-
TRSP Desirable Characteristics Indigenous data are grounded in community values, which extend to society at large. Any value created from Indigenous data should benefit Indigenous communities in an equitable manner and contribute to Indigenous aspirations for wellbeing.
-
TRSP Desirable Characteristics Data enrich the planning, implementation, and evaluation processes that support the service and policy needs of Indigenous communities. Data also enable better engagement between citizens, institutions, and governments to improve decision-making. Ethical use of open data has the capacity to improve transparency and decision-making by providing Indigenous nations and communities with a better understanding of their peoples, territories, and resources. It similarly can provide greater insight into third-party policies and programs affecting Indigenous Peoples.
-
TRSP Desirable Characteristics Governments and institutions must actively support the use and reuse of data by Indigenous nations and communities by facilitating the establishment of the foundations for Indigenous innovation, value generation, and the promotion of local self-determined development processes
-
TRSP Desirable Characteristics Indigenous Peoples’ rights and wellbeing should be the primary concern.
-
TRSP Desirable Characteristics Data ecosystems shall be designed and function in ways that enable Indigenous Peoples to derive benefit from the data.
-
TRSP Desirable Characteristics Indigenous Peoples’ rights and interests in Indigenous data must be recognised and their authority to control such data be empowered. Indigenous data governance enables Indigenous Peoples and governing bodies to determine how Indigenous Peoples, as well as Indigenous lands, territories, resources, knowledges and geographical indicators, are represented and identified within data.
-
TRSP Desirable Characteristics Those working with Indigenous data have a responsibility to share how those data are used to support Indigenous Peoples’ self determination and collective benefit. Accountability requires meaningful and openly available evidence of these efforts and the benefits accruing to Indigenous Peoples.
Tags
- TRSP
- Responsibility
- Postitive Relations
- No Harm
- Inclusive
- Governance
- Justice
- Collective Benefit
- Authority to Control
- FAIR, CARE, TRUST - Adoption, Implementation, and Deployment
- Data for Governance
- Grounded
- Capacity
- Capability
- CARE
- Data Governance
- Ethics
- Governed
- Equitable
- Rights
- Future Use
- Subject
Annotators
URL
-
-
zenodo.org zenodo.org
-
TRSP Desirable Characteristics
Mechanism and process to make and track edits to a dataset after deposition: does the repository enable modification to the submitted dataset (e.g., to correct it or append additional information)? Is there a process to distinguish, link and access all versions of the data?
-
TRSP Desirable Characteristics
Licence or terms of use for reuse of existing data in the repository; these can be the same for every dataset in the repository, or vary from dataset to dataset: what are the conditions under which one can reuse the data?
-
TRSP Desirable Characteristics
Policy that details how the preservation of the data is ensured: does the repository have a page, or document that describes this?
-
TRSP Desirable Characteristics
Deposition of data: are there any restrictions (e.g. by location, country, organisation, etc.) or can anyone from anywhere deposit data?
-
TRSP Desirable Characteristics
Contact info (for the person, depositor, producer or owner, ideally with ORCID; or organisation) of the data: does the repository keep and show this information?
-
TRSP Desirable Characteristics
A mechanism to enable link datasets to related articles or pre-prints: does the repository enable this at submission stage or post-submission?
-
TRSP Desirable Characteristics
Mechanism and process for sharing deposited data via a link anonymously (or otherwise depending on journal policy regarding open/closed review): does the repository allow confidential access to the data for peer-review? Does the repository also have capability for double blind peer review?
-
TRSP Desirable Characteristics
Contact (person or organisation) for the record in FAIRsharing that describes the repository: has the owner or maintainer of the repository claimed the record and vetted its descriptions? Although records in FAIRsharing are curated by its in-house team, the participation of the owner or maintainer of the repository helps verifying the information and tracking the evolution of the resource."
-
TRSP Desirable Characteristics
Plan that gives information about sustainability plans for the repository: does the repository have a webpage, or document that describes these?
-
TRSP Desirable Characteristics
The life cycle status of the repository: is it still being developed or in production and accepting data submissions? The latter does not exclude that some (re)development, enhancement or maintenance may be ongoing, as happens in any mature system.
-
TRSP Desirable Characteristics The higher-level subject areas/disciplines the repository covers, as well as cross-disciplinary domains, such as the types or data, technology and study.
-
TRSP Desirable Characteristics
Globally unique and Persistent IDentifiers (PIDs): does the repository assign them to the deposited data? Which type of identifier schema is used?
-
TRSP Desirable Characteristics
The type of funding (e.g. grants, donations, memberships) and the organisation(s) that fund the repository.
-
TRSP Desirable Characteristics Review and annotation of the data performed by the repository (e.g. via a data submission tool that enforces some curation, or by its curation team): are there a set of minimum curation steps that repository performs on the submitted data? Is there a webpage or document that describes the type of curation done? This criteria is also related to the Data and Metadata Standards criteria.
-
TRSP Desirable Characteristics
The community-defined standards the repository implements to enable the representation of data and/or metadata in a consistent, machine readable form (e.g. via models, formats, schemas, vocabularies, ontologies). These standards facilitate the discovery and interpretation of data and/or metadata.
-
TRSP Desirable Characteristics
Data access mechanisms and terms to define access at repository and/or dataset level: what is the process through which access can be requested (and granted)? For example, is the data is freely available or subject to a request and approval process.
-
10.5281/zenodo.4084762
TRSP Desirable Characteristics
Certification schemes and/or community badges that assess certain aspects of the repository (e.g., its fitness, trustworthiness, adoption): does the repository have any?
Tags
- TRSP
- Services
- Contact
- Deposit
- Pre-publication
- Machine-Readable
- Preservation
- TRL
- Licence
- Author
- Persistence
- Certification
- Domain
- Curation
- Technology Readiness Level
- DRS-CTM
- Status
- Access Conditions
- Financial
- Annotation
- Sustainable Operational Revenue
- Curator
- Accessible
- Versioning
- PID
- Conditions
- Policy
- Contributor
- Related Identifier Support
- Sustainability
- Discipline
- Level of Curation
- Metadata
Annotators
URL
-
-
-
TRSP Desirable Characteristics
Policies that explain the repository’s commitment and processes that ensure the long-term preservation, fitness, and availability of datasets.
-
TRSP Desirable Characteristics
Earning a certification, typically by means of a third party audit or community endorsement process, presents evidence that a repository meets a formal standard and adheres to a set of best, professional practices.
-
TRSP Desirable Characteristics
The terms of reuse of datasets that are provided by a repository
-
TRSP Desirable Characteristics
Policies for who can view and access a dataset and under what conditions.
-
TRSP Desirable Characteristics
Policies that explain what datasets the repository will accept for deposit, from whom, and under what conditions, including costs.
-
TRSP Desirable Characteristics
Curatorial services performed by repository functionality or personnel that enhance or otherwise add value to datasets and create purposeful collections of data.
-
TRSP Desirable Characteristics
Format/s of the metadata that describes datasets in a repository
-
TRSP Desirable Characteristics
The repository provides or utilises persistent identifiers.
-
TRSP Desirable Characteristics
Application Programming Interfaces (APIs), service endpoints, and other protocol interfaces that enable machine access to a repository.
-
TRSP Desirable Characteristics
The subject classification of datasets in a repository.
-
TRSP Desirable Characteristics
A general description of the data repository.
-
TRSP Desirable Characteristics
Contact information for the data repository
-
TRSP Desirable Characteristics
The organization responsible for the data repository
-
TRSP Desirable Characteristics
The native language of the user interface of the repository
-
TRSP Desirable Characteristics
The country in which the repository operates
-
TRSP Desirable Characteristics
The uniform resource locator (URL) that serves as the homepage and primary entry point for accessing the repository on the World Wide Web.
-
TRSP Desirable Characteristics The name that the repository provides and what users commonly call the repository.
-
- Sep 2024
-
ipres2024.pubpub.org ipres2024.pubpub.org
-
As oil becomes more expensive to extract, less and less energy will be available.
This is simply not true. Less energy will be available from fossil sources, but in many countries renewable energy is price competitive already and likely to become more so.
-
-
en.wikipedia.org en.wikipedia.org
-
Efficient - process management includes deliberate process optimization/improvement.
Maturity Level 5
-
Capable - the process is quantitatively managed in accordance with agreed-upon metrics.
Maturity Level 4
-
Defined - the process is defined/confirmed as a standard business process
Maturity Level 3
-
Repeatable - the process is at least documented sufficiently such that repeating the same steps may be attempted.
Maturity Level 2
-
Initial (chaotic, ad hoc, individual heroics) - the starting point for use of a new or undocumented repeat process.
Maturity Level 1
-
-
fairsharing.org fairsharing.org
-
ISO 24619:2011 specifies requirements for the persistent identifier (PID) framework and for using PIDs for referencing and citing documents, files and language resources (e.g digital dictionaries, text corpora, linguistic annotated corpora). A PID is an electronic identification referring to or citing electronic documents, files, resources, resource collections such as books, articles, papers, images etc. ISO 24619:2011 also addresses issues of persistence and granularity of references to resources, first by requiring that persistent references be implemented by using a PID framework and further by imposing requirements on any PID frameworks used for this purpose.
Tags
Annotators
URL
-
-
status.ror.org status.ror.org
-
All Systems Operational
Example of availability
-
- Aug 2024
-
archiveshub.jisc.ac.uk archiveshub.jisc.ac.uk
-
The Anatomy of References
ARCHON Schema definition
-
We cannot guarantee persistence of identifiers, because we do not generate and maintain the data
ARCHON is not guaranteed to be persistent
-
- Jul 2024
-
www.wikidata.org www.wikidata.org
-
DataCite
-
- Jun 2024
-
zenodo.org zenodo.org
-
TRSP Desirable Characteristics
-
TRSP Desirable Characteristics
-
TRSP Desirable Characteristics
-
TRSP Desirable Characteristics
-
TRSP Desirable Characteristics
-
TRSP Desirable Characteristics
-
TRSP Desirable Characteristics
-
TRSP Desirable Characteristics
-
-
www.nature.com www.nature.com
-
Managing the intellectual property rights of data producers
TRSP Desirable Characteristics
-
and the security of the system and its content
TRSP Desirable Characteristics
-
TRSP Desirable Characteristics
-
Ensuring sustainability of a TRUSTworthy repository is necessary to ensure uninterrupted access to its valuable data holdings for current and future user communities. Continued access to data is dependent upon the ability of the repository to provide services over time, and to respond with new or improved services to meet evolving user community requirements.
TRSP Desirable Characteristics
-
Planning sufficiently for risk mitigation, business continuity, disaster recovery, and succession.
TRSP Desirable Characteristics
-
Continued access to data is dependent upon the ability of the repository to provide services over time, and to respond with new or improved services to meet evolving user community requirements
TRSP Desirable Characteristics
-
Managing the intellectual property rights of data producers, the protection of sensitive information resources, and the security of the system and its content.
TRSP Desirable Characteristics
-
repositories should ensure that, at a minimum, the mission statement and scope of the repository are clearly stated
TRSP Desirable Characteristics
-
- May 2024
-
www.inchi-trust.org www.inchi-trust.org
-
free open-source software to load and write chemical structures
PID Standard Software
-
-
pbs.twimg.com pbs.twimg.com
-
PID Start
Tags
Annotators
URL
-
-
drive.google.com drive.google.com
-
PID Start Date
-
-
en.wikipedia.org en.wikipedia.org
-
It was created in 2001
PID Started
Tags
Annotators
URL
-
-
www.wikidata.org www.wikidata.org
-
4,463,574
PID Count
Tags
Annotators
URL
-
-
www.wikidata.org www.wikidata.orgarXiv1
-
2,400,000
PID Count
-
-
www.islrn.org www.islrn.orgISLRN1
-
The International Standard Language Resource Number (ISLRN) is a new, unique and universal identification schema for Language Resources which provides Language Resources with unique names using a standardized nomenclature. It also ensures Language Resources are identified, and consequently recognized with proper references in activities within Human Language Technologies, as well as in documents and scientific papers.
-
- Feb 2024
-
boulderlamp.com boulderlamp.com
-
Green spectrum also penetrates deep into plants driving photosynthesis where other spectrum cannot.
Call b.s. when you see it ... plants are green because they reflect most of the green frequency range - opposite of penetration.
-
- Jan 2024
-
info.orcid.org info.orcid.org
-
The ORCID Researcher Advisory Council (ORAC) is a diverse group of researchers who provide valuable perspectives and advice to ORCID staff and the ORCID Board to ensure that ORCID provides value and utility to researchers and facilitates research and innovation.
PID - Governance
-
-
www.dona.net www.dona.net
-
Mr. Wittenburg started by thanking the Executive Director who had provided useful inputs relating to EOSCdiscussions in Western Europe. In Mr. Wittenburg's view the DO Architecture technology was now facingtwo main challenges: (i) building such infrastructure was taking more time than it had initially beenexpected; and (ii) building such infrastructure was still a highly dynamic issue as there were many differentsuggestions and positions among the players. For Mr. Wittenburg, a DONA strength was that it was an islandof stability in this dynamic environment
Interesting.
-
- Nov 2023
-
zenodo.org zenodo.org
-
Sustainable resource discovery
Tags
Annotators
URL
-
-
help.figshare.com help.figshare.com
-
Figshare supports version control of all publicly available data. Any privately stored data can also be altered or deleted as you wish.
PID Versioning
-
-
-
The artist looked at the producerThe producer sat backHe said, "What we have got here, is a perfect trackBut we don't have a vocalAnd we don't have a songIf we could get these things accomplishedNothing else could go wrong"So he balanced the ashtrayAs he picked up the phoneAnd said, "Send me a songwriterWho's drifted far from homeAnd make sure that he's hungryMake sure he's aloneSend me a cheeseburgerAnd a new Rolling Stone"
Presentation quotes
-
- Oct 2023
-
igsn.github.io igsn.github.io
-
In a journal article or manuscript a sample identified by IGSN SSH000SUA may look like this (tagged IGSN): IGSN:SSH000SUA
Manuscript tagging
-
Unlike many other persistent identifiers, an IGSN is used not only used by machines but also needs to be handled by humans.
Human-readable
Tags
Annotators
URL
-
-
www.hyam.net www.hyam.net
-
DOIs have a business model. LSIDs currently do not. Without a business model (read funding) we should stick to something that doesn’t have the implementation/adoption impediment of LSIDs and make the best of it (i.e. just have a usage policy for HTTP URIs).
-
Without some kind of persistence mechanism the only advantage of LSIDs is that they look like they are supposed to be persistent. Unfortunately, because many people are using UUIDs as their object identifiers LSIDs actually look like something you wouldn’t want to look at let alone expose to a user! CoL actually hide them because they look like this: urn:lsid:catalogueoflife.org:taxon:d755ba3e-29c1-102b-9a4a-00304854f820:ac2009
-
The act of minting an LSID indicates that you intend to try to make it permanent or at least never re-use it for another resource.
Tags
Annotators
URL
-
-
riojournal.com riojournal.com
-
The preferred PID scheme In consideration of the foregoing, the strongest option across the studied major dimensions of the available Handle System PID schemes and operational modes is for DiSSCo to use DOIs to identify Digital Specimens. The case for choosing DOI comes out slightly more strongly than choosing ePIC for reasons related to the substantial achievements, operational experience and reputation of DOI/ IDF to date. Operating under another Handle-system prefix than those used by IDF and ePIC is the substantially weakest option because of the difficulties associated with introducing an identifier that is not perceived to be a DOI. The term ‘DOI’ is trademarked by the IDF and thus not available for describing other identifiers. The practical and sensible avenue to explore further are the options to establish and become an RA member of the DOI Foundation (option A5) and to enter a strategic alliance at the level of the DOI Foundation (option A1). These options are likely most effective when actioned in combination.
Preferred PID Scheme
-
When digitized, each resulting ‘Digital Specimen’ must be persistently and unambiguously identified. Subsequent events or transactions associated with the Digital Specimen, such as annotation and/or modification by a scientist must be recorded, stored and also unambiguously identified.
Workflows
-
Persistent identifiers (PID) to identify digital representations of physical specimens in natural science collections (i.e., digital specimens) unambiguously and uniquely on the Internet are one of the mechanisms for digitally transforming collections-based science.
Use case
-
Information quality is the strongest factor to influence organizational benefits through perceived usefulness and user satisfaction
Information quality=precision
-
Digital Specimen
Entity
-
Appropriate identifiers Requirement: PIDs appropriate to the digital object type being persistently identified.
Appropriate
-
Governance
GOvernance
-
Persistence
Persistence
-
Trust
Trust
-
Available Handle-based PID schemes
Topology if Handle-based schemes
-
Scalability
Characteristics
-
Collectively the IDF and its RA members assume the long-term responsibility to maintain and sustain the DOI PID scheme for everyone.
Characteristics
-
-
content.iospress.com content.iospress.com
-
Which identifiers are FAIR enough?
Comparing FAIRness of PIDs
-
digital or physical objects, or concepts
-
-
-
Compact identifiers are a longstanding informal convention in bioinformatics. To be used as globally unique, persistent, web-resolvable identifiers, they require a commonly agreed namespace registry with maintenance rules and clear governance; a set of redirection rules for converting namespace prefixes, provider codes and local identifiers to resolution URLs; and deployed production-quality resolvers with long-term sustainability.
Characteristics
Tags
Annotators
URL
-
-
-
Wittenburg, P., Hellström, M., Zwölf, C.-M., Abroshan, H., Asmi, A., Di Bernardo, G., Couvreur, D., Gaizer, T., Holub, P., Hooft, R., Häggström, I., Kohler, M., Koureas, D., Kuchinke, W., Milanesi, L., Padfield, J., Rosato, A., Staiger, C., van Uytvanck, D., & Weigel, T. (2017). Persistent identifiers: Consolidated assertions. Status of November, 2017. Zenodo. https://doi.org/10.5281/zenodo.1116189
Characteristics
Tags
Annotators
URL
-
-
webdoc.sub.gwdg.de webdoc.sub.gwdg.de
-
Breaking of links is mostly due to administrative changes at the referencedInternet node
Benefits
-
Over time the risk grows that the document is no longer accessible at the loca-tion given as reference. Web servers that follow the HTTP protocol then givethe notorious reply: ‘404 not found’. This resembles the situation of a book in a– very large – library that is not on the shelf at the position indicated in the cata-logue. How is it to be found?
PID Issues
-
In the mid 1990s, a number of schemes were developed that, rather than rely-ing on the precise address of a document, introduced the idea of name spaces forrecording the names and locations of documents.
History
-
-
www.crossref.org www.crossref.org
-
Archives. The Member shall use best efforts to contract with a third-party archive or other content host (an "Archive") (a list of which can be found here) for such Archive to preserve the Member's Content and, in the event that the Member ceases to host the Member's Content, to make such Content available for persistent linking.
Characteristics
-
Maintaining and Updating Metadata.
Characteristics
Tags
Annotators
URL
-
-
datacite.org datacite.org
-
withdrawing content withoutposting a notification (“Tombstone Page”) andupdating the record's URL/metadata with DataCite
Characteristics
-
Commitment to data persistence
Characteristics
-
Maintaining and Updating Metadata.
Characteristics
-
-
www.pidconsortium.net www.pidconsortium.net
-
ePIC PID service, and an organisation, that provides a PID service and is not an ePIC member, can ask for a certification, that it provides its PID service along the lines of the ePIC rules and policies.
Characteristics
-
ePIC has rules and policies, how to provide PID services and how to ensure the reliability, that is necessary for the persistency of access to digital objects via ePIC PIDs.
Characteristics
-
Registered production PIDs will not be deleted. They are used as a kind of tombstone even if the underlying data is not available anymore.
Characteristics
-
Persistent identifiers (PIDs) are an abstraction layer that arbitrates between the reference of a digital object and its location.
Definition
-
-
datascience.codata.org datascience.codata.org
-
Scalability – in the data/metadata stored;
Characteristics
-
-
en.wikipedia.org en.wikipedia.org
-
ORCID was first announced in 2009
Persistence Maximum
Tags
Annotators
URL
-
-
en.wikipedia.org en.wikipedia.org
-
The developer and administrator of the DOI system is the International DOI Foundation (IDF), which introduced it in 2000
Persistence Maximum
-
-
www.ietf.org www.ietf.org
-
5.2. Key ? was(DESCRIPTION) when(DATE) resync This "metadata" command form provides nothing more than a way to carry a Key along with its description. The form is a "no-op" (except when "resync" is present) in the sense that the Key is treated as an adorned URL (as if no THUMP request were present). This form is designed as a passive data structrue that pairs a hyperlink with its metadata so that a formatted description might be surfaced by a client-side trigger event such as a "mouse-over". It is passive in the sense that selecting ("clicking on") the URL should result in ordinary access via the Key-as-pure-link as if no THUMP request were present. The form is effectively a metadata cache, and the DATE of last extraction tells how fresh it is. The "was" pseudo-command takes multiple arguments separated by "|", the first argument identifying the kind of DESCRIPTION that follows, e.g,
ARK Kernel Metadata Query
-
-
www.ietf.org www.ietf.org
-
To resolve a Compact ARK (ie, an ARK beginning "ark:") it must initially be promoted to a Mapping ARK so that it becomes actionable. On the web, this means finding a suitable web Resolver Service to prepend to the compact form of the identifier in order to convert it to a URL (cf [CURIE]). (This is more or less true for any type of identifier not already in URL form.)
Characteristics
-
-
www.pidconsortium.net www.pidconsortium.net
-
CLARIN: European Research Infrastructure for Language Resources and Technology CNIC: Computer Network Information Center, Chinese Academy of Sciences, China CSC: IT Center for Science CSCS: Swiss National Supercomputing Centre DKRZ: Deutsches Klimarechenzentrum GRNET: Greek Research and Technology Network GWDG: Gesellschaft für wissenschaftliche Datenverarbeitung Göttingen SND: Swedish National Data Center SURF: SURF is the collaborative ICT organization for higher education and research in the Netherlands
ePIC Members
-
-
support.datacite.org support.datacite.org
-
When content underlying a DOI is updated, we recommend updating the DOI metadata and, for major changes, assigning a new DOI. For minor content changes, the same DOI may be used with updated metadata. A new DOI is not required. For major content changes, we recommend assigning a new DOI and linking the new DOI to the previous DOI with related identifiers.
Characteristics
-
To enable easy usability for both humans and machines, a DOI should resolve to a landing page that contains information about the DOI being resolved. It is the responsibility of the entity creating the DOI to provide such a landing page. The following are best practices for creating well-formed DOI landing pages.
Characteristics
-
there may be infrequent cases where it is not desirable for the item described by a DOI to be available publicly, such as in the case of research retraction. In these cases, it is best practice to still provide a "tombstone page", which is a special type of landing page describing the item that has been removed.
Characteristics
-
-
datascience.codata.org datascience.codata.org
-
certification: If certified, acronym for certification organization or standard (e.g., TRAC, TDR, DSA) and year of certification.
Certification This potentially includes many of the features of PIDs already listed
-
succession: The plan for dealing with sudden loss of provider viability, including set-aside funding and length of time that operations would be able to support continued operation while a successor provider is found to keep references intact.
Succession
-
mission: One sentence mission statement of the organization.
Mission
-
business model: For profit (FP) or not for profit (NP).
Characteristics
-
name: Full name of the provider organization. identifier: Unique identifier for the organization.
Provider Identity
-
check character (CC): A check character is incorporated in the assigned identifier to guard against common transcription errors.
Mitigation
-
inflection: a change to the ending of an object’s id string in order to obtain a reference to content related to the originally referenced content.
Expectations A form of content negotiation
-
landing: content intended mostly for human consumption, such as an object description and links to primary information (e.g., an image file or a spreadsheet), to alternate versions and formats, and to related information; from “landing page”, this is intended to support a browsing experience of an abstract overall view of the object.
Expectations
-
plunging: content intended as primary object information, often required or directly usable by software; from “below the landing page”, this is intended to support an immersive object experience that bypasses any browsing step.
Expectations
-
introversioned: a kind of intraversioned content for which the version identifier (within the object identifier) is opaque, e.g., “http://doi.org/10.2345/678”, which happens to be version 4.
Expectations
-
intraversioned: a version identifier is part of the id string, e.g., “http://doi.org/10.2345/67.V4”.
Expectations
-
extraversioned: a version identifier is separate from the id string, so that the actionable id does not lead to specific version without human intervention, e.g., “http://doi.org/10.2345/67”, Version 4.
Expectations
-
Precisely when such assignment will be triggered depends on policy that will differ across objects, collections, and providers.
Trigger
-
waxing: change that is limited to appending content in a way that does not in itself disrupt or displace previously recorded content. Examples of waxing objects include live sensor-based data feeds, citation databases, and serial publications.
Expectations Dynamic Citation
-
subinfinite: due to succession arrangements, the object is expected to be available beyond the provider organization’s lifetime.
Expectations
-
lifetime: the object is expected to be available as long as the provider exists.
Expectations
-
-
ieeexplore.ieee.org ieeexplore.ieee.org
-
We propose a novel approach relying on persistent, timestamped, adapted queries to versioned and timestamped data sources
Expectations
-
-
dans.knaw.nl dans.knaw.nl
-
Datafiles can be published with a suitable embargo period, forinstance to allow completion of publications or research basedon the dataset, or to respect contracts made by the depositorwith third parties concerning intellectual property rights.DANS encourages embargo periods of 6 months or less.
Expectations
-
If a published dataset is improved by amendments to thedata files of the dataset, a major version increment iscreated with a record of changes. In cases where it isnecessary to disable access to earlier versions, these can bedeaccessioned
Expectations
-