pidapalooza



Join the Party!
Martin Fisch - The audience is shaking

Main Stage: Plenary Sessions



carly

Carly Strasser

Reaching Nirvana: The Future of Identifiers

Session Info

clifford

Clifford Tatum

Towards governance of PID portability for research evaluation

Session Info

herbert

Herbert Van de Sompel

Signposting pattern for PIDs

Session Info

jonathan

Jonathan Clark

PIDvasive - What’s possible when everything has a Persistent Identifier?

Session Info

simon

Simon Porter

Exploring the relationship between Persistent Identifiers and Research Information citizenship

Session Info

PIDapalooza 2016 Lineup

PID Stories: Which strategies worked? Which strategies failed? Tell us your horror stories! Share your victories! Emerging PIDs: Long-term identifiers are no longer just for digital objects. What additional use cases are you working on? PIDagogy: How do you teach, learn, persuade, discuss, and improve adoption? What's it mean to build a pedagogy for PIDs? PID Sci: They live in the lab. They live in the field. Let's talk about practical applications of PIDs in the real-life research world. I-Word: What would make PID systems "interoperate" optimally? Maybe just standardized link/relation types? Persistence: Is open infrastructure the key to achieving persistence? What are the frontiers of ‘persistence’? Org-ID: There are tons of different approaches to organizational identifers. So they get their own track.



Tuesday, 8th November

Start End Session
18:00 20:30 Rehersal: meet-up and chill-out with fellow festival goers at Bryggjan Brugghus Brewpub (cash bar).



Wednesday, 9th November

Start End Session Stage 1: Helka I Stage 2: Helka II Stage 3: Esja II
09:00 09:30 Coffee and registration - Box Office opens at 9:00 - Conference Lobby/Katla II
09:30 10:00 Opening Act Lighting the eternal (persistent) flame - Main Stage: Katla II
10:00 10:30 Richard Wynne: Tales from PID Adoption Paul Jessop: Party Identification Daniel Mietchen: Making ethics data FAIR
10:30 11:00 Christopher Brown: Organisation IDs for UK research Christophe Blanchi: Multi-Stakeholder Global Handle Registry - Enabling Digital Sovereignty Martin Fenner: PIDs for Projects
11:00 11:30 Posters Session - Break - Snack Bar opens at 11:00 - Conference Lobby/Katla II
11:30 12:00 Ramona Walls: Prefix Commons Cory Craig: ORCID & Hyper-authored Articles Arthur Smith: Wikidata and Persistent Identifiers
12:00 12:30 Trisha Adamus: ORCID Promotion in a University Setting Markus Stocker: Persistent Identification of Instruments Daniel Goroff: Identifying Legal Entities
12:30 13:15 Lunch - Concession Stand opens at 12:30 - Súlnasalur restaurant
13:15 14:00 Main Stage Plenary Jonathan Clark: PIDvasive - What’s possible when everything has a persistent identifier? - Main Stage: Katla II
14:00 14:30 Ginny Hendricks, Alice Meadows, Laura Rueda: Persistent Identifiers 101 – Developing a PIDriculum (full hour) Cynthia L. Chandler, Adam Shepherd, Robert Arko: Persistent Identifiers and Linked Ocean Data (full hour) Susan White-DePace, Michael Hofmockel: Two part discussion on Society for Science at User Research Facilities (SSURF) and Driving Data Sharing among Large R&D Organizations (full hour)
14:30 15:00
15:00 15:30 Posters Session - Break - Snack Bar opens at 15:00 - Conference Lobby/Katla II
15:30 16:00 Joe Wass: What happens when PIDs aren't there: Stories from Crossref Event Data Max Ogden: Public key cryptography Tim Clark, Nick Juty: Assign locally, resolve globally: prefix-based collection access
16:00 16:30 Maaike Duine: PID use and the human perspective Manfred Kohler: Frequency and Variability Andres Mori: Global Research Identifier Database (GRID)
16:30 17:15 Main Stage Plenary Simon Porter: Exploring the relationship between persistent identifiers and research information citizenship - Main Stage: Katla II
17:15 17:30 Encore - Closing Remarks - ain't no party like a persistent party, cuz a persistent party don't stop
17:30 19:00 Afterparty in the hotel lobby. Please don't break anything.



Thursday, 10th November

Start End Session Stage 1: Helka I Stage 2: Helka II Stage 3: Esja II
09:00 09:30 Coffee and registration - Box Office opens at 9:00 - Conference Lobby/Katla II
09:30 09:45 Opening Act Welcome Back
09:45 10:30 Main Stage Plenary Clifford Tatum: Towards governance of PID portability for research evaluation - Main Stage: Katla II
10:30 11:00 Ulrich Schwardmann: PID Information Types will leverage Interoperability between PID systems John Kunze: What do we mean by persistence? Salvatore Mele: Challenges of Measuring PID Adoption
11:00 11:30 Mieneke van der Salm: Digital Author Identifier Stephanie Simms: PIDs in DMPs: spinning tracks with syntax Tom Gillespie: Scientific Protocols
11:30 12:00 Ted Lawless: Publications and ORCIDs Xinjian Guo: Yale Persistent Linking Service Using Handles Arthur Smith: Organisation Identifiers - APS experience
12:00 12:45 Lunch - Concession Stand opens at 12:00 - Súlnasalur restaurant
12:45 13:30 Plenary - Main Stage Herbert Van de Sompel: Signposting for Persistent Identifiers - Main Stage: Katla II
13:30 14:00 Crossref, DataCite, ORCID and ORG IDS (full hour) Kerstin Lehnert: Let's get physical: PIDs for Samples & Collections Paul Vierkant: Identifiers on the rise in Germany
14:00 14:30 Najko Jahn: Using DOIs to assess institutional support for OA fees Gustavo Durand: PIDs throughout the Dataverse
14:30 15:00 Posters Session - Break - Snack Bar opens at 14:30 - Conference Lobby/Katla II
15:00 15:45 Plenary - Main Stage Carly Strasser: Reaching Nirvana. The Future of Identifiers - Main Stage: Katla II
15:45 16:00 Mike Drop - Closing Remarks - you don't have to go home, but you can't stay here

Festival Lineup



as

Adam Shepherd

am

Alice Meadows

am

Andres Mori

as

Arthur Smith

cb

Christophe Blanchi

cb

Christopher Brown

cc

Cory Craig

cc

Cynthia L. Chandler

dg

Daniel Goroff

dm

Daniel Mietchen

jw

Joe Wass

jk

John Kunze

kl

Kerstin Lehnert

lh

Lambert Heller

lr

Laura Rueda

md

Maaike Duine

mh

Michael Hofmockel

mvds

Mieneke van der Salm

mk

Manfred Kohler

ms

Markus Stocker

mf

Martin Fenner

mo

Max Ogden

nj

Najko Jahn

nj

Nick Juty

pj

Paul Jessop

pv

Paul Vierkant

rw

Ramona Walls

rw

Richard Wynne

ra

Robert Arko

sm

Salvatore Mele

ss

Stephanie Simms

Susan White-DePace

tl

Ted Lawless

Tim Clark

tg

Tom Gillespie

ta

Trisha Adamus

us

Ulrich Schwardmann

xg

Xinjian Guo





Main Stage: Plenary Session Details



PIDvasive - what’s possible when everything has a persistent identifier?

Jonathan Clark - International DOI Foundation

Day 1 Plenary Session: Knowing what’s what and who’s is an old problem. Thankfully we have made huge progress and many communities have now understood and agreed persistent identifier standards. But what would a future look like if everything has an identifier? How will I know that I can trust the information? How can I know how persistent an identifier really is? And what about the object it describes, how persistent is that? What if that object is physical and not digital? How will my machine know all this? What if digital objects can roam freely, and can self-describe and interact automagically?

This session on the future will attempt to predict the unpredictable and imagine the unimaginable. And maybe, just maybe we might discover that this future is closer than we might think.


Exploring the relationship between persistent identifiers and research information citizenship

Simon Porter - Digital Science

Day 1 Plenary Session: To attach a persistent identifier to a representation of a person, Organisation or research object is to provide that representation a way to travel across research systems and Organizational boundaries. The background infrastructure required for this to happen is as much social as it is technical. This presentation will attempt to unpack the social element of this infrastructure through the concept of research information citizenship. As research information citizens, what are our obligations to the research data that we consume, create and communicate? Particular focus will be on unpacking the use of ORCID and GRID identifiers in terms of trust, roles and responsibilities within the research community.


Towards governance of PID portability for research evaluation

Clifford Tatum - CWTS, Leiden University

Day 2 Plenary Session: Increased adoption of PIDs in the academic enterprise coincides with increasing demand for Research Information (RI) related to research evaluation. Collection and analysis of RI often occurs at the level of institution (e.g. with the use of CRIS systems), where public information about research output is linked to private information related to financial and human resources. This creates an interoperability dilemma, whereby increased expectations for open science are in conflict with privacy, security, and commercial concerns for certain kinds of data, such as personal details about staff, data gathered from human subjects, and licensed content. In this presentation I propose a shift in focus from interoperability of research infrastructure to portability of PIDs as a way out of this dilemma.


Signposting pattern for PIDs

Herbert Van de Sompel - Los Alamos National Laboratory

Day 2 Plenary Session: PIDs need to be used to achieve their intended persistence. Our research found that a disturbing percentage of references to papers that have DOIs actually use the landing page HTTP URI instead of the DOI HTTP URI. It can safely be assumed that the same problem exists for other types of PIDs. The net result is that the true potential of PIDs is not realized. In order to ameliorate this problem we propose to convey the PID in the HTTP Link response header of all resources identified by the PID, including the landing page and content resources such as "the PDF" and "the dataset". This allows tools, such as citation managers, to auto-discover and use the PID. Similarly, the HTTP Link header can be used to allow tools to auto-discover, from the landing page, which resources are part of a PID-identified object. These and other uses of the HTTP Link header to achieve a coarse yet meaningful level of interoperability in the scholarly communication system are promoted by the “Signposting the Scholarly Web” effort, http://signposting.org.


Reaching Nirvana: The Future of Identifiers

Carly Strasser - Gordon and Betty Moore Foundation

Day 2 Plenary Session: After crowd surfing through two days of acts, I will present PIDapalooza's "Greatest Hits" of takeaways, lessons learned, points for discussion, and new directions. Put your hands up for plenty of music puns woven into aspirational ideas for the future of identifiers.




PID Stories - Session Details




SSURF

Susan White-DePace, Michael Hofmockel

Two part discussion on Society for Science at User Research Facilities (SSURF) and Driving Data Sharing among Large R&D Organizations (full hour)

Part I Susan White-DePace: The Society for Science at User Research Facilities (SSURF) is a non-profit organization representing scientific user facilities across the United States and abroad. SSURF seeks to help steward the resolution of our technical challenges, security concerns, conflicting policies and identity management challenges that currently blocks us from working together. The resolution of these blockers will clear the way for these research facilities to do more by sharing in the creation of the many common wheels they all need. A comprehensive robust open Identity management system is the root of the matter.

Part II Michael Hofmockel: Driving Data Sharing among Large R&D Organizations


From DAI to ORCID: Implementation and beyond in Leiden

Mieneke van der Salm

In the Netherlands, there is a relatively long history of working with author identifiers. Since 2008, virtually all researchers affiliated with a Dutch university have been assigned a so-called DAI, a Digital Author Identifier. As the DAI system was custom-built for a Dutch situation, many universities in the Netherlands are currently in the process of implementing the more international ORCID system. With the aid of SURF, five Dutch universities have collectively become members of the ORCID organisation through a national consortium.

At Leiden University, one of the institutions in this consortium, the implementation of the ORCID system is a central activity within a much broader programme which will ultimately lead to the ability to present the scholarly output of all of our researchers in a broader context. When publications, data sets, funding programmes and researchers can all be identified unambiguously via PIDs, it becomes possible to visualise the many connections that exist between these objects and these people, and to provide a more complete and more coherent overview of the results of research projects.


Identifiers on the rise in Germany

Paul Vierkant

Identifiers on the rise in Germany shall treat the current development of identifiers in Germany's research landscape. Introducing ORCID in Germany's Universities and research organizations finds a lot of interest and a quick uptake. The talks illustrates also the challenges for personal identifiers taking especially German history into account. Within the talk I will present the latest results from a study on the usage and spread of ORCID in academic institutions in Germany in the course of the ORCID DE project. The presentation will also touch upon the discussion about of the recently presented ""Kerndatensatz Forschung"" (Research Core Dataset) recommended by the German Council of Science and Humanities aiming to gather coherent information about research activities also using authority files such as ORCID, DOI and organization identifiers (http://www.forschungsinfo.de/kerndatensatz/en/index.php?home).


Tales from the PID Adoption Frontier

Richard Wynne

One of the services performed by Scholarly journal publishers is to transform unstructured author manuscripts (usually prepared in Word format) into structured content with valid PIDs. How does this happen? The Editorial Manager submission system contains more than a million author records with validated ORCIDs and supports emerging standards such as CRediT and Open Funder Registry. This presentation tells the story of real world technical and operational challenges associated with collecting and using PIDs. We’ll explore the good, the bad and the ugly and talk about new frontiers.


Making ethics data FAIR

Daniel Mietchen

So far, most of the ecosystem of ethical review of past, present and future research is basically hidden. This secrecy is a barrier to communicating ethical aspects of research, establishing best practices and standards, and educating researchers and the public about this topic. Not surprisingly, it has thus been suggested to "[m]ake all documentation around ethical approval and consent freely available" (https://doi.org/10.1136/bmj.i4626), which is where persistent (and preferably unique) identifers come in handy.

Assigning PIDs to important components of the ethical review process (e.g. IRB or other ethics committee, review requests, reviews, IRB decisions) and making that information accessible to humans and machines could pave the way for ethical aspects of research to be included more prominently in research communication. For research data, the FAIR principles (http://doi.org/10.1038/sdata.2016.18) have been established to ensure that data are Findable, Accessible, Interoperable and Reusable.


Digital Author Identifier

Peter Verhaar

In the Netherlands, there is a relatively long history of working with author identifiers. Since 2005, virtually all researchers affiliated with a Dutch university have been assigned a so-called DAI, a Digital Author Identifier. As the DAI system was custom-built for a Dutch situation, many universities in the Netherlands are currently in the process of implementing the more international ORCID system. With the aid of SURF, five Dutch universities have collectively become members of the ORCID organisation through a national consortium. At Leiden University, one of the institutions in this consortium, the implementation of the ORCID system is a central activity within a much broader programme which will ultimately lead to the ability to present the scholarly output of all of our researchers in a broader context. When publications, data sets, funding programmes and researchers can all be identified unambiguously via PIDs, it becomes possible to visualise the many connections that exist between these entities and to provide a more complete and more coherent overview of the results of research projects.


What happens when PIDs aren't there: Stories from Crossref Event Data

Joe Wass

Crossref Event Data (CED) tracks the use of scholarly publications across the web. We use PIDs to reference publications. In some places, like Wikipedia, PIDs are a commonplace sight. But in places like Twitter and Facebook, they are few and far between. CED looks for article landing pages and tries to convert them back into PIDs. How do we track publications on platforms that don't encourage the use of PIDs? Hear examples, methods, success and horror stories as well as our plans for CED.


Marine ecosystem science and identifiers

Cynthia Chandler

Marine ecosystem science requires access to a wide variety of data. Research topics include investigation of complex food webs, sustainable fisheries science, coastal and marine ecosystem studies that contribute to better informed management decisions, and climate change vulnerability and potential mitigation. Such research requires access to data from the natural, social and economic sciences and often large distributed teams of investigators with expertise in a broad range of disciplines.

As data are made more freely available and discoverable on the Web, it is likely that data are being used and re-used far from their point of origin. That distance defined in time, space and research domain increases the need for robust metadata (e.g. structured documentation) to provide proper context for the data. A group of complementary marine research data repositories in the USA have been integrating use of Persistent Identifiers (PIDs) into their metadata catalogs. Use of Open Researcher and Contributor ID (ORCiD) and Digital Object Identifiers (DOIs) have shown great potential to disambiguate persons and research objects respectively. In addition Open Funder Registry (FundRef) codes are used to identify funding sources, and International Geo Sample Numbers (IGSNs) help to track the provenance of physical samples. The PIDs are integrated into the metadata records of the repository catalogs, published out as Linked Open Data and enhanced with semantic markup to provide additional, critical context for improved discovery of resources of interest. In this session we propose to review progress made thus far, reflecting on what has worked or failed, and thoughts on logical next steps. Use of PIDs has resulted in the ability to validate metadata records by comparing the contents with trusted, authoritative sources. The PIDs also serve as reliable facts in Linked Open Data resources that can be connected with related research results, e.g. published data sets with literature publications.




Emerging PIDs - Session Details



Party Identification

Paul Jessop

Party identification seems to be an idea whose time has come. ORCID is the currency of scholarly publishing. ISNI is gaining traction in the entertainment industry and in memory institutions. But there is a myriad of proprietary schemes surrounding these industry standards. This session will explore new use cases for party identification (such as in the visual arts), the need for interoperability between standard schemes and the role of proprietary systems (and the ability of standard identifiers to bridge between them).


PID for Projects

Martin Fenner - DataCite

Projects are a very common of scientific collaboration, but we currently have no established way to describe them with persistent identifiers. In this presentation we will propose ways to use existing PID systems to better describe projects, including the description of all project members and the collection of all project outputs.


PIDs in DMPs: spinning tracks with syntax

Stephanie Simms

Data management plans (DMPs) are now an integral part of good research practice, yet most of their potential remains locked up in static text files. By repositioning DMPs as dynamic, machine-actionable inventories of digital research outputs, people, organizations, funders, etc., they could provide greater benefits to all stakeholders. Persistent identifiers are a key ingredient in the transition to machine-actionable DMPs, enabling information to be passed across existing workflows and systems to plan resources, connect outputs, and automate reporting and monitoring.


Public key cryptography

Max Ogden

In the Dat project we want to be able to use identifiers to find duplicate copies of datasets around the world to increase download speeds. By taking the hash of the content of the data, and signing it with the public key of the user who published it, and using these as the basis for our dataset identifiers, we are able to verify identity of authorship as well as integrity of data. In this talk we will discuss the pros and cons of our identifier scheme, how we hope to standardize them, and what centralized vs decentralized infrastructure is required to make a system like this usable to the average user.


PIDs for Scientific Protocols

Tom Gillespie

Research protocols represent the complex 4-dimensional processes that are the heart of experimental science. Protocols are the substrate for critical evaluation of scientific data, yet their current representation in the digital world is in desperate need of modernization. Many different groups have recognized this and various new digital representations for protocols are starting to emerge. This panel will focus on challenges that a system for identifying scientific protocols must address. Some of the issues include the conflation of PID metadata with data that should be in the artifact (related to citation vs accession), and achieving representation agnostic identification. In addition, some of the important questions for discussion include: when do we issue a new protocol identifier? Who inputs metadata? Who can update metadata? And, why not DOIs?




PIDagogy - Session Details



Persistent Identifiers 101 – Developing a PIDriculum

Ginny Hendricks, Alice Meadows, Laura Rueda

In this interactive session we will develop ideas for a set of educational resources on persistent identifiers. These resources could be used by anyone who wants to learn about PIDs themselves or who is responsible for teaching others about them. After a brief introduction, attendees will be divided into groups to brainstorm and plan what should be included in different types of resources (a video, a quiz, an infographic, and more) and who and will develop them. Help us devise the best tools possible for transforming New PIDs on the Block into PIDlates Instructors!


Utilization of ORCIDs

Trisha Adamus

This session will focus on the evolution of the ORCID campaign on the University of Wisconsin-Madison campus and share the lessons learned during a pilot project with the School of Nursing. This ongoing pilot project includes the development and implementation of promotional materials, hands on instructional sessions and pilot assessment. The pilot will be used to inform campaign expansion for the entire School of Nursing and subsequently the University of Wisconsin-Madison.


PID use and the human perspective

Maaike Duine

Our presentation focuses on how to encourage PID use from a human perspective. THOR – an EU project funded under the Horizon 2020 programme – endeavours to overcome the cultural barriers to PID adoption by organising training activities for a wide range of stakeholders. Our ambassador network is critical to achieving this goal. In our talk, a representative from THOR and one of our ambassadors will discuss the benefits of tapping into their disciplinary network to promote the use of PIDs in a way that is tailored to a specific research community. What kind of training and materials work best? How can you ensure people will see the benefits and continue to use PIDs? The session will end with sharing best practices and an interactive discussion on how to integrate PIDagogy in the Research Data Science Curriculum.




PID Sci - Session Details



Physical samples

Kerstin Lehnert

The use of persistent, resolvable identifiers for physical samples and collections ensures unambiguous citation of these samples to a) give credit for their acquisition or production and curation; b) link to digital metadata that describe the sample and where the sample can be accessed (sample repositories); c) link to data generated by the study of the samples; and d) link and integrate data acquired throughout the analytical history of a sample. The International Geo Sample Number is an example for a sample and collection PID with rapidly growing adoption in the Earth Sciences. Other science domains or commercial fields use different PIDs for their samples. This session invites participants, who are interested to learn about the application, architecture, and governance of the IGSN system, and who will engage in a discussion to coordinate and align different PID systems that are in use or considered for samples and sample collections.


Instruments

Markus Stocker

This talk attempts to make a case for persistent identifiers for instruments and instrument deployments. Preserving data about instruments and instrument deployments, such as sensing devices used in environmental monitoring, arguably increases the fitness of observation data for reuse. In fact, granular data about instruments and their deployments support researchers in their decision for whether or not observation data collected in deployments are fit for reuse in scientific investigations. For instance, studies may have more or less stringent requirements on reliability of measurement or may require deployment locations within a certain region. In traditional data publications, data about instruments and their deployments are represented as metadata about collected observation data. This representation can be challenged with an alternative one that represents observation data and instrument deployment data as distinct information objects. The alternative representation could drive interesting and potentially important future developments. First, it underscores the possibility of persistent identification of instrument deployment data - akin to, but distinct from, persistent identification of observation data. Second, the community may decide to specify required and desired sets of attributes about instruments and instrument deployments. Such specification could increase the contextual information available to researchers. Third, new resources may be developed that specialize on preserving instrument and instrument deployment data, and on resolving requests for such data.


Frequency and Variability

Manfred Kohler

In life sciences, a high number of experiments produces a lot of small data sets. This high frequency combined with a huge variability of experiments demand a high workload on documentation. But, as expected, the quality of documentation is low. Thus we propose a more simplified way to create useful descriptions of experimental work by using predefines templates accessible via PIDs. The templates itself can be used to create a graph database showing the hierarchical structure of experimental conditions, useful for finding the correct experimental template as well as for correct analytical combinations of experimental work based on semantic information across different research areas. The intended method could be used for new as well as for existing experiments. The presentation should be used as a starter for a community based discussion.




I-Word - Session Details



Challenges of Measuring PID Adoption

Salvatore Mele

The EC-funded THOR project (http://project-thor.eu) aims to improve the interoperability of PIDs for researchers and data. To assess project progress and prepare for a sustainable future, THOR built an evaluation component into the project. But in order to have a meaningful evaluation, you must have something to measure and you must know how to measure it effectively. In the nascent field of PID interoperability, which information is missing? What do we not know (yet)? And what steps can we take to know the unknowable? This presentation will outline the challenges of measuring PID adoption that were encountered in the process of evaluating the THOR project alongside its continued evolution.


PID Information Types will leverage Interoperability between PID systems

Ulrich Schwardmann

Metadata together with PIDs becomes the most important driver for research data reuse and sharing on one hand and for parametrization of data management workflows on the other hand. Whereus findability is improved by a rich and deeply structured metadata set, parametrization of worklows especially with many data sets needs simpler and much faster accessible metadata for efficiency reasons. Furthermore communities have often their own demands on valuable metadata from the scientific point of view as well as for instance for access regulation on recent generated research data. This often does not fit into predefined schemas and rises the need for community specific policies. All this can be covered by utilizing the database behind the Handle PID system to maintain so called PID information types (PIT), and aggregate them into community specific lists of mandatory and optional properties. The proposed talk will describe, how such PID Information Types can be defined in data type registries together with automatically generated schemas and how PIDs can be enriched with such type instances, and eventually how this can substantially improve the interoperability and richness of metadata between PID systems as well as the trustworthyness in the data policies of research data repositories.


Prefix Commons

Ramona Walls, Chris Mungall, Julie McMurry, Michel Dumontier, Melissa Haendel

1) The confusion with identifiers often starts with the basics, including what the “identifier” even is. We routinely observe what “stuff falls in and out” of identifiers: all kinds of characters get prepended, subtracted, or altered when identifiers are referenced outside--and sometimes even inside--their database of origin. Assignment of identifiers is not coordinated across datasets or repositories, leading to the possibility of clashes on the same identifier, and clashes on the same prefix. When one finds a shortform database identifier “in the wild”, there are no universally accepted methods to:

Resolve it to a meaningful and authoritative web page
Reference it in an external database
Discover references to it in external databases
Cite/mention it in the literature
Discover citations/mentions of it in the literature
Expand it to an http/https URI (Uniform Resource Locators which are critical in web-based data integration efforts such as the smartAPI BD2K interoperability supplement project, a project to link API outputs together.)

65% of the 1,499 data repositories registered in Re3Data, are characterized as having no (declared) persistent http identifier system. The same can be inferred for 47% of the 561 collections in Identifiers.org; however, it should be said that this is precisely why identifiers.org exists--to compensate for the lack of such systems from constituent providers. Because of the lack of documentation and transparency, it is often difficult to determine what the identifier system even is and whether or not it is backed by a commitment to persistence.

PrefixCommons is a cross-cutting framework to aggregate, document, and harmonize identifier prefixes from multiple sources, most notably Identifiers.org, the OBOFoundry, Bioportal, and prefix.cc. Prefix consistency and lack of global collisions are helpful to aid human understanding; however, neither of these features is required in order to make the existing situation better than it is. In PrefixCommons we have begun to develop tools for composing and validating sets of prefixes as used in different contexts. It also provides a framework for iteratively approaching a more unified standard for minting new identifier strategies.


Publications and ORCIDs

Ted Lawless

Over six million publications in the Web of Science are related to at least one author identified with an ORCID and more are added each month. Existing APIs allow customers to query the Web of Science and integrate data about affiliated researchers into their systems. How can this ORCID integration be leveraged? What are the next steps in utilizing identifiers to make connections across systems? Examples of potential data reuse will be presented and discussed.


Multi-Stakeholder Global Handle Registry - Enabling Digital Sovereignty

Christophe Blanchi

The DONA Foundation (DONA) was constituted in Geneva in early 2014 in part to administer and maintain the stable operation of the Global Handle Registry (GHR) along with multiple parties around the globe known as the Multi-Primary Administrators (MPAs). Responsibility for the GHR, previously held solely by CNRI in Reston Virginia USA, was transferred to the DONA Foundation in May, 2014. Since then, five MPAs have been authorized and credentialed by DONA to provide global handle services based on their credential. New organizations are currently in the process of being considered for authorization as future MPAs by the DONA Board of Directors.

The MPA-based approach to operating the GHR is key to enabling operations and management of the Handle System on a distributed multi-stakeholder basis. In this session, we will describe the operation and administration of the GHR on a multi-primary basis, under the overall administration of DONA, and will present some of the related standards, operating policies and procedures.


Using DOIs and Crossref to assess the evolving institutional support for open access publication fees

Najko Jahn

Persistent identifier not only reference digital objects, but come with useful metadata. In my session, I present how research libraries can re-use Crossref's API and standardized metadata to keep track of recent developments in scholarly communication. More specifically, I address the problem of institutional spending for open access publication fees and journal subscriptions in Germany, and how to compare this expenditure across institutions by re-using Crossref's bibliographic index. As a tool, rOpenSci's rcrossref client is used. Several open questions to the PID community remain including versioning of metadata, in particular with regard to publisher and journal titles, as well as the current status of licensing information coverage in Crossref's index.




Persistence - Session Details



Wikidata and Persistent Identifiers

Arthur Smith - American Physical Society

Wikidata as an interoperability solution. Wikidata is the structured database solution (now about 4 years old) from the WikiMedia Foundation and is already the central source for linking articles across the many different language-specific wikipedia's. Wikidata allows additional structured data, including a wide variety of external identifiers, to be attached to entities. As a free, open, standards-based (RDF and SPARQL supported), and universally editable database, Wikidata may provide an ideal solution for interlinking and providing an interoperable interface for at least some persistent identifiers.


Assign locally, resolve globally: prefix-based collection access

Tim Clark, Nick Juty

This talk will discuss a joint project of the California Digital Library and the European Bioinformatics Institute, with other collaborators, to develop a common approach for global resolution of locally-assigned accession numbers, based on a shared registry of defined resource prefixes and provider codes. We believe our approach will enable academic publishers to implement the Joint Declaration of Data Citation Principles (JDDCP) more rapidly, easily and effectively for data in key bioinformatics resources.


What do we mean

John Kunze

What do we mean by Persistence? We don't have well-defined terms to talk about the various flavors of persistence that digital service providers support. How do we make "persistent" reference to objects for which limited change is not a bug, but a feature? And how do we characterize such change? In this talk we invite feedback and testing on a draft set of persistence terms designed to help providers set user expectations and to inform users in their long-term linking choices.


ORCID & Hyper-authored Articles: What are the options?

Cory Craig

The number of published hyper-authored papers (those with hundreds or thousands of authors) is on the rise. What are the options for display of ORCID iDs in these multi-authored works? I will give an overview of the trends, issues, and logistics in hyper-authored papers with the goal of identifying options for including and displaying PIDs in these works.


Broken URLs

Xinjian Guo

"The Yale Persistent Linking Service (YPLS) allows for creation of permanent links or handles that can be resolved to target URLs via Handle.Net Registry (HNR) by CNRI under the DONA Foundation. Broken URLs or outdated bookmarks often make the exploration and discovery of digital data very frustrating. Persistent links provide a mechanism for ensuring that links to digital assets will continue to resolve and provide access to digital content even if the location of the content and the underlying software change over time. The handle syntax is prefix/local-unique-identifier which makes the handle globally unique as the prefix Yale has registered is unique. YPLS supports two types of handle formats, one is based on a string of several alphanumeric characters generated with NOID (Nice Opaque IDentifier) minter, for example 10079/sqv9sf1, another is the semantically meaningful handles widely used by the libraries for online catalogs and finding aids, for example 10079/bibid/161348. A web interface was developed for users to create, update, delete, or resolve a single handle or a batch of handles using an input file. The web service API allows any system on campus to generate and manage handles programmatically.


Blockchain

Lambert Heller

Blockchain technology is - quiet possibly - about to disrupt the software related service infrastructure of science. (Good overview: S Bartling & B Fecher 2016, http://doi.org/10.5281/zenodo.60223) . This is true especially in a field like PIDs, where executive powers are concentrated in the hands of few trusted agents. The mere availability of blockchain begs the question: Why don't we separate the production, updates and record keeping of IDs (which can now be done in a distributed way, without a need for any trusted central agents) from curating collections of PID'ed objects which meet certain criteria? What could be reached by e.g. coining PIDs for newly created objects directly by e.g. optical digital sensors, right away from the lab bench? Or by blockchain-based person identifiers which allow for pseudonymous (but otherwise open) publications, contributions to peer review or grant applications? - And assumed blockchain-based PIDs will prevail in some of these exemplary application fields - how do we make sure that traditionally organized PIDs and blockchain-based PIDs stay interoperable?




Org-ID - Session Details



JISC

Christopher Brown

Following the publication of the report “Review of selected organisational IDs and development of use cases for the Jisc CASRAI-UK Organisational Identifiers Working Group”, Jisc has been reviewing its recommendations and engaging with UK stakeholders and international partners on the requirements for an orgID. Although there are many technical issues to resolve with the adoption of an existing, or new, orgID, any solution will only gain traction when implemented by key stakeholders. Can we wait for a perfect solution or should we promote one that satisfies most requirements but can be improved? Are higher education institutions in the UK waiting for guidance from research funders before making a decision, while developing short-term solutions? Whatever is the best solution, it will only be adopted when key stakeholders have been persuaded to adopt it and integrate it with their existing systems. Establishing the benefits are key to selling any solution, but to achieve a consensus requires community engagement. This session will highlight the work Jisc has undertaken to understand the benefits and the work engaging with stakeholders and the community.


Identifying Legal Entities

Daniel Goroff

The Global Legal Entity Identifier (LEI) System is designed to uniquely and unambiguously identify participants in financial transactions. The ISO 17442 standard defines a set of attributes or legal entity reference data that are the most essential elements of identification. The Common Data File format provides the specificity needed for implementation of the ISO standard. The reporting of LEI and legal entity reference data is conducted daily by LEI issuing organizations using the Common Data File format.

The LEI code itself is neutral, with no embedded intelligence or country codes which would create unnecessary complexity for users. Four key principles underlie the LEI: 1) It is a global standard; 2) A single, unique identifier is assigned to each legal entity; 3) It is supported by high data quality; and 4) The dataset is a public good, available free of charge to all users."


GRID

Andres Mori

The Global Research Identifier Database (GRID) is a free (CC-BY), manually curated database of organizations associated with research. Its main purpose is to assist in issues with disambiguation, integration and analysis of data relevant to organizations featured in the scientific process. We will discuss the challenges in terms of data coverage, structure and quality presented by a persistent identifier database at a global scale. The benefits of offering various data formats for distribution will also be addressed, specifically in terms of API and open linked data (RDF).


Organisation Identifiers - APS experience

Arthur Smith

Issues of granularity, hierarchy, naming, varieties of roles, geographic location and change over time make institutional identifiers surprisingly complex to handle, and standards (de facto or otherwise) in this area have so far proven inadequate to the problem of allowing researchers to unambiguously and uniquely identify their institutions in publication or other venues. Ideally this session would examine existing and new institutional identifiers and the important use cases they need to support, and help chart a path to solving the real problems in this area.

Open identifiers deserve their own festival


Why build an open identifier infrastructure? So that anyone can use it to create cool tools and services for the research community. This year's festival themes include:

PID stories

Which strategies worked? Which strategies failed? Tell us your horror stories! Share your victories!

Emerging PIDs

Long-term identifiers are no longer just for digital objects. We have use cases for people, organizations, vocabulary terms, and more. What additional use cases are you working on?

PIDagogy

It’s a challenge for those who provide PID services and tools to engage the wider community. How do you teach, learn, persuade, discuss, and improve adoption? What's it mean to build a pedagogy for PIDs?

PID Sci

They live in the lab. They live in the field. Let's talk about practical applications of PIDs in the real-life research world.

The I-word

What would make heterogeneous PID systems "interoperate" optimally? Would standardized metadata and APIs across PID types solve many of the problems, and if so, how would that be achieved? What about standardized link/relation types?

Persistence

So many factors affect persistence: mission, oversight, funding, succession, redundancy, governance. Is open infrastructure for scholarly communication the key to achieving persistence? What are the frontiers of ‘persistence’?

Org IDs

There are tons of different approaches to organizational identifers. So they get their own track.




Register now


PIDapalooza is a two-day Persistent Identifier festival, happening for the first time this year: November 9-10, 2016. California Digital Library, Crossref, DataCite, and ORCID invite you to Iceland for a mixture of PID demos, workshops, brainstorming and updates on the state of the art.

Where:
Radisson Blu Saga Hotel
Hagatorg, 107
Reykjavík, Iceland

When:
9th and 10th November 2016


Register here!

Can't wait? Contact us!



@pidapalooza
#pidapalooza


info@pidapalooza.org

Crossref Crossref Crossref CDL