Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents
stylenone

Note: A copy of this document is formatted in MS Word and is attached. An earlier draft is available at https://docs.google.com/document/d/1IoECf3-PmhE2aVf2ZR83QanZtsMTw7Aa6FCILi9Aq8s/edit#

Definitions

Distributed Identity refers to a particular method of verifying one’s identity and personal attributes to a relying party. 

Distributed Identity is mainly used for accessing online services but there is no theoretical obstacle to presenting a digital identity in a non-virtual setting like a physical entry at a gate.

Distributed Identity is most useful at first-time access to a service, like at registration, because it allows for the user - the Holder - to share verifiable attestations and claims about their identity and attributes which may have been self-asserted or granted. These, in turn, should indeed be verified by the relying party. The relying party may then store some of the information gathered in a local account and issue local credentials, in which case in later interactions distributed identity plays a lesser role. However, it may also be the case that the claims are not stored, or that they need to be updated or maintained, and then Distributed Identity plays an equal role in subsequent interactions as in the first one. 

The attestations and claims are granted to a person by a community or authority or by herself. These are stored by the user on their device, typically in a wallet application or browser extension and then presented to the relying party when accessing the service.

...

The verification could use a digital infrastructure like digital signature verification on a signed piece of data or consultation with a registry or a distributed ledger. 

Motivation

There are several advantages to be gained from a Distributed Identity system in Research and Education (DI4R) setting:

  • Better attribute aggregation: in a DI4R setting, attribute aggregation happens within the user wallet. This enables attributes from more sources with the user in perfect control of the release.
  • Easier integration for the identity and service providers: Providers need not federate - they can decide to provide or consume user information or stop doing that at any time and it is only up to the user whether they want to provide attributes or not. 
  • No tracking by IdP: In a SAML or OIDC setting, the Identity Provider can track in real-time where its users are logging in. In DI4R, the issuer cannot track any subsequent usage of the issued information and thus learn about the user's behaviour.
  • Easier compliance with GDPR:
    • The user holds control over cards and can easily delete them.
    • For the IdP there is not much difference, except in terms of less control - the IdP cannot know/limit what happens with the credentials once issued - they can only track their inclusion into the user's wallet (including after a claim has expired or is revoked).
    • The IdP's ability to control attribute release improves privacy and data protection.
    • Not having a proxy (in the long run -proxies are temporarily very useful) is also a big advantage.
    • The authorization is decoupled from providing attributes.
    • The service is responsible for asking for only what it needs and whom it trusts and is responsible for claims regarding verification, authorization and GDPR-complied handling of released information.
  • Easier in the ecosystem to exchange information without top-level trust route approval - basically a mesh-like federation. 
    • Explanation: we came up with tagging in eduGAIN so that we don’t break the trust model, while entities can still express additional content.

...

There is a Distributed Identity solution provider already in use in the EU, mainly in the Netherlands: the I Release My Attributes (IRMA) by Privacy By Design foundation. 

Since the components of that system are available on GitHub, and several key components are already Licensed with Apache 2.0, it is natural that we opted to experiment with that. 

Therefore, from this point on, we specifically refer to IRMA as our prospective DI4R solution.

...

  1. Usually, the session starts with the user performing some action on the website (e.g. clicking on "Log in with IRMA").
  2. The requestor sends its session request (containing the attributes to be disclosed or issued, or message to be signed) to the IRMA server. Depending on its configuration, the IRMA server accepts the session request only if the session request is authentic (e.g. a validly signed session request JWT) from an authorised requestor.
  3. The IRMA server accepts the request and assigns a session token (a random string) to it. It returns the contents of the QR code that the front-end must display: the URL to itself and the session token.
  4. The front-end (irma-frontend) receives and displays the QR code, which is scanned by the IRMA app.
  5. The IRMA app requests the session request from step 1, receiving the attributes to be disclosed or issued, or the message to be signed.
  6. The IRMA server returns the session request.
  7. The IRMA app displays the attributes to be disclosed or issued, or the message to be signed, and asks the user if she wants to proceed.
  8. The user accepts.
  9. The IRMA server performs the IRMA protocol with the IRMA app, issuing new attributes to the user, receiving and verifying attributes from the user's IRMA app, or receiving and verifying an attribute-based signature made by the user's app.
  10. The session status (DONE, CANCELLED, TIMEOUT), along with disclosed and verified attributes or signatures depending on the session type, are returned to the requestor.

Use of Proxies

DI4R in Proxy Approach proxy approach is a logical extension of the available data sources for a service, for which the multi-protocol proxy was created in the first place. 

In this arrangement, the sole source of all information is the Proxy from the SP's point of view.

...

IRMA implements the Idemix Protocol to handle pseudonymous attribute handling. The Idemix protocol provides a way for users to use verifiable pseudonymous identifiers, coupled with certain attributes at services, without revealing their identity. With Idemix, this includes non-traceability across services, by the virtue of the user having credentials targeted to each service. 

The credentials are always issued by an Issuer Organization. The User contacts the Issuer and registers an account and establishes a pseudonym. If the user is eligible for certain attributes, a credential will be issued containing the pseudonym and the attributes to the user. 

Then, the user can prove the possession of such credentials without actually revealing them to a verifier organization. 

Use Cases (Generalised)

The following use case descriptions present some ideas of how the system may be used in an academic setting.

...

In a traditional SAML flow, the following happens. The goal is to enable user Aladár (A) to manage the authorisation of user Béla (B) authorization to service S, but in a way that this information is not maintained in S but in an external source, the Membership Management Service (MMS). 

  1. A logs in on the web interface of the MMS, a SAML SP and an account are created.
  2. A creates a Virtual Organization / Community / Group - terminology depends on the actual tool but let us call it (VO)
  3. A wants to invite B to his VO. In order to do this, he needs an email address to B. This email address serves as a trust anchor for the moment, therefore it needs to really belong to B and not be compromised.
  4. A sends an email invitation to B with a link containing a token. The email is sent by the MMS system.
  5. B follows the link to the web interface of the MMS, prompted for login. may already have a login (for previous participation in other VOs) or needs to create a new one. may log in with a federated account but it could be the case that there is none, and a local account is created or a VHO account is used. This scenario is made possible by the fact that really the access to the email inbox is what provides the trust for the VO membership.
  6. After creating/accessing a local account, the token sent in the link is processed and B's account is now associated with the VO. 
  7. will eventually access a service that needs this membership information, commonly called entitlement.
    1. The service will perform a login flow.
    2. with With B's user identifier queries the MMS back-end, for instance, a SAML AA or an integration. This requires the usage of the same user identifier that was used at the MMS, typically a common OIDC/SAML source.
  8. A may revoke the entitlement at any time, which will take effect at the next session: the service accessed will query the MMS and will not get the entitlement.

...

With the introduction of DI4R, the flow may be significantly simplified. 

  1. creates a VO at the MMS service.
  2. A sends an invitation to B to the VO.
    1. an An email is sent to B from the MMS.
    2. a A card is registered to the registry?
  3. visits the link and receives the card, which is added to the wallet.
  4. visits a service that needs the entitlement and presents the card.
  5. (the The card is verified in the common registry, therefore revocation is possible.)

With this solution, does not have to use the same login (i.e. the MMS and the target S do do not need to be in the same federation). Possibly, B can receive the card at a page maintained by the DI4R provider. 

Gliffy Diagram
displayNameMMS Flow
nameMMS Flow
pagePin5

...

  • The certificate of acceptance will contain the name of the author and the metadata of the article, therefore this can be handled as a simple card. 
  • The certificate of editing will also contain the name of the author and the edited issue, making it very similar to the certificate of acceptance.
  • The certificate of review should also be connected to the person who did the review but it should not reveal what the review entailed. The way for doing that is to be in a large enough set of people so that the k-anonymity is sufficiently high. Otherwise, based on the exact timing and the fields of interest of a reviewer an author might be able to guess who did their review. Therefore, only larger time ranges (e.g., a year range) should be revealed. Smaller journals may want to pool themselves together and issue a certificate that only says that the review was done in one of the journals in question. 

Gliffy Diagram
macroId185be960-9022-4171-bab6-ace1919b0ce9
displayNameJournal to IRMA
nameJournal to IRMA
pagePin4

...

  1. User A (the sponsor already in collaboration)sends an email invitation to user B  (outside the technical collaboration). 
    1. A describes B on a form at the VHO and inputs the email address of B.
    2. the The email is sent out.
  2. User B visits the VHO service and receives a card that describes their identity as stated by A.
    1. visits the service, reviews the data stated about her by A, and receives a card.
    2. The card gets registered to the registry.
  3. User can now access services within the collaboration.
    1. B attempts to access the service.
    2. the The service verifies the card in the registry and allows access. 

Gliffy Diagram
macroId15fcc9b6-e7c1-4e1a-8560-7f9075a0c94e
displayNameVHO case
nameVHO case
pagePin2

...

With this method, the Verifier no longer trusts an IdP (something that is exposed on the public internet) but trusts the authentication and the possession of the wallet. Arguably, this provides the opportunity for a stronger level of assurance (i.e. two factors to the wallet+possession of the device).

Issues to Address and Discussion Discussion

Assurance

Since many sources can provide IRMA attributes, the IRMA platform does not standardise levels of assurance beyond individual profiles. Assurance levels are provided by using the corresponding schema-defined credential attributes, that is, IRMA passes on the level of assurance provided by sources only if these levels are incorporated into the used schema and implemented by the IRMA issuer. 

For example, the attribute “assurancelevel” is used in schemas that provide data from passports or ID cards, and it conveys the levels set by the document issuer or an intermediate entity that collected and verified the information provided with the credential. This level is in line with eIDAS. Some other schemes use “digidlevel” to provide the level from the Dutch Digital ID (digid.nl), which is the assurance with which identity is verified in the Dutch population register.

...

When it comes to traditional authentication sessions, the need for separate authentication factors for high-stakes sessions has been long acknowledged. 

Login names and passwords may fall prey to a hacked browser or operating system, in which case an independent channel - a one-time password challenge, a push notification on a secondary device, paper-based factors or even something as weak as an interceptable SMS can provide a crucial last line of defence. 

Most of the currently trending second-factor options assume the primary channel to be a desktop or laptop computer and the mobile as a secondary device. 

However, IRMA is a mobile-only application, so the user is less likely to have a second mobile device that may act as an independent source of authentication. 

This is a drawback as no independent channel is provided for the user's access to their information. The information is stored on the user device, defended by a PIN, which, should the mobile operating system get exposed, will also be compromised, and the private information can be stolen.

...

Another important factor to consider is the very nature of the distributed identity system - there are no huge amounts of sensitive data on any one device, instead, all the devices store their owner’s data only, making them much less valuable targets. 

Moreover, the client devices normally don’t provide server functionality at all, that is, they are usually not addressable under a well-known DNS name, and there are no ports exposed.

Besides all this, the mobile device is much more personal than the desktop or laptop device and is much better tied to the user, also, by the virtue of providing the primary means of personal communication, a mobile device is much sooner reported and their connection remotely deactivated when stolen or lost. 

This shows that the lack of a second-factor option in the case of a mobile wallet may be compensated by other factors.

...

An alternative wallet in this context means a non-smartphone implementation of a wallet. While having the ability to use alternative wallets seems a necessity as the user base grows, a non-smartphone implementation comes with several challenges. 

One such challenge is the QR-Code reading for which the smartphone is especially well suited, but is not impossible in another architecture either, i.e. a browser extension. Another challenge is the safe storage of the ‘cards’, but that also can be done.

Attribute Revocation

Revocation is enabled per credential type in the IRMA scheme. If so, the properly configured issuer’s IRMA  IRMA will issue revocation-enabled credentials of that type. If the user has a revocation-enabled credential then proving non-revocation is not required; instead, they can just disclose attributes from the credential, which is much cheaper. Non-revocation is still ensured by using revocation update messages which are created whenever an issuer performs a revocation, which also distributes issuer-related information that is updated at the time of revocation and is necessary to disclose attributes.

...

  • Multi-valued attributes
  • Alternative wallets
  • Scalable schema definition for a size of a federation like eduGAIN (of issuers)
    • 5k or more entities
  • Peer-to-peer claims (cards)
  • Pixie dusting - claiming that someone is your co-worker, club member, etc. 
  • Conventions on prefixes for wildcards used on attribute names
  • Use of multiple schemas and schema selector
    • UX is not hard but needs to be done well
    • allows Allows for different universes of DI4R
  • Enhanced presentation of cards
    • Since the user is in charge of exposing cards in their wallet to the service, it is important to present these cards, their content and their source in a clear but informative way. This requires further establishing of a standardised and scalable way to specify their presentation and access to supporting information which is also interoperable with existing identity infrastructures and trust frameworks.
  • Usability testing/evaluation
    • DI4R is a new concept, so it is a reasonable question whether the users understand the flow at all and the benefits that justify the adoption of changes. Should be done with appropriate early adopters such as researchers involved in Open Science.