Skip to content

Customisations

User-defined prefixes

A namespace prefix is a mapping from a prefix to a namespace URL. For example

owl: http://www.w3.org/2002/07/owl#

Tripper already include a default list of predefined prefixes. Additional prefixed can be provided in two ways.

With the prefixes argument

Several functions in the API (like save_dict(), as_jsonld() and TableDoc.parse_csv()) takes a prefixes argument with which additional namespace prefixes can provided.

This may be handy when used from the Python API.

With custom context

Additional prefixes can also be provided via a custom JSON-LD context as a "prefix": "namespace URL" mapping.

See User-defined keywords for how this is done.

User-defined keywords

Tripper already include a long list of predefined keywords, that are defined in the default JSON-LD context. A description of how to define new concepts in the JSON-LD context is given by JSON-LD 1.1 document, and can be tested in the JSON-LD Playground.

A new custom keyword can be added by providing mapping in a custom JSON-LD context from the keyword to the IRI of the corresponding concept in an ontology.

Lets assume that you already have a domain ontology with base IRI http://example.com/myonto#, that defines the concepts for the keywords you want to use for the data documentation.

First, you can add the prefix for the base IRI of your domain ontology to a custom JSON-LD context

"myonto": "http://example.com/myonto#",

How the keywords should be specified in the context depends on whether they correspond to a data property or an object property in the ontology and whether a given datatype is expected.

Simple literal

Simple literals keywords correspond to data properties with no specific datatype (just a plain string).

Assume you want to add the keyword batchNumber to relate documented samples to the number assigned to the batch they are taken from. It corresponds to the data property http://example.com/myonto#batchNumber in your domain ontology. By adding the following mapping to your custom JSON-LD context, batchNumber becomes available as a keyword for your data documentation:

"batchNumber": "myonto:batchNumber",

Literal with specific datatype

If batchNumber must always be an integer, you can specify this by replacing the above mapping with the following:

"batchNumber": {
    "@id": "myonto:batchNumber",
    "@type": "xsd:integer"
},

Here "@id" refer to the IRI batchNumber is mapped to and "@type" its datatype. In this case we use xsd:integer, which is defined in the W3C xsd vocabulary.

Object property

Object properties are relations between two individuals in the knowledge base.

If you want to say more about the batches, you may want to store them as individuals in the knowledge base. In that case, you may want to add a keyword fromBatch which relate your sample to the batch it was taken from. In your ontology you may define fromBatch as a object property with IRI: http://example.com/myonto/fromBatch.

"fromBatch": {
    "@id": "myonto:fromBatch",
    "@type": "@id"
},

Here the special value "@id" for the "@type" means that the value of fromBatch must be an IRI.

Providing a custom context

Custom context can be provided for all the interfaces described in the section Documenting a resource.

Python dict

Both for the single-resource and multi-resource dicts, you can add a "@context" key to the dict who's value is

  • a string containing a resolvable URL to the custom context,
  • a dict with the custom context or
  • a list of the aforementioned strings and dicts.

For example

{
    "@context": [
        # URL to a JSON file, typically a domain-specific context
        "https://json-ld.org/contexts/person.jsonld",

        # Local context
        {
            "fromBatch": {
                "@id": "myonto:fromBatch",
                "@type": "@id"
            }
        }
    ],

    # Documenting of the resource using keywords defined in the context
    ...
}

Note that the default context is always included and doesn't need to be specified explicitly.

YAML file

Since the YAML representation is just a YAML serialisation of a multi-resource dict, custom context can be provided by adding a "@context" keyword.

For example, the following YAML file defines a custom context defining the myonto prefix as well as the batchNumber and fromBatch keywords. An additional "kb" prefix (used for documented resources) is defined with the prefixes keyword.

---

# Custom context
"@context":
  myonto: http://example.com/myonto#

  batchNumber:
    "@id": myonto:batchNumber
    "@type": xsd:integer

  fromBatch:
    "@id": myonto:fromBatch
    "@type": "@id"


# Additional prefixes
prefixes:
  kb: http://example.com/kb#


resources:
  # Samples
  - "@id": kb:sampleA
    "@type": chameo:Sample
    fromBatch: kb:batch1

  - "@id": kb:sampleB
    "@type": chameo:Sample
    fromBatch: kb:batch1

  - "@id": kb:sampleC
    "@type": chameo:Sample
    fromBatch: kb:batch2

  # Batches
  - "@id": kb:batch1
    "@type": myonto:Batch
    batchNumber: 1

  - "@id": kb:batch2
    "@type": myonto:Batch
    batchNumber: 2

You can save this context to a triplestore with

>>> from tripper import Triplestore
>>> from tripper.dataset import save_datadoc
>>>
>>> ts = Triplestore("rdflib")
>>> save_datadoc(  # doctest: +ELLIPSIS
...     ts,
...      "https://raw.githubusercontent.com/EMMC-ASBL/tripper/refs/heads/master/tests/input/custom_context.yaml",
... )
AttrDict(...)

The content of the triplestore should now be

>>> print(ts.serialize())
@prefix chameo: <https://w3id.org/emmo/domain/characterisation-methodology/chameo#> .
@prefix kb: <http://example.com/kb#> .
@prefix myonto: <http://example.com/myonto#> .
@prefix owl: <http://www.w3.org/2002/07/owl#> .
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
<BLANKLINE>
kb:sampleA a owl:NamedIndividual,
        chameo:Sample ;
    myonto:fromBatch kb:batch1 .
<BLANKLINE>
kb:sampleB a owl:NamedIndividual,
        chameo:Sample ;
    myonto:fromBatch kb:batch1 .
<BLANKLINE>
kb:sampleC a owl:NamedIndividual,
        chameo:Sample ;
    myonto:fromBatch kb:batch2 .
<BLANKLINE>
kb:batch2 a myonto:Batch,
        owl:NamedIndividual ;
    myonto:batchNumber 2 .
<BLANKLINE>
kb:batch1 a myonto:Batch,
        owl:NamedIndividual ;
    myonto:batchNumber 1 .
<BLANKLINE>
<BLANKLINE>

Table

The __init__() method of the TableDoc class takes a context argument with witch user-defined context can be provided. The value of the context argument is the same as for the @context key of a Python dict.

User-defined resource types

TODO

Extending the list of predefined resource types it not implemented yet.

Since JSON-LD is not designed for categorisation, new resource types should not be added in a custom JSON-LD context. Instead, the list of available resource types should be stored and retrieved from the knowledge base.