If you wish to contribute to the linguistic linked data cloud and have your resource included in the cloud, please fill in the form at LOD Cloud
Linguistic Linked Open Data is a movement about publishing data for linguistics and natural language processing using the following principles:
The primary benefits of LLOD have been identified as:
If you wish to contribute to the linguistic linked data cloud and have your resource included in the cloud, please fill in the form at LOD Cloud
This document guides the process of creating a linked data (LD) version of a lexical resource, particularly a bilingual dictionary. It contains advice on the vocabularies selection, RDF generation process, and publication of the results. Read More
The process of creating a Linked Data (LD) version of a lexical resource, in particular BabelNet, is describe in these guidelines. These guidelines contain advice on the vocabularies selection, RDF generation process and publication of the results. This document describes the models used and the design decisions taken during the conversion of BabelNet into the well-known lemon representation. More in general, we will describe common patterns that naturally emerge when converting a lexical resource into RDF format. Read More
This document describes proposed best practices for transforming multilingual terminologies, particulary those available in TBX format, into a Linked Data version. The best practices are exemplified using the well-known IATE resource. The document describes the models and vocabularies to be used and describes the conversion using a running example. Read More
This document describes best practices to follow for the generation of Linked Data text corpora, using the NLP Interchange Format (NIF). NIF is an RDF/OWL-based format that aims to achieve interoperability between NLP tools, language resources and annotations. It can be used to assign URIs to strings and annotate the resulting resources. The Brown corpus serves as example throughout these guidelines. Read More
This document describes best practices to follow for the implementation of RESTful NLP web services that rely on the NLP Interchange Format (NIF). „NIF is an RDF/OWL-based format that aims to achieve interoperability between NLP tools language resources and annotations.“ As a proof-of-concept, we have implemented NIF wrappers for the Stanford POS tagger and Stanford parser. Read More
This best practice describes how to exploit Linguistic Linked Data resources. The suggested steps for exploitation comprise: search and discovery of relevant resources, verifying the license of the dataset, navigating to the distribution of the data (download or SPARQL endpoint), extraction of the data that is relevant for a particular purpose or application. Read More
This document recommends best practices for building Linguistic Linked Open Data (LLOD) aware web services. LLOD denotes the representation of linguistic resources in accordance with linked data principles. These principles include that entities are to be identified by HTTP URIs providing RDF-based information about the entity including links to related entities. LLOD-aware services consume, process and produce such resources. LLOD-aware services are services that consume resources available as Linked Data as Input and output an RDF resource that can in turn be published as Linked Data. Read More
Lemon is an RDF model for representing lexical information relative to ontologies. This guides helps you get starte with using the lemon model. Read More