DMLEX on Wikibase
A serialization of the DMLEX model, for LexBib Wikibase
See our full paper about the experiments described on these pages from November 2025 here.
This page describes how lexical resources datasets following the DMLEX model are represented on this Wikibase instance. The aim of these experiments is to present DMLEX datasets to the user on collaboratively editable entity pages, and to allow SPARQL querying of the content.
This model is heavily inspired by the DMLEX Ontology (the RDF serialization of DMLEX deploying Ontolex-Lemon). The Wikibase model for lexical entries is built on top of the three Ontolex-Lemon core classes entry, sense and form.
For some notes about the 2024 preliminary experiments, see this page.
DMLEX on LexBib Wikibase
DMLex dictionaries
- A copy of the script for DMLex-to-Wikibase transformation and upload is found here.
- A log of encountered issues is found here: https://github.com/dlindem/dmlex-wikibase/blob/main/comments/dmlex_comments.md
The following dictionaries have been uploaded from DMLex 3.0 XML sources to this Wikibase. See the corresponding pages for details.
- Veliki slovensko-nemški slovar (SLDE-V 3.2); upload complete; the page contains detailed information
- Priročni slovensko-angleški slovar (SLEN-P 3.2): upload complete; the page contains detailed information
- Veliki angleško-slovenski slovar (ENSL-V 3.2): upload complete; the page contains detailed information
- Slovensko-angleški slovar: (SLEN 3.2): upload complete; the page contains detailed information
- Slovensko-srbohrvaški slovar: (SLSH 3.3): upload complete; the page contains detailed information
- Srbohrvaško-slovenski slovar: TBD
Coarse statistics
This query lists DMLex resources on this Wikibase and counts entries and senses.
PREFIX lwb: <https://lexbib.elex.is/entity/>
PREFIX ldp: <https://lexbib.elex.is/prop/direct/>
select distinct ?resource ?resourceLabel (count(distinct ?entry) as ?entries) (count(distinct ?sense) as ?senses)
where {?resource ldp:P5 lwb:Q100. ?entry ldp:P207 ?resource. ?entry ontolex:sense ?sense.
SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }}
group by ?resource ?resourceLabel ?entries ?senses order by desc(?entries)
Slovene lemmata across dictionaries
This query lists a Slovene lemma list, grouping identical lempos-entities together.
PREFIX lwb: <https://lexbib.elex.is/entity/>
PREFIX ldp: <https://lexbib.elex.is/prop/direct/>
select distinct ?lemma (lang(?lemma) as ?lang) ?posLabel (count(distinct ?resource) as ?num_of_resources) (group_concat(distinct ?resourceName; SEPARATOR=", ") as ?in_resource)
where {?entry ldp:P5 lwb:Q111; ldp:P207 ?resource; dct:language lwb:Q209; wikibase:lemma ?lemma; wikibase:lexicalCategory ?pos.
?resource rdfs:label ?resourceName. filter(lang(?resourceName)="en")
SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }}
group by ?lemma ?posLabel ?num_of_resources ?in_resource order by LCASE(?lemma)