LexVoc: Difference between revisions

From LexBib
(Created page with "= LexVoc Vocabulary of Lexicographic Terms = LexVoc is... == Graph Visualisations == * Graph view, complete, coloured: [https://lexbib.elex.is/query/#%23defaultView%3AGraph%0A...")
 
Line 1: Line 1:
= LexVoc Vocabulary of Lexicographic Terms =
= LexVoc Vocabulary of Lexicographic Terms =
LexVoc is...
We have started to develop LexVoc, a controlled vocabulary of Lexicography-related terms that shall be used as content descriptors, and linked to the corresponding bibliographical items. We have defined English preferred and alternative lexicalizations, and represented relations between terms according to the [https://www.w3.org/2004/02/skos/ W3C SKOS standard].
 
Sources for LexVoc have been the following:
 
# An [[Item:Q14502|updated and extended version]] of the index of “Bibliografía Temática de la Lexicografía” ([https://www.zotero.org/groups/1892855/lexbib/collections/UAS9GNZS/items/MC56SE35/collection Córdoba Rodríguez 2003]) translated to English (2003 e-bibliography [https://www.udc.es/grupos/lexicografia/bibliografia/tematica.html here])
# The [[Item:Q15134|typology of dictionaries]] by Engelberg and Storrer ([[Item:Q11281|2016]])
# The ''[[Item:Q14506|Glossary of Lexicographic Terms]]'' by Kipfer ([[Item:Q13652|2013]])
# The [[Item:Q14504|index]] of the volume ''Using Online Dictionaries'' ([[Item:Q3589|Müller-Spitzer 2014]])
# The [[Item:Q14512|''Linguistic Property'' branch]] of the [http://linguistics-ontology.org/ GOLD ontology]
 
We have merged all concepts stemming from sources (1) to (5), and set relations between them, so that terms can be represented as nodes in a single graph, with SKOS relations as edges.
 
In a second step, we have extended the vocabulary with a [[Item:Q14510|manually revised subset]] of salient term candidates, extracted from a corpus compiled using all English full texts present in the collection used for Elexifinder version 2 (Spring 2021).
 
We have then extended the vocabulary further, using term extraction results from subsets of our English full texts. This has been done for the field of [[Item:Q15007|dictionary digitization]] (narrower of [[Item:Q14318|dictionary making]]).
 
LexVoc main branches as they are defined today are listed below. Names of natural languages have experimentally been part of the vocabulary. This branch currently is not maintained, and not used as Elexifinder category, since natural languages as search filter are available through wikification (Elexifinder ''concepts'', not ''categories''.)
 
== Elexifinder Categories ==
 
Terms that serve as Elexifinder category belong to the first three [[Property:P72|skos:broader]] hierarchy levels below the [[Item:Q1|root concept]]. Terms deeper in the hierarchy are considered in article indexation, and so are closeMatch terms without own broader-hierarchy, but the assigned category visible on Elexifinder will be the corresponding broader category of the third level below root.
 
== Graph Visualisations ==
== Graph Visualisations ==
* Graph view, complete, coloured: [https://lexbib.elex.is/query/#%23defaultView%3AGraph%0APREFIX%20lwb%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fentity%2F%3E%0APREFIX%20ldp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fdirect%2F%3E%0APREFIX%20lp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2F%3E%0APREFIX%20lps%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fstatement%2F%3E%0APREFIX%20lpq%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fqualifier%2F%3E%0APREFIX%20lpr%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Freference%2F%3E%0A%0Aselect%20%3Fs%20%3FsLabel%20%0A%3Frgb%0A%3Fo%20%3FoLabel%20%0A%23%3Fdistance%0A%23%3FedgeLabel%20%0Awhere%20%7B%0A%20%20%3Fs%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%3Fs%20ldp%3AP72%2a%20lwb%3AQ1%20.%20%23%20root%20node%20of%20the%20representation%2C%20Q1%20for%20%22Lexicography%22%0A%20%20%3Fs%20ldp%3AP72%20%3Fo%20.%0A%20%20%23FILTER%20%28%3Fp%20%3D%20ldp%3AP72%29%0A%20%20%3Fs%20rdfs%3Alabel%20%3FsLabel%20.%0A%20%20FILTER%20%28lang%28%3FsLabel%29%3D%22en%22%20%7C%7C%20lang%28%3FsLabel%29%3D%22de%22%29%0A%20%20%3Fo%20rdfs%3Alabel%20%3FoLabel%20.%0A%20%20FILTER%20%28lang%28%3FoLabel%29%3D%22en%22%20%7C%7C%20lang%28%3FoLabel%29%3D%22de%22%29%0A%20%20%0A%20%20%7B%20select%20%3Fs%20%28count%20%28%3Fbroader%29%20as%20%3Fdistance%29%20%0A%20%20%20%20%20%20where%20%7B%20%3Fs%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%20%20OPTIONAL%20%7B%20%3Fs%20ldp%3AP72%2B%20%3Fbroader%20.%20%7D%0A%20%20%20%20%20%20%20%20%20%20%20%20%7D%20GROUP%20BY%20%3Fs%20%3Fdistance%0A%20%20%7D%0A%20%20%0A%20%20BIND%20%28%0A%20%20COALESCE%28%0A%20%20%20%20IF%28%3Fo%20%3D%20lwb%3AQ21886%20%2C%20%220000CC%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%221%22%20%2C%20%22FF9999%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%222%22%20%2C%20%22FFB266%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%223%22%20%2C%20%22FFFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%224%22%20%2C%20%22CCFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%225%22%20%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%226%22%20%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20%22FFFFFF%22%0A%20%20%29%20AS%20%3Frgb%0A%29%0A%20%20%20%0A%20%20%7D%20GROUP%20BY%20%0A%20%20%20%20%3Fs%20%3FsLabel%20%0A%20%20%20%20%20%3Frgb%0A%20%20%20%20%3Fo%20%3FoLabel%0A%20%20%20%23%20%3Fdistance%20%0A%20%20%23%20%20%3FedgeLabel%20%0A%20%20%20%20 Build live].
* Graph view, complete, coloured: [https://lexbib.elex.is/query/#%23defaultView%3AGraph%0APREFIX%20lwb%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fentity%2F%3E%0APREFIX%20ldp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fdirect%2F%3E%0APREFIX%20lp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2F%3E%0APREFIX%20lps%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fstatement%2F%3E%0APREFIX%20lpq%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fqualifier%2F%3E%0APREFIX%20lpr%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Freference%2F%3E%0A%0Aselect%20%3Fs%20%3FsLabel%20%0A%3Frgb%0A%3Fo%20%3FoLabel%20%0A%23%3Fdistance%0A%23%3FedgeLabel%20%0Awhere%20%7B%0A%20%20%3Fs%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%3Fs%20ldp%3AP72%2a%20lwb%3AQ1%20.%20%23%20root%20node%20of%20the%20representation%2C%20Q1%20for%20%22Lexicography%22%0A%20%20%3Fs%20ldp%3AP72%20%3Fo%20.%0A%20%20%23FILTER%20%28%3Fp%20%3D%20ldp%3AP72%29%0A%20%20%3Fs%20rdfs%3Alabel%20%3FsLabel%20.%0A%20%20FILTER%20%28lang%28%3FsLabel%29%3D%22en%22%20%7C%7C%20lang%28%3FsLabel%29%3D%22de%22%29%0A%20%20%3Fo%20rdfs%3Alabel%20%3FoLabel%20.%0A%20%20FILTER%20%28lang%28%3FoLabel%29%3D%22en%22%20%7C%7C%20lang%28%3FoLabel%29%3D%22de%22%29%0A%20%20%0A%20%20%7B%20select%20%3Fs%20%28count%20%28%3Fbroader%29%20as%20%3Fdistance%29%20%0A%20%20%20%20%20%20where%20%7B%20%3Fs%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%20%20OPTIONAL%20%7B%20%3Fs%20ldp%3AP72%2B%20%3Fbroader%20.%20%7D%0A%20%20%20%20%20%20%20%20%20%20%20%20%7D%20GROUP%20BY%20%3Fs%20%3Fdistance%0A%20%20%7D%0A%20%20%0A%20%20BIND%20%28%0A%20%20COALESCE%28%0A%20%20%20%20IF%28%3Fo%20%3D%20lwb%3AQ21886%20%2C%20%220000CC%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%221%22%20%2C%20%22FF9999%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%222%22%20%2C%20%22FFB266%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%223%22%20%2C%20%22FFFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%224%22%20%2C%20%22CCFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%225%22%20%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%226%22%20%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20%22FFFFFF%22%0A%20%20%29%20AS%20%3Frgb%0A%29%0A%20%20%20%0A%20%20%7D%20GROUP%20BY%20%0A%20%20%20%20%3Fs%20%3FsLabel%20%0A%20%20%20%20%20%3Frgb%0A%20%20%20%20%3Fo%20%3FoLabel%0A%20%20%20%23%20%3Fdistance%20%0A%20%20%23%20%20%3FedgeLabel%20%0A%20%20%20%20 Build live].
* Graph view, Elexifinder categories only (upper three hierarchy levels): [https://lexbib.elex.is/query/#%23defaultView%3AGraph%0A%23%20this%20shows%20only%20three%20levels%20of%20skos%3Abroader%20relation%20below%20the%20specified%20root%20node.%0APREFIX%20lwb%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fentity%2F%3E%0APREFIX%20ldp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fdirect%2F%3E%0APREFIX%20lp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2F%3E%0APREFIX%20lps%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fstatement%2F%3E%0APREFIX%20lpq%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fqualifier%2F%3E%0APREFIX%20lpr%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Freference%2F%3E%0APREFIX%20lno%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fnovalue%2F%3E%0A%0A%0A%0Aselect%20distinct%20%3Fs%20%3FsLabel%20%3Ft%20%3FtLabel%0A%0Awhere%20%7B%0A%20%20%3Fs%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%0A%20%20%7B%20lwb%3AQ1%20ldp%3AP73%2B%20%3Ft.%20%7D%20%0A%20%20UNION%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%0A%20%20%7B%20lwb%3AQ1%20ldp%3AP73%2Fldp%3AP73%20%3Ft.%20%7D%20%0A%20%20UNION%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%0A%20%20%7B%20lwb%3AQ1%20ldp%3AP73%2Fldp%3AP73%2Fldp%3AP73%20%3Ft.%20%7D%20%0A%20%20%0A%20%20%3Ft%20ldp%3AP72%20%3Fs.%0A%20%20%3Fs%20rdfs%3Alabel%20%3FsLabel%20.%20FILTER%20%28lang%28%3FsLabel%29%3D%22en%22%29%0A%20%20%3Ft%20rdfs%3Alabel%20%3FtLabel%20.%20FILTER%20%28lang%28%3FtLabel%29%3D%22en%22%29%0A%20%20%0A%20%20%7D Query].
== Update Workflow ==
As soon as a new term is linked to a member of the main SKOS graph using [[Property:P72|skos:broader]] or [[Property:P77|skos:closeMatch]], it is considered in subsequent iterations of article full text indexation.

Revision as of 21:54, 23 August 2021

LexVoc Vocabulary of Lexicographic Terms

We have started to develop LexVoc, a controlled vocabulary of Lexicography-related terms that shall be used as content descriptors, and linked to the corresponding bibliographical items. We have defined English preferred and alternative lexicalizations, and represented relations between terms according to the W3C SKOS standard.

Sources for LexVoc have been the following:

  1. An updated and extended version of the index of “Bibliografía Temática de la Lexicografía” (Córdoba Rodríguez 2003) translated to English (2003 e-bibliography here)
  2. The typology of dictionaries by Engelberg and Storrer (2016)
  3. The Glossary of Lexicographic Terms by Kipfer (2013)
  4. The index of the volume Using Online Dictionaries (Müller-Spitzer 2014)
  5. The Linguistic Property branch of the GOLD ontology

We have merged all concepts stemming from sources (1) to (5), and set relations between them, so that terms can be represented as nodes in a single graph, with SKOS relations as edges.

In a second step, we have extended the vocabulary with a manually revised subset of salient term candidates, extracted from a corpus compiled using all English full texts present in the collection used for Elexifinder version 2 (Spring 2021).

We have then extended the vocabulary further, using term extraction results from subsets of our English full texts. This has been done for the field of dictionary digitization (narrower of dictionary making).

LexVoc main branches as they are defined today are listed below. Names of natural languages have experimentally been part of the vocabulary. This branch currently is not maintained, and not used as Elexifinder category, since natural languages as search filter are available through wikification (Elexifinder concepts, not categories.)

Elexifinder Categories

Terms that serve as Elexifinder category belong to the first three skos:broader hierarchy levels below the root concept. Terms deeper in the hierarchy are considered in article indexation, and so are closeMatch terms without own broader-hierarchy, but the assigned category visible on Elexifinder will be the corresponding broader category of the third level below root.

Graph Visualisations

  • Graph view, complete, coloured: Build live.
  • Graph view, Elexifinder categories only (upper three hierarchy levels): Query.

Update Workflow

As soon as a new term is linked to a member of the main SKOS graph using skos:broader or skos:closeMatch, it is considered in subsequent iterations of article full text indexation.