LexVoc: Difference between revisions
No edit summary |
|||
Line 1: | Line 1: | ||
= LexVoc Vocabulary of Lexicographic Terms = | = LexVoc Vocabulary of Lexicographic Terms = | ||
The vocabulary is being developed for different purposes: | |||
* [[Elexifinder]] ''categories'', complementary to [[Elexifinder]] ''concepts'' (obtained through automatic wikifikation). | |||
* Content describing indexation of [[LexBib Zotero]] bibliographical items. | |||
* Concepts in a tree graph with SKOS relations as edges, and “[[Item:Q1|Lexicography]]” as root node. This concept tree can be seen as ramification of the “Lexicography” subject heading as listed in cross-domain library vocabularies; see e.g. “[https://id.loc.gov/authorities/subjects/sh85076357.html Lexicography]” in [https://id.loc.gov/authorities/subjects.html LCSH], or “[https://www.blldb-online.de/blldb/suche/kapitel_vollsuche.xml?DD1=11&SF1=&OP1=and&DD2=1&SF2=&OP2=and&DD3=5&SF3=&OP3=and&DD4=17&SF4=Lexicography&JV=1971&JB=2009&PT=0&language=0&S=3&A=10&x=24&y=6&Sprache=de&Skript=ergebnis&modus=1&step=1&Complexity=0&contenttype=text%2Fhtml&like=1&vid=8B958983-3E0C-42AB-8568-C772D3418EFE Lexicography]” in [https://data.linguistik.de/bll/index.html BLL]. | |||
LexVoc terms have English preferred and alternative lexicalizations; relations between them are represented according to the [https://www.w3.org/2004/02/skos/ W3C SKOS standard]. | |||
* Graph view, complete, coloured: [https://lexbib.elex.is/query/#%23defaultView%3AGraph%0APREFIX%20lwb%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fentity%2F%3E%0APREFIX%20ldp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fdirect%2F%3E%0APREFIX%20lp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2F%3E%0APREFIX%20lps%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fstatement%2F%3E%0APREFIX%20lpq%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fqualifier%2F%3E%0APREFIX%20lpr%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Freference%2F%3E%0A%0Aselect%20%3Fs%20%3FsLabel%20%0A%3Frgb%0A%3Fo%20%3FoLabel%20%0A%23%3Fdistance%0A%23%3FedgeLabel%20%0Awhere%20%7B%0A%20%20%3Fs%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%3Fs%20ldp%3AP72%2a%20lwb%3AQ1%20.%20%23%20root%20node%20of%20the%20representation%2C%20Q1%20for%20%22Lexicography%22%0A%20%20%3Fs%20ldp%3AP72%20%3Fo%20.%0A%20%20%23FILTER%20%28%3Fp%20%3D%20ldp%3AP72%29%0A%20%20%3Fs%20rdfs%3Alabel%20%3FsLabel%20.%0A%20%20FILTER%20%28lang%28%3FsLabel%29%3D%22en%22%20%7C%7C%20lang%28%3FsLabel%29%3D%22de%22%29%0A%20%20%3Fo%20rdfs%3Alabel%20%3FoLabel%20.%0A%20%20FILTER%20%28lang%28%3FoLabel%29%3D%22en%22%20%7C%7C%20lang%28%3FoLabel%29%3D%22de%22%29%0A%20%20%0A%20%20%7B%20select%20%3Fs%20%28count%20%28%3Fbroader%29%20as%20%3Fdistance%29%20%0A%20%20%20%20%20%20where%20%7B%20%3Fs%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%20%20OPTIONAL%20%7B%20%3Fs%20ldp%3AP72%2B%20%3Fbroader%20.%20%7D%0A%20%20%20%20%20%20%20%20%20%20%20%20%7D%20GROUP%20BY%20%3Fs%20%3Fdistance%0A%20%20%7D%0A%20%20%0A%20%20BIND%20%28%0A%20%20COALESCE%28%0A%20%20%20%20IF%28%3Fo%20%3D%20lwb%3AQ21886%20%2C%20%220000CC%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%221%22%20%2C%20%22FF9999%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%222%22%20%2C%20%22FFB266%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%223%22%20%2C%20%22FFFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%224%22%20%2C%20%22CCFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%225%22%20%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%226%22%20%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20%22FFFFFF%22%0A%20%20%29%20AS%20%3Frgb%0A%29%0A%20%20%20%0A%20%20%7D%20GROUP%20BY%20%0A%20%20%20%20%3Fs%20%3FsLabel%20%0A%20%20%20%20%20%3Frgb%0A%20%20%20%20%3Fo%20%3FoLabel%0A%20%20%20%23%20%3Fdistance%20%0A%20%20%23%20%20%3FedgeLabel%20%0A%20%20%20%20 Build live]. | * Graph view, complete, coloured: [https://lexbib.elex.is/query/#%23defaultView%3AGraph%0APREFIX%20lwb%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fentity%2F%3E%0APREFIX%20ldp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fdirect%2F%3E%0APREFIX%20lp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2F%3E%0APREFIX%20lps%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fstatement%2F%3E%0APREFIX%20lpq%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fqualifier%2F%3E%0APREFIX%20lpr%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Freference%2F%3E%0A%0Aselect%20%3Fs%20%3FsLabel%20%0A%3Frgb%0A%3Fo%20%3FoLabel%20%0A%23%3Fdistance%0A%23%3FedgeLabel%20%0Awhere%20%7B%0A%20%20%3Fs%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%3Fs%20ldp%3AP72%2a%20lwb%3AQ1%20.%20%23%20root%20node%20of%20the%20representation%2C%20Q1%20for%20%22Lexicography%22%0A%20%20%3Fs%20ldp%3AP72%20%3Fo%20.%0A%20%20%23FILTER%20%28%3Fp%20%3D%20ldp%3AP72%29%0A%20%20%3Fs%20rdfs%3Alabel%20%3FsLabel%20.%0A%20%20FILTER%20%28lang%28%3FsLabel%29%3D%22en%22%20%7C%7C%20lang%28%3FsLabel%29%3D%22de%22%29%0A%20%20%3Fo%20rdfs%3Alabel%20%3FoLabel%20.%0A%20%20FILTER%20%28lang%28%3FoLabel%29%3D%22en%22%20%7C%7C%20lang%28%3FoLabel%29%3D%22de%22%29%0A%20%20%0A%20%20%7B%20select%20%3Fs%20%28count%20%28%3Fbroader%29%20as%20%3Fdistance%29%20%0A%20%20%20%20%20%20where%20%7B%20%3Fs%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%20%20OPTIONAL%20%7B%20%3Fs%20ldp%3AP72%2B%20%3Fbroader%20.%20%7D%0A%20%20%20%20%20%20%20%20%20%20%20%20%7D%20GROUP%20BY%20%3Fs%20%3Fdistance%0A%20%20%7D%0A%20%20%0A%20%20BIND%20%28%0A%20%20COALESCE%28%0A%20%20%20%20IF%28%3Fo%20%3D%20lwb%3AQ21886%20%2C%20%220000CC%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%221%22%20%2C%20%22FF9999%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%222%22%20%2C%20%22FFB266%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%223%22%20%2C%20%22FFFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%224%22%20%2C%20%22CCFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%225%22%20%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%226%22%20%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20%22FFFFFF%22%0A%20%20%29%20AS%20%3Frgb%0A%29%0A%20%20%20%0A%20%20%7D%20GROUP%20BY%20%0A%20%20%20%20%3Fs%20%3FsLabel%20%0A%20%20%20%20%20%3Frgb%0A%20%20%20%20%3Fo%20%3FoLabel%0A%20%20%20%23%20%3Fdistance%20%0A%20%20%23%20%20%3FedgeLabel%20%0A%20%20%20%20 Build live]. | ||
Line 6: | Line 12: | ||
Sources for LexVoc have been the following: | Sources for LexVoc have been the following: | ||
# An [[Item:Q14502|updated and extended version]] of the index of “Bibliografía Temática de la Lexicografía” ([https://www.zotero.org/groups/1892855/lexbib/collections/UAS9GNZS/items/MC56SE35/collection Córdoba Rodríguez 2003]) translated to English (2003 e-bibliography [https://www.udc.es/grupos/lexicografia/bibliografia/tematica.html here]) | # An [[Item:Q14502|updated and extended version]] of the index of “Bibliografía Temática de la Lexicografía” ([https://www.zotero.org/groups/1892855/lexbib/collections/UAS9GNZS/items/MC56SE35/collection Córdoba Rodríguez 2003]) translated to English (2003 e-bibliography [https://www.udc.es/grupos/lexicografia/bibliografia/tematica.html here]), members [https://data.lexbib.org/w/index.php?title=Special:WhatLinksHere/Item:Q14502&limit=500 here] | ||
# The [[Item:Q15134|typology of dictionaries]] by Engelberg and Storrer ([[Item:Q11281|2016]]) | # The [[Item:Q15134|typology of dictionaries]] by Engelberg and Storrer ([[Item:Q11281|2016]]), members [https://data.lexbib.org/w/index.php?title=Special:WhatLinksHere/Item:Q15134&limit=500 here] | ||
# The ''[[Item:Q14506|Glossary of Lexicographic Terms]]'' by Kipfer ([[Item:Q13652|2013]]) | # The ''[[Item:Q14506|Glossary of Lexicographic Terms]]'' by Kipfer ([[Item:Q13652|2013]]), members [https://data.lexbib.org/w/index.php?title=Special:WhatLinksHere/Item:Q14506&limit=500 here] | ||
# The [[Item:Q14504|index]] of the volume ''Using Online Dictionaries'' ([[Item:Q3589|Müller-Spitzer 2014]]) | # The [[Item:Q14504|index]] of the volume ''Using Online Dictionaries'' ([[Item:Q3589|Müller-Spitzer 2014]]), members [https://data.lexbib.org/w/index.php?title=Special:WhatLinksHere/Item:Q14504&limit=500 here] | ||
# The [[Item:Q14512|''Linguistic Property'' branch]] of the [http://linguistics-ontology.org/ GOLD ontology] | # The [[Item:Q14512|''Linguistic Property'' branch]] of the [http://linguistics-ontology.org/ GOLD ontology], members [https://data.lexbib.org/w/index.php?title=Special:WhatLinksHere/Item:Q14512&limit=500 here] | ||
We have merged all concepts stemming from sources (1) to (5), and set relations between them, so that terms can be represented as nodes in a single graph, with SKOS relations as edges. | We have merged all concepts stemming from sources (1) to (5), and set relations between them, so that terms can be represented as nodes in a single graph, with SKOS relations as edges. | ||
In a second step, we have extended the vocabulary with a [[Item:Q14510|manually revised subset]] of salient term candidates, extracted from a corpus compiled using all English full texts present in the collection used for [[Elexifinder]] version 2 (Spring 2021). | In a second step, we have extended the vocabulary with a [[Item:Q14510|manually revised subset]] ([https://data.lexbib.org/w/index.php?title=Special:WhatLinksHere/Item:Q14510&limit=500 members]) of salient term candidates, extracted from a corpus compiled using all English full texts present in the collection used for [[Elexifinder]] version 2 (Spring 2021). | ||
We have then extended the vocabulary further, using term extraction results from subsets of our English full texts. This has been done for the field of [[Item:Q15007|dictionary digitization]] (narrower of [[Item:Q14318|dictionary making]]). | We have then extended the vocabulary further, using term extraction results from subsets of our English full texts. This has been done for the field of [[Item:Q15007|dictionary digitization]] (narrower of [[Item:Q14318|dictionary making]]). |
Revision as of 22:29, 23 August 2021
LexVoc Vocabulary of Lexicographic Terms
The vocabulary is being developed for different purposes:
- Elexifinder categories, complementary to Elexifinder concepts (obtained through automatic wikifikation).
- Content describing indexation of LexBib Zotero bibliographical items.
- Concepts in a tree graph with SKOS relations as edges, and “Lexicography” as root node. This concept tree can be seen as ramification of the “Lexicography” subject heading as listed in cross-domain library vocabularies; see e.g. “Lexicography” in LCSH, or “Lexicography” in BLL.
LexVoc terms have English preferred and alternative lexicalizations; relations between them are represented according to the W3C SKOS standard.
- Graph view, complete, coloured: Build live.
Sources for LexVoc have been the following:
- An updated and extended version of the index of “Bibliografía Temática de la Lexicografía” (Córdoba Rodríguez 2003) translated to English (2003 e-bibliography here), members here
- The typology of dictionaries by Engelberg and Storrer (2016), members here
- The Glossary of Lexicographic Terms by Kipfer (2013), members here
- The index of the volume Using Online Dictionaries (Müller-Spitzer 2014), members here
- The Linguistic Property branch of the GOLD ontology, members here
We have merged all concepts stemming from sources (1) to (5), and set relations between them, so that terms can be represented as nodes in a single graph, with SKOS relations as edges.
In a second step, we have extended the vocabulary with a manually revised subset (members) of salient term candidates, extracted from a corpus compiled using all English full texts present in the collection used for Elexifinder version 2 (Spring 2021).
We have then extended the vocabulary further, using term extraction results from subsets of our English full texts. This has been done for the field of dictionary digitization (narrower of dictionary making).
LexVoc main branches as they are defined today are listed below. Names of natural languages have experimentally been part of the vocabulary. This branch currently is not maintained, and not used as Elexifinder category, since natural languages as search filter are available through wikification (Elexifinder concepts, not categories.)
Elexifinder Categories
Terms that serve as Elexifinder category belong to the first three skos:broader hierarchy levels below the root concept. Terms deeper in the hierarchy are considered in article indexation, and so are closeMatch terms without own broader-hierarchy, but the assigned category visible on Elexifinder will be the corresponding broader category of the third level below root.
- Graph view, Elexifinder categories only (upper three hierarchy levels): Query.
LexVoc main branches
The following concepts are directly linked to root node “Lexicography” using skos:broader. These are Elexifinder main categories. Two narrower levels below each top-level concept are also considered as (visible) Elexifinder category. Click on the “Graph” links to get a graph representation, with the top-level concept as root node.
- Dictionary Structure (Graph)
- Dictionary Type (Graph)
- Dictionary Making (Graph)
- Dictionary Use (Graph)
- Dictionary Function (Graph)
- Dictionary Distribution Type (Graph)
- Linguistic Property (Graph)
- NLP / Corpus Linguistics (Graph)
- Software <TBD> (Graph)
Update Workflow
As soon as a new term is linked to a member of the main SKOS graph using skos:broader or skos:closeMatch, it is considered in subsequent iterations of article full text indexation.