LexVoc: Difference between revisions

From LexBib
mNo edit summary
(17 intermediate revisions by the same user not shown)
Line 2: Line 2:
The vocabulary is being developed for different purposes:
The vocabulary is being developed for different purposes:


* [[Elexifinder]] ''categories'', complementary to [[Elexifinder]] ''concepts'' (obtained through automatic wikifikation).
* [[Elexifinder]] ''categories'' are derived from LexVoc, and are available for search and filter functions in the [[Elexifinder]] app, complementary to [[Elexifinder]] ''concepts'' (obtained through automatic wikifikation).
* Content describing indexation of [[LexBib Zotero]] bibliographical items.
* Content describing indexation of [[LexBib Zotero]] bibliographical items.
* Concepts in a tree graph with SKOS relations as edges, and “[[Item:Q1|Lexicography]]” as root node. This concept tree can be seen as ramification of the “Lexicography” subject heading as listed in cross-domain library vocabularies; see e.g. “[https://id.loc.gov/authorities/subjects/sh85076357.html Lexicography]” in [https://id.loc.gov/authorities/subjects.html LCSH], or “[https://www.blldb-online.de/blldb/suche/kapitel_vollsuche.xml?DD1=11&SF1=&OP1=and&DD2=1&SF2=&OP2=and&DD3=5&SF3=&OP3=and&DD4=17&SF4=Lexicography&JV=1971&JB=2009&PT=0&language=0&S=3&A=10&x=24&y=6&Sprache=de&Skript=ergebnis&modus=1&step=1&Complexity=0&contenttype=text%2Fhtml&like=1&vid=8B958983-3E0C-42AB-8568-C772D3418EFE Lexicography]” in [https://data.linguistik.de/bll/index.html BLL].
* Concepts in a tree graph with SKOS relations as edges, and “[[Item:Q1|Lexicography]]” as root node. This concept tree can be seen as ramification of the “Lexicography” subject heading as listed in cross-domain library vocabularies; see e.g. “[https://id.loc.gov/authorities/subjects/sh85076357.html Lexicography]” in [https://id.loc.gov/authorities/subjects.html LCSH], or “[https://www.blldb-online.de/blldb/suche/kapitel_vollsuche.xml?DD1=11&SF1=&OP1=and&DD2=1&SF2=&OP2=and&DD3=5&SF3=&OP3=and&DD4=17&SF4=Lexicography&JV=1971&JB=2009&PT=0&language=0&S=3&A=10&x=24&y=6&Sprache=de&Skript=ergebnis&modus=1&step=1&Complexity=0&contenttype=text%2Fhtml&like=1&vid=8B958983-3E0C-42AB-8568-C772D3418EFE Lexicography]” in [https://data.linguistik.de/bll/index.html BLL].
Line 24: Line 24:
We have then extended the vocabulary further, using term extraction results from subsets of our English full texts. This has been done for the field of [[Item:Q15007|dictionary digitization]] (narrower of [[Item:Q14318|dictionary making]]).
We have then extended the vocabulary further, using term extraction results from subsets of our English full texts. This has been done for the field of [[Item:Q15007|dictionary digitization]] (narrower of [[Item:Q14318|dictionary making]]).


LexVoc main branches as they are defined today are listed below. Names of natural languages have experimentally been part of the vocabulary. This branch currently is not maintained, and not used as [[Elexifinder]] category, since natural languages as search filter are available through wikification ([[Elexifinder]] ''concepts'', not ''categories''.)
LexVoc main branches as they are defined today are listed below. Names of natural languages have been part of the vocabulary in LexBib experimental version 2, and articles have been indexed with language names found in their full texts ([https://data.lexbib.org/wiki/Item:Q385 example]). This branch currently is not maintained, and not used as [[Elexifinder]] category, since natural languages as search filter are available through wikification ([[Elexifinder]] ''concepts'', not ''categories''.)


== Elexifinder Categories ==
== Elexifinder Categories ==


Terms that serve as [[Elexifinder]] category belong to the first three [[Property:P72|skos:broader]] hierarchy levels below the [[Item:Q1|root concept]]. Terms deeper in the hierarchy are considered in article indexation, and so are closeMatch terms without own broader-hierarchy, but the assigned category visible on Elexifinder will be the corresponding broader category of the third level below root.
Terms that serve as [[Elexifinder]] category belong to the first three [[Property:P72|skos:broader]] hierarchy levels below the [[Item:Q1|root concept]]. Terms deeper in the hierarchy are considered in article indexation, and so are [[Property:P77|closeMatch]] terms without own broader-hierarchy, but the assigned category visible on [[Elexifinder]] will be the corresponding broader category of the third level below root.


* Graph view, Elexifinder categories only (upper three hierarchy levels): [https://lexbib.elex.is/query/#%23defaultView%3AGraph%0A%23%20this%20shows%20only%20three%20levels%20of%20skos%3Abroader%20relation%20below%20the%20specified%20root%20node.%0APREFIX%20lwb%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fentity%2F%3E%0APREFIX%20ldp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fdirect%2F%3E%0APREFIX%20lp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2F%3E%0APREFIX%20lps%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fstatement%2F%3E%0APREFIX%20lpq%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fqualifier%2F%3E%0APREFIX%20lpr%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Freference%2F%3E%0APREFIX%20lno%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fnovalue%2F%3E%0A%0A%0A%0Aselect%20distinct%20%3Fs%20%3FsLabel%20%3Ft%20%3FtLabel%0A%0Awhere%20%7B%0A%20%20%3Fs%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%0A%20%20%7B%20lwb%3AQ1%20ldp%3AP73%2B%20%3Ft.%20%7D%20%0A%20%20UNION%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%0A%20%20%7B%20lwb%3AQ1%20ldp%3AP73%2Fldp%3AP73%20%3Ft.%20%7D%20%0A%20%20UNION%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%0A%20%20%7B%20lwb%3AQ1%20ldp%3AP73%2Fldp%3AP73%2Fldp%3AP73%20%3Ft.%20%7D%20%0A%20%20%0A%20%20%3Ft%20ldp%3AP72%20%3Fs.%0A%20%20%3Fs%20rdfs%3Alabel%20%3FsLabel%20.%20FILTER%20%28lang%28%3FsLabel%29%3D%22en%22%29%0A%20%20%3Ft%20rdfs%3Alabel%20%3FtLabel%20.%20FILTER%20%28lang%28%3FtLabel%29%3D%22en%22%29%0A%20%20%0A%20%20%7D Query].
* Graph view, [[Elexifinder]] categories only (upper three hierarchy levels): [https://lexbib.elex.is/query/#%23defaultView%3AGraph%0A%23%20this%20shows%20only%20three%20levels%20of%20skos%3Abroader%20relation%20below%20Q1.%0APREFIX%20lwb%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fentity%2F%3E%0APREFIX%20ldp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fdirect%2F%3E%0APREFIX%20lp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2F%3E%0APREFIX%20lps%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fstatement%2F%3E%0APREFIX%20lpq%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fqualifier%2F%3E%0APREFIX%20lpr%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Freference%2F%3E%0APREFIX%20lno%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fnovalue%2F%3E%0A%0Aselect%20distinct%20%0A%3Fs%20%3FsLabel%20%0A%23%3Fdistance%20%0A%3Frgb%20%0A%3Ft%20%3FtLabel%0A%0Awhere%20%7B%0A%20%20%3Fs%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%0A%20%20%7B%20lwb%3AQ1%20ldp%3AP73%20%3Ft.%20%7D%20%0A%20%20UNION%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%0A%20%20%7B%20lwb%3AQ1%20ldp%3AP73%2Fldp%3AP73%20%3Ft.%20%7D%20%0A%20%20UNION%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%0A%20%20%7B%20lwb%3AQ1%20ldp%3AP73%2Fldp%3AP73%2Fldp%3AP73%20%3Ft.%20%7D%20%0A%20%20%0A%20%20%3Ft%20ldp%3AP72%20%3Fs.%0A%20%20%3Fs%20rdfs%3Alabel%20%3FsLabel%20.%20FILTER%20%28lang%28%3FsLabel%29%3D%22en%22%29%0A%20%20%3Ft%20rdfs%3Alabel%20%3FtLabel%20.%20FILTER%20%28lang%28%3FtLabel%29%3D%22en%22%29%0A%20%20%0A%20%20%23%20distance%20from%20root%20node%20%22Lexicography%22%20%28number%20of%20broader%20concepts%29%0A%20%20%7B%20select%20%3Fs%20%28count%20%28%3Fbroader%29%20as%20%3Fdistance%29%20where%20%7B%0A%20%20%20%20%20%20OPTIONAL%20%7B%3Fs%20ldp%3AP72%2B%20%3Fbroader%20.%20%7D%7D%20GROUP%20BY%20%3Fs%20%3Fdistance%20%7D%0A%0A%20%20%23%20colouring%0A%20%20BIND%20%28%0A%20%20COALESCE%28%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%221%22%20%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%222%22%20%2C%20%22FF9999%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%223%22%20%2C%20%22FFB266%22%2C%201%2F0%29%2C%0A%20%20%20%23%20IF%28str%28%3Fdistance%29%3D%223%22%20%2C%20%22FFFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20%22FFFF99%22%0A%20%20%29%20AS%20%3Frgb%0A%20%20%29%0A%20%20%7D%20GROUP%20BY%20%0A%20%20%3Fs%20%3FsLabel%20%20%20%0A%20%20%23%3Fdistance%0A%20%20%3Frgb%20%0A%20%20%3Ft%20%3FtLabel%0A%20%20 Query].


== LexVoc main branches ==
== LexVoc main branches ==


The following concepts are directly linked to root node “Lexicography” using [[Property:P72|skos:broader]]. These are [[Elexifinder]] main categories. Two narrower levels below each top-level concept are also considered as (visible) [[Elexifinder]] category. Click on the “Graph” links to get a graph representation, with the top-level concept as root node.
The following concepts are directly linked to [[Item:Q1|root node “Lexicography”]] using [[Property:P72|skos:broader]]. These are [[Elexifinder]] main categories. Two narrower levels below each top-level concept are also considered as (visible) [[Elexifinder]] category. Click on the “Graph” links to get a graph representation, with the top-level concept as root node.


* Dictionary Structure ([https://lexbib.elex.is/query/#%23defaultView%3AGraph%0A%23%20Graph%20view%20with%20specified%20root%20node%0APREFIX%20lwb%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fentity%2F%3E%0APREFIX%20ldp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fdirect%2F%3E%0APREFIX%20lp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2F%3E%0APREFIX%20lps%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fstatement%2F%3E%0APREFIX%20lpq%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fqualifier%2F%3E%0APREFIX%20lpr%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Freference%2F%3E%0APREFIX%20lno%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fnovalue%2F%3E%0A%0Aselect%20%3Fs%20%3FsLabel%0A%3Frgb%0A%3FedgeLabel%20%3Fo%20%3FoLabel%20where%20%7B%0A%20%20%0A%20%20BIND%20%28lwb%3AQ14291%20as%20%3Froot%29%20%23%20specify%20root%20node%20for%20the%20representation%20here%2C%20e.g.%20%22lwb%3AQ1%22%20for%20%22Lexicography%22%0A%20%20%0A%20%20%3Fo%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%3Froot%20ldp%3AP73%2a%20%3Fo.%0A%20%20%3Fs%20%3Fp%20%3Fo%20.%0A%20%20%20%20%0A%20%20%7B%3Froot%20ldp%3AP73%2a%20%3Fs%20.%7D%20UNION%20%7B%3Fs%20ldp%3AP77%7Cldp%3AP5%20%3Fo.%7D%0A%20%0A%20%20%3Fs%20rdfs%3Alabel%20%3FsLabel%20.%20FILTER%20%28lang%28%3FsLabel%29%3D%22en%22%29%0A%20%20%3Fedge%20wikibase%3AdirectClaim%20%3Fp%20.%0A%20%20FILTER%20%28%3Fp%20%3D%20ldp%3AP73%20%7C%7C%20%3Fp%20%3D%20ldp%3AP77%20%7C%7C%20%3Fp%20%3D%20ldp%3AP5%29%20%23%20skos%3Anarrower%20%2F%20skos%3AcloseMatch%20%2F%20instance%20of%0A%20%20%3Fedge%20rdfs%3Alabel%20%3FedgeLabel%20.%0A%20%20%3Fo%20rdfs%3Alabel%20%3FoLabel%20.%20FILTER%20%28lang%28%3FoLabel%29%3D%22en%22%29%0A%20%0A%20%20%23%20distance%20from%20root%20node%20%22Lexicography%22%20%28number%20of%20broader%20concepts%29%0A%20%20%7B%20select%20%3Fs%20%28count%20%28%3Fbroader%29%20as%20%3Fdistance%29%20where%20%7B%0A%20%20%20%20%20%20OPTIONAL%20%7B%3Fs%20ldp%3AP72%2B%20%3Fbroader%20.%20%7D%7D%20GROUP%20BY%20%3Fs%20%3Fdistance%20%7D%0A%0A%20%20%23%20colouring%0A%20%20BIND%20%28%0A%20%20COALESCE%28%0A%20%20%20%20IF%28%3Fs%20%3D%20%3Froot%20%2C%20%220000CC%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28%3Fp%20%3D%20ldp%3AP77%2C%20%22CCFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28%3Fp%20%3D%20ldp%3AP5%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%221%22%20%2C%20%22FF9999%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%222%22%20%2C%20%22FFB266%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%223%22%20%2C%20%22FFFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%224%22%20%2C%20%22CCFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%225%22%20%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20%22FFFF99%22%0A%20%20%29%20AS%20%3Frgb%0A%29%0A%20%20%7D%20GROUP%20BY%20%3Fs%20%3FsLabel%20%3Fdistance%20%3Frgb%20%3FedgeLabel%20%3Fo%20%3FoLabel%0A Graph])
* Dictionary Structure ([https://lexbib.elex.is/query/#%23defaultView%3AGraph%0A%23%20Graph%20view%20with%20specified%20root%20node%0APREFIX%20lwb%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fentity%2F%3E%0APREFIX%20ldp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fdirect%2F%3E%0APREFIX%20lp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2F%3E%0APREFIX%20lps%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fstatement%2F%3E%0APREFIX%20lpq%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fqualifier%2F%3E%0APREFIX%20lpr%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Freference%2F%3E%0APREFIX%20lno%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fnovalue%2F%3E%0A%0Aselect%20%3Fs%20%3FsLabel%0A%3Frgb%0A%3FedgeLabel%20%3Fo%20%3FoLabel%20where%20%7B%0A%20%20%0A%20%20BIND%20%28lwb%3AQ14291%20as%20%3Froot%29%20%23%20specify%20root%20node%20for%20the%20representation%20here%2C%20e.g.%20%22lwb%3AQ1%22%20for%20%22Lexicography%22%0A%20%20%0A%20%20%3Fo%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%3Froot%20ldp%3AP73%2a%20%3Fo.%0A%20%20%3Fs%20%3Fp%20%3Fo%20.%0A%20%20%20%20%0A%20%20%7B%3Froot%20ldp%3AP73%2a%20%3Fs%20.%7D%20UNION%20%7B%3Fs%20ldp%3AP77%7Cldp%3AP5%20%3Fo.%7D%0A%20%0A%20%20%3Fs%20rdfs%3Alabel%20%3FsLabel%20.%20FILTER%20%28lang%28%3FsLabel%29%3D%22en%22%29%0A%20%20%3Fedge%20wikibase%3AdirectClaim%20%3Fp%20.%0A%20%20FILTER%20%28%3Fp%20%3D%20ldp%3AP73%20%7C%7C%20%3Fp%20%3D%20ldp%3AP77%20%7C%7C%20%3Fp%20%3D%20ldp%3AP5%29%20%23%20skos%3Anarrower%20%2F%20skos%3AcloseMatch%20%2F%20instance%20of%0A%20%20%3Fedge%20rdfs%3Alabel%20%3FedgeLabel%20.%0A%20%20%3Fo%20rdfs%3Alabel%20%3FoLabel%20.%20FILTER%20%28lang%28%3FoLabel%29%3D%22en%22%29%0A%20%0A%20%20%23%20distance%20from%20root%20node%20%22Lexicography%22%20%28number%20of%20broader%20concepts%29%0A%20%20%7B%20select%20%3Fs%20%28count%20%28%3Fbroader%29%20as%20%3Fdistance%29%20where%20%7B%0A%20%20%20%20%20%20OPTIONAL%20%7B%3Fs%20ldp%3AP72%2B%20%3Fbroader%20.%20%7D%7D%20GROUP%20BY%20%3Fs%20%3Fdistance%20%7D%0A%0A%20%20%23%20colouring%0A%20%20BIND%20%28%0A%20%20COALESCE%28%0A%20%20%20%20IF%28%3Fs%20%3D%20%3Froot%20%2C%20%220000CC%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28%3Fp%20%3D%20ldp%3AP77%2C%20%22CCFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28%3Fp%20%3D%20ldp%3AP5%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%221%22%20%2C%20%22FF9999%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%222%22%20%2C%20%22FFB266%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%223%22%20%2C%20%22FFFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%224%22%20%2C%20%22CCFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%225%22%20%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20%22FFFF99%22%0A%20%20%29%20AS%20%3Frgb%0A%29%0A%20%20%7D%20GROUP%20BY%20%3Fs%20%3FsLabel%20%3Fdistance%20%3Frgb%20%3FedgeLabel%20%3Fo%20%3FoLabel%0A Graph])
Line 46: Line 46:
* Software <TBD> ([https://lexbib.elex.is/query/#%23defaultView%3AGraph%0A%23%20Graph%20view%20with%20specified%20root%20node%0APREFIX%20lwb%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fentity%2F%3E%0APREFIX%20ldp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fdirect%2F%3E%0APREFIX%20lp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2F%3E%0APREFIX%20lps%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fstatement%2F%3E%0APREFIX%20lpq%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fqualifier%2F%3E%0APREFIX%20lpr%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Freference%2F%3E%0APREFIX%20lno%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fnovalue%2F%3E%0A%0Aselect%20%3Fs%20%3FsLabel%0A%3Frgb%0A%3FedgeLabel%20%3Fo%20%3FoLabel%20where%20%7B%0A%20%20%0A%20%20BIND%20%28lwb%3AQ14281%20as%20%3Froot%29%20%23%20specify%20root%20node%20for%20the%20representation%20here%2C%20e.g.%20%22lwb%3AQ1%22%20for%20%22Lexicography%22%0A%20%20%0A%20%20%3Fo%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%3Froot%20ldp%3AP73%2a%20%3Fo.%0A%20%20%3Fs%20%3Fp%20%3Fo%20.%0A%20%20%20%20%0A%20%20%7B%3Froot%20ldp%3AP73%2a%20%3Fs%20.%7D%20UNION%20%7B%3Fs%20ldp%3AP77%7Cldp%3AP5%20%3Fo.%7D%0A%20%0A%20%20%3Fs%20rdfs%3Alabel%20%3FsLabel%20.%20FILTER%20%28lang%28%3FsLabel%29%3D%22en%22%29%0A%20%20%3Fedge%20wikibase%3AdirectClaim%20%3Fp%20.%0A%20%20FILTER%20%28%3Fp%20%3D%20ldp%3AP73%20%7C%7C%20%3Fp%20%3D%20ldp%3AP77%20%7C%7C%20%3Fp%20%3D%20ldp%3AP5%29%20%23%20skos%3Anarrower%20%2F%20skos%3AcloseMatch%20%2F%20instance%20of%0A%20%20%3Fedge%20rdfs%3Alabel%20%3FedgeLabel%20.%0A%20%20%3Fo%20rdfs%3Alabel%20%3FoLabel%20.%20FILTER%20%28lang%28%3FoLabel%29%3D%22en%22%29%0A%20%0A%20%20%23%20distance%20from%20root%20node%20%22Lexicography%22%20%28number%20of%20broader%20concepts%29%0A%20%20%7B%20select%20%3Fs%20%28count%20%28%3Fbroader%29%20as%20%3Fdistance%29%20where%20%7B%0A%20%20%20%20%20%20OPTIONAL%20%7B%3Fs%20ldp%3AP72%2B%20%3Fbroader%20.%20%7D%7D%20GROUP%20BY%20%3Fs%20%3Fdistance%20%7D%0A%0A%20%20%23%20colouring%0A%20%20BIND%20%28%0A%20%20COALESCE%28%0A%20%20%20%20IF%28%3Fs%20%3D%20%3Froot%20%2C%20%220000CC%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28%3Fp%20%3D%20ldp%3AP77%2C%20%22CCFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28%3Fp%20%3D%20ldp%3AP5%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%221%22%20%2C%20%22FF9999%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%222%22%20%2C%20%22FFB266%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%223%22%20%2C%20%22FFFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%224%22%20%2C%20%22CCFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%225%22%20%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20%22FFFF99%22%0A%20%20%29%20AS%20%3Frgb%0A%29%0A%20%20%7D%20GROUP%20BY%20%3Fs%20%3FsLabel%20%3Fdistance%20%3Frgb%20%3FedgeLabel%20%3Fo%20%3FoLabel%0A Graph])
* Software <TBD> ([https://lexbib.elex.is/query/#%23defaultView%3AGraph%0A%23%20Graph%20view%20with%20specified%20root%20node%0APREFIX%20lwb%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fentity%2F%3E%0APREFIX%20ldp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fdirect%2F%3E%0APREFIX%20lp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2F%3E%0APREFIX%20lps%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fstatement%2F%3E%0APREFIX%20lpq%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fqualifier%2F%3E%0APREFIX%20lpr%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Freference%2F%3E%0APREFIX%20lno%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fnovalue%2F%3E%0A%0Aselect%20%3Fs%20%3FsLabel%0A%3Frgb%0A%3FedgeLabel%20%3Fo%20%3FoLabel%20where%20%7B%0A%20%20%0A%20%20BIND%20%28lwb%3AQ14281%20as%20%3Froot%29%20%23%20specify%20root%20node%20for%20the%20representation%20here%2C%20e.g.%20%22lwb%3AQ1%22%20for%20%22Lexicography%22%0A%20%20%0A%20%20%3Fo%20ldp%3AP5%20lwb%3AQ7.%0A%20%20%3Froot%20ldp%3AP73%2a%20%3Fo.%0A%20%20%3Fs%20%3Fp%20%3Fo%20.%0A%20%20%20%20%0A%20%20%7B%3Froot%20ldp%3AP73%2a%20%3Fs%20.%7D%20UNION%20%7B%3Fs%20ldp%3AP77%7Cldp%3AP5%20%3Fo.%7D%0A%20%0A%20%20%3Fs%20rdfs%3Alabel%20%3FsLabel%20.%20FILTER%20%28lang%28%3FsLabel%29%3D%22en%22%29%0A%20%20%3Fedge%20wikibase%3AdirectClaim%20%3Fp%20.%0A%20%20FILTER%20%28%3Fp%20%3D%20ldp%3AP73%20%7C%7C%20%3Fp%20%3D%20ldp%3AP77%20%7C%7C%20%3Fp%20%3D%20ldp%3AP5%29%20%23%20skos%3Anarrower%20%2F%20skos%3AcloseMatch%20%2F%20instance%20of%0A%20%20%3Fedge%20rdfs%3Alabel%20%3FedgeLabel%20.%0A%20%20%3Fo%20rdfs%3Alabel%20%3FoLabel%20.%20FILTER%20%28lang%28%3FoLabel%29%3D%22en%22%29%0A%20%0A%20%20%23%20distance%20from%20root%20node%20%22Lexicography%22%20%28number%20of%20broader%20concepts%29%0A%20%20%7B%20select%20%3Fs%20%28count%20%28%3Fbroader%29%20as%20%3Fdistance%29%20where%20%7B%0A%20%20%20%20%20%20OPTIONAL%20%7B%3Fs%20ldp%3AP72%2B%20%3Fbroader%20.%20%7D%7D%20GROUP%20BY%20%3Fs%20%3Fdistance%20%7D%0A%0A%20%20%23%20colouring%0A%20%20BIND%20%28%0A%20%20COALESCE%28%0A%20%20%20%20IF%28%3Fs%20%3D%20%3Froot%20%2C%20%220000CC%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28%3Fp%20%3D%20ldp%3AP77%2C%20%22CCFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28%3Fp%20%3D%20ldp%3AP5%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%221%22%20%2C%20%22FF9999%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%222%22%20%2C%20%22FFB266%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%223%22%20%2C%20%22FFFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%224%22%20%2C%20%22CCFF99%22%2C%201%2F0%29%2C%0A%20%20%20%20IF%28str%28%3Fdistance%29%3D%225%22%20%2C%20%22CCFFE5%22%2C%201%2F0%29%2C%0A%20%20%20%20%22FFFF99%22%0A%20%20%29%20AS%20%3Frgb%0A%29%0A%20%20%7D%20GROUP%20BY%20%3Fs%20%3FsLabel%20%3Fdistance%20%3Frgb%20%3FedgeLabel%20%3Fo%20%3FoLabel%0A Graph])


== Update Workflow ==
== LexVoc development and full text indexation process ==


As soon as a new term is linked to a member of the main SKOS graph using [[Property:P72|skos:broader]] or [[Property:P77|skos:closeMatch]], it is considered in subsequent iterations of article full text indexation.
LexVoc, in its actual state, is by no means complete nor finished. It is ongoing work. On Elexis LexMeet, you can contribute to the [https://meet.elex.is/groups/lexicographic-concepts-vocabulary/ discussion about LexVoc structure], i.e. the definition of main branches, and the inner organization of these. We also call for collaboration for [[LexVoc translation on Lexonomy]] (see our [https://lexbib.org/Kosem-Lindemann-fullpaper-printed.pdf 2021 Euralex paper]).
 
Regarding inclusion of new terms: As soon as a new term (i.e. a member of [[Item:Q7|Class "Term"]], and of at least one [[Item:Q33|skos:Collection]]) is linked to a member of the main SKOS graph using [[Property:P72|skos:broader]] or [[Property:P77|skos:closeMatch]], it is considered in subsequent iterations of article full text indexation. On the other hand, if a term has no [[Property:P72|skos:broader]] or [[Property:P77|skos:closeMatch]] relation to another term, it is not (any more) considered for article full text indexation. An article full text indexation iteration (performed locally by [[User:DavidL]]) thus reflects the state of LexVoc at that particular point of time.
 
Full text indexation with LexVoc terms, version 3, is currently in preparation. In this version 3, at the moment, you can see in how many articles a term (i.e., at least one of the labels of that term) have been found in LexBib full texts (see below). You can see single articles and associated LexVoc terms in version 2 (on a single bibliographical item's wikipage ([[Item:Q13936|example]]), click on the value of "LexBib v2 legacy ID".)
 
In how many articles have we found LexVoc terms (narrowers of "[[Item:Q1|Lexicography]]")? [https://lexbib.elex.is/query/#PREFIX%20lwb%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fentity%2F%3E%0APREFIX%20ldp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fdirect%2F%3E%0APREFIX%20lp%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2F%3E%0APREFIX%20lps%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fstatement%2F%3E%0APREFIX%20lpq%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fqualifier%2F%3E%0APREFIX%20lpr%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Freference%2F%3E%0APREFIX%20lno%3A%20%3Chttp%3A%2F%2Flexbib.elex.is%2Fprop%2Fnovalue%2F%3E%0A%0Aselect%20%3FTerm%20%3FenPrefLabel%20%28group_concat%28%3FenAltLabel%3BSEPARATOR%3D%22%3B%22%29%20as%20%3FenAltLabels%29%20%3Fcorpus_hits%0A%0Awhere%20%7B%0A%20%20%3FTerm%20ldp%3AP5%20lwb%3AQ7%3B%0A%20%20rdfs%3Alabel%20%3FenPrefLabel.%20filter%28lang%28%3FenPrefLabel%29%3D%22en%22%29%0A%20%7B%20%3FTerm%20ldp%3AP72%2a%20lwb%3AQ1%20.%7D%20%23%20present%20in%20narrower-broader-tree%20with%20%22Lexicography%22%20as%20root%20node%0A%20%20%20UNION%0A%20%7B%20%3FTerm%20ldp%3AP77%20%3FcloseMatch.%20%3FcloseMatch%20ldp%3AP72%2a%20lwb%3AQ1%20.%20%7D%20%23%20includes%20closeMatch%20items%20without%20own%20broader-rels%0A%20%0A%20OPTIONAL%20%7B%20%3FTerm%20skos%3AaltLabel%20%3FenAltLabel.%0A%20%20%20%20%20%20%20%20%20%20%20%20filter%28lang%28%3FenAltLabel%29%3D%22en%22%29%20%7D%0A%20OPTIONAL%20%7B%20%3FTerm%20ldp%3AP109%20%3Fcorpus_hits.%7D%0A%0A%20%20%7D%20group%20by%20%3FTerm%20%3FenPrefLabel%20%3Fcorpus_hits%0A%20%20%20%20order%20by%20DESC%20%28xsd%3Ainteger%28%3Fcorpus_hits%29%29 Query].
 
== Stop-labels ==
 
Some term lexicalizations (SKOS ''labels'') are very ambiguous, and do not yield proper results (many false positives and/or too many distinct word senses relevant in the domain of Lexicography). At the time being, the approach towards that problem is a label stop-list, which is accessible [https://github.com/elexis-eu/elexifinder/blob/master/wikibase/stopterms.py here]. These term labels can be excluded in article full text indexation (if a term has several labels, the other labels are not affected); but currently, in order to review the stoplist, they aren't excluded.

Revision as of 08:36, 13 September 2021

LexVoc Vocabulary of Lexicographic Terms

The vocabulary is being developed for different purposes:

  • Elexifinder categories are derived from LexVoc, and are available for search and filter functions in the Elexifinder app, complementary to Elexifinder concepts (obtained through automatic wikifikation).
  • Content describing indexation of LexBib Zotero bibliographical items.
  • Concepts in a tree graph with SKOS relations as edges, and “Lexicography” as root node. This concept tree can be seen as ramification of the “Lexicography” subject heading as listed in cross-domain library vocabularies; see e.g. “Lexicography” in LCSH, or “Lexicography” in BLL.

LexVoc terms have English preferred and alternative lexicalizations; relations between them are represented according to the W3C SKOS standard. Lexicalizations in other languages have been drafted from BabelNet and Wikidata, and will be manually validated and completed (see LexVoc translation on Lexonomy).

Sources for LexVoc have been the following:

  1. An updated and extended version of the index of Bibliografía Temática de la Lexicografía (Córdoba Rodríguez 2003) translated to English, members here
  2. The typology of dictionaries by Engelberg and Storrer (2016), members here
  3. The Glossary of Lexicographic Terms by Kipfer (2013), members here
  4. The index of the volume Using Online Dictionaries (Müller-Spitzer 2014), members here
  5. The Linguistic Property branch of the GOLD ontology, members here

We have merged all concepts stemming from sources (1) to (5), and set relations between them, so that terms can be represented as nodes in a single graph, with SKOS relations as edges.

In a second step, we have extended the vocabulary with a manually revised subset (members) of salient term candidates, extracted from a corpus compiled using all English full texts present in the collection used for Elexifinder version 2 (Spring 2021).

We have then extended the vocabulary further, using term extraction results from subsets of our English full texts. This has been done for the field of dictionary digitization (narrower of dictionary making).

LexVoc main branches as they are defined today are listed below. Names of natural languages have been part of the vocabulary in LexBib experimental version 2, and articles have been indexed with language names found in their full texts (example). This branch currently is not maintained, and not used as Elexifinder category, since natural languages as search filter are available through wikification (Elexifinder concepts, not categories.)

Elexifinder Categories

Terms that serve as Elexifinder category belong to the first three skos:broader hierarchy levels below the root concept. Terms deeper in the hierarchy are considered in article indexation, and so are closeMatch terms without own broader-hierarchy, but the assigned category visible on Elexifinder will be the corresponding broader category of the third level below root.

LexVoc main branches

The following concepts are directly linked to root node “Lexicography” using skos:broader. These are Elexifinder main categories. Two narrower levels below each top-level concept are also considered as (visible) Elexifinder category. Click on the “Graph” links to get a graph representation, with the top-level concept as root node.

  • Dictionary Structure (Graph)
  • Dictionary Type (Graph)
  • Dictionary Making (Graph)
  • Dictionary Use (Graph)
  • Dictionary Function (Graph)
  • Dictionary Distribution Type (Graph)
  • Linguistic Property (Graph)
  • NLP / Corpus Linguistics (Graph)
  • Software <TBD> (Graph)

LexVoc development and full text indexation process

LexVoc, in its actual state, is by no means complete nor finished. It is ongoing work. On Elexis LexMeet, you can contribute to the discussion about LexVoc structure, i.e. the definition of main branches, and the inner organization of these. We also call for collaboration for LexVoc translation on Lexonomy (see our 2021 Euralex paper).

Regarding inclusion of new terms: As soon as a new term (i.e. a member of Class "Term", and of at least one skos:Collection) is linked to a member of the main SKOS graph using skos:broader or skos:closeMatch, it is considered in subsequent iterations of article full text indexation. On the other hand, if a term has no skos:broader or skos:closeMatch relation to another term, it is not (any more) considered for article full text indexation. An article full text indexation iteration (performed locally by User:DavidL) thus reflects the state of LexVoc at that particular point of time.

Full text indexation with LexVoc terms, version 3, is currently in preparation. In this version 3, at the moment, you can see in how many articles a term (i.e., at least one of the labels of that term) have been found in LexBib full texts (see below). You can see single articles and associated LexVoc terms in version 2 (on a single bibliographical item's wikipage (example), click on the value of "LexBib v2 legacy ID".)

In how many articles have we found LexVoc terms (narrowers of "Lexicography")? Query.

Stop-labels

Some term lexicalizations (SKOS labels) are very ambiguous, and do not yield proper results (many false positives and/or too many distinct word senses relevant in the domain of Lexicography). At the time being, the approach towards that problem is a label stop-list, which is accessible here. These term labels can be excluded in article full text indexation (if a term has several labels, the other labels are not affected); but currently, in order to review the stoplist, they aren't excluded.