Anonymous

LexBib bibliodata workflow overview: Difference between revisions

From LexBib
no edit summary
No edit summary
Line 91: Line 91:


A [https://github.com/elexis-eu/elexifinder/tree/master/wikibase/lexvoc-lexonomy set of python scripts] performs transformations from and to Lexonomy XML format. This is needed for [[LexVoc translation on Lexonomy]].
A [https://github.com/elexis-eu/elexifinder/tree/master/wikibase/lexvoc-lexonomy set of python scripts] performs transformations from and to Lexonomy XML format. This is needed for [[LexVoc translation on Lexonomy]].
* [https://github.com/elexis-eu/elexifinder/blob/master/wikibase/lexvoc-lexonomy/buildlexonomy.py buildlexonomy.py] builds 38 bilingual Lexonomy XML dictionaries with LexVoc data.
* [https://github.com/elexis-eu/elexifinder/blob/master/wikibase/lexvoc-lexonomy/buildlexonomy.py buildlexonomy.py] builds 38 bilingual Lexonomy XML dictionaries out of LexVoc SKOS data.
* [https://github.com/elexis-eu/elexifinder/blob/master/wikibase/lexvoc-lexonomy/mergeddict2lwb.py mergeddict3lwb.py] collects translation equivalents from Lexonomy XML (merged on Lexonomy server), and writes them to LexBib wikibase.
* [https://github.com/elexis-eu/elexifinder/blob/master/wikibase/lexvoc-lexonomy/mergeddict2lwb.py mergeddict3lwb.py] collects translation equivalents from Lexonomy XML (merged on Lexonomy server), and writes them to LexBib wikibase.
* [https://github.com/elexis-eu/elexifinder/blob/master/wikibase/lexvoc-lexonomy/getdicts.py getdicts.py] collects translation equivalents from Lexonomy XML (single dictionary).
* [https://github.com/elexis-eu/elexifinder/blob/master/wikibase/lexvoc-lexonomy/getdicts.py getdicts.py] collects translation equivalents from Lexonomy XML (single dictionary).
* [https://github.com/elexis-eu/elexifinder/blob/master/wikibase/lexvoc-lexonomy/statsfrommergeddict.py statsfrommergeddict.py] and getstats.py produce data rows about translation progress
* [https://github.com/elexis-eu/elexifinder/blob/master/wikibase/lexvoc-lexonomy/statsfrommergeddict.py statsfrommergeddict.py] and getstats.py produce data rows about translation progress
= Outlook =
The following tasks are planned, and awaiting a detailed workflow design:
* Alignment of [[Item:Q5|person]] (and [[Item:Q11|organization]]) items to Wikidata and VIAF:
** This can be done using Open Refine. An experiment using a subset of LexBib showed that about 25% of LexBib persons are found on Wikidata, and around 40% on VIAF. Person entity data on Wikidata contains ORCID identifiers, among other person metadata, like birth (and death) date, affiliations, etc. Person entity data on VIAF contains reference to authored publications (of all domains).
** For persons not found on Wikidata, new Wikidata person items can be created.
* Alignment of [[Item:Q6|event]] items to Wikidata:
** This has been done for EURALEX and eLex conference series (Wikidata items have been created and described, [https://www.wikidata.org/wiki/Q100594538 example]).
* Alignment of [[Item:Q3|bibliographical items]] to Wikidata (creation of Wikidata items and transfer of bibliodata):
** A DOI-matching experiment has revealed that so far less than 1% of LexBib bibliographical items are found on Wikidata.
** A transfer of bibliodata and author items to Wikidata enables a use of tools like [https://scholia.toolforge.org/ Scholia] for lexicographical articles.
** A transfer of bibliodata to Wikidata enables its inclusion in [https://meta.wikimedia.org/wiki/WikiCite WikiCite] and [https://opencitations.net/ OpenCitations], i.e. into open citation graphs.
** Registering DOI (via [https://www.crossref.org/ Crossref] or [https://support.datacite.org/ DataCite], see [https://support.datacite.org/docs/datacite-or-crossref comparison]) for LexBib articles that do not have such an identifier (the vast majority) would include LexBib articles in citation graphs maintained by commercial providers (Web of Science, Scopus).
* Development of a metadata model for Lexical Resources such as dictionaries, and cataloguing of dictionaries:
** Regarding the data model, work is in progress, see [[Dictionaries|Dictionaries in LexBib]], and [[LexMeta]].
** Regarding cataloguing, first experiments have been carried out e.g. using datasets from [https://glottolog.org/langdoc Glottolog], an open repository that contains several thousand of dictionary metadata sets, and [https://www.owid.de/obelex/dict/en Obelex-dict], a catalogue of e-dictionaries.
* Definition and representation of bibliographic item relations:
** Citation relation (BibItem A cites BibItem B)
** Review relation (BibItem A reviews BibItem B or Lexical Resource C)