Anonymous

LexVoc: Difference between revisions

From LexBib
Line 64: Line 64:
== Stop-labels ==
== Stop-labels ==


Some term lexicalizations (SKOS ''labels'') are very ambiguous, and do not yield proper results (many false positives and/or too many distinct word senses relevant in the domain of Lexicography). At the time being, the approach towards that problem is a label stop-list, which is accessible [https://github.com/elexis-eu/elexifinder/blob/master/wikibase/stopterms.py here]. These term labels can be excluded in article full text indexation (if a term has several labels, the other labels are not affected); but currently, in order to review the stoplist, they aren't excluded.
Some term lexicalizations (SKOS ''labels'') are very ambiguous, and do not yield proper results (many false positives and/or too many distinct word senses relevant in the domain of Lexicography). At the time being, the approach towards that problem is a label stop-list, which is accessible [https://github.com/elexis-eu/elexifinder/blob/master/wikibase/stopterms.py here]. These term labels are excluded in article full text indexation (if a term has several labels, the other labels are not affected). This is currently our only approach towards ambiguous term labels; we assume that articles will contain a sufficient amount of other indexable terms, and that siblings, narrowers and broaders of the stoplist term will be among them, so that term indexation will still be meaningful.