Anonymous

LexBib bibliodata workflow overview: Difference between revisions

From LexBib
No edit summary
Line 7: Line 7:


* All bibliodata is stored in [[LexBib Zotero]], which is a "group" on the Zotero platform. The group page is public, but item attachments (PDF, TXT) are restricted to registered group members. Member registration is restricted to members of the project.
* All bibliodata is stored in [[LexBib Zotero]], which is a "group" on the Zotero platform. The group page is public, but item attachments (PDF, TXT) are restricted to registered group members. Member registration is restricted to members of the project.
* The Zotero software includes web scraping so-called [https://www.zotero.org/support/translators translators], which ingest bibliodata as single items or in batches. Zotero will also try to harvest the PDF. If it finds a PDF, it also produces a TXT version.
* For scraping publication metadata from web pages (e.g. article 'landing pages' in journal or publisher portals), the Zotero software includes so-called [https://www.zotero.org/support/translators translators], which ingest bibliodata as single items or in batches. Zotero will also try to harvest the PDF. If it finds a PDF, it also produces a TXT version.
* We transform different bibliodata representation formats, or tabular data, to RIS format, which is seamlessly ingested by Zotero, using [https://github.com/elexis-eu/elexifinder/tree/master/BibDataConverters own converters].
* We transform bibliodata that reaches us as tabular data to [https://en.wikipedia.org/wiki/RIS_(file_format) RIS format], with [https://github.com/elexis-eu/elexifinder/tree/master/BibDataConverters own converters]. RIS is straightforwardly imported by Zotero, and, if needed, exported, manipulated using regular expressions, and re-imported.


==Manual curation==
==Manual curation==