plazi / drosophilid-data

Repository for cleanup, enhancement, and preparation of Drosophlid data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Define needed metadata fields

tcatapano opened this issue · comments

@myrmoteras: Ive created this private repo for the prep of the Drosophilid data. The first step is to define the target metadata fields and what cleaning needs to be done. Take a look at https://raw.githubusercontent.com/plazi/drosophilid-data/master/BibData-Drosophila.corrected.tab.csv and let me know what you think needs to be done.

Right now the fields are:

ID-taxodros|author|year|title|pagination|volume|journalOrPublisher|formatAndMeta|pubDate|libNumber|issue|editor|volumeTitle

@tcatapano Ideally we need to be able to provide the data that is required in CrossRef, DataCite, and now Wikicite to create depositions that can be used by these services. I assume that taxodros has all the fields, including the type of publications? Also, how is the names of the author provided? how in the case where there are multiple authors?

this is an example of the elements that are provided by Taxodros, and those that need be added to create a valid deposition.

It seems at a first glance that only the author names have to be parsed into separate fields, e.g. Abd El-Halim, A.S. & Mostafa, A.A. & Allam, K.A.M.

EXAMPLE
ID-taxodros| abd el-halim et al., 2005|
author| Abd El-Halim, A.S. & Mostafa, A.A. & Allam, K.A.M.|
year| 2005|
title| Dipterous flies species and their densities in fourteen Egyptian governorates.|
pagination| 351-362|
volume| 35|
journalOrPublisher| J. Egypt. Soc. Parasitol.|
formatAndMeta| pdf|
pubDate| |
libNumber| |
issue| |
editor| |
|volumeTitle| |

reuquired to be added for the upload at Zenodo
uploadType | publication
PublicationType | Journal article
Description | uploaded by Plazi from Taxodros
AccessRight | open access
License | either CC BY or licence not specified
Communities | Biodiversity Literature Repository

@myrmoteras I missed the notification of your comments. I'll get to work on the conversion targetting the Zenodo upload first

see #2: authors have been split into own columns

for required zenodo fields see #3