Research Repository

Abstract

Integrating Collector and Author Roles Across Specimen and Publication Datasets.

上市 Deposited

Creator

Nicolson, Nicky ( )
Paton, Alan ( )
Phillips, Sarah ( )
Tucker, Allan ( )

2019

Abstract

This work builds on the outputs of a collector data-mining exercise applied to GBIF mobilised herbarium specimen metadata, which uses unsupervised learning (clustering) to identify collectors from minimal metadata associated with field collected specimens (the DarwinCore terms , and ). Here, we outline methods to integrate these data-mined collector entities (large scale dataset, aggregated from multiple sources, created programatically) with a dataset of author entities from the International Plant Names Index (smaller scale, single source dataset, created via editorial management). The integration process asserts a generic "scientist" entity with activities in different stages of the species description process: collecting and name publication. We present techniques to investigate specialisations including content - taxa of study - and activity stages: examining if individuals focus on collecting and/or name publication. Finally, we discuss generalisations of this initially herbarium-focussed data mining and record linkage process to enable applications in a wider context, particularly in zoological datasets.

Items:

缩图	文件名	上载日期	能见度	File Size	动作
	BISS_article_35866.pdf	2022-02-09	上市	62 KB	Download Download (as thumbnail)

Metadata

Resource Type: Abstract
Creator: Nicolson, Nicky ( )

Paton, Alan ( )

Phillips, Sarah ( )

Tucker, Allan ( )
Date published: 2019-06-13
Institution: Royal Botanic Gardens, Kew
Series name: Biodiversity Information Science and Standards
Volume: 3
Pagination: e35866
Publisher: Pensoft Publishers
Place of publication: Sofia, Bulgaria
eISSN: 2535-0897
Official URL: https://doi.org/10.3897/biss.3.35866
Related URL: https://biss.pensoft.net/article/35866/
Licence: CC BY 4.0 Attribution
Rights statement: In Copyright
DOI: 10.3897/biss.3.35866
Alternate identifier: identifier: e35866

type: article_number
关键词: Collectors
Data-mining
Herbarium specimen data
IPNI
Additional Information: This article is part of: ST08 - More than Names : Identifying and Crediting People in Biodiversity Data Edited by Simon Chagnoux, David Shorthouse, Anne Thessen.