News

barroso watermark

barRoso 1.0.0

Initial Release

The first official release of the barRoso R package — a comprehensive toolkit for standardizing, harmonizing, and preparing plant specimen records for research and reconciliation.

Highlights

  • barroso_std(): Unified function to clean and standardize herbarium records across multiple fields (collector, geography, taxonomy, etc.).
  • barroso_flag_duplicates(): Flag potential duplicate specimens across herbaria using metadata patterns.
  • barroso_cat(): Combine and reconcile specimen records from multiple virtual herbaria (e.g., GBIF, SEINet, REFLORA, JABOT, speciesLink).
  • barroso_labels(): Generate printable herbarium labels from cleaned fieldbook data, with embedded maps and taxonomic authority retrieval.
  • Standardize collector names and collection numbers using regex-based parsing
  • Harmonize taxonomic, geographic, and temporal fields
  • Flag and remove potential duplicates across herbarium records
  • Generate herbarium labels from fieldbook data
  • Integrate with external taxonomic databases (e.g., LCVP, WFO)
  • Prepare large-scale biodiversity datasets for publication and analysis
  • Optimized for datasets from REFLORA, speciesLink, and JABOT.
  • Supports integration with tidyverse workflows for downstream analyses.
  • Test coverage >95%, continuous integration via GitHub Actions.

Philosophy

Unlike other tools that aggressively clean (and discard) records, barRoso focuses on standardization first — ensuring that all specimens, even misidentified or ambiguous ones, remain usable and discoverable. Standardization also enables better duplicate detection and data reconciliation without losing valuable information.

Infrastructure

Feedback

Please report bugs or feature requests here:
https://github.com/DBOSlab/barRoso/issues