News
barRoso 1.0.0
Initial Release
The first official release of the barRoso R package — a comprehensive toolkit for standardizing, harmonizing, and preparing plant specimen records for research and reconciliation.
Highlights
barroso_std(): Unified function to clean and standardize herbarium records across multiple fields (collector, geography, taxonomy, etc.).barroso_flag_duplicates(): Flag potential duplicate specimens across herbaria using metadata patterns.barroso_cat(): Combine and reconcile specimen records from multiple virtual herbaria (e.g., GBIF, SEINet, REFLORA, JABOT, speciesLink).barroso_labels(): Generate printable herbarium labels from cleaned fieldbook data, with embedded maps and taxonomic authority retrieval.- Standardize collector names and collection numbers using regex-based parsing
- Harmonize taxonomic, geographic, and temporal fields
- Flag and remove potential duplicates across herbarium records
- Generate herbarium labels from fieldbook data
- Integrate with external taxonomic databases (e.g., LCVP, WFO)
- Prepare large-scale biodiversity datasets for publication and analysis
- Optimized for datasets from REFLORA, speciesLink, and JABOT.
- Supports integration with tidyverse workflows for downstream analyses.
- Test coverage >95%, continuous integration via GitHub Actions.
Philosophy
Unlike other tools that aggressively clean (and discard) records, barRoso focuses on standardization first — ensuring that all specimens, even misidentified or ambiguous ones, remain usable and discoverable. Standardization also enables better duplicate detection and data reconciliation without losing valuable information.
Infrastructure
- MIT license.
- GitHub Actions: R-CMD-check, test coverage, continuous integration.
- Website: barRoso documentation site
Feedback
Please report bugs or feature requests here:
https://github.com/DBOSlab/barRoso/issues