Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

First Commit for Biodiveristy Heritage Library #2450

Merged
merged 15 commits into from
Nov 26, 2024
40 changes: 40 additions & 0 deletions datasets/bhl-open-data.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,40 @@
Name: "Biodiversity Heritage Library Metadata and Page Images"
Description: The Biodiversity Heritage Library (BHL) is the world’s largest open access digital library for biodiversity literature and archives. BHL operates as a worldwide consortium of natural history, botanical, research, and national libraries working together to digitize the natural history literature held in their collections and make it freely available for open access.
Documentation: Documentation can be found at <a href="https://github.com/gbhl/bhl-open-data">our GitHub repository</a>.
Contact: [email protected]
ManagedBy: "[The Biodiversity Heritage Library](https://biodiversitylibrary.org/)"
UpdateFrequency: "Metadata is updated monthly. Images are updated weekly."
Tags:
- biodiversity
- bioinformatics
- life sciences
License: Public Domain, CC0, or Creative Commons. Exact licenses are found in the related metadta files and <a href="https://github.com/gbhl/bhl-open-data">documentation</a>.
Citation:
Resources:
- Description: Image files (JPEG-2000) and associated metadata describing the image and the book or article that contains the image.
ARN: arn:aws:s3:::bhl-open-data
Region: us-east-2
Type: S3 Bucket
DataAtWork:
Tools & Applications:
- Title: "BioStor"
URL: https://biostor.org/
AuthorName: Roderic Page
AuthorURL: https://scholar.google.com/citations?user=4Z5WABAAAAAJ&hl=en
- Title: "BHLIndex"
URL: https://github.com/gnames/bhlindex
AuthorName: Global Names
AuthorURL: https://globalnames.org/
Publications:
- Title: "Unearthing the Past for a Sustainable Future: Extracting and transforming data in the Biodiversity Heritage Library for climate action"
URL: https://doi.org/10.3897/biss.7.112436
AuthorName: Dearborn J, Lichtenberg M, Richard J, deVeer J, Trizna M, Mika K
AuthorURL: https://biss.pensoft.net/article/112436/list/9/
- Title: "Understanding BHL Through Metadata: Patterns of Bio-Diverse Knowledge Production"
URL: https://www.ncbi.nlm.nih.gov/pubmed?cmd=DetailsSearch&term=29853643[PMID]
AuthorName: Lidia Ponce de la Vega
AuthorURL: https://lidiapv.com/
- Title: "AI models are getting better and better at reading handwriting, but how can we find handwritten text to begin with?"
URL: https://doi.org/10.25573/data.23523495.v1
AuthorName: Mike Trizna, JJ Dearborn
AuthorURL: https://datascience.si.edu/people/mike-trizna