diff --git a/datasets/software-heritage.yaml b/datasets/software-heritage.yaml index 761e14aaf..2fa120d41 100644 --- a/datasets/software-heritage.yaml +++ b/datasets/software-heritage.yaml @@ -13,6 +13,7 @@ Description: | Debian), and language-specific package managers (e.g., PyPI). Crawling information is also included, providing timestamps about when and where all archived source code artifacts have been observed in the wild. + Author and committer information is anonymized. Documentation: https://docs.softwareheritage.org/devel/swh-dataset/graph/athena.html Contact: aws@softwareheritage.org ManagedBy: Software Heritage @@ -24,8 +25,10 @@ Tags: - free software - digital preservation License: | - Creative Commons Attribution 4.0 International. - + The term "Software Heritage Graph Dataset" designates the internal structure of the Software Heritage archive, and explicitly excludes the file contents. + The "Software Heritage Graph Dataset" is distributed under the Creative Commons Attribution 4.0 International license. + For terms of use of all other contents found in the S3 buckets, contact datasets@softwareheritage.org + By accessing the dataset, you agree with the Software Heritage [Ethical Charter for using the archive data](https://www.softwareheritage.org/legal/users-ethical-charter/), @@ -44,5 +47,14 @@ Resources: Type: S3 Bucket DataAtWork: Tutorials: + - Title: Using the Software Heritage Graph Dataset + URL: https://docs.softwareheritage.org/devel/swh-dataset/graph/index.html + AuthorName: The Software Heritage team Tools & Applications: + - Title: The SWH-Graph module + URL: https://docs.softwareheritage.org/devel/swh-graph/index.html + AuthorName: The Software Heritage team Publications: + - Title: The Software Heritage Graph Dataset + URL: https://dx.doi.org/10.1145/3379597.3387510 + AuthorName: Antoine Pietri, Diomidis Spinellis, Stefano Zacchiroli