Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update glossary and graphviz for repo/workflows #191

Merged
merged 11 commits into from
Mar 28, 2024

Conversation

joverlee521
Copy link
Contributor

Description of proposed changes

With the push for pathogen repos to adhere to the pathogen-repo-guide, the glossary and graphviz for repositories and workflows need to be updated.

This will also make it easier to standardize terminology in the upcoming ingest tutorials.

Related issue(s)

Resolves #190

Checklist

  • Checks pass

@joverlee521
Copy link
Contributor Author

Main changes are

Leaving this as a draft for discussion.

@joverlee521 joverlee521 requested a review from a team March 2, 2024 02:23
Copy link
Member

@victorlin victorlin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for starting this PR, these changes are much needed! The suggestions below are based on the idea of going full send in the direction of multi-workflow pathogen repos, instead of keeping existing wording that assumes each pathogen repo = 1 phylo workflow. This means drawing clear boundaries around the types of workflows to align with pathogen-repo-guide, and at least acknowledging the presence of ingest/nextclade workflows if not walking through them in detail like the phylo workflow.

src/reference/glossary.rst Outdated Show resolved Hide resolved
src/reference/glossary.rst Outdated Show resolved Hide resolved
src/reference/glossary.rst Show resolved Hide resolved
With the push for pathogen repos to adhere to the pathogen-repo-guide,
the glossary and graphviz for repositories and workflows need to be
updated.

This will also make it easier to use standardized terminology in the
upcoming ingest tutorials.
Keep the term `workflow` general and link out to individual workflow
terms that summarize their uses.

Based on suggestion from @victorlin in review
#191 (comment)
Updating term references that used "workflow" to "phylogenetic workflow"
where appropriate to match new definitions defined in previous commit.
Based on @victorlin's suggestion in review
#191 (comment)

This will pave the way for the addition of ingest workflow tutorials
in the near future.
Keep the term `dataset` general and link out to individual dataset
terms that summarize their uses.

Based on suggestion from @victorlin in review
#191 (comment)
Updating term references that used "dataset" to "phylogenetic dataset"
where appropriate to match new definitions defined in previous commit.
@joverlee521 joverlee521 marked this pull request as ready for review March 21, 2024 23:35
@joverlee521
Copy link
Contributor Author

Major updates after initial feedback from @victorlin:

I'm planning to add the ingest tutorials in a separate branch based on these changes so they can link back to these new terms.

Copy link
Contributor

@j23414 j23414 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can get behind the use of the terms. As far as I understand, I'd just need to practice the new phrasing:

  • “phylogenetic analysis” -> “phylogenetic workflow” (to focus on the code)
  • “phylogenetic build” -> “phylogenetic dataset” (to focus on the final files)

@joverlee521 joverlee521 mentioned this pull request Mar 22, 2024
1 task
src/reference/glossary.rst Outdated Show resolved Hide resolved
src/reference/glossary.rst Outdated Show resolved Hide resolved
src/reference/glossary.rst Outdated Show resolved Hide resolved
src/reference/glossary.rst Outdated Show resolved Hide resolved
src/reference/glossary.rst Outdated Show resolved Hide resolved
src/reference/glossary.rst Outdated Show resolved Hide resolved
joverlee521 and others added 4 commits March 28, 2024 12:21
Originally suggested to create individual terms for each build
(phylogenetic/ingest/Nextclade) to match the individual terms for
"workflow" and "dataset", but seeing the terms written out made it
seem like overkill.

Instead, just adding the aliases to clarify the term covers all three
types of builds, as suggested by @victorlin in review.¹

¹ #191 (comment)
The colloquial term "Nextstrain workflow" is usually referring to a
phylogenetic workflow since we used to only have the phylogenetic
workflow.
@joverlee521 joverlee521 merged commit 1770e24 into master Mar 28, 2024
4 checks passed
@joverlee521 joverlee521 deleted the update-glossary-workflow branch March 28, 2024 20:15
@tsibley
Copy link
Member

tsibley commented Apr 18, 2024

This is great! Nice work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Update glossary terms workflow and workflow repository,
4 participants