Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using LLM outputs to enhance CL terms and structure #2081

Closed
1 of 6 tasks
dosumis opened this issue Aug 1, 2023 · 4 comments · Fixed by #2530
Closed
1 of 6 tasks

Using LLM outputs to enhance CL terms and structure #2081

dosumis opened this issue Aug 1, 2023 · 4 comments · Fixed by #2530
Assignees
Labels

Comments

@dosumis
Copy link
Contributor

dosumis commented Aug 1, 2023

Cell Guide has draft cell type descriptions from ChatGPT
e.g. Acinar Cell Of Salivary Gland

CL def: An acinar cell of salivary gland.
CL parent class def:

ChatGPT:

Acinar cells of the salivary gland are a critical part of the oral physiology and the digestive mechanism. These specialized cells are concentrated towards the terminal ends of the salivary glands, where they form sac-like acini or clusters, giving them their name. They are amphicrine in nature, which means they play both secretory and endocrine roles.

The primary function of the acinar cells of the salivary gland is to manufacture and secrete saliva. This substance, comprised mainly of water, electrolytes, mucins, and enzymes, is vital for the initial stages of digestion and for the regular maintenance of oral health. The acinar cells produce the two main types of saliva: serous and mucinous. The serous type, predominantly produced by the parotid gland, contains the enzyme amylase responsible for the initial breakdown of complex carbohydrates. The mucinous type, more common in the submandibular and sublingual glands, contains mucin that aids in lubrication, facilitating the process of mastication and bolus formation.

Moreover, acinar cells also express and respond to various endocrine, paracrine, and autocrine hormones that govern the quantity and composition of the saliva produced. This implies sensitivity and modulation according to the internal and external environment, dietary habits, and even circadian rhythms. Impairment of acinar cells, whether due to damage or related to a systemic condition such as Sjögren's Syndrome, could disrupt the optimal functioning of the oral and digestive systems. In conclusion, acinar cells of the salivary gland, despite their diminutive size, have a substantial role in oral health and overall homeostasis.

This clearly needs curating for accuracy and the addition of references and some details may be too human-specific for CL or may better belong in an extended knowledge graph, but reading it also immediately suggests some obvious improvements we can make.

  1. Having some more information about morphology location and function would make the def more useful.
  2. There is currently no textual or logical connection to saliva secretion, but this can easily be added
  3. We can add subtypes by function - serous & mucinous (it might be worth requesting more specific GO terms)
  4. We could add subtypes by gland (parotid, submandibular and sublingual glands)
  5. What evidence is there for endocrine function? Is this sufficient to add additional links to GO via capable_of?

One way we could home in on some references is via another LLM-based resource, perplexity:

https://www.perplexity.ai/search/f5e5f8a6-cf6e-4b67-b28b-320173bc0725?s=c

Acinar cells of salivary glands are essential for the production and secretion of saliva, which plays a vital role in digestion and maintaining oral health. These cells are found in the salivary glands, including the parotid, submandibular, and sublingual glands 1. Acinar cells can be classified into two types: serous and mucinous 1. Serous acinar cells secrete an isotonic, protein-rich fluid, while mucinous acinar cells secrete mucin, a lubricant 1.

The salivary glands are made up of lobules, with each lobule containing many secretory acinar cells that form rounded secretory units with a central duct 1 . Acinar cells initially produce an isotonic, plasma-like fluid that is further modified by duct cells through the reabsorption of most of the NaCl and secretion of KHCO3 2 . Due to the ducts being relatively impermeable to water, the final saliva is hypotonic 2. Acinar cells are surrounded by myoepithelial cells, which display 4 to 8 processes that wrap around the acini and intercalated ducts 4. These myoepithelial cells rhythmically contract to squeeze saliva from the acinar units through the duct system and release it into the oral cavity 4. In addition to myoepithelial cells, acinar cells are also surrounded by an extracellular matrix, immune cells, stromal cells, myofibroblasts, and nerve fibers 4.

The primary function of acinar cells is to regulate ion channel activity in various cellular domains to generate the appropriate fluid secretion in salivary gland 6 . The receptor-evoked Ca2+ signal in salivary gland acinar cells is unique in that it starts at the apical pole and then rapidly increases across the cell 6 . This polarized Ca2+ signal can be attributed to the polarized arrangement of Ca2+ channels, transporters, and signaling proteins 6

The references may not be ideal primary refs - but typically have references to good primary refs. This description also provides more detail on secretory mechanisms (which might suggest more GO links) as well as useful context and material for the structure and function of related cell types - the myoepithelial cells and duct cells of the salivary glands.


Actions

Tasks

No tasks being tracked yet.
@dosumis dosumis added the Epic label Aug 1, 2023
@addiehl
Copy link
Contributor

addiehl commented Aug 1, 2023

I think these LLM descriptions read great, but definitely need review by trained curators to avoid possible hallucinations and verify facts and references. Clearly, they provide excellent starting material for in depth descriptions of cell types as well as suggesting additional subclasses, as David points out. But human review is key, because published cell type descriptions will be the feedstock of future LLMs and we don't want to create a cycle of misinformation that becomes difficult to correct as it gets embedded in other resources and search engine indices.

Copy link

This issue has not seen any activity in the past 6 months; it will be closed automatically in one year from now if no action is taken.

@dosumis
Copy link
Contributor Author

dosumis commented May 10, 2024

@Caroline-99 - had forgotten about this ticket. Has some relevant detail.

Let's keep open for now. Working through this example, even with the limited tech stack we have, could be a useful exercise.

We will also need a new epic with an outline roadmap, but I think that needs more discussion.

@dosumis
Copy link
Contributor Author

dosumis commented Aug 27, 2024

New plan:

  • We will accept referenced, reviewed CellGuide defs after a brief review to avoid any that clearly refer to the wrong term or are only appropriate for a parent class.
  • These will be added using dcterms:Description and prepended with a boilerplate disclaimer.
  • The process of addition will be managed using a simple DOSDP template to generate merged text. This will be a one off addition + merge rather than maintaining as dynamic content in spreadsheets (may need some discussion)
  • Extended descriptions may be edited and new ones added as we work on terms for
  • Current content in comments should be reviewed for addition to extended defs
  • We will run OntoGPT/CurateGPT across CellGuide extended descriptions with the aim of matching to GO terms.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Development

Successfully merging a pull request may close this issue.

3 participants