Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Review all tips for relevance to deep learning and biology #215

Open
agitter opened this issue Jun 8, 2020 · 1 comment
Open

Review all tips for relevance to deep learning and biology #215

agitter opened this issue Jun 8, 2020 · 1 comment

Comments

@agitter
Copy link
Collaborator

agitter commented Jun 8, 2020

To keep the scope of this article focused, the tips should be about deep learning in biology. Several tips only pertain to one or the other. I recommend we add more biology examples to the tips that are mostly about deep learning to keep the article relevant.

I'm less sure about what to do for the tips that describe best practices for machine learning or statistics but are not specific to deep learning. For example, the first paragraphs of Tip 4 are all essential but not deep learning-specific.

@Benjamin-Lee
Copy link
Owner

I recommend we add more biology examples to the tips that are mostly about deep learning to keep the article relevant.

I agree 1000%. I'm not sure if I've mentioned it elsewhere, but examples/anecdotes are critical for cementing concepts. When I CS50, they introduced the idea of an integer overflow early on (in lecture 1!), which is kind of abstract. However, they make it very concrete and show why you really have to pay attention with examples such as the Patriot missile battery failure which killed 28 people and the 787 overflow bug.

For each tip, let's ensure we have at least one biology-related example. I took a look and here's what I saw:

  1. Suppose that there are robust heritability estimates for a phenotype that suggest that the genetic contribution is modest but a deep learning model predicts the phenotype with very high accuracy. The model may be capturing signal unrelated to genetic mechanisms underlying the phenotype. In this case, a possible explanation is that people with similar genetic markers may have shared exposures.

  2. As an illustration, Rajkomar et al. [13] found that simpler baseline models achieved performance comparable with that of DL in a number of clinical prediction tasks using electronic health records, which may be a surprise to many.

  3. Lots of biological applications but no examples.
  4. A poignant example of this lesson is...

Does someone want to take the lead on figuring out some examples?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants