Skip to content

Understanding the HG002 truthset #117

Answered by ACEnglish
KasperH2 asked this question in Q&A
Discussion options

You must be logged in to vote

Hello,

I'll try my best to answer your questions with the disclaimer that I'm not the official maintainer of the GIAB data.

It appears that the DEL/INS SV counts in the manuscript are incorrect. 9641 is the count of PASS/Tier1 SVs which breaks down to

   4199 DEL
   5442 INS

From my understanding, the REPTYPE isn't a technical definition the same as SVTYPE's DEL/INS as much as it is an annotation. It would appear you have found edge cases where the REPTYPE annotation could be incorrect. However, it may be useful for you to explore how many of these are Tier1 SVs. Generally, I choose not to stray outside of the PASS/Tier1 regions.

$ bcftools view  -i "REPTYPE == 'DUP' & SVTYPE == 'DEL'" T…

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@KasperH2
Comment options

@ACEnglish
Comment options

@KasperH2
Comment options

@ACEnglish
Comment options

Answer selected by KasperH2
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants