You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
While I agree adding this would be beneficial, it's the least problematic bit about the spec!
It certainly would be good if the original authors could add more about it. One thing that confused me lots is the "Auxiliary data", which changes format depending on the thing being indexed. (IIRC it's tabix data for VCF and some BAI-related format for BCF). I assume it's meant to be generic, but it also makes it largely unparseable without custom knowledge.
It is clear from examination of .csi files that they are stored as BGZF (why?), although this is not mentioned and is at odds with the current behaviour of BAI.
I used bcftools 1.19 to index a BCF file and tried to parse the CSI index file according to the spec https://github.com/samtools/hts-specs/blob/26347448cadff3cf40982d60fe2a97f20d2543ea/CSIv1.tex#L20C28-L20C33. It was not working as expected. After
hexdump -C
on the csi file, I realized it not a plain binary file as described in CSIv1 spec file.But it seem consist with the spec after decompressing it
bgzip -cd test.bcf.csi | hexdump -C
:Could we add a sentence in the spec to point this out for future readers? Or it is not part of the spec?
The text was updated successfully, but these errors were encountered: