NextClade Variant Calling info

Hi @corneliusroemer ,

I have a follow up question for this post. I am doing a retrospective analysis on SC2 data to look at variant severity. In the model, I am grouping sequences by clade and looking at yes/no for hospitalizations. I am interested in understanding more about how the ‘good’ vs ‘mediocre’ vs’ bad’ qc.overallStatus is related to accuracy of lineage calls and if there has been any further work done for having a confidence estimate for lineage calls? Essentially, which sequences should be excluded because the lineage call can’t be trusted and is the qc.overallStatus the best indicator of this?

Thank you in advance!