I am looking to extract the nucleotide mutations from my data downloaded from GISAID. How can I get the nucleotide mutations ? Which python script,command,file or procedure I need to execute to get the mutation data ?
If you want to compare to another sequence, then a simple approach would be to align your sequences of interest and compare them to get the changes.
@james I have used the next clade web application, Is there a way to do it locally for a larger dataset ? or could i run specific scripts which can do it ?
yes, there is a cli that you can run locally. We just released an alpha version of a faster reimplentation:
(post withdrawn by author, will be automatically deleted in 24 hours unless flagged)
@rneher Thanks for the information regarding the nextclade tool. I have used the nextclade API and it filters the data or sequences based on some precondtions . I think there are as follows :
" By default, sequences less than 27,000 bases in length or with more than 3,000
N (unknown) bases are omitted from the analysis.** For a basic QC and preliminary analysis of your sequence data, you can use clades.nextstrain.org. This tool will check your sequences for excess divergence, clustered differences from the reference, and missing or ambiguous data. In addition, it will assign nextstrain clades and call mutations relative to the reference."
I am looking for a way to take all the sequences from GISAID and run analysis based on the conditions of my research with my advisor. Is there a way to not use the precondition and filtering? I wish to use my own conditions and filtering condtions for the analysis.