[Non-Nextstrain survey] What are the pain points in getting data ready for phylogenomic analysis?

Gavin · May 18, 2023, 11:45pm

Hi all!

I’m new to this forum, so please accept my apologies if this post is off topic.

I’m a developer of the open source software tool cogent3 for genomic data wrangling and molecular evolutionary analyses. (cogent3 is a direct descendant of PyCogent, of which I was co-lead developer with Rob Knight.)

I’m running a survey to evaluate what the major computational challenges being faced by our community are in terms of getting data ready for phylogenomic analyses.

If this is of interest to you, you can fill out the survey at https://forms.gle/VSt8TKdWtzUfe5A99

It will take <2 minutes.

Please forward to any colleagues who you think might be interested!

thank you!

Gavin

corneliusroemer · May 18, 2023, 11:53pm

Filling it out just now. Would it be possible to share your results? If not in raw then maybe in aggregate form? We all love data

Some feedback:

Do you really mean “species” here or in fact “samples/sequences”? If I analyze 2000 SARS-CoV-2 virus genomes, those are of one species. What should I tick?

image1660×486 12 KB
What if I analyze protein-coding RNA, as in SARS-CoV-2?

image1556×492 13.9 KB

Done!

Gavin · May 19, 2023, 1:08am

Great comments, updated the form for both questions!

Happy to share the results of course! Probably be a few weeks, but feel free to pester if I haven’t delivered them by then.

Topic		Replies	Views
Phylogeny analysis of RSV A and B Help and Getting Started	0	76	February 18, 2025
1 fundamental (maybe naive) question on nextStrain	1	457	May 19, 2021
Help needed to bulid Pylogenetic analysis for viruses	2	463	March 8, 2021
Help for phylogenetic tree about Dengue Help and Getting Started	15	953	April 6, 2023
Guide to filtering GISAID data for division-specific SARS-CoV-2 builds Help and Getting Started	3	1557	April 17, 2024

[Non-Nextstrain survey] What are the pain points in getting data ready for phylogenomic analysis?

Related topics