Comparison of SARS-CoV-2 variants with INSaFLU and galaxyproject
Development of workflows for NGS data analysis have facilitated the study of sequences. Such workflows have their own advantages and challenges based on the algorithms they use. As a part of this study for Biohackathon 2020, we have compared the SARS-CoV-2 variant outputs of INSaFLU workflow with those analyzed by galaxyproject/SARS-CoV-2. Within 24 samples, 597 variants were found to be shared between two workflows, with almost half of them found within the coding sequence of replicase polyprotein 1ab. Within the shared variants, number of non-synonymous variants were considerably higher and nearly half of the variants were multiallelic. Prospective studies could help us evaluate the accuracy of these variants.