Integrated analysis of protein sequence and structure redefines viral diversity and the taxonomy of the Flaviviridae

Peter Simmonds, Anamarija Butković, Joe Grove, Richard Mayne, Jonathon C O Mifsud, Martin Beer, Jens Bukh, J Felix Drexler, Amit Kapoor, Volker Lohmann, Donald B Smith, Jack T Stapleton, Nikos Vasilakis, Jens H Kuhn

Abstract

The Flaviviridae are a family of non-segmented positive-sense enveloped RNA viruses containing significant pathogens including hepatitis C virus and yellow fever virus. Recent large-scale metagenomic surveys have identified many diverse RNA viruses related to classical orthoflaviviruses and pestiviruses but quite different genome lengths and configurations, and with a hugely expanded host range that spans multiple animal phyla, including molluscs, cnidarians and stramenopiles,, and plants. Grouping of RNA-directed RNA polymerase (RdRP) hallmark gene sequences of flavivirus and 'flavi-like' viruses into four divergent clades and multiple lineages within them was congruent with helicase gene phylogeny, PPHMM profile comparisons, and comparison of RdRP protein structure predicted by AlphFold2. These results support their classification into the established order, Amarillovirales, in three families (Flaviviridae, Pestiviridae, and Hepaciviridae), and 14 genera. This taxonomic framework informed by RdRP hallmark gene evolutionary relationships provides a stable reference from which major genome re-organisational events can be understood.

OriginalsprogEngelsk
TidsskriftbioRxiv : the preprint server for biology
ISSN2692-8205
DOI
StatusUdgivet - 18 jan. 2025

Fingeraftryk

Dyk ned i forskningsemnerne om 'Integrated analysis of protein sequence and structure redefines viral diversity and the taxonomy of the Flaviviridae'. Sammen danner de et unikt fingeraftryk.

Citationsformater