PUBLICATION

Phylogeny of teleost connexins reveals highly inconsistent intra- and interspecies use of nomenclature and misassemblies in recent teleost chromosome assemblies

Authors
Mikalsen, S.O., Tausen, M., Í Kongsstovu, S.
ID
ZDB-PUB-200403-38
Date
2020
Source
BMC Genomics   21: 223 (Journal)
Registered Authors
Keywords
Connexins, Genome duplication, Mammals, Nomenclature, Ohnologs, Orthologs, Paralogs, Phylogenetic trees, Teleosts
MeSH Terms
  • Animals
  • Connexins/genetics*
  • Fish Proteins/genetics
  • Fishes/genetics*
  • Multigene Family
  • Phylogeny
  • Sequence Analysis, DNA/methods*
  • Terminology as Topic
PubMed
32160866 Full text @ BMC Genomics
Abstract
Based on an initial collecting of database sequences from the gap junction protein gene family (also called connexin genes) in a few teleosts, the naming of these sequences appeared variable. The reasons could be (i) that the structure in this family is variable across teleosts, or (ii) unfortunate naming. Rather clear rules for the naming of genes in fish and mammals have been outlined by nomenclature committees, including the naming of orthologous and ohnologous genes. We therefore analyzed the connexin gene family in teleosts in more detail. We covered the range of divergence times in teleosts (eel, Atlantic herring, zebrafish, Atlantic cod, three-spined stickleback, Japanese pufferfish and spotted pufferfish; listed from early divergence to late divergence).
The gene family pattern of connexin genes is similar across the analyzed teleosts. However, (i) several nomenclature systems are used, (ii) specific orthologous groups contain genes that are named differently in different species, (iii) several distinct genes have the same name in a species, and (iv) some genes have incorrect names. The latter includes a human connexin pseudogene, claimed as GJA4P, but which in reality is Cx39.2P (a delta subfamily gene often called GJD2like). We point out the ohnologous pairs of genes in teleosts, and we suggest a more consistent nomenclature following the outlined rules from the nomenclature committees. We further show that connexin sequences can indicate some errors in two high-quality chromosome assemblies that became available very recently.
Minimal consistency exists in the present practice of naming teleost connexin genes. A consistent and unified nomenclature would be an advantage for future automatic annotations and would make various types of subsequent genetic analyses easier. Additionally, roughly 5% of the connexin sequences point out misassemblies in the new high-quality chromosome assemblies from herring and cod.
Genes / Markers
Figures
Show all Figures
Expression
Phenotype
Mutations / Transgenics
Human Disease / Model
Sequence Targeting Reagents
Fish
Antibodies
Orthology
Engineered Foreign Genes
Mapping