Protein Family Expansions and Biological Complexity
MetadataShow full item record
During the course of evolution, new proteins are produced very largely as the result of gene duplication, divergence and, in many cases, combination. This means that proteins or protein domains belong to families or, in cases where their relationships can only be recognised on the basis of structure, superfamilies whose members descended from a common ancestor. The size of superfamilies can vary greatly. Also, during the course of evolution organisms of increasing complexity have arisen. In this paper we determine the identity of those superfamilies whose relative sizes in different organisms are highly correlated to the complexity of the organisms. As a measure of the complexity of 38 uni- and multicellular eukaryotes we took the number of different cell types of which they are composed. Of 1,219 superfamilies, there are 194 whose sizes in the 38 organisms are strongly correlated with the number of cell types in the organisms. We give outline descriptions of these superfamilies. Half are involved in extracellular processes or regulation and smaller proportions in other types of activity. Half of all superfamilies have no significant correlation with complexity. We also determined whether the expansions of large superfamilies correlate with each other. We found three large clusters of correlated expansions: one involves expansions in both vertebrates and plants, one just in vertebrates, and one just in plants. Our work identifies important protein families and provides one explanation of the discrepancy between the total number of genes and the apparent physiological complexity of eukaryotic organisms.
Christine Vogel is with Medical Research Council Laboratory of Molecular Biology and UT Austin, Cyrus Chothia is with Medical Research Council Laboratory of Molecular Biology.
CitationVogel C, Chothia C (2006) Protein Family Expansions and Biological Complexity. PLoS Comput Biol 2(5): e48. doi:10.1371/journal.pcbi.0020048
The following license files are associated with this item:
Showing items related by title, author, creator and subject.
A Universal Trend of Reduced mRNA Stability near the Translation-Initiation Site in Prokaryotes and Eukaryotes Gu, Wanjun; Zhou, Tong; Wilke, Claus O. (Public Library of Science, 2010-02-05)Recent studies have suggested that the thermodynamic stability of mRNA secondary structure near the start codon can regulate translation efficiency in Escherichia coli, and that translation is more efficient the less stable ...
Wall, Michael E.; Raghavan, Sindhu; Cohn, Judith D.; Dunbar, John (Public Library of Science, 2011-11-17)Recent studies have noted extensive inconsistencies in gene start sites among orthologous genes in related microbial genomes. Here we provide the first documented evidence that imposing gene start consistency improves the ...
Zhang, Jin, doctor of plant biology (2015-08)Plastid genomes of angiosperms are highly conserved in both genome organization and nucleotide substitution rates. Geraniaceae have highly rearranged genomes and elevated nucleotide substitution rates, which provides an ...