Introduction
- de Jong R.M.
- Bazzacco P.
- Poelarends G.J.
- Johnson Jr., W.H.
- Kim Y.J.
- Burks E.A.
- Serrano H.
- Thunnissen A.M.
- Whitman C.P.
- Dijkstra B.W.
- Fingerle-Rowson G.
- Kaleswarapu D.R.
- Schlander C.
- Kabgani N.
- Brocks T.
- Reinart N.
- Busch R.
- Schütz A.
- Lue H.
- Du X.
- Liu A.
- Xiong H.
- Chen Y.
- Nemajerova A.
- Hallek M.
- et al.

- Poelarends G.J.
- Serrano H.
- Person M.D.
- Johnson Jr., W.H.
- Murzin A.G.
- Whitman C.P.
Results and discussion
A large-scale comparison of 11,395 sequences reveals new structural and functional features of the TSF

- Mashiyama S.T.
- Malabanan M.M.
- Akiva E.
- Bhosle R.
- Branch M.C.
- Hillerich B.
- Jagessar K.
- Kim J.
- Patskovsky Y.
- Seidel R.D.
- Stead M.
- Toro R.
- Vetting M.W.
- Almo S.C.
- Armstrong R.N.
- Babbitt P.C.
- van der Meer J.Y.
- Poddar H.
- Baas B.J.
- Miao Y.
- Rahimi M.
- Kunzendorf A.
- van Merkerk R.
- Tepper P.G.
- Geertsema E.M.
- Thunnissen A.M.
- Quax W.J.
- Poelarends G.J.
Subgroupings may be useful for classifying newly discovered TSF members
Domain structure and sequence length variation across the TSF
Some TSF sequences lack an N-terminal proline

Phylogenetic representation in the TSF

Some MIF proteins in higher eukaryotes function as cytokines
Structure–function relationships between the 4-OT and cis-CaaD subgroups of the TSF
- Poelarends G.J.
- Serrano H.
- Person M.D.
- Johnson Jr., W.H.
- Murzin A.G.
- Whitman C.P.
Linkers between the 4-OT and cis-CaaD subgroups identify a similarity path between them

![]() |
Kinetic analysis showed loss of tautomerase activity in the enzymes linking 4-OT to cis-CaaD

Phylogenetic reconstruction of the linker set

Additional linkers among other TSF subgroups

Summary
Experimental procedures
Sequence data collection
Construction of sequence similarity networks
Division of the TSF into subgroups based on sequence similarity
Construction of the structure similarity network
Creation of HMMs
Identification of TSF members without an N-terminal proline
Identification of linkers
Creation of linker control networks
Structure characterization
Creation of the structure-guided alignment
Phylogenetic tree construction of the 4-OT and cis-CaaD subgroups
Kinetic analysis of linkers
Author contributions
Acknowledgments
Supplementary Material
References
- Divergent evolution of enzymatic function: Mechanistically diverse superfamilies and functionally distinct suprafamilies.Annu. Rev. Biochem. 2001; 70 (11395407): 209-246
- Toward mechanistic classification of enzyme functions.Curr. Opin. Chem. Biol. 2011; 15 (21489855): 435-442
- Structural classification of proteins: New superfamilies.Curr. Opin. Struct. Biol. 1996; 6 (8804825): 386-394
- The chemical versatility of the β-α-β fold: Catalytic promiscuity and divergent evolution in the tautomerase superfamily.Cell. Mol. Life Sci. 2008; 65 (18695941): 3606-3618
- Catalytic role of the amino-terminal proline in 4-oxalocrotonate tautomerase: Affinity labeling and heteronuclear NMR studies.Biochemistry. 1996; 35 (8547260): 803-813
- The 4-oxalocrotonate tautomerase family of enzymes: How nature makes new enzymes using a β-α-β structural motif.Arch. Biochem. Biophys. 2002; 402 (12051677): 1-13
- 4-Oxalocrotonate tautomerase: pH dependence of catalysis and pKa values of active site residues.Biochemistry. 1996; 35 (8547261): 814-823
- Enzymatic ketonization of 2-hydroxymuconate: Specificity and mechanism investigated by the crystal structures of two isomerases.Biochemistry. 1996; 35 (8547259): 792-802
- Pro-1 of macrophage migration inhibitory factor functions as a catalytic base in the phenylpyruvate tautomerase activity.Biochemistry. 1999; 38 (10353846): 7346-7354
- Mechanism of the phenylpyruvate tautomerase activity of macrophage migration inhibitory factor: Properties of the P1G, P1A, Y95F, and N97A mutants.Biochemistry. 2000; 39 (10933783): 9671-9678
- Crystal structure of macrophage migration inhibitory factor complexed with (E)-2-fluoro-p-hydroxycinnamate at 1.8 Å resolution: Implications for enzymatic catalysis and inhibition.Biochemistry. 1999; 38 (10360941): 7444-7452
- The X-ray structure of trans-3-chloroacrylic acid dehalogenase reveals a novel hydration mechanism in the tautomerase superfamily.J. Biol. Chem. 2004; 279 (14701869): 11546-11552
- Reactions of trans-3-chloroacrylic acid dehalogenase with acetylene substrates: Consequences of and evidence for a hydration reaction.Biochemistry. 2003; 42 (12873137): 8762-8773
- Crystal structures of native and inactivated cis-3-chloroacrylic acid dehalogenase: Structural basis for substrate specificity and inactivation by (R)-oxirane-2-carboxylate.J. Biol. Chem. 2007; 282 (17121835): 2440-2449
- The hydratase activity of malonate semialdehyde decarboxylase: Mechanistic and evolutionary implications.J. Am. Chem. Soc. 2004; 126 (15571384): 15658-15659
- The roles of active-site residues in the catalytic mechanism of trans-3-chloroacrylic acid dehalogenase: A kinetic, NMR, and mutational analysis.Biochemistry. 2004; 43 (15065850): 4082-4091
- MIF, a controversial cytokine: A review of structural features, challenges, and opportunities for drug development.Expert Opin. Ther. Targets. 2016; 20 (27762152): 1463-1475
- The macrophage migration inhibitory factor-glucocorticoid dyad: Regulation of inflammation and immunity.Mol. Endocrinol. 2007; 21 (17389748): 1267-1280
- Crystal structure of macrophage migration inhibitory factor from human lymphocyte at 2.1 Å resolution.FEBS Lett. 1996; 389 (8766818): 145-148
- MIF is a pituitary-derived cytokine that potentiates lethal endotoxaemia.Nature. 1993; 365 (8413654): 756-759
- Disulfide analysis reveals a role for macrophage migration inhibitory factor (MIF) as thiol-protein oxidoreductase.J. Mol. Biol. 1998; 280 (9653033): 85-102
- The immunoregulatory mediator macrophage migration inhibitory factor (MIF) catalyzes a tautomerization reaction.Mol. Med. 1996; 2 (8900542): 143-149
- The non-mammalian MIF superfamily.Immunobiology. 2017; 222 (27780588): 473-482
- A tautomerase-null macrophage migration-inhibitory factor (MIF) gene knock-in mouse model reveals that protein interactions and not enzymatic activity mediate MIF-dependent growth regulation.Mol. Cell Biol. 2009; 29 (19188446): 1922-1932
- Crystal structure of 4-oxalocrotonate tautomerase inactivated by 2-oxo-3-pentynoate at 2.4 Å resolution: Analysis and implications for the mechanism of inactivation and catalysis.Biochemistry. 1998; 37 (9778344): 14692-14700
- Kinetic and structural characterization of a heterohexamer 4-oxalocrotonate tautomerase from Chloroflexus aurantiacus J-10-fl: Implications for functional and structural diversity in the tautomerase superfamily.Biochemistry. 2010; 49 (20465238): 5016-5027
- Stereospecific alkylation of cis-3-chloroacrylic acid dehalogenase by (R)-oxirane-2-carboxylate: Analysis and mechanistic implications.Biochemistry. 2004; 43 (15170356): 7187-7196
- Crystal structures of the wild-type, P1A mutant, and inactivated malonate semialdehyde decarboxylase: A structural basis for the decarboxylase and hydratase activities.Biochemistry. 2005; 44 (16274229): 14818-14827
- The crystal structure of YdcE, a 4-oxalocrotonate tautomerase homologue from Escherichia coli, confirms the structural basis for oligomer diversity.Biochemistry. 2002; 41 (12356301): 12010-12024
- An indigoidine biosynthetic gene cluster from Streptomyces chromofuscus ATCC 49982 contains an unusual IndB homologue.J. Ind. Microbiol. Biotechnol. 2013; 40 (23053349): 159-168
- Characterization of Cg10062 from Corynebacterium glutamicum: implications for the evolution of cis-3-chloroacrylic acid dehalogenase activity in the tautomerase superfamily.Biochemistry. 2008; 47 (18598055): 8139-8147
- Cloning, expression, and characterization of a cis-3-chloroacrylic acid dehalogenase: Insights into the mechanistic, structural, and evolutionary relationship between isomer-specific 3-chloroacrylic acid dehalogenases.Biochemistry. 2004; 43 (14730981): 759-772
- The structure-function linkage database.Nucleic Acids Res. 2014; 42 (24271399): D521-D530
- Using sequence similarity networks for visualization of relationships across diverse protein superfamilies.PLoS ONE. 2009; 4 (19190775)e4345
- Pythoscape: A framework for generation of large protein similarity networks.Bioinformatics. 2012; 28 (22962345): 2845-2846
- Profile hidden Markov models.Bioinformatics. 1998; 14 (9918945): 755-763
- An atlas of the thioredoxin fold class reveals the complexity of function-enabling adaptations.PLoS Comput. Biol. 2009; 5 (19851441)e1000541
- Inference of functional properties from large-scale analysis of enzyme superfamilies.J. Biol. Chem. 2012; 287 (22069325): 35-42
- Protein similarity networks reveal relationships among sequence, structure, and function within the Cupin superfamily.PLoS ONE. 2013; 8 (24040257)e74477
- Large-scale determination of sequence, structure, and function relationships in cytosolic glutathione transferases across the biosphere.PLoS Biol. 2014; 12 (24756107)e1001843
- Using mutability landscapes of a promiscuous tautomerase to guide the engineering of enantioselective Michaelases.Nat. Commun. 2016; 7 (26952338)10911
- Effects of mutations of the active site arginine residues in 4-oxalocrotonate tautomerase on the pKa values of active site residues and on the pH dependence of catalysis.Biochemistry. 1999; 38 (10493803): 12358-12366
- The structural basis for the perturbed pKa of the catalytic base in 4-oxalocrotonate tautomerase: Kinetic and structural effects of mutations of Phe-50.Biochemistry. 2001; 40 (11329265): 1984-1995
- Kinetic, stereochemical, and structural effects of mutations of the active site arginine residues in 4-oxalocrotonate tautomerase.Biochemistry. 1999; 38 (10493802): 12343-12357
- Evolution of enzymatic activity in the tautomerase superfamily: Mechanistic and structural consequences of the L8R mutation in 4-oxalocrotonate tautomerase.Biochemistry. 2006; 45 (16784221): 7700-7708
- Identification and characterization of new family members in the tautomerase superfamily: Analysis and implications.Arch. Biochem. Biophys. 2014; 564 (25219626): 189-196
- Mechanistic characterization of a bacterial malonate semialdehyde decarboxylase: Identification of a new activity on the tautomerase superfamily.J. Biol. Chem. 2003; 278 (14506256): 48674-48683
- Extent of N-terminal methionine excision from Escherichia coli proteins is governed by the side-chain length of the penultimate amino acid.Proc. Natl. Acad. Sci. U.S.A. 1989; 86 (2682640): 8247-8251
- Kinetic and structural characterization of DmpI from Helicobacter pylori and Archaeoglobus fulgidus, two 4-oxalocrotonate tautomerase family members.Bioorg. Chem. 2010; 38 (20709352): 252-259
- Molecular cloning of human d-dopachrome tautomerase cDNA: N-terminal proline is essential for enzyme activation.Biochem. Biophys. Res. Commun. 1998; 243 (9480844): 538-544
- The d-dopachrome tautomerase (DDT) gene product is a cytokine and functional homolog of macrophage migration inhibitory factor (MIF).Proc. Natl. Acad. Sci. U.S.A. 2011; 108 (21817065): E577-E585
- d-Dopachrome tautomerase (D-DT or MIF-2): Doubling the MIF cytokine family.Cytokine. 2012; 59 (22507380): 10-17
- Phylogeny and evolution of plant macrophage migration inhibitory factor/d-dopachrome tautomerase-like proteins.BMC Evol. Biol. 2015; 15 (25888527): 64
- Identification and biochemical characterization of macrophage migration inhibitory factor-2 (MIF-2) homologue of human lymphatic filarial parasite, Wuchereria bancrofti.Acta Trop. 2015; 142 (25446175): 71-78
- Stereospecific ketonization of 2-hydroxymuconate by 4-oxalocrotonate tautomerase and 5-carboxymethyl-2-hydroxymuconate isomerase.J. Am. Chem. Soc. 1992; 114: 10104-10110
- Phenylpyruvate tautomerase activity of trans-3-chloroacrylic acid dehalogenase: Evidence for an enol intermediate in the dehalogenase reaction?.Biochemistry. 2007; 46 (17661448): 9596-9604
- Kinetic and structural characterization of a cis-3-chloroacrylic acid dehalogenase homologue in Pseudomonas sp. UW4: A potential step between subgroups in the tautomerase superfamily.Arch. Biochem. Biophys. 2017; 636 (29111295): 50-56
- The relation between the divergence of sequence and structure in proteins.EMBO J. 1986; 5 (3709526): 823-826
- Annotation error in public databases: Misannotation of molecular function in enzyme superfamilies.PLoS Comput. Biol. 2009; 5 (20011109)e1000605
- InterPro in 2017: Beyond protein family and domain annotations.Nucleic Acids Res. 2017; 45 (27899635): D190-D199
- Gapped BLAST and PSI-BLAST: A new generation of protein database search programs.Nucleic Acids Res. 1997; 25 (9254694): 3389-3402
- Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.Bioinformatics. 2006; 22 (16731699): 1658-1659
- Cytoscape: A software environment for integrated models of biomolecular interaction networks.Genome Res. 2003; 13 (14597658): 2498-2504
- TM-align: A protein structure alignment algorithm based on the TM-score.Nucleic Acids Res. 2005; 33 (15849316): 2302-2309
- The Pfam protein families database: Towards a more sustainable future.Nucleic Acids Res. 2016; 44 (26673716): D279-D285
- How significant is a protein structure similarity with TM-score = 0.5?.Bioinformatics. 2010; 26 (20164152): 889-895
- Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.Mol. Syst. Biol. 2011; 7 (21988835): 539
- Accelerated profile HMM searches.PLoS Comput. Biol. 2011; 7 (22039361)e1002195
- UCSF Chimera: A visualization system for exploratory research and analysis.J. Comput. Chem. 2004; 25 (15264254): 1605-1612
- MUSCLE: Multiple sequence alignment with high accuracy and high throughput.Nucleic Acids Res. 2004; 32 (15034147): 1792-1797
- MrBayes 3: Bayesian phylogenetic inference under mixed models.Bioinformatics. 2003; 19 (12912839): 1572-1574
- UniProt: The universal protein knowledgebase.Nucleic Acids Res. 2017; 45 (27899622): D158-D169
Article info
Publication history
Footnotes
This work was supported by Grants F-1334 (to CPW) and F-1778 (to Y. J. Z.) from the Robert A. Welch Foundation and by National Institutes of Health Grants GM-41239 (to C. P. W.) and GM-60595 (to P. C. B). The authors declare that they have no conflicts of interest with the contents of this article. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
This article contains Figs. S1–S15 and File S1.
The atomic coordinates and structure factors (codes 5UIF, 5UNQ, and 6BLM) have been deposited in the Protein Data Bank (http://wwpdb.org/).
Identification
Copyright
User license
Creative Commons Attribution (CC BY 4.0) |
Permitted
- Read, print & download
- Redistribute or republish the final article
- Text & data mine
- Translate the article
- Reuse portions or extracts from the article in other works
- Sell or re-use for commercial purposes
Elsevier's open access license policy