Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Krishna, R. et al. Generalized biomolecular modeling and design with RoseTTAFold All-Atom. Science 384, eadl2528 (2024).
Devine, P. N. et al. Extending the application of biocatalysis to meet the challenges of drug development. Nat. Rev. Chem. 2, 409–421 (2018).
Kissman, E. N. et al. Expanding chemistry through in vitro and in vivo biocatalysis. Nature 631, 37–48 (2024).
Neugebauer, M. E. et al. A family of radical halogenases for the engineering of amino-acid-based products. Nat. Chem. Biol. 15, 1009–1016 (2019).
Khersonsky, O., Roodveldt, C. & Tawfik, D. Enzyme promiscuity: evolutionary and mechanistic aspects. Curr. Opin. Chem. Biol. 10, 498–508 (2006).
Gerlt, J. A. & Babbitt, P. C. Divergent evolution of enzymatic function: mechanistically diverse superfamilies and functionally distinct suprafamilies. Annu. Rev. Biochem. 70, 209–246 (2001).
Aharoni, A. et al. The ‘evolvability’ of promiscuous protein functions. Nat. Genet. 37, 73–76 (2005).
Lee, D., Redfern, O. & Orengo, C. Predicting protein function from sequence and structure. Nat. Rev. Mol. Cell Biol. 8, 995–1005 (2007).
Yu, T. et al. Enzyme function prediction using contrastive learning. Science 379, 1358–1363 (2023).
Altschul, S. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997).
Pál, C., Papp, B. & Lercher, M. J. An integrated view of protein evolution. Nat. Rev. Genet. 7, 337–348 (2006).
Tokuriki, N. & Tawfik, D. S. Protein dynamism and evolvability. Science 324, 203–207 (2009).
Davidi, D., Longo, L. M., Jabłońska, J., Milo, R. & Tawfik, D. S. A bird’s-eye view of enzyme evolution: chemical, physicochemical, and physiological considerations. Chem. Rev. 118, 8786–8797 (2018).
Lin, Z. et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science 379, 1123–1130 (2023).
Yoon, P. H. et al. Structure-guided discovery of ancestral CRISPR–Cas13 ribonucleases. Science 385, 538–543 (2024).
Nomburg, J. et al. Birth of protein folds and functions in the virome. Nature 633, 710–717 (2024).
Gaschignard, G. et al. AlphaFold2-guided description of CoBaHMA, a novel family of bacterial domains within the heavy-metal-associated superfamily. Proteins 92, 776–794 (2024).
Babbitt, P. C. et al. A functionally diverse enzyme superfamily that abstracts the α protons of carboxylic acids. Science 267, 1159–1161 (1995).
Waldron, K. J. & Robinson, N. J. How do bacterial cells ensure that metalloproteins get the correct metal?. Nat. Rev. Microbiol. 7, 25–35 (2009).
Hekkelman, M. L., De Vries, I., Joosten, R. P. & Perrakis, A. AlphaFill: enriching AlphaFold models with ligands and cofactors. Nat. Methods 20, 205–213 (2023).
Cheng, Y. et al. Co-evolution-based prediction of metal-binding sites in proteomes by machine learning. Nat. Chem. Biol. 19, 548–555 (2023).
Dürr, S. L., Levy, A. & Rothlisberger, U. Metal3D: a general deep learning framework for accurate metal ion location prediction in proteins. Nat. Commun. 14, 2713 (2023).
Laveglia, V., Bazayeva, M., Andreini, C. & Rosato, A. Hunting down zinc(II)-binding sites in proteins with distance matrices. Bioinformatics 39, btad653 (2023).
Vaillancourt, F. H., Yeh, E., Vosburg, D. A., O’Connor, S. E. & Walsh, C. T. Cryptic chlorination by a non-haem iron enzyme during cyclopropyl amino acid biosynthesis. Nature 436, 1191–1194 (2005).
Matthews, M. L. et al. Direct nitration and azidation of aliphatic carbons by an iron-dependent halogenase. Nat. Chem. Biol. 10, 209–215 (2014).
Gomez, C. A., Mondal, D., Du, Q., Chan, N. & Lewis, J. C. Directed evolution of an iron(II)- and α-ketoglutarate-dependent dioxygenase for site-selective azidation of unactivated aliphatic C−H bonds. Angew. Chem. 135, e202301370 (2023).
Dunwell, J. M., Culham, A., Carter, C. E., Sosa-Aguirre, C. R. & Goodenough, P. W. Evolution of functional diversity in the cupin superfamily. Trends Biochem. Sci. 26, 740–746 (2001).
Dunwell, J. M., Purvis, A. & Khuri, S. Cupins: the most functionally diverse protein superfamily?. Phytochemistry 65, 7–17 (2004).
Galperin, M. Y. & Koonin, E. V. Divergence and convergence in enzyme evolution. J. Biol. Chem. 287, 21–28 (2012).
Iyer, L. M., Abhiman, S., De Souza, R. F. & Aravind, L. Origin and evolution of peptide-modifying dioxygenases and identification of the wybutosine hydroxylase/hydroperoxidase. Nucleic Acids Res. 38, 5261–5279 (2010).
Uberto, R. & Moomaw, E. W. Protein similarity networks reveal relationships among sequence, structure, and function within the cupin superfamily. PLoS One 8, e74477 (2013).
Krebs, C., Galonić Fujimori, D., Walsh, C. T. & Bollinger, J. M. Non-heme Fe(IV)–oxo intermediates. Acc. Chem. Res. 40, 484–492 (2007).
Kipouros, I. & Chang, M. yannikipouros/hal-discovery: Metal-coordination mining pipeline for radical halogenases (v1.0). Zenodo https://doi.org/10.5281/zenodo.19737459 (2026).
Hillwig, M. L. & Liu, X. A new family of iron-dependent halogenases acts on freestanding substrates. Nat. Chem. Biol. 10, 921–923 (2014).
Zhao, C. et al. An Fe2+ – and α-ketoglutarate-dependent halogenase acts on nucleotide substrates. Angew. Chem. Int. Ed. 59, 9478–9484 (2020).
Kim, C. Y. et al. The chloroalkaloid (−)-acutumine is biosynthesized via a Fe(II)- and 2-oxoglutarate-dependent halogenase in Menispermaceae plants. Nat. Commun. 11, 1867 (2020).
Glasser, N. R., Cui, D., Risser, D. D., Okafor, C. D. & Balskus, E. P. Accelerating the discovery of alkyl halide-derived natural products using halide depletion. Nat. Chem. 16, 173–182 (2024).
Zallot, R., Oberg, N. & Gerlt, J. A. The EFI web resource for genomic enzymology tools: leveraging protein, genome, and metagenome databases to discover novel enzymes and metabolic pathways. Biochemistry 58, 4169–4182 (2019).
Khare, D. et al. Conformational switch triggered by α-ketoglutarate in a halogenase of curacin A biosynthesis. Proc. Natl Acad. Sci. USA 107, 14099–14104 (2010).
Price, M. N. & Arkin, A. P. PaperBLAST: text mining papers for information about homologs. mSystems 2, e00039-17 (2017).
Mansky, J. et al. The influence of genes on the “killer plasmid” of Dinoroseobacter shibae on its symbiosis with the dinoflagellate Prorocentrum minimum. Front. Microbiol. 12, 804767 (2022).
Wang, H. et al. Identification of genetic modules mediating the Jekyll and Hyde interaction of Dinoroseobacter shibae with the dinoflagellate Prorocentrum minimum. Front. Microbiol. 6, 1262 (2015).
Wienhausen, G. et al. The overlooked role of a biotin precursor for marine bacteria—desthiobiotin as an escape route for biotin auxotrophy. ISME J. 16, 2599–2609 (2022).
Matthews, M. L. et al. Substrate positioning controls the partition between halogenation and hydroxylation in the aliphatic halogenase, SyrB2. Proc. Natl Acad. Sci. USA 106, 17723–17728 (2009).
Li, M. H. et al. RLA8—a new and highly effective quadruple PPAR-α/γ/δ and GPR40 agonist to reverse nonalcoholic steatohepatitis and fibrosis. J. Pharmacol. Exp. Ther. 369, 67–77 (2019).
Mitchell, A. J. et al. Structural basis for halogenation by iron- and 2-oxo-glutarate-dependent enzyme WelO5. Nat. Chem. Biol. 12, 636–640 (2016).
Büchler, J. et al. Algorithm-aided engineering of aliphatic halogenase WelO5* for the asymmetric late-stage functionalization of soraphens. Nat. Commun. 13, 371 (2022).
Nakamura, H., Schultz, E. E. & Balskus, E. P. A new strategy for aromatic ring alkylation in cylindrocyclophane biosynthesis. Nat. Chem. Biol. 13, 916–921 (2017).
Chiang, C.-Y. et al. Copper-dependent halogenase catalyses unactivated C−H bond functionalization. Nature 638, 126–132 (2025).

