Lead Principal Investigators: Alexander Iakounine (Yakunin), Alexei Savchenko, Cheryl Arrowsmith, Aled Edwards
In all sequenced genomes, a large fraction of predicted genes (up to 50% in some genomes) encodes proteins of unknown function. Even in yeast, a major model organism, there are over 1,000 uncharacterized genes (Pena-Castillo, L., and Hughes, T., 2007, Genetics 176:7-14). Many global strategies are already being used to infer function (protein interactions, gene expression analysis, protein localization, and gene knock-outs).
This project is devoted to structural and biochemical characterization of unknown yeast proteins and domains. Structural proteomics is focused on the determination of three-dimensional (3D) structures of annotated and un-annotated proteins, whereas biochemical proteomics represents a straightforward approach to find the biochemical function of unknown genes by direct analysis of biochemical activity of the purified protein. Both approaches have already revealed many unexpected functional inferences, and their combined use (including work in our laboratory) for functional annotation has been especially effective.

All purified proteins are submitted to crystallization trials and/or tested for amenability to NMR analysis by heteronuclear single quantum coherence (HSQC) spectroscopy. Successfully crystallized proteins are produced as Se-Met or heavy metal (Hg, Pt) derivatives, and their multi-wavelength anomalous diffraction (MAD) datasets are produced at the synchrotron radiation source (Advanced Photon Source, Structural Biology Center, Argonne National Laboratory, USA). Small hypothetical proteins (<23 kDa) with good HSQC spectra are labeled with 13C and 15N, and resonances are assigned by using conventional triple resonance techniques. Obtained 3D structures of hypothetical proteins are analyzed for the presence of putative active sites and are compared (DALI search) with the structures of known proteins available in the PDB to identify structural homologues and to generate hypotheses about possible biochemical function of these proteins. Our Structural Proteomics lab (Savchenko, Arrowsmith, Edwards) has gained significant experience in structural characterization of unknown proteins and produced 17 structures of yeast proteins (including 12 structures from this project) (Table 2). Our goal here is to generate structures and hypotheses for at least 25 uncharacterized yeast proteins representing different protein families (Table 3).
Proteins that have detectable activity are further characterized using secondary screens with natural substrates (substrate profiling) (Fig. 3). Specifically, phosphatases are screened for activity against a panel of 91 phosphorylated substrates (Table 4), phosphodiesterases with 22 substrates (Table 5), and esterases with 37 substrates (Table 6). The spectrophotometric assays are developed for 96-well plates, can be performed quickly and require only a few micrograms of protein. We demonstrated the feasibility and merits of this approach for hydrolases and oxidoreductases, two very broad and important classes of enzymes, and identified over 200 new enzymes.
General and specific enzymatic assays are also used to rapidly test the hypotheses generated by the Structural Proteomics and other groups about the biochemical function of particular proteins. All proteins with identified enzymatic activity are characterized biochemically (substrate and metal profiles, reaction products, cofactors, inhibitors) and their kinetic parameters (Km, kcat, kcat/Km) are obtained to determine if they correlate with the in vivo concentrations of these metabolites. In combination with information produced by other groups of this project (genetic arrays, protein interactions), the obtained biochemical data are used to generate models about cellular roles of these proteins. In this project, we identified enzymatic activity in over 30 yeast proteins and biochemically characterized all soluble members of the HAD-like phosphatase family (Table 7, Table 8).
We invite all researchers wishing to check their proteins for catalytic activity to send them to us for screening (conditions and terms of collaboration, Fig. 4).