Monday, January 3, 2011

Protein Data Bank

   The Protein Data Bank (PDB) is a repository for the 3-D structural data of large biological molecules, such as proteins and nucleic acids. Hence, PDB is a key resource in areas of structural biology, such as structural genomics. The data, typically obtained by X-ray crystallography or NMR spectroscopybiologists and biochemists from around the world since most major scientific journals, and some funding agencies, such as the NIH in the USA, now require scientists to submit their structure data to the PDB. The PDB archive contains information about experimentally-determined structures of proteins, nucleic acids, and complex assemblies. As a member of the wwPDB, the RCSB PDB curates and annotates PDB data according to agreed upon standards. Here are some examples of proteins and it's 3-D structure:

1.  HtrA
Also known as DegP and probably identical to the Do protease, is a heat shock-induced serine protease that is active in the periplasm of Escherichia coli. Homologues of HtrA have been described in a wide range of bacteria and in eukaryotes. Its chief role is to degrade misfolded proteins in the periplasm. Substrate recognition probably involves the recently described PDZ domains in the C-terminal half of HtrA and, we suspect, has much in common with the substrate recognition system of the tail-specific protease, Prc (which also possesses a PDZ domain). The expression of htrA is regulated by a complex set of signal transduction pathways, which includes an alternative sigma factor, RpoE, an anti-sigma factor, RseA, a two-component regulatory system, CpxRA, and two phosphoprotein phosphatases, PrpA and PrpB. Mutations in the htrA genes of Salmonella, Brucella and Yersinia cause decreased survival in mice and/or macrophages, and htrA mutants can act as vaccines, as cloning hosts and as carriers of heterologous antigens.





2. LonA
The structure of a recombinant construct consisting of residues 1-245 of Escherichia coli Lon protease, the prototypical member of the A-type Lon family, is reported. This construct encompasses all or most of the N-terminal domain of the enzyme. The structure was solved by SeMet SAD to 2.6 Å resolution utilizing trigonal crystals that contained one molecule in the asymmetric unit. The molecule consists of two compact subdomains and a very long C-terminal [alpha]-helix. The structure of the first subdomain (residues 1-117), which consists mostly of [beta]-strands, is similar to that of the shorter fragment previously expressed and crystallized, whereas the second subdomain is almost entirely helical. The fold and spatial relationship of the two subdomains, with the exception of the C-terminal helix, closely resemble the structure of BPP1347, a 203-amino-acid protein of unknown function from Bordetella parapertussis, and more distantly several other proteins. It was not possible to refine the structure to satisfactory convergence; however, since almost all of the Se atoms could be located on the basis of their anomalous scattering the correctness of the overall structure is not in question. The structure reported here was also compared with the structures of the putative substrate-binding domains of several proteins, showing topological similarities that should help in defining the binding sites used by Lon substrates.




3. ClP
Members of the Clp family of molecular chaperones and protease regulatory subunits contain homologous regions with properties expected for substrate-binding domains. Fragments corresponding to these sequences are stably and independently folded for Lon, ClpA, and ClpY. The corresponding regions from ClpB and ClpX are unstable. All five fragments exhibit distinct patterns of binding to three proteins that are protease substrates in vivo: the heat shock transcription factor σ32, the SOS mutagenesis protein UmuD, and Arc repressor bearing the SsrA degradation tag. Recognition of UmuD is mediated through peptide sequences within a 24-residue N-terminal region whereas recognition of both σ32 and SsrA-tagged Arc requires sequences at the C terminus. These results indicate that the Clp proteases use the mechanism of substrate discrimination and suggest that these related ATP-dependent bacterial proteases scrutinize accessible or disordered regions of potential substrates for the presence of specific targeting sequences.





No comments:

Post a Comment