The regulatory proteome

For a living cell to function in its environment, a large number of regulatory processes are needed, including regulation of cell proliferation, cell differentiation and cell death. The underlying mechanisms include regulation of gene expression, as well as, different post-translational modifications that control for example activity, stability, localization or degradation of the protein. An important class of regulatory proteins are transcription factors that determine when genes are switched on and off. About 1500 human transcription factors are known and these proteins are often differentially expressed in different parts of the human body.

Regulation of gene expression

Transcription factors are regulatory proteins, and they are considered to be the most diverse and important mechanism of gene regulation. According to the TF class database and with data in both the UniProt and Ensembl databases, 1496 human genes are classified as transcription factors. They have DNA-binding domains that bind, specifically and with extreme affinity, to consensus DNA sequences and thereby activate (or in rare cases inhibit) transcription. Transcription factors are classified into families either based on the highly conserved sequences of the DNA binding domains, or on their three-dimensional protein structure. These structural motifs result in their specificity for the consensus sequence and the major classes include 778 proteins with zinc-coordinating DNA-binding domains (zinc-finger proteins), 173 proteins with basic domains (helix-loop-helix and leucine-zipper factors), and 392 proteins with helix-turn-helix domains (homeodomain factors). In Table 1, the transcription factors are classified according to structural motif as in the TFclass database.

Table 1. Structural classification of transcription factors.

Structural motif Number of genes
Yet undefined DNA-binding domains 19
Basic domains 173
Zinc-coordinating DNA-binding domains 778
Helix-turn-helix domains 392
Other all-alpha-helical DNA-binding domains 46
alpha-Helices exposed by beta-structures 13
Immunoglobulin fold 62
beta-Hairpin exposed by an alpha/beta-scaffold 14
beta-Sheet binding to DNA 5
beta-Barrel DNA-binding domains 3

Examples of transcription factors from different structural classes

The zinc-finger is a structural motif in which one or more zinc ions stabilize the protein fold as exemplified by the three-dimensional schematic representation of the estrogen receptor ESR1 (purple with zinc-ions in red) binding to DNA. ESR1 is a nuclear hormone receptor, here shown to be expressed in glandular cells and cells in endometrial stroma of the uterus by staining with the antibody CAB000037.



Zink-finger

The structural motif known as the leucine-zipper consists of a leucine repeat region, which forms an alpha helix with a hydrophobic region responsible for dimerization. Here exemplified by a three-dimensional schematic representation of the proto-oncogene JUN (purple) binding as a homodimer to DNA.

JUN is a basic leucine-zipper factor, here shown to be expressed in glandular cells of the colon by immunohistochemical staining by using the antibody CAB007780.



Leucine-zipper


JUN

The helix-turn-helix motif is a DNA binding motif composed of two α-helices, which make contacts with DNA and are joined by a short turn. The three-dimensional schematic representation shows the transcription factor GBX1 (purple) binding to DNA. GBX1 is a homeo-domain factor, here shown to be expressed in follicle cells of the ovary by using the antibody HPA055783.



Helix-turn-helix

Post-translational modifications of proteins

Post-translational modifications (PTM) are chemical modifications such as phosphorylation, glycosylation, ubiquitination, nitrosylation, methylation, acetylation, lipidation and proteolysis, which regulate activity, stability, localization and interaction of proteins. Most modifications are mediated by enzymes, which add or remove functional groups, proteins, lipids or sugars to or from amino acid side chains or cleave peptide bonds to remove specific sequences or subunits.

Phosphorylation

Phosphorylation is one of the most important post-translational modifications and plays a critical role in regulation of cell cycle, growth and differentiation, apoptosis and signal transduction pathways. It is a reversible process involving phosphorylation and dephosphorylation of a variety of substrates including proteins, lipids and carbohydrates. Protein activity and function are commonly regulated by phosphorylation on serine, threonine or tyrosine residues, which functions either by inducing conformational changes that regulate the catalytic activity or by recruiting other proteins that bind and recognize phosphomotifs. Kinases are the proteins mediating phosphorylation, this large group is predicted to include 501 enzymes.

TSSK1B is a serine/threonine-protein kinase required during spermatid development, here shown to be expressed in mature sperm in epididymis and in testis. The protein expression is visualised by immunohistochemistry using the antibody HPA027827.


TSSK1B - epididymis

TSSK1B - testis

Relevant links and publications

Uhlén M et al., Tissue-based map of the human proteome. Science (2015)
PubMed: 25613900 DOI: 10.1126/science.1260419