Phosphotyrosine recognition domains: the typical, the atypical and the versatile
© Kaneko et al.; licensee BioMed Central Ltd. 2012
Received: 13 August 2012
Accepted: 9 October 2012
Published: 7 November 2012
Skip to main content
© Kaneko et al.; licensee BioMed Central Ltd. 2012
Received: 13 August 2012
Accepted: 9 October 2012
Published: 7 November 2012
SH2 domains are long known prominent players in the field of phosphotyrosine recognition within signaling protein networks. However, over the years they have been joined by an increasing number of other protein domain families that can, at least with some of their members, also recognise pTyr residues in a sequence-specific context. This superfamily of pTyr recognition modules, which includes substantial fractions of the PTB domains, as well as much smaller, or even single member fractions like the HYB domain, the PKCδ and PKCθ C2 domains and RKIP, represents a fascinating, medically relevant and hence intensely studied part of the cellular signaling architecture of metazoans. Protein tyrosine phosphorylation clearly serves a plethora of functions and pTyr recognition domains are used in a similarly wide range of interaction modes, which encompass, for example, partner protein switching, tandem recognition functionalities and the interaction with catalytically active protein domains. If looked upon closely enough, virtually no pTyr recognition and regulation event is an exact mirror image of another one in the same cell. Thus, the more we learn about the biology and ultrastructural details of pTyr recognition domains, the more does it become apparent that nature cleverly combines and varies a few basic principles to generate a sheer endless number of sophisticated and highly effective recognition/regulation events that are, under normal conditions, elegantly orchestrated in time and space. This knowledge is also valuable when exploring pTyr reader domains as diagnostic tools, drug targets or therapeutic reagents to combat human diseases.
Intracellular communication is transmitted via networks of molecules that execute information transfer using protein-mediated interactions. Post-translational modifications (PTMs) such as protein phosphorylation, acetylation, methylation and ubiquitination confer spatiotemporal dynamics to cell signaling . Among these PTMs, the tyrosine phosphorylation signaling system in eukaryotes, especially in mammalian species, has been extensively studied owing to its importance in numerous cellular functions including differentiation, proliferation, motility and apoptosis as well as its therapeutic potential. In particular, mutations and aberrant expression of kinases are frequently associated with tumourigenesis [2, 3].
Signaling proteins often possess a cassette-like architecture made up of catalytic domains and/or protein interaction modules . One important group of protein-protein interaction modules are the autonomous domains that recognize phosphorylated tyrosine (pTyr) residues at specific sites on their target molecules . These pTyr-binding protein modules and their targets are a part of an elaborate pTyr signaling system that consists of three major components that help relay molecular messages . The pTyr signaling system is activated when a stimulus reaches catalytic proteins that act as “writers” of phosphorylation, the protein tyrosine kinases (PTKs). Most PTKs are phosphorylated on themselves to attain an active state, and subsequently phosphorylate other substrate proteins. A second group of proteins that contain modular domains are capable of recognizing or “reading” this phospho-modification information and thereby linking the kinase signal to downstream molecules. The phosphorylation can be subsequently “erased” by a third group of proteins, the protein tyrosine phosphatases (PTPs), therefore terminating the signal [7, 8].
The human genome harbours 90 PTKs , hundreds of pTyr-recognition domains that include 121 members of the Src homology 2 (SH2) domain family [10–12], and more than 10,000 tyrosine phosphorylation sites [12, 13]. These signaling components form an enormous network of pTyr signaling that is both robust and dynamic. Nature is equipped with multiple strategies to reduce possible misfiring of pTyr signals due to the complexity of pTyr signaling network. First, PTKs can remain inactive until they are stimulated by a proper cue, most typically by association of a specific ligand molecule to the PTK. For example, receptor tyrosine kinases (RTKs), transmembrane proteins consisting of an extracellular ligand binding site and an intracellular tyrosine kinase domain, are designed inactive until a ligand binds to the extracellular site of the RTK, which often induces RTK oligomerization . Structural studies have also revealed the presence of both active and inactive conformations for many cytoplasmic and receptor kinases [15, 16]. Moreover, the level of PTP activity is very high in cells, thereby ensuring that pTyr sites can be rapidly dephosphorylated .
Second, each component of the pTyr signaling circuit, i.e., a PTK, PTP or pTyr-binding module, possesses substrate or ligand recognition specificity to narrow down potential interaction partners [18–23]. Since all three components are the modules that bind to linear motif peptides, the specificity at the molecular or atomic level is defined by the ligand peptide sequence and can also be contributed by conformation of the peptide. The interaction specificity is further enhanced by spatiotemporal regulation of the network components, including tissue-specific or cell cycle-dependent protein expression, protein localization to subcellular compartments or a scaffold protein, and protein inactivation involving receptor internalization (endocytosis) and/or protein degradation. These multiple layers of regulatory mechanisms are essential for coordinating such a complex functional network [1, 15, 24, 25]. However, an unintended activation of pTyr signaling, or misfiring, may occur when a circuit component malfunctions, most commonly due to mutations, overexpression or loss of a component or an element, as we will discuss some cases below.
This review will focus on the pTyr "reader" proteins contributing to this complex system. In addition to the SH2 and phosphotyrosine-binding (PTB) domains, the two archetype domain families known for pTyr binding, recent studies have shown that at least a handful of additional protein modules are capable of reading out the tyrosine phosphorylation. Here we review and explore the structure, function, specificity and therapeutic potentials of a number of typical and atypical members of the superfamily of pTyr-binding protein modules.
The Src homology 2 (SH2) domains, a non-catalytic module containing ~100 amino acids, was first discovered by insertion-mutation analysis of the v fps/fes oncogene from the Fujinami sarcoma virus  (for historical perspectives of tyrosine phosphorylation studies, refer to reviews by Pawson  and Hunter ). Soon thereafter, the SH2 domain was identified in oncogenes such as v crk and in the endogenous cytoplasmic proteins phospholipase Cγ1 (PLCγ1) and the Ras GTPase activating protein (RasGAP) [28, 29]. SH2 domains have since been identified in a wide range of eukaryotic species, including yeast, but primarily in metazoans [7, 11]. A recent tally finds 121 SH2 in 111 proteins in the human genome . Proteins containing SH2 domains include those that function as kinases, adaptors, phosphatases, ubiquitin ligases, transcription factors, guanine nucleotide exchange factors and phospholipid-based secondary signaling molecules [5, 12, 30]. Studies in almost two decades have demonstrated the tyrosine phosphorylation-dependent nature of typical SH2 domain-ligand interactions [31–34], the central role played by SH2 domains in connecting activated receptor tyrosine kinases, such as the epithelial growth factor receptor (EGFR) and the platelet-derived growth factor receptor (PDGFR), to cytoplasmic signaling molecules [29, 35]. In addition, kinase SH2 domains are essential in regulating the catalytic activity of cytoplasmic kinases as exemplified for the Src family as well as the Fes and Abl kinases . A growing picture illustrates that kinase SH2 domains may regulate catalytic activity utilizing diverse mechanisms [16, 36]. These and other lines of work establish the SH2 domain as a key player in the cellular signaling system in a pTyr-dependent manner .
SH2 domains are equipped for the specific recognition of a subset of pTyr–containing ligands [20, 22, 53–55]. A number of studies have established that binding affinity of an SH2 domain to a pTyr-containing ligand is moderate, with the typical affinity range between 0.1 μM and 10 μM for equilibrium dissociation constant values (K D) [56–60]. This moderate affinity is considered to be crucial for allowing transient association and dissociation events in cell signaling. Indeed, artificially increased affinity using an engineered SH2 domain (called the pTyr superbinder) has been shown to cause detrimental consequences to cells . While the pTyr-binding pocket, which is present in the N-terminal half and highly conserved in the SH2 domain, provides the basal affinity for ligand binding with approximately a half of the total binding free energy , the hydrophobic pocket present in the C-terminal half of an SH2 domain provides specificity towards a hydrophobic residue in a peptide ligand. Recent studies suggest that the major positional specificity of an SH2 domain is conferred by the EF and BG loops which regulate ligand access to specificity pockets in an SH2 domain. Thus, distinct loop composition and configuration determines whether an SH2 domain has specificity for a residue at the second, third or fourth position C-terminal to the pTyr residue [10, 11, 20, 40, 63, 64]. The wealth of experimentally solved SH2 domain-ligand complex structures allows the systematic ultrastructural investigation into how variations in the specificity pocket leads to distinct specificities with bioinformatics tools. Meanwhile, additional subtlety and sophistication in pTyr-peptide discrimination have been demonstrated by a recent study highlighting the importance of permissive and non-permissive residues proximal to pTyr in the ligand sequence . The study provided evidence that local sequence context provides an additional layer of specificity enhancement beyond the general sequence motifs uncovered by regular degenerate peptide library screens [20, 66].
The yeast genome encodes only one SH2 domain-containing protein, the transcription factor Spt6 . The C-terminal region of the protein, initially predicted to contain a single SH2 domain, binds to the C-terminal domain (CTD) of the RNA polymerase II . The CTD consists of an abundance of repeats (52 in human, 26 in yeast) of the heptad sequence YSPTSPS, in which each tyrosine, serine, and threonine residues can be phosphorylated. Furthermore, prolines are subjected to cis trans isomerization, adding another layer of complexity and dynamics to the CTD [67, 68]. The SH2 domain of Spt6 has been considered a prototypical SH2 domain for several reasons. (I) Spt6 is present in yeast that does not contain a PTK, (II) Yoh et al. demonstrated that the Spt6 SH2 domain region binds to Ser-phosphorylated CTD, and (III) Spt6 is conserved in eukaryotes including slime moulds and plants [7, 11, 52, 69]. Indeed, the recently solved structures of the C-terminal region of Spt6 revealed that the region actually contains two SH2 domains in tandem that are intimately associated with each other (Figure 1C) [48–51]. The phospho-binding pocket of the N-terminal SH2 domain, which contributes to CTD phosphopeptide binding, contains an arginine that is invariant among eukaryotic SH2 domains [49, 50, 52]. In contrast, the corresponding pocket in the C-terminal SH2 domain lacks an arginine, and NMR titration studies suggest that this pocket is not used for peptide binding. Instead, a positively charged patch on the surface of the C-terminal SH2 domain participates in CTD binding (Figure 1C) [49, 50]. Interestingly, the tandem SH2 domains have shown low affinities (with dissociation constants in the millimolar range) for both pTyr- and pSer-containing peptides derived from the CTD . It is proposed that the binding of the Spt6 tandem SH2 domains to the polymerase may be significantly enhanced in vivo as the CTD contains numerous repeats of the phosphorylated heptad sequence that can increase the effective local concentration of the binding target for the tandem SH2 domains . From an evolutionary standpoint, it is likely that the Spt6 SH2 domains provided the prototype for a family of modular domain for the phospho-specific interaction that have later evolved to be specific for phosphotyrosine. Notwithstanding this viewpoint, SH2 domains have been identified in abundance in protozoans such as choanoflagellates [7, 8, 70].
Although SH2 domains are the largest group of pTyr-binding modules [5, 10], it has been shown that certain SH2 domains have the ability to bind ligands in a tyrosine phosphorylation-independent manner [10, 34, 71–78]. For example, the SH2 domains of tensins and SAP bind to both phosphorylated and non-phosphorylated forms of ligand peptides [73–75, 78, 79]. Some SH2 domains feature a secondary site located outside of the primary ligand-binding site to engage a pTyr-ligand protein using two sites [76, 80]. In this regard, Anderson and colleagues identified a second binding site on the C-terminal SH2 domain of the phosphoinositide 3-kinase p85α subunit that binds the Raf family kinase member A-Raf in a phosphorylation-independent manner . The authors posit that the second binding site increases the target selectivity of the SH2 domain. Biochemical and structural analysis illustrated pTyr-independent interaction between the N-terminal SH2 domain of PLCγ1 and the tyrosine kinase domain of the fibroblast growth factor receptor (FGFR) via a secondary binding site, in addition to the canonical pTyr-binding primary site . In another scheme, an SH2 domain may contain sequence motifs that are recognized by distinct types of modular domains. A long, proline-rich insertion into the DE loop of the Crk SH2 domain (Figure 1B) is recognized by the Src homolgy 3 (SH3) domain of the Abl kinase . Sequence analysis of Crk orthologs suggests that this insertion is of recent evolutionary origin as it is identified only in mammalian species . This provides an example of loop evolution that enriches the pTyr signaling network by introducing a novel interaction.
Among the 90 PTKs in the human genome, 32 are cytoplasmic tyrosine kinases . Notably, 28 of them also contain an SH2 domain in tandem with the kinase domain (with the exceptions of TNK1, ACK, FAK and Pyk2). This suggests a strong physical-functional relationship between the kinase and SH2 domains. Mayer et al. demonstrated that a kinase-associated SH2 domain promotes phosphorylation of substrates, which they termed processive phosphorylation . In the course of processive phosphorylation, the kinase domain of a cytoplasmic tyrosine kinase phosphorylates a substrate on the tyrosine site, that is then tightly bound by the SH2 domain, allowing the associated kinase domain to carry out further phosphorylation of the substrate (or a second substrate molecule associated with the first substrate) [19, 83]. Thus the assembly of SH2-kinase domain cassettes allows the physical association required for function of the two domains [4, 5, 15]. Moreover, recent bioinformatics analysis of protein sequences for 330 bona fide SH2-binding motifs revealed that tyrosine phosphorylation sites in human proteome are significantly enriched in the vicinity of the SH2 domain-binding sites , which may indicate that processive phosphorylation is a rather common phenomenon. Processive phosphorylation is perhaps unique to the SH2-kinase domain combination, because the PTB domain (see below) does not coexist with a tyrosine kinase domain in any human protein . As we discuss later, direct intramolecular interaction between the kinase and SH2 domains is essential for activity in some PTKs, which represents another physical and functional interaction between the two domains and a potential target for treatment in cancer cells.
The phosphotyrosine-binding (PTB) or phosphotyrosine-interacting domain, first identified in the adaptor protein Shc [97–99], is the second largest family of pTyr-binding modules. Approximately 60 human proteins contain a PTB domain . Although none have been identified from yeast or plants so far, two PTB domains have been found in the slime mould Dictyostelium discoideum, and 31 in the choanoflagellate Monosiga brevicollis. The proteins harbouring a PTB domain strictly act as adaptors or molecular scaffolds , with possible exceptions at least in M. brevicollis, where multiple tyrosine kinases that contain a PTB domain have been identified . Indeed, biochemical studies indicate that the PTB domain of the M. brevicollis tyrosine kinase HMTK1 assists in targeting of a pTyr-containing substrate peptide . While a third of all PTB-containing proteins identified contain a single copy of the domain, the remaining two-thirds feature a PTB domain occurring in combination with other modular domains such as SH2, SH3, PDZ or SAM . PTB domain-containing proteins are involved in a host of signaling processes, including those involving receptor tyrosine kinases, cytokines and lipoprotein receptors, and cellular functions such as cell division and cell-cell adhesion .
Similar to the SH2 domain, PTB domains may bind tyrosine phosphorylation sites in cellular proteins. Proteomic studies have revealed that the two domain families may in fact target overlapping pTyr sites with micromolar affinities [57–59, 113]. However, unlike SH2 domains, the specificity of a PTB domain is primarily focused towards amino acids N-terminal to the pTyr residue in a peptide, most commonly in an NPXpY or NPXY sequence motif . Moreover, the majority of PTB domains prefer a non-phosphorylated tyrosine residue. In fact, phosphorylation of a peptide is inhibitory to binding to some members of the Dab-1-like PTB domain group . Interestingly, a number of PTB domains can bind the head groups of inositol phosphates with varying affinities, a function observed also in some of the structurally similar PH domains [85, 110]. PTB domains appear to commonly bind to phospholipids through a patch of basic residues on the surface of the domain, although the actual residues that bind to the phospholipid are variable or not resolved for most PTB domains [85, 103]. Interestingly, a phosphopeptide and a phospholipid can compete against each other in binding to a PTB domain, as has been shown for the Shc PTB domain .
Numb is an adaptor and endocytic protein that plays an important role in asymmetric cell division and embryogenesis . It contains a Dab-1-like PTB domain indispensable for its function. Biochemical and structural studies have suggested that the Numb PTB domain is capable of binding to either non-phosphorylated sequences that contain an NXX[Y/F] motif or pTyr-containing sequences [104, 111, 116, 117]. Structural analysis unraveled the molecular basis of promiscuous binding by the Numb PTB domain to peptides that possess distinct primary and secondary structures [104, 111]. Figure 4B shows the complex structure of the Numb PTB domain in complex with a non-phosphorylated peptide that contains the sequence NMSF derived from the Numb-associated kinase (NAK) . Interestingly, mutation of the sequence from NMSF to NAAF resulted in 15-fold increase in binding affinity, implying that the physiological association between the Numb PTB domain and NAK may not be optimized for high affinity . Moreover, the Numb PTB domain provides a binding site for another domain in an isoform-specific manner. The p72 and p66 isoforms of Numb contain an 11-residue insert within the PTB domain after the α2 helix, which serves as a binding site for the PDZ domain of LNX (Ligand-of-Numb), an E3 ubiquitin ligase required for isoform-specific Numb degradation . Recently, we found that Numb serves as a new player in epithelial to mesenchymal transition, a critical step in cancer progression and metastasis [94, 119]. The Numb PTB domain binds to the N751VYYY motif located in the cytoplasmic region of E-cadherin, but phosphorylation of the motif by Src results in dissociation of the PTB domain, suggesting that Src activation negatively regulates the interaction between Numb and E-cadherin .
Unlike SH2 domains, which often coexist with a kinase or phosphatase domain in a protein, the human PTB domain-containing proteins invariably lack a catalytic domain, with the single exception of the presence of a PTP-like domain in the tensin family (see a later section for detail) [85, 120, 121]. Tensins 1–4 are focal adhesion proteins that contain an SH2 domain and a PTB domain in tandem . Similar to the tensin-like lipid and tyrosine phosphatase PTEN , tensins 1–3 have been identified as tumour suppressors. Tensins interact with another tumour suppressor, deleted in liver cancer 1 (DLC1), and suppress focal adhesions and cell migration in various cancers [122, 124]. The SH2 domain of tensin3 is itself phosphorylated by the Src kinase . Lowy and colleagues suggest that tyrosine phosphorylation on the tensin3 SH2 domain provides a mechanism for controlling ligand binding and that phosphorylation of the SH2 domain by the Src kinase endows tensin3 with proto-oncogenic properties . The study also indicated that binding to DLC1 was not dependent on tyrosine phosphorylation of the SH2 domain of tensin3, whereas binding to some other ligands such as the focal adhesion kinase was enhanced by the SH2 phosphorylation. Another study showed that the SH2 of tensin2 has the ability to bind non-phosphorylated DLC1 . These two observations indicate that the phosphorylation state of the ligand proteins as well as that of the SH2 domain can regulate SH2-ligand interactions in tensins [78, 125]. Moreover, the PTB domain of tensin2 displays a novel peptide binding mode. Although it was observed that the tensin1 PTB domain can bind an NPXY peptide in a canonical manner , as well as a DLC1 peptide via a yet uncharacterized mode , the tensin2 PTB domain, which can also bind the NPXY motif , has been determined to utilize its N-terminal helix (α1) to engage a non-NPXY site in DLC1 . The binding surface in the latter PTB domain is opposite to the canonical NPXY peptide binding site and the peptide ligand adopts an elongated conformation rather than the conventional β-turn structure (Figure 4C) . These observations indicate the presence of peptide binding surface specializations among the tensin family PTB domains.
The PH domain was first described in pleckstrin, a substrate of protein kinase C (PKC) [131, 132]. It has since been identified in a large number of signaling and cytoskeleton-associated proteins. By virtue of its ability to bind phospholipids, in particular inositol phosphates, the PH domain plays an important role in targeting the corresponding protein to the plasma membrane [103, 133]. Interestingly, some PH domains are found to bind to proteins in a phosphorylation-dependent manner [110, 133–137]. The PH domain of GEP100 (also known as BRAG2 or IQSEC1), a guanine nucleotide exchange factor for the small GTP-binding protein Arf6, binds to the EGF receptor residues pTyr1068 and pTyr1086, both of which are part of a YXNQ motif . These two pTyr sites are also known to recruit the adaptor proteins Grb2 and Shc as well as the transcription factor STAT3 via their SH2 and/or PTB domains [139, 140]. However, it has been shown that the GEP100 pathway may not necessarily interfere with the Grb2 pathway in breast cancer . Co-expression of GEP100 and Arf6 turned non-invasive MCF7 cells to become invasive upon EGF stimulation [138, 142]. Therefore, this pathway is a potential new target for breast cancer therapeutics. The crystal structure of the GEP100 PH domain alone has been solved (PDB database entry 3QWM), but a PH domain – pTyr-peptide complex structure is still missing, since synthetic pTyr1068 and 1086 peptides immobilised on a carrier membrane can directly bind to the GEP100 PH domain, but have a surprisingly low binding affinity to it in solution (S.F. et al., unpublished data). The reason for this currently remains unclear. By contrast, the native EGFR receptor is effectively precipitated by the GEP100 PH domain upon activation, i.e., when phosphorylated on pTyr1068 and/or pTyr1086.
Raf-1 kinase inhibitory protein (RKIP), also known as phosphatidylethanolamine binding protein 1, suppresses Raf-1 kinase activity in the MAP kinase pathway . RKIP binds to a 24 amino acid stretch in the N-terminal of the Raf-1 kinase which has a central element (S338SYY341) regulated by phosphorylation . Substitutions in the ligand binding site of RKIP compromise the stability of the phosphorylated Raf-RKIP complex. Additionally, mutation of the Ser residues or Tyr341 on the Raf-1 kinase motif disrupts its interaction with RKIP. This suggests that RKIP contains a novel phospho-amino acid binding domain . The structure of RKIP bound to pTyr shows that RKIP indeed contains a deep pocket molded complementary to the shape of the pTyr side chain (Figure 6B) . NMR titration studies further confirmed that the pocket region is the binding site for the tri-phosphorylated Raf-1 peptide, pS338pSYpY, with a K D value of 45 μM . However, direct, physiological or structural, evidence for complex formation is required before RKIP can be classified as a bona fide pTyr binder.
In the above sections, we have explored that, in addition to the SH2 and PTB domains, there are more than a handful of examples of atypical domains that can recognize and bind pTyr-containing target proteins. So does nature have more domain members capable of pTyr-epitope binding that remain to be identified? Recent advances in proteomic technologies are starting to shed light on this question. Christofk et al. employed the SILAC (stable isotope labeling of amino acids in cell culture) technique in combination with a pTyr-peptide library to identify novel pTyr-binding proteins from mammalian cell lysates . Whereas the majority of proteins identified were SH2 or PTB domain-containing proteins, as expected, the screening also identified pyruvate kinase, a protein that does not have any known pTyr-recognition domain. Pyruvate kinase (PK) regulates the final rate-limiting step in glycolysis that converts cellular metabolism from an anaerobic to an aerobic process. The M2 isoform of the PK (PKM2) is the only form of PK used for glycolysis in cancer cells, and it is also the only one of the four isoforms that has the ability to bind to pTyr targets [167, 168]. The structure of hPKM2 illustrates a homo-tetramer with each monomer binding a fructose-1,6-bisphosphate (FBP) molecule in a site distal to the active site (Figure 6C) [150, 169]. Phosphotyrosine ligand binding may be a key event in modulating PK activity regulated by the allosteric activator FBP . The study demonstrated that PKM2 binds pTyr peptides on the lip of the FBP binding pocket and acts as a negative regulator of PK activity (Figure 6C) . Binding of the phosphopeptide releases FBP from the active tetrameric form of PKM2 and dissociates the enzyme into inactive dimers .
More recently, Christofk et al. identified a handful of potential pTyr-binding proteins from SILAC experiments. These include the serine/threonine-protein kinase WNK1, 5-formyltetrahydrofolate cyclo-ligase, glycerol-3-phosphate dehydrogenase, vimentin, 2,4-dienoyl-CoA reductase, and the T-complex protein 1 subunit η . Although biochemical characterization is required to ascertain if these candidates are true pTyr-binders, results from this study suggest that there may be more pTyr recognition domains or proteins in nature that await to be discovered.
New insights into the molecular basis of cancer were generated already decades ago by the analysis of SH2-domain containing oncoproteins such as v-Src [27, 171]. SH2-domain containing proteins have been implicated in many diseases, including immune-related disorders, metabolic syndromes, osteopathologic conditions and different cancers [10, 172]. Mutations in the SH2 domains that cause malignancies fall usually into one of three categories: missense mutations of amino acids involved in target binding, mutations in residues that regulate catalytic activity or mutations on the SH2 domain, distal to the module core, that affect the architectural integrity of the SH2 domain . A number of disease-causing mutations on SH2 domains have been reported as compiled by Liu et al.  and Lappalainen, et al. , including mutations related to the Noonan syndrome, juvenile myelomonocytic leukemia and the X-linked lymphoproliferative syndrome. X-chromosome-linked agammaglobulinemia (XLA) is an example of a disease where mutations are present in the SH2 domain of the Bruton's tyrosine kinase (BTK), and many of these mutations are located on the pTyr-ligand binding site, including the indispensable arginine of the pTyr-binding pocket as well as BG loop residues [16, 173]. Moreover, Hong et al. demonstrated that the BTK SH2 domain binds to phospholipids and the XLA-causing mutations alter lipid binding selectivity . In another example, multiple point mutations within the transcription factor STAT3 SH2 domain have been identified and linked to large granular lymphocytic leukemia and the hyper-IgE syndrome [175, 176]. In conjunction with the identification and study of SH2 domain-related diseases, small molecule inhibitors of SH2 domains, SH2 domain-containing proteins or SH2 binding partners, are being developed with some success as therapeutic reagents, although the development of phosphomimetics faces hurdles due to the strong charge of the phosphate group [177, 178]. Besides SH2 domains, mutations in the PTB domain have also been linked to diseases such as coronary heart disease and type II diabetes .
Biochemical and structural studies have demonstrated that direct intramolecular interactions between the SH2 and kinase domains are required for kinase activation in some PTKs [15, 16, 36]. In the Fes kinase, electrostatic interactions and shape complementarity between residues from the SH2 domain and the αC helix of the kinase domain stabilize the active state of the kinase. Similarly, in the Abl kinase, interaction between Ile164 of the SH2 domain and Thr291/Tyr331 in the kinase domain N lobe is essential for activity . Subsequently, this interface has been investigated intensively as a target for cancer intervention. A successful approach was the creation of a protein-based agent called the monobody, derived from the fibronectin type III (FN3) domain with engineered loops designed for high affinity binding to a specific target molecule . Koide and colleagues created a monobody engineered to selectively bind the SH2 domain of the Abl kinase . They further generated another monobody that targets the SH2-kinase interface that involves Ile164 and then connected the two monobodies with a linker. The resulting tandem monobody disrupted the interface between the SH2 and kinase domains to inhibit the catalytic activity of the deregulated fusion kinase Bcr-Abl, both in vitro and in vivo.
Due to its independently folding nature, a modular domain can often be successfully used as a tool in proteomic research. Jadwin et al. termed this "domainomics” . In particular, applications of phosphotyrosine-recognition domains have been reported for therapeutic and diagnostic purposes. SH2 and PTB domain-based probes have been developed and used to profile global pTyr landscapes using innovative assays such as the peptide dot blotting, Far Western blotting and oligonucleotide tagged multiplex (OTM) assays [183–186]. The OTM assay is designed for quantitative multiplexed profiling of tyrosine-phosphorylated proteins from cell extracts using DNA-tagged SH2 domains, and has been successfully harnessed to discriminate tyrosine phosphorylation states in tumour cell lines and leukemia samples from patients . Similarly, Machida et al. reported application of domain-based probe technology to profile lung cancer cell lines, and demonstrated significant correlation between EGFR mutations and the Grb2 SH2 and ShcA PTB domain probe signals, suggesting a diagnostic value . A potential for application of SH2 domains as non-invasive intracellular imaging or as inhibitor reagents has also been demonstrated. An SH2 domain from Grb2 or PLCγ1 fused with a protein transduction tag (the TAT tag)  was used for SH2 domain delivery into cells. These TAT-tagged SH2 domains showed anti-tumour effects [189, 190]. In addition, the TAT-tagged Grb2 SH2 domain can be used as a molecular probe for monitoring EGFR localization in cells .
SH2 domains have also been incorporated as a biosensor molecule that functions in live cells. Wang et al. created a bipartite Src reporter biosensor protein that contains an SH2 domain and a Src kinase substrate fused, respectively, with the cyan and green fluorescent proteins . Upon phosphorylation of the substrate by endogenous Src kinase, the SH2 domain of the biosensor binds to the substrate in cis, and produces a change in the emission spectrum. Using the Src reporter, the authors successfully visualized kinase activity in cells. In another example, an engineered adenocarcinoma-derived cell line that expresses an EGFR biosensor has been developed . This cell line expresses the Grb2 SH2 domain fused to the green fluorescent protein, and can be used to monitor EGFR internalization upon EGF stimulation. The cell line may provide a useful tool for high-throughput drug screening since effects of drug candidates on modulating EGFR activity can be monitored in live cells.
The genome sequence analysis of M. brevicollis, a unicellular choanoflagellate that possesses a PTK signalling system , brought a big surprise since the numbers of PTKs, PTPs and SH2 domains are comparable to or greater than those in mammals [6, 100]. Numerous PTK, PTP, SH2 and PTB domain-containing proteins have been also identified in other pre-metazoan species [196, 197], challenging the simple notion of "expansion from yeast to man" in terms of the evolution of phosphotyrosine signalling. Since M. brevicollis contains a plethora of putative PTK toolkit proteins, many of which have no orthologs in humans [6, 100, 196], it suggests that multiple evolutionary roads, which remain to be elucidated, may have been used to mix and match different pTyr signalling components in order to build the PTK signalling pathways and networks unique to a particular species. For example, bacteria may possess their own phosphotyrosine signalling systems as bacterial tyrosine kinases are found to be unrelated to eukaryotic PTKs [198, 199]. Nevertheless, bacteria do not only rely on their own repertoire of pTyr signalling components. Some pathogenic bacteria, for example the enteropathongenic Escherichia coli (EPEC) and Helicobacter pylori, are known to hijack pTyr signalling system components of their host by injecting a virulence factor into the host cell during infection [200, 201]. The injected bacterial protein is then phosphorylated by a host PTK, and subsequently recruits SH2 domains of host proteins to rewire the host signalling network for the bacteria's benefit. The injected bacterial effectors have evolved to be readily phosphorylated by host PTKs, and to recruit a number of host SH2 domains after being phosphorylated [201, 202]. For instance, EPEC injects a transmembrane effector protein Tir (translocated intimin receptor) to host cells upon infection. The cytoplasmic YDXV motif of Tir is then phosphorylated and recruits the SH2 domain of the adaptor protein NCK, which results in pathological actin reorganization [200, 203]. It is noteworthy that a similar scenario is seen in the vaccinia virus. A tyrosine residue in the viral membrane protein named A36R is phosphorylated by host PTKs, and the viral protein subsequently recruits the NCK and Grb2 SH2 domains to manipulate downstream actin regulation of the host cell [204, 205]. Intervention of pTyr-mediated host-pathogen interactions could be a novel therapeutic strategy, if sufficiently selective agents can be developed.
Much of what has been learnt in past decades about pTyr reader domains, their interactions and the networks that they serve can now be utilised and also warrants further exploration into several directions. Firstly, we need to make an effort to put more ‘puzzle pieces’ together to start seeing a more complete picture. This includes analysing in much more detail how the compartmentalisation of pTyr writers and readers, i.e. the architecture of the pTyr signalling machinery, generates highly specific signals in vivo. Protein modification reader domains should also be useful in a range of clinical settings, for example for molecular diagnoses and monitoring, but this potential remains untapped in clinical routine settings. As demonstrated for the SH2 domains, e.g., specificity-switching mutants or affinity-enhanced "pTyr superbinders" [40, 61, 63], engineering of pTyr-binding domains may yield protein-based reagents with desired specificities and affinities towards tyrosine phosphorylated targets, which may provide an array of novel agents for research and clinical purposes. Furthermore, we are just beginning to realise that at least some protein modification reader domains have the capacity to decipher combinations of modifications rather than recognising just single site changes, and further research in this area promises to provide additional clues to understanding the elegant ways in which cells manage to read and process a huge number of incoming signals and to translate them into a wide spectrum of biochemical and biological responses. Hence, typical and atypical pTyr reader domains should be considered to be ‘old dogs up to new tricks’ which we are just starting to grasp.
Work from the Li laboratory was supported by grants from the Canadian Cancer Society, Canadian Institute of Health Research and Genome Canada. SSL holds a Canada Research Chair in Functional Genomics and Cellular Proteomics. SMF acknowledges funding by the Breast Cancer Campaign (UK) and the EU FP7 programme. We apologize to those whose work is relevant to this review but who are not cited because of space considerations or our oversight.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.