RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy194
         (525 letters)



>gnl|CDD|212892 cd11959, SH3_Cortactin, Src homology 3 domain of Cortactin.
           Cortactin was originally identified as a substrate of
           Src kinase. It is an actin regulatory protein that binds
           to the Arp2/3 complex and stabilizes branched actin
           filaments. It is involved in cellular processes that
           affect cell motility, adhesion, migration, endocytosis,
           and invasion. It is expressed ubiquitously except in
           hematopoietic cells, where the homolog hematopoietic
           lineage cell-specific 1 (HS1) is expressed instead.
           Cortactin contains an N-terminal acidic domain, several
           copies of a repeat domain found in cortactin and HS1, a
           proline-rich region, and a C-terminal SH3 domain. The
           N-terminal region interacts with the Arp2/3 complex and
           F-actin, and is crucial in regulating branched actin
           assembly. Cortactin also serves as a scaffold and
           provides a bridge to the actin cytoskeleton for membrane
           trafficking and signaling proteins that bind to its SH3
           domain. Binding partners for the SH3 domain of cortactin
           include dynamin2, N-WASp, MIM, FGD1, among others. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score =  120 bits (303), Expect = 2e-33
 Identities = 47/53 (88%), Positives = 49/53 (92%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           TAVALYDYQA+ DDEISFDPDDIITNIEMIDEGWWRG C G+YGLFPANYV L
Sbjct: 1   TAVALYDYQAADDDEISFDPDDIITNIEMIDEGWWRGVCRGKYGLFPANYVEL 53


>gnl|CDD|213006 cd12073, SH3_HS1, Src homology 3 domain of Hematopoietic lineage
           cell-specific protein 1.  HS1, also called HCLS1
           (hematopoietic cell-specific Lyn substrate 1), is a
           cortactin homolog expressed specifically in
           hematopoietic cells. It is an actin regulatory protein
           that binds the Arp2/3 complex and stabilizes branched
           actin filaments. It is required for cell spreading and
           signaling in lymphocytes. It regulates cytoskeletal
           remodeling that controls lymphocyte trafficking, and it
           also affects tissue invasion and infiltration of
           leukemic B cells. Like cortactin, HS1 contains an
           N-terminal acidic domain, several copies of a repeat
           domain found in cortactin and HS1, a proline-rich
           region, and a C-terminal SH3 domain. The N-terminal
           region binds the Arp2/3 complex and F-actin, while the
           C-terminal region acts as an adaptor or scaffold that
           can connect varied proteins that bind the SH3 domain
           within the actin network. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score =  104 bits (261), Expect = 1e-27
 Identities = 39/53 (73%), Positives = 43/53 (81%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSLQ 524
           AVALYDYQ   DDEISFDP + IT+IEM+DEGWW+G CHG  GLFPANYV L 
Sbjct: 3   AVALYDYQGEGDDEISFDPQETITDIEMVDEGWWKGTCHGHRGLFPANYVELL 55


>gnl|CDD|212893 cd11960, SH3_Abp1_eu, Src homology 3 domain of eumetazoan
           Actin-binding protein 1.  Abp1, also called drebrin-like
           protein, is an adaptor protein that functions in
           receptor-mediated endocytosis and vesicle trafficking.
           It contains an N-terminal actin-binding module, the
           actin-depolymerizing factor (ADF) homology domain, a
           helical domain, and a C-terminal SH3 domain. Mammalian
           Abp1, unlike yeast Abp1, does not contain an acidic
           domain that interacts with the Arp2/3 complex. It
           regulates actin dynamics indirectly by interacting with
           dynamin and WASP family proteins. Abp1 deficiency causes
           abnormal organ structure and function of the spleen,
           heart, and lung of mice. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 54

 Score =  101 bits (254), Expect = 1e-26
 Identities = 40/54 (74%), Positives = 42/54 (77%), Gaps = 1/54 (1%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYC-HGQYGLFPANYVSL 523
            A ALYDYQA+ D EISFDP DIIT+IE IDEGWWRG    G YGLFPANYV L
Sbjct: 1   RARALYDYQAADDTEISFDPGDIITDIEQIDEGWWRGTGPDGTYGLFPANYVEL 54


>gnl|CDD|212753 cd11819, SH3_Cortactin_like, Src homology 3 domain of Cortactin and
           related proteins.  This subfamily includes cortactin,
           Abp1 (actin-binding protein 1), hematopoietic lineage
           cell-specific protein 1 (HS1), and similar proteins.
           These proteins are involved in regulating actin dynamics
           through direct or indirect interaction with the Arp2/3
           complex, which is required to initiate actin
           polymerization. They all contain at least one C-terminal
           SH3 domain. Cortactin and HS1 bind Arp2/3 and actin
           through an N-terminal region that contains an acidic
           domain and several copies of a repeat domain found in
           cortactin and HS1. Abp1 binds actin via an N-terminal
           actin-depolymerizing factor (ADF) homology domain. Yeast
           Abp1 binds Arp2/3 directly through two acidic domains.
           Mammalian Abp1 does not directly interact with Arp2/3;
           instead, it regulates actin dynamics indirectly by
           interacting with dynamin and WASP family proteins. The
           C-terminal region of these proteins acts as an adaptor
           or scaffold that can connect membrane trafficking and
           signaling proteins that bind the SH3 domain within the
           actin network. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 54

 Score =  101 bits (254), Expect = 2e-26
 Identities = 37/54 (68%), Positives = 39/54 (72%), Gaps = 1/54 (1%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYC-HGQYGLFPANYVSL 523
            A ALYDYQA+ D+EISF   DIIT IE IDEGWW G    GQ GLFPANYV L
Sbjct: 1   RAKALYDYQAAEDNEISFVEGDIITQIEQIDEGWWLGVNAKGQKGLFPANYVEL 54


>gnl|CDD|214620 smart00326, SH3, Src homology 3 domains.  Src homology 3 (SH3)
           domains bind to target proteins through sequences
           containing proline and hydrophobic amino acids.
           Pro-containing polypeptides may bind to SH3 domains in 2
           different binding orientations.
          Length = 56

 Score = 80.3 bits (199), Expect = 5e-19
 Identities = 29/54 (53%), Positives = 35/54 (64%), Gaps = 1/54 (1%)

Query: 469 GYTAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH-GQYGLFPANYV 521
           G    ALYDY A   DE+SF   DIIT +E  D+GWW+G    G+ GLFP+NYV
Sbjct: 2   GPQVRALYDYTAQDPDELSFKKGDIITVLEKSDDGWWKGRLGRGKEGLFPSNYV 55


>gnl|CDD|212690 cd00174, SH3, Src Homology 3 domain superfamily.  Src Homology 3
           (SH3) domains are protein interaction domains that bind
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. Thus, they
           are referred to as proline-recognition domains (PRDs).
           SH3 domains are less selective and show more diverse
           specificity compared to other PRDs. They have been shown
           to bind peptide sequences that lack the PxxP motif;
           examples include the PxxDY motif of Eps8 and the
           RKxxYxxY sequence in SKAP55. SH3 domain containing
           proteins play versatile and diverse roles in the cell,
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies, among others. Many members of this
           superfamily are adaptor proteins that associate with a
           number of protein partners, facilitating complex
           formation and signal transduction.
          Length = 51

 Score = 79.8 bits (198), Expect = 7e-19
 Identities = 30/51 (58%), Positives = 36/51 (70%), Gaps = 1/51 (1%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH-GQYGLFPANY 520
            A ALYDY+A  DDE+SF   DIIT +E  D+GWW G  + G+ GLFPANY
Sbjct: 1   YARALYDYEAQDDDELSFKKGDIITVLEKDDDGWWEGELNGGREGLFPANY 51


>gnl|CDD|212894 cd11961, SH3_Abp1_fungi_C2, Second C-terminal Src homology 3 domain
           of Fungal Actin-binding protein 1.  Abp1 is an adaptor
           protein that functions in receptor-mediated endocytosis
           and vesicle trafficking. It contains an N-terminal
           actin-binding module, the actin-depolymerizing factor
           (ADF) homology domain, a central proline-rich region,
           and a C-terminal SH3 domain (many yeast Abp1 proteins
           contain two C-terminal SH3 domains). Yeast Abp1 also
           contains two acidic domains that bind directly to the
           Arp2/3 complex, which is required to initiate actin
           polymerization. The SH3 domain of yeast Abp1 binds and
           localizes the kinases, Ark1p and Prk1p, which facilitate
           actin patch disassembly following vesicle
           internalization. It also mediates the localization to
           the actin patch of the synaptojanin-like protein, Sjl2p,
           which plays a key role in endocytosis. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 53

 Score = 76.4 bits (188), Expect = 1e-17
 Identities = 31/53 (58%), Positives = 38/53 (71%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
            A ALYDY A+ D+E+SF  +D I NIE +D+ WW G CHG  GLFP+NYV L
Sbjct: 1   WAKALYDYDAAEDNELSFFENDKIINIEFVDDDWWLGECHGSRGLFPSNYVEL 53


>gnl|CDD|202162 pfam02218, HS1_rep, Repeat in HS1/Cortactin.  The function of this
           repeat is unknown. Seven copies are found in cortactin
           and four copies are found in HS1. The repeats are always
           found amino terminal to an SH3 domain pfam00018.
          Length = 37

 Score = 72.1 bits (177), Expect = 3e-16
 Identities = 32/37 (86%), Positives = 35/37 (94%)

Query: 158 GFGGKFGVQKDRQDKSAVGWDHQEKIEKHESQKDYAK 194
           GFGGKFGVQKDRQDKSAVG+D+Q K EKHESQKDY+K
Sbjct: 1   GFGGKFGVQKDRQDKSAVGFDYQGKTEKHESQKDYSK 37



 Score = 62.0 bits (151), Expect = 8e-13
 Identities = 26/37 (70%), Positives = 29/37 (78%)

Query: 121 GFGGKFGVQNDRVDKSAVTWDHKEVIEKHASQKDYSS 157
           GFGGKFGVQ DR DKSAV +D++   EKH SQKDYS 
Sbjct: 1   GFGGKFGVQKDRQDKSAVGFDYQGKTEKHESQKDYSK 37



 Score = 61.3 bits (149), Expect = 2e-12
 Identities = 24/34 (70%), Positives = 30/34 (88%)

Query: 195 GFGGKFGIESDRQDKSAVGWDHVEKVDKHQSQKD 228
           GFGGKFG++ DRQDKSAVG+D+  K +KH+SQKD
Sbjct: 1   GFGGKFGVQKDRQDKSAVGFDYQGKTEKHESQKD 34



 Score = 58.2 bits (141), Expect = 2e-11
 Identities = 23/35 (65%), Positives = 28/35 (80%)

Query: 84  GYGGKFGVEKDRMDESAVGHDYVAQLHQHESQSDY 118
           G+GGKFGV+KDR D+SAVG DY  +  +HESQ DY
Sbjct: 1   GFGGKFGVQKDRQDKSAVGFDYQGKTEKHESQKDY 35


>gnl|CDD|212723 cd11789, SH3_Nebulin_family_C, C-terminal Src Homology 3 domain of
           the Nebulin family of proteins.  Nebulin family proteins
           contain multiple nebulin repeats, and may contain an
           N-terminal LIM domain and/or a C-terminal SH3 domain.
           They have molecular weights ranging from 34 to 900 kD,
           depending on the number of nebulin repeats, and they all
           bind actin. They are involved in the regulation of actin
           filament architecture and function as stabilizers and
           scaffolds for cytoskeletal structures with which they
           associate, such as long actin filaments or focal
           adhesions. Nebulin family proteins that contain a
           C-terminal SH3 domain include the giant filamentous
           protein nebulin, nebulette, Lasp1, and Lasp2. Lasp2,
           also called LIM-nebulette, is an alternatively spliced
           variant of nebulette. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 68.5 bits (168), Expect = 7e-15
 Identities = 27/51 (52%), Positives = 35/51 (68%), Gaps = 2/51 (3%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH--GQYGLFPANYV 521
            A+YDY A+ DDE+SF   D+I N+E+ID+GW  G     GQ G+ PANYV
Sbjct: 3   RAMYDYAAADDDEVSFQEGDVIINVEIIDDGWMEGTVQRTGQSGMLPANYV 53


>gnl|CDD|212759 cd11825, SH3_PLCgamma, Src homology 3 domain of Phospholipase C
           (PLC) gamma.  PLC catalyzes the hydrolysis of
           phosphatidylinositol (4,5)-bisphosphate [PtdIns(4,5)P2]
           to produce Ins(1,4,5)P3 and diacylglycerol (DAG) in
           response to various receptors. Ins(1,4,5)P3 initiates
           the calcium signaling cascade while DAG functions as an
           activator of PKC. PLCgamma catalyzes this reaction in
           tyrosine kinase-dependent signaling pathways. It is
           activated and recruited to its substrate at the
           membrane. Vertebrates contain two forms of PLCgamma,
           PLCgamma1, which is widely expressed, and PLCgamma2,
           which is primarily found in haematopoietic cells.
           PLCgamma contains a Pleckstrin homology (PH) domain
           followed by an elongation factor (EF) domain, two
           catalytic regions of PLC domains that flank two tandem
           SH2 domains, followed by a SH3 domain and C2 domain. The
           SH3 domain of PLCgamma1 directly interacts with
           dynamin-1 and can serve as a guanine nucleotide exchange
           factor (GEF). It also interacts with Cbl, inhibiting its
           phosphorylation and activity. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 54

 Score = 67.0 bits (164), Expect = 3e-14
 Identities = 29/52 (55%), Positives = 33/52 (63%), Gaps = 1/52 (1%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQY-GLFPANYV 521
           T  ALYDY+A   DE+SF    IITN+E  D GWWRG   G+    FPANYV
Sbjct: 1   TVKALYDYRAQRPDELSFCKHAIITNVEKEDGGWWRGDYGGKKQKWFPANYV 52


>gnl|CDD|212757 cd11823, SH3_Nostrin, Src homology 3 domain of Nitric Oxide
           Synthase TRaffic INducer.  Nostrin is expressed in
           endothelial and epithelial cells and is involved in the
           regulation, trafficking and targeting of endothelial NOS
           (eNOS). It facilitates the endocytosis of eNOS by
           coordinating the functions of dynamin and the
           Wiskott-Aldrich syndrome protein (WASP). Increased
           expression of Nostrin may be correlated to preeclampsia.
           Nostrin contains an N-terminal F-BAR domain and a
           C-terminal SH3 domain. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 63.9 bits (156), Expect = 3e-13
 Identities = 25/49 (51%), Positives = 32/49 (65%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
            ALY Y A+ +DE+S  P DII   E  D+GWW G  +G+ G+FPA YV
Sbjct: 3   KALYSYTANREDELSLQPGDIIEVHEKQDDGWWLGELNGKKGIFPATYV 51


>gnl|CDD|212739 cd11805, SH3_GRB2_like_C, C-terminal Src homology 3 domain of
           Growth factor receptor-bound protein 2 (GRB2) and
           related proteins.  This family includes the adaptor
           protein GRB2 and related proteins including Drosophila
           melanogaster Downstream of receptor kinase (DRK),
           Caenorhabditis elegans Sex muscle abnormal protein 5
           (Sem-5), GRB2-related adaptor protein (GRAP), GRAP2, and
           similar proteins. Family members contain an N-terminal
           SH3 domain, a central SH2 domain, and a C-terminal SH3
           domain. GRB2/Sem-5/DRK is a critical signaling molecule
           that regulates the Ras pathway by linking tyrosine
           kinases to the Ras guanine nucleotide releasing protein
           Sos (son of sevenless), which converts Ras to the active
           GTP-bound state. GRAP2 plays an important role in T cell
           receptor (TCR) signaling by promoting the formation of
           the SLP-76:LAT complex, which couples the TCR to the Ras
           pathway. GRAP acts as a negative regulator of T cell
           receptor (TCR)-induced lymphocyte proliferation by
           downregulating the signaling to the Ras/ERK pathway. The
           C-terminal SH3 domains (SH3c) of GRB2 and GRAP2 have
           been shown to bind to classical PxxP motif ligands, as
           well as to non-classical motifs. GRB2 SH3c binds Gab2
           (Grb2-associated binder 2) through epitopes containing
           RxxK motifs, while the SH3c of GRAP2 binds to the
           phosphatase-like protein HD-PTP via a RxxxxK motif. SH3
           domains are protein interaction domains that typically
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 63.8 bits (156), Expect = 3e-13
 Identities = 22/49 (44%), Positives = 29/49 (59%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVS 522
           ALYD+      E+ F   DIIT ++  D  WW+G   G+ G+FPANYV 
Sbjct: 4   ALYDFNPQEPGELEFRRGDIITVLDSSDPDWWKGELRGRVGIFPANYVQ 52


>gnl|CDD|212706 cd11772, SH3_OSTF1, Src Homology 3 domain of metazoan osteoclast
           stimulating factor 1.  OSTF1, also named OSF or SH3P2,
           is a signaling protein containing SH3 and ankyrin-repeat
           domains. It acts through a Src-related pathway to
           enhance the formation of osteoclasts and bone
           resorption. It also acts as a negative regulator of cell
           motility. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 63.9 bits (156), Expect = 3e-13
 Identities = 22/48 (45%), Positives = 31/48 (64%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           ALYDY+A   DE+SF+  D++   +  D  WW+  C G+ GL P+NYV
Sbjct: 4   ALYDYEAQHPDELSFEEGDLLYISDKSDPNWWKATCGGKTGLIPSNYV 51


>gnl|CDD|215659 pfam00018, SH3_1, SH3 domain.  SH3 (Src homology 3) domains are
           often indicative of a protein involved in signal
           transduction related to cytoskeletal organisation. First
           described in the Src cytoplasmic tyrosine kinase. The
           structure is a partly opened beta barrel.
          Length = 47

 Score = 62.6 bits (153), Expect = 8e-13
 Identities = 24/47 (51%), Positives = 30/47 (63%), Gaps = 1/47 (2%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH-GQYGLFPA 518
           VALYDY A   DE+SF   DII  +E  D+GWW+G    G+ GL P+
Sbjct: 1   VALYDYTAREPDELSFKKGDIIIVLEKSDDGWWKGRLKGGKEGLIPS 47


>gnl|CDD|212774 cd11840, SH3_Intersectin_5, Fifth Src homology 3 domain (or SH3E)
           of Intersectin.  Intersectins (ITSNs) are adaptor
           proteins that function in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction.
           They are essential for initiating clathrin-coated pit
           formation. They bind to many proteins through their
           multidomain structure and facilitate the assembly of
           multimeric complexes. Vertebrates contain two ITSN
           proteins, ITSN1 and ITSN2, which exist in alternatively
           spliced short and long isoforms. The short isoforms
           contain two Eps15 homology domains (EH1 and EH2), a
           coiled-coil region and five SH3 domains (SH3A-E), while
           the long isoforms, in addition, contain RhoGEF (also
           called Dbl-homologous or DH), Pleckstrin homology (PH)
           and C2 domains. ITSN1 and ITSN2 are both widely
           expressed, with variations depending on tissue type and
           stage of development. The fifth SH3 domain (or SH3E) of
           ITSN1 has been shown to bind many protein partners
           including SGIP1, Sos1, dynamin1/2, CIN85, c-Cbl, SHIP2,
           N-WASP, and synaptojanin-1, among others. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 62.4 bits (152), Expect = 9e-13
 Identities = 25/49 (51%), Positives = 32/49 (65%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           +AL+ Y A  +DE+SF   DII  +   D  WWRG  +GQ GLFP+NYV
Sbjct: 3   IALFPYTAQNEDELSFQKGDIINVLSKDDPDWWRGELNGQTGLFPSNYV 51


>gnl|CDD|212810 cd11877, SH3_PIX, Src Homology 3 domain of Pak Interactive eXchange
           factors.  PIX proteins are Rho guanine nucleotide
           exchange factors (GEFs), which activate small GTPases by
           exchanging bound GDP for free GTP. They act as GEFs for
           both Cdc42 and Rac 1, and have been implicated in cell
           motility, adhesion, neurite outgrowth, and cell
           polarity. Vertebrates contain two proteins from the PIX
           subfamily, alpha-PIX and beta-PIX. Alpha-PIX, also
           called ARHGEF6, is localized in dendritic spines where
           it regulates spine morphogenesis. Mutations in the
           ARHGEF6 gene cause X-linked intellectual disability in
           humans. Beta-PIX play roles in regulating neuroendocrine
           exocytosis, focal adhesion maturation, cell migration,
           synaptic vesicle localization, and insulin secretion.
           PIX proteins contain an N-terminal SH3 domain followed
           by RhoGEF (also called Dbl-homologous or DH) and
           Pleckstrin Homology (PH) domains, and a C-terminal
           leucine-zipper domain for dimerization. The SH3 domain
           of PIX binds to an atypical PxxxPR motif in
           p21-activated kinases (PAKs) with high affinity. The
           binding of PAKs to PIX facilitate the localization of
           PAKs to focal complexes and also localizes PAKs to PIX
           targets Cdc43 and Rac, leading to the activation of
           PAKs. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 62.3 bits (152), Expect = 1e-12
 Identities = 21/48 (43%), Positives = 35/48 (72%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           A ++++ + +DE+SFD  DIIT  ++++ GWW G  +G+ G FP+NYV
Sbjct: 4   AKFNFEGTNEDELSFDKGDIITVTQVVEGGWWEGTLNGKTGWFPSNYV 51


>gnl|CDD|212700 cd11766, SH3_Nck_2, Second Src Homology 3 domain of Nck adaptor
           proteins.  Nck adaptor proteins regulate actin
           cytoskeleton dynamics by linking proline-rich effector
           molecules to protein tyrosine kinases and phosphorylated
           signaling intermediates. They contain three SH3 domains
           and a C-terminal SH2 domain. They function downstream of
           the PDGFbeta receptor and are involved in Rho GTPase
           signaling and actin dynamics. Vertebrates contain two
           Nck adaptor proteins: Nck1 (also called Nckalpha) and
           Nck2 (also called Nckbeta or Growth factor
           receptor-bound protein 4, Grb4), which show partly
           overlapping functions but also bind distinct targets.
           Their SH3 domains are involved in recruiting downstream
           effector molecules, such as the N-WASP/Arp2/3 complex,
           which when activated induces actin polymerization that
           results in the production of pedestals, or protrusions
           of the plasma membrane. The second SH3 domain of Nck
           appears to prefer ligands containing the APxxPxR motif.
           SH3 domains are protein interaction domains that usually
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially a PxxP motif. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 62.3 bits (152), Expect = 1e-12
 Identities = 23/52 (44%), Positives = 34/52 (65%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVS 522
            AV  ++Y+A  +DE+S    D +  +E   +GWWRG C+GQ G FP+NYV+
Sbjct: 1   PAVVKFNYEAQREDELSLRKGDRVLVLEKSSDGWWRGECNGQVGWFPSNYVT 52


>gnl|CDD|212760 cd11826, SH3_Abi, Src homology 3 domain of Abl Interactor proteins.
            Abl interactor (Abi) proteins are adaptor proteins
           serving as binding partners and substrates of Abl
           tyrosine kinases. They are involved in regulating actin
           cytoskeletal reorganization and play important roles in
           membrane-ruffling, endocytosis, cell motility, and cell
           migration. They localize to sites of actin
           polymerization in epithelial adherens junction and
           immune synapses, as well as to the leading edge of
           lamellipodia. Vertebrates contain two Abi proteins, Abi1
           and Abi2. Abi1 displays a wide expression pattern while
           Abi2 is highly expressed in the eye and brain. Abi
           proteins contain a homeobox homology domain, a
           proline-rich region, and a SH3 domain. The SH3 domain of
           Abi binds to a PxxP motif in Abl. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 52

 Score = 61.2 bits (149), Expect = 3e-12
 Identities = 27/49 (55%), Positives = 32/49 (65%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           VALYDY A  DDE+SF   DII   +  D+GW+ G  +G  GLFP NYV
Sbjct: 3   VALYDYTADKDDELSFQEGDIIYVTKKNDDGWYEGVLNGVTGLFPGNYV 51


>gnl|CDD|212754 cd11820, SH3_STAM, Src homology 3 domain of Signal Transducing
           Adaptor Molecules.  STAMs were discovered as proteins
           that are highly phosphorylated following cytokine and
           growth factor stimulation. They function in cytokine
           signaling and surface receptor degradation, as well as
           regulate Golgi morphology. They associate with many
           proteins including Jak2 and Jak3 tyrosine kinases, Hrs,
           AMSH, and UBPY. STAM adaptor proteins contain VHS
           (Vps27, Hrs, STAM homology), ubiquitin interacting
           (UIM), and SH3 domains. There are two vertebrate STAMs,
           STAM1 and STAM2, which may be functionally redundant;
           vertebrate STAMs contain ITAM motifs. They are part of
           the endosomal sorting complex required for transport
           (ESCRT-0). STAM2 deficiency in mice did not cause any
           obvious abnormality, while STAM1 deficiency resulted in
           growth retardation. Loss of both STAM1 and STAM2 in mice
           proved lethal, indicating that STAMs are important for
           embryonic development. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 54

 Score = 61.3 bits (149), Expect = 3e-12
 Identities = 23/49 (46%), Positives = 35/49 (71%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVS 522
           ALYD++A+ D+E++F   +IIT ++  D  WW+G  H   GLFPAN+V+
Sbjct: 5   ALYDFEAAEDNELTFKAGEIITVLDDSDPNWWKGSNHRGEGLFPANFVT 53


>gnl|CDD|212806 cd11873, SH3_CD2AP-like_1, First Src Homology 3 domain (SH3A) of
           CD2-associated protein and similar proteins.  This
           subfamily is composed of the first SH3 domain (SH3A) of
           CD2AP, CIN85 (Cbl-interacting protein of 85 kDa), and
           similar domains. CD2AP and CIN85 are adaptor proteins
           that bind to protein partners and assemble complexes
           that have been implicated in T cell activation, kidney
           function, and apoptosis of neuronal cells. They also
           associate with endocytic proteins, actin cytoskeleton
           components, and other adaptor proteins involved in
           receptor tyrosine kinase (RTK) signaling. CD2AP and the
           main isoform of CIN85 contain three SH3 domains, a
           proline-rich region, and a C-terminal coiled-coil
           domain. All of these domains enable CD2AP and CIN85 to
           bind various protein partners and assemble complexes
           that have been implicated in many different functions.
           SH3A of both proteins bind to an atypical PXXXPR motif
           at the C-terminus of Cbl and the cytoplasmic domain of
           the cell adhesion protein CD2. CIN85 SH3A binds to
           internal proline-rich motifs within the proline-rich
           region; this intramolecular interaction serves as a
           regulatory mechanism to keep CIN85 in a closed
           conformation, preventing the recruitment of other
           proteins. CIN85 SH3A has also been shown to bind
           ubiquitin. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 60.7 bits (148), Expect = 4e-12
 Identities = 21/51 (41%), Positives = 33/51 (64%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
             +  +DY A   DE++    DIITN++ ++EGWW G  +G+ G+FP N+V
Sbjct: 1   EVIVEFDYDAEEPDELTLKVGDIITNVKKMEEGWWEGTLNGKRGMFPDNFV 51


>gnl|CDD|212883 cd11950, SH3_GRAP2_C, C-terminal Src homology 3 domain of
           GRB2-related adaptor protein 2.  GRAP2 is also called
           GADS (GRB2-related adapter downstream of Shc), GrpL,
           GRB2L, Mona, or GRID (Grb2-related protein with insert
           domain). It is expressed specifically in the
           hematopoietic system. It plays an important role in T
           cell receptor (TCR) signaling by promoting the formation
           of the SLP-76:LAT complex, which couples the TCR to the
           Ras pathway. It also has roles in antigen-receptor and
           tyrosine kinase mediated signaling. GRAP2 is unique from
           other GRB2-like adaptor proteins in that it can be
           regulated by caspase cleavage. It contains an N-terminal
           SH3 domain, a central SH2 domain, and a C-terminal SH3
           domain. The C-terminal SH3 domain of GRAP2 binds to
           different motifs found in substrate peptides including
           the typical PxxP motif in hematopoietic progenitor
           kinase 1 (HPK1), the RxxK motif in SLP-76 and HPK1, and
           the RxxxxK motif in phosphatase-like protein HD-PTP. SH3
           domains are protein interaction domains that typically
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 60.6 bits (147), Expect = 4e-12
 Identities = 24/50 (48%), Positives = 34/50 (68%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
             ALYD++A  DDE+ F+  D+I  ++  +  WW+G  HG+ GLFPANYV
Sbjct: 2   VRALYDFEALEDDELGFNSGDVIEVLDSSNPSWWKGRLHGKLGLFPANYV 51


>gnl|CDD|212895 cd11962, SH3_Abp1_fungi_C1, First C-terminal Src homology 3 domain
           of Fungal Actin-binding protein 1.  Abp1 is an adaptor
           protein that functions in receptor-mediated endocytosis
           and vesicle trafficking. It contains an N-terminal
           actin-binding module, the actin-depolymerizing factor
           (ADF) homology domain, a central proline-rich region,
           and a C-terminal SH3 domain (many yeast Abp1 proteins
           contain two C-terminal SH3 domains). Yeast Abp1 also
           contains two acidic domains that bind directly to the
           Arp2/3 complex, which is required to initiate actin
           polymerization. The SH3 domain of yeast Abp1 binds and
           localizes the kinases, Ark1p and Prk1p, which facilitate
           actin patch disassembly following vesicle
           internalization. It also mediates the localization to
           the actin patch of the synaptojanin-like protein, Sjl2p,
           which plays a key role in endocytosis. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 54

 Score = 60.6 bits (147), Expect = 5e-12
 Identities = 29/53 (54%), Positives = 36/53 (67%), Gaps = 1/53 (1%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGY-CHGQYGLFPANYVSL 523
           AV LYDY+   D+EI     +I+TNIEM+DE WW G    G+ GLFP+NYV L
Sbjct: 2   AVVLYDYEKDEDNEIELVEGEIVTNIEMVDEDWWMGTNSKGESGLFPSNYVEL 54


>gnl|CDD|212882 cd11949, SH3_GRB2_C, C-terminal Src homology 3 domain of Growth
           factor receptor-bound protein 2.  GRB2 is a critical
           signaling molecule that regulates the Ras pathway by
           linking tyrosine kinases to the Ras guanine nucleotide
           releasing protein Sos (son of sevenless), which converts
           Ras to the active GTP-bound state. It is ubiquitously
           expressed in all tissues throughout development and is
           important in cell cycle progression, motility,
           morphogenesis, and angiogenesis. In lymphocytes, GRB2 is
           associated with antigen receptor signaling components.
           GRB2 contains an N-terminal SH3 domain, a central SH2
           domain, and a C-terminal SH3 domain. The C-terminal SH3
           domain of GRB2 binds to Gab2 (Grb2-associated binder 2)
           through epitopes containing RxxK motifs, as well as to
           the proline-rich C-terminus of FGRF2. SH3 domains are
           protein interaction domains that typically bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 59.9 bits (145), Expect = 8e-12
 Identities = 22/52 (42%), Positives = 30/52 (57%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVS 522
              AL+D+    D E+ F   D I  ++  D  WW+G CHGQ G+FP NYV+
Sbjct: 1   YVQALFDFDPQEDGELGFRRGDFIEVMDNSDPNWWKGACHGQTGMFPRNYVT 52


>gnl|CDD|212761 cd11827, SH3_MyoIe_If_like, Src homology 3 domain of Myosins Ie,
           If, and similar proteins.  Myosins Ie (MyoIe) and If
           (MyoIf) are nonmuscle, unconventional, long tailed,
           class I myosins containing an N-terminal motor domain
           and a myosin tail with TH1, TH2, and SH3 domains. MyoIe
           interacts with the endocytic proteins, dynamin and
           synaptojanin-1, through its SH3 domain; it may play a
           role in clathrin-dependent endocytosis. In the kidney,
           MyoIe is critical for podocyte function and normal
           glomerular filtration. Mutations in MyoIe is associated
           with focal segmental glomerulosclerosis, a disease
           characterized by massive proteinuria and progression to
           end-stage kidney disease. MyoIf is predominantly
           expressed in the immune system; it plays a role in
           immune cell motility and innate immunity. Mutations in
           MyoIf may be associated with the loss of hearing. The
           MyoIf gene has also been found to be fused to the MLL
           (Mixed lineage leukemia) gene in infant acute myeloid
           leukemias (AML). SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 59.4 bits (144), Expect = 1e-11
 Identities = 24/48 (50%), Positives = 29/48 (60%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           ALY Y A   DE+SF+  DII  ++    GWW G   G+ GLFP NYV
Sbjct: 4   ALYAYDAQDTDELSFNEGDIIEILKEDPSGWWTGRLRGKEGLFPGNYV 51


>gnl|CDD|212737 cd11803, SH3_Endophilin_A, Src homology 3 domain of Endophilin-A.
           Endophilins play roles in synaptic vesicle formation,
           virus budding, mitochondrial morphology maintenance,
           receptor-mediated endocytosis inhibition, and endosomal
           sorting. They are classified into two types, A and B.
           Vertebrates contain three endophilin-A isoforms (A1, A2,
           and A3). Endophilin-A proteins are enriched in the brain
           and play multiple roles in receptor-mediated
           endocytosis. They tubulate membranes and regulate
           calcium influx into neurons to trigger the activation of
           the endocytic machinery. They are also involved in the
           sorting of plasma membrane proteins, actin filament
           assembly, and the uncoating of clathrin-coated vesicles
           for fusion with endosomes. Endophilins contain an
           N-terminal N-BAR domain (BAR domain with an additional
           N-terminal amphipathic helix), followed by a variable
           region containing proline clusters, and a C-terminal SH3
           domain. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 58.4 bits (142), Expect = 3e-11
 Identities = 23/50 (46%), Positives = 29/50 (58%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
             ALYD++   + E+ F   DIIT    IDE W+ G  +GQ G FP NYV
Sbjct: 3   CRALYDFEPENEGELGFKEGDIITLTNQIDENWYEGMVNGQSGFFPVNYV 52


>gnl|CDD|212866 cd11933, SH3_Nebulin_C, C-terminal Src Homology 3 domain of
           Nebulin.  Nebulin is a giant filamentous protein
           (600-900 kD) that is expressed abundantly in skeletal
           muscle. It binds to actin thin filaments and regulates
           its assembly and function. Nebulin was thought to be
           part of a molecular ruler complex that is critical in
           determining the lengths of actin thin filaments in
           skeletal muscle since its length, which varies due to
           alternative splicing, correlates with the length of thin
           filaments in various muscle types. Recent studies
           indicate that nebulin regulates thin filament length by
           stabilizing the filaments and preventing
           depolymerization. Mutations in nebulin can cause
           nemaline myopathy, characterized by muscle weakness
           which can be severe and can lead to neonatal lethality.
           Nebulin contains an N-terminal LIM domain, many nebulin
           repeats/super repeats, and a C-terminal SH3 domain. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 58

 Score = 57.3 bits (138), Expect = 7e-11
 Identities = 27/55 (49%), Positives = 36/55 (65%), Gaps = 2/55 (3%)

Query: 469 GYTAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH--GQYGLFPANYV 521
           G +  A+YDY+A+ DDE+SF   D I N++ IDEGW  G     G+ G+ PANYV
Sbjct: 1   GKSFRAMYDYRAADDDEVSFKDGDTIVNVQTIDEGWMYGTVQRTGKTGMLPANYV 55


>gnl|CDD|212815 cd11882, SH3_GRAF-like, Src Homology 3 domain of GTPase Regulator
           Associated with Focal adhesion kinase and similar
           proteins.  This subfamily is composed of Rho GTPase
           activating proteins (GAPs) with similarity to GRAF.
           Members contain an N-terminal BAR domain, followed by a
           Pleckstrin homology (PH) domain, a Rho GAP domain, and a
           C-terminal SH3 domain. Although vertebrates harbor four
           Rho GAPs in the GRAF subfamily including GRAF, GRAF2,
           GRAF3, and Oligophrenin-1 (OPHN1), only three are
           included in this model. OPHN1 contains the BAR, PH and
           GAP domains, but not the C-terminal SH3 domain. GRAF and
           GRAF2 show GAP activity towards RhoA and Cdc42. GRAF
           influences Rho-mediated cytoskeletal rearrangements and
           binds focal adhesion kinase. GRAF2 regulates
           caspase-activated p21-activated protein kinase-2. The
           SH3 domain of GRAF and GRAF2 binds PKNbeta, a target of
           the small GTPase Rho. SH3 domains bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs; they play a role in the
           regulation of enzymes by intramolecular interactions,
           changing the subcellular localization of signal pathway
           components and mediate multiprotein complex assemblies.
          Length = 54

 Score = 56.5 bits (137), Expect = 1e-10
 Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 1/54 (1%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDE-GWWRGYCHGQYGLFPANYVSL 523
            A ALY  +A  + E+SF+P  IITN++  DE GW  G  +G+ GL P NYV  
Sbjct: 1   RARALYACKAEDESELSFEPGQIITNVQPSDEPGWLEGTLNGRTGLIPENYVEF 54


>gnl|CDD|212770 cd11836, SH3_Intersectin_1, First Src homology 3 domain (or SH3A)
           of Intersectin.  Intersectins (ITSNs) are adaptor
           proteins that function in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction.
           They are essential for initiating clathrin-coated pit
           formation. They bind to many proteins through their
           multidomain structure and facilitate the assembly of
           multimeric complexes. Vertebrates contain two ITSN
           proteins, ITSN1 and ITSN2, which exist in alternatively
           spliced short and long isoforms. The short isoforms
           contain two Eps15 homology domains (EH1 and EH2), a
           coiled-coil region and five SH3 domains (SH3A-E), while
           the long isoforms, in addition, contain RhoGEF (also
           called Dbl-homologous or DH), Pleckstrin homology (PH)
           and C2 domains. ITSN1 and ITSN2 are both widely
           expressed, with variations depending on tissue type and
           stage of development. The first SH3 domain (or SH3A) of
           ITSN1 has been shown to bind many proteins including
           Sos1, dynamin1/2, CIN85, c-Cbl, PI3K-C2, SHIP2, N-WASP,
           and CdGAP, among others. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 56.2 bits (136), Expect = 2e-10
 Identities = 26/55 (47%), Positives = 30/55 (54%), Gaps = 10/55 (18%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDE------GWWRGYCHGQYGLFPANYVS 522
           ALY ++A   DEISF P DII     +DE      GW  G   G+ G FPANYV 
Sbjct: 4   ALYAFEARNPDEISFQPGDII----QVDESQVAEPGWLAGELKGKTGWFPANYVE 54


>gnl|CDD|212904 cd11971, SH3_Abi1, Src homology 3 domain of Abl Interactor 1.
           Abi1, also called e3B1, is a central regulator of actin
           cytoskeletal reorganization through interactions with
           many protein complexes. It is part of WAVE, a
           nucleation-promoting factor complex, that links Rac 1
           activation to actin polymerization causing lamellipodia
           protrusion at the plasma membrane. Abi1 interact with
           formins to promote protrusions at the leading edge of
           motile cells. It also is a target of alpha4 integrin,
           regulating membrane protrusions at sites of integrin
           engagement. Abi proteins are adaptor proteins serving as
           binding partners and substrates of Abl tyrosine kinases.
           They are involved in regulating actin cytoskeletal
           reorganization and play important roles in
           membrane-ruffling, endocytosis, cell motility, and cell
           migration. Abi proteins contain a homeobox homology
           domain, a proline-rich region, and a SH3 domain. The SH3
           domain of Abi binds to a PxxP motif in Abl. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 59

 Score = 56.2 bits (135), Expect = 2e-10
 Identities = 26/49 (53%), Positives = 32/49 (65%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           VA+YDY    DDE+SF    II  I+  D+GW+ G C+G  GLFP NYV
Sbjct: 3   VAIYDYSKDKDDELSFMEGAIIYVIKKNDDGWYEGVCNGVTGLFPGNYV 51


>gnl|CDD|212749 cd11815, SH3_Eve1_2, Second Src homology 3 domain of ADAM-binding
           protein Eve-1.  Eve-1, also called SH3 domain-containing
           protein 19 (SH3D19) or EEN-binding protein (EBP), exists
           in multiple alternatively spliced isoforms. The longest
           isoform contains five SH3 domain in the C-terminal
           region and seven proline-rich motifs in the N-terminal
           region. It is abundantly expressed in skeletal muscle
           and heart, and may be involved in regulating the
           activity of ADAMs (A disintegrin and metalloproteases).
           Eve-1 interacts with EEN, an endophilin involved in
           endocytosis and may be the target of the MLL-EEN fusion
           protein that is implicated in leukemogenesis. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 52

 Score = 55.3 bits (133), Expect = 4e-10
 Identities = 21/50 (42%), Positives = 32/50 (64%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           AV L+D+ A   D++S +  +I+  +E ID  W+RG C    G+FPAN+V
Sbjct: 2   AVVLHDFPAEHSDDLSLNSGEIVYLLEKIDTEWYRGKCKNTTGIFPANHV 51


>gnl|CDD|212868 cd11935, SH3_Nebulette_C, C-terminal Src Homology 3 domain of
           Nebulette and LIM-nebulette (or Lasp2).  Nebulette is a
           cardiac-specific protein that localizes to the Z-disc.
           It interacts with tropomyosin and is important in
           stabilizing actin thin filaments in cardiac muscles.
           Polymorphisms in the nebulette gene are associated with
           dilated cardiomyopathy, with some mutations resulting in
           severe heart failure. Nebulette is a 107kD protein that
           contains an N-terminal acidic region, multiple nebulin
           repeats, and a C-terminal SH3 domain. LIM-nebulette,
           also called Lasp2 (LIM and SH3 domain protein 2), is an
           alternatively spliced variant of nebulette. Although it
           shares a gene with nebulette, Lasp2 is not transcribed
           from a muscle-specific promoter, giving rise to its
           multiple tissue expression pattern with highest amounts
           in the brain. It can crosslink actin filaments and it
           affects cell spreading. Lasp2 is a 34kD protein
           containing an N-terminal LIM domain, three nebulin
           repeats, and a C-terminal SH3 domain. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 58

 Score = 55.0 bits (132), Expect = 5e-10
 Identities = 24/52 (46%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH--GQYGLFPANYVSL 523
           A+YDY A  +DE+SF   D I N++ IDEGW  G     G+ G+ PANY+  
Sbjct: 5   AMYDYSAQDEDEVSFRDGDYIVNVQPIDEGWMYGTVQRTGRTGMLPANYIEF 56


>gnl|CDD|212928 cd11995, SH3_Intersectin1_5, Fifth Src homology 3 domain (or SH3E)
           of Intersectin-1.  Intersectin-1 (ITSN1) is an adaptor
           protein that functions in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction. It
           plays a role in clathrin-coated pit (CCP) formation. It
           binds to many proteins through its multidomain structure
           and facilitate the assembly of multimeric complexes.
           ITSN1 localizes in membranous organelles, CCPs, the
           Golgi complex, and may be involved in the cell membrane
           trafficking system. It exists in alternatively spliced
           short and long isoforms. The short isoform contains two
           Eps15 homology domains (EH1 and EH2), a coiled-coil
           region and five SH3 domains (SH3A-E), while the long
           isoform, in addition, contains RhoGEF (also called
           Dbl-homologous or DH), Pleckstrin homology (PH) and C2
           domains. The fifth SH3 domain (or SH3E) of ITSN1 has
           been shown to bind many protein partners including
           SGIP1, Sos1, dynamin1/2, CIN85, c-Cbl, SHIP2, N-WASP,
           and synaptojanin-1, among others. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 54

 Score = 55.0 bits (132), Expect = 5e-10
 Identities = 24/51 (47%), Positives = 32/51 (62%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           + +YDY A  DDE++F    II  +   D  WW+G  +GQ GLFP+NYV L
Sbjct: 4   IGMYDYTAQNDDELAFSKGQIINVLNKEDPDWWKGELNGQVGLFPSNYVKL 54


>gnl|CDD|212777 cd11843, SH3_PACSIN, Src homology 3 domain of Protein kinase C and
           Casein kinase Substrate in Neurons (PACSIN) proteins.
           PACSINs, also called Synaptic dynamin-associated
           proteins (Syndapins), act as regulators of cytoskeletal
           and membrane dynamics. They bind both dynamin and
           Wiskott-Aldrich syndrome protein (WASP), and may provide
           direct links between the actin cytoskeletal machinery
           through WASP and dynamin-dependent endocytosis.
           Vetebrates harbor three isoforms with distinct
           expression patterns and specific functions. PACSINs
           contain an N-terminal F-BAR domain and a C-terminal SH3
           domain. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 54.7 bits (132), Expect = 6e-10
 Identities = 26/49 (53%), Positives = 33/49 (67%), Gaps = 1/49 (2%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDE-GWWRGYCHGQYGLFPANYV 521
           ALYDY+    DE+SF   DI+T +E  DE GW +G   G+ GL+PANYV
Sbjct: 4   ALYDYEGQESDELSFKAGDILTKLEEEDEQGWCKGRLDGRVGLYPANYV 52


>gnl|CDD|212746 cd11812, SH3_AHI-1, Src Homology 3 domain of Abelson helper
           integration site-1 (AHI-1).  AHI-1, also called
           Jouberin, is expressed in high levels in the brain,
           gonad tissues, and skeletal muscle. It is an adaptor
           protein that interacts with the small GTPase Rab8a and
           regulates it distribution and function, affecting cilium
           formation and vesicle transport. Mutations in the AHI-1
           gene can cause Joubert syndrome, a disorder
           characterized by brainstem malformations, cerebellar
           aplasia/hypoplasia, and retinal dystrophy. AHI-1
           variation is also associated with susceptibility to
           schizophrenia and type 2 diabetes mellitus progression.
           AHI-1 contains WD40 and SH3 domains. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 52

 Score = 54.4 bits (131), Expect = 6e-10
 Identities = 25/52 (48%), Positives = 32/52 (61%), Gaps = 1/52 (1%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRG-YCHGQYGLFPANYV 521
           T VALYDY A+  DE++    DII  +   ++ WW G   +GQ G FPANYV
Sbjct: 1   TVVALYDYTANRSDELTIHRGDIIRVLYKDNDNWWFGSLVNGQQGYFPANYV 52


>gnl|CDD|212720 cd11786, SH3_SH3RF_1, First Src Homology 3 domain of SH3 domain
           containing ring finger proteins.  This model represents
           the first SH3 domain of SH3RF1 (or POSH), SH3RF2 (or
           POSHER), SH3RF3 (POSH2), and similar domains. Members of
           this family are scaffold proteins that function as E3
           ubiquitin-protein ligases. They all contain an
           N-terminal RING finger domain and multiple SH3 domains;
           SH3RF1 and SH3RF3 have four SH3 domains while SH3RF2 has
           three. SH3RF1 plays a role in calcium homeostasis
           through the control of the ubiquitin domain protein
           Herp. It may also have a role in regulating death
           receptor mediated and JNK mediated apoptosis. SH3RF3
           interacts with p21-activated kinase 2 (PAK2) and
           GTP-loaded Rac1. It may play a role in regulating JNK
           mediated apoptosis in certain conditions. SH3RF2 acts as
           an anti-apoptotic regulator of the JNK pathway by
           binding to and promoting the degradation of SH3RF1. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 54.3 bits (131), Expect = 7e-10
 Identities = 23/50 (46%), Positives = 32/50 (64%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           A ALY+Y+     ++SF   DII   + IDE W+ G C+G+ G FPA+YV
Sbjct: 2   AKALYNYEGKEPGDLSFKKGDIILLRKRIDENWYHGECNGKQGFFPASYV 51


>gnl|CDD|212752 cd11818, SH3_Eve1_5, Fifth Src homology 3 domain of ADAM-binding
           protein Eve-1.  Eve-1, also called SH3 domain-containing
           protein 19 (SH3D19) or EEN-binding protein (EBP), exists
           in multiple alternatively spliced isoforms. The longest
           isoform contains five SH3 domain in the C-terminal
           region and seven proline-rich motifs in the N-terminal
           region. It is abundantly expressed in skeletal muscle
           and heart, and may be involved in regulating the
           activity of ADAMs (A disintegrin and metalloproteases).
           Eve-1 interacts with EEN, an endophilin involved in
           endocytosis and may be the target of the MLL-EEN fusion
           protein that is implicated in leukemogenesis. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 50

 Score = 54.4 bits (131), Expect = 7e-10
 Identities = 24/49 (48%), Positives = 31/49 (63%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANY 520
           A ALYD+    +DE+SF   DIIT +E IDE W  G   G+ G+FP N+
Sbjct: 2   ARALYDFTGENEDELSFKAGDIITELESIDEEWMSGELRGKSGIFPKNF 50


>gnl|CDD|212712 cd11778, SH3_Bzz1_2, Second Src Homology 3 domain of Bzz1 and
           similar domains.  Bzz1 (or Bzz1p) is a
           WASP/Las17-interacting protein involved in endocytosis
           and trafficking to the vacuole. It physically interacts
           with type I myosins and functions in the early steps of
           endocytosis. Together with other proteins, it induces
           membrane scission in yeast. Bzz1 contains an N-terminal
           F-BAR (FES-CIP4 Homology and Bin/Amphiphysin/Rvs), a
           central coiled-coil, and two C-terminal SH3 domains.
           This model represents the second C-terminal SH3 domain.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 51

 Score = 54.4 bits (131), Expect = 7e-10
 Identities = 24/48 (50%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMID-EGWWRGYCHGQYGLFPANY 520
           ALYDY+A  DDEIS    D I  I   D  GW  G  +G  GLFP +Y
Sbjct: 4   ALYDYEAQGDDEISIRVGDRIAVIRGDDGSGWTYGEINGVKGLFPTSY 51


>gnl|CDD|212762 cd11828, SH3_ARHGEF9_like, Src homology 3 domain of ARHGEF9-like
           Rho guanine nucleotide exchange factors.  Members of
           this family contain a SH3 domain followed by RhoGEF
           (also called Dbl-homologous or DH) and Pleckstrin
           Homology (PH) domains. They include the Rho guanine
           nucleotide exchange factors ARHGEF9, ASEF (also called
           ARHGEF4), ASEF2, and similar proteins. GEFs activate
           small GTPases by exchanging bound GDP for free GTP.
           ARHGEF9 specifically activates Cdc42, while both ASEF
           and ASEF2 can activate Rac1 and Cdc42. ARHGEF9 is highly
           expressed in the brain and it interacts with gephyrin, a
           postsynaptic protein associated with GABA and glycine
           receptors. ASEF plays a role in angiogenesis and cell
           migration. ASEF2 is important in cell migration and
           adhesion dynamics. ASEF exists in an autoinhibited form
           and is activated upon binding of the tumor suppressor
           APC (adenomatous polyposis coli), leading to the
           activation of Rac1 or Cdc42. In its autoinhibited form,
           the SH3 domain of ASEF forms an extensive interface with
           the DH and PH domains, blocking the Rac binding site.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 53.5 bits (129), Expect = 1e-09
 Identities = 19/53 (35%), Positives = 30/53 (56%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
            A AL+D+     +E+ F   D+I  ++M D+ WW G    + G FPA++V L
Sbjct: 1   LAEALWDHVTMDPEELGFKAGDVIEVLDMSDKDWWWGSIRDEEGWFPASFVRL 53


>gnl|CDD|213018 cd12142, SH3_D21-like, Src Homology 3 domain of SH3
           domain-containing protein 21 (SH3D21) and similar
           proteins.  N-terminal SH3 domain of the uncharacterized
           protein SH3 domain-containing protein 21, and similar
           uncharacterized domains, it belongs to the CD2AP-like_3
           subfamily of proteins. The CD2AP-like_3 subfamily is
           composed of the third SH3 domain (SH3C) of CD2AP, CIN85
           (Cbl-interacting protein of 85 kDa), and similar
           domains. CD2AP and CIN85 are adaptor proteins that bind
           to protein partners and assemble complexes that have
           been implicated in T cell activation, kidney function,
           and apoptosis of neuronal cells. They also associate
           with endocytic proteins, actin cytoskeleton components,
           and other adaptor proteins involved in receptor tyrosine
           kinase (RTK) signaling. CD2AP and the main isoform of
           CIN85 contain three SH3 domains, a proline-rich region,
           and a C-terminal coiled-coil domain. All of these
           domains enable CD2AP and CIN85 to bind various protein
           partners and assemble complexes that have been
           implicated in many different functions. SH3C of both
           proteins have been shown to bind to ubiquitin. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 53.2 bits (128), Expect = 2e-09
 Identities = 22/49 (44%), Positives = 29/49 (59%), Gaps = 2/49 (4%)

Query: 475 LYDYQASADDEISFDPDDIITNI--EMIDEGWWRGYCHGQYGLFPANYV 521
           L+DY   A DE++    D+I  I  E  DEGWW G  +G+ G FP N+V
Sbjct: 5   LFDYNPVAPDELALKKGDVIEVISKETEDEGWWEGELNGRRGFFPDNFV 53


>gnl|CDD|212903 cd11970, SH3_PLCgamma1, Src homology 3 domain of Phospholipase C
           (PLC) gamma 1.  PLCgamma1 is widely expressed and is
           essential in growth and development. It is activated by
           the TrkA receptor tyrosine kinase and functions as a key
           regulator of cell differentiation. It is also the
           predominant PLCgamma in T cells and is required for T
           cell and NK cell function. PLCs catalyze the hydrolysis
           of phosphatidylinositol (4,5)-bisphosphate
           [PtdIns(4,5)P2] to produce Ins(1,4,5)P3 and
           diacylglycerol (DAG). Ins(1,4,5)P3 initiates the calcium
           signaling cascade while DAG functions as an activator of
           PKC. PLCgamma contains a Pleckstrin homology (PH) domain
           followed by an elongation factor (EF) domain, two
           catalytic regions of PLC domains that flank two tandem
           SH2 domains, followed by a SH3 domain and C2 domain. The
           SH3 domain of PLCgamma1 directly interacts with
           dynamin-1 and can serve as a guanine nucleotide exchange
           factor (GEF). It also interacts with Cbl, inhibiting its
           phosphorylation and activity. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 60

 Score = 53.5 bits (128), Expect = 2e-09
 Identities = 24/49 (48%), Positives = 34/49 (69%), Gaps = 1/49 (2%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGL-FPANYV 521
           AL+DY+A  +DE++F  + II N+E  + GWWRG   G+  L FP+NYV
Sbjct: 8   ALFDYKAQREDELTFTKNAIIQNVEKQEGGWWRGDYGGKKQLWFPSNYV 56


>gnl|CDD|212758 cd11824, SH3_PSTPIP1, Src homology 3 domain of
           Proline-Serine-Threonine Phosphatase-Interacting Protein
           1.  PSTPIP1, also called CD2 Binding Protein 1 (CD2BP1),
           is mainly expressed in hematopoietic cells. It is a
           binding partner of the cell surface receptor CD2 and
           PTP-PEST, a tyrosine phosphatase which functions in cell
           motility and Rac1 regulation. It also plays a role in
           the activation of the Wiskott-Aldrich syndrome protein
           (WASP), which couples actin rearrangement and T cell
           activation. Mutations in the gene encoding PSTPIP1 cause
           the autoinflammatory disorder known as PAPA (pyogenic
           sterile arthritis, pyoderma gangrenosum, and acne)
           syndrome. PSTPIP1 contains an N-terminal F-BAR domain,
           PEST motifs, and a C-terminal SH3 domain. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 53.1 bits (128), Expect = 2e-09
 Identities = 21/50 (42%), Positives = 28/50 (56%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
              LYDY A  DDE+S    D++  IE  ++GWW    +GQ GL P  Y+
Sbjct: 2   YSVLYDYTAQEDDELSISKGDVVAVIEKGEDGWWTVERNGQKGLVPGTYL 51


>gnl|CDD|212715 cd11781, SH3_Sorbs_1, First Src Homology 3 domain of Sorbin and SH3
           domain containing (Sorbs) proteins and similar domains. 
           This family, also called the vinexin family, is composed
           predominantly of adaptor proteins containing one sorbin
           homology (SoHo) and three SH3 domains. Members include
           the first SH3 domains of Sorbs1 (or ponsin), Sorbs2 (or
           ArgBP2), Vinexin (or Sorbs3), and similar domains. They
           are involved in the regulation of cytoskeletal
           organization, cell adhesion, and growth factor
           signaling. Members of this family bind multiple partners
           including signaling molecules like c-Abl, c-Arg, Sos,
           and c-Cbl, as well as cytoskeletal molecules such as
           vinculin and afadin. They may have overlapping
           functions. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 53.1 bits (128), Expect = 2e-09
 Identities = 21/51 (41%), Positives = 31/51 (60%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
            A ALY ++A +  E+S    DII     ID+ W+ G  +G+ G+FPA+YV
Sbjct: 1   KARALYPFKAQSAKELSLKKGDIIYIRRQIDKNWYEGEHNGRVGIFPASYV 51


>gnl|CDD|212897 cd11964, SH3_STAM1, Src homology 3 domain of Signal Transducing
           Adaptor Molecule 1.  STAM1 is part of the endosomal
           sorting complex required for transport (ESCRT-0) and is
           involved in sorting ubiquitinated cargo proteins from
           the endosome. It may also be involved in the regulation
           of IL2 and GM-CSF mediated signaling, and has been
           implicated in neural cell survival. STAMs were
           discovered as proteins that are highly phosphorylated
           following cytokine and growth factor stimulation. They
           function in cytokine signaling and surface receptor
           degradation, as well as regulate Golgi morphology. They
           associate with many proteins including Jak2 and Jak3
           tyrosine kinases, Hrs, AMSH, and UBPY. STAM adaptor
           proteins contain VHS (Vps27, Hrs, STAM homology),
           ubiquitin interacting (UIM), and SH3 domains. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 53.0 bits (127), Expect = 2e-09
 Identities = 21/49 (42%), Positives = 34/49 (69%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVS 522
           A+YD++A+ D+E++F   DIIT ++  D  WW+G      GLFP+N+V+
Sbjct: 5   AIYDFEAAEDNELTFKAGDIITILDDSDPNWWKGETPQGTGLFPSNFVT 53


>gnl|CDD|212929 cd11996, SH3_Intersectin2_5, Fifth Src homology 3 domain (or SH3E)
           of Intersectin-2.  Intersectin-2 (ITSN2) is an adaptor
           protein that functions in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction. It
           plays a role in clathrin-coated pit (CCP) formation. It
           binds to many proteins through its multidomain structure
           and facilitate the assembly of multimeric complexes.
           ITSN2 also functions as a specific GEF for Cdc42
           activation in epithelial morphogenesis, and is required
           in mitotic spindle orientation. It exists in
           alternatively spliced short and long isoforms. The short
           isoform contains two Eps15 homology domains (EH1 and
           EH2), a coiled-coil region and five SH3 domains
           (SH3A-E), while the long isoform, in addition, contains
           RhoGEF (also called Dbl-homologous or DH), Pleckstrin
           homology (PH) and C2 domains. The fifth SH3 domain (or
           SH3E) of ITSN2 is expected to bind protein partners,
           similar to ITSN1 which has been shown to bind many
           protein partners including SGIP1, Sos1, dynamin1/2,
           CIN85, c-Cbl, SHIP2, N-WASP, and synaptojanin-1, among
           others. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 54

 Score = 52.7 bits (126), Expect = 3e-09
 Identities = 22/51 (43%), Positives = 33/51 (64%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           +A+YDY A+ +DE+SF    +I  +   D  WW+G  +G  GLFP+NYV +
Sbjct: 4   IAMYDYTANNEDELSFSKGQLINVLNKDDPDWWQGEINGVTGLFPSNYVKM 54


>gnl|CDD|212750 cd11816, SH3_Eve1_3, Third Src homology 3 domain of ADAM-binding
           protein Eve-1.  Eve-1, also called SH3 domain-containing
           protein 19 (SH3D19) or EEN-binding protein (EBP), exists
           in multiple alternatively spliced isoforms. The longest
           isoform contains five SH3 domain in the C-terminal
           region and seven proline-rich motifs in the N-terminal
           region. It is abundantly expressed in skeletal muscle
           and heart, and may be involved in regulating the
           activity of ADAMs (A disintegrin and metalloproteases).
           Eve-1 interacts with EEN, an endophilin involved in
           endocytosis and may be the target of the MLL-EEN fusion
           protein that is implicated in leukemogenesis. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 51

 Score = 52.4 bits (126), Expect = 3e-09
 Identities = 20/50 (40%), Positives = 32/50 (64%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
            VA +D++   +DE+SF   D+IT  E + E W +G  +G+ G+FP N+V
Sbjct: 2   CVARFDFEGEQEDELSFSEGDVITLKEYVGEEWAKGELNGKIGIFPLNFV 51


>gnl|CDD|212808 cd11875, SH3_CD2AP-like_3, Third Src Homology 3 domain (SH3C) of
           CD2-associated protein and similar proteins.  This
           subfamily is composed of the third SH3 domain (SH3C) of
           CD2AP, CIN85 (Cbl-interacting protein of 85 kDa), and
           similar domains. CD2AP and CIN85 are adaptor proteins
           that bind to protein partners and assemble complexes
           that have been implicated in T cell activation, kidney
           function, and apoptosis of neuronal cells. They also
           associate with endocytic proteins, actin cytoskeleton
           components, and other adaptor proteins involved in
           receptor tyrosine kinase (RTK) signaling. CD2AP and the
           main isoform of CIN85 contain three SH3 domains, a
           proline-rich region, and a C-terminal coiled-coil
           domain. All of these domains enable CD2AP and CIN85 to
           bind various protein partners and assemble complexes
           that have been implicated in many different functions.
           SH3C of both proteins have been shown to bind to
           ubiquitin. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 52.4 bits (126), Expect = 4e-09
 Identities = 21/55 (38%), Positives = 35/55 (63%), Gaps = 2/55 (3%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNI--EMIDEGWWRGYCHGQYGLFPANYVSL 523
            A  L+DY+A  +DE++    DI+T +  +  D+GWW+G  +G+ G+FP N+V  
Sbjct: 1   KARVLFDYEAENEDELTLREGDIVTILSKDCEDKGWWKGELNGKRGVFPDNFVEP 55


>gnl|CDD|212714 cd11780, SH3_Sorbs_3, Third (or C-terminal) Src Homology 3 domain
           of Sorbin and SH3 domain containing (Sorbs) proteins and
           similar domains.  This family, also called the vinexin
           family, is composed predominantly of adaptor proteins
           containing one sorbin homology (SoHo) and three SH3
           domains. Members include the third SH3 domains of Sorbs1
           (or ponsin), Sorbs2 (or ArgBP2), Vinexin (or Sorbs3),
           and similar domains. They are involved in the regulation
           of cytoskeletal organization, cell adhesion, and growth
           factor signaling. Members of this family bind multiple
           partners including signaling molecules like c-Abl,
           c-Arg, Sos, and c-Cbl, as well as cytoskeletal molecules
           such as vinculin and afadin. They may have overlapping
           functions. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 52.3 bits (126), Expect = 4e-09
 Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 2/50 (4%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH--GQYGLFPANYV 521
           ALY Y    +DE+     DI+  +E  D+GW+ G     G +G FP NYV
Sbjct: 4   ALYSYTPQNEDELELREGDIVYVMEKCDDGWFVGTSERTGLFGTFPGNYV 53


>gnl|CDD|212985 cd12052, SH3_CIN85_1, First Src Homology 3 domain (SH3A) of
           Cbl-interacting protein of 85 kDa.  CIN85, also called
           SH3 domain-containing kinase-binding protein 1 (SH3KBP1)
           or CD2-binding protein 3 (CD2BP3) or Ruk, is an adaptor
           protein that is involved in the downregulation of
           receptor tyrosine kinases by facilitating endocytosis
           through interaction with endophilin-associated ubiquitin
           ligase Cbl proteins. It is also important in many other
           cellular processes including vesicle-mediated transport,
           cytoskeletal remodelling, apoptosis, cell adhesion and
           migration, and viral infection, among others. CIN85
           exists as multiple variants from alternative splicing;
           the main variant contains three SH3 domains, a
           proline-rich region, and a C-terminal coiled-coil
           domain. All of these domains enable CIN85 to bind
           various protein partners and assemble complexes that
           have been implicated in many different functions. This
           alignment model represents the first SH3 domain (SH3A)
           of CIN85; SH3A binds to internal proline-rich motifs
           within the proline-rich region. This intramolecular
           interaction serves as a regulatory mechanism to keep
           CIN85 in a closed conformation, preventing the
           recruitment of other proteins. SH3A has also been shown
           to bind ubiquitin and to an atypical PXXXPR motif at the
           C-terminus of Cbl and the cytoplasmic end of the cell
           adhesion protein CD2. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 52.2 bits (125), Expect = 4e-09
 Identities = 23/50 (46%), Positives = 32/50 (64%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           A+  +DY+A  +DE++    DIIT I+  D GWW G   G+ GLFP N+V
Sbjct: 2   AIVEFDYKAQHEDELTITVGDIITKIKKDDGGWWEGEIKGRRGLFPDNFV 51


>gnl|CDD|212867 cd11934, SH3_Lasp1_C, C-terminal Src Homology 3 domain of LIM and
           SH3 domain protein 1.  Lasp1 is a cytoplasmic protein
           that binds focal adhesion proteins and is involved in
           cell signaling, migration, and proliferation. It is
           overexpressed in several cancer cells including breast,
           ovarian, bladder, and liver. In cancer cells, it can be
           found in the nucleus; its degree of nuclear localization
           correlates with tumor size and poor prognosis. Lasp1 is
           a 36kD protein containing an N-terminal LIM domain, two
           nebulin repeats, and a C-terminal SH3 domain. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 59

 Score = 51.9 bits (124), Expect = 6e-09
 Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 2/50 (4%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH--GQYGLFPANYV 521
           A+YDY A+ +DE+SF   D I N++ ID+GW  G     G  G+ PANYV
Sbjct: 7   AVYDYNAADEDEVSFQDGDTIVNVQQIDDGWMYGTVERTGDTGMLPANYV 56


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 57.9 bits (141), Expect = 7e-09
 Identities = 35/104 (33%), Positives = 61/104 (58%), Gaps = 9/104 (8%)

Query: 361 IQNSNKEEEE--KEKQRMVQEEIKR------KEL-ERKEELEKEQIRIKEEQENIKKKKE 411
           ++ + KE E   KE     +EEI +      KEL ER+ EL+K + R+ +++EN+ +K E
Sbjct: 44  LEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLE 103

Query: 412 QEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEEL 455
             EK  +E E+ +K L +KQQE ++ + E +  + EQ + +E +
Sbjct: 104 LLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERI 147



 Score = 49.4 bits (119), Expect = 3e-06
 Identities = 44/185 (23%), Positives = 82/185 (44%), Gaps = 55/185 (29%)

Query: 274 ENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSD 333
           E  AK+  EE++K +E  K++  L+ +   E+  KL      EL+ +  E++K       
Sbjct: 37  EEEAKRILEEAKKEAEAIKKEALLEAK---EEIHKLRNEFEKELRERRNELQKLEK---- 89

Query: 334 VPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELE 393
                                       +   KEE    K     E ++++E    EELE
Sbjct: 90  ----------------------------RLLQKEENLDRK----LELLEKRE----EELE 113

Query: 394 KEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLME 453
           K++  ++++Q+ ++KK+  EE E   +E+ Q+L     +    L AEE A+    E L+E
Sbjct: 114 KKEKELEQKQQELEKKE--EELEELIEEQLQEL-----ERISGLTAEE-AK----EILLE 161

Query: 454 ELRQQ 458
           ++ ++
Sbjct: 162 KVEEE 166



 Score = 39.8 bits (94), Expect = 0.003
 Identities = 27/96 (28%), Positives = 52/96 (54%), Gaps = 7/96 (7%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL 426
           +E E+E +R+++E  K  E  +KE L    +  KEE   ++ + E+E +ER+   E QKL
Sbjct: 34  KEAEEEAKRILEEAKKEAEAIKKEAL----LEAKEEIHKLRNEFEKELRERR--NELQKL 87

Query: 427 LLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNE 462
             +  Q+E+ L   +   L ++E  +E+  ++   +
Sbjct: 88  EKRLLQKEENLD-RKLELLEKREEELEKKEKELEQK 122



 Score = 36.7 bits (86), Expect = 0.028
 Identities = 45/186 (24%), Positives = 82/186 (44%), Gaps = 69/186 (37%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKL 328
           L AK E          + R+E EKE R+ ++++     QKLE+R    L  KEE +++KL
Sbjct: 60  LEAKEEIH--------KLRNEFEKELRERRNEL-----QKLEKR----LLQKEENLDRKL 102

Query: 329 NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELER 388
                                            +   K EEE EK+              
Sbjct: 103 ---------------------------------ELLEKREEELEKKE------------- 116

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKERKE-----QEENQKLLLKKQQEEDRLKAEEQA 443
            +ELE++Q  +++++E +++  E++ +E +       EE +++LL+K +EE R +A    
Sbjct: 117 -KELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARHEAAVLI 175

Query: 444 RLLEQE 449
           + +E+E
Sbjct: 176 KEIEEE 181



 Score = 33.2 bits (77), Expect = 0.37
 Identities = 37/137 (27%), Positives = 59/137 (43%), Gaps = 24/137 (17%)

Query: 268 NLRAKFENLAKQTEE-----ESRKRSEEEKEKRKL----KDQIDLEQAQKLEERRLSELK 318
            LR +FE   ++        E R   +EE   RKL    K + +LE+ +K  E++  EL+
Sbjct: 68  KLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELE 127

Query: 319 IKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQ 378
            KEEE+E+ +          S  T     + +L++            +EE   E   +  
Sbjct: 128 KKEEELEELIEEQLQELERISGLTAEEAKEILLEKV-----------EEEARHEAAVL-- 174

Query: 379 EEIKRKELERKEELEKE 395
             IK  E E KEE +K+
Sbjct: 175 --IKEIEEEAKEEADKK 189


>gnl|CDD|212933 cd12000, SH3_CASS4, Src homology 3 domain of CAS (Crk-Associated
           Substrate) scaffolding protein family member 4.  CASS4,
           also called HEPL (HEF1-EFS-p130Cas-like), localizes to
           focal adhesions and plays a role in regulating FAK
           activity, focal adhesion integrity, and cell spreading.
           It is most abundant in blood cells and lung tissue, and
           is also found in high levels in leukemia and ovarian
           cell lines. CAS proteins function as molecular scaffolds
           to regulate protein complexes that are involved in many
           cellular processes. They share a common domain structure
           that includes an N-terminal SH3 domain, an unstructured
           substrate domain that contains many YxxP motifs, a
           serine-rich four-helix bundle, and a FAT-like C-terminal
           domain. The SH3 domain of CAS proteins binds to diverse
           partners including FAK, FRNK, Pyk2, PTP-PEST, DOCK180,
           among others. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 57

 Score = 51.4 bits (123), Expect = 8e-09
 Identities = 25/55 (45%), Positives = 33/55 (60%), Gaps = 3/55 (5%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMI---DEGWWRGYCHGQYGLFPANYVSL 523
           A ALYD +A   DE++F   DI+T +E      EGWW+   HG+ GL PAN + L
Sbjct: 3   ARALYDNKADCSDELAFRRGDILTVLEQNVPGSEGWWKCLLHGRQGLAPANRLQL 57


>gnl|CDD|212896 cd11963, SH3_STAM2, Src homology 3 domain of Signal Transducing
           Adaptor Molecule 2.  STAM2, also called EAST (Epidermal
           growth factor receptor-associated protein with SH3 and
           TAM domain) or Hbp (Hrs binding protein), is part of the
           endosomal sorting complex required for transport
           (ESCRT-0). It plays a role in sorting mono-ubiquinated
           endosomal cargo for trafficking to the lysosome for
           degradation. It is also involved in the regulation of
           exocytosis. STAMs were discovered as proteins that are
           highly phosphorylated following cytokine and growth
           factor stimulation. They function in cytokine signaling
           and surface receptor degradation, as well as regulate
           Golgi morphology. They associate with many proteins
           including Jak2 and Jak3 tyrosine kinases, Hrs, AMSH, and
           UBPY. STAM adaptor proteins contain VHS (Vps27, Hrs,
           STAM homology), ubiquitin interacting (UIM), and SH3
           domains. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 57

 Score = 51.2 bits (122), Expect = 1e-08
 Identities = 21/49 (42%), Positives = 33/49 (67%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVS 522
           ALYD++A  D+E++F   +II  ++  D  WW+G  H   GLFP+N+V+
Sbjct: 6   ALYDFEAVEDNELTFKHGEIIIVLDDSDANWWKGENHRGVGLFPSNFVT 54


>gnl|CDD|212992 cd12059, SH3_MLK1-3, Src Homology 3 domain of Mixed Lineage Kinases
           1, 2, and 3.  MLKs 1, 2, and 3 are Serine/Threonine
           Kinases (STKs), catalyzing the transfer of the
           gamma-phosphoryl group from ATP to S/T residues on
           protein substrates. MLKs act as mitogen-activated
           protein kinase kinase kinases (MAP3Ks, MKKKs, MAPKKKs),
           which phosphorylate and activate MAPK kinases (MAPKKs or
           MKKs or MAP2Ks), which in turn phosphorylate and
           activate MAPKs during signaling cascades that are
           important in mediating cellular responses to
           extracellular signals. MLKs play roles in immunity and
           inflammation, as well as in cell death, proliferation,
           and cell cycle regulation. Little is known about the
           specific function of MLK1, also called MAP3K9. It is
           capable of activating the c-Jun N-terminal kinase
           pathway. Mice lacking both MLK1 and MLK2 are viable,
           fertile, and have normal life spans. MLK2, also called
           MAP3K10, is abundant in brain, skeletal muscle, and
           testis. It functions upstream of the MAPK, c-Jun
           N-terminal kinase. It binds hippocalcin, a
           calcium-sensor protein that protects neurons against
           calcium-induced cell death. Both MLK2 and hippocalcin
           may be associated with the pathogenesis of Parkinson's
           disease. MLK3, also called MAP3K11, is highly expressed
           in breast cancer cells and its signaling through c-Jun
           N-terminal kinase has been implicated in the migration,
           invasion, and malignancy of cancer cells. It also
           functions as a negative regulator of Inhibitor of
           Nuclear Factor-KappaB Kinase (IKK) and thus, impacts
           inflammation and immunity. MLKs contain an SH3 domain, a
           catalytic kinase domain, a leucine zipper, a
           proline-rich region, and a CRIB domain that mediates
           binding to GTP-bound Cdc42 and Rac. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 58

 Score = 50.9 bits (122), Expect = 1e-08
 Identities = 21/54 (38%), Positives = 34/54 (62%), Gaps = 5/54 (9%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMI-----DEGWWRGYCHGQYGLFPANYVS 522
           A++DY+ASA+DE++    D +  +        DEGWW G  + + G+FP+NYV+
Sbjct: 4   AVFDYEASAEDELTLRRGDRVEVLSKDSAVSGDEGWWTGKINDRVGIFPSNYVT 57


>gnl|CDD|212775 cd11841, SH3_SH3YL1_like, Src homology 3 domain of SH3 domain
           containing Ysc84-like 1 (SH3YL1) protein.  SH3YL1
           localizes to the plasma membrane and is required for
           dorsal ruffle formation. It binds phosphoinositides
           (PIs) with high affinity through its N-terminal SYLF
           domain (also called DUF500). In addition, SH3YL1
           contains a C-terminal SH3 domain which has been reported
           to bind to N-WASP, dynamin 2, and SHIP2 (a PI
           5-phosphatase). SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 54

 Score = 50.9 bits (122), Expect = 1e-08
 Identities = 21/54 (38%), Positives = 28/54 (51%), Gaps = 2/54 (3%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDE--GWWRGYCHGQYGLFPANYVS 522
              ALY ++     ++SF   D IT +   D    WW G   G+ G+FPANYVS
Sbjct: 1   EVTALYSFEGQQPCDLSFQAGDRITVLTRTDSQFDWWEGRLRGRVGIFPANYVS 54


>gnl|CDD|219499 pfam07653, SH3_2, Variant SH3 domain.  SH3 (Src homology 3) domains
           are often indicative of a protein involved in signal
           transduction related to cytoskeletal organisation. First
           described in the Src cytoplasmic tyrosine kinase. The
           structure is a partly opened beta barrel.
          Length = 53

 Score = 50.5 bits (122), Expect = 1e-08
 Identities = 17/51 (33%), Positives = 29/51 (56%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVS 522
              ++DY A+  +E+S    D++  ++  D GWW G   G+ GL P++YV 
Sbjct: 2   GRVIFDYVATDPNELSLKKGDVVKVLDKDDNGWWEGERGGRRGLVPSSYVE 52


>gnl|CDD|212730 cd11796, SH3_DNMBP_N3, Third N-terminal Src homology 3 domain of
           Dynamin Binding Protein, also called Tuba.  DNMBP or
           Tuba is a cdc42-specific guanine nucleotide exchange
           factor (GEF) that contains four N-terminal SH3 domains,
           a central RhoGEF [or Dbl homology (DH)] domain followed
           by a Bin/Amphiphysin/Rvs (BAR) domain, and two
           C-terminal SH3 domains. It provides a functional link
           between dynamin and key regulatory proteins of the actin
           cytoskeleton. It plays an important role in regulating
           cell junction configuration. The four N-terminal SH3
           domains of DNMBP binds the GTPase dynamin, which plays
           an important role in the fission of endocytic vesicles.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 51

 Score = 50.0 bits (120), Expect = 2e-08
 Identities = 18/50 (36%), Positives = 30/50 (60%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           A  L D  A  D+E+     D++T   ++D+GW+RG  +G+ G+FP  +V
Sbjct: 2   ARVLQDLSAQLDEELDLREGDVVTITGILDKGWFRGELNGRRGIFPEGFV 51


>gnl|CDD|212751 cd11817, SH3_Eve1_4, Fourth Src homology 3 domain of ADAM-binding
           protein Eve-1.  Eve-1, also called SH3 domain-containing
           protein 19 (SH3D19) or EEN-binding protein (EBP), exists
           in multiple alternatively spliced isoforms. The longest
           isoform contains five SH3 domain in the C-terminal
           region and seven proline-rich motifs in the N-terminal
           region. It is abundantly expressed in skeletal muscle
           and heart, and may be involved in regulating the
           activity of ADAMs (A disintegrin and metalloproteases).
           Eve-1 interacts with EEN, an endophilin involved in
           endocytosis and may be the target of the MLL-EEN fusion
           protein that is implicated in leukemogenesis. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 50

 Score = 50.2 bits (120), Expect = 2e-08
 Identities = 19/49 (38%), Positives = 29/49 (59%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANY 520
           AVALYD+    ++++SF   D I   E +D  W RG  +G+ G+FP  +
Sbjct: 2   AVALYDFTGETEEDLSFQRGDRILVTEHLDAEWSRGRLNGREGIFPRAF 50


>gnl|CDD|212905 cd11972, SH3_Abi2, Src homology 3 domain of Abl Interactor 2.  Abi2
           is highly expressed in the brain and eye. It regulates
           actin cytoskeletal reorganization at adherens junctions
           and dendritic spines, which is important in cell
           morphogenesis, migration, and cognitive function. Mice
           deficient with Abi2 show defects in orientation and
           migration of lens fibers, neuronal migration, dendritic
           spine morphology, as well as deficits in learning and
           memory. Abi proteins are adaptor proteins serving as
           binding partners and substrates of Abl tyrosine kinases.
           They are involved in regulating actin cytoskeletal
           reorganization and play important roles in
           membrane-ruffling, endocytosis, cell motility, and cell
           migration. Abi proteins contain a homeobox homology
           domain, a proline-rich region, and a SH3 domain. The SH3
           domain of Abi binds to a PxxP motif in Abl. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 61

 Score = 50.0 bits (119), Expect = 3e-08
 Identities = 24/49 (48%), Positives = 31/49 (63%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           VA+YDY    +DE+SF    II  I+  D+GW+ G  +G  GLFP NYV
Sbjct: 6   VAIYDYTKDKEDELSFQEGAIIYVIKKNDDGWYEGVMNGVTGLFPGNYV 54


>gnl|CDD|212767 cd11833, SH3_Stac_1, First C-terminal Src homology 3 domain of SH3
           and cysteine-rich domain-containing (Stac) proteins.
           Stac proteins are putative adaptor proteins that contain
           a cysteine-rich C1 domain and one or two SH3 domains at
           the C-terminus. There are three mammalian members
           (Stac1, Stac2, and Stac3) of this family. Stac1 and
           Stac3 contain two SH3 domains while Stac2 contains a
           single SH3 domain at the C-terminus. This model
           represents the first C-terminal SH3 domain of Stac1 and
           Stac3, and the single C-terminal SH3 domain of Stac2.
           Stac1 and Stac2 have been found to be expressed
           differently in mature dorsal root ganglia (DRG) neurons.
           Stac1 is mainly expressed in peptidergic neurons while
           Stac2 is found in a subset of nonpeptidergic and all
           trkB+ neurons. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 49.8 bits (119), Expect = 3e-08
 Identities = 19/51 (37%), Positives = 31/51 (60%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           T VALY ++   ++++   P D IT ++  +E WW+G    + G FPAN+V
Sbjct: 1   TYVALYKFKPQENEDLEMRPGDKITLLDDSNEDWWKGKIEDRVGFFPANFV 51


>gnl|CDD|212807 cd11874, SH3_CD2AP-like_2, Second Src Homology 3 domain (SH3B) of
           CD2-associated protein and similar proteins.  This
           subfamily is composed of the second SH3 domain (SH3B) of
           CD2AP, CIN85 (Cbl-interacting protein of 85 kDa), and
           similar domains. CD2AP and CIN85 are adaptor proteins
           that bind to protein partners and assemble complexes
           that have been implicated in T cell activation, kidney
           function, and apoptosis of neuronal cells. They also
           associate with endocytic proteins, actin cytoskeleton
           components, and other adaptor proteins involved in
           receptor tyrosine kinase (RTK) signaling. CD2AP and the
           main isoform of CIN85 contain three SH3 domains, a
           proline-rich region, and a C-terminal coiled-coil
           domain. All of these domains enable CD2AP and CIN85 to
           bind various protein partners and assemble complexes
           that have been implicated in many different functions.
           SH3B of both proteins have been shown to bind to Cbl. In
           the case of CD2AP, its SH3B binds to Cbl at a site
           distinct from the c-Cbl/SH3A binding site. The CIN85
           SH3B also binds ubiquitin. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 49.3 bits (118), Expect = 5e-08
 Identities = 17/48 (35%), Positives = 28/48 (58%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
            L+ Y    +DE+     D I  +  ++EGWW G  +G+ G+FP+N+V
Sbjct: 4   VLFSYTPQNEDELELKVGDTIEVLGEVEEGWWEGKLNGKVGVFPSNFV 51


>gnl|CDD|212776 cd11842, SH3_Ysc84p_like, Src homology 3 domain of Ysc84p and
           similar fungal proteins.  This family is composed of the
           Saccharomyces cerevisiae proteins, Ysc84p (also called
           LAS17-binding protein 4, Lsb4p) and Lsb3p, and similar
           fungal proteins. They contain an N-terminal SYLF domain
           (also called DUF500) and a C-terminal SH3 domain. Ysc84p
           localizes to actin patches and plays an important in
           actin polymerization during endocytosis. The N-terminal
           domain of both Ysc84p and Lsb3p can bind and bundle
           actin filaments. A study of the yeast SH3 domain
           interactome predicts that the SH3 domains of Lsb3p and
           Lsb4p may function as molecular hubs for the assembly of
           endocytic complexes. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 49.0 bits (117), Expect = 5e-08
 Identities = 24/55 (43%), Positives = 32/55 (58%), Gaps = 2/55 (3%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMID--EGWWRGYCHGQYGLFPANYVSL 523
            AVALYD+      +++F   DIIT ++  D    WW G   G+ G+FPANYV L
Sbjct: 1   KAVALYDFAGEQPGDLAFQKGDIITILKKSDSQNDWWTGRIGGREGIFPANYVEL 55


>gnl|CDD|212884 cd11951, SH3_GRAP_C, C-terminal Src homology 3 domain of
           GRB2-related adaptor protein.  GRAP is a GRB-2 like
           adaptor protein that is highly expressed in lymphoid
           tissues. It acts as a negative regulator of T cell
           receptor (TCR)-induced lymphocyte proliferation by
           downregulating the signaling to the Ras/ERK pathway. It
           has been identified as a regulator of TGFbeta signaling
           in diabetic kidney tubules and may have a role in the
           pathogenesis of the disease. GRAP contains an N-terminal
           SH3 domain, a central SH2 domain, and a C-terminal SH3
           domain. The C-terminal SH3 domains (SH3c) of the related
           proteins, GRB2 and GRAP2, have been shown to bind to
           classical PxxP motif ligands, as well as to
           non-classical motifs. GRB2 SH3c binds Gab2
           (Grb2-associated binder 2) through epitopes containing
           RxxK motifs, while the SH3c of GRAP2 binds to the
           phosphatase-like protein HD-PTP via a RxxxxK motif. SH3
           domains are protein interaction domains that typically
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 48.6 bits (116), Expect = 8e-08
 Identities = 21/48 (43%), Positives = 27/48 (56%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           A YD+ A    ++SF   DII  ++  D  WWRG   G+ G FP NYV
Sbjct: 4   AQYDFSAEDPSQLSFRRGDIIEVLDCPDPNWWRGRISGRVGFFPRNYV 51


>gnl|CDD|212990 cd12057, SH3_CIN85_3, Third Src Homology 3 domain (SH3C) of
           Cbl-interacting protein of 85 kDa.  CIN85, also called
           SH3 domain-containing kinase-binding protein 1 (SH3KBP1)
           or CD2-binding protein 3 (CD2BP3) or Ruk, is an adaptor
           protein that is involved in the downregulation of
           receptor tyrosine kinases by facilitating endocytosis
           through interaction with endophilin-associated ubiquitin
           ligase Cbl proteins. It is also important in many other
           cellular processes including vesicle-mediated transport,
           cytoskeletal remodelling, apoptosis, cell adhesion and
           migration, and viral infection, among others. CIN85
           exists as multiple variants from alternative splicing;
           the main variant contains three SH3 domains, a
           proline-rich region, and a C-terminal coiled-coil
           domain. All of these domains enable CIN85 to bind
           various protein partners and assemble complexes that
           have been implicated in many different functions. This
           alignment model represents the third SH3 domain (SH3C)
           of CIN85. SH3C has been shown to bind ubiquitin. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 56

 Score = 48.4 bits (115), Expect = 9e-08
 Identities = 22/51 (43%), Positives = 33/51 (64%), Gaps = 2/51 (3%)

Query: 475 LYDYQASADDEISFDPDDIITNI--EMIDEGWWRGYCHGQYGLFPANYVSL 523
           L+ Y+A  +DE++    DI+T I  + ID GWW G  +G+ G+FP N+V L
Sbjct: 5   LFPYEAQNEDELTIKEGDIVTLISKDCIDAGWWEGELNGRRGVFPDNFVKL 55


>gnl|CDD|212902 cd11969, SH3_PLCgamma2, Src homology 3 domain of Phospholipase C
           (PLC) gamma 2.  PLCgamma2 is primarily expressed in
           haematopoietic cells, specifically in B cells. It is
           activated by tyrosine phosphorylation by B cell receptor
           (BCR) kinases and is recruited to the plasma membrane
           where its substrate is located. It is required in
           pre-BCR signaling and in the maturation of B cells. PLCs
           catalyze the hydrolysis of phosphatidylinositol
           (4,5)-bisphosphate [PtdIns(4,5)P2] to produce
           Ins(1,4,5)P3 and diacylglycerol (DAG). Ins(1,4,5)P3
           initiates the calcium signaling cascade while DAG
           functions as an activator of PKC. PLCgamma contains a
           Pleckstrin homology (PH) domain followed by an
           elongation factor (EF) domain, two catalytic regions of
           PLC domains that flank two tandem SH2 domains, followed
           by a SH3 domain and C2 domain. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 48.3 bits (115), Expect = 1e-07
 Identities = 24/56 (42%), Positives = 30/56 (53%), Gaps = 9/56 (16%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYG-----LFPANYV 521
           T  ALYDY+A   DE+SF    +I N+     GWW+    G YG      FP+NYV
Sbjct: 1   TVKALYDYRAKRSDELSFCKGALIHNVSKETGGWWK----GDYGGKVQHYFPSNYV 52


>gnl|CDD|212696 cd11762, SH3_FCHSD_2, Second Src Homology 3 domain of FCH and
           double SH3 domains proteins.  This group is composed of
           FCH and double SH3 domains protein 1 (FCHSD1) and
           FCHSD2. These proteins have a common domain structure
           consisting of an N-terminal F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs), two SH3, and C-terminal
           proline-rich domains. They have only been characterized
           in silico and their functions remain unknown. This group
           also includes the insect protein, nervous wreck, which
           acts as a regulator of synaptic growth signaling. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 57

 Score = 48.2 bits (115), Expect = 1e-07
 Identities = 21/49 (42%), Positives = 32/49 (65%), Gaps = 4/49 (8%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEM----IDEGWWRGYCHGQYGLFPA 518
           ALYDY+A +D+E+SF    II  +      +D+GWW G  +G+ G+FP+
Sbjct: 4   ALYDYEAQSDEELSFPEGAIIRILRKDDNGVDDGWWEGEFNGRVGVFPS 52


>gnl|CDD|212984 cd12051, SH3_DOCK1_5_A, Src Homology 3 domain of Class A Dedicator
           of Cytokinesis proteins 1 and 5.  Dock1, also called
           Dock180, and Dock5 are class A DOCKs and are atypical
           guanine nucleotide exchange factors (GEFs) that lack the
           conventional Dbl homology (DH) domain. Dock1 interacts
           with the scaffold protein Elmo and the resulting complex
           functions upstream of Rac in many biological events
           including phagocytosis of apoptotic cells, cell
           migration and invasion. Dock5 functions upstream of Rac1
           to regulate osteoclast function. All DOCKs contain two
           homology domains: the DHR-1 (Dock homology region-1),
           also called CZH1 (CED-5, Dock180, and MBC-zizimin
           homology 1), and DHR-2 (also called CZH2 or Docker). The
           DHR-1 domain binds
           phosphatidylinositol-3,4,5-triphosphate while DHR-2
           contains the catalytic activity for Rac and/or Cdc42.
           Class A DOCKs also contain an SH3 domain at the
           N-terminal region and a PxxP motif at the C-terminus;
           they are specific GEFs for Rac. The SH3 domain of Dock1
           binds to DHR-2 in an autoinhibitory manner; binding of
           Elmo to the SH3 domain of Dock1 exposes the DHR-2 domain
           and promotes GEF activity. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 56

 Score = 47.9 bits (114), Expect = 1e-07
 Identities = 22/56 (39%), Positives = 33/56 (58%), Gaps = 4/56 (7%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYC---HGQYGLFPANYVSLQ 524
            VA+Y+Y A   DE+S    D +  +E   EGW+RGY      + G+FPA+Y+ L+
Sbjct: 2   GVAIYNYDARGPDELSLQIGDTVHILETY-EGWYRGYTLRKKSKKGIFPASYIHLK 56


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 54.4 bits (130), Expect = 1e-07
 Identities = 42/194 (21%), Positives = 89/194 (45%), Gaps = 5/194 (2%)

Query: 272  KFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGH 331
            K +   K  E +    +++ +EK+K  +    E+ +K EE++ +E   K EE   K    
Sbjct: 1523 KADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEE--DKNMAL 1580

Query: 332  SDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEE 391
                 +   E   +     L +        +    EE + + + + + E ++K++E+ ++
Sbjct: 1581 RKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKK 1640

Query: 392  LEKEQIRIKEE---QENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQ 448
             E E+ +  EE    E   K K  EE ++ E+++ +    KK +E+++  AE   +  E+
Sbjct: 1641 KEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEE 1700

Query: 449  ERLMEELRQQGTNE 462
             +  EEL+++   E
Sbjct: 1701 AKKAEELKKKEAEE 1714



 Score = 52.8 bits (126), Expect = 4e-07
 Identities = 43/198 (21%), Positives = 92/198 (46%), Gaps = 1/198 (0%)

Query: 265  KPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEI 324
            K +  + K +   K  E +    +++ +E +K  +    E+ +K +E + +E   K EE 
Sbjct: 1504 KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEK 1563

Query: 325  EKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRK 384
            +K                     K   +    + + +    K+ + +E ++  + +IK +
Sbjct: 1564 KKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAE 1623

Query: 385  ELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQAR 444
            EL++ EE +K+  ++K+++   KKK E+ +K  +E+ + +     K+ EED+ KAEE  +
Sbjct: 1624 ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA-EEENKIKAAEEAKKAEEDKKKAEEAKK 1682

Query: 445  LLEQERLMEELRQQGTNE 462
              E E+   E  ++   E
Sbjct: 1683 AEEDEKKAAEALKKEAEE 1700



 Score = 51.7 bits (123), Expect = 9e-07
 Identities = 62/237 (26%), Positives = 114/237 (48%), Gaps = 17/237 (7%)

Query: 270  RAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLN 329
             A+ E + K  EEE + ++EE K+  + K + +  +  + E++++ +LK KE E +KK  
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE 1650

Query: 330  GHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERK 389
                  L  + E   +      K+   D    + + K EE+++K     E +K++  E K
Sbjct: 1651 -----ELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA---AEALKKEAEEAK 1702

Query: 390  EELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQE 449
            +    E+++ KE +E   KKK +E K+ +E+ + +    KK+ EED+ KAEE  +  E++
Sbjct: 1703 K---AEELKKKEAEE---KKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEK 1756

Query: 450  RLMEELRQQGTNEDTEEDLGYTAV---ALYDYQASADDEISFDPDDIITNIEMIDEG 503
            + +  L+++   +  E      AV    L +       E+     DI  N   I EG
Sbjct: 1757 KKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEG 1813



 Score = 49.8 bits (118), Expect = 3e-06
 Identities = 44/207 (21%), Positives = 87/207 (42%), Gaps = 8/207 (3%)

Query: 265  KPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDL-EQAQKLEERRLSELKIKEEE 323
            K    + K +   K+ EE  +     + E     D+ +  E+  +  E++  E K K + 
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382

Query: 324  IEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKR 383
             +KK         +             LK+  +       + K+ EEK+K     E  K+
Sbjct: 1383 AKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK---ADEAKKK 1439

Query: 384  KELERKEELEKEQIRIKEEQENIKKKKEQ----EEKERKEQEENQKLLLKKQQEEDRLKA 439
             E  +K +  K++    ++ E  KKK E+    +E ++K +E  +    KK+ EE + KA
Sbjct: 1440 AEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKA 1499

Query: 440  EEQARLLEQERLMEELRQQGTNEDTEE 466
            +E  +  E ++  +E ++    +  +E
Sbjct: 1500 DEAKKAAEAKKKADEAKKAEEAKKADE 1526



 Score = 47.8 bits (113), Expect = 1e-05
 Identities = 51/206 (24%), Positives = 101/206 (49%), Gaps = 16/206 (7%)

Query: 277  AKQTEEESRKRSEE----EKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHS 332
            AK+  EE++K+++E     + K+K  +    E+A+K +E + +E   K +E +K      
Sbjct: 1488 AKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKK 1547

Query: 333  DVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEEL 392
               L  + E      K   ++          + ++ EE +K    + E   K  E ++++
Sbjct: 1548 ADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKM 1607

Query: 393  EKEQIRIKEEQENIKK---KKEQEEKERKEQ-------EENQKLLLKKQQEEDRLKAEEQ 442
            + E+ + K E+  IK    KK +EEK++ EQ       E+ +   LKK +EE+++KA E+
Sbjct: 1608 KAEEAK-KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666

Query: 443  ARLLEQE-RLMEELRQQGTNEDTEED 467
            A+  E++ +  EE ++   +E    +
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAE 1692



 Score = 47.8 bits (113), Expect = 1e-05
 Identities = 48/195 (24%), Positives = 95/195 (48%), Gaps = 14/195 (7%)

Query: 278  KQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSEL--------KIKEEEIEKKLN 329
            K+ EE+ +    ++ E+ K       E+A+K EE R+ E+        K+K EE +K   
Sbjct: 1558 KKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEE 1617

Query: 330  GHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEE-----KEKQRMVQEEIKRK 384
                       E     V+ + K+   +    +   K EEE      E+ +  +E+ K+ 
Sbjct: 1618 AKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKA 1677

Query: 385  ELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQAR 444
            E  +K E ++++     ++E  + KK +E K+++ +E+ +   LKK +EE+++KAEE  +
Sbjct: 1678 EEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKK 1737

Query: 445  LLEQE-RLMEELRQQ 458
              E++ +  EE ++ 
Sbjct: 1738 EAEEDKKKAEEAKKD 1752



 Score = 47.4 bits (112), Expect = 2e-05
 Identities = 36/192 (18%), Positives = 82/192 (42%), Gaps = 2/192 (1%)

Query: 277  AKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPL 336
            A+      R+ + + +E RK  +    E+ +K +E + +E K K +E +KK         
Sbjct: 1260 ARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADE 1319

Query: 337  SPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQ 396
            +              K+   +      + K E E         E K +  E+K+E  K+ 
Sbjct: 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKK- 1378

Query: 397  IRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELR 456
             +    ++  ++KK+ +E ++K +E+ +K    K+    + KA+E  +  E+++  +E +
Sbjct: 1379 -KADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAK 1437

Query: 457  QQGTNEDTEEDL 468
            ++       ++ 
Sbjct: 1438 KKAEEAKKADEA 1449



 Score = 46.3 bits (109), Expect = 4e-05
 Identities = 44/200 (22%), Positives = 88/200 (44%), Gaps = 8/200 (4%)

Query: 265  KPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEI 324
            K    + K E L K+  EE +K  E +K + + K +   E  +  E+++ +E   K EE 
Sbjct: 1627 KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686

Query: 325  EKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRK 384
            EKK                    + + K+   +    +   K EEE + +    EE K++
Sbjct: 1687 EKK-----AAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIK---AEEAKKE 1738

Query: 385  ELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQAR 444
              E K++ E+ +   +E+++    KKE+E+K  + ++E + ++ ++  EED  +  E  +
Sbjct: 1739 AEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDK 1798

Query: 445  LLEQERLMEELRQQGTNEDT 464
             ++          +G  E  
Sbjct: 1799 KIKDIFDNFANIIEGGKEGN 1818



 Score = 45.5 bits (107), Expect = 7e-05
 Identities = 58/253 (22%), Positives = 116/253 (45%), Gaps = 23/253 (9%)

Query: 218  EKVDKHQSQKDDNRQVVTSSKVKDLIAANSTNASKENIKPKPNIGHVKPSNLRAKFENLA 277
            +K D  + + ++ ++   ++K +   AA+   A++E    K      K    + K +   
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEE----KAEAAEKKKEEAKKKADAAK 1384

Query: 278  KQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLS 337
            K+ EE+ +K  E +K+  + K + D  +     +++  E K K EE +K          +
Sbjct: 1385 KKAEEK-KKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEA 1443

Query: 338  PSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERK--EELEKE 395
               +          K+   +    + + K+ EE +K     +E K+K  E K  +E +K+
Sbjct: 1444 KKADE--------AKKKAEEAKKAEEAKKKAEEAKKA----DEAKKKAEEAKKADEAKKK 1491

Query: 396  QIRIKEEQENIKKKKEQEEK--ERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLME 453
                K++ +  KK  E ++K  E K+ EE +K    K+ EE + KA+E A+  E+++  +
Sbjct: 1492 AEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAK-KADE-AKKAEEKKKAD 1549

Query: 454  ELRQQGTNEDTEE 466
            EL++    +  EE
Sbjct: 1550 ELKKAEELKKAEE 1562



 Score = 43.6 bits (102), Expect = 3e-04
 Identities = 38/208 (18%), Positives = 95/208 (45%), Gaps = 4/208 (1%)

Query: 265  KPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEI 324
            + + ++A+    A + ++   K+  +E +K + K + D  + +  E ++  E K K EE 
Sbjct: 1268 RQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEA 1327

Query: 325  EKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRK 384
            +KK +                  ++      ++    +    E++++E ++      K+ 
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKA 1387

Query: 385  ELERK-EELEKEQIRIKEEQENIKKKKEQEEKE---RKEQEENQKLLLKKQQEEDRLKAE 440
            E ++K +E +K+    K++ + +KK    ++K    +K+ EE +K    K++ E+  KA+
Sbjct: 1388 EEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKAD 1447

Query: 441  EQARLLEQERLMEELRQQGTNEDTEEDL 468
            E  +  E+ +  EE +++       ++ 
Sbjct: 1448 EAKKKAEEAKKAEEAKKKAEEAKKADEA 1475



 Score = 42.8 bits (100), Expect = 5e-04
 Identities = 39/194 (20%), Positives = 84/194 (43%), Gaps = 8/194 (4%)

Query: 282  EESRKRSEE---EKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSP 338
            EE+RK  +    E  K+  + + D E+A+K EE R +E   K EE            +  
Sbjct: 1215 EEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKA 1274

Query: 339  STETPPVPVKSILKQPTSDGIPIQNSNKEEEE---KEKQRMVQEEIKRK--ELERKEELE 393
                    +K   ++  +D        K+ +E   K ++    +E K+K  E ++K +  
Sbjct: 1275 EEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAA 1334

Query: 394  KEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLME 453
            K++    ++     K + +   +  E  E +    +K++EE + KA+   +  E+++  +
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKAD 1394

Query: 454  ELRQQGTNEDTEED 467
            E +++   +  + D
Sbjct: 1395 EAKKKAEEDKKKAD 1408



 Score = 42.1 bits (98), Expect = 9e-04
 Identities = 47/210 (22%), Positives = 95/210 (45%), Gaps = 27/210 (12%)

Query: 265  KPSNLRAKFENLAKQTEE-----ESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKI 319
            K +    K E   K+ EE     E++K++EE K+  + K + +  + +  E ++ +E K 
Sbjct: 1451 KKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKK 1510

Query: 320  KEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQE 379
            K +E +K            + E                    +    EE++K  +    E
Sbjct: 1511 KADEAKKAEEAKKADEAKKAEEAKKAD---------------EAKKAEEKKKADELKKAE 1555

Query: 380  EIKRKELERK-------EELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQ 432
            E+K+ E ++K       EE +   +R  EE +  ++ + +E  +  E+E+  K    K+ 
Sbjct: 1556 ELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKA 1615

Query: 433  EEDRLKAEEQARLLEQERLMEELRQQGTNE 462
            EE ++KAEE  +  E+++ +E+L+++   E
Sbjct: 1616 EEAKIKAEELKKAEEEKKKVEQLKKKEAEE 1645



 Score = 40.9 bits (95), Expect = 0.002
 Identities = 57/294 (19%), Positives = 127/294 (43%), Gaps = 22/294 (7%)

Query: 180  QEKIEKHESQKDYAKGFGGKFGIESDRQDKSAVGWDHVEKVDKHQSQKDDNRQVVTSSKV 239
            +   ++ E+ ++ A+    K      + D +    +  +K D+ + + +++++     K 
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412

Query: 240  KDLIAANSTNASKENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKD 299
                   +  A K+  + K      K +    K +   K+ EE   K++EE K+K +   
Sbjct: 1413 AAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE--AKKAEEAKKKAEEAK 1470

Query: 300  QIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGI 359
            + D  + +  E ++  E K K EE +KK +                   +  K+   +  
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKAD--------------EAKKAAEAKKKADEAK 1516

Query: 360  PIQNSNKEEEEK--EKQRMVQEEIKRKELERKEELEK-EQIRIKEEQENIKKKKEQEEKE 416
              + + K +E K  E+ +   E  K +E ++ +EL+K E+++  EE++  ++ K+ EE +
Sbjct: 1517 KAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDK 1576

Query: 417  R---KEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEED 467
                ++ EE +K    + +E  +L  EE+    E+ +  EE + +       E+
Sbjct: 1577 NMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEE 1630



 Score = 37.4 bits (86), Expect = 0.021
 Identities = 41/199 (20%), Positives = 81/199 (40%), Gaps = 18/199 (9%)

Query: 253  ENIKPKPNIGHVKPSNLRAKFENLAKQTEE----ESRKRSEEEKEKRKLKDQIDLEQAQK 308
            E +K       +K +    K E   K+ EE    E  ++   E  K++ ++    E+ +K
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709

Query: 309  LEE---RRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSN 365
             E    ++  ELK  EEE + K             +          K+  +     +   
Sbjct: 1710 KEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKK 1769

Query: 366  KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKK-----------KKEQEE 414
             EE  KEK+ +++EE+  ++ +R+ E++K+   I +   NI +            KE E+
Sbjct: 1770 AEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMED 1829

Query: 415  KERKEQEENQKLLLKKQQE 433
               KE  +++ + L++   
Sbjct: 1830 SAIKEVADSKNMQLEEADA 1848



 Score = 32.8 bits (74), Expect = 0.51
 Identities = 48/188 (25%), Positives = 84/188 (44%), Gaps = 15/188 (7%)

Query: 271  AKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNG 330
            AK +N A +  EE+  ++EE K+    K +   E  +  E ++ +E   K EE  K    
Sbjct: 1083 AKEDNRADEATEEAFGKAEEAKKTETGKAE---EARKAEEAKKKAEDARKAEEARKA--- 1136

Query: 331  HSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKE 390
              D   +          +  + +   D    + + K E+ K+     +   K +E+ + E
Sbjct: 1137 -EDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKK----AEAARKAEEVRKAE 1191

Query: 391  ELEK-EQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQE 449
            EL K E  R  E     +++++ EE  + E    +K    K+ EE +  AEE A+  E+E
Sbjct: 1192 ELRKAEDARKAEAARKAEEERKAEEARKAEDA--KKAEAVKKAEEAKKDAEE-AKKAEEE 1248

Query: 450  RLMEELRQ 457
            R  EE+R+
Sbjct: 1249 RNNEEIRK 1256



 Score = 29.7 bits (66), Expect = 4.5
 Identities = 51/190 (26%), Positives = 88/190 (46%), Gaps = 15/190 (7%)

Query: 282  EESRKRSEEEKEK--RKLKDQIDLEQAQKLEERRLSELKIKEEEIEK--KLNGHSDVPLS 337
            EE+RK  + ++ +  RK +D    E+A+K E+ + +E   K EE+ K  +L    D   +
Sbjct: 1143 EEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKA 1202

Query: 338  PSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQ--------EEIKRKELERK 389
             +        K+   +   D    +   K EE K+     +        EEI++ E  R 
Sbjct: 1203 EAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARM 1262

Query: 390  EELEKEQIRIK-EEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQ 448
                + Q  IK EE     + K+ EEK  K+ +E +K   KK+ +E + KAEE  +  E 
Sbjct: 1263 AHFARRQAAIKAEEARKADELKKAEEK--KKADEAKKAEEKKKADEAKKKAEEAKKADEA 1320

Query: 449  ERLMEELRQQ 458
            ++  EE +++
Sbjct: 1321 KKKAEEAKKK 1330


>gnl|CDD|212991 cd12058, SH3_MLK4, Src Homology 3 domain of Mixed Lineage Kinase 4.
            MLK4 is a Serine/Threonine Kinase (STK), catalyzing the
           transfer of the gamma-phosphoryl group from ATP to S/T
           residues on protein substrates. MLKs act as
           mitogen-activated protein kinase kinase kinases (MAP3Ks,
           MKKKs, MAPKKKs), which phosphorylate and activate MAPK
           kinases (MAPKKs or MKKs or MAP2Ks), which in turn
           phosphorylate and activate MAPKs during signaling
           cascades that are important in mediating cellular
           responses to extracellular signals. MLKs play roles in
           immunity and inflammation, as well as in cell death,
           proliferation, and cell cycle regulation. The specific
           function of MLK4 is yet to be determined. Mutations in
           the kinase domain of MLK4 have been detected in
           colorectal cancers. MLK4 contains an SH3 domain, a
           catalytic kinase domain, a leucine zipper, a
           proline-rich region, and a CRIB domain that mediates
           binding to GTP-bound Cdc42 and Rac. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 58

 Score = 48.0 bits (114), Expect = 1e-07
 Identities = 23/54 (42%), Positives = 33/54 (61%), Gaps = 5/54 (9%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMI-----DEGWWRGYCHGQYGLFPANYVS 522
           ALYDY+AS +DE+S    D++  +        D+GWW G    + G+FPANYV+
Sbjct: 4   ALYDYEASGEDELSLRRGDVVEVLSQDAAVSGDDGWWAGKIRHRLGIFPANYVT 57


>gnl|CDD|212835 cd11902, SH3_Nck2_2, Second Src Homology 3 domain of Nck2 adaptor
           protein.  Nck2 (also called Nckbeta or Growth factor
           receptor-bound protein 4, Grb4) plays a crucial role in
           connecting signaling pathways of tyrosine kinase
           receptors and important effectors in actin dynamics and
           cytoskeletal remodeling. It binds neuronal signaling
           proteins such as ephrinB and Disabled-1 (Dab-1)
           exclusively. Nck adaptor proteins regulate actin
           cytoskeleton dynamics by linking proline-rich effector
           molecules to protein tyrosine kinases and phosphorylated
           signaling intermediates. They contain three SH3 domains
           and a C-terminal SH2 domain. They function downstream of
           the PDGFbeta receptor and are involved in Rho GTPase
           signaling and actin dynamics. Vertebrates contain two
           Nck adaptor proteins: Nck1 (also called Nckalpha) and
           Nck2, which show partly overlapping functions but also
           bind distinct targets. The second SH3 domain of Nck
           appears to prefer ligands containing the APxxPxR motif.
           SH3 domains are protein interaction domains that usually
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially a PxxP motif. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 47.7 bits (113), Expect = 2e-07
 Identities = 21/50 (42%), Positives = 29/50 (58%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           A   + Y A  +DE+S      +T +E   +GWWRG  +GQ G FP+NYV
Sbjct: 3   AFVKFAYVAEREDELSLVKGSRVTVMEKCSDGWWRGSYNGQIGWFPSNYV 52


>gnl|CDD|212743 cd11809, SH3_srGAP, Src homology 3 domain of Slit-Robo GTPase
           Activating Proteins.  Slit-Robo GTPase Activating
           Proteins (srGAPs) are Rho GAPs that interact with Robo1,
           the transmembrane receptor of Slit proteins. Slit
           proteins are secreted proteins that control axon
           guidance and the migration of neurons and leukocytes.
           Vertebrates contain three isoforms of srGAPs (srGAP1-3),
           all of which are expressed during embryonic and early
           development in the nervous system but with different
           localization and timing. A fourth member has also been
           reported (srGAP4, also called ARHGAP4). srGAPs contain
           an N-terminal F-BAR domain, a Rho GAP domain, and a
           C-terminal SH3 domain. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 47.4 bits (113), Expect = 2e-07
 Identities = 20/52 (38%), Positives = 30/52 (57%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           A A +DY   ++ E+SF   D +T    + + WWRG  +GQ GL P  Y++L
Sbjct: 2   ATAQFDYTGRSERELSFKKGDSLTLYRQVSDDWWRGQLNGQDGLVPHKYITL 53


>gnl|CDD|212771 cd11837, SH3_Intersectin_2, Second Src homology 3 domain (or SH3B)
           of Intersectin.  Intersectins (ITSNs) are adaptor
           proteins that function in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction.
           They are essential for initiating clathrin-coated pit
           formation. They bind to many proteins through their
           multidomain structure and facilitate the assembly of
           multimeric complexes. Vertebrates contain two ITSN
           proteins, ITSN1 and ITSN2, which exist in alternatively
           spliced short and long isoforms. The short isoforms
           contain two Eps15 homology domains (EH1 and EH2), a
           coiled-coil region and five SH3 domains (SH3A-E), while
           the long isoforms, in addition, contain RhoGEF (also
           called Dbl-homologous or DH), Pleckstrin homology (PH)
           and C2 domains. ITSN1 and ITSN2 are both widely
           expressed, with variations depending on tissue type and
           stage of development. The second SH3 domain (or SH3B) of
           ITSN1 has been shown to bind WNK and CdGAP. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 47.4 bits (113), Expect = 2e-07
 Identities = 23/54 (42%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH-GQYGLFPANYVSL 523
           TA ALY ++A  ++ +SF   DIIT +E   E WW G    G+ G FP +YV  
Sbjct: 1   TATALYPWRAKKENHLSFAKGDIITVLEQ-QEMWWFGELEGGEEGWFPKSYVKE 53


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 52.8 bits (127), Expect = 2e-07
 Identities = 32/192 (16%), Positives = 74/192 (38%), Gaps = 7/192 (3%)

Query: 274 ENLAKQTEEESRKRSEEEKEKRKLKDQI-DLEQAQKLEERRLSELKIKEEEIEKKL-NGH 331
           + + K+     +K  E++ ++ KL+ Q+  LE      E +L E     +++ K++ + +
Sbjct: 41  KQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLN 100

Query: 332 SDVPLSPSTETPPVPVKSIL---KQPTSDGIPIQNSNKEEEEKEKQRMVQ--EEIKRKEL 386
           + +      E       +      Q +    P       E+ +   R+      +     
Sbjct: 101 ARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARA 160

Query: 387 ERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLL 446
           ER + L+    ++   +  I  ++ +      EQ   Q  L +  +E  +  A+  + L 
Sbjct: 161 ERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELS 220

Query: 447 EQERLMEELRQQ 458
             ++ +EELR  
Sbjct: 221 ADQKKLEELRAN 232



 Score = 34.7 bits (80), Expect = 0.12
 Identities = 14/90 (15%), Positives = 35/90 (38%)

Query: 374 QRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQE 433
           QR  Q ++ +   ERK+ L +    +  +Q+ +++ +  E + + E    +    K ++ 
Sbjct: 194 QRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREA 253

Query: 434 EDRLKAEEQARLLEQERLMEELRQQGTNED 463
               +A        + +   E  +    E 
Sbjct: 254 AAAAEAAAARARAAEAKRTGETYKPTAPEK 283



 Score = 31.6 bits (72), Expect = 1.0
 Identities = 21/162 (12%), Positives = 50/162 (30%), Gaps = 32/162 (19%)

Query: 260 NIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKI 319
             G + P+      + L    ++ +  R+E   E+ +L     L   Q+ ++ +L++L  
Sbjct: 151 YYGALNPARAERI-DALKATLKQLAAVRAEIAAEQAELTT---LLSEQRAQQAKLAQLLE 206

Query: 320 KEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQE 379
           + ++   +LN                    +               K EE +  +  ++ 
Sbjct: 207 ERKKTLAQLN------------------SELSAD----------QKKLEELRANESRLKN 238

Query: 380 EIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
           EI   E    +  E               + ++  +  K   
Sbjct: 239 EIASAEAAAAKAREAAAAAEAAAARARAAEAKRTGETYKPTA 280


>gnl|CDD|212755 cd11821, SH3_ASAP, Src homology 3 domain of ArfGAP with SH3 domain,
           ankyrin repeat and PH domain containing proteins.  ASAPs
           are Arf GTPase activating proteins (GAPs) and they
           function in regulating cell growth, migration, and
           invasion. They contain an N-terminal BAR domain,
           followed by a Pleckstrin homology (PH) domain, an Arf
           GAP domain, ankyrin (ANK) repeats, and a C-terminal SH3
           domain. Vertebrates contain at least three members,
           ASAP1, ASAP2, and ASAP3, but some ASAP3 proteins do not
           seem to harbor a C-terminal SH3 domain. ASAP1 and ASAP2
           show GTPase activating protein (GAP) activity towards
           Arf1 and Arf5. They do not show GAP activity towards
           Arf6, but are able to mediate Arf6 signaling by binding
           stably to GTP-Arf6. ASAP3 is an Arf6-specific GAP. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 46.9 bits (112), Expect = 3e-07
 Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 3/50 (6%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQ---YGLFPANY 520
           ALYD QA  DDE++F   +II      D+ WW G+  G     G+FP ++
Sbjct: 4   ALYDCQADNDDELTFSEGEIIVVTGEEDDEWWEGHIEGDPSRRGVFPVSF 53


>gnl|CDD|212979 cd12046, SH3_p67phox_C, C-terminal (or second) Src Homology 3
           domain of the p67phox subunit of NADPH oxidase.
           p67phox, also called Neutrophil cytosol factor 2
           (NCF-2), is a cytosolic subunit of the phagocytic NADPH
           oxidase complex (also called Nox2 or gp91phox) which
           plays a crucial role in the cellular response to
           bacterial infection. NADPH oxidase catalyzes the
           transfer of electrons from NADPH to oxygen during
           phagocytosis forming superoxide and reactive oxygen
           species. p67phox plays a regulatory role and contains
           N-terminal TPR, first SH3 (or N-terminal or central
           SH3), PB1, and C-terminal SH3 domains. It binds, via its
           C-terminal SH3 domain, to a proline-rich region of
           p47phox and upon activation, this complex assembles with
           flavocytochrome b558, the Nox2-p22phox heterodimer.
           Concurrently, RacGTP translocates to the membrane and
           interacts with the TPR domain of p67phox, which leads to
           the activation of NADPH oxidase. The PB1 domain of
           p67phox binds to its partner PB1 domain in p40phox, and
           this facilitates the assembly of p47phox-p67phox at the
           membrane. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 47.1 bits (112), Expect = 3e-07
 Identities = 18/49 (36%), Positives = 31/49 (63%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           VAL+ Y+AS  +++ F   D+I  +  ++E W  G C G+ G+FP+ +V
Sbjct: 3   VALFSYEASQPEDLEFQKGDVILVLSKVNEDWLEGQCKGKIGIFPSAFV 51


>gnl|CDD|212772 cd11838, SH3_Intersectin_3, Third Src homology 3 domain (or SH3C)
           of Intersectin.  Intersectins (ITSNs) are adaptor
           proteins that function in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction.
           They are essential for initiating clathrin-coated pit
           formation. They bind to many proteins through their
           multidomain structure and facilitate the assembly of
           multimeric complexes. Vertebrates contain two ITSN
           proteins, ITSN1 and ITSN2, which exist in alternatively
           spliced short and long isoforms. The short isoforms
           contain two Eps15 homology domains (EH1 and EH2), a
           coiled-coil region and five SH3 domains (SH3A-E), while
           the long isoforms, in addition, contain RhoGEF (also
           called Dbl-homologous or DH), Pleckstrin homology (PH)
           and C2 domains. ITSN1 and ITSN2 are both widely
           expressed, with variations depending on tissue type and
           stage of development. The third SH3 domain (or SH3C) of
           ITSN1 has been shown to bind many proteins including
           dynamin1/2, CIN85, c-Cbl, SHIP2, Reps1, synaptojanin-1,
           and WNK, among others. The SH3C of ITSN2 has been shown
           to bind the K15 protein of Kaposi's sarcoma-associated
           herpesvirus. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 52

 Score = 47.0 bits (112), Expect = 3e-07
 Identities = 17/51 (33%), Positives = 30/51 (58%), Gaps = 1/51 (1%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           +ALY Y+++   +++F+  D+I   +  D  WW G    + G+FP+NYV  
Sbjct: 3   IALYPYESNEPGDLTFNAGDVILVTKK-DGEWWTGTIGDRTGIFPSNYVRP 52


>gnl|CDD|212820 cd11887, SH3_Bbc1, Src Homology 3 domain of Bbc1 and similar
           domains.  This subfamily is composed of Saccharomyces
           cerevisiae Bbc1p, also called Mti1p (Myosin tail
           region-interacting protein), and similar proteins. Bbc1p
           interacts with and regulates type I myosins in yeast,
           Myo3p and Myo5p, which are involved in actin
           cytoskeletal reorganization. It also binds and inhibits
           Las17, a WASp family protein that functions as an
           activator of the Arp2/3 complex. Bbc1p contains an
           N-terminal SH3 domain. SH3 domains bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs; they play a role in the
           regulation of enzymes by intramolecular interactions,
           changing the subcellular localization of signal pathway
           components and mediate multiprotein complex assemblies.
          Length = 60

 Score = 47.0 bits (112), Expect = 3e-07
 Identities = 19/61 (31%), Positives = 31/61 (50%), Gaps = 13/61 (21%)

Query: 470 YTAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQY---------GLFPANY 520
           +   ALY Y++  +D+++FD   +IT  E  D  W+     G+Y         G+FP N+
Sbjct: 2   FKVKALYPYESDHEDDLNFDVGQLITVTEEEDADWY----FGEYVDSNGNTKEGIFPKNF 57

Query: 521 V 521
           V
Sbjct: 58  V 58


>gnl|CDD|212809 cd11876, SH3_MLK, Src Homology 3 domain of Mixed Lineage Kinases.
           MLKs are Serine/Threonine Kinases (STKs), catalyzing the
           transfer of the gamma-phosphoryl group from ATP to S/T
           residues on protein substrates. MLKs act as
           mitogen-activated protein kinase kinase kinases (MAP3Ks,
           MKKKs, MAPKKKs), which phosphorylate and activate MAPK
           kinases (MAPKKs or MKKs or MAP2Ks), which in turn
           phosphorylate and activate MAPKs during signaling
           cascades that are important in mediating cellular
           responses to extracellular signals. MLKs play roles in
           immunity and inflammation, as well as in cell death,
           proliferation, and cell cycle regulation. Mammals have
           four MLKs (MLK1-4), mostly conserved in vertebrates,
           which contain an SH3 domain, a catalytic kinase domain,
           a leucine zipper, a proline-rich region, and a CRIB
           domain that mediates binding to GTP-bound Cdc42 and Rac.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 58

 Score = 47.1 bits (112), Expect = 3e-07
 Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 5/55 (9%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMI-----DEGWWRGYCHGQYGLFPANYVS 522
            AL+DY A  +DE++      +  +        DEGWW G    + G+FP+NYV+
Sbjct: 3   TALFDYDARGEDELTLRRGQPVEVLSKDAAVSGDEGWWTGKIGDKVGIFPSNYVA 57


>gnl|CDD|212697 cd11763, SH3_SNX9_like, Src Homology 3 domain of Sorting Nexin 9
           and similar proteins.  Sorting nexins (SNXs) are Phox
           homology (PX) domain containing proteins that are
           involved in regulating membrane traffic and protein
           sorting in the endosomal system. SNXs differ from each
           other in their lipid-binding specificity, subcellular
           localization and specific function in the endocytic
           pathway. This subfamily consists of SH3 domain
           containing SNXs including SNX9, SNX18, SNX33, and
           similar proteins. SNX9 is localized to plasma membrane
           endocytic sites and acts primarily in clathrin-mediated
           endocytosis, while SNX18 is localized to peripheral
           endosomal structures, and acts in a trafficking pathway
           that is clathrin-independent but relies on AP-1 and
           PACS1. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 46.6 bits (111), Expect = 4e-07
 Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 474 ALYDYQASADDEISFDPDDIITNI-EMIDEGWWRGY-CHGQYGLFPANYV 521
           ALYD+ +    E+S    +++T   + + +GW  G    G+ GLFP++YV
Sbjct: 4   ALYDFDSQPSGELSLRAGEVLTITRQDVGDGWLEGRNSRGEVGLFPSSYV 53


>gnl|CDD|212987 cd12054, SH3_CD2AP_2, Second Src Homology 3 domain (SH3B) of
           CD2-associated protein.  CD2AP, also called CMS (Cas
           ligand with Multiple SH3 domains) or METS1
           (Mesenchyme-to-Epithelium Transition protein with SH3
           domains), is a cytosolic adaptor protein that plays a
           role in regulating the cytoskeleton. It is critical in
           cell-to-cell union necessary for kidney function. It
           also stabilizes the contact between a T cell and
           antigen-presenting cells. It is primarily expressed in
           podocytes at the cytoplasmic face of the slit diaphragm
           and serves as a linker anchoring podocin and nephrin to
           the actin cytoskeleton. CD2AP contains three SH3
           domains, a proline-rich region, and a C-terminal
           coiled-coil domain. All of these domains enable CD2AP to
           bind various protein partners and assemble complexes
           that have been implicated in many different functions.
           This alignment model represents the second SH3 domain
           (SH3B) of CD2AP. SH3B binds to c-Cbl in a site (TPSSRPLR
           is the core binding motif) distinct from the c-Cbl/SH3A
           binding site. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 46.1 bits (109), Expect = 6e-07
 Identities = 20/47 (42%), Positives = 30/47 (63%)

Query: 475 LYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           L++Y    +DE+     DII   E ++EGWW G  +G+ GLFP+N+V
Sbjct: 6   LFEYVPQNEDELELKVGDIIDINEEVEEGWWSGTLNGKSGLFPSNFV 52


>gnl|CDD|212747 cd11813, SH3_SGSM3, Src Homology 3 domain of Small G protein
           Signaling Modulator 3.  SGSM3 is also called
           Merlin-associated protein (MAP), RUN and SH3
           domain-containing protein (RUSC3), RUN and TBC1
           domain-containing protein 3 (RUTBC3), Rab
           GTPase-activating protein 5 (RabGAP5), or Rab GAP-like
           protein (RabGAPLP). It is expressed ubiquitously and
           functions as a regulator of small G protein RAP- and
           RAB-mediated neuronal signaling. It is involved in
           modulating NGF-mediated neurite outgrowth and
           differentiation. It also interacts with the tumor
           suppressor merlin and may play a role in the
           merlin-associated suppression of cell growth. SGSM3
           contains TBC, SH3, and RUN domains. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 53

 Score = 46.0 bits (109), Expect = 7e-07
 Identities = 24/52 (46%), Positives = 30/52 (57%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           A AL D++   DDE+ F  +DIIT I   DE  W G  +G  G FPA +V L
Sbjct: 2   AKALLDFERHDDDELGFRKNDIITIISQKDEHCWVGELNGLRGWFPAKFVEL 53


>gnl|CDD|212778 cd11844, SH3_CAS, Src homology 3 domain of CAS (Crk-Associated
           Substrate) scaffolding proteins.  CAS proteins function
           as molecular scaffolds to regulate protein complexes
           that are involved in many cellular processes including
           migration, chemotaxis, apoptosis, differentiation, and
           progenitor cell function. They mediate the signaling of
           integrins at focal adhesions where they localize, and
           thus, regulate cell invasion and survival.
           Over-expression of these proteins is implicated in poor
           prognosis, increased metastasis, and resistance to
           chemotherapeutics in many cancers such as breast, lung,
           melanoma, and glioblastoma. CAS proteins have also been
           linked to the pathogenesis of inflammatory disorders,
           Alzheimer's, Parkinson's, and developmental defects.
           They share a common domain structure that includes an
           N-terminal SH3 domain, an unstructured substrate domain
           that contains many YxxP motifs, a serine-rich four-helix
           bundle, and a FAT-like C-terminal domain. Vertebrates
           contain four CAS proteins: BCAR1 (or p130Cas), NEDD9 (or
           HEF1), EFS (or SIN), and CASS4 (or HEPL). The SH3 domain
           of CAS proteins binds to diverse partners including FAK,
           FRNK, Pyk2, PTP-PEST, DOCK180, among others. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 56

 Score = 45.8 bits (109), Expect = 8e-07
 Identities = 22/55 (40%), Positives = 30/55 (54%), Gaps = 3/55 (5%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMID---EGWWRGYCHGQYGLFPANYVSL 523
           A ALYD  A + DE++F   DI+T +E      EGWW     G+ G+ P N + L
Sbjct: 2   ARALYDNVAESPDELAFRRGDILTVLEQNTAGLEGWWLCSLRGRQGIAPGNRLKL 56


>gnl|CDD|212738 cd11804, SH3_GRB2_like_N, N-terminal Src homology 3 domain of
           Growth factor receptor-bound protein 2 (GRB2) and
           related proteins.  This family includes the adaptor
           protein GRB2 and related proteins including Drosophila
           melanogaster Downstream of receptor kinase (DRK),
           Caenorhabditis elegans Sex muscle abnormal protein 5
           (Sem-5), GRB2-related adaptor protein (GRAP), GRAP2, and
           similar proteins. Family members contain an N-terminal
           SH3 domain, a central SH2 domain, and a C-terminal SH3
           domain. GRB2/Sem-5/DRK is a critical signaling molecule
           that regulates the Ras pathway by linking tyrosine
           kinases to the Ras guanine nucleotide releasing protein
           Sos (son of sevenless), which converts Ras to the active
           GTP-bound state. GRAP2 plays an important role in T cell
           receptor (TCR) signaling by promoting the formation of
           the SLP-76:LAT complex, which couples the TCR to the Ras
           pathway. GRAP acts as a negative regulator of T cell
           receptor (TCR)-induced lymphocyte proliferation by
           downregulating the signaling to the Ras/ERK pathway. The
           N-terminal SH3 domain of GRB2 binds to Sos and
           Sos-derived proline-rich peptides. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 52

 Score = 45.4 bits (108), Expect = 9e-07
 Identities = 21/52 (40%), Positives = 33/52 (63%), Gaps = 3/52 (5%)

Query: 472 AVALYDYQASADDEISFDPDDI--ITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           AVA +D++A+A+DE+SF    I  + N+E  D  W++    G+ GL P NY+
Sbjct: 2   AVAKHDFKATAEDELSFKKGSILKVLNMED-DPNWYKAELDGKEGLIPKNYI 52


>gnl|CDD|212803 cd11870, SH3_p67phox-like_C, C-terminal Src Homology 3 domain of
           the p67phox subunit of NADPH oxidase and similar
           proteins.  This subfamily is composed of p67phox, NADPH
           oxidase activator 1 (Noxa1), and similar proteins.
           p67phox, also called Neutrophil cytosol factor 2
           (NCF-2), and Noxa1 are homologs and are the cytosolic
           subunits of the phagocytic (Nox2) and nonphagocytic
           (Nox1) NADPH oxidase complexes, respectively. NADPH
           oxidase catalyzes the transfer of electrons from NADPH
           to oxygen during phagocytosis forming superoxide and
           reactive oxygen species. p67phox and Noxa1 play
           regulatory roles. p67phox contains N-terminal TPR, first
           SH3 (or N-terminal or central SH3), PB1, and C-terminal
           SH3 domains. Noxa1 has a similar domain architecture
           except it is lacking the N-terminal SH3 domain. The TPR
           domain of both binds activated GTP-bound Rac, while the
           C-terminal SH3 domain of p67phox and Noxa1 binds the
           polyproline motif found at the C-terminus of p47phox and
           Noxo1, respectively. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 45.6 bits (108), Expect = 9e-07
 Identities = 16/49 (32%), Positives = 28/49 (57%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           VAL+ Y+A   +++ F   D I  +  ++E W  G+  G+ G+FP  +V
Sbjct: 3   VALHRYEAQGPEDLGFREGDTIDVLSEVNEAWLEGHSDGRVGIFPKCFV 51


>gnl|CDD|212919 cd11986, SH3_Stac3_1, First C-terminal Src homology 3 domain of SH3
           and cysteine-rich domain-containing protein 3 (Stac3).
           Stac proteins are putative adaptor proteins that contain
           a cysteine-rich C1 domain and one or two SH3 domains at
           the C-terminus. There are three mammalian members
           (Stac1, Stac2, and Stac3) of this family. Stac1 and
           Stac3 contain two SH3 domains while Stac2 contains a
           single SH3 domain at the C-terminus. Stac1 and Stac2
           have been found to be expressed differently in mature
           dorsal root ganglia (DRG) neurons. Stac1 is mainly
           expressed in peptidergic neurons while Stac2 is found in
           a subset of nonpeptidergic and all trkB+ neurons. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 45.7 bits (108), Expect = 9e-07
 Identities = 20/49 (40%), Positives = 30/49 (61%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           VALY ++A   D++ F P + IT I+  +E WWRG    + G FP N++
Sbjct: 3   VALYRFKALEKDDLDFHPGERITVIDDSNEEWWRGKIGEKTGYFPMNFI 51


>gnl|CDD|212834 cd11901, SH3_Nck1_2, Second Src Homology 3 domain of Nck1 adaptor
           protein.  Nck1 (also called Nckalpha) plays a crucial
           role in connecting signaling pathways of tyrosine kinase
           receptors and important effectors in actin dynamics and
           cytoskeletal remodeling. It binds and activates RasGAP,
           resulting in the downregulation of Ras. It is also
           involved in the signaling of endothilin-mediated
           inhibition of cell migration. Nck adaptor proteins
           regulate actin cytoskeleton dynamics by linking
           proline-rich effector molecules to protein tyrosine
           kinases and phosphorylated signaling intermediates. They
           contain three SH3 domains and a C-terminal SH2 domain.
           They function downstream of the PDGFbeta receptor and
           are involved in Rho GTPase signaling and actin dynamics.
           Vertebrates contain two Nck adaptor proteins: Nck1 (also
           called Nckalpha) and Nck2, which show partly overlapping
           functions but also bind distinct targets. The second SH3
           domain of Nck appears to prefer ligands containing the
           APxxPxR motif. SH3 domains are protein interaction
           domains that usually bind to proline-rich ligands with
           moderate affinity and selectivity, preferentially a PxxP
           motif. They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 45.4 bits (107), Expect = 1e-06
 Identities = 20/51 (39%), Positives = 30/51 (58%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVS 522
           A   ++Y A  +DE+S      +  +E   +GWWRG  +GQ G FP+NYV+
Sbjct: 4   AYVKFNYTAEREDELSLVKGTKVIVMEKCSDGWWRGSYNGQVGWFPSNYVT 54


>gnl|CDD|212764 cd11830, SH3_VAV_2, C-terminal (or second) Src homology 3 domain of
           VAV proteins.  VAV proteins function both as cytoplasmic
           guanine nucleotide exchange factors (GEFs) for Rho
           GTPases and scaffold proteins and they play important
           roles in cell signaling by coupling cell surface
           receptors to various effector functions. They play key
           roles in processes that require cytoskeletal
           reorganization including immune synapse formation,
           phagocytosis, cell spreading, and platelet aggregation,
           among others. Vertebrates have three VAV proteins (VAV1,
           VAV2, and VAV3). VAV proteins contain several domains
           that enable their function: N-terminal calponin homology
           (CH), acidic, RhoGEF (also called Dbl-homologous or DH),
           Pleckstrin Homology (PH), C1 (zinc finger), SH2, and two
           SH3 domains. The SH3 domain of VAV is involved in the
           localization of proteins to specific sites within the
           cell, by interacting with proline-rich sequences within
           target proteins. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 54

 Score = 45.3 bits (107), Expect = 1e-06
 Identities = 20/52 (38%), Positives = 29/52 (55%), Gaps = 1/52 (1%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNI-EMIDEGWWRGYCHGQYGLFPANYV 521
           TA A YD+ A    E+S    D++    +   +GWWRG  +G+ G FP+ YV
Sbjct: 1   TAKARYDFCARDMRELSLKEGDVVKIYNKKGQQGWWRGEINGRIGWFPSTYV 52


>gnl|CDD|212989 cd12056, SH3_CD2AP_3, Third Src Homology 3 domain (SH3C) of
           CD2-associated protein.  CD2AP, also called CMS (Cas
           ligand with Multiple SH3 domains) or METS1
           (Mesenchyme-to-Epithelium Transition protein with SH3
           domains), is a cytosolic adaptor protein that plays a
           role in regulating the cytoskeleton. It is critical in
           cell-to-cell union necessary for kidney function. It
           also stabilizes the contact between a T cell and
           antigen-presenting cells. It is primarily expressed in
           podocytes at the cytoplasmic face of the slit diaphragm
           and serves as a linker anchoring podocin and nephrin to
           the actin cytoskeleton. CD2AP contains three SH3
           domains, a proline-rich region, and a C-terminal
           coiled-coil domain. All of these domains enable CD2AP to
           bind various protein partners and assemble complexes
           that have been implicated in many different functions.
           This alignment model represents the third SH3 domain
           (SH3C) of CD2AP. SH3C has been shown to bind ubiquitin.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 57

 Score = 45.2 bits (107), Expect = 1e-06
 Identities = 19/52 (36%), Positives = 33/52 (63%), Gaps = 2/52 (3%)

Query: 474 ALYDYQASADDEISFDPDDII--TNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           AL+ Y+ + +DE+ F   +II   + +  + GWW+G  +G+ G+FP N+VS 
Sbjct: 6   ALFHYEGTNEDELDFKEGEIILIISKDTGEPGWWKGELNGKEGVFPDNFVSQ 57


>gnl|CDD|212994 cd12061, SH3_betaPIX, Src Homology 3 domain of beta-Pak Interactive
           eXchange factor.  Beta-PIX, also called Rho guanine
           nucleotide exchange factor 7 (ARHGEF7) or Cool (Cloned
           out of Library)-1, activates small GTPases by exchanging
           bound GDP for free GTP. It acts as a GEF for both Cdc42
           and Rac 1, and plays important roles in regulating
           neuroendocrine exocytosis, focal adhesion maturation,
           cell migration, synaptic vesicle localization, and
           insulin secretion. PIX proteins contain an N-terminal
           SH3 domain followed by RhoGEF (also called
           Dbl-homologous or DH) and Pleckstrin Homology (PH)
           domains, and a C-terminal leucine-zipper domain for
           dimerization. The SH3 domain of PIX binds to an atypical
           PxxxPR motif in p21-activated kinases (PAKs) with high
           affinity. The binding of PAKs to PIX facilitate the
           localization of PAKs to focal complexes and also
           localizes PAKs to PIX targets Cdc43 and Rac, leading to
           the activation of PAKs. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 54

 Score = 45.1 bits (106), Expect = 1e-06
 Identities = 19/48 (39%), Positives = 31/48 (64%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           A +++Q + +DE+SF   D+I    + + GWW G  +G+ G FP+NYV
Sbjct: 4   AKFNFQQTNEDELSFSKGDVIHVTRVEEGGWWEGTHNGRTGWFPSNYV 51


>gnl|CDD|212853 cd11920, SH3_Sorbs2_1, First Src Homology 3 domain of Sorbin and
           SH3 domain containing 2 (Sorbs2), also called
           Arg-binding protein 2 (ArgBP2).  Sorbs2 or ArgBP2 is an
           adaptor protein containing one sorbin homology (SoHo)
           and three SH3 domains. It regulates actin-dependent
           processes including cell adhesion, morphology, and
           migration. It is expressed in many tissues and is
           abundant in the heart. Like vinexin, it is found in
           focal adhesion where it interacts with vinculin and
           afadin. It also localizes in epithelial cell stress
           fibers and in cardiac muscle cell Z-discs. Sorbs2 has
           been implicated to play roles in the signaling of c-Arg,
           Akt, and Pyk2. Other interaction partners of Sorbs2
           include c-Abl, flotillin, spectrin, dynamin 1/2,
           synaptojanin, PTP-PEST, among others. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 55

 Score = 45.4 bits (107), Expect = 1e-06
 Identities = 20/50 (40%), Positives = 31/50 (62%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           A A+YD++A    E+SF   D +  +  ID+ W+ G  HG+ G+FP +YV
Sbjct: 3   ARAVYDFKAQTSKELSFKKGDTVYILRKIDQNWYEGEHHGRVGIFPISYV 52


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 50.7 bits (122), Expect = 1e-06
 Identities = 29/111 (26%), Positives = 58/111 (52%), Gaps = 9/111 (8%)

Query: 361 IQNSNKEEEEKEKQRMVQ--EEIK--RKELE-----RKEELEKEQIRIKEEQENIKKKKE 411
           I+ + KE E  +K+ +++  EE+   R ELE     R+ EL++ + R+ + +E + +K E
Sbjct: 38  IEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKME 97

Query: 412 QEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNE 462
             +K+ +  E+ +K L  K++  D  + E +  + EQ   +E +      E
Sbjct: 98  SLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEE 148



 Score = 43.8 bits (104), Expect = 2e-04
 Identities = 40/188 (21%), Positives = 86/188 (45%), Gaps = 43/188 (22%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKK 327
                 E LAK+  EE++K +E  K++  L+ + ++ + +   ER   ELK +  E+++ 
Sbjct: 25  KKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELER---ELKERRNELQRL 81

Query: 328 LNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE 387
                                             +    + EE   ++M   + K + LE
Sbjct: 82  ----------------------------------ERRLLQREETLDRKMESLDKKEENLE 107

Query: 388 RKE-ELEKEQIRIKEEQENIKKKKEQEEKERKE-----QEENQKLLLKKQQEEDRLKAEE 441
           +KE EL  ++  + E++E +++   ++ +E +      QEE +++LL++ +EE R +A +
Sbjct: 108 KKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEEAKEILLEEVEEEARHEAAK 167

Query: 442 QARLLEQE 449
             + +E+E
Sbjct: 168 LIKEIEEE 175



 Score = 31.0 bits (71), Expect = 1.6
 Identities = 29/97 (29%), Positives = 53/97 (54%), Gaps = 9/97 (9%)

Query: 375 RMVQEEIKRKELERKEELEKEQI-RIKEEQENIKKKKEQEEKE--RKEQEENQKLLLKKQ 431
            ++++ I  K+L   EEL K  I   K+E E +KK+   E KE   K + E ++ L   +
Sbjct: 16  YLLRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELEREL---K 72

Query: 432 QEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDL 468
           +  + L+  E+ RLL++E  ++  R+  + +  EE+L
Sbjct: 73  ERRNELQRLER-RLLQREETLD--RKMESLDKKEENL 106


>gnl|CDD|212721 cd11787, SH3_SH3RF_2, Second Src Homology 3 domain of SH3 domain
           containing ring finger proteins.  This model represents
           the second SH3 domain of SH3RF1 (or POSH), SH3RF2 (or
           POSHER), SH3RF3 (POSH2), and similar domains. Members of
           this family are scaffold proteins that function as E3
           ubiquitin-protein ligases. They all contain an
           N-terminal RING finger domain and multiple SH3 domains;
           SH3RF1 and SH3RF3 have four SH3 domains while SH3RF2 has
           three. SH3RF1 plays a role in calcium homeostasis
           through the control of the ubiquitin domain protein
           Herp. It may also have a role in regulating death
           receptor mediated and JNK mediated apoptosis. SH3RF3
           interacts with p21-activated kinase 2 (PAK2) and
           GTP-loaded Rac1. It may play a role in regulating JNK
           mediated apoptosis in certain conditions. SH3RF2 acts as
           an anti-apoptotic regulator of the JNK pathway by
           binding to and promoting the degradation of SH3RF1. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 44.2 bits (105), Expect = 2e-06
 Identities = 18/53 (33%), Positives = 29/53 (54%), Gaps = 3/53 (5%)

Query: 471 TAVALYDYQASADDE---ISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANY 520
              ALYD++   +DE   ++F   D+IT I  +DE W  G    + G+FP ++
Sbjct: 1   QCKALYDFEMKDEDEKDCLTFKKGDVITVIRRVDENWAEGRLGDKIGIFPISF 53


>gnl|CDD|212724 cd11790, SH3_Amphiphysin, Src Homology 3 domain of Amphiphysin and
           related domains.  Amphiphysins function primarily in
           endocytosis and other membrane remodeling events. They
           exist in several isoforms and mammals possess two
           amphiphysin proteins from distinct genes. Amphiphysin I
           proteins, enriched in the brain and nervous system,
           contain domains that bind clathrin, Adaptor Protein
           complex 2 (AP2), dynamin, and synaptojanin. They
           function in synaptic vesicle endocytosis. Human
           autoantibodies to amphiphysin I hinder GABAergic
           signaling and contribute to the pathogenesis of
           paraneoplastic stiff-person syndrome. Some amphiphysin
           II isoforms, also called Bridging integrator 1 (Bin1),
           are localized in many different tissues and may function
           in intracellular vesicle trafficking. In skeletal
           muscle, Bin1 plays a role in the organization and
           maintenance of the T-tubule network. Mutations in Bin1
           are associated with autosomal recessive centronuclear
           myopathy. Amphiphysins contain an N-terminal BAR domain
           with an additional N-terminal amphipathic helix (an
           N-BAR), a variable central domain, and a C-terminal SH3
           domain. The SH3 domain of amphiphysins bind proline-rich
           motifs present in binding partners such as dynamin,
           synaptojanin, and nsP3. It also belongs to a subset of
           SH3 domains that bind ubiquitin in a site that overlaps
           with the peptide binding site. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 64

 Score = 44.6 bits (106), Expect = 2e-06
 Identities = 21/58 (36%), Positives = 28/58 (48%), Gaps = 7/58 (12%)

Query: 470 YTAVALYDYQASADDEISFDPDDIITNIEM-----IDEGWWRGY--CHGQYGLFPANY 520
           Y   A +DY A   DE++F+  D+I  I        DEGW  G     G  G+FP N+
Sbjct: 3   YKVRATHDYTAEDTDELTFEKGDVILVIPFDDPEEQDEGWLMGVKESTGCRGVFPENF 60


>gnl|CDD|212860 cd11927, SH3_SH3RF1_1, First Src Homology 3 domain of SH3 domain
           containing ring finger protein 1, an E3
           ubiquitin-protein ligase.  SH3RF1 is also called POSH
           (Plenty of SH3s) or SH3MD2 (SH3 multiple domains protein
           2). It is a scaffold protein that acts as an E3
           ubiquitin-protein ligase. It plays a role in calcium
           homeostasis through the control of the ubiquitin domain
           protein Herp. It may also have a role in regulating
           death receptor mediated and JNK mediated apoptosis.
           SH3RF1 also enhances the ubiquitination of ROMK1
           potassium channel resulting in its increased
           endocytosis. It contains an N-terminal RING finger
           domain and four SH3 domains. This model represents the
           first SH3 domain, located at the N-terminal half, of
           SH3RF1. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 54

 Score = 44.2 bits (104), Expect = 3e-06
 Identities = 19/52 (36%), Positives = 29/52 (55%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           A ALY+Y+     ++ F   DII     +DE W+ G  +G +G FP N+V +
Sbjct: 3   AKALYNYEGKEPGDLKFSKGDIIILRRQVDENWYHGEVNGIHGFFPTNFVQI 54


>gnl|CDD|212849 cd11916, SH3_Sorbs1_3, Third (or C-terminal) Src Homology 3 domain
           of Sorbin and SH3 domain containing 1 (Sorbs1), also
           called ponsin.  Sorbs1 is also called ponsin, SH3P12, or
           CAP (c-Cbl associated protein). It is an adaptor protein
           containing one sorbin homology (SoHo) and three SH3
           domains. It binds Cbl and plays a major role in
           regulating the insulin signaling pathway by enhancing
           insulin-induced phosphorylation of Cbl. Sorbs1, like
           vinexin, localizes at cell-ECM and cell-cell adhesion
           sites where it binds vinculin, paxillin, and afadin. It
           may function in the control of cell motility. Other
           interaction partners of Sorbs1 include c-Abl, Sos,
           flotillin, Grb4, ataxin-7, filamin C, among others. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 59

 Score = 44.6 bits (105), Expect = 3e-06
 Identities = 23/56 (41%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 470 YTAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH--GQYGLFPANYVSL 523
           Y+  ALY Y    DDE+     DI+  +E  D+GW+ G      Q+G FP NYV L
Sbjct: 2   YSYQALYSYAPQNDDELELRDGDIVDVMEKCDDGWFVGTSRRTKQFGTFPGNYVKL 57


>gnl|CDD|212988 cd12055, SH3_CIN85_2, Second Src Homology 3 domain (SH3B) of
           Cbl-interacting protein of 85 kDa.  CIN85, also called
           SH3 domain-containing kinase-binding protein 1 (SH3KBP1)
           or CD2-binding protein 3 (CD2BP3) or Ruk, is an adaptor
           protein that is involved in the downregulation of
           receptor tyrosine kinases by facilitating endocytosis
           through interaction with endophilin-associated ubiquitin
           ligase Cbl proteins. It is also important in many other
           cellular processes including vesicle-mediated transport,
           cytoskeletal remodelling, apoptosis, cell adhesion and
           migration, and viral infection, among others. CIN85
           exists as multiple variants from alternative splicing;
           the main variant contains three SH3 domains, a
           proline-rich region, and a C-terminal coiled-coil
           domain. All of these domains enable CIN85 to bind
           various protein partners and assemble complexes that
           have been implicated in many different functions. This
           alignment model represents the second SH3 domain (SH3B)
           of CIN85. SH3B has been shown to bind Cbl proline-rich
           peptides and ubiquitin. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 44.2 bits (104), Expect = 3e-06
 Identities = 16/48 (33%), Positives = 28/48 (58%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
             + Y    +DE+     DII  +  ++EGWW G  +G+ G+FP+N++
Sbjct: 4   VAFSYLPQNEDELELKVGDIIEVVGEVEEGWWEGVLNGKTGMFPSNFI 51


>gnl|CDD|212861 cd11928, SH3_SH3RF3_1, First Src Homology 3 domain of SH3 domain
           containing ring finger 3, an E3 ubiquitin-protein
           ligase.  SH3RF3 is also called POSH2 (Plenty of SH3s 2)
           or SH3MD4 (SH3 multiple domains protein 4). It is a
           scaffold protein with E3 ubiquitin-protein ligase
           activity. It was identified in the screen for
           interacting partners of p21-activated kinase 2 (PAK2).
           It may play a role in regulating JNK mediated apoptosis
           in certain conditions. It also interacts with GTP-loaded
           Rac1. SH3RF3 is highly homologous to SH3RF1; it also
           contains an N-terminal RING finger domain and four SH3
           domains. This model represents the first SH3 domain,
           located at the N-terminal half, of SH3RF3. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 54

 Score = 44.1 bits (104), Expect = 3e-06
 Identities = 17/50 (34%), Positives = 27/50 (54%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           ALY Y+     ++ F+  DII     +DE W+ G  +G +G  PA+Y+  
Sbjct: 5   ALYSYEGKEPGDLKFNKGDIIILRRKVDENWYHGELNGCHGFLPASYIQC 54


>gnl|CDD|212889 cd11956, SH3_srGAP4, Src homology 3 domain of Slit-Robo GTPase
           Activating Protein 4.  srGAP4, also called ARHGAP4, is
           highly expressed in hematopoietic cells and may play a
           role in lymphocyte differentiation. It is able to
           stimulate the GTPase activity of Rac1, Cdc42, and RhoA.
           In the nervous system, srGAP4 has been detected in
           differentiating neurites and may be involved in axon and
           dendritic growth. srGAPs are Rho GAPs that interact with
           Robo1, the transmembrane receptor of Slit proteins. Slit
           proteins are secreted proteins that control axon
           guidance and the migration of neurons and leukocytes.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 55

 Score = 44.1 bits (104), Expect = 3e-06
 Identities = 19/52 (36%), Positives = 26/52 (50%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           AVA +DY      E+SF   D++         WWRG  +G  GL P  Y+S+
Sbjct: 4   AVACFDYTGRTAQELSFKRGDVLLLHSKASSDWWRGEHNGMRGLIPHKYISV 55


>gnl|CDD|212769 cd11835, SH3_ARHGAP32_33, Src homology 3 domain of Rho
           GTPase-activating proteins 32 and 33, and similar
           proteins.  Members of this family contain N-terminal PX
           and Src Homology 3 (SH3) domains, a central Rho GAP
           domain, and C-terminal extensions. RhoGAPs (or ARHGAPs)
           bind to Rho proteins and enhance the hydrolysis rates of
           bound GTP. ARHGAP32 is also called RICS, PX-RICS,
           p250GAP, or p200RhoGAP. It is a Rho GTPase-activating
           protein for Cdc42 and Rac1, and is implicated in the
           regulation of postsynaptic signaling and neurite
           outgrowth. PX-RICS, a variant of RICS that contain PX
           and SH3 domains, is the main isoform expressed during
           neural development. It is involved in neural functions
           including axon and dendrite extension, postnatal
           remodeling, and fine-tuning of neural circuits during
           early brain development. ARHGAP33, also called sorting
           nexin 26 or TCGAP (Tc10/CDC42 GTPase-activating
           protein), is widely expressed in the brain where it is
           involved in regulating the outgrowth of axons and
           dendrites and is regulated by the protein tyrosine
           kinase Fyn. It is translocated to the plasma membrane in
           adipocytes in response to insulin and may be involved in
           the regulation of insulin-stimulated glucose transport.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 54

 Score = 44.0 bits (104), Expect = 3e-06
 Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEM---IDEGWWRGYCHGQYGLFPANYV 521
           A  +  Y A A DE+S +  DI++ I+M    +  WWRG    Q G FP+  V
Sbjct: 2   AHVIKRYTAQAPDELSLEVGDIVSVIDMPPPEESTWWRGKKGFQVGFFPSECV 54


>gnl|CDD|212790 cd11856, SH3_p47phox_like, Src homology 3 domains of the p47phox
           subunit of NADPH oxidase and similar domains.  This
           family is composed of the tandem SH3 domains of p47phox
           subunit of NADPH oxidase and Nox Organizing protein 1
           (NoxO1), the four SH3 domains of Tks4 (Tyr kinase
           substrate with four SH3 domains), the five SH3 domains
           of Tks5, the SH3 domain of obscurin, Myosin-I,  and
           similar domains. Most members of this group also contain
           Phox homology (PX) domains, except for obscurin and
           Myosin-I. p47phox and NoxO1 are regulators of the
           phagocytic NADPH oxidase complex (also called Nox2 or
           gp91phox) and nonphagocytic NADPH oxidase Nox1,
           respectively. They play roles in the activation of their
           respective NADPH oxidase, which catalyzes the transfer
           of electrons from NADPH to molecular oxygen to form
           superoxide. Tks proteins are Src substrates and
           scaffolding proteins that play important roles in the
           formation of podosomes and invadopodia, the dynamic
           actin-rich structures that are related to cell migration
           and cancer cell invasion. Obscurin is a giant muscle
           protein that plays important roles in the organization
           and assembly of the myofibril and the sarcoplasmic
           reticulum. Type I myosins (Myosin-I) are actin-dependent
           motors in endocytic actin structures and actin patches.
           They play roles in membrane traffic in endocytic and
           secretory pathways, cell motility, and mechanosensing.
           Myosin-I contains an N-terminal actin-activated ATPase,
           a phospholipid-binding TH1 (tail homology 1) domain, and
           a C-terminal extension which includes an F-actin-binding
           TH2 domain, an SH3 domain, and an acidic peptide that
           participates in activating the Arp2/3complex. The SH3
           domain of myosin-I is required for myosin-I-induced
           actin polymerization. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 43.8 bits (104), Expect = 3e-06
 Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 4/50 (8%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYC--HGQYGLFPANY 520
           VA+ DY+A  DDEIS    +++  +E  D GWW  Y     + G  PA+Y
Sbjct: 3   VAIADYEAQGDDEISLQEGEVVEVLEKNDSGWW--YVRKGDKEGWVPASY 50


>gnl|CDD|212779 cd11845, SH3_Src_like, Src homology 3 domain of Src kinase-like
           Protein Tyrosine Kinases.  Src subfamily members include
           Src, Lck, Hck, Blk, Lyn, Fgr, Fyn, Yrk, Yes, and Brk.
           Src (or c-Src) proteins are cytoplasmic (or
           non-receptor) PTKs which are anchored to the plasma
           membrane. They contain an N-terminal SH4 domain with a
           myristoylation site, followed by SH3 and SH2 domains, a
           tyr kinase domain, and a regulatory C-terminal region
           containing a conserved tyr. They are activated by
           autophosphorylation at the tyr kinase domain, but are
           negatively regulated by phosphorylation at the
           C-terminal tyr by Csk (C-terminal Src Kinase). However,
           Brk lacks the N-terminal myristoylation sites. Src
           proteins are involved in signaling pathways that
           regulate cytokine and growth factor responses,
           cytoskeleton dynamics, cell proliferation, survival, and
           differentiation. They were identified as the first
           proto-oncogene products, and they regulate cell
           adhesion, invasion, and motility in cancer cells, and
           tumor vasculature, contributing to cancer progression
           and metastasis. Src kinases are overexpressed in a
           variety of human cancers, making them attractive targets
           for therapy. They are also implicated in acute
           inflammatory responses and osteoclast function. Src,
           Fyn, Yes, and Yrk are widely expressed, while Blk, Lck,
           Hck, Fgr, Lyn, and Brk show a limited expression
           pattern. This subfamily also includes Drosophila Src42A,
           Src oncogene at 42A (also known as Dsrc41) which
           accumulates at sites of cell-cell or cell-matrix
           adhesion, and participates in Drosphila development and
           wound healing. It has been shown to promote tube
           elongation in the tracheal system, is essential for
           proper cell-cell matching during dorsal closure, and
           regulates cell-cell contacts in developing Drosophila
           eyes. The SH3 domain of Src kinases contributes to
           substrate recruitment by binding adaptor
           proteins/substrates, and regulation of kinase activity
           through an intramolecular interaction. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 52

 Score = 43.7 bits (104), Expect = 4e-06
 Identities = 21/51 (41%), Positives = 29/51 (56%), Gaps = 4/51 (7%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH---GQYGLFPANY 520
           VALYDY+A  DD++SF   D +  ++  D  WW    H   G+ G  P+NY
Sbjct: 3   VALYDYEARTDDDLSFKKGDRLQILDDSDGDWWLA-RHLSTGKEGYIPSNY 52


>gnl|CDD|212881 cd11948, SH3_GRAP_N, N-terminal Src homology 3 domain of
           GRB2-related adaptor protein.  GRAP is a GRB-2 like
           adaptor protein that is highly expressed in lymphoid
           tissues. It acts as a negative regulator of T cell
           receptor (TCR)-induced lymphocyte proliferation by
           downregulating the signaling to the Ras/ERK pathway. It
           has been identified as a regulator of TGFbeta signaling
           in diabetic kidney tubules and may have a role in the
           pathogenesis of the disease. GRAP contains an N-terminal
           SH3 domain, a central SH2 domain, and a C-terminal SH3
           domain. The N-terminal SH3 domain of the related protein
           GRB2 binds to Sos and Sos-derived proline-rich peptides.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 54

 Score = 44.0 bits (104), Expect = 4e-06
 Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 1/53 (1%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDE-GWWRGYCHGQYGLFPANYVSL 523
           AVALY +QA+  DE+ F   DI+  + M D+  W++    G+ G  P NY+ +
Sbjct: 2   AVALYSFQATESDELPFQKGDILKILNMEDDQNWYKAELQGREGYIPKNYIKV 54


>gnl|CDD|212923 cd11990, SH3_Intersectin2_2, Second Src homology 3 domain (or SH3B)
           of Intersectin-2.  Intersectin-2 (ITSN2) is an adaptor
           protein that functions in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction. It
           plays a role in clathrin-coated pit (CCP) formation. It
           binds to many proteins through its multidomain structure
           and facilitate the assembly of multimeric complexes.
           ITSN2 also functions as a specific GEF for Cdc42
           activation in epithelial morphogenesis, and is required
           in mitotic spindle orientation. It exists in
           alternatively spliced short and long isoforms. The short
           isoform contains two Eps15 homology domains (EH1 and
           EH2), a coiled-coil region and five SH3 domains
           (SH3A-E), while the long isoform, in addition, contains
           RhoGEF (also called Dbl-homologous or DH), Pleckstrin
           homology (PH) and C2 domains. The second SH3 domain (or
           SH3B) of ITSN2 is expected to bind protein partners,
           similar to ITSN1 which has been shown to bind WNK and
           CdGAP. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 52

 Score = 43.9 bits (103), Expect = 4e-06
 Identities = 23/52 (44%), Positives = 30/52 (57%), Gaps = 1/52 (1%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           A AL  + A  D+ ++F  +DIIT +E   E WW G  HG  G FP +YV L
Sbjct: 2   AQALCSWTAKKDNHLNFSKNDIITVLEQ-QENWWFGEVHGGRGWFPKSYVKL 52


>gnl|CDD|212980 cd12047, SH3_Noxa1_C, C-terminal Src Homology 3 domain of NADPH
           oxidase activator 1.  Noxa1 is a homolog of p67phox and
           is a cytosolic subunit of the nonphagocytic NADPH
           oxidase complex Nox1, which catalyzes the transfer of
           electrons from NADPH to molecular oxygen to form
           superoxide. Noxa1 is co-expressed with Nox1 in colon,
           stomach, uterus, prostate, and vascular smooth muscle
           cells, consistent with its regulatory role. It does not
           interact with p40phox, unlike p67phox, making Nox1
           activity independent of p40phox, unlike Nox2. Noxa1
           contains TPR, PB1, and C-terminal SH3 domains, but lacks
           the central SH3 domain that is present in p67phox. The
           TPR domain binds activated GTP-bound Rac. The C-terminal
           SH3 domain binds the polyproline motif found at the
           C-terminus of Noxo1, a homolog of p47phox. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 43.7 bits (103), Expect = 4e-06
 Identities = 15/49 (30%), Positives = 27/49 (55%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           VA +DY A   +++ F   D I  +  +++ W  G+C G+ G+FP  + 
Sbjct: 3   VAQHDYSAQGPEDLEFSQGDTIDILSEVNQEWLEGHCDGRIGIFPKCFA 51


>gnl|CDD|212922 cd11989, SH3_Intersectin1_2, Second Src homology 3 domain (or SH3B)
           of Intersectin-1.  Intersectin-1 (ITSN1) is an adaptor
           protein that functions in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction. It
           plays a role in clathrin-coated pit (CCP) formation. It
           binds to many proteins through its multidomain structure
           and facilitate the assembly of multimeric complexes.
           ITSN1 localizes in membranous organelles, CCPs, the
           Golgi complex, and may be involved in the cell membrane
           trafficking system. It exists in alternatively spliced
           short and long isoforms. The short isoform contains two
           Eps15 homology domains (EH1 and EH2), a coiled-coil
           region and five SH3 domains (SH3A-E), while the long
           isoform, in addition, contains RhoGEF (also called
           Dbl-homologous or DH), Pleckstrin homology (PH) and C2
           domains. The second SH3 domain (or SH3B) of ITSN1 has
           been shown to bind WNK and CdGAP. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 52

 Score = 43.5 bits (102), Expect = 4e-06
 Identities = 23/52 (44%), Positives = 33/52 (63%), Gaps = 1/52 (1%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           A ALY ++A  D+ ++F+ +D+IT +E  D  WW G   GQ G FP +YV L
Sbjct: 2   AQALYPWRAKKDNHLNFNKNDVITVLEQQDM-WWFGEVQGQKGWFPKSYVKL 52


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 48.9 bits (117), Expect = 5e-06
 Identities = 44/197 (22%), Positives = 93/197 (47%), Gaps = 11/197 (5%)

Query: 276 LAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVP 335
           LAK  E        EE+  R  ++  +L++  +  E+ + ELK + EE+ ++L    +  
Sbjct: 231 LAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEEL 290

Query: 336 LSPSTETPPV-PVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEK 394
           L    E   +    S+L++   +   ++N  +E EE+ ++   + E  ++ELE +E L +
Sbjct: 291 LELKEEIEELEGEISLLRERLEE---LENELEELEERLEELKEKIEALKEELEERETLLE 347

Query: 395 EQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEE----DRLKAEEQARLLEQER 450
           E   +++    +++ KE+ E++     E  + L +  +EE    +   AE +  L E +R
Sbjct: 348 E---LEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKR 404

Query: 451 LMEELRQQGTNEDTEED 467
            +E L ++        +
Sbjct: 405 EIESLEERLERLSERLE 421



 Score = 48.2 bits (115), Expect = 1e-05
 Identities = 38/190 (20%), Positives = 81/190 (42%), Gaps = 16/190 (8%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQI-DLEQAQKLEERRLSELKIKEEEIEK 326
            L  +   L  Q E+   +    + E R L+D + +L +  +  ER+L ELK +   +E+
Sbjct: 671 ELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEE 730

Query: 327 KLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKEL 386
           +L                     + +         +   + +E  E+     E ++    
Sbjct: 731 ELEQLQ---------------SRLEELEEELEELEEELEELQERLEELEEELESLEEALA 775

Query: 387 ERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLL 446
           + KEE+E+ + + +  QE +++ +E+ E+  +  +  ++ L   +Q  +RL+ E +    
Sbjct: 776 KLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEE 835

Query: 447 EQERLMEELR 456
           E E L E+L 
Sbjct: 836 EIEELEEKLD 845



 Score = 48.2 bits (115), Expect = 1e-05
 Identities = 46/201 (22%), Positives = 83/201 (41%), Gaps = 25/201 (12%)

Query: 269 LRAKFENLAKQTE--EESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEK 326
           L  + E L +Q E  E  ++   E +E         L++ +K  E    EL   EEE+E+
Sbjct: 198 LEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEE 257

Query: 327 KLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKEL 386
                               ++   K+       I+    E EE  ++    +E   +  
Sbjct: 258 LQEE----------------LEEAEKE-------IEELKSELEELREELEELQEELLELK 294

Query: 387 ERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLL 446
           E  EELE E   ++E  E ++ + E+ E+  +E +E  + L ++ +E + L  E +  L 
Sbjct: 295 EEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLA 354

Query: 447 EQERLMEELRQQGTNEDTEED 467
           E E   EEL ++ +    E +
Sbjct: 355 ELEEAKEELEEKLSALLEELE 375



 Score = 44.7 bits (106), Expect = 1e-04
 Identities = 40/200 (20%), Positives = 88/200 (44%), Gaps = 5/200 (2%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQID-LEQAQKLEERRLSELKIKEEEIEKK 327
           LR + E L ++  E   +  E E E   L+++++ LE   +  E RL ELK K E ++++
Sbjct: 279 LREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEE 338

Query: 328 LNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE 387
           L     +          +  +   ++       +    +E  E  ++ + + E +  E+ 
Sbjct: 339 LEERETLLEELEQLLAEL--EEAKEELEEKLSALLEELEELFEALREELAELEAELAEIR 396

Query: 388 RKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLE 447
            + E  K +I  +  +E +++  E+ E  ++E +E +  L + Q E + L  E +    +
Sbjct: 397 NELEELKREI--ESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQ 454

Query: 448 QERLMEELRQQGTNEDTEED 467
            E L + L++        ++
Sbjct: 455 LEELRDRLKELERELAELQE 474



 Score = 43.5 bits (103), Expect = 2e-04
 Identities = 43/201 (21%), Positives = 85/201 (42%), Gaps = 3/201 (1%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKK 327
            L  +   L ++ EE   +  E E+   +LK++I+  + +  E   L E   +     ++
Sbjct: 299 ELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEE 358

Query: 328 LNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE 387
                +  LS   E      +++ ++       +     E EE +++    EE   +  E
Sbjct: 359 AKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSE 418

Query: 388 RKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLE 447
           R E+L++E   ++ E E ++ + E+  +E +E EE  + L  + +E   L+ E      E
Sbjct: 419 RLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKE---LERELAELQEE 475

Query: 448 QERLMEELRQQGTNEDTEEDL 468
            +RL +EL       D  E  
Sbjct: 476 LQRLEKELSSLEARLDRLEAE 496



 Score = 42.8 bits (101), Expect = 4e-04
 Identities = 47/213 (22%), Positives = 96/213 (45%), Gaps = 15/213 (7%)

Query: 267 SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQID--LEQAQKLEERR------LSELK 318
             L  + E L ++ EE   +    E+   KLK++I+   E+ Q L+E        L E +
Sbjct: 747 EELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAE 806

Query: 319 IKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQ 378
            + + +E++L            E         L++   +     +  +EE E+ ++ + +
Sbjct: 807 RRLDALERELESLEQRRERLEQE------IEELEEEIEELEEKLDELEEELEELEKELEE 860

Query: 379 EEIKRKELE-RKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRL 437
            + + +ELE  KEELE E   ++EE+E ++++  + E E  E +E  + L ++ +E +  
Sbjct: 861 LKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAK 920

Query: 438 KAEEQARLLEQERLMEELRQQGTNEDTEEDLGY 470
               +  L E E  +EE  +     + E ++  
Sbjct: 921 LERLEVELPELEEELEEEYEDTLETELEREIER 953



 Score = 41.2 bits (97), Expect = 0.001
 Identities = 51/197 (25%), Positives = 99/197 (50%), Gaps = 10/197 (5%)

Query: 267 SNLRAKFENLAKQTEEESRK-RSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIE 325
             LR + E L +Q EE  R+  + EE+ ++      +LE+  +  E  L EL+ + EE+E
Sbjct: 705 EELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELE 764

Query: 326 KKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKE 385
           ++L    +       E      + + ++  +    ++   +E EE E++    E      
Sbjct: 765 EELESLEEALAKLKEEI-----EELEEKRQALQEELEELEEELEEAERRLDALERELESL 819

Query: 386 LERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ----EENQKLLLKKQQEEDRLKAEE 441
            +R+E LE+E   ++EE E +++K ++ E+E +E     EE ++ L + + E++ L+ E 
Sbjct: 820 EQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDEL 879

Query: 442 QARLLEQERLMEELRQQ 458
           +    E+E L EELR+ 
Sbjct: 880 KELEEEKEELEEELREL 896



 Score = 37.8 bits (88), Expect = 0.017
 Identities = 41/193 (21%), Positives = 95/193 (49%), Gaps = 17/193 (8%)

Query: 267 SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQID--------LEQAQKLEERRLSELK 318
           + L+ + E L ++ +    +  E E+E  + + ++D        LEQ ++  E+ + EL+
Sbjct: 775 AKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELE 834

Query: 319 IKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRM-V 377
            + EE+E+KL+   +       E     ++ + ++        +    E +E E+++  +
Sbjct: 835 EEIEELEEKLDELEEELEELEKE-----LEELKEELEELEAEKEELEDELKELEEEKEEL 889

Query: 378 QEEIKRKELER---KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEE 434
           +EE++  E E    KEE+EK + R++E +  +++ + +  +  +E EE  +  L+ + E 
Sbjct: 890 EEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELER 949

Query: 435 DRLKAEEQARLLE 447
           +  + EE+   L 
Sbjct: 950 EIERLEEEIEALG 962



 Score = 37.4 bits (87), Expect = 0.021
 Identities = 37/228 (16%), Positives = 81/228 (35%), Gaps = 41/228 (17%)

Query: 218 EKVDKHQSQKDDNRQVVTSSKVKDLIAANSTNASKENIKPKPNIGHVKPSNLRAKFENLA 277
           E++++ + + +  ++ +   +             +E  +            L   FE L 
Sbjct: 323 ERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALR 382

Query: 278 KQTEEESRKRSEEEKEKRKLKDQID-LEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPL 336
           ++  E   + +E   E  +LK +I+ LE+  +    RL +LK + +E+E +L        
Sbjct: 383 EELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAEL-------- 434

Query: 337 SPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQ 396
                                           EE + +     E   +  E+ EEL    
Sbjct: 435 --------------------------------EELQTELEELNEELEELEEQLEELRDRL 462

Query: 397 IRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQAR 444
             ++ E   ++++ ++ EKE    E     L  +Q+    ++A  +A 
Sbjct: 463 KELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEAL 510



 Score = 36.2 bits (84), Expect = 0.054
 Identities = 51/236 (21%), Positives = 98/236 (41%), Gaps = 54/236 (22%)

Query: 230 NRQVVTSSKVKDLIAANSTNASKENIKPKPNIGHV---KPSNLRAKFENLAKQTEEESRK 286
           N + V    ++DL+A +       +I  +  +  +   KP   R   E  A  ++ + RK
Sbjct: 115 NGEKVRLKDIQDLLADSGIGKESYSIVSQGKVEEIINAKPEERRKLIEEAAGVSKYKERK 174

Query: 287 RSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVP 346
              EE E++       LE+ +      L  L+   EE+EK+L       L          
Sbjct: 175 ---EEAERK-------LERTE----ENLERLEDLLEELEKQLE-----KLE--------- 206

Query: 347 VKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE----RKEELEKEQIRIKEE 402
                              ++ E+ E+ + ++ E++  EL     + +EL KE   ++EE
Sbjct: 207 -------------------RQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEE 247

Query: 403 QENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
              ++++ E+ ++E +E E+  + L  + +E      E Q  LLE +  +EEL  +
Sbjct: 248 LSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGE 303



 Score = 35.8 bits (83), Expect = 0.060
 Identities = 36/150 (24%), Positives = 72/150 (48%), Gaps = 8/150 (5%)

Query: 361 IQNSNKEEEEKEKQ----RMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKE 416
           +++     EE  +Q        EE+KR+    +EELE+ Q R++E +E +++ +E+ E+ 
Sbjct: 697 LRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEEL 756

Query: 417 RKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGT----NEDTEEDLGYTA 472
           ++  EE ++ L   ++   +LK E +    +++ L EEL +        E   + L    
Sbjct: 757 QERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALEREL 816

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDE 502
            +L   +   + EI    ++I    E +DE
Sbjct: 817 ESLEQRRERLEQEIEELEEEIEELEEKLDE 846



 Score = 33.5 bits (77), Expect = 0.30
 Identities = 36/151 (23%), Positives = 71/151 (47%), Gaps = 13/151 (8%)

Query: 268  NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQI-DLEQAQKLEERRLSELKIKEEEIEK 326
             L  + + L ++ EE   +  E E E  +LK++I  L +  +  E +L  L+++  E+E+
Sbjct: 874  ELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEE 933

Query: 327  KLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKEL 386
            +L    +  L    E         L++      P+     EE E+ ++R        +EL
Sbjct: 934  ELEEEYEDTLETELER----EIERLEEEIEALGPVNLRAIEEYEEVEER-------YEEL 982

Query: 387  ERK-EELEKEQIRIKEEQENIKKKKEQEEKE 416
            + + E+LE+ + ++ E  E + K+K +  KE
Sbjct: 983  KSQREDLEEAKEKLLEVIEELDKEKRERFKE 1013



 Score = 30.8 bits (70), Expect = 2.1
 Identities = 24/101 (23%), Positives = 48/101 (47%), Gaps = 5/101 (4%)

Query: 366 KEEEEKEKQRM--VQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEEN 423
           KE +E+ ++++   +E ++R E    EELEK+  +++ + E  ++ +E   K    + E 
Sbjct: 171 KERKEEAERKLERTEENLERLEDLL-EELEKQLEKLERQAEKAERYQEL--KAELRELEL 227

Query: 424 QKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDT 464
             LL K ++    L+  E+     +E L E   +    E  
Sbjct: 228 ALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKE 268



 Score = 28.9 bits (65), Expect = 7.8
 Identities = 33/176 (18%), Positives = 82/176 (46%), Gaps = 22/176 (12%)

Query: 265  KPSNLRAKFENLAKQTEE-ESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEE 323
            +   L  + E L ++ EE E+ K   E++ K   +++ +LE+  +  E  L+ELK + E+
Sbjct: 850  ELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEK 909

Query: 324  IEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQE-EIK 382
            + ++L                  ++ +  +       ++   ++  E E +R ++  E +
Sbjct: 910  LRERLE------------ELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEE 957

Query: 383  RKEL--------ERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKK 430
             + L        E  EE+E+    +K ++E++++ KE+  +  +E ++ ++   K+
Sbjct: 958  IEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRERFKE 1013


>gnl|CDD|212885 cd11952, SH3_iASPP, Src Homology 3 (SH3) domain of Inhibitor of
           ASPP protein (iASPP).  iASPP, also called
           RelA-associated inhibitor (RAI), is an oncoprotein that
           inhibits the apoptotic transactivation potential of p53.
           It is upregulated in human breast cancers expressing
           wild-type p53, in acute leukemias regardless of the p53
           mutation status, as well as in ovarian cancer where it
           is associated with poor patient outcome and
           chemoresistance. iASPP is also a binding partner and
           negative regulator of p65RelA, which promotes cell
           proliferation and inhibits apoptosis; p65RelA has the
           opposite effect on cell growth compared to the p53
           family. It contains a proline-rich region, four ankyrin
           (ANK) repeats, and an SH3 domain at its C-terminal half.
           The SH3 domain and the ANK repeats of iASPP contribute
           to the p53 binding site; they bind to the DNA binding
           domain of p53. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 56

 Score = 43.4 bits (102), Expect = 6e-06
 Identities = 21/52 (40%), Positives = 27/52 (51%), Gaps = 2/52 (3%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEG--WWRGYCHGQYGLFPANYVSL 523
           AL+DY A   DE+SF   D++T +    EG  WW     G+ G  P NY  L
Sbjct: 5   ALWDYSAEFPDELSFKEGDMVTVLRKDGEGTDWWWASLCGREGYVPRNYFGL 56


>gnl|CDD|212936 cd12003, SH3_EFS, Src homology 3 domain of CAS (Crk-Associated
           Substrate) scaffolding protein family member, Embryonal
           Fyn-associated Substrate.  EFS is also called HEFS,
           CASS3 (Cas scaffolding protein family member 3) or SIN
           (Src-interacting protein). It was identified based on
           interactions with the Src kinases, Fyn and Yes. It plays
           a role in thymocyte development and acts as a negative
           regulator of T cell proliferation. CAS proteins function
           as molecular scaffolds to regulate protein complexes
           that are involved in many cellular processes. They share
           a common domain structure that includes an N-terminal
           SH3 domain, an unstructured substrate domain that
           contains many YxxP motifs, a serine-rich four-helix
           bundle, and a FAT-like C-terminal domain. The SH3 domain
           of CAS proteins binds to diverse partners including FAK,
           FRNK, Pyk2, PTP-PEST, DOCK180, among others. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 62

 Score = 43.3 bits (102), Expect = 6e-06
 Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 3/57 (5%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMID---EGWWRGYCHGQYGLFPANYVSLQQ 525
           A ALYD  A + +E+SF   D++  ++       GWW    HGQ G+ PAN + L  
Sbjct: 3   AKALYDNAAESPEELSFRRGDVLMVLKREHGSLPGWWLCSLHGQQGIAPANRLRLLP 59


>gnl|CDD|212805 cd11872, SH3_DOCK_AB, Src Homology 3 domain of Class A and B
           Dedicator of Cytokinesis proteins.  DOCK proteins are
           atypical guanine nucleotide exchange factors (GEFs) that
           lack the conventional Dbl homology (DH) domain. They are
           divided into four classes (A-D) based on sequence
           similarity and domain architecture: class A includes
           Dock1, 2 and 5; class B includes Dock3 and 4; class C
           includes Dock6, 7, and 8; and class D includes Dock9, 10
           and 11. All DOCKs contain two homology domains: the
           DHR-1 (Dock homology region-1), also called CZH1 (CED-5,
           Dock180, and MBC-zizimin homology 1), and DHR-2 (also
           called CZH2 or Docker). The DHR-1 domain binds
           phosphatidylinositol-3,4,5-triphosphate while DHR-2
           contains the catalytic activity for Rac and/or Cdc42.
           This subfamily includes only Class A and B DOCKs, which
           also contain an SH3 domain at the N-terminal region and
           a PxxP motif at the C-terminus. Class A/B DOCKs are
           mostly specific GEFs for Rac, except Dock4 which
           activates the Ras family GTPase Rap1, probably
           indirectly through interaction with Rap regulatory
           proteins. The SH3 domain of class A/B DOCKs have been
           shown to bind Elmo, a scaffold protein that promotes GEF
           activity of DOCKs by releasing DHR-2 autoinhibition by
           the intramolecular SH3 domain. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 56

 Score = 42.6 bits (101), Expect = 1e-05
 Identities = 17/56 (30%), Positives = 31/56 (55%), Gaps = 4/56 (7%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYC---HGQYGLFPANYVSLQ 524
            VA+Y++Q   + ++S    D +  +E   EGW+RG+        G+FP +YV ++
Sbjct: 2   GVAIYNFQGDGEHQLSLQVGDTVQILE-ECEGWYRGFSLRNKSLKGIFPKSYVHIK 56


>gnl|CDD|212946 cd12013, SH3_RIM-BP_3, Third Src homology 3 domain of
           Rab3-interacting molecules (RIMs) binding proteins.
           RIMs binding proteins (RBPs, RIM-BPs) associate with
           calcium channels present in photoreceptors, neurons, and
           hair cells; they interact simultaneously with specific
           calcium channel subunits, and active zone proteins, RIM1
           and RIM2. RIMs are part of the matrix at the presynaptic
           active zone and are associated with synaptic vesicles
           through their interaction with the small GTPase Rab3.
           RIM-BPs play a role in regulating synaptic transmission
           by serving as adaptors and linking calcium channels with
           the synaptic vesicle release machinery. RIM-BPs contain
           three SH3 domains and two to three fibronectin III
           repeats. Invertebrates contain one, while vertebrates
           contain at least two RIM-BPs, RIM-BP1 and RIM-BP2.
           RIM-BP1 is also called peripheral-type benzodiazapine
           receptor associated protein 1 (PRAX-1). Mammals contain
           a third protein, RIM-BP3. RIM-BP1 and RIM-BP2 are
           predominantly expressed in the brain where they display
           overlapping but distinct expression patterns, while
           RIM-BP3 is almost exclusively expressed in the testis
           and is essential in spermiogenesis. The SH3 domains of
           RIM-BPs bind to the PxxP motifs of RIM1, RIM2, and
           L-type (alpha1D) and N-type (alpha1B) calcium channel
           subunits. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity. They play versatile and diverse roles
           in the cell including the regulation of enzymes,
           changing the subcellular localization of signaling
           pathway components, and mediating the formation of
           multiprotein complex assemblies.
          Length = 61

 Score = 42.8 bits (101), Expect = 1e-05
 Identities = 23/57 (40%), Positives = 34/57 (59%), Gaps = 8/57 (14%)

Query: 473 VALYDY-------QASADDEISFDPDDIITNI-EMIDEGWWRGYCHGQYGLFPANYV 521
           VAL+DY          A+ E+SF   DIIT   EM ++G++ G  +GQ GL P+N++
Sbjct: 3   VALFDYDPRESSPNVDAEVELSFRAGDIITVFGEMDEDGFYYGELNGQRGLVPSNFL 59


>gnl|CDD|212918 cd11985, SH3_Stac2_C, C-terminal Src homology 3 domain of SH3 and
           cysteine-rich domain-containing protein 2 (Stac2).  Stac
           proteins are putative adaptor proteins that contain a
           cysteine-rich C1 domain and one or two SH3 domains at
           the C-terminus. There are three mammalian members
           (Stac1, Stac2, and Stac3) of this family. Stac2 contains
           a single SH3 domain at the C-terminus unlike Stac1 and
           Stac3, which contain two C-terminal SH3 domains. Stac1
           and Stac2 have been found to be expressed differently in
           mature dorsal root ganglia (DRG) neurons. Stac1 is
           mainly expressed in peptidergic neurons while Stac2 is
           found in a subset of nonpeptidergic and all trkB+
           neurons. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 42.6 bits (100), Expect = 1e-05
 Identities = 16/51 (31%), Positives = 29/51 (56%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           + VALY +    ++++   P D +  ++  +E WW+G    + G FPAN+V
Sbjct: 1   SYVALYKFLPQENNDLPLQPGDRVMVVDDSNEDWWKGKSGDRVGFFPANFV 51


>gnl|CDD|212701 cd11767, SH3_Nck_3, Third Src Homology 3 domain of Nck adaptor
           proteins.  This group contains the third SH3 domain of
           Nck, the first SH3 domain of Caenorhabditis elegans
           Ced-2 (Cell death abnormality protein 2), and similar
           domains. Nck adaptor proteins regulate actin
           cytoskeleton dynamics by linking proline-rich effector
           molecules to protein tyrosine kinases and phosphorylated
           signaling intermediates. They contain three SH3 domains
           and a C-terminal SH2 domain. They function downstream of
           the PDGFbeta receptor and are involved in Rho GTPase
           signaling and actin dynamics. Vertebrates contain two
           Nck adaptor proteins: Nck1 (also called Nckalpha) and
           Nck2 (also called Nckbeta or Growth factor
           receptor-bound protein 4, Grb4), which show partly
           overlapping functions but also bind distinct targets.
           Their SH3 domains are involved in recruiting downstream
           effector molecules, such as the N-WASP/Arp2/3 complex,
           which when activated induces actin polymerization that
           results in the production of pedestals, or protrusions
           of the plasma membrane. The third SH3 domain of Nck
           appears to prefer ligands with a PxAPxR motif. SH3
           domains are protein interaction domains that usually
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially a PxxP motif. Ced-2 is a
           cell corpse engulfment protein that interacts with Ced-5
           in a pathway that regulates the activation of Ced-10, a
           Rac small GTPase.
          Length = 56

 Score = 42.7 bits (101), Expect = 1e-05
 Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 3/55 (5%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEM--IDEGWWRGY-CHGQYGLFPANYVSL 523
            VALY +    D+E+SF+  + +  IE    D  WW+     G  GL P NYV +
Sbjct: 2   VVALYPFTGENDEELSFEKGERLEIIEKPEDDPDWWKARNALGTTGLVPRNYVEV 56


>gnl|CDD|212783 cd11849, SH3_SPIN90, Src homology 3 domain of SH3 protein
           interacting with Nck, 90 kDa (SPIN90).  SPIN90 is also
           called NCK interacting protein with SH3 domain
           (NCKIPSD), Dia-interacting protein (DIP), 54 kDa
           vimentin-interacting protein (VIP54), or
           WASP-interacting SH3-domain protein (WISH). It is an
           F-actin binding protein that regulates actin
           polymerization and endocytosis. It associates with the
           Arp2/3 complex near actin filaments and determines
           filament localization at the leading edge of
           lamellipodia. SPIN90 is expressed in the early stages of
           neuronal differentiation and plays a role in regulating
           growth cone dynamics and neurite outgrowth. It also
           interacts with IRSp53 and regulates cell motility by
           playing a role in the formation of membrane protrusions.
           SPIN90 contains an N-terminal SH3 domain, a proline-rich
           domain, and a C-terminal VCA (verprolin-homology and
           cofilin-like acidic) domain. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 42.3 bits (100), Expect = 1e-05
 Identities = 16/49 (32%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGY-CHGQYGLFPANYV 521
           ALYD++++  + +SF   +    +E  +  WW      G+ G  PANYV
Sbjct: 4   ALYDFKSAEPNTLSFSEGETFLLLERSNAHWWLVTNHSGETGYVPANYV 52


>gnl|CDD|212716 cd11782, SH3_Sorbs_2, Second Src Homology 3 domain of Sorbin and
           SH3 domain containing (Sorbs) proteins and similar
           domains.  This family, also called the vinexin family,
           is composed predominantly of adaptor proteins containing
           one sorbin homology (SoHo) and three SH3 domains.
           Members include the second SH3 domains of Sorbs1 (or
           ponsin), Sorbs2 (or ArgBP2), Vinexin (or Sorbs3), and
           similar domains. They are involved in the regulation of
           cytoskeletal organization, cell adhesion, and growth
           factor signaling. Members of this family bind multiple
           partners including signaling molecules like c-Abl,
           c-Arg, Sos, and c-Cbl, as well as cytoskeletal molecules
           such as vinculin and afadin. They may have overlapping
           functions. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 42.3 bits (100), Expect = 1e-05
 Identities = 20/50 (40%), Positives = 29/50 (58%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           A A Y++ A    E+SF   D+IT    +DE W+ G   G+ G+FP +YV
Sbjct: 2   ARAKYNFNADTGVELSFRKGDVITLTRRVDENWYEGRIGGRQGIFPVSYV 51


>gnl|CDD|212909 cd11976, SH3_VAV1_2, C-terminal (or second) Src homology 3 domain
           of VAV1 protein.  VAV1 is expressed predominantly in the
           hematopoietic system and it plays an important role in
           the development and activation of B and T cells. It is
           activated by tyrosine phosphorylation to function as a
           guanine nucleotide exchange factor (GEF) for Rho GTPases
           following cell surface receptor activation, triggering
           various effects such as cytoskeletal reorganization,
           transcription regulation, cell cycle progression, and
           calcium mobilization. It also serves as a scaffold
           protein and has been shown to interact with Ku70, Socs1,
           Janus kinase 2, SIAH2, S100B, Abl gene, ZAP-70, SLP76,
           and Syk, among others. VAV proteins contain several
           domains that enable their function: N-terminal calponin
           homology (CH), acidic, RhoGEF (also called
           Dbl-homologous or DH), Pleckstrin Homology (PH), C1
           (zinc finger), SH2, and two SH3 domains. The C-terminal
           SH3 domain of Vav1 interacts with a wide variety of
           proteins including cytoskeletal regulators (zyxin),
           RNA-binding proteins (Sam68), transcriptional
           regulators, viral proteins, and dynamin 2. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 54

 Score = 42.2 bits (99), Expect = 2e-05
 Identities = 24/52 (46%), Positives = 30/52 (57%), Gaps = 1/52 (1%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDE-GWWRGYCHGQYGLFPANYV 521
           TA A YD+ A    E+S    DII  +    + GWWRG  +G+ G FPANYV
Sbjct: 1   TAKARYDFCARDRSELSLKEGDIIKILNKKGQQGWWRGEIYGRVGWFPANYV 52


>gnl|CDD|212931 cd11998, SH3_PACSIN1-2, Src homology 3 domain of Protein kinase C
           and Casein kinase Substrate in Neurons 1 (PACSIN1) and
           PACSIN 2.  PACSIN 1 or Syndapin I (Synaptic
           dynamin-associated protein I) is expressed specifically
           in the brain and is localized in neurites and synaptic
           boutons. It binds the brain-specific proteins dynamin I,
           synaptojanin, synapsin I, and neural Wiskott-Aldrich
           syndrome protein (nWASP), and functions as a link
           between the cytoskeletal machinery and synaptic vesicle
           endocytosis. PACSIN 1 interacts with huntingtin and may
           be implicated in the neuropathology of Huntington's
           disease. PACSIN 2 or Syndapin II is expressed
           ubiquitously and is involved in the regulation of
           tubulin polymerization. It associates with Golgi
           membranes and forms a complex with dynamin II which is
           crucial in promoting vesicle formation from the
           trans-Golgi network. PACSINs act as regulators of
           cytoskeletal and membrane dynamics. Vetebrates harbor
           three isoforms with distinct expression patterns and
           specific functions. PACSINs contain an N-terminal F-BAR
           domain and a C-terminal SH3 domain. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 56

 Score = 42.2 bits (99), Expect = 2e-05
 Identities = 26/50 (52%), Positives = 31/50 (62%), Gaps = 2/50 (4%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDE-GWWRGYC-HGQYGLFPANYV 521
           ALYDY     DE+SF   D +T +E  DE GW +G    GQ GL+PANYV
Sbjct: 5   ALYDYDGQEQDELSFKAGDELTKLEDEDEQGWCKGRLDSGQVGLYPANYV 54


>gnl|CDD|212705 cd11771, SH3_Pex13p_fungal, Src Homology 3 domain of fungal
           peroxisomal membrane protein Pex13p.  Pex13p, located in
           the peroxisomal membrane, contains two transmembrane
           regions and a C-terminal SH3 domain. It binds to the
           peroxisomal targeting type I (PTS1) receptor Pex5p and
           the docking factor Pex14p through its SH3 domain. It is
           essential for both PTS1 and PTS2 protein import pathways
           into the peroxisomal matrix. Pex13p binds Pex14p, which
           contains a PxxP motif, in a classical fashion to the
           proline-rich ligand binding site of its SH3 domain. It
           binds the WxxxF/Y motif of Pex5p in a novel site that
           does not compete with Pex14p binding. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 60

 Score = 42.3 bits (100), Expect = 2e-05
 Identities = 20/57 (35%), Positives = 29/57 (50%), Gaps = 7/57 (12%)

Query: 474 ALYDYQASADD-EISFDPDDII-----TNIEMIDEGWWRGYCH-GQYGLFPANYVSL 523
           ALYD+     + E+S    DI+     T+    D  WW+G    G+ G FP+NYV +
Sbjct: 4   ALYDFTPENPEMELSLKKGDIVAVLSKTDPLGRDSEWWKGRTRDGRIGWFPSNYVEV 60


>gnl|CDD|212717 cd11783, SH3_SH3RF_3, Third Src Homology 3 domain of SH3 domain
           containing ring finger 1 (SH3RF1), SH3RF3, and similar
           domains.  SH3RF1 (or POSH) and SH3RF3 (or POSH2) are
           scaffold proteins that function as E3 ubiquitin-protein
           ligases. They contain an N-terminal RING finger domain
           and four SH3 domains. This model represents the third
           SH3 domain, located in the middle of SH3RF1 and SH3RF3,
           and similar domains. SH3RF1 plays a role in calcium
           homeostasis through the control of the ubiquitin domain
           protein Herp. It may also have a role in regulating
           death receptor mediated and JNK mediated apoptosis.
           SH3RF3 interacts with p21-activated kinase 2 (PAK2) and
           GTP-loaded Rac1. It may play a role in regulating JNK
           mediated apoptosis in certain conditions. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 42.0 bits (99), Expect = 2e-05
 Identities = 20/51 (39%), Positives = 28/51 (54%), Gaps = 2/51 (3%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYC--HGQYGLFPANYV 521
           VALY Y+    DE+     ++ T  E   +GW++G     GQ G+FP NYV
Sbjct: 3   VALYPYKPQKPDELELRKGEMYTVTEKCQDGWFKGTSLRTGQSGVFPGNYV 53


>gnl|CDD|212935 cd12002, SH3_NEDD9, Src homology 3 domain of CAS (Crk-Associated
           Substrate) scaffolding protein family member, Neural
           precursor cell Expressed, Developmentally Down-regulated
           9.  NEDD9 is also called human enhancer of filamentation
           1 (HEF1) or CAS-L (Crk-associated substrate in
           lymphocyte). It was first described as a gene
           predominantly expressed in early embryonic brain, and
           was also isolated from a screen of human proteins that
           regulate filamentous budding in yeast, and as a tyrosine
           phosphorylated protein in lymphocytes. It promotes
           metastasis in different solid tumors. NEDD9 localizes in
           focal adhesions and associates with FAK and Abl kinase.
           It also interacts with SMAD3 and the proteasomal
           machinery which allows its rapid turnover; these
           interactions are not shared by other CAS proteins. CAS
           proteins function as molecular scaffolds to regulate
           protein complexes that are involved in many cellular
           processes. They share a common domain structure that
           includes an N-terminal SH3 domain, an unstructured
           substrate domain that contains many YxxP motifs, a
           serine-rich four-helix bundle, and a FAT-like C-terminal
           domain. The SH3 domain of CAS proteins binds to diverse
           partners including FAK, FRNK, Pyk2, PTP-PEST, DOCK180,
           among others. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 57

 Score = 41.9 bits (98), Expect = 2e-05
 Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMID---EGWWRGYCHGQYGLFPANYVSL 523
           A ALYD      +E++F   DI+T IE      EGWW    HG+ G+ P N + L
Sbjct: 2   ARALYDNVPECAEELAFRKGDILTVIEQNTGGLEGWWLCSLHGRQGIAPGNRLKL 56


>gnl|CDD|212708 cd11774, SH3_Sla1p_2, Second Src Homology 3 domain of the fungal
           endocytic adaptor protein Sla1p.  Sla1p facilitates
           endocytosis by playing a role as an adaptor protein in
           coupling components of the actin cytoskeleton to the
           endocytic machinery. It interacts with Abp1p, Las17p and
           Pan1p, which are activator proteins of actin-related
           protein 2/3 (Arp2/3). Sla1p contains multiple domains
           including three SH3 domains, a SAM (sterile alpha motif)
           domain, and a Sla1 homology domain 1 (SHD1), which binds
           to the NPFXD motif that is found in many integral
           membrane proteins such as the Golgi-localized
           Arf-binding protein Lsb5p and the P4-ATPases, Drs2p and
           Dnf1p. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 52

 Score = 42.1 bits (99), Expect = 2e-05
 Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 1/52 (1%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHG-QYGLFPANYV 521
            A ALYDY    ++E+SF+  D +   +  D  W     +G Q+G  PANY+
Sbjct: 1   QAKALYDYDKQTEEELSFNEGDTLDVYDDSDSDWILVGFNGTQFGFVPANYI 52


>gnl|CDD|212921 cd11988, SH3_Intersectin2_1, First Src homology 3 domain (or SH3A)
           of Intersectin-2.  Intersectin-2 (ITSN2) is an adaptor
           protein that functions in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction. It
           plays a role in clathrin-coated pit (CCP) formation. It
           binds to many proteins through its multidomain structure
           and facilitate the assembly of multimeric complexes.
           ITSN2 also functions as a specific GEF for Cdc42
           activation in epithelial morphogenesis, and is required
           in mitotic spindle orientation. It exists in
           alternatively spliced short and long isoforms. The short
           isoform contains two Eps15 homology domains (EH1 and
           EH2), a coiled-coil region and five SH3 domains
           (SH3A-E), while the long isoform, in addition, contains
           RhoGEF (also called Dbl-homologous or DH), Pleckstrin
           homology (PH) and C2 domains. The first SH3 domain (or
           SH3A) of ITSN2 is expected to bind many protein
           partners, similar to ITSN1 which has been shown to bind
           Sos1, dynamin1/2, CIN85, c-Cbl, PI3K-C2, SHIP2, N-WASP,
           and CdGAP, among others. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 57

 Score = 42.2 bits (99), Expect = 2e-05
 Identities = 23/54 (42%), Positives = 29/54 (53%), Gaps = 10/54 (18%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDE------GWWRGYCHGQYGLFPANYV 521
           ALY ++A   DE+SF+  DII     +DE      GW  G   G +G FP NYV
Sbjct: 6   ALYPFEARNHDEMSFNAGDII----QVDEKTVGEPGWLYGSFQGNFGWFPCNYV 55


>gnl|CDD|212993 cd12060, SH3_alphaPIX, Src Homology 3 domain of alpha-Pak
           Interactive eXchange factor.  Alpha-PIX, also called Rho
           guanine nucleotide exchange factor 6 (ARHGEF6) or Cool
           (Cloned out of Library)-2, activates small GTPases by
           exchanging bound GDP for free GTP. It acts as a GEF for
           both Cdc42 and Rac 1, and is localized in dendritic
           spines where it regulates spine morphogenesis. It
           controls dendritic length and spine density in the
           hippocampus. Mutations in the ARHGEF6 gene cause
           X-linked intellectual disability in humans. PIX proteins
           contain an N-terminal SH3 domain followed by RhoGEF
           (also called Dbl-homologous or DH) and Pleckstrin
           Homology (PH) domains, and a C-terminal leucine-zipper
           domain for dimerization. The SH3 domain of PIX binds to
           an atypical PxxxPR motif in p21-activated kinases (PAKs)
           with high affinity. The binding of PAKs to PIX
           facilitate the localization of PAKs to focal complexes
           and also localizes PAKs to PIX targets Cdc43 and Rac,
           leading to the activation of PAKs. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 58

 Score = 41.9 bits (98), Expect = 2e-05
 Identities = 18/48 (37%), Positives = 30/48 (62%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           A ++++ + +DE+S    DII    + + GWW G  +G+ G FP+NYV
Sbjct: 6   ARFNFKQTNEDELSVCKGDIIYVTRVEEGGWWEGTLNGKTGWFPSNYV 53


>gnl|CDD|212934 cd12001, SH3_BCAR1, Src homology 3 domain of the CAS
           (Crk-Associated Substrate) scaffolding protein family
           member, Breast Cancer Anti-estrogen Resistance 1.
           BCAR1, also called p130cas or CASS1, is the founding
           member of the CAS family of scaffolding proteins and was
           originally identified through its ability to associate
           with Crk. The name BCAR1 was designated because the
           human gene was identified in a screen for genes that
           promote resistance to tamoxifen. It is widely expressed
           and its deletion is lethal in mice. It plays a role in
           regulating cell motility, survival, proliferation,
           transformation, cancer progression, and bacterial
           pathogenesis. CAS proteins function as molecular
           scaffolds to regulate protein complexes that are
           involved in many cellular processes. They share a common
           domain structure that includes an N-terminal SH3 domain,
           an unstructured substrate domain that contains many YxxP
           motifs, a serine-rich four-helix bundle, and a FAT-like
           C-terminal domain. The SH3 domain of CAS proteins binds
           to diverse partners including FAK, FRNK, Pyk2, PTP-PEST,
           DOCK180, among others. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 68

 Score = 42.0 bits (98), Expect = 2e-05
 Identities = 23/59 (38%), Positives = 32/59 (54%), Gaps = 3/59 (5%)

Query: 468 LGYTAVALYDYQASADDEISFDPDDIITNIEMIDE---GWWRGYCHGQYGLFPANYVSL 523
           L   A ALYD  A + DE+SF   DI+T +E   +   GWW    HG+ G+ P N + +
Sbjct: 1   LNVLAKALYDNVAESPDELSFRKGDIMTVLERDTQGLDGWWLCSLHGRQGIVPGNRLKI 59


>gnl|CDD|212773 cd11839, SH3_Intersectin_4, Fourth Src homology 3 domain (or SH3D)
           of Intersectin.  Intersectins (ITSNs) are adaptor
           proteins that function in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction.
           They are essential for initiating clathrin-coated pit
           formation. They bind to many proteins through their
           multidomain structure and facilitate the assembly of
           multimeric complexes. Vertebrates contain two ITSN
           proteins, ITSN1 and ITSN2, which exist in alternatively
           spliced short and long isoforms. The short isoforms
           contain two Eps15 homology domains (EH1 and EH2), a
           coiled-coil region and five SH3 domains (SH3A-E), while
           the long isoforms, in addition, contain RhoGEF (also
           called Dbl-homologous or DH), Pleckstrin homology (PH)
           and C2 domains. ITSN1 and ITSN2 are both widely
           expressed, with variations depending on tissue type and
           stage of development. The fourth SH3 domain (or SH3D) of
           ITSN1 has been shown to bind SHIP2, Numb, CdGAP, and
           N-WASP. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 58

 Score = 41.6 bits (98), Expect = 2e-05
 Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 5/57 (8%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHG-----QYGLFPANYVSL 523
           A  +  + A+A++++S     ++   +    GWW G         Q G FPANYV L
Sbjct: 2   AQVIAPFTATAENQLSLAVGQLVLVRKKSPSGWWEGELQARGKKRQIGWFPANYVKL 58


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 46.0 bits (109), Expect = 3e-05
 Identities = 26/85 (30%), Positives = 47/85 (55%), Gaps = 4/85 (4%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
            Q   K  +  E+QR  +E+ + +EL++K+    EQ R+K+ ++  ++   QE+K++ E+
Sbjct: 67  QQQQQKSAKRAEEQRKKKEQQQAEELQQKQA--AEQERLKQLEK--ERLAAQEQKKQAEE 122

Query: 421 EENQKLLLKKQQEEDRLKAEEQARL 445
              Q  L +KQ EE   KA   A+ 
Sbjct: 123 AAKQAALKQKQAEEAAAKAAAAAKA 147



 Score = 36.7 bits (85), Expect = 0.023
 Identities = 20/82 (24%), Positives = 38/82 (46%), Gaps = 12/82 (14%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           + EE ++KQ   QE +K  +LE++    +EQ          KK+ E+  K+   +++  +
Sbjct: 88  QAEELQQKQAAEQERLK--QLEKERLAAQEQ----------KKQAEEAAKQAALKQKQAE 135

Query: 426 LLLKKQQEEDRLKAEEQARLLE 447
               K     + KAE +A+   
Sbjct: 136 EAAAKAAAAAKAKAEAEAKRAA 157



 Score = 33.6 bits (77), Expect = 0.22
 Identities = 25/82 (30%), Positives = 46/82 (56%), Gaps = 9/82 (10%)

Query: 377 VQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDR 436
           V E+  R++ ++K     E+ R K+EQ+      + EE ++K+  E ++L   KQ E++R
Sbjct: 60  VVEQYNRQQQQQKSAKRAEEQRKKKEQQ------QAEELQQKQAAEQERL---KQLEKER 110

Query: 437 LKAEEQARLLEQERLMEELRQQ 458
           L A+EQ +  E+      L+Q+
Sbjct: 111 LAAQEQKKQAEEAAKQAALKQK 132



 Score = 29.8 bits (67), Expect = 3.6
 Identities = 12/82 (14%), Positives = 36/82 (43%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
               +  E++  +++ +E +  +E +++ E   +Q  +K++Q      K     + K + 
Sbjct: 92  LQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEA 151

Query: 422 ENQKLLLKKQQEEDRLKAEEQA 443
           E ++     ++     K + +A
Sbjct: 152 EAKRAAAAAKKAAAEAKKKAEA 173



 Score = 29.0 bits (65), Expect = 6.1
 Identities = 36/168 (21%), Positives = 65/168 (38%), Gaps = 47/168 (27%)

Query: 279 QTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSEL---KIKEEEIEKKLNGHSDVP 335
           Q +++S KR+EE+++K++ +   +L+Q Q  E+ RL +L   ++  +E +K         
Sbjct: 68  QQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKK--------- 118

Query: 336 LSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKE 395
                                     Q     ++   KQ+  +E   +     K + E E
Sbjct: 119 --------------------------QAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAE 152

Query: 396 QIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQA 443
             R          KK   E ++K + E      KK   E + KAE +A
Sbjct: 153 AKRAAAA-----AKKAAAEAKKKAEAEAA----KKAAAEAKKKAEAEA 191



 Score = 29.0 bits (65), Expect = 6.7
 Identities = 34/167 (20%), Positives = 73/167 (43%), Gaps = 4/167 (2%)

Query: 277 AKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPL 336
           AK+ EE+ +K+ +++ E+ + K   + E+ ++LE+ RL+  + K++  E           
Sbjct: 74  AKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQ 133

Query: 337 SPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQ 396
           +   E       +   +  ++      + K+   + K++   E  K+   E K++ E E 
Sbjct: 134 A--EEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEA 191

Query: 397 IRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQA 443
                 +   KKK E E K++   E  +K   + +    +  AE +A
Sbjct: 192 AAKAAAEA--KKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKA 236


>gnl|CDD|212924 cd11991, SH3_Intersectin1_3, Third Src homology 3 domain (or SH3C)
           of Intersectin-1.  Intersectin-1 (ITSN1) is an adaptor
           protein that functions in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction. It
           plays a role in clathrin-coated pit (CCP) formation. It
           binds to many proteins through its multidomain structure
           and facilitate the assembly of multimeric complexes.
           ITSN1 localizes in membranous organelles, CCPs, the
           Golgi complex, and may be involved in the cell membrane
           trafficking system. It exists in alternatively spliced
           short and long isoforms. The short isoform contains two
           Eps15 homology domains (EH1 and EH2), a coiled-coil
           region and five SH3 domains (SH3A-E), while the long
           isoform, in addition, contains RhoGEF (also called
           Dbl-homologous or DH), Pleckstrin homology (PH) and C2
           domains. The third SH3 domain (or SH3C) of ITSN1 has
           been shown to bind many proteins including dynamin1/2,
           CIN85, c-Cbl, SHIP2, Reps1, synaptojanin-1, and WNK,
           among others. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 52

 Score = 41.1 bits (96), Expect = 3e-05
 Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           VA+Y Y+++   +++F   D+I   +  D  WW G    + G+FP+NYV  
Sbjct: 3   VAMYTYESNEQGDLTFQQGDVILVTKK-DGDWWTGTVGDKTGVFPSNYVRP 52


>gnl|CDD|212725 cd11791, SH3_UBASH3, Src homology 3 domain of Ubiquitin-associated
           and SH3 domain-containing proteins, also called TULA (T
           cell Ubiquitin LigAnd) family of proteins.  UBASH3 or
           TULA proteins are also referred to as Suppressor of T
           cell receptor Signaling (STS) proteins. They contain an
           N-terminal UBA domain, a central SH3 domain, and a
           C-terminal histidine phosphatase domain. They bind c-Cbl
           through the SH3 domain and to ubiquitin via UBA. In some
           vertebrates, there are two TULA family proteins, called
           UBASH3A (also called TULA or STS-2) and UBASH3B (also
           called TULA-2 or STS-1), which show partly overlapping
           as well as distinct functions. UBASH3B is widely
           expressed while UBASH3A is only found in lymphoid cells.
           UBASH3A facilitates apoptosis induced in T cells through
           its interaction with the apoptosis-inducing factor AIF.
           UBASH3B is an active phosphatase while UBASH3A is not.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 59

 Score = 41.1 bits (97), Expect = 3e-05
 Identities = 17/54 (31%), Positives = 22/54 (40%), Gaps = 6/54 (11%)

Query: 474 ALYDYQASADDEISFDPDDII----TNIEMIDEGWWRGYCH--GQYGLFPANYV 521
            LY Y    +DE+   P D I      ++   +GW  G     G  GL P NY 
Sbjct: 4   VLYPYTPQEEDELELVPGDYIYVSPEELDSSSDGWVEGTSWLTGCSGLLPENYT 57


>gnl|CDD|212911 cd11978, SH3_VAV3_2, C-terminal (or second) Src homology 3 domain
           of VAV3 protein.  VAV3 is ubiquitously expressed and
           functions as a phosphorylation-dependent guanine
           nucleotide exchange factor (GEF) for RhoA, RhoG, and
           Rac1. It has been implicated to function in the
           hematopoietic, bone, cerebellar, and cardiovascular
           systems. VAV3 is essential in axon guidance in neurons
           that control blood pressure and respiration. It is
           overexpressed in prostate cancer cells and it plays a
           role in regulating androgen receptor transcriptional
           activity. VAV proteins contain several domains that
           enable their function: N-terminal calponin homology
           (CH), acidic, RhoGEF (also called Dbl-homologous or DH),
           Pleckstrin Homology (PH), C1 (zinc finger), SH2, and two
           SH3 domains. The SH3 domain of VAV is involved in the
           localization of proteins to specific sites within the
           cell, by interacting with proline-rich sequences within
           target proteins. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 56

 Score = 41.2 bits (96), Expect = 3e-05
 Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 472 AVALYDYQASADDEISFDPDDIIT-NIEMIDEGWWRGYCHGQYGLFPANYV 521
           A+A YD+ A    E+S    D++    +M   GWWRG  +G+ G FP+ YV
Sbjct: 3   AIARYDFCARDMRELSLLKGDVVKIYTKMSTNGWWRGEVNGRVGWFPSTYV 53


>gnl|CDD|212910 cd11977, SH3_VAV2_2, C-terminal (or second) Src homology 3 domain
           of VAV2 protein.  VAV2 is widely expressed and functions
           as a guanine nucleotide exchange factor (GEF) for RhoA,
           RhoB and RhoG and also activates Rac1 and Cdc42. It is
           implicated in many cellular and physiological functions
           including blood pressure control, eye development,
           neurite outgrowth and branching, EGFR endocytosis and
           degradation, and cell cluster morphology, among others.
           It has been reported to associate with Nek3. VAV
           proteins contain several domains that enable their
           function: N-terminal calponin homology (CH), acidic,
           RhoGEF (also called Dbl-homologous or DH), Pleckstrin
           Homology (PH), C1 (zinc finger), SH2, and two SH3
           domains. The SH3 domain of VAV is involved in the
           localization of proteins to specific sites within the
           cell, by interacting with proline-rich sequences within
           target proteins. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 58

 Score = 41.2 bits (96), Expect = 3e-05
 Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 2/53 (3%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMI--DEGWWRGYCHGQYGLFPANYV 521
           TAVA Y++ A    E+S    D++     I  D+GWW+G  +G+ G FP+ YV
Sbjct: 2   TAVARYNFAARDMRELSLREGDVVRIYSRIGGDQGWWKGETNGRIGWFPSTYV 54


>gnl|CDD|212731 cd11797, SH3_DNMBP_N4, Fourth N-terminal Src homology 3 domain of
           Dynamin Binding Protein, also called Tuba.  DNMBP or
           Tuba is a cdc42-specific guanine nucleotide exchange
           factor (GEF) that contains four N-terminal SH3 domains,
           a central RhoGEF [or Dbl homology (DH)] domain followed
           by a Bin/Amphiphysin/Rvs (BAR) domain, and two
           C-terminal SH3 domains. It provides a functional link
           between dynamin and key regulatory proteins of the actin
           cytoskeleton. It plays an important role in regulating
           cell junction configuration. The four N-terminal SH3
           domains of DNMBP bind the GTPase dynamin, which plays an
           important role in the fission of endocytic vesicles. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 50

 Score = 40.9 bits (96), Expect = 3e-05
 Identities = 18/46 (39%), Positives = 27/46 (58%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFP 517
            VALY +QA   +E+ F+  D I  I  +++GW  G   G+ G+FP
Sbjct: 2   GVALYRFQALEPNELDFEVGDRIRIIATLEDGWLEGELKGRRGIFP 47


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 45.4 bits (108), Expect = 4e-05
 Identities = 25/93 (26%), Positives = 49/93 (52%), Gaps = 10/93 (10%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE- 419
           I+    + E  E ++ +  E +++E +  E  E+        QE++K+  E+ E ER++ 
Sbjct: 206 IEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSY------QEHVKQLIEKMEAEREKL 259

Query: 420 QEENQKLLLKKQQE-EDRLKA--EEQARLLEQE 449
             E +++L  K QE E+ LK   + +A  L++E
Sbjct: 260 LAEQERMLEHKLQEQEELLKEGFKTEAESLQKE 292



 Score = 29.2 bits (66), Expect = 5.3
 Identities = 23/87 (26%), Positives = 46/87 (52%), Gaps = 10/87 (11%)

Query: 379 EEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE--QEENQKLLLKKQQEEDR 436
           EE+ ++ L  KE +E E I   ++    K+K  + E+ + E  + E + L  K+++EE  
Sbjct: 175 EEVLQEFLNSKEAVE-EAILQTDQALTAKEKAIEAERAKAEAAEAEQELLREKQKEEEQM 233

Query: 437 LKA-----EEQARLLEQERLMEELRQQ 458
           ++A     +E  + L ++  ME  R++
Sbjct: 234 MEAQERSYQEHVKQLIEK--MEAEREK 258



 Score = 28.4 bits (64), Expect = 8.6
 Identities = 17/72 (23%), Positives = 37/72 (51%), Gaps = 13/72 (18%)

Query: 366 KEEEEKEKQRMVQEEIKRKELER--KEELEKEQIRIKEEQENIKKKKEQEEK-------- 415
           +E++++E+Q M  +E   +E  +   E++E E+ ++  EQE + + K QE++        
Sbjct: 224 REKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFK 283

Query: 416 ---ERKEQEENQ 424
              E  ++E   
Sbjct: 284 TEAESLQKEIQD 295


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 45.8 bits (109), Expect = 4e-05
 Identities = 43/230 (18%), Positives = 83/230 (36%), Gaps = 28/230 (12%)

Query: 237 SKVKDLIAANSTNASKENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRK 296
              +  + ++ T  SK        +   + S L  +          E+  ++     +  
Sbjct: 58  RDPQISVVSSVTKLSKGPKDEPKQVKIPRQSGLSEE--------VAEAEAKATSVAAEAT 109

Query: 297 LKDQIDLEQAQKLEERRLSELKIKEE-----EIEKKLNGHSDV-PLSPSTETPPVPVKSI 350
               I  E  + LEE     LK         E+    N   D        +     + S 
Sbjct: 110 TPKSIQ-ELVEALEELLEELLKETASDPVVQELVSIFNDLIDSIKEDNLKDDLESLIASA 168

Query: 351 LKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKK 410
            ++       +     EEEE+     ++  +K K  E   +LE+E +   E +E   +K+
Sbjct: 169 KEELDQLSKKLAELKAEEEEE-----LERALKEKREELLSKLEEELLARLESKEAALEKQ 223

Query: 411 EQEEKERKEQEENQKLLLKKQQEEDRLKAE-EQARLLEQERLMEELRQQG 459
            + E ER+++E  +K        E++L+ E E+     +++L  EL  Q 
Sbjct: 224 LRLEFEREKEELRKKY-------EEKLRQELERQAEAHEQKLKNELALQA 266



 Score = 30.0 bits (68), Expect = 3.0
 Identities = 38/161 (23%), Positives = 66/161 (40%), Gaps = 36/161 (22%)

Query: 296 KLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPT 355
           +LK + + E  + L+E+R   L   EEE+  +L                   ++ L++  
Sbjct: 181 ELKAEEEEELERALKEKREELLSKLEEELLARLESK----------------EAALEK-- 222

Query: 356 SDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEE----------LEKEQIRIKEEQEN 405
                 Q   + E EKE+ R   EE  R+ELER+ E            +     +E  + 
Sbjct: 223 ------QLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREFNKE 276

Query: 406 IKKKKEQEEKERKEQEENQKLLLK--KQQEEDRLKAEEQAR 444
           IK+K E+E   R  +       LK  ++  + R +AE++  
Sbjct: 277 IKEKVEEERNGRLAKLAELNSRLKGLEKALDSRSEAEDENH 317


>gnl|CDD|212817 cd11884, SH3_MYO15, Src Homology 3 domain of Myosin XV.  This
           subfamily is composed of proteins with similarity to
           Myosin XVa. Myosin XVa is an unconventional myosin that
           is critical for the normal growth of mechanosensory
           stereocilia of inner ear hair cells. Mutations in the
           myosin XVa gene are associated with nonsyndromic hearing
           loss. Myosin XVa contains a unique N-terminal extension
           followed by a motor domain, light chain-binding IQ
           motifs, and a tail consisting of a pair of MyTH4-FERM
           tandems separated by a SH3 domain, and a PDZ domain. SH3
           domains bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs;
           they play a role in the regulation of enzymes by
           intramolecular interactions, changing the subcellular
           localization of signal pathway components and mediate
           multiprotein complex assemblies.
          Length = 56

 Score = 40.8 bits (96), Expect = 4e-05
 Identities = 17/53 (32%), Positives = 22/53 (41%), Gaps = 3/53 (5%)

Query: 472 AVALYDYQASADDEISFDPDDIIT---NIEMIDEGWWRGYCHGQYGLFPANYV 521
            VA+  Y       +SF   D+I        +D GW  G   G+ G FP  YV
Sbjct: 2   VVAVRAYITRDQTLLSFHKGDVIKLLPKEGPLDPGWLFGTLDGRSGAFPKEYV 54


>gnl|CDD|212789 cd11855, SH3_Sho1p, Src homology 3 domain of High osmolarity
           signaling protein Sho1p.  Sho1p (or Sho1), also called
           SSU81 (Suppressor of SUA8-1 mutation), is a yeast
           membrane protein that regulates adaptation to high salt
           conditions by activating the HOG (high-osmolarity
           glycerol) pathway. High salt concentrations lead to the
           localization to the membrane of the MAPKK Pbs2, which is
           then activated by the MAPKK Ste11 and in turn, activates
           the MAPK Hog1. Pbs2 is localized to the membrane though
           the interaction of its PxxP motif with the SH3 domain of
           Sho1p. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 40.5 bits (95), Expect = 6e-05
 Identities = 21/56 (37%), Positives = 32/56 (57%), Gaps = 4/56 (7%)

Query: 471 TAVALYDYQASADD--EISFDPDDIITNIEMIDEGWWRG-YCHGQYGLFPANYVSL 523
            A ALY Y AS DD  E+SF+  +I+  +      WW+    +G+ G+ P+NY+ L
Sbjct: 1   RARALYPYDASPDDPNELSFEKGEIL-EVSDTSGKWWQARKSNGETGICPSNYLQL 55


>gnl|CDD|212703 cd11769, SH3_CSK, Src Homology 3 domain of C-terminal Src kinase.
           CSK is a cytoplasmic (or nonreceptor) tyr kinase
           containing the Src homology domains, SH3 and SH2,
           N-terminal to the catalytic tyr kinase domain. They
           negatively regulate the activity of Src kinases that are
           anchored to the plasma membrane. To inhibit Src kinases,
           CSK is translocated to the membrane via binding to
           specific transmembrane proteins, G-proteins, or adaptor
           proteins near the membrane. CSK catalyzes the tyr
           phosphorylation of the regulatory C-terminal tail of Src
           kinases, resulting in their inactivation. It is
           expressed in a wide variety of tissues and plays a role,
           as a regulator of Src, in cell proliferation, survival,
           and differentiation, and consequently, in cancer
           development and progression. In addition, CSK also shows
           Src-independent functions. It is a critical component in
           G-protein signaling, and plays a role in cytoskeletal
           reorganization and cell migration. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 57

 Score = 40.4 bits (95), Expect = 6e-05
 Identities = 16/56 (28%), Positives = 32/56 (57%), Gaps = 2/56 (3%)

Query: 469 GYTAVALYDYQASADDEISFDPDDIITNIEMI-DEGWWRG-YCHGQYGLFPANYVS 522
           G   +A Y++  ++++++ F   DI+T + +  D  W++     G+ G+ PANYV 
Sbjct: 1   GTECIAKYNFNGASEEDLPFKKGDILTIVAVTKDPNWYKAKNKDGREGMIPANYVQ 56


>gnl|CDD|212740 cd11806, SH3_PRMT2, Src homology 3 domain of Protein arginine
           N-methyltransferase 2.  PRMT2, also called HRMT1L1,
           belongs to the arginine methyltransferase protein
           family. It functions as a coactivator to both estrogen
           receptor alpha (ER-alpha) and androgen receptor (AR),
           presumably through arginine methylation. The ER-alpha
           transcription factor is involved in cell proliferation,
           differentiation, morphogenesis, and apoptosis, and is
           also implicated in the development and progression of
           breast cancer. PRMT2 and its variants are upregulated in
           breast cancer cells and may be involved in modulating
           the ER-alpha signaling pathway during formation of
           breast cancer. PRMT2 also plays a role in regulating the
           function of E2F transcription factors, which are
           critical cell cycle regulators, by binding to the
           retinoblastoma gene product (RB). It contains an
           N-terminal SH3 domain and an AdoMet binding domain. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 40.5 bits (95), Expect = 7e-05
 Identities = 14/50 (28%), Positives = 26/50 (52%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVS 522
           VA+ D+ A+ D ++SF+  D +  +      WW    +G  G  PA+++ 
Sbjct: 3   VAIADFVATDDSQLSFESGDKLLVLRKPSVDWWWAEHNGCCGYIPASHLH 52


>gnl|CDD|212763 cd11829, SH3_GAS7, Src homology 3 domain of Growth Arrest Specific
           protein 7.  GAS7 is mainly expressed in the brain and is
           required for neurite outgrowth. It may also play a role
           in the protection and migration of embryonic stem cells.
           Treatment-related acute myeloid leukemia (AML) has been
           reported resulting from mixed-lineage leukemia
           (MLL)-GAS7 translocations as a complication of primary
           cancer treatment. GAS7 contains an N-terminal SH3
           domain, followed by a WW domain, and a central F-BAR
           domain. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 52

 Score = 40.2 bits (94), Expect = 7e-05
 Identities = 17/49 (34%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 474 ALYDYQASADDE-ISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
            LY +      + +SF+  ++I  ++  D GWW G   G  G FPA+YV
Sbjct: 4   TLYAFTGEQHQQGLSFEAGELIRVLQAPDGGWWEGEKDGLRGWFPASYV 52


>gnl|CDD|212852 cd11919, SH3_Sorbs1_1, First Src Homology 3 domain of Sorbin and
           SH3 domain containing 1 (Sorbs1), also called ponsin.
           Sorbs1 is also called ponsin, SH3P12, or CAP (c-Cbl
           associated protein). It is an adaptor protein containing
           one sorbin homology (SoHo) and three SH3 domains. It
           binds Cbl and plays a major role in regulating the
           insulin signaling pathway by enhancing insulin-induced
           phosphorylation of Cbl. Sorbs1, like vinexin, localizes
           at cell-ECM and cell-cell adhesion sites where it binds
           vinculin, paxillin, and afadin. It may function in the
           control of cell motility. Other interaction partners of
           Sorbs1 include c-Abl, Sos, flotillin, Grb4, ataxin-7,
           filamin C, among others. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 40.3 bits (94), Expect = 8e-05
 Identities = 18/52 (34%), Positives = 30/52 (57%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           A A +D++A    E+     DI+   + ID+ W+ G  HG+ G+FP +Y+ L
Sbjct: 3   ARAKFDFKAQTLKELPLQKGDIVYIYKQIDQNWYEGEHHGRVGIFPRSYIEL 54


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 45.1 bits (107), Expect = 8e-05
 Identities = 45/232 (19%), Positives = 100/232 (43%), Gaps = 12/232 (5%)

Query: 230 NRQVVTSSKVKDLIAANSTNASKENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRKRSE 289
              +     +K+ +            + +     ++   L+ +   L  + +E      E
Sbjct: 518 IELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEE 577

Query: 290 EEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKS 349
               + + ++  +L +  K  +++L EL+ +  ++E+ L            E     ++S
Sbjct: 578 LRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELES 637

Query: 350 ILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKK 409
            L++           N + E +E  +   EE++ K  E + E+ +E  RI+ E++  +K 
Sbjct: 638 ELEK----------LNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKL 687

Query: 410 KEQEEKERKEQEENQKL--LLKKQQEEDRLKAEEQARLLEQERLMEELRQQG 459
           +E E+ E + ++  ++L  LLKK  E ++L  E ++R  E E L +EL +  
Sbjct: 688 EELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLE 739



 Score = 42.4 bits (100), Expect = 5e-04
 Identities = 43/193 (22%), Positives = 90/193 (46%), Gaps = 16/193 (8%)

Query: 271 AKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLE-ERRLSELKIKEEEIEKKLN 329
            + E L ++ EE+  +  E E+E  +L+++++  +A   E E  L +LK  EE +EK   
Sbjct: 277 RELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEE 336

Query: 330 GHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERK 389
               +                L +  ++   +     +E E+  + +  E+   K LER 
Sbjct: 337 KLEKLE----------SELEELAEEKNELAKLLEERLKELEERLEEL--EKELEKALERL 384

Query: 390 EELEKEQIRIKEEQENIKK---KKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLL 446
           ++LE+    +KEE   +     + ++E +E +++ E  +  L++ +EE +   E+  +L 
Sbjct: 385 KQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLE 444

Query: 447 EQERLMEELRQQG 459
            +E ++ EL   G
Sbjct: 445 SKELMIAELAGAG 457



 Score = 42.1 bits (99), Expect = 8e-04
 Identities = 44/198 (22%), Positives = 89/198 (44%), Gaps = 11/198 (5%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKL 328
           LR + E L K+  E   +  E  + +  LK++++ E+ +KLE       ++KE+   ++L
Sbjct: 499 LREEIEELEKELRELEEELIELLELEEALKEELE-EKLEKLENLLEELEELKEKLQLQQL 557

Query: 329 NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELER 388
                       E     +K +L++       +  + KEE E+ ++R+ + + K KELE 
Sbjct: 558 KEELRQLEDRLQE-----LKELLEELR-----LLRTRKEELEELRERLKELKKKLKELEE 607

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQ 448
           +    +E ++  E  E   + +E EE+   E E+       ++  +  L+  E+     +
Sbjct: 608 RLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELE 667

Query: 449 ERLMEELRQQGTNEDTEE 466
             +  EL++    E  EE
Sbjct: 668 AEIRRELQRIENEEQLEE 685



 Score = 39.4 bits (92), Expect = 0.005
 Identities = 34/193 (17%), Positives = 82/193 (42%), Gaps = 29/193 (15%)

Query: 270 RAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLN 329
             + E   +  EE   +  EE++   +LK ++   ++ +LE  ++ E +++E E      
Sbjct: 227 EEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELE------ 280

Query: 330 GHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERK 389
                            +   L++       ++   +E EE+ +      E   + LE+ 
Sbjct: 281 ----------------RLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKL 324

Query: 390 EELEKEQIRIKEEQENIKKKKE----QEEKERKEQEENQKLLLKKQQEEDRLKAEEQARL 445
           + LE+   +++E+ E ++ + E    ++ +  K  EE  K L   ++  + L+ E +  L
Sbjct: 325 KSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKEL---EERLEELEKELEKAL 381

Query: 446 LEQERLMEELRQQ 458
              ++L E +++ 
Sbjct: 382 ERLKQLEEAIQEL 394



 Score = 39.0 bits (91), Expect = 0.007
 Identities = 32/193 (16%), Positives = 84/193 (43%), Gaps = 11/193 (5%)

Query: 267 SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLE-ERRLSELKIKEEEIE 325
                + + L +  +E   + +E      +++++++  + +  E ER L EL+ + +++E
Sbjct: 378 EKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLE 437

Query: 326 KKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKE 385
           +++N      L  +          +  Q             EE EKE   + + E++  E
Sbjct: 438 EQINQLESKELMIAELAGAGEKCPVCGQEL----------PEEHEKELLELYELELEELE 487

Query: 386 LERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARL 445
            E   E E+ ++R + E+   + ++ +EE     + E       +++ E      E+   
Sbjct: 488 EELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEE 547

Query: 446 LEQERLMEELRQQ 458
           L+++  +++L+++
Sbjct: 548 LKEKLQLQQLKEE 560



 Score = 35.9 bits (83), Expect = 0.052
 Identities = 42/206 (20%), Positives = 88/206 (42%), Gaps = 16/206 (7%)

Query: 272 KFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGH 331
           K+E L++  +E  ++  + + E+ + +    LE  + L E    ELK + +++E+     
Sbjct: 168 KYEKLSELLKEVIKEA-KAKIEELEGQLSELLEDIEDLLEALEEELK-ELKKLEEIQEEQ 225

Query: 332 SDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEE 391
            +  L    E         L++       +    +E+E  E+ +    EI+  ELE  + 
Sbjct: 226 EEEELEQEIE--------ALEER------LAELEEEKERLEELKARLLEIESLELEALKI 271

Query: 392 LEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERL 451
            E+E   ++   E +++K E+ E+  +E EE ++ L   +   + L+   +     +ERL
Sbjct: 272 REEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERL 331

Query: 452 MEELRQQGTNEDTEEDLGYTAVALYD 477
            +   +    E   E+L      L  
Sbjct: 332 EKLEEKLEKLESELEELAEEKNELAK 357



 Score = 35.5 bits (82), Expect = 0.084
 Identities = 33/181 (18%), Positives = 76/181 (41%), Gaps = 22/181 (12%)

Query: 288 SEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPV 347
             E++     + +++  + +   E+  +EL+ + EE+EK+L                   
Sbjct: 470 EHEKELLELYELELEELEEELSREKEEAELREEIEELEKELR------------------ 511

Query: 348 KSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIK 407
             + ++        +   +E EEK ++     E   +  E KE+L+ +Q++ +  Q   +
Sbjct: 512 -ELEEELIELLELEEALKEELEEKLEKLENLLE---ELEELKEKLQLQQLKEELRQLEDR 567

Query: 408 KKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEED 467
            ++ +E  E       +K  L++ +E  +   ++   L E+   +EEL Q     + E +
Sbjct: 568 LQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENE 627

Query: 468 L 468
           L
Sbjct: 628 L 628



 Score = 33.2 bits (76), Expect = 0.37
 Identities = 27/89 (30%), Positives = 49/89 (55%), Gaps = 2/89 (2%)

Query: 369 EEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLL 428
           E+ EK   + +E+ ++   + EELE +   + E+ E++ +  E+E KE K+ EE Q+   
Sbjct: 167 EKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQE 226

Query: 429 KKQQEEDRLKAEEQARLLEQERLMEELRQ 457
           +++ E++    EE  RL E E   E L +
Sbjct: 227 EEELEQEIEALEE--RLAELEEEKERLEE 253



 Score = 32.8 bits (75), Expect = 0.52
 Identities = 38/165 (23%), Positives = 76/165 (46%), Gaps = 5/165 (3%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKL 328
           LR + E L +  E     + + ++ + +L    +L Q+ +L E   +EL+  EEE+E +L
Sbjct: 581 LRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAE-NELEEAEEELESEL 639

Query: 329 NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELER 388
              +            +       +     I  +    E EE+ ++++ + E   +ELE+
Sbjct: 640 EKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQ 699

Query: 389 K----EELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLK 429
                EEL K+   I++  E ++ +K + E+ +KE E+ +K L  
Sbjct: 700 LREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALEL 744



 Score = 32.4 bits (74), Expect = 0.71
 Identities = 42/203 (20%), Positives = 84/203 (41%), Gaps = 21/203 (10%)

Query: 267 SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKD-QIDLEQAQKLEERRLSELKIKEEEIE 325
             L   +E   ++ EEE  +  EE + + ++++ + +L + ++     L   +  +EE+E
Sbjct: 473 KELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELE 532

Query: 326 KKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKE 385
           +KL    ++                L++       +Q    +EE ++ +  +QE  +  E
Sbjct: 533 EKLEKLENL----------------LEELEELKEKLQLQQLKEELRQLEDRLQELKELLE 576

Query: 386 LERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARL 445
             R     KE++    E+    KKK +E +ER  Q E     L +  E    + E +   
Sbjct: 577 ELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEE----LLQSLELSEAENELEEAE 632

Query: 446 LEQERLMEELRQQGTNEDTEEDL 468
            E E  +E+L  Q   E+  +  
Sbjct: 633 EELESELEKLNLQAELEELLQAA 655



 Score = 29.0 bits (65), Expect = 8.2
 Identities = 40/170 (23%), Positives = 71/170 (41%), Gaps = 20/170 (11%)

Query: 271 AKFENLAKQTEEESRKRSEEEKEKR--KLKDQIDLEQAQK-LEERRLSELKIKEEEIEKK 327
           ++ E L +  E    +   EE E+      ++++L+   + L +  L EL+ K EE+E +
Sbjct: 610 SQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAE 669

Query: 328 LNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE 387
           +                      ++        ++   + EEE E Q   + E   K+L 
Sbjct: 670 IRR----------------ELQRIENEEQLEEKLEELEQLEEELE-QLREELEELLKKLG 712

Query: 388 RKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRL 437
             E+L +E    K E E +KK+ E+ EK  +  EE ++ L K     D L
Sbjct: 713 EIEQLIEELESRKAELEELKKELEKLEKALELLEELREKLGKAGLRADIL 762


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
            found at the N terminus of SMC proteins. The SMC
            (structural maintenance of chromosomes) superfamily
            proteins have ATP-binding domains at the N- and
            C-termini, and two extended coiled-coil domains separated
            by a hinge in the middle. The eukaryotic SMC proteins
            form two kind of heterodimers: the SMC1/SMC3 and the
            SMC2/SMC4 types. These heterodimers constitute an
            essential part of higher order complexes, which are
            involved in chromatin and DNA dynamics. This family also
            includes the RecF and RecN proteins that are involved in
            DNA metabolism and recombination.
          Length = 1162

 Score = 45.0 bits (106), Expect = 9e-05
 Identities = 41/224 (18%), Positives = 86/224 (38%), Gaps = 9/224 (4%)

Query: 218  EKVDKHQSQKDDNRQVVTSSKVKDLIAANSTNASKENIKPKPNIGHVKPSNLRAKFENLA 277
            E+  K +++  +  Q++   + K         A +   + K      +      +     
Sbjct: 808  EEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKE 867

Query: 278  KQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLS 337
            +  +E   K  E E++K K + +   E+ ++ ++    E +      EK+      +   
Sbjct: 868  ELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEE 927

Query: 338  PSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQI 397
                         L    +D    +  NKEEEE+  +R+         L  KEEL    +
Sbjct: 928  AIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRL---------LLAKEELGNVNL 978

Query: 398  RIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEE 441
                E E  +++  ++E +++  EE +K LL++  EE   + +E
Sbjct: 979  MAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKE 1022



 Score = 43.8 bits (103), Expect = 2e-04
 Identities = 42/192 (21%), Positives = 86/192 (44%), Gaps = 13/192 (6%)

Query: 278 KQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLS 337
              +E   K   E  +   L+ Q ++++ ++  +    ELK  + E E+ L         
Sbjct: 677 LAEQELQEKAESELAKNEILRRQEEIKKKEQRIKE---ELKKLKLEKEELLADKVQEAQD 733

Query: 338 PSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQI 397
              E        +L+Q   +    +  ++ ++E+E++   +  +K KEL  +EE  ++  
Sbjct: 734 KINEEL-----KLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKL- 787

Query: 398 RIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARL--LEQERLMEEL 455
             K E+E  +K K QEE+ R  +EE ++     ++E+  ++ EE+ +   LE+  L  + 
Sbjct: 788 --KVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKE 845

Query: 456 RQQGTNEDTEED 467
            Q+      EE 
Sbjct: 846 EQKLEKLAEEEL 857



 Score = 42.6 bits (100), Expect = 5e-04
 Identities = 43/203 (21%), Positives = 86/203 (42%), Gaps = 9/203 (4%)

Query: 268 NLRAKFENLA-KQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEK 326
            L  + ENLA    + E  K  E + +++  K     +  +KLE    + L +   ++ +
Sbjct: 178 KLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNE 237

Query: 327 KLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQN-SNKEEEEKEKQRMVQEEIKRKE 385
           +        L    E          KQ       I     KE +E+EK++ +QEE + K 
Sbjct: 238 ERIDLLQELLRDEQEE-----IESSKQELEKEEEILAQVLKENKEEEKEKKLQEE-ELKL 291

Query: 386 LERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDR-LKAEEQAR 444
           L ++EE  K ++   E ++   ++K +E ++  ++ E +    K++ EE      E + +
Sbjct: 292 LAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIK 351

Query: 445 LLEQERLMEELRQQGTNEDTEED 467
              +E   E+L +     +  E+
Sbjct: 352 REAEEEEEEQLEKLQEKLEQLEE 374



 Score = 42.3 bits (99), Expect = 6e-04
 Identities = 52/213 (24%), Positives = 94/213 (44%), Gaps = 9/213 (4%)

Query: 263 HVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEE 322
            +    LR + E +    +E  +   EEE   + LK+  + E+ +KL+E  L  L  +EE
Sbjct: 241 DLLQELLRDEQEEIESSKQELEK---EEEILAQVLKENKEEEKEKKLQEEELKLLAKEEE 297

Query: 323 EIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIK 382
           E++ +L       L          +K   K+       ++   +E EE EK+    E  +
Sbjct: 298 ELKSELL-----KLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKR 352

Query: 383 RKELERKEELEKEQIR-IKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEE 441
             E E +E+LEK Q +  + E+E + KKK + E+     +  ++ L  K +EE   K   
Sbjct: 353 EAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLL 412

Query: 442 QARLLEQERLMEELRQQGTNEDTEEDLGYTAVA 474
           +    E++ L EE +++    +  E+   T   
Sbjct: 413 ELSEQEEDLLKEEKKEELKIVEELEESLETKQG 445



 Score = 42.3 bits (99), Expect = 6e-04
 Identities = 39/195 (20%), Positives = 84/195 (43%), Gaps = 7/195 (3%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKK 327
           NL         +   +  +    ++++     + +LE+ +++  + L E K +E+E + +
Sbjct: 226 NLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQ 285

Query: 328 LNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE 387
                ++ L    E         L++   D         +E EKE +++ +E  K KE  
Sbjct: 286 ---EEELKLLAKEEEELKSELLKLERRKVDD----EEKLKESEKELKKLEKELKKEKEEI 338

Query: 388 RKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLE 447
            + E E +++ IK E E  ++++ ++ +E+ EQ E + L  KK + E    A +      
Sbjct: 339 EELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEEL 398

Query: 448 QERLMEELRQQGTNE 462
           + +  EE   +   E
Sbjct: 399 ELKNEEEKEAKLLLE 413



 Score = 41.5 bits (97), Expect = 0.001
 Identities = 40/195 (20%), Positives = 90/195 (46%), Gaps = 3/195 (1%)

Query: 265 KPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEI 324
                R + E  A  + E+ +K+   +K   + ++  +L     LEE +L ELK+KE+  
Sbjct: 151 MKPERRLEIEEEAAGSREKRKKKERLKKLIEETENLAELII--DLEELKLQELKLKEQAK 208

Query: 325 EKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDG-IPIQNSNKEEEEKEKQRMVQEEIKR 383
           +          L    E         L +   D    +    +EE E  KQ + +EE   
Sbjct: 209 KALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEIL 268

Query: 384 KELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQA 443
            ++ ++ + E+++ +++EE+  +  K+E+E K    + E +K+  +++ +E   + ++  
Sbjct: 269 AQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLE 328

Query: 444 RLLEQERLMEELRQQ 458
           + L++E+   E  ++
Sbjct: 329 KELKKEKEEIEELEK 343



 Score = 39.6 bits (92), Expect = 0.004
 Identities = 43/206 (20%), Positives = 85/206 (41%), Gaps = 18/206 (8%)

Query: 270 RAKFENLAKQTEEESRKRSEEEKEKRKL-KDQIDLEQAQKLEERRLSELKIKEEEIEKKL 328
           +   E    + + E  + +    +  KL +++IDL Q    +E+   E   +E E E+++
Sbjct: 208 KKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEI 267

Query: 329 NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELER 388
                       +   +  + +               KEEEE + + +  E  K  + E+
Sbjct: 268 LAQVLKENKEEEKEKKLQEEELKL-----------LAKEEEELKSELLKLERRKVDDEEK 316

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKE------RKEQEENQKLLLKKQQEEDRLKAEEQ 442
            +E EKE  ++++E +  K++ E+ EKE      ++E EE ++  L+K QE+     EE 
Sbjct: 317 LKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEEL 376

Query: 443 ARLLEQERLMEELRQQGTNEDTEEDL 468
               + E        +   E+ E   
Sbjct: 377 LAKKKLESERLSSAAKLKEEELELKN 402



 Score = 38.8 bits (90), Expect = 0.007
 Identities = 20/102 (19%), Positives = 45/102 (44%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           K ++++  +++++E     EL    E  K Q    +EQ     +  Q +++ + +EEN  
Sbjct: 169 KRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLL 228

Query: 426 LLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEED 467
            L   +  E+R+   ++    EQE +    ++    E+    
Sbjct: 229 YLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQ 270



 Score = 35.3 bits (81), Expect = 0.085
 Identities = 50/267 (18%), Positives = 107/267 (40%), Gaps = 19/267 (7%)

Query: 199 KFGIESDRQDKSAVGWDHVEKVDKHQSQKDDNRQVVTSSKVKDLIAANSTNASKENIKPK 258
           +        +K       V K +K + ++   ++       K+     S     E  K  
Sbjct: 253 EIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRK-- 310

Query: 259 PNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSE-L 317
                 K      + + L K+ ++E  +  E EKE ++L+ + + E+ ++ +  +L E L
Sbjct: 311 -VDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKL 369

Query: 318 KIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEK--QR 375
           +  EEE+  K    S+   S +        K    +  ++         E  E+E+   +
Sbjct: 370 EQLEEELLAKKKLESERLSSAAKL------KEEELELKNEEEKEAKLLLELSEQEEDLLK 423

Query: 376 MVQEEIKRKELERKEELEKEQIRIKEEQENIK-------KKKEQEEKERKEQEENQKLLL 428
             ++E  +   E +E LE +Q ++ EE+E ++       K K + +K     +E + + L
Sbjct: 424 EEKKEELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETKLVKL 483

Query: 429 KKQQEEDRLKAEEQARLLEQERLMEEL 455
            +Q E   L+ + +    ++ +  E L
Sbjct: 484 LEQLELLLLRQKLEEASQKESKAREGL 510



 Score = 30.3 bits (68), Expect = 3.1
 Identities = 21/109 (19%), Positives = 48/109 (44%), Gaps = 1/109 (0%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
           I     E   + ++       KRK+ ER ++L +E   + E   ++++ K QE K +++ 
Sbjct: 148 IAMMKPERRLEIEEEAAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQA 207

Query: 421 EENQKLLLKKQQEEDRLKAEEQARLL-EQERLMEELRQQGTNEDTEEDL 468
           ++  +    K++ E   +       L   E  ++ L++   +E  E + 
Sbjct: 208 KKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIES 256


>gnl|CDD|212920 cd11987, SH3_Intersectin1_1, First Src homology 3 domain (or SH3A)
           of Intersectin-1.  Intersectin-1 (ITSN1) is an adaptor
           protein that functions in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction. It
           plays a role in clathrin-coated pit (CCP) formation. It
           binds to many proteins through its multidomain structure
           and facilitate the assembly of multimeric complexes.
           ITSN1 localizes in membranous organelles, CCPs, the
           Golgi complex, and may be involved in the cell membrane
           trafficking system. It exists in alternatively spliced
           short and long isoforms. The short isoform contains two
           Eps15 homology domains (EH1 and EH2), a coiled-coil
           region and five SH3 domains (SH3A-E), while the long
           isoform, in addition, contains RhoGEF (also called
           Dbl-homologous or DH), Pleckstrin homology (PH) and C2
           domains. The first SH3 domain (or SH3A) of ITSN1 has
           been shown to bind many proteins including Sos1,
           dynamin1/2, CIN85, c-Cbl, PI3K-C2, SHIP2, N-WASP, and
           CdGAP, among others. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 40.0 bits (93), Expect = 1e-04
 Identities = 23/53 (43%), Positives = 30/53 (56%), Gaps = 10/53 (18%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDE------GWWRGYCHGQYGLFPANY 520
           ALY ++A + DEI+  P DI+    M+DE      GW  G   G+ G FPANY
Sbjct: 4   ALYPFEARSHDEITIQPGDIV----MVDESQTGEPGWLGGELKGKTGWFPANY 52


>gnl|CDD|212707 cd11773, SH3_Sla1p_1, First Src Homology 3 domain of the fungal
           endocytic adaptor protein Sla1p.  Sla1p facilitates
           endocytosis by playing a role as an adaptor protein in
           coupling components of the actin cytoskeleton to the
           endocytic machinery. It interacts with Abp1p, Las17p and
           Pan1p, which are activator proteins of actin-related
           protein 2/3 (Arp2/3). Sla1p contains multiple domains
           including three SH3 domains, a SAM (sterile alpha motif)
           domain, and a Sla1 homology domain 1 (SHD1), which binds
           to the NPFXD motif that is found in many integral
           membrane proteins such as the Golgi-localized
           Arf-binding protein Lsb5p and the P4-ATPases, Drs2p and
           Dnf1p. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 57

 Score = 39.7 bits (93), Expect = 1e-04
 Identities = 19/54 (35%), Positives = 27/54 (50%), Gaps = 7/54 (12%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWR-------GYCHGQYGLFPANY 520
           ALYDY+   +DE++   DDI+  +E  D+ WW+              GL PA Y
Sbjct: 4   ALYDYEPQTEDELTIQEDDILYLLEKSDDDWWKVKLKVNSSDDDEPVGLVPATY 57


>gnl|CDD|212986 cd12053, SH3_CD2AP_1, First Src Homology 3 domain (SH3A) of
           CD2-associated protein.  CD2AP, also called CMS (Cas
           ligand with Multiple SH3 domains) or METS1
           (Mesenchyme-to-Epithelium Transition protein with SH3
           domains), is a cytosolic adaptor protein that plays a
           role in regulating the cytoskeleton. It is critical in
           cell-to-cell union necessary for kidney function. It
           also stabilizes the contact between a T cell and
           antigen-presenting cells. It is primarily expressed in
           podocytes at the cytoplasmic face of the slit diaphragm
           and serves as a linker anchoring podocin and nephrin to
           the actin cytoskeleton. CD2AP contains three SH3
           domains, a proline-rich region, and a C-terminal
           coiled-coil domain. All of these domains enable CD2AP to
           bind various protein partners and assemble complexes
           that have been implicated in many different functions.
           This alignment model represents the first SH3 domain
           (SH3A) of CD2AP. SH3A binds to the PXXXPR motif present
           in c-Cbl and the cytoplasmic domain of cell adhesion
           protein CD2. Its interaction with CD2 anchors CD2 at
           sites of cell contact. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 56

 Score = 39.8 bits (93), Expect = 1e-04
 Identities = 19/47 (40%), Positives = 31/47 (65%), Gaps = 1/47 (2%)

Query: 476 YDYQASADDEISFDPDDIITNIEMIDE-GWWRGYCHGQYGLFPANYV 521
           YDY A  +DE++    +II N++ ++E GW  G  +G+ G+FP N+V
Sbjct: 6   YDYDAVHEDELTIRVGEIIRNVKKLEEEGWLEGELNGRRGMFPDNFV 52


>gnl|CDD|212880 cd11947, SH3_GRAP2_N, N-terminal Src homology 3 domain of
           GRB2-related adaptor protein 2.  GRAP2 is also called
           GADS (GRB2-related adapter downstream of Shc), GrpL,
           GRB2L, Mona, or GRID (Grb2-related protein with insert
           domain). It is expressed specifically in the
           hematopoietic system. It plays an important role in T
           cell receptor (TCR) signaling by promoting the formation
           of the SLP-76:LAT complex, which couples the TCR to the
           Ras pathway. It also have roles in antigen-receptor and
           tyrosine kinase mediated signaling. GRAP2 is unique from
           other GRB2-like adaptor proteins in that it can be
           regulated by caspase cleavage. It contains an N-terminal
           SH3 domain, a central SH2 domain, and a C-terminal SH3
           domain. The N-terminal SH3 domain of the related protein
           GRB2 binds to Sos and Sos-derived proline-rich peptides.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 52

 Score = 39.4 bits (92), Expect = 1e-04
 Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           A   +D+ AS +DE+SF   D++  I   D+ W++   +G+ G  P N+V
Sbjct: 2   ARGKFDFTASGEDELSFKKGDVL-KILSSDDIWFKAELNGEEGYVPKNFV 50


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 44.7 bits (106), Expect = 1e-04
 Identities = 51/195 (26%), Positives = 92/195 (47%), Gaps = 11/195 (5%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEER-RLSELKIKEEEIEK 326
           NL  + E L  Q EE   K  E  +E  +L+++++  + +       L EL+ + EE+E 
Sbjct: 313 NLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELES 372

Query: 327 KLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKEL 386
           +L    +   +  ++     V  +  Q  S    I+      E  E +R   ++   + L
Sbjct: 373 RLEELEEQLETLRSK-----VAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELL 427

Query: 387 ERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ--EENQKLLLKKQQEEDRLKAEE--- 441
           ++ EE E ++++ + E+   + ++ QEE ER E+  EE ++ L + +Q  D  + E    
Sbjct: 428 KKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQL 487

Query: 442 QARLLEQERLMEELR 456
           QARL   ERL E L 
Sbjct: 488 QARLDSLERLQENLE 502



 Score = 44.3 bits (105), Expect = 1e-04
 Identities = 40/217 (18%), Positives = 83/217 (38%), Gaps = 29/217 (13%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLE-ERRLSELKIKEEEIEK 326
            L  K E L ++  E  +  +E  KE  +L+++++  + +  E  R++S L+     +E 
Sbjct: 681 ELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEA 740

Query: 327 KLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKEL 386
           ++    +                           I   +KE  E E +    EE   +  
Sbjct: 741 EVEQLEE--------------------------RIAQLSKELTELEAEIEELEERLEEAE 774

Query: 387 ERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLL 446
           E   E E E   ++ + E +K++ +   +   E      LL ++           + R+ 
Sbjct: 775 EELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIA 834

Query: 447 EQERLMEELRQQGTNEDTEEDLGYTAVALYDYQASAD 483
             ER +E+L +Q   E+  ED+   A  + + +   +
Sbjct: 835 ATERRLEDLEEQ--IEELSEDIESLAAEIEELEELIE 869



 Score = 43.5 bits (103), Expect = 3e-04
 Identities = 45/206 (21%), Positives = 88/206 (42%), Gaps = 39/206 (18%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLE-ERRLSELKIKEEEIEK 326
            L  +   L +Q +    + +  E++  +L+ Q++  +++  E    L+EL+ K EE+++
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKE 351

Query: 327 KLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKEL 386
           +L                                ++   +E E + ++   + E   ++L
Sbjct: 352 ELES------------------------------LEAELEELEAELEELESRLEELEEQL 381

Query: 387 ERKE----ELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQ 442
           E       +LE +   +  E E ++ + E+ E  R+  ++  + LLKK +E +    E Q
Sbjct: 382 ETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAE--LKELQ 439

Query: 443 ARLLEQERLMEELRQQGTNEDTEEDL 468
           A L E E  +EEL  Q   E  EE L
Sbjct: 440 AELEELEEELEEL--QEELERLEEAL 463



 Score = 42.4 bits (100), Expect = 6e-04
 Identities = 41/196 (20%), Positives = 86/196 (43%), Gaps = 12/196 (6%)

Query: 267 SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEK 326
           + L A+ E L ++  + S++ +E E E  +L+++  LE+A++      +E++  E +IE+
Sbjct: 736 ARLEAEVEQLEERIAQLSKELTELEAEIEELEER--LEEAEEELAEAEAEIEELEAQIEQ 793

Query: 327 KLNGHSDVPLSPSTETPPVPVKSILKQP----TSDGIPIQNSNKEEEEKEKQRMVQEEIK 382
                       +       +++ L        +    +++  +     E++    EE  
Sbjct: 794 LKEELK------ALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQI 847

Query: 383 RKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQ 442
            +  E  E L  E   ++E  E ++ + E    ER   EE   LL  + +E      E +
Sbjct: 848 EELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELE 907

Query: 443 ARLLEQERLMEELRQQ 458
           ++  E  R +EELR++
Sbjct: 908 SKRSELRRELEELREK 923



 Score = 41.6 bits (98), Expect = 0.001
 Identities = 40/190 (21%), Positives = 76/190 (40%), Gaps = 46/190 (24%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKL 328
           LR + E L ++ +E   +  E   E ++L  +  LE+        +SEL+ + EE++K+L
Sbjct: 237 LREELEELQEELKEAEEELEELTAELQEL--EEKLEEL----RLEVSELEEEIEELQKEL 290

Query: 329 NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELER 388
                                                 E    E+Q+ +  E       R
Sbjct: 291 ---------------------------------YALANEISRLEQQKQILRE-------R 310

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQ 448
              LE++   ++ + E ++ K ++  +E  E EE  + L ++ +  +    E +A L E 
Sbjct: 311 LANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEEL 370

Query: 449 ERLMEELRQQ 458
           E  +EEL +Q
Sbjct: 371 ESRLEELEEQ 380



 Score = 39.3 bits (92), Expect = 0.005
 Identities = 43/205 (20%), Positives = 84/205 (40%), Gaps = 21/205 (10%)

Query: 267 SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQID-LEQAQKLEERRLSELKIKEEEIE 325
             L  + E L K+ EE SR+ S   K+  +L+ +++ LE+      + L+EL+ + EE+E
Sbjct: 708 EELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELE 767

Query: 326 KKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQE------ 379
           ++L    +       E      +    +   + +  +     E   E +  +        
Sbjct: 768 ERLEEAEE-------ELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAA 820

Query: 380 --EIKRKELERKEELEKEQI-----RIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQ 432
               + + LER+    + ++     +I+E  E+I+    + E+  +  EE +  L     
Sbjct: 821 NLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLN 880

Query: 433 EEDRLKAEEQARLLEQERLMEELRQ 457
           E   L+        E E L EELR+
Sbjct: 881 ERASLEEALALLRSELEELSEELRE 905



 Score = 36.6 bits (85), Expect = 0.042
 Identities = 42/233 (18%), Positives = 87/233 (37%), Gaps = 28/233 (12%)

Query: 267 SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQID--LEQAQKLEER------RLSELK 318
           + L A+ E L ++ EE   + +E E E  +L+ QI+   E+ + L E        L+ L 
Sbjct: 757 TELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLN 816

Query: 319 IKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQ 378
            +   + ++L          +TE     ++  +++             E+ E     + +
Sbjct: 817 EEAANLRERLESLERR--IAATERRLEDLEEQIEE-----------LSEDIESLAAEIEE 863

Query: 379 EEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLK 438
            E   +ELE   ELE         +E +   + + E+  +E  E +    + ++E + L+
Sbjct: 864 LEELIEELE--SELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELR 921

Query: 439 AE-EQARLLEQERLMEELRQQGT----NEDTEEDLGYTAVALYDYQASADDEI 486
            +  Q  L  +   +     Q         T E+       + D +  A   +
Sbjct: 922 EKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRL 974


>gnl|CDD|212932 cd11999, SH3_PACSIN_like, Src homology 3 domain of an unknown
           subfamily of proteins with similarity to Protein kinase
           C and Casein kinase Substrate in Neurons (PACSIN)
           proteins.  PACSINs, also called Synaptic
           dynamin-associated proteins (Syndapins), act as
           regulators of cytoskeletal and membrane dynamics. They
           bind both dynamin and Wiskott-Aldrich syndrome protein
           (WASP), and may provide direct links between the actin
           cytoskeletal machinery through WASP and
           dynamin-dependent endocytosis. Vetebrates harbor three
           isoforms with distinct expression patterns and specific
           functions. PACSINs contain an N-terminal F-BAR domain
           and a C-terminal SH3 domain. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 56

 Score = 39.5 bits (92), Expect = 1e-04
 Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDE-GWWRGYCH-GQYGLFPANYV 521
           A+YDY     DE+SF   + +  +E  DE GW +G    G  GL+PANYV
Sbjct: 6   AVYDYTGQEPDELSFKAGEELLKVEDEDEQGWCKGVTDGGAVGLYPANYV 55


>gnl|CDD|212940 cd12007, SH3_Yes, Src homology 3 domain of Yes Protein Tyrosine
           Kinase.  Yes (or c-Yes) is a member of the Src subfamily
           of proteins, which are cytoplasmic (or non-receptor)
           PTKs. c-Yes kinase is the cellular homolog of the
           oncogenic protein (v-Yes) encoded by the Yamaguchi 73
           and Esh sarcoma viruses. It displays functional overlap
           with other Src subfamily members, particularly Src. It
           also shows some unique functions such as binding to
           occludins, transmembrane proteins that regulate
           extracellular interactions in tight junctions. Yes also
           associates with a number of proteins in different cell
           types that Src does not interact with, like JAK2 and
           gp130 in pre-adipocytes, and Pyk2 in treated pulmonary
           vein endothelial cells. Although the biological function
           of Yes remains unclear, it appears to have a role in
           regulating cell-cell interactions and vesicle
           trafficking in polarized cells. Src kinases contain an
           N-terminal SH4 domain with a myristoylation site,
           followed by SH3 and SH2 domains, a tyr kinase domain,
           and a regulatory C-terminal region containing a
           conserved tyr. They are activated by autophosphorylation
           at the tyr kinase domain, but are negatively regulated
           by phosphorylation at the C-terminal tyr by Csk
           (C-terminal Src Kinase). The SH3 domain of Src kinases
           contributes to substrate recruitment by binding adaptor
           proteins/substrates, and regulation of kinase activity
           through an intramolecular interaction. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 58

 Score = 39.6 bits (92), Expect = 1e-04
 Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWW--RGYCHGQYGLFPANYVS 522
           VALYDY+A   +++SF   +    I   +  WW  R    G+ G  P+NYV+
Sbjct: 4   VALYDYEARTTEDLSFKKGERFQIINNTEGDWWEARSIATGKNGYIPSNYVA 55


>gnl|CDD|212879 cd11946, SH3_GRB2_N, N-terminal Src homology 3 domain of Growth
           factor receptor-bound protein 2.  GRB2 is a critical
           signaling molecule that regulates the Ras pathway by
           linking tyrosine kinases to the Ras guanine nucleotide
           releasing protein Sos (son of sevenless), which converts
           Ras to the active GTP-bound state. It is ubiquitously
           expressed in all tissues throughout development and is
           important in cell cycle progression, motility,
           morphogenesis, and angiogenesis. In lymphocytes, GRB2 is
           associated with antigen receptor signaling components.
           GRB2 contains an N-terminal SH3 domain, a central SH2
           domain, and a C-terminal SH3 domain. Its N-terminal SH3
           domain binds to Sos and Sos-derived proline-rich
           peptides. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 56

 Score = 39.2 bits (91), Expect = 2e-04
 Identities = 21/54 (38%), Positives = 36/54 (66%), Gaps = 1/54 (1%)

Query: 472 AVALYDYQASADDEISFDPDDIITNI-EMIDEGWWRGYCHGQYGLFPANYVSLQ 524
           A+A YD++A+ADDE+SF   DI+  + E  D+ W++   +G+ G  P NY+ ++
Sbjct: 3   AIAKYDFKATADDELSFKRGDILKVLNEECDQNWYKAELNGKDGFIPKNYIEMK 56


>gnl|CDD|212862 cd11929, SH3_SH3RF2_1, First Src Homology 3 domain of SH3 domain
           containing ring finger 2.  SH3RF2 is also called POSHER
           (POSH-eliminating RING protein) or HEPP1 (heart protein
           phosphatase 1-binding protein). It acts as an
           anti-apoptotic regulator of the JNK pathway by binding
           to and promoting the degradation of SH3RF1 (or POSH), a
           scaffold protein that is required for pro-apoptotic JNK
           activation. It may also play a role in cardiac functions
           together with protein phosphatase 1. SH3RF2 contains an
           N-terminal RING finger domain and three SH3 domains.
           This model represents the first SH3 domain, located at
           the N-terminal half, of SH3RF2. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 54

 Score = 39.2 bits (91), Expect = 2e-04
 Identities = 17/52 (32%), Positives = 29/52 (55%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           A AL +Y+     ++ F+  D+I     +DE W+ G  +G  G+FPA+ V +
Sbjct: 3   AKALCNYRGHNPGDLKFNKGDVILLRRQLDENWYLGEINGVSGIFPASSVEV 54


>gnl|CDD|212945 cd12012, SH3_RIM-BP_2, Second Src homology 3 domain of
           Rab3-interacting molecules (RIMs) binding proteins.
           RIMs binding proteins (RBPs, RIM-BPs) associate with
           calcium channels present in photoreceptors, neurons, and
           hair cells; they interact simultaneously with specific
           calcium channel subunits, and active zone proteins, RIM1
           and RIM2. RIMs are part of the matrix at the presynaptic
           active zone and are associated with synaptic vesicles
           through their interaction with the small GTPase Rab3.
           RIM-BPs play a role in regulating synaptic transmission
           by serving as adaptors and linking calcium channels with
           the synaptic vesicle release machinery. RIM-BPs contain
           three SH3 domains and two to three fibronectin III
           repeats. Invertebrates contain one, while vertebrates
           contain at least two RIM-BPs, RIM-BP1 and RIM-BP2.
           RIM-BP1 is also called peripheral-type benzodiazapine
           receptor associated protein 1 (PRAX-1). Mammals contain
           a third protein, RIM-BP3. RIM-BP1 and RIM-BP2 are
           predominantly expressed in the brain where they display
           overlapping but distinct expression patterns, while
           RIM-BP3 is almost exclusively expressed in the testis
           and is essential in spermiogenesis. The SH3 domains of
           RIM-BPs bind to the PxxP motifs of RIM1, RIM2, and
           L-type (alpha1D) and N-type (alpha1B) calcium channel
           subunits. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity. They play versatile and diverse roles
           in the cell including the regulation of enzymes,
           changing the subcellular localization of signaling
           pathway components, and mediating the formation of
           multiprotein complex assemblies.
          Length = 62

 Score = 39.2 bits (92), Expect = 2e-04
 Identities = 19/59 (32%), Positives = 29/59 (49%), Gaps = 9/59 (15%)

Query: 473 VALYDY--------QASADDEISFDPDDIITNIEMIDE-GWWRGYCHGQYGLFPANYVS 522
           VAL+DY          +A++E+ F    +I      D  G++ G  +G+ GL P N VS
Sbjct: 3   VALFDYDPLTMSPNPDAAEEELPFKEGQLIKVYGDKDADGFYLGEINGRRGLVPCNMVS 61


>gnl|CDD|212845 cd11912, SH3_Bzz1_1, First Src Homology 3 domain of Bzz1 and
           similar domains.  Bzz1 (or Bzz1p) is a
           WASP/Las17-interacting protein involved in endocytosis
           and trafficking to the vacuole. It physically interacts
           with type I myosins and functions in the early steps of
           endocytosis. Together with other proteins, it induces
           membrane scission in yeast. Bzz1 contains an N-terminal
           F-BAR (FES-CIP4 Homology and Bin/Amphiphysin/Rvs), a
           central coiled-coil, and two C-terminal SH3 domains.
           This model represents the first C-terminal SH3 domain.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 39.1 bits (92), Expect = 2e-04
 Identities = 23/57 (40%), Positives = 32/57 (56%), Gaps = 6/57 (10%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMID-EGWWR---GYCHGQYGLFPANYVSL 523
           TA  LYDY AS DDE+S    + +T +E  D  GW +   G   G+ GL P +Y+ +
Sbjct: 1   TAKVLYDYTASGDDEVSISEGEEVTVLEPDDGSGWTKVRNGS--GEEGLVPTSYIEI 55


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 42.0 bits (99), Expect = 2e-04
 Identities = 28/66 (42%), Positives = 44/66 (66%)

Query: 379 EEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLK 438
              KR +LE K+   +++   +EE+E  KK +E+ E ERKE+EE ++   KK++EE+R +
Sbjct: 4   GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63

Query: 439 AEEQAR 444
            EEQAR
Sbjct: 64  REEQAR 69



 Score = 41.2 bits (97), Expect = 4e-04
 Identities = 33/91 (36%), Positives = 58/91 (63%), Gaps = 7/91 (7%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
           +   + EEE+ ++R   EE +  E + +EELE+E+ + KEE+E    +KE+EE+ RKEQE
Sbjct: 18  RQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE----RKEREEQARKEQE 73

Query: 422 ENQKLLLK---KQQEEDRLKAEEQARLLEQE 449
           E +KL      +++  D+L A+E++  L ++
Sbjct: 74  EYEKLKSSFVVEEEGTDKLSADEESNELLED 104


>gnl|CDD|212798 cd11864, SH3_PEX13_eumet, Src Homology 3 domain of eumetazoan
           Peroxisomal biogenesis factor 13.  PEX13 is a peroxin
           and is required for protein import into the peroxisomal
           matrix and membrane. It is an integral membrane protein
           that is essential for the localization of PEX14 and the
           import of proteins containing the peroxisome matrix
           targeting signals, PTS1 and PTS2. Mutations of the PEX13
           gene in humans lead to a wide range of peroxisome
           biogenesis disorders (PBDs), the most severe of which is
           known as Zellweger syndrome (ZS), a severe multisystem
           disorder characterized by hypotonia, psychomotor
           retardation, and neuronal migration defects. PEX13
           contains two transmembrane regions and a C-terminal SH3
           domain. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 58

 Score = 38.8 bits (91), Expect = 2e-04
 Identities = 21/56 (37%), Positives = 28/56 (50%), Gaps = 5/56 (8%)

Query: 471 TAVALYDYQASADDEISFDPDDIIT----NIEMIDEGWWRGYCHGQ-YGLFPANYV 521
            A A YD+ A ++DE+SF   D +      ++    GW      GQ  GL PANYV
Sbjct: 1   VARAEYDFVAESEDELSFRAGDKLRLAPKELQPRVRGWLLATVDGQKIGLVPANYV 56


>gnl|CDD|212930 cd11997, SH3_PACSIN3, Src homology 3 domain of Protein kinase C and
           Casein kinase Substrate in Neurons 3 (PACSIN3).  PACSIN
           3 or Syndapin III (Synaptic dynamin-associated protein
           III) is expressed ubiquitously and regulates glucose
           uptake in adipocytes through its role in GLUT1
           trafficking. It also modulates the subcellular
           localization and stimulus-specific function of the
           cation channel TRPV4. PACSINs act as regulators of
           cytoskeletal and membrane dynamics. Vetebrates harbor
           three isoforms with distinct expression patterns and
           specific functions. PACSINs contain an N-terminal F-BAR
           domain and a C-terminal SH3 domain. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 56

 Score = 38.8 bits (90), Expect = 3e-04
 Identities = 23/50 (46%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDE-GWWRG-YCHGQYGLFPANYV 521
           ALYDY     DE+SF   + +  I   DE GW +G    G+ GL+PANYV
Sbjct: 6   ALYDYTGQEADELSFKAGEELLKIGEEDEQGWCKGRLLSGRIGLYPANYV 55


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 43.4 bits (103), Expect = 3e-04
 Identities = 43/246 (17%), Positives = 83/246 (33%), Gaps = 52/246 (21%)

Query: 236 SSKVKDLIAANSTNASKENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKR 295
            +K+ + +    +   K N   +               E L K  EEE     + EKE +
Sbjct: 52  LTKLSEALDKLRSYLPKLNPLREEKKKVSV-----KSLEELIKDVEEEL---EKIEKEIK 103

Query: 296 KLKDQIDLEQAQKLEERRLSELKIKEEEIE---------KKLNGHSDV----------PL 336
           +L+++I      +LE   + EL+ + E +E           L G   V           L
Sbjct: 104 ELEEEI-----SELENE-IKELEQEIERLEPWGNFDLDLSLLLGFKYVSVFVGTVPEDKL 157

Query: 337 SPSTETPPVPVKSILKQ-PTSDGIPIQNSNKEEEEKEKQ--------------RMVQEEI 381
                   V     +        + +    +  +E E++                  E I
Sbjct: 158 EELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTPSELI 217

Query: 382 KRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEE 441
           +  + E  EE+EKE+  + EE + + KK  +E     E  E +    ++ +   +    +
Sbjct: 218 REIK-EELEEIEKERESLLEELKELAKKYLEELLALYEYLEIEL---ERAEALSKFLKTD 273

Query: 442 QARLLE 447
           +   +E
Sbjct: 274 KTFAIE 279


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 43.2 bits (102), Expect = 3e-04
 Identities = 28/112 (25%), Positives = 59/112 (52%), Gaps = 4/112 (3%)

Query: 352 KQPTSDGIPIQNSNKEEEEKEKQRMVQE-EIKRKELERK-EELEKEQIRIKEEQENIKKK 409
           K+ T +    + +  E+  K+ +  V+  E +  EL+R+ EEL++E  +++ E E  +++
Sbjct: 406 KEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRRE 465

Query: 410 KEQEEKERKEQEENQKLL--LKKQQEEDRLKAEEQARLLEQERLMEELRQQG 459
              + ++ +E     + +  L+K+ EE + + EE  R L + R M +L   G
Sbjct: 466 VRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELSG 517



 Score = 34.7 bits (80), Expect = 0.15
 Identities = 36/196 (18%), Positives = 81/196 (41%), Gaps = 23/196 (11%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKL 328
            + K E + ++  E      + E+ K  +     L +A       LS++K +E   EK+ 
Sbjct: 357 YKPKLEKVERKLPELG-IWKDVERIKALVIRGYPLAEA-------LSKVKEEERPREKEG 408

Query: 329 NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKR----K 384
               +       E     ++  +++        +NS  + E +E +R +++        +
Sbjct: 409 TEEEERREITVYEKRIKKLEETVERLE-----EENSELKRELEELKREIEKLESELERFR 463

Query: 385 ELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDR------LK 438
              R +  +  +IR ++ +    +K+ +E+K+R E+ E +   L+K ++ +       +K
Sbjct: 464 REVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVK 523

Query: 439 AEEQARLLEQERLMEE 454
             E+  L   E   EE
Sbjct: 524 VVEKLTLEAIEEAEEE 539



 Score = 32.0 bits (73), Expect = 0.83
 Identities = 23/93 (24%), Positives = 52/93 (55%), Gaps = 1/93 (1%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           +  E++  +   + EI   E +R ++LE+   R++EE   +K++ E+ ++E ++ E   +
Sbjct: 402 RPREKEGTEEEERREITVYE-KRIKKLEETVERLEEENSELKRELEELKREIEKLESELE 460

Query: 426 LLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
              ++ +++ R   E +AR    ERL +EL ++
Sbjct: 461 RFRREVRDKVRKDREIRARDRRIERLEKELEEK 493



 Score = 31.2 bits (71), Expect = 1.6
 Identities = 39/165 (23%), Positives = 71/165 (43%), Gaps = 14/165 (8%)

Query: 267 SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQK------LEERRLSELKIK 320
                + E L K+ EE+ ++  E E++  +L+    LE + K      +E+  L  ++  
Sbjct: 477 RARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLTLEAIEEA 536

Query: 321 EEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEE 380
           EEE      G   +   PS        + I K+P +     + S+   EE  K  +    
Sbjct: 537 EEEY-GIKEGDVILVEDPSGGGARTAEELIDKKPRAIIRGEEMSHAAAEEFFKNEIPVLP 595

Query: 381 IKRKELERKEEL---EKEQIRIKEEQENIKKKKEQEEKERKEQEE 422
               ++ R +E    + E++R   E    + KK  EE+ER+++EE
Sbjct: 596 EGDVQIIRLDEFAVVDSEELRRAIE----EWKKRFEERERRQKEE 636



 Score = 30.1 bits (68), Expect = 3.9
 Identities = 26/83 (31%), Positives = 43/83 (51%), Gaps = 9/83 (10%)

Query: 375 RMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEE 434
             V+EE + +E E  EE E+ +I + E++      K+ EE   + +EEN +L    ++E 
Sbjct: 395 SKVKEEERPREKEGTEEEERREITVYEKRI-----KKLEETVERLEEENSEL----KREL 445

Query: 435 DRLKAEEQARLLEQERLMEELRQ 457
           + LK E +    E ER   E+R 
Sbjct: 446 EELKREIEKLESELERFRREVRD 468



 Score = 29.3 bits (66), Expect = 6.9
 Identities = 24/101 (23%), Positives = 48/101 (47%), Gaps = 13/101 (12%)

Query: 371 KEKQRMVQEEIKRKELERKEELEK----------EQIRIKEEQENIKKKKEQEEKERKEQ 420
           K K   V+ ++    + +  E  K          E +   +E+E  ++K+  EE+ER+E 
Sbjct: 358 KPKLEKVERKLPELGIWKDVERIKALVIRGYPLAEALSKVKEEERPREKEGTEEEERREI 417

Query: 421 EENQKLLLKKQQEEDRLKAEE---QARLLEQERLMEELRQQ 458
              +K + K ++  +RL+ E    +  L E +R +E+L  +
Sbjct: 418 TVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESE 458


>gnl|CDD|212816 cd11883, SH3_Sdc25, Src Homology 3 domain of Sdc25/Cdc25 guanine
           nucleotide exchange factors.  This subfamily is composed
           of the Saccharomyces cerevisiae guanine nucleotide
           exchange factors (GEFs) Sdc25 and Cdc25, and similar
           proteins. These GEFs regulate Ras by stimulating the
           GDP/GTP exchange on Ras. Cdc25 is involved in the
           Ras/PKA pathway that plays an important role in the
           regulation of metabolism, stress responses, and
           proliferation, depending on available nutrients and
           conditions. Proteins in this subfamily contain an
           N-terminal SH3 domain as well as REM (Ras exchanger
           motif) and RasGEF domains at the C-terminus. SH3 domains
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs; they play a
           role in the regulation of enzymes by intramolecular
           interactions, changing the subcellular localization of
           signal pathway components and mediate multiprotein
           complex assemblies.
          Length = 55

 Score = 38.4 bits (90), Expect = 3e-04
 Identities = 19/55 (34%), Positives = 26/55 (47%), Gaps = 5/55 (9%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRG--YCHGQY---GLFPANY 520
             VALYD+   + +++SF   DII  +     GWW G           G FP+NY
Sbjct: 1   VVVALYDFTPKSKNQLSFKAGDIIYVLNKDPSGWWDGVIISSSGKVKRGWFPSNY 55


>gnl|CDD|212827 cd11894, SH3_FCHSD2_2, Second Src Homology 3 domain of FCH and
           double SH3 domains protein 2.  FCHSD2 has a domain
           structure consisting of an N-terminal F-BAR (FES-CIP4
           Homology and Bin/Amphiphysin/Rvs), two SH3, and
           C-terminal proline-rich domains. It has only been
           characterized in silico and its function is unknown. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 56

 Score = 38.4 bits (89), Expect = 4e-04
 Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 3/48 (6%)

Query: 474 ALYDYQASADDEISFDPDDIITNIE---MIDEGWWRGYCHGQYGLFPA 518
           ALYDY+   DDE+SF    II  +      D+G+W G  +G+ G+FP+
Sbjct: 4   ALYDYEGQTDDELSFPEGAIIRILNKENQDDDGFWEGEFNGRIGVFPS 51


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 43.1 bits (102), Expect = 4e-04
 Identities = 38/196 (19%), Positives = 76/196 (38%), Gaps = 33/196 (16%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKL 328
           L+ + E L +      ++    + E ++L+ +I+  +        L +L+    ++E +L
Sbjct: 735 LKERLEELEEDLSSLEQEIENVKSELKELEARIEELEED------LHKLEEALNDLEARL 788

Query: 329 NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELER 388
           +                 +  I  +           +K EEE  +      EI++K    
Sbjct: 789 SHS--------------RIPEIQAE----------LSKLEEEVSRIEARLREIEQKLNRL 824

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEE---QARL 445
             E E  +  I+E QE     KEQ +   KE E       + ++E + L+A     ++RL
Sbjct: 825 TLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRL 884

Query: 446 LEQERLMEELRQQGTN 461
            + ++  +EL  Q   
Sbjct: 885 GDLKKERDELEAQLRE 900



 Score = 40.8 bits (96), Expect = 0.002
 Identities = 37/153 (24%), Positives = 78/153 (50%), Gaps = 7/153 (4%)

Query: 268  NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQI-DLEQAQKLEERRLSELKIKEEEIEK 326
             L A   +L  +  +  ++R E E + R+L+ +I +LE   + + +RLSELK K E +E+
Sbjct: 872  ELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEE 931

Query: 327  KLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKEL 386
            +L+   D                 ++        +Q   +E    E   M+  +   + L
Sbjct: 932  ELSEIEDPKGEDEEIPEEELSLEDVQAE------LQRVEEEIRALEPVNMLAIQEYEEVL 985

Query: 387  ERKEELEKEQIRIKEEQENIKKKKEQEEKERKE 419
            +R +EL++++ +++EE++ I ++ E+ EK+++E
Sbjct: 986  KRLDELKEKRAKLEEERKAILERIEEYEKKKRE 1018



 Score = 39.3 bits (92), Expect = 0.006
 Identities = 36/198 (18%), Positives = 89/198 (44%), Gaps = 18/198 (9%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKK 327
           N++++ + L  + EE      + E+    L+ ++   +  +++   LS+L+ +   IE +
Sbjct: 755 NVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQ-AELSKLEEEVSRIEAR 813

Query: 328 LNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE 387
           L    +  L+  T       K  L++   +    +   KE+ +  ++ +     K++ELE
Sbjct: 814 LR-EIEQKLNRLT-----LEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELE 867

Query: 388 RKEE--------LEKEQIRIKEEQENIKKKKEQEEKERKEQE---ENQKLLLKKQQEEDR 436
            + E        LE     +K+E++ ++ +  + E++ +E E   E ++  L + + +  
Sbjct: 868 EELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLE 927

Query: 437 LKAEEQARLLEQERLMEE 454
              EE + + + +   EE
Sbjct: 928 ALEEELSEIEDPKGEDEE 945



 Score = 37.4 bits (87), Expect = 0.021
 Identities = 47/205 (22%), Positives = 86/205 (41%), Gaps = 31/205 (15%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKL 328
           L  + E L +Q E   R+ +  E+E  KL      E+  +LE+R L E++   EE+ KK+
Sbjct: 228 LLKEKEALERQKEAIERQLASLEEELEKLT-----EEISELEKR-LEEIEQLLEELNKKI 281

Query: 329 NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELER 388
                       E   + VK  + +  ++ I     +  E+E+E +   +   K +    
Sbjct: 282 K--------DLGEEEQLRVKEKIGELEAE-IASLERSIAEKERELEDAEERLAKLEAEID 332

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL----------------LLKKQQ 432
           K   E E++  + E+E  ++ K  EE    ++E                     L   ++
Sbjct: 333 KLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYRE 392

Query: 433 EEDRLKAEEQARLLEQERLMEELRQ 457
           + ++LK E      E +RL EEL++
Sbjct: 393 KLEKLKREINELKRELDRLQEELQR 417



 Score = 36.6 bits (85), Expect = 0.035
 Identities = 33/185 (17%), Positives = 88/185 (47%), Gaps = 23/185 (12%)

Query: 267 SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEE--------RRLSELK 318
             L A+ E L ++ EEE ++R +  +E  +LK++++  +A+  E           L + +
Sbjct: 332 DKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYR 391

Query: 319 IKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQ 378
            K E++++++N           +     ++  L++ + +   +  +    E K  +   +
Sbjct: 392 EKLEKLKREINEL---------KRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEE 442

Query: 379 EEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLK 438
           +E K  E++++E       ++++   ++ K +++    ++E +  +K L K Q+E    +
Sbjct: 443 KEDKALEIKKQEW------KLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAE 496

Query: 439 AEEQA 443
           A+ +A
Sbjct: 497 AQARA 501



 Score = 35.8 bits (83), Expect = 0.062
 Identities = 24/94 (25%), Positives = 45/94 (47%), Gaps = 4/94 (4%)

Query: 363 NSNKEEEEKEKQRMVQEEIKRKELERK-EELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
           +S + E  + + R+ +   +  +  RK  E+EKE  ++++E+E  K K+  EE E     
Sbjct: 691 SSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEE--KLKERLEELEEDLSS 748

Query: 422 ENQKLLLKKQQEEDRLKAEEQARLLEQERLMEEL 455
             Q++   K  E   L+A  +    +  +L E L
Sbjct: 749 LEQEIENVK-SELKELEARIEELEEDLHKLEEAL 781



 Score = 32.7 bits (75), Expect = 0.55
 Identities = 17/99 (17%), Positives = 44/99 (44%), Gaps = 8/99 (8%)

Query: 369 EEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLL 428
            E  + + ++E +        E L++E   ++ E   I+ + ++  +E  +       + 
Sbjct: 671 SEPAELQRLRERL--------EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIE 722

Query: 429 KKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEED 467
           K+ ++ ++ + + + RL E E  +  L Q+  N  +E  
Sbjct: 723 KEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELK 761



 Score = 32.0 bits (73), Expect = 1.0
 Identities = 22/88 (25%), Positives = 47/88 (53%), Gaps = 4/88 (4%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL 427
           EEE++++  + EE      E KEELE  +  ++E  +   + +++ +  R++ E+ ++ +
Sbjct: 346 EEERKRRDKLTEEYA----ELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREI 401

Query: 428 LKKQQEEDRLKAEEQARLLEQERLMEEL 455
            + ++E DRL+ E Q    E   L   +
Sbjct: 402 NELKRELDRLQEELQRLSEELADLNAAI 429



 Score = 31.6 bits (72), Expect = 1.3
 Identities = 33/191 (17%), Positives = 77/191 (40%), Gaps = 38/191 (19%)

Query: 267 SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLE-ERRLSELKIKEEEIE 325
           S + A+   + ++    + ++   EKE ++L++Q    + Q    E+ +  L  K+EE+E
Sbjct: 808 SRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELE 867

Query: 326 KKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKE 385
           ++L                                 +       + E +     ++K++ 
Sbjct: 868 EEL---------------------------------EELEAALRDLESRL---GDLKKER 891

Query: 386 LERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARL 445
            E + +L + + +I+E +  I+KK+++   E K + E  +  L + ++      E     
Sbjct: 892 DELEAQLRELERKIEELEAQIEKKRKR-LSELKAKLEALEEELSEIEDPKGEDEEIPEEE 950

Query: 446 LEQERLMEELR 456
           L  E +  EL+
Sbjct: 951 LSLEDVQAELQ 961



 Score = 30.8 bits (70), Expect = 2.2
 Identities = 36/202 (17%), Positives = 84/202 (41%), Gaps = 21/202 (10%)

Query: 263 HVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQID-----LEQAQKLEERRLSEL 317
             + ++L        ++ E+   + ++ E E  KL  +I+     +E+ +K  ++   E 
Sbjct: 300 EAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEY 359

Query: 318 KIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMV 377
              +EE+E       +V            +K   ++       ++   +E  E +++   
Sbjct: 360 AELKEELEDLRAELEEV--DKEFAETRDELKDYREK-------LEKLKREINELKRELDR 410

Query: 378 QEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRL 437
            +E  ++  E   +L      I+ +   ++++KE +  E K+QE        +Q   D  
Sbjct: 411 LQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWK-----LEQLAADLS 465

Query: 438 KAEEQARLLEQE--RLMEELRQ 457
           K E++   L++E  R+ +EL +
Sbjct: 466 KYEQELYDLKEEYDRVEKELSK 487


>gnl|CDD|212802 cd11869, SH3_p40phox, Src Homology 3 domain of the p40phox subunit
           of NADPH oxidase.  p40phox, also called Neutrophil
           cytosol factor 4 (NCF-4), is a cytosolic subunit of the
           phagocytic NADPH oxidase complex (also called Nox2 or
           gp91phox) which plays a crucial role in the cellular
           response to bacterial infection. NADPH oxidase catalyzes
           the transfer of electrons from NADPH to oxygen during
           phagocytosis forming superoxide and reactive oxygen
           species. p40phox positively regulates NADPH oxidase in
           both phosphatidylinositol-3-phosphate (PI3P)-dependent
           and PI3P-independent manner. It contains an N-terminal
           PX domain, a central SH3 domain, and a C-terminal PB1
           domain that interacts with p67phox. The SH3 domain of
           p40phox binds to canonical polyproline and noncanonical
           motifs at the C-terminus of p47phox. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 54

 Score = 38.2 bits (89), Expect = 4e-04
 Identities = 15/52 (28%), Positives = 30/52 (57%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           A AL+D+  ++  E++F   D+I  +  +++ W  G   G  G+FP ++V +
Sbjct: 2   AEALFDFTGNSKLELNFKAGDVIFLLSRVNKDWLEGTVRGATGIFPLSFVKI 53


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 42.8 bits (101), Expect = 4e-04
 Identities = 22/95 (23%), Positives = 47/95 (49%)

Query: 372 EKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQ 431
            K + VQ + +R + + KE+LE++  +  E    I+++K++++   +E  +  +  L++Q
Sbjct: 373 GKHQDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQ 432

Query: 432 QEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEE 466
            E  +L+  E+   LE      + R        EE
Sbjct: 433 LEAGKLEFNEEEYELELRLGRLKQRLDSATATPEE 467



 Score = 40.1 bits (94), Expect = 0.003
 Identities = 33/138 (23%), Positives = 55/138 (39%), Gaps = 16/138 (11%)

Query: 362 QNSNKEEEEKEKQRMVQ-EEIKRKELERKEELEK---EQIRIKEEQENIKKKKEQEEKER 417
               K+ EE+  Q   + EE KR E E +  L++   +  R++ EQ+++K K E    ER
Sbjct: 620 VAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAER 679

Query: 418 KEQEENQ--------KLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDLG 469
           K+Q E Q        K LL++QQ         +    E         Q     + +  L 
Sbjct: 680 KQQAETQLRQLDAQLKQLLEQQQA---FLEALKDDFRELRTERLAKWQV-VEGELDNQLA 735

Query: 470 YTAVALYDYQASADDEIS 487
             + A+   +  A   + 
Sbjct: 736 QLSAAIEAARTQAKARLK 753



 Score = 32.0 bits (73), Expect = 1.0
 Identities = 40/203 (19%), Positives = 79/203 (38%), Gaps = 33/203 (16%)

Query: 268 NLRAKFENLAKQTE--EESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIE 325
           ++R++ E +  + +      +  + + E+ K K  I  +  + LE+       I+EE+  
Sbjct: 356 SIRSELEEVEARLDALTGKHQDVQRKYERLKQK--IKEQLERDLEKNNERLAAIREEKDR 413

Query: 326 KKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEK--------EKQRMV 377
           +K     D+                L+      +        EEE          KQR+ 
Sbjct: 414 QKAAIEEDL--------------QALESQLRQQLEAGKLEFNEEEYELELRLGRLKQRLD 459

Query: 378 QEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRL 437
                 +ELE+ E  ++   + +EEQE  +   EQ + E ++       L K++ E    
Sbjct: 460 SATATPEELEQLEINDEALEKAQEEQEQAEANVEQLQSELRQ-------LRKRRDEALEA 512

Query: 438 KAEEQARLLEQERLMEELRQQGT 460
               + RLL+  + ++EL  Q +
Sbjct: 513 LQRAERRLLQLRQALDELELQLS 535



 Score = 30.4 bits (69), Expect = 3.3
 Identities = 28/200 (14%), Positives = 80/200 (40%), Gaps = 28/200 (14%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKL 328
           LR + +   +  +    K+ + E++  +   +++  + ++ E    + LK    +++ +L
Sbjct: 605 LRERLQQAEEALQSAVAKQKQAEEQLVQANAELE--EQKRAEAEARTALKQARLDLQ-RL 661

Query: 329 NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELER 388
                                  +Q   D + +  + ++++ + + R +  ++K+   ++
Sbjct: 662 QNE--------------------QQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQ 701

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQ 448
           +  LE  +   +E +     K +  E E   Q       L    E  R +A+ + + L++
Sbjct: 702 QAFLEALKDDFRELRTERLAKWQVVEGELDNQLAQ----LSAAIEAARTQAKARLKELKK 757

Query: 449 ERLMEELRQQGTNEDTEEDL 468
           +    EL     + +T ++L
Sbjct: 758 QY-DRELASLDVDPNTVKEL 776


>gnl|CDD|212906 cd11973, SH3_ASEF, Src homology 3 domain of APC-Stimulated guanine
           nucleotide Exchange Factor.  ASEF, also called ARHGEF4,
           exists in an autoinhibited form and is activated upon
           binding of the tumor suppressor APC (adenomatous
           polyposis coli). GEFs activate small GTPases by
           exchanging bound GDP for free GTP. ASEF can activate
           Rac1 or Cdc42. Truncated ASEF, which is found in
           colorectal cancers, is constitutively active and has
           been shown to promote angiogenesis and cancer cell
           migration. ASEF contains a SH3 domain followed by RhoGEF
           (also called Dbl-homologous or DH) and Pleckstrin
           Homology (PH) domains. In its autoinhibited form, the
           SH3 domain of ASEF forms an extensive interface with the
           DH and PH domains, blocking the Rac binding site. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 73

 Score = 38.8 bits (90), Expect = 4e-04
 Identities = 21/68 (30%), Positives = 33/68 (48%)

Query: 457 QQGTNEDTEEDLGYTAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLF 516
           Q   NE   +     A AL+D+    D E+ F   D+I  ++  ++ WW G      G F
Sbjct: 5   QLAINELISDGSVVCAEALWDHVTMDDQELGFKAGDVIEVMDATNKEWWWGRVLDSEGWF 64

Query: 517 PANYVSLQ 524
           PA++V L+
Sbjct: 65  PASFVRLR 72


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 41.0 bits (97), Expect = 4e-04
 Identities = 22/78 (28%), Positives = 46/78 (58%), Gaps = 3/78 (3%)

Query: 363 NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKK--KKEQEEKERKEQ 420
              K  + ++ +  +++EIK+ E E KEELEK    ++ + E I+K  ++E++ +E++  
Sbjct: 111 GMRKALQAEQGKSELEQEIKKLE-EEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHA 169

Query: 421 EENQKLLLKKQQEEDRLK 438
           +E   L  + QQ + +L+
Sbjct: 170 DEIAFLKKQNQQLKSQLE 187



 Score = 35.6 bits (83), Expect = 0.027
 Identities = 18/79 (22%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 373 KQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQ 432
           ++ +  E+ K +  +  ++LE+E+  +++    ++ K E  EK  +E+EE Q    +   
Sbjct: 113 RKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEK--REEEERQIEEKRHAD 170

Query: 433 EEDRLKAEEQARLLEQERL 451
           E   LK + Q    + E++
Sbjct: 171 EIAFLKKQNQQLKSQLEQI 189



 Score = 29.9 bits (68), Expect = 2.0
 Identities = 21/67 (31%), Positives = 39/67 (58%), Gaps = 7/67 (10%)

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQ 448
           ++ L+ EQ + + EQE  K ++E+EE E++  E      L+ + E    K EE+ R +E+
Sbjct: 113 RKALQAEQGKSELEQEIKKLEEEKEELEKRVAE------LEAKLEAIE-KREEEERQIEE 165

Query: 449 ERLMEEL 455
           +R  +E+
Sbjct: 166 KRHADEI 172


>gnl|CDD|212857 cd11924, SH3_Vinexin_2, Second Src Homology 3 domain of Vinexin,
           also called Sorbin and SH3 domain containing 3 (Sorbs3).
            Vinexin is also called Sorbs3, SH3P3, and
           SH3-containing adapter molecule 1 (SCAM-1). It is an
           adaptor protein containing one sorbin homology (SoHo)
           and three SH3 domains. Vinexin was first identified as a
           vinculin binding protein; it is co-localized with
           vinculin at cell-ECM and cell-cell adhesion sites. There
           are several splice variants of vinexin: alpha, which
           contains the SoHo and three SH3 domains and displays
           tissue-specific expression; and beta, which contains
           only the three SH3 domains and is widely expressed.
           Vinexin alpha stimulates the accumulation of F-actin at
           focal contact sites. Vinexin also promotes keratinocyte
           migration and wound healing. The SH3 domains of vinexin
           have been reported to bind a number of ligands including
           vinculin, WAVE2, DLG5, Abl, and Cbl. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 56

 Score = 38.0 bits (88), Expect = 4e-04
 Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHG--QYGLFPANYVSL 523
           AVA Y ++   + E+SF   + I  I  ++E W+ G   G  + G+FPA+YV +
Sbjct: 3   AVAQYTFKGDLEVELSFRKGEHICLIRKVNENWYEGRITGTGRQGIFPASYVQV 56


>gnl|CDD|212854 cd11921, SH3_Vinexin_1, First Src Homology 3 domain of Vinexin,
           also called Sorbin and SH3 domain containing 3 (Sorbs3).
            Vinexin is also called Sorbs3, SH3P3, and
           SH3-containing adapter molecule 1 (SCAM-1). It is an
           adaptor protein containing one sorbin homology (SoHo)
           and three SH3 domains. Vinexin was first identified as a
           vinculin binding protein; it is co-localized with
           vinculin at cell-ECM and cell-cell adhesion sites. There
           are several splice variants of vinexin: alpha, which
           contains the SoHo and three SH3 domains and displays
           tissue-specific expression; and beta, which contains
           only the three SH3 domains and is widely expressed.
           Vinexin alpha stimulates the accumulation of F-actin at
           focal contact sites. Vinexin also promotes keratinocyte
           migration and wound healing. The SH3 domains of vinexin
           have been reported to bind a number of ligands including
           vinculin, WAVE2, DLG5, Abl, and Cbl. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 55

 Score = 38.0 bits (88), Expect = 5e-04
 Identities = 18/48 (37%), Positives = 30/48 (62%)

Query: 476 YDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           +D+QA +  E++    DI+   + +D+ W  G  HG+ G+FPANYV +
Sbjct: 7   FDFQAQSPKELTLQKGDIVYIHKEVDKNWLEGEHHGRVGIFPANYVEV 54


>gnl|CDD|212718 cd11784, SH3_SH3RF2_3, Third Src Homology 3 domain of SH3 domain
           containing ring finger 2.  SH3RF2 is also called POSHER
           (POSH-eliminating RING protein) or HEPP1 (heart protein
           phosphatase 1-binding protein). It acts as an
           anti-apoptotic regulator of the JNK pathway by binding
           to and promoting the degradation of SH3RF1 (or POSH), a
           scaffold protein that is required for pro-apoptotic JNK
           activation. It may also play a role in cardiac functions
           together with protein phosphatase 1. SH3RF2 contains an
           N-terminal RING finger domain and three SH3 domains.
           This model represents the third SH3 domain, located in
           the middle, of SH3RF2. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 37.8 bits (88), Expect = 5e-04
 Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYC--HGQYGLFPANYVS 522
           VAL+ Y A   +E+     + +  +    EGW RG     G+ G+FP+NYVS
Sbjct: 3   VALHSYSAHRPEELELQKGEGVRVLGKFQEGWLRGLSLVTGRVGIFPSNYVS 54


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 41.9 bits (99), Expect = 6e-04
 Identities = 25/73 (34%), Positives = 42/73 (57%), Gaps = 12/73 (16%)

Query: 379 EEIKRK-ELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRL 437
            E+ RK +  R+EE EK    I +  E  ++++ QE+KE K++EE       ++ +  +L
Sbjct: 255 PEVLRKVDKTREEEEEK----ILKAAEEERQEEAQEKKEEKKKEE-------REAKLAKL 303

Query: 438 KAEEQARLLEQER 450
             EEQ +L E+ER
Sbjct: 304 SPEEQRKLEEKER 316



 Score = 40.3 bits (95), Expect = 0.002
 Identities = 20/59 (33%), Positives = 39/59 (66%)

Query: 363 NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
           +  +EEEE++  +  +EE + +  E+KEE +KE+   K  + + +++++ EEKERK+Q 
Sbjct: 262 DKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQA 320



 Score = 37.6 bits (88), Expect = 0.012
 Identities = 21/64 (32%), Positives = 38/64 (59%), Gaps = 8/64 (12%)

Query: 370 EKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLK 429
           +K ++   ++ +K  E ER+EE +++    KEE    KKK+E+E K  K   E Q+ L +
Sbjct: 262 DKTREEEEEKILKAAEEERQEEAQEK----KEE----KKKEEREAKLAKLSPEEQRKLEE 313

Query: 430 KQQE 433
           K+++
Sbjct: 314 KERK 317



 Score = 31.5 bits (72), Expect = 1.00
 Identities = 17/64 (26%), Positives = 30/64 (46%), Gaps = 6/64 (9%)

Query: 266 PSNLRAKFENLAKQTEEESRKRSEEEK-EKRKLKDQIDLEQAQKLEERRLSELKIKEEEI 324
              +  K +   ++ EE+  K +EEE+ E+ + K     ++ +K EER     K+  EE 
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEK-----KEEKKKEEREAKLAKLSPEEQ 308

Query: 325 EKKL 328
            K  
Sbjct: 309 RKLE 312



 Score = 30.7 bits (70), Expect = 1.8
 Identities = 14/52 (26%), Positives = 27/52 (51%), Gaps = 7/52 (13%)

Query: 268 NLRAKFE------NLAKQTEEESRKRSEEEKEKRK-LKDQIDLEQAQKLEER 312
             R + E         ++ EE   K+ E++KE+R+    ++  E+ +KLEE+
Sbjct: 263 KTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEK 314



 Score = 30.3 bits (69), Expect = 2.3
 Identities = 10/49 (20%), Positives = 28/49 (57%)

Query: 407 KKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEEL 455
            +   + +K R+E+EE      +++++E+  + +E+ +  E+E  + +L
Sbjct: 255 PEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKL 303



 Score = 29.1 bits (66), Expect = 5.5
 Identities = 17/55 (30%), Positives = 32/55 (58%), Gaps = 10/55 (18%)

Query: 365 NKEEEEKEKQRMVQEEIKRKELERK-EELEKEQIRIKEEQENIKKKKEQEEKERK 418
            +E +E+ +++  +EE K++E E K  +L  E+ R  EE       KE++++ RK
Sbjct: 277 EEERQEEAQEK--KEEKKKEEREAKLAKLSPEEQRKLEE-------KERKKQARK 322


>gnl|CDD|212702 cd11768, SH3_Tec_like, Src Homology 3 domain of Tec-like Protein
           Tyrosine Kinases.  The Tec (Tyrosine kinase expressed in
           hepatocellular carcinoma) subfamily is composed of Tec,
           Btk, Bmx (Etk), Itk (Tsk, Emt), Rlk (Txk), and similar
           proteins. They are cytoplasmic (or nonreceptor) tyr
           kinases containing Src homology protein interaction
           domains (SH3, SH2) N-terminal to the catalytic tyr
           kinase domain. Most Tec subfamily members (except Rlk)
           also contain an N-terminal pleckstrin homology (PH)
           domain, which binds the products of PI3K and allows
           membrane recruitment and activation. In addition, some
           members contain the Tec homology (TH) domain, which
           contains proline-rich and zinc-binding regions. Tec
           kinases are expressed mainly by haematopoietic cells,
           although Tec and Bmx are also found in endothelial
           cells. B-cells express Btk and Tec, while T-cells
           express Itk, Txk, and Tec. Collectively, Tec kinases are
           expressed in a variety of myeloid cells such as mast
           cells, platelets, macrophages, and dendritic cells. Each
           Tec kinase shows a distinct cell-type pattern of
           expression. The function of Tec kinases in lymphoid
           cells have been studied extensively. They play important
           roles in the development, differentiation, maturation,
           regulation, survival, and function of B-cells and
           T-cells. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 54

 Score = 37.6 bits (88), Expect = 6e-04
 Identities = 16/50 (32%), Positives = 26/50 (52%), Gaps = 1/50 (2%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGY-CHGQYGLFPANYV 521
           VALYD+Q     ++  +  +    ++  +E WWR    +G  G  P+NYV
Sbjct: 3   VALYDFQPIEPGDLPLEKGEEYVVLDDSNEHWWRARDKNGNEGYIPSNYV 52


>gnl|CDD|212997 cd12064, SH3_GRAF, Src Homology 3 domain of GTPase Regulator
           Associated with Focal adhesion kinase.  GRAF, also
           called Rho GTPase activating protein 26 (ARHGAP26),
           Oligophrenin-1-like (OPHN1L) or GRAF1, is a GAP with
           activity towards RhoA and Cdc42 and is only weakly
           active towards Rac1. It influences Rho-mediated
           cytoskeletal rearrangements and binds focal adhesion
           kinase (FAK), which is a critical component of integrin
           signaling. It is essential for the major
           clathrin-independent endocytic pathway mediated by
           pleiomorphic membranes. GRAF contains an N-terminal BAR
           domain, followed by a Pleckstrin homology (PH) domain, a
           Rho GAP domain, and a C-terminal SH3 domain. The SH3
           domain of GRAF binds PKNbeta, a target of the small
           GTPase Rho. SH3 domains bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs; they play a role in the regulation of
           enzymes by intramolecular interactions, changing the
           subcellular localization of signal pathway components
           and mediate multiprotein complex assemblies.
          Length = 56

 Score = 37.8 bits (87), Expect = 6e-04
 Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDE-GWWRGYCHGQYGLFPANYVSL 523
           A ALY  +A  D E+SF    +  N+    E GW  G  +G+ GL P NYV  
Sbjct: 3   AKALYACKAEHDSELSFTAGTVFDNVHPSQEPGWLEGTLNGKTGLIPENYVEF 55


>gnl|CDD|212785 cd11851, SH3_RIM-BP, Src homology 3 domains of Rab3-interacting
           molecules (RIMs) binding proteins.  RIMs binding
           proteins (RBPs, RIM-BPs) associate with calcium channels
           present in photoreceptors, neurons, and hair cells; they
           interact simultaneously with specific calcium channel
           subunits, and active zone proteins, RIM1 and RIM2. RIMs
           are part of the matrix at the presynaptic active zone
           and are associated with synaptic vesicles through their
           interaction with the small GTPase Rab3. RIM-BPs play a
           role in regulating synaptic transmission by serving as
           adaptors and linking calcium channels with the synaptic
           vesicle release machinery. RIM-BPs contain three SH3
           domains and two to three fibronectin III repeats.
           Invertebrates contain one, while vertebrates contain at
           least two RIM-BPs, RIM-BP1 and RIM-BP2. RIM-BP1 is also
           called peripheral-type benzodiazapine receptor
           associated protein 1 (PRAX-1). Mammals contain a third
           protein, RIM-BP3. RIM-BP1 and RIM-BP2 are predominantly
           expressed in the brain where they display overlapping
           but distinct expression patterns, while RIM-BP3 is
           almost exclusively expressed in the testis and is
           essential in spermiogenesis. The SH3 domains of RIM-BPs
           bind to the PxxP motifs of RIM1, RIM2, and L-type
           (alpha1D) and N-type (alpha1B) calcium channel subunits.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 62

 Score = 37.7 bits (88), Expect = 7e-04
 Identities = 20/59 (33%), Positives = 31/59 (52%), Gaps = 9/59 (15%)

Query: 473 VALYDYQAS-------ADDEISFDPDDIITNIEMIDE-GWWRGY-CHGQYGLFPANYVS 522
           VALYDY           ++E+SF   D++     +DE G++ G    G+ GL P+N+V 
Sbjct: 3   VALYDYNPETMSPNDDPEEELSFHAGDVVRVYGPMDEDGFYYGELEGGRKGLVPSNFVQ 61


>gnl|CDD|212908 cd11975, SH3_ARHGEF9, Src homology 3 domain of the Rho guanine
           nucleotide exchange factor ARHGEF9.  ARHGEF9, also
           called PEM2 or collybistin, selectively activates Cdc42
           by exchanging bound GDP for free GTP. It is highly
           expressed in the brain and it interacts with gephyrin, a
           postsynaptic protein associated with GABA and glycine
           receptors. Mutations in the ARHGEF9 gene cause X-linked
           mental retardation with associated features like
           seizures, hyper-anxiety, aggressive behavior, and
           sensory hyperarousal. ARHGEF9 contains a SH3 domain
           followed by RhoGEF (also called Dbl-homologous or DH)
           and Pleckstrin Homology (PH) domains. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 62

 Score = 37.8 bits (87), Expect = 7e-04
 Identities = 17/53 (32%), Positives = 32/53 (60%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           +A A++D+   A+ E++F   D+I  ++  ++ WW G    + G FPA++V L
Sbjct: 6   SAEAVWDHVTMANRELAFKAGDVIKVLDASNKDWWWGQIDDEEGWFPASFVRL 58


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 41.5 bits (97), Expect = 7e-04
 Identities = 29/84 (34%), Positives = 51/84 (60%), Gaps = 6/84 (7%)

Query: 366 KEEEEKEKQRMVQEEIKRK---ELERKEELEKEQIRIKEEQ---ENIKKKKEQEEKERKE 419
            E++ K+K+  V EE+K K   E ER ++LEKE+++ +E+Q   E  +K+ + E+K+++E
Sbjct: 77  GEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEE 136

Query: 420 QEENQKLLLKKQQEEDRLKAEEQA 443
           Q        KK+ E  + KA  +A
Sbjct: 137 QARKAAAEQKKKAEAAKAKAAAEA 160



 Score = 39.5 bits (92), Expect = 0.004
 Identities = 29/92 (31%), Positives = 47/92 (51%), Gaps = 5/92 (5%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL- 426
           EE K KQ   QE +K+ E ER +  E+++   + E++   ++K+QEE+ RK   E +K  
Sbjct: 90  EELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKA 149

Query: 427 ---LLKKQQEEDRLKAEEQA-RLLEQERLMEE 454
                K   E  +LKA  +A +  E+     E
Sbjct: 150 EAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAE 181



 Score = 38.8 bits (90), Expect = 0.006
 Identities = 24/84 (28%), Positives = 47/84 (55%), Gaps = 2/84 (2%)

Query: 369 EEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL- 427
           ++  + +  Q   K+ E +RK++ E+    +K +Q   +++ +Q EKER + +E QK   
Sbjct: 62  QQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAE 121

Query: 428 -LKKQQEEDRLKAEEQARLLEQER 450
             +KQ + ++ + EEQAR    E+
Sbjct: 122 EAEKQAQLEQKQQEEQARKAAAEQ 145



 Score = 37.6 bits (87), Expect = 0.012
 Identities = 33/171 (19%), Positives = 72/171 (42%), Gaps = 41/171 (23%)

Query: 279 QTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSP 338
           Q+++ S K+ E++++K++ +   +L+  Q  E+ RL +L+ +  + +++           
Sbjct: 68  QSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQ----------- 116

Query: 339 STETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIR 398
                                      K+ EE EKQ  ++++ +++E  RK   E+++  
Sbjct: 117 --------------------------QKQAEEAEKQAQLEQK-QQEEQARKAAAEQKK-- 147

Query: 399 IKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQE 449
            K E    K   E  + +   + + +     K  EE + KAE  A   + E
Sbjct: 148 -KAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAE 197


>gnl|CDD|212863 cd11930, SH3_SH3RF1_2, Second Src Homology 3 domain of SH3 domain
           containing ring finger protein 1, an E3
           ubiquitin-protein ligase.  SH3RF1 is also called POSH
           (Plenty of SH3s) or SH3MD2 (SH3 multiple domains protein
           2). It is a scaffold protein that acts as an E3
           ubiquitin-protein ligase. It plays a role in calcium
           homeostasis through the control of the ubiquitin domain
           protein Herp. It may also have a role in regulating
           death receptor mediated and JNK mediated apoptosis.
           SH3RF1 also enhances the ubiquitination of ROMK1
           potassium channel resulting in its increased
           endocytosis. It contains an N-terminal RING finger
           domain and four SH3 domains. This model represents the
           second SH3 domain, located C-terminal of the first SH3
           domain at the N-terminal half, of SH3RF1. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 37.3 bits (86), Expect = 7e-04
 Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 4/52 (7%)

Query: 474 ALYDYQ---ASAD-DEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           ALYD++     AD D + F  DDI+T I  +DE W  G    + G+FP +YV
Sbjct: 4   ALYDFEVKDKEADKDCLPFAKDDILTVIRRVDENWAEGMLGDKIGIFPISYV 55


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 41.4 bits (98), Expect = 8e-04
 Identities = 27/96 (28%), Positives = 58/96 (60%), Gaps = 5/96 (5%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEE-----QENIKKKKEQEEKERKEQEE 422
            +E+E+   + E I+ ++    +E  ++Q +++EE     +E I++K+E++E+ER+E+ +
Sbjct: 95  LQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELK 154

Query: 423 NQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
             +   +K + E+  +AE + R  E+ER +  LR Q
Sbjct: 155 ILEYQREKAEREEEREAERRERKEEKEREVARLRAQ 190



 Score = 40.3 bits (95), Expect = 0.002
 Identities = 25/93 (26%), Positives = 56/93 (60%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           + EEE+E +R  ++E K +E+ R    ++E    +EE + ++    QEE ERKE+++ ++
Sbjct: 164 EREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEKE 223

Query: 426 LLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
              K+++++  L+   + ++ E+E  ++E R +
Sbjct: 224 EAEKRRRQKQELQRAREEQIEEKEERLQEERAE 256



 Score = 39.5 bits (93), Expect = 0.004
 Identities = 44/189 (23%), Positives = 87/189 (46%), Gaps = 32/189 (16%)

Query: 276 LAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVP 335
           L +Q EE  ++R EE +E+ + ++Q+D    +  EE      + +E++ + +        
Sbjct: 76  LQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREE------ 129

Query: 336 LSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEE---EKEKQRMVQEEIKRKELERKEEL 392
                      +    ++        +   KEEE   E+E++  + E  + K  ER+EE 
Sbjct: 130 -----------IDEFNEE--------RIERKEEEKEREREEELKILEYQREKA-EREEER 169

Query: 393 EKEQIRIKEEQENIKKK-KEQEEKERKEQEENQKLLLKKQQEED--RLKAEEQARLLEQE 449
           E E+   KEE+E    + + Q+E+   E+EE  +L     QEE   + + +E+    ++ 
Sbjct: 170 EAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEKEEAEKRR 229

Query: 450 RLMEELRQQ 458
           R  +EL++ 
Sbjct: 230 RQKQELQRA 238



 Score = 36.4 bits (85), Expect = 0.033
 Identities = 24/93 (25%), Positives = 55/93 (59%), Gaps = 1/93 (1%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
            EEEE+E++R  +    R  L+ + E E+E+ R +E +E ++++++ +E   + QEE++ 
Sbjct: 56  AEEEERERKRKEERREGRAVLQEQIE-EREKRRQEEYEERLQEREQMDEIIERIQEEDEA 114

Query: 426 LLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
              +K++++ +L+ E      E+    EE +++
Sbjct: 115 EAQEKREKQKKLREEIDEFNEERIERKEEEKER 147



 Score = 36.0 bits (84), Expect = 0.041
 Identities = 30/99 (30%), Positives = 63/99 (63%), Gaps = 6/99 (6%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEE-----QENIKKKKEQEEKERKEQ 420
           +EE ++ +  + QEE +RKE ++++E  +++ R K+E     +E I++K+E+ ++ER E+
Sbjct: 198 REELDELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEE 257

Query: 421 EENQKLLLKKQQEEDRL-KAEEQARLLEQERLMEELRQQ 458
           E  ++ +L+KQ E++ L +   + R +++     EL QQ
Sbjct: 258 EAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQ 296



 Score = 34.5 bits (80), Expect = 0.12
 Identities = 29/113 (25%), Positives = 57/113 (50%), Gaps = 5/113 (4%)

Query: 365 NKEEEEKEKQRMV-----QEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE 419
            + E +++++R       QE+I+ +E  R+EE E+     ++  E I++ +E++E E +E
Sbjct: 59  EERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQE 118

Query: 420 QEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDLGYTA 472
           + E QK L ++  E +  + E +    E+ER  E    +   E  E +    A
Sbjct: 119 KREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREA 171



 Score = 33.3 bits (77), Expect = 0.24
 Identities = 21/93 (22%), Positives = 56/93 (60%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           K+E ++ ++  ++E+ +R + ER EE  + +  ++++ E+ + ++E  EK R ++ E+++
Sbjct: 232 KQELQRAREEQIEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRR 291

Query: 426 LLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
            L ++ +E++  +A E+   LE+   + E   +
Sbjct: 292 ELEQQIEEKEERRAAEREEELEEGERLREEEAE 324



 Score = 33.0 bits (76), Expect = 0.32
 Identities = 25/86 (29%), Positives = 54/86 (62%), Gaps = 3/86 (3%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIR---IKEEQENIKKKKEQEEKERKEQEE 422
           + EEE E++RM++++ + +ELE++   ++   R    +E ++ I++K+E+   ER+E+ E
Sbjct: 254 RAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEELE 313

Query: 423 NQKLLLKKQQEEDRLKAEEQARLLEQ 448
             + L +++ E      EE+ RLL++
Sbjct: 314 EGERLREEEAERQARIEEERQRLLKE 339



 Score = 33.0 bits (76), Expect = 0.37
 Identities = 21/90 (23%), Positives = 54/90 (60%)

Query: 377 VQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDR 436
           ++E+ + K  E++EE   +++  +E  + + +++E+E K ++E+ E + +L ++ +E ++
Sbjct: 26  IEEKKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREK 85

Query: 437 LKAEEQARLLEQERLMEELRQQGTNEDTEE 466
            + EE    L++   M+E+ ++   ED  E
Sbjct: 86  RRQEEYEERLQEREQMDEIIERIQEEDEAE 115



 Score = 32.6 bits (75), Expect = 0.53
 Identities = 19/100 (19%), Positives = 53/100 (53%), Gaps = 8/100 (8%)

Query: 365 NKEEEEKEKQRMVQ------EEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERK 418
            K EE++E++R+ +       +   +E ER+ + ++E+   +   +   +++E+  +E  
Sbjct: 32  IKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEY 91

Query: 419 EQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
           E+   ++  + +  E  R++ E++A   E+    ++LR++
Sbjct: 92  EERLQEREQMDEIIE--RIQEEDEAEAQEKREKQKKLREE 129



 Score = 31.8 bits (73), Expect = 0.88
 Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 4/94 (4%)

Query: 367 EEEEKEKQRMVQEEIKRKELERK-EELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           E E KE+Q+  +E  KR+  +++ +   +EQI  KEE+   ++++ +EE ER+   E Q 
Sbjct: 212 EYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERL--QEERAEEEAERERMLEKQA 269

Query: 426 LLLKKQQEEDRLKAEEQARL-LEQERLMEELRQQ 458
              + +QE    +  ++     E E+ +EE  ++
Sbjct: 270 EDEELEQENAEKRRMKRLEHRRELEQQIEEKEER 303



 Score = 31.4 bits (72), Expect = 1.0
 Identities = 27/102 (26%), Positives = 58/102 (56%), Gaps = 12/102 (11%)

Query: 366 KEEEEKEKQRMVQEEIKRKE-------LERKEELEKEQIRIKEEQENIKKKKEQEE--KE 416
           ++E E  + R  QEE + +         +  +E  + + R KE++E  K++++++E  + 
Sbjct: 179 EKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRA 238

Query: 417 RKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
           R+EQ E ++  L++++ E+     E+ R+LE++   EEL Q+
Sbjct: 239 REEQIEEKEERLQEERAEEE---AERERMLEKQAEDEELEQE 277


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 41.6 bits (98), Expect = 8e-04
 Identities = 40/215 (18%), Positives = 71/215 (33%), Gaps = 26/215 (12%)

Query: 283 ESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELK-----IKEEEIEKKLNGHSDVPLS 337
           E+  +S EE  K   K   ++E+  +  E RLSEL      +++   E +   + D  LS
Sbjct: 91  EAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLEDLLEELEPLAYLDFDLS 150

Query: 338 --PSTETPPVPVKSILKQPTSDGIPIQNSNKEEE---EKEKQRMV-----QEEIKRKELE 387
                +   V +   L +       +     E     E  +  +V       E   K  +
Sbjct: 151 LLRGLKFLLVRL--GLVRREKLEALVGVIEDEVALYGENVEASVVIVVAHGAEDLDKVSK 208

Query: 388 RKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLE 447
              EL  E   + E      +   + E+   E ++  + L  + +    L  +    LL 
Sbjct: 209 ILNELGFELYEVPEFDGGPSELISELEEVIAEIQDELESLRSELEA---LAEKIAEELLA 265

Query: 448 QERLMEELRQQGTNEDTEEDLGYTAVALYDYQASA 482
              ++E  +  G   D    L  T    Y      
Sbjct: 266 VREILEIEKALG---DVLSKLARTE---YTLAIEG 294


>gnl|CDD|212828 cd11895, SH3_FCHSD1_2, Second Src Homology 3 domain of FCH and
           double SH3 domains protein 1.  FCHSD1 has a domain
           structure consisting of an N-terminal F-BAR (FES-CIP4
           Homology and Bin/Amphiphysin/Rvs), two SH3, and
           C-terminal proline-rich domains. It has only been
           characterized in silico and its function is unknown. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 58

 Score = 37.3 bits (86), Expect = 9e-04
 Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 4/51 (7%)

Query: 472 AVALYDYQASADDEISFDPDDIITNI----EMIDEGWWRGYCHGQYGLFPA 518
           A ALY Y   + +E+SF    +I  +    + +D+G+WRG   G+ G+FP+
Sbjct: 2   ARALYSYTGQSPEELSFPEGALIRLLPRAQDGVDDGFWRGEFGGRVGVFPS 52


>gnl|CDD|212742 cd11808, SH3_Alpha_Spectrin, Src homology 3 domain of Alpha
           Spectrin.  Spectrin is a major structural component of
           the red blood cell membrane skeleton and is important in
           erythropoiesis and membrane biogenesis. It is a
           flexible, rope-like molecule composed of two subunits,
           alpha and beta, which consist of many spectrin-type
           repeats. Alpha and beta spectrin associate to form
           heterodimers and tetramers; spectrin tetramer formation
           is critical for red cell shape and deformability.
           Defects in alpha spectrin have been associated with
           inherited hemolytic anemias including hereditary
           spherocytosis (HSp), hereditary elliptocytosis (HE), and
           hereditary pyropoikilocytosis (HPP). Alpha spectrin
           contains a middle SH3 domain and a C-terminal EF-hand
           binding motif in addition to multiple spectrin repeats.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 37.1 bits (86), Expect = 0.001
 Identities = 19/49 (38%), Positives = 28/49 (57%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           VALYDYQ  +  E+S    DI+T +   ++ WW+   + + G  PA YV
Sbjct: 3   VALYDYQEKSPREVSMKKGDILTLLNSSNKDWWKVEVNDRQGFVPAAYV 51


>gnl|CDD|212898 cd11965, SH3_ASAP1, Src homology 3 domain of ArfGAP with SH3
           domain, ankyrin repeat and PH domain containing protein
           1.  ASAP1 is also called DDEF1 (Development and
           Differentiation Enhancing Factor 1), AMAP1, centaurin
           beta-4, or PAG2. an Arf GTPase activating protein (GAP)
           with activity towards Arf1 and Arf5 but not Arf6.
           However, it has been shown to bind GTP-Arf6 stably
           without GAP activity. It has been implicated in cell
           growth, migration, and survival, as well as in tumor
           invasion and malignancy. It binds paxillin and
           cortactin, two components of invadopodia which are
           essential for tumor invasiveness. It also binds focal
           adhesion kinase (FAK) and the SH2/SH3 adaptor CrkL.
           ASAP1 contains an N-terminal BAR domain, followed by a
           Pleckstrin homology (PH) domain, an Arf GAP domain,
           ankyrin (ANK) repeats, and a C-terminal SH3 domain. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 57

 Score = 37.3 bits (86), Expect = 0.001
 Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 3/52 (5%)

Query: 475 LYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQ---YGLFPANYVSL 523
           +YD QA  DDE++F   ++I      D+ WW G+  GQ    G+FP ++V +
Sbjct: 5   IYDCQADNDDELTFVEGEVIIVTGEEDQEWWIGHIEGQPERKGVFPVSFVHI 56


>gnl|CDD|212692 cd11758, SH3_CRK_N, N-terminal Src Homology 3 domain of Ct10
           Regulator of Kinase adaptor proteins.  CRK adaptor
           proteins consists of SH2 and SH3 domains, which bind
           tyrosine-phosphorylated peptides and proline-rich
           motifs, respectively. They function downstream of
           protein tyrosine kinases in many signaling pathways
           started by various extracellular signals, including
           growth and differentiation factors. Cellular CRK (c-CRK)
           contains a single SH2 domain, followed by N-terminal and
           C-terminal SH3 domains. It is involved in the regulation
           of many cellular processes including cell growth,
           motility, adhesion, and apoptosis. CRK has been
           implicated in the malignancy of various human cancers.
           The N-terminal SH3 domain of CRK binds a number of
           target proteins including DOCK180, C3G, SOS, and cABL.
           The CRK family includes two alternatively spliced
           protein forms, CRKI and CRKII, that are expressed by the
           CRK gene, and the CRK-like (CRKL) protein, which is
           expressed by a distinct gene (CRKL). SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 55

 Score = 37.0 bits (86), Expect = 0.001
 Identities = 16/54 (29%), Positives = 27/54 (50%), Gaps = 1/54 (1%)

Query: 470 YTAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGY-CHGQYGLFPANYVS 522
               AL+D+  + D+++ F   +I+T I   +E WW      G+ G+ P  YV 
Sbjct: 1   EYVRALFDFPGNDDEDLPFKKGEILTVIRKPEEQWWNARNSEGKTGMIPVPYVE 54


>gnl|CDD|212939 cd12006, SH3_Fyn_Yrk, Src homology 3 domain of Fyn and Yrk Protein
           Tyrosine Kinases.  Fyn and Yrk (Yes-related kinase) are
           members of the Src subfamily of proteins, which are
           cytoplasmic (or non-receptor) PTKs. Fyn, together with
           Lck, plays a critical role in T-cell signal transduction
           by phosphorylating ITAM (immunoreceptor tyr activation
           motif) sequences on T-cell receptors, ultimately leading
           to the proliferation and differentiation of T-cells. In
           addition, Fyn is involved in the myelination of neurons,
           and is implicated in Alzheimer's and Parkinson's
           diseases. Yrk has been detected only in chickens. It is
           primarily found in neuronal and epithelial cells and in
           macrophages. It may play a role in inflammation and in
           response to injury. Src kinases contain an N-terminal
           SH4 domain with a myristoylation site, followed by SH3
           and SH2 domains, a tyr kinase domain, and a regulatory
           C-terminal region containing a conserved tyr. They are
           activated by autophosphorylation at the tyr kinase
           domain, but are negatively regulated by phosphorylation
           at the C-terminal tyr by Csk (C-terminal Src Kinase).
           The SH3 domain of Src kinases contributes to substrate
           recruitment by binding adaptor proteins/substrates, and
           regulation of kinase activity through an intramolecular
           interaction. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 56

 Score = 37.0 bits (85), Expect = 0.001
 Identities = 19/52 (36%), Positives = 29/52 (55%), Gaps = 2/52 (3%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWW--RGYCHGQYGLFPANYVS 522
           VALYDY+A  +D++SF   +    +   +  WW  R    G+ G  P+NYV+
Sbjct: 4   VALYDYEARTEDDLSFHKGEKFQILNSSEGDWWEARSLTTGETGYIPSNYVA 55


>gnl|CDD|212850 cd11917, SH3_Sorbs2_3, Third (or C-terminal) Src Homology 3 domain
           of Sorbin and SH3 domain containing 2 (Sorbs2), also
           called Arg-binding protein 2 (ArgBP2).  Sorbs2 or ArgBP2
           is an adaptor protein containing one sorbin homology
           (SoHo) and three SH3 domains. It regulates
           actin-dependent processes including cell adhesion,
           morphology, and migration. It is expressed in many
           tissues and is abundant in the heart. Like vinexin, it
           is found in focal adhesion where it interacts with
           vinculin and afadin. It also localizes in epithelial
           cell stress fibers and in cardiac muscle cell Z-discs.
           Sorbs2 has been implicated to play roles in the
           signaling of c-Arg, Akt, and Pyk2. Other interaction
           partners of Sorbs2 include c-Abl, flotillin, spectrin,
           dynamin 1/2, synaptojanin, PTP-PEST, among others. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 61

 Score = 36.9 bits (85), Expect = 0.001
 Identities = 19/50 (38%), Positives = 27/50 (54%), Gaps = 2/50 (4%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQ--YGLFPANYV 521
           ALY+Y    +DE+     D+I  +E  D+GW+ G       +G FP NYV
Sbjct: 9   ALYNYMPRNEDELELREGDVIDVMEKCDDGWFVGTSRRTKFFGTFPGNYV 58


>gnl|CDD|212695 cd11761, SH3_FCHSD_1, First Src Homology 3 domain of FCH and double
           SH3 domains proteins.  This group is composed of FCH and
           double SH3 domains protein 1 (FCHSD1) and FCHSD2. These
           proteins have a common domain structure consisting of an
           N-terminal F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs), two SH3, and C-terminal
           proline-rich domains. They have only been characterized
           in silico and their functions remain unknown. This group
           also includes the insect protein, nervous wreck, which
           acts as a regulator of synaptic growth signaling. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 57

 Score = 37.0 bits (86), Expect = 0.001
 Identities = 17/53 (32%), Positives = 26/53 (49%), Gaps = 2/53 (3%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMID-EGWWRGY-CHGQYGLFPANYV 521
           T   LY Y+A   DE++    + +  IE  D +GW +     G+ G  P NY+
Sbjct: 3   TCKVLYSYEAQRPDELTITEGEELEVIEDGDGDGWVKARNKSGEVGYVPENYL 55


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 40.6 bits (95), Expect = 0.001
 Identities = 23/119 (19%), Positives = 58/119 (48%), Gaps = 3/119 (2%)

Query: 335 PLSPSTETPPVPVKSILKQPTSDGIPI-QNSNKEEEEKEKQRMVQEEIKRKELERKEELE 393
            L  S +  P     I++    D   + Q +N+ +++K+     ++E ++K  ++ EE E
Sbjct: 22  SLYHSVKPEPGGGGEIIQAVLVDPGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAE 81

Query: 394 KEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRL--KAEEQARLLEQER 450
           K++   +  Q+ ++++   E+  ++ ++  ++   K++Q E+    +A E     E E 
Sbjct: 82  KQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEA 140



 Score = 39.0 bits (91), Expect = 0.004
 Identities = 19/84 (22%), Positives = 42/84 (50%), Gaps = 4/84 (4%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           +   E+ +Q+ +++    ++  ++ E   +Q   K++Q    K K+  E + K + E + 
Sbjct: 83  QRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAE- 141

Query: 426 LLLKKQQEEDRLKAEEQARLLEQE 449
              KK +EE + +AEE+A+     
Sbjct: 142 ---KKAKEEAKKQAEEEAKAKAAA 162



 Score = 38.3 bits (89), Expect = 0.008
 Identities = 36/175 (20%), Positives = 69/175 (39%), Gaps = 35/175 (20%)

Query: 277 AKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPL 336
           A + E+E +K+ E++ E+ + +   +  + ++LE+R  +E   K+ E   K         
Sbjct: 62  AAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK--------- 112

Query: 337 SPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQ 396
                                    +   + EE K KQ     E K K     E+  KE+
Sbjct: 113 ----------------------QAEEKQKQAEEAKAKQA---AEAKAKAEAEAEKKAKEE 147

Query: 397 IRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQE-EDRLKAEEQARLLEQER 450
            + + E+E   K   + +K+  E ++  +   K + E + + KAEE     E  +
Sbjct: 148 AKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAK 202



 Score = 37.5 bits (87), Expect = 0.012
 Identities = 21/97 (21%), Positives = 49/97 (50%), Gaps = 4/97 (4%)

Query: 360 PIQNSNKEEEEKE--KQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKER 417
             +   K E++ E  +++   E+ ++KELE++   EK   + ++  +  ++K++Q E+ +
Sbjct: 66  EQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAK 125

Query: 418 KEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEE 454
            +Q    K   K + E ++   EE  +  E+E   + 
Sbjct: 126 AKQAAEAK--AKAEAEAEKKAKEEAKKQAEEEAKAKA 160



 Score = 37.5 bits (87), Expect = 0.014
 Identities = 22/93 (23%), Positives = 40/93 (43%), Gaps = 7/93 (7%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           K  ++ E+     EE +++  E K +   E    K + E   +KK +EE +++ +EE   
Sbjct: 102 KAAKQAEQAAKQAEEKQKQAEEAKAKQAAEA---KAKAEAEAEKKAKEEAKKQAEEEA-- 156

Query: 426 LLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
               K   E + KA E  +  E E   +   + 
Sbjct: 157 --KAKAAAEAKKKAAEAKKKAEAEAKAKAEAKA 187



 Score = 36.4 bits (84), Expect = 0.035
 Identities = 39/186 (20%), Positives = 70/186 (37%), Gaps = 34/186 (18%)

Query: 279 QTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSP 338
           Q +++   + E+E++K+  +   + E+ +  E+ R  EL+ +    +             
Sbjct: 56  QQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAA----------K 105

Query: 339 STETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKE-LERKEELE---K 394
             E      +   KQ        +   K+  E + +   + E K KE  +++ E E   K
Sbjct: 106 QAEQAAKQAEEKQKQAE------EAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAK 159

Query: 395 EQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAE--------EQARLL 446
                K++    KKK E E K + E +       K + EE + KAE        E A   
Sbjct: 160 AAAEAKKKAAEAKKKAEAEAKAKAEAKA------KAKAEEAKAKAEAAKAKAAAEAAAKA 213

Query: 447 EQERLM 452
           E E   
Sbjct: 214 EAEAAA 219



 Score = 31.7 bits (72), Expect = 0.79
 Identities = 35/179 (19%), Positives = 66/179 (36%), Gaps = 48/179 (26%)

Query: 276 LAKQTEEESRKRSEEEKEKRKLKDQI-DLEQAQKLEERRLSELKIKEEEIEKKLNGHSDV 334
           L +Q EE  ++R+ E+  +++L+ +    + A++ E+      + +++  E K       
Sbjct: 73  LEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAK------- 125

Query: 335 PLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKE-LERKEELE 393
                                          K+  E + +   + E K KE  +++ E E
Sbjct: 126 ------------------------------AKQAAEAKAKAEAEAEKKAKEEAKKQAEEE 155

Query: 394 ---KEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQE 449
              K     K++    KKK E E K + E +       K + EE + KAE        E
Sbjct: 156 AKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKA------KAKAEEAKAKAEAAKAKAAAE 208



 Score = 31.0 bits (70), Expect = 1.7
 Identities = 39/202 (19%), Positives = 64/202 (31%), Gaps = 47/202 (23%)

Query: 249 NASKENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQK 308
            A +   +        K    RA  E  AKQ E+ +++  E++K+  + K +   E   K
Sbjct: 76  QAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAK 135

Query: 309 LE---ERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSN 365
            E   E++  E   K+ E E                                        
Sbjct: 136 AEAEAEKKAKEEAKKQAEEEA--------------------------------------- 156

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           K +   E ++   E  K+ E E K + E +    K + E  K K E  +   K   E   
Sbjct: 157 KAKAAAEAKKKAAEAKKKAEAEAKAKAEAKA---KAKAEEAKAKAEAAKA--KAAAEAAA 211

Query: 426 LLLKKQQEEDRLKAEEQARLLE 447
               +       +AE +A   E
Sbjct: 212 KAEAEAAAAAAAEAERKADEAE 233


>gnl|CDD|212888 cd11955, SH3_srGAP1-3, Src homology 3 domain of Slit-Robo GTPase
           Activating Proteins 1, 2, and 3.  srGAP1, also called
           Rho GTPase-Activating Protein 13 (ARHGAP13), is a Cdc42-
           and RhoA-specific GAP and is expressed later in the
           development of central nervous system tissues. srGAP2 is
           expressed in zones of neuronal differentiation. It plays
           a role in the regeneration of neurons and axons. srGAP3,
           also called MEGAP (MEntal disorder associated
           GTPase-Activating Protein), is a Rho GAP with activity
           towards Rac1 and Cdc42. It impacts cell migration by
           regulating actin and microtubule cytoskeletal dynamics.
           The association between srGAP3 haploinsufficiency and
           mental retardation is under debate. srGAPs are Rho GAPs
           that interact with Robo1, the transmembrane receptor of
           Slit proteins. Slit proteins are secreted proteins that
           control axon guidance and the migration of neurons and
           leukocytes. srGAPs contain an N-terminal F-BAR domain, a
           Rho GAP domain, and a C-terminal SH3 domain. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 36.5 bits (84), Expect = 0.001
 Identities = 15/50 (30%), Positives = 23/50 (46%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           A+A +DY   +  E+SF     +       + WW G  +G  GL P  Y+
Sbjct: 2   AIAKFDYVGRSARELSFKKGASLLLYHRASDDWWEGRHNGIDGLVPHQYI 51


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 40.8 bits (95), Expect = 0.001
 Identities = 25/94 (26%), Positives = 47/94 (50%), Gaps = 1/94 (1%)

Query: 373 KQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQ 432
           K+R  QE+ KR + + KEEL+K+QI   + Q+     ++  +K+R E  + Q+      +
Sbjct: 204 KERESQEDAKRAQ-QLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPK 262

Query: 433 EEDRLKAEEQARLLEQERLMEELRQQGTNEDTEE 466
             D    +E  ++ E ++   E  Q    ++ EE
Sbjct: 263 PADTSSPKEDKQVAENQKREIEKAQIEIKKNDEE 296



 Score = 40.0 bits (93), Expect = 0.003
 Identities = 39/198 (19%), Positives = 81/198 (40%), Gaps = 38/198 (19%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKK 327
           N R    +L ++  +E  KR+++ KE+   K QID ++AQ+  +        + +E+ +K
Sbjct: 195 NFRRDMTDLKERESQEDAKRAQQLKEELDKK-QIDADKAQQKADFAQDNADKQRDEVRQK 253

Query: 328 LNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE 387
                ++P    T +P                                    E K+    
Sbjct: 254 QQEAKNLPKPADTSSPK-----------------------------------EDKQVAEN 278

Query: 388 RKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLE 447
           +K E+EK QI IK+  E   K K+ +  + K++ +  +   + ++ E + K E  A   +
Sbjct: 279 QKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAE--D 336

Query: 448 QERLMEELRQQGTNEDTE 465
            ++   ++  Q T+ + +
Sbjct: 337 LQKTKPQVEAQPTSLNED 354



 Score = 31.9 bits (72), Expect = 0.85
 Identities = 29/121 (23%), Positives = 48/121 (39%), Gaps = 18/121 (14%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
           +   K+ +  + Q+         + +R E  +K+Q    E +   K       KE K+  
Sbjct: 221 ELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQ----EAKNLPKPADTSSPKEDKQVA 276

Query: 422 ENQKLLLKKQQEEDRLKAEE--------------QARLLEQERLMEELRQQGTNEDTEED 467
           ENQK  ++K Q E +   EE              +++  E+E   +EL  Q   E   ED
Sbjct: 277 ENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAED 336

Query: 468 L 468
           L
Sbjct: 337 L 337


>gnl|CDD|212851 cd11918, SH3_Vinexin_3, Third (or C-terminal) Src Homology 3 domain
           of Vinexin, also called Sorbin and SH3 domain containing
           3 (Sorbs3).  Vinexin is also called Sorbs3, SH3P3, and
           SH3-containing adapter molecule 1 (SCAM-1). It is an
           adaptor protein containing one sorbin homology (SoHo)
           and three SH3 domains. Vinexin was first identified as a
           vinculin binding protein; it is co-localized with
           vinculin at cell-ECM and cell-cell adhesion sites. There
           are several splice variants of vinexin: alpha, which
           contains the SoHo and three SH3 domains and displays
           tissue-specific expression; and beta, which contains
           only the three SH3 domains and is widely expressed.
           Vinexin alpha stimulates the accumulation of F-actin at
           focal contact sites. Vinexin also promotes keratinocyte
           migration and wound healing. The SH3 domains of vinexin
           have been reported to bind a number of ligands including
           vinculin, WAVE2, DLG5, Abl, and Cbl. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 58

 Score = 36.9 bits (85), Expect = 0.001
 Identities = 16/51 (31%), Positives = 28/51 (54%), Gaps = 2/51 (3%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH--GQYGLFPANYVS 522
           A+Y Y+   +DE+     D +  ++  D+GW+ G      ++G FP NYV+
Sbjct: 6   AVYQYRPQNEDELELREGDRVDVMQQCDDGWFVGVSRRTQKFGTFPGNYVA 56


>gnl|CDD|212949 cd12016, SH3_Tks_2, Second Src homology 3 domain of Tyrosine kinase
           substrate (Tks) proteins.  Tks proteins are Src
           substrates and scaffolding proteins that play important
           roles in the formation of podosomes and invadopodia, the
           dynamic actin-rich structures that are related to cell
           migration and cancer cell invasion. Vertebrates contain
           two Tks proteins, Tks4 (Tyr kinase substrate with four
           SH3 domains) and Tks5 (Tyr kinase substrate with five
           SH3 domains), which display partially overlapping but
           non-redundant functions. Both associate with the ADAMs
           family of transmembrane metalloproteases, which function
           as sheddases and mediators of cell and matrix
           interactions. Tks5 interacts with N-WASP and Nck, while
           Tks4 is essential for the localization of MT1-MMP
           (membrane-type 1 matrix metalloproteinase) to
           invadopodia. Tks proteins contain an N-terminal Phox
           homology (PX) domain and four or five SH3 domains. This
           model characterizes the second SH3 domain of Tks
           proteins. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 54

 Score = 36.3 bits (84), Expect = 0.002
 Identities = 15/49 (30%), Positives = 26/49 (53%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           +    Y+A  +DEI F+   ++  I+   +GWW+    G+ G  PA Y+
Sbjct: 4   ITTQAYKAENEDEIGFETGVVVEVIQKNLDGWWKIRYQGKEGWAPATYL 52


>gnl|CDD|212859 cd11926, SH3_SH3RF1_3, Third Src Homology 3 domain of SH3 domain
           containing ring finger 1, an E3 ubiquitin-protein
           ligase.  SH3RF1 is also called POSH (Plenty of SH3s) or
           SH3MD2 (SH3 multiple domains protein 2). It is a
           scaffold protein that acts as an E3 ubiquitin-protein
           ligase. It plays a role in calcium homeostasis through
           the control of the ubiquitin domain protein Herp. It may
           also have a role in regulating death receptor mediated
           and JNK mediated apoptosis. SH3RF1 also enhances the
           ubiquitination of ROMK1 potassium channel resulting in
           its increased endocytosis. It contains an N-terminal
           RING finger domain and four SH3 domains. This model
           represents the third SH3 domain, located in the middle,
           of SH3RF1. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 36.5 bits (84), Expect = 0.002
 Identities = 16/52 (30%), Positives = 27/52 (51%), Gaps = 2/52 (3%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH--GQYGLFPANYVS 522
           VA+Y Y    +DE+     ++    E   +GW++G      + G+FP NYV+
Sbjct: 3   VAIYPYTPRKEDELELRKGEMFLVFERCQDGWFKGTSMHTSKIGVFPGNYVA 54


>gnl|CDD|213017 cd12141, SH3_DNMBP_C2, Second C-terminal Src homology 3 domain of
           Dynamin Binding Protein, also called Tuba, and similar
           domains.  DNMBP or Tuba is a cdc42-specific guanine
           nucleotide exchange factor (GEF) that contains four
           N-terminal SH3 domains, a central RhoGEF [or Dbl
           homology (DH)] domain followed by a Bin/Amphiphysin/Rvs
           (BAR) domain, and two C-terminal SH3 domains. It
           provides a functional link between dynamin, Rho GTPase
           signaling, and actin dynamics. It plays an important
           role in regulating cell junction configuration. The
           C-terminal SH3 domains of DNMBP bind to N-WASP and
           Ena/VASP proteins, which are key regulatory proteins of
           the actin cytoskeleton. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 57

 Score = 36.3 bits (84), Expect = 0.002
 Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 4/52 (7%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEG----WWRGYCHGQYGLFPANYV 521
           A+Y ++A + +E+S   +  +  +E  D      WW    +GQ G  P+NY+
Sbjct: 4   AVYTFKARSPNELSVSANQRVRILEFSDLTGNKEWWLAEANGQKGYVPSNYI 55


>gnl|CDD|212925 cd11992, SH3_Intersectin2_3, Third Src homology 3 domain (or SH3C)
           of Intersectin-2.  Intersectin-2 (ITSN2) is an adaptor
           protein that functions in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction. It
           plays a role in clathrin-coated pit (CCP) formation. It
           binds to many proteins through its multidomain structure
           and facilitate the assembly of multimeric complexes.
           ITSN2 also functions as a specific GEF for Cdc42
           activation in epithelial morphogenesis, and is required
           in mitotic spindle orientation. It exists in
           alternatively spliced short and long isoforms. The short
           isoform contains two Eps15 homology domains (EH1 and
           EH2), a coiled-coil region and five SH3 domains
           (SH3A-E), while the long isoform, in addition, contains
           RhoGEF (also called Dbl-homologous or DH), Pleckstrin
           homology (PH) and C2 domains. The third SH3 domain
           (SH3C) of ITSN2 has been shown to bind the K15 protein
           of Kaposi's sarcoma-associated herpesvirus. SH3 domains
           are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 52

 Score = 36.1 bits (83), Expect = 0.002
 Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           +ALY Y +S   +++F+  + I  +   D  WW G    + G+FP+NYV
Sbjct: 3   IALYPYSSSEPGDLTFNEGEEIL-VTQKDGEWWTGSIEDRTGIFPSNYV 50


>gnl|CDD|212999 cd12066, SH3_GRAF3, Src Homology 3 domain of GTPase Regulator
           Associated with Focal adhesion kinase 3.  GRAF3 is also
           called Rho GTPase activating protein 42 (ARHGAP42) or
           ARHGAP10-like. Though its function has not been
           characterized, it may be a GAP with activity towards
           RhoA and Cdc42, based on its similarity to GRAF and
           GRAF2. It contains an N-terminal BAR domain, followed by
           a Pleckstrin homology (PH) domain, a Rho GAP domain, and
           a C-terminal SH3 domain. The SH3 domain of GRAF and
           GRAF2 binds PKNbeta, a target of the small GTPase Rho.
           SH3 domains bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs;
           they play a role in the regulation of enzymes by
           intramolecular interactions, changing the subcellular
           localization of signal pathway components and mediate
           multiprotein complex assemblies.
          Length = 55

 Score = 36.2 bits (83), Expect = 0.002
 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 472 AVALYDYQASADDEISFDPDDIITNI-EMIDEGWWRGYCHGQYGLFPANYV 521
           A A+Y  +A    E+SF    I +N+   ++ GW +    G+ GL P NYV
Sbjct: 2   AKAMYSCKAEHSHELSFPQGAIFSNVYPSVEPGWLKATYEGKTGLVPENYV 52


>gnl|CDD|212741 cd11807, SH3_ASPP, Src homology 3 domain of Apoptosis Stimulating
           of p53 proteins (ASPP).  The ASPP family of proteins
           bind to important regulators of apoptosis (p53, Bcl-2,
           and RelA) and cell growth (APCL, PP1). They share
           similarity at their C-termini, where they harbor a
           proline-rich region, four ankyrin (ANK) repeats, and an
           SH3 domain. Vertebrates contain three members of the
           family: ASPP1, ASPP2, and iASPP. ASPP1 and ASPP2
           activate the apoptotic function of the p53 family of
           tumor suppressors (p53, p63, and p73), while iASPP is an
           oncoprotein that specifically inhibits p53-induced
           apoptosis. The expression of ASPP proteins is altered in
           tumors; ASPP1 and ASPP2 are downregulated whereas iASPP
           is upregulated is some cancer types. ASPP proteins also
           bind and regulate protein phosphatase 1 (PP1), and this
           binding is competitive with p53 binding. The SH3 domain
           and the ANK repeats of ASPP contribute to the p53
           binding site; they bind to the DNA binding domain of
           p53. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 57

 Score = 35.8 bits (83), Expect = 0.003
 Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEG---WWRGYCHGQYGLFPANYVSL 523
           AL+DY+A   DE+SF   D +T +   D+    WW    + + G  P N + L
Sbjct: 5   ALFDYEAENGDELSFREGDELTVLRKGDDDETEWWWARLNDKEGYVPRNLLGL 57


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 38.7 bits (91), Expect = 0.003
 Identities = 38/123 (30%), Positives = 65/123 (52%), Gaps = 25/123 (20%)

Query: 361 IQNSNKEEEE--KEKQRMVQEEIKR------KEL-ERKEELEKEQIRI--KEEQ-----E 404
           I+ + KE E   KE     +EEI +      +EL ER+ EL++++ R+  KEE      E
Sbjct: 40  IEEAKKEAEALKKEALLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDE 99

Query: 405 NIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLE---------QERLMEEL 455
           +++KK+E  E++ KE    Q+ L +K++E + L  E+Q  L           +E L+EE+
Sbjct: 100 SLEKKEESLEEKEKELAARQQQLEEKEEELEELIEEQQQELERISGLTAEEAKEILLEEV 159

Query: 456 RQQ 458
             +
Sbjct: 160 EAE 162


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 40.0 bits (94), Expect = 0.003
 Identities = 31/256 (12%), Positives = 81/256 (31%), Gaps = 15/256 (5%)

Query: 181  EKIEKHESQKDYAKGFGGKF-GIESDRQDKSAVGWDHVEKVDKHQSQKDDNRQVVTSSKV 239
            E +E+ E  ++       +       +  K                 K   ++   SS  
Sbjct: 1139 EALEEQEEVEEKEIAKEQRLKSKTKGKASKLRK-----------PKLKKKEKKKKKSSAD 1187

Query: 240  KDLIAANSTNASKENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKD 299
            K   A+   N+ + +   K  +     +       +  +  EE+  K  +   ++ K K 
Sbjct: 1188 KSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKK 1247

Query: 300  QIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGI 359
                + ++  +E    +L  + +         +     P     P    +   +P+S   
Sbjct: 1248 NNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTK 1307

Query: 360  PIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE 419
                   E          +++   K+  RK++ +    +    Q +   ++ +++K    
Sbjct: 1308 KKVKKRLEGSLAAL---KKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSS 1364

Query: 420  QEENQKLLLKKQQEED 435
             E++    +   ++ED
Sbjct: 1365 SEDDDDSEVDDSEDED 1380



 Score = 32.3 bits (74), Expect = 0.81
 Identities = 35/163 (21%), Positives = 72/163 (44%), Gaps = 17/163 (10%)

Query: 287  RSEEEKEKRKLKDQIDLEQAQKLEERRLSEL-KIKEEEI---EKKLNGHSDVPLSPSTET 342
             + E       K  + +++ +KL   R  ++ K K E+I   E++     D       E 
Sbjct: 1022 INGELVITNAKKKDL-VKELKKLGYVRFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEE 1080

Query: 343  PPVPVKS---ILKQP----TSDGIPIQNSNKEEEEKEKQRMVQEEIK---RKELER-KEE 391
                  S   +L  P    T + +   N+  E++EKE +++     K    ++L++ +E 
Sbjct: 1081 ELGAAVSYDYLLSMPIWSLTKEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEA 1140

Query: 392  LEK-EQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQE 433
            LE+ E++  KE  +  + K + + K  K ++   K   KK+++
Sbjct: 1141 LEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKK 1183


>gnl|CDD|212858 cd11925, SH3_SH3RF3_3, Third Src Homology 3 domain of SH3 domain
           containing ring finger 3, an E3 ubiquitin-protein
           ligase.  SH3RF3 is also called POSH2 (Plenty of SH3s 2)
           or SH3MD4 (SH3 multiple domains protein 4). It is a
           scaffold protein with E3 ubiquitin-protein ligase
           activity. It was identified in the screen for
           interacting partners of p21-activated kinase 2 (PAK2).
           It may play a role in regulating JNK mediated apoptosis
           in certain conditions. It also interacts with GTP-loaded
           Rac1. SH3RF3 is highly homologous to SH3RF1; it also
           contains an N-terminal RING finger domain and four SH3
           domains. This model represents the third SH3 domain,
           located in the middle, of SH3RF3. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 57

 Score = 35.7 bits (82), Expect = 0.003
 Identities = 18/52 (34%), Positives = 29/52 (55%), Gaps = 2/52 (3%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYC--HGQYGLFPANYVS 522
           +ALY Y+   +DE+     ++   IE   +GW++G     G  G+FP NYV+
Sbjct: 4   LALYAYKPQKNDELELRKGEMYRVIEKCQDGWFKGTSLRTGVSGVFPGNYVT 55


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 39.8 bits (94), Expect = 0.003
 Identities = 28/163 (17%), Positives = 68/163 (41%), Gaps = 35/163 (21%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKK 327
            L A  E L ++ E+++ +     KE  KLK+++  E+ +KL+E     L+  E+E ++ 
Sbjct: 520 ELIASLEELERELEQKAEEAEALLKEAEKLKEEL-EEKKEKLQEEEDKLLEEAEKEAQQA 578

Query: 328 LNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE 387
           +                                     K+E ++  + + Q +       
Sbjct: 579 IK----------------------------------EAKKEADEIIKELRQLQKGGYASV 604

Query: 388 RKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKK 430
           +  EL + + R+ +  E  +KKK++++++++E +   ++    
Sbjct: 605 KAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLS 647



 Score = 39.4 bits (93), Expect = 0.005
 Identities = 22/88 (25%), Positives = 47/88 (53%), Gaps = 10/88 (11%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELER-KEELEKEQIRIKEEQENIKKKKEQEEKE---- 416
           +   +E E+K ++     E   KE E+ KEELE+++ +++EE++ + ++ E+E ++    
Sbjct: 526 EELERELEQKAEE----AEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE 581

Query: 417 -RKEQEENQKLLLKKQQEEDRLKAEEQA 443
            +KE +E  K L + Q+         + 
Sbjct: 582 AKKEADEIIKELRQLQKGGYASVKAHEL 609



 Score = 36.0 bits (84), Expect = 0.054
 Identities = 36/171 (21%), Positives = 77/171 (45%), Gaps = 35/171 (20%)

Query: 282 EESRKRSEEEKEKRKLKDQI-DLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPST 340
           EE++K   E+KEK  L + I  LE+ ++  E++  E +   +E EK              
Sbjct: 505 EEAKKLIGEDKEK--LNELIASLEELERELEQKAEEAEALLKEAEKLK------------ 550

Query: 341 ETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRM---VQEEIKRKELERKEELEKEQI 397
                  + + ++        +   +EEE+K  +      Q+ IK  + E  E +++ + 
Sbjct: 551 -------EELEEK--------KEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQ 595

Query: 398 RIKEEQENIKKKKEQEEKER--KEQEENQKLLLKKQQEEDRLKAEEQARLL 446
             K    ++K  +  E ++R  K  E+ +K   K++++++ LK  ++ + L
Sbjct: 596 LQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYL 646



 Score = 30.6 bits (70), Expect = 2.5
 Identities = 23/99 (23%), Positives = 48/99 (48%), Gaps = 17/99 (17%)

Query: 372 EKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQ 431
           E+     E+ K +E E    L KE  ++KEE E        E+KE+ ++EE++ L   ++
Sbjct: 526 EELERELEQ-KAEEAE---ALLKEAEKLKEELE--------EKKEKLQEEEDKLLEEAEK 573

Query: 432 QEEDRLKA--EEQARLLEQERLMEELRQQGTNEDTEEDL 468
           + +  +K   +E   ++   + + +L++ G       +L
Sbjct: 574 EAQQAIKEAKKEADEII---KELRQLQKGGYASVKAHEL 609


>gnl|CDD|212941 cd12008, SH3_Src, Src homology 3 domain of Src Protein Tyrosine
           Kinase.  Src (or c-Src) is a cytoplasmic (or
           non-receptor) PTK and is the vertebrate homolog of the
           oncogenic protein (v-Src) from Rous sarcoma virus.
           Together with other Src subfamily proteins, it is
           involved in signaling pathways that regulate cytokine
           and growth factor responses, cytoskeleton dynamics, cell
           proliferation, survival, and differentiation. Src also
           play a role in regulating cell adhesion, invasion, and
           motility in cancer cells, and tumor vasculature,
           contributing to cancer progression and metastasis.
           Elevated levels of Src kinase activity have been
           reported in a variety of human cancers. Several
           inhibitors of Src have been developed as anti-cancer
           drugs. Src is also implicated in acute inflammatory
           responses and osteoclast function. Src kinases contain
           an N-terminal SH4 domain with a myristoylation site,
           followed by SH3 and SH2 domains, a tyr kinase domain,
           and a regulatory C-terminal region containing a
           conserved tyr. They are activated by autophosphorylation
           at the tyr kinase domain, but are negatively regulated
           by phosphorylation at the C-terminal tyr by Csk
           (C-terminal Src Kinase). The SH3 domain of Src kinases
           contributes to substrate recruitment by binding adaptor
           proteins/substrates, and regulation of kinase activity
           through an intramolecular interaction. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 56

 Score = 35.5 bits (81), Expect = 0.003
 Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 2/54 (3%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGY--CHGQYGLFPANYVS 522
           T VALYDY++  + ++SF   + +  +   +  WW  +    GQ G  P+NYV+
Sbjct: 1   TFVALYDYESRTETDLSFKKGERLQIVNNTEGDWWLAHSLTTGQTGYIPSNYVA 54


>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 548

 Score = 39.8 bits (93), Expect = 0.004
 Identities = 37/195 (18%), Positives = 73/195 (37%), Gaps = 21/195 (10%)

Query: 264 VKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEE 323
           V+    +      A+QT E     +E E E    K +    + +  +   L+E  I+EE+
Sbjct: 248 VELEVEQQPAGKTAEQTREVKIILAETEAEVAAWKAE---TRREAEQAEILAEQAIQEEK 304

Query: 324 IEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKR 383
            + +     +V  + + E   + V  I +Q  ++   ++   +       QR  QEE   
Sbjct: 305 AQAEQ----EVQHAKALEAREMRVGLIERQKETE---LEPQERSYFINAAQRQAQEEA-- 355

Query: 384 KELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ---EENQKLLLKKQQE-EDRLKA 439
                K      +    + +  ++  +E EE ER EQ       +   ++Q E   R +A
Sbjct: 356 -----KAAANIAEAIGAQAEAAVETARETEEAERAEQAALVAAAEAAEQEQVEIAVRAEA 410

Query: 440 EEQARLLEQERLMEE 454
            +     +   +  E
Sbjct: 411 AKAEAEAQAAEIKAE 425



 Score = 33.3 bits (76), Expect = 0.36
 Identities = 20/116 (17%), Positives = 44/116 (37%), Gaps = 14/116 (12%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKE--------QIRIKEEQENIKKKKEQ 412
            + +  E+E +  + +   E++   +ER++E E E            ++ QE  K     
Sbjct: 302 EEKAQAEQEVQHAKALEAREMRVGLIERQKETELEPQERSYFINAAQRQAQEEAKAAANI 361

Query: 413 EEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLE--QERLMEELRQQGTNEDTEE 466
            E    + E      ++  +E +  +  EQA L+   +    E++      E  + 
Sbjct: 362 AEAIGAQAEAA----VETARETEEAERAEQAALVAAAEAAEQEQVEIAVRAEAAKA 413



 Score = 32.5 bits (74), Expect = 0.64
 Identities = 29/200 (14%), Positives = 66/200 (33%), Gaps = 18/200 (9%)

Query: 304 EQAQKLEERRL--SELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPI 361
             AQ L++  +  +E + + E    + N  + +      + P        ++        
Sbjct: 215 RIAQVLQDAEIAENEAEKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAET 274

Query: 362 QN--SNKEEEEKEKQRMVQEEIKRKELERKEELEKE-QIRIKEEQEN-----IKKKKEQE 413
           +   +  + E + +    +   ++   E K + E+E Q     E        I+++KE E
Sbjct: 275 EAEVAAWKAETRREAEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIERQKETE 334

Query: 414 EKER-------KEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLME-ELRQQGTNEDTE 465
            + +         Q + Q+         + + A+ +A +       E E  +Q       
Sbjct: 335 LEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQAEAAVETARETEEAERAEQAALVAAA 394

Query: 466 EDLGYTAVALYDYQASADDE 485
           E      V +     +A  E
Sbjct: 395 EAAEQEQVEIAVRAEAAKAE 414



 Score = 32.1 bits (73), Expect = 0.73
 Identities = 23/140 (16%), Positives = 50/140 (35%), Gaps = 6/140 (4%)

Query: 359 IPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERK 418
           I I  +N++ +  E +   Q   K  E  R+ ++      I  E E      + E +   
Sbjct: 236 IAIAEANRDAKLVELEVEQQPAGKTAEQTREVKI------ILAETEAEVAAWKAETRREA 289

Query: 419 EQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDLGYTAVALYDY 478
           EQ E       ++++    +  + A+ LE   +   L ++    + E       +     
Sbjct: 290 EQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIERQKETELEPQERSYFINAAQR 349

Query: 479 QASADDEISFDPDDIITNIE 498
           QA  + + + +  + I    
Sbjct: 350 QAQEEAKAAANIAEAIGAQA 369


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 39.5 bits (93), Expect = 0.004
 Identities = 23/81 (28%), Positives = 54/81 (66%), Gaps = 5/81 (6%)

Query: 365 NKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQ 424
             ++  K+ +++V++  K++E E+KE+ +K     K+E+E  ++K+++EE++ +E+EE  
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE-- 461

Query: 425 KLLLKKQQEEDRLKAEEQARL 445
               ++++EE+  K ++QA L
Sbjct: 462 ---AEEEKEEEEEKKKKQATL 479



 Score = 38.4 bits (90), Expect = 0.009
 Identities = 20/68 (29%), Positives = 44/68 (64%)

Query: 364 SNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEEN 423
           + K ++  EK    +EE K+++ ++    +K++   +EE+E  +++KE+EE+E +E++E 
Sbjct: 409 TKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468

Query: 424 QKLLLKKQ 431
           ++   KKQ
Sbjct: 469 EEEKKKKQ 476



 Score = 31.4 bits (72), Expect = 1.1
 Identities = 18/74 (24%), Positives = 42/74 (56%)

Query: 349 SILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKK 408
             L         I+   ++ E+K ++   +++ K    ++KEE E+E+   KEE++  ++
Sbjct: 400 EFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459

Query: 409 KKEQEEKERKEQEE 422
           ++ +EEKE +E+++
Sbjct: 460 EEAEEEKEEEEEKK 473



 Score = 29.5 bits (67), Expect = 4.8
 Identities = 19/69 (27%), Positives = 40/69 (57%), Gaps = 4/69 (5%)

Query: 378 QEEIK----RKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQE 433
           +EEI+     K+  +K +   E+   K E+E  +KKK+    ++KE+EE ++   K++++
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455

Query: 434 EDRLKAEEQ 442
           E+  +  E+
Sbjct: 456 EEEEEEAEE 464


>gnl|CDD|212864 cd11931, SH3_SH3RF3_2, Second Src Homology 3 domain of SH3 domain
           containing ring finger 3, an E3 ubiquitin-protein
           ligase.  SH3RF3 is also called POSH2 (Plenty of SH3s 2)
           or SH3MD4 (SH3 multiple domains protein 4). It is a
           scaffold protein with E3 ubiquitin-protein ligase
           activity. It was identified in the screen for
           interacting partners of p21-activated kinase 2 (PAK2).
           It may play a role in regulating JNK mediated apoptosis
           in certain conditions. It also interacts with GTP-loaded
           Rac1. SH3RF3 is highly homologous to SH3RF1; it also
           contains an N-terminal RING finger domain and four SH3
           domains. This model represents the second SH3 domain,
           located C-terminal of the first SH3 domain at the
           N-terminal half, of SH3RF3. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 35.3 bits (81), Expect = 0.004
 Identities = 19/52 (36%), Positives = 29/52 (55%), Gaps = 4/52 (7%)

Query: 474 ALYDYQASADDE----ISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           ALYD++    D+    ++F  D+I+T I  +DE W  G    + G+FP  YV
Sbjct: 4   ALYDFEIKDKDQDKDCLTFTKDEILTVIRRVDENWAEGMLGDKIGIFPILYV 55


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 39.6 bits (92), Expect = 0.004
 Identities = 34/111 (30%), Positives = 47/111 (42%), Gaps = 9/111 (8%)

Query: 335 PLSPSTETPPVPVKSILKQPTS-DGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELE 393
           P SP T   P PV  +  QPT+   +PI  +N       ++     E KRK   R  EL+
Sbjct: 421 PQSPGTSYGPEPVGPVPPQPTNPYVMPISMANMVYPGHPQEH--GHERKRK---RGGELK 475

Query: 394 KEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQAR 444
           +E I   E  + +KK KE++E   KE E        K+  E   K      
Sbjct: 476 EELI---ETLKLVKKLKEEQESLAKELEATAHKSEIKKIAESEFKNAGAKT 523


>gnl|CDD|212735 cd11801, SH3_JIP1_like, Src homology 3 domain of JNK-interacting
           proteins 1 and 2, and similar domains.  JNK-interacting
           proteins (JIPs) function as scaffolding proteins for
           c-Jun N-terminal kinase (JNK) signaling pathways. They
           bind to components of Mitogen-activated protein kinase
           (MAPK) pathways such as JNK, MKK, and several MAP3Ks
           such as MLK and DLK. There are four JIPs (JIP1-4); all
           contain a JNK binding domain. JIP1 and JIP2 also contain
           SH3 and Phosphotyrosine-binding (PTB) domains. Both are
           highly expressed in the brain and pancreatic beta-cells.
           JIP1 functions as an adaptor linking motor to cargo
           during axonal transport and also is involved in
           regulating insulin secretion. JIP2 form complexes with
           fibroblast growth factor homologous factors (FHFs),
           which facilitates activation of the p38delta MAPK. The
           SH3 domain of JIP1 homodimerizes at the interface
           usually involved in proline-rich ligand recognition,
           despite the lack of this motif in the domain itself. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 35.4 bits (82), Expect = 0.004
 Identities = 19/53 (35%), Positives = 26/53 (49%), Gaps = 2/53 (3%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYC--HGQYGLFPANYV 521
           T  AL+ +    +DEI  D  D +   +  D+ W  G     GQ G+FPA YV
Sbjct: 1   THRALHKFIPRHEDEIELDIGDPVYVEQEADDLWCEGTNLRTGQRGIFPAAYV 53


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 38.9 bits (91), Expect = 0.004
 Identities = 16/52 (30%), Positives = 27/52 (51%), Gaps = 5/52 (9%)

Query: 410 KEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTN 461
           K++EE+ER+ +   +  L   +Q   + KA+E     EQ+   EE R +  N
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAKE-----EQKAESEETRHRAAN 167


>gnl|CDD|220716 pfam10359, Fmp27_WPPW, RNA pol II promoter Fmp27 protein domain.
           Fmp27_WPPW is a conserved domain of a family of proteins
           involved in RNA polymerase II transcription initiation.
           It contains characteristic HQR and WPPW sequence motifs.
           and is towards the C-terminal in members which contain
           Fmp27_SW pfam10305.
          Length = 470

 Score = 39.2 bits (92), Expect = 0.004
 Identities = 30/120 (25%), Positives = 54/120 (45%), Gaps = 11/120 (9%)

Query: 370 EKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLK 429
              K R VQ E+     ER ++LE+   +IK+ +E +   +  + +E +   E  +  L 
Sbjct: 154 GSNKPRRVQRELLE---ERLKQLEE---QIKKLEEKLDDLELNDTEELQSDLEELEEELS 207

Query: 430 KQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDLGYTAVALYDYQASADDEISFD 489
             +E  RL+  E+  LLE     EE   + ++ DTE  +  +  +L    +S +    F 
Sbjct: 208 VLKE--RLEFLEK--LLEDLERSEESSDRSSSTDTESSIADSLSSL-SNASSDESVSDFH 262


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 37.2 bits (87), Expect = 0.005
 Identities = 21/77 (27%), Positives = 42/77 (54%), Gaps = 6/77 (7%)

Query: 360 PIQNSNKEEEEKEKQRM-----VQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEE 414
            ++ +   +   +KQ+      +Q +   KE+++     K Q  +K E E  K++K++  
Sbjct: 48  TLKFTEAVKVSIKKQKKINPKRLQRQA-AKEVKKPGISTKAQQALKLEHERNKQEKKKRS 106

Query: 415 KERKEQEENQKLLLKKQ 431
           KE+KE+E+ +K  LK+Q
Sbjct: 107 KEKKEEEKERKRQLKQQ 123


>gnl|CDD|212855 cd11922, SH3_Sorbs1_2, Second Src Homology 3 domain of Sorbin and
           SH3 domain containing 1 (Sorbs1), also called ponsin.
           Sorbs1 is also called ponsin, SH3P12, or CAP (c-Cbl
           associated protein). It is an adaptor protein containing
           one sorbin homology (SoHo) and three SH3 domains. It
           binds Cbl and plays a major role in regulating the
           insulin signaling pathway by enhancing insulin-induced
           phosphorylation of Cbl. Sorbs1, like vinexin, localizes
           at cell-ECM and cell-cell adhesion sites where it binds
           vinculin, paxillin, and afadin. It may function in the
           control of cell motility. Other interaction partners of
           Sorbs1 include c-Abl, Sos, flotillin, Grb4, ataxin-7,
           filamin C, among others. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 58

 Score = 35.4 bits (81), Expect = 0.005
 Identities = 17/54 (31%), Positives = 29/54 (53%), Gaps = 2/54 (3%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHG--QYGLFPANYVSL 523
           A+A +++      E+SF   + IT +  +DE W+ G   G  + G+FP  YV +
Sbjct: 3   AIAKFNFNGDTQVEMSFRKGERITLLRQVDENWYEGRIPGTSRQGIFPITYVDV 56


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 39.5 bits (92), Expect = 0.005
 Identities = 22/101 (21%), Positives = 42/101 (41%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
            N N  +  +EK    Q++  R EL+ K   +KE++        +K++  QE  +  ++ 
Sbjct: 9   ANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEV 68

Query: 422 ENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNE 462
           +       KQ  E     EE  + ++ E L + +      E
Sbjct: 69  KKSTKEESKQLLEVLKTKEEHQKEIQYEILQKTIPSFEPKE 109



 Score = 29.9 bits (67), Expect = 4.0
 Identities = 37/171 (21%), Positives = 76/171 (44%), Gaps = 38/171 (22%)

Query: 278 KQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLS 337
               +E     +++K + +LK+++                  K+EE+             
Sbjct: 14  NDRMQEKDDEKQDQKNRMELKEKV----------------LDKKEEV------------- 44

Query: 338 PSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKE---ELEK 394
             T+    PVK   +Q + + + I +  K+  ++E ++++ E +K KE  +KE   E+ +
Sbjct: 45  -VTDNVDSPVK---EQSSQENLKIADEVKKSTKEESKQLL-EVLKTKEEHQKEIQYEILQ 99

Query: 395 EQIRIKEEQENIKKKKEQEE-KERKEQEENQKLLLKKQQEEDRLKAEEQAR 444
           + I   E +E+I KK E  + ++ K+Q +  +L   KQ    R   E++ R
Sbjct: 100 KTIPSFEPKESILKKLEDIKPEQAKKQTKLFRLFEPKQLPIYRANGEKELR 150



 Score = 29.1 bits (65), Expect = 6.4
 Identities = 25/139 (17%), Positives = 60/139 (43%), Gaps = 1/139 (0%)

Query: 382 KRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE-QEENQKLLLKKQQEEDRLKAE 440
           +R+      +  +E+   K++Q+N  + KE+   +++E   +N    +K+Q  ++ LK  
Sbjct: 6   RREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIA 65

Query: 441 EQARLLEQERLMEELRQQGTNEDTEEDLGYTAVALYDYQASADDEISFDPDDIITNIEMI 500
           ++ +   +E   + L    T E+ ++++ Y  +          + I    +DI       
Sbjct: 66  DEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQKTIPSFEPKESILKKLEDIKPEQAKK 125

Query: 501 DEGWWRGYCHGQYGLFPAN 519
               +R +   Q  ++ AN
Sbjct: 126 QTKLFRLFEPKQLPIYRAN 144



 Score = 29.1 bits (65), Expect = 7.4
 Identities = 19/113 (16%), Positives = 47/113 (41%), Gaps = 3/113 (2%)

Query: 219 KVDKHQSQKDDNRQVVTSSKVKDLIAANSTNASKENIKPKPNIGHVKPSNLRAKFENLAK 278
             +K   ++D   ++    KV D      T+     +K + +  ++K ++   K      
Sbjct: 17  MQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTKEES 76

Query: 279 QTEEESRKRSEEEK---EKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKL 328
           +   E  K  EE +   +   L+  I   + ++   ++L ++K ++ + + KL
Sbjct: 77  KQLLEVLKTKEEHQKEIQYEILQKTIPSFEPKESILKKLEDIKPEQAKKQTKL 129


>gnl|CDD|212998 cd12065, SH3_GRAF2, Src Homology 3 domain of GTPase Regulator
           Associated with Focal adhesion kinase 2.  GRAF2, also
           called Rho GTPase activating protein 10 (ARHGAP10) or
           PS-GAP, is a GAP with activity towards Cdc42 and RhoA.
           It regulates caspase-activated p21-activated protein
           kinase-2 (PAK-2p34). GRAF2 interacts with PAK-2p34,
           leading to its stabilization and decrease of cell death.
           It is highly expressed in skeletal muscle, and is
           involved in alpha-catenin recruitment at cell-cell
           junctions. GRAF2 contains an N-terminal BAR domain,
           followed by a Pleckstrin homology (PH) domain, a Rho GAP
           domain, and a C-terminal SH3 domain. The SH3 domain of
           GRAF binds PKNbeta, a target of the small GTPase Rho.
           SH3 domains bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs;
           they play a role in the regulation of enzymes by
           intramolecular interactions, changing the subcellular
           localization of signal pathway components and mediate
           multiprotein complex assemblies.
          Length = 54

 Score = 35.0 bits (80), Expect = 0.005
 Identities = 19/53 (35%), Positives = 29/53 (54%), Gaps = 1/53 (1%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDE-GWWRGYCHGQYGLFPANYVSL 523
           A A+Y  +A    E+SF+   I  ++ +  E GW  G  +G+ GL P NYV +
Sbjct: 2   AKAVYPCEAEHSSELSFEVGAIFEDVTLSREPGWLEGTLNGKRGLIPENYVEI 54


>gnl|CDD|212886 cd11953, SH3_ASPP2, Src Homology 3 (SH3) domain of Apoptosis
           Stimulating of p53 protein 2.  ASPP2 is the full length
           form of the previously-identified tumor supressor,
           p53-binding protein 2 (p53BP2). ASPP2 activates the
           apoptotic function of the p53 family of tumor
           suppressors (p53, p63, and p73). It plays a central role
           in regulating apoptosis and cell growth; ASPP2-deficient
           mice show postnatal death. Downregulated expression of
           ASPP2 is frequently found in breast tumors, lung cancer,
           and diffuse large B-cell lymphoma where it is correlated
           with a poor clinical outcome. ASPP2 contains a
           proline-rich region, four ankyrin (ANK) repeats, and an
           SH3 domain at its C-terminal half. The SH3 domain and
           the ANK repeats of ASPP2 contribute to the p53 binding
           site; they bind to the DNA binding domain of p53. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 57

 Score = 34.9 bits (80), Expect = 0.006
 Identities = 19/53 (35%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEG---WWRGYCHGQYGLFPANYVSL 523
           AL+DY+  +DDE+SF   D +T +   DE    WW    + + G  P N + L
Sbjct: 5   ALWDYEGESDDELSFKEGDCMTILRREDEDETEWWWARLNDKEGYVPRNLLGL 57


>gnl|CDD|212907 cd11974, SH3_ASEF2, Src homology 3 domain of APC-Stimulated guanine
           nucleotide Exchange Factor 2.  ASEF2, also called
           Spermatogenesis-associated protein 13 (SPATA13), is a
           GEF that localizes with actin at the leading edge of
           cells and is important in cell migration and adhesion
           dynamics. GEFs activate small GTPases by exchanging
           bound GDP for free GTP. ASEF2 can activate both Rac 1
           and Cdc42, but only Rac1 activation is necessary for
           increased cell migration and adhesion turnover. Together
           with APC (adenomatous polyposis coli) and Neurabin2, a
           scaffold protein that binds F-actin, it is involved in
           regulating HGF-induced cell migration. ASEF2 contains a
           SH3 domain followed by RhoGEF (also called
           Dbl-homologous or DH) and Pleckstrin Homology (PH)
           domains. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 54

 Score = 34.7 bits (79), Expect = 0.006
 Identities = 18/52 (34%), Positives = 29/52 (55%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           A AL+D+    D E++F   D+I  +E  ++ WW G    +   FPA++V L
Sbjct: 3   AEALWDHVTMDDQELAFKAGDVIRVLEASNKDWWWGRNEDREAWFPASFVRL 54


>gnl|CDD|212727 cd11793, SH3_ephexin1_like, Src homology 3 domain of ephexin-1-like
           SH3 domain containing Rho guanine nucleotide exchange
           factors.  Members of this family contain RhoGEF (also
           called Dbl-homologous or DH), Pleckstrin Homology (PH),
           and C-terminal SH3 domains. They include the Rho guanine
           nucleotide exchange factors ARHGEF5, ARHGEF16, ARHGEF19,
           ARHGEF26, ARHGEF27 (also called ephexin-1), and similar
           proteins, and are also called ephexins because they
           interact directly with ephrin A receptors. GEFs interact
           with Rho GTPases via their DH domains to catalyze
           nucleotide exchange by stabilizing the nucleotide-free
           GTPase intermediate. They play important roles in
           neuronal development. The SH3 domains of ARHGEFs play an
           autoinhibitory role through intramolecular interactions
           with a proline-rich region N-terminal to the DH domain.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 34.6 bits (80), Expect = 0.006
 Identities = 13/52 (25%), Positives = 27/52 (51%), Gaps = 2/52 (3%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRG--YCHGQYGLFPANYV 521
              ++ Y A   DE++ +  D++  +  + +GW+ G     G+ G FP++Y 
Sbjct: 2   VQCVHAYTAQQPDELTLEEGDVVNVLRKMPDGWYEGERLRDGERGWFPSSYT 53


>gnl|CDD|212837 cd11904, SH3_Nck1_3, Third Src Homology 3 domain of Nck1 adaptor
           protein.  Nck1 (also called Nckalpha) plays a crucial
           role in connecting signaling pathways of tyrosine kinase
           receptors and important effectors in actin dynamics and
           cytoskeletal remodeling. It binds and activates RasGAP,
           resulting in the downregulation of Ras. It is also
           involved in the signaling of endothilin-mediated
           inhibition of cell migration. Nck adaptor proteins
           regulate actin cytoskeleton dynamics by linking
           proline-rich effector molecules to protein tyrosine
           kinases and phosphorylated signaling intermediates. They
           contain three SH3 domains and a C-terminal SH2 domain.
           They function downstream of the PDGFbeta receptor and
           are involved in Rho GTPase signaling and actin dynamics.
           Vertebrates contain two Nck adaptor proteins: Nck1 (also
           called Nckalpha) and Nck2, which show partly overlapping
           functions but also bind distinct targets. The third SH3
           domain of Nck appears to prefer ligands with a PxAPxR
           motif. SH3 domains are protein interaction domains that
           usually bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially a PxxP motif.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 57

 Score = 34.6 bits (79), Expect = 0.008
 Identities = 20/57 (35%), Positives = 34/57 (59%), Gaps = 3/57 (5%)

Query: 470 YTAVALYDYQASADDEISFDPDDIITNIEMI--DEGWWR-GYCHGQYGLFPANYVSL 523
           +   ALY + +S D+E++F+  +++  IE    D  WW+    +GQ GL P NYV++
Sbjct: 1   HVVQALYPFSSSNDEELNFEKGEVMDVIEKPENDPEWWKCRKANGQVGLVPKNYVTV 57


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 38.5 bits (90), Expect = 0.008
 Identities = 21/102 (20%), Positives = 42/102 (41%), Gaps = 3/102 (2%)

Query: 367 EEEEKEKQRMVQEEIKRK-ELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
            E+    QR    E++R  E+ERK  L K + +IK+    +K    + +    ++  + +
Sbjct: 16  NEDVSAFQRKFVNEVRRCDEVERK--LRKLESKIKKLGIPLKDTGGKPDVPPSKEFLDLE 73

Query: 426 LLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEED 467
             +   + E +   E    L ++   +EE       E +  D
Sbjct: 74  EEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLD 115



 Score = 36.2 bits (84), Expect = 0.043
 Identities = 41/228 (17%), Positives = 79/228 (34%), Gaps = 32/228 (14%)

Query: 250 ASKENIKPKPNIGHVKPSNLRAKFENLAKQTEE-ESRKRSEEEKEKRKLKDQIDL--EQA 306
             K  I  K   G       + +F +L ++  + E+  +  EE     L+ +I+   E  
Sbjct: 47  IKKLGIPLKDTGGKPDVPPSK-EFLDLEEEILDLEAEIKEVEEN-LESLEKEINELEEWL 104

Query: 307 QKLEERRLSELKIKEEEIE--------KKLNGHSDVPLSPSTET-PPVPVKSILKQPTSD 357
             L+E +    +  EE  E        K L G   + L           +++  ++    
Sbjct: 105 NVLDEEKSFLDENLEELSELSNLDIDFKYLRGAEGLKLGFVAGVINREKLEA-FERELWR 163

Query: 358 ---GIPIQNSNKEEEEKEKQRMV------QEEIKRKELERKEELEKEQIRIKE------E 402
              G  I+ +  EE  ++ ++ V       +E   K  +  +    E   + E      E
Sbjct: 164 ACRGY-IRQAEIEEPLEDPKKTVFIIFFVGKEDLDKVKKILDSFGFELYDVPETEGERSE 222

Query: 403 QENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQER 450
             +   K+ +E +   EQ E   L     +  D L A ++    E+  
Sbjct: 223 LISKVNKRIEELQRVLEQTE-SHLEKVLVKIADELLAWDEQVSKEKAV 269



 Score = 35.0 bits (81), Expect = 0.091
 Identities = 43/174 (24%), Positives = 62/174 (35%), Gaps = 36/174 (20%)

Query: 309 LEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEE 368
            E RR  E++ K  ++E K+     +PL  +   P VP                    EE
Sbjct: 28  NEVRRCDEVERKLRKLESKIKKL-GIPLKDTGGKPDVPPSKEFLD------------LEE 74

Query: 369 EEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLL 428
           E  +    ++ EIK  E E  E LEKE I   EE  N+  +++    E  E+      L 
Sbjct: 75  EILD----LEAEIKEVE-ENLESLEKE-INELEEWLNVLDEEKSFLDENLEELSELSNLD 128

Query: 429 KK----QQEE-----------DRLKAEEQARLLEQERLMEELRQQGTNEDTEED 467
                 +  E           +R K E   R L   R      +Q   E+  ED
Sbjct: 129 IDFKYLRGAEGLKLGFVAGVINREKLEAFEREL--WRACRGYIRQAEIEEPLED 180


>gnl|CDD|212830 cd11897, SH3_SNX18, Src Homology 3 domain of Sorting nexin 18.
           SNX18 is localized to peripheral endosomal structures,
           and acts in a trafficking pathway that is
           clathrin-independent but relies on AP-1 and PACS1. It
           binds FIP5 and is required for apical lumen formation.
           It may also play a role in axonal elongation. SNXs are
           Phox homology (PX) domain containing proteins that are
           involved in regulating membrane traffic and protein
           sorting in the endosomal system. SNX18 also contains BAR
           and SH3 domains. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 34.6 bits (79), Expect = 0.008
 Identities = 21/54 (38%), Positives = 30/54 (55%), Gaps = 2/54 (3%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMID-EGWWRGY-CHGQYGLFPANYVSL 523
           A ALYD+++    EIS    ++++     D EGW  G    G  GLFPA+YV +
Sbjct: 2   ARALYDFRSENPGEISLREHEVLSLCSEQDIEGWLEGVNSRGDRGLFPASYVEV 55


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 38.5 bits (90), Expect = 0.009
 Identities = 21/73 (28%), Positives = 34/73 (46%), Gaps = 2/73 (2%)

Query: 359  IPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERK 418
            I I+N N+EE+E   Q    E  K K+   +  L  ++  I+E+      KK + +K+ K
Sbjct: 1518 IEIENRNQEEKEPAGQGE-LESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYK 1576

Query: 419  EQ-EENQKLLLKK 430
               E      LK+
Sbjct: 1577 SNTEAELDFFLKR 1589



 Score = 32.8 bits (75), Expect = 0.64
 Identities = 23/89 (25%), Positives = 44/89 (49%), Gaps = 8/89 (8%)

Query: 363  NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQI------RIKEEQENIKKKKEQEEKE 416
            N NK+  +K+K  +   E K  E+E + + EKE          KE++ N++     +EK 
Sbjct: 1497 NVNKKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKN 1556

Query: 417  RKEQEENQKLLLKKQQEEDRLKAEEQARL 445
             +E     +  +KK++ + + K+  +A L
Sbjct: 1557 IEEDYA--ESDIKKRKNKKQYKSNTEAEL 1583


>gnl|CDD|212800 cd11866, SH3_SKAP1-like, Src Homology 3 domain of Src
           Kinase-Associated Phosphoprotein 1 and similar proteins.
            This subfamily is composed of SKAP1, SKAP2, and similar
           proteins. SKAP1 and SKAP2 are immune cell-specific
           adaptor proteins that play roles in T- and B-cell
           adhesion, respectively, and are thus important in the
           migration of T- and B-cells to sites of inflammation and
           for movement during T-cell conjugation with
           antigen-presenting cells. Both SKAP1 and SKAP2 bind to
           ADAP (adhesion and degranulation-promoting adaptor
           protein), among many other binding partners. They
           contain a pleckstrin homology (PH) domain, a C-terminal
           SH3 domain, and several tyrosine phosphorylation sites.
           The SH3 domain of SKAP1 is necessary for its ability to
           regulate T-cell conjugation with antigen-presenting
           cells and the formation of LFA-1 clusters. SKAP1 binds
           primarily to a proline-rich region of ADAP through its
           SH3 domain; its degradation is regulated by ADAP. A
           secondary interaction occurs via the ADAP SH3 domain and
           the RKxxYxxY motif in SKAP1. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 34.3 bits (79), Expect = 0.009
 Identities = 19/50 (38%), Positives = 27/50 (54%), Gaps = 2/50 (4%)

Query: 474 ALYDYQASADDEISFDPDDIITNI--EMIDEGWWRGYCHGQYGLFPANYV 521
            L+D   +  DE+SF   D+I  I  E    GWW G  +G+ GL P +Y+
Sbjct: 4   GLWDCSGNEPDELSFKRGDLIYIISKEYDSFGWWVGELNGKVGLVPKDYL 53


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 38.5 bits (90), Expect = 0.010
 Identities = 42/201 (20%), Positives = 76/201 (37%), Gaps = 7/201 (3%)

Query: 267  SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEK 326
            S+L+     L  +  E  +  S +  E  + K ++     + L    L E    E     
Sbjct: 895  SSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLP 954

Query: 327  KLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEE--KEKQRMVQEEIKRK 384
            +LN   +V       +     + +LK+ T   I ++  NK   E    K+ + +   +  
Sbjct: 955  ELNKLHEVESKLKETSEEY--EDLLKKST---ILVREGNKANSELKNFKKELAELSKQYG 1009

Query: 385  ELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQAR 444
             L+   +  KE      E ++  K    E  E    +  QKL      E ++L+A  +A 
Sbjct: 1010 ALQESTKQLKELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLLENNQLQARYKAL 1069

Query: 445  LLEQERLMEELRQQGTNEDTE 465
             L +E  + + +Q    E TE
Sbjct: 1070 KLRRENSLLDDKQLYQLESTE 1090



 Score = 33.5 bits (77), Expect = 0.30
 Identities = 36/175 (20%), Positives = 63/175 (36%), Gaps = 14/175 (8%)

Query: 287  RSEEEKEKRKL--KDQIDLEQAQKLE--ERRLSELKIKEEEIEKKLNGHSDVPLSPSTET 342
            RS + K++  L  K+ I L+ AQ++E  ER+L ELKI  + I                  
Sbjct: 853  RSLKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISS----------LKLVNL 902

Query: 343  PPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEE 402
                    LK+  S  +      K E     ++++      +    +     E  ++ E 
Sbjct: 903  ELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEV 962

Query: 403  QENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQ 457
            +  +K+  E+ E   K+     +   K   E    K E      +   L E  +Q
Sbjct: 963  ESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQ 1017


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 37.9 bits (88), Expect = 0.011
 Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 2/89 (2%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
           +++ N E    E      +E+K++E    +ELE+ +    +    +   + QEEKE+ E 
Sbjct: 55  LESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGEL--VELQEEKEQLEN 112

Query: 421 EENQKLLLKKQQEEDRLKAEEQARLLEQE 449
           EE Q L      + + L+ E+  + LE +
Sbjct: 113 EELQYLREYNLFDRNNLQLEDNLQSLELQ 141



 Score = 34.5 bits (79), Expect = 0.14
 Identities = 23/83 (27%), Positives = 45/83 (54%), Gaps = 3/83 (3%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           ++E +  KQ + + E +  E+   E L+ E   +K+E+E +  + E+ EKE  +      
Sbjct: 42  EKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDL---DG 98

Query: 426 LLLKKQQEEDRLKAEEQARLLEQ 448
            L++ Q+E+++L+ EE   L E 
Sbjct: 99  ELVELQEEKEQLENEELQYLREY 121


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins
           in this family for which functions are known are flap
           endonucleases that generate the 3' incision next to DNA
           damage as part of nucleotide excision repair. This
           family is related to many other flap endonuclease
           families including the fen1 family. This family is based
           on the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1034

 Score = 38.0 bits (88), Expect = 0.013
 Identities = 44/254 (17%), Positives = 89/254 (35%), Gaps = 25/254 (9%)

Query: 200 FGIESDRQDKSAV---------GWDHVEKVDKHQSQKDDNRQVVTSSKVKDLIAANSTNA 250
           FG+ S R+DK A+         G     +  + Q++          S +    ++++   
Sbjct: 527 FGLPSQREDKLAIPTEGTQNLQGISDHPEQFEFQNELSPLETKNNESNL----SSDAETE 582

Query: 251 SKENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLE 310
              N +         PS     +E       +E R  +E   +   + +  DL       
Sbjct: 583 GSPNPEMPSWSSVTVPSEALDNYETTNPSNAKEVRNFAETGIQTTNVGESADLLLISNPM 642

Query: 311 ERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEE 370
           E    E + +E E +       +V    ST    VP KS     + +      +     E
Sbjct: 643 EVEPMESEKEESESDGSFI---EVDSVSSTLELQVPSKSQPTDESEENAE---NKVASIE 696

Query: 371 KEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKK 430
            E ++ +++      L   E  E   + + EE+++    K + +    E+ E  +  L  
Sbjct: 697 GEHRKEIED------LLFDESEEDNIVGMIEEEKDADDFKNEWQDISLEELEALEANLLA 750

Query: 431 QQEEDRLKAEEQAR 444
           +Q   + + ++Q R
Sbjct: 751 EQNSLKAQKQQQKR 764


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 38.2 bits (89), Expect = 0.013
 Identities = 34/202 (16%), Positives = 71/202 (35%), Gaps = 15/202 (7%)

Query: 291 EKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSI 350
           +   R LK    L++  +       E++  E+ ++ +L            +      +  
Sbjct: 730 KLFGRFLKAFNALKKLYEFAYTLEEEIQYLEDLLKARLEKDYQSKYLDLIDNLNDARE-- 787

Query: 351 LKQPTSDGIPIQNSNKEEEEKEKQRMVQEEI-------KRKELERKEELEKEQIRIKEEQ 403
           +K   S  I + + + E E   KQ +  + I         K    +   E  +  IKE  
Sbjct: 788 IKDKESKAIDLDDIDFELELIGKQEINIDYILELLQTFNDKNGAYESLKELIERIIKEWI 847

Query: 404 ENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNED 463
           E++++KK+  E       E       K  + D  +  E+  +L ++    +   +    +
Sbjct: 848 EDLRQKKKLIE----RLIEAINQYRAK--KLDTAEKLEELYILAKKEEEFKQFAEEEGLN 901

Query: 464 TEEDLGYTAVALYDYQASADDE 485
            EE   Y  +AL   +   +  
Sbjct: 902 EEELAFYDDLALNGGKLPENGT 923


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 38.1 bits (89), Expect = 0.013
 Identities = 22/81 (27%), Positives = 46/81 (56%), Gaps = 1/81 (1%)

Query: 387 ERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLL 446
           ER EELE E   ++EE E ++++ E+ E   + ++  ++L  +++  E+ + AE +  + 
Sbjct: 475 ERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELI-AERRETIE 533

Query: 447 EQERLMEELRQQGTNEDTEED 467
           E+    EELR++    + E +
Sbjct: 534 EKRERAEELRERAAELEAEAE 554


>gnl|CDD|212792 cd11858, SH3_Myosin-I_fungi, Src homology 3 domain of Type I fungal
           Myosins.  Type I myosins (myosin-I) are actin-dependent
           motors in endocytic actin structures and actin patches.
           They play roles in membrane traffic in endocytic and
           secretory pathways, cell motility, and mechanosensing.
           Saccharomyces cerevisiae has two myosins-I, Myo3 and
           Myo5, which are involved in endocytosis and the
           polarization of the actin cytoskeleton. Myosin-I
           contains an N-terminal actin-activated ATPase, a
           phospholipid-binding TH1 (tail homology 1) domain, and a
           C-terminal extension which includes an F-actin-binding
           TH2 domain, an SH3 domain, and an acidic peptide that
           participates in activating the Arp2/3complex. The SH3
           domain of myosin-I is required for myosin-I-induced
           actin polymerization. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 33.9 bits (78), Expect = 0.013
 Identities = 15/35 (42%), Positives = 21/35 (60%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWW 505
           T  ALYD+  S  +E+S   DDI+  ++  D GWW
Sbjct: 1   TYKALYDFAGSVANELSLKKDDIVYIVQKEDNGWW 35


>gnl|CDD|212839 cd11906, SH3_BTK, Src Homology 3 domain of Bruton's tyrosine
           kinase.  BTK is a cytoplasmic (or nonreceptor) tyr
           kinase containing Src homology protein interaction
           domains (SH3, SH2) N-terminal to the catalytic tyr
           kinase domain. It also contains an N-terminal pleckstrin
           homology (PH) domain, which binds the products of PI3K
           and allows membrane recruitment and activation, and the
           Tec homology (TH) domain with proline-rich and
           zinc-binding regions. Btk is expressed in B-cells, and a
           variety of myeloid cells including mast cells,
           platelets, neutrophils, and dendrictic cells. It
           interacts with a variety of partners, from cytosolic
           proteins to nuclear transcription factors, suggesting a
           diversity of functions. Stimulation of a diverse array
           of cell surface receptors, including antigen engagement
           of the B-cell receptor (BCR), leads to PH-mediated
           membrane translocation of Btk and subsequent
           phosphorylation by Src kinase and activation. Btk plays
           an important role in the life cycle of B-cells including
           their development, differentiation, proliferation,
           survival, and apoptosis. Mutations in Btk cause the
           primary immunodeficiency disease, X-linked
           agammaglobulinaemia (XLA) in humans. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 55

 Score = 33.6 bits (77), Expect = 0.014
 Identities = 16/52 (30%), Positives = 24/52 (46%), Gaps = 1/52 (1%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGY-CHGQYGLFPANYV 521
             VALYDY      ++     +    +E  +  WWR    +G+ G  P+NYV
Sbjct: 2   KVVALYDYTPMNAQDLQLRKGEEYVILEESNLPWWRARDKNGREGYIPSNYV 53


>gnl|CDD|212728 cd11794, SH3_DNMBP_N1, First N-terminal Src homology 3 domain of
           Dynamin Binding Protein, also called Tuba.  DNMBP or
           Tuba is a cdc42-specific guanine nucleotide exchange
           factor (GEF) that contains four N-terminal SH3 domains,
           a central RhoGEF [or Dbl homology (DH)] domain followed
           by a Bin/Amphiphysin/Rvs (BAR) domain, and two
           C-terminal SH3 domains. It provides a functional link
           between dynamin and key regulatory proteins of the actin
           cytoskeleton. It plays an important role in regulating
           cell junction configuration. The four N-terminal SH3
           domains of DNMBP binds the GTPase dynamin, which plays
           an important role in the fission of endocytic vesicles.
           SH3 domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 51

 Score = 33.6 bits (77), Expect = 0.016
 Identities = 15/51 (29%), Positives = 28/51 (54%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
              A++D+  S  +E+     D+I  ++++DE W  G   G  G FP+++V
Sbjct: 1   VVRAIFDFCPSVSEELPLFAGDVIEVLKVVDEFWLLGTKEGVTGQFPSSFV 51


>gnl|CDD|212856 cd11923, SH3_Sorbs2_2, Second Src Homology 3 domain of Sorbin and
           SH3 domain containing 2 (Sorbs2), also called
           Arg-binding protein 2 (ArgBP2).  Sorbs2 or ArgBP2 is an
           adaptor protein containing one sorbin homology (SoHo)
           and three SH3 domains. It regulates actin-dependent
           processes including cell adhesion, morphology, and
           migration. It is expressed in many tissues and is
           abundant in the heart. Like vinexin, it is found in
           focal adhesion where it interacts with vinculin and
           afadin. It also localizes in epithelial cell stress
           fibers and in cardiac muscle cell Z-discs. Sorbs2 has
           been implicated to play roles in the signaling of c-Arg,
           Akt, and Pyk2. Other interaction partners of Sorbs2
           include c-Abl, flotillin, spectrin, dynamin 1/2,
           synaptojanin, PTP-PEST, among others. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 57

 Score = 33.7 bits (77), Expect = 0.016
 Identities = 17/54 (31%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHG--QYGLFPANYVSL 523
           AVA Y++ A  + E+S    D +  ++ +D+ W+ G   G  + G+FP +YV +
Sbjct: 3   AVAKYNFNADTNVELSLRKGDRVVLLKQVDQNWYEGKIPGTNRQGIFPVSYVEV 56


>gnl|CDD|212865 cd11932, SH3_SH3RF2_2, Second Src Homology 3 domain of SH3 domain
           containing ring finger 2.  SH3RF2 is also called POSHER
           (POSH-eliminating RING protein) or HEPP1 (heart protein
           phosphatase 1-binding protein). It acts as an
           anti-apoptotic regulator of the JNK pathway by binding
           to and promoting the degradation of SH3RF1 (or POSH), a
           scaffold protein that is required for pro-apoptotic JNK
           activation. It may also play a role in cardiac functions
           together with protein phosphatase 1. SH3RF2 contains an
           N-terminal RING finger domain and three SH3 domains.
           This model represents the second SH3 domain, located
           C-terminal of the first SH3 domain at the N-terminal
           half, of SH3RF2. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 57

 Score = 33.7 bits (77), Expect = 0.017
 Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 6/54 (11%)

Query: 474 ALYDY------QASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           ALY++      +  + D + F  DDIIT I  +DE W  G    Q G+FP  +V
Sbjct: 4   ALYNFDLKEKNREESKDCLKFQKDDIITVISRVDENWAEGKLGDQVGIFPILFV 57


>gnl|CDD|212838 cd11905, SH3_Tec, Src Homology 3 domain of Tec (Tyrosine kinase
           expressed in hepatocellular carcinoma).  Tec is a
           cytoplasmic (or nonreceptor) tyr kinase containing Src
           homology protein interaction domains (SH3, SH2)
           N-terminal to the catalytic tyr kinase domain. It also
           contains an N-terminal pleckstrin homology (PH) domain,
           which binds the products of PI3K and allows membrane
           recruitment and activation, and the Tec homology (TH)
           domain, which contains proline-rich and zinc-binding
           regions. It is more widely-expressed than other Tec
           subfamily kinases. Tec is found in endothelial cells,
           both B- and T-cells, and a variety of myeloid cells
           including mast cells, erythroid cells, platelets,
           macrophages and neutrophils. Tec is a key component of
           T-cell receptor (TCR) signaling, and is important in
           TCR-stimulated proliferation, IL-2 production and
           phospholipase C-gamma1 activation. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 56

 Score = 33.6 bits (77), Expect = 0.018
 Identities = 15/53 (28%), Positives = 28/53 (52%), Gaps = 1/53 (1%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGY-CHGQYGLFPANYVS 522
             VA+YD+Q +   ++  +  +    +E  D  WW+    +G+ G  P+NYV+
Sbjct: 2   IVVAMYDFQPTEPHDLRLETGEEYVILEKNDVHWWKARDKYGKEGYIPSNYVT 54


>gnl|CDD|212744 cd11810, SH3_RUSC1_like, Src homology 3 domain of RUN and SH3
           domain-containing proteins 1 and 2.  RUSC1 and RUSC2,
           that were originally characterized in silico. They are
           adaptor proteins consisting of RUN, leucine zipper, and
           SH3 domains. RUSC1, also called NESCA (New molecule
           containing SH3 at the carboxy-terminus), is highly
           expressed in the brain and is translocated to the
           nuclear membrane from the cytoplasm upon stimulation
           with neurotrophin. It plays a role in facilitating
           neurotrophin-dependent neurite outgrowth. It also
           interacts with NEMO (or IKKgamma) and may function in
           NEMO-mediated activation of NF-kB. RUSC2, also called
           Iporin, is expressed ubiquitously with highest amounts
           in the brain and testis. It interacts with the small
           GTPase Rab1 and the Golgi matrix protein GM130, and may
           function in linking GTPases to certain intracellular
           signaling pathways. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 50

 Score = 33.2 bits (76), Expect = 0.019
 Identities = 14/47 (29%), Positives = 22/47 (46%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANY 520
           AL  + A+   ++SF   DI+  I  +D+ W         GL P +Y
Sbjct: 4   ALCHHVATDSGQLSFRKGDILRVIARVDDDWLLCTRGSTKGLVPLSY 50


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 37.5 bits (88), Expect = 0.020
 Identities = 23/100 (23%), Positives = 43/100 (43%), Gaps = 2/100 (2%)

Query: 345 VPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQE 404
           V  K +L++    GI +++ +   EE+E ++  +E  +  E E K E E+      EE+ 
Sbjct: 15  VSSKELLEKLKELGIEVKSHSSTVEEEEARK--EEAKREAEEEAKAEAEEAAAAEAEEEA 72

Query: 405 NIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQAR 444
             +       +E  E     +   +  ++E    AE  AR
Sbjct: 73  KAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAAR 112



 Score = 35.2 bits (82), Expect = 0.085
 Identities = 23/132 (17%), Positives = 48/132 (36%)

Query: 270 RAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLN 329
           + + E  AK   EE+     EE+ K +       E+A +      +  +  E+E  +   
Sbjct: 49  KREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAE 108

Query: 330 GHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERK 389
             +  P +          K   K+P           K  + + ++R  +   K+K+ ++ 
Sbjct: 109 AAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKP 168

Query: 390 EELEKEQIRIKE 401
            E    ++ I E
Sbjct: 169 TEKIPREVVIPE 180



 Score = 31.7 bits (73), Expect = 1.2
 Identities = 21/134 (15%), Positives = 54/134 (40%), Gaps = 1/134 (0%)

Query: 288 SEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVP-LSPSTETPPVP 346
            EEE  K + K + + E   + EE   +E + + +          +    + + E    P
Sbjct: 39  EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARP 98

Query: 347 VKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENI 406
            +    +P         + K  ++K+  +  +++ KRK     +  +  + R +      
Sbjct: 99  AEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRR 158

Query: 407 KKKKEQEEKERKEQ 420
           ++KK++++K  ++ 
Sbjct: 159 RRKKKKKQKPTEKI 172


>gnl|CDD|212878 cd11945, SH3_Endophilin_B1, Src homology 3 domain of Endophilin-B1.
            Endophilin-B1, also called Bax-interacting factor 1
           (Bif-1) or SH3GLB1 (SH3-domain GRB2-like endophilin B1),
           is localized mainly to the Golgi apparatus. It is
           involved in the regulation of many biological events
           including autophagy, tumorigenesis, nerve growth factor
           (NGF) trafficking, neurite outgrowth, mitochondrial
           outer membrane dynamics, and cell death. Endophilins
           play roles in synaptic vesicle formation, virus budding,
           mitochondrial morphology maintenance, receptor-mediated
           endocytosis inhibition, and endosomal sorting. They
           contain an N-terminal N-BAR domain (BAR domain with an
           additional N-terminal amphipathic helix), followed by a
           variable region containing proline clusters, and a
           C-terminal SH3 domain. Endophilin-B1 forms homo- and
           heterodimers (with endophilin-B2) through its BAR
           domain. It interacts with amphiphysin 1 and dynamin 1
           through its SH3 domain. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 61

 Score = 33.5 bits (76), Expect = 0.022
 Identities = 19/54 (35%), Positives = 27/54 (50%), Gaps = 2/54 (3%)

Query: 472 AVALYDYQASADDEISFDPDDIIT--NIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           A  LYDY A+   E+S   D++IT  ++  +D  W  G    Q G  P  Y+ L
Sbjct: 6   ARVLYDYDAANSTELSLLADEVITVYSVPGMDSDWLMGERGNQKGKVPITYLEL 59


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 37.0 bits (86), Expect = 0.023
 Identities = 24/105 (22%), Positives = 46/105 (43%), Gaps = 13/105 (12%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIK-------EEQENIKKKKEQ-EEKERK 418
             E KE     + + +++    +EEL++E+ R+           E +   + Q EE+E+ 
Sbjct: 54  LLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKA 113

Query: 419 EQEENQKLLLKKQQEEDRLK-----AEEQARLLEQERLMEELRQQ 458
                 +L   ++Q ++ L        EQAR L  + L  EL ++
Sbjct: 114 LSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAELEEE 158


>gnl|CDD|180941 PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated.
          Length = 174

 Score = 35.7 bits (83), Expect = 0.024
 Identities = 20/93 (21%), Positives = 46/93 (49%), Gaps = 10/93 (10%)

Query: 372 EKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEEN-----QKL 426
           E++  + + +K  E    E L +    + E Q+ + + +++ E+ R + +        ++
Sbjct: 50  ERREAILQALKEAE----ERLRQAAQALAEAQQKLAQAQQEAERIRADAKARAEAIRAEI 105

Query: 427 LLKKQQEEDRLKAEEQARL-LEQERLMEELRQQ 458
             +  ++  RLK    A L  EQER++ +LR++
Sbjct: 106 EKQAIEDMARLKQTAAADLSAEQERVIAQLRRE 138


>gnl|CDD|212957 cd12024, SH3_NoxO1_2, Second or C-terminal Src homology 3 domain of
           NADPH oxidase (Nox) Organizing protein 1.  Nox
           Organizing protein 1 (NoxO1) is a critical regulator of
           enzyme kinetics of the nonphagocytic NADPH oxidase Nox1,
           which catalyzes the transfer of electrons from NADPH to
           molecular oxygen to form superoxide. Nox1 is expressed
           in colon, stomach, uterus, prostate, and vascular smooth
           muscle cells. NoxO1 is involved in targeting activator
           subunits (such as NoxA1) to Nox1. It is co-localized
           with Nox1 in the membranes of resting cells and directs
           the subcellular localization of Nox1. NoxO1 contains an
           N-terminal Phox homology (PX) domain, tandem SH3 domains
           (N-SH3 and C-SH3), and a C-terminal proline-rich region
           (PRR). This model characterizes the second SH3 domain
           (or C-SH3) of NoxO1. The tandem SH3 domains of NoxO1
           interact with the PRR of p22phox, which also complexes
           with Nox1. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 33.1 bits (76), Expect = 0.024
 Identities = 14/49 (28%), Positives = 24/49 (48%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANY 520
             A   Y+A  +DE+S     ++  ++  D GWW    +G+ G  P+ Y
Sbjct: 2   YYATRAYEAQKEDELSVPAGVVVEVLQKSDNGWWLIRYNGRAGYVPSMY 50


>gnl|CDD|212819 cd11886, SH3_BOI, Src Homology 3 domain of fungal BOI-like
           proteins.  This subfamily includes the Saccharomyces
           cerevisiae proteins BOI1 and BOI2, and similar proteins.
           They contain an N-terminal SH3 domain, a Sterile alpha
           motif (SAM), and a Pleckstrin homology (PH) domain at
           the C-terminus. BOI1 and BOI2 interact with the SH3
           domain of Bem1p, a protein involved in bud formation.
           They promote polarized cell growth and participates in
           the NoCut signaling pathway, which is involved in the
           control of cytokinesis. SH3 domains bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs; they play a role in the
           regulation of enzymes by intramolecular interactions,
           changing the subcellular localization of signal pathway
           components and mediate multiprotein complex assemblies.
          Length = 55

 Score = 33.1 bits (76), Expect = 0.025
 Identities = 18/53 (33%), Positives = 30/53 (56%), Gaps = 5/53 (9%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDE---GWWRG--YCHGQYGLFPANY 520
           + ++D+ A ++DE++  P D I  IE  +E   GW+ G     G+ GLFP  +
Sbjct: 3   IVIHDFNARSEDELTLKPGDKIELIEDDEEFGDGWYLGRNLRTGETGLFPVVF 55


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 36.9 bits (86), Expect = 0.026
 Identities = 20/45 (44%), Positives = 32/45 (71%), Gaps = 2/45 (4%)

Query: 386 LERKEEL--EKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLL 428
           L+ KEEL  EKE+    +EQ+ ++K K+QEEK++KE E+ +K  +
Sbjct: 547 LDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKI 591



 Score = 36.2 bits (84), Expect = 0.041
 Identities = 16/40 (40%), Positives = 28/40 (70%), Gaps = 1/40 (2%)

Query: 378 QEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKER 417
           +EE++R E E KE L++++   K +++  KKKKE E+ E+
Sbjct: 550 KEELQR-EKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 34.2 bits (79), Expect = 0.17
 Identities = 22/66 (33%), Positives = 37/66 (56%), Gaps = 5/66 (7%)

Query: 388 RKEELEKEQIRI--KEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARL 445
           R E L    IRI  K +  ++ K  ++EE +R   E+ +K  LK+Q+   +LK +E+ + 
Sbjct: 524 RDEWLPNLGIRIEDKPDGPSVWKLDDKEELQR---EKEEKEALKEQKRLRKLKKQEEKKK 580

Query: 446 LEQERL 451
            E E+L
Sbjct: 581 KELEKL 586



 Score = 29.6 bits (67), Expect = 5.3
 Identities = 30/133 (22%), Positives = 55/133 (41%), Gaps = 31/133 (23%)

Query: 267 SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEK 326
              R +  + AK       K    +K+K++L     L+   KL +  L  L I+ E    
Sbjct: 490 LRFRDEVRDAAKAEM----KLISLDKKKKQL-----LQLCDKLRDEWLPNLGIRIE---- 536

Query: 327 KLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKEL 386
                 D P  PS        K        D   +Q   +E+E  ++Q+ +++  K++E 
Sbjct: 537 ------DKPDGPSV------WKLD------DKEELQREKEEKEALKEQKRLRKLKKQEEK 578

Query: 387 ERKEELEKEQIRI 399
           ++KE  + E+ +I
Sbjct: 579 KKKELEKLEKAKI 591



 Score = 29.2 bits (66), Expect = 6.8
 Identities = 15/45 (33%), Positives = 30/45 (66%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKK 410
           K ++++E QR  +E+   KE +R  +L+K++ + K+E E ++K K
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 36.9 bits (85), Expect = 0.027
 Identities = 27/116 (23%), Positives = 39/116 (33%), Gaps = 9/116 (7%)

Query: 378 QEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQ--KLLLKKQQEED 435
           Q    R EL   +  EK     + E    + +K ++E+E   QE     + L K QQE  
Sbjct: 82  QLRALRTELGTAQG-EKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELA 140

Query: 436 RLKAEEQ------ARLLEQERLMEELRQQGTNEDTEEDLGYTAVALYDYQASADDE 485
           RL  + Q        L EQ R +E   Q       +     T +            
Sbjct: 141 RLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSA 196



 Score = 30.8 bits (69), Expect = 2.2
 Identities = 33/190 (17%), Positives = 78/190 (41%), Gaps = 5/190 (2%)

Query: 270 RAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLN 329
            A+ E    + E E+ ++      +   K Q +L +  K  +   + LK   E+  ++L 
Sbjct: 106 AARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQ-RRQLE 164

Query: 330 GHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQE-EIKRKELER 388
             +    +   +       + LK    D + ++++  E+E +         + + +EL R
Sbjct: 165 AQAQSLQASQKQLQASA--TQLKSQVLD-LKLRSAQIEQEAQNLATRANAAQARTEELAR 221

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQ 448
           +    ++  +  ++++    +K Q+   R EQ   ++  L++ +       +E A+L   
Sbjct: 222 RAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQARLEQEVAQLEAY 281

Query: 449 ERLMEELRQQ 458
            +    LRQQ
Sbjct: 282 YQAYVRLRQQ 291



 Score = 28.8 bits (64), Expect = 7.2
 Identities = 25/190 (13%), Positives = 65/190 (34%), Gaps = 5/190 (2%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKL 328
           LRA    L     E+    +E E  + +L+      +A + E     +   K ++   +L
Sbjct: 83  LRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARL 142

Query: 329 NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELER 388
              +        +T    +    +Q  +    +Q S K+ +    Q   Q    +    +
Sbjct: 143 TKQAQ-----DLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQ 197

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQ 448
            E+  +         +   ++  +     ++  +  +    +  ++ +  A    ++ E+
Sbjct: 198 IEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRER 257

Query: 449 ERLMEELRQQ 458
           ER ++ L   
Sbjct: 258 ERQLQRLETA 267


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 36.6 bits (84), Expect = 0.028
 Identities = 41/181 (22%), Positives = 79/181 (43%), Gaps = 7/181 (3%)

Query: 274 ENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSD 333
               ++ E+E  +   EE E+ +   + + +   +  E    E K  E E E+K    S 
Sbjct: 108 NETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSL 167

Query: 334 VPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELE 393
              +    T  +                  + KE E+ ++++  QE     E  +K+  E
Sbjct: 168 EENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQ--QEAALELEELKKKREE 225

Query: 394 KEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLME 453
           + ++  +EEQ       +QEE +RK +EE +K  LK++ E  R +A E+ + + ++ L E
Sbjct: 226 RRKVLEEEEQRR-----KQEEADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDGLSE 280

Query: 454 E 454
           +
Sbjct: 281 D 281



 Score = 29.6 bits (66), Expect = 5.0
 Identities = 44/192 (22%), Positives = 81/192 (42%), Gaps = 4/192 (2%)

Query: 271 AKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNG 330
           A  E LA++ E    + SE  + +++ K     +   +   R   +   + E +E++   
Sbjct: 58  ALLERLARREERRDERFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKE 117

Query: 331 HSDVPLSPSTETPPVP---VKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE 387
            S        ET  V     K+  +         +    EEEEK K+  ++E        
Sbjct: 118 ESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTH 177

Query: 388 RKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL-LKKQQEEDRLKAEEQARLL 446
           + +  E    R   E   ++  KE E+ ++K+QE   +L  LKK++EE R   EE+ +  
Sbjct: 178 KLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRR 237

Query: 447 EQERLMEELRQQ 458
           +QE    + R++
Sbjct: 238 KQEEADRKSREE 249



 Score = 29.2 bits (65), Expect = 6.1
 Identities = 24/100 (24%), Positives = 45/100 (45%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL 427
           E+   +   V+EE K +  E +EE+E+ +   K EQ+N  +  E+ +KE KE E  ++  
Sbjct: 102 EDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEK 161

Query: 428 LKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEED 467
            K+   E+        +L   E        +G   +  ++
Sbjct: 162 PKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKE 201


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 37.0 bits (86), Expect = 0.029
 Identities = 47/188 (25%), Positives = 92/188 (48%), Gaps = 23/188 (12%)

Query: 273 FENLAKQTEEESRKRSEEEKEKRKLKDQIDL--EQAQKLEE--RRLSELKIKEEEIEKKL 328
           ++NL +  +E  R+    EK  ++ ++  +L  E+ ++LEE  R ++E+  +  E+ ++L
Sbjct: 164 YKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREEL 223

Query: 329 NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE- 387
                             ++  +K+       I+   KE E  E     + E K +ELE 
Sbjct: 224 EK----------------LEKEVKELEELKEEIEELEKELESLEG-SKRKLEEKIRELEE 266

Query: 388 RKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLE 447
           R EEL+KE   ++E+ + +K+ KE+ E+  K  E  ++ L + ++ E RL   E+  +  
Sbjct: 267 RIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEE-EING 325

Query: 448 QERLMEEL 455
            E  ++EL
Sbjct: 326 IEERIKEL 333



 Score = 35.4 bits (82), Expect = 0.091
 Identities = 27/104 (25%), Positives = 52/104 (50%), Gaps = 6/104 (5%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
           I+   +  E  EK     E I+    E+++ELE+    I E    + + +E+ EK  KE 
Sbjct: 171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEV 230

Query: 421 EENQKL------LLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
           +E ++L      L K+ +  +  K + + ++ E E  +EEL+++
Sbjct: 231 KELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKE 274



 Score = 34.7 bits (80), Expect = 0.13
 Identities = 35/154 (22%), Positives = 76/154 (49%), Gaps = 26/154 (16%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKK 327
            L  ++    K+ E+E ++  E+E++ RK   +++    ++ E  +L EL  + +E+E+K
Sbjct: 452 ELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEK 511

Query: 328 LNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKR--KE 385
           L  ++   L                         + + + E+ KEK   ++ EIK   KE
Sbjct: 512 LKKYNLEELE------------------------KKAEEYEKLKEKLIKLKGEIKSLKKE 547

Query: 386 LERKEELEKEQIRIKEEQENIKKKKEQEEKERKE 419
           LE+ EEL+K+   ++++ + ++++  +  KE +E
Sbjct: 548 LEKLEELKKKLAELEKKLDELEEELAELLKELEE 581



 Score = 34.3 bits (79), Expect = 0.18
 Identities = 54/195 (27%), Positives = 96/195 (49%), Gaps = 22/195 (11%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQID--LEQAQKLEE--RRLSELKIKEEEI 324
           L+ + E L K+ E     + + E++ R+L+++I+   ++ ++LEE  + L ELK K EE 
Sbjct: 236 LKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEY 295

Query: 325 EKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRK 384
            K L+   +  L    E     ++  L +   +   I+   KE EEKE++    EE+K+K
Sbjct: 296 IK-LSEFYEEYLDELRE-----IEKRLSRLEEEINGIEERIKELEEKEERL---EELKKK 346

Query: 385 ELERKEELEK---------EQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEED 435
             E ++ LE+         E    KEE E +KK+      E+ E+E  +    K++ EE+
Sbjct: 347 LKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEE 406

Query: 436 RLKAEEQARLLEQER 450
             K   +   L++E 
Sbjct: 407 ISKITARIGELKKEI 421



 Score = 29.3 bits (66), Expect = 6.5
 Identities = 49/208 (23%), Positives = 100/208 (48%), Gaps = 21/208 (10%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEK-RKLKDQI-----DLEQAQKLEERRLSELKIKE 321
            L  K +    +  E+  +  E+ KEK  KLK +I     +LE+ ++L+ ++L+EL+ K 
Sbjct: 507 ELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELK-KKLAELEKKL 565

Query: 322 EEIEKKLNGHSDVPLSPSTETPPVPVKSI--LKQPTSDGIPIQNSNKEEEEKEKQR---- 375
           +E+E++L            E+     + +  L+   ++ + ++++ KE E +EK+     
Sbjct: 566 DELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLE 625

Query: 376 --------MVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL 427
                    + E  KR E  RKE  E E+   +EE E ++++  +  +E        + L
Sbjct: 626 EELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEEL 685

Query: 428 LKKQQEEDRLKAEEQARLLEQERLMEEL 455
            K+++E  +   + +  L E+E+  +EL
Sbjct: 686 EKRREEIKKTLEKLKEELEEREKAKKEL 713



 Score = 28.5 bits (64), Expect = 9.9
 Identities = 37/166 (22%), Positives = 79/166 (47%), Gaps = 30/166 (18%)

Query: 285 RKRSEEEKE---KRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTE 341
           R+ +EE ++   +    +   +E+  K  E +  +L+ +  E+EK L   S++       
Sbjct: 443 RELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESEL------- 495

Query: 342 TPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKE 401
              + +K + +Q             +E E++ ++   EE+++K  E  E+L+++ I++K 
Sbjct: 496 ---IKLKELAEQ------------LKELEEKLKKYNLEELEKKA-EEYEKLKEKLIKLKG 539

Query: 402 EQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLE 447
           E +++KK    E ++ +E ++    L KK  E +   AE    L E
Sbjct: 540 EIKSLKK----ELEKLEELKKKLAELEKKLDELEEELAELLKELEE 581


>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B.  A
           small number of taxonomically diverse prokaryotic
           species, including Methanosarcina barkeri, have what
           appears to be a second ATP synthase, in addition to the
           normal F1F0 ATPase in bacteria and A1A0 ATPase in
           archaea. These enzymes use ion gradients to synthesize
           ATP, CC and in principle may run in either direction.
           This model represents the F0 subunit B of this apparent
           second ATP synthase.
          Length = 246

 Score = 35.8 bits (83), Expect = 0.031
 Identities = 27/100 (27%), Positives = 52/100 (52%), Gaps = 5/100 (5%)

Query: 360 PIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE 419
           PI ++    E+K    +   + K++E E++    +E+    ++Q  +   K +EE     
Sbjct: 29  PILDAMDAREKKIAGELADADTKKREAEQERREYEEKNEELDQQREVLLTKAKEEA---- 84

Query: 420 QEENQKLLLKKQQEEDRLKAEEQARLL-EQERLMEELRQQ 458
           Q E Q+LL + ++E D ++ + Q  L  EQ  L +ELR++
Sbjct: 85  QAERQRLLDEAREEADEIREKWQEALRREQAALSDELRRR 124


>gnl|CDD|212916 cd11983, SH3_Shank2, Src homology 3 domain of SH3 and multiple
           ankyrin repeat domains protein 2.  Shank2, also called
           ProSAP1 (Proline-rich synapse-associated protein 1) or
           CortBP1 (Cortactin-binding protein 1), is found in
           neurons, glia, endocrine cells, liver, and kidney. It
           plays a role in regulating dendritic spine volume and
           branching and postsynaptic clustering. Mutations in the
           Shank2 gene are associated with autism spectrum disorder
           and mental retardation. Shank proteins carry scaffolding
           functions through multiple sites of protein-protein
           interaction in its domain architecture, including
           ankyrin (ANK) repeats, a long proline rich region, as
           well as SH3, PDZ, and SAM domains. The SH3 domain of
           Shank binds GRIP, a scaffold protein that binds AMPA
           receptors and Eph receptors/ligands. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 52

 Score = 32.6 bits (74), Expect = 0.032
 Identities = 15/49 (30%), Positives = 22/49 (44%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           V +  YQ   + EI     D +  + + + G+W G   G  G FPA  V
Sbjct: 4   VVVKSYQPQVEGEIPLHKGDRVKVLSIGEGGFWEGSARGHVGWFPAECV 52


>gnl|CDD|212887 cd11954, SH3_ASPP1, Src Homology 3 domain of Apoptosis Stimulating
           of p53 protein 1.  ASPP1, like ASPP2, activates the
           apoptotic function of the p53 family of tumor
           suppressors (p53, p63, and p73). In addition, it
           functions in the cytoplasm to regulate the nuclear
           localization of the transcriptional cofactors YAP and
           TAZ by inihibiting their phosphorylation; YAP and TAZ
           are important regulators of cell expansion,
           differentiation, migration, and invasion. ASPP1 is
           downregulated in breast tumors expressing wild-type p53.
           It contains a proline-rich region, four ankyrin (ANK)
           repeats, and an SH3 domain at its C-terminal half. The
           SH3 domain and the ANK repeats of ASPP1 contribute to
           the p53 binding site; they bind to the DNA binding
           domain of p53. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 57

 Score = 32.7 bits (74), Expect = 0.032
 Identities = 19/55 (34%), Positives = 27/55 (49%), Gaps = 3/55 (5%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEG---WWRGYCHGQYGLFPANYVSL 523
             AL+DY+A   DE+SF   D IT +   D+    WW    + + G  P N + L
Sbjct: 3   VYALWDYEAQNADELSFQEGDAITILRRKDDSETEWWWARLNDKEGYVPKNLLGL 57


>gnl|CDD|213009 cd12076, SH3_Tks4_2, Second Src homology 3 domain of Tyrosine
           kinase substrate with four SH3 domains.  Tks4, also
           called SH3 and PX domain-containing protein 2B
           (SH3PXD2B) or HOFI, is a Src substrate and scaffolding
           protein that plays an important role in the formation of
           podosomes and invadopodia, the dynamic actin-rich
           structures that are related to cell migration and cancer
           cell invasion. It is required in the formation of
           functional podosomes, EGF-induced membrane ruffling, and
           lamellipodia generation. It plays an important role in
           cellular attachment and cell spreading. Tks4 is
           essential for the localization of MT1-MMP (membrane-type
           1 matrix metalloproteinase) to invadopodia. It contains
           an N-terminal Phox homology (PX) domain and four SH3
           domains. This model characterizes the second SH3 domain
           of Tks4. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 54

 Score = 32.7 bits (74), Expect = 0.033
 Identities = 16/49 (32%), Positives = 26/49 (53%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
             +Y Y A   DEI+ +   ++  I+   EGWW+    G+ G  PA+Y+
Sbjct: 4   TVIYPYTARDQDEINLEKGAVVEVIQKNLEGWWKIRYQGKEGWAPASYL 52


>gnl|CDD|216249 pfam01025, GrpE, GrpE. 
          Length = 165

 Score = 35.3 bits (82), Expect = 0.034
 Identities = 19/68 (27%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
           +   +EEE ++++  ++EE++  E E  EEL+   +R+  E EN +K+ E  E+E  ++ 
Sbjct: 1   EEKEEEEELEDEEEALEEELEELE-EEIEELKDRLLRLLAEFENYRKRTE-REREEAKKY 58

Query: 422 ENQKLLLK 429
             +K    
Sbjct: 59  AIEKFAKD 66


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 35.9 bits (83), Expect = 0.037
 Identities = 21/84 (25%), Positives = 44/84 (52%), Gaps = 6/84 (7%)

Query: 377 VQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDR 436
           +QE+ ++ ELE K     ++   ++ Q  +++ + + E+ RK  +E  +   ++ Q +D 
Sbjct: 59  MQEKTRQAELEAK----IKEYEAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDE 114

Query: 437 LKAEEQARLLEQERLMEE--LRQQ 458
           L  +   + LEQ+R   E  L+ Q
Sbjct: 115 LARKRYQKELEQQRRQNEELLKMQ 138



 Score = 35.5 bits (82), Expect = 0.046
 Identities = 25/92 (27%), Positives = 49/92 (53%), Gaps = 5/92 (5%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL 427
           ++E E+QR   EE+ + + E    L +E +R   E+E ++ ++E  E+E + + EN   +
Sbjct: 121 QKELEQQRRQNEELLKMQEE--SVLRQEAMRRATEEEILEMRRETIEEEAELEREN---I 175

Query: 428 LKKQQEEDRLKAEEQARLLEQERLMEELRQQG 459
             K + E R +A+E+    +  R M +L+   
Sbjct: 176 RAKIEAEARGRAKEERENEDINREMLKLKANE 207



 Score = 33.2 bits (76), Expect = 0.27
 Identities = 21/89 (23%), Positives = 50/89 (56%), Gaps = 3/89 (3%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL 427
           ++E  ++R  Q+E++++   + EEL K Q      QE +++  E+E  E + +   ++  
Sbjct: 112 QDELARKRY-QKELEQQRR-QNEELLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAE 169

Query: 428 LKKQQEEDRLKAEEQARLLEQERLMEELR 456
           L+++    +++AE + R  ++ER  E++ 
Sbjct: 170 LERENIRAKIEAEARGR-AKEERENEDIN 197



 Score = 29.3 bits (66), Expect = 4.7
 Identities = 21/105 (20%), Positives = 53/105 (50%), Gaps = 13/105 (12%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQ------ENIKKKKEQ-- 412
           ++ +  E EE+ K    Q + +++  + ++EL +++ + + EQ      E +K ++E   
Sbjct: 84  LERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQNEELLKMQEESVL 143

Query: 413 -EEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELR 456
            +E  R+  EE + L ++++  E+  + E +     + ++  E R
Sbjct: 144 RQEAMRRATEE-EILEMRRETIEEEAELEREN---IRAKIEAEAR 184


>gnl|CDD|212766 cd11832, SH3_Shank, Src homology 3 domain of SH3 and multiple
           ankyrin repeat domains (Shank) proteins.  Shank proteins
           carry scaffolding functions through multiple sites of
           protein-protein interaction in its domain architecture,
           including ankyrin (ANK) repeats, a long proline rich
           region, as well as SH3, PDZ, and SAM domains. They bind
           a variety of membrane and cytosolic proteins, and exist
           in alternatively spliced isoforms. They are highly
           enriched in postsynaptic density (PSD) where they
           interact with the cytoskeleton and with postsynaptic
           membrane receptors including NMDA and glutamate
           receptors. They are crucial in the construction and
           organization of the PSD and dendritic spines of
           excitatory synapses. There are three members of this
           family (Shank1, Shank2, Shank3) which show distinct and
           cell-type specific patterns of expression. Shank1 is
           brain-specific; Shank2 is found in neurons, glia,
           endocrine cells, liver, and kidney; Shank3 is widely
           expressed. The SH3 domain of Shank binds GRIP, a
           scaffold protein that binds AMPA receptors and Eph
           receptors/ligands. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 50

 Score = 32.4 bits (74), Expect = 0.039
 Identities = 13/46 (28%), Positives = 23/46 (50%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPA 518
           +A+  Y    + EIS    D +  + + + G+W G   G+ G FP+
Sbjct: 3   IAVKSYSPQEEGEISLHKGDRVKVLSIGEGGFWEGSVRGRTGWFPS 48


>gnl|CDD|223407 COG0330, HflC, Membrane protease subunits, stomatin/prohibitin
           homologs [Posttranslational modification, protein
           turnover, chaperones].
          Length = 291

 Score = 35.9 bits (83), Expect = 0.039
 Identities = 18/99 (18%), Positives = 35/99 (35%), Gaps = 2/99 (2%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           +E +   +++M  E  KR E+   E   +  I   E +        + E E +     + 
Sbjct: 173 EEVQAAMEKQMAAERDKRAEILEAEGEAQAAILRAEGEAEAAIILAEAEAEAEVIARAEA 232

Query: 426 LLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDT 464
              K       L+    A     +R +EEL +     ++
Sbjct: 233 DAAKIIAAA--LREAPAAPQALAQRYLEELLEIALAGNS 269


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 36.5 bits (85), Expect = 0.039
 Identities = 14/71 (19%), Positives = 34/71 (47%), Gaps = 1/71 (1%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEK-ERKE 419
              + + EEE + +     E K  + E++   E++     EE+E+  K ++ ++    KE
Sbjct: 229 TSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKE 288

Query: 420 QEENQKLLLKK 430
           +++ +    +K
Sbjct: 289 KKDEELFWFEK 299



 Score = 30.7 bits (70), Expect = 2.1
 Identities = 14/68 (20%), Positives = 32/68 (47%), Gaps = 1/68 (1%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           KE  E E++    +       E K   ++++   +E+     ++KE  +K  ++ ++ + 
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDK-TEDLDKLEI 285

Query: 426 LLLKKQQE 433
           L  KK +E
Sbjct: 286 LKEKKDEE 293



 Score = 30.0 bits (68), Expect = 4.4
 Identities = 18/76 (23%), Positives = 38/76 (50%), Gaps = 7/76 (9%)

Query: 373 KQRMVQEEIKRKELERKEELEK--EQIRIKEEQENIKKK-----KEQEEKERKEQEENQK 425
           K+     E + +E E   E+E   E    K+EQE   ++      E++E   K ++ ++ 
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283

Query: 426 LLLKKQQEEDRLKAEE 441
            +LK++++E+    E+
Sbjct: 284 EILKEKKDEELFWFEK 299


>gnl|CDD|212977 cd12044, SH3_SKAP1, Src Homology 3 domain of Src Kinase-Associated
           Phosphoprotein 1.  SKAP1, also called SKAP55 (Src
           kinase-associated protein of 55kDa), is an immune
           cell-specific adaptor protein that plays an important
           role in T-cell adhesion, migration, and integrin
           clustering. It is expressed exclusively in
           T-lymphocytes, mast cells, and macrophages. Binding
           partners include ADAP (adhesion and
           degranulation-promoting adaptor protein), Fyn, Riam,
           RapL, and RasGRP. It contains a pleckstrin homology (PH)
           domain, a C-terminal SH3 domain, and several tyrosine
           phosphorylation sites. The SH3 domain of SKAP1 is
           necessary for its ability to regulate T-cell conjugation
           with antigen-presenting cells and the formation of LFA-1
           clusters. SKAP1 binds primarily to a proline-rich region
           of ADAP through its SH3 domain; its degradation is
           regulated by ADAP. A secondary interaction occurs via
           the ADAP SH3 domain and the RKxxYxxY motif in SKAP1. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 32.5 bits (74), Expect = 0.041
 Identities = 17/50 (34%), Positives = 25/50 (50%), Gaps = 2/50 (4%)

Query: 474 ALYDYQASADDEISFDPDDIITNI--EMIDEGWWRGYCHGQYGLFPANYV 521
            L+D      DE+SF   D+I  +  E    GWW G  +G  G+ P +Y+
Sbjct: 4   GLWDCFGDNPDELSFQRGDLIYILSKEYNMYGWWVGELNGIVGIVPKDYL 53


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 35.2 bits (81), Expect = 0.044
 Identities = 32/92 (34%), Positives = 47/92 (51%), Gaps = 2/92 (2%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEE--N 423
           +  E  E Q  + E +K  +      L   Q R++E +   K+KKE+ E+  KE EE  +
Sbjct: 112 EIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELED 171

Query: 424 QKLLLKKQQEEDRLKAEEQARLLEQERLMEEL 455
           +K LL+K  EE   K E +    E E L+EEL
Sbjct: 172 EKDLLEKLLEEKEKKKELEELKEELEELLEEL 203



 Score = 29.1 bits (65), Expect = 4.2
 Identities = 16/57 (28%), Positives = 38/57 (66%)

Query: 378 QEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEE 434
            +  ++++ ER EELEKE   +++E++ ++K  E++EK+++ +E  ++L    ++ E
Sbjct: 148 LKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELLEELE 204



 Score = 28.3 bits (63), Expect = 7.2
 Identities = 22/127 (17%), Positives = 49/127 (38%), Gaps = 1/127 (0%)

Query: 199 KFGIESDRQDKSAVGWDHVEKVDKHQSQKDDNRQVVTSSKVKDLIAANSTNASKENIKPK 258
            F     +     +         K +  K    ++V   ++++ I     +  +      
Sbjct: 79  TFENNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEILPLLL 138

Query: 259 PNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELK 318
             +G  +   L+ K +   ++ EE  ++  E E EK  L+  ++ ++ +K  E    EL+
Sbjct: 139 -YLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELE 197

Query: 319 IKEEEIE 325
              EE+E
Sbjct: 198 ELLEELE 204



 Score = 27.9 bits (62), Expect = 9.5
 Identities = 24/115 (20%), Positives = 59/115 (51%), Gaps = 7/115 (6%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERK-- 418
           I   N + + K +      E+ +K+ ++ ++   E + I E QE I +  + +++     
Sbjct: 78  ITFENNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEILPLL 137

Query: 419 ---EQEENQKLLLKKQQEEDRLKAEEQA--RLLEQERLMEELRQQGTNEDTEEDL 468
               QE  ++L  K++++++RL+  E+    L +++ L+E+L ++   +   E+L
Sbjct: 138 LYLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEEL 192


>gnl|CDD|213001 cd12068, SH3_MYO15B, Src Homology 3 domain of Myosin XVb.  Myosin
           XVb, also called KIAA1783, was named based on its
           similarity with myosin XVa. It is a transcribed and
           unprocessed pseudogene whose predicted amino acid
           sequence contains mutated or deleted amino acid residues
           that are normally conserved and important for myosin
           function. The related myosin XVa is important for normal
           growth of mechanosensory stereocilia of inner ear hair
           cells. Myosin XVa contains a unique N-terminal extension
           followed by a motor domain, light chain-binding IQ
           motifs, and a tail consisting of a pair of MyTH4-FERM
           tandems separated by a SH3 domain, and a PDZ domain. SH3
           domains bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs;
           they play a role in the regulation of enzymes by
           intramolecular interactions, changing the subcellular
           localization of signal pathway components and mediate
           multiprotein complex assemblies.
          Length = 55

 Score = 32.5 bits (74), Expect = 0.045
 Identities = 19/53 (35%), Positives = 26/53 (49%), Gaps = 2/53 (3%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEM--IDEGWWRGYCHGQYGLFPANYVSL 523
           VAL  Y       +SF   D+I  + M  ++ GW  G   G+ GLFPA+ V  
Sbjct: 3   VALRSYITDDKSLLSFHRGDLIKLLPMAGLEPGWQFGSTGGRSGLFPADIVQP 55


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 35.8 bits (83), Expect = 0.045
 Identities = 21/93 (22%), Positives = 41/93 (44%), Gaps = 1/93 (1%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIK-KKKEQEEKERKEQEENQK 425
           E    E  ++  + +K +   +KE L + + RI + +E I+ K++  EE +R   +    
Sbjct: 48  EGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEELKRALAQRRSD 107

Query: 426 LLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
           L     Q E R  ++ +    E +R   +L   
Sbjct: 108 LSSASYQLEKRRASQLEKLQDEIKRTRSKLNAL 140



 Score = 32.0 bits (73), Expect = 0.66
 Identities = 26/135 (19%), Positives = 59/135 (43%), Gaps = 18/135 (13%)

Query: 347 VKSILKQPTSDGIPIQN---SNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQ 403
           V+  L+  T++   +       +   +KE+   ++  I +     KEE+E+++ RI+E +
Sbjct: 43  VEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQL----KEEIEQKRERIEELK 98

Query: 404 ENIKKKKE-------QEEKERKEQEENQKLLLKKQQEE----DRLKAEEQARLLEQERLM 452
             + +++        Q EK R  Q E  +  +K+ + +      L AE+++ L  +   +
Sbjct: 99  RALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTRSKLNALHSLLAEKRSFLCRELAKL 158

Query: 453 EELRQQGTNEDTEED 467
             LR+       +  
Sbjct: 159 FPLRRVIRGRKGDSS 173



 Score = 30.0 bits (68), Expect = 3.0
 Identities = 24/102 (23%), Positives = 40/102 (39%), Gaps = 4/102 (3%)

Query: 369 EEKEKQRMVQEEIKRKELERK--EELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL 426
           E +     +  E   +EL++K  E LE       +   ++ K +   +KER  Q   +  
Sbjct: 24  ELRLDLARLLLE--NEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARIS 81

Query: 427 LLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDL 468
            LK++ E+ R + EE  R L Q R           +     L
Sbjct: 82  QLKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQL 123


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 35.5 bits (82), Expect = 0.048
 Identities = 21/85 (24%), Positives = 48/85 (56%)

Query: 372 EKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQ 431
           E+    Q+E++ +  + +E++ + Q  ++E +E   + +E+ ++E +E +  +K   + +
Sbjct: 1   EEAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELE 60

Query: 432 QEEDRLKAEEQARLLEQERLMEELR 456
           +E  RL+ E  A   E+ERL  E+ 
Sbjct: 61  EENRRLEEEAAASEEERERLEAEVD 85



 Score = 30.9 bits (70), Expect = 1.2
 Identities = 26/95 (27%), Positives = 50/95 (52%), Gaps = 3/95 (3%)

Query: 364 SNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEEN 423
           + KE EE E+  +  EE  ++E E  + LEK+   ++EE   ++++    E+ER+  E  
Sbjct: 24  AQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERERLEAE 83

Query: 424 QKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
              + +   E  +L+ E + +  E  +L +ELR+ 
Sbjct: 84  ---VDEATAEVAKLEEEREKKEAETRQLQQELREA 115



 Score = 30.9 bits (70), Expect = 1.3
 Identities = 27/104 (25%), Positives = 49/104 (47%), Gaps = 5/104 (4%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
           +      E +EK +  +EE +  E +  E LE+E  R++EE    ++++E+ E E  E  
Sbjct: 30  EYEETALELEEKLKQEEEEAQLLEKKADE-LEEENRRLEEEAAASEEERERLEAEVDEAT 88

Query: 422 ENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTE 465
                 + K +EE   K  E  +L ++ R  +E  ++   E  E
Sbjct: 89  A----EVAKLEEEREKKEAETRQLQQELREAQEAHERARQELLE 128


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 35.8 bits (83), Expect = 0.050
 Identities = 34/159 (21%), Positives = 70/159 (44%), Gaps = 27/159 (16%)

Query: 276 LAKQTEEESRKRSEEEKEKRKLKDQIDL---------EQAQKLEERRLSELKIKEEEIEK 326
           L ++ E+E +   + ++ +R  + +++          E ++ L E    + + + ++   
Sbjct: 199 LQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESA 258

Query: 327 KLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEE----IK 382
                S+       E    PV+   K+ T         NKE+  KE +R  +EE     K
Sbjct: 259 WEGFESE------YEPINKPVRP--KRKTK-----AQRNKEKRRKELEREAKEEKQLKKK 305

Query: 383 RKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
             +L R +E+ KE  + KE+    KK++ +E  E+K+ +
Sbjct: 306 LAQLARLKEIAKEVAQ-KEKARARKKEQRKERGEKKKLK 343


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 35.5 bits (82), Expect = 0.051
 Identities = 24/126 (19%), Positives = 60/126 (47%), Gaps = 31/126 (24%)

Query: 364 SNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEE--QENIKKKKEQEEKERKE-- 419
           S K+ + ++K + + EE +++E E++ E  + + R+ +E  +E  ++K +Q  K+R    
Sbjct: 94  SAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKH 153

Query: 420 ---------------------------QEENQKLLLKKQQEEDRLKAEEQARLLEQERLM 452
                                      ++  Q+  LKK +++ + + EE+ +  ++++  
Sbjct: 154 KKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEE 213

Query: 453 EELRQQ 458
           EE +Q+
Sbjct: 214 EERKQK 219



 Score = 31.6 bits (72), Expect = 0.78
 Identities = 28/154 (18%), Positives = 67/154 (43%), Gaps = 34/154 (22%)

Query: 270 RAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSE-LKIKEEEIEKKL 328
           + K + L ++ +++ R++  EE E R           Q+L + +  E  + K ++  K+ 
Sbjct: 101 QKKLQKLLEEKQKQEREKEREEAELR-----------QRLAKEKYEEWCRQKAQQAAKQ- 148

Query: 329 NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELER 388
                                  K+         + + + E    Q   ++ ++  EL++
Sbjct: 149 -----------------RTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKK 191

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEE 422
            ++ +++    +EE+   ++KK+QEE+ERK++ E
Sbjct: 192 LKQQQQK----REEERRKQRKKQQEEEERKQKAE 221



 Score = 30.9 bits (70), Expect = 1.5
 Identities = 18/80 (22%), Positives = 40/80 (50%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
           Q + K+   K K+   +          K E    Q   K+  +  + KK +++++++E+E
Sbjct: 143 QQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEE 202

Query: 422 ENQKLLLKKQQEEDRLKAEE 441
             ++   ++++EE + KAEE
Sbjct: 203 RRKQRKKQQEEEERKQKAEE 222



 Score = 28.5 bits (64), Expect = 8.0
 Identities = 26/142 (18%), Positives = 62/142 (43%), Gaps = 17/142 (11%)

Query: 334 VPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEE---KEKQRMVQEEIKRKELERKE 390
           VP    + T      S     +   + ++   +E  E     KQ   Q+++++   E+++
Sbjct: 54  VPSLSLSSTASSLSDSSTYSRSLKEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQK 113

Query: 391 ELEKEQIRIKEEQENIKKKKEQEEKERKEQE--------------ENQKLLLKKQQEEDR 436
           +  +++    E ++ + K+K +E   +K Q+              E+    L    + +R
Sbjct: 114 QEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPER 173

Query: 437 LKAEEQARLLEQERLMEELRQQ 458
             ++E+A+   QE  +++L+QQ
Sbjct: 174 NVSQEEAKKRLQEWELKKLKQQ 195


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 35.7 bits (82), Expect = 0.059
 Identities = 33/200 (16%), Positives = 63/200 (31%), Gaps = 11/200 (5%)

Query: 274 ENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQK--LEERRLSELKIKEEEIEKKL--- 328
           E L K   +ES +  +E +++ + K+QI L++ +K  +   RL EL+ +   +       
Sbjct: 454 EKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHP 513

Query: 329 NGHSDVPLSPSTETPPV------PVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIK 382
           N       +P   T  +        +    +            +    KE+ + +Q+   
Sbjct: 514 NPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFS 573

Query: 383 RKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQ 442
                     E                ++  E E     E   LL K Q E+D       
Sbjct: 574 ILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLH 633

Query: 443 ARLLEQERLMEELRQQGTNE 462
            +   QE  ++         
Sbjct: 634 LQQCSQELALKLTALHALQL 653



 Score = 34.2 bits (78), Expect = 0.19
 Identities = 31/212 (14%), Positives = 69/212 (32%), Gaps = 10/212 (4%)

Query: 265 KPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEI 324
            P     + + L K+ +       + ++    L  + + ++ Q  +++ L +L+ + EE+
Sbjct: 213 MPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEEL 272

Query: 325 EKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRK 384
             +     +                         I  Q      E + K R   + + ++
Sbjct: 273 RAQEAVLEE--TQERINRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKR 330

Query: 385 ELERKEELEKEQIRIKEE----QENIKKKKEQEEKERKEQEENQKL----LLKKQQEEDR 436
               K++   E+ R   +    QE   +   +     +E    Q      +   QQ++  
Sbjct: 331 AAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTT 390

Query: 437 LKAEEQARLLEQERLMEELRQQGTNEDTEEDL 468
           L  + Q+   E + L  E     T      DL
Sbjct: 391 LTQKLQSLCKELDILQREQATIDTRTSAFRDL 422



 Score = 30.7 bits (69), Expect = 2.3
 Identities = 21/90 (23%), Positives = 40/90 (44%), Gaps = 3/90 (3%)

Query: 365 NKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEEN- 423
                  +   +    +     ERK+ LEKE   ++E  +  ++      ++R+ QEE  
Sbjct: 197 ELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQL 256

Query: 424 --QKLLLKKQQEEDRLKAEEQARLLEQERL 451
             Q+LL + +   + L+A+E      QER+
Sbjct: 257 KKQQLLKQLRARIEELRAQEAVLEETQERI 286



 Score = 30.3 bits (68), Expect = 3.4
 Identities = 21/181 (11%), Positives = 63/181 (34%), Gaps = 19/181 (10%)

Query: 274 ENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSD 333
                  E   R   E     R++  Q      Q    + +  L+ ++  + +KL     
Sbjct: 347 LQTLHSQEIHIRDAHEVATSIREISCQ------QHTLTQHIHTLQQQKTTLTQKLQSLCK 400

Query: 334 VPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELE 393
                           IL++  +      ++ ++ + +      Q+E++++  E      
Sbjct: 401 E-------------LDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAI 447

Query: 394 KEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLME 453
               + ++ ++   ++  Q  KER++Q + ++ +  ++  +  +       L E+   + 
Sbjct: 448 TCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLC 507

Query: 454 E 454
            
Sbjct: 508 G 508



 Score = 29.6 bits (66), Expect = 5.9
 Identities = 14/93 (15%), Positives = 32/93 (34%), Gaps = 6/93 (6%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKEQIR-----IKEEQENIKKKKEQEEKERKEQEE 422
           E+  E + M+  E      + + E + + +R       +E           +    ++  
Sbjct: 601 EKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERV 660

Query: 423 NQKLLLKKQQEEDRLKAEEQARLLEQERLMEEL 455
            +  L  +   +  L A  Q  L + +   E+L
Sbjct: 661 REHALSIRVLPK-ELLASRQLALQKMQSEKEQL 692


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
           appears in eukaryotes as well as bacteria and tends to
           be found near the C-terminus of the metalloprotease M16C
           (pfam05193).
          Length = 248

 Score = 34.9 bits (81), Expect = 0.061
 Identities = 12/46 (26%), Positives = 24/46 (52%), Gaps = 1/46 (2%)

Query: 393 EKEQIRIKEEQENIKKKKEQ-EEKERKEQEENQKLLLKKQQEEDRL 437
             E+   +EE+E ++KKK    E+++++  E    L + Q   + L
Sbjct: 6   TLEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQSTPEDL 51



 Score = 29.1 bits (66), Expect = 4.5
 Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 6/51 (11%)

Query: 404 ENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEE 454
           E +++K  +EE+E+ E+            EED+ K  E+   LE+ +   E
Sbjct: 5   ETLEEKLNEEEREKLEK------KKSSLSEEDKEKIIERGLELEELQSTPE 49


>gnl|CDD|212915 cd11982, SH3_Shank1, Src homology 3 domain of SH3 and multiple
           ankyrin repeat domains protein 1.  Shank1, also called
           SSTRIP (Somatostatin receptor-interacting protein), is a
           brain-specific protein that plays a role in the
           construction of postsynaptic density (PSD) and the
           maturation of dendritic spines. Mice deficient in Shank1
           show altered PSD composition, thinner PSDs, smaller
           dendritic spines, and weaker basal synaptic
           transmission, although synaptic plasticity is normal.
           They show increased anxiety and impaired fear memory,
           but also show better spatial learning. Shank proteins
           carry scaffolding functions through multiple sites of
           protein-protein interaction in its domain architecture,
           including ankyrin (ANK) repeats, a long proline rich
           region, as well as SH3, PDZ, and SAM domains. The SH3
           domain of Shank binds GRIP, a scaffold protein that
           binds AMPA receptors and Eph receptors/ligands. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 52

 Score = 31.9 bits (72), Expect = 0.061
 Identities = 17/51 (33%), Positives = 29/51 (56%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           T +A+  YQ+ A+ EIS    + I  + + + G+W G   G+ G FP++ V
Sbjct: 2   TFMAVKPYQSQAEGEISLSKGEKIKVLSVGEGGFWEGQVKGRVGWFPSDCV 52


>gnl|CDD|212736 cd11802, SH3_Endophilin_B, Src homology 3 domain of Endophilin-B.
           Endophilins play roles in synaptic vesicle formation,
           virus budding, mitochondrial morphology maintenance,
           receptor-mediated endocytosis inhibition, and endosomal
           sorting. They are classified into two types, A and B.
           Vertebrates contain two endophilin-B isoforms.
           Endophilin-B proteins are cytoplasmic proteins expressed
           mainly in the heart, placenta, and skeletal muscle.
           Endophilins contain an N-terminal N-BAR domain (BAR
           domain with an additional N-terminal amphipathic helix),
           followed by a variable region containing proline
           clusters, and a C-terminal SH3 domain. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 52

 Score = 31.9 bits (73), Expect = 0.065
 Identities = 19/52 (36%), Positives = 24/52 (46%), Gaps = 2/52 (3%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMI--DEGWWRGYCHGQYGLFPANY 520
            A  LYDY A    E+S   D++IT  E+   DE +  G    Q G  P  Y
Sbjct: 1   KARVLYDYDAEDSTELSLLADEVITVYELPGMDEDYMMGERGSQRGKVPVAY 52


>gnl|CDD|227515 COG5188, PRP9, Splicing factor 3a, subunit 3 [RNA processing and
           modification].
          Length = 470

 Score = 35.4 bits (81), Expect = 0.066
 Identities = 30/147 (20%), Positives = 50/147 (34%), Gaps = 10/147 (6%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL 426
           EE E  +  + +   +  +L R E   + QIRI E  E I  +    E ER         
Sbjct: 12  EEMEIIENAIAERIQRNPKLPRDELRLERQIRIFENMERISNQIWLVEHERPTG------ 65

Query: 427 LLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDLGYTAVALYDYQASADDEI 486
              K Q   R+       L    + +  L  +G  +   E  G   +  Y+  A    ++
Sbjct: 66  --LKDQMMKRINRSIDRDLYGFYKKLGALNVEGKLDGEIEVKGLRDLGYYESSAPKARDL 123

Query: 487 SFDPDDIITNIEMIDEGWWRGYCHGQY 513
             D +      E+  +G        +Y
Sbjct: 124 --DVEAAFKGSELYGDGMELERIFRKY 148


>gnl|CDD|212978 cd12045, SH3_SKAP2, Src Homology 3 domain of Src Kinase-Associated
           Phosphoprotein 2.  SKAP2, also called SKAP55-Related
           (SKAP55R) or SKAP55 homolog (SKAP-HOM or SKAP55-HOM), is
           an immune cell-specific adaptor protein that plays an
           important role in adhesion and migration of B-cells and
           macrophages. Binding partners include ADAP (adhesion and
           degranulation-promoting adaptor protein), YopH, SHPS1,
           and HPK1. SKAP2 has also been identified as a substrate
           for lymphoid-specific tyrosine phosphatase (Lyp), which
           has been implicated in a wide variety of autoimmune
           diseases. It contains a pleckstrin homology (PH) domain,
           a C-terminal SH3 domain, and several tyrosine
           phosphorylation sites. Like SKAP1, SKAP2 is expected to
           bind primarily to a proline-rich region of ADAP through
           its SH3 domain; its degradation may be regulated by
           ADAP. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 31.8 bits (72), Expect = 0.068
 Identities = 18/49 (36%), Positives = 22/49 (44%), Gaps = 2/49 (4%)

Query: 475 LYDYQASADDEISFDPDDIITNI--EMIDEGWWRGYCHGQYGLFPANYV 521
           L+D      DE+SF   D I  +  E    GWW G   G  GL P  Y+
Sbjct: 5   LWDCTGDQPDELSFKRGDTIYILSKEYNRFGWWVGEMKGTIGLVPKAYI 53


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 35.7 bits (82), Expect = 0.071
 Identities = 72/385 (18%), Positives = 125/385 (32%), Gaps = 92/385 (23%)

Query: 112 HESQSDYKTGFGGKFGVQNDRVDKSAVTWDHKEVIEKHASQKDYSSGFGGKFGVQKDRQD 171
            +             G+ N     +       EVIE H+++   SS              
Sbjct: 259 GKDHHHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTN----GSKETT 314

Query: 172 KSAVGWDHQEKIEKHESQKDYAKGFGGKFGIESDRQDKSAVGWDHVEKVDKHQSQKDDNR 231
            ++          K       AK         S +   SA G   V       ++    R
Sbjct: 315 SNSSSAAAGSIGSKSSK---SAKHSNRNKSNSSPKSHSSANG--SVPSSSVSDNESKQKR 369

Query: 232 QVVTSSKVKDLIAANSTNASKENIKPK-PNIGHVKPSNLRAKFENLAKQTEEESRKRSEE 290
              +SS  +D     S  ++   ++   P      PS +    +++ K   E  + R  E
Sbjct: 370 ASKSSSGARDSKKDASGMSANGTVENCIPENKISTPSAIERLEQDIKKLQAELQQARQNE 429

Query: 291 EKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSI 350
            + + ++     LE++ K +   L +LK + + ++ KLN                     
Sbjct: 430 SELRNQISLLTSLERSLKSD---LGQLKKENDMLQTKLNSMV------------------ 468

Query: 351 LKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQE---NIK 407
                             ++K+KQ M               +EK   R+K E +   N +
Sbjct: 469 ----------------SAKQKDKQSMQS-------------MEK---RLKSEADSRVNAE 496

Query: 408 KKKEQEEKERKEQEENQK----------------LLLKKQQEEDRLK--------AEEQA 443
           K+  +E+K +KE+EE                   L   KQ  E  +K         EE+ 
Sbjct: 497 KQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEEC 556

Query: 444 RLLEQERLMEELRQQGTNEDTEEDL 468
           R+LE+E   +ELR+   +E   E L
Sbjct: 557 RMLEKE--AQELRKYQESEKETEVL 579


>gnl|CDD|212955 cd12022, SH3_p47phox_2, Second or C-terminal Src homology 3 domain
           of the p47phox subunit of NADPH oxidase, also called
           Neutrophil Cytosolic Factor 1.  p47phox, or NCF1, is a
           cytosolic subunit of the phagocytic NADPH oxidase
           complex (also called Nox2 or gp91phox), which plays a
           key role in the ability of phagocytes to defend against
           bacterial infections. NADPH oxidase catalyzes the
           transfer of electrons from NADPH to oxygen during
           phagocytosis forming superoxide and reactive oxygen
           species. p47phox is required for activation of NADH
           oxidase and plays a role in translocation. It contains
           an N-terminal Phox homology (PX) domain, tandem SH3
           domains (N-SH3 and C-SH3), a polybasic/autoinhibitory
           region, and a C-terminal proline-rich region (PRR). This
           model characterizes the second SH3 domain (or C-SH3) of
           p47phox. In its inactive state, the tandem SH3 domains
           interact intramolecularly with the autoinhibitory
           region; upon activation, the tandem SH3 domains are
           exposed through a conformational change, resulting in
           their binding to the PRR of p22phox and the activation
           of NADPH oxidase. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 31.7 bits (72), Expect = 0.071
 Identities = 13/49 (26%), Positives = 23/49 (46%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           + +  Y A  +DE++    + I  I  + +GWW        G FP+ Y+
Sbjct: 3   ITIKAYTAVEEDELTLLEGEAIEVIHKLLDGWWVVRKGEVTGYFPSMYL 51


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 35.4 bits (82), Expect = 0.072
 Identities = 37/177 (20%), Positives = 66/177 (37%), Gaps = 14/177 (7%)

Query: 250 ASKENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQA--- 306
             ++ IK   +    K      K EN  ++ E+E     E EK   +L+ + +L  A   
Sbjct: 275 FERDKIKQLASELEKKLEKELKKLENKLEKQEDE---LEELEKAAEELRQKGELLYANLQ 331

Query: 307 --QKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSI-LKQPTSDGIPIQN 363
             ++  +          EEI+ +L    D   +PS        K   LK    + +  Q 
Sbjct: 332 LIEEGLKSVRLADFYGNEEIKIEL----DKSKTPSENAQRYFKKYKKLKGAKVN-LDRQL 386

Query: 364 SNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
           S  +E     +       K +  +  EE+ +E I     +   KK+K++E  E+   
Sbjct: 387 SELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEKFRW 443



 Score = 34.7 bits (80), Expect = 0.14
 Identities = 37/193 (19%), Positives = 68/193 (35%), Gaps = 38/193 (19%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQID-LEQAQKLEERRLSELKIKEEEIEKK 327
           L  KFE   K  +  S    + EKE +KL+++++  E   +  E+   EL+ K E +   
Sbjct: 271 LDEKFERD-KIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYAN 329

Query: 328 LNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE 387
           L                     +++           S +  +         EEIK +  +
Sbjct: 330 LQ--------------------LIE-------EGLKSVRLADFYG-----NEEIKIELDK 357

Query: 388 RKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEE--DRLKAEEQARL 445
            K   E  Q   K+ ++    K   + +  + +E        K   E  +  KA E+ R 
Sbjct: 358 SKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIR- 416

Query: 446 LEQERLMEELRQQ 458
            E+      L+ +
Sbjct: 417 -EELIEEGLLKSK 428


>gnl|CDD|224308 COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy
           production and conversion].
          Length = 194

 Score = 34.3 bits (79), Expect = 0.076
 Identities = 25/97 (25%), Positives = 42/97 (43%), Gaps = 9/97 (9%)

Query: 363 NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEE 422
               E+  K+  R  +EE +    E +EE EK +   K E E     +E   K  KE E 
Sbjct: 1   LMELEKLIKKILREAEEEAEEILEEAREEAEKIKEEAKREAEEA--IEEILRKAEKEAE- 57

Query: 423 NQKLLLKKQQEEDRLKAEEQARLLE-QERLMEELRQQ 458
                 ++Q+       E + +LLE +E ++E + + 
Sbjct: 58  -----RERQRIISSALLEARRKLLEAKEEILESVFEA 89


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 34.7 bits (80), Expect = 0.077
 Identities = 24/75 (32%), Positives = 44/75 (58%)

Query: 363 NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEE 422
           + N EE+  ++    +E+  ++E    EE+EKE+I    E+ N  K +E +++  K +EE
Sbjct: 10  HENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEE 69

Query: 423 NQKLLLKKQQEEDRL 437
           N+KL  + +  +DRL
Sbjct: 70  NKKLENELEALKDRL 84



 Score = 32.0 bits (73), Expect = 0.54
 Identities = 20/75 (26%), Positives = 39/75 (52%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
            + +  +EE+K K+  ++ E   KE   ++  E  +++I+E ++   K KE+ +K   E 
Sbjct: 18  CKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKKLENEL 77

Query: 421 EENQKLLLKKQQEED 435
           E  +  LL+   E D
Sbjct: 78  EALKDRLLRTVAEYD 92


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 34.1 bits (79), Expect = 0.078
 Identities = 15/63 (23%), Positives = 28/63 (44%), Gaps = 3/63 (4%)

Query: 371 KEKQRMVQEEIKRKELER---KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL 427
             KQ + +      ELE    + EL+ +  R K+E+   + + ++ E + KE E      
Sbjct: 48  SIKQHLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHA 107

Query: 428 LKK 430
           L +
Sbjct: 108 LHQ 110



 Score = 29.1 bits (66), Expect = 3.5
 Identities = 11/62 (17%), Positives = 22/62 (35%), Gaps = 4/62 (6%)

Query: 391 ELEKEQIRI--KEEQENIKKKKEQEEKERKEQEENQKLLLK--KQQEEDRLKAEEQARLL 446
           EL  + + +     +  ++ K  + +KE+       K L    K+ E +   A  Q    
Sbjct: 55  ELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQHHRF 114

Query: 447 EQ 448
             
Sbjct: 115 AL 116


>gnl|CDD|212799 cd11865, SH3_Nbp2-like, Src Homology 3 domain of Saccharomyces
           cerevisiae Nap1-binding protein 2 and similar fungal
           proteins.  This subfamily includes Saccharomyces
           cerevisiae Nbp2 (Nucleosome assembly protein 1
           (Nap1)-binding protein 2), Schizosaccharomyces pombe
           Skb5, and similar proteins. Nbp2 interacts with Nap1,
           which is essential for maintaining proper nucleosome
           structures in transcription and replication. It is also
           the binding partner of the yeast type II protein
           phosphatase Ptc1p and serves as a scaffolding protein
           that brings seven kinases in close contact to Ptc1p.
           Nbp2 plays a role many cell processes including
           organelle inheritance, mating hormone response, cell
           wall stress, mitotic cell growth at elevated
           temperatures, and high osmolarity. Skb5 interacts with
           the p21-activated kinase (PAK) homolog Shk1, which is
           critical for fission yeast cell viability. Skb5
           activates Shk1 and plays a role in regulating cell
           morphology and growth under hypertonic conditions. Nbp2
           and Skb5 contain an SH3 domain. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 31.7 bits (72), Expect = 0.079
 Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 2/54 (3%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGY--CHGQYGLFPANYVSL 523
           AVALYD++   D+E+ F    I+  +    +GW        G+ GL P  +VS 
Sbjct: 2   AVALYDFEPEHDNELGFAEGQILFILYKHGQGWLIAEDESGGKTGLVPEEFVSY 55


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 35.2 bits (81), Expect = 0.079
 Identities = 23/98 (23%), Positives = 50/98 (51%)

Query: 347 VKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENI 406
            K   K P +   P +    E  ++E++   Q + ++K+ + K + E +  + KEE +  
Sbjct: 81  EKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140

Query: 407 KKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQAR 444
           +  KE+E+++ K+ EE +    +K++E  R K+  +  
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKP 178



 Score = 31.8 bits (72), Expect = 0.90
 Identities = 33/190 (17%), Positives = 76/190 (40%), Gaps = 15/190 (7%)

Query: 263 HVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEE 322
            V+    +          E ++    EEEKEK            Q  EE++  + K KEE
Sbjct: 79  RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKE-----------QVKEEKKKKKEKPKEE 127

Query: 323 EIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIK 382
             ++K    +     P  E      K + +    +    +   + +   +K    +   K
Sbjct: 128 PKDRKPKEEAKEK-RPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNK 186

Query: 383 RKEL---ERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKA 439
           +KE    E++ +  +E ++ K E+ ++ +++E+EE + K++E     + + +  +    +
Sbjct: 187 KKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEIS 246

Query: 440 EEQARLLEQE 449
              +  L++ 
Sbjct: 247 RRSSSSLKKP 256



 Score = 31.0 bits (70), Expect = 1.8
 Identities = 40/214 (18%), Positives = 78/214 (36%), Gaps = 8/214 (3%)

Query: 248 TNASKENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQ 307
           +  S +    +   G  K    + K     K    +  ++ +E+ ++ K K +   ++  
Sbjct: 69  SKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEP 128

Query: 308 KLEERRL-----SELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQ 362
           K  + +         K KE+E EKK+    D       E   V  KS  K+P     P +
Sbjct: 129 KDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRER--VRAKSRPKKPPKKKPPNK 186

Query: 363 NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEE 422
                EEEK++Q   +E +K K  E     E+E+     +         +E++ R+  E 
Sbjct: 187 KKEPPEEEKQRQA-AREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEI 245

Query: 423 NQKLLLKKQQEEDRLKAEEQARLLEQERLMEELR 456
           +++     ++ +              +R     R
Sbjct: 246 SRRSSSSLKKPDPSPSMASPETRESSKRTETRPR 279


>gnl|CDD|212926 cd11993, SH3_Intersectin1_4, Fourth Src homology 3 domain (or SH3D)
           of Intersectin-1.  Intersectin-1 (ITSN1) is an adaptor
           protein that functions in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction. It
           plays a role in clathrin-coated pit (CCP) formation. It
           binds to many proteins through its multidomain structure
           and facilitate the assembly of multimeric complexes.
           ITSN1 localizes in membranous organelles, CCPs, the
           Golgi complex, and may be involved in the cell membrane
           trafficking system. It exists in alternatively spliced
           short and long isoforms. The short isoform contains two
           Eps15 homology domains (EH1 and EH2), a coiled-coil
           region and five SH3 domains (SH3A-E), while the long
           isoform, in addition, contains RhoGEF (also called
           Dbl-homologous or DH), Pleckstrin homology (PH) and C2
           domains. The fourth SH3 domain (or SH3D) of ITSN1 has
           been shown to bind SHIP2, Numb, CdGAP, and N-WASP. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 65

 Score = 32.0 bits (72), Expect = 0.080
 Identities = 18/57 (31%), Positives = 26/57 (45%), Gaps = 5/57 (8%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHG-----QYGLFPANYVSL 523
           A  +  Y A+  ++++  P  +I   +    GWW G         Q G FPANYV L
Sbjct: 6   AQVIASYTATGPEQLTLAPGQLILIRKKNPGGWWEGELQARGKKRQIGWFPANYVKL 62


>gnl|CDD|212877 cd11944, SH3_Endophilin_B2, Src homology 3 domain of Endophilin-B2.
            Endophilin-B2, also called SH3GLB2 (SH3-domain
           GRB2-like endophilin B2), is a cytoplasmic protein that
           interacts with the apoptosis inducer Bax. It is
           overexpressed in prostate cancer metastasis and has been
           identified as a cancer antigen with potential utility in
           immunotherapy. Endophilins play roles in synaptic
           vesicle formation, virus budding, mitochondrial
           morphology maintenance, receptor-mediated endocytosis
           inhibition, and endosomal sorting. They contain an
           N-terminal N-BAR domain (BAR domain with an additional
           N-terminal amphipathic helix), followed by a variable
           region containing proline clusters, and a C-terminal SH3
           domain. Endophilin-B2 forms homo- and heterodimers (with
           endophilin-B1) through its BAR domain. The related
           protein endophilin-B1 interacts with amphiphysin 1 and
           dynamin 1 through its SH3 domain. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 55

 Score = 31.5 bits (71), Expect = 0.082
 Identities = 18/54 (33%), Positives = 28/54 (51%), Gaps = 2/54 (3%)

Query: 472 AVALYDYQASADDEISFDPDDIIT--NIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           A  LYDY+A+   E++   D++IT  ++  +D  W  G    Q G  P  Y+ L
Sbjct: 2   ARVLYDYEAADSSELALLADELITVYSLPGMDPDWLIGERGNQKGKVPVTYLEL 55


>gnl|CDD|240439 PTZ00491, PTZ00491, major vault protein; Provisional.
          Length = 850

 Score = 35.4 bits (82), Expect = 0.082
 Identities = 19/60 (31%), Positives = 28/60 (46%), Gaps = 6/60 (10%)

Query: 404 ENIKKKKEQE---EKERKEQEENQKLLLKKQQEEDRLKAEEQAR-LLEQERLMEELRQQG 459
           E   K +E     + E  EQE   +  L++Q+  D+ KAEEQ   LLE +     +   G
Sbjct: 658 EITTKSQEAAARHQAELLEQEA--RGRLERQKMHDKAKAEEQRTKLLELQAESAAVESSG 715


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 35.3 bits (81), Expect = 0.087
 Identities = 33/125 (26%), Positives = 53/125 (42%), Gaps = 11/125 (8%)

Query: 352 KQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKE 411
           +   S    IQ  N+  +++ + +  Q E++RK     E  E+E  R  E +  +K  +E
Sbjct: 62  ELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEE 121

Query: 412 QE---EKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLM--------EELRQQGT 460
            E   E E  E EE  KLL  K   E      E+   L++ +           E++ +  
Sbjct: 122 LEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQ 181

Query: 461 NEDTE 465
           N DTE
Sbjct: 182 NADTE 186



 Score = 33.4 bits (76), Expect = 0.34
 Identities = 26/105 (24%), Positives = 48/105 (45%), Gaps = 9/105 (8%)

Query: 363 NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEE 422
           ++  E  E+E  R ++ E++ K LE  E+  + +    EE+  + K K   E  + + E+
Sbjct: 96  STLAENYERELDRNLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEK 155

Query: 423 NQKLL--------LKKQQEEDRLKAEEQ-ARLLEQERLMEELRQQ 458
             +L         +K    E + +A+     L   E  +EELR+Q
Sbjct: 156 EDQLKEAKESISRIKNDLSEMQCRAQNADTELKLLESELEELREQ 200



 Score = 33.4 bits (76), Expect = 0.37
 Identities = 40/198 (20%), Positives = 87/198 (43%), Gaps = 21/198 (10%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQID-------LEQAQKLEERRLSELKIKE 321
           L  + + L +  ++   + +E E+E + LKD++D        E+  +L+E + S  +IK 
Sbjct: 112 LEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKN 171

Query: 322 EEIEKKLNGHSDVPLSPSTETPPVPVKSILK----------QPTSDGIPIQNSNKEEEEK 371
           +  E +    +        E+    ++  L+          +        Q S+ +   K
Sbjct: 172 DLSEMQCRAQNADTELKLLESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVK 231

Query: 372 EKQRMVQEEIKRKELERK-EELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKK 430
            K   ++EE+KR E + +  +  KEQ+    E E       +E ++ +  +E+   LLK+
Sbjct: 232 IKH--LEEELKRYEQDAEVVKSMKEQLLQIPELERELAALREENRKLRSMKED-NELLKE 288

Query: 431 QQEEDRLKAEEQARLLEQ 448
           + E+ + + E   ++ E+
Sbjct: 289 ELEDLQSRLERFEKMREK 306


>gnl|CDD|212829 cd11896, SH3_SNX33, Src Homology 3 domain of Sorting Nexin 33.
           SNX33 interacts with Wiskott-Aldrich syndrome protein
           (WASP) and plays a role in the maintenance of cell shape
           and cell cycle progression. It modulates the shedding
           and endocytosis of cellular prion protein (PrP(c)) and
           amyloid precursor protein (APP). SNXs are Phox homology
           (PX) domain containing proteins that are involved in
           regulating membrane traffic and protein sorting in the
           endosomal system. SNX33 also contains BAR and SH3
           domains. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 31.5 bits (71), Expect = 0.087
 Identities = 19/54 (35%), Positives = 32/54 (59%), Gaps = 2/54 (3%)

Query: 472 AVALYDYQASADDEISFDPD-DIITNIEMIDEGWWRGY-CHGQYGLFPANYVSL 523
           A ALY +Q+   +EI+   + +++   E   +GW +G    G+ GLFPA+YV +
Sbjct: 2   ARALYSFQSENKEEINIQENEELVIFSENSLDGWLQGQNSRGETGLFPASYVEI 55


>gnl|CDD|212954 cd12021, SH3_p47phox_1, First or N-terminal Src homology 3 domain
           of the p47phox subunit of NADPH oxidase, also called
           Neutrophil Cytosolic Factor 1.  p47phox, or NCF1, is a
           cytosolic subunit of the phagocytic NADPH oxidase
           complex (also called Nox2 or gp91phox), which plays a
           key role in the ability of phagocytes to defend against
           bacterial infections. NADPH oxidase catalyzes the
           transfer of electrons from NADPH to oxygen during
           phagocytosis forming superoxide and reactive oxygen
           species. p47phox is required for activation of NADH
           oxidase and plays a role in translocation. It contains
           an N-terminal Phox homology (PX) domain, tandem SH3
           domains (N-SH3 and C-SH3), a polybasic/autoinhibitory
           region, and a C-terminal proline-rich region (PRR). This
           model characterizes the first SH3 domain (or N-SH3) of
           p47phox. In its inactive state, the tandem SH3 domains
           interact intramolecularly with the autoinhibitory
           region; upon activation, the tandem SH3 domains are
           exposed through a conformational change, resulting in
           their binding to the PRR of p22phox and the activation
           of NADPH oxidase. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 31.5 bits (71), Expect = 0.089
 Identities = 16/52 (30%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 470 YTAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           Y A+A  DY+ S+  E++    D++  +E  + GWW      + G  PA+Y+
Sbjct: 2   YRAIA--DYEKSSKSEMALKTGDVVEVVEKSENGWWFCQLKAKRGWVPASYL 51


>gnl|CDD|212948 cd12015, SH3_Tks_1, First Src homology 3 domain of Tyrosine kinase
           substrate (Tks) proteins.  Tks proteins are Src
           substrates and scaffolding proteins that play important
           roles in the formation of podosomes and invadopodia, the
           dynamic actin-rich structures that are related to cell
           migration and cancer cell invasion. Vertebrates contain
           two Tks proteins, Tks4 (Tyr kinase substrate with four
           SH3 domains) and Tks5 (Tyr kinase substrate with five
           SH3 domains), which display partially overlapping but
           non-redundant functions. Both associate with the ADAMs
           family of transmembrane metalloproteases, which function
           as sheddases and mediators of cell and matrix
           interactions. Tks5 interacts with N-WASP and Nck, while
           Tks4 is essential for the localization of MT1-MMP
           (membrane-type 1 matrix metalloproteinase) to
           invadopodia. Tks proteins contain an N-terminal Phox
           homology (PX) domain and four or five SH3 domains. This
           model characterizes the first SH3 domain of Tks
           proteins. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 31.2 bits (71), Expect = 0.095
 Identities = 18/51 (35%), Positives = 24/51 (47%), Gaps = 2/51 (3%)

Query: 470 YTAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANY 520
           Y  VA  DY+    +EIS    D++  IE  + GWW      + G  PA Y
Sbjct: 2   YVVVA--DYKKQQPNEISLRAGDVVDVIEKNENGWWFVSLEDEQGWVPATY 50


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 35.3 bits (82), Expect = 0.097
 Identities = 36/119 (30%), Positives = 60/119 (50%), Gaps = 18/119 (15%)

Query: 364 SNKEEEEKEKQRMVQEEIKRKEL---------ERKEELEKEQIRIKEEQ----ENIKKKK 410
           S  EE ++  +R++Q EI+R+ L         ER E+LEKE   ++EE     E  K +K
Sbjct: 402 SKPEELDELDRRIIQLEIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEK 461

Query: 411 E--QEEKERKEQEENQKLLLKK-QQEEDRLKAEE--QARLLEQERLMEELRQQGTNEDT 464
              Q  ++ KE+ E  +L L++ ++E D  KA E    +L E E+ ++    +   E  
Sbjct: 462 AAIQGIQQIKEEIEQVRLELEQAEREGDLAKAAELQYGKLPELEKRLQAAEAKLGEETK 520


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 35.1 bits (80), Expect = 0.100
 Identities = 25/90 (27%), Positives = 53/90 (58%), Gaps = 5/90 (5%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL 426
           E+E   ++ +  + +++K +ER E  E+E++  +E  E I++++ + E+  +E+ E  +L
Sbjct: 441 EKENAHRKALEMKILEKKRIERLEREERERLE-RERMERIERERLERERLERERLERDRL 499

Query: 427 LLKKQQEEDRLKAEEQARLLEQERLMEELR 456
              ++   DRL+ E   RL E++RL +  R
Sbjct: 500 ---ERDRLDRLERERVDRL-ERDRLEKARR 525


>gnl|CDD|212938 cd12005, SH3_Lck, Src homology 3 domain of Lck Protein Tyrosine
           Kinase.  Lck is a member of the Src subfamily of
           proteins, which are cytoplasmic (or non-receptor) PTKs.
           Lck is expressed in T-cells and natural killer cells. It
           plays a critical role in T-cell maturation, activation,
           and T-cell receptor (TCR) signaling. Lck phosphorylates
           ITAM (immunoreceptor tyr activation motif) sequences on
           several subunits of TCRs, leading to the activation of
           different second messenger cascades. Phosphorylated
           ITAMs serve as binding sites for other signaling factor
           such as Syk and ZAP-70, leading to their activation and
           propagation of downstream events. In addition, Lck
           regulates drug-induced apoptosis by interfering with the
           mitochondrial death pathway. The apototic role of Lck is
           independent of its primary function in T-cell signaling.
           Src kinases contain an N-terminal SH4 domain with a
           myristoylation site, followed by SH3 and SH2 domains, a
           tyr kinase domain, and a regulatory C-terminal region
           containing a conserved tyr. They are activated by
           autophosphorylation at the tyr kinase domain, but are
           negatively regulated by phosphorylation at the
           C-terminal tyr by Csk (C-terminal Src Kinase). The SH3
           domain of Src kinases contributes to substrate
           recruitment by binding adaptor proteins/substrates, and
           regulation of kinase activity through an intramolecular
           interaction. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 54

 Score = 31.3 bits (71), Expect = 0.10
 Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 3/52 (5%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYC--HGQYGLFPANYVS 522
           VALY Y+ S D ++ F+  + +  +E   E WW+      GQ G  P N+V+
Sbjct: 3   VALYSYEPSHDGDLGFEKGEKLRILEQSGE-WWKAQSLTTGQEGFIPFNFVA 53


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 34.2 bits (78), Expect = 0.11
 Identities = 20/97 (20%), Positives = 43/97 (44%), Gaps = 5/97 (5%)

Query: 337 SPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQ 396
             +   P     +I +  +    P +  N  EE  +    +   ++ K  + K+E    +
Sbjct: 146 PTTMTLPIAEHPTITENQSFQPWPERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSK 205

Query: 397 IRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQE 433
           +     QE +++ K +E++ ++EQEE +     KQ+E
Sbjct: 206 L-----QEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE 237


>gnl|CDD|212734 cd11800, SH3_DNMBP_C2_like, Second C-terminal Src homology 3 domain
           of Dynamin Binding Protein, also called Tuba, and
           similar domains.  DNMBP or Tuba is a cdc42-specific
           guanine nucleotide exchange factor (GEF) that contains
           four N-terminal SH3 domains, a central RhoGEF [or Dbl
           homology (DH)] domain followed by a Bin/Amphiphysin/Rvs
           (BAR) domain, and two C-terminal SH3 domains. It
           provides a functional link between dynamin, Rho GTPase
           signaling, and actin dynamics. It plays an important
           role in regulating cell junction configuration. The
           C-terminal SH3 domains of DNMBP bind to N-WASP and
           Ena/VASP proteins, which are key regulatory proteins of
           the actin cytoskeleton. Also included in this subfamily
           is the second C-terminal SH3 domain of Rho guanine
           nucleotide exchange factor 37 (ARHGEF37), whose function
           is still unknown. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 57

 Score = 31.2 bits (71), Expect = 0.11
 Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEG----WWRGYCHGQYGLFPANY 520
            ALY ++A +  E+S     ++T +E  D      WW     G+ G  P+NY
Sbjct: 3   YALYTFEARSPGELSVTEGQVVTVLEKHDLKGNPEWWLVEDRGKQGYVPSNY 54


>gnl|CDD|212899 cd11966, SH3_ASAP2, Src homology 3 domain of ArfGAP with SH3
           domain, ankyrin repeat and PH domain containing protein
           2.  ASAP2 is also called DDEF2 (Development and
           Differentiation Enhancing Factor 2), AMAP2, centaurin
           beta-3, or PAG3. It mediates the functions of Arf
           GTPases vial dual mechanisms: it exhibits GTPase
           activating protein (GAP) activity towards class I (Arf1)
           and II (Arf5) Arfs; and it binds class III Arfs
           (GTP-Arf6) stably without GAP activity. It binds
           paxillin and is implicated in Fcgamma receptor-mediated
           phagocytosis in macrophages and in cell migration. ASAP2
           contains an N-terminal BAR domain, followed by a
           Pleckstrin homology (PH) domain, an Arf GAP domain,
           ankyrin (ANK) repeats, and a C-terminal SH3 domain. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 56

 Score = 31.5 bits (71), Expect = 0.11
 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 3/51 (5%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQ---YGLFPANYV 521
           ALY+  A   DE++F   +II      D+ WW G+  G+    G FP ++V
Sbjct: 4   ALYNCVADNPDELTFSEGEIIIVDGEEDKEWWIGHIDGEPTRRGAFPVSFV 54


>gnl|CDD|233221 TIGR00985, 3a0801s04tom, mitochondrial import receptor subunit
           translocase of outer membrane 20 kDa subunit.
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 148

 Score = 33.4 bits (76), Expect = 0.11
 Identities = 19/84 (22%), Positives = 41/84 (48%), Gaps = 1/84 (1%)

Query: 394 KEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEE-QARLLEQERLM 452
           ++++R + +++   +K+     + K+++  +       +  D     E +A  L++ +L 
Sbjct: 38  RKKLRRRRKKQAGAEKQYGGLAKEKQRQRIRPAAAGLAKAPDPTDPSEKEAFFLQEVQLG 97

Query: 453 EELRQQGTNEDTEEDLGYTAVALY 476
           EEL  QGTN D      Y A+ +Y
Sbjct: 98  EELMAQGTNVDEGAVHFYNALKVY 121


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 34.4 bits (79), Expect = 0.12
 Identities = 26/148 (17%), Positives = 53/148 (35%), Gaps = 15/148 (10%)

Query: 223 HQSQKDDNRQVVTSSKVKDLIAANSTNASKENIKPKPNIGHVKPSNLRAKFENLAKQTEE 282
            Q  +   +Q    + +  L      N  +  I P      +   N   K    A++ E 
Sbjct: 289 EQRAQYREKQKEKKAFLWTL----RRNRLRMIITPW-RAPELHAENAEIKKTRTAEKNEA 343

Query: 283 ESRKRSEEEKEK---RKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPS 339
           ++RK+   +K +   R++  +   E+A  +   R    +    + +K       +  SP+
Sbjct: 344 KARKKEIAQKRRAAEREINREARQERAAAMARARA---RRAAVKAKK----KGLIDASPN 396

Query: 340 TETPPVPVKSILKQPTSDGIPIQNSNKE 367
            +TP    +S    P  +       N+E
Sbjct: 397 EDTPSENEESKGSPPQVEATTTAEPNRE 424



 Score = 33.7 bits (77), Expect = 0.21
 Identities = 17/82 (20%), Positives = 29/82 (35%), Gaps = 8/82 (9%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
           I+ +   E+ + K R  +   KR+  ER+   E  Q R             +    R   
Sbjct: 332 IKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQER--------AAAMARARARRAAV 383

Query: 421 EENQKLLLKKQQEEDRLKAEEQ 442
           +  +K L+     ED     E+
Sbjct: 384 KAKKKGLIDASPNEDTPSENEE 405



 Score = 33.3 bits (76), Expect = 0.32
 Identities = 17/96 (17%), Positives = 44/96 (45%)

Query: 371 KEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKK 430
           + + RM+    +  EL  +    K+    ++ +   +KK+  +++   E+E N++   ++
Sbjct: 310 RNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQER 369

Query: 431 QQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEE 466
                R +A   A   +++ L++    + T  + EE
Sbjct: 370 AAAMARARARRAAVKAKKKGLIDASPNEDTPSENEE 405



 Score = 29.4 bits (66), Expect = 5.1
 Identities = 14/69 (20%), Positives = 24/69 (34%), Gaps = 1/69 (1%)

Query: 368 EEEKEKQRMVQEEIKR-KELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL 426
           +E  +K+R  + EI R    ER   + + + R    +   K   +    E    E  +  
Sbjct: 348 KEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEESK 407

Query: 427 LLKKQQEED 435
               Q E  
Sbjct: 408 GSPPQVEAT 416


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 34.9 bits (80), Expect = 0.12
 Identities = 23/74 (31%), Positives = 42/74 (56%), Gaps = 6/74 (8%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL 427
           E E EK+ + +EE++RK  ++KEE  KE    KE ++    +KE + K + +Q  +   +
Sbjct: 5   ESEAEKKILTEEELERK--KKKEEKAKE----KELKKLKAAQKEAKAKLQAQQASDGTNV 58

Query: 428 LKKQQEEDRLKAEE 441
            KK +++ R +  E
Sbjct: 59  PKKSEKKSRKRDVE 72



 Score = 34.5 bits (79), Expect = 0.15
 Identities = 19/61 (31%), Positives = 40/61 (65%), Gaps = 5/61 (8%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKE-QIRIKEEQE----NIKKKKEQEEKERKEQEE 422
           EEE E+++  +E+ K KEL++ +  +KE + +++ +Q     N+ KK E++ ++R  ++E
Sbjct: 15  EEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDE 74

Query: 423 N 423
           N
Sbjct: 75  N 75



 Score = 32.6 bits (74), Expect = 0.58
 Identities = 21/71 (29%), Positives = 38/71 (53%), Gaps = 3/71 (4%)

Query: 384 KELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQA 443
            E E+K   E+E  R K+++E   K KE+E K+ K  ++  K  L+ QQ  D     +++
Sbjct: 6   SEAEKKILTEEELERKKKKEE---KAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKS 62

Query: 444 RLLEQERLMEE 454
               ++R +E+
Sbjct: 63  EKKSRKRDVED 73



 Score = 29.1 bits (65), Expect = 6.8
 Identities = 22/82 (26%), Positives = 38/82 (46%), Gaps = 7/82 (8%)

Query: 388 RKEELEKEQIRIKEEQENIKKKKEQE--EKERKEQEENQKLLLKKQQEEDRLKAEEQARL 445
            + E E E+  + EE+   KKKKE++  EKE K+ +  QK    K Q +         + 
Sbjct: 2   SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKK 61

Query: 446 LEQERLMEELRQQGTNEDTEED 467
            E++      R++   ++  ED
Sbjct: 62  SEKKS-----RKRDVEDENPED 78


>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH.  This
           bacillus clade of FliH proteins is not found by the Pfam
           FliH model pfam02108, but is closely related to the
           sequences identified by that model. Sequences identified
           by this model are observed in flagellar operons in an
           analogous position relative to other flagellar operon
           genes.
          Length = 255

 Score = 34.4 bits (79), Expect = 0.12
 Identities = 26/79 (32%), Positives = 45/79 (56%), Gaps = 5/79 (6%)

Query: 381 IKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAE 440
           I  +++    E E+E    ++E E I +K E E  +  EQ E Q   +++Q E++R + E
Sbjct: 20  IPLRQVTDVPETEQELADEEQEFEQILEKAEAEAAQIIEQAEAQAAAIREQIEQERAQWE 79

Query: 441 EQARLLEQERLMEELRQQG 459
           E     E+ERL++E +Q+G
Sbjct: 80  E-----ERERLIQEAKQEG 93


>gnl|CDD|212869 cd11936, SH3_UBASH3B, Src homology 3 domain of Ubiquitin-associated
           and SH3 domain-containing protein B.  UBASH3B, also
           called Suppressor of T cell receptor Signaling (STS)-1
           or T cell Ubiquitin LigAnd (TULA)-2 is an active
           phosphatase that is expressed ubiquitously. The
           phosphatase activity of UBASH3B is essential for its
           roles in the suppression of TCR signaling and the
           regulation of EGFR. It also interacts with Syk and
           functions as a negative regulator of platelet
           glycoprotein VI signaling. TULA proteins contain an
           N-terminal UBA domain, a central SH3 domain, and a
           C-terminal histidine phosphatase domain. They bind c-Cbl
           through the SH3 domain and to ubiquitin via UBA. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 62

 Score = 31.2 bits (70), Expect = 0.13
 Identities = 20/58 (34%), Positives = 24/58 (41%), Gaps = 6/58 (10%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEM----IDEGWWRG--YCHGQYGLFPANYVS 522
           T   +Y Y    DDE+   P D I    M      EGW  G     G  GL P NY++
Sbjct: 3   TLQVIYPYTPQNDDELELVPGDYIFMSPMEQTSTSEGWIYGTSLTTGCSGLLPENYIT 60


>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
           Provisional.
          Length = 1109

 Score = 34.6 bits (80), Expect = 0.13
 Identities = 41/204 (20%), Positives = 75/204 (36%), Gaps = 41/204 (20%)

Query: 251 SKENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLE 310
           + E  +P+    ++    L  +   ++ Q  E+SR+  +E+   R++ D +     Q+ E
Sbjct: 89  NNERDEPRSVPPNMSTDALEQEILQVSSQLLEKSRQAQQEQDRAREISDSLSQLPQQQTE 148

Query: 311 ERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEE 370
            RR         EIE++L          +  TP                           
Sbjct: 149 ARRQL------NEIERRLQT------LGTPNTP-------------------------LA 171

Query: 371 KEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKK 430
           + +   +Q E    +    +ELE  Q+     QE  + + E  +K R +Q +     L+ 
Sbjct: 172 QAQLTALQAESAALKA-LVDELELAQLSANNRQELARLRSELAKK-RSQQLDAYLQALRN 229

Query: 431 QQEEDRLKAEEQARLLEQERLMEE 454
           Q    R +  E+A  LE   L+ E
Sbjct: 230 QLNSQRQREAERA--LESTELLAE 251


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 34.8 bits (80), Expect = 0.14
 Identities = 32/180 (17%), Positives = 57/180 (31%), Gaps = 23/180 (12%)

Query: 271 AKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNG 330
           A +  L +    E  + +E   E R  +D I   +       RL E K  E+E+      
Sbjct: 205 ATYHKLLESRRAEHARLAELRSELRADRDHIRALRDAVELWPRLQEWKQLEQELT----- 259

Query: 331 HSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKE 390
                            +        DG+ ++   +E   ++ +   + +     L   +
Sbjct: 260 -------------RRREELATFPR--DGV-LRLEKREAHLQKTEA--EIDALLVRLAELK 301

Query: 391 ELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQER 450
           +L  + I  KE       +  Q+  E K         L   + + R K E      E ER
Sbjct: 302 DLASQLIPAKEAVLQALVRLHQQLSEIKASAFELTETLAGIEADLRDKEEAAGNGFEAER 361


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 34.5 bits (80), Expect = 0.14
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 10/77 (12%)

Query: 387 ERKEELEKEQIRIKEEQENIKK------KKEQEEKERKEQEENQKLLLKKQQEEDRLKAE 440
           E  +ELE+E   ++EE E IKK      KK + +   +++E+ +KLL  K++  + L+  
Sbjct: 334 EELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELEEL 393

Query: 441 EQARLLEQERLMEELRQ 457
           E+    E + L EEL  
Sbjct: 394 EE----ELKELKEELES 406



 Score = 33.0 bits (76), Expect = 0.41
 Identities = 22/74 (29%), Positives = 43/74 (58%), Gaps = 3/74 (4%)

Query: 369 EEKEKQRMVQEEIKRKELERKEELEK--EQIRIKEEQENIKKKKEQEEKERKEQEENQKL 426
           E KE+ + ++EE+K  E E  E+++K  +++  K   +   +K+EQ EK  + +E+  + 
Sbjct: 331 ELKEELKELEEELKELE-EELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEE 389

Query: 427 LLKKQQEEDRLKAE 440
           L + ++E   LK E
Sbjct: 390 LEELEEELKELKEE 403


>gnl|CDD|212875 cd11942, SH3_JIP2, Src homology 3 domain of JNK-interacting protein
           2.  JNK-interacting protein 2 (JIP2) is also called
           Mitogen-activated protein kinase 8-interacting protein 2
           (MAPK8IP2) or Islet-brain-2 (IB2). It is widely
           expressed in the brain, where it forms complexes with
           fibroblast growth factor homologous factors (FHFs),
           which facilitates activation of the p38delta MAPK. JIP2
           is enriched in postsynaptic densities and may play a
           role in motor and cognitive function. In addition to a
           JNK binding domain, JIP2 also contains SH3 and
           Phosphotyrosine-binding (PTB) domains. The SH3 domain of
           the related protein JIP1 homodimerizes at the interface
           usually involved in proline-rich ligand recognition,
           despite the lack of this motif in the domain itself. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 31.0 bits (70), Expect = 0.15
 Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH--GQYGLFPANY 520
           T  A++ +    +DE+  D DD +      D+ W+RGY    G+ G+FPA Y
Sbjct: 1   THRAVFRFIPRHEDELELDVDDPLLVEAEEDDYWYRGYNMRTGERGIFPAFY 52


>gnl|CDD|227635 COG5325, COG5325, t-SNARE complex subunit, syntaxin [Intracellular
           trafficking and secretion].
          Length = 283

 Score = 34.0 bits (78), Expect = 0.15
 Identities = 32/154 (20%), Positives = 62/154 (40%), Gaps = 14/154 (9%)

Query: 267 SNLRAKFENLAKQTEEESRK----RSEEEKEKRKLKDQI--DLEQAQKLEERRLSELKIK 320
           + +R     L K   + +      +SE+E E  +L  ++  DL++ +K+ + +   L+  
Sbjct: 48  TAVRRSISRLGKVYAKHTEPSFSDKSEKEDEIDELSKKVNQDLQRCEKILKTKYKNLQSS 107

Query: 321 --EEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQ 378
             + ++ + LN           ++       +L+              EEEE E+     
Sbjct: 108 FLQSKLLRDLNTECMEGQRIQQKSAQFRKYQVLQAKFLRNKNNDQHPLEEEEDEESL--S 165

Query: 379 EEIKRKELER----KEELEKEQIRIKEEQENIKK 408
               ++ L++     EELE +QI I E  E IK 
Sbjct: 166 SLGSQQTLQQQGLSNEELEYQQILITERDEEIKN 199


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 33.8 bits (78), Expect = 0.17
 Identities = 20/78 (25%), Positives = 37/78 (47%), Gaps = 4/78 (5%)

Query: 383 RKELERKEELEKEQIRIKEEQENIKKKKEQE--EKERKEQEENQKLLLKKQQEEDRLKAE 440
            K        E  + R+ EE+  +KKK  ++  EKE++ +E  Q L+ ++++   +L  E
Sbjct: 295 LKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKL--E 352

Query: 441 EQARLLEQERLMEELRQQ 458
           E  + LE      E  + 
Sbjct: 353 EIQKKLEDLEKRLEKLKS 370



 Score = 29.6 bits (67), Expect = 4.8
 Identities = 14/68 (20%), Positives = 40/68 (58%), Gaps = 1/68 (1%)

Query: 363 NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEE 422
           +     +E  + R+ +EE + K+   ++  EKE+ R++E ++N+ +++++   + +E ++
Sbjct: 298 SGEPSLKEIHEARLNEEERELKKKFTEKIREKEK-RLEELEQNLIEERKELNSKLEEIQK 356

Query: 423 NQKLLLKK 430
             + L K+
Sbjct: 357 KLEDLEKR 364



 Score = 29.2 bits (66), Expect = 4.9
 Identities = 15/72 (20%), Positives = 32/72 (44%), Gaps = 6/72 (8%)

Query: 360 PIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERK- 418
              +  +  EE+ + +    E K +E E++ E  ++ +     +E  +   + EE ++K 
Sbjct: 304 KEIHEARLNEEERELKKKFTE-KIREKEKRLEELEQNLI----EERKELNSKLEEIQKKL 358

Query: 419 EQEENQKLLLKK 430
           E  E +   LK 
Sbjct: 359 EDLEKRLEKLKS 370


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 32.0 bits (73), Expect = 0.18
 Identities = 23/72 (31%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 369 EEKEKQRMVQEEIKRKELERKEELEKE-QIRIKEEQENIKKKKEQEEKERKEQEENQKLL 427
           E++ ++R+ Q+ IK +E E K+E E E Q RI+  +E    K+E+E  E+   + + K +
Sbjct: 31  EKRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEKERYEKMAAKMHAKKV 90

Query: 428 LKKQQEEDRLKA 439
            + ++ E R K 
Sbjct: 91  ERLKRREKRNKL 102


>gnl|CDD|149180 pfam07960, CBP4, CBP4.  The CBP4 in S. cerevisiae is essential for
           the expression and activity of ubiquinol-cytochrome c
           reductase. This family appears to be fungal specific.
          Length = 128

 Score = 32.4 bits (74), Expect = 0.19
 Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 3/67 (4%)

Query: 350 ILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKK 409
           ILK+ +    PI  +   E   EK+R V+E  K K  E ++E   E  RI+EE E  + +
Sbjct: 65  ILKETSKSDKPIWKTGPIESPWEKKRNVREFQKTKAEEAQKE---ELERIREELEEARAQ 121

Query: 410 KEQEEKE 416
            E+  KE
Sbjct: 122 SEEMRKE 128


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 33.0 bits (76), Expect = 0.19
 Identities = 29/82 (35%), Positives = 49/82 (59%), Gaps = 10/82 (12%)

Query: 382 KRKELER-KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKK---QQEEDRL 437
            +  LE+ K+ELE+ + RI E Q  I    E+ +K R+E EE  +LL +    ++E  +L
Sbjct: 67  LKTRLEKLKKELEELKQRIAELQAQI----EKLKKGREETEERTELLEELKQLEKELKKL 122

Query: 438 KAE-EQARLLEQERLMEELRQQ 458
           KAE E+    + ER+ E+L+++
Sbjct: 123 KAELEKYEKNDPERI-EKLKEE 143



 Score = 29.5 bits (67), Expect = 3.0
 Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 1/65 (1%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
           I     + E+ +K R   EE   + LE  ++LEKE  ++K E E  +K   +  ++ KE+
Sbjct: 85  IAELQAQIEKLKKGREETEER-TELLEELKQLEKELKKLKAELEKYEKNDPERIEKLKEE 143

Query: 421 EENQK 425
            +  K
Sbjct: 144 TKVAK 148


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 33.8 bits (78), Expect = 0.20
 Identities = 24/146 (16%), Positives = 57/146 (39%), Gaps = 10/146 (6%)

Query: 311 ERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEE 370
           E  L  +K  EE + K LN  + +   P        +K  +         ++    E   
Sbjct: 153 EENLEGMKRDEELLNKDLNLINSI--KPKLRKKLQALKEEIAS-------LRQLADELNL 203

Query: 371 KEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKK 430
            +   + +   + + L  K   +++Q+   +++        +    +K +   +    +K
Sbjct: 204 CDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEK 263

Query: 431 QQEEDRL-KAEEQARLLEQERLMEEL 455
            +EE R   A+E ++L  +  L+++L
Sbjct: 264 IREECRGWSAKEISKLKAKVSLLQKL 289


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 34.3 bits (78), Expect = 0.21
 Identities = 34/118 (28%), Positives = 62/118 (52%), Gaps = 9/118 (7%)

Query: 320  KEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQE 379
             EE        +S +P    ++ P +  +SI        +P+    +E+ E +  R+V+E
Sbjct: 968  SEEMGYGAEEDYS-LPREIESKLP-LDKRSIAVVSRRIELPVPPECREKHEIKD-RIVKE 1024

Query: 380  EIK-RKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDR 436
             IK ++E ER E L++ +     E+E  KK+KE+E++ RK   +N K + KK+ ++ R
Sbjct: 1025 RIKDQEEKERMESLQRAK-----EEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077



 Score = 29.3 bits (65), Expect = 6.5
 Identities = 35/191 (18%), Positives = 75/191 (39%), Gaps = 8/191 (4%)

Query: 253 ENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEER 312
           E++ P+  I   K  + ++   +L  Q E E      +   +       + E+  + E  
Sbjct: 508 ESLSPEECIEEYKGESAKSSESDLVVQDEPEDFFDVSKVANE---SISSNHEKLMESEFE 564

Query: 313 RLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKE 372
            L +      +++ +    +      S E     ++   K    D    +NS+  E E+ 
Sbjct: 565 ELKKKWSSLAQLKSRFQKDA---TLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEES 621

Query: 373 KQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQ 432
           +   V  E +    E   E E+E+   K+E+  ++   E EE+   E+++      +K++
Sbjct: 622 RGSSVTAENEESADEVDYETEREENARKKEE--LRGNFELEERGDPEKKDVDWYTEEKRK 679

Query: 433 EEDRLKAEEQA 443
            E++LK     
Sbjct: 680 IEEQLKINRSE 690


>gnl|CDD|212812 cd11879, SH3_Bem1p_2, Second Src Homology 3 domain of Bud emergence
           protein 1 and similar domains.  Members of this
           subfamily bear similarity to Saccharomyces cerevisiae
           Bem1p, containing two Src Homology 3 (SH3) domains at
           the N-terminus, a central PX domain, and a C-terminal
           PB1 domain. Bem1p is a scaffolding protein that is
           critical for proper Cdc42p activation during bud
           formation in yeast. During budding and mating, Bem1p
           migrates to the plasma membrane where it can serve as an
           adaptor for Cdc42p and some other proteins. Bem1p also
           functions as an effector of the G1 cyclin Cln3p and the
           cyclin-dependent kinase Cdc28p in promoting vacuolar
           fusion. SH3 domains bind to proline-rich ligands with
           moderate affinity and selectivity, preferentially to
           PxxP motifs; they play a role in the regulation of
           enzymes by intramolecular interactions, changing the
           subcellular localization of signal pathway components
           and mediate multiprotein complex assemblies.
          Length = 56

 Score = 30.4 bits (69), Expect = 0.21
 Identities = 15/56 (26%), Positives = 24/56 (42%), Gaps = 5/56 (8%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEGWW----RGYCHGQYGLFPANYVSL 523
            + LYD++A   DE+     D I      +  W+     G   G  GL P ++V +
Sbjct: 2   GIVLYDFKAERPDELDAKAGDAIIICAHSNYEWFVAKPIGRL-GGPGLIPVSFVEI 56


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 34.0 bits (78), Expect = 0.21
 Identities = 27/96 (28%), Positives = 53/96 (55%), Gaps = 10/96 (10%)

Query: 364 SNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKE-----RK 418
           S  E+E ++K   +++ +K +E   K+ELE+E   +KE + N K + E+E +E     +K
Sbjct: 521 SALEKELEQKNEHLEKLLKEQEKL-KKELEQEMEELKERERNKKLELEKEAQEALKALKK 579

Query: 419 EQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEE 454
           E E     ++++ +E+   KA+E   + +  +L E 
Sbjct: 580 EVES----IIRELKEKKIHKAKEIKSIEDLVKLKET 611



 Score = 33.6 bits (77), Expect = 0.29
 Identities = 18/64 (28%), Positives = 37/64 (57%)

Query: 386 LERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARL 445
           +E+   LEKE  +  E  E + K++E+ +KE +++ E  K   + ++ E   +A+E  + 
Sbjct: 517 IEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKA 576

Query: 446 LEQE 449
           L++E
Sbjct: 577 LKKE 580



 Score = 33.6 bits (77), Expect = 0.30
 Identities = 27/78 (34%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 357 DGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKE 416
             I  Q      E KE+  ++ E++   E E +++ E  + ++ +EQE +KK+ EQE +E
Sbjct: 496 HFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLE-KLLKEQEKLKKELEQEMEE 554

Query: 417 RKEQEENQKLLLKKQQEE 434
            KE+E N+KL L+K+ +E
Sbjct: 555 LKERERNKKLELEKEAQE 572



 Score = 33.3 bits (76), Expect = 0.40
 Identities = 18/76 (23%), Positives = 39/76 (51%), Gaps = 2/76 (2%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRI-KEEQENIKKKKEQEEKERKEQEENQ 424
           K ++E E++    +E +R +    E+  +E ++  K+E E+I ++  +E+K  K +E   
Sbjct: 543 KLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIREL-KEKKIHKAKEIKS 601

Query: 425 KLLLKKQQEEDRLKAE 440
              L K +E  +   +
Sbjct: 602 IEDLVKLKETKQKIPQ 617



 Score = 29.0 bits (65), Expect = 7.7
 Identities = 31/166 (18%), Positives = 63/166 (37%), Gaps = 14/166 (8%)

Query: 265 KPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLE------ERRLSELK 318
           K  +L    +   K  +E  ++  E ++ +R  K +++ E  + L+      E  + ELK
Sbjct: 530 KNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELK 589

Query: 319 IKEEEIEKKLNGHSDVPLSPSTET--PPVPVKSILKQPTSDGIPIQNSNKE----EEEKE 372
            K+    K++    D+     T+   P  P      +   D + I+   ++    +    
Sbjct: 590 EKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKI-GDKVRIRYFGQKGKIVQILGG 648

Query: 373 KQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERK 418
            +  V     R ++    ELEK       ++  + K  + E KE  
Sbjct: 649 NKWNVTVGGMRMKVHGS-ELEKINKAPPPKKFKVPKTTKPEPKEAS 693


>gnl|CDD|212891 cd11958, SH3_RUSC1, Src homology 3 domain of RUN and SH3
           domain-containing protein 1.  RUSC1, also called NESCA
           (New molecule containing SH3 at the carboxy-terminus),
           is highly expressed in the brain and is translocated to
           the nuclear membrane from the cytoplasm upon stimulation
           with neurotrophin. It plays a role in facilitating
           neurotrophin-dependent neurite outgrowth. It also
           interacts with NEMO (or IKKgamma) and may function in
           NEMO-mediated activation of NF-kB. RUSC proteins are
           adaptor proteins consisting of RUN, leucine zipper, and
           SH3 domains. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 51

 Score = 30.2 bits (68), Expect = 0.21
 Identities = 15/49 (30%), Positives = 24/49 (48%), Gaps = 1/49 (2%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVS 522
           AL D  A ++ ++SF   + +  +  +DE W R     + GL P  Y S
Sbjct: 4   ALCD-HAGSESQLSFRKGEELQVLGTVDEDWIRCRRGDREGLVPVGYTS 51


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 34.0 bits (78), Expect = 0.21
 Identities = 22/77 (28%), Positives = 41/77 (53%)

Query: 391  ELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQER 450
            ELE E++++ E+ +  + +KE+E  E + QE+N       +Q+++RL+ EE+ +   QER
Sbjct: 2560 ELENEKLKLLEKAKIEESRKERERIESETQEDNTDEEQINRQQQERLQKEEEQKAYSQER 2619

Query: 451  LMEELRQQGTNEDTEED 467
            L  E+            
Sbjct: 2620 LNREVSGTDDTNKNHNT 2636


>gnl|CDD|212709 cd11775, SH3_Sla1p_3, Third Src Homology 3 domain of the fungal
           endocytic adaptor protein Sla1p.  Sla1p facilitates
           endocytosis by playing a role as an adaptor protein in
           coupling components of the actin cytoskeleton to the
           endocytic machinery. It interacts with Abp1p, Las17p and
           Pan1p, which are activator proteins of actin-related
           protein 2/3 (Arp2/3). Sla1p contains multiple domains
           including three SH3 domains, a SAM (sterile alpha motif)
           domain, and a Sla1 homology domain 1 (SHD1), which binds
           to the NPFXD motif that is found in many integral
           membrane proteins such as the Golgi-localized
           Arf-binding protein Lsb5p and the P4-ATPases, Drs2p and
           Dnf1p. The third SH3 domain of Sla1p can bind ubiquitin
           while retaining the ability to bind proline-rich
           ligands; monoubiquitination of target proteins signals
           internalization and sorting through the endocytic
           pathway. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 57

 Score = 30.4 bits (69), Expect = 0.21
 Identities = 16/57 (28%), Positives = 28/57 (49%), Gaps = 5/57 (8%)

Query: 471 TAVALYDYQASADDEISFDPDD--IITNIEMIDEGWW--RGYCHGQYGLFPANYVSL 523
               LYD+ A +DDE++    D   I + +   + WW       G+ G+ PA+Y+ +
Sbjct: 2   RGKVLYDFDAQSDDELTVKEGDVVYILDDKKSKD-WWMVENVSTGKEGVVPASYIEI 57


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 34.2 bits (79), Expect = 0.22
 Identities = 26/115 (22%), Positives = 53/115 (46%), Gaps = 6/115 (5%)

Query: 393 EKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLK---AEEQARLLEQE 449
           E  +  +   Q+ +   K+Q E + +E+ ++Q L   +QQE   L+   AE + +  E E
Sbjct: 138 EDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELE 197

Query: 450 RLMEELRQQGTNEDTEEDLGYTAVALYDYQASADDEISFDPDDIITNIEMIDEGW 504
             +E+L+++      E       +     QA+   E+S +   I+ + ++   GW
Sbjct: 198 AQLEQLQEKAAETSQERKQKRKEITD---QAAKRLELSEEETRILIDQQLRKAGW 249



 Score = 29.5 bits (67), Expect = 5.4
 Identities = 21/119 (17%), Positives = 58/119 (48%), Gaps = 12/119 (10%)

Query: 360 PIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE 419
           P    N     +++   ++++++ +  E+ +     + + +E         E EEK+++ 
Sbjct: 137 PEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQEL 196

Query: 420 QEENQKLLLK-----KQQEEDRLKAEEQA----RLLEQE-RLM--EELRQQGTNEDTEE 466
           + + ++L  K     +++++ R +  +QA     L E+E R++  ++LR+ G   D++ 
Sbjct: 197 EAQLEQLQEKAAETSQERKQKRKEITDQAAKRLELSEEETRILIDQQLRKAGWEADSKT 255


>gnl|CDD|212942 cd12009, SH3_Blk, Src homology 3 domain of Blk Protein Tyrosine
           Kinase.  Blk is a member of the Src subfamily of
           proteins, which are cytoplasmic (or non-receptor) PTKs.
           It is expressed specifically in B-cells and is involved
           in pre-BCR (B-cell receptor) signaling. Src kinases
           contain an N-terminal SH4 domain with a myristoylation
           site, followed by SH3 and SH2 domains, a tyr kinase
           domain, and a regulatory C-terminal region containing a
           conserved tyr. They are activated by autophosphorylation
           at the tyr kinase domain, but are negatively regulated
           by phosphorylation at the C-terminal tyr by Csk
           (C-terminal Src Kinase). The SH3 domain of Src kinases
           contributes to substrate recruitment by binding adaptor
           proteins/substrates, and regulation of kinase activity
           through an intramolecular interaction. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 54

 Score = 30.2 bits (68), Expect = 0.23
 Identities = 13/54 (24%), Positives = 26/54 (48%), Gaps = 3/54 (5%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWW--RGYCHGQYGLFPANYVS 522
             +A YD+  S + ++     + +  ++  D  WW  +    G+ G  P+NYV+
Sbjct: 1   CVIAQYDFVPSNERDLQLKKGEKLQVLKS-DGEWWLAKSLTTGKEGYIPSNYVA 53


>gnl|CDD|212844 cd11911, SH3_CIP4-like, Src Homology 3 domain of Cdc42-Interacting
           Protein 4.  This subfamily is composed of
           Cdc42-Interacting Protein 4 (CIP4), Formin Binding
           Protein 17 (FBP17), FormiN Binding Protein 1-Like
           (FNBP1L), and similar proteins. CIP4 and FNBP1L are
           Cdc42 effectors that bind Wiskott-Aldrich syndrome
           protein (WASP) and function in endocytosis. CIP4 and
           FBP17 bind to the Fas ligand and may be implicated in
           the inflammatory response. CIP4 may also play a role in
           phagocytosis. It functions downstream of Cdc42 in
           PDGF-dependent actin reorganization and cell migration,
           and also regulates the activity of PDGFRbeta. It uses
           Src as a substrate in regulating the invasiveness of
           breast tumor cells. CIP4 may also play a role in the
           pathogenesis of Huntington's disease. Members of this
           subfamily typically contain an N-terminal F-BAR
           (FES-CIP4 Homology and Bin/Amphiphysin/Rvs) domain, a
           central Cdc42-binding HR1 domain, and a C-terminal SH3
           domain. The SH3 domain of CIP4 associates with Gapex-5,
           a Rab31 GEF. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 30.3 bits (69), Expect = 0.23
 Identities = 15/55 (27%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMID-EGWWR-GYCHGQYGLFPANYVSL 523
           T  ALYD+  +++  +S +  +I+  +E    +GW R    +G  G  P +Y+ +
Sbjct: 1   TCTALYDFDGTSEGTLSMEEGEILLVLEEDGGDGWTRVRKNNGDEGYVPTSYIEV 55


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 32.2 bits (74), Expect = 0.23
 Identities = 18/78 (23%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 369 EEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKE---RKEQEENQK 425
           E +E++    +E+ +   + K + EK+Q   K E E + K   +E+ E    K ++E ++
Sbjct: 1   EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60

Query: 426 LLLKKQQEEDRLKAEEQA 443
           L  +  + E + +A++  
Sbjct: 61  LEAELARRELKAEAKKML 78



 Score = 28.4 bits (64), Expect = 3.7
 Identities = 16/78 (20%), Positives = 41/78 (52%)

Query: 391 ELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQER 450
           E E+E+    +E +    K++ + ++++E+++++   L K   E++ + E +    E E 
Sbjct: 1   EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60

Query: 451 LMEELRQQGTNEDTEEDL 468
           L  EL ++    + ++ L
Sbjct: 61  LEAELARRELKAEAKKML 78


>gnl|CDD|212704 cd11770, SH3_Nephrocystin, Src Homology 3 domain of Nephrocystin
           (or Nephrocystin-1).  Nephrocystin contains an SH3
           domain involved in signaling pathways that regulate cell
           adhesion and cytoskeletal organization. It is a protein
           that in humans is associated with juvenile
           nephronophthisis, an inherited kidney disease
           characterized by renal fibrosis that lead to chronic
           renal failure in children. It is localized in cell-cell
           junctions in renal duct cells, and is known to interact
           with Ack1, an activated Cdc42-associated kinase. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 54

 Score = 30.4 bits (69), Expect = 0.23
 Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 1/50 (2%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGY-CHGQYGLFPANYV 521
            AL D+QA  + ++SF   +++  I    +GWW      G  GL P  Y+
Sbjct: 3   EALSDFQAEQEGDLSFKKGEVLRIISKRADGWWLAENSKGNRGLVPKTYL 52


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 33.7 bits (78), Expect = 0.24
 Identities = 24/106 (22%), Positives = 49/106 (46%), Gaps = 10/106 (9%)

Query: 359 IPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENI---KKKKEQEEK 415
           I I  S     + E +RMV++  +      +++  KE+I  K E E      +K  +EE 
Sbjct: 493 ITITAS-SGLSDDEIERMVKDAEEYAA---EDKKRKERIEAKNEAEEYVYSLEKSLKEEG 548

Query: 416 ERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTN 461
           ++  + + +K+    +  ++ L+ E++    E E   EEL++    
Sbjct: 549 DKLPEADKKKVEEAIEWLKEELEGEDKE---EIEAKTEELQKVVQP 591


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 33.8 bits (78), Expect = 0.24
 Identities = 38/178 (21%), Positives = 82/178 (46%), Gaps = 18/178 (10%)

Query: 267 SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDL--EQAQKLEERRLSELKIKEEEI 324
             L  K+E+  ++  EE     + EKE  + K  ++      + LEE     L+   +++
Sbjct: 288 EELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEI----LEKNLQKL 343

Query: 325 EKKL-NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKR 383
           E+KL +  + + L   T+     ++SI     +    I+  N++ +  +K+   + + K+
Sbjct: 344 EEKLKDPSTSIELESITD----LIESINDIIDAINELIREHNEKIDNLKKE---KNKAKK 396

Query: 384 K-ELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAE 440
           K  L    EL+++    ++E++ ++K     EKE K+ E   K     ++E   L+ +
Sbjct: 397 KLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIK---ALEKEIKELEKQ 451



 Score = 31.9 bits (73), Expect = 0.94
 Identities = 25/103 (24%), Positives = 50/103 (48%), Gaps = 10/103 (9%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE-QEENQKL 426
           EE  E +  ++E         K+EL+K + +I++ +  I+KK+E+ EK + +  ++  K 
Sbjct: 88  EENIEIEAQIEE--------LKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKK 139

Query: 427 LLKKQQEEDRLKAE-EQARLLEQERLMEELRQQGTNEDTEEDL 468
           L KK         +    +   +E+L++EL+    N  +   L
Sbjct: 140 LAKKYDSNLSEALKGLNYKKNFKEKLLKELKSVILNASSLLSL 182


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 33.6 bits (77), Expect = 0.24
 Identities = 17/103 (16%), Positives = 48/103 (46%), Gaps = 3/103 (2%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           K++E  + +  +  + K ++    E++++ ++ +++E   ++  +    ++ KE E  +K
Sbjct: 62  KKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEK 121

Query: 426 LLLKKQQEEDRL---KAEEQARLLEQERLMEELRQQGTNEDTE 465
            L +  +E  +     AE   +  E  +  E L+ +   +  E
Sbjct: 122 ELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKLEE 164



 Score = 33.3 bits (76), Expect = 0.32
 Identities = 43/205 (20%), Positives = 78/205 (38%), Gaps = 7/205 (3%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEE-KEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEK 326
           N   +FE  A +   + R   ++E  +  +       EQ     E    ++K  E  ++ 
Sbjct: 42  NTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNE----QIKQFELALQD 97

Query: 327 KLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNK-EEEEKEKQRMVQEEIKRKE 385
           ++     + L    +   + +        S  +  Q  N  E  EK+++    EE  + E
Sbjct: 98  EIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFE 157

Query: 386 LERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARL 445
            E+K E   E  R K E++  +   + E KE +EQ E++  +LKK +    L   +Q + 
Sbjct: 158 NEKKLEESLELEREKFEEQLHEANLDLEFKENEEQRESKWAILKKLKRRAEL-GSQQVQG 216

Query: 446 LEQERLMEELRQQGTNEDTEEDLGY 470
              E   E   +     D  E    
Sbjct: 217 EALELPNESFIRSKFPSDINEKADK 241



 Score = 32.9 bits (75), Expect = 0.39
 Identities = 32/174 (18%), Positives = 69/174 (39%), Gaps = 5/174 (2%)

Query: 295 RKLKDQIDLEQA-QKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQ 353
            K +D I  EQA  K  E+      +   E  +K           + +     +  + +Q
Sbjct: 16  PKCEDSIKGEQARYKQIEQEDQSRILNTLEEFEKEANEKRAQYRSAKKKE---LSQLEEQ 72

Query: 354 PTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQE 413
             +     +N   E+ ++ +  +  E  K + LE     + +++ + E++ +   K+ Q+
Sbjct: 73  LINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQK 132

Query: 414 EKERKEQEENQKLLLKKQQEEDRLKAEEQARL-LEQERLMEELRQQGTNEDTEE 466
           + +   +   +K    K +E  + + E++    LE ER   E +    N D E 
Sbjct: 133 QLQNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEF 186



 Score = 29.8 bits (67), Expect = 4.0
 Identities = 22/114 (19%), Positives = 48/114 (42%), Gaps = 7/114 (6%)

Query: 357 DGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKE 416
           D I  + +  ++ E+E Q  +   ++  E E  E+  + +   K+E         Q E++
Sbjct: 20  DSIKGEQARYKQIEQEDQSRILNTLEEFEKEANEKRAQYRSAKKKELS-------QLEEQ 72

Query: 417 RKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDLGY 470
              Q++ QK L  +Q ++  L  +++   LE   L+   + +      +E    
Sbjct: 73  LINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDEL 126


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
           outer membrane].
          Length = 170

 Score = 32.8 bits (75), Expect = 0.25
 Identities = 19/68 (27%), Positives = 37/68 (54%), Gaps = 3/68 (4%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKEL--ERKEELEKEQIRIKEEQENIKKKKEQEEKERKE 419
           +    ++E K K+  +Q++ K + L    K E E ++ +      N K+++ +++  R+E
Sbjct: 59  ELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEK-LVNAFNKKQQEYEKDLNRRE 117

Query: 420 QEENQKLL 427
            EE QKLL
Sbjct: 118 AEEEQKLL 125



 Score = 30.1 bits (68), Expect = 1.6
 Identities = 23/78 (29%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 370 EKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLK 429
           E E ++  Q+E+++ + E K +  K Q   K E  + + K E E K+ K      K   +
Sbjct: 50  ESEFKK-RQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVNAFNKKQQE 108

Query: 430 KQQEEDRLKAEEQARLLE 447
            +++ +R +AEE+ +LLE
Sbjct: 109 YEKDLNRREAEEEQKLLE 126



 Score = 28.9 bits (65), Expect = 4.7
 Identities = 19/105 (18%), Positives = 46/105 (43%)

Query: 349 SILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKK 408
           S      +  I I N  +  +E  + + V  +++ +  +R++EL+K Q  +K ++  ++ 
Sbjct: 17  SASAAQAAPKIAIVNLGRIFQESPQAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQD 76

Query: 409 KKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLME 453
             + E    + + E +    K     ++ + E +  L  +E   E
Sbjct: 77  DGKMEALSDRAKAEAEIKKEKLVNAFNKKQQEYEKDLNRREAEEE 121


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 33.7 bits (77), Expect = 0.26
 Identities = 23/134 (17%), Positives = 52/134 (38%), Gaps = 8/134 (5%)

Query: 337 SPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQ 396
           +P    P   V+S  K+        QN+  +E+       +      K    KEE++ + 
Sbjct: 10  APRGFAPRTTVESSQKRI------QQNNGDKEDSSTSTSSLSVSAVEKTSNAKEEIQVDF 63

Query: 397 IRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEE--DRLKAEEQARLLEQERLMEE 454
               E      + +++ E E+ + +  +   + K++    D    ++ +   + +   E 
Sbjct: 64  QHNSESAVEEVEAEDEIEVEQNQSDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLEREN 123

Query: 455 LRQQGTNEDTEEDL 468
           LR++   E  EE+ 
Sbjct: 124 LRKREIEELAEENF 137



 Score = 31.4 bits (71), Expect = 1.4
 Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 8/54 (14%)

Query: 380 EIKRKELER--KEELEKEQIRIKEEQENIKKKKEQEEKER----KEQEENQKLL 427
           E KR+ELE+  KEE E+E  R  EEQ   +++K   E +R     E E+ ++ L
Sbjct: 252 EEKRRELEKLAKEEAERE--RQAEEQRRREEEKAAMEADRAQAKAEVEKRREKL 303



 Score = 31.4 bits (71), Expect = 1.4
 Identities = 21/64 (32%), Positives = 39/64 (60%), Gaps = 6/64 (9%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL 426
           EE+ +E +++ +EE +R   ER+ E   EQ R +EE+  ++  + Q + E +++ E  + 
Sbjct: 252 EEKRRELEKLAKEEAER---ERQAE---EQRRREEEKAAMEADRAQAKAEVEKRREKLQN 305

Query: 427 LLKK 430
           LLKK
Sbjct: 306 LLKK 309



 Score = 29.8 bits (67), Expect = 4.6
 Identities = 19/69 (27%), Positives = 38/69 (55%), Gaps = 6/69 (8%)

Query: 390 EELEKEQIRIKEEQENIKK-KKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQ 448
           +E   E   ++E++  ++K  KE+ E+ER+ +E+ ++   K   E DR +A+      E 
Sbjct: 242 DEHSFEDFLLEEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAK-----AEV 296

Query: 449 ERLMEELRQ 457
           E+  E+L+ 
Sbjct: 297 EKRREKLQN 305



 Score = 29.5 bits (66), Expect = 5.8
 Identities = 18/49 (36%), Positives = 27/49 (55%), Gaps = 5/49 (10%)

Query: 414 EKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNE 462
           E++R+E E+     L K++ E   +AEEQ R  E++  ME  R Q   E
Sbjct: 252 EEKRRELEK-----LAKEEAERERQAEEQRRREEEKAAMEADRAQAKAE 295


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 33.9 bits (78), Expect = 0.26
 Identities = 28/134 (20%), Positives = 60/134 (44%), Gaps = 19/134 (14%)

Query: 350 ILKQPTSDGIPIQNSNKEEEE-------KEKQRMVQEEIKRKELERKEELEKEQIRIKEE 402
           I+ +  SDGI   + N   +E        E++R+    I+R+E+  + E  +    I EE
Sbjct: 786 IITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELEL---INEE 842

Query: 403 QENIKKKKEQEEKERKEQEENQ-------KLLLKKQQEEDRLKAEEQARLLEQERL--ME 453
           Q+N+K++ + E  E +E EE            ++ + +  +L     A+ +  + +   +
Sbjct: 843 QKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSD 902

Query: 454 ELRQQGTNEDTEED 467
           E++Q       ++ 
Sbjct: 903 EIKQDEKTTKKKKK 916



 Score = 32.3 bits (74), Expect = 0.71
 Identities = 36/175 (20%), Positives = 67/175 (38%), Gaps = 26/175 (14%)

Query: 245 ANSTNASKENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLE 304
            N+ NA K  I  +     +  S++  + + L  Q  E   +R +  + +R+       E
Sbjct: 776 FNNPNAEKILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEE 835

Query: 305 QAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNS 364
                EE++  + +IK E  E +                         Q  +     +N 
Sbjct: 836 LELINEEQKNLKQEIKLELSEIQEA-------------------EEEIQNIN-----ENK 871

Query: 365 NKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE 419
           N+  E K   + + + I  K++  K  L      IK++++  KKKK+  EK  +E
Sbjct: 872 NEFVEFKNDPKKLNKLIIAKDVLIK--LVISSDEIKQDEKTTKKKKKDLEKTDEE 924



 Score = 30.4 bits (69), Expect = 2.6
 Identities = 18/105 (17%), Positives = 45/105 (42%), Gaps = 2/105 (1%)

Query: 363 NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEE 422
           N N E+     + ++ + I   ++  + +   +QI I+ E+E +K  + Q E+     EE
Sbjct: 778 NPNAEKILIITE-ILSDGINNSDINDRPQELIDQI-IESEEERLKALRIQREEMLMRPEE 835

Query: 423 NQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEED 467
            + +  +++  +  +K E       +E +      +    + + D
Sbjct: 836 LELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKND 880


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 32.7 bits (74), Expect = 0.28
 Identities = 26/125 (20%), Positives = 64/125 (51%), Gaps = 8/125 (6%)

Query: 342 TPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKE 401
           T PV    I +     G+P+    +++    K+  +++  K+  + R+E++EK +   +E
Sbjct: 80  TTPVASHLINRAAYDTGVPLAIRIRDQLRSVKKDDIKK--KKSLIIRQEQIEKARQEREE 137

Query: 402 EQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTN 461
            +E ++ ++ +E+ + +E +E Q      ++E +    EEQ+   E E + ++  +  ++
Sbjct: 138 LEERMEWERREEKIDEREDQEEQ------EREREEQTIEEQSDDSEHEIIEQDESETESD 191

Query: 462 EDTEE 466
           +D  E
Sbjct: 192 DDKTE 196



 Score = 28.5 bits (63), Expect = 7.6
 Identities = 17/84 (20%), Positives = 46/84 (54%)

Query: 384 KELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQA 443
           + +++ +  +K+ + I++EQ    +++ +E +ER E E  ++ + +++ +E++ +  E+ 
Sbjct: 108 RSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQ 167

Query: 444 RLLEQERLMEELRQQGTNEDTEED 467
            + EQ    E    +    +TE D
Sbjct: 168 TIEEQSDDSEHEIIEQDESETESD 191


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 33.1 bits (76), Expect = 0.28
 Identities = 24/74 (32%), Positives = 40/74 (54%), Gaps = 5/74 (6%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
            EE+E+EK ++  +   +K L+ K E +++  R KEE+        + E++  E E  +K
Sbjct: 42  DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKG-----LRELEEDTPEDELAEK 96

Query: 426 LLLKKQQEEDRLKA 439
           L L+K QEE  L  
Sbjct: 97  LRLRKLQEESDLNN 110



 Score = 29.6 bits (67), Expect = 3.7
 Identities = 15/74 (20%), Positives = 37/74 (50%), Gaps = 1/74 (1%)

Query: 386 LERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLK-KQQEEDRLKAEEQAR 444
           ++ K + E E   +K+  +  + ++++EEK +   +   K  LK K +E+++ K E++ +
Sbjct: 20  VKDKWDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEK 79

Query: 445 LLEQERLMEELRQQ 458
            L +        + 
Sbjct: 80  GLRELEEDTPEDEL 93


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 32.2 bits (74), Expect = 0.30
 Identities = 20/83 (24%), Positives = 48/83 (57%), Gaps = 9/83 (10%)

Query: 375 RMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKE---QEEKERKEQE------ENQK 425
           +  Q++++++  +R+ ELEK +  +++ +E ++K      +  +E+KE+E      E Q+
Sbjct: 17  KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQR 76

Query: 426 LLLKKQQEEDRLKAEEQARLLEQ 448
              K QQ+  + + EE  ++L++
Sbjct: 77  KQQKLQQDLQKRQQEELQKILDK 99


>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0).  Part of the CF(0)
           (base unit) of the ATP synthase. The base unit is
           thought to translocate protons through membrane (inner
           membrane in mitochondria, thylakoid membrane in plants,
           cytoplasmic membrane in bacteria). The B subunits are
           thought to interact with the stalk of the CF(1)
           subunits. This domain should not be confused with the ab
           CF(1) proteins (in the head of the ATP synthase) which
           are found in pfam00006.
          Length = 132

 Score = 31.8 bits (73), Expect = 0.30
 Identities = 20/81 (24%), Positives = 40/81 (49%), Gaps = 6/81 (7%)

Query: 379 EEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLK 438
           EE  ++      E E++  + + E   I    ++E ++ KE+   +      Q++ +RL 
Sbjct: 43  EERLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQKLKEEILAE-----AQKDAERLL 97

Query: 439 AEEQARL-LEQERLMEELRQQ 458
              +A +  E+E+ + ELRQQ
Sbjct: 98  ESARAEIEQEKEQALAELRQQ 118


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 33.1 bits (76), Expect = 0.31
 Identities = 20/106 (18%), Positives = 44/106 (41%), Gaps = 6/106 (5%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL 426
            +  ++++    EE K +    KEE++K    I   +E + + + +     +        
Sbjct: 16  RKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGE--- 72

Query: 427 LLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDLGYTA 472
             ++++E     AE +A L   +  + E R +  +  T+ D GY  
Sbjct: 73  --EEEEEAKAEAAEFRAYLRGGDDALAEER-KALSTGTDSDGGYLV 115


>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
           family.  This model represents a relatively
           well-conserved region near the C-terminus of the tape
           measure protein of a lambda and related phage. This
           protein, which controls phage tail length, is typically
           about 1000 residues in length. Both low-complexity
           sequence and insertion/deletion events appear common in
           this family. Mutational studies suggest a ruler or
           template role in the determination of phage tail length.
           Similar behavior is attributed to proteins from
           distantly related or unrelated families in other phage
           [Mobile and extrachromosomal element functions, Prophage
           functions].
          Length = 332

 Score = 32.9 bits (75), Expect = 0.33
 Identities = 30/141 (21%), Positives = 53/141 (37%), Gaps = 17/141 (12%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEE--------------QENI 406
           +Q+ + E     K   + EE +RK LE  ++L +    I+ +              +E +
Sbjct: 26  LQSRSDEIIALIKLEKLLEEAERKALEALKKLAEATASIRAQNKRQLDRFGLGDKQRERL 85

Query: 407 KKKKEQEEKERKEQEENQKLLLKKQQ-EEDRLKAEEQARLLEQERLMEELRQQGTNEDTE 465
             + + +   RK+Q +  K +  K     D  K +  A        + EL      ED  
Sbjct: 86  DARLQIDRTFRKQQRDLNKAMTAKGLAGSDLYKEQLAAIKAALNEALAELHAYYAAEDAL 145

Query: 466 E-DLGYTAV-ALYDYQASADD 484
           + D    A   L DY  +A +
Sbjct: 146 QGDWLAGARSGLADYGETATN 166



 Score = 29.4 bits (66), Expect = 5.3
 Identities = 14/75 (18%), Positives = 36/75 (48%)

Query: 384 KELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQA 443
           K L+ + +     I++++  E  ++K  +  K+  E   + +   K+Q +   L  +++ 
Sbjct: 24  KSLQSRSDEIIALIKLEKLLEEAERKALEALKKLAEATASIRAQNKRQLDRFGLGDKQRE 83

Query: 444 RLLEQERLMEELRQQ 458
           RL  + ++    R+Q
Sbjct: 84  RLDARLQIDRTFRKQ 98


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 33.3 bits (76), Expect = 0.33
 Identities = 37/186 (19%), Positives = 76/186 (40%), Gaps = 18/186 (9%)

Query: 256 KPKPNIGHVKPSNLRAKFENLAKQTEEESRKR---SEEEKEKRKLKDQIDLEQA------ 306
            PK  +          K ++ +K+T  E  +     +    KR    + ++  +      
Sbjct: 167 PPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKT 226

Query: 307 -QKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVK-SILKQPTSDGIPIQNS 364
            +K E++  SE  +KEE  E+   G  DV L   +  P    +     +P   G    + 
Sbjct: 227 KEKKEKKEASESTVKEESEEES--GKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSE 284

Query: 365 -----NKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE 419
                 ++E+ K  ++M+++E + +E+E   E   E+   +E +     KKE+E++E   
Sbjct: 285 EETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTV 344

Query: 420 QEENQK 425
             +  +
Sbjct: 345 SPDGGR 350



 Score = 28.6 bits (64), Expect = 9.6
 Identities = 22/84 (26%), Positives = 37/84 (44%), Gaps = 6/84 (7%)

Query: 360 PIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE 419
                 +E EE+  +R V  E +  E    +E E E    K   E    ++E EEKE+++
Sbjct: 236 SESTVKEESEEESGKRDVILEDESAEPTGLDEDEDED-EPKPSGERSDSEEETEEKEKEK 294

Query: 420 QEENQKLLLKKQQEEDRLKAEEQA 443
           ++      LKK  E++    E + 
Sbjct: 295 RKR-----LKKMMEDEDEDEEMEI 313


>gnl|CDD|221514 pfam12297, EVC2_like, Ellis van Creveld protein 2 like protein.
           This family of proteins is found in eukaryotes. Proteins
           in this family are typically between 571 and 1310 amino
           acids in length. There are two conserved sequence
           motifs: LPA and ELH. EVC2 is implicated in Ellis van
           Creveld chondrodysplastic dwarfism in humans. Mutations
           in this protein can give rise to this congenital
           condition. LIMBIN is a protein which shares around 80%
           sequence homology with EVC2 and it is implicated in a
           similar condition in bovine chondrodysplastic dwarfism.
          Length = 429

 Score = 33.3 bits (76), Expect = 0.33
 Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 19/109 (17%)

Query: 366 KEEEEKEKQRMVQEEIKRKE-------LERKEELEKEQIR----IKEEQENIKKKKEQEE 414
           +E  E E+   + +    +E       L+    LE+E ++    +++E++  K  ++   
Sbjct: 240 REMAEMEQAEELLKRAPEREAVECSSLLDTLHGLEQEHLQRSLLLQQEEDFAKAHRQLAV 299

Query: 415 KERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQ--------ERLMEEL 455
            +R E        LK    +  LK EE   LL++        E LM+  
Sbjct: 300 FQRVELHSIFFTQLKSATFKGELKPEEAKSLLQEYSKIQETIEELMDFF 348


>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
           Members of this family of small ribosomal proteins
           possess one of three conserved blocks of sequence found
           in proteins that stimulate the dissociation of guanine
           nucleotides from G-proteins, leaving open the
           possibility that MRP-S27 might be a functional partner
           of GTP-binding ribosomal proteins.
          Length = 417

 Score = 33.2 bits (76), Expect = 0.33
 Identities = 25/136 (18%), Positives = 49/136 (36%), Gaps = 9/136 (6%)

Query: 303 LEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQ 362
           L+  Q + E+  SEL   E  ++K          S   E   +      KQ     +   
Sbjct: 282 LDLLQSILEKLSSELD--EASLKKLQEAIDKSESSELDEDLLLEEVEKSKQKFEPLLAQY 339

Query: 363 -------NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEK 415
                   S  +   K + R +    +R     KE+LE  + R +      +K  ++++K
Sbjct: 340 GESFQEWRSKLQALAKVESRSLLSVEERLPTIEKEDLELYEQRQQLWFFENRKLWQRKKK 399

Query: 416 ERKEQEENQKLLLKKQ 431
            R++ +E  +     +
Sbjct: 400 LREQADEEYQQRHATR 415



 Score = 30.5 bits (69), Expect = 2.5
 Identities = 24/95 (25%), Positives = 42/95 (44%), Gaps = 13/95 (13%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEEL-EKEQIRIKEE---QENIKKKK---EQEEKERKEQ 420
            EE EK +   E +  +  E  +E   K Q   K E     +++++    E+E+ E  EQ
Sbjct: 322 LEEVEKSKQKFEPLLAQYGESFQEWRSKLQALAKVESRSLLSVEERLPTIEKEDLELYEQ 381

Query: 421 E------ENQKLLLKKQQEEDRLKAEEQARLLEQE 449
                  EN+KL  +K++  ++   E Q R   + 
Sbjct: 382 RQQLWFFENRKLWQRKKKLREQADEEYQQRHATRA 416


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 33.1 bits (76), Expect = 0.35
 Identities = 22/88 (25%), Positives = 39/88 (44%), Gaps = 12/88 (13%)

Query: 371 KEKQR-------MVQEEIKRKELERKEELEKEQIR-----IKEEQENIKKKKEQEEKERK 418
           +E QR       +V E I   +  R+ + E + +R     + +E   +K  KE   +   
Sbjct: 19  RESQRRRFASVELVDEVIALDKEWRQRQFELDSLRKEFNKLNKEVAKLKIAKEDATELIA 78

Query: 419 EQEENQKLLLKKQQEEDRLKAEEQARLL 446
           E +E +K + +K+ E    KA   A+L 
Sbjct: 79  ETKELKKEITEKEAEVQEAKAALDAKLK 106


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 32.7 bits (75), Expect = 0.36
 Identities = 21/63 (33%), Positives = 40/63 (63%), Gaps = 5/63 (7%)

Query: 367 EEEEKEKQRMVQEEI-KRKELERKE--ELEKEQIRIKEEQENIKKKKEQEEKERKEQEEN 423
           EEEE  + R  Q +  +R+  E  E   LE+ + R +EE+E  ++KK+ +E++++E+E  
Sbjct: 150 EEEELAELRQQQRQFEQRRNAELAETQRLEEAERRRREEKE--RRKKQDKERKQREKETA 207

Query: 424 QKL 426
           +K+
Sbjct: 208 EKI 210



 Score = 31.2 bits (71), Expect = 1.3
 Identities = 15/56 (26%), Positives = 31/56 (55%), Gaps = 6/56 (10%)

Query: 401 EEQENIKKKKEQEEKERK-EQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEEL 455
           EE   +++++ Q E+ R  E  E Q+L     +E +R + EE+ R  +Q++  ++ 
Sbjct: 152 EELAELRQQQRQFEQRRNAELAETQRL-----EEAERRRREEKERRKKQDKERKQR 202



 Score = 28.5 bits (64), Expect = 9.4
 Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 8/50 (16%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEK 415
           +  E  E QR+ + E +R+E       EKE+ R K+++E  +++KE  EK
Sbjct: 168 RNAELAETQRLEEAERRRRE-------EKER-RKKQDKERKQREKETAEK 209


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 33.2 bits (76), Expect = 0.36
 Identities = 29/192 (15%), Positives = 76/192 (39%), Gaps = 13/192 (6%)

Query: 280 TEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPS 339
            +E+  +  E+  EKR   D  D ++ ++ + +R+ E  +K +    KLN  +   +S  
Sbjct: 246 PDEKDSEILEDTLEKR-SDDLHDYDEDEEYKFKRVREYDMKVKSKATKLNELALFFVSDE 304

Query: 340 TET---PPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMV-QEEIKRKELERKEELEKE 395
                  P+  +  L++        +  N       ++    Q  +K +    KE   ++
Sbjct: 305 NGVVYYKPLRSRVELRR--------RRVNDVIRPLVREHNNDQLNVKLRNPSTKESKMRD 356

Query: 396 QIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEEL 455
           + R + +  + ++  E E++E +++ +  +    +  EE+  ++ E         +  + 
Sbjct: 357 KRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDS 416

Query: 456 RQQGTNEDTEED 467
             +   E   + 
Sbjct: 417 ESKADKESASDS 428



 Score = 32.8 bits (75), Expect = 0.51
 Identities = 17/121 (14%), Positives = 51/121 (42%), Gaps = 17/121 (14%)

Query: 347 VKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENI 406
           ++ ++++  +D + ++  N   +E + +   +  +   + E  +E E E+    EEQ + 
Sbjct: 328 IRPLVREHNNDQLNVKLRNPSTKESKMRDKRRARLDPIDFEEVDEDEDEE----EEQRSD 383

Query: 407 KKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEE 466
           + ++E+ E    E+E +Q       +    + ++            E    + +  D++ 
Sbjct: 384 EHEEEEGEDS--EEEGSQSREDGSSESSSDVGSDS-----------ESKADKESASDSDS 430

Query: 467 D 467
           +
Sbjct: 431 E 431


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 31.4 bits (72), Expect = 0.37
 Identities = 30/108 (27%), Positives = 58/108 (53%), Gaps = 15/108 (13%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKEL---ERKEELEKEQIRIKEE--------QENIKKK 409
           ++   + EE +      +EE +R+E    +R+EELEK++  ++E         +EN  K+
Sbjct: 3   LEKRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKR 62

Query: 410 KEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQ 457
           +  E+K  +E++  +    +K++E   LKAE +    E E+L E+L +
Sbjct: 63  RRAEKKAEEEKKLRK----EKEEEIKELKAELEELKAEIEKLEEKLEE 106



 Score = 30.3 bits (69), Expect = 0.87
 Identities = 27/93 (29%), Positives = 57/93 (61%), Gaps = 3/93 (3%)

Query: 369 EEKEKQRMVQEEI--KRKELERKEELEKEQIR-IKEEQENIKKKKEQEEKERKEQEENQK 425
           E++ +   VQ  +  KR+E ER+EEL K++   +++++E +++   + +K  KE E  ++
Sbjct: 4   EKRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRR 63

Query: 426 LLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
              KK +EE +L+ E++  + E +  +EEL+ +
Sbjct: 64  RAEKKAEEEKKLRKEKEEEIKELKAELEELKAE 96


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 33.1 bits (75), Expect = 0.38
 Identities = 20/62 (32%), Positives = 38/62 (61%), Gaps = 2/62 (3%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
           +    EEEE++K +M+    K+K+L +K  ++    + +E+ EN+KKKK+Q  K++K   
Sbjct: 532 KRKVDEEEEEKKLKMIMMSNKQKKLYKK--MKYSNAKKEEQAENLKKKKKQIAKQKKLDS 589

Query: 422 EN 423
           + 
Sbjct: 590 KK 591



 Score = 31.6 bits (71), Expect = 1.2
 Identities = 19/66 (28%), Positives = 33/66 (50%), Gaps = 2/66 (3%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQ--IRIKEEQENIKKKKEQEEKERKEQEENQ 424
           E+   E     +++   +EL+ ++ELE E   I+  E  E  K   + + K+RK  EE +
Sbjct: 481 EDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEE 540

Query: 425 KLLLKK 430
           +  LK 
Sbjct: 541 EKKLKM 546



 Score = 28.9 bits (64), Expect = 7.3
 Identities = 20/75 (26%), Positives = 40/75 (53%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL 426
           E  E +K     +  KRK  E +EE + + I +  +Q+ + KK +    +++EQ EN K 
Sbjct: 517 ETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKK 576

Query: 427 LLKKQQEEDRLKAEE 441
             K+  ++ +L +++
Sbjct: 577 KKKQIAKQKKLDSKK 591


>gnl|CDD|212870 cd11937, SH3_UBASH3A, Src homology 3 domain of Ubiquitin-associated
           and SH3 domain-containing protein A.  UBASH3A is also
           called Cbl-Interacting Protein 4 (CLIP4), T cell
           Ubiquitin LigAnd (TULA), or T cell receptor Signaling
           (STS)-2. It is only found in lymphoid cells and exhibits
           weak phosphatase activity. UBASH3A facilitates T
           cell-induced apoptosis through interaction with the
           apoptosis-inducing factor AIF. It is involved in
           regulating the level of phosphorylation of the
           zeta-associated protein (ZAP)-70 tyrosine kinase. TULA
           proteins contain an N-terminal UBA domain, a central SH3
           domain, and a C-terminal histidine phosphatase domain.
           They bind c-Cbl through the SH3 domain and to ubiquitin
           via UBA. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 60

 Score = 30.0 bits (67), Expect = 0.38
 Identities = 20/56 (35%), Positives = 23/56 (41%), Gaps = 6/56 (10%)

Query: 471 TAVALYDYQASADDEISFDPDDII----TNIEMIDEGWWRGYCH--GQYGLFPANY 520
           T  AL+ Y+    DE+   P D I    T      EGW  G  H  G  G  P NY
Sbjct: 2   TLRALFQYKPQNIDELMLSPGDYIFVDPTQQSEASEGWVIGISHRTGCRGFLPENY 57


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 406

 Score = 33.0 bits (76), Expect = 0.38
 Identities = 20/68 (29%), Positives = 31/68 (45%), Gaps = 4/68 (5%)

Query: 395 EQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLK----KQQEEDRLKAEEQARLLEQER 450
           E++   +E     +  E +E   K ++   KLL K       EE RL+AE      E +R
Sbjct: 1   EELDALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDR 60

Query: 451 LMEELRQQ 458
           L EE+ + 
Sbjct: 61  LREEIERL 68


>gnl|CDD|212693 cd11759, SH3_CRK_C, C-terminal Src Homology 3 domain of Ct10
           Regulator of Kinase adaptor proteins.  CRK adaptor
           proteins consists of SH2 and SH3 domains, which bind
           tyrosine-phosphorylated peptides and proline-rich
           motifs, respectively. They function downstream of
           protein tyrosine kinases in many signaling pathways
           started by various extracellular signals, including
           growth and differentiation factors. Cellular CRK (c-CRK)
           contains a single SH2 domain, followed by N-terminal and
           C-terminal SH3 domains. It is involved in the regulation
           of many cellular processes including cell growth,
           motility, adhesion, and apoptosis. CRK has been
           implicated in the malignancy of various human cancers.
           The C-terminal SH3 domain of CRK has not been shown to
           bind any target protein; it acts as a negative regulator
           of CRK function by stabilizing a structure that inhibits
           the access by target proteins to the N-terminal SH3
           domain. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes by intramolecular interactions,
           changing the subcellular localization of signal pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 57

 Score = 29.8 bits (67), Expect = 0.39
 Identities = 13/48 (27%), Positives = 23/48 (47%), Gaps = 5/48 (10%)

Query: 476 YDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYVSL 523
           YD  A     ++ +  D++   ++   G W G  +G+ G FP  +V L
Sbjct: 15  YDKTA-----LALEVGDLVKVTKINVSGQWEGELNGKVGHFPFTHVEL 57


>gnl|CDD|212937 cd12004, SH3_Lyn, Src homology 3 domain of Lyn Protein Tyrosine
           Kinase.  Lyn is a member of the Src subfamily of
           proteins, which are cytoplasmic (or non-receptor) PTKs.
           Lyn is expressed in B lymphocytes and myeloid cells. It
           exhibits both positive and negative regulatory roles in
           B cell receptor (BCR) signaling. Lyn, as well as Fyn and
           Blk, promotes B cell activation by phosphorylating ITAMs
           (immunoreceptor tyr activation motifs) in CD19 and in Ig
           components of BCR. It negatively regulates signaling by
           its unique ability to phosphorylate ITIMs
           (immunoreceptor tyr inhibition motifs) in cell surface
           receptors like CD22 and CD5. Lyn also plays an important
           role in G-CSF receptor signaling by phosphorylating a
           variety of adaptor molecules. Src kinases contain an
           N-terminal SH4 domain with a myristoylation site,
           followed by SH3 and SH2 domains, a tyr kinase domain,
           and a regulatory C-terminal region containing a
           conserved tyr. They are activated by autophosphorylation
           at the tyr kinase domain, but are negatively regulated
           by phosphorylation at the C-terminal tyr by Csk
           (C-terminal Src Kinase). The SH3 domain of Src kinases
           contributes to substrate recruitment by binding adaptor
           proteins/substrates, and regulation of kinase activity
           through an intramolecular interaction. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 56

 Score = 30.0 bits (67), Expect = 0.39
 Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWW--RGYCHGQYGLFPANYVS 522
           VALY Y    +D++SF   + +  IE   E WW  R     + G  P+NYV+
Sbjct: 3   VALYPYDGIHEDDLSFKKGEKLKVIEEHGE-WWKARSLTTKKEGFIPSNYVA 53


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 33.4 bits (76), Expect = 0.40
 Identities = 34/201 (16%), Positives = 77/201 (38%), Gaps = 15/201 (7%)

Query: 273  FENLAKQTEEESRK--RSEEEKEKRKLKDQIDLEQAQKLE-ERRLSELKIKEEEIEKKLN 329
            FE   ++ EE +    +S+EE E+ ++ +   ++   K++ +   +  +  EE  +K + 
Sbjct: 4067 FEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEENTDKGIV 4126

Query: 330  GHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQE-EIKRKELER 388
            G ++               +       + +    S  +E   E  R  Q      +E ++
Sbjct: 4127 GENE-----ELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEADRQYQSLGDHLREWQQ 4181

Query: 389  KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQ 448
               +  E   + E Q       E    +  E+E+ Q L      E+D++K+ ++     Q
Sbjct: 4182 ANRI-HEWEDLTESQSQAFDDSEFMHVKEDEEEDLQAL---GNAEKDQIKSIDRDESANQ 4237

Query: 449  ERLMEELRQQGTNEDTEEDLG 469
                + +      ED  +++G
Sbjct: 4238 NP--DSMNSTNIAEDEADEVG 4256



 Score = 32.3 bits (73), Expect = 0.87
 Identities = 40/210 (19%), Positives = 77/210 (36%), Gaps = 39/210 (18%)

Query: 272  KFENLAKQTEEESRKRSEEEKEKRKLKDQID---LEQAQKLEERRLSELKI---KEEEIE 325
              E    +TE++S ++S    E   +  + D   LE   + E+    E+      ++EI+
Sbjct: 3911 PNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQ 3970

Query: 326  KKLNGHSDVPLSPSTETPPVPVKSILKQPTSD----------GIPIQNSNKEEEEKEKQR 375
              +  ++  P  P  E   +P    L +   D           +   + NKEE + EK  
Sbjct: 3971 PDIQENNSQPP-PENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDE 4029

Query: 376  MVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEED 435
             +Q+E     LE    L+++      +Q++     E +EK  ++  E             
Sbjct: 4030 PMQDE---DPLEENNTLDED-----IQQDDFSDLAEDDEKMNEDGFEEN----------- 4070

Query: 436  RLKAEEQARLLEQERLMEELRQQGTNEDTE 465
                +E     E     +E  +QG   + +
Sbjct: 4071 ---VQENEESTEDGVKSDEELEQGEVPEDQ 4097



 Score = 31.5 bits (71), Expect = 1.3
 Identities = 67/336 (19%), Positives = 126/336 (37%), Gaps = 42/336 (12%)

Query: 165  VQKDRQDKSAVGWDHQEKIEKHESQKDYAKGFGGKFGIESDRQDKSAVGWDHVEKVDKHQ 224
            V K+  +K+    D QEK ++ E   D     G    I+ D Q+ ++      E +D  +
Sbjct: 3936 VSKEDDNKALEDKDRQEKEDEEEMSDD----VGIDDEIQPDIQENNSQPPPENEDLDLPE 3991

Query: 225  SQKDDNRQVVTSSKVK----DLIAANSTN--ASKENIKPKPNIGHVKPSNLRAKFENLAK 278
              K D ++   S        D+ AA+     A  E  +P  +   ++ +N   +      
Sbjct: 3992 DLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDD 4051

Query: 279  QTE--EESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPL 336
             ++  E+  K +E+  E+   +++   E   K +E    EL+  E   ++ ++ H     
Sbjct: 4052 FSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDE----ELEQGEVPEDQAIDNH----- 4102

Query: 337  SPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQ 396
                             P  D      S + +EE   + +V E  +  E +         
Sbjct: 4103 -----------------PKMDAKSTFASAEADEENTDKGIVGENEELGEEDGVRGNGTAD 4145

Query: 397  IRIKEEQENIKKKKE-QEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEEL 455
               ++ QE+    KE   E +R+ Q     L  ++ Q+ +R+  E +     Q +  ++ 
Sbjct: 4146 GEFEQVQEDTSTPKEAMSEADRQYQSLGDHL--REWQQANRIH-EWEDLTESQSQAFDDS 4202

Query: 456  RQQGTNEDTEEDLGYTAVALYDYQASADDEISFDPD 491
                  ED EEDL     A  D   S D + S + +
Sbjct: 4203 EFMHVKEDEEEDLQALGNAEKDQIKSIDRDESANQN 4238



 Score = 31.1 bits (70), Expect = 2.0
 Identities = 23/113 (20%), Positives = 46/113 (40%), Gaps = 2/113 (1%)

Query: 357  DGIPIQNSNKEEEE--KEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEE 414
            + IP   SN   E+   E       E ++K  E+     +  +  KE+     + K+++E
Sbjct: 3893 NDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQE 3952

Query: 415  KERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEED 467
            KE +E+  +   +  + Q + +    +     E   L E+L+      D  +D
Sbjct: 3953 KEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKD 4005



 Score = 29.2 bits (65), Expect = 7.1
 Identities = 26/130 (20%), Positives = 56/130 (43%), Gaps = 9/130 (6%)

Query: 320  KEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQE 379
            +++++E+  N       S   E+    ++S +   T D +  +N N + EE+ +   + E
Sbjct: 3833 EDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQD--LDE 3890

Query: 380  EIK------RKELERKEELEKEQIRIKE-EQENIKKKKEQEEKERKEQEENQKLLLKKQQ 432
            E+          L  K   E  +  + E EQ++ ++     E +   +E++ K L  K +
Sbjct: 3891 EVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDR 3950

Query: 433  EEDRLKAEEQ 442
            +E   + E  
Sbjct: 3951 QEKEDEEEMS 3960


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 33.1 bits (76), Expect = 0.40
 Identities = 13/80 (16%), Positives = 33/80 (41%)

Query: 379 EEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLK 438
           + +K     R   ++ +  ++    +  KK   + E+ + ++ E  K + K + ++    
Sbjct: 12  DLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKI 71

Query: 439 AEEQARLLEQERLMEELRQQ 458
            E +  L E +  + EL   
Sbjct: 72  EEIKKELKELKEELTELSAA 91



 Score = 31.2 bits (71), Expect = 1.5
 Identities = 23/109 (21%), Positives = 55/109 (50%), Gaps = 2/109 (1%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERK-EELEKEQ-IRIKEEQENIKKKKEQEEKERKE 419
             +     + + ++++  + +RK+L  + EEL+ ++    K+  +   +KK++ E+ +KE
Sbjct: 18  LKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKE 77

Query: 420 QEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDL 468
            +E ++ L +       L+AE Q +LL    +  E    G +E+   ++
Sbjct: 78  LKELKEELTELSAALKALEAELQDKLLSIPNIPHESVPVGKDEEDNLEV 126



 Score = 30.8 bits (70), Expect = 1.7
 Identities = 26/108 (24%), Positives = 40/108 (37%), Gaps = 15/108 (13%)

Query: 267 SNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEE-----RRLSELKIKE 321
             L  + + L  + EE   KR+E  K+  K K Q   +  +  +E       L+EL    
Sbjct: 33  IALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAAL 92

Query: 322 EEIEKKLNG---------HSDVPLSPSTETPPVPVKSILKQPTSDGIP 360
           + +E +L           H  VP+    E   + VK     P  D  P
Sbjct: 93  KALEAELQDKLLSIPNIPHESVPVG-KDEEDNLEVKRWGTPPVFDFKP 139


>gnl|CDD|212981 cd12048, SH3_DOCK3_B, Src Homology 3 domain of Class B Dedicator of
           Cytokinesis 3.  Dock3, also called modifier of cell
           adhesion (MOCA), and presenilin binding protein (PBP),
           is a class B DOCK and is an atypical guanine nucleotide
           exchange factor (GEF) that lacks the conventional Dbl
           homology (DH) domain. It regulates N-cadherin dependent
           cell-cell adhesion, cell polarity, and neuronal
           morphology. It promotes axonal growth by stimulating
           actin polymerization and microtubule assembly. All DOCKs
           contain two homology domains: the DHR-1 (Dock homology
           region-1), also called CZH1 (CED-5, Dock180, and
           MBC-zizimin homology 1), and DHR-2 (also called CZH2 or
           Docker). The DHR-1 domain binds
           phosphatidylinositol-3,4,5-triphosphate while DHR-2
           contains the catalytic activity for Rac and/or Cdc42.
           Class B DOCKs also contain an SH3 domain at the
           N-terminal region and a PxxP motif at the C-terminus;
           Dock3 is a specific GEFs for Rac. The SH3 domain of
           Dock3 binds to DHR-2 in an autoinhibitory manner;
           binding of the scaffold protein Elmo to the SH3 domain
           of Dock3 exposes the DHR-2 domain and promotes GEF
           activity. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 56

 Score = 29.9 bits (67), Expect = 0.42
 Identities = 12/26 (46%), Positives = 17/26 (65%), Gaps = 3/26 (11%)

Query: 502 EGWWRGYCHGQ---YGLFPANYVSLQ 524
           EGW+RG    +    G+FPANY+ L+
Sbjct: 31  EGWYRGVSIKKPNVKGIFPANYIHLK 56


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 32.0 bits (72), Expect = 0.43
 Identities = 31/100 (31%), Positives = 63/100 (63%), Gaps = 3/100 (3%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEE---QENIKKKKEQEEKERK 418
           Q   +  E++E+ R+ +EE+KR+  E +   E+E  R +EE   ++  K K++ EE+E++
Sbjct: 41  QEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQ 100

Query: 419 EQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
           EQEE +++  +K++ E R + E +   LE+E+  +++ Q+
Sbjct: 101 EQEEQERIQKQKEEAEARAREEAERMRLEREKHFQQIEQE 140


>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 433 to 538 amino acids in length. This domain is
           found associated with pfam00616, pfam00168. This domain
           has two conserved sequence motifs: DLQ and PLSFQNP.
          Length = 489

 Score = 32.8 bits (74), Expect = 0.44
 Identities = 23/81 (28%), Positives = 43/81 (53%), Gaps = 3/81 (3%)

Query: 385 ELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQAR 444
                EE  +E  + ++ ++ I + KE+     +  EE ++ LL ++Q+  +L  E QAR
Sbjct: 351 IEYETEENREEGTQAEKYEQEIARLKERLRVSVRRLEEYERRLLGQEQQMQKLLQEYQAR 410

Query: 445 LLEQERLMEELRQQGTNEDTE 465
           L + E   E LR+Q   +D++
Sbjct: 411 LEDSE---ERLRRQQEEKDSQ 428


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 32.7 bits (75), Expect = 0.48
 Identities = 19/93 (20%), Positives = 42/93 (45%), Gaps = 10/93 (10%)

Query: 379 EEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLK 438
           E+++ K    K +L++ +  I   +     K    + + K + +N+KL  + ++++   K
Sbjct: 280 EKLQEKIKALKYQLKRLKKMILLFEMISDLK---RKLKSKFERDNEKLDAEVKEKKKEKK 336

Query: 439 AEEQ-----ARLLEQERLMEELRQQGTNEDTEE 466
            EE+      RL E+   +E   Q    E+ + 
Sbjct: 337 KEEKKKKQIERLEERIEKLE--VQATDKEENKT 367



 Score = 30.0 bits (68), Expect = 3.5
 Identities = 17/75 (22%), Positives = 46/75 (61%), Gaps = 6/75 (8%)

Query: 366 KEEEEKEKQRMVQEEI---KRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEE 422
           K + ++ K+ ++  E+    +++L+ K E + E++  + +++  KK+K++EEK++K+ E 
Sbjct: 290 KYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEK--KKEKKKEEKKKKQIER 347

Query: 423 -NQKLLLKKQQEEDR 436
             +++   + Q  D+
Sbjct: 348 LEERIEKLEVQATDK 362


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 32.9 bits (75), Expect = 0.48
 Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 11/78 (14%)

Query: 385 ELER-KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEE--NQKLLLKKQQEEDRLKAEE 441
           EL R +E+LE+EQ      +E  ++ K + E+ERK +EE   Q+  ++  Q   RL A E
Sbjct: 4   ELRRAREKLEREQ------RERKQRAKLKLERERKAKEEAAKQREAIEAAQRSRRLDAIE 57

Query: 442 QARLLEQERLMEELRQQG 459
               ++ ++ M+E  Q G
Sbjct: 58  AQ--IKADQQMQESLQAG 73



 Score = 29.5 bits (66), Expect = 4.6
 Identities = 23/83 (27%), Positives = 44/83 (53%), Gaps = 8/83 (9%)

Query: 383 RKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQ---KLLLKKQQEEDRLKA 439
           R++LER++   K++ ++K E+E  +K KE+  K+R+  E  Q   +L   + Q +   + 
Sbjct: 9   REKLEREQRERKQRAKLKLERE--RKAKEEAAKQREAIEAAQRSRRLDAIEAQIKADQQM 66

Query: 440 EEQ---ARLLEQERLMEELRQQG 459
           +E     R +   R+ E +  QG
Sbjct: 67  QESLQAGRGIVFSRIFEAVSFQG 89


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 31.4 bits (71), Expect = 0.48
 Identities = 29/88 (32%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 363 NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIK-EEQENIKKKKEQEEKERKEQE 421
           ++ +E+EE ++     EE +RK   RK E E+E +R    ++  IKKK+E EE+ +   E
Sbjct: 31  DAEEEDEEIQEALREAEE-ERKAKHRKMEEEREVMRQGIRDKYGIKKKEEDEEEPQAAAE 89

Query: 422 ENQKLLLKKQQEEDRLKAEEQARLLEQE 449
           E +  L +K++  + L AE      E+E
Sbjct: 90  EEEGRLGRKKKTPEELAAEAGEEDEEEE 117


>gnl|CDD|220182 pfam09328, Phytochelatin_C, Domain of unknown function (DUF1984).
           Members of this family of functionally uncharacterized
           domains are found at the C-terminus of plant
           phytochelatin synthases.
          Length = 264

 Score = 32.4 bits (74), Expect = 0.48
 Identities = 19/63 (30%), Positives = 32/63 (50%), Gaps = 7/63 (11%)

Query: 251 SKENIKPKPNIGHVKPSNLRAKFENLAK------QTEEESRKRSEEEKEKRKLKDQIDLE 304
             EN+K    +  V   +L A F +  K      + EE +   S+EEKE+  LK+++ L+
Sbjct: 33  KSENLKDVQEVLSVVFKSLPANFGDFIKWVAEVRRQEEGNSSLSKEEKERLALKEEV-LQ 91

Query: 305 QAQ 307
           Q +
Sbjct: 92  QVR 94


>gnl|CDD|212822 cd11889, SH3_Cyk3p-like, Src Homology 3 domain of Cytokinesis
           protein 3 and similar proteins.  Cytokinesis protein 3
           (Cyk3 or Cyk3p) is a component of the actomyosin ring
           independent cytokinesis pathway in yeast. It interacts
           with Inn1 and facilitates its recruitment to the bud
           neck, thereby promoting cytokinesis. Cyk3p contains an
           N-terminal SH3 domain and a C-terminal
           transglutaminase-like domain. The Cyk3p SH3 domain binds
           to the C-terminal proline-rich region of Inn1. SH3
           domains bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs;
           they play a role in the regulation of enzymes by
           intramolecular interactions, changing the subcellular
           localization of signal pathway components and mediate
           multiprotein complex assemblies.
          Length = 53

 Score = 29.4 bits (66), Expect = 0.49
 Identities = 15/50 (30%), Positives = 27/50 (54%), Gaps = 2/50 (4%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYC--HGQYGLFPANYV 521
           A+Y +    + ++ F   D+I  + + D  WW G    +G  G+FP+N+V
Sbjct: 4   AVYSWAGETEGDLGFLEGDLIEVLSIGDGSWWSGKLRRNGAEGIFPSNFV 53


>gnl|CDD|213010 cd12077, SH3_Tks5_2, Second Src homology 3 domain of Tyrosine
           kinase substrate with five SH3 domains.  Tks5, also
           called SH3 and PX domain-containing protein 2A
           (SH3PXD2A) or Five SH (FISH), is a scaffolding protein
           and Src substrate that is localized in podosomes, which
           are electron-dense structures found in Src-transformed
           fibroblasts, osteoclasts, macrophages, and some invasive
           cancer cells. It binds and regulates some members of the
           ADAMs family of transmembrane metalloproteases, which
           function as sheddases and mediators of cell and matrix
           interactions. It is required for podosome formation,
           degradation of the extracellular matrix, and cancer cell
           invasion. Tks5 contains an N-terminal Phox homology (PX)
           domain and five SH3 domains. This model characterizes
           the second SH3 domain of Tks5. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 54

 Score = 29.2 bits (65), Expect = 0.53
 Identities = 16/49 (32%), Positives = 24/49 (48%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           V +  Y +   DEI F+    +  I+   EGWW     G+ G  PA+Y+
Sbjct: 4   VTVQPYTSQGKDEIGFEKGVTVEVIQKNLEGWWYIRYLGKEGWAPASYL 52


>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana and other plants. This region is
           often found associated with Agenet domains and may
           contain coiled-coil.
          Length = 190

 Score = 31.7 bits (72), Expect = 0.55
 Identities = 23/84 (27%), Positives = 45/84 (53%), Gaps = 6/84 (7%)

Query: 366 KEEEEKEKQRMVQEEIKRKELE-RKEELEKEQIRIKEEQENIKKKKEQEEKE-RKEQEEN 423
            EE +  ++ + ++EI R+EL+    ELE++ + ++ +   +K+KKE E+KE  + + E 
Sbjct: 109 LEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEIARLKSEA 168

Query: 424 QKLLLKKQQEEDRLKAEEQARLLE 447
            K+    +QE    + E Q     
Sbjct: 169 SKI----KQELVDAELEFQTTAAA 188



 Score = 29.0 bits (65), Expect = 4.6
 Identities = 16/73 (21%), Positives = 33/73 (45%), Gaps = 3/73 (4%)

Query: 371 KEKQRMVQEEIKRKE---LERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL 427
           K+ Q    EE K  E    E++   ++    I E +  I + + Q    ++++E   K +
Sbjct: 102 KDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEI 161

Query: 428 LKKQQEEDRLKAE 440
            + + E  ++K E
Sbjct: 162 ARLKSEASKIKQE 174



 Score = 28.2 bits (63), Expect = 8.3
 Identities = 18/67 (26%), Positives = 39/67 (58%), Gaps = 4/67 (5%)

Query: 392 LEKEQIRIKEEQENIKKKKEQEEKERKEQE----ENQKLLLKKQQEEDRLKAEEQARLLE 447
           L+ +Q +  EE++ ++K+  ++E  R+E +    E ++ +L+ Q++   LK +++A   E
Sbjct: 101 LKDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKE 160

Query: 448 QERLMEE 454
             RL  E
Sbjct: 161 IARLKSE 167


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 32.5 bits (74), Expect = 0.56
 Identities = 19/67 (28%), Positives = 37/67 (55%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL 427
           E  K+K    Q+E K    + KE+  +EQ  ++E  E  K+++EQ     +++EE Q++ 
Sbjct: 116 ENTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMN 175

Query: 428 LKKQQEE 434
            +K ++ 
Sbjct: 176 KRKNKQA 182



 Score = 30.9 bits (70), Expect = 1.3
 Identities = 23/101 (22%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
           Q  NK+  +K K++  +E+ + +E    E+ E+EQ R+  ++E    ++EQ+  +RK ++
Sbjct: 126 QKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKE----EEEQQMNKRKNKQ 181

Query: 422 ENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNE 462
                L         L A+ +   ++ E  +E+ + +  N 
Sbjct: 182 ALLDELETSTLPAAELIAQHKKNSVKLEMQVEKPKPEKPNT 222



 Score = 29.4 bits (66), Expect = 5.1
 Identities = 24/69 (34%), Positives = 36/69 (52%), Gaps = 8/69 (11%)

Query: 404 ENIKKKKEQEEKERKEQEENQKLLLKKQQE--EDRLKAE----EQARLLEQERLMEELRQ 457
           EN KKK E  +KE K+  +  K    ++QE  E+ L+ E    EQ RLL Q+   EE +Q
Sbjct: 116 ENTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQK--EEEEQQ 173

Query: 458 QGTNEDTEE 466
               ++ + 
Sbjct: 174 MNKRKNKQA 182


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 31.1 bits (71), Expect = 0.61
 Identities = 23/73 (31%), Positives = 47/73 (64%), Gaps = 6/73 (8%)

Query: 397 IRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQ------ARLLEQER 450
           I I+E++E  +++KE++E++ ++++E  K L++K++ E RL+  E+      AR  + E 
Sbjct: 12  IDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDED 71

Query: 451 LMEELRQQGTNED 463
             EEL++Q   +D
Sbjct: 72  YDEELKEQERWDD 84



 Score = 29.6 bits (67), Expect = 1.8
 Identities = 21/105 (20%), Positives = 51/105 (48%), Gaps = 1/105 (0%)

Query: 370 EKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLK 429
           +K  + +  EE + ++   KEE E+++ + KE  + + +K+E+E++  + ++   K L +
Sbjct: 6   DKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLAR 65

Query: 430 KQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDLGYTAVA 474
              +ED    E + +    + + + LR++    D +    Y    
Sbjct: 66  YADDED-YDEELKEQERWDDPMAQFLRKKKEKTDKKGKPLYKGPF 109


>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106.  This
           archaebacterial protein family has no known function.
           Members are predicted to be integral membrane proteins.
          Length = 168

 Score = 31.5 bits (72), Expect = 0.64
 Identities = 10/49 (20%), Positives = 23/49 (46%), Gaps = 6/49 (12%)

Query: 368 EEEKEKQRMVQEEIKRKEL------ERKEELEKEQIRIKEEQENIKKKK 410
           + EK ++R  + + + +EL         ++ EK Q  + E+Q+ +    
Sbjct: 41  KMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQKEMMMDM 89



 Score = 28.8 bits (65), Expect = 4.4
 Identities = 10/46 (21%), Positives = 27/46 (58%), Gaps = 2/46 (4%)

Query: 399 IKEEQENIKKKKEQEEKERKE-QEENQKLLLKK-QQEEDRLKAEEQ 442
           I  + E  +K++++ +K  +E ++   KL  KK ++ ++ L  +++
Sbjct: 38  IDRKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQK 83


>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP).  This family consists of
           several PV-1 (PLVAP) proteins which seem to be specific
           to mammals. PV-1 is a novel protein component of the
           endothelial fenestral and stomatal diaphragms. The
           function of this family is unknown.
          Length = 442

 Score = 32.3 bits (73), Expect = 0.65
 Identities = 44/196 (22%), Positives = 82/196 (41%), Gaps = 16/196 (8%)

Query: 276 LAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVP 335
             K+  EE  +   + +E +  + Q+  E  QK++   +   K K E         S +P
Sbjct: 188 AGKRQAEEQLEACGKARELQHQEQQLAEENLQKVQALCIPLDKDKFEADLLNAWRDSIIP 247

Query: 336 LSPST-----ETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE-RK 389
            S          P  P  + +++   D +P   S+K EE     R   E + R+  + ++
Sbjct: 248 RSLDNLGYNLHHPLGPELASIRR-ACDHLPGIMSSKIEELARGLRADIERVARENADLQR 306

Query: 390 EELEKEQIRIKEEQENIKKKKEQEEKERKEQEE---------NQKLLLKKQQEEDRLKAE 440
           ++LE EQ     ++   K +KE + +E K Q E          +K  L+ +++    + E
Sbjct: 307 QKLEAEQAAQAAQEAKAKAEKEAQAREAKLQAECARQTQLALEEKAALRAERDNLAKELE 366

Query: 441 EQARLLEQERLMEELR 456
            + R  EQ R+   +R
Sbjct: 367 AKKREAEQLRMELAIR 382


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
           contains an extraordinarily long coiled-coil domain in
           its C-terminal half and is ubiquitously expressed. It is
           a novel binding partner of several syntaxin family
           members and is possibly involved in Ca2+-dependent
           exocytosis in neuroendocrine cells. Gamma-taxilin,
           described as leucine zipper protein Factor Inhibiting
           ATF4-mediated Transcription (FIAT), localises to the
           nucleus in osteoblasts and dimerises with ATF4 to form
           inactive dimers, thus inhibiting ATF4-mediated
           transcription.
          Length = 309

 Score = 32.0 bits (73), Expect = 0.67
 Identities = 34/132 (25%), Positives = 60/132 (45%), Gaps = 24/132 (18%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKE---- 416
           +Q  NK  +E+ K R  +EE KRKEL  K +   + I+ + E+ +    K  EE E    
Sbjct: 83  LQKHNKTLKEENKARSDEEEEKRKELTEKFQSTLKDIQDQMEEHSNPNHKLCEENEELRE 142

Query: 417 --------------------RKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELR 456
                               + +  E Q    K +Q +++ + E++    E+E L++E  
Sbjct: 143 KLKELIEQYELREQHFEKLLKTKDLEQQLNDAKLEQAQEQAEQEQEKHKREKEILLKEAA 202

Query: 457 QQGTNEDTEEDL 468
           Q  T ++TE++L
Sbjct: 203 QVSTLKETEKEL 214


>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242). 
          Length = 401

 Score = 32.2 bits (73), Expect = 0.67
 Identities = 29/119 (24%), Positives = 50/119 (42%), Gaps = 13/119 (10%)

Query: 375 RMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEE 434
           +++Q +I RK  E+ E L   +  I+E Q+   + K Q +K   +    + + +K +   
Sbjct: 71  QLLQTKI-RKLTEQDEGLRNIRESIEERQKESDQLKLQNQKLVNQLGHARGVFIKTKGRY 129

Query: 435 DRLKA------EEQARL------LEQERLMEELRQQGTNEDTEEDLGYTAVALYDYQAS 481
            +L+       EE   L      L QE   +    Q  N +  E L Y      +YQA+
Sbjct: 130 QKLEELSRHLKEENQCLQIQLDALVQECNEKIEENQELNRELAETLAYQQELNDEYQAT 188



 Score = 30.3 bits (68), Expect = 3.0
 Identities = 31/126 (24%), Positives = 56/126 (44%), Gaps = 14/126 (11%)

Query: 352 KQPTSDGIPIQNSNKEEEEKEKQRMVQ-EEIKRKELERKEELEKEQIRIKEEQENIKKKK 410
           +Q  SD + +QN      +K   ++     +  K   R ++LE+    +KEE + ++ + 
Sbjct: 97  RQKESDQLKLQN------QKLVNQLGHARGVFIKTKGRYQKLEELSRHLKEENQCLQIQL 150

Query: 411 EQEEKERKEQEENQKLLLKKQQE----EDRLKAEEQARLLEQERLMEELRQQ--GTNEDT 464
           +   +E  E+ E  + L ++  E    +  L  E QA  +EQ   M + RQ   G  E  
Sbjct: 151 DALVQECNEKIEENQELNRELAETLAYQQELNDEYQATFVEQ-HNMLDKRQAYIGKLEAK 209

Query: 465 EEDLGY 470
            +DL  
Sbjct: 210 VQDLMC 215


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 31.6 bits (72), Expect = 0.68
 Identities = 20/71 (28%), Positives = 42/71 (59%)

Query: 379 EEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLK 438
            E K+K+ E  EE+EK +   +E+Q+   KKK+ ++K+ K++++       K +++D  +
Sbjct: 58  TEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKE 117

Query: 439 AEEQARLLEQE 449
           AE++   L + 
Sbjct: 118 AEDKLEDLTKS 128


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 32.3 bits (74), Expect = 0.71
 Identities = 23/88 (26%), Positives = 50/88 (56%), Gaps = 3/88 (3%)

Query: 369 EEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKK-KEQEEKERKEQEENQKLL 427
           E+ E Q +++     K L++  EL K+++ + + Q  I KK +E+ EK ++E    ++  
Sbjct: 174 EKDELQEVLETVNIEKRLKKALELLKKELELLKLQNKITKKVEEKMEKTQREYYLREQ-- 231

Query: 428 LKKQQEEDRLKAEEQARLLEQERLMEEL 455
           LK  ++E  ++ +++  L + +  +EEL
Sbjct: 232 LKAIKKELGIEKDDKDELEKLKEKLEEL 259


>gnl|CDD|220410 pfam09798, LCD1, DNA damage checkpoint protein.  This is a family
           of proteins which regulate checkpoint kinases. In
           Schizosaccharomyces pombe this protein is called Rad26
           and in Saccharomyces cerevisiae it is called LCD1.
          Length = 648

 Score = 32.5 bits (74), Expect = 0.71
 Identities = 20/61 (32%), Positives = 33/61 (54%), Gaps = 9/61 (14%)

Query: 391 ELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQER 450
            L+++    K+E+ N +K +  E KE+ +QE     L K +QE   L+ E +  +LEQ  
Sbjct: 8   MLQQQ----KQEERNKQKSRVNELKEKHDQE-----LQKLKQELQSLEDERKFLVLEQRG 58

Query: 451 L 451
           L
Sbjct: 59  L 59


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 472

 Score = 32.0 bits (73), Expect = 0.72
 Identities = 15/54 (27%), Positives = 31/54 (57%), Gaps = 4/54 (7%)

Query: 410 KEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQE----RLMEELRQQG 459
              +E + +E+ +N  +  K++++E+    EE+   +E E     LMEEL+++G
Sbjct: 337 PNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEELKEKG 390


>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase.  This domain family is found in
           eukaryotes, and is approximately 140 amino acids in
           length. The family is found in association with
           pfam00069. Polo-like kinase 1 (Plx1) is essential during
           mitosis for the activation of Cdc25C, for spindle
           assembly, and for cyclin B degradation. This family is
           Polo kinase kinase (PKK) which phosphorylates Polo
           kinase and Polo-like kinase to activate them. PKK is a
           serine/threonine kinase.
          Length = 142

 Score = 30.8 bits (70), Expect = 0.73
 Identities = 24/83 (28%), Positives = 44/83 (53%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL 426
            +E +   + ++ E K +    KE L+ E+  +K+E E + + +EQE+K  K ++E Q+ 
Sbjct: 47  TQELRRLPKRIRAEQKTRLKMFKESLKIEKKELKQEVEKLPRFQEQEKKRMKAEKEEQEQ 106

Query: 427 LLKKQQEEDRLKAEEQARLLEQE 449
             +KQ+ E   K EE      Q+
Sbjct: 107 KHQKQEREFLAKQEENLEEALQQ 129


>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
          Length = 874

 Score = 32.4 bits (75), Expect = 0.75
 Identities = 23/67 (34%), Positives = 35/67 (52%), Gaps = 14/67 (20%)

Query: 384 KELERKEELEKEQIRIKEEQENIKKKKEQE-----------EKERKEQEENQKLLLKKQQ 432
            EL R   LEKE  ++++E E ++KK   E           EKER++  E ++ L K ++
Sbjct: 811 AELAR---LEKELAKLEKEIERVEKKLSNEGFVAKAPEEVVEKEREKLAEYEEKLAKLKE 867

Query: 433 EEDRLKA 439
              RLKA
Sbjct: 868 RLARLKA 874


>gnl|CDD|212713 cd11779, SH3_Irsp53_BAIAP2L, Src Homology 3 domain of Insulin
           Receptor tyrosine kinase Substrate p53, Brain-specific
           Angiogenesis Inhibitor 1-Associated Protein 2
           (BAIAP2)-Like proteins, and similar proteins.  Proteins
           in this family include IRSp53, BAIAP2L1, BAIAP2L2, and
           similar proteins. They all contain an
           Inverse-Bin/Amphiphysin/Rvs (I-BAR) or IMD domain in
           addition to the SH3 domain. IRSp53, also known as
           BAIAP2, is a scaffolding protein that takes part in many
           signaling pathways including Cdc42-induced filopodia
           formation, Rac-mediated lamellipodia extension, and
           spine morphogenesis. IRSp53 exists as multiple splicing
           variants that differ mainly at the C-termini. BAIAP2L1,
           also called IRTKS (Insulin Receptor Tyrosine Kinase
           Substrate), serves as a substrate for the insulin
           receptor and binds the small GTPase Rac. It plays a role
           in regulating the actin cytoskeleton and colocalizes
           with F-actin, cortactin, VASP, and vinculin. IRSp53 and
           IRTKS also mediate the recruitment of effector proteins
           Tir and EspFu, which regulate host cell actin
           reorganization, to bacterial attachment sites. BAIAP2L2
           co-localizes with clathrin plaques but its function has
           not been determined. The SH3 domains of IRSp53 and IRTKS
           have been shown to bind the proline-rich C-terminus of
           EspFu. SH3 domains are protein interaction domains that
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 57

 Score = 28.8 bits (65), Expect = 0.77
 Identities = 18/53 (33%), Positives = 26/53 (49%), Gaps = 7/53 (13%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDE---GWWRG--YCHGQYGLFPANYV 521
           ALY + A  + ++SF+  D+IT +    E   GW  G     G+ G FP  Y 
Sbjct: 5   ALYPHAAGGETQLSFEEGDVITLL--GPEPRDGWHYGENERSGRRGWFPIAYT 55


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 32.1 bits (74), Expect = 0.77
 Identities = 14/53 (26%), Positives = 28/53 (52%), Gaps = 5/53 (9%)

Query: 387 ERKEELEKEQIRIKEEQENIKKKKEQEEKERKE-QEENQKLLLKKQQEEDRLK 438
           ER E LE+    ++E+   ++ K    E E ++ + E ++L    + E ++LK
Sbjct: 1   ERLEALEERNSELEEQIRQLELKLRDLEAENEKLERELERL----KSELEKLK 49


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 31.5 bits (72), Expect = 0.78
 Identities = 12/39 (30%), Positives = 19/39 (48%), Gaps = 1/39 (2%)

Query: 387 ERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           E K+    E+    EE E     KE+E  E +++E  +K
Sbjct: 158 EWKDAKLLEEFA-AEEAEAAAAAKEEESAEGEKKESKKK 195



 Score = 28.8 bits (65), Expect = 5.7
 Identities = 11/38 (28%), Positives = 17/38 (44%), Gaps = 4/38 (10%)

Query: 383 RKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
            K+ +  EE   E+     E     K++E  E E+KE 
Sbjct: 159 WKDAKLLEEFAAEE----AEAAAAAKEEESAEGEKKES 192


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 31.8 bits (72), Expect = 0.80
 Identities = 26/106 (24%), Positives = 51/106 (48%), Gaps = 10/106 (9%)

Query: 355 TSDGIPIQNSNKEEEEKEKQRMVQEEIKRKE--LERKEELEKEQIRIKEEQENIKKKKEQ 412
           T   +P     KE+ E+ K+++ +E  K KE  L+  EELE E   ++E  + ++ +  +
Sbjct: 123 TLQRVPEYMDLKEDYEELKEKL-EELQKEKEELLKELEELEAEYEEVQERLKRLEVENSR 181

Query: 413 EEKERKEQEENQKLLLKKQQEEDRL-KAEEQARLLEQERLMEELRQ 457
            E+  K      KL  +    + R  + E    L E+E + + +++
Sbjct: 182 LEEMLK------KLPGEVYDLKKRWDELEPGVELPEEELISDLVKE 221


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 32.0 bits (73), Expect = 0.81
 Identities = 23/92 (25%), Positives = 44/92 (47%), Gaps = 4/92 (4%)

Query: 374 QRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL---LKK 430
           ++ ++EE   K LE  ++   ++  +   +E  +K  EQ E+  + QEE +K +   +KK
Sbjct: 440 KKFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKK 499

Query: 431 QQEEDRLKAEEQARLLEQER-LMEELRQQGTN 461
             E D     E+  + E+    +E L  +  N
Sbjct: 500 IPEVDTYLLLEELGINEETYEKLEALLAKKNN 531


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 31.3 bits (71), Expect = 0.81
 Identities = 21/76 (27%), Positives = 46/76 (60%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
           I+ ++++EEEKE      +  + ++    EE EKE+   + E+E+I +++E+EE  ++E+
Sbjct: 33  IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEE 92

Query: 421 EENQKLLLKKQQEEDR 436
           +E +   +K+Q+    
Sbjct: 93  KEEEAEDVKQQEVFSF 108


>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b.  This model
           describes the F1/F0 ATP synthase b subunit in bacteria
           only. Scoring just below the trusted cutoff are the
           N-terminal domains of Mycobacterial b/delta fusion
           proteins and a subunit from an archaeon, Methanosarcina
           barkeri, in which the ATP synthase homolog differs in
           architecture and is not experimentally confirmed. This
           model helps resolve b from the related b' subunit.
           Within the family is an example from a
           sodium-translocating rather than proton-translocating
           ATP synthase [Energy metabolism, ATP-proton motive force
           interconversion].
          Length = 147

 Score = 30.8 bits (70), Expect = 0.83
 Identities = 19/96 (19%), Positives = 45/96 (46%), Gaps = 7/96 (7%)

Query: 360 PIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIR-IKEEQENI-----KKKKEQE 413
           P+  + +  ++K    +   E  +KE    ++  +  ++  K+E + I     K+  E  
Sbjct: 19  PLAKAIETRQKKIADGLASAERAKKEAALAQKKAQVILKEAKDEAQEIIENANKRGSEIL 78

Query: 414 EKERKE-QEENQKLLLKKQQEEDRLKAEEQARLLEQ 448
           E+ + E +EE +K+  + + E +  K + +  L +Q
Sbjct: 79  EEAKAEAREEREKIKAQARAEIEAEKEQAREELRKQ 114


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 32.0 bits (73), Expect = 0.86
 Identities = 22/107 (20%), Positives = 50/107 (46%), Gaps = 4/107 (3%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           + E  K+++   + E  R+ELE +EE ++E+     ++   ++K   E  E++ + +  K
Sbjct: 381 RAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLK 440

Query: 426 LLLKKQQEE----DRLKAEEQARLLEQERLMEELRQQGTNEDTEEDL 468
              K + +E    D  +  E     + E++  +L ++      EE+ 
Sbjct: 441 KENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEE 487



 Score = 30.0 bits (68), Expect = 3.6
 Identities = 27/95 (28%), Positives = 44/95 (46%), Gaps = 11/95 (11%)

Query: 365 NKEEEEKEKQRMVQEEIK--RKELERKEELEKEQIRIKEEQENIKKKK----EQEEKERK 418
               EE E +++  EE K  R EL +  EL   +    +  + IK KK     ++EK ++
Sbjct: 183 LTPFEELELKKLSPEEAKARRAELRKMRELLFREEAKAKRIKKIKSKKYHRVHKKEKLKE 242

Query: 419 EQEENQKLLLKKQQEEDRLKAEEQARLLEQERLME 453
           E +E ++L      + D   A E+   LE+ R  E
Sbjct: 243 ELKEFEEL-----VKADPEAALEELEKLERRRAEE 272


>gnl|CDD|235777 PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier
           protein subunit; Validated.
          Length = 155

 Score = 30.9 bits (71), Expect = 0.87
 Identities = 18/89 (20%), Positives = 29/89 (32%), Gaps = 11/89 (12%)

Query: 295 RKLKDQIDLEQAQKLEERRLSELKIKEEE----IEKKLNGHSDVPLSPSTETPPVPVKSI 350
           RK+K  I+L     ++E  +SE +IKE E    I +            +   P     + 
Sbjct: 4   RKIKKLIEL-----VDESGISEFEIKEGEESVRISRAAAAPVAPVAQQAAAAPVAAAPAA 58

Query: 351 LKQPTSDGIPIQNSNKEEEEKEKQR--MV 377
                +       +   E E       MV
Sbjct: 59  AAAAAAAPAAAPAAAAAEAEGHVVTSPMV 87


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 31.6 bits (72), Expect = 0.90
 Identities = 14/69 (20%), Positives = 31/69 (44%)

Query: 390 EELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQE 449
           E+ ++++  ++E+Q  ++ K E     + E E     L  ++ E++ L A   A+     
Sbjct: 151 EQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASAL 210

Query: 450 RLMEELRQQ 458
                L +Q
Sbjct: 211 GEKAALEEQ 219


>gnl|CDD|205150 pfam12925, APP_E2, E2 domain of amyloid precursor protein.  The E2
           domain is the largest of the conserved domains of the
           amyloid precursor protein. The structure of E2 consists
           of two coiled-coil sub-structures connected through a
           continuous helix, and bears an unexpected resemblance to
           the spectrin family of protein structures.E 2 can
           reversibly dimerise in solution, and the dimerisation
           occurs along the longest dimension of the molecule in an
           antiparallel orientation, which enables the N-terminal
           substructure of one monomer to pack against the
           C-terminal substructure of a second monomer. The high
           degree of conservation of residues at the putative dimer
           interface suggests that the E2 dimer observed in the
           crystal could be physiologically relevant. Heparin
           sulfate proteoglycans, the putative ligands for the
           precursor present in extracellular matrix, bind to E2 at
           a conserved and positively charged site near the dimer
           interface.
          Length = 193

 Score = 31.2 bits (71), Expect = 0.91
 Identities = 31/113 (27%), Positives = 49/113 (43%), Gaps = 11/113 (9%)

Query: 353 QPTSDGI-PIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKE----QIRIKEEQENIK 407
            PTSD + P  +  KE+ E E+ +  ++ ++ K  ER  ++ KE    + + K   +   
Sbjct: 6   SPTSDAVDPYFSEPKEDNEHERYKEAKKRLEEKHRERMTQVMKEWEEAESQYKNLPKADP 65

Query: 408 KKKEQEEKERKE--QEENQKLLLKKQQEEDRLKAEEQAR----LLEQERLMEE 454
           K  +   KE  E  QE  Q L  +   E  +L    Q R    L E+ R   E
Sbjct: 66  KAAQLMRKELTERFQETVQTLEQEAAAERQQLVETHQQRVEAHLNERRRAALE 118


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 31.9 bits (73), Expect = 0.92
 Identities = 16/58 (27%), Positives = 35/58 (60%)

Query: 378 QEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEED 435
           +EE++ +  E  +E E E+   + E++ ++KK ++  K+ K++E   + LL+ + EE 
Sbjct: 67  EEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEP 124


>gnl|CDD|212818 cd11885, SH3_SH3TC, Src Homology 3 domain of SH3 domain and
           tetratricopeptide repeat-containing (SH3TC) proteins and
           similar domains.  This subfamily is composed of
           vertebrate SH3TC proteins and hypothetical fungal
           proteins containing BAR and SH3 domains. Mammals contain
           two SH3TC proteins, SH3TC1 and SH3TC2. The function of
           SH3TC1 is unknown. SH3TC2 is localized in Schwann cells
           in the peripheral nervous system, where it interacts
           with Rab11 and plays a role in peripheral nerve
           myelination. Mutations in SH3TC2 are associated with
           Charcot-Marie-Tooth disease type 4C, a severe hereditary
           peripheral neuropathy with symptoms that include
           progressive scoliosis, delayed age of walking, muscular
           atrophy, distal weakness, and reduced nerve conduction
           velocity. SH3 domains bind to proline-rich ligands with
           moderate affinity and selectivity, preferentially to
           PxxP motifs; they play a role in the regulation of
           enzymes by intramolecular interactions, changing the
           subcellular localization of signal pathway components
           and mediate multiprotein complex assemblies.
          Length = 55

 Score = 28.8 bits (65), Expect = 0.92
 Identities = 15/55 (27%), Positives = 23/55 (41%), Gaps = 4/55 (7%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEG--WWRGY--CHGQYGLFPANYV 521
           +  A  D++     E+SF   D I  I  +  G  W+ G     G+ G  P N+ 
Sbjct: 1   SCTAKMDFEGVEPGELSFRQGDSIEIIGDLIPGLQWFVGRSKSSGRVGFVPTNHF 55


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 31.7 bits (72), Expect = 0.92
 Identities = 17/78 (21%), Positives = 43/78 (55%), Gaps = 2/78 (2%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKE--QEEKERK 418
           ++++     +K ++ + +   K ++  +K    K++    EE E   K+++  QEE E+K
Sbjct: 356 VKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKK 415

Query: 419 EQEENQKLLLKKQQEEDR 436
           ++E+  +   K+Q++E +
Sbjct: 416 QKEQADEDKEKRQKDERK 433



 Score = 30.9 bits (70), Expect = 1.7
 Identities = 14/61 (22%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 365 NKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQ 424
           NK+ ++  K+    ++      E + + ++E+++ +EE E  +K++  E+KE+++++E +
Sbjct: 375 NKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLK-QEENEKKQKEQADEDKEKRQKDERK 433

Query: 425 K 425
           K
Sbjct: 434 K 434



 Score = 29.4 bits (66), Expect = 5.3
 Identities = 14/57 (24%), Positives = 33/57 (57%), Gaps = 2/57 (3%)

Query: 387 ERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQA 443
           +R+E L++   ++++  + + + K++ +    E+ E +    K +QEE+  K +EQA
Sbjct: 366 KRQELLKEYNKKLQDYTKKLGEVKDETDAS--EEAEAKAKEEKLKQEENEKKQKEQA 420


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 32.0 bits (73), Expect = 0.93
 Identities = 13/88 (14%), Positives = 33/88 (37%), Gaps = 5/88 (5%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKK-----KEQEEKE 416
           Q+S    + K +   +  E +    + +  +  E+  +K +   ++         Q  K 
Sbjct: 851 QDSGIRNDVKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKY 910

Query: 417 RKEQEENQKLLLKKQQEEDRLKAEEQAR 444
            +E+   +++      EE   +A+E   
Sbjct: 911 NEEKSAQERMPGADSPEELMKRAKEYQD 938


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 30.4 bits (69), Expect = 0.96
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 4/61 (6%)

Query: 382 KRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEE 441
           K KE  R+++L+ +    KEE E  K+K+E+  K   E E+ +  L KK+ E   +K E+
Sbjct: 67  KAKEKLRRDKLKAK----KEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEEK 122

Query: 442 Q 442
           +
Sbjct: 123 K 123


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 31.2 bits (71), Expect = 0.98
 Identities = 33/142 (23%), Positives = 65/142 (45%), Gaps = 11/142 (7%)

Query: 288 SEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVP-LSPSTETPPVP 346
           +E+E  KR L+   +LE+A++   +   E K+  E+ +K  N  +         E   + 
Sbjct: 50  TEDELRKRDLRA--ELEEAER-AHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIK 106

Query: 347 VKSILKQPTS-DGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQEN 405
            K I +   + D     +S+  +   +       E +   L R  ELEK    IK+E+  
Sbjct: 107 RKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLR--ELEK----IKKERAE 160

Query: 406 IKKKKEQEEKERKEQEENQKLL 427
            K+++E+E+   +E+   +++L
Sbjct: 161 EKEREEEEKAAEEEKAREEEIL 182


>gnl|CDD|212950 cd12017, SH3_Tks_3, Third Src homology 3 domain of Tyrosine kinase
           substrate (Tks) proteins.  Tks proteins are Src
           substrates and scaffolding proteins that play important
           roles in the formation of podosomes and invadopodia, the
           dynamic actin-rich structures that are related to cell
           migration and cancer cell invasion. Vertebrates contain
           two Tks proteins, Tks4 (Tyr kinase substrate with four
           SH3 domains) and Tks5 (Tyr kinase substrate with five
           SH3 domains), which display partially overlapping but
           non-redundant functions. Both associate with the ADAMs
           family of transmembrane metalloproteases, which function
           as sheddases and mediators of cell and matrix
           interactions. Tks5 interacts with N-WASP and Nck, while
           Tks4 is essential for the localization of MT1-MMP
           (membrane-type 1 matrix metalloproteinase) to
           invadopodia. Tks proteins contain an N-terminal Phox
           homology (PX) domain and four or five SH3 domains. This
           model characterizes the third SH3 domain of Tks
           proteins. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 53

 Score = 28.6 bits (64), Expect = 1.0
 Identities = 14/45 (31%), Positives = 23/45 (51%)

Query: 477 DYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           ++QA+  D ISF     +  I+    GWW     G+ G  P++Y+
Sbjct: 7   EFQATIQDGISFQKGQKVEVIDKNPSGWWYVKIDGKEGWAPSSYI 51


>gnl|CDD|214354 CHL00074, rps14, ribosomal protein S14.
          Length = 100

 Score = 29.8 bits (68), Expect = 1.1
 Identities = 13/43 (30%), Positives = 24/43 (55%), Gaps = 5/43 (11%)

Query: 373 KQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEK 415
           K+ ++Q E KR++L +K  L++  ++     + IKK     EK
Sbjct: 3   KKSLIQREKKRQKLVQKYHLKRRSLK-----KEIKKVSSLSEK 40


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 31.9 bits (73), Expect = 1.1
 Identities = 29/149 (19%), Positives = 58/149 (38%), Gaps = 5/149 (3%)

Query: 277 AKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPL 336
            +QT E    +  E  EK + +D+      ++ + RR  E +  ++E  K LN       
Sbjct: 647 QQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQE-AKALNVEEQSVQ 705

Query: 337 SPSTETPPVPVKSILKQ-PTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKE 395
               E     V+   KQ   +  + I+ S  EE       +V+E +  + + ++    + 
Sbjct: 706 ETEQEERVQQVQPRRKQRQLNQKVRIEQSVAEEAV---APVVEETVAAEPVVQEVPAPRT 762

Query: 396 QIRIKEEQENIKKKKEQEEKERKEQEENQ 424
           ++         +   EQ+E+   E  +N 
Sbjct: 763 ELVKVPLPVVAQTAPEQDEENNAENRDNN 791



 Score = 30.8 bits (70), Expect = 2.0
 Identities = 25/97 (25%), Positives = 48/97 (49%), Gaps = 13/97 (13%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
              N+EE  + +++  Q+  + +E ++ E  EK     + + E  +++  + E++R+  +
Sbjct: 632 GRENREENRRNRRQAQQQTAETRESQQAEVTEKA----RTQDE--QQQAPRRERQRRRND 685

Query: 422 ENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
           E ++     QQE   L  EEQ+    QE   EE  QQ
Sbjct: 686 EKRQA----QQEAKALNVEEQS---VQETEQEERVQQ 715



 Score = 29.6 bits (67), Expect = 4.9
 Identities = 29/143 (20%), Positives = 57/143 (39%), Gaps = 13/143 (9%)

Query: 333 DVPLSPSTETPPVPVKSILKQPTSDGIPIQ------------NSNKEEEEKEKQRMVQEE 380
           DVP +P+   P  PV +   +  +   P Q                  EE + Q     +
Sbjct: 536 DVPPAPTPAEPAAPVVAAAPKAAAATPPAQPGLLSRFFGALKALFSGGEETKPQEQPAPK 595

Query: 381 IKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ-EENQKLLLKKQQEEDRLKA 439
            + K   +++  +  Q   ++  E    +  +  +E +E  EEN++   + QQ+    + 
Sbjct: 596 AEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRNRRQAQQQTAETRE 655

Query: 440 EEQARLLEQERLMEELRQQGTNE 462
            +QA + E+ R  +E +Q    E
Sbjct: 656 SQQAEVTEKARTQDEQQQAPRRE 678


>gnl|CDD|179302 PRK01558, PRK01558, V-type ATP synthase subunit E; Provisional.
          Length = 198

 Score = 30.9 bits (70), Expect = 1.1
 Identities = 22/57 (38%), Positives = 28/57 (49%), Gaps = 1/57 (1%)

Query: 379 EEIKRKELERKEELEKEQIR-IKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEE 434
            +IK+  LE  E L  E I   KEE E I  K E+E KE K + E +    K+   E
Sbjct: 10  NKIKKDGLEEAERLANEIILEAKEEAEEIIAKAEEEAKELKAKAEKEANDYKRHALE 66


>gnl|CDD|212788 cd11854, SH3_Fus1p, Src homology 3 domain of yeast cell fusion
           protein Fus1p.  Fus1p is required at the cell surface
           for cell fusion during the mating response in yeast. It
           requires Bch1p and Bud7p, which are Chs5p-Arf1p binding
           proteins, for localization to the plasma membrane. It
           acts as a scaffold protein to assemble a cell surface
           complex which is involved in septum degradation and
           inhibition of the NOG pathway to promote cell fusion.
           The SH3 domain of Fus1p interacts with Bin1p, a formin
           that controls the assembly of actin cables in response
           to Cdc42 signaling. It has been shown to bind the motif,
           R(S/T)(S/T)SL, instead of PxxP motifs. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 56

 Score = 28.4 bits (64), Expect = 1.1
 Identities = 7/34 (20%), Positives = 15/34 (44%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGW 504
               +  ++ S DDE+     + +  +   D+GW
Sbjct: 1   LMTVISTFEPSLDDELLIKVGETVRVLAEYDDGW 34


>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
          Length = 613

 Score = 31.6 bits (73), Expect = 1.2
 Identities = 20/87 (22%), Positives = 38/87 (43%), Gaps = 18/87 (20%)

Query: 390 EELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQE-----------EDRLK 438
           E   K    +     ++ K+ E+E++E++E EE  K LL++ +E             RL 
Sbjct: 460 EFDGKPFKSVARGDLDLGKEDEEEKEEKEEAEEEFKPLLERLKEALGDKVKDVRLSHRLT 519

Query: 439 -------AEEQARLLEQERLMEELRQQ 458
                  A+E     + E+L++   Q+
Sbjct: 520 DSPACLVADEGDMSTQMEKLLKAAGQE 546


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 30.5 bits (69), Expect = 1.2
 Identities = 17/68 (25%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL 426
            EE +++Q+ V +E + KE  + ++ +KE+   K+  E + KKK  +  +   ++  +K 
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKE-KKVAEKLAKKKSTKTTKNTTKKATKKT 150

Query: 427 LLKKQQEE 434
             KK++ +
Sbjct: 151 TTKKEEGK 158


>gnl|CDD|153342 cd07658, F-BAR_NOSTRIN, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Nitric Oxide Synthase
           TRaffic INducer (NOSTRIN).  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Nitric Oxide Synthase TRaffic
           INducer (NOSTRIN) is expressed in endothelial and
           epithelial cells and is involved in the regulation,
           trafficking and targeting of endothelial NOS (eNOS).
           NOSTRIN facilitates the endocytosis of eNOS by
           coordinating the functions of dynamin and the
           Wiskott-Aldrich syndrome protein (WASP). Increased
           expression of NOSTRIN may be correlated to preeclampsia.
           NOSTRIN contains an N-terminal F-BAR domain and a
           C-terminal SH3 domain. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
           The F-BAR domain of NOSTRIN is necessary and sufficient
           for its membrane association and is responsible for its
           subcellular localization.
          Length = 239

 Score = 31.2 bits (71), Expect = 1.2
 Identities = 25/95 (26%), Positives = 48/95 (50%), Gaps = 9/95 (9%)

Query: 365 NKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQ 424
            + E+ K K+++       +E E+ ++  ++  +   +Q+ + K K+  E + KE E   
Sbjct: 121 WRSEQIKVKKKLHG---LARENEKLQDQVEDNKQSCTKQKMLNKLKKSAEVQDKEDE--- 174

Query: 425 KLLLKKQQ-EEDRLKAEEQARLLEQERLMEELRQQ 458
           KL  K+++ EE RLKAE +         +E LR +
Sbjct: 175 KLEAKRKKGEESRLKAENEYY--TCCVRLERLRLE 207


>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
          Length = 444

 Score = 31.2 bits (71), Expect = 1.2
 Identities = 26/114 (22%), Positives = 42/114 (36%), Gaps = 13/114 (11%)

Query: 358 GIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEE--LEKEQIRIK-EEQENIKKKKEQEE 414
             P +  N + E  E    +  EIK +  E  +E          + EE+E    +K  E 
Sbjct: 205 YPPSRGYNLKSELVEYSSEIVNEIKEEVCETDDESAYVSLDAEEEFEEEEEKPAEKSTES 264

Query: 415 KERKEQEENQK---LLLKKQQE-----EDRLKAEEQARLLEQERLMEELRQQGT 460
             +  +E +       L   +E     E+R KA E   L + E  +  L + G 
Sbjct: 265 TFQLSKETSIAKESKELPDGEEIEFGNEERFKAPEI--LFKPELPISGLEEAGK 316


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 30.9 bits (70), Expect = 1.2
 Identities = 22/121 (18%), Positives = 54/121 (44%), Gaps = 32/121 (26%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKK--------KKEQEEKERK 418
           EEEE+E    + + ++    ++KE+  + Q R  E  +N+ K        + ++ ++E+K
Sbjct: 91  EEEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQK 150

Query: 419 EQEENQKL------------------------LLKKQQEEDRLKAEEQARLLEQERLMEE 454
            +   ++                         +L+++++E++ K +E  R  ++E+ M  
Sbjct: 151 ARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKKKVKEAKRREKEEKRMAA 210

Query: 455 L 455
           L
Sbjct: 211 L 211



 Score = 29.4 bits (66), Expect = 4.2
 Identities = 16/87 (18%), Positives = 44/87 (50%), Gaps = 3/87 (3%)

Query: 387 ERKEELEKEQIRIKEEQENIKKK-KEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARL 445
           E+ EELE+E+         + ++ +EQ++++   ++  +  + K   +  ++ A+ +A+ 
Sbjct: 85  EKLEELEEEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQK 144

Query: 446 LEQERLM--EELRQQGTNEDTEEDLGY 470
            ++E+     + R++    +  E  GY
Sbjct: 145 RKREQKARAAKERKERLVAEAREHFGY 171


>gnl|CDD|212711 cd11777, SH3_CIP4_Bzz1_like, Src Homology 3 domain of
           Cdc42-Interacting Protein 4, Bzz1 and similar domains.
           This subfamily is composed of Cdc42-Interacting Protein
           4 (CIP4) and similar proteins such as Formin Binding
           Protein 17 (FBP17) and FormiN Binding Protein 1-Like
           (FNBP1L), as well as yeast Bzz1 (or Bzz1p). CIP4 and
           FNBP1L are Cdc42 effectors that bind Wiskott-Aldrich
           syndrome protein (WASP) and function in endocytosis.
           CIP4 and FBP17 bind to the Fas ligand and may be
           implicated in the inflammatory response. CIP4 may also
           play a role in phagocytosis. Bzz1 is also a
           WASP/Las17-interacting protein involved in endocytosis
           and trafficking to the vacuole. It physically interacts
           with type I myosins and functions in the early steps of
           endocytosis. Members of this subfamily contain an
           N-terminal F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain as well as at least one
           C-terminal SH3 domain. Bzz1 contains a second SH3 domain
           at the C-terminus. SH3 domains are protein interaction
           domains that bind to proline-rich ligands with moderate
           affinity and selectivity, preferentially to PxxP motifs.
           They play versatile and diverse roles in the cell
           including the regulation of enzymes, changing the
           subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 28.3 bits (63), Expect = 1.2
 Identities = 14/53 (26%), Positives = 26/53 (49%), Gaps = 2/53 (3%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEM-IDEGWWR-GYCHGQYGLFPANYVSL 523
            ALY +  S++  IS    + ++ +E    +GW R     G+ G  P +Y+ +
Sbjct: 3   KALYAFVGSSEGTISMTEGEKLSLVEEDKGDGWTRVRRDTGEEGYVPTSYIRI 55


>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
           region.  Members of this family have very varied
           localisations within the eukaryotic cell. pinin is known
           to localise at the desmosomes and is implicated in
           anchoring intermediate filaments to the desmosomal
           plaque. SDK2/3 is a dynamically localised nuclear
           protein thought to be involved in modulation of
           alternative pre-mRNA splicing. memA is a tumour marker
           preferentially expressed in human melanoma cell lines. A
           common feature of the members of this family is that
           they may all participate in regulating protein-protein
           interactions.
          Length = 131

 Score = 30.1 bits (68), Expect = 1.2
 Identities = 21/84 (25%), Positives = 48/84 (57%), Gaps = 2/84 (2%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
           +Q  ++EE     +   + EI++K LE +E+ E+E++R KE++E  ++++ ++ + RK +
Sbjct: 20  LQKFSQEESRLTSKEKRRAEIEQK-LEEQEKQEREELR-KEKRELFEERRRKQLELRKLE 77

Query: 421 EENQKLLLKKQQEEDRLKAEEQAR 444
           ++ +   L++   E  L      R
Sbjct: 78  QKMEDEKLQETWHEHNLALANFIR 101


>gnl|CDD|151935 pfam11498, Activator_LAG-3, Transcriptional activator LAG-3.  The
           C.elegans Notch pathway, involved in the control of
           growth, differentiation and patterning in animal
           development, relies on either of the receptors GLP-1 or
           LIN-12. Both these receptors promote signalling by the
           recruitment of LAG-3 to target promoters, where it then
           acts as a transcriptional activator. LAG-3 works as a
           ternary complex together with the DNA binding protein,
           LAG-1.
          Length = 476

 Score = 31.5 bits (70), Expect = 1.2
 Identities = 12/61 (19%), Positives = 41/61 (67%)

Query: 398 RIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQ 457
           ++ ++Q  ++  ++QE + ++ +++ Q+ ++ + Q++ + + ++Q  LL+Q++ M +L+Q
Sbjct: 324 QLAQQQNKMRLLQQQEMEMQRIEQQRQQQIMHQHQQQQQQEHQQQQMLLQQQQQMHQLQQ 383

Query: 458 Q 458
            
Sbjct: 384 H 384



 Score = 29.9 bits (66), Expect = 3.5
 Identities = 20/97 (20%), Positives = 46/97 (47%), Gaps = 2/97 (2%)

Query: 346 PVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQEN 405
           P     K P   G      +        Q + Q   ++ ++   ++ E E  RI  EQ+ 
Sbjct: 292 PEGETKKSPMEAGGDRMPQSAPPPAMNPQHIAQLAQQQNKMRLLQQQEMEMQRI--EQQR 349

Query: 406 IKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQ 442
            ++   Q +++++++ + Q++LL++QQ+  +L+   Q
Sbjct: 350 QQQIMHQHQQQQQQEHQQQQMLLQQQQQMHQLQQHHQ 386


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 31.6 bits (71), Expect = 1.2
 Identities = 16/40 (40%), Positives = 25/40 (62%)

Query: 364 SNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQ 403
           + K EE  EK +   E+  R+E ER++E EKE+ R +E +
Sbjct: 578 AKKREEAVEKAKREAEQKAREEREREKEKEKERERERERE 617



 Score = 31.2 bits (70), Expect = 1.6
 Identities = 16/44 (36%), Positives = 28/44 (63%), Gaps = 2/44 (4%)

Query: 407 KKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQER 450
           KK++E  EK ++E E  QK   ++++E+++ K  E+ R  E ER
Sbjct: 579 KKREEAVEKAKREAE--QKAREEREREKEKEKEREREREREAER 620



 Score = 30.8 bits (69), Expect = 2.3
 Identities = 34/131 (25%), Positives = 60/131 (45%), Gaps = 13/131 (9%)

Query: 314 LSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEK 373
           L  ++IKEE +++          +   E+PP P +S   +PT    P   S      K  
Sbjct: 506 LPPIQIKEEPLDE----------AEEPESPPPPPRSPSPEPTVVNTPSHASQSARFYKHL 555

Query: 374 QRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQE 433
            R       R +L     L   ++  K E+   K K+E E+K R+E+E  ++   ++++E
Sbjct: 556 DRGYNS-CARTDLYFTP-LASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKERERE 613

Query: 434 EDRLKAEEQAR 444
            +R +AE  A+
Sbjct: 614 RER-EAERAAK 623


>gnl|CDD|212836 cd11903, SH3_Nck2_3, Third Src Homology 3 domain of Nck2 adaptor
           protein.  Nck2 (also called Nckbeta or Growth factor
           receptor-bound protein 4, Grb4) plays a crucial role in
           connecting signaling pathways of tyrosine kinase
           receptors and important effectors in actin dynamics and
           cytoskeletal remodeling. It binds neuronal signaling
           proteins such as ephrinB and Disabled-1 (Dab-1)
           exclusively. Nck adaptor proteins regulate actin
           cytoskeleton dynamics by linking proline-rich effector
           molecules to protein tyrosine kinases and phosphorylated
           signaling intermediates. They contain three SH3 domains
           and a C-terminal SH2 domain. They function downstream of
           the PDGFbeta receptor and are involved in Rho GTPase
           signaling and actin dynamics. Vertebrates contain two
           Nck adaptor proteins: Nck1 (also called Nckalpha) and
           Nck2, which show partly overlapping functions but also
           bind distinct targets. The third SH3 domain of Nck
           appears to prefer ligands with a PxAPxR motif. SH3
           domains are protein interaction domains that usually
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially a PxxP motif. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 59

 Score = 28.5 bits (63), Expect = 1.3
 Identities = 17/50 (34%), Positives = 27/50 (54%), Gaps = 3/50 (6%)

Query: 475 LYDYQASADDEISFDPDDIITNIEMI--DEGWWR-GYCHGQYGLFPANYV 521
           LY + +  ++E++F+  + +  IE    D  WW+     GQ GL P NYV
Sbjct: 6   LYPFSSVTEEELNFEKGETMEVIEKPENDPEWWKCKNSRGQVGLVPKNYV 55


>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 486

 Score = 31.3 bits (71), Expect = 1.3
 Identities = 28/114 (24%), Positives = 58/114 (50%), Gaps = 13/114 (11%)

Query: 345 VPVKSILKQPTSDGIPIQNSNKEEEEKE--KQRMVQEEIKR---------KELERKEELE 393
           VP+  +LK+ +      Q   KEEE++E    +++ + IK          K  E KEE  
Sbjct: 358 VPISELLKEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEG 417

Query: 394 KEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLE 447
           K  I++++ +E+    + +  K+     E ++  +KK++E+++ + E   ++LE
Sbjct: 418 KITIKVEKSEEDTLDLEIKSIKKYFPFIEFEE--VKKEKEKEKERPEFVEKVLE 469


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
           sigma54-dependent transcription [Transcription / Signal
           transduction mechanisms].
          Length = 225

 Score = 30.7 bits (70), Expect = 1.3
 Identities = 20/102 (19%), Positives = 51/102 (50%), Gaps = 3/102 (2%)

Query: 364 SNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEEN 423
           +   ++ ++ ++M+++ I+  E E  +   +   +    Q+ +++K E+ +   ++ EE 
Sbjct: 16  NELLDKAEDPEKMLEQAIRDMESELAK-ARQALAQAIARQKQLERKLEEAQARAEKLEEK 74

Query: 424 QKLLLKKQQEEDRLKAEEQARLLEQ--ERLMEELRQQGTNED 463
            +L L+   E+   +A E+ + LE   + L  EL+Q     +
Sbjct: 75  AELALQAGNEDLAREALEEKQSLEDLAKALEAELQQAEEQVE 116


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 31.4 bits (70), Expect = 1.3
 Identities = 24/69 (34%), Positives = 37/69 (53%), Gaps = 4/69 (5%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQE----EKERKEQE 421
           K++ E+EKQ+  QE+ K   +  K  +E EQ + K E E     KEQ+    E E+  QE
Sbjct: 183 KQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKDFIKEAEQNCQE 242

Query: 422 ENQKLLLKK 430
            + +  +KK
Sbjct: 243 NHNQFFIKK 251



 Score = 28.7 bits (63), Expect = 8.3
 Identities = 27/96 (28%), Positives = 56/96 (58%), Gaps = 3/96 (3%)

Query: 363 NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKER-KEQE 421
           N +  E E+E+Q+  QE  K+K  +   EL   QI+ ++E++  +++K++ E+E+ K   
Sbjct: 145 NKSGIELEQEEQKTEQE--KQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSN 202

Query: 422 ENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQ 457
              K  ++ +QE+ + + E+Q  + EQ+  ++E  Q
Sbjct: 203 IANKNAIELEQEKQKTENEKQDLIKEQKDFIKEAEQ 238


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 29.9 bits (68), Expect = 1.3
 Identities = 18/50 (36%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 398 RIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLE 447
           R+  E     KK E+EE+E +++ E + L   K++EE R +A+ Q R L 
Sbjct: 69  RLMREMLLFWKKNEKEERELRKRAEKEALEQAKKEEELR-EAKRQQRKLN 117


>gnl|CDD|211344 cd02867, PseudoU_synth_TruB_4, Pseudouridine synthase homolog 4.
           This group consists of Eukaryotic TruB proteins similar
           to Saccharomyces cerevisiae Pus4. S. cerevisiae Pus4,
           makes psi55 in the T loop of both cytoplasmic and
           mitochondrial tRNAs. Psi55 is almost universally
           conserved.  Pseudouridine synthases catalyze the
           isomerization of specific uridines in an RNA molecule to
           pseudouridines (5-ribosyluracil, psi).
          Length = 312

 Score = 31.3 bits (71), Expect = 1.3
 Identities = 28/124 (22%), Positives = 45/124 (36%), Gaps = 9/124 (7%)

Query: 335 PLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKEL---ERKEE 391
           PL    E   V V  +L +   +  P+     EEE K+ +R V + + ++     E  E 
Sbjct: 163 PLPRPIERRQVVVSELLVKDWIEPGPLFTRTVEEEGKQYERSVVKMLGKELKTFAEVTEL 222

Query: 392 LEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERL 451
               +    EE E        EE +RK + E +        E   L+++   R       
Sbjct: 223 TATAEGDPVEEVEAT-----HEESKRKSEVEEEANEKSLGPEARSLESDA-GRGSFSPAA 276

Query: 452 MEEL 455
           M  L
Sbjct: 277 MVRL 280


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 31.3 bits (71), Expect = 1.3
 Identities = 19/86 (22%), Positives = 43/86 (50%), Gaps = 4/86 (4%)

Query: 359 IPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERK 418
           I I++S + ++E E  ++ +   +R+E E  +   +E++  ++ +   +  + +E     
Sbjct: 368 IDIKDSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLP 427

Query: 419 EQEENQKLLLKKQQEEDRLKAEEQAR 444
           E E   +  L++ Q E     EE+AR
Sbjct: 428 EDEFEDEDELEEAQPE----EEEEAR 449


>gnl|CDD|180700 PRK06800, fliH, flagellar assembly protein H; Validated.
          Length = 228

 Score = 31.0 bits (70), Expect = 1.3
 Identities = 16/72 (22%), Positives = 40/72 (55%), Gaps = 4/72 (5%)

Query: 367 EEEEKEKQRMVQEEIKRKELER-KEELEKEQIRIKEEQENIKKKKEQEEKERKEQE---E 422
           E EE+ ++   +   ++K L +   +L +EQ +++ E++ +   +EQ ++  ++Q    E
Sbjct: 31  EVEEEIQKDHEELLAQQKSLHKELNQLRQEQQKLERERQQLLADREQFQEHVQQQMKEIE 90

Query: 423 NQKLLLKKQQEE 434
             +   +K+Q+E
Sbjct: 91  AARQQFQKEQQE 102



 Score = 30.2 bits (68), Expect = 2.0
 Identities = 16/81 (19%), Positives = 44/81 (54%)

Query: 385 ELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQAR 444
           E+E +EE++K+   +  +Q+++ K+  Q  +E+++ E  ++ LL  +++      ++   
Sbjct: 29  EVEVEEEIQKDHEELLAQQKSLHKELNQLRQEQQKLERERQQLLADREQFQEHVQQQMKE 88

Query: 445 LLEQERLMEELRQQGTNEDTE 465
           +    +  ++ +Q+   E TE
Sbjct: 89  IEAARQQFQKEQQETAYEWTE 109


>gnl|CDD|233242 TIGR01035, hemA, glutamyl-tRNA reductase.  This enzyme, together
           with glutamate-1-semialdehyde-2,1-aminomutase
           (TIGR00713), leads to the production of
           delta-amino-levulinic acid from Glu-tRNA [Biosynthesis
           of cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 417

 Score = 31.2 bits (71), Expect = 1.3
 Identities = 26/111 (23%), Positives = 47/111 (42%), Gaps = 11/111 (9%)

Query: 358 GIPIQNSNKEEEEKEKQRM-VQEEIKR-KELERKEELEK---------EQIRIKEEQENI 406
            +  +N  +  EE EK    V+EE    K+  R  E+E          E +R KE ++ +
Sbjct: 302 PVVEENLAERREEAEKAEEIVEEETAEFKQWLRSLEVEPTIKALRSLAEIVREKELEKAL 361

Query: 407 KKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQ 457
           KK     +   +  E+  + L+ K      ++ ++ A   E E  +E L+ 
Sbjct: 362 KKLPGLSKDVEEVLEDLARKLINKLLHAPTVRLKQLADKEESEVCLEALKN 412


>gnl|CDD|182791 PRK10865, PRK10865, protein disaggregation chaperone; Provisional.
          Length = 857

 Score = 31.4 bits (71), Expect = 1.3
 Identities = 28/98 (28%), Positives = 53/98 (54%), Gaps = 19/98 (19%)

Query: 375 RMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEE 434
           RM Q + K +EL+R   L++  I++K EQ+ + K+ ++  K+R +    +  L  K+++ 
Sbjct: 402 RM-QIDSKPEELDR---LDRRIIQLKLEQQALMKESDEASKKRLDMLNEE--LSDKERQY 455

Query: 435 DRLKAE---EQARL---------LEQERL-MEELRQQG 459
             L+ E   E+A L         LEQ ++ +E+ R+ G
Sbjct: 456 SELEEEWKAEKASLSGTQTIKAELEQAKIAIEQARRVG 493


>gnl|CDD|181435 PRK08457, motB, flagellar motor protein MotB; Reviewed.
          Length = 257

 Score = 30.8 bits (70), Expect = 1.4
 Identities = 17/71 (23%), Positives = 29/71 (40%), Gaps = 11/71 (15%)

Query: 387 ERKEELEKEQIRI---------KEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRL 437
            + E L+ E I+I         K+  + I       E+E  E +  +     +Q+  +R 
Sbjct: 43  AKVEALKTEFIKIFDYTAKPEVKQPVKVIPPDSGDAEEEMDELKSLKASQNTQQETIERR 102

Query: 438 KAEEQARLLEQ 448
           K E  A  L+Q
Sbjct: 103 KGEGGA--LDQ 111


>gnl|CDD|234000 TIGR02760, TraI_TIGR, conjugative transfer relaxase protein TraI.
            This protein is a component of the relaxosome complex. In
            the process of conjugative plasmid transfer the
            realaxosome binds to the plasmid at the oriT (origin of
            transfer) site. The relaxase protein TraI mediates the
            single-strand nicking and ATP-dependent unwinding
            (relaxation, helicase activity) of the plasmid molecule.
            These two activities reside in separate domains of the
            protein.
          Length = 1960

 Score = 31.4 bits (71), Expect = 1.4
 Identities = 33/155 (21%), Positives = 59/155 (38%), Gaps = 8/155 (5%)

Query: 245  ANSTNASKENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLE 304
            AN        +  K N+  + P  +      +     EE++ +   EK   K K   D  
Sbjct: 1794 ANPKATVVIAVGGKNNLSPIIPKFIPKNVVIVLDNDGEEAKSQRAIEKIINKFKQ--DNI 1851

Query: 305  QAQKLEERRLSELKIKEEEIEKKLN---GHSDVPLSPSTETPPVPVKSILKQPTSDGIPI 361
             A+ +     ++  I EEE++K+L       +       +       S+LK P SD I I
Sbjct: 1852 SARIVFPDDWND--IGEEELQKQLMRAISSIEDKDIEIPKAIADFE-SVLKMPNSDIISI 1908

Query: 362  QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQ 396
             ++ + E EK     +     + +   + +L KE 
Sbjct: 1909 IDNARTEREKSDLEQIVANEHKSQQALERDLNKEF 1943


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 31.2 bits (70), Expect = 1.4
 Identities = 34/168 (20%), Positives = 64/168 (38%), Gaps = 16/168 (9%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKK 327
            L+ KF   A      SRK+ E      +  D+      +K         K    EI + 
Sbjct: 37  RLQNKFLLAAG-----SRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEP 91

Query: 328 LNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELE 387
           L   S++             K   K P         ++ + EE++ ++ V+   KR++++
Sbjct: 92  LEEGSEL----VVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVR---KRRKVK 144

Query: 388 RKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEED 435
           + +E  ++Q      +  +   +E E     E E  ++L L+K   ED
Sbjct: 145 KMDEDVEDQ----GSESEVSDVEESEFVTSLENESEEELDLEKDDGED 188


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 31.1 bits (71), Expect = 1.4
 Identities = 26/114 (22%), Positives = 48/114 (42%), Gaps = 22/114 (19%)

Query: 358 GIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENI----------- 406
            I I  S+   EE E +RMV+E     E ++K    KE+I  +   +++           
Sbjct: 489 SITITASSGLSEE-EIERMVKEAEANAEEDKKR---KEEIEARNNADSLAYQAEKTLKEA 544

Query: 407 KKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGT 460
             K   EEKE+ E+   +   LK+  + + ++  +       E L + L++   
Sbjct: 545 GDKLPAEEKEKIEKAVAE---LKEALKGEDVEEIKAK----TEELQQALQKLAE 591


>gnl|CDD|212927 cd11994, SH3_Intersectin2_4, Fourth Src homology 3 domain (or SH3D)
           of Intersectin-2.  Intersectin-2 (ITSN2) is an adaptor
           protein that functions in exo- and endocytosis, actin
           cytoskeletal reorganization, and signal transduction. It
           plays a role in clathrin-coated pit (CCP) formation. It
           binds to many proteins through its multidomain structure
           and facilitate the assembly of multimeric complexes.
           ITSN2 also functions as a specific GEF for Cdc42
           activation in epithelial morphogenesis, and is required
           in mitotic spindle orientation. It exists in
           alternatively spliced short and long isoforms. The short
           isoform contains two Eps15 homology domains (EH1 and
           EH2), a coiled-coil region and five SH3 domains
           (SH3A-E), while the long isoform, in addition, contains
           RhoGEF (also called Dbl-homologous or DH), Pleckstrin
           homology (PH) and C2 domains. The fourth SH3 domain (or
           SH3D) of ITSN2 is expected to bind protein partners,
           similar to ITSN1 which has been shown to bind SHIP2,
           Numb, CdGAP, and N-WASP. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 59

 Score = 28.4 bits (63), Expect = 1.4
 Identities = 17/51 (33%), Positives = 25/51 (49%), Gaps = 5/51 (9%)

Query: 478 YQASADDEISFDPDDIITNIEMIDEGWWRGYCHG-----QYGLFPANYVSL 523
           Y AS  +++S  P  +I  ++    GWW G         Q G FPA++V L
Sbjct: 8   YVASGVEQLSLSPGQLILILKKNSSGWWLGELQARGKKRQKGWFPASHVKL 58


>gnl|CDD|212795 cd11861, SH3_DLG-like, Src Homology 3 domain of Disks large homolog
           proteins.  The DLG-like proteins are scaffolding
           proteins that cluster at synapses and are also called
           PSD (postsynaptic density)-95 proteins or SAPs
           (synapse-associated proteins). They play important roles
           in synaptic development and plasticity, cell polarity,
           migration and proliferation. They are members of the
           MAGUK (membrane-associated guanylate kinase) protein
           family, which is characterized by the presence of a core
           of three domains: PDZ, SH3, and guanylate kinase (GuK).
           The GuK domain in MAGUK proteins is enzymatically
           inactive; instead, the domain mediates protein-protein
           interactions and associates intramolecularly with the
           SH3 domain. DLG-like proteins contain three PDZ domains
           and varying N-terminal regions. All DLG proteins exist
           as alternatively-spliced isoforms. Vertebrates contain
           four DLG proteins from different genes, called DLG1-4.
           DLG4 and DLG2 are found predominantly at postsynaptic
           sites and they mediate surface ion channel and receptor
           clustering. DLG3 is found axons and some presynaptic
           terminals. DLG1 interacts with AMPA-type glutamate
           receptors and is critical in their maturation and
           delivery to synapses. The SH3 domain of DLG4 binds and
           clusters the kainate subgroup of glutamate receptors via
           two proline-rich sequences in their C-terminal tail. It
           also binds AKAP79/150 (A-kinase anchoring protein). SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 61

 Score = 28.4 bits (64), Expect = 1.5
 Identities = 13/37 (35%), Positives = 17/37 (45%), Gaps = 5/37 (13%)

Query: 474 ALYDYQASADD-----EISFDPDDIITNIEMIDEGWW 505
           AL+DY  S D       +SF   DI+      D+ WW
Sbjct: 4   ALFDYDPSRDSGLPSQGLSFKFGDILHVTNASDDEWW 40


>gnl|CDD|223294 COG0216, PrfA, Protein chain release factor A [Translation,
           ribosomal structure and biogenesis].
          Length = 363

 Score = 31.0 bits (71), Expect = 1.5
 Identities = 26/102 (25%), Positives = 40/102 (39%), Gaps = 16/102 (15%)

Query: 306 AQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTE------------TPPVPVKSILKQ 353
            +     +L  L  + EE+E  L   SD  +    +             P V      K+
Sbjct: 1   MKPSLLEKLESLLERYEELEALL---SDPEVISDPDEYRKLSKEYAELEPIVEKYREYKK 57

Query: 354 PTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKE 395
              D    +    EE++ E + M +EEIK  E  + EELE+E
Sbjct: 58  AQEDLEDAKEMLAEEKDPEMREMAEEEIKELE-AKIEELEEE 98


>gnl|CDD|132150 TIGR03106, trio_M42_hydro, hydrolase, peptidase M42 family.  This
           model describes a subfamily of MEROPS peptidase family
           M42, a glutamyl aminopeptidase family that also includes
           the cellulase CelM from Clostridium thermocellum and
           deblocking aminopeptidases that can remove acylated
           amino acids. Members of this family occur in a three
           gene cassette with an amidotransferase (TIGR03104)in the
           asparagine synthase (glutamine-hydrolyzing) family, and
           a probable acetyltransferase (TIGR03103) in the GNAT
           family.
          Length = 343

 Score = 30.9 bits (70), Expect = 1.5
 Identities = 16/37 (43%), Positives = 18/37 (48%), Gaps = 4/37 (10%)

Query: 194 KGFGGKFGIESDRQDKSAVGWDHVE-KVDKHQSQKDD 229
           K  G  F  E D Q     GWDHVE +VD   S + D
Sbjct: 118 KASGHAFNEEIDSQ---PTGWDHVEVRVDARASCRAD 151


>gnl|CDD|239451 cd03335, TCP1_alpha, TCP-1 (CTT or eukaryotic type II) chaperonin
           family, alpha subunit. Chaperonins are involved in
           productive folding of proteins. They share a common
           general morphology, a double toroid of 2 stacked rings.
           In contrast to bacterial group I chaperonins (GroEL),
           each ring of the eukaryotic cytosolic chaperonin (CTT)
           consists of eight different, but homologous subunits.
           Their common function is to sequester nonnative proteins
           inside their central cavity and promote folding by using
           energy derived from ATP hydrolysis. The best studied in
           vivo substrates of CTT are actin and tubulin.
          Length = 527

 Score = 31.1 bits (71), Expect = 1.5
 Identities = 17/46 (36%), Positives = 26/46 (56%)

Query: 292 KEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLS 337
           K K KL  Q+ +   +KLE+ R  E  I +E I+K L   ++V L+
Sbjct: 235 KTKMKLGVQVVVTDPEKLEKIRQRESDITKERIKKILAAGANVVLT 280


>gnl|CDD|212811 cd11878, SH3_Bem1p_1, First Src Homology 3 domain of Bud emergence
           protein 1 and similar domains.  Members of this
           subfamily bear similarity to Saccharomyces cerevisiae
           Bem1p, containing two Src Homology 3 (SH3) domains at
           the N-terminus, a central PX domain, and a C-terminal
           PB1 domain. Bem1p is a scaffolding protein that is
           critical for proper Cdc42p activation during bud
           formation in yeast. During budding and mating, Bem1p
           migrates to the plasma membrane where it can serve as an
           adaptor for Cdc42p and some other proteins. Bem1p also
           functions as an effector of the G1 cyclin Cln3p and the
           cyclin-dependent kinase Cdc28p in promoting vacuolar
           fusion. SH3 domains bind to proline-rich ligands with
           moderate affinity and selectivity, preferentially to
           PxxP motifs; they play a role in the regulation of
           enzymes by intramolecular interactions, changing the
           subcellular localization of signal pathway components
           and mediate multiprotein complex assemblies.
          Length = 54

 Score = 28.0 bits (63), Expect = 1.5
 Identities = 19/50 (38%), Positives = 25/50 (50%), Gaps = 3/50 (6%)

Query: 474 ALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCH---GQYGLFPANY 520
           ALYDY+A    E+SF   D    I   D+G W    +   G+ GL P +Y
Sbjct: 4   ALYDYRAQTPGELSFSKGDFFHVIGEEDQGEWYEATNPVTGKRGLVPKSY 53


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 30.4 bits (69), Expect = 1.6
 Identities = 19/75 (25%), Positives = 39/75 (52%), Gaps = 1/75 (1%)

Query: 360 PIQNSNKEEEEKEKQRMVQEEIKRKELERKE-ELEKEQIRIKEEQENIKKKKEQEEKERK 418
             +    + EE E+    +EE   +E   +E ELE+EQ  I E +  +++ K++  + + 
Sbjct: 5   EQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQA 64

Query: 419 EQEENQKLLLKKQQE 433
           E E  +K   ++++E
Sbjct: 65  EFENLRKRTEREREE 79



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 20/73 (27%), Positives = 36/73 (49%), Gaps = 8/73 (10%)

Query: 352 KQPTSDGIPIQNSNKEEEEKEKQRMVQEE-IKRKELERKEELEKE-------QIRIKEEQ 403
           K    D    + + K EEE+ ++   +EE    +E +   ELE +        +R + E 
Sbjct: 7   KTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQAEF 66

Query: 404 ENIKKKKEQEEKE 416
           EN++K+ E+E +E
Sbjct: 67  ENLRKRTEREREE 79


>gnl|CDD|213000 cd12067, SH3_MYO15A, Src Homology 3 domain of Myosin XVa.  Myosin
           XVa is an unconventional myosin that is critical for the
           normal growth of mechanosensory stereocilia of inner ear
           hair cells. Mutations in the myosin XVa gene are
           associated with nonsyndromic hearing loss. Myosin XVa
           contains a unique N-terminal extension followed by a
           motor domain, light chain-binding IQ motifs, and a tail
           consisting of a pair of MyTH4-FERM tandems separated by
           a SH3 domain, and a PDZ domain. SH3 domains bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs; they play a
           role in the regulation of enzymes by intramolecular
           interactions, changing the subcellular localization of
           signal pathway components and mediate multiprotein
           complex assemblies.
          Length = 80

 Score = 28.6 bits (64), Expect = 1.6
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query: 501 DEGWWRGYCHGQYGLFPANYV 521
           D GW  G  HG+ G+FPA  V
Sbjct: 58  DFGWKFGAIHGRSGVFPAELV 78


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 31.3 bits (71), Expect = 1.6
 Identities = 21/115 (18%), Positives = 45/115 (39%), Gaps = 12/115 (10%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
               ++E E  ++   + + + + LER+ +  + ++   +     +  +E E     + E
Sbjct: 285 LGRARDELETAREEERELDARTEALEREADALRTRLEALQGSPAYQDAEELERARA-DAE 343

Query: 422 ENQKLL---------LKKQQEEDRLKA-EEQARLLEQERLMEELRQQGTNEDTEE 466
             Q             + + EE+R +  EE  RL + ER +   R+Q      E 
Sbjct: 344 ALQAAAADARQAIREAESRLEEERRRLDEEAGRLDDAERELRAAREQ-LARAAER 397


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 30.0 bits (68), Expect = 1.7
 Identities = 15/68 (22%), Positives = 35/68 (51%), Gaps = 8/68 (11%)

Query: 380 EIKRKELERKEELEKEQ------IRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQE 433
             ++K++E   +  +E+      +   EEQ+ I  +K +E+ E++E +   +L  +   E
Sbjct: 30  RYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELK--NQLFSENATE 87

Query: 434 EDRLKAEE 441
            + +KA +
Sbjct: 88  NNTVKATK 95


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 31.2 bits (70), Expect = 1.7
 Identities = 22/73 (30%), Positives = 33/73 (45%), Gaps = 9/73 (12%)

Query: 404 ENIKKKKEQEEKE-RKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEE-------L 455
           E  KKKKE+E +E + +   N +      QE   ++    AR L  ++LM         L
Sbjct: 461 EPKKKKKEKEGEELQPKLVMNSEDASLASQEGMSIRGNS-ARHLVMQKLMRTNRSNVIVL 519

Query: 456 RQQGTNEDTEEDL 468
           R   T +D +E L
Sbjct: 520 RNMVTPQDIDEFL 532


>gnl|CDD|192910 pfam12001, DUF3496, Domain of unknown function (DUF3496).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is about 110
           amino acids in length.
          Length = 111

 Score = 29.4 bits (66), Expect = 1.8
 Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 5/80 (6%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL 427
           E E  K +  QE+    ELE+ ++L  E+++I++   N   K  +     +  E + KLL
Sbjct: 13  ESELSKMKTSQEDFNTTELEKYKQLYLEELKIRKSLSNKLTKTNE-----RLAEVSTKLL 67

Query: 428 LKKQQEEDRLKAEEQARLLE 447
           L+K+Q    L   E   +LE
Sbjct: 68  LEKEQSRSLLSTLETRPVLE 87


>gnl|CDD|212699 cd11765, SH3_Nck_1, First Src Homology 3 domain of Nck adaptor
           proteins.  Nck adaptor proteins regulate actin
           cytoskeleton dynamics by linking proline-rich effector
           molecules to protein tyrosine kinases and phosphorylated
           signaling intermediates. They contain three SH3 domains
           and a C-terminal SH2 domain. They function downstream of
           the PDGFbeta receptor and are involved in Rho GTPase
           signaling and actin dynamics. Vertebrates contain two
           Nck adaptor proteins: Nck1 (also called Nckalpha) and
           Nck2 (also called Nckbeta or Growth factor
           receptor-bound protein 4, Grb4), which show partly
           overlapping functions but also bind distinct targets.
           Their SH3 domains are involved in recruiting downstream
           effector molecules, such as the N-WASP/Arp2/3 complex,
           which when activated induces actin polymerization that
           results in the production of pedestals, or protrusions
           of the plasma membrane. The first SH3 domain of Nck
           proteins preferentially binds the PxxDY sequence, which
           is present in the CD3e cytoplasmic tail. This binding
           inhibits phosphorylation by Src kinases, resulting in
           the downregulation of TCR surface expression. SH3
           domains are protein interaction domains that usually
           bind to proline-rich ligands with moderate affinity and
           selectivity, preferentially a PxxP motif. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 51

 Score = 27.8 bits (62), Expect = 1.8
 Identities = 19/53 (35%), Positives = 28/53 (52%), Gaps = 6/53 (11%)

Query: 472 AVALYDYQASADDEISFDPDDIITNIEMIDEG--WWR-GYCHGQYGLFPANYV 521
            VA YDY A  D E+S   ++ +T   ++D+   WW+      Q G  P+NYV
Sbjct: 2   VVAKYDYTAQGDQELSIKKNEKLT---LLDDSKHWWKVQNSSNQTGYVPSNYV 51


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 30.7 bits (69), Expect = 1.8
 Identities = 15/107 (14%), Positives = 49/107 (45%), Gaps = 2/107 (1%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           KE+ +   +++V  + +     ++ +L++++  + E+ +   +++  E  +  E+ E+  
Sbjct: 106 KEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSD--DRQETLEAGKVHEETEDSY 163

Query: 426 LLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDLGYTA 472
            + +   E+ +   +E+A   E E   E + +    +   +D+    
Sbjct: 164 HVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEED 210


>gnl|CDD|223618 COG0544, Tig, FKBP-type peptidyl-prolyl cis-trans isomerase
           (trigger factor) [Posttranslational modification,
           protein turnover, chaperones].
          Length = 441

 Score = 30.7 bits (70), Expect = 1.8
 Identities = 24/97 (24%), Positives = 49/97 (50%), Gaps = 4/97 (4%)

Query: 374 QRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQE 433
           + +V+ EI     +  ++L+++ I   E     +++  +E KE  E+     LLL++  +
Sbjct: 303 ESLVEAEIDNLLKQALQQLQQQGIDSLEASGESEEELREEFKEEAEKRVKLGLLLEEIAK 362

Query: 434 EDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDLGY 470
           E++L+  E+    E +  +EEL +Q   E  EE +  
Sbjct: 363 EEKLEVTEE----EIKAEIEELARQYGGEQPEEVIKL 395


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 30.0 bits (68), Expect = 1.8
 Identities = 20/84 (23%), Positives = 41/84 (48%), Gaps = 10/84 (11%)

Query: 371 KEKQRMVQEEIK-----RKEL-----ERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
            E+Q  + +++      ++E      E ++ELE+ + +  E  E  KK+ EQ  +E K +
Sbjct: 36  DERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAE 95

Query: 421 EENQKLLLKKQQEEDRLKAEEQAR 444
            E +   +K+  E +    +E+A 
Sbjct: 96  AEEELERIKEAAEAEIEAEKERAL 119



 Score = 27.6 bits (62), Expect = 9.7
 Identities = 19/91 (20%), Positives = 45/91 (49%), Gaps = 1/91 (1%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
            +EE +      ++E++    +  E +E+ +   ++  E IK + E+E +  KE  E + 
Sbjct: 52  LKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELERIKEAAEAEI 111

Query: 426 LLLKKQQEED-RLKAEEQARLLEQERLMEEL 455
              K++  E+ R +  E A  + ++ L +++
Sbjct: 112 EAEKERALEELRAEVAELAVAIAEKLLGKKV 142


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 31.0 bits (70), Expect = 1.9
 Identities = 29/116 (25%), Positives = 63/116 (54%), Gaps = 10/116 (8%)

Query: 363 NSNKEE---EEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQ---EEK- 415
            + K+E   E KE+ R  +EEI+++    ++EL++ + R+ E   ++ +K E    +EK 
Sbjct: 67  KALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTERATSLDRKDENLSSKEKT 126

Query: 416 -ERKEQE--ENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDL 468
            E KEQ   +  K + +++++ ++L+ +++A L     L +   ++    +TE  L
Sbjct: 127 LESKEQSLTDKSKHIDEREEQVEKLEEQKKAELERVAALSQAEAREIILAETENKL 182


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 30.4 bits (69), Expect = 1.9
 Identities = 14/53 (26%), Positives = 35/53 (66%), Gaps = 1/53 (1%)

Query: 409 KKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTN 461
           + E+EE+E++++EE ++LL   +   ++ + +++A+ + Q+   EE+R +  N
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEM-QKEEDEEMRHRAAN 172



 Score = 29.2 bits (66), Expect = 4.5
 Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 6/71 (8%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKK------EQEEKERKEQE 421
           + EKE +R       RK+L   E+LE+E+   ++E+E  +  +      EQ   ++K +E
Sbjct: 98  DSEKEDERYEITSDVRKQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKE 157

Query: 422 ENQKLLLKKQQ 432
             ++   + + 
Sbjct: 158 MQKEEDEEMRH 168



 Score = 29.2 bits (66), Expect = 4.6
 Identities = 14/67 (20%), Positives = 31/67 (46%), Gaps = 2/67 (2%)

Query: 356 SDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKE--QE 413
            D      S+  ++ +  +++ +EE ++++ E +E L +      E+    +K KE  +E
Sbjct: 102 EDERYEITSDVRKQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKE 161

Query: 414 EKERKEQ 420
           E E    
Sbjct: 162 EDEEMRH 168


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 30.5 bits (69), Expect = 2.0
 Identities = 23/78 (29%), Positives = 39/78 (50%), Gaps = 9/78 (11%)

Query: 378 QEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEK---------ERKEQEENQKLLL 428
           +  +K    ERKE+  KEQ   KE ++  KKKKE   K           ++ +E +K  L
Sbjct: 272 ESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQKLQEERKEQL 331

Query: 429 KKQQEEDRLKAEEQARLL 446
            K ++E++ + +E  + L
Sbjct: 332 AKLRKEEKEREKEYEQEL 349


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 30.1 bits (68), Expect = 2.0
 Identities = 29/102 (28%), Positives = 52/102 (50%), Gaps = 8/102 (7%)

Query: 332 SDVPLSPSTETPPVPVKSILKQ---PTSDGIPI-----QNSNKEEEEKEKQRMVQEEIKR 383
             +        PPVP    LK    PT  G P        S  E  EKE    V++E + 
Sbjct: 90  PAIYDKVVVPRPPVPQPKGLKMRFFPTGYGAPDGPPSELGSESETSEKETTAKVEKEAEV 149

Query: 384 KELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           +E E+KE+ +K++++ +++++  KK+K  E K  K++++ +K
Sbjct: 150 EEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191


>gnl|CDD|212710 cd11776, SH3_PI3K_p85, Src Homology 3 domain of the p85 regulatory
           subunit of Class IA Phosphatidylinositol 3-kinases.
           Class I PI3Ks convert PtdIns(4,5)P2 to the critical
           second messenger PtdIns(3,4,5)P3. They are heterodimers
           and exist in multiple isoforms consisting of one
           catalytic subunit (out of four isoforms) and one of
           several regulatory subunits. Class IA PI3Ks associate
           with the p85 regulatory subunit family, which contains
           SH3, RhoGAP, and SH2 domains. The p85 subunits recruit
           the PI3K p110 catalytic subunit to the membrane, where
           p110 phosphorylates inositol lipids. Vertebrates harbor
           two p85 isoforms, called alpha and beta. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 72

 Score = 28.2 bits (63), Expect = 2.0
 Identities = 20/67 (29%), Positives = 28/67 (41%), Gaps = 17/67 (25%)

Query: 474 ALYDYQASADDEISFDPDDIIT--NIEMI-------------DEGWWRGYCH--GQYGLF 516
           ALYDY+   D++I     D++   N E++              EGW  G     G+ G F
Sbjct: 5   ALYDYEKERDEDIILKTGDVLVVENPELLALGVPDGKETVPKPEGWLEGKNERTGERGDF 64

Query: 517 PANYVSL 523
           P  YV  
Sbjct: 65  PGTYVEF 71


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 30.7 bits (70), Expect = 2.2
 Identities = 20/97 (20%), Positives = 44/97 (45%), Gaps = 8/97 (8%)

Query: 363 NSNKEEEEKEKQRMVQEEIKRK-ELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
            S   EE   +   V    +    L    +L  E  R+++E E ++K+ ++ EK+     
Sbjct: 782 LSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKL---- 837

Query: 422 ENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
            N+  + K  +E   +  +E+ +L E +  + +L ++
Sbjct: 838 SNEGFVAKAPEE---VVEKEKEKLAEYQVKLAKLEER 871



 Score = 30.3 bits (69), Expect = 3.0
 Identities = 43/175 (24%), Positives = 72/175 (41%), Gaps = 36/175 (20%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKK 327
           NLRA+  NL+     +      EE E R   ++ D++    LEE  +             
Sbjct: 736 NLRAEM-NLSPSAPLKVVLVGSEELEDRLEANEDDIKGLANLEELEI------------- 781

Query: 328 LNGHSDVPLSPSTETPPVPVKSILKQPTSDGIP--IQNSNKEEEEKEKQRMVQEEIKRKE 385
                   LSP  E PPV V ++       G    +  +   +   E  R+       KE
Sbjct: 782 --------LSPDPEEPPVEVTAV-----VGGAELFLPLAGLIDLAAELARL------EKE 822

Query: 386 LER-KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKA 439
           LE+ ++E+++ + ++  E    K  +E  EKE+++  E Q  L K ++    LKA
Sbjct: 823 LEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAKLEERLAVLKA 877


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 30.6 bits (69), Expect = 2.3
 Identities = 28/86 (32%), Positives = 49/86 (56%), Gaps = 8/86 (9%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEE--KERKEQEEN 423
           K+ EE EK       I ++  + +EELEKE  ++ +E+E++ K+   EE  KE ++ EEN
Sbjct: 368 KDVEEGEKT------IVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEEN 421

Query: 424 QKLLLKKQQEEDRLKAEEQARLLEQE 449
               L K+++E   K + +A  +E+ 
Sbjct: 422 FLKELSKEEKELLEKLKMEASEVEKL 447


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 30.7 bits (69), Expect = 2.3
 Identities = 28/103 (27%), Positives = 50/103 (48%), Gaps = 5/103 (4%)

Query: 347 VKSIL--KQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQE 404
           VK IL   + T+D +    S KE   K+ ++ ++ E   +   ++EE E++   I +   
Sbjct: 103 VKGILTLGKKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDF-IADCWR 161

Query: 405 NIKKKKEQEEKERKEQEENQKL--LLKKQQEEDRLKAEEQARL 445
           N+ KK E+  KE  E  + +K    + K+ E  ++ A E   L
Sbjct: 162 NLYKKNEERFKEYLENFKRKKFKRKILKEFENAKINASEIVGL 204


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 30.4 bits (69), Expect = 2.3
 Identities = 23/120 (19%), Positives = 47/120 (39%), Gaps = 8/120 (6%)

Query: 347 VKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENI 406
            K +  +        +    E+  +E     +E I+  E +  E +E  +   KEE    
Sbjct: 27  FKVLSIKEKERLREEEEEQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKK 86

Query: 407 KKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEE 466
             +     ++ + Q E +    +++ EE+  +  E+A+        EE  ++G  E  EE
Sbjct: 87  TDEASSIIEKLQMQIERE----QEEWEEELERLIEEAK----AEGYEEGYEKGREEGLEE 138



 Score = 29.6 bits (67), Expect = 3.6
 Identities = 23/97 (23%), Positives = 46/97 (47%), Gaps = 7/97 (7%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQE-NIKKKKEQEEKERKEQ 420
             + +  EE E+      E        +EE ++E ++  +E    I+K + Q E+E++E 
Sbjct: 55  DEAKEIIEEAEEDAFEIVE------AAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEW 108

Query: 421 EENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQ 457
           EE  + L+++ + E   +  E+ R    E + E + Q
Sbjct: 109 EEELERLIEEAKAEGYEEGYEKGREEGLEEVRELIEQ 145


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 30.7 bits (70), Expect = 2.3
 Identities = 19/96 (19%), Positives = 45/96 (46%), Gaps = 4/96 (4%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEE--QENIKKKKEQEEKERKEQEENQ 424
            E+    +++ Q  ++  ELE++   ++   R+  E  +   K   +++E E+ ++E   
Sbjct: 506 REQRHLAEQLQQLRMRLSELEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEA 565

Query: 425 KLL-LKKQQEEDRLKAEEQARLLEQ-ERLMEELRQQ 458
           +L  L +   E R +     + LEQ +  ++ L  +
Sbjct: 566 RLESLSESVSEARERRMALRQQLEQLQARIQRLAAR 601



 Score = 29.2 bits (66), Expect = 6.5
 Identities = 14/58 (24%), Positives = 28/58 (48%), Gaps = 3/58 (5%)

Query: 368  EEEKEKQRMVQEEIK---RKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEE 422
               +E+ R  ++E+         R+ +LEK+    + E +N+ KK  + E++  E  E
Sbjct: 1052 SGAEERARARRDELHARLSANRSRRNQLEKQLTFCEAEMDNLTKKLRKLERDYHEMRE 1109


>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
          Length = 201

 Score = 30.0 bits (68), Expect = 2.4
 Identities = 11/45 (24%), Positives = 28/45 (62%), Gaps = 1/45 (2%)

Query: 390 EELEKEQIRIKEEQENIKK-KKEQEEKERKEQEENQKLLLKKQQE 433
           E++++ Q  +KE Q+  ++ ++  + K+ K+ +E Q  ++  Q+E
Sbjct: 72  EKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQMEMMDDQRE 116


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 30.7 bits (69), Expect = 2.4
 Identities = 39/197 (19%), Positives = 82/197 (41%), Gaps = 20/197 (10%)

Query: 262 GHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLE---ERRLSELK 318
           G  +   +  + E  A+ TE+E    + EE E+ + + + + E   ++E   +R+ +E +
Sbjct: 708 GETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHE 767

Query: 319 IKEE----------EIEKKLNGH--SDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNK 366
            + E          EI+   +G    D       E           +         +  +
Sbjct: 768 GETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTE 827

Query: 367 EEEEKEKQRMV---QEEIKRKE--LERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
            ++E  +Q +    Q E K+ E  ++     +      +EE+E  ++++E+EE+E +E+E
Sbjct: 828 VKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEE 887

Query: 422 ENQKLLLKKQQEEDRLK 438
           E  +  L  +  E R K
Sbjct: 888 EENEEPLSLEWPETRQK 904


>gnl|CDD|238252 cd00446, GrpE, GrpE is the adenine nucleotide exchange factor of
           DnaK (Hsp70)-type ATPases. The GrpE dimer binds to the
           ATPase domain of Hsp70 catalyzing the dissociation of
           ADP, which enables rebinding of ATP, one step in the
           Hsp70 reaction cycle in protein folding. In eukaryotes,
           only the mitochondrial Hsp70, not the cytosolic form, is
           GrpE dependent.
          Length = 137

 Score = 29.4 bits (67), Expect = 2.4
 Identities = 12/39 (30%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL 427
            EEL+ + +R   E EN +K+ E+E +E ++    +K  
Sbjct: 1   LEELKDKLLRALAEFENYRKRTEREREEARKY-AIEKFA 38


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 30.1 bits (68), Expect = 2.4
 Identities = 15/92 (16%), Positives = 32/92 (34%), Gaps = 12/92 (13%)

Query: 334 VPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELE 393
           V  +P+ E  P P     +               E E   +    +  ++ + E+K +  
Sbjct: 48  VLEAPTEEPQPEPEPPEEQPKP----------PTEPETPPEPTPPKPKEKPKPEKKPKKP 97

Query: 394 KEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           K +   K + +   K K Q + ++   +   K
Sbjct: 98  KPK--PKPKPKPKPKVKPQPKPKKPPSKTAAK 127


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 29.6 bits (67), Expect = 2.5
 Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 9/92 (9%)

Query: 370 EKEKQRMVQE----EIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK 425
           E E +R+       + + ++LER+  L + +     E++  KK K  E+K + E+EE Q+
Sbjct: 63  EAEIERLQNTIERLKTQLEDLERELALLQAK-----ERQLEKKLKTLEQKLKNEKEEVQR 117

Query: 426 LLLKKQQEEDRLKAEEQARLLEQERLMEELRQ 457
           L    QQ + +   E + R  E E+L E L Q
Sbjct: 118 LKNIIQQRKTQYNHELKKRDREIEKLKERLLQ 149


>gnl|CDD|212797 cd11863, SH3_CACNB, Src Homology 3 domain of Voltage-dependent
           L-type calcium channel subunit beta.  Voltage-dependent
           calcium channels (Ca(V)s) are multi-protein complexes
           that regulate the entry of calcium into cells. They
           impact muscle contraction, neuronal migration, hormone
           and neurotransmitter release, and the activation of
           calcium-dependent signaling pathways. They are composed
           of four subunits: alpha1, alpha2delta, beta, and gamma.
           The beta subunit is a soluble and intracellular protein
           that interacts with the transmembrane alpha1 subunit. It
           facilitates the trafficking and proper localization of
           the alpha1 subunit to the cellular plasma membrane.
           Vertebrates contain four different beta subunits from
           distinct genes (beta1-4); each exists as multiple splice
           variants. All are expressed in the brain while other
           tissues show more specific expression patterns. The beta
           subunits show similarity to MAGUK (membrane-associated
           guanylate kinase) proteins in that they contain SH3 and
           inactive guanylate kinase (GuK) domains; however, they
           do not appear to contain a PDZ domain. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 62

 Score = 27.6 bits (62), Expect = 2.5
 Identities = 11/45 (24%), Positives = 16/45 (35%), Gaps = 7/45 (15%)

Query: 470 YTAVALYDYQASADD-------EISFDPDDIITNIEMIDEGWWRG 507
           +       Y  S DD        +SF+  D +   E  +  WW G
Sbjct: 1   FAVRTNVGYDGSLDDDSPVPGYAVSFEAKDFLHIKEKYNNDWWIG 45


>gnl|CDD|235551 PRK05667, dnaG, DNA primase; Validated.
          Length = 580

 Score = 30.6 bits (70), Expect = 2.5
 Identities = 15/84 (17%), Positives = 38/84 (45%), Gaps = 4/84 (4%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL 426
           +   +E   +++     +E+  ++         KE ++ ++K ++Q  +ER E+   ++ 
Sbjct: 500 DAGLEELAALLESLAVWEEISEEDIAA----LEKELKDALEKLRDQLLEERLEELIAKER 555

Query: 427 LLKKQQEEDRLKAEEQARLLEQER 450
           LL+        + E    L+E +R
Sbjct: 556 LLEGHGLSSEERLELLQLLIELKR 579


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 29.6 bits (67), Expect = 2.6
 Identities = 18/82 (21%), Positives = 37/82 (45%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
           +    ++  E +++    ++         E L+KE  RIK+   N  +  E+  + R+E 
Sbjct: 82  LAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELREEL 141

Query: 421 EENQKLLLKKQQEEDRLKAEEQ 442
            E ++     + E +RL+  EQ
Sbjct: 142 AELKQENEALEAENERLQENEQ 163


>gnl|CDD|212917 cd11984, SH3_Shank3, Src homology 3 domain of SH3 and multiple
           ankyrin repeat domains protein 3.  Shank3, also called
           ProSAP2 (Proline-rich synapse-associated protein 2), is
           widely expressed. It plays a role in the formation of
           dendritic spines and synapses. Haploinsufficiency of the
           Shank3 gene causes the 22q13 deletion/Phelan-McDermid
           syndrome, and variants of Shank3 have been implicated in
           autism spectrum disorder, schizophrenia, and
           intellectual disability. Shank proteins carry
           scaffolding functions through multiple sites of
           protein-protein interaction in its domain architecture,
           including ankyrin (ANK) repeats, a long proline rich
           region, as well as SH3, PDZ, and SAM domains. The SH3
           domain of Shank binds GRIP, a scaffold protein that
           binds AMPA receptors and Eph receptors/ligands. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 52

 Score = 27.2 bits (60), Expect = 2.8
 Identities = 14/51 (27%), Positives = 26/51 (50%)

Query: 471 TAVALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           T +A+  Y    + EI  +  + +  + + + G+W G   G+ G FPA+ V
Sbjct: 2   TFIAVKAYSPQGEGEIQLNRGERVKVLSIGEGGFWEGTVKGRTGWFPADCV 52


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 30.3 bits (68), Expect = 2.9
 Identities = 28/140 (20%), Positives = 65/140 (46%), Gaps = 2/140 (1%)

Query: 278 KQTEEESRKRSEEEKEKRKLKDQIDLEQAQ-KLEERRLSELKIKEEEIEKKLNGHSDVPL 336
           ++ E    ++   + E  K KD ++  + + +  E++L ELK  E +  ++ N  ++  L
Sbjct: 594 EELETLKSEKENLDGELSKCKDDLEESKNKLQETEKKLEELK-SELDASQESNSLAETQL 652

Query: 337 SPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQ 396
               E+                   +  +  E+E EK+R   +E+  K  E +E++E+ +
Sbjct: 653 KCMVESYESLELRAKDLEAEHKSLQEKISSLEDELEKERQNHQELIAKCRELEEKIERAE 712

Query: 397 IRIKEEQENIKKKKEQEEKE 416
                ++ + +++K ++EKE
Sbjct: 713 QEENMQKLDEEEQKIKQEKE 732


>gnl|CDD|212781 cd11847, SH3_Brk, Src homology 3 domain of Brk (Breast tumor
           kinase) Protein Tyrosine Kinase (PTK), also called PTK6.
            Brk is a cytoplasmic (or non-receptor) PTK with limited
           homology to Src kinases. It has been found to be
           overexpressed in a majority of breast tumors. It plays
           roles in normal cell differentiation, proliferation,
           survival, migration, and cell cycle progression. Brk
           substrates include RNA-binding proteins (SLM-1/2,
           Sam68), transcription factors (STAT3/5), and signaling
           molecules (Akt, paxillin, IRS-4). Src kinases in general
           contain an N-terminal SH4 domain with a myristoylation
           site, followed by SH3 and SH2 domains, a tyr kinase
           domain, and a regulatory C-terminal region containing a
           conserved tyr; they are activated by autophosphorylation
           at the tyr kinase domain, but are negatively regulated
           by phosphorylation at the C-terminal tyr by Csk
           (C-terminal Src Kinase). However, Brk lacks the
           N-terminal myristoylation site. The SH3 domain of Src
           kinases contributes to substrate recruitment by binding
           adaptor proteins/substrates, and regulation of kinase
           activity through an intramolecular interaction. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 58

 Score = 27.5 bits (61), Expect = 2.9
 Identities = 11/33 (33%), Positives = 17/33 (51%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWW 505
            AL+D++A  D+E+SF   D     E   + W 
Sbjct: 3   KALWDFKARGDEELSFQAGDQFRIAERSGDWWT 35


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 29.6 bits (67), Expect = 2.9
 Identities = 19/103 (18%), Positives = 50/103 (48%)

Query: 365 NKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQ 424
            + E ++ ++R+ + E K   ++ + EL    I I+  +E I   +++  +  +E E+ +
Sbjct: 64  LESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLE 123

Query: 425 KLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEED 467
           K +   ++  +RL+          E  + E+R++G    ++ +
Sbjct: 124 KEIEDLKERLERLEKNLAEAEARLEEEVAEIREEGQELSSKRE 166



 Score = 28.9 bits (65), Expect = 6.0
 Identities = 18/104 (17%), Positives = 47/104 (45%), Gaps = 11/104 (10%)

Query: 363 NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEE 422
           N   E  E E + + + ++ + E E +E  E+ + R +E+   +K ++E      + Q  
Sbjct: 44  NKALEALEIELEDL-ENQVSQLESEIQEIRERIK-RAEEKLSAVKDERELRALNIEIQIA 101

Query: 423 NQKL---------LLKKQQEEDRLKAEEQARLLEQERLMEELRQ 457
            +++         L+++ ++ ++   + + RL   E+ + E   
Sbjct: 102 KERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEA 145



 Score = 28.5 bits (64), Expect = 6.8
 Identities = 34/181 (18%), Positives = 70/181 (38%), Gaps = 24/181 (13%)

Query: 272 KFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGH 331
             + L  + +    +  E  K  +K K +++         + L  L+I+ E++E +++  
Sbjct: 11  AIQKLDLEKDRLEPRIKEIRKALKKAKAELEALN------KALEALEIELEDLENQVSQL 64

Query: 332 SDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEE 391
                          ++ I ++       +     E E +     +Q        ER   
Sbjct: 65  E------------SEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAK-----ERINS 107

Query: 392 LEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERL 451
           LE E   + EE E ++K+ E + KER E+ E      + + EE+  +  E+ + L  +R 
Sbjct: 108 LEDELAELMEEIEKLEKEIE-DLKERLERLEKNLAEAEARLEEEVAEIREEGQELSSKRE 166

Query: 452 M 452
            
Sbjct: 167 E 167


>gnl|CDD|212821 cd11888, SH3_ARHGAP9_like, Src Homology 3 domain of Rho
           GTPase-activating protein 9 and similar proteins.  This
           subfamily is composed of Rho GTPase-activating proteins
           including mammalian ARHGAP9, and vertebrate ARHGAPs 12
           and 27. RhoGAPs (or ARHGAPs) bind to Rho proteins and
           enhance the hydrolysis rates of bound GTP. ARHGAP9
           functions as a GAP for Rac and Cdc42, but not for RhoA.
           It negatively regulates cell migration and adhesion. It
           also acts as a docking protein for the MAP kinases Erk2
           and p38alpha, and may facilitate cross-talk between the
           Rho GTPase and MAPK pathways to control actin
           remodeling. ARHGAP27, also called CAMGAP1, shows GAP
           activity towards Rac1 and Cdc42. It binds the adaptor
           protein CIN85 and may play a role in clathrin-mediated
           endocytosis. ARHGAP12 has been shown to display GAP
           activity towards Rac1. It plays a role in regulating
           HFG-driven cell growth and invasiveness. ARHGAPs in this
           subfamily contain SH3, WW, Pleckstin homology (PH), and
           RhoGAP domains. SH3 domains bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs; they play a role in the regulation of
           enzymes by intramolecular interactions, changing the
           subcellular localization of signal pathway components
           and mediate multiprotein complex assemblies.
          Length = 54

 Score = 27.3 bits (61), Expect = 3.1
 Identities = 11/54 (20%), Positives = 19/54 (35%), Gaps = 4/54 (7%)

Query: 471 TAVALYD--YQASADDEISFDPDDIITNIEMIDEGWW--RGYCHGQYGLFPANY 520
             V LY   Y      ++S    +    ++  ++ WW  R     +    PA Y
Sbjct: 1   YVVVLYPFEYTGKDGRKVSIKEGERFLLLKKSNDDWWQVRRPGDSKPFYVPAQY 54


>gnl|CDD|176962 CHL00019, atpF, ATP synthase CF0 B subunit.
          Length = 184

 Score = 29.4 bits (67), Expect = 3.1
 Identities = 25/101 (24%), Positives = 45/101 (44%), Gaps = 24/101 (23%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
           I+NS    EE+ ++ + + E  R  L       + ++   E + N   + E+E       
Sbjct: 64  IRNS----EERREEAIEKLEKARARLR------QAELEADEIRVNGYSEIERE------- 106

Query: 421 EENQKLLLKKQQEED--RLKAEEQARL-LEQERLMEELRQQ 458
               K  L  Q +ED  RL+  +   +  EQ+R + ++RQQ
Sbjct: 107 ----KENLINQAKEDLERLENYKNETIRFEQQRAINQVRQQ 143


>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 366 and 597 amino
           acids in length. There is a single completely conserved
           residue R that may be functionally important.
          Length = 297

 Score = 29.9 bits (68), Expect = 3.2
 Identities = 13/61 (21%), Positives = 29/61 (47%), Gaps = 8/61 (13%)

Query: 387 ERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEE--------DRLK 438
           E++E+L+  Q R++  +  +     Q E ++   E N++  + ++Q           RL+
Sbjct: 186 EKREKLDAAQARLETLKARLDAAIAQLEAQKAALERNRQAAVAEKQARIEANDGLLARLE 245

Query: 439 A 439
           A
Sbjct: 246 A 246


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 30.1 bits (68), Expect = 3.2
 Identities = 12/63 (19%), Positives = 32/63 (50%)

Query: 360 PIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE 419
             +  NK++  K+K +  ++++   +    E + + Q    ++    KKKK++++K++  
Sbjct: 12  QQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNL 71

Query: 420 QEE 422
            E 
Sbjct: 72  GEA 74


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 29.7 bits (67), Expect = 3.3
 Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 2/62 (3%)

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQ--EEDRLKAEEQARLL 446
           K E EKE      E E  KKK+E E K++KE EE + +  K  Q  +E +L +  Q +L 
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTKLT 132

Query: 447 EQ 448
           E+
Sbjct: 133 EE 134


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 30.1 bits (68), Expect = 3.3
 Identities = 24/107 (22%), Positives = 49/107 (45%), Gaps = 15/107 (14%)

Query: 352 KQPTSDGIPIQNSNKEEEEKEKQRM--VQEEIKRKELERKEELEKEQIRIKEEQENIKKK 409
           + P S    I+     E +K   ++   +  +K ++L+  E  E+ +  +K  Q+  KK+
Sbjct: 124 ETPASAPSSIKEQKSSELKKVDSQLPDPRALLKGEDLKTVETAEEIEKSLKAMQQKWKKR 183

Query: 410 KEQEEKERKEQEENQKL-------------LLKKQQEEDRLKAEEQA 443
           K+    ++  +E  ++L             L K ++E D+LK E +A
Sbjct: 184 KKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKA 230


>gnl|CDD|227429 COG5098, COG5098, Chromosome condensation complex Condensin,
           subunit D2 [Chromatin structure and dynamics / Cell
           division and chromosome partitioning].
          Length = 1128

 Score = 30.3 bits (68), Expect = 3.3
 Identities = 12/54 (22%), Positives = 21/54 (38%)

Query: 392 LEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARL 445
           L ++ +   E Q N  K   QE     E+E  Q     + + E       ++R+
Sbjct: 431 LWEKNLGSAENQLNSLKSGLQETLCDGEKEVEQDEGQCRTELEGSFNKSAESRI 484


>gnl|CDD|224234 COG1315, COG1315, Uncharacterized conserved protein [Function
           unknown].
          Length = 543

 Score = 30.1 bits (68), Expect = 3.3
 Identities = 20/87 (22%), Positives = 35/87 (40%), Gaps = 2/87 (2%)

Query: 377 VQEEIKRKELERKEELEKEQIRIKEEQENIK--KKKEQEEKERKEQEENQKLLLKKQQEE 434
           +  EI  +  E  EE+   + R+K+  + +    K + E K+     + + LL      +
Sbjct: 404 MDPEIVERLKELTEEISLHEERLKKLTKLLVALVKVKIESKKNILPPDKESLLTAVNNTK 463

Query: 435 DRLKAEEQARLLEQERLMEELRQQGTN 461
             L+   +    E E L EEL   G  
Sbjct: 464 ITLRNSIEKIKAELEGLQEELEVVGIE 490


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 30.0 bits (68), Expect = 3.3
 Identities = 25/97 (25%), Positives = 44/97 (45%), Gaps = 12/97 (12%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELER-KEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
               +E  + ++     E+    EL+R KE+L+K        QE + K K+ EE E + Q
Sbjct: 182 DALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLL------QEIMIKVKKLEELEEELQ 235

Query: 421 EENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQ 457
           E   K+     ++    K+E    + E E+ +E+ R 
Sbjct: 236 ELESKI-----EDLTNKKSELNTEIAEAEKKLEQCRG 267



 Score = 28.4 bits (64), Expect = 8.7
 Identities = 13/60 (21%), Positives = 30/60 (50%), Gaps = 1/60 (1%)

Query: 268 NLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLE-ERRLSELKIKEEEIEK 326
             + K + L ++   + +K  E E+E ++L+ +I+    +K E    ++E + K E+   
Sbjct: 208 RAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRG 267


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 29.9 bits (68), Expect = 3.3
 Identities = 17/52 (32%), Positives = 30/52 (57%)

Query: 387 ERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLK 438
           E KEEL K    IKE+     KKK +E+K +K +++ ++    K++++   K
Sbjct: 363 ELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414


>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein.  Bacillus subtilis stage V
           sporulation protein R is involved in spore cortex
           formation. Little is known about cortex biosynthesis,
           except that it depends on several sigma E controlled
           genes, including spoVR.
          Length = 427

 Score = 29.9 bits (68), Expect = 3.4
 Identities = 21/81 (25%), Positives = 35/81 (43%), Gaps = 15/81 (18%)

Query: 371 KEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERK----EQEENQKL 426
           K   ++  EE K ++ ER+E L+ +   +       ++K E+E   RK    E +E+   
Sbjct: 170 KRPIKLSLEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQEDILY 229

Query: 427 LLKKQQEEDRLKAEEQARLLE 447
            L           E+ A LLE
Sbjct: 230 FL-----------EKNAPLLE 239


>gnl|CDD|212969 cd12036, SH3_MPP5, Src Homology 3 domain of Membrane Protein,
           Palmitoylated 5 (or MAGUK p55 subfamily member 5).
           MPP5, also called PALS1 (Protein associated with Lin7)
           or Nagie oko protein in zebrafish or Stardust in
           Drosophila, is a scaffolding protein which associates
           with Crumbs homolog 1 (CRB1), CRB2, or CRB3 through its
           PDZ domain and with PALS1-associated tight junction
           protein (PATJ) or multi-PDZ domain protein 1 (MUPP1)
           through its L27 domain. The resulting tri-protein
           complexes are core proteins of the Crumb complex, which
           localizes at tight junctions or subapical regions, and
           is involved in the maintenance of apical-basal polarity
           in epithelial cells and the morphogenesis and function
           of photoreceptor cells. MPP5 is critical for the proper
           stratification of the retina and is also expressed in T
           lymphocytes where it is important for TCR-mediated
           activation of NFkB. Drosophila Stardust exists in
           several isoforms, some of which show opposing functions
           in photoreceptor cells, which suggests that the relative
           ratio of different Crumbs complexes regulates
           photoreceptor homeostasis. MPP5 contains two L27 domains
           followed by the core of three domains characteristic of
           MAGUK (membrane-associated guanylate kinase) proteins:
           PDZ, SH3, and guanylate kinase (GuK). In addition, it
           also contains the Hook (Protein 4.1 Binding) motif in
           between the SH3 and GuK domains. The GuK domain in MAGUK
           proteins is enzymatically inactive; instead, the domain
           mediates protein-protein interactions and associates
           intramolecularly with the SH3 domain. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 63

 Score = 27.4 bits (61), Expect = 3.4
 Identities = 15/46 (32%), Positives = 19/46 (41%), Gaps = 7/46 (15%)

Query: 474 ALYDYQASADDEI-------SFDPDDIITNIEMIDEGWWRGYCHGQ 512
           A +DY    D  I       SF   DI+  I   D  WW+ Y  G+
Sbjct: 4   AHFDYDPEDDPYIPCRELGLSFQKGDILHVISQEDPNWWQAYREGE 49


>gnl|CDD|237478 PRK13709, PRK13709, conjugal transfer nickase/helicase TraI;
            Provisional.
          Length = 1747

 Score = 30.1 bits (68), Expect = 3.6
 Identities = 27/149 (18%), Positives = 56/149 (37%), Gaps = 11/149 (7%)

Query: 329  NGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEK-------QRMVQEEI 381
            N  S V +  + E  P    +I        IP  +        E        QR  +E I
Sbjct: 1594 NPDSGVVVRIAGEGRPWNPGAITGGRVWGDIPDNSVQPGAGNGEPVTAEVLAQRQAEEAI 1653

Query: 382  KRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE----ENQKLLLKKQQEEDRL 437
            +R+   R +E+ ++    K +  + K ++   +   +E++      ++  L +    +  
Sbjct: 1654 RRETERRADEIVRKMAENKPDLPDGKTEQAVRDIAGQERDRAAISEREAALPESVLREPQ 1713

Query: 438  KAEEQARLLEQERLMEELRQQGTNEDTEE 466
            +  E  R + +E L+ E  QQ   +   +
Sbjct: 1714 REREAVREVARENLLRERLQQMERDMVRD 1742


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 29.3 bits (66), Expect = 3.6
 Identities = 20/102 (19%), Positives = 47/102 (46%), Gaps = 12/102 (11%)

Query: 367 EEEEKEKQRMVQEEIKRKELER------------KEELEKEQIRIKEEQENIKKKKEQEE 414
            E  + KQ++ + + + ++LE             KE+L+  +  +K  +E ++  +E  +
Sbjct: 79  GELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIK 138

Query: 415 KERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELR 456
           +  KE  E ++ L ++ +E        Q  L   +  ++EL+
Sbjct: 139 ELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQ 180



 Score = 29.0 bits (65), Expect = 4.4
 Identities = 18/83 (21%), Positives = 37/83 (44%), Gaps = 9/83 (10%)

Query: 382 KRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLL-----LKKQQEEDR 436
              +  R ++LE+ Q  + E ++ +    E +E+    +E   +L      LK+  +  R
Sbjct: 64  APVQSVRPQKLEELQGELSELKQQLS---ELQEELEDLEERIAELESELEDLKEDLQLLR 120

Query: 437 -LKAEEQARLLEQERLMEELRQQ 458
            L    + RL   E  ++EL ++
Sbjct: 121 ELLKSLEERLESLEESIKELAKE 143



 Score = 28.2 bits (63), Expect = 7.9
 Identities = 19/109 (17%), Positives = 50/109 (45%), Gaps = 1/109 (0%)

Query: 360 PIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE 419
           P+Q+   ++ E+ +  + + + +  EL+ + E  +E+I   E +    K+  Q  +E  +
Sbjct: 65  PVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLK 124

Query: 420 QEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEEDL 468
             E ++L   ++  ++  K   + R   +E + E   +    ++  + L
Sbjct: 125 SLE-ERLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRL 172


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
           protein 2.  THO and TREX form a eukaryotic complex which
           functions in messenger ribonucleoprotein metabolism and
           plays a role in preventing the transcription-associated
           genetic instability. Tho2, along with four other
           subunits forms THO.
          Length = 296

 Score = 29.6 bits (67), Expect = 3.7
 Identities = 11/69 (15%), Positives = 35/69 (50%), Gaps = 7/69 (10%)

Query: 393 EKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLM 452
           +KE  R++++ + +       +K++KE++  + L+        +L+ E +  +   E+  
Sbjct: 30  DKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLI-------KKLEEELKKHIEHNEKTK 82

Query: 453 EELRQQGTN 461
           + L ++ ++
Sbjct: 83  KRLSEEKSS 91


>gnl|CDD|224958 COG2047, COG2047, Uncharacterized protein (ATP-grasp superfamily)
           [General function prediction only].
          Length = 258

 Score = 29.7 bits (67), Expect = 3.8
 Identities = 15/53 (28%), Positives = 28/53 (52%), Gaps = 5/53 (9%)

Query: 389 KEELEK--EQIRIKEEQENIKKKKEQEEK---ERKEQEENQKLLLKKQQEEDR 436
           K  LE   + + I+ + E ++++ ++ E+   + KE EE Q+    K  EED 
Sbjct: 202 KAVLEVLCKMLGIEIDMEALEERAKEMEEIIEKLKEMEEMQEAQEPKPGEEDL 254


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 30.0 bits (67), Expect = 3.8
 Identities = 29/162 (17%), Positives = 66/162 (40%), Gaps = 5/162 (3%)

Query: 287 RSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVP 346
           R+ E  +  +  D   +E+  K  +++  + K KE + +KK        L  + +  P  
Sbjct: 179 RNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPAS 238

Query: 347 VKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENI 406
             S+ +   +      +    + E ++ +  + E  +K  + K++ ++     KE++E  
Sbjct: 239 AASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQR-----KEKEEKK 293

Query: 407 KKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQ 448
           KKKK    +        ++ +     EE+ L      +LL +
Sbjct: 294 KKKKHHHHRCHHSDGGAEQPVQNGAVEEEPLPPMSSYKLLAE 335


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 29.9 bits (68), Expect = 3.9
 Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 7/69 (10%)

Query: 367 EEEEKEKQRMVQEEI-KRKEL-----ERKEELEKEQIRIKEEQENIKKK-KEQEEKERKE 419
           +EE ++  R ++E   +R EL        +  E +   +  E + +K+K KE E    + 
Sbjct: 35  DEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEKLKELEAALDEL 94

Query: 420 QEENQKLLL 428
           + E   LLL
Sbjct: 95  EAELDTLLL 103


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 28.9 bits (65), Expect = 3.9
 Identities = 23/86 (26%), Positives = 54/86 (62%), Gaps = 3/86 (3%)

Query: 382 KRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEE 441
           +R+E ER E ++++    K+E E+ + ++++EEK+RK++E+  K   K+Q+++ + K ++
Sbjct: 45  RRREYERLELMDEK---WKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKK 101

Query: 442 QARLLEQERLMEELRQQGTNEDTEED 467
           +A+   ++   E  +    + D EE+
Sbjct: 102 KAKKGNKKEEKEGSKSSEESSDEEEE 127


>gnl|CDD|217957 pfam04196, Bunya_RdRp, Bunyavirus RNA dependent RNA polymerase.
           The bunyaviruses are enveloped viruses with a genome
           consisting of 3 ssRNA segments (called L, M and S). The
           nucleocapsid protein is encode on the small (S) genomic
           RNA. The L segment codes for an RNA polymerase. This
           family contains the RNA dependent RNA polymerase on the
           L segment.
          Length = 735

 Score = 30.1 bits (68), Expect = 4.0
 Identities = 36/201 (17%), Positives = 63/201 (31%), Gaps = 29/201 (14%)

Query: 258 KPNIGHVKPSNLRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEER----R 313
           K    H + + L      L K+  E  RK  E+E +    +D ++ +    L        
Sbjct: 165 KNKGLHGEVNVLAK----LLKKILEWERKFREKEPKYLGYEDPVNGDIKVHLLSESYLKD 220

Query: 314 LSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEK 373
           L    +    +E +  G        ST       K +L++     +    SN  +   E 
Sbjct: 221 LCRHALLRLRLEYQ-EGFCAPITELSTLESS---KLLLERIEKSAVLKALSNFSKHWYEP 276

Query: 374 Q----------RMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEEN 423
           +          + +  E+        + +E  +    +   N  K    + KE  E+   
Sbjct: 277 KGSDLRNISELKEMSLEVGEDNYHLSKVIETIETIRFDNLYNFNKTLVLKLKEESEELIE 336

Query: 424 QK------LLLKKQQEE-DRL 437
                    L KKQQ   DR 
Sbjct: 337 MAKKCMHICLFKKQQTGGDRE 357


>gnl|CDD|212968 cd12035, SH3_MPP1-like, Src Homology 3 domain of Membrane Protein,
           Palmitoylated 1 (or MAGUK p55 subfamily member 1)-like
           proteins.  This subfamily includes MPP1, CASK
           (Calcium/calmodulin-dependent Serine protein Kinase),
           Caenorhabditis elegans lin-2, and similar proteins. MPP1
           and CASK are scaffolding proteins from the MAGUK
           (membrane-associated guanylate kinase) protein family,
           which is characterized by the presence of a core of
           three domains: PDZ, SH3, and guanylate kinase (GuK). In
           addition, they also have the Hook (Protein 4.1 Binding)
           motif in between the SH3 and GuK domains. The GuK domain
           in MAGUK proteins is enzymatically inactive; instead,
           the domain mediates protein-protein interactions and
           associates intramolecularly with the SH3 domain. CASK
           and lin-2 also contain an N-terminal
           calmodulin-dependent kinase (CaMK)-like domain and two
           L27 domains. MPP1 is ubiquitously-expressed and plays
           roles in regulating neutrophil polarity, cell shape,
           hair cell development, and neural development and
           patterning of the retina. CASK is highly expressed in
           the mammalian nervous system and plays roles in synaptic
           protein targeting, neural development, and gene
           expression regulation. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 62

 Score = 27.0 bits (60), Expect = 4.0
 Identities = 14/41 (34%), Positives = 18/41 (43%), Gaps = 7/41 (17%)

Query: 474 ALYDYQASADDEI-------SFDPDDIITNIEMIDEGWWRG 507
           A +DY  S DD I       +F   DI+  I   D  WW+ 
Sbjct: 4   AQFDYDPSKDDLIPCQQAGIAFKTGDILQIISKDDHNWWQA 44


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 28.8 bits (65), Expect = 4.0
 Identities = 19/87 (21%), Positives = 48/87 (55%), Gaps = 4/87 (4%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
           + + K+ E++ K+   + + K KEL+++E+  ++Q     E+    K++E ++K+++ Q+
Sbjct: 34  KAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQ 93

Query: 422 ENQKLLLKKQQEEDRLKAEEQARLLEQ 448
           + Q      QQE  + + E    + ++
Sbjct: 94  KQQAA----QQELQQKQQELLQPIYDK 116


>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1.  MAT1
           is an assembly/targeting factor for cyclin-dependent
           kinase-activating kinase (CAK), which interacts with the
           transcription factor TFIIH. The domain found to the
           N-terminal side of this domain is a C3HC4 RING finger.
          Length = 200

 Score = 29.3 bits (66), Expect = 4.0
 Identities = 27/102 (26%), Positives = 55/102 (53%), Gaps = 8/102 (7%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
           +  NK+   + K+R+ +E+   +ELE+  E EKE    KEE+    +K+EQE+K  KE++
Sbjct: 74  EKENKDSIMRNKRRLTREQ---EELEQALEEEKEM---KEEKRLHLQKEEQEQKMAKEKD 127

Query: 422 ENQKL--LLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTN 461
           + + +  L       + + A+ + +  + E  +E+L ++   
Sbjct: 128 KQEIIDELETSNLPANVIIAQHKKQSKQLESQVEKLERKKRV 169


>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3).  This family
           consists of several optic atrophy 3 (OPA3) proteins.
           OPA3 deficiency causes type III 3-methylglutaconic
           aciduria (MGA) in humans. This disease manifests with
           early bilateral optic atrophy, spasticity,
           extrapyramidal dysfunction, ataxia, and cognitive
           deficits, but normal longevity.
          Length = 134

 Score = 28.4 bits (64), Expect = 4.1
 Identities = 14/41 (34%), Positives = 27/41 (65%), Gaps = 1/41 (2%)

Query: 372 EKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQ 412
           E QR  ++E  +KE E ++ELE+ + R+ E +  I+++K +
Sbjct: 95  EYQRQSRKE-AKKEEELQQELEELEARVGELELEIERQKAR 134



 Score = 27.6 bits (62), Expect = 9.3
 Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 4/40 (10%)

Query: 363 NSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEE 402
            S KE +++E+ +   EE++     R  ELE E  R K  
Sbjct: 99  QSRKEAKKEEELQQELEELE----ARVGELELEIERQKAR 134


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 29.4 bits (66), Expect = 4.1
 Identities = 13/73 (17%), Positives = 29/73 (39%), Gaps = 7/73 (9%)

Query: 362 QNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
            N +K   E   +   Q++   K   +     +   R+K+     KK+++++ K++K   
Sbjct: 23  ANKDKGNPEIYNKINSQDKAIEK--FKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK--- 77

Query: 422 ENQKLLLKKQQEE 434
               L +   Q  
Sbjct: 78  --VPLQVNPAQLF 88


>gnl|CDD|212841 cd11908, SH3_ITK, Src Homology 3 domain of Interleukin-2-inducible
           T-cell Kinase.  ITK (also known as Tsk or Emt) is a
           cytoplasmic (or nonreceptor) tyr kinase containing Src
           homology protein interaction domains (SH3, SH2)
           N-terminal to the catalytic tyr kinase domain. It also
           contains an N-terminal pleckstrin homology (PH) domain,
           which binds the products of PI3K and allows membrane
           recruitment and activation, and the Tec homology (TH)
           domain, which contains proline-rich and zinc-binding
           regions. ITK is expressed in T-cells and mast cells, and
           is important in their development and differentiation.
           Of the three Tec kinases expressed in T-cells, ITK plays
           the predominant role in T-cell receptor (TCR) signaling.
           It is activated by phosphorylation upon TCR crosslinking
           and is involved in the pathway resulting in
           phospholipase C-gamma1 activation and actin
           polymerization. It also plays a role in the downstream
           signaling of the T-cell costimulatory receptor CD28, the
           T-cell surface receptor CD2, and the chemokine receptor
           CXCR4. In addition, ITK is crucial for the development
           of T-helper(Th)2 effector responses. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 56

 Score = 26.9 bits (59), Expect = 4.2
 Identities = 14/50 (28%), Positives = 27/50 (54%), Gaps = 1/50 (2%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGY-CHGQYGLFPANYV 521
           +ALYDYQ +   E++   ++    ++  +  WWR    +G  G  P++Y+
Sbjct: 4   IALYDYQTNDPQELALRYNEEYHLLDSSEIHWWRVQDKNGHEGYVPSSYL 53


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 29.6 bits (67), Expect = 4.4
 Identities = 25/97 (25%), Positives = 52/97 (53%), Gaps = 10/97 (10%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEK---EQIRIKEEQEN----IKKKKEQE 413
           ++   KEEE +EK     ++I  K  E  EE+ K    + +++EE+E     ++K  +  
Sbjct: 96  LEREQKEEEVREKH---NKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQMEILEKALKSY 152

Query: 414 EKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQER 450
            K  KE+ ++ + L K  Q+E   + +++ +++E+ R
Sbjct: 153 LKIVKEENKSLQRLAKALQKESEERTQDETKMIEEYR 189



 Score = 29.6 bits (67), Expect = 4.9
 Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 6/64 (9%)

Query: 385 ELERK-EELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQA 443
            L+RK +ELE+EQ   KEE+   K  K+  EK  ++ EE  K +  K + +   + E+Q 
Sbjct: 88  GLQRKLKELEREQ---KEEEVREKHNKKIIEKFGEDLEEVYKFM--KGEAKVEEEEEKQM 142

Query: 444 RLLE 447
            +LE
Sbjct: 143 EILE 146


>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 29.0 bits (65), Expect = 4.4
 Identities = 18/78 (23%), Positives = 38/78 (48%), Gaps = 1/78 (1%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
            + + + E EK    +      +    +KE LEK +  + E  E +K  ++++ + +  +
Sbjct: 49  KKAAAQSEIEKFCANIADAAADQLYALQKELLEKLK-DLAEVNERLKALEKKKAELKDLE 107

Query: 421 EENQKLLLKKQQEEDRLK 438
           EE + +L  KQ E+ +L 
Sbjct: 108 EEREGILRSKQAEDGKLV 125


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 29.3 bits (65), Expect = 4.4
 Identities = 16/47 (34%), Positives = 26/47 (55%), Gaps = 3/47 (6%)

Query: 406 IKKKKEQ---EEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQE 449
           I K+KE+   EEKER+EQ   Q   + +Q E+  ++ +    +  QE
Sbjct: 94  IVKRKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQE 140


>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
           ArcB; Provisional.
          Length = 779

 Score = 29.9 bits (68), Expect = 4.4
 Identities = 23/88 (26%), Positives = 46/88 (52%), Gaps = 19/88 (21%)

Query: 367 EEEEKEKQRMVQ--EEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQ 424
           EE  +   R+V   EE++ ++LE   +L          ++NI +   QE  ER++ EE +
Sbjct: 81  EESRQRLSRLVAKLEEMRERDLELNVQL----------KDNIAQLN-QEIAEREKAEEAR 129

Query: 425 KLLLKK------QQEEDRLKAEEQARLL 446
           +   ++      ++EE +++ E+Q+ LL
Sbjct: 130 QEAFEQLKNEIKEREETQIELEQQSSLL 157


>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
           secretion].
          Length = 407

 Score = 29.6 bits (67), Expect = 4.4
 Identities = 10/62 (16%), Positives = 26/62 (41%), Gaps = 7/62 (11%)

Query: 274 ENLAKQTEEE--SRKRSEEEKEKRKLKDQIDLEQ-----AQKLEERRLSELKIKEEEIEK 326
              AK+ E    S    + ++ K   ++  ++ +      ++LEE     + + ++EI+ 
Sbjct: 91  RKAAKKIERSTPSLIERKTQEVKDSGEEIAEMMRDEKVPIRELEEIPPEFVALYKQEIQS 150

Query: 327 KL 328
             
Sbjct: 151 PT 152


>gnl|CDD|227453 COG5124, COG5124, Protein predicted to be involved in meiotic
           recombination [Cell division and chromosome partitioning
           / General function prediction only].
          Length = 209

 Score = 29.2 bits (65), Expect = 4.4
 Identities = 23/102 (22%), Positives = 50/102 (49%), Gaps = 12/102 (11%)

Query: 347 VKSILKQPTSDGI----PIQNSNK----EEEEKEKQRMVQEEIKRKELERKEELEKEQIR 398
           VK +L+Q   DG+        SN     + +  +K     E +K    ++ +E++++   
Sbjct: 45  VKDLLQQLVDDGVVSVEKCGTSNIYWSFKSQTLQKLYDSSELLK----KKIQEVKQDIAT 100

Query: 399 IKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAE 440
            KEE +  K  + ++  E ++    + LL K+++E+D +K +
Sbjct: 101 YKEEIDKEKATRRKKFTEGQKNYNREALLEKRKKEQDEIKKK 142


>gnl|CDD|233686 TIGR02016, BchX, chlorophyllide reductase iron protein subunit X.
           This model represents the X subunit of the three-subunit
           enzyme, (bacterio)chlorophyllide reductase. This enzyme
           is responsible for the reduction of the chlorin B-ring
           and is closely related to the protochlorophyllide
           reductase complex which reduces the D-ring. Both of
           these complexes in turn are homologous to nitrogenase.
           This subunit is homologous to the nitrogenase component
           II, or "iron" protein [Energy metabolism,
           Photosynthesis].
          Length = 296

 Score = 29.4 bits (66), Expect = 4.5
 Identities = 14/53 (26%), Positives = 20/53 (37%), Gaps = 7/53 (13%)

Query: 273 FENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELK-IKEEEI 324
           FE LA    +    R       R L     L   + L E R+ +L    EE++
Sbjct: 241 FEELAGNVADAPPLR------PRPLSPDALLALFETLPETRVVDLVGATEEDL 287


>gnl|CDD|219951 pfam08644, SPT16, FACT complex subunit (SPT16/CDC68).  Proteins in
           this family are subunits the FACT complex. The FACT
           complex plays a role in transcription initiation and
           promotes binding of TATA-binding protein (TBP) to a TATA
           box in chromatin.
          Length = 152

 Score = 28.7 bits (65), Expect = 4.7
 Identities = 17/40 (42%), Positives = 26/40 (65%), Gaps = 1/40 (2%)

Query: 399 IKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLK 438
           IKE Q+ +KK+ E E KE+ +  E +KL+L + +   RLK
Sbjct: 87  IKELQKRVKKR-ETERKEKADVVEQEKLILNRNKRPPRLK 125


>gnl|CDD|237561 PRK13923, PRK13923, putative spore coat protein regulator protein
           YlbO; Provisional.
          Length = 170

 Score = 28.6 bits (64), Expect = 4.7
 Identities = 24/102 (23%), Positives = 41/102 (40%), Gaps = 1/102 (0%)

Query: 320 KEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQE 379
           KE +  ++  G S   L  + +T    + S +   T + +  +   K +EE+EK     +
Sbjct: 69  KERKELRRQLGFSPSNLPDNVKTGDEIITSGISDLTLEDVLSEQIGKLQEEEEKLSWENQ 128

Query: 380 EIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE 421
            +K+ EL   EE  +  I I      +    E E K     E
Sbjct: 129 TLKQ-ELAITEEDYRALIVIMNRARRMAILVEDERKSLAAFE 169


>gnl|CDD|213013 cd12080, SH3_MPP1, Src Homology 3 domain of Membrane Protein,
           Palmitoylated 1 (or MAGUK p55 subfamily member 1).
           MPP1, also called 55 kDa erythrocyte membrane protein
           (p55), is a ubiquitously-expressed scaffolding protein
           that plays roles in regulating neutrophil polarity, cell
           shape, hair cell development, and neural development and
           patterning of the retina. It was originally identified
           as an erythrocyte protein that stabilizes the actin
           cytoskeleton to the plasma membrane by forming a complex
           with 4.1R protein and glycophorin C. MPP1 is one of
           seven vertebrate homologs of the Drosophila Stardust
           protein, which is required in establishing cell
           polarity, and it contains the three domains
           characteristic of MAGUK (membrane-associated guanylate
           kinase) proteins: PDZ, SH3, and guanylate kinase (GuK).
           In addition, it also contains the Hook (Protein 4.1
           Binding) motif in between the SH3 and GuK domains. The
           GuK domain in MAGUK proteins is enzymatically inactive;
           instead, the domain mediates protein-protein
           interactions and associates intramolecularly with the
           SH3 domain. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 62

 Score = 26.8 bits (59), Expect = 4.8
 Identities = 18/56 (32%), Positives = 22/56 (39%), Gaps = 11/56 (19%)

Query: 474 ALYDYQASADDEI-------SFDPDDIITNIEMIDEGWWRGYCHGQ----YGLFPA 518
           A +DY    D+ I        F   DII  I   D  WW+G   G      GL P+
Sbjct: 4   AQFDYDPKKDNLIPCKEAGLKFQTGDIIQIINKDDSNWWQGRVEGSGEESAGLIPS 59


>gnl|CDD|235239 PRK04172, pheS, phenylalanyl-tRNA synthetase subunit alpha;
           Provisional.
          Length = 489

 Score = 29.4 bits (67), Expect = 4.9
 Identities = 26/129 (20%), Positives = 51/129 (39%), Gaps = 30/129 (23%)

Query: 310 EERRLSELKIKEE----EIEKKLNGHSDVP------------------LSPSTETPPVPV 347
           E R L+ LK   E    E+++ L    +V                   +    +    P 
Sbjct: 76  ERRLLNALKDGGEVSLDELKEALLDKKEVGIALGNLARKGWAKIEKGKVILKPDAYEDPE 135

Query: 348 KSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIK 407
           +  LK        +   +KEE  +E  ++++E  KRK +E KE  E+  + + +    + 
Sbjct: 136 EKALKA-------LAEGDKEELSEEDLKVLKELKKRKLVEEKERTERS-VELTDAGLELL 187

Query: 408 KKKEQEEKE 416
           K+  + ++E
Sbjct: 188 KEGIELKEE 196


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 28.4 bits (64), Expect = 4.9
 Identities = 13/37 (35%), Positives = 21/37 (56%)

Query: 368 EEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQE 404
           +E  E QR   EE K  E E K + E+ + R +E+++
Sbjct: 90  DERAEAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 29.6 bits (67), Expect = 5.0
 Identities = 26/126 (20%), Positives = 59/126 (46%), Gaps = 19/126 (15%)

Query: 378 QEEIKRKELERKEELEKEQI-RIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDR 436
           +EE+ ++E ER ++LE E++ R++ E+E+     ++EE++ KE  ++         +++ 
Sbjct: 260 EEELAKEEAERLKKLEAERLRRMRGEEED-----DEEEEDSKESADD--------LDDEF 306

Query: 437 LKAEEQARLLEQERLMEELRQQGTNEDTEEDLGYTAVALYDYQASADDEISFDPDDIITN 496
              ++    L Q    EE  + G +++ EED               D ++S + +D    
Sbjct: 307 EPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEE-----EEEDVDLSDEEEDEEDE 361

Query: 497 IEMIDE 502
               ++
Sbjct: 362 DSDDED 367


>gnl|CDD|234801 PRK00591, prfA, peptide chain release factor 1; Validated.
          Length = 359

 Score = 29.3 bits (67), Expect = 5.1
 Identities = 16/34 (47%), Positives = 24/34 (70%), Gaps = 2/34 (5%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRI 399
           +EE + E + M +EE+K  E ER EELE E+++I
Sbjct: 69  EEESDPEMREMAKEELKELE-ERLEELE-EELKI 100


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 29.2 bits (66), Expect = 5.1
 Identities = 14/48 (29%), Positives = 29/48 (60%)

Query: 388 RKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEED 435
           +  ELEK + R +E +E ++K  E+ + E  +++  + + + K+Q ED
Sbjct: 95  KSGELEKRRERREEAEEELEKAIEEGDDEEIKKQSKRTVRVTKEQNED 142


>gnl|CDD|213007 cd12074, SH3_Tks5_1, First Src homology 3 domain of Tyrosine kinase
           substrate with five SH3 domains.  Tks5, also called SH3
           and PX domain-containing protein 2A (SH3PXD2A) or Five
           SH (FISH), is a scaffolding protein and Src substrate
           that is localized in podosomes, which are electron-dense
           structures found in Src-transformed fibroblasts,
           osteoclasts, macrophages, and some invasive cancer
           cells. It binds and regulates some members of the ADAMs
           family of transmembrane metalloproteases, which function
           as sheddases and mediators of cell and matrix
           interactions. It is required for podosome formation,
           degradation of the extracellular matrix, and cancer cell
           invasion. Tks5 contains an N-terminal Phox homology (PX)
           domain and five SH3 domains. This model characterizes
           the first SH3 domain of Tks5. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 53

 Score = 26.6 bits (58), Expect = 5.2
 Identities = 14/49 (28%), Positives = 24/49 (48%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHGQYGLFPANYV 521
           V + +Y+   + EIS    +++  IE  + GWW      + G  PA Y+
Sbjct: 3   VVVSNYEKQENSEISLQAGEVVDVIEKNESGWWFVSTAEEQGWVPATYL 51


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 28.8 bits (65), Expect = 5.3
 Identities = 15/66 (22%), Positives = 37/66 (56%)

Query: 367 EEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKL 426
           +  EKE + + +   K K+ E KEEL++    +K   + +K K  + E  ++ +++ ++L
Sbjct: 59  DYREKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKEL 118

Query: 427 LLKKQQ 432
           + + ++
Sbjct: 119 IKEGKK 124


>gnl|CDD|212726 cd11792, SH3_Fut8, Src homology 3 domain of
           Alpha1,6-fucosyltransferase (Fut8).  Fut8 catalyzes the
           alpha1,6-linkage of a fucose residue from a donor
           substrate to N-linked oligosaccharides on glycoproteins
           in a process called core fucosylation, which is crucial
           for growth factor receptor-mediated biological
           functions. Fut8-deficient mice show severe growth
           retardation, early death, and a pulmonary emphysema-like
           phenotype. Fut8 is also implicated to play roles in
           aging and cancer metastasis. It contains an N-terminal
           coiled-coil domain, a catalytic domain, and a C-terminal
           SH3 domain. The SH3 domain of Fut8 is located in the
           lumen and its role in glycosyl transfer is unclear. SH3
           domains are protein interaction domains that bind to
           proline-rich ligands with moderate affinity and
           selectivity, preferentially to PxxP motifs. They play
           versatile and diverse roles in the cell including the
           regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 55

 Score = 26.4 bits (59), Expect = 5.4
 Identities = 16/52 (30%), Positives = 23/52 (44%), Gaps = 10/52 (19%)

Query: 473 VALYDYQASADDEISFDPDDIITNIEMIDEGWWRGYCHG------QYGLFPA 518
           VA+Y ++    DEI     DII     +    W GY  G      + GL+P+
Sbjct: 3   VAIYPHKPRNHDEIELRVGDIIG----VAGNHWDGYSKGRNRRTGKTGLYPS 50


>gnl|CDD|217689 pfam03717, PBP_dimer, Penicillin-binding Protein dimerisation
           domain.  This domain is found at the N terminus of Class
           B High Molecular Weight Penicillin-Binding Proteins. Its
           function has not been precisely defined, but is strongly
           implicated in PBP polymerisation. The domain forms a
           largely disordered 'sugar tongs' structure.
          Length = 167

 Score = 28.4 bits (64), Expect = 5.4
 Identities = 18/72 (25%), Positives = 32/72 (44%), Gaps = 11/72 (15%)

Query: 264 VKPSNLRAKFENLAKQ-------TEEESRKRSEEEKEKRK----LKDQIDLEQAQKLEER 312
           V P  +  K    A++         EE  K+ +E+  K      LK  +  E+A +++E 
Sbjct: 27  VDPKEVEDKKAEEAEKLAKILGLDPEEILKKLKEKDAKSYEPVVLKRNLSPEEAARIKEL 86

Query: 313 RLSELKIKEEEI 324
           +L  + I+EE  
Sbjct: 87  KLPGVSIEEEPK 98


>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
           (Spore_III_AF).  This family represents the stage III
           sporulation protein AF (Spore_III_AF) of the bacterial
           endospore formation program, which exists in some but
           not all members of the Firmicutes (formerly called
           low-GC Gram-positives). The C-terminal region of these
           proteins is poorly conserved.
          Length = 185

 Score = 28.8 bits (65), Expect = 5.8
 Identities = 27/133 (20%), Positives = 60/133 (45%), Gaps = 14/133 (10%)

Query: 272 KFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGH 331
           + E+  K+ +   R    EE  K+ L+ Q++ ++ ++    ++ +++++ +E  +  N  
Sbjct: 64  EIESKKKEIQASQRAYILEEYAKQ-LEKQVE-KKLKEEYGVKVKDVEVEIDEDLESNNFD 121

Query: 332 ---SDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELER 388
               +V L   ++      +    +     I  Q S  +EEE+E +    E+IK   L  
Sbjct: 122 IKEVNVTLKEESK------EKQKSKVEPVVIDTQTSKPKEEEEESE--EAEKIKN-FLAD 172

Query: 389 KEELEKEQIRIKE 401
           +  + KEQI +  
Sbjct: 173 EYNIPKEQIEVYI 185


>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
           This family consists of the N-terminal region of the
           prokaryotic fibronectin-binding protein. Fibronectin
           binding is considered to be an important virulence
           factor in streptococcal infections. Fibronectin is a
           dimeric glycoprotein that is present in a soluble form
           in plasma and extracellular fluids; it is also present
           in a fibrillar form on cell surfaces. Both the soluble
           and cellular forms of fibronectin may be incorporated
           into the extracellular tissue matrix. While fibronectin
           has critical roles in eukaryotic cellular processes,
           such as adhesion, migration and differentiation, it is
           also a substrate for the attachment of bacteria. The
           binding of pathogenic Streptococcus pyogenes and
           Staphylococcus aureus to epithelial cells via
           fibronectin facilitates their internalisation and
           systemic spread within the host.
          Length = 447

 Score = 29.1 bits (66), Expect = 5.8
 Identities = 40/181 (22%), Positives = 78/181 (43%), Gaps = 50/181 (27%)

Query: 270 RAKFENLAKQTEEESRKRSEEEKEKRKLKDQID-LEQAQKLEERR---------LSELKI 319
           R + +      E+  R  +E EK ++KL+     LE+A+  E  R         L  +K 
Sbjct: 287 RDRVKQKRSDLEK--RVENELEKLEKKLEKLEKELEEAENAENYRLYGELLTANLYAIKK 344

Query: 320 KEEEIEKK---LNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNS----NKEEEEKE 372
             +E+E           +PL P+        KS          P +N+     K ++ K 
Sbjct: 345 GMKEVELPNYYTGEEITIPLDPA--------KS----------PSENAQKYYKKYKKLKR 386

Query: 373 KQRMVQEEIK--RKELERKEELE-----------KEQIRIKEEQENIKKKKEQEEKERKE 419
               V+E+I+  ++E+E  E +E            E+IR +  ++   K+K++++K++K+
Sbjct: 387 AVEAVEEQIEETKEEIEYLESVEAQLENAESLEDLEEIREELIEQGYLKEKKRKKKKKKK 446

Query: 420 Q 420
           +
Sbjct: 447 K 447



 Score = 29.1 bits (66), Expect = 6.1
 Identities = 19/86 (22%), Positives = 36/86 (41%), Gaps = 6/86 (6%)

Query: 411 EQEEKERKEQEENQKL-LLKKQQEEDRLKAEEQARLLEQERLMEELRQQG----TNEDTE 465
           ++ E++R +Q+ +     ++ + E+   K E+  + LE+    E  R  G     N    
Sbjct: 283 KKAERDRVKQKRSDLEKRVENELEKLEKKLEKLEKELEEAENAENYRLYGELLTANLYAI 342

Query: 466 EDLGYTAVALYDYQASADDEISFDPD 491
           +  G   V L +Y    +  I  DP 
Sbjct: 343 KK-GMKEVELPNYYTGEEITIPLDPA 367


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 28.2 bits (63), Expect = 6.0
 Identities = 12/62 (19%), Positives = 28/62 (45%), Gaps = 1/62 (1%)

Query: 350 ILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQI-RIKEEQENIKK 408
           I ++ +S     + + +E  ++ +Q    E     + ++K E+ K +  + K + E  K 
Sbjct: 87  IFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146

Query: 409 KK 410
           K 
Sbjct: 147 KP 148


>gnl|CDD|236914 PRK11466, PRK11466, hybrid sensory histidine kinase TorS;
           Provisional.
          Length = 914

 Score = 29.5 bits (66), Expect = 6.0
 Identities = 18/61 (29%), Positives = 28/61 (45%), Gaps = 15/61 (24%)

Query: 283 ESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKE--------------EEIEKKL 328
           E  +R   E    +L D IDLE   ++ E RLS L++++                +EK+L
Sbjct: 178 EQDQRQAAESALDRLID-IDLEYVNQMNELRLSALRVQQMVMNLGLEQIQKNAPTLEKQL 236

Query: 329 N 329
           N
Sbjct: 237 N 237


>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein.  This family consists of several
           eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
           to suppress GCR2 and is highly expressed in the muscle
           and heart. The function of this family is unknown
           although it has been speculated that SGT1 may be
           functionally analogous to the Gcr2p protein of
           Saccharomyces cerevisiae which is known to be a
           regulatory factor of glycolytic gene expression.
          Length = 557

 Score = 29.3 bits (66), Expect = 6.0
 Identities = 27/155 (17%), Positives = 54/155 (34%), Gaps = 10/155 (6%)

Query: 347 VKSILKQPTSDGI---PIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQ 403
             S+L+     G     ++ S + +E  EK             E   +L +  +    ++
Sbjct: 308 WSSLLESLKEKGYFKDNLEGSEEYKELLEKAEAYFSSTASSPGEEILDLLQLDLEELRDE 367

Query: 404 ENIKKKKEQEE-----KERKEQEENQKLLLKKQQEEDRLKAEEQARLLE-QERLMEELRQ 457
           E     ++ +       +  + E  ++   KK  + ++  A E   L E    + E L +
Sbjct: 368 EQDLPPEDDDSWLNISPDELDSELQERQGDKKDLKSNKEDANEVDDLEEVVSSMEEFLNK 427

Query: 458 QGTNEDTEEDLGYTAVALYDYQASADDEISFDPDD 492
             + E  E               S D ++SFD D+
Sbjct: 428 VSSFEGAEFADDEDEDDDEP-DDSEDKDVSFDEDE 461


>gnl|CDD|221232 pfam11802, CENP-K, Centromere-associated protein K.  CENP-K is one
           of seven new CENP-A-nucleosome distal (CAD) centromere
           components (the others being CENP-L, CENP-O, CENP-P,
           CENP-Q, CENP-R and CENP-S) that are identified as
           assembling on the CENP-A nucleosome associated complex,
           NAC. The CENP-A NAC is essential, as disruption of the
           complex causes errors of chromosome alignment and
           segregation that preclude cell survival despite
           continued centromere-derived mitotic checkpoint
           signalling. CENP-K is centromere-associated through its
           interaction with one or more components of the CENP-A
           NAC.
          Length = 273

 Score = 29.1 bits (65), Expect = 6.0
 Identities = 26/92 (28%), Positives = 46/92 (50%), Gaps = 7/92 (7%)

Query: 366 KEEEEKEKQ--RMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEEN 423
           KEE +K +Q   MV   ++ K  + KE+LE+EQ +  +EQ+ I         ER+ + +N
Sbjct: 96  KEELQKLRQDLEMVLSSVQSKNEKLKEDLEREQ-QWLDEQQQILDSL----NERQSELKN 150

Query: 424 QKLLLKKQQEEDRLKAEEQARLLEQERLMEEL 455
           Q +   + +    LK +       +E+L+  L
Sbjct: 151 QVVTFSESRIFQELKTKMLRIKEYKEKLLSAL 182


>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 122

 Score = 27.8 bits (62), Expect = 6.0
 Identities = 16/41 (39%), Positives = 24/41 (58%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENI 406
           KEEEE ++ +   E    K  ++ E+LE E+ +  EEQE I
Sbjct: 70  KEEEEAKRLQSELERELMKRAQKLEQLELEKAKEDEEQEQI 110



 Score = 27.8 bits (62), Expect = 7.0
 Identities = 18/62 (29%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 382 KRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEE 441
           ++ EL+R  E  K    +KE++E  + K+ Q E ER+  +  QK L + + E+ +   E+
Sbjct: 49  QKPELQRVLEKRKRDQVLKEQKEEEEAKRLQSELERELMKRAQK-LEQLELEKAKEDEEQ 107

Query: 442 QA 443
           + 
Sbjct: 108 EQ 109



 Score = 27.4 bits (61), Expect = 9.2
 Identities = 21/71 (29%), Positives = 46/71 (64%), Gaps = 2/71 (2%)

Query: 386 LERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARL 445
           L +K EL++   + K +Q  + K++++EE+ ++ Q E ++ L+K+ Q+ ++L+ E+    
Sbjct: 47  LNQKPELQRVLEKRKRDQ--VLKEQKEEEEAKRLQSELERELMKRAQKLEQLELEKAKED 104

Query: 446 LEQERLMEELR 456
            EQE++ E L+
Sbjct: 105 EEQEQIPEFLK 115


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 29.3 bits (66), Expect = 6.2
 Identities = 40/183 (21%), Positives = 73/183 (39%), Gaps = 34/183 (18%)

Query: 274 ENLAKQTEEESRKRSEEEKEKRKLKDQID-LEQ--AQKLEERRLSELKIKE--------- 321
                 TEE+  K  E E E   L++QID LE    + L E +    KIK+         
Sbjct: 254 PPAEMDTEEDRTK--EREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVR 311

Query: 322 ---EEIEKKLNGHSD-VPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMV 377
              EE+E++       V L P  E     +++++                  E+  +   
Sbjct: 312 QENEELEEEYKIKKRTVELLPDAENNVAKLQALVVA--------------SSERLLELAQ 357

Query: 378 QEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRL 437
           Q E  R  L   +E    + + + +++  +++ ++ +K R + EE +  L  K+Q   +L
Sbjct: 358 QWEAHRTPL--IDEYRSLKEKNRNKEDETQRQLDEIKKLRNKIEELESELQTKEQLYKQL 415

Query: 438 KAE 440
             E
Sbjct: 416 LDE 418


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 605

 Score = 29.4 bits (66), Expect = 6.2
 Identities = 20/113 (17%), Positives = 42/113 (37%), Gaps = 15/113 (13%)

Query: 360 PIQNSNKEE--EEKEKQRMVQEEIKRKELERKEELEKEQIRIKE-------------EQE 404
            ++N   E   + ++K     + I  K++   EE  K  +                  Q 
Sbjct: 367 KLENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQN 426

Query: 405 NIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQ 457
           N ++  ++EE   KE  +  + + K  +  D L+  +Q    +   L + + Q
Sbjct: 427 NKEQDIKKEELLEKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIELSKNMLQ 479


>gnl|CDD|148750 pfam07321, YscO, Type III secretion protein YscO.  This family
           contains the bacterial type III secretion protein YscO,
           which is approximately 150 residues long. YscO has been
           shown to be required for high-level expression and
           secretion of the anti-host proteins V antigen and Yops
           in Yersinia pestis.
          Length = 152

 Score = 28.2 bits (63), Expect = 6.2
 Identities = 24/87 (27%), Positives = 45/87 (51%), Gaps = 2/87 (2%)

Query: 380 EIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQE-ENQKLLLKKQQEEDRLK 438
           E++   +  KE LEK Q ++   +EN    ++Q  + ++  E E Q+L   +QQ ++  K
Sbjct: 58  ELQGATVLLKE-LEKWQQQVGLLRENEASLEQQLAEAKQRLEAERQRLRQARQQLQEARK 116

Query: 439 AEEQARLLEQERLMEELRQQGTNEDTE 465
           A+E+   L ++   E   Q+   E+ E
Sbjct: 117 AQEKFAELARQEQAEAQAQRQYLEELE 143


>gnl|CDD|219925 pfam08598, Sds3, Sds3-like.  Repression of gene transcription is
           mediated by histone deacetylases containing
           repressor-co-repressor complexes, which are recruited to
           promoters of target genes via interactions with
           sequence-specific transcription factors. The
           co-repressor complex contains a core of at least seven
           proteins. This family represents the conserved region
           found in Sds3, Dep1 and BRMS1-homologue p40 proteins.
          Length = 184

 Score = 28.4 bits (64), Expect = 6.4
 Identities = 21/78 (26%), Positives = 43/78 (55%), Gaps = 3/78 (3%)

Query: 378 QEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRL 437
            E +K  E  R + L+  ++R + + E I+++ E E +  K++ E +K LL+++      
Sbjct: 45  LEPLKDLEERRDDRLKVAELRREYKLECIEREYEAERQAAKQEFEKEKRLLRERL---LE 101

Query: 438 KAEEQARLLEQERLMEEL 455
           + EE+   LE+ER   ++
Sbjct: 102 ELEEKIYRLEEERRSLDI 119


>gnl|CDD|224559 COG1645, COG1645, Uncharacterized Zn-finger containing protein
           [General function prediction only].
          Length = 131

 Score = 27.8 bits (62), Expect = 6.5
 Identities = 12/58 (20%), Positives = 27/58 (46%)

Query: 399 IKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELR 456
           + EE+E   + + QE+  R   E        K++ ++   +E+   ++E  + +E  R
Sbjct: 56  VVEEEEEEVEAEVQEQLRRSRPELPDDSDELKKEIDEDPASEKLLSVIETLKRIEARR 113


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score = 28.2 bits (63), Expect = 6.9
 Identities = 19/116 (16%), Positives = 51/116 (43%), Gaps = 11/116 (9%)

Query: 362 QNSNKEEEEKEKQRM----VQEEIKRKELERKEELEK--EQIRIKEEQENIKKK----KE 411
           Q    +   +  + +     +  +     +R ++L K   +  +  E     +     ++
Sbjct: 34  QRDIHQARVERMEEIKNLKYELIMNDHLNKRIDKLLKGLREEELSPETPTYNEMLAELQD 93

Query: 412 QEEKERKEQEENQKLLLKK-QQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEE 466
           Q +KE +E   + + LL++ ++  D+LK E++    + + L +E +++  + DT  
Sbjct: 94  QLKKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKKKIWSVDTHT 149


>gnl|CDD|232958 TIGR00399, metG_C_term, methionyl-tRNA synthetase C-terminal
           region/beta chain.  The methionyl-tRNA synthetase (metG)
           is a class I amino acyl-tRNA ligase. This model
           describes a region of the methionyl-tRNA synthetase that
           is present at the C-terminus of MetG in some species (E.
           coli, B. subtilis, Thermotoga maritima, Methanobacterium
           thermoautotrophicum), and as a separate beta chain in
           Aquifex aeolicus. It is absent in a number of other
           species (e.g. Mycoplasma genitalium, Mycobacterium
           tuberculosis), while Pyrococcus horikoshii has both a
           full length MetG and a second protein homologous to the
           beta chain only. Proteins hit by This model should
           called methionyl-tRNA synthetase beta chain if and only
           if the model metG hits a separate protein not also hit
           by This model [Protein synthesis, tRNA aminoacylation].
          Length = 137

 Score = 27.8 bits (62), Expect = 6.9
 Identities = 16/58 (27%), Positives = 27/58 (46%)

Query: 271 AKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKL 328
            K E L  +  ++  K+ E EK     K+ I ++  +K++ R    LK +  E   KL
Sbjct: 2   KKIEELKLKGAKKKEKKDEGEKALEPQKETITIDDFEKVDLRVGKILKAERVEKSDKL 59


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 29.1 bits (66), Expect = 7.1
 Identities = 24/92 (26%), Positives = 51/92 (55%), Gaps = 14/92 (15%)

Query: 369 EEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQK-LL 427
           + +EKQ +++       L+ KE LEK    +++E + ++ +K   +K +++ E++Q+   
Sbjct: 177 KLEEKQEILET------LDVKERLEKLLDLLEKEIDLLQLEKRIRKKVKEQMEKSQREYY 230

Query: 428 LKKQQEEDRLKA--EEQARLLEQERLMEELRQ 457
           L++Q     LKA  +E     + +  +EELR+
Sbjct: 231 LREQ-----LKAIQKELGEDDDDKDEVEELRE 257


>gnl|CDD|204811 pfam12026, DUF3513, Domain of unknown function (DUF3513).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 192 to 218 amino acids in length. This domain is
           found associated with pfam00018, pfam08824. This domain
           has a conserved QPP sequence motif.
          Length = 208

 Score = 28.6 bits (64), Expect = 7.3
 Identities = 11/39 (28%), Positives = 19/39 (48%), Gaps = 1/39 (2%)

Query: 386 LERKEELEKEQIRIKE-EQENIKKKKEQEEKERKEQEEN 423
           L+ KEE ++ Q  + E       K+ E+ E+E  +  E 
Sbjct: 23  LQGKEEFQRTQKELLEKGPITQLKQFERLEREVAKPAEA 61


>gnl|CDD|224581 COG1667, COG1667, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 254

 Score = 28.7 bits (64), Expect = 7.3
 Identities = 25/81 (30%), Positives = 37/81 (45%), Gaps = 7/81 (8%)

Query: 384 KELERKEELEKEQIRIKEEQENIKKKKE-------QEEKERKEQEENQKLLLKKQQEEDR 436
           K  ER+E LEK  IR+ E +E  +   E        EE E   +EE  K L  K+ +E+ 
Sbjct: 131 KSSEREELLEKYGIRVPEIEEIERILDEAVGSDYFSEEIEEPSREELLKKLGIKEPDENW 190

Query: 437 LKAEEQARLLEQERLMEELRQ 457
           ++   +      E L EE  +
Sbjct: 191 VETILKVFRGVDEELEEEKER 211


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 29.0 bits (65), Expect = 7.5
 Identities = 18/72 (25%), Positives = 35/72 (48%), Gaps = 7/72 (9%)

Query: 399 IKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAE-----EQARLLEQ--ERL 451
           I+ + EN   +  QE  E +   E Q+LL    +  + L+ +     ++   L+Q  E L
Sbjct: 462 IRLQHENKMLRLGQEGSENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDL 521

Query: 452 MEELRQQGTNED 463
            + L++QG+  +
Sbjct: 522 QKALQEQGSKSE 533



 Score = 28.6 bits (64), Expect = 9.6
 Identities = 43/187 (22%), Positives = 70/187 (37%), Gaps = 31/187 (16%)

Query: 265 KPSNLRAKFENLAKQTEEESRKRSEEEK-------EKRKLKDQIDLEQAQKLEERRLSEL 317
           K +  R + E   +Q +E   K SEE K       E ++L+++  LE  QK +ER L+E 
Sbjct: 357 KANAARGQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEK--LEALQKEKERLLAER 414

Query: 318 KIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMV 377
               E  E+   G                   + +          N   E    E     
Sbjct: 415 DSLRETNEELRCGQ-------------AQQDQLTQADAGTSPSGDNLAAELLPSE----Y 457

Query: 378 QEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEE-----NQKLLLKKQQ 432
           +E++ R + E K     ++    E    +++  E   +   E E      NQ++L  +QQ
Sbjct: 458 REKLIRLQHENKMLRLGQEGSENERITELQQLLEDANRRNNELETQLRLANQRILELQQQ 517

Query: 433 EEDRLKA 439
            ED  KA
Sbjct: 518 VEDLQKA 524


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 29.2 bits (65), Expect = 7.6
 Identities = 20/112 (17%), Positives = 53/112 (47%), Gaps = 7/112 (6%)

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
            Q   ++ +E + +    ++ K K  E ++++  ++ +++  +E +K  + + +  +   
Sbjct: 195 RQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELDPLKNRL 254

Query: 421 EENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQ------QGTNEDTEE 466
           +E +  L K  + ++ +KA +  R  + E+   EL        QGT+E   +
Sbjct: 255 KEIEHNLSKIMKLDNEIKALKS-RKKQMEKDNSELELKMEKVFQGTDEQLND 305


>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
          Length = 196

 Score = 28.3 bits (63), Expect = 7.6
 Identities = 23/85 (27%), Positives = 45/85 (52%), Gaps = 5/85 (5%)

Query: 347 VKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQI----RIKEE 402
           V ++      +G P     ++ + +++     EE+K+K L++KE   +E +    R+K E
Sbjct: 16  VNNLSSNEQMEGPPEDEQAQQNQPQQQTVDEIEELKQK-LQQKEVEAQEYLDIAQRLKAE 74

Query: 403 QENIKKKKEQEEKERKEQEENQKLL 427
            EN +K+ E+E+ E  E  + Q +L
Sbjct: 75  FENYRKRTEKEKSEMVEYGKEQVIL 99


>gnl|CDD|227780 COG5493, COG5493, Uncharacterized conserved protein containing a
           coiled-coil domain [Function unknown].
          Length = 231

 Score = 28.3 bits (63), Expect = 7.7
 Identities = 18/95 (18%), Positives = 46/95 (48%), Gaps = 16/95 (16%)

Query: 379 EEIKRKELE-----------RKEELEKEQIRIKE-----EQENIKKKKEQEEKERKEQEE 422
           EEIKR  LE           R E L +   ++        ++++++ +++ E+ +KE  +
Sbjct: 5   EEIKRVLLENPEILVDVLTQRPEILYEVLAKLTPWQQLATKQDVEELRKETEQRQKELAD 64

Query: 423 NQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQ 457
            +  + K++  ++ LK  ++ +  E     E++++
Sbjct: 65  EKLEVRKQKATKEDLKLLQRFQEEEFRATKEDIKR 99


>gnl|CDD|213395 cd12152, F1-ATPase_delta, mitochondrial ATP synthase delta subunit.
            The F-ATPase is found in bacterial plasma membranes,
           mitochondrial inner membranes and in chloroplast
           thylakoid membranes. It has also been found in the
           archaea Methanosarcina barkeri. It uses a proton
           gradient to drive ATP synthesis and hydrolyzes ATP to
           build the proton gradient. The extrinisic membrane
           domain, F1, is composed of alpha, beta, gamma, delta,
           and epsilon subunits with a stoichiometry of 3:3:1:1:1.
           Alpha and beta subunit form the globular catalytic
           moiety, a hexameric ring of alternating subunits. Gamma,
           delta and epsilon subunits form a stalk, connecting F1
           to F0, the integral membrane proton translocating
           domain. In bacteria, which is lacking a eukaryotic
           epsilon subunit homolog, this subunit is called the
           epsilon subunit.
          Length = 123

 Score = 27.5 bits (62), Expect = 7.8
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query: 299 DQIDLEQAQKLEERRLSELKIKEEEIEKK 327
           + ID+E+A++  ER    L   ++E EK 
Sbjct: 84  EDIDVERAEEALERAEERLAQAKDEREKA 112


>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain.  FH proteins control
           rearrangements of the actin cytoskeleton, especially in
           the context of cytokinesis and cell polarisation.
           Members of this family have been found to interact with
           Rho-GTPases, profilin and other actin-assoziated
           proteins. These interactions are mediated by the
           proline-rich FH1 domain, usually located in front of FH2
           (but not listed in SMART). Despite this cytosolic
           function, vertebrate formins have been assigned
           functions within the nucleus. A set of Formin-Binding
           Proteins (FBPs) has been shown to bind FH1 with their WW
           domain.
          Length = 392

 Score = 28.9 bits (65), Expect = 7.9
 Identities = 20/93 (21%), Positives = 41/93 (44%), Gaps = 3/93 (3%)

Query: 379 EEIKRKELERKEELEKEQI--RIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDR 436
           E++     E  ++   E+      E  +   K  E+  K+ +E+EE +K L+K+  E ++
Sbjct: 301 EKLVEYYGEDPKDTSPEEFFKDFNEFLKEFSKAAEENIKKEEEEEERRKKLVKETTEYEQ 360

Query: 437 LKAEEQARLLEQERLMEELRQQGTNEDTEEDLG 469
             + ++ R    +    E    G  +   E+LG
Sbjct: 361 SSSRQKERNPSMD-FEVERDFLGVLDSLLEELG 392


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 29.0 bits (65), Expect = 7.9
 Identities = 23/142 (16%), Positives = 52/142 (36%), Gaps = 5/142 (3%)

Query: 301 IDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIP 360
           IDL+    ++  + S  KI+ E   K    + D+      E      K   K        
Sbjct: 15  IDLKNPLFIKNLKHSSYKIRLESDIKDSLLNLDINKK-LHEKLDKKNKKFNKTDDLKDS- 72

Query: 361 IQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQ 420
            +   K++++ +K+  + ++    +     +   +Q         ++K K + EK +K+ 
Sbjct: 73  KKTKLKQKKKIKKKLHIDDD---YDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKI 129

Query: 421 EENQKLLLKKQQEEDRLKAEEQ 442
             N+    KK++         +
Sbjct: 130 TVNKSTNKKKKKVLSSKDELIK 151


>gnl|CDD|184059 PRK13450, atpC, F0F1 ATP synthase subunit epsilon; Provisional.
          Length = 132

 Score = 27.8 bits (62), Expect = 7.9
 Identities = 16/48 (33%), Positives = 30/48 (62%), Gaps = 3/48 (6%)

Query: 274 ENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELKIKE 321
           E +  +  E ++KR+EE   K   KD+ID+++A+    R ++ +K+KE
Sbjct: 87  EEIDIKRAENAKKRAEERLRK---KDEIDVKRAELALFRAIARIKLKE 131


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 28.9 bits (65), Expect = 8.1
 Identities = 18/96 (18%), Positives = 39/96 (40%), Gaps = 9/96 (9%)

Query: 379 EEIKRKELERKEELEKEQIRIKEEQENIKKKKE--------QEEKERKEQEENQKLLLKK 430
           +++KR+  + K +L KE   ++EE   + KK           +  E   + +N+    + 
Sbjct: 383 QQVKRELQDAKSQLLKELRELEEELAEVDKKISTIPSEEQIAQLLEELGEAQNELFRSEA 442

Query: 431 QQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEE 466
           + EE   + E     +E  R   + + +    +  E
Sbjct: 443 EIEELLRQLETLKEAIEALRKTLDEKTK-QKINAFE 477



 Score = 28.9 bits (65), Expect = 8.8
 Identities = 24/105 (22%), Positives = 39/105 (37%), Gaps = 7/105 (6%)

Query: 366 KEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEE---KERKEQEE 422
            E E  E +   Q E      +    L  E    +   E+++KK   E     E +EQ E
Sbjct: 209 SEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSEGGDLFEEREQLE 268

Query: 423 NQKLLL----KKQQEEDRLKAEEQARLLEQERLMEELRQQGTNED 463
            Q   +    K  + + R  A +   LL    L++  + Q   E+
Sbjct: 269 RQLKEIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQKEE 313


>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component.  Nup88 can be divided
           into two structural domains; the N-terminal two-thirds
           of the protein has no obvious structural motifs but is
           the region for binding to Nup98, one of the components
           of the nuclear pore. the C-terminal end is a predicted
           coiled-coil domain. Nup88 is overexpressed in tumour
           cells.
          Length = 717

 Score = 29.1 bits (65), Expect = 8.1
 Identities = 30/142 (21%), Positives = 58/142 (40%), Gaps = 16/142 (11%)

Query: 292 KEKRKLKDQIDLEQ--------AQKLEERRLSELKIKEEEIEKKLNG-----HSDVP-LS 337
           +++++L+D  D  +        A+KL E +  E K  +E +  +        +S +P LS
Sbjct: 573 QKEKQLEDIQDCREERKSLSERAEKLAE-KFEEAKYNQELLVNRCKRLLQSANSQLPVLS 631

Query: 338 PSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKE-Q 396
            S       ++ I KQ       I+   K++  +      QE  K+      E+  K   
Sbjct: 632 DSERDMSKELQRINKQLQHLANGIKQVKKKKNYQRYHMASQESPKKSSYTLPEKQHKTIT 691

Query: 397 IRIKEEQENIKKKKEQEEKERK 418
             +KE  E+I +  +Q ++   
Sbjct: 692 EILKELGEHIDRMIKQIKRINN 713


>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
          Length = 673

 Score = 29.1 bits (65), Expect = 8.1
 Identities = 23/90 (25%), Positives = 38/90 (42%), Gaps = 4/90 (4%)

Query: 348 KSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIK 407
            +I K P   G     S    +     +  Q EI++K  + K +L+    + +   E  K
Sbjct: 335 AAIAKAPDVPGWSNTASGTLTDSYVTPKEAQAEIEQKLAQEKAQLDAAVEKGELTPEQAK 394

Query: 408 ----KKKEQEEKERKEQEENQKLLLKKQQE 433
                K E EE+ER E+ +  +  LK+ Q 
Sbjct: 395 QIARAKLEPEERERLEKIDKAQAALKQAQS 424


>gnl|CDD|204831 pfam12137, RapA_C, RNA polymerase recycling family C-terminal.
           This domain is found in bacteria. This domain is about
           360 amino acids in length. This domain is found
           associated with pfam00271, pfam00176. The function of
           this domain is not known, but structurally it forms an
           alpha-beta fold in nature with a central beta-sheet
           flanked by helices and loops, the beta-sheet being
           mainly antiparallel and flanked by four alpha helices,
           among which the two longer helices exhibit a coiled-coil
           arrangement.
          Length = 362

 Score = 28.7 bits (65), Expect = 8.2
 Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 9/62 (14%)

Query: 269 LRAKFENLAKQTEEESRKRSEEEKEKRKLKDQIDLEQAQKLEERRLSELK-----IKEEE 323
           LR + E L ++ E    K +EE+ E+   + +  ++Q    E  RL  LK     I++EE
Sbjct: 276 LRPQIEQLLQKAE----KLAEEQLEEIIEQAKQQMQQQLSAELARLKALKAVNPNIRDEE 331

Query: 324 IE 325
           IE
Sbjct: 332 IE 333


>gnl|CDD|220897 pfam10883, DUF2681, Protein of unknown function (DUF2681).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 81 and 117 amino acids
           in length.
          Length = 87

 Score = 27.0 bits (60), Expect = 8.4
 Identities = 20/59 (33%), Positives = 31/59 (52%), Gaps = 7/59 (11%)

Query: 408 KKKEQEEKERKEQEENQKLLLKKQQEED-------RLKAEEQARLLEQERLMEELRQQG 459
           K K+ + + RK Q EN++L  +K   E        R K EE  R L ++ + E+L+Q G
Sbjct: 24  KLKKAQRENRKLQAENEQLATEKAVAETEVKNAKVRQKNEENTRRLSRDDVDEQLQQHG 82


>gnl|CDD|212873 cd11940, SH3_ARHGEF5_19, Src homology 3 domain of the Rho guanine
           nucleotide exchange factors ARHGEF5 and ARHGEF19.
           ARHGEF5, also called ephexin-3 or TIM (Transforming
           immortalized mammary oncogene), is a potent activator of
           RhoA and it plays roles in regulating cell shape,
           adhesion, and migration. It binds to the SH3 domain of
           Src and is involved in regulating Src-induced podosome
           formation. ARHGEF19, also called ephexin-2 or WGEF
           (weak-similarity GEF), is highly expressed in the
           intestine, liver, heart and kidney. It activates RhoA,
           Cdc42, and Rac 1, and has been shown to activate RhoA in
           the Wnt-PCP (planar cell polarity) pathway. It is
           involved in the regulation of cell polarity and
           cytoskeletal reorganization. ARHGEF5 and ARHGEF19
           contain RhoGEF (also called Dbl-homologous or DH),
           Pleckstrin Homology (PH), and SH3 domains. The SH3
           domains of ARHGEFs play an autoinhibitory role through
           intramolecular interactions with a proline-rich region
           N-terminal to the DH domain. SH3 domains are protein
           interaction domains that bind to proline-rich ligands
           with moderate affinity and selectivity, preferentially
           to PxxP motifs. They play versatile and diverse roles in
           the cell including the regulation of enzymes, changing
           the subcellular localization of signaling pathway
           components, and mediating the formation of multiprotein
           complex assemblies.
          Length = 55

 Score = 25.9 bits (57), Expect = 8.6
 Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 2/46 (4%)

Query: 478 YQASADDEISFDPDDIITNIEMIDEGWWRG--YCHGQYGLFPANYV 521
           Y+A  +DE++ +  DII   +   +GW  G     G+ G FP ++V
Sbjct: 8   YKAQENDELTLEKADIIMVRQQSSDGWLEGVRLSDGERGWFPQSHV 53


>gnl|CDD|132534 TIGR03495, phage_LysB, phage lysis regulatory protein, LysB family.
            Members of this protein family are phage lysis
           regulatory protein, including the well-studied protein
           LysB (lysis protein B) of Enterobacteria phage P2. For
           members of this family, genes are found in phage or in
           prophage regions of bacterial genomes, typically near a
           phage lysozyme or phage holin.
          Length = 135

 Score = 27.7 bits (62), Expect = 8.7
 Identities = 12/55 (21%), Positives = 29/55 (52%)

Query: 389 KEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQA 443
           + EL  +  ++       K+ +E + + R++  + + LL +++Q  +RLK E + 
Sbjct: 39  QAELASKANQLIVLLALAKRNEEAQAQLRQQLAQARALLAQREQRIERLKRENED 93


>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 557

 Score = 28.7 bits (65), Expect = 8.8
 Identities = 28/126 (22%), Positives = 56/126 (44%), Gaps = 18/126 (14%)

Query: 344 PVPVKSILKQPTSDGIPI--QNSNKEEEEKEKQRMV------QEEIKRKELERK----EE 391
           PV   + LK+     I I  Q+ ++   + E QR +       EE+ ++  +      ++
Sbjct: 110 PVS-LAQLKELGQLLIDIHGQHEHQSLLKPELQRQLLDAFAGLEELAQEAYQEAYQAWKQ 168

Query: 392 LEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERL 451
             +E   ++E++    ++ +  + + +E EE     L  Q  ED    EE+ RL   E+L
Sbjct: 169 ARRELEDLQEKERERAQRADLLQFQLEELEE-----LNLQPGEDEELEEERKRLSNSEKL 223

Query: 452 MEELRQ 457
            E ++ 
Sbjct: 224 AEAIQN 229


>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
           unknown].
          Length = 833

 Score = 28.8 bits (64), Expect = 8.8
 Identities = 49/332 (14%), Positives = 104/332 (31%), Gaps = 55/332 (16%)

Query: 169 RQDKSAVGWDHQEKIEKHESQKDYAKGFGGKFGIESDRQDKSAVGWDHVEKVDK------ 222
           R+DK A     +   EK           G         QD       +    D+      
Sbjct: 349 REDKQAALLAAKLAAEKLAR----VTAQGALNARLKLAQDDLTQAELNYAAADQAANQEG 404

Query: 223 HQSQKDDNRQVVTSSKVKDLIAANSTN--ASKENIKPKPNIGHVKPSNLRAKFENLAKQT 280
             + ++D  +V+++ + +  I  N       +   + + N    K   L+A     A Q 
Sbjct: 405 ALNAREDEAEVLSTQEERRDILKNLLADAEKRTARQEELNKALAKAKILQADKAAKAYQE 464

Query: 281 EEESRKRSEEEKEKRKLKDQIDLEQAQK---LEERRLSELKIKEEEIEKKLNGHSDVPLS 337
           +   R+     K     + Q  +  A K     ++++++L   +E+   +          
Sbjct: 465 DILQREAQSRGKTAAAERSQEQMTAALKALLAFQQQIADLSGAKEKASDQ---------- 514

Query: 338 PSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQI 397
                             S     +      +E+ KQR +QE+    E +++      Q+
Sbjct: 515 -----------------KSLLWKAEEQYALLKEEAKQRQLQEQKALLEHKKETLEYTSQL 557

Query: 398 RIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQ 457
                       +  +++  + +   Q    +K++  D  +           + + EL  
Sbjct: 558 -----------AELLDQQADRFELSAQAAGSQKERGSDLYREALAQNAAALNKALNELAA 606

Query: 458 QGTNEDTEED--LGYTAVALYDYQASADDEIS 487
             +  D  +         AL +Y+ SA D  S
Sbjct: 607 YWSALDLLQGDWKAGALSALANYRDSATDVAS 638


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 28.7 bits (64), Expect = 9.0
 Identities = 19/89 (21%), Positives = 41/89 (46%)

Query: 379 EEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDRLK 438
           EE+  K   ++++ E+E+ R ++E+E         +  +      +   +  Q+ E+   
Sbjct: 385 EEVISKYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEES 444

Query: 439 AEEQARLLEQERLMEELRQQGTNEDTEED 467
            EE+    E+E   E+  ++   ED EE+
Sbjct: 445 VEEEEEEEEEEEEEEQESEEEEGEDEEEE 473


>gnl|CDD|222430 pfam13874, Nup54, Nucleoporin complex subunit 54.  This is the
           human Nup54 subunit of the nucleoporin complex,
           equivalent to Nup57 of yeast. Nup54, Nup58 and Nup62 all
           have similar affinities for importin-beta. It seems
           likely that they are the only FG-repeat nucleoporins of
           the central channel, and as such they would form a zone
           of equal affinity spanning the central channel. The
           diffusion of importin-beta import complexes through the
           central channel may be a stochastic process as the
           affinities are similar, whereas movement from
           cytoplasmic fibrils to the central channel and from the
           central channel to the nuclear basket would be
           facilitated by the subtle differences in affinity
           between them.
          Length = 141

 Score = 27.6 bits (62), Expect = 9.0
 Identities = 21/91 (23%), Positives = 42/91 (46%), Gaps = 12/91 (13%)

Query: 377 VQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEENQKLLLKKQQEEDR 436
            +E  +R +L+R EE+++ + R+ E  + +K+        +K   E    + + ++    
Sbjct: 32  FEELKQRVKLQR-EEVKQLRERLHEINDKLKEL-------QKHDLETSARIEEAKRRHTE 83

Query: 437 LKAEEQARLLEQERLMEELRQQGTNEDTEED 467
           L      RLL+    +E LR +G     EE+
Sbjct: 84  L----SHRLLKLAAKLEILRNKGYPLSPEEE 110


>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 424

 Score = 28.7 bits (65), Expect = 9.2
 Identities = 17/105 (16%), Positives = 42/105 (40%), Gaps = 8/105 (7%)

Query: 360 PIQNSNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKE 419
            ++        ++     +  +   + E ++  E ++   K   E+ K  +++ ++ R E
Sbjct: 80  LLEKRPSRTNSQDALLQAENALPEWKKELEKPSEPKEEEPKAAAES-KVVQKELDELRDE 138

Query: 420 QEENQKLL------LKKQQEEDRLKAEEQARLLEQERLMEELRQQ 458
            +E + LL      L++ +      AE   RL     L  E+ ++
Sbjct: 139 LKELKNLLEDQLSGLRQVERIPPEFAELYKRLKRSG-LSPEIAEK 182


>gnl|CDD|235751 PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated.
          Length = 871

 Score = 28.7 bits (65), Expect = 9.3
 Identities = 19/50 (38%), Positives = 27/50 (54%), Gaps = 4/50 (8%)

Query: 399 IKEEQENIKKKK-EQEEKERKEQEENQKLLLKKQQEED-RLKAEEQARLL 446
           IK  +    K+K EQ  +E  E+EE   LL + QQ  D   KA+E  R++
Sbjct: 612 IKNFEPGASKRKFEQGRQEALEKEEE--LLSRLQQLPDGEQKAKETKRMI 659


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 27.8 bits (62), Expect = 9.3
 Identities = 16/48 (33%), Positives = 24/48 (50%)

Query: 420 QEENQKLLLKKQQEEDRLKAEEQARLLEQERLMEELRQQGTNEDTEED 467
             E  +L L +QQEE+    EE+    E+    EE  Q+G   + EE+
Sbjct: 65  DREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEE 112


>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family.  This family includes PspA
           a protein that suppresses sigma54-dependent
           transcription. The PspA protein, a negative regulator of
           the Escherichia coli phage shock psp operon, is produced
           when virulence factors are exported through secretins in
           many Gram-negative pathogenic bacteria and its homologue
           in plants, VIPP1, plays a critical role in thylakoid
           biogenesis, essential for photosynthesis. Activation of
           transcription by the enhancer-dependent bacterial
           sigma(54) containing RNA polymerase occurs through ATP
           hydrolysis-driven protein conformational changes enabled
           by activator proteins that belong to the large AAA(+)
           mechanochemical protein family. It has been shown that
           PspA directly and specifically acts upon and binds to
           the AAA(+) domain of the PspF transcription activator.
          Length = 220

 Score = 28.1 bits (63), Expect = 9.6
 Identities = 20/97 (20%), Positives = 50/97 (51%), Gaps = 3/97 (3%)

Query: 364 SNKEEEEKEKQRMVQEEIKRKELERKEELEKEQIRIKEEQENIKKKKEQEEKERKEQEEN 423
               ++ ++ ++M+++ I+  + E  +   +   ++   Q+ +++K E+++++ K+ E  
Sbjct: 15  HEGLDKAEDPEKMLEQAIRDMQSELGK-ARQALAQVIARQKQLERKLEEQKEQAKKLENK 73

Query: 424 QKLLLKKQQEEDRLKAEEQARLLEQ--ERLMEELRQQ 458
            +  L K  EE   +A  +   LE+  E L  +L QQ
Sbjct: 74  ARAALTKGNEELAREALAEIATLEKQAEALETQLTQQ 110


>gnl|CDD|131393 TIGR02340, chap_CCT_alpha, T-complex protein 1, alpha subunit.
           Members of this family, all eukaryotic, are part of the
           group II chaperonin complex called CCT (chaperonin
           containing TCP-1) or TRiC. The archaeal equivalent group
           II chaperonin is often called the thermosome. Both are
           somewhat related to the group I chaperonin of bacterial,
           GroEL/GroES. This family consists exclusively of the CCT
           alpha chain (part of a paralogous family) from animals,
           plants, fungi, and other eukaryotes.
          Length = 536

 Score = 28.5 bits (64), Expect = 9.6
 Identities = 17/47 (36%), Positives = 27/47 (57%)

Query: 291 EKEKRKLKDQIDLEQAQKLEERRLSELKIKEEEIEKKLNGHSDVPLS 337
           +K K  L  QI ++   KLE+ R  E  I +E I+K L+  ++V L+
Sbjct: 238 QKAKMALGVQIVVDDPAKLEQIRQREADITKERIKKILDAGANVVLT 284


>gnl|CDD|146444 pfam03805, CLAG, Cytoadherence-linked asexual protein.  Clag
           (cytoadherence linked asexual gene) is a malaria surface
           protein which has been shown to be involved in the
           binding of Plasmodium falciparum infected erythrocytes
           to host endothelial cells, a process termed
           cytoadherence. The cytoadherence phenomenon is
           associated with the sequestration of infected
           erythrocytes in the blood vessels of the brain, cerebral
           malaria. Clag is a multi-gene family in Plasmodium
           falciparum with at least 9 members identified to date.
           Orthologous proteins in the rodent malaria species
           Plasmodium chabaudi (Lawson D Unpubl. obs.) suggest that
           the gene family is found in other malaria species and
           may play a more generic role in cytoadherence.
          Length = 1282

 Score = 28.9 bits (65), Expect = 9.7
 Identities = 31/176 (17%), Positives = 51/176 (28%), Gaps = 37/176 (21%)

Query: 227 KDDNRQVVTSSKVKDLIAANSTNASKENIKPKPNIGHVKPSNLRAKFENLAKQTEEESRK 286
             D ++ +  SK K +    +    +E I P  +           K+E++ K+       
Sbjct: 41  TSDTKEYLDISKFKIINKDITDGDDEEYIVPTEDST----FEDIIKYEHILKEQVILKYN 96

Query: 287 RSEEEKEKRKL-------------------KDQIDLEQA------------QKLEERRLS 315
               +  KRK+                   K   DL++A            +   E + +
Sbjct: 97  PEISDLIKRKILIVRTLKIIKLMLIPMNAYKKTNDLKKALEELNKVFTNKNKSNNEDKDN 156

Query: 316 ELKIKEEEIEKKLNGHSDVPLSPSTETPPVPVKSILKQPTSDGIPIQNSNKEEEEK 371
            LK+     EK L     V    S          I     +D     N N E  E 
Sbjct: 157 TLKLSGRIFEKLLLSVKTVKRRNSIYNKGDDKIDI--IDYNDLFFTTNPNIEFMEN 210


>gnl|CDD|224400 COG1483, COG1483, Predicted ATPase (AAA+ superfamily) [General
           function prediction only].
          Length = 774

 Score = 28.6 bits (64), Expect = 9.8
 Identities = 17/76 (22%), Positives = 28/76 (36%), Gaps = 7/76 (9%)

Query: 377 VQEEIKRKELERKEELEKEQIRIKEEQENIKKK--KEQEEKERKEQEENQKLLLKKQQEE 434
           + E    + L   EE     +   E  + IK+K  K  E+ E   + E +K+  K  +  
Sbjct: 604 LVEYEAEEILSILEEETARVLACDEVMKTIKEKYKKYGEDVEEISENEVRKIGEKAVE-- 661

Query: 435 DRLKAEEQARLLEQER 450
                EEQ +      
Sbjct: 662 ---DLEEQFKDSFGRV 674


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.306    0.127    0.352 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,876,277
Number of extensions: 2717661
Number of successful extensions: 15080
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8278
Number of HSP's successfully gapped: 2434
Length of query: 525
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 424
Effective length of database: 6,457,848
Effective search space: 2738127552
Effective search space used: 2738127552
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 61 (27.1 bits)