RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1797
         (393 letters)



>gnl|CDD|219649 pfam07933, DUF1681, Protein of unknown function (DUF1681).  This
           family is composed of sequences derived from a number of
           hypothetical eukaryotic proteins of unknown function.
          Length = 158

 Score =  206 bits (527), Expect = 5e-66
 Identities = 80/134 (59%), Positives = 97/134 (72%), Gaps = 4/134 (2%)

Query: 190 RAADWNLQEPQWTGRMRLVTKDKECILKLEDRNSGELFAQCPIDTYPGVAIETVSDSSRY 249
           RAADW L EP WTGR+R+V K     + LED ++GELFAQ P D+Y   A+E V DSSRY
Sbjct: 27  RAADWKLDEPIWTGRLRVVEKGDRVSIVLEDPSTGELFAQAPYDSYN--AVEPVLDSSRY 84

Query: 250 FVLRIKDDSGRQAFIGLGFSDRSDSFDLNVALQDHFKWLKKTEEIEKEKEEPKQE--LDL 307
           FV+R++D +GR+AFIG+GF DRSD+FD NVALQDHFKW+K   E  K+KE    E  LDL
Sbjct: 85  FVIRVQDPTGRKAFIGIGFEDRSDAFDFNVALQDHFKWVKNESEPSKQKESNDSEPKLDL 144

Query: 308 KFKEGETIKINMKI 321
             KEGETI IN+  
Sbjct: 145 SLKEGETITINLGG 158



 Score =  179 bits (455), Expect = 3e-55
 Identities = 68/119 (57%), Positives = 85/119 (71%), Gaps = 4/119 (3%)

Query: 1   MRLVTKDKECILKLEDRNSGELFAQCPIDTYPGVAIETVSDSSRYFVLRIKDDSGRQAFI 60
           +R+V K     + LED ++GELFAQ P D+Y   A+E V DSSRYFV+R++D +GR+AFI
Sbjct: 42  LRVVEKGDRVSIVLEDPSTGELFAQAPYDSYN--AVEPVLDSSRYFVIRVQDPTGRKAFI 99

Query: 61  GLGFSDRSDSFDLNVALQDHFKWLKKTEEIEKEKEEPKQE--LDLKFKEGETIKINMKI 117
           G+GF DRSD+FD NVALQDHFKW+K   E  K+KE    E  LDL  KEGETI IN+  
Sbjct: 100 GIGFEDRSDAFDFNVALQDHFKWVKNESEPSKQKESNDSEPKLDLSLKEGETITINLGG 158



 Score = 99.7 bits (249), Expect = 4e-25
 Identities = 41/65 (63%), Positives = 47/65 (72%), Gaps = 2/65 (3%)

Query: 331 GRQAFIGLGFSDRSDSFDLNVALQDHFKWLKKTEEIEKEKEEPKQE--LDLKFKEGETIK 388
           GR+AFIG+GF DRSD+FD NVALQDHFKW+K   E  K+KE    E  LDL  KEGETI 
Sbjct: 94  GRKAFIGIGFEDRSDAFDFNVALQDHFKWVKNESEPSKQKESNDSEPKLDLSLKEGETIT 153

Query: 389 INMKI 393
           IN+  
Sbjct: 154 INLGG 158


>gnl|CDD|241382 cd13228, PHear_NECAP, NECAP (adaptin-ear-binding coat-associated
           protein) Plextrin Homology (PH) fold with ear-like
           function (PHear) domain.  NECAPs are alpha-ear-binding
           proteins that enrich on clathrin-coated vesicles (CCVs).
           NECAP 1 is expressed in brain and non-neuronal tissues
           and cells while NECAP 2 is ubiquitously expressed. The
           PH-like domain of NECAPs is a protein-binding interface
           that mimics the FxDxF motif binding properties of the
           alpha-ear and is called PHear (PH fold with ear-like
           function) domain. PH domains have diverse functions, but
           in general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 120

 Score =  193 bits (494), Expect = 1e-61
 Identities = 68/105 (64%), Positives = 85/105 (80%), Gaps = 4/105 (3%)

Query: 186 PQPS----RAADWNLQEPQWTGRMRLVTKDKECILKLEDRNSGELFAQCPIDTYPGVAIE 241
           P+ S    RAADW L++P WTGR+R+V K ++C ++LED N+GELFAQCP+D  PG A+E
Sbjct: 16  PRASARGYRAADWGLEKPLWTGRLRVVAKGEKCEIRLEDPNTGELFAQCPVDVDPGTAVE 75

Query: 242 TVSDSSRYFVLRIKDDSGRQAFIGLGFSDRSDSFDLNVALQDHFK 286
            V DSSRYFVLR++D SGR AFIG+GF +RSD+FD NVALQDHFK
Sbjct: 76  PVVDSSRYFVLRVEDGSGRHAFIGIGFRERSDAFDFNVALQDHFK 120



 Score =  164 bits (417), Expect = 4e-50
 Identities = 55/82 (67%), Positives = 69/82 (84%)

Query: 1   MRLVTKDKECILKLEDRNSGELFAQCPIDTYPGVAIETVSDSSRYFVLRIKDDSGRQAFI 60
           +R+V K ++C ++LED N+GELFAQCP+D  PG A+E V DSSRYFVLR++D SGR AFI
Sbjct: 39  LRVVAKGEKCEIRLEDPNTGELFAQCPVDVDPGTAVEPVVDSSRYFVLRVEDGSGRHAFI 98

Query: 61  GLGFSDRSDSFDLNVALQDHFK 82
           G+GF +RSD+FD NVALQDHFK
Sbjct: 99  GIGFRERSDAFDFNVALQDHFK 120



 Score = 65.7 bits (161), Expect = 3e-13
 Identities = 22/28 (78%), Positives = 25/28 (89%)

Query: 331 GRQAFIGLGFSDRSDSFDLNVALQDHFK 358
           GR AFIG+GF +RSD+FD NVALQDHFK
Sbjct: 93  GRHAFIGIGFRERSDAFDFNVALQDHFK 120


>gnl|CDD|241235 cd00900, PH-like, Pleckstrin homology-like domain.  The PH-like
          family includes the PH domain, both the Shc-like and
          IRS-like PTB domains, the ran-binding domain, the EVH1
          domain, a domain in neurobeachin and the third domain
          of FERM. All of these domains have a PH fold, but lack
          significant sequence similarity. They are generally
          involved in targeting to protein to the appropriate
          cellular location or interacting with a binding
          partner. This domain family possesses multiple
          functions including the ability to bind inositol
          phosphates and to other proteins.
          Length = 88

 Score = 42.8 bits (101), Expect = 1e-05
 Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 3/58 (5%)

Query: 12 LKLEDRNSGELFAQCPIDTYPGVAIETVSDSSRYFVLRIKDDSGRQAFIGLGFSDRSD 69
          L L D+    L  + PI     V     SDS R  VLR+KD   ++  +G  F  + +
Sbjct: 27 LILRDKQDDRLKLKIPISDIVKVNPSRASDSDRALVLRLKD---KRESLGFSFKSKEE 81



 Score = 42.8 bits (101), Expect = 1e-05
 Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 3/58 (5%)

Query: 216 LKLEDRNSGELFAQCPIDTYPGVAIETVSDSSRYFVLRIKDDSGRQAFIGLGFSDRSD 273
           L L D+    L  + PI     V     SDS R  VLR+KD   ++  +G  F  + +
Sbjct: 27  LILRDKQDDRLKLKIPISDIVKVNPSRASDSDRALVLRLKD---KRESLGFSFKSKEE 81


>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
          Length = 653

 Score = 37.5 bits (87), Expect = 0.011
 Identities = 29/103 (28%), Positives = 45/103 (43%), Gaps = 14/103 (13%)

Query: 66  DRSDSFDLNVALQDHFKWLKKTEEIEKEKEEPKQELDLKFKEGETIKINMKITKKDGADG 125
             SD   +  A+ +  +WL+K +  EKE+ E KQ      KE E++  N  +TK   A G
Sbjct: 565 SDSDKATIEKAIDEALEWLEKNQLAEKEEFEHKQ------KEVESV-CNPIMTKMYQAAG 617

Query: 126 QGKAKTRPTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVN 168
            G     P   GG+    PG       P  A + +   + +V+
Sbjct: 618 GGMPGGMP---GGMPGGMPGGAG----PAGAGASSGPTVEEVD 653


>gnl|CDD|213013 cd12080, SH3_MPP1, Src Homology 3 domain of Membrane Protein,
           Palmitoylated 1 (or MAGUK p55 subfamily member 1).
           MPP1, also called 55 kDa erythrocyte membrane protein
           (p55), is a ubiquitously-expressed scaffolding protein
           that plays roles in regulating neutrophil polarity, cell
           shape, hair cell development, and neural development and
           patterning of the retina. It was originally identified
           as an erythrocyte protein that stabilizes the actin
           cytoskeleton to the plasma membrane by forming a complex
           with 4.1R protein and glycophorin C. MPP1 is one of
           seven vertebrate homologs of the Drosophila Stardust
           protein, which is required in establishing cell
           polarity, and it contains the three domains
           characteristic of MAGUK (membrane-associated guanylate
           kinase) proteins: PDZ, SH3, and guanylate kinase (GuK).
           In addition, it also contains the Hook (Protein 4.1
           Binding) motif in between the SH3 and GuK domains. The
           GuK domain in MAGUK proteins is enzymatically inactive;
           instead, the domain mediates protein-protein
           interactions and associates intramolecularly with the
           SH3 domain. SH3 domains are protein interaction domains
           that bind to proline-rich ligands with moderate affinity
           and selectivity, preferentially to PxxP motifs. They
           play versatile and diverse roles in the cell including
           the regulation of enzymes, changing the subcellular
           localization of signaling pathway components, and
           mediating the formation of multiprotein complex
           assemblies.
          Length = 62

 Score = 30.3 bits (68), Expect = 0.21
 Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 3/47 (6%)

Query: 97  PKQELDLKFKEGETIKINMKITKKDGADGQGKAKTRPTSTGGLLPPP 143
           P +E  LKF+ G+ I+I   I K D    QG+ +     + GL+P P
Sbjct: 17  PCKEAGLKFQTGDIIQI---INKDDSNWWQGRVEGSGEESAGLIPSP 60


>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein.  PACS-1 is a
           cytosolic sorting protein that directs the localisation
           of membrane proteins in the trans-Golgi network
           (TGN)/endosomal system. PACS-1 connects the clathrin
           adaptor AP-1 to acidic cluster sorting motifs contained
           in the cytoplasmic domain of cargo proteins such as
           furin, the cation-independent mannose-6-phosphate
           receptor and in viral proteins such as human
           immunodeficiency virus type 1 Nef.
          Length = 413

 Score = 32.5 bits (74), Expect = 0.34
 Identities = 16/49 (32%), Positives = 22/49 (44%), Gaps = 8/49 (16%)

Query: 117 ITKKDGADGQGKAKTRPTSTGGLLPPPPGSGMSK---ISPPPAPSQNSS 162
               D  D    +    +S+     PP  SG SK    +PPP+PS +SS
Sbjct: 216 QASADLDDSAALSLFVLSSS-----PPSSSGASKEASATPPPSPSMSSS 259


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 32.6 bits (74), Expect = 0.42
 Identities = 20/70 (28%), Positives = 25/70 (35%), Gaps = 3/70 (4%)

Query: 123  ADGQGKAKTRPTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAP 182
                  A     S  G LPPP  +  +   PPP P   S P+    A     GD     P
Sbjct: 2811 VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG---GDVRRRPP 2867

Query: 183  APAPQPSRAA 192
            + +P    AA
Sbjct: 2868 SRSPAAKPAA 2877



 Score = 31.8 bits (72), Expect = 0.80
 Identities = 16/52 (30%), Positives = 21/52 (40%), Gaps = 3/52 (5%)

Query: 141  PPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPAPQPSRAA 192
            PPPP        P P    +++P+    A A        AAPAP   P+  A
Sbjct: 2701 PPPPPP---TPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPA 2749



 Score = 31.1 bits (70), Expect = 1.4
 Identities = 11/62 (17%), Positives = 14/62 (22%), Gaps = 1/62 (1%)

Query: 131  TRPTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPAPQPSR 190
             RP +       P G        PP       P    +  AN           P  +P  
Sbjct: 2596 ARPRAPVDDRGDPRGPAPPS-PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRD 2654

Query: 191  AA 192
              
Sbjct: 2655 DP 2656



 Score = 30.3 bits (68), Expect = 2.5
 Identities = 17/65 (26%), Positives = 23/65 (35%), Gaps = 6/65 (9%)

Query: 132  RPTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPA-----PAP 186
             P +     P PPG   ++ + P  P+  + P       A   G    A P      PAP
Sbjct: 2711 APHALVSATPLPPGPAAARQASPALPAAPAPP-AVPAGPATPGGPARPARPPTTAGPPAP 2769

Query: 187  QPSRA 191
             P  A
Sbjct: 2770 APPAA 2774



 Score = 28.4 bits (63), Expect = 8.6
 Identities = 13/61 (21%), Positives = 19/61 (31%), Gaps = 2/61 (3%)

Query: 133 PTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPAPQPSRAA 192
           P    G    P  +     + PP P + ++P          WG   + AP   P P    
Sbjct: 259 PVVGEGADRAPETA--RGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPP 316

Query: 193 D 193
            
Sbjct: 317 P 317


>gnl|CDD|224363 COG1446, COG1446, Asparaginase [Amino acid transport and
           metabolism].
          Length = 307

 Score = 31.9 bits (73), Expect = 0.52
 Identities = 21/72 (29%), Positives = 29/72 (40%), Gaps = 5/72 (6%)

Query: 77  LQDHFKWLKKTEEIEKEKEEPKQELDLKFKEGETIKINMKITKKDGA---DGQGKAKTRP 133
             D F   ++ E ++ E++  KQ LD   K  E  +         GA   D  G      
Sbjct: 134 EYDPFTEERRAEWLQAERDAKKQVLD-HSKTYEEPEDPDSKHGTVGAVALDADGNLAA-A 191

Query: 134 TSTGGLLPPPPG 145
           TSTGG+    PG
Sbjct: 192 TSTGGVFLKRPG 203


>gnl|CDD|237537 PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional.
          Length = 440

 Score = 31.4 bits (72), Expect = 0.91
 Identities = 16/85 (18%), Positives = 23/85 (27%), Gaps = 5/85 (5%)

Query: 122 GADGQGKAKTRPTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAA 181
           GA G   A       GG+      +  S     P     S     + ++       T   
Sbjct: 327 GAAGGSGAAGVAAGLGGVARAGASAAAS-----PLRRAASRAAESMKSSFRAGARSTGGG 381

Query: 182 PAPAPQPSRAADWNLQEPQWTGRMR 206
              A   + A       P W  RM+
Sbjct: 382 AGGAAAAAAAGAAAAGPPAWAKRMK 406


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 31.2 bits (71), Expect = 1.1
 Identities = 17/89 (19%), Positives = 25/89 (28%), Gaps = 9/89 (10%)

Query: 122 GADGQGKAKTRPTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAA 181
            A     A     +        P +  S  + PPA +   +P+    A A         A
Sbjct: 385 EAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAP-PAPVAAPAAAAP--------A 435

Query: 182 PAPAPQPSRAADWNLQEPQWTGRMRLVTK 210
            APA  P+  A       Q       +  
Sbjct: 436 AAPAAAPAAVALAPAPPAQAAPETVAIPV 464


>gnl|CDD|222483 pfam13971, MEI4-Rec24, Microtubule-binding domain of katanin.  This
           is the C-terminal domain of katanin - a heterodimeric
           micro-tubule severing ATPase. Katanin is found localised
           at mitotic spindle bodies. This domain binds
           microtubules by interacting specifically with the
           N-terminal domain of p60 katanin.
          Length = 375

 Score = 31.0 bits (70), Expect = 1.2
 Identities = 25/111 (22%), Positives = 38/111 (34%), Gaps = 14/111 (12%)

Query: 282 QDHFKWLKKTEEIEKEKEEPKQELDL--------KFKEGETIK--INMKITVSTYTLGLG 331
           QD   W KK E +E E  + +QEL          K          ++   T S     L 
Sbjct: 53  QDE-GWKKKAELLEAEPLQLRQELLKNRNSAGCAKSGAKVFPAKLLDQDPTSSENDETLL 111

Query: 332 RQAFIGLGFSDRSDSF---DLNVALQDHFKWLKKTEEIEKEKEEPKQELDL 379
            +       S R       D+      H ++L+   E++   E   Q+ DL
Sbjct: 112 EELGPTPPNSQRVPKRPQADIENPFSSHMQFLQCLLELKTLTESVYQKADL 162


>gnl|CDD|219055 pfam06484, Ten_N, Teneurin Intracellular Region.  This family is
           found in the intracellular N-terminal region of the
           Teneurin family of proteins. These proteins are
           'pair-rule' genes and are involved in tissue patterning,
           specifically probably neural patterning. The
           intracellular domain is cleaved in response to
           homophilic interaction of the extracellular domain, and
           translocates to the nucleus. Here it probably carries
           out to some transcriptional regulatory activity. The
           length of this region and the conservation suggests that
           there may be two structural domains here (personal obs:C
           Yeats).
          Length = 370

 Score = 30.4 bits (68), Expect = 1.5
 Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 2/42 (4%)

Query: 152 SPPPAPSQNSSPIHQ-VNANANNWGDFTS-AAPAPAPQPSRA 191
           +PP   S + SP H   + N+ N  ++T  + P+PAP  S  
Sbjct: 244 TPPLPLSHSHSPSHHAASINSLNRSNYTQRSNPSPAPTDSST 285


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 30.6 bits (69), Expect = 1.5
 Identities = 17/68 (25%), Positives = 26/68 (38%), Gaps = 11/68 (16%)

Query: 127 GKAKTRP-TSTGGLLP----PPPGSGMSKISPPPAPSQNSSPIHQVNA------NANNWG 175
           G A + P    G   P    PPP +     +PP  P    + + ++        +A   G
Sbjct: 908 GDAASAPPPGAGPPAPPQAVPPPRTTQPPAAPPRGPDVPPAAVAELRETLADLRSAQRSG 967

Query: 176 DFTSAAPA 183
           DFT+   A
Sbjct: 968 DFTAYGAA 975


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 30.8 bits (69), Expect = 1.6
 Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 9/66 (13%)

Query: 123 ADGQGKAKTRPTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAP 182
           +D   + +         +  PPG+ ++  +PPP PS  + P           G   +A P
Sbjct: 164 SDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQ---------GSPIAAQP 214

Query: 183 APAPQP 188
           AP PQ 
Sbjct: 215 APQPQQ 220



 Score = 29.7 bits (66), Expect = 3.0
 Identities = 11/51 (21%), Positives = 15/51 (29%), Gaps = 5/51 (9%)

Query: 142 PPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPAPQPSRAA 192
           PP  S          P Q+    H   +     G   +  P P+   S  A
Sbjct: 432 PPKASTHPHSGLHSGPPQSPFAQHPFTS-----GGLPAIGPPPSLPTSTPA 477


>gnl|CDD|218056 pfam04388, Hamartin, Hamartin protein.  This family includes the
           hamartin protein which is thought to function as a
           tumour suppressor. The hamartin protein interacts with
           the tuberin protein pfam03542. Tuberous sclerosis
           complex (TSC) is an autosomal dominant disorder and is
           characterized by the presence of hamartomas in many
           organs, such as brain, skin, heart, lung, and kidney. It
           is caused by mutation either TSC1 or TSC2 tumour
           suppressor gene. TSC1 encodes a protein, hamartin,
           containing two coiled-coil regions, which have been
           shown to mediate binding to tuberin. The TSC2 gene codes
           for tuberin pfam03542. These two proteins function
           within the same pathway(s) regulating cell cycle, cell
           growth, adhesion, and vesicular trafficking.
          Length = 667

 Score = 30.7 bits (69), Expect = 1.8
 Identities = 17/72 (23%), Positives = 28/72 (38%), Gaps = 1/72 (1%)

Query: 133 PTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPAPQPSRAA 192
           P+S  G+  PP   GMS +    +PS  S         +      + + P+ +P P   A
Sbjct: 335 PSSLCGMATPPSSIGMSPLILSLSPSHLSGRAPGTTG-SGKGEPASESTPSTSPPPPGLA 393

Query: 193 DWNLQEPQWTGR 204
           D  ++    T  
Sbjct: 394 DDIVRAIFATSS 405


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 30.1 bits (68), Expect = 2.1
 Identities = 16/67 (23%), Positives = 22/67 (32%), Gaps = 13/67 (19%)

Query: 126 QGKAKTRPTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPA 185
           +  A   P++  G   P   +     + PP P    +                 A PAPA
Sbjct: 378 ERGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAA-------------ARPAPA 424

Query: 186 PQPSRAA 192
           P P  AA
Sbjct: 425 PAPPAAA 431


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 30.5 bits (69), Expect = 2.1
 Identities = 13/63 (20%), Positives = 24/63 (38%), Gaps = 8/63 (12%)

Query: 141 PPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTS--------AAPAPAPQPSRAA 192
           P P   G       P  S +SS   + ++++ +    +S         +P+ +P PSR  
Sbjct: 298 PSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP 357

Query: 193 DWN 195
              
Sbjct: 358 PPA 360



 Score = 29.4 bits (66), Expect = 4.1
 Identities = 17/69 (24%), Positives = 26/69 (37%), Gaps = 1/69 (1%)

Query: 125 GQGKAKTRPTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAP 184
                +  PT  G   P PP       SPPP+P+ + S + +    +       S   A 
Sbjct: 98  ASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLR-PVGSPGPPPAASPPAAG 156

Query: 185 APQPSRAAD 193
           A   + A+D
Sbjct: 157 ASPAAVASD 165


>gnl|CDD|178439 PLN02847, PLN02847, triacylglycerol lipase.
          Length = 633

 Score = 30.2 bits (68), Expect = 2.3
 Identities = 20/79 (25%), Positives = 36/79 (45%), Gaps = 5/79 (6%)

Query: 89  EIEKEKEEPKQELDLKFKEGETIKINMKITKKDGADGQGKAKTRPTSTGGLLPP----PP 144
           E+EKE +  + E+D + +E E      +IT+++    +        ++  L       PP
Sbjct: 502 ELEKELQRQETEVDAQAQEEEAAAAK-EITEEENVLAKAVESKTSITSSDLSESQQFYPP 560

Query: 145 GSGMSKISPPPAPSQNSSP 163
           G  M  +S PP+ S+N   
Sbjct: 561 GRIMHIVSMPPSDSENDDD 579


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 29.0 bits (65), Expect = 2.5
 Identities = 15/66 (22%), Positives = 21/66 (31%), Gaps = 8/66 (12%)

Query: 126 QGKAKTRPTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPA 185
              A   P +        P   +    P   P+   +P   V A A       +  P+  
Sbjct: 83  TPPAPPEPVTPPT--AQSPAPAVPTPPPTSTPAVPPAPAAAVPAPA------AAPPPSDP 134

Query: 186 PQPSRA 191
           PQP RA
Sbjct: 135 PQPPRA 140


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 29.8 bits (67), Expect = 2.7
 Identities = 19/66 (28%), Positives = 27/66 (40%), Gaps = 4/66 (6%)

Query: 124 DGQGKAKTRPTSTGGLLP-PPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAP 182
           D       RP       P PPPG+G      P  P ++ +P+  +     +     +A P
Sbjct: 212 DHPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDAPVVPI---RPSAPGPLAAQP 268

Query: 183 APAPQP 188
           APAP P
Sbjct: 269 APAPGP 274


>gnl|CDD|166070 PLN02429, PLN02429, triosephosphate isomerase.
          Length = 315

 Score = 29.4 bits (65), Expect = 3.0
 Identities = 38/134 (28%), Positives = 55/134 (41%), Gaps = 13/134 (9%)

Query: 139 LLPPPPGSGMSKISP---PPAPSQNSSPIHQVNANANNWGDFTSAAPAPAPQPSRAADWN 195
           L  PP  SG+ +ISP     A S + S  H+VN++       +S+  +P    + A    
Sbjct: 6   LTAPPSFSGLRRISPKLDAAAVSSHQSFFHRVNSSTRLVSSSSSSHRSPRGVVAMAGSGK 65

Query: 196 LQEPQWTGRMRLVTKDKECILKL-EDRNSGELFAQCPIDTYPG-VAIETVSDSSRYFVLR 253
                           K+ I KL  D NS  L A   +   P  V I+ V  S      R
Sbjct: 66  FFVGGNWKC----NGTKDSIAKLISDLNSATLEADVDVVVSPPFVYIDQVKSS---LTDR 118

Query: 254 IKDDSGRQAFIGLG 267
           I D SG+ +++G G
Sbjct: 119 I-DISGQNSWVGKG 131


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 29.9 bits (68), Expect = 3.1
 Identities = 13/73 (17%), Positives = 22/73 (30%), Gaps = 15/73 (20%)

Query: 120 KDGADGQGKAKTRPTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTS 179
             G+     A     +       P  +  +K    PAP+  ++                +
Sbjct: 37  GPGSTAAPTAAAAAAAAAA--SAPAAAPAAKAPAAPAPAPPAAAA-------------PA 81

Query: 180 AAPAPAPQPSRAA 192
           A P PA   + AA
Sbjct: 82  APPKPAAAAAAAA 94


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 29.5 bits (66), Expect = 3.6
 Identities = 20/66 (30%), Positives = 28/66 (42%), Gaps = 8/66 (12%)

Query: 132 RPTSTGGLLPPPPGS--GMSKISPPPAPSQNSSPIHQVNANANNWGDFT------SAAPA 183
                G + PPPP S  G+   S   APS +S+P +++   + N           SA PA
Sbjct: 179 GGNIPGAIQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPA 238

Query: 184 PAPQPS 189
             P P 
Sbjct: 239 SIPAPP 244


>gnl|CDD|234942 PRK01318, PRK01318, membrane protein insertase; Provisional.
          Length = 521

 Score = 29.5 bits (67), Expect = 4.1
 Identities = 14/82 (17%), Positives = 25/82 (30%), Gaps = 3/82 (3%)

Query: 41  DSSRYFVLRIKDDSGRQAFIGLGFSDRSDSFDLNVALQDHFKWLKKTEEIEKEKEEPKQE 100
           DSS   VL +   +    F   G +        NV   D   +    + +     + +  
Sbjct: 72  DSSPPVVL-LSPSTEHPYFAQSGLTGAD--GPDNVPNPDRTLYTADGDSLVLADGQNELP 128

Query: 101 LDLKFKEGETIKINMKITKKDG 122
           + L +  G  +      T   G
Sbjct: 129 VTLTWTNGNGLTFTKTYTLDRG 150



 Score = 29.1 bits (66), Expect = 5.4
 Identities = 13/79 (16%), Positives = 25/79 (31%), Gaps = 3/79 (3%)

Query: 245 DSSRYFVLRIKDDSGRQAFIGLGFSDRSDSFDLNVALQDHFKWLKKTEEIEKEKEEPKQE 304
           DSS   VL +   +    F   G +        NV   D   +    + +     + +  
Sbjct: 72  DSSPPVVL-LSPSTEHPYFAQSGLTGAD--GPDNVPNPDRTLYTADGDSLVLADGQNELP 128

Query: 305 LDLKFKEGETIKINMKITV 323
           + L +  G  +      T+
Sbjct: 129 VTLTWTNGNGLTFTKTYTL 147


>gnl|CDD|153327 cd07643, I-BAR_IMD_MIM, Inverse (I)-BAR, also known as the
           IRSp53/MIM homology Domain (IMD), of Missing In
           Metastasis.  The IMD domain, also called
           Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a
           dimerization and lipid-binding module that bends
           membranes and induces membrane protrusions. Members of
           this subfamily include missing in metastasis (MIM) or
           metastasis suppressor 1 (MTSS1), metastasis suppressor
           1-like (MTSSL) or ABBA (Actin-Bundling protein with
           BAIAP2 homology), and similar proteins. They contain an
           N-terminal IMD and a WASP homology 2 (WH2) actin-binding
           motif at the C-terminus. MIM was originally identified
           as a missing transcript from metastatic bladder and
           prostate cancer cells. It is a scaffold protein that
           functions in a signaling pathway between the PDGF
           receptor, Src kinases, and actin assembly. It may also
           function as a cofactor of the Sonic hedgehog (Shh)
           transcriptional pathway and may participate in tumor
           development and progression via this pathway. ABBA
           regulates actin and plasma membrane dynamics to promote
           the extension of radial glia, which is important in
           neuronal migration, axon guidance and neurogenesis. The
           IMD domain of MIM binds and bundles actin filaments,
           binds membranes, and interacts with the small GTPase
           Rac.
          Length = 231

 Score = 29.0 bits (65), Expect = 4.1
 Identities = 16/53 (30%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 73  LNVALQDHFK-WLKKTEEIEKE--KEEPKQELDLKFKEGETIKINMKITKKDG 122
           L   LQ+  + W K   +++K+  KE  K   ++K K  +TI++  K  K  G
Sbjct: 98  LVNPLQEKIEEWKKVANQLDKDHAKEYKKARQEIKKKSSDTIRLQKKARKGKG 150


>gnl|CDD|219262 pfam07001, BAT2_N, BAT2 N-terminus.  This family represents the
           N-terminus (approximately 200 residues) of the
           proline-rich protein BAT2. BAT2 is similar to other
           proteins with large proline-rich domains, such as some
           nuclear proteins, collagens, elastin, and synapsin.
          Length = 189

 Score = 28.7 bits (64), Expect = 4.2
 Identities = 20/78 (25%), Positives = 27/78 (34%), Gaps = 10/78 (12%)

Query: 114 NMKITKKDGADGQGKAKTRPTSTGGLLP-------PPPGSGMSKISPPPAPSQNSSPIHQ 166
           N+ +  KDG     K       +   LP       PP  S  S  SPPP P   +     
Sbjct: 76  NVSLVPKDGTGWASKQDQSDPKSSDALPPQTPESQPPAVSQTSASSPPPPPQPATPL--- 132

Query: 167 VNANANNWGDFTSAAPAP 184
           V   A +W   ++   A 
Sbjct: 133 VPGGAKSWAVASAKPGAQ 150


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 29.3 bits (65), Expect = 4.2
 Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 3/99 (3%)

Query: 290  KTEEIEKEKEEPKQELDLKFKEGETIKINMKITVSTYTLGLGRQAFIGLGFSDRSDSFDL 349
            K EE++K +EE K+   LK KE E  K   ++  +     +           D+  + + 
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEA 1680

Query: 350  NVALQDHFKWLKKTEEIEKEKEEPKQELDLKFKEGETIK 388
              A +D     K  E ++KE EE K+  +LK KE E  K
Sbjct: 1681 KKAEEDE---KKAAEALKKEAEEAKKAEELKKKEAEEKK 1716


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 28.2 bits (63), Expect = 4.2
 Identities = 12/42 (28%), Positives = 15/42 (35%)

Query: 122 GADGQGKAKTRPTSTGGLLPPPPGSGMSKISPPPAPSQNSSP 163
              G G       S+    P  PG G +    P AP  +SS 
Sbjct: 1   STGGDGTGGVSSGSSAPAPPAGPGPGPNAPPAPAAPGVDSSA 42


>gnl|CDD|236504 PRK09418, PRK09418, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 780

 Score = 29.3 bits (65), Expect = 4.3
 Identities = 20/82 (24%), Positives = 30/82 (36%), Gaps = 7/82 (8%)

Query: 107 EGETIKINMKITKKDGAD---GQGKAKTRPTSTGGL----LPPPPGSGMSKISPPPAPSQ 159
           E E  K  + ITKK+  D   G     T PT  G        PP G G +  +P   P+ 
Sbjct: 642 ENEFAKYAIDITKKNDDDKETGGENPTTPPTGEGDNGENPTTPPTGEGNNGENPTTPPTG 701

Query: 160 NSSPIHQVNANANNWGDFTSAA 181
             +        + + G+   + 
Sbjct: 702 EGNNGGNPTTPSTDEGNNAGSG 723


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 28.9 bits (65), Expect = 4.3
 Identities = 23/100 (23%), Positives = 42/100 (42%), Gaps = 5/100 (5%)

Query: 67  RSDSFDLNVALQDHFKWLKKTEEIEKEKEEPKQELDLKFKEGETIK--INMKITKKDGAD 124
           R    D+N  L+  F+  KK EE E+E++E   +      E E  +   + + ++ D  D
Sbjct: 163 RRADVDVNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEED 222

Query: 125 GQGKAKTRPTSTGGLLPPPPGSGMSKISPPPAPSQNSSPI 164
                  +  S+    P  P S + K +   + + +SS  
Sbjct: 223 NDNTPSPKSGSSS---PAKPTSILKKSAAKRSEAPSSSKA 259


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 28.9 bits (64), Expect = 5.1
 Identities = 16/65 (24%), Positives = 24/65 (36%), Gaps = 7/65 (10%)

Query: 137 GGLLPPPPGSGMSKISPPPAP--SQNSSPIHQVNANANNWGDFTSAAPAPAPQPSRAADW 194
            G     P SG +   P P     Q ++P   +  ++        A PAP+  PS    W
Sbjct: 382 SGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSA-----APATPAPSAAPSPRVPW 436

Query: 195 NLQEP 199
           +   P
Sbjct: 437 DDAPP 441


>gnl|CDD|218632 pfam05549, Allexi_40kDa, Allexivirus 40kDa protein. 
          Length = 271

 Score = 28.6 bits (64), Expect = 5.6
 Identities = 8/26 (30%), Positives = 10/26 (38%)

Query: 141 PPPPGSGMSKISPPPAPSQNSSPIHQ 166
              P    +  S  P  S  S PI+Q
Sbjct: 145 LAGPSVSPTPASSHPTASDRSLPIYQ 170


>gnl|CDD|213014 cd12081, SH3_CASK, Src Homology 3 domain of
           Calcium/calmodulin-dependent Serine protein Kinase.
           CASK is a scaffolding protein that is highly expressed
           in the mammalian nervous system and plays roles in
           synaptic protein targeting, neural development, and gene
           expression regulation. CASK interacts with many
           different binding partners including parkin, neurexin,
           syndecans, calcium channel proteins, caskin, among
           others, to perform specific functions in different
           subcellular locations. Disruption of the CASK gene in
           mice results in neonatal lethality while mutations in
           the human gene have been associated with X-linked mental
           retardation. Drosophila CASK is associated with both
           pre- and postsynaptic membranes and is crucial in
           synaptic transmission and vesicle cycling. CASK contains
           an N-terminal calmodulin-dependent kinase (CaMK)-like
           domain, two L27 domains, followed by the core of three
           domains characteristic of MAGUK (membrane-associated
           guanylate kinase) proteins: PDZ, SH3, and guanylate
           kinase (GuK). In addition, it also contains the Hook
           (Protein 4.1 Binding) motif in between the SH3 and GuK
           domains. The GuK domain in MAGUK proteins is
           enzymatically inactive; instead, the domain mediates
           protein-protein interactions and associates
           intramolecularly with the SH3 domain. SH3 domains are
           protein interaction domains that bind to proline-rich
           ligands with moderate affinity and selectivity,
           preferentially to PxxP motifs. They play versatile and
           diverse roles in the cell including the regulation of
           enzymes, changing the subcellular localization of
           signaling pathway components, and mediating the
           formation of multiprotein complex assemblies.
          Length = 62

 Score = 26.4 bits (58), Expect = 5.7
 Identities = 14/47 (29%), Positives = 25/47 (53%), Gaps = 3/47 (6%)

Query: 97  PKQELDLKFKEGETIKINMKITKKDGADGQGKAKTRPTSTGGLLPPP 143
           P ++  ++F+ G+ ++I   I+K D    Q K +     T GL+P P
Sbjct: 17  PCKQAGIRFRVGDILQI---ISKDDHNWWQAKLENSKNGTAGLIPSP 60


>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
           carrier protein subunit; Validated.
          Length = 153

 Score = 27.9 bits (62), Expect = 5.9
 Identities = 14/56 (25%), Positives = 21/56 (37%), Gaps = 13/56 (23%)

Query: 138 GLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPAPQPSRAAD 193
           G+        +   +P PAP+  S+P                A  APAP P+ A +
Sbjct: 42  GIDLSAVQEQVPTPAPAPAPAVPSAPT-------------PVAPAAPAPAPASAGE 84


>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 598

 Score = 28.7 bits (65), Expect = 6.6
 Identities = 11/56 (19%), Positives = 15/56 (26%)

Query: 137 GGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPAPQPSRAA 192
           G   P   G G       P     ++      A     G   + A  P    + AA
Sbjct: 390 GPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAA 445


>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1.  This region is found
           in some of the Diaphanous related formins (Drfs). It
           consists of low complexity repeats of around 12
           residues.
          Length = 160

 Score = 27.6 bits (61), Expect = 6.9
 Identities = 15/52 (28%), Positives = 17/52 (32%), Gaps = 10/52 (19%)

Query: 137 GGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPAPQP 188
            G+ PPPP    +   PPP P   S P             F    PA    P
Sbjct: 119 PGVPPPPPPFPGAPGIPPPPPGMGSPPPPP----------FGFGVPAAPVLP 160



 Score = 27.6 bits (61), Expect = 8.6
 Identities = 10/31 (32%), Positives = 11/31 (35%)

Query: 127 GKAKTRPTSTGGLLPPPPGSGMSKISPPPAP 157
           G     P   G   PPPP  G    + P  P
Sbjct: 130 GAPGIPPPPPGMGSPPPPPFGFGVPAAPVLP 160



 Score = 27.2 bits (60), Expect = 9.9
 Identities = 11/20 (55%), Positives = 13/20 (65%)

Query: 138 GLLPPPPGSGMSKISPPPAP 157
           G+ PPPP  G + I PPP P
Sbjct: 83  GIPPPPPLPGGAGIPPPPPP 102


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 28.8 bits (65), Expect = 6.9
 Identities = 15/74 (20%), Positives = 23/74 (31%), Gaps = 4/74 (5%)

Query: 133 PTSTGGLLPPPPGSGMSKISPPPAPSQNSS--PIHQVNANANNWGDFTSAAPAPAPQPSR 190
             + G   P       +  +P  AP+  ++        A A       + APAPAP P  
Sbjct: 386 GVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQ--PAPAPAPAPAPPS 443

Query: 191 AADWNLQEPQWTGR 204
            A         +  
Sbjct: 444 PAGNAPAGGAPSPP 457


>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
           (DUF2360).  This is the conserved 140 amino acid region
           of a family of proteins conserved from nematodes to
           humans. One C. elegans member is annotated as a
           Daf-16-dependent longevity protein 1 but this could not
           be confirmed. The function is unknown.
          Length = 147

 Score = 27.7 bits (62), Expect = 6.9
 Identities = 10/48 (20%), Positives = 15/48 (31%), Gaps = 7/48 (14%)

Query: 152 SPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPAPQPSRAADWNLQEP 199
            PPPA +  +                 + A +P P  + A      EP
Sbjct: 59  PPPPASAITNGGPPPPPPAR-------AEAASPPPPEAPAEPPAEPEP 99


>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
           biogenesis, outer membrane].
          Length = 284

 Score = 28.1 bits (63), Expect = 7.0
 Identities = 20/110 (18%), Positives = 37/110 (33%), Gaps = 10/110 (9%)

Query: 240 IETVSDSSRYFVLRIKDDSGRQAFIGLGFSDRSDSFD-------LNVALQDHFKWLKKTE 292
           I  V  +    V  +        F+           D       L   L +  + L++ E
Sbjct: 36  IRQVVGTVLSPVQSVVAAPFE--FVDGVLEFLKSLKDLALENEELKKELAELEQLLEEVE 93

Query: 293 EIEKEKEEPKQELDLKFKEGETIKINMKITVSTYTLGLGRQAFIGLGFSD 342
            +E+E +  K+ LD K    +   I  ++ +S       +   I  G +D
Sbjct: 94  SLEEENKRLKELLDFKESSSDYDPIAARV-ISRSPDPWSQTIVIDKGSND 142


>gnl|CDD|132965 cd06634, STKc_TAO2, Catalytic domain of the Protein
           Serine/Threonine Kinase, Thousand-and-one amino acids 2.
            Serine/threonine kinases (STKs), thousand-and-one amino
           acids 2 (TAO2) subfamily, catalytic (c) domain. STKs
           catalyze the transfer of the gamma-phosphoryl group from
           ATP to serine/threonine residues on protein substrates.
           The TAO subfamily is part of a larger superfamily that
           includes the catalytic domains of other protein STKs,
           protein tyrosine kinases, RIO kinases, aminoglycoside
           phosphotransferase, choline kinase, and phosphoinositide
           3-kinase. TAO proteins possess mitogen-activated protein
           kinase (MAPK) kinase kinase (MAPKKK or MAP3K or MKKK)
           activity. MAPK signaling cascades are important in
           mediating cellular responses to extracellular signals.
           Human TAO2 is also known as prostate-derived Ste20-like
           kinase (PSK) and was identified in a screen for
           overexpressed RNAs in prostate cancer. TAO2 activates
           both p38 and c-Jun N-terminal kinase (JNK), by
           phosphorylating and activating the respective MAP/ERK
           kinases (MEKs, also known as MKKs or MAPKKs), MEK3/MEK6
           and MKK4/MKK7. TAO2 contains a long C-terminal extension
           with autoinhibitory segments. It is activated by the
           release of this inhibition and the phosphorylation of
           its activation loop serine. TAO2 functions as a
           regulator of actin cytoskeletal and microtubule
           organization. In addition, it regulates the transforming
           growth factor-activated kinase 1 (TAK1), which is a
           MAPKKK that plays an essential role in the signaling
           pathways of tumor necrosis factor (TNF), interleukin 1
           (IL-1), and Toll-like receptor (TLR).
          Length = 308

 Score = 28.5 bits (63), Expect = 7.1
 Identities = 19/84 (22%), Positives = 36/84 (42%), Gaps = 6/84 (7%)

Query: 143 PPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPAPQPSRAADWNL------ 196
           PP   M+ +S     +QN SP  Q    +  + +F  +     PQ    ++  L      
Sbjct: 212 PPLFNMNAMSALYHIAQNESPALQSGHWSEYFRNFVDSCLQKIPQDRPTSEVLLKHRFVL 271

Query: 197 QEPQWTGRMRLVTKDKECILKLED 220
           +E   T  M L+ + K+ + +L++
Sbjct: 272 RERPPTVIMDLIQRTKDAVRELDN 295


>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824).  This is
           a repeating domain found in fungal proteins. It is
           proline-rich, and the function is not known.
          Length = 135

 Score = 27.2 bits (60), Expect = 8.6
 Identities = 14/56 (25%), Positives = 17/56 (30%), Gaps = 3/56 (5%)

Query: 141 PPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAP---APQPSRAAD 193
           PPPPGS       P      +       A            P     AP+P RA +
Sbjct: 78  PPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGPGQDPYAPRPRRADE 133


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 28.3 bits (64), Expect = 8.7
 Identities = 19/112 (16%), Positives = 46/112 (41%), Gaps = 22/112 (19%)

Query: 284 HFKWLKKTEEIEKEKEEPKQELDLKFKEGETIKINM------KITVSTYTLGLGRQAFIG 337
             +  +K E++++E+++  +E + + ++     I        +I      L  G  A + 
Sbjct: 550 KEELEEKKEKLQEEEDKLLEEAEKEAQQ----AIKEAKKEADEIIKELRQLQKGGYASVK 605

Query: 338 LGFSDRSDSFDLNVALQDHFKWLKKTEEIEKEKEEPKQELDLKFKEGETIKI 389
                          L +  K L K  E +++K++ ++E   + K G+ +K 
Sbjct: 606 AH------------ELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKY 645


>gnl|CDD|240015 cd04657, Piwi_ago-like, Piwi_ago-like: PIWI domain, Argonaute-like
           subfamily. Argonaute is the central component of the
           RNA-induced silencing complex (RISC) and related
           complexes. The PIWI domain is the C-terminal portion of
           Argonaute and consists of two subdomains, one of which
           provides the 5' anchoring of the guide RNA and the
           other, the catalytic site for slicing.
          Length = 426

 Score = 28.0 bits (63), Expect = 9.1
 Identities = 17/51 (33%), Positives = 23/51 (45%), Gaps = 8/51 (15%)

Query: 176 DFTSAAPAPAPQ-PSRAA-----DWNLQEPQWTGRMRLVTKDKECILKLED 220
           D T  +P      PS AA     DW+L   Q+   +RL +  +E I  LE 
Sbjct: 205 DVTHPSPGDPAGAPSIAAVVASVDWHLA--QYPASVRLQSHRQEIIDDLES 253


>gnl|CDD|219805 pfam08347, CTNNB1_binding, N-terminal CTNNB1 binding.  This region
           tends to appear at the N-terminus of proteins also
           containing DNA-binding HMG (high mobility group) boxes
           (pfam00505) and appears to bind the armadillo repeat of
           CTNNB1 (beta-catenin), forming a stable complex.
           Signaling by Wnt through TCF/LCF is involved in
           developmental patterning, induction of neural tissues,
           cell fate decisions and stem cell differentiation.
           Isoforms of HMG T-cell factors lacking the N-terminal
           CTNNB1-binding domain cannot fulfill their role as
           transcriptional activators in T-cell differentiation.
          Length = 200

 Score = 27.6 bits (61), Expect = 9.2
 Identities = 9/53 (16%), Positives = 12/53 (22%)

Query: 135 STGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPAPQ 187
              G    PP SG   +  P     N S      ++             P   
Sbjct: 91  QDMGYYKGPPYSGYPFLMLPNPYLPNGSLSPLPPSSNKVPVVQPPHHVHPLTP 143


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 28.1 bits (63), Expect = 9.4
 Identities = 18/78 (23%), Positives = 25/78 (32%), Gaps = 10/78 (12%)

Query: 133 PTSTGGLLPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNWGDFTSAAPAPAPQPSRAA 192
           P+S     PPP     +  SPPP P+    P  Q           ++A P P    +  A
Sbjct: 216 PSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQV------PPLSTAKPTPPSASATPA 269

Query: 193 DWNLQEPQWTGRMRLVTK 210
                    T     + K
Sbjct: 270 ----PIGGITLDDDAIAK 283


>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
          Length = 306

 Score = 27.7 bits (61), Expect = 9.6
 Identities = 14/52 (26%), Positives = 20/52 (38%), Gaps = 3/52 (5%)

Query: 140 LPPPPGSGMSKISPPPAPSQNSSPIHQVNANANNW---GDFTSAAPAPAPQP 188
           LPP     ++ + P  AP+   +P+ Q  A             A P   PQP
Sbjct: 8   LPPGLAGLVANVPPAAAPTPQPNPVIQPQAPVQPGQPGAPQQLAIPTQQPQP 59


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.133    0.387 

Gapped
Lambda     K      H
   0.267   0.0828    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,999,343
Number of extensions: 1960070
Number of successful extensions: 3061
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2980
Number of HSP's successfully gapped: 101
Length of query: 393
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 294
Effective length of database: 6,546,556
Effective search space: 1924687464
Effective search space used: 1924687464
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 60 (26.7 bits)