RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3725
         (760 letters)



>gnl|CDD|241276 cd01243, PH_MRCK, MRCK (myotonic dystrophy-related Cdc42-binding
           kinase) pleckstrin homology (PH) domain.  MRCK is
           thought to be coincidence detector of signaling by Cdc42
           and phosphoinositides. It has been shown to promote
           cytoskeletal reorganization, which affects many
           biological processes. There are 2 members of this
           family: MRCKalpha and MRCKbeta. MRCK consists of a
           serine/threonine kinase domain, a cysteine rich (C1)
           region, a PH domain and a p21 binding motif. The MRCK PH
           domain is responsible for its targeting to cell to cell
           junctions. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 136

 Score =  248 bits (636), Expect = 7e-79
 Identities = 100/136 (73%), Positives = 111/136 (81%), Gaps = 1/136 (0%)

Query: 270 PLGIDPTRGIGTAYEGYVKVPKTGGVKKGWVRQFVVVCDFKLFLYDISPDRNALPAVYVS 329
           PLGIDPTRGIGTAYEGYVKVPK GGVKKGW RQF VVCDFKLFLYDIS D+N  P+V  S
Sbjct: 1   PLGIDPTRGIGTAYEGYVKVPKPGGVKKGWQRQFAVVCDFKLFLYDISEDKNPQPSVVAS 60

Query: 330 LVLDMRDEDFAVSGVRESDVIHATKKDIPCIFRITTSLMDPPG-TKNHTLMLADSDTEKT 388
            VLDMRDE+F+VS V ESDVIHA KKDIPCIFR+T S +  P   K+  L+LADS+ EK 
Sbjct: 61  QVLDMRDEEFSVSSVLESDVIHANKKDIPCIFRVTASQLHAPLLLKSSQLLLADSEAEKQ 120

Query: 389 KWVVALSELHRILKRN 404
           KWV AL ELHR+LK+N
Sbjct: 121 KWVGALKELHRLLKKN 136


>gnl|CDD|216115 pfam00780, CNH, CNH domain.  Domain found in NIK1-like kinase,
           mouse citron and yeast ROM1, ROM2. Unpublished
           observations.
          Length = 266

 Score =  170 bits (432), Expect = 6e-48
 Identities = 65/242 (26%), Positives = 105/242 (43%), Gaps = 17/242 (7%)

Query: 457 KIYQMDYIPEEQLLVVLAGKQRYVRLVPVRALDGDEVE----WVKIPETKGCLSFTTGPL 512
            + Q+  + E  LL+ L+ KQ  + + P+ ALD  E+       K+PETKGC  F     
Sbjct: 36  SVTQLAVLEEFNLLLSLSDKQ--LYVYPLSALDSKELSSKSAPQKLPETKGCSFFARAGY 93

Query: 513 THTRTQHCLALAVKRQNSSQIILYEITRTKTRHKRL-HEVILPTLAQCIHIFSEGRLCVG 571
              R    L +AVKR     I LYE      +  +L  E  +P     I      +LCVG
Sbjct: 94  CGGR---FLVVAVKRT----IKLYEWYEPLKKFHKLFKEFYVPAEVFSIAFLKP-KLCVG 145

Query: 572 YQSGFSIYKFSQDNRPIPLIHQDNPLVSLLTY-SPVDALLAIELPRGEFLLVFHSLAAYV 630
              GF +           L+   +  +        +  L  + L   EFLL +   A YV
Sbjct: 146 CAKGFELVDLDTGA-TQSLLDPADQSLDFAARKENLKPLAVVRLSDDEFLLCYDEFAVYV 204

Query: 631 DSQGHKSREKEIMYPALPTGASYMDGQLLIFSETHVDVFNAESGDWLQTVNIRRALPLDT 690
           + QG +SR++E+ +   P   +Y+   LL F    +++ + E+G+ +QT+   +   L +
Sbjct: 205 NLQGRRSRDQELEWEGAPESVAYLYPYLLAFHPNFIEIRSVETGELVQTIPGPKIRFLHS 264

Query: 691 RG 692
             
Sbjct: 265 GS 266


>gnl|CDD|215738 pfam00130, C1_1, Phorbol esters/diacylglycerol binding domain (C1
           domain).  This domain is also known as the Protein
           kinase C conserved region 1 (C1) domain.
          Length = 53

 Score = 73.3 bits (180), Expect = 2e-16
 Identities = 24/51 (47%), Positives = 29/51 (56%)

Query: 211 HQFITRTFTSPTKCNHCTSLMVGLTRQGVVCDICGFACHLSCCDKVPPSCP 261
           H F+ RTF SPT C+HC   + GL +QG+ C  CG   H  C   VPP C 
Sbjct: 1   HHFVHRTFKSPTFCDHCGEFLWGLGKQGLKCSWCGLNVHKRCHSLVPPECG 51


>gnl|CDD|214481 smart00036, CNH, Domain found in NIK1-like kinases, mouse citron
           and yeast ROM1, ROM2. 
          Length = 302

 Score = 80.5 bits (199), Expect = 2e-16
 Identities = 50/250 (20%), Positives = 102/250 (40%), Gaps = 33/250 (13%)

Query: 458 IYQMDYIPEEQLLVVLAGKQRYVRLVPVRALDGDEVE------------WVKIPETKGCL 505
           + Q+  + E  +L++++GK+  +   P+ AL   +                KIP+ KGC 
Sbjct: 45  VTQIWVLEENNVLLMISGKKPQLYSHPLSALVEKKEALGSARLVIRKNVLTKIPDVKGCH 104

Query: 506 SFTTGPLTHTRTQHCLALAVKRQNSSQIILYEITRTKTRHKRLHEVILPTLAQCIHIFSE 565
                       +  L L V  Q+S  ++L +      + K      L  L   + +F E
Sbjct: 105 LCAVV-----NGKRSLFLCVALQSS--VVLLQWYNPLKKFKLFKSKFLFPLISPVPVFVE 157

Query: 566 --------GRLCVG-YQSGFSIYKFSQDNRPIPLIHQDNPLVSLLTYSPVDALLAIELPR 616
                     +C+G  + G  + +F +      +  +D  L  L   + +  +  +++PR
Sbjct: 158 LVSSSFERPGICIGSDKGGGDVVQFHESL----VSKEDLSLPFLSEETSLKPISVVQVPR 213

Query: 617 GEFLLVFHSLAAYVDSQGH-KSREKEIMYPALPTGASYMDGQLLIFSETHVDVFNAESGD 675
            E LL +     +V+  G  +SR   + +  +P   +Y    LL F +  +++ + ++G+
Sbjct: 214 DEVLLCYDEFGVFVNLYGKRRSRNPILHWEFMPESFAYHSPYLLAFHDNGIEIRSIKTGE 273

Query: 676 WLQTVNIRRA 685
            LQ +  R  
Sbjct: 274 LLQELADRET 283


>gnl|CDD|237996 cd00029, C1, Protein kinase C conserved region 1 (C1) .
           Cysteine-rich zinc binding domain. Some members of this
           domain family bind phorbol esters and diacylglycerol,
           some are reported to bind RasGTP. May occur in tandem
           arrangement. Diacylglycerol (DAG) is a second messenger,
           released by activation of Phospholipase D. Phorbol
           Esters (PE) can act as analogues of DAG and mimic its
           downstream effects in, for example, tumor promotion.
           Protein Kinases C are activated by DAG/PE, this
           activation is mediated by their N-terminal conserved
           region (C1). DAG/PE binding may be phospholipid
           dependent. C1 domains may also mediate DAG/PE signals in
           chimaerins (a family of Rac GTPase activating proteins),
           RasGRPs (exchange factors for Ras/Rap1), and Munc13
           isoforms (scaffolding proteins involved in exocytosis).
          Length = 50

 Score = 72.5 bits (178), Expect = 4e-16
 Identities = 23/50 (46%), Positives = 31/50 (62%)

Query: 211 HQFITRTFTSPTKCNHCTSLMVGLTRQGVVCDICGFACHLSCCDKVPPSC 260
           H+F+ ++F  PT C+ C   + GL +QG+ C  C   CH  C DKVPPSC
Sbjct: 1   HRFVRKSFFKPTFCDVCRKSIWGLFKQGLRCSWCKVKCHKKCADKVPPSC 50


>gnl|CDD|197519 smart00109, C1, Protein kinase C conserved region 1 (C1) domains
           (Cysteine-rich domains).  Some bind phorbol esters and
           diacylglycerol. Some bind RasGTP. Zinc-binding domains.
          Length = 50

 Score = 65.6 bits (160), Expect = 1e-13
 Identities = 22/50 (44%), Positives = 28/50 (56%)

Query: 211 HQFITRTFTSPTKCNHCTSLMVGLTRQGVVCDICGFACHLSCCDKVPPSC 260
           H+ + RTFT PT C  C   + G  +QG+ C  C   CH  C DKVP +C
Sbjct: 1   HKHVFRTFTKPTFCCVCRKSIWGSFKQGLRCSECKVKCHKKCADKVPKAC 50


>gnl|CDD|241275 cd01242, PH_ROCK, Rho-associated coiled-coil containing protein
           kinase pleckstrin homology (PH) domain.  ROCK is a
           serine/threonine kinase that binds GTP-Rho. It consists
           of a kinase domain, a coiled coil region and a PH
           domain. The ROCK PH domain is interrupted by a C1
           domain. ROCK plays a role in cellular functions, such as
           contraction, adhesion, migration, and proliferation and
           in the regulation of apoptosis. There are two ROCK
           isoforms, ROCK1 and ROCK2. In ROCK2 the Rho Binding
           Domain (RBD) and the PH domain work together in membrane
           localization with RBD receiving the RhoA signal and the
           PH domain receiving the phospholipid signal. PH domains
           have diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 126

 Score = 54.7 bits (132), Expect = 5e-09
 Identities = 31/82 (37%), Positives = 48/82 (58%), Gaps = 7/82 (8%)

Query: 284 EGYVKVPKTGGVKK-GWVRQFVVVCDFKLFLYDISPDR-NALPAVYVSLVLDMRDEDFAV 341
           EG++ +P    +K+ GW +QFVVV   K+  Y+   D+ N+ P     L++D+  + F V
Sbjct: 4   EGWLSIPNKQNIKRHGWKKQFVVVSSKKILFYNSEQDKQNSDP----FLIIDL-SKLFHV 58

Query: 342 SGVRESDVIHATKKDIPCIFRI 363
             V + DVI A  K+IP IF+I
Sbjct: 59  RSVTQGDVIRADAKEIPRIFQI 80


>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
            Rho/Rac/Cdc42-like GTPases [Signal transduction
            mechanisms].
          Length = 1175

 Score = 59.1 bits (143), Expect = 6e-09
 Identities = 54/241 (22%), Positives = 98/241 (40%), Gaps = 18/241 (7%)

Query: 457  KIYQMDYIPEEQLLVVLAGKQRYVRLVPVRALDGDEVEWVK-IPETKGCLSFT-TGPLTH 514
             I Q+  I E +L+++L+ K+ Y    P+  +D    E VK      G +SF   G    
Sbjct: 903  NISQIIVIEEYKLMLLLSDKKLYS--CPLDVIDASTEENVKKSRIVNGHVSFFKQGFCNG 960

Query: 515  TR---TQHCLALAVKRQNSSQIILYEITRTKTRHKRL-----HEVILPTLAQCIHIFSEG 566
             R        +L+         +  +  ++    K L      E+ +P+    +H F + 
Sbjct: 961  KRLVCAVKSSSLSATLAVIEAPLALKKNKSGNLKKALTIELSTELYVPSEPLSVH-FLKN 1019

Query: 567  RLCVGYQSGFSIYKFSQDNRPIPLIHQDNPLVSLLTYSPVDALLAIELPRGEFLLVFHSL 626
            +LC+G + GF I       R   L++  +              +AI    GEFLL +   
Sbjct: 1020 KLCIGCKKGFEIVSLEN-LRTESLLNPADTSPLFFEKKENTKPIAIFRVSGEFLLCYSEF 1078

Query: 627  AAYVDSQGHKSREKEIM-YPALPTGASYMDGQLLIFSETHVDVFNAESGDWLQTV---NI 682
            A +V+ QG + R   I  +   P   +     +L F    +++ + E+G+ ++ +   NI
Sbjct: 1079 AFFVNDQGWRKRTSWIFHWEGEPQEFALSYPYILAFEPNFIEIRHIETGELIRCILGHNI 1138

Query: 683  R 683
            R
Sbjct: 1139 R 1139


>gnl|CDD|215766 pfam00169, PH, PH domain.  PH stands for pleckstrin homology.
          Length = 101

 Score = 50.2 bits (120), Expect = 1e-07
 Identities = 28/117 (23%), Positives = 48/117 (41%), Gaps = 18/117 (15%)

Query: 283 YEGYVKVPKTGGVKKGWVRQFVVVCDFKLFLYDISPDRNALPAVYVSLVLDMRDEDFAVS 342
            EG++   K  G +K W +++ V+ D  L  Y  S   ++ P   + L           S
Sbjct: 3   KEGWLLK-KGSGGRKSWKKRYFVLFDGVLLYYKDSKKSSSRPKGSIPL-----------S 50

Query: 343 GVRESDVIHATKKDIPCIFRITTSLMDPPGTKNHTLMLADSDTEKTKWVVALSELHR 399
           G + + V  +        F I T      G +   L+ A+S+ E+ +WV A+    R
Sbjct: 51  GCQVTKVPDSEDGKRKNCFEIRT------GDRETFLLQAESEEERKEWVKAIRSAIR 101


>gnl|CDD|214574 smart00233, PH, Pleckstrin homology domain.  Domain commonly found
           in eukaryotic signalling proteins. The domain family
           possesses multiple functions including the abilities to
           bind inositol phosphates, and various proteins. PH
           domains have been found to possess inserted domains
           (such as in PLC gamma, syntrophins) and to be inserted
           within other domains. Mutations in Brutons tyrosine
           kinase (Btk) within its PH domain cause X-linked
           agammaglobulinaemia (XLA) in patients. Point mutations
           cluster into the positively charged end of the molecule
           around the predicted binding site for
           phosphatidylinositol lipids.
          Length = 102

 Score = 48.7 bits (116), Expect = 4e-07
 Identities = 30/117 (25%), Positives = 50/117 (42%), Gaps = 17/117 (14%)

Query: 283 YEGYVKVPKTGGVKKGWVRQFVVVCDFKLFLYDISPDRNALPAVYVSLVLDMRDEDFAVS 342
            EG++   K+GG KK W +++ V+ +  L  Y    D+ +        +           
Sbjct: 3   KEGWLYK-KSGGGKKSWKKRYFVLFNSTLLYYKSKKDKKSYKPKGSIDLSGCT------- 54

Query: 343 GVRESDVIHATKKDIPCIFRITTSLMDPPGTKNHTLMLADSDTEKTKWVVALSELHR 399
            VRE+    ++KK  P  F I T        +   L+ A+S+ E+ KWV AL +   
Sbjct: 55  -VREAPDPDSSKK--PHCFEIKT------SDRKTLLLQAESEEEREKWVEALRKAIA 102


>gnl|CDD|241231 cd00821, PH, Pleckstrin homology (PH) domain.  PH domains have
           diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 92

 Score = 43.5 bits (102), Expect = 2e-05
 Identities = 23/111 (20%), Positives = 42/111 (37%), Gaps = 19/111 (17%)

Query: 283 YEGYVKVPKTGGVKKGWVRQFVVVCDFKLFLYDISPDRNALPAVYVSLVLDMRDEDFAVS 342
            EG++K  + G   K W +++ V+ D  L  Y    D +  P   + L   +        
Sbjct: 1   KEGWLKK-RGGKGLKSWKKRWFVLFDDVLLYYKSKKDSSKKPKGLIPLSDGL-------- 51

Query: 343 GVRESDVIHATKKDIPCIFRITTSLMDPPGTKNHTLMLADSDTEKTKWVVA 393
                +V   +    P  F + T     P       + A+S+ E+ +W+ A
Sbjct: 52  -----EVELVSSSGKPNCFELVT-----PDRGRTYYLQAESEEEREEWLEA 92


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 41.2 bits (97), Expect = 0.002
 Identities = 29/140 (20%), Positives = 59/140 (42%), Gaps = 3/140 (2%)

Query: 56  GGPGDKNWRDRRSQKLEKM--ELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICD 113
           GG  +K     + ++L+++  EL  L++ L    +    +  EL      L   ++++ +
Sbjct: 654 GGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEE 713

Query: 114 FRLKMESVSMDLRRKDSQLKELQSRLESGDI-LERPASQISYLDHFLKESRHTGSVESED 172
              ++E +  +L   + +L++LQSRLE  +  LE    ++  L   L+E         E 
Sbjct: 714 LERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEA 773

Query: 173 GDIEDNRVPSIASSRSNLSE 192
                  +  +   R  L E
Sbjct: 774 LAKLKEEIEELEEKRQALQE 793



 Score = 35.8 bits (83), Expect = 0.11
 Identities = 16/80 (20%), Positives = 41/80 (51%), Gaps = 2/80 (2%)

Query: 64  RDRRSQKLEKME--LLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESV 121
            +   ++LE+ E  L  L+  L S  Q + ++ +E+ +   E+   ++++ +   ++E +
Sbjct: 795 LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEEL 854

Query: 122 SMDLRRKDSQLKELQSRLES 141
             +L     +L+EL++  E 
Sbjct: 855 EKELEELKEELEELEAEKEE 874



 Score = 33.9 bits (78), Expect = 0.40
 Identities = 21/107 (19%), Positives = 50/107 (46%), Gaps = 2/107 (1%)

Query: 73  KMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQL 132
           K E+  L+    +  +   ++ EEL +    L A ++E+     + E +  ++   + ++
Sbjct: 778 KEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEI 837

Query: 133 KELQSRLESGDI-LERPASQISYLDHFLKESR-HTGSVESEDGDIED 177
           +EL+ +L+  +  LE    ++  L   L+E       +E E  ++E+
Sbjct: 838 EELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEE 884



 Score = 33.5 bits (77), Expect = 0.52
 Identities = 24/89 (26%), Positives = 38/89 (42%), Gaps = 4/89 (4%)

Query: 66  RRSQKLEKMELLNLQSSLNSEI----QAKTQISEELSKTRSELIAAQKEICDFRLKMESV 121
               K E  EL   +  L  E+    + K ++ EEL +  SEL   ++EI   R ++E +
Sbjct: 858 LEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEEL 917

Query: 122 SMDLRRKDSQLKELQSRLESGDILERPAS 150
              L R + +L EL+  LE          
Sbjct: 918 EAKLERLEVELPELEEELEEEYEDTLETE 946



 Score = 31.6 bits (72), Expect = 1.8
 Identities = 17/86 (19%), Positives = 41/86 (47%), Gaps = 5/86 (5%)

Query: 60  DKNWRDRRSQKLEKME-----LLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDF 114
            +  ++R  +  E++E     L  L+  +    + +  + EEL +   EL  A++ +   
Sbjct: 753 LEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDAL 812

Query: 115 RLKMESVSMDLRRKDSQLKELQSRLE 140
             ++ES+     R + +++EL+  +E
Sbjct: 813 ERELESLEQRRERLEQEIEELEEEIE 838



 Score = 30.8 bits (70), Expect = 3.5
 Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 3/90 (3%)

Query: 64  RDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSM 123
                  L   +L  L+  L    +  +++ EEL + + EL  A+KEI + + ++E +  
Sbjct: 222 LRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELRE 281

Query: 124 DLRRKDSQLKELQSRLESGDILERPASQIS 153
           +L     +L EL+  +E    LE   S + 
Sbjct: 282 ELEELQEELLELKEEIEE---LEGEISLLR 308



 Score = 30.5 bits (69), Expect = 5.0
 Identities = 19/92 (20%), Positives = 41/92 (44%), Gaps = 12/92 (13%)

Query: 61  KNWRDRRSQKLEKMELL------------NLQSSLNSEIQAKTQISEELSKTRSELIAAQ 108
           ++   RR +  +++E L             L+  L    +   ++ EEL +  +E    +
Sbjct: 817 ESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELE 876

Query: 109 KEICDFRLKMESVSMDLRRKDSQLKELQSRLE 140
            E+ +   + E +  +LR  +S+L EL+  +E
Sbjct: 877 DELKELEEEKEELEEELRELESELAELKEEIE 908



 Score = 29.3 bits (66), Expect = 9.3
 Identities = 19/86 (22%), Positives = 40/86 (46%), Gaps = 4/86 (4%)

Query: 60  DKNWRDRRSQKLEKMELLNLQSSLNSEIQAK----TQISEELSKTRSELIAAQKEICDFR 115
           ++   +    + E  EL         EI+       ++ EEL + + EL+  ++EI +  
Sbjct: 242 EELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELE 301

Query: 116 LKMESVSMDLRRKDSQLKELQSRLES 141
            ++  +   L   +++L+EL+ RLE 
Sbjct: 302 GEISLLRERLEELENELEELEERLEE 327


>gnl|CDD|241430 cd13276, PH_AtPH1, Arabidopsis thaliana Pleckstrin homolog (PH) 1
           (AtPH1) PH domain.  AtPH1 is expressed in all plant
           tissue and is proposed to be the plant homolog of human
           pleckstrin. Pleckstrin consists of two PH domains
           separated by a linker region, while AtPH has a single PH
           domain with a short N-terminal extension. AtPH1 binds
           PtdIns3P specifically and is thought to be an adaptor
           molecule since it has no obvious catalytic functions. PH
           domains have diverse functions, but in general are
           involved in targeting proteins to the appropriate
           cellular location or in the interaction with a binding
           partner. They share little sequence conservation, but
           all have a common fold, which is electrostatically
           polarized. Less than 10% of PH domains bind
           phosphoinositide phosphates (PIPs) with high affinity
           and specificity. PH domains are distinguished from other
           PIP-binding domains by their specific high-affinity
           binding to PIPs with two vicinal phosphate groups:
           PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
           results in targeting some PH domain proteins to the
           plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 117

 Score = 38.1 bits (89), Expect = 0.002
 Identities = 32/113 (28%), Positives = 48/113 (42%), Gaps = 24/113 (21%)

Query: 283 YEGYVKVPKTGGVKKGWVRQFVVVCDFKLFLY-DISPDRNALPAVYVSLVLDMRDEDFAV 341
             G++   K GG  K W R++ V+   KLF + D  PD      + +S  L        V
Sbjct: 9   KAGWLT--KQGGSIKTWRRRWFVLKQGKLFYFKDEDPDSEPRGVIDLSDCL-------TV 59

Query: 342 SGVRESDVIHATKKDIPCIFRITTSLMDPPGTKNHTLMLADSDTEKTKWVVAL 394
               E     AT K     F ++T    P  T     ++ADS+ EK +W+ A+
Sbjct: 60  KSAEE-----ATNK--EFAFEVST----PERTF---YLIADSEKEKEEWISAI 98


>gnl|CDD|117396 pfam08826, DMPK_coil, DMPK coiled coil domain like.  This domain is
           found in the myotonic dystrophy protein kinase (DMPK)
           and adopts a coiled coil structure. It plays a role in
           dimerisation.
          Length = 61

 Score = 36.4 bits (84), Expect = 0.003
 Identities = 18/60 (30%), Positives = 37/60 (61%)

Query: 79  LQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKELQSR 138
           LQS+L +EI+AK  + EEL K ++  I  + ++ +   K   +  ++R+   +++EL++R
Sbjct: 2   LQSALEAEIRAKQSLQEELEKVKAANINFESKLQEAEAKNRELEAEVRQLKKRMEELRAR 61


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 36.9 bits (86), Expect = 0.010
 Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 2/74 (2%)

Query: 69  QKLEKM-ELLN-LQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLR 126
           + LE+  E+ N  Q     E+    +  EEL   R +L   +KEI   + + ES   +L 
Sbjct: 31  EDLEEQAEIANEAQQKYERELVKHAEDIEELQALRKQLNELKKEIAQLKAEAESAQAELS 90

Query: 127 RKDSQLKELQSRLE 140
             +   +E +  LE
Sbjct: 91  EAEESWEEQKKMLE 104


>gnl|CDD|190531 pfam03126, Plus-3, Plus-3 domain.  This domain is about 90 residues
           in length and is often found associated with the
           pfam02213 domain. The function of this domain is
           uncertain. It is possible that this domain is involved
           in DNA binding as it has three conserved positively
           charged residues, hence this domain has been named the
           plus-3 domain. It is found in yeast Rtf1 which may be a
           transcription elongation factor.
          Length = 105

 Score = 35.3 bits (82), Expect = 0.020
 Identities = 15/54 (27%), Positives = 24/54 (44%), Gaps = 5/54 (9%)

Query: 445 RYVQIARVGDTKKIYQMDYIPEEQLLVVLAGKQRYVRLVPVRAL-DGD--EVEW 495
           R  QI  V + KK YQ+     ++ L V  GK    R+  +  + +    E E+
Sbjct: 47  RLAQITGVKEGKKPYQLGGKTTDKYLKVTHGKSE--RVFRISFVSNSPFTEEEF 98



 Score = 29.2 bits (66), Expect = 2.4
 Identities = 13/41 (31%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 415 KIARVGDTKKIYQMDYIPKEQLLVVLAGKQ-RYVQIARVGD 454
           +I  V + KK YQ+     ++ L V  GK  R  +I+ V +
Sbjct: 50  QITGVKEGKKPYQLGGKTTDKYLKVTHGKSERVFRISFVSN 90


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 37.0 bits (86), Expect = 0.033
 Identities = 16/79 (20%), Positives = 32/79 (40%), Gaps = 16/79 (20%)

Query: 63  WRDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVS 122
           W   R+  +++   L            K +   +  +T+ +L     EI   R K+E + 
Sbjct: 359 WEAHRTPLIDEYRSL------------KEKNRNKEDETQRQL----DEIKKLRNKIEELE 402

Query: 123 MDLRRKDSQLKELQSRLES 141
            +L+ K+   K+L    E+
Sbjct: 403 SELQTKEQLYKQLLDEYEN 421


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 35.0 bits (81), Expect = 0.14
 Identities = 19/101 (18%), Positives = 44/101 (43%), Gaps = 14/101 (13%)

Query: 72  EKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQ 131
            K +L + + +L  E++   Q+ +EL       +   KE      K++ +  ++  K  +
Sbjct: 173 IKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKE------KLKKLLQEIMIKVKK 226

Query: 132 LKELQSRLES--GDILERPA------SQISYLDHFLKESRH 164
           L+EL+  L+     I +         ++I+  +  L++ R 
Sbjct: 227 LEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRG 267


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 34.6 bits (80), Expect = 0.15
 Identities = 28/109 (25%), Positives = 50/109 (45%), Gaps = 13/109 (11%)

Query: 74  MELLNLQSSLNSEIQAKTQISEE----LSKTRSEL-IAAQKEICDFRLKMESVSMDLRRK 128
            + LNL +S+  +++ K Q  +E    L +   EL +    E+   R ++ S+S+ +  K
Sbjct: 167 NKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEK 226

Query: 129 DSQLKELQSRL-ESGDILERPASQISYL-------DHFLKESRHTGSVE 169
             QL+ELQ  L E    +E   ++ S L       +   +E R   + E
Sbjct: 227 RKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREECRGWSAKE 275


>gnl|CDD|219977 pfam08700, Vps51, Vps51/Vps67.  This family includes a presumed
           domain found in a number of components of vesicular
           transport. The VFT tethering complex (also known as GARP
           complex, Golgi associated retrograde protein complex,
           Vps53 tethering complex) is a conserved eukaryotic
           docking complex which is involved recycling of proteins
           from endosomes to the late Golgi. Vps51 (also known as
           Vps67) is a subunit of VFT and interacts with the SNARE
           Tlg1. Cog1_N is the N-terminus of the Cog1 subunit of
           the eight-unit Conserved Oligomeric Golgi (COG) complex
           that participates in retrograde vesicular transport and
           is required to maintain normal Golgi structure and
           function. The subunits are located in two lobes and Cog1
           serves to bind the two lobes together probably via the
           highly conserved N-terminal domain of approximately 85
           residues.
          Length = 87

 Score = 32.3 bits (74), Expect = 0.15
 Identities = 22/72 (30%), Positives = 33/72 (45%), Gaps = 11/72 (15%)

Query: 75  ELLNLQSSLNSEIQAKTQISEELSKT----RSELIAAQKEICDFRLKMESVSMDLRRKDS 130
           EL  ++  L +EI+      EEL +       +LI A   I     KM+S    L +K S
Sbjct: 23  ELRQIEKKLRAEIEEL---QEELRQLVYENYRDLIKAADTI----SKMKSEMESLSQKLS 75

Query: 131 QLKELQSRLESG 142
           +LK+   +L S 
Sbjct: 76  ELKQALEKLSSK 87


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 34.6 bits (80), Expect = 0.21
 Identities = 20/85 (23%), Positives = 43/85 (50%), Gaps = 9/85 (10%)

Query: 65  DRRSQKLE------KMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEI-CDFRLK 117
            R    L+      + +L +L+  L  E +AK ++  +LSK  +E+   + +   +  L+
Sbjct: 256 SRERANLQAQLRQLEHDLDSLREQLEEESEAKAELERQLSKANAEIQQWRSKFESEGALR 315

Query: 118 MESVSMDLRRKDSQ-LKELQSRLES 141
            E +  +L++K +Q + EL+   E+
Sbjct: 316 AEELE-ELKKKLNQKISELEEAAEA 339


>gnl|CDD|240133 cd04863, MtLigD_Pol_like, MtLigD_Pol_like: Polymerase (Pol) domain
           of bacterial LigD proteins similar to Mycobacterium
           tuberculosis (Mt)LigD. The LigD Pol domain belongs to
           the archaeal/eukaryal primase (AEP) superfamily. In
           prokaryotes, LigD along with Ku is required for
           non-homologous end joining (NHEJ)-mediated repair of DNA
           double-strand breaks (DSB). NHEJ-mediated DNA DSB repair
           is error-prone. MtLigD is monomeric and contains an
           N-terminal Pol domain, a central phosphoesterase module,
           and a C-terminal ligase domain. It has been suggested
           that LigD Pol contributes to NHEJ-mediated DNA DSB
           repair in vivo, by filling in short 5'-overhangs with
           ribonucleotides; the filled in termini would then be
           sealed by the associated LigD ligase domain, resulting
           in short stretches of RNA incorporated into the genomic
           DNA. The MtLigD Pol domain is stimulated by manganese,
           is error-prone, and prefers adding rNTPs to dNTPs in
           vitro. The MtLigD Pol domain has been shown to prefer
           DNA gapped substrates containing a 5'-phosphate group at
           the gap.
          Length = 231

 Score = 33.1 bits (76), Expect = 0.39
 Identities = 21/67 (31%), Positives = 29/67 (43%), Gaps = 7/67 (10%)

Query: 670 NAESG--DWLQTVNIRRALPLDTRGSLCFSLANDIPYVVYLANIHQREL-INLMTGGDMR 726
           N  SG  DWL T  +R        G+L + L ND+  + + AN+   EL +   T     
Sbjct: 42  NCPSGAPDWLPTAEVRSE----GSGTLTYPLVNDLATLAWAANLAALELHVPQWTVDADG 97

Query: 727 QPRPKRR 733
            P P  R
Sbjct: 98  NPGPPDR 104


>gnl|CDD|201356 pfam00628, PHD, PHD-finger.  PHD folds into an interleaved type of
           Zn-finger chelating 2 Zn ions in a similar manner to
           that of the RING and FYVE domains. Several PHD fingers
           have been identified as binding modules of methylated
           histone H3.
          Length = 51

 Score = 30.2 bits (68), Expect = 0.40
 Identities = 7/28 (25%), Positives = 12/28 (42%)

Query: 239 VVCDICGFACHLSCCDKVPPSCPVPPDQ 266
           ++CD C    HL+C         +P  +
Sbjct: 15  LLCDGCDRWFHLACLGPPLEPEEIPEGE 42


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 33.6 bits (77), Expect = 0.42
 Identities = 16/86 (18%), Positives = 40/86 (46%), Gaps = 6/86 (6%)

Query: 68  SQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRS------ELIAAQKEICDFRLKMESV 121
              LE++E L  +  L    +   Q+ + L + +        L   ++E+ + R +++ +
Sbjct: 539 ENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKEL 598

Query: 122 SMDLRRKDSQLKELQSRLESGDILER 147
              L+  + +L +L+  L+S ++ E 
Sbjct: 599 KKKLKELEERLSQLEELLQSLELSEA 624



 Score = 33.6 bits (77), Expect = 0.51
 Identities = 12/72 (16%), Positives = 29/72 (40%)

Query: 69  QKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRK 128
            + E  EL      L + ++   ++ E+L      L   ++++     ++E ++ +    
Sbjct: 296 LEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNEL 355

Query: 129 DSQLKELQSRLE 140
              L+E    LE
Sbjct: 356 AKLLEERLKELE 367


>gnl|CDD|241425 cd13271, PH2_TAPP1_2, Tandem PH-domain-containing proteins 1 and 2
           Pleckstrin homology (PH) domain, C-terminal repeat.  The
           binding of TAPP1 (also called PLEKHA1/pleckstrin
           homology domain containing, family A (phosphoinositide
           binding specific) member 1) and TAPP2 (also called
           PLEKHA2) adaptors to PtdIns(3,4)P(2), but not PI(3,4,
           5)P3, function as negative regulators of insulin and
           PI3K signalling pathways (i.e. TAPP/utrophin/syntrophin
           complex). TAPP1 and TAPP2 contain two sequential PH
           domains in which the C-terminal PH domain specifically
           binds PtdIns(3,4)P2 with high affinity. The N-terminal
           PH domain does not interact with any phosphoinositide
           tested. They also contain a C-terminal PDZ-binding motif
           that interacts with several PDZ-binding proteins,
           including PTPN13 (known previously as PTPL1 or FAP-1) as
           well as the scaffolding proteins MUPP1 (multiple
           PDZ-domain-containing protein 1), syntrophin and
           utrophin. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 114

 Score = 31.2 bits (71), Expect = 0.53
 Identities = 24/106 (22%), Positives = 43/106 (40%), Gaps = 21/106 (19%)

Query: 291 KTGGVKKGWVRQFVVVCDFKLFLYDISPDRNALPAVYVSLVLDMRDEDFAVSGVRESDVI 350
           K G V+K W R++ ++ D  +  Y    D+  L  +       +++    V  V E    
Sbjct: 16  KQGAVRKNWKRRYFILDDNTISYYKSETDKEPLRTIP------LKE----VLKVHECLSG 65

Query: 351 HATKKDIPCIFRITTSLMDPPGTKNHTLML-ADSDTEKTKWVVALS 395
               +D   +F I T+        + T  + ADS  +   W+ A+S
Sbjct: 66  DLLMRDN--LFEIITT--------SRTFYIQADSPEDMHSWIKAIS 101


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 33.5 bits (77), Expect = 0.57
 Identities = 19/77 (24%), Positives = 43/77 (55%), Gaps = 6/77 (7%)

Query: 67  RSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQK---EICDFRLKMESVSM 123
           + ++ E  E+L   + ++SE+    ++ EEL K   E+   ++   EI +   ++ES+  
Sbjct: 196 KEKEKELEEVLREINEISSELP---ELREELEKLEKEVKELEELKEEIEELEKELESLEG 252

Query: 124 DLRRKDSQLKELQSRLE 140
             R+ + +++EL+ R+E
Sbjct: 253 SKRKLEEKIRELEERIE 269



 Score = 31.6 bits (72), Expect = 2.0
 Identities = 22/129 (17%), Positives = 49/129 (37%), Gaps = 14/129 (10%)

Query: 65  DRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMD 124
            +   ++ K E+      L   I+    I E + +   EL    +EI +   ++  +  +
Sbjct: 164 YKNLGEVIK-EIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREE 222

Query: 125 LRRKDSQLKELQSRLESGDILERPASQISYLDHFLKESRHTGSVESEDGDIEDNRVPSIA 184
           L + + ++KEL+   E  + LE+                    +E +  ++E+     I 
Sbjct: 223 LEKLEKEVKELEELKEEIEELEK---------ELESLEGSKRKLEEKIRELEE----RIE 269

Query: 185 SSRSNLSEL 193
             +  + EL
Sbjct: 270 ELKKEIEEL 278



 Score = 30.8 bits (70), Expect = 3.8
 Identities = 23/101 (22%), Positives = 43/101 (42%), Gaps = 24/101 (23%)

Query: 65  DRRSQKLEK-----MELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKME 119
           + R ++LE      +EL + +  L  E +   ++ EEL K   EL   +K + + R ++E
Sbjct: 591 EERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELE 650

Query: 120 -------------------SVSMDLRRKDSQLKELQSRLES 141
                               +S +L    ++L+EL+ R E 
Sbjct: 651 ELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREE 691



 Score = 30.4 bits (69), Expect = 4.8
 Identities = 25/100 (25%), Positives = 45/100 (45%), Gaps = 7/100 (7%)

Query: 96  ELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKELQSRLESGDILERPASQISY- 154
           EL     EL   +KE+     +++    +L   + +L+EL+  LE    LE+  S+  Y 
Sbjct: 606 ELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEE---LEKKYSEEEYE 662

Query: 155 --LDHFLKESRHTGSVESEDGDIEDNRVPSIASSRSNLSE 192
              + +L+ SR    + +E  ++E  R   I  +   L E
Sbjct: 663 ELREEYLELSRELAGLRAELEELE-KRREEIKKTLEKLKE 701


>gnl|CDD|206350 pfam14182, YgaB, YgaB-like protein.  The YgaB-like protein family
           includes the B. subtilis YgaB protein, which is
           functionally uncharacterized. This family of proteins is
           found in bacteria. Proteins in this family are
           approximately 90 amino acids in length.
          Length = 79

 Score = 30.4 bits (69), Expect = 0.58
 Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 17/75 (22%)

Query: 68  SQKLEKME-LLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLR 126
           S++++ M+ LL LQS L           E   +   EL+  +K+      ++ES+  ++ 
Sbjct: 7   SEQMKTMDKLLFLQSEL-----------ERCQEIEKELLELEKKA-----ELESIQQEIS 50

Query: 127 RKDSQLKELQSRLES 141
           R   +LKE+Q   E 
Sbjct: 51  RMRKELKEIQELFEK 65


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 33.5 bits (77), Expect = 0.58
 Identities = 28/133 (21%), Positives = 55/133 (41%), Gaps = 12/133 (9%)

Query: 70  KLEKM--ELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRR 127
           +L+++   L  L+  L+S      +I   L +   EL  A ++I +   ++E +  +  +
Sbjct: 675 ELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEK 734

Query: 128 KDSQLKELQSRLESGDILERPASQISYLDHFLKE----SRHTGSVESEDGDIED----NR 179
              +L+EL+  L S  + +   +  S L                +E    D+E     +R
Sbjct: 735 LKERLEELEEDLSS--LEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSR 792

Query: 180 VPSIASSRSNLSE 192
           +P I +  S L E
Sbjct: 793 IPEIQAELSKLEE 805



 Score = 33.1 bits (76), Expect = 0.65
 Identities = 28/117 (23%), Positives = 51/117 (43%), Gaps = 10/117 (8%)

Query: 64  RDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSM 123
           R   S + E  +L    S L   ++   Q+ EEL+K   +L   ++E    + K+  +  
Sbjct: 244 RQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDL--GEEEQLRVKEKIGELEA 301

Query: 124 DLRRKDSQLKELQSRLESGDILERPASQISYLDHFLKESRHTGSVESEDGDIEDNRV 180
           ++   +  + E +  LE  D  ER A   + +D  L E      +E  + +IE+ R 
Sbjct: 302 EIASLERSIAEKERELE--DAEERLAKLEAEIDKLLAE------IEELEREIEEERK 350



 Score = 33.1 bits (76), Expect = 0.75
 Identities = 16/81 (19%), Positives = 36/81 (44%), Gaps = 4/81 (4%)

Query: 64  RDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSK----TRSELIAAQKEICDFRLKME 119
           R+   ++  + +L    + L  E++      EE+ K    TR EL   ++++   + ++ 
Sbjct: 343 REIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREIN 402

Query: 120 SVSMDLRRKDSQLKELQSRLE 140
            +  +L R   +L+ L   L 
Sbjct: 403 ELKRELDRLQEELQRLSEELA 423



 Score = 31.6 bits (72), Expect = 1.8
 Identities = 22/83 (26%), Positives = 39/83 (46%), Gaps = 7/83 (8%)

Query: 69  QKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRK 128
           QKL ++ L   +  L  EIQ   +   +L +    +   +KEI +   K E +  +L   
Sbjct: 819 QKLNRLTLE--KEYLEKEIQELQEQRIDLKEQIKSI---EKEIENLNGKKEELEEELEEL 873

Query: 129 DSQLKELQSRLE--SGDILERPA 149
           ++ L++L+SRL     +  E  A
Sbjct: 874 EAALRDLESRLGDLKKERDELEA 896



 Score = 31.6 bits (72), Expect = 1.9
 Identities = 11/62 (17%), Positives = 32/62 (51%)

Query: 79  LQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKELQSR 138
           ++  LN     K  + +E+ + + + I  +++I     ++E+++      + +L+EL++ 
Sbjct: 817 IEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAA 876

Query: 139 LE 140
           L 
Sbjct: 877 LR 878



 Score = 30.4 bits (69), Expect = 4.5
 Identities = 15/90 (16%), Positives = 36/90 (40%), Gaps = 5/90 (5%)

Query: 66  RRSQKLE--KMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSM 123
           R S++L      +  +++ +N   + K   + E+ K   +L     ++  +  ++  +  
Sbjct: 417 RLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKE 476

Query: 124 DLRRKDSQLKELQSRLESGDILERPASQIS 153
           +  R + +L +LQ  L      E  A    
Sbjct: 477 EYDRVEKELSKLQREL---AEAEAQARASE 503



 Score = 29.7 bits (67), Expect = 8.1
 Identities = 18/78 (23%), Positives = 39/78 (50%), Gaps = 7/78 (8%)

Query: 66  RRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDL 125
            RS   ++ EL + +     E  AK +   E+ K  +E+   ++EI + R + + ++ + 
Sbjct: 307 ERSIAEKERELEDAE-----ERLAKLE--AEIDKLLAEIEELEREIEEERKRRDKLTEEY 359

Query: 126 RRKDSQLKELQSRLESGD 143
                +L++L++ LE  D
Sbjct: 360 AELKEELEDLRAELEEVD 377


>gnl|CDD|182106 PRK09841, PRK09841, cryptic autophosphorylating protein tyrosine
           kinase Etk; Provisional.
          Length = 726

 Score = 33.0 bits (75), Expect = 0.62
 Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 23/117 (19%)

Query: 93  ISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRK---------DSQLKELQSRLESGD 143
           +  +L + RSEL  A++++  +R + +SV ++L  K         D+QL EL  R     
Sbjct: 272 LQRQLPEVRSELDQAEEKLNVYRQQRDSVDLNLEAKAVLEQIVNVDNQLNELTFR----- 326

Query: 144 ILERPASQISYLDH----FLKESRHTGSVESEDGDIEDNRVPSIASSRSNLSELTID 196
             E   SQ+   DH     L E R T  +E E   + + RV ++ S++  +  L+ D
Sbjct: 327 --EAEISQLYKKDHPTYRALLEKRQT--LEQERKRL-NKRVSAMPSTQQEVLRLSRD 378


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 31.7 bits (72), Expect = 0.85
 Identities = 19/107 (17%), Positives = 41/107 (38%), Gaps = 9/107 (8%)

Query: 66  RRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLK-------M 118
              Q +   +L  LQ  L+   Q  +++ EEL      +   + E+ D +         +
Sbjct: 64  APVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELL 123

Query: 119 ESVSMDLRRKDSQLKELQSRLESGDILERPASQISYLDHFLKESRHT 165
           +S+   L   +  +KEL   L   ++ +    ++  L   L+  +  
Sbjct: 124 KSLEERLESLEESIKELAKELR--ELRQDLREEVEELREELERLQEN 168


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 32.7 bits (74), Expect = 0.86
 Identities = 35/145 (24%), Positives = 65/145 (44%), Gaps = 15/145 (10%)

Query: 64  RDRRSQKLEKM--ELLNLQSSLNSEIQAKTQISEELSK-------TRSELIAAQKEICDF 114
           R     + E++  +L  LQ  L S  Q    + EEL+        T +EL  ++ E    
Sbjct: 278 RIDLQLENEQLHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTIAELHQSRLESAQM 337

Query: 115 RLKMESVSMDLRRKDSQLKE-----LQSRLESGDILERPASQISYLDHFLKESRHTGSV- 168
            L++  +++ L+   SQ  +      QS     D +++ ++++  L+ +L+E R      
Sbjct: 338 SLQLSQLNLALKEGQSQWAQERETLRQSAEADKDRIQKLSAELLKLEEWLQEERSQREKL 397

Query: 169 ESEDGDIEDNRVPSIASSRSNLSEL 193
           E E G  +D     ++ +R  LSEL
Sbjct: 398 EVELGIEKDCNRVQLSENRRELSEL 422


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 32.3 bits (73), Expect = 1.1
 Identities = 28/149 (18%), Positives = 59/149 (39%), Gaps = 6/149 (4%)

Query: 65  DRRSQKLEKME-LLNLQS-SLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVS 122
           D    +LE+    L  ++ SL +  + K   + E+   R      ++++   + K+E++ 
Sbjct: 346 DALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQ 405

Query: 123 MDLRRKDSQLKELQSRLESGDILERPASQISYLDHFLKE-SRHTGSVESEDGDIEDNRVP 181
              RRK+ +LKE + RL S        + +  L+  L E  R    ++ +    E     
Sbjct: 406 ETFRRKERRLKEEKERLRSLQTDTNTDTALEKLEKALAEKERIIERLKEQRDRDERYEQE 465

Query: 182 SIASSRSNLSEL--TIDQMNNMLRHDRSS 208
              + +    +L   +  +   L  +R  
Sbjct: 466 EFETYKKEFEDLKEEVQNLQLKL-SEREL 493



 Score = 31.5 bits (71), Expect = 2.0
 Identities = 22/83 (26%), Positives = 46/83 (55%), Gaps = 2/83 (2%)

Query: 61  KNWRDRRSQKL--EKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKM 118
           K   DR +Q+L  +K ELL LQ+ L +     + + + L K + +L  A++E    + ++
Sbjct: 286 KAKCDRAAQELSRKKTELLGLQTELETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEV 345

Query: 119 ESVSMDLRRKDSQLKELQSRLES 141
           +++  +L RK + L +  + L++
Sbjct: 346 DALRYELERKHNTLTKKTASLQA 368


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 31.8 bits (72), Expect = 1.2
 Identities = 13/82 (15%), Positives = 34/82 (41%)

Query: 59  GDKNWRDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKM 118
           G K  +        K +   L+  L    + K ++ +EL +  +E    Q+ +    ++ 
Sbjct: 120 GLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVEN 179

Query: 119 ESVSMDLRRKDSQLKELQSRLE 140
             +   L++   ++ +L+ R +
Sbjct: 180 SRLEEMLKKLPGEVYDLKKRWD 201


>gnl|CDD|218912 pfam06148, COG2, COG (conserved oligomeric Golgi) complex
           component, COG2.  The COG complex comprises eight
           proteins COG1-8. The COG complex plays critical roles in
           Golgi structure and function. The proposed function of
           the complex is to mediate the initial physical contact
           between transport vesicles and their membrane targets. A
           comparable role in tethering vesicles has been suggested
           for at least six additional large multisubunit
           complexes, including the exocyst, a complex that
           mediates trafficking to the plasma membrane. COG2
           structure reveals a six-helix bundle with few conserved
           surface features but a general resemblance to recently
           determined crystal structures of four different exocyst
           subunits. These bundles inCOG2 may act as platforms for
           interaction with other trafficing proteins including
           SNAREs (soluble N-ethylmaleimide factor attachment
           protein receptors) and Rabs.
          Length = 133

 Score = 30.7 bits (70), Expect = 1.3
 Identities = 24/111 (21%), Positives = 46/111 (41%), Gaps = 17/111 (15%)

Query: 66  RRSQKLEKM--ELLNLQSSLNSEI---------------QAKTQISEELSKTRSELIAAQ 108
           R+   LE +  +L      L SE+                    + E+L + R  L+A +
Sbjct: 23  RKGVPLESLRDDLRAYLKLLKSELIELINEDYADFVSLSTNLVGLDEKLERIRVPLLALR 82

Query: 109 KEICDFRLKMESVSMDLRRKDSQLKELQSRLESGDILERPASQISYLDHFL 159
           KE+   R ++E+   ++  K  + K+L+ + ES  +L      +  L+  L
Sbjct: 83  KEVESVRSELEARLEEVEAKLEEKKDLREKKESLKLLLDLDHVVEKLEDLL 133


>gnl|CDD|240439 PTZ00491, PTZ00491, major vault protein; Provisional.
          Length = 850

 Score = 31.9 bits (73), Expect = 1.5
 Identities = 17/77 (22%), Positives = 45/77 (58%), Gaps = 5/77 (6%)

Query: 65  DRRSQKLEKMELLNLQSSLNSEIQAKTQ-ISEELSKTRSELIAAQKEICDFRLKMESVSM 123
           D+   + ++ +LL LQ+  ++ +++  Q  +E L++  + LI A+ E+    L+ +++ +
Sbjct: 690 DKAKAEEQRTKLLELQAE-SAAVESSGQSRAEALAEAEARLIEAEAEVEQAELRAKALRI 748

Query: 124 DLRRKDSQLKELQSRLE 140
                +++L++L+ R E
Sbjct: 749 ---EAEAELEKLRKRQE 762


>gnl|CDD|191851 pfam07798, DUF1640, Protein of unknown function (DUF1640).  This
           family consists of sequences derived from hypothetical
           eukaryotic proteins. A region approximately 100 residues
           in length is featured.
          Length = 177

 Score = 30.7 bits (70), Expect = 1.7
 Identities = 16/78 (20%), Positives = 40/78 (51%), Gaps = 10/78 (12%)

Query: 69  QKLEKMELLNLQSS---LNSEI-QAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMD 124
           Q LEK E  NLQ+    L +++ + K ++ +E++K  + +        D  L+   +  +
Sbjct: 68  QSLEKSEFANLQAENEKLKNDLEKLKNRLRDEITKVTAGV------RLDLNLEKGRIREE 121

Query: 125 LRRKDSQLKELQSRLESG 142
              ++ ++KE  +++++ 
Sbjct: 122 SAEQELKIKETDTKIDTE 139


>gnl|CDD|150852 pfam10241, KxDL, Uncharacterized conserved protein.  This is a
           family of short proteins which are conserved over a
           region of 80 residues. There is a characteristic KxDL
           motif towards the C-terminus. The function is unknown.
          Length = 88

 Score = 29.2 bits (66), Expect = 1.7
 Identities = 16/62 (25%), Positives = 23/62 (37%), Gaps = 3/62 (4%)

Query: 80  QSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKELQSRL 139
              L+  I  + QI   L KT   L+     +   RLK      D       LKE++  L
Sbjct: 10  PEDLDRIIAKQAQILGRLEKTNRMLLELN-ALSQARLK--GARADFAEGTRLLKEMKKDL 66

Query: 140 ES 141
           + 
Sbjct: 67  DW 68


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 31.9 bits (72), Expect = 1.8
 Identities = 18/80 (22%), Positives = 40/80 (50%), Gaps = 6/80 (7%)

Query: 68  SQKLEKMELL--NLQSSLNSEIQAKTQISEEL----SKTRSELIAAQKEICDFRLKMESV 121
              L+    L  +   S  SE++ K +  +E+       +S +   +KEI + R K++ V
Sbjct: 697 ISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKV 756

Query: 122 SMDLRRKDSQLKELQSRLES 141
           + D++R  + ++E ++ L +
Sbjct: 757 NRDIQRLKNDIEEQETLLGT 776



 Score = 31.9 bits (72), Expect = 1.8
 Identities = 17/75 (22%), Positives = 35/75 (46%), Gaps = 4/75 (5%)

Query: 86  EIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKELQSRLESGDIL 145
           E++   +    L++ ++EL+    E    +L+ +     +R +DS ++ L +RLE  D  
Sbjct: 327 ELEKLNKERRLLNQEKTELLV---EQGRLQLQADRHQEHIRARDSLIQSLATRLEL-DGF 382

Query: 146 ERPASQISYLDHFLK 160
           ER       + +F  
Sbjct: 383 ERGPFSERQIKNFHT 397


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 31.6 bits (72), Expect = 2.1
 Identities = 14/73 (19%), Positives = 36/73 (49%), Gaps = 1/73 (1%)

Query: 69  QKLEKMELLNLQSSLNSEIQAKTQISEELSKT-RSELIAAQKEICDFRLKMESVSMDLRR 127
           +  E  E L    +   E++A+ +  E   +    +L   + ++    L++ S++ ++ R
Sbjct: 345 KLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIER 404

Query: 128 KDSQLKELQSRLE 140
            +++L+ L+ R E
Sbjct: 405 LEARLERLEDRRE 417



 Score = 30.8 bits (70), Expect = 3.2
 Identities = 16/78 (20%), Positives = 41/78 (52%), Gaps = 5/78 (6%)

Query: 64  RDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSM 123
            +   ++L++ E       L +E+Q      E+L + R E+   ++EI + + ++ +++ 
Sbjct: 241 LEELQEELKEAEEE--LEELTAELQELE---EKLEELRLEVSELEEEIEELQKELYALAN 295

Query: 124 DLRRKDSQLKELQSRLES 141
           ++ R + Q + L+ RL +
Sbjct: 296 EISRLEQQKQILRERLAN 313



 Score = 30.0 bits (68), Expect = 6.2
 Identities = 15/78 (19%), Positives = 41/78 (52%), Gaps = 2/78 (2%)

Query: 65  DRRSQKLEKM--ELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVS 122
           +   +++E++  ++ +L + +    +   ++  EL    +E  + ++ +   R ++E +S
Sbjct: 841 EDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELS 900

Query: 123 MDLRRKDSQLKELQSRLE 140
            +LR  +S+  EL+  LE
Sbjct: 901 EELRELESKRSELRRELE 918



 Score = 29.6 bits (67), Expect = 8.6
 Identities = 22/73 (30%), Positives = 40/73 (54%), Gaps = 9/73 (12%)

Query: 71  LEKMELL--NLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRK 128
           +E++E L   L+S L + +  +  + E L+  RSEL    +E+     ++ES   +LRR 
Sbjct: 861 IEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL----RELESKRSELRR- 915

Query: 129 DSQLKELQSRLES 141
             +L+EL+ +L  
Sbjct: 916 --ELEELREKLAQ 926



 Score = 29.3 bits (66), Expect = 9.9
 Identities = 25/101 (24%), Positives = 45/101 (44%), Gaps = 7/101 (6%)

Query: 67  RSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLR 126
             +KLE+ EL  LQ+ L    +   ++ EEL +    L   ++E+       E     L 
Sbjct: 426 LLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREEL-------EEAEQALD 478

Query: 127 RKDSQLKELQSRLESGDILERPASQISYLDHFLKESRHTGS 167
             + +L +LQ+RL+S + L+      S     L +++   S
Sbjct: 479 AAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLS 519


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 30.5 bits (69), Expect = 2.6
 Identities = 10/64 (15%), Positives = 30/64 (46%)

Query: 78  NLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKELQS 137
           N  S L+   + K  I  E+    +++   Q +I + + +++    ++++   ++ EL+ 
Sbjct: 35  NQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKE 94

Query: 138 RLES 141
            +  
Sbjct: 95  NIVE 98


>gnl|CDD|214584 smart00249, PHD, PHD zinc finger.  The plant homeodomain (PHD)
           finger is a C4HC3 zinc-finger-like motif found in
           nuclear proteins thought to be involved in epigenetics
           and chromatin-mediated transcriptional regulation. The
           PHD finger binds two zinc ions using the so-called
           'cross-brace' motif and is thus structurally related to
           the RING finger and the FYVE finger. It is not yet known
           if PHD fingers have a common molecular function. Several
           reports suggest that it can function as a
           protein-protein interacton domain and it was recently
           demonstrated that the PHD finger of p300 can cooperate
           with the adjacent BROMO domain in nucleosome binding in
           vitro. Other reports suggesting that the PHD finger is a
           ubiquitin ligase have been refuted as these domains were
           RING fingers misidentified as PHD fingers.
          Length = 47

 Score = 27.6 bits (61), Expect = 2.7
 Identities = 11/47 (23%), Positives = 15/47 (31%), Gaps = 14/47 (29%)

Query: 224 CNHC-----TSLMVGLTRQGVVCDICGFACHLSCCDKVPPSCPVPPD 265
           C+ C        ++        CD C    H +C    PP     PD
Sbjct: 2   CSVCGKPDDGGELLQ-------CDGCDRWYHQTCLG--PPLLEEEPD 39


>gnl|CDD|206675 cd01888, eIF2_gamma, Gamma subunit of initiation factor 2 (eIF2
           gamma).  eIF2 is a heterotrimeric translation initiation
           factor that consists of alpha, beta, and gamma subunits.
           The GTP-bound gamma subunit also binds initiator
           methionyl-tRNA and delivers it to the 40S ribosomal
           subunit. Following hydrolysis of GTP to GDP, eIF2:GDP is
           released from the ribosome. The gamma subunit has no
           intrinsic GTPase activity, but is stimulated by the
           GTPase activating protein (GAP) eIF5, and GDP/GTP
           exchange is stimulated by the guanine nucleotide
           exchange factor (GEF) eIF2B. eIF2B is a heteropentamer,
           and the epsilon chain binds eIF2. Both eIF5 and
           eIF2B-epsilon are known to bind strongly to eIF2-beta,
           but have also been shown to bind directly to eIF2-gamma.
           It is possible that eIF2-beta serves simply as a
           high-affinity docking site for eIF5 and eIF2B-epsilon,
           or that eIF2-beta serves a regulatory role. eIF2-gamma
           is found only in eukaryotes and archaea. It is closely
           related to SelB, the selenocysteine-specific elongation
           factor from eubacteria. The translational factor
           components of the ternary complex, IF2 in eubacteria and
           eIF2 in eukaryotes are not the same protein (despite
           their unfortunately similar names). Both factors are
           GTPases; however, eubacterial IF-2 is a single
           polypeptide, while eIF2 is heterotrimeric. eIF2-gamma is
           a member of the same family as eubacterial IF2, but the
           two proteins are only distantly related. This family
           includes translation initiation, elongation, and release
           factors.
          Length = 197

 Score = 30.3 bits (69), Expect = 2.8
 Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 7/44 (15%)

Query: 617 GEFLLVFHSLAAYVDSQGHKSREKEIMYPALPTGASYMDGQLLI 660
           GE  LV H   ++VD  GH     EI+   + +GA+ MDG LL+
Sbjct: 71  GETKLVRH--VSFVDCPGH-----EILMATMLSGAAVMDGALLL 107


>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
           protein, PEP-CTERM locus subfamily.  Members of this
           protein family belong to the family of polysaccharide
           chain length determinant proteins (pfam02706). All are
           found in species that encode the PEP-CTERM/exosortase
           system predicted to act in protein sorting in a number
           of Gram-negative bacteria, and are found near the epsH
           homolog that is the putative exosortase gene [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 498

 Score = 30.8 bits (70), Expect = 2.9
 Identities = 20/104 (19%), Positives = 37/104 (35%), Gaps = 6/104 (5%)

Query: 93  ISEELSKTRSELIAAQKEICDFRLK-MESVSMDLRRKDSQLKELQSRLESG--DILERPA 149
           I E++     +L AA+  +  F+ +    +        S++ E Q  LE+   ++ E  A
Sbjct: 166 IDEQIKTYEKKLEAAENRLKAFKQENGGILPDQEGDYYSEISEAQEELEAARLELNEAIA 225

Query: 150 SQISYLDHFLKESRHTGSVESEDGDIEDNRVPSIASSRSNLSEL 193
            + +       E     +  S      D R   I +    L  L
Sbjct: 226 QRDALKRQLGGEEPVLLAGSSVANSELDGR---IEALEKQLDAL 266


>gnl|CDD|215437 PLN02816, PLN02816, mannosyltransferase.
          Length = 546

 Score = 30.8 bits (69), Expect = 2.9
 Identities = 35/128 (27%), Positives = 52/128 (40%), Gaps = 22/128 (17%)

Query: 597 LVSLLTYSPVDALLAIELPRGEFL--LVFHSLAAYVDSQGHKSREKEIMYPALPTGASYM 654
           LV L T++P  ++  I   + + L  L+   LA Y    GHK  E   + P LP      
Sbjct: 299 LVMLFTFTPF-SIAGIIKSKNQKLSALILWVLAIY-SILGHK--EFRFVLPVLPIA---- 350

Query: 655 DGQLLIFSETHVDVFNAESGDWLQTVNIRRALPLDT--------RGSLCFSLANDIPYVV 706
               LIFS                +V  ++ +P           R S+ F LA +IP  +
Sbjct: 351 ----LIFSGYAFAQMEVSGSSSSSSVTKKKQVPRQNHTKWSPKLRLSVYFLLATNIPMAL 406

Query: 707 YLANIHQR 714
           Y++  HQR
Sbjct: 407 YMSLFHQR 414


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 30.4 bits (68), Expect = 4.0
 Identities = 20/96 (20%), Positives = 36/96 (37%), Gaps = 9/96 (9%)

Query: 75  ELLNLQSSLNSEIQAKTQI-------SEELSKTRSELIAAQKEICDFRLKMESVSMDLRR 127
           +L  L++ L +    K            EL K R E  A ++E+   R  +     +L R
Sbjct: 82  QLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELAR 141

Query: 128 KDSQLKELQSRLESGDILERPASQISYLDHFLKESR 163
              Q ++LQ+RL    + E+     +         +
Sbjct: 142 LTKQAQDLQTRL--KTLAEQRRQLEAQAQSLQASQK 175


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
           synthetase class II domain (pfam00587) and represents
           the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 28.3 bits (64), Expect = 5.1
 Identities = 16/81 (19%), Positives = 38/81 (46%), Gaps = 17/81 (20%)

Query: 60  DKNWRDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKME 119
           D+  R+ +  +LE      LQ+  N           ELSK   +    +++      +++
Sbjct: 35  DEERRELQ-VELE-----ELQAERN-----------ELSKEIGKAKKKKEDAEALIAEVK 77

Query: 120 SVSMDLRRKDSQLKELQSRLE 140
            +  +L+  +++L+EL++ L+
Sbjct: 78  ELKDELKALEAELRELEAELD 98


>gnl|CDD|218380 pfam05010, TACC, Transforming acidic coiled-coil-containing protein
           (TACC).  This family contains the proteins TACC 1, 2 and
           3 the genes for which are found concentrated in the
           centrosomes of eukaryotic and may play a conserved role
           in organising centrosomal microtubules. The human TACC
           proteins have been linked to cancer and TACC2 has been
           identified as a possible tumour suppressor (AZU-1). The
           functional homologue (Alp7) in Schizosaccharomyces pombe
           has been shown to be required for organisation of
           bipolar spindles.
          Length = 207

 Score = 29.3 bits (66), Expect = 5.6
 Identities = 13/48 (27%), Positives = 28/48 (58%)

Query: 88  QAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKEL 135
            A  +I++  SK ++E  A Q  +   ++K++S+   L +K+ + +EL
Sbjct: 147 IANEEIAQVRSKAKAETAALQASLRKEQMKVQSLEETLEQKNKENEEL 194


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 29.7 bits (67), Expect = 5.9
 Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 7/69 (10%)

Query: 72  EKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQ 131
           E+ EL  L S   ++     Q+ EE  KT ++L  ++      +L+      +LR  +S+
Sbjct: 183 EQAELTTLLSEQRAQQAKLAQLLEERKKTLAQL-NSELSADQKKLE------ELRANESR 235

Query: 132 LKELQSRLE 140
           LK   +  E
Sbjct: 236 LKNEIASAE 244


>gnl|CDD|237653 PRK14276, PRK14276, chaperone protein DnaJ; Provisional.
          Length = 380

 Score = 29.7 bits (67), Expect = 6.3
 Identities = 12/36 (33%), Positives = 16/36 (44%), Gaps = 9/36 (25%)

Query: 219 TSPTKCNHC---------TSLMVGLTRQGVVCDICG 245
           TSP  C  C         T   +G+ R+ V CD+C 
Sbjct: 161 TSPVTCGKCHGSGVITVDTQTPLGMMRRQVTCDVCH 196


>gnl|CDD|227419 COG5087, RTT109, Uncharacterized conserved protein [Function
           unknown].
          Length = 349

 Score = 29.6 bits (66), Expect = 6.6
 Identities = 17/67 (25%), Positives = 22/67 (32%), Gaps = 10/67 (14%)

Query: 90  KTQISEELSKTRSELIAAQKEICDFRLKMESVSM-DLRRKDSQLKELQSRLESGDILERP 148
              I   +S  R   I A+K ICDF L  E      L R     +              P
Sbjct: 284 DKYIFGSISDARRSAIEAKKNICDFELAREYFQGQGLVRPKKNEEANAV---------MP 334

Query: 149 ASQISYL 155
           A+  + L
Sbjct: 335 ATNKNML 341


>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil.  Members of
           this family are predominantly found in the
           actin-bundling protein Cortexillin I from Dictyostelium
           discoideum. They adopt a structure consisting of an
           18-heptad-repeat alpha-helical coiled-coil, and are a
           prerequisite for the assembly of Cortexillin I.
          Length = 107

 Score = 28.0 bits (62), Expect = 6.9
 Identities = 16/76 (21%), Positives = 34/76 (44%), Gaps = 3/76 (3%)

Query: 65  DRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMD 124
           ++   +  K EL N  + L + ++++    EEL+K + +L +    +     +       
Sbjct: 3   EKARLEASKNELANKLAGLENSLESEKASQEELAKQKDQLRSLLASL---EAQKAEREQR 59

Query: 125 LRRKDSQLKELQSRLE 140
           LR   +++ EL   LE
Sbjct: 60  LRELQAKIDELLKNLE 75


>gnl|CDD|224264 COG1345, FliD, Flagellar capping protein [Cell motility and
           secretion].
          Length = 483

 Score = 29.7 bits (67), Expect = 6.9
 Identities = 21/93 (22%), Positives = 42/93 (45%), Gaps = 14/93 (15%)

Query: 79  LQSSLNSEIQA----------KTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRK 128
           L+++LN +  +           T I++ L +  + L+A    I     + +S++  ++  
Sbjct: 377 LKAALNEDPDSVREFFGGDGKDTGIAKTLGEGLNSLLAVTGGI--ITGRTDSLNKQIKSL 434

Query: 129 DSQLKELQSRLESGDILERPASQISYLDHFLKE 161
           D  +K L  RLE     ER  +Q + LD  + +
Sbjct: 435 DKDIKSLDKRLE--AAEERYKTQFNTLDDMMTQ 465


>gnl|CDD|235861 PRK06782, PRK06782, flagellar motor switch protein; Reviewed.
          Length = 528

 Score = 29.4 bits (66), Expect = 7.3
 Identities = 18/86 (20%), Positives = 41/86 (47%), Gaps = 6/86 (6%)

Query: 68  SQKLEKMELLNLQSSLNSEI-QAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLR 126
            ++L ++EL  ++  +N  +  A T +SE   + + ++     +    + +M   S++  
Sbjct: 319 EKELTELELSAIKEIMNQMMGHAATAMSEMF-QEKIDISPPTVKFVTLKEEMLGESIE-- 375

Query: 127 RKDSQLKELQSRLESGDILERPASQI 152
               +L ++   LE GD+L+    QI
Sbjct: 376 --VDELVQITFNLEVGDLLQTKMYQI 399


>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin. 
          Length = 237

 Score = 29.1 bits (66), Expect = 7.9
 Identities = 18/39 (46%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 66  RRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSEL 104
           R  QKLEK E+  L+  L +E +    ISEEL +T +EL
Sbjct: 197 RSVQKLEK-EVDRLEDELLAEKEKYKAISEELDQTLAEL 234


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 29.2 bits (66), Expect = 8.2
 Identities = 13/77 (16%), Positives = 28/77 (36%), Gaps = 16/77 (20%)

Query: 64  RDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSM 123
            +  + K E   L   + S++       QI + +     EL    +++   R +++    
Sbjct: 80  SENEALKAENERLQKREQSIDQ------QIQQAVQSETQELTKEIEQLKSERQQLQ---- 129

Query: 124 DLRRKDSQLKELQSRLE 140
                   + +LQ RL 
Sbjct: 130 ------GLIDQLQRRLA 140


>gnl|CDD|151039 pfam10473, Cenp-F_leu_zip, Leucine-rich repeats of kinetochore
           protein Cenp-F/LEK1.  Cenp-F, a centromeric kinetochore,
           microtubule-binding protein consisting of two
           1,600-amino acid-long coils, is essential for the full
           functioning of the mitotic checkpoint pathway. There are
           several leucine-rich repeats along the sequence of LEK1
           that are considered to be zippers, though they do not
           appear to be binding DNA directly in this instance.
          Length = 140

 Score = 28.2 bits (63), Expect = 8.9
 Identities = 16/50 (32%), Positives = 29/50 (58%), Gaps = 3/50 (6%)

Query: 61  KNWRDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKE 110
           +  ++R S+ LEK+   + ++ L  + Q K Q+ EE SKT  E++  Q +
Sbjct: 90  QEKQERVSE-LEKL-NSSTENLLEEKEQEKIQMKEE-SKTAVEMLQTQLK 136


>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain. 
          Length = 225

 Score = 28.9 bits (65), Expect = 9.3
 Identities = 21/77 (27%), Positives = 30/77 (38%), Gaps = 24/77 (31%)

Query: 61  KNWRDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIA-AQKEICDFRL--- 116
           + WR+RR  ++E+ +                   E   K + ELI  AQKEI DF     
Sbjct: 115 RKWRERRDLRIEERD-------------------EASEKKKEELIEKAQKEIDDFYENYN 155

Query: 117 -KMESVSMDLRRKDSQL 132
            K E      R++  Q 
Sbjct: 156 EKKEKTKAQNRKEAEQF 172


>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein.  This family
           consists of the p50 and variable adherence-associated
           antigen (Vaa) adhesins from Mycoplasma hominis. M.
           hominis is a mycoplasma associated with human urogenital
           diseases, pneumonia, and septic arthritis. An adhesin is
           a cell surface molecule that mediates adhesion to other
           cells or to the surrounding surface or substrate. The
           Vaa antigen is a 50-kDa surface lipoprotein that has
           four tandem repetitive DNA sequences encoding a periodic
           peptide structure, and is highly immunogenic in the
           human host. p50 is also a 50-kDa lipoprotein, having
           three repeats A,B and C, that may be a tetramer of
           191-kDa in its native environment.
          Length = 353

 Score = 28.9 bits (64), Expect = 9.6
 Identities = 23/80 (28%), Positives = 43/80 (53%), Gaps = 8/80 (10%)

Query: 65  DRRSQKLEKMELLNLQSSLNSEIQAKTQISEE---LSKTRSELIA-AQKEI---CDFRLK 117
           D ++ ++ K EL    + + S I+   +I+ E   LSKT ++ IA  +K+     DF+ +
Sbjct: 110 DDKNAQIAK-ELAERNAKIQSNIEELKKINNEAFELSKTVNKTIAEVEKKFKIPKDFKEQ 168

Query: 118 MESVSMDLRRKDSQLKELQS 137
           +E+ + DL  K  Q+ E  +
Sbjct: 169 LENFADDLLDKSRQIDEFTT 188


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.134    0.394 

Gapped
Lambda     K      H
   0.267   0.0722    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 38,214,205
Number of extensions: 3749831
Number of successful extensions: 3409
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3344
Number of HSP's successfully gapped: 121
Length of query: 760
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 656
Effective length of database: 6,324,786
Effective search space: 4149059616
Effective search space used: 4149059616
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.1 bits)