RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3835
         (1659 letters)



>gnl|CDD|212794 cd11860, SH3_DLG5, Src homology 3 domain of Disks Large homolog 5.
            DLG5 is a multifunctional scaffold protein that is
            located at sites of cell-cell contact and is involved in
            the maintenance of cell shape and polarity. Mutations in
            the DLG5 gene are associated with Crohn's disease (CD)
            and inflammatory bowel disease (IBD). DLG5 is a member of
            the MAGUK (membrane-associated guanylate kinase) protein
            family, which is characterized by the presence of a core
            of three domains: PDZ, SH3, and guanylate kinase (GuK).
            The GuK domain in MAGUK proteins is enzymatically
            inactive; instead, the domain mediates protein-protein
            interactions and associates intramolecularly with the SH3
            domain. DLG5 contains 4 PDZ domains as well as an
            N-terminal domain of unknown function. SH3 domains are
            protein interaction domains that bind to proline-rich
            ligands with moderate affinity and selectivity,
            preferentially to PxxP motifs. They play versatile and
            diverse roles in the cell including the regulation of
            enzymes, changing the subcellular localization of
            signaling pathway components, and mediating the formation
            of multiprotein complex assemblies.
          Length = 63

 Score =  115 bits (291), Expect = 6e-31
 Identities = 45/65 (69%), Positives = 53/65 (81%), Gaps = 2/65 (3%)

Query: 1334 YMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGIIPSK 1393
            Y+RALF R+ +  D  +L F KDDILYVDNTMFNGV G WRAWLVD +G +++CGIIPSK
Sbjct: 1    YVRALFDRSAENED--ELSFKKDDILYVDNTMFNGVFGQWRAWLVDEEGRKRKCGIIPSK 58

Query: 1394 YKVEE 1398
            YKVEE
Sbjct: 59   YKVEE 63


>gnl|CDD|238492 cd00992, PDZ_signaling, PDZ domain found in a variety of Eumetazoan
            signaling molecules, often in tandem arrangements. May be
            responsible for specific protein-protein interactions, as
            most PDZ domains bind C-terminal polypeptides, and
            binding to internal (non-C-terminal) polypeptides and
            even to lipids has been demonstrated. In this subfamily
            of PDZ domains an N-terminal beta-strand forms the
            peptide-binding groove base, a circular permutation with
            respect to PDZ domains found in proteases.
          Length = 82

 Score = 79.1 bits (196), Expect = 1e-17
 Identities = 27/82 (32%), Positives = 47/82 (57%), Gaps = 3/82 (3%)

Query: 1129 LRRVHIDKS-VEPLGIQIQC--LDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRS 1185
            +R V + K     LG  ++      GG+FVS V     A + GL+VGD++LEV G+++  
Sbjct: 1    VRTVTLRKDPGGGLGFSLRGGKDSGGGIFVSRVEPGGPAERGGLRVGDRILEVNGVSVEG 60

Query: 1186 ATYQLAASVLRQCGNSITMLVQ 1207
             T++ A  +L+  G+ +T+ V+
Sbjct: 61   LTHEEAVELLKNSGDEVTLTVR 82



 Score = 67.6 bits (166), Expect = 9e-14
 Identities = 31/80 (38%), Positives = 38/80 (47%), Gaps = 2/80 (2%)

Query: 1237 PRFLMIETRKCSNLGISLVGGN--AVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRA 1294
             R + +       LG SL GG     GI+V  V+ G      GLR GDRILE NG  +  
Sbjct: 1    VRTVTLRKDPGGGLGFSLRGGKDSGGGIFVSRVEPGGPAERGGLRVGDRILEVNGVSVEG 60

Query: 1295 ATAEEAAYELAKPADKVTVL 1314
             T EEA   L    D+VT+ 
Sbjct: 61   LTHEEAVELLKNSGDEVTLT 80



 Score = 55.3 bits (134), Expect = 2e-09
 Identities = 26/82 (31%), Positives = 45/82 (54%), Gaps = 8/82 (9%)

Query: 812 LVTTQLQLNNSYDHGLTL------ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVD 865
           + T  L+ +     G +L        G+++ ++ PG  A + G L VGDR+L +N  +V+
Sbjct: 1   VRTVTLRKDPGGGLGFSLRGGKDSGGGIFVSRVEPGGPAERGG-LRVGDRILEVNGVSVE 59

Query: 866 GLSSAREGMALLSDSPDVLTIT 887
           GL+   E + LL +S D +T+T
Sbjct: 60  GLTHE-EAVELLKNSGDEVTLT 80



 Score = 54.9 bits (133), Expect = 3e-09
 Identities = 26/72 (36%), Positives = 41/72 (56%), Gaps = 6/72 (8%)

Query: 192 GLQLNNSYDHGLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMA 251
           G  L    D G     G+++ ++ PG  A + G L VGDR+L +N  +V+GL+   E + 
Sbjct: 15  GFSLRGGKDSG----GGIFVSRVEPGGPAERGG-LRVGDRILEVNGVSVEGLTHE-EAVE 68

Query: 252 LLSDSPDVLTIT 263
           LL +S D +T+T
Sbjct: 69  LLKNSGDEVTLT 80



 Score = 34.8 bits (81), Expect = 0.031
 Identities = 9/38 (23%), Positives = 20/38 (52%)

Query: 766 KVNDCIVRVNNLDCSNVSKRMVLETVRSSGTSALMVIR 803
           +V D I+ VN +    ++    +E +++SG    + +R
Sbjct: 45  RVGDRILEVNGVSVEGLTHEEAVELLKNSGDEVTLTVR 82



 Score = 30.2 bits (69), Expect = 1.4
 Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 6/42 (14%)

Query: 72  SPNEALGLELVGGRDDPFYPNDSSIYVSNIVKGSVTD-GKLK 112
            P   LG  L GG+D     +   I+VS +  G   + G L+
Sbjct: 9   DPGGGLGFSLRGGKD-----SGGGIFVSRVEPGGPAERGGLR 45


>gnl|CDD|214570 smart00228, PDZ, Domain present in PSD-95, Dlg, and ZO-1/2.  Also
            called DHR (Dlg homologous region) or GLGF (relatively
            well conserved tetrapeptide in these domains). Some PDZs
            have been shown to bind C-terminal polypeptides; others
            appear to bind internal (non-C-terminal) polypeptides.
            Different PDZs possess different binding specificities.
          Length = 85

 Score = 77.0 bits (190), Expect = 6e-17
 Identities = 29/85 (34%), Positives = 46/85 (54%), Gaps = 2/85 (2%)

Query: 1128 ELRRVHIDKSVEPLGIQIQ--CLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRS 1185
            E R V ++K    LG  +     + GGV VS+V   S A++ GL+VGD +LEV G ++  
Sbjct: 1    EPRLVELEKGGGGLGFSLVGGKDEGGGVVVSSVVPGSPAAKAGLRVGDVILEVNGTSVEG 60

Query: 1186 ATYQLAASVLRQCGNSITMLVQYSP 1210
             T+  A  +L++ G  +T+ V    
Sbjct: 61   LTHLEAVDLLKKAGGKVTLTVLRGG 85



 Score = 67.4 bits (165), Expect = 1e-13
 Identities = 33/73 (45%), Positives = 37/73 (50%), Gaps = 2/73 (2%)

Query: 1244 TRKCSNLGISLVGG--NAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAA 1301
             +    LG SLVGG     G+ V SV  GS    AGLR GD ILE NGT +   T  EA 
Sbjct: 8    EKGGGGLGFSLVGGKDEGGGVVVSSVVPGSPAAKAGLRVGDVILEVNGTSVEGLTHLEAV 67

Query: 1302 YELAKPADKVTVL 1314
              L K   KVT+ 
Sbjct: 68   DLLKKAGGKVTLT 80



 Score = 50.1 bits (120), Expect = 1e-07
 Identities = 22/62 (35%), Positives = 35/62 (56%), Gaps = 2/62 (3%)

Query: 206 ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTL 265
             G+ +  + PGS AAK G L VGD +L +N  +V+GL+     + LL  +   +T+T L
Sbjct: 25  GGGVVVSSVVPGSPAAKAG-LRVGDVILEVNGTSVEGLTHLEA-VDLLKKAGGKVTLTVL 82

Query: 266 KP 267
           + 
Sbjct: 83  RG 84



 Score = 50.1 bits (120), Expect = 1e-07
 Identities = 22/62 (35%), Positives = 35/62 (56%), Gaps = 2/62 (3%)

Query: 830 ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTL 889
             G+ +  + PGS AAK G L VGD +L +N  +V+GL+     + LL  +   +T+T L
Sbjct: 25  GGGVVVSSVVPGSPAAKAG-LRVGDVILEVNGTSVEGLTHLEA-VDLLKKAGGKVTLTVL 82

Query: 890 KP 891
           + 
Sbjct: 83  RG 84



 Score = 32.4 bits (74), Expect = 0.27
 Identities = 7/40 (17%), Positives = 18/40 (45%)

Query: 766 KVNDCIVRVNNLDCSNVSKRMVLETVRSSGTSALMVIRRR 805
           +V D I+ VN      ++    ++ ++ +G    + + R 
Sbjct: 45  RVGDVILEVNGTSVEGLTHLEAVDLLKKAGGKVTLTVLRG 84


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 79.0 bits (195), Expect = 2e-14
 Identities = 64/315 (20%), Positives = 137/315 (43%), Gaps = 21/315 (6%)

Query: 297 NETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELR 356
                        LE  R Q +    QLE   +E ++L  +   L S  + L+ EL EL 
Sbjct: 691 KSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELE 750

Query: 357 SGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELA 416
                          + L +       + E++++    L++  ++L        ++LE  
Sbjct: 751 ------------EELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEEL 798

Query: 417 QEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQH 476
           +EE+   +++ + + +E  +  + R  L+Q+      + +    + +E +E L +++++ 
Sbjct: 799 EEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKEL 858

Query: 477 EDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAV 536
           E+  +E+    A + +   +LK L EE+    +E   + SE   + +E+EKL E L +  
Sbjct: 859 EELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELE 918

Query: 537 RKIKTLEMENKELVE-----DKKTLSYQIETLKREIASALHDRD----KALKECNDLRER 587
            K++ LE+E  EL E      + TL  ++E     +   +        +A++E  ++ ER
Sbjct: 919 AKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEER 978

Query: 588 FGEFTAAKEECQREA 602
           + E  + +E+ +   
Sbjct: 979 YEELKSQREDLEEAK 993



 Score = 78.6 bits (194), Expect = 2e-14
 Identities = 69/367 (18%), Positives = 148/367 (40%), Gaps = 38/367 (10%)

Query: 301 RRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCC 360
           R      +EL+    +     AQLE   +E  SL+++   L    + L R+L EL     
Sbjct: 660 RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELE---- 715

Query: 361 TQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEV 420
                      + L +   +   + E ++   + L +  ++L        ++LE  +EE+
Sbjct: 716 --------RQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEEL 767

Query: 421 ARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAV 480
             +++    + +E      +R  L++              E  E +E L + +++ +   
Sbjct: 768 ESLEEALAKLKEEIEELEEKRQALQE--------------ELEELEEELEEAERRLDALE 813

Query: 481 KEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIK 540
           +E+ +    R +  ++++ L EE     ++   +  E + + KE+E+L E+L +   + +
Sbjct: 814 RELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKE 873

Query: 541 TLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQR 600
            LE       ++ K L  + E L+ E+     +  +  +E   LRER  E  A  E  + 
Sbjct: 874 ELE-------DELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEV 926

Query: 601 EAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDK 660
           E      E      E     +   E+E        + +     NL A +E + + +  ++
Sbjct: 927 EL-PELEEELEEEYEDTLETELEREIERLEE----EIEALGPVNLRAIEEYEEVEERYEE 981

Query: 661 LQAELQD 667
           L+++ +D
Sbjct: 982 LKSQRED 988



 Score = 77.8 bits (192), Expect = 3e-14
 Identities = 54/321 (16%), Positives = 130/321 (40%), Gaps = 26/321 (8%)

Query: 383 NKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERN 442
                 K E   L +   +L        ++L+  + E+  ++   E++ ++     R+  
Sbjct: 660 RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLE 719

Query: 443 GLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTE 502
            LK++  A   + +       E +E L +++++ E+  + +                L E
Sbjct: 720 ELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLE--------------ELEE 765

Query: 503 ERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIET 562
           E  +  +  + +  E + + ++ + L E+L +   +++  E     L  + ++L  + E 
Sbjct: 766 ELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRER 825

Query: 563 LKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKE 622
           L++EI     + ++  ++ ++L E   E     EE + E      E      E     +E
Sbjct: 826 LEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEE 885

Query: 623 RNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTESIQE 682
           + E+E        + ++   +  E  +EI++LR+  ++L+A+L+   V     L E ++E
Sbjct: 886 KEELEE-------ELRELESELAELKEEIEKLRERLEELEAKLERLEVELP-ELEEELEE 937

Query: 683 AEVSKRRRDWAFSERDKIEVI 703
                   +       +IE +
Sbjct: 938 EYEDTLETEL----EREIERL 954



 Score = 74.4 bits (183), Expect = 4e-13
 Identities = 53/283 (18%), Positives = 115/283 (40%), Gaps = 11/283 (3%)

Query: 421 ARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAV 480
           +R K+      +E      E   L+ Q      +      E    ++ L ++++Q E+  
Sbjct: 656 SRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELE 715

Query: 481 KEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIK 540
           +++           ++L++L        +E   +  E + + + +E+L E+L      + 
Sbjct: 716 RQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALA 775

Query: 541 TLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQR 600
            L+ E +EL E ++ L  ++E L+ E+  A    D   +E   L +R        EE + 
Sbjct: 776 KLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEE 835

Query: 601 EAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDK 660
           E             + D   +E  E+E    +L  + ++   +  E   E+  L +  ++
Sbjct: 836 EIEELE-------EKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEE 888

Query: 661 LQAELQDQSVNYLFILTESIQEAEVSKRRRDWAFSERDKIEVI 703
           L+ EL++        L E  +E E  + R +   ++ +++EV 
Sbjct: 889 LEEELRELESE----LAELKEEIEKLRERLEELEAKLERLEVE 927



 Score = 72.1 bits (177), Expect = 2e-12
 Identities = 56/298 (18%), Positives = 123/298 (41%), Gaps = 23/298 (7%)

Query: 409 AVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQ----QCTAAIRQWDIALRERNE 464
             + LE  ++ +  ++KQ E + ++   A R +    +    +    + +     +E  E
Sbjct: 184 TEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEE 243

Query: 465 YQEALAKVQQQHEDAVKEINTAM--------------AVRMKATKDLKRLTEERNAAMQE 510
            +E L++++++ E+  +E+  A                   +  ++L  L EE      E
Sbjct: 244 LEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGE 303

Query: 511 YSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKR---EI 567
            SL+    + +  E+E+L E L +   KI+ L+ E +E     + L   +  L+    E+
Sbjct: 304 ISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEEL 363

Query: 568 ASALHDRDKALKECND-LRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEV 626
              L    + L+E  + LRE   E  A   E + E    + E        +   +   ++
Sbjct: 364 EEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDL 423

Query: 627 EAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTESIQEAE 684
           +    +L  + ++ + +  E N+E++ L +  ++L+  L++        L E +Q  E
Sbjct: 424 KEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELEREL-AELQEELQRLE 480



 Score = 71.7 bits (176), Expect = 3e-12
 Identities = 49/261 (18%), Positives = 111/261 (42%), Gaps = 4/261 (1%)

Query: 377 HYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNT 436
                  + E +++E   L +  ++L      A  ++E  + E+  ++++ E++ +E   
Sbjct: 233 KLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLE 292

Query: 437 ALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKD 496
              E   L+ + +    + +    E  E +E L +++++ E   +E+     +  +  + 
Sbjct: 293 LKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQL 352

Query: 497 LKRLTEERNAAMQEYSLIMSERDSVHK----EMEKLSEDLTQAVRKIKTLEMENKELVED 552
           L  L E +    ++ S ++ E + + +    E+ +L  +L +   +++ L+ E + L E 
Sbjct: 353 LAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEER 412

Query: 553 KKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMR 612
            + LS ++E LK E+     + ++   E  +L E   E     EE +        E A  
Sbjct: 413 LERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAEL 472

Query: 613 MGERDSSRKERNEVEAASMDL 633
             E     KE + +EA    L
Sbjct: 473 QEELQRLEKELSSLEARLDRL 493



 Score = 64.4 bits (157), Expect = 5e-10
 Identities = 50/294 (17%), Positives = 108/294 (36%), Gaps = 34/294 (11%)

Query: 291 LYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDR 350
           L + +  E  +  +   +EL     + +    +LE A +E   L+S+  +L  + + L  
Sbjct: 229 LLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQE 288

Query: 351 ELLEL-----------------RSGCCTQPESDDAGSADSLNQH--YMSALNKYETIKDE 391
           ELLEL                       + E  +    +   +       L + ET+ +E
Sbjct: 289 ELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEE 348

Query: 392 YDALRKRFDDLINTHSTAVDKLELAQEE-VARIKKQCEDMNQEKNTALRERNGLKQQCTA 450
            + L    ++        +  L    EE    ++++  ++  E      E   LK     
Sbjct: 349 LEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELK----- 403

Query: 451 AIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQE 510
                    RE    +E L ++ ++ ED  +E+    A   +   +L+ L EE     ++
Sbjct: 404 ---------REIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQ 454

Query: 511 YSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLK 564
              +      + +E+ +L E+L +  +++ +LE     L  +++        L+
Sbjct: 455 LEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLE 508



 Score = 64.4 bits (157), Expect = 5e-10
 Identities = 60/329 (18%), Positives = 131/329 (39%), Gaps = 38/329 (11%)

Query: 308 KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDD 367
            EL     + +    +LE   +E S L  +  +L  + +  ++E+ EL+S          
Sbjct: 225 LELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKS---------- 274

Query: 368 AGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQC 427
                           + E +++E + L++   +L         ++ L +E +  ++ + 
Sbjct: 275 ----------------ELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENEL 318

Query: 428 EDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAM 487
           E++ +       +   LK++           L ER    E L ++  + E+A +E+   +
Sbjct: 319 EELEERLEELKEKIEALKEE-----------LEERETLLEELEQLLAELEEAKEELEEKL 367

Query: 488 AVRMKATKDLK-RLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMEN 546
           +  ++  ++L   L EE      E + I +E + + +E+E L E L +   +++ L+ E 
Sbjct: 368 SALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEEL 427

Query: 547 KELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHAR 606
           KEL  + + L  ++E L  E+       ++      +L     E     +  ++E     
Sbjct: 428 KELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLE 487

Query: 607 FEYAMRMGERDSSRKERNEVEAASMDLFG 635
                   E+ +S+  R  +EA    L G
Sbjct: 488 ARLDRLEAEQRASQGVRAVLEALESGLPG 516



 Score = 56.6 bits (137), Expect = 1e-07
 Identities = 38/239 (15%), Positives = 99/239 (41%), Gaps = 11/239 (4%)

Query: 307  LKELEYYRGQHQAAMAQLEAAAQES----SSLRSKYSDLISDKQRLDRELLELRSGCCTQ 362
            ++ELE  R   Q  + +LE   +E+     +L  +   L   ++RL++E+ EL      +
Sbjct: 781  IEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELE-----E 835

Query: 363  PESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVAR 422
               +     D L +       + E +K+E + L    ++L +      ++ E  +EE+  
Sbjct: 836  EIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRE 895

Query: 423  IKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEAL--AKVQQQHEDAV 480
            ++ +  ++ +E          L+ +      +      E  E  E     +++++ E   
Sbjct: 896  LESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLE 955

Query: 481  KEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKI 539
            +EI     V ++A ++ + + E       +   +   ++ + + +E+L ++  +  ++ 
Sbjct: 956  EEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRERFKET 1014



 Score = 43.9 bits (104), Expect = 9e-04
 Identities = 57/383 (14%), Positives = 134/383 (34%), Gaps = 62/383 (16%)

Query: 110 KLKKKEEVAR-IKKQCEDMNQEKNTALRERNGLKQ----QCTAAIRQWDIALRERNEYQE 164
            L++ E++   ++KQ E + ++   A R +    +    +    + +     +E  E +E
Sbjct: 187 NLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEE 246

Query: 165 ALAKVQQQHEDAVKEINTAMAVTNYLDGLQLNNSYDHGLTLESGLYICKISPGSLAAKEG 224
            L++++++ E+  +E+  A      L   +L    +    L+  L   K     +   EG
Sbjct: 247 ELSRLEEELEELQEELEEAEKEIEELKS-ELEELREELEELQEELLELKE---EIEELEG 302

Query: 225 SLAVGDRVLSINNRTVDGLSSAREG--MALLSDSPDVLTITTLKPTSLGEHSPRVHRKSV 282
            +++    L      ++ L    E     + +   ++    TL      E          
Sbjct: 303 EISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLL----EELE-------- 350

Query: 283 NSSTQTMPLYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEA----AAQESSSLRSKY 338
               Q +        E   +    L+ELE      +  +A+LEA       E   L+ + 
Sbjct: 351 ----QLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREI 406

Query: 339 SDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKR 398
             L    +RL   L +                   L +       + E ++ E + L + 
Sbjct: 407 ESLEERLERLSERLED-------------------LKEELKELEAELEELQTELEELNEE 447

Query: 399 FDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIA 458
            ++L        D+L+  + E+A ++++ + + +E ++     + L+            A
Sbjct: 448 LEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLE------------A 495

Query: 459 LRERNEYQEALAKVQQQHEDAVK 481
            +  ++   A+ +  +     V 
Sbjct: 496 EQRASQGVRAVLEALESGLPGVY 518


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 77.4 bits (191), Expect = 5e-14
 Identities = 98/591 (16%), Positives = 223/591 (37%), Gaps = 81/591 (13%)

Query: 112 KKKEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQ 171
           + +EE+  ++++ E + +       E    +Q   AA R+    L +     ++L ++Q+
Sbjct: 444 ELEEELEELQEELERLEEALEELREELEEAEQALDAAERE----LAQLQARLDSLERLQE 499

Query: 172 QHEDAVKEINTAMAVTNYLDGLQLNNSYDHGLTLESGLYICKISPGSLAAKEGSLAVGDR 231
             E   + +   +   + L G+        G+  E    +  +  G  AA E   A+G R
Sbjct: 500 NLEGFSEGVKALLKNQSGLSGI-------LGVLSE----LISVDEGYEAAIE--AALGGR 546

Query: 232 VLSINNRTVDGLSSAREGMALLSDSPDVLTITTLKPTSLGEHSPRVHRKSVNSSTQTMPL 291
              +    V+ L++A++ +A L  + ++  +T L   S+     + +             
Sbjct: 547 ---LQAVVVENLNAAKKAIAFLKQN-ELGRVTFLPLDSIKGTEIQGND------------ 590

Query: 292 YVRQHNETVRRGDHTLKELEYYRGQHQAAMA----------QLEAAAQESSSLRSKYSDL 341
             R+  + +       K+L  +  + + A++           L+ A + +  LR  Y  +
Sbjct: 591 --REILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYR-I 647

Query: 342 ISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDD 401
           ++    LD +L+    G  T       G +   N    S L +   I++    L ++ ++
Sbjct: 648 VT----LDGDLV-RPGGVIT-------GGSAKTNS---SILERRREIEE----LEEKIEE 688

Query: 402 LINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRE 461
           L        +K+   ++ +A ++K+ E++ +E     +E   L +Q +A  +       E
Sbjct: 689 LE-------EKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAE 741

Query: 462 RNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEER-------NAAMQEYSLI 514
             + +E +A++ ++  +   EI        +A ++L     E            +E   +
Sbjct: 742 VEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKAL 801

Query: 515 MSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDR 574
               D +  E+  L+E+      ++++LE          + L  QIE L  +I S   + 
Sbjct: 802 REALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEI 861

Query: 575 DKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLF 634
           ++  +   +L          +   +      R E      E      +R+E+     +L 
Sbjct: 862 EELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELR 921

Query: 635 GKCQKERMDNLEANQEIDRLR-KHTDKLQAELQDQSVNYLFILTESIQEAE 684
            K  +  +        ID L+ + +++    L++       I  +  +EA 
Sbjct: 922 EKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKI-EDDEEEAR 971



 Score = 63.5 bits (155), Expect = 1e-09
 Identities = 60/349 (17%), Positives = 132/349 (37%), Gaps = 54/349 (15%)

Query: 292 YVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQE-SSSLRS---------KYSDL 341
           Y  +  ET R+ + T           +  + +LE    E    L+S         +Y +L
Sbjct: 170 YKERRKETERKLERT-----------RENLDRLEDILNELERQLKSLERQAEKAERYKEL 218

Query: 342 ISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHY---MSALNKYETIKDEYDALRKR 398
            ++ + L+  LL                  + L +        L + E   +E  A  + 
Sbjct: 219 KAELRELELALL--------------VLRLEELREELEELQEELKEAEEELEELTAELQE 264

Query: 399 FDDLINTHSTAV----DKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQ 454
            ++ +      V    +++E  Q+E+  +  +   + Q+K         L++Q      Q
Sbjct: 265 LEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQ 324

Query: 455 WDIALRERNEYQEALAKV-------QQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAA 507
            +    + +E  E LA++       +++ E    E+    A   +    L+ L E+    
Sbjct: 325 LEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETL 384

Query: 508 MQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVE-----DKKTLSYQIET 562
             + + +  +  S++ E+E+L   L +   + + L+ E +EL++     + K L  ++E 
Sbjct: 385 RSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEE 444

Query: 563 LKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAM 611
           L+ E+     + ++  +   +LRE   E   A +  +RE    +     
Sbjct: 445 LEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDS 493



 Score = 60.1 bits (146), Expect = 1e-08
 Identities = 57/310 (18%), Positives = 126/310 (40%), Gaps = 45/310 (14%)

Query: 293  VRQHNETVRRGDHTLKELEYYRGQHQAAMAQ----LEAAAQESSSLRSKYSDLISDKQRL 348
            V Q  E + +    L ELE    + +  + +    L  A  E   L ++   L  + + L
Sbjct: 742  VEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKAL 801

Query: 349  DRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHST 408
               L ELR                                  E   L +   +L     +
Sbjct: 802  REALDELR---------------------------------AELTLLNEEAANLRERLES 828

Query: 409  AVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEA 468
               ++   +  +  +++Q E+++++  +   E   L++       + +  L ER   +EA
Sbjct: 829  LERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEA 888

Query: 469  LAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKL 528
            LA ++ + E+  +E+    + R +  ++L+ L E+          +    D++    E+L
Sbjct: 889  LALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNL---QERL 945

Query: 529  SEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRD-KALKECNDLRER 587
            SE+ +  + + + LE    ++ +D++    +++ L+ +I   L   +  A++E  +L+ER
Sbjct: 946  SEEYSLTLEEAEALE---NKIEDDEEEARRRLKRLENKIK-ELGPVNLAAIEEYEELKER 1001

Query: 588  FGEFTAAKEE 597
            +   TA KE+
Sbjct: 1002 YDFLTAQKED 1011



 Score = 58.9 bits (143), Expect = 2e-08
 Identities = 54/276 (19%), Positives = 100/276 (36%), Gaps = 42/276 (15%)

Query: 432 QEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRM 491
            E+   + E   L+++      +     +   E ++ L +++++ E   KE+        
Sbjct: 673 LERRREIEE---LEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQIS 729

Query: 492 KATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVE 551
              KDL RL  E     +  + +  E   +  E+E+L E L +A  ++   E E      
Sbjct: 730 ALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAE------ 783

Query: 552 DKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAM 611
                   IE L+ +I       ++  +E   LRE   E  A       EA + R     
Sbjct: 784 --------IEELEAQI-------EQLKEELKALREALDELRAELTLLNEEAANLRERLES 828

Query: 612 RMGERDSSRKERNEVEAASMDLFGKCQKERM--DNLEANQEIDRLRKHTDKLQAELQDQS 669
              ER  +  ER   +          Q E +  D      EI+ L +  ++L++EL+   
Sbjct: 829 L--ERRIAATERRLEDLEE-------QIEELSEDIESLAAEIEELEELIEELESELEA-- 877

Query: 670 VNYLFILTESIQEAEVSKRRRDWAFSERDKIEVIDS 705
                +L E     E     R       +++  ++S
Sbjct: 878 -----LLNERASLEEALALLRSELEELSEELRELES 908



 Score = 58.9 bits (143), Expect = 2e-08
 Identities = 65/372 (17%), Positives = 150/372 (40%), Gaps = 37/372 (9%)

Query: 291 LYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDR 350
           L V +  E     +   +EL+    + +   A+L+   ++   LR + S+L  + + L +
Sbjct: 229 LLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQK 288

Query: 351 ELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAV 410
           EL  L            A     L Q       +   ++ + + L  + ++L +      
Sbjct: 289 ELYAL------------ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELA 336

Query: 411 DKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALA 470
           ++L   +E++  +K++ E +  E    L E     ++  + + + +    +    +  +A
Sbjct: 337 EELAELEEKLEELKEELESLEAE----LEELEAELEELESRLEELE---EQLETLRSKVA 389

Query: 471 KVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSE 530
           +++ Q      EI      R++A   L+RL + R    QE   ++ + +    E+++L  
Sbjct: 390 QLELQIASLNNEI-----ERLEAR--LERLEDRRERLQQEIEELLKKLEE--AELKELQA 440

Query: 531 DLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKA---LKECNDLRER 587
           +L +   +++ L+ E + L E  + L  ++E  ++ + +A  +  +    L     L+E 
Sbjct: 441 ELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQEN 500

Query: 588 FGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAA-SMDLFGKCQKERMDNLE 646
              F+       +     +   +  +G            EAA    L G+ Q   ++NL 
Sbjct: 501 LEGFSEGV----KALLKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRLQAVVVENLN 556

Query: 647 -ANQEIDRLRKH 657
            A + I  L+++
Sbjct: 557 AAKKAIAFLKQN 568



 Score = 50.4 bits (121), Expect = 8e-06
 Identities = 49/292 (16%), Positives = 110/292 (37%), Gaps = 37/292 (12%)

Query: 381 ALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRE 440
            + + E +++E + L++   +          +L+  +E++  ++ +  ++ +E     +E
Sbjct: 230 LVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKE 289

Query: 441 RNGLKQQCTAAIRQWDIALRERN--EYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLK 498
              L           +I+  E+     +E LA +++Q E+   ++    +   +  ++L 
Sbjct: 290 LYALAN---------EISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELA 340

Query: 499 RLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSY 558
            L E+               + + +E+E L  +L +   +++ LE   +EL E       
Sbjct: 341 ELEEKL--------------EELKEELESLEAELEELEAELEELESRLEELEE------- 379

Query: 559 QIETLKREIASALHDRDKALKECNDLRERFGEFTAAKE---ECQREAFHARFEYAMRMGE 615
           Q+ETL+ ++A           E   L  R       +E   +   E      E  ++  +
Sbjct: 380 QLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQ 439

Query: 616 RDSSRKERNEVEAASMDLFGKCQKERMDNL--EANQEIDRLRKHTDKLQAEL 665
            +    E    E        +   E +     EA Q +D   +   +LQA L
Sbjct: 440 AELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARL 491



 Score = 38.9 bits (91), Expect = 0.028
 Identities = 53/396 (13%), Positives = 143/396 (36%), Gaps = 58/396 (14%)

Query: 115  EEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHE 174
            E++  ++++  ++ +      +E   L+++     ++ +   R+ +  ++ LA+++ + E
Sbjct: 684  EKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVE 743

Query: 175  DAVKEIN-TAMAVTNYLDGLQLNNSYDHGLTLESGLYICKISPGSLAAKEGSLAVGDRVL 233
               + I   +  +T     ++                + +       A+     +  ++ 
Sbjct: 744  QLEERIAQLSKELTELEAEIEELEER-----------LEEAEEELAEAEAEIEELEAQIE 792

Query: 234  SINNRTV---DGLSSAREGMALLSDSPDVLTITTLKPTSLGEHSPRVHRKSVNSSTQTMP 290
             +        + L   R  + LL++          +  SL        R+  +   Q   
Sbjct: 793  QLKEELKALREALDELRAELTLLNEE---AANLRERLESLERRIAATERRLEDLEEQI-- 847

Query: 291  LYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQL----EAAAQESSSLRSKYSDLISDKQ 346
                + +E +      ++ELE    + ++ +  L     +  +  + LRS+  +L  + +
Sbjct: 848  ---EELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELR 904

Query: 347  RLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRF------- 399
             L+ +  ELR               + L +       + E ++   D L++R        
Sbjct: 905  ELESKRSELR------------RELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLT 952

Query: 400  -DDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIA 458
             ++     +   D  E A+  + R++ + +++      A+ E   LK++           
Sbjct: 953  LEEAEALENKIEDDEEEARRRLKRLENKIKELGPVNLAAIEEYEELKER----------- 1001

Query: 459  LRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKAT 494
                   +E L + ++  E+A++EI+     R K T
Sbjct: 1002 YDFLTAQKEDLTEAKETLEEAIEEIDREARERFKDT 1037



 Score = 38.5 bits (90), Expect = 0.042
 Identities = 60/330 (18%), Positives = 118/330 (35%), Gaps = 46/330 (13%)

Query: 114  KEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIA---LRERNEYQEALAKVQ 170
            ++++AR++ + E + +      +E   L+ +      + + A   L E     E L    
Sbjct: 732  RKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQI 791

Query: 171  QQHEDAVKEINTAMAVTNYLDGLQ--LNNSYDHGLTLESGLYICKISPGSLAAKEGSLAV 228
            +Q ++ +K +  A      LD L+  L    +    L   L   +     +AA E  L  
Sbjct: 792  EQLKEELKALREA------LDELRAELTLLNEEAANLRERL---ESLERRIAATERRLED 842

Query: 229  GDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTLKPTSLGEHSPRVHRKSVNSSTQT 288
             +  +   +  ++ L++  E    L +  + L        +    S       + S  + 
Sbjct: 843  LEEQIEELSEDIESLAAEIEE---LEELIEELESELEALLNERA-SLEEALALLRSELEE 898

Query: 289  MPLYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQE-SSSLRSKYSDLI----- 342
            +   +R+           L+EL     Q +  +  LE         L  +YS  +     
Sbjct: 899  LSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEA 958

Query: 343  ------SDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALR 396
                   D++   R L  L                  L    ++A+ +YE +K+ YD L 
Sbjct: 959  LENKIEDDEEEARRRLKRLE------------NKIKELGPVNLAAIEEYEELKERYDFLT 1006

Query: 397  KRFDDLINTHSTAVDKLELAQEEVARIKKQ 426
             + +DL    + A + LE A EE+ R  ++
Sbjct: 1007 AQKEDL----TEAKETLEEAIEEIDREARE 1032



 Score = 34.6 bits (80), Expect = 0.55
 Identities = 64/436 (14%), Positives = 144/436 (33%), Gaps = 64/436 (14%)

Query: 308 KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDD 367
           ++LE  R +      Q+ +   E   L ++   L   ++RL +E+ EL          + 
Sbjct: 379 EQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKEL 438

Query: 368 AGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVA---RIK 424
               + L +       + E +++  + LR+  ++       A  +L   Q  +    R++
Sbjct: 439 QAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQ 498

Query: 425 KQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQH-------- 476
           +  E  ++     L+ ++GL           ++ +     Y+ A+               
Sbjct: 499 ENLEGFSEGVKALLKNQSGLSG---ILGVLSEL-ISVDEGYEAAIEAALGGRLQAVVVEN 554

Query: 477 -EDAVKEIN----------TAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEM 525
              A K I           T + +      +++    E    ++ +  +  +      ++
Sbjct: 555 LNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKL 614

Query: 526 EKL----------SEDLTQAVRKIKTLEMENKELVEDK---------------------- 553
            K            +DL  A+   K L    + +  D                       
Sbjct: 615 RKALSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTNSSILE 674

Query: 554 -----KTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFE 608
                + L  +IE L+ +IA       +  KE  +L E   +     EE  R+    R +
Sbjct: 675 RRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKD 734

Query: 609 YAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQ 668
            A    E +   +   ++     +L  + ++      EA +E+       ++L+A+++  
Sbjct: 735 LARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQL 794

Query: 669 SVNYLFILTESIQEAE 684
               L  L E++ E  
Sbjct: 795 K-EELKALREALDELR 809


>gnl|CDD|214504 smart00072, GuKc, Guanylate kinase homologues.  Active enzymes
            catalyze ATP-dependent phosphorylation of GMP to GDP.
            Structure resembles that of adenylate kinase. So-called
            membrane-associated guanylate kinase homologues (MAGUKs)
            do not possess guanylate kinase activities; instead at
            least some possess protein-binding functions.
          Length = 174

 Score = 65.8 bits (161), Expect = 5e-12
 Identities = 39/185 (21%), Positives = 76/185 (41%), Gaps = 28/185 (15%)

Query: 1481 GALSDTVTDKLLQDFPDKFVRCAP--------------EIMHCPQAAMEKGLADNLFVDY 1526
            G    T+  +L+Q+ PD F R                 +     +   E  +   LF+++
Sbjct: 2    GVGKGTLLAELIQEIPDAFERVVSHTTRPPRPGEVNGVDYHFVSKEEFEDDIKSGLFLEW 61

Query: 1527 -RKKGAYFECTSVAAVKDACDKNNAHVILDVSLAGVERLHRQHVYPIVLLIKFKSTKQIK 1585
               +G Y+  TS   ++   +K   H +LD+   GV++L +  +YPIV+ I   S+++++
Sbjct: 62   GEYEGNYYG-TSKETIRQVAEKGK-HCLLDIDPQGVKQLRKAQLYPIVIFIAPPSSEELE 119

Query: 1586 EIKEIRDGRYSSLDKVTAKAAKEMYEHGLKIECEHKHYISAVIP-AGVNIAYMCTQIQTA 1644
                 R          T++  ++      K   +  H    VI    +  AY   +++  
Sbjct: 120  RRLRQR-------GTETSERIQKRLAAAQKEA-QEYHLFDYVIVNDDLEDAY--EELKEI 169

Query: 1645 VELEQ 1649
            +E EQ
Sbjct: 170  LEAEQ 174


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 70.9 bits (174), Expect = 5e-12
 Identities = 66/306 (21%), Positives = 130/306 (42%), Gaps = 27/306 (8%)

Query: 379 MSALNKYETIKD-----EYDA-LRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQ 432
           MS + + + I +     E+D    K  ++L       +++L+L  +E    ++Q E + +
Sbjct: 150 MSPVERRKIIDEIAGVAEFDRKKEKALEEL-EEVEENIERLDLIIDEK---RQQLERLRR 205

Query: 433 EKNTALR---------ERNG--LKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVK 481
           E+  A R         E  G  L ++  A  RQ +   R+    +E L K+ ++  +  K
Sbjct: 206 EREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEK 265

Query: 482 EINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKT 541
            +     +  +  K +K L EE    ++E      +   +  E+  L   + +  R+++ 
Sbjct: 266 RLEEIEQLLEELNKKIKDLGEEEQLRVKE------KIGELEAEIASLERSIAEKERELED 319

Query: 542 LEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQRE 601
            E    +L  +   L  +IE L+REI      RDK  +E  +L+E   +  A  EE  +E
Sbjct: 320 AEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKE 379

Query: 602 AFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKL 661
               R E      + +  ++E NE++     L  + Q+   +  + N  I  +    ++L
Sbjct: 380 FAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINEL 439

Query: 662 QAELQD 667
           + E +D
Sbjct: 440 EEEKED 445



 Score = 57.0 bits (138), Expect = 9e-08
 Identities = 55/313 (17%), Positives = 125/313 (39%), Gaps = 29/313 (9%)

Query: 330 ESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETI- 388
           E   L  +   L   K+ ++R+L  L      +           L +         E + 
Sbjct: 224 EGYELLKEKEALERQKEAIERQLASLE-----EELEKLTEEISELEKRLEEIEQLLEELN 278

Query: 389 -------KDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRER 441
                  ++E   ++++  +L    ++    +   + E+   +++   +  E +  L E 
Sbjct: 279 KKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEI 338

Query: 442 NGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKD-LKRL 500
             L+++    I +     + R++  E  A+++++ ED   E+   +      T+D LK  
Sbjct: 339 EELERE----IEEER---KRRDKLTEEYAELKEELEDLRAELE-EVDKEFAETRDELKDY 390

Query: 501 TEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQI 560
            E+     +E + +  E D + +E+++LSE+L      I  +E +  EL E+K+  + +I
Sbjct: 391 REKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEI 450

Query: 561 ETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSR 620
           +  + ++     D  K  +E  DL+E +       +  ++E    + E A    +  +S 
Sbjct: 451 KKQEWKLEQLAADLSKYEQELYDLKEEY-------DRVEKELSKLQRELAEAEAQARASE 503

Query: 621 KERNEVEAASMDL 633
           +      A    L
Sbjct: 504 ERVRGGRAVEEVL 516



 Score = 53.9 bits (130), Expect = 7e-07
 Identities = 69/397 (17%), Positives = 158/397 (39%), Gaps = 64/397 (16%)

Query: 322  AQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSA 381
            A+L+   +    L+ + S L S+ +R++  L EL        E  DA       +     
Sbjct: 674  AELQRLRERLEGLKRELSSLQSELRRIENRLDELSQ------ELSDASRKIGEIE----- 722

Query: 382  LNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRER 441
              + E ++ E + L++R ++L    S+   ++E  + E+  ++ + E++ ++ +      
Sbjct: 723  -KEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEAL 781

Query: 442  NGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLT 501
            N L+ +   +  +      E ++ +E +++++ +  +  +++N       + T + + L 
Sbjct: 782  NDLEAR--LSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLN-------RLTLEKEYLE 832

Query: 502  EERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIE 561
            +E     ++   +  +  S+ KE+E L+    +   +++ LE   ++L      L  + +
Sbjct: 833  KEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERD 892

Query: 562  TLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRK 621
             L+ ++       ++   +    R+R  E  A  E  + E           + E +  + 
Sbjct: 893  ELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEE-----------LSEIEDPKG 941

Query: 622  ERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQD-QSVNYLFILTESI 680
            E  E+    + L    Q E                   +++ E++  + VN L     +I
Sbjct: 942  EDEEIPEEELSL-EDVQAEL-----------------QRVEEEIRALEPVNML-----AI 978

Query: 681  QEAEVSKRR-------RDWAFSERDKI-EVIDSYHTK 709
            QE E   +R       R     ER  I E I+ Y  K
Sbjct: 979  QEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKK 1015



 Score = 50.1 bits (120), Expect = 1e-05
 Identities = 48/230 (20%), Positives = 95/230 (41%), Gaps = 39/230 (16%)

Query: 309 ELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDA 368
           E+            +LE A +  + L ++   L+++ + L+RE+ E R            
Sbjct: 302 EIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEER------------ 349

Query: 369 GSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCE 428
              D L         +Y  +K+E + LR   +++    +   D+L+  +E++       E
Sbjct: 350 KRRDKLT-------EEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKL-------E 395

Query: 429 DMNQEKNTALRERNGL---KQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINT 485
            + +E N   RE + L    Q+ +  +   + A+    E    + +++++ ED   EI  
Sbjct: 396 KLKREINELKRELDRLQEELQRLSEELADLNAAI-AGIE--AKINELEEEKEDKALEIK- 451

Query: 486 AMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQA 535
                 K    L++L  + +   QE   +  E D V KE+ KL  +L +A
Sbjct: 452 ------KQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEA 495



 Score = 48.5 bits (116), Expect = 3e-05
 Identities = 58/345 (16%), Positives = 121/345 (35%), Gaps = 60/345 (17%)

Query: 301  RRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCC 360
             R D   +EL     +      ++E   QE   L+ +  +L  D   L++E+  ++S   
Sbjct: 702  NRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKS--- 758

Query: 361  TQPESDDAGSADSLNQHYMSALNKYE-TIKDEYDALRKRFDDLINTHSTAVDKLELAQEE 419
               E  +    ++  +     L+K E  + D    L       I    + ++      EE
Sbjct: 759  ---ELKEL---EARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLE------EE 806

Query: 420  VARIKKQCEDMNQ-------EKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEAL--- 469
            V+RI+ +  ++ Q       EK    +E   L++Q           ++   +  E L   
Sbjct: 807  VSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQ----RIDLKEQIKSIEKEIENLNGK 862

Query: 470  -AKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKL 528
              +++++ E+    +    +      K+   L  +     ++   + ++ +   K + +L
Sbjct: 863  KEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSEL 922

Query: 529  SEDLTQAVRKIKTLEMENKELVE------DKKTLSYQIETLKREIASALHD--------- 573
               L     ++  +E    E  E        + +  +++ ++ EI  AL           
Sbjct: 923  KAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEI-RALEPVNMLAIQEY 981

Query: 574  -------------RDKALKECNDLRERFGEFTAAKEECQREAFHA 605
                         R K  +E   + ER  E+   K E   EAF A
Sbjct: 982  EEVLKRLDELKEKRAKLEEERKAILERIEEYEKKKREVFMEAFEA 1026



 Score = 47.0 bits (112), Expect = 1e-04
 Identities = 54/336 (16%), Positives = 121/336 (36%), Gaps = 68/336 (20%)

Query: 308  KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDD 367
            KE+E    + +    +LE   ++ SSL  +  ++ S+ + L+  + EL            
Sbjct: 723  KEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELE----------- 771

Query: 368  AGSADSLNQHYMSALNKYE-TIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQ 426
                          L+K E  + D    L       I    + ++      EEV+RI+ +
Sbjct: 772  ------------EDLHKLEEALNDLEARLSHSRIPEIQAELSKLE------EEVSRIEAR 813

Query: 427  CEDMNQ-------EKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEAL----AKVQQQ 475
              ++ Q       EK    +E   L++Q           ++   +  E L     +++++
Sbjct: 814  LREIEQKLNRLTLEKEYLEKEIQELQEQ----RIDLKEQIKSIEKEIENLNGKKEELEEE 869

Query: 476  HEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQA 535
             E+    +    +      K+   L  +     ++   + ++ +   K + +L   L   
Sbjct: 870  LEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEAL 929

Query: 536  VRKIKTLEMENKELVE------DKKTLSYQIETLKREIAS-------ALHDRDKALKECN 582
              ++  +E    E  E        + +  +++ ++ EI +       A+ + ++ LK  +
Sbjct: 930  EEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLD 989

Query: 583  DL----------RERFGEFTAAKEECQREAFHARFE 608
            +L          R+   E     E+ +RE F   FE
Sbjct: 990  ELKEKRAKLEEERKAILERIEEYEKKKREVFMEAFE 1025



 Score = 43.5 bits (103), Expect = 0.001
 Identities = 41/219 (18%), Positives = 93/219 (42%), Gaps = 19/219 (8%)

Query: 464 EYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHK 523
           E+     K  ++ E+  + I     +  +  + L+RL  ER  A + Y  ++ E+     
Sbjct: 167 EFDRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKA-ERYQALLKEKREY-- 223

Query: 524 EMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECND 583
           E  +L ++     R+ + +E +   L E+ + L+ +I  L++ +       ++  K+  D
Sbjct: 224 EGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKD 283

Query: 584 L--------RERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRK----ERNEVEAASM 631
           L        +E+ GE  A     +R    +  E    + + +        E +++ A   
Sbjct: 284 LGEEEQLRVKEKIGELEAEIASLER----SIAEKERELEDAEERLAKLEAEIDKLLAEIE 339

Query: 632 DLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSV 670
           +L  + ++ER    +  +E   L++  + L+AEL++   
Sbjct: 340 ELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDK 378



 Score = 37.7 bits (88), Expect = 0.067
 Identities = 49/277 (17%), Positives = 98/277 (35%), Gaps = 41/277 (14%)

Query: 291  LYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDR 350
            L  R  +  +      L +LE    + +A + ++E      +  +      I + Q    
Sbjct: 784  LEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRI 843

Query: 351  ELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAV 410
            +L E       + E+ +    + L +           ++     L+K  D+L        
Sbjct: 844  DLKEQIKSIEKEIENLNGKKEE-LEEELEELEAALRDLESRLGDLKKERDELEAQLRELE 902

Query: 411  DKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALA 470
             K+E  + ++ + +K+  ++ + K  AL E         + I        E  E + +L 
Sbjct: 903  RKIEELEAQIEKKRKRLSEL-KAKLEALEEEL-------SEIEDPKGEDEEIPEEELSLE 954

Query: 471  KVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSE 530
             VQ +                +  ++++ L      A+QEY       + V K +++L E
Sbjct: 955  DVQAE--------------LQRVEEEIRALEPVNMLAIQEY-------EEVLKRLDELKE 993

Query: 531  DLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREI 567
                   K   LE E K ++E       + E  KRE+
Sbjct: 994  -------KRAKLEEERKAILE----RIEEYEKKKREV 1019


>gnl|CDD|201332 pfam00595, PDZ, PDZ domain (Also known as DHR or GLGF).  PDZ domains
            are found in diverse signaling proteins.
          Length = 80

 Score = 61.5 bits (150), Expect = 1e-11
 Identities = 29/79 (36%), Positives = 39/79 (49%), Gaps = 2/79 (2%)

Query: 1240 LMIETRKCSNLGISLVGGNA--VGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATA 1297
            + +E      LG SLVGG+    GI+V  V  G    + GL+ GDRIL  NG DL   + 
Sbjct: 2    VTLEKSGRGGLGFSLVGGSDGDPGIFVSEVLPGGAAEAGGLQEGDRILSINGQDLENLSH 61

Query: 1298 EEAAYELAKPADKVTVLAQ 1316
            +EA   L     +VT+   
Sbjct: 62   DEAVLALKGSGGEVTLTIL 80



 Score = 54.2 bits (131), Expect = 6e-09
 Identities = 19/73 (26%), Positives = 33/73 (45%), Gaps = 2/73 (2%)

Query: 1137 SVEPLGIQIQ--CLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSATYQLAASV 1194
                LG  +        G+FVS V     A   GLQ GD++L + G ++ + ++  A   
Sbjct: 8    GRGGLGFSLVGGSDGDPGIFVSEVLPGGAAEAGGLQEGDRILSINGQDLENLSHDEAVLA 67

Query: 1195 LRQCGNSITMLVQ 1207
            L+  G  +T+ + 
Sbjct: 68   LKGSGGEVTLTIL 80



 Score = 45.3 bits (108), Expect = 8e-06
 Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 208 GLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTL 265
           G+++ ++ PG  AA+ G L  GDR+LSIN + ++ LS     +  L  S   +T+T L
Sbjct: 25  GIFVSEVLPGG-AAEAGGLQEGDRILSINGQDLENLSHDEA-VLALKGSGGEVTLTIL 80



 Score = 45.3 bits (108), Expect = 8e-06
 Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 832 GLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTL 889
           G+++ ++ PG  AA+ G L  GDR+LSIN + ++ LS     +  L  S   +T+T L
Sbjct: 25  GIFVSEVLPGG-AAEAGGLQEGDRILSINGQDLENLSHDEA-VLALKGSGGEVTLTIL 80


>gnl|CDD|238080 cd00136, PDZ, PDZ domain, also called DHR (Dlg homologous region) or
            GLGF (after a conserved sequence motif). Many PDZ domains
            bind C-terminal polypeptides, though binding to internal
            (non-C-terminal) polypeptides and even to lipids has been
            demonstrated. Heterodimerization through PDZ-PDZ domain
            interactions adds to the domain's versatility, and PDZ
            domain-mediated interactions may be modulated dynamically
            through target phosphorylation. Some PDZ domains play a
            role in scaffolding supramolecular complexes. PDZ domains
            are found in diverse signaling proteins in bacteria,
            archebacteria, and eurkayotes. This CD contains two
            distinct structural subgroups with either a N- or
            C-terminal beta-strand forming the peptide-binding groove
            base. The circular permutation placing the strand on the
            N-terminus appears to be found in Eumetazoa only, while
            the C-terminal variant is found in all three kingdoms of
            life, and seems to co-occur with protease domains. PDZ
            domains have been named after PSD95(post synaptic density
            protein), DlgA (Drosophila disc large tumor suppressor),
            and ZO1, a mammalian tight junction protein.
          Length = 70

 Score = 58.9 bits (143), Expect = 8e-11
 Identities = 29/65 (44%), Positives = 39/65 (60%), Gaps = 1/65 (1%)

Query: 1250 LGISLVGGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYEL-AKPA 1308
            LG S+ GG   G+ V SV+ GS    AGL+ GD IL  NGTD++  T E+ A  L  +  
Sbjct: 3    LGFSIRGGTEGGVVVLSVEPGSPAERAGLQAGDVILAVNGTDVKNLTLEDVAELLKKEVG 62

Query: 1309 DKVTV 1313
            +KVT+
Sbjct: 63   EKVTL 67



 Score = 54.6 bits (132), Expect = 3e-09
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 1140 PLGIQIQCLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSATYQLAASVLRQC- 1198
             LG  I+    GGV V +V   S A + GLQ GD +L V G ++++ T +  A +L++  
Sbjct: 2    GLGFSIRGGTEGGVVVLSVEPGSPAERAGLQAGDVILAVNGTDVKNLTLEDVAELLKKEV 61

Query: 1199 GNSITMLV 1206
            G  +T+ V
Sbjct: 62   GEKVTLTV 69



 Score = 43.4 bits (103), Expect = 2e-05
 Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 5/69 (7%)

Query: 201 HGLTL----ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDS 256
            G ++    E G+ +  + PGS A + G L  GD +L++N   V  L+       L  + 
Sbjct: 3   LGFSIRGGTEGGVVVLSVEPGSPAERAG-LQAGDVILAVNGTDVKNLTLEDVAELLKKEV 61

Query: 257 PDVLTITTL 265
            + +T+T  
Sbjct: 62  GEKVTLTVR 70



 Score = 43.4 bits (103), Expect = 2e-05
 Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 5/69 (7%)

Query: 825 HGLTL----ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDS 880
            G ++    E G+ +  + PGS A + G L  GD +L++N   V  L+       L  + 
Sbjct: 3   LGFSIRGGTEGGVVVLSVEPGSPAERAG-LQAGDVILAVNGTDVKNLTLEDVAELLKKEV 61

Query: 881 PDVLTITTL 889
            + +T+T  
Sbjct: 62  GEKVTLTVR 70


>gnl|CDD|201353 pfam00625, Guanylate_kin, Guanylate kinase. 
          Length = 183

 Score = 60.4 bits (147), Expect = 4e-10
 Identities = 35/153 (22%), Positives = 65/153 (42%), Gaps = 19/153 (12%)

Query: 1475 RPVILVGAL---SDTVTDKLLQDFPDKFVRCAPEIMHCPQA--------------AMEKG 1517
            RP++L G        +   LL ++P+KF          P+                ME  
Sbjct: 3    RPIVLSGPSGVGKSHIKKALLDEYPEKFGYSVSHTTRPPRPGEVDGKDYHFVSKEEMEND 62

Query: 1518 LADNLFVDYRKKGAYFECTSVAAVKDACDKNNAHVILDVSLAGVERLHRQHVYPIVLLIK 1577
            ++ N F++Y +    +  TS  A++    ++    ILDV + GV++L +  + PI + IK
Sbjct: 63   ISANEFLEYAEFNGNYYGTSKEAIEQIA-ESGKICILDVDIQGVKQLRKAELSPISVFIK 121

Query: 1578 FKSTKQI-KEIKEIRDGRYSSLDKVTAKAAKEM 1609
              S K + + +K     +   ++K    A +E 
Sbjct: 122  PPSLKVLQRRLKRRGTEQEEKINKRMEAAEQEF 154


>gnl|CDD|233695 TIGR02037, degP_htrA_DO, periplasmic serine protease, Do/DeqQ family.
             This family consists of a set proteins various
            designated DegP, heat shock protein HtrA, and protease
            DO. The ortholog in Pseudomonas aeruginosa is designated
            MucD and is found in an operon that controls mucoid
            phenotype. This family also includes the DegQ (HhoA)
            paralog in E. coli which can rescue a DegP mutant, but
            not the smaller DegS paralog, which cannot. Members of
            this family are located in the periplasm and have
            separable functions as both protease and chaperone.
            Members have a trypsin domain and two copies of a PDZ
            domain. This protein protects bacteria from thermal and
            other stresses and may be important for the survival of
            bacterial pathogens.// The chaperone function is dominant
            at low temperatures, whereas the proteolytic activity is
            turned on at elevated temperatures [Protein fate, Protein
            folding and stabilization, Protein fate, Degradation of
            proteins, peptides, and glycopeptides].
          Length = 428

 Score = 57.6 bits (140), Expect = 3e-08
 Identities = 49/200 (24%), Positives = 72/200 (36%), Gaps = 41/200 (20%)

Query: 1141 LGIQIQCLDS-----------GGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSA--- 1186
            LG+ IQ + S            G  V+ V   S A + GL+ GD +  V G  + S    
Sbjct: 236  LGVTIQEVTSDLAKSLGLEKQRGALVAQVLPGSPAEKAGLKAGDVITSVNGKPISSFADL 295

Query: 1187 TYQLAAS---------VLRQCGN-SITMLVQYSPDKYHELEGSGSSSAENESVSGRGSGE 1236
               +            +LR+    +IT+ +  SP+         SSS     ++      
Sbjct: 296  RRAIGTLKPGKKVTLGILRKGKEKTITVTLGASPE------EQASSSNPFLGLTVANLSP 349

Query: 1237 PRFLMIETRKCSNLGISLVGGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAAT 1296
                  E RK   L      G+  G+ V  V SGS    AGL+ GD IL  N   + +  
Sbjct: 350  ------EIRKELRLK-----GDVKGVVVTKVVSGSPAARAGLQPGDVILSVNQQPVSSVA 398

Query: 1297 AEEAAYELAKPADKVTVLAQ 1316
                    AK   +V +L  
Sbjct: 399  ELRKVLARAKKGGRVALLIL 418



 Score = 32.2 bits (74), Expect = 2.5
 Identities = 18/46 (39%), Positives = 26/46 (56%), Gaps = 4/46 (8%)

Query: 208 GLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALL 253
           G+ + K+  GS AA+ G L  GD +LS+N + V   SS  E   +L
Sbjct: 363 GVVVTKVVSGSPAARAG-LQPGDVILSVNQQPV---SSVAELRKVL 404



 Score = 32.2 bits (74), Expect = 2.5
 Identities = 18/46 (39%), Positives = 26/46 (56%), Gaps = 4/46 (8%)

Query: 832 GLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALL 877
           G+ + K+  GS AA+ G L  GD +LS+N + V   SS  E   +L
Sbjct: 363 GVVVTKVVSGSPAARAG-LQPGDVILSVNQQPV---SSVAELRKVL 404



 Score = 31.4 bits (72), Expect = 3.9
 Identities = 26/102 (25%), Positives = 44/102 (43%), Gaps = 13/102 (12%)

Query: 780 SNVSKRMVLETVRSSGTSALMVIRRRRVGARSLVTTQLQLNNSYDHGLTLESGLYICKIS 839
           SN++K  V++ +   G      ++R  +G      T   L  S   GL  + G  + ++ 
Sbjct: 215 SNMAKN-VVDQLIEGGK-----VKRGWLGVTIQEVTS-DLAKSL--GLEKQRGALVAQVL 265

Query: 840 PGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSP 881
           PGS A K G L  GD + S+N + +   SS  +    +    
Sbjct: 266 PGSPAEKAG-LKAGDVITSVNGKPI---SSFADLRRAIGTLK 303



 Score = 31.0 bits (71), Expect = 6.3
 Identities = 16/56 (28%), Positives = 26/56 (46%), Gaps = 4/56 (7%)

Query: 202 GLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSP 257
           GL  + G  + ++ PGS A K G L  GD + S+N + +   SS  +    +    
Sbjct: 252 GLEKQRGALVAQVLPGSPAEKAG-LKAGDVITSVNGKPI---SSFADLRRAIGTLK 303


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 57.1 bits (138), Expect = 7e-08
 Identities = 50/305 (16%), Positives = 126/305 (41%), Gaps = 22/305 (7%)

Query: 390 DEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCT 449
           ++Y+ L +   ++I      +++LE    E+    +   +  +E+   L++   ++++  
Sbjct: 167 EKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEE-- 224

Query: 450 AAIRQWDIALRERNEYQEALAKVQ---QQHEDAVKEINTAMAVRMKATKDLKRLTEERNA 506
              ++ +   +E    +E LA+++   ++ E+    +    ++ ++A K  +    E   
Sbjct: 225 ---QEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELER 281

Query: 507 AMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMEN---KELVEDKKTLSYQIETL 563
            ++E    +   + + +E+E+L E+L      ++ LE      K L E  + L  ++E L
Sbjct: 282 LLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKL 341

Query: 564 KREIASA---LHDRDKALKE-CNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSS 619
           + E+       ++  K L+E   +L ER  E        + E    R +      +    
Sbjct: 342 ESELEELAEEKNELAKLLEERLKELEERLEELEK-----ELEKALERLKQLEEAIQELKE 396

Query: 620 RKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTES 679
                E+ AA  ++  + ++   +  E  +E++ L +   KL+ ++       L I   +
Sbjct: 397 ELA--ELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELA 454

Query: 680 IQEAE 684
               +
Sbjct: 455 GAGEK 459



 Score = 52.8 bits (127), Expect = 2e-06
 Identities = 65/430 (15%), Positives = 168/430 (39%), Gaps = 63/430 (14%)

Query: 304 DHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQP 363
           +  ++ELE      +A + +LE   ++  SL  +   L    ++L+ EL EL        
Sbjct: 297 EREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEEL------AE 350

Query: 364 ESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARI 423
           E ++              L + E   +E +   ++  + +     A   ++  +EE+A +
Sbjct: 351 EKNELAKLLE------ERLKELEERLEELEKELEKALERLKQLEEA---IQELKEELAEL 401

Query: 424 KKQCEDMNQEKNTALRERNGLKQQCT------AAIRQWDIALRERNEYQEALA------- 470
               E++ +E     +E   L+++          + +    L  +      LA       
Sbjct: 402 SAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCP 461

Query: 471 ------------KVQQQHEDAVKEINTAMAVRMKATK---DLKRLTEERNAAMQEYSLIM 515
                       ++ + +E  ++E+   ++   +  +   +++ L +E     +E   ++
Sbjct: 462 VCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELL 521

Query: 516 SERDSVHKEMEKLSEDLTQAVRKIKTL--EMENKELVEDKKTLSYQIETLKREIASALHD 573
              +++ +E+E+  E L   + +++ L  +++ ++L E+ + L  +++ LK  +      
Sbjct: 522 ELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLL 581

Query: 574 RDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDL 633
           R +  +   +LRER  E     +E +     ++ E  +   +     +  NE+E A  +L
Sbjct: 582 RTRKEEL-EELRERLKELKKKLKELEER--LSQLEELL---QSLELSEAENELEEAEEEL 635

Query: 634 FGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTESIQEAEVSKRRRDWA 693
                +  ++ L    E++ L +       EL+++       L   I+         +  
Sbjct: 636 -----ESELEKLNLQAELEELLQ---AALEELEEKVEE----LEAEIRRELQRIENEEQL 683

Query: 694 FSERDKIEVI 703
             + +++E +
Sbjct: 684 EEKLEELEQL 693



 Score = 47.8 bits (114), Expect = 5e-05
 Identities = 49/294 (16%), Positives = 121/294 (41%), Gaps = 42/294 (14%)

Query: 301 RRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCC 360
              +H  + LE Y  + +    +L    +E+  LR +  +L  + + L+ EL+EL     
Sbjct: 467 LPEEHEKELLELYELELEELEEELSREKEEAE-LREEIEELEKELRELEEELIEL----- 520

Query: 361 TQPESDDAGSADSLNQHYMSALNKYETIK------------DEYDALRKRFDDLINTH-- 406
            + E       +   +   + L + E +K             + +   +   +L+     
Sbjct: 521 LELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRL 580

Query: 407 -STAVDKLELAQEEVARIKK---QCEDMNQEKNTALRERNGLKQ--------QCTAAIRQ 454
             T  ++LE  +E +  +KK   + E+   +    L+     +         +   +  +
Sbjct: 581 LRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELE 640

Query: 455 WDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLI 514
               L  + E +E L    ++ E+ V+E+   +   ++  ++ ++L E+     +E   +
Sbjct: 641 ---KLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKL----EELEQL 693

Query: 515 MSERDSVHKEMEKLSEDLT---QAVRKIKTLEMENKELVEDKKTLSYQIETLKR 565
             E + + +E+E+L + L    Q + ++++ + E +EL ++ + L   +E L+ 
Sbjct: 694 EEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALELLEE 747


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
           This family consists of several plant proteins of
           unknown function. Several sequences in this family are
           described as being "myosin heavy chain-like".
          Length = 484

 Score = 56.2 bits (135), Expect = 1e-07
 Identities = 74/343 (21%), Positives = 141/343 (41%), Gaps = 49/343 (14%)

Query: 307 LKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESD 366
           LKELE  +   +    +LE A +E    +             D EL +LR+      E  
Sbjct: 63  LKELESTKRTVEDLKLKLEKAEKEEQQAKQ------------DSELAKLRA------EEL 104

Query: 367 DAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQ 426
           + G  +   + Y++A  + +++K+E   +R+ +D L+     A+ + E A       +K+
Sbjct: 105 EQGIQELEVERYITATAELDSVKEELRKIRQEYDALVEERDAALKRAEEAICASKVNEKK 164

Query: 427 CEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTA 486
            E++ +E                 A  +   A  E+++ +E   K  ++ E  ++ +   
Sbjct: 165 VEELTKEIIAMKESLERAHAAHLEAEEERIGAALEKDQDRETYEKELKEAEKELERLKQD 224

Query: 487 MAVRMKATKDLKRLTE--ERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKT--- 541
               +   KDL++L E      ++Q+   IM+   SV KE+E+   +L +A  + K+   
Sbjct: 225 ----LDPEKDLEKLAEASAELESLQKEISIMASVASVLKELEEAKANLEKAAEEEKSLRN 280

Query: 542 --------LEMENKELVEDKKTLSYQIETLKREIASALHDR---------DKALKECNDL 584
                   LE E KEL E +     + E    E AS+L             KA ++  ++
Sbjct: 281 LVESLKQELEEEKKELEELR-----EKEGEAEEAASSLEAELNRTKSEKESKAREKMVEI 335

Query: 585 RERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVE 627
             +  + ++  EE ++EA  AR E      E + ++      E
Sbjct: 336 PLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAALETAE 378



 Score = 48.1 bits (114), Expect = 3e-05
 Identities = 61/261 (23%), Positives = 110/261 (42%), Gaps = 13/261 (4%)

Query: 413 LELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKV 472
           L LAQEE  ++KK+ E   +EK   L+E    K+       + + A +E  + ++     
Sbjct: 38  LMLAQEEANKLKKELEVAEKEKLQVLKELESTKRTVEDLKLKLEKAEKEEQQAKQDSELA 97

Query: 473 QQQHEDAVKEI-NTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
           + + E+  + I    +   + AT +L  + EE     QEY  ++ ERD+  K  E     
Sbjct: 98  KLRAEELEQGIQELEVERYITATAELDSVKEELRKIRQEYDALVEERDAALKRAE----- 152

Query: 532 LTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERF-GE 590
             +A+   K  E + +EL ++   +   +E        A  +R  A  E +  RE +  E
Sbjct: 153 --EAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAALEKDQDRETYEKE 210

Query: 591 FTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKE----RMDNLE 646
              A++E +R       E  +      S+  E  + E + M       KE    + +  +
Sbjct: 211 LKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKEISIMASVASVLKELEEAKANLEK 270

Query: 647 ANQEIDRLRKHTDKLQAELQD 667
           A +E   LR   + L+ EL++
Sbjct: 271 AAEEEKSLRNLVESLKQELEE 291



 Score = 48.1 bits (114), Expect = 4e-05
 Identities = 58/295 (19%), Positives = 108/295 (36%), Gaps = 7/295 (2%)

Query: 380 SALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARI-KKQCEDMNQEKNTAL 438
           S     E +K + +   K           A  + E  ++ +  +  ++      E ++  
Sbjct: 68  STKRTVEDLKLKLEKAEKEEQQAKQDSELAKLRAEELEQGIQELEVERYITATAELDSVK 127

Query: 439 RERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLK 498
            E   ++Q+  A + + D AL+   E   A    +++ E+  KEI        +A     
Sbjct: 128 EELRKIRQEYDALVEERDAALKRAEEAICASKVNEKKVEELTKEIIAMKESLERAHAAHL 187

Query: 499 RLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAV---RKIKTLEMENKELVEDKKT 555
              EER  A  E        +   KE EK  E L Q +   + ++ L   + EL   +K 
Sbjct: 188 EAEEERIGAALEKDQDRETYEKELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKE 247

Query: 556 LSYQ--IETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRM 613
           +S    + ++ +E+  A  + +KA +E   LR          EE ++E    R +     
Sbjct: 248 ISIMASVASVLKELEEAKANLEKAAEEEKSLRNLVESLKQELEEEKKELEELREKEGEAE 307

Query: 614 GERDSSRKERNEV-EAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQD 667
               S   E N            K  +  +   +A+ E +  RK  +  + EL+ 
Sbjct: 308 EAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRK 362



 Score = 37.7 bits (87), Expect = 0.049
 Identities = 58/277 (20%), Positives = 114/277 (41%), Gaps = 21/277 (7%)

Query: 308 KELEYYRGQHQAA--MAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCC--TQP 363
           KELE  +        + +L  A+ E  SL+ + S + S    + +EL E ++      + 
Sbjct: 216 KELERLKQDLDPEKDLEKLAEASAELESLQKEISIMASVA-SVLKELEEAKANLEKAAEE 274

Query: 364 ESDDAGSADSLNQHYMSALNKYETIKDEYDALR-KRFDDLINTHSTAVDKLELAQEEVAR 422
           E       +SL Q       + E  K E + LR K  +      S   +      E+ ++
Sbjct: 275 EKSLRNLVESLKQ-------ELEEEKKELEELREKEGEAEEAASSLEAELNRTKSEKESK 327

Query: 423 IKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKE 482
            +++  ++  +   A  E    +++  AA  +      E  + + AL   + + E A+KE
Sbjct: 328 AREKMVEIPLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAALETAELRLEAALKE 387

Query: 483 INTAMAVRMKATKDLKRLTEERNAAMQEYS----LIMSERDSV---HKEMEKLSE-DLTQ 534
              A A    A  ++K L E   +A  +      L + E + +    +E E+L+E  +  
Sbjct: 388 AEAAKAAEALALAEIKALQESEESAKADSPRKITLSLEEYEELSKKAEEAEELAEKKVAA 447

Query: 535 AVRKIKTLEMENKELVEDKKTLSYQIETLKREIASAL 571
           A+ +++  +    E ++  +  S +IE  K  +  AL
Sbjct: 448 ALAQVEEAKESENESLKKLEEASKEIEEKKAALEEAL 484



 Score = 33.8 bits (77), Expect = 0.91
 Identities = 40/227 (17%), Positives = 81/227 (35%), Gaps = 27/227 (11%)

Query: 461 ERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDL---KRLTEERNAAMQEYSLIMSE 517
                +  L   Q++     KE+  A   +++  K+L   KR  E+    +++      E
Sbjct: 30  RVLVKETELMLAQEEANKLKKELEVAEKEKLQVLKELESTKRTVEDLKLKLEKAE---KE 86

Query: 518 RDSVHKEMEKLSEDLTQAVRKIKTLEMENKEL-VEDKKTLSYQIETLKREIASALHDRDK 576
                ++ E       +  + I+ LE+E       +  ++  ++  +++E  + + +RD 
Sbjct: 87  EQQAKQDSELAKLRAEELEQGIQELEVERYITATAELDSVKEELRKIRQEYDALVEERDA 146

Query: 577 ALKEC--------------NDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSR-- 620
           ALK                 +L +       + E        A  E      E+D  R  
Sbjct: 147 ALKRAEEAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAALEKDQDRET 206

Query: 621 --KERNEVEAASMDLFGKC--QKERMDNLEANQEIDRLRKHTDKLQA 663
             KE  E E     L      +K+     EA+ E++ L+K    + +
Sbjct: 207 YEKELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKEISIMAS 253


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 54.7 bits (132), Expect = 4e-07
 Identities = 84/409 (20%), Positives = 160/409 (39%), Gaps = 68/409 (16%)

Query: 293 VRQHNETVRRGDHTLKE-LEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRE 351
           V    E +   D  L++ LE  R   QA   + E+  +++  L  +  +L  +   L+ E
Sbjct: 312 VEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESE 371

Query: 352 LLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVD 411
           L E R     + E                   + E +++E + LR+RF D       A D
Sbjct: 372 LEEAREAVEDRRE-------------------EIEELEEEIEELRERFGDAPVDLGNAED 412

Query: 412 KLELAQEEVARIKKQCEDM---------NQEKNTALRERNGLKQQCTAAIRQWDIA---- 458
            LE  +EE   ++++  ++           E+  AL E  G   +C   +          
Sbjct: 413 FLEELREERDELREREAELEATLRTARERVEEAEALLEA-GKCPECGQPVEGSPHVETIE 471

Query: 459 -LRERNEYQEA-LAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMS 516
             RER E  EA L  ++++ E+  + +  A  +  +A   ++RL E R    +   LI  
Sbjct: 472 EDRERVEELEAELEDLEEEVEEVEERLERAEDLV-EAEDRIERLEERREDLEE---LIAE 527

Query: 517 ERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDR-D 575
            R+++ ++ E+  E   +A      LE E +E               KRE A+   +  +
Sbjct: 528 RRETIEEKRERAEELRERA----AELEAEAEE---------------KREAAAEAEEEAE 568

Query: 576 KALKECNDLRERFGEFTAAKEECQR--EAFHARFEYAMRMGERDSSRKERNEVEAASMDL 633
           +A +E  +L  +  E     E  +R      A  +    +      R+   E+     + 
Sbjct: 569 EAREEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKREALAELNDERRER 628

Query: 634 FGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTESIQE 682
             + ++ER   LEA  +  R+    ++ + + + ++  YL  + E + E
Sbjct: 629 LAE-KRERKRELEAEFDEARI----EEAREDKE-RAEEYLEQVEEKLDE 671



 Score = 45.4 bits (108), Expect = 3e-04
 Identities = 42/193 (21%), Positives = 84/193 (43%), Gaps = 17/193 (8%)

Query: 516 SERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRD 575
           SE   + +E+E+  E   QA     T +  ++ L E ++    ++ETL+ EI        
Sbjct: 213 SELAELDEEIERYEEQREQARE---TRDEADEVLEEHEERRE-ELETLEAEIEDLRETIA 268

Query: 576 KALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFG 635
           +  +E  +L E   +     EE + E      E  +   + ++    R E+E    +L  
Sbjct: 269 ETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRD 328

Query: 636 KCQKERMDNLEANQEIDRLRKHTDKLQ---AELQDQSVNYLFILTESIQEAEVSKRRRDW 692
           + ++ R+     N+E + LR+  D L+    EL++++      L   ++EA  +   R  
Sbjct: 329 RLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAE----LESELEEAREAVEDR-- 382

Query: 693 AFSERDKIEVIDS 705
               R++IE ++ 
Sbjct: 383 ----REEIEELEE 391



 Score = 41.9 bits (99), Expect = 0.003
 Identities = 63/381 (16%), Positives = 133/381 (34%), Gaps = 66/381 (17%)

Query: 309 ELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDA 368
           E+E Y  Q + A    + A +       +  +L    + L+ E+ +LR    T  E++  
Sbjct: 221 EIERYEEQREQARETRDEADEVLEEHEERREEL----ETLEAEIEDLRE---TIAETER- 272

Query: 369 GSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLEL----------AQE 418
                          + E + +E   LR+R ++L       + +  L           +E
Sbjct: 273 ---------------EREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARRE 317

Query: 419 EVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHED 478
           E+    ++  D  +E   A +  N   +            L ER E      +++++  +
Sbjct: 318 ELEDRDEELRDRLEECRVAAQAHNEEAESLREDADD----LEERAE------ELREEAAE 367

Query: 479 AVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRK 538
              E+  A        ++++ L EE     + +     +  +    +E+L E+  +   +
Sbjct: 368 LESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELRER 427

Query: 539 IKTLEM----------ENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERF 588
              LE           E + L+E  K                + +  + ++E     E  
Sbjct: 428 EAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVETIEEDRERVEELEAELE-- 485

Query: 589 GEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEAN 648
                 +E  + E    R E  +   +R    +ER E          +   ER + +E  
Sbjct: 486 ---DLEEEVEEVEERLERAEDLVEAEDRIERLEERRE-------DLEELIAERRETIEEK 535

Query: 649 QE-IDRLRKHTDKLQAELQDQ 668
           +E  + LR+   +L+AE +++
Sbjct: 536 RERAEELRERAAELEAEAEEK 556



 Score = 39.6 bits (93), Expect = 0.014
 Identities = 58/319 (18%), Positives = 122/319 (38%), Gaps = 50/319 (15%)

Query: 308 KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSG--------- 358
           +E+E  R +   A   L  A      LR +  +L   +  L+  L   R           
Sbjct: 391 EEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLE 450

Query: 359 ---C--CTQP--ESDDAGSADSLNQHYMSALNKYETIKDEYDALR-------------KR 398
              C  C QP   S    + +   +       + E +++E + +               R
Sbjct: 451 AGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDR 510

Query: 399 FDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIA 458
            + L        + +   +E +   +++ E++ +       E    ++    A  + + A
Sbjct: 511 IERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEA 570

Query: 459 LRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTE------ERNAAMQEYS 512
             E  E    LA+++++ E +++ I T +A    A  +++RL E      E N   +E  
Sbjct: 571 REEVAELNSKLAELKERIE-SLERIRTLLAAIADAEDEIERLREKREALAELNDERRER- 628

Query: 513 LIMSERDSVHKEM---------EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETL 563
             ++E+    +E+         E+  ED  +A   ++ +E +  EL E++  L  +I  +
Sbjct: 629 --LAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAV 686

Query: 564 KREIAS--ALHDRDKALKE 580
           + E+     L +R +AL+ 
Sbjct: 687 ENELEELEELRERREALEN 705



 Score = 34.6 bits (80), Expect = 0.48
 Identities = 37/183 (20%), Positives = 68/183 (37%), Gaps = 13/183 (7%)

Query: 526 EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKR---EIASALHDRDKALKECN 582
           EK  +DL     ++  LE E  EL E+ +    Q E  +    E    L + ++  +E  
Sbjct: 198 EKEEKDL---HERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELE 254

Query: 583 DLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERM 642
            L     +      E +RE    R E A  + +    R+   E+E    DL  +   +  
Sbjct: 255 TLEAEIEDLRETIAETERE----REELAEEVRDL---RERLEELEEERDDLLAEAGLDDA 307

Query: 643 DNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTESIQEAEVSKRRRDWAFSERDKIEV 702
           D        + L    ++L+  L++  V       E+    E +    + A   R++   
Sbjct: 308 DAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAE 367

Query: 703 IDS 705
           ++S
Sbjct: 368 LES 370


>gnl|CDD|212793 cd11859, SH3_ZO, Src homology 3 domain of the Tight junction
            proteins, Zonula occludens (ZO) proteins.  ZO proteins
            are scaffolding proteins that associate with each other
            and with other proteins of the tight junction, zonula
            adherens, and gap junctions. They play roles in
            regulating cytoskeletal dynamics at these cell junctions.
            They are considered members of the MAGUK
            (membrane-associated guanylate kinase) protein family,
            which is characterized by the presence of a core of three
            domains: PDZ, SH3, and guanylate kinase (GuK). The GuK
            domain in MAGUK proteins is enzymatically inactive;
            instead, the domain mediates protein-protein interactions
            and associates intramolecularly with the SH3 domain.
            Vertebrates contain three ZO proteins (ZO-1, ZO-2, and
            ZO-3) with redundant and non-redundant roles. They
            contain three PDZ domains, followed by SH3 and GuK
            domains; in addition, ZO-1 and ZO-2 contains a
            proline-rich (PR) actin binding domain at the C-terminus
            while ZO-3 contains this PR domain between the second and
            third PDZ domains. The C-terminal regions of the three ZO
            proteins are unique. The SH3 domain of ZO-1 has been
            shown to bind ZONAB, ZAK, afadin, and Galpha12. SH3
            domains are protein interaction domains that bind to
            proline-rich ligands with moderate affinity and
            selectivity, preferentially to PxxP motifs. They play
            versatile and diverse roles in the cell including the
            regulation of enzymes, changing the subcellular
            localization of signaling pathway components, and
            mediating the formation of multiprotein complex
            assemblies.
          Length = 62

 Score = 46.9 bits (112), Expect = 1e-06
 Identities = 18/64 (28%), Positives = 33/64 (51%), Gaps = 2/64 (3%)

Query: 1334 YMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGIIPSK 1393
            Y+R  F    +     +L F K ++ +V +T++ G  G W+A  V  +    + G+IP+K
Sbjct: 1    YIRTHFDY--EKPAKGELSFKKGEVFHVVDTLYQGTVGSWQAVRVGRNHQELERGVIPNK 58

Query: 1394 YKVE 1397
             + E
Sbjct: 59   SRAE 62


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 51.7 bits (124), Expect = 3e-06
 Identities = 78/360 (21%), Positives = 142/360 (39%), Gaps = 60/360 (16%)

Query: 318 QAAMAQLEAAAQESSSLRS-KYSDLISDKQRLDRELLELRSG------CCTQPESDDAGS 370
           Q  +AQLE ++   + L S KY+ L S  ++L  E   L +        C + E +    
Sbjct: 220 QERLAQLEGSSLGPNQLGSKKYNLLQSQLEQLQEENFRLEAARDDYRIKCEELEKE-LAE 278

Query: 371 ADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDM 430
               N    S   + + +KDE D LR+           + DK +  + +V   KK+ ED+
Sbjct: 279 LQHRNDELTSLAAESQALKDEIDVLRE-----------SSDKAKKLEAQVETYKKKLEDL 327

Query: 431 NQEKNT--ALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMA 488
           N  +     L ERN +  Q T    Q +  L++ N  +  L   ++Q ++   +++    
Sbjct: 328 NDLRRQVKLLEERNAMYMQNT---VQLEEELKKANAARGQLETYKRQVQELHAKLSEESK 384

Query: 489 VRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKE 548
              K   + KRL E+  A  +E   +++ERDS           L +   +++  + +  +
Sbjct: 385 KADKLEFEYKRLEEKLEALQKEKERLLAERDS-----------LRETNEELRCGQAQQDQ 433

Query: 549 LVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFE 608
           L +     S   + L  E+  +            + RE+           Q E       
Sbjct: 434 LTQADAGTSPSGDNLAAELLPS------------EYREKL-------IRLQHE------N 468

Query: 609 YAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQ 668
             +R+G+  S  +   E++    D   +  +       ANQ I  L++  + LQ  LQ+Q
Sbjct: 469 KMLRLGQEGSENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQ 528



 Score = 47.5 bits (113), Expect = 6e-05
 Identities = 99/502 (19%), Positives = 193/502 (38%), Gaps = 84/502 (16%)

Query: 132 NTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQ---QHEDAVKEINTAMAVTN 188
             A  E++ L Q+C    +Q  +   E+N  Q+   K+Q+   Q E +    N   +   
Sbjct: 182 KEAQEEKDELAQRCHELDKQVLLLQEEKNSLQQENEKLQERLAQLEGSSLGPNQLGSKKY 241

Query: 189 YLDGLQLNNSYDHGLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSARE 248
            L   QL    +    LE+     +I    L          ++ L+      D L+S   
Sbjct: 242 NLLQSQLEQLQEENFRLEAARDDYRIKCEEL----------EKELAELQHRNDELTSLAA 291

Query: 249 GMALLSDSPDVLTITTLK----PTSLGEHSPRVH-----RKSVNSSTQTMPLYV---RQH 296
               L D  DVL  ++ K       +  +  ++      R+ V    +   +Y+    Q 
Sbjct: 292 ESQALKDEIDVLRESSDKAKKLEAQVETYKKKLEDLNDLRRQVKLLEERNAMYMQNTVQL 351

Query: 297 NETVRRGDHTLKELEYYR-------GQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRL- 348
            E +++ +    +LE Y+        +      + +    E   L  K   L  +K+RL 
Sbjct: 352 EEELKKANAARGQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKLEALQKEKERLL 411

Query: 349 -DRELL-----ELRSG-----CCTQPESDDAGSADSLNQHYMSA---------------- 381
            +R+ L     ELR G       TQ ++  + S D+L    + +                
Sbjct: 412 AERDSLRETNEELRCGQAQQDQLTQADAGTSPSGDNLAAELLPSEYREKLIRLQHENKML 471

Query: 382 -LNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRE 440
            L +  +  +    L++  +D    ++    +L LA + +  +++Q ED+ +    AL+E
Sbjct: 472 RLGQEGSENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDLQK----ALQE 527

Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
           +    +  +    + +  L + +E  E L K ++Q E+   + +  ++ ++   +   + 
Sbjct: 528 QGSKSEDSSLLKSKLEEHLEQLHEANEELQKKREQIEELEPDQDQNLSRKIAELEAALQK 587

Query: 501 TEERNAAMQE-YSLIMSERDSVHK-----------EMEKLSEDLTQAVRKIKTLEMENKE 548
            +E   AM+E Y   + +   V K           E++ L + LT+  ++I+ LE E   
Sbjct: 588 KDEDMRAMEERYKKYVEKAREVIKTLDPKQNPASPEIQLLKKQLTERDKRIRHLESE--- 644

Query: 549 LVEDKKTLSYQIETLKREIASA 570
             E  K +  Q E L   I SA
Sbjct: 645 -YEKAKPMREQEEKL---IVSA 662



 Score = 39.4 bits (92), Expect = 0.018
 Identities = 41/219 (18%), Positives = 89/219 (40%), Gaps = 15/219 (6%)

Query: 382 LNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRER 441
           L K    + + +  +++  +L    S    K +  + E  R++++ E + +EK   L ER
Sbjct: 355 LKKANAARGQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKLEALQKEKERLLAER 414

Query: 442 NGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLT 501
           + L++       +      ++++  +A A      ++   E+  +   R K    L RL 
Sbjct: 415 DSLRE----TNEELRCGQAQQDQLTQADAGTSPSGDNLAAELLPS-EYREK----LIRLQ 465

Query: 502 EERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIE 561
            E         L + +  S ++ + +L + L  A R+   LE + +   +    L  Q+E
Sbjct: 466 HENKM------LRLGQEGSENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVE 519

Query: 562 TLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQR 600
            L++ +       + +    + L E   +   A EE Q+
Sbjct: 520 DLQKALQEQGSKSEDSSLLKSKLEEHLEQLHEANEELQK 558



 Score = 39.0 bits (91), Expect = 0.025
 Identities = 55/240 (22%), Positives = 97/240 (40%), Gaps = 25/240 (10%)

Query: 435 NTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQ---QHEDAVKEINTAMAVR- 490
             A  E++ L Q+C    +Q  +   E+N  Q+   K+Q+   Q E +    N   + + 
Sbjct: 182 KEAQEEKDELAQRCHELDKQVLLLQEEKNSLQQENEKLQERLAQLEGSSLGPNQLGSKKY 241

Query: 491 MKATKDLKRLTEER---NAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENK 547
                 L++L EE     AA  +Y +   E +    E++  +++LT    + + L+ E  
Sbjct: 242 NLLQSQLEQLQEENFRLEAARDDYRIKCEELEKELAELQHRNDELTSLAAESQALKDEID 301

Query: 548 ELVE--DK-KTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFH 604
            L E  DK K L  Q+ET K           K L++ NDLR +        EE       
Sbjct: 302 VLRESSDKAKKLEAQVETYK-----------KKLEDLNDLRRQ----VKLLEERNAMYMQ 346

Query: 605 ARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAE 664
              +    + + +++R +    +    +L  K  +E     +   E  RL +  + LQ E
Sbjct: 347 NTVQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKLEALQKE 406



 Score = 30.5 bits (69), Expect = 8.5
 Identities = 32/167 (19%), Positives = 64/167 (38%), Gaps = 19/167 (11%)

Query: 506 AAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKR 565
            A+QE   +MS+        E       Q    +K    + KE  E+K  L+ +   L +
Sbjct: 148 TAIQE---LMSKEQGSSPSRESAGNLDQQ----LKKALEDLKEAQEEKDELAQRCHELDK 200

Query: 566 EIASALHDRDKALKECNDLRERFGEFTAAKEE----CQREAFHARFEYAMRMGERDSSRK 621
           ++     +++   +E   L+ER  +   +         ++    + +        +  ++
Sbjct: 201 QVLLLQEEKNSLQQENEKLQERLAQLEGSSLGPNQLGSKKYNLLQSQL-------EQLQE 253

Query: 622 ERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQ 668
           E   +EAA  D   KC++   +  E     D L     + QA L+D+
Sbjct: 254 ENFRLEAARDDYRIKCEELEKELAELQHRNDELTSLAAESQA-LKDE 299


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 52.0 bits (125), Expect = 3e-06
 Identities = 83/469 (17%), Positives = 178/469 (37%), Gaps = 61/469 (13%)

Query: 295 QHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISD-KQRL----D 349
           +  E ++R +  L +L   R        QL   A    SL     +     ++ +     
Sbjct: 508 EALEALQRAERRLLQL---RQALDELELQLSPQA---GSLLHFLRNEAPGWEESIGKVIS 561

Query: 350 RELLELRSGCCTQPESDDAGSADSLN--QHYMSALNKYETIKDEYDALRKRFDDLINTHS 407
            ELLE        P+  +   +D+L      +  L+  +   +E   LR+R         
Sbjct: 562 PELLER---TDLDPQLVEGSDSDTLYGVSLDLQRLDVPDYAANE-TELRERLQQAEEALQ 617

Query: 408 TAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALR-ERNEYQ 466
           +AV K + A+E++ +   + E+  + +  A        +Q    +++     +  +++ +
Sbjct: 618 SAVAKQKQAEEQLVQANAELEEQKRAEAEARTA----LKQARLDLQRLQNEQQSLKDKLE 673

Query: 467 EALAKVQQQHEDAVKEINTAMAVRMKATKDLK-----RLTEERNAAMQEYSLIMSERDSV 521
            A+A+ +QQ E  +++++  +   ++  +           E R   + ++ ++  E D+ 
Sbjct: 674 LAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAKWQVVEGELDNQ 733

Query: 522 HKEMEKLSEDL-TQAVRKIKTLEMENKELVEDK-------KTLSYQIETLKREIASALHD 573
             ++    E   TQA  ++K L+ +    +          K L  QIE L+  I      
Sbjct: 734 LAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVR 793

Query: 574 RDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSS----RKERNEVEAA 629
           R +  +    ++E +    + +EE            A+++ E +SS    ++E   +   
Sbjct: 794 RPEVREYRAFMQETWLHRDSLREERPN--------LAIQLRELESSAEELQQELTRLIKD 845

Query: 630 SMDLFGKCQKERMDN----LEANQEIDRLRKHTDKLQAELQDQSVNYL-FILTESIQEAE 684
           +     K ++ER        + ++ +  LR    +L    +  + N     ++E + + E
Sbjct: 846 TKLRRKKLEQERKALEKQLDQLDELLRGLRDEMRQLAELKEPANANQAEGSISERLDQLE 905

Query: 685 VSKRRRDWAFSERDK-IEVIDSYHTKHAGRHKRGKELDKNGGTWPKARS 732
             KR+R     +  K +E  D     H      G  L +    W K RS
Sbjct: 906 EFKRKRKRLSGDLKKFLERFDGVIKDH-----SGSGLYET---WQKLRS 946



 Score = 40.1 bits (94), Expect = 0.014
 Identities = 65/407 (15%), Positives = 131/407 (32%), Gaps = 79/407 (19%)

Query: 311 EYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGS 370
           E  +        QLEA   E +    +       + RL R    L S   T  E +    
Sbjct: 420 EDLQALESQLRQQLEAGKLEFNEEEYEL------ELRLGRLKQRLDSATATPEELEQLEI 473

Query: 371 ADSLNQHYMSALNKY----ETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQ 426
            D   +       +     E ++ E   LRKR D+ +     A  +L   ++ +  ++ Q
Sbjct: 474 NDEALEKAQEEQEQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQ 533

Query: 427 C--------EDMNQEK-----------NTALRERNGLKQQCTAAIR-------QWDIALR 460
                      +  E            +  L ER  L  Q               D+   
Sbjct: 534 LSPQAGSLLHFLRNEAPGWEESIGKVISPELLERTDLDPQLVEGSDSDTLYGVSLDLQRL 593

Query: 461 ERNEY-------QEALAKVQ---QQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQE 510
           +  +Y       +E L + +   Q      K+    +       ++ KR   E   A+++
Sbjct: 594 DVPDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQ 653

Query: 511 YSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASA 570
             L + +R    ++  K         +    +    ++     + L  Q++ L  +  + 
Sbjct: 654 ARLDL-QRLQNEQQSLK--------DKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAF 704

Query: 571 LHDRDKALKECNDLRERFGEFTAA--KEECQREAFHARFEYAMRMGERDSSRKERNEVEA 628
           L    +ALK+  D RE   E  A     E + +   A+   A+      +  K R +   
Sbjct: 705 L----EALKD--DFRELRTERLAKWQVVEGELDNQLAQLSAAIE--AARTQAKARLK--- 753

Query: 629 ASMDLFGKCQKERMDNLEAN----QEIDRLRKHTDKLQAELQDQSVN 671
                  + +K+    L +       +  L++  ++L+  ++  +V 
Sbjct: 754 -------ELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVR 793



 Score = 38.5 bits (90), Expect = 0.037
 Identities = 45/271 (16%), Positives = 91/271 (33%), Gaps = 50/271 (18%)

Query: 411 DKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAI----RQWDIALRERNEYQ 466
           ++ E  QE   R+++Q   +  +   A  E N       A +     + ++   ++  ++
Sbjct: 281 EEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQKGAFE 340

Query: 467 EALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEME 526
           +A  +  Q   D +  I + +    +    L  LT +     ++Y           +  +
Sbjct: 341 DADIEQLQADLDQLPSIRSELE---EVEARLDALTGKHQDVQRKYE----------RLKQ 387

Query: 527 KLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRE 586
           K+ E L + + K            E    +  + +  K  I   L          + LR+
Sbjct: 388 KIKEQLERDLEKNN----------ERLAAIREEKDRQKAAIEEDLQ------ALESQLRQ 431

Query: 587 RFGEFTAAKEECQREAFHARFEYAMRMGERDSSRK------ERNEVEAASMDLFGKCQKE 640
           +     A K E   E +       +R+G             E  E    + +   K Q+E
Sbjct: 432 QLE---AGKLEFNEEEYE----LELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEE 484

Query: 641 ----RMDNLEANQEIDRLRKHTDKLQAELQD 667
                 +  +   E+ +LRK  D+    LQ 
Sbjct: 485 QEQAEANVEQLQSELRQLRKRRDEALEALQR 515



 Score = 37.4 bits (87), Expect = 0.076
 Identities = 71/420 (16%), Positives = 148/420 (35%), Gaps = 67/420 (15%)

Query: 313 YRGQHQAAMAQLEAAAQESSSLRSKYSDLISD-KQRLDRELLELRSGCCTQPESDDAGSA 371
            R + +   A+L+A   +   ++ KY  L    K++L+R+L +         E  D   A
Sbjct: 357 IRSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKA 416

Query: 372 DSLNQHY---MSALNK-YETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIK--- 424
             + +      S L +  E  K E++      +  +      +D      EE+ +++   
Sbjct: 417 A-IEEDLQALESQLRQQLEAGKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEIND 475

Query: 425 KQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAV---- 480
           +  E   +E+  A      L+ +     ++ D AL      +  L +++Q  ++      
Sbjct: 476 EALEKAQEEQEQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLS 535

Query: 481 ----------------------KEINTAMAVRMKATKDL--KRLTEERNAAMQEYSLIMS 516
                                 K I+  +  R     DL  + +    +  +   SL + 
Sbjct: 536 PQAGSLLHFLRNEAPGWEESIGKVISPELLER----TDLDPQLVEGSDSDTLYGVSLDLQ 591

Query: 517 --ERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDR 574
             +         +L E L QA   +++   + K+  E     + ++E  KR  A A    
Sbjct: 592 RLDVPDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEAR--- 648

Query: 575 DKALKECNDLRER-FGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDL 633
             ALK+     +R   E  + K++ +                +  +  +  +++A    L
Sbjct: 649 -TALKQARLDLQRLQNEQQSLKDKLEL----------AIAERKQQAETQLRQLDAQLKQL 697

Query: 634 FGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSV---NYLFILTESIQEAEVSKRRR 690
                +++   LEA +  D  R+   +  A+ Q       N L  L+ +I+ A    + R
Sbjct: 698 ----LEQQQAFLEALK--DDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKAR 751


>gnl|CDD|238489 cd00989, PDZ_metalloprotease, PDZ domain of bacterial and plant zinc
            metalloprotases, presumably membrane-associated or
            integral membrane proteases, which may be involved in
            signalling and regulatory mechanisms. May be responsible
            for substrate recognition and/or binding, as most PDZ
            domains bind C-terminal polypeptides, and binding to
            internal (non-C-terminal) polypeptides and even to lipids
            has been demonstrated. In this subfamily of
            protease-associated PDZ domains a C-terminal beta-strand
            forms the peptide-binding groove base, a circular
            permutation with respect to PDZ domains found in
            Eumetazoan signaling proteins.
          Length = 79

 Score = 46.1 bits (110), Expect = 4e-06
 Identities = 24/80 (30%), Positives = 35/80 (43%), Gaps = 8/80 (10%)

Query: 1250 LGISLVGGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAK-PA 1308
            LG    GG  +   +  V  GS    AGL+ GDRIL  NG   +  + E+    + + P 
Sbjct: 3    LGFV-PGGPPIEPVIGEVVPGSPAAKAGLKAGDRILAINGQ--KIKSWEDLVDAVQENPG 59

Query: 1309 DKVTVLAQYSVERYNEVKDK 1328
              +T+    +VER  E    
Sbjct: 60   KPLTL----TVERNGETITL 75



 Score = 34.1 bits (79), Expect = 0.060
 Identities = 15/53 (28%), Positives = 28/53 (52%), Gaps = 1/53 (1%)

Query: 1155 VSTVSEHSLASQVGLQVGDQLLEVCGINMRSATYQLAASVLRQCGNSITMLVQ 1207
            +  V   S A++ GL+ GD++L + G  ++S    L  +V    G  +T+ V+
Sbjct: 16   IGEVVPGSPAAKAGLKAGDRILAINGQKIKSWE-DLVDAVQENPGKPLTLTVE 67



 Score = 31.8 bits (73), Expect = 0.45
 Identities = 14/29 (48%), Positives = 20/29 (68%), Gaps = 1/29 (3%)

Query: 213 KISPGSLAAKEGSLAVGDRVLSINNRTVD 241
           ++ PGS AAK G L  GDR+L+IN + + 
Sbjct: 18  EVVPGSPAAKAG-LKAGDRILAINGQKIK 45



 Score = 31.8 bits (73), Expect = 0.45
 Identities = 14/29 (48%), Positives = 20/29 (68%), Gaps = 1/29 (3%)

Query: 837 KISPGSLAAKEGSLAVGDRVLSINNRTVD 865
           ++ PGS AAK G L  GDR+L+IN + + 
Sbjct: 18  EVVPGSPAAKAG-LKAGDRILAINGQKIK 45


>gnl|CDD|221961 pfam13180, PDZ_2, PDZ domain. 
          Length = 81

 Score = 44.9 bits (107), Expect = 9e-06
 Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 4/78 (5%)

Query: 1250 LGISLVGGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAKPAD 1309
            +G+ +V     G+ V SV+ GS    AGL+ GD IL  +G  + + T         KP D
Sbjct: 3    IGVRVVQNEGTGVTVVSVKEGSPAAKAGLKPGDIILSIDGKKVNSLTELIEVILNGKPGD 62

Query: 1310 KVTVLAQYSVERYNEVKD 1327
             V +    +V R  + K 
Sbjct: 63   TVKL----TVYRDGKKKT 76



 Score = 34.5 bits (80), Expect = 0.041
 Identities = 20/59 (33%), Positives = 33/59 (55%), Gaps = 4/59 (6%)

Query: 206 ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSP-DVLTIT 263
            +G+ +  +  GS AAK G L  GD +LSI+ + V+ L+   E   +L+  P D + +T
Sbjct: 12  GTGVTVVSVKEGSPAAKAG-LKPGDIILSIDGKKVNSLTELIE--VILNGKPGDTVKLT 67



 Score = 34.5 bits (80), Expect = 0.041
 Identities = 20/59 (33%), Positives = 33/59 (55%), Gaps = 4/59 (6%)

Query: 830 ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSP-DVLTIT 887
            +G+ +  +  GS AAK G L  GD +LSI+ + V+ L+   E   +L+  P D + +T
Sbjct: 12  GTGVTVVSVKEGSPAAKAG-LKPGDIILSIDGKKVNSLTELIE--VILNGKPGDTVKLT 67



 Score = 32.6 bits (75), Expect = 0.24
 Identities = 16/47 (34%), Positives = 28/47 (59%)

Query: 1141 LGIQIQCLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSAT 1187
            +G+++   +  GV V +V E S A++ GL+ GD +L + G  + S T
Sbjct: 3    IGVRVVQNEGTGVTVVSVKEGSPAAKAGLKPGDIILSIDGKKVNSLT 49


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 50.0 bits (119), Expect = 1e-05
 Identities = 89/440 (20%), Positives = 157/440 (35%), Gaps = 62/440 (14%)

Query: 304 DHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQP 363
           +   KE+E  R    A  A+ + AAQE S  +++   L ++ + L  +  ++R       
Sbjct: 269 NRLDKEVEAERSALAAMKAKCDRAAQELSRKKTELLGLQTELETLANQDSDMR------- 321

Query: 364 ESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARI 423
                   D L +    A  +   ++ E DALR   +   NT +     L+ AQEE A  
Sbjct: 322 -----QHLDKLKEDLTRAEQEKAILQTEVDALRYELERKHNTLTKKTASLQAAQEEKATY 376

Query: 424 KKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEI 483
             + EDM        R+R             ++   R+    Q+ +  +Q+      + +
Sbjct: 377 AGEIEDM--------RDR-------------YEKTERKLRVLQKKIENLQETFRRKERRL 415

Query: 484 NTAMAVRMKATKDL---KRLTEERNAAMQEYSLIMSERDSVHKEMEKL-SEDLTQAVRKI 539
                 R+++ +         E+   A+ E   I+          E+   E+     ++ 
Sbjct: 416 KEE-KERLRSLQTDTNTDTALEKLEKALAEKERIIERLKEQRDRDERYEQEEFETYKKEF 474

Query: 540 KTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEEC- 598
           + L+ E + L         Q+E LK E++    ++    K+ +DL     E    +E+  
Sbjct: 475 EDLKEEVQNLQLKLSERELQLELLKEEVSKLASNQL---KQRSDLERAHIELEKIREKHE 531

Query: 599 ----QREAFHARFEYAMRMGE--------RDSSRKERNEVEAASMDLFGKCQKERMDNLE 646
               + +   A  E A R           R  S   RNEV+   +D   K ++ER D   
Sbjct: 532 KLEKELKRLRANPESADRGSAVDAGTSRSRADSAGARNEVDRL-LDRLEKAEQERDD--- 587

Query: 647 ANQEIDRLRKHTDKLQAELQDQSVNYLFILTE----SIQEAEVSKRRRDWAFSERDKIEV 702
              E  RL K  +K Q  L  Q         E    S +  E ++R       E    + 
Sbjct: 588 TEMEAGRLAKELEKAQRHLTKQQEKTEATRIEFERKSAELLEEAERLEKSEAEEETLRQS 647

Query: 703 IDSYHTKHAGRHKRGKELDK 722
               H + A  +     + K
Sbjct: 648 TQIGHAQAAAHNHIEHHVQK 667



 Score = 33.4 bits (76), Expect = 1.4
 Identities = 54/282 (19%), Positives = 107/282 (37%), Gaps = 19/282 (6%)

Query: 384 KYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNG 443
           + E  + E + L+K  +   +   ++++ ++       + ++        + T  +E+  
Sbjct: 4   QLENAQRENELLKKELERKQSKLGSSMNSIKTFWSPELKRERALRKDEAARITVWKEQLR 63

Query: 444 LKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEE 503
           + Q+     +    AL+E  + Q  L ++QQ+ E +  E      +  K+     R+  E
Sbjct: 64  VTQEENQHAQLTVQALQEELKSQRELNRLQQKLEKSGNETGELQQLE-KSEGQFSRIQAE 122

Query: 504 RNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETL 563
           R     E  L+    + +   ME+L + L     +IK L    +      K      E L
Sbjct: 123 RERQAMELFLLRDTLEKLQLRMERLQQTLDARDEEIKKLLEMLQSKGLSAKAEEEDSERL 182

Query: 564 KREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKER 623
            R     + + +        L E+  +    +    RE  H + +      ERD ++ E 
Sbjct: 183 GR-----IREAEDQQSHLEVLLEQKEK----EHMMLREEIHQKLQ-----MERDDAKTEA 228

Query: 624 NEVEAASMDLFGKCQKERMDNLE-ANQEIDRLRKHTDKLQAE 664
           ++     +D      KE    LE A  EI RL+   D  +A+
Sbjct: 229 SQ---KLIDEKDTKIKEFEKMLEKAENEIYRLQSRCDTSEAD 267


>gnl|CDD|238487 cd00987, PDZ_serine_protease, PDZ domain of tryspin-like serine
            proteases, such as DegP/HtrA, which are oligomeric
            proteins involved in heat-shock response, chaperone
            function, and apoptosis. May be responsible for substrate
            recognition and/or binding, as most PDZ domains bind
            C-terminal polypeptides, though binding to internal
            (non-C-terminal) polypeptides and even to lipids has been
            demonstrated. In this subfamily of protease-associated
            PDZ domains a C-terminal beta-strand forms the
            peptide-binding groove base, a circular permutation with
            respect to PDZ domains found in Eumetazoan signaling
            proteins.
          Length = 90

 Score = 44.2 bits (105), Expect = 2e-05
 Identities = 22/53 (41%), Positives = 28/53 (52%)

Query: 1261 GIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAKPADKVTV 1313
            G+ V SV  GS    AGL+ GD IL  NG  +++      A    KP DKVT+
Sbjct: 25   GVLVASVDPGSPAAKAGLKPGDVILAVNGKPVKSVADLRRALAELKPGDKVTL 77



 Score = 36.5 bits (85), Expect = 0.010
 Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 202 GLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSARE 248
           GL    G+ +  + PGS AAK G L  GD +L++N + V  ++  R 
Sbjct: 19  GLKDTKGVLVASVDPGSPAAKAG-LKPGDVILAVNGKPVKSVADLRR 64



 Score = 36.5 bits (85), Expect = 0.010
 Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 826 GLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSARE 872
           GL    G+ +  + PGS AAK G L  GD +L++N + V  ++  R 
Sbjct: 19  GLKDTKGVLVASVDPGSPAAKAG-LKPGDVILAVNGKPVKSVADLRR 64



 Score = 31.8 bits (73), Expect = 0.55
 Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 13/79 (16%)

Query: 1141 LGIQIQCL-----------DSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSATYQ 1189
            LG+ +Q L           D+ GV V++V   S A++ GL+ GD +L V G  ++S    
Sbjct: 3    LGVTVQDLTPDLAEELGLKDTKGVLVASVDPGSPAAKAGLKPGDVILAVNGKPVKSVA-D 61

Query: 1190 LAASVLR-QCGNSITMLVQ 1207
            L  ++   + G+ +T+ V 
Sbjct: 62   LRRALAELKPGDKVTLTVL 80


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 46.1 bits (110), Expect = 4e-05
 Identities = 21/105 (20%), Positives = 43/105 (40%), Gaps = 11/105 (10%)

Query: 460 RERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERD 519
               + ++ L +++Q+  +   +I      R +        TEER   ++E   +  E  
Sbjct: 69  TRLEKLKKELEELKQRIAELQAQIEKLKKGREE--------TEERTELLEELKQLEKELK 120

Query: 520 SVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLK 564
            +  E+EK  ++  + + K+K    E K   E     +  I +LK
Sbjct: 121 KLKAELEKYEKNDPERIEKLK---EETKVAKEAANRWTDNIFSLK 162



 Score = 30.7 bits (70), Expect = 4.0
 Identities = 13/68 (19%), Positives = 28/68 (41%), Gaps = 8/68 (11%)

Query: 108 DGKLKKKEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALA 167
               K K  + ++KK+ E++ Q           L+ Q     +  +    ER E  E L 
Sbjct: 62  QALNKLKTRLEKLKKELEELKQRI-------AELQAQIEKLKKGREETE-ERTELLEELK 113

Query: 168 KVQQQHED 175
           +++++ + 
Sbjct: 114 QLEKELKK 121


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 47.0 bits (112), Expect = 6e-05
 Identities = 39/235 (16%), Positives = 83/235 (35%), Gaps = 15/235 (6%)

Query: 359 CCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQE 418
                 + D      + +   +   K    +D+   L K+   L    ++   +L    +
Sbjct: 28  AAAFSAAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETAD 87

Query: 419 EVARIKKQCEDMNQEKN---TALRERNG-LKQQCTAAIRQWD---IALRERNEYQEALAK 471
           ++ +++KQ  D+N   N      RE+   L +Q  A  R       AL    E  +A   
Sbjct: 88  DLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPE--DAQRS 145

Query: 472 V--QQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLS 529
           V     +          +       K L  +  E  A   E + ++SE+ +   ++ +L 
Sbjct: 146 VRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLL 205

Query: 530 EDLTQAVRK----IKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKE 580
           E+  + + +    +   + + +EL  ++  L  +I + +   A A      A   
Sbjct: 206 EERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAA 260



 Score = 36.6 bits (85), Expect = 0.11
 Identities = 37/212 (17%), Positives = 69/212 (32%), Gaps = 28/212 (13%)

Query: 295 QHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLE 354
              + +R       +LE      +  +A LEA   E++    K    I+D   L+  L  
Sbjct: 49  ALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIAD---LNARLNA 105

Query: 355 LRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLE 414
           L      Q        A+ L     S  N          AL    +D   +   A+    
Sbjct: 106 LE----VQEREQRRRLAEQLAALQRSGRN-------PPPALLVSPEDAQRSVRLAIYYGA 154

Query: 415 LAQEEVARI---KKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
           L      RI   K   + +   +     E+  L              L E+   Q  LA+
Sbjct: 155 LNPARAERIDALKATLKQLAAVRAEIAAEQAELTT-----------LLSEQRAQQAKLAQ 203

Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEE 503
           + ++ +  + ++N+ ++   K  ++L+     
Sbjct: 204 LLEERKKTLAQLNSELSADQKKLEELRANESR 235



 Score = 33.9 bits (78), Expect = 0.70
 Identities = 42/217 (19%), Positives = 74/217 (34%), Gaps = 33/217 (15%)

Query: 517 ERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDK 576
           +   + KE+  L + + +   +   LE + K L  +       I +L+ ++     D  K
Sbjct: 39  QLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETE-------IASLEAQLIETADDLKK 91

Query: 577 ALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMG------ERDSSRKERNEVEAAS 630
             K+  DL  R       + E QR     +     R G         S    +  V  A 
Sbjct: 92  LRKQIADLNARLNALEVQERE-QRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAI 150

Query: 631 MDLFGKCQKER---MDNLEANQ-EIDRLRKHTDKLQAELQDQSVNYLFILTESIQEAEVS 686
              +G     R   +D L+A   ++  +R      QAEL          L+E   +    
Sbjct: 151 --YYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTL-------LSEQRAQQAKL 201

Query: 687 KRRRDWAFSERDKIEVIDSYHTKHAGRHKRGKELDKN 723
            +       ER K       +++ +   K+ +EL  N
Sbjct: 202 AQLL----EERKKTL--AQLNSELSADQKKLEELRAN 232


>gnl|CDD|212961 cd12028, SH3_ZO-3, Src homology 3 domain of the Tight junction
            protein, Zonula occludens protein 3.  ZO-3 is a
            scaffolding protein that associates with other ZO
            proteins and other proteins of the tight junction, zonula
            adherens, and gap junctions. ZO proteins play roles in
            regulating cytoskeletal dynamics at these cell junctions.
            ZO-3 is critical for epidermal barrier function. It
            regulates cyclin D1-dependent cell proliferation. It is
            considered a member of the MAGUK (membrane-associated
            guanylate kinase) protein family, which is characterized
            by the presence of a core of three domains: PDZ, SH3, and
            guanylate kinase (GuK). The GuK domain in MAGUK proteins
            is enzymatically inactive; instead, the domain mediates
            protein-protein interactions and associates
            intramolecularly with the SH3 domain. The C-terminal
            region of ZO-3 is the smallest of the three ZO proteins.
            The SH3 domain of the related protein ZO-1 has been shown
            to bind ZONAB, ZAK, afadin, and Galpha12. SH3 domains are
            protein interaction domains that bind to proline-rich
            ligands with moderate affinity and selectivity,
            preferentially to PxxP motifs. They play versatile and
            diverse roles in the cell including the regulation of
            enzymes, changing the subcellular localization of
            signaling pathway components, and mediating the formation
            of multiprotein complex assemblies.
          Length = 65

 Score = 42.2 bits (99), Expect = 7e-05
 Identities = 22/67 (32%), Positives = 33/67 (49%), Gaps = 2/67 (2%)

Query: 1331 DSFYMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGII 1390
            DSFY+R  F    D      L F + ++ +V +TM  G  G W A  +  D    + GII
Sbjct: 1    DSFYIRTHFDYEPD--PPSGLSFTRGEVFHVLDTMHRGKLGSWLAVRMGRDLREMEKGII 58

Query: 1391 PSKYKVE 1397
            P++ + E
Sbjct: 59   PNQSRAE 65


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 47.4 bits (113), Expect = 8e-05
 Identities = 57/264 (21%), Positives = 112/264 (42%), Gaps = 42/264 (15%)

Query: 382 LNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKN--TALR 439
           + + E I++      K  ++++   +    +L   +EE+ +++K+ +++ + K     L 
Sbjct: 185 IKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELE 244

Query: 440 ERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKR 499
           +     +    + R+ +  +RE  E  E L K  ++ E+ VKE+           K+LK 
Sbjct: 245 KELESLEG---SKRKLEEKIRELEERIEELKKEIEELEEKVKEL-----------KELKE 290

Query: 500 LTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQ 559
             EE     + Y   + E   + K + +L E++     +IK LE + + L E KK    +
Sbjct: 291 KAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKK----K 346

Query: 560 IETLKREIAS-----ALHDRDKALK-ECNDLRERFGEFT------------AAKEECQRE 601
           ++ L++ +        L++  KA K E   L++R    T             AKEE + E
Sbjct: 347 LKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEE 406

Query: 602 AFHARFEYAMRMGERDSSRKERNE 625
                 +   R+GE     KE  +
Sbjct: 407 ----ISKITARIGELKKEIKELKK 426



 Score = 46.6 bits (111), Expect = 1e-04
 Identities = 52/231 (22%), Positives = 106/231 (45%), Gaps = 34/231 (14%)

Query: 384 KYETIKDEYDALR----------KRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
           +YE +K++   L+          ++ ++L    +    KL+  +EE+A + K+ E++  E
Sbjct: 526 EYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFE 585

Query: 434 KNTALRERNG-LKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMK 492
               L ER   L+      +   D A +E    ++ L K++++ + A +E+        +
Sbjct: 586 SVEELEERLKELEPFYNEYLELKD-AEKELEREEKELKKLEEELDKAFEELA-------E 637

Query: 493 ATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVED 552
             K L+ L +E     ++YS          +E E+L E+  +  R++  L  E +EL + 
Sbjct: 638 TEKRLEELRKELEELEKKYS---------EEEYEELREEYLELSRELAGLRAELEELEKR 688

Query: 553 KKTLSYQIETLKREIAS------ALHDRDKALKECNDLRERFGEFTAAKEE 597
           ++ +   +E LK E+         L   +KAL+   +LRE+  ++ A  +E
Sbjct: 689 REEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKALLKE 739



 Score = 38.9 bits (91), Expect = 0.030
 Identities = 48/213 (22%), Positives = 89/213 (41%), Gaps = 38/213 (17%)

Query: 459 LRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSER 518
           ++ R E  E   K  +  E+ +KE            K+L+ +  E N    E   +  E 
Sbjct: 174 IKRRIERLEKFIKRTENIEELIKE----------KEKELEEVLREINEISSELPELREEL 223

Query: 519 DSVHK---EMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRD 575
           + + K   E+E+L E++ +  +++++LE   ++L E  + L  +IE LK+EI        
Sbjct: 224 EKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEI-------- 275

Query: 576 KALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFG 635
                  +L E+  E    KE+ +        EY       +    E  E+E     L  
Sbjct: 276 ------EELEEKVKELKELKEKAE--------EYIKLSEFYEEYLDELREIEKRLSRLEE 321

Query: 636 KCQ--KERMDNLEA-NQEIDRLRKHTDKLQAEL 665
           +    +ER+  LE   + ++ L+K   +L+  L
Sbjct: 322 EINGIEERIKELEEKEERLEELKKKLKELEKRL 354



 Score = 38.5 bits (90), Expect = 0.035
 Identities = 59/393 (15%), Positives = 139/393 (35%), Gaps = 91/393 (23%)

Query: 308 KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDD 367
           K L     +      +++    E      +  +L    + L++ L EL            
Sbjct: 314 KRLSRLEEEINGIEERIK----ELEEKEERLEELKKKLKELEKRLEELEE---------- 359

Query: 368 AGSADSLNQHYMSALNKYETIKDEYDALRKRFDDL--------INTHSTAVDKLELAQEE 419
                           + +  K+E + L+KR   L        +     A +++E    +
Sbjct: 360 ----------RHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISK 409

Query: 420 VARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRER--NEYQEALAKVQQQHE 477
           +     + +   +E   A+ E    K +C    R+     R+    EY   L +++++ +
Sbjct: 410 ITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELK 469

Query: 478 DA----------VKEINTAMA-----VRMKATKDLKRLTEER---------NAAMQEYSL 513
           +           ++E+   +      +++K   +  +  EE+             +EY  
Sbjct: 470 EIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEK 529

Query: 514 IMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIA----S 569
           +  +   +  E++ L ++L     K++ L+ +  EL +    L  ++  L +E+      
Sbjct: 530 LKEKLIKLKGEIKSLKKEL----EKLEELKKKLAELEKKLDELEEELAELLKELEELGFE 585

Query: 570 ALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAA 629
           ++ + ++ LKE       + E   A++E +R              E    +K   E++ A
Sbjct: 586 SVEELEERLKELEPFYNEYLELKDAEKELER--------------EEKELKKLEEELDKA 631

Query: 630 SMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQ 662
             +L            E  + ++ LRK  ++L+
Sbjct: 632 FEEL-----------AETEKRLEELRKELEELE 653


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 47.1 bits (112), Expect = 8e-05
 Identities = 35/268 (13%), Positives = 85/268 (31%), Gaps = 16/268 (5%)

Query: 402 LINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRE 461
           L+        +     E +   ++  E+  +  + A       ++Q   A+   +  L  
Sbjct: 671 LLEEAEKQAARKASLDERLRDAERALEEAEERHDEAQSALEAWQEQWYDALL--EAGLGG 728

Query: 462 RNEYQEALAKVQQQHE--DAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIM--SE 517
           R     AL  ++      + ++  +          +DL R  EE  A  +  +  M  + 
Sbjct: 729 RASPAGALDALELLQNIKEKLQAADDLRQRIAAMERDLARFEEEVEALAEAVAPEMLGTP 788

Query: 518 RDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKA 577
            D   + +++  +         + L  E +E  ++    +  ++  +  + + L      
Sbjct: 789 ADETARALKQRLKRARDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEARLTALLR--AAR 846

Query: 578 LKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKC 637
                +L        A +          R     R   R         + A +  L    
Sbjct: 847 CTTIEELL------AAVERSDTYRELRKRIAALERTLVRAGGGLSLEALVAEAAALDPDE 900

Query: 638 QKERMDNLEANQEIDRLRKHTDKLQAEL 665
              R++ L   ++I+ L +  ++L  E+
Sbjct: 901 LPARLEEL--ARDIEELEEELNELAQEV 926



 Score = 42.4 bits (100), Expect = 0.002
 Identities = 57/360 (15%), Positives = 123/360 (34%), Gaps = 59/360 (16%)

Query: 293 VRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDREL 352
           +++  E        ++E++      +  +  L+ A +E ++LR +   L  +KQRL+R  
Sbjct: 156 LKELKELEAE----IREVQLKTRTWKDLVKALDEAEEELANLRKELRQLEKEKQRLERL- 210

Query: 353 LELRSGCCTQPESDDAGSADSLN--QHYMSALNKYETIKDEYDALRKRFDDLINTHSTAV 410
                                L   +     L     + D      +R+++       A 
Sbjct: 211 ---------------RRLLPLLAERKALEQQLAALGEVIDLPPDAVERYEEARAELRAAR 255

Query: 411 DKLELAQEEVARIKKQCEDMN-----QEKNTALRERNGLKQQCTAAIRQWDIALRE--RN 463
             LEL  E +  ++ + ++++       +  A+   +  + +   A  + D+  RE    
Sbjct: 256 RNLELLTERLEALQAELDEISLDEELLAQAAAIEALHQQRGEYRNA--EQDLPDREGEIA 313

Query: 464 EYQEALAKVQQQ-----HEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSER 518
             +EA A +  Q      E+AV+ +  ++A +    + +  L + + A  Q         
Sbjct: 314 NAREAAAALLAQIGPDADEEAVESLRPSLAAK----ETVTELEKRKEALDQALKSARDAL 369

Query: 519 DSVHKEMEKLSEDLTQAVRKIKTLEMENK-ELVEDKKTLSYQIETLKREIASALHDRDKA 577
           +   +E++++   L           +            +   +  L++E+A        A
Sbjct: 370 EERERELKQVRAQLAALPTVTVPEPLRAALADALRAGDIDAALAALEQEVA-------VA 422

Query: 578 LKECNDLRERFGEFTAAKEEC---------QREAFHARFEYAMRMGERDSSRKERNEVEA 628
            +E      R G +    EE          Q +AF    E    M  +   R    E+E 
Sbjct: 423 KRELAQALSRLGLWRGDLEELVALPVPSAEQVQAFLKEAE--EIMQAKRQKRDRLLELEE 480



 Score = 34.7 bits (80), Expect = 0.57
 Identities = 62/391 (15%), Positives = 127/391 (32%), Gaps = 34/391 (8%)

Query: 309 ELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDR----ELLELRSGCCTQPE 364
             E  R + +AA   LE   +   +L+++  ++  D++ L +    E L  + G     E
Sbjct: 243 RYEEARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAAAIEALHQQRGEYRNAE 302

Query: 365 SD---DAGSADSLNQHYMSALNKYETIKDEYD--------ALRKRFDDLINTHSTAVDKL 413
            D     G   +  +   + L +     DE          A ++   +L          L
Sbjct: 303 QDLPDREGEIANAREAAAALLAQIGPDADEEAVESLRPSLAAKETVTELEKRKEALDQAL 362

Query: 414 ELAQEEVARIKKQCEDMNQEKNT--ALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
           + A++ +   +++ + +  +      +     L+     A+R  DI        QE    
Sbjct: 363 KSARDALEERERELKQVRAQLAALPTVTVPEPLRAALADALRAGDIDAALAALEQEVAVA 422

Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
            +++   A+  +        +         E+  A ++E   IM  +      + +L ED
Sbjct: 423 -KRELAQALSRLGLWRGDLEELVALPVPSAEQVQAFLKEAEEIMQAKRQKRDRLLELEED 481

Query: 532 LTQAVRKIKTL----EMENKELVEDKKTLSYQIETLKREIASALHDR----DKALKECND 583
           L     +++ L     +  +E V   +     +    R   + L       ++A++  + 
Sbjct: 482 LETLELQLRHLDAAGAVPTEEEVAAARARRDALWQDIRRGYALLEPTASAYEEAVESADQ 541

Query: 584 LRER-FGEFTAAKE----ECQREAFHARFEYA---MRMGERDSSRKERNEVEAASMDLFG 635
           L +R   E     E      Q EA   R E     + + E   +          +     
Sbjct: 542 LADRLLREAQLVGELQSLRQQEEAARRRLEQLEKELEVLELALAALREAWQAQWAAAGLP 601

Query: 636 KCQKERMDNLEANQEIDRLRKHTDKLQAELQ 666
               E  D L          +   K +AEL 
Sbjct: 602 LTPAEMEDWLAERATAREQVRAYFKARAELD 632



 Score = 31.7 bits (72), Expect = 4.4
 Identities = 33/168 (19%), Positives = 67/168 (39%), Gaps = 10/168 (5%)

Query: 365 SDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLIN-THSTAVDKLELAQEEVARI 423
            D A +A+ L +    A  +        D    R   L+     T +++L  A E     
Sbjct: 804 RDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEARLTALLRAARCTTIEELLAAVERSDTY 863

Query: 424 KKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEI 483
           ++  + +   + T +R   GL  +   A    + A  + +E    L ++ +  E+  +E+
Sbjct: 864 RELRKRIAALERTLVRAGGGLSLEALVA----EAAALDPDELPARLEELARDIEELEEEL 919

Query: 484 NTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
           N        A ++L R+     AA      + +ER+S+  ++  L+E 
Sbjct: 920 NELAQEVGAAKQELARMDGGSTAA-----ELEAERESLLAQLRDLAER 962


>gnl|CDD|222290 pfam13654, AAA_32, AAA domain.  This family includes a wide variety
           of AAA domains including some that have lost essential
           nucleotide binding residues in the P-loop.
          Length = 509

 Score = 46.3 bits (111), Expect = 1e-04
 Identities = 33/153 (21%), Positives = 66/153 (43%), Gaps = 38/153 (24%)

Query: 461 ERNEYQEALAK----VQQQHEDAVKEIN------------TAMAVRMKATKDLKRLTEER 504
           E  EY+    +     Q++ E+A +E+             T         KD + LTEE 
Sbjct: 109 ESEEYEARKEEIEEEFQEKREEAFEELEEEAKEKGFALVRTPGGFVFAPLKDGEPLTEEE 168

Query: 505 NAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLK 564
             A+ E      ER+ + +++++L E+L + +R+++ LE E +E          ++  L 
Sbjct: 169 FEALPE-----EEREELEEKIDELEEELQEILRQLRELEREARE----------KLRELD 213

Query: 565 REIASALHDRDKALKEC-NDLRERFGEFTAAKE 596
           RE+A        A+    ++L+E++ +      
Sbjct: 214 REVALF------AVGPLLDELKEKYADLPKVLA 240


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 45.0 bits (107), Expect = 1e-04
 Identities = 25/180 (13%), Positives = 62/180 (34%), Gaps = 22/180 (12%)

Query: 481 KEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIK 540
            E +       +  K LK+   E  A  +    +  E + +  ++ +L  ++ +   +IK
Sbjct: 17  LEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIK 76

Query: 541 TLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQR 600
             E +   + +++     ++  L  EI  A    +    E  +L E   +     E+   
Sbjct: 77  RAEEKLSAVKDER-----ELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIED--- 128

Query: 601 EAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDK 660
                          ++   +    +  A   L  +  + R +  E + + + L++  D 
Sbjct: 129 --------------LKERLERLEKNLAEAEARLEEEVAEIREEGQELSSKREELKEKLDP 174



 Score = 43.5 bits (103), Expect = 4e-04
 Identities = 36/170 (21%), Positives = 69/170 (40%), Gaps = 15/170 (8%)

Query: 460 RERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY-----SLI 514
               E ++AL K + + E   K +  A+ + ++  ++     E     ++E        +
Sbjct: 24  PRIKEIRKALKKAKAELEALNKAL-EALEIELEDLENQVSQLESEIQEIRERIKRAEEKL 82

Query: 515 MSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDR 574
            + +D   +E+  L+ ++  A  +I +LE E  EL+E+ + L  +IE LK  +       
Sbjct: 83  SAVKDE--RELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLE------ 134

Query: 575 DKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERN 624
            +  K   +   R  E  A   E  +E    R E   ++     S  ER 
Sbjct: 135 -RLEKNLAEAEARLEEEVAEIREEGQELSSKREELKEKLDPELLSEYERI 183


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 46.2 bits (109), Expect = 1e-04
 Identities = 45/216 (20%), Positives = 85/216 (39%), Gaps = 4/216 (1%)

Query: 452 IRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY 511
           + Q D    +    +  L   Q +   A  E   A +   KA ++ + + +E  AA Q  
Sbjct: 73  VFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNL 132

Query: 512 SLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASAL 571
           +    E   + K+ + L   L     + + LE + + L   +K L      LK ++    
Sbjct: 133 AKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLK 192

Query: 572 HDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASM 631
               +  +E  +L  R     A  EE  R A  A  + A  + +RD+   ++ +  AA  
Sbjct: 193 LRSAQIEQEAQNLATRANAAQARTEELARRA-AAAQQTAQAIQQRDAQISQKAQQIAARA 251

Query: 632 DLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQD 667
           +   + ++ ++  LE      RL +   +L+A  Q 
Sbjct: 252 EQIRE-RERQLQRLE--TAQARLEQEVAQLEAYYQA 284



 Score = 43.1 bits (101), Expect = 0.001
 Identities = 19/138 (13%), Positives = 51/138 (36%)

Query: 412 KLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
           +L   + E+   + +      E+  A  E    +Q+  A  ++   A +   + Q+ LA+
Sbjct: 82  QLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELAR 141

Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
           + +Q +D    + T    R +     + L   +       + + S+   +     ++ ++
Sbjct: 142 LTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQE 201

Query: 532 LTQAVRKIKTLEMENKEL 549
                 +    +   +EL
Sbjct: 202 AQNLATRANAAQARTEEL 219



 Score = 38.9 bits (90), Expect = 0.024
 Identities = 36/238 (15%), Positives = 79/238 (33%), Gaps = 41/238 (17%)

Query: 298 ETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRS 357
             V + D    +L   R +   A  +  AA  E  + RS+       K R +RE +    
Sbjct: 71  SGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSE-----LQKARQEREAVRQEL 125

Query: 358 GCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQ 417
                          +  Q+   A  +   +  +   L+ R                   
Sbjct: 126 --------------AAARQNLAKAQQELARLTKQAQDLQTR------------------- 152

Query: 418 EEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHE 477
             +  + +Q   +  +  +    +  L Q     ++   + L+ R+   E  A+      
Sbjct: 153 --LKTLAEQRRQLEAQAQSLQASQKQL-QASATQLKSQVLDLKLRSAQIEQEAQNLATRA 209

Query: 478 DAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQA 535
           +A +     +A R  A +   +  ++R+A + + +  ++ R    +E E+  + L  A
Sbjct: 210 NAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETA 267


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 45.1 bits (107), Expect = 2e-04
 Identities = 52/274 (18%), Positives = 105/274 (38%), Gaps = 43/274 (15%)

Query: 327 AAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYE 386
             +E+S L  K  +L +      REL E                A  L +       + +
Sbjct: 32  LRKEASELAEKRDELNAKV----RELREK---------------AQELREERDEINEEVQ 72

Query: 387 TIKDEYDALRKRFDDL---INTHSTAVDKLELAQEEVARIKK---QCEDMNQEKNTALRE 440
            +K++ D +  +  +L           ++  L    +  +++   + E   Q       E
Sbjct: 73  ELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEE 132

Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
              L Q+              R E ++A  K  +++E   +       ++ KA +  +++
Sbjct: 133 ERELVQKIK----------ELRKELEDA-KKALEENEKLKELKAEIDELKKKAREIHEKI 181

Query: 501 TEERNAAMQEYSLIM----SERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTL 556
            E  N A QEY   M     E D + KE ++L E+  +  +KI  L  E + L  + + L
Sbjct: 182 QELANEA-QEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELREL 240

Query: 557 SYQIETLKREIASA--LHDRDKALKECNDLRERF 588
             +I+ L+ +  +A     R++  +   ++ E+F
Sbjct: 241 EKKIKALRAKEKAAKRREKREELKERAEEIYEKF 274



 Score = 42.0 bits (99), Expect = 0.002
 Identities = 47/269 (17%), Positives = 115/269 (42%), Gaps = 36/269 (13%)

Query: 380 SALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALR 439
               K + +K+E + L+++ D+L    S   +K +    +V  ++++ +++ +E++    
Sbjct: 10  ELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINE 69

Query: 440 ERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVR---MKATKD 496
           E   LK+              +R+E    L ++++++ +  ++ N             ++
Sbjct: 70  EVQELKE--------------KRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLERE 115

Query: 497 LKRL----------TEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMEN 546
           ++RL           EE    +Q+   +  +     K+  + +E L +   +I  L+ + 
Sbjct: 116 IERLEKKQQTSVLTPEEERELVQKIKEL-RKELEDAKKALEENEKLKELKAEIDELKKKA 174

Query: 547 KELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHAR 606
           +E+ E  + L+ + +    E+     + D+  KE ++L E F E +   +E   E  + +
Sbjct: 175 REIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQ 234

Query: 607 FEY--------AMRMGERDSSRKERNEVE 627
            E         A+R  E+ + R+E+ E  
Sbjct: 235 NELRELEKKIKALRAKEKAAKRREKREEL 263



 Score = 41.6 bits (98), Expect = 0.003
 Identities = 68/327 (20%), Positives = 134/327 (40%), Gaps = 72/327 (22%)

Query: 411 DKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALA 470
            K +  +EE+  +K++ +++ +E +    +R+ L  +      +      ER+E  E + 
Sbjct: 13  LKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEEV- 71

Query: 471 KVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSE 530
              Q+ ++   EIN  +    K  ++LK    E N                 + ++ L  
Sbjct: 72  ---QELKEKRDEINAKLQELRKEYRELKEKRNEFN--------------LGGRSIKSLER 114

Query: 531 DLTQAVRKIKTLEM---ENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRER 587
           ++ +  +K +T  +   E +ELV+       +I+ L++E    L D  KAL+E   L+E 
Sbjct: 115 EIERLEKKQQTSVLTPEEERELVQ-------KIKELRKE----LEDAKKALEENEKLKEL 163

Query: 588 FGEFTAAKEECQREAFHARF--------EYAMRM----GERDSSRKERNEVEAASMDLFG 635
             E    K++      H +         EY   M     E D  RKE +E+         
Sbjct: 164 KAEIDELKKK--AREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHE------- 214

Query: 636 KCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSVNYLFILTESIQEAEVSKRRRDWAFS 695
                  + +E +++ID L +    LQ EL++     L    ++++  E + +RR+    
Sbjct: 215 -------EFVELSKKIDELHEEFRNLQNELRE-----LEKKIKALRAKEKAAKRREKREE 262

Query: 696 ERDKIEVIDSYHTKHAGRHKRGKELDK 722
            +++ E I         + KRG++L  
Sbjct: 263 LKERAEEIYE-------KFKRGEKLTT 282



 Score = 36.2 bits (84), Expect = 0.10
 Identities = 50/236 (21%), Positives = 99/236 (41%), Gaps = 19/236 (8%)

Query: 494 TKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDK 553
              L  L  +R    +E   +  +RD + KE  +L+E   +   K++ L  + +EL E++
Sbjct: 5   LDKLDELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREER 64

Query: 554 KTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAA---KEECQREAFHARFEYA 610
             ++ +++ LK +         +  KE  +L+E+  EF       +  +RE    R E  
Sbjct: 65  DEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREI--ERLEK- 121

Query: 611 MRMGERDSSRKERNEVEAASMDLFGKCQKERMD---NLEANQEIDRLRKHTDKLQAELQD 667
            +     +  +ER  V+        + +KE  D    LE N+++  L+   D+L+ + ++
Sbjct: 122 KQQTSVLTPEEERELVQKIK-----ELRKELEDAKKALEENEKLKELKAEIDELKKKARE 176

Query: 668 QSVNYLFILTESIQEAEVSKRRRDWAFSERDKI-EVIDSYHTKHAGRHKRGKELDK 722
                   + E   EA+         F E D++ +  D  H +     K+  EL +
Sbjct: 177 ----IHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHE 228


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 44.9 bits (106), Expect = 4e-04
 Identities = 59/322 (18%), Positives = 134/322 (41%), Gaps = 36/322 (11%)

Query: 362 QPESDDAGSAD-SLNQHYMSALNKYETIKDEYDALRKRFDDL---INTHSTAVDKLELAQ 417
           Q E D   S D +  +  +  + +  +++  YD L+   D L   I+      +KL+ + 
Sbjct: 137 QGEMDSLISGDPAQRKKILDEILEINSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSN 196

Query: 418 EEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHE 477
            E+  IKKQ  D  +  +  L+E   L  +   A+  ++      NE   +L  ++ ++E
Sbjct: 197 LELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELS-SLEDMKNRYE 255

Query: 478 DAVKEINTAMAVRMKATKDLKRLTEERNA-----------AMQEYSLIMSERDSVHKEME 526
             +K   + +++ ++     K L E                + +Y    ++ ++  + + 
Sbjct: 256 SEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILS 315

Query: 527 KLSEDLTQ---AVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDK---ALKE 580
            +  ++ +    ++K+  L+ +  + ++ K     + + L  +I             LK 
Sbjct: 316 NIDAEINKYHAIIKKLSVLQKDYNDYIKKKS----RYDDLNNQILELEGYEMDYNSYLKS 371

Query: 581 CNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKE 640
              L+++  E++  K   +  AF +     ++  + D+ +KE NE+     D+  K    
Sbjct: 372 IESLKKKIEEYS--KNIERMSAFISEI-LKIQEIDPDAIKKELNEINVKLQDISSK---- 424

Query: 641 RMDNLEANQEIDRLRKHTDKLQ 662
            + +L  NQ I  LR++ D+L 
Sbjct: 425 -VSSL--NQRIRALRENLDELS 443



 Score = 39.9 bits (93), Expect = 0.015
 Identities = 63/366 (17%), Positives = 128/366 (34%), Gaps = 60/366 (16%)

Query: 383 NKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCED--MNQEKNTALRE 440
           N Y   K  YD L  +  +L          L+     +  +KK+ E+   N E+ +A   
Sbjct: 339 NDYIKKKSRYDDLNNQILELEGYEMDYNSYLK----SIESLKKKIEEYSKNIERMSAFIS 394

Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
                Q+        D   +E NE    L    Q     V  +N     R++A ++    
Sbjct: 395 EILKIQEIDP-----DAIKKELNEINVKL----QDISSKVSSLN----QRIRALRE-NLD 440

Query: 501 TEERNAAMQEYSLIMS------ERDSVHKEMEKLSEDLTQAVRKIKTLEME----NKELV 550
              RN  M     +          +  +  +   +E  ++   KI+ +E+E    ++++V
Sbjct: 441 ELSRNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIV 500

Query: 551 EDKKTLSY-------QIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQR--- 600
           + KK   Y       +      +I SA  D +    + N+L+++  ++   K   +    
Sbjct: 501 DLKKRKEYLESEEINKSINEYNKIESARADLEDIKIKINELKDKHDKYEEIKNRYKSLKL 560

Query: 601 EAFHARF-EYAMRMGERDS-----SRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRL 654
           E   ++   +   +          +R   NE++    DL     + R+  +E        
Sbjct: 561 EDLDSKRTSWLNALAVISLIDIETNRSRSNEIKKQLNDL-----ESRLQEIEIG--FPDD 613

Query: 655 RKHTDKLQAELQDQSVNYLFILTESIQEAEVSKRRRDWAFSERDKIEVIDSYHTKHAGRH 714
           + + DK   E+++++ N          E + +K   +     R KI+       +     
Sbjct: 614 KSYIDKSIREIENEANNL----NNKYNEIQENKILIE---KLRGKIDNYKKQIAEIDSII 666

Query: 715 KRGKEL 720
              KE+
Sbjct: 667 PDLKEI 672


>gnl|CDD|212795 cd11861, SH3_DLG-like, Src Homology 3 domain of Disks large homolog
            proteins.  The DLG-like proteins are scaffolding proteins
            that cluster at synapses and are also called PSD
            (postsynaptic density)-95 proteins or SAPs
            (synapse-associated proteins). They play important roles
            in synaptic development and plasticity, cell polarity,
            migration and proliferation. They are members of the
            MAGUK (membrane-associated guanylate kinase) protein
            family, which is characterized by the presence of a core
            of three domains: PDZ, SH3, and guanylate kinase (GuK).
            The GuK domain in MAGUK proteins is enzymatically
            inactive; instead, the domain mediates protein-protein
            interactions and associates intramolecularly with the SH3
            domain. DLG-like proteins contain three PDZ domains and
            varying N-terminal regions. All DLG proteins exist as
            alternatively-spliced isoforms. Vertebrates contain four
            DLG proteins from different genes, called DLG1-4. DLG4
            and DLG2 are found predominantly at postsynaptic sites
            and they mediate surface ion channel and receptor
            clustering. DLG3 is found axons and some presynaptic
            terminals. DLG1 interacts with AMPA-type glutamate
            receptors and is critical in their maturation and
            delivery to synapses. The SH3 domain of DLG4 binds and
            clusters the kainate subgroup of glutamate receptors via
            two proline-rich sequences in their C-terminal tail. It
            also binds AKAP79/150 (A-kinase anchoring protein). SH3
            domains are protein interaction domains that bind to
            proline-rich ligands with moderate affinity and
            selectivity, preferentially to PxxP motifs. They play
            versatile and diverse roles in the cell including the
            regulation of enzymes, changing the subcellular
            localization of signaling pathway components, and
            mediating the formation of multiprotein complex
            assemblies.
          Length = 61

 Score = 39.6 bits (93), Expect = 4e-04
 Identities = 22/63 (34%), Positives = 29/63 (46%), Gaps = 6/63 (9%)

Query: 1334 YMRALFSRTGDLGDAL---QLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGII 1390
            Y+RALF         L    L F   DIL+V N         W+A  V  +G  ++ G+I
Sbjct: 1    YVRALFDYDPSRDSGLPSQGLSFKFGDILHVTNA---SDDEWWQARRVTPNGEEEEVGVI 57

Query: 1391 PSK 1393
            PSK
Sbjct: 58   PSK 60


>gnl|CDD|214620 smart00326, SH3, Src homology 3 domains.  Src homology 3 (SH3)
            domains bind to target proteins through sequences
            containing proline and hydrophobic amino acids.
            Pro-containing polypeptides may bind to SH3 domains in 2
            different binding orientations.
          Length = 56

 Score = 39.4 bits (93), Expect = 4e-04
 Identities = 18/64 (28%), Positives = 25/64 (39%), Gaps = 10/64 (15%)

Query: 1331 DSFYMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGII 1390
            +   +RAL+  T    D  +L F K DI+ V           W  W       R + G+ 
Sbjct: 1    EGPQVRALYDYTAQDPD--ELSFKKGDIITVLEK----SDDGW--WKGRLG--RGKEGLF 50

Query: 1391 PSKY 1394
            PS Y
Sbjct: 51   PSNY 54


>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
           Provisional.
          Length = 569

 Score = 44.4 bits (106), Expect = 5e-04
 Identities = 76/356 (21%), Positives = 137/356 (38%), Gaps = 78/356 (21%)

Query: 384 KYETIKDEYDALRKRFDDLINTHS--TAVDKLELAQEEVARIKKQCEDMNQEKN-----T 436
           + E +KD Y  LRK    L N  S   A+D+LE   +++  ++++     +         
Sbjct: 141 EVEQLKDLYRELRKSL--LANRFSFGPALDELE---KQLENLEEEFSQFVELTESGDYVE 195

Query: 437 ALRERNGLKQQCTAAIRQWDIA-----LRE-RNEYQEALAKVQQQHEDAV---------- 480
           A    + L+++  AA+ Q  +      L+E + E  + L +++  + + V          
Sbjct: 196 AREILDQLEEE-LAALEQ-IMEEIPELLKELQTELPDQLQELKAGYRELVEEGYHLDHLD 253

Query: 481 --KEINTAMAVRMKATKDLKRL----TEERNAAMQE-----YSLIMSE---RDSVHKEME 526
             KEI        +    L+ L     EE+N  +QE     Y ++  E   R  V K  +
Sbjct: 254 IEKEIQDLKEQIDENLALLEELDLDEAEEKNEEIQERIDQLYDILEREVKARKYVEKNSD 313

Query: 527 KLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRE 586
            L + L  A  + K L+ E   + +     SY +        S L    +  K+   L +
Sbjct: 314 TLPDFLEHAKEQNKELKEEIDRVKQ-----SYTLN------ESELESVRQLEKQLESLEK 362

Query: 587 RFGEFTAAKEECQREAF---HARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMD 643
           ++ E T    E Q  A+       E  +         K+  E+E     L    Q  R D
Sbjct: 363 QYDEITERIAE-QEIAYSELQEELEEIL---------KQLEEIEKEQEKLSEMLQGLRKD 412

Query: 644 NLEANQEIDRLRK--HTDKLQAE------LQDQSVNYLFILTESIQE--AEVSKRR 689
            LEA ++++R R   H  K   E      L +  +   F +++ I+    E+ ++ 
Sbjct: 413 ELEAREKLERYRNKLHEIKRYLEKSNLPGLPEDYLEMFFEVSDEIEALAEELEEKP 468



 Score = 30.6 bits (70), Expect = 8.4
 Identities = 53/337 (15%), Positives = 115/337 (34%), Gaps = 88/337 (26%)

Query: 391 EYDALRKRFDDLINTHSTAVDK-LELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCT 449
           +++  R+++D+++      +++ L  A+E   + +              + ++ + +   
Sbjct: 65  KFEEWRQKWDEIVTNSLPDIEEQLFEAEELNDKFR------------FRKAKHEINE--- 109

Query: 450 AAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERN--AA 507
                 D+   +  +  E L ++ +  E   +E+     +  +  K L  L    +   A
Sbjct: 110 -IESLLDLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSL--LANRFSFGPA 166

Query: 508 MQEYSLIMSERDSVHKEMEKLSE--DLTQAVRKIKTLEMENKEL-----------VEDKK 554
           + E    +   +    +  +L+E  D  +A   +  LE E   L            E + 
Sbjct: 167 LDELEKQLENLEEEFSQFVELTESGDYVEAREILDQLEEELAALEQIMEEIPELLKELQT 226

Query: 555 TLSYQIETLK---RE------------IASALHDRDKALKECNDLRERFGEFTAAKEECQ 599
            L  Q++ LK   RE            I   + D  + + E   L E   +   A+E+ +
Sbjct: 227 ELPDQLQELKAGYRELVEEGYHLDHLDIEKEIQDLKEQIDENLALLEEL-DLDEAEEKNE 285

Query: 600 R-----EAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRL 654
                 +  +   E  +         K R  VE            +  D L     ++  
Sbjct: 286 EIQERIDQLYDILEREV---------KARKYVE------------KNSDTLP--DFLEHA 322

Query: 655 RKHTDKLQAELQ--DQSVNYLFILTESIQEAEVSKRR 689
           ++   +L+ E+    QS    + L ES    E+   R
Sbjct: 323 KEQNKELKEEIDRVKQS----YTLNES----ELESVR 351


>gnl|CDD|238488 cd00988, PDZ_CTP_protease, PDZ domain of C-terminal processing-,
            tail-specific-, and tricorn proteases, which function in
            posttranslational protein processing, maturation, and
            disassembly or degradation, in Bacteria, Archaea, and
            plant chloroplasts. May be responsible for substrate
            recognition and/or binding, as most PDZ domains bind
            C-terminal polypeptides, and binding to internal
            (non-C-terminal) polypeptides and even to lipids has been
            demonstrated. In this subfamily of protease-associated
            PDZ domains a C-terminal beta-strand forms the
            peptide-binding groove base, a circular permutation with
            respect to PDZ domains found in Eumetazoan signaling
            proteins.
          Length = 85

 Score = 39.9 bits (94), Expect = 6e-04
 Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 1/54 (1%)

Query: 1261 GIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELA-KPADKVTV 1313
            G+ + SV  GS    AG++ GD I+  +G  +   + E+    L  K   KV +
Sbjct: 14   GLVITSVLPGSPAAKAGIKAGDIIVAIDGEPVDGLSLEDVVKLLRGKAGTKVRL 67



 Score = 35.7 bits (83), Expect = 0.020
 Identities = 14/68 (20%), Positives = 39/68 (57%), Gaps = 2/68 (2%)

Query: 1141 LGIQIQCLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSATYQLAASVLR-QCG 1199
            +G++++  D GG+ +++V   S A++ G++ GD ++ + G  +   + +    +LR + G
Sbjct: 4    IGLELKY-DDGGLVITSVLPGSPAAKAGIKAGDIIVAIDGEPVDGLSLEDVVKLLRGKAG 62

Query: 1200 NSITMLVQ 1207
              + + ++
Sbjct: 63   TKVRLTLK 70



 Score = 34.1 bits (79), Expect = 0.067
 Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 4/48 (8%)

Query: 202 GLTL---ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSA 246
           GL L   + GL I  + PGS AAK G +  GD +++I+   VDGLS  
Sbjct: 5   GLELKYDDGGLVITSVLPGSPAAKAG-IKAGDIIVAIDGEPVDGLSLE 51



 Score = 34.1 bits (79), Expect = 0.067
 Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 4/48 (8%)

Query: 826 GLTL---ESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSA 870
           GL L   + GL I  + PGS AAK G +  GD +++I+   VDGLS  
Sbjct: 5   GLELKYDDGGLVITSVLPGSPAAKAG-IKAGDIIVAIDGEPVDGLSLE 51


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 41.5 bits (98), Expect = 0.001
 Identities = 22/104 (21%), Positives = 46/104 (44%), Gaps = 18/104 (17%)

Query: 467 EALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEME 526
           E L ++QQ+  +  +E+        +  + L  L +E     QE S + +E + + KE+ 
Sbjct: 66  ERLPELQQELAELQEELA-------ELQEQLAELQQENQELKQELSTLEAELERLQKELA 118

Query: 527 KLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASA 570
           ++ +    A+           EL E+ + L  ++  LK+E  + 
Sbjct: 119 RIKQLSANAI-----------ELDEENRELREELAELKQENEAL 151



 Score = 32.7 bits (75), Expect = 1.1
 Identities = 21/86 (24%), Positives = 42/86 (48%), Gaps = 2/86 (2%)

Query: 383 NKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQ--EKNTALRE 440
            +   ++++   L++   +L    ST   +LE  Q+E+ARIK+   +  +  E+N  LRE
Sbjct: 80  EELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELRE 139

Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQ 466
                +Q   A+   +  L+E  + +
Sbjct: 140 ELAELKQENEALEAENERLQENEQRR 165



 Score = 32.7 bits (75), Expect = 1.1
 Identities = 26/139 (18%), Positives = 58/139 (41%), Gaps = 39/139 (28%)

Query: 416 AQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQ 475
           A+E +  ++++  ++ QE+   L+E+    QQ             E  E ++ L+ ++ +
Sbjct: 64  ARERLPELQQELAEL-QEELAELQEQLAELQQ-------------ENQELKQELSTLEAE 109

Query: 476 HEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQA 535
            E   KE+             +K+L+       +E                +L E+L + 
Sbjct: 110 LERLQKELAR-----------IKQLSANAIELDEEN--------------RELREELAEL 144

Query: 536 VRKIKTLEMENKELVEDKK 554
            ++ + LE EN+ L E+++
Sbjct: 145 KQENEALEAENERLQENEQ 163



 Score = 30.0 bits (68), Expect = 7.2
 Identities = 15/94 (15%), Positives = 39/94 (41%), Gaps = 4/94 (4%)

Query: 412 KLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
           +L   QEE+A +++Q  ++ QE     +E + L+ +    + +    L    +      +
Sbjct: 74  ELAELQEELAELQEQLAELQQENQELKQELSTLEAE----LERLQKELARIKQLSANAIE 129

Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEERN 505
           + +++ +  +E+            + +RL E   
Sbjct: 130 LDEENRELREELAELKQENEALEAENERLQENEQ 163


>gnl|CDD|212690 cd00174, SH3, Src Homology 3 domain superfamily.  Src Homology 3
            (SH3) domains are protein interaction domains that bind
            proline-rich ligands with moderate affinity and
            selectivity, preferentially to PxxP motifs. Thus, they
            are referred to as proline-recognition domains (PRDs).
            SH3 domains are less selective and show more diverse
            specificity compared to other PRDs. They have been shown
            to bind peptide sequences that lack the PxxP motif;
            examples include the PxxDY motif of Eps8 and the RKxxYxxY
            sequence in SKAP55. SH3 domain containing proteins play
            versatile and diverse roles in the cell, including the
            regulation of enzymes, changing the subcellular
            localization of signaling pathway components, and
            mediating the formation of multiprotein complex
            assemblies, among others. Many members of this
            superfamily are adaptor proteins that associate with a
            number of protein partners, facilitating complex
            formation and signal transduction.
          Length = 51

 Score = 37.4 bits (88), Expect = 0.002
 Identities = 18/61 (29%), Positives = 26/61 (42%), Gaps = 10/61 (16%)

Query: 1334 YMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGIIPSK 1393
            Y RAL+       D  +L F K DI+ V           W  W  + +G R+  G+ P+ 
Sbjct: 1    YARALYDYEAQDDD--ELSFKKGDIITVLEK----DDDGW--WEGELNGGRE--GLFPAN 50

Query: 1394 Y 1394
            Y
Sbjct: 51   Y 51


>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein. 
          Length = 312

 Score = 41.8 bits (99), Expect = 0.002
 Identities = 56/263 (21%), Positives = 108/263 (41%), Gaps = 37/263 (14%)

Query: 309 ELEYYRGQHQAAMAQLEAA-AQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDD 367
           ++E  R +  A  ++L +   QE   LR +  +L +++ RL  E+  LR           
Sbjct: 33  KIEELRQKKSAEPSRLYSLYEQEIRELRKQLDELTNERARLQLEIDNLR----------- 81

Query: 368 AGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQC 427
             +A+   + Y   LN  ++ +++   LRK  D+          K+E  QEE+A +KK  
Sbjct: 82  -LAAEDFREKYEDELNLRQSAENDIVGLRKDLDEATLARVDLEMKIESLQEELAFLKKNH 140

Query: 428 ED-----MNQEKNTALRE-RNGLKQQCTAAIR----QWD-IALRERNEYQEALAKVQQQH 476
           E+      +Q +     E     K   T A+     Q++ +A + R   QEA    + + 
Sbjct: 141 EEEVRELQSQIQGQVNVEMDAARKLDLTKALAEIRAQYEELAEKNR---QEAEEWYKSKL 197

Query: 477 EDAVKEINT-AMAVRMKAT--KDLKRLTEERNAAMQEYSLIMSERDSVHKEM----EKLS 529
           E+  +       A+R       +L+R  +       E   +  ++ S+ +++    E+  
Sbjct: 198 EELQQAAARNGDALRSAKEEITELRRQIQSLEI---ELQSLKKQKASLERQLAELEERYE 254

Query: 530 EDLTQAVRKIKTLEMENKELVED 552
            +L      I  LE E ++L  +
Sbjct: 255 LELADYQDTISELEEELQQLKAE 277


>gnl|CDD|212964 cd12031, SH3_DLG1, Src Homology 3 domain of Disks Large homolog 1.
            DLG1, also called synapse-associated protein 97 (SAP97),
            is a scaffolding protein that clusters at synapses and
            plays an important role in synaptic development and
            plasticity. DLG1 plays roles in regulating cell polarity,
            proliferation, migration, and cycle progression. It
            interacts with AMPA-type glutamate receptors and is
            critical in their maturation and delivery to synapses. It
            also interacts with PKCalpha and promotes wound healing.
            DLG1 is a member of the MAGUK (membrane-associated
            guanylate kinase) protein family, which is characterized
            by the presence of a core of three domains: PDZ, SH3, and
            guanylate kinase (GuK). The GuK domain in MAGUK proteins
            is enzymatically inactive; instead, the domain mediates
            protein-protein interactions and associates
            intramolecularly with the SH3 domain. DLG1 contains three
            PDZ domains. SH3 domains are protein interaction domains
            that bind to proline-rich ligands with moderate affinity
            and selectivity, preferentially to PxxP motifs. They play
            versatile and diverse roles in the cell including the
            regulation of enzymes, changing the subcellular
            localization of signaling pathway components, and
            mediating the formation of multiprotein complex
            assemblies.
          Length = 67

 Score = 38.1 bits (88), Expect = 0.002
 Identities = 30/69 (43%), Positives = 39/69 (56%), Gaps = 6/69 (8%)

Query: 1332 SFYMRALFS--RTGDLGDALQ-LRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCG 1388
            S Y+RALF   +T D G   Q L F   DIL+V N   +     W+A  V ADG  ++ G
Sbjct: 2    SLYVRALFDYDKTKDSGLPSQGLNFKFGDILHVVNASDDE---WWQARQVTADGESEEIG 58

Query: 1389 IIPSKYKVE 1397
            +IPSK +VE
Sbjct: 59   VIPSKRRVE 67


>gnl|CDD|212959 cd12026, SH3_ZO-1, Src homology 3 domain of the Tight junction
            protein, Zonula occludens protein 1.  ZO-1 is a
            scaffolding protein that associates with other ZO
            proteins and other proteins of the tight junction, zonula
            adherens, and gap junctions. ZO proteins play roles in
            regulating cytoskeletal dynamics at these cell junctions.
            ZO-1 plays an essential role in embryonic development. It
            regulates the assembly and dynamics of the cortical
            cytoskeleton at cell-cell junctions. It is considered a
            member of the MAGUK (membrane-associated guanylate
            kinase) protein family, which is characterized by the
            presence of a core of three domains: PDZ, SH3, and
            guanylate kinase (GuK). The GuK domain in MAGUK proteins
            is enzymatically inactive; instead, the domain mediates
            protein-protein interactions and associates
            intramolecularly with the SH3 domain. The C-terminal
            region of ZO-1 is the largest of the three ZO proteins
            and contains an actin-binding region and domains of
            unknown function designated alpha and ZU5. The SH3 domain
            of ZO-1 has been shown to bind ZONAB, ZAK, afadin, and
            Galpha12. SH3 domains are protein interaction domains
            that bind to proline-rich ligands with moderate affinity
            and selectivity, preferentially to PxxP motifs. They play
            versatile and diverse roles in the cell including the
            regulation of enzymes, changing the subcellular
            localization of signaling pathway components, and
            mediating the formation of multiprotein complex
            assemblies.
          Length = 65

 Score = 37.8 bits (87), Expect = 0.003
 Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 2/67 (2%)

Query: 1331 DSFYMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGII 1390
            DSFY+R  F    +      L F+K ++  V +T++NG  G W A  +  +    + GII
Sbjct: 1    DSFYIRTHFEYEKE--SPYGLSFNKGEVFRVVDTLYNGKLGSWLAIRIGKNHKEVERGII 58

Query: 1391 PSKYKVE 1397
            P+K + E
Sbjct: 59   PNKNRAE 65


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 41.6 bits (98), Expect = 0.004
 Identities = 74/343 (21%), Positives = 139/343 (40%), Gaps = 71/343 (20%)

Query: 345 KQRLDRELLELRSGCCTQPESDDAGSA-DSLNQHYMSALNKYETIKDEYDA-LRKR---F 399
           ++RL+ E  EL++        ++A +A +      + A  +   I+ E +  L ++   F
Sbjct: 463 RRRLEAEKDELQAAL------EEAEAALELEESKVLRAQVELSQIRSEIERRLAEKEEEF 516

Query: 400 DDLINTHSTAVDKLELAQE-------EVARIKKQCE-DMNQEKNTALRERNGLKQQCTAA 451
           ++    H  A++ L+   E       E +R+KK+ E D+N+           L+     A
Sbjct: 517 ENTRKNHQRAIESLQATLEAEAKGKAEASRLKKKLEGDINE-----------LEIALDHA 565

Query: 452 IRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY 511
            +    A +   +YQ+ + ++Q Q E+  +          +  ++   + E R  A++  
Sbjct: 566 NKANAEAQKNVKKYQQQVKELQTQVEEEQR--------AREDAREQLAVAERRATALEA- 616

Query: 512 SLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETL---KR--- 565
                       E+E+L   L QA R  K  E E  E  E    L+ Q  +L   KR   
Sbjct: 617 ------------ELEELRSALEQAERARKQAETELAEASERVNELTAQNSSLIAQKRKLE 664

Query: 566 -EIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERN 624
            E+A+   D D+A+ E     ER        ++ Q +A     E      E++ S  +  
Sbjct: 665 GELAALQSDLDEAVNELKAAEERA-------KKAQADAARLAEELRQ---EQEHS--QHL 712

Query: 625 EVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQD 667
           E     ++   K  + R+D  EA   +   +K   KL+A +++
Sbjct: 713 ERLRKQLESQVKELQVRLDEAEAAA-LKGGKKMIQKLEARVRE 754



 Score = 37.7 bits (88), Expect = 0.058
 Identities = 76/395 (19%), Positives = 139/395 (35%), Gaps = 85/395 (21%)

Query: 322 AQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCC---TQPESDDAGSADSLNQHY 378
           AQL     +  SLR +  +    K  L+R+L +  +      ++ ES+ A  A+ L +  
Sbjct: 264 AQLRQLEHDLDSLREQLEEESEAKAELERQLSKANAEIQQWRSKFESEGALRAEELEELK 323

Query: 379 MSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTAL 438
                K   +++  +A   + D L  T S    +LE  Q E+ R      ++ +++    
Sbjct: 324 KKLNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAAASELEKKQKNFD 383

Query: 439 RERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLK 498
           +     K++      + D A RE       L +++ + E+   ++             L+
Sbjct: 384 KILAEWKRKVDELQAELDTAQREARNLSTELFRLKNELEELKDQVE-----------ALR 432

Query: 499 RLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSY 558
           R                 E  ++  E+  L++ L +  R +  LE   + L         
Sbjct: 433 R-----------------ENKNLQDEIHDLTDQLGEGGRNVHELEKARRRL--------- 466

Query: 559 QIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDS 618
             E  K E+ +AL + + AL+       R           Q E    R E   R+ E++ 
Sbjct: 467 --EAEKDELQAALEEAEAALELEESKVLR----------AQVELSQIRSEIERRLAEKEE 514

Query: 619 SRKERNEVEAASMDLFGKCQKERMDNLEAN--------QEIDRLRK-------------- 656
                 E E        K  +  +++L+A          E  RL+K              
Sbjct: 515 ------EFENTR-----KNHQRAIESLQATLEAEAKGKAEASRLKKKLEGDINELEIALD 563

Query: 657 HTDKLQAELQDQSVNYLFILTESIQEAEVSKRRRD 691
           H +K  AE Q     Y   + E   + E  +R R+
Sbjct: 564 HANKANAEAQKNVKKYQQQVKELQTQVEEEQRARE 598



 Score = 36.9 bits (86), Expect = 0.11
 Identities = 57/314 (18%), Positives = 128/314 (40%), Gaps = 49/314 (15%)

Query: 308 KELEYYRGQHQAAMAQLEAAAQESSS--------LRSKYSDLISDKQRLDRELLELRSGC 359
           K+LE    QH+ A+A L    Q++ +        L+ + +    +K +L  E+ +L +  
Sbjct: 109 KDLEEANLQHEEALATLRKKHQDAINELSEQIEQLQKQKAKAEKEKSQLQAEVDDLLAQL 168

Query: 360 CTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEE 419
                +    +A+   +   S L++ +   DE   L+++ +DL +  S    +L   Q E
Sbjct: 169 --DQIAKAKLNAEKKAKQLESQLSELQVKLDE---LQRQLNDLTSQKS----RL---QSE 216

Query: 420 VARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRER----------------- 462
            + + +Q E+   + +   + ++ L+ Q   A R  +   RER                 
Sbjct: 217 NSDLTRQLEEAEAQVSNLSKLKSQLESQLEEAKRSLEEESRERANLQAQLRQLEHDLDSL 276

Query: 463 -----------NEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY 511
                       E +  L+K   + +    +  +  A+R +  ++LK+   ++ + ++E 
Sbjct: 277 REQLEEESEAKAELERQLSKANAEIQQWRSKFESEGALRAEELEELKKKLNQKISELEEA 336

Query: 512 SLIM-SERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASA 570
           +    ++ DS+ K   +L  +L     +++       EL + +K     +   KR++   
Sbjct: 337 AEAANAKCDSLEKTKSRLQSELEDLQIELERANAAASELEKKQKNFDKILAEWKRKVDEL 396

Query: 571 LHDRDKALKECNDL 584
             + D A +E  +L
Sbjct: 397 QAELDTAQREARNL 410


>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904).  This
           family consists of several bacterial and archaeal
           hypothetical proteins of unknown function.
          Length = 72

 Score = 36.9 bits (86), Expect = 0.006
 Identities = 18/56 (32%), Positives = 30/56 (53%)

Query: 521 VHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDK 576
             + +E+L   + QAV  I  L+ME +EL E+ + LS + E L+ E      +R+ 
Sbjct: 2   SLELLEELESKIQQAVETIALLQMEIEELKEENEQLSEENEELEEENKKLKEERNA 57



 Score = 29.2 bits (66), Expect = 2.8
 Identities = 12/49 (24%), Positives = 26/49 (53%)

Query: 399 FDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQ 447
            ++L +    AV+ + L Q E+  +K++ E +++E      E   LK++
Sbjct: 6   LEELESKIQQAVETIALLQMEIEELKEENEQLSEENEELEEENKKLKEE 54


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 40.8 bits (96), Expect = 0.006
 Identities = 28/192 (14%), Positives = 62/192 (32%), Gaps = 30/192 (15%)

Query: 308 KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDD 367
           + ++ Y    + A+ +LE+          + S+   D + L   L  L            
Sbjct: 289 ELIDKYESHIEKALEELES----ILDTEKENSEFKLDVEELKALLEALEE---------- 334

Query: 368 AGSADSLNQH------YMSALNKYETIKDEYDALRKRFD---DLINTHSTAVDKLELAQE 418
               +   Q         S   + E+I D  +++    D   +LI  H+  +D L+  + 
Sbjct: 335 --ILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKN 392

Query: 419 EVAR-----IKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQ 473
           +  +     +  + ++         +            I+Q +  ++   +  + L K  
Sbjct: 393 KAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQL 452

Query: 474 QQHEDAVKEINT 485
              E    EIN 
Sbjct: 453 TNIEPTADEINK 464



 Score = 38.1 bits (89), Expect = 0.045
 Identities = 51/278 (18%), Positives = 107/278 (38%), Gaps = 48/278 (17%)

Query: 339 SDLISDKQRLD-----RELLELRSGC--CTQPESDDAGSADSLNQHYMSALNKYETIKDE 391
           S+LI++    D      E  E    C  C Q  +++      L  H+           +E
Sbjct: 232 SELINNLGNSDWVKEGLEYHEEGDTCPFCQQTITEE--RKAELEAHF----------DEE 279

Query: 392 YDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAA 451
           Y  L ++ ++LI+ + + ++K     E +   +K+  +   +          L++     
Sbjct: 280 YQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEI---- 335

Query: 452 IRQWDIALRERNEY------QEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERN 505
           + +    L E+ +        E++  + +   D +  IN  +    +  + +  L +E+N
Sbjct: 336 LEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIR---EHNEKIDNLKKEKN 392

Query: 506 AAMQEYSLIMSERDSV-HKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLK 564
            A          +  +    + +L ED+    ++ K LE     L ++ K L  +I+ L+
Sbjct: 393 KA----------KKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALE 442

Query: 565 REIAS---ALHDRDKALKECNDLRERFG--EFTAAKEE 597
           +EI      L + +    E N L + +G   F   KE 
Sbjct: 443 KEIKELEKQLTNIEPTADEINKLLKAYGFGNFELEKEA 480


>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
           Synaptonemal complex protein 1 (SCP-1) is the major
           component of the transverse filaments of the
           synaptonemal complex. Synaptonemal complexes are
           structures that are formed between homologous
           chromosomes during meiotic prophase.
          Length = 787

 Score = 40.9 bits (95), Expect = 0.006
 Identities = 82/458 (17%), Positives = 178/458 (38%), Gaps = 44/458 (9%)

Query: 272 EHSPRVHRKSVNSSTQTMPLYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQES 331
           +H    ++K +N   + + L + Q  E     ++ +K+L +   + +    QLE    E 
Sbjct: 225 QHLEEEYKKEINDKEKQVSLLLIQITEK----ENKMKDLTFLLEESRDKANQLE----EK 276

Query: 332 SSLRSK-YSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKD 390
           + L+ +   +LI  K  L +EL +++             +  +L +    A      + +
Sbjct: 277 TKLQDENLKELIEKKDHLTKELEDIKMSL-----QRSMSTQKALEEDLQIATKTICQLTE 331

Query: 391 EYDALRKRFDDLINTHSTAVDKLELAQ---EEVARIKKQCEDMN-----------QEKNT 436
           E +A  +  +     HS  V + E      EE+ R ++Q  + N           Q+K++
Sbjct: 332 EKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKSS 391

Query: 437 ALRERNGLKQQCTAAIRQWDIALRERNEY---QEALAKVQQQHEDAVKEINTAMAVRMKA 493
            L E    K      + +    L E  +    ++   K+ ++ +   +E+   +  R K 
Sbjct: 392 ELEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEELKGKEQELIFLLQAREKE 451

Query: 494 TKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDK 553
             DL+        + + Y     E + +  E+EK      +       L +ENKEL ++ 
Sbjct: 452 IHDLEIQLTAIKTSEEHYL---KEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEA 508

Query: 554 KTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRM 613
             ++ +++  + +I +     ++ LK+  +L E+        E  + E      E   ++
Sbjct: 509 SDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVKCKL 568

Query: 614 GERD----SSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQS 669
            + +    S   E  + E     L  KC   +      N+ I+ L +    L+ +   ++
Sbjct: 569 DKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAEN 628

Query: 670 VNYLFILTESIQEAEVSKRRRDWAFSERDKIEVIDSYH 707
                    +  E +V+K   + A +++   E+ID+Y 
Sbjct: 629 KQL------NAYEIKVNKLELELASAKQKFEEIIDNYQ 660


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 40.5 bits (96), Expect = 0.007
 Identities = 40/183 (21%), Positives = 79/183 (43%), Gaps = 23/183 (12%)

Query: 402 LINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRE---------RNGLKQQCTAAI 452
           +      A   LE A++E   IKK+     +E+   LR          RN L++      
Sbjct: 33  IKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKL----- 87

Query: 453 RQWDIALRERN--EYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQE 510
            +  +  +E N     E L K +++ E   KE+        K  ++L+ L EE+   ++ 
Sbjct: 88  -EKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146

Query: 511 YSLIMSE--RDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVED--KKTLSYQIETLKRE 566
            S + +E  ++ + +++E+ +    +A   IK +E E KE  +   K+ L+  I+    +
Sbjct: 147 ISGLTAEEAKEILLEKVEEEAR--HEAAVLIKEIEEEAKEEADKKAKEILAQAIQRCAAD 204

Query: 567 IAS 569
             +
Sbjct: 205 HVA 207


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
           contains an extraordinarily long coiled-coil domain in
           its C-terminal half and is ubiquitously expressed. It is
           a novel binding partner of several syntaxin family
           members and is possibly involved in Ca2+-dependent
           exocytosis in neuroendocrine cells. Gamma-taxilin,
           described as leucine zipper protein Factor Inhibiting
           ATF4-mediated Transcription (FIAT), localises to the
           nucleus in osteoblasts and dimerises with ATF4 to form
           inactive dimers, thus inhibiting ATF4-mediated
           transcription.
          Length = 309

 Score = 40.1 bits (94), Expect = 0.007
 Identities = 64/311 (20%), Positives = 131/311 (42%), Gaps = 34/311 (10%)

Query: 379 MSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTAL 438
             +L+K +T +++ + L KR  +L+         L+  Q+   +++K+ + +  E + A+
Sbjct: 11  NQSLSKLDTPEEKLETLIKRLAELLKEMKRLEKDLKKLQKRGEQLQKEKDKLQSEHSKAI 70

Query: 439 RERNGLKQQCTAAIRQ----WDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKAT 494
             R+ L+  C    +      +      +E +E   ++ ++ +  +K+I   M       
Sbjct: 71  LARSKLESLCRELQKHNKTLKEENKARSDEEEEKRKELTEKFQSTLKDIQDQMEEHSNPN 130

Query: 495 KDLKRLTEERNAAMQE----YSLIMSERDSVHKEME---KLSE-DLTQAVRKIKTLEMEN 546
             L    EE    ++E    Y L     + + K  +   +L++  L QA  + +  + ++
Sbjct: 131 HKLCEENEELREKLKELIEQYELREQHFEKLLKTKDLEQQLNDAKLEQAQEQAEQEQEKH 190

Query: 547 KELVEDKKTLSYQIETLKREIASALHDRDKALKE-CNDLRERFGEFTAAKEECQREAFHA 605
           K   E     + Q+ TLK        + +K L+   N   ++F EF     +   E F  
Sbjct: 191 KREKEILLKEAAQVSTLK--------ETEKELRAQLNLYSDKFKEFQDTLNK-SNEVF-L 240

Query: 606 RFEYAMRMGERDSSRKERNEVEAASMDL-FGKCQKERMDNLEANQ----EIDRLRKHTDK 660
            F+  M   E+ S + ++ E E  +    + K  K  ++  E  Q    EI++L+K  DK
Sbjct: 241 TFKQEM---EKMSKKIKKLEKENLAWKRKWEKSNKALLEMAEERQKRDKEIEKLKKKIDK 297

Query: 661 LQA---ELQDQ 668
           L+     LQ +
Sbjct: 298 LEKLCRALQAE 308


>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16).  Autophagy is a
           ubiquitous intracellular degradation system for
           eukaryotic cells. During autophagy, cytoplasmic
           components are enclosed in autophagosomes and delivered
           to lysosomes/vacuoles. ATG16 (also known as Apg16) has
           been shown to be bind to Apg5 and is required for the
           function of the Apg12p-Apg5p conjugate in the yeast
           autophagy pathway.
          Length = 194

 Score = 39.1 bits (91), Expect = 0.009
 Identities = 36/196 (18%), Positives = 64/196 (32%), Gaps = 37/196 (18%)

Query: 375 NQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEK 434
               +  L        + D   K F +L   +    D+  L + E   ++ +  +     
Sbjct: 2   RDEILRRLK-------DRDRREKAFSELFEQYRQLADRTRLLKVENDALQAEKYEQQSSH 54

Query: 435 NTALRERNGLKQQCTAAIRQWDIALRE--------RNEYQEALAKVQQQHEDAVKEINTA 486
           +++            A + Q    LRE        R E  + L  +  + E   +EI   
Sbjct: 55  SSSPSADGPGSDAAIAEMEQKLAKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQ- 113

Query: 487 MAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDL-----------TQA 535
                   K +  L  E  +   E    + +     +E EK +E L              
Sbjct: 114 ------LEKTIAELRSEITSLETE----IRDLREELQEKEKDNETLQDELISLNIELNAL 163

Query: 536 VRKIKTLEMENKELVE 551
             K++ L+ EN+ELVE
Sbjct: 164 EEKLRKLQKENQELVE 179



 Score = 31.3 bits (71), Expect = 2.6
 Identities = 33/181 (18%), Positives = 66/181 (36%), Gaps = 22/181 (12%)

Query: 430 MNQEKNTALRERNGLKQQCTAAIRQWDIALR-------ERNEYQEALAKVQQQHEDAVKE 482
              E    L++R+  ++  +    Q+            E +  Q    + Q  H  +   
Sbjct: 1   WRDEILRRLKDRDRREKAFSELFEQYRQLADRTRLLKVENDALQAEKYEQQSSHSSSPSA 60

Query: 483 INTAMAVRMKATKD-LKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKT 541
                   +   +  L +L EE     ++   +      ++ E+E+L  ++ Q  + I  
Sbjct: 61  DGPGSDAAIAEMEQKLAKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQLEKTIAE 120

Query: 542 LEMENKELVEDKKTL-------SYQIETLKREIAS------ALHDRDKALKECND-LRER 587
           L  E   L  + + L           ETL+ E+ S      AL ++ + L++ N  L ER
Sbjct: 121 LRSEITSLETEIRDLREELQEKEKDNETLQDELISLNIELNALEEKLRKLQKENQELVER 180

Query: 588 F 588
           +
Sbjct: 181 W 181


>gnl|CDD|223864 COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer
            membrane].
          Length = 406

 Score = 40.0 bits (94), Expect = 0.010
 Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 5/80 (6%)

Query: 1250 LGISLVGGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELA-KPA 1308
            +GI L   +  G+ V S   GS    AG++ GD I++ +G  +   + +EA   +  KP 
Sbjct: 102  IGIELQMEDIGGVKVVSPIDGSPAAKAGIKPGDVIIKIDGKSVGGVSLDEAVKLIRGKPG 161

Query: 1309 DKVTVLAQYSVERYNEVKDK 1328
             KVT+    ++ R    K  
Sbjct: 162  TKVTL----TILRAGGGKPF 177



 Score = 37.3 bits (87), Expect = 0.057
 Identities = 18/75 (24%), Positives = 40/75 (53%), Gaps = 1/75 (1%)

Query: 1134 IDKSVEPLGIQIQCLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRSATYQLAAS 1193
                   +GI++Q  D GGV V +  + S A++ G++ GD ++++ G ++   +   A  
Sbjct: 95   TSGEFGGIGIELQMEDIGGVKVVSPIDGSPAAKAGIKPGDVIIKIDGKSVGGVSLDEAVK 154

Query: 1194 VLR-QCGNSITMLVQ 1207
            ++R + G  +T+ + 
Sbjct: 155  LIRGKPGTKVTLTIL 169


>gnl|CDD|212960 cd12027, SH3_ZO-2, Src homology 3 domain of the Tight junction
            protein, Zonula occludens protein 2.  ZO-2 is a
            scaffolding protein that associates with other ZO
            proteins and other proteins of the tight junction, zonula
            adherens, and gap junctions. ZO proteins play roles in
            regulating cytoskeletal dynamics at these cell junctions.
            ZO-2 plays an essential role in embryonic development. It
            is critical for the blood-testis barrier integrity and
            male fertility. It also regulates the expression of
            cyclin D1 and cell proliferation. It is considered a
            member of the MAGUK (membrane-associated guanylate
            kinase) protein family, which is characterized by the
            presence of a core of three domains: PDZ, SH3, and
            guanylate kinase (GuK). The GuK domain in MAGUK proteins
            is enzymatically inactive; instead, the domain mediates
            protein-protein interactions and associates
            intramolecularly with the SH3 domain. The C-terminal
            region of ZO-2 contains an actin-binding region and a
            domain of unknown function designated beta. The SH3
            domain of the related protein ZO-1 has been shown to bind
            ZONAB, ZAK, afadin, and Galpha12. SH3 domains are protein
            interaction domains that bind to proline-rich ligands
            with moderate affinity and selectivity, preferentially to
            PxxP motifs. They play versatile and diverse roles in the
            cell including the regulation of enzymes, changing the
            subcellular localization of signaling pathway components,
            and mediating the formation of multiprotein complex
            assemblies.
          Length = 63

 Score = 36.0 bits (83), Expect = 0.010
 Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 5/68 (7%)

Query: 1330 GDSFYMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGI 1389
            GDSF++R  F    +L  +L   F + +I  V +T+++G  G+W A  +   G   + G+
Sbjct: 1    GDSFFIRTHFEYEKELPQSLA--FTRGEIFRVVDTLYDGKLGNWLAVRI---GNELEKGL 55

Query: 1390 IPSKYKVE 1397
            IP+K + E
Sbjct: 56   IPNKSRAE 63


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
           Provisional.
          Length = 2722

 Score = 40.2 bits (94), Expect = 0.011
 Identities = 48/248 (19%), Positives = 96/248 (38%), Gaps = 35/248 (14%)

Query: 372 DSLNQHYMSALNKYETIKDEYDALRKRFDDL------INTHSTAVDKLELAQEEVARIKK 425
           + L +     L   ET+  +    R   +D+      I  +   +  +    +E+  I +
Sbjct: 539 EGLIELIKYYLQSIETLIKDEKLKRSMKNDIKNKIKYIEENVDHIKDIISLNDEIDNIIQ 598

Query: 426 QCEDMNQE----KNTALRERNGLKQQCTAAIRQ-WDIALRE------------RNEYQEA 468
           Q E++  E    K   + E+N L+++    + + +   L+E            +  Y EA
Sbjct: 599 QIEELINEALFNKEKFINEKNDLQEKVKYILNKFYKGDLQELLDELSHFLDDHKYLYHEA 658

Query: 469 LAK--VQQQHEDAVKEINTAMAVRMKAT------KDLKRLTEERNAAMQEYSLIMSERDS 520
            +K  +Q     +  E        MK+       K+LK   E +N    + ++I  + ++
Sbjct: 659 KSKEDLQTLLNTSKNEYEKLE--FMKSDNIDNIIKNLK--KELQNLLSLKENIIKKQLNN 714

Query: 521 VHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKE 580
           + +++       T     +K+   E KE  E  +   +QI   K E    L++ DK L +
Sbjct: 715 IEQDISNSLNQYTIKYNDLKSSIEEYKEEEEKLEVYKHQIINRKNEFILHLYENDKDLPD 774

Query: 581 CNDLRERF 588
             +  E F
Sbjct: 775 GKNTYEEF 782



 Score = 33.3 bits (76), Expect = 1.6
 Identities = 42/256 (16%), Positives = 99/256 (38%), Gaps = 59/256 (23%)

Query: 337  KYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALR 396
            KYSD I  KQR   +L+E                              Y       + L+
Sbjct: 1910 KYSDDIDKKQRDCNKLVE-------------------------DGNEIYLKSTAINE-LK 1943

Query: 397  KRFDDLINTHSTAVDKLELAQEEVARIKK-QCED------MNQEKNTALRE-RNGLKQQC 448
               + + N  S   +K++    +++ + K  C D      + +E+   L++ RN   Q+ 
Sbjct: 1944 NMINSVKNKESAISNKIDNVSNKLSELNKITCNDESYDEILEKEEYEELKDLRNSFNQEK 2003

Query: 449  TAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAM 508
               +    +     N+ +E     +   ++  K + T     +KA++++K++ E +  + 
Sbjct: 2004 AETLNNLKL-----NKIKEDFNSYKNLLDELEKSVKT-----LKASENIKKIVENKKTS- 2052

Query: 509  QEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIA 568
                 I +  + + KE+E ++  L + ++K   +E+     +         I+ ++ +I+
Sbjct: 2053 --IDAINTNIEDIEKEIESINPSLDELLKKGHKIEISRYTSI---------IDNVQTKIS 2101

Query: 569  ---SALHDRDKALKEC 581
                 ++D +K  +  
Sbjct: 2102 NDSKNINDIEKKAQIY 2117



 Score = 32.9 bits (75), Expect = 1.9
 Identities = 42/210 (20%), Positives = 75/210 (35%), Gaps = 39/210 (18%)

Query: 374 LNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
           +N +Y   +  Y T    Y    K++ D IN        +E   EE+ +  +QC      
Sbjct: 118 VNIYYFKEMIHYAT---SYYDDLKKYSDKINE------DVEPLNEEIIKNIEQC---LGN 165

Query: 434 KNTALRERNGLKQQCTAAIRQ--WDIALRE----RNEYQEALAKVQQQHEDAVKEINTAM 487
           KN        L+      +R+  +D    E    +  +   L   ++ ++  +K+IN  +
Sbjct: 166 KNDLDNLIIVLENPEKYNVRKTLYDEKFNEYKNKKEAFYNCLKNKKEDYDKKIKKINNEI 225

Query: 488 AVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVH-KEMEK---------LSEDLTQAVR 537
                  K LK +    N    +  + M E   +   E+             E L     
Sbjct: 226 ------RKLLKNIKCTGNMCKTDTYVDMVELYLLRVNEVPSNNYDNYLNRAKELLESGSD 279

Query: 538 KIKTLEMENKELVEDKKTLSYQIETLKREI 567
            I  ++ E    + D KT+ Y I  ++ EI
Sbjct: 280 LINKIKKE----LGDNKTI-YSINFIQEEI 304



 Score = 32.9 bits (75), Expect = 2.3
 Identities = 45/237 (18%), Positives = 101/237 (42%), Gaps = 31/237 (13%)

Query: 384  KYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNG 443
            + + +   Y+ L K+ DDLI       D +EL  + +    K+ E+   +  + L +   
Sbjct: 999  EIDKLNVNYNILNKKIDDLIKKQH--DDIIELIDKLIKEKGKEIEEKVDQYISLLEK--- 1053

Query: 444  LKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMA----VRMKATKDLKR 499
            +K + ++     DI   +  + +E +  ++++ E  +K+I+        ++ K+ + +  
Sbjct: 1054 MKTKLSSFHFNIDIKKYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVVN 1113

Query: 500  LTEERNAAMQEYSLIMSERDSVHKEMEKLSEDL-TQAVRKIKTLEMENKEL--------- 549
              +E+N   + Y+      + ++K+MEK  ++L    +  I   E+   E+         
Sbjct: 1114 ADKEKNKQTEHYNKKKKSLEKIYKQMEKTLKELENMNLEDITLNEVNEIEIEYERILIDH 1173

Query: 550  --------VEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEEC 598
                     +  KT+  +IE+ K++I      +    KE ND    F E+ A  ++ 
Sbjct: 1174 IVEQINNEAKKSKTIMEEIESYKKDIDQV---KKNMSKERNDHLTTF-EYNAYYDKA 1226


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 39.2 bits (92), Expect = 0.014
 Identities = 27/136 (19%), Positives = 50/136 (36%), Gaps = 33/136 (24%)

Query: 490 RMKATKDLK-RLTEERNAAMQEYSLIMSERDSVHKEMEKLSE------------------ 530
           RMK  + LK  L E      ++Y L+M E + ++    KL +                  
Sbjct: 138 RMKLLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDE 197

Query: 531 -------DLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECND 583
                  +L +A  K+K L  E    V+  + L  +++ L+ +I            + ++
Sbjct: 198 LEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIE-------DLTNKKSE 250

Query: 584 LRERFGEFTAAKEECQ 599
           L     E     E+C+
Sbjct: 251 LNTEIAEAEKKLEQCR 266



 Score = 33.8 bits (78), Expect = 0.67
 Identities = 40/207 (19%), Positives = 85/207 (41%), Gaps = 40/207 (19%)

Query: 491 MKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELV 550
             A+ D+K L +++   ++ ++ + +++   ++   KL E L       + L+   + L 
Sbjct: 105 FSASPDVKLLMDKQFQLVKTFARLEAKKMW-YEWRMKLLEGL------KEGLDENLEGLK 157

Query: 551 EDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYA 610
           ED K L  ++E L   I   L DR  AL+E         +    ++E +           
Sbjct: 158 EDYKLLMKELELLN-SIKPKLRDRKDALEE------ELRQLKQLEDELED---------- 200

Query: 611 MRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQSV 670
                      +  E++ A   L    +K   + +   ++++ L +   +L+++++D   
Sbjct: 201 ----------CDPTELDRAKEKL----KKLLQEIMIKVKKLEELEEELQELESKIED-LT 245

Query: 671 NYLFILTESIQEAE-VSKRRRDWAFSE 696
           N    L   I EAE   ++ R + F E
Sbjct: 246 NKKSELNTEIAEAEKKLEQCRGFTFKE 272



 Score = 32.3 bits (74), Expect = 2.0
 Identities = 24/115 (20%), Positives = 56/115 (48%), Gaps = 5/115 (4%)

Query: 386 ETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTAL-RERNGL 444
           E +K++Y  L K  + L +      D+ +  +EE+ ++K+  +++     T L R +  L
Sbjct: 154 EGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKL 213

Query: 445 KQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKR 499
           K+      ++  I +++  E +E L +++ + ED   + +       +A K L++
Sbjct: 214 KKLL----QEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQ 264



 Score = 31.5 bits (72), Expect = 3.1
 Identities = 18/87 (20%), Positives = 38/87 (43%), Gaps = 3/87 (3%)

Query: 390 DEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCT 449
            E D  +++   L+      V KLE  +EE+  ++ + ED+  +K+    E    +++  
Sbjct: 204 TELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKK-L 262

Query: 450 AAIRQWDIALRERNEYQEALAKVQQQH 476
              R +    +E  + +E L  +Q   
Sbjct: 263 EQCRGFTF--KEIEKLKEQLKLLQSLT 287



 Score = 31.1 bits (71), Expect = 4.9
 Identities = 31/155 (20%), Positives = 61/155 (39%), Gaps = 20/155 (12%)

Query: 411 DKLELAQEEVARIKKQCEDMN------QEKNTALRERNGLKQQCTAAIRQWDIALRERNE 464
           + LE  +E+   + K+ E +N      +++  AL E     +Q    +   D    +R  
Sbjct: 151 ENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDR-- 208

Query: 465 YQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHK- 523
            +E L K+ Q+    VK++        +    ++ LT +++    E   I      + + 
Sbjct: 209 AKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTE---IAEAEKKLEQC 265

Query: 524 ------EMEKLSE--DLTQAVRKIKTLEMENKELV 550
                 E+EKL E   L Q++   K  ++    L 
Sbjct: 266 RGFTFKEIEKLKEQLKLLQSLTGWKITKLSGNTLS 300


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 39.6 bits (92), Expect = 0.014
 Identities = 49/284 (17%), Positives = 113/284 (39%), Gaps = 36/284 (12%)

Query: 412 KLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
           K    + +V+ ++ + E +      +  +   L++Q   A    +    ERN    AL  
Sbjct: 158 KFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSSESMSAERN----ALLA 213

Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
            + +++  + E+           +D++ LT++     QE   ++     +  E+E++  +
Sbjct: 214 QRAENQQRILELE----------QDIQTLTQK----KQENDRVLEGTQDIEAELERMKGE 259

Query: 532 LTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEF 591
           L Q ++K  T++  ++E       L  + E L  ++ +     + + ++   L E     
Sbjct: 260 LKQRLKK-MTIQRRDEETERIDLQL--ENEQLHEDLRTLQERLESSQQKAGLLGEELASL 316

Query: 592 TAAKEECQREAFHARFEYA-MRMGERDSSRKERNEV-----EAASMDLFGKCQKERMDNL 645
            + ++    E   +R E A M +     +   +        E  ++    +  K+R+  L
Sbjct: 317 GSLRDHTIAELHQSRLESAQMSLQLSQLNLALKEGQSQWAQERETLRQSAEADKDRIQKL 376

Query: 646 EANQEIDRL-------RKHTDKLQAELQDQSVNYLFILTESIQE 682
            A  E+ +L       R   +KL+ EL  +       L+E+ +E
Sbjct: 377 SA--ELLKLEEWLQEERSQREKLEVELGIEKDCNRVQLSENRRE 418



 Score = 38.4 bits (89), Expect = 0.038
 Identities = 67/319 (21%), Positives = 126/319 (39%), Gaps = 50/319 (15%)

Query: 384 KYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNG 443
           K+  ++ +   +R R + L      + +K E  +E+    +   E M+ E+N  L +R  
Sbjct: 158 KFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAE 217

Query: 444 LKQQC----------TAAIRQWDIAL-----------RERNEYQEALAKVQQQHEDAVKE 482
            +Q+           T   ++ D  L           R + E ++ L K+  Q  D   E
Sbjct: 218 NQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEETE 277

Query: 483 I------NTAMAVRMKATKDLKRLTEERNAAM-QEYSLIMSERDSVHKEMEKLSEDLTQA 535
                  N  +   ++  ++    ++++   + +E + + S RD    E+ +   +  Q 
Sbjct: 278 RIDLQLENEQLHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTIAELHQSRLESAQM 337

Query: 536 VRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAK 595
             ++  L   N  L E +   + + ETL++   SA  D+D+  K   +L +   E    +
Sbjct: 338 SLQLSQL---NLALKEGQSQWAQERETLRQ---SAEADKDRIQKLSAELLKL--EEWLQE 389

Query: 596 EECQREAFHARF--EYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEI-- 651
           E  QRE        E      +   +R+E +E+ +A   L    QKE+    E  QE+  
Sbjct: 390 ERSQREKLEVELGIEKDCNRVQLSENRRELSELRSALRVL----QKEKEQLQEEKQELLD 445

Query: 652 ------DRLRKHTDKLQAE 664
                  RL K  D+  AE
Sbjct: 446 YIRVLELRLDKEADEKWAE 464


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 39.4 bits (92), Expect = 0.016
 Identities = 31/132 (23%), Positives = 53/132 (40%), Gaps = 22/132 (16%)

Query: 386 ETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE-----KNTALRE 440
           E  K  Y   ++  + LI   S    +LE   E + ++ K+ E + +E     +    RE
Sbjct: 500 EQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERE 559

Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAV---KEINTAMAVRMKATKDL 497
           RN   +                 E QEAL  ++++ E  +   KE     A  +K+ +DL
Sbjct: 560 RNKKLEL--------------EKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDL 605

Query: 498 KRLTEERNAAMQ 509
            +L E +    Q
Sbjct: 606 VKLKETKQKIPQ 617



 Score = 31.7 bits (72), Expect = 3.9
 Identities = 28/106 (26%), Positives = 54/106 (50%), Gaps = 9/106 (8%)

Query: 482 EINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKT 541
           EIN  +         L++  E++N  +++   ++ E++ + KE+E+  E+L +  R  K 
Sbjct: 512 EINVLIE----KLSALEKELEQKNEHLEK---LLKEQEKLKKELEQEMEELKERERNKKL 564

Query: 542 LEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRER 587
            E+E KE  E  K L  ++E++ RE+      + K +K   DL + 
Sbjct: 565 -ELE-KEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKL 608


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 39.7 bits (92), Expect = 0.017
 Identities = 53/264 (20%), Positives = 112/264 (42%), Gaps = 16/264 (6%)

Query: 414  ELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQ 473
            EL + E  +  ++ +   ++KN ALR+    K+   A I +      E  + +   AK  
Sbjct: 1556 ELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKA 1615

Query: 474  QQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQ-----EYSLIMSERDSVHKEMEKL 528
            ++ +   +E+  A   + K  +  K+  EE+  A +     E + I +  ++   E +K 
Sbjct: 1616 EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKK 1675

Query: 529  -SEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRER 587
             +E+  +A    K      K+  E+ K    + E LK++ A      ++  K   + + +
Sbjct: 1676 KAEEAKKAEEDEKKAAEALKKEAEEAK----KAEELKKKEAEEKKKAEELKKAEEENKIK 1731

Query: 588  FGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEA 647
              E     EE +++A  A+ +      E+      + E E  + ++  + +KE +   E 
Sbjct: 1732 AEEAKKEAEEDKKKAEEAKKD----EEEKKKIAHLKKEEEKKAEEI--RKEKEAVIEEEL 1785

Query: 648  NQEIDRLRKHTDKLQAELQDQSVN 671
            ++E ++ R   DK   ++ D   N
Sbjct: 1786 DEEDEKRRMEVDKKIKDIFDNFAN 1809


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 39.6 bits (93), Expect = 0.018
 Identities = 59/275 (21%), Positives = 107/275 (38%), Gaps = 45/275 (16%)

Query: 298  ETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRS 357
            +  RR     + L  +  Q Q   +QLE A +  S+L      L+     L  E L  R 
Sbjct: 841  QLNRRRVELERALADHESQEQQQRSQLEQAKEGLSALNR----LLPRLNLLADETLADRV 896

Query: 358  GCCTQPESDDAGSADSLNQHYMSALNKYETI-------KDEYDALRKRFDDLINTHSTAV 410
                + + D+A  A    Q + +AL + E I        ++++ L++ +     T   A 
Sbjct: 897  EEI-REQLDEAEEAKRFVQQHGNALAQLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAK 955

Query: 411  DKLELAQEEVARIK----KQCEDM---NQEKNTALRERNGLKQ------QCTAAIRQWDI 457
             +     E V R      +   +M   N + N  LR+R  L+Q      +    +RQ   
Sbjct: 956  QQAFALTEVVQRRAHFSYEDAAEMLAKNSDLNEKLRQR--LEQAEQERTRAREQLRQAQA 1013

Query: 458  ALRERNEYQEAL-----AKVQQQHEDAVKEIN-------TAMAVRMKATKD-----LKRL 500
             L + N+   +L     AK +Q  ++  +E+        +    R +A +D     L   
Sbjct: 1014 QLAQYNQVLASLKSSYDAK-RQMLQELKQELQDLGVPADSGAEERARARRDELHARLSAN 1072

Query: 501  TEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQA 535
               RN   ++ +   +E D++ K++ KL  D  + 
Sbjct: 1073 RSRRNQLEKQLTFCEAEMDNLTKKLRKLERDYHEM 1107



 Score = 33.8 bits (78), Expect = 1.1
 Identities = 50/248 (20%), Positives = 96/248 (38%), Gaps = 38/248 (15%)

Query: 314 RGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLD------------RELLELRSGCCT 361
             Q +   A+ EAA +E   L+S+ +D    +Q LD             + LE     C 
Sbjct: 375 DEQQEENEARAEAAEEEVDELKSQLADY---QQALDVQQTRAIQYQQAVQALERAKQLCG 431

Query: 362 QPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVA 421
            P+     + D L +       K +   +E  +L ++          A  + E A + V 
Sbjct: 432 LPDLTADNAEDWLEE----FQAKEQEATEELLSLEQKLSV----AQAAHSQFEQAYQLVR 483

Query: 422 RI-----KKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRE-RNEYQEALAKVQQQ 475
           +I     + +  D+ +E    LRE+  L +Q    ++Q  + L E     ++     QQ+
Sbjct: 484 KIAGEVSRSEAWDVARELLRRLREQRHLAEQ----LQQLRMRLSELEQRLRQ-----QQR 534

Query: 476 HEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQA 535
            E  + E    +   +    +L++L EE  A ++  S  +SE       + +  E L   
Sbjct: 535 AERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQAR 594

Query: 536 VRKIKTLE 543
           ++++    
Sbjct: 595 IQRLAARA 602


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 39.3 bits (92), Expect = 0.019
 Identities = 57/330 (17%), Positives = 101/330 (30%), Gaps = 67/330 (20%)

Query: 364 ESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLE----LAQEE 419
            S      ++L         KYE +  E   LR   ++   +  +   K         EE
Sbjct: 204 PSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSEGGDLFEE 263

Query: 420 VARIKKQCEDMNQEKNTALRERNGLKQQC----------TAAIRQWDI------------ 457
             ++++Q +++   +     +   L               +   Q               
Sbjct: 264 REQLERQLKEIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQKEEQSQQNQLTQE 323

Query: 458 ALRER-NEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMS 516
            L ER  E  E+L K+     + VKEI   +A   K       +                
Sbjct: 324 ELEERDKELLESLPKLALP-AEHVKEIAAELAEIDKPATTDSEIPH-------------- 368

Query: 517 ERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDK 576
                  E+ +L E L Q V+         +EL + K  L  ++  L+ E+A      DK
Sbjct: 369 --RLSGSELTQL-EVLIQQVK---------RELQDAKSQLLKELRELEEELAEV----DK 412

Query: 577 ALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGK 636
            +       E+  +      E Q E F +  E      E     +   E    +++   K
Sbjct: 413 KI-STIPSEEQIAQLLEELGEAQNELFRSEAEIE----ELLRQLETLKE----AIEALRK 463

Query: 637 CQKERMDNLEANQEIDRLRKHTDKLQAELQ 666
              E+        E++R     DK +  L+
Sbjct: 464 TLDEKTKQKINAFELERAITIADKAKKTLK 493



 Score = 32.0 bits (73), Expect = 3.6
 Identities = 16/59 (27%), Positives = 27/59 (45%)

Query: 526 EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDL 584
            K SE  +  + +I+ LE E KE  E  + L+ +I  L+ E+  A    +   K+    
Sbjct: 198 RKKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSE 256


>gnl|CDD|212963 cd12030, SH3_DLG4, Src Homology 3 domain of Disks Large homolog 4.
            DLG4, also called postsynaptic density-95 (PSD95) or
            synapse-associated protein 90 (SAP90), is a scaffolding
            protein that clusters at synapses and plays an important
            role in synaptic development and plasticity. It is
            responsible for the membrane clustering and retention of
            many transporters and receptors such as potassium
            channels and PMCA4b, a P-type ion transport ATPase, among
            others. DLG4 is a member of the MAGUK
            (membrane-associated guanylate kinase) protein family,
            which is characterized by the presence of a core of three
            domains: PDZ, SH3, and guanylate kinase (GuK). The GuK
            domain in MAGUK proteins is enzymatically inactive;
            instead, the domain mediates protein-protein interactions
            and associates intramolecularly with the SH3 domain. DLG4
            contains three PDZ domains. The SH3 domain of DLG4 binds
            and clusters the kainate subgroup of glutamate receptors
            via two proline-rich sequences in their C-terminal tail.
            It also binds AKAP79/150 (A-kinase anchoring protein).
            SH3 domains are protein interaction domains that bind to
            proline-rich ligands with moderate affinity and
            selectivity, preferentially to PxxP motifs. They play
            versatile and diverse roles in the cell including the
            regulation of enzymes, changing the subcellular
            localization of signaling pathway components, and
            mediating the formation of multiprotein complex
            assemblies.
          Length = 66

 Score = 35.3 bits (81), Expect = 0.020
 Identities = 28/71 (39%), Positives = 38/71 (53%), Gaps = 10/71 (14%)

Query: 1332 SFYMRALFS--RTGDLG---DALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQ 1386
             FY+RALF   +T D G    AL  RF   D+L+V   +  G    W+A  V +D   ++
Sbjct: 1    GFYIRALFDYDKTKDCGFLSQALSFRF--GDVLHV---IDAGDEEWWQARRVHSDSETEE 55

Query: 1387 CGIIPSKYKVE 1397
             G IPSK +VE
Sbjct: 56   IGFIPSKRRVE 66


>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ.  Members of
           this family are the FliJ protein found, in nearly every
           case, in the midst of other flagellar biosynthesis genes
           in bacgterial genomes. Typically the fliJ gene is found
           adjacent to the gene for the flagellum-specific ATPase
           FliI. Sequence scoring in the gray zone between trusted
           and noise cutoffs include both probable FliJ proteins
           and components of bacterial type III secretion systems.
          Length = 141

 Score = 36.9 bits (86), Expect = 0.021
 Identities = 26/121 (21%), Positives = 47/121 (38%), Gaps = 32/121 (26%)

Query: 473 QQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY--------------------- 511
           +++ E A  E+  A A   +    L++L + R    Q+                      
Sbjct: 12  EKEEEQAKLELAKAQAEFERLETQLQQLIKYREEYEQQALEKVGAGTSALELSNYQRFIR 71

Query: 512 ---SLIMSERDSVH---KEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKR 565
                I  ++  +    +E+E   E L +A R++K LE      +++KK   Y+ E  KR
Sbjct: 72  QLDQRIQQQQQELALLQQEVEAKRERLLEARRELKALEK-----LKEKKQKEYRAEEAKR 126

Query: 566 E 566
           E
Sbjct: 127 E 127


>gnl|CDD|212962 cd12029, SH3_DLG3, Src Homology 3 domain of Disks Large homolog 3.
            DLG3, also called synapse-associated protein 102
            (SAP102), is a scaffolding protein that clusters at
            synapses and plays an important role in synaptic
            development and plasticity. Mutations in DLG3 cause
            midgestational embryonic lethality in mice and may be
            associated with nonsyndromic X-linked mental retardation
            in humans. It interacts with the NEDD4 (neural precursor
            cell-expressed developmentally downregulated 4) family of
            ubiquitin ligases and promotes apical tight junction
            formation. DLG3 is a member of the MAGUK
            (membrane-associated guanylate kinase) protein family,
            which is characterized by the presence of a core of three
            domains: PDZ, SH3, and guanylate kinase (GuK). The GuK
            domain in MAGUK proteins is enzymatically inactive;
            instead, the domain mediates protein-protein interactions
            and associates intramolecularly with the SH3 domain. DLG3
            contains three PDZ domains. SH3 domains are protein
            interaction domains that bind to proline-rich ligands
            with moderate affinity and selectivity, preferentially to
            PxxP motifs. They play versatile and diverse roles in the
            cell including the regulation of enzymes, changing the
            subcellular localization of signaling pathway components,
            and mediating the formation of multiprotein complex
            assemblies.
          Length = 67

 Score = 35.1 bits (80), Expect = 0.022
 Identities = 30/69 (43%), Positives = 37/69 (53%), Gaps = 6/69 (8%)

Query: 1332 SFYMRALFS--RTGDLGDALQ-LRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCG 1388
            S Y+RALF   RT D     Q L F   DIL+V N   +     W+A LV   G  +Q G
Sbjct: 2    SLYVRALFDYDRTRDSCLPSQGLSFSYGDILHVINASDDE---WWQARLVTPHGESEQIG 58

Query: 1389 IIPSKYKVE 1397
            +IPSK +VE
Sbjct: 59   VIPSKKRVE 67


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 39.1 bits (91), Expect = 0.026
 Identities = 61/292 (20%), Positives = 120/292 (41%), Gaps = 46/292 (15%)

Query: 428 EDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQ-----HEDAVKE 482
           ++  Q +NT+LR    L Q+ T++    + A  +R+E   A+   QQ       + +  +
Sbjct: 70  DENGQLENTSLRTVMELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQTNSKDGEQLSDFQ 129

Query: 483 INTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTL 542
           +   + +   A K++  L + R  A+++   I++E++++  ++  L   L++   +IK  
Sbjct: 130 LEDLVGMIQNAEKNILLLNQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLA 189

Query: 543 EME--NKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEE--- 597
             E  + E++E+      Q+E L+ E+        +   E   +     E    KEE   
Sbjct: 190 AQEKIHVEILEE------QLEKLRNELLI------RGATEGLCVHSLSKELDVLKEENML 237

Query: 598 --CQREAFHARFEYAMRMGERDSS-RKERNEVEAASMDLFGK-----------------C 637
                +   A         ER     KER+ ++A+  +L  K                 C
Sbjct: 238 LKDDIQFLKAELIEVAETEERVFKLEKERSLLDASLRELESKFIVAQEDVSKLSPLQYDC 297

Query: 638 QKERMDNLEANQEIDRLRKHTDKLQAELQ--DQSVNYLFILTESIQEAEVSK 687
             E+++NL+    +DR     +K    L       + +  L  S++EA VSK
Sbjct: 298 WWEKVENLQD--LLDRATNQVEKAALVLDQNQDLRDKVDKLEASLKEANVSK 347



 Score = 34.5 bits (79), Expect = 0.64
 Identities = 74/361 (20%), Positives = 134/361 (37%), Gaps = 79/361 (21%)

Query: 331 SSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKD 390
           SS  + K    I+ KQR     L+  +    Q E+    +   L Q         ++   
Sbjct: 43  SSQQKKKRGKNIAPKQRSSNSKLQSNTDENGQLENTSLRTVMELPQ---------KSTSS 93

Query: 391 EYDALRKRFDDLINTHSTAVDK--LELAQEEVARIKKQCEDM-----NQEKNTALRERNG 443
           + D    R     +    A+D      +++       Q ED+     N EKN  L     
Sbjct: 94  DDD--HNRASMQRDEAIAAIDNEQQTNSKDGEQLSDFQLEDLVGMIQNAEKNILL----- 146

Query: 444 LKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEE 503
           L Q    A+   +  L E+   Q  +  ++ +  +    I  A   ++      ++L + 
Sbjct: 147 LNQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEEQLEKL 206

Query: 504 RNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETL 563
           RN  +       +E   VH   ++L             L+ EN  L +D       I+ L
Sbjct: 207 RNELLIR---GATEGLCVHSLSKELD-----------VLKEENMLLKDD-------IQFL 245

Query: 564 KREIASALHDRDKAL----KECN----DLRERFGEFTAAKEEC------QREAFHARFEY 609
           K E+   + + ++ +    KE +     LRE   +F  A+E+       Q + +  + E 
Sbjct: 246 KAEL-IEVAETEERVFKLEKERSLLDASLRELESKFIVAQEDVSKLSPLQYDCWWEKVEN 304

Query: 610 AMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQDQS 669
              + +R +     N+VE A++             L+ NQ+   LR   DKL+A L++ +
Sbjct: 305 LQDLLDRAT-----NQVEKAAL------------VLDQNQD---LRDKVDKLEASLKEAN 344

Query: 670 V 670
           V
Sbjct: 345 V 345



 Score = 34.1 bits (78), Expect = 0.76
 Identities = 48/266 (18%), Positives = 110/266 (41%), Gaps = 46/266 (17%)

Query: 374 LNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
           LNQ  + AL   E I  E +AL+ + + L    S    +++LA +E   ++   E + + 
Sbjct: 147 LNQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEEQLEKL 206

Query: 434 KNTALRERNGLKQQCTAAIRQWDIALRERN------------------EYQEALAKVQQQ 475
           +N  L           +  ++ D+ L+E N                  E +E + K++++
Sbjct: 207 RNELLIRGATEGLCVHSLSKELDV-LKEENMLLKDDIQFLKAELIEVAETEERVFKLEKE 265

Query: 476 H---EDAVKEINTAMAVR---------------MKATKDLKRLTEERNAAMQEYSLIMSE 517
               + +++E+ +   V                 +  ++L+ L +     +++ +L++ +
Sbjct: 266 RSLLDASLRELESKFIVAQEDVSKLSPLQYDCWWEKVENLQDLLDRATNQVEKAALVLDQ 325

Query: 518 RDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDK-KTLSYQIETLKREIASALHDRDK 576
              +  +++KL   L +A   +        EL++ K K L  +++    EI S +    +
Sbjct: 326 NQDLRDKVDKLEASLKEA--NVSKFSSYKVELLQQKLKLLEERLQASDHEIHSYIQLYQE 383

Query: 577 ALKECNDLRERFGEFTAAKEECQREA 602
           ++KE  D   +       KEE ++ +
Sbjct: 384 SIKEFQDTLSKL------KEESKKRS 403


>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
           binding.  This is the C-terminal conserved coiled coil
           region of a family of TATA element modulatory factor 1
           proteins conserved in eukaryotes. The proteins bind to
           the TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMF1_TATA_bd is the most conserved
           part of the TMFs. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells. The Rab6-binding
           domain appears to be the same region as this C-terminal
           family.
          Length = 121

 Score = 36.0 bits (84), Expect = 0.028
 Identities = 21/88 (23%), Positives = 42/88 (47%), Gaps = 13/88 (14%)

Query: 484 NTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLE 543
           N  +  R+ +T  L+RL  E  +   E + + +ERD   +E+          V+  +  E
Sbjct: 15  NVQLVERLSST--LRRLEGELASLKDELARLEAERDEARQEI----------VKLTEENE 62

Query: 544 MENKELVEDKKTLSYQIETLKREIASAL 571
            E +EL ++ + L  ++E L++   + L
Sbjct: 63  -ELRELKKEIEELEKELEDLEQRYETTL 89



 Score = 29.1 bits (66), Expect = 7.3
 Identities = 20/106 (18%), Positives = 41/106 (38%), Gaps = 29/106 (27%)

Query: 318 QAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQH 377
            + + +LE    E +SL+ + + L +++    +E+++L                      
Sbjct: 23  SSTLRRLEG---ELASLKDELARLEAERDEARQEIVKL---------------------- 57

Query: 378 YMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARI 423
                 +   +K E + L K  +DL   + T    LEL  E+  R+
Sbjct: 58  -TEENEELRELKKEIEELEKELEDLEQRYETT---LELLGEKSERV 99



 Score = 29.1 bits (66), Expect = 8.0
 Identities = 25/105 (23%), Positives = 43/105 (40%), Gaps = 29/105 (27%)

Query: 450 AAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQ 509
           + +R+ +    E    ++ LA+++ + ++A +EI                  EE     +
Sbjct: 24  STLRRLEG---ELASLKDELARLEAERDEARQEIVKLT-----------EENEELRELKK 69

Query: 510 EYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEM--ENKELVED 552
           E            +E+EK  EDL Q  R   TLE+  E  E VE+
Sbjct: 70  EI-----------EELEKELEDLEQ--RYETTLELLGEKSERVEE 101


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 38.4 bits (89), Expect = 0.034
 Identities = 46/179 (25%), Positives = 85/179 (47%), Gaps = 27/179 (15%)

Query: 400 DDLINTHST----AVDKLELAQEEVARIKKQCEDMNQEKNTALRERN-----GLKQQCTA 450
           + L+  H+     AV   E A+ E A +K+Q ED+  +K  A  +R       LK +C  
Sbjct: 2   EALVKQHAKVAEEAVSGWEKAEAEAAALKQQLEDVTLKKLAA-EDRASHLDGALK-ECMR 59

Query: 451 AIRQWDIALRERNE--YQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAM 508
            +RQ    +RE  E   Q+A  +  ++ E    E+   +    +  K+L R   E N A+
Sbjct: 60  QLRQ----VREEQEQKIQDAATEKTKEWEKVKAELEKKLL---ELQKELARAAAE-NDAL 111

Query: 509 QEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREI 567
              S  + E+    K + +LSE+ ++A  +I+ L+   +   ++  +L Y++  L +E+
Sbjct: 112 ---SRSLQEKS---KMIMELSEEKSRAESEIEDLKARLESAEKENSSLKYEVHVLSKEL 164


>gnl|CDD|212965 cd12032, SH3_DLG2, Src Homology 3 domain of Disks Large homolog 2.
            DLG2, also called postsynaptic density-93 (PSD93) or
            Channel-associated protein of synapse-110 (chapsyn 110),
            is a scaffolding protein that clusters at synapses and
            plays an important role in synaptic development and
            plasticity. The DLG2 delta isoform binds inwardly
            rectifying potassium Kir2 channels, which determine
            resting membrane potential in neurons. It regulates the
            spatial and temporal distribution of Kir2 channels within
            neuronal membranes. DLG2 is a member of the MAGUK
            (membrane-associated guanylate kinase) protein family,
            which is characterized by the presence of a core of three
            domains: PDZ, SH3, and guanylate kinase (GuK). The GuK
            domain in MAGUK proteins is enzymatically inactive;
            instead, the domain mediates protein-protein interactions
            and associates intramolecularly with the SH3 domain. DLG2
            contains three PDZ domains. SH3 domains are protein
            interaction domains that bind to proline-rich ligands
            with moderate affinity and selectivity, preferentially to
            PxxP motifs. They play versatile and diverse roles in the
            cell including the regulation of enzymes, changing the
            subcellular localization of signaling pathway components,
            and mediating the formation of multiprotein complex
            assemblies.
          Length = 74

 Score = 34.7 bits (79), Expect = 0.036
 Identities = 27/71 (38%), Positives = 39/71 (54%), Gaps = 6/71 (8%)

Query: 1332 SFYMRALF--SRTGDLGDALQ-LRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCG 1388
            S Y+RA+F   ++ D G   Q L F   DIL+V N   +     W+A  V  DG  ++ G
Sbjct: 5    SLYVRAMFDYEKSKDSGLPSQGLSFRYGDILHVINASDDE---WWQARRVTPDGDSEEMG 61

Query: 1389 IIPSKYKVEEE 1399
            +IPSK +VE +
Sbjct: 62   VIPSKRRVERK 72


>gnl|CDD|185320 PRK15422, PRK15422, septal ring assembly protein ZapB; Provisional.
          Length = 79

 Score = 34.7 bits (79), Expect = 0.044
 Identities = 22/62 (35%), Positives = 35/62 (56%), Gaps = 7/62 (11%)

Query: 526 EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLR 585
           EKL   + QA+  I  L+ME +EL E   +LS       +E+ +A H R++  +E N L+
Sbjct: 7   EKLEAKVQQAIDTITLLQMEIEELKEKNNSLS-------QEVQNAQHQREELERENNHLK 59

Query: 586 ER 587
           E+
Sbjct: 60  EQ 61


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 38.1 bits (89), Expect = 0.046
 Identities = 30/197 (15%), Positives = 72/197 (36%), Gaps = 21/197 (10%)

Query: 376 QHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCED------ 429
              +    K     +     + ++D+L+    T   ++E   +E+  +    ED      
Sbjct: 201 NKNIEEQRKKN--GENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALN 258

Query: 430 -MNQE------------KNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQH 476
            +N              K   + E+ G+   CT  I +    + +  +  + L    ++ 
Sbjct: 259 KLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKL 318

Query: 477 EDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAV 536
           + A+ E+   M    + +K L  L  + +   Q    ++ +   V   +E+L  +     
Sbjct: 319 DTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNA 378

Query: 537 RKIKTLEMENKELVEDK 553
            ++  L+ E  ++V+ K
Sbjct: 379 EELAKLQDELDKIVKTK 395



 Score = 33.4 bits (77), Expect = 1.0
 Identities = 42/240 (17%), Positives = 88/240 (36%), Gaps = 36/240 (15%)

Query: 380 SALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKN---- 435
           S ++K    KD+   L ++   L         +++   + +   +K+  +    K     
Sbjct: 166 SEMDK--LNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYD 223

Query: 436 TALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHED---AVKEINTAMAVRMK 492
             + E   +K +                E  + L  +    ED   A+ ++NTA A ++K
Sbjct: 224 ELVEEAKTIKAE--------------IEELTDELLNLVMDIEDPSAALNKLNTAAA-KIK 268

Query: 493 A-----TKDLKRLTE--ERNAAMQEYS----LIMSERDSVHKEMEKLSEDLTQAVRKIKT 541
           +      K +K   +        Q+ S     I   +D + KE++   E L  A+ +++ 
Sbjct: 269 SKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKL-KELQHSLEKLDTAIDELEE 327

Query: 542 LEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQRE 601
           +  E  E  +    L  +I T K+ + + +    K      +L+  F +      + Q E
Sbjct: 328 IMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDE 387


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 35.7 bits (83), Expect = 0.053
 Identities = 28/156 (17%), Positives = 64/156 (41%), Gaps = 24/156 (15%)

Query: 409 AVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEA 468
             ++    Q E+ R+ ++ ED  ++      +     +    A          + +Y+  
Sbjct: 1   LEEEKSSLQAELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEA----------QQKYERE 50

Query: 469 LAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKL 528
           L     +H + ++E+        +  K++ +L  E  +A  E S      +S  ++ + L
Sbjct: 51  L----VKHAEDIEELQALRKQLNELKKEIAQLKAEAESAQAELS---EAEESWEEQKKML 103

Query: 529 SEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLK 564
            ++L++  ++I+ L  +NK L +       QIE L 
Sbjct: 104 EDELSELEKRIEELNEQNKLLHD-------QIELLS 132


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 37.9 bits (88), Expect = 0.059
 Identities = 54/254 (21%), Positives = 86/254 (33%), Gaps = 38/254 (14%)

Query: 308 KELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLD----------RELLELRS 357
           +ELE  R   +A  A    A Q     R   S L  +++RLD          REL   R 
Sbjct: 333 EELERARADAEALQAAAADARQAI---REAESRLEEERRRLDEEAGRLDDAERELRAARE 389

Query: 358 GCCTQPESDDAGSADSLN------QHYMSALNKYETIKDEYD---ALRKRFDDLINTHST 408
                 E      A +           +  L   +  + + D   A R     L+     
Sbjct: 390 QLARAAERAGLSPAHTAEPDAALAAQELQELGALDARRQDADRVIAQRSEQVALLRRRDD 449

Query: 409 AVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQC-------TAAIRQWDIALRE 461
             D+   A+   A  + + +++++E   A   R  L  +         A +  W   LRE
Sbjct: 450 VADR---AEATHAAARARRDELDEEAEQA-AARAELADEAVHREGARLAWVDAWQAQLRE 505

Query: 462 -----RNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMS 516
                 ++   ALA +         E    +AV          LT ER A      ++  
Sbjct: 506 LTILAVDDQPGALADLDSWDALLQGEAPVRVAVYSAVQPLADELTRERAALRLAEEVLEE 565

Query: 517 ERDSVHKEMEKLSE 530
           ERD++  E E+L +
Sbjct: 566 ERDALRTERERLEQ 579



 Score = 34.0 bits (78), Expect = 0.83
 Identities = 35/237 (14%), Positives = 66/237 (27%), Gaps = 31/237 (13%)

Query: 397 KRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWD 456
            ++D L      A D+LE A+EE   +  + E + +E +        L         Q  
Sbjct: 276 TQYDQLSRDLGRARDELETAREEERELDARTEALEREADALRTRLEAL---------QGS 326

Query: 457 IALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMS 516
            A ++  E + A A  +     A            +  ++ +RL EE             
Sbjct: 327 PAYQDAEELERARADAEALQAAAADARQAIREAESRLEEERRRLDEEAGRLDD------- 379

Query: 517 ERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDK 576
                     +L     Q  R  +   +      E    L+ Q       + +   D D+
Sbjct: 380 -------AERELRAAREQLARAAERAGLSPAHTAEPDAALAAQELQELGALDARRQDADR 432

Query: 577 ALKECNDLRERFGEFTAA----KEECQREAFHARFEYAMRMGERDSSRKERNEVEAA 629
            + +    R              +  +     AR        E + +       + A
Sbjct: 433 VIAQ----RSEQVALLRRRDDVADRAEATHAAARARRDELDEEAEQAAARAELADEA 485



 Score = 32.9 bits (75), Expect = 2.1
 Identities = 36/208 (17%), Positives = 67/208 (32%), Gaps = 14/208 (6%)

Query: 307 LKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQ-PES 365
             + +        A  +LE A +E   L ++   L  +   L   L  L+     Q  E 
Sbjct: 275 QTQYDQLSRDLGRARDELETAREEERELDARTEALEREADALRTRLEALQGSPAYQDAEE 334

Query: 366 DDAGSADSLNQHYMSALNKYETIKDE----YDALRKRFDDLINTHSTAVDKLELAQEEVA 421
            +   AD+      +A         E     +  R+R D+       A  +L  A+E++A
Sbjct: 335 LERARADAEALQ--AAAADARQAIREAESRLEEERRRLDEEAGRLDDAERELRAAREQLA 392

Query: 422 RIKKQC--EDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDA 479
           R  ++      +  +  A       + Q   A+             +     + ++ +D 
Sbjct: 393 RAAERAGLSPAHTAEPDAALAAQ--ELQELGALDARRQDADRVIAQRSEQVALLRRRDDV 450

Query: 480 VKEINTAMAVRMKATKDLKRLTEERNAA 507
                   A   +A +D   L EE   A
Sbjct: 451 ADRAEATHA-AARARRD--ELDEEAEQA 475


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 37.6 bits (87), Expect = 0.059
 Identities = 45/225 (20%), Positives = 87/225 (38%), Gaps = 35/225 (15%)

Query: 243 LSSAREGMALLSDSPDVLTITTLKPTSLGEHSPRVHRKS---------VNSSTQTMPLYV 293
           ++  R+G+  + +S D       K  +    S ++ ++          V S    M   +
Sbjct: 411 VTKERDGLRAILNSYD-------KELTETSVSGQLMKRLEEAEDLVQKVQSHLAKMENQL 463

Query: 294 RQHNETV----RRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLD 349
            +  E V     R +    E++  + Q  +    L    + +++LR K   L  ++ RL 
Sbjct: 464 SELEEDVGQQKDRNNTLETEIKLLKEQLVSNERLLSFVKEATNALRLKIETLERERDRLR 523

Query: 350 RELLELRSGCCTQPESDDAGSAD------SLN---QHYMSALNKYETIKDEYDALRKRFD 400
           +E   L           D  ++       SLN   +    A    E ++ E + L++R  
Sbjct: 524 QEKSLLEMKLEHLCLQGDYNASRTKVLHMSLNPASEAEQIAKQTIEALQAECEKLKERLQ 583

Query: 401 DLINTHSTAVDKLE-----LAQEEVARIKKQCEDMNQEKNTALRE 440
            L    S   D  +     ++ +E+A++KKQ E   + KN  L+E
Sbjct: 584 ALEEGKSQPGDLEKAVGSHISSKEIAQLKKQVESAEK-KNQRLKE 627



 Score = 31.8 bits (72), Expect = 3.7
 Identities = 50/282 (17%), Positives = 101/282 (35%), Gaps = 61/282 (21%)

Query: 380 SALNKYETIKDEYDALRKRFDDL-INTHSTAVDKLELAQEEVARIKK-----QCEDMNQE 433
            A N+    ++E   L+ + D   +   +   D+L+ A+E ++RIK      QC   N +
Sbjct: 125 KAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNAD 184

Query: 434 KNTAL--RERNGLKQQCTAAIRQWDIALRERNEYQEALA--------------KVQQQHE 477
               L   E   L++Q     ++   A ++        A              ++++  +
Sbjct: 185 TELKLLESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQ 244

Query: 478 D-AVKEINTAMAVRMKATKDLKRLTEERNA---AMQEYSLIMSE---------------R 518
           D  V +      +++   +       E N    +M+E + ++ E               R
Sbjct: 245 DAEVVKSMKEQLLQIPELERELAALREENRKLRSMKEDNELLKEELEDLQSRLERFEKMR 304

Query: 519 DSVHK---EMEKLSEDLTQ-----------------AVRKIKTLEMENKELVEDKKTLSY 558
           + +     E EKL  +L                     R+I  L+ E  +L E   ++S 
Sbjct: 305 EKLADLELEKEKLENELKSWKSLLQDIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISS 364

Query: 559 QIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQR 600
             + L+  +     +R KA+ E  +L+++     A     QR
Sbjct: 365 SAKQLETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQR 406



 Score = 31.8 bits (72), Expect = 3.7
 Identities = 41/240 (17%), Positives = 89/240 (37%), Gaps = 18/240 (7%)

Query: 324 LEAAAQESSSLRSKYSDLISDKQRLDRELLE----LRSGCCTQPESDDAGSADS-LNQHY 378
           L++  +    +R K +DL  +K++L+ EL      L+         DD       L    
Sbjct: 293 LQSRLERFEKMREKLADLELEKEKLENELKSWKSLLQDIGLNLRTPDDLSRRIVVLQNEE 352

Query: 379 MSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTAL 438
           +    K  +I      L      L      AV ++   ++++  +K     + +      
Sbjct: 353 LQLKEKNGSISSSAKQLETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVT 412

Query: 439 RERNGLKQQCTA-----AIRQWDIALRERNE--------YQEALAKVQQQHEDAVKEINT 485
           +ER+GL+    +             L +R E         Q  LAK++ Q  +  +++  
Sbjct: 413 KERDGLRAILNSYDKELTETSVSGQLMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVGQ 472

Query: 486 AMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEME 545
                     ++K L E+  +  +  S +    +++  ++E L  +  +  ++   LEM+
Sbjct: 473 QKDRNNTLETEIKLLKEQLVSNERLLSFVKEATNALRLKIETLERERDRLRQEKSLLEMK 532


>gnl|CDD|238026 cd00071, GMPK, Guanosine monophosphate kinase (GMPK, EC 2.7.4.8),
            also known as guanylate kinase (GKase), catalyzes the
            reversible phosphoryl transfer from adenosine
            triphosphate (ATP) to guanosine monophosphate (GMP) to
            yield adenosine diphosphate (ADP) and guanosine
            diphosphate (GDP). It plays an essential role in the
            biosynthesis of guanosine triphosphate (GTP). This enzyme
            is also important for the activation of some antiviral
            and anticancer agents, such as acyclovir, ganciclovir,
            carbovir, and thiopurines.
          Length = 137

 Score = 35.6 bits (83), Expect = 0.060
 Identities = 26/121 (21%), Positives = 53/121 (43%), Gaps = 24/121 (19%)

Query: 1476 PVILVGALS----DTVTDKLLQDFPDKFV-------RCAP---EIM----HC-PQAAMEK 1516
             ++L G  S     T+  +LL++F   F        R  P   E+     H   +   E+
Sbjct: 1    LIVLSGP-SGVGKSTLLKRLLEEFDPNFGFSVSHTTR-KPRPGEVDGVDYHFVSKEEFER 58

Query: 1517 GLADNLFVDYRKKGAYFECTSVAAVKDACDKNNAHVILDVSLAGVERLHRQH--VYPIVL 1574
             + +  F+++ +    +  TS AAV++A  +    VIL++ + G  ++ + +     I +
Sbjct: 59   LIENGEFLEWAEFHGNYYGTSKAAVEEALAEGK-IVILEIDVQGARQVKKSYPDAVSIFI 117

Query: 1575 L 1575
            L
Sbjct: 118  L 118


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 36.9 bits (86), Expect = 0.097
 Identities = 47/226 (20%), Positives = 93/226 (41%), Gaps = 37/226 (16%)

Query: 390 DEYDALRKRFDDLIN--THSTAVDKLEL----AQEEVARIKKQCEDMNQEKNTALRERNG 443
                L   F+DLI+        D LE     A+EE+ ++ K+  ++  E+         
Sbjct: 136 PVVQELVSIFNDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAELKAEE--EEELERA 193

Query: 444 LKQQCTAAIRQWDIALRERNEYQEA--LAKVQQQHEDAVKEINTAMAVRMKATKDLKRLT 501
           LK++    + + +  L  R E +EA    +++ + E   +E+      +++   +L+R  
Sbjct: 194 LKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQ--ELERQA 251

Query: 502 EE-----RNA-AMQEYSL-----------IMSERDSVHKEMEKLS---EDLTQAVRKIKT 541
           E      +N  A+Q   L           +  ER+    ++ +L+   + L +A+     
Sbjct: 252 EAHEQKLKNELALQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKALDSRSE 311

Query: 542 LEMENKELVEDKKTLSYQIETLKREIASALHDRDKAL-KECNDLRE 586
            E EN ++    + L   +E LK  + S      + L KE + L+E
Sbjct: 312 AEDENHKV----QQLWLAVEALKSALKSGSAGSPRPLVKELDALKE 353


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 37.1 bits (87), Expect = 0.10
 Identities = 17/100 (17%), Positives = 45/100 (45%), Gaps = 10/100 (10%)

Query: 384 KYETIKDEYDALRKRFDDLINT-HSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERN 442
             E ++++ + L++  D L+      A   ++ A++E   I K+   + +    +++   
Sbjct: 549 LKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKA-- 606

Query: 443 GLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKE 482
               +   A ++    L + NE +E   K Q++ ++ +K 
Sbjct: 607 ---HELIEARKR----LNKANEKKEKKKKKQKEKQEELKV 639


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 36.8 bits (85), Expect = 0.10
 Identities = 24/122 (19%), Positives = 52/122 (42%), Gaps = 15/122 (12%)

Query: 453 RQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYS 512
            +   A +ER+ Y++ L+K++ Q+ +     N           +L  L +E    + E  
Sbjct: 36  SELRDAEKERDTYKQYLSKLESQNVE---ISNYE-----ALDSELDELKKEEERLLDELE 87

Query: 513 LIMSERDSVHKEM-------EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKR 565
            +  E D +  E+       E+L  +  Q +R+    +  N +L ++ ++L  Q E    
Sbjct: 88  ELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQSLELQYEYSLN 147

Query: 566 EI 567
           ++
Sbjct: 148 QL 149


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 36.9 bits (85), Expect = 0.10
 Identities = 56/396 (14%), Positives = 124/396 (31%), Gaps = 54/396 (13%)

Query: 314 RGQHQAAMAQLEAAAQESSSLR----SKYSDLISDKQRLDRELLELRSGCCTQPESDD-- 367
             + ++   + E     S  L              KQ L++EL  LR       +S    
Sbjct: 186 FAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYL 245

Query: 368 ------AGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVA 421
                         Q       + E ++ +   L +  + +    +     L    + V 
Sbjct: 246 TQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERI--NRARKAAPLAAHIKAVT 303

Query: 422 RIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVK 481
           +I++Q + ++ E  + +R R  L  +  A ++Q         E +  L  +  Q      
Sbjct: 304 QIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQ----QSSIEEQRRLLQTLHSQEIHIRD 359

Query: 482 EINTAMAVRMKATKD------LKRLTEERNAAMQEYSLIMSERDSVHKEMEK-LSEDLTQ 534
               A ++R  + +       +  L +++    Q+   +  E D + +E     +     
Sbjct: 360 AHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAF 419

Query: 535 AVRKIK--------TLEMENKELVEDKKTLSYQIETLK----REIASALHDRDKALKECN 582
              + +         L+    EL     T + Q E L+    +E A +L +R++ L+   
Sbjct: 420 RDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKE 479

Query: 583 DLRERFGEFTAAKEE------------CQREAFHARFEYAMRMGERDSSRKERNEVEAAS 630
            +  +     A                C            +      + R +R E   A 
Sbjct: 480 QIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQ 539

Query: 631 M-----DLFGKCQKERMDNLEANQEIDRLRKHTDKL 661
           +     D++ +   ER       +++  +++    L
Sbjct: 540 LETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSIL 575



 Score = 35.7 bits (82), Expect = 0.24
 Identities = 40/287 (13%), Positives = 85/287 (29%), Gaps = 24/287 (8%)

Query: 412 KLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
           +++  ++  A+++   ED+  +  +  ++R  LK+Q     + + I  +  N  +E +  
Sbjct: 529 RMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPN 588

Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
           +Q                      +   L  +         + +  +    +E+      
Sbjct: 589 LQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQ-CSQELALKLTA 647

Query: 532 LTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEF 591
           L              +E V +       +             R  AL++    +E+   +
Sbjct: 648 LHA------LQLTLTQERVREHALSIRVLPKELLAS------RQLALQKMQSEKEQLTYW 695

Query: 592 TAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEI 651
                +CQ         +           +E NE+E AS  L              NQ +
Sbjct: 696 KEMLAQCQ-TLLRELETHIEE------YDREFNEIENASSSLGSDLAAR---EDALNQSL 745

Query: 652 DRLRKHTD-KLQAELQDQSVNYLFILTESIQEAEVSKRRRDWAFSER 697
             L       L+A  +    N   +       AE+S    +  F  R
Sbjct: 746 KELMHQARTVLKARTEAHFNNNEEVTAALQTGAELSHLAAEIQFFNR 792


>gnl|CDD|185272 PRK15374, PRK15374, pathogenicity island 1 effector protein SipB;
           Provisional.
          Length = 593

 Score = 36.5 bits (84), Expect = 0.12
 Identities = 46/203 (22%), Positives = 80/203 (39%), Gaps = 33/203 (16%)

Query: 369 GSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCE 428
           G A      Y +++ K +T K  YDA  K+    +      +  L+ A    A+ +   E
Sbjct: 134 GEAQEATDLYEASIKKTDTAKSVYDAAEKK----LTQAQNKLQSLDPADPGYAQAEAAVE 189

Query: 429 DMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMA 488
              Q    A   +  L +   A ++    A  +  +    L K Q           TA A
Sbjct: 190 ---QAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQ----------GTANA 236

Query: 489 VRMKATKDLKRLTEERN-AAMQEYSLIMSER-DSVHKEMEK-LSEDLT--QAVRKIKTLE 543
               A+++     E+ N + +   +++M+   + V K  E+ L  DL    A+++ +  E
Sbjct: 237 ----ASQNQVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAE 292

Query: 544 MENKELVEDKKTLSYQIETLKRE 566
           ME       KK+  +Q ET K E
Sbjct: 293 ME-------KKSAEFQEETRKAE 308



 Score = 31.1 bits (70), Expect = 6.5
 Identities = 17/116 (14%), Positives = 40/116 (34%), Gaps = 4/116 (3%)

Query: 473 QQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDL 532
           ++      KE  TA+    +AT   +   ++ + A   Y     +      +++ L    
Sbjct: 119 KEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAAEKKLTQAQNKLQSLDPAD 178

Query: 533 TQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERF 588
               +    +E   KE  E K+     ++        A  D     ++ +++  +F
Sbjct: 179 PGYAQAEAAVEQAGKEATEAKE----ALDKATDATVKAGTDAKAKAEKADNILTKF 230


>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
           Nexins.  BAR domains are dimerization, lipid binding and
           curvature sensing modules found in many different
           proteins with diverse functions. Sorting nexins (SNXs)
           are Phox homology (PX) domain containing proteins that
           are involved in regulating membrane traffic and protein
           sorting in the endosomal system. SNXs differ from each
           other in their lipid-binding specificity, subcellular
           localization and specific function in the endocytic
           pathway. A subset of SNXs also contain BAR domains. The
           PX-BAR structural unit determines the specific membrane
           targeting of SNXs. BAR domains form dimers that bind to
           membranes, induce membrane bending and curvature, and
           may also be involved in protein-protein interactions.
          Length = 218

 Score = 35.8 bits (83), Expect = 0.13
 Identities = 32/217 (14%), Positives = 67/217 (30%), Gaps = 27/217 (12%)

Query: 384 KYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVAR----IKKQCEDMNQEKNTALR 439
           ++E  KD    L ++   L    S    +L   + E+        K    + + +     
Sbjct: 5   EFEEAKDYILKLEEQLKKL----SKQAQRLVKRRRELGSALGEFGKALIKLAKCEEE--- 57

Query: 440 ERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKR 499
               L +  +   +  +  L      +    +   +  + +KE         +   D   
Sbjct: 58  VGGELGEALSKLGKAAE-ELSSL--SEAQANQELVKLLEPLKEYLRYCQAVKETLDD--- 111

Query: 500 LTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQ 559
               R  A+     +  +  S   ++EKL         K++ LE E +E     +    +
Sbjct: 112 ----RADALLTLQSLKKDLASKKAQLEKLKAAPGIKPAKVEELEEELEEAESALEEARKR 167

Query: 560 IETLKREIASAL--HDRDKALKECNDLRERFGEFTAA 594
            E +   +   L     ++A     DL+    EF   
Sbjct: 168 YEEISERLKEELKRFHEERA----RDLKAALKEFARL 200



 Score = 31.2 bits (71), Expect = 4.1
 Identities = 9/58 (15%), Positives = 23/58 (39%)

Query: 376 QHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
                AL   +++K +  + + + + L         K+E  +EE+   +   E+  + 
Sbjct: 110 DDRADALLTLQSLKKDLASKKAQLEKLKAAPGIKPAKVEELEEELEEAESALEEARKR 167


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
            functions are known are involvedin recombination,
            recombinational repair, and/or non-homologous end
            joining.They are components of an exonuclease complex
            with MRE11 homologs. This family is distantly related to
            the SbcC family of bacterial proteins.This family is
            based on the phylogenomic analysis of JA Eisen (1999,
            Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 36.9 bits (85), Expect = 0.13
 Identities = 59/370 (15%), Positives = 145/370 (39%), Gaps = 38/370 (10%)

Query: 351  ELLELRSGCCT------QPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLIN 404
            +L +    CC       Q E++       L      A +K ++ + E     KR D+++ 
Sbjct: 671  QLTDENQSCCPVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLG 730

Query: 405  THSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNE 464
                    ++L ++E+  ++ + + +N++     R +N +                   E
Sbjct: 731  LAPGRQSIIDLKEKEIPELRNKLQKVNRDIQ---RLKNDI-------------------E 768

Query: 465  YQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKE 524
             QE L       E++ K   T + +  +   +LK +  ER  A Q   L  S+ D   ++
Sbjct: 769  EQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDV--ERKIAQQAAKLQGSDLDRTVQQ 826

Query: 525  MEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDL 584
            + +  ++    +  + +    N++L++D++     +++   E+ S        L+     
Sbjct: 827  VNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQF 886

Query: 585  RERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDN 644
             E+  E +   +   RE   A+ E    +       ++  E   +S +   K  ++++++
Sbjct: 887  EEQLVELSTEVQSLIREIKDAK-EQDSPLETFLEKDQQEKEELISSKETSNKKAQDKVND 945

Query: 645  LEANQEIDRLRKHTDKLQAELQDQSVNYLF-----ILTESIQEAEVSKRRRDWAFSERDK 699
            ++  +++  +  +   ++ ++QD   +YL      + T + Q  E  K +       R  
Sbjct: 946  IK--EKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLM 1003

Query: 700  IEVIDSYHTK 709
             + ID+   +
Sbjct: 1004 RQDIDTQKIQ 1013



 Score = 31.2 bits (70), Expect = 6.5
 Identities = 73/393 (18%), Positives = 149/393 (37%), Gaps = 47/393 (11%)

Query: 325  EAAAQESSSLRSKYSDLISDKQRLDRELLELRS-GCCTQPESDDAGSADSLNQHYMSALN 383
            +AA  + S L      +  +KQ    EL  + S     +    D        QH  S  N
Sbjct: 811  QAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQ---QEQIQHLKSKTN 867

Query: 384  KYETIK---DEYDALRKRFDDLINTHSTAVDKL----ELAQEEVARIKKQCEDMNQEKNT 436
            + ++ K         R++F++ +   ST V  L    + A+E+ + ++   E   QEK  
Sbjct: 868  ELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEE 927

Query: 437  ALRERNGLKQQCTAAIRQWDIALRERNEYQEAL-AKVQQQHEDAVK----EINTAMAVRM 491
             +  +    ++    +      ++  + Y + +  K+Q   +D +K    E+NT  A   
Sbjct: 928  LISSKETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLE 987

Query: 492  KATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVE 551
            +  K  +++ E+     Q+        D+   +   L ++LT   R+ +  E+E +    
Sbjct: 988  ECEKHQEKINEDMRLMRQDI-------DTQKIQERWLQDNLTLRKRENELKEVEEELKQH 1040

Query: 552  DKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYA- 610
             K+    Q+  +K+E      + D   +       R   +    +  ++E    +F  A 
Sbjct: 1041 LKEMGQMQVLQMKQEHQKLEENIDLIKRNHVLALGRQKGYEKEIKHFKKELREPQFRDAE 1100

Query: 611  ---------MRMGERDSS--RKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTD 659
                     MR  E  +         ++ A M    K    +M+  E N+ I  L + T 
Sbjct: 1101 EKYREMMIVMRTTELVNKDLDIYYKTLDQAIM----KFHSMKME--EINKIIRDLWRST- 1153

Query: 660  KLQAELQDQSVNYLFILTESIQEAEVSKRRRDW 692
                  + Q + Y+ I +++ +    S +RR++
Sbjct: 1154 -----YRGQDIEYIEIRSDADENVSASDKRRNY 1181


>gnl|CDD|225616 COG3074, COG3074, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 79

 Score = 33.4 bits (76), Expect = 0.13
 Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 1/56 (1%)

Query: 525 MEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKE 580
            EKL   + QA+  I  L+ME +EL E   +LS +++  + +   AL   ++ LKE
Sbjct: 6   FEKLEAKVQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQR-EALERENEQLKE 60


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 36.7 bits (85), Expect = 0.14
 Identities = 50/306 (16%), Positives = 94/306 (30%), Gaps = 35/306 (11%)

Query: 394 ALRKRFDDLINTHSTAVDKLELAQEEVARIKKQC--EDMNQEKNTALRERNGLKQQCTAA 451
           A        I + S AV + + A  ++     +    D   E    L  ++ ++Q     
Sbjct: 503 AQIVFLSAEIKSSSRAVREEKAAVTDIPEELARLLITDELPELAVDLLVQSRIRQHWQQL 562

Query: 452 IRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVR--------------MKATKDL 497
            +  D         +   A  +    +   E   A+                 +   KDL
Sbjct: 563 RKALDQLEAAYEALEGRFAAAEAAMAEWQSEWEEALDELGLSRELSPEQQLDILSTMKDL 622

Query: 498 KRLTEERNAAMQEYSLIMSERDSVHKEMEKLSE----------DLTQAVRKIKTLEMENK 547
           K+L +++     + + +  E+ +  + +E L             L    R     E++ +
Sbjct: 623 KKLMQKKAELTHQVARLREEQAAFEERVEGLLAVLEAQFIDLSTLFCVQRLRVAAELQKE 682

Query: 548 E---LVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFH 604
           E    +E     + ++    R            L +C           AA+EE Q     
Sbjct: 683 EARLALEGNIERTKELNDELRAELELHRKEILDLFDCGTADTEDAFREAAREEQQLTQRE 742

Query: 605 ARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAE 664
           +R E      E  +      E    S  L  +  KE    L     ID L +  ++L A+
Sbjct: 743 SRLESLEAQLEGVA-----AEAYELSASLDQRELKEEELALLEEA-IDALDEEVEELHAQ 796

Query: 665 LQDQSV 670
           +   S 
Sbjct: 797 VAALSR 802



 Score = 34.0 bits (78), Expect = 0.95
 Identities = 42/321 (13%), Positives = 97/321 (30%), Gaps = 40/321 (12%)

Query: 316 QHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLN 375
           Q    + Q      E+    + Y  L+  ++     L ELR          D     +L 
Sbjct: 185 QLLEKLKQERNEIDEAEKEYATYHKLLESRRAEHARLAELR-----SELRADRDHIRALR 239

Query: 376 QHY--MSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
                   L +++ ++ E   L +R ++L       V +LE  +  + + + + + +   
Sbjct: 240 DAVELWPRLQEWKQLEQE---LTRRREELATFPRDGVLRLEKREAHLQKTEAEIDALLVR 296

Query: 434 KNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKA 493
                   + L        ++  +    R   Q +  K              A       
Sbjct: 297 LAELKDLASQLIP-----AKEAVLQALVRLHQQLSEIKASAFELTETLAGIEADL----- 346

Query: 494 TKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDK 553
                R  EE            +ER    + +E +    +   R++K  E    + +++K
Sbjct: 347 -----RDKEEAAGNG-----FEAERVHDLRSLECMLRYQSSQ-RELKQTEAAYCKRLDEK 395

Query: 554 KTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFT------AAKEECQREAFHARF 607
           +    + E   R+    L D ++ ++  ++ RE                        A  
Sbjct: 396 RLFEDEAEEEARQ---RLADDEEEVRAGDEAREEKIAANSQVIDKEEVCNLYDRRDTAWQ 452

Query: 608 EYAMRMGERDSSRKERNEVEA 628
           +      ++ +  +++ E   
Sbjct: 453 KQRFLREKQTAFERQKTEHTK 473



 Score = 32.5 bits (74), Expect = 2.7
 Identities = 44/295 (14%), Positives = 99/295 (33%), Gaps = 27/295 (9%)

Query: 322 AQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSA 381
           A+++++++     ++  +D+  +  RL                + D      + QH+   
Sbjct: 510 AEIKSSSRAVREEKAAVTDIPEELARLLITDE-------LPELAVDLLVQSRIRQHWQQL 562

Query: 382 LNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRER 441
               + ++  Y+AL  RF       +    + E A +E+  + ++     Q    +  + 
Sbjct: 563 RKALDQLEAAYEALEGRFAAAEAAMAEWQSEWEEALDELG-LSRELSPEQQLDILSTMKD 621

Query: 442 NGLKQQCTAAI-RQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
                Q  A +  Q      E+  ++E +  +    E    +++T   V  +     +  
Sbjct: 622 LKKLMQKKAELTHQVARLREEQAAFEERVEGLLAVLEAQFIDLSTLFCV-QRLRVAAELQ 680

Query: 501 TEERNAAMQEYSLIMSE--------RDSVHKEMEKL--------SEDLTQAVRKIKTLEM 544
            EE   A++       E         +   KE+  L         +   +A R+ + L  
Sbjct: 681 KEEARLALEGNIERTKELNDELRAELELHRKEILDLFDCGTADTEDAFREAAREEQQLTQ 740

Query: 545 ENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQ 599
               L   +  L   +     E++++L  R+   +E   L E         EE  
Sbjct: 741 RESRLESLEAQLE-GVAAEAYELSASLDQRELKEEELALLEEAIDALDEEVEELH 794


>gnl|CDD|139585 PRK13460, PRK13460, F0F1 ATP synthase subunit B; Provisional.
          Length = 173

 Score = 35.0 bits (80), Expect = 0.14
 Identities = 31/126 (24%), Positives = 53/126 (42%), Gaps = 11/126 (8%)

Query: 455 WDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLI 514
           WD+ L+  +E    +            +IN A  +R++A   LK      N+A  E + I
Sbjct: 38  WDVILKALDERASGVQN----------DINKASELRLEAEALLKDYEARLNSAKDEANAI 87

Query: 515 MSE-RDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHD 573
           ++E +    K   KL E+    V+  K   ++  EL + K     Q + ++  I  A   
Sbjct: 88  VAEAKSDALKLKNKLLEETNNEVKAQKDQAVKEIELAKGKALSQLQNQIVEMTITIASKV 147

Query: 574 RDKALK 579
            +K LK
Sbjct: 148 LEKQLK 153


>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP).  This family consists of
           several PV-1 (PLVAP) proteins which seem to be specific
           to mammals. PV-1 is a novel protein component of the
           endothelial fenestral and stomatal diaphragms. The
           function of this family is unknown.
          Length = 442

 Score = 36.2 bits (83), Expect = 0.15
 Identities = 56/265 (21%), Positives = 109/265 (41%), Gaps = 20/265 (7%)

Query: 375 NQHYMSALNKYET-IKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
           N  +++A+   E   +D++  + K  D L+      V  LE+   E+A+ K  C   +++
Sbjct: 129 NNRFIAAIILSEKQCQDQFKDMNKSCDALLFKLGEKVKTLEM---EIAKEKAIC---SKD 182

Query: 434 KNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKA 493
           K + L  +   ++Q  A  +  ++  +E+   +E L KVQ       K+   A  +    
Sbjct: 183 KESLLAGKRQAEEQLEACGKARELQHQEQQLAEENLQKVQALCIPLDKDKFEADLLNAWR 242

Query: 494 TKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKI-KTLEMENKELVED 552
              + R  +  N     +  +  E  S+ +  + L   ++  + ++ + L  + + +  +
Sbjct: 243 DSIIPRSLD--NLGYNLHHPLGPELASIRRACDHLPGIMSSKIEELARGLRADIERVARE 300

Query: 553 KKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMR 612
              L  Q    ++   +A   + KA KE    RE        + EC R+   A  E A  
Sbjct: 301 NADLQRQKLEAEQAAQAAQEAKAKAEKEAQ-ARE-----AKLQAECARQTQLALEEKAAL 354

Query: 613 MGERDSSRKE----RNEVEAASMDL 633
             ERD+  KE    + E E   M+L
Sbjct: 355 RAERDNLAKELEAKKREAEQLRMEL 379


>gnl|CDD|223343 COG0265, DegQ, Trypsin-like serine proteases, typically periplasmic,
            contain C-terminal PDZ domain [Posttranslational
            modification, protein turnover, chaperones].
          Length = 347

 Score = 36.0 bits (83), Expect = 0.16
 Identities = 17/53 (32%), Positives = 25/53 (47%)

Query: 1261 GIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAKPADKVTV 1313
            G  V  V  GS    AG++ GD I   NG  + + +   AA    +P D+V +
Sbjct: 271  GAVVLGVLPGSPAAKAGIKAGDIITAVNGKPVASLSDLVAAVASNRPGDEVAL 323



 Score = 34.1 bits (78), Expect = 0.63
 Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 1/60 (1%)

Query: 200 DHGLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDV 259
             GL + +G  +  + PGS AAK G +  GD + ++N + V  LS     +A      +V
Sbjct: 263 ALGLPVAAGAVVLGVLPGSPAAKAG-IKAGDIITAVNGKPVASLSDLVAAVASNRPGDEV 321



 Score = 34.1 bits (78), Expect = 0.63
 Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 1/60 (1%)

Query: 824 DHGLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDV 883
             GL + +G  +  + PGS AAK G +  GD + ++N + V  LS     +A      +V
Sbjct: 263 ALGLPVAAGAVVLGVLPGSPAAKAG-IKAGDIITAVNGKPVASLSDLVAAVASNRPGDEV 321


>gnl|CDD|226127 COG3599, DivIVA, Cell division initiation protein [Cell division
           and chromosome partitioning].
          Length = 212

 Score = 35.1 bits (81), Expect = 0.17
 Identities = 25/190 (13%), Positives = 59/190 (31%), Gaps = 22/190 (11%)

Query: 388 IKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQE-KNTALRERNGLKQ 446
            ++E D      DD+I       D  E   +E   ++ + +++ +E K  A  E +   Q
Sbjct: 24  DEEEVDEF---LDDVI-------DDYEQLLDENEDLEDEIDELKEELKEAADAEDSQAIQ 73

Query: 447 QCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNA 506
           Q      +     +      + + K        V       A R      + +  E+   
Sbjct: 74  QAETEAEE---LKQAAEAEADDILKRASAQAQRVFGKAQYKADRYLEDAKIAQELEDLKR 130

Query: 507 AMQEY-----SLIMSERDS-VHKEMEKLSEDLTQAVRKI--KTLEMENKELVEDKKTLSY 558
             Q       S I ++  S   ++ +++       V     +   + +    +  +    
Sbjct: 131 QAQVERQRLRSDIEAQLASAKQEDWDEILRSTVDEVEAANEEAERLADAAQADADRLRDE 190

Query: 559 QIETLKREIA 568
               +  ++A
Sbjct: 191 CDIYVDTKLA 200


>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana and other plants. This region is
           often found associated with Agenet domains and may
           contain coiled-coil.
          Length = 190

 Score = 35.1 bits (81), Expect = 0.17
 Identities = 18/80 (22%), Positives = 31/80 (38%)

Query: 509 QEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIA 568
            + +  + ER  + KE+ +      +   +I  LE +  EL      L  + E   +EIA
Sbjct: 103 DDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEIA 162

Query: 569 SALHDRDKALKECNDLRERF 588
               +  K  +E  D    F
Sbjct: 163 RLKSEASKIKQELVDAELEF 182



 Score = 30.9 bits (70), Expect = 3.8
 Identities = 34/150 (22%), Positives = 57/150 (38%), Gaps = 34/150 (22%)

Query: 523 KEMEKLSEDLTQAV--RKIKTLEMENKELVE---DKKTLSYQIETLKREIASALHDRDKA 577
           +E++KL  D + +      K       EL +   + + L  ++  L     S   D+ K 
Sbjct: 57  EEVKKLQIDDSISEFESLSKCF----SELEKHGFNVQALQSRLNKL----LSLKDDQTKK 108

Query: 578 LKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKC 637
           L+E    R+   +  A KE  ++E      E   ++ E      +R            K 
Sbjct: 109 LEE----RKGLEKEIAEKEISRQELDSEIAELERKILE-----LQRQAALL-------KE 152

Query: 638 QKERMDNLEANQEIDRLRKHTDKLQAELQD 667
           +KE  D      EI RL+    K++ EL D
Sbjct: 153 KKEAEDK-----EIARLKSEASKIKQELVD 177


>gnl|CDD|219499 pfam07653, SH3_2, Variant SH3 domain.  SH3 (Src homology 3) domains
            are often indicative of a protein involved in signal
            transduction related to cytoskeletal organisation. First
            described in the Src cytoplasmic tyrosine kinase. The
            structure is a partly opened beta barrel.
          Length = 53

 Score = 32.1 bits (74), Expect = 0.18
 Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 12/65 (18%)

Query: 1334 YMRALFSRTGDLGDALQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGIIPSK 1393
            Y R +F       D  +L   K D++ V     +     W  W  +  G R   G++PS 
Sbjct: 1    YGRVIFDYVAT--DPNELSLKKGDVVKV----LDKDDNGW--WEGERGGRR---GLVPSS 49

Query: 1394 YKVEE 1398
            Y VEE
Sbjct: 50   Y-VEE 53


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 36.1 bits (83), Expect = 0.19
 Identities = 54/337 (16%), Positives = 135/337 (40%), Gaps = 39/337 (11%)

Query: 372 DSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMN 431
           D   + Y S L   +  +     L+  F+  ++  +T +  L+   + +    ++   ++
Sbjct: 235 DYFTESYKSFLKLEDNYEPSEQELKLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKIS 294

Query: 432 QEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRM 491
           Q+  T   +   LK          +   ++  E+   L K++ + E   +EI       +
Sbjct: 295 QKIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKA-----L 349

Query: 492 KATKD-LKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELV 550
           ++  D L +   ++  + +++ L+  ER+ + +E++K++    +  + +K+ ++E + + 
Sbjct: 350 QSNIDELHKQLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIF 409

Query: 551 EDKKTLSYQIETLKREI----ASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHAR 606
           +  +    Q ++L + I    +   H+ + +  + N   E+     +   E  +     +
Sbjct: 410 KSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINI--EQLFPKGSGINESIK-----K 462

Query: 607 FEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEA--NQEIDRLRKHTDKLQAE 664
               +     D  ++     E  S+ L     +E + NL+   N+    L K   +L   
Sbjct: 463 SILEL----NDEIQERIKTEENKSITL-----EEDIKNLKHDINELTQILEKLELEL--- 510

Query: 665 LQDQSVNYLFILTESIQEAEVSKRRRDWAFSERDKIE 701
                 N  F L++   E E+  +R      E +K+E
Sbjct: 511 ---SEANSKFELSKEENERELVAQR-----IEIEKLE 539



 Score = 31.9 bits (72), Expect = 4.0
 Identities = 44/316 (13%), Positives = 108/316 (34%), Gaps = 20/316 (6%)

Query: 280 KSVNSSTQTMPLYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYS 339
           K+ N +         + ++ ++      + L+    +++  +  ++  +QE      K  
Sbjct: 277 KTQNDNLYEKIQEAMKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLK 336

Query: 340 DLISDKQRLDRELLELRSGCCTQPESDDAGSAD--SLNQHYMSALNKYETIKDEYDALRK 397
             I  K+   + L         Q       +     +NQ       + + I  + D L K
Sbjct: 337 SEIELKEEEIKALQSNIDELHKQLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTK 396

Query: 398 RFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWD- 456
                         KLE AQ     ++K     +       R R+ +      +  + + 
Sbjct: 397 SVKS---------RKLE-AQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINI 446

Query: 457 -----IALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY 511
                         ++++ ++  + ++ +K          +  K+LK    E    +++ 
Sbjct: 447 EQLFPKGSGINESIKKSILELNDEIQERIKTEENKSITLEEDIKNLKHDINELTQILEKL 506

Query: 512 SLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASAL 571
            L +SE +S  +  ++ +E    A ++I+  ++E  EL +        I   ++ + S  
Sbjct: 507 ELELSEANSKFELSKEENERELVA-QRIEIEKLEK-ELNDLNLLSKTSILDAEQLVQSTE 564

Query: 572 HDRDKALKECNDLRER 587
              D+   + N  R +
Sbjct: 565 IKLDELKVDLNRKRYK 580


>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
           protein.  Members of this protein family are homologs of
           the HlyD membrane fusion protein of type I secretion
           systems. Their occurrence in prokaryotic genomes is
           associated with the occurrence of a novel class of
           microcin (small bacteriocins) with a leader peptide
           region related to nitrile hydratase. We designate the
           class of bacteriocin as Nitrile Hydratase Leader
           Microcin, or NHLM. This family, therefore, is designated
           as NHLM bacteriocin system secretion protein. Some but
           not all NHLM-class putative microcins belong to the TOMM
           (thiazole/oxazole modified microcin) class as assessed
           by the presence of the scaffolding protein and/or
           cyclodehydratase in the same gene clusters [Transport
           and binding proteins, Amino acids, peptides and amines,
           Cellular processes, Biosynthesis of natural products].
          Length = 421

 Score = 36.0 bits (83), Expect = 0.20
 Identities = 37/161 (22%), Positives = 66/161 (40%), Gaps = 15/161 (9%)

Query: 395 LRKRFDDLINTHSTAVDKLELAQEEV----ARIKKQCEDMNQEKNTALRERNGLKQQCTA 450
           LR+R    +      + +L+   EEV     R+K+  E   Q+   AL E  G  ++  A
Sbjct: 94  LRER----LQESYQKLTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGRLREELA 149

Query: 451 AIRQWDIALR---ERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERN-- 505
           A+ +     R    R        ++ QQ     ++     A + +A   L+   +ERN  
Sbjct: 150 ALSREVGKQRGLLSRGLATFKRDRILQQQWR-EEQEKYDAADKARAIYALQTKADERNLE 208

Query: 506 AAMQEYSLIMSERDSV-HKEMEKLSEDLTQAVRKIKTLEME 545
             +Q  S    +   V  KE+E +   + +A  +I+ LE +
Sbjct: 209 TVLQSLSQADFQLAGVAEKELETVEARIKEARYEIEELENK 249


>gnl|CDD|223821 COG0750, COG0750, Predicted membrane-associated Zn-dependent
            proteases 1 [Cell envelope biogenesis, outer membrane].
          Length = 375

 Score = 35.6 bits (82), Expect = 0.20
 Identities = 21/76 (27%), Positives = 29/76 (38%), Gaps = 4/76 (5%)

Query: 1252 ISLVGGNAVGI--YVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAKPAD 1309
            +  V G        V  V   S    AGLR GDRI+  +G  +  A+ ++    L   A 
Sbjct: 119  LFFVIGLVPVASPVVGEVAPKSAAALAGLRPGDRIVAVDGEKV--ASWDDVRRLLVAAAG 176

Query: 1310 KVTVLAQYSVERYNEV 1325
             V  L    V R +  
Sbjct: 177  DVFNLLTILVIRLDGE 192


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 34.1 bits (79), Expect = 0.22
 Identities = 20/98 (20%), Positives = 41/98 (41%), Gaps = 2/98 (2%)

Query: 390 DEYDALRKRFDDLINTHSTAVDKLELA-QEEVARIKKQCEDMNQEKNTALRERNGLKQQC 448
               A +K+ +         + K E   Q+E  +++KQ   +++E   A ++    KQQ 
Sbjct: 31  PAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQE 90

Query: 449 TAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTA 486
               +Q       + + QE L  +  + + A+KE+   
Sbjct: 91  LQQKQQ-AAQQELQQKQQELLQPIYDKIDKAIKEVAKE 127



 Score = 29.1 bits (66), Expect = 9.9
 Identities = 23/103 (22%), Positives = 50/103 (48%), Gaps = 5/103 (4%)

Query: 483 INTAMAVR-MKATKDL-KRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIK 540
           ++    +    A K   K+L +E      E      E     ++++K +  L++  RK K
Sbjct: 21  VDVQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAK 80

Query: 541 TLEMENK--ELVEDKKTLSYQIETLKREIASALHDR-DKALKE 580
             E++ K  EL + ++    +++  ++E+   ++D+ DKA+KE
Sbjct: 81  QQELQQKQQELQQKQQAAQQELQQKQQELLQPIYDKIDKAIKE 123


>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
           (DUF3340).  This presumed domain is found at the
           C-terminus of tail specific proteases. Its function is
           unknown. This family is found in bacteria and
           eukaryotes. This presumed domain is typically between 88
           to 187 amino acids in length.
          Length = 139

 Score = 33.7 bits (78), Expect = 0.26
 Identities = 19/99 (19%), Positives = 36/99 (36%), Gaps = 18/99 (18%)

Query: 463 NEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEE---RNAAMQE--YSLIMSE 517
            +    L +++Q+H+            R+    + + L E+         +   SL  +E
Sbjct: 31  GDISPFLPELRQKHKA-----------RIAKDPEFQYLNEDIAWLKEERDDKTVSLNEAE 79

Query: 518 RDSVHKEMEKLSEDLTQAVRKIKTLE--MENKELVEDKK 554
           R +  +E EK         RK    E      +L +DK+
Sbjct: 80  RKAEREEDEKKRLKRENERRKALGEEPLKSLDDLPKDKE 118


>gnl|CDD|212785 cd11851, SH3_RIM-BP, Src homology 3 domains of Rab3-interacting
            molecules (RIMs) binding proteins.  RIMs binding proteins
            (RBPs, RIM-BPs) associate with calcium channels present
            in photoreceptors, neurons, and hair cells; they interact
            simultaneously with specific calcium channel subunits,
            and active zone proteins, RIM1 and RIM2. RIMs are part of
            the matrix at the presynaptic active zone and are
            associated with synaptic vesicles through their
            interaction with the small GTPase Rab3. RIM-BPs play a
            role in regulating synaptic transmission by serving as
            adaptors and linking calcium channels with the synaptic
            vesicle release machinery. RIM-BPs contain three SH3
            domains and two to three fibronectin III repeats.
            Invertebrates contain one, while vertebrates contain at
            least two RIM-BPs, RIM-BP1 and RIM-BP2. RIM-BP1 is also
            called peripheral-type benzodiazapine receptor associated
            protein 1 (PRAX-1). Mammals contain a third protein,
            RIM-BP3. RIM-BP1 and RIM-BP2 are predominantly expressed
            in the brain where they display overlapping but distinct
            expression patterns, while RIM-BP3 is almost exclusively
            expressed in the testis and is essential in
            spermiogenesis. The SH3 domains of RIM-BPs bind to the
            PxxP motifs of RIM1, RIM2, and L-type (alpha1D) and
            N-type (alpha1B) calcium channel subunits. SH3 domains
            are protein interaction domains that bind to proline-rich
            ligands with moderate affinity and selectivity. They play
            versatile and diverse roles in the cell including the
            regulation of enzymes, changing the subcellular
            localization of signaling pathway components, and
            mediating the formation of multiprotein complex
            assemblies.
          Length = 62

 Score = 31.9 bits (73), Expect = 0.26
 Identities = 18/71 (25%), Positives = 26/71 (36%), Gaps = 24/71 (33%)

Query: 1335 MRALFS---RTGDLGDAL--QLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQ---- 1385
            M AL+     T    D    +L FH  D++ V   M             D DGF      
Sbjct: 2    MVALYDYNPETMSPNDDPEEELSFHAGDVVRVYGPM-------------DEDGFYYGELE 48

Query: 1386 --QCGIIPSKY 1394
              + G++PS +
Sbjct: 49   GGRKGLVPSNF 59


>gnl|CDD|226483 COG3975, COG3975, Predicted protease with the C-terminal PDZ domain
            [General function prediction only].
          Length = 558

 Score = 35.5 bits (82), Expect = 0.29
 Identities = 21/113 (18%), Positives = 35/113 (30%), Gaps = 4/113 (3%)

Query: 1243 ETRKCSNLGISLVGGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAA--TAEEA 1300
            + R+   LG+  V        +  V  G   Y AGL  GD+I+  NG   +       + 
Sbjct: 446  KPREAYYLGLK-VKSEGGHEKITFVFPGGPAYKAGLSPGDKIVAINGISDQLDRYKVNDK 504

Query: 1301 AYELAKPADK-VTVLAQYSVERYNEVKDKPGDSFYMRALFSRTGDLGDALQLR 1352
                     +    L +   +   +    P        L +    LG+    R
Sbjct: 505  IQVHVFREGRLREFLVKLGGDPTAQYIILPIGDRNPAQLANAAAWLGEIASAR 557


>gnl|CDD|153337 cd07653, F-BAR_CIP4-like, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Cdc42-Interacting Protein
           4 and similar proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. This subfamily is composed of
           Cdc42-Interacting Protein 4 (CIP4), Formin Binding
           Protein 17 (FBP17), FormiN Binding Protein 1-Like
           (FNBP1L), and similar proteins. CIP4 and FNBP1L are
           Cdc42 effectors that bind Wiskott-Aldrich syndrome
           protein (WASP) and function in endocytosis. CIP4 and
           FBP17 bind to the Fas ligand and may be implicated in
           the inflammatory response. CIP4 may also play a role in
           phagocytosis. Members of this subfamily typically
           contain an N-terminal F-BAR domain and a C-terminal SH3
           domain. In addition, some members such as FNBP1L contain
           a central Cdc42-binding HR1 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 251

 Score = 34.9 bits (81), Expect = 0.30
 Identities = 35/155 (22%), Positives = 72/155 (46%), Gaps = 9/155 (5%)

Query: 400 DDLINTHSTAVDKLELAQEEVAR-IKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIA 458
           +D+   H    + L      V + +K    ++ QE+   L E + L+Q+  ++I+Q + +
Sbjct: 75  NDIAGQHELIAENL---NSNVCKELKTLISELRQERKKHLSEGSKLQQKLESSIKQLEKS 131

Query: 459 LRE-RNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSE 517
            +     ++EA  K +Q++E A  ++N   A   KA  +    T+    A  EY+  + +
Sbjct: 132 KKAYEKAFKEAE-KAKQKYEKADADMNLTKADVEKAKANANLKTQAAEEAKNEYAAQLQK 190

Query: 518 RDSVHKEMEKLSEDLTQAVRKIKTLEME-NKELVE 551
            +   ++ +  S DL Q   K++ L+ +     VE
Sbjct: 191 FN--KEQRQHYSTDLPQIFDKLQELDEKRINRTVE 223


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 33.8 bits (78), Expect = 0.30
 Identities = 28/113 (24%), Positives = 53/113 (46%), Gaps = 7/113 (6%)

Query: 458 ALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSE 517
           A R + E Q  LA+ +Q+ E+A ++   A  +  +A K+ +++ EE  A  +E    + E
Sbjct: 49  AERLKEEAQALLAEYEQELEEAREQ---ASEIIEQAKKEAEQIAEEIKAEAEEELERIKE 105

Query: 518 RD--SVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIA 568
                +  E E+  E+L   V ++     E  +L+  K   + Q + +   IA
Sbjct: 106 AAEAEIEAEKERALEELRAEVAELAVAIAE--KLLGKKVDEAAQKDLIDAFIA 156


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 34.3 bits (79), Expect = 0.36
 Identities = 42/216 (19%), Positives = 85/216 (39%), Gaps = 28/216 (12%)

Query: 458 ALRERNEYQEALAKVQQQHEDAVKE----INTAMAVRMK---ATKDLKRLTEERNAAMQE 510
           A RE+ E +E + ++++    A KE      TA+ +  K     ++ + L ++ +   +E
Sbjct: 3   AEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEE 62

Query: 511 YSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASA 570
              +  E  +  +E E+L  ++ +A  ++  LE E ++   +          L++E+  A
Sbjct: 63  NRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAET-------RQLQQELREA 115

Query: 571 LHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAAS 630
               ++A       R+   E  AA       A     E      +         E +   
Sbjct: 116 QEAHERA-------RQELLEAAAAPTAPPHVAAPVNGEQLEP-DDNGEEASADLETDPDM 167

Query: 631 MDLFGKCQKERMDNLEANQEIDRLRKHTDKLQAELQ 666
            D     ++ER+   E N+   RL+     L++EL 
Sbjct: 168 KDR---SEEERVTYAEKNE---RLQTQLQALKSELA 197



 Score = 32.8 bits (75), Expect = 1.4
 Identities = 42/203 (20%), Positives = 76/203 (37%), Gaps = 20/203 (9%)

Query: 381 ALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRE 440
            L +YE    E +   K+ ++           LE   +E+    ++ E   +E   +  E
Sbjct: 27  ELEEYEETALELEEKLKQEEE-------EAQLLEKKADELEEENRRLE---EEAAASEEE 76

Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
           R  L+ +   A  +  +A  E     E   K + +     +E+  A     +A ++L   
Sbjct: 77  RERLEAEVDEATAE--VAKLE-----EEREKKEAETRQLQQELREAQEAHERARQELLEA 129

Query: 501 TEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQI 560
                A     + +  E+       E+ S DL          E E     E  + L  Q+
Sbjct: 130 AAAPTAPPHVAAPVNGEQLEPDDNGEEASADLETDPDMKDRSEEERVTYAEKNERLQTQL 189

Query: 561 ETLKREIASALHDRDKALKECND 583
           + LK E+A+A   RD++ +  ND
Sbjct: 190 QALKSELAAA---RDESKETAND 209


>gnl|CDD|150503 pfam09840, DUF2067, Uncharacterized protein conserved in archaea
           (DUF2067).  This domain, found in various archaeal
           proteins, has no known function.
          Length = 190

 Score = 33.9 bits (78), Expect = 0.38
 Identities = 15/72 (20%), Positives = 32/72 (44%), Gaps = 3/72 (4%)

Query: 497 LKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTL 556
           L+RL+         Y  +  + + +  ++    ++L +A+R++K L  E +     K   
Sbjct: 15  LERLSRLLKDI---YLSVEIKGNKLKVKVFGTEKELKEAIRRVKELAREVRRKYRRKGLF 71

Query: 557 SYQIETLKREIA 568
            Y +  L RE+ 
Sbjct: 72  EYPLSDLFRELG 83


>gnl|CDD|226081 COG3551, COG3551, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 402

 Score = 35.0 bits (80), Expect = 0.39
 Identities = 21/155 (13%), Positives = 55/155 (35%), Gaps = 24/155 (15%)

Query: 388 IKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQ 447
           ++ E+       + +      +      +   + R  +Q  D+                 
Sbjct: 268 LEREFSRAMPLLEYVGELLWQSASSHNES---MTRFSEQVADLAMS-------------- 310

Query: 448 CTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAA 507
              A RQ +  LR+R     A     +Q+E A++E++  ++V  +     +R   E +  
Sbjct: 311 -YTAQRQLNDGLRDRLLEAGARD---EQNERAMRELSRRLSVCREELVSAQRNLAETDNL 366

Query: 508 MQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTL 542
           +++     +   ++   ++      T+ +R +  L
Sbjct: 367 LRDRDDTHARLMAI---LDSRFWRFTKPLRNLSRL 398


>gnl|CDD|223910 COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility
           and secretion / Signal transduction mechanisms].
          Length = 408

 Score = 34.6 bits (79), Expect = 0.47
 Identities = 21/112 (18%), Positives = 53/112 (47%), Gaps = 3/112 (2%)

Query: 463 NEYQEALAKVQQQHEDAVKEINT---AMAVRMKATKDLKRLTEERNAAMQEYSLIMSERD 519
            E    + ++Q +  DAV+ +      ++  +K  ++      E  AA++E S ++SE  
Sbjct: 292 KEIGLLIEEIQNEAADAVEHMEESASEVSEGVKLVEETGSSLGEIAAAIEEVSQLISEIA 351

Query: 520 SVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASAL 571
           +  +E   + E++  ++ ++  +  EN   VE+    S +++ L  ++   +
Sbjct: 352 AATEEQTAVLEEINASIEELDDVTQENAAAVEELAAASEELKELAEKLLELV 403


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
           binding.  This is the middle region of a family of TATA
           element modulatory factor 1 proteins conserved in
           eukaryotes that contains at its N-terminal section a
           number of leucine zippers that could potentially form
           coiled coil structures. The whole proteins bind to the
           TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells.
          Length = 74

 Score = 31.5 bits (72), Expect = 0.50
 Identities = 16/48 (33%), Positives = 28/48 (58%)

Query: 523 KEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASA 570
           +E EKLS+   +    IK L  +NKEL ++   L  ++E L++E+ + 
Sbjct: 19  EEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENL 66


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 34.1 bits (79), Expect = 0.52
 Identities = 64/327 (19%), Positives = 132/327 (40%), Gaps = 50/327 (15%)

Query: 386 ETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLK 445
           + IK E     +R D+++       ++L+   EE  R +K+ E+  + +   L+E+  ++
Sbjct: 30  KRIKAEEKEEERRIDEMME-----EERLKALAEEEERERKRKEERREGRAV-LQEQ--IE 81

Query: 446 QQCTAAIRQWDIALRERNEYQEALAKVQQQHE-----------DAVKEINTAMAVRMKAT 494
           ++      +++  L+ER +  E + ++Q++ E              +EI+     R++  
Sbjct: 82  EREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERK 141

Query: 495 KDLKRLTEERNAAMQEYSL--------IMSERDSVHKEMEKLSEDLTQAVRKIKTLEMEN 546
           ++ K    E    + EY            +ER    +E E+    L     + +    E 
Sbjct: 142 EEEKEREREEELKILEYQREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDEREEL 201

Query: 547 KELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEEC---QREAF 603
            EL  D     YQ E  ++E      + +K  ++  +L+    E    KEE    +R   
Sbjct: 202 DELRADL----YQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEE 257

Query: 604 HARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQA 663
            A  E  +     D    E  E E A        +K RM  LE  +E+++  +  ++ +A
Sbjct: 258 EAERERMLEKQAED----EELEQENA--------EKRRMKRLEHRRELEQQIEEKEERRA 305

Query: 664 ELQDQSVNYLFILTESIQEAEVSKRRR 690
             +++ +       E ++E E  ++ R
Sbjct: 306 AEREEELEEG----ERLREEEAERQAR 328



 Score = 32.2 bits (74), Expect = 2.6
 Identities = 24/136 (17%), Positives = 51/136 (37%), Gaps = 5/136 (3%)

Query: 413 LELAQEEVARIKKQCEDMNQEKNTALRE--RNGLKQQCTAAIRQWDI-ALRERNEYQEAL 469
            +L QEE  R ++Q E    EK    ++  +   ++Q      +       E  E +  L
Sbjct: 206 ADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERML 265

Query: 470 AKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAA-MQEYSLIMSERDSVHKEMEKL 528
            K  +  E   +        R++  ++L++  EE+      E    + E + + +E  + 
Sbjct: 266 EKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEELEEGERLREEEAER 325

Query: 529 SEDLTQA-VRKIKTLE 543
              + +   R +K   
Sbjct: 326 QARIEEERQRLLKEHA 341


>gnl|CDD|238491 cd00991, PDZ_archaeal_metalloprotease, PDZ domain of archaeal zinc
            metalloprotases, presumably membrane-associated or
            integral membrane proteases, which may be involved in
            signalling and regulatory mechanisms. May be responsible
            for substrate recognition and/or binding, as most PDZ
            domains bind C-terminal polypeptides, and binding to
            internal (non-C-terminal) polypeptides and even to lipids
            has been demonstrated. In this subfamily of
            protease-associated PDZ domains a C-terminal beta-strand
            forms the peptide-binding groove base, a circular
            permutation with respect to PDZ domains found in
            Eumetazoan signaling proteins.
          Length = 79

 Score = 31.6 bits (72), Expect = 0.53
 Identities = 18/54 (33%), Positives = 25/54 (46%)

Query: 1261 GIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAKPADKVTVL 1314
            G+ +  V  GS   +A L TGD I   NGT +        A +  KP + +TV 
Sbjct: 11   GVVIVGVIVGSPAENAVLHTGDVIYSINGTPITTLEDFMEALKPTKPGEVITVT 64



 Score = 30.4 bits (69), Expect = 1.3
 Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 1/50 (2%)

Query: 221 AKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTLKPTSL 270
           A+   L  GD + SIN   +  L    E +       +V+T+T L  T+ 
Sbjct: 23  AENAVLHTGDVIYSINGTPITTLEDFMEALKPTKPG-EVITVTVLPSTTK 71



 Score = 30.4 bits (69), Expect = 1.3
 Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 1/50 (2%)

Query: 845 AKEGSLAVGDRVLSINNRTVDGLSSAREGMALLSDSPDVLTITTLKPTSL 894
           A+   L  GD + SIN   +  L    E +       +V+T+T L  T+ 
Sbjct: 23  AENAVLHTGDVIYSINGTPITTLEDFMEALKPTKPG-EVITVTVLPSTTK 71


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 33.8 bits (78), Expect = 0.61
 Identities = 31/132 (23%), Positives = 61/132 (46%), Gaps = 12/132 (9%)

Query: 467 EALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEE----RNAA--MQEYSLIMSERDS 520
           E L  +++  E   K++N   +++ K  K L+ L EE    R  A  +     +  E + 
Sbjct: 154 ENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPL--ELEK 211

Query: 521 VHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASA----LHDRDK 576
             +E+  LS  +++  ++++ L+ E +EL    + L+ +   L  EIA A       R  
Sbjct: 212 ARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREECRGW 271

Query: 577 ALKECNDLRERF 588
           + KE + L+ + 
Sbjct: 272 SAKEISKLKAKV 283



 Score = 31.2 bits (71), Expect = 4.6
 Identities = 21/128 (16%), Positives = 46/128 (35%), Gaps = 26/128 (20%)

Query: 375 NQHYMSALNKYETIKDE----YDALRKRFDDLIN----THSTAVDKLELAQEE------- 419
            +     LN   +IK +      AL++    L       +     +LE A++E       
Sbjct: 163 EELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVK 222

Query: 420 VARIKKQCEDMNQEKN---TALRERNGLKQQCTAAIRQWDIALRER--------NEYQEA 468
           ++  +KQ E++ QE      A+      K +    I + +    E         ++ +  
Sbjct: 223 ISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREECRGWSAKEISKLKAK 282

Query: 469 LAKVQQQH 476
           ++ +Q+  
Sbjct: 283 VSLLQKLT 290


>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B.  A
           small number of taxonomically diverse prokaryotic
           species, including Methanosarcina barkeri, have what
           appears to be a second ATP synthase, in addition to the
           normal F1F0 ATPase in bacteria and A1A0 ATPase in
           archaea. These enzymes use ion gradients to synthesize
           ATP, CC and in principle may run in either direction.
           This model represents the F0 subunit B of this apparent
           second ATP synthase.
          Length = 246

 Score = 33.5 bits (77), Expect = 0.69
 Identities = 35/176 (19%), Positives = 67/176 (38%), Gaps = 27/176 (15%)

Query: 458 ALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSE 517
           A +ER EY+E   ++ QQ E  +     A      A  + +RL +E   A +E   I  +
Sbjct: 55  AEQERREYEEKNEELDQQREVLL---TKAKEE---AQAERQRLLDE---AREEADEIREK 105

Query: 518 R-DSVHKEMEKLSEDLTQAVRKI------KTLEMENKELVEDK--KTLSYQIETLKRE-- 566
             +++ +E   LS++L +           K L       +E++       ++ TL  +  
Sbjct: 106 WQEALRREQAALSDELRRRTGAEVFAIARKVLTDLADTDLEERMVDVFVQRLRTLDPDEK 165

Query: 567 --IASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSR 620
             +A AL D    +     +R  F      +E+  R+         +R+  +    
Sbjct: 166 AALAEALADSGNPVL----VRSAFELPEEQREQI-RDTIRETLGPEIRLRFQTEPD 216



 Score = 32.7 bits (75), Expect = 1.2
 Identities = 26/93 (27%), Positives = 37/93 (39%), Gaps = 19/93 (20%)

Query: 583 DLRER--FGEFTAAKEECQREAFHARFEYAMRMGERDSSR-----KERNEVEAASMDLFG 635
           D RE+   GE   A +  +REA   R EY  +  E D  R     K + E +A    L  
Sbjct: 35  DAREKKIAGELADA-DTKKREAEQERREYEEKNEELDQQREVLLTKAKEEAQAERQRLLD 93

Query: 636 KCQKERMDNLEANQEIDRLRKH-TDKLQAELQD 667
                     EA +E D +R+   + L+ E   
Sbjct: 94  ----------EAREEADEIREKWQEALRREQAA 116


>gnl|CDD|238490 cd00990, PDZ_glycyl_aminopeptidase, PDZ domain associated with
            archaeal and bacterial M61 glycyl-aminopeptidases. May be
            responsible for substrate recognition and/or binding, as
            most PDZ domains bind C-terminal polypeptides, and
            binding to internal (non-C-terminal) polypeptides and
            even to lipids has been demonstrated. In this subfamily
            of protease-associated PDZ domains a C-terminal
            beta-strand is presumed to form the peptide-binding
            groove base, a circular permutation with respect to PDZ
            domains found in Eumetazoan signaling proteins.
          Length = 80

 Score = 31.2 bits (71), Expect = 0.73
 Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 1141 LGIQIQCLDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCGINMR 1184
            LG+ +   + G   V+ V + S A + GL  GD+L+ V G  + 
Sbjct: 3    LGLTLDK-EEGLGKVTFVRDDSPADKAGLVAGDELVAVNGWRVD 45


>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361).
           This is a region of 120 amino acids that is conserved in
           a family of proteins found from plants to fungi. The
           function is not known.
          Length = 114

 Score = 31.8 bits (73), Expect = 0.80
 Identities = 21/81 (25%), Positives = 36/81 (44%), Gaps = 15/81 (18%)

Query: 462 RNEYQEALAKVQQQHEDA-VKEINTAMAV---------RMKATKDLKRLTEERNAAMQEY 511
           R E + AL  ++++ EDA  ++    MA          R KAT+ LK+L ++      E 
Sbjct: 23  RVEKERALEALKKELEDAERRKKRKKMAKKYHMVRFFERKKATRRLKQLEKKLEETSDE- 81

Query: 512 SLIMSERDSVHKEMEKLSEDL 532
                E   + +++ K   DL
Sbjct: 82  ----KEIKDLKEQLSKAEVDL 98


>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription factor
            N terminal domain.  The N terminus of the PEA3
            transcription factors is implicated in transactivation
            and in inhibition of DNA binding. Transactivation is
            potentiated by activation of the Ras/MAP kinase and
            protein kinase A signalling cascades. The N terminal
            region contains conserved MAP kinase phosphorylation
            sites.
          Length = 336

 Score = 33.7 bits (77), Expect = 0.82
 Identities = 38/193 (19%), Positives = 55/193 (28%), Gaps = 37/193 (19%)

Query: 887  TTLKPTSLGEHSPRVHRKSVNSSTQTMPLYVRYGDQPPPDSYQVPKTSTKSH------DP 940
            + + P             S +S              PPP S       ++        +P
Sbjct: 141  SPVNPQETVRQLQPSGPLSNSSPPSPHTPLPNQSPLPPPMSSPDSSYPSEHRFQRQLSEP 200

Query: 941  AKPSPSSSGGGGKWDSLREKMQGVRRRGGGEEGKRSNRNS-SPPVTFEQEQVDAIAELDS 999
              P P   G G      R+      R+        S      PP  F+QE  D + E   
Sbjct: 201  CLPFPPPPGRGS-----RDGRPPYHRQM-------SEPLVPYPPQGFKQEYHDPLYE--- 245

Query: 1000 FVHTHSPSDGSSPHY---HKAPR---------SLASAHSHPGG-PNNGAGGTSYETSSYH 1046
                  P+ G  PH     + PR            S++    G  +N   G SY+     
Sbjct: 246  --EAGVPNQGPFPHPMMIKQEPRDFTYDSEVPGCHSSYGRAEGFYSNRHEGFSYDKEPRL 303

Query: 1047 SYDAAGTFPRKKE 1059
             YD     P + E
Sbjct: 304  FYDDTCVVPERLE 316


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 33.9 bits (78), Expect = 0.84
 Identities = 20/112 (17%), Positives = 39/112 (34%), Gaps = 5/112 (4%)

Query: 523 KEMEKLSEDLTQAVRKIKT--LEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKE 580
           KE+ +   +L + +RK+       E +E   +   L  +I+ L  EI       D+   +
Sbjct: 2   KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61

Query: 581 CNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMD 632
                E  G     +E     A    +   +R G+   + + +        D
Sbjct: 62  PAASGEGGGGEEEEEEAKAEAAEFRAY---LRGGDDALAEERKALSTGTDSD 110


>gnl|CDD|219925 pfam08598, Sds3, Sds3-like.  Repression of gene transcription is
           mediated by histone deacetylases containing
           repressor-co-repressor complexes, which are recruited to
           promoters of target genes via interactions with
           sequence-specific transcription factors. The
           co-repressor complex contains a core of at least seven
           proteins. This family represents the conserved region
           found in Sds3, Dep1 and BRMS1-homologue p40 proteins.
          Length = 184

 Score = 32.7 bits (75), Expect = 0.91
 Identities = 23/132 (17%), Positives = 47/132 (35%), Gaps = 22/132 (16%)

Query: 379 MSALNKYETIKDEYDALRKR-FDDLINTHSTAVDKLELAQEEVA----RIKKQCEDMNQE 433
             AL++   I+ ++  LR + + + +    T   +LEL  +          K  E+   +
Sbjct: 1   QEALDELAKIEKQFAELRDQLYRERLAQLQT---ELELLLQGTHPEYLEPLKDLEERRDD 57

Query: 434 KNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKA 493
           +      R   K +C               EY+      +Q+ E   + +   +    + 
Sbjct: 58  RLKVAELRREYKLECI------------EREYEAERQAAKQEFEKEKRLLRERL--LEEL 103

Query: 494 TKDLKRLTEERN 505
            + + RL EER 
Sbjct: 104 EEKIYRLEEERR 115


>gnl|CDD|192111 pfam08647, BRE1, BRE1 E3 ubiquitin ligase.  BRE1 is an E3 ubiquitin
           ligase that has been shown to act as a transcriptional
           activator through direct activator interactions.
          Length = 96

 Score = 31.1 bits (71), Expect = 0.94
 Identities = 17/61 (27%), Positives = 33/61 (54%), Gaps = 3/61 (4%)

Query: 497 LKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQ---AVRKIKTLEMENKELVEDK 553
             RLT E+  A Q+Y   M  +D++  E +KL+  L +    + ++K LE E +  +++ 
Sbjct: 33  KLRLTAEKAKADQKYFAAMRSKDALDAENKKLNTQLNKSSELIEQLKDLENELRRRIKEL 92

Query: 554 K 554
           +
Sbjct: 93  E 93


>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 1104

 Score = 33.8 bits (77), Expect = 0.96
 Identities = 33/177 (18%), Positives = 67/177 (37%), Gaps = 16/177 (9%)

Query: 395 LRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQ 454
                       + A  K+E A+ E      +  +M    +  +++  G +    +A R+
Sbjct: 300 QETLVRQFTVEQTQAKSKVESAKIE----TDRAREMETLAHDNVKQIVGAQHGILSAKRE 355

Query: 455 WDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLI 514
              A+ +R     A A +       VK +  A      A +  + L E  NAA       
Sbjct: 356 G--AVDKRRTISTARAGLDA----LVKGLGGA------APESAEELLELNNAARLTVDEY 403

Query: 515 MSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASAL 571
            + R+++    ++  ED T+AV + K  + E   L +    + Y++  ++  +   L
Sbjct: 404 PAAREALESAGQRNVEDRTRAVDEFKAADQELSSLSKGSSNIEYRLLQVRENLCQDL 460



 Score = 31.5 bits (71), Expect = 5.5
 Identities = 43/242 (17%), Positives = 82/242 (33%), Gaps = 30/242 (12%)

Query: 295 QHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLE 354
           +  E ++R  HTL  L   + + + A    + A     +L +  +       R+ +E  E
Sbjct: 249 EQVEDIKRQIHTLDPLVQLKNRREKAQQSKDHANALKKALPTVGN-------RIKKEEQE 301

Query: 355 LRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLE 414
                                     A +K E+ K E      R  ++       V ++ 
Sbjct: 302 TL--------------VRQFTVEQTQAKSKVESAKIE----TDRAREMETLAHDNVKQIV 343

Query: 415 LAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQ 474
            AQ  +   K+  E    ++ T    R GL             +  E  E   A      
Sbjct: 344 GAQHGILSAKR--EGAVDKRRTISTARAGLDALVKGLGGAAPESAEELLELNNAARLTVD 401

Query: 475 QHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQ 534
           ++  A + + +A     +  +D  R  +E  AA QE S +     ++   + ++ E+L Q
Sbjct: 402 EYPAAREALESA---GQRNVEDRTRAVDEFKAADQELSSLSKGSSNIEYRLLQVRENLCQ 458

Query: 535 AV 536
            +
Sbjct: 459 DL 460


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
           sigma54-dependent transcription [Transcription / Signal
           transduction mechanisms].
          Length = 225

 Score = 33.1 bits (76), Expect = 0.98
 Identities = 36/203 (17%), Positives = 75/203 (36%), Gaps = 17/203 (8%)

Query: 468 ALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEK 527
            + ++  + ED  K +  A+        +L +  +    A+     +  + +      EK
Sbjct: 14  NINELLDKAEDPEKMLEQAI---RDMESELAKARQALAQAIARQKQLERKLEEAQARAEK 70

Query: 528 LSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRER 587
           L E    A++     E   +E +E+K++L    + L+ E+  A    +K  K+   L ++
Sbjct: 71  LEEKAELALQAGN--EDLAREALEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQK 128

Query: 588 FGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEA 647
             E  A KE    +A  A  +   ++              +++M  F +  +E+++  EA
Sbjct: 129 IAELRAKKE--ALKARKAAAKAQEKVNRSLGGGS-----SSSAMAAFER-MEEKIEEREA 180

Query: 648 N---QEIDRLRKHTDKLQAELQD 667
                         D L  E   
Sbjct: 181 RAEAAAELAEGSGDD-LDKEFAQ 202


>gnl|CDD|169691 PRK09173, PRK09173, F0F1 ATP synthase subunit B; Validated.
          Length = 159

 Score = 32.4 bits (74), Expect = 1.0
 Identities = 22/73 (30%), Positives = 36/73 (49%), Gaps = 7/73 (9%)

Query: 456 DIALRERNEYQEALAKVQQQHEDA---VKEINTAMAVRMKA-TKDLKRLTEE---RNAAM 508
             A R R E Q+ LA+ Q++ ++A     +I  A     +A T + KR TEE   R   +
Sbjct: 43  AEARRLREEAQQLLAEYQRKRKEAEKEAADIVAAAEREAEALTAEAKRKTEEYVARRNKL 102

Query: 509 QEYSLIMSERDSV 521
            E  +  +E D++
Sbjct: 103 AEQKIAQAETDAI 115


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 33.7 bits (78), Expect = 1.0
 Identities = 26/130 (20%), Positives = 49/130 (37%), Gaps = 34/130 (26%)

Query: 386 ETIKDEYDALRKRFDDLINTHSTAVDK-----LELAQEEVARIKKQCEDMNQEKNTALRE 440
             ++ + DAL K+          A DK     LE     + +I +Q E+  Q        
Sbjct: 39  ADVQAQLDALNKQ------KLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQ-------- 84

Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
              LKQQ   A  +    LR+    Q  L  ++  +++  +E  + +++R      L+  
Sbjct: 85  ---LKQQLAQAPAK----LRQ---AQAELEALKDDNDEETRETLSTLSLR-----QLESR 129

Query: 501 TEERNAAMQE 510
             +    +Q 
Sbjct: 130 LAQTLDQLQN 139



 Score = 32.2 bits (74), Expect = 3.5
 Identities = 24/117 (20%), Positives = 52/117 (44%), Gaps = 6/117 (5%)

Query: 370 SADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCED 429
           +   + Q  + ALNK + ++ E   +++   DL  T +  +DK++  +EE  ++K+Q   
Sbjct: 37  TEADV-QAQLDALNKQKLLEAEDKLVQQ---DLEQTLAL-LDKIDRQKEETEQLKQQLAQ 91

Query: 430 MNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTA 486
              +   A  E   LK       R+  ++     + +  LA+   Q ++A  ++   
Sbjct: 92  APAKLRQAQAELEALKDDNDEETRE-TLSTLSLRQLESRLAQTLDQLQNAQNDLAEY 147


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 33.4 bits (76), Expect = 1.1
 Identities = 65/280 (23%), Positives = 117/280 (41%), Gaps = 63/280 (22%)

Query: 322 AQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSA 381
           A+L+ A Q  S LR++ S L S ++ L  +L +L+                         
Sbjct: 420 AELQQARQNESELRNQISLLTSLERSLKSDLGQLKK------------------------ 455

Query: 382 LNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCED-MNQEKNTALRE 440
                    E D L+ + + +++  +   DK  +   E  R+K + +  +N EK  A  E
Sbjct: 456 ---------ENDMLQTKLNSMVS--AKQKDKQSMQSME-KRLKSEADSRVNAEKQLA-EE 502

Query: 441 RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRL 500
           +   K++   A R    A   R E  E+L + +Q  E  +K++       +K  ++  R+
Sbjct: 503 KKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEH----DLKLKEEECRM 558

Query: 501 TEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKI--KTLEMENKELVED--KKTL 556
            E+    +++Y           +E EK +E L  A++ +  K L +EN    E   K  L
Sbjct: 559 LEKEAQELRKY-----------QESEKETEVLMSALQAMQDKNLMLENSLSAETRLKLDL 607

Query: 557 SYQIETLKR--EIAS-ALHDRDKALKECNDLRERFGEFTA 593
              +  +KR  EIA  A++ R+    E  DL+ +  +  A
Sbjct: 608 FSALGDVKRQLEIAHGAIYKRE---AEIVDLKSKIADILA 644



 Score = 33.4 bits (76), Expect = 1.4
 Identities = 35/167 (20%), Positives = 68/167 (40%), Gaps = 13/167 (7%)

Query: 496 DLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKT 555
           D+K+L  E   A Q  S + ++   +      L  DL Q  ++   L+ +   +V  K+ 
Sbjct: 414 DIKKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQK 473

Query: 556 LSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGE 615
               ++++++ + S    R  A K+  + ++R       K+E +  A  A  + A    E
Sbjct: 474 DKQSMQSMEKRLKSEADSRVNAEKQLAEEKKR-------KKEEEETAARAAAQAAASREE 526

Query: 616 RDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRLRKHTDKLQ 662
              S K+  +      DL  + +K   D     +E   L K   +L+
Sbjct: 527 CAESLKQAKQ------DLEMEIKKLEHDLKLKEEECRMLEKEAQELR 567


>gnl|CDD|227406 COG5074, COG5074, t-SNARE complex subunit, syntaxin [Intracellular
           trafficking and secretion].
          Length = 280

 Score = 32.9 bits (75), Expect = 1.3
 Identities = 49/288 (17%), Positives = 98/288 (34%), Gaps = 82/288 (28%)

Query: 297 NETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELR 356
           N+ +   +  + +++     H+  + ++     +S  LR    +  S    L R L    
Sbjct: 31  NKNLSVYEKEINQID---NLHKDLLTEVFEE--QSRKLRRSLDNFSSQTTDLQRNL---- 81

Query: 357 SGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELA 416
                     D  SA+    H           + + + +R++F  LI      +D     
Sbjct: 82  --------KKDIKSAERDGIHL-------ANKQAQAENVRQKFLKLIQ-DYRIIDS-NYR 124

Query: 417 QEEVARIKKQCEDMN-----QEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAK 471
           +EE  + ++Q           E   A+ + NG +    A +         R E + ALA+
Sbjct: 125 EEEKEQARRQYIIAQPEATEDEVEAAINDVNGQQVFSQALLNA-----NRRGEAKTALAE 179

Query: 472 VQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSED 531
           VQ +H                  +++K++                         EK   +
Sbjct: 180 VQARH------------------QEIKKI-------------------------EKTMAE 196

Query: 532 LTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALK 579
           LTQ    ++ L +E +E V+    +   +E  +  +   +   DKA+K
Sbjct: 197 LTQLFNDMEELVIEQQENVD---VIDKNVEDAQENVEQGVGHTDKAVK 241


>gnl|CDD|219083 pfam06548, Kinesin-related, Kinesin-related.  This family
           represents a region within kinesin-related proteins from
           higher plants. Many family members also contain the
           pfam00225 domain. Kinesins are ATP-driven microtubule
           motor proteins that produce directed force. Some family
           members are associated with the phragmoplast, a
           structure composed mainly of microtubules that executes
           cytokinesis in higher plants.
          Length = 488

 Score = 33.3 bits (76), Expect = 1.3
 Identities = 40/213 (18%), Positives = 89/213 (41%), Gaps = 34/213 (15%)

Query: 340 DLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRF 399
           +L +++   ++  +EL S         +    + L      A+  +  I ++Y  L++++
Sbjct: 303 ELEANRSLAEKLEMELES---------EKKCTEELKDALQRAMQGHARIIEQYAELQEKY 353

Query: 400 DDLINTHSTAVDKLELAQEEVARI-KKQCEDMNQEKNTALRERNGLKQQCTA--AIRQWD 456
           +DL+  H   ++ +E  ++  A+   K             R  N L  + +A  A R+  
Sbjct: 354 NDLLEKHRRIMEGIEDVKKAAAKAGVKGAG---------SRFANALAAELSALRAERE-- 402

Query: 457 IALRERNEYQEALAKVQQQHEDAVKEINTA--MAVRMKATKDLKRLTEERNAAMQEYSLI 514
              +ER   ++    +Q Q  D  + +  A  + VR++  ++   L +ER    ++    
Sbjct: 403 ---KERELLKKENKSLQIQLRDTAEAVQAAGELLVRLREAEEAATLAQERAMLAEQ---- 455

Query: 515 MSERDSVHKEMEKLSEDLTQAVRKIKTLEMENK 547
             E + + K++EKL       +  +K    E+K
Sbjct: 456 --ENEKLKKQIEKLKRKHKMEIVTMKQYLAESK 486



 Score = 33.3 bits (76), Expect = 1.3
 Identities = 39/232 (16%), Positives = 74/232 (31%), Gaps = 45/232 (19%)

Query: 344 DKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLI 403
           + +  +  LL      C + E  DA   +  +   +  +    + K     L        
Sbjct: 59  ESEARESTLLCSS---CKKKEDTDAVVQEISDCINLQLVPVDNSEKAVEKVL-------- 107

Query: 404 NTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQ------CTAAIRQW-- 455
                             R + + E++  E+   + + N L QQ      C A I Q   
Sbjct: 108 --------------AGAIRREMELEELCVEQAAKIEQLNRLVQQYKHERECNALIDQTKE 153

Query: 456 ----------DIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERN 505
                     D  L       E    +  +H+  +K         +K T +LKR+ EE  
Sbjct: 154 DKIARLESLMDGVLPTEEFIDEEFLSLMNEHK-LLKSDYENHPEVLKETIELKRIQEELE 212

Query: 506 AAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLS 557
           +  + +   M E++ + +E++ L   L   +   K+       L+     L 
Sbjct: 213 SF-KNFYFDMGEKEVLLEEIQDLKSQLQCMLSSSKSTARTRSSLLARSFQLR 263


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 33.4 bits (77), Expect = 1.4
 Identities = 12/102 (11%), Positives = 37/102 (36%), Gaps = 13/102 (12%)

Query: 383 NKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERN 442
           N    ++ E   L+++ +      + +    E  Q+E+  ++    ++ +++     +  
Sbjct: 142 NLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLE 201

Query: 443 GLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEIN 484
            L+++                  QE   K ++  + A K + 
Sbjct: 202 QLQEKAAETS-------------QERKQKRKEITDQAAKRLE 230



 Score = 32.6 bits (75), Expect = 2.2
 Identities = 30/159 (18%), Positives = 54/159 (33%), Gaps = 51/159 (32%)

Query: 411 DKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALA 470
           + L   Q+EV  +K+Q E   +EK                   Q       + +   AL 
Sbjct: 142 NLLHALQQEVLTLKQQLELQAREK------------------AQSQALAEAQQQELVALE 183

Query: 471 KVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSE 530
            +  + E+  +E+             L++L E+     QE               +K  E
Sbjct: 184 GLAAELEEKQQELEA----------QLEQLQEKAAETSQE-------------RKQKRKE 220

Query: 531 DLTQAVRKIKTLEMENKELVE----------DKKTLSYQ 559
              QA ++++  E E + L++          D KTL + 
Sbjct: 221 ITDQAAKRLELSEEETRILIDQQLRKAGWEADSKTLRFS 259


>gnl|CDD|225387 COG2831, FhaC, Hemolysin activation/secretion protein [Intracellular
            trafficking and secretion].
          Length = 554

 Score = 33.1 bits (76), Expect = 1.5
 Identities = 24/119 (20%), Positives = 35/119 (29%), Gaps = 16/119 (13%)

Query: 1315 AQYSVERYNEVKDKPGDS----FYMRALFSRTGDLGDALQLRF-HKDDILYVDNTMFNGV 1369
             +  +E   +V D  G S      +     R      +L L   H+    Y+D+T     
Sbjct: 296  YRQVLEGPFDVLDYQGKSQNLSLRLSHPLLRNRSSKLSLGLGLSHRRSENYLDDTEIEVQ 355

Query: 1370 PGHWRAWLVDADGFRQQCGIIPSKYKVEEELLLRRSLGDL------ESDARRATRRSFF 1422
                 A  +   G R    +       +  L  RR LG L      E      T R   
Sbjct: 356  RRRLTAVELGLSGRRY---LGGGTL--DASLGYRRGLGALGASKDDEETNGEPTSRFKK 409


>gnl|CDD|232932 TIGR00345, GET3_arsA_TRC40, transport-energizing ATPase,
           TRC40/GET3/ArsA family.  Members of this family are
           ATPases that energize transport, although with different
           partner proteins for different functions. Recent
           findings show that TRC40 (GET3 in yeast) in involved in
           the insertion of tail-anchored membrane proteins in
           eukaryotes. A similar function is expected for members
           of this family in archaea. However, the earliest
           discovery of a function for this protein family is ArsA,
           an arsenic resistance protein that partners with ArsB
           (see pfam02040) for As(III) efflux [Hypothetical
           proteins, Conserved].
          Length = 284

 Score = 32.4 bits (74), Expect = 1.5
 Identities = 14/65 (21%), Positives = 26/65 (40%), Gaps = 3/65 (4%)

Query: 324 LEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALN 383
            +AA +E    R+K  + I         L +   G    P  D+  + D   +H   A N
Sbjct: 55  PQAALEE---YRAKLVEQIKGNLPDGDMLGDQLEGAALSPGIDEIAAFDEFLKHMTDAEN 111

Query: 384 KYETI 388
           +++ +
Sbjct: 112 EFDVV 116


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 33.0 bits (76), Expect = 1.6
 Identities = 35/157 (22%), Positives = 71/157 (45%), Gaps = 17/157 (10%)

Query: 408 TAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRER----- 462
            A   +E A++E   +KK  E + + K    + R  L+++     R+ ++   ER     
Sbjct: 33  LAKRIIEEAKKEAETLKK--EALLEAKEEVHKLRAELERE--LKERRNELQRLERRLLQR 88

Query: 463 ----NEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSE- 517
               +   E+L K ++  E   KE++       +  ++L+ L  E+   ++  S +  E 
Sbjct: 89  EETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEE 148

Query: 518 -RDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDK 553
            ++ + +E+E   E   +A + IK +E E KE  + K
Sbjct: 149 AKEILLEEVE--EEARHEAAKLIKEIEEEAKEEADKK 183


>gnl|CDD|216263 pfam01044, Vinculin, Vinculin family. 
          Length = 850

 Score = 33.2 bits (76), Expect = 1.7
 Identities = 46/252 (18%), Positives = 83/252 (32%), Gaps = 49/252 (19%)

Query: 478 DAVKEINTAMAVRMKA---TKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQ 534
           D +K++   +   MK       ++   E R+   ++          +   M ++      
Sbjct: 192 DTLKKLAPILITAMKIFVRHPGVEEARENRDYTAKQ----------MSDAMNEIIR---- 237

Query: 535 AVRKIKTLEMENKELVEDKKTLSYQIETLKREI----ASALHDRDKALKECNDLRERFGE 590
            V ++ +  +  K L      L+  ++ L   I    A+ + +R +   E        G 
Sbjct: 238 -VLQLTSYTVMKKALSGIGSELAAALDWLDDPIIIDPATYIEERCRPSLEERIESIISGA 296

Query: 591 FTAAKEECQREAFHARFEYAMRMGERDSSRKERNE-VEAASMDL---FGKCQKERMDNLE 646
              A   C R                 +   E N     A  DL   + K    + D   
Sbjct: 297 ALMADSLCTRRN------------RGQAIVAECNGAGRQALQDLLSEYMKLAGRKDDLDA 344

Query: 647 ANQEIDRLRKHTDKLQAELQDQSVN-----YLFILTESIQEAEVSKRRRDWAFSERDKIE 701
               IDR+RK T  L+ +L+   V+     +L   T      E +K  R+  F E+ +  
Sbjct: 345 LCDAIDRMRKKTRDLRRQLRKAVVDHVSDSFLDTTTPLKLLIEAAKAGREENFEEKAQN- 403

Query: 702 VIDSYHTKHAGR 713
                  +HA R
Sbjct: 404 -----FQEHANR 410


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 33.1 bits (76), Expect = 1.7
 Identities = 54/293 (18%), Positives = 94/293 (32%), Gaps = 58/293 (19%)

Query: 315 GQHQAAMAQLEAAAQ-ESSSLRSKYSDLIS--DKQRLDRELLELRSGCCTQP-------- 363
           G    A+A LE AAQ +    R+    ++S     + D+ L   +     QP        
Sbjct: 411 GDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQPDNASLHNL 470

Query: 364 -------ESDDAGSADSLNQ-------HYMSALN--KYETIKDEYDALRKRFDDLINTHS 407
                  + D A + ++  +        + +A N  + +  +   D   +RF+ ++    
Sbjct: 471 LGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDP 530

Query: 408 TAVD------KLELAQ----EEVARIKKQCEDMNQEKNTAL------RERNGLKQQCTAA 451
             +        L L      E VA ++K  E   QE   AL        +  LK+     
Sbjct: 531 KNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQLKKALAIL 590

Query: 452 IRQWDIALRERNEYQEALAKVQQQ---HEDAVKEINTAMAVRMKATKDLKRLTEERNAAM 508
               D A  +  E    L + Q        AV      +A++  +   L  L     A  
Sbjct: 591 NEAADAA-PDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSALALLLL-----ADA 644

Query: 509 QEYSLIMSERDSVHKEMEK---LSEDLTQAVRKIKTLEMENKELVEDKKTLSY 558
                +M         +++   L  D T+A   +  L +  K     KK    
Sbjct: 645 YA---VMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIAKS 694


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 32.8 bits (75), Expect = 2.0
 Identities = 38/253 (15%), Positives = 82/253 (32%), Gaps = 28/253 (11%)

Query: 389 KDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQC 448
              +    K F+ L       + +     EE  +     ED+ + +     +   L    
Sbjct: 729 AKLFGRFLKAFNALKK-----LYEFAYTLEEEIQYL---EDLLKARLEKDYQSKYLD--- 777

Query: 449 TAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVR-----MKATKDLKRLTEE 503
              +       RE  +  E+ A      +  ++ I            +  T + K    E
Sbjct: 778 ---LIDNLNDAREIKDK-ESKAIDLDDIDFELELIGKQEINIDYILELLQTFNDKNGAYE 833

Query: 504 RNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETL 563
               + E  +     D   K+  KL E L +A+ + +  +++  E +E+   L+ + E  
Sbjct: 834 SLKELIERIIKEWIEDLRQKK--KLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEF 891

Query: 564 KREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKER 623
           K+       + ++ L   +DL    G+      E        +      + E++    + 
Sbjct: 892 KQFAEEEGLNEEE-LAFYDDLALNGGKLPENGTELVE-----KLAKEKSLREKNKDDWKA 945

Query: 624 NEVEAASMDLFGK 636
            E   A +  F +
Sbjct: 946 KEEVEAKLKAFFR 958


>gnl|CDD|227398 COG5066, SCS2, VAMP-associated protein involved in inositol
            metabolism [Intracellular trafficking and secretion].
          Length = 242

 Score = 31.9 bits (72), Expect = 2.2
 Identities = 20/76 (26%), Positives = 26/76 (34%), Gaps = 8/76 (10%)

Query: 1053 TFPRKKENQR-----FRIPSNPSVAFKTSGTKMSNGSIEKTSERGSPMPTFHVEVLSPGL 1107
            TF     N+       +  S   V FK   T   +  +        PM T  VEV+  GL
Sbjct: 10   TFYVPLTNKSKEMFSVQNNSPEPVGFKVKTTAPKDYCVRPNMGLIEPMSTVEVEVILQGL 69

Query: 1108 RQNKRNSLPDYCCSHK 1123
                    PD+ C  K
Sbjct: 70   ---TEEPAPDFKCRDK 82


>gnl|CDD|153341 cd07657, F-BAR_Fes_Fer, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Fes (feline sarcoma) and
           Fer (Fes related) tyrosine kinases.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Fes (feline sarcoma), also called
           Fps (Fujinami poultry sarcoma), and Fer (Fes related)
           are cytoplasmic (or nonreceptor) tyrosine kinases that
           play roles in haematopoiesis, inflammation and immunity,
           growth factor signaling, cytoskeletal regulation, cell
           migration and adhesion, and the regulation of cell-cell
           interactions. Although Fes and Fer show redundancy in
           their biological functions, they show differences in
           their expression patterns. Fer is ubiquitously expressed
           while Fes is expressed predominantly in myeloid and
           endothelial cells. Fes and Fer contain an N-terminal
           F-BAR domain, an SH2 domain, and a C-terminal catalytic
           kinase domain. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules. The
           F-BAR domain of Fes is critical in its role in
           microtubule nucleation and bundling.
          Length = 237

 Score = 32.0 bits (73), Expect = 2.3
 Identities = 23/117 (19%), Positives = 45/117 (38%), Gaps = 11/117 (9%)

Query: 370 SADSLNQHYMSALNKY----ETIKDEYDALRKRFDDLINTHSTAVDKLELA----QEEVA 421
            A++L    +  L          K  Y   R++ D+     +  V+KL+       E+  
Sbjct: 84  HAEALESGTLDKLTLLIKDKRKAKKAYQEERQQIDEQYKKLTDEVEKLKSEYQKLLEDYK 143

Query: 422 RIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHED 478
             K + E+   +     R+ +  + +   A R+  +     N+Y  AL + Q+  ED
Sbjct: 144 AAKSKFEEAVVKGGRGGRKLDKARDKYQKACRKLHLC---HNDYVLALLEAQEHEED 197


>gnl|CDD|212969 cd12036, SH3_MPP5, Src Homology 3 domain of Membrane Protein,
            Palmitoylated 5 (or MAGUK p55 subfamily member 5).  MPP5,
            also called PALS1 (Protein associated with Lin7) or Nagie
            oko protein in zebrafish or Stardust in Drosophila, is a
            scaffolding protein which associates with Crumbs homolog
            1 (CRB1), CRB2, or CRB3 through its PDZ domain and with
            PALS1-associated tight junction protein (PATJ) or
            multi-PDZ domain protein 1 (MUPP1) through its L27
            domain. The resulting tri-protein complexes are core
            proteins of the Crumb complex, which localizes at tight
            junctions or subapical regions, and is involved in the
            maintenance of apical-basal polarity in epithelial cells
            and the morphogenesis and function of photoreceptor
            cells. MPP5 is critical for the proper stratification of
            the retina and is also expressed in T lymphocytes where
            it is important for TCR-mediated activation of NFkB.
            Drosophila Stardust exists in several isoforms, some of
            which show opposing functions in photoreceptor cells,
            which suggests that the relative ratio of different
            Crumbs complexes regulates photoreceptor homeostasis.
            MPP5 contains two L27 domains followed by the core of
            three domains characteristic of MAGUK
            (membrane-associated guanylate kinase) proteins: PDZ,
            SH3, and guanylate kinase (GuK). In addition, it also
            contains the Hook (Protein 4.1 Binding) motif in between
            the SH3 and GuK domains. The GuK domain in MAGUK proteins
            is enzymatically inactive; instead, the domain mediates
            protein-protein interactions and associates
            intramolecularly with the SH3 domain. SH3 domains are
            protein interaction domains that bind to proline-rich
            ligands with moderate affinity and selectivity,
            preferentially to PxxP motifs. They play versatile and
            diverse roles in the cell including the regulation of
            enzymes, changing the subcellular localization of
            signaling pathway components, and mediating the formation
            of multiprotein complex assemblies.
          Length = 63

 Score = 29.3 bits (66), Expect = 2.3
 Identities = 16/45 (35%), Positives = 23/45 (51%), Gaps = 3/45 (6%)

Query: 1349 LQLRFHKDDILYVDNTMFNGVPGHWRAWLVDADGFRQQCGIIPSK 1393
            L L F K DIL+V +      P  W+A+    +  +   G+IPSK
Sbjct: 21   LGLSFQKGDILHVISQ---EDPNWWQAYREGEEDNQSLAGLIPSK 62


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 32.0 bits (73), Expect = 2.4
 Identities = 49/260 (18%), Positives = 95/260 (36%), Gaps = 31/260 (11%)

Query: 371 ADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDM 430
           A  L+    +  +K   ++ E   ++   + L N       K++  Q+E+ + K + +  
Sbjct: 26  AALLSDKIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKK- 84

Query: 431 NQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVR 490
            Q++   L+E                  + ER E  +  A+  Q +  A   I+  + + 
Sbjct: 85  LQKEIAELKEN-----------------IVERQELLKKRARAMQVNGTATSYID--VILN 125

Query: 491 MKATKDL-KRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKEL 549
            K+  DL  R+T        +  ++    +   ++ + L E       K++TL     EL
Sbjct: 126 SKSFSDLISRVTAISVIVDADKKIL----EQQKEDKKSLEEKQAALEDKLETLVALQNEL 181

Query: 550 VEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEY 609
                +L+ Q       IA+       AL E   L E+     A  E    EA  A+ E 
Sbjct: 182 ETQLNSLNSQKAEKNALIAALAAKEASALGEKAALEEQ----KALAEAAAAEA--AKQEA 235

Query: 610 AMRMGERDSSRKERNEVEAA 629
           A +   ++ +  +     A 
Sbjct: 236 AAKAAAQEQAALQAAATAAQ 255


>gnl|CDD|191863 pfam07818, HCNGP, HCNGP-like protein.  This family comprises
           sequences bearing significant similarity to the mouse
           transcriptional regulator protein HCNGP. This protein is
           localised to the nucleus and is thought to be involved
           in the regulation of beta-2-microglobulin genes.
          Length = 96

 Score = 30.0 bits (68), Expect = 2.4
 Identities = 17/92 (18%), Positives = 35/92 (38%), Gaps = 12/92 (13%)

Query: 100 NIVKGSVTDGKLKKKEEVARIKKQCEDMNQ--EKNTALR---------ERNGLKQQCTA- 147
              KG  +     K E   ++K+Q    NQ  + +   R         +   + +  T  
Sbjct: 3   PEPKGECSPELNAKIEHFLKLKRQGTHFNQKLQNSKEFRNPSILEKLMDFFDIDEYGTNY 62

Query: 148 AIRQWDIALRERNEYQEALAKVQQQHEDAVKE 179
               +D  +    +Y E LAK Q++ ++  ++
Sbjct: 63  PKDIFDPHVFPSWDYYEELAKEQKEEQEKREK 94


>gnl|CDD|220484 pfam09943, DUF2175, Uncharacterized protein conserved in archaea
            (DUF2175).  This domain, found in various hypothetical
            archaeal proteins, has no known function.
          Length = 101

 Score = 30.1 bits (68), Expect = 2.4
 Identities = 22/92 (23%), Positives = 38/92 (41%), Gaps = 6/92 (6%)

Query: 1520 DNLFVDYRKKGAYFECTSVAAVKDACDKNNAHVILDVSLAGVERLHRQHVYPIVLLIKFK 1579
              LF    K   ++EC     V+DA  K          L  + RL       IVL  + +
Sbjct: 15   GELFTFTSKGPVHYECL----VEDASKKIKGDK--PEDLVALLRLEDYLHEGIVLAKELE 68

Query: 1580 STKQIKEIKEIRDGRYSSLDKVTAKAAKEMYE 1611
               + +EIK++       L+K+ AK  +++ +
Sbjct: 69   QVAESEEIKKVIRQVRKDLEKLAAKLTRKIEK 100


>gnl|CDD|185252 PRK15354, PRK15354, type III secretion system protein SsaK;
           Provisional.
          Length = 224

 Score = 31.6 bits (71), Expect = 2.5
 Identities = 23/84 (27%), Positives = 37/84 (44%), Gaps = 22/84 (26%)

Query: 594 AKEECQREAFHARFEYAMRMGE---RDSSRKERNEVEAASMDLFGKCQKERMDNLEANQE 650
           A+E+  +   HA    A R  E   RD+ R +R +                   +E  QE
Sbjct: 34  AQEQQAKRVSHAIVSSAYRKAEKIIRDAYRYQREQ------------------KVEQQQE 75

Query: 651 IDRLRKHT-DKLQAELQDQSVNYL 673
           +  LRK+T +K++ E  +Q V +L
Sbjct: 76  LACLRKNTLEKMEVEWLEQHVKHL 99


>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0).  Part of the CF(0)
           (base unit) of the ATP synthase. The base unit is
           thought to translocate protons through membrane (inner
           membrane in mitochondria, thylakoid membrane in plants,
           cytoplasmic membrane in bacteria). The B subunits are
           thought to interact with the stalk of the CF(1)
           subunits. This domain should not be confused with the ab
           CF(1) proteins (in the head of the ATP synthase) which
           are found in pfam00006.
          Length = 132

 Score = 30.7 bits (70), Expect = 2.5
 Identities = 21/77 (27%), Positives = 35/77 (45%), Gaps = 5/77 (6%)

Query: 462 RNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEY--SLIMSERD 519
             +    LA+ +QQ   A  E +    +   A K+ ++L EE  A  Q+    L+ S R 
Sbjct: 46  LKQAAALLAEAEQQLAQARAEAS---EIINNAKKEAQKLKEEILAEAQKDAERLLESARA 102

Query: 520 SVHKEMEKLSEDLTQAV 536
            + +E E+   +L Q V
Sbjct: 103 EIEQEKEQALAELRQQV 119


>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region.  This family
           represents an N-terminal conserved region found in
           several huntingtin-associated protein 1 (HAP1)
           homologues. HAP1 binds to huntingtin in a polyglutamine
           repeat-length-dependent manner. However, its possible
           role in the pathogenesis of Huntington's disease is
           unclear. This family also includes a similar N-terminal
           conserved region from hypothetical protein products of
           ALS2CR3 genes found in the human juvenile amyotrophic
           lateral sclerosis critical region 2q33-2q34.
          Length = 307

 Score = 32.1 bits (73), Expect = 2.6
 Identities = 42/180 (23%), Positives = 72/180 (40%), Gaps = 31/180 (17%)

Query: 516 SERDSVHKEMEKLSEDLTQAVR---------KIKTLEMENKELVEDKKTLSYQIETLKRE 566
           SE +S      +  E  + +           K+K LE EN+ L  +   L  +  T + +
Sbjct: 138 SEDESSESTPLRPQESSSSSHGCFQLEALQEKLKLLEEENEHLRSEASHLKTETVTYEEK 197

Query: 567 IASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEV 626
               ++D  K L+E ND      E  A K E                 + +  ++E   +
Sbjct: 198 EQQLVNDCVKQLREANDQIASLSEELAKKTE-----------------DLERQQEEITHL 240

Query: 627 EAASMDLFGKCQKERMDNLEANQ----EIDRLRKHTDKLQAELQDQSVNYLFILTESIQE 682
            +  +DL  KC+   ++N E  Q      D  R+   +LQ ELQD+    + +L E+ +E
Sbjct: 241 LSQIVDLQKKCKSYALENEELQQHLAAAKDAQRQLQAELQ-ELQDKYAECMEMLHEAQEE 299


>gnl|CDD|165222 PHA02896, PHA02896, A-type inclusion like protein; Provisional.
          Length = 616

 Score = 32.3 bits (73), Expect = 2.7
 Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 10/65 (15%)

Query: 615 ERDSSRKER-NEVEAA----SMDLFG---KCQKERMDNLEANQEIDRLRKHTDKLQAELQ 666
           ++D S  +R N+ + A      D FG   +  KE+MDNLE +   D LRKH  +L  +L 
Sbjct: 544 DKDKSDDDRLNDADKAMRKLEADGFGDRYRKIKEKMDNLEDD--YDDLRKHAIELPKKLD 601

Query: 667 DQSVN 671
           +QS N
Sbjct: 602 NQSGN 606


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 32.0 bits (73), Expect = 2.7
 Identities = 52/303 (17%), Positives = 95/303 (31%), Gaps = 68/303 (22%)

Query: 318 QAAMAQLEAAAQESSSLRSKYSDL--ISDKQRLDRELLELRSGCCTQPESDDAGSADSLN 375
           Q A   L  AAQ  ++  S   DL  +  K RLD      R     + E     +A+   
Sbjct: 196 QKAAEYLRQAAQAGTTGSSSADDLADLLSKFRLDSWDKGSRFTRTEKVEQAATAAAEVPP 255

Query: 376 QHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKN 435
                       +  E D  ++R  +L           E  QE++  ++   E++  E  
Sbjct: 256 A----------EMDTEEDRTKEREAEL-----------EALQEQIDELESSIEEVLSE-- 292

Query: 436 TALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAV--------------K 481
                     +   + I+Q +  L    +  E L +  +  +  V               
Sbjct: 293 ---------IKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENNVAKLQA 343

Query: 482 EINTAM-----------AVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSE 530
            +  +            A R     + + L E+      E    + E   +  ++E+L  
Sbjct: 344 LVVASSERLLELAQQWEAHRTPLIDEYRSLKEKNRNKEDETQRQLDEIKKLRNKIEELES 403

Query: 531 DLTQAVRKIKTLEMENKELVEDKKTLSY---------QIETLKREIASALHDRDKALKEC 581
           +L    +  K L  E +   +     +Y          I+  K +I   L D     KE 
Sbjct: 404 ELQTKEQLYKQLLDEYENAPKSVSRSAYTRRILEIIKNIKKQKEDIDKILSDTRSLQKEI 463

Query: 582 NDL 584
           N++
Sbjct: 464 NNI 466



 Score = 32.0 bits (73), Expect = 2.9
 Identities = 29/177 (16%), Positives = 63/177 (35%), Gaps = 37/177 (20%)

Query: 481 KEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIK 540
            E++T      +   +L+ L E+ +        ++SE  ++  ++++++E+LT   ++ +
Sbjct: 256 AEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENE 315

Query: 541 TLEMENK------ELVED-----------------------------KKTLSYQIETLKR 565
            LE E K      EL+ D                             +  L  +  +LK 
Sbjct: 316 ELEEEYKIKKRTVELLPDAENNVAKLQALVVASSERLLELAQQWEAHRTPLIDEYRSLKE 375

Query: 566 EIASALHDRDKALKECNDLRERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKE 622
           +  +   +  + L E   LR +  E     E   +E  + +         +  SR  
Sbjct: 376 KNRNKEDETQRQLDEIKKLRNKIEE--LESELQTKEQLYKQLLDEYENAPKSVSRSA 430


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 31.8 bits (72), Expect = 2.8
 Identities = 27/114 (23%), Positives = 45/114 (39%), Gaps = 24/114 (21%)

Query: 490 RMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKEL 549
           R+    DLK   EE    ++E   +  E++ + KE+E+L  +  +   ++K LE+EN  L
Sbjct: 126 RVPEYMDLKEDYEELKEKLEE---LQKEKEELLKELEELEAEYEEVQERLKRLEVENSRL 182

Query: 550 VEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEECQREAF 603
            E                        K   E  DL++R+ E     E  + E  
Sbjct: 183 EEM---------------------LKKLPGEVYDLKKRWDELEPGVELPEEELI 215


>gnl|CDD|236802 PRK10942, PRK10942, serine endoprotease; Provisional.
          Length = 473

 Score = 32.0 bits (73), Expect = 2.9
 Identities = 36/149 (24%), Positives = 61/149 (40%), Gaps = 36/149 (24%)

Query: 1152 GVFVSTVSEHSLASQVGLQVGDQLLEVCGINMRS------------ATYQLAASVLRQCG 1199
            G FVS V  +S A++ G++ GD +  + G  + S               +L   +LR  G
Sbjct: 312  GAFVSQVLPNSSAAKAGIKAGDVITSLNGKPISSFAALRAQVGTMPVGSKLTLGLLRD-G 370

Query: 1200 NSITMLVQYSPDKYHELEGSGSSSAENESVSGRGSGEPRFLMIETRKCSNLGISLVGGNA 1259
              + + V        EL+ S  +  ++ ++         F  IE  + SN       G  
Sbjct: 371  KPVNVNV--------ELQQSSQNQVDSSNI---------FNGIEGAELSN------KGGD 407

Query: 1260 VGIYVHSVQSGSLGYSAGLRTGDRILEYN 1288
             G+ V +V+ G+     GL+ GD I+  N
Sbjct: 408  KGVVVDNVKPGTPAAQIGLKKGDVIIGAN 436



 Score = 31.7 bits (72), Expect = 3.6
 Identities = 16/58 (27%), Positives = 32/58 (55%), Gaps = 3/58 (5%)

Query: 192 GLQLNNSYDHGLTLES--GLYICKISPGSLAAKEGSLAVGDRVLSINNRTVDGLSSAR 247
           G +LN+     + +++  G ++ ++ P S AAK G  A GD + S+N + +   ++ R
Sbjct: 294 GTELNSELAKAMKVDAQRGAFVSQVLPNSSAAKAGIKA-GDVITSLNGKPISSFAALR 350


>gnl|CDD|148810 pfam07412, Geminin, Geminin.  This family contains the eukaryotic
           protein geminin (approximately 200 residues long).
           Geminin inhibits DNA replication by preventing the
           incorporation of MCM complex into prereplication
           complex, and is degraded during the mitotic phase of the
           cell cycle. It has been proposed that geminin inhibits
           DNA replication during S, G2, and M phases and that
           geminin destruction at the metaphase-anaphase transition
           permits replication in the succeeding cell cycle.
          Length = 200

 Score = 31.4 bits (71), Expect = 3.0
 Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 10/68 (14%)

Query: 498 KRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLS 557
           K + EER  A+ E            +E EKL +++ Q   +I  L+ EN+EL E  + + 
Sbjct: 110 KEVAEERRKALYE----------ALQENEKLHKEIEQKDEEIARLKKENEELAELAEHVQ 159

Query: 558 YQIETLKR 565
           Y  E ++R
Sbjct: 160 YMAEMIER 167


>gnl|CDD|221998 pfam13234, rRNA_proc-arch, rRNA-processing arch domain.  Mtr4 is
           the essential RNA helicase, and is an exosome-activating
           cofactor. This arch domain is carried in Mtr4 and Ski2
           (the cytosolic homologue of Mtr4). The arch domain is
           required for proper 5.8S rRNA processing, and appears to
           function independently of canonical helicase activity.
          Length = 266

 Score = 31.8 bits (73), Expect = 3.0
 Identities = 26/118 (22%), Positives = 51/118 (43%), Gaps = 34/118 (28%)

Query: 478 DAVKEINTAMAVRMKATKDLKRLTEERNA---AMQEYSLIMSER--------DSVHKEME 526
             +  I+   +VR+   KDL+   E R +   +++E    + +R        D + ++M+
Sbjct: 158 SCIDAIS---SVRLYLPKDLRSE-EARESVLKSLKE----VKKRFPDGIPLLDPI-EDMK 208

Query: 527 KLSEDLTQAVRKIKTLE--MENKELVED------------KKTLSYQIETLKREIASA 570
              +   + +RKI+ LE  + +  L               K  L  +I+ LK++I SA
Sbjct: 209 IKDDSFKKLLRKIEVLESRLLSNPLHNSPRLEELYDQYSKKVELEEEIKALKKKIKSA 266


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
           the bacterial cell cycle, the tubulin-like cell-division
           protein FtsZ polymerises into a ring structure that
           establishes the location of the nascent division site.
           EzrA modulates the frequency and position of FtsZ ring
           formation.
          Length = 559

 Score = 32.1 bits (74), Expect = 3.0
 Identities = 72/308 (23%), Positives = 124/308 (40%), Gaps = 58/308 (18%)

Query: 384 KYETIKDEYDALRKRFDDLINTHS--TAVDKLELAQEEVARIKKQCEDMNQEKN--TALR 439
           + E +KD+Y  LRK    L  + S   A+D+LE   +E+    +Q  ++ +  +   A  
Sbjct: 137 EVEELKDKYRELRKTL--LAKSFSYGPALDELEKQLDELEEEFEQFVELTESGDYLEARE 194

Query: 440 ERNGLKQQCTAAIRQWDIA-----LRE-RNEYQEALAKVQQ------------QHEDAVK 481
               L+++ T A+ Q  +      L+E +NE+ + L +++              H D  K
Sbjct: 195 VLLKLEEE-TDALEQ-KMEEIPPLLKELQNEFPDQLEELKAGYREMTEEGYHFDHLDIEK 252

Query: 482 EINTAMAVRMKATKDLKRL----TEERNAAMQE-----YSLIMSE---RDSVHKEMEKLS 529
           E+        +    L+ L     EE N  ++E     Y ++  E   +  V K ++KL+
Sbjct: 253 ELQDLKEQIDQNLALLEELDLDEAEEENEEIEERIDTLYDILEKEVKAKKFVEKNIDKLT 312

Query: 530 EDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECND-LRERF 588
           + L  A  + K L +E   L +     SY +   + E    L  + + L+   D L ER 
Sbjct: 313 DFLEHAREQNKQLLLELDRLQQ-----SYTLNEDELETVRELEKQLEELEAQYDQLVERI 367

Query: 589 GEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEAN 648
            E   A  E Q        E            K+  E+E    +L    Q  R D LEA 
Sbjct: 368 AEKKVAYSELQ-----EELEEIE---------KQLEEIEKEQEELSESLQGLRKDELEAR 413

Query: 649 QEIDRLRK 656
           +++   R+
Sbjct: 414 EKLQEYRQ 421



 Score = 31.8 bits (73), Expect = 4.0
 Identities = 60/335 (17%), Positives = 115/335 (34%), Gaps = 86/335 (25%)

Query: 391 EYDALRKRFDDLINTHSTAVDK-LELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCT 449
            ++  R+++DD++      +++ L  A+E   + +       + K  A++E   L     
Sbjct: 61  TFEEWRQKWDDIVTNSLPDIEELLFEAEELNDKFR-----FLKAKK-AIKEIEELLDTIE 114

Query: 450 AAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERN--AA 507
             I Q            E L ++++  E   KE+        +  K L  L +  +   A
Sbjct: 115 EDIEQ----------ILEELNELKESEEKNRKEVEELKDKYRELRKTL--LAKSFSYGPA 162

Query: 508 MQEYSLIMSERDSVHKEMEKLSE--DLTQAVRKIKTLEMENKELVED-----------KK 554
           + E    + E +   ++  +L+E  D  +A   +  LE E   L +            + 
Sbjct: 163 LDELEKQLDELEEEFEQFVELTESGDYLEAREVLLKLEEETDALEQKMEEIPPLLKELQN 222

Query: 555 TLSYQIETLK---------------REIASALHDRDKALKECNDLRERFGEFTAAKEECQ 599
               Q+E LK                +I   L D  + + +   L E   +   A+EE +
Sbjct: 223 EFPDQLEELKAGYREMTEEGYHFDHLDIEKELQDLKEQIDQNLALLEEL-DLDEAEEENE 281

Query: 600 R-----EAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEIDRL 654
                 +  +   E                EV+A       K  ++ +D L     ++  
Sbjct: 282 EIEERIDTLYDILE---------------KEVKA------KKFVEKNIDKLT--DFLEHA 318

Query: 655 RKHTDKLQAELQ--DQSVNYLFILTESIQEAEVSK 687
           R+   +L  EL    QS    + L E   E E  +
Sbjct: 319 REQNKQLLLELDRLQQS----YTLNED--ELETVR 347


>gnl|CDD|237296 PRK13188, PRK13188, bifunctional UDP-3-O-[3-hydroxymyristoyl]
            N-acetylglucosamine
            deacetylase/(3R)-hydroxymyristoyl-[acyl-carrier-protein]
            dehydratase; Reviewed.
          Length = 464

 Score = 31.8 bits (73), Expect = 3.1
 Identities = 15/33 (45%), Positives = 21/33 (63%), Gaps = 1/33 (3%)

Query: 1148 LDSGGVFVSTVSEHSLASQVGLQVGDQLLEVCG 1180
            L+  GV V TV EH LA+  GL + + L+E+ G
Sbjct: 67   LEKNGVKVHTV-EHVLAALYGLGIDNCLIELDG 98


>gnl|CDD|206365 pfam14197, Cep57_CLD_2, Centrosome localisation domain of PPC89.
           The N-terminal region of the fission yeast spindle pole
           body protein PPC89 has low similarity to the human Cep57
           protein. The CLD or centrosome localisation domain of
           Cep57 and PPC89 is found at the N-terminus. This region
           localises to the centrosome internally to gamma-tubulin,
           suggesting that it is either on both centrioles or on a
           centromatrix component. This N-terminal region can also
           multimerise with the N-terminus of other Cep57
           molecules. The C-terminal part, Family Cep57_MT_bd,
           pfam06657, is the microtubule-binding region of Cep57
           and PPC89.
          Length = 69

 Score = 28.8 bits (65), Expect = 3.4
 Identities = 15/63 (23%), Positives = 26/63 (41%), Gaps = 1/63 (1%)

Query: 524 EMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECND 583
           E   L   L  A RK+   E+E K L  ++ +   Q+     E    L   ++AL++  +
Sbjct: 6   ENLTLRNRLDSATRKVAVHEIELKRLNRERDSALRQLGVAYLECQE-LKAENEALRKELE 64

Query: 584 LRE 586
              
Sbjct: 65  ELR 67


>gnl|CDD|204632 pfam11365, DUF3166, Protein of unknown function (DUF3166).  This
           eukaryotic family of proteins has no known function.
          Length = 97

 Score = 29.8 bits (67), Expect = 3.4
 Identities = 14/55 (25%), Positives = 27/55 (49%), Gaps = 9/55 (16%)

Query: 323 QLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQH 377
           QL+   +E++ +R K + L  +K +++ EL + +S           G  DS  +H
Sbjct: 9   QLQFVKEEAALMRKKLAKLEEEKDKVEGELQKYKSK---------YGDLDSGAEH 54


>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1.  This is
           the N-terminal 200 residues of a set of proteins
           conserved from yeasts to humans. Most of the proteins in
           this entry have an RhoGEF pfam00621 domain at their
           C-terminal end.
          Length = 158

 Score = 30.9 bits (70), Expect = 3.5
 Identities = 36/125 (28%), Positives = 58/125 (46%), Gaps = 8/125 (6%)

Query: 464 EYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHK 523
           E  ++LA   Q+ E    E+        + +   +R  E R  A Q+      E +   K
Sbjct: 36  ENLDSLASENQELE---VELELLREDNERLSTQYEREKELRKQAEQKLLEFEDELEQEKK 92

Query: 524 EMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRD-KALKECN 582
           E++K  EDL + VR+   LE++ K L +    L  +   LK+E  ++LH+R  K LK   
Sbjct: 93  ELQKKIEDLEENVRQ---LELKAKNLSDQVSRLEERETELKKEY-NSLHERYTKLLKNYV 148

Query: 583 DLRER 587
           +  ER
Sbjct: 149 EYVER 153


>gnl|CDD|182723 PRK10779, PRK10779, zinc metallopeptidase RseP; Provisional.
          Length = 449

 Score = 31.6 bits (72), Expect = 3.7
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query: 1267 VQSGSLGYSAGLRTGDRILEYNGTDLRA 1294
            VQ  S    AGL+ GDRI++ +G  L  
Sbjct: 228  VQPNSAASKAGLQAGDRIVKVDGQPLTQ 255


>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I.  coiled-coil structure. It
           has been suggested that the domains I and II from
           laminin A, B1 and B2 may come together to form a triple
           helical coiled-coil structure.
          Length = 263

 Score = 31.2 bits (71), Expect = 3.7
 Identities = 41/220 (18%), Positives = 73/220 (33%), Gaps = 36/220 (16%)

Query: 475 QHEDAVKEINTAMAVRM---------KATKDLKRLTEERNAAMQEYSLIMSERDSVHKEM 525
           Q +  +  +N   A+             T+ L       +A  ++   +      + ++ 
Sbjct: 1   QLDSVLLSLNLTGALPAPYKILYNIENLTQQLSGRLSPISATKRQLEDLEKALSKLAQDA 60

Query: 526 EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALKECNDLR 585
           E L +   QA+ K + +    +  +   KTL+  I+ L+  I           KE  +  
Sbjct: 61  ENLQKKANQALAKAQKVNAAIERTLGHAKTLAEAIKNLRDNI-----------KEILEKL 109

Query: 586 ERFGE--FTAAKEECQREAFHARFEYAMRMGERD-SSRKERNEVE--------AASMDLF 634
              GE  F        R    A+      + ERD   + +  E E              F
Sbjct: 110 AGLGENAFALPGSALSRMLAEAQR-MLGEIRERDFQKQLQNAEAELKAAEELLNRVQTWF 168

Query: 635 GKCQKERMDNLEANQEIDRLRKHTDKLQA--ELQDQSVNY 672
              Q+E     EA +  D L ++  KL    EL D++   
Sbjct: 169 QTPQEENEGLFEALR--DDLNEYEAKLSDLRELLDEAQAK 206


>gnl|CDD|214016 cd12923, iSH2_PI3K_IA_R, Inter-Src homology 2 (iSH2) helical domain
           of Class IA Phosphoinositide 3-kinase Regulatory
           subunits.  PI3Ks catalyze the transfer of the
           gamma-phosphoryl group from ATP to the 3-hydroxyl of the
           inositol ring of D-myo-phosphatidylinositol (PtdIns) or
           its derivatives. They play an important role in a
           variety of fundamental cellular processes, including
           cell motility, the Ras pathway, vesicle trafficking and
           secretion, immune cell activation, and apoptosis. They
           are classified according to their substrate specificity,
           regulation, and domain structure. Class IA PI3Ks are
           heterodimers of a p110 catalytic (C) subunit and a
           p85-related regulatory (R) subunit. The R subunit
           down-regulates PI3K basal activity, stabilizes the C
           subunit, and plays a role in the activation downstream
           of tyrosine kinases. All R subunits contain two SH2
           domains that flank an intervening helical domain (iSH2),
           which binds to the N-terminal adaptor-binding domain
           (ABD) of the catalytic subunit. In vertebrates, there
           are three genes (PIK3R1, PIK3R2, and PIK3R3) that encode
           for different Class IA PI3K R subunits.
          Length = 152

 Score = 30.7 bits (70), Expect = 3.7
 Identities = 17/74 (22%), Positives = 30/74 (40%), Gaps = 14/74 (18%)

Query: 503 ERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIET 562
           E+   M+   L+ S    + +  E+L EDL + V   + LE E              + +
Sbjct: 73  EKQRLMENNELLKSRLKELEESKEQLEEDLRKQVAYNRELERE--------------MNS 118

Query: 563 LKREIASALHDRDK 576
           LK E+      +D+
Sbjct: 119 LKPELMQLRKQKDQ 132


>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
          Length = 985

 Score = 32.1 bits (73), Expect = 3.7
 Identities = 99/568 (17%), Positives = 188/568 (33%), Gaps = 110/568 (19%)

Query: 130 EKNTALRERNGLKQQC------TAAIRQWDIALRERNEYQEALA-----KVQQQHEDAVK 178
           EK  AL ++ G+K          A+ R +D      NE    L        +Q++ + +K
Sbjct: 372 EKEEALFDKRGIKALSLFFIDKIASYRSYD------NELPAKLKGLFEKLYKQKYAEVLK 425

Query: 179 EINTAMAVTNYLDGLQLNNSYDHGLTLESGLYICKISPGSLAAKEGSLAVGD-------- 230
           +        +YL     +    HG     G +       S   KEG  A+ D        
Sbjct: 426 KELDRSFYRDYLQSTDDDILKVHG-----GYF-------SKDNKEGDEAIEDEIDLILKD 473

Query: 231 --RVLSINN--RTVDGLSSAREGMALLSDSPDVLTITTLKPTSLGEHSPRVHRKSVNSST 286
             ++LS +   R +    + REG     D+P+V TI  L+ +            S  S  
Sbjct: 474 KEKLLSFDEPLRFIFSKWALREGW----DNPNVFTICKLRSSG-----------SEISKL 518

Query: 287 QT----MPLYVRQHNETVRRGDHTLKELEYYRGQHQAAMAQLEAAAQESSSLRSKYSDLI 342
           Q     + L V ++ E V +      EL     + +      +A  +E +     +    
Sbjct: 519 QEVGRGLRLAVNENGERVTKDFDFPNELTVLVNESEKDFV--KALQKEIN--DESFIKQG 574

Query: 343 SDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDL 402
             K   +   L L+      PE+ +      L +     LN+ E  + E        D L
Sbjct: 575 FAKDEEELSKLILKERGKVLPETLEKLLFGDLIKDDYLDLNENEFAEKE-----PELDKL 629

Query: 403 INTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWD------ 456
            +    A +K++   ++      +  +  + K   ++ R    ++  A    W+      
Sbjct: 630 KDELYEAFEKIKDLIKDRLIGADRIRNAKERKKEKIKIRKENFKEFKAL---WEKINHKA 686

Query: 457 ---IALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLT------EERNA- 506
                +       E +  + ++   + ++I+     ++    +   +       E  +A 
Sbjct: 687 VLQYQIDSEALIDEIIKNILREFAVSAEQISYTTQKKVITMNNGLIIEEAGEYFEREDAI 746

Query: 507 --AMQEYSLIMSE--------RDSVHKEMEKLSED-LTQAVRKIKTLEMEN-KELVEDKK 554
             +  +Y   +S         R S+HK +E + E+ L +     K    E  K+L     
Sbjct: 747 VWSSMKYHEFLSALANKTKLPRKSLHKVLEAIKENKLNE----FKKNPQEGLKQLSNIFL 802

Query: 555 TLSYQ-----IETLKREIASALHDRDKALKECNDLRERF-GEFTAAKEECQREAFHARFE 608
            +        I    R         D    E  ++RE    E    KE     A    +E
Sbjct: 803 EIQLLNIVHKISYQMRLTTIKNSKTDAFYDEDGEIREELKAELGRKKENANSPAREYLYE 862

Query: 609 YAMRMGERDSSRKERNEVEAASMDLFGK 636
                 + +  ++   E+E   + +FGK
Sbjct: 863 EIFYQVDSEIEKEFVEELENTGVIVFGK 890


>gnl|CDD|188670 cd08715, RGS_RGS1, Regulator of G protein signaling (RGS) domain
           found in the RGS1 protein.  The RGS (Regulator of
           G-protein Signaling) domain is an essential part of the
           RGS1 protein. RGS1 is a member of the R4/RGS subfamily
           of the RGS family, a diverse group of multifunctional
           proteins that regulate cellular signaling events
           downstream of G-protein coupled receptors (GPCRs). RGS
           proteins play critical regulatory roles as GTPase
           activating proteins (GAPs) of the heterotrimeric
           G-protein G-alpha subunits. The RGS domain controls
           G-protein signaling by accelerating the GTPase activity
           of the G-alpha subunit which leads to G protein
           deactivation and promotes desensitization. As a major
           G-protein regulator, RGS domain containing proteins are
           involved in many crucial cellular processes such as
           regulation of intracellular trafficking, glial
           differentiation, embryonic axis formation, skeletal and
           muscle development, and cell migration during early
           embryogenesis.  RGS 1 is expressed predominantly in
           hematopoietic compartments, including T and B
           lymphocytes, and may play a major role in
           chemokine-mediated homing of lymphocytes to secondary
           lymphoid organs. In addition, RGS1 interacts with
           calmodulin and 14-3-3 protein outside of the GPCR
           pathway.
          Length = 114

 Score = 29.9 bits (67), Expect = 3.8
 Identities = 15/45 (33%), Positives = 19/45 (42%)

Query: 476 HEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDS 520
             DA K+IN     R    K +K  T       Q+   I+ ERDS
Sbjct: 57  QSDAAKQINIDFRTRESTAKKIKAPTPTCFDEAQKVIYILMERDS 101


>gnl|CDD|153339 cd07655, F-BAR_PACSIN, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Protein kinase C and
           Casein kinase Substrate in Neurons (PACSIN) proteins.
           F-BAR domains are dimerization modules that bind and
           bend membranes and are found in proteins involved in
           membrane dynamics and actin reorganization. Protein
           kinase C and Casein kinase Substrate in Neurons (PACSIN)
           proteins, also called Synaptic dynamin-associated
           proteins (Syndapins), act as regulators of cytoskeletal
           and membrane dynamics. They bind both dynamin and
           Wiskott-Aldrich syndrome protein (WASP), and may provide
           direct links between the actin cytoskeletal machinery
           through WASP and dynamin-dependent endocytosis.
           Vetebrates harbor three isoforms with distinct
           expression patterns and specific functions. PACSINs
           contain an N-terminal F-BAR domain and a C-terminal SH3
           domain. F-BAR domains form banana-shaped dimers with a
           positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 258

 Score = 31.1 bits (71), Expect = 3.8
 Identities = 26/105 (24%), Positives = 44/105 (41%), Gaps = 10/105 (9%)

Query: 459 LRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSL----I 514
            +E  E ++  AK Q+     +K++  A      A K  K   ++ N A  + SL    +
Sbjct: 111 FKETKEAEDGFAKAQKPWAKLLKKVEKAKKAYHAACKAEKSAQKQENNAKSDTSLSPDQV 170

Query: 515 MSERDSVHK---EMEKLSEDLTQAVRKIKTLE---MENKELVEDK 553
              +D V K   E+ K  +   +A+  +       ME+ E V DK
Sbjct: 171 KKLQDKVEKCKQEVSKTKDKYEKALEDLNKYNPRYMEDMEQVFDK 215


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 30.2 bits (69), Expect = 3.9
 Identities = 14/110 (12%), Positives = 39/110 (35%), Gaps = 9/110 (8%)

Query: 379 MSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTAL 438
           +      +  + + +   K+    +      + KL+       +++K    +++      
Sbjct: 10  LQESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLK------EKLQKDAATLSEAAREKK 63

Query: 439 RER-NGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAM 487
            +      Q+     ++    L++R   QE L K+  +   A+KE+    
Sbjct: 64  EKELQKKVQEFQRKQQKLQQDLQKR--QQEELQKILDKINKAIKEVAKKK 111


>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family.  In a
           subset of endospore-forming members of the Firmcutes,
           members of this protein family are found, several to a
           genome. Two very strongly conserved sequences regions
           are separated by a highly variable linker region. Much
           of the linker region was excised from the seed alignment
           for this model. A characterized member is the
           prespore-specific transcription RsfA from Bacillus
           subtilis, previously called YwfN, which is controlled by
           sigma factor F and seems to fine-tune expression of some
           genes in the sigma-F regulon. A paralog in Bacillus
           subtilis is designated YlbO [Regulatory functions, DNA
           interactions, Cellular processes, Sporulation and
           germination].
          Length = 161

 Score = 30.5 bits (69), Expect = 4.1
 Identities = 24/121 (19%), Positives = 55/121 (45%), Gaps = 11/121 (9%)

Query: 455 WDIALRERNEYQEALAKVQQQHEDAVKEINT-AMAVRMKATKDLKRLTEERNAAMQEYSL 513
           W+  +R++  Y+EA+   ++Q ++  +E  +  +   +   ++LK       A  +E   
Sbjct: 51  WNAYVRKQ--YEEAIELAKKQRKELKREAGSLTLQDVISFLQNLKTTNPSDQALQKENER 108

Query: 514 IMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHD 573
           + ++ +S+ K  E+L        ++++ L      + ED +TL   I    R++A    D
Sbjct: 109 LKNQNESLQKRNEELE-------KELEKLRQRLSTIEEDYQTL-IDIMDRARKLAVVEDD 160

Query: 574 R 574
            
Sbjct: 161 E 161


>gnl|CDD|234035 TIGR02860, spore_IV_B, stage IV sporulation protein B.  SpoIVB, the
            stage IV sporulation protein B of endospore-forming
            bacteria such as Bacillus subtilis, is a serine
            proteinase, expressed in the spore (rather than mother
            cell) compartment, that participates in a proteolytic
            activation cascade for Sigma-K. It appears to be
            universal among endospore-forming bacteria and occurs
            nowhere else [Cellular processes, Sporulation and
            germination].
          Length = 402

 Score = 31.5 bits (72), Expect = 4.1
 Identities = 15/54 (27%), Positives = 28/54 (51%), Gaps = 16/54 (29%)

Query: 1252 ISLV-GGNAVGIYVHSVQSGSLGYS---------------AGLRTGDRILEYNG 1289
            I ++ GG ++G+ +++     +G+S               AG++ GDRIL+ NG
Sbjct: 89   IKVIPGGQSIGVKLNTKGVLVVGFSDIETEKGKIHSPGEEAGIQIGDRILKING 142


>gnl|CDD|222066 pfam13353, Fer4_12, 4Fe-4S single cluster domain.  This family
          includes proteins containing domains which bind to
          iron-sulfur clusters. Members include bacterial
          ferredoxins, various dehydrogenases, and various
          reductases. The structure of the domain is an
          alpha-antiparallel beta sandwich.
          Length = 140

 Score = 30.2 bits (69), Expect = 4.3
 Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 2/35 (5%)

Query: 57 QEWDTEVLDMNLTNLSPNEALGLELVGGRDDPFYP 91
          + +  E+ D  + +L      GL L+GG  +P  P
Sbjct: 34 EPFTEELEDEIIEDLKDPYISGLTLLGG--EPLLP 66


>gnl|CDD|233816 TIGR02302, aProt_lowcomp, TIGR02302 family protein.  Members of
           this family are long (~850 residue) bacterial proteins
           from the alpha Proteobacteria. Each has 2-3 predicted
           transmembrane helices near the N-terminus and a long
           C-terminal region that includes stretches of
           Gln/Gly-rich low complexity sequence, predicted by TMHMM
           to be outside the membrane. In Bradyrhizobium japonicum,
           two tandem reading frames are together homologous the
           single members found in other species; the cutoffs
           scores are set low enough that the longer scores above
           the trusted cutoff and the shorter above the noise
           cutoff for this model.
          Length = 851

 Score = 31.5 bits (71), Expect = 4.4
 Identities = 34/143 (23%), Positives = 60/143 (41%), Gaps = 33/143 (23%)

Query: 437 ALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKD 496
           ALR+           I   D++  ER      L   Q   +DA++        R  + ++
Sbjct: 478 ALRDVADNLWSLALGIEDGDLSDAER-----RLRAAQDALKDALE--------RGASDEE 524

Query: 497 LKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELV--EDKK 554
           +K+LT++  AAMQ Y             M +L++ L    +++      N +++  +D +
Sbjct: 525 IKQLTDKLRAAMQTY-------------MRQLAQQLRNNPQQLARPLDPNTKVLRQQDLQ 571

Query: 555 TLSYQIETLKREIASALHDRDKA 577
            +  QIE L R       DRD+A
Sbjct: 572 NMMDQIENLARS-----GDRDQA 589


>gnl|CDD|234999 PRK02101, PRK02101, hypothetical protein; Validated.
          Length = 255

 Score = 31.0 bits (71), Expect = 4.5
 Identities = 18/75 (24%), Positives = 33/75 (44%), Gaps = 5/75 (6%)

Query: 1290 TDLRAATAEEAAYELAKPADKVTVLAQYSVERYNEVKDKPGDSFYMRALFSRTGDLGDAL 1349
              LR  + ++ A  L K +DK   LA  +  R+ + + +   +    A+ + TGD+ D L
Sbjct: 36   DALRELSPDDLA-SLMKISDK---LAGLNAARFQDWQPEFTTANARPAILAFTGDVYDGL 91

Query: 1350 QLR-FHKDDILYVDN 1363
                  + D+ Y   
Sbjct: 92   DADSLSEADLDYAQQ 106


>gnl|CDD|177681 PLN00049, PLN00049, carboxyl-terminal processing protease;
            Provisional.
          Length = 389

 Score = 31.2 bits (71), Expect = 4.5
 Identities = 18/53 (33%), Positives = 23/53 (43%)

Query: 1256 GGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYELAKPA 1308
             G   G+ V +   G     AG+R GD IL  +GT     +  EAA  L  P 
Sbjct: 98   DGPPAGLVVVAPAPGGPAARAGIRPGDVILAIDGTSTEGLSLYEAADRLQGPE 150


>gnl|CDD|182820 PRK10898, PRK10898, serine endoprotease; Provisional.
          Length = 353

 Score = 31.1 bits (71), Expect = 4.8
 Identities = 22/81 (27%), Positives = 40/81 (49%), Gaps = 10/81 (12%)

Query: 802 IRRRRV--GARSLVTTQLQLNNSYDHGLTLESGLYICKISPGSLAAKEGSLAVGDRVLSI 859
           IR  RV  G   +   ++   ++   G+    G+ + ++SP   AAK G + V D ++S+
Sbjct: 248 IRDGRVIRGYIGIGGREIAPLHAQGGGIDQLQGIVVNEVSPDGPAAKAG-IQVNDLIISV 306

Query: 860 NNR-------TVDGLSSAREG 873
           NN+       T+D ++  R G
Sbjct: 307 NNKPAISALETMDQVAEIRPG 327



 Score = 30.7 bits (70), Expect = 6.4
 Identities = 17/60 (28%), Positives = 32/60 (53%), Gaps = 8/60 (13%)

Query: 197 NSYDHGLTLESGLYICKISPGSLAAKEGSLAVGDRVLSINNR-------TVDGLSSAREG 249
           ++   G+    G+ + ++SP   AAK G + V D ++S+NN+       T+D ++  R G
Sbjct: 269 HAQGGGIDQLQGIVVNEVSPDGPAAKAG-IQVNDLIISVNNKPAISALETMDQVAEIRPG 327


>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR).  This
           family consists of several mammalian alpha helical
           coiled-coil rod HCR proteins. The function of HCR is
           unknown but it has been implicated in psoriasis in
           humans and is thought to affect keratinocyte
           proliferation.
          Length = 739

 Score = 31.6 bits (71), Expect = 4.9
 Identities = 62/291 (21%), Positives = 111/291 (38%), Gaps = 39/291 (13%)

Query: 406 HSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEY 465
           H+ +  +L   Q++VA ++++    +QE+    R     + +         +   E +  
Sbjct: 329 HADSTCQL---QKQVAELQEEVTSQSQEQAILQRSLQDKEAEVEVERMNSKVLQGELSRA 385

Query: 466 QEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEM 525
           QEA  ++QQQ   A ++              L+ + E  +++ Q     M+E +    ++
Sbjct: 386 QEARRRLQQQVASAEEQ--------------LQLVVEAVHSSQQGLLATMAEVEVALAQL 431

Query: 526 EKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALHDRDKALK-ECNDL 584
             LS  L+ AVRK+ T+    + LV  K  L+     L++E +       + L  E   L
Sbjct: 432 PSLSNRLSYAVRKVHTI----QGLVARKVALA----QLRQESSPPAPAEAEDLSTELRQL 483

Query: 585 RERFGEFTAAKEECQREAFHARFEYAMRMGERDSSRKERNEVEAASMDLFGKCQKERMDN 644
           RE      A   E Q     AR         R+   +ER  +   +  L  + Q+ +   
Sbjct: 484 REERNRLDA---ELQ---LSARLIQQEVGRAREQGEEERQRLAEVAQQLERELQESQESL 537

Query: 645 LEANQEIDRLRK-------HTDKLQAELQDQSVNYLFILTESIQEAEVSKR 688
             A  +++  R            L+ EL  Q   Y   L E + E E   R
Sbjct: 538 ASAGAQLEAARAGQQESTEEAASLRQELTQQQEVYGQALQEKVAEVETRLR 588


>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
           in cytoskeletal structure; family members include
           spectrin, alpha-actinin and dystrophin; the spectrin
           repeat forms a three helix bundle with the second helix
           interrupted by proline in some sequences; the repeats
           are independent folding units; tandem repeats are found
           in differing numbers and arrange in an antiparallel
           manner to form dimers; the repeats are defined by a
           characteristic tryptophan (W) residue in helix A and a
           leucine (L) at the carboxyl end of helix C and separated
           by a linker of 5 residues; two copies of the repeat are
           present here.
          Length = 213

 Score = 30.9 bits (70), Expect = 4.9
 Identities = 25/147 (17%), Positives = 58/147 (39%), Gaps = 11/147 (7%)

Query: 463 NEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVH 522
           NE  E L +      + ++E    +  R +    L+ L EER   ++E   +        
Sbjct: 60  NELGEQLIEEGHPDAEEIQERLEELNQRWEE---LRELAEERRQRLEEALDLQQF----F 112

Query: 523 KEMEKLSEDLTQAVRKIKTLEM-ENKELVEDKKTLSYQIETLKREIASALHDRDKALKEC 581
           ++ + L + L +    + + ++ ++ E VE+      + + L+ E+ +         +  
Sbjct: 113 RDADDLEQWLEEKEAALASEDLGKDLESVEELLK---KHKELEEELEAHEPRLKSLNELA 169

Query: 582 NDLRERFGEFTAAKEECQREAFHARFE 608
            +L E        + E + E  + R+E
Sbjct: 170 EELLEEGHPDADEEIEEKLEELNERWE 196


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 31.7 bits (72), Expect = 5.0
 Identities = 37/180 (20%), Positives = 46/180 (25%), Gaps = 16/180 (8%)

Query: 920  GDQPPPDSYQVPKTSTKSHDPAKPSPSSSGGGGKWDSLREKMQGVRRRGGGEEGKRSNRN 979
             + P         +S+       PSPSS G G    S R        R   E    S  +
Sbjct: 275  WNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSR---ESSSSSTSS 331

Query: 980  SSPPVTFEQEQVDAIAELDSFVHTHSPSDGSSPHYHKAPRSLASAHSHPGGPNNGAGGTS 1039
            SS         V             SPS    P                   +  A    
Sbjct: 332  SSESS--RGAAVSPGPSPSR-----SPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGR 384

Query: 1040 YETSSYHSYDAAGTFPRKKENQRF---RIPSNPSVAFKTSGTKMSNGSIEKTSERGSPMP 1096
              T        AG   R+    RF   R   +P  A   SG   +   +   S  G P P
Sbjct: 385  -PTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPS--GEPWP 441


>gnl|CDD|222469 pfam13949, ALIX_LYPXL_bnd, ALIX V-shaped domain binding to HIV.
           The binding of the LYPxL motif of late HIV p6Gag and
           EIAV p9Gag to this domain is necessary for viral
           budding.This domain is generally central between an
           N-terminal Bro1 domain, pfam03097 and a C-terminal
           proline-rich domain. The retroviruses thus used this
           domain to hijack the ESCRT system of the cell.
          Length = 292

 Score = 31.1 bits (71), Expect = 5.2
 Identities = 40/275 (14%), Positives = 96/275 (34%), Gaps = 60/275 (21%)

Query: 320 AMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLELRSGCCTQPESDDAGSADSLNQHYM 379
               L+    E   LR+KY +                    T+P S        L     
Sbjct: 48  IKELLDEEESEDEQLRAKYGERW------------------TRPPS------SELTAELR 83

Query: 380 SALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQ--EKNTA 437
             L KY              +    + +    KLE  +E +  +    E++      ++ 
Sbjct: 84  EELRKYRGY----------LEQASESDAQLRSKLEEHEENLELLSGPEEELEALLPSSSP 133

Query: 438 LRERNGLKQ--QCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATK 495
            +     +Q  +    + + +    ER +  E L   ++  +D + ++   +A+    + 
Sbjct: 134 SKTPEVSEQISRLRELLNKLNELKAEREKLLEELK--EKAQDDDISKL--LIALNKLGSS 189

Query: 496 DLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKT 555
           + ++L EE    ++++      ++ V + + K  E L +       ++  N+E ++++K 
Sbjct: 190 NEEQLFEEE---LKKFD---PLQERVEQNLSKQEELLKE-------IQEANEEFLQERK- 235

Query: 556 LSYQIETLKREIASALHDRDKALKECNDLRERFGE 590
                ++ ++E    L   + A  +  +L     E
Sbjct: 236 ----SDSEQKEREKVLQKLENAYDKYKELSSNLEE 266


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 31.4 bits (71), Expect = 5.2
 Identities = 41/174 (23%), Positives = 77/174 (44%), Gaps = 15/174 (8%)

Query: 416 AQEEVARIKKQCE-DMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQ 474
           A++E   ++ + E D    K TA RE   LK++     ++      E  +Y+E + +  +
Sbjct: 40  AEQEAVNLRGKAERDAEHIKKTAKRESKALKKELLLEAKE------EARKYREEIEQEFK 93

Query: 475 QHEDAVKEINTAMAVRMKA--TKDLKRLTEERNAAMQEYSLIMSER--DSVHKEMEKLSE 530
                +K+I + +  R  +   KD    ++E+    +E SL    +  D   +++EKL E
Sbjct: 94  SERQELKQIESRLTERATSLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEE 153

Query: 531 DLTQAVRKIKTLEMENKE---LVEDKKTLSYQIETLKREIASALHDR-DKALKE 580
                + ++  L         L E +  L+++I T  RE    + DR DK  K+
Sbjct: 154 QKKAELERVAALSQAEAREIILAETENKLTHEIATRIREAEREVKDRSDKMAKD 207


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 31.1 bits (70), Expect = 5.3
 Identities = 22/104 (21%), Positives = 45/104 (43%), Gaps = 5/104 (4%)

Query: 411 DKLELAQEEVARIKK----QCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQ 466
            + +  ++    +K     + E + Q +   L+ +   KQ   A  +Q  +  +++ E  
Sbjct: 80  QRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAE-KQAQLEQKQQEEQA 138

Query: 467 EALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQE 510
              A  Q++  +A K    A A ++KA  + K+  EE   A +E
Sbjct: 139 RKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEE 182


>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
           The small mechanosensitive channel, MscS, is a part of
           the turgor-driven solute efflux system that protects
           bacteria from lysis in the event of osmotic shock. The
           MscS protein alone is sufficient to form a functional
           mechanosensitive channel gated directly by tension in
           the lipid bilayer. The MscS proteins are heptamers of
           three transmembrane subunits with seven converging M3
           domains, and this MscS_porin is towards the N-terminal
           of the molecules. The high concentration of negative
           charges at the extracellular entrance of the pore helps
           select the cations for efflux.
          Length = 239

 Score = 30.7 bits (70), Expect = 5.3
 Identities = 34/198 (17%), Positives = 80/198 (40%), Gaps = 22/198 (11%)

Query: 362 QPESDDAGSADSLNQHYMSALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVA 421
            P  +       L Q  ++ L++ +  K + D  +K+ DD       A  +L   ++E+ 
Sbjct: 11  LPTPEQKALIQDLEQA-LNFLDEIDKSKQKADQYQKQIDD-------APKELRELRQELE 62

Query: 422 RIKKQCEDMNQE-KNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAV 480
            +KK    +  E  N +L +   L+Q+    + Q      +  +    L ++Q + E A 
Sbjct: 63  ALKKTDAPVFPELANLSLSQ---LEQRLAQTLSQLQELQEQLQQENSQLIELQTRPERAQ 119

Query: 481 KEINTAMAVRMKATKDLKR-------LTEERNAAMQ-EYSLIMSERDSVHKEMEKLSE-- 530
           ++++ A     +    L+        L + +   +Q E + + ++ + +  E+   +   
Sbjct: 120 QQLSEARRRLQEIRNRLQALSPGGTPLAQAQRTLLQAELAALKAQIEELELELLSNNNRQ 179

Query: 531 DLTQAVRKIKTLEMENKE 548
           +L +  R +    +E  E
Sbjct: 180 ELLRLQRDLLKKRIERLE 197


>gnl|CDD|147120 pfam04803, Cor1, Cor1/Xlr/Xmr conserved region.  Cor1 is a
           component of the chromosome core in the meiotic prophase
           chromosomes. Xlr is a lymphoid cell specific protein.
           Xlm is abundantly transcribed in testis in a
           tissue-specific and developmentally regulated manner.
           The protein is located in the nuclei of spermatocytes,
           early in the prophase of the first meiotic division, and
           later becomes concentrated in the XY nuclear subregion
           where it is in particular associated with the axes of
           sex chromosomes.
          Length = 130

 Score = 29.6 bits (67), Expect = 5.4
 Identities = 23/102 (22%), Positives = 39/102 (38%), Gaps = 12/102 (11%)

Query: 396 RKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQW 455
           RKR +             E   E+V   K Q E   Q+ N          QQ     +QW
Sbjct: 8   RKRLETYTKASFKGS---EQKLEQV--WKTQQEQR-QKLN------QEYSQQFLTLFQQW 55

Query: 456 DIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDL 497
           +   ++  E +E L+ + +Q +   ++       R+K  K+L
Sbjct: 56  EADAQKFEEQEEKLSNMFRQQQKLFQQARIVQNQRLKTIKEL 97


>gnl|CDD|223400 COG0323, MutL, DNA mismatch repair enzyme (predicted ATPase) [DNA
           replication, recombination, and repair].
          Length = 638

 Score = 31.2 bits (71), Expect = 5.5
 Identities = 34/153 (22%), Positives = 55/153 (35%), Gaps = 33/153 (21%)

Query: 384 KYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNTALRERNG 443
            YE +K+E   + +    LI        +LEL+ EE             E++    E+ G
Sbjct: 484 LYEKLKNELGNVGELQPLLIP------IRLELSPEEADV---------LEEHKEELEKLG 528

Query: 444 LKQQ----CTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKR 499
            + +     + A+R     L +    +     +       + E            KDLK 
Sbjct: 529 FEIESFGENSVAVRSVPAMLGKAEVQELIRELLDD-----LLE---------GKLKDLKE 574

Query: 500 LTEERNAAMQEYSLIMSERDSVHKEMEKLSEDL 532
           L EE  A M   S + + R+   +EM  L  DL
Sbjct: 575 LLEELAATMACRSAVKAGRELSAEEMNALLRDL 607


>gnl|CDD|221288 pfam11882, DUF3402, Domain of unknown function (DUF3402).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 350 to 473 amino acids in length. This domain is
           found associated with pfam07923.
          Length = 402

 Score = 31.1 bits (71), Expect = 5.6
 Identities = 17/90 (18%), Positives = 38/90 (42%), Gaps = 6/90 (6%)

Query: 479 AVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRK 538
           ++ E     + +++     K+L EER   M+       ER    + +     D++++V++
Sbjct: 3   SIIEAGELFSKKVRMKLATKQLWEEREKFMKY------ERGWDTESLVGDPLDISESVKE 56

Query: 539 IKTLEMENKELVEDKKTLSYQIETLKREIA 568
           +K     +    + KK    ++ T   EI 
Sbjct: 57  LKLEMYTSLAEDQPKKEEIERLSTDSEEIP 86


>gnl|CDD|169982 PRK09578, PRK09578, periplasmic multidrug efflux lipoprotein
           precursor; Reviewed.
          Length = 385

 Score = 30.9 bits (70), Expect = 5.6
 Identities = 13/41 (31%), Positives = 21/41 (51%)

Query: 314 RGQHQAAMAQLEAAAQESSSLRSKYSDLISDKQRLDRELLE 354
           R     A+A+ EAA   +   R +Y DL+ D+   +R+  E
Sbjct: 103 RDAAAGALAKAEAAHLAALDKRRRYDDLVRDRAVSERDYTE 143


>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869).  This
           bacterial family of proteins has no known function.
          Length = 131

 Score = 29.5 bits (67), Expect = 5.7
 Identities = 12/60 (20%), Positives = 30/60 (50%), Gaps = 4/60 (6%)

Query: 417 QEEVARIKKQCEDMNQEKNTALRE-RNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQ 475
           QE++ +++++ + +  +   A++E R    QQ     +Q +   +ER +  E   ++  Q
Sbjct: 26  QEQLTQLEQELQQLEFQGQKAIKEIRKQSAQQIEQIQQQVE---QERAKRLEQKNQLLFQ 82


>gnl|CDD|233008 TIGR00524, eIF-2B_rel, eIF-2B alpha/beta/delta-related
           uncharacterized proteins.  This model, eIF-2B_rel,
           describes half of a superfamily, where the other half
           consists of eukaryotic translation initiation factor 2B
           (eIF-2B) subunits alpha, beta, and delta. It is unclear
           whether the eIF-2B_rel set is monophyletic, or whether
           they are all more closely related to each other than to
           any eIF-2B subunit because the eIF-2B clade is highly
           derived. Members of this branch of the family are all
           uncharacterized with respect to function and are found
           in the Archaea, Bacteria, and Eukarya, although a number
           are described as putative translation intiation factor
           components. Proteins found by eIF-2B_rel include at
           least three clades, including a set of uncharacterized
           eukaryotic proteins, a set found in some but not all
           Archaea, and a set universal so far among the Archaea
           and closely related to several uncharacterized bacterial
           proteins [Unknown function, General].
          Length = 303

 Score = 30.9 bits (70), Expect = 5.8
 Identities = 19/64 (29%), Positives = 30/64 (46%), Gaps = 2/64 (3%)

Query: 378 YMSALNKYETIKDEYDA-LRKRFDDLINTHSTAVDKLELAQEEVARIKKQCEDMNQEKNT 436
            ++A        +E+   L K  D L++T  TAV+ L  A E V    +  E + + K +
Sbjct: 31  ALAARKIETDNVEEFKEDLEKAADFLLSTRPTAVN-LFWALERVLNSAENGESVEEAKES 89

Query: 437 ALRE 440
            LRE
Sbjct: 90  LLRE 93


>gnl|CDD|236729 PRK10636, PRK10636, putative ABC transporter ATP-binding protein;
           Provisional.
          Length = 638

 Score = 31.3 bits (71), Expect = 6.1
 Identities = 25/76 (32%), Positives = 35/76 (46%), Gaps = 3/76 (3%)

Query: 464 EYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHK 523
           +YQ+ L+ VQ+Q     +      A   +A KD KR   E     Q    +  E   + K
Sbjct: 514 DYQQWLSDVQKQENQTDEAPKENNANSAQARKDQKRREAELRTQTQP---LRKEIARLEK 570

Query: 524 EMEKLSEDLTQAVRKI 539
           EMEKL+  L QA  K+
Sbjct: 571 EMEKLNAQLAQAEEKL 586


>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein.  This family
           consists of the p50 and variable adherence-associated
           antigen (Vaa) adhesins from Mycoplasma hominis. M.
           hominis is a mycoplasma associated with human urogenital
           diseases, pneumonia, and septic arthritis. An adhesin is
           a cell surface molecule that mediates adhesion to other
           cells or to the surrounding surface or substrate. The
           Vaa antigen is a 50-kDa surface lipoprotein that has
           four tandem repetitive DNA sequences encoding a periodic
           peptide structure, and is highly immunogenic in the
           human host. p50 is also a 50-kDa lipoprotein, having
           three repeats A,B and C, that may be a tetramer of
           191-kDa in its native environment.
          Length = 353

 Score = 30.8 bits (69), Expect = 6.2
 Identities = 35/184 (19%), Positives = 65/184 (35%), Gaps = 20/184 (10%)

Query: 410 VDKLELAQEEVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEAL 469
           + KL  A E     KK  +D N +    L ERN   Q     +++ +      NE  E  
Sbjct: 92  ISKLSAAVENAKNEKKAIDDKNAQIAKELAERNAKIQSNIEELKKIN------NEAFELS 145

Query: 470 AKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLS 529
             V +   +  K+       + +       L ++     +  ++  ++      E+E   
Sbjct: 146 KTVNKTIAEVEKKFKIPKDFKEQLENFADDLLDKSRQIDEFTTVTSTQEGFTLAELESFK 205

Query: 530 EDLTQAVRKIKT-----LEMENKEL---------VEDKKTLSYQIETLKREIASALHDRD 575
           E  T     +K+     L+    EL         VE+ K LS++I      +   + + +
Sbjct: 206 EITTTWFNGMKSEWARVLDAWKNELTEINSIIKGVEELKKLSHEISEFSNSVKKTISELE 265

Query: 576 KALK 579
           K  K
Sbjct: 266 KKFK 269


>gnl|CDD|235076 PRK02842, PRK02842, light-independent protochlorophyllide reductase
            subunit N; Provisional.
          Length = 427

 Score = 31.0 bits (71), Expect = 6.2
 Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 9/56 (16%)

Query: 1446 YSDSG-----TLNEDSVSLASYQRVIRLNSEPV-LRPVILVGALSDTVTDKLLQDF 1495
            YS SG     T  ED+V LA+   V      P     ++LVG+L+D V D+L  +F
Sbjct: 135  YSGSGLETTFTQGEDAV-LAAL--VPFCPEAPADHPSLVLVGSLADVVEDQLTLEF 187


>gnl|CDD|203998 pfam08606, Prp19, Prp19/Pso4-like.  This regions is found
           specifically in PRP19-like protein. The region
           represented by this family covers the sequence
           implicated in self-interaction and a coiled-coiled
           motif. PRP19-like proteins form an oligomer that is
           necessary for spliceosome assembly.
          Length = 70

 Score = 28.4 bits (64), Expect = 6.2
 Identities = 12/45 (26%), Positives = 23/45 (51%), Gaps = 7/45 (15%)

Query: 555 TLSYQIETLKREIASALHDRD-------KALKECNDLRERFGEFT 592
            L  Q++  ++E++ AL+  D       + +KE ++ RE   E T
Sbjct: 26  ELRKQLDQTRQELSHALYQHDAACRVIARLIKERDEAREALAELT 70



 Score = 28.0 bits (63), Expect = 8.2
 Identities = 18/55 (32%), Positives = 28/55 (50%), Gaps = 7/55 (12%)

Query: 116 EVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQ 170
           E   ++KQ +   QE + AL        Q  AA R     ++ER+E +EALA++ 
Sbjct: 23  ETFELRKQLDQTRQELSHAL-------YQHDAACRVIARLIKERDEAREALAELT 70



 Score = 28.0 bits (63), Expect = 8.2
 Identities = 18/55 (32%), Positives = 28/55 (50%), Gaps = 7/55 (12%)

Query: 419 EVARIKKQCEDMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQ 473
           E   ++KQ +   QE + AL        Q  AA R     ++ER+E +EALA++ 
Sbjct: 23  ETFELRKQLDQTRQELSHAL-------YQHDAACRVIARLIKERDEAREALAELT 70


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 29.5 bits (67), Expect = 6.2
 Identities = 20/117 (17%), Positives = 45/117 (38%), Gaps = 22/117 (18%)

Query: 481 KEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIK 540
           KE++ A+A      +  +   +     + +    MS  +    E+EKL ++L +   ++ 
Sbjct: 11  KEVDKAIAKEKAKWEKKQEEKKSEAEKLAK----MSAEEKAEYELEKLEKELEELEAELA 66

Query: 541 TLEMENK---------------ELV--EDKKTLSYQIETLKREIASALHD-RDKALK 579
             E++ +               +LV  ED +     ++  K+    A+     + LK
Sbjct: 67  RRELKAEAKKMLSEKGLPVELLDLVVGEDAEETKANVKAFKKLFDKAVEAGVKERLK 123


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 30.8 bits (70), Expect = 6.3
 Identities = 17/88 (19%), Positives = 36/88 (40%), Gaps = 11/88 (12%)

Query: 460 RERNEYQEALAKVQQQHEDA--VKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSE 517
           +E     +   +++++      +KEI   +A + KA    K   +ER         +   
Sbjct: 290 KELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKK----KLKRR 345

Query: 518 RDSVHKEMEK-----LSEDLTQAVRKIK 540
           +   HK  E      L ++L+ ++R +K
Sbjct: 346 KLGKHKYPEPPLEVVLPDELSGSLRTLK 373


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 30.3 bits (69), Expect = 6.3
 Identities = 14/71 (19%), Positives = 33/71 (46%), Gaps = 3/71 (4%)

Query: 497 LKRLTEERNAAMQEYSLIMSERDSVH--KEMEKLSEDLTQAVRKIKTLEMENKELVEDKK 554
           L R+ +E    +  Y   + E       ++  +  +  ++  ++IK LE E +EL +   
Sbjct: 86  LLRVRDELRMTIAAYQ-TLYESSVAFGMRKALQAEQGKSELEQEIKKLEEEKEELEKRVA 144

Query: 555 TLSYQIETLKR 565
            L  ++E +++
Sbjct: 145 ELEAKLEAIEK 155


>gnl|CDD|219526 pfam07701, HNOBA, Heme NO binding associated.  The HNOBA domain is
           found associated with the HNOB domain and pfam00211 in
           soluble cyclases and signalling proteins. The HNOB
           domain is predicted to function as a heme-dependent
           sensor for gaseous ligands, and transduce diverse
           downstream signals, in both bacteria and animals.
          Length = 209

 Score = 30.3 bits (69), Expect = 6.5
 Identities = 23/81 (28%), Positives = 37/81 (45%), Gaps = 24/81 (29%)

Query: 493 ATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVED 552
           AT+DL  L E+  A           +  + K +EKL + L +A R+++          E 
Sbjct: 151 ATRDLILLGEQARA-----------QLKLKKRLEKLKDKLEEAHRELE---------EEK 190

Query: 553 KKT--LSYQIETLKREIASAL 571
           KKT  L Y +  L + +A+ L
Sbjct: 191 KKTDDLLYSM--LPKSVANRL 209


>gnl|CDD|184236 PRK13679, PRK13679, hypothetical protein; Provisional.
          Length = 168

 Score = 29.9 bits (68), Expect = 6.7
 Identities = 14/50 (28%), Positives = 26/50 (52%), Gaps = 3/50 (6%)

Query: 387 TIKDEYDALRKRFDD---LINTHSTAVDKLELAQEEVARIKKQCEDMNQE 433
            I+D  ++ RKR+D    LI  H T  +  E++ E++  I ++   +  E
Sbjct: 12  KIQDFANSYRKRYDPHYALIPPHITLKEPFEISDEQLDSIVEELRAIASE 61


>gnl|CDD|218270 pfam04795, PAPA-1, PAPA-1-like conserved region.  Family of
           proteins with a conserved region found in PAPA-1, a
           PAP-1 binding protein.
          Length = 89

 Score = 28.5 bits (64), Expect = 6.7
 Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 4/76 (5%)

Query: 110 KLKKKEEVARIKK-QCEDMNQE-KNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALA 167
              K+ E AR +K Q E   +E K   + +   LK+Q     R+  I   +  + QE  A
Sbjct: 2   MQLKRAEAARRRKNQSEKRAEEEKMETINKL--LKKQARRKKREDKIKKGDEKKAQEKAA 59

Query: 168 KVQQQHEDAVKEINTA 183
           + ++     ++ ++  
Sbjct: 60  RAEKAPPPMIRYVSGP 75


>gnl|CDD|217145 pfam02616, ScpA_ScpB, ScpA/B protein.  ScpA and ScpB participate in
           chromosomal partition during cell division. It may act
           via the formation of a condensin-like complex containing
           smc that pull DNA away from mid-cell into both cell
           halves. These proteins are part of the Kleisin
           superfamily.
          Length = 177

 Score = 30.0 bits (68), Expect = 6.8
 Identities = 24/114 (21%), Positives = 47/114 (41%), Gaps = 10/114 (8%)

Query: 485 TAMAVRMKATKDLKRLTEERNAAMQEY-SLIMSERDSVHKEMEKLSEDLTQAVRKIKTLE 543
            ++ +RMKA   L+  +++    + E+   +  E   V K +  L     + V++ +   
Sbjct: 43  ASILLRMKAEALLRSFSKKPV-TLDEFIKTLEEELVEVEKSVLDLIHAFNKLVKRKRKNR 101

Query: 544 MENKELVEDKKTLSYQIETLKREIASALHDRDK----ALKECNDLRERFGEFTA 593
           +  K   E    L   +E +  E+ + L  R K     LK+    ++  G F A
Sbjct: 102 LRKKNETE----LEVSVEEVLEELLAFLGVRLKEVLTKLKKFPSKKDLVGAFVA 151


>gnl|CDD|214599 smart00283, MA, Methyl-accepting chemotaxis-like domains
           (chemotaxis sensory transducer).  Thought to undergo
           reversible methylation in response to attractants or
           repellants during bacterial chemotaxis.
          Length = 262

 Score = 30.3 bits (69), Expect = 7.5
 Identities = 15/63 (23%), Positives = 31/63 (49%)

Query: 478 DAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVR 537
           D V+EI  A   +   ++++    +E     QE + +  E  +  +E+  L+E+L + V 
Sbjct: 200 DLVQEIAAATDEQAAGSEEVNAAIDEIAQVTQETAAMSEEISAAAEELSGLAEELDELVE 259

Query: 538 KIK 540
           + K
Sbjct: 260 RFK 262


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 30.9 bits (71), Expect = 7.5
 Identities = 18/93 (19%), Positives = 37/93 (39%), Gaps = 7/93 (7%)

Query: 462 RNEYQEALAKVQQQHEDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSV 521
             +  E    + +Q++   K     +  R+   K   +L E+    ++   L        
Sbjct: 217 PEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKLREKLKEELKSLRL-------T 269

Query: 522 HKEMEKLSEDLTQAVRKIKTLEMENKELVEDKK 554
            K++++L E L    ++++  E E   LVE  K
Sbjct: 270 SKQIDELVEQLRDINKRVRGQERELLRLVERLK 302


>gnl|CDD|148314 pfam06632, XRCC4, DNA double-strand break repair and V(D)J
           recombination protein XRCC4.  This family consists of
           several eukaryotic DNA double-strand break repair and
           V(D)J recombination protein XRCC4 sequences. In the
           non-homologous end joining pathway of DNA double-strand
           break repair, the ligation step is catalyzed by a
           complex of XRCC4 and DNA ligase IV. It is thought that
           XRCC4 and ligase IV are essential for alignment-based
           gap filling, as well as for final ligation of the
           breaks.
          Length = 331

 Score = 30.6 bits (69), Expect = 7.8
 Identities = 55/282 (19%), Positives = 110/282 (39%), Gaps = 38/282 (13%)

Query: 477 EDAVKEINTAMAVRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAV 536
            D  +E +     R K   +L++          EY+   S ++S+H   EK  +D++  +
Sbjct: 50  SDISQEADDMAMEREKYVDELRKALVSGAGPAGEYTFNFS-KESLHFSYEKNLKDVSFRL 108

Query: 537 RKIKTLEMENKELVEDKKTLSYQIETLKREIASALH---DRDKALKECNDLRERFGEFTA 593
              K  ++ N   V  ++ + Y ++T+    A   H   + ++ L++ ND++ RF +  +
Sbjct: 109 GSFKLEKVPNPAEV-IRELICYCLDTIAELQAKNEHLQKENERLLRDWNDIQGRFEKCVS 167

Query: 594 AKEECQREAFHARFEYAM--RMGERDSSRKERNEVEAASMDLFGKCQKERMDNLEANQEI 651
            KE  + +  + RF   +  +  +  S  K  +E +     +    ++++  +  +++  
Sbjct: 168 GKEALETD-LYKRFILVLNEKKAKIRSLHKLLSEAQELEKSI----KQKKETSACSDKTP 222

Query: 652 DRLRKHTDKLQAELQDQSVNYLFILTESIQEAEVSKRRRDWAFSERDKIEVIDSYHTKHA 711
           D   K+         D S       T+  QEA         A   +D   +     T  A
Sbjct: 223 DEDSKY---------DGS-------TDEEQEAPPKPSESMPAAVSKDDSLISSPDITDIA 266

Query: 712 GRHKRGKELDKNGGTWPKARSGPVIENGTGTILHPRRHKERL 753
              KR + + KN GT PK            T     + KE+ 
Sbjct: 267 PSRKRRQRMQKNLGTEPKMA----------TQEQQLQEKEKP 298


>gnl|CDD|222389 pfam13801, Metal_resist, Heavy-metal resistance.  This is a
           metal-binding protein which is involved in resistance to
           heavy-metal ions. The protein forms a four-helix hooked
           hairpin, consisting of two long alpha helices each
           flanked by a shorter alpha helix. It binds a metal ion
           in a type-2 like centre. It contains two copies of an
           LTXXQ motif.
          Length = 125

 Score = 29.1 bits (66), Expect = 7.9
 Identities = 21/88 (23%), Positives = 28/88 (31%), Gaps = 9/88 (10%)

Query: 583 DLRERFGEFTAAKEECQREAFHARFEYAMR--MGERDSSRKERNEVEAASMDLFGKCQKE 640
             R   G    A    QR A  A      R     R + R  R E+ A         + E
Sbjct: 30  RRRPGLGPAALALPPEQRRALRAALRAHRREVRPLRRALRAARRELLALL-------RAE 82

Query: 641 RMDNLEANQEIDRLRKHTDKLQAELQDQ 668
             D       + R R+    LQA +Q+ 
Sbjct: 83  PFDPAALRAALARQREARAALQARVQEA 110


>gnl|CDD|218452 pfam05130, FlgN, FlgN protein.  This family includes the FlgN
           protein and export chaperone involved in flagellar
           synthesis.
          Length = 140

 Score = 29.3 bits (66), Expect = 8.1
 Identities = 25/97 (25%), Positives = 42/97 (43%), Gaps = 12/97 (12%)

Query: 514 IMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKELVEDKKTLSYQIETLKREIASALH- 572
           ++ E   + +E+ +L E+  +A++      +E  EL E+K+ L  Q+E L+ E    L  
Sbjct: 5   LLEEELELLEELLELLEEEQEALKAGDIEALE--ELTEEKQELLAQLEQLEEERQQLLAE 62

Query: 573 ----DRDKALKEC-----NDLRERFGEFTAAKEECQR 600
                    L E       +LRE + E     EE Q 
Sbjct: 63  LGLSPGPAGLSELLAAEDEELRELWQELKELLEELQE 99


>gnl|CDD|225575 COG3031, PulC, Type II secretory pathway, component PulC
            [Intracellular trafficking and secretion].
          Length = 275

 Score = 30.3 bits (68), Expect = 8.9
 Identities = 15/61 (24%), Positives = 25/61 (40%), Gaps = 2/61 (3%)

Query: 1256 GGNAVGIYVHSVQSGSLGYSAGLRTGDRILEYNGTDLRAATAEEAAYEL--AKPADKVTV 1313
                 G      + GSL Y +GL+ GD  +  N  DL          ++    P+ ++TV
Sbjct: 203  NEKIEGYRFEPGKDGSLFYKSGLQRGDIAVAINNLDLTDPEDMFRLLQMLRNMPSLQLTV 262

Query: 1314 L 1314
            +
Sbjct: 263  I 263


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
           division and chromosome partitioning].
          Length = 570

 Score = 30.4 bits (69), Expect = 9.2
 Identities = 41/229 (17%), Positives = 78/229 (34%), Gaps = 23/229 (10%)

Query: 372 DSLNQHYM---SALNKYETIKDEYDALRKRFDDLINTHSTAVDKLELAQEEVARIKKQCE 428
           + + + Y    + L      + E   L    D+++             Q+ +  I+K   
Sbjct: 333 ERVKESYRLAETELGSVRKFEKELKELESVLDEILENIEAQEVAYSELQDNLEEIEKALT 392

Query: 429 DMNQEKNTALRERNGLKQQCTAAIRQWDIALRERNEYQEALAKVQQQHEDAVKEINTAMA 488
           D+  E+         L++    A    +   R +++  E    +++ +   + E      
Sbjct: 393 DIEDEQEKVQEHLTSLRKDELEA---RENLERLKSKLHEIKRYMEKSNLPGLPE------ 443

Query: 489 VRMKATKDLKRLTEERNAAMQEYSLIMSERDSVHKEMEKLSEDLTQAVRKIKTLEMENKE 548
                             A  E   +M E   V   ME +S  +  A   + TLE E +E
Sbjct: 444 ----------TFLSLFFTAGHEIQDLMKELSEVPINMEAVSALVDIATEDMNTLEDETEE 493

Query: 549 LVEDKKTLSYQIETLKREIASALHDRDKALKECNDLRERFGEFTAAKEE 597
           +VE    L+ Q+        S   +  K+L E   L E   ++ A+ E 
Sbjct: 494 VVE-NAVLAEQLIQYGNRYRSRNAEVAKSLNEAERLFENAFDYDASFEI 541


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.129    0.366 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 82,784,939
Number of extensions: 8263287
Number of successful extensions: 8258
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7588
Number of HSP's successfully gapped: 552
Length of query: 1659
Length of database: 10,937,602
Length adjustment: 110
Effective length of query: 1549
Effective length of database: 6,058,662
Effective search space: 9384867438
Effective search space used: 9384867438
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 66 (29.1 bits)