RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16785
         (1642 letters)



>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
            chromosome partitioning].
          Length = 1163

 Score = 50.5 bits (121), Expect = 9e-06
 Identities = 78/392 (19%), Positives = 148/392 (37%), Gaps = 59/392 (15%)

Query: 851  VSDLKTRLKDVEKKL------LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE-LEHKT 903
            VS  K R ++ E+KL      LE  E++  E EKQ      L +L  Q   A    E K 
Sbjct: 167  VSKYKERKEEAERKLERTEENLERLEDLLEELEKQ------LEKLERQAEKAERYQELKA 220

Query: 904  MIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 963
              E    + ALL AKLKE+++E++   E+    ++++ E+++ ++  + EI  L   L  
Sbjct: 221  --ELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEE 278

Query: 964  SLAENHILNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLL 1023
               E   L  ++  +              L+  I L+++             + +  + L
Sbjct: 279  LREELEELQEELLELKEEIE--------ELEGEISLLRE------------RLEELENEL 318

Query: 1024 VDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTSKDSANIQLDHAETN 1083
             +LE+++++    I+  K+  +LE  ET +        E  +     +     L      
Sbjct: 319  EELEERLEELKEKIEALKE--ELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEE 376

Query: 1084 TNEESPADQDETNSTTTL---DKSHIQKEIANNLPKVWRILIELLNTHQEQYENN----- 1135
              E    +  E  +       +   +++EI          L E L    E+ E+      
Sbjct: 377  LFEALREELAELEAELAEIRNELEELKREI--------ESLEERLERLSERLEDLKEELK 428

Query: 1136 DVNENFESCYKNIKTNTGKTHQVISVSQTFIRLKHLILEKNHLTNQLAKLKTLNNHLEIR 1195
            ++    E     ++           + +   +L+ L      L  +LA+L+     LE  
Sbjct: 429  ELEAELEELQTELEEL------NEELEELEEQLEELRDRLKELERELAELQEELQRLEKE 482

Query: 1196 LNHSEQRLNTVTTELKKTWNVVNKLKLQHKQL 1227
            L+  E RL+ +  E + +  V   L+     L
Sbjct: 483  LSSLEARLDRLEAEQRASQGVRAVLEALESGL 514



 Score = 47.4 bits (113), Expect = 8e-05
 Identities = 54/255 (21%), Positives = 106/255 (41%), Gaps = 20/255 (7%)

Query: 34  VSDLKTRLKDVEKKL------LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE-LEHKT 86
           VS  K R ++ E+KL      LE  E++  E EKQ      L +L  Q   A    E K 
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQ------LEKLERQAEKAERYQELKA 220

Query: 87  MIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 146
             E    + ALL AKLKE+++E++   E+    ++++ E+++ ++  + EI  L   L  
Sbjct: 221 --ELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEE 278

Query: 147 SLAENHILNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLL 206
              E   L  ++  +        +   I+   L++   +     ++ +          + 
Sbjct: 279 LREELEELQEELLELKEEIEE--LEGEIS---LLRERLEELENELEELEERLEELKEKIE 333

Query: 207 VDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTKAVESNQTSTSKDSA 266
              E+  ++ETL  + ++ L +LE  +  +   +   +E  +   +A+         + A
Sbjct: 334 ALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELA 393

Query: 267 NIQLDHAETNTNEES 281
            I+ +  E     ES
Sbjct: 394 EIRNELEELKREIES 408



 Score = 39.7 bits (93), Expect = 0.018
 Identities = 28/118 (23%), Positives = 64/118 (54%), Gaps = 4/118 (3%)

Query: 37  LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 96
           LK +++ ++++L E  E +  E E+ +   ++ +E  E++ +A   E + + E    + A
Sbjct: 328 LKEKIEALKEELEER-ETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELA 386

Query: 97  LLEAKLKEIQEEI---QAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAEN 151
            LEA+L EI+ E+   + ++E  +E  +++ E  ++++ +  E+    E L + L E 
Sbjct: 387 ELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEEL 444



 Score = 38.9 bits (91), Expect = 0.027
 Identities = 23/150 (15%), Positives = 69/150 (46%), Gaps = 2/150 (1%)

Query: 33  KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAEL--EHKTMIEK 90
           ++ +L+  L++ E++L     E++   +++    +++ EL E+     E   E +  +E+
Sbjct: 794 ELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE 853

Query: 91  ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 150
              +   L+ +L+E++ E +   ++ KE +++  E+E+ ++  + E+  L E +      
Sbjct: 854 LEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRER 913

Query: 151 NHILNSQMTLVNNLFSNMLINPMINLDMLI 180
              L +++  +      +        +  +
Sbjct: 914 LEELEAKLERLEVELPELEEELEEEYEDTL 943



 Score = 38.2 bits (89), Expect = 0.047
 Identities = 23/147 (15%), Positives = 67/147 (45%), Gaps = 2/147 (1%)

Query: 853 DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAEL--EHKTMIEKELA 910
           +L+  L++ E++L     E++   +++    +++ EL E+     E   E +  +E+   
Sbjct: 797 ELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEK 856

Query: 911 KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 970
           +   L+ +L+E++ E +   ++ KE +++  E+E+ ++  + E+  L E +         
Sbjct: 857 ELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916

Query: 971 LNSQMTLVNNLFSNMLINPMINLDMLI 997
           L +++  +      +        +  +
Sbjct: 917 LEAKLERLEVELPELEEELEEEYEDTL 943



 Score = 37.4 bits (87), Expect = 0.087
 Identities = 24/114 (21%), Positives = 58/114 (50%), Gaps = 9/114 (7%)

Query: 38  KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNAL 97
           K  LK++E++L E   +++   E+  +   +LR L +          +   + E  K  L
Sbjct: 666 KRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKREL 725

Query: 98  ---------LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 142
                    L+++L+E++EE++   E+ +E ++++ E+E+ ++  ++ +  L E
Sbjct: 726 AALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKE 779



 Score = 37.4 bits (87), Expect = 0.087
 Identities = 24/114 (21%), Positives = 58/114 (50%), Gaps = 9/114 (7%)

Query: 855 KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNAL 914
           K  LK++E++L E   +++   E+  +   +LR L +          +   + E  K  L
Sbjct: 666 KRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKREL 725

Query: 915 ---------LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 959
                    L+++L+E++EE++   E+ +E ++++ E+E+ ++  ++ +  L E
Sbjct: 726 AALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKE 779



 Score = 36.2 bits (84), Expect = 0.19
 Identities = 32/135 (23%), Positives = 69/135 (51%), Gaps = 9/135 (6%)

Query: 33  KVSDLKTRLKDVEKKLLESPEEIKI------EYEKQMTNYKKLRELYEQRAAAAELEHKT 86
            ++ LK  ++++E+K     EE++       E E+++   ++  E  EQR    E E + 
Sbjct: 773 ALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEE 832

Query: 87  MIEK-ELAKNAL--LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNEN 143
           + E+ E  +  L  LE +L+E+++E++   E+ +E + +  E+E  ++  ++E   L E 
Sbjct: 833 LEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEE 892

Query: 144 LTSSLAENHILNSQM 158
           L    +E   L  ++
Sbjct: 893 LRELESELAELKEEI 907



 Score = 34.7 bits (80), Expect = 0.61
 Identities = 29/128 (22%), Positives = 62/128 (48%), Gaps = 2/128 (1%)

Query: 850 LVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 909
           L+ +L+ +L+++E++L E   E+    E+      +L EL E+     E   +     E 
Sbjct: 703 LLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEE 762

Query: 910 AKNAL--LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 967
            +  L  LE  L +++EEI+   EK +  ++++ E+E+ ++  +  ++ L   L S    
Sbjct: 763 LEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQR 822

Query: 968 NHILNSQM 975
              L  ++
Sbjct: 823 RERLEQEI 830



 Score = 33.9 bits (78), Expect = 0.82
 Identities = 35/199 (17%), Positives = 86/199 (43%), Gaps = 15/199 (7%)

Query: 34  VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELA 93
           + +    L++  ++L E  EE++ E E       KL+E  E+     +   + + E E  
Sbjct: 742 LEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEE 801

Query: 94  KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 153
                E +L  ++ E+++  ++ +  +++I E+E+ I+  +++++ L E L     E   
Sbjct: 802 LE-EAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEE 860

Query: 154 LNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKI 213
           L  ++  +                 L   +++      +L    E+ +  S L +L+++I
Sbjct: 861 LKEELEELEAEKEE-----------LEDELKELEEEKEEL--EEELRELESELAELKEEI 907

Query: 214 -DQETLAIQNDKQLKQLEN 231
                   + + +L++LE 
Sbjct: 908 EKLRERLEELEAKLERLEV 926


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 44.7 bits (106), Expect = 4e-04
 Identities = 25/114 (21%), Positives = 66/114 (57%), Gaps = 2/114 (1%)

Query: 33  KVSDLKTRLKDVEKKLLESPEEI-KIEYEKQMTNYKKLRELYEQRAAAAELEH-KTMIEK 90
           +VS+L+  +++++K+L     EI ++E +KQ+   +      +     A+LE  ++ +++
Sbjct: 275 EVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334

Query: 91  ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 144
              + A LE KL+E++EE+++   + +E + ++ E+E  ++  ++++  L   +
Sbjct: 335 LAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV 388



 Score = 44.3 bits (105), Expect = 7e-04
 Identities = 25/113 (22%), Positives = 65/113 (57%), Gaps = 2/113 (1%)

Query: 851 VSDLKTRLKDVEKKLLESPEEI-KIEYEKQMTNYKKLRELYEQRAAAAELEH-KTMIEKE 908
           VS+L+  +++++K+L     EI ++E +KQ+   +      +     A+LE  ++ +++ 
Sbjct: 276 VSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDEL 335

Query: 909 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 961
             + A LE KL+E++EE+++   + +E + ++ E+E  ++  ++++  L   +
Sbjct: 336 AEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV 388



 Score = 37.0 bits (86), Expect = 0.12
 Identities = 32/129 (24%), Positives = 59/129 (45%), Gaps = 20/129 (15%)

Query: 34  VSDLKTRLKDVEKKL------LESPEEIKIEYEKQMTN----------YKKLRELYEQRA 77
           +S  K R K+ E+KL      L+  E+I  E E+Q+ +          YK+L+    +  
Sbjct: 167 ISKYKERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELE 226

Query: 78  AAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQL----EKYKENKKQILEMEKNIQLK 133
            A  +     + +EL +      + +E  EE+ A+L    EK +E + ++ E+E+ I+  
Sbjct: 227 LALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEEL 286

Query: 134 DDEINNLNE 142
             E+  L  
Sbjct: 287 QKELYALAN 295



 Score = 36.6 bits (85), Expect = 0.15
 Identities = 21/130 (16%), Positives = 59/130 (45%), Gaps = 2/130 (1%)

Query: 840 LATTRSVILPLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE- 898
           L      +   +  L+  L+++ +++    +++     +     +++ +L ++       
Sbjct: 703 LRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAE 762

Query: 899 -LEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 957
             E +  +E+   + A  EA+++E++ +I+   E+ K  ++ + E+   + L ++E  NL
Sbjct: 763 IEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANL 822

Query: 958 NENLTSSLAE 967
            E L S    
Sbjct: 823 RERLESLERR 832



 Score = 35.8 bits (83), Expect = 0.27
 Identities = 20/119 (16%), Positives = 57/119 (47%), Gaps = 2/119 (1%)

Query: 34  VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE--LEHKTMIEKE 91
           +  L+  L+++ +++    +++     +     +++ +L ++         E +  +E+ 
Sbjct: 714 LEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEA 773

Query: 92  LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 150
             + A  EA+++E++ +I+   E+ K  ++ + E+   + L ++E  NL E L S    
Sbjct: 774 EEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERR 832



 Score = 34.6 bits (80), Expect = 0.52
 Identities = 22/140 (15%), Positives = 58/140 (41%), Gaps = 11/140 (7%)

Query: 33  KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE----------L 82
           ++S L+ + + + ++L     +++ E E Q+   +   +   +  A  E           
Sbjct: 296 EISRLEQQKQILRERLANLERQLE-ELEAQLEELESKLDELAEELAELEEKLEELKEELE 354

Query: 83  EHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 142
             +  +E+  A+   LE++L+E++E+++    K  + + QI  +   I+  +  +  L +
Sbjct: 355 SLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLED 414

Query: 143 NLTSSLAENHILNSQMTLVN 162
                  E   L  ++    
Sbjct: 415 RRERLQQEIEELLKKLEEAE 434



 Score = 34.6 bits (80), Expect = 0.57
 Identities = 26/133 (19%), Positives = 59/133 (44%), Gaps = 12/133 (9%)

Query: 854 LKTRLKDVEKKLLESPEEI------KIEYEKQMT-NYKKLRELYEQRAAAAELEHKTMIE 906
           L+ RL ++E++L E   ++        E  +++    +KL EL E+  +         +E
Sbjct: 307 LRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEA-----ELE 361

Query: 907 KELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLA 966
           +  A+   LE++L+E++E+++    K  + + QI  +   I+  +  +  L +       
Sbjct: 362 ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ 421

Query: 967 ENHILNSQMTLVN 979
           E   L  ++    
Sbjct: 422 EIEELLKKLEEAE 434



 Score = 34.3 bits (79), Expect = 0.68
 Identities = 19/80 (23%), Positives = 38/80 (47%), Gaps = 2/80 (2%)

Query: 67  KKLRELYEQRAAAAEL--EHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQIL 124
           ++L EL E+   A E   E    +++   K   L  ++ E++EEI+   ++      +I 
Sbjct: 239 EELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEIS 298

Query: 125 EMEKNIQLKDDEINNLNENL 144
            +E+  Q+  + + NL   L
Sbjct: 299 RLEQQKQILRERLANLERQL 318



 Score = 34.3 bits (79), Expect = 0.68
 Identities = 19/80 (23%), Positives = 38/80 (47%), Gaps = 2/80 (2%)

Query: 884 KKLRELYEQRAAAAEL--EHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQIL 941
           ++L EL E+   A E   E    +++   K   L  ++ E++EEI+   ++      +I 
Sbjct: 239 EELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEIS 298

Query: 942 EMEKNIQLKDDEINNLNENL 961
            +E+  Q+  + + NL   L
Sbjct: 299 RLEQQKQILRERLANLERQL 318



 Score = 33.1 bits (76), Expect = 1.7
 Identities = 63/367 (17%), Positives = 134/367 (36%), Gaps = 54/367 (14%)

Query: 851  VSDLKTRLKDVEKKL------LESPEEIKIEYEKQMTNYKKLRELYEQ-RAAAAELEHKT 903
            +S  K R K+ E+KL      L+  E+I  E E+Q+ + ++  E  E+ +   AEL    
Sbjct: 167  ISKYKERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRE-- 224

Query: 904  MIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 963
             +E  L    L E +            E+ +E ++++ E E       +E+  L   L  
Sbjct: 225  -LELALLVLRLEELR------------EELEELQEELKEAE-------EELEELTAELQE 264

Query: 964  SLAENHILNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLIN--NHEINDTVS 1021
               +   L  +++                L+  I+ +Q     + + I+    +      
Sbjct: 265  LEEKLEELRLEVS---------------ELEEEIEELQKELYALANEISRLEQQKQILRE 309

Query: 1022 LLVDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTSKDSANIQLDHAE 1081
             L +LE+++++    ++  +        E       +E ++    S   +   ++ +  E
Sbjct: 310  RLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEE 369

Query: 1082 TNTNEESPADQDETNST----TTLDKSHIQKEIANNLPKVWRILIELLNTHQEQYENNDV 1137
              +  E   +Q ET  +      L  + +  EI     ++ R+        QE  E    
Sbjct: 370  LESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLK- 428

Query: 1138 NENFESCYKNIKTNTGKTHQVISVSQTFIRLKHLILEKNHLTNQLAKLKTLNNHLEIRLN 1197
             +  E+  K ++    +  + +   Q    L+ L      L  +L + +   +  E  L 
Sbjct: 429  -KLEEAELKELQAELEELEEELEELQE--ELERLEEALEELREELEEAEQALDAAERELA 485

Query: 1198 HSEQRLN 1204
              + RL+
Sbjct: 486  QLQARLD 492


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 43.4 bits (103), Expect = 8e-04
 Identities = 27/113 (23%), Positives = 53/113 (46%), Gaps = 4/113 (3%)

Query: 41  LKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEA 100
            K+V+ K L SP   K E E    ++ +L+E  ++     +   + + + +     L + 
Sbjct: 305 GKEVKAKELGSPAGTKTEIE-VGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKK 363

Query: 101 KLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 153
              ++  E + QLEK  E K+++ E    ++  ++E+  L E L S  +E  I
Sbjct: 364 ARGQLPPEKREQLEKLLETKEKLSEE---LEELEEELKELKEELESLYSEGKI 413



 Score = 43.4 bits (103), Expect = 8e-04
 Identities = 27/113 (23%), Positives = 53/113 (46%), Gaps = 4/113 (3%)

Query: 858 LKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEA 917
            K+V+ K L SP   K E E    ++ +L+E  ++     +   + + + +     L + 
Sbjct: 305 GKEVKAKELGSPAGTKTEIE-VGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKK 363

Query: 918 KLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 970
              ++  E + QLEK  E K+++ E    ++  ++E+  L E L S  +E  I
Sbjct: 364 ARGQLPPEKREQLEKLLETKEKLSEE---LEELEEELKELKEELESLYSEGKI 413


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
            found at the N terminus of SMC proteins. The SMC
            (structural maintenance of chromosomes) superfamily
            proteins have ATP-binding domains at the N- and
            C-termini, and two extended coiled-coil domains separated
            by a hinge in the middle. The eukaryotic SMC proteins
            form two kind of heterodimers: the SMC1/SMC3 and the
            SMC2/SMC4 types. These heterodimers constitute an
            essential part of higher order complexes, which are
            involved in chromatin and DNA dynamics. This family also
            includes the RecF and RecN proteins that are involved in
            DNA metabolism and recombination.
          Length = 1162

 Score = 42.3 bits (99), Expect = 0.003
 Identities = 58/364 (15%), Positives = 116/364 (31%), Gaps = 23/364 (6%)

Query: 850  LVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 909
              S  K + K+  KKL+E  E +            +  +L EQ   A E        +  
Sbjct: 164  AGSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELE 223

Query: 910  AKNALLEAKLKEIQEEIQAQLEKY----KENKKQILEMEKNIQLKDDEINNLNENLTSSL 965
             +N L    LK  +E I    E      +E +    E+EK  ++    +    E      
Sbjct: 224  EENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKK 283

Query: 966  AENHILNSQMTLVNNLFSNMLI---NPMINLDMLIKLIQDNYLLIIDLINN-HEINDTVS 1021
             +   L         L S +L      + + + L +  ++   L  +L     EI +   
Sbjct: 284  LQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEK 343

Query: 1022 LLVDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTSKDSANIQLDHAE 1081
             L +LE K + E    +  ++L++             +  E        +S  +      
Sbjct: 344  ELKELEIKREAEEEEEEQLEKLQEKLE----------QLEEELLAKKKLESERLSSAAKL 393

Query: 1082 TNTNEESPADQDETNSTTTLDKSHIQKEIANNLPKVWRILIELLNTHQEQYENNDVNENF 1141
                E    +++E  +   L+ S  ++++     K   + I        + +   + E  
Sbjct: 394  KE-EELELKNEEEKEAKLLLELSEQEEDLLKEEKKE-ELKIVEELEESLETKQGKLTEEK 451

Query: 1142 ESCYKNIKTNTGKTHQVISVSQTFIRLKHLILEKNHLTNQLAKLKTLNNHLEIRLNHSEQ 1201
            E   K          +   + ++   LK   L K     +L  L+        + + + +
Sbjct: 452  EELEKQALKLLKDKLE---LKKSEDLLKETKLVKLLEQLELLLLRQKLEEASQKESKARE 508

Query: 1202 RLNT 1205
             L  
Sbjct: 509  GLAV 512



 Score = 42.3 bits (99), Expect = 0.003
 Identities = 38/287 (13%), Positives = 95/287 (33%), Gaps = 21/287 (7%)

Query: 34  VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELA 93
            S  K + K+  KKL+E  E +            +  +L EQ   A E        +   
Sbjct: 165 GSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEE 224

Query: 94  KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 153
           +N L    LK  +E I    E  ++ +++I   ++ ++ +++ +  + +       E  +
Sbjct: 225 ENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKL 284

Query: 154 LNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKI 213
              +                + L    +    + LL ++     +         +L+K  
Sbjct: 285 QEEE----------------LKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLE 328

Query: 214 DQETLAIQN----DKQLKQLENYETPMSSAMVEAVESNQTSTKAVESNQTSTSKDSANIQ 269
            +     +     +K+LK+LE           +  +  +   +  E        +S  + 
Sbjct: 329 KELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLS 388

Query: 270 LDHAETNTNEESPADQDETNSTTTLDKSHIQKEITRVISNSLISGLS 316
                    E    +++E  +   L+ S  ++++ +      +  + 
Sbjct: 389 SAAKLKE-EELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVE 434



 Score = 34.2 bits (78), Expect = 0.72
 Identities = 21/135 (15%), Positives = 48/135 (35%), Gaps = 4/135 (2%)

Query: 33  KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLREL----YEQRAAAAELEHKTMI 88
           ++      LK  E KL E  ++    Y+ +     +   L    Y +         + ++
Sbjct: 188 ELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELL 247

Query: 89  EKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSL 148
             E  +    + +L++ +E +   L++ KE +K+    E+ ++L   E   L   L    
Sbjct: 248 RDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLE 307

Query: 149 AENHILNSQMTLVNN 163
                   ++     
Sbjct: 308 RRKVDDEEKLKESEK 322



 Score = 30.7 bits (69), Expect = 8.1
 Identities = 23/128 (17%), Positives = 45/128 (35%), Gaps = 8/128 (6%)

Query: 25  SSTSMVVFKVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELY--------EQR 76
                   ++ +L+  LK++E K     EE +   + Q    +   EL            
Sbjct: 329 KELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLS 388

Query: 77  AAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDE 136
           +AA   E +  ++ E  K A L  +L E +E++  + +K +    + LE     +     
Sbjct: 389 SAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLT 448

Query: 137 INNLNENL 144
                   
Sbjct: 449 EEKEELEK 456


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
            recombination, and repair].
          Length = 908

 Score = 40.9 bits (96), Expect = 0.006
 Identities = 67/447 (14%), Positives = 169/447 (37%), Gaps = 40/447 (8%)

Query: 853  DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE--LEHKTMIEKELA 910
             L+    ++E+   E  E  K+  E+     ++L EL ++   A E   + +  I++   
Sbjct: 337  KLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKE 396

Query: 911  KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS------- 963
            + A L A L+EIQEE++   ++ +E ++++ E+E+ I+  +++IN L             
Sbjct: 397  ELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGA 456

Query: 964  ----SLAENHILNSQMTLVNNLFSNML--INPMINLDMLIKLIQDNYLLIIDLINNHEIN 1017
                 +    +       +  L+   L  +   ++ +     +++    +   +   E  
Sbjct: 457  GEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEE 516

Query: 1018 DTVSL--LVDLEKKIDQETLAIQNDKQLKQ--LENYETPMSSAMVEAVESNQTSTSKDSA 1073
                L     L+++++++   ++N  +  +   E  +       +  +E       +   
Sbjct: 517  LIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELL- 575

Query: 1074 NIQLDHAETNTNEESPADQDETNSTTTLDKSHIQKEIANNLPKVWRILIELLNTHQEQYE 1133
                   E      +  ++ E       +     KE+   L +    L ELL + +    
Sbjct: 576  -------EELRLLRTRKEELEELRERLKELKKKLKELEERLSQ----LEELLQSLELSEA 624

Query: 1134 NNDVNENFESCYKNIKTNTGKTHQVISVSQTFIR--LKHLILEKNHLTNQLAKLKTLNNH 1191
             N++ E  E   ++      +   + +  +  ++  L+ L  +   L  ++ +       
Sbjct: 625  ENEL-EEAEEELESEL----EKLNLQAELEELLQAALEELEEKVEELEAEIRRELQR-IE 678

Query: 1192 LEIRLNHSEQRLNTVTTELKKTWNVVNKLKLQHKQLYTHEKILKYELKHKRNIINDLKID 1251
             E +L    + L  +  EL++    + +L  +  ++    + L+        +  +L+  
Sbjct: 679  NEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELE-K 737

Query: 1252 LEYVREKWDLVKEKNEQNELDYKVLRK 1278
            LE   E  + ++EK  +  L   +LR 
Sbjct: 738  LEKALELLEELREKLGKAGLRADILRN 764



 Score = 39.4 bits (92), Expect = 0.022
 Identities = 28/113 (24%), Positives = 59/113 (52%), Gaps = 2/113 (1%)

Query: 36  DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE--LEHKTMIEKELA 93
            L+    ++E+   E  E  K+  E+     ++L EL ++   A E   + +  I++   
Sbjct: 337 KLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKE 396

Query: 94  KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 146
           + A L A L+EIQEE++   ++ +E ++++ E+E+ I+  +++IN L      
Sbjct: 397 ELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELM 449



 Score = 34.7 bits (80), Expect = 0.53
 Identities = 79/445 (17%), Positives = 160/445 (35%), Gaps = 36/445 (8%)

Query: 864  KLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQ 923
              L+S  + + E   ++   +K  +L E      E+  +   + E  +  L E       
Sbjct: 147  AFLKSKPKERKEILDELFGLEKYEKLSEL---LKEVIKEAKAKIEELEGQLSELLEDIED 203

Query: 924  EEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNEN---LTSSLAENHILNSQMTLVNN 980
                 + E  +  K + ++ E+  +  + EI  L E    L         L +++  + +
Sbjct: 204  LLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIES 263

Query: 981  LFSNMLINPMINLDMLIKLIQDNYLLIIDL-INNHEINDTVSLLVDLEKKIDQETLAIQN 1039
            L    L      L  L +L+++    I  L     EI +    L  L   +++    ++ 
Sbjct: 264  LELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEK 323

Query: 1040 DKQL-KQLENYETPMSSAMVE----AVESNQTSTSKDS-ANIQLDHAETNTNEESPADQD 1093
             K L ++LE  E  +     E    A E N+ +   +       +  E    E   A + 
Sbjct: 324  LKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALER 383

Query: 1094 ETNSTTTLDKSHIQKEIANNLPKVWRILIELLNTHQEQYENNDVNENFESCYKNIKT--- 1150
                   + +  +++E+A     +  I  EL    +E  E     E  E   K ++    
Sbjct: 384  LKQLEEAIQE--LKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQIN 441

Query: 1151 ---NTGKTHQVISVSQ-------TFIRLKHLILEKNHLTNQLAKLKTLNNHLEIRLNHSE 1200
               +       ++ +          +  +H          +L +L+      E+     E
Sbjct: 442  QLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELE-----EELSREKEE 496

Query: 1201 QRLNTVTTELKKTWNVVNKLKLQHKQLYTHEKILKYELKHKRNIINDLKIDLEYVREKWD 1260
              L     EL+K    + +  ++  +L   E+ LK EL+ K   + +L  +LE ++EK  
Sbjct: 497  AELREEIEELEKELRELEEELIELLEL---EEALKEELEEKLEKLENLLEELEELKEKLQ 553

Query: 1261 LVKEKNEQNELDYKVLRKEFALRRL 1285
            L + K E  +L+ ++   +  L  L
Sbjct: 554  LQQLKEELRQLEDRLQELKELLEEL 578



 Score = 32.8 bits (75), Expect = 2.1
 Identities = 27/128 (21%), Positives = 65/128 (50%), Gaps = 16/128 (12%)

Query: 33  KVSDLKTRLKDVEKKLLESPEEIKI---------EYEKQMTNYKKLRELYEQRAAAAELE 83
           ++ +   RL+++E+++ E  EE++          E  +++ + ++  E  E++    E E
Sbjct: 285 ELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESE 344

Query: 84  HKTMIEKELAKNALLEAKLKEIQE-------EIQAQLEKYKENKKQILEMEKNIQLKDDE 136
            + + E++     LLE +LKE++E       E++  LE+ K+ ++ I E+++ +      
Sbjct: 345 LEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAA 404

Query: 137 INNLNENL 144
           +  + E L
Sbjct: 405 LEEIQEEL 412



 Score = 32.0 bits (73), Expect = 3.1
 Identities = 27/127 (21%), Positives = 64/127 (50%), Gaps = 16/127 (12%)

Query: 851 VSDLKTRLKDVEKKLLESPEEIKI---------EYEKQMTNYKKLRELYEQRAAAAELEH 901
           + +   RL+++E+++ E  EE++          E  +++ + ++  E  E++    E E 
Sbjct: 286 LEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESEL 345

Query: 902 KTMIEKELAKNALLEAKLKEIQE-------EIQAQLEKYKENKKQILEMEKNIQLKDDEI 954
           + + E++     LLE +LKE++E       E++  LE+ K+ ++ I E+++ +      +
Sbjct: 346 EELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAAL 405

Query: 955 NNLNENL 961
             + E L
Sbjct: 406 EEIQEEL 412



 Score = 32.0 bits (73), Expect = 3.6
 Identities = 16/98 (16%), Positives = 49/98 (50%), Gaps = 7/98 (7%)

Query: 850 LVSDLKTRLKDVEKKL--LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEK 907
            + +L+ +++++E ++       E + + E+++   ++L E  EQ     E      + K
Sbjct: 655 ALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEE-----LLK 709

Query: 908 ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEK 945
           +L +   L  +L+  + E++   ++ ++ +K +  +E+
Sbjct: 710 KLGEIEQLIEELESRKAELEELKKELEKLEKALELLEE 747



 Score = 32.0 bits (73), Expect = 3.6
 Identities = 16/98 (16%), Positives = 49/98 (50%), Gaps = 7/98 (7%)

Query: 33  KVSDLKTRLKDVEKKL--LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEK 90
            + +L+ +++++E ++       E + + E+++   ++L E  EQ     E      + K
Sbjct: 655 ALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEE-----LLK 709

Query: 91  ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEK 128
           +L +   L  +L+  + E++   ++ ++ +K +  +E+
Sbjct: 710 KLGEIEQLIEELESRKAELEELKKELEKLEKALELLEE 747



 Score = 30.9 bits (70), Expect = 7.5
 Identities = 23/112 (20%), Positives = 52/112 (46%), Gaps = 13/112 (11%)

Query: 33  KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 92
           ++ +L+  L+++E++L+E  E  +   E+     +KL  L E+     E      +++EL
Sbjct: 502 EIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEEL 561

Query: 93  AKNALLEAKLK-------------EIQEEIQAQLEKYKENKKQILEMEKNIQ 131
            +      +LK             E  EE++ +L++ K+  K++ E    ++
Sbjct: 562 RQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLE 613



 Score = 30.5 bits (69), Expect = 8.8
 Identities = 23/119 (19%), Positives = 53/119 (44%), Gaps = 13/119 (10%)

Query: 843 TRSVILPLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHK 902
             + +   + +L+  L+++E++L+E  E  +   E+     +KL  L E+     E    
Sbjct: 495 EEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQL 554

Query: 903 TMIEKELAKNALLEAKLK-------------EIQEEIQAQLEKYKENKKQILEMEKNIQ 948
             +++EL +      +LK             E  EE++ +L++ K+  K++ E    ++
Sbjct: 555 QQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLE 613


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 40.6 bits (96), Expect = 0.009
 Identities = 22/111 (19%), Positives = 52/111 (46%), Gaps = 14/111 (12%)

Query: 37  LKTRLKDVEKKLLESPEEIKIEYEKQMTNYK--------KLRELYEQRAAAAELEH---- 84
           L+ + ++ E  L E+ E++K E E++    +        +  +  +Q    A+ E     
Sbjct: 532 LEQKAEEAEALLKEA-EKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEII 590

Query: 85  KTMIEKELAKNALLEAK-LKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 134
           K + + +    A ++A  L E ++ +    EK ++ KK+  E ++ +++ D
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGD 641



 Score = 40.6 bits (96), Expect = 0.009
 Identities = 22/111 (19%), Positives = 52/111 (46%), Gaps = 14/111 (12%)

Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYK--------KLRELYEQRAAAAELEH---- 901
           L+ + ++ E  L E+ E++K E E++    +        +  +  +Q    A+ E     
Sbjct: 532 LEQKAEEAEALLKEA-EKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEII 590

Query: 902 KTMIEKELAKNALLEAK-LKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 951
           K + + +    A ++A  L E ++ +    EK ++ KK+  E ++ +++ D
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGD 641


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 38.4 bits (90), Expect = 0.029
 Identities = 16/56 (28%), Positives = 30/56 (53%), Gaps = 1/56 (1%)

Query: 53  EEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEE 108
           EE++ EYE+++  Y+  +E++E      E + K  I+K       L  +L E++ E
Sbjct: 68  EELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKG-KDEEALAEELLELEAE 122



 Score = 38.4 bits (90), Expect = 0.029
 Identities = 16/56 (28%), Positives = 30/56 (53%), Gaps = 1/56 (1%)

Query: 870 EEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEE 925
           EE++ EYE+++  Y+  +E++E      E + K  I+K       L  +L E++ E
Sbjct: 68  EELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKG-KDEEALAEELLELEAE 122


>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
           UL32.  The large phosphorylated protein (UL32-like) of
           herpes viruses is the polypeptide most frequently
           reactive in immuno-blotting analyses with antisera when
           compared with other viral proteins.
          Length = 777

 Score = 38.7 bits (90), Expect = 0.032
 Identities = 30/126 (23%), Positives = 41/126 (32%), Gaps = 8/126 (6%)

Query: 571 ILEARNARLKRLEDQ-------CKNLFHKMSLTSQRSDTLSERLEELHDYYGTSNERHSP 623
           I E     +K L          C+N+  K  +     DTL ERLEE  D Y  S E   P
Sbjct: 222 INELCKRLVKELRRCPSFASVYCRNIL-KHPVDGTSVDTLLERLEEDFDIYEDSLEYDDP 280

Query: 624 RRHREGSQDSDDTRLRSPENEPLSGVSANVPDSQENSQESDDTRRRSAETNESHSGVSGN 683
                   D DD        +PL  ++ +   S +     +D               S +
Sbjct: 281 GLESTDEDDDDDGDSSLQTFKPLLDLTGSSLWSDDEESGDEDGDGSGFAPEPLIKTDSRS 340

Query: 684 VSDLQD 689
              L D
Sbjct: 341 NDTLVD 346


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 38.5 bits (90), Expect = 0.033
 Identities = 28/127 (22%), Positives = 58/127 (45%), Gaps = 15/127 (11%)

Query: 36  DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYE------------QRAAAAELE 83
           DL+  L++V+K+  E+ +E+K +Y +++   K  RE+ E            QR +    +
Sbjct: 368 DLRAELEEVDKEFAETRDELK-DYREKLEKLK--REINELKRELDRLQEELQRLSEELAD 424

Query: 84  HKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNEN 143
               I    AK   LE + ++   EI+ Q  K ++    + + E+ +    +E + + + 
Sbjct: 425 LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKE 484

Query: 144 LTSSLAE 150
           L+    E
Sbjct: 485 LSKLQRE 491



 Score = 38.5 bits (90), Expect = 0.033
 Identities = 28/127 (22%), Positives = 58/127 (45%), Gaps = 15/127 (11%)

Query: 853 DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYE------------QRAAAAELE 900
           DL+  L++V+K+  E+ +E+K +Y +++   K  RE+ E            QR +    +
Sbjct: 368 DLRAELEEVDKEFAETRDELK-DYREKLEKLK--REINELKRELDRLQEELQRLSEELAD 424

Query: 901 HKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNEN 960
               I    AK   LE + ++   EI+ Q  K ++    + + E+ +    +E + + + 
Sbjct: 425 LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKE 484

Query: 961 LTSSLAE 967
           L+    E
Sbjct: 485 LSKLQRE 491



 Score = 38.1 bits (89), Expect = 0.050
 Identities = 44/229 (19%), Positives = 98/229 (42%), Gaps = 25/229 (10%)

Query: 34   VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEH--------- 84
            VS ++ RL+++E+KL     E +   ++     ++  +L EQ  +  +            
Sbjct: 807  VSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEEL 866

Query: 85   KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 144
            +  +E+  A    LE++L ++++E         E + Q+ E+E+ I+  + +I    + L
Sbjct: 867  EEELEELEAALRDLESRLGDLKKER-------DELEAQLRELERKIEELEAQIEKKRKRL 919

Query: 145  TSSLAENHILNSQMTLVNNLFSNMLINPMINLDM--LIKLIQDNYLLIIDL--INNHEIN 200
            +   A+   L  +++ + +        P   L +  +   +Q     I  L  +N   I 
Sbjct: 920  SELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQ 979

Query: 201  D---TVSLLVDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVES 246
            +    +  L +L++K  +  L  +    L+++E YE       +EA E+
Sbjct: 980  EYEEVLKRLDELKEK--RAKLEEERKAILERIEEYEKKKREVFMEAFEA 1026



 Score = 38.1 bits (89), Expect = 0.050
 Identities = 44/229 (19%), Positives = 98/229 (42%), Gaps = 25/229 (10%)

Query: 851  VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEH--------- 901
            VS ++ RL+++E+KL     E +   ++     ++  +L EQ  +  +            
Sbjct: 807  VSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEEL 866

Query: 902  KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 961
            +  +E+  A    LE++L ++++E         E + Q+ E+E+ I+  + +I    + L
Sbjct: 867  EEELEELEAALRDLESRLGDLKKER-------DELEAQLRELERKIEELEAQIEKKRKRL 919

Query: 962  TSSLAENHILNSQMTLVNNLFSNMLINPMINLDM--LIKLIQDNYLLIIDL--INNHEIN 1017
            +   A+   L  +++ + +        P   L +  +   +Q     I  L  +N   I 
Sbjct: 920  SELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQ 979

Query: 1018 D---TVSLLVDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVES 1063
            +    +  L +L++K  +  L  +    L+++E YE       +EA E+
Sbjct: 980  EYEEVLKRLDELKEK--RAKLEEERKAILERIEEYEKKKREVFMEAFEA 1026



 Score = 30.8 bits (70), Expect = 7.3
 Identities = 24/112 (21%), Positives = 53/112 (47%), Gaps = 6/112 (5%)

Query: 33  KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 92
           ++  L+ RL+ + K+ L S +      E ++    +      ++    E E    +E+E 
Sbjct: 675 ELQRLRERLEGL-KRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKE-IEQLEQEE 732

Query: 93  AKN----ALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 140
            K       LE  L  +++EI+    + KE + +I E+E+++   ++ +N+L
Sbjct: 733 EKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDL 784



 Score = 30.4 bits (69), Expect = 9.5
 Identities = 26/131 (19%), Positives = 57/131 (43%), Gaps = 21/131 (16%)

Query: 33  KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAA-AELEH--KTMIE 89
           ++SD   ++ ++EK++ +  +E +   E+     + L  L ++     +EL+     + E
Sbjct: 710 ELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEE 769

Query: 90  KELAKNALLEA-----------KLKEIQEEIQAQLEKYKENKKQILEMEKNI-------Q 131
            E   + L EA           ++ EIQ E+    E+    + ++ E+E+ +       +
Sbjct: 770 LEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKE 829

Query: 132 LKDDEINNLNE 142
             + EI  L E
Sbjct: 830 YLEKEIQELQE 840



 Score = 30.4 bits (69), Expect = 9.8
 Identities = 33/152 (21%), Positives = 61/152 (40%), Gaps = 28/152 (18%)

Query: 836 ESNILATTRSVILPLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLREL------ 889
           E + L +    I   + +L   L D  +K+ E  +EI+   +++    ++L EL      
Sbjct: 689 ELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSS 748

Query: 890 YEQRAAAAELEHKTMI----EKELAKNALLEA-----------KLKEIQEEIQAQLEKYK 934
            EQ     + E K +     E E   + L EA           ++ EIQ E+    E+  
Sbjct: 749 LEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVS 808

Query: 935 ENKKQILEMEKNI-------QLKDDEINNLNE 959
             + ++ E+E+ +       +  + EI  L E
Sbjct: 809 RIEARLREIEQKLNRLTLEKEYLEKEIQELQE 840


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 37.9 bits (88), Expect = 0.039
 Identities = 26/93 (27%), Positives = 43/93 (46%), Gaps = 4/93 (4%)

Query: 37  LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 96
            K  L  +E++L+   +E K  + +Q+    +L    E     A        +KEL    
Sbjct: 62  KKKELSQLEEQLINQKKEQKNLFNEQIKQ-FELALQDEIAKLEALELLNLEKDKELE--- 117

Query: 97  LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKN 129
           LLE +L E+ +E+Q QL+   E  ++  E  KN
Sbjct: 118 LLEKELDELSKELQKQLQNTAEIIEKKRENNKN 150



 Score = 37.9 bits (88), Expect = 0.039
 Identities = 26/93 (27%), Positives = 43/93 (46%), Gaps = 4/93 (4%)

Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 913
            K  L  +E++L+   +E K  + +Q+    +L    E     A        +KEL    
Sbjct: 62  KKKELSQLEEQLINQKKEQKNLFNEQIKQ-FELALQDEIAKLEALELLNLEKDKELE--- 117

Query: 914 LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKN 946
           LLE +L E+ +E+Q QL+   E  ++  E  KN
Sbjct: 118 LLEKELDELSKELQKQLQNTAEIIEKKRENNKN 150



 Score = 31.7 bits (72), Expect = 4.0
 Identities = 27/177 (15%), Positives = 72/177 (40%), Gaps = 12/177 (6%)

Query: 55  IKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLE 114
            K   ++  +      E +E+ A     ++++  +KEL++   LE +L   ++E +    
Sbjct: 29  YKQIEQEDQSRILNTLEEFEKEANEKRAQYRSAKKKELSQ---LEEQLINQKKEQKNLFN 85

Query: 115 KYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTLVNNLFSNMLINPMI 174
           +  +  +  L+ ++  +L+  E+ NL ++    L E   L+     +     N       
Sbjct: 86  EQIKQFELALQ-DEIAKLEALELLNLEKDKELELLEKE-LDELSKELQKQLQNTAEI--- 140

Query: 175 NLDMLIKLIQDNYLLIIDLINNHEINDTVSLL-VDLEKKIDQETLAIQNDKQLKQLE 230
              +  K   +     +   N  ++ +++ L     E+++ +  L ++  +  +Q E
Sbjct: 141 ---IEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQRE 194



 Score = 31.7 bits (72), Expect = 4.0
 Identities = 27/177 (15%), Positives = 72/177 (40%), Gaps = 12/177 (6%)

Query: 872  IKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLE 931
             K   ++  +      E +E+ A     ++++  +KEL++   LE +L   ++E +    
Sbjct: 29   YKQIEQEDQSRILNTLEEFEKEANEKRAQYRSAKKKELSQ---LEEQLINQKKEQKNLFN 85

Query: 932  KYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTLVNNLFSNMLINPMI 991
            +  +  +  L+ ++  +L+  E+ NL ++    L E   L+     +     N       
Sbjct: 86   EQIKQFELALQ-DEIAKLEALELLNLEKDKELELLEKE-LDELSKELQKQLQNTAEI--- 140

Query: 992  NLDMLIKLIQDNYLLIIDLINNHEINDTVSLL-VDLEKKIDQETLAIQNDKQLKQLE 1047
               +  K   +     +   N  ++ +++ L     E+++ +  L ++  +  +Q E
Sbjct: 141  ---IEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQRE 194


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 38.1 bits (89), Expect = 0.048
 Identities = 74/417 (17%), Positives = 148/417 (35%), Gaps = 54/417 (12%)

Query: 870  EEIKIEYEKQMTNYKKLRELYEQ-RAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQA 928
                I+ +K +   KKLRE  E   +  AE+  +       AK      K + I  +   
Sbjct: 816  LACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLLKKETIYLQSAQ 875

Query: 929  QLEKYKENKKQILEMEKNIQLKDD--EINNL---NENLTSSLAENHILNSQMTLVNNLFS 983
            ++E         L   +  +LK D   I++L   N  L S + E         L  +L S
Sbjct: 876  RVE---------LAERQLQELKIDVKSISSLKLVNLELESEIIE---------LKKSLSS 917

Query: 984  NMLINPMINLDMLIKL--IQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETLAIQNDK 1041
            +++ N     +++ +L  + +N  L       +     ++ L ++E K+ + +   ++  
Sbjct: 918  DLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYED-- 975

Query: 1042 QLKQLENYETPMSSAMVEAVESNQTSTSKDSANIQLDHAETNTNEESPADQDETNSTTTL 1101
             LK+        + A  E     +           L  +     E      +  +++  +
Sbjct: 976  LLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSASKII 1035

Query: 1102 DKSHIQKEIANNLPKVWRILIELLNTHQEQYEN------NDVNENFESCYKNIKTNTGKT 1155
                 +  I   L K+  +L+   N  Q +Y+       N + ++ +        N  KT
Sbjct: 1036 SSESTELSILKPLQKLKGLLLLENNQLQARYKALKLRRENSLLDDKQLYQLESTENLLKT 1095

Query: 1156 HQVISVSQTFIRLKHLILEKNHLTNQLAKLKTLNNHLEI---------RLNHSEQRLNTV 1206
               I+V    +  ++L+   N L   +A++  LN   EI          L    Q+L+ +
Sbjct: 1096 ---INVKDLEVTNRNLVKPANVLQFIVAQMIKLNLLQEISKFLSQLVNTLEPVFQKLSVL 1152

Query: 1207 TTELKKTWNVVN--------KLKLQHKQLYTHEKILKYELKHKRNIINDLKIDLEYV 1255
              EL   +   N              ++      +   + K   + +NDLK +L  +
Sbjct: 1153 QLELDGLFWEANLEALPSPPPFAALSEKRLYQSALYDEKSKLSSSEVNDLKNELIAL 1209



 Score = 33.5 bits (77), Expect = 1.3
 Identities = 37/189 (19%), Positives = 73/189 (38%), Gaps = 28/189 (14%)

Query: 53  EEIKIEYEKQMTNYKKLRELYEQ-RAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQA 111
               I+ +K +   KKLRE  E   +  AE+  +       AK      K + I  +   
Sbjct: 816 LACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLLKKETIYLQSAQ 875

Query: 112 QLEKYKENKKQILEMEKNIQLKDD--EINNL---NENLTSSLAENHILNSQMTLVNNLFS 166
           ++E         L   +  +LK D   I++L   N  L S + E         L  +L S
Sbjct: 876 RVE---------LAERQLQELKIDVKSISSLKLVNLELESEIIE---------LKKSLSS 917

Query: 167 NMLINPMINLDMLIKL--IQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETLAIQNDK 224
           +++ N     +++ +L  + +N  L       +     ++ L ++E K+ + +   ++  
Sbjct: 918 DLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYED-- 975

Query: 225 QLKQLENYE 233
            LK+     
Sbjct: 976 LLKKSTILV 984



 Score = 32.0 bits (73), Expect = 4.2
 Identities = 29/176 (16%), Positives = 66/176 (37%), Gaps = 21/176 (11%)

Query: 1119 RILIELLNTHQEQYENNDVNENFESCYKNIKTNTGKTHQVISVSQTFIRLKHLILEKNHL 1178
            +    +    ++  E     ++  S       N     ++I + ++      LI      
Sbjct: 872  QSAQRVELAERQLQELKIDVKSISSLKL---VNLELESEIIELKKSL--SSDLIENLEFK 926

Query: 1179 TNQLAKLKTLNNHLEI--------RLNHSEQRLNTVTTELKKTWNVVNKLKLQHKQLYTH 1230
            T  +A+LK L N++++               +L+ V ++LK+T         +++ L   
Sbjct: 927  TELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKET-------SEEYEDLLKK 979

Query: 1231 EKILKYELKHKRNIINDLKIDL-EYVREKWDLVKEKNEQNELDYKVLRKEFALRRL 1285
              IL  E     + + + K +L E  ++   L +   +  EL  +V   + A + +
Sbjct: 980  STILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSASKII 1035



 Score = 31.6 bits (72), Expect = 5.2
 Identities = 71/454 (15%), Positives = 134/454 (29%), Gaps = 74/454 (16%)

Query: 848  LPLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEK 907
            L   ++L  RLK +   +     +++     +     +L +L+E                
Sbjct: 923  LEFKTELIARLKKLLNNI-----DLEEGPSIEYVKLPELNKLHEV--------------- 962

Query: 908  ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 967
                    E+KLKE  EE +  L+K     ++  +    ++    E+  L++   +    
Sbjct: 963  --------ESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQES 1014

Query: 968  NHILNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLE 1027
               L      V  L S         +             +  L     + +         
Sbjct: 1015 TKQLKELPVEVAELQSAS------KIISSESTELSILKPLQKLKGLLLLENNQLQARYKA 1068

Query: 1028 KKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTSKDSANIQLDHAET----N 1083
             K+ +E   +  DKQL QLE+ E  + +  V+ +E    +  K +  +Q   A+      
Sbjct: 1069 LKLRRENSLLD-DKQLYQLESTENLLKTINVKDLEVTNRNLVKPANVLQFIVAQMIKLNL 1127

Query: 1084 TNEES-----------PADQDETNSTTTLDKSHIQKEIANNLPKVWRILIEL-------- 1124
              E S           P  Q  +     LD    +  +           +          
Sbjct: 1128 LQEISKFLSQLVNTLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKRLYQSAL 1187

Query: 1125 --LNTHQEQYENNDVNENFESCYKNIKTNTGKTHQVISVSQTFIRLKHLILEKNHLTNQL 1182
                +     E ND+     + +  I +   +  +          LK LI E    T   
Sbjct: 1188 YDEKSKLSSSEVNDLKNELIALFSKIFSGWPRGDK----------LKKLISEGWVPTEYS 1237

Query: 1183 AKLKTLNNHLEIRLNHSEQRLNTVTTELKKTWNVVNKLKLQHKQLYTHEKILKYELKH-K 1241
              LK  NN L  + +      N     L  +  + N L     +       +   L++  
Sbjct: 1238 TSLKGFNN-LNKKFDTPASMSNEKLLSLLNS--IDNLLSSYKLEEEVLPATINSLLQYIN 1294

Query: 1242 RNIINDLKIDLEYVREKWDLVKEKNEQNELDYKV 1275
              + N L+     +R K       N +   D+  
Sbjct: 1295 VGLFNALRTKASSLRWKSATEVNYNSEELDDWCR 1328



 Score = 30.8 bits (70), Expect = 8.4
 Identities = 41/228 (17%), Positives = 77/228 (33%), Gaps = 31/228 (13%)

Query: 23   LTSSTSMVVFKVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAEL 82
            L  S S  + +  + KT L    KKLL +  +++     +     +L +L+E        
Sbjct: 911  LKKSLSSDLIENLEFKTELIARLKKLLNN-IDLEEGPSIEYVKLPELNKLHEV------- 962

Query: 83   EHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 142
                            E+KLKE  EE +  L+K     ++  +    ++    E+  L++
Sbjct: 963  ----------------ESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSK 1006

Query: 143  NLTSSLAENHILNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDT 202
               +       L      V  L S         +             +  L     + + 
Sbjct: 1007 QYGALQESTKQLKELPVEVAELQSAS------KIISSESTELSILKPLQKLKGLLLLENN 1060

Query: 203  VSLLVDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTS 250
                     K+ +E   +  DKQL QLE+ E  + +  V+ +E    +
Sbjct: 1061 QLQARYKALKLRRENSLLD-DKQLYQLESTENLLKTINVKDLEVTNRN 1107


>gnl|CDD|223778 COG0706, YidC, Preprotein translocase subunit YidC [Intracellular
           trafficking and secretion].
          Length = 314

 Score = 36.7 bits (85), Expect = 0.097
 Identities = 15/41 (36%), Positives = 23/41 (56%)

Query: 100 AKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 140
           AK++E+Q +I+   EKYK   KQ  + E     K  ++N L
Sbjct: 137 AKMQELQPKIKEIQEKYKGTDKQKQQQEMMKLYKKHKVNPL 177



 Score = 36.7 bits (85), Expect = 0.097
 Identities = 15/41 (36%), Positives = 23/41 (56%)

Query: 917 AKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 957
           AK++E+Q +I+   EKYK   KQ  + E     K  ++N L
Sbjct: 137 AKMQELQPKIKEIQEKYKGTDKQKQQQEMMKLYKKHKVNPL 177


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
           sigma54-dependent transcription [Transcription / Signal
           transduction mechanisms].
          Length = 225

 Score = 35.8 bits (83), Expect = 0.11
 Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 11/89 (12%)

Query: 54  EIKIEYEKQMTNYKKL-RELYEQRAAAAELEHKTMI-----EKELAKNAL-----LEAKL 102
           + +    + +   K+L R+L E +A A +LE K  +      ++LA+ AL     LE   
Sbjct: 42  KARQALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDLAREALEEKQSLEDLA 101

Query: 103 KEIQEEIQAQLEKYKENKKQILEMEKNIQ 131
           K ++ E+Q   E+ ++ KKQ+  +E+ I 
Sbjct: 102 KALEAELQQAEEQVEKLKKQLAALEQKIA 130



 Score = 35.8 bits (83), Expect = 0.11
 Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 11/89 (12%)

Query: 871 EIKIEYEKQMTNYKKL-RELYEQRAAAAELEHKTMI-----EKELAKNAL-----LEAKL 919
           + +    + +   K+L R+L E +A A +LE K  +      ++LA+ AL     LE   
Sbjct: 42  KARQALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDLAREALEEKQSLEDLA 101

Query: 920 KEIQEEIQAQLEKYKENKKQILEMEKNIQ 948
           K ++ E+Q   E+ ++ KKQ+  +E+ I 
Sbjct: 102 KALEAELQQAEEQVEKLKKQLAALEQKIA 130


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 35.8 bits (83), Expect = 0.17
 Identities = 22/105 (20%), Positives = 50/105 (47%), Gaps = 3/105 (2%)

Query: 41  LKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQ-RAAAAELEHKTMIEKELAK--NAL 97
           L+ +++ L E+ E +K +Y+  M   + L  +  + R     LE +    K+L       
Sbjct: 142 LEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDC 201

Query: 98  LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 142
              +L   +E+++  L++     K++ E+E+ +Q  + +I +L  
Sbjct: 202 DPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTN 246



 Score = 35.8 bits (83), Expect = 0.17
 Identities = 22/105 (20%), Positives = 50/105 (47%), Gaps = 3/105 (2%)

Query: 858 LKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQ-RAAAAELEHKTMIEKELAK--NAL 914
           L+ +++ L E+ E +K +Y+  M   + L  +  + R     LE +    K+L       
Sbjct: 142 LEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDC 201

Query: 915 LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 959
              +L   +E+++  L++     K++ E+E+ +Q  + +I +L  
Sbjct: 202 DPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTN 246


>gnl|CDD|181362 PRK08296, PRK08296, hypothetical protein; Provisional.
          Length = 603

 Score = 35.8 bits (83), Expect = 0.20
 Identities = 23/72 (31%), Positives = 32/72 (44%), Gaps = 9/72 (12%)

Query: 444 DEKDIVKGNVVDYEATKQTRTESPSEEQKSKQLIRQDRITEE------TDESRPFESA-E 496
           D+ +I  G + DY    + R E+  E         +DRI EE       DE + F+    
Sbjct: 305 DDLEIPLGYIKDYIGRLR-RGETI-ERPTEALRAERDRIVEEYRDLLDGDERKAFDELLG 362

Query: 497 CSRTSQPEVENH 508
            +RT  P VENH
Sbjct: 363 LARTVYPYVENH 374


>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
           protein turnover, chaperones].
          Length = 579

 Score = 35.8 bits (83), Expect = 0.21
 Identities = 29/103 (28%), Positives = 40/103 (38%), Gaps = 18/103 (17%)

Query: 51  SPEEIK--IEYEKQMTNY-KKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQE 107
           S EEI+  +E  +      KK REL E R  A  L +    EK L +   +  + KE  E
Sbjct: 484 SDEEIERMVEDAEANAALDKKFRELVEARNEAESLIYSL--EKALKEIVKVSEEEKEKIE 541

Query: 108 EIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 150
           E    LE+  E +K             +EI    E L     +
Sbjct: 542 EAITDLEEALEGEK-------------EEIKAKIEELQEVTQK 571



 Score = 35.8 bits (83), Expect = 0.21
 Identities = 29/103 (28%), Positives = 40/103 (38%), Gaps = 18/103 (17%)

Query: 868 SPEEIK--IEYEKQMTNY-KKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQE 924
           S EEI+  +E  +      KK REL E R  A  L +    EK L +   +  + KE  E
Sbjct: 484 SDEEIERMVEDAEANAALDKKFRELVEARNEAESLIYSL--EKALKEIVKVSEEEKEKIE 541

Query: 925 EIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 967
           E    LE+  E +K             +EI    E L     +
Sbjct: 542 EAITDLEEALEGEK-------------EEIKAKIEELQEVTQK 571


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 35.9 bits (82), Expect = 0.24
 Identities = 51/268 (19%), Positives = 104/268 (38%), Gaps = 22/268 (8%)

Query: 1209 ELKKTWNVVNKLKLQHKQLYTHEKILKYELK---HKRNIINDLKIDLEYVREKWDLVKEK 1265
            E+ K +    K+K +  +     KI   ELK    ++  +  LK      ++K + +K+ 
Sbjct: 1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA 1655

Query: 1266 NEQNELDYKVL--------RKEFALRRLQYDGGRSTSESGISTDTEDDNVEIVETNDESE 1317
             E+N++             +K    ++ + D  +  +E+      E    E ++  +  E
Sbjct: 1656 EEENKIKAAEEAKKAEEDKKKAEEAKKAEEDE-KKAAEALKKEAEEAKKAEELKKKEAEE 1714

Query: 1318 DDKSEVVETQSEHDVPHTNSVMSIEEDDQDEVEEL----ESETNVSAINNEQEVERTEVT 1373
              K+E ++   E +           E+D+ + EE     E +  ++ +  E+E +  E+ 
Sbjct: 1715 KKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIR 1774

Query: 1374 DNTSAVIPSRETIENEAVPVSVSPQ-EDIVDTITTSRTDHIESHQEINSSRET-----KE 1427
                AVI      E+E   + V  + +DI D          E +  IN S+E      KE
Sbjct: 1775 KEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKE 1834

Query: 1428 IPGTSDANVNYSQQFLNNKITIKTIGGK 1455
            +  + +  +  +  F  +K       G+
Sbjct: 1835 VADSKNMQLEEADAFEKHKFNKNNENGE 1862



 Score = 34.7 bits (79), Expect = 0.62
 Identities = 59/309 (19%), Positives = 122/309 (39%), Gaps = 16/309 (5%)

Query: 857  RLKDVEKKLLESPEEIKIEYEKQMTNYKKL-RELYEQRAAAAELEHKTMIEKELAKNALL 915
            +LK  E +  +  EE+K   E+      +  ++  E +  A E +     EK+ A+    
Sbjct: 1637 QLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696

Query: 916  EAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQM 975
            EA+  +  EE++ +  + K+  +++ + E+  ++K +E     E       E      + 
Sbjct: 1697 EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEK 1756

Query: 976  TLVNNLFSNMLINPMINLDMLIKLIQ---DNYLLIIDLINNHEINDTVSLLVDLEKKIDQ 1032
              + +L                 +I+   D       +  + +I D      ++ +   +
Sbjct: 1757 KKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKE 1816

Query: 1033 ETLAIQNDKQLKQLENYETPMSSAMV--EAVESNQTSTSKDSANIQLDHAETNTNEESPA 1090
              L I + K+++     E   S  M   EA    +   +K++ N +  + E + N+E   
Sbjct: 1817 GNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDL 1876

Query: 1091 DQD------ETNSTTTLDKSHIQKEIANN--LPKVWRILIELLNTHQEQYENNDVNENFE 1142
             +D      E +    +DK  I++EI NN    K   I+ + L+  +++Y   D  E  E
Sbjct: 1877 KEDDEEEIEEADEIEKIDKDDIEREIPNNNMAGKNNDIIDDKLD--KDEYIKRDAEETRE 1934

Query: 1143 SCYKNIKTN 1151
               K  K +
Sbjct: 1935 EIIKISKKD 1943


>gnl|CDD|234212 TIGR03443, alpha_am_amid, L-aminoadipate-semialdehyde
           dehydrogenase.  Members of this protein family are
           L-aminoadipate-semialdehyde dehydrogenase (EC 1.2.1.31),
           product of the LYS2 gene. It is also called
           alpha-aminoadipate reductase. In fungi, lysine is
           synthesized via aminoadipate. Currently, all members of
           this family are fungal.
          Length = 1389

 Score = 35.8 bits (83), Expect = 0.26
 Identities = 52/187 (27%), Positives = 78/187 (41%), Gaps = 38/187 (20%)

Query: 737 NVEAESRADAVVQDR--------IDTRLKSSPGINECTSAVSAVSEESSGLRSVDAIRPV 788
           NVE   RAD  V+ R        IDT L   P + E  + V    +E   L  V  I P 
Sbjct: 693 NVECCGRADDQVKIRGFRIELGEIDTHLSQHPLVRENVTLVRRDKDEEPTL--VSYIVPQ 750

Query: 789 LNHDEMT--RRNVEAESRADAVVQGQEKPRPLTNGV--CLKTCVPMHSTPPESNILATTR 844
              DE+   +  V+ E  +D VV+G  K R L   +   LK  +P ++ P          
Sbjct: 751 DKSDELEEFKSEVDDEESSDPVVKGLIKYRKLIKDIREYLKKKLPSYAIP---------- 800

Query: 845 SVILPLVSDLKTRLKDVEKKLLESPEEIKIEYEK-QMTNYKKLRELYEQRAAAAELEHKT 903
           +VI+PL            KKL  +P   K++       +  +L  + + R+A+A  E  T
Sbjct: 801 TVIVPL------------KKLPLNPNG-KVDKPALPFPDTAQLAAVAKNRSASAADEEFT 847

Query: 904 MIEKELA 910
             E+E+ 
Sbjct: 848 ETEREIR 854


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 35.8 bits (83), Expect = 0.26
 Identities = 23/117 (19%), Positives = 52/117 (44%), Gaps = 7/117 (5%)

Query: 847 ILPLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIE 906
           I  L+  +   +  + + + E  E+I    +K+    KK   L+       +++     +
Sbjct: 359 ITDLIESINDIIDAINELIREHNEKID-NLKKEKNKAKKKLWLHLVAELKEDIDA---YQ 414

Query: 907 KELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 963
           KE      LE  +  +++EI+    + K  +K+I E+EK +   +   + +N+ L +
Sbjct: 415 KEKKG---LEKAINSLEKEIKQLEAEIKALEKEIKELEKQLTNIEPTADEINKLLKA 468



 Score = 35.0 bits (81), Expect = 0.35
 Identities = 26/170 (15%), Positives = 74/170 (43%), Gaps = 14/170 (8%)

Query: 103 KEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTLVN 162
           +E+ E+++  ++KY+ + ++ LE  ++I   + E +    ++    A    L   +    
Sbjct: 281 QELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNL 340

Query: 163 NLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETLAIQN 222
                 L +P  ++++      ++   +I+      IND +  + +L ++ +++   ++ 
Sbjct: 341 QKLEEKLKDPSTSIEL------ESITDLIE-----SINDIIDAINELIREHNEKIDNLKK 389

Query: 223 DKQL--KQLENYETPMSSAMVEAVESNQT-STKAVESNQTSTSKDSANIQ 269
           +K    K+L  +        ++A +  +    KA+ S +    +  A I+
Sbjct: 390 EKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIK 439



 Score = 34.6 bits (80), Expect = 0.57
 Identities = 21/117 (17%), Positives = 51/117 (43%), Gaps = 7/117 (5%)

Query: 30  VVFKVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIE 89
           +   +  +   +  + + + E  E+I    +K+    KK   L+       +++     +
Sbjct: 359 ITDLIESINDIIDAINELIREHNEKID-NLKKEKNKAKKKLWLHLVAELKEDIDA---YQ 414

Query: 90  KELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 146
           KE      LE  +  +++EI+    + K  +K+I E+EK +   +   + +N+ L +
Sbjct: 415 KEKKG---LEKAINSLEKEIKQLEAEIKALEKEIKELEKQLTNIEPTADEINKLLKA 468



 Score = 34.2 bits (79), Expect = 0.76
 Identities = 29/215 (13%), Positives = 87/215 (40%), Gaps = 28/215 (13%)

Query: 920  KEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTLVN 979
            +E+ E+++  ++KY+ + ++ LE  ++I   + E +    ++    A    L   +    
Sbjct: 281  QELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNL 340

Query: 980  NLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETLAIQN 1039
                  L +P  ++++      ++   +I+      IND +  + +L ++ +++   ++ 
Sbjct: 341  QKLEEKLKDPSTSIEL------ESITDLIE-----SINDIIDAINELIREHNEKIDNLKK 389

Query: 1040 DKQL--KQLENYETPMSSAMVEAVESNQTSTSKDSANIQLDHAETNTNEESPADQDETNS 1097
            +K    K+L  +        V  ++ +  +  K+   ++      N+ E+     +    
Sbjct: 390  EKNKAKKKLWLHL-------VAELKEDIDAYQKEKKGLE---KAINSLEKEIKQLEAEIK 439

Query: 1098 TTTLDKSHIQKEIANNLPKVWRILIELLNTHQEQY 1132
                +   ++K++ N  P       + +N   + Y
Sbjct: 440  ALEKEIKELEKQLTNIEP-----TADEINKLLKAY 469



 Score = 33.4 bits (77), Expect = 1.1
 Identities = 30/117 (25%), Positives = 52/117 (44%), Gaps = 8/117 (6%)

Query: 850 LVSDLKTRLKDVEKKLLESPEEI--KIEYEKQMTNYKKLRELYEQRAAAAELEHKTMI-- 905
           L+  L+  L+   +KL E  ++    IE E      + + ++ +        EH   I  
Sbjct: 328 LLEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDA-INELIREHNEKIDN 386

Query: 906 ---EKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 959
              EK  AK  L    + E++E+I A  ++ K  +K I  +EK I+  + EI  L +
Sbjct: 387 LKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEK 443


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 35.5 bits (82), Expect = 0.27
 Identities = 14/108 (12%), Positives = 42/108 (38%), Gaps = 9/108 (8%)

Query: 38  KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHK-TMIEKELAKNA 96
               KD + K L+  + +           +++++   +     +++ +  +I K+     
Sbjct: 765 ARLEKDYQSKYLDLIDNLND--------AREIKDKESKAIDLDDIDFELELIGKQEINID 816

Query: 97  LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 144
            +   L+   ++  A     +  ++ I E  ++++ K   I  L E +
Sbjct: 817 YILELLQTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAI 864



 Score = 35.5 bits (82), Expect = 0.27
 Identities = 14/108 (12%), Positives = 42/108 (38%), Gaps = 9/108 (8%)

Query: 855 KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHK-TMIEKELAKNA 913
               KD + K L+  + +           +++++   +     +++ +  +I K+     
Sbjct: 765 ARLEKDYQSKYLDLIDNLND--------AREIKDKESKAIDLDDIDFELELIGKQEINID 816

Query: 914 LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 961
            +   L+   ++  A     +  ++ I E  ++++ K   I  L E +
Sbjct: 817 YILELLQTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAI 864


>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
           subunit.  CENP-Q is one of the components that assembles
           onto the CENP-A-nucleosome distal (CAD) centromere. The
           centromere, which is the basic element of chromosome
           inheritance, is epigenetically determined in mammals.
           CENP-A, the centromere-specific histone H3 variant,
           assembles an array of nucleosomes and it is this that
           seems to be the prime candidate for specifying
           centromere identity. CENP-A nucleosomes directly recruit
           a proximal CENP-A nucleosome associated complex (NAC)
           comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
           CENP-C and CENP-H. Assembly of the CENP-A NAC at
           centromeres is dependent on CENP-M, CENP-N and CENP-T.
           Additionally, there are seven other subunits which make
           up the CENP-A-nucleosome distal (CAD) centromere,
           CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
           CENP-S, also assembling on the CENP-A NAC. Fta7 is the
           equivalent component of the fission yeast Sim4 complex.
          Length = 159

 Score = 33.9 bits (78), Expect = 0.28
 Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 3/72 (4%)

Query: 82  LEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLN 141
           L  K  +E++LA        L   +EEI+ +  + +++ + + E+EKN +  + E    +
Sbjct: 26  LRRKRALERQLAAELKQLELL---EEEIRREEAELEKDLEYLQELEKNAKALEREREEES 82

Query: 142 ENLTSSLAENHI 153
           +NL   L     
Sbjct: 83  KNLHPVLRLLES 94



 Score = 33.9 bits (78), Expect = 0.28
 Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 3/72 (4%)

Query: 899 LEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLN 958
           L  K  +E++LA        L   +EEI+ +  + +++ + + E+EKN +  + E    +
Sbjct: 26  LRRKRALERQLAAELKQLELL---EEEIRREEAELEKDLEYLQELEKNAKALEREREEES 82

Query: 959 ENLTSSLAENHI 970
           +NL   L     
Sbjct: 83  KNLHPVLRLLES 94


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 35.1 bits (82), Expect = 0.33
 Identities = 25/92 (27%), Positives = 44/92 (47%), Gaps = 3/92 (3%)

Query: 33  KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYE--QRAAAAELEHKTMIEK 90
           K  +L  +L+ +EK+  E  E+ + E E++    +K  E  E        ELE  + +  
Sbjct: 94  KEENLDRKLELLEKRE-EELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTA 152

Query: 91  ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQ 122
           E AK  LLE   +E + E    +++ +E  K+
Sbjct: 153 EEAKEILLEKVEEEARHEAAVLIKEIEEEAKE 184



 Score = 34.8 bits (81), Expect = 0.40
 Identities = 31/119 (26%), Positives = 53/119 (44%), Gaps = 12/119 (10%)

Query: 40  RLKDVEKKLLESPEE---IKIEYEKQMTNYKKLREL-----YEQRAAAAELEHKTMIEKE 91
           R K  E K+ E+ EE   I  E +K+    KK   L       +     E E +     E
Sbjct: 25  RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRER-RNE 83

Query: 92  LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 150
           L K   LE +L + +E +  +LE  ++ ++++ + EK ++ K  E+    E L   + E
Sbjct: 84  LQK---LEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEE 139



 Score = 34.8 bits (81), Expect = 0.40
 Identities = 31/119 (26%), Positives = 53/119 (44%), Gaps = 12/119 (10%)

Query: 857 RLKDVEKKLLESPEE---IKIEYEKQMTNYKKLREL-----YEQRAAAAELEHKTMIEKE 908
           R K  E K+ E+ EE   I  E +K+    KK   L       +     E E +     E
Sbjct: 25  RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRER-RNE 83

Query: 909 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 967
           L K   LE +L + +E +  +LE  ++ ++++ + EK ++ K  E+    E L   + E
Sbjct: 84  LQK---LEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEE 139



 Score = 34.4 bits (80), Expect = 0.55
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 3/88 (3%)

Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYE--QRAAAAELEHKTMIEKELAK 911
           L  +L+ +EK+  E  E+ + E E++    +K  E  E        ELE  + +  E AK
Sbjct: 98  LDRKLELLEKRE-EELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAK 156

Query: 912 NALLEAKLKEIQEEIQAQLEKYKENKKQ 939
             LLE   +E + E    +++ +E  K+
Sbjct: 157 EILLEKVEEEARHEAAVLIKEIEEEAKE 184



 Score = 32.8 bits (76), Expect = 1.7
 Identities = 18/110 (16%), Positives = 57/110 (51%), Gaps = 13/110 (11%)

Query: 858 LKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQ-----RAAAAELEHKTM-----IEK 907
           L++ +K+     +E  +E +++     KLR  +E+     R    +LE + +     +++
Sbjct: 44  LEEAKKEAEAIKKEALLEAKEE---IHKLRNEFEKELRERRNELQKLEKRLLQKEENLDR 100

Query: 908 ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 957
           +L      E +L++ ++E++ + ++ ++ ++++ E+ +    + + I+ L
Sbjct: 101 KLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGL 150



 Score = 31.3 bits (72), Expect = 4.9
 Identities = 20/79 (25%), Positives = 39/79 (49%), Gaps = 9/79 (11%)

Query: 66  YKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILE 125
           Y   +++ E +   AE E K ++E+        + + + I++E  A LE  +E  K   E
Sbjct: 22  YFVRKKIAEAKIKEAEEEAKRILEEA-------KKEAEAIKKE--ALLEAKEEIHKLRNE 72

Query: 126 MEKNIQLKDDEINNLNENL 144
            EK ++ + +E+  L + L
Sbjct: 73  FEKELRERRNELQKLEKRL 91



 Score = 31.3 bits (72), Expect = 4.9
 Identities = 20/79 (25%), Positives = 39/79 (49%), Gaps = 9/79 (11%)

Query: 883 YKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILE 942
           Y   +++ E +   AE E K ++E+        + + + I++E  A LE  +E  K   E
Sbjct: 22  YFVRKKIAEAKIKEAEEEAKRILEEA-------KKEAEAIKKE--ALLEAKEEIHKLRNE 72

Query: 943 MEKNIQLKDDEINNLNENL 961
            EK ++ + +E+  L + L
Sbjct: 73  FEKELRERRNELQKLEKRL 91


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 35.0 bits (81), Expect = 0.35
 Identities = 22/106 (20%), Positives = 61/106 (57%), Gaps = 3/106 (2%)

Query: 37  LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 96
           L+ +++++E+++ E  +EI+ E E+++   K+L+E  E+    +E   +     EL +  
Sbjct: 257 LEEKIRELEERIEELKKEIE-ELEEKVKELKELKEKAEEYIKLSEFYEE--YLDELREIE 313

Query: 97  LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 142
              ++L+E    I+ ++++ +E ++++ E++K ++  +  +  L E
Sbjct: 314 KRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEE 359



 Score = 35.0 bits (81), Expect = 0.35
 Identities = 22/106 (20%), Positives = 61/106 (57%), Gaps = 3/106 (2%)

Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 913
           L+ +++++E+++ E  +EI+ E E+++   K+L+E  E+    +E   +     EL +  
Sbjct: 257 LEEKIRELEERIEELKKEIE-ELEEKVKELKELKEKAEEYIKLSEFYEE--YLDELREIE 313

Query: 914 LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 959
              ++L+E    I+ ++++ +E ++++ E++K ++  +  +  L E
Sbjct: 314 KRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEE 359



 Score = 34.3 bits (79), Expect = 0.78
 Identities = 29/118 (24%), Positives = 58/118 (49%), Gaps = 16/118 (13%)

Query: 35  SDLKTRLKDVEKKLLESPEEIKI----------EYEKQMTNYKKLRELYEQRAAAAELEH 84
            +++  +K+ EK+L E   EI            E EK     K+L EL E+     +   
Sbjct: 189 ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELE 248

Query: 85  KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 142
              +E    K   LE K++E++E I+   ++ +E ++++ E+++ ++ K +E   L+E
Sbjct: 249 S--LEGSKRK---LEEKIRELEERIEELKKEIEELEEKVKELKE-LKEKAEEYIKLSE 300



 Score = 34.3 bits (79), Expect = 0.78
 Identities = 29/118 (24%), Positives = 58/118 (49%), Gaps = 16/118 (13%)

Query: 852 SDLKTRLKDVEKKLLESPEEIKI----------EYEKQMTNYKKLRELYEQRAAAAELEH 901
            +++  +K+ EK+L E   EI            E EK     K+L EL E+     +   
Sbjct: 189 ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELE 248

Query: 902 KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 959
              +E    K   LE K++E++E I+   ++ +E ++++ E+++ ++ K +E   L+E
Sbjct: 249 S--LEGSKRK---LEEKIRELEERIEELKKEIEELEEKVKELKE-LKEKAEEYIKLSE 300



 Score = 33.5 bits (77), Expect = 1.0
 Identities = 23/121 (19%), Positives = 54/121 (44%), Gaps = 21/121 (17%)

Query: 37  LKTRLKDVEKKLLESPE------------EIKIEYEKQMTNYKKLRELYEQRAAAAELEH 84
           LK  ++++E+K+ E  E            E   EY  ++   +K     E+         
Sbjct: 271 LKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEING----- 325

Query: 85  KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 144
              IE+ + +    E +L+E++++++   ++ +E +++    E+    K +E+  L + L
Sbjct: 326 ---IEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKK-EELERLKKRL 381

Query: 145 T 145
           T
Sbjct: 382 T 382



 Score = 33.5 bits (77), Expect = 1.0
 Identities = 23/121 (19%), Positives = 54/121 (44%), Gaps = 21/121 (17%)

Query: 854 LKTRLKDVEKKLLESPE------------EIKIEYEKQMTNYKKLRELYEQRAAAAELEH 901
           LK  ++++E+K+ E  E            E   EY  ++   +K     E+         
Sbjct: 271 LKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEING----- 325

Query: 902 KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 961
              IE+ + +    E +L+E++++++   ++ +E +++    E+    K +E+  L + L
Sbjct: 326 ---IEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKK-EELERLKKRL 381

Query: 962 T 962
           T
Sbjct: 382 T 382



 Score = 32.7 bits (75), Expect = 2.0
 Identities = 25/97 (25%), Positives = 51/97 (52%), Gaps = 5/97 (5%)

Query: 33  KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 92
           K+ +L+  L ++ K+L E   E   E E+++   K+L   Y +     + E +  +E+E 
Sbjct: 564 KLDELEEELAELLKELEELGFESVEELEERL---KELEPFYNEYLELKDAEKE--LEREE 618

Query: 93  AKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKN 129
            +   LE +L +  EE+    ++ +E +K++ E+EK 
Sbjct: 619 KELKKLEEELDKAFEELAETEKRLEELRKELEELEKK 655



 Score = 31.2 bits (71), Expect = 6.7
 Identities = 24/94 (25%), Positives = 49/94 (52%), Gaps = 5/94 (5%)

Query: 853 DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKN 912
           +L+  L ++ K+L E   E   E E+++   K+L   Y +     + E +  +E+E  + 
Sbjct: 567 ELEEELAELLKELEELGFESVEELEERL---KELEPFYNEYLELKDAEKE--LEREEKEL 621

Query: 913 ALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKN 946
             LE +L +  EE+    ++ +E +K++ E+EK 
Sbjct: 622 KKLEEELDKAFEELAETEKRLEELRKELEELEKK 655


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 35.1 bits (81), Expect = 0.40
 Identities = 22/108 (20%), Positives = 54/108 (50%), Gaps = 15/108 (13%)

Query: 41  LKDVEKKLLESPEEIKIEYEK-QMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLE 99
           L  V+++     +E   E E+ ++T Y+K   + +             +E+   +N+ L+
Sbjct: 394 LSKVKEEERPREKEGTEEEERREITVYEK--RIKKLEE---------TVERLEEENSELK 442

Query: 100 AKLKEIQEEI---QAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 144
            +L+E++ EI   +++LE+++   +  +  ++ I+ +D  I  L + L
Sbjct: 443 RELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKEL 490



 Score = 35.1 bits (81), Expect = 0.40
 Identities = 22/108 (20%), Positives = 54/108 (50%), Gaps = 15/108 (13%)

Query: 858 LKDVEKKLLESPEEIKIEYEK-QMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLE 916
           L  V+++     +E   E E+ ++T Y+K   + +             +E+   +N+ L+
Sbjct: 394 LSKVKEEERPREKEGTEEEERREITVYEK--RIKKLEE---------TVERLEEENSELK 442

Query: 917 AKLKEIQEEI---QAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 961
            +L+E++ EI   +++LE+++   +  +  ++ I+ +D  I  L + L
Sbjct: 443 RELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKEL 490


>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1).  This family
            consists of several mammalian dentin matrix protein 1
            (DMP1) sequences. The dentin matrix acidic phosphoprotein
            1 (DMP1) gene has been mapped to human chromosome 4q21.
            DMP1 is a bone and teeth specific protein initially
            identified from mineralised dentin. DMP1 is primarily
            localised in the nuclear compartment of undifferentiated
            osteoblasts. In the nucleus, DMP1 acts as a
            transcriptional component for activation of
            osteoblast-specific genes like osteocalcin. During the
            early phase of osteoblast maturation, Ca(2+) surges into
            the nucleus from the cytoplasm, triggering the
            phosphorylation of DMP1 by a nuclear isoform of casein
            kinase II. This phosphorylated DMP1 is then exported out
            into the extracellular matrix, where it regulates
            nucleation of hydroxyapatite. DMP1 is a unique molecule
            that initiates osteoblast differentiation by
            transcription in the nucleus and orchestrates mineralised
            matrix formation extracellularly, at later stages of
            osteoblast maturation. The DMP1 gene has been found to be
            ectopically expressed in lung cancer although the reason
            for this is unknown.
          Length = 514

 Score = 34.6 bits (79), Expect = 0.42
 Identities = 37/165 (22%), Positives = 69/165 (41%), Gaps = 13/165 (7%)

Query: 1277 RKEFALRRLQYDGGRSTSESGISTDTEDDNVEIVETNDESEDDKSEVVETQSEHDVPHTN 1336
            RK F   R+  + GR   +   + + + D+ E    +   E  +SE  E   E+     +
Sbjct: 266  RKFFRKSRISEEDGRGELDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQEDS 325

Query: 1337 SVMSIEEDDQDEVEELESETNVSAINNEQEVERTEVTDNTSAVIPSRETIENEAVPVSVS 1396
                  ++ QD   E   E ++ +  N  E +  EV   +    P   T  +E    S S
Sbjct: 326  ------QEVQDPSSESSQEADLPSQENSSESQE-EVVSESRGDNPDNTTSHSEDQEDSES 378

Query: 1397 PQEDIVDTITTSRTDHI------ESHQEINSSRETKEIPGTSDAN 1435
             +ED +DT ++S +         ES++ ++SS E+ E     +++
Sbjct: 379  SEEDSLDTPSSSESQSTEEQADSESNESLSSSEESPESTEDENSS 423


>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0).  Part of the CF(0)
           (base unit) of the ATP synthase. The base unit is
           thought to translocate protons through membrane (inner
           membrane in mitochondria, thylakoid membrane in plants,
           cytoplasmic membrane in bacteria). The B subunits are
           thought to interact with the stalk of the CF(1)
           subunits. This domain should not be confused with the ab
           CF(1) proteins (in the head of the ATP synthase) which
           are found in pfam00006.
          Length = 132

 Score = 32.6 bits (75), Expect = 0.50
 Identities = 27/103 (26%), Positives = 41/103 (39%), Gaps = 11/103 (10%)

Query: 849 PLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKE 908
           PL   L  R + +   + E+ EE   +          L    EQ+ A A  E   +I   
Sbjct: 23  PLGKILDERKEKIANNIKEA-EERLKQAAA-------LLAEAEQQLAQARAEASEIINN- 73

Query: 909 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 951
            AK    + K +EI  E Q   E+  E+ +  +E EK   L +
Sbjct: 74  -AKKEAQKLK-EEILAEAQKDAERLLESARAEIEQEKEQALAE 114



 Score = 31.5 bits (72), Expect = 1.2
 Identities = 25/98 (25%), Positives = 39/98 (39%), Gaps = 11/98 (11%)

Query: 37  LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 96
           L  R + +   + E+ EE   +          L    EQ+ A A  E   +I    AK  
Sbjct: 28  LDERKEKIANNIKEA-EERLKQAAA-------LLAEAEQQLAQARAEASEIINN--AKKE 77

Query: 97  LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 134
             + K +EI  E Q   E+  E+ +  +E EK   L +
Sbjct: 78  AQKLK-EEILAEAQKDAERLLESARAEIEQEKEQALAE 114


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 34.5 bits (79), Expect = 0.50
 Identities = 31/105 (29%), Positives = 58/105 (55%), Gaps = 6/105 (5%)

Query: 28  SMVVFKVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAA-AAELEHKT 86
           SMV  K  D K  ++ +EK+L +S  + ++  EKQ+   KK ++  E+ AA AA     +
Sbjct: 466 SMVSAKQKD-KQSMQSMEKRL-KSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAAS 523

Query: 87  MIEK-ELAKNAL--LEAKLKEIQEEIQAQLEKYKENKKQILEMEK 128
             E  E  K A   LE ++K+++ +++ + E+ +  +K+  E+ K
Sbjct: 524 REECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQELRK 568



 Score = 34.5 bits (79), Expect = 0.57
 Identities = 26/95 (27%), Positives = 53/95 (55%), Gaps = 5/95 (5%)

Query: 855 KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAA-AAELEHKTMIEK-ELAKN 912
           K  ++ +EK+L +S  + ++  EKQ+   KK ++  E+ AA AA     +  E  E  K 
Sbjct: 475 KQSMQSMEKRL-KSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQ 533

Query: 913 AL--LEAKLKEIQEEIQAQLEKYKENKKQILEMEK 945
           A   LE ++K+++ +++ + E+ +  +K+  E+ K
Sbjct: 534 AKQDLEMEIKKLEHDLKLKEEECRMLEKEAQELRK 568


>gnl|CDD|216883 pfam02118, Srg, Srg family chemoreceptor. 
          Length = 275

 Score = 33.7 bits (78), Expect = 0.61
 Identities = 11/45 (24%), Positives = 23/45 (51%), Gaps = 7/45 (15%)

Query: 4   FTILSMIVFKNFILTIGSPLTSSTSMVVFKVSDLKTRLKDVEKKL 48
           F ++  I+    + TI      ++ +  +K++ L  R+K VE+ L
Sbjct: 172 FQLIYFILI--LVFTI-----ITSIVTSYKLAKLSKRIKSVERSL 209


>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS).  This family
           consists of several mammalian selenoprotein S (SelS)
           sequences. SelS is a plasma membrane protein and is
           present in a variety of tissues and cell types. The
           function of this family is unknown.
          Length = 190

 Score = 32.9 bits (75), Expect = 0.90
 Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 89  EKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQ 131
           +K + +   LEA    +QE++ A+  K+KE +KQ LE EK  Q
Sbjct: 76  DKVVRRQEALEAARLRMQEDLDAKAAKFKEKQKQ-LEEEKRRQ 117



 Score = 32.9 bits (75), Expect = 0.90
 Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 906 EKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQ 948
           +K + +   LEA    +QE++ A+  K+KE +KQ LE EK  Q
Sbjct: 76  DKVVRRQEALEAARLRMQEDLDAKAAKFKEKQKQ-LEEEKRRQ 117


>gnl|CDD|234801 PRK00591, prfA, peptide chain release factor 1; Validated.
          Length = 359

 Score = 33.5 bits (78), Expect = 0.97
 Identities = 26/102 (25%), Positives = 49/102 (48%), Gaps = 23/102 (22%)

Query: 28  SMVVFKVSDLKTRLKDVEKKLLESPEEIK---------IEY---EKQMTNYKKLRELYEQ 75
             ++ K+  L+ R +++E  LL  PE I           EY   E  +  Y++ ++  E 
Sbjct: 2   PSMLDKLEALEERYEELEA-LLSDPEVISDQKRFRKLSKEYAELEPIVEAYREYKQAQED 60

Query: 76  RAAAAEL-------EHKTMIEKELAKNALLEAKLKEIQEEIQ 110
              A E+       E + M ++EL +   LE +L+E++EE++
Sbjct: 61  LEEAKEMLEEESDPEMREMAKEELKE---LEERLEELEEELK 99



 Score = 31.6 bits (73), Expect = 4.0
 Identities = 25/94 (26%), Positives = 45/94 (47%), Gaps = 23/94 (24%)

Query: 853 DLKTRLKDVEKKLLESPEEIK---------IEY---EKQMTNYKKLRELYEQRAAAAEL- 899
            L+ R +++E  LL  PE I           EY   E  +  Y++ ++  E    A E+ 
Sbjct: 10  ALEERYEELEA-LLSDPEVISDQKRFRKLSKEYAELEPIVEAYREYKQAQEDLEEAKEML 68

Query: 900 ------EHKTMIEKELAKNALLEAKLKEIQEEIQ 927
                 E + M ++EL +   LE +L+E++EE++
Sbjct: 69  EEESDPEMREMAKEELKE---LEERLEELEEELK 99


>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family.  This family includes PspA
           a protein that suppresses sigma54-dependent
           transcription. The PspA protein, a negative regulator of
           the Escherichia coli phage shock psp operon, is produced
           when virulence factors are exported through secretins in
           many Gram-negative pathogenic bacteria and its homologue
           in plants, VIPP1, plays a critical role in thylakoid
           biogenesis, essential for photosynthesis. Activation of
           transcription by the enhancer-dependent bacterial
           sigma(54) containing RNA polymerase occurs through ATP
           hydrolysis-driven protein conformational changes enabled
           by activator proteins that belong to the large AAA(+)
           mechanochemical protein family. It has been shown that
           PspA directly and specifically acts upon and binds to
           the AAA(+) domain of the PspF transcription activator.
          Length = 220

 Score = 32.7 bits (75), Expect = 1.1
 Identities = 29/124 (23%), Positives = 63/124 (50%), Gaps = 15/124 (12%)

Query: 41  LKDVEKKLLESPEEIKIEYEKQMTNYKKL-RELYEQRAAAAELEHK-----TMIEKELAK 94
           ++D++ +L ++ + +     + +   K+L R+L EQ+  A +LE+K     T   +ELA+
Sbjct: 32  IRDMQSELGKARQAL----AQVIARQKQLERKLEEQKEQAKKLENKARAALTKGNEELAR 87

Query: 95  NAL-----LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLA 149
            AL     LE + + ++ ++  Q    ++ +KQ+  +E  IQ    +   L   L ++ A
Sbjct: 88  EALAEIATLEKQAEALETQLTQQRSAVEQLRKQLAALETKIQQLKAKKTALKARLKAAKA 147

Query: 150 ENHI 153
           +  +
Sbjct: 148 QEAV 151



 Score = 32.7 bits (75), Expect = 1.1
 Identities = 29/124 (23%), Positives = 63/124 (50%), Gaps = 15/124 (12%)

Query: 858 LKDVEKKLLESPEEIKIEYEKQMTNYKKL-RELYEQRAAAAELEHK-----TMIEKELAK 911
           ++D++ +L ++ + +     + +   K+L R+L EQ+  A +LE+K     T   +ELA+
Sbjct: 32  IRDMQSELGKARQAL----AQVIARQKQLERKLEEQKEQAKKLENKARAALTKGNEELAR 87

Query: 912 NAL-----LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLA 966
            AL     LE + + ++ ++  Q    ++ +KQ+  +E  IQ    +   L   L ++ A
Sbjct: 88  EALAEIATLEKQAEALETQLTQQRSAVEQLRKQLAALETKIQQLKAKKTALKARLKAAKA 147

Query: 967 ENHI 970
           +  +
Sbjct: 148 QEAV 151


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 32.7 bits (75), Expect = 1.2
 Identities = 26/99 (26%), Positives = 53/99 (53%), Gaps = 3/99 (3%)

Query: 33  KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 92
            +  L+  L+D+E ++ +   EI+   E+     +KL  + ++R   A L  +  I KE 
Sbjct: 46  ALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRA-LNIEIQIAKER 104

Query: 93  AKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQ 131
             +  LE +L E+ EEI+   ++ ++ K+++  +EKN+ 
Sbjct: 105 INS--LEDELAELMEEIEKLEKEIEDLKERLERLEKNLA 141



 Score = 32.3 bits (74), Expect = 1.8
 Identities = 26/98 (26%), Positives = 53/98 (54%), Gaps = 3/98 (3%)

Query: 851 VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELA 910
           +  L+  L+D+E ++ +   EI+   E+     +KL  + ++R   A L  +  I KE  
Sbjct: 47  LEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRA-LNIEIQIAKERI 105

Query: 911 KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQ 948
            +  LE +L E+ EEI+   ++ ++ K+++  +EKN+ 
Sbjct: 106 NS--LEDELAELMEEIEKLEKEIEDLKERLERLEKNLA 141


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 33.5 bits (77), Expect = 1.2
 Identities = 28/119 (23%), Positives = 57/119 (47%), Gaps = 4/119 (3%)

Query: 34  VSDLKTRLKDVEKKL--LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKE 91
           + DL+  +++VE++L   E   E +   E+     + L EL  +R    E E +   E+ 
Sbjct: 484 LEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIE-EKRERAEEL 542

Query: 92  LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 150
             + A LEA+ +E +E      E+ +E ++++ E+   +    + I +L   + + LA 
Sbjct: 543 RERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLE-RIRTLLAA 600



 Score = 33.5 bits (77), Expect = 1.2
 Identities = 28/119 (23%), Positives = 57/119 (47%), Gaps = 4/119 (3%)

Query: 851 VSDLKTRLKDVEKKL--LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKE 908
           + DL+  +++VE++L   E   E +   E+     + L EL  +R    E E +   E+ 
Sbjct: 484 LEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIE-EKRERAEEL 542

Query: 909 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 967
             + A LEA+ +E +E      E+ +E ++++ E+   +    + I +L   + + LA 
Sbjct: 543 RERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLE-RIRTLLAA 600


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 33.3 bits (77), Expect = 1.3
 Identities = 27/102 (26%), Positives = 49/102 (48%), Gaps = 20/102 (19%)

Query: 31  VFKVSDLKTRLKDV------EKKLLESPEEIKIEYEKQMTNYKKLRELY--EQ-RAAAAE 81
           + +  D+K RL+ +      E  LL+  + I+ + ++QM   K  RE Y  EQ +A   E
Sbjct: 184 ILETLDVKERLEKLLDLLEKEIDLLQLEKRIRKKVKEQME--KSQREYYLREQLKAIQKE 241

Query: 82  LEHKTMIEKELAKNALLEAKL------KEIQEEIQAQLEKYK 117
           L      + E+ +   L  K+      KE +E+ + +L+K +
Sbjct: 242 LGEDDDDKDEVEE---LREKIEKLKLPKEAKEKAEKELKKLE 280



 Score = 32.9 bits (76), Expect = 1.6
 Identities = 27/97 (27%), Positives = 47/97 (48%), Gaps = 20/97 (20%)

Query: 853 DLKTRLKDV------EKKLLESPEEIKIEYEKQMTNYKKLRELY--EQ-RAAAAELEHKT 903
           D+K RL+ +      E  LL+  + I+ + ++QM   K  RE Y  EQ +A   EL    
Sbjct: 189 DVKERLEKLLDLLEKEIDLLQLEKRIRKKVKEQME--KSQREYYLREQLKAIQKELGEDD 246

Query: 904 MIEKELAKNALLEAKL------KEIQEEIQAQLEKYK 934
             + E+ +   L  K+      KE +E+ + +L+K +
Sbjct: 247 DDKDEVEE---LREKIEKLKLPKEAKEKAEKELKKLE 280


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 33.3 bits (76), Expect = 1.4
 Identities = 34/233 (14%), Positives = 64/233 (27%), Gaps = 15/233 (6%)

Query: 538 TLQREPITSVPTATSNTTLPSEPSVSRTRTPEEILEARNARLKRLEDQCKNLFHKMSLTS 597
              R  + S    ++   +    +V    +     E ++A +      C    H  S  +
Sbjct: 23  GSLRLGLRSNYLQSTVEDIEGPSAVDERTSGVLRDEGKHANILYNSILCNQKKHA-SFLN 81

Query: 598 QRSDTLSERLEELHDYYGTSNERHSPRRHREGSQDSDDTRLRSPENEPLSGVSANVPDSQ 657
           QR     +  +E    Y    +        E S D D   L  P+ +          + Q
Sbjct: 82  QRKSLDDDDDDEFDFLYEDDEDDAGNATSGESSTDDDSL-LELPDRDE--DADTQANNDQ 138

Query: 658 ENSQESDDTRRRSAETNESHSGVSGNVSDL--QDRIDT--RLKSSPEINECTSAVSAVSE 713
            N  + DD+     +     S    +   L  + +  T    K     N+   A    + 
Sbjct: 139 TNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEA----AA 194

Query: 714 ESSGLRSVDAIRPVLNHDEMTRRNVEAESRADAV---VQDRIDTRLKSSPGIN 763
           +  G        P+ + D          +  D      +          PG+ 
Sbjct: 195 KDGGKSKSSDPGPLNDSDGQGDDGDPESAEEDKAASNTRAAYTKATSVFPGLY 247


>gnl|CDD|191114 pfam04859, DUF641, Plant protein of unknown function (DUF641).
           Plant protein of unknown function.
          Length = 132

 Score = 31.6 bits (72), Expect = 1.4
 Identities = 17/64 (26%), Positives = 28/64 (43%)

Query: 94  KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 153
           K      +   +  EIQ Q    K  +  + ++E  ++ KD EI++L E L   L  N  
Sbjct: 68  KRQKGAPQDAILAAEIQEQRSLLKTYEIMVKKLEAEVRAKDSEIDSLREKLEELLVANSK 127

Query: 154 LNSQ 157
           L  +
Sbjct: 128 LEKR 131



 Score = 31.6 bits (72), Expect = 1.4
 Identities = 17/64 (26%), Positives = 28/64 (43%)

Query: 911 KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 970
           K      +   +  EIQ Q    K  +  + ++E  ++ KD EI++L E L   L  N  
Sbjct: 68  KRQKGAPQDAILAAEIQEQRSLLKTYEIMVKKLEAEVRAKDSEIDSLREKLEELLVANSK 127

Query: 971 LNSQ 974
           L  +
Sbjct: 128 LEKR 131


>gnl|CDD|216872 pfam02096, 60KD_IMP, 60Kd inner membrane protein. 
          Length = 193

 Score = 32.1 bits (74), Expect = 1.5
 Identities = 13/30 (43%), Positives = 22/30 (73%), Gaps = 1/30 (3%)

Query: 100 AKLKEIQEEIQAQLEKYKENKKQI-LEMEK 128
           AK++E+Q EI+   EKYK++ +++  EM K
Sbjct: 30  AKMQELQPEIKEIQEKYKDDPQKLQQEMMK 59



 Score = 32.1 bits (74), Expect = 1.5
 Identities = 13/30 (43%), Positives = 22/30 (73%), Gaps = 1/30 (3%)

Query: 917 AKLKEIQEEIQAQLEKYKENKKQI-LEMEK 945
           AK++E+Q EI+   EKYK++ +++  EM K
Sbjct: 30  AKMQELQPEIKEIQEKYKDDPQKLQQEMMK 59


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 31.8 bits (73), Expect = 1.6
 Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 2/53 (3%)

Query: 88  IEKELAKNA--LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEIN 138
           I++ LA+ A  LLE +    + E+QA++ +YK+ K +     K ++ K  E  
Sbjct: 49  IKQHLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAE 101



 Score = 31.8 bits (73), Expect = 1.6
 Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 2/53 (3%)

Query: 905 IEKELAKNA--LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEIN 955
           I++ LA+ A  LLE +    + E+QA++ +YK+ K +     K ++ K  E  
Sbjct: 49  IKQHLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAE 101


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 31.8 bits (73), Expect = 1.7
 Identities = 30/111 (27%), Positives = 58/111 (52%), Gaps = 22/111 (19%)

Query: 37  LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 96
           LKTRL+ ++K+L E  + I  E + Q+   KK RE  E+R                    
Sbjct: 67  LKTRLEKLKKELEELKQRIA-ELQAQIEKLKKGREETEER------------------TE 107

Query: 97  LLEAK--LKEIQEEIQAQLEKYKENK-KQILEMEKNIQLKDDEINNLNENL 144
           LLE    L++  ++++A+LEKY++N  ++I ++++  ++  +  N   +N+
Sbjct: 108 LLEELKQLEKELKKLKAELEKYEKNDPERIEKLKEETKVAKEAANRWTDNI 158



 Score = 31.8 bits (73), Expect = 1.7
 Identities = 30/111 (27%), Positives = 58/111 (52%), Gaps = 22/111 (19%)

Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 913
           LKTRL+ ++K+L E  + I  E + Q+   KK RE  E+R                    
Sbjct: 67  LKTRLEKLKKELEELKQRIA-ELQAQIEKLKKGREETEER------------------TE 107

Query: 914 LLEAK--LKEIQEEIQAQLEKYKENK-KQILEMEKNIQLKDDEINNLNENL 961
           LLE    L++  ++++A+LEKY++N  ++I ++++  ++  +  N   +N+
Sbjct: 108 LLEELKQLEKELKKLKAELEKYEKNDPERIEKLKEETKVAKEAANRWTDNI 158


>gnl|CDD|232831 TIGR00115, tig, trigger factor.  Trigger factor is a
           ribosome-associated molecular chaperone and is the first
           chaperone to interact with nascent polypeptide. Trigger
           factor can bind at the same time as the signal
           recognition particle (SRP), but is excluded by the SRP
           receptor (FtsY). The central domain of trigger factor
           has peptidyl-prolyl cis/trans isomerase activity. This
           protein is found in a single copy in virtually every
           bacterial genome [Protein fate, Protein folding and
           stabilization].
          Length = 408

 Score = 32.5 bits (75), Expect = 1.8
 Identities = 21/102 (20%), Positives = 49/102 (48%), Gaps = 16/102 (15%)

Query: 850 LVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 909
            +  L+ +  D+E+ L ++ EE++ E+ ++     KL                 +I +E+
Sbjct: 307 ALQQLQQQGIDLEEYLKDTEEELREEFREEAERRVKLG----------------LILEEI 350

Query: 910 AKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 951
           AK   +E   +E++ EI+   ++Y E+ +++ +  K   L +
Sbjct: 351 AKKEKIEVSEEEVEAEIEELAQQYGEDPEEVKKYYKKNGLLE 392



 Score = 32.5 bits (75), Expect = 1.8
 Identities = 21/100 (21%), Positives = 48/100 (48%), Gaps = 16/100 (16%)

Query: 35  SDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAK 94
             L+ +  D+E+ L ++ EE++ E+ ++     KL                 +I +E+AK
Sbjct: 309 QQLQQQGIDLEEYLKDTEEELREEFREEAERRVKLG----------------LILEEIAK 352

Query: 95  NALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 134
              +E   +E++ EI+   ++Y E+ +++ +  K   L +
Sbjct: 353 KEKIEVSEEEVEAEIEELAQQYGEDPEEVKKYYKKNGLLE 392


>gnl|CDD|214369 CHL00118, atpG, ATP synthase CF0 B' subunit; Validated.
          Length = 156

 Score = 31.5 bits (72), Expect = 2.0
 Identities = 19/104 (18%), Positives = 47/104 (45%), Gaps = 19/104 (18%)

Query: 47  KLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQ 106
           ++L    E+  +YE++++  +K  +L       ++ E K ++E EL              
Sbjct: 67  EILAKANELTKQYEQELSKARKEAQL---EITQSQKEAKEIVENEL-------------- 109

Query: 107 EEIQAQLEKYKENKKQILEMEKNIQLK--DDEINNLNENLTSSL 148
           ++ Q  ++       + LE +K   LK  +++++ L++ +   L
Sbjct: 110 KQAQKYIDSLLNEATKQLEAQKEKALKSLEEQVDTLSDQIEEKL 153



 Score = 31.5 bits (72), Expect = 2.0
 Identities = 19/104 (18%), Positives = 47/104 (45%), Gaps = 19/104 (18%)

Query: 864 KLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQ 923
           ++L    E+  +YE++++  +K  +L       ++ E K ++E EL              
Sbjct: 67  EILAKANELTKQYEQELSKARKEAQL---EITQSQKEAKEIVENEL-------------- 109

Query: 924 EEIQAQLEKYKENKKQILEMEKNIQLK--DDEINNLNENLTSSL 965
           ++ Q  ++       + LE +K   LK  +++++ L++ +   L
Sbjct: 110 KQAQKYIDSLLNEATKQLEAQKEKALKSLEEQVDTLSDQIEEKL 153


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 32.3 bits (74), Expect = 2.0
 Identities = 26/90 (28%), Positives = 45/90 (50%), Gaps = 14/90 (15%)

Query: 50  ESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEI 109
           E+ EE  ++ ++ +T  +K  E    +A AAE           A+  LL  K KE ++ +
Sbjct: 186 EAVEEAILQTDQALTAKEKAIEAERAKAEAAE-----------AEQELLREKQKEEEQMM 234

Query: 110 QAQLEKYKENKKQI---LEMEKNIQLKDDE 136
           +AQ   Y+E+ KQ+   +E E+   L + E
Sbjct: 235 EAQERSYQEHVKQLIEKMEAEREKLLAEQE 264



 Score = 32.3 bits (74), Expect = 2.0
 Identities = 26/90 (28%), Positives = 45/90 (50%), Gaps = 14/90 (15%)

Query: 867 ESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEI 926
           E+ EE  ++ ++ +T  +K  E    +A AAE           A+  LL  K KE ++ +
Sbjct: 186 EAVEEAILQTDQALTAKEKAIEAERAKAEAAE-----------AEQELLREKQKEEEQMM 234

Query: 927 QAQLEKYKENKKQI---LEMEKNIQLKDDE 953
           +AQ   Y+E+ KQ+   +E E+   L + E
Sbjct: 235 EAQERSYQEHVKQLIEKMEAEREKLLAEQE 264


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 32.4 bits (74), Expect = 2.2
 Identities = 23/111 (20%), Positives = 52/111 (46%), Gaps = 5/111 (4%)

Query: 34  VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTM-IEKEL 92
           + D +  LK  + K +E+ EEI+   +     +KK ++    +    E + +   I+K+ 
Sbjct: 148 LPDPRALLKGEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKD 207

Query: 93  AKNAL----LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINN 139
            KN L    ++ +  ++++E +A  +K K  K  +   +K ++    E+  
Sbjct: 208 IKNPLELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAELKK 258



 Score = 32.4 bits (74), Expect = 2.2
 Identities = 23/111 (20%), Positives = 52/111 (46%), Gaps = 5/111 (4%)

Query: 851 VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTM-IEKEL 909
           + D +  LK  + K +E+ EEI+   +     +KK ++    +    E + +   I+K+ 
Sbjct: 148 LPDPRALLKGEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKD 207

Query: 910 AKNAL----LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINN 956
            KN L    ++ +  ++++E +A  +K K  K  +   +K ++    E+  
Sbjct: 208 IKNPLELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAELKK 258


>gnl|CDD|218176 pfam04614, Pex19, Pex19 protein family. 
          Length = 244

 Score = 31.9 bits (73), Expect = 2.2
 Identities = 34/162 (20%), Positives = 61/162 (37%), Gaps = 25/162 (15%)

Query: 849  PLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKE 908
             L  DL+  + ++  +  E   E K ++EK +       EL E+ AAA++   K      
Sbjct: 8    ELAKDLQKGMAELLGEDGEEDPETKAQFEKLLK------ELGEEEAAASKEGEK------ 55

Query: 909  LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAEN 968
             +  A         ++ I   LE+ KE+         N      E    +++L + L + 
Sbjct: 56   PSAAAGESGTDPSFKDVISETLERLKESGD-------NADAAASEDKKNSDDLLAQLLKQ 108

Query: 969  HILNSQMTLVNNLFSNMLINPMINL---DML---IKLIQDNY 1004
               N      +   S +L   M  L   ++L   +K + D Y
Sbjct: 109  LGGNGDGGGGDGDMSKLLQGMMEQLSSKEVLYEPLKELNDKY 150



 Score = 31.2 bits (71), Expect = 3.4
 Identities = 33/158 (20%), Positives = 60/158 (37%), Gaps = 25/158 (15%)

Query: 36  DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKN 95
           DL+  + ++  +  E   E K ++EK +       EL E+ AAA++   K       +  
Sbjct: 12  DLQKGMAELLGEDGEEDPETKAQFEKLLK------ELGEEEAAASKEGEK------PSAA 59

Query: 96  ALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILN 155
           A         ++ I   LE+ KE+         N      E    +++L + L +    N
Sbjct: 60  AGESGTDPSFKDVISETLERLKESGD-------NADAAASEDKKNSDDLLAQLLKQLGGN 112

Query: 156 SQMTLVNNLFSNMLINPMINL---DML---IKLIQDNY 187
                 +   S +L   M  L   ++L   +K + D Y
Sbjct: 113 GDGGGGDGDMSKLLQGMMEQLSSKEVLYEPLKELNDKY 150


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 32.3 bits (74), Expect = 2.3
 Identities = 13/67 (19%), Positives = 25/67 (37%), Gaps = 4/67 (5%)

Query: 67  KKLRELYEQRAAAAELEH----KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQ 122
           K+LRE   + A               E+E A+   L+ ++ ++  EI    E   E + +
Sbjct: 2   KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61

Query: 123 ILEMEKN 129
                + 
Sbjct: 62  PAASGEG 68



 Score = 32.3 bits (74), Expect = 2.3
 Identities = 13/67 (19%), Positives = 25/67 (37%), Gaps = 4/67 (5%)

Query: 884 KKLRELYEQRAAAAELEH----KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQ 939
           K+LRE   + A               E+E A+   L+ ++ ++  EI    E   E + +
Sbjct: 2   KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61

Query: 940 ILEMEKN 946
                + 
Sbjct: 62  PAASGEG 68


>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 725

 Score = 32.5 bits (74), Expect = 2.3
 Identities = 43/220 (19%), Positives = 82/220 (37%), Gaps = 8/220 (3%)

Query: 102 LKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNS--QMT 159
           L   +  I      Y ++ K++L +   I+ +  +   L  +L + + E  I N     +
Sbjct: 242 LVSNENLINLLNLLYSKDIKEVLNILNQIKEQGIDPELLLISLINLVKEFIIFNKTKDNS 301

Query: 160 LVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLV-DLEKKIDQETL 218
           L+       L    I+ D   K I+  + L+ DL  +   NDT+ +L+  L    + E  
Sbjct: 302 LLEYYSEEDLEKLKIDDDFAYKFIEILFDLLKDLKISDNPNDTLEILIIKLLALSELEEE 361

Query: 219 AIQNDKQLKQLENYETPMSSAMVEAVESNQTSTKAVESN-----QTSTSKDSANIQLDHA 273
                K  K  EN    +     +    N    K+   N     +T  +K+  +  ++  
Sbjct: 362 DENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNSSFINKT 421

Query: 274 ETNTNEESPADQDETNSTTTLDKSHIQKEITRVISNSLIS 313
           E         D+    +T  ++  + Q+     I N +IS
Sbjct: 422 ENILTNSPLKDELLEKTTEIINIENPQEFEFGQIGNDIIS 461


>gnl|CDD|234272 TIGR03592, yidC_oxa1_cterm, membrane protein insertase, YidC/Oxa1
           family, C-terminal domain.  This model describes
           full-length from some species, and the C-terminal region
           only from other species, of the YidC/Oxa1 family of
           proteins. This domain appears to be univeral among
           bacteria (although absent from Archaea). The
           well-characterized YidC protein from Escherichia coli
           and its close homologs contain a large N-terminal
           periplasmic domain in addition to the region modeled
           here [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 181

 Score = 31.3 bits (72), Expect = 2.4
 Identities = 10/28 (35%), Positives = 18/28 (64%), Gaps = 1/28 (3%)

Query: 100 AKLKEIQEEIQAQLEKYKENK-KQILEM 126
            K++E+Q +++   EKYK++  K   EM
Sbjct: 29  RKMQELQPKLKEIQEKYKDDPQKLQQEM 56



 Score = 31.3 bits (72), Expect = 2.4
 Identities = 10/28 (35%), Positives = 18/28 (64%), Gaps = 1/28 (3%)

Query: 917 AKLKEIQEEIQAQLEKYKENK-KQILEM 943
            K++E+Q +++   EKYK++  K   EM
Sbjct: 29  RKMQELQPKLKEIQEKYKDDPQKLQQEM 56


>gnl|CDD|225283 COG2427, COG2427, Uncharacterized conserved protein [Function
           unknown].
          Length = 148

 Score = 31.2 bits (71), Expect = 2.4
 Identities = 26/102 (25%), Positives = 43/102 (42%), Gaps = 6/102 (5%)

Query: 96  ALLEAKLKEIQEEIQAQLEKYKENKKQILE-MEKNIQLKDDEINNLNENLTSSLAENHIL 154
             +E   KE  EE+   LEK  ENK  I E M+    LK+  +  L+  L      +   
Sbjct: 4   KTIEQLKKEKLEEL---LEKLAENKDAIKELMDLLKLLKESGL--LDALLGLRAKADIAK 58

Query: 155 NSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINN 196
             +  L   L  N+L N +I L +L  +  +    +++ +  
Sbjct: 59  KLKDELAKELIENLLNNMLIMLGLLSLIDSERLSKLVENLIK 100



 Score = 31.2 bits (71), Expect = 2.4
 Identities = 26/102 (25%), Positives = 43/102 (42%), Gaps = 6/102 (5%)

Query: 913  ALLEAKLKEIQEEIQAQLEKYKENKKQILE-MEKNIQLKDDEINNLNENLTSSLAENHIL 971
              +E   KE  EE+   LEK  ENK  I E M+    LK+  +  L+  L      +   
Sbjct: 4    KTIEQLKKEKLEEL---LEKLAENKDAIKELMDLLKLLKESGL--LDALLGLRAKADIAK 58

Query: 972  NSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINN 1013
              +  L   L  N+L N +I L +L  +  +    +++ +  
Sbjct: 59   KLKDELAKELIENLLNNMLIMLGLLSLIDSERLSKLVENLIK 100


>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
           envelope biogenesis, outer membrane].
          Length = 835

 Score = 32.4 bits (74), Expect = 2.5
 Identities = 21/87 (24%), Positives = 33/87 (37%), Gaps = 10/87 (11%)

Query: 74  EQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQA---QLEKYKENKKQILEMEKNI 130
            + AA        +I++ELA N  L   L +  E + A      +      Q+L+  + I
Sbjct: 46  AEPAAEEAELQAELIQQELAINDQLSQALNQQTERLNALASDDRQLANLLLQLLQSSRTI 105

Query: 131 QLKDDEINNLNENLTSSLAENHILNSQ 157
                E   +   L  SL  + IL  Q
Sbjct: 106 ----REQIAV---LRGSLLLSRILLQQ 125



 Score = 32.4 bits (74), Expect = 2.5
 Identities = 21/87 (24%), Positives = 33/87 (37%), Gaps = 10/87 (11%)

Query: 891 EQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQA---QLEKYKENKKQILEMEKNI 947
            + AA        +I++ELA N  L   L +  E + A      +      Q+L+  + I
Sbjct: 46  AEPAAEEAELQAELIQQELAINDQLSQALNQQTERLNALASDDRQLANLLLQLLQSSRTI 105

Query: 948 QLKDDEINNLNENLTSSLAENHILNSQ 974
                E   +   L  SL  + IL  Q
Sbjct: 106 ----REQIAV---LRGSLLLSRILLQQ 125


>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869).  This
           bacterial family of proteins has no known function.
          Length = 131

 Score = 30.6 bits (70), Expect = 2.6
 Identities = 16/70 (22%), Positives = 33/70 (47%), Gaps = 12/70 (17%)

Query: 68  KLRELYEQRAAAAELEHKTMIEKELA------KNALLEAKLK------EIQEEIQAQLEK 115
            + E +++   A   E  T +E+EL       + A+ E + +      +IQ++++ +  K
Sbjct: 12  IVTERWKEELQAELQEQLTQLEQELQQLEFQGQKAIKEIRKQSAQQIEQIQQQVEQERAK 71

Query: 116 YKENKKQILE 125
             E K Q+L 
Sbjct: 72  RLEQKNQLLF 81



 Score = 30.6 bits (70), Expect = 2.6
 Identities = 16/70 (22%), Positives = 33/70 (47%), Gaps = 12/70 (17%)

Query: 885 KLRELYEQRAAAAELEHKTMIEKELA------KNALLEAKLK------EIQEEIQAQLEK 932
            + E +++   A   E  T +E+EL       + A+ E + +      +IQ++++ +  K
Sbjct: 12  IVTERWKEELQAELQEQLTQLEQELQQLEFQGQKAIKEIRKQSAQQIEQIQQQVEQERAK 71

Query: 933 YKENKKQILE 942
             E K Q+L 
Sbjct: 72  RLEQKNQLLF 81


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 32.1 bits (73), Expect = 2.7
 Identities = 30/90 (33%), Positives = 38/90 (42%), Gaps = 12/90 (13%)

Query: 40  RLKDVEKKLLESPEEIK--IEYEKQMTNYKKLRELYEQRAAAAE-----LEHKTMIEKEL 92
           RLK +EK+ L + E+ K   E  KQ    +K  E  E  A AA       E +       
Sbjct: 102 RLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAE--EAAAKAAAAAKAKAEAEAKRAAAA 159

Query: 93  AKNALLEAKLKEIQEEIQAQLEKYKENKKQ 122
           AK A  EAK K   E   A  +   E KK+
Sbjct: 160 AKKAAAEAKKKAEAE---AAKKAAAEAKKK 186



 Score = 32.1 bits (73), Expect = 2.7
 Identities = 30/90 (33%), Positives = 38/90 (42%), Gaps = 12/90 (13%)

Query: 857 RLKDVEKKLLESPEEIK--IEYEKQMTNYKKLRELYEQRAAAAE-----LEHKTMIEKEL 909
           RLK +EK+ L + E+ K   E  KQ    +K  E  E  A AA       E +       
Sbjct: 102 RLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAE--EAAAKAAAAAKAKAEAEAKRAAAA 159

Query: 910 AKNALLEAKLKEIQEEIQAQLEKYKENKKQ 939
           AK A  EAK K   E   A  +   E KK+
Sbjct: 160 AKKAAAEAKKKAEAE---AAKKAAAEAKKK 186


>gnl|CDD|223407 COG0330, HflC, Membrane protease subunits, stomatin/prohibitin
            homologs [Posttranslational modification, protein
            turnover, chaperones].
          Length = 291

 Score = 31.6 bits (72), Expect = 2.9
 Identities = 28/167 (16%), Positives = 57/167 (34%), Gaps = 19/167 (11%)

Query: 836  ESNILATTRSVILPLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAA 895
             + I A  R ++         ++ DVE K ++ PEE++   EKQM           +R  
Sbjct: 139  RAEINAKIREILDEAADPWGIKVVDVEIKDIDPPEEVQAAMEKQMA---------AERDK 189

Query: 896  AAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEIN 955
             AE+       +     A  EA+   I  E +A+ E     +    ++      +     
Sbjct: 190  RAEILEAEGEAQAAILRAEGEAEAAIILAEAEAEAEVIARAEADAAKIIAAALREAPAAP 249

Query: 956  NLN-ENLTSSLAENHILNSQMTLVNNLFSNMLINPMINLDMLIKLIQ 1001
                +           L   + +     S +++ P      ++ L++
Sbjct: 250  QALAQRY---------LEELLEIALAGNSKVVVVPNSAGGAILGLLE 287



 Score = 30.5 bits (69), Expect = 7.9
 Identities = 25/152 (16%), Positives = 51/152 (33%), Gaps = 19/152 (12%)

Query: 34  VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELA 93
                 ++ DVE K ++ PEE++   EKQM           +R   AE+       +   
Sbjct: 154 ADPWGIKVVDVEIKDIDPPEEVQAAMEKQMA---------AERDKRAEILEAEGEAQAAI 204

Query: 94  KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLN-ENLTSSLAENH 152
             A  EA+   I  E +A+ E     +    ++      +         +          
Sbjct: 205 LRAEGEAEAAIILAEAEAEAEVIARAEADAAKIIAAALREAPAAPQALAQRY-------- 256

Query: 153 ILNSQMTLVNNLFSNMLINPMINLDMLIKLIQ 184
            L   + +     S +++ P      ++ L++
Sbjct: 257 -LEELLEIALAGNSKVVVVPNSAGGAILGLLE 287


>gnl|CDD|237440 PRK13588, PRK13588, flagellin B; Provisional.
          Length = 514

 Score = 32.1 bits (73), Expect = 3.0
 Identities = 22/85 (25%), Positives = 46/85 (54%), Gaps = 7/85 (8%)

Query: 1288 DGGRSTSES-GISTDTEDDNVEIVETNDESEDDKSEVVET------QSEHDVPHTNSVMS 1340
            D  RS S + G +    +D + +V+T D++ D++ ++++T      Q+  D     S  +
Sbjct: 52   DSLRSQSANLGQAIRNANDAIGMVQTADKAMDEQIKILDTIKTKAVQAAQDGQTLESRRA 111

Query: 1341 IEEDDQDEVEELESETNVSAINNEQ 1365
            ++ D Q  +EEL++  N ++ N +Q
Sbjct: 112  LQSDIQRLLEELDNIANTTSFNGQQ 136


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
           SseC is a secreted protein that forms a complex together
           with SecB and SecD on the surface of Salmonella. All
           these proteins are secreted by the type III secretion
           system. Many mucosal pathogens use type III secretion
           systems for the injection of effector proteins into
           target cells. SecB, SseC and SecD are inserted into the
           target cell membrane. where they form a small pore or
           translocon. In addition to SseC, this family includes
           the bacterial secreted proteins PopB, PepB, YopB and
           EspD which are thought to be directly involved in pore
           formation, and type III secretion system translocon.
          Length = 303

 Score = 31.7 bits (72), Expect = 3.0
 Identities = 15/51 (29%), Positives = 28/51 (54%), Gaps = 7/51 (13%)

Query: 71  ELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKK 121
           +L E++A +   + +   +K+       E K +E QE+I+  +EK +E KK
Sbjct: 11  KLAEKQAKSKLQQLERARDKQ-------EKKAEEYQEQIKKAIEKAEEAKK 54



 Score = 31.7 bits (72), Expect = 3.0
 Identities = 15/51 (29%), Positives = 28/51 (54%), Gaps = 7/51 (13%)

Query: 888 ELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKK 938
           +L E++A +   + +   +K+       E K +E QE+I+  +EK +E KK
Sbjct: 11  KLAEKQAKSKLQQLERARDKQ-------EKKAEEYQEQIKKAIEKAEEAKK 54



 Score = 30.9 bits (70), Expect = 6.1
 Identities = 11/46 (23%), Positives = 25/46 (54%), Gaps = 2/46 (4%)

Query: 86  TMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQ 131
            +I K   K A  ++KL++++     Q +K +E ++QI +  +  +
Sbjct: 7   ELISKLAEKQA--KSKLQQLERARDKQEKKAEEYQEQIKKAIEKAE 50



 Score = 30.9 bits (70), Expect = 6.1
 Identities = 11/46 (23%), Positives = 25/46 (54%), Gaps = 2/46 (4%)

Query: 903 TMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQ 948
            +I K   K A  ++KL++++     Q +K +E ++QI +  +  +
Sbjct: 7   ELISKLAEKQA--KSKLQQLERARDKQEKKAEEYQEQIKKAIEKAE 50


>gnl|CDD|216500 pfam01432, Peptidase_M3, Peptidase family M3.  This is the Thimet
            oligopeptidase family, large family of mammalian and
            bacterial oligopeptidases that cleave medium sized
            peptides. The group also contains mitochondrial
            intermediate peptidase which is encoded by nuclear DNA
            but functions within the mitochondria to remove the
            leader sequence.
          Length = 450

 Score = 32.0 bits (73), Expect = 3.2
 Identities = 29/120 (24%), Positives = 53/120 (44%), Gaps = 11/120 (9%)

Query: 1171 LILEKNHLTNQLAKLKTLNNHLEIRLNHSEQRLNTVTTELKKTWNVVNKLKLQHKQLYTH 1230
            L+ E   L  +LAKL    ++ E  L   E ++  +   +    + + +L  + + L   
Sbjct: 34   LLEELLKLRAELAKLLGYPSYAEASL---EDKMAKIPETVY---DFLEELVNKLRPLLHR 87

Query: 1231 EKILKYELKHKRNIINDLKI-DLEYVREKWDLVKEKNEQNELDYKVLRKEFALRRLQYDG 1289
            E  L  +LK K   + +L+  D+ Y  EK    + +   + LD + LR  F L ++   G
Sbjct: 88   ELELLKKLKKKELGLEELQPWDVAYYSEK----QREELYDPLDQEELRPYFPLEQVLEKG 143


>gnl|CDD|233137 TIGR00811, sit, silicon transporter.  Marine diatoms such as
           Cylindrotheca fusiformis encode at least six silicon
           transport protein homologues which exhibit similar size
           and topology. One characterized member of the family
           (Sit1) functions in the energy-dependent uptake of
           either Silicic acid [Si(OH)4] or Silicate [Si(OH)3O-] by
           a Na+ symport mechanism. The system is found in marine
           diatoms which make their "glass houses" out of silicon
           [Transport and binding proteins, Other].
          Length = 545

 Score = 31.9 bits (72), Expect = 3.3
 Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 3/60 (5%)

Query: 49  LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEE 108
           L + E   +  E+ + N ++LRE+     +  E   KT  E  LA       KL   +EE
Sbjct: 474 LSALETFSVRRERMLKNIRELREMINNAISDEE---KTTFEAALAIEVKALDKLNAEEEE 530



 Score = 31.9 bits (72), Expect = 3.3
 Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 3/60 (5%)

Query: 866 LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEE 925
           L + E   +  E+ + N ++LRE+     +  E   KT  E  LA       KL   +EE
Sbjct: 474 LSALETFSVRRERMLKNIRELREMINNAISDEE---KTTFEAALAIEVKALDKLNAEEEE 530


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 32.1 bits (72), Expect = 3.4
 Identities = 30/203 (14%), Positives = 61/203 (30%), Gaps = 14/203 (6%)

Query: 483 TEETDESRPFESAECSRTSQPEVENHISNTDTIETAYSEMSEPIEELHTERLSTETLQRE 542
           T  T  + P +++  SRT+        + + T        + P  +  ++  +  +    
Sbjct: 475 TGTTSSTLPEDTSPTSRTTSA---TPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPI 531

Query: 543 PITSVPTATSNTTLPSEPSVSRTRTPEEILEARNARLKRLEDQCKNLFH-----KMSLTS 597
            I    TATS  T  +    + +    E     N     +      L       +    S
Sbjct: 532 VIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGS 591

Query: 598 QRSDTLSERLEELHDYYGTSNERHSP---RRHREGSQDSDDTRLRSPENEPLSGVSANVP 654
             +          H    +++   +P     H  G ++  +     P    +S +S    
Sbjct: 592 SPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPG 651

Query: 655 D---SQENSQESDDTRRRSAETN 674
               SQ +   +  T R   E +
Sbjct: 652 PGTTSQVSGPGNSSTSRYPGEVH 674


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 31.9 bits (73), Expect = 3.5
 Identities = 31/133 (23%), Positives = 56/133 (42%), Gaps = 41/133 (30%)

Query: 31  VFKVSDLKTRLKDV------EKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEH 84
           V +  +++ RLK        E +LL+   +I  + E++M   K  RE Y           
Sbjct: 181 VLETVNIEKRLKKALELLKKELELLKLQNKITKKVEEKME--KTQREYY----------- 227

Query: 85  KTMIEKELAKNALLEAKLKEIQEEI------QAQLEKYKENKKQILEMEKNIQLK-DDEI 137
                        L  +LK I++E+      + +LEK KE K + L++ + ++   + E+
Sbjct: 228 -------------LREQLKAIKKELGIEKDDKDELEKLKE-KLEELKLPEEVKKVIEKEL 273

Query: 138 NNLNENLTSSLAE 150
             L   L  S +E
Sbjct: 274 TKL-SLLEPSSSE 285



 Score = 30.7 bits (70), Expect = 7.4
 Identities = 30/128 (23%), Positives = 54/128 (42%), Gaps = 41/128 (32%)

Query: 853 DLKTRLKDV------EKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIE 906
           +++ RLK        E +LL+   +I  + E++M   K  RE Y                
Sbjct: 186 NIEKRLKKALELLKKELELLKLQNKITKKVEEKME--KTQREYY---------------- 227

Query: 907 KELAKNALLEAKLKEIQEEI------QAQLEKYKENKKQILEMEKNIQLK-DDEINNLNE 959
                   L  +LK I++E+      + +LEK KE K + L++ + ++   + E+  L  
Sbjct: 228 --------LREQLKAIKKELGIEKDDKDELEKLKE-KLEELKLPEEVKKVIEKELTKL-S 277

Query: 960 NLTSSLAE 967
            L  S +E
Sbjct: 278 LLEPSSSE 285


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 30.7 bits (70), Expect = 3.6
 Identities = 15/87 (17%), Positives = 40/87 (45%), Gaps = 10/87 (11%)

Query: 69  LRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENK-----KQI 123
           ++++  +  A    +    +EKE  K   L+A+L++ ++E+Q + +K ++       +  
Sbjct: 23  VQKVLSESPAGKAAQ--KQLEKEFKK---LQAELQKKEKELQKEEQKLQKQAATLSEEAR 77

Query: 124 LEMEKNIQLKDDEINNLNENLTSSLAE 150
              ++ +Q K  E+    +     L +
Sbjct: 78  KAKQQELQQKQQELQQKQQAAQQELQQ 104



 Score = 30.7 bits (70), Expect = 3.6
 Identities = 15/87 (17%), Positives = 40/87 (45%), Gaps = 10/87 (11%)

Query: 886 LRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENK-----KQI 940
           ++++  +  A    +    +EKE  K   L+A+L++ ++E+Q + +K ++       +  
Sbjct: 23  VQKVLSESPAGKAAQ--KQLEKEFKK---LQAELQKKEKELQKEEQKLQKQAATLSEEAR 77

Query: 941 LEMEKNIQLKDDEINNLNENLTSSLAE 967
              ++ +Q K  E+    +     L +
Sbjct: 78  KAKQQELQQKQQELQQKQQAAQQELQQ 104



 Score = 29.5 bits (67), Expect = 8.7
 Identities = 18/85 (21%), Positives = 46/85 (54%), Gaps = 4/85 (4%)

Query: 42  KDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAK 101
           K  +K+L +  ++++ E +K+    +K  +  +++AA    E +   ++EL +    + +
Sbjct: 34  KAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQ---KQQE 90

Query: 102 LKEIQEEIQAQL-EKYKENKKQILE 125
           L++ Q+  Q +L +K +E  + I +
Sbjct: 91  LQQKQQAAQQELQQKQQELLQPIYD 115



 Score = 29.5 bits (67), Expect = 8.7
 Identities = 18/85 (21%), Positives = 46/85 (54%), Gaps = 4/85 (4%)

Query: 859 KDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAK 918
           K  +K+L +  ++++ E +K+    +K  +  +++AA    E +   ++EL +    + +
Sbjct: 34  KAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQ---KQQE 90

Query: 919 LKEIQEEIQAQL-EKYKENKKQILE 942
           L++ Q+  Q +L +K +E  + I +
Sbjct: 91  LQQKQQAAQQELQQKQQELLQPIYD 115


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
            cilia and flagella are specialised organelles found at
            the periphery of cells of diverse organisms.
            Intra-flagellar transport (IFT) is required for the
            assembly and maintenance of eukaryotic cilia and
            flagella, and consists of the bidirectional movement of
            large protein particles between the base and the distal
            tip of the organelle. IFT particles contain multiple
            copies of two distinct protein complexes, A and B, which
            contain at least 6 and 11 protein subunits. IFT57 is part
            of complex B but is not, however, required for the core
            subunits to stay associated. This protein is known as
            Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 31.6 bits (72), Expect = 3.7
 Identities = 21/93 (22%), Positives = 35/93 (37%), Gaps = 10/93 (10%)

Query: 1277 RKEFALRRLQYDGGRSTSESGISTDTEDDNVEIVETNDESEDDKSEVVETQSEHDVPHTN 1336
            +K F+ +R +Y       E  +  D  +  +E VE   E E+   +   TQ         
Sbjct: 116  KKGFSFKRPKY-PNEEDEEENVDEDDAEIILEEVEEEVEIEEV-DDDEGTQETKYKRGDT 173

Query: 1337 SVMSIEEDDQDEVEELESETNVSAINNEQEVER 1369
            S+    +D          E+ + A   + EVER
Sbjct: 174  SLTPQAKDVL--------ESLIDAAEWKLEVER 198


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
            consist of three concentric proteinaceous capsid layers.
            The innermost capsid (core) is made of VP2. The genomic
            RNA and the two minor proteins VP1 and VP3 are
            encapsidated within this layer. The N-terminus of
            rotavirus VP2 is necessary for the encapsidation of VP1
            and VP3.
          Length = 887

 Score = 31.8 bits (72), Expect = 3.8
 Identities = 22/143 (15%), Positives = 58/143 (40%), Gaps = 16/143 (11%)

Query: 1291 RSTSESGISTDTEDDNVE---IVETNDESEDDKSEVVETQSEHDVPHTNSVMSIEEDDQ- 1346
             + + +    + +D+  +    +E  ++  D K EVV    +  V   +S  +++  D+ 
Sbjct: 9    ANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEV 68

Query: 1347 DEVEELESETNVSAINNEQEVERTEVTDNTSAVIPSRETIENEAVPVSVSPQEDIVDTIT 1406
             +  + ES+  +  +  ++E ++    +     IP            S  P+E I+  + 
Sbjct: 69   KKSTKEESKQLLEVLKTKEEHQKEIQYEILQKTIP------------SFEPKESILKKLE 116

Query: 1407 TSRTDHIESHQEINSSRETKEIP 1429
              + +  +   ++    E K++P
Sbjct: 117  DIKPEQAKKQTKLFRLFEPKQLP 139


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 30.3 bits (69), Expect = 4.3
 Identities = 26/98 (26%), Positives = 42/98 (42%), Gaps = 11/98 (11%)

Query: 849 PLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKE 908
           P++  L  R   +   L E+ E +K E +        L   YEQ    A  +   +IE+ 
Sbjct: 30  PILKALDERQAKIADDLAEA-ERLKEEAQA-------LLAEYEQELEEAREQASEIIEQ- 80

Query: 909 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKN 946
                  E   +EI+ E + +LE+ KE  +  +E EK 
Sbjct: 81  --AKKEAEQIAEEIKAEAEEELERIKEAAEAEIEAEKE 116



 Score = 30.0 bits (68), Expect = 5.6
 Identities = 18/63 (28%), Positives = 29/63 (46%), Gaps = 3/63 (4%)

Query: 67  KKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEM 126
           + L   YEQ    A  +   +IE+        E   +EI+ E + +LE+ KE  +  +E 
Sbjct: 57  QALLAEYEQELEEAREQASEIIEQ---AKKEAEQIAEEIKAEAEEELERIKEAAEAEIEA 113

Query: 127 EKN 129
           EK 
Sbjct: 114 EKE 116



 Score = 30.0 bits (68), Expect = 6.1
 Identities = 19/82 (23%), Positives = 40/82 (48%), Gaps = 5/82 (6%)

Query: 38  KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAK-NA 96
              L + E+   E+ + +  EYE+++   ++ RE   +    A+ E + + E+  A+   
Sbjct: 43  ADDLAEAERLKEEA-QALLAEYEQEL---EEAREQASEIIEQAKKEAEQIAEEIKAEAEE 98

Query: 97  LLEAKLKEIQEEIQAQLEKYKE 118
            LE   +  + EI+A+ E+  E
Sbjct: 99  ELERIKEAAEAEIEAEKERALE 120



 Score = 30.0 bits (68), Expect = 6.1
 Identities = 19/82 (23%), Positives = 40/82 (48%), Gaps = 5/82 (6%)

Query: 855 KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAK-NA 913
              L + E+   E+ + +  EYE+++   ++ RE   +    A+ E + + E+  A+   
Sbjct: 43  ADDLAEAERLKEEA-QALLAEYEQEL---EEAREQASEIIEQAKKEAEQIAEEIKAEAEE 98

Query: 914 LLEAKLKEIQEEIQAQLEKYKE 935
            LE   +  + EI+A+ E+  E
Sbjct: 99  ELERIKEAAEAEIEAEKERALE 120



 Score = 30.0 bits (68), Expect = 7.0
 Identities = 13/60 (21%), Positives = 30/60 (50%), Gaps = 8/60 (13%)

Query: 66  YKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILE 125
           +K + +  ++R A         I  +LA+   L+ + + +  E + +LE+ +E   +I+E
Sbjct: 28  WKPILKALDERQA--------KIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIE 79



 Score = 30.0 bits (68), Expect = 7.0
 Identities = 13/60 (21%), Positives = 30/60 (50%), Gaps = 8/60 (13%)

Query: 883 YKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILE 942
           +K + +  ++R A         I  +LA+   L+ + + +  E + +LE+ +E   +I+E
Sbjct: 28  WKPILKALDERQA--------KIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIE 79


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 31.2 bits (71), Expect = 4.4
 Identities = 22/91 (24%), Positives = 42/91 (46%), Gaps = 9/91 (9%)

Query: 59  YEKQMTNYK-KLRELYEQRAAAAELEHKTMI---EKELAKNALLEAKLKEIQEEIQAQLE 114
           +EK M   + K++ L  Q         K MI   E        L++K +   E++ A+++
Sbjct: 275 HEKSMEKLQEKIKALKYQL-----KRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVK 329

Query: 115 KYKENKKQILEMEKNIQLKDDEINNLNENLT 145
           + K+ KK+  + +K I+  ++ I  L    T
Sbjct: 330 EKKKEKKKEEKKKKQIERLEERIEKLEVQAT 360



 Score = 31.2 bits (71), Expect = 4.4
 Identities = 22/91 (24%), Positives = 42/91 (46%), Gaps = 9/91 (9%)

Query: 876 YEKQMTNYK-KLRELYEQRAAAAELEHKTMI---EKELAKNALLEAKLKEIQEEIQAQLE 931
           +EK M   + K++ L  Q         K MI   E        L++K +   E++ A+++
Sbjct: 275 HEKSMEKLQEKIKALKYQL-----KRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVK 329

Query: 932 KYKENKKQILEMEKNIQLKDDEINNLNENLT 962
           + K+ KK+  + +K I+  ++ I  L    T
Sbjct: 330 EKKKEKKKEEKKKKQIERLEERIEKLEVQAT 360


>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 304

 Score = 31.2 bits (71), Expect = 4.5
 Identities = 12/29 (41%), Positives = 21/29 (72%), Gaps = 1/29 (3%)

Query: 101 KLKEIQEEIQAQLEKYKENK-KQILEMEK 128
           K++E+Q +I+A  +KY  ++ K  LEM+K
Sbjct: 60  KMQELQPQIKALQKKYGNDRQKMALEMQK 88



 Score = 31.2 bits (71), Expect = 4.5
 Identities = 12/29 (41%), Positives = 21/29 (72%), Gaps = 1/29 (3%)

Query: 918 KLKEIQEEIQAQLEKYKENK-KQILEMEK 945
           K++E+Q +I+A  +KY  ++ K  LEM+K
Sbjct: 60  KMQELQPQIKALQKKYGNDRQKMALEMQK 88


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 30.6 bits (69), Expect = 4.6
 Identities = 18/113 (15%), Positives = 46/113 (40%), Gaps = 1/113 (0%)

Query: 10  IVFKNFILTIGSPLTSSTSMVVFKVSDLKTRLKD-VEKKLLESPEEIKIEYEKQMTNYKK 68
           I     +          + + + ++ +L+  + + ++      P  + +  E+      K
Sbjct: 92  IEESRELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFK 151

Query: 69  LRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKK 121
            +E  E+     +   +   EK+L +  L E + K+  EE++ +LE+  E  +
Sbjct: 152 RKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELLEELE 204


>gnl|CDD|131798 TIGR02751, PEPCase_arch, phosphoenolpyruvate carboxylase, archaeal
            type.  This family is the archaeal-type
            phosphoenolpyruvate carboxylase, although not every host
            species is archaeal. These sequences bear little
            resemblance to the bacterial/eukaryotic type. The members
            from Sulfolobus solfataricus and Methanothermobacter
            thermautotrophicus were verified experimentally, while
            the activity is known to be present in a number of other
            archaea [Energy metabolism, Other].
          Length = 506

 Score = 31.2 bits (71), Expect = 4.6
 Identities = 25/105 (23%), Positives = 41/105 (39%), Gaps = 11/105 (10%)

Query: 1252 LEYVREK-WDLVKEKNEQNELDYKVLRK--EFALRRLQYDG-GRSTSESGISTDTEDDNV 1307
            L  + E   +++KE        Y  LR    FA R L  +          +S +TED  V
Sbjct: 397  LSELDEDDLEVIKEL-------YPNLRSDLSFAARYLNPESADSLKDIVKLSEETEDMIV 449

Query: 1308 EIVETNDESEDDKSEVVETQSEHDVPHTNSVMSIEEDDQDEVEEL 1352
              V+ +++    K  +V T  EH +       + +E+    V  L
Sbjct: 450  IDVKYDEDEPGIKYNLVYTLEEHLLYTGLIGKAFKEEVFQAVRSL 494


>gnl|CDD|218083 pfam04426, Bul1_C, Bul1 C terminus.  This family contains the C
           terminus of Saccharomyces cerevisiae Bul1. Bul1 binds
           the ubiquitin ligase Rsp5, via an N terminal PPSY motif.
           The complex containing Bul1 and Rsp5 is involved in
           intracellular trafficking of the general amino acid
           permease Gap1, degradation of Rog1 in cooperation with
           Bul2 and GSK-3, and mitochondrial inheritance. Bul1 may
           contain HEAT repeats.
          Length = 226

 Score = 30.8 bits (70), Expect = 4.7
 Identities = 31/118 (26%), Positives = 59/118 (50%), Gaps = 9/118 (7%)

Query: 38  KTRLKDVEKKLLESPEEIKIEYEKQMT-NYKKLRELY---EQRAAAAELEHKTMIEKELA 93
           K +LK+++K   +  ++IK EY+K+   N +KL ELY      +   EL+    I  +LA
Sbjct: 74  KEKLKNIKKTFKDFLKKIK-EYKKKFNENKEKLNELYNLNRTISTNRELKFTDFISTQLA 132

Query: 94  KN----ALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSS 147
            +    A L+  +  + +  + Q+   K  KK  L+ E+++ +  +   ++ E L  S
Sbjct: 133 NDVESLANLKVNVTNLNDIFKKQVVTLKWLKKGSLQYERDVNVNLEFNPDIKETLVPS 190



 Score = 30.8 bits (70), Expect = 4.7
 Identities = 31/118 (26%), Positives = 59/118 (50%), Gaps = 9/118 (7%)

Query: 855 KTRLKDVEKKLLESPEEIKIEYEKQMT-NYKKLRELY---EQRAAAAELEHKTMIEKELA 910
           K +LK+++K   +  ++IK EY+K+   N +KL ELY      +   EL+    I  +LA
Sbjct: 74  KEKLKNIKKTFKDFLKKIK-EYKKKFNENKEKLNELYNLNRTISTNRELKFTDFISTQLA 132

Query: 911 KN----ALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSS 964
            +    A L+  +  + +  + Q+   K  KK  L+ E+++ +  +   ++ E L  S
Sbjct: 133 NDVESLANLKVNVTNLNDIFKKQVVTLKWLKKGSLQYERDVNVNLEFNPDIKETLVPS 190


>gnl|CDD|220767 pfam10459, Peptidase_S46, Peptidase S46.  Dipeptidyl-peptidase 7
           (DPP-7) is the best characterized member of this family.
           It is a serine peptidase that is located on the cell
           surface and is predicted to have two N-terminal
           transmembrane domains.
          Length = 696

 Score = 31.4 bits (72), Expect = 4.9
 Identities = 25/109 (22%), Positives = 43/109 (39%), Gaps = 10/109 (9%)

Query: 840 LATTRSVILPLVSD-LKTRLKDVEKKLLESPEEIKIEYE---KQMTNY-KKLRELYEQRA 894
           +   R    P   + L  R+  +E  +L+ P EI+I+Y      + NY K    + E   
Sbjct: 261 VEFRRDWAYPARIERLGERIALLEAAMLKDP-EIRIKYASKLAGLANYWKNSIGMLEGLK 319

Query: 895 AAAELEHKTMIEKELA----KNALLEAKLKEIQEEIQAQLEKYKENKKQ 939
               L  K   E  L     K+    AK  +  +E+ A   + +E  ++
Sbjct: 320 DLDLLARKQAREAALRAWVKKDPARGAKYGDALDELAALYAERRELARR 368


>gnl|CDD|172503 PRK14002, PRK14002, potassium-transporting ATPase subunit C;
           Provisional.
          Length = 186

 Score = 30.5 bits (69), Expect = 4.9
 Identities = 23/74 (31%), Positives = 30/74 (40%), Gaps = 6/74 (8%)

Query: 746 AVVQDRIDTRLKSSPGINECTSAVSAVSEESSGLRSVDAIRPVLNHDEMTR----RNVEA 801
           A VQ RIDT L   P ++        V+   SGL     I P   + ++ R    R +  
Sbjct: 92  AEVQARIDTFLVHHPYLSRKDIPAEMVTASGSGLDP--NISPQAAYVQVKRVAKARGMSE 149

Query: 802 ESRADAVVQGQEKP 815
           E     V Q  EKP
Sbjct: 150 EKVKQLVDQHVEKP 163


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 31.2 bits (72), Expect = 4.9
 Identities = 27/103 (26%), Positives = 49/103 (47%), Gaps = 16/103 (15%)

Query: 46  KKLLESPEEIKIEYEKQMT-----NYKKLRELYEQRAAAAELEHKTMIEKELAK-NALLE 99
           K + E+PE +K   E         +  +L EL E+R        +T +E+  A+ NAL +
Sbjct: 5   KLIRENPEAVK---EALAKRGFPLDVDELLELDEERREL-----QTELEELQAERNALSK 56

Query: 100 --AKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 140
              + K   E+ +A + + KE K++I  +E  +   + E+  L
Sbjct: 57  EIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAELEEL 99



 Score = 31.2 bits (72), Expect = 4.9
 Identities = 27/103 (26%), Positives = 49/103 (47%), Gaps = 16/103 (15%)

Query: 863 KKLLESPEEIKIEYEKQMT-----NYKKLRELYEQRAAAAELEHKTMIEKELAK-NALLE 916
           K + E+PE +K   E         +  +L EL E+R        +T +E+  A+ NAL +
Sbjct: 5   KLIRENPEAVK---EALAKRGFPLDVDELLELDEERREL-----QTELEELQAERNALSK 56

Query: 917 --AKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 957
              + K   E+ +A + + KE K++I  +E  +   + E+  L
Sbjct: 57  EIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAELEEL 99


>gnl|CDD|233028 TIGR00571, dam, DNA adenine methylase (dam).  All proteins in this
            family for which functions are known are DNA-adenine
            methyltransferases. This family is based on the
            phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
            Stanford University). The DNA adenine methylase (dam) of
            E. coli and related species is instrumental in
            distinguishing the newly synthesized strand during DNA
            replication for methylation-directed mismatch repair.
            This family includes several phage methylases and a
            number of different restriction enzyme chromosomal
            site-specific modification systems [DNA metabolism, DNA
            replication, recombination, and repair].
          Length = 266

 Score = 30.8 bits (70), Expect = 5.1
 Identities = 19/58 (32%), Positives = 30/58 (51%), Gaps = 8/58 (13%)

Query: 1103 KSHIQKEIANNLPKVWRILIE--------LLNTHQEQYENNDVNENFESCYKNIKTNT 1152
            K+ +  EI  +LPK +  L+E          N + ++Y  ND+NE+  + YK IK N 
Sbjct: 11   KTSLLPEIKKHLPKNFNCLVEPFVGGGAVFFNLNPKRYLLNDINEDLINLYKAIKNNV 68


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 31.2 bits (71), Expect = 5.1
 Identities = 29/104 (27%), Positives = 43/104 (41%), Gaps = 16/104 (15%)

Query: 41  LKDVEKKLLESPEEIKIEYEKQMT----------NYKKLRE----LYEQRAAAAELEHKT 86
           LK V        EEIKIE +K  T           YKKL+     L  Q +   E     
Sbjct: 337 LKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYY 396

Query: 87  MIEKELAKNALLEAKLKEIQEEI--QAQLEKYKENKKQILEMEK 128
              K   + A  +  ++EI+EE+  +  L+  K+ +K+    EK
Sbjct: 397 ESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440



 Score = 31.2 bits (71), Expect = 5.1
 Identities = 29/104 (27%), Positives = 43/104 (41%), Gaps = 16/104 (15%)

Query: 858 LKDVEKKLLESPEEIKIEYEKQMT----------NYKKLRE----LYEQRAAAAELEHKT 903
           LK V        EEIKIE +K  T           YKKL+     L  Q +   E     
Sbjct: 337 LKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYY 396

Query: 904 MIEKELAKNALLEAKLKEIQEEI--QAQLEKYKENKKQILEMEK 945
              K   + A  +  ++EI+EE+  +  L+  K+ +K+    EK
Sbjct: 397 ESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 30.8 bits (70), Expect = 5.2
 Identities = 20/171 (11%), Positives = 67/171 (39%), Gaps = 17/171 (9%)

Query: 98  LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQ 157
           ++ +++ +  +I+    K  E +K+I + +  I+    EI  L EN+        +L  +
Sbjct: 50  IQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVE---RQELLKKR 106

Query: 158 M----------TLVNNLFSNMLINPMIN-LDMLIKLIQDNYLLIIDLINNHEINDTVSLL 206
                      + ++ + ++   + +I+ +  +  ++  +  ++     + +  +     
Sbjct: 107 ARAMQVNGTATSYIDVILNSKSFSDLISRVTAISVIVDADKKILEQQKEDKKSLEEKQ-- 164

Query: 207 VDLEKKIDQ-ETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTKAVES 256
             LE K++    L  + + QL  L + +   ++ +        ++     +
Sbjct: 165 AALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALGEKAA 215



 Score = 30.8 bits (70), Expect = 5.3
 Identities = 21/171 (12%), Positives = 69/171 (40%), Gaps = 17/171 (9%)

Query: 915  LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQ 974
            ++ +++ +  +I+    K  E +K+I + +  I+    EI  L EN+        +L  +
Sbjct: 50   IQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVE---RQELLKKR 106

Query: 975  M----------TLVNNLFSNMLINPMIN-LDMLIKLIQDNYLLIIDLINNHEINDTVSLL 1023
                       + ++ + ++   + +I+ +  +  ++  +  ++     + +  +     
Sbjct: 107  ARAMQVNGTATSYIDVILNSKSFSDLISRVTAISVIVDADKKILEQQKEDKKSLEEKQ-- 164

Query: 1024 VDLEKKIDQ-ETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTSKDSA 1073
              LE K++    L  + + QL  L + +   ++ +        ++  + +A
Sbjct: 165  AALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALGEKAA 215


>gnl|CDD|223874 COG0803, LraI, ABC-type metal ion transport system, periplasmic
           component/surface adhesin [Inorganic ion transport and
           metabolism].
          Length = 303

 Score = 30.8 bits (70), Expect = 5.3
 Identities = 11/41 (26%), Positives = 18/41 (43%), Gaps = 3/41 (7%)

Query: 83  EHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQI 123
           E+K   EK          KL ++ EE +A+L K    +  +
Sbjct: 159 ENKETYEKNAEA---YLKKLNKLDEEAKAKLSKIPAQRDVV 196



 Score = 30.8 bits (70), Expect = 5.3
 Identities = 11/41 (26%), Positives = 18/41 (43%), Gaps = 3/41 (7%)

Query: 900 EHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQI 940
           E+K   EK          KL ++ EE +A+L K    +  +
Sbjct: 159 ENKETYEKNAEA---YLKKLNKLDEEAKAKLSKIPAQRDVV 196


>gnl|CDD|192445 pfam10018, Med4, Vitamin-D-receptor interacting Mediator subunit 4.
            Members of this family function as part of the Mediator
           (Med) complex, which links DNA-bound transcriptional
           regulators and the general transcription machinery,
           particularly the RNA polymerase II enzyme. They play a
           role in basal transcription by mediating activation or
           repression according to the specific complement of
           transcriptional regulators bound to the promoter.
          Length = 176

 Score = 30.1 bits (68), Expect = 5.6
 Identities = 12/50 (24%), Positives = 31/50 (62%), Gaps = 1/50 (2%)

Query: 97  LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 146
            L +   E++  +  +LE+++EN+++ILE+   ++  D++I ++ + L  
Sbjct: 6   DLISADDELKSAL-KELEEHQENQQRILELRAEVESLDEKIKDILKELKE 54



 Score = 30.1 bits (68), Expect = 5.6
 Identities = 12/50 (24%), Positives = 31/50 (62%), Gaps = 1/50 (2%)

Query: 914 LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 963
            L +   E++  +  +LE+++EN+++ILE+   ++  D++I ++ + L  
Sbjct: 6   DLISADDELKSAL-KELEEHQENQQRILELRAEVESLDEKIKDILKELKE 54


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score = 31.2 bits (72), Expect = 5.9
 Identities = 19/54 (35%), Positives = 32/54 (59%), Gaps = 4/54 (7%)

Query: 90  KELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNEN 143
           KELA  ALL+AK  E+ E ++A LE+ KE +K++ +++   +L      +L   
Sbjct: 704 KELA--ALLKAKPSELPERVEALLEELKELEKELEQLKA--KLAAAAAGDLLAQ 753



 Score = 31.2 bits (72), Expect = 5.9
 Identities = 19/54 (35%), Positives = 32/54 (59%), Gaps = 4/54 (7%)

Query: 907 KELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNEN 960
           KELA  ALL+AK  E+ E ++A LE+ KE +K++ +++   +L      +L   
Sbjct: 704 KELA--ALLKAKPSELPERVEALLEELKELEKELEQLKA--KLAAAAAGDLLAQ 753


>gnl|CDD|215960 pfam00509, Hemagglutinin, Haemagglutinin.  Hemagglutinin from
           influenza virus causes membrane fusion of the viral
           membrane with the host membrane. Fusion occurs after the
           host cell internalises the virus by endocytosis. The
           drop of pH causes release of a hydrophobic fusion
           peptide and a large conformational change leading to
           membrane fusion.
          Length = 550

 Score = 31.2 bits (71), Expect = 6.1
 Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 12/76 (15%)

Query: 87  MIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQ-----------ILEMEKNIQLKDD 135
           +IEK   +   +E +  E++  I   L K  ++              +LE +  + L D 
Sbjct: 383 LIEKMNTQFEAIEKEFSELERRIN-NLNKKVDDGITDIWSYNAELLVLLENQHTLDLHDS 441

Query: 136 EINNLNENLTSSLAEN 151
            +NNL E +   L EN
Sbjct: 442 NVNNLYEKVRRQLREN 457



 Score = 31.2 bits (71), Expect = 6.1
 Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 12/76 (15%)

Query: 904 MIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQ-----------ILEMEKNIQLKDD 952
           +IEK   +   +E +  E++  I   L K  ++              +LE +  + L D 
Sbjct: 383 LIEKMNTQFEAIEKEFSELERRIN-NLNKKVDDGITDIWSYNAELLVLLENQHTLDLHDS 441

Query: 953 EINNLNENLTSSLAEN 968
            +NNL E +   L EN
Sbjct: 442 NVNNLYEKVRRQLREN 457


>gnl|CDD|234942 PRK01318, PRK01318, membrane protein insertase; Provisional.
          Length = 521

 Score = 31.0 bits (71), Expect = 6.1
 Identities = 12/28 (42%), Positives = 18/28 (64%), Gaps = 1/28 (3%)

Query: 100 AKLKEIQEEIQAQLEKYKENK-KQILEM 126
           AK+K +Q ++Q   EKYK++  K   EM
Sbjct: 350 AKMKVLQPKMQELKEKYKDDPQKMQQEM 377



 Score = 31.0 bits (71), Expect = 6.1
 Identities = 12/28 (42%), Positives = 18/28 (64%), Gaps = 1/28 (3%)

Query: 917 AKLKEIQEEIQAQLEKYKENK-KQILEM 943
           AK+K +Q ++Q   EKYK++  K   EM
Sbjct: 350 AKMKVLQPKMQELKEKYKDDPQKMQQEM 377


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 30.9 bits (70), Expect = 6.4
 Identities = 18/83 (21%), Positives = 31/83 (37%), Gaps = 4/83 (4%)

Query: 233 ETPMSSAMVEAVESNQTSTKAVE--SNQTSTSKDSANIQLDHAETNTNEESPADQDETNS 290
            +P + A     +S   +T      SN   TSKD+++   D A+ N       D ++   
Sbjct: 21  TSPTAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQ--DNNDKKF 78

Query: 291 TTTLDKSHIQKEITRVISNSLIS 313
           +T    +     I   I  +L  
Sbjct: 79  STIDSSTSDSNNIIDFIYKNLPQ 101


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 30.9 bits (71), Expect = 6.4
 Identities = 19/93 (20%), Positives = 43/93 (46%), Gaps = 7/93 (7%)

Query: 42  KDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAK 101
             +     E PE++  +++     YKKLR+  E++      +HK   +    +  L E +
Sbjct: 207 DSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKL---REKLKE-E 262

Query: 102 LKEI---QEEIQAQLEKYKENKKQILEMEKNIQ 131
           LK +    ++I   +E+ ++  K++   E+ + 
Sbjct: 263 LKSLRLTSKQIDELVEQLRDINKRVRGQERELL 295



 Score = 30.9 bits (71), Expect = 6.4
 Identities = 19/93 (20%), Positives = 43/93 (46%), Gaps = 7/93 (7%)

Query: 859 KDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAK 918
             +     E PE++  +++     YKKLR+  E++      +HK   +    +  L E +
Sbjct: 207 DSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKL---REKLKE-E 262

Query: 919 LKEI---QEEIQAQLEKYKENKKQILEMEKNIQ 948
           LK +    ++I   +E+ ++  K++   E+ + 
Sbjct: 263 LKSLRLTSKQIDELVEQLRDINKRVRGQERELL 295


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 29.5 bits (67), Expect = 6.7
 Identities = 24/103 (23%), Positives = 50/103 (48%), Gaps = 12/103 (11%)

Query: 42  KDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAK 101
           K  +K+L +  ++ + E EK     +KL+E  ++ AA       T+ E    K    E +
Sbjct: 17  KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAA-------TLSEAAREK---KEKE 66

Query: 102 LKEIQEEIQAQLEKYKE--NKKQILEMEKNIQLKDDEINNLNE 142
           L++  +E Q + +K ++   K+Q  E++K +   +  I  + +
Sbjct: 67  LQKKVQEFQRKQQKLQQDLQKRQQEELQKILDKINKAIKEVAK 109



 Score = 29.5 bits (67), Expect = 6.7
 Identities = 24/103 (23%), Positives = 50/103 (48%), Gaps = 12/103 (11%)

Query: 859 KDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAK 918
           K  +K+L +  ++ + E EK     +KL+E  ++ AA       T+ E    K    E +
Sbjct: 17  KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAA-------TLSEAAREK---KEKE 66

Query: 919 LKEIQEEIQAQLEKYKE--NKKQILEMEKNIQLKDDEINNLNE 959
           L++  +E Q + +K ++   K+Q  E++K +   +  I  + +
Sbjct: 67  LQKKVQEFQRKQQKLQQDLQKRQQEELQKILDKINKAIKEVAK 109


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 30.7 bits (69), Expect = 6.8
 Identities = 18/94 (19%), Positives = 40/94 (42%), Gaps = 3/94 (3%)

Query: 35  SDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAK 94
           S  K   +  +KK  +  EE+K    KQ    ++L++L ++R  A E + +    ++ A+
Sbjct: 72  SSAKKGEQQRKKKEEQVAEELK---PKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQ 128

Query: 95  NALLEAKLKEIQEEIQAQLEKYKENKKQILEMEK 128
               + + +  +   + + +      K   E  K
Sbjct: 129 LEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAK 162



 Score = 30.7 bits (69), Expect = 6.8
 Identities = 18/94 (19%), Positives = 40/94 (42%), Gaps = 3/94 (3%)

Query: 852 SDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAK 911
           S  K   +  +KK  +  EE+K    KQ    ++L++L ++R  A E + +    ++ A+
Sbjct: 72  SSAKKGEQQRKKKEEQVAEELK---PKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQ 128

Query: 912 NALLEAKLKEIQEEIQAQLEKYKENKKQILEMEK 945
               + + +  +   + + +      K   E  K
Sbjct: 129 LEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAK 162


>gnl|CDD|193569 cd08656, M28_like_6, M28 Zn-Peptidases.  Peptidase family M28 (also
            called aminopeptidase Y family), uncharacterized
            subfamily. The M28 family contains aminopeptidases as
            well as carboxypeptidases. They have co-catalytic zinc
            ions; each zinc ion is tetrahedrally co-ordinated, with
            three amino acid ligands plus activated water; one
            aspartate residue binds both metal ions.
          Length = 281

 Score = 30.3 bits (69), Expect = 6.9
 Identities = 10/34 (29%), Positives = 21/34 (61%), Gaps = 3/34 (8%)

Query: 1612 VISSYKPDYSK---LVSHWNCRPLAVKEEEEANE 1642
            +I+S+ P+  K   L +HW+ RP+A ++ +   +
Sbjct: 62   IIASFNPEAKKRILLAAHWDTRPIADQDPDPKKK 95


>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN.  All proteins in this
           family for which functions are known are ATP binding
           proteins involved in the initiation of recombination and
           recombinational repair [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 563

 Score = 30.9 bits (70), Expect = 7.0
 Identities = 25/117 (21%), Positives = 59/117 (50%), Gaps = 9/117 (7%)

Query: 37  LKTRLKDVEKKLLESPEEIKIEYEKQ-MTNYKKLRE--------LYEQRAAAAELEHKTM 87
           L+ +L+++E+  L+  E+  +E E+Q ++N +KLRE        L            + +
Sbjct: 194 LQFQLEELEEADLQPGEDEALEAEQQRLSNLEKLRELSQNALAALRGDVDVQEGSLLEGL 253

Query: 88  IEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 144
            E +LA  ++++  L+E+ E++   L + +E  +++      ++   + +N + E L
Sbjct: 254 GEAQLALASVIDGSLRELAEQVGNALTEVEEATRELQNYLDELEFDPERLNEIEERL 310



 Score = 30.9 bits (70), Expect = 7.0
 Identities = 25/117 (21%), Positives = 59/117 (50%), Gaps = 9/117 (7%)

Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQ-MTNYKKLRE--------LYEQRAAAAELEHKTM 904
           L+ +L+++E+  L+  E+  +E E+Q ++N +KLRE        L            + +
Sbjct: 194 LQFQLEELEEADLQPGEDEALEAEQQRLSNLEKLRELSQNALAALRGDVDVQEGSLLEGL 253

Query: 905 IEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 961
            E +LA  ++++  L+E+ E++   L + +E  +++      ++   + +N + E L
Sbjct: 254 GEAQLALASVIDGSLRELAEQVGNALTEVEEATRELQNYLDELEFDPERLNEIEERL 310


>gnl|CDD|234780 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha;
           Validated.
          Length = 339

 Score = 30.5 bits (70), Expect = 7.0
 Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 16/91 (17%)

Query: 850 LVSDLKTRLKDVEKKLLESPEEIKIEY--EK-QMTNY-KKLREL-YEQRAAAAEL--EHK 902
           LV +    +       LE+ E ++++Y  +K ++T   K L +L  E+R  A  L  E K
Sbjct: 8   LVEEALAAIAAA--SDLEALEALRVKYLGKKGELTELLKGLGKLPPEERKEAGALINELK 65

Query: 903 TMIEKELAKNALLEAKLKEI-QEEIQAQLEK 932
             IE      A LE + +E+    + A+L  
Sbjct: 66  QAIE------AALEERKEELEAAALNARLAA 90


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 31.1 bits (71), Expect = 7.1
 Identities = 26/113 (23%), Positives = 47/113 (41%), Gaps = 17/113 (15%)

Query: 32  FKVSDLKTRLKDVEKKLL----ESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTM 87
            ++  L++ L  + + L       P+   +  EK+  + K L EL        E E +  
Sbjct: 43  ERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEEL----IKDVEEELEK- 97

Query: 88  IEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 140
           IEKE+ +   LE ++ E++ EI+       E + + LE   N  L    +   
Sbjct: 98  IEKEIKE---LEEEISELENEIKE-----LEQEIERLEPWGNFDLDLSLLLGF 142


>gnl|CDD|226907 COG4530, COG4530, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 129

 Score = 29.5 bits (66), Expect = 7.2
 Identities = 20/87 (22%), Positives = 29/87 (33%), Gaps = 14/87 (16%)

Query: 1277 RKEFALRRLQYDGGRSTSESGISTDTEDDNVEIVETNDESEDDK----------SEVVET 1326
            R  F              E     D E++ VE+V   D  +D K           E V+ 
Sbjct: 38   RSYFEETTPDAAAEEEEEEVVKEVDAENEEVEVVSLEDADDDPKGGDDLPDLGDDEDVDL 97

Query: 1327 QSEHDVPHTNSVMSIEEDDQDEVEELE 1353
              + D       +  EEDD D+V  + 
Sbjct: 98   DDDDDDT----FLEDEEDDDDDVSGII 120


>gnl|CDD|132552 TIGR03513, GldL_gliding, gliding motility-associated protein GldL. 
           This protein family, GldL, is named for the member from
           Flavobacterium johnsoniae, which is required for a type
           of rapid gliding motility found in certain members of
           the Bacteriodetes. However, members are found also in
           several members of the Bacteriodetes that appear not to
           be motile [Cellular processes, Chemotaxis and motility].
          Length = 202

 Score = 30.2 bits (68), Expect = 7.3
 Identities = 28/131 (21%), Positives = 48/131 (36%), Gaps = 26/131 (19%)

Query: 65  NYKKLRELYEQRAAAAELEHKTMIE--KELAKNALLEAKL-------KEIQEEIQAQLEK 115
           N +K    Y +  +A  L  + +    KE  K+A L   L       +   + +    + 
Sbjct: 69  NKRKEAAAYAEEPSAVGLLSQKLDAMLKEAKKDATLMQSLGNGINNFEGAAKTLAPMTDS 128

Query: 116 YKENKKQILEMEK------------NIQLKD-----DEINNLNENLTSSLAENHILNSQM 158
           Y + KK I +M                QLK      D  N +  N +S   E   + + +
Sbjct: 129 YAQQKKYIEQMSSLAANMEGLNTIYEAQLKGASSHADANNEIAINSSSLKEEMEKMAANL 188

Query: 159 TLVNNLFSNML 169
           T +N ++  ML
Sbjct: 189 TSLNEVYGGML 199



 Score = 30.2 bits (68), Expect = 7.3
 Identities = 28/131 (21%), Positives = 48/131 (36%), Gaps = 26/131 (19%)

Query: 882 NYKKLRELYEQRAAAAELEHKTMIE--KELAKNALLEAKL-------KEIQEEIQAQLEK 932
           N +K    Y +  +A  L  + +    KE  K+A L   L       +   + +    + 
Sbjct: 69  NKRKEAAAYAEEPSAVGLLSQKLDAMLKEAKKDATLMQSLGNGINNFEGAAKTLAPMTDS 128

Query: 933 YKENKKQILEMEK------------NIQLKD-----DEINNLNENLTSSLAENHILNSQM 975
           Y + KK I +M                QLK      D  N +  N +S   E   + + +
Sbjct: 129 YAQQKKYIEQMSSLAANMEGLNTIYEAQLKGASSHADANNEIAINSSSLKEEMEKMAANL 188

Query: 976 TLVNNLFSNML 986
           T +N ++  ML
Sbjct: 189 TSLNEVYGGML 199


>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16).  Autophagy is a
           ubiquitous intracellular degradation system for
           eukaryotic cells. During autophagy, cytoplasmic
           components are enclosed in autophagosomes and delivered
           to lysosomes/vacuoles. ATG16 (also known as Apg16) has
           been shown to be bind to Apg5 and is required for the
           function of the Apg12p-Apg5p conjugate in the yeast
           autophagy pathway.
          Length = 194

 Score = 30.2 bits (68), Expect = 7.4
 Identities = 20/94 (21%), Positives = 44/94 (46%), Gaps = 4/94 (4%)

Query: 48  LLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQE 107
            +   E+   +  +++T   K R    QR     L     +E+   +   LE  + E++ 
Sbjct: 68  AIAEMEQKLAKLREELTELHKKRGELAQRL----LLLNDELEQLRREIQQLEKTIAELRS 123

Query: 108 EIQAQLEKYKENKKQILEMEKNIQLKDDEINNLN 141
           EI +   + ++ ++++ E EK+ +   DE+ +LN
Sbjct: 124 EITSLETEIRDLREELQEKEKDNETLQDELISLN 157



 Score = 30.2 bits (68), Expect = 7.4
 Identities = 20/94 (21%), Positives = 44/94 (46%), Gaps = 4/94 (4%)

Query: 865 LLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQE 924
            +   E+   +  +++T   K R    QR     L     +E+   +   LE  + E++ 
Sbjct: 68  AIAEMEQKLAKLREELTELHKKRGELAQRL----LLLNDELEQLRREIQQLEKTIAELRS 123

Query: 925 EIQAQLEKYKENKKQILEMEKNIQLKDDEINNLN 958
           EI +   + ++ ++++ E EK+ +   DE+ +LN
Sbjct: 124 EITSLETEIRDLREELQEKEKDNETLQDELISLN 157


>gnl|CDD|224202 COG1283, NptA, Na+/phosphate symporter [Inorganic ion transport and
            metabolism].
          Length = 533

 Score = 30.7 bits (70), Expect = 7.4
 Identities = 43/220 (19%), Positives = 81/220 (36%), Gaps = 54/220 (24%)

Query: 835  PESNILATTRSVILPLVSDLKTRLKDVEKKLLESPE--------EIKI--EYEKQMTNYK 884
            P + +LA   + ++P   D +   K +++  L+SP         E+    +  +QM   +
Sbjct: 295  PFTGLLARFVTRLIPGDEDDEIEPKHLDETALDSPVVALANAAREVLRLGDSIEQML--E 352

Query: 885  KLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQL---------EKYKE 935
            +L E  E  A            KE+ K   LE  +  + EEI+  L         E+   
Sbjct: 353  RLYEYIEGDAK---------KVKEIRK---LEDAVDRLYEEIKLYLARLSKEGLSEEESR 400

Query: 936  NKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILN---SQMTLVNNLFSNMLINPMIN 992
               +I++   N++   D I  L E     +A     +    +                  
Sbjct: 401  RWAEIIDAAINLEHIGDIIERLLELADKKIANGRAFSEDGLEE----------------- 443

Query: 993  LDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQ 1032
            LD L  L  +N  L I ++   +  +    LV+ +K++ +
Sbjct: 444  LDALFALTLENLRLAISVLVTGD-LELARRLVERKKRVRR 482


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 31.0 bits (71), Expect = 7.5
 Identities = 14/60 (23%), Positives = 28/60 (46%), Gaps = 3/60 (5%)

Query: 90  KELAKNALLEAKLKEIQ---EEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 146
             L K  LLEA+ K +Q   E+  A L+K    K++  ++++ +     ++      L +
Sbjct: 46  DALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEA 105



 Score = 31.0 bits (71), Expect = 7.5
 Identities = 14/60 (23%), Positives = 28/60 (46%), Gaps = 3/60 (5%)

Query: 907 KELAKNALLEAKLKEIQ---EEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 963
             L K  LLEA+ K +Q   E+  A L+K    K++  ++++ +     ++      L +
Sbjct: 46  DALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEA 105



 Score = 31.0 bits (71), Expect = 7.6
 Identities = 24/116 (20%), Positives = 43/116 (37%), Gaps = 6/116 (5%)

Query: 206 LVDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTKAVESNQTSTSKDS 265
              L++++ Q    ++  +   +LE  +        E + +   S + +ES    T    
Sbjct: 82  TEQLKQQLAQAPAKLRQAQ--AELEALKDDNDEETRETLST--LSLRQLESRLAQTLDQL 137

Query: 266 ANIQLDHAETNTNEESPADQDETNSTTTLDKSHIQKEITRVISNSLISG--LSPSL 319
            N Q D AE N+   S   Q E         S   ++I  ++    + G  L PS 
Sbjct: 138 QNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLQQIRNLLKGGKVGGKALRPSQ 193



 Score = 30.7 bits (70), Expect = 8.0
 Identities = 39/264 (14%), Positives = 92/264 (34%), Gaps = 73/264 (27%)

Query: 33  KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 92
           K+   K   + ++++L ++P +++ + + ++   K               ++     + L
Sbjct: 74  KIDRQKEETEQLKQQLAQAPAKLR-QAQAELEALKD--------------DNDEETRETL 118

Query: 93  AKNAL--LEAKLKEIQE---EIQAQLEKY-----------KENKKQILEMEKNIQLKDDE 136
           +  +L  LE++L +  +     Q  L +Y           +  +  +    + +Q    +
Sbjct: 119 STLSLRQLESRLAQTLDQLQNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLQ----Q 174

Query: 137 INN-LNENLTSSLAENH----ILNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLII 191
           I N L        A       +L ++  L+N              D+  K ++ N  L  
Sbjct: 175 IRNLLKGGKVGGKALRPSQRVLLQAEQALLN-----------AQNDLQRKSLEGNTQL-Q 222

Query: 192 DLINNHEINDTVSLLVDLEKKIDQETLAIQ---NDKQLKQLENYETPMSSAMVEAVESNQ 248
           DL+   +  D ++  +     ++ +   +Q   N K+L   E           + V+  Q
Sbjct: 223 DLLQ--KQRDYLTARIQR---LEHQLQLLQEAINSKRLTLSE-----------KTVQEAQ 266

Query: 249 TSTKAVESNQTSTSKD--SANIQL 270
           +  +A               N+QL
Sbjct: 267 SQDEAARIQANPLVAQELEINLQL 290


>gnl|CDD|214442 MTH00160, ND2, NADH dehydrogenase subunit 2; Provisional.
          Length = 335

 Score = 30.6 bits (70), Expect = 7.5
 Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 1/40 (2%)

Query: 144 LTSSLAENHIL-NSQMTLVNNLFSNMLINPMINLDMLIKL 182
           L  +LA   +L +  + ++N +  N + + +IN  +L+KL
Sbjct: 62  LIQALASIILLFSIILNMLNLMLFNNISSMLINSSLLLKL 101



 Score = 30.6 bits (70), Expect = 7.5
 Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 1/40 (2%)

Query: 961 LTSSLAENHIL-NSQMTLVNNLFSNMLINPMINLDMLIKL 999
           L  +LA   +L +  + ++N +  N + + +IN  +L+KL
Sbjct: 62  LIQALASIILLFSIILNMLNLMLFNNISSMLINSSLLLKL 101


>gnl|CDD|132062 TIGR03017, EpsF, chain length determinant protein EpsF.  Sequences
           in this family of proteins are members of the chain
           length determinant family (pfam02706) which includes the
           wzc protein from E.coli. This family of proteins are
           homologous to the EpsF protein of the methanolan
           biosynthesis operon of Methylobacillus species strain
           12S. The distribution of this protein appears to be
           restricted to a subset of exopolysaccharide operons
           containing a syntenic grouping of genes including a
           variant of the EpsH exosortase protein. Exosortase has
           been proposed to be involved in the targetting and
           processing of proteins containing the PEP-CTERM domain
           to the exopolysaccharide layer.
          Length = 444

 Score = 30.5 bits (69), Expect = 7.6
 Identities = 25/104 (24%), Positives = 42/104 (40%), Gaps = 22/104 (21%)

Query: 849 PLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEH-KTMIEK 907
           P++ +LKT +   E KL E  +            YK          A AE+   K+ +  
Sbjct: 254 PIIQNLKTDIARAESKLAELSQR----LGPNHPQYK---------RAQAEINSLKSQLNA 300

Query: 908 ELAKNA--------LLEAKLKEIQEEIQAQLEKYKENKKQILEM 943
           E+ K          +L+ +  E++E ++ Q  K  E  +Q  EM
Sbjct: 301 EIKKVTSSVGTNSRILKQREAELREALENQKAKVLELNRQRDEM 344


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 30.8 bits (70), Expect = 7.9
 Identities = 22/95 (23%), Positives = 41/95 (43%), Gaps = 5/95 (5%)

Query: 850 LVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 909
           L++  K  L  + KKL E   E + E E+ +   K+ RE    +     L      E  L
Sbjct: 164 LIASAKEELDQLSKKLAELKAEEEEELERAL---KEKREELLSKLEEELLARLESKEAAL 220

Query: 910 AKN--ALLEAKLKEIQEEIQAQLEKYKENKKQILE 942
            K      E + +E++++ + +L +  E + +  E
Sbjct: 221 EKQLRLEFEREKEELRKKYEEKLRQELERQAEAHE 255


>gnl|CDD|218552 pfam05316, VAR1, Mitochondrial ribosomal protein (VAR1).  This
           family consists of the yeast mitochondrial ribosomal
           proteins VAR1. Mitochondria possess their own ribosomes
           responsible for the synthesis of a small number of
           proteins encoded by the mitochondrial genome. In yeast
           the two ribosomal RNAs and a single ribosomal protein,
           VAR1, are products of mitochondrial genes, and the
           remaining approximately 80 ribosomal proteins are
           encoded in the nucleus. VAR1 along with 15S rRNA are
           necessary for the formation of mature 37S subunits.
          Length = 350

 Score = 30.3 bits (68), Expect = 7.9
 Identities = 27/110 (24%), Positives = 54/110 (49%), Gaps = 11/110 (10%)

Query: 119 NKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTLVNNLFS----------NM 168
            K  +L+M  N ++ + +IN  N N+   L E +   +++  +NN+ +          N 
Sbjct: 5   LKNYLLKMNNN-RMNNMDINIKNNNMNKYLNEYNNKGNKLQHINNMNNWNNQLYNYNKNN 63

Query: 169 LINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETL 218
           +IN  IN  ++ KL+     L +++INN+ IN+     + + K   + T+
Sbjct: 64  VINTWINDKLVNKLLYKLMKLKVNIINNNIINNGQIKKIIINKPKFKHTI 113



 Score = 30.3 bits (68), Expect = 7.9
 Identities = 27/110 (24%), Positives = 54/110 (49%), Gaps = 11/110 (10%)

Query: 936  NKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTLVNNLFS----------NM 985
             K  +L+M  N ++ + +IN  N N+   L E +   +++  +NN+ +          N 
Sbjct: 5    LKNYLLKMNNN-RMNNMDINIKNNNMNKYLNEYNNKGNKLQHINNMNNWNNQLYNYNKNN 63

Query: 986  LINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETL 1035
            +IN  IN  ++ KL+     L +++INN+ IN+     + + K   + T+
Sbjct: 64   VINTWINDKLVNKLLYKLMKLKVNIINNNIINNGQIKKIIINKPKFKHTI 113


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 30.8 bits (70), Expect = 8.0
 Identities = 30/176 (17%), Positives = 53/176 (30%), Gaps = 22/176 (12%)

Query: 441 ETSDEKDIVKGNVVDYEATKQTRTESPSEEQKSKQLIRQDR---ITEETDESRPFESAEC 497
           E     D + G   D +   +   +   E+ +       +R   I EE D  +     + 
Sbjct: 363 EVEARLDALTGKHQDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDL 422

Query: 498 SRTSQPEVENHISNTDTIETAYSEMSEPIEELHTERLSTETLQREPITSVPTATSNTTLP 557
                   +         E    E +E   EL   RL     + +      TAT      
Sbjct: 423 QALESQLRQQL-------EAGKLEFNEEEYEL-ELRLGRLKQRLDS----ATATPEELEQ 470

Query: 558 SEPSVSRTRTPEEILEARNARLKRLEDQCKNLFHKMSLTSQRSDTLSERLEELHDY 613
            E +       +E  E   A +++L+ + + L        +R D   E L+     
Sbjct: 471 LEINDEALEKAQEEQEQAEANVEQLQSELRQL-------RKRRDEALEALQRAERR 519


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 30.6 bits (69), Expect = 8.3
 Identities = 29/99 (29%), Positives = 50/99 (50%), Gaps = 11/99 (11%)

Query: 849 PLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKE 908
           PL+  L+  +++ EK +++   +I+ E EK++   +KL +  E       LE      KE
Sbjct: 361 PLLERLEKDVEEGEKTIVKEARQIEEELEKEV---EKLGKEEESLFKRVALEEGL---KE 414

Query: 909 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNI 947
           L ++   E  LKE+ +E +  LEK    K +  E+EK  
Sbjct: 415 LEQD--EENFLKELSKEEKELLEKL---KMEASEVEKLF 448


>gnl|CDD|145900 pfam02994, Transposase_22, L1 transposable element. 
          Length = 370

 Score = 30.4 bits (68), Expect = 8.3
 Identities = 32/140 (22%), Positives = 53/140 (37%), Gaps = 13/140 (9%)

Query: 504 EVENHISNTDTIETAYSEMSEPIEELHTERLSTETLQREPITSVPTATSNTTLPSEPSVS 563
           +VE HI NT+       E+ +   EL  E  S +    E    +        +      +
Sbjct: 85  QVEAHIENTEKCLKELMELKKKARELKEECRSIKKEHMETTLDIENQKKRQGVVDTSFTN 144

Query: 564 RTRTPEEILEARNARLKRLEDQCKNLFHKMSLTSQRSDTLSERLEELHDYYGTSNERHSP 623
           R +  EE       R+   ED  + +        +R   L + ++E+ DY    N R   
Sbjct: 145 RIQEMEE-------RISGAEDSIEEIKRTGKFNEKRKKRLEQNIQEIQDYVRRPNLRII- 196

Query: 624 RRHREGSQDSDDTRLRSPEN 643
                G  +S+D +L+ PEN
Sbjct: 197 -----GVPESEDEQLKGPEN 211


>gnl|CDD|235474 PRK05451, PRK05451, dihydroorotase; Provisional.
          Length = 345

 Score = 30.5 bits (70), Expect = 8.5
 Identities = 11/20 (55%), Positives = 15/20 (75%), Gaps = 1/20 (5%)

Query: 408 PLLRDH-KIKVVAEHISTED 426
           PL R   K+K+V EHI+T+D
Sbjct: 161 PLRRRFPKLKIVFEHITTKD 180


>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3).
            This family contains EBNA-3A, -3B, and -3C which are
            latent infection nuclear proteins important for
            Epstein-Barr virus (EBV)-induced B-cell immortalisation
            and the immune response to EBV infection.
          Length = 254

 Score = 30.1 bits (68), Expect = 8.5
 Identities = 18/59 (30%), Positives = 28/59 (47%), Gaps = 9/59 (15%)

Query: 1281 ALRRLQYDGGRSTSESGISTDTEDDNVEIVETNDESEDDKSEVVETQSEHDVPHTNSVM 1339
            A  R     G++   S + T  E D+   VET+ ESED +SE      + ++P+    M
Sbjct: 192  AFLR-----GQALGLSALRTPGEPDDDATVETSSESEDSESE----SDDEELPYIVPRM 241


>gnl|CDD|216560 pfam01544, CorA, CorA-like Mg2+ transporter protein.  The CorA
           transport system is the primary Mg2+ influx system of
           Salmonella typhimurium and Escherichia coli. CorA is
           virtually ubiquitous in the Bacteria and Archaea. There
           are also eukaryotic relatives of this protein. The
           family includes the MRS2 protein from yeast that is
           thought to be an RNA splicing protein. However its
           membership of this family suggests that its effect on
           splicing is due to altered magnesium levels in the cell.
          Length = 291

 Score = 30.4 bits (69), Expect = 8.8
 Identities = 35/169 (20%), Positives = 66/169 (39%), Gaps = 23/169 (13%)

Query: 7   LSMIVFKNFILTIGSPLTSSTSMVVFKVSDLKTRLKDVEK--------------KLLESP 52
           LS+ +  N ++T+      +   +  ++ +L    K                  ++LE  
Sbjct: 71  LSIFLGDNRLITVRREPLPAVDEIRKRLENLGLGPKSPGDLLYLLLDSIVDRYFEILEKL 130

Query: 53  EEIKIEYEKQMTNYKK---LRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEI 109
           EE   E E ++ +      LREL   R +   L       +E+  N LL      I EE 
Sbjct: 131 EEELDELEDELEDSTTNELLRELLRLRRSLVRLRRSLAPLREVL-NRLLSDDGPLIDEEE 189

Query: 110 QAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQM 158
           +  L    +  +++LEM + ++   + + +L +   S L+    LN  M
Sbjct: 190 KEYLRDLLDELERLLEMAEILR---ERLRSLQDAYLSLLSNR--LNRIM 233


>gnl|CDD|227731 COG5444, COG5444, Uncharacterized conserved protein [Function
           unknown].
          Length = 565

 Score = 30.5 bits (69), Expect = 8.8
 Identities = 25/139 (17%), Positives = 49/139 (35%), Gaps = 14/139 (10%)

Query: 23  LTSSTSMVVFKVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAEL 82
               T     +V   +   KD  +KL +  +E         +  ++L+ +  Q      L
Sbjct: 137 GKLDTEDFKDEVDSARKHAKDTLEKLYKLDQEGMTLMAAVESKMQELKAIIRQ------L 190

Query: 83  EHKTMIEKELAK--------NALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 134
           E  T+      K         A  E  + +   E+  Q E Y + K ++ +   +++  D
Sbjct: 191 EEWTIKGGATKKGVPIHYVAKAFAEVTIHKKAAEVALQSETYLDIKTELAKERPDMRDLD 250

Query: 135 DEINNLNENLTSSLAENHI 153
             + + N+     LA   I
Sbjct: 251 KPLESANKTGYEGLAGEDI 269


>gnl|CDD|200398 TIGR04147, GGGPS_Halobact, phosphoglycerol geranylgeranyltransferase,
            putative.  In most archaea, phosphoglycerol
            geranylgeranyltransferase (EC 2.5.1.41), also known as
            GGGP synthase and GGGPS, catalyzes the stereospecific
            first step in the biosynthesis of their characteristic
            membrane diether lipids. However, some groups of archaeal
            GGGPS homologs are more closely related to certain
            bacterial proteins than to each other. This family
            represents the putative GGGPS family as found in the
            Halobacteria.
          Length = 229

 Score = 29.8 bits (67), Expect = 8.9
 Identities = 13/32 (40%), Positives = 19/32 (59%)

Query: 1302 TEDDNVEIVETNDESEDDKSEVVETQSEHDVP 1333
            T  D +EI  T D +E+    VV+  +E+DVP
Sbjct: 29   TGTDAIEIGGTLDMTEEKMQRVVDACAEYDVP 60


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 29.9 bits (68), Expect = 9.0
 Identities = 31/93 (33%), Positives = 49/93 (52%), Gaps = 17/93 (18%)

Query: 37  LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKL----RELYEQ-RAAAAELEHKTMIEKE 91
           LK RLK++EK+L    + +K E E     ++K+     ELY++  AA  +++ KT     
Sbjct: 91  LKARLKELEKEL----KNLKWESEVLEQRFEKVERERDELYDKFEAAIQDVQQKT----- 141

Query: 92  LAKNALLEAKLKEIQEEI---QAQLEKYKENKK 121
             KN LLE KL+ + EE+   +AQL +      
Sbjct: 142 GLKNLLLEQKLEALNEELEKKEAQLNEVLAAAN 174



 Score = 29.9 bits (68), Expect = 9.0
 Identities = 31/93 (33%), Positives = 49/93 (52%), Gaps = 17/93 (18%)

Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKL----RELYEQ-RAAAAELEHKTMIEKE 908
           LK RLK++EK+L    + +K E E     ++K+     ELY++  AA  +++ KT     
Sbjct: 91  LKARLKELEKEL----KNLKWESEVLEQRFEKVERERDELYDKFEAAIQDVQQKT----- 141

Query: 909 LAKNALLEAKLKEIQEEI---QAQLEKYKENKK 938
             KN LLE KL+ + EE+   +AQL +      
Sbjct: 142 GLKNLLLEQKLEALNEELEKKEAQLNEVLAAAN 174


>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160.  This
           family consists of several special lobe-specific silk
           protein SSP160 sequences which appear to be specific to
           Chironomus (Midge) species.
          Length = 758

 Score = 30.5 bits (68), Expect = 9.1
 Identities = 12/58 (20%), Positives = 31/58 (53%)

Query: 236 MSSAMVEAVESNQTSTKAVESNQTSTSKDSANIQLDHAETNTNEESPADQDETNSTTT 293
           M SA ++A++S   S     S+  +++ +S +   +++ T++N  +      ++S +T
Sbjct: 90  MISANIQAIQSGSGSASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSNST 147


>gnl|CDD|181395 PRK08341, PRK08341, amidophosphoribosyltransferase; Provisional.
          Length = 442

 Score = 30.6 bits (69), Expect = 9.2
 Identities = 18/67 (26%), Positives = 32/67 (47%), Gaps = 5/67 (7%)

Query: 54  EIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQL 113
           +I I +   +TN+  LR  YE R     ++ ++ ++ EL   + L     E  +E +A  
Sbjct: 95  KIAIAHNGTLTNFLPLRRKYESRG----VKFRSSVDTELIGISFLW-HYSETGDEFEAMR 149

Query: 114 EKYKENK 120
           E + E K
Sbjct: 150 EVFNEVK 156



 Score = 30.6 bits (69), Expect = 9.2
 Identities = 18/67 (26%), Positives = 32/67 (47%), Gaps = 5/67 (7%)

Query: 871 EIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQL 930
           +I I +   +TN+  LR  YE R     ++ ++ ++ EL   + L     E  +E +A  
Sbjct: 95  KIAIAHNGTLTNFLPLRRKYESRG----VKFRSSVDTELIGISFLW-HYSETGDEFEAMR 149

Query: 931 EKYKENK 937
           E + E K
Sbjct: 150 EVFNEVK 156


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 30.6 bits (69), Expect = 9.6
 Identities = 55/296 (18%), Positives = 116/296 (39%), Gaps = 26/296 (8%)

Query: 860  DVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAK--NALLEA 917
            +  K+L+E     KI+ + ++  + K+           +L     + KE  K    L + 
Sbjct: 893  NSNKQLVEHLLNNKIDLKNKLEQHMKIINTDNIIQKNEKLNLLNNLNKEKEKIEKQLSDT 952

Query: 918  KLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTL 977
            K+  ++ +I+  LE Y ++K+ I     N     ++++   +      +E   LN    +
Sbjct: 953  KINNLKMQIEKTLEYYDKSKENI---NGNDGTHLEKLDKEKDEWEHFKSEIDKLNVNYNI 1009

Query: 978  VNNLFSNMLINPMIN-LDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETLA 1036
            +N    +++     + ++++ KLI++    I + ++ +     +SLL  ++ K+   +  
Sbjct: 1010 LNKKIDDLIKKQHDDIIELIDKLIKEKGKEIEEKVDQY-----ISLLEKMKTKL--SSFH 1062

Query: 1037 IQNDKQLKQLENYETPMS--SAMVEAVESNQTSTSKDSANIQLDHAE--TNTNEESPADQ 1092
               D +  +    +  +      VEA+             I+    E   N ++E    Q
Sbjct: 1063 FNIDIKKYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVVNADKEK-NKQ 1121

Query: 1093 DETNSTTTLDKSHIQKEIANNLPKVWRILIELLNTHQEQYENNDVNENFESCYKNI 1148
             E  +        I K++   L        EL N + E    N+VNE  E  Y+ I
Sbjct: 1122 TEHYNKKKKSLEKIYKQMEKTLK-------ELENMNLEDITLNEVNE-IEIEYERI 1169


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
           contains an extraordinarily long coiled-coil domain in
           its C-terminal half and is ubiquitously expressed. It is
           a novel binding partner of several syntaxin family
           members and is possibly involved in Ca2+-dependent
           exocytosis in neuroendocrine cells. Gamma-taxilin,
           described as leucine zipper protein Factor Inhibiting
           ATF4-mediated Transcription (FIAT), localises to the
           nucleus in osteoblasts and dimerises with ATF4 to form
           inactive dimers, thus inhibiting ATF4-mediated
           transcription.
          Length = 309

 Score = 30.0 bits (68), Expect = 9.9
 Identities = 28/113 (24%), Positives = 53/113 (46%), Gaps = 1/113 (0%)

Query: 850 LVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 909
           L    ++ LKD++ ++ E         E+     +KL+EL EQ     +   K +  K+L
Sbjct: 108 LTEKFQSTLKDIQDQMEEHSNPNHKLCEENEELREKLKELIEQYELREQHFEKLLKTKDL 167

Query: 910 AKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLT 962
            +  L +AKL++ QE+ + + EK+K  K+ +L+    +    +    L   L 
Sbjct: 168 EQQ-LNDAKLEQAQEQAEQEQEKHKREKEILLKEAAQVSTLKETEKELRAQLN 219


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 30.6 bits (69), Expect = 10.0
 Identities = 31/140 (22%), Positives = 56/140 (40%), Gaps = 13/140 (9%)

Query: 832 STPPESNILATTRSVILP--LVSDLKTRL---KDVEKKLLESPEEIKIEYEKQMTNYKKL 886
             P ES      +   +P  ++   KT     K+    L+E    ++ E E++  + +KL
Sbjct: 478 GIPGESYAFEIAQRYGIPHFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKL 537

Query: 887 R--------ELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKK 938
                    EL ++     E E    +E E      L+A  KE++  I+   EK     K
Sbjct: 538 LKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAK 597

Query: 939 QILEMEKNIQLKDDEINNLN 958
           +I  +E  ++LK+ +     
Sbjct: 598 EIKSIEDLVKLKETKQKIPQ 617


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.309    0.126    0.340 

Gapped
Lambda     K      H
   0.267   0.0724    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 79,343,333
Number of extensions: 7862484
Number of successful extensions: 12221
Number of sequences better than 10.0: 1
Number of HSP's gapped: 10886
Number of HSP's successfully gapped: 1148
Length of query: 1642
Length of database: 10,937,602
Length adjustment: 110
Effective length of query: 1532
Effective length of database: 6,058,662
Effective search space: 9281870184
Effective search space used: 9281870184
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 66 (29.2 bits)