RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy597
         (546 letters)



>gnl|CDD|220173 pfam09311, Rab5-bind, Rabaptin-like protein.  Members of this
           family are predominantly found in Rabaptin and allow for
           binding to the GTPase Rab5. This interaction is
           necessary and sufficient for Rab5-dependent recruitment
           of Rabaptin5 to early endosomal membranes.
          Length = 180

 Score =  171 bits (436), Expect = 8e-51
 Identities = 91/177 (51%), Positives = 115/177 (64%), Gaps = 2/177 (1%)

Query: 77  ELGLNEAQVMTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANS 136
           EL L+ +QV  A+   LQ +EA +QKLR Q+RRL QEN WLR ELA TQQ+LQ+SEQ   
Sbjct: 1   ELQLSFSQVQEAVQDQLQELEASRQKLRAQLRRLQQENQWLRGELALTQQELQASEQEVI 60

Query: 137 QLEEEKKHLEFMSSIRKYDSDIQGDENSGDQKSDKHKQDDPVVDLFPDEDNDDRNNLS-P 195
           QL EE KHL+F+   R+ +       +   ++  K +  DP+ +L P+ + +    +  P
Sbjct: 61  QLPEEVKHLQFLCLQRREELIESRTASEHLEEELKSEI-DPLKELKPNLEEELAAEIELP 119

Query: 196 TPPSQFAQQVNAGYEIPARLRTLHNLVIQYASQGRYEVAVPLCKQALEDLEKTSGKY 252
              S+  Q   + YEIPARLRTLHNLVIQY SQGRYEVAVPLCK ALEDLEK SG  
Sbjct: 120 QIQSEKDQLQASLYEIPARLRTLHNLVIQYESQGRYEVAVPLCKAALEDLEKESGHK 176



 Score = 30.8 bits (70), Expect = 0.98
 Identities = 22/49 (44%), Positives = 29/49 (59%)

Query: 328 GENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDV 376
                A   TL+NL + Y  +G+Y+ A PLCK ALE  EK  G +HPD+
Sbjct: 132 LYEIPARLRTLHNLVIQYESQGRYEVAVPLCKAALEDLEKESGHKHPDL 180


>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat. 
          Length = 78

 Score = 76.3 bits (188), Expect = 3e-17
 Identities = 38/79 (48%), Positives = 45/79 (56%), Gaps = 1/79 (1%)

Query: 289 HPDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKR 348
           HPD+A  LN LALV R    Y EA  LL  AL +  + LGE+H   A  LNNLA LY   
Sbjct: 1   HPDLAAALNNLALVLRRLGDYDEALELLEKALELARE-LGEDHPETARALNNLARLYLAL 59

Query: 349 GKYKEAEPLCKRALEIREK 367
           G Y EA    ++AL +RE 
Sbjct: 60  GDYDEALEYLEKALALREA 78



 Score = 72.8 bits (179), Expect = 4e-16
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 331 HAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQ 390
           H  +AA LNNLA++  + G Y EA  L ++ALE+    LG++HP+ A+ LNNLA L    
Sbjct: 1   HPDLAAALNNLALVLRRLGDYDEALELLEKALELAR-ELGEDHPETARALNNLARLYLAL 59

Query: 391 SKYEEVERYYQRALEIYE 408
             Y+E   Y ++AL + E
Sbjct: 60  GDYDEALEYLEKALALRE 77



 Score = 65.9 bits (161), Expect = 1e-13
 Identities = 30/79 (37%), Positives = 41/79 (51%), Gaps = 1/79 (1%)

Query: 373 HPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQ 432
           HPD+A  LNNLAL+ +    Y+E     ++ALE+   +LG D    A+  NNLA  YL  
Sbjct: 1   HPDLAAALNNLALVLRRLGDYDEALELLEKALELAR-ELGEDHPETARALNNLARLYLAL 59

Query: 433 GKYKEAEILYKQVLTRAHE 451
           G Y EA    ++ L     
Sbjct: 60  GDYDEALEYLEKALALREA 78



 Score = 52.8 bits (127), Expect = 5e-09
 Identities = 27/117 (23%), Positives = 38/117 (32%), Gaps = 39/117 (33%)

Query: 415 DSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGACDGDNKPIWQDRQKNKAK 474
             ++A   NNLA    + G Y EA                                    
Sbjct: 1   HPDLAAALNNLALVLRRLGDYDEAL----------------------------------- 25

Query: 475 NREKYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 531
              +  ++ALE+   +LG D    A+  NNLA  YL  G Y EA    ++ L     
Sbjct: 26  ---ELLEKALELAR-ELGEDHPETARALNNLARLYLALGDYDEALEYLEKALALREA 78



 Score = 47.4 bits (113), Expect = 3e-07
 Identities = 29/114 (25%), Positives = 39/114 (34%), Gaps = 37/114 (32%)

Query: 212 PARLRTLHNLVIQYASQGRYEVAVPLCKQALEDLEKTSGKYEAAETLEDCALRSRKESYD 271
           P     L+NL +     G Y+ A+ L ++ALE   +                        
Sbjct: 2   PDLAAALNNLALVLRRLGDYDEALELLEKALELAREL----------------------- 38

Query: 272 IVKQAKVAQILGSGGHDHPDVATMLNILALVYRDQNKYKEAANLLNDALTIREK 325
                         G DHP+ A  LN LA +Y     Y EA   L  AL +RE 
Sbjct: 39  --------------GEDHPETARALNNLARLYLALGDYDEALEYLEKALALREA 78



 Score = 35.0 bits (81), Expect = 0.009
 Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 2/46 (4%)

Query: 495 DSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGACDGD 540
             ++A   NNLA    + G Y EA  L ++ L  A  RE G    +
Sbjct: 1   HPDLAAALNNLALVLRRLGDYDEALELLEKALELA--RELGEDHPE 44



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 12/37 (32%), Positives = 19/37 (51%)

Query: 207 AGYEIPARLRTLHNLVIQYASQGRYEVAVPLCKQALE 243
            G + P   R L+NL   Y + G Y+ A+   ++AL 
Sbjct: 38  LGEDHPETARALNNLARLYLALGDYDEALEYLEKALA 74


>gnl|CDD|205553 pfam13374, TPR_10, Tetratricopeptide repeat. 
          Length = 42

 Score = 61.4 bits (150), Expect = 2e-12
 Identities = 23/42 (54%), Positives = 30/42 (71%)

Query: 334 VAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPD 375
            AA+LNNLA     +G+Y+EAE L + AL IR +VLG +HPD
Sbjct: 1   TAASLNNLARALRAQGRYEEAEELLEEALAIRRRVLGPDHPD 42



 Score = 48.3 bits (116), Expect = 6e-08
 Identities = 19/40 (47%), Positives = 24/40 (60%)

Query: 292 VATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENH 331
            A  LN LA   R Q +Y+EA  LL +AL IR + LG +H
Sbjct: 1   TAASLNNLARALRAQGRYEEAEELLEEALAIRRRVLGPDH 40



 Score = 41.8 bits (99), Expect = 2e-05
 Identities = 18/39 (46%), Positives = 21/39 (53%)

Query: 376 VAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPD 414
            A  LNNLA   + Q +YEE E   + AL I    LGPD
Sbjct: 1   TAASLNNLARALRAQGRYEEAEELLEEALAIRRRVLGPD 39



 Score = 38.7 bits (91), Expect = 2e-04
 Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 1/39 (2%)

Query: 418 VAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGA 456
            A + NNLA     QG+Y+EAE L ++ L     R  G 
Sbjct: 1   TAASLNNLARALRAQGRYEEAEELLEEALAI-RRRVLGP 38



 Score = 38.7 bits (91), Expect = 2e-04
 Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 1/39 (2%)

Query: 498 VAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGA 536
            A + NNLA     QG+Y+EAE L ++ L     R  G 
Sbjct: 1   TAASLNNLARALRAQGRYEEAEELLEEALAI-RRRVLGP 38



 Score = 32.5 bits (75), Expect = 0.027
 Identities = 13/36 (36%), Positives = 19/36 (52%)

Query: 215 LRTLHNLVIQYASQGRYEVAVPLCKQALEDLEKTSG 250
             +L+NL     +QGRYE A  L ++AL    +  G
Sbjct: 2   AASLNNLARALRAQGRYEEAEELLEEALAIRRRVLG 37


>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
           34 amino acids
           [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
           X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
           in a variety of organisms including bacteria,
           cyanobacteria, yeast, fungi, plants, and humans in
           various subcellular locations; involved in a variety of
           functions including protein-protein interactions, but
           common features in the interaction partners have not
           been defined; involved in chaperone, cell-cycle,
           transciption, and protein transport complexes; the
           number of TPR motifs varies among proteins (1,3-11,13
           15,16,19); 5-6 tandem repeats generate a right-handed
           helical structure with an amphipathic channel that is
           thought to accomodate an alpha-helix of a target
           protein; it has been proposed that TPR proteins
           preferably interact with WD-40 repeat proteins, but in
           many instances several TPR-proteins seem to aggregate to
           multi-protein complexes; examples of TPR-proteins
           include, Cdc16p, Cdc23p and Cdc27p components of the
           cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
           targeting signals, the Tom70p co-receptor for
           mitochondrial targeting signals, Ser/Thr phosphatase 5C
           and the p110 subunit of O-GlcNAc transferase; three
           copies of the repeat are present here.
          Length = 100

 Score = 60.1 bits (146), Expect = 2e-11
 Identities = 36/110 (32%), Positives = 45/110 (40%), Gaps = 16/110 (14%)

Query: 299 LALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLC 358
           L  +Y     Y EA      AL +            A    NLA  Y K GKY+EA    
Sbjct: 6   LGNLYYKLGDYDEALEYYEKALELDPDN--------ADAYYNLAAAYYKLGKYEEALEDY 57

Query: 359 KRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYE 408
           ++ALE+         PD AK   NL L      KYEE    Y++ALE+  
Sbjct: 58  EKALELD--------PDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDP 99



 Score = 47.4 bits (113), Expect = 7e-07
 Identities = 41/149 (27%), Positives = 57/149 (38%), Gaps = 54/149 (36%)

Query: 378 KQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKE 437
           + L NL  L      Y+E   YY++ALE+        D + A    NLA+ Y K GKY+E
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALEL--------DPDNADAYYNLAAAYYKLGKYEE 52

Query: 438 AEILYKQVLTRAHEREFGACDGDNKPIWQDRQKNKAKNREKYYQRALEIYELKLGPDDSN 497
           A                                       + Y++ALE+        D +
Sbjct: 53  AL--------------------------------------EDYEKALEL--------DPD 66

Query: 498 VAKTKNNLASCYLKQGKYKEAEILYKQVL 526
            AK   NL   Y K GKY+EA   Y++ L
Sbjct: 67  NAKAYYNLGLAYYKLGKYEEALEAYEKAL 95



 Score = 41.2 bits (97), Expect = 1e-04
 Identities = 25/78 (32%), Positives = 32/78 (41%), Gaps = 8/78 (10%)

Query: 290 PDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRG 349
           PD A     LA  Y    KY+EA      AL +            A    NL + Y K G
Sbjct: 31  PDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPD--------NAKAYYNLGLAYYKLG 82

Query: 350 KYKEAEPLCKRALEIREK 367
           KY+EA    ++ALE+   
Sbjct: 83  KYEEALEAYEKALELDPN 100


>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 250

 Score = 49.3 bits (118), Expect = 1e-06
 Identities = 42/149 (28%), Positives = 58/149 (38%), Gaps = 24/149 (16%)

Query: 299 LALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLC 358
           LAL Y  Q  Y +A   L  AL         +           A  Y K G+   A+   
Sbjct: 41  LALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLV--------RAHYYQKLGENDLADESY 92

Query: 359 KRALEIREKVLGKEHPDVAKQLNNL-ALLCQNQSKYEEVERYYQRALEIYELKLGPDDSN 417
           ++AL           P+    LNN  A LC  Q + EE  + ++RAL        P    
Sbjct: 93  RKAL--------SLAPNNGDVLNNYGAFLCA-QGRPEEAMQQFERALA------DPAYGE 137

Query: 418 VAKTKNNLASCYLKQGKYKEAEILYKQVL 446
            + T  NL  C LK G++ +AE   K+ L
Sbjct: 138 PSDTLENLGLCALKAGQFDQAEEYLKRAL 166



 Score = 33.2 bits (76), Expect = 0.27
 Identities = 40/198 (20%), Positives = 67/198 (33%), Gaps = 60/198 (30%)

Query: 329 ENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQ 388
            +    A     LA+ Y ++G Y +A+   ++ALE          P         A   Q
Sbjct: 29  TDRNEAAKARLQLALGYLQQGDYAQAKKNLEKALEH--------DPSYYLAHLVRAHYYQ 80

Query: 389 NQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTR 448
              + +  +  Y++AL      L P++ +V    NN  +    QG+ +EA          
Sbjct: 81  KLGENDLADESYRKAL-----SLAPNNGDVL---NNYGAFLCAQGRPEEAM--------- 123

Query: 449 AHEREFGACDGDNKPIWQDRQKNKAKNREKYYQRALEIYELKLGPDDSNVAKTKNNLASC 508
                                        + ++RAL        P     + T  NL  C
Sbjct: 124 -----------------------------QQFERALA------DPAYGEPSDTLENLGLC 148

Query: 509 YLKQGKYKEAEILYKQVL 526
            LK G++ +AE   K+ L
Sbjct: 149 ALKAGQFDQAEEYLKRAL 166


>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
           PilW.  Members of this family are designated PilF in ref
           (PMID:8973346) and PilW in ref (PMID:15612916). This
           outer membrane protein is required both for pilus
           stability and for pilus function such as adherence to
           human cells. Members of this family contain copies of
           the TPR (tetratricopeptide repeat) domain.
          Length = 234

 Score = 49.3 bits (118), Expect = 1e-06
 Identities = 48/202 (23%), Positives = 84/202 (41%), Gaps = 28/202 (13%)

Query: 329 ENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQ 388
            +    A     LA+ Y ++G  + A+    +ALE          PD       LAL  Q
Sbjct: 25  TDRNKAAKIRVQLALGYLEQGDLEVAKENLDKALEHD--------PDDYLAYLALALYYQ 76

Query: 389 NQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQ-VLT 447
              + E+ E  ++RAL      L P++ +V    NN  +   +QGKY++A   ++Q +  
Sbjct: 77  QLGELEKAEDSFRRAL-----TLNPNNGDV---LNNYGTFLCQQGKYEQAMQQFEQAIED 128

Query: 448 RAHEREFGACDGDNKPIWQDRQKNKAKNREKYYQRALEIYELKLGPDDSNVAKTKNNLAS 507
             + +   + +  N  +   +  +  K  EKY  RAL+I        D    ++   LA 
Sbjct: 129 PLYPQPARSLE--NAGLCALKAGDFDK-AEKYLTRALQI--------DPQRPESLLELAE 177

Query: 508 CYLKQGKYKEAEILYKQVLTRA 529
            Y  +G+YK+A    ++     
Sbjct: 178 LYYLRGQYKDARAYLERYQQTY 199



 Score = 43.5 bits (103), Expect = 1e-04
 Identities = 38/160 (23%), Positives = 65/160 (40%), Gaps = 22/160 (13%)

Query: 290 PDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRG 349
           PD       LAL Y+   + ++A +    ALT+               LNN      ++G
Sbjct: 62  PDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNN--------GDVLNNYGTFLCQQG 113

Query: 350 KYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYEL 409
           KY++A    ++A+E         +P  A+ L N  L       +++ E+Y  RAL+I   
Sbjct: 114 KYEQAMQQFEQAIE------DPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQI--- 164

Query: 410 KLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRA 449
                D    ++   LA  Y  +G+YK+A    ++     
Sbjct: 165 -----DPQRPESLLELAELYYLRGQYKDARAYLERYQQTY 199


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 43.1 bits (102), Expect = 1e-05
 Identities = 25/73 (34%), Positives = 33/73 (45%), Gaps = 9/73 (12%)

Query: 335 AATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLAL-LCQNQSKY 393
           A  L NL     K G Y EA    ++ALE+         PD A+   NLAL   +    Y
Sbjct: 3   AEALKNLGNALFKLGDYDEAIEAYEKALEL--------DPDNAEAYYNLALAYLKLGKDY 54

Query: 394 EEVERYYQRALEI 406
           EE     ++ALE+
Sbjct: 55  EEALEDLEKALEL 67



 Score = 41.5 bits (98), Expect = 3e-05
 Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 9/73 (12%)

Query: 375 DVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGK 434
           D A+ L NL         Y+E    Y++ALE+        D + A+   NLA  YLK GK
Sbjct: 1   DNAEALKNLGNALFKLGDYDEAIEAYEKALEL--------DPDNAEAYYNLALAYLKLGK 52

Query: 435 -YKEAEILYKQVL 446
            Y+EA    ++ L
Sbjct: 53  DYEEALEDLEKAL 65



 Score = 38.8 bits (91), Expect = 4e-04
 Identities = 23/77 (29%), Positives = 29/77 (37%), Gaps = 9/77 (11%)

Query: 291 DVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGK 350
           D A  L  L         Y EA      AL +            A    NLA+ Y K GK
Sbjct: 1   DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDN--------AEAYYNLALAYLKLGK 52

Query: 351 -YKEAEPLCKRALEIRE 366
            Y+EA    ++ALE+  
Sbjct: 53  DYEEALEDLEKALELDP 69



 Score = 33.4 bits (77), Expect = 0.029
 Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 6/54 (11%)

Query: 476 REKYYQRALEIYE--LKLGPDDSNVAKTKNNLASCYLKQGK-YKEAEILYKQVL 526
           +   Y  A+E YE  L+L PD+   A+   NLA  YLK GK Y+EA    ++ L
Sbjct: 15  KLGDYDEAIEAYEKALELDPDN---AEAYYNLALAYLKLGKDYEEALEDLEKAL 65



 Score = 29.2 bits (66), Expect = 0.75
 Identities = 10/30 (33%), Positives = 15/30 (50%)

Query: 497 NVAKTKNNLASCYLKQGKYKEAEILYKQVL 526
           + A+   NL +   K G Y EA   Y++ L
Sbjct: 1   DNAEALKNLGNALFKLGDYDEAIEAYEKAL 30


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 46.2 bits (110), Expect = 4e-05
 Identities = 50/223 (22%), Positives = 82/223 (36%), Gaps = 36/223 (16%)

Query: 225 YASQGRYEVAVPLCKQALEDLEKTSGKYEAAETLEDCALRSRKESYDIVKQAKVAQILGS 284
           Y  Q  Y  A+   ++AL+    +S     A  L    L S          A+  + L +
Sbjct: 713 YLRQKDYPAAIQAYRKALK-RAPSSQ---NAIKLHRALLASGN-------TAEAVKTLEA 761

Query: 285 GGHDHPDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVL 344
               HP+ A +   LA +Y  Q  Y +A       +              A  LNNLA L
Sbjct: 762 WLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAPDN--------AVVLNNLAWL 813

Query: 345 YGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRAL 404
           Y +  K   A    +RAL++         P++   L+ L  L   + + +      ++A+
Sbjct: 814 YLEL-KDPRALEYAERALKLA--------PNIPAILDTLGWLLVEKGEADRALPLLRKAV 864

Query: 405 EIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLT 447
            I      P+    A  + +LA   L  G+  EA     ++L 
Sbjct: 865 NI-----APEA---AAIRYHLALALLATGRKAEARKELDKLLN 899



 Score = 35.8 bits (83), Expect = 0.059
 Identities = 57/275 (20%), Positives = 91/275 (33%), Gaps = 70/275 (25%)

Query: 290 PDVATMLNILALVYRDQNKYKEAANLLNDALTIRE------------------------- 324
           PD A  L +LA  Y     Y +A   L  AL ++                          
Sbjct: 632 PDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKI 691

Query: 325 -KTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNL 383
            K+L + H   A        LY ++  Y  A    ++AL+             A +L+  
Sbjct: 692 AKSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKR------APSSQNAIKLHRA 745

Query: 384 ALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYK 443
            L   N ++  +    + +              N A  +  LA  YL Q  Y +A   Y+
Sbjct: 746 LLASGNTAEAVKTLEAWLK-----------THPNDAVLRTALAELYLAQKDYDKAIKHYQ 794

Query: 444 QVLTRAHEREFGACDGDNKPI-----WQDRQKNKAKNREKYYQRALEIYE--LKLGPDDS 496
            V+ +A          DN  +     W   +            RALE  E  LKL P+  
Sbjct: 795 TVVKKAP---------DNAVVLNNLAWLYLELKDP--------RALEYAERALKLAPNIP 837

Query: 497 NVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 531
            +  T   L    +++G+   A  L ++ +  A E
Sbjct: 838 AILDT---LGWLLVEKGEADRALPLLRKAVNIAPE 869



 Score = 34.3 bits (79), Expect = 0.20
 Identities = 33/154 (21%), Positives = 51/154 (33%), Gaps = 28/154 (18%)

Query: 288 DHPDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGK 347
           D    A +L +  L Y    + + A      AL I  ++L             LA L   
Sbjct: 120 DDEGAAELLALRGLAYLGLGQLELAQKSYEQALAIDPRSL--------YAKLGLAQLALA 171

Query: 348 RGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIY 407
             ++ EA  L    L           P     L     L  +    E        AL  Y
Sbjct: 172 ENRFDEARALIDEVLTA--------DPGNVDALLLKGDLLLSLGNIEL-------ALAAY 216

Query: 408 E--LKLGPDDSNVAKTKNNLASCYLKQGKYKEAE 439
              + L P++  V      LA+  ++ G+++EAE
Sbjct: 217 RKAIALRPNNIAV---LLALATILIEAGEFEEAE 247



 Score = 33.9 bits (78), Expect = 0.25
 Identities = 57/269 (21%), Positives = 87/269 (32%), Gaps = 59/269 (21%)

Query: 289 HPDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKR 348
           +P        LA  Y  + + K+A  +LN+A         +           L       
Sbjct: 563 NPQEIEPALALAQYYLGKGQLKKALAILNEAA--------DAAPDSPEAWLMLGRAQLAA 614

Query: 349 GKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYE 408
           G   +A    K+ L +         PD A  L  LA        Y +     +RALE+  
Sbjct: 615 GDLNKAVSSFKKLLAL--------QPDSALALLLLADAYAVMKNYAKAITSLKRALELKP 666

Query: 409 -----------LKLGPDDSNVAKT------KNN---------LASCYLKQGKYKEAEILY 442
                      L L    +  AK       K +             YL+Q  Y  A   Y
Sbjct: 667 DNTEAQIGLAQLLLAAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAY 726

Query: 443 KQVLTRAHEREFGACDGDNKPIWQDRQKNKAKNREKYYQRALEIYE--LKLGPDDSNVAK 500
           ++    A +R   + +            N A+        A++  E  LK  P+D   A 
Sbjct: 727 RK----ALKRAPSSQNAIKLHRALLASGNTAE--------AVKTLEAWLKTHPND---AV 771

Query: 501 TKNNLASCYLKQGKYKEAEILYKQVLTRA 529
            +  LA  YL Q  Y +A   Y+ V+ +A
Sbjct: 772 LRTALAELYLAQKDYDKAIKHYQTVVKKA 800



 Score = 32.0 bits (73), Expect = 1.1
 Identities = 38/151 (25%), Positives = 57/151 (37%), Gaps = 29/151 (19%)

Query: 300 ALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLCK 359
           A  Y  +NKYK A   L +AL                    L  +Y   G Y  AE   +
Sbjct: 29  AKSYLQKNKYKAAIIQLKNALQKDPNDAEARFL--------LGKIYLALGDYAAAEKELR 80

Query: 360 RALEI---REKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDS 416
           +AL +   + +VL    P +A+     A L Q +         +Q+ L+    K   DD 
Sbjct: 81  KALSLGYPKNQVL----PLLAR-----AYLLQGK---------FQQVLDELPGKTLLDDE 122

Query: 417 NVAKTKNNLASCYLKQGKYKEAEILYKQVLT 447
             A+        YL  G+ + A+  Y+Q L 
Sbjct: 123 GAAELLALRGLAYLGLGQLELAQKSYEQALA 153



 Score = 28.9 bits (65), Expect = 9.0
 Identities = 21/91 (23%), Positives = 39/91 (42%), Gaps = 9/91 (9%)

Query: 359 KRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNV 418
            +AL   +K L K+ PD A   N L  +   +    +    +++AL I        + + 
Sbjct: 448 DKALAAAKK-LEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSI--------EPDF 498

Query: 419 AKTKNNLASCYLKQGKYKEAEILYKQVLTRA 449
                NLA   +++G   +A   +++VLT  
Sbjct: 499 FPAAANLARIDIQEGNPDDAIQRFEKVLTID 529


>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
          Length = 291

 Score = 42.9 bits (99), Expect = 3e-04
 Identities = 56/239 (23%), Positives = 81/239 (33%), Gaps = 10/239 (4%)

Query: 295 MLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNL--AVLYGKRGKYK 352
           +L  LA++     K    A  L +A     + LGE   A+      L         G   
Sbjct: 4   LLLALAILLEALAKLLAEALALLEAGLALLELLGELAEALELLEEALELLPNSDLAGLLL 63

Query: 353 EAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLG 412
                  +   + E +   E     + L NLA    N     E    Y+ ALE+ E  L 
Sbjct: 64  LLALALLKLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALGKYEEALELLEKALA 123

Query: 413 PDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGACDGDNKPIWQDRQKNK 472
            D                + G Y+EA  LY++ L    E    A                
Sbjct: 124 LDPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELA--------EALLALGA 175

Query: 473 AKNREKYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 531
                  Y+ ALE+ E  L  +  + A+   NL   YLK GKY+EA   Y++ L    +
Sbjct: 176 LLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPD 234



 Score = 39.8 bits (91), Expect = 0.002
 Identities = 57/274 (20%), Positives = 94/274 (34%), Gaps = 16/274 (5%)

Query: 253 EAAETLEDCALRSRKESYDIVKQAKVAQILGSGGHDHPDVATMLNILALVYRDQNKYKEA 312
            A   L +   +   E+  +++       L     +  ++      L             
Sbjct: 6   LALAILLEALAKLLAEALALLEAGLALLELLGELAEALELLEEALELLPNSDLAGLLLLL 65

Query: 313 ANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKE 372
           A  L     + E       A     L NLA      G   EA    + ALE+ EK L  +
Sbjct: 66  ALALLKLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALGKYEEALELLEKALALD 125

Query: 373 HPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQ 432
                 +             YEE    Y++ALE+      P+ + +A+    L +     
Sbjct: 126 PDPDLAEALLALGALYELGDYEEALELYEKALELD-----PELNELAEALLALGALLEAL 180

Query: 433 GKYKEAEILYKQVLTRAHEREFGACDGDNKPIWQDRQKNKAKNREKYYQRALEIYELKLG 492
           G+Y+EA  L ++ L    + +  A                   +   Y+ ALE YE K  
Sbjct: 181 GRYEEALELLEKALKLNPDDDAEA----------LLNLGLLYLKLGKYEEALEYYE-KAL 229

Query: 493 PDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVL 526
             D + A+   NLA   L+ G+Y+EA    ++ L
Sbjct: 230 ELDPDNAEALYNLALLLLELGRYEEALEALEKAL 263



 Score = 34.8 bits (78), Expect = 0.082
 Identities = 55/211 (26%), Positives = 87/211 (41%), Gaps = 21/211 (9%)

Query: 202 AQQVNAGYEIPARLRTLHNLVIQYASQGRYEVAVPLCKQALEDLEKTSGKYEAAETLEDC 261
            ++      +P     L NL +   + G+YE A+ L ++AL                   
Sbjct: 82  LEKALELELLPNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALY 141

Query: 262 ALRSRKESYDIVKQAKVAQILGSGGHDHPDVATMLNILALVYRDQNKYKEAANLLNDALT 321
            L   +E+ ++ ++A           +  ++A  L  L  +     +Y+EA  LL  AL 
Sbjct: 142 ELGDYEEALELYEKALEL------DPELNELAEALLALGALLEALGRYEEALELLEKALK 195

Query: 322 IREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLN 381
           +       N    A  L NL +LY K GKY+EA    ++ALE+         PD A+ L 
Sbjct: 196 L-------NPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELD--------PDNAEALY 240

Query: 382 NLALLCQNQSKYEEVERYYQRALEIYELKLG 412
           NLALL     +YEE     ++ALE+      
Sbjct: 241 NLALLLLELGRYEEALEALEKALELDPDLYN 271



 Score = 33.7 bits (75), Expect = 0.22
 Identities = 49/225 (21%), Positives = 82/225 (36%), Gaps = 24/225 (10%)

Query: 314 NLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEH 373
             L  AL I  + L +  A   A L     L    G+  EA  L + ALE+         
Sbjct: 2   KDLLLALAILLEALAKLLAEALALLEAGLALLELLGELAEALELLEEALEL--------- 52

Query: 374 PDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQG 433
                  +   LL        ++ R  +    + +        N+A+   NL       G
Sbjct: 53  ---LPNSDLAGLLLLLALALLKLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALG 109

Query: 434 KYKEAEILYKQVLTRAHEREFGACDGDNKPIWQDRQKNKAKNREKYYQRALEIYE--LKL 491
           KY+EA  L ++ L    + +            +      A      Y+ ALE+YE  L+L
Sbjct: 110 KYEEALELLEKALALDPDPDLA----------EALLALGALYELGDYEEALELYEKALEL 159

Query: 492 GPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGA 536
            P+ + +A+    L +     G+Y+EA  L ++ L    + +  A
Sbjct: 160 DPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEA 204


>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat. 
          Length = 34

 Score = 36.6 bits (86), Expect = 7e-04
 Identities = 14/33 (42%), Positives = 17/33 (51%)

Query: 335 AATLNNLAVLYGKRGKYKEAEPLCKRALEIREK 367
           A  L NL   Y K GKY EA    ++ALE+   
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33



 Score = 35.1 bits (82), Expect = 0.003
 Identities = 13/32 (40%), Positives = 17/32 (53%)

Query: 377 AKQLNNLALLCQNQSKYEEVERYYQRALEIYE 408
           AK L NL        KY+E   YY++ALE+  
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNP 32



 Score = 32.8 bits (76), Expect = 0.018
 Identities = 14/33 (42%), Positives = 17/33 (51%)

Query: 419 AKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 451
           AK   NL + YLK GKY EA   Y++ L     
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33



 Score = 32.8 bits (76), Expect = 0.018
 Identities = 14/33 (42%), Positives = 17/33 (51%)

Query: 499 AKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 531
           AK   NL + YLK GKY EA   Y++ L     
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33



 Score = 25.5 bits (57), Expect = 7.3
 Identities = 10/33 (30%), Positives = 11/33 (33%)

Query: 293 ATMLNILALVYRDQNKYKEAANLLNDALTIREK 325
           A  L  L   Y    KY EA      AL +   
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33


>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat. 
          Length = 65

 Score = 36.5 bits (85), Expect = 0.002
 Identities = 16/67 (23%), Positives = 22/67 (32%), Gaps = 8/67 (11%)

Query: 341 LAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYY 400
           LA    + G Y EA    + AL          +P  A+ L  L      Q +  E     
Sbjct: 3   LARAALRAGDYDEALAALEAAL--------ARYPLAAEALLLLGEALLRQGRLAEAAALL 54

Query: 401 QRALEIY 407
           + AL   
Sbjct: 55  RAALAAD 61



 Score = 33.4 bits (77), Expect = 0.029
 Identities = 17/67 (25%), Positives = 22/67 (32%), Gaps = 8/67 (11%)

Query: 299 LALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLC 358
           LA        Y EA   L  AL          +   A  L  L     ++G+  EA  L 
Sbjct: 3   LARAALRAGDYDEALAALEAAL--------ARYPLAAEALLLLGEALLRQGRLAEAAALL 54

Query: 359 KRALEIR 365
           + AL   
Sbjct: 55  RAALAAD 61



 Score = 32.3 bits (74), Expect = 0.065
 Identities = 14/52 (26%), Positives = 18/52 (34%), Gaps = 5/52 (9%)

Query: 400 YQRALEIYE--LKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRA 449
           Y  AL   E  L   P  +        L    L+QG+  EA  L +  L   
Sbjct: 13  YDEALAALEAALARYPLAAEAL---LLLGEALLRQGRLAEAAALLRAALAAD 61



 Score = 32.3 bits (74), Expect = 0.065
 Identities = 14/52 (26%), Positives = 18/52 (34%), Gaps = 5/52 (9%)

Query: 480 YQRALEIYE--LKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRA 529
           Y  AL   E  L   P  +        L    L+QG+  EA  L +  L   
Sbjct: 13  YDEALAALEAALARYPLAAEAL---LLLGEALLRQGRLAEAAALLRAALAAD 61



 Score = 27.6 bits (62), Expect = 3.1
 Identities = 12/36 (33%), Positives = 15/36 (41%)

Query: 288 DHPDVATMLNILALVYRDQNKYKEAANLLNDALTIR 323
            +P  A  L +L      Q +  EAA LL  AL   
Sbjct: 26  RYPLAAEALLLLGEALLRQGRLAEAAALLRAALAAD 61


>gnl|CDD|221958 pfam13176, TPR_7, Tetratricopeptide repeat. 
          Length = 36

 Score = 35.3 bits (82), Expect = 0.003
 Identities = 14/36 (38%), Positives = 21/36 (58%)

Query: 337 TLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKE 372
            L+NL  LY K G Y++A  L +RAL + +    +E
Sbjct: 1   ALSNLGRLYRKLGDYEKAISLYERALALAKDPEDRE 36



 Score = 30.3 bits (69), Expect = 0.17
 Identities = 10/35 (28%), Positives = 17/35 (48%)

Query: 380 LNNLALLCQNQSKYEEVERYYQRALEIYELKLGPD 414
           L+NL  L +    YE+    Y+RAL + +     +
Sbjct: 2   LSNLGRLYRKLGDYEKAISLYERALALAKDPEDRE 36



 Score = 29.9 bits (68), Expect = 0.21
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 423 NNLASCYLKQGKYKEAEILYKQVLTRAHERE 453
           +NL   Y K G Y++A  LY++ L  A + E
Sbjct: 3   SNLGRLYRKLGDYEKAISLYERALALAKDPE 33



 Score = 29.9 bits (68), Expect = 0.21
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 503 NNLASCYLKQGKYKEAEILYKQVLTRAHERE 533
           +NL   Y K G Y++A  LY++ L  A + E
Sbjct: 3   SNLGRLYRKLGDYEKAISLYERALALAKDPE 33



 Score = 27.2 bits (61), Expect = 1.9
 Identities = 9/35 (25%), Positives = 16/35 (45%)

Query: 296 LNILALVYRDQNKYKEAANLLNDALTIREKTLGEN 330
           L+ L  +YR    Y++A +L   AL + +      
Sbjct: 2   LSNLGRLYRKLGDYEKAISLYERALALAKDPEDRE 36


>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats.  Repeats present in 4
           or more copies in proteins. Contain a minimum of 34
           amino acids each and self-associate via a "knobs and
           holes" mechanism.
          Length = 34

 Score = 35.1 bits (82), Expect = 0.003
 Identities = 13/33 (39%), Positives = 16/33 (48%)

Query: 335 AATLNNLAVLYGKRGKYKEAEPLCKRALEIREK 367
           A  L NL   Y K G Y EA    ++ALE+   
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33



 Score = 33.6 bits (78), Expect = 0.011
 Identities = 12/33 (36%), Positives = 16/33 (48%)

Query: 419 AKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 451
           A+   NL + YLK G Y EA   Y++ L     
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33



 Score = 33.6 bits (78), Expect = 0.011
 Identities = 12/33 (36%), Positives = 16/33 (48%)

Query: 499 AKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 531
           A+   NL + YLK G Y EA   Y++ L     
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33



 Score = 32.0 bits (74), Expect = 0.036
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query: 377 AKQLNNLALLCQNQSKYEEVERYYQRALEI 406
           A+ L NL         Y+E   YY++ALE+
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALEL 30



 Score = 25.5 bits (57), Expect = 8.8
 Identities = 9/33 (27%), Positives = 10/33 (30%)

Query: 293 ATMLNILALVYRDQNKYKEAANLLNDALTIREK 325
           A  L  L   Y     Y EA      AL +   
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 37.3 bits (87), Expect = 0.009
 Identities = 12/54 (22%), Positives = 29/54 (53%)

Query: 93  LQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
           L  ++ E  +L+ ++  L ++ A L+ E    +Q+L + E    +L++E   ++
Sbjct: 68  LPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIK 121



 Score = 33.8 bits (78), Expect = 0.12
 Identities = 24/125 (19%), Positives = 57/125 (45%), Gaps = 18/125 (14%)

Query: 29  TNTKTVQQGLEALKNEHTSILNSLSETQHGGPVEEEKRNMVKNSLEIIELGLNEAQ-VMT 87
           ++T + ++ L  L+ E   +              +E+   ++  L  ++    E +  ++
Sbjct: 59  SDTPSARERLPELQQELAEL--------------QEELAELQEQLAELQQENQELKQELS 104

Query: 88  ALAGHLQNVEAEKQKLRT---QVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKH 144
            L   L+ ++ E  +++        L +EN  LR+ELA  +Q+ ++ E  N +L+E ++ 
Sbjct: 105 TLEAELERLQKELARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQENEQR 164

Query: 145 LEFMS 149
             F+ 
Sbjct: 165 RWFLY 169


>gnl|CDD|218602 pfam05478, Prominin, Prominin.  The prominins are an emerging
           family of proteins that among the multispan membrane
           proteins display a novel topology. Mouse prominin and
           human prominin (mouse)-like 1 (PROML1) are predicted to
           contain five membrane spanning domains, with an
           N-terminal domain exposed to the extracellular space
           followed by four, alternating small cytoplasmic and
           large extracellular, loops and a cytoplasmic C-terminal
           domain. The exact function of prominin is unknown
           although in humans defects in PROM1, the gene coding for
           prominin, cause retinal degeneration.
          Length = 807

 Score = 36.5 bits (85), Expect = 0.037
 Identities = 38/195 (19%), Positives = 72/195 (36%), Gaps = 38/195 (19%)

Query: 34  VQQGLEALKNEHTSILNSLSETQHGGPVEEEKRNMVKNSLEIIELGLNEAQVMTALAGHL 93
           + Q  +  +++ +  L+   E   GG + +E ++ V   LE I   +   QV+  +   L
Sbjct: 201 LAQNYQETQDQVSKDLDGAGE-HLGGRIHDELKSHVLPVLEAI---VELGQVLQEMKDGL 256

Query: 94  QNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLEFMSSIRK 153
           ++V    ++LR    +L      +R  L N+  KL  ++   S           +SS+  
Sbjct: 257 ESVNKTLKELREGGSQLRDGLRGVRRRLNNSLNKLCLTQMCAS----------ILSSLSI 306

Query: 154 YDSDIQGDENSGDQKSDKHKQDDPVVDLFPDEDNDDRNNLSPTPPSQFAQQVNAGY---- 209
              D        D                PD   +   NL     + F+  V  G     
Sbjct: 307 LALDA-------DFNQ------------LPDVT-EILENLEEVVKADFSSIVQEGNSTFE 346

Query: 210 EIPARLRTLHNLVIQ 224
           EIP++++   + V+ 
Sbjct: 347 EIPSKVKNQTSSVVP 361


>gnl|CDD|221837 pfam12895, Apc3, Anaphase-promoting complex, cyclosome, subunit 3. 
           Apc3, otherwise known as Cdc27, is one of the subunits
           of the anaphase-promoting complex or cyclosome. The
           anaphase-promoting complex is a multiprotein subunit E3
           ubiquitin ligase complex that controls segregation of
           chromosomes and exit from mitosis in eukaryotes. The
           protein members of this family contain TPR repeats just
           as those of Apc7 do, and it appears that these TPR units
           bind the C-termini of the APC co-activators CDH1 and
           CDC20.
          Length = 80

 Score = 33.0 bits (76), Expect = 0.049
 Identities = 28/99 (28%), Positives = 40/99 (40%), Gaps = 19/99 (19%)

Query: 347 KRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEI 406
            +G Y+ A  L        EK+L     +    L       Q Q         Y+RA E+
Sbjct: 1   DQGNYENAIFLA-------EKLLALTPSNEDAYLLAQCYFLQGQ---------YKRAYEL 44

Query: 407 YELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQV 445
              KL  D+S+  +    LA C LK GKY EA  + ++ 
Sbjct: 45  LR-KLKLDNSSGCRYL--LAQCLLKLGKYDEAIAVLEKA 80



 Score = 28.0 bits (63), Expect = 2.5
 Identities = 18/50 (36%), Positives = 27/50 (54%), Gaps = 3/50 (6%)

Query: 476 REKYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQV 525
            +  Y+RA E+   KL  D+S+  +    LA C LK GKY EA  + ++ 
Sbjct: 34  LQGQYKRAYELLR-KLKLDNSSGCRYL--LAQCLLKLGKYDEAIAVLEKA 80


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 35.8 bits (83), Expect = 0.051
 Identities = 26/126 (20%), Positives = 45/126 (35%), Gaps = 22/126 (17%)

Query: 71  NSLEIIELGLNEAQVMTAL-----AGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQ 125
           +S   IE   +              G     E+ K  LR Q+  ++ +   L  ELA  Q
Sbjct: 105 DSQAAIEFPDDLLSAEDPAVPELIKGQQSLFESRKSTLRAQLELILAQIKQLEAELAGLQ 164

Query: 126 QKLQSSEQ----ANSQLE-----EEKK------HLEFMSSIRKYDSDIQGDENSGDQKSD 170
            +LQ+  Q     + +LE     +EK        LE      +   ++   E   + +  
Sbjct: 165 AQLQALRQQLEVISEELEARRKLKEKGLVSRLELLELERERAEAQGELGRLE--AELEVL 222

Query: 171 KHKQDD 176
           K + D+
Sbjct: 223 KRQIDE 228


>gnl|CDD|227343 COG5010, TadD, Flp pilus assembly protein TadD, contains TPR
           repeats [Intracellular trafficking and secretion].
          Length = 257

 Score = 34.7 bits (80), Expect = 0.082
 Identities = 30/165 (18%), Positives = 48/165 (29%), Gaps = 30/165 (18%)

Query: 372 EHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLK 431
            +P   + L            + E     ++A       L P D    +  N L +   +
Sbjct: 95  AYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAAR-----LAPTD---WEAWNLLGAALDQ 146

Query: 432 QGKYKEAEILYKQVLTRAHEREFGACDGDNKP-----IWQDRQKNKAKNREKYYQRALEI 486
            G++ EA   Y+Q L  A          +        +    + +           A   
Sbjct: 147 LGRFDEARRAYRQALELA--------PNEPSIANNLGMSLLLRGDLEDAE-TLLLPAYL- 196

Query: 487 YELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 531
                   DS V     NLA     QG ++EAE +  Q L     
Sbjct: 197 ----SPAADSRVR---QNLALVVGLQGDFREAEDIAVQELLSEQA 234



 Score = 33.2 bits (76), Expect = 0.26
 Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 16/125 (12%)

Query: 317 NDALTIREKTLGENHAAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDV 376
            D+     +     +      L        + G + EA  + ++A  +         P  
Sbjct: 82  ADSSLAVLQKSAIAYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAARLA--------PTD 133

Query: 377 AKQLNNLALLCQNQSKYEEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYK 436
            +  N L        +++E  R Y++ALE+          N     NNL    L +G  +
Sbjct: 134 WEAWNLLGAALDQLGRFDEARRAYRQALEL--------APNEPSIANNLGMSLLLRGDLE 185

Query: 437 EAEIL 441
           +AE L
Sbjct: 186 DAETL 190


>gnl|CDD|221232 pfam11802, CENP-K, Centromere-associated protein K.  CENP-K is one
           of seven new CENP-A-nucleosome distal (CAD) centromere
           components (the others being CENP-L, CENP-O, CENP-P,
           CENP-Q, CENP-R and CENP-S) that are identified as
           assembling on the CENP-A nucleosome associated complex,
           NAC. The CENP-A NAC is essential, as disruption of the
           complex causes errors of chromosome alignment and
           segregation that preclude cell survival despite
           continued centromere-derived mitotic checkpoint
           signalling. CENP-K is centromere-associated through its
           interaction with one or more components of the CENP-A
           NAC.
          Length = 273

 Score = 34.9 bits (80), Expect = 0.095
 Identities = 24/128 (18%), Positives = 54/128 (42%), Gaps = 12/128 (9%)

Query: 23  NQEEIMTNTKTVQQGLEALKNEHTSILNSLSETQH----GGPVEEEKRNMVKNSLEIIEL 78
            ++++     +VQ   E LK +       L E Q         + E +N V         
Sbjct: 102 LRQDLEMVLSSVQSKNEKLKEDLEREQQWLDEQQQILDSLNERQSELKNQVVT------- 154

Query: 79  GLNEAQVMTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQL 138
             +E+++   L   +  ++  K+KL + +   ++E+  L +  A+ ++K +  ++ + QL
Sbjct: 155 -FSESRIFQELKTKMLRIKEYKEKLLSALGEFLEEHFPLPEAQASIKKKRKGIQEPSVQL 213

Query: 139 EEEKKHLE 146
               + LE
Sbjct: 214 ITLHEILE 221


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 35.0 bits (81), Expect = 0.13
 Identities = 24/136 (17%), Positives = 54/136 (39%), Gaps = 1/136 (0%)

Query: 31  TKTVQQGLEALKNEHTSILNSLSETQHGGPVEEEKRNMVKNSLEIIELGLNEAQ-VMTAL 89
            K  ++ LE LK E   +   L   Q       E+   +  ++  IE  +NE +      
Sbjct: 387 LKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDK 446

Query: 90  AGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLEFMS 149
           A  ++  E + ++L   + +  QE   L++E    +++L   ++  ++ E + +  E   
Sbjct: 447 ALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERV 506

Query: 150 SIRKYDSDIQGDENSG 165
              +   ++      G
Sbjct: 507 RGGRAVEEVLKASIQG 522


>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
           [Coenzyme metabolism].
          Length = 400

 Score = 34.3 bits (79), Expect = 0.16
 Identities = 30/113 (26%), Positives = 45/113 (39%), Gaps = 15/113 (13%)

Query: 339 NNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVER 398
             L  L   R +  + EPL K A    EK L K+HP+    L+ L  L      + +   
Sbjct: 296 PRLCRLIP-RLRPGDPEPLIKAA----EKWL-KQHPEDPLLLSTLGRLALKNKLWGKASE 349

Query: 399 YYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHE 451
             + A     LKL P  S        LA    + G+ +EAE + ++ L    +
Sbjct: 350 ALEAA-----LKLRPSAS----DYAELADALDQLGEPEEAEQVRREALLLTRQ 393


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
           receptor (72 kDa mitochondrial outermembrane protein)
           (mitochondrial import receptor for the ADP/ATP carrier)
           (translocase of outermembrane tom70).  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 34.2 bits (78), Expect = 0.17
 Identities = 29/117 (24%), Positives = 53/117 (45%), Gaps = 9/117 (7%)

Query: 290 PDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRG 349
           P+   + N    +  DQNK+ EA    + A+ + ++T    +  V   +N    L+  + 
Sbjct: 464 PEAPDVYNYYGELLLDQNKFDEAIEKFDTAIELEKETKPM-YMNVLPLINKALALFQWKQ 522

Query: 350 KYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEI 406
            + EAE LC++AL I         P+    +  +A L   Q   +E  + ++RA E+
Sbjct: 523 DFIEAENLCEKALII--------DPECDIAVATMAQLLLQQGDVDEALKLFERAAEL 571


>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat.  This Pfam entry
           includes outlying Tetratricopeptide-like repeats (TPR)
           that are not matched by pfam00515.
          Length = 34

 Score = 29.8 bits (68), Expect = 0.22
 Identities = 11/28 (39%), Positives = 15/28 (53%)

Query: 419 AKTKNNLASCYLKQGKYKEAEILYKQVL 446
           A+   NL   Y K G Y+EA   Y++ L
Sbjct: 1   AEALYNLGLAYYKLGDYEEALEAYEKAL 28



 Score = 29.8 bits (68), Expect = 0.22
 Identities = 11/28 (39%), Positives = 15/28 (53%)

Query: 499 AKTKNNLASCYLKQGKYKEAEILYKQVL 526
           A+   NL   Y K G Y+EA   Y++ L
Sbjct: 1   AEALYNLGLAYYKLGDYEEALEAYEKAL 28



 Score = 29.8 bits (68), Expect = 0.24
 Identities = 13/33 (39%), Positives = 18/33 (54%)

Query: 335 AATLNNLAVLYGKRGKYKEAEPLCKRALEIREK 367
           A  L NL + Y K G Y+EA    ++ALE+   
Sbjct: 1   AEALYNLGLAYYKLGDYEEALEAYEKALELDPN 33



 Score = 28.6 bits (65), Expect = 0.51
 Identities = 12/32 (37%), Positives = 16/32 (50%)

Query: 377 AKQLNNLALLCQNQSKYEEVERYYQRALEIYE 408
           A+ L NL L       YEE    Y++ALE+  
Sbjct: 1   AEALYNLGLAYYKLGDYEEALEAYEKALELDP 32


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 33.8 bits (78), Expect = 0.23
 Identities = 36/157 (22%), Positives = 61/157 (38%), Gaps = 15/157 (9%)

Query: 12  KIESIGKMTVMNQEEIMTNTKTVQQGLEALKNEHTSILNSLSETQHGGPVEEEKRNMVKN 71
           KIE   K+  M ++       T  Q +    +  T I + L E QH     +   +    
Sbjct: 270 KIEQFQKVIKMYEKG--GVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAID---- 323

Query: 72  SLEIIELGLNEAQV--------MTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELAN 123
            LE I    NE           ++     L  +  + +K++  +  L  E     +ELA 
Sbjct: 324 ELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAK 383

Query: 124 TQQKLQSSEQANSQLEEEKKHLEFMSSIRKYDSDIQG 160
            Q +L    +  S+L +EK H   ++ + K DS I+ 
Sbjct: 384 LQDELDKIVKTKSELVKEKYHRGIVTDLLK-DSGIKA 419


>gnl|CDD|205988 pfam13815, Dzip-like_N, Iguana/Dzip1-like DAZ-interacting protein
           N-terminal.  The DAZ gene-product - Deleted in
           Azoospermia - and a closely related sequence are
           required early in germ-cell development in order to
           maintain germ-cell populations. This family is the
           N-terminal region that is the only part of the protein
           in some fungi and lower metazoa.
          Length = 118

 Score = 31.5 bits (72), Expect = 0.34
 Identities = 12/34 (35%), Positives = 17/34 (50%)

Query: 110 LVQENAWLRDELANTQQKLQSSEQANSQLEEEKK 143
           L Q N+ L + L    Q+ +  EQ  S+  EE K
Sbjct: 78  LTQSNSELEERLQEALQEAEEREQELSKQSEELK 111


>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972).  This
           family consists of several hypothetical bacterial
           sequences. The function of this family is unknown.
          Length = 106

 Score = 31.1 bits (71), Expect = 0.37
 Identities = 19/55 (34%), Positives = 32/55 (58%)

Query: 89  LAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKK 143
           L   L  + AE  +L+ Q+  L++ENA LR E  + +++L+  EQ   + E+ KK
Sbjct: 13  LEQQLNQLLAELGELKQQLAELLEENAELRIENEHLRERLEELEQEQKEKEKGKK 67


>gnl|CDD|140213 PTZ00186, PTZ00186, heat shock 70 kDa precursor protein;
           Provisional.
          Length = 657

 Score = 33.1 bits (75), Expect = 0.37
 Identities = 33/149 (22%), Positives = 66/149 (44%), Gaps = 17/149 (11%)

Query: 42  KNEHTSILNSLSETQHGGPVEEEKRNMVKNS----------LEIIELGLN-EAQVMTA-- 88
           K++ T    +++ T +GG  +E+   M+++S           E++E+  N E Q+ TA  
Sbjct: 509 KDKATGKTQNITITANGGLSKEQIEQMIRDSEQHAEADRVKRELVEVRNNAETQLTTAER 568

Query: 89  -LAGHLQNVEAEKQKLRT---QVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKH 144
            L       +AEK+ ++T   ++R+ ++     +D+LA    KLQ +     + E ++  
Sbjct: 569 QLGEWKYVSDAEKENVKTLVAELRKAMENPNVAKDDLAAATDKLQKAVMECGRTEYQQAA 628

Query: 145 LEFMSSIRKYDSDIQGDENSGDQKSDKHK 173
                S        Q  +    Q S++ K
Sbjct: 629 AANSGSSSNSGEQQQQQQQQQQQNSEEKK 657


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 32.6 bits (75), Expect = 0.45
 Identities = 23/98 (23%), Positives = 43/98 (43%), Gaps = 3/98 (3%)

Query: 37  GLEALKNEHTSILNSLSETQHGGPVEEEKRNMVKNSLEIIELGLNEAQV-MTALAGHLQN 95
            LE   N+   + +S++  +      E +R+ ++  L  +      A+     LA  L +
Sbjct: 70  SLERQGNQ--DLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQELDS 127

Query: 96  VEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQ 133
            +    +   QV  L Q+ A LR +LA  +  L +SE+
Sbjct: 128 EKQVSARALAQVELLNQQIAALRRQLAALEAALDASEK 165


>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat. 
          Length = 34

 Score = 28.7 bits (65), Expect = 0.55
 Identities = 13/41 (31%), Positives = 20/41 (48%), Gaps = 8/41 (19%)

Query: 399 YYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAE 439
            Y++ALE+      P+++       NLA   L  G+Y EA 
Sbjct: 1   LYEKALELD-----PNNAEAY---YNLALLLLNLGQYDEAL 33



 Score = 28.7 bits (65), Expect = 0.55
 Identities = 13/41 (31%), Positives = 20/41 (48%), Gaps = 8/41 (19%)

Query: 479 YYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAE 519
            Y++ALE+      P+++       NLA   L  G+Y EA 
Sbjct: 1   LYEKALELD-----PNNAEAY---YNLALLLLNLGQYDEAL 33


>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
           repeats [General function prediction only].
          Length = 484

 Score = 32.8 bits (75), Expect = 0.57
 Identities = 38/200 (19%), Positives = 58/200 (29%), Gaps = 55/200 (27%)

Query: 288 DHPDVATMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGK 347
             PD    L +   +  + NK KEA   L  AL +        ++ +     NLA    K
Sbjct: 335 AQPDNPYYLELAGDILLEANKAKEAIERLKKALALD------PNSPLLQL--NLAQALLK 386

Query: 348 RGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERYYQRALEIY 407
            GK +EA  +  R L           P+                          +A    
Sbjct: 387 GGKPQEAIRILNRYL--------FNDPE-----------------DPNGWDLLAQAYA-- 419

Query: 408 ELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTRAHEREFGACDGDNKPIW-- 465
             +LG    N A+     A  Y   G+ ++A I   +   +              P W  
Sbjct: 420 --ELG----NRAEALLARAEGYALAGRLEQAIIFLMRASQQV---------KLGFPDWAR 464

Query: 466 -QDR--QKNKAKNREKYYQR 482
              R  Q  +     K Y +
Sbjct: 465 ADARIDQLRQQNEEFKPYTK 484


>gnl|CDD|236865 PRK11160, PRK11160, cysteine/glutathione ABC transporter
           membrane/ATP-binding component; Reviewed.
          Length = 574

 Score = 32.1 bits (74), Expect = 0.76
 Identities = 11/54 (20%), Positives = 25/54 (46%), Gaps = 9/54 (16%)

Query: 94  QNVEAEKQKLRTQVRRLVQENAWL---------RDELANTQQKLQSSEQANSQL 138
           Q++   + + R Q+   +Q  A L         R +L  T+Q+  ++++  + L
Sbjct: 190 QDLTHLRAQYRVQLTEWLQGQAELTLFGAEDRYRQQLEQTEQQWLAAQRRQANL 243


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 32.0 bits (73), Expect = 1.0
 Identities = 27/117 (23%), Positives = 53/117 (45%), Gaps = 8/117 (6%)

Query: 38  LEALKNEHTSILNSLSETQHGGPVEEEKRNMVKNSLEIIELGLNEAQ--------VMTAL 89
           LE L+ E   +   L E +        +   ++  LE + L ++E +         + AL
Sbjct: 234 LEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYAL 293

Query: 90  AGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
           A  +  +E +KQ LR ++  L ++   L  +L   + KL    +  ++LEE+ + L+
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELK 350


>gnl|CDD|223054 PHA03359, PHA03359, UL17 tegument protein; Provisional.
          Length = 686

 Score = 31.9 bits (73), Expect = 1.1
 Identities = 38/163 (23%), Positives = 54/163 (33%), Gaps = 29/163 (17%)

Query: 210 EIPARLRTLHNLVIQYA----SQGRYEVAVP----LCKQALEDLEKTSG----KYEAAET 257
            I       ++L+  Y        RY    P    LC+Q    +   +       EAA  
Sbjct: 136 AIGDAAEEYYDLLFTYDELVPPGTRYNADGPRLAALCRQFCAYVRSHAALPPAAKEAAAH 195

Query: 258 LEDCALRSRKESYDIVKQAK-----VAQILGSGGHDHPDVATMLN-----ILALVYRDQN 307
           LE C     + + D V  A        Q+   GG D P     +      IL+L+ R   
Sbjct: 196 LEACLGEDGRSNGDPVLSADAYIDPEEQLTAPGGFDTPAADARIEEDDREILSLIRR--- 252

Query: 308 KYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGKRGK 350
               AA ++     +R     E   AVA+ L   A   G  G 
Sbjct: 253 ----AAEVVASRHPVRAAAPREVGGAVASGLRQGARALGAAGA 291


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 31.8 bits (72), Expect = 1.2
 Identities = 24/69 (34%), Positives = 40/69 (57%), Gaps = 5/69 (7%)

Query: 94  QNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQS--SEQANSQLEEEK-KHLEFMSS 150
           QN + E + L +++  L ++    + ELA  ++KLQS  SEQA+S     K KHLE    
Sbjct: 181 QNADTELKLLESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEE--E 238

Query: 151 IRKYDSDIQ 159
           +++Y+ D +
Sbjct: 239 LKRYEQDAE 247


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 31.3 bits (71), Expect = 1.5
 Identities = 26/114 (22%), Positives = 45/114 (39%), Gaps = 5/114 (4%)

Query: 36  QGLEALKNEHTSILNSLSETQHGGPVEEEKRNMVK---NSLEIIELGLNEAQVMTALAGH 92
           + L AL+N   S                E  +  +   N  EI+E     A  ++ +   
Sbjct: 308 EHLNALRNLQASKEQQDGLDSEKDRGSHEDGDYYEVDINGPEILECKYRVA--VSEVGEL 365

Query: 93  LQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
              ++A K +      R   E   LR+E+ N  +KL S E+++ + +E    LE
Sbjct: 366 KAELKALKARYNELQERYEDEKDRLRNEIQNLAEKLLSLEKSSHEDQERSGDLE 419


>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
           Provisional.
          Length = 276

 Score = 30.7 bits (70), Expect = 1.8
 Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 4/38 (10%)

Query: 108 RRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHL 145
             L +EN  L+ EL   + +LQ       QLE E   L
Sbjct: 72  FDLREENEELKKELLELESRLQE----LEQLEAENARL 105



 Score = 29.6 bits (67), Expect = 4.1
 Identities = 14/58 (24%), Positives = 26/58 (44%), Gaps = 5/58 (8%)

Query: 83  AQVMTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEE 140
                 ++G  +++ A    LR +   L +E      EL +  Q+L+  E  N++L E
Sbjct: 55  NAPREFVSGVFESL-ASLFDLREENEELKKELL----ELESRLQELEQLEAENARLRE 107


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 30.8 bits (70), Expect = 1.8
 Identities = 13/59 (22%), Positives = 28/59 (47%), Gaps = 1/59 (1%)

Query: 89  LAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQA-NSQLEEEKKHLE 146
           L   ++ +     KL ++   L  EN  L+    +  Q++Q + Q+   +L +E + L+
Sbjct: 64  LVAEVKELRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLK 122


>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP).  This family consists of
           several PV-1 (PLVAP) proteins which seem to be specific
           to mammals. PV-1 is a novel protein component of the
           endothelial fenestral and stomatal diaphragms. The
           function of this family is unknown.
          Length = 442

 Score = 30.4 bits (68), Expect = 2.5
 Identities = 14/51 (27%), Positives = 30/51 (58%)

Query: 96  VEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
           +E   + LR  + R+ +ENA L+ +    +Q  Q++++A ++ E+E +  E
Sbjct: 283 IEELARGLRADIERVARENADLQRQKLEAEQAAQAAQEAKAKAEKEAQARE 333


>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904).  This
           family consists of several bacterial and archaeal
           hypothetical proteins of unknown function.
          Length = 72

 Score = 28.0 bits (63), Expect = 2.6
 Identities = 12/44 (27%), Positives = 27/44 (61%)

Query: 102 KLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHL 145
           +L +++++ V+  A L+ E+   +++ +   + N +LEEE K L
Sbjct: 8   ELESKIQQAVETIALLQMEIEELKEENEQLSEENEELEEENKKL 51


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 30.0 bits (68), Expect = 2.8
 Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 2/87 (2%)

Query: 62  EEEKRNMVKNSLEIIELGLN-EAQVMTALAGHLQNVEAEK-QKLRTQVRRLVQENAWLRD 119
           +    N +K  L      L  E   +  LA  L   +  + +K R ++R L  + +  R 
Sbjct: 169 DLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRK 228

Query: 120 ELANTQQKLQSSEQANSQLEEEKKHLE 146
           +L   QQ+LQ    A   L  +K  L 
Sbjct: 229 QLEELQQELQELTIAIEALTNKKSELL 255


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
           binding.  This is the middle region of a family of TATA
           element modulatory factor 1 proteins conserved in
           eukaryotes that contains at its N-terminal section a
           number of leucine zippers that could potentially form
           coiled coil structures. The whole proteins bind to the
           TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells.
          Length = 74

 Score = 27.7 bits (62), Expect = 2.9
 Identities = 14/54 (25%), Positives = 27/54 (50%)

Query: 93  LQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
            + +  ++ K    +++L  +N  L  E+A  ++KL+  E+    LEE  K  E
Sbjct: 21  GEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENLEERLKRAE 74


>gnl|CDD|235966 PRK07207, PRK07207, ribonucleotide-diphosphate reductase subunit
           alpha; Validated.
          Length = 965

 Score = 30.3 bits (69), Expect = 2.9
 Identities = 29/111 (26%), Positives = 42/111 (37%), Gaps = 22/111 (19%)

Query: 237 LCKQALEDLEKTSGKYEAAETLEDCALRSRKESYDIVKQAKVAQILGSGGHDHPDVATML 296
           L ++A E LE    +   AETL        K  YD V   +V +            A +L
Sbjct: 169 LIEEACEGLEAVDAEPILAETL--------KNLYDGVPMDEVYK------------ALIL 208

Query: 297 NILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLAVLYGK 347
               L+ ++ N     A LL D  T+R + LG        T   +A  Y +
Sbjct: 209 AARTLIEKEPNYSYVTARLLLD--TLRREALGFLGVPEEVTQAEMATRYAE 257


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 29.8 bits (67), Expect = 3.3
 Identities = 20/98 (20%), Positives = 39/98 (39%), Gaps = 2/98 (2%)

Query: 61  VEEEKRNMVKNSLEIIELGLNEAQVMTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDE 120
             E++R+  K  L  +E    E     AL   L  ++ E+++L  ++  L +E+  L  E
Sbjct: 40  DAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGE 99

Query: 121 LANTQQKLQS--SEQANSQLEEEKKHLEFMSSIRKYDS 156
           L   Q++ +   +E+     E        +       S
Sbjct: 100 LVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQS 137


>gnl|CDD|234861 PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional.
          Length = 552

 Score = 30.0 bits (69), Expect = 3.3
 Identities = 18/49 (36%), Positives = 22/49 (44%), Gaps = 15/49 (30%)

Query: 313 ANLLN-DALTIREKTLGENHAAVAA-------TLNN-------LAVLYG 346
           A LL+ D LT+  KTL EN A            L+N       LA+L G
Sbjct: 330 AGLLHGDCLTVTGKTLAENLADAPDPDQDVIRPLDNPISPTGGLAILKG 378


>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat. 
          Length = 34

 Score = 26.7 bits (60), Expect = 3.4
 Identities = 10/33 (30%), Positives = 17/33 (51%)

Query: 335 AATLNNLAVLYGKRGKYKEAEPLCKRALEIREK 367
           A     L  +Y + G Y+EA+   ++ALE+   
Sbjct: 1   ARAYYLLGQIYLQLGDYEEAKEYYEKALELDPN 33



 Score = 26.3 bits (59), Expect = 4.4
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 419 AKTKNNLASCYLKQGKYKEAEILYKQVLT 447
           A+    L   YL+ G Y+EA+  Y++ L 
Sbjct: 1   ARAYYLLGQIYLQLGDYEEAKEYYEKALE 29



 Score = 26.3 bits (59), Expect = 4.4
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 499 AKTKNNLASCYLKQGKYKEAEILYKQVLT 527
           A+    L   YL+ G Y+EA+  Y++ L 
Sbjct: 1   ARAYYLLGQIYLQLGDYEEAKEYYEKALE 29


>gnl|CDD|185096 PRK15174, PRK15174, Vi polysaccharide export protein VexE;
           Provisional.
          Length = 656

 Score = 30.1 bits (67), Expect = 3.8
 Identities = 12/47 (25%), Positives = 18/47 (38%), Gaps = 3/47 (6%)

Query: 212 PARLRTLHNLVIQYASQGRYEVAVPLCKQALE---DLEKTSGKYEAA 255
              +R +          G+ E A+PL +Q+L    DL      Y  A
Sbjct: 281 SDNVRIVTLYADALIRTGQNEKAIPLLQQSLATHPDLPYVRAMYARA 327


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 29.5 bits (66), Expect = 4.3
 Identities = 24/89 (26%), Positives = 39/89 (43%), Gaps = 9/89 (10%)

Query: 61  VEEEKRNMVKNSLEIIELGLNEAQVMTALAGHLQNVEAEKQKLRTQVRR---LVQENAWL 117
           V +E +N       ++  GL   Q +       ++ E  K+KL    +    L++E   L
Sbjct: 108 VRKELKN------ALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEEL 161

Query: 118 RDELANTQQKLQSSEQANSQLEEEKKHLE 146
             E    Q++L+  E  NS+LEE  K L 
Sbjct: 162 EAEYEEVQERLKRLEVENSRLEEMLKKLP 190


>gnl|CDD|235315 PRK04841, PRK04841, transcriptional regulator MalT; Provisional.
          Length = 903

 Score = 29.9 bits (68), Expect = 4.4
 Identities = 19/65 (29%), Positives = 29/65 (44%), Gaps = 1/65 (1%)

Query: 384 ALLCQNQSKYEEVERYYQRA-LEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILY 442
           A L Q+Q +Y EV     RA  E+ +  +  D +  A+     A   +  G  +EAE L 
Sbjct: 416 AWLAQSQHRYSEVNTLLARAEQELKDRNIELDGTLQAEFNALRAQVAINDGDPEEAERLA 475

Query: 443 KQVLT 447
           +  L 
Sbjct: 476 ELALA 480


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 29.7 bits (67), Expect = 4.6
 Identities = 13/71 (18%), Positives = 30/71 (42%)

Query: 81  NEAQVMTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEE 140
            E          L+ ++ +  +L + +  ++ E   L  ++    ++L +  Q N +LEE
Sbjct: 260 TEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEE 319

Query: 141 EKKHLEFMSSI 151
           E K  +    +
Sbjct: 320 EYKIKKRTVEL 330


>gnl|CDD|233013 TIGR00540, TPR_hemY_coli, heme biosynthesis-associated TPR protein.
            Members of this protein family are uncharacterized
           tetratricopeptide repeat (TPR) proteins invariably found
           in heme biosynthesis gene clusters. The absence of any
           invariant residues other than Ala argues against this
           protein serving as an enzyme per se. The gene symbol
           hemY assigned in E. coli is unfortunate in that an
           unrelated protein, protoporphyrinogen oxidase (HemG in
           E. coli) is designated HemY in Bacillus subtilis
           [Unknown function, General].
          Length = 367

 Score = 29.6 bits (67), Expect = 4.7
 Identities = 32/105 (30%), Positives = 47/105 (44%), Gaps = 15/105 (14%)

Query: 340 NLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQNQSKYEEVERY 399
            L  LYG R +  +  PL KRA    EK L K+HPD A  L  L  LC  Q  + + + Y
Sbjct: 277 ELLRLYG-RLQPGDPSPLIKRA----EKWL-KKHPDDALLLLALGRLCLRQQLWGKAQSY 330

Query: 400 YQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQ 444
            + +     L L P +    +    LA  + + G  + A   Y++
Sbjct: 331 LEAS-----LSLAPTEEAHLE----LAQLFEQLGDTEAAAQHYRK 366


>gnl|CDD|221956 pfam13174, TPR_6, Tetratricopeptide repeat. 
          Length = 33

 Score = 25.9 bits (58), Expect = 4.7
 Identities = 10/25 (40%), Positives = 14/25 (56%)

Query: 424 NLASCYLKQGKYKEAEILYKQVLTR 448
            LA  YLK G   EA+   +++L R
Sbjct: 5   KLALAYLKLGDTDEAKEALERLLKR 29



 Score = 25.9 bits (58), Expect = 4.7
 Identities = 10/25 (40%), Positives = 14/25 (56%)

Query: 504 NLASCYLKQGKYKEAEILYKQVLTR 528
            LA  YLK G   EA+   +++L R
Sbjct: 5   KLALAYLKLGDTDEAKEALERLLKR 29


>gnl|CDD|224631 COG1717, RPL32, Ribosomal protein L32E [Translation, ribosomal
           structure and biogenesis].
          Length = 133

 Score = 28.5 bits (64), Expect = 5.0
 Identities = 27/92 (29%), Positives = 37/92 (40%), Gaps = 19/92 (20%)

Query: 186 DNDDRNNLSPTPPSQFAQQVNAGYEIPARLRTLHNLVIQYASQGRYEVAVPLCKQALEDL 245
           D+  R  L   PP      V  GY  P  +R LH         G  EV V      ++DL
Sbjct: 47  DSKMRRKLKGKPP-----MVKIGYRSPKAVRGLH-------PSGYEEVLV----HNVKDL 90

Query: 246 EKTSGKYEAAETLEDCALRSRKESYDIVKQAK 277
           EK   + +AA        R R E   I+++A+
Sbjct: 91  EKLDPETQAARIASTVGARKRIE---ILERAR 119


>gnl|CDD|221262 pfam11846, DUF3366, Domain of unknown function (DUF3366).  This
           domain is functionally uncharacterized. This domain is
           found in bacteria. This presumed domain is about 200
           amino acids in length.
          Length = 193

 Score = 28.9 bits (65), Expect = 5.1
 Identities = 21/114 (18%), Positives = 30/114 (26%), Gaps = 22/114 (19%)

Query: 425 LASCYLKQGKYKEAEILYKQVLTRAHEREFGACDGDNKPIWQDR--------------QK 470
               Y+ +  Y   E L +   TR            N  +WQDR                
Sbjct: 64  AVLAYMLRD-YHRVERLTRFEGTR-PANITPLRRVTNPLLWQDRFDWADYATALLLGITP 121

Query: 471 NKAKNREKYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQ 524
                   Y++ AL++   K  P           LA      G   EA     +
Sbjct: 122 QNPALLAPYHEWALKLLRYKPRP------AVYRRLALALALLGDPAEARDWMAE 169


>gnl|CDD|226409 COG3893, COG3893, Inactivated superfamily I helicase [DNA
           replication, recombination, and repair].
          Length = 697

 Score = 29.4 bits (66), Expect = 5.1
 Identities = 30/109 (27%), Positives = 42/109 (38%), Gaps = 14/109 (12%)

Query: 236 PLCKQALEDLEKTSGKYEAAETLEDCALRSRKESYDIVKQAKV--AQILGSGGHDHPDVA 293
           PL +  LE L+      +AA  LE  ALR  K S    +   +  A++LG+   D P   
Sbjct: 399 PLARFGLEFLDLR----DAANVLELAALRGGKVSPGYGELPALLEARLLGTSDDDLPPAW 454

Query: 294 TMLNILALVYRDQNKYKEAANLLNDALTIREKTLGENHAAVAATLNNLA 342
                  L   +       A LL   ++  +  LG  H    A L  LA
Sbjct: 455 RAG----LDPDN----IALARLLLQRISEAKAPLGAGHFEAIADLLALA 495


>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family.  In a
           subset of endospore-forming members of the Firmcutes,
           members of this protein family are found, several to a
           genome. Two very strongly conserved sequences regions
           are separated by a highly variable linker region. Much
           of the linker region was excised from the seed alignment
           for this model. A characterized member is the
           prespore-specific transcription RsfA from Bacillus
           subtilis, previously called YwfN, which is controlled by
           sigma factor F and seems to fine-tune expression of some
           genes in the sigma-F regulon. A paralog in Bacillus
           subtilis is designated YlbO [Regulatory functions, DNA
           interactions, Cellular processes, Sporulation and
           germination].
          Length = 161

 Score = 28.6 bits (64), Expect = 5.2
 Identities = 18/65 (27%), Positives = 32/65 (49%), Gaps = 4/65 (6%)

Query: 93  LQNVE---AEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHL-EFM 148
           LQN++      Q L+ +  RL  +N  L+      +++L+   Q  S +EE+ + L + M
Sbjct: 89  LQNLKTTNPSDQALQKENERLKNQNESLQKRNEELEKELEKLRQRLSTIEEDYQTLIDIM 148

Query: 149 SSIRK 153
              RK
Sbjct: 149 DRARK 153


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 29.7 bits (67), Expect = 5.2
 Identities = 18/94 (19%), Positives = 44/94 (46%), Gaps = 1/94 (1%)

Query: 61  VEEEKRNMVKNSLEIIELGLNEAQVMTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDE 120
           +EE+   +       ++   NE Q +      L+ +E E ++LR ++  L+++   +   
Sbjct: 659 LEEKVEELEAEIRRELQRIENEEQ-LEEKLEELEQLEEELEQLREELEELLKKLGEIEQL 717

Query: 121 LANTQQKLQSSEQANSQLEEEKKHLEFMSSIRKY 154
           +   + +    E+   +LE+ +K LE +  +R+ 
Sbjct: 718 IEELESRKAELEELKKELEKLEKALELLEELREK 751


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 29.6 bits (66), Expect = 5.5
 Identities = 15/58 (25%), Positives = 28/58 (48%)

Query: 89  LAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
            A   Q      Q+   Q+ +  Q+ A   +++   +++LQ  E A ++LE+E   LE
Sbjct: 222 RAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQARLEQEVAQLE 279



 Score = 28.5 bits (63), Expect = 9.7
 Identities = 17/78 (21%), Positives = 32/78 (41%), Gaps = 1/78 (1%)

Query: 70  KNSLEIIELGLNEAQV-MTALAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKL 128
           +  L   +     A+    A    LQ    E++ +R ++    Q  A  + ELA   ++ 
Sbjct: 87  RTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQA 146

Query: 129 QSSEQANSQLEEEKKHLE 146
           Q  +     L E+++ LE
Sbjct: 147 QDLQTRLKTLAEQRRQLE 164


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 28.4 bits (64), Expect = 5.8
 Identities = 14/57 (24%), Positives = 29/57 (50%)

Query: 90  AGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
               +++E   +KL  ++ RL      L+ +L + +++L   +    QLE++ K LE
Sbjct: 49  LEFRESLEETLRKLEAEIERLQNTIERLKTQLEDLERELALLQAKERQLEKKLKTLE 105


>gnl|CDD|216192 pfam00920, ILVD_EDD, Dehydratase family. 
          Length = 521

 Score = 29.4 bits (67), Expect = 6.0
 Identities = 20/56 (35%), Positives = 25/56 (44%), Gaps = 17/56 (30%)

Query: 308 KYKEAANLLN-DALTIREKTLGEN---------HAAVAATLNN-------LAVLYG 346
           K    A LL+ DALT+  +TLGEN            V   L+N       LA+L G
Sbjct: 295 KELLDAGLLHGDALTVTGRTLGENLAEAPILDPDGDVIRPLDNPFKPEGGLAILKG 350


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
           synthetase/uroporphyrin-III C-methyltransferase;
           Reviewed.
          Length = 656

 Score = 29.3 bits (66), Expect = 6.8
 Identities = 14/57 (24%), Positives = 23/57 (40%), Gaps = 7/57 (12%)

Query: 89  LAGHLQNVEAEKQKLRTQVRRLV-------QENAWLRDELANTQQKLQSSEQANSQL 138
           L    Q  +A+  +LR +  +          + A L  +LA+ Q   Q+ EQ    L
Sbjct: 355 LVQRQQANDAQTAELRVKTEQAQASVHQLDSQFAQLDGKLADAQSAQQALEQQYQDL 411


>gnl|CDD|188247 TIGR02795, tol_pal_ybgF, tol-pal system protein YbgF.  Members of
           this protein family are the product of one of seven
           genes regularly clustered in operons to encode the
           proteins of the tol-pal system, which is critical for
           maintaining the integrity of the bacterial outer
           membrane. The gene for this periplasmic protein has been
           designated orf2 and ybgF. All members of the seed
           alignment were from unique tol-pal gene regions from
           completed bacterial genomes. The architecture of this
           protein is a signal sequence, a low-complexity region
           usually rich in Asn and Gln, a well-conserved region
           with tandem repeats that resemble the tetratricopeptide
           (TPR) repeat, involved in protein-protein interaction.
          Length = 117

 Score = 27.6 bits (62), Expect = 7.2
 Identities = 21/115 (18%), Positives = 39/115 (33%), Gaps = 16/115 (13%)

Query: 337 TLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKEHPDVAKQLNNLALLCQ---NQSKY 393
              + A+L  K G Y +A    +  L        K++P      N    L +    Q  Y
Sbjct: 2   AYYDAALLVLKAGDYADAIQAFQAFL--------KKYPKSTYAPNAHYWLGEAYYAQGDY 53

Query: 394 EEVERYYQRALEIYELKLGPDDSNVAKTKNNLASCYLKQGKYKEAEILYKQVLTR 448
            +  + +   ++ Y     P           L     + G  ++A+   +QV+ R
Sbjct: 54  ADAAKAFLAVVKKY-----PKSPKAPDALLKLGMSLQELGDKEKAKATLQQVIKR 103


>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are about 260 amino acids in
           length.
          Length = 250

 Score = 28.7 bits (65), Expect = 7.2
 Identities = 21/72 (29%), Positives = 34/72 (47%), Gaps = 11/72 (15%)

Query: 80  LNEAQVMTALAGHLQN----VEAEKQKLRTQVRRLVQENAWLR-------DELANTQQKL 128
           L  AQ   A A   Q        + Q+L  ++R+L++E   LR         +AN QQ++
Sbjct: 28  LAAAQQTVAAAAQSQKKIDKWADQTQELLAEIRQLLKEIENLRVYNDQLQRLVANQQQEI 87

Query: 129 QSSEQANSQLEE 140
            S +Q   Q+E+
Sbjct: 88  ASLQQQIEQIEK 99


>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
           biogenesis, outer membrane].
          Length = 284

 Score = 28.9 bits (65), Expect = 7.8
 Identities = 17/51 (33%), Positives = 20/51 (39%), Gaps = 6/51 (11%)

Query: 109 RLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHL--EFMSSIRKYDSD 157
            L  EN  L+ ELA  +Q L+        LEEE K L           D D
Sbjct: 70  DLALENEELKKELAELEQLLEE----VESLEEENKRLKELLDFKESSSDYD 116


>gnl|CDD|184448 PRK14010, PRK14010, potassium-transporting ATPase subunit B;
           Provisional.
          Length = 673

 Score = 28.9 bits (64), Expect = 7.9
 Identities = 25/105 (23%), Positives = 38/105 (36%), Gaps = 13/105 (12%)

Query: 195 PTPPSQFAQQVNAGYEIPARLRTLHNLVIQYASQGRYEVAVPLCKQALEDLEKTSGKYE- 253
           P   S F + V A YE      T     I           V L  +   DL +  G+Y  
Sbjct: 321 PVKSSSFERLVKAAYESSIADDTPEGRSI-----------VKLAYKQHIDLPQEVGEYIP 369

Query: 254 -AAETLEDCALRSRKESYDIVKQAKVAQILGSGGHDHPDVATMLN 297
             AET       + +E Y     + V ++  +GGH   D+  ++ 
Sbjct: 370 FTAETRMSGVKFTTREVYKGAPNSMVKRVKEAGGHIPVDLDALVK 414


>gnl|CDD|240030 cd04673, Nudix_Hydrolase_15, Members of the Nudix hydrolase
           superfamily catalyze the hydrolysis of NUcleoside
           DIphosphates linked to other moieties, X. Enzymes
           belonging to this superfamily require a divalent cation,
           such as Mg2+ or Mn2+, for their activity and contain a
           highly conserved 23-residue nudix motif
           (GX5EX7REUXEEXGU, where U = I, L or V), which functions
           as a metal binding and catalytic site. Substrates of
           nudix hydrolases include intact and oxidatively damaged
           nucleoside triphosphates, dinucleoside polyphosphates,
           nucleotide-sugars and dinucleotide enzymes. These
           substrates are metabolites or cell signaling molecules
           that require regulation during different stages of the
           cell cycle or during periods of stress. In general, the
           role of the nudix hydrolase is to sanitize the
           nucleotide pools and to maintain cell viability, thereby
           serving as surveillance & "house-cleaning" enzymes.
           Substrate specificity is used to define families within
           the superfamily. Differences in substrate specificity
           are determined by the N-terminal extension or by
           residues in variable loop regions. Mechanistically,
           substrate hydrolysis occurs by a nucleophilic
           substitution reaction, with variation in the numbers and
           roles of divalent cations required.
          Length = 122

 Score = 27.6 bits (62), Expect = 8.4
 Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 2/34 (5%)

Query: 250 GKYEAAETLEDCALRSRKESYDIVKQAKVAQILG 283
           GK E  ETLE  ALR   E   +   A+V ++L 
Sbjct: 32  GKVELGETLEQAALRELLEETGLE--AEVGRLLT 63


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 28.8 bits (65), Expect = 8.9
 Identities = 15/54 (27%), Positives = 28/54 (51%)

Query: 93  LQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLE 146
           L  ++ E   L+ Q+    +E A  +      QQ+L + E   ++LEE+++ LE
Sbjct: 144 LHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELE 197


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 28.4 bits (64), Expect = 9.1
 Identities = 21/99 (21%), Positives = 43/99 (43%), Gaps = 4/99 (4%)

Query: 62  EEEKRNMVKNSLEIIELGLN-EAQVMTALAGHLQNV-EAEKQKLRTQVRRLVQENAWLRD 119
           E E  N +K  L   +  L  E + +  L   L++    E  + + ++++L+QE      
Sbjct: 166 ELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVK 225

Query: 120 ELANTQQKLQSSEQANSQLEEEKKHLEFMSSIRKYDSDI 158
           +L   +++LQ  E     L  +K   E  + I + +  +
Sbjct: 226 KLEELEEELQELESKIEDLTNKK--SELNTEIAEAEKKL 262


>gnl|CDD|233734 TIGR02120, GspF, type II secretion system protein F.  This membrane
           protein is a component of the terminal branch complex of
           the general secretion pathway (GSP), also known as
           the"Type II" secretion pathway. The GSP transports
           proteins (generally virulence-associated cell wall
           hydrolases) across the outer membrase of the bacterial
           cell. Transport across the inner membrane is often, but
           not exclusively handled by the Sec system. This model
           was constructed from the broader subfamily model,
           pfam00482 which includes components of pilin complexes
           (PilC) as well as other related genes. GspF is nearly
           always gene clustered with other GSP subunits. Some
           genes from Xylella and Xanthomonas strains score below
           the trusted cutoff due to excessive divergence from the
           family such that a sequence from Deinococcus which does
           not appear to be GspF scores higher [Protein fate,
           Protein and peptide secretion and trafficking].
          Length = 399

 Score = 28.4 bits (64), Expect = 9.2
 Identities = 14/57 (24%), Positives = 23/57 (40%), Gaps = 6/57 (10%)

Query: 51  SLSE--TQHGGPVEEEKRNMVKNSLEIIELGLNEAQVMTALAGHLQNVEAEKQKLRT 105
           SL++   QH        R +V       E       V+  LA +L+  +A + K+ T
Sbjct: 112 SLADALAQHPRDFPPLYRALVAAG----EASGALDAVLERLADYLEERQALRSKITT 164


>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869).  This
           bacterial family of proteins has no known function.
          Length = 131

 Score = 27.6 bits (62), Expect = 9.5
 Identities = 16/59 (27%), Positives = 27/59 (45%), Gaps = 2/59 (3%)

Query: 89  LAGHLQNVEAEKQKLRTQVRRLVQENAWLRDELANTQQKLQSSEQANSQLEEEKKHLEF 147
           L   L  +E E Q+L  Q ++ ++E    +      +Q  Q  EQ  ++  E+K  L F
Sbjct: 25  LQEQLTQLEQELQQLEFQGQKAIKEIR--KQSAQQIEQIQQQVEQERAKRLEQKNQLLF 81


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.128    0.355 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,853,063
Number of extensions: 2604159
Number of successful extensions: 3064
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2881
Number of HSP's successfully gapped: 340
Length of query: 546
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 444
Effective length of database: 6,413,494
Effective search space: 2847591336
Effective search space used: 2847591336
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 61 (27.2 bits)