RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8075
         (1074 letters)



>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain. 
          Length = 372

 Score =  342 bits (880), Expect = e-108
 Identities = 151/381 (39%), Positives = 220/381 (57%), Gaps = 21/381 (5%)

Query: 601 PPRSKLK----SRIPRPYFPNVSVQVNGTIWTELDESKLYASMELEVVDKLFCAYQKNGV 656
            P+ KLK     ++          Q  GT+W ELDE      ++L  +++LF A  K   
Sbjct: 6   KPKKKLKPLHWDKVNPA-------QDRGTVWDELDEESFEKDLDLSELEELFSAKAKKKK 58

Query: 657 PADGSVEDLTKLGKISKSKILSVIDGRRAQNCTILLSKLKMTDEEISKAILSMDSADQLP 716
               S +   K     K K +SV+D +R+QN  ILL KLK+  EEI +AIL MD    L 
Sbjct: 59  ----SKKSEKKSSSKKKKKEISVLDPKRSQNIAILLRKLKLPPEEIVQAILEMD-ESVLG 113

Query: 717 LDMLEQLLKFTPNTEEAALLEEHSTE--TLARADRFLYEISKIPHYEQRLRCLHYKKRFH 774
           L++LE LLK  P  EE   L+E+  +   L RA++FL E+SKIP  E+RL  L +K  F 
Sbjct: 114 LELLENLLKMAPTKEELKKLKEYKGDPSKLGRAEQFLLELSKIPRLEERLEALLFKSTFE 173

Query: 775 FWISDIEPKIKSVMEASREVTRSRRLRKLLELVLALGNYMNKG-ARGNASGFRLTSLNKL 833
             + +++P ++++  AS E+  SR+ +KLLEL+LALGNYMN G  RGNA GF+L+SL KL
Sbjct: 174 EEVEELKPSLETLEAASEELRESRKFKKLLELILALGNYMNSGTRRGNAKGFKLSSLLKL 233

Query: 834 VDTKSSSSRGVTLLHYLVELAEKRFKEVLKLEEDLCHVKEASKVSMKELENDMTQLRNNL 893
            DTKS+ ++  TLLHYLV++  ++  ++L    +L HV++A+KV +++LE D+ +L   L
Sbjct: 234 SDTKSTDNK-TTLLHYLVKIIREKLPDLLDFSSELSHVEKAAKVDLEQLEKDVKELEKGL 292

Query: 894 KEVSREIEFQRVQPLVPGDRFLPVMKEFLTSATCRLSEIEDLFQDMKTRFDRAVRLFGED 953
           K++ RE+E        P D+F+  MKEFL  A  +L ++E L ++    F      FGED
Sbjct: 293 KKLERELELSA-LDEHPDDKFVEKMKEFLEEAEEKLDKLESLLKEAMELFKELTEYFGED 351

Query: 954 NSSIQPEDFFVIFDTFLTTLN 974
                PE+FF I   FL    
Sbjct: 352 PKETSPEEFFKILRDFLRMFK 372


>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain.  FH proteins control
            rearrangements of the actin cytoskeleton, especially in
            the context of cytokinesis and cell polarisation. Members
            of this family have been found to interact with
            Rho-GTPases, profilin and other actin-assoziated
            proteins. These interactions are mediated by the
            proline-rich FH1 domain, usually located in front of FH2
            (but not listed in SMART). Despite this cytosolic
            function, vertebrate formins have been assigned functions
            within the nucleus. A set of Formin-Binding Proteins
            (FBPs) has been shown to bind FH1 with their WW domain.
          Length = 392

 Score =  289 bits (742), Expect = 2e-88
 Identities = 146/451 (32%), Positives = 225/451 (49%), Gaps = 69/451 (15%)

Query: 592  PRPLVRCRQPPRSKLKSRIPRPYFPNVSVQVNGTIWTELDESKLYASMELEVVDKLFCAY 651
            P+P  + +     KL                 GT+W ++DE       +L+ +++LF + 
Sbjct: 4    PKPKKKLKPLHWDKLNPSDLS-----------GTVWDKIDEES---EGDLDELEELF-SA 48

Query: 652  QKNGVPADGSVEDLTKLGKISKSKILSVIDGRRAQNCTILLSKLKMTDEEISKAILSMDS 711
            ++    A   V +   + K   S+   ++D +R+QN  ILL KL M+ EEI +AIL  D 
Sbjct: 49   KEKTKSASKDVSEKKSILKKKASQEFKILDPKRSQNLAILLRKLHMSYEEIKEAILEGDE 108

Query: 712  ADQLPLDMLEQLLKFTPNTEEAALLEEHSTET---LARADRFLYEISKIPHYEQRLRCLH 768
             D L +D+LEQLLK+ P  EE   L E+  E    LARA++FL  IS IP+ E+RL  L 
Sbjct: 109  -DVLSVDLLEQLLKYAPTKEELKKLREYKEEDPEELARAEQFLLLISNIPYLEERLNALL 167

Query: 769  YKKRFHFWISDIEPKIKSVMEASREVTRSRRLRKLLELVLALGNYMNKG-ARGNASGFRL 827
            +K  F   + D++P+I+ V  A  E+  S++ RKLLEL+LA+GNYMN G  RG A GF+L
Sbjct: 168  FKANFEEEVEDLKPQIEKVEAACEELRESKKFRKLLELILAIGNYMNGGSRRGQAYGFKL 227

Query: 828  TSLNKLVDTKSSSSRGVTLLHYLVELAEKRFKEVLKLEEDLCHVKEASKVSMKELENDMT 887
            +SL KL D KS+ ++  TLLH+LV++  K++   L   E+L                   
Sbjct: 228  SSLLKLSDVKSADNK-TTLLHFLVKIIRKKYLGGLSDPENLD------------------ 268

Query: 888  QLRNNLKEVSREIEFQRVQPLVPGDRFLPVMKEFLTSATCRLSEIEDLFQDMKTRFDRAV 947
                                    D+F+ VMK FL +A  +  +++    D+KTRF++ V
Sbjct: 269  ------------------------DKFIEVMKPFLKAAKEKYDKLQKDLSDLKTRFEKLV 304

Query: 948  RLFGEDNSSIQPEDFFVIFDTFLTTLNEAKLDNENMKRRREEEEKRARQDAEMKKRT--- 1004
              +GED     PE+FF  F+ FL   ++A    EN+K + EEEE+R ++  +        
Sbjct: 305  EYYGEDPKDTSPEEFFKDFNEFLKEFSKAA--EENIK-KEEEEEERRKKLVKETTEYEQS 361

Query: 1005 TDRKQGAKENGVVISNGSVKGEFDDLISALR 1035
            + R++    +          G  D L+  L 
Sbjct: 362  SSRQKERNPSMDFEVERDFLGVLDSLLEELG 392


>gnl|CDD|219000 pfam06367, Drf_FH3, Diaphanous FH3 Domain.  This region is found in
           the Formin-like and and diaphanous proteins.
          Length = 197

 Score =  185 bits (472), Expect = 1e-53
 Identities = 72/182 (39%), Positives = 103/182 (56%), Gaps = 6/182 (3%)

Query: 230 GGHKKVLDAMSHYQRFCNERSRFQGIVNDLDKSTGIYRTDVSIKTALMSFLNAVLSYGPG 289
           GGH+KVL+A  +++  C ER RF+ +V  LD S      +V  K A M F+NA+++    
Sbjct: 1   GGHEKVLEATLNFKEVCRERGRFRSLVGALDSS---ENDNVEYKVATMQFINALVNSPE- 56

Query: 290 QHSLEFRLHLRYEVIMLGIQPIIDKLRTHENETLDRHLDFFEMLRHEDESELARKYETDH 349
              L FRLHLR E   LG+  I+DKLR  ENE LD  L  FE  R ED  EL  +++  +
Sbjct: 57  --DLNFRLHLRSEFTRLGLDRILDKLRELENEKLDDQLQIFEENREEDVEELLERFDDVN 114

Query: 350 IETKSATSMFNILRRKLGHTAAYPHFLSLLHHCILLPLEFGCHPQHWLLLDRLAQQLTLQ 409
           ++      +F +L+ K+  T A PHFLS+L H +L+  +    PQ+W LL+ L  Q+ L 
Sbjct: 115 VDLDDPVELFELLKNKVKDTEAEPHFLSILQHLLLIRDDEEELPQYWKLLEELVSQIVLH 174

Query: 410 TE 411
             
Sbjct: 175 RT 176


>gnl|CDD|219001 pfam06371, Drf_GBD, Diaphanous GTPase-binding Domain.  This domain
           is bound to by GTP-attached Rho proteins, leading to
           activation of the Drf protein.
          Length = 187

 Score =  164 bits (417), Expect = 2e-46
 Identities = 69/200 (34%), Positives = 100/200 (50%), Gaps = 26/200 (13%)

Query: 41  LPIPPTNELNTMFEELVEELDLTAPNKAAMLSLPDEKKWQIYCSRK---------MDQTS 91
           L  P  +E+  +FEEL+EE++L    +  M + P EKKWQ+                  S
Sbjct: 1   LQKPDDDEILKLFEELMEEMNLPEEKRRPMRAKPIEKKWQLVVQYLLTNFQGESLSKSLS 60

Query: 92  TTDAPDNPEQYIEQLRNFKSSPLTEHDDTEIKSRTRILDSLRTALRTQPHSFVLRFIECE 151
            ++   +PE Y+++L++                  + L+SLR ALRTQP S+V RFI  E
Sbjct: 61  KSNETGSPEYYVKKLKDGSID-------------QKCLESLRVALRTQPVSWVKRFIGAE 107

Query: 152 GLLTLLEYLKNLD----HYSGQTSLHSSIIGCIKALMNNSTGRSHVLGYPGAIDIIAQSL 207
           GL  LL  L  +                I+ C+KALMNN  G  HVLG+P  I ++A+SL
Sbjct: 108 GLSALLNILSKIVRKKSESDEDLDREYEILKCLKALMNNKFGIDHVLGHPEVILLLARSL 167

Query: 208 STENIKTKIAVLEILGAVCL 227
            +   KT+   LE+L A+CL
Sbjct: 168 DSPRPKTRKLALELLTALCL 187


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 53.2 bits (128), Expect = 8e-07
 Identities = 30/131 (22%), Positives = 64/131 (48%), Gaps = 4/131 (3%)

Query: 401 RLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLA 460
           +L  +L    E  E+ +  L ++  +    L   EEE+ + ++   +L+ E  +L  +  
Sbjct: 734 QLQSRLEELEEELEELEEELEELQER----LEELEEELESLEEALAKLKEEIEELEEKRQ 789

Query: 461 KKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQD 520
             ++EL++  +E E+ +  L  ++  LE         +Q++ ELE+ I EL+ + D  ++
Sbjct: 790 ALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEE 849

Query: 521 ERKRLENLINS 531
           E + LE  +  
Sbjct: 850 ELEELEKELEE 860



 Score = 51.6 bits (124), Expect = 2e-06
 Identities = 31/102 (30%), Positives = 55/102 (53%)

Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
            L   EEE+  A+++ D LE E   L  R  + EQE+++  +E E+L+E L  +++ LE+
Sbjct: 794 ELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE 853

Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
                 E K+++ ELE    EL+      ++E++ LE  +  
Sbjct: 854 LEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRE 895



 Score = 46.6 bits (111), Expect = 7e-05
 Identities = 27/99 (27%), Positives = 55/99 (55%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
           A EEE+   + + +ELE E  +L   L + ++ L++  +E E L+E+L+++K+ +E+   
Sbjct: 727 ALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEE 786

Query: 493 GHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
                ++++ ELE+ + E + R D  + E + LE     
Sbjct: 787 KRQALQEELEELEEELEEAERRLDALERELESLEQRRER 825



 Score = 46.2 bits (110), Expect = 9e-05
 Identities = 31/140 (22%), Positives = 66/140 (47%), Gaps = 7/140 (5%)

Query: 397 LLLDRLAQQ---LTLQTETGEDYDHILLKINVK----KVVHLLAKEEEIVAAKKKADELE 449
            LL+ L +Q   L  Q E  E Y  +  ++        +  L    +E+   +++   LE
Sbjct: 193 DLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLE 252

Query: 450 MENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRIS 509
            E  +L   L + E+E+++   E E+L E L  +++ L +      E + +++ L +R+ 
Sbjct: 253 EELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLE 312

Query: 510 ELKHRADFEQDERKRLENLI 529
           EL++  +  ++  + L+  I
Sbjct: 313 ELENELEELEERLEELKEKI 332



 Score = 46.2 bits (110), Expect = 9e-05
 Identities = 30/98 (30%), Positives = 57/98 (58%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
             EEEI   ++K DELE E  +L   L + ++EL++   EKE+L++ L  +++  E+   
Sbjct: 832 ELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEE 891

Query: 493 GHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
              E + ++AEL++ I +L+ R +  + + +RLE  + 
Sbjct: 892 ELRELESELAELKEEIEKLRERLEELEAKLERLEVELP 929



 Score = 45.1 bits (107), Expect = 2e-04
 Identities = 28/100 (28%), Positives = 58/100 (58%)

Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
            L   +EE+  A+K+ +EL+ E  +L   L + ++EL +  +E E+L+  +S +++RLE+
Sbjct: 254 ELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEE 313

Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLI 529
                 E ++++ EL+++I  LK   +  +   + LE L+
Sbjct: 314 LENELEELEERLEELKEKIEALKEELEERETLLEELEQLL 353



 Score = 45.1 bits (107), Expect = 2e-04
 Identities = 27/111 (24%), Positives = 53/111 (47%), Gaps = 3/111 (2%)

Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
           L   + EI + +++ + L     DL   L + E EL++   E E+L+E L  ++++LE+ 
Sbjct: 399 LEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEEL 458

Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAV 541
                E ++++AEL++ +  L+           RL+ L          +AV
Sbjct: 459 RDRLKELERELAELQEELQRLE---KELSSLEARLDRLEAEQRASQGVRAV 506



 Score = 45.1 bits (107), Expect = 2e-04
 Identities = 27/111 (24%), Positives = 59/111 (53%), Gaps = 1/111 (0%)

Query: 421 LKINVKKVVHLLAKEEEIV-AAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDES 479
           LK  ++ +  LL +    +   +++ +EL+ E   L   L + +  L++  +E E+L+E 
Sbjct: 693 LKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEE 752

Query: 480 LSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
           L  +++RLE+        ++ +A+L++ I EL+ +    Q+E + LE  + 
Sbjct: 753 LEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELE 803



 Score = 44.3 bits (105), Expect = 3e-04
 Identities = 27/104 (25%), Positives = 51/104 (49%), Gaps = 4/104 (3%)

Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
           L   EE +   K+K + L+ E  +    L + EQ L +  + KE+L+E LS + + LE+ 
Sbjct: 318 LEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEEL 377

Query: 491 TAGHLE----AKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
                E     + ++AE+ + + ELK   +  ++  +RL   + 
Sbjct: 378 FEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLE 421



 Score = 43.9 bits (104), Expect = 5e-04
 Identities = 26/102 (25%), Positives = 49/102 (48%)

Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
            L   EEE+   + + ++LE E   L N L   E  L++  ++ E+L+  L  +K  L  
Sbjct: 668 ELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAA 727

Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
                 + + ++ ELE+ + EL+   +  Q+  + LE  + S
Sbjct: 728 LEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELES 769



 Score = 43.9 bits (104), Expect = 5e-04
 Identities = 27/98 (27%), Positives = 51/98 (52%)

Query: 429 VHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLE 488
             L   EE   A +++  ELE E  ++ N L + ++E++   +  E L E L  +K+ L+
Sbjct: 369 ALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELK 428

Query: 489 KETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
           +  A   E + ++ EL + + EL+ + +  +D  K LE
Sbjct: 429 ELEAELEELQTELEELNEELEELEEQLEELRDRLKELE 466



 Score = 42.0 bits (99), Expect = 0.002
 Identities = 20/96 (20%), Positives = 48/96 (50%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
             + E+   +++ +EL+ E ++L   + + E E+    +  E+L+  L  +++RLE+   
Sbjct: 271 ELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKE 330

Query: 493 GHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
                K+++ E E  + EL+      ++ ++ LE  
Sbjct: 331 KIEALKEELEERETLLEELEQLLAELEEAKEELEEK 366



 Score = 39.7 bits (93), Expect = 0.010
 Identities = 30/118 (25%), Positives = 55/118 (46%)

Query: 414 EDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEK 473
           E+ + +L ++         AKEE         +ELE     L   LA+ E EL +   E 
Sbjct: 340 EERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNEL 399

Query: 474 EDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
           E+L   +  +++RLE+ +    + K+++ ELE  + EL+   +   +E + LE  +  
Sbjct: 400 EELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEE 457



 Score = 38.5 bits (90), Expect = 0.023
 Identities = 28/100 (28%), Positives = 51/100 (51%), Gaps = 2/100 (2%)

Query: 432 LAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET 491
           L K E      ++  EL+ E  +L   L   +  L +  +E E+L+E LSR+++ LE+  
Sbjct: 202 LEKLERQAEKAERYQELKAELRELELALLLAK--LKELRKELEELEEELSRLEEELEELQ 259

Query: 492 AGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
               EA++++ EL+  + EL+   +  Q+E   L+  I  
Sbjct: 260 EELEEAEKEIEELKSELEELREELEELQEELLELKEEIEE 299



 Score = 37.0 bits (86), Expect = 0.062
 Identities = 29/106 (27%), Positives = 57/106 (53%), Gaps = 6/106 (5%)

Query: 430  HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
             L   +EEI   +++ +ELE +   L   L + E+EL  + + ++ L+  L R  +RLE+
Sbjct: 899  ELAELKEEIEKLRERLEELEAKLERLEVELPELEEEL--EEEYEDTLETELEREIERLEE 956

Query: 490  ETAG----HLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
            E       +L A ++  E+E+R  ELK + +  ++ +++L  +I  
Sbjct: 957  EIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEE 1002



 Score = 37.0 bits (86), Expect = 0.068
 Identities = 21/99 (21%), Positives = 48/99 (48%)

Query: 432 LAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET 491
           LA++ E+   +++  ELE +   L   L   + EL       E+L   L  ++ +LE+  
Sbjct: 663 LAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELK 722

Query: 492 AGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
                 ++++ +L+ R+ EL+   +  ++E + L+  + 
Sbjct: 723 RELAALEEELEQLQSRLEELEEELEELEEELEELQERLE 761



 Score = 36.6 bits (85), Expect = 0.085
 Identities = 25/97 (25%), Positives = 49/97 (50%)

Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAG 493
            E+     + KA+  E+E   L  +L +  +EL++  +E   L+E L  +++ LE+    
Sbjct: 209 AEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKE 268

Query: 494 HLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
             E K ++ EL + + EL+      ++E + LE  I+
Sbjct: 269 IEELKSELEELREELEELQEELLELKEEIEELEGEIS 305



 Score = 33.1 bits (76), Expect = 1.1
 Identities = 23/91 (25%), Positives = 50/91 (54%), Gaps = 4/91 (4%)

Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNR 458
           ++ L ++L   +E  ED    L ++  +    L   + E+    ++ +ELE +  +L +R
Sbjct: 406 IESLEERLERLSERLEDLKEELKELEAE----LEELQTELEELNEELEELEEQLEELRDR 461

Query: 459 LAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
           L + E+EL +  +E + L++ LS ++ RL++
Sbjct: 462 LKELERELAELQEELQRLEKELSSLEARLDR 492


>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator.  In
           eukaryotes, this family of proteins induces
           mitochondrial fission.
          Length = 248

 Score = 49.8 bits (119), Expect = 2e-06
 Identities = 41/146 (28%), Positives = 58/146 (39%), Gaps = 13/146 (8%)

Query: 456 SNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRA 515
            NR   K   L      ++    +L + + RL+     + EA QK++ LED ++ L  RA
Sbjct: 79  PNRHEWKPSPLLPMKLSRQPSVPNLRQEEPRLQSLKPPNEEALQKISALEDELAFL--RA 136

Query: 516 DF----EQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPP 571
                    E     N   S  +  D      S +     P SP   E P   +PPPPPP
Sbjct: 137 QIAKIVAAQE---QSNSTTSDLLSSDESVPSSSTTSF---PISP-PTEEPVLEVPPPPPP 189

Query: 572 ASLPPPPPPRRSSDLPYLTCPRPLVR 597
              PPPP  ++S+    L   R   R
Sbjct: 190 PPPPPPPSLQQSTSAIDLIKERKGQR 215


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 46.3 bits (111), Expect = 8e-05
 Identities = 34/105 (32%), Positives = 52/105 (49%), Gaps = 10/105 (9%)

Query: 424 NVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
            +KK   L AKEE      K  +E E E  +  N L K E+ L QK +  +   E L + 
Sbjct: 53  AIKKEALLEAKEE----IHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKR 108

Query: 484 KDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
           ++ LEK+    LE KQ+  ELE +  EL+      +++ + LE +
Sbjct: 109 EEELEKKEKE-LEQKQQ--ELEKKEEELEEL---IEEQLQELERI 147



 Score = 33.2 bits (77), Expect = 0.90
 Identities = 27/92 (29%), Positives = 45/92 (48%), Gaps = 17/92 (18%)

Query: 424 NVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
           N+ + + LL K EE +  K+K  E + + ++      KKE+EL++  +E+    E +S  
Sbjct: 97  NLDRKLELLEKREEELEKKEKELEQKQQELE------KKEEELEELIEEQLQELERIS-- 148

Query: 484 KDRLEKETAGHLEAKQKVAELEDRISELKHRA 515
              L  E     EAK+ +  LE    E +H A
Sbjct: 149 --GLTAE-----EAKEIL--LEKVEEEARHEA 171


>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
           Provisional.
          Length = 475

 Score = 45.6 bits (108), Expect = 1e-04
 Identities = 24/97 (24%), Positives = 40/97 (41%), Gaps = 12/97 (12%)

Query: 496 EAKQKVAELEDRISELKHRADF----EQDERKRLENLINSGS-VPDDTKAVG--LSRSGG 548
           E +   A+++ +  E++   D       D+++R+E L    + + +  KA+G     + G
Sbjct: 73  EMQVTAAQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKALGANPVTATG 132

Query: 549 REEPSSPRKPEAPSSCIPPPPPPASLPP-----PPPP 580
              P  P  P  P     P   P S PP      PPP
Sbjct: 133 EPVPQMPASPPGPEGEPQPGNTPVSFPPQGSVAVPPP 169


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 45.4 bits (108), Expect = 2e-04
 Identities = 29/100 (29%), Positives = 51/100 (51%), Gaps = 4/100 (4%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           E EI + ++   E E E  D   RLAK E E+D+   E E+L+  +   + R +K T  +
Sbjct: 300 EAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEY 359

Query: 495 LEAKQKVAELEDRISEL--KHRADFEQ--DERKRLENLIN 530
            E K+++ +L   + E+  +     ++  D R++LE L  
Sbjct: 360 AELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKR 399



 Score = 45.1 bits (107), Expect = 2e-04
 Identities = 19/94 (20%), Positives = 49/94 (52%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
             +EE+   + + +E++ E  +  + L    ++L++  +E  +L   L R+++ L++ + 
Sbjct: 361 ELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSE 420

Query: 493 GHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
              +    +A +E +I+EL+   + +  E K+ E
Sbjct: 421 ELADLNAAIAGIEAKINELEEEKEDKALEIKKQE 454



 Score = 43.5 bits (103), Expect = 8e-04
 Identities = 41/142 (28%), Positives = 67/142 (47%), Gaps = 21/142 (14%)

Query: 397 LLLDRLAQQL-TLQTE--TGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENV 453
           L++D   QQL  L+ E    E Y   LLK   +   + L KE       K+A E + E +
Sbjct: 191 LIIDEKRQQLERLRREREKAERYQ-ALLKEKREYEGYELLKE-------KEALERQKEAI 242

Query: 454 DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA--------GHLEAKQKVAELE 505
           +   +LA  E+EL++ T+E  +L++ L  I+  LE+             L  K+K+ ELE
Sbjct: 243 E--RQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELE 300

Query: 506 DRISELKHRADFEQDERKRLEN 527
             I+ L+     ++ E +  E 
Sbjct: 301 AEIASLERSIAEKERELEDAEE 322



 Score = 42.7 bits (101), Expect = 0.001
 Identities = 23/92 (25%), Positives = 45/92 (48%), Gaps = 2/92 (2%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           E + +  +   + L+ E   L + L + E  LD+ +QE  D    +  I+  +E+     
Sbjct: 675 ELQRLRERL--EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEE 732

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLE 526
            + K+++ ELE+ +S L+   +  + E K LE
Sbjct: 733 EKLKERLEELEEDLSSLEQEIENVKSELKELE 764



 Score = 40.8 bits (96), Expect = 0.004
 Identities = 24/102 (23%), Positives = 50/102 (49%), Gaps = 5/102 (4%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
             +I   +K+ ++LE E   L  RL + E++L    QE E++   L  ++ R+E+     
Sbjct: 715 SRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDL 774

Query: 495 LEAKQKVAELEDRIS-----ELKHRADFEQDERKRLENLINS 531
            + ++ + +LE R+S     E++      ++E  R+E  +  
Sbjct: 775 HKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLRE 816



 Score = 40.1 bits (94), Expect = 0.008
 Identities = 23/107 (21%), Positives = 50/107 (46%), Gaps = 7/107 (6%)

Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK---DRLEKE 490
            +E+ +  K++   +E E  +L+ +  + E+EL++      DL+  L  +K   D LE +
Sbjct: 838 LQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQ 897

Query: 491 TA----GHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGS 533
                    E + ++ +   R+SELK + +  ++E   +E+      
Sbjct: 898 LRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDE 944



 Score = 38.9 bits (91), Expect = 0.019
 Identities = 23/101 (22%), Positives = 48/101 (47%), Gaps = 10/101 (9%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK---DRLEKET 491
              I   + K +ELE E  D +  + K+E +L+Q   +    ++ L  +K   DR+EKE 
Sbjct: 426 NAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKEL 485

Query: 492 AGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSG 532
           +   + ++++AE E +    + R        + +E ++ + 
Sbjct: 486 S---KLQRELAEAEAQARASEER----VRGGRAVEEVLKAS 519



 Score = 38.5 bits (90), Expect = 0.023
 Identities = 27/111 (24%), Positives = 49/111 (44%), Gaps = 15/111 (13%)

Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQEL-----DQKTQEKEDLDESLSRIKD 485
           +   + E+   + + +ELE +   L   L   E  L      +   E   L+E +SRI+ 
Sbjct: 753 IENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEA 812

Query: 486 RLE----KETAGHLE---AKQKVAELEDRISELKHRADFEQDERKRLENLI 529
           RL     K     LE    ++++ EL+++  +LK +    +   K +ENL 
Sbjct: 813 RLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQ---IKSIEKEIENLN 860



 Score = 38.5 bits (90), Expect = 0.023
 Identities = 16/95 (16%), Positives = 48/95 (50%)

Query: 436 EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHL 495
           +E+   ++K ++L+ E  +L   L + ++EL + ++E  DL+ +++ I+ ++ +      
Sbjct: 385 DELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKE 444

Query: 496 EAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
           +   ++ + E ++ +L       + E   L+   +
Sbjct: 445 DKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYD 479



 Score = 37.7 bits (88), Expect = 0.045
 Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 7/73 (9%)

Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDE-------SLSRIKDRLEKETAGHLEAKQ 499
            L  E  DL+  +A  E ++++  +EKED           L ++   L K      + K+
Sbjct: 417 RLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKE 476

Query: 500 KVAELEDRISELK 512
           +   +E  +S+L+
Sbjct: 477 EYDRVEKELSKLQ 489



 Score = 35.0 bits (81), Expect = 0.25
 Identities = 20/92 (21%), Positives = 42/92 (45%), Gaps = 7/92 (7%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           E  +    ++  +   +  ++   + + EQE ++  +  E+L+E LS ++  +E      
Sbjct: 701 ENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIE------ 754

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLE 526
              K ++ ELE RI EL+      ++    LE
Sbjct: 755 -NVKSELKELEARIEELEEDLHKLEEALNDLE 785



 Score = 35.0 bits (81), Expect = 0.28
 Identities = 35/178 (19%), Positives = 65/178 (36%), Gaps = 10/178 (5%)

Query: 408 LQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELD 467
           L+    +    +      +    L   EEE+   + +  E+E +   L+      E+E+ 
Sbjct: 777 LEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQ 836

Query: 468 QKTQEKEDLDESLSRIKDRLE------KETAGHLEAKQ-KVAELEDRISELKHRADFEQD 520
           +  +++ DL E +  I+  +E      +E    LE  +  + +LE R+ +LK   D  + 
Sbjct: 837 ELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEA 896

Query: 521 ERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPE---APSSCIPPPPPPASLP 575
           + + LE  I       + K   LS    + E       E           P    SL 
Sbjct: 897 QLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLE 954



 Score = 32.3 bits (74), Expect = 1.6
 Identities = 19/91 (20%), Positives = 41/91 (45%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
             EEE+   +    +LE    DL     + E +L +  ++ E+L+  + + + RL +  A
Sbjct: 865 ELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKA 924

Query: 493 GHLEAKQKVAELEDRISELKHRADFEQDERK 523
                +++++E+ED   E +   + E     
Sbjct: 925 KLEALEEELSEIEDPKGEDEEIPEEELSLED 955



 Score = 31.2 bits (71), Expect = 4.2
 Identities = 21/71 (29%), Positives = 39/71 (54%), Gaps = 6/71 (8%)

Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADF 517
           RL ++ + L +   E   L   L RI++RL++ +    +A +K+ E+E  I +L+     
Sbjct: 678 RLRERLEGLKR---ELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQE--- 731

Query: 518 EQDERKRLENL 528
           E+  ++RLE L
Sbjct: 732 EEKLKERLEEL 742


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 45.4 bits (108), Expect = 2e-04
 Identities = 30/143 (20%), Positives = 68/143 (47%), Gaps = 8/143 (5%)

Query: 394 QHWLLLDR-------LAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKAD 446
           +  LL+ R       L +      E  E+ + +  ++   +   L     E+   +++ +
Sbjct: 226 ELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEE-KLEELRLEVSELEEEIE 284

Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELED 506
           EL+ E   L+N +++ EQ+     +   +L+  L  ++ +LE+  +   E  +++AELE+
Sbjct: 285 ELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEE 344

Query: 507 RISELKHRADFEQDERKRLENLI 529
           ++ ELK   +  + E + LE  +
Sbjct: 345 KLEELKEELESLEAELEELEAEL 367



 Score = 43.5 bits (103), Expect = 6e-04
 Identities = 24/92 (26%), Positives = 48/92 (52%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           EE+I   +K   EL  E  +L   L +  +EL++ +++   L + L+R++  +E+     
Sbjct: 690 EEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERI 749

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLE 526
            +  +++ ELE  I EL+ R +  ++E    E
Sbjct: 750 AQLSKELTELEAEIEELEERLEEAEEELAEAE 781



 Score = 41.2 bits (97), Expect = 0.004
 Identities = 30/129 (23%), Positives = 64/129 (49%), Gaps = 6/129 (4%)

Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKV-VHLLAKEEEIVAAKKKADELEMENVDLSN 457
           ++ L +++    E   +     L+  + ++   L   EEE+   +K+ +EL  +   L  
Sbjct: 679 IEELEEKIEELEEKIAE-----LEKALAELRKELEELEEELEQLRKELEELSRQISALRK 733

Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADF 517
            LA+ E E++Q  +    L + L+ ++  +E+      EA++++AE E  I EL+ + + 
Sbjct: 734 DLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQ 793

Query: 518 EQDERKRLE 526
            ++E K L 
Sbjct: 794 LKEELKALR 802



 Score = 40.4 bits (95), Expect = 0.006
 Identities = 25/101 (24%), Positives = 47/101 (46%), Gaps = 7/101 (6%)

Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
           +L +  EI   ++K +ELE +  +L   LA+  +EL++  +E E L + L  +  ++   
Sbjct: 672 ILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQIS-- 729

Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
                  ++ +A LE  + +L+ R      E   LE  I  
Sbjct: 730 -----ALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEE 765



 Score = 40.0 bits (94), Expect = 0.008
 Identities = 24/97 (24%), Positives = 48/97 (49%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           EE+I    +  + L  E  +L   + + E EL+    E+  L+E+L+ ++  LE+ +   
Sbjct: 844 EEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
            E + K +EL   + EL+ +    +   + LE  I++
Sbjct: 904 RELESKRSELRRELEELREKLAQLELRLEGLEVRIDN 940



 Score = 39.3 bits (92), Expect = 0.015
 Identities = 22/97 (22%), Positives = 45/97 (46%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           +EE+ A ++  DEL  E   L+   A   + L+   +     +  L  +++++E+ +   
Sbjct: 795 KEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI 854

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
                ++ ELE+ I EL+   +   +ER  LE  +  
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALAL 891



 Score = 38.1 bits (89), Expect = 0.029
 Identities = 24/96 (25%), Positives = 45/96 (46%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           EE I    K+  ELE E  +L  RL + E+EL +   E E+L+  + ++K+ L+      
Sbjct: 746 EERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREAL 805

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
            E + ++  L +  + L+ R +  +      E  + 
Sbjct: 806 DELRAELTLLNEEAANLRERLESLERRIAATERRLE 841



 Score = 37.7 bits (88), Expect = 0.037
 Identities = 24/112 (21%), Positives = 53/112 (47%), Gaps = 14/112 (12%)

Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQ-------ELDQKTQEKEDLDESLSRIKDR 486
            E++    +++   LE +  +L  +L + E        EL +  ++ E+L E L  ++  
Sbjct: 300 LEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAE 359

Query: 487 LEKETAGHLEAKQKVAELED-------RISELKHRADFEQDERKRLENLINS 531
           LE+  A   E + ++ ELE+       ++++L+ +     +E +RLE  +  
Sbjct: 360 LEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLER 411



 Score = 37.7 bits (88), Expect = 0.039
 Identities = 24/97 (24%), Positives = 45/97 (46%)

Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
            L    EE+   ++K +EL+ E   L   L + E EL++     E+L+E L  ++ ++ +
Sbjct: 331 KLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQ 390

Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
                     ++  LE R+  L+ R +  Q E + L 
Sbjct: 391 LELQIASLNNEIERLEARLERLEDRRERLQQEIEELL 427



 Score = 37.0 bits (86), Expect = 0.066
 Identities = 29/130 (22%), Positives = 62/130 (47%), Gaps = 4/130 (3%)

Query: 401 RLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLA 460
           +L +++   ++   + +  + ++  +    L   EEE+  A+ + +ELE +   L   L 
Sbjct: 744 QLEERIAQLSKELTELEAEIEELEER----LEEAEEELAEAEAEIEELEAQIEQLKEELK 799

Query: 461 KKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQD 520
              + LD+   E   L+E  + +++RLE         ++++ +LE++I EL    +    
Sbjct: 800 ALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAA 859

Query: 521 ERKRLENLIN 530
           E + LE LI 
Sbjct: 860 EIEELEELIE 869



 Score = 37.0 bits (86), Expect = 0.072
 Identities = 19/92 (20%), Positives = 38/92 (41%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
            E +   + +   L  E  +L  RL   E+ +    +  EDL+E +  + + +E   A  
Sbjct: 802 REALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEI 861

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLE 526
            E ++ + ELE  +  L +     ++    L 
Sbjct: 862 EELEELIEELESELEALLNERASLEEALALLR 893



 Score = 36.6 bits (85), Expect = 0.087
 Identities = 28/129 (21%), Positives = 56/129 (43%), Gaps = 12/129 (9%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK--- 489
             E E+   + + +ELE +   L +++A+ E ++     E E L+  L R++DR E+   
Sbjct: 362 ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ 421

Query: 490 ---------ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKA 540
                    E A   E + ++ ELE+ + EL+   +  ++  + L   +       D   
Sbjct: 422 EIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAE 481

Query: 541 VGLSRSGGR 549
             L++   R
Sbjct: 482 RELAQLQAR 490



 Score = 35.8 bits (83), Expect = 0.14
 Identities = 24/99 (24%), Positives = 49/99 (49%), Gaps = 5/99 (5%)

Query: 435 EEEIVAAKKKADELEMENVDL-----SNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
           E  +   + + + L+ E  +L        L + + EL++  +E E+L E L R+++ LE+
Sbjct: 406 EARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEE 465

Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
                 EA+Q +   E  +++L+ R D  +  ++ LE  
Sbjct: 466 LREELEEAEQALDAAERELAQLQARLDSLERLQENLEGF 504



 Score = 35.8 bits (83), Expect = 0.16
 Identities = 33/155 (21%), Positives = 63/155 (40%), Gaps = 33/155 (21%)

Query: 399 LDRLAQQLTLQTETGEDYDHILLKINV------KKVVHLLAKEEEIVAAKKKADELEMEN 452
           L+ L +Q+   +E  E     + ++         ++  LL +   +  A       E+E 
Sbjct: 840 LEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALL-RSELEE 898

Query: 453 VDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELK 512
             LS  L + E +  +  +E E+L E L++++ RLE         + ++  L++R+SE +
Sbjct: 899 --LSEELRELESKRSELRRELEELREKLAQLELRLEG-------LEVRIDNLQERLSE-E 948

Query: 513 HRADFE----------------QDERKRLENLINS 531
           +    E                +   KRLEN I  
Sbjct: 949 YSLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983



 Score = 31.6 bits (72), Expect = 2.9
 Identities = 22/66 (33%), Positives = 36/66 (54%)

Query: 465 ELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKR 524
            L++  +E E+L E L   ++ LE+ TA   E ++K+ EL   +SEL+   +  Q E   
Sbjct: 233 RLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYA 292

Query: 525 LENLIN 530
           L N I+
Sbjct: 293 LANEIS 298


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 43.5 bits (103), Expect = 2e-04
 Identities = 24/93 (25%), Positives = 49/93 (52%), Gaps = 4/93 (4%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           ++E+   +++  EL+ +  +L     + +QEL     E E L + L+RIK    + +A  
Sbjct: 72  QQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIK----QLSANA 127

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLEN 527
           +E  ++  EL + ++ELK   +  + E +RL+ 
Sbjct: 128 IELDEENRELREELAELKQENEALEAENERLQE 160



 Score = 31.9 bits (73), Expect = 1.1
 Identities = 19/52 (36%), Positives = 25/52 (48%), Gaps = 1/52 (1%)

Query: 432 LAKE-EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSR 482
           L KE   I      A EL+ EN +L   LA+ +QE +    E E L E+  R
Sbjct: 113 LQKELARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQENEQR 164


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 45.3 bits (107), Expect = 2e-04
 Identities = 22/92 (23%), Positives = 29/92 (31%), Gaps = 20/92 (21%)

Query: 545  RSGGREEPSSPRKPEAPSSCIPPPPPPASLPP------PPPPRRSS---------DLPYL 589
            R     + + PR P         P PP+ LPP      PPPP  S               
Sbjct: 2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647

Query: 590  TCPRPLV-----RCRQPPRSKLKSRIPRPYFP 616
               RP       R  +P R++   R  +   P
Sbjct: 2648 PPERPRDDPAPGRVSRPRRARRLGRAAQASSP 2679



 Score = 39.5 bits (92), Expect = 0.011
 Identities = 19/84 (22%), Positives = 28/84 (33%), Gaps = 19/84 (22%)

Query: 549  REEPSSPRKPEAPSSCIP---PPPPPAS----------------LPPPPPPRRSSDLPYL 589
             ++   PR P  PS   P    P PP                  +PPP  PR       +
Sbjct: 2602 VDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV 2661

Query: 590  TCPRPLVRCRQPPRSKLKSRIPRP 613
            + PR   R  +  ++    + PR 
Sbjct: 2662 SRPRRARRLGRAAQASSPPQRPRR 2685



 Score = 38.8 bits (90), Expect = 0.018
 Identities = 19/49 (38%), Positives = 23/49 (46%), Gaps = 4/49 (8%)

Query: 550  EEPSSPRKPEAPSSCIPPPPPPA----SLPPPPPPRRSSDLPYLTCPRP 594
            ++P  P +P+AP    P P PP       PPPPPPR    L   T P  
Sbjct: 2903 DQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG 2951



 Score = 38.4 bits (89), Expect = 0.025
 Identities = 20/68 (29%), Positives = 28/68 (41%), Gaps = 4/68 (5%)

Query: 550  EEPSSPRKPEAPSSCIPPPPPPASLPPPPPPR---RSSDLPYLTCPRPLVRCRQPPRSKL 606
              P    +P  P    PPPPPP    PP  P      +  P    P+P +    P R  +
Sbjct: 2915 PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAV 2974

Query: 607  -KSRIPRP 613
             + R+P+P
Sbjct: 2975 PRFRVPQP 2982



 Score = 37.6 bits (87), Expect = 0.050
 Identities = 17/36 (47%), Positives = 18/36 (50%)

Query: 548  GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRS 583
              E P  P+ P  P     PPPPP   PPPPPP R 
Sbjct: 2904 QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939



 Score = 37.2 bits (86), Expect = 0.054
 Identities = 14/53 (26%), Positives = 17/53 (32%), Gaps = 1/53 (1%)

Query: 553  SSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYL-TCPRPLVRCRQPPRS 604
            + P    +  S   PP  P   P P  P      P     P+P      PPR 
Sbjct: 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939



 Score = 36.8 bits (85), Expect = 0.082
 Identities = 26/74 (35%), Positives = 30/74 (40%), Gaps = 3/74 (4%)

Query: 531  SGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLT 590
            S   P D  A  L+ +      +SP  P  P +   P  PP   PPP PP  S  L    
Sbjct: 2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP---PPPGPPPPSLPLGGSV 2856

Query: 591  CPRPLVRCRQPPRS 604
             P   VR R P RS
Sbjct: 2857 APGGDVRRRPPSRS 2870



 Score = 36.8 bits (85), Expect = 0.084
 Identities = 17/70 (24%), Positives = 26/70 (37%), Gaps = 7/70 (10%)

Query: 552  PSSPRKPEA---PSSCIPPPPPPASLPPPPPPR-RSSDLPYLTCPRPLVRCRQPPRSKLK 607
            P++P +P         +       +LPP  P R      P    P+P  +   PP+ +  
Sbjct: 2875 PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAP--PPPQPQPQPPPPPQPQPP 2932

Query: 608  SRIP-RPYFP 616
               P RP  P
Sbjct: 2933 PPPPPRPQPP 2942



 Score = 35.7 bits (82), Expect = 0.17
 Identities = 25/77 (32%), Positives = 32/77 (41%), Gaps = 3/77 (3%)

Query: 539 KAVGLSRSGGREEPSSPRKPEAPSSCIPPPPP-PASLPPPPP-PRRSSDLPYLTCPRPLV 596
            A   +R  G ++ + P  P   S   P P P PAS PPPP  P  S++      P P  
Sbjct: 391 AATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAP-P 449

Query: 597 RCRQPPRSKLKSRIPRP 613
             RQPP    +     P
Sbjct: 450 PERQPPAPATEPAPDDP 466



 Score = 34.9 bits (80), Expect = 0.34
 Identities = 19/50 (38%), Positives = 21/50 (42%), Gaps = 8/50 (16%)

Query: 567  PPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFP 616
            PPPP     PP  P RS     +  PRP  R   P    + SR  RP  P
Sbjct: 2551 PPPPLPPAAPPAAPDRS-----VPPPRPAPR---PSEPAVTSRARRPDAP 2592



 Score = 34.9 bits (80), Expect = 0.35
 Identities = 21/92 (22%), Positives = 28/92 (30%), Gaps = 6/92 (6%)

Query: 507  RISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSS-PRKPEAPSSCI 565
              +E            +R  +    G V    +A  L R+     P   PR+  A  +  
Sbjct: 2634 AANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVG 2693

Query: 566  P-----PPPPPASLPPPPPPRRSSDLPYLTCP 592
                   PPPP   P P P    S  P    P
Sbjct: 2694 SLTSLADPPPPPPTPEPAPHALVSATPLPPGP 2725



 Score = 34.5 bits (79), Expect = 0.46
 Identities = 17/72 (23%), Positives = 19/72 (26%), Gaps = 2/72 (2%)

Query: 531  SGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLT 590
             G  P      G    GG      P +  A     P  PP   L  P   R +       
Sbjct: 2843 PGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA--L 2900

Query: 591  CPRPLVRCRQPP 602
             P    R  QP 
Sbjct: 2901 PPDQPERPPQPQ 2912



 Score = 33.4 bits (76), Expect = 0.88
 Identities = 18/55 (32%), Positives = 19/55 (34%), Gaps = 6/55 (10%)

Query: 549  REEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYL-TCPRPLVRCRQPP 602
            R   S    P+ P     P  PP   P P PP      P     PRP     QPP
Sbjct: 2893 RSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP-----QPP 2942



 Score = 33.4 bits (76), Expect = 0.96
 Identities = 9/35 (25%), Positives = 12/35 (34%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDL 586
             + R PE       PPPPP +  P         +
Sbjct: 263 EGADRAPETARGATGPPPPPEAAAPNGAAAPPDGV 297



 Score = 32.6 bits (74), Expect = 1.6
 Identities = 18/53 (33%), Positives = 24/53 (45%), Gaps = 2/53 (3%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPL--VRCRQPP 602
           P++P     P S   P PPP   PP P    + D P     + L  +R R+PP
Sbjct: 431 PATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPP 483



 Score = 31.8 bits (72), Expect = 2.4
 Identities = 19/58 (32%), Positives = 23/58 (39%), Gaps = 9/58 (15%)

Query: 552  PSSPRKPEAP-SSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKS 608
            P  P +P  P ++  PP P P + P   PPRR          RP V      R  L S
Sbjct: 2751 PGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL--------TRPAVASLSESRESLPS 2800



 Score = 31.1 bits (70), Expect = 5.0
 Identities = 14/50 (28%), Positives = 17/50 (34%), Gaps = 2/50 (4%)

Query: 547  GGREEPSSPRKPEAPSSCIPPPPPPASLPP--PPPPRRSSDLPYLTCPRP 594
            GG   P+ P     P +  PP  P A  P     P   S      + P P
Sbjct: 2752 GGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP 2801



 Score = 31.1 bits (70), Expect = 5.2
 Identities = 14/61 (22%), Positives = 19/61 (31%), Gaps = 5/61 (8%)

Query: 535  PDDTKAVGLSRSGGREEPS----SPRKPEAPSSCIPPPPPPASLP-PPPPPRRSSDLPYL 589
            P    A     SG   +P      P +   P   +P P P    P    PP     L  +
Sbjct: 2945 PTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRV 3004

Query: 590  T 590
            +
Sbjct: 3005 S 3005



 Score = 30.7 bits (69), Expect = 6.5
 Identities = 22/72 (30%), Positives = 23/72 (31%), Gaps = 4/72 (5%)

Query: 546  SGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPP----PRRSSDLPYLTCPRPLVRCRQP 601
            +G    P  P  P A      PPP PA  P  P      RR    P    PR  V  R  
Sbjct: 2548 AGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGD 2607

Query: 602  PRSKLKSRIPRP 613
            PR         P
Sbjct: 2608 PRGPAPPSPLPP 2619



 Score = 30.3 bits (68), Expect = 7.4
 Identities = 11/38 (28%), Positives = 14/38 (36%), Gaps = 8/38 (21%)

Query: 551 EPSSPRKPEAPSSCI--------PPPPPPASLPPPPPP 580
           E ++P    AP   +        P   P    PPPP P
Sbjct: 283 EAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAP 320



 Score = 30.3 bits (68), Expect = 7.6
 Identities = 12/53 (22%), Positives = 16/53 (30%), Gaps = 2/53 (3%)

Query: 541  VGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPR 593
             G +R       + P  P  P++    PP    L  P     S     L  P 
Sbjct: 2752 GGPARPARPPTTAGPPAPAPPAAPAAGPPRR--LTRPAVASLSESRESLPSPW 2802



 Score = 30.3 bits (68), Expect = 8.7
 Identities = 18/67 (26%), Positives = 22/67 (32%), Gaps = 6/67 (8%)

Query: 547  GGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKL 606
            GG   P    +   PS    P   PA+ P  PP RR   L      R       PP    
Sbjct: 2853 GGSVAPGGDVRRRPPSR--SPAAKPAA-PARPPVRR---LARPAVSRSTESFALPPDQPE 2906

Query: 607  KSRIPRP 613
            +   P+ 
Sbjct: 2907 RPPQPQA 2913



 Score = 29.9 bits (67), Expect = 9.7
 Identities = 14/51 (27%), Positives = 30/51 (58%), Gaps = 1/51 (1%)

Query: 438  IVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLE 488
            + A +++ADELE    DL+ ++A +      + +   D++ +L R+++R E
Sbjct: 1287 VGATRRRADELEAAAADLAEKMAARRAR-ASRERWAADVEAALDRVENRAE 1336


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 44.8 bits (106), Expect = 3e-04
 Identities = 23/65 (35%), Positives = 25/65 (38%), Gaps = 4/65 (6%)

Query: 538 TKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVR 597
           T A       G   P  P  P+ P    PP PPPAS PP P P  S  L  +  P P   
Sbjct: 94  TLAPASPAREGSPTPPGPSSPDPP----PPTPPPASPPPSPAPDLSEMLRPVGSPGPPPA 149

Query: 598 CRQPP 602
              P 
Sbjct: 150 ASPPA 154



 Score = 43.2 bits (102), Expect = 9e-04
 Identities = 17/83 (20%), Positives = 23/83 (27%), Gaps = 10/83 (12%)

Query: 530 NSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYL 589
           +S S  + + +   S S      +       PS    P P     P  P   R       
Sbjct: 317 SSSSSRESSSSSTSSSSESSRGAAVSP---GPSPSRSPSPSRPPPPADPSSPRKR----- 368

Query: 590 TCPRPLVRCRQPPRSKLKSRIPR 612
             PRP      P  S  +    R
Sbjct: 369 --PRPSRAPSSPAASAGRPTRRR 389



 Score = 40.5 bits (95), Expect = 0.006
 Identities = 24/103 (23%), Positives = 37/103 (35%), Gaps = 6/103 (5%)

Query: 530 NSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYL 589
           +  +V  D  +   +         + R P +P +  PP  PPA+  P PP R S      
Sbjct: 158 SPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASA 217

Query: 590 TCPRPLVRCRQPPRSKLKSRIPRPYFPNVSVQVNGTIWTELDE 632
           + P P      P RS            + S + +G  W   +E
Sbjct: 218 SSPAPA-----PGRSA-ADDAGASSSDSSSSESSGCGWGPENE 254



 Score = 38.6 bits (90), Expect = 0.024
 Identities = 17/89 (19%), Positives = 25/89 (28%), Gaps = 7/89 (7%)

Query: 532 GSVPDDTKAVGLSRSGGREEPSSPRKP-EAPSSCIPPPPPPASLPPPPPPRRSSDLPYLT 590
           G +     A   +     +  S    P     S  P  PP A+  PPP     S      
Sbjct: 800 GRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPA 859

Query: 591 CPRPLVRCR------QPPRSKLKSRIPRP 613
                 R +      +PP  + +     P
Sbjct: 860 AAGGRARGKNGRRRPRPPEPRARPGAAAP 888



 Score = 36.7 bits (85), Expect = 0.079
 Identities = 23/86 (26%), Positives = 26/86 (30%), Gaps = 14/86 (16%)

Query: 531 SGSVPDDTKAVGLSRSGGR----------EEPSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
           S S      AV    S  R           +PSSPRK   PS    P  P AS   P   
Sbjct: 331 SSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSR--APSSPAASAGRPTRR 388

Query: 581 RRSSDLPYLTCPR--PLVRCRQPPRS 604
           R  + +      R          PR 
Sbjct: 389 RARAAVAGRARRRDATGRFPAGRPRP 414



 Score = 34.0 bits (78), Expect = 0.53
 Identities = 24/112 (21%), Positives = 29/112 (25%), Gaps = 13/112 (11%)

Query: 512 KHRADFEQDERKRLENLINSGSVPDDTK---AVGLSRSGGREEPSSPRKPEAP------- 561
              A     +RK   +  + GS                  R   SS  KP A        
Sbjct: 809 ADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGK 868

Query: 562 SSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRP 613
           +    P PP     P       +         P  R R  PR KL    P P
Sbjct: 869 NGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKL---GPMP 917



 Score = 33.2 bits (76), Expect = 1.1
 Identities = 16/68 (23%), Positives = 19/68 (27%), Gaps = 4/68 (5%)

Query: 531 SGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSD---LP 587
             S      A G +R         P +P A      PP   A+ PP   P          
Sbjct: 852 ESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRV 911

Query: 588 YLTCPRPL 595
            L  P P 
Sbjct: 912 KLG-PMPP 918



 Score = 32.8 bits (75), Expect = 1.3
 Identities = 22/105 (20%), Positives = 27/105 (25%), Gaps = 22/105 (20%)

Query: 531 SGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASL---------------- 574
           SG     ++    S S    E S    P +P S   P  P AS                 
Sbjct: 273 SGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS 332

Query: 575 ------PPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRP 613
                     P    S  P  + P P      P +    SR P  
Sbjct: 333 SESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSS 377



 Score = 32.5 bits (74), Expect = 1.5
 Identities = 17/83 (20%), Positives = 22/83 (26%)

Query: 520 DERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPP 579
            +R         G   +       S         +P  P    S  PP P     PPP P
Sbjct: 62  CDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTP 121

Query: 580 PRRSSDLPYLTCPRPLVRCRQPP 602
           P  S           ++R    P
Sbjct: 122 PPASPPPSPAPDLSEMLRPVGSP 144



 Score = 31.7 bits (72), Expect = 2.9
 Identities = 21/82 (25%), Positives = 24/82 (29%), Gaps = 8/82 (9%)

Query: 540 AVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPR--------RSSDLPYLTC 591
           A G +    R  P+S        S  P P  P S P P  PR        R S     + 
Sbjct: 272 ASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSS 331

Query: 592 PRPLVRCRQPPRSKLKSRIPRP 613
                R          SR P P
Sbjct: 332 SSESSRGAAVSPGPSPSRSPSP 353



 Score = 31.3 bits (71), Expect = 3.6
 Identities = 11/85 (12%), Positives = 19/85 (22%)

Query: 520 DERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPP 579
           D  +     + +G+   D          G    +   +  +  +       PAS      
Sbjct: 46  DSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGS 105

Query: 580 PRRSSDLPYLTCPRPLVRCRQPPRS 604
           P           P        PP  
Sbjct: 106 PTPPGPSSPDPPPPTPPPASPPPSP 130


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 41.6 bits (98), Expect = 0.001
 Identities = 30/93 (32%), Positives = 50/93 (53%), Gaps = 3/93 (3%)

Query: 424 NVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
            VK    L A   EI  AK++ + LE E  +L   + K E+E++   +  E L+++L+  
Sbjct: 84  AVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEA 143

Query: 484 KDRLEKETAGHLEAKQKVAELEDRISELKHRAD 516
           + RLE+E A   E +++  EL  +  ELK + D
Sbjct: 144 EARLEEEVA---EIREEGQELSSKREELKEKLD 173



 Score = 38.5 bits (90), Expect = 0.012
 Identities = 18/95 (18%), Positives = 42/95 (44%)

Query: 424 NVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
           +++  V  L  E + +  + K  E ++  V     L     E+    +    L++ L+ +
Sbjct: 56  DLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAEL 115

Query: 484 KDRLEKETAGHLEAKQKVAELEDRISELKHRADFE 518
            + +EK      + K+++  LE  ++E + R + E
Sbjct: 116 MEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEE 150



 Score = 33.5 bits (77), Expect = 0.38
 Identities = 18/93 (19%), Positives = 35/93 (37%), Gaps = 5/93 (5%)

Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAG 493
            E  I   +K   + + E   L+  L   E EL+    +   L+  +  I++R+++    
Sbjct: 22  LEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKR---- 77

Query: 494 HLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
             E K    + E  +  L       ++    LE
Sbjct: 78  -AEEKLSAVKDERELRALNIEIQIAKERINSLE 109


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 41.5 bits (98), Expect = 0.002
 Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 6/101 (5%)

Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKE-DLDESLSRIKDRLEK 489
           L AKE+ I A + KA+  E E       L +K++E +Q  + +E    E + ++ +++E 
Sbjct: 199 LTAKEKAIEAERAKAEAAEAEQ----ELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEA 254

Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
           E    L  ++++ E + +  E   +  F+  E + L+  I 
Sbjct: 255 EREKLLAEQERMLEHKLQEQEELLKEGFKT-EAESLQKEIQ 294


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 40.5 bits (95), Expect = 0.002
 Identities = 28/106 (26%), Positives = 52/106 (49%), Gaps = 4/106 (3%)

Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
            L   + E+   K++  EL+ E  DL  R+A+ E EL+   ++ + L E L  +++RLE 
Sbjct: 73  KLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLES 132

Query: 490 ETAGHLEAKQKVAELE----DRISELKHRADFEQDERKRLENLINS 531
                 E  +++ EL     + + EL+   +  Q+  +RL+  I  
Sbjct: 133 LEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQE 178



 Score = 35.5 bits (82), Expect = 0.076
 Identities = 26/100 (26%), Positives = 50/100 (50%), Gaps = 1/100 (1%)

Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
           LL K        +K +EL+ E  +L  +L++ ++EL+   +   +L+  L  +K+ L+  
Sbjct: 60  LLLKAPVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLL 119

Query: 491 TAGHLEAKQKVAELEDRISEL-KHRADFEQDERKRLENLI 529
                  ++++  LE+ I EL K   +  QD R+ +E L 
Sbjct: 120 RELLKSLEERLESLEESIKELAKELRELRQDLREEVEELR 159


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 41.6 bits (98), Expect = 0.002
 Identities = 33/105 (31%), Positives = 53/105 (50%), Gaps = 10/105 (9%)

Query: 432 LAKEEEIVAAK-KKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
           L ++EE + A+ +K D LE +  +    L+ +E EL++  ++   LD  L R+   L  E
Sbjct: 86  LVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQ---LDNELYRVAG-LTPE 141

Query: 491 TAGHLEAKQKVAELED----RISELKHRADFEQDERKRLENLINS 531
            A  L  K   AELE+    R+ +++  AD E  ERK    L  +
Sbjct: 142 QARKLLLKLLDAELEEEKAQRVKKIEEEADLEA-ERKAQNILAQA 185


>gnl|CDD|222828 PHA01732, PHA01732, proline-rich protein.
          Length = 94

 Score = 38.2 bits (88), Expect = 0.003
 Identities = 13/29 (44%), Positives = 16/29 (55%), Gaps = 5/29 (17%)

Query: 553 SSPRKPEAPSSCIPPPPPPASLPPPPPPR 581
            +P+ PE P      P PPA +PPPPP  
Sbjct: 5   RAPKPPEPP-----APLPPAPVPPPPPAP 28



 Score = 34.7 bits (79), Expect = 0.039
 Identities = 12/28 (42%), Positives = 15/28 (53%)

Query: 560 APSSCIPPPPPPASLPPPPPPRRSSDLP 587
           AP    PP P P +  PPPPP   + +P
Sbjct: 6   APKPPEPPAPLPPAPVPPPPPAPPAPVP 33



 Score = 34.3 bits (78), Expect = 0.054
 Identities = 12/29 (41%), Positives = 12/29 (41%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
           P  P  P        PPPPPA   P P P
Sbjct: 7   PKPPEPPAPLPPAPVPPPPPAPPAPVPEP 35


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 40.6 bits (95), Expect = 0.003
 Identities = 20/66 (30%), Positives = 22/66 (33%), Gaps = 3/66 (4%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIP 611
           P  P    A  +   PPPPP    PP P     D P    P P      PP   +    P
Sbjct: 45  PPPPSTAAAAPAPAAPPPPPPPAAPPAPQ---PDDPNAAPPPPPADPNAPPPPPVDPNAP 101

Query: 612 RPYFPN 617
            P  P 
Sbjct: 102 PPPAPE 107



 Score = 34.9 bits (80), Expect = 0.21
 Identities = 23/85 (27%), Positives = 28/85 (32%), Gaps = 4/85 (4%)

Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRI 610
            P+    P  P    PP P P      PPP  +   P    P P+     PP +    RI
Sbjct: 54  APAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPAD--PNAPPPPPVDPNAPPPPAPEPGRI 111

Query: 611 PRPYFPNVSVQVNGTIWTELDESKL 635
                    V   G  W E D + L
Sbjct: 112 DNAVGGFSYVVPAG--WVESDATHL 134



 Score = 33.7 bits (77), Expect = 0.51
 Identities = 17/66 (25%), Positives = 21/66 (31%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIP 611
           P++     AP    P     A  P  PPP      P    P        PP +   +  P
Sbjct: 34  PATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPP 93

Query: 612 RPYFPN 617
            P  PN
Sbjct: 94  PPVDPN 99


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 40.3 bits (95), Expect = 0.005
 Identities = 33/104 (31%), Positives = 51/104 (49%), Gaps = 12/104 (11%)

Query: 424 NVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
            +KK   L AKEE      K   ELE E  +  N L + E+ L Q+ +  +   ESL + 
Sbjct: 47  TLKKEALLEAKEE----VHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKK 102

Query: 484 KDRLEKETAGHLEAKQK-VAELEDRISELKHRADFEQDERKRLE 526
           ++ LEK+    L  K+K + E E+ + EL        ++R+ LE
Sbjct: 103 EENLEKKEK-ELSNKEKNLDEKEEELEEL------IAEQREELE 139



 Score = 34.9 bits (81), Expect = 0.22
 Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 14/90 (15%)

Query: 431 LLAKEE----EIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDR 486
           LL +EE    ++ +  KK + LE +       L+ KE+ LD+K +E   L+E ++  ++ 
Sbjct: 85  LLQREETLDRKMESLDKKEENLEKKE----KELSNKEKNLDEKEEE---LEELIAEQREE 137

Query: 487 LEKETA-GHLEAKQKVAELEDRISELKHRA 515
           LE+ +     EAK+ +  LE+   E +H A
Sbjct: 138 LERISGLTQEEAKEIL--LEEVEEEARHEA 165


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 39.9 bits (93), Expect = 0.005
 Identities = 18/102 (17%), Positives = 33/102 (32%), Gaps = 22/102 (21%)

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSR--------- 545
           +E  + V EL+++I   + +  +   ++K    LI SG +  D K V   +         
Sbjct: 17  MEPDETVKELKEKIEAEQGKDAYPVAQQK----LIYSGKILSDDKTVKEYKIKEKDFVVV 72

Query: 546 ---------SGGREEPSSPRKPEAPSSCIPPPPPPASLPPPP 578
                           ++P     P+   P  P       P 
Sbjct: 73  MVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPA 114


>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1.  This region is found
           in some of the Diaphanous related formins (Drfs). It
           consists of low complexity repeats of around 12
           residues.
          Length = 160

 Score = 38.0 bits (88), Expect = 0.008
 Identities = 17/33 (51%), Positives = 17/33 (51%), Gaps = 2/33 (6%)

Query: 547 GGREEPSSPRKPEAPSSCIPPPPPPASLPPPPP 579
           G    P  P  P AP   IPPPPP    PPPPP
Sbjct: 118 GPGVPPPPPPFPGAPG--IPPPPPGMGSPPPPP 148



 Score = 38.0 bits (88), Expect = 0.009
 Identities = 21/52 (40%), Positives = 24/52 (46%), Gaps = 8/52 (15%)

Query: 547 GGREEPSSPRKPEAPSSCIPPPPP---PASLPPPPPPRRSSDLPYLTCPRPL 595
           GG   P  P  P     CIPPPPP      +PPPPP    + +P    P PL
Sbjct: 8   GGVGIPPPP--PLPGGVCIPPPPPLPGGTGIPPPPPLPGGAAIP---PPPPL 54



 Score = 37.6 bits (87), Expect = 0.011
 Identities = 23/64 (35%), Positives = 29/64 (45%), Gaps = 4/64 (6%)

Query: 555 PRKPEAPSSCIPPPPPP----ASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRI 610
           P  P    + IPPPPPP    A++PPPPP      +P    P P      PP   + S  
Sbjct: 86  PPPPLPGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGSPP 145

Query: 611 PRPY 614
           P P+
Sbjct: 146 PPPF 149



 Score = 36.5 bits (84), Expect = 0.030
 Identities = 16/35 (45%), Positives = 17/35 (48%), Gaps = 2/35 (5%)

Query: 547 GGREEPSSPRKPEAPSSCIPPPPPP--ASLPPPPP 579
           GG   P  P  P  P    PPPP P    +PPPPP
Sbjct: 105 GGAAVPPPPPLPGGPGVPPPPPPFPGAPGIPPPPP 139



 Score = 35.7 bits (82), Expect = 0.043
 Identities = 25/73 (34%), Positives = 30/73 (41%), Gaps = 15/73 (20%)

Query: 547 GGREEPSSPRKPEAPSSCIPPPPP---PASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPR 603
           G    P  P  P A  + IPPPPP    A +PPPPP    + +P    P P      PP 
Sbjct: 56  GVAGIPPPPPLPGA--TAIPPPPPLPGAAGIPPPPPLPGGAGIP----PPP------PPL 103

Query: 604 SKLKSRIPRPYFP 616
               +  P P  P
Sbjct: 104 PGGAAVPPPPPLP 116



 Score = 34.9 bits (80), Expect = 0.089
 Identities = 25/71 (35%), Positives = 27/71 (38%), Gaps = 17/71 (23%)

Query: 555 PRKPEAPSSCIPPPPPPASLP-----PPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSR 609
           P  P  P     PPPPP  LP     PPPPP     LP  T   P      PP     + 
Sbjct: 1   PPPPPLPGGVGIPPPPP--LPGGVCIPPPPP-----LPGGTGIPP-----PPPLPGGAAI 48

Query: 610 IPRPYFPNVSV 620
            P P  P V+ 
Sbjct: 49  PPPPPLPGVAG 59



 Score = 33.8 bits (77), Expect = 0.22
 Identities = 28/81 (34%), Positives = 30/81 (37%), Gaps = 27/81 (33%)

Query: 547 GGREEPSSPRKPEAPSSCIPPPP---------PPASLP-----PPPPPRRSSDLPYLTCP 592
           GG   P  P  P    + IPPPP         PP  LP     PPPPP     LP  T  
Sbjct: 20  GGVCIPPPPPLPGG--TGIPPPPPLPGGAAIPPPPPLPGVAGIPPPPP-----LPGATAI 72

Query: 593 RPLVRCRQPPRSKLKSRIPRP 613
            P      PP     + IP P
Sbjct: 73  PP------PPPLPGAAGIPPP 87


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 39.6 bits (93), Expect = 0.011
 Identities = 31/119 (26%), Positives = 57/119 (47%), Gaps = 15/119 (12%)

Query: 421 LKINVKKVVHLLAKEEEIVAAK--------KKADELEMENVDLSNRLAKKEQELDQKTQE 472
           L+   + +  L+A+  E +  K        ++A ELE E  +     A+ E+E ++  +E
Sbjct: 514 LEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREE 573

Query: 473 KEDLDESLSRIKDRLEK-ETAGHLEAKQKVAELEDRISEL----KHRADFEQDERKRLE 526
             +L+  L+ +K+R+E  E    L A   +A+ ED I  L    +  A+   + R+RL 
Sbjct: 574 VAELNSKLAELKERIESLERIRTLLA--AIADAEDEIERLREKREALAELNDERRERLA 630



 Score = 35.8 bits (83), Expect = 0.15
 Identities = 30/119 (25%), Positives = 55/119 (46%), Gaps = 22/119 (18%)

Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
           L+  V++V   L + E++V A+ + + LE    DL   +A++ + +++K +  E+L E  
Sbjct: 487 LEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRER- 545

Query: 481 SRIKDRLEKETAGHLEAKQK-----------VAELEDRISELKHRADFEQDERKRLENL 528
                 LE E     EA  +           VAEL  +++ELK       +  + LE +
Sbjct: 546 ---AAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELK-------ERIESLERI 594



 Score = 35.8 bits (83), Expect = 0.17
 Identities = 21/91 (23%), Positives = 44/91 (48%)

Query: 436 EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHL 495
           EE+   +++ +ELE E  DL       + + +     +E+L++    ++DRLE+      
Sbjct: 279 EEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQ 338

Query: 496 EAKQKVAELEDRISELKHRADFEQDERKRLE 526
              ++   L +   +L+ RA+  ++E   LE
Sbjct: 339 AHNEEAESLREDADDLEERAEELREEAAELE 369



 Score = 34.2 bits (79), Expect = 0.48
 Identities = 22/79 (27%), Positives = 37/79 (46%), Gaps = 5/79 (6%)

Query: 449 EMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET---AGHLEAKQKVAELE 505
           E E  DL  RL   E EL +  +E E  +E   + ++  ++       H E ++++  LE
Sbjct: 198 EKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLE 257

Query: 506 DRISELKHRADFEQDERKR 524
             I +L  R    + ER+R
Sbjct: 258 AEIEDL--RETIAETERER 274



 Score = 32.7 bits (75), Expect = 1.2
 Identities = 31/96 (32%), Positives = 48/96 (50%), Gaps = 10/96 (10%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           EE +   ++K DEL  E  DL   +   E EL+    E E+L E    +++R+E   A +
Sbjct: 659 EEYLEQVEEKLDELREERDDLQAEIGAVENELE----ELEELRERREALENRVEALEALY 714

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
            EA+    ELE    +L  RA+  Q   + LE ++N
Sbjct: 715 DEAE----ELESMYGDL--RAELRQRNVETLERMLN 744



 Score = 31.9 bits (73), Expect = 2.3
 Identities = 21/108 (19%), Positives = 54/108 (50%), Gaps = 11/108 (10%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQ------KTQEKEDLDESLSRIKDR 486
            K E    A+++A+E   E  +L+++LA+ ++ ++             D ++ + R++++
Sbjct: 555 EKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREK 614

Query: 487 LEKETAGHLEAKQKVAELEDRISELKHRADFE-----QDERKRLENLI 529
            E     + E ++++AE  +R  EL+   D       +++++R E  +
Sbjct: 615 REALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYL 662



 Score = 30.8 bits (70), Expect = 5.1
 Identities = 21/87 (24%), Positives = 39/87 (44%), Gaps = 7/87 (8%)

Query: 435 EEEIVAAKKKADELE--MENV-----DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRL 487
           EE     +++A ELE  +E       D    + + E+E+++  +   D    L   +D L
Sbjct: 355 EERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFL 414

Query: 488 EKETAGHLEAKQKVAELEDRISELKHR 514
           E+      E +++ AELE  +   + R
Sbjct: 415 EELREERDELREREAELEATLRTARER 441



 Score = 30.0 bits (68), Expect = 8.5
 Identities = 26/100 (26%), Positives = 48/100 (48%), Gaps = 12/100 (12%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLE--KETA 492
           + E V A++  +ELE  + +L +RL +         +E E L E    +++R E  +E A
Sbjct: 308 DAEAVEARR--EELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEA 365

Query: 493 GHLE-----AKQKVAELEDRISELKHRADFEQDERKRLEN 527
             LE     A++ V +  + I EL+   +   + R+R  +
Sbjct: 366 AELESELEEAREAVEDRREEIEELEEEIE---ELRERFGD 402



 Score = 30.0 bits (68), Expect = 9.7
 Identities = 22/112 (19%), Positives = 46/112 (41%), Gaps = 21/112 (18%)

Query: 432 LAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK------- 484
           L +   ++AA   A++      ++     K+E   +   + +E L E   R +       
Sbjct: 591 LERIRTLLAAIADAED------EIERLREKREALAELNDERRERLAEKRERKRELEAEFD 644

Query: 485 -DRLEKETAGHLEAKQKVAELEDRISELKHRADFEQD-------ERKRLENL 528
             R+E+       A++ + ++E+++ EL+   D  Q        E + LE L
Sbjct: 645 EARIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEEL 696


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 39.3 bits (91), Expect = 0.013
 Identities = 35/141 (24%), Positives = 62/141 (43%), Gaps = 14/141 (9%)

Query: 405 QLTLQTETGEDYDHIL--LKINVKKVVHLLAK----EEEIVAAKKKADELEMENVDLSNR 458
           QL    E  + Y++ L  LK  +K++ H L+K    + EI A K +  ++E +N +L  +
Sbjct: 232 QLESSREIVKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELK 291

Query: 459 LAKKEQELDQKTQEKEDL--------DESLSRIKDRLEKETAGHLEAKQKVAELEDRISE 510
           + K  Q  D++  +            +  L   +  LEK         Q+  EL      
Sbjct: 292 MEKVFQGTDEQLNDLYHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGR 351

Query: 511 LKHRADFEQDERKRLENLINS 531
           L+ +AD  Q+  +  ++LI S
Sbjct: 352 LQLQADRHQEHIRARDSLIQS 372



 Score = 31.2 bits (70), Expect = 3.6
 Identities = 34/151 (22%), Positives = 67/151 (44%), Gaps = 20/151 (13%)

Query: 399 LDRLAQQLTLQT-ETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSN 457
           LDR  QQ+  +  E   + D ++ KI + + + +  ++E+I   K K +EL+ E + +  
Sbjct: 820 LDRTVQQVNQEKQEKQHELDTVVSKIELNRKL-IQDQQEQIQHLKSKTNELKSEKLQIGT 878

Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEK----------------ETAGHLEAKQKV 501
            L +++Q  +Q  +   ++   +  IKD  E+                E     E   K 
Sbjct: 879 NLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKK 938

Query: 502 AELEDRISELKHRADFEQDERKRLENLINSG 532
           A  +D+++++K +        K +EN I  G
Sbjct: 939 A--QDKVNDIKEKVKNIHGYMKDIENKIQDG 967



 Score = 30.4 bits (68), Expect = 7.2
 Identities = 15/60 (25%), Positives = 26/60 (43%)

Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
              K       KK+   L+ E  DL  +L K +QE++Q         +     KD+++K+
Sbjct: 489 KAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKD 548


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 38.7 bits (90), Expect = 0.015
 Identities = 12/30 (40%), Positives = 15/30 (50%)

Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
              SP +P AP + IP PP P  +    PP
Sbjct: 226 PGPSPAQPSAPPASIPAPPIPPVIQYVAPP 255


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 38.9 bits (91), Expect = 0.016
 Identities = 27/97 (27%), Positives = 54/97 (55%), Gaps = 3/97 (3%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI--KDRLEKETA 492
           EE I   +K+ +E+  E  ++S+ L +  +EL++  +E ++L+E    I   ++  +   
Sbjct: 192 EELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLE 251

Query: 493 GHLEA-KQKVAELEDRISELKHRADFEQDERKRLENL 528
           G     ++K+ ELE+RI ELK   +  +++ K L+ L
Sbjct: 252 GSKRKLEEKIRELEERIEELKKEIEELEEKVKELKEL 288



 Score = 37.0 bits (86), Expect = 0.064
 Identities = 29/110 (26%), Positives = 61/110 (55%), Gaps = 9/110 (8%)

Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAK-KEQELDQKTQEKEDLDES 479
           L+  ++++  +L KE E++  K+ A++L+    +L  +L K   +EL++K +E E L E 
Sbjct: 478 LRKELRELEKVLKKESELIKLKELAEQLK----ELEEKLKKYNLEELEKKAEEYEKLKEK 533

Query: 480 LSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLI 529
           L     +L+ E     +  +K+ EL+ +++EL+ + D  ++E   L   +
Sbjct: 534 L----IKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKEL 579



 Score = 36.2 bits (84), Expect = 0.11
 Identities = 28/108 (25%), Positives = 56/108 (51%), Gaps = 11/108 (10%)

Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
           L+  VK++  L  K EE +   +  +E   E  ++  RL++ E+E++   +  ++L+E  
Sbjct: 278 LEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKE 337

Query: 481 SRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
            R++           E K+K+ ELE R+ EL+ R +  ++ + + E L
Sbjct: 338 ERLE-----------ELKKKLKELEKRLEELEERHELYEEAKAKKEEL 374



 Score = 33.1 bits (76), Expect = 0.89
 Identities = 23/106 (21%), Positives = 50/106 (47%), Gaps = 5/106 (4%)

Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDL-----DESL 480
            + + L   E+E+   +K+  +LE E       LA+ E+ L++  +E E+L     +E  
Sbjct: 602 NEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEY 661

Query: 481 SRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
             +++   + +      + ++ ELE R  E+K   +  ++E +  E
Sbjct: 662 EELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEERE 707



 Score = 33.1 bits (76), Expect = 1.0
 Identities = 24/99 (24%), Positives = 52/99 (52%), Gaps = 3/99 (3%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKT--QEKEDLDESLSRI-KDRLEKET 491
           E+E+   ++K  +L  E  +L   L K+ + +  K   ++ ++L+E L +   + LEK+ 
Sbjct: 465 EKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKA 524

Query: 492 AGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
             + + K+K+ +L+  I  LK   +  ++ +K+L  L  
Sbjct: 525 EEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEK 563



 Score = 32.7 bits (75), Expect = 1.4
 Identities = 27/94 (28%), Positives = 47/94 (50%), Gaps = 1/94 (1%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           EE I   KK+ +ELE E V     L +K +E  + ++  E+  + L  I+ RL +     
Sbjct: 265 EERIEELKKEIEELE-EKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEI 323

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
              ++++ ELE++   L+      ++  KRLE L
Sbjct: 324 NGIEERIKELEEKEERLEELKKKLKELEKRLEEL 357



 Score = 32.3 bits (74), Expect = 1.7
 Identities = 29/102 (28%), Positives = 51/102 (50%), Gaps = 7/102 (6%)

Query: 434 KEEEIVAAKKKADELEMENV-DLSNRLAKKEQ------ELDQKTQEKEDLDESLSRIKDR 486
            EEE+    K+ +EL  E+V +L  RL + E       EL    +E E  ++ L ++++ 
Sbjct: 568 LEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEE 627

Query: 487 LEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
           L+K      E ++++ EL   + EL+ +   E+ E  R E L
Sbjct: 628 LDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYL 669



 Score = 31.2 bits (71), Expect = 3.7
 Identities = 20/82 (24%), Positives = 40/82 (48%), Gaps = 5/82 (6%)

Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK---DRLEKE 490
             +E+   +KK  E E E   L     +  +EL     E E+L++    IK   ++L++E
Sbjct: 645 LRKELEELEKKYSEEEYEE--LREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEE 702

Query: 491 TAGHLEAKQKVAELEDRISELK 512
                +AK+++ +LE  +  ++
Sbjct: 703 LEEREKAKKELEKLEKALERVE 724



 Score = 30.0 bits (68), Expect = 7.8
 Identities = 18/82 (21%), Positives = 44/82 (53%), Gaps = 1/82 (1%)

Query: 425 VKKVVHLLAKEEEIVAAKKKADELEMENVD-LSNRLAKKEQELDQKTQEKEDLDESLSRI 483
           V + ++ ++ E   +  + +  E E++ ++ L   + + E+EL+     K  L+E +  +
Sbjct: 205 VLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIREL 264

Query: 484 KDRLEKETAGHLEAKQKVAELE 505
           ++R+E+      E ++KV EL+
Sbjct: 265 EERIEELKKEIEELEEKVKELK 286



 Score = 30.0 bits (68), Expect = 9.3
 Identities = 26/92 (28%), Positives = 46/92 (50%), Gaps = 16/92 (17%)

Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
           L+   +      AK+EE+   KK+   L  E +         E+EL++  + KE+++E +
Sbjct: 357 LEERHELYEEAKAKKEELERLKKRLTGLTPEKL---------EKELEELEKAKEEIEEEI 407

Query: 481 SRIKDRLEKETAGHLEAKQKVAELEDRISELK 512
           S+I  R+        E K+++ EL+  I ELK
Sbjct: 408 SKITARIG-------ELKKEIKELKKAIEELK 432


>gnl|CDD|218146 pfam04554, Extensin_2, Extensin-like region. 
          Length = 57

 Score = 34.7 bits (80), Expect = 0.016
 Identities = 17/49 (34%), Positives = 18/49 (36%), Gaps = 4/49 (8%)

Query: 566 PPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPY 614
            PPPP     PPPP    S  P    P      + PP    KS  P  Y
Sbjct: 5   SPPPPVKQYSPPPPYYYKSPPP----PVKSPVYKSPPPPVYKSPPPPKY 49



 Score = 32.8 bits (75), Expect = 0.060
 Identities = 13/35 (37%), Positives = 13/35 (37%)

Query: 553 SSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLP 587
            SP  P        PPPP    PPPP     S  P
Sbjct: 22  KSPPPPVKSPVYKSPPPPVYKSPPPPKYVYKSPPP 56



 Score = 32.4 bits (74), Expect = 0.098
 Identities = 19/59 (32%), Positives = 23/59 (38%), Gaps = 6/59 (10%)

Query: 553 SSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIP 611
            SP  P        PPPP     PPPP        Y + P P+ +   PP+   KS  P
Sbjct: 4   KSPPPPVKQY---SPPPPYYYKSPPPPV---KSPVYKSPPPPVYKSPPPPKYVYKSPPP 56



 Score = 31.3 bits (71), Expect = 0.24
 Identities = 16/51 (31%), Positives = 18/51 (35%), Gaps = 7/51 (13%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
             SP  P    S  PP   P    PPPP  +S        P P    + PP
Sbjct: 12  QYSPPPPYYYKSPPPPVKSPVYKSPPPPVYKSP-------PPPKYVYKSPP 55


>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region.  This family
           represents an N-terminal conserved region found in
           several huntingtin-associated protein 1 (HAP1)
           homologues. HAP1 binds to huntingtin in a polyglutamine
           repeat-length-dependent manner. However, its possible
           role in the pathogenesis of Huntington's disease is
           unclear. This family also includes a similar N-terminal
           conserved region from hypothetical protein products of
           ALS2CR3 genes found in the human juvenile amyotrophic
           lateral sclerosis critical region 2q33-2q34.
          Length = 307

 Score = 38.3 bits (89), Expect = 0.017
 Identities = 50/218 (22%), Positives = 85/218 (38%), Gaps = 28/218 (12%)

Query: 321 ETLDRHLDFFEMLRHE--DESELARKY--ETDHIETKSATSMFNILRRKLGHTAAYPHFL 376
           E L +  D    LRHE   + EL + Y    +  E +S+ S     +     +       
Sbjct: 105 EQLGKARDEILQLRHELNLKDELLQFYSDADEESEDESSESTPLRPQESSSSSHGCFQLE 164

Query: 377 SLLHHCILLPLEFGCHPQHWLLLDRLAQQLTLQTETGEDYDHILLKINVKKV----VHLL 432
           +L     LL        +    L   A  L  +T T E+ +  L+   VK++      + 
Sbjct: 165 ALQEKLKLLE-------EENEHLRSEASHLKTETVTYEEKEQQLVNDCVKQLREANDQIA 217

Query: 433 AKEEEIVAAKKKADELEM---ENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
           +  EE+    KK ++LE    E   L +++   +++      E E+L + L+  KD   +
Sbjct: 218 SLSEELA---KKTEDLERQQEEITHLLSQIVDLQKKCKSYALENEELQQHLAAAKDAQRQ 274

Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLEN 527
             A   E + K AE  + + E        Q+E K L N
Sbjct: 275 LQAELQELQDKYAECMEMLHE-------AQEELKNLRN 305


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 38.1 bits (89), Expect = 0.019
 Identities = 36/142 (25%), Positives = 66/142 (46%), Gaps = 14/142 (9%)

Query: 396 WLLLDRLAQQLTLQTET-GEDYDHILLKINVKKVVH------LLAKEEEIVAAKKKADEL 448
             LL+ L + L    E   EDY  ++ ++ +   +         A EEE+   K+  DEL
Sbjct: 139 MKLLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDEL 198

Query: 449 EMENVDLSNR----LAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAEL 504
           E  +    +R    L K  QE+  K ++ E+L+E L  ++ ++E  T    E   ++AE 
Sbjct: 199 EDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEA 258

Query: 505 EDRISELKHRADFEQDERKRLE 526
           E ++ + +    F   E ++L+
Sbjct: 259 EKKLEQCRG---FTFKEIEKLK 277



 Score = 35.4 bits (82), Expect = 0.14
 Identities = 16/85 (18%), Positives = 38/85 (44%), Gaps = 4/85 (4%)

Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAG 493
              E+  AK+K  +L  E +    +L + E+EL +   + EDL    S +   + +    
Sbjct: 202 DPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKK 261

Query: 494 HLEAKQ----KVAELEDRISELKHR 514
             + +     ++ +L++++  L+  
Sbjct: 262 LEQCRGFTFKEIEKLKEQLKLLQSL 286


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 38.2 bits (89), Expect = 0.020
 Identities = 18/75 (24%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 452 NVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISEL 511
           N DLS    K+++ L +  ++ +D  + L  +KD  +       +AK++  + E+   + 
Sbjct: 359 NTDLSG--DKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQ 416

Query: 512 KHRADFEQDERKRLE 526
           K +AD ++++R++ E
Sbjct: 417 KEQADEDKEKRQKDE 431


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 37.5 bits (87), Expect = 0.029
 Identities = 16/47 (34%), Positives = 18/47 (38%), Gaps = 6/47 (12%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPP------RRSSDLPYLTCP 592
           P     P +P +  P P  PAS PPP P          S  P L  P
Sbjct: 157 PPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPPLKSP 203



 Score = 35.2 bits (81), Expect = 0.16
 Identities = 15/51 (29%), Positives = 17/51 (33%), Gaps = 3/51 (5%)

Query: 558 PEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKS 608
           P  P +  P  PP A   P  P       P    P    +   PP   LKS
Sbjct: 155 PPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPP---LKS 202



 Score = 33.7 bits (77), Expect = 0.44
 Identities = 16/58 (27%), Positives = 24/58 (41%), Gaps = 3/58 (5%)

Query: 557 KPEAPSSCI--PPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPR 612
           +P  P+  +   PPPP A  P  PP  + +     + P P      P ++  KS  P 
Sbjct: 143 QPPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAP-KSSHPP 199


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 38.0 bits (89), Expect = 0.030
 Identities = 32/97 (32%), Positives = 51/97 (52%), Gaps = 10/97 (10%)

Query: 443 KKADEL---EMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQ 499
           KKA EL   E+E + L N++ KK +E  +KTQ +  L E L  IK  L  E     + K 
Sbjct: 192 KKALELLKKELELLKLQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEK----DDKD 247

Query: 500 KVAELEDRISELKHRADFE---QDERKRLENLINSGS 533
           ++ +L++++ ELK   + +   + E  +L  L  S S
Sbjct: 248 ELEKLKEKLEELKLPEEVKKVIEKELTKLSLLEPSSS 284


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 37.4 bits (87), Expect = 0.033
 Identities = 25/104 (24%), Positives = 30/104 (28%), Gaps = 10/104 (9%)

Query: 518 EQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPP 577
           ++++        NS +      A   S S      SSP  P  PS    P  P  S  PP
Sbjct: 155 DEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPP 214

Query: 578 PPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFPNVSVQ 621
            P    SD P             P      S  P P  P     
Sbjct: 215 APSSFQSDTPP----------PSPESPTNPSPPPGPAAPPPPPV 248


>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
          Length = 463

 Score = 37.8 bits (87), Expect = 0.033
 Identities = 28/104 (26%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 526 ENLINSG-SVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLP--------- 575
           E L+  G +V   TK   +S+S       +P + + P +  P P PPA            
Sbjct: 147 EFLVKEGDTVEPGTKVAIISKSEDAASQVTPSQ-KIPETTDPKPSPPAEDKQKPKVESAP 205

Query: 576 ---------PPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRI 610
                     PPPP++S+  P L    P  R R+ P ++L+ R+
Sbjct: 206 VAEKPKAPSSPPPPKQSAKEPQLP---PKERERRVPMTRLRKRV 246


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 37.8 bits (88), Expect = 0.034
 Identities = 28/150 (18%), Positives = 63/150 (42%), Gaps = 6/150 (4%)

Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNR 458
           L +L +    Q E   + +   L+  + ++      +E +   K +  E+E   ++    
Sbjct: 215 LKKLEEIQEEQEEEELEQEIEALEERLAELEEE---KERLEELKARLLEIESLELEALKI 271

Query: 459 LAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKH-RADF 517
             ++ +EL++  +E E+  E L  ++  +E+        +  + ELE+ + +LK      
Sbjct: 272 REEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERL 331

Query: 518 EQDERKR--LENLINSGSVPDDTKAVGLSR 545
           E+ E K   LE+ +   +   +  A  L  
Sbjct: 332 EKLEEKLEKLESELEELAEEKNELAKLLEE 361



 Score = 36.7 bits (85), Expect = 0.073
 Identities = 38/155 (24%), Positives = 68/155 (43%), Gaps = 18/155 (11%)

Query: 394 QHWLLLDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELE---- 449
           Q    L+    +L  + E  E+    LL+I   ++  L  +EEE+   ++  +ELE    
Sbjct: 232 QEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIE 291

Query: 450 --MENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELED- 506
              E       L ++ + L    +E E+L E L  +++RLEK      + + ++ EL + 
Sbjct: 292 RLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEE 351

Query: 507 ----------RISELKHRAD-FEQDERKRLENLIN 530
                     R+ EL+ R +  E++  K LE L  
Sbjct: 352 KNELAKLLEERLKELEERLEELEKELEKALERLKQ 386



 Score = 35.9 bits (83), Expect = 0.15
 Identities = 30/141 (21%), Positives = 62/141 (43%), Gaps = 11/141 (7%)

Query: 394 QHWLLLDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENV 453
               LL+ L + L       E  + +  K+   +       EE      + A  LE    
Sbjct: 309 GLRALLEELEELLEKLKSLEERLEKLEEKLEKLES----ELEELAEEKNELAKLLEERLK 364

Query: 454 DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQ-------KVAELED 506
           +L  RL + E+EL++  +  + L+E++  +K+ L + +A   E ++       ++ ELE 
Sbjct: 365 ELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELER 424

Query: 507 RISELKHRADFEQDERKRLEN 527
            + EL+      +++  +LE+
Sbjct: 425 ELEELEEEIKKLEEQINQLES 445



 Score = 32.8 bits (75), Expect = 1.2
 Identities = 31/111 (27%), Positives = 56/111 (50%), Gaps = 10/111 (9%)

Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
           L+  + +        EEI   +K+  ELE E ++L       ++EL++K ++ E+L E L
Sbjct: 486 LEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEEL 545

Query: 481 SRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
             +K++L+ +     + K+++ +LEDR+ ELK     E  E  RL      
Sbjct: 546 EELKEKLQLQ-----QLKEELRQLEDRLQELK-----ELLEELRLLRTRKE 586



 Score = 32.4 bits (74), Expect = 1.5
 Identities = 24/106 (22%), Positives = 48/106 (45%), Gaps = 5/106 (4%)

Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKE-----QELDQKTQEKEDLDESLSRIK 484
            L  ++EE+   +++  EL+ +  +L  RL++ E      EL +   E E+ +E L    
Sbjct: 580 LLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESEL 639

Query: 485 DRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
           ++L  +       +  + ELE+++ EL+     E    +  E L  
Sbjct: 640 EKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEE 685



 Score = 32.0 bits (73), Expect = 2.1
 Identities = 29/105 (27%), Positives = 51/105 (48%), Gaps = 14/105 (13%)

Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
           L A EEE+   KK  +  E +       L ++ + L+++  E E+  E L  +K RL + 
Sbjct: 205 LEALEEELKELKKLEEIQEEQEE---EELEQEIEALEERLAELEEEKERLEELKARLLEI 261

Query: 491 TAGHLEA----KQKVAELEDRISELKHRADFEQDERKRLENLINS 531
            +  LEA    ++++ ELE  + EL       +++ +RLE L   
Sbjct: 262 ESLELEALKIREEELRELERLLEEL-------EEKIERLEELERE 299



 Score = 31.7 bits (72), Expect = 3.1
 Identities = 29/126 (23%), Positives = 56/126 (44%), Gaps = 15/126 (11%)

Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDE-- 478
           L+   +++  L  + E + A  ++ +EL  +   L  RL K E++L++   E E+L E  
Sbjct: 293 LEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEK 352

Query: 479 ------------SLSRIKDRLEKETAGHLEAKQKVAE-LEDRISELKHRADFEQDERKRL 525
                        L    + LEKE    LE  +++ E +++   EL   +   ++ ++ L
Sbjct: 353 NELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEEL 412

Query: 526 ENLINS 531
           E L   
Sbjct: 413 EELEKE 418



 Score = 31.3 bits (71), Expect = 3.9
 Identities = 29/115 (25%), Positives = 53/115 (46%), Gaps = 16/115 (13%)

Query: 398 LLDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSN 457
            L++L  Q  L     E+     L+   +KV  L A+    +   +  ++LE    +   
Sbjct: 638 ELEKLNLQAEL-----EELLQAALEELEEKVEELEAEIRRELQRIENEEQLE----EKLE 688

Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELK 512
            L + E+EL+Q  +E E+L + L  I+  +E+        + + AELE+   EL+
Sbjct: 689 ELEQLEEELEQLREELEELLKKLGEIEQLIEE-------LESRKAELEELKKELE 736



 Score = 30.1 bits (68), Expect = 8.5
 Identities = 28/165 (16%), Positives = 65/165 (39%), Gaps = 32/165 (19%)

Query: 398 LLDRLAQQLTLQTETGEDYDHIL----------LKINVKKVVHLLAKEEEIVAAKKKADE 447
           L +   +   L  E  ++ +  L          L+   +    +   +EE+       +E
Sbjct: 348 LAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEE 407

Query: 448 LEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK--ETAGHLEA-------- 497
           ++ E  +L   L + E+EL++  +E + L+E +++++ +     E AG  E         
Sbjct: 408 IQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQEL 467

Query: 498 ------------KQKVAELEDRISELKHRADFEQDERKRLENLIN 530
                       + ++ ELE+ +S  K  A+  ++  +  + L  
Sbjct: 468 PEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRE 512


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 37.3 bits (87), Expect = 0.034
 Identities = 18/79 (22%), Positives = 36/79 (45%), Gaps = 14/79 (17%)

Query: 455 LSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHR 514
           LS  ++ K+  LD+   +  +L + LS     LE++            +L+D ++ L+  
Sbjct: 44  LSREISGKDSALDRLNSQIAELADLLS-----LERQGN---------QDLQDSVANLRAS 89

Query: 515 ADFEQDERKRLENLINSGS 533
               + ER RL+ L+   +
Sbjct: 90  LSAAEAERSRLQALLAELA 108


>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729).  This
           family of proteins is found in viruses. Proteins in this
           family are typically between 145 and 1707 amino acids in
           length. The family is found in association with
           pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
           There is a single completely conserved residue L that
           may be functionally important.
          Length = 115

 Score = 35.0 bits (81), Expect = 0.036
 Identities = 10/31 (32%), Positives = 12/31 (38%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRR 582
                 P  P S + PP P    P PPP  +
Sbjct: 77  EPPVPGPAGPPSPLAPPAPARKPPLPPPRPQ 107



 Score = 32.3 bits (74), Expect = 0.35
 Identities = 18/68 (26%), Positives = 22/68 (32%), Gaps = 12/68 (17%)

Query: 535 PDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPA-----SLPPPPPPRRSSDLPYL 589
           PDD   VG  R    + P        P S +   PPP+        P PP    +  P  
Sbjct: 37  PDDPPPVGDPRPPVVDTPP-------PVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSP 89

Query: 590 TCPRPLVR 597
             P    R
Sbjct: 90  LAPPAPAR 97



 Score = 31.2 bits (71), Expect = 0.94
 Identities = 15/36 (41%), Positives = 16/36 (44%), Gaps = 4/36 (11%)

Query: 551 EPSSPRKPEAPSSCIPP----PPPPASLPPPPPPRR 582
            P    +P  P    PP    PP PA  PP PPPR 
Sbjct: 71  APPPDPEPPVPGPAGPPSPLAPPAPARKPPLPPPRP 106



 Score = 30.0 bits (68), Expect = 2.4
 Identities = 15/33 (45%), Positives = 15/33 (45%), Gaps = 1/33 (3%)

Query: 551 EPSSPRKPEAPSSCIPPPPP-PASLPPPPPPRR 582
           EP  P     PS   PP P     LPPP P RR
Sbjct: 77  EPPVPGPAGPPSPLAPPAPARKPPLPPPRPQRR 109



 Score = 29.6 bits (67), Expect = 2.7
 Identities = 16/52 (30%), Positives = 17/52 (32%), Gaps = 10/52 (19%)

Query: 562 SSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRP 613
           SSC  PP       PPP             PRP V    PP S +    P  
Sbjct: 26  SSCFSPPESAHPDDPPPVG----------DPRPPVVDTPPPVSAVWVLPPPS 67



 Score = 29.3 bits (66), Expect = 4.1
 Identities = 13/38 (34%), Positives = 13/38 (34%), Gaps = 6/38 (15%)

Query: 543 LSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
            S SG     S       P S  P  PPP   P PP  
Sbjct: 20  WSTSGFSSCFS------PPESAHPDDPPPVGDPRPPVV 51


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 37.8 bits (88), Expect = 0.036
 Identities = 19/99 (19%), Positives = 44/99 (44%), Gaps = 11/99 (11%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
             E+ I   ++  + LE EN +L       ++EL++  +E E L+  L R +  +  +  
Sbjct: 419 VYEKRIKKLEETVERLEEENSEL-------KRELEELKREIEKLESELERFRREVRDK-- 469

Query: 493 GHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
              + + ++   + RI  L+   + ++   + LE  +  
Sbjct: 470 -VRKDR-EIRARDRRIERLEKELEEKKKRVEELERKLAE 506



 Score = 36.6 bits (85), Expect = 0.083
 Identities = 26/78 (33%), Positives = 45/78 (57%), Gaps = 4/78 (5%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           EEE    K++ +EL+ E   L + L +  +E+  K ++  ++     RI +RLEKE    
Sbjct: 435 EEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRI-ERLEKELE-- 491

Query: 495 LEAKQKVAELEDRISELK 512
            E K++V ELE +++EL+
Sbjct: 492 -EKKKRVEELERKLAELR 508



 Score = 30.4 bits (69), Expect = 6.3
 Identities = 17/130 (13%), Positives = 46/130 (35%), Gaps = 14/130 (10%)

Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNR 458
           ++R   +L +  +       ++    + + +  + +EE     +   +E   E      R
Sbjct: 364 VERKLPELGIWKDVERIKALVIRGYPLAEALSKVKEEERPREKEGTEEEERREITVYEKR 423

Query: 459 LAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFE 518
           + K E+ +++  +E  +L   L  +K  +EK              LE  +   +     +
Sbjct: 424 IKKLEETVERLEEENSELKRELEELKREIEK--------------LESELERFRREVRDK 469

Query: 519 QDERKRLENL 528
             + + +   
Sbjct: 470 VRKDREIRAR 479


>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
           terminal.  This domain family is found in eukaryotes,
           and is typically between 177 and 207 amino acids in
           length. The family is found in association with
           pfam00778, pfam02377, pfam00610, pfam00595. The segment
           polarity gene dishevelled (dsh) is required for pattern
           formation of the embryonic segments. It is involved in
           the determination of body organisation through the
           Wingless pathway (analogous to the Wnt-1 pathway).
          Length = 202

 Score = 36.5 bits (84), Expect = 0.038
 Identities = 28/91 (30%), Positives = 32/91 (35%), Gaps = 22/91 (24%)

Query: 508 ISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSS-----PRKPEAPS 562
             E    AD  +  RK       SGS  + T   G  RSGGR  PS      P +    S
Sbjct: 83  DGERSRAADGREGGRKSG----GSGSESEHTSRSGSRRSGGRRAPSERSGPPPSEGSVRS 138

Query: 563 SC-------------IPPPPPPASLPPPPPP 580
           S              +PPP  P  L   PPP
Sbjct: 139 SLSHPSSHSSYGAPGVPPPYNPPMLMMKPPP 169


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 36.4 bits (85), Expect = 0.038
 Identities = 32/107 (29%), Positives = 47/107 (43%), Gaps = 14/107 (13%)

Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVD-------LSNRLAKKEQELDQKTQEKEDLDE 478
           KK   L AKEE      K   E E E  +          RL +KE+ LD+K +  E  +E
Sbjct: 51  KKEALLEAKEE----IHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEE 106

Query: 479 SLSRIKDRLEKETAGHLEAKQKVAE--LEDRISELKHRADFEQDERK 523
           SL   +  L       LE K++  E  +E++  EL+  +    +E K
Sbjct: 107 SLEEKEKELAARQQ-QLEEKEEELEELIEEQQQELERISGLTAEEAK 152


>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
          Length = 209

 Score = 36.3 bits (84), Expect = 0.043
 Identities = 15/77 (19%), Positives = 28/77 (36%), Gaps = 5/77 (6%)

Query: 542 GLSRSGGREEPSSPRKPEAPSSCIPP-----PPPPASLPPPPPPRRSSDLPYLTCPRPLV 596
            L ++    EP++  KP   ++ + P      PP   +P P P   ++       P+   
Sbjct: 95  DLQKATSMSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAAAAPKAPA 154

Query: 597 RCRQPPRSKLKSRIPRP 613
           + R           P+P
Sbjct: 155 KPRAKSPRPAAKAAPKP 171


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 37.4 bits (86), Expect = 0.053
 Identities = 13/51 (25%), Positives = 15/51 (29%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
              P       +  PPP  P S P  PP   S   P  +       C  P 
Sbjct: 453 AQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPG 503



 Score = 35.4 bits (81), Expect = 0.20
 Identities = 41/168 (24%), Positives = 62/168 (36%), Gaps = 19/168 (11%)

Query: 460 AKKEQELDQKTQEKEDLDESLSRIKD--RLEKETAGHLEAKQKVAELEDRISELKHRADF 517
           A   +E ++ T +K    E LSR       E E  G  E+    +  E+  S+ K   D 
Sbjct: 88  ASDTEEPERVTAKKSKTQE-LSRPNSPSEGEGEGEGEGESSDSRSVNEEGSSDPK---DI 143

Query: 518 EQDERKRLENLINSGSVPDDTKAVGLSRSGGREE-------PSSPRKPEAPSSCIPPPPP 570
           +QD R       +S S+P        S S  +++       PS    P A  +   PPP 
Sbjct: 144 DQDNRS------SSPSIPSPQDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPT 197

Query: 571 PASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFPNV 618
           P++   PP     +  P     +P         S    R+P P+ P  
Sbjct: 198 PSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQ 245



 Score = 33.9 bits (77), Expect = 0.59
 Identities = 20/67 (29%), Positives = 24/67 (35%), Gaps = 12/67 (17%)

Query: 533 SVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASL------------PPPPPP 580
           S+P  T A     S G + P S        +   PP PP  +             PPPPP
Sbjct: 470 SLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAEEPESPPPPP 529

Query: 581 RRSSDLP 587
           R  S  P
Sbjct: 530 RSPSPEP 536


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 34.9 bits (81), Expect = 0.062
 Identities = 20/101 (19%), Positives = 44/101 (43%), Gaps = 11/101 (10%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKE----DLDESLSRIKDRLEKE 490
           +  + A +++ +  E         L KKE+EL +   + +    + +    R + + E+E
Sbjct: 13  QLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEE 72

Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
                E ++++ EL+  + ELK        E ++LE  +  
Sbjct: 73  KKLRKEKEEEIKELKAELEELKA-------EIEKLEEKLEE 106


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 36.9 bits (86), Expect = 0.065
 Identities = 26/80 (32%), Positives = 40/80 (50%), Gaps = 14/80 (17%)

Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELED 506
           ELE +  +L N+L +KE EL Q              +  +LE E A   + ++K+ ELE 
Sbjct: 1   ELERQKRELENQLYRKESELSQ--------------LSSKLEDEQALVAQLQKKIKELEA 46

Query: 507 RISELKHRADFEQDERKRLE 526
           RI EL+   + E+  R + E
Sbjct: 47  RIRELEEELEAERAARAKAE 66



 Score = 32.7 bits (75), Expect = 1.3
 Identities = 21/91 (23%), Positives = 46/91 (50%), Gaps = 3/91 (3%)

Query: 437 EIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLE 496
           ++ + K +   L+ EN DL+ +L + E ++   ++ K  L+  L   K  LE+E+     
Sbjct: 205 DLTSQKSR---LQSENSDLTRQLEEAEAQVSNLSKLKSQLESQLEEAKRSLEEESRERAN 261

Query: 497 AKQKVAELEDRISELKHRADFEQDERKRLEN 527
            + ++ +LE  +  L+ + + E + +  LE 
Sbjct: 262 LQAQLRQLEHDLDSLREQLEEESEAKAELER 292



 Score = 31.6 bits (72), Expect = 3.0
 Identities = 38/137 (27%), Positives = 61/137 (44%), Gaps = 22/137 (16%)

Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNR 458
           LD L +QL  ++E   + +  L K N + +    +K E   A +  A+ELE    +L  +
Sbjct: 273 LDSLREQLEEESEAKAELERQLSKANAE-IQQWRSKFESEGALR--AEELE----ELKKK 325

Query: 459 LAKKEQELDQKTQEKEDLDESLSRIKDRLEKE-------------TAGHLEAKQKVAELE 505
           L +K  EL++  +      +SL + K RL+ E              A  LE KQK    +
Sbjct: 326 LNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAAASELEKKQK--NFD 383

Query: 506 DRISELKHRADFEQDER 522
             ++E K + D  Q E 
Sbjct: 384 KILAEWKRKVDELQAEL 400



 Score = 30.0 bits (68), Expect = 9.5
 Identities = 26/94 (27%), Positives = 46/94 (48%), Gaps = 5/94 (5%)

Query: 436 EEIVAAKKKADELEMENVDLSNRLA---KKEQELDQKTQE-KEDLDESLSRIKDRLEK-E 490
           +E V   K A+E   +    + RLA   ++EQE  Q  +  ++ L+  +  ++ RL++ E
Sbjct: 675 DEAVNELKAAEERAKKAQADAARLAEELRQEQEHSQHLERLRKQLESQVKELQVRLDEAE 734

Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKR 524
            A     K+ + +LE R+ EL+   D EQ     
Sbjct: 735 AAALKGGKKMIQKLEARVRELEAELDGEQRRHAE 768


>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil.  Members of
           this family are predominantly found in the
           actin-bundling protein Cortexillin I from Dictyostelium
           discoideum. They adopt a structure consisting of an
           18-heptad-repeat alpha-helical coiled-coil, and are a
           prerequisite for the assembly of Cortexillin I.
          Length = 107

 Score = 34.2 bits (78), Expect = 0.081
 Identities = 23/85 (27%), Positives = 44/85 (51%)

Query: 442 KKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKV 501
           K++   LE    +L+N+LA  E  L+ +   +E+L +   +++  L    A   E +Q++
Sbjct: 1   KEEKARLEASKNELANKLAGLENSLESEKASQEELAKQKDQLRSLLASLEAQKAEREQRL 60

Query: 502 AELEDRISELKHRADFEQDERKRLE 526
            EL+ +I EL    + E+  +  LE
Sbjct: 61  RELQAKIDELLKNLELEKLAKMELE 85


>gnl|CDD|165528 PHA03270, PHA03270, envelope glycoprotein C; Provisional.
          Length = 466

 Score = 36.5 bits (84), Expect = 0.081
 Identities = 24/76 (31%), Positives = 31/76 (40%), Gaps = 2/76 (2%)

Query: 538 TKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVR 597
           + A     SG       PR    P+     P  PAS P   PP+     P+  C RP V 
Sbjct: 31  SNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGPASPPKSGPPKSPPA-PF-RCKRPDVL 88

Query: 598 CRQPPRSKLKSRIPRP 613
            R   R +++ R PRP
Sbjct: 89  ARYGSRVQIRCRFPRP 104



 Score = 33.4 bits (76), Expect = 0.72
 Identities = 26/108 (24%), Positives = 40/108 (37%), Gaps = 8/108 (7%)

Query: 542 GLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQP 601
            +S +       SP   E P +   P     +   P  P +S   P  + P P  RC++P
Sbjct: 29  AVSNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGPASPPKSG--PPKSPPAP-FRCKRP 85

Query: 602 -PRSKLKSRIP-RPYFPNVSVQVNGTIW---TELDESKLYASMELEVV 644
              ++  SR+  R  FP  + +    IW       E    A +E   V
Sbjct: 86  DVLARYGSRVQIRCRFPRPTAEFRLQIWRVAAAPSEEIRPAPIEPGAV 133


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 36.7 bits (84), Expect = 0.087
 Identities = 33/94 (35%), Positives = 49/94 (52%), Gaps = 10/94 (10%)

Query: 426  KKVVHLLAKEEEIVAAKKKADEL---EMENVDLSNRLAKKEQELDQKTQEKEDLDESLSR 482
            KKV  L  KE E    KKKA+EL   E EN   +   AKK +E  +K +E +  +E   +
Sbjct: 1633 KKVEQLKKKEAE---EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689

Query: 483  IKDRLEKETAGHLEAKQKVAELEDRISELKHRAD 516
              + L+KE     E  +K  EL+ + +E K +A+
Sbjct: 1690 AAEALKKEA----EEAKKAEELKKKEAEEKKKAE 1719



 Score = 33.6 bits (76), Expect = 0.78
 Identities = 25/93 (26%), Positives = 36/93 (38%), Gaps = 13/93 (13%)

Query: 425  VKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK 484
             KK      K EE     KKADE + +    +    KK     +K +E +   E      
Sbjct: 1301 KKKADEAKKKAEE----AKKADEAKKK----AEEAKKKADAAKKKAEEAKKAAE----AA 1348

Query: 485  DRLEKETAGHLEAKQKVAEL-EDRISELKHRAD 516
                +  A   EA ++ AE  E +  E K +AD
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381



 Score = 33.2 bits (75), Expect = 1.0
 Identities = 31/103 (30%), Positives = 44/103 (42%), Gaps = 10/103 (9%)

Query: 426  KKVVHLLAKEEEIVAAKKKADELE--MENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
            KK      K EE   AKKKAD  +   E    +   AK E E      E  +     +  
Sbjct: 1315 KKADEAKKKAEE---AKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK 1371

Query: 484  KDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
            K    K+ A   +A +K AE + +  E K +A  E+D++K  E
Sbjct: 1372 KKEEAKKKA---DAAKKKAEEKKKADEAKKKA--EEDKKKADE 1409



 Score = 32.4 bits (73), Expect = 1.9
 Identities = 32/112 (28%), Positives = 50/112 (44%), Gaps = 10/112 (8%)

Query: 425  VKKVVHLLAKEEEIVAA---KKKADEL-EMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
             KK      K EE   A   KKKA+E  + E        AKK  E  +K +E +  DE+ 
Sbjct: 1430 KKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAK 1489

Query: 481  SRIKDRLEK--ETAGHLEAKQKVAELEDRISELKHRAD--FEQDERKRLENL 528
             + ++  +K  E     EAK+K  E   + +E   +AD   + +E K+ +  
Sbjct: 1490 KKAEEAKKKADEAKKAAEAKKKADEA--KKAEEAKKADEAKKAEEAKKADEA 1539



 Score = 32.0 bits (72), Expect = 2.3
 Identities = 27/98 (27%), Positives = 42/98 (42%), Gaps = 6/98 (6%)

Query: 433  AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
            A E E    K +A E + E        AKK+ E  +K  E +   E   +  D L+K  A
Sbjct: 1356 ADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAA 1415

Query: 493  GHLEAKQ--KVAELEDRISELKHRADFEQDERKRLENL 528
               +A +  K AE + +  E K +A    +E K+ +  
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKA----EEAKKADEA 1449



 Score = 31.6 bits (71), Expect = 2.8
 Identities = 27/92 (29%), Positives = 41/92 (44%), Gaps = 8/92 (8%)

Query: 425  VKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK 484
             KK      K EE    KKKADEL+           KK  E  +K +EK+  DE+  + +
Sbjct: 1390 KKKADEAKKKAEE---DKKKADELKKAAAA-----KKKADEAKKKAEEKKKADEAKKKAE 1441

Query: 485  DRLEKETAGHLEAKQKVAELEDRISELKHRAD 516
            +  + + A     + K AE   + +E   +AD
Sbjct: 1442 EAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD 1473



 Score = 30.9 bits (69), Expect = 5.4
 Identities = 27/103 (26%), Positives = 44/103 (42%), Gaps = 13/103 (12%)

Query: 424  NVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
            N  K      K EE    KKKA+E +    D      K  + L ++ +E +  +E    +
Sbjct: 1659 NKIKAAEEAKKAEE---DKKKAEEAKKAEEDEK----KAAEALKKEAEEAKKAEE----L 1707

Query: 484  KDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
            K +  +E     E K+   E + +  E K  A  E+D++K  E
Sbjct: 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEA--EEDKKKAEE 1748



 Score = 30.1 bits (67), Expect = 9.3
 Identities = 31/120 (25%), Positives = 52/120 (43%), Gaps = 11/120 (9%)

Query: 426  KKVVHLLAKEEEIVAAKKKADEL-EMENVDLSNRLAKKEQ----ELDQKTQEKEDLDESL 480
            KK      K  E   AKKKADE  + E    ++   K E+    +  +K +EK+  DE L
Sbjct: 1496 KKKADEAKKAAE---AKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADE-L 1551

Query: 481  SRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKA 540
             + ++  + E     E  +K    ED+   L+   + ++ E  R+E ++         KA
Sbjct: 1552 KKAEELKKAEEKKKAEEAKK--AEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA 1609


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 36.3 bits (85), Expect = 0.087
 Identities = 28/118 (23%), Positives = 49/118 (41%), Gaps = 26/118 (22%)

Query: 436 EEIVAAKKK---ADELEMENV-----DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRL 487
           E I+   KK    D+ ++  +     +L   L +K +E +   +E E L E L   K++L
Sbjct: 501 ENIIEEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKL 560

Query: 488 EKETAGHL-EAKQKV--------AELEDRISELKHRADFE---------QDERKRLEN 527
           ++E    L EA+++          E ++ I EL+                + RKRL  
Sbjct: 561 QEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNK 618


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 36.5 bits (84), Expect = 0.088
 Identities = 35/183 (19%), Positives = 56/183 (30%), Gaps = 22/183 (12%)

Query: 476 LDESLSRIKDRLEKETAGHLEAK--QKVAELEDRISELKHRADFEQDERKRLENLINSGS 533
           L E    +K+R ++   G   A   Q    L     ELK +  FE   R  L  L     
Sbjct: 316 LIERPESVKERYQQSAGGLTPAAVMQMTDFLMKTQGELKFQ--FEYQFRFELALL----- 368

Query: 534 VPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPR 593
                + + L R+ G   PS    P+       P  P     P P    +         R
Sbjct: 369 -----RLIELVRNDGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGA------R 417

Query: 594 PLVRCRQPPRSKLKSRIPRPYFPNVSVQVNGTIWTELDESKLYASMELEVVDKLFCAYQK 653
           P      P  +    + P P   +  +  +       + +     ++L      F  + +
Sbjct: 418 PAELP-SPASAPTPEQQP-PVARSAPLPPSPQASAPRNVASGKPGVDLGSWQGKFMNFTR 475

Query: 654 NGV 656
           NG 
Sbjct: 476 NGS 478


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 36.5 bits (85), Expect = 0.10
 Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 10/82 (12%)

Query: 451  ENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAK-------QKVAE 503
            +N DL+ +L    Q L+Q  QE+    E L + + +L +        K       Q + E
Sbjct: 982  KNSDLNEKL---RQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQE 1038

Query: 504  LEDRISELKHRADFEQDERKRL 525
            L+  + +L   AD   +ER R 
Sbjct: 1039 LKQELQDLGVPADSGAEERARA 1060


>gnl|CDD|148844 pfam07469, DUF1518, Domain of unknown function (DUF1518).  This
           domain, which is usually found tandemly repeated, is
           found various receptor co-activating proteins.
          Length = 56

 Score = 32.1 bits (73), Expect = 0.11
 Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 4/50 (8%)

Query: 553 SSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
            +PR P+AP    P PP       P P   S   P      P++  R  P
Sbjct: 7   GNPRMPQAPPQQFPYPPNYGMGQQPDPAFTSPFSP----QSPMMSPRMGP 52


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 35.7 bits (82), Expect = 0.12
 Identities = 27/122 (22%), Positives = 44/122 (36%), Gaps = 9/122 (7%)

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSS 554
           L  KQ++AE +  +S+L+   +  Q          +S  V D TK         + E + 
Sbjct: 11  LRDKQEIAETQKELSKLQLSHEEAQSSEA------HSFHV-DSTKQPPAPEQVAKHELAD 63

Query: 555 PRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPY 614
              P    +   PP P    P P   ++S   P    P  L   +     +  +    PY
Sbjct: 64  --APLQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPY 121

Query: 615 FP 616
           +P
Sbjct: 122 YP 123



 Score = 34.1 bits (78), Expect = 0.39
 Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 3/68 (4%)

Query: 552 PSSPRKPEAPSSCIPPPPP---PASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKS 608
           P+  ++P  P    P PPP   P +  P PPP+      Y + P+     + PP     +
Sbjct: 114 PTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSA 173

Query: 609 RIPRPYFP 616
                 +P
Sbjct: 174 PQVSGLYP 181


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 36.3 bits (83), Expect = 0.12
 Identities = 38/125 (30%), Positives = 66/125 (52%), Gaps = 12/125 (9%)

Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLD-ESLSRIK 484
           K+  H  A E +I+  KK+ + LE E  +   R  ++ + ++++  E+E L+ E L R  
Sbjct: 442 KENAHRKALEMKILE-KKRIERLEREERERLER--ERMERIERERLERERLERERLER-- 496

Query: 485 DRLEKETAGHLEAKQKVAELE-DRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGL 543
           DRLE++    LE +++V  LE DR+ + +  + F     K +EN +++G  P D   VG 
Sbjct: 497 DRLERDRLDRLE-RERVDRLERDRLEKARRNSYF----LKGMENGLSAGGGPGDGPGVGA 551

Query: 544 SRSGG 548
               G
Sbjct: 552 GVGAG 556


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 35.8 bits (83), Expect = 0.13
 Identities = 31/138 (22%), Positives = 66/138 (47%), Gaps = 17/138 (12%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDE-------SLSRIKD 485
           A++E I   K+  +EL  +  +L  + ++++  L ++  +++ L++       +L+ ++ 
Sbjct: 167 ARQETIAELKQTREELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLTGLES 226

Query: 486 RLEKETAGHLEAKQKVAELEDRIS----ELKHRADFEQDE----RKRLENLINSGSV--P 535
            L+K+     E +   + L D I+    E K RA+ E  E    R + +     GS   P
Sbjct: 227 SLQKDQQQLSELRANESRLRDSIARAEREAKARAEREAREAARVRDKQKQAKRKGSTYKP 286

Query: 536 DDTKAVGLSRSGGREEPS 553
            +++   +SR+GG   P 
Sbjct: 287 TESERSLMSRTGGLGRPR 304


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 35.8 bits (83), Expect = 0.14
 Identities = 32/137 (23%), Positives = 60/137 (43%), Gaps = 28/137 (20%)

Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSN- 457
           LD+L+++L       E+   +   +  K+   L   EEE++A  +  +    + + L   
Sbjct: 172 LDQLSKKLAELKAEEEE--ELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFE 229

Query: 458 ----RLAKK-EQELDQKTQ-EKEDLDESLSR----------------IKDRLEKETAGHL 495
                L KK E++L Q+ + + E  ++ L                  IK+++E+E  G L
Sbjct: 230 REKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEERNGRL 289

Query: 496 EAKQKVAELEDRISELK 512
               K+AEL  R+  L+
Sbjct: 290 A---KLAELNSRLKGLE 303


>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
          Length = 318

 Score = 35.3 bits (81), Expect = 0.14
 Identities = 16/48 (33%), Positives = 20/48 (41%), Gaps = 3/48 (6%)

Query: 556 RKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPR 603
           ++  + S   P PP   S P P PPR     P  T P P     +PP 
Sbjct: 2   KRARSRS---PSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPP 46



 Score = 29.9 bits (67), Expect = 8.3
 Identities = 18/52 (34%), Positives = 20/52 (38%), Gaps = 13/52 (25%)

Query: 544 SRSGGREEPSSPRKPEAPSS-------CIPPPPPPASLPP------PPPPRR 582
           SRS       SP +P  P S         P PP P + PP      P  PRR
Sbjct: 6   SRSPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRR 57


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 35.3 bits (81), Expect = 0.16
 Identities = 18/72 (25%), Positives = 37/72 (51%)

Query: 442 KKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKV 501
           K+K +EL+ E  +L   L + E E ++  +  + L+   SR+++ L+K      + K++ 
Sbjct: 141 KEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRW 200

Query: 502 AELEDRISELKH 513
            ELE  +   + 
Sbjct: 201 DELEPGVELPEE 212



 Score = 32.2 bits (73), Expect = 1.2
 Identities = 30/144 (20%), Positives = 62/144 (43%), Gaps = 17/144 (11%)

Query: 413 GEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQE 472
           G D +HI +++  K++      +  +V A  K  +   E +DL     + +++L++  +E
Sbjct: 97  GHDVEHIDVELVRKEL------KNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKE 150

Query: 473 KEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDE----RKRLENL 528
           KE+L + L  ++   E       E ++++  LE   S L+        E    +KR + L
Sbjct: 151 KEELLKELEELEAEYE-------EVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDEL 203

Query: 529 INSGSVPDDTKAVGLSRSGGREEP 552
                +P++     L +      P
Sbjct: 204 EPGVELPEEELISDLVKETLNLAP 227


>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
          Length = 280

 Score = 35.2 bits (80), Expect = 0.17
 Identities = 15/47 (31%), Positives = 20/47 (42%), Gaps = 2/47 (4%)

Query: 556 RKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
           ++P   S   P P   A  P  P    ++  P +TCP P   C  PP
Sbjct: 74  QRPSGQSPLAPSPACAAPAPACPACAPAAPAPAVTCPAPAPAC--PP 118



 Score = 32.1 bits (72), Expect = 1.4
 Identities = 21/70 (30%), Positives = 24/70 (34%), Gaps = 2/70 (2%)

Query: 554 SPRKPEAPSSCIPPPPP--PASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIP 611
           S + P APS     P P  PA  P  P P  +   P   CP        PP        P
Sbjct: 77  SGQSPLAPSPACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARP 136

Query: 612 RPYFPNVSVQ 621
            P  P  + Q
Sbjct: 137 APACPPSTRQ 146



 Score = 30.6 bits (68), Expect = 4.0
 Identities = 20/72 (27%), Positives = 29/72 (40%), Gaps = 7/72 (9%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPP----PPPPRRSSDLPYLTCPRPLVRCRQPPRSKLK 607
           P+    P    +C PP   PA   P    PP  R+    P L  P+P    +      L 
Sbjct: 112 PAPACPPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTPKPAPAAKP---IFLH 168

Query: 608 SRIPRPYFPNVS 619
           +++P P +P  S
Sbjct: 169 NQLPPPDYPAAS 180



 Score = 30.6 bits (68), Expect = 4.2
 Identities = 21/77 (27%), Positives = 27/77 (35%), Gaps = 17/77 (22%)

Query: 552 PSSPRKPEAPSSCIPPPP-------------PPASLP--PPPPPRRSSDLPYLTCPRPLV 596
           P+      A  +C P  P             PPA+ P  PPP    +   P   CP    
Sbjct: 86  PACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPACPPSTR 145

Query: 597 RCRQPPRSKLKSRIPRP 613
           +C  PP   L +  P P
Sbjct: 146 QC--PPAPPLPTPKPAP 160


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 33.6 bits (77), Expect = 0.17
 Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 2/54 (3%)

Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
           EP +P   ++P+  +P PPP ++   PP P  +   P    P       QPPR+
Sbjct: 89  EPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPS--DPPQPPRA 140



 Score = 33.3 bits (76), Expect = 0.29
 Identities = 22/74 (29%), Positives = 28/74 (37%), Gaps = 4/74 (5%)

Query: 512 KHRADFEQDERKRLENLI---NSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPP 568
           +H    EQ+ RK L  L        V   T              S+P  P AP++ +P P
Sbjct: 66  EHILSMEQEARKILAPLTPPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAP 125

Query: 569 -PPPASLPPPPPPR 581
              P    PP PPR
Sbjct: 126 AAAPPPSDPPQPPR 139


>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
           Provisional.
          Length = 1109

 Score = 35.8 bits (83), Expect = 0.17
 Identities = 25/79 (31%), Positives = 41/79 (51%), Gaps = 8/79 (10%)

Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNR--LAKKEQELDQKTQEKEDLDESLSRIKDRL- 487
           L A + E  A K   DELE+  +  +NR  LA+   EL +K    + LD  L  ++++L 
Sbjct: 175 LTALQAESAALKALVDELELAQLSANNRQELARLRSELAKK--RSQQLDAYLQALRNQLN 232

Query: 488 ---EKETAGHLEAKQKVAE 503
              ++E    LE+ + +AE
Sbjct: 233 SQRQREAERALESTELLAE 251


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 35.4 bits (81), Expect = 0.17
 Identities = 15/49 (30%), Positives = 20/49 (40%), Gaps = 5/49 (10%)

Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPP--PPPPRRSSDLPYLTCPRPLVR 597
           +P++     AP    P P P  +  P    P       PY+T   PLVR
Sbjct: 228 DPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGDSGPYVT---PLVR 273


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 35.3 bits (81), Expect = 0.18
 Identities = 26/105 (24%), Positives = 48/105 (45%), Gaps = 14/105 (13%)

Query: 421 LKINVKKVVH-LLAKEEEIVAAKKKADELEM--ENVDLSNRLAKKEQELDQKTQEKE--- 474
           L++ +KK+ H L  KEEE    +K+A EL    E+   +  L    Q +  K    E   
Sbjct: 538 LEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLMSALQAMQDKNLMLENSL 597

Query: 475 --------DLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISEL 511
                   DL  +L  +K +LE       + + ++ +L+ +I+++
Sbjct: 598 SAETRLKLDLFSALGDVKRQLEIAHGAIYKREAEIVDLKSKIADI 642



 Score = 31.0 bits (70), Expect = 4.4
 Identities = 22/92 (23%), Positives = 42/92 (45%), Gaps = 11/92 (11%)

Query: 437 EIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLE 496
           E+  A++   EL  +   L++     + +L Q  +E + L   L+ +    +K+      
Sbjct: 421 ELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQKD------ 474

Query: 497 AKQKVAELEDRI-SELKHRADFE---QDERKR 524
            KQ +  +E R+ SE   R + E    +E+KR
Sbjct: 475 -KQSMQSMEKRLKSEADSRVNAEKQLAEEKKR 505


>gnl|CDD|223428 COG0351, ThiD, Hydroxymethylpyrimidine/phosphomethylpyrimidine
           kinase [Coenzyme metabolism].
          Length = 263

 Score = 34.9 bits (81), Expect = 0.19
 Identities = 16/59 (27%), Positives = 25/59 (42%), Gaps = 11/59 (18%)

Query: 703 SKAILSMDSADQLPLDMLEQLLKF----TPNTEEAALL----EEHSTETLARADRFLYE 753
           S   L  + A +    + E+LL      TPN  EA  L    +  + E +  A + L+E
Sbjct: 112 SGDPLLDEEAVEA---LREELLPLATVVTPNLPEAEALSGLPKIKTEEDMKEAAKLLHE 167


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 34.7 bits (80), Expect = 0.23
 Identities = 16/83 (19%), Positives = 36/83 (43%), Gaps = 8/83 (9%)

Query: 454 DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHL-----EAKQKVAELEDRI 508
              N L +   +L +   E E+L     ++++ LE  T           K +VA  ++R+
Sbjct: 17  CARNSLYELRLDLARLLLENEELK---QKVEEALEGATNEDGKLAADLLKLEVARKKERL 73

Query: 509 SELKHRADFEQDERKRLENLINS 531
           ++++ R    ++E ++    I  
Sbjct: 74  NQIRARISQLKEEIEQKRERIEE 96



 Score = 29.3 bits (66), Expect = 10.0
 Identities = 11/69 (15%), Positives = 34/69 (49%), Gaps = 1/69 (1%)

Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQK-VAELE 505
             +     +  R+++ ++E++QK +  E+L  +L++ +  L   +    + +   + +L+
Sbjct: 68  RKKERLNQIRARISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQ 127

Query: 506 DRISELKHR 514
           D I   + +
Sbjct: 128 DEIKRTRSK 136


>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like.  This family is a set
           of eukaryotic tropomyosins. Within the yeast Tmp1 and
           Tmp2, biochemical and sequence analyses indicate that
           Tpm2p spans four actin monomers along a filament,
           whereas Tpmlp spans five. Despite its shorter length,
           Tpm2p can compete with Tpm1p for binding to F-actin.
           Over-expression of Tpm2p in vivo alters the axial
           budding of haploids to a bipolar pattern, and this can
           be partially suppressed by co-over-expression of Tpm1p.
           This suggests distinct functions for the two
           tropomyosins, and indicates that the ratio between them
           is important for correct morphogenesis. The family also
           contains higher eukaryote Tmp3 members.
          Length = 143

 Score = 33.3 bits (77), Expect = 0.25
 Identities = 19/73 (26%), Positives = 42/73 (57%), Gaps = 7/73 (9%)

Query: 440 AAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQ 499
            A+++A+ELE +  +L     +KEQE+    ++ + L+E + +++++L+       EAK+
Sbjct: 11  NAQERAEELEEKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQLK-------EAKE 63

Query: 500 KVAELEDRISELK 512
           K+ E E   +  +
Sbjct: 64  KLEESEKLATNAE 76



 Score = 31.0 bits (71), Expect = 1.4
 Identities = 30/102 (29%), Positives = 42/102 (41%), Gaps = 19/102 (18%)

Query: 431 LLAKEEEIVAAKKKADELEME----NVDLSNRLAKKEQELDQKTQEK----------EDL 476
            L KE+EI + +KK  +LE E       L     K E+     T  +          E+L
Sbjct: 30  NLEKEQEITSLQKKNQQLEEEVEKLEEQLKEAKEKLEESEKLATNAEALTRRIQLLEEEL 89

Query: 477 DESLSRIKDRLEK-----ETAGHLEAKQKVAELEDRISELKH 513
           +ES  R+K+  EK     + A   E K K  E E    E K+
Sbjct: 90  EESEKRLKETTEKLREADKKAEESERKVKALENERDEWEEKY 131


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 33.3 bits (77), Expect = 0.25
 Identities = 23/110 (20%), Positives = 47/110 (42%), Gaps = 19/110 (17%)

Query: 423 INVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSR 482
           ++V+K++      +E  A K    +LE E       L K E+EL +  ++       L +
Sbjct: 4   VDVQKIL------QESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLKEK-------LQK 50

Query: 483 IKDRLEKETAGHLEAKQKVAELEDRISELKH-RADFEQDERKRLENLINS 531
               L +        ++K  EL+ ++ E +  +   +QD +KR +  +  
Sbjct: 51  DAATLSEA-----AREKKEKELQKKVQEFQRKQQKLQQDLQKRQQEELQK 95


>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
           carrier protein subunit; Validated.
          Length = 153

 Score = 33.3 bits (76), Expect = 0.26
 Identities = 15/58 (25%), Positives = 26/58 (44%), Gaps = 2/58 (3%)

Query: 537 DTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRP 594
           + K +G+  S  +E+  +P    AP+    P P   + P P P     ++  +T P P
Sbjct: 37  EAKGLGIDLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASAGENV--VTAPMP 92


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 34.8 bits (80), Expect = 0.28
 Identities = 20/92 (21%), Positives = 30/92 (32%), Gaps = 14/92 (15%)

Query: 525 LENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPP--------------PP 570
           +E L+     P   K    + S  R  P+   +P+A ++   PP              P 
Sbjct: 356 IEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR 415

Query: 571 PASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
           P + P P  P  +  L     P        PP
Sbjct: 416 PVAPPVPHTPESAPKLTRAAIPVDEKPKYTPP 447


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 34.7 bits (79), Expect = 0.33
 Identities = 17/76 (22%), Positives = 26/76 (34%), Gaps = 2/76 (2%)

Query: 545 RSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPR-RSSDLP-YLTCPRPLVRCRQPP 602
           +    ++P  P+ PE P     P        P  P      D+P     P      ++PP
Sbjct: 578 KPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637

Query: 603 RSKLKSRIPRPYFPNV 618
             +  S   RP  P +
Sbjct: 638 PPQRPSSPERPEGPKI 653



 Score = 32.4 bits (73), Expect = 1.6
 Identities = 20/69 (28%), Positives = 25/69 (36%), Gaps = 2/69 (2%)

Query: 550 EEPSSPRKPEAPSSCIPPPPP--PASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLK 607
           EEP  P++P +      P  P  P  L  P  P+R         P P  R   P R +  
Sbjct: 592 EEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGP 651

Query: 608 SRIPRPYFP 616
             I  P  P
Sbjct: 652 KIIKSPKPP 660



 Score = 32.4 bits (73), Expect = 1.8
 Identities = 13/30 (43%), Positives = 16/30 (53%)

Query: 550 EEPSSPRKPEAPSSCIPPPPPPASLPPPPP 579
           + PSSP +PE P     P PP +  PP  P
Sbjct: 640 QRPSSPERPEGPKIIKSPKPPKSPKPPFDP 669



 Score = 30.4 bits (68), Expect = 7.1
 Identities = 59/261 (22%), Positives = 90/261 (34%), Gaps = 35/261 (13%)

Query: 395 HWLLLDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVD 454
           H  + DR+     +     ++Y   + KI   + +  L K+     +KKK   +E E+ D
Sbjct: 451 HLRIKDRIPANNDIYMLKFDEYWTRISKIQFTQEIKKLIKK-----SKKKLAPIEEEDSD 505

Query: 455 ----------LSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAEL 504
                      S    K   + + +  E ED  ES    +     ET      K+     
Sbjct: 506 KHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAK 565

Query: 505 E---DRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGRE-EPSSPRKPEA 560
           E    +I  L  + +F +D +            P++ K     RS  R   P SP+ PE 
Sbjct: 566 EHKPSKIPTLSKKPEFPKDPK--------HPKDPEEPKKPKRPRSAQRPTRPKSPKLPEL 617

Query: 561 ---PSSCIPPPPPPASLPPPPP-----PRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPR 612
              P S   P  P +   PPPP     P R      +  P+P    + P   K K +   
Sbjct: 618 LDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYD 677

Query: 613 PYFPNVSVQVNGTIWTELDES 633
            Y    +          LDES
Sbjct: 678 DYLDAAAKSKETKTTVVLDES 698


>gnl|CDD|178768 PLN03229, PLN03229, acetyl-coenzyme A carboxylase carboxyl
           transferase subunit alpha; Provisional.
          Length = 762

 Score = 34.4 bits (79), Expect = 0.33
 Identities = 28/166 (16%), Positives = 60/166 (36%), Gaps = 26/166 (15%)

Query: 392 HPQHWLLLD---RLAQQLTLQTETGEDYDHILLKINVKKVVH----LLAKEEEIVAAKKK 444
              H +L++   +L  +   +     +Y  +  K+++         L  K+ +    K +
Sbjct: 503 QLMHPVLMEKIEKLKDEFNKRLSRAPNYLSLKYKLDMLNEFSRAKALSEKKSKAEKLKAE 562

Query: 445 ADELEMENVDLSNRLAKKEQELDQKTQEK--------EDLDESLSRIKDRLEKETAGHLE 496
            ++   E +D      K E    +             +DL E + ++K  +E E AG L+
Sbjct: 563 INKKFKEVMDRPEIKEKMEALKAEVASSGASSGDELDDDLKEKVEKMKKEIELELAGVLK 622

Query: 497 ----------AKQKVAELEDRISELKHR-ADFEQDERKRLENLINS 531
                      K K    +     L+ +     ++  K++E +I S
Sbjct: 623 SMGLEVIGVTKKNKDTAEQTPPPNLQEKIESLNEEINKKIERVIRS 668


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 34.4 bits (79), Expect = 0.34
 Identities = 35/98 (35%), Positives = 50/98 (51%), Gaps = 5/98 (5%)

Query: 434 KEEEIVAAKKKAD---ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
           KEE  V  +K +    ELE +N  L   L ++E+   +  QE E+L E     K  LEKE
Sbjct: 510 KEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKE 569

Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
               L+A +K  E+E  I ELK +   +  E K +E+L
Sbjct: 570 AQEALKALKK--EVESIIRELKEKKIHKAKEIKSIEDL 605



 Score = 30.6 bits (69), Expect = 5.8
 Identities = 22/101 (21%), Positives = 42/101 (41%), Gaps = 11/101 (10%)

Query: 410 TETGEDYDHILLKIN--VKKVVHLLAKEEEIV-AAKKKADELEMENVDLSNRLAKKEQEL 466
            E  E+ + ++ K++   K++       E+++   +K   ELE E  +L  R   K+ EL
Sbjct: 507 GEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLEL 566

Query: 467 DQKTQEK--------EDLDESLSRIKDRLEKETAGHLEAKQ 499
           +++ QE         E +   L   K    KE     +  +
Sbjct: 567 EKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVK 607


>gnl|CDD|216368 pfam01213, CAP_N, Adenylate cyclase associated (CAP) N terminal. 
          Length = 313

 Score = 34.0 bits (78), Expect = 0.34
 Identities = 13/40 (32%), Positives = 16/40 (40%)

Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLP 587
           G    +    P   SS    PPPP   PPP  P  S+ + 
Sbjct: 213 GPVAAAKSALPAVSSSAPSAPPPPPPPPPPSVPTISNSVE 252


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 33.5 bits (76), Expect = 0.35
 Identities = 25/89 (28%), Positives = 49/89 (55%), Gaps = 4/89 (4%)

Query: 422 KINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLS 481
           K ++KK   L+ ++E+I  A+++ +ELE E ++   R  K ++  DQ+ QE+E  ++++ 
Sbjct: 112 KDDIKKKKSLIIRQEQIEKARQEREELE-ERMEWERREEKIDEREDQEEQEREREEQTIE 170

Query: 482 RIKDRLEKETAGHLEAKQKVAELEDRISE 510
              D  E E    +E  +   E +D  +E
Sbjct: 171 EQSDDSEHEI---IEQDESETESDDDKTE 196


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 34.6 bits (79), Expect = 0.36
 Identities = 23/89 (25%), Positives = 37/89 (41%), Gaps = 12/89 (13%)

Query: 434 KEEEIVAAKKKADEL--EMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET 491
           KEEEI A +   DEL  ++    +S       ++ +   QE+E L   L +I  + +K T
Sbjct: 342 KEEEIKALQSNIDELHKQLRKQGIST------EQFELMNQEREKLTRELDKINIQSDKLT 395

Query: 492 AG----HLEAKQKVAELEDRISELKHRAD 516
                  LEA+     LE  + +      
Sbjct: 396 KSVKSRKLEAQGIFKSLEKTLRQYDSLIQ 424


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 34.3 bits (79), Expect = 0.37
 Identities = 21/92 (22%), Positives = 28/92 (30%), Gaps = 12/92 (13%)

Query: 494 HLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPS 553
            L     V+EL DR+  L+  A                 + P                P 
Sbjct: 360 TLAPGAPVSELLDRLEALERGAPAPPSAAWGAPTPAAPAAPPP-----------AAAPPV 408

Query: 554 SPRKPEAPSSCIPPPPPPASLPPPPPPRRSSD 585
            P  P  P++  P P P        PP RS+D
Sbjct: 409 PPAAPARPAAARPAPAPAPP-AAAAPPARSAD 439



 Score = 32.8 bits (75), Expect = 1.1
 Identities = 14/57 (24%), Positives = 18/57 (31%)

Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
                       AP+   P  PPPA+ PP PP   +        P P       P +
Sbjct: 379 RGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPA 435


>gnl|CDD|223010 PHA03215, PHA03215, nuclear protein UL24; Provisional.
          Length = 262

 Score = 33.7 bits (77), Expect = 0.41
 Identities = 14/52 (26%), Positives = 22/52 (42%)

Query: 533 SVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSS 584
           +VP + +     R  G   P S R+P   ++  P P  PA+   PP     +
Sbjct: 188 AVPLEPRTTRRGRRRGARRPRSRRRPRGGAAGPPVPAEPAAPERPPADPTPT 239


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 33.5 bits (76), Expect = 0.47
 Identities = 24/171 (14%), Positives = 54/171 (31%), Gaps = 23/171 (13%)

Query: 461 KKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQD 520
           +KE+ L ++ + +E + E L RI ++ E++       ++   +   + +          +
Sbjct: 97  RKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQEPTQPA--------GVN 148

Query: 521 ERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKP-----------EAPSSCIPPPP 569
                +  +     P+  + +   R G   +P+   +P             P+   P   
Sbjct: 149 VANNPQPQVQPQFGPNPQQRINPQRFGFPMQPNMGMRPGFNQMPPHMPGMPPNQMRPGFN 208

Query: 570 PPASLPP----PPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFP 616
           P   +PP       P    ++              P      +   RP F 
Sbjct: 209 PMPGMPPRPGFNQNPNMMPNMNRPGFRPQPGGFNHPGTPMGPNMQQRPGFN 259


>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201).  This is
           a family of coiled-coil proteins from eukaryotes. The
           function is not known.
          Length = 177

 Score = 32.9 bits (76), Expect = 0.48
 Identities = 24/110 (21%), Positives = 47/110 (42%), Gaps = 15/110 (13%)

Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
           LK+     VH+L         K+K   LE E       L + ++EL +K +    L + L
Sbjct: 68  LKLKYTNTVHILT------HLKEKLHMLEEE-------LERLKKELKEKEERLAKLRKEL 114

Query: 481 SRIKDRLEKETAGHLEAKQKVAEL--EDRISELKHRADFEQDERKRLENL 528
            ++K   +K    + + +Q+   L     + +     +  +++RK +E L
Sbjct: 115 YKVKLERDKLRKQNKKLRQQGGLLHVPALLYDYDKTVEKVEEKRKSVEKL 164



 Score = 30.3 bits (69), Expect = 3.4
 Identities = 26/118 (22%), Positives = 56/118 (47%), Gaps = 4/118 (3%)

Query: 416 YDHILLKINVKKVVHLLAKEEEIVAAKKKAD--ELEMENVDLSNRLAKKEQEL-DQKTQE 472
              I LK  + K+   L ++EE+       D  +L++EN  L+ ++ ++ +EL   K + 
Sbjct: 13  LKLITLKHTLAKIQEKLEQKEELGEGLHLIDFEQLQIENQALNEKIEERNKELKRLKLKY 72

Query: 473 KEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
              +   L+ +K++L          K+++ E E+R+++L+      + ER +L     
Sbjct: 73  TNTVHI-LTHLKEKLHMLEEELERLKKELKEKEERLAKLRKELYKVKLERDKLRKQNK 129


>gnl|CDD|146930 pfam04534, Herpes_UL56, Herpesvirus UL56 protein.  In herpes
           simplex virus type 2, UL56 is thought to be a
           tail-anchored type II membrane protein involved in
           vesicular trafficking. The C terminal hydrophobic region
           is required for association with the cytoplasmic
           membrane, and the N terminal proline-rich region is
           important for the translocation of UL56 to the Golgi
           apparatus and cytoplasmic vesicles.
          Length = 198

 Score = 33.2 bits (75), Expect = 0.48
 Identities = 19/74 (25%), Positives = 26/74 (35%), Gaps = 5/74 (6%)

Query: 535 PDDTKAVGLSRSGGREEPSSPRKPEAPSSC---IPPPPPPASLPPPPPPRRSSDL--PYL 589
           P D ++  +S      E  S   P   + C       P   S P   P  +SS L  P  
Sbjct: 123 PPDVRSDLVSGLPTYAEAPSDHPPTYATVCAARTTEQPMGLSWPFDQPRSQSSGLWRPPT 182

Query: 590 TCPRPLVRCRQPPR 603
              R L R ++  R
Sbjct: 183 VNSRELYRAQRAAR 196


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 32.6 bits (75), Expect = 0.50
 Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 19/109 (17%)

Query: 423 INVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSR 482
           ++V+KV+       E  A K    +LE E   L   L KKE+EL ++ Q+ +    +LS 
Sbjct: 21  VDVQKVL------SESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSE 74

Query: 483 IKDRLEKETAGHLEAKQKVAELEDRISELKH-RADFEQDERKRLENLIN 530
                          K K  EL+ +  EL+  +   +Q+ +++ + L+ 
Sbjct: 75  ------------EARKAKQQELQQKQQELQQKQQAAQQELQQKQQELLQ 111


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 34.1 bits (78), Expect = 0.52
 Identities = 16/51 (31%), Positives = 18/51 (35%), Gaps = 3/51 (5%)

Query: 538 TKAVGLSRSGGREEPSSP-RKPEAPSSCIPPPPPPASLPPPPPPRRSSDLP 587
           T  V  +  G       P   P AP      PPP  + PP  PPR     P
Sbjct: 899 TGRVATAPGGDAASAPPPGAGPPAPPQ--AVPPPRTTQPPAAPPRGPDVPP 947


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 33.0 bits (76), Expect = 0.53
 Identities = 19/73 (26%), Positives = 40/73 (54%), Gaps = 8/73 (10%)

Query: 456 SNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK------ETAGHLEAKQKVAELEDRIS 509
           S  L K +  L++  +E E+L + ++ ++ ++EK      ET    E  +++ +LE  + 
Sbjct: 61  SQALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELK 120

Query: 510 ELKHRADFEQDER 522
           +LK  A+ E+ E+
Sbjct: 121 KLK--AELEKYEK 131


>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
           Provisional.
          Length = 376

 Score = 33.7 bits (77), Expect = 0.53
 Identities = 19/61 (31%), Positives = 22/61 (36%), Gaps = 13/61 (21%)

Query: 545 RSGGREEPSSPR-------KPEAPS----SCIPPPPPPASLPPPPPPRRSSDLPYLTCPR 593
           RS    EP+ P        KP  P+       PP    A  P  PP  +S   P    PR
Sbjct: 52  RSKKENEPAPPSTMIATNTKPFHPAPIDVPPDPPAAQEAVQPTAPPSAQSE--PERNEPR 109

Query: 594 P 594
           P
Sbjct: 110 P 110


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 32.3 bits (74), Expect = 0.53
 Identities = 18/78 (23%), Positives = 34/78 (43%), Gaps = 5/78 (6%)

Query: 440 AAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQ 499
           A +K   EL +++ +    L    ++L++  +E   L       +  L +      E K+
Sbjct: 43  AQQKYEREL-VKHAEDIEELQALRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKK 101

Query: 500 ----KVAELEDRISELKH 513
               +++ELE RI EL  
Sbjct: 102 MLEDELSELEKRIEELNE 119


>gnl|CDD|205921 pfam13747, DUF4164, Domain of unknown function (DUF4164).  This is
           a family of short, approx 100 residue-long, bacterial
           proteins of unknown function. There is several conserved
           LE/LD sequence pairs.
          Length = 89

 Score = 31.0 bits (71), Expect = 0.55
 Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 10/58 (17%)

Query: 471 QEKEDLDESLSRIK---DRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRL 525
               +L+ +L R+    DRLE      LE  + + ELE  I  L         +R RL
Sbjct: 1   TVTTELEAALQRLDAALDRLEAALDRRLERDRDIDELEAEIQRL-------GADRSRL 51


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 33.8 bits (77), Expect = 0.56
 Identities = 29/134 (21%), Positives = 50/134 (37%), Gaps = 21/134 (15%)

Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
           KK     AK  E   AK KA+             AKK+ E + K   ++   E+ ++ K 
Sbjct: 170 KKKAEEAAKAAE--EAKAKAEAAA----------AKKKAEAEAKAAAEKAKAEAEAKAKA 217

Query: 486 RLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKR----LENLINSGSVPDDTKAV 541
             + E A   +A  +  +   +    K  A  +  ERK     L+++    S        
Sbjct: 218 EKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAALDDIFGGLSS-----GK 272

Query: 542 GLSRSGGREEPSSP 555
              ++GG  + +S 
Sbjct: 273 NAPKTGGGAKNASS 286


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 33.7 bits (77), Expect = 0.57
 Identities = 29/112 (25%), Positives = 52/112 (46%), Gaps = 3/112 (2%)

Query: 418 HILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLD 477
           H   +I +++    LA+  E      +  ELE+    L     K E E  +  +E + L 
Sbjct: 84  HKRAQIELERKASTLAENYEREL--DRNLELEVRLKALEELEKKAENEAAEAEEEAKLLK 141

Query: 478 ESLSRIKDRLEKETAGHL-EAKQKVAELEDRISELKHRADFEQDERKRLENL 528
           + L     +L+ E    L EAK+ ++ +++ +SE++ RA     E K LE+ 
Sbjct: 142 DKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTELKLLESE 193


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
           sigma54-dependent transcription [Transcription / Signal
           transduction mechanisms].
          Length = 225

 Score = 33.1 bits (76), Expect = 0.58
 Identities = 24/114 (21%), Positives = 45/114 (39%), Gaps = 17/114 (14%)

Query: 402 LAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAK 461
           LAQ +  Q +     +    +            EE+   A +  +E      DL+    +
Sbjct: 47  LAQAIARQKQLERKLEEAQARAE--------KLEEKAELALQAGNE------DLAREALE 92

Query: 462 KEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRA 515
           ++Q L+   +  E   +      ++L+K+ A     +QK+AEL  +   LK R 
Sbjct: 93  EKQSLEDLAKALEAELQQAEEQVEKLKKQLA---ALEQKIAELRAKKEALKARK 143


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 33.8 bits (78), Expect = 0.60
 Identities = 21/91 (23%), Positives = 27/91 (29%), Gaps = 14/91 (15%)

Query: 518 EQDER---KRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEA-----------PSS 563
             DER    RLE L     V     A   +         +     A           P++
Sbjct: 368 SDDERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAA 427

Query: 564 CIPPPPPPASLPPPPPPRRSSDLPYLTCPRP 594
              P P PA  P PP P  ++       P P
Sbjct: 428 APQPAPAPAPAPAPPSPAGNAPAGGAPSPPP 458


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 32.7 bits (75), Expect = 0.63
 Identities = 20/73 (27%), Positives = 36/73 (49%), Gaps = 8/73 (10%)

Query: 461 KKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHR-----A 515
            +E+  + + +E  + +     +++  E E     E + K+AELE ++ EL+ R     A
Sbjct: 8   VEEEVEETEVEEAVEDEVEEETVEEESEAELLD--EEQAKIAELEAKLDELEERYLRLQA 65

Query: 516 DFE-QDERKRLEN 527
           DFE    R + EN
Sbjct: 66  DFENYKRRIQKEN 78


>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 122

 Score = 31.7 bits (72), Expect = 0.63
 Identities = 13/46 (28%), Positives = 25/46 (54%)

Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDES 479
           K ++++  +K+ +E +    +L   L K+ Q+L+Q   EK   DE 
Sbjct: 61  KRDQVLKEQKEEEEAKRLQSELERELMKRAQKLEQLELEKAKEDEE 106


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
           synthetase/uroporphyrin-III C-methyltransferase;
           Reviewed.
          Length = 656

 Score = 33.5 bits (77), Expect = 0.69
 Identities = 10/43 (23%), Positives = 13/43 (30%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRP 594
           P+  R  +   S      P A+   P PP      P     R 
Sbjct: 277 PAPSRMTDTNDSKSVTSQPAAAAAAPAPPPNPPATPPEPPARR 319



 Score = 32.8 bits (75), Expect = 1.2
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query: 550 EEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRS 583
           +  S   +P A ++   PPP P + PP PP RR 
Sbjct: 287 DSKSVTSQPAAAAAAPAPPPNPPATPPEPPARRG 320



 Score = 30.8 bits (70), Expect = 4.4
 Identities = 11/35 (31%), Positives = 14/35 (40%)

Query: 550 EEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSS 584
            +  S     A ++  P PPP     PP PP R  
Sbjct: 286 NDSKSVTSQPAAAAAAPAPPPNPPATPPEPPARRG 320



 Score = 30.1 bits (68), Expect = 8.6
 Identities = 10/35 (28%), Positives = 12/35 (34%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDL 586
            +S     A +   PP PP     PP    R S  
Sbjct: 291 VTSQPAAAAAAPAPPPNPPATPPEPPARRGRGSAA 325


>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1.  Mating in
           fungi is controlled by the loci that determine the
           mating type of an individual, and only individuals with
           differing mating types can mate. Basidiomycete fungi
           have evolved a unique mating system, termed tetrapolar
           or bifactorial incompatibility, in which mating type is
           determined by two unlinked loci; compatibility at both
           loci is required for mating to occur. The multi-allelic
           tetrapolar mating system is considered to be a novel
           innovation that could have only evolved once, and is
           thus unique to the mushroom fungi. This domain is
           C-terminal to the homeodomain transcription factor
           region.
          Length = 418

 Score = 33.2 bits (76), Expect = 0.71
 Identities = 16/59 (27%), Positives = 16/59 (27%), Gaps = 5/59 (8%)

Query: 551 EPSSPRKPEAPSSCIPPP--PPPASLPPPPPPRRSSDLP---YLTCPRPLVRCRQPPRS 604
           E S     E   S  P      P    P    RR S  P        RP  R R    S
Sbjct: 75  ERSPALSSERLLSPSPSVLDLSPVLASPQTGKRRRSSSPSDDEDEAERPSKRPRSDSIS 133



 Score = 30.2 bits (68), Expect = 7.2
 Identities = 23/114 (20%), Positives = 35/114 (30%), Gaps = 24/114 (21%)

Query: 511 LKHRADFEQDE-------RKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSS 563
            + R+    D+        KR  +   S S                  P++  + E   +
Sbjct: 106 KRRRSSSPSDDEDEAERPSKRPRSDSISSSSSPAKPP-----EACLPSPAASTQDELSEA 160

Query: 564 CIPPPPPPASLPPPPPP---------RRSSDLPYLTCPRPLVRCRQPPRSKLKS 608
              P P P+  PP  P          RR SD   L  P+   R +   R +  S
Sbjct: 161 SAAPLPTPSLSPPHTPTDTAPSGKRKRRLSDGFQLPAPK---RPQTSSRPQTVS 211


>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
           modification].
          Length = 2365

 Score = 33.8 bits (77), Expect = 0.73
 Identities = 14/26 (53%), Positives = 15/26 (57%), Gaps = 4/26 (15%)

Query: 561 PSSCIPPPPPPA----SLPPPPPPRR 582
           P +  PPPPPP     S PPPPPP  
Sbjct: 6   PGNPPPPPPPPGFEPPSQPPPPPPPG 31



 Score = 29.9 bits (67), Expect = 9.2
 Identities = 13/30 (43%), Positives = 14/30 (46%), Gaps = 4/30 (13%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPR 581
           P +P  P  P    PP  PP    PPPPP 
Sbjct: 6   PGNPPPPPPPPGFEPPSQPP----PPPPPG 31


>gnl|CDD|220260 pfam09483, HpaP, Type III secretion protein (HpaP).  This entry
           represents proteins encoded by genes which are always
           found in type III secretion operons, although their
           function in the processes of secretion and virulence is
           unclear. Hpa stands for Hrp-associated gene, where Hrp
           stands for hypersensitivity response and virulence. see
           also PMID:18584024.
          Length = 185

 Score = 32.4 bits (74), Expect = 0.73
 Identities = 13/70 (18%), Positives = 15/70 (21%), Gaps = 1/70 (1%)

Query: 534 VPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPR 593
            P         RS              P     PP P     P  P R     P    P 
Sbjct: 1   APAGAAPRAARRSFDYARLMRRAARAGPPGTPAPPGPAEDAHPEFPERPRDA-PAPPAPP 59

Query: 594 PLVRCRQPPR 603
                 + P+
Sbjct: 60  RATDGDRDPQ 69


>gnl|CDD|130529 TIGR01462, greA, transcription elongation factor GreA.  The GreA
           and GreB transcription elongation factors enable to
           continuation of RNA transcription past template-encoded
           arresting sites. Among the Proteobacteria, distinct
           clades of GreA and GreB are found. GreA differs
           functionally in that it releases smaller
           oligonucleotides. Because members of the family outside
           the Proteobacteria resemble GreA more closely than GreB,
           the GreB clade (TIGR01461) forms a plausible outgroup
           and the remainder of the GreA/B family, included in this
           model, is designated GreA. In the Chlamydias and some
           spirochetes, the region described by This model is found
           as the C-terminal region of a much larger protein
           [Transcription, Transcription factors].
          Length = 151

 Score = 31.8 bits (73), Expect = 0.74
 Identities = 23/87 (26%), Positives = 39/87 (44%), Gaps = 22/87 (25%)

Query: 461 KKEQELDQ-KTQEKEDLDESLSRIKDRLE----KETAGHLEAKQKVAELEDRISELKHRA 515
           K ++EL+  KT ++ ++ E    I +  E     E A +  AK++    E RI+E     
Sbjct: 9   KLKEELEYLKTVKRPEISE---EIAEAREHGDLSENAEYHAAKEEQGFNEGRIAE----- 60

Query: 516 DFEQDERKRLENLINSGSVPDDTKAVG 542
                    LE+L+ +  V DD+K   
Sbjct: 61  ---------LEDLLANAQVIDDSKLST 78


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 33.5 bits (77), Expect = 0.77
 Identities = 13/57 (22%), Positives = 22/57 (38%), Gaps = 3/57 (5%)

Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSD---LPYLTCPRPLVRCRQPPRS 604
            P +     AP++     P   +L P PP + + +   +P    P P V    P  +
Sbjct: 424 APVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPA 480



 Score = 30.5 bits (69), Expect = 6.2
 Identities = 7/34 (20%), Positives = 13/34 (38%)

Query: 547 GGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
                P++    +A ++  P   P A+   P  P
Sbjct: 384 PEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAP 417



 Score = 30.1 bits (68), Expect = 8.7
 Identities = 10/51 (19%), Positives = 16/51 (31%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
            ++P    A  +  P PP  A+      P R +  P +    P        
Sbjct: 435 AAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAA 485


>gnl|CDD|223043 PHA03323, PHA03323, nuclear egress membrane protein UL34;
           Provisional.
          Length = 272

 Score = 32.7 bits (75), Expect = 0.79
 Identities = 17/64 (26%), Positives = 25/64 (39%), Gaps = 5/64 (7%)

Query: 549 REEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKS 608
               +  RK    +S   PP PP +  PP P   ++   + +   PL    + PRS    
Sbjct: 193 DGSIAGDRKNGDAASPGCPPIPPRAASPPLPAAGAAR-GWASLASPL----RAPRSVPTV 247

Query: 609 RIPR 612
           RI  
Sbjct: 248 RISA 251


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 33.5 bits (77), Expect = 0.81
 Identities = 30/121 (24%), Positives = 48/121 (39%), Gaps = 9/121 (7%)

Query: 431 LLAKEEEIVAAKKKADELEME----NVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDR 486
           L  +  E+   +  A+EL+ E      D   R  K EQE     ++ + LDE L  ++D 
Sbjct: 821 LAIQLREL---ESSAEELQQELTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDE 877

Query: 487 LEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRS 546
           + +       A     + E  ISE   + +  + +RKRL   +       D      S S
Sbjct: 878 MRQLAELKEPA--NANQAEGSISERLDQLEEFKRKRKRLSGDLKKFLERFDGVIKDHSGS 935

Query: 547 G 547
           G
Sbjct: 936 G 936



 Score = 32.7 bits (75), Expect = 1.3
 Identities = 29/84 (34%), Positives = 45/84 (53%), Gaps = 4/84 (4%)

Query: 446 DELEMENVDLSNRLAK-KEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAEL 504
           ++LE +    + RLA  +E++  QK   +EDL    S+++ +LE   AG LE  ++  EL
Sbjct: 391 EQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLE---AGKLEFNEEEYEL 447

Query: 505 EDRISELKHRADFEQDERKRLENL 528
           E R+  LK R D      + LE L
Sbjct: 448 ELRLGRLKQRLDSATATPEELEQL 471



 Score = 31.2 bits (71), Expect = 4.1
 Identities = 31/126 (24%), Positives = 52/126 (41%), Gaps = 15/126 (11%)

Query: 408 LQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELD 467
           LQ +  E    + L++       L+A EE +  A+++ +  E +N  L  +L   E +L 
Sbjct: 251 LQEDF-EQLLSLELRLQHLHG-ELVADEERL--AEEQEERQEAKN-RLRQQLRTLEDQLK 305

Query: 468 QK----TQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERK 523
           +      QE    +  L+  +  LE      LE  QK A  +  I +L+   D     R 
Sbjct: 306 EARDELNQELSAANAKLAADRSELE-----LLED-QKGAFEDADIEQLQADLDQLPSIRS 359

Query: 524 RLENLI 529
            LE + 
Sbjct: 360 ELEEVE 365


>gnl|CDD|215496 PLN02918, PLN02918, pyridoxine (pyridoxamine) 5'-phosphate oxidase.
          Length = 544

 Score = 33.4 bits (76), Expect = 0.82
 Identities = 11/57 (19%), Positives = 16/57 (28%), Gaps = 6/57 (10%)

Query: 563 SCIPPPPPPASLPPPPP-----PRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPY 614
           + +     P  + PPPP         S       P        P R  L ++   P 
Sbjct: 11  TFLLQSLLPLPISPPPPHSSSLSSSPSPTQRFLTPSQG-SRLPPRRRALCTKSQDPR 66


>gnl|CDD|234398 TIGR03921, T7SS_mycosin, type VII secretion-associated serine
           protease mycosin.  Members of this family are
           subtilisin-related serine proteases, found strictly in
           the Actinobacteria and associated with type VII
           secretion operons. The designation mycosin is used for
           members from Mycobacterium [Protein fate, Protein and
           peptide secretion and trafficking, Protein fate, Protein
           modification and repair].
          Length = 350

 Score = 33.1 bits (76), Expect = 0.82
 Identities = 10/32 (31%), Positives = 13/32 (40%)

Query: 555 PRKPEAPSSCIPPPPPPASLPPPPPPRRSSDL 586
           P +   P    P P  P + P PPPP   +  
Sbjct: 287 PPEDGRPLRPAPAPARPVAAPAPPPPPDDTPR 318


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 33.2 bits (76), Expect = 0.83
 Identities = 25/118 (21%), Positives = 54/118 (45%), Gaps = 15/118 (12%)

Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
           KK    L  ++++   KK+ + ++ +++     L K ++E D+  +E +   + +   K+
Sbjct: 181 KKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKIKSAKN 240

Query: 486 RLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL--INSGSVPDDTKAV 541
            L+ +     + K  +AEL+             Q++ KRLEN   I SG + +    +
Sbjct: 241 DLQNDKK---QLKADLAELKKAP----------QNDLKRLENKYAIKSGDLKNFAVDL 285


>gnl|CDD|223202 COG0124, HisS, Histidyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 33.0 bits (76), Expect = 0.90
 Identities = 15/85 (17%), Positives = 33/85 (38%), Gaps = 10/85 (11%)

Query: 415 DYDHILLKINV---KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQ 471
             +  L  + +   + ++  L K ++I   +   D       +           LD K  
Sbjct: 168 ILEGRLEYLGIDQREALLRYLDKLDKIGKLELDEDSKRRLKTNPLR-------VLDSKKD 220

Query: 472 EKEDLDESLSRIKDRLEKETAGHLE 496
             ++L ++   + D L++E+  HLE
Sbjct: 221 SDQELLKNAPELLDYLDEESLEHLE 245


>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein. 
          Length = 312

 Score = 32.6 bits (75), Expect = 0.95
 Identities = 29/102 (28%), Positives = 55/102 (53%), Gaps = 5/102 (4%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELD-QKTQEKEDLDESLSRIKDR--LEKET 491
           E +IV  +K  DE  +  VDL  ++   ++EL   K   +E++ E  S+I+ +  +E + 
Sbjct: 102 ENDIVGLRKDLDEATLARVDLEMKIESLQEELAFLKKNHEEEVRELQSQIQGQVNVEMDA 161

Query: 492 AGHLEAKQKVAELEDRISEL--KHRADFEQDERKRLENLINS 531
           A  L+  + +AE+  +  EL  K+R + E+  + +LE L  +
Sbjct: 162 ARKLDLTKALAEIRAQYEELAEKNRQEAEEWYKSKLEELQQA 203


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 32.7 bits (75), Expect = 0.97
 Identities = 21/97 (21%), Positives = 43/97 (44%), Gaps = 7/97 (7%)

Query: 435 EEEIVAAKKKADELEMENVD----LSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
           +EEI + ++ ADEL + +          L     ++ +K ++ E+L + L  +   +E  
Sbjct: 188 KEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEAL 247

Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLEN 527
           T    E  +++AE E    E +    +   E  +L+ 
Sbjct: 248 TNKKSELLEEIAEAEKIREECRG---WSAKEISKLKA 281



 Score = 31.2 bits (71), Expect = 3.2
 Identities = 20/99 (20%), Positives = 37/99 (37%), Gaps = 13/99 (13%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           EE +       + ++ +       L ++   L Q   E    D         LEK     
Sbjct: 163 EELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCD------PLELEK----- 211

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGS 533
             A+Q++  L  +ISE + + +  Q E + L   I + +
Sbjct: 212 --ARQELRSLSVKISEKRKQLEELQQELQELTIAIEALT 248


>gnl|CDD|147144 pfam04834, Adeno_E3_14_5, Early E3 14.5 kDa protein.  The E3B 14.5
           kDa was first identified in Human adenovirus type 5. It
           is an integral membrane protein oriented with its C
           terminus in the cytoplasm. It functions to down-regulate
           the epidermal growth factor receptor and prevent tumour
           necrosis factor cytolysis. It achieves this through the
           interaction with E3 10.4 kDa protein.
          Length = 97

 Score = 30.5 bits (69), Expect = 1.00
 Identities = 9/26 (34%), Positives = 11/26 (42%)

Query: 555 PRKPEAPSSCIPPPPPPASLPPPPPP 580
           PR P+     +  PPPP    P  P 
Sbjct: 60  PRYPDPQHIPLQQPPPPPEPQPRAPS 85


>gnl|CDD|193407 pfam12934, FTO_CTD, FTO C-terminal domain.  This domain is found at
           the C-terminus of the FTO protein which was shown to be
           associated with increased BMI and obesity risk in
           humans. The N-terminal domain of this protein is a DNA
           demethylase and this domain is found to associate with
           the N-terminal domain in the crystal structure. This
           domain is alpha helical with three helices that form a
           bundle.
          Length = 172

 Score = 32.0 bits (73), Expect = 1.0
 Identities = 31/133 (23%), Positives = 55/133 (41%), Gaps = 19/133 (14%)

Query: 868 LCHVKEASKVSMKELENDMTQLRNNLKEVSREIEFQRV-QPLVPGDRFLPVMKEFLTSAT 926
           L  +K+++ VS+K LE  + Q    L+E+  E+EF+ + Q  + G R       +    T
Sbjct: 17  LPDLKDSAAVSLKSLEAAVIQQ---LEEIHNEVEFEWLRQFWLQGSRHAKCHDYW----T 69

Query: 927 CRLSEIEDLFQDMKTRFDRA-VRLFGEDNSSIQPEDFFVIFDTFLTTLNEAKLDNENMKR 985
             ++E+ + +  M+         L     ++ QP      +D  L  L E        + 
Sbjct: 70  QPIAELTEAWDKMELMTRLVLDALKRASLTTEQPNR---AYDMLLYLLTE-------RQE 119

Query: 986 RREEEEKRARQDA 998
            R+E   R R  A
Sbjct: 120 LRDEWTARCRSGA 132


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 33.0 bits (76), Expect = 1.0
 Identities = 34/136 (25%), Positives = 58/136 (42%), Gaps = 37/136 (27%)

Query: 773 FHFWISDIEPKIKSVMEASREVTRSRRLRKLLELVLALGNYMNKGARGNASGFRLTSLNK 832
            H      + K +   E  R++     L KL E +  L +Y+ K          L  L +
Sbjct: 31  VHI----EDLKEELSNERLRKL--RSLLTKLSEALDKLRSYLPK----------LNPLRE 74

Query: 833 LVDTKSSSSRGVTLLHYLVELAEKRFKEVLKLEEDLCHVKEASKVSMKELENDMTQLRNN 892
                S  S        L EL +   +E+ K+E+++           KELE ++++L N 
Sbjct: 75  EKKKVSVKS--------LEELIKDVEEELEKIEKEI-----------KELEEEISELENE 115

Query: 893 LKEVSREIEFQRVQPL 908
           +KE+ +EIE  R++P 
Sbjct: 116 IKELEQEIE--RLEPW 129



 Score = 32.6 bits (75), Expect = 1.2
 Identities = 25/117 (21%), Positives = 51/117 (43%), Gaps = 20/117 (17%)

Query: 438 IVAAKKKADE-------LEMENVDLSN------RLAKKEQELDQKTQEKEDLDESLSRIK 484
           +V  K+ +DE       L  E ++L         + + ++EL++  +E+E L E L  + 
Sbjct: 183 VVVLKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELA 242

Query: 485 DRLEKE---TAGHLEAKQKVAELEDRISELKHRADFE----QDERKRLENLINSGSV 534
            +  +E      +LE + + AE   +  +       E    +D  K+L+ LI+  + 
Sbjct: 243 KKYLEELLALYEYLEIELERAEALSKFLKTDKTFAIEGWVPEDRVKKLKELIDKATG 299



 Score = 32.2 bits (74), Expect = 1.9
 Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 19/112 (16%)

Query: 428 VVHLLAKEEEIVAAK-----KKADELEMENVDLSNRLAKKEQELD--QKTQEKEDLDESL 480
           VVH+   +EE+   +         +L  E +D       K   L   +K    + L+E +
Sbjct: 30  VVHIEDLKEELSNERLRKLRSLLTKL-SEALDKLRSYLPKLNPLREEKKKVSVKSLEELI 88

Query: 481 SRIKDRLEK--ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
             +++ LEK  +    LE  ++++ELE+ I EL+        E +RLE   N
Sbjct: 89  KDVEEELEKIEKEIKELE--EEISELENEIKELEQ-------EIERLEPWGN 131



 Score = 30.3 bits (69), Expect = 7.5
 Identities = 20/105 (19%), Positives = 41/105 (39%), Gaps = 13/105 (12%)

Query: 435 EEEIVAAKKKADEL----EMENVDLSNRLAKKEQELDQ-KTQEKEDLDESLSRIKDRLEK 489
              +    +  D+L       N     +     + L++     +E+L++    IK+  E+
Sbjct: 49  RSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEE 108

Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSV 534
            +    E + ++ ELE  I  L+   +F+ D    L  L+    V
Sbjct: 109 IS----ELENEIKELEQEIERLEPWGNFDLD----LSLLLGFKYV 145


>gnl|CDD|218762 pfam05816, TelA, Toxic anion resistance protein (TelA).  This
           family consists of several prokaryotic TelA like
           proteins. TelA and KlA are associated with tellurite
           resistance and plasmid fertility inhibition.
          Length = 333

 Score = 32.6 bits (75), Expect = 1.1
 Identities = 20/67 (29%), Positives = 31/67 (46%), Gaps = 5/67 (7%)

Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
           L     +    L  E+ I A K K  ELE +   L   L +K +      QE  DL+++L
Sbjct: 124 LYDKNLEYYEEL--EKYIAAGKLKLAELEAK---LLPELEEKAEGDQMAAQELNDLEQAL 178

Query: 481 SRIKDRL 487
            R++ R+
Sbjct: 179 DRLEQRV 185


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 32.0 bits (73), Expect = 1.1
 Identities = 16/86 (18%), Positives = 31/86 (36%), Gaps = 3/86 (3%)

Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGRE 550
            A + EAK+K  EL + I ++K   ++E+ ++K       S    D  K     +   + 
Sbjct: 54  DAEYTEAKKKKKELAEEIEKVK--KEYEE-KQKWKWKKKKSKKKKDKDKDKKDDKKDDKS 110

Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPP 576
           E    ++ E     +           
Sbjct: 111 EKKDEKEAEDKLEDLTKSYSETLSTL 136



 Score = 32.0 bits (73), Expect = 1.1
 Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 5/77 (6%)

Query: 436 EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHL 495
            E   AKKK  EL  E   +     +K++   +K + K+  D+     KD+ + +     
Sbjct: 55  AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKD----KDK-KDDKKDDK 109

Query: 496 EAKQKVAELEDRISELK 512
             K+   E ED++ +L 
Sbjct: 110 SEKKDEKEAEDKLEDLT 126


>gnl|CDD|220403 pfam09788, Tmemb_55A, Transmembrane protein 55A.  Members of this
           family catalyze the hydrolysis of the 4-position
           phosphate of phosphatidylinositol 4,5-bisphosphate, in
           the reaction: 1-phosphatidyl-myo-inositol
           4,5-bisphosphate + H(2)O =
           1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.
          Length = 258

 Score = 32.5 bits (74), Expect = 1.1
 Identities = 20/64 (31%), Positives = 25/64 (39%), Gaps = 4/64 (6%)

Query: 536 DDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPL 595
            +    G   S G E    P KP    +  P P   + LPPP  P +S D        P+
Sbjct: 12  SEDIDGGTGVSPGDEGAGPPSKPPYDQNFAPFPVLLSELPPPYSPAQSPDSG----SAPV 67

Query: 596 VRCR 599
           V CR
Sbjct: 68  VNCR 71


>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
           Nexins.  BAR domains are dimerization, lipid binding and
           curvature sensing modules found in many different
           proteins with diverse functions. Sorting nexins (SNXs)
           are Phox homology (PX) domain containing proteins that
           are involved in regulating membrane traffic and protein
           sorting in the endosomal system. SNXs differ from each
           other in their lipid-binding specificity, subcellular
           localization and specific function in the endocytic
           pathway. A subset of SNXs also contain BAR domains. The
           PX-BAR structural unit determines the specific membrane
           targeting of SNXs. BAR domains form dimers that bind to
           membranes, induce membrane bending and curvature, and
           may also be involved in protein-protein interactions.
          Length = 218

 Score = 31.9 bits (73), Expect = 1.2
 Identities = 17/100 (17%), Positives = 45/100 (45%), Gaps = 7/100 (7%)

Query: 398 LLDRLAQQLTLQ-TETGEDYDHIL------LKINVKKVVHLLAKEEEIVAAKKKADELEM 450
           L +  A Q  ++  E  ++Y          L      ++ L + ++++ + K + ++L+ 
Sbjct: 79  LSEAQANQELVKLLEPLKEYLRYCQAVKETLDDRADALLTLQSLKKDLASKKAQLEKLKA 138

Query: 451 ENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
                  ++ + E+EL++     E+  +    I +RL++E
Sbjct: 139 APGIKPAKVEELEEELEEAESALEEARKRYEEISERLKEE 178


>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80.  Spc24 is a component of
           the evolutionarily conserved kinetochore-associated
           Ndc80 complex and is involved in chromosome segregation.
          Length = 117

 Score = 30.8 bits (70), Expect = 1.2
 Identities = 18/61 (29%), Positives = 27/61 (44%), Gaps = 4/61 (6%)

Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELED 506
           ELE E   L+  L   E EL+    E E+L E L    + LE++     E  +    ++D
Sbjct: 3   ELENEKFRLAKELNDLESELNNLQSELEELKERL----EELEEQEVEGDEVDEDETPVDD 58

Query: 507 R 507
            
Sbjct: 59  S 59


>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
           component of pyruvate dehydrogenase complex.
          Length = 539

 Score = 32.5 bits (74), Expect = 1.3
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 7/41 (17%)

Query: 551 EPSSPRKPEAPSSCIPPPPP-------PASLPPPPPPRRSS 584
           +PSS   P AP +   PPPP       PAS P P   + S+
Sbjct: 201 KPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSA 241



 Score = 30.2 bits (68), Expect = 7.8
 Identities = 14/44 (31%), Positives = 17/44 (38%)

Query: 561 PSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
           PSS   P  P A   PPPP     + P  +      +   PP S
Sbjct: 202 PSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSS 245


>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 584

 Score = 32.5 bits (74), Expect = 1.4
 Identities = 21/104 (20%), Positives = 33/104 (31%), Gaps = 10/104 (9%)

Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPR-PLVRCRQPP-RSK 605
                ++P+   APS      P P   P P P    +  P    P    VR      R K
Sbjct: 389 ANLLHNAPQAAPAPS---AAAPEPKHQPAPEPRPVLAPTPASGEPNAAAVRSMWSTVRDK 445

Query: 606 LKSRIPRPYFPNVSVQVNGTIWTELDESKLYASMELEVVDKLFC 649
           ++ R          V + G     L+ + L  + E   + +   
Sbjct: 446 VRQRS-----RTTEVMLAGATVRALEGNTLVLTHESAPLARRLS 484


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 32.6 bits (74), Expect = 1.4
 Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 18/169 (10%)

Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADF 517
              K+E+   +   +K    +  ++ K+  E+E       +    + E+     +   + 
Sbjct: 162 EEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEE--REK 219

Query: 518 EQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPP 577
           E+D+ K  E   +     +  +    S    R   SS +KP+   S   P    +S    
Sbjct: 220 EEDDGKDRETTTSPMEEDESRQ----SSEISRRSSSSLKKPDPSPSMASPETRESSKRTE 275

Query: 578 PPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFPNVSVQVNGTI 626
             PR S   P     RP      PPR K K          +   +    
Sbjct: 276 TRPRTSLRPP---SARPASARPAPPRVKRK---------EIVTVLQDAQ 312



 Score = 31.8 bits (72), Expect = 2.5
 Identities = 30/152 (19%), Positives = 56/152 (36%), Gaps = 3/152 (1%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK--E 490
            KE+++   + + +E + E V   +R  KK  +     ++KE  +E   R   R     +
Sbjct: 149 EKEKKVEEPRDREEEKKRERVRAKSR-PKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGK 207

Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGRE 550
                  +++  E +D        +  E+DE ++   +    S          S +    
Sbjct: 208 PEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPET 267

Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRR 582
             SS R    P + + PP    +   P PPR 
Sbjct: 268 RESSKRTETRPRTSLRPPSARPASARPAPPRV 299


>gnl|CDD|223037 PHA03301, PHA03301, envelope glycoprotein L; Provisional.
          Length = 226

 Score = 31.8 bits (72), Expect = 1.5
 Identities = 13/59 (22%), Positives = 19/59 (32%), Gaps = 8/59 (13%)

Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKL 606
            R   S+P    +P       PPP + P    PR           +P     +P R + 
Sbjct: 176 TRRPLSAPDDEASPQPKSLATPPPVAAPSRRTPRPR--------RKPRGNRTRPSRPRN 226


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 32.3 bits (74), Expect = 1.5
 Identities = 14/58 (24%), Positives = 34/58 (58%), Gaps = 1/58 (1%)

Query: 451 ENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLE-AKQKVAELEDR 507
           EN+++  ++ + ++EL +  ++ E L+  + + ++ LEK     L+ A +K+A+  D 
Sbjct: 89  ENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDS 146



 Score = 32.3 bits (74), Expect = 1.6
 Identities = 25/108 (23%), Positives = 43/108 (39%), Gaps = 8/108 (7%)

Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQE-KEDLDESLSRIKDRLEKETA 492
            E +I   KK+  +LE +   L   + KKE+EL++   +  +   + L++  D    E  
Sbjct: 93  IEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEAL 152

Query: 493 GHLEAKQK-----VAELEDRISELKHRADFEQDERKRLENLINSGSVP 535
             L  K+      + EL+  I         E  E K     + S + P
Sbjct: 153 KGLNYKKNFKEKLLKELKSVILNASSLLSLE--ELKAKIKTLFSSNKP 198


>gnl|CDD|236692 PRK10431, PRK10431, N-acetylmuramoyl-l-alanine amidase II;
           Provisional.
          Length = 445

 Score = 32.1 bits (73), Expect = 1.6
 Identities = 13/37 (35%), Positives = 18/37 (48%), Gaps = 3/37 (8%)

Query: 568 PPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
           PPPP   PPPP   +  + P +  PR     R P ++
Sbjct: 130 PPPP---PPPPVVAKRVETPAVVAPRVSEPARNPFKT 163


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 32.4 bits (74), Expect = 1.6
 Identities = 18/74 (24%), Positives = 32/74 (43%), Gaps = 3/74 (4%)

Query: 459 LAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFE 518
           LA    E+     E+ +L   LS  + +  K      E K+ +A+L   +S  + + +  
Sbjct: 173 LAAVRAEI---AAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEEL 229

Query: 519 QDERKRLENLINSG 532
           +    RL+N I S 
Sbjct: 230 RANESRLKNEIASA 243



 Score = 31.6 bits (72), Expect = 2.6
 Identities = 32/172 (18%), Positives = 50/172 (29%), Gaps = 13/172 (7%)

Query: 386 PLEFGCHPQHWLLLDRLAQQL-TLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKK 444
           P      P+      RLA     L     E  D +        +  L A   EI A + +
Sbjct: 132 PPALLVSPEDAQRSVRLAIYYGALNPARAERIDAL-----KATLKQLAAVRAEIAAEQAE 186

Query: 445 ADELEMENVDLSNRLAKK-------EQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEA 497
              L  E      +LA+          +L+ +    +   E L   + RL+ E A    A
Sbjct: 187 LTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAA 246

Query: 498 KQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGR 549
             K  E          RA   + +R        +      +   G     G+
Sbjct: 247 AAKAREAAAAAEAAAARARAAEAKRTGETYKPTAPEKMLISSTGGFGALRGQ 298



 Score = 30.8 bits (70), Expect = 4.0
 Identities = 15/72 (20%), Positives = 34/72 (47%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
           A E++I   + +  +LE +   L   +A  E +L +   + + L + ++ +  RL     
Sbjct: 49  ALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALEV 108

Query: 493 GHLEAKQKVAEL 504
              E ++++AE 
Sbjct: 109 QEREQRRRLAEQ 120


>gnl|CDD|117486 pfam08919, F_actin_bind, F-actin binding.  The F-actin binding
           domain forms a compact bundle of four antiparallel
           alpha-helices, which are arranged in a left-handed
           topology. Binding of F-actin to the F-actin binding
           domain may result in cytoplasmic retention and
           subcellular distribution of the protein, as well as
           possible inhibition of protein function.
          Length = 179

 Score = 31.2 bits (70), Expect = 1.6
 Identities = 20/67 (29%), Positives = 27/67 (40%), Gaps = 12/67 (17%)

Query: 548 GREEPSSPRKPEAPSSCIPP--PPPPASLP---PPPPPRRSSDLPYLTCPRPLV------ 596
           G ++P  P  P+  S+  P   PP P  LP   P P    +   P      PL+      
Sbjct: 2   GLKKPVPPAVPKPQSTAKPVGTPPSPVPLPSTSPSPSKMANGTQPSSAAFIPLISTRVSL 61

Query: 597 -RCRQPP 602
            + RQPP
Sbjct: 62  RKTRQPP 68


>gnl|CDD|191840 pfam07749, ERp29, Endoplasmic reticulum protein ERp29, C-terminal
           domain.  ERp29 is a ubiquitously expressed endoplasmic
           reticulum protein found in mammals. ERp29 is comprised
           of two domains. This domain, the C-terminal domain, has
           an all helical fold. ERp29 is thought to form part of
           the thyroglobulin folding complex.
          Length = 95

 Score = 29.9 bits (68), Expect = 1.6
 Identities = 14/62 (22%), Positives = 27/62 (43%), Gaps = 2/62 (3%)

Query: 477 DESLSRIKDRLEKETAGHLEAKQKVAELEDRISE--LKHRADFEQDERKRLENLINSGSV 534
           D+    +   +E+  A   ++  K A+   ++ E  L+   D+   E  RLE L+     
Sbjct: 18  DDERKELLSEVEEAVAKLKDSAAKYAKYYVKVMEKILEKGEDYVAKELARLERLLEGSLA 77

Query: 535 PD 536
           P+
Sbjct: 78  PE 79


>gnl|CDD|147296 pfam05038, Cytochrom_B558a, Cytochrome Cytochrome b558
           alpha-subunit.  Cytochrome b-245 light chain (p22-phox)
           is one of the key electron transfer elements of the
           NADPH oxidase in phagocytes.
          Length = 186

 Score = 31.4 bits (71), Expect = 1.7
 Identities = 15/35 (42%), Positives = 16/35 (45%), Gaps = 1/35 (2%)

Query: 546 SGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
            G +  P  PRK E P        PP S PPP PP
Sbjct: 126 RGEQWTPILPRKKERPPVGGTIKQPP-SNPPPRPP 159


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 32.5 bits (74), Expect = 1.7
 Identities = 13/56 (23%), Positives = 16/56 (28%), Gaps = 3/56 (5%)

Query: 552 PSSPRKPEAPS---SCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
           P +PR    P       PPP  PA+  P P        P            Q   +
Sbjct: 695 PQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAEQADGA 750


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 31.8 bits (73), Expect = 1.7
 Identities = 23/110 (20%), Positives = 58/110 (52%), Gaps = 2/110 (1%)

Query: 422 KINVKKVVHLLAKEEEIVAAKKKADELE-MENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
           +  + +++  + +E+E  A +K+  + +  E +D  N    + +E  +K +E+E+  + L
Sbjct: 98  REQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEE-EKEREREEELKIL 156

Query: 481 SRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
              +++ E+E     E +++  E E  ++ L+ + +  +DER+ L+ L  
Sbjct: 157 EYQREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRA 206



 Score = 30.6 bits (70), Expect = 4.3
 Identities = 26/133 (19%), Positives = 56/133 (42%), Gaps = 5/133 (3%)

Query: 398 LLDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSN 457
            + RL  Q     +  E+ D +   +  ++      ++E+  A K++  + E++      
Sbjct: 183 EVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREEQ 242

Query: 458 RLAKKEQELDQKTQEKEDLDESLS--RIKDRLEKETAGHLEAKQKV--AELEDRISELKH 513
              K+E+  +++ +E+ + +  L      + LE+E A     K+     ELE +I E K 
Sbjct: 243 IEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQI-EEKE 301

Query: 514 RADFEQDERKRLE 526
                + E +  E
Sbjct: 302 ERRAAEREEELEE 314



 Score = 30.3 bits (69), Expect = 5.2
 Identities = 27/106 (25%), Positives = 50/106 (47%), Gaps = 12/106 (11%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAK------------KEQELDQKTQEKEDLDESLSR 482
           EEE + A  + +E E +  +                   +++E +++ QE+E +DE + R
Sbjct: 48  EEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIER 107

Query: 483 IKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
           I++  E E     E ++K+ E  D  +E +     E+ ER+R E L
Sbjct: 108 IQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEEL 153


>gnl|CDD|193253 pfam12777, MT, Microtubule-binding stalk of dynein motor.  the 380
           kDa motor unit of dynein belongs to the AAA class of
           chaperone-like ATPases. The core of the 380 kDa motor
           unit contains a concatenated chain of six AAA modules,
           of which four correspond to the ATP binding sites with
           P-loop signatures described previously, and two are
           modules in which the P loop has been lost in evolution.
           This family is the region between D4 and D5 and is the
           two predicted alpha-helical coiled coil segments that
           form the stalk supporting the ATP-sensitive microtubule
           binding component.
          Length = 344

 Score = 31.9 bits (72), Expect = 1.9
 Identities = 17/57 (29%), Positives = 32/57 (56%)

Query: 454 DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISE 510
           DL  +LA +E EL QK ++ + L + +    +++ KE A   + + KVA +   ++E
Sbjct: 19  DLKAKLAIQEVELKQKNEDADKLIQVVGVETEKVSKEKAIADQEEVKVAVINKEVTE 75


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 32.2 bits (74), Expect = 1.9
 Identities = 22/99 (22%), Positives = 39/99 (39%), Gaps = 25/99 (25%)

Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNR 458
           +  L QQL LQ                       A+++E+VA +  A ELE +  +L  +
Sbjct: 151 VLTLKQQLELQAREKAQSQ-----------ALAEAQQQELVALEGLAAELEEKQQELEAQ 199

Query: 459 LAK-KEQELDQKTQEKE-------------DLDESLSRI 483
           L + +E+  +   + K+             +L E  +RI
Sbjct: 200 LEQLQEKAAETSQERKQKRKEITDQAAKRLELSEEETRI 238



 Score = 29.9 bits (68), Expect = 8.7
 Identities = 16/70 (22%), Positives = 30/70 (42%)

Query: 454 DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKH 513
            L   +   +Q+L+ + +EK          +  L        E ++K  ELE ++ +L+ 
Sbjct: 146 ALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQE 205

Query: 514 RADFEQDERK 523
           +A     ERK
Sbjct: 206 KAAETSQERK 215


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 31.9 bits (72), Expect = 2.0
 Identities = 23/110 (20%), Positives = 52/110 (47%), Gaps = 7/110 (6%)

Query: 429 VHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLE 488
             L  K+ E++  + + + L  ++ D+   L K +++L +  QEK  L   +  ++  LE
Sbjct: 294 QELSRKKTELLGLQTELETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEVDALRYELE 353

Query: 489 -------KETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
                  K+TA    A+++ A     I +++ R +  + + + L+  I +
Sbjct: 354 RKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIEN 403



 Score = 30.8 bits (69), Expect = 5.3
 Identities = 29/137 (21%), Positives = 55/137 (40%), Gaps = 16/137 (11%)

Query: 402 LAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAK 461
           L  Q  L+T   +D D            HL   +E++  A+++   L+ E   L   L +
Sbjct: 304 LGLQTELETLANQDSDM---------RQHLDKLKEDLTRAEQEKAILQTEVDALRYELER 354

Query: 462 KEQELDQKTQEKEDLDESLSR-------IKDRLEKETAGHLEAKQKVAELEDRISELKHR 514
           K   L +KT   +   E  +        ++DR EK        ++K+  L++     + R
Sbjct: 355 KHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQETFRRKERR 414

Query: 515 ADFEQDERKRLENLINS 531
              E++  + L+   N+
Sbjct: 415 LKEEKERLRSLQTDTNT 431


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 31.7 bits (72), Expect = 2.0
 Identities = 20/66 (30%), Positives = 35/66 (53%), Gaps = 2/66 (3%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKT-QEKEDLDESLSRIKDRLEKET 491
           A EEEI+  +++  E E E ++  N  AK E E   +  +E+E+ D +   +K +  +E 
Sbjct: 151 ATEEEILEMRRETIEEEAE-LERENIRAKIEAEARGRAKEERENEDINREMLKLKANEER 209

Query: 492 AGHLEA 497
              LE+
Sbjct: 210 ETVLES 215


>gnl|CDD|172947 PRK14472, PRK14472, F0F1 ATP synthase subunit B; Provisional.
          Length = 175

 Score = 30.9 bits (70), Expect = 2.1
 Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 8/75 (10%)

Query: 441 AKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQK 500
           AK +A+ +  +N +L   LAK + E D+  +E ++  E L     R E     H EAK+ 
Sbjct: 64  AKDEAEAILRKNREL---LAKADAEADKIIREGKEYAEKL-----RAEITEKAHTEAKKM 115

Query: 501 VAELEDRISELKHRA 515
           +A  ++ I + K RA
Sbjct: 116 IASAKEEIEQEKRRA 130


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 32.2 bits (73), Expect = 2.1
 Identities = 24/126 (19%), Positives = 44/126 (34%), Gaps = 6/126 (4%)

Query: 405 QLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQ 464
           +L  +    E+      K   K    L  ++EEI   +K   EL+   +       ++EQ
Sbjct: 305 KLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEK---ELKELEIKREAEEEEEEQ 361

Query: 465 ELDQKTQE---KEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDE 521
               + +    +E+L        +RL        E  +   E E     L   ++ E+D 
Sbjct: 362 LEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDL 421

Query: 522 RKRLEN 527
            K  + 
Sbjct: 422 LKEEKK 427


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 31.6 bits (72), Expect = 2.2
 Identities = 35/165 (21%), Positives = 62/165 (37%), Gaps = 24/165 (14%)

Query: 430 HLLAKEEEIVAAKKKADELE---MENVDLSNRLAKKEQELDQKTQEKEDLDE--SLSRIK 484
             L +E+E    K++ +EL    M+   L NR A  ++E++      E L+E   L   +
Sbjct: 111 DKLDEEQEERVEKEREEELAGDAMKK--LENRTADSKREME----VLERLEELKELQSRR 164

Query: 485 DRLEKET---AGHLEAKQKVAELEDR----ISELKHRADFEQDERKRLENLINSGSVPDD 537
             ++  +   A     K++  E E+     I  L    + E+D R+  +         +D
Sbjct: 165 ADVDVNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDND 224

Query: 538 TKAVGLSRSGGR------EEPSSPRKPEAPSSCIPPPPPPASLPP 576
                 S S          + S+ ++ EAPSS            P
Sbjct: 225 NTPSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKP 269


>gnl|CDD|236795 PRK10920, PRK10920, putative uroporphyrinogen III
           C-methyltransferase; Provisional.
          Length = 390

 Score = 31.6 bits (72), Expect = 2.2
 Identities = 15/74 (20%), Positives = 35/74 (47%), Gaps = 10/74 (13%)

Query: 441 AKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQK 500
            K++A      N  L+N+L   ++    +  +K++L+  L +    L++       A ++
Sbjct: 58  GKQQAQNQTATNDALANQLTALQK---AQESQKQELEGILKQQAKALDQ-------ANRQ 107

Query: 501 VAELEDRISELKHR 514
            A L  ++ EL+ +
Sbjct: 108 QAALAKQLDELQQK 121


>gnl|CDD|237046 PRK12297, obgE, GTPase CgtA; Reviewed.
          Length = 424

 Score = 31.6 bits (73), Expect = 2.2
 Identities = 30/131 (22%), Positives = 53/131 (40%), Gaps = 24/131 (18%)

Query: 414 EDYDHILLKIN--VKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQK-- 469
           EDY+    KIN  +K     L +  +IV A K      M+  +    L + +++L  K  
Sbjct: 256 EDYE----KINKELKLYNPRLLERPQIVVANK------MDLPEAEENLEEFKEKLGPKVF 305

Query: 470 ---TQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDER---- 522
                  + LDE L  + + LE+     LE ++   E+  +  E +      +DE     
Sbjct: 306 PISALTGQGLDELLYAVAELLEETPEFPLEEEEVEEEVYYKFEEEEKDFTITRDEDGVFV 365

Query: 523 ---KRLENLIN 530
              +++E L  
Sbjct: 366 VSGEKIERLFK 376


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 31.2 bits (71), Expect = 2.3
 Identities = 19/67 (28%), Positives = 29/67 (43%), Gaps = 1/67 (1%)

Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRI 610
            P    +PE     IP PP  A +  P P  +    P    P+P+ +  + P+ ++K   
Sbjct: 67  PPEPVVEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKPKPK-PKPVKKVEEQPKREVKPVE 125

Query: 611 PRPYFPN 617
           PRP  P 
Sbjct: 126 PRPASPF 132


>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824).  This is
           a repeating domain found in fungal proteins. It is
           proline-rich, and the function is not known.
          Length = 135

 Score = 30.2 bits (68), Expect = 2.3
 Identities = 18/73 (24%), Positives = 20/73 (27%), Gaps = 1/73 (1%)

Query: 545 RSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTC-PRPLVRCRQPPR 603
           +   RE        +A S     P  P   PP PP       P     P P      PP 
Sbjct: 30  KKAERERERYRHDHDAYSDSYEEPYDPTPYPPSPPVSDPRYYPNSNYFPPPPGSTPVPPP 89

Query: 604 SKLKSRIPRPYFP 616
                  P  Y P
Sbjct: 90  GPQPGYNPADYPP 102



 Score = 28.7 bits (64), Expect = 8.7
 Identities = 15/46 (32%), Positives = 16/46 (34%), Gaps = 4/46 (8%)

Query: 553 SSPRKPEAPSSC----IPPPPPPASLPPPPPPRRSSDLPYLTCPRP 594
           S+P  P  P         PPPP A  PP   P           PRP
Sbjct: 83  STPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGPGQDPYAPRP 128


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 31.3 bits (71), Expect = 2.4
 Identities = 31/156 (19%), Positives = 63/156 (40%), Gaps = 21/156 (13%)

Query: 425 VKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK 484
           V+ +V+ L    ++   KKK +  + EN D+  +        ++ T+E+E+L+E      
Sbjct: 102 VEDIVYNLTNNIDLENTKKKIETYQKENKDVIQK------NKEKSTREQEELEE------ 149

Query: 485 DRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLS 544
             LE E     + +  + + E+     + + +  ++++  L+ L  S     +  A    
Sbjct: 150 -ALEFEKEEEEQRRLLLQKEEE-----EQQMNKRKNKQALLDELETSTLPAAELIAQHKK 203

Query: 545 RSGGREEPSSPRKPEAP---SSCIPPPPPPASLPPP 577
            S   E      KPE P   S+ I      + +P  
Sbjct: 204 NSVKLEMQVEKPKPEKPNTFSTGIKMGYQISLVPVQ 239


>gnl|CDD|220777 pfam10486, PI3K_1B_p101, Phosphoinositide 3-kinase gamma adapter
           protein p101 subunit.  Class I PI3Ks are dual-specific
           lipid and protein kinases involved in numerous
           intracellular signaling pathways. Class IB PI3K,
           p110gamma, is mainly activated by seven-transmembrane
           G-protein-coupled receptors (GPCRs), through its
           regulatory subunit p101 and G-protein beta-gamma
           subunits.
          Length = 856

 Score = 31.9 bits (72), Expect = 2.4
 Identities = 29/115 (25%), Positives = 42/115 (36%), Gaps = 10/115 (8%)

Query: 474 EDLDESLSRIKDRLE-KETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSG 532
           ED DE    +  R E +E   H + KQK+ +L     +L  R D        L  L  +G
Sbjct: 384 EDSDEGSDELPARGEPQEERAHNKLKQKIYKLFKSKGQLVKRRDLSPCIDWPLP-LRRAG 442

Query: 533 SV--PDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSD 585
           S+  P  ++  G SR       S      A  + +     P  +     P  S D
Sbjct: 443 SLCGPQLSRIPGRSRR------SKSLPQPALGTQLLDLQLPQKVAFKRRPFLSGD 491


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
           the bacterial cell cycle, the tubulin-like cell-division
           protein FtsZ polymerises into a ring structure that
           establishes the location of the nascent division site.
           EzrA modulates the frequency and position of FtsZ ring
           formation.
          Length = 559

 Score = 31.8 bits (73), Expect = 2.4
 Identities = 32/117 (27%), Positives = 56/117 (47%), Gaps = 17/117 (14%)

Query: 399 LDRLAQQLTLQTETGEDYDHILLKIN-VKKVVHLLAKEEEIVAAKKKADELEMENVDLSN 457
           LDRL Q  TL  +  E    +  ++  ++     L +    +A KK A        +L  
Sbjct: 329 LDRLQQSYTLNEDELETVRELEKQLEELEAQYDQLVER---IAEKKVAYS------ELQE 379

Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHR 514
            L + E++L++  +E+E+L ESL      L K+    LEA++K+ E   ++ E+K  
Sbjct: 380 ELEEIEKQLEEIEKEQEELSESLQG----LRKD---ELEAREKLQEYRQKLHEIKRY 429



 Score = 31.0 bits (71), Expect = 3.9
 Identities = 23/83 (27%), Positives = 37/83 (44%), Gaps = 15/83 (18%)

Query: 421 LKINVKKVVHLLAKEEEIVAA-KKKADELEMENVDL-------SNRLAKKEQELDQKTQE 472
           + IN+  V   L +  + V   K+K +EL ++N  L       +NR     +E+ +  +E
Sbjct: 463 VPINMDAVNRQLEEATDDVETLKEKTEEL-VDNATLAEQLIQYANRYRSDNEEVAEALKE 521

Query: 473 KEDL------DESLSRIKDRLEK 489
            E L       E+L  I   LEK
Sbjct: 522 AERLFENYDYKEALEIIATALEK 544


>gnl|CDD|224617 COG1703, ArgK, Putative periplasmic protein kinase ArgK and related
            GTPases of G3E family [Amino acid transport and
            metabolism].
          Length = 323

 Score = 31.5 bits (72), Expect = 2.5
 Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 3/70 (4%)

Query: 970  LTTLNEAKLDNENMKRRREEEEKRARQDAEMKKRTTDRKQGAKENGVVISNGSVKGEFDD 1029
            +  L +A  D+           ++ R       RT  R +       + +N +VK  + D
Sbjct: 242  IDELWDAIEDHRKFLTESGLFTEKRRTQYVEWIRTLVRDEVLDR---LEANPAVKKVYAD 298

Query: 1030 LISALRTGDV 1039
            L +A+R G++
Sbjct: 299  LEAAVRKGEL 308


>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904).  This
           family consists of several bacterial and archaeal
           hypothetical proteins of unknown function.
          Length = 72

 Score = 28.8 bits (65), Expect = 2.5
 Identities = 13/55 (23%), Positives = 26/55 (47%)

Query: 436 EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
           E I   + + +EL+ EN  LS    + E+E  +  +E+      L  +  +L++ 
Sbjct: 18  ETIALLQMEIEELKEENEQLSEENEELEEENKKLKEERNAWQSRLRALLGKLDEV 72


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 31.8 bits (72), Expect = 2.6
 Identities = 36/154 (23%), Positives = 62/154 (40%), Gaps = 13/154 (8%)

Query: 433  AKEEEIVAAKKKADELEMENVD---LSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
            A E E       +  +E ENV         A  +++     +EKE+ DE L  +KDRL  
Sbjct: 1214 ASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILD-LKDRLAA 1272

Query: 490  ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLS----- 544
                   A  + A++E+ +  +  R    + +     ++I+     DD  AV +S     
Sbjct: 1273 YNLD--SAPAQSAKMEETVKAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERL 1330

Query: 545  -RSGGREEPSSPRKPEAPSSCIPPPPPPASLPPP 577
             + GGR+ P++  K  A          PA++   
Sbjct: 1331 KKKGGRK-PAAANKKAAKPPAAAKKRGPATVQSG 1363


>gnl|CDD|223163 COG0085, RpoB, DNA-directed RNA polymerase, beta subunit/140 kD
           subunit [Transcription].
          Length = 1060

 Score = 31.8 bits (73), Expect = 2.6
 Identities = 35/160 (21%), Positives = 55/160 (34%), Gaps = 21/160 (13%)

Query: 672 SKSKILSVIDGRRAQNCTILLSKLKM-TDEEISKAI-------LSMDSADQLPLDMLEQL 723
            K  +   ID +R    TILL  L + TDEEI +A        L     ++  L++ E+ 
Sbjct: 177 PKDNLYVRIDRKRKIPVTILLRALGLETDEEIIEAFGGDELTDLVPPEGEEALLEIYEEA 236

Query: 724 LKFTPNTEEAALLEEHSTETLARADRFL--YEISKIPHYEQRLRCLHYKKRFHFWISDIE 781
                    A  L       + R D     Y+ +K    ++ L  L           D+ 
Sbjct: 237 KGEKITARNALELIGSRVFVVKRYDAKEGRYKRAKYVLDKELLPHLGEAGE----RYDLS 292

Query: 782 PKIKSVMEASREVTRSRRLRKLLELVLALGNYMNKGARGN 821
              K+             ++ L+EL L  G   +    GN
Sbjct: 293 RVGKAKDII-------AMIKYLIELRLGKGEEDDIDHLGN 325


>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566).  Family of
           related proteins that is plant specific.
          Length = 313

 Score = 31.5 bits (71), Expect = 2.7
 Identities = 20/88 (22%), Positives = 29/88 (32%)

Query: 544 SRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPR 603
            RS  R   SS     + SS       PAS P   P R +S        +         R
Sbjct: 17  PRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFGLSKQRPSSLSRGR 76

Query: 604 SKLKSRIPRPYFPNVSVQVNGTIWTELD 631
              +   P    P+ +  +NG++ T   
Sbjct: 77  LSSRFVSPSRGSPSAAASLNGSLATAST 104


>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
          Length = 869

 Score = 31.5 bits (72), Expect = 2.7
 Identities = 28/121 (23%), Positives = 47/121 (38%), Gaps = 34/121 (28%)

Query: 420 LLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKK--EQELDQKTQEKEDLD 477
           +L+ +    V LL +E EI     +  ELE E       L K   E+ + ++ +E E  +
Sbjct: 333 ILRRSTDHTVSLLKQELEI-----RLSELE-EQW-HFASLEKIFIEERIYKEIEEAETWE 385

Query: 478 ESLSRIKDRLE-----------KETAGHL--------------EAKQKVAELEDRISELK 512
             +  I   LE           ++    L              +A + +A LE  I+E+K
Sbjct: 386 AVIEAIDKGLEPFKKQFIREVTEDDIVRLTEIKIKRISKFDSDKADELIARLEAEIAEVK 445

Query: 513 H 513
           H
Sbjct: 446 H 446


>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein.  Tymoviruses
           are single stranded RNA viruses. This family includes a
           protein of unknown function that has been named based on
           its molecular weight. Tymoviruses such as the ononis
           yellow mosaic tymovirus encode only three proteins. Of
           these two are overlapping this protein overlaps a larger
           ORF that is thought to be the polymerase.
          Length = 458

 Score = 31.6 bits (72), Expect = 2.7
 Identities = 16/71 (22%), Positives = 19/71 (26%), Gaps = 8/71 (11%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIP 611
                +P  P       P     P P P  R   LP    PR         R  L     
Sbjct: 247 HPHTTRPSPPRPAFSRSPSSPLSPLPRPSTRRGLLPNPRLPR-------ASRGHLPPPTS 299

Query: 612 -RPYFPNVSVQ 621
             P  PN  ++
Sbjct: 300 SAPPRPNGGLR 310



 Score = 30.4 bits (69), Expect = 6.1
 Identities = 19/53 (35%), Positives = 20/53 (37%), Gaps = 7/53 (13%)

Query: 549 REEPSSPRKPEAPSSCIPP-PPPPAS-----LPPPPPPRRS-SDLPYLTCPRP 594
              PS PR   + S   P  P P  S     LP P  PR S   LP  T   P
Sbjct: 250 TTRPSPPRPAFSRSPSSPLSPLPRPSTRRGLLPNPRLPRASRGHLPPPTSSAP 302



 Score = 30.1 bits (68), Expect = 7.0
 Identities = 20/68 (29%), Positives = 26/68 (38%), Gaps = 6/68 (8%)

Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPP------RRSSDLPYLTCPRPLVRCRQP 601
             E+P  PR P+ PS        P  L            RRS  LP L+  RP+    +P
Sbjct: 106 PGEQPKLPRAPQLPSHPRRHRSLPHHLHHSSKHRNRLHARRSDVLPSLSNSRPVHELPKP 165

Query: 602 PRSKLKSR 609
               L+ R
Sbjct: 166 STPVLQPR 173


>gnl|CDD|217565 pfam03449, GreA_GreB_N, Transcription elongation factor,
           N-terminal.  This domain adopts a long alpha-hairpin
           structure.
          Length = 71

 Score = 28.6 bits (65), Expect = 2.7
 Identities = 18/61 (29%), Positives = 33/61 (54%), Gaps = 8/61 (13%)

Query: 459 LAKKEQELDQ-KTQEKEDLDESLSRIKDRLE----KETAGHLEAKQKVAELEDRISELKH 513
             K ++EL++ K  E+ ++ ++   I +  E     E A +  AK++  E+E RI EL+ 
Sbjct: 8   YEKLQEELERLKNVERPEIIKA---IAEAREHGDLSENAEYHAAKEEQREIEARIRELED 64

Query: 514 R 514
           R
Sbjct: 65  R 65


>gnl|CDD|153364 cd07680, F-BAR_PACSIN1, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Protein kinase C and
           Casein kinase Substrate in Neurons 1 (PACSIN1).  F-BAR
           domains are dimerization modules that bind and bend
           membranes and are found in proteins involved in membrane
           dynamics and actin reorganization. Protein kinase C and
           Casein kinase Substrate in Neurons (PACSIN) proteins,
           also called Synaptic dynamin-associated proteins
           (Syndapins), act as regulators of cytoskeletal and
           membrane dynamics. Vetebrates harbor three isoforms with
           distinct expression patterns and specific functions.
           PACSIN 1 or Syndapin I is expressed specifically in the
           brain and is localized in neurites and synaptic boutons.
           It binds the brain-specific proteins dynamin I,
           synaptojanin, synapsin I, and neural Wiskott-Aldrich
           syndrome protein (nWASP), and functions as a link
           between the cytoskeletal machinery and synaptic vesicle
           endocytosis. PACSIN 1 interacts with huntingtin and may
           be implicated in the neuropathology of Huntington's
           disease. It contains an N-terminal F-BAR domain and a
           C-terminal SH3 domain. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
          Length = 258

 Score = 31.2 bits (70), Expect = 2.7
 Identities = 18/78 (23%), Positives = 37/78 (47%), Gaps = 2/78 (2%)

Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
           KK  HL  KEE++   ++   + E        +  K + ++D+  Q+ +   E   ++ D
Sbjct: 139 KKAYHLACKEEKLAMTREANSKAEQSVT--PEQQKKLQDKVDKCKQDVQKTQEKYEKVLD 196

Query: 486 RLEKETAGHLEAKQKVAE 503
            + K T  ++E  ++V E
Sbjct: 197 DVGKTTPQYMENMEQVFE 214


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
           role in the biomineralisation of teeth.  They seem to
           regulate formation of crystallites during the secretory
           stage of tooth enamel development and are thought to
           play a major role in the structural organisation and
           mineralisation of developing enamel. The extracellular
           matrix of the developing enamel comprises two major
           classes of protein: the hydrophobic amelogenins and the
           acidic enamelins. Circular dichroism studies of porcine
           amelogenin have shown that the protein consists of 3
           discrete folding units: the N-terminal region appears to
           contain beta-strand structures, while the C-terminal
           region displays characteristics of a random coil
           conformation. Subsequent studies on the bovine protein
           have indicated the amelogenin structure to contain a
           repetitive beta-turn segment and a "beta-spiral" between
           Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
           rich region. The beta-spiral offers a probable site for
           interactions with Ca2+ ions. Muatations in the human
           amelogenin gene (AMGX) cause X-linked hypoplastic
           amelogenesis imperfecta, a disease characterised by
           defective enamel. A 9bp deletion in exon 2 of AMGX
           results in the loss of codons for Ile5, Leu6, Phe7 and
           Ala8, and replacement by a new threonine codon,
           disrupting the 16-residue (Met1-Ala16) amelogenin signal
           peptide.
          Length = 165

 Score = 30.5 bits (69), Expect = 2.7
 Identities = 16/45 (35%), Positives = 20/45 (44%)

Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPL 595
           +P  P++P  P   + P PP    PP PP      LP L    PL
Sbjct: 106 QPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLPDLPL 150


>gnl|CDD|222689 pfam14335, DUF4391, Domain of unknown function (DUF4391).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 220 and
           257 amino acids in length.
          Length = 221

 Score = 30.7 bits (70), Expect = 2.8
 Identities = 20/89 (22%), Positives = 37/89 (41%), Gaps = 10/89 (11%)

Query: 396 WLLLDRLAQQLTLQTETGEDYDHILLKINVKKV-----VHLLAKEEEIVAAKKKADELEM 450
           W  LD L+  L         Y+ I+ +I          V+     +E V   ++ +ELE 
Sbjct: 130 WAFLDSLSLSLFSGLNLDALYESIVNQIIKLNASSNTGVYEKEDLKERVERLEQIEELEK 189

Query: 451 ENVDLSNRLAK-----KEQELDQKTQEKE 474
           E   L  +L K     ++ EL+ + ++ +
Sbjct: 190 EIAKLKKKLKKEKQFNRKVELNLEIKKLK 218


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 31.0 bits (70), Expect = 2.9
 Identities = 31/151 (20%), Positives = 55/151 (36%), Gaps = 20/151 (13%)

Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
           K    + A+EE + + +K   E + E  +L     +  +EL Q   +  D++  + + K 
Sbjct: 15  KLKDEIKAEEEALESEEKHLKEYDKEMEELLEEKMQHVEELRQIHADINDMETEIKQSKS 74

Query: 486 RLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSR 545
            LE+          K+A +    + LK   +  +     LE L      P   +      
Sbjct: 75  ELER-------RMGKIARMHGEYNPLKESINEMRKLELGLEEL------PQLHEEERQKI 121

Query: 546 SGGREEPSSPRKPEAPSSCIPPPPPPASLPP 576
                  +SP  P        PP  P++LP 
Sbjct: 122 ----SPATSPVPPVPLP---DPPAFPSTLPA 145


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
            This is a family of fungal proteins whose function is
            unknown.
          Length = 130

 Score = 29.9 bits (68), Expect = 2.9
 Identities = 6/29 (20%), Positives = 16/29 (55%)

Query: 986  RREEEEKRARQDAEMKKRTTDRKQGAKEN 1014
            R +E  ++ ++  + +++   R+Q  KE 
Sbjct: 102  RTKERAEKEKRTRKNREKKFKRRQKEKEK 130


>gnl|CDD|216989 pfam02354, Latrophilin, Latrophilin Cytoplasmic C-terminal region. 
           This family consists of the cytoplasmic C-terminal
           region in latrophilin. Latrophilin is a synaptic Ca2+
           independent alpha- latrotoxin (LTX) receptor and is a
           novel member of the secretin family of G-protein coupled
           receptors that are involved in secretion. Latrophilin
           mRNA is present only in neuronal tissue. Lactrophillin
           interacts with G-alpha O.
          Length = 366

 Score = 31.2 bits (70), Expect = 3.0
 Identities = 14/56 (25%), Positives = 19/56 (33%), Gaps = 4/56 (7%)

Query: 549 REEPSSPR-KPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPR 603
           R+ P+ P   P   +  +PPPPP               +P L     L    Q  R
Sbjct: 285 RDSPTYPSSSPSTQTEALPPPPPA---KCGDAEIYYKSMPNLGARNQLQTYYQLGR 337


>gnl|CDD|236658 PRK10153, PRK10153, DNA-binding transcriptional activator CadC;
           Provisional.
          Length = 517

 Score = 31.6 bits (72), Expect = 3.0
 Identities = 12/46 (26%), Positives = 20/46 (43%)

Query: 567 PPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPR 612
           PPP P ++P    P  S++  +   P     C+ P +SK  +    
Sbjct: 120 PPPIPEAVPATAEPSESANAQFPAPPLTRAVCQSPAKSKRFTTFWV 165


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
           binding.  This is the middle region of a family of TATA
           element modulatory factor 1 proteins conserved in
           eukaryotes that contains at its N-terminal section a
           number of leucine zippers that could potentially form
           coiled coil structures. The whole proteins bind to the
           TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells.
          Length = 74

 Score = 28.8 bits (65), Expect = 3.1
 Identities = 13/46 (28%), Positives = 21/46 (45%)

Query: 432 LAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLD 477
           L     I   + K  ELE E  +L  +L K E+EL+   +  +  +
Sbjct: 29  LKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENLEERLKRAE 74



 Score = 28.4 bits (64), Expect = 3.3
 Identities = 17/83 (20%), Positives = 40/83 (48%), Gaps = 13/83 (15%)

Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
             LA+++E +A   +    E E   LS +  K    + +   + ++L++ ++ +K +LEK
Sbjct: 5   KKLAEKDEQIAQLME----EGEK--LSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEK 58

Query: 490 ETAGHLEAKQKVAELEDRISELK 512
                    +++  LE+R+   +
Sbjct: 59  LE-------KELENLEERLKRAE 74


>gnl|CDD|234749 PRK00405, rpoB, DNA-directed RNA polymerase subunit beta; Reviewed.
          Length = 1112

 Score = 31.6 bits (73), Expect = 3.1
 Identities = 15/33 (45%), Positives = 17/33 (51%), Gaps = 1/33 (3%)

Query: 671 ISKSKILSV-IDGRRAQNCTILLSKLKMTDEEI 702
                IL V ID RR    T+LL  L  +DEEI
Sbjct: 181 FDPKDILYVRIDRRRKLPVTVLLRALGYSDEEI 213


>gnl|CDD|193472 pfam12999, PRKCSH-like, Glucosidase II beta subunit-like.  The
           sequences found in this family are similar to a region
           found in the beta-subunit of glucosidase II, which is
           also known as protein kinase C substrate 80K-H (PRKCSH).
           The enzyme catalyzes the sequential removal of two
           alpha-1,3-linked glucose residues in the second step of
           N-linked oligosaccharide processing. The beta subunit is
           required for the solubility and stability of the
           heterodimeric enzyme, and is involved in retaining the
           enzyme within the endoplasmic reticulum.
          Length = 176

 Score = 30.5 bits (69), Expect = 3.1
 Identities = 19/63 (30%), Positives = 26/63 (41%), Gaps = 7/63 (11%)

Query: 411 ETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKT 470
           E    ++  L + N      L  KE  ++AA+KK DE       L  RL + E  L  K 
Sbjct: 121 EIARQFEEYLTEHNNSVKNGLKIKEGLLLAAQKKRDE-------LKKRLKELEDSLPAKQ 173

Query: 471 QEK 473
           Q  
Sbjct: 174 QRL 176


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 31.5 bits (72), Expect = 3.1
 Identities = 20/89 (22%), Positives = 40/89 (44%), Gaps = 12/89 (13%)

Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDR------------LEKETAGH 494
           E+ ++  +L   L + E+EL +  +E E + + L ++  +            LEK     
Sbjct: 324 EVGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETK 383

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERK 523
            +  +++ ELE+ + ELK   +    E K
Sbjct: 384 EKLSEELEELEEELKELKEELESLYSEGK 412


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 31.0 bits (70), Expect = 3.1
 Identities = 25/101 (24%), Positives = 36/101 (35%), Gaps = 18/101 (17%)

Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
           KK      K+ E  A  K A E            AKK+    +K  E E   ++ ++ K 
Sbjct: 142 KKAKEEAKKQAEEEAKAKAAAE------------AKKKAAEAKKKAEAEAKAKAEAKAKA 189

Query: 486 RLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
           + E+  A    AK K A       E   +A+ E       E
Sbjct: 190 KAEEAKAKAEAAKAKAAA------EAAAKAEAEAAAAAAAE 224


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 30.9 bits (70), Expect = 3.2
 Identities = 22/69 (31%), Positives = 39/69 (56%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           EEE    +KKADELE EN  L    A  E+E ++   E ++    ++++++  EK+ A  
Sbjct: 46  EEEAQLLEKKADELEEENRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAET 105

Query: 495 LEAKQKVAE 503
            + +Q++ E
Sbjct: 106 RQLQQELRE 114



 Score = 30.1 bits (68), Expect = 5.9
 Identities = 25/86 (29%), Positives = 45/86 (52%), Gaps = 7/86 (8%)

Query: 441 AKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQK 500
           A+++  ELE     +   + + ++EL++  +   +L+E L +     E+E A  LE K  
Sbjct: 3   AEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQ-----EEEEAQLLEKKAD 57

Query: 501 VAELEDRISELKHRADFEQDERKRLE 526
             ELE+    L+  A   ++ER+RLE
Sbjct: 58  --ELEEENRRLEEEAAASEEERERLE 81


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 31.2 bits (71), Expect = 3.2
 Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 7/87 (8%)

Query: 523 KRLENLINSGSVPDDTKAVGLSR-SGGREEPSSPRKPEA--PSSCIPPPPPPASLPPPPP 579
           KR ++  +     D+   VG  R       P++ ++ EA  PS      PP AS  P  P
Sbjct: 47  KRDDDSYDEDVEDDE--GVGEVRVHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQP 104

Query: 580 PRRSSD--LPYLTCPRPLVRCRQPPRS 604
            ++  +  +P    PRP     QP + 
Sbjct: 105 VQQPPEAQVPPQHAPRPAQPAPQPVQQ 131


>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain.  This region of the APC
           family of proteins is known as the basic domain. It
           contains a high proportion of positively charged amino
           acids and interacts with microtubules.
          Length = 359

 Score = 31.2 bits (70), Expect = 3.3
 Identities = 19/56 (33%), Positives = 24/56 (42%), Gaps = 3/56 (5%)

Query: 549 REEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
           R + ++P K + P     P  PP S  P P  +RS  L     P  L     PPRS
Sbjct: 17  RSQSTTPSK-KGPPLKTQPSDPPKS--PSPGQQRSRSLHRPAKPSELAELSPPPRS 69


>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
           [Intracellular trafficking and secretion].
          Length = 360

 Score = 30.9 bits (70), Expect = 3.3
 Identities = 12/44 (27%), Positives = 18/44 (40%), Gaps = 4/44 (9%)

Query: 540 AVGLSRSGGREEPSSPRKP----EAPSSCIPPPPPPASLPPPPP 579
              L    G ++  +  +P    E  +  +PP P    LPPP P
Sbjct: 40  GFALIALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPPLP 83


>gnl|CDD|216319 pfam01134, GIDA, Glucose inhibited division protein A. 
          Length = 391

 Score = 31.0 bits (71), Expect = 3.3
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query: 630 LDESKLYASMELEVVDKLFCAYQKNGV 656
           +D  +L  ++E + +  LF A Q NG 
Sbjct: 340 IDPPQLLPTLETKKIPGLFFAGQINGT 366


>gnl|CDD|218545 pfam05300, DUF737, Protein of unknown function (DUF737).  This
           family consists of several uncharacterized mammalian
           proteins of unknown function.
          Length = 187

 Score = 30.5 bits (69), Expect = 3.3
 Identities = 20/69 (28%), Positives = 36/69 (52%), Gaps = 8/69 (11%)

Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADF 517
           R+AK+E+E  ++   +  L E  S  ++R +        AK    +LE++ +ELK    F
Sbjct: 101 RIAKREREAAEEQLSRAVLREKASAEQERQK--------AKHLARQLEEKEAELKRLDAF 152

Query: 518 EQDERKRLE 526
            +++  RLE
Sbjct: 153 YKEQLARLE 161


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 31.1 bits (70), Expect = 3.3
 Identities = 12/59 (20%), Positives = 19/59 (32%), Gaps = 4/59 (6%)

Query: 550 EEPSSPRKPEAPSSCIPPP----PPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
              +SP     PS+   PP    P P++    P    + +      P        PP +
Sbjct: 213 RTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPAN 271



 Score = 30.3 bits (68), Expect = 6.8
 Identities = 8/36 (22%), Positives = 11/36 (30%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLP 587
           P    +  A     P P    S P    P  S+ + 
Sbjct: 208 PRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIA 243


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 30.5 bits (69), Expect = 3.3
 Identities = 26/108 (24%), Positives = 49/108 (45%), Gaps = 6/108 (5%)

Query: 440 AAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQ 499
           A  KK+ + + E         KK +  D KT+E E+  ++     D  E  +A  L    
Sbjct: 63  ATTKKSKKKDKE---KLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDID 119

Query: 500 KVAELEDRISELKHRADFEQDERKRLENLINS-GSVPDDTKAVGLSRS 546
              +++D  S+LK   +    + K+L   +++  +V +D +A  +SR 
Sbjct: 120 NEDDMDD--SDLKLATENIIKDLKKLNARVSAVSTVLEDVQAASISRQ 165


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 30.8 bits (70), Expect = 3.4
 Identities = 16/78 (20%), Positives = 37/78 (47%), Gaps = 7/78 (8%)

Query: 454 DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKH 513
            LS+++  ++ +L +  +EK+++   +  + +++E       E + K+ EL+  I + K 
Sbjct: 28  LLSDKIQNQDSKLSELQKEKKNIQNEIESLDNQIE-------EIQSKIDELQKEIDQSKA 80

Query: 514 RADFEQDERKRLENLINS 531
                Q E   L+  I  
Sbjct: 81  EIKKLQKEIAELKENIVE 98



 Score = 30.1 bits (68), Expect = 6.0
 Identities = 22/101 (21%), Positives = 42/101 (41%), Gaps = 1/101 (0%)

Query: 426 KKVVHLLAKEEEIVAAKKKADELEMEN-VDLSNRLAKKEQELDQKTQEKEDLDESLSRIK 484
           KK++    ++++ +  K+ A E ++E  V L N L  +   L+ +  EK  L  +L+  +
Sbjct: 147 KKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKE 206

Query: 485 DRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRL 525
                E A   E K          ++ +  A     E+  L
Sbjct: 207 ASALGEKAALEEQKALAEAAAAEAAKQEAAAKAAAQEQAAL 247


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 31.0 bits (70), Expect = 3.5
 Identities = 13/51 (25%), Positives = 16/51 (31%), Gaps = 3/51 (5%)

Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQP 601
            P SP    AP        P      P      ++ P    PR   R R+P
Sbjct: 20  APPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPR---RERKP 67



 Score = 31.0 bits (70), Expect = 4.0
 Identities = 18/60 (30%), Positives = 24/60 (40%), Gaps = 1/60 (1%)

Query: 557 KPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFP 616
           K EA      PP P A+  PPPP + ++       P       + P+ K K R  R   P
Sbjct: 11  KGEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPK-KDKPRRERKPKP 69



 Score = 30.7 bits (69), Expect = 4.6
 Identities = 12/55 (21%), Positives = 20/55 (36%)

Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
           G  E  +P  P   ++  PPPP   + P       ++  P    P+     R+  
Sbjct: 12  GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERK 66



 Score = 30.7 bits (69), Expect = 4.9
 Identities = 14/55 (25%), Positives = 17/55 (30%), Gaps = 3/55 (5%)

Query: 532 GSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPP---PPPASLPPPPPPRRS 583
           G    +  A           P  P K  AP++    P    P A  P    PRR 
Sbjct: 10  GKGEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRE 64


>gnl|CDD|221518 pfam12301, CD99L2, CD99 antigen like protein 2.  This family of
           proteins is found in eukaryotes. Proteins in this family
           are typically between 165 and 237 amino acids in length.
           CD99L2 and CD99 are involved in trans-endothelial
           migration of neutrophils in vitro and in the recruitment
           of neutrophils into inflamed peritoneum.
          Length = 154

 Score = 30.1 bits (68), Expect = 3.5
 Identities = 10/47 (21%), Positives = 15/47 (31%), Gaps = 12/47 (25%)

Query: 551 EPSSPRKPEA------------PSSCIPPPPPPASLPPPPPPRRSSD 585
           +P++P+KP                +     P P   P P P     D
Sbjct: 18  KPATPKKPSKGDDDFDLEDALDGGNDGGAGPKPKPRPQPDPGNPGGD 64


>gnl|CDD|149849 pfam08912, Rho_Binding, Rho Binding.  Rho Binding Domain is
           responsible for the recognition and binding of Rho
           binding domain-containing proteins (such as ROCK) to
           Rho, resulting in activation of the GTPase which in turn
           modulates the phosphorylation of various signalling
           proteins. This domain is within an amphipathic
           alpha-helical coiled-coil and interacts with Rho through
           predominantly hydrophobic interactions.
          Length = 68

 Score = 28.2 bits (63), Expect = 3.5
 Identities = 21/70 (30%), Positives = 36/70 (51%), Gaps = 8/70 (11%)

Query: 442 KKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK----ETAGHLEA 497
            K    L  E  +L+N+L K ++EL +  +E+    +S++ IK   EK    E     +A
Sbjct: 1   TKDVANLANEKEELNNKLKKAQEELQKLKEEE----QSINNIKAAFEKQLQTERTLKTQA 56

Query: 498 KQKVAELEDR 507
             K+AE+ +R
Sbjct: 57  VNKLAEIMNR 66


>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
          Length = 1000

 Score = 31.2 bits (70), Expect = 3.6
 Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 9/48 (18%)

Query: 544 SRSGGREEPSSPRKPEAP-----SSCI----PPPPPPASLPPPPPPRR 582
            R G  ++PS P +P        ++ I    P  PP  ++ P PPP R
Sbjct: 451 ERPGPSDQPSVPVEPAHLTPVEHTTVILHQPPQSPPTVAIKPAPPPSR 498


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 31.4 bits (72), Expect = 3.6
 Identities = 16/72 (22%), Positives = 27/72 (37%), Gaps = 2/72 (2%)

Query: 536 DDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPP--PPPPRRSSDLPYLTCPR 593
            +      + +   +  ++P    AP      PPPPAS P   P  P   +    L   +
Sbjct: 368 PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQ 427

Query: 594 PLVRCRQPPRSK 605
            L R +   ++K
Sbjct: 428 QLQRAQGATKAK 439


>gnl|CDD|184538 PRK14149, PRK14149, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 30.2 bits (68), Expect = 3.7
 Identities = 24/81 (29%), Positives = 35/81 (43%), Gaps = 9/81 (11%)

Query: 446 DELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELE 505
           DE   E+  LS     KE E ++K    ++      +     E E     E K K  E+ 
Sbjct: 3   DEHNQEHDHLSQ----KEPESEEKACACKEQQGEEKQEASEKEGEIKEDFELKYK--EMH 56

Query: 506 DRISELKHRADFEQDERKRLE 526
           +    L+  ADFE + +KRLE
Sbjct: 57  E--KYLRVHADFE-NVKKRLE 74


>gnl|CDD|148061 pfam06229, FRG1, FRG1-like family.  The human FRG1 gene maps to
           human chromosome 4q35 and has been identified as a
           candidate for facioscapulohumeral muscular dystrophy.
           Currently, the function of FRG1 is unknown.
          Length = 189

 Score = 30.5 bits (69), Expect = 3.7
 Identities = 19/93 (20%), Positives = 34/93 (36%), Gaps = 15/93 (16%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLD----------ESLSRIK 484
             +IVA  K A E EM  +  S+   + +     + + +              + +   K
Sbjct: 95  AGDIVAKSKTAGEGEMVEI-RSDASREFKTTDRIRMEARFKPRIKESKYVKKFQKIQDKK 153

Query: 485 DRLEKETAGHLEAKQKV----AELEDRISELKH 513
            RL  +    L+  +       EL DR  ++KH
Sbjct: 154 LRLSDDEVKRLKKARAEGNLHEELLDRRVKMKH 186


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 31.1 bits (71), Expect = 3.7
 Identities = 15/60 (25%), Positives = 18/60 (30%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIP 611
            S P     P    P    P   PPPPPP  +          P      PP  +  +  P
Sbjct: 529 KSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSP 588


>gnl|CDD|192111 pfam08647, BRE1, BRE1 E3 ubiquitin ligase.  BRE1 is an E3 ubiquitin
           ligase that has been shown to act as a transcriptional
           activator through direct activator interactions.
          Length = 96

 Score = 28.8 bits (65), Expect = 3.7
 Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 4/57 (7%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET 491
           +++  AA +  D L+ EN  L N    K  EL ++ ++ E  +E   RIK+ LEK+ 
Sbjct: 44  DQKYFAAMRSKDALDAENKKL-NTQLNKSSELIEQLKDLE--NELRRRIKE-LEKQL 96


>gnl|CDD|239122 cd02657, Peptidase_C19A, A subfamily of Peptidase C19. Peptidase
           C19 contains ubiquitinyl hydrolases. They are
           intracellular peptidases that remove ubiquitin molecules
           from polyubiquinated peptides by cleavage of isopeptide
           bonds. They hydrolyse bonds involving the carboxyl group
           of the C-terminal Gly residue of ubiquitin. The purpose
           of the de-ubiquitination is thought to be editing of the
           ubiquitin conjugates, which could rescue them from
           degradation, as well as recycling of the ubiquitin. The
           ubiquitin/proteasome system is responsible for most
           protein turnover in the mammalian cell, and with over 50
           members, family C19 is one of the largest families of
           peptidases in the human genome.
          Length = 305

 Score = 30.8 bits (70), Expect = 3.8
 Identities = 17/47 (36%), Positives = 19/47 (40%), Gaps = 1/47 (2%)

Query: 800 LRKLLELVLALGNYMNKGARGNASGFRLT-SLNKLVDTKSSSSRGVT 845
           LR + EL  AL NY       N S   LT +L  L DT       V 
Sbjct: 18  LRSVPELRDALKNYNPARRGANQSSDNLTNALRDLFDTMDKKQEPVP 64


>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
           (TBPIP).  This family consists of several eukaryotic
           TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
           been demonstrated to interact with the human
           immunodeficiency virus type 1 (HIV-1) viral protein Tat,
           then modulate the essential replication process of HIV.
           In addition, TBP-1 has been shown to be a component of
           the 26S proteasome, a basic multiprotein complex that
           degrades ubiquitinated proteins in an ATP-dependent
           fashion. Human TBPIP interacts with human TBP-1 then
           modulates the inhibitory action of human TBP-1 on
           HIV-Tat-mediated transactivation.
          Length = 169

 Score = 30.0 bits (68), Expect = 3.8
 Identities = 17/82 (20%), Positives = 40/82 (48%), Gaps = 6/82 (7%)

Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELED 506
           +L+ME  +L   +   +Q+      E + L   L+   + L++E     E K++V E+E+
Sbjct: 76  KLDMEIEELREEVQLLKQDCSTLEIELKSLTSDLT--TEELQEEIQ---ELKKEVREIEE 130

Query: 507 RISELKH-RADFEQDERKRLEN 527
           ++  L+        +E ++++ 
Sbjct: 131 KLESLEEGWKPVTPEEMEKVKK 152


>gnl|CDD|225950 COG3416, COG3416, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 233

 Score = 30.6 bits (69), Expect = 3.8
 Identities = 35/141 (24%), Positives = 52/141 (36%), Gaps = 43/141 (30%)

Query: 468 QKTQEKEDLDESLSRIKDRLEKE-TAGHL-------------EAKQKVAELEDRISELKH 513
           +K +  E   ++ + I + + K+  A +              +A  ++ ELE RI+ L  
Sbjct: 17  KKAEANERDPQAEALIAEAVAKQPDAAYYLAQRVLIQEQALKKASTQIKELEKRIAIL-- 74

Query: 514 RADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPAS 573
               +  E         SGS         LS +     P  P    AP +  PPP  PA+
Sbjct: 75  ----QAGEAG-------SGSF--------LSNAFKWGTPQEP----APPANAPPPKEPAA 111

Query: 574 LPPPPPPRRSSDLPYLTCPRP 594
               PP  RSS     T P P
Sbjct: 112 ----PPSWRSSPAGPTTQPSP 128


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 31.1 bits (71), Expect = 3.9
 Identities = 24/98 (24%), Positives = 45/98 (45%), Gaps = 20/98 (20%)

Query: 434 KEEEIVAAKKKADELEMENVDLS--------NRLAKKEQELDQKTQEKEDLDE------- 478
           K EE+    ++  +LE+E   L          RL   E+EL +  +E  DL+E       
Sbjct: 403 KPEELDELDRRIIQLEIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKA 462

Query: 479 ---SLSRIKDRLEKETAGHLEAKQKVAELEDRISELKH 513
               + +IK+ +E+     LE  ++  +L  + +EL++
Sbjct: 463 AIQGIQQIKEEIEQ-VRLELEQAEREGDLA-KAAELQY 498


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 31.1 bits (70), Expect = 4.0
 Identities = 20/102 (19%), Positives = 42/102 (41%), Gaps = 8/102 (7%)

Query: 421 LKINVKKVVHLLAKEEEIVAAKKKA----DELEMENVDLSNRLAKKEQE---LDQKTQEK 473
           L+ + +K   L  + +E  ++ +      + L  +  +   R+ + EQ+   L QK QE 
Sbjct: 180 LRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAENQQRILELEQDIQTLTQKKQEN 239

Query: 474 EDLDESLSRIKDRLEKETAGHLE-AKQKVAELEDRISELKHR 514
           + + E    I+  LE+      +  K+   +  D  +E    
Sbjct: 240 DRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEETERIDL 281


>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI.  This minor capsid
           protein may act as a link between the external capsid
           and the internal DNA-protein core. The C-terminal 11
           residues may function as a protease cofactor leading to
           enzyme activation.
          Length = 238

 Score = 30.5 bits (69), Expect = 4.0
 Identities = 15/74 (20%), Positives = 23/74 (31%), Gaps = 3/74 (4%)

Query: 514 RADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPAS 573
           R   E+D  K LE ++         + V  +      +P  PR           P PP+ 
Sbjct: 95  RQKVEKDLEKLLEKVLGEEEPAPQEETV--ADPIQALQPR-PRPDVEEVLVPAAPEPPSY 151

Query: 574 LPPPPPPRRSSDLP 587
                P     + P
Sbjct: 152 EETIKPGPAPVEEP 165


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 30.3 bits (69), Expect = 4.1
 Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 17/103 (16%)

Query: 435 EEEIVAAKKKAD-------ELEMENVDLSNRLAKKEQE---LDQKTQEKEDLDESLSRIK 484
           +EEI   KK  +       E+  EN  L   L K E+E   L +K ++ E   +SL  +K
Sbjct: 33  KEEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLK 92

Query: 485 DRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLEN 527
            RL+       E ++++  L+     L+ R +  + ER  L +
Sbjct: 93  ARLK-------ELEKELKNLKWESEVLEQRFEKVERERDELYD 128


>gnl|CDD|223831 COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase
            [Posttranslational modification, protein turnover,
            chaperones].
          Length = 320

 Score = 30.8 bits (69), Expect = 4.2
 Identities = 22/170 (12%), Positives = 48/170 (28%), Gaps = 8/170 (4%)

Query: 876  KVSMKELENDMTQLRNNLKEVSREIEF----QRVQPLVPGDRFLPVMKEFLTSATCRLSE 931
            K+S + +   + ++    K     +      Q  Q +     F  V  +F   A  +   
Sbjct: 81   KISQQVVAQLIAEIAVLDKLAKELLLGLSDEQGKQLIAEKPAFQDVEGKFDARAYLKRLR 140

Query: 932  IEDLFQDMKTRFDRAVRLFGEDNSSIQPEDFFVIFDTFLTTLNEAKLDNENMKRRREEEE 991
            I+ L ++ +    R            +  +        L        +   + ++   E 
Sbjct: 141  IKGLTEEQEAEALR--DKLQNKQQGKKVTEVQARH--ILVKAEAKAKEALALLKKGVREA 196

Query: 992  KRARQDAEMKKRTTDRKQGAKENGVVISNGSVKGEFDDLISALRTGDVFG 1041
            K    +   K+      +           G +  EF      L+ G+V  
Sbjct: 197  KADFAELAKKQSEDPSSKNGGGLLGWNKKGQLVPEFRKAAFILKVGEVSA 246


>gnl|CDD|203593 pfam07200, Mod_r, Modifier of rudimentary (Mod(r)) protein.  This
           family represents a conserved region approximately 150
           residues long within a number of eukaryotic proteins
           that show homology with Drosophila melanogaster Modifier
           of rudimentary (Mod(r)) proteins. The N-terminal half of
           Mod(r) proteins is acidic, whereas the C-terminal half
           is basic, and both of these regions are represented in
           this family. Members of this family include the Vps37
           subunit of the endosomal sorting complex ESCRT-I, a
           complex involved in recruiting transport machinery for
           protein sorting at the multivesicular body (MVB). The
           yeast ESCRT-I complex consists of three proteins (Vps23,
           Vps28 and Vps37). The mammalian homologue of Vps37
           interacts with Tsg101 (Pfam: PF05743) through its mod(r)
           domain and its function is essential for lysosomal
           sorting of EGF receptors.
          Length = 150

 Score = 29.6 bits (67), Expect = 4.2
 Identities = 14/77 (18%), Positives = 34/77 (44%), Gaps = 5/77 (6%)

Query: 414 EDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEK 473
           +     +L++   + +       E  A   + + L  EN++   +L + + +L  K +E 
Sbjct: 17  DALLEFVLELPQVQDL-----NLEKDALLAQNESLAEENLEKEPQLEELKAQLRDKYEEL 71

Query: 474 EDLDESLSRIKDRLEKE 490
           + L+E     K +L++ 
Sbjct: 72  KKLEEKYEEKKQQLDEL 88


>gnl|CDD|192662 pfam10738, Lpp-LpqN, Probable lipoprotein LpqN.  This family is
           conserved in Mycobacteriaceae and is likely to be a
           lipoprotein.
          Length = 175

 Score = 30.0 bits (68), Expect = 4.2
 Identities = 16/50 (32%), Positives = 20/50 (40%)

Query: 531 SGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
              VP  T A  L  +G   EP +P    A +  +P PP     P P  P
Sbjct: 1   PVPVPSGTLAQYLEENGVTGEPVAPGDLGALTISLPTPPGWEPYPDPNVP 50


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 31.0 bits (71), Expect = 4.3
 Identities = 28/100 (28%), Positives = 49/100 (49%), Gaps = 18/100 (18%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           E+ +   +K+ D L++E      R+ KK +E  +K+Q +  L E L  I+  L ++    
Sbjct: 195 EKLLDLLEKEIDLLQLEK-----RIRKKVKEQMEKSQREYYLREQLKAIQKELGEDDDDK 249

Query: 495 LEAKQKVAELEDRISEL------KHRADFEQDERKRLENL 528
            E    V EL ++I +L      K +A+    E K+LE +
Sbjct: 250 DE----VEELREKIEKLKLPKEAKEKAE---KELKKLETM 282


>gnl|CDD|237000 PRK11855, PRK11855, dihydrolipoamide acetyltransferase; Reviewed.
          Length = 547

 Score = 30.9 bits (71), Expect = 4.3
 Identities = 7/47 (14%), Positives = 12/47 (25%), Gaps = 3/47 (6%)

Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVR 597
             +      A  +  P      +   P         P+ +   P VR
Sbjct: 207 AAAPAAAAAAAPAPAPAAAAAPAAAAPAAAAAPGKAPHAS---PAVR 250


>gnl|CDD|219490 pfam07627, PSCyt3, Protein of unknown function (DUF1588).  A region
           of similarity shared by several Rhodopirellula baltica
           cytochrome-like proteins that are predicted to be
           secreted. These proteins also match pfam07626 and
           pfam07624.
          Length = 102

 Score = 29.0 bits (66), Expect = 4.4
 Identities = 8/19 (42%), Positives = 10/19 (52%)

Query: 564 CIPPPPPPASLPPPPPPRR 582
             PPPPPP ++P   P   
Sbjct: 36  GDPPPPPPPNVPALEPDDP 54


>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
           (DUF1750).  This is a fungal domain of unknown function.
          Length = 669

 Score = 30.8 bits (69), Expect = 4.5
 Identities = 29/131 (22%), Positives = 47/131 (35%), Gaps = 13/131 (9%)

Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
            +V   +A     +   KK     ME  + ++ L   EQ L     +  D    + R++ 
Sbjct: 367 DRVAKKVADMTAEIEKLKKRHARRMEKFNRTSALKDAEQRLRDAAADPADTGSEIWRLEG 426

Query: 486 RLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKR-LENLINSGSVPDDTKAVGLS 544
           RLE +T               R    +H+A ++ D+  R +E       VP+    V   
Sbjct: 427 RLEIDTEDD----------SSRQGPREHKAKYKVDDIVREVEKSWGKPIVPE--PDVSCV 474

Query: 545 RSGGREEPSSP 555
             GG  E   P
Sbjct: 475 EKGGLLEKIEP 485


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 30.8 bits (69), Expect = 4.5
 Identities = 20/80 (25%), Positives = 28/80 (35%), Gaps = 3/80 (3%)

Query: 540 AVGLSRSGG---REEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLV 596
           A G++ S        PS+   P  P    PP PP + +PP P PR     P    P  + 
Sbjct: 411 AAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVA 470

Query: 597 RCRQPPRSKLKSRIPRPYFP 616
                P +         + P
Sbjct: 471 SASDAPPTLGDPSDTAEHTP 490


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 30.6 bits (70), Expect = 4.5
 Identities = 16/56 (28%), Positives = 27/56 (48%), Gaps = 5/56 (8%)

Query: 432 LAKE--EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
           L+KE    +   +  A+EL  E  +L  +L + E  LD+    + +LD  L  I +
Sbjct: 55  LSKEIGRALKRGEDDAEELIAEVKELKEKLKELEAALDEL---EAELDTLLLTIPN 107


>gnl|CDD|114474 pfam05750, Rubella_Capsid, Rubella capsid protein.  Rubella virus
           is an enveloped positive-strand RNA virus of the family
           Togaviridae. Virions are composed of three structural
           proteins: a capsid and two membrane-spanning
           glycoproteins, E2 and E1. During virus assembly, the
           capsid interacts with genomic RNA to form nucleocapsids.
           It has been discovered that capsid phosphorylation
           serves to negatively regulate binding of viral genomic
           RNA. This may delay the initiation of nucleocapsid
           assembly until sufficient amounts of virus glycoproteins
           accumulate at the budding site and/or prevent
           non-specific binding to cellular RNA when levels of
           genomic RNA are low. It follows that at a late stage in
           replication, the capsid may undergo dephosphorylation
           before nucleocapsid assembly occurs.
          Length = 300

 Score = 30.6 bits (68), Expect = 4.5
 Identities = 32/134 (23%), Positives = 50/134 (37%), Gaps = 15/134 (11%)

Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDT--------KAVGLSRS 546
           +E  QK  E + R    +  A   Q  R R     +S +  DD+        +  G    
Sbjct: 9   MEDLQKALEAQSRALRAELAAGASQSRRPRPPRQRDSSTSGDDSGRDSGGPRRRRGNRGR 68

Query: 547 GGREE----PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
           G R++    P  P + +   S  P P P  + P  P P R       + PRP +    PP
Sbjct: 69  GQRKDWSRAPPPPEERQESRSQTPAPKPSRAPPQQPQPPRMQTGRGGSAPRPEL---GPP 125

Query: 603 RSKLKSRIPRPYFP 616
            +  ++ + R   P
Sbjct: 126 TNPFQAAVARGLRP 139


>gnl|CDD|193414 pfam12941, HCV_NS5a_C, HCV NS5a protein C-terminal region.  This is
           a family of proteins found in the hepatitis C virus.
           This family contains the C-terminal region of the NS5A
           protein. CC The molecular function of the non-structural
           5a protein is uncertain. The NS5a protein is
           phosphorylated when expressed in mammalian cells. It is
           thought to interact with the ds RNA dependent
           (interferon inducible) kinase PKR.
          Length = 244

 Score = 30.5 bits (69), Expect = 4.5
 Identities = 11/13 (84%), Positives = 11/13 (84%)

Query: 570 PPASLPPPPPPRR 582
           PPA LPP PPPRR
Sbjct: 140 PPAGLPPVPPPRR 152


>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
           Provisional.
          Length = 418

 Score = 30.8 bits (70), Expect = 4.6
 Identities = 18/72 (25%), Positives = 30/72 (41%), Gaps = 3/72 (4%)

Query: 539 KAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRC 598
            A    ++   +  ++   PE P++  P PP  A  P P P  +    P     R   R 
Sbjct: 133 AAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKP---PPTPVARADPRE 189

Query: 599 RQPPRSKLKSRI 610
            + P S+++ RI
Sbjct: 190 TRVPMSRMRQRI 201


>gnl|CDD|223448 COG0371, GldA, Glycerol dehydrogenase and related enzymes [Energy
           production and conversion].
          Length = 360

 Score = 30.7 bits (70), Expect = 4.7
 Identities = 15/65 (23%), Positives = 27/65 (41%), Gaps = 3/65 (4%)

Query: 650 AYQKNGVPADGSVEDLTKLGKISKSKILSVIDGRRAQNCTILLSKLKMTDEEISKAILSM 709
             +K G+P   ++ +L  L      + L+V    R +  TIL     +T E    A+ + 
Sbjct: 298 FLKKVGLPT--TLAELG-LDDDEVIEALTVAHAIRPERETILGMPFGLTPEAARAALEAT 354

Query: 710 DSADQ 714
           +   Q
Sbjct: 355 EVIGQ 359


>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are about 260 amino acids in
           length.
          Length = 250

 Score = 30.3 bits (69), Expect = 4.8
 Identities = 20/100 (20%), Positives = 39/100 (39%), Gaps = 18/100 (18%)

Query: 432 LAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET 491
                EI    K+ + L + N  L   +A ++QE+         L + + +I ++  +  
Sbjct: 53  QELLAEIRQLLKEIENLRVYNDQLQRLVANQQQEIA-------SLQQQIEQI-EKTRQGL 104

Query: 492 AGHLEAKQKVAELEDRISELKHRAD--FEQDER-KRLENL 528
              +   + +  LE  +       D  F ++ER  RL  L
Sbjct: 105 VPLMY--RMIDGLEQFV-----ALDLPFLKEERLARLARL 137


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 30.7 bits (70), Expect = 5.0
 Identities = 18/69 (26%), Positives = 31/69 (44%), Gaps = 8/69 (11%)

Query: 443 KKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVA 502
            K D  + E   L  +LA+   +L Q   E       L  +KD  ++ET   L +   + 
Sbjct: 73  DKIDRQKEETEQLKQQLAQAPAKLRQAQAE-------LEALKDDNDEETRETL-STLSLR 124

Query: 503 ELEDRISEL 511
           +LE R+++ 
Sbjct: 125 QLESRLAQT 133


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 30.9 bits (70), Expect = 5.0
 Identities = 35/154 (22%), Positives = 70/154 (45%), Gaps = 26/154 (16%)

Query: 397 LLLDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDL- 455
             L++L Q+   + E   D   I  +   K++  L  + +E+ +   ++  L+ E   L 
Sbjct: 246 SQLEQL-QEENFRLEAARDDYRIKCEELEKELAELQHRNDELTSLAAESQALKDEIDVLR 304

Query: 456 --SNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEA---------------- 497
             S++  K E +++   ++ EDL++   ++K  LE+  A +++                 
Sbjct: 305 ESSDKAKKLEAQVETYKKKLEDLNDLRRQVK-LLEERNAMYMQNTVQLEEELKKANAARG 363

Query: 498 -----KQKVAELEDRISELKHRADFEQDERKRLE 526
                K++V EL  ++SE   +AD  + E KRLE
Sbjct: 364 QLETYKRQVQELHAKLSEESKKADKLEFEYKRLE 397


>gnl|CDD|218307 pfam04880, NUDE_C, NUDE protein, C-terminal conserved region.  This
           family represents the C-terminal conserved region of the
           NUDE proteins. NUDE proteins are involved in nuclear
           migration.
          Length = 166

 Score = 29.8 bits (66), Expect = 5.1
 Identities = 30/128 (23%), Positives = 46/128 (35%), Gaps = 20/128 (15%)

Query: 457 NRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRAD 516
           N+  ++   L+ +  EKE L     R++D                 EL D   EL  +  
Sbjct: 10  NQALERNALLEMEYDEKESLRIENQRLRD-----------------ELRDLKQELIVQER 52

Query: 517 FEQDERK-RLENLINSGSVPDDTKAVGLSRSGGREEPSSP-RKPEAPSSCIPPPPPPASL 574
              + RK R   ++N GS P  T     S +  R  P+     P  P + +      A+ 
Sbjct: 53  LRNNNRKSRPAPVVNLGSSP-STPHTNSSMNSPRSPPNGTVSSPLTPPTKLSLTLASATA 111

Query: 575 PPPPPPRR 582
             P PP  
Sbjct: 112 TDPAPPMS 119


>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
          Length = 306

 Score = 30.4 bits (68), Expect = 5.1
 Identities = 10/30 (33%), Positives = 13/30 (43%), Gaps = 3/30 (10%)

Query: 554 SPRKPEAPSSCIPPPP---PPASLPPPPPP 580
             + P  P+   PP      P +L  PPPP
Sbjct: 70  VQQAPAQPAPAAPPAAGAALPEALEVPPPP 99


>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein.  The Bacillus subtilis divIVA1
           mutation causes misplacement of the septum during cell
           division, resulting in the formation of small, circular,
           anucleate mini-cells. Inactivation of divIVA produces a
           mini-cell phenotype, whereas overproduction of DivIVA
           results in a filamentation phenotype. These proteins
           appear to contain coiled-coils.
          Length = 131

 Score = 29.2 bits (66), Expect = 5.3
 Identities = 20/86 (23%), Positives = 40/86 (46%), Gaps = 8/86 (9%)

Query: 451 ENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE-TAGHLEAKQKVAELEDRIS 509
           EN +L   + + E++L++  + +E L ++L   ++  E+       EA+  + E E +  
Sbjct: 40  ENEELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVKANAQKEAELIIKEAEAKAE 99

Query: 510 ELKHRAD-------FEQDERKRLENL 528
            + + A+        E DE KR   L
Sbjct: 100 RIVNDANEEAKKLATEYDELKRQSRL 125


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
           synthetase class II domain (pfam00587) and represents
           the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 28.7 bits (65), Expect = 5.3
 Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 11/81 (13%)

Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
           LL  +EE    + + +EL+ E     N L+K+  +  +K ++ E L   +  +KD L+  
Sbjct: 31  LLELDEERRELQVELEELQAE----RNELSKEIGKAKKKKEDAEALIAEVKELKDELK-- 84

Query: 491 TAGHLEAKQKVAELEDRISEL 511
                  + ++ ELE  + +L
Sbjct: 85  -----ALEAELRELEAELDKL 100


>gnl|CDD|225986 COG3455, COG3455, Type VI protein secretion system component VasF
           [Intracellular    trafficking, secretion, and vesicular
           transport].
          Length = 262

 Score = 30.0 bits (68), Expect = 5.4
 Identities = 12/37 (32%), Positives = 17/37 (45%)

Query: 349 HIETKSATSMFNILRRKLGHTAAYPHFLSLLHHCILL 385
           H E      +F +L R L   A Y   L L++ C+ L
Sbjct: 128 HNEAWGGEKVFELLERLLREPARYRDLLELIYLCLSL 164


>gnl|CDD|215918 pfam00435, Spectrin, Spectrin repeat.  Spectrin repeat-domains are
           found in several proteins involved in cytoskeletal
           structure. These include spectrin, alpha-actinin and
           dystrophin. The sequence repeat used in this family is
           taken from the structural repeat in reference. The
           spectrin domain- repeat forms a three helix bundle. The
           second helix is interrupted by proline in some
           sequences. The repeats are defined by a characteristic
           tryptophan (W) residue at position 17 in helix A and a
           leucine (L) at 2 residues from the carboxyl end of helix
           C. Although the domain occurs in ultiple repeats along
           sequences, the domains are actually stable on their own
           - ie they act, biophysically, like domains rather than
           repeats that along function when aggregated.
          Length = 105

 Score = 28.8 bits (65), Expect = 5.5
 Identities = 21/102 (20%), Positives = 44/102 (43%), Gaps = 19/102 (18%)

Query: 442 KKKADEL-----EMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK------- 489
            + AD+L     E E +  S    K  + +    ++ + L+  L+  +DR+E        
Sbjct: 7   FRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNELAEK 66

Query: 490 -ETAGHLEAKQ---KVAELEDRISELKHRADFEQDERKRLEN 527
               GH  +++   ++ EL +R  +L   A    + +++LE 
Sbjct: 67  LIAEGHYASEEIQERLEELNERWEQLLELA---AERKQKLEE 105


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 30.4 bits (69), Expect = 5.6
 Identities = 12/36 (33%), Positives = 13/36 (36%), Gaps = 3/36 (8%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLP 587
           P + R P  P      PPPP   PPPP         
Sbjct: 253 PRATRLPAPPQPPEEEPPPP---PPPPEDDDDPPED 285



 Score = 30.0 bits (68), Expect = 9.3
 Identities = 10/36 (27%), Positives = 13/36 (36%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLP 587
           P+ P+ PE      PPPP     PP     +     
Sbjct: 259 PAPPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAED 294


>gnl|CDD|221459 pfam12200, DUF3597, Domain of unknown function (DUF3597).  This
           family of proteins is found in bacteria, eukaryotes and
           viruses. Proteins in this family are typically between
           126 and 281 amino acids in length. The function of this
           domain is unknown. The structure of this domain has been
           found to contain five helices with a long flexible loop
           between helices one and two.
          Length = 124

 Score = 28.8 bits (65), Expect = 5.6
 Identities = 10/40 (25%), Positives = 15/40 (37%), Gaps = 1/40 (2%)

Query: 547 GGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDL 586
            G    ++P  P A  +  P     A+    PP   S D+
Sbjct: 10  FGEAAAAAP-APAAAPATAPAAAAAAAPAATPPAAASVDV 48


>gnl|CDD|227751 COG5464, COG5464, Uncharacterized conserved protein [Function
           unknown].
          Length = 289

 Score = 30.1 bits (68), Expect = 5.6
 Identities = 33/169 (19%), Positives = 59/169 (34%), Gaps = 21/169 (12%)

Query: 366 LGHTAAYPHFLSLLHHCILLPLEFGCHPQHWLLLDRLAQQLTLQTETGEDYDHILLKINV 425
                 Y     L+   ++   E   H +  L L  L Q+   Q +     +  L  + V
Sbjct: 133 FDDRKLYSSAFPLVDLTVVPDDEIMQHRR--LALLELLQKHIRQRDLMGLVE-QLATLLV 189

Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
               +L   + EI           + N  L     ++ +   +  +E     E+L  I  
Sbjct: 190 YG--YLNDSQLEI-----------LFNYGLQTDTLRETRFYQEIAEEGPQHGEALMTIAG 236

Query: 486 RLEKETAGH---LEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
           RLE+E  G    LE    + EL  RI+++      +++   R+  L   
Sbjct: 237 RLEQE--GKQEGLEIVPGLLELGLRIAQIMLELGLDRELVLRVTGLSEE 283


>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein.  This is is a family of
           Papillomavirus proteins, E4, coded for by ORF4. A splice
           variant, E1--E4, exists but neither the function of E4
           or E1--E4 is known.
          Length = 95

 Score = 28.4 bits (64), Expect = 5.6
 Identities = 13/50 (26%), Positives = 16/50 (32%), Gaps = 11/50 (22%)

Query: 563 SCIPPPPPPASLPPPPPPRR-----------SSDLPYLTCPRPLVRCRQP 601
           +   PP PP   PP  P +R           SS     + P P      P
Sbjct: 18  TYTTPPRPPPPPPPWAPKKRCRRRLESDSDSSSTTSPSSPPTPQPTTECP 67


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 30.4 bits (68), Expect = 5.7
 Identities = 22/108 (20%), Positives = 49/108 (45%), Gaps = 4/108 (3%)

Query: 423 INVKKVVHLLAKEEE--IVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
           I+ KKVV  L ++ E  +   +   D  E E+ + + R  + ++ELD+K  + +   +  
Sbjct: 177 ISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKA 236

Query: 481 SRIKDRLEKETAGHLEAKQKVAELED--RISELKHRADFEQDERKRLE 526
              +D  +K+     + +Q+   L      S  K      +++++ +E
Sbjct: 237 DFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIE 284


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 30.7 bits (69), Expect = 5.8
 Identities = 26/83 (31%), Positives = 37/83 (44%), Gaps = 13/83 (15%)

Query: 440 AAKKKADELEMENVDLSNRLAKKEQELDQKT----------QEKEDLDESLSRIKDRLEK 489
           AA +K  E E    +L  +L + ++EL +            QEK  +   LS  K R E 
Sbjct: 75  AATEKTKEWEKVKAELEKKLLELQKELARAAAENDALSRSLQEKSKMIMELSEEKSRAES 134

Query: 490 ETAGHLEAKQKVAELEDRISELK 512
           E    L+A+ + AE E   S LK
Sbjct: 135 EIE-DLKARLESAEKE--NSSLK 154



 Score = 30.3 bits (68), Expect = 6.9
 Identities = 31/112 (27%), Positives = 50/112 (44%), Gaps = 9/112 (8%)

Query: 417 DHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKE--QELDQKTQEKE 474
           + ++ +    + V + A E +      K+ E E+ + D           +EL+    EKE
Sbjct: 545 EWVVNQEKSLQDVSVEASEIKKNFLGLKSSEKEINSPDEVKGAVCISTLEELETLKSEKE 604

Query: 475 DLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
           +LD  LS+ KD LE       E+K K+ E E ++ ELK   D  Q+     E
Sbjct: 605 NLDGELSKCKDDLE-------ESKNKLQETEKKLEELKSELDASQESNSLAE 649


>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
           family). 
          Length = 619

 Score = 30.6 bits (69), Expect = 6.0
 Identities = 22/104 (21%), Positives = 41/104 (39%), Gaps = 9/104 (8%)

Query: 430 HLLAKEEEIVAAKKKADELE------MENVDLSNRLAKKEQELDQKTQEKEDLD---ESL 480
            L    EE V   ++ D L+         +         EQ++    +   D     E++
Sbjct: 150 RLGELREEKVRRLEEVDSLKQSIKSLCSLLGTPPARTDFEQDVLSYGEIPNDHCVTRETI 209

Query: 481 SRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKR 524
            ++   LE   A   +   K+ +L ++I EL +R     +E+KR
Sbjct: 210 DKLDKMLENLAAQKKQRADKIDDLREQIQELWNRLQISDEEQKR 253


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 30.6 bits (69), Expect = 6.1
 Identities = 8/44 (18%), Positives = 14/44 (31%), Gaps = 3/44 (6%)

Query: 544 SRSGGREEPSSPRKPEAPSSCIPPPPPPASLPP---PPPPRRSS 584
           +R  G     +P    AP++   P               P R++
Sbjct: 441 ARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAA 484


>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN).  This family
           consists of several eukaryotic survival motor neuron
           (SMN) proteins. The Survival of Motor Neurons (SMN)
           protein, the product of the spinal muscular
           atrophy-determining gene, is part of a large
           macromolecular complex (SMN complex) that functions in
           the assembly of spliceosomal small nuclear
           ribonucleoproteins (snRNPs). The SMN complex functions
           as a specificity factor essential for the efficient
           assembly of Sm proteins on U snRNAs and likely protects
           cells from illicit, and potentially deleterious,
           non-specific binding of Sm proteins to RNAs.
          Length = 264

 Score = 30.0 bits (67), Expect = 6.2
 Identities = 24/116 (20%), Positives = 28/116 (24%), Gaps = 16/116 (13%)

Query: 474 EDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGS 533
              D     +K+    ET       +   +  DR S                        
Sbjct: 121 PPPDVDEDALKEANVNET-------ESSTDESDRSSHSHEVRSKSNFPMGPPSPWNPRFP 173

Query: 534 VPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPP--PPPPRRSSDLP 587
                   G  R G       P       S  PPP P    PP  PPPP  S D  
Sbjct: 174 PGPPPPPPGFGRHG-----EKPSGWPPFLSGWPPPFPLG--PPMIPPPPPMSPDFG 222


>gnl|CDD|214775 smart00689, DM6, Cysteine-rich domain currently specific to
           Drosophila. 
          Length = 157

 Score = 29.4 bits (66), Expect = 6.2
 Identities = 18/68 (26%), Positives = 21/68 (30%), Gaps = 11/68 (16%)

Query: 550 EEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLT---CPRP-LVRCRQPPRSK 605
           E P    KP+    C P   P       P PRR    P      CP    + C+      
Sbjct: 35  ECPPIMIKPKR--ICCPDKEPR-----IPRPRRRRRKPRCPSAACPTGNPMPCKPEAGKS 87

Query: 606 LKSRIPRP 613
              RI  P
Sbjct: 88  PCPRIRMP 95



 Score = 29.0 bits (65), Expect = 7.4
 Identities = 17/70 (24%), Positives = 21/70 (30%), Gaps = 10/70 (14%)

Query: 549 REEPSSPRKPEAPSSCIPPP-PPPASLPPPPPPRRSSDLPYLTCPR-PLVRCRQ---PPR 603
            +EP  PR          P    P   P P  P          CPR  +  C+    PP 
Sbjct: 50  DKEPRIPRPRRRRRKPRCPSAACPTGNPMPCKPEAGKSP----CPRIRMPCCKPARRPPS 105

Query: 604 SKLKSRIPRP 613
            +   R P  
Sbjct: 106 CR-AVRRPSD 114


>gnl|CDD|215379 PLN02708, PLN02708, Probable pectinesterase/pectinesterase
           inhibitor.
          Length = 553

 Score = 30.2 bits (68), Expect = 6.2
 Identities = 9/24 (37%), Positives = 10/24 (41%)

Query: 553 SSPRKPEAPSSCIPPPPPPASLPP 576
           S+        S  PPPP   S PP
Sbjct: 22  SNRHHHHHTPSPSPPPPSSPSTPP 45


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 29.6 bits (67), Expect = 6.3
 Identities = 22/97 (22%), Positives = 40/97 (41%), Gaps = 18/97 (18%)

Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
           EE      ++A++ E E  +      + E E +Q  QE  +L+  L  +KD+        
Sbjct: 9   EEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQ--QEIAELEAQLEELKDKY------- 59

Query: 495 LEAKQKVAELEDRISELKHRADFEQDE--RKRLENLI 529
               +  AE E     L+ R + E++E  +  +E   
Sbjct: 60  ---LRAQAEFE----NLRKRTEREREEAKKYAIEKFA 89


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 30.6 bits (69), Expect = 6.3
 Identities = 12/59 (20%), Positives = 20/59 (33%), Gaps = 4/59 (6%)

Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPP--PRRSSDLPYLTCPR--PLVRCRQPP 602
           G    ++ +   AP++   P  P  ++  P P     + D P     R       R  P
Sbjct: 568 GARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAP 626


>gnl|CDD|216840 pfam02005, TRM, N2,N2-dimethylguanosine tRNA methyltransferase.
           This enzyme EC:2.1.1.32 used S-AdoMet to methylate tRNA.
           The TRM1 gene of Saccharomyces cerevisiae is necessary
           for the N2,N2-dimethylguanosine modification of both
           mitochondrial and cytoplasmic tRNAs. The enzyme is found
           in both eukaryotes and archaebacteria.
          Length = 375

 Score = 30.0 bits (68), Expect = 6.7
 Identities = 13/58 (22%), Positives = 26/58 (44%), Gaps = 8/58 (13%)

Query: 760 YEQRLRCLHYKKRFHF----WISDI--EPKIKSVMEA--SREVTRSRRLRKLLELVLA 809
            + +  C H   RFH     W+  +  +  ++ ++E   S     S+R+  LL+L+  
Sbjct: 255 AKFKAECPHCGGRFHLAGPLWLGPLHDKEFVEEMLEIAESPPEEFSKRILGLLKLIKE 312


>gnl|CDD|221130 pfam11527, ARL2_Bind_BART, The ARF-like 2 binding protein BART.
           BART binds specifically to ARL2.GTP with a high affinity
           however it does not bind to ARL2.GDP. It is thought that
           this specific interaction is due to BART being the first
           identified ARL2-specific effector. The function is not
           completely characterized. BART is predominantly
           cytosolic but can also be found to be associated with
           mitochondria. BART is also involved in binding to the
           adenine nucleotide transporter ANT1.
          Length = 120

 Score = 28.8 bits (65), Expect = 6.7
 Identities = 15/43 (34%), Positives = 21/43 (48%), Gaps = 3/43 (6%)

Query: 688 CTILLSKLKM---TDEEISKAILSMDSADQLPLDMLEQLLKFT 727
             +L S LK    + E+  +A LS  S +Q    + EQLL  T
Sbjct: 58  EKLLESFLKEVGISPEQFEEACLSQLSKNQQAKVIFEQLLAAT 100


>gnl|CDD|220610 pfam10165, Ric8, Guanine nucleotide exchange factor synembryn.
           Ric8 is involved in the EGL-30 neurotransmitter
           signalling pathway. It is a guanine nucleotide exchange
           factor that regulates neurotransmitter secretion.
          Length = 439

 Score = 30.4 bits (69), Expect = 6.7
 Identities = 18/86 (20%), Positives = 33/86 (38%), Gaps = 13/86 (15%)

Query: 309 QPIIDKLRTHEN--ETLDRHLDFFEMLRHEDESELARKYETDHI-ETKSATSMFNILRRK 365
           + +ID+          L+RHL        + +  LA   E   + ET     +FN+    
Sbjct: 110 KQLIDEHHGLSLLLAALERHLKQASK---DKQEPLAPPMELLALSETLKL--LFNVTFFY 164

Query: 366 LGH-----TAAYPHFLSLLHHCILLP 386
                   +A+ PH +++L     +P
Sbjct: 165 PRDVEEEFSASLPHIVAILRRLPTIP 190


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 30.2 bits (68), Expect = 6.9
 Identities = 19/51 (37%), Positives = 20/51 (39%), Gaps = 4/51 (7%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
           P + R PE      P PPPP   P PP P    D P  T P       Q P
Sbjct: 261 PRATRLPEPE----PQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIP 307


>gnl|CDD|222604 pfam14222, MOR2-PAG1_N, Cell morphogenesis N-terminal.  This family
           is the conserved N-terminal region of proteins that are
           involved in cell morphogenesis.
          Length = 538

 Score = 30.3 bits (69), Expect = 6.9
 Identities = 16/66 (24%), Positives = 19/66 (28%), Gaps = 11/66 (16%)

Query: 566 PPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFPNVSVQVNGT 625
           PPPPPP S P     R                        L   I + Y+P     + G 
Sbjct: 351 PPPPPPTSPPSSNNLRVKKTPSSEYLTNE---------VALSIGI-KEYYPEFCKAL-GK 399

Query: 626 IWTELD 631
           I    D
Sbjct: 400 ILILCD 405


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 30.0 bits (68), Expect = 7.1
 Identities = 18/68 (26%), Positives = 24/68 (35%), Gaps = 7/68 (10%)

Query: 553 SSPRKPEAPSSCIPPPPPPASLPPP---PPPRRSSDLPYLTCPRPL----VRCRQPPRSK 605
            S R     S   P PP  +   P    PPP   S  P     RPL     R R+  + +
Sbjct: 306 VSDRPRHLSSDSSPSPPDTSDSDPSTETPPPASLSHSPPAAFERPLALSPKRKREGDKKQ 365

Query: 606 LKSRIPRP 613
            K +  + 
Sbjct: 366 KKKKSKKL 373


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 30.2 bits (68), Expect = 7.1
 Identities = 21/65 (32%), Positives = 27/65 (41%), Gaps = 13/65 (20%)

Query: 961  DFFVI---------FDTFLT--TLNEA-KLDNENMKRRREEEEKRARQDAEMKKRTTDRK 1008
            DF V          F+ FL      E  KL  E  +R R+ EE+R R++ E      DR 
Sbjct: 232  DFCVEIEGGMDEHSFEDFLLEEKRRELEKLAKEEAERERQAEEQR-RREEEKAAMEADRA 290

Query: 1009 QGAKE 1013
            Q   E
Sbjct: 291  QAKAE 295


>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
          Length = 540

 Score = 30.3 bits (68), Expect = 7.1
 Identities = 9/30 (30%), Positives = 10/30 (33%)

Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPR 581
              P   E   +  P  P P   PPP P  
Sbjct: 172 TPPPDDDEGDEAGAPATPAPPLHPPPAPHP 201


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 30.0 bits (68), Expect = 7.2
 Identities = 23/93 (24%), Positives = 48/93 (51%), Gaps = 7/93 (7%)

Query: 436 EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHL 495
           EE    +K+ADEL  E V+LS ++ +  +E      E  +L++ +  ++ + +       
Sbjct: 200 EEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKA-----A 254

Query: 496 EAKQKVAELEDRISELKHRADFEQDERKRLENL 528
           + ++K  EL++R  E+  +  F++ E+   E L
Sbjct: 255 KRREKREELKERAEEIYEK--FKRGEKLTTEEL 285


>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
          Length = 290

 Score = 30.0 bits (67), Expect = 7.2
 Identities = 14/60 (23%), Positives = 18/60 (30%), Gaps = 4/60 (6%)

Query: 550 EEPSSPRKPEAPSSC--IPPPPPPASLPPPPPPRRSSDLPYLTCPR--PLVRCRQPPRSK 605
           E P+ PR    P +      P P      P   R          PR  P  +    PR +
Sbjct: 82  ERPADPRPSRRPHAQPEASGPGPARGARGPAGSRGRGRRAESPSPRDPPNPKGASAPRGR 141


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 30.0 bits (68), Expect = 7.3
 Identities = 17/81 (20%), Positives = 40/81 (49%), Gaps = 3/81 (3%)

Query: 432 LAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET 491
           + K +E + A K   +   + + L   ++  +++L +   E+++  E L       +KE 
Sbjct: 279 MEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKL-KSKFERDN--EKLDAEVKEKKKEK 335

Query: 492 AGHLEAKQKVAELEDRISELK 512
               + K+++  LE+RI +L+
Sbjct: 336 KKEEKKKKQIERLEERIEKLE 356


>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
           Provisional.
          Length = 693

 Score = 30.0 bits (69), Expect = 7.9
 Identities = 15/84 (17%), Positives = 40/84 (47%), Gaps = 5/84 (5%)

Query: 436 EEIV--AAKKKADELEMENVDLSNRLAKK-EQELDQ--KTQEKEDLDESLSRIKDRLEKE 490
           EE+   A  K   +    + +L   + +  E +L +  +  +K++ + +L  IK+ + + 
Sbjct: 220 EELAAEAGPKWEWQPPEVDEELKAAVKELAEAKLKEAYQITDKQEREAALDAIKEEVLEA 279

Query: 491 TAGHLEAKQKVAELEDRISELKHR 514
            A   E ++   E+++   +L+ +
Sbjct: 280 LAAEEEEEEDEKEIKEAFKKLEKK 303


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 29.9 bits (67), Expect = 8.0
 Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 2/84 (2%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
             EEE  AAK KA            +  +K +  ++ T+E+E+ D+  ++ K     +  
Sbjct: 119 VTEEEKAAAKAKAAAAAKAKAAALAK--QKREGTEEVTEEEEETDKEKAKAKAAAAAKAK 176

Query: 493 GHLEAKQKVAELEDRISELKHRAD 516
               AKQK AE  +   E+     
Sbjct: 177 AAALAKQKAAEAGEGTEEVTEEEK 200


>gnl|CDD|226447 COG3937, COG3937, Uncharacterized conserved protein [Function
           unknown].
          Length = 108

 Score = 28.2 bits (63), Expect = 8.2
 Identities = 20/87 (22%), Positives = 45/87 (51%), Gaps = 7/87 (8%)

Query: 441 AKKKADELEMENVDLSNRLAKK--EQELDQKTQEKEDLDESLSR-IKDRLEKETAGHLEA 497
            +K  DEL ++  +L+   AK+  +  L Q  + + +L+E + R I++ L       +  
Sbjct: 26  VQKLVDEL-VKKGELNAEEAKRFVDDLLRQAKEAQGELEEKIPRKIEEMLSDLE---VAR 81

Query: 498 KQKVAELEDRISELKHRADFEQDERKR 524
           + ++ EL +R+  L+ +    +++ KR
Sbjct: 82  QSEMDELTERVDALERQVADLENKLKR 108


>gnl|CDD|182002 PRK09627, oorA, 2-oxoglutarate-acceptor oxidoreductase subunit
           OorA; Reviewed.
          Length = 375

 Score = 29.7 bits (67), Expect = 8.2
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQD 520
           ET GH+  K  + +LE+    + +R +F+ D
Sbjct: 170 ETVGHMYGKAVIPDLEEVQKMIINRKEFDGD 200


>gnl|CDD|227556 COG5231, VMA13, Vacuolar H+-ATPase V1 sector, subunit H [Energy
           production and conversion].
          Length = 432

 Score = 29.9 bits (67), Expect = 8.3
 Identities = 31/132 (23%), Positives = 52/132 (39%), Gaps = 11/132 (8%)

Query: 405 QLTLQTETGEDYDHILLKIN-VKKVVHLLAKEEE-------IVAAKKKADELEMENVDLS 456
            LT   E  +D D +   IN +  +V   AKE+        +     K+ +  + +  L 
Sbjct: 219 ILTFSKECAQDIDKMDDLINDLIAIVKERAKEKVLRLCCGIVANVLDKSPKGYIFSPLLL 278

Query: 457 NRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRI---SELKH 513
           N ++K  Q L ++    E+L   + RI+ RL + T         + EL+      S   H
Sbjct: 279 NDISKCVQVLLERKYSDEELVIDIERIRSRLVQNTKKLCIFDNYLNELDSGRLEWSPYHH 338

Query: 514 RADFEQDERKRL 525
           + DF       L
Sbjct: 339 KKDFWSTNLDML 350


>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
           (DUF2360).  This is the conserved 140 amino acid region
           of a family of proteins conserved from nematodes to
           humans. One C. elegans member is annotated as a
           Daf-16-dependent longevity protein 1 but this could not
           be confirmed. The function is unknown.
          Length = 147

 Score = 28.9 bits (65), Expect = 8.3
 Identities = 20/88 (22%), Positives = 28/88 (31%), Gaps = 15/88 (17%)

Query: 542 GLSRSGGREEPSSPRKPEAPSS-CIPPPPPPASLPPPPPPRRSS--DLPYLTCPRPL--- 595
           GL     +  P  P    A ++   PPPPP  +    PPP  +            P    
Sbjct: 49  GLEDVTVQTTPPPP--ASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPEPEAPAENT 106

Query: 596 VRCRQPPRSKLKSRIPRPYFPNVSVQVN 623
           V   + PR          YF  + + V 
Sbjct: 107 VTVAKDPRY-------AKYFKMLKLGVP 127


>gnl|CDD|224644 COG1730, GIM5, Predicted prefoldin, molecular chaperone implicated
           in de novo protein folding [Posttranslational
           modification, protein turnover, chaperones].
          Length = 145

 Score = 28.9 bits (65), Expect = 8.4
 Identities = 12/39 (30%), Positives = 23/39 (58%)

Query: 473 KEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISEL 511
           ++  DE++  +K R+E+      + +Q +AEL  RI +L
Sbjct: 89  EKSADEAIEFLKKRIEELEKAIEKLQQALAELAQRIEQL 127


>gnl|CDD|220261 pfam09484, Cas_TM1802, CRISPR-associated protein TM1802
           (cas_TM1802).  Clusters of short DNA repeats with
           non-homologous spacers, which are found at regular
           intervals in the genomes of phylogenetically distinct
           prokaryotic species, comprise a family with recognisable
           features. This family is known as CRISPR (short for
           Clustered, Regularly Interspaced Short Palindromic
           Repeats). A number of protein families appear only in
           association with these repeats and are designated Cas
           (CRISPR-Associated) proteins. This minor cas protein is
           found in at least five prokaryotic genomes:
           Methanosarcina mazei, Sulfurihydrogenibium azorense,
           Thermotoga maritima, Carboxydothermus hydrogenoformans,
           and Dictyoglomus thermophilum, the first of which is
           archaeal while the rest are bacterial.
          Length = 584

 Score = 30.0 bits (68), Expect = 8.4
 Identities = 10/44 (22%), Positives = 22/44 (50%)

Query: 473 KEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRAD 516
           K++L++ L  I + L    +    +K  +A LE+ I ++    +
Sbjct: 272 KKNLEKELEEILESLNDIKSLSKGSKNSLAGLEEEILDILSDEN 315


>gnl|CDD|220716 pfam10359, Fmp27_WPPW, RNA pol II promoter Fmp27 protein domain.
           Fmp27_WPPW is a conserved domain of a family of proteins
           involved in RNA polymerase II transcription initiation.
           It contains characteristic HQR and WPPW sequence motifs.
           and is towards the C-terminal in members which contain
           Fmp27_SW pfam10305.
          Length = 470

 Score = 30.0 bits (68), Expect = 8.4
 Identities = 17/69 (24%), Positives = 33/69 (47%), Gaps = 1/69 (1%)

Query: 473 KEDLDESLSRIKDRLEK-ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
            + L+E + +++++L+  E     E +  + ELE+ +S LK R +F +   + LE    S
Sbjct: 170 LKQLEEQIKKLEEKLDDLELNDTEELQSDLEELEEELSVLKERLEFLEKLLEDLERSEES 229

Query: 532 GSVPDDTKA 540
                 T  
Sbjct: 230 SDRSSSTDT 238


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 30.1 bits (69), Expect = 8.5
 Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 1/80 (1%)

Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEK-EDLDESLSRIKDRLEKET 491
           A     +      ++ E E  +  + LA  E EL +K  EK + L +   +++   EK+ 
Sbjct: 183 AHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKV 242

Query: 492 AGHLEAKQKVAELEDRISEL 511
            G L   +K A+L +++ E 
Sbjct: 243 EGRLAQHKKYAKLREKLKEE 262


>gnl|CDD|235309 PRK04596, minC, septum formation inhibitor; Reviewed.
          Length = 248

 Score = 29.5 bits (66), Expect = 8.5
 Identities = 13/27 (48%), Positives = 13/27 (48%), Gaps = 6/27 (22%)

Query: 568 PPPPASLPPPPPPRRSSDLPYLTCPRP 594
           PPPP   PPPPP R     P     RP
Sbjct: 119 PPPP---PPPPPARAEPAPP---VARP 139


>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
           binding.  This is the C-terminal conserved coiled coil
           region of a family of TATA element modulatory factor 1
           proteins conserved in eukaryotes. The proteins bind to
           the TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMF1_TATA_bd is the most conserved
           part of the TMFs. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells. The Rab6-binding
           domain appears to be the same region as this C-terminal
           family.
          Length = 121

 Score = 28.3 bits (64), Expect = 8.5
 Identities = 14/58 (24%), Positives = 30/58 (51%), Gaps = 4/58 (6%)

Query: 459 LAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRAD 516
           LA  + EL +   E+++  + + +    L +E     E K+++ ELE  + +L+ R +
Sbjct: 33  LASLKDELARLEAERDEARQEIVK----LTEENEELRELKKEIEELEKELEDLEQRYE 86


>gnl|CDD|236012 PRK07406, PRK07406, RNA polymerase sigma factor RpoD; Validated.
          Length = 373

 Score = 29.8 bits (67), Expect = 8.5
 Identities = 22/63 (34%), Positives = 34/63 (53%), Gaps = 10/63 (15%)

Query: 429 VHLLAKEEEIVAAKKKADELEMENVDLSNRLAKK-EQELDQKTQEKE---DLDESLSRIK 484
           + LL  +EEI  A+K AD LE+E       L ++ E EL ++  +KE    +D  L + +
Sbjct: 75  IRLLRPDEEIELARKIADLLELE------ELREQFESELGREPSDKEWAELVDMPLPKFR 128

Query: 485 DRL 487
            RL
Sbjct: 129 RRL 131


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 30.0 bits (68), Expect = 8.7
 Identities = 28/133 (21%), Positives = 53/133 (39%), Gaps = 34/133 (25%)

Query: 426 KKVVHLLAKEEEIVAAKKKAD----ELEMENVDLSNRLAK----------------KEQE 465
            K   L+ + + I A  ++       L M+  D S  L K                K  +
Sbjct: 220 NKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIK 279

Query: 466 LDQK-------TQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFE 518
           + +K       TQ+  +  + +++IKD+L+       E +  + +L+  I EL+   D  
Sbjct: 280 MYEKGGVCPTCTQQISEGPDRITKIKDKLK-------ELQHSLEKLDTAIDELEEIMDEF 332

Query: 519 QDERKRLENLINS 531
            ++ K+L  L N 
Sbjct: 333 NEQSKKLLELKNK 345


>gnl|CDD|133382 cd01107, HTH_BmrR, Helix-Turn-Helix DNA binding domain of the BmrR
           transcription regulator.  Helix-turn-helix (HTH)
           multidrug-efflux transporter transcription regulator,
           BmrR and YdfL of Bacillus subtilis, and related
           proteins; N-terminal domain. Bmr is a membrane protein
           which causes the efflux of a variety of toxic substances
           and antibiotics. BmrR is comprised of two distinct
           domains that harbor a regulatory (effector-binding) site
           and an active (DNA-binding) site. The conserved
           N-terminal domain contains a winged HTH motif  that
           mediates DNA binding, while the C-terminal domain binds
           coactivating, toxic compounds. BmrR shares the
           N-terminal DNA binding domain with other transcription
           regulators of the MerR superfamily that promote
           transcription by reconfiguring the spacer between the
           -35 and -10 promoter elements.
          Length = 108

 Score = 28.3 bits (64), Expect = 8.8
 Identities = 18/58 (31%), Positives = 27/58 (46%), Gaps = 9/58 (15%)

Query: 475 DLDESLSRIKDRLEKETAGHLEA--KQKVAELEDRISELKHRADFEQDERKRLENLIN 530
           DL   L  IK+ L+ +    L    ++K+AELE  I EL       Q   + LE+ + 
Sbjct: 56  DLGFPLEEIKEILDADNDDELRKLLREKLAELEAEIEEL-------QRILRLLEDRLK 106


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 29.1 bits (65), Expect = 9.1
 Identities = 13/45 (28%), Positives = 16/45 (35%)

Query: 561 PSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSK 605
              C P  P P+S PPP PP               +  + PP  K
Sbjct: 49  YPFCPPTTPHPSSQPPPCPPSPGHPPQTNDTHEKDLALQPPPGGK 93


>gnl|CDD|182123 PRK09866, PRK09866, hypothetical protein; Provisional.
          Length = 741

 Score = 29.8 bits (67), Expect = 9.4
 Identities = 11/43 (25%), Positives = 22/43 (51%)

Query: 873 EASKVSMKELENDMTQLRNNLKEVSREIEFQRVQPLVPGDRFL 915
           E  + ++ ++E  +  L+ N  +VS EI+ +    L   + FL
Sbjct: 449 EQLRQNIHQVEESLQLLQLNQAQVSGEIKHEIELALTSANHFL 491


>gnl|CDD|200432 TIGR04181, NHT_00031, aminotransferase, LLPSF_NHT_00031 family.
           This clade of aminotransferases is a member of the
           pfam01041 (DegT/DnrJ/EryC1/StrS) superfamily. The family
           is named after the instance in Leptospira interrogans
           serovar Lai, str. 56601, where it is the 31st gene in
           the 91-gene lipopolysaccharide biosynthesis locus.
           Members of this family are generally found within a
           subcluster of seven or more genes including an
           epimerase/dehydratase, four genes homologous to the
           elements of the neuraminic (sialic) acid biosynthesis
           cluster (NeuABCD) and a nucleotidyl transferase.
           Together it is very likely that these enzymes direct the
           biosynthesis of a nine-carbon sugar analogous to
           CMP-neuraminic acid. These seven genes form the core of
           the cassette, although they are often accompanied by
           additional genes that may further modify the product
           sugar.
          Length = 359

 Score = 29.4 bits (67), Expect = 9.5
 Identities = 15/56 (26%), Positives = 18/56 (32%), Gaps = 3/56 (5%)

Query: 360 NILRRKLGHTAAYPHFLSLLHHCILLPLEFGCHPQHWL---LLDRLAQQLTLQTET 412
             L RK      Y  F S +     LP   G    +WL   LLD    +  L    
Sbjct: 260 EFLARKRELAEIYKEFFSGIPGVEFLPEPAGARSNYWLNALLLDSKLDRDELLEAL 315


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 29.6 bits (67), Expect = 9.6
 Identities = 28/98 (28%), Positives = 46/98 (46%), Gaps = 15/98 (15%)

Query: 437 EIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRL-EKETAGHL 495
           E ++  K + E  ++ +    RL ++E+EL +K  EK    E       RL E E     
Sbjct: 290 EKLSGLKNSGEPSLKEIH-EARLNEEERELKKKFTEKIREKE------KRLEELEQNLIE 342

Query: 496 EAKQKVAELEDRISELKHRADFEQDERKRLENLINSGS 533
           E K+  ++LE+   +L+       D  KRLE L ++ S
Sbjct: 343 ERKELNSKLEEIQKKLE-------DLEKRLEKLKSNKS 373


>gnl|CDD|151656 pfam11214, Med2, Mediator complex subunit 2.  This family of mediator
            complex subunit 2 proteins is conserved in fungi.
            Cyclin-dependent kinase CDK8 or Srb10 interacts with and
            phosphorylates Med2. Post-translational modifications of
            Mediator subunits are important for regulation of gene
            expression.
          Length = 99

 Score = 27.8 bits (62), Expect = 9.6
 Identities = 13/43 (30%), Positives = 25/43 (58%), Gaps = 3/43 (6%)

Query: 962  FFVIFDTFLTTLNEAKLDNENM---KRRREEEEKRARQDAEMK 1001
            F  I D   + LN+ K   + +   K+++EEEE++ +++ E K
Sbjct: 57   FHSILDDTESKLNDTKWYVDTLVENKKKQEEEERKKKEEEEKK 99


>gnl|CDD|148701 pfam07246, Phlebovirus_NSM, Phlebovirus nonstructural protein NS-M.
            This family consists of several Phlebovirus
           nonstructural NS-M proteins which represent the
           N-terminal region of the M polyprotein precursor. The
           function of this family is unknown.
          Length = 264

 Score = 29.5 bits (65), Expect = 9.7
 Identities = 19/89 (21%), Positives = 37/89 (41%), Gaps = 12/89 (13%)

Query: 455 LSNRLAKKEQELDQKTQEKED------LDESLSRIKDRLEKETAG------HLEAKQKVA 502
           L     +K+  + +KT+E E+          + + K+ ++K          H +AK +V 
Sbjct: 146 LKKNAEEKDATIQRKTKEMEEDSRNQIAHHEIQQKKNEIQKLRNDLKRGQEHRDAKLRVE 205

Query: 503 ELEDRISELKHRADFEQDERKRLENLINS 531
           E   R+  LK       DE +  ++ + S
Sbjct: 206 ESMARVKGLKEELKQLTDELQHAKSEMQS 234


>gnl|CDD|183876 PRK13169, PRK13169, DNA replication intiation control protein YabA;
           Reviewed.
          Length = 110

 Score = 27.9 bits (63), Expect = 9.8
 Identities = 25/88 (28%), Positives = 40/88 (45%), Gaps = 22/88 (25%)

Query: 399 LDRLAQQL-TLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSN 457
           LD L Q L  L  E G       LK  + +++      EE  A       L +EN  L  
Sbjct: 10  LDDLEQNLGVLLKELGA------LKKQLAELL------EENTA-------LRLENDKLRE 50

Query: 458 RLAKKEQELDQKTQEKEDLD--ESLSRI 483
           RL + E E   K ++K++ +  ++L+R+
Sbjct: 51  RLEELEAEEPAKEKKKKEGEGKDNLARL 78


>gnl|CDD|236839 PRK11086, PRK11086, sensory histidine kinase DcuS; Provisional.
          Length = 542

 Score = 29.9 bits (68), Expect = 9.8
 Identities = 23/72 (31%), Positives = 35/72 (48%), Gaps = 9/72 (12%)

Query: 118 DDTEIKSRTRILDSLRT---ALRTQPHSFVLRFIECEGLLTLLEYLKNLDHYSGQTSLH- 173
           D TE++   + LD +     ALR Q H F+ +     GLL L  Y   L+ Y  +T+ + 
Sbjct: 322 DKTEVRQLAQRLDGMVNYADALRAQSHEFMNKLHVILGLLHLKSY-DQLEDYILKTANNY 380

Query: 174 ----SSIIGCIK 181
                S++G IK
Sbjct: 381 QEEIGSLLGKIK 392


>gnl|CDD|238574 cd01169, HMPP_kinase, 4-amino-5-hydroxymethyl-2-methyl-pyrimidine
           phosphate kinase (HMPP-kinase) catalyzes two consecutive
           phosphorylation steps in the thiamine phosphate
           biosynthesis pathway, leading to the synthesis of
           vitamin B1. The first step is the phosphorylation of the
           hydroxyl group of HMP to form
           4-amino-5-hydroxymethyl-2-methyl-pyrimidine phosphate
           (HMP-P) and then the phophorylation of HMP-P to form
           4-amino-5-hydroxymethyl-2-methyl-pyrimidine
           pyrophosphate (HMP-PP), which is the substrate for the
           thiamine synthase coupling reaction.
          Length = 242

 Score = 29.4 bits (67), Expect = 9.8
 Identities = 17/54 (31%), Positives = 23/54 (42%), Gaps = 10/54 (18%)

Query: 707 LSMDSADQLPLDMLEQLLKF----TPNTEEAALL---EEHSTETLARADRFLYE 753
           L  D A +    + E LL      TPN  EA LL   E  + E + +A + L  
Sbjct: 112 LLDDDAIEA---LRELLLPLATLITPNLPEAELLTGLEIATEEDMMKAAKALLA 162


>gnl|CDD|191511 pfam06409, NPIP, Nuclear pore complex interacting protein (NPIP).
           This family consists of a series of primate specific
           nuclear pore complex interacting protein (NPIP)
           sequences. The function of this family is unknown but is
           well conserved from African apes to humans.
          Length = 350

 Score = 29.4 bits (65), Expect = 9.8
 Identities = 53/238 (22%), Positives = 85/238 (35%), Gaps = 32/238 (13%)

Query: 422 KINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLS 481
           KI   KV   +   ++I   +K A E          +L+ KE E  +K ++  + +E+  
Sbjct: 120 KICKMKVTTKINHHDKINGKRKTAKEH-------LRKLSMKECEHAEKERQVSEAEEN-- 170

Query: 482 RIKDRLE-KETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSV------ 534
               +L+ KE   + E  Q+   L  R +E  H+       RK L N +   +       
Sbjct: 171 ---GKLDMKEIHTYKEMFQRAQALRRR-AEDYHKCKIPPSARKPLCNRVRMAAAEHRHSS 226

Query: 535 -----PDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYL 589
                P  T     +R G +  P + +     +S     PP   L P PP    S    L
Sbjct: 227 GLPYWPYLTAETLKNRMGHQPPPPTQQHSIIDNSLSLKTPPECLLTPLPPSALPSADDNL 286

Query: 590 TCPRPLVRCRQPP-------RSKLKSRIPRPYFPNVSVQVNGTIWTELDESKLYASME 640
             P   +    PP          L  + P P  P+V   +   +   L  S L  S++
Sbjct: 287 KTPAECLLRPLPPSADDNIKECPLALKPPPPLPPSVDDNLKECLLAPLPPSALPPSVD 344


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.133    0.380 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 55,221,044
Number of extensions: 5637388
Number of successful extensions: 13914
Number of sequences better than 10.0: 1
Number of HSP's gapped: 11878
Number of HSP's successfully gapped: 843
Length of query: 1074
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 967
Effective length of database: 6,191,724
Effective search space: 5987397108
Effective search space used: 5987397108
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 64 (28.5 bits)