RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7893
         (691 letters)



>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
          Length = 444

 Score =  131 bits (330), Expect = 1e-32
 Identities = 103/433 (23%), Positives = 169/433 (39%), Gaps = 71/433 (16%)

Query: 29  PIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKERGKKDGETQ----VGNDISNIEAV- 83
            IV DNG+   + G+A  + P  +F +++ + R E    +   +    VGN+  N     
Sbjct: 8   TIVIDNGSGTTKAGFAGNDTPTTVFPSIVGRRRDEDSVMEDTEEKDTYVGNEAQNDRDNS 67

Query: 84  RFQLKTQFDKNVVTHYDIQEQIFDYAFS---HLSINTEGNVNHPIVLTEPFLNPNYSRSL 140
             +L+   +  ++ ++D  EQI+DY F     L  + E    HP++LTEP LNP  +R  
Sbjct: 68  LLELRYPIENGIILNWDAMEQIWDYTFFNKGDLLPSPEE---HPLLLTEPPLNPPSNREK 124

Query: 141 MSELLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSGVIISCG 200
           ++ELLFE   VP++   I ++ S  Y                          +G++I  G
Sbjct: 125 ITELLFETLNVPALYLAIQAVLS-LYASGSSDE-------------------TGLVIDSG 164

Query: 201 YQCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINS------------I 248
              THVIPV++G +     KRID+GG  +  +L KLL+ KYP                 I
Sbjct: 165 DSVTHVIPVVDGIVLPKAVKRIDIGGRDITDYLKKLLREKYPPSRGYNLKSELVEYSSEI 224

Query: 249 TPSRSEELL---WDYGFVATDYREHLRKWLDAEFYDSNVVKVQL-PYAVPVPNLTTEQQK 304
                EE+     +  +V+ D  E   +  +     S     QL             ++ 
Sbjct: 225 VNEIKEEVCETDDESAYVSLDAEEEFEEEEEKPAEKSTESTFQLSKETSIAKE---SKEL 281

Query: 305 DRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINN 364
              +E+  +       K  E L   E  ++ L E  +I E      +   E   +   N+
Sbjct: 282 PDGEEI--EFGNEERFKAPEILFKPELPISGLEEAGKIDESKQELVAENYEISPTNLGND 339

Query: 365 IQDLNKSINQ-LQQKIEKTKAKIIAYNN-----GEDLTEEPKAKLSKEI----------- 407
           I  L + + Q +Q   E  +     Y+N     G         +L KE+           
Sbjct: 340 IAGLPELVYQSIQICDEDVRKS--LYSNIVLTGGTSKIPGFAERLQKELTSLAPSIWKVS 397

Query: 408 AVPESEAEFKAWL 420
            +P  +    AWL
Sbjct: 398 VIPPPDPSLDAWL 410



 Score = 76.7 bits (189), Expect = 1e-14
 Identities = 46/190 (24%), Positives = 82/190 (43%), Gaps = 2/190 (1%)

Query: 461 ELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQ 520
           E   +E+        +  + + K  +  A ++    + E +    E   +  PE     +
Sbjct: 248 EEFEEEEEKPAEKSTESTFQLSKETS-IAKESKELPDGEEIEFGNEERFKA-PEILFKPE 305

Query: 521 EQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQSLANN 580
                 +EA ++    + +        P+ LG+  AG+ E +   +    + + +SL +N
Sbjct: 306 LPISGLEEAGKIDESKQELVAENYEISPTNLGNDIAGLPELVYQSIQICDEDVRKSLYSN 365

Query: 581 IFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAWSGARQFSLSENFHDFA 640
           I +TG   ++PGF ERL K+L    P     SV    +P LDAW GA   +  E F    
Sbjct: 366 IVLTGGTSKIPGFAERLQKELTSLAPSIWKVSVIPPPDPSLDAWLGASILASLETFQQLW 425

Query: 641 VTQSDYQEKG 650
           +T+ +Y+E G
Sbjct: 426 ITKEEYEEHG 435


>gnl|CDD|214592 smart00268, ACTIN, Actin.  ACTIN subfamily of
           ACTIN/mreB/sugarkinase/Hsp70 superfamily.
          Length = 373

 Score =  123 bits (311), Expect = 1e-30
 Identities = 70/291 (24%), Positives = 125/291 (42%), Gaps = 60/291 (20%)

Query: 29  PIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKERGKK--DGETQVGNDISNIEAVRFQ 86
            IV DNG+   + G+A  + P ++F +++ +P+  +G      +  VG++         +
Sbjct: 3   AIVIDNGSGTIKAGFAGEDFPQVVFPSIVGRPKDGKGMVGDAKDIFVGDEAQEKRGG-LE 61

Query: 87  LKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSELL 145
           LK   +  +V ++D  E+I+DY F + L +  E    HP++LTEP +NP  +R  + E++
Sbjct: 62  LKYPIENGIVENWDDMEKIWDYTFFNELRVEPE---EHPVLLTEPPMNPKSNREKILEIM 118

Query: 146 FECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSGVIISCGYQCTH 205
           FE +  P++   I ++ S   +G                        +G++I  G   TH
Sbjct: 119 FETFNFPALYIAIQAVLSLYASG----------------------RTTGLVIDSGDGVTH 156

Query: 206 VIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINSITPSRSEELLWDYGFVAT 265
           V+PV++G +     KRID+ G  +  +L +LL  +     NS                  
Sbjct: 157 VVPVVDGYVLPHAIKRIDIAGRDITDYLKELLS-ERGYQFNSS----------------- 198

Query: 266 DYREHLRKWLDAEFYDSNVVKVQLPYAVPVPNLTTEQQKDRRKELAKKLVE 316
                      AEF     +K +L Y V   +   E +  R    + KL +
Sbjct: 199 -----------AEFEIVREIKEKLCY-VA-EDFEKEMKLARESSESSKLEK 236



 Score = 86.5 bits (215), Expect = 3e-18
 Identities = 39/146 (26%), Positives = 64/146 (43%)

Query: 505 EEILRQHDPEFTSLNQEQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNF 564
           E+ ++       S   E+     + N + +G ER   PE LF P ++G  Q GI E +  
Sbjct: 219 EKEMKLARESSESSKLEKTYELPDGNTIKVGNERFRIPEILFSPELIGLEQKGIHELVYE 278

Query: 565 VLNSYPQHIAQSLANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAW 624
            +      + + L  NI ++G    +PGF ERL K+L +  P +    V      +   W
Sbjct: 279 SIQKCDIDVRKDLYENIVLSGGSTLIPGFGERLEKELKQLAPKKLKVKVIAPPERKYSVW 338

Query: 625 SGARQFSLSENFHDFAVTQSDYQEKG 650
            G    +    F D  +T+ +Y+E G
Sbjct: 339 LGGSILASLSTFEDMWITKKEYEESG 364


>gnl|CDD|200935 pfam00022, Actin, Actin. 
          Length = 367

 Score = 99.2 bits (248), Expect = 2e-22
 Identities = 51/222 (22%), Positives = 100/222 (45%), Gaps = 41/222 (18%)

Query: 29  PIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKERGKKD---GETQVGNDISNIEAVRF 85
            +V DNG+   + G+A  + P  +  +++ +PR          G+  + +    +E    
Sbjct: 5   ALVIDNGSGTTKAGFAGEDAPRAVIPSVVGRPRGRGVMVKYYVGDEAL-SKRPGLE---- 59

Query: 86  QLKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSEL 144
            ++   +  +V ++D  E+I+++ F   L ++ E    HP++LTEP LNP  +R   +E+
Sbjct: 60  -VRYPIEDGIVENWDAMEKIWEHTFFEELRVDPE---EHPLLLTEPPLNPPANREKATEI 115

Query: 145 LFECYQVPSVCYGID---SLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSGVIISCGY 201
           +FE + VP++        S ++       G++                   +G+++  G 
Sbjct: 116 MFETFGVPALYLAKQAVLSAYAS------GRT-------------------TGLVVDSGA 150

Query: 202 QCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPS 243
             T V+PV  G +     +R DL G  +  +L KLL  +  S
Sbjct: 151 GVTSVVPVYEGYVLQKAIRRSDLAGDDLTDYLRKLLSSRTYS 192



 Score = 84.2 bits (209), Expect = 2e-17
 Identities = 41/118 (34%), Positives = 57/118 (48%), Gaps = 1/118 (0%)

Query: 534 IGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQSLANNIFVTGSLCQLPGF 593
           +G ER   PE LF PS++GS  AGI E +   +N+    +  SL  NI VTG     PGF
Sbjct: 241 LGNERFRVPEILFNPSLIGSESAGIPELIYDSINACDVDLRPSLLANIVVTGGTTLFPGF 300

Query: 594 VERLNKDLLENRPFQSHFSVSLAEN-PELDAWSGARQFSLSENFHDFAVTQSDYQEKG 650
            ERL K+L +  P      +    N  +  AW G    +    F    V++ +Y+E G
Sbjct: 301 TERLEKELAQLAPSGVKVKIIAPPNERKYSAWIGGSILASLGTFQQMWVSKQEYEEHG 358


>gnl|CDD|240225 PTZ00004, PTZ00004, actin-2; Provisional.
          Length = 378

 Score = 93.7 bits (233), Expect = 2e-20
 Identities = 61/272 (22%), Positives = 122/272 (44%), Gaps = 46/272 (16%)

Query: 30  IVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKER---GKKDGETQVGNDISNIEAVRFQ 86
            V DNG+   + G+A  + P  +F +++ +P+      G ++ +  VG++  +   +   
Sbjct: 9   AVVDNGSGMVKAGFAGDDAPRCVFPSIVGRPKNPGIMVGMEEKDCYVGDEAQDKRGI-LT 67

Query: 87  LKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSELL 145
           LK   +  +VT++D  E+I+ + F + L +  E    HP++LTE  LNP  +R  M++++
Sbjct: 68  LKYPIEHGIVTNWDDMEKIWHHTFYNELRVAPE---EHPVLLTEAPLNPKANREKMTQIM 124

Query: 146 FECYQVPSVCYGID---SLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSGVIISCGYQ 202
           FE + VP++   I    SL++         SG                  +G+++  G  
Sbjct: 125 FETHNVPAMYVAIQAVLSLYA---------SGRT----------------TGIVLDSGDG 159

Query: 203 CTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINSITPSRSEELLWD--- 259
            +H +P+  G        R+D+ G  +  ++ K+L         + T +  +E++ D   
Sbjct: 160 VSHTVPIYEGYSLPHAIHRLDVAGRDLTEYMMKIL----HERGTTFTTTAEKEIVRDIKE 215

Query: 260 -YGFVATDYREHLRKWLDAEFYDSNVVKVQLP 290
              ++A D+ E +     A   D      +LP
Sbjct: 216 KLCYIALDFDEEMGN--SAGSSDKYEESYELP 245



 Score = 60.9 bits (148), Expect = 8e-10
 Identities = 37/122 (30%), Positives = 50/122 (40%), Gaps = 1/122 (0%)

Query: 530 NQLHIGVERMCGPECLFQPSMLGSIQA-GISETLNFVLNSYPQHIAQSLANNIFVTGSLC 588
             + +G ER   PE LFQPS++G  +  GI E     +N     I + L  NI ++G   
Sbjct: 248 TIITVGSERFRCPEALFQPSLIGKEEPPGIHELTFQSINKCDIDIRKDLYGNIVLSGGTT 307

Query: 589 QLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQE 648
              G  ERL K+L    P      V      +   W G    S    F    VT+ +Y E
Sbjct: 308 MYRGLPERLTKELTTLAPSTMKIKVVAPPERKYSVWIGGSILSSLPTFQQMWVTKEEYDE 367

Query: 649 KG 650
            G
Sbjct: 368 SG 369


>gnl|CDD|240426 PTZ00466, PTZ00466, actin-like protein; Provisional.
          Length = 380

 Score = 86.9 bits (215), Expect = 3e-18
 Identities = 61/226 (26%), Positives = 109/226 (48%), Gaps = 34/226 (15%)

Query: 18  EYLDELRDSSLPIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKER---GKKDGETQVG 74
           EY  +L  S+ PI+ DNG    + G+A  + PNL+F + + +P+ +R   G  +G   VG
Sbjct: 4   EYAKQLY-SNQPIIIDNGTGYIKAGFAGEDVPNLVFPSYVGRPKYKRVMAGAVEGNIFVG 62

Query: 75  NDISNIEAVRFQLKTQFDKN--VVTHYDIQEQIFDYAFSHLSINTEGNVNHPIVLTEPFL 132
           N     E  R  LK  +  N  ++ +++  E I+ + ++ + IN+E    HP++LTE  L
Sbjct: 63  N---KAEEYRGLLKVTYPINHGIIENWNDMENIWIHVYNSMKINSE---EHPVLLTEAPL 116

Query: 133 NPNYSRSLMSELLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQ 192
           NP  ++  ++E+ FE + VP++   I ++ S            + SCG            
Sbjct: 117 NPQKNKEKIAEVFFETFNVPALFISIQAILS------------LYSCGK----------T 154

Query: 193 SGVIISCGYQCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQ 238
           +G ++ CG    H + +  G    +   R D+ G  +  +L  LL+
Sbjct: 155 NGTVLDCGDGVCHCVSIYEGYSITNTITRTDVAGRDITTYLGYLLR 200



 Score = 46.9 bits (111), Expect = 2e-05
 Identities = 29/123 (23%), Positives = 56/123 (45%)

Query: 528 EANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQSLANNIFVTGSL 587
           + +Q+ IG ER   PE LF PS+LG    G+SE +   +      + ++L ++I ++G  
Sbjct: 249 DGSQILIGSERYRAPEVLFNPSILGLEYLGLSELIVTSITRADMDLRRTLYSHIVLSGGT 308

Query: 588 CQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQ 647
               GF +RL  ++ +  P      +S     +   + G    +    F    +++ ++ 
Sbjct: 309 TMFHGFGDRLLNEIRKFAPKDITIRISAPPERKFSTFIGGSILASLATFKKIWISKQEFD 368

Query: 648 EKG 650
           E G
Sbjct: 369 EYG 371


>gnl|CDD|173506 PTZ00281, PTZ00281, actin; Provisional.
          Length = 376

 Score = 84.0 bits (207), Expect = 3e-17
 Identities = 57/249 (22%), Positives = 115/249 (46%), Gaps = 34/249 (13%)

Query: 30  IVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKE-----RGKKDGETQVGNDISNIEAVR 84
           +V DNG+  C+ G+A  + P  +F +++ +PR        G+KD  + VG++  +   + 
Sbjct: 9   LVIDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHTGVMVGMGQKD--SYVGDEAQSKRGI- 65

Query: 85  FQLKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSE 143
             LK   +  +VT++D  E+I+ + F + L +  E    HP++LTE  LNP  +R  M++
Sbjct: 66  LTLKYPIEHGIVTNWDDMEKIWHHTFYNELRVAPE---EHPVLLTEAPLNPKANREKMTQ 122

Query: 144 LLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSGVIISCGYQC 203
           ++FE +  P++   I ++ S   +G                        +G+++  G   
Sbjct: 123 IMFETFNTPAMYVAIQAVLSLYASG----------------------RTTGIVMDSGDGV 160

Query: 204 THVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINSITPSRSEELLWDYGFV 263
           +H +P+  G        R+DL G  +  ++ K+L  +  S   +       ++     +V
Sbjct: 161 SHTVPIYEGYALPHAILRLDLAGRDLTDYMMKILTERGYSFTTTAEREIVRDIKEKLAYV 220

Query: 264 ATDYREHLR 272
           A D+   ++
Sbjct: 221 ALDFEAEMQ 229



 Score = 61.3 bits (148), Expect = 7e-10
 Identities = 41/137 (29%), Positives = 61/137 (44%), Gaps = 6/137 (4%)

Query: 516 TSLNQEQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQ 575
           ++L +  EL   +   + IG ER   PE LFQPS LG   AGI ET    +      I +
Sbjct: 235 SALEKSYELP--DGQVITIGNERFRCPEALFQPSFLGMESAGIHETTYNSIMKCDVDIRK 292

Query: 576 SLANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPE--LDAWSGARQFSLS 633
            L  N+ ++G     PG  +R+NK+L    P  S   + +   PE     W G    +  
Sbjct: 293 DLYGNVVLSGGTTMFPGIADRMNKELTALAP--STMKIKIIAPPERKYSVWIGGSILASL 350

Query: 634 ENFHDFAVTQSDYQEKG 650
             F    +++ +Y E G
Sbjct: 351 STFQQMWISKEEYDESG 367


>gnl|CDD|240343 PTZ00280, PTZ00280, Actin-related protein 3; Provisional.
          Length = 414

 Score = 66.7 bits (163), Expect = 1e-11
 Identities = 65/295 (22%), Positives = 121/295 (41%), Gaps = 50/295 (16%)

Query: 26  SSLPI-VFDNGAWCCRVGWASCEKPNLIFKNLIA------KPRKERGKKDGETQVGNDIS 78
           S+LP+ V DNG    ++G+A   +P  I   LIA      + R ++G +D +  +G++  
Sbjct: 2   STLPVVVIDNGTGYTKMGYAGNTEPTYIIPTLIADNSKQSRRRSKKGFEDLDFYIGDEAL 61

Query: 79  NIEAVRFQLKTQFDKNVVTHYDIQEQIFDYA-FSHLSINTEGNVNHPIVLTEPFLNPNYS 137
              +  + L       +V  +D+ E+ ++   F +L    E    H  +LTEP +NP  +
Sbjct: 62  A-ASKSYTLTYPMKHGIVEDWDLMEKFWEQCIFKYLRCEPE---EHYFILTEPPMNPPEN 117

Query: 138 RSLMSELLFECYQVPSVCYGID---SLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSG 194
           R   +E++FE + V  +   +    +L +   +    + G                  +G
Sbjct: 118 REYTAEIMFETFNVKGLYIAVQAVLALRASWTSKKAKELG---------------GTLTG 162

Query: 195 VIISCGYQCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINSITPSRSE 254
            +I  G   THVIPV++G +  S  K I L G  + + + ++L+ +            ++
Sbjct: 163 TVIDSGDGVTHVIPVVDGYVIGSSIKHIPLAGRDITNFIQQMLRERGEPIPAEDILLLAQ 222

Query: 255 ELLWDYGFVATDYREHLRKWLDAEF--YDSNVVKVQLPYAVPVPNLTTEQQKDRR 307
            +   Y +VA D        +  EF  YDS+          P  +       +  
Sbjct: 223 RIKEKYCYVAPD--------IAKEFEKYDSD----------PKNHFKKYTAVNSV 259



 Score = 42.0 bits (99), Expect = 7e-04
 Identities = 33/153 (21%), Positives = 56/153 (36%), Gaps = 18/153 (11%)

Query: 515 FTSLNQEQELSPKEANQLHIGVERMCGPECLFQPSML-GSIQAGISETLNFVLNSYPQHI 573
           F        ++ K+   + +G ER  GPE  F P +        + E ++  + S P   
Sbjct: 250 FKKYTAVNSVT-KKPYTVDVGYERFLGPEMFFHPEIFSSEWTTPLPEVVDDAIQSCPIDC 308

Query: 574 AQSLANNIFVTGSLCQLPGFVERLNKD--------------LLENRPFQSHFSVSLAENP 619
            + L  NI ++G      GF +RL +D              L   +       V++  +P
Sbjct: 309 RRPLYKNIVLSGGSTMFKGFDKRLQRDVRKRVDRRLKKAEELSGGKLKPIPIDVNVVSHP 368

Query: 620 --ELDAWSGARQFSLSENFHDFAVTQSDYQEKG 650
                 W G    + S  F     T+++Y E G
Sbjct: 369 RQRYAVWYGGSMLASSPEFEKVCHTKAEYDEYG 401


>gnl|CDD|185631 PTZ00452, PTZ00452, actin; Provisional.
          Length = 375

 Score = 63.2 bits (153), Expect = 1e-10
 Identities = 65/290 (22%), Positives = 122/290 (42%), Gaps = 46/290 (15%)

Query: 30  IVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRK---ERGKKDGETQVGNDISNIEAVRFQ 86
           +V DNG+  C++G A  + P   F  ++ + ++        + E  VG +      V   
Sbjct: 8   VVIDNGSGYCKIGIAGDDAPTSCFPAIVGRSKQNDGIFSTFNKEYYVGEEAQAKRGV-LA 66

Query: 87  LKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSELL 145
           +K      ++  +D  E I+ +AF + L ++ E   + P+ +T+  +N  ++R  M++++
Sbjct: 67  IKEPIQNGIINSWDDIEIIWHHAFYNELCMSPE---DQPVFMTDAPMNSKFNRERMTQIM 123

Query: 146 FECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQVYQYNGWEGQSGVIISCGYQCTH 205
           FE +  P +    +++ S   +G                         G+++  G   TH
Sbjct: 124 FETFNTPCLYISNEAVLSLYTSG----------------------KTIGLVVDSGEGVTH 161

Query: 206 VIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQ-LKYP---SHINSITPSRSEELLWDYG 261
            +PV  G        +I+L G     +L ++LQ L Y     H   I  +  E L     
Sbjct: 162 CVPVFEGHQIPQAITKINLAGRLCTDYLTQILQELGYSLTEPHQRIIVKNIKERLC---- 217

Query: 262 FVATDYREHLRKWLDAEFYDSNVVKVQLPYAVPVPN-LTTEQQKDRRKEL 310
           + A D ++  R + ++   DS       PY +P  N LT + QK R  E+
Sbjct: 218 YTALDPQDEKRIYKESNSQDS-------PYKLPDGNILTIKSQKFRCSEI 260



 Score = 46.7 bits (110), Expect = 3e-05
 Identities = 36/135 (26%), Positives = 58/135 (42%), Gaps = 2/135 (1%)

Query: 517 SLNQEQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQS 576
           S +Q+      + N L I  ++    E LFQP ++G   AGI       +      + Q 
Sbjct: 233 SNSQDSPYKLPDGNILTIKSQKFRCSEILFQPKLIGLEVAGIHHLAYSSIKKCDLDLRQE 292

Query: 577 LANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAW-SGARQFSLSEN 635
           L  NI ++G     PG   RL+ +L    P Q    V+   +    AW  G+ Q +LS  
Sbjct: 293 LCRNIVLSGGTTLFPGIANRLSNELTNLVPSQLKIQVAAPPDRRFSAWIGGSIQCTLSTQ 352

Query: 636 FHDFAVTQSDYQEKG 650
              + + + +Y E+G
Sbjct: 353 QPQW-IKRQEYDEQG 366


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 46.2 bits (110), Expect = 5e-05
 Identities = 43/263 (16%), Positives = 95/263 (36%), Gaps = 34/263 (12%)

Query: 305 DRRKELAK-------------KLVEMNAKKREERLVDDERHLNELLELREIV-ELTPSDH 350
           DR+KE A               L+    +++ ERL  +         L +   E    + 
Sbjct: 169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYEL 228

Query: 351 SHAREAF---KSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNG-EDLTEEPKAKLSKE 406
              +EA    K      +  L + + +L ++I + + ++       E+L ++ K    +E
Sbjct: 229 LKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEE 288

Query: 407 IAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISE----- 461
                     K  + E + + A +      +++  +D  +R      E  +L++E     
Sbjct: 289 QL------RVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELE 342

Query: 462 --LARKEKRDDDFGMR-DEDWDVYKVINKDAG--DTDSEEEQERLIELEEILRQHDPEFT 516
             +  + KR D       E  +  + +  +    D +  E ++ L +  E L +   E  
Sbjct: 343 REIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREIN 402

Query: 517 SLNQEQELSPKEANQLHIGVERM 539
            L +E +   +E  +L   +  +
Sbjct: 403 ELKRELDRLQEELQRLSEELADL 425



 Score = 42.0 bits (99), Expect = 0.001
 Identities = 49/253 (19%), Positives = 99/253 (39%), Gaps = 29/253 (11%)

Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
           E+  +   EL K+L E+     E      +    E L ++E +    ++ +    +    
Sbjct: 254 EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEK 313

Query: 361 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEP--KAKLSKEIA---------- 408
               ++D  + + +L+ +I+K  A+I       ++ EE   + KL++E A          
Sbjct: 314 E-RELEDAEERLAKLEAEIDKLLAEIEELE--REIEEERKRRDKLTEEYAELKEELEDLR 370

Query: 409 --VPESEAEFKAWLIETKKKRAYI---IDKKNARKQRRQDLAKRRTAAAQERMRLISELA 463
             + E + EF     E K  R  +     + N  K+    L +     ++E   L + +A
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIA 430

Query: 464 RKEKRDDDFGMRDEDWDVYKVINKDAGD-----TDSEEEQERLIELEEILRQHDPEFTSL 518
             E + ++     E  D    I K          D  + ++ L +L+E   + + E + L
Sbjct: 431 GIEAKINELEEEKE--DKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKL 488

Query: 519 NQEQELSPKEANQ 531
             ++EL+  EA  
Sbjct: 489 --QRELAEAEAQA 499



 Score = 36.2 bits (84), Expect = 0.065
 Identities = 45/248 (18%), Positives = 92/248 (37%), Gaps = 24/248 (9%)

Query: 302 QQKDRRKELAKKLVEMNAKKREERLVDD----ERHLNELLEL-----REIVELTPSDHSH 352
            +    + L ++L  +  K+    L  +    E  L+EL +      R+I E+       
Sbjct: 671 SEPAELQRLRERLEGL--KRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQL 728

Query: 353 AREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPES 412
            +E  K      +++L + ++ L+Q+IE  K+++       +  EE   KL + +   E+
Sbjct: 729 EQEEEKLKER--LEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEA 786

Query: 413 EAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDF 472
                       +     +++    +  R +   R       R+ L  E   KE ++   
Sbjct: 787 RLSHSRIPEIQAELSK--LEE----EVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQE 840

Query: 473 GMRDEDWDVYKVINK---DAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEA 529
                D        +   +  +   EE +E L ELE  LR  +     L +E++    + 
Sbjct: 841 QRI--DLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQL 898

Query: 530 NQLHIGVE 537
            +L   +E
Sbjct: 899 RELERKIE 906



 Score = 34.3 bits (79), Expect = 0.31
 Identities = 22/105 (20%), Positives = 45/105 (42%), Gaps = 6/105 (5%)

Query: 304 KDRRKELAKKLVEMNAKKRE--ERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMG 361
           K++ K + K++  +N KK E  E L   E     L +L   +     +        + + 
Sbjct: 846 KEQIKSIEKEIENLNGKKEELEEEL---EELEAALRDLESRLGDLKKERDELEAQLRELE 902

Query: 362 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKE 406
              I++L   I + ++++ + KAK+ A        E+PK +  + 
Sbjct: 903 -RKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEI 946



 Score = 31.2 bits (71), Expect = 2.2
 Identities = 32/123 (26%), Positives = 60/123 (48%), Gaps = 5/123 (4%)

Query: 299 TTEQQKDRRKELAKKLVEMNAKKREE-RLVDDERHLN-ELLELREIVELTPSDHSHAREA 356
           T ++ KD R++L K   E+N  KRE  RL ++ + L+ EL +L   +    +  +   E 
Sbjct: 383 TRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEE 442

Query: 357 FKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEF 416
            +   +  I+     + QL   + K + ++       D  E+  +KL +E+A  E+EA+ 
Sbjct: 443 KEDKAL-EIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELA--EAEAQA 499

Query: 417 KAW 419
           +A 
Sbjct: 500 RAS 502


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 43.2 bits (102), Expect = 5e-04
 Identities = 39/237 (16%), Positives = 87/237 (36%), Gaps = 26/237 (10%)

Query: 297 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHL-NELLELREIVELTPSDHSHARE 355
            L  + ++  R+    K      ++  E+L      L  EL EL E +E          E
Sbjct: 706 ELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEE 765

Query: 356 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 415
                           +  L++ + K K +I          +E   +L +E+   E   +
Sbjct: 766 E---------------LESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLD 810

Query: 416 FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMR 475
                +E+ ++R   ++++    +   +  + +    +E +  + +   + K + +  + 
Sbjct: 811 ALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELE-ELE 869

Query: 476 DEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQL 532
            E  ++   +         +E +E   ELEE LR+ + E   L +E E   +   +L
Sbjct: 870 AEKEELEDEL---------KELEEEKEELEEELRELESELAELKEEIEKLRERLEEL 917



 Score = 38.9 bits (91), Expect = 0.009
 Identities = 39/212 (18%), Positives = 87/212 (41%), Gaps = 12/212 (5%)

Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSD---HSHAREAF 357
           +++ +  +E  + L E    K +E + + E     L E  E +E    +      A E  
Sbjct: 757 QERLEELEEELESLEEA-LAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERE 815

Query: 358 KSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFK 417
                   + L + I +L+++IE+ + K+       +  E+   +L +E+   E EAE +
Sbjct: 816 LESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELE--ELEAEKE 873

Query: 418 AWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDE 477
              +E + K     ++K   ++  ++L        +E  +L   L   E + +   +   
Sbjct: 874 E--LEDELKELE--EEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELP 929

Query: 478 DWDVYKVINKDAGDTDSEEEQERLIELEEILR 509
             ++ + + ++  DT   E +  +  LEE + 
Sbjct: 930 --ELEEELEEEYEDTLETELEREIERLEEEIE 959



 Score = 37.0 bits (86), Expect = 0.042
 Identities = 22/136 (16%), Positives = 51/136 (37%), Gaps = 9/136 (6%)

Query: 301 EQQKDRRKELAKKLVEMNAKKRE-----ERLVDDERHLNELLELREIVELTPSDHSHARE 355
           E+  +   ELA+   E+   KRE     ERL      L +L E  + +E    +     E
Sbjct: 383 EELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELE 442

Query: 356 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 415
                    +++L + + +L+ ++++ + ++          E+  + L   +   E+E  
Sbjct: 443 ELNE----ELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQR 498

Query: 416 FKAWLIETKKKRAYII 431
               +    +     +
Sbjct: 499 ASQGVRAVLEALESGL 514



 Score = 33.1 bits (76), Expect = 0.63
 Identities = 42/229 (18%), Positives = 84/229 (36%), Gaps = 48/229 (20%)

Query: 304 KDRRKELAKKL--VEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMG 361
           K+R++E  +KL   E N ++ E+ L + E+ L +L    E  E      +  RE   ++ 
Sbjct: 171 KERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALL 230

Query: 362 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLI 421
           +  +++L K + +L++++ + + ++          +E   +  KEI              
Sbjct: 231 LAKLKELRKELEELEEELSRLEEELEEL-------QEELEEAEKEI-------------- 269

Query: 422 ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDV 481
                           K   ++L +      +E + L  E+   E        R      
Sbjct: 270 -------------EELKSELEELREELEELQEELLELKEEIEELEGEISLLRER------ 310

Query: 482 YKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSL-NQEQELSPKEA 529
                 +  + + EE +ERL EL+E +     E        +EL    A
Sbjct: 311 -----LEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLA 354



 Score = 30.1 bits (68), Expect = 5.6
 Identities = 27/172 (15%), Positives = 69/172 (40%), Gaps = 23/172 (13%)

Query: 365 IQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETK 424
           + +L   + +L+++++  K ++ +    + L E  +     E  + E + E  A   E +
Sbjct: 676 LAELEAQLEKLEEELKSLKNELRSLE--DLLEELRRQLEELERQLEELKRELAALEEELE 733

Query: 425 KKRAYI--IDKKNARKQRRQDLAKRRTAAAQERMR-LISELARKEKRDDDFGMRDEDWDV 481
           + ++ +  ++++    +   +  + R    +E +  L   LA+ ++  ++          
Sbjct: 734 QLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELE-------- 785

Query: 482 YKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQLH 533
                        +  QE L ELEE L + +    +L +E E   +   +L 
Sbjct: 786 ----------EKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLE 827


>gnl|CDD|212657 cd00012, NBD_sugar-kinase_HSP70_actin, Nucleotide-Binding Domain of
           the sugar kinase/HSP70/actin superfamily.  This
           superfamily includes the actin family, the HSP70 family
           of molecular chaperones and nucleotide exchange factors,
           the ROK (repressor, ORF, kinase) family, the hexokinase
           family, the FGGY family (which includes glycerol kinase
           and similar carbohydrate kinases such as rhamnulokinase
           and xylulokinase), the exopolyphosphatase/guanosine
           pentaphosphate phosphohydrolase/nucleoside triphosphate
           diphosphohydrolase family, propionate kinase/acetate
           kinase family, glycerol dehydratase reactivase,
           2-hydroxyglutaryl-CoA dehydratase component A,
           N-acetylglucosamine kinase, butyrate kinase 2,
           Escherichia coli YeaZ and similar glycoproteases, the
           cell shape-determining protein MreB, the plasmid DNA
           segregation factor ParM, cell cycle proteins FtsA, Pili
           assembly protein PilM, ethanolamine utilization protein
           EutJ, and similar proteins. The nucleotide-binding site
           residues are conserved; the nucleotide sits in a deep
           cleft formed between the two lobes of the
           nucleotide-binding domain (NBD). Substrate binding to
           superfamily members is associated with closure of this
           catalytic site cleft. The functional activities of
           several members of the superfamily, including
           hexokinases, actin, and HSP70s, are modulated by
           allosteric effectors, which may act on the cleft
           closure.
          Length = 185

 Score = 40.6 bits (95), Expect = 9e-04
 Identities = 28/185 (15%), Positives = 50/185 (27%), Gaps = 34/185 (18%)

Query: 72  QVGNDISNIEAVRFQLKTQFDKNVVTHYDIQEQIFDYAFSHLSINTEGNVNHPIVLTEPF 131
            V +    I               VT  D  E+             +  +   + +TEP 
Sbjct: 13  GVADLDGEILPEEIVPTPVGRPGAVTDLDELEEALRELLKEALRQLKSEI-DAVGITEPG 71

Query: 132 LNPNYSRSLMS---------ELLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQ 182
             P  +R ++           L  E      V    D++ +    G  G+          
Sbjct: 72  GVPKENREVIILPNLLLIPLALALEDLGGVPVAVVNDAVAAALAEGLFGK---------- 121

Query: 183 VYQYNGWEGQSGVIISCGYQCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQL--K 240
                  E  + +++  G   T +  V +G     K      G   +   L  +L L   
Sbjct: 122 -------EEDTVLVVDLGTGTTGIAIVEDG-----KGGVGAAGELGIAEALAAVLNLLDG 169

Query: 241 YPSHI 245
            P  +
Sbjct: 170 LPDAV 174


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 41.3 bits (97), Expect = 0.002
 Identities = 37/188 (19%), Positives = 74/188 (39%), Gaps = 23/188 (12%)

Query: 297 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDD---------------ERHLNELLE-LR 340
            L  +Q K+  ++L  +L E+     E RL+                 ++ L EL E L 
Sbjct: 551 KLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLS 610

Query: 341 EIVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPK 400
           ++ EL  S      E          ++L   + +L  + E  +    A    E+  EE +
Sbjct: 611 QLEELLQSLELSEAENEL---EEAEEELESELEKLNLQAELEELLQAALEELEEKVEELE 667

Query: 401 AKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLIS 460
           A++ +E+   E+E + +    E  ++   + ++    ++  ++L K+     Q    L S
Sbjct: 668 AEIRRELQRIENEEQLE----EKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELES 723

Query: 461 ELARKEKR 468
             A  E+ 
Sbjct: 724 RKAELEEL 731



 Score = 29.3 bits (66), Expect = 9.7
 Identities = 47/265 (17%), Positives = 103/265 (38%), Gaps = 42/265 (15%)

Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
           + +++  +EL + L E   +++ ERL + ER + EL E  E +         A       
Sbjct: 270 KIREEELRELERLLEE--LEEKIERLEELEREIEELEEELEGLR--------ALLEELEE 319

Query: 361 GINNIQDLNKSINQLQQKIEKTKAKI----IAYNNGEDLTEEPKAKLSKEIAVPESEAE- 415
            +  ++ L + + +L++K+EK ++++       N    L EE   +L + +   E E E 
Sbjct: 320 LLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEK 379

Query: 416 -------FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKR 468
                   +  + E K++ A +       ++  ++L K      +E   L  E+ + E++
Sbjct: 380 ALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQ 439

Query: 469 DDDFGMRDEDW------------------DVYKVINKDAGDTDSE--EEQERLIELEEIL 508
            +    ++                     + ++    +  + + E  EE+    + E  L
Sbjct: 440 INQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAEL 499

Query: 509 RQHDPEFTSLNQEQELSPKEANQLH 533
           R+   E     +E E    E  +L 
Sbjct: 500 REEIEELEKELRELEEELIELLELE 524


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 40.0 bits (94), Expect = 0.005
 Identities = 44/234 (18%), Positives = 93/234 (39%), Gaps = 29/234 (12%)

Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERH--LNELLELREIVELTPSDHSHAREAFK 358
           E+ ++ +KEL     E++  +++++++ +       +L EL   +E   S      E   
Sbjct: 281 EEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELA 340

Query: 359 SMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKA 418
            +    +++L + +  L+ ++E+ +A++    +  +  EE    L  ++A  E +     
Sbjct: 341 ELE-EKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIA--- 396

Query: 419 WLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDED 478
                  +   +       + R + L  RR    QE   L+ +L   E    +     E+
Sbjct: 397 ---SLNNEIERL-------EARLERLEDRRERLQQEIEELLKKLEEAEL--KELQAELEE 444

Query: 479 WDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQL 532
                    +    + +EE ERL E  E LR+   E     Q  + + +E  QL
Sbjct: 445 --------LEEELEELQEELERLEEALEELRE---ELEEAEQALDAAERELAQL 487



 Score = 39.7 bits (93), Expect = 0.007
 Identities = 48/243 (19%), Positives = 107/243 (44%), Gaps = 15/243 (6%)

Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELR-EIVELTPSDHSHAREAFKS 359
           E+ ++  +EL ++L E   +  E      E    +L ELR E+ EL   +    ++   +
Sbjct: 235 EELREELEELQEELKEAEEELEELTAELQELEE-KLEELRLEVSELE-EEIEELQKELYA 292

Query: 360 MGI------NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE 413
           +           Q L + +  L++++E+ +A++    +  D   E  A+L +++   E +
Sbjct: 293 LANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLE--ELK 350

Query: 414 AEFKAWLIETKKKRAYIIDKKNARKQRRQDL--AKRRTAAAQERMRLI-SELARKEKRDD 470
            E ++   E ++  A + + ++  ++  + L   + + A  + ++  + +E+ R E R +
Sbjct: 351 EELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLE 410

Query: 471 DFGMRDEDWD-VYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEA 529
               R E      + + K   + + +E Q  L ELEE L +   E   L +  E   +E 
Sbjct: 411 RLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREEL 470

Query: 530 NQL 532
            + 
Sbjct: 471 EEA 473



 Score = 38.5 bits (90), Expect = 0.015
 Identities = 44/236 (18%), Positives = 84/236 (35%), Gaps = 36/236 (15%)

Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
            Q      EL  ++ E+  +  E      E    E+ EL   +E    +    REA   +
Sbjct: 750 AQLSKELTELEAEIEELEERLEEAEEELAE-AEAEIEELEAQIEQLKEELKALREALDEL 808

Query: 361 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIA------------ 408
               +  LN+    L++++E  + +I A     +  EE   +LS++I             
Sbjct: 809 R-AELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEEL 867

Query: 409 VPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKR 468
           + E E+E +A L E       +       +   ++L++       +R  L  EL    ++
Sbjct: 868 IEELESELEALLNERASLEEAL----ALLRSELEELSEELRELESKRSELRRELEELREK 923

Query: 469 DDDFGMR------------DEDWDVYKVI------NKDAGDTDSEEEQERLIELEE 506
                +R            +   + Y +        ++  + D EE + RL  LE 
Sbjct: 924 LAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLEN 979



 Score = 34.3 bits (79), Expect = 0.27
 Identities = 43/242 (17%), Positives = 92/242 (38%), Gaps = 60/242 (24%)

Query: 304 KDRRKELAKKLV--EMNAKKREERLVDDERHLNEL----------LELREIVELTPSDHS 351
           K+RRKE  +KL     N  + E+ L + ER L  L           EL+   EL      
Sbjct: 171 KERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKA--EL------ 222

Query: 352 HAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPE 411
             RE   ++ +  +++L + + +LQ+++++               EE   +L+ E+   E
Sbjct: 223 --RELELALLVLRLEELREELEELQEELKE--------------AEEELEELTAELQELE 266

Query: 412 SEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDD 471
            +       +E  +     ++++    Q+         +  +++ +++ E   +    + 
Sbjct: 267 EK-------LEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRE---RLANLER 316

Query: 472 FGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQ 531
                           +  +   EE + +L EL E L + + +   L +E E    E  +
Sbjct: 317 --------------QLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEE 362

Query: 532 LH 533
           L 
Sbjct: 363 LE 364



 Score = 33.9 bits (78), Expect = 0.36
 Identities = 49/247 (19%), Positives = 89/247 (36%), Gaps = 18/247 (7%)

Query: 296 PNLTTEQQKDRRKELAKKLVEMNAKKRE-----ERLVDDERHLNELLELREIVELTPSDH 350
            N +  +++   +EL +K+ E+  K  E       L  +   L E LE         S  
Sbjct: 668 TNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQ 727

Query: 351 SHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVP 410
             A     +     ++ L + I QL +++ + +A+I       +  EE  A+   EI   
Sbjct: 728 ISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE-- 785

Query: 411 ESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDD 470
           E EA+ +    E K  R   +D+  A       L +      +    L   +A  E+R +
Sbjct: 786 ELEAQIEQLKEELKALRE-ALDELRAELTL---LNEEAANLRERLESLERRIAATERRLE 841

Query: 471 DF-----GMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELS 525
           D       + ++   +   I +   +   EE +  L  L       +     L  E E  
Sbjct: 842 DLEEQIEELSEDIESLAAEIEE--LEELIEELESELEALLNERASLEEALALLRSELEEL 899

Query: 526 PKEANQL 532
            +E  +L
Sbjct: 900 SEELREL 906



 Score = 33.5 bits (77), Expect = 0.49
 Identities = 32/213 (15%), Positives = 77/213 (36%), Gaps = 22/213 (10%)

Query: 334 NELLELREIVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGE 393
            E+ EL E +E      +   +A        + +L K + +L++++E+ + ++   +   
Sbjct: 677 REIEELEEKIEELEEKIAELEKA--------LAELRKELEELEEELEQLRKELEELSRQI 728

Query: 394 DLTEEPKAKLSKEIAVPESE--------AEFKAWLIETKKKRAYIIDKKNARKQRRQDLA 445
               +  A+L  E+   E           E +A + E +++     ++    +   ++L 
Sbjct: 729 SALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788

Query: 446 KRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAGDT-DSEEEQERLIEL 504
            +     +E   L   L           +   + +   +  +           + RL +L
Sbjct: 789 AQIEQLKEELKALREALDELRAE-----LTLLNEEAANLRERLESLERRIAATERRLEDL 843

Query: 505 EEILRQHDPEFTSLNQEQELSPKEANQLHIGVE 537
           EE + +   +  SL  E E   +   +L   +E
Sbjct: 844 EEQIEELSEDIESLAAEIEELEELIEELESELE 876


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 39.0 bits (92), Expect = 0.008
 Identities = 40/143 (27%), Positives = 62/143 (43%), Gaps = 25/143 (17%)

Query: 301 EQQKDRRKELAKKLVEMNAK--KREERLVDDERHLN---ELLELRE-IVELTPSDHSHAR 354
           E+    R E  K+L E   +  K E+RL+  E +L+   ELLE RE  +E    +    +
Sbjct: 64  EEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQ 123

Query: 355 EAFKSMGINNIQDLNKSINQLQQKIEK----TK--AKIIAYNNGEDLTEEPKAKLSKEIA 408
           +  +       ++L + I +  Q++E+    T   AK I     E+      A L KEI 
Sbjct: 124 QELEKK----EEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARHEAAVLIKEI- 178

Query: 409 VPESEAEFKAWLIETKKKRAYII 431
             E EA+      E   K+A  I
Sbjct: 179 --EEEAK------EEADKKAKEI 193



 Score = 31.3 bits (72), Expect = 1.7
 Identities = 27/143 (18%), Positives = 58/143 (40%), Gaps = 22/143 (15%)

Query: 368 LNKSINQLQQKIEKTKAKII---AYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETK 424
           + K I + + K  + +AK I   A    E + +E   +  +EI    +E E      E +
Sbjct: 24  VRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEK-----ELR 78

Query: 425 KKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKV 484
           ++R  +   +    Q+ ++L ++     +    L  +    E++  +   ++E+ +    
Sbjct: 79  ERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELE---- 134

Query: 485 INKDAGDTDSEEEQERLIELEEI 507
                     E  +E+L ELE I
Sbjct: 135 ----------ELIEEQLQELERI 147


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
           found at the C-terminus of a family of FtsJ-like
           methyltransferases. Members of this family are involved
           in 60S ribosomal biogenesis.
          Length = 212

 Score = 36.9 bits (86), Expect = 0.017
 Identities = 27/94 (28%), Positives = 42/94 (44%), Gaps = 15/94 (15%)

Query: 376 QQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKK- 434
            +++EK K K  A    ED++E  KAK  K++         KA   + K+K+ Y++ KK 
Sbjct: 125 LKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYK-------KAKKKKKKRKKTYVVAKKG 177

Query: 435 NARKQRRQDLAKRRTAAAQERMRLISELARKEKR 468
           N  K  R    K +      RM       +K+ R
Sbjct: 178 NKGKAGRPKGVKGKYKVVDPRM-------KKDLR 204


>gnl|CDD|193205 pfam12729, 4HB_MCP_1, Four helix bundle sensory module for signal
           transduction.  This family is a four helix bundle that
           operates as a ubiquitous sensory module in prokaryotic
           signal-transduction. The 4HB_MCP is always found between
           two predicted transmembrane helices indicating that it
           detects only extracellular signals. In many cases the
           domain is associated with a cytoplasmic HAMP domain
           suggesting that most proteins carrying the bundle might
           share the mechanism of transmembrane signalling which is
           well-characterized in E coli chemoreceptors.
          Length = 181

 Score = 36.4 bits (85), Expect = 0.023
 Identities = 27/104 (25%), Positives = 39/104 (37%), Gaps = 27/104 (25%)

Query: 362 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLI 421
           + +I++L   I++L +K EKT            LTEE K              EFK  L 
Sbjct: 81  LKDIEELRAEIDKLLKKYEKTI-----------LTEEEKKLF----------NEFKEQLK 119

Query: 422 ETKKKRAYIID--KKN----ARKQRRQDLAKRRTAAAQERMRLI 459
             +K R  ++D  K      A      +L   R A  +    LI
Sbjct: 120 AYRKVRNKVLDLAKAGKNDEAYALYLTELEPARDAVIEALDELI 163


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 36.9 bits (85), Expect = 0.039
 Identities = 30/249 (12%), Positives = 85/249 (34%), Gaps = 21/249 (8%)

Query: 300 TEQQKDRRKELAKK------LVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHA 353
           + +++ +++ L K       L E+     E +L + +        L         +    
Sbjct: 166 SREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEE 225

Query: 354 REAFKSMG---INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVP 410
              +          I  L + +   Q++IE +K ++           +   +  KE  + 
Sbjct: 226 NLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQ 285

Query: 411 ESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQ---ERMRLISELARKEK 467
           E E +  A   E  K     ++++    + +   +++     +   ++ +   E   KE 
Sbjct: 286 EEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKEL 345

Query: 468 RDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPK 527
                    ++ ++ +   ++  +   + +++     EE+L +   E   L+   +L  +
Sbjct: 346 ---------KELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEE 396

Query: 528 EANQLHIGV 536
           E    +   
Sbjct: 397 ELELKNEEE 405


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 35.4 bits (82), Expect = 0.082
 Identities = 21/113 (18%), Positives = 42/113 (37%), Gaps = 23/113 (20%)

Query: 302 QQKDRRKELAKKLVEMNA-----KKREERLVDD-----------ERHLNELLEL-REIVE 344
             +  ++ L + L  +        K  E L              E  L +L +L  E+ +
Sbjct: 141 LLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELED 200

Query: 345 LTPSDHSHAREAFKSM------GINNIQDLNKSINQLQQKIEKTKAKIIAYNN 391
             P++   A+E  K +       +  +++L + + +L+ KIE    K    N 
Sbjct: 201 CDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNT 253


>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
          Length = 156

 Score = 34.0 bits (79), Expect = 0.089
 Identities = 23/93 (24%), Positives = 43/93 (46%), Gaps = 12/93 (12%)

Query: 374 QLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDK 433
            + + +E+ + KI      + L    +AK   E+A  + EA+    L E + + A II+ 
Sbjct: 28  PIMKALEERQKKIA-----DGLAAAERAKKELELAQAKYEAQ----LAEARAEAAEIIE- 77

Query: 434 KNARKQRRQDLAKRRTAAAQERMRLISELARKE 466
             A+K+  Q + + +  A  E  R+    A+ E
Sbjct: 78  -QAKKRAAQIIEEAKAEAEAEAARIK-AQAQAE 108


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 35.4 bits (82), Expect = 0.11
 Identities = 55/216 (25%), Positives = 99/216 (45%), Gaps = 33/216 (15%)

Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
           E++++R +EL KKL E+  +KR E L +      E    +E                K +
Sbjct: 334 EEKEERLEELKKKLKEL--EKRLEELEERHELYEEAKAKKE----------ELERLKKRL 381

Query: 361 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE-AEFKAW 419
                + L K +    +++EK K +I          EE  +K++  I   + E  E K  
Sbjct: 382 TGLTPEKLEKEL----EELEKAKEEI----------EEEISKITARIGELKKEIKELKKA 427

Query: 420 LIETKK-KRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDED 478
           + E KK K    +  +   ++ R++L +  TA   E  R+  EL   E+++    +R E 
Sbjct: 428 IEELKKAKGKCPVCGRELTEEHRKELLEEYTA---ELKRIEKELKEIEEKERK--LRKEL 482

Query: 479 WDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPE 514
            ++ KV+ K++     +E  E+L ELEE L++++ E
Sbjct: 483 RELEKVLKKESELIKLKELAEQLKELEEKLKKYNLE 518



 Score = 33.9 bits (78), Expect = 0.39
 Identities = 38/169 (22%), Positives = 79/169 (46%), Gaps = 20/169 (11%)

Query: 308 KELAKKLVEMNAKKREERLVDDERHLNELL----ELREIVELTPSDHSHARE-----AFK 358
            E   + +E + +K EE++ + E  + EL     EL E V+        A E      F 
Sbjct: 243 LEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFY 302

Query: 359 SMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKA 418
              ++ ++++ K +++L+++I   + +I      ++L EE + +L +E+     E E + 
Sbjct: 303 EEYLDELREIEKRLSRLEEEINGIEERI------KEL-EEKEERL-EELKKKLKELEKR- 353

Query: 419 WLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 467
             +E  ++R  + ++  A+K+  + L KR T    E++    E   K K
Sbjct: 354 --LEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAK 400



 Score = 32.7 bits (75), Expect = 0.75
 Identities = 52/227 (22%), Positives = 95/227 (41%), Gaps = 36/227 (15%)

Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDE-----RHLNELLELREIVELTPSDHSHARE 355
           E  K++ KEL + L E+N    E   + +E     + + EL EL+E +E         +E
Sbjct: 193 ELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIE------ELEKE 246

Query: 356 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 415
                G  + + L + I +L+++IE+ K +I      E+L  E K K  KE+   + +AE
Sbjct: 247 LESLEG--SKRKLEEKIRELEERIEELKKEI------EEL--EEKVKELKEL---KEKAE 293

Query: 416 FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMR 475
               L E  ++    +D+    ++R   L +            I EL  KE+R ++   +
Sbjct: 294 EYIKLSEFYEE---YLDELREIEKRLSRLEEE----INGIEERIKELEEKEERLEELKKK 346

Query: 476 DEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQ 522
            +     ++  +     +  E  E     +E L +     T L  E+
Sbjct: 347 LK-----ELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEK 388


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 35.4 bits (82), Expect = 0.12
 Identities = 53/269 (19%), Positives = 97/269 (36%), Gaps = 49/269 (18%)

Query: 298 LTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLE-LREIVELTPSDHSHAREA 356
           L  EQQ  + K   +  +    ++ E +L   +  L +LLE  +  +E    D    R  
Sbjct: 661 LQNEQQSLKDK--LELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTE 718

Query: 357 FKSMGINNIQDLNKSINQLQQKIEK--------TKAKIIAYNN---GEDLTEEPKAKLSK 405
             +       +L+  + QL   IE          K     Y+      D+      +L +
Sbjct: 719 RLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKR 778

Query: 406 EI----------AVPESEA-EFKAWLIETKKKRAYIIDKKNARKQR-----------RQD 443
           +I          AV   E  E++A++ ET   R  + +++     +           +Q+
Sbjct: 779 QIEELETTIERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAIQLRELESSAEELQQE 838

Query: 444 LAKRRTAAAQERMRLISELARKEKRDDDF-----GMRDEDWDVYKVINKDAGDTDSEEEQ 498
           L +        R +L  E    EK+ D       G+RDE     + + +     ++ + +
Sbjct: 839 LTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDE----MRQLAELKEPANANQAE 894

Query: 499 ----ERLIELEEILRQHDPEFTSLNQEQE 523
               ERL +LEE  R+       L +  E
Sbjct: 895 GSISERLDQLEEFKRKRKRLSGDLKKFLE 923



 Score = 29.7 bits (67), Expect = 7.8
 Identities = 51/257 (19%), Positives = 95/257 (36%), Gaps = 37/257 (14%)

Query: 300 TEQQKDRRKELAKKLVEMNAK----KREERLVDD--ERHLNELLELR----EIVELTPSD 349
            +Q K+ R EL ++L   NAK    + E  L++D      +  +E      + +    S+
Sbjct: 301 EDQLKEARDELNQELSAANAKLAADRSELELLEDQKGAFEDADIEQLQADLDQLPSIRSE 360

Query: 350 HSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNN-GEDLTEEPKAKLSKEIA 408
                    ++     QD+ +   +L+QKI++   + +  NN       E K +    I 
Sbjct: 361 LEEVEARLDAL-TGKHQDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIE 419

Query: 409 VPES--EAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKE 466
                 E++ +  L   K +     +++   + R   L +R  +A         EL + E
Sbjct: 420 EDLQALESQLRQQLEAGKLEFN---EEEYELELRLGRLKQRLDSATATP----EELEQLE 472

Query: 467 KRDDDFGMRDEDWDVYKVINKDAG------DTDSEEEQERL----IELEEILRQHDPEFT 516
             D+      E+ +  +   +             +E  E L      L + LRQ   E  
Sbjct: 473 INDEALEKAQEEQEQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQ-LRQALDEL- 530

Query: 517 SLNQEQELSPKEANQLH 533
               E +LSP+  + LH
Sbjct: 531 ----ELQLSPQAGSLLH 543


>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein.  The Bacillus subtilis divIVA1
           mutation causes misplacement of the septum during cell
           division, resulting in the formation of small, circular,
           anucleate mini-cells. Inactivation of divIVA produces a
           mini-cell phenotype, whereas overproduction of DivIVA
           results in a filamentation phenotype. These proteins
           appear to contain coiled-coils.
          Length = 131

 Score = 33.4 bits (77), Expect = 0.14
 Identities = 23/107 (21%), Positives = 44/107 (41%), Gaps = 11/107 (10%)

Query: 362 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE-FKAWL 420
           I + + L K   +L+++IE+ + K+  Y   E+        L K + V +  AE  KA  
Sbjct: 31  IKDYEALYKENEELKEEIERLEEKLEEYKELEE-------TLQKTLVVAQETAEEVKA-- 81

Query: 421 IETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 467
              +K+   II +  A+ +R  + A             +   +R  +
Sbjct: 82  -NAQKEAELIIKEAEAKAERIVNDANEEAKKLATEYDELKRQSRLFR 127


>gnl|CDD|234792 PRK00558, uvrC, excinuclease ABC subunit C; Validated.
          Length = 598

 Score = 34.7 bits (81), Expect = 0.16
 Identities = 22/96 (22%), Positives = 39/96 (40%), Gaps = 20/96 (20%)

Query: 277 AEFYDSNVV--KVQLPYAVPVPNLTTEQQKDRRKELA-------KKLVEM---NAK-KRE 323
            +FY    +  ++ +P+ +    L       R+ ++        K+L+E+   NA+   E
Sbjct: 298 GQFYLQKSIPKEILVPHELDDEELAEALLAGRKVKIRVPQRGDKKELLELAEKNAREALE 357

Query: 324 ERLVDDERHLNELLELREIVEL--TPS-----DHSH 352
            +L D       L EL E++ L   P      D SH
Sbjct: 358 RKLSDQSTQQQALEELAELLGLPEPPYRIECFDISH 393


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 34.6 bits (80), Expect = 0.19
 Identities = 22/133 (16%), Positives = 47/133 (35%), Gaps = 3/133 (2%)

Query: 378 KIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNAR 437
           K    +   +       + +   A  S         A   A      KK+       + +
Sbjct: 46  KKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKK 105

Query: 438 KQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYK---VINKDAGDTDS 494
            +++  L K       + + ++++    +  DDD  + D+D D        ++D  D D 
Sbjct: 106 AEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDV 165

Query: 495 EEEQERLIELEEI 507
           ++E E   E +E+
Sbjct: 166 DDEDEEKKEAKEL 178


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 34.3 bits (79), Expect = 0.21
 Identities = 30/117 (25%), Positives = 54/117 (46%), Gaps = 21/117 (17%)

Query: 366 QDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE-AEFKAWLIET- 423
              +K + Q+Q++I   + KI          ++ +AKL K++   E+E A  +A LIET 
Sbjct: 34  AADDKQLKQIQKEIAALEKKI-------REQQDQRAKLEKQLKSLETEIASLEAQLIETA 86

Query: 424 ------KKKRAYIIDKKNA----RKQRRQDLAKRRTAAAQERMRLISELARKEKRDD 470
                 +K+ A +  + NA     +++R+ LA+    AA +R       A     +D
Sbjct: 87  DDLKKLRKQIADLNARLNALEVQEREQRRRLAE--QLAALQRSGRNPPPALLVSPED 141



 Score = 32.4 bits (74), Expect = 0.73
 Identities = 29/175 (16%), Positives = 62/175 (35%), Gaps = 10/175 (5%)

Query: 302 QQKDRRKELAKKLVEMNA--KKREERLVDDERHLNELLELREIVELTPS----DHSHARE 355
           +  D  K+L K++ ++NA     E +  +  R L E L   +     P           +
Sbjct: 84  ETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQ 143

Query: 356 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 415
               + I     LN +  +    ++ T  ++ A     ++  E     +        +A+
Sbjct: 144 RSVRLAIY-YGALNPARAERIDALKATLKQLAAV--RAEIAAEQAELTTLLSEQRAQQAK 200

Query: 416 FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARK-EKRD 469
               L E KK  A +  + +A +++ ++L    +    E     +  A+  E   
Sbjct: 201 LAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAA 255



 Score = 30.5 bits (69), Expect = 3.0
 Identities = 33/189 (17%), Positives = 66/189 (34%), Gaps = 21/189 (11%)

Query: 292 AVPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHS 351
           AV     +      + K++ K++    A +++ R   D+R   +L +  + +E   +   
Sbjct: 25  AVLAAAFSAAADDKQLKQIQKEI---AALEKKIREQQDQR--AKLEKQLKSLETEIASLE 79

Query: 352 HAREAFKSMGIN----NIQDLNKSINQLQQKIEKTK---AKIIAYNNGEDLTEEPKAKLS 404
                     +      I DLN  +N L+ +  + +   A+ +A          P   +S
Sbjct: 80  AQLIETA-DDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVS 138

Query: 405 KEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRL---ISE 461
                PE         I          ++ +A K   + LA  R   A E+  L   +SE
Sbjct: 139 -----PEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSE 193

Query: 462 LARKEKRDD 470
              ++ +  
Sbjct: 194 QRAQQAKLA 202


>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A. 
          Length = 427

 Score = 34.4 bits (80), Expect = 0.22
 Identities = 31/132 (23%), Positives = 51/132 (38%), Gaps = 26/132 (19%)

Query: 305 DRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINN 364
           + R E+ K+  E   +K EERL   E  L  L ++  ++E+                I  
Sbjct: 315 EHRLEVYKRRKEYLLEKLEERLHILEGLLKALNKIDFVIEV----------------IRG 358

Query: 365 IQDLNKSINQLQQKIEKTKAKIIA----YNNGEDLTEEPKAKLSKEIAVPESEAEFKAWL 420
             DL K+  +L +++ + +A  +           LT+E   KL KEI   E E E     
Sbjct: 359 SIDLKKAKKELIEELSEIQADYLLDMRLRR----LTKEEIEKLEKEIE--ELEKEIAELE 412

Query: 421 IETKKKRAYIID 432
                ++   I 
Sbjct: 413 KILASEKKLWIK 424


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 34.2 bits (79), Expect = 0.24
 Identities = 30/177 (16%), Positives = 69/177 (38%), Gaps = 20/177 (11%)

Query: 293 VPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHL-NELLEL-------REIVE 344
           +P  +  +E+  +   +      E    K  + LV+    L  ELL+        +E+V 
Sbjct: 84  IPRQSGLSEEVAEAEAKATSVAAEATTPKSIQELVEALEELLEELLKETASDPVVQELVS 143

Query: 345 LTPS--DHSHAREAFKSMG------INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLT 396
           +     D          +          +  L+K + +L+ + E+   + +     E L 
Sbjct: 144 IFNDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELL- 202

Query: 397 EEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQ 453
            + + +L   +   E+  E K   +E ++++  +  +K   ++ RQ+L ++  A  Q
Sbjct: 203 SKLEEELLARLESKEAALE-KQLRLEFEREKEEL--RKKYEEKLRQELERQAEAHEQ 256


>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
           factor dissociates the posttermination complex, composed
           of the ribosome, deacylated tRNA, and mRNA, after
           termination of translation.  Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.  RRF is believed to bind the ribosome at the
           A-site in a manner that mimics tRNA, but the specific
           mechanisms remain unclear.  RRF is essential for
           bacterial growth.  It is not necessary for cell growth
           in archaea or eukaryotes, but is found in mitochondria
           or chloroplasts of some eukaryotic species.
          Length = 179

 Score = 33.0 bits (76), Expect = 0.29
 Identities = 18/96 (18%), Positives = 41/96 (42%), Gaps = 13/96 (13%)

Query: 293 VPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSH 352
           V +P LT E    RRKEL K   ++  + +           +++ +L +  E++  +   
Sbjct: 96  VNLPPLTEE----RRKELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKK 151

Query: 353 AREAFKSMGINNIQDLNK-SINQLQQKIEKTKAKII 387
           A E         +Q L    I ++ + ++  + +++
Sbjct: 152 AEED--------LQKLTDEYIKKIDELLKSKEKELL 179


>gnl|CDD|151681 pfam11239, DUF3040, Protein of unknown function (DUF3040).  Some
           members in this family of proteins with unknown function
           are annotated as membrane proteins however this cannot
           be confirmed.
          Length = 82

 Score = 31.0 bits (71), Expect = 0.34
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 494 SEEEQERLIELEEILRQHDPEFTS 517
           SE EQ RL E+E  L   DP+F +
Sbjct: 4   SEHEQRRLEEIERALYAEDPKFAA 27


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 32.3 bits (74), Expect = 0.37
 Identities = 24/99 (24%), Positives = 44/99 (44%), Gaps = 19/99 (19%)

Query: 368 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKR 427
           + K++++ Q KI             +DL E  + K   +  + E E E    L E +++ 
Sbjct: 31  ILKALDERQAKIA------------DDLAEAERLKEEAQALLAEYEQE----LEEAREQA 74

Query: 428 AYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKE 466
           + II+   A+K+  Q   + +  A +E  R+  E A  E
Sbjct: 75  SEIIE--QAKKEAEQIAEEIKAEAEEELERIK-EAAEAE 110


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 33.5 bits (76), Expect = 0.38
 Identities = 46/228 (20%), Positives = 80/228 (35%), Gaps = 32/228 (14%)

Query: 302 QQKDRRKELAKKLVEMNAKKREERLVDDERHL--NELLELREIVELTPSDHSHAREAFKS 359
           Q  D R +L     E+   + E+R  + ER    +EL + R+  E    + + AR+    
Sbjct: 75  QLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAK 134

Query: 360 MGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAW 419
                 Q+L +   Q Q    + K           L  + ++  + +  +  S  + K+ 
Sbjct: 135 A----QQELARLTKQAQDLQTRLKTLAEQR---RQLEAQAQSLQASQKQLQASATQLKSQ 187

Query: 420 LIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDW 479
           +++ K + A I       +Q  Q+LA R  AA      L    A  ++       RD   
Sbjct: 188 VLDLKLRSAQI-------EQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDA-- 238

Query: 480 DVYKVINKD----AGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQE 523
              ++  K     A      E + +L  LE            L QE  
Sbjct: 239 ---QISQKAQQIAARAEQIRERERQLQRLET-------AQARLEQEVA 276


>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I.  coiled-coil structure. It
           has been suggested that the domains I and II from
           laminin A, B1 and B2 may come together to form a triple
           helical coiled-coil structure.
          Length = 263

 Score = 33.1 bits (76), Expect = 0.41
 Identities = 34/167 (20%), Positives = 67/167 (40%), Gaps = 31/167 (18%)

Query: 292 AVPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELL----ELREIVELTP 347
           A+P P       ++  ++L+ +L  ++A KR  +L D E+ L++L      L++      
Sbjct: 14  ALPAPYKILYNIENLTQQLSGRLSPISATKR--QLEDLEKALSKLAQDAENLQKKANQAL 71

Query: 348 SDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEI 407
           +       A +   + + + L ++I  L+  I++   K+     GE+    P + LS+ +
Sbjct: 72  AKAQKVNAAIERT-LGHAKTLAEAIKNLRDNIKEILEKLAGL--GENAFALPGSALSRML 128

Query: 408 AVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQE 454
           A    EA+     I                  R +D  K+   A  E
Sbjct: 129 A----EAQRMLGEI------------------RERDFQKQLQNAEAE 153


>gnl|CDD|217630 pfam03592, Terminase_2, Terminase small subunit.  Packaging of
           double-stranded viral DNA concatemers requires
           interaction of the prohead with virus DNA. This process
           is mediated by a phage-encoded DNA recognition and
           terminase protein. The terminase enzymes described so
           far, which are hetero-oligomers composed of a small and
           a large subunit, do not have a significant level of
           sequence homology. The small terminase subunit is
           thought to form a nucleoprotein structure that helps to
           position the terminase large subunit at the packaging
           initiation site.
          Length = 141

 Score = 31.9 bits (73), Expect = 0.43
 Identities = 26/115 (22%), Positives = 40/115 (34%), Gaps = 24/115 (20%)

Query: 424 KKKRAYIIDKKNARK--------QRRQDLAKRRTAAAQERMRLISELAR---KEKRDDDF 472
            KK A  I  +   K        +  ++  KR    A E +  ++E+AR    E      
Sbjct: 28  SKKTARSIGSELLTKPKIKAYIDELMKEREKRTIITADEVLEELTEIARGDETELVLVGT 87

Query: 473 GMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFT-SLNQEQELSP 526
              + +    KV  KD           RL  LE + + H   FT  +  E E+  
Sbjct: 88  KEGEVEKIEKKVKLKD-----------RLKALELLGK-HYGLFTDKVEVEGEVKK 130


>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
           chaperones [Posttranslational modification, protein
           turnover, chaperones].
          Length = 786

 Score = 33.4 bits (77), Expect = 0.43
 Identities = 23/123 (18%), Positives = 51/123 (41%), Gaps = 19/123 (15%)

Query: 394 DLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKK--NARKQRRQDLAKRRTAA 451
           DL +E  A++  EI  PE   E +  L + + ++  +  ++    +K   + +  +    
Sbjct: 384 DLLDEAGARVRLEIDKPEELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRI 443

Query: 452 AQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAG---DTDSEEEQERLIELEEIL 508
            +    L +E+                 D+ +V+ +  G       E+E+E+L+ LE  L
Sbjct: 444 PELEKELEAEVDED--------------DIAEVVARWTGIPVAKLLEDEKEKLLNLERRL 489

Query: 509 RQH 511
           ++ 
Sbjct: 490 KKR 492


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 33.0 bits (75), Expect = 0.44
 Identities = 36/132 (27%), Positives = 63/132 (47%), Gaps = 9/132 (6%)

Query: 276 DAEFYDSNVVKVQLPYAVPVPNLTTEQQ-------KDRRKELAKKLVEMNAKKREERLVD 328
           D E  D  +V+ +L  A+    L T Q+       K+  +EL +KL E+  +K EE L +
Sbjct: 99  DVEHIDVELVRKELKNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEK-EELLKE 157

Query: 329 DERHLNELLELREIVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIA 388
            E    E  E++E ++    ++S   E  K +    + DL K  ++L+  +E  + ++I+
Sbjct: 158 LEELEAEYEEVQERLKRLEVENSRLEEMLKKL-PGEVYDLKKRWDELEPGVELPEEELIS 216

Query: 389 YNNGEDLTEEPK 400
               E L   PK
Sbjct: 217 DLVKETLNLAPK 228


>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator.  DivIC from B.
           subtilis is necessary for both vegetative and
           sporulation septum formation. These proteins are mainly
           composed of an amino terminal coiled-coil.
          Length = 80

 Score = 30.6 bits (70), Expect = 0.49
 Identities = 8/36 (22%), Positives = 17/36 (47%)

Query: 363 NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEE 398
             I  L   + +L+ + E+ +A++    +  D  EE
Sbjct: 24  QEIAALQAELAKLKAENEELEAEVKDLKSDPDYIEE 59


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 31.1 bits (71), Expect = 0.63
 Identities = 22/103 (21%), Positives = 42/103 (40%), Gaps = 22/103 (21%)

Query: 301 EQQKDRRKELAKKLVEMNAK-KREERLVDDERHLNELLELREIVELTPSDHSHAREAFKS 359
           E++++ +K  A+KL +M+A+ K E  L   E+ L EL       EL     + A++    
Sbjct: 25  EKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELK----AEAKKMLSE 80

Query: 360 MGIN-----------------NIQDLNKSINQLQQKIEKTKAK 385
            G+                  N++   K  ++  +   K + K
Sbjct: 81  KGLPVELLDLVVGEDAEETKANVKAFKKLFDKAVEAGVKERLK 123


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 33.2 bits (75), Expect = 0.66
 Identities = 48/225 (21%), Positives = 85/225 (37%), Gaps = 19/225 (8%)

Query: 298  LTTEQQKDRRKELAKKLVEM----NAKKREERLVDDERHLNELLELREIVELTPSDHSHA 353
               E+ K +     KK  E      AKK+ E   +D++  +EL +     +        A
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAE---EDKKKADELKKAAAAKKKADEAKKKA 1427

Query: 354  REAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE 413
             E  K+       +  K  ++ ++K E+ K    A    E+  +  +AK   E A    E
Sbjct: 1428 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADE 1487

Query: 414  AEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARK-------- 465
            A+ KA   E  KK+A    K    K++  +  K   A   +  +   E A+K        
Sbjct: 1488 AKKKA---EEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE-AKKADEAKKAE 1543

Query: 466  EKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQ 510
            EK+  D   + E+    +   K      +EE++   +   E  ++
Sbjct: 1544 EKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588



 Score = 30.5 bits (68), Expect = 4.2
 Identities = 44/218 (20%), Positives = 87/218 (39%), Gaps = 32/218 (14%)

Query: 301  EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
            E++K  + E AKK  E   K  E +  ++E+   E L+ +E  E        A E  K+ 
Sbjct: 1602 EEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEK-----KKAEELKKAE 1656

Query: 361  GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWL 420
              N I+       +  +K E+ K K       E+  +  + +     A+ +   E     
Sbjct: 1657 EENKIKA-----AEEAKKAEEDKKK------AEEAKKAEEDEKKAAEALKKEAEE----- 1700

Query: 421  IETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRD----DDFGMRD 476
                 K+A  + KK A ++++ +  K+   A +E      E  ++ + D    ++    +
Sbjct: 1701 ----AKKAEELKKKEAEEKKKAEELKK---AEEENKIKAEEAKKEAEEDKKKAEEAKKDE 1753

Query: 477  EDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPE 514
            E+      + K+      E  +E+   +EE L + D +
Sbjct: 1754 EEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEK 1791



 Score = 30.5 bits (68), Expect = 4.3
 Identities = 44/202 (21%), Positives = 82/202 (40%), Gaps = 15/202 (7%)

Query: 302  QQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMG 361
            +Q   + E A+K  E+  KK EE+   DE    E  E ++  E        A EA K+  
Sbjct: 1268 RQAAIKAEEARKADEL--KKAEEKKKADEAKKAE--EKKKADEAK----KKAEEAKKA-- 1317

Query: 362  INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLI 421
                 +  K   + ++K +  K K        +  +      + E    E +AE      
Sbjct: 1318 ----DEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373

Query: 422  ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDV 481
            E  KK+A    KK A ++++ D AK++    +++   + + A  +K+ D+   + E+   
Sbjct: 1374 EEAKKKADAA-KKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432

Query: 482  YKVINKDAGDTDSEEEQERLIE 503
                 K A +    +E ++  E
Sbjct: 1433 ADEAKKKAEEAKKADEAKKKAE 1454


>gnl|CDD|225123 COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC
           component [Carbohydrate transport and metabolism].
          Length = 472

 Score = 32.7 bits (75), Expect = 0.67
 Identities = 24/114 (21%), Positives = 49/114 (42%), Gaps = 14/114 (12%)

Query: 298 LTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIV-----ELTPSDH-- 350
            +  + +D  +    K+ EM  + + +         N+L  +++I+      +  S    
Sbjct: 337 TSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKKIIFACDAGMGSSAMGA 396

Query: 351 SHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLS 404
           S  R+  K+ G+N+I   N +IN L        A ++  +  +DLT+  K K+ 
Sbjct: 397 SILRKKLKNAGLNDISVTNYAINNL-----PADADLVITH--KDLTDRAKRKVP 443


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 32.7 bits (75), Expect = 0.71
 Identities = 37/181 (20%), Positives = 78/181 (43%), Gaps = 22/181 (12%)

Query: 318 NAKKRE-ERLVDDERHLNELLELRE-------IVELTPSDHSHAREAFKSMGINNI--QD 367
           NA   + + ++DD   L +L E RE        VE   SD   + +  K+  I     +D
Sbjct: 145 NATPSDRQDMIDDLLQLGKLEEYRERASDARLGVERVLSDQRGSLDQLKAQ-IEEKEEKD 203

Query: 368 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLS----------KEIAVPESE-AEF 416
           L++ +N L+ ++ +   +I  Y    +   E + +            +E+   E+E  + 
Sbjct: 204 LHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDL 263

Query: 417 KAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRD 476
           +  + ET+++R  + ++    ++R ++L + R     E     ++    E R ++   RD
Sbjct: 264 RETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRD 323

Query: 477 E 477
           E
Sbjct: 324 E 324


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 32.6 bits (75), Expect = 0.75
 Identities = 31/147 (21%), Positives = 63/147 (42%), Gaps = 24/147 (16%)

Query: 297 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREA 356
           NL  E        L  K+V + +++ E+RL  +E  +  L  L E+  L+P       E 
Sbjct: 736 NLRAEMNLSPSAPL--KVVLVGSEELEDRLEANEDDIKGLANLEELEILSPDPEEPPVEV 793

Query: 357 FKSMGI--------------NNIQDLNKSINQLQQKIEKTKAKIIAYNNGED-LTEEPKA 401
              +G                 +  L K + +L+++I++ + K+    + E  + + P+ 
Sbjct: 794 TAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKL----SNEGFVAKAPEE 849

Query: 402 KLSKEIAVPESEAEFKAWLIETKKKRA 428
            + KE    E  AE++  L + +++ A
Sbjct: 850 VVEKEK---EKLAEYQVKLAKLEERLA 873


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
           sigma54-dependent transcription [Transcription / Signal
           transduction mechanisms].
          Length = 225

 Score = 31.9 bits (73), Expect = 0.76
 Identities = 25/98 (25%), Positives = 42/98 (42%), Gaps = 11/98 (11%)

Query: 299 TTEQQKDRRKELAKKLVEM--NAKKREER-----LVDDERHLNELLELREIVELTPSDHS 351
              Q   R+K+L +KL E    A+K EE+        +E    E LE ++ +E       
Sbjct: 46  ALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDLAREALEEKQSLEDLAKALE 105

Query: 352 HAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAY 389
              +  +      ++ L K +  L+QKI + +AK  A 
Sbjct: 106 AELQQAEEQ----VEKLKKQLAALEQKIAELRAKKEAL 139


>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH.  This
           bacillus clade of FliH proteins is not found by the Pfam
           FliH model pfam02108, but is closely related to the
           sequences identified by that model. Sequences identified
           by this model are observed in flagellar operons in an
           analogous position relative to other flagellar operon
           genes.
          Length = 255

 Score = 31.7 bits (72), Expect = 0.93
 Identities = 33/149 (22%), Positives = 58/149 (38%), Gaps = 20/149 (13%)

Query: 368 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKR 427
           ++  I   +  I   +  II      D+ E  + +L+ E    E   E      + + + 
Sbjct: 1   MSNIIKSTKSVIPAQERGIIPLRQVTDVPET-EQELADEEQEFEQILE------KAEAEA 53

Query: 428 AYIIDKKNAR-KQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGM-RDEDWDVYKVI 485
           A II++  A+    R+ + + R    +ER RLI E A++E  +  F     E   +Y+  
Sbjct: 54  AQIIEQAEAQAAAIREQIEQERAQWEEERERLIQE-AKQEGYEAGFQAGESEALSIYQST 112

Query: 486 NKDAGDTDS----------EEEQERLIEL 504
             +A               E  Q  +IEL
Sbjct: 113 IDEANAIVEEAKDDYEEKIESAQPLIIEL 141


>gnl|CDD|237358 PRK13351, PRK13351, elongation factor G; Reviewed.
          Length = 687

 Score = 32.2 bits (74), Expect = 1.1
 Identities = 22/86 (25%), Positives = 30/86 (34%), Gaps = 5/86 (5%)

Query: 483 KVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQLHIGVERMCGP 542
            V      +   EE +E   +L E L + D E   L  E E    E  QL   +      
Sbjct: 192 TVEEGPIPEELLEEVEEAREKLIEALAEFDDELLELYLEGEELSAE--QLRAPLREGTR- 248

Query: 543 ECLFQPSMLGSIQ--AGISETLNFVL 566
                P + GS     GI   L+ V+
Sbjct: 249 SGHLVPVLFGSALKNIGIEPLLDAVV 274


>gnl|CDD|239776 cd04243, AAK_AK-HSDH-like, AAK_AK-HSDH-like: Amino Acid Kinase
           Superfamily (AAK), AK-HSDH-like; this family includes
           the N-terminal catalytic domain of aspartokinase (AK) of
           the bifunctional enzyme AK- homoserine dehydrogenase
           (HSDH). These aspartokinases are found in such bacteria
           as E. coli (AKI-HSDHI, ThrA  and  AKII-HSDHII, MetL) and
           in higher plants (Z. mays AK-HSDH). AK and HSDH are the
           first and third enzymes in the biosynthetic pathway of
           the aspartate family of amino acids. AK catalyzes the
           phosphorylation of Asp to P-aspartyl phosphate. HSDH
           catalyzes the NADPH-dependent conversion of Asp
           3-semialdehyde to homoserine. ThrA and MetL are involved
           in threonine and methionine biosynthesis, respectively.
           In E. coli, ThrA is subject to allosteric regulation by
           the end product L-threonine and the native enzyme is
           reported to be tetrameric. As with bacteria, plant AK
           and HSDH are feedback inhibited by pathway end products.
           Maize AK-HSDH is a Thr-sensitive 180-kD enzyme.
           Arabidopsis AK-HSDH is an alanine-activated,
           threonine-sensitive enzyme whose ACT domains, located
           C-terminal to the AK catalytic domain, were shown to be
           involved in allosteric activation. Also included in this
           CD is the catalytic domain of the aspartokinase (AK) of
           the lysine-sensitive aspartokinase isoenzyme AKIII, a
           monofunctional class enzyme (LysC) found in some
           bacteria such as E. coli. In E. coli, LysC is reported
           to be a homodimer of 50 kD subunits. Also included in
           this CD is  the catalytic domain of aspartokinase (AK)
           of the bifunctional enzyme AK - DAP decarboxylase
           (DapDC) found in some bacteria. DapDC, which is the lysA
           gene product, catalyzes the decarboxylation of DAP to
           lysine.
          Length = 293

 Score = 31.8 bits (73), Expect = 1.1
 Identities = 21/87 (24%), Positives = 41/87 (47%), Gaps = 8/87 (9%)

Query: 309 ELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINNIQDL 368
            LA+     +  +        ERHL+ + EL     L+    +    A  S+ +  ++DL
Sbjct: 48  ALAELAASGDDAQAIVLQEIRERHLDLIKEL-----LSGESAAELLAALDSL-LERLKDL 101

Query: 369 NKSINQLQQKIEKTKAKIIAYNNGEDL 395
            + I  L +  +KT+A+++++  GE L
Sbjct: 102 LEGIRLLGELSDKTRAEVLSF--GELL 126


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 31.9 bits (73), Expect = 1.2
 Identities = 22/147 (14%), Positives = 47/147 (31%), Gaps = 15/147 (10%)

Query: 297 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREA 356
            +      DR +EL  +++E   ++ +   +  E  L    EL  I E          + 
Sbjct: 794 GINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQ--------KN 845

Query: 357 FKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEF 416
            K      + ++ ++  ++Q   E         N   +   +PK      IA        
Sbjct: 846 LKQEIKLELSEIQEAEEEIQNINEN-------KNEFVEFKNDPKKLNKLIIAKDVLIKLV 898

Query: 417 KAWLIETKKKRAYIIDKKNARKQRRQD 443
            +     + ++     KK+  K   + 
Sbjct: 899 ISSDEIKQDEKTTKKKKKDLEKTDEEA 925


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 31.8 bits (73), Expect = 1.4
 Identities = 34/130 (26%), Positives = 61/130 (46%), Gaps = 14/130 (10%)

Query: 411 ESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDD 470
            +E   K  +IE  KK A  + KK A  + ++++ K R    +E     +EL R E+R  
Sbjct: 29  SAEELAKR-IIEEAKKEAETL-KKEALLEAKEEVHKLRAELERELKERRNELQRLERRLL 86

Query: 471 DFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQE------- 523
               R+E  D  K+ + D  + + E++++ L   E+ L + + E   L  EQ        
Sbjct: 87  Q---REETLD-RKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERIS 142

Query: 524 -LSPKEANQL 532
            L+ +EA ++
Sbjct: 143 GLTQEEAKEI 152


>gnl|CDD|234275 TIGR03599, YloV, DAK2 domain fusion protein YloV.  This model
           describes a protein family that contains an N-terminal
           DAK2 domain (pfam02734), so named because of similarity
           to the dihydroxyacetone kinase family family. The
           GTP-binding protein CgtA (a member of the obg family) is
           a bacterial GTPase associated with ribosome biogenesis,
           and it has a characteristic extension (TIGR03595) in
           certain lineages. This protein family described here was
           found, by the method of partial phylognetic profiling,
           to have a phylogenetic distribution strongly correlated
           to that of TIGR03595. This correlation implies some form
           of functional coupling.
          Length = 530

 Score = 31.8 bits (73), Expect = 1.5
 Identities = 16/47 (34%), Positives = 23/47 (48%), Gaps = 6/47 (12%)

Query: 353 AREAFKSMGINNI----QDLNKSINQLQQKIEKTKAK--IIAYNNGE 393
             E FKS+G + +    Q +N S   + + IEK  AK  I+  NN  
Sbjct: 326 IAELFKSLGADVVIEGGQTMNPSTEDILKAIEKVNAKNVIVLPNNKN 372


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 31.4 bits (72), Expect = 1.6
 Identities = 28/174 (16%), Positives = 70/174 (40%), Gaps = 13/174 (7%)

Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
            ++K+ R+E    L E   ++ + R  + E  L E  ++ EI+E    +     +  +  
Sbjct: 63  RKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREK 122

Query: 361 GINNIQDLNKSINQLQQKIEKTKA-------KIIAYNNGEDLTEEPKAKLSKEIAVPESE 413
                +++++   +  ++ E+ K        KI+ Y   +   EE +    +E      +
Sbjct: 123 QKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREAERRER-----K 177

Query: 414 AEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 467
            E +  +   + ++    D++    + R DL +      +ER +   E  ++ +
Sbjct: 178 EEKEREVARLRAQQEEAEDEREELDELRADLYQEE-YERKERQKEKEEAEKRRR 230


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 31.2 bits (71), Expect = 1.6
 Identities = 15/47 (31%), Positives = 23/47 (48%)

Query: 422 ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKR 468
           E ++KR  I + K A  QRR DL+       + R   + +L  + KR
Sbjct: 86  EIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKR 132



 Score = 29.3 bits (66), Expect = 7.7
 Identities = 30/135 (22%), Positives = 61/135 (45%), Gaps = 13/135 (9%)

Query: 333 LNELLELREIVE--LTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYN 390
           L E  EL++ VE  L  + +   + A   +    +    + +NQ++ +I + K +I    
Sbjct: 33  LLENEELKQKVEEALEGATNEDGKLA-ADLLKLEVARKKERLNQIRARISQLKEEIEQ-- 89

Query: 391 NGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKK-NARKQRRQDLAKRRT 449
                  E   +L + +A   S+    ++  + +K+RA  ++K  +  K+ R  L    +
Sbjct: 90  -----KRERIEELKRALAQRRSDLSSASY--QLEKRRASQLEKLQDEIKRTRSKLNALHS 142

Query: 450 AAAQERMRLISELAR 464
             A++R  L  ELA+
Sbjct: 143 LLAEKRSFLCRELAK 157


>gnl|CDD|237301 PRK13200, psaA, photosystem I P700 chlorophyll a apoprotein A1;
           Provisional.
          Length = 766

 Score = 31.3 bits (71), Expect = 1.9
 Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 4/48 (8%)

Query: 610 HFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQEKGGEFFRVH 657
           HF  +LA+ P+   W     ++L  + HDF    SD +E   + F  H
Sbjct: 33  HFDRTLAKGPKTTTWI----WNLHADAHDFDTHTSDLEEVSRKIFSAH 76


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 30.8 bits (70), Expect = 2.2
 Identities = 22/90 (24%), Positives = 32/90 (35%), Gaps = 10/90 (11%)

Query: 364 NIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIET 423
              +L   +N L  +  +  A I A    E      KA L ++ A+ E+ A       E 
Sbjct: 177 LQNELETQLNSLNSQKAEKNALIAALAAKEASALGEKAALEEQKALAEAAAA------EA 230

Query: 424 KKKRAYIIDKKNARKQRRQDLAKRRTAAAQ 453
            K+ A       A  Q +  L    TAA  
Sbjct: 231 AKQEA----AAKAAAQEQAALQAAATAAQP 256



 Score = 29.7 bits (67), Expect = 5.1
 Identities = 15/50 (30%), Positives = 26/50 (52%), Gaps = 7/50 (14%)

Query: 363 NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPES 412
           N I+ L+  I ++Q KI++ + +I       D ++    KL KEIA  + 
Sbjct: 52  NEIESLDNQIEEIQSKIDELQKEI-------DQSKAEIKKLQKEIAELKE 94


>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
          Length = 1320

 Score = 31.4 bits (71), Expect = 2.3
 Identities = 57/234 (24%), Positives = 95/234 (40%), Gaps = 54/234 (23%)

Query: 307  RKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINNIQ 366
            R EL         +K E   +D E+   E  EL+E +++    H+   E +  +   +IQ
Sbjct: 1071 RTELDASRALAEKQKHE---LDTEKRCAE--ELKEAMQMAMEGHARMLEQYADLEEKHIQ 1125

Query: 367  DLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE------AEFKAWL 420
             L +   ++Q+ I+  K                KA     +   ES+      AE  A  
Sbjct: 1126 LLARH-RRIQEGIDDVK----------------KAAARAGVRGAESKFINALAAEISALK 1168

Query: 421  IETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISE---LARKEKRDDDFGMRDE 477
            +E +K+R Y+ D+  + + + +D A+   AA +  +RL      L   +KR  D     E
Sbjct: 1169 VEREKERRYLRDENKSLQAQLRDTAEAVQAAGELLVRLKEAEEALTVAQKRAMD--AEQE 1226

Query: 478  DWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELS--PKEA 529
              + YK I+K                   + R+H+ E ++LNQ    S  PKEA
Sbjct: 1227 AAEAYKQIDK-------------------LKRKHENEISTLNQLVAESRLPKEA 1261


>gnl|CDD|212668 cd10225, MreB_like, MreB and similar proteins.  MreB is a bacterial
           protein which assembles into filaments resembling those
           of eukaryotic F-actin. It is involved in determining the
           shape of rod-like bacterial cells, by assembling into
           large fibrous spirals beneath the cell membrane. MreB
           has also been implicated in chromosome segregation;
           specifically MreB is thought to bind to and segregate
           the replication origin of bacterial chromosomes.
          Length = 320

 Score = 30.9 bits (71), Expect = 2.4
 Identities = 11/47 (23%), Positives = 18/47 (38%), Gaps = 8/47 (17%)

Query: 577 LANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDA 623
           L   I +TG    L G  E ++++            V +AE+P    
Sbjct: 270 LDRGIVLTGGGALLRGLDELISEET--------GLPVRVAEDPLTCV 308


>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor.  The ribosome recycling
           factor (RRF / ribosome release factor) dissociates the
           ribosome from the mRNA after termination of translation,
           and is essential bacterial growth. Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.
          Length = 165

 Score = 30.2 bits (69), Expect = 2.4
 Identities = 27/94 (28%), Positives = 41/94 (43%), Gaps = 21/94 (22%)

Query: 293 VPVPNLTTEQQKDRRKELAKKLVEM--NAKK--REERLVDDERHLNELLELREIVELTPS 348
           +P+P LT E    RRKEL K+  ++   AK   R  R     R  N+ L+  E  +    
Sbjct: 82  IPIPPLTEE----RRKELVKQAKKLAEEAKVAIRNIR-----RDANDKLKKLEKDKEISE 132

Query: 349 DHSHAREAFKSMGINNIQDL-NKSINQLQQKIEK 381
           D     E         IQ L +K I ++ + ++K
Sbjct: 133 DEVKRAEK-------EIQKLTDKYIKKIDELLKK 159


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 31.0 bits (70), Expect = 2.4
 Identities = 43/184 (23%), Positives = 70/184 (38%), Gaps = 36/184 (19%)

Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
           +    + +E  KKL +   +  ++R  +  R      EL +           A +A K  
Sbjct: 60  KPAAKKEQERQKKLEQQAEEAEKQRAAEQAR----QKELEQRAA--------AEKAAK-- 105

Query: 361 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKL-------SKEIAVPESE 413
                +   K   + Q++ E+ KAK  A         E KAK        +KE A  ++E
Sbjct: 106 ---QAEQAAKQAEEKQKQAEEAKAKQAA---------EAKAKAEAEAEKKAKEEAKKQAE 153

Query: 414 AEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFG 473
            E KA      KK+A    KK   + + +  AK +  A  E  +  +E A K K   +  
Sbjct: 154 EEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK--AKAEEAKAKAEAA-KAKAAAEAA 210

Query: 474 MRDE 477
            + E
Sbjct: 211 AKAE 214


>gnl|CDD|143435 cd07117, ALDH_StaphAldA1, Uncharacterized Staphylococcus aureus
           AldA1 (SACOL0154) aldehyde dehydrogenase-like.
           Uncharacterized aldehyde dehydrogenase from
           Staphylococcus aureus (AldA1, locus SACOL0154) and other
           similar sequences are present in this CD.
          Length = 475

 Score = 30.5 bits (69), Expect = 3.0
 Identities = 24/105 (22%), Positives = 50/105 (47%), Gaps = 15/105 (14%)

Query: 339 LREIVELTPSDHSHA----REAFKSMGINNIQDLNKSINQLQQKIEKTK---AKIIAYNN 391
           L EI + T +D   A    +EAFK+     + +    +N++   I++ K   A +   +N
Sbjct: 29  LSEITDATDADVDRAVKAAQEAFKTWRKTTVAERANILNKIADIIDENKELLAMVETLDN 88

Query: 392 GEDLTEEPKAKLSKEIAVPESEAEFK--AWLIETKKKRAYIIDKK 434
           G+ + E      ++ + +P +   F+  A +I  ++  A +ID+ 
Sbjct: 89  GKPIRE------TRAVDIPLAADHFRYFAGVIRAEEGSANMIDED 127


>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
           (DUF1750).  This is a fungal domain of unknown function.
          Length = 669

 Score = 30.8 bits (69), Expect = 3.2
 Identities = 26/122 (21%), Positives = 45/122 (36%), Gaps = 10/122 (8%)

Query: 392 GEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAA 451
             D    P+   + ++  PE   EF   + +        I+K   R  RR +   R +A 
Sbjct: 342 TGDSDHGPEKPQATKLE-PEVADEFADRVAKKVADMTAEIEKLKKRHARRMEKFNRTSAL 400

Query: 452 AQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSEE---EQERLIELEEIL 508
                RL    A       D G   E W +   +  D  D  S +   E +   ++++I+
Sbjct: 401 KDAEQRLRDAAADPA----DTG--SEIWRLEGRLEIDTEDDSSRQGPREHKAKYKVDDIV 454

Query: 509 RQ 510
           R+
Sbjct: 455 RE 456


>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
          Length = 417

 Score = 30.6 bits (69), Expect = 3.2
 Identities = 21/81 (25%), Positives = 38/81 (46%), Gaps = 5/81 (6%)

Query: 298 LTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAF 357
           L +E    R+ +L +    M  + RE+ L   ER + +    R  +E+       A E  
Sbjct: 182 LESELVSRRKNDLKQ----MYEEDREDYLGKLERDITKFFVDRGFLEIKSPILIPA-EYI 236

Query: 358 KSMGINNIQDLNKSINQLQQK 378
           + MGI+N  +L+K I ++ + 
Sbjct: 237 ERMGIDNDTELSKQIFRVDKN 257


>gnl|CDD|222317 pfam13684, Dak1_2, Dihydroxyacetone kinase family.  This is the
           kinase domain of the dihydroxyacetone kinase family.
          Length = 309

 Score = 30.2 bits (69), Expect = 3.4
 Identities = 29/94 (30%), Positives = 45/94 (47%), Gaps = 9/94 (9%)

Query: 323 EERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINNI----QDLNKSINQLQQK 378
           EE L  +E    + L+   +V + P +     E FKS+G++ +    Q +N S   +   
Sbjct: 76  EEILEKEEEDEPKELKDYGVVAVAPGEG--IAELFKSLGVDVVIEGGQTMNPSTEDILAA 133

Query: 379 IEKTKAK--IIAYNNGED-LTEEPKAKLSKEIAV 409
           IEK  AK  I+  NN    L  E  A+L+K+  V
Sbjct: 134 IEKVNAKNVIVLPNNKNIILAAEQAAELAKDKNV 167


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 29.9 bits (67), Expect = 3.5
 Identities = 24/131 (18%), Positives = 39/131 (29%), Gaps = 7/131 (5%)

Query: 398 EPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMR 457
             K K + E  VP  EAE  A        R      +   K++ Q L ++   + QE   
Sbjct: 66  GLKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDG 125

Query: 458 LISELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTS 517
              E   +   D      D D        +        EE E    +EE   +   +   
Sbjct: 126 PGKEP--QLDEDKFLLAEDSD-----DRQETLEAGKVHEETEDSYHVEETASEQYKQDMK 178

Query: 518 LNQEQELSPKE 528
               ++ +   
Sbjct: 179 EKASEQENEDS 189


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 30.1 bits (68), Expect = 3.6
 Identities = 11/65 (16%), Positives = 24/65 (36%), Gaps = 5/65 (7%)

Query: 473 GMRDEDWDVYKVIN-----KDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPK 527
            + D D  +  +       K   D   ++ QE L E  + L+ +  +   +  E + S +
Sbjct: 338 NLDDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEE 397

Query: 528 EANQL 532
              + 
Sbjct: 398 AEAKA 402


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 30.0 bits (68), Expect = 3.7
 Identities = 29/159 (18%), Positives = 66/159 (41%), Gaps = 8/159 (5%)

Query: 300 TEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKS 359
            +Q      E  ++LV+   K+  + L D ++ L E  +L+E+          ARE  + 
Sbjct: 122 KQQTSVLTPEEERELVQ-KIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEK 180

Query: 360 MG--INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFK 417
           +    N  Q+ ++ + +L ++ ++ + +    +  E+  E  K            + E +
Sbjct: 181 IQELANEAQEYHEEMIKLFEEADELRKEADELH--EEFVELSKKIDELHEEFRNLQNELR 238

Query: 418 AWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERM 456
               + K  RA    +K A+++ +++  K R     E+ 
Sbjct: 239 ELEKKIKALRAK---EKAAKRREKREELKERAEEIYEKF 274



 Score = 29.6 bits (67), Expect = 4.6
 Identities = 32/178 (17%), Positives = 68/178 (38%), Gaps = 21/178 (11%)

Query: 300 TEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKS 359
            E+ K++R EL K+  E+  K+ E         LN   ++RE+ E          E  + 
Sbjct: 22  IEELKEKRDELRKEASELAEKRDE---------LNA--KVRELREKAQELREERDEINEE 70

Query: 360 MGI--NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFK 417
           +        ++N  + +L+++  + K K   +N G          L +EI       E K
Sbjct: 71  VQELKEKRDEINAKLQELRKEYRELKEKRNEFNLG----GRSIKSLEREI----ERLEKK 122

Query: 418 AWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMR 475
                   +    + +K    ++  + AK+     ++   L +E+   +K+  +   +
Sbjct: 123 QQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEK 180


>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein.  This family
           consists of the p50 and variable adherence-associated
           antigen (Vaa) adhesins from Mycoplasma hominis. M.
           hominis is a mycoplasma associated with human urogenital
           diseases, pneumonia, and septic arthritis. An adhesin is
           a cell surface molecule that mediates adhesion to other
           cells or to the surrounding surface or substrate. The
           Vaa antigen is a 50-kDa surface lipoprotein that has
           four tandem repetitive DNA sequences encoding a periodic
           peptide structure, and is highly immunogenic in the
           human host. p50 is also a 50-kDa lipoprotein, having
           three repeats A,B and C, that may be a tetramer of
           191-kDa in its native environment.
          Length = 353

 Score = 30.1 bits (67), Expect = 3.8
 Identities = 24/96 (25%), Positives = 42/96 (43%), Gaps = 6/96 (6%)

Query: 295 VPNLTTEQQK--DRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSH 352
           V N   E++   D+  ++AK+L E NAK  +  + + ++  NE  EL + V  T ++   
Sbjct: 99  VENAKNEKKAIDDKNAQIAKELAERNAKI-QSNIEELKKINNEAFELSKTVNKTIAEVEK 157

Query: 353 A---REAFKSMGINNIQDLNKSINQLQQKIEKTKAK 385
                + FK    N   DL     Q+ +    T  +
Sbjct: 158 KFKIPKDFKEQLENFADDLLDKSRQIDEFTTVTSTQ 193


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 29.2 bits (66), Expect = 3.8
 Identities = 16/79 (20%), Positives = 35/79 (44%), Gaps = 4/79 (5%)

Query: 421 IETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWD 480
            + K++   I ++K  R++R+Q+L K+     +ER   +  L  +   ++D    D +  
Sbjct: 42  AKEKEREERIEERKRIREERKQELEKQL----KERKEALKLLEEENDDEEDAETEDTEDV 97

Query: 481 VYKVINKDAGDTDSEEEQE 499
                      T ++ E+E
Sbjct: 98  EDDEWEGFPEPTVTDYEEE 116


>gnl|CDD|130402 TIGR01335, psaA, photosystem I core protein PsaA.  The core
           proteins of photosystem I are PsaA and PsaB, homologous
           integral membrane proteins that form a heterodimer. The
           heterodimer binds the electron-donating chlorophyll
           dimer P700, as well as chlorophyll, phylloquinone, and
           4FE-4S electron acceptors. This model describes PsaA
           only [Energy metabolism, Photosynthesis].
          Length = 752

 Score = 30.3 bits (68), Expect = 4.2
 Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 4/48 (8%)

Query: 610 HFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQEKGGEFFRVH 657
           HFS +LA+ P+   W     ++L  + HDF    SD ++   + F  H
Sbjct: 34  HFSRTLAKGPKTTTWI----WNLHADAHDFDSHTSDLEDISRKIFSAH 77


>gnl|CDD|176459 cd03586, PolY_Pol_IV_kappa, DNA Polymerase IV/Kappa.  Pol IV, also
           known as Pol kappa, DinB, and Dpo4, is a translesion
           synthesis (TLS) polymerase.  Translesion synthesis is a
           process that allows the bypass of a variety of DNA
           lesions.  TLS polymerases lack proofreading activity and
           have low fidelity and low processivity.  They use
           damaged DNA as templates and insert nucleotides opposite
           the lesions.  Known primarily as Pol IV in prokaryotes
           and Pol kappa in eukaryotes, this polymerase has a
           propensity for generating frameshift mutations.  The
           eukaryotic Pol kappa differs from Pol IV and Dpo4 by an
           N-terminal extension of ~75 residues known as the
           "N-clasp" region.  The structure of Pol kappa shows DNA
           that is almost totally encircled by Pol kappa, with the
           N-clasp region augmenting the interactions between DNA
           and the polymerase. Pol kappa is more resistant than Pol
           eta and Pol iota to bulky guanine adducts and is
           efficient at catalyzing the incorporation of dCTP.
           Bacterial pol IV has a higher error rate than other
           Y-family polymerases.
          Length = 334

 Score = 30.2 bits (69), Expect = 4.2
 Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 8/89 (8%)

Query: 355 EAFKSMGINNIQDL-NKSINQLQQKIEKTKAKIIAYNNGED---LTEEPKAK-LSKEIAV 409
           E  K +GI  I DL    +  L++   K+  ++     G D   +  + + K +  E   
Sbjct: 185 EKLKELGIKTIGDLAKLDVELLKKLFGKSGRRLYELARGIDNRPVEPDRERKSIGVERTF 244

Query: 410 PE---SEAEFKAWLIETKKKRAYIIDKKN 435
            E      E    L+E  ++ A  + K+ 
Sbjct: 245 SEDLTDPEELLEELLELAEELAERLRKRG 273


>gnl|CDD|233252 TIGR01056, topB, DNA topoisomerase III, bacteria and conjugative
           plasmid.  This model describes topoisomerase III from
           bacteria and its equivalents encoded on plasmids. The
           gene is designated topB if found in the bacterial
           chromosome, traE on conjugative plasmid RP4, etc. These
           enzymes are involved in the control of DNA topology. DNA
           topoisomerase III belongs to the type I topoisomerases,
           which are ATP-independent [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 660

 Score = 30.2 bits (68), Expect = 4.3
 Identities = 27/108 (25%), Positives = 46/108 (42%), Gaps = 14/108 (12%)

Query: 390 NNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRR-------- 441
           N    + ++   K  KE     +EA  +A +IE   KR +I  KKN     +        
Sbjct: 495 NPAAFVQDKGLKKTLKETKGLGTEAT-RADIIENLFKRGFIQKKKNKIYITKNGKLLCLL 553

Query: 442 --QDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKVINK 487
             + L K    A  E+   ++ ++  EK DDDF +   +  + + IN+
Sbjct: 554 LPELLTKPDLTAQWEQY--LNGISAGEKDDDDF-INTINEMIKQTINE 598


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 29.5 bits (67), Expect = 4.3
 Identities = 21/98 (21%), Positives = 42/98 (42%), Gaps = 15/98 (15%)

Query: 347 PSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEP------K 400
           PS   +  +         ++ L K + +L+Q+I + +A+I     G + TEE        
Sbjct: 60  PSQALNKLK-------TRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEEL 112

Query: 401 AKLSKEIAVPESE-AEFKAWLIET-KKKRAYIIDKKNA 436
            +L KE+   ++E  +++    E  +K +      K A
Sbjct: 113 KQLEKELKKLKAELEKYEKNDPERIEKLKEETKVAKEA 150


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
            functions are known are involvedin recombination,
            recombinational repair, and/or non-homologous end
            joining.They are components of an exonuclease complex
            with MRE11 homologs. This family is distantly related to
            the SbcC family of bacterial proteins.This family is
            based on the phylogenomic analysis of JA Eisen (1999,
            Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 30.4 bits (68), Expect = 4.3
 Identities = 44/239 (18%), Positives = 95/239 (39%), Gaps = 27/239 (11%)

Query: 302  QQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMG 361
            Q+K    +     +E+N K  +++  +  +HL       +  +L    +   R+ F+   
Sbjct: 832  QEKQHELDTVVSKIELNRKLIQDQ-QEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQL 890

Query: 362  ---INNIQDLNKSINQLQQKI---EKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 415
                  +Q L + I   +++    E    K        D  E+ +   SKE +  +++ +
Sbjct: 891  VELSTEVQSLIREIKDAKEQDSPLETFLEK--------DQQEKEELISSKETSNKKAQDK 942

Query: 416  FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMR 475
                  + K    Y+ D +N  +  + D  K++     E   + ++L   EK  +     
Sbjct: 943  VNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKE---TELNTVNAQLEECEKHQEKI--- 996

Query: 476  DEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQLHI 534
            +ED      +     D D+++ QER ++    LR+ + E   + +E +   KE  Q+ +
Sbjct: 997  NED------MRLMRQDIDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQV 1049


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 30.4 bits (69), Expect = 4.4
 Identities = 31/99 (31%), Positives = 47/99 (47%), Gaps = 20/99 (20%)

Query: 315 VEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINNIQDLNKSINQ 374
           +E+N KKRE           EL +LR+ +E     H  A    +    + I +L++ I Q
Sbjct: 94  IELN-KKREA----------ELAKLRKDLEEANLQHEEALATLRKKHQDAINELSEQIEQ 142

Query: 375 LQ---QKIEKTKAKIIAYNNGEDLT----EEPKAKLSKE 406
           LQ    K EK K+++ A    +DL     +  KAKL+ E
Sbjct: 143 LQKQKAKAEKEKSQLQAEV--DDLLAQLDQIAKAKLNAE 179


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 29.6 bits (67), Expect = 4.4
 Identities = 19/105 (18%), Positives = 45/105 (42%), Gaps = 4/105 (3%)

Query: 363 NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIE 422
             ++ LNK++  L+ ++E  + ++    +      E   +  ++++  + E E +A  IE
Sbjct: 38  AELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIE 97

Query: 423 TKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 467
            +       ++ N+ +    +L +      +E   L   L R EK
Sbjct: 98  IQI----AKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEK 138


>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
          Length = 212

 Score = 29.3 bits (66), Expect = 5.1
 Identities = 19/102 (18%), Positives = 41/102 (40%), Gaps = 10/102 (9%)

Query: 363 NNIQDLNKSINQL---QQKIEKTK---AKIIAYNNGEDLTEEPKAKLSKEIAVPESEA-- 414
            N++ L   +++L    +K +K K   A I A  + E++   P  +      +P      
Sbjct: 109 LNVKRLKAYLSKLVLFPRKGKKPKKGFAGIPADTSKEEVVALPVKQNKNSEVIPFQRTPK 168

Query: 415 EFKAWLI--ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQE 454
             KA +I  E +   AY   ++     +     +++  A ++
Sbjct: 169 REKARVITKEERAFNAYRTLRQAKLNAKFVGKKEKKKQAKEK 210


>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein.  This family includes FlxA from
           E. coli. The expression of FlxA is regulated by the FliA
           sigma factor, a transcription factor specific for class
           3 flagellar operons. However FlxA is not required for
           flagellar function or formation.
          Length = 106

 Score = 28.2 bits (63), Expect = 5.2
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 372 INQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEI 407
           I  LQ++I+    ++   ++ EDLT E K K  + I
Sbjct: 21  IKSLQKQIQSLTKQLKELSSSEDLTAEEKQKQQQLI 56


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 29.9 bits (67), Expect = 5.6
 Identities = 17/122 (13%), Positives = 39/122 (31%), Gaps = 9/122 (7%)

Query: 405 KEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELAR 464
           +E   P    E      ++ +K    +     +   R     +      + +++ S    
Sbjct: 763 EEFEYPRMYEEASG---KSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVE 819

Query: 465 KEKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQEL 524
            E        R+E     K I+  A     +++  ++  +  IL Q    + S  ++   
Sbjct: 820 TEGERCTIKQREE-----KGIDAPAILNVKKKKPYKVDNMTRILPQQSR-YISFIKDDRF 873

Query: 525 SP 526
            P
Sbjct: 874 VP 875


>gnl|CDD|180481 PRK06231, PRK06231, F0F1 ATP synthase subunit B; Validated.
          Length = 205

 Score = 29.0 bits (65), Expect = 5.7
 Identities = 20/88 (22%), Positives = 45/88 (51%), Gaps = 7/88 (7%)

Query: 300 TEQQKDRRKELAKKLVEM--NAKKREERLVDD--ERHLNELLELREIVELTPSDHSHARE 355
           T++  ++RKEL +  +      K++ ++L+++  +RH N L + +EI++    +    + 
Sbjct: 73  TQRFLNKRKELIEAEINQANELKQQAQQLLENAKQRHENALAQAKEIIDQANYEALQLKS 132

Query: 356 AFKSMGINNIQDLNKSINQLQQKIEKTK 383
             +       +  N  I Q +Q+IEK +
Sbjct: 133 ELEK---EANRQANLIIFQARQEIEKER 157


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 29.7 bits (67), Expect = 6.0
 Identities = 22/146 (15%), Positives = 48/146 (32%), Gaps = 14/146 (9%)

Query: 363 NNIQDLNKSINQLQQKIEKTKAKIIAYN-NGEDLTEEPKAKLSKEIAVPESEAEFKAWLI 421
             ++ L   + + + ++E+ +         GE L     A L     + E     +    
Sbjct: 293 KELKKLENKLEKQEDELEELEKAAEELRQKGELLY----ANLQL---IEEGLKSVRLADF 345

Query: 422 ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDV 481
              ++    +DK     +  Q   K+       ++ L  +L+  ++    +         
Sbjct: 346 YGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAK----- 400

Query: 482 YKVINKDAGDTDSEEEQERLIELEEI 507
              + K  G    EE +E LIE   +
Sbjct: 401 -TALEKAEGKKAIEEIREELIEEGLL 425


>gnl|CDD|199850 cd03868, M14_CPD_I, Peptidase M14 carboxypeptidase subfamily
           N/E-like; Carboxypeptidase D, domain I subgroup.  The
           first carboxypeptidase (CP)-like domain of
           Carboxypeptidase D (CPD; EC 3.4.17.22), domain I. CPD
           differs from all other metallocarboxypeptidases in that
           it contains multiple CP-like domains. CPD belongs to the
           N/E-like subfamily of the M14 family of
           metallocarboxypeptidases (MCPs).The M14 family are
           zinc-binding CPs which hydrolyze single, C-terminal
           amino acids from polypeptide chains, and have a
           recognition site for the free C-terminal carboxyl group,
           which is a key determinant of specificity. CPD is a
           single-chain protein containing a signal peptide, three
           tandem repeats of CP-like domains separated by short
           bridge regions, followed by a transmembrane domain, and
           a C-terminal cytosolic tail. The first two CP-like
           domains of CPD contain all of the essential active site
           and substrate-binding residues, the third CP-like domain
           lacks critical residues necessary for enzymatic activity
           and is inactive towards standard CP substrates. Domain I
           is optimally active at pH 6.3-7.5 and prefers substrates
           with C-terminal Arg, whereas domain II is active at pH
           5.0-6.5 and prefers substrates with C-terminal Lys. This
           Domain I family contains two contiguous surface
           cysteines that may become palmitoylated and target the
           enzyme to membranes, thus regulating intracellular
           trafficking. CPD functions in the processing of proteins
           that transit the secretory pathway, and is present in
           all vertebrates as well as Drosophila. It is broadly
           distributed in all tissue types. Within cells, CPD is
           present in the trans Golgi network and immature
           secretory vesicles, but is excluded from mature
           vesicles. It is thought to play a role in the processing
           of proteins that are initially processed by furin or
           related endopeptidases present in the trans Golgi
           network, such as growth factors and receptors. CPD is
           implicated in the pathogenesis of lupus erythematosus
           (LE), it is regulated by TGF-beta in various cell types
           of murine and human origin and is significantly
           down-regulated in CD14 positive cells isolated from
           patients with LE. As down-regulation of CPD leads to
           down-modulation of TGF-beta, CPD may have a role in a
           positive feedback loop. In D. melanogaster, the CPD
           variant 1B short (DmCPD1Bs) is necessary and sufficient
           for viability of the fruit fly.
          Length = 293

 Score = 29.2 bits (66), Expect = 6.1
 Identities = 21/68 (30%), Positives = 24/68 (35%), Gaps = 12/68 (17%)

Query: 587 LCQLPGFVER---LNKDLLENRPFQSHFSVSLAEN----PELDA---WSGARQFSLSENF 636
            C   G+  R    N DL  NR F   F           PE  A   W  +  F LS N 
Sbjct: 116 DCSCGGYGGRGNANNVDL--NRNFPDQFEGKHVRAQERQPETLAMINWIRSNPFVLSANL 173

Query: 637 HDFAVTQS 644
           H  +V  S
Sbjct: 174 HGGSVVAS 181


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 29.7 bits (67), Expect = 6.2
 Identities = 21/106 (19%), Positives = 39/106 (36%), Gaps = 21/106 (19%)

Query: 298 LTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELRE------IVELTPSDHS 351
           L   ++ D  +EL K++  +      ER +        L  L          EL      
Sbjct: 854 LAAVERSDTYRELRKRIAAL------ERTLVRAGGGLSLEALVAEAAALDPDELPARLEE 907

Query: 352 HAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTE 397
            AR+         I++L + +N+L Q++   K ++   + G    E
Sbjct: 908 LARD---------IEELEEELNELAQEVGAAKQELARMDGGSTAAE 944


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 29.3 bits (66), Expect = 6.4
 Identities = 30/164 (18%), Positives = 64/164 (39%), Gaps = 15/164 (9%)

Query: 290 PYAVPVPNLTTEQQKDRRKELAKKLVEMNAKKRE-----ERLVDDERHLNELLELREIVE 344
           P+A     L+  Q+K R+ EL  K+ E  A++ +      R+  +ER      + ++  +
Sbjct: 48  PHAKKAFELSKMQEKTRQAELEAKIKEYEAQQAQAKLERARVEAEERRKTLQEQTQQEQQ 107

Query: 345 LTPSDHSHAREAF----KSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPK 400
                   AR+ +    +     N + L      + ++    +A           T E +
Sbjct: 108 RAQYQDELARKRYQKELEQQRRQNEELLKMQEESVLRQEAMRRATEEEILEMRRETIEEE 167

Query: 401 AKLSKEIAVPESEAEFKAWLIETKKK-----RAYIIDKKNARKQ 439
           A+L +E    + EAE +    + +++     R  +  K N  ++
Sbjct: 168 AELERENIRAKIEAEARG-RAKEERENEDINREMLKLKANEERE 210


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 27.8 bits (62), Expect = 6.5
 Identities = 26/84 (30%), Positives = 36/84 (42%), Gaps = 21/84 (25%)

Query: 399 PKAKLS---KEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQER 455
           PK+KL+   K +     +   KA   E K       D+K A +QRR    K R AA +E+
Sbjct: 23  PKSKLTSWEKRMEKRLEQQAIKAREKELK-------DEKEAERQRRIQAIKERRAAKEEK 75

Query: 456 MRL-----------ISELARKEKR 468
            R            +  L R+EKR
Sbjct: 76  ERYEKMAAKMHAKKVERLKRREKR 99


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 29.5 bits (67), Expect = 6.6
 Identities = 17/92 (18%), Positives = 43/92 (46%), Gaps = 5/92 (5%)

Query: 318 NAKKREERLVDDERHLNE-LLELREIVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQ 376
             K+  + L ++ + L E L +++++++  P     AR          ++ L ++  +L 
Sbjct: 331 ELKEELKELEEELKELEEELEKIKKLLKKLPK---KARGQLPPEKREQLEKLLETKEKLS 387

Query: 377 QKIEKTKAKI-IAYNNGEDLTEEPKAKLSKEI 407
           +++E+ + ++       E L  E K  ++K I
Sbjct: 388 EELEELEEELKELKEELESLYSEGKISVNKTI 419


>gnl|CDD|178850 PRK00083, frr, ribosome recycling factor; Reviewed.
          Length = 185

 Score = 28.9 bits (66), Expect = 6.6
 Identities = 11/22 (50%), Positives = 15/22 (68%), Gaps = 4/22 (18%)

Query: 293 VPVPNLTTEQQKDRRKELAKKL 314
           +P+P LT E    RRKEL K++
Sbjct: 100 LPIPPLTEE----RRKELVKQV 117


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 29.7 bits (67), Expect = 6.8
 Identities = 25/130 (19%), Positives = 52/130 (40%), Gaps = 13/130 (10%)

Query: 419 WLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDED 478
            L+   KKR  +  +     Q  Q  A+ +  AA    + +    R ++R +    R+E 
Sbjct: 20  VLVVLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREEL 79

Query: 479 WDVYKVINKDAGDTDSEEE-----QERLIELEEILRQHDPEFTSLNQEQE--------LS 525
               + + +     D+  E     + +L E E+ L   + E   L ++ +        L+
Sbjct: 80  QREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLT 139

Query: 526 PKEANQLHIG 535
           P++A +L + 
Sbjct: 140 PEQARKLLLK 149


>gnl|CDD|212669 cd10227, ParM_like, Plasmid segregation protein ParM and similar
           proteins.  ParM is a plasmid-encoded bacterial homolog
           of actin, which polymerizes into filaments similar to
           F-actin, and plays a vital role in plasmid segregation.
           ParM filaments segregate plasmids paired at midcell into
           the individual daughter cells. This subfamily also
           contains Thermoplasma acidophilum Ta0583, an active
           ATPase at physiological temperatures, which has a
           propensity to form filaments.
          Length = 312

 Score = 29.3 bits (66), Expect = 7.1
 Identities = 16/92 (17%), Positives = 36/92 (39%), Gaps = 3/92 (3%)

Query: 190 EGQSGVIISCGYQCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINSIT 249
           + +  ++I  G   T V+   NG    S +  ++LG   +   + K L  +Y      ++
Sbjct: 165 KDKKVLVIDIGGGTTDVVVFDNGKPVESSSGSLELGVSDLYEAIAKELNKEYGI---DLS 221

Query: 250 PSRSEELLWDYGFVATDYREHLRKWLDAEFYD 281
               EE+L +         E + + ++    +
Sbjct: 222 DEEIEEILRNGKIKNYGKEEDITEIIEEAAEE 253


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 28.9 bits (65), Expect = 7.3
 Identities = 26/149 (17%), Positives = 43/149 (28%), Gaps = 18/149 (12%)

Query: 389 YNNGEDLTEEPKAKL-SKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKR 447
               E    + +A+L   E A    +    A + +  K        +       +D  KR
Sbjct: 49  GTEDELRKRDLRAELEEAERAHKSKKENKLA-IEDADKSTNLDASNEGDEDDDEEDEIKR 107

Query: 448 RTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEI 507
           +      R    +  A       D    D+D            D   +E    L ELE+I
Sbjct: 108 KRIEEDAR----NSDADDSDSSSDSDSSDDD---------SDDDDSEDETAALLRELEKI 154

Query: 508 LRQHDPEFTSL---NQEQELSPKEANQLH 533
            ++   E          +E   +E   L 
Sbjct: 155 KKERAEEKEREEEEKAAEEEKAREEEILT 183


>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b.  This model
           describes the F1/F0 ATP synthase b subunit in bacteria
           only. Scoring just below the trusted cutoff are the
           N-terminal domains of Mycobacterial b/delta fusion
           proteins and a subunit from an archaeon, Methanosarcina
           barkeri, in which the ATP synthase homolog differs in
           architecture and is not experimentally confirmed. This
           model helps resolve b from the related b' subunit.
           Within the family is an example from a
           sodium-translocating rather than proton-translocating
           ATP synthase [Energy metabolism, ATP-proton motive force
           interconversion].
          Length = 147

 Score = 28.5 bits (64), Expect = 7.5
 Identities = 26/99 (26%), Positives = 49/99 (49%), Gaps = 16/99 (16%)

Query: 378 KIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNAR 437
           K  +T+ K IA    + L    +AK  KE A+ + +A+    L E K +   II+  NA 
Sbjct: 22  KAIETRQKKIA----DGLASAERAK--KEAALAQKKAQVI--LKEAKDEAQEIIE--NAN 71

Query: 438 KQRRQDLAKRRTAAAQERMRLIS------ELARKEKRDD 470
           K+  + L + +  A +ER ++ +      E  +++ R++
Sbjct: 72  KRGSEILEEAKAEAREEREKIKAQARAEIEAEKEQAREE 110


>gnl|CDD|219911 pfam08572, PRP3, pre-mRNA processing factor 3 (PRP3).  Pre-mRNA
           processing factor 3 (PRP3) is a U4/U6-associated
           splicing factor. The human PRP3 has been implicated in
           autosomal retinitis pigmentosa.
          Length = 222

 Score = 28.9 bits (65), Expect = 7.6
 Identities = 16/81 (19%), Positives = 28/81 (34%), Gaps = 15/81 (18%)

Query: 442 QDLAKRRTAAAQERMR-LISELARKEKRDDDFGMRD--------------EDWDVYKVIN 486
           Q   + R  A  E     ISE A+         + +              E WD+  + +
Sbjct: 32  QQAEELRREAKLEEEEARISEEAQNAGLKSATDLAEKIPKKELPDDVPDIEWWDMPILDD 91

Query: 487 KDAGDTDSEEEQERLIELEEI 507
            D  D + E +++   E+  I
Sbjct: 92  PDYNDLEDESDEDIDEEMSSI 112


>gnl|CDD|189049 cd09879, PIN_AF0591, PIN domain of Archaeoglobus fulgidus AF0591
           protein and other similar archaeal homologs.  PIN (PilT
           N terminus) domain of Archaeoglobus fulgidus AF0591
           protein and other similar uncharacterized archaeal
           homologs are included in this family. These PIN domains
           are structural homologs of flap endonuclease-1
           (FEN1)-like PIN domains, but lack the extensive
           arch/clamp region  and the H3TH (helix-3-turn-helix)
           domain, an atypical helix-hairpin-helix-2-like region,
           seen in FEN1-like PIN domains. PIN domains within this
           subgroup contain four highly conserved acidic residues
           (putative metal-binding, active site residues) which
           cluster at the C-terminal end of the beta-sheet and form
           a negatively charged pocket near the center of the
           molecule.
          Length = 117

 Score = 27.9 bits (63), Expect = 7.9
 Identities = 9/31 (29%), Positives = 13/31 (41%)

Query: 382 TKAKIIAYNNGEDLTEEPKAKLSKEIAVPES 412
           T A +     G D+ EE +     E  VP +
Sbjct: 5   TNALMYPVQFGVDIFEELERLGYPEFVVPSA 35


>gnl|CDD|153336 cd07652, F-BAR_Rgd1, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Saccharomyces cerevisiae 
           Rho GTPase activating protein Rgd1 and similar proteins.
            F-BAR domains are dimerization modules that bind and
           bend membranes and are found in proteins involved in
           membrane dynamics and actin reorganization.
           Saccharomyces cerevisiae Rgd1 is a GTPase activating
           protein (GAP) with activity towards Rho3p and Rho4p,
           which are involved in bud growth and cytokinesis,
           respectively. At low pH, S. cerevisiae Rgd1 is required
           for cell survival and the activation of the protein
           kinase C pathway, which is important in cell integrity
           and the maintenance of cell shape. It contains an
           N-terminal F-BAR domain and a C-terminal Rho GAP domain.
           The F-BAR domain of S. cerevisiae Rgd1 binds to
           phosphoinositides and plays an important role in the
           localization of the protein to the bud tip/neck during
           the cell cycle. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 234

 Score = 28.9 bits (65), Expect = 8.0
 Identities = 41/177 (23%), Positives = 63/177 (35%), Gaps = 54/177 (30%)

Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
           E+  D     AK L EM+               +EL  L + VE        +R++ K  
Sbjct: 75  EKLADNGLRFAKALNEMS---------------DELSSLAKTVE-------KSRKSIKET 112

Query: 361 GI---NNIQDLNKSINQLQQKIEKTKAKIIAYNNGED-----LTEEPKAKL------SKE 406
           G      +QD        +   EK KA+   Y++  D      T +P  KL      +K 
Sbjct: 113 GKRAEKKVQDA-------EAAAEKAKAR---YDSLADDLERVKTGDPGKKLKFGLKGNKS 162

Query: 407 IAVPESEAEFKAWLIET--KKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISE 461
            A  E E   K    +     K         A+  R++ L++ R  A ++   LI E
Sbjct: 163 AAQHEDELLRKVQAADQDYASK------VNAAQALRQELLSRHRPEAVKDLFDLILE 213


>gnl|CDD|221268 pfam11853, DUF3373, Protein of unknown function (DUF3373).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria. Proteins in this
           family are typically between 472 to 574 amino acids in
           length.
          Length = 485

 Score = 29.4 bits (66), Expect = 8.2
 Identities = 9/43 (20%), Positives = 20/43 (46%)

Query: 356 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEE 398
           +  +       DL + I  L++++ + KA++       D TE+
Sbjct: 17  SLPAAAAAADIDLLQKIEALKKELAELKAQLKDLQKRVDKTEK 59


>gnl|CDD|151391 pfam10944, DUF2630, Protein of unknown function (DUF2630).  This
           bacterial family of proteins have no known function.
          Length = 81

 Score = 26.9 bits (60), Expect = 8.7
 Identities = 16/58 (27%), Positives = 23/58 (39%), Gaps = 14/58 (24%)

Query: 453 QERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQ 510
           Q+ +  I+EL  +E       +R +            G+ D  EE  RL  LE  L Q
Sbjct: 4   QDILARINELVAEEHE-----LRAK---------LQQGEIDESEEHARLARLEVALDQ 47


>gnl|CDD|221178 pfam11709, Mit_ribos_Mrp51, Mitochondrial ribosomal protein
           subunit.  This family is the mitochondrial ribosomal
           small-subunit protein Mrp51. Its function is not
           entirely clear, but deletion of the MRP51 gene
           completely blocked mitochondrial gene expression.
          Length = 302

 Score = 28.9 bits (65), Expect = 8.8
 Identities = 21/103 (20%), Positives = 36/103 (34%), Gaps = 13/103 (12%)

Query: 331 RHL--NELLELREIVELTP-SDHSHAREAFKSMGI------NNIQDLNKSINQLQQKIEK 381
           RH+  N+L  L  + +      H   R  F+ +G+           L +S +        
Sbjct: 64  RHISVNDLDTLEHMTDFESAGGHQWTRLRFQELGLPVSLPPPESNPLFESASDKTDNSPP 123

Query: 382 TKAKIIAYNNGEDLT-EEPKAKLSKEIAVPESEAEFKAWLIET 423
             + + +      +T  E    L K   V     EF+ WL+E 
Sbjct: 124 LSSLLNSGPWLAGMTPGEFSEYLKK---VSPLRPEFRKWLLEK 163


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 29.2 bits (65), Expect = 9.2
 Identities = 48/226 (21%), Positives = 85/226 (37%), Gaps = 28/226 (12%)

Query: 269 EHLRKWLDAEFYDSNVVKVQLPYAVPVPNLTTEQQKDRRKELAKKLVEM------NAKKR 322
           E  R+ ++       + K+QL           +Q  D R E  KKL+EM      +AK  
Sbjct: 122 ERERQAMELFLLRDTLEKLQLRME------RLQQTLDARDEEIKKLLEMLQSKGLSAKAE 175

Query: 323 EERLVDDER---------HLNELLELREIVELTPSDHSH-----AREAFKSMGINN-IQD 367
           EE      R         HL  LLE +E   +   +  H      R+  K+      I +
Sbjct: 176 EEDSERLGRIREAEDQQSHLEVLLEQKEKEHMMLREEIHQKLQMERDDAKTEASQKLIDE 235

Query: 368 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEF-KAWLIETKKK 426
            +  I + ++ +EK + +I    +  D +E  + +L KE+    S     KA      ++
Sbjct: 236 KDTKIKEFEKMLEKAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKAKCDRAAQE 295

Query: 427 RAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDF 472
            +    +    +   + LA + +   Q   +L  +L R E+     
Sbjct: 296 LSRKKTELLGLQTELETLANQDSDMRQHLDKLKEDLTRAEQEKAIL 341


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 28.4 bits (64), Expect = 9.3
 Identities = 39/206 (18%), Positives = 82/206 (39%), Gaps = 55/206 (26%)

Query: 301 EQQKDRRKELAKKLVEM-NAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKS 359
           EQ+K R+K+  K+  +  +A+K E   V +E + +         +  P +++     F  
Sbjct: 16  EQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKS-------KKKAAPIENAEGNIVFSK 68

Query: 360 MGINNIQDLNKS-------------INQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKE 406
           +   + +   K                QL +K+E  K K+      E+L E+  A++ ++
Sbjct: 69  VEFADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKKL------EELDEDKAAEIEEK 122

Query: 407 I-----------------------AVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQD 443
                                   A+   E + K    E K+++  +  KK  R+++R++
Sbjct: 123 EKWTKALAKAEGVKVKDDEKLLKKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREE 182

Query: 444 -LAKRRTAAAQERMRLISELARKEKR 468
            L KR+     ++ +     A+K+ R
Sbjct: 183 NLKKRKDDKKNKKKK----KAKKKGR 204


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 28.8 bits (65), Expect = 9.6
 Identities = 19/76 (25%), Positives = 31/76 (40%), Gaps = 13/76 (17%)

Query: 301 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 360
           E Q+   +E  K+L+E    +RE+ L + ER    +LE +   +          EA    
Sbjct: 235 EAQERSYQEHVKQLIEKMEAEREKLLAEQER----MLEHKLQEQEELLKEGFKTEA---- 286

Query: 361 GINNIQDLNKSINQLQ 376
                + L K I  L+
Sbjct: 287 -----ESLQKEIQDLK 297


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 29.4 bits (66), Expect = 9.7
 Identities = 29/156 (18%), Positives = 57/156 (36%), Gaps = 16/156 (10%)

Query: 299  TTEQQKDRRKELAKKLVEMNAKKR--EERLVDDERHLNELLELREIVELTPSDHSHAREA 356
               Q  D  K      +         E     +E+  +ELL + + +           + 
Sbjct: 2452 KILQDIDNEKNKENNNMNAEKIDDLIENVTSHNEKIKSELLIINDALRRVKEKKDEMNKL 2511

Query: 357  FKSMGINN----------IQDLNKSINQLQQKIEKTKAKIIAYNNGE--DLTEEPKAKLS 404
            F S+  NN          + +    IN+L+  + K    +++Y + E  +L  E K KL 
Sbjct: 2512 FNSLTENNNNNNNSAKNIVDNSTYIINELESHVSKLNE-LLSYIDNEIKELENE-KLKLL 2569

Query: 405  KEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQR 440
            ++  + ES  E +    ET++           +++R
Sbjct: 2570 EKAKIEESRKERERIESETQEDNTDEEQINRQQQER 2605


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 28.0 bits (63), Expect = 9.7
 Identities = 8/33 (24%), Positives = 16/33 (48%)

Query: 436 ARKQRRQDLAKRRTAAAQERMRLISELARKEKR 468
             KQ ++  +K +    +ER R + +  +K K 
Sbjct: 97  RNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKH 129


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.134    0.396 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,836,871
Number of extensions: 3596620
Number of successful extensions: 5299
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5022
Number of HSP's successfully gapped: 289
Length of query: 691
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 587
Effective length of database: 6,324,786
Effective search space: 3712649382
Effective search space used: 3712649382
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (27.5 bits)