RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1907
         (1320 letters)



>gnl|CDD|224557 COG1643, HrpA, HrpA-like helicases [DNA replication, recombination,
           and repair].
          Length = 845

 Score =  332 bits (853), Expect = 1e-97
 Identities = 125/231 (54%), Positives = 156/231 (67%), Gaps = 7/231 (3%)

Query: 1   MTDGTLHREFLSEPDLASYSVMIIDEAHERTLHTDILFGLVKDIVRF-RSDLKLLISSAT 59
           MTDG L RE  ++P L+ YSV+IIDEAHER+L+TDIL GL+KD++   R DLKL+I SAT
Sbjct: 145 MTDGILLREIQNDPLLSGYSVVIIDEAHERSLNTDILLGLLKDLLARRRDDLKLIIMSAT 204

Query: 60  LDAQKFSEFFDDAPVFRIPGRRFPVDIYYTKAPEADYI-DACVVSILQIHATQPLGDILV 118
           LDA++FS +F +APV  I GR +PV+I Y    EADYI    +V+ + IH  +  G ILV
Sbjct: 205 LDAERFSAYFGNAPVIEIEGRTYPVEIRYLPEAEADYILLDAIVAAVDIHLREGSGSILV 264

Query: 119 FLTGQEEIETCNEILQDRVRRLGSKIKELIILPVYSNLPTDMQTKIFLPTPPNARKVVLA 178
           FL GQ EIE   E L+     LG    +L ILP+Y  L  + Q ++F P P   RKVVLA
Sbjct: 265 FLPGQREIERTAEWLEKA--ELGD---DLEILPLYGALSAEEQVRVFEPAPGGKRKVVLA 319

Query: 179 TNIAETSLTIDNIIYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAG 229
           TNIAETSLTI  I YVID G  K+  ++ RTG+  L   PISKASA+QRAG
Sbjct: 320 TNIAETSLTIPGIRYVIDSGLAKEKRYDPRTGLTRLETEPISKASADQRAG 370



 Score =  303 bits (778), Expect = 2e-87
 Identities = 126/406 (31%), Positives = 178/406 (43%), Gaps = 54/406 (13%)

Query: 796  VYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAGRAGRVAAGKCFRLYTAWAYKHE 855
             YVID G  K+  ++ RTG+  L   PISKASA+QRAGRAGR   G C+RLY+   +   
Sbjct: 333  RYVIDSGLAKEKRYDPRTGLTRLETEPISKASADQRAGRAGRTGPGICYRLYSEEDFL-A 391

Query: 856  LEDNTVPEIQRINLGNAVLMLKALGI-NDLIHFDFLDPPPHETLVLALEQLYALGALNHH 914
              + T+PEI R +L   VL LK+LGI  D+  F FLDPPP   +  AL  L  LGAL+  
Sbjct: 392  FPEFTLPEILRTDLSGLVLQLKSLGIGQDIAPFPFLDPPPEAAIQAALTLLQELGALDDS 451

Query: 915  GDLTKLGRRMAEFPLDPMMGKMLLASEKYKCVEEIVTIASMLSVNGAI--FYRP-KDKII 971
            G LT LG++M+  PLDP + +MLL + +  C+ E  TIASMLS       F R  K +  
Sbjct: 452  GKLTPLGKQMSLLPLDPRLARMLLTAPEGGCLGEAATIASMLSEQDRESDFSRDVKLRKQ 511

Query: 972  HADTARKNFF------MLGGDHLVLLNIYNQW-----ADADFST-QWCYENFIQHRSMKR 1019
                               GDHL+LL  +        A  ++     C       +++ R
Sbjct: 512  RTAQDLLKRLKRRNAADPRGDHLLLLEAFPDRIARKRAKGEYLRANGCRAMLFPTKALSR 571

Query: 1020 ARDVREQLIGLVQRVE-------------------MELVSNPTETVNIRKAITAGYFYHI 1060
            A  +   L+     +                     E   +     +IR A+ AG   +I
Sbjct: 572  APWIIAALLVQTSALAGRILAAAEIDEDEWAAQHLPEHCYSEPIWDDIRGALAAGRKLNI 631

Query: 1061 AKLSKSGS-YKTAKHNQTVMIHPNS-SLFEELPRWVLYHELVFTTKEFMRQ--------- 1109
            A+L   G  Y T   N  V  HP+S  L   L  W+ Y E + T K ++R+         
Sbjct: 632  AQLQLDGRPYVTLSDNTPVFAHPSSVRLGLVLLEWIKYAEFLRTRKGYLREGRGERWPDV 691

Query: 1110 --VTEIESKWLLEVAP---HYYKPRELEDSTNKKMPKTLGKSRAEL 1150
              + E+   WL E              + +  + +P  L     +L
Sbjct: 692  QTLIELLKLWLKEQVKGLRGLDGLT--KAAMKQALPDLLPWDLTQL 735



 Score =  207 bits (529), Expect = 4e-55
 Identities = 76/137 (55%), Positives = 96/137 (70%), Gaps = 1/137 (0%)

Query: 657 KKKLNIEETKKSLPIYPFKQDLIDAVRDHQVLIIEGETGSGKTTQIPQYLHEAGFTKDKM 716
               +I E +  LP+   + +++ A+  +QV+II GETGSGKTTQ+PQ+L E G      
Sbjct: 37  ANVPDILEYRSGLPVTAVRDEILKAIEQNQVVIIVGETGSGKTTQLPQFLLEEGL-GIAG 95

Query: 717 KIGCTQPRRVAAMSVSARVAEEMGVKLGCEVGYAIRFEDCTSERTIIKYMTDGTLHREFL 776
           KIGCTQPRR+AA SV+ RVAEE+G KLG  VGY+IRFE   S RT IK MTDG L RE  
Sbjct: 96  KIGCTQPRRLAARSVAERVAEELGEKLGETVGYSIRFESKVSPRTRIKVMTDGILLREIQ 155

Query: 777 SEPDLASYSVMIIDEAH 793
           ++P L+ YSV+IIDEAH
Sbjct: 156 NDPLLSGYSVVIIDEAH 172



 Score = 89.7 bits (223), Expect = 5e-18
 Identities = 44/214 (20%), Positives = 75/214 (35%), Gaps = 43/214 (20%)

Query: 1145 KSRAELGGDHLVLLNIYNQW-----ADADFST-QWCYENFIQHRSMKRARDVREQLIGLV 1198
            ++ A+  GDHL+LL  +        A  ++     C       +++ RA  +   L+   
Sbjct: 524  RNAADPRGDHLLLLEAFPDRIARKRAKGEYLRANGCRAMLFPTKALSRAPWIIAALLVQT 583

Query: 1199 QRVE-------------------MELVSNPTETVNIRKAITAGYFYHIAKLSKSGS-YKT 1238
              +                     E   +     +IR A+ AG   +IA+L   G  Y T
Sbjct: 584  SALAGRILAAAEIDEDEWAAQHLPEHCYSEPIWDDIRGALAAGRKLNIAQLQLDGRPYVT 643

Query: 1239 AKHNQTVMIHPNS-SLFEELPRWVLYHELVFTTKEFMRQ-----------VTEIESKWLL 1286
               N  V  HP+S  L   L  W+ Y E + T K ++R+           + E+   WL 
Sbjct: 644  LSDNTPVFAHPSSVRLGLVLLEWIKYAEFLRTRKGYLREGRGERWPDVQTLIELLKLWLK 703

Query: 1287 EVAP---HYYKPRELEDSTNKKMPKTLGKSRAEL 1317
            E              + +  + +P  L     +L
Sbjct: 704  EQVKGLRGLDGLT--KAAMKQALPDLLPWDLTQL 735


>gnl|CDD|233659 TIGR01967, DEAH_box_HrpA, ATP-dependent helicase HrpA.  This model
            represents HrpA, one of two related but uncharacterized
            DEAH-box ATP-dependent helicases in many Proteobacteria
            and a few high-GC Gram-positive bacteria. HrpA is about
            1300 amino acids long, while its paralog HrpB, also
            uncharacterized, is about 800 amino acids long. Related
            characterized eukarotic proteins are RNA helicases
            associated with pre-mRNA processing [Unknown function,
            Enzymes of unknown specificity].
          Length = 1283

 Score =  209 bits (534), Expect = 9e-55
 Identities = 121/369 (32%), Positives = 191/369 (51%), Gaps = 12/369 (3%)

Query: 797  YVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAGRAGRVAAGKCFRLYTAWAYKHEL 856
            YVID G  + + ++ RT ++ L + PIS+ASANQR GR GRVA G C RLY+   +    
Sbjct: 351  YVIDTGTARISRYSYRTKVQRLPIEPISQASANQRKGRCGRVAPGICIRLYSEEDFNSRP 410

Query: 857  EDNTVPEIQRINLGNAVLMLKALGINDLIHFDFLDPPPHETL---VLALEQLYALGALNH 913
            E  T PEI R NL + +L + AL + D+  F F++ P    +      LE+L AL     
Sbjct: 411  E-FTDPEILRTNLASVILQMLALRLGDIAAFPFIEAPDPRAIRDGFRLLEELGALDDDEA 469

Query: 914  HGDLTKLGRRMAEFPLDPMMGKMLLASEKYKCVEEIVTIASMLSVNGAIFYRPKDKIIHA 973
               LT +GR++A+ P+DP + +MLL + +  C++E++ IAS LS+      RP +K   A
Sbjct: 470  EPQLTPIGRQLAQLPVDPRLARMLLEAHRLGCLQEVLIIASALSIQD-PRERPMEKQQAA 528

Query: 974  DTARKNFFMLGGDHLVLLNIYNQWAD---ADFSTQ---WCYENFIQHRSMKRARDVREQL 1027
            D A   F     D L  +N++    +   A  + Q    C + ++ +  ++  +D+  QL
Sbjct: 529  DQAHARFKDPRSDFLSRVNLWRHIEEQRQALSANQFRNACRKQYLNYLRVREWQDIYRQL 588

Query: 1028 IGLVQRVEMELVSNPTETVNIRKAITAGYFYHIAKLSKSGSYKTAKHNQTVMIHPNSSLF 1087
              +V+ + ++L   P +   I KA+ +G    I    +   Y  A+  +   I P S LF
Sbjct: 589  TQVVKELGLKLNEEPADYDAIHKALLSGLLSQIGMKDEKHEYDGAR-GRKFHIFPGSPLF 647

Query: 1088 EELPRWVLYHELVFTTKEFMRQVTEIESKWLLEVAPHYYKPRELEDSTNKKMPKTLGKSR 1147
            ++ P+WV+  ELV T+K + R V +IE +W+  VA H  K    E    KK  + +   +
Sbjct: 648  KKPPKWVMAAELVETSKLYARLVAKIEPEWVEPVAGHLIKKNYFEPHWEKKRGQVMAYEK 707

Query: 1148 AELGGDHLV 1156
              L G  LV
Sbjct: 708  VTLYGLTLV 716



 Score =  195 bits (498), Expect = 2e-50
 Identities = 96/235 (40%), Positives = 148/235 (62%), Gaps = 15/235 (6%)

Query: 1   MTDGTLHREFLSEPDLASYSVMIIDEAHERTLHTDILFGLVKDIVRFRSDLKLLISSATL 60
           MTDG L  E   +  L+ Y  +IIDEAHER+L+ D L G +K ++  R DLK++I+SAT+
Sbjct: 162 MTDGILLAETQQDRFLSRYDTIIIDEAHERSLNIDFLLGYLKQLLPRRPDLKIIITSATI 221

Query: 61  DAQKFSEFFDDAPVFRIPGRRFPVDIYY------TKAPEADYIDACVVSILQIHATQPLG 114
           D ++FS  F++AP+  + GR +PV++ Y       +  + D ++A + ++ ++ A  P G
Sbjct: 222 DPERFSRHFNNAPIIEVSGRTYPVEVRYRPLVEEQEDDDLDQLEAILDAVDELFAEGP-G 280

Query: 115 DILVFLTGQEEIETCNEILQDRVRRLGSKIKELIILPVYSNLPTDMQTKIFLPTPPNARK 174
           DIL+FL G+ EI    EIL+ R       ++   ILP+Y+ L    Q ++F   P + R+
Sbjct: 281 DILIFLPGEREIRDAAEILRKR------NLRHTEILPLYARLSNKEQQRVF--QPHSGRR 332

Query: 175 VVLATNIAETSLTIDNIIYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAG 229
           +VLATN+AETSLT+  I YVID G  + + ++ RT ++ L + PIS+ASANQR G
Sbjct: 333 IVLATNVAETSLTVPGIHYVIDTGTARISRYSYRTKVQRLPIEPISQASANQRKG 387



 Score =  140 bits (355), Expect = 2e-33
 Identities = 63/133 (47%), Positives = 89/133 (66%), Gaps = 1/133 (0%)

Query: 668 SLPIYPFKQDLIDAVRDHQVLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIGCTQPRRVA 727
           +LP+   ++D+ +A+ ++QV+II GETGSGKTTQ+P+   E G     + IG TQPRR+A
Sbjct: 65  NLPVSAKREDIAEAIAENQVVIIAGETGSGKTTQLPKICLELGRGSHGL-IGHTQPRRLA 123

Query: 728 AMSVSARVAEEMGVKLGCEVGYAIRFEDCTSERTIIKYMTDGTLHREFLSEPDLASYSVM 787
           A +V+ R+AEE+G  LG +VGY +RF D  S  T++K MTDG L  E   +  L+ Y  +
Sbjct: 124 ARTVAQRIAEELGTPLGEKVGYKVRFHDQVSSNTLVKLMTDGILLAETQQDRFLSRYDTI 183

Query: 788 IIDEAHPKVYVID 800
           IIDEAH +   ID
Sbjct: 184 IIDEAHERSLNID 196



 Score = 76.7 bits (189), Expect = 6e-14
 Identities = 39/133 (29%), Positives = 67/133 (50%), Gaps = 1/133 (0%)

Query: 1173 WCYENFIQHRSMKRARDVREQLIGLVQRVEMELVSNPTETVNIRKAITAGYFYHIAKLSK 1232
             C + ++ +  ++  +D+  QL  +V+ + ++L   P +   I KA+ +G    I    +
Sbjct: 567  ACRKQYLNYLRVREWQDIYRQLTQVVKELGLKLNEEPADYDAIHKALLSGLLSQIGMKDE 626

Query: 1233 SGSYKTAKHNQTVMIHPNSSLFEELPRWVLYHELVFTTKEFMRQVTEIESKWLLEVAPHY 1292
               Y  A+  +   I P S LF++ P+WV+  ELV T+K + R V +IE +W+  VA H 
Sbjct: 627  KHEYDGAR-GRKFHIFPGSPLFKKPPKWVMAAELVETSKLYARLVAKIEPEWVEPVAGHL 685

Query: 1293 YKPRELEDSTNKK 1305
             K    E    KK
Sbjct: 686  IKKNYFEPHWEKK 698


>gnl|CDD|233660 TIGR01970, DEAH_box_HrpB, ATP-dependent helicase HrpB.  This model
           represents HrpB, one of two related but uncharacterized
           DEAH-box ATP-dependent helicases in many Proteobacteria,
           but also in a few species of other lineages. The member
           from Rhizobium meliloti has been designated HelO. HrpB
           is typically about 800 residues in length, while its
           paralog HrpA (TIGR01967), also uncharacterized, is about
           1300 amino acids long. Related characterized eukarotic
           proteins are RNA helicases associated with pre-mRNA
           processing [Unknown function, Enzymes of unknown
           specificity].
          Length = 819

 Score =  196 bits (500), Expect = 1e-51
 Identities = 98/230 (42%), Positives = 126/230 (54%), Gaps = 9/230 (3%)

Query: 2   TDGTLHREFLSEPDLASYSVMIIDEAHERTLHTDILFGLVKDIVR-FRSDLKLLISSATL 60
           T+G L R    +P+L     +I DE HER+L  D+   L  D+    R DLK+L  SATL
Sbjct: 97  TEGILTRMIQDDPELDGVGALIFDEFHERSLDADLGLALALDVQSSLREDLKILAMSATL 156

Query: 61  DAQKFSEFFDDAPVFRIPGRRFPVDIYYTKAPEADYI-DACVVSILQIHATQPLGDILVF 119
           D ++ S    DAPV    GR FPV+I Y        + DA   ++    A++  G ILVF
Sbjct: 157 DGERLSSLLPDAPVVESEGRSFPVEIRYLPLRGDQRLEDAVSRAVEHALASET-GSILVF 215

Query: 120 LTGQEEIETCNEILQDRVRRLGSKIKELIILPVYSNLPTDMQTKIFLPTPPNARKVVLAT 179
           L GQ EI    E L +R   L S   +++I P+Y  L    Q +   P P   RKVVLAT
Sbjct: 216 LPGQAEIRRVQEQLAER---LDS---DVLICPLYGELSLAAQDRAIKPDPQGRRKVVLAT 269

Query: 180 NIAETSLTIDNIIYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAG 229
           NIAETSLTI+ I  VID G  +   F+ +TG+  L  V IS+ASA QRAG
Sbjct: 270 NIAETSLTIEGIRVVIDSGLARVARFDPKTGITRLETVRISQASATQRAG 319



 Score =  113 bits (285), Expect = 2e-25
 Identities = 56/142 (39%), Positives = 73/142 (51%), Gaps = 1/142 (0%)

Query: 798 VIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAGRAGRVAAGKCFRLYTAWAYKHELE 857
           VID G  +   F+ +TG+  L  V IS+ASA QRAGRAGR+  G C+RL+ +      L 
Sbjct: 284 VIDSGLARVARFDPKTGITRLETVRISQASATQRAGRAGRLEPGVCYRLW-SEEQHQRLP 342

Query: 858 DNTVPEIQRINLGNAVLMLKALGINDLIHFDFLDPPPHETLVLALEQLYALGALNHHGDL 917
               PEI + +L    L L   G  D     +LD PP   L  A + L  LGAL+  G L
Sbjct: 343 AQDEPEILQADLSGLALELAQWGAKDPSDLRWLDAPPSVALAAARQLLQRLGALDAQGRL 402

Query: 918 TKLGRRMAEFPLDPMMGKMLLA 939
           T  G+ MA     P +  MLL+
Sbjct: 403 TAHGKAMAALGCHPRLAAMLLS 424



 Score =  100 bits (251), Expect = 3e-21
 Identities = 45/126 (35%), Positives = 69/126 (54%), Gaps = 4/126 (3%)

Query: 669 LPIYPFKQDLIDAVRDHQVLIIEGETGSGKTTQIP-QYLHEAGFTKDKMKIGCTQPRRVA 727
           LPI+     L DA+  H  +++E   G+GK+T +P   L   G      KI   +PRR+A
Sbjct: 1   LPIHAVLPALRDALAAHPQVVLEAPPGAGKSTAVPLALLDAPGIGG---KIIMLEPRRLA 57

Query: 728 AMSVSARVAEEMGVKLGCEVGYAIRFEDCTSERTIIKYMTDGTLHREFLSEPDLASYSVM 787
           A S + R+A ++G  +G  VGY +R E+  S RT ++ +T+G L R    +P+L     +
Sbjct: 58  ARSAAQRLASQLGEAVGQTVGYRVRGENKVSRRTRLEVVTEGILTRMIQDDPELDGVGAL 117

Query: 788 IIDEAH 793
           I DE H
Sbjct: 118 IFDEFH 123


>gnl|CDD|182986 PRK11131, PRK11131, ATP-dependent RNA helicase HrpA; Provisional.
          Length = 1294

 Score =  189 bits (482), Expect = 1e-48
 Identities = 117/350 (33%), Positives = 185/350 (52%), Gaps = 29/350 (8%)

Query: 797  YVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAGRAGRVAAGKCFRLYTAWAYKHEL 856
            YVIDPG  + + ++ RT ++ L + PIS+ASANQR GR GRV+ G C RLY+        
Sbjct: 358  YVIDPGTARISRYSYRTKVQRLPIEPISQASANQRKGRCGRVSEGICIRLYS-------- 409

Query: 857  EDN-------TVPEIQRINLGNAVLMLKALGINDLIHFDFLDPPPHETLVLALEQLYALG 909
            ED+       T PEI R NL + +L + ALG+ D+  F F++ P    +   +  L  LG
Sbjct: 410  EDDFLSRPEFTDPEILRTNLASVILQMTALGLGDIAAFPFVEAPDKRNIQDGVRLLEELG 469

Query: 910  ALN-----HHGDLTKLGRRMAEFPLDPMMGKMLLASEKYKCVEEIVTIASMLSVNGAIFY 964
            A+          LT LGR++A+ P+DP + +M+L ++K+ CV E++ I S LS+      
Sbjct: 470  AITTDEQASAYKLTPLGRQLAQLPVDPRLARMVLEAQKHGCVREVMIITSALSIQDP-RE 528

Query: 965  RPKDKIIHADTARKNFFMLGGDHLVLLNIYN---QWADADFSTQW---CYENFIQHRSMK 1018
            RP DK   +D   + F     D L  +N++N   +   A  S Q+   C  +++ +  ++
Sbjct: 529  RPMDKQQASDEKHRRFADKESDFLAFVNLWNYLQEQQKALSSNQFRRLCRTDYLNYLRVR 588

Query: 1019 RARDVREQLIGLVQRVEMELVSNPTETVNIRKAITAGYFYHIA-KLSKSGSYKTAKHNQT 1077
              +D+  QL  +V+ + + + S P E   I  A+  G   HI  K ++   Y  A+ N  
Sbjct: 589  EWQDIYTQLRQVVKELGIPVNSEPAEYREIHTALLTGLLSHIGMKDAEKQEYTGAR-NAR 647

Query: 1078 VMIHPNSSLFEELPRWVLYHELVFTTKEFMRQVTEIESKWLLEVAPHYYK 1127
              I P S LF++ P+WV+  ELV T++ + R    IE +W+  +A H  K
Sbjct: 648  FSIFPGSGLFKKPPKWVMVAELVETSRLWGRIAARIEPEWIEPLAQHLIK 697



 Score =  186 bits (474), Expect = 1e-47
 Identities = 93/236 (39%), Positives = 140/236 (59%), Gaps = 17/236 (7%)

Query: 1   MTDGTLHREFLSEPDLASYSVMIIDEAHERTLHTDILFGLVKDIVRFRSDLKLLISSATL 60
           MTDG L  E   +  L  Y  +IIDEAHER+L+ D + G +K+++  R DLK++I+SAT+
Sbjct: 169 MTDGILLAEIQQDRLLMQYDTIIIDEAHERSLNIDFILGYLKELLPRRPDLKVIITSATI 228

Query: 61  DAQKFSEFFDDAPVFRIPGRRFPVDIYYTKAPEADYIDACVVSILQ--IHATQPL----- 113
           D ++FS  F++AP+  + GR +PV++ Y   P  +  D      LQ    A   L     
Sbjct: 229 DPERFSRHFNNAPIIEVSGRTYPVEVRYR--PIVEEADDTERDQLQAIFDAVDELGREGP 286

Query: 114 GDILVFLTGQEEIETCNEILQDRVRRLGSKIKELIILPVYSNLPTDMQTKIFLPTPPNAR 173
           GDIL+F++G+ EI    + L          ++   ILP+Y+ L    Q ++F     + R
Sbjct: 287 GDILIFMSGEREIRDTADALNKL------NLRHTEILPLYARLSNSEQNRVF--QSHSGR 338

Query: 174 KVVLATNIAETSLTIDNIIYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAG 229
           ++VLATN+AETSLT+  I YVIDPG  + + ++ RT ++ L + PIS+ASANQR G
Sbjct: 339 RIVLATNVAETSLTVPGIKYVIDPGTARISRYSYRTKVQRLPIEPISQASANQRKG 394



 Score =  148 bits (375), Expect = 7e-36
 Identities = 68/141 (48%), Positives = 94/141 (66%), Gaps = 3/141 (2%)

Query: 668 SLPIYPFKQDLIDAVRDHQVLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIGCTQPRRVA 727
           +LP+   KQD+++A+RDHQV+I+ GETGSGKTTQ+P+   E G     + IG TQPRR+A
Sbjct: 72  NLPVSQKKQDILEAIRDHQVVIVAGETGSGKTTQLPKICLELGRGVKGL-IGHTQPRRLA 130

Query: 728 AMSVSARVAEEMGVKLGCEVGYAIRFEDCTSERTIIKYMTDGTLHREFLSEPDLASYSVM 787
           A +V+ R+AEE+  +LG  VGY +RF D  S+ T++K MTDG L  E   +  L  Y  +
Sbjct: 131 ARTVANRIAEELETELGGCVGYKVRFNDQVSDNTMVKLMTDGILLAEIQQDRLLMQYDTI 190

Query: 788 IIDEAHPKVYVID--PGFCKQ 806
           IIDEAH +   ID   G+ K+
Sbjct: 191 IIDEAHERSLNIDFILGYLKE 211



 Score = 59.7 bits (145), Expect = 1e-08
 Identities = 43/149 (28%), Positives = 75/149 (50%), Gaps = 8/149 (5%)

Query: 1153 DHLVLLNIYN---QWADADFSTQW---CYENFIQHRSMKRARDVREQLIGLVQRVEMELV 1206
            D L  +N++N   +   A  S Q+   C  +++ +  ++  +D+  QL  +V+ + + + 
Sbjct: 550  DFLAFVNLWNYLQEQQKALSSNQFRRLCRTDYLNYLRVREWQDIYTQLRQVVKELGIPVN 609

Query: 1207 SNPTETVNIRKAITAGYFYHIA-KLSKSGSYKTAKHNQTVMIHPNSSLFEELPRWVLYHE 1265
            S P E   I  A+  G   HI  K ++   Y  A+ N    I P S LF++ P+WV+  E
Sbjct: 610  SEPAEYREIHTALLTGLLSHIGMKDAEKQEYTGAR-NARFSIFPGSGLFKKPPKWVMVAE 668

Query: 1266 LVFTTKEFMRQVTEIESKWLLEVAPHYYK 1294
            LV T++ + R    IE +W+  +A H  K
Sbjct: 669  LVETSRLWGRIAARIEPEWIEPLAQHLIK 697


>gnl|CDD|236950 PRK11664, PRK11664, ATP-dependent RNA helicase HrpB; Provisional.
          Length = 812

 Score =  175 bits (445), Expect = 1e-44
 Identities = 92/230 (40%), Positives = 126/230 (54%), Gaps = 9/230 (3%)

Query: 2   TDGTLHREFLSEPDLASYSVMIIDEAHERTLHTDILFGLVKDIVR-FRSDLKLLISSATL 60
           T+G L R    +P+L+   ++I+DE HER+L  D+   L+ D+ +  R DLKLLI SATL
Sbjct: 100 TEGILTRMIQRDPELSGVGLVILDEFHERSLQADLALALLLDVQQGLRDDLKLLIMSATL 159

Query: 61  DAQKFSEFFDDAPVFRIPGRRFPVDIYYTKAPEADYID-ACVVSILQIHATQPLGDILVF 119
           D  +  +   DAPV    GR FPV+  Y   P     D A   +  ++   +  G +L+F
Sbjct: 160 DNDRLQQLLPDAPVIVSEGRSFPVERRYQPLPAHQRFDEAVARATAELLRQES-GSLLLF 218

Query: 120 LTGQEEIETCNEILQDRVRRLGSKIKELIILPVYSNLPTDMQTKIFLPTPPNARKVVLAT 179
           L G  EI+   E L  RV        ++++ P+Y  L    Q K  LP P   RKVVLAT
Sbjct: 219 LPGVGEIQRVQEQLASRVA------SDVLLCPLYGALSLAEQQKAILPAPAGRRKVVLAT 272

Query: 180 NIAETSLTIDNIIYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAG 229
           NIAETSLTI+ I  V+D G  +   F+ +TG+  L    IS+AS  QRAG
Sbjct: 273 NIAETSLTIEGIRLVVDSGLERVARFDPKTGLTRLVTQRISQASMTQRAG 322



 Score = 97.7 bits (244), Expect = 2e-20
 Identities = 54/143 (37%), Positives = 73/143 (51%), Gaps = 1/143 (0%)

Query: 798 VIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAGRAGRVAAGKCFRLYTAWAYKHELE 857
           V+D G  +   F+ +TG+  L    IS+AS  QRAGRAGR+  G C  LY+    +    
Sbjct: 287 VVDSGLERVARFDPKTGLTRLVTQRISQASMTQRAGRAGRLEPGICLHLYSKEQAERAAA 346

Query: 858 DNTVPEIQRINLGNAVLMLKALGINDLIHFDFLDPPPHETLVLALEQLYALGALNHHGDL 917
            +  PEI   +L   +L L   G +D     +LD PP   L  A   L  LGAL+  G L
Sbjct: 347 QSE-PEILHSDLSGLLLELLQWGCHDPAQLSWLDQPPAAALAAAKRLLQQLGALDGQGRL 405

Query: 918 TKLGRRMAEFPLDPMMGKMLLAS 940
           T  GR+MA    DP +  ML+A+
Sbjct: 406 TARGRKMAALGNDPRLAAMLVAA 428



 Score = 88.4 bits (220), Expect = 1e-17
 Identities = 44/127 (34%), Positives = 74/127 (58%), Gaps = 4/127 (3%)

Query: 668 SLPIYPFKQDLIDAVRDHQVLIIEGETGSGKTTQIP-QYLHEAGFTKDKMKIGCTQPRRV 726
           SLP+     +L+ A++    ++++  TG+GK+T +P Q L   G      KI   +PRR+
Sbjct: 3   SLPVAAVLPELLTALKTAPQVLLKAPTGAGKSTWLPLQLLQHGGING---KIIMLEPRRL 59

Query: 727 AAMSVSARVAEEMGVKLGCEVGYAIRFEDCTSERTIIKYMTDGTLHREFLSEPDLASYSV 786
           AA +V+ R+AE++G K G  VGY +R E      T ++ +T+G L R    +P+L+   +
Sbjct: 60  AARNVAQRLAEQLGEKPGETVGYRMRAESKVGPNTRLEVVTEGILTRMIQRDPELSGVGL 119

Query: 787 MIIDEAH 793
           +I+DE H
Sbjct: 120 VILDEFH 126


>gnl|CDD|218070 pfam04408, HA2, Helicase associated domain (HA2).  This presumed
           domain is about 90 amino acid residues in length. It is
           found is a diverse set of RNA helicases. Its function is
           unknown, however it seems likely to be involved in
           nucleic acid binding.
          Length = 91

 Score =  136 bits (344), Expect = 9e-38
 Identities = 54/91 (59%), Positives = 67/91 (73%), Gaps = 1/91 (1%)

Query: 901 ALEQLYALGALNHHGDLTKLGRRMAEFPLDPMMGKMLLASEKYKCVEEIVTIASMLSVNG 960
           ALE LY LGAL+  G+LT LGR+MAE PLDP +GKMLL S ++ C++EI+TIA+MLSV  
Sbjct: 2   ALELLYELGALDDDGELTPLGRQMAELPLDPRLGKMLLLSAEFGCLDEILTIAAMLSV-P 60

Query: 961 AIFYRPKDKIIHADTARKNFFMLGGDHLVLL 991
           + FYRPK+K   AD AR+ F     DHL LL
Sbjct: 61  SPFYRPKEKEEEADAARRKFASAESDHLTLL 91


>gnl|CDD|219532 pfam07717, OB_NTP_bind, Oligonucleotide/oligosaccharide-binding
            (OB)-fold.  This family is found towards the C-terminus
            of the DEAD-box helicases (pfam00270). In these helicases
            it is apparently always found in association with
            pfam04408. There do seem to be a couple of instances
            where it occurs by itself - . The structure PDB:3i4u
            adopts an OB-fold. helicases (pfam00270). In these
            helicases it is apparently always found in association
            with pfam04408. This C-terminal domain of the yeast
            helicase contains an
            oligonucleotide/oligosaccharide-binding (OB)-fold which
            seems to be placed at the entrance of the putative
            nucleic acid cavity. It also constitutes the binding site
            for the G-patch-containing domain of Pfa1p. When found on
            DEAH/RHA helicases, this domain is central to the
            regulation of the helicase activity through its binding
            of both RNA and G-patch domain proteins.
          Length = 109

 Score =  128 bits (325), Expect = 5e-35
 Identities = 49/108 (45%), Positives = 68/108 (62%), Gaps = 8/108 (7%)

Query: 1026 QLIGLVQRVEME------LVSNPTETVNIRKAITAGYFYHIAKLSKSGSYK-TAKHNQTV 1078
            QL+ +++R+ +       L S+  +   IRKA+ AG F ++A+L        T K  Q V
Sbjct: 2    QLLEILERLGLPEEPSSLLNSSSNDYELIRKALCAGLFPNVARLVDEKGEYKTLKEGQPV 61

Query: 1079 MIHPNSSLFEELP-RWVLYHELVFTTKEFMRQVTEIESKWLLEVAPHY 1125
             IHP+S LF + P  WV+YHELV TTKE+MR VT I+ +WLLE+APHY
Sbjct: 62   FIHPSSVLFGKKPPEWVVYHELVETTKEYMRDVTAIDPEWLLELAPHY 109



 Score =  128 bits (325), Expect = 5e-35
 Identities = 49/108 (45%), Positives = 68/108 (62%), Gaps = 8/108 (7%)

Query: 1193 QLIGLVQRVEME------LVSNPTETVNIRKAITAGYFYHIAKLSKSGSYK-TAKHNQTV 1245
            QL+ +++R+ +       L S+  +   IRKA+ AG F ++A+L        T K  Q V
Sbjct: 2    QLLEILERLGLPEEPSSLLNSSSNDYELIRKALCAGLFPNVARLVDEKGEYKTLKEGQPV 61

Query: 1246 MIHPNSSLFEELP-RWVLYHELVFTTKEFMRQVTEIESKWLLEVAPHY 1292
             IHP+S LF + P  WV+YHELV TTKE+MR VT I+ +WLLE+APHY
Sbjct: 62   FIHPSSVLFGKKPPEWVVYHELVETTKEYMRDVTAIDPEWLLELAPHY 109


>gnl|CDD|214852 smart00847, HA2, Helicase associated domain (HA2) Add an
           annotation.  This presumed domain is about 90 amino acid
           residues in length. It is found is a diverse set of RNA
           helicases. Its function is unknown, however it seems
           likely to be involved in nucleic acid binding.
          Length = 82

 Score =  108 bits (272), Expect = 3e-28
 Identities = 45/85 (52%), Positives = 57/85 (67%), Gaps = 3/85 (3%)

Query: 907 ALGALNHHGDLTKLGRRMAEFPLDPMMGKMLLASEKYKCVEEIVTIASMLSVNGAIFYRP 966
            LGAL+  G LT LGR+MAE PLDP + KMLLA+ ++ C++EI+TI +MLSV      RP
Sbjct: 1   ELGALDDDGRLTPLGRKMAELPLDPRLAKMLLAAAEFGCLDEILTIVAMLSVGD---PRP 57

Query: 967 KDKIIHADTARKNFFMLGGDHLVLL 991
           K+K   AD AR+ F     DHL LL
Sbjct: 58  KEKREDADAARRRFADPESDHLTLL 82


>gnl|CDD|238005 cd00046, DEXDc, DEAD-like helicases superfamily. A diverse family
           of proteins involved in ATP-dependent RNA or DNA
           unwinding. This domain contains the ATP-binding region.
          Length = 144

 Score =  101 bits (254), Expect = 6e-25
 Identities = 31/115 (26%), Positives = 47/115 (40%), Gaps = 9/115 (7%)

Query: 686 QVLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIGCTQPRRVAAMSVSARVAEEMGVKLGC 745
           + +++   TGSGKT      + E   +    ++    P R  A  V+ R+ E  G  +  
Sbjct: 1   RDVLLAAPTGSGKTLAALLPILELLDSLKGGQVLVLAPTRELANQVAERLKELFGEGI-- 58

Query: 746 EVGYAIRFEDCT------SERTIIKYMTDGTLHREFLSEP-DLASYSVMIIDEAH 793
           +VGY I            S +T I   T G L  E       L    ++I+DEAH
Sbjct: 59  KVGYLIGGTSIKQQEKLLSGKTDIVVGTPGRLLDELERLKLSLKKLDLLILDEAH 113



 Score = 52.7 bits (127), Expect = 6e-08
 Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 2/61 (3%)

Query: 1   MTDGTLHREFLSEP-DLASYSVMIIDEAHERTLHTDILFGLVKDIVRFRSDLKLLISSAT 59
            T G L  E       L    ++I+DEAH        L GL K +++   D ++L+ SAT
Sbjct: 85  GTPGRLLDELERLKLSLKKLDLLILDEAHRLLNQGFGLLGL-KILLKLPKDRQVLLLSAT 143

Query: 60  L 60
            
Sbjct: 144 P 144


>gnl|CDD|214692 smart00487, DEXDc, DEAD-like helicases superfamily. 
          Length = 201

 Score = 87.2 bits (216), Expect = 3e-19
 Identities = 32/138 (23%), Positives = 57/138 (41%), Gaps = 8/138 (5%)

Query: 664 ETKKSLPIYPFKQDLIDAV-RDHQVLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIGCTQ 722
           E     P+ P++++ I+A+    + +I+   TGSGKT        EA       ++    
Sbjct: 2   EKFGFEPLRPYQKEAIEALLSGLRDVILAAPTGSGKTLAALLPALEALKRGKGGRVLVLV 61

Query: 723 PRRVAAMSVSARVAEEM----GVKLGCEVGYAIR--FEDCTSERTIIKYMTDGTLHREFL 776
           P R  A   +  + +         +G   G + R       S +T I   T G L     
Sbjct: 62  PTRELAEQWAEELKKLGPSLGLKVVGLYGGDSKREQLRKLESGKTDILVTTPGRLLDLLE 121

Query: 777 SEP-DLASYSVMIIDEAH 793
           ++   L++  ++I+DEAH
Sbjct: 122 NDKLSLSNVDLVILDEAH 139



 Score = 53.6 bits (129), Expect = 8e-08
 Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 8/91 (8%)

Query: 1   MTDGTLHREFLSEP-DLASYSVMIIDEAHERT--LHTDILFGLVKDIVRFRSDLKLLISS 57
            T G L     ++   L++  ++I+DEAH        D L  L+K       +++LL+ S
Sbjct: 111 TTPGRLLDLLENDKLSLSNVDLVILDEAHRLLDGGFGDQLEKLLK---LLPKNVQLLLLS 167

Query: 58  ATL--DAQKFSEFFDDAPVFRIPGRRFPVDI 86
           AT   + +   E F + PVF   G      I
Sbjct: 168 ATPPEEIENLLELFLNDPVFIDVGFTPLEPI 198


>gnl|CDD|197757 smart00490, HELICc, helicase superfamily c-terminal domain. 
          Length = 82

 Score = 49.5 bits (119), Expect = 2e-07
 Identities = 23/99 (23%), Positives = 38/99 (38%), Gaps = 23/99 (23%)

Query: 131 EILQDRVRRLGSKIKELIILPVYSNLPTDMQTKIFLPTPPNARKVVLATNIAETSLTIDN 190
           E L + ++ LG K+       ++  L  + + +I         KV++AT++AE  L +  
Sbjct: 1   EELAELLKELGIKV-----ARLHGGLSQEEREEILDKFNNGKIKVLVATDVAERGLDLPG 55

Query: 191 IIYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAG 229
           +  VI                      P S AS  QR G
Sbjct: 56  VDLVIIYDL------------------PWSPASYIQRIG 76



 Score = 29.5 bits (67), Expect = 2.1
 Identities = 11/17 (64%), Positives = 12/17 (70%)

Query: 821 VPISKASANQRAGRAGR 837
           +P S AS  QR GRAGR
Sbjct: 64  LPWSPASYIQRIGRAGR 80


>gnl|CDD|177443 PHA02653, PHA02653, RNA helicase NPH-II; Provisional.
          Length = 675

 Score = 54.2 bits (131), Expect = 4e-07
 Identities = 53/201 (26%), Positives = 91/201 (45%), Gaps = 35/201 (17%)

Query: 16  LASYSVMIIDEAHERTLHTDILFGLV-KDIVRFRSDLKLLISSATL--DAQKFSEFFDDA 72
           L  Y  +IIDE HE     DI+  +  K I + RS   L + +ATL  D  +  EFF + 
Sbjct: 289 LFDYGTVIIDEVHEHDQIGDIIIAVARKHIDKIRS---LFLMTATLEDDRDRIKEFFPNP 345

Query: 73  PVFRIPG-RRFPV-DIY----YTKAPEADYIDA---CVVSILQIHATQPLG-DILVFLTG 122
               IPG   FP+ ++Y    Y    +  YI+     +V+ L+ + T P G   +VF+  
Sbjct: 346 AFVHIPGGTLFPISEVYVKNKYNPKNKRAYIEEEKKNIVTALKKY-TPPKGSSGIVFV-- 402

Query: 123 QEEIETCNEILQDRVRRLGSKIKELIILPVYSNLP--TDMQTKIFLPTPPNARK---VVL 177
              +  C E      + L  ++       ++  +P   ++  K++      + K   +++
Sbjct: 403 -ASVSQCEEY----KKYLEKRLPIYDFYIIHGKVPNIDEILEKVY------SSKNPSIII 451

Query: 178 ATNIAETSLTIDNIIYVIDPG 198
           +T   E+S+TI N  +V D G
Sbjct: 452 STPYLESSVTIRNATHVYDTG 472



 Score = 32.3 bits (74), Expect = 2.2
 Identities = 13/61 (21%), Positives = 25/61 (40%), Gaps = 7/61 (11%)

Query: 653 TEKQKKKLNIEETKKSLPIYPFK-------QDLIDAVRDHQVLIIEGETGSGKTTQIPQY 705
              +K  + I    +     P           + +A    + +++ G TG GKT+Q+P+ 
Sbjct: 140 VGSKKDTIGILGNPEPFSKIPLASLQPDVQLKIFEAWISRKPVVLTGGTGVGKTSQVPKL 199

Query: 706 L 706
           L
Sbjct: 200 L 200



 Score = 30.3 bits (69), Expect = 8.4
 Identities = 12/26 (46%), Positives = 14/26 (53%)

Query: 823 ISKASANQRAGRAGRVAAGKCFRLYT 848
           ISK+   QR GR GRV+ G     Y 
Sbjct: 487 ISKSMRTQRKGRVGRVSPGTYVYFYD 512


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 50.9 bits (121), Expect = 5e-06
 Identities = 45/219 (20%), Positives = 92/219 (42%), Gaps = 8/219 (3%)

Query: 450  DDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESR 509
            ++ K+ DE  ++ +   K   +    +   K+A E        +A+   K     + +  
Sbjct: 1480 EEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEA 1539

Query: 510  RKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEK 569
            +K  EK+K D++ + E     +E   +E+     E ++   ++   E AK+ ++AR  E 
Sbjct: 1540 KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKA--EEAKKAEEARIEEV 1597

Query: 570  VQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLL 629
            ++ Y   K  K  E  +  + K K    E+ K  +E+          +  E ++  EL  
Sbjct: 1598 MKLYEEEKKMKAEEAKKAEEAKIKA---EELKKAEEEKKKVEQLKKKEAEEKKKAEELKK 1654

Query: 630  DNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKS 668
              + + I+A        K EE+  + ++ K   E+ KK+
Sbjct: 1655 AEEENKIKAAEE---AKKAEEDKKKAEEAKKAEEDEKKA 1690



 Score = 45.5 bits (107), Expect = 2e-04
 Identities = 45/230 (19%), Positives = 99/230 (43%), Gaps = 23/230 (10%)

Query: 445  LTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKI--IP 502
            L  ++++ K +     +    DK   +    + +D++   EA K+   EAE   K   + 
Sbjct: 1652 LKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK---EAEEAKKAEELK 1708

Query: 503  KLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHD 562
            K   E ++K  E +K ++  +++A+              E ++   E K+K  E  K+ +
Sbjct: 1709 KKEAEEKKKAEELKKAEEENKIKAE--------------EAKKEAEEDKKKAEEAKKDEE 1754

Query: 563  KARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQ 622
            + +++  +++    K  +  ++ E V E+E     E+R+ E ++    IF   A   E  
Sbjct: 1755 EKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGG 1814

Query: 623  EQYELLLDN----QIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKS 668
            ++  L++++    +   I+ ++   N    E +  EK K   N E  +  
Sbjct: 1815 KEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDG 1864



 Score = 45.5 bits (107), Expect = 2e-04
 Identities = 46/235 (19%), Positives = 98/235 (41%), Gaps = 14/235 (5%)

Query: 439  AFMGHFLTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKL-----E 493
            A M HF       + E    ++ LK  ++    + A   ++K+  +EA K+ +      E
Sbjct: 1260 ARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADE 1319

Query: 494  AESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRK 553
            A+ + +   K    +++K  E +K  + A+ EA+   DE   +E++    E++  E K+K
Sbjct: 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379

Query: 554  LLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFR 613
                 K+ ++ ++ ++ ++    K  +  +K +E+ +         +K  DE    A  +
Sbjct: 1380 ADAAKKKAEEKKKADEAKK----KAEEDKKKADELKKAA-----AAKKKADEAKKKAEEK 1430

Query: 614  VGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKS 668
              A + + + +     D      +         K  EE  +  + K   EE KK+
Sbjct: 1431 KKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKA 1485



 Score = 44.4 bits (104), Expect = 5e-04
 Identities = 48/233 (20%), Positives = 95/233 (40%), Gaps = 18/233 (7%)

Query: 451  DRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAE-SRDKIIPKLRIESR 509
            D   +  + +   +  ++N  + AT  +     EEA K    +AE +R     K + E  
Sbjct: 1069 DEGLKPSYKDFDFDAKEDNRADEATE-EAFGKAEEAKKTETGKAEEARKAEEAKKKAEDA 1127

Query: 510  RKYLEKRKEDKVAEL-EADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELE 568
            RK  E RK +   +  EA   +D          E  R+  E ++   E AK+ + AR+ E
Sbjct: 1128 RKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKA--EDAKKAEAARKAE 1185

Query: 569  KVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQ-RKWEDEQMSSAIFRV------------G 615
            +V++    +  +   K E   + E+    E+ RK ED + + A+ +              
Sbjct: 1186 EVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKA 1245

Query: 616  AKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKS 668
             ++R ++E  +        F +  +     +  + +  +K ++K   +E KK+
Sbjct: 1246 EEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKA 1298



 Score = 43.2 bits (101), Expect = 0.001
 Identities = 43/226 (19%), Positives = 100/226 (44%), Gaps = 6/226 (2%)

Query: 448  SDDDRK-ERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKL-EAESRDKIIPKLR 505
            +D+ +K E  + ++  K  ++    + A   ++K+  +E  K  +L +AE + K     +
Sbjct: 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKK 1571

Query: 506  IESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERE--RRDREHKRKLLELAKEHDK 563
             E  +    ++ E+     EA I +   L+ E++  + E  ++  E K K  EL K  ++
Sbjct: 1572 AEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEE 1631

Query: 564  ARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQE 623
             +++E++++    +  K  E  +  +E +     E +K E+++  +   +   +D +   
Sbjct: 1632 KKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAA 1691

Query: 624  QYELLLDNQIDFIQALSLPGNRD--KGEEEMTEKQKKKLNIEETKK 667
            +       +    + L      +  K EE    +++ K+  EE KK
Sbjct: 1692 EALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKK 1737



 Score = 42.1 bits (98), Expect = 0.002
 Identities = 43/219 (19%), Positives = 89/219 (40%), Gaps = 13/219 (5%)

Query: 463  KNRDKNNTKNVATSRQDKRAYEEAAKRL----KLEAESRDKIIPKLRIESRRKYLEKRKE 518
            K  +           ++ +  +EA K+     K   E++     K + +  +K  E +K 
Sbjct: 1465 KAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKA 1524

Query: 519  DKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAK-------EHDKARELEKVQ 571
            D+  + E     DE   +E++    E +  E  +K  E  K       E DK   L K +
Sbjct: 1525 DEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAE 1584

Query: 572  RYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDN 631
                 +  +  E  +  +E++K+   E +K E+ ++ +   +   ++++  EQ +     
Sbjct: 1585 EAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAE 1644

Query: 632  QIDFIQAL--SLPGNRDKGEEEMTEKQKKKLNIEETKKS 668
            +    + L  +   N+ K  EE  + ++ K   EE KK+
Sbjct: 1645 EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKA 1683



 Score = 38.6 bits (89), Expect = 0.029
 Identities = 34/196 (17%), Positives = 91/196 (46%), Gaps = 7/196 (3%)

Query: 476  SRQDKRAYEE--AAKRLKLEAESRDKI--IPKLRIESRRKYLEKRKEDKVAELEADILDD 531
            + + K+A E    A+ LK   E + K+  + K   E ++K  E +K ++  +++A     
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAK 1668

Query: 532  EYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEK 591
            +    +++  E ++ + + K+    L KE ++A++ E++++    +  K  E  +  +E 
Sbjct: 1669 KAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEN 1728

Query: 592  EKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEE 651
            +      +++ E+++  +   +   KD E +++   L   +    + +         EE 
Sbjct: 1729 KIKAEEAKKEAEEDKKKAEEAK---KDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEEL 1785

Query: 652  MTEKQKKKLNIEETKK 667
              E +K+++ +++  K
Sbjct: 1786 DEEDEKRRMEVDKKIK 1801



 Score = 36.3 bits (83), Expect = 0.16
 Identities = 37/164 (22%), Positives = 69/164 (42%), Gaps = 11/164 (6%)

Query: 453  KERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKI-----IPKLRIE 507
            +E+ E +E+ K   K          ++K+  +EA K+ + + +  D++       K   E
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADE 1422

Query: 508  SRRKYLEKRKED---KVAELEADILDDEYLFSEQELTERERRDREHKRKLLEL---AKEH 561
            +++K  EK+K D   K AE      + +    E +  E  ++  E  +K  E    A+E 
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEA 1482

Query: 562  DKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDE 605
             KA E +K       K  +  +  E   + ++    E+ K  DE
Sbjct: 1483 KKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526



 Score = 35.9 bits (82), Expect = 0.21
 Identities = 46/196 (23%), Positives = 81/196 (41%), Gaps = 16/196 (8%)

Query: 478  QDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSE 537
            +D R  EEA K     AE   ++    + E  RK  E RK +   + EA    +E   +E
Sbjct: 1137 EDARKAEEARK-----AEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAE 1191

Query: 538  QELTERERRDREHKRKL-----LELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKE 592
            +     + R  E  RK       E A++ + A++ E V++    K     +  EE  + E
Sbjct: 1192 ELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAK-----KDAEEAKKAE 1246

Query: 593  KVPHYEQ-RKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEE 651
            +  + E+ RK+E+ +M+    R  A   E   + + L   +       +      K  +E
Sbjct: 1247 EERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADE 1306

Query: 652  MTEKQKKKLNIEETKK 667
              +K ++    +E KK
Sbjct: 1307 AKKKAEEAKKADEAKK 1322



 Score = 35.5 bits (81), Expect = 0.23
 Identities = 43/218 (19%), Positives = 83/218 (38%), Gaps = 9/218 (4%)

Query: 452  RKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRK 511
            + E    +E  +  +      +A   +D R  EEA K     AE   K     + E  RK
Sbjct: 1135 KAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARK-----AEDAKKAEAARKAEEVRK 1189

Query: 512  YLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQ 571
              E RK +   + EA    +E   +E+     + +  E  +K  E  K+ ++A++ E+ +
Sbjct: 1190 AEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEER 1249

Query: 572  RYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVG-AKDREHQEQYELLLD 630
                 +  +          +  +   E RK ++ + +    +   AK  E +++ +    
Sbjct: 1250 NNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKK 1309

Query: 631  NQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKS 668
               +  +A      + K EE   +    K   EE KK+
Sbjct: 1310 KAEEAKKADEA---KKKAEEAKKKADAAKKKAEEAKKA 1344



 Score = 32.4 bits (73), Expect = 2.3
 Identities = 39/221 (17%), Positives = 84/221 (38%), Gaps = 20/221 (9%)

Query: 447  DSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRI 506
            D+++ +K  +E +     + +         RQ     EEA K     A+   K   K + 
Sbjct: 1238 DAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARK-----ADELKKAEEKKKA 1292

Query: 507  ESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARE 566
            +  +K  EK+K D+  +   +          ++  E +++  E K+K     K+ ++A++
Sbjct: 1293 DEAKKAEEKKKADEAKKKAEE---------AKKADEAKKKAEEAKKKADAAKKKAEEAKK 1343

Query: 567  LEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYE 626
              +  +          E  EE  E  +    E +K  D        +  A++++  ++ +
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA------AKKKAEEKKKADEAK 1397

Query: 627  LLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKK 667
               +        L       K  +E  +K ++K   +E KK
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 48.2 bits (115), Expect = 3e-05
 Identities = 48/223 (21%), Positives = 88/223 (39%), Gaps = 23/223 (10%)

Query: 451 DRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRR 510
             KER E  + L   +K               YE+ ++ LK   +     I +L  +   
Sbjct: 152 KPKERKEILDELFGLEK---------------YEKLSELLKEVIKEAKAKIEELEGQL-- 194

Query: 511 KYLEKRKEDKVAELEADI---LDDEYLFSEQELTERERRDREHKRKLLELAKEHDKAREL 567
             L +  ED +  LE ++      E +  EQE  E E+     + +L EL +E ++  EL
Sbjct: 195 SELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEEL 254

Query: 568 EKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYEL 627
           +           +  +  EE    E     E+ + + E++   + R   +  E  E    
Sbjct: 255 KARLLEIESLELEALKIREEELR-ELERLLEELEEKIERL-EELEREIEELEEELEGLRA 312

Query: 628 LLDNQIDFIQAL-SLPGNRDKGEEEMTEKQKKKLNIEETKKSL 669
           LL+   + ++ L SL    +K EE++ + + +   + E K  L
Sbjct: 313 LLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNEL 355



 Score = 41.3 bits (97), Expect = 0.004
 Identities = 40/227 (17%), Positives = 81/227 (35%), Gaps = 16/227 (7%)

Query: 449 DDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIES 508
           +D     +   E LK   K          ++     EA +    E E   + + +L+   
Sbjct: 199 EDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARL 258

Query: 509 RRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELE 568
                 + +  K+ E E   L+      E+++   E  +RE +    EL        ELE
Sbjct: 259 LEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELE 318

Query: 569 ------KVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQ 622
                 K     + K     EK E+++ + +     + K    +++  +     +  E  
Sbjct: 319 ELLEKLKSLEERLEKLE---EKLEKLESELE--ELAEEK---NELAKLLEERLKELEERL 370

Query: 623 EQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSL 669
           E+ E  L+  ++ ++ L       + +EE+ E       I+E  + L
Sbjct: 371 EELEKELEKALERLKQLEE--AIQELKEELAELSAALEEIQEELEEL 415



 Score = 39.7 bits (93), Expect = 0.013
 Identities = 41/264 (15%), Positives = 93/264 (35%), Gaps = 19/264 (7%)

Query: 419 RRTQSPCPSCLKVFKTPKVRAFMGHFLTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQ 478
                 CP C +       +  +  +  + ++  +E     E  + R++           
Sbjct: 454 AGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELREL 513

Query: 479 DKRAYEEAAKRLKLEAESRDKI--IPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFS 536
           ++   E       L+ E  +K+  +  L  E      + + +    EL       +    
Sbjct: 514 EEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEEL------RQLEDR 567

Query: 537 EQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYM---------PKGTKGPEKYEE 587
            QEL E     R  + +  EL +  ++ +EL+K  +             +  +  E   E
Sbjct: 568 LQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENE 627

Query: 588 VDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQ-EQYELLLDNQIDFIQAL-SLPGNR 645
           ++E E+    E  K   +     + +   ++ E + E+ E  +  ++  I+    L    
Sbjct: 628 LEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKL 687

Query: 646 DKGEEEMTEKQKKKLNIEETKKSL 669
           ++ E+   E ++ +  +EE  K L
Sbjct: 688 EELEQLEEELEQLREELEELLKKL 711



 Score = 34.4 bits (79), Expect = 0.57
 Identities = 43/228 (18%), Positives = 95/228 (41%), Gaps = 18/228 (7%)

Query: 444 FLTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPK 503
            L  + +  K+ +E  + LK      +  +   ++     EE  K L+      +++  +
Sbjct: 376 ELEKALERLKQLEEAIQELKEELAELSAALEEIQE---ELEELEKELEELERELEELEEE 432

Query: 504 LRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDK 563
           ++    +    + KE  +AEL      ++     QEL E      EH+++LLEL +   +
Sbjct: 433 IKKLEEQINQLESKELMIAELAGA--GEKCPVCGQELPE------EHEKELLELYEL--E 482

Query: 564 ARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQE 623
             ELE+ +     +  +  E+ EE++++ +    E+   E  ++  A+     +  E  E
Sbjct: 483 LEELEE-ELSREKEEAELREEIEELEKELRE--LEEELIELLELEEALKEELEEKLEKLE 539

Query: 624 QYELLLDNQIDFIQALSLPGNRDKGEEEMTE--KQKKKLNIEETKKSL 669
                L+   + +Q   L     + E+ + E  +  ++L +  T+K  
Sbjct: 540 NLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEE 587


>gnl|CDD|238034 cd00079, HELICc, Helicase superfamily c-terminal domain; associated
           with DEXDc-, DEAD-, and DEAH-box proteins, yeast
           initiation factor 4A, Ski2p, and Hepatitis C virus NS3
           helicases; this domain is found in a wide variety of
           helicases and helicase related proteins; may not be an
           autonomously folding unit, but an integral part of the
           helicase; 4 helicase superfamilies at present according
           to the organization of their signature motifs; all
           helicases share the ability to unwind nucleic acid
           duplexes with a distinct directional polarity; they
           utilize the free energy from nucleoside triphosphate
           hydrolysis to fuel their translocation along DNA,
           unwinding the duplex in the process.
          Length = 131

 Score = 43.8 bits (104), Expect = 6e-05
 Identities = 26/148 (17%), Positives = 58/148 (39%), Gaps = 34/148 (22%)

Query: 85  DIYYTKAPEADYIDACVVSILQIHATQPLGDILVFLTGQEEIETCNEILQDRVRRLGSKI 144
            I     P  D     ++ +L+ H  +  G +L+F   ++ ++     L + +R+ G K+
Sbjct: 1   PIKQYVLPVEDEKLEALLELLKEHLKKG-GKVLIFCPSKKMLD----ELAELLRKPGIKV 55

Query: 145 KELIILPVYSNLPTDMQTKI---FLPTPPNARKVVLATNIAETSLTIDNIIYVIDPGFCK 201
                  ++ +   + + ++   F         V++AT++    + + N+  VI      
Sbjct: 56  A-----ALHGDGSQEEREEVLKDFREGEIV---VLVATDVIARGIDLPNVSVVI------ 101

Query: 202 QNNFNSRTGMESLTVVPISKASANQRAG 229
             N++           P S +S  QR G
Sbjct: 102 --NYDL----------PWSPSSYLQRIG 117



 Score = 33.0 bits (76), Expect = 0.34
 Identities = 12/26 (46%), Positives = 13/26 (50%), Gaps = 1/26 (3%)

Query: 822 PISKASANQRAGRAGRVA-AGKCFRL 846
           P S +S  QR GRAGR    G    L
Sbjct: 106 PWSPSSYLQRIGRAGRAGQKGTAILL 131


>gnl|CDD|201125 pfam00271, Helicase_C, Helicase conserved C-terminal domain.  The
           Prosite family is restricted to DEAD/H helicases,
           whereas this domain family is found in a wide variety of
           helicases and helicase related proteins. It may be that
           this is not an autonomously folding unit, but an
           integral part of the helicase.
          Length = 78

 Score = 40.6 bits (96), Expect = 2e-04
 Identities = 16/87 (18%), Positives = 36/87 (41%), Gaps = 18/87 (20%)

Query: 143 KIKELIILPVYSNLPTDMQTKIFLPTPPNARKVVLATNIAETSLTIDNIIYVIDPGFCKQ 202
           +   + +  ++  L  + + +I         KV++AT++A   + + ++  VI       
Sbjct: 4   RKPGIKVARLHGGLSQEEREEILEDFRNGKSKVLVATDVAGRGIDLPDVNLVI------- 56

Query: 203 NNFNSRTGMESLTVVPISKASANQRAG 229
            N++          +P + AS  QR G
Sbjct: 57  -NYD----------LPWNPASYIQRIG 72



 Score = 28.6 bits (65), Expect = 4.0
 Identities = 10/18 (55%), Positives = 12/18 (66%)

Query: 821 VPISKASANQRAGRAGRV 838
           +P + AS  QR GRAGR 
Sbjct: 60  LPWNPASYIQRIGRAGRA 77


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 43.4 bits (103), Expect = 8e-04
 Identities = 39/152 (25%), Positives = 81/152 (53%), Gaps = 26/152 (17%)

Query: 484 EEAAKRLKLEAESR----DKI---IPKLRIESRRKYLEKRKED----KVAELEADI--LD 530
           +EAA R+++E +S+    D++   I +L IE  R+ L+K K++    ++ +LE ++  L+
Sbjct: 390 DEAAARIRMEIDSKPEELDELDRRIIQLEIE--REALKKEKDEASKERLEDLEKELAELE 447

Query: 531 DEYLFSEQELTERERRDREHKRKLLELAKEHDKAR-ELEKVQRYYMPKGTKGPE-KYEEV 588
           +EY     +L E+ + ++   + + ++ +E ++ R ELE+ +R       K  E +Y ++
Sbjct: 448 EEY----ADLEEQWKAEKAAIQGIQQIKEEIEQVRLELEQAEREG--DLAKAAELQYGKL 501

Query: 589 DEKEKVPHYEQRKWEDEQMSSAIFR--VGAKD 618
            E EK     + K   E+    + R  V A++
Sbjct: 502 PELEKRLQAAEAK-LGEETKPRLLREEVTAEE 532


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 42.4 bits (100), Expect = 0.002
 Identities = 48/237 (20%), Positives = 98/237 (41%), Gaps = 17/237 (7%)

Query: 445 LTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKL 504
           L + ++D    ++  E +K+  K     +    +D    EEA     LEA      IP++
Sbjct: 739 LEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALN--DLEARLSHSRIPEI 796

Query: 505 R-----IESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAK 559
           +     +E     +E R  +   +L    L+ EYL  E +  + +R D + + K +E   
Sbjct: 797 QAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI 856

Query: 560 EHDKAR------ELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFR 613
           E+   +      ELE+++       ++  +  +E DE E      + + + E++ + I +
Sbjct: 857 ENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEA--QLRELERKIEELEAQIEK 914

Query: 614 VGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEK-QKKKLNIEETKKSL 669
              +  E + + E L + ++  I+           EE   E  Q +   +EE  ++L
Sbjct: 915 KRKRLSELKAKLEALEE-ELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRAL 970



 Score = 42.4 bits (100), Expect = 0.002
 Identities = 60/269 (22%), Positives = 100/269 (37%), Gaps = 49/269 (18%)

Query: 451 DRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRR 510
             K   E  E LK ++    +  A  RQ     EE  K  +  +E   ++     IE   
Sbjct: 218 KEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLE---EIEQLL 274

Query: 511 KYLEKRKED-----------KVAELEADI--LDDEYLFSEQELTERERRDREHKRKLLEL 557
           + L K+ +D           K+ ELEA+I  L+      E+EL + E R  + + ++ +L
Sbjct: 275 EELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKL 334

Query: 558 AKEHDK-ARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGA 616
             E ++  RE+E+ ++             EE  E ++     + + E+     A  R   
Sbjct: 335 LAEIEELEREIEEERKRRDKLT-------EEYAELKEELEDLRAELEEVDKEFAETRDEL 387

Query: 617 KDREHQ------EQYELL--LDNQIDFIQALS---------LPGNRDKG---EEEMTEKQ 656
           KD   +      E  EL   LD   + +Q LS         + G   K    EEE  +K 
Sbjct: 388 KDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKA 447

Query: 657 KKKLNIEETKKSLPIYPFKQDLIDAVRDH 685
            +    E   + L       DL    ++ 
Sbjct: 448 LEIKKQEWKLEQL-----AADLSKYEQEL 471



 Score = 37.4 bits (87), Expect = 0.064
 Identities = 37/193 (19%), Positives = 83/193 (43%), Gaps = 11/193 (5%)

Query: 480 KRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLE--KRKEDKVAELEADILDDEYLFSE 537
           +R  E+A +RL       DK+     IE   + +E  +++ DK+ E E   L +E     
Sbjct: 314 ERELEDAEERLAKLEAEIDKL--LAEIEELEREIEEERKRRDKLTE-EYAELKEELEDLR 370

Query: 538 QELTERERRDREHKRKLLELAKEHDKA-RELEKVQRYYMPKGTKGPEKYEEVDE-KEKVP 595
            EL E ++   E + +L +  ++ +K  RE+ +++R       +     EE+ +    + 
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIA 430

Query: 596 HYEQRKWE-DEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTE 654
             E +  E +E+       +  ++ + ++    L   +    +   L    D+ E+E+++
Sbjct: 431 GIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYE---QELYDLKEEYDRVEKELSK 487

Query: 655 KQKKKLNIEETKK 667
            Q++    E   +
Sbjct: 488 LQRELAEAEAQAR 500



 Score = 37.4 bits (87), Expect = 0.066
 Identities = 42/224 (18%), Positives = 81/224 (36%), Gaps = 35/224 (15%)

Query: 476 SRQDKRAYEEAAKRL-KLEAESRDKIIPKLRIESRRKYLEKRKED---KVAELEADI--L 529
           SR +    +   +RL  L+ E         RIE+R   L +   D   K+ E+E +I  L
Sbjct: 669 SRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQL 728

Query: 530 DDEYLFSEQELTERERRDREHKRKLLELAKEHD--------KARELEKVQR-----YYMP 576
           + E    ++ L E E      ++++  +  E             +L K++          
Sbjct: 729 EQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARL 788

Query: 577 KGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKD---------REHQEQYEL 627
             ++ PE   E+ + E+    E  + E  ++     ++              +  ++  +
Sbjct: 789 SHSRIPEIQAELSKLEE----EVSRIE-ARLREIEQKLNRLTLEKEYLEKEIQELQEQRI 843

Query: 628 LLDNQIDFIQALSLPGNRDKGE--EEMTEKQKKKLNIEETKKSL 669
            L  QI  I+      N  K E  EE+ E +    ++E     L
Sbjct: 844 DLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDL 887



 Score = 31.2 bits (71), Expect = 5.1
 Identities = 31/182 (17%), Positives = 67/182 (36%), Gaps = 29/182 (15%)

Query: 452 RKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRK 511
             E +E    +   ++   ++  T    +   E    R +LE   ++    +  ++  R+
Sbjct: 335 LAEIEELEREI--EEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYRE 392

Query: 512 YLEK--------------------RKEDKVAELEADI--LDDEYLFSEQELTERERRDRE 549
            LEK                    R  +++A+L A I  ++ +    E+E  ++    ++
Sbjct: 393 KLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKK 452

Query: 550 HKRKLLELAKEHDKAR-ELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMS 608
            + KL +LA +  K   EL  ++  Y     +  +   E+ E E     + R  E+    
Sbjct: 453 QEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEA----QARASEERVRG 508

Query: 609 SA 610
             
Sbjct: 509 GR 510


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 41.8 bits (99), Expect = 0.002
 Identities = 44/188 (23%), Positives = 76/188 (40%), Gaps = 44/188 (23%)

Query: 480 KRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQE 539
           KR  EE  +RL+ E E  D+II +++ E   +  EKR+  K  +L  +I        ++ 
Sbjct: 85  KRRQEEYEERLQ-EREQMDEIIERIQEEDEAEAQEKRE--KQKKLREEI--------DEF 133

Query: 540 LTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQ 599
             ER  R  E K +  E         E  K+  Y   K     E+ EE + + +     +
Sbjct: 134 NEERIERKEEEKERERE---------EELKILEYQREKA----EREEEREAERR-----E 175

Query: 600 RKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKK 659
           RK E E+  + +     +  + +E+ + L                 D  +EE   K+++K
Sbjct: 176 RKEEKEREVARLRAQQEEAEDEREELDELRA---------------DLYQEEYERKERQK 220

Query: 660 LNIEETKK 667
              E  K+
Sbjct: 221 EKEEAEKR 228



 Score = 33.7 bits (78), Expect = 0.54
 Identities = 23/107 (21%), Positives = 48/107 (44%), Gaps = 18/107 (16%)

Query: 480 KRAYEEAAK--RLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFS- 536
               EE  +  R K + E+  +   + + E  ++  E++ E+K   L+ +  ++E     
Sbjct: 207 DLYQEEYERKERQKEKEEAEKRR--RQKQE-LQRAREEQIEEKEERLQEERAEEEAERER 263

Query: 537 -----------EQELTERER-RDREHKRKLLELAKEHDKARELEKVQ 571
                      EQE  E+ R +  EH+R+L +  +E ++ R  E+ +
Sbjct: 264 MLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREE 310



 Score = 33.7 bits (78), Expect = 0.61
 Identities = 40/203 (19%), Positives = 83/203 (40%), Gaps = 22/203 (10%)

Query: 477 RQDKRAYEE----AAKRLKLEAESRDKIIPKLRIESRRKYLEKRKE----DKVAELEADI 528
              +R  EE    A ++ + + + R++I         RK  EK +E     K+ E + + 
Sbjct: 103 EIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREK 162

Query: 529 LDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKV--QRYYMPKGTKGPEKYE 586
            + E     +    +E ++RE  R   +  +  D+  EL+++    Y      K  +K +
Sbjct: 163 AEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEK 222

Query: 587 EVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRD 646
           E  EK +    E ++  +EQ+     R+  +  E + + E +L+ Q             +
Sbjct: 223 EEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERMLEKQ------------AE 270

Query: 647 KGEEEMTEKQKKKLNIEETKKSL 669
             E E    +K+++   E ++ L
Sbjct: 271 DEELEQENAEKRRMKRLEHRREL 293


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 42.0 bits (99), Expect = 0.002
 Identities = 61/258 (23%), Positives = 117/258 (45%), Gaps = 39/258 (15%)

Query: 457 EFSERLKN-RDKNNTKNVATSRQDKRAYEEAAKRL-KLEAESR--DKIIPKLR-IESRRK 511
           E +E+LK   +K    N+    +    YE+  ++L KL+ E +   K + KL  ++ +  
Sbjct: 500 ELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLA 559

Query: 512 YLEKRK---EDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAK-EHDK---- 563
            LEK+    E+++AEL    L++    S +EL ER +       + LEL   E +     
Sbjct: 560 ELEKKLDELEEELAELL-KELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREE 618

Query: 564 ------ARELEKVQRYYMPKGTKGPEKYEEVDEKEK---VPHYEQRKWEDEQMSSAIFRV 614
                   EL+K          +  E  +E++E EK      YE+ + E  ++S  +  +
Sbjct: 619 KELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGL 678

Query: 615 GAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSLPIYPF 674
            A+  E        L+ + + I+        +K +EE+ E++K K  +E+ +K+L     
Sbjct: 679 RAELEE--------LEKRREEIKK-----TLEKLKEELEEREKAKKELEKLEKAL---ER 722

Query: 675 KQDLIDAVRDHQVLIIEG 692
            ++L + V+ ++ L+ E 
Sbjct: 723 VEELREKVKKYKALLKER 740



 Score = 35.8 bits (83), Expect = 0.21
 Identities = 46/226 (20%), Positives = 100/226 (44%), Gaps = 21/226 (9%)

Query: 462 LKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLR--------IESRRKYL 513
           +K R +   K +  +   +   +E  K L+      ++I  +L         +E   K L
Sbjct: 174 IKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKEL 233

Query: 514 EKRKEDKVAELEADILDDEYLFS--EQELTERERRDREHKRKLLELAKEHDKARELEKVQ 571
           E+ KE ++ ELE ++   E      E+++ E E R  E K+++ EL ++  + +EL++  
Sbjct: 234 EELKE-EIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKA 292

Query: 572 RYY------MPKGTKGPEKYEEVDEK--EKVPHYEQRKWEDEQMSSAIFRVGAKDREHQE 623
             Y        +      + E+   +  E++   E+R  E E+    +  +  K +E ++
Sbjct: 293 EEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEK 352

Query: 624 QYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLN--IEETKK 667
           + E L +    + +A +     ++ ++ +T    +KL   +EE +K
Sbjct: 353 RLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEK 398



 Score = 35.4 bits (82), Expect = 0.22
 Identities = 46/236 (19%), Positives = 96/236 (40%), Gaps = 30/236 (12%)

Query: 453 KERDEFSERLKNRDK------NNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRI 506
           K  +   E +K ++K           +++   + R   E  ++   E E   + I +L  
Sbjct: 186 KRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEK 245

Query: 507 ESRRKYLEKRK-EDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKAR 565
           E       KRK E+K+ ELE  I         +EL +      E  ++L EL ++ ++  
Sbjct: 246 ELESLEGSKRKLEEKIRELEERI---------EELKKEIEELEEKVKELKELKEKAEEYI 296

Query: 566 ELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQY 625
           +L +    Y+ +  +  ++   ++E+      E+R  E E+    +  +  K +E +++ 
Sbjct: 297 KLSEFYEEYLDELREIEKRLSRLEEEING--IEERIKELEEKEERLEELKKKLKELEKRL 354

Query: 626 ELL------------LDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSL 669
           E L               +++ ++        +K E+E+ E +K K  IEE    +
Sbjct: 355 EELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKI 410



 Score = 32.0 bits (73), Expect = 2.8
 Identities = 48/218 (22%), Positives = 89/218 (40%), Gaps = 36/218 (16%)

Query: 460 ERLKNRDKNNTKNVATSRQDKRAYEEAAKRL----KLEAESRDKIIPKLRIESRRKYLEK 515
           +  +N  KN  + +   ++     E+  KR     +L  E   ++   LR  +       
Sbjct: 158 DDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELP 217

Query: 516 RKEDKVAELEADILDDEYLFSEQELTERERRDRE-HKRKLLELAKEHDKARELEKVQRYY 574
              +++ +LE ++ + E L  E E  E+E    E  KRKL E  +E ++  E        
Sbjct: 218 ELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIE-------- 269

Query: 575 MPKGTKGPEKYEEVDE-KEKVPHYEQRKWEDEQMSSAI-FRVGAKDREHQEQYEL-LLDN 631
                   E  +E++E +EKV   ++ K + E+      F     D   + +  L  L+ 
Sbjct: 270 --------ELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEE 321

Query: 632 QIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSL 669
           +I+ I+            E + E ++K+  +EE KK L
Sbjct: 322 EINGIE------------ERIKELEEKEERLEELKKKL 347



 Score = 32.0 bits (73), Expect = 2.9
 Identities = 42/164 (25%), Positives = 69/164 (42%), Gaps = 37/164 (22%)

Query: 460 ERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKL-----RIESRRKYLE 514
           ERLK R    T         ++  EE  K  +   E   KI  ++      I+  +K +E
Sbjct: 375 ERLKKRLTGLTPEKL-----EKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIE 429

Query: 515 KRKEDK----VAELEADILDDEYLFSE------------QELTERERRDREHKRKL---- 554
           + K+ K    V   E      + L  E            +E+ E+ER+ R+  R+L    
Sbjct: 430 ELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVL 489

Query: 555 ---LELAKEHDKAREL----EKVQRYYMPKGTKGPEKYEEVDEK 591
               EL K  + A +L    EK+++Y + +  K  E+YE++ EK
Sbjct: 490 KKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEK 533


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 41.4 bits (97), Expect = 0.004
 Identities = 49/241 (20%), Positives = 79/241 (32%), Gaps = 34/241 (14%)

Query: 452 RKERDEFSERLKNRDK-----NNTKNVATSRQDKRAY-EEAAKRLKLEAESRDKIIPKLR 505
           R+E+  F ER++             +     Q  R   E   +  +L  E   +   +L 
Sbjct: 640 REEQAAFEERVEGLLAVLEAQFIDLSTLFCVQRLRVAAELQKEEARLALEGNIERTKELN 699

Query: 506 IESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKAR 565
            E R +    RKE  +   +    D E  F E    E++   RE + + LE   E   A 
Sbjct: 700 DELRAELELHRKE-ILDLFDCGTADTEDAFREAAREEQQLTQRESRLESLEAQLEGVAAE 758

Query: 566 ELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQY 625
             E                  E+ E+E     E     DE++     +V A  R+   Q 
Sbjct: 759 AYELSAS----------LDQRELKEEELALLEEAIDALDEEVEELHAQVAALSRQ-IAQL 807

Query: 626 ELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSLPIYPFKQDLIDAVRDH 685
           E                G       +  E  K+ L  E+ +K   +    Q L +A+R  
Sbjct: 808 E---------------GGGTVAELRQRRESLKEDLE-EKARKWASLRLAVQVLEEALRLF 851

Query: 686 Q 686
           +
Sbjct: 852 K 852



 Score = 31.7 bits (72), Expect = 3.5
 Identities = 54/265 (20%), Positives = 96/265 (36%), Gaps = 49/265 (18%)

Query: 443 HFLTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLE-AESRDKII 501
           H L +S      R          D+++ + +  + +     +E  K+L+ E    R+++ 
Sbjct: 208 HKLLESRRAEHARLAELRSELRADRDHIRALRDAVELWPRLQEW-KQLEQELTRRREELA 266

Query: 502 --PK---LRIESRRKYLEKRKED------KVAEL-----------EADILDDEYLF---- 535
             P+   LR+E R  +L+K + +      ++AEL           EA +     L     
Sbjct: 267 TFPRDGVLRLEKREAHLQKTEAEIDALLVRLAELKDLASQLIPAKEAVLQALVRLHQQLS 326

Query: 536 ----SEQELTE------RERRDREHKRKLLELAKEHDKARELEKVQRYYMPKG--TKGPE 583
               S  ELTE       + RD+E        A+     R LE + RY   +    +   
Sbjct: 327 EIKASAFELTETLAGIEADLRDKEEAAGNGFEAERVHDLRSLECMLRYQSSQRELKQTEA 386

Query: 584 KY-EEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLP 642
            Y + +DEK        R +EDE    A  R+   + E +   E   +      Q +   
Sbjct: 387 AYCKRLDEK--------RLFEDEAEEEARQRLADDEEEVRAGDEAREEKIAANSQVIDKE 438

Query: 643 GNRDKGEEEMTEKQKKKLNIEETKK 667
              +  +   T  QK++   E+   
Sbjct: 439 EVCNLYDRRDTAWQKQRFLREKQTA 463


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 41.2 bits (97), Expect = 0.004
 Identities = 39/248 (15%), Positives = 82/248 (33%), Gaps = 57/248 (22%)

Query: 478 QDKRA-YEEAA-----KRLKLEAESR------------------DKIIPKLRIESRR--K 511
           +++RA +EEAA     K  + E E +                  ++ +  L  ++ +  +
Sbjct: 155 EERRAIFEEAAGISKYKERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAER 214

Query: 512 YLEKRKEDKVAEL-----EADILDDEYLFSEQELTERERRDREHKRKL---------LEL 557
           Y E + E +  EL       + L +E    ++EL E E    E   +L         L L
Sbjct: 215 YKELKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRL 274

Query: 558 AKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEK-VPHYEQRKWEDEQMSSAIFRVGA 616
                +    E  +  Y            E+   E+      +R    E+    +     
Sbjct: 275 EVSELEEEIEELQKELY--------ALANEISRLEQQKQILRERLANLERQLEELEAQLE 326

Query: 617 KDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSLPIYPFKQ 676
           +     ++    L    +  +   L    +  E E+ E + +   +E   + L      +
Sbjct: 327 ELESKLDELAEELAELEE--KLEELKEELESLEAELEELEAELEELESRLEEL------E 378

Query: 677 DLIDAVRD 684
           + ++ +R 
Sbjct: 379 EQLETLRS 386



 Score = 35.0 bits (81), Expect = 0.35
 Identities = 28/146 (19%), Positives = 52/146 (35%), Gaps = 20/146 (13%)

Query: 446 TDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAES--------- 496
              ++   + DE +E L    +   + +    +   A  E  +    E ES         
Sbjct: 323 AQLEELESKLDELAEEL-AELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQL 381

Query: 497 ---RDKIIPKLR----IESRRKYLEKRKED---KVAELEADILDDEYLFSEQELTERERR 546
              R K+         + +  + LE R E    +   L+ +I +      E EL E +  
Sbjct: 382 ETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAE 441

Query: 547 DREHKRKLLELAKEHDKARELEKVQR 572
             E + +L EL +E ++  E  +  R
Sbjct: 442 LEELEEELEELQEELERLEEALEELR 467



 Score = 31.6 bits (72), Expect = 3.4
 Identities = 43/224 (19%), Positives = 84/224 (37%), Gaps = 23/224 (10%)

Query: 455 RDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLE 514
           R E  E  +  ++   +    SRQ     ++ A RL+ E E  ++ I +L  E       
Sbjct: 704 RKELEELEEELEQLRKELEELSRQISALRKDLA-RLEAEVEQLEERIAQLSKEL------ 756

Query: 515 KRKEDKVAELEADILDDEYLFSEQELTERERRDRE---HKRKLLELAKEHDKAR-ELEKV 570
              E ++ ELE  + + E   +E E  E E  + +    K +L  L +  D+ R EL  +
Sbjct: 757 TELEAEIEELEERLEEAEEELAEAE-AEIEELEAQIEQLKEELKALREALDELRAELTLL 815

Query: 571 QRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWED-----EQMSSAIFRVGAKDREHQEQY 625
                    +       +   E+     + + E+     E +++ I  +     E + + 
Sbjct: 816 NEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESEL 875

Query: 626 ELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSL 669
           E LL N+   ++             E+ E  ++   +E  +  L
Sbjct: 876 EALL-NERASLEE-----ALALLRSELEELSEELRELESKRSEL 913


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 40.0 bits (93), Expect = 0.011
 Identities = 35/196 (17%), Positives = 73/196 (37%), Gaps = 19/196 (9%)

Query: 473 VATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDE 532
            A SR+ ++  +E  K+L  E E+  ++I  L     ++   K +  K  E    + +  
Sbjct: 163 AAGSREKRKK-KERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYY-QLKEKL 220

Query: 533 YLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKE 592
            L  E  L     +  E +  LL+     D+  E+E  ++             E   E+E
Sbjct: 221 ELEEENLLYLDYLKLNEERIDLLQ-ELLRDEQEEIESSKQ-------------ELEKEEE 266

Query: 593 KVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEM 652
            +    Q   E+++          + +   ++ E L    +   +       + K  E+ 
Sbjct: 267 IL---AQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKE 323

Query: 653 TEKQKKKLNIEETKKS 668
            +K +K+L  E+ +  
Sbjct: 324 LKKLEKELKKEKEEIE 339



 Score = 32.2 bits (73), Expect = 2.6
 Identities = 28/206 (13%), Positives = 65/206 (31%), Gaps = 11/206 (5%)

Query: 448  SDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRI- 506
                 K  +E  ERL+              +++   E+  K      E ++K   K    
Sbjct: 846  EQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEE 905

Query: 507  ESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKA-- 564
            ES++  L + KE+++ E  A+       +  +            +    E  +++ +   
Sbjct: 906  ESQKDNLLEEKENEIEERIAEEAIILLKYESEP-----EELLLEEADEKEKEEDNKEEEE 960

Query: 565  RELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQ 624
               +++       G        E +EKE+   Y + + + E++      +  +  E +  
Sbjct: 961  ERNKRLLLAKEELGNVNLMAIAEFEEKEE--RYNKDELKKERLEEEKKELLREIIE-ETC 1017

Query: 625  YELLLDNQIDFIQALSLPGNRDKGEE 650
                   ++       L       E 
Sbjct: 1018 QRFKEFLELFVSINRGLNKVFFYLEL 1043


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 38.6 bits (90), Expect = 0.016
 Identities = 49/197 (24%), Positives = 79/197 (40%), Gaps = 30/197 (15%)

Query: 464 NRDKNNTKNVATSRQDKRAYEEAAKRLK-LEAESRDKI---IPKLRIESRRKYLEKRKED 519
            R +NN      S  D  A E  AK L+ LE+    K    + K++ ++R+  LE + + 
Sbjct: 15  KRTRNNGPATIASGFDPTALERGAKALRELESSPHAKKAFELSKMQEKTRQAELEAKIK- 73

Query: 520 KVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKAR-----ELEKVQRYY 574
              E EA           Q   ER R + E +RK L+   + ++ R     EL + +RY 
Sbjct: 74  ---EYEAQQ--------AQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELAR-KRYQ 121

Query: 575 MPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQID 634
                K  E+    +E E +   E+     E M  A      + R    + E  L+ +  
Sbjct: 122 -----KELEQQRRQNE-ELLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAELEREN- 174

Query: 635 FIQALSLPGNRDKGEEE 651
            I+A      R + +EE
Sbjct: 175 -IRAKIEAEARGRAKEE 190


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 39.0 bits (90), Expect = 0.019
 Identities = 49/222 (22%), Positives = 88/222 (39%), Gaps = 21/222 (9%)

Query: 367 HPVIQLFTILPSSSTSSRLFTTLPSSLAAVPHHIQRQSPAQSYHRMTPGCTDRRTQSPCP 426
           HP +QL  +  +    +  +   P++L         Q   + + R +  C  R++ +   
Sbjct: 331 HPGLQLAAMEKAKHAEAANYGISPNTLI-------NQRNEEQHGRRSSSCASRQSANNVT 383

Query: 427 SCLKVFKTPKVRAFMGHFLTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDK-RAYEE 485
           +   +     V +        S DDRK   + +      + +    V     ++ R  +E
Sbjct: 384 NITSITSVTSVASVASVASVPSKDDRKYPQDGATHCHAVNGHYGGRVDKDHAERARIEKE 443

Query: 486 AAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERER 545
            A R  LE     KI+ K RIE     LE+ + +++     + ++ E L  E+   ER  
Sbjct: 444 NAHRKALEM----KILEKKRIER----LEREERERLERERMERIERERLERERLERERLE 495

Query: 546 RDREHKRKLLELAKEHDKARE---LEKVQR--YYMPKGTKGP 582
           RDR  + +L  L +E     E   LEK +R  Y++     G 
Sbjct: 496 RDRLERDRLDRLERERVDRLERDRLEKARRNSYFLKGMENGL 537


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 38.5 bits (90), Expect = 0.026
 Identities = 45/208 (21%), Positives = 93/208 (44%), Gaps = 24/208 (11%)

Query: 470 TKNVATSRQDKRAYEEAAKRLK-LEAESRDKIIPKLRIESRRKYL-----EKRKEDKVAE 523
           ++N  +S   KR  +E  + L  LEA+          +++  + L     E R++ +  E
Sbjct: 656 SRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELE 715

Query: 524 LEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPE 583
            + + L  E    E+EL + + R  E + +L EL +E ++ +E             +  E
Sbjct: 716 RQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQE-------------RLEE 762

Query: 584 KYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFI--QALSL 641
             EE++  E+     + K E E++      +  +  E +E+ E   + ++D +  +  SL
Sbjct: 763 LEEELESLEE--ALAKLKEEIEELEEKRQALQEELEELEEELE-EAERRLDALERELESL 819

Query: 642 PGNRDKGEEEMTEKQKKKLNIEETKKSL 669
              R++ E+E+ E +++   +EE    L
Sbjct: 820 EQRRERLEQEIEELEEEIEELEEKLDEL 847



 Score = 37.8 bits (88), Expect = 0.047
 Identities = 50/184 (27%), Positives = 89/184 (48%), Gaps = 14/184 (7%)

Query: 452 RKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLR--IESR 509
            +E +E  E  + R     + + +  Q +   E+  + L+ E E  ++ + +L   +E  
Sbjct: 795 LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEEL 854

Query: 510 RKYLEKRKEDKVAELEADI--LDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARE- 566
            K LE+ KE ++ ELEA+   L+DE    E+E  E E   RE + +L EL +E +K RE 
Sbjct: 855 EKELEELKE-ELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRER 913

Query: 567 ----LEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQ 622
                 K++R       + PE  EE++E+ +     + + E E++   I  +G  +    
Sbjct: 914 LEELEAKLERL----EVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAI 969

Query: 623 EQYE 626
           E+YE
Sbjct: 970 EEYE 973



 Score = 34.3 bits (79), Expect = 0.52
 Identities = 46/241 (19%), Positives = 94/241 (39%), Gaps = 24/241 (9%)

Query: 445 LTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKL 504
           L       +E +E  E L+   +   + +    ++  + EEA  +LK E E  ++    L
Sbjct: 732 LEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQAL 791

Query: 505 RIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKA 564
           + E   +  E+ +E   AE   D L+ E    E     RER ++E +    E+       
Sbjct: 792 Q-EELEELEEELEE---AERRLDALEREL---ESLEQRRERLEQEIEELEEEI------- 837

Query: 565 RELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQ 624
            ELE+       +  +  ++ EE+ E+      E+ + E E++   +     +  E +E+
Sbjct: 838 EELEEKLDELEEELEELEKELEELKEEL-----EELEAEKEELEDEL----KELEEEKEE 888

Query: 625 YELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSLPIY-PFKQDLIDAVR 683
            E  L      +  L     + +   E  E + ++L +E  +    +   ++  L   + 
Sbjct: 889 LEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELE 948

Query: 684 D 684
            
Sbjct: 949 R 949



 Score = 33.1 bits (76), Expect = 1.1
 Identities = 41/190 (21%), Positives = 83/190 (43%), Gaps = 22/190 (11%)

Query: 484 EEAAKRLKLEAESRDKIIPKLRIESR--RKYLEKRKEDKVAELEADILDDEYLFSEQELT 541
           EE  +RL+   E  +K + KL  ++    +Y E + E +  ELE  +L  +     +EL 
Sbjct: 185 EENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELR--ELELALLLAKLKELRKELE 242

Query: 542 ERERRDREHKRKLLELAKE-HDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQR 600
           E E      + +L EL +E  +  +E+E+++            + EE+ E+      E+ 
Sbjct: 243 ELEEELSRLEEELEELQEELEEAEKEIEELK-----------SELEELREE-----LEEL 286

Query: 601 KWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQAL-SLPGNRDKGEEEMTEKQKKK 659
           + E  ++   I  +  +    +E+ E L +   +  + L  L    +  +EE+ E++   
Sbjct: 287 QEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLL 346

Query: 660 LNIEETKKSL 669
             +E+    L
Sbjct: 347 EELEQLLAEL 356


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 37.2 bits (86), Expect = 0.040
 Identities = 36/139 (25%), Positives = 61/139 (43%), Gaps = 15/139 (10%)

Query: 446 TDSDDDRKERDEFSERLK-----NRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKI 500
           ++ +++    D+   RLK      +D+   +        ++A EE AKR    AE R + 
Sbjct: 21  SEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEEEAKRK---AEERKRE 77

Query: 501 IPKLRIESRRKYLE--KRKEDKVAELEADILDDEYLFSEQEL-----TERERRDREHKRK 553
             K+  E  +K LE  KR     A ++    DDE    E E       +R +RDRE + +
Sbjct: 78  TLKIVEEEVKKELELKKRNTLLEANIDDVDTDDENEEEEYEAWKLRELKRIKRDREEREE 137

Query: 554 LLELAKEHDKARELEKVQR 572
           +     E +K R + + +R
Sbjct: 138 MEREKAEIEKMRNMTEEER 156


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 38.0 bits (88), Expect = 0.041
 Identities = 36/124 (29%), Positives = 57/124 (45%), Gaps = 11/124 (8%)

Query: 452 RKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRK 511
           +KE D    +L +        V+  ++DK++ +   KRLK EA+SR     +L  E +RK
Sbjct: 454 KKENDMLQTKLNSM-------VSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRK 506

Query: 512 YLEKRKEDKVAELEADILDD--EYLFSEQELTERERRDREHKRKLLELAKE--HDKAREL 567
             E+    + A   A   ++  E L   ++  E E +  EH  KL E        +A+EL
Sbjct: 507 KEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQEL 566

Query: 568 EKVQ 571
            K Q
Sbjct: 567 RKYQ 570


>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
           transcription initiation/nucleotide excision repair
           factor TFIIH, subunit TFB3 [Cell division and chromosome
           partitioning / Transcription / DNA replication,
           recombination, and repair].
          Length = 314

 Score = 36.9 bits (85), Expect = 0.057
 Identities = 33/157 (21%), Positives = 59/157 (37%), Gaps = 29/157 (18%)

Query: 409 YHRMTPGCTDR---RTQSPCPS--CLKV-----FKTP-----------KVRAFM----GH 443
           YHRM   C DR   R  + CP   C K+     F               VR  +      
Sbjct: 35  YHRMCESCVDRIFSRGPAQCPYKGCGKILRKIKFIKQTFEDITVEKEVDVRKRLLRAFNK 94

Query: 444 FLTDSDDDRKERDEFSERLKNRDKN--NTKNVATSRQDKRAYEEAAKRLKLEAESRDKII 501
              +   D  + +++ E ++          +V+ + +  + YEE  +   L    R    
Sbjct: 95  EEEEFGGDLAKYNDYLEEVEELVFELLELIDVSLTEEKVKKYEEMNQDSILNNLERPT-- 152

Query: 502 PKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQ 538
           P++    ++  L+KR + K  +LE  I ++E +  E 
Sbjct: 153 PEVMPGKQKNVLQKRMKLKKRQLERQIEEEERMNKEW 189


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 36.7 bits (85), Expect = 0.12
 Identities = 29/172 (16%), Positives = 53/172 (30%), Gaps = 45/172 (26%)

Query: 450 DDRKERDEFSERL----KNRDKNNT--------KNVATSRQDKRAYEE------------ 485
           D  K  DE  E L    K   +           + +     +++A E+            
Sbjct: 178 DLVKALDEAEEELANLRKELRQLEKEKQRLERLRRLLPLLAERKALEQQLAALGEVIDLP 237

Query: 486 ---AAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEAD--ILDD--------- 531
                +  +  AE R     +  +E   + LE  +  ++ E+  D  +L           
Sbjct: 238 PDAVERYEEARAELRAA---RRNLELLTERLEALQA-ELDEISLDEELLAQAAAIEALHQ 293

Query: 532 ---EYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTK 580
              EY  +EQ+L +RE      +     L  +     + E V+        K
Sbjct: 294 QRGEYRNAEQDLPDREGEIANAREAAAALLAQIGPDADEEAVESLRPSLAAK 345


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 35.5 bits (82), Expect = 0.13
 Identities = 32/149 (21%), Positives = 65/149 (43%), Gaps = 9/149 (6%)

Query: 453 KERDEFSERLKNRDKNNTK-NVATSRQDKRA--YEEAAKRLKLEAESRDKIIPKLRIESR 509
           +E+ E  ER++  +++  +        ++ A   EE  K+ + EA+  +K   +L  E+R
Sbjct: 5   REQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENR 64

Query: 510 RKYLEKRKEDKVAE---LEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARE 566
           R  LE+       E   LEA++ +     ++ E  ERE+++ E ++   EL +  +    
Sbjct: 65  R--LEEEAAASEEERERLEAEVDEATAEVAKLE-EEREKKEAETRQLQQELREAQEAHER 121

Query: 567 LEKVQRYYMPKGTKGPEKYEEVDEKEKVP 595
             +         T  P     V+ ++  P
Sbjct: 122 ARQELLEAAAAPTAPPHVAAPVNGEQLEP 150


>gnl|CDD|238550 cd01130, VirB11-like_ATPase, Type IV secretory pathway component
           VirB11, and related ATPases. The homohexamer, VirB11 is
           one of eleven Vir proteins, which are required for
           T-pilus biogenesis and virulence in the transfer of
           T-DNA from the Ti (tumor-inducing) plasmid of bacterial
           to plant cells. The pilus is a fibrous cell surface
           organelle, which mediates adhesion between bacteria
           during conjugative transfer or between bacteria and host
           eukaryotic cells during infection. VirB11- related
           ATPases include the archaeal flagella biosynthesis
           protein and the pilus assembly proteins CpaF/TadA and
           TrbB.  This alignment contains the C-terminal domain,
           which is the ATPase.
          Length = 186

 Score = 35.0 bits (81), Expect = 0.14
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 678 LIDAVRDHQVLIIEGETGSGKTT 700
           L  AV   + ++I G TGSGKTT
Sbjct: 18  LWLAVEARKNILISGGTGSGKTT 40


>gnl|CDD|222258 pfam13604, AAA_30, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. There is a Walker A and
           Walker B.
          Length = 195

 Score = 34.9 bits (81), Expect = 0.17
 Identities = 25/121 (20%), Positives = 49/121 (40%), Gaps = 35/121 (28%)

Query: 680 DAVRD-----HQVLIIEGETGSGKTT---QIPQYLHEAGFTKDKMKIGCTQPRRVAAMSV 731
           +AVR       +V +++G  G+GKTT      +    AG+             RV  ++ 
Sbjct: 8   EAVRHVLTSGDRVAVVQGPAGTGKTTSLKAAREAWEAAGY-------------RVIGLAP 54

Query: 732 SARVAEEMGVKLGCEVGYAIRFEDCTSERTIIKYMTDGTLHREFLSEPDLASYSVMIIDE 791
           + + A+ +G +LG E             RT+   +      +       L + +++++DE
Sbjct: 55  TGKAAKVLGEELGIEA------------RTLASLL--HRWDKGEDPGRVLDAGTLLVVDE 100

Query: 792 A 792
           A
Sbjct: 101 A 101


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 34.1 bits (79), Expect = 0.28
 Identities = 19/92 (20%), Positives = 44/92 (47%), Gaps = 2/92 (2%)

Query: 515 KRKEDKVAELEADILD-DEYLFSEQELTERERRDREHKRKLLELAKEHDKAR-ELEKVQR 572
            + + ++ +L+ ++ +  + +   Q   E+ ++ RE   +  EL +E  +   EL+K++ 
Sbjct: 65  NKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKA 124

Query: 573 YYMPKGTKGPEKYEEVDEKEKVPHYEQRKWED 604
                    PE+ E++ E+ KV      +W D
Sbjct: 125 ELEKYEKNDPERIEKLKEETKVAKEAANRWTD 156


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 34.5 bits (80), Expect = 0.29
 Identities = 16/69 (23%), Positives = 30/69 (43%), Gaps = 11/69 (15%)

Query: 452 RKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRK 511
            K R+E  E++    +            +R  E   K+ + + E R+  + KL  E +RK
Sbjct: 262 DKTREEEEEKILKAAEE-----------ERQEEAQEKKEEKKKEEREAKLAKLSPEEQRK 310

Query: 512 YLEKRKEDK 520
             EK ++ +
Sbjct: 311 LEEKERKKQ 319


>gnl|CDD|183704 PRK12726, PRK12726, flagellar biosynthesis regulator FlhF;
           Provisional.
          Length = 407

 Score = 34.7 bits (79), Expect = 0.29
 Identities = 28/120 (23%), Positives = 56/120 (46%), Gaps = 9/120 (7%)

Query: 592 EKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEE 651
           +  P  E+RK ++E++S+    + A +RE   +     +   DF++ L   G  D    +
Sbjct: 104 QSTPLLEERKTQEEELSAMRLELAALNRELAVKMREEREQNSDFVKFLKGRGISDTYVAD 163

Query: 652 MTEKQKKKLNIEETKK-------SLPIYPFKQDLIDA--VRDHQVLIIEGETGSGKTTQI 702
             +  +K+    ET          +P    K  + D+  + +H+++ + G+TG GKTT +
Sbjct: 164 FMQAGRKQFKQVETAHLDDITDWFVPYLSGKLAVEDSFDLSNHRIISLIGQTGVGKTTTL 223


>gnl|CDD|220245 pfam09444, MRC1, MRC1-like domain.  This putative domain is found
           to be the most conserved region in mediator of
           replication checkpoint protein 1.
          Length = 145

 Score = 33.5 bits (77), Expect = 0.30
 Identities = 28/119 (23%), Positives = 51/119 (42%), Gaps = 4/119 (3%)

Query: 456 DEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEK 515
           DE S+   + +     +   S++     E AAKR + E E  +K + KL  +     L +
Sbjct: 29  DEDSDDDDDSELEEMIDDEYSKEKAENDELAAKRAEEEREDDEKEVEKLLDDITTGGLRR 88

Query: 516 RKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYY 574
           R+     +LE    +DE     Q    + R     +++LLE   + DK  +  K + ++
Sbjct: 89  RRGKNGLDLELSDSEDE---LLQRRRLKRRELALMRQRLLE-DGDLDKLAKNPKSKAFF 143


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 34.7 bits (80), Expect = 0.32
 Identities = 42/220 (19%), Positives = 89/220 (40%), Gaps = 26/220 (11%)

Query: 452 RKERDEFSERLKN-RDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRR 510
           R+ERDE +E ++  ++K +  N           +E  +  +   E          +E   
Sbjct: 61  REERDEINEEVQELKEKRDEINAKLQEL----RKEYRELKEKRNEFNLGGRSIKSLEREI 116

Query: 511 KYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKE----HDKARE 566
           + LEK+++  V   E +    + +   ++  E  ++  E   KL EL  E      KARE
Sbjct: 117 ERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKARE 176

Query: 567 L-EKVQRY------YMPKGTKGPEKYEEVDEKEKVPH--YEQRKWEDEQMSSAIFRVGAK 617
           + EK+Q        Y  +  K  E+ +E+ ++    H  + +   + +++      +  +
Sbjct: 177 IHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNE 236

Query: 618 DREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQK 657
            RE +++ + L   +            R +  EE+ E+ +
Sbjct: 237 LRELEKKIKALRAKEKA--------AKRREKREELKERAE 268


>gnl|CDD|227095 COG4753, COG4753, Response regulator containing CheY-like receiver
           domain and AraC-type DNA-binding domain [Signal
           transduction mechanisms].
          Length = 475

 Score = 34.6 bits (80), Expect = 0.34
 Identities = 31/160 (19%), Positives = 52/160 (32%), Gaps = 16/160 (10%)

Query: 481 RAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQE- 539
           + Y E A++L  E         K  +       +  K+ K    E +      +  E   
Sbjct: 239 KVYREIAEKLGFEEADYFDYRFKKYLGMTPD--QYNKQLKQNITENNARGITAILEELFV 296

Query: 540 -LTERERRDREHK--------RKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDE 590
              E      +HK          L      +D+    E VQR +     +  E+ EE+  
Sbjct: 297 MGVEHLLESSKHKDALEIWYLLSLYVQMILNDEGLLEEIVQRIH---DIQSFEELEEITS 353

Query: 591 KEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLD 630
           +     Y  +  E+ + SS I +V     +H     L L 
Sbjct: 354 QILHVEYVIQSVENIEGSSLIQKVLDYIHKH-FSENLTLK 392


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 35.0 bits (81), Expect = 0.36
 Identities = 27/113 (23%), Positives = 50/113 (44%), Gaps = 14/113 (12%)

Query: 447 DSDDDRKERDEFSERLKNR-DKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLR 505
           D+ D+R  R+    R +NR ++   +      ++ +  E   K     A ++D+     R
Sbjct: 622 DTRDNRTRREGRENREENRRNRRQAQQQTAETRESQQAEVTEK-----ARTQDEQQQAPR 676

Query: 506 IE-SRRKYLEKRK---EDKVAELEADILDDEYLFSEQELTERERRDREHKRKL 554
            E  RR+  EKR+   E K   +E   + +     EQE   ++ + R  +R+L
Sbjct: 677 RERQRRRNDEKRQAQQEAKALNVEEQSVQET----EQEERVQQVQPRRKQRQL 725


>gnl|CDD|203709 pfam07652, Flavi_DEAD, Flavivirus DEAD domain. 
          Length = 146

 Score = 33.1 bits (76), Expect = 0.38
 Identities = 30/104 (28%), Positives = 46/104 (44%), Gaps = 17/104 (16%)

Query: 695 GSGKTTQ-IPQYLHEAGFTKDKMKIGCTQPRRVAAMSVSARVAEEMGVKL-GCEVGY--- 749
           G+GKT + +P+ + E      +++     P RV        V  EM   L G  + Y   
Sbjct: 12  GAGKTRKVLPELVREC--IDRRLRTLVLAPTRV--------VLAEMEEALRGLPIRYHTP 61

Query: 750 AIRFEDCTSERTIIKYMTDGTLHREFLSEPDLASYSVMIIDEAH 793
           A+  E    E  I+  M   T  +  LS   + +Y V+I+DEAH
Sbjct: 62  AVSSEHTGRE--IVDVMCHATFTQRLLSPVRVPNYEVIIMDEAH 103


>gnl|CDD|215832 pfam00270, DEAD, DEAD/DEAH box helicase.  Members of this family
           include the DEAD and DEAH box helicases. Helicases are
           involved in unwinding nucleic acids. The DEAD box
           helicases are involved in various aspects of RNA
           metabolism, including nuclear transcription, pre mRNA
           splicing, ribosome biogenesis, nucleocytoplasmic
           transport, translation, RNA decay and organellar gene
           expression.
          Length = 169

 Score = 33.4 bits (77), Expect = 0.42
 Identities = 28/142 (19%), Positives = 50/142 (35%), Gaps = 14/142 (9%)

Query: 672 YPFKQDLIDAVRDHQVLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIG-CTQPRRVAAMS 730
            P +   I A+   + ++++  TGSGKT      + +A   K          P R  A  
Sbjct: 1   TPIQAQAIPAILSGKDVLVQAPTGSGKTLAFLLPILQALLPKKGGPQALVLAPTRELAEQ 60

Query: 731 VSARVAEEMGVKLGCEVGYAI-------RFEDCTSERTIIKYMTDGTLHREFLSEPDLAS 783
           +     +++   LG  V           +       +  I   T G L            
Sbjct: 61  I-YEELKKLFKILGLRVALLTGGTSLKEQARKLKKGKADILVGTPGRLLDLLRRGKLKLL 119

Query: 784 YSV--MIIDEAHPKVYVIDPGF 803
            ++  +++DEAH    ++D GF
Sbjct: 120 KNLKLLVLDEAH---RLLDMGF 138


>gnl|CDD|150406 pfam09727, CortBP2, Cortactin-binding protein-2.  This entry is the
           first approximately 250 residues of cortactin-binding
           protein 2. In addition to being a positional candidate
           for autism this protein is expressed at highest levels
           in the brain in humans. The human protein has six
           associated ankyrin repeat domains pfam00023 towards the
           C-terminus which act as protein-protein interaction
           domains.
          Length = 193

 Score = 33.3 bits (76), Expect = 0.45
 Identities = 18/84 (21%), Positives = 36/84 (42%)

Query: 480 KRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQE 539
           K        +L    + + K + +L  E R+     +K D    L     +      EQE
Sbjct: 95  KETQRRMLAQLLAAEKRQRKTVLELEEEKRKHIRYMKKSDDFTNLLEQERERLKKLLEQE 154

Query: 540 LTERERRDREHKRKLLELAKEHDK 563
            +++ ++++EH++ L  L +E  K
Sbjct: 155 KSQQAKKEQEHRKLLATLEEELGK 178


>gnl|CDD|238071 cd00124, MYSc, Myosin motor domain. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 679

 Score = 34.2 bits (79), Expect = 0.49
 Identities = 14/44 (31%), Positives = 24/44 (54%), Gaps = 4/44 (9%)

Query: 676 QDLIDAVRDHQVLIIEGETGSGKTT---QIPQYLHEAGFTKDKM 716
           ++++   R+ Q +II GE+G+GKT     I +YL     + D  
Sbjct: 78  RNMLRDRRN-QSIIISGESGAGKTENTKLIMKYLASLAGSNDTG 120


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 34.2 bits (79), Expect = 0.59
 Identities = 42/159 (26%), Positives = 72/159 (45%), Gaps = 33/159 (20%)

Query: 450 DDRKER-DEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIES 508
            +R+E  +E  ER +   +   + +    ++KR   EAA   + EAE   + + +L   S
Sbjct: 526 AERRETIEEKRERAEELRERAAE-LEAEAEEKR---EAAAEAEEEAEEAREEVAEL--NS 579

Query: 509 RRKYLEKRKE--DKVAELEADILDDEYLFSEQELTER---------ERRD-----REHKR 552
           +   L++R E  +++  L A I   +     + L E+         ERR+     RE KR
Sbjct: 580 KLAELKERIESLERIRTLLAAI--ADAEDEIERLREKREALAELNDERRERLAEKRERKR 637

Query: 553 KLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEK 591
              EL  E D+AR +E+ +        +  E  E+V+EK
Sbjct: 638 ---ELEAEFDEAR-IEEARE----DKERAEEYLEQVEEK 668



 Score = 33.5 bits (77), Expect = 0.85
 Identities = 31/121 (25%), Positives = 52/121 (42%), Gaps = 22/121 (18%)

Query: 451 DRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRR 510
           + KE  +  ERL N  ++    +    +      E A+  + EA+          +E   
Sbjct: 197 EEKEEKDLHERL-NGLESELAELDEEIERYEEQREQARETRDEADEV--------LEE-- 245

Query: 511 KYLEKRKEDKVAELEADILDDEYLFSEQELTERERRD-----REHKRKLLELAKEHDKAR 565
            + E+R+E  +  LEA+I D     +E   TERER +     R+ + +L EL +E D   
Sbjct: 246 -HEERREE--LETLEAEIEDLRETIAE---TEREREELAEEVRDLRERLEELEEERDDLL 299

Query: 566 E 566
            
Sbjct: 300 A 300



 Score = 30.8 bits (70), Expect = 6.0
 Identities = 27/118 (22%), Positives = 48/118 (40%), Gaps = 30/118 (25%)

Query: 449 DDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIES 508
           ++DR+  +E    L++        +    ++     E A+ L  EAE R +     R+E 
Sbjct: 471 EEDRERVEELEAELED--------LEEEVEEVEERLERAEDLV-EAEDRIE-----RLEE 516

Query: 509 RRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARE 566
           RR+ LE+   ++   +E                 RER +   +R   EL  E ++ RE
Sbjct: 517 RREDLEELIAERRETIEEK---------------RERAEELRERA-AELEAEAEEKRE 558



 Score = 30.4 bits (69), Expect = 7.5
 Identities = 29/130 (22%), Positives = 49/130 (37%), Gaps = 34/130 (26%)

Query: 452 RKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKI------IPKLR 505
             E  E  E ++  ++   +   T        +EA + L+   E R+++      I  LR
Sbjct: 212 ESELAELDEEIERYEEQREQARETR-------DEADEVLEEHEERREELETLEAEIEDLR 264

Query: 506 -------------------IESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERR 546
                              +  R + LE+ ++D +AE   D  D E +  E    E E R
Sbjct: 265 ETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAV--EARREELEDR 322

Query: 547 DREHKRKLLE 556
           D E + +L E
Sbjct: 323 DEELRDRLEE 332


>gnl|CDD|240439 PTZ00491, PTZ00491, major vault protein; Provisional.
          Length = 850

 Score = 34.2 bits (79), Expect = 0.62
 Identities = 32/115 (27%), Positives = 49/115 (42%), Gaps = 15/115 (13%)

Query: 460 ERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKED 519
           ER K  DK          Q  +  E  A+   +E+  + +       E+R   +E   E 
Sbjct: 684 ERQKMHDK-----AKAEEQRTKLLELQAESAAVESSGQSRAEALAEAEARL--IEAEAEV 736

Query: 520 KVAELEADILDDEYLFSEQELT-ERERRDREHK--RKLLELAKEHDKARELEKVQ 571
           + AEL A  L  E    E EL   R+R++ E +  +   EL  E  KA+EL  ++
Sbjct: 737 EQAELRAKALRIEA---EAELEKLRKRQELELEYEQAQNEL--EIAKAKELADIE 786


>gnl|CDD|173607 PTZ00417, PTZ00417, lysine-tRNA ligase; Provisional.
          Length = 585

 Score = 33.8 bits (77), Expect = 0.65
 Identities = 28/107 (26%), Positives = 49/107 (45%), Gaps = 25/107 (23%)

Query: 376 LPSSSTSSRLFTTLPSSLAA---------VPHHIQRQSPAQSYHRMTPGCTDRRTQSPCP 426
           +P+  T+++L   L S             +  H Q  SP   YHR  PG T+R       
Sbjct: 441 MPNPPTAAKLLDQLASHFIENKYPNKPFFIIEHPQIMSPLAKYHRSKPGLTER------- 493

Query: 427 SCLKVFKTPK--VRAFMGHFLTDSDDDRKERDEFSERLKNRDKNNTK 471
             L++F   K  + A+     T+ +D  K+++ FS + K+R+K + +
Sbjct: 494 --LEMFICGKEVLNAY-----TELNDPFKQKECFSAQQKDREKGDAE 533


>gnl|CDD|184377 PRK13894, PRK13894, conjugal transfer ATPase TrbB; Provisional.
          Length = 319

 Score = 33.6 bits (77), Expect = 0.67
 Identities = 14/34 (41%), Positives = 24/34 (70%)

Query: 675 KQDLIDAVRDHQVLIIEGETGSGKTTQIPQYLHE 708
           ++ +I AVR H+ +++ G TGSGKTT +   ++E
Sbjct: 138 REAIIAAVRAHRNILVIGGTGSGKTTLVNAIINE 171


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 33.0 bits (76), Expect = 0.71
 Identities = 31/144 (21%), Positives = 59/144 (40%), Gaps = 24/144 (16%)

Query: 447 DSDDDRKERDEFSERLKNRDKNNTKNVATSRQ--DKRAYEEAAKRLKLEAESRDKIIPKL 504
              ++ K  +  S++     +N   N+  S+         +   +LK + + +     +L
Sbjct: 38  SEAEEVKNEENKSKKKAAPIENAEGNIVFSKVEFADGEQAKKDLKLKKKKKKKKTDYKQL 97

Query: 505 --RIESRRKYLEKRKEDKVAELEADIL--------------DDEYLFS------EQELTE 542
             ++E+R+K LE+  EDK AE+E                  DDE L        E++  +
Sbjct: 98  LKKLEARKKKLEELDEDKAAEIEEKEKWTKALAKAEGVKVKDDEKLLKKALKRKEKQKKK 157

Query: 543 RERRDREHKRKLLELAKEHDKARE 566
            ++  +E K K+ +   E  K RE
Sbjct: 158 SKKEWKERKEKVEKKKAERQKKRE 181


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 33.7 bits (77), Expect = 0.78
 Identities = 26/114 (22%), Positives = 50/114 (43%), Gaps = 2/114 (1%)

Query: 453 KERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKY 512
           KERD     L + DK  T+    S Q  +  EEA   ++       K+  +L        
Sbjct: 413 KERDGLRAILNSYDKELTETSV-SGQLMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVG 471

Query: 513 LEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARE 566
            +K + + + E E  +L ++ + +E+ L+  +      + K+  L +E D+ R+
Sbjct: 472 QQKDRNNTL-ETEIKLLKEQLVSNERLLSFVKEATNALRLKIETLERERDRLRQ 524


>gnl|CDD|234011 TIGR02782, TrbB_P, P-type conjugative transfer ATPase TrbB.  The
           TrbB protein is found in the trb locus of Agrobacterium
           Ti plasmids where it is involved in the type IV
           secretion system for plasmid conjugative transfer. TrbB
           is a homolog of the vir system VirB11 ATPase , and the
           Flp pilus sytem ATPase TadA [Cellular processes,
           Conjugation].
          Length = 299

 Score = 33.2 bits (76), Expect = 0.78
 Identities = 11/23 (47%), Positives = 16/23 (69%)

Query: 678 LIDAVRDHQVLIIEGETGSGKTT 700
           L +AV   + +++ G TGSGKTT
Sbjct: 125 LREAVLARKNILVVGGTGSGKTT 147


>gnl|CDD|223363 COG0286, HsdM, Type I restriction-modification system
           methyltransferase subunit [Defense mechanisms].
          Length = 489

 Score = 33.2 bits (76), Expect = 0.93
 Identities = 20/136 (14%), Positives = 49/136 (36%), Gaps = 15/136 (11%)

Query: 426 PSCLKVFKTPKVRAFMGHFLTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEE 485
           P+ +      K          D+  +  E+    +RL   +     +     ++   + +
Sbjct: 367 PTNILFLTKNKPAERNDVLFIDASKEHFEKPLNKKRLTEENIEKIVDTYREFKEIEGFSK 426

Query: 486 AAKRLKLEAESRDKIIPKLRIESRRKYLEKRKED--KVAELEADILDDEYLFSEQELTER 543
           +         S ++I       +  +Y+ + +E+   + EL  ++ + +      EL E 
Sbjct: 427 SV--------SLEEIRENDYNLNPGRYVAEEEEEDIDLDELLQELSEIDA-----ELAEL 473

Query: 544 ERRDREHKRKLLELAK 559
           E+   E  ++LL   K
Sbjct: 474 EKELEEILKELLLNLK 489


>gnl|CDD|218380 pfam05010, TACC, Transforming acidic coiled-coil-containing protein
           (TACC).  This family contains the proteins TACC 1, 2 and
           3 the genes for which are found concentrated in the
           centrosomes of eukaryotic and may play a conserved role
           in organising centrosomal microtubules. The human TACC
           proteins have been linked to cancer and TACC2 has been
           identified as a possible tumour suppressor (AZU-1). The
           functional homologue (Alp7) in Schizosaccharomyces pombe
           has been shown to be required for organisation of
           bipolar spindles.
          Length = 207

 Score = 32.4 bits (74), Expect = 0.93
 Identities = 31/123 (25%), Positives = 50/123 (40%), Gaps = 26/123 (21%)

Query: 476 SRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLF 535
           S++D  A  +  KR  + +E   + + K   + R + LE  K   V E E  I       
Sbjct: 3   SQKDMDAALKKIKREVISSELESEELKKKYEKLRSENLEMGK--IVDEFEKTI------- 53

Query: 536 SEQELTERERRDREHKRKLLELAKEHDKA--------RELEKVQRYYMPKGTKGPEKYEE 587
             Q + E +++    KR+L E+  E D+A             + + Y        EKY+E
Sbjct: 54  -AQMIEESQKQKELSKRELQEVLAEKDQAYADLNSLETSFSDLFKRY--------EKYKE 104

Query: 588 VDE 590
           V E
Sbjct: 105 VIE 107


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 33.1 bits (75), Expect = 1.1
 Identities = 32/177 (18%), Positives = 65/177 (36%), Gaps = 15/177 (8%)

Query: 460 ERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKED 519
           E L+   +   + V   + + +  ++  ++     E RD+I  K       + + K  E+
Sbjct: 189 ETLRQVRQTQGQKVQEHQMELKYLKQYKEK---ACEIRDQITSKEAQLESSREIVKSYEN 245

Query: 520 KVAELEADILDDEYLFS-----EQELTERERR--DREHKRKLLELAKEHDKARELEKVQR 572
           ++  L+  + + E+  S     + E+   + R    E     LEL  E       E++  
Sbjct: 246 ELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLND 305

Query: 573 YYMPKGTKGPEKYEE-VDEKEKVPHYEQRKWEDEQMSSA----IFRVGAKDREHQEQ 624
            Y        EK  E VD + ++    + +    Q  +       R+  +   HQE 
Sbjct: 306 LYHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQEH 362


>gnl|CDD|219890 pfam08537, NBP1, Fungal Nap binding protein NBP1.  NBP1 is a
           nuclear protein which has been shown in Saccharomyces
           cerevisiae to be essential for the G2/M transition of
           the cell cycle.
          Length = 313

 Score = 32.9 bits (75), Expect = 1.1
 Identities = 29/162 (17%), Positives = 55/162 (33%), Gaps = 11/162 (6%)

Query: 437 VRAFMGHFLTDSDDDRKER---DEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLE 493
           ++     F   +D  ++E    DE  ER K R K   K   +  +  +   + A  LK  
Sbjct: 5   LKGLWKDFWIKTDGRKREYGSLDEPRERSKVRRKRARKTSKSYLKVSKPNTDRASELKRR 64

Query: 494 AESRDKIIPKLR----IESRRKYLEKRKED----KVAELEADILDDEYLFSEQELTERER 545
            E R     + R      + +        D    + A +   + +      +++  +R  
Sbjct: 65  TERRGPPTKRRRQRTFWNNIKSVFSNDNRDLSKMQNASINYRLPNSPADEEQEDAKDRIL 124

Query: 546 RDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEE 587
           R    K+K+LEL        EL +  +       + P    +
Sbjct: 125 RSEAFKKKVLELKYNKRMLEELRRGSQDGRSNRDRNPSLNLD 166


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 33.3 bits (77), Expect = 1.2
 Identities = 28/116 (24%), Positives = 54/116 (46%), Gaps = 12/116 (10%)

Query: 484 EEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTER 543
           EEA K +  + E  +++I  L  E  R+  +K +E +    EA+ L +E    +++L E 
Sbjct: 505 EEAKKLIGEDKEKLNELIASL-EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEE 563

Query: 544 ERRDREHK----RKLLELAKEHDKARELEKVQR--YYMPKGTKGPEKYEEVDEKEK 593
           E +  E      ++ ++ AK+     E +++ +    + KG     K  E+ E  K
Sbjct: 564 EDKLLEEAEKEAQQAIKEAKK-----EADEIIKELRQLQKGGYASVKAHELIEARK 614



 Score = 31.0 bits (71), Expect = 5.4
 Identities = 19/99 (19%), Positives = 42/99 (42%), Gaps = 4/99 (4%)

Query: 480 KRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEAD-ILDDEYLFSEQ 538
            +  E+  + L+ + E   +   KL  E+ ++  +  KE   A+ EAD I+ +     + 
Sbjct: 543 LKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE---AKKEADEIIKELRQLQKG 599

Query: 539 ELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPK 577
                +  +    RK L  A E  + ++ ++ ++    K
Sbjct: 600 GYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 32.8 bits (76), Expect = 1.4
 Identities = 27/120 (22%), Positives = 56/120 (46%), Gaps = 20/120 (16%)

Query: 471 KNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRI-ESRRKYLEKR---------KEDK 520
           K +A ++   +  EE AKR+  EA+   + I K  + E++ +  + R         + ++
Sbjct: 26  KKIAEAKI--KEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNE 83

Query: 521 VAELEADILD--------DEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQR 572
           + +LE  +L          E L   +E  E++ ++ E K++ LE  +E  +    E++Q 
Sbjct: 84  LQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 31.6 bits (71), Expect = 1.8
 Identities = 34/134 (25%), Positives = 58/134 (43%), Gaps = 7/134 (5%)

Query: 537 EQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPH 596
           EQE  E+E +DR  + +L   A E    RE E  ++       K  +   + +E+EK   
Sbjct: 43  EQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQ 102

Query: 597 YEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQ 656
            EQ + + ++  +      A+ RE  E+    L+ +  F Q       R K  EE+ ++ 
Sbjct: 103 EEQERIQKQKEEAE-----ARAREEAERMR--LEREKHFQQIEQERLERKKRLEEIMKRT 155

Query: 657 KKKLNIEETKKSLP 670
           +K     + KK  P
Sbjct: 156 RKSEVSPQVKKEDP 169



 Score = 30.4 bits (68), Expect = 3.8
 Identities = 26/125 (20%), Positives = 56/125 (44%), Gaps = 2/125 (1%)

Query: 449 DDDRKERDEFSERLKNRDKNNTKNVATSRQDK-RAYEEAAKRLKLEAESRDKIIPKLRIE 507
           + +R+E++E     +   K         R+++ R  EE   R K E   R     + + +
Sbjct: 43  EQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQ 102

Query: 508 SRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKAREL 567
             ++ ++K+KE+  A    +  +   L  E+   + E+   E K++L E+ K   K+   
Sbjct: 103 EEQERIQKQKEEAEARAR-EEAERMRLEREKHFQQIEQERLERKKRLEEIMKRTRKSEVS 161

Query: 568 EKVQR 572
            +V++
Sbjct: 162 PQVKK 166


>gnl|CDD|224124 COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense
           mechanisms].
          Length = 733

 Score = 32.4 bits (74), Expect = 1.9
 Identities = 16/60 (26%), Positives = 26/60 (43%), Gaps = 11/60 (18%)

Query: 18  SYSVMIIDEAHERTLHTD-----ILFGLVKDIVRFRSDLKLLISSATLDAQKFSEFFDDA 72
             S++I+DE H   L+ D      L  L++ +      + +L+ SATL      E    A
Sbjct: 338 LTSLVILDEVH---LYADETMLAALLALLEALAEA--GVPVLLMSATLPPF-LKEKLKKA 391


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 32.2 bits (74), Expect = 1.9
 Identities = 36/128 (28%), Positives = 63/128 (49%), Gaps = 27/128 (21%)

Query: 465 RDKNNTKNVATSRQDKRAYEEAAKR----LKLEA--ESRDKIIPKLRIE------SRRKY 512
           R +   K + ++ +  +   E AK+    LK EA  E+++++  KLR E       RR  
Sbjct: 19  RKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVH-KLRAELERELKERRNE 77

Query: 513 LEKRKEDKVAELEADILD---------DEYLFS-EQELTERERRDREHKRKLLELAKEHD 562
           L +R E ++ + E + LD         +E L   E+EL+ +E+   E + +L EL  E  
Sbjct: 78  L-QRLERRLLQRE-ETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQR 135

Query: 563 KARELEKV 570
           +  ELE++
Sbjct: 136 E--ELERI 141


>gnl|CDD|224308 COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy
           production and conversion].
          Length = 194

 Score = 31.6 bits (72), Expect = 2.0
 Identities = 28/99 (28%), Positives = 45/99 (45%), Gaps = 9/99 (9%)

Query: 449 DDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAE-SRDKIIPKLRIE 507
            +  +E +E  E  +  +    K  A    ++   EE  ++ + EAE  R +II    +E
Sbjct: 13  REAEEEAEEILEEAR-EEAEKIKEEAKREAEEAI-EEILRKAEKEAERERQRIISSALLE 70

Query: 508 SRRKYLEKRKE--DKVAELEADILD----DEYLFSEQEL 540
           +RRK LE ++E  + V E   + L     D    S QEL
Sbjct: 71  ARRKLLEAKEEILESVFEAVEEKLRNIASDPEYESLQEL 109


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 31.9 bits (73), Expect = 2.1
 Identities = 25/160 (15%), Positives = 53/160 (33%), Gaps = 28/160 (17%)

Query: 445 LTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKI---- 500
            +DS+D+  E  + + +    D +   +     +     E  A++ K    +  +I    
Sbjct: 129 SSDSEDE--EEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPA 186

Query: 501 ----IPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLE 556
               I +LR+E         K  +  +   +   DE + ++         D E   K   
Sbjct: 187 DFAKIQELRLEKGVDKALGGKLKRRDKDAPERHSDELVDAD---------DIEGPAKK-- 235

Query: 557 LAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPH 596
             K+  K   +   +     +G +  EK+     K+    
Sbjct: 236 --KKQTKEERIATAK-----EGREDREKFGSRKGKKDKEG 268


>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit.  This
           family includes the vacuolar ATP synthase E subunit, as
           well as the archaebacterial ATP synthase E subunit.
          Length = 195

 Score = 31.5 bits (72), Expect = 2.2
 Identities = 24/91 (26%), Positives = 40/91 (43%), Gaps = 11/91 (12%)

Query: 471 KNVATSRQDKRAYEEAAKRLKLEAESRDKI-IPKLRIESRRKYLEKRKEDKVAELEADIL 529
           K  A    +K+  E   K+ K +AE   +I I   + E+R K L  R+E         +L
Sbjct: 25  KAEAVEEAEKKIEEIYEKKEK-QAEMEKQIIISNAKNEARLKVLNAREE---------LL 74

Query: 530 DDEYLFSEQELTERERRDREHKRKLLELAKE 560
           D  +  +++ L        E+K  L +L K+
Sbjct: 75  DSVFEEAKERLANLSEDKDEYKDLLKDLIKQ 105


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 31.6 bits (72), Expect = 2.2
 Identities = 16/77 (20%), Positives = 35/77 (45%), Gaps = 5/77 (6%)

Query: 484 EEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTER 543
           +E  +  K +  ++ K   K  ++++ +  EK K +K  +   ++ +D     E EL E+
Sbjct: 42  DEEKEEEKAKVAAKAK--AKKALKAKIEEKEKAKREKEEKGLRELEEDT---PEDELAEK 96

Query: 544 ERRDREHKRKLLELAKE 560
            R  +  +   L  A +
Sbjct: 97  LRLRKLQEESDLNNAAD 113


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 32.0 bits (73), Expect = 2.3
 Identities = 16/97 (16%), Positives = 38/97 (39%), Gaps = 3/97 (3%)

Query: 476 SRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRR-KYLEKRKEDKVAELEA--DILDDE 532
               +          +   +  ++ + +L  E+   K   +  + ++ +LE+  +    E
Sbjct: 406 KEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRRE 465

Query: 533 YLFSEQELTERERRDREHKRKLLELAKEHDKARELEK 569
                ++  E   RDR  +R   EL ++  +  ELE+
Sbjct: 466 VRDKVRKDREIRARDRRIERLEKELEEKKKRVEELER 502


>gnl|CDD|234232 TIGR03499, FlhF, flagellar biosynthetic protein FlhF.  [Cellular
           processes, Chemotaxis and motility].
          Length = 283

 Score = 31.9 bits (73), Expect = 2.3
 Identities = 28/164 (17%), Positives = 53/164 (32%), Gaps = 16/164 (9%)

Query: 543 RERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKW 602
                 E  +K LE A         E ++     +         +  E        +++ 
Sbjct: 57  AAASAEEEAKKALEPADPAKLPATAEPLELPAPLEEP--AAPAAQAAEPLLPEAELRKEL 114

Query: 603 ED-EQMSSAIFRVGAKDREHQEQ---YELLLDNQI--DFIQALSLPGNRDKGEEEMTEKQ 656
               ++   +    A  R   E     E LL+  +  +  + L      D   E+     
Sbjct: 115 SALRELLERLLAGLALQRRDPEGAKLLERLLEAGVSEELARELLEKLPEDLDAEDAWRWL 174

Query: 657 KKKLNIEETKKSLPIYPFKQDLIDAVRDHQVLIIEGETGSGKTT 700
           ++ L     +  LP+ P +  ++       V+ + G TG GKTT
Sbjct: 175 REAL-----EGMLPVKPEEDPIL---ERGGVIALVGPTGVGKTT 210


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 32.1 bits (73), Expect = 2.3
 Identities = 26/96 (27%), Positives = 42/96 (43%), Gaps = 6/96 (6%)

Query: 484 EEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTER 543
           E+A        E  + +I KL     ++  +K +  +    E + L  E     +EL ER
Sbjct: 500 EQAKTFYGEFKEEINVLIEKL-SALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKER 558

Query: 544 ERRDREHKRKLLELAKEHDKA--RELEKVQRYYMPK 577
           E   R  K +L + A+E  KA  +E+E + R    K
Sbjct: 559 E---RNKKLELEKEAQEALKALKKEVESIIRELKEK 591


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 31.7 bits (72), Expect = 2.7
 Identities = 36/228 (15%), Positives = 65/228 (28%), Gaps = 37/228 (16%)

Query: 453 KERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRL----------KLEAESRDKIIP 502
              +EF +    +                      K+           + E   +D+I  
Sbjct: 42  NTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAK 101

Query: 503 KLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHD 562
                     LEK KE ++ E E D L  E     Q   E   + RE+ +    L  E++
Sbjct: 102 L--EALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENE 159

Query: 563 KARE--LEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDRE 620
           K  E  LE  +  +  +  +     E                E+E+   + + +  K + 
Sbjct: 160 KKLEESLELEREKFEEQLHEANLDLE--------------FKENEEQRESKWAILKKLKR 205

Query: 621 HQEQYELLLDNQIDFIQALSLPGNR---------DKGEEEMTEKQKKK 659
             E     +  +   +   S   ++         DKG        K  
Sbjct: 206 RAELGSQQVQGEALELPNESFIRSKFPSDINEKADKGFRGFDCLNKVP 253


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 30.7 bits (70), Expect = 2.9
 Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 1/67 (1%)

Query: 497 RDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLE 556
           R K+   L ++ + K  E+ +E +V EL+ +   DE L  ++    +  + RE++RK  E
Sbjct: 87  RKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELL-EKELAKLKREKRRENERKQKE 145

Query: 557 LAKEHDK 563
           + KE  K
Sbjct: 146 ILKEQMK 152


>gnl|CDD|223813 COG0742, COG0742, N6-adenine-specific methylase [DNA replication,
           recombination, and repair].
          Length = 187

 Score = 31.0 bits (71), Expect = 2.9
 Identities = 18/73 (24%), Positives = 29/73 (39%), Gaps = 3/73 (4%)

Query: 865 QRINLGNAVLMLKALGINDLIHFDFLDPPPHETLVLALEQLYALGALNHHGDLTKLGRRM 924
            R+   +A+  LK LG  +     FLDPP  + L+     L  L     +G L      +
Sbjct: 95  ARVLRNDALRALKQLGTREPFDLVFLDPPYAKGLLDKELALLLLE---ENGWLKPGALIV 151

Query: 925 AEFPLDPMMGKML 937
            E   D  + ++ 
Sbjct: 152 VEHDKDVELPELP 164


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 31.6 bits (70), Expect = 2.9
 Identities = 26/108 (24%), Positives = 50/108 (46%), Gaps = 4/108 (3%)

Query: 480 KRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQE 539
           K   ++A K++  + E+  K+  K   E   +  E   + + A+ E    + +   +EQE
Sbjct: 72  KSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEE----EHKQWKAEQE 127

Query: 540 LTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEE 587
             ++E+  +E   K L+  K+ +KA + EK ++    K    P   EE
Sbjct: 128 RIQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKKASTPAPVEE 175


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 31.7 bits (72), Expect = 3.1
 Identities = 20/82 (24%), Positives = 38/82 (46%), Gaps = 6/82 (7%)

Query: 513 LEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQR 572
           L +R E  V E E  I+  E    E+EL +   +  + +  L +     +  +ELE+ + 
Sbjct: 362 LLERLEKDVEEGEKTIVK-EARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEE 420

Query: 573 YYMPKGTKGPEKYEEVDEKEKV 594
            ++ + +K     EE +  EK+
Sbjct: 421 NFLKELSK-----EEKELLEKL 437


>gnl|CDD|220178 pfam09321, DUF1978, Domain of unknown function (DUF1978).  Members
           of this family are found in various hypothetical
           proteins produced by the bacterium Chlamydia pneumoniae.
           Their exact function has not, as yet, been identified.
          Length = 241

 Score = 31.0 bits (70), Expect = 3.1
 Identities = 16/95 (16%), Positives = 39/95 (41%), Gaps = 7/95 (7%)

Query: 445 LTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRL-KLEAESRDKIIPK 503
           +++ +D+ ++++  S   ++ +           Q  R  E+  K     E E R++   +
Sbjct: 132 VSEREDETEKQETVSSYARDLETALDHIEVEYVQCVREQEDYWKEEDVKEQELREEGGER 191

Query: 504 LRIESRRKYLE------KRKEDKVAELEADILDDE 532
            R+E  ++ LE      K  + ++   E  +   E
Sbjct: 192 KRLEEIQEGLESLEDLLKATKKRLDFAEEHLEKAE 226


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 31.0 bits (71), Expect = 3.3
 Identities = 25/98 (25%), Positives = 48/98 (48%), Gaps = 12/98 (12%)

Query: 460 ERLKNRDKNNTKNVATSRQD--------KRAYEEAAK-RLKLEAESRDKIIPKLRIESRR 510
             +K  +++N K +A   Q+        K+A EE  + R KL+   +DK   K  +++R 
Sbjct: 37  AEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLK-NLKARL 95

Query: 511 KYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDR 548
           K LEK  ++   + E+++L+  +   E+E  E   +  
Sbjct: 96  KELEKELKN--LKWESEVLEQRFEKVERERDELYDKFE 131


>gnl|CDD|223435 COG0358, DnaG, DNA primase (bacterial type) [DNA replication,
           recombination, and repair].
          Length = 568

 Score = 31.7 bits (72), Expect = 3.3
 Identities = 12/49 (24%), Positives = 21/49 (42%), Gaps = 3/49 (6%)

Query: 648 GEEEMTEKQKKKLNIEET---KKSLPIYPFKQDLIDAVRDHQVLIIEGE 693
           G   + + + K LN  ET   KK   +Y          +  Q++++EG 
Sbjct: 208 GGRVLGDDKPKYLNSPETELFKKGEELYGLDPARKKIAKADQIIVVEGY 256


>gnl|CDD|237260 PRK12906, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 796

 Score = 31.6 bits (72), Expect = 3.5
 Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 2/47 (4%)

Query: 663 EETKKSLPIYPFKQDLIDAVRDHQVLIIEGETGSGKT--TQIPQYLH 707
           E  K+ L + PF   +I  +  H+  I E +TG GKT    +P YL+
Sbjct: 71  EGAKRVLGLRPFDVQIIGGIVLHEGNIAEMKTGEGKTLTATLPVYLN 117


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
           splicing factor.  These splicing factors consist of an
           N-terminal arginine-rich low complexity domain followed
           by three tandem RNA recognition motifs (pfam00076). The
           well-characterized members of this family are auxilliary
           components of the U2 small nuclear ribonuclearprotein
           splicing factor (U2AF). These proteins are closely
           related to the CC1-like subfamily of splicing factors
           (TIGR01622). Members of this subfamily are found in
           plants, metazoa and fungi.
          Length = 509

 Score = 31.4 bits (71), Expect = 3.7
 Identities = 24/142 (16%), Positives = 51/142 (35%), Gaps = 22/142 (15%)

Query: 455 RDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLE 514
           RDE  +R + + +   ++ ++ R  +R+ + +  R       R +     R   R    +
Sbjct: 1   RDEEPDREREKSRGRDRDRSSERPRRRSRDRSRFR------DRHR-----RSRERSYRED 49

Query: 515 KRKEDKVAELEADILDDEYLFSEQELTERERRDR------EHKRKLLELAKEHDKARELE 568
            R  D+            Y    +      RR R      +H+R+L + +  +   ++ +
Sbjct: 50  SRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDK 109

Query: 569 KVQRYYMPKGTKGPEKYEEVDE 590
           K   + +      P  YE V  
Sbjct: 110 KRSLWDIK-----PPGYELVTA 126


>gnl|CDD|240364 PTZ00332, PTZ00332, paraflagellar rod protein; Provisional.
          Length = 589

 Score = 31.5 bits (71), Expect = 4.0
 Identities = 34/118 (28%), Positives = 59/118 (50%), Gaps = 22/118 (18%)

Query: 461 RLKNRDKNNTKNVATSRQDKRAYE-EAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKE- 518
           RLK R + + K++  + Q     + EA KR     E  ++ I        R+  ++++E 
Sbjct: 271 RLKQRCETDLKHIHDAIQKADLEDAEAMKRYATNKEKSERFI--------RENEDRQEEA 322

Query: 519 -DKVAELEADILDDEYLFSE--QELTER-ERRDREHKRKL-----LELAKEHDKAREL 567
            +K+ +LE  +   + L +E  +E+  R E  DRE KR++     LE+A +H K  EL
Sbjct: 323 WNKIQDLERQL---QRLGTERFEEVKRRIEENDREEKRRVEYQQFLEVAGQHKKLLEL 377


>gnl|CDD|215920 pfam00437, T2SE, Type II/IV secretion system protein.  This family
           contains both type II and type IV pathway secretion
           proteins from bacteria. VirB11 ATPase is a subunit of
           the Agrobacterium tumefaciens transfer DNA (T-DNA)
           transfer system, a type IV secretion pathway required
           for delivery of T-DNA and effector proteins to plant
           cells during infection.
          Length = 273

 Score = 30.7 bits (70), Expect = 4.2
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 678 LIDAVRDHQVLIIEGETGSGKTT 700
           L  AV+    +++ G TGSGKTT
Sbjct: 122 LRQAVQARGNILVSGGTGSGKTT 144


>gnl|CDD|107037 PHA01514, PHA01514, O-antigen conversion protein C.
          Length = 485

 Score = 31.1 bits (70), Expect = 4.4
 Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 7/74 (9%)

Query: 1034 VEMELVSNPTE-TVNIRKAITAGYF---YHIAKLSKSGSYKTAKHNQTVMIHPNSSLFEE 1089
            V+   +SN T+ T +    +  GYF   Y IAK   +GSY   +H++ + I  NSSLFE 
Sbjct: 183  VKKNSISNITKNTASSVAGLIVGYFAYSYFIAKRLVTGSYNI-EHSKIIEI--NSSLFEG 239

Query: 1090 LPRWVLYHELVFTT 1103
            +   VL    +F+T
Sbjct: 240  IISNVLSFYRMFST 253



 Score = 31.1 bits (70), Expect = 4.4
 Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 7/74 (9%)

Query: 1201 VEMELVSNPTE-TVNIRKAITAGYF---YHIAKLSKSGSYKTAKHNQTVMIHPNSSLFEE 1256
            V+   +SN T+ T +    +  GYF   Y IAK   +GSY   +H++ + I  NSSLFE 
Sbjct: 183  VKKNSISNITKNTASSVAGLIVGYFAYSYFIAKRLVTGSYNI-EHSKIIEI--NSSLFEG 239

Query: 1257 LPRWVLYHELVFTT 1270
            +   VL    +F+T
Sbjct: 240  IISNVLSFYRMFST 253


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 31.3 bits (71), Expect = 4.5
 Identities = 43/245 (17%), Positives = 84/245 (34%), Gaps = 45/245 (18%)

Query: 447 DSDDDRKERDEFSERLKN-RDK---------------------------NNTKNVATSRQ 478
           +  + +++R+E  E++   R+K                             +K     ++
Sbjct: 47  EEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQKKKE 106

Query: 479 DKRAYEEAAKRLKLE--AESRDKIIPKLRIESRRKYLEKRKEDKVAEL-EADILDDEYLF 535
            +R         + E  AE   + +  L++  + +  E+  ED +  L +  +L+DE   
Sbjct: 107 AERKKALLLDEKEKERAAEYTSEDLAGLKVGHKVEEFEE-GEDVILTLKDTGVLEDED-- 163

Query: 536 SEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKY-EEVDEKEKV 594
              EL   E  ++E  +K LEL K+    +                  KY EE++ K+K 
Sbjct: 164 EGDELENVELVEKEKDKKNLELKKK----KPDYDPDDDDKFNKRSILSKYDEEIEGKKKK 219

Query: 595 PHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTE 654
                    D   S+        +++ QE  + L  N +      +     D  +     
Sbjct: 220 SDNLFTL--DSGGST----DDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMV 273

Query: 655 KQKKK 659
           K KK 
Sbjct: 274 KFKKP 278


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 31.2 bits (70), Expect = 4.5
 Identities = 38/161 (23%), Positives = 74/161 (45%), Gaps = 13/161 (8%)

Query: 447 DSDDDRKERDEFSER---LKNRDKNNTKNVATSRQDKRAYE----EAAKRLKLEAESRDK 499
           + ++ R+ER+E  E     K+  KN+ ++    +++++  E    E  KR  LE  + + 
Sbjct: 115 EKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEF 174

Query: 500 IIPKLRIESRRKYLEKRKEDKV-AELEADILDDEYLFSEQELTERERRDREHKRKLLELA 558
           +  KL+           +  +V A  E + L  +   +  EL E  ++ RE +RK+LE  
Sbjct: 175 MTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEE-LKKKREERRKVLEEE 233

Query: 559 KEHDKARELEKVQRYYMPKGTKGPE----KYEEVDEKEKVP 595
           ++  K  E ++  R    K     E    + E  ++++KVP
Sbjct: 234 EQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKRQKVP 274



 Score = 30.8 bits (69), Expect = 5.5
 Identities = 33/158 (20%), Positives = 66/158 (41%), Gaps = 13/158 (8%)

Query: 445 LTDSDDDRKER-DEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPK 503
           L   ++ R ER  E  ER K     +T          ++  E ++R++ ++ + ++ + +
Sbjct: 63  LARREERRDERFSEALERQKEFKPTSTD---------QSLSEPSRRMQEDSGAENETVEE 113

Query: 504 LRIESRRKYLEKRKEDK-VAELEADILDDEYLFSEQELTERERRDREH-KRKLLELAKEH 561
              E  R+  E+ +E + V + E      +    ++E  E E  + E  KR  LE     
Sbjct: 114 EEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGE 173

Query: 562 DKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQ 599
               +L+  +  +   G +G +  E   E EK+   +Q
Sbjct: 174 FMTHKLKHTENTFSRGGAEGAQ-VEAGKEFEKLKQKQQ 210


>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein.  YkyA is a
           family of proteins containing a lipoprotein signal and a
           hydrolase domain. It is similar to cell wall binding
           proteins and might also be recognisable by a host immune
           defence system. It is thus likely to belong to pathways
           important for pathogenicity.
          Length = 205

 Score = 30.4 bits (69), Expect = 4.8
 Identities = 35/143 (24%), Positives = 64/143 (44%), Gaps = 20/143 (13%)

Query: 451 DRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRR 510
           ++KE++ + + ++    +N      S Q     ++  K LK E ES +K   +   +S +
Sbjct: 46  EKKEQELYDQIIQLGKDDNEAIKKLSDQALANVDKREKLLKKEKESIEK--SEEEFKSAK 103

Query: 511 KYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKV 570
           KY+EK ++ K+ + +A  L              + R + +     +L K + KA  LEK 
Sbjct: 104 KYIEKIEDKKLKK-KAKQLVKVM----------KERYKSYD----KLYKAYKKALNLEK- 147

Query: 571 QRYYMPKGTKGPEKYEEVDEKEK 593
           + Y   K        +E+DEK K
Sbjct: 148 ELYEYLK--DKDLTLKELDEKIK 168


>gnl|CDD|236148 PRK08074, PRK08074, bifunctional ATP-dependent DNA helicase/DNA
           polymerase III subunit epsilon; Validated.
          Length = 928

 Score = 31.1 bits (71), Expect = 5.0
 Identities = 17/62 (27%), Positives = 21/62 (33%), Gaps = 6/62 (9%)

Query: 6   LHREFLSEPDLASYSVMIIDEAH---ERTLHTDILFGLVKDIVRFRSDLKLLISSATLDA 62
           L      EP L SY  +IIDEAH   E         G     + F+  L  L +      
Sbjct: 443 LTDLTSEEPLLPSYEHIIIDEAHHFEEAASRH---LGEQFSYMSFQLLLSRLGTLEEDGL 499

Query: 63  QK 64
             
Sbjct: 500 LS 501



 Score = 31.1 bits (71), Expect = 5.7
 Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 771 LHREFLSEPDLASYSVMIIDEAH 793
           L      EP L SY  +IIDEAH
Sbjct: 443 LTDLTSEEPLLPSYEHIIIDEAH 465


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 31.0 bits (71), Expect = 5.1
 Identities = 30/135 (22%), Positives = 59/135 (43%), Gaps = 20/135 (14%)

Query: 489 RLKLEAESRDKIIPKLRIE---SRRK-------YLE--KRKEDKVAE-----LEADILDD 531
           + K + E+RD I  K+  +   S+RK       +L    +   ++A      LE    + 
Sbjct: 340 KTKEKRETRDSIAKKIAEKLHTSKRKVRREVLPFLSIIFKHNPELAARLAAFLELTEEEI 399

Query: 532 EYLFSEQELTERERRDREHKRKLLEL-AKEHDKARELEKVQRYYMPKGTKGPEKYEEVDE 590
           E+L   ++ T++ ++  E   K  E   KE  K     K +     +  +  +K EE +E
Sbjct: 400 EFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEE--EEEEEKEKKEEEKEE 457

Query: 591 KEKVPHYEQRKWEDE 605
           +E+    E+ + E++
Sbjct: 458 EEEEAEEEKEEEEEK 472


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 31.1 bits (70), Expect = 5.1
 Identities = 40/205 (19%), Positives = 75/205 (36%), Gaps = 22/205 (10%)

Query: 480 KRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQE 539
           K         +  ++ S  K++  LR E   K +  R+         D+ D      E+E
Sbjct: 162 KNILSGNVSDVDTDSISDKKVVEALR-EDNEKGVNFRR---------DMTD----LKERE 207

Query: 540 LTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQ 599
             E  +R ++ K +L +   + DKA++     +    K      + ++  +    P    
Sbjct: 208 SQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTS 267

Query: 600 RKWEDEQMSSAIFRVGAKDR-EHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKK 658
              ED+Q++    R   K + E ++  E  L  +      L       K E + +EK+ +
Sbjct: 268 SPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDL-------KQESKASEKEAE 320

Query: 659 KLNIEETKKSLPIYPFKQDLIDAVR 683
              +E  KK  P+    Q     V 
Sbjct: 321 DKELEAQKKREPVAEDLQKTKPQVE 345


>gnl|CDD|176500 cd08557, PI-PLCc_bacteria_like, Catalytic domain of bacterial
           phosphatidylinositol-specific phospholipase C and
           similar proteins.  This subfamily corresponds to the
           catalytic domain present in bacterial
           phosphatidylinositol-specific phospholipase C (PI-PLC,
           EC 4.6.1.13) and their sequence homologs found in
           eukaryota. Bacterial PI-PLCs participate in
           Ca2+-independent PI metabolism, hydrolyzing the membrane
           lipid phosphatidylinositol (PI) to produce
           phosphorylated myo-inositol and diacylglycerol (DAG).
           Although their precise physiological function remains
           unclear, bacterial PI-PLCs may function as virulence
           factors in some pathogenic bacteria. Bacterial PI-PLCs
           contain a single TIM-barrel type catalytic domain. Its
           catalytic mechanism is based on general base and acid
           catalysis utilizing two well conserved histidines, and
           consists of two steps, a phosphotransfer and a
           phosphodiesterase reaction. Eukaryotic homologs in this
           family are named as phosphatidylinositol-specific
           phospholipase C X domain containing proteins (PI-PLCXD).
           They are distinct from the typical eukaryotic
           phosphoinositide-specific phospholipases C (PI-PLC, EC
           3.1.4.11), which have a multidomain organization that
           consists of a PLC catalytic core domain, and various
           regulatory domains. The catalytic core domain is
           assembled from two highly conserved X- and Y-regions
           split by a divergent linker sequence. In contrast,
           eukaryotic PI-PLCXDs contain a single TIM-barrel type
           catalytic domain, X domain, which is closely related to
           that of bacterial PI-PLCs. Although the biological
           function of eukaryotic PI-PLCXDs still remains unclear,
           it may be distinct from that of typical eukaryotic
           PI-PLCs. This family also includes a distinctly
           different type of eukaryotic PLC,
           glycosylphosphatidylinositol-specific phospholipase C
           (GPI-PLC), an integral membrane protein characterized in
           the protozoan parasite Trypanosoma brucei. T. brucei
           GPI-PLC hydrolyzes the GPI-anchor on the variant
           specific glycoprotein (VSG), releasing dimyristyl
           glycerol (DMG), which may facilitate the evasion of the
           protozoan to the host's immune system. It does not
           require Ca2+ for its activity and is more closely
           related to bacterial PI-PLCs, but not mammalian PI-PLCs.
          Length = 271

 Score = 30.5 bits (69), Expect = 5.3
 Identities = 16/58 (27%), Positives = 22/58 (37%), Gaps = 9/58 (15%)

Query: 888 DFLDPPPHETLVLALEQLYALGALNHHGDL---------TKLGRRMAEFPLDPMMGKM 936
           DFLD  P E ++L LE  Y       H +L           L R        P +G++
Sbjct: 90  DFLDAHPSEVVILDLEHEYGGDNGEDHDELDALLRDVLGDPLYRPPVRAGGWPTLGEL 147


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 30.8 bits (70), Expect = 5.4
 Identities = 26/156 (16%), Positives = 52/156 (33%), Gaps = 19/156 (12%)

Query: 475 TSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYL 534
              + +R  E   K  KLE++ +   IP      +      ++   + E   D+      
Sbjct: 26  FVNEVRRCDEVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDL------ 79

Query: 535 FSEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKV 594
             E E+ E E      ++++ EL    +    L++ + +      +   +   +D   K 
Sbjct: 80  --EAEIKEVEENLESLEKEINELE---EWLNVLDEEKSFLDE-NLEELSELSNLDIDFKY 133

Query: 595 PHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLD 630
                      +     F  G  +RE  E +E  L 
Sbjct: 134 LR-------GAEGLKLGFVAGVINREKLEAFERELW 162


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
           N-terminal.  This is the very acidic N-terminal region
           of the early transcription elongation factor Spt5. The
           Spt5-Spt4 complex regulates early transcription
           elongation by RNA polymerase II and has an imputed role
           in pre-mRNA processing via its physical association with
           mRNA capping enzymes. The actual function of this
           N-terminal domain is not known although it is
           dispensable for binding to Spt4.
          Length = 92

 Score = 28.6 bits (64), Expect = 5.5
 Identities = 17/58 (29%), Positives = 29/58 (50%), Gaps = 4/58 (6%)

Query: 518 EDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKV--QRY 573
           E++  E + + L DE  F ++   E +RR R   R+  +  +E   A EL +   +RY
Sbjct: 14  EEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEE--DAEELAEYLRKRY 69


>gnl|CDD|219665 pfam07959, Fucokinase, L-fucokinase.  In the salvage pathway of
           GDP-L-fucose, free cytosolic fucose is phosphorylated by
           L-fucokinase to form L-fucose-L-phosphate, which is then
           further converted to GDP-L-fucose in the reaction
           catalyzed by GDP-L-fucose pyrophosphorylase.
          Length = 414

 Score = 30.6 bits (69), Expect = 5.5
 Identities = 21/108 (19%), Positives = 35/108 (32%), Gaps = 26/108 (24%)

Query: 376 LPSSSTSSRLFTTLPSSLAAVP------HHIQRQSPAQSYHR----MTPG----CTDR-R 420
           LP++S   ++FT LP      P      + ++ +     Y R    M PG      D   
Sbjct: 8   LPNASPLGKIFTPLPVLAPDGPVPLLGDNLLELKLAM--YQRFPPQMPPGVLVTSGDELL 65

Query: 421 TQSPCP---------SCLKVFKTPKVRAFMGHFLTDSDDDRKERDEFS 459
           +    P         + L    +  +    G F+TDS         + 
Sbjct: 66  SVGDAPGISFDEPGATALAHPSSLAIATNHGVFVTDSQGSLAHDLTYR 113


>gnl|CDD|238544 cd01124, KaiC, KaiC is a circadian clock protein primarily found in
           cyanobacteria KaiC is a RecA-like ATPase, having both
           Walker A and Walker B motifs. A related protein is found
           in archaea.
          Length = 187

 Score = 30.0 bits (68), Expect = 5.5
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 687 VLIIEGETGSGKTTQIPQYLHEA 709
             ++ G  G+GKTT   Q+L+  
Sbjct: 1   STLLSGGPGTGKTTFALQFLYAG 23


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 30.8 bits (69), Expect = 5.8
 Identities = 52/249 (20%), Positives = 94/249 (37%), Gaps = 17/249 (6%)

Query: 445 LTDSDDDRKER-DEFSERLKNRDKNNTK-----NVATSRQDKRAYEEAAKRLKLEAESRD 498
           L + D D ++  D+  E L   ++         +      +++      K   L+A   +
Sbjct: 313 LANQDSDMRQHLDKLKEDLTRAEQEKAILQTEVDALRYELERKHNTLTKKTASLQAAQEE 372

Query: 499 KIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELA 558
           K      IE  R   EK       E +  +L  +    ++    +ERR +E K +L  L 
Sbjct: 373 KATYAGEIEDMRDRYEKT------ERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQ 426

Query: 559 KEHDKARELEKVQRYYMPKGTKGPEKYEEV---DEKEKVPHYEQRKWEDEQMSSAIFRVG 615
            + +    LEK+++    K  +  E+ +E    DE+ +   +E  K E E +   +  + 
Sbjct: 427 TDTNTDTALEKLEKALAEK-ERIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQ 485

Query: 616 AKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLN-IEETKKSLPIYPF 674
            K  E + Q ELL +             +  +      EK ++K   +E+  K L   P 
Sbjct: 486 LKLSERELQLELLKEEVSKLASNQLKQRSDLERAHIELEKIREKHEKLEKELKRLRANPE 545

Query: 675 KQDLIDAVR 683
             D   AV 
Sbjct: 546 SADRGSAVD 554


>gnl|CDD|130385 TIGR01318, gltD_gamma_fam, glutamate synthase small subunit family
           protein, proteobacterial.  This model represents one of
           three built for the NADPH-dependent or NADH-dependent
           glutamate synthase (EC 1.4.1.13 and 1.4.1.14,
           respectively) small subunit and homologs. TIGR01317
           describes the small subunit (or equivalent region from
           longer forms) in eukaryotes, Gram-positive bacteria, and
           some other lineages, both NADH and NADPH-dependent.
           TIGR01316 describes a protein of similar length, from
           Archaea and a number of bacterial lineages, that forms
           glutamate synthase homotetramers without a large
           subunit. This model describes both glutatate synthase
           small subunit and closely related paralogs of unknown
           function from a number of gamma and alpha subdivision
           Proteobacteria, including E. coli.
          Length = 467

 Score = 30.5 bits (69), Expect = 5.9
 Identities = 11/35 (31%), Positives = 16/35 (45%), Gaps = 2/35 (5%)

Query: 727 AAMSVSARVAEEMGV--KLGCEVGYAIRFEDCTSE 759
           A +S    +   MG+   L CEVG  I  +D   +
Sbjct: 192 AVLSRRREIFTAMGIEFHLNCEVGRDISLDDLLED 226


>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 557

 Score = 30.6 bits (70), Expect = 6.5
 Identities = 24/100 (24%), Positives = 37/100 (37%), Gaps = 32/100 (32%)

Query: 505 RIESR--------RKY-------LEKRKEDKVAELEADILDDEYLFSEQELTERERRDRE 549
            +E R        RKY       LE   +    EL   + + E   S + L +  +   +
Sbjct: 301 EVEERLFALKSLARKYGVTIEDLLEYLDK-IKEELA-QLDNSE--ESLEALEKEVK---K 353

Query: 550 HKRKLLELAKE-----HDKARELEK-----VQRYYMPKGT 579
            K +LLE A+         A+ELEK     ++   M K  
Sbjct: 354 LKAELLEAAEALSAIRKKAAKELEKEVTAELKALAMEKAR 393


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 29.7 bits (67), Expect = 6.5
 Identities = 20/88 (22%), Positives = 35/88 (39%), Gaps = 14/88 (15%)

Query: 485 EAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERE 544
            A KR KLE +         R + R    E+R+E K        L+++     +E  E E
Sbjct: 4   GAKKRAKLEEKQA-------RRQQREAEEEEREERKK-------LEEKREGERKEEEELE 49

Query: 545 RRDREHKRKLLELAKEHDKARELEKVQR 572
               + K +     +E    +E E+ ++
Sbjct: 50  EEREKKKEEEERKEREEQARKEQEEYEK 77


>gnl|CDD|222104 pfam13401, AAA_22, AAA domain. 
          Length = 124

 Score = 29.2 bits (66), Expect = 6.5
 Identities = 22/116 (18%), Positives = 40/116 (34%), Gaps = 31/116 (26%)

Query: 682 VRDHQVLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIGCTQPRRVAAMSVSARV-AEEMG 740
            R   + ++ GE+GSGKTT + +   +               RRV  +   +    +++ 
Sbjct: 1   RRGAGIGVLTGESGSGKTTLLRRLARQL------------PNRRVVYVEAPSLGTPKDLL 48

Query: 741 VKLGCEVGYAIRF---EDCTSERTIIKYMTDGTLHREFLSEPDLASYSVMIIDEAH 793
            K+   +G  +      +      I+  +      R  L           IIDEA 
Sbjct: 49  RKILRALGLPLSGGTTAELL--EAILDALKR--RGRPLL-----------IIDEAQ 89


>gnl|CDD|145707 pfam02697, DUF217, Uncharacterized ACR, COG1753.  Structural
           modelling suggests this domain may bind nucleic acids.
          Length = 71

 Score = 27.7 bits (62), Expect = 6.8
 Identities = 14/61 (22%), Positives = 25/61 (40%), Gaps = 6/61 (9%)

Query: 482 AYEEAAKRLKLEAESRDKIIPKL-----RIESRRKYLEKRKEDKVAELEADILDDEYLFS 536
            YE+   +LK   +S  ++I +L            Y     E++  ELE ++ +    F 
Sbjct: 10  VYEKL-LKLKRGGKSFSEVIRELIEKKGNRRKLMDYFGILSEEEADELEKELKEVRKKFR 68

Query: 537 E 537
           E
Sbjct: 69  E 69


>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Exonuclease-1 (EXO1) is involved
           in multiple, eukaryotic DNA metabolic pathways,
           including DNA replication processes (5' flap DNA
           endonuclease activity and double stranded DNA
           5'-exonuclease activity), DNA repair processes (DNA
           mismatch repair (MMR) and post-replication repair
           (PRR)), recombination, and telomere integrity. EXO1
           functions in the MMS2 error-free branch of the PRR
           pathway in the maintenance and repair of stalled
           replication forks. Studies also suggest that EXO1 plays
           both structural and catalytic roles during MMR-mediated
           mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
           family of structure-specific, 5' nucleases. These
           nucleases contain a PIN (PilT N terminus) domain with a
           helical arch/clamp region (I domain) of variable length
           (approximately 43 residues in EXO1 PIN domains) and a
           H3TH (helix-3-turn-helix) domain, an atypical
           helix-hairpin-helix-2-like region. Both the H3TH domain
           (not included here) and the helical arch/clamp region
           are involved in DNA binding. Nucleases within this group
           also have a carboxylate-rich active site that is
           involved in binding essential divalent metal ion
           cofactors (Mg2+/Mn2+). EXO1 nucleases also have
           C-terminal Mlh1- and Msh2-binding domains which allow
           interaction with MMR and PRR proteins, respectively.
          Length = 210

 Score = 29.8 bits (68), Expect = 7.1
 Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 2/35 (5%)

Query: 541 TERERRDREHKRKLLELAKEHDKARELEKVQRYYM 575
           TE ERR+R  + + LE A E  +     + +  + 
Sbjct: 88  TEEERRER--REENLEKALELLREGNRSEARECFQ 120


>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
           production and conversion].
          Length = 108

 Score = 28.5 bits (64), Expect = 7.5
 Identities = 19/66 (28%), Positives = 30/66 (45%), Gaps = 4/66 (6%)

Query: 484 EEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKV----AELEADILDDEYLFSEQE 539
            E  + +K    S D+ I + + E+ +   E R+E +     AE EA+ L  E L   +E
Sbjct: 6   SEVLREIKKAEISADEEIEEAKEEAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEARE 65

Query: 540 LTERER 545
             E E 
Sbjct: 66  EAEEEA 71


>gnl|CDD|217188 pfam02689, Herpes_Helicase, Helicase.  This family consists of
           Helicases from the Herpes viruses. Helicases are
           responsible for the unwinding of DNA and are essential
           for replication and completion of the viral life cycle.
          Length = 801

 Score = 30.4 bits (69), Expect = 7.6
 Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 7/47 (14%)

Query: 687 VLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIGCTQPRRVAAMSVSA 733
             +I G  G+GK+T I Q L+E     D +  G T   RVAA ++SA
Sbjct: 61  AYLITGTAGAGKSTSI-QTLNE---NLDCLITGAT---RVAAQNLSA 100


>gnl|CDD|214640 smart00382, AAA, ATPases associated with a variety of cellular
           activities.  AAA - ATPases associated with a variety of
           cellular activities. This profile/alignment only detects
           a fraction of this vast family. The poorly conserved
           N-terminal helix is missing from the alignment.
          Length = 148

 Score = 29.3 bits (65), Expect = 7.8
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 685 HQVLIIEGETGSGKTT 700
            +V++I G  GSGKTT
Sbjct: 2   GEVILIVGPPGSGKTT 17


>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16).  Autophagy is a
           ubiquitous intracellular degradation system for
           eukaryotic cells. During autophagy, cytoplasmic
           components are enclosed in autophagosomes and delivered
           to lysosomes/vacuoles. ATG16 (also known as Apg16) has
           been shown to be bind to Apg5 and is required for the
           function of the Apg12p-Apg5p conjugate in the yeast
           autophagy pathway.
          Length = 194

 Score = 29.4 bits (66), Expect = 8.2
 Identities = 31/148 (20%), Positives = 57/148 (38%), Gaps = 12/148 (8%)

Query: 455 RDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLE 514
           RDE   RLK+RD+          Q ++  +       L+ E+      K   +S      
Sbjct: 2   RDEILRRLKDRDRREKAFSELFEQYRQLADRTR---LLKVENDALQAEKYEQQSSHSSSP 58

Query: 515 KRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKAR--------E 566
                      A+ ++ +     +ELTE  ++  E  ++LL L  E ++ R         
Sbjct: 59  SADGPGSDAAIAE-MEQKLAKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQLEKT 117

Query: 567 LEKVQRYYMPKGTKGPEKYEEVDEKEKV 594
           + +++       T+  +  EE+ EKEK 
Sbjct: 118 IAELRSEITSLETEIRDLREELQEKEKD 145


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 29.9 bits (68), Expect = 8.5
 Identities = 17/59 (28%), Positives = 34/59 (57%), Gaps = 2/59 (3%)

Query: 514 EKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLE-LAKEHDK-ARELEKV 570
            +R + + AE E ++L ++    EQ +  +ER  +EH ++L+E +  E +K   E E++
Sbjct: 208 AERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERM 266


>gnl|CDD|234015 TIGR02788, VirB11, P-type DNA transfer ATPase VirB11.  The VirB11
           protein is found in the vir locus of Agrobacterium Ti
           plasmids where it is involved in the type IV secretion
           system for DNA transfer. VirB11 is believed to be an
           ATPase. VirB11 is a homolog of the P-like conjugation
           system TrbB protein and the Flp pilus sytem protein
           TadA.
          Length = 308

 Score = 30.0 bits (68), Expect = 8.6
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 674 FKQDLIDAVRDHQVLIIEGETGSGKTT 700
            K+ L  A+   + +II G TGSGKTT
Sbjct: 133 IKEFLRLAIASRKNIIISGGTGSGKTT 159


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 28.8 bits (65), Expect = 8.6
 Identities = 21/73 (28%), Positives = 35/73 (47%), Gaps = 5/73 (6%)

Query: 480 KRAYEEAAKRLKLE-AESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQ 538
           K+A EEA ++ + E  E R +I    R E +++  ++ KE K A    +  +D+   +E 
Sbjct: 36  KKAQEEAKEKEREERIEERKRI----REERKQELEKQLKERKEALKLLEEENDDEEDAET 91

Query: 539 ELTERERRDREHK 551
           E TE    D    
Sbjct: 92  EDTEDVEDDEWEG 104


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 30.1 bits (68), Expect = 8.8
 Identities = 30/175 (17%), Positives = 62/175 (35%), Gaps = 34/175 (19%)

Query: 498 DKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLEL 557
           D+   + +I+     LEK+ E ++ +LE  +        E EL E E+   E ++K    
Sbjct: 272 DEKFERDKIKQLASELEKKLEKELKKLENKLEK-----QEDELEELEKAAEELRQK---- 322

Query: 558 AKEHDKARELEKVQRYY--MPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVG 615
                     E +      + +G K     +    +E     ++ K    + +   F+  
Sbjct: 323 ---------GELLYANLQLIEEGLKSVRLADFYGNEEIKIELDKSK-TPSENAQRYFKKY 372

Query: 616 AKDREHQEQYELLLD---NQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKK 667
            K +  +   +  L      I + ++           +   EK + K  IEE ++
Sbjct: 373 KKLKGAKVNLDRQLSELKEAIAYYESA----------KTALEKAEGKKAIEEIRE 417


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 29.8 bits (67), Expect = 9.7
 Identities = 23/107 (21%), Positives = 43/107 (40%), Gaps = 5/107 (4%)

Query: 456 DEFSERLKNRDKNN----TKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRK 511
           +   ++++   K N     KN   S +++   EEA +  K E E R  ++ K   E +  
Sbjct: 116 ENTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMN 175

Query: 512 YLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELA 558
              K K+  + ELE   L    L ++ +    +   +  K K  +  
Sbjct: 176 K-RKNKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKPKPEKPN 221


>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 122

 Score = 28.6 bits (64), Expect = 9.8
 Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 5/59 (8%)

Query: 510 RKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELE 568
           ++ LEKRK D+V + +      E   +++  +E ER   +  +KL +L  E  K  E +
Sbjct: 54  QRVLEKRKRDQVLKEQ-----KEEEEAKRLQSELERELMKRAQKLEQLELEKAKEDEEQ 107


>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family.  This model
           represents a subfamily of RNA splicing factors including
           the Pad-1 protein (N. crassa), CAPER (M. musculus) and
           CC1.3 (H.sapiens). These proteins are characterized by
           an N-terminal arginine-rich, low complexity domain
           followed by three (or in the case of 4 H. sapiens
           paralogs, two) RNA recognition domains (rrm: pfam00706).
           These splicing factors are closely related to the U2AF
           splicing factor family (TIGR01642). A homologous gene
           from Plasmodium falciparum was identified in the course
           of the analysis of that genome at TIGR and was included
           in the seed.
          Length = 457

 Score = 29.9 bits (67), Expect = 9.8
 Identities = 23/98 (23%), Positives = 40/98 (40%), Gaps = 10/98 (10%)

Query: 453 KERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKII-PKLRIESRRK 511
           + RD    RL+N  + + K    SR+  R+ + + +R       RD     + R  SR  
Sbjct: 1   RYRDRERGRLRNDTRRSDKGRERSRRRSRSRDRSRRR-----RDRDYYRGRRGRSRSRSP 55

Query: 512 Y-LEKRKEDKVAELEADILDDEYLFSEQELTERERRDR 548
               + + D+    +          +++ LTE ER DR
Sbjct: 56  NRYYRPRGDRSYRRDDRRSGRN---TKEPLTEAERDDR 90


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.134    0.394 

Gapped
Lambda     K      H
   0.267   0.0723    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 67,747,927
Number of extensions: 6840708
Number of successful extensions: 8057
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7757
Number of HSP's successfully gapped: 313
Length of query: 1320
Length of database: 10,937,602
Length adjustment: 108
Effective length of query: 1212
Effective length of database: 6,147,370
Effective search space: 7450612440
Effective search space used: 7450612440
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (28.8 bits)