RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11229
         (705 letters)



>gnl|CDD|238667 cd01371, KISc_KIF3, Kinesin motor domain, kinesins II or KIF3_like
           proteins. Subgroup of kinesins, which form heterotrimers
           composed of 2 kinesins and one non-motor accessory
           subunit. Kinesins II play important roles in ciliary
           transport, and have been implicated in neuronal
           transport, melanosome transport, the secretory pathway,
           and mitosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In this group the motor
           domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 333

 Score =  545 bits (1407), Expect = 0.0
 Identities = 239/394 (60%), Positives = 280/394 (71%), Gaps = 61/394 (15%)

Query: 16  ENVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYST 75
           ENV+V VR RPL+K+E   G  +I  VD   G +TV NP   A EPPK FTFD V     
Sbjct: 1   ENVKVVVRCRPLNKREKSEGAPEIVGVDENRGQVTVHNPKADAKEPPKVFTFDAV----- 55

Query: 76  VDLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIF 135
                                        +D NS Q D+YNETARP+VD VL G+NGTIF
Sbjct: 56  -----------------------------YDPNSTQEDVYNETARPLVDSVLEGYNGTIF 86

Query: 136 AYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEI 195
           AYGQTGTGKTFTMEGV   PEL+GIIPNSFAHIFGHIAKA E+V+FLVRVSY EIYNEE+
Sbjct: 87  AYGQTGTGKTFTMEGVREPPELRGIIPNSFAHIFGHIAKA-ENVQFLVRVSYLEIYNEEV 145

Query: 196 RDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNK 255
           RDLL K++ + LE+KERPD GVY             VKDLS ++  NA++M+K+M+LGNK
Sbjct: 146 RDLLGKDQKKKLELKERPDRGVY-------------VKDLSMFVVKNAEEMDKLMTLGNK 192

Query: 256 NRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDL 315
           NRSVGAT MNE SSRSH+IF+I IE S         GE+          +++G+L+LVDL
Sbjct: 193 NRSVGATNMNEDSSRSHSIFTITIECS-------EKGEDGE------NHIRVGKLNLVDL 239

Query: 316 AGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQDSLG 375
           AGSERQ+KTGA+G RLKEA+KINLSLS LGNVISALVDGK THIPYR+SKLTR+LQDSLG
Sbjct: 240 AGSERQSKTGATGDRLKEATKINLSLSALGNVISALVDGKSTHIPYRDSKLTRLLQDSLG 299

Query: 376 GNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
           GNSKTVMCA +GPA YNY+ET+STLRYA+R K I
Sbjct: 300 GNSKTVMCANIGPADYNYDETLSTLRYANRAKNI 333


>gnl|CDD|215803 pfam00225, Kinesin, Kinesin motor domain. 
          Length = 326

 Score =  445 bits (1147), Expect = e-153
 Identities = 172/389 (44%), Positives = 224/389 (57%), Gaps = 65/389 (16%)

Query: 23  RVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTVDLLNGI 82
           RVRPL+++E   G  DI  VD  +                                    
Sbjct: 1   RVRPLNEREKSRGSSDIVNVDETDSEDKESVV---------------------------- 32

Query: 83  ITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFAYGQTGT 142
                   +     KTFTFD VFD  + Q  +Y ETA+P+V+ VL G+N TIFAYGQTG+
Sbjct: 33  -------ITNKGREKTFTFDRVFDPEATQEFVYEETAKPLVESVLEGYNVTIFAYGQTGS 85

Query: 143 GKTFTMEGVNNVPELKGIIPNSFAHIFGHIAK-ADESVKFLVRVSYFEIYNEEIRDLLSK 201
           GKT+TMEG +   E  GIIP +   +F  I +  +    F VRVSY EIYNE+I DLLS 
Sbjct: 86  GKTYTMEG-DPPEEEPGIIPRALEDLFERIEERKERKWSFSVRVSYLEIYNEKIYDLLSP 144

Query: 202 NKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNKNRSVGA 261
           +  +              L ++E P  GVYVK L+     +A+++ +++ LGNKNR+V +
Sbjct: 145 SSKKKK------------LRIREDPKKGVYVKGLTEVEVTSAEEVLELLELGNKNRTVAS 192

Query: 262 TAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDLAGSERQ 321
           T MNE SSRSHAIF+I +E  +                        G+L+LVDLAGSER 
Sbjct: 193 TNMNEESSRSHAIFTITVEQRNRDTDGSVK---------------TGKLNLVDLAGSERA 237

Query: 322 AKTG-ASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQDSLGGNSKT 380
           +KTG A G+RLKEA+ IN SLS LGNVI+AL + + +HIPYR+SKLTR+LQDSLGGNSKT
Sbjct: 238 SKTGAAGGRRLKEAANINKSLSALGNVINALAENQSSHIPYRDSKLTRLLQDSLGGNSKT 297

Query: 381 VMCATVGPASYNYEETISTLRYASRVKKI 409
           +M A + P+  NYEET+STLR+ASR K I
Sbjct: 298 LMIANISPSDSNYEETLSTLRFASRAKNI 326


>gnl|CDD|214526 smart00129, KISc, Kinesin motor, catalytic domain. ATPase.
           Microtubule-dependent molecular motors that play
           important roles in intracellular transport of organelles
           and in cell division.
          Length = 335

 Score =  444 bits (1144), Expect = e-152
 Identities = 182/401 (45%), Positives = 230/401 (57%), Gaps = 67/401 (16%)

Query: 17  NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
           N+RV VRVRPL+K+E                              P    F         
Sbjct: 1   NIRVVVRVRPLNKREKSRKS-------------------------PSVVPFPD------- 28

Query: 77  DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
                  T+   +    +  K FTFD VFDA + Q D++ ETA P+VD VL G+N TIFA
Sbjct: 29  ---KVGKTLTVRSPKNRQGEKKFTFDKVFDATASQEDVFEETAAPLVDSVLEGYNATIFA 85

Query: 137 YGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEIR 196
           YGQTG+GKT+TM G    P+  GIIP +   +F  I K +E  +F V+VSY EIYNE+IR
Sbjct: 86  YGQTGSGKTYTMIGT---PDSPGIIPRALKDLFEKIDKREEGWQFSVKVSYLEIYNEKIR 142

Query: 197 DLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNKN 256
           DLL+ +                 LE++E    GVYVK L+    ++ +++  ++  GNKN
Sbjct: 143 DLLNPSSK--------------KLEIREDEKGGVYVKGLTEISVSSFEEVYNLLEKGNKN 188

Query: 257 RSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDLA 316
           R+V AT MNE SSRSHA+F+I +E       K     +        GS +  +L+LVDLA
Sbjct: 189 RTVAATKMNEESSRSHAVFTITVE------QKIKNSSS--------GSGKASKLNLVDLA 234

Query: 317 GSERQAKTGASGQRLKEASKINLSLSTLGNVISALVD-GKCTHIPYRNSKLTRILQDSLG 375
           GSER  KTGA G RLKEA  IN SLS LGNVI+AL    K  HIPYR+SKLTR+LQDSLG
Sbjct: 235 GSERAKKTGAEGDRLKEAGNINKSLSALGNVINALAQHSKSRHIPYRDSKLTRLLQDSLG 294

Query: 376 GNSKTVMCATVGPASYNYEETISTLRYASRVKKICNKARVN 416
           GNSKT+M A V P+S N EET+STLR+ASR K+I NK  VN
Sbjct: 295 GNSKTLMIANVSPSSSNLEETLSTLRFASRAKEIKNKPIVN 335


>gnl|CDD|238054 cd00106, KISc, Kinesin motor domain. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Kinesins are microtubule-dependent
           molecular motors that play important roles in
           intracellular transport and in cell division. In most
           kinesins, the motor domain is found at the N-terminus
           (N-type), in some its is found in the middle (M-type),
           or C-terminal (C-type). N-type and M-type kinesins are
           (+) end-directed motors, while C-type kinesins are (-)
           end-directed motors, i.e. they transport cargo towards
           the (-) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 328

 Score =  420 bits (1081), Expect = e-143
 Identities = 180/393 (45%), Positives = 225/393 (57%), Gaps = 67/393 (17%)

Query: 17  NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
           N+RV VR+RPL+ +ES                                 T D      TV
Sbjct: 1   NIRVVVRIRPLNGRESK--------------------------SEESCITVD---DNKTV 31

Query: 77  DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
                  T+  P       PK+FTFD VFD NS Q D+Y  TA+P+V+ VL G+NGTIFA
Sbjct: 32  -------TLTPPKDGRKAGPKSFTFDHVFDPNSTQEDVYETTAKPLVESVLEGYNGTIFA 84

Query: 137 YGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADES-VKFLVRVSYFEIYNEEI 195
           YGQTG+GKT+TM G    P+  GIIP +   +F  I +  E    F V VSY EIYNE++
Sbjct: 85  YGQTGSGKTYTMFG---SPKDPGIIPRALEDLFNLIDERKEKNKSFSVSVSYLEIYNEKV 141

Query: 196 RDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNK 255
            DLLS       E   +P      L ++E P  GVYVK L+     +A+D   ++  G K
Sbjct: 142 YDLLSP------EPPSKP------LSLREDPKGGVYVKGLTEVEVGSAEDALSLLQKGLK 189

Query: 256 NRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDL 315
           NR+  +TAMNE SSRSHAIF+I +E  +     R+               +  +L+LVDL
Sbjct: 190 NRTTASTAMNERSSRSHAIFTIHVEQRNTTNDGRSI--------------KSSKLNLVDL 235

Query: 316 AGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKC-THIPYRNSKLTRILQDSL 374
           AGSER  KTGA G RLKEA  IN SLS LGNVISAL  G+   HIPYR+SKLTR+LQDSL
Sbjct: 236 AGSERAKKTGAEGDRLKEAKNINKSLSALGNVISALSSGQKKKHIPYRDSKLTRLLQDSL 295

Query: 375 GGNSKTVMCATVGPASYNYEETISTLRYASRVK 407
           GGNSKT+M A + P+S NY+ET+STLR+ASR K
Sbjct: 296 GGNSKTLMIANISPSSENYDETLSTLRFASRAK 328


>gnl|CDD|238665 cd01369, KISc_KHC_KIF5, Kinesin motor domain, kinesin heavy chain
           (KHC) or KIF5-like subgroup. Members of this group have
           been associated with organelle transport. This catalytic
           (head) domain has ATPase activity and belongs to the
           larger group of P-loop NTPases. Kinesins are
           microtubule-dependent molecular motors that play
           important roles in intracellular transport and in cell
           division. In most kinesins, the motor domain is found at
           the N-terminus (N-type). N-type kinesins are (+)
           end-directed motors, i.e. they transport cargo towards
           the (+) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 325

 Score =  389 bits (1001), Expect = e-131
 Identities = 175/395 (44%), Positives = 228/395 (57%), Gaps = 70/395 (17%)

Query: 15  IENVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYS 74
             N++V  R RPL++KE   G   I                        T +        
Sbjct: 1   ECNIKVVCRFRPLNEKEELRGSKSIVKFP-----------------GEDTVSI------- 36

Query: 75  TVDLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTI 134
                           +  +  KTF+FD VF  N+ Q D+YN  A+PIVD VL+G+NGTI
Sbjct: 37  ----------------AGSDDGKTFSFDRVFPPNTTQEDVYNFVAKPIVDDVLNGYNGTI 80

Query: 135 FAYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEE 194
           FAYGQTG+GKT+TMEG    PELKGIIP     IF HI+  DE+++F V+VSY EIY E+
Sbjct: 81  FAYGQTGSGKTYTMEGPPGDPELKGIIPRIVHDIFEHISSMDENLEFHVKVSYLEIYMEK 140

Query: 195 IRDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGN 254
           IRDLL  +KD               L+V E  + GVYVK L+    ++ +++ ++++ G 
Sbjct: 141 IRDLLDVSKDN--------------LQVHEDKNRGVYVKGLTERFVSSPEEVLEVINEGK 186

Query: 255 KNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVD 314
            NR+V +T MNE SSRSH+IF I ++  +   G +  G                +L LVD
Sbjct: 187 SNRAVASTNMNEESSRSHSIFLITLKQENVETGSKKRG----------------KLFLVD 230

Query: 315 LAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQDSL 374
           LAGSE+ +KTGA GQ L+EA KIN SLS LGNVI+AL DGK THIPYR+SKLTRILQDSL
Sbjct: 231 LAGSEKVSKTGAEGQTLEEAKKINKSLSALGNVINALTDGKSTHIPYRDSKLTRILQDSL 290

Query: 375 GGNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
           GGNS+T +     P+SYN  ET+STLR+ +R K I
Sbjct: 291 GGNSRTTLIICCSPSSYNESETLSTLRFGARAKTI 325


>gnl|CDD|238662 cd01366, KISc_C_terminal, Kinesin motor domain,
           KIFC2/KIFC3/ncd-like carboxy-terminal kinesins. Ncd is a
           spindle motor protein necessary for chromosome
           segregation in meiosis. KIFC2/KIFC3-like kinesins have
           been implicated in motility of the Golgi apparatus as
           well as dentritic and axonal transport in neurons. This
           catalytic (head) domain has ATPase activity and belongs
           to the larger group of P-loop NTPases. Kinesins are
           microtubule-dependent molecular motors that play
           important roles in intracellular transport and in cell
           division. In this subgroup the motor domain is found at
           the C-terminus (C-type). C-type kinesins are (-)
           end-directed motors, i.e. they transport cargo towards
           the (-) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 329

 Score =  367 bits (945), Expect = e-122
 Identities = 155/395 (39%), Positives = 213/395 (53%), Gaps = 73/395 (18%)

Query: 17  NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
           N+RVF RVRPL   ES                           E     +F         
Sbjct: 3   NIRVFCRVRPLLPSEST--------------------------EYSSVISFPDED----- 31

Query: 77  DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
               G I +    G      K+F+FD VFD ++ Q D++ E + P+V   L G+N  IFA
Sbjct: 32  ---GGTIELSKGTGK----KKSFSFDRVFDPDASQEDVFEEVS-PLVQSALDGYNVCIFA 83

Query: 137 YGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESV--KFLVRVSYFEIYNEE 194
           YGQTG+GKT+TMEG    PE  GIIP +   +F + A+  +     + +  S  EIYNE 
Sbjct: 84  YGQTGSGKTYTMEGP---PENPGIIPRALEQLF-NTAEELKEKGWSYTITASMLEIYNET 139

Query: 195 IRDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGN 254
           IRDLL+       +           LE+K       YV +L+    ++ +++ ++++LG+
Sbjct: 140 IRDLLATKPAPKKK-----------LEIKHDSKGETYVTNLTEVPVSSPEEVTRLLNLGS 188

Query: 255 KNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVD 314
           KNRSV +T MNE SSRSHA+F + I  ++   G++T G                +L+LVD
Sbjct: 189 KNRSVASTNMNEHSSRSHAVFQLKIRGTNLQTGEQTRG----------------KLNLVD 232

Query: 315 LAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQDSL 374
           LAGSER  K+GA+G RLKEA  IN SLS LG+VISAL     +H+PYRNSKLT +LQDSL
Sbjct: 233 LAGSERLKKSGATGDRLKEAQAINKSLSALGDVISALRSKD-SHVPYRNSKLTYLLQDSL 291

Query: 375 GGNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
           GGNSKT+M   + P   N  ET+ +LR+ASRV+ +
Sbjct: 292 GGNSKTLMFVNISPLESNLSETLCSLRFASRVRSV 326


>gnl|CDD|238661 cd01365, KISc_KIF1A_KIF1B, Kinesin motor domain, KIF1_like
           proteins. KIF1A (Unc104) transports synaptic vesicles to
           the nerve  terminal, KIF1B has been implicated in
           transport of mitochondria. Both proteins are expressed
           in neurons. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           In contrast to the majority of dimeric kinesins, most
           KIF1A/Unc104 kinesins are monomeric motors. A
           lysine-rich loop in KIF1A binds to the negatively
           charged C-terminus of tubulin and compensates for the
           lack of a second motor domain, allowing KIF1A to move
           processively.
          Length = 356

 Score =  359 bits (923), Expect = e-118
 Identities = 162/416 (38%), Positives = 231/416 (55%), Gaps = 75/416 (18%)

Query: 16  ENVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNP--AGSAHEPPKTFTFDTVTPY 73
            NV+V VRVRP + +E + G   I  V +   + T+KNP  A +  + PK+F+FD    Y
Sbjct: 1   ANVKVAVRVRPFNSREKNRGSKCI--VQMPGKVTTLKNPKAADATRKKPKSFSFDHS--Y 56

Query: 74  STVDLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGT 133
            + D                   +   + +       Q D++ +  R ++D    G+N  
Sbjct: 57  WSHD------------------SEDPHYAS-------QEDVFEDLGRELLDHAFEGYNVC 91

Query: 134 IFAYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKA-DESVKFLVRVSYFEIYN 192
           +FAYGQTG+GK++TM G     E KGIIP     +F  I    ++++ + V VSY EIYN
Sbjct: 92  LFAYGQTGSGKSYTMMG---YKEEKGIIPRLCEELFQRIESKKEQNLSYEVEVSYMEIYN 148

Query: 193 EEIRDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSL 252
           E++RDLL+  K                L+V+E P +G YV+DLS     + +D++ ++  
Sbjct: 149 EKVRDLLNPKKKN-----------KGNLKVREHPVLGPYVEDLSKVAVTSYEDIQNLLEE 197

Query: 253 GNKNRSVGATAMNEVSSRSHAIFSII-----IETSSGLGGKRTTGENWRLWAEAQGSVQM 307
           GNK+R+  +T MN+ SSRSHA+F+I+     ++  + L  ++ +                
Sbjct: 198 GNKSRTTASTNMNDTSSRSHAVFTIVLTQKKLDKETDLTTEKVS---------------- 241

Query: 308 GRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVD-------GKCTHIP 360
            ++ LVDLAGSER + TGA G RLKE S IN SL+TLG VISAL D        K + IP
Sbjct: 242 -KISLVDLAGSERASSTGAEGDRLKEGSNINKSLTTLGKVISALADNSSAKSKKKSSFIP 300

Query: 361 YRNSKLTRILQDSLGGNSKTVMCATVGPASYNYEETISTLRYASRVKKICNKARVN 416
           YR+S LT +L+++LGGNSKT M AT+ PA  NYEET+STLRYA R KKI N A VN
Sbjct: 301 YRDSVLTWLLKENLGGNSKTAMIATISPADINYEETLSTLRYADRAKKIVNVAVVN 356


>gnl|CDD|238668 cd01372, KISc_KIF4, Kinesin motor domain, KIF4-like subfamily.
           Members of this group seem to perform a variety of
           functions, and have been implicated in neuronal
           organelle transport and chromosome segregation during
           mitosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 341

 Score =  357 bits (918), Expect = e-118
 Identities = 155/397 (39%), Positives = 210/397 (52%), Gaps = 64/397 (16%)

Query: 18  VRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTVD 77
           VRV VRVRPL  KE   G   +                          +     P  TV 
Sbjct: 3   VRVAVRVRPLLPKELLEGC-QVCV------------------------SVVPGEPQVTV- 36

Query: 78  LLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFAY 137
                              K+FTFD VFD ++ Q ++YN    P+VD +  G+N T+ AY
Sbjct: 37  ----------------GTDKSFTFDYVFDPSTSQEEVYNTCVAPLVDGLFEGYNATVLAY 80

Query: 138 GQTGTGKTFTMEGVNNVPEL---KGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEE 194
           GQTG+GKT+TM       E     GIIP +  HIF  I +  +   F ++VS+ E+YNEE
Sbjct: 81  GQTGSGKTYTMGTAFTASEDEEEVGIIPRAIQHIFKKIDEKKDEPDFQLKVSFLELYNEE 140

Query: 195 IRDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGN 254
           +RDLLS +        E+  I     +++E     + +  L+    N+A ++   +  G+
Sbjct: 141 VRDLLSPSTS------EKSPI-----QIREDSKGNIIIVGLTEVTVNSAQEVMSCLEQGS 189

Query: 255 KNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVD 314
            +R+  +TAMN  SSRSHAIF+I +E        R  G    +  + + S    + H VD
Sbjct: 190 LSRTTASTAMNSQSSRSHAIFTITLEQ------TRKNGPIAPMSGDDKNSTLTSKFHFVD 243

Query: 315 LAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVD--GKCTHIPYRNSKLTRILQD 372
           LAGSER  KTGA+G RLKE   IN  L  LGNVISAL D   K +H+PYR+SKLTR+LQD
Sbjct: 244 LAGSERLKKTGATGDRLKEGISINSGLLALGNVISALGDESKKGSHVPYRDSKLTRLLQD 303

Query: 373 SLGGNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
           SLGGNS T+M A V PA  N+EET++TL+YA+R + I
Sbjct: 304 SLGGNSHTLMIACVSPADSNFEETLNTLKYANRARNI 340


>gnl|CDD|238666 cd01370, KISc_KIP3_like, Kinesin motor domain, KIP3-like subgroup.
           The yeast kinesin KIP3 plays a role in positioning the
           mitotic spindle. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 338

 Score =  346 bits (889), Expect = e-114
 Identities = 155/395 (39%), Positives = 219/395 (55%), Gaps = 59/395 (14%)

Query: 17  NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
           ++ V VRVRP ++KE   G   +  V +++  + V               FD   P    
Sbjct: 1   SLTVAVRVRPFNEKEKQEGTRRV--VKVVDDRMLV---------------FD---PKDEE 40

Query: 77  DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
           D    +   +N           ++FD VFD  S Q ++Y  T +P+VD VL+G+N T+FA
Sbjct: 41  DAFRNLRARRNK-------ELKYSFDRVFDETSTQEEVYENTTKPLVDGVLNGYNATVFA 93

Query: 137 YGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEIR 196
           YG TG GKT TM G ++ P   G++  +   +F  I +  +  +F V +SY EIYNE IR
Sbjct: 94  YGATGAGKTHTMLGTDSDP---GLMVLTMKDLFDKIEERKDDKEFEVSLSYLEIYNETIR 150

Query: 197 DLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNKN 256
           DLLS +                 LE++E P+ G+ V  L+ +   +A+++ +++  GN+N
Sbjct: 151 DLLSPSSGP--------------LELREDPNQGIVVAGLTEHQPKSAEEILELLMKGNRN 196

Query: 257 RSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDLA 316
           R+   T  N  SSRSHA+  I +         RT   N          V++G+L L+DLA
Sbjct: 197 RTQEPTEANATSSRSHAVLQITVRQKD-----RTASIN--------QQVRIGKLSLIDLA 243

Query: 317 GSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKC--THIPYRNSKLTRILQDSL 374
           GSER + T   GQRLKE + IN SL  LGN I+ALVDGK    HIPYR+SKLTR+L+DSL
Sbjct: 244 GSERASATNNRGQRLKEGANINRSLLALGNCINALVDGKKKNKHIPYRDSKLTRLLKDSL 303

Query: 375 GGNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
           GGN KTVM A + P+S +YEET +TL+YA+R K I
Sbjct: 304 GGNCKTVMIANISPSSSHYEETHNTLKYANRAKNI 338


>gnl|CDD|238660 cd01364, KISc_BimC_Eg5, Kinesin motor domain, BimC/Eg5 spindle pole
           proteins, participate in spindle assembly and chromosome
           segregation during cell division. This catalytic (head)
           domain has ATPase activity and belongs to the larger
           group of P-loop NTPases. Kinesins are
           microtubule-dependent molecular motors that play
           important roles in intracellular transport and in cell
           division. In most kinesins, the motor domain is found at
           the N-terminus (N-type), N-type kinesins are (+)
           end-directed motors, i.e. they transport cargo towards
           the (+) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 352

 Score =  344 bits (885), Expect = e-113
 Identities = 162/411 (39%), Positives = 216/411 (52%), Gaps = 70/411 (17%)

Query: 17  NVRVFVRVRPLSKKESDAG-HVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYST 75
           N++V VR RP + +E      V +        II     A                  ST
Sbjct: 3   NIQVVVRCRPRNSRERKEKSSVVVEVSGSSKEIIVSTGGADK---------------QST 47

Query: 76  VDLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIF 135
                                KT+TFD VF   + Q+++Y++   PI+D+VL G+N TIF
Sbjct: 48  ---------------------KTYTFDKVFGPEADQIEVYSQVVSPILDEVLMGYNCTIF 86

Query: 136 AYGQTGTGKTFTMEGVNN--------VPELKGIIPNSFAHIFGHIAKADESVKFLVRVSY 187
           AYGQTGTGKT+TMEG           +    GIIP +   +F  +   +   ++ V+VSY
Sbjct: 87  AYGQTGTGKTYTMEGDRTDNKGSTWELSPHAGIIPRALYQLFEKLESQNT--EYSVKVSY 144

Query: 188 FEIYNEEIRDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDME 247
            E+YNEE+ DLLS   D          + ++     +    GV ++ L     NNA++  
Sbjct: 145 LELYNEELFDLLSSESD------LNKPLRIFDDTNNKG---GVVIQGLEEITVNNANEGL 195

Query: 248 KIMSLGNKNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQM 307
           K++  G+  R   AT MN+ SSRSH+IFSI I        K TT     L       V++
Sbjct: 196 KLLEKGSAKRKTAATLMNDQSSRSHSIFSITIHI------KETTISGEEL-------VKI 242

Query: 308 GRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLT 367
           G+L+LVDLAGSE   ++GA  +R +EA  IN SL TLG VI+ALV+ K  HIPYR SKLT
Sbjct: 243 GKLNLVDLAGSENIGRSGAENKRAREAGNINQSLLTLGRVINALVE-KSPHIPYRESKLT 301

Query: 368 RILQDSLGGNSKTVMCATVGPASYNYEETISTLRYASRVKKICNKARVNQD 418
           R+LQDSLGG +KT + AT+ PAS N EET+STL YA R K I NK  VNQ 
Sbjct: 302 RLLQDSLGGRTKTSIIATISPASINLEETLSTLEYAHRAKNIKNKPEVNQK 352


>gnl|CDD|238670 cd01374, KISc_CENP_E, Kinesin motor domain, CENP-E/KIP2-like
           subgroup, involved in chromosome movement and/or spindle
           elongation during mitosis. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Kinesins are microtubule-dependent
           molecular motors that play important roles in
           intracellular transport and in cell division. In most
           kinesins, the motor domain is found at the N-terminus
           (N-type). N-type kinesins are (+) end-directed motors,
           i.e. they transport cargo towards the (+) end of the
           microtubule. Kinesin motor domains hydrolyze ATP at a
           rate of about 80 per second, and move along the
           microtubule at a speed of about 6400 Angstroms per
           second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 321

 Score =  331 bits (850), Expect = e-108
 Identities = 157/394 (39%), Positives = 215/394 (54%), Gaps = 74/394 (18%)

Query: 17  NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
            ++V VRVRPL+ +ESD   V  +  +                        +T++     
Sbjct: 1   KIKVSVRVRPLNPRESDNEQVAWSIDN-----------------------DNTISL---- 33

Query: 77  DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
                             P ++FTFD VF   S   ++Y   A+P+V   L G+NGTIFA
Sbjct: 34  --------------EESTPGQSFTFDRVFGGESTNREVYERIAKPVVRSALEGYNGTIFA 79

Query: 137 YGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEIR 196
           YGQT +GKTFTM G    P   GIIP +   IF  I    +  +FL+RVSY EIYNE+I+
Sbjct: 80  YGQTSSGKTFTMSGDEQEP---GIIPLAVRDIFQRIQDTPDR-EFLLRVSYLEIYNEKIK 135

Query: 197 DLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNKN 256
           DLLS +  +              L ++E P+ GV V  L+  I  + + + ++++ G KN
Sbjct: 136 DLLSPSPQE--------------LRIREDPNKGVVVAGLTEEIVTSPEHLLQLIARGEKN 181

Query: 257 RSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDLA 316
           R VG T  NE SSRSH IF + IE+                     G+V++  L+L+DLA
Sbjct: 182 RHVGETDFNERSSRSHTIFQLTIESRERG-------------DSESGTVRVSTLNLIDLA 228

Query: 317 GSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKC-THIPYRNSKLTRILQDSLG 375
           GSER ++TGA G+R KE S IN SL TLG VIS L +GK   HIPYR+SKLTRILQ SL 
Sbjct: 229 GSERASQTGA-GERRKEGSFINKSLLTLGTVISKLSEGKNSGHIPYRDSKLTRILQPSLS 287

Query: 376 GNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
           GN++T +  T+ PAS + EET++TL++ASR KK+
Sbjct: 288 GNARTAIICTISPASSHVEETLNTLKFASRAKKV 321


>gnl|CDD|238669 cd01373, KISc_KLP2_like, Kinesin motor domain, KLP2-like subgroup.
           Members of this subgroup seem to play a role in mitosis
           and meiosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 337

 Score =  295 bits (756), Expect = 3e-94
 Identities = 157/410 (38%), Positives = 221/410 (53%), Gaps = 91/410 (22%)

Query: 17  NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
            V+V VR+RP ++ E+D G          + ++   +P                      
Sbjct: 2   AVKVVVRIRPPNEIEADGGQGQCLKKLSSDTLVWHSHP---------------------- 39

Query: 77  DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
                              P+ FTFD V D+N+ Q D++    +P+V+  LSG+NG+IFA
Sbjct: 40  -------------------PRMFTFDHVADSNTNQEDVFQSVGKPLVEDCLSGYNGSIFA 80

Query: 137 YGQTGTGKTFTMEG-----VNNVPELKGIIPNSFAHIFGHI----AKADESVKFLVRVSY 187
           YGQTG+GKT+TM G       +   L+G+IP  F ++F  I     K  + +KFL + S+
Sbjct: 81  YGQTGSGKTYTMMGPSSSDDESPHGLQGVIPRIFEYLFSLIQREEEKRGDGLKFLCKCSF 140

Query: 188 FEIYNEEIRDLL---SKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNAD 244
            EIYNE+I DLL   S+N                 L+++E    GVYV++L+    ++ +
Sbjct: 141 LEIYNEQITDLLDPTSRN-----------------LKIREDIKKGVYVENLTEEYVSSYE 183

Query: 245 DMEKIMSLGNKNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGS 304
           D+ +++  G  NR V AT+MN  SSRSHA+F+  IE+                W +   S
Sbjct: 184 DVYQVLLKGLSNRKVAATSMNSESSRSHAVFTCTIES----------------WEKKASS 227

Query: 305 V--QMGRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVD---GKCTHI 359
              +  RL+LVDLAGSERQ   GA G RLKEA  IN SLSTLG+VI ALVD   GK  H+
Sbjct: 228 TNIRTSRLNLVDLAGSERQKDDGAEGVRLKEAKNINKSLSTLGHVIMALVDVAHGKQRHV 287

Query: 360 PYRNSKLTRILQDSLGGNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
           PYR+SKLT +L+DSLGGN+KT + A V P+S  + ET+STL++A R K I
Sbjct: 288 PYRDSKLTFLLRDSLGGNAKTTIIANVSPSSKCFGETLSTLKFAQRAKLI 337


>gnl|CDD|238671 cd01375, KISc_KIF9_like, Kinesin motor domain, KIF9-like subgroup;
           might play a role in cell shape remodeling. This
           catalytic (head) domain has ATPase activity and belongs
           to the larger group of P-loop NTPases. Kinesins are
           microtubule-dependent molecular motors that play
           important roles in intracellular transport and in cell
           division. In most kinesins, the motor domain is found at
           the N-terminus (N-type). N-type kinesins are (+)
           end-directed motors, i.e. they transport cargo towards
           the (+) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 334

 Score =  287 bits (735), Expect = 4e-91
 Identities = 151/390 (38%), Positives = 205/390 (52%), Gaps = 57/390 (14%)

Query: 18  VRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTVD 77
           ++VFVRVRP              T       I +     S           +  P   V 
Sbjct: 2   IQVFVRVRP--------------TPTKQGSSIKLGPDGKSV---------SSNLPKDLVR 38

Query: 78  LLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFAY 137
                  V N          +F FD VF  N+ Q ++Y   A+P+VD  L G+NGTIFAY
Sbjct: 39  ------GVVNNQQEDF----SFKFDGVFH-NASQEEVYETVAKPVVDSALDGYNGTIFAY 87

Query: 138 GQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEIRD 197
           GQTG GKTFTM G     + +G+IP +   +F  +A    +  + V VSY EIYNE++ D
Sbjct: 88  GQTGAGKTFTMTGGTESYKDRGLIPRALEQVFREVAM-RATKTYTVHVSYLEIYNEQLYD 146

Query: 198 LLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNKNR 257
           LL    +    +          + + E  +  ++VK LS + A   ++   ++ LG  NR
Sbjct: 147 LLGDTPEALESLPA--------VTILEDSEQNIHVKGLSLHSATTEEEALNLLFLGETNR 198

Query: 258 SVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDLAG 317
           ++  T+MN+ SSRSH IF+I +E+ S   G                 V++ +L+LVDLAG
Sbjct: 199 TIAETSMNQASSRSHCIFTIHLESRSREAGSEV--------------VRLSKLNLVDLAG 244

Query: 318 SERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQDSLGGN 377
           SER +KTG SGQ LKEA  IN SLS L  VI+AL +   TH+PYRNSKLT +L+DSLGGN
Sbjct: 245 SERVSKTGVSGQVLKEAKYINKSLSFLEQVINALSEKARTHVPYRNSKLTHVLRDSLGGN 304

Query: 378 SKTVMCATVGPASYNYEETISTLRYASRVK 407
            KTVM AT+     N +ET+STLR+A RV 
Sbjct: 305 CKTVMLATIWVEPSNLDETLSTLRFAQRVA 334


>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
          Length = 568

 Score =  281 bits (720), Expect = 5e-86
 Identities = 168/508 (33%), Positives = 257/508 (50%), Gaps = 64/508 (12%)

Query: 68  DTVTPYSTVDLLNGIITVKNPAGSAHE-----PPKTFTFDTVFDANSKQLDIYNETARPI 122
             +     +        + N +  +H         T+ FD VF  ++ Q D+Y ET +P+
Sbjct: 22  SDIKSTIRIIPGELGERLINTSKKSHVSLEKSKEGTYAFDKVFGPSATQEDVYEETIKPL 81

Query: 123 VDKVLSGFNGTIFAYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFL 182
           +D +L G+N T+FAYGQTG+GKT+TM G     E  GIIP S   +F  +     +  F 
Sbjct: 82  IDSLLLGYNCTVFAYGQTGSGKTYTMSGT---EEEPGIIPLSLKELFSKLEDLSMTKDFA 138

Query: 183 VRVSYFEIYNEEIRDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANN 242
           V +SY EIYNE+I DLLS N++               L ++E   +GV V  L+    ++
Sbjct: 139 VSISYLEIYNEKIYDLLSPNEES--------------LNIREDSLLGVKVAGLTEKHVSS 184

Query: 243 ADDMEKIMSLGNKNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQ 302
            +++  ++  G KNR+  +T +N+ SSRSH+IF I + + + + G   T           
Sbjct: 185 KEEILDLLRKGEKNRTTASTEINDESSRSHSIFQIELASKNKVSGTSETS---------- 234

Query: 303 GSVQMGRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVD-GKCTHIPY 361
                 +L LVDLAGSER A+TG  G RLKE + IN SL TLGNVI+AL D  K  HIPY
Sbjct: 235 ------KLSLVDLAGSERAARTGNRGTRLKEGASINKSLLTLGNVINALGDKKKSGHIPY 288

Query: 362 RNSKLTRILQDSLGGNSKTVMCATVGPASYNYEETISTLRYASRVKKICNKARVNQDPKD 421
           R SKLTR+LQDSLGGN  T +  T+ P+S ++EETI+TL++ASR K I NK +VN     
Sbjct: 289 RESKLTRLLQDSLGGNCNTRVICTISPSSNSFEETINTLKFASRAKSIKNKIQVNS---- 344

Query: 422 ALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKR 481
                  +EI E+K  L E       E        E+       + S+       A ++ 
Sbjct: 345 --SSDSSREIEEIKFDLSED----RSEI-------EILVFREQSQLSQSSLSGIFAYMQS 391

Query: 482 AQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEH--------RKQKEE 533
            + + +++  +++L+ K ++ G     +  + +     +  Q L          R+++  
Sbjct: 392 LKKETETLKSRIDLIMKSIISGTFERKKLLKEEGWKYKSTLQFLRIEIDRLLLLREEELS 451

Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEESE 561
           + K  I + +  R D+    SS+ EE+ 
Sbjct: 452 KKKTKIHKLNKLRHDLSSLLSSIPEETS 479



 Score = 31.2 bits (71), Expect = 2.0
 Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 20/101 (19%)

Query: 254 NKNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLV 313
           +K RS  +T +N  SSRSH+ F   +  S+      +T E                L+ V
Sbjct: 488 SKLRSSASTKLNLRSSRSHSKFRDHLNGSN-----SSTKEL--------------SLNQV 528

Query: 314 DLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDG 354
           DLAGSER+    + G+ L+E   +N SLS+LG+VI AL   
Sbjct: 529 DLAGSERKVSQ-SVGELLRETQSLNKSLSSLGDVIHALGSK 568


>gnl|CDD|238663 cd01367, KISc_KIF2_like, Kinesin motor domain, KIF2-like group.
           KIF2 is a protein expressed in neurons, which has been
           associated with axonal transport and neuron development;
           alternative splice forms have been implicated in
           lysosomal translocation. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Kinesins are microtubule-dependent
           molecular motors that play important roles in
           intracellular transport and in cell division. In this
           subgroup the motor domain is found in the middle
           (M-type) of the protein chain. M-type kinesins are (+)
           end-directed motors, i.e. they transport cargo towards
           the (+) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second (KIF2 may be slower). To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 322

 Score =  272 bits (699), Expect = 6e-86
 Identities = 136/395 (34%), Positives = 191/395 (48%), Gaps = 76/395 (19%)

Query: 16  ENVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEP-PKTFTFDTVTPYS 74
             + V VR RPL+ KE   G  D+ + +  N  +TV       HEP  K           
Sbjct: 1   MKITVAVRKRPLNDKELSKGETDVVSCES-NPTVTV-------HEPKTK----------- 41

Query: 75  TVDLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTI 134
            VDL    I              TF FD VFD      ++Y  T +P++  V  G   T 
Sbjct: 42  -VDL-TKYIE-----------KHTFRFDYVFDEAVTNEEVYRSTVKPLIPHVFEGGVATC 88

Query: 135 FAYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEE 194
           FAYGQTG+GKT+TM G  N     G+   +   IF  +A+ ++     V VS+FEIY  +
Sbjct: 89  FAYGQTGSGKTYTMLGDENQE---GLYALAARDIFRLLAQPND--DLGVTVSFFEIYGGK 143

Query: 195 IRDLLSKNKD-QCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLG 253
           + DLL+  K    LE  +                  V +  L+     + D++ +++  G
Sbjct: 144 LFDLLNDRKRLSVLEDGKG----------------NVQIVGLTEKPVTSVDELLELIESG 187

Query: 254 NKNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLV 313
           N  R+ G+T  N+ SSRSHAI  II++                     + +  +G+L  +
Sbjct: 188 NSLRTTGSTGANDQSSRSHAILQIILKN-------------------KKLNKLLGKLSFI 228

Query: 314 DLAGSERQAKTGASG-QRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQD 372
           DLAGSER A T     Q  KE ++IN SL  L   I AL   K  H+P+R SKLT++L+D
Sbjct: 229 DLAGSERGADTSEHDRQTRKEGAEINKSLLALKECIRALASNK-AHVPFRGSKLTQVLRD 287

Query: 373 SLGGNSKTVMCATVGPASYNYEETISTLRYASRVK 407
           S  GNSKTVM AT+ P++ + E T++TLRYA RVK
Sbjct: 288 SFIGNSKTVMIATISPSASSCEHTLNTLRYADRVK 322


>gnl|CDD|238672 cd01376, KISc_KID_like, Kinesin motor domain, KIF22/Kid-like
           subgroup. Members of this group might play a role in
           regulating chromosomal movement along microtubules in
           mitosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 319

 Score =  238 bits (610), Expect = 5e-73
 Identities = 134/392 (34%), Positives = 189/392 (48%), Gaps = 74/392 (18%)

Query: 17  NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
           NVRV VRVRP    E D       +   + GI + +  A S                   
Sbjct: 1   NVRVVVRVRPFLDCEED-------SSSCVRGIDSDQGQAKS------------------- 34

Query: 77  DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
                 + ++NP        K + FD  +     Q DI++   +PIV  +LSG N T+FA
Sbjct: 35  ------VEIENP--RNRGETKKYQFDAFYGTECTQEDIFSREVKPIVPHLLSGQNATVFA 86

Query: 137 YGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEIR 196
           YG TG GKT TM G    P   G+IP + + +     K   +  F   +SY+EIYNE++ 
Sbjct: 87  YGSTGAGKTHTMLG---DPNEPGLIPRTLSDLLRMGRKQAWTGAFS--MSYYEIYNEKVY 141

Query: 197 DLLS-KNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNK 255
           DLL    K+               L ++E  D  + +  L++    +  + E+     +K
Sbjct: 142 DLLEPAKKE---------------LPIREDKDGNILIVGLTSKPIKSMAEFEEAYIPASK 186

Query: 256 NRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDL 315
           NR+V AT +N+ SSRSHA+  I + T      +                   G+L+L+DL
Sbjct: 187 NRTVAATKLNDNSSRSHAVLRIKV-TQPASNIQLE-----------------GKLNLIDL 228

Query: 316 AGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQDSLG 375
           AGSE   +TG  G RLKE++ IN SL  L  V+ AL  G    IPYR SKLTR+LQDSLG
Sbjct: 229 AGSEDNRRTGNEGIRLKESAAINSSLFVLSKVVDALNKGL-PRIPYRESKLTRLLQDSLG 287

Query: 376 GNSKTVMCATVGPASYNYEETISTLRYASRVK 407
           G S+ +M A + P    Y++T+STL +ASR K
Sbjct: 288 GGSRCIMVANIAPERSFYQDTLSTLNFASRSK 319


>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
          Length = 1320

 Score =  249 bits (636), Expect = 4e-70
 Identities = 140/341 (41%), Positives = 197/341 (57%), Gaps = 42/341 (12%)

Query: 97  KTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFAYGQTGTGKTFTMEGVNNV-- 154
           +TFTFD++ D  S Q DI+     P+V+  L+GFN ++FAYGQTG+GKT+TM G  N   
Sbjct: 132 QTFTFDSIADPESTQEDIFQLVGAPLVENCLAGFNSSVFAYGQTGSGKTYTMWGPANGLL 191

Query: 155 -----PELKGIIPNSFAHIFGHIAK-----ADESVKFLVRVSYFEIYNEEIRDLLSKNKD 204
                 + +G+ P  F  +F  I +     AD  +K+  R S+ EIYNE+I DLL     
Sbjct: 192 EEHLSGDQQGLTPRVFERLFARINEEQIKHADRQLKYQCRCSFLEIYNEQITDLL----- 246

Query: 205 QCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNKNRSVGATAM 264
                    D     L+++E    GVYV++L+        D+ +++  G  NR  GAT++
Sbjct: 247 ---------DPSQKNLQIREDVKSGVYVENLTEEYVKTMKDVTQLLIKGLSNRRTGATSI 297

Query: 265 NEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDLAGSERQAKT 324
           N  SSRSH++F+ ++E+              +  A+   S +  R++LVDLAGSERQ  T
Sbjct: 298 NAESSRSHSVFTCVVESRC------------KSVADGLSSFKTSRINLVDLAGSERQKLT 345

Query: 325 GASGQRLKEASKINLSLSTLGNVISALVD----GKCTHIPYRNSKLTRILQDSLGGNSKT 380
           GA+G RLKEA  IN SLS LGN+I+ L +    GK  HIPYR+S+LT +LQ+SLGGN+K 
Sbjct: 346 GAAGDRLKEAGNINRSLSQLGNLINILAEISQTGKQRHIPYRDSRLTFLLQESLGGNAKL 405

Query: 381 VMCATVGPASYNYEETISTLRYASRVKKICNKARVNQDPKD 421
            M   + P+     ET STLR+A R K I NKA VN+  +D
Sbjct: 406 AMVCAISPSQSCKSETFSTLRFAQRAKAIKNKAVVNEVMQD 446


>gnl|CDD|238664 cd01368, KISc_KIF23_like, Kinesin motor domain, KIF23-like
           subgroup. Members of this group may play a role in
           mitosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 345

 Score =  220 bits (563), Expect = 7e-66
 Identities = 119/393 (30%), Positives = 185/393 (47%), Gaps = 57/393 (14%)

Query: 16  ENVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYST 75
           + V+V++RVRPLSK E ++        +    I  + +     H P  +           
Sbjct: 1   DPVKVYLRVRPLSKDELES--------EDEGCIEVINSTTIQLHPPKGSAAR-------- 44

Query: 76  VDLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIF 135
                     K+      +    F+F  VF  N+ Q + +  TA P+V  +L G N  +F
Sbjct: 45  ----------KSERNGG-QKETKFSFSKVFGPNTTQKEFFEGTALPLVQDLLKGKNSLLF 93

Query: 136 AYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEI 195
            YG T +GKT+TM+G    P   GI+P S   IF  I        + V VSY EIYN  I
Sbjct: 94  TYGVTNSGKTYTMQGS---PGDGGILPRSLDVIFNSIGG------YSVFVSYVEIYNNYI 144

Query: 196 RDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNK 255
            DLL  +     + +         L ++E  +  +YV  L+    ++ ++  ++   G K
Sbjct: 145 YDLLEDSPSSTKKRQS--------LRLREDHNGNMYVAGLTEVEVSSTEEAREVFKRGQK 196

Query: 256 NRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDL 315
           NR V  T +N  SSRSH++F+I +  + G        +        +  + + +L LVDL
Sbjct: 197 NRRVAGTKLNRESSRSHSVFTIKLVQAPGDSDGDVDQD--------KDQITVSQLSLVDL 248

Query: 316 AGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVD-----GKCTHIPYRNSKLTRIL 370
           AGSER ++T  +G+RLKEA  IN SL TLG  I  L +          +PYR+SKLT + 
Sbjct: 249 AGSERTSRTQNTGERLKEAGNINTSLMTLGKCIEVLRENQLSGSTNKMVPYRDSKLTHLF 308

Query: 371 QDSLGGNSKTVMCATVGPASYNYEETISTLRYA 403
           Q+   G  K  M   V P + +Y+ET+  ++++
Sbjct: 309 QNYFDGEGKARMIVNVNPCASDYDETLHVMKFS 341


>gnl|CDD|238659 cd01363, Motor_domain, Myosin and Kinesin motor domain. These
           ATPases belong to the P-loop NTPase family and provide
           the driving force in myosin and kinesin mediated
           processes.
          Length = 186

 Score =  151 bits (383), Expect = 2e-42
 Identities = 77/157 (49%), Positives = 99/157 (63%), Gaps = 14/157 (8%)

Query: 232 VKDLSAYIANNADDMEKIMSLGNKNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTT 291
            ++ +  I     D+  +M  GN NR+  ATAMNE SSRSH++F I        GGK   
Sbjct: 44  KREGAGIIPRTVTDVIDLMDKGNANRTTAATAMNEHSSRSHSVFRIH------FGGKNAL 97

Query: 292 GENWRLWAEAQGSVQMGRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISAL 351
                  A A    ++G+++LVDLAGSER   +GA G RL E + IN SLSTLGNVISAL
Sbjct: 98  -------ASATEQPKVGKINLVDLAGSERIDFSGAEGSRLTETANINKSLSTLGNVISAL 150

Query: 352 VDGKCTHIPYRNSKLTRILQDSLGGNSKTVMCATVGP 388
            +   +H+PYR SKLTR+LQDSLGGNS+T+M A + P
Sbjct: 151 AERD-SHVPYRESKLTRLLQDSLGGNSRTLMVACISP 186



 Score = 67.9 bits (166), Expect = 5e-13
 Identities = 24/59 (40%), Positives = 33/59 (55%), Gaps = 5/59 (8%)

Query: 114 IYNETARPIVDKVLSGFNGTIFAYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHI 172
           ++ +   P++   L G+N  IFAYGQTG+GKT+TMEG     E  GIIP         +
Sbjct: 8   VFRDVG-PLLQSALDGYNVCIFAYGQTGSGKTYTMEG---KREGAGIIP-RTVTDVIDL 61


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 52.8 bits (127), Expect = 5e-07
 Identities = 44/215 (20%), Positives = 98/215 (45%), Gaps = 27/215 (12%)

Query: 431 ISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSD----- 485
           +S+ K++ EEA             E +L+ TE  +E  E + +E + +L++ +       
Sbjct: 167 VSKYKERKEEA-------------ERKLERTEENLERLEDLLEELEKQLEKLERQAEKAE 213

Query: 486 -HQSMVEKLELLQKKVLVGG--ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEK 542
            +Q +  +L  L+  +L+    E   E  E++++L     +  E +++ EE  K+ I+E 
Sbjct: 214 RYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKE-IEEL 272

Query: 543 DAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQI 602
            +E  ++ E+   LQEE   L ++++++   +   +E  +    E   E+E   + + ++
Sbjct: 273 KSELEELREELEELQEELLELKEEIEELEGEISLLRERLE----ELENELEELEERLEEL 328

Query: 603 TKEYSYQQAIIQKFIPTKYLELMTQNATWNEDIGE 637
            ++    +  +++   T   EL    A   E   E
Sbjct: 329 KEKIEALKEELEE-RETLLEELEQLLAELEEAKEE 362



 Score = 48.6 bits (116), Expect = 1e-05
 Identities = 50/203 (24%), Positives = 101/203 (49%), Gaps = 17/203 (8%)

Query: 394 EETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPN 453
           +  +  L  A  + K+  + R   +  +  L + ++E+ EL+++LEEA            
Sbjct: 219 KAELRELELALLLAKL-KELRKELEELEEELSRLEEELEELQEELEEAEK---------- 267

Query: 454 PEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEI 513
              E++E +  +E      +E   EL   + + + +  ++ LL++++      L E  E 
Sbjct: 268 ---EIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEER 324

Query: 514 QQQL---LDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
            ++L   ++A  +ELE R+   EEL++ + E +  + ++EEK S+L EE E L + L++ 
Sbjct: 325 LEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREE 384

Query: 571 SSMLLAAKEERKDVQRENRKEIE 593
            + L A   E ++   E ++EIE
Sbjct: 385 LAELEAELAEIRNELEELKREIE 407



 Score = 48.2 bits (115), Expect = 2e-05
 Identities = 43/156 (27%), Positives = 84/156 (53%), Gaps = 10/156 (6%)

Query: 428 QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ 487
           Q+E+ EL+++LEEA      E      E EL+  E   E  E   +E + E++  +    
Sbjct: 792 QEELEELEEELEEA------ERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLD 845

Query: 488 SMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERI 547
            + E+LE L+K++    E L E+ E  +   +    EL+  ++++EEL++ ++E ++E  
Sbjct: 846 ELEEELEELEKEL----EELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELA 901

Query: 548 DMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKD 583
           +++E+   L+E  E L  KL+++   L   +EE ++
Sbjct: 902 ELKEEIEKLRERLEELEAKLERLEVELPELEEELEE 937



 Score = 45.1 bits (107), Expect = 1e-04
 Identities = 28/144 (19%), Positives = 72/144 (50%), Gaps = 7/144 (4%)

Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQK 531
           E E   E ++A  +    +E+     ++ L   + L  + E  +Q  +   QE+E  +++
Sbjct: 780 EIEELEEKRQALQEELEELEEELEEAERRL---DALERELESLEQRRERLEQEIEELEEE 836

Query: 532 EEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKE 591
            EEL++ + E + E  ++E++   L+EE E L  + +++   L   +EE+++++    +E
Sbjct: 837 IEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELE----EE 892

Query: 592 IEGFLDNVRQITKEYSYQQAIIQK 615
           +      + ++ +E    +  +++
Sbjct: 893 LRELESELAELKEEIEKLRERLEE 916



 Score = 44.7 bits (106), Expect = 2e-04
 Identities = 31/124 (25%), Positives = 67/124 (54%), Gaps = 11/124 (8%)

Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE 533
           + + ELK  +++ +S+ + LE L+++        LE+ E Q + L    +EL   +++ E
Sbjct: 685 KLEEELKSLKNELRSLEDLLEELRRQ--------LEELERQLEELK---RELAALEEELE 733

Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
           +L+  ++E + E  ++EE+   LQE  E L ++L+ +   L   KEE ++++ + +   E
Sbjct: 734 QLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQE 793

Query: 594 GFLD 597
              +
Sbjct: 794 ELEE 797



 Score = 43.9 bits (104), Expect = 3e-04
 Identities = 35/161 (21%), Positives = 80/161 (49%), Gaps = 5/161 (3%)

Query: 457 ELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQ 516
           EL E E  +E  E        EL+  +   + +  +LE L++++    E L  +    ++
Sbjct: 675 ELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQL----EELKRELAALEE 730

Query: 517 LLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLA 576
            L+     LE  +++ EEL++ ++E      ++EE+  SL+E    L ++++++     A
Sbjct: 731 ELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQA 790

Query: 577 AKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFI 617
            +EE ++++ E  +E E  LD + +  +    ++  +++ I
Sbjct: 791 LQEELEELEEE-LEEAERRLDALERELESLEQRRERLEQEI 830



 Score = 43.9 bits (104), Expect = 3e-04
 Identities = 33/145 (22%), Positives = 71/145 (48%), Gaps = 3/145 (2%)

Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGG---ENLLEKAEIQQQLLDAAAQELEHRK 529
           +E + E++  +S+ + + E+LE LQ+++L      E L  +  + ++ L+    ELE  +
Sbjct: 263 EEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELE 322

Query: 530 QKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR 589
           ++ EELK+ I+    E  + E     L++    L +  +++   L A  EE +++    R
Sbjct: 323 ERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALR 382

Query: 590 KEIEGFLDNVRQITKEYSYQQAIIQ 614
           +E+      + +I  E    +  I+
Sbjct: 383 EELAELEAELAEIRNELEELKREIE 407



 Score = 43.2 bits (102), Expect = 6e-04
 Identities = 50/229 (21%), Positives = 100/229 (43%), Gaps = 21/229 (9%)

Query: 394 EETISTLRYASRVKKICNKARVNQDPKDALLIKYQK---EISELKKKLEEASVPPPREEP 450
           E+ I  L+  S ++++  +    Q+    L  + ++   EIS L+++LEE        E 
Sbjct: 266 EKEIEELK--SELEELREELEELQEELLELKEEIEELEGEISLLRERLEELEN---ELEE 320

Query: 451 TPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK 510
                 EL E    ++      +    EL++  ++ +   E+LE     +L   E L E 
Sbjct: 321 LEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEA 380

Query: 511 AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
                + L     EL   + + EELK+ I+         EE+   L E  E L ++LK++
Sbjct: 381 LR---EELAELEAELAEIRNELEELKREIESL-------EERLERLSERLEDLKEELKEL 430

Query: 571 SSMLLAAKEERKDVQRENRK---EIEGFLDNVRQITKEYSYQQAIIQKF 616
            + L   + E +++  E  +   ++E   D ++++ +E +  Q  +Q+ 
Sbjct: 431 EAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRL 479



 Score = 39.7 bits (93), Expect = 0.006
 Identities = 38/179 (21%), Positives = 89/179 (49%), Gaps = 4/179 (2%)

Query: 428 QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ 487
           ++++ ELK++L          E   +   EL+E    +E       E   EL+      +
Sbjct: 715 ERQLEELKRELAALEE---ELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLE 771

Query: 488 SMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERI 547
             + KL+   +++    + L E+ E  ++ L+ A + L+  +++ E L++  +  + E  
Sbjct: 772 EALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIE 831

Query: 548 DMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEY 606
           ++EE+   L+E+ + L ++L+++   L   KEE ++++ E ++E+E  L  + +  +E 
Sbjct: 832 ELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAE-KEELEDELKELEEEKEEL 889



 Score = 38.9 bits (91), Expect = 0.010
 Identities = 39/182 (21%), Positives = 92/182 (50%), Gaps = 10/182 (5%)

Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSD 485
             ++ +++LK+++EE        +       EL+E E  +E +E   D  + EL+  +  
Sbjct: 769 SLEEALAKLKEEIEELEEKRQALQE------ELEELEEELEEAERRLDALERELESLEQR 822

Query: 486 HQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAE 545
            + + +++E L++++    E L EK +  ++ L+   +ELE  K++ EEL+   +E + E
Sbjct: 823 RERLEQEIEELEEEI----EELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDE 878

Query: 546 RIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKE 605
             ++EE+   L+EE   L  +L ++   +   +E  ++++ +  +      +   ++ +E
Sbjct: 879 LKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEE 938

Query: 606 YS 607
           Y 
Sbjct: 939 YE 940



 Score = 37.8 bits (88), Expect = 0.026
 Identities = 28/144 (19%), Positives = 76/144 (52%), Gaps = 7/144 (4%)

Query: 476 DAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEEL 535
           + EL++ QS  + + E+LE L+++ L   +  LE+ E + + L+ A  +L+   ++ EE 
Sbjct: 729 EEELEQLQSRLEELEEELEELEEE-LEELQERLEELEEELESLEEALAKLKEEIEELEEK 787

Query: 536 KKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRK---EI 592
           ++ +QE   E  ++EE+    +   + L ++L+ +       ++E ++++ E  +   ++
Sbjct: 788 RQALQE---ELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKL 844

Query: 593 EGFLDNVRQITKEYSYQQAIIQKF 616
           +   + + ++ KE    +  +++ 
Sbjct: 845 DELEEELEELEKELEELKEELEEL 868



 Score = 37.4 bits (87), Expect = 0.030
 Identities = 19/106 (17%), Positives = 52/106 (49%), Gaps = 3/106 (2%)

Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
           + L E+    +  L+   +EL+  K +   L+  ++E   +  ++E +   L+ E   L 
Sbjct: 670 KELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALE 729

Query: 565 KKLKKVSSMLLAAKEERKDVQR---ENRKEIEGFLDNVRQITKEYS 607
           ++L+++ S L   +EE ++++    E ++ +E   + +  + +  +
Sbjct: 730 EELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALA 775



 Score = 34.3 bits (79), Expect = 0.28
 Identities = 40/186 (21%), Positives = 80/186 (43%), Gaps = 28/186 (15%)

Query: 428  QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ 487
            ++E+ EL+ + EE                EL E E   E  E    E ++EL   + + +
Sbjct: 862  KEELEELEAEKEELED-------------ELKELEEEKEELEEELRELESELAELKEEIE 908

Query: 488  SMVEKLELLQKKVLVGGENLLEKAE-IQQQLLDAAAQELEHRKQKEEELKKNIQEKDAER 546
             + E+LE L+ K+      L E  E ++++  D    ELE   ++ EE  + +   +   
Sbjct: 909  KLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRA 968

Query: 547  IDM----EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQI 602
            I+     EE+Y  L+ + E L +  +K+  ++    +E+++           F +   +I
Sbjct: 969  IEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRER----------FKETFDKI 1018

Query: 603  TKEYSY 608
             + +S 
Sbjct: 1019 NENFSE 1024



 Score = 33.9 bits (78), Expect = 0.32
 Identities = 32/141 (22%), Positives = 64/141 (45%), Gaps = 18/141 (12%)

Query: 424  LIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQ 483
            L + ++E  EL+++L E              E EL E +  +E      +E +A+L+R +
Sbjct: 879  LKELEEEKEELEEELREL-------------ESELAELKEEIEKLRERLEELEAKLERLE 925

Query: 484  SDHQSMVEKL-ELLQKKVLVGGENLLEKAEIQQQLLD----AAAQELEHRKQKEEELKKN 538
             +   + E+L E  +  +    E  +E+ E + + L      A +E E  +++ EELK  
Sbjct: 926  VELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQ 985

Query: 539  IQEKDAERIDMEEKYSSLQEE 559
             ++ +  +  + E    L +E
Sbjct: 986  REDLEEAKEKLLEVIEELDKE 1006



 Score = 29.7 bits (67), Expect = 6.2
 Identities = 17/101 (16%), Positives = 46/101 (45%), Gaps = 11/101 (10%)

Query: 508 LEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKL 567
            E  E++++L      +LE  +++ + LK  ++  +    ++  +   L+ + E L ++L
Sbjct: 667 RELKELEEEL-AELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKREL 725

Query: 568 KKVSSMLLAAKEERKDVQRENR---KEIEGFLDNVRQITKE 605
                   A +EE + +Q       +E+E   + + ++ + 
Sbjct: 726 A-------ALEEELEQLQSRLEELEEELEELEEELEELQER 759


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 48.1 bits (115), Expect = 2e-05
 Identities = 33/156 (21%), Positives = 68/156 (43%), Gaps = 10/156 (6%)

Query: 457 ELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKV------LVGGENLLEK 510
           E++E E  +E  E    E +  L   + + + + E+LE L+K++      +      L +
Sbjct: 678 EIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLAR 737

Query: 511 AEIQQQLLDAAAQELEHR----KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKK 566
            E + + L+    +L       + + EEL++ ++E + E  + E +   L+ + E L ++
Sbjct: 738 LEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEE 797

Query: 567 LKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQI 602
           LK +   L   + E   +  E     E      R+I
Sbjct: 798 LKALREALDELRAELTLLNEEAANLRERLESLERRI 833



 Score = 42.7 bits (101), Expect = 7e-04
 Identities = 50/254 (19%), Positives = 107/254 (42%), Gaps = 42/254 (16%)

Query: 396 TISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPE 455
            +  LR     +++        +     L + ++E+ EL  +L+E              E
Sbjct: 228 ALLVLRLEELREEL--------EELQEELKEAEEELEELTAELQEL-------------E 266

Query: 456 PELDETEPCVEPSEVIEDETDA---------ELKRAQSDHQSMVEKLELLQKKVLVGGEN 506
            +L+E    +E SE+ E+  +          E+ R +   Q + E+L  L++++      
Sbjct: 267 EKLEELR--LEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQ 324

Query: 507 LLE---KAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGL 563
           L E   K +   + L    ++LE  K++ E L+  ++E +AE  ++E +   L+E+ E L
Sbjct: 325 LEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETL 384

Query: 564 SKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLE 623
                KV+ + L       +++R     +E   D   ++ +E    + +++K    +  E
Sbjct: 385 R---SKVAQLELQIASLNNEIER-LEARLERLEDRRERLQQE---IEELLKKLEEAELKE 437

Query: 624 LMTQNATWNEDIGE 637
           L  +     E++ E
Sbjct: 438 LQAELEELEEELEE 451



 Score = 41.6 bits (98), Expect = 0.002
 Identities = 44/235 (18%), Positives = 90/235 (38%), Gaps = 22/235 (9%)

Query: 392 NYEETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPT 451
             EE +  LR      +      + +      L     EIS L+++ +         E  
Sbjct: 264 ELEEKLEELRLEVSELE----EEIEE--LQKELYALANEISRLEQQKQILRERLANLERQ 317

Query: 452 PNP-EPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK 510
               E +L+E E  ++       E + +L+  + + +S+  +LE L+ ++    E L  +
Sbjct: 318 LEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAEL----EELESR 373

Query: 511 AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
            E  ++ L+    ++   + +   L   I+  +A    +E++   LQ+E E L KKL++ 
Sbjct: 374 LEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEA 433

Query: 571 -----------SSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQ 614
                          L   +E  +   E  +E+   L+   Q       + A +Q
Sbjct: 434 ELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQ 488



 Score = 40.8 bits (96), Expect = 0.002
 Identities = 46/240 (19%), Positives = 107/240 (44%), Gaps = 7/240 (2%)

Query: 403 ASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPR-EEPTPNPEPELDET 461
            +R++    +           L + + EI EL+++LEEA       E      E ++++ 
Sbjct: 735 LARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQL 794

Query: 462 EPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGG---ENLLEKAEIQQQLL 518
           +  ++      DE  AEL     +  ++ E+LE L++++       E+L E+ E   + +
Sbjct: 795 KEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI 854

Query: 519 DAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAK 578
           ++ A E+E  ++  EEL+  ++    ER  +EE  + L+ E E LS++L+++ S     +
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELR 914

Query: 579 EERKDVQREN---RKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLELMTQNATWNEDI 635
            E ++++ +       +EG    +  + +  S + ++  +       ++          +
Sbjct: 915 RELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRL 974



 Score = 32.0 bits (73), Expect = 1.2
 Identities = 25/102 (24%), Positives = 55/102 (53%), Gaps = 11/102 (10%)

Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
           +++ ++   +++E  ++K  EL+K + E   E  ++EE+   L++E E LS+++      
Sbjct: 675 RRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQI------ 728

Query: 574 LLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQK 615
                  RKD+ R    E+E   + + Q++KE +  +A I++
Sbjct: 729 ----SALRKDLARL-EAEVEQLEERIAQLSKELTELEAEIEE 765


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 46.6 bits (111), Expect = 4e-05
 Identities = 34/186 (18%), Positives = 80/186 (43%), Gaps = 9/186 (4%)

Query: 421 DALLIKYQKEISELKKKLEEAS--VPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAE 478
              + + +K + E+++ LEE +  +    EE     + ++ E E  +   E    E + E
Sbjct: 257 TEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERE 316

Query: 479 LKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKN 538
           L+ A+     +  +++ L  ++    E L  + E +++  D   +E    K++ E+L+  
Sbjct: 317 LEDAEERLAKLEAEIDKLLAEI----EELEREIEEERKRRDKLTEEYAELKEELEDLRAE 372

Query: 539 IQEKDAERIDMEEKYSSLQEESEGLSKK---LKKVSSMLLAAKEERKDVQRENRKEIEGF 595
           ++E D E  +  ++    +E+ E L ++   LK+    L    +   +   +    I G 
Sbjct: 373 LEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGI 432

Query: 596 LDNVRQ 601
              + +
Sbjct: 433 EAKINE 438



 Score = 46.6 bits (111), Expect = 4e-05
 Identities = 42/197 (21%), Positives = 83/197 (42%), Gaps = 34/197 (17%)

Query: 418 DPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDA 477
           +  +  L K ++ +++L+ +L             P  + EL + E  V   E    E + 
Sbjct: 768 EELEEDLHKLEEALNDLEARL--------SHSRIPEIQAELSKLEEEVSRIEARLREIEQ 819

Query: 478 ELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKE----- 532
           +L R   +        E L+K          E  E+Q+Q +D   Q     K+ E     
Sbjct: 820 KLNRLTLE-------KEYLEK----------EIQELQEQRIDLKEQIKSIEKEIENLNGK 862

Query: 533 -EELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSML--LAAKEERKDVQRENR 589
            EEL++ ++E +A   D+E +   L++E + L  +L+++   +  L A+ E+K  +    
Sbjct: 863 KEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSEL 922

Query: 590 KE-IEGFLDNVRQITKE 605
           K  +E   + + +I   
Sbjct: 923 KAKLEALEEELSEIEDP 939



 Score = 46.6 bits (111), Expect = 5e-05
 Identities = 46/190 (24%), Positives = 91/190 (47%), Gaps = 33/190 (17%)

Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSD 485
           K  +EISEL+K+LEE              E  L+E       ++ I+D  + E  R +  
Sbjct: 255 KLTEEISELEKRLEEI-------------EQLLEEL------NKKIKDLGEEEQLRVKEK 295

Query: 486 HQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAE 545
              +  ++  L++ +      L E AE +   L+A   E++    + EEL++ I+E+   
Sbjct: 296 IGELEAEIASLERSIAEKEREL-EDAEERLAKLEA---EIDKLLAEIEELEREIEEERKR 351

Query: 546 RIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR----------ENRKEIEGF 595
           R  + E+Y+ L+EE E L  +L++V       ++E KD +           E ++E++  
Sbjct: 352 RDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRL 411

Query: 596 LDNVRQITKE 605
            + ++++++E
Sbjct: 412 QEELQRLSEE 421



 Score = 44.3 bits (105), Expect = 2e-04
 Identities = 51/220 (23%), Positives = 92/220 (41%), Gaps = 34/220 (15%)

Query: 421 DALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELK 480
           +A + K   EI EL++++EE      R +       E  E +  +E      +E D E  
Sbjct: 328 EAEIDKLLAEIEELEREIEEERK---RRDKLTE---EYAELKEELEDLRAELEEVDKEFA 381

Query: 481 RAQSDHQSMVEKLELLQKKV--LVGGENLL---------EKAEIQQQLLDAAAQ--ELEH 527
             + + +   EKLE L++++  L    + L         E A++   +    A+  ELE 
Sbjct: 382 ETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEE 441

Query: 528 RKQ-KEEELKK---NIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKD 583
            K+ K  E+KK    +++  A+    E++   L+EE + + K+L K+   L  A+ + + 
Sbjct: 442 EKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARA 501

Query: 584 VQRENR----------KEIEGFLDNVRQITK-EYSYQQAI 612
            +   R            I+G    V Q+      Y  AI
Sbjct: 502 SEERVRGGRAVEEVLKASIQGVHGTVAQLGSVGERYATAI 541



 Score = 41.6 bits (98), Expect = 0.002
 Identities = 41/197 (20%), Positives = 87/197 (44%), Gaps = 20/197 (10%)

Query: 428 QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ 487
           +KEI +L+++ E+        E     E +L   E  +E  +    E +A ++  + D  
Sbjct: 722 EKEIEQLEQEEEKLK------ERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLH 775

Query: 488 SMVEKLELLQKKVLVGG----ENLLEKAEIQQQLLDAAAQELE----HRKQKEEELKKNI 539
            + E L  L+ ++        +  L K E +   ++A  +E+E        ++E L+K I
Sbjct: 776 KLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEI 835

Query: 540 QEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQ-RENRKEIEGFLDN 598
           QE   +RID++E+  S+++E E L+ K ++     L  + E  +   R+    +      
Sbjct: 836 QELQEQRIDLKEQIKSIEKEIENLNGKKEE-----LEEELEELEAALRDLESRLGDLKKE 890

Query: 599 VRQITKEYSYQQAIIQK 615
             ++  +    +  I++
Sbjct: 891 RDELEAQLRELERKIEE 907



 Score = 38.5 bits (90), Expect = 0.013
 Identities = 38/169 (22%), Positives = 80/169 (47%), Gaps = 14/169 (8%)

Query: 457 ELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLL-------- 508
           EL+E E  +E  ++I DE   +L+R + + +       LL++K    G  LL        
Sbjct: 178 ELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALER 237

Query: 509 EKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSL-QEESEGLSKKL 567
           +K  I++QL  +  +ELE   ++  EL+K ++E +    ++ +K   L +EE   + +K+
Sbjct: 238 QKEAIERQL-ASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKI 296

Query: 568 KKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKF 616
            ++ + +       +    E  +E+E   + + ++  E     A I++ 
Sbjct: 297 GELEAEI----ASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEEL 341



 Score = 35.4 bits (82), Expect = 0.13
 Identities = 38/171 (22%), Positives = 78/171 (45%), Gaps = 17/171 (9%)

Query: 465 VEPSEVIED-----ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENL------LEKAEI 513
           VE  ++I++     E D + ++A  + + + E +E L   +    + L       EKAE 
Sbjct: 153 VERRKIIDEIAGVAEFDRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAER 212

Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
            Q LL     E    +  E   +K   E+  E I+ +   +SL+EE E L++++ ++   
Sbjct: 213 YQALLK----EKREYEGYELLKEKEALERQKEAIERQ--LASLEEELEKLTEEISELEKR 266

Query: 574 LLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLEL 624
           L   ++  +++ ++ +   E     V++   E   + A +++ I  K  EL
Sbjct: 267 LEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKEREL 317



 Score = 33.9 bits (78), Expect = 0.40
 Identities = 25/132 (18%), Positives = 61/132 (46%), Gaps = 15/132 (11%)

Query: 465 VEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQE 524
            E S  +   + A  +      +S   +L+ L+++        LE  + +   L +   E
Sbjct: 649 FEKSGAMTGGSRAP-RGGILFSRSEPAELQRLRER--------LEGLKRE---LSSLQSE 696

Query: 525 LEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDV 584
           L   + + +EL + + +   +  ++E++   L++E E L ++L+++   L + ++E ++V
Sbjct: 697 LRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENV 756

Query: 585 QRE---NRKEIE 593
           + E       IE
Sbjct: 757 KSELKELEARIE 768



 Score = 30.8 bits (70), Expect = 2.9
 Identities = 24/142 (16%), Positives = 70/142 (49%), Gaps = 6/142 (4%)

Query: 474  ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE 533
            E + EL+  ++  + +  +L  L+K+     + L  +    ++ ++    ++E ++++  
Sbjct: 865  ELEEELEELEAALRDLESRLGDLKKER----DELEAQLRELERKIEELEAQIEKKRKRLS 920

Query: 534  ELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
            ELK  ++  + E  ++E+     +E  E     L+ V + L   +EE + ++  N   I+
Sbjct: 921  ELKAKLEALEEELSEIEDPKGEDEEIPEEEL-SLEDVQAELQRVEEEIRALEPVNMLAIQ 979

Query: 594  GFLDNVRQITKEYSYQQAIIQK 615
             + + ++++  E   ++A +++
Sbjct: 980  EYEEVLKRL-DELKEKRAKLEE 1000


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 45.8 bits (109), Expect = 8e-05
 Identities = 31/132 (23%), Positives = 67/132 (50%), Gaps = 2/132 (1%)

Query: 455 EPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQ 514
            P ++  E   E  E +E E   +L+    + +  +E+ E L  +     E L E+ E  
Sbjct: 464 SPHVETIEEDRERVEELEAEL-EDLEEEVEEVEERLERAEDL-VEAEDRIERLEERREDL 521

Query: 515 QQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSML 574
           ++L+    + +E ++++ EEL++   E +AE  +  E  +  +EE+E   +++ +++S L
Sbjct: 522 EELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKL 581

Query: 575 LAAKEERKDVQR 586
              KE  + ++R
Sbjct: 582 AELKERIESLER 593



 Score = 31.2 bits (71), Expect = 2.3
 Identities = 29/146 (19%), Positives = 56/146 (38%), Gaps = 23/146 (15%)

Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL--LDAAAQELEHRK- 529
            E D E++R +   +   E  +          E L E  E +++L  L+A  ++L     
Sbjct: 216 AELDEEIERYEEQREQARETRDEAD-------EVLEEHEERREELETLEAEIEDLRETIA 268

Query: 530 ---QKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVS---SMLLAAKEERKD 583
              ++ EEL + +++      ++EE+   L  E+       + V      L    EE +D
Sbjct: 269 ETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRD 328

Query: 584 VQRENR-------KEIEGFLDNVRQI 602
              E R       +E E   ++   +
Sbjct: 329 RLEECRVAAQAHNEEAESLREDADDL 354



 Score = 31.2 bits (71), Expect = 2.6
 Identities = 33/152 (21%), Positives = 61/152 (40%), Gaps = 8/152 (5%)

Query: 472 EDETDAELKRAQSDH--QSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRK 529
           E    AE  R ++        EK E   +      E   E AE+  +L     + +E   
Sbjct: 534 EKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKL-AELKERIE-SL 591

Query: 530 QKEEELKKNIQEKDAERIDMEEKYSSLQE----ESEGLSKKLKKVSSMLLAAKEERKDVQ 585
           ++   L   I + + E   + EK  +L E      E L++K ++   +     E R +  
Sbjct: 592 ERIRTLLAAIADAEDEIERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEEA 651

Query: 586 RENRKEIEGFLDNVRQITKEYSYQQAIIQKFI 617
           RE+++  E +L+ V +   E   ++  +Q  I
Sbjct: 652 REDKERAEEYLEQVEEKLDELREERDDLQAEI 683



 Score = 29.2 bits (66), Expect = 8.3
 Identities = 36/164 (21%), Positives = 69/164 (42%), Gaps = 25/164 (15%)

Query: 457 ELDETEPCVEPSE--VIEDETD-AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEI 513
           EL+  E  +E     + E E +  EL     D +  +E+LE          ++LL +A +
Sbjct: 252 ELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELE-------EERDDLLAEAGL 304

Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
                +A     E  + ++EEL+  ++E    R+  +       EE+E L +    +   
Sbjct: 305 DDADAEAVEARREELEDRDEELRDRLEE---CRVAAQAH----NEEAESLREDADDL--- 354

Query: 574 LLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFI 617
                EER +  RE   E+E  L+  R+  ++   +   +++ I
Sbjct: 355 -----EERAEELREEAAELESELEEAREAVEDRREEIEELEEEI 393



 Score = 29.2 bits (66), Expect = 8.5
 Identities = 30/108 (27%), Positives = 50/108 (46%), Gaps = 15/108 (13%)

Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
           E+L E A+     L+  A+EL   +++  EL+  ++E      D  E+   L+EE E L 
Sbjct: 345 ESLREDADD----LEERAEEL---REEAAELESELEEAREAVEDRREEIEELEEEIEELR 397

Query: 565 KK-------LKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKE 605
           ++       L      L   +EER ++ RE   E+E  L   R+  +E
Sbjct: 398 ERFGDAPVDLGNAEDFLEELREERDEL-REREAELEATLRTARERVEE 444


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 44.7 bits (105), Expect = 2e-04
 Identities = 44/221 (19%), Positives = 92/221 (41%), Gaps = 7/221 (3%)

Query: 400  LRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELD 459
            L+ A   KK     +  +D   AL  +  +E  + ++   E  +    EE     E    
Sbjct: 1557 LKKAEEKKKAEEAKKAEEDKNMAL--RKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKK 1614

Query: 460  ETEPCVEPSEVIEDETD----AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQ 515
              E  ++  E+ + E +     +LK+ +++ +   E+L+  +++  +      +KAE  +
Sbjct: 1615 AEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674

Query: 516  QLLDAAAQELEHRKQKEEELKKNIQE-KDAERIDMEEKYSSLQEESEGLSKKLKKVSSML 574
            +  + A +  E  K+  E LKK  +E K AE +  +E     + E    +++  K+ +  
Sbjct: 1675 KKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734

Query: 575  LAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQK 615
               + E    + E  K+ E     +  + KE   +   I+K
Sbjct: 1735 AKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRK 1775



 Score = 36.7 bits (84), Expect = 0.059
 Identities = 46/201 (22%), Positives = 85/201 (42%), Gaps = 11/201 (5%)

Query: 400  LRYASRVKKICNKARVNQDPKDALLIK-----YQKEISELKKKLEEASVPPPREEPTPNP 454
             + A   KK     +  ++ K A   K      +K+    KKK EEA      E      
Sbjct: 1295 AKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAK--KAAEAAKAEA 1352

Query: 455  EPELDETEPCVEPSEVIEDETDAELKRAQSDHQSM--VEKLELLQKKVLVGGENL--LEK 510
            E   DE E   E +E  E + +   K+A +  +     +K +  +KK     +    L+K
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412

Query: 511  AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
            A   ++  D A ++ E +K+ +E  KK  + K A+    + + +   EE++  +++ KK 
Sbjct: 1413 AAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKA 1472

Query: 571  SSMLLAAKEERKDVQRENRKE 591
                  A+E +K  + + + E
Sbjct: 1473 DEAKKKAEEAKKADEAKKKAE 1493



 Score = 34.0 bits (77), Expect = 0.35
 Identities = 47/203 (23%), Positives = 88/203 (43%), Gaps = 21/203 (10%)

Query: 400  LRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELD 459
             + A+  KK  ++A+  ++ K A   K  +E     KK +EA     ++        EL 
Sbjct: 1502 AKKAAEAKKKADEAKKAEEAKKADEAKKAEE----AKKADEA-----KKAEEKKKADELK 1552

Query: 460  ETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLD 519
            + E   +  E  + E   E K+A+ D    + K E  +K      E +++  E ++++  
Sbjct: 1553 KAEELKKAEEKKKAE---EAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA 1609

Query: 520  AAAQELEHRKQKEEELKK---------NIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
              A++ E  K K EELKK          +++K+AE     E+    +EE++  + +  K 
Sbjct: 1610 EEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKK 1669

Query: 571  SSMLLAAKEERKDVQRENRKEIE 593
            +       EE K  + + +K  E
Sbjct: 1670 AEEDKKKAEEAKKAEEDEKKAAE 1692



 Score = 33.2 bits (75), Expect = 0.64
 Identities = 41/214 (19%), Positives = 83/214 (38%), Gaps = 15/214 (7%)

Query: 391  YNYEETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPR--- 447
                +    ++ A   KK   +A+  ++ ++   I+  +E         +A++       
Sbjct: 1220 AEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARK 1279

Query: 448  --EEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVE----KLELLQKKVL 501
              E      + + DE +   E  +  E +  AE  +   + +   E    K +  +KK  
Sbjct: 1280 ADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKK-- 1337

Query: 502  VGGENLLEKAEIQQQLLDAAAQELEHRKQKEE--ELKKNIQEKDAERIDMEEKYSSLQEE 559
               E   + AE  +   +AAA E E  ++K E  E KK   +K A+    + +     +E
Sbjct: 1338 --AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADE 1395

Query: 560  SEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
            ++  +++ KK +  L  A   +K      +K  E
Sbjct: 1396 AKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE 1429



 Score = 32.8 bits (74), Expect = 0.90
 Identities = 48/228 (21%), Positives = 97/228 (42%), Gaps = 23/228 (10%)

Query: 401  RYASRVKKI--CNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPEL 458
            + A  +KK    NK +  ++ K A   K + E  E KK  E+        E       E 
Sbjct: 1647 KKAEELKKAEEENKIKAAEEAKKAEEDKKKAE--EAKKAEEDEK---KAAEALKKEAEEA 1701

Query: 459  DETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKL----ELLQKKVLVGGENLLEKAEIQ 514
             + E  ++  E  E +   ELK+A+ +++   E+     E  +KK     ++  EK +I 
Sbjct: 1702 KKAEE-LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIA 1760

Query: 515  QQLLDAAAQELEHRKQKEEELKKNIQEKDAE-RIDMEEKYSSLQEESEGLSKKLKKVSSM 573
                +   +  E RK+KE  +++ + E+D + R+++++K   + +    + +  K+ + +
Sbjct: 1761 HLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLV 1820

Query: 574  LLAAKE----------ERKDVQRENRKEIEGFLDNVRQITKEYSYQQA 611
            +  +KE          + K++Q E     E    N      E   ++A
Sbjct: 1821 INDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEA 1868



 Score = 29.7 bits (66), Expect = 7.0
 Identities = 32/184 (17%), Positives = 74/184 (40%), Gaps = 5/184 (2%)

Query: 412  KARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVI 471
            KA   +   +A   + +    E +   E+A     ++E          +     + ++  
Sbjct: 1337 KAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEA 1396

Query: 472  EDETDAELKRAQSDHQSMVEK--LELLQKKV--LVGGENLLEKAEIQQQLLDAAAQELEH 527
            + + + + K+A    ++   K   +  +KK       +   +KAE + +  D A ++ E 
Sbjct: 1397 KKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAE-EAKKADEAKKKAEE 1455

Query: 528  RKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRE 587
             K+ EE  KK  + K A+    + + +   +E++  +++ KK +     A E +K     
Sbjct: 1456 AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA 1515

Query: 588  NRKE 591
             + E
Sbjct: 1516 KKAE 1519


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 43.2 bits (102), Expect = 5e-04
 Identities = 47/213 (22%), Positives = 92/213 (43%), Gaps = 33/213 (15%)

Query: 422 ALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKR 481
            LL   ++E+ EL+++L+E                EL+E       S++ E     EL  
Sbjct: 579 RLLRTRKEELEELRERLKELK----------KKLKELEERL-----SQLEELLQSLELSE 623

Query: 482 AQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQE 541
           A+++ +   E+LE           + LEK  +Q +L +     LE  ++K EEL+  I+ 
Sbjct: 624 AENELEEAEEELE-----------SELEKLNLQAELEELLQAALEELEEKVEELEAEIRR 672

Query: 542 KDAERIDMEEKYSSLQEESEGLSKKLKKVSSML--LAAKEERKDVQRENRKEIEGFLDNV 599
           +  +RI+ EE+     EE E L ++L+++   L  L  K    +   E  +  +  L+ +
Sbjct: 673 EL-QRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEEL 731

Query: 600 RQ----ITKEYSYQQAIIQKFIPTKYLELMTQN 628
           ++    + K     + + +K         + +N
Sbjct: 732 KKELEKLEKALELLEELREKLGKAGLRADILRN 764



 Score = 40.5 bits (95), Expect = 0.003
 Identities = 29/123 (23%), Positives = 58/123 (47%), Gaps = 3/123 (2%)

Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLE---KAEIQQQLLDAAAQELEHRKQ 530
           E   EL+R   + +  +E+LE L++++    E L       E  ++LL+      E  ++
Sbjct: 274 EELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEK 333

Query: 531 KEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRK 590
            EE+L+K   E +    +  E    L+E  + L ++L+++   L  A E  K ++   ++
Sbjct: 334 LEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQE 393

Query: 591 EIE 593
             E
Sbjct: 394 LKE 396



 Score = 38.2 bits (89), Expect = 0.018
 Identities = 29/113 (25%), Positives = 55/113 (48%), Gaps = 13/113 (11%)

Query: 478 ELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL---LDAAAQELEHRKQKEEE 534
           +LK  +   + + EKLE L+ ++    E   E A++ ++    L+   +ELE   +K  E
Sbjct: 323 KLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALE 382

Query: 535 LKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRE 587
             K ++E   E          L+EE   LS  L+++   L   ++E ++++RE
Sbjct: 383 RLKQLEEAIQE----------LKEELAELSAALEEIQEELEELEKELEELERE 425



 Score = 36.7 bits (85), Expect = 0.043
 Identities = 39/181 (21%), Positives = 78/181 (43%), Gaps = 22/181 (12%)

Query: 418 DPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETD- 476
           +     L + ++ I ELK++L E                 L+E +      E+ E E + 
Sbjct: 378 EKALERLKQLEEAIQELKEELAEL-------------SAALEEIQ-----EELEELEKEL 419

Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
            EL+R   + +  ++KLE    ++      + E A   ++      +  E  +++  EL 
Sbjct: 420 EELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELY 479

Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKL---KKVSSMLLAAKEERKDVQRENRKEIE 593
           +   E+  E +  E++ + L+EE E L K+L   ++    LL  +E  K+   E  +++E
Sbjct: 480 ELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLE 539

Query: 594 G 594
            
Sbjct: 540 N 540



 Score = 34.7 bits (80), Expect = 0.19
 Identities = 31/176 (17%), Positives = 76/176 (43%), Gaps = 15/176 (8%)

Query: 474 ETDAELKRAQSDHQSMVEKL---ELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQ 530
           E DA LK    + + ++++L   E  +K   +  E ++++A+ + + L+    EL   + 
Sbjct: 144 EFDAFLKSKPKERKEILDELFGLEKYEKLSELLKE-VIKEAKAKIEELEGQLSELL--ED 200

Query: 531 KEEELKKNIQEKDAER----IDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR 586
            E+ L+   +E    +    I  E++   L++E E L ++L ++       +E +  +  
Sbjct: 201 IEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLE 260

Query: 587 ENRKEIE---GFLDNVRQITKEYSYQQAIIQKF--IPTKYLELMTQNATWNEDIGE 637
               E+E      + +R++ +     +  I++   +  +  EL  +       + E
Sbjct: 261 IESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEE 316



 Score = 30.5 bits (69), Expect = 3.8
 Identities = 47/238 (19%), Positives = 100/238 (42%), Gaps = 27/238 (11%)

Query: 392 NYEETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPT 451
           + E  I+ L  A     +C +    ++ +  LL  Y+ E+ EL+++L          E  
Sbjct: 445 SKELMIAELAGAGEKCPVCGQEL-PEEHEKELLELYELELEELEEELSREKEEAELREEI 503

Query: 452 PNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKA 511
              E EL E E  +     +E+    EL+      ++++E+LE L++K+ +  + L E+ 
Sbjct: 504 EELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQL--QQLKEEL 561

Query: 512 EIQQQL------LDAAAQELEHRKQKEEELKKNIQEKDA----------------ERIDM 549
              +        L    + L  RK++ EEL++ ++E                   + +++
Sbjct: 562 RQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLEL 621

Query: 550 EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQ--RENRKEIEGFLDNVRQITKE 605
            E  + L+E  E L  +L+K++      +  +  ++   E  +E+E  +    Q  + 
Sbjct: 622 SEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIEN 679



 Score = 30.1 bits (68), Expect = 4.9
 Identities = 35/189 (18%), Positives = 80/189 (42%), Gaps = 29/189 (15%)

Query: 423 LLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRA 482
            + K +++I++L+ K    +      E  P    EL E     E  + + +  + EL+  
Sbjct: 432 EIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPE-----EHEKELLELYELELEEL 486

Query: 483 Q------SDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
           +       +   + E++E L+K++    E L+E  E+++ L +   ++LE  +   EEL+
Sbjct: 487 EEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELE 546

Query: 537 KNIQEKDAERIDMEEKYSS------------------LQEESEGLSKKLKKVSSMLLAAK 578
           +  ++   +++  E +                      +EE E L ++LK++   L   +
Sbjct: 547 ELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELE 606

Query: 579 EERKDVQRE 587
           E    ++  
Sbjct: 607 ERLSQLEEL 615


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 41.7 bits (99), Expect = 0.001
 Identities = 23/86 (26%), Positives = 50/86 (58%), Gaps = 4/86 (4%)

Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE 533
           E + EL+  +++ Q + ++L  LQK+  +  +  LE  E +++ L+   +ELE ++Q+ E
Sbjct: 72  EFEKELRERRNELQKLEKRL--LQKEENL--DRKLELLEKREEELEKKEKELEQKQQELE 127

Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEE 559
           + ++ ++E   E++   E+ S L  E
Sbjct: 128 KKEEELEELIEEQLQELERISGLTAE 153



 Score = 41.3 bits (98), Expect = 0.002
 Identities = 30/150 (20%), Positives = 74/150 (49%), Gaps = 18/150 (12%)

Query: 471 IEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAA--QELEHR 528
            E+E    L+ A+ + +++ ++  L  K+     E    + E +++L +     Q+LE R
Sbjct: 36  AEEEAKRILEEAKKEAEAIKKEALLEAKE-----EIHKLRNEFEKELRERRNELQKLEKR 90

Query: 529 -KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDV--- 584
             QKEE L + ++  +    ++E+K   L+++ + L KK +++  ++    +E + +   
Sbjct: 91  LLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGL 150

Query: 585 -QRENRKEIEGFLDNVRQITKEYSYQQAII 613
              E ++ +   L+ V    +E  ++ A++
Sbjct: 151 TAEEAKEIL---LEKVE---EEARHEAAVL 174


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 40.1 bits (94), Expect = 0.001
 Identities = 27/126 (21%), Positives = 61/126 (48%), Gaps = 8/126 (6%)

Query: 490 VEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDM 549
           + K  +   +     E   E +E++QQL      EL+   +  EE    I E ++E  D+
Sbjct: 61  LLKAPVQSVRPQKLEELQGELSELKQQL-----SELQEELEDLEER---IAELESELEDL 112

Query: 550 EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQ 609
           +E    L+E  + L ++L+ +   +    +E ++++++ R+E+E   + + ++ +     
Sbjct: 113 KEDLQLLRELLKSLEERLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRL 172

Query: 610 QAIIQK 615
           Q  IQ+
Sbjct: 173 QEAIQE 178



 Score = 32.8 bits (75), Expect = 0.32
 Identities = 24/110 (21%), Positives = 49/110 (44%), Gaps = 7/110 (6%)

Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEH---- 527
            +E   EL   +     + E+LE L++++      L +  E  Q L +      E     
Sbjct: 74  LEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESL 133

Query: 528 ---RKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSML 574
               K+  +EL++  Q+   E  ++ E+   LQE  + L + ++++ S+L
Sbjct: 134 EESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLL 183



 Score = 29.7 bits (67), Expect = 3.4
 Identities = 25/140 (17%), Positives = 59/140 (42%), Gaps = 13/140 (9%)

Query: 397 ISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEP 456
              L        + +      +     L + ++++SEL+++LE+       EE       
Sbjct: 54  ALGLLCLLLKAPVQSVRPQKLEELQGELSELKQQLSELQEELED------LEE------- 100

Query: 457 ELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQ 516
            + E E  +E  +         LK  +   +S+ E ++ L K++    ++L E+ E  ++
Sbjct: 101 RIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELAKELRELRQDLREEVEELRE 160

Query: 517 LLDAAAQELEHRKQKEEELK 536
            L+   + L+  ++  +EL+
Sbjct: 161 ELERLQENLQRLQEAIQELQ 180


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 41.1 bits (96), Expect = 0.002
 Identities = 34/165 (20%), Positives = 68/165 (41%), Gaps = 2/165 (1%)

Query: 431 ISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMV 490
            S  K+K +E       E           E     E     + +   E  + +   +   
Sbjct: 165 GSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEE 224

Query: 491 EKLELLQKKVLVGG--ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERID 548
           E L  L    L     + L E    +Q+ ++++ QELE  ++   ++ K  +E++ E+  
Sbjct: 225 ENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKL 284

Query: 549 MEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
            EE+   L +E E L  +L K+    +  +E+ K+ ++E +K  +
Sbjct: 285 QEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEK 329



 Score = 33.4 bits (76), Expect = 0.58
 Identities = 33/191 (17%), Positives = 74/191 (38%), Gaps = 17/191 (8%)

Query: 391 YNYEETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEP 450
              EE +  L Y    ++  +  +     +   +   ++E+ + ++ L +          
Sbjct: 221 ELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLK------- 273

Query: 451 TPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK 510
                 E  E E   +  E        E +  +S+   +  +    ++K L   E  L+K
Sbjct: 274 ------ENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEK-LKESEKELKK 326

Query: 511 AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
            E +   L    +E+E  +++ +EL+   + ++ E   +E+    L++  E L  K K  
Sbjct: 327 LEKE---LKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLE 383

Query: 571 SSMLLAAKEER 581
           S  L +A + +
Sbjct: 384 SERLSSAAKLK 394



 Score = 31.1 bits (70), Expect = 2.6
 Identities = 26/145 (17%), Positives = 65/145 (44%), Gaps = 3/145 (2%)

Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQK 531
             +   EL R + +     ++    ++++L       ++ E +++L +   + L   K++
Sbjct: 239 RIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLA--KEE 296

Query: 532 EEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKE 591
           EE   + ++ +  +  D EEK    ++E + L K+LKK    +   ++E K+++ +   E
Sbjct: 297 EELKSELLKLERRKVDD-EEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAE 355

Query: 592 IEGFLDNVRQITKEYSYQQAIIQKF 616
            E      +   K    ++ ++ K 
Sbjct: 356 EEEEEQLEKLQEKLEQLEEELLAKK 380


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 40.4 bits (95), Expect = 0.003
 Identities = 53/222 (23%), Positives = 105/222 (47%), Gaps = 30/222 (13%)

Query: 424 LIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVI----------ED 473
           LIK + EI  LKK+LE+       ++     E +LDE E   E +E++           +
Sbjct: 534 LIKLKGEIKSLKKELEKLE---ELKKKLAELEKKLDELEE--ELAELLKELEELGFESVE 588

Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE 533
           E +  LK  +  +   +E  +  ++      E   ++ +  ++ LD A +EL   +++ E
Sbjct: 589 ELEERLKELEPFYNEYLELKDAEKEL-----EREEKELKKLEEELDKAFEELAETEKRLE 643

Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR------E 587
           EL+K ++E   E+   EE+Y  L+EE   LS++L  + + L   ++ R+++++      E
Sbjct: 644 ELRKELEE--LEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKE 701

Query: 588 NRKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLELMTQNA 629
             +E E     + ++ K     + + +K    KY  L+ + A
Sbjct: 702 ELEEREKAKKELEKLEKALERVEELREKV--KKYKALLKERA 741



 Score = 37.0 bits (86), Expect = 0.036
 Identities = 41/198 (20%), Positives = 89/198 (44%), Gaps = 25/198 (12%)

Query: 423 LLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRA 482
            L + ++EI+ ++++++E      R E       EL++    +E       E   E K  
Sbjct: 315 RLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERH----ELYEEAKAK 370

Query: 483 QSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR----KQKEEELKKN 538
           + + + + ++L  L  + L   E  LE+ E  ++ ++    ++  R    K++ +ELKK 
Sbjct: 371 KEELERLKKRLTGLTPEKL---EKELEELEKAKEEIEEEISKITARIGELKKEIKELKKA 427

Query: 539 IQE-KDAER--------IDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR 589
           I+E K A+         +  E +   L+E     + +LK++   L   +E+ + + R+  
Sbjct: 428 IEELKKAKGKCPVCGRELTEEHRKELLEE----YTAELKRIEKELKEIEEKERKL-RKEL 482

Query: 590 KEIEGFLDNVRQITKEYS 607
           +E+E  L    ++ K   
Sbjct: 483 RELEKVLKKESELIKLKE 500



 Score = 35.0 bits (81), Expect = 0.17
 Identities = 42/220 (19%), Positives = 92/220 (41%), Gaps = 47/220 (21%)

Query: 394 EETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPN 453
           ++ I  L+ A     +C +  + ++ +  LL +Y  E+  ++K+L+E             
Sbjct: 425 KKAIEELKKAKGKCPVCGR-ELTEEHRKELLEEYTAELKRIEKELKEI------------ 471

Query: 454 PEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEI 513
                          E  E +   EL+         +EK+   + +++   E   +  E+
Sbjct: 472 ---------------EEKERKLRKELRE--------LEKVLKKESELIKLKELAEQLKEL 508

Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
           +++L     +ELE + ++ E+LK+ + +   E   ++++   L+E    L KKL ++   
Sbjct: 509 EEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEE----LKKKLAELEKK 564

Query: 574 LLAAKEERKDVQRENR-------KEIEGFLDNVRQITKEY 606
           L   +EE  ++ +E         +E+E  L  +     EY
Sbjct: 565 LDELEEELAELLKELEELGFESVEELEERLKELEPFYNEY 604



 Score = 33.5 bits (77), Expect = 0.48
 Identities = 25/100 (25%), Positives = 57/100 (57%), Gaps = 5/100 (5%)

Query: 508 LEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKL 567
           LEK E + + L+   +E+E  +++ E L+ + ++ + +  ++EE+   L++E E L +K+
Sbjct: 223 LEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKV 282

Query: 568 KKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYS 607
           K++  +     +E+ +   +  +  E +LD +R+I K  S
Sbjct: 283 KELKEL-----KEKAEEYIKLSEFYEEYLDELREIEKRLS 317



 Score = 30.8 bits (70), Expect = 3.1
 Identities = 25/93 (26%), Positives = 52/93 (55%), Gaps = 5/93 (5%)

Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
            ELK+   + +  V++L+ L++K     E  ++ +E  ++ LD   +E+E R  + EE  
Sbjct: 269 EELKKEIEELEEKVKELKELKEKA----EEYIKLSEFYEEYLDEL-REIEKRLSRLEEEI 323

Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
             I+E+  E  + EE+   L+++ + L K+L++
Sbjct: 324 NGIEERIKELEEKEERLEELKKKLKELEKRLEE 356



 Score = 30.4 bits (69), Expect = 4.0
 Identities = 21/98 (21%), Positives = 47/98 (47%), Gaps = 1/98 (1%)

Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
           EN  +      + +    + LE   ++ E +++ I+EK+ E  ++  + + +  E   L 
Sbjct: 161 ENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELR 220

Query: 565 KKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQI 602
           ++L+K+   +    EE K+   E  KE+E    + R++
Sbjct: 221 EELEKLEKEVKEL-EELKEEIEELEKELESLEGSKRKL 257


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 39.8 bits (94), Expect = 0.005
 Identities = 25/103 (24%), Positives = 47/103 (45%), Gaps = 12/103 (11%)

Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
             L+   E  ++ L+  A+E E   ++ E+LK+          ++EEK   LQEE + L 
Sbjct: 519 NELIASLEELERELEQKAEEAEALLKEAEKLKE----------ELEEKKEKLQEEEDKLL 568

Query: 565 KKL-KKVSSMLLAAKEERKDVQRENR-KEIEGFLDNVRQITKE 605
           ++  K+    +  AK+E  ++ +E R  +  G+         E
Sbjct: 569 EEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIE 611



 Score = 30.6 bits (70), Expect = 3.8
 Identities = 34/128 (26%), Positives = 57/128 (44%), Gaps = 36/128 (28%)

Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQ-QQLLDAA----------- 521
           E +A LK A+   + + EK E LQ++     + LLE+AE + QQ +  A           
Sbjct: 538 EAEALLKEAEKLKEELEEKKEKLQEEE----DKLLEEAEKEAQQAIKEAKKEADEIIKEL 593

Query: 522 ------------AQELE--HRK------QKEEELKKNIQEKDAERIDMEEKYSSLQEESE 561
                       A EL    ++      +KE++ KK  ++++  ++  E KY SL ++ E
Sbjct: 594 RQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGE 653

Query: 562 GLSKKLKK 569
            LS    K
Sbjct: 654 VLSIPDDK 661


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 39.9 bits (94), Expect = 0.005
 Identities = 39/166 (23%), Positives = 72/166 (43%), Gaps = 20/166 (12%)

Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSD 485
           +YQ ++ EL+++LEE +      E     + + +E E     +E  E+E D ELK   +D
Sbjct: 352 RYQADLEELEERLEEQN------EVVEEADEQQEENE---ARAEAAEEEVD-ELKSQLAD 401

Query: 486 HQSMVEKLELLQKKVLV--GGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKD 543
           +Q   + L++ Q + +        LE+A   +QL        ++ +   EE +   QE  
Sbjct: 402 YQ---QALDVQQTRAIQYQQAVQALERA---KQLCGLPDLTADNAEDWLEEFQAKEQEAT 455

Query: 544 AERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR 589
            E + +E+K S  Q       +  + V    +A +  R +     R
Sbjct: 456 EELLSLEQKLSVAQAAHSQFEQAYQLVRK--IAGEVSRSEAWDVAR 499



 Score = 36.1 bits (84), Expect = 0.074
 Identities = 24/114 (21%), Positives = 48/114 (42%), Gaps = 1/114 (0%)

Query: 469  EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
            E        +L++AQ+      + L  L+       + L E  +  Q L   A    E R
Sbjct: 998  EQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVPADSGAEER 1057

Query: 529  -KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEER 581
             + + +EL   +    + R  +E++ +  + E + L+KKL+K+       +E+ 
Sbjct: 1058 ARARRDELHARLSANRSRRNQLEKQLTFCEAEMDNLTKKLRKLERDYHEMREQV 1111



 Score = 31.1 bits (71), Expect = 2.3
 Identities = 30/149 (20%), Positives = 57/149 (38%), Gaps = 43/149 (28%)

Query: 478 ELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK--------------AEIQQQLLDAAAQ 523
           +L   Q     M  +L  L        E+ LE+              A  QQ+ ++    
Sbjct: 301 QLAAEQYRLVEMARELAEL-----NEAESDLEQDYQAASDHLNLVQTALRQQEKIERYQA 355

Query: 524 ELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKD 583
           +LE  +++ EE  + ++E D ++ + E +  + +EE + L  +L           ++  D
Sbjct: 356 DLEELEERLEEQNEVVEEADEQQEENEARAEAAEEEVDELKSQLADY--------QQALD 407

Query: 584 VQRENRKEIEGFLDNVRQITKEYSYQQAI 612
           VQ+                T+   YQQA+
Sbjct: 408 VQQ----------------TRAIQYQQAV 420


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 38.6 bits (90), Expect = 0.007
 Identities = 47/209 (22%), Positives = 84/209 (40%), Gaps = 12/209 (5%)

Query: 401 RYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDE 460
            Y     ++  K +  ++    L  K   E+ E  ++LEE +     EE     E E+DE
Sbjct: 30  EYEETALELEEKLKQEEEEAQLLEKK-ADELEEENRRLEEEAAA--SEEERERLEAEVDE 86

Query: 461 TEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDA 520
               V   E   ++ +AE ++ Q + +   E  E  ++++L   E               
Sbjct: 87  ATAEVAKLEEEREKKEAETRQLQQELREAQEAHERARQELL---EAAAAPTAPPHVAAPV 143

Query: 521 AAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEE 580
             ++LE     EE       + D +    EE+ +   E++E L  +L+ + S L AA++E
Sbjct: 144 NGEQLEPDDNGEEASADLETDPDMKDRSEEERVTYA-EKNERLQTQLQALKSELAAARDE 202

Query: 581 RK----DVQ-RENRKEIEGFLDNVRQITK 604
            K    D+   EN +        +RQI  
Sbjct: 203 SKETANDILHEENVRAGRDKYKTLRQIRS 231



 Score = 28.9 bits (65), Expect = 8.5
 Identities = 30/189 (15%), Positives = 79/189 (41%), Gaps = 26/189 (13%)

Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSD 485
           + ++E  EL++++E+              E ++   +  +E  E    E + +LK+ + +
Sbjct: 2   EAEREQQELEERMEQM-------------EEDMRRAQKELEEYEETALELEEKLKQEEEE 48

Query: 486 HQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAE 545
            Q     LE    ++      L E+A   ++  +    E++    +  +L++  ++K+AE
Sbjct: 49  AQ----LLEKKADELEEENRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAE 104

Query: 546 RIDMEEKYSSLQEESEGLSKKLKKV------SSMLLAAKEERKDVQRENRKEIEGFLD-- 597
              ++++    QE  E   ++L +          + A     +    +N +E    L+  
Sbjct: 105 TRQLQQELREAQEAHERARQELLEAAAAPTAPPHVAAPVNGEQLEPDDNGEEASADLETD 164

Query: 598 -NVRQITKE 605
            +++  ++E
Sbjct: 165 PDMKDRSEE 173


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 38.1 bits (89), Expect = 0.007
 Identities = 19/76 (25%), Positives = 43/76 (56%), Gaps = 3/76 (3%)

Query: 516 QLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSK---KLKKVSS 572
           + L    QEL   +++  EL++ + E   E  +++++ S+L+ E E L K   ++K++S+
Sbjct: 66  ERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSA 125

Query: 573 MLLAAKEERKDVQREN 588
             +   EE ++++ E 
Sbjct: 126 NAIELDEENRELREEL 141



 Score = 36.1 bits (84), Expect = 0.030
 Identities = 24/91 (26%), Positives = 47/91 (51%), Gaps = 5/91 (5%)

Query: 479 LKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL--LDAAAQELEHRKQKEEELK 536
           L   Q +   + E+L  LQ+++    E   E  E++Q+L  L+A  + L+    + ++L 
Sbjct: 68  LPELQQELAELQEELAELQEQL---AELQQENQELKQELSTLEAELERLQKELARIKQLS 124

Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKL 567
            N  E D E  ++ E+ + L++E+E L  + 
Sbjct: 125 ANAIELDEENRELREELAELKQENEALEAEN 155



 Score = 29.6 bits (67), Expect = 3.9
 Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 8/89 (8%)

Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKE 532
            E   +L   Q ++Q + ++L  L+ ++        E A I+Q  L A A EL+   +  
Sbjct: 83  AELQEQLAELQQENQELKQELSTLEAEL---ERLQKELARIKQ--LSANAIELDEENR-- 135

Query: 533 EELKKNIQEKDAERIDMEEKYSSLQEESE 561
            EL++ + E   E   +E +   LQE  +
Sbjct: 136 -ELREELAELKQENEALEAENERLQENEQ 163


>gnl|CDD|234013 TIGR02785, addA_Gpos, helicase-exonuclease AddAB, AddA subunit,
           Firmicutes type.  AddAB, also called RexAB, substitutes
           for RecBCD in several bacterial lineages. These DNA
           recombination proteins act before synapse and are
           particularly important for DNA repair of double-stranded
           breaks by homologous recombination. The term AddAB is
           used broadly, with AddA homologous between the
           Firmicutes (as modeled here) and the
           alphaproteobacteria, while the partner AddB proteins
           show no strong homology across the two groups of species
           [DNA metabolism, DNA replication, recombination, and
           repair].
          Length = 1230

 Score = 39.3 bits (92), Expect = 0.008
 Identities = 37/173 (21%), Positives = 69/173 (39%), Gaps = 22/173 (12%)

Query: 451 TPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK 510
           TPNPE  L+      E  EV E  T   LK  Q   + +  +LE LQ+K        L++
Sbjct: 180 TPNPEKWLNNL---AEAYEVKEKFTIESLKLQQQIKELLKNELEGLQEK--------LQR 228

Query: 511 AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKD--AERIDMEEKYSSLQEESEGLSKKLK 568
           A       D  A  LE+ +   + + + IQE    A+  ++ +  ++ + +    + K  
Sbjct: 229 ALELFMAEDGLAPRLENFQLDLQNIDELIQESLAQADWNELRKAVAAFKFK----NLKAA 284

Query: 569 KVSSMLLAAKEERKDVQRENRKEIEG-----FLDNVRQITKEYSYQQAIIQKF 616
           K         EE   ++ E +K++E      F  +     +     + +++  
Sbjct: 285 KGDEEDADLLEEADKLREEAKKQLEKLKTDYFTRSEEDHLRIMQEMKPVVKTL 337


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 38.9 bits (91), Expect = 0.009
 Identities = 22/97 (22%), Positives = 44/97 (45%), Gaps = 6/97 (6%)

Query: 491 EKLELLQKKVLVGG---ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERI 547
              +L+++ +   G   +  LEK             E+E  + K  EL K I +   ++ 
Sbjct: 9   NNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKK 68

Query: 548 DMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDV 584
           D  E    +++E + L ++L ++S+ L A + E +D 
Sbjct: 69  DKIE---EIKKELKELKEELTELSAALKALEAELQDK 102



 Score = 29.3 bits (66), Expect = 8.2
 Identities = 20/93 (21%), Positives = 43/93 (46%), Gaps = 4/93 (4%)

Query: 524 ELEHRKQKEEELKKNIQEKDAER-IDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERK 582
           +L     K    +    + D E+ I ++++   L  E E L  K  ++S  +  AK ++K
Sbjct: 12  DLVKESLKA---RGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKK 68

Query: 583 DVQRENRKEIEGFLDNVRQITKEYSYQQAIIQK 615
           D   E +KE++   + + +++      +A +Q 
Sbjct: 69  DKIEEIKKELKELKEELTELSAALKALEAELQD 101


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 36.8 bits (86), Expect = 0.011
 Identities = 16/97 (16%), Positives = 50/97 (51%), Gaps = 12/97 (12%)

Query: 519 DAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAK 578
            A     +  +++ ++L+  +Q+K+ E   ++++   LQ+++  LS++ +K     L  K
Sbjct: 31  PAGKAAQKQLEKEFKKLQAELQKKEKE---LQKEEQKLQKQAATLSEEARKAKQQELQQK 87

Query: 579 EE---------RKDVQRENRKEIEGFLDNVRQITKEY 606
           ++         ++++Q++ ++ ++   D + +  KE 
Sbjct: 88  QQELQQKQQAAQQELQQKQQELLQPIYDKIDKAIKEV 124


>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 1104

 Score = 38.5 bits (89), Expect = 0.013
 Identities = 30/183 (16%), Positives = 62/183 (33%), Gaps = 15/183 (8%)

Query: 399 TLR-YASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPE 457
            L    +R+KK   +  V Q   +    K + E ++++            +        +
Sbjct: 287 ALPTVGNRIKKEEQETLVRQFTVEQTQAKSKVESAKIETDRAREMETLAHDNVKQIVGAQ 346

Query: 458 LDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL 517
                   E +     +    +  A++   ++V+ L          G    E AE   +L
Sbjct: 347 HGILSAKREGAV----DKRRTISTARAGLDALVKGL----------GGAAPESAEELLEL 392

Query: 518 LDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAA 577
            +AA   ++      E L+   Q    +R    +++ +  +E   LSK    +   LL  
Sbjct: 393 NNAARLTVDEYPAAREALESAGQRNVEDRTRAVDEFKAADQELSSLSKGSSNIEYRLLQV 452

Query: 578 KEE 580
           +E 
Sbjct: 453 REN 455



 Score = 34.2 bits (78), Expect = 0.29
 Identities = 37/207 (17%), Positives = 74/207 (35%), Gaps = 20/207 (9%)

Query: 428 QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVI---EDETDAELKRAQS 484
           Q++I+EL+ +LE  +            +  LD  +   +  E     E    A LK+   
Sbjct: 673 QRQIAELQARLERLTHTQ---SDIAIAKAALDAAQTRQKVLERQYQQEVTECAGLKKDLK 729

Query: 485 DHQSMVEKLELLQKKVLVGG-ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKD 543
               +  K+  + K+ + G  + L      Q             R +   +L+K I   +
Sbjct: 730 RAAMLSRKVHSIAKQGMTGALQALGAAHFPQVAPEQHDDIVDIERIEHRRQLQKRIDAVN 789

Query: 544 AERIDMEEK----YSSLQEESEG----LSKKLKKVSSMLLAAKEERKDVQRENRKEIEGF 595
           A    + E+     S  ++E       +  +L  +   L   +   +D   E     +  
Sbjct: 790 ARLRRLREEIIGRMSDAKKEDTAALSEVGAELDDIPEYLARLQTLTEDALPEFLARFQEL 849

Query: 596 L-----DNVRQITKEYSYQQAIIQKFI 617
           L     D V Q+     +++A+I++ I
Sbjct: 850 LNRSSDDGVTQLLSHLDHERALIEERI 876



 Score = 32.3 bits (73), Expect = 1.3
 Identities = 23/111 (20%), Positives = 43/111 (38%), Gaps = 1/111 (0%)

Query: 465 VEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENL-LEKAEIQQQLLDAAAQ 523
           V+     ED    ++ R Q ++  + E+   L     +   ++ L  A+ Q   L A  +
Sbjct: 625 VKAMLSREDFYMIKIMRQQGEYIKLQEQANALAHIQALNFASIDLPSAQRQIAELQARLE 684

Query: 524 ELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSML 574
            L H +      K  +      +  +E +Y     E  GL K LK+ + + 
Sbjct: 685 RLTHTQSDIAIAKAALDAAQTRQKVLERQYQQEVTECAGLKKDLKRAAMLS 735


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 36.4 bits (85), Expect = 0.013
 Identities = 21/96 (21%), Positives = 44/96 (45%), Gaps = 7/96 (7%)

Query: 512 EIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKL-KKV 570
             Q+QL     +     ++ E+EL+K  ++   +   + E         E   K+L KKV
Sbjct: 18  AAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSE------AAREKKEKELQKKV 71

Query: 571 SSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEY 606
                  ++ ++D+Q+  ++E++  LD + +  KE 
Sbjct: 72  QEFQRKQQKLQQDLQKRQQEELQKILDKINKAIKEV 107


>gnl|CDD|222469 pfam13949, ALIX_LYPXL_bnd, ALIX V-shaped domain binding to HIV.
           The binding of the LYPxL motif of late HIV p6Gag and
           EIAV p9Gag to this domain is necessary for viral
           budding.This domain is generally central between an
           N-terminal Bro1 domain, pfam03097 and a C-terminal
           proline-rich domain. The retroviruses thus used this
           domain to hijack the ESCRT system of the cell.
          Length = 292

 Score = 37.7 bits (88), Expect = 0.014
 Identities = 46/227 (20%), Positives = 87/227 (38%), Gaps = 45/227 (19%)

Query: 412 KARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVI 471
             R  + P   L  + ++E+ + +  LE+AS                             
Sbjct: 67  GERWTRPPSSELTAELREELRKYRGYLEQAS----------------------------- 97

Query: 472 EDETDAELKRAQSDHQSMVEKL----ELLQKKVLVGGENLLEKAEIQQQLLDAAAQELE- 526
             E+DA+L+    +H+  +E L    E L+  +     +   +   Q   L     +L  
Sbjct: 98  --ESDAQLRSKLEEHEENLELLSGPEEELEALLPSSSPSKTPEVSEQISRLRELLNKLNE 155

Query: 527 ---HRKQKEEELKKNIQEKDAERIDME-EKYSSLQEE--SEGLSKKLKKVSSMLLAAKEE 580
               R++  EELK+  Q+ D  ++ +   K  S  EE   E   KK   +   +     +
Sbjct: 156 LKAEREKLLEELKEKAQDDDISKLLIALNKLGSSNEEQLFEEELKKFDPLQERVEQNLSK 215

Query: 581 RKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKF--IPTKYLELM 625
           ++++ +E ++  E FL   R+   E   ++ ++QK      KY EL 
Sbjct: 216 QEELLKEIQEANEEFL-QERKSDSEQKEREKVLQKLENAYDKYKELS 261


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
           This family consists of several plant proteins of
           unknown function. Several sequences in this family are
           described as being "myosin heavy chain-like".
          Length = 484

 Score = 37.7 bits (87), Expect = 0.023
 Identities = 51/229 (22%), Positives = 105/229 (45%), Gaps = 25/229 (10%)

Query: 419 PKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAE 478
            K+  L+  Q+E ++LKK+LE A      E+       EL+ T+  VE  ++  ++ + E
Sbjct: 33  VKETELMLAQEEANKLKKELEVA------EKEKLQVLKELESTKRTVEDLKLKLEKAEKE 86

Query: 479 LKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKN 538
            ++A+ D +    + E L++ +    E  +E+       LD+  +EL   +Q+ + L   
Sbjct: 87  EQQAKQDSELAKLRAEELEQGI---QELEVERYITATAELDSVKEELRKIRQEYDAL--- 140

Query: 539 IQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM----------LLAAKEERKDVQREN 588
           ++E+DA     EE   + +   + + +  K++ +M           L A+EER     E 
Sbjct: 141 VEERDAALKRAEEAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAALEK 200

Query: 589 RKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLELMTQNATWNEDIGE 637
            ++ E +   +++  KE    + + Q   P K LE + + +   E + +
Sbjct: 201 DQDRETYEKELKEAEKE---LERLKQDLDPEKDLEKLAEASAELESLQK 246



 Score = 29.6 bits (66), Expect = 6.2
 Identities = 44/176 (25%), Positives = 79/176 (44%), Gaps = 14/176 (7%)

Query: 428 QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIE--DETDAELKRAQSD 485
           + E   +   LE+       E+     E EL+  +  ++P + +E   E  AEL+  Q +
Sbjct: 188 EAEEERIGAALEKDQDRETYEKELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKE 247

Query: 486 HQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAA---QELEHRKQKEEELKKNIQEK 542
              M   +  + K++     NL + AE ++ L +      QELE  K++ EEL++   E 
Sbjct: 248 ISIM-ASVASVLKELEEAKANLEKAAEEEKSLRNLVESLKQELEEEKKELEELREKEGEA 306

Query: 543 DAERIDME--------EKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRK 590
           +     +E        EK S  +E+   +  KL++ SS    A++E +  + E RK
Sbjct: 307 EEAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRK 362



 Score = 29.2 bits (65), Expect = 8.9
 Identities = 40/155 (25%), Positives = 71/155 (45%), Gaps = 15/155 (9%)

Query: 444 PPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVG 503
            P R E     E EL   +   E +++ ++   AE ++ Q      +++LE  ++ V   
Sbjct: 24  LPQRSERVLVKETELMLAQ--EEANKLKKELEVAEKEKLQV-----LKELESTKRTV--- 73

Query: 504 GENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAER-IDMEEKYSSLQEESEG 562
            E+L  K E  ++    A Q+ E  K + EEL++ IQE + ER I    +  S++EE   
Sbjct: 74  -EDLKLKLEKAEKEEQQAKQDSELAKLRAEELEQGIQELEVERYITATAELDSVKEELRK 132

Query: 563 LSKKLKKVSSMLLAA---KEERKDVQRENRKEIEG 594
           + ++   +     AA    EE     + N K++E 
Sbjct: 133 IRQEYDALVEERDAALKRAEEAICASKVNEKKVEE 167


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 36.4 bits (85), Expect = 0.028
 Identities = 21/67 (31%), Positives = 31/67 (46%), Gaps = 3/67 (4%)

Query: 524 ELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLK---KVSSMLLAAKEE 580
           E+   K+ EE  +K + E   E   + E     +EE E L KKLK   K    L   K  
Sbjct: 35  EIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKAR 94

Query: 581 RKDVQRE 587
            K++++E
Sbjct: 95  LKELEKE 101



 Score = 34.1 bits (79), Expect = 0.13
 Identities = 28/116 (24%), Positives = 55/116 (47%), Gaps = 21/116 (18%)

Query: 487 QSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQ--EKD- 543
           +S+ E++  ++K       N    AEI Q+      + L+  +++ EEL+K ++  EKD 
Sbjct: 30  KSLKEEIAEMKKNEE---HNEKLMAEIAQENKRLV-EPLKKAEEEVEELRKKLKDYEKDK 85

Query: 544 -------AERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEI 592
                  A   ++E++  +L+ ESE L ++ +KV         ER ++  +    I
Sbjct: 86  QSLKNLKARLKELEKELKNLKWESEVLEQRFEKVE-------RERDELYDKFEAAI 134


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 35.3 bits (82), Expect = 0.028
 Identities = 30/112 (26%), Positives = 57/112 (50%), Gaps = 20/112 (17%)

Query: 515 QQLLDAAAQELEHR----KQKEEELKKN--------------IQEKDAERIDMEEKYSSL 556
           Q  LDA  +E E R    KQ+EEEL+K               ++E +A+R   E+K    
Sbjct: 13  QLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEE 72

Query: 557 QEESEGLSKKLKKVSSML--LAAKEERKDVQRENRKEIEGFLDNVRQITKEY 606
           ++  +   +++K++ + L  L A+ E+ + + E  +  E FL++V + + E+
Sbjct: 73  KKLRKEKEEEIKELKAELEELKAEIEKLEEKLEEYQPYEEFLESVVERSPEF 124


>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
           (TBPIP).  This family consists of several eukaryotic
           TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
           been demonstrated to interact with the human
           immunodeficiency virus type 1 (HIV-1) viral protein Tat,
           then modulate the essential replication process of HIV.
           In addition, TBP-1 has been shown to be a component of
           the 26S proteasome, a basic multiprotein complex that
           degrades ubiquitinated proteins in an ATP-dependent
           fashion. Human TBPIP interacts with human TBP-1 then
           modulates the inhibitory action of human TBP-1 on
           HIV-Tat-mediated transactivation.
          Length = 169

 Score = 35.7 bits (83), Expect = 0.029
 Identities = 19/84 (22%), Positives = 39/84 (46%), Gaps = 7/84 (8%)

Query: 529 KQKEEELKKNIQEK--DAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAK-----EER 581
            Q + EL  + +    D E  ++ E+   L+++   L  +LK ++S L   +     +E 
Sbjct: 62  NQDQFELPSDEELNKLDMEIEELREEVQLLKQDCSTLEIELKSLTSDLTTEELQEEIQEL 121

Query: 582 KDVQRENRKEIEGFLDNVRQITKE 605
           K   RE  +++E   +  + +T E
Sbjct: 122 KKEVREIEEKLESLEEGWKPVTPE 145


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 36.4 bits (85), Expect = 0.047
 Identities = 29/122 (23%), Positives = 56/122 (45%), Gaps = 16/122 (13%)

Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKE 532
           +E        + + +    +L+ L++++L   E L  K E     LD   + LE ++++ 
Sbjct: 58  EEVHKLRAELERELKERRNELQRLERRLLQREETLDRKME----SLDKKEENLEKKEKEL 113

Query: 533 EELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLA-AKEE-RKDVQRENRK 590
              +KN+ EK       EE+   L  E     ++L+++S +    AKE   ++V+ E R 
Sbjct: 114 SNKEKNLDEK-------EEELEELIAEQR---EELERISGLTQEEAKEILLEEVEEEARH 163

Query: 591 EI 592
           E 
Sbjct: 164 EA 165


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 36.1 bits (84), Expect = 0.060
 Identities = 32/182 (17%), Positives = 79/182 (43%), Gaps = 28/182 (15%)

Query: 393 YEETISTLRYASRVKKICNKARVNQDPKD------ALLIKYQKEISELKKKLEEASVPPP 446
           YE  I   +    ++ I +  + N + K       ALL   ++ + +  +KLEE      
Sbjct: 294 YESHIE--KALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQKLEE-----K 346

Query: 447 REEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGEN 506
            ++P+ + E E   T+     +++I+   +        +H   ++ L+  + K     + 
Sbjct: 347 LKDPSTSIELE-SITDLIESINDIIDAINELI-----REHNEKIDNLKKEKNKA----KK 396

Query: 507 LLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKK 566
            L    + +       ++++  +++++ L+K I   + E   +E +  +L++E + L K+
Sbjct: 397 KLWLHLVAE-----LKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQ 451

Query: 567 LK 568
           L 
Sbjct: 452 LT 453


>gnl|CDD|191413 pfam05970, PIF1, PIF1-like helicase.  This family includes
           homologues of the PIF1 helicase, which inhibits
           telomerase activity and is cell cycle regulated. This
           family includes a large number of largely
           uncharacterized plant proteins. This family includes a
           P-loop motif that is involved in nucleotide binding.
          Length = 364

 Score = 35.8 bits (83), Expect = 0.062
 Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 4/39 (10%)

Query: 108 NSKQLDIYNETARPIVDKVLSGFNGTIFAYGQTGTGKTF 146
           N +Q  ++++    I++ V +   G  F YG  GTGKT+
Sbjct: 3   NDEQKSVFDQ----IIESVDNNLGGVFFVYGYGGTGKTY 37


>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein. 
          Length = 122

 Score = 34.2 bits (79), Expect = 0.063
 Identities = 31/122 (25%), Positives = 59/122 (48%), Gaps = 24/122 (19%)

Query: 478 ELKRAQSDHQSMVEKLELLQK------KVLVGGENLLEKAEIQ--QQLLDAAAQELEHRK 529
            L  AQ + +   EKLE LQ+      + L G    +  AE++  Q  + A  + +  ++
Sbjct: 6   NLAEAQQELEQAEEKLEELQQYRLEYRQQLSGSGQGISAAELRNYQAFISALDEAIAQQQ 65

Query: 530 QKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR 589
           Q+ E+ +K +++         E++    +E     KKL+K     L  +E++K+ + ENR
Sbjct: 66  QELEQAEKQVEQA-------REQWQEANQE----RKKLEK-----LLEREKKKEQKEENR 109

Query: 590 KE 591
           +E
Sbjct: 110 RE 111


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 34.9 bits (81), Expect = 0.068
 Identities = 22/91 (24%), Positives = 44/91 (48%), Gaps = 6/91 (6%)

Query: 515 QQLLDAAAQELEHRKQKEEELKKNIQ------EKDAERIDMEEKYSSLQEESEGLSKKLK 568
           +  L+   +ELE  KQ+  EL+  I+      E+  ER ++ E+   L++E + L  +L+
Sbjct: 68  KTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAELE 127

Query: 569 KVSSMLLAAKEERKDVQRENRKEIEGFLDNV 599
           K         E+ K+  +  ++    + DN+
Sbjct: 128 KYEKNDPERIEKLKEETKVAKEAANRWTDNI 158


>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region.  This family
           represents an N-terminal conserved region found in
           several huntingtin-associated protein 1 (HAP1)
           homologues. HAP1 binds to huntingtin in a polyglutamine
           repeat-length-dependent manner. However, its possible
           role in the pathogenesis of Huntington's disease is
           unclear. This family also includes a similar N-terminal
           conserved region from hypothetical protein products of
           ALS2CR3 genes found in the human juvenile amyotrophic
           lateral sclerosis critical region 2q33-2q34.
          Length = 307

 Score = 35.6 bits (82), Expect = 0.073
 Identities = 34/143 (23%), Positives = 61/143 (42%), Gaps = 21/143 (14%)

Query: 466 EPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKV-LVGGEN--LLEKA----------- 511
           E SE    E+     +  S       +LE LQ+K+ L+  EN  L  +A           
Sbjct: 136 EESEDESSESTPLRPQESSSSSHGCFQLEALQEKLKLLEEENEHLRSEASHLKTETVTYE 195

Query: 512 EIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLK--- 568
           E +QQL++   ++L     +   L + + +K  +    +E+ + L  +   L KK K   
Sbjct: 196 EKEQQLVNDCVKQLREANDQIASLSEELAKKTEDLERQQEEITHLLSQIVDLQKKCKSYA 255

Query: 569 ----KVSSMLLAAKEERKDVQRE 587
               ++   L AAK+ ++ +Q E
Sbjct: 256 LENEELQQHLAAAKDAQRQLQAE 278


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 35.4 bits (82), Expect = 0.079
 Identities = 23/130 (17%), Positives = 58/130 (44%), Gaps = 7/130 (5%)

Query: 476 DAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEEL 535
             EL+   S    + ++ + L++++    +   E  +     LD A ++L+   Q+    
Sbjct: 164 MKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIK 223

Query: 536 KKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGF 595
            K ++E        EE+   L+ + E L+ K  ++++ +  A+++ +  +    KEIE  
Sbjct: 224 VKKLEEL-------EEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTFKEIEKL 276

Query: 596 LDNVRQITKE 605
            + ++ +   
Sbjct: 277 KEQLKLLQSL 286



 Score = 34.6 bits (80), Expect = 0.13
 Identities = 19/92 (20%), Positives = 46/92 (50%), Gaps = 9/92 (9%)

Query: 522 AQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV-----SSMLLA 576
            + LE  K+  + L K ++  ++ +  + ++  +L+EE   L K+L+        + L  
Sbjct: 150 DENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQL-KQLEDELEDCDPTELDR 208

Query: 577 AKEERKDVQREN---RKEIEGFLDNVRQITKE 605
           AKE+ K + +E     K++E   + ++++  +
Sbjct: 209 AKEKLKKLLQEIMIKVKKLEELEEELQELESK 240


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 35.0 bits (81), Expect = 0.096
 Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 9/108 (8%)

Query: 485 DHQSMVEKLELLQKKV---LVGGENLLEKAEIQQQLLDAAAQE--LEHRKQKEEELKKNI 539
           D   ++ + E L++KV   L G  N   K       L+ A ++  L   + +  +LK+ I
Sbjct: 28  DLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEI 87

Query: 540 QEK----DAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKD 583
           ++K    +  +  + ++ S L   S  L K+       L    +  + 
Sbjct: 88  EQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTRS 135


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 35.0 bits (81), Expect = 0.10
 Identities = 53/223 (23%), Positives = 91/223 (40%), Gaps = 58/223 (26%)

Query: 405 RVKKICNKARVNQDPKDALLIKYQKEISEL---KKKLEEASVPPPREEPTPNPEPELD-- 459
           RV +    A+     +D   I+ Q+E  +    KK+ +   +  P+   TP P       
Sbjct: 11  RVVRGNTYAQEIIPIRDGDEIQRQQEARQRALMKKRTKR--IQNPKIMGTPPPVEGRKHI 68

Query: 460 --ETEPCVE------PSEVIEDETDAELKRAQS----------DHQSMVEKLEL------ 495
             +TEP +E      P    E +TD  L R  +          D ++ +E+ EL      
Sbjct: 69  DVQTEPYLEELTDKIPEADAETQTDFFLDRPPTPLYVPAKTGIDVETQIEEGELFDFDLE 128

Query: 496 ----LQKKVLVGGENLLEKA--EIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDM 549
               L+  VLVG    LE+A  E+ ++      +EL   +Q++ + ++    + AE   +
Sbjct: 129 VEPILE--VLVG--KTLEQALLEVLEE------EELAELRQQQRQFEQRRNAELAETQRL 178

Query: 550 EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEI 592
           EE     +EE E   K+ K           ERK  ++E  ++I
Sbjct: 179 EEAERRRREEKERRKKQDK-----------ERKQREKETAEKI 210


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 35.2 bits (81), Expect = 0.11
 Identities = 32/140 (22%), Positives = 61/140 (43%), Gaps = 12/140 (8%)

Query: 468 SEVIEDETDAELKRAQSDHQSMV-EKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELE 526
           S ++    + E +  +   Q    +K EL Q +     E L+ + + Q+ L +   ++ E
Sbjct: 38  SRILNTLEEFEKEANEKRAQYRSAKKKELSQLE-----EQLINQKKEQKNLFNEQIKQFE 92

Query: 527 HRKQKE----EELKKNIQEKDAERIDMEEKYSSLQEE-SEGLSKKLKKVSSMLLAAKEER 581
              Q E    E L+    EKD E   +E++   L +E  + L    + +       K E 
Sbjct: 93  LALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEE 152

Query: 582 KDVQRENRKEIEGFLDNVRQ 601
           + ++ EN K++E  L+  R+
Sbjct: 153 R-LKFENEKKLEESLELERE 171


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 35.4 bits (82), Expect = 0.11
 Identities = 25/128 (19%), Positives = 50/128 (39%), Gaps = 4/128 (3%)

Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
             I D     + +     +  ++ L + ++++L+  E L    E Q+ L      EL   
Sbjct: 798 SDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSE- 856

Query: 529 KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEE---RKDVQ 585
            Q+ EE  +NI E   E ++ +     L +        +K V S     ++E   +K  +
Sbjct: 857 IQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKK 916

Query: 586 RENRKEIE 593
              + + E
Sbjct: 917 DLEKTDEE 924


>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 433 to 538 amino acids in length. This domain is
           found associated with pfam00616, pfam00168. This domain
           has two conserved sequence motifs: DLQ and PLSFQNP.
          Length = 489

 Score = 35.5 bits (81), Expect = 0.11
 Identities = 35/144 (24%), Positives = 64/144 (44%), Gaps = 22/144 (15%)

Query: 444 PPPREEPTPNPEPELDETEPCV-EPSEVIEDETDAELKRAQSDHQSMVE-KLELLQKKVL 501
               ++P+P     L+ T   V   S+  + E + E  R +       E ++  L++++ 
Sbjct: 321 AGHLQQPSPVNMSALERTAAWVLMNSQYEDIEYETEENREEGTQAEKYEQEIARLKERLR 380

Query: 502 VGGENLLEKAEIQQQLLDAAAQELEHR-KQKEEELKKNIQEKDAERIDMEEKYSSLQEES 560
           V    L               +E E R   +E++++K +QE  A   D EE+    QEE 
Sbjct: 381 VSVRRL---------------EEYERRLLGQEQQMQKLLQEYQARLEDSEERLRRQQEEK 425

Query: 561 EGLSKKLKKVSSMLLAAKEE-RKD 583
           +    ++K + S L+A +EE +KD
Sbjct: 426 DS---QMKSIISRLMAVEEELKKD 446


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 35.3 bits (82), Expect = 0.12
 Identities = 17/89 (19%), Positives = 42/89 (47%), Gaps = 10/89 (11%)

Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
             L ++ E+Q +    +    E ++Q+   L+    E + ++ ++E +   LQE++   S
Sbjct: 152 LTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETS 211

Query: 565 KKLKKVSSMLLAAKEERKDVQRENRKEIE 593
           ++           K++RK++  +  K +E
Sbjct: 212 QE----------RKQKRKEITDQAAKRLE 230



 Score = 29.9 bits (68), Expect = 5.5
 Identities = 20/109 (18%), Positives = 51/109 (46%), Gaps = 4/109 (3%)

Query: 444 PPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVG 503
            P    P  +PE  L   +  V   +   +    E  ++Q+  ++  ++L  L+      
Sbjct: 130 KPGPFVPPEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAEL 189

Query: 504 GENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEK 552
            E   ++ E++ QL     +  E  ++++++ K+ I ++ A+R+++ E+
Sbjct: 190 EE---KQQELEAQLEQLQEKAAETSQERKQKRKE-ITDQAAKRLELSEE 234


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 34.6 bits (80), Expect = 0.13
 Identities = 23/120 (19%), Positives = 51/120 (42%), Gaps = 14/120 (11%)

Query: 505 ENLLEKAEIQQQLLDAAAQELEHRK------QKEEELKKNIQEKDAERIDMEEKYSSLQE 558
           ++LL   ++  +      +  E RK       + E L K ++  + E  D+E + S L+ 
Sbjct: 7   KSLLAIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLES 66

Query: 559 ESEGLSKKLKKVSSMLLAAKEER--------KDVQRENRKEIEGFLDNVRQITKEYSYQQ 610
           E + + +++K+    L A K+ER          + +E    +E  L  + +  ++   + 
Sbjct: 67  EIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEI 126



 Score = 31.2 bits (71), Expect = 1.4
 Identities = 38/210 (18%), Positives = 93/210 (44%), Gaps = 28/210 (13%)

Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVL-VGGENLLEKAEIQ---------QQLLDAA 521
             E    LK+A+++ +++ + LE L+ ++  +  +    ++EIQ         ++ L A 
Sbjct: 26  IKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAV 85

Query: 522 --AQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKE 579
              +EL     + +  K+ I   + E  ++ E+   L++E E L ++L++          
Sbjct: 86  KDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLER---------- 135

Query: 580 ERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLELMTQNATWNEDIGEWE 639
             K++  E    +E  +  +R+  +E S ++  +++ +  + L    +     + +G   
Sbjct: 136 LEKNLA-EAEARLEEEVAEIREEGQELSSKREELKEKLDPELLSEYERIRKNKKGVGVVP 194

Query: 640 LKCVAYTGNNMRKIVPLPEDTTATQIAQPD 669
           L+     G +M+    LP   T +++ + D
Sbjct: 195 LEGRVCGGCHMK----LP-SQTLSKVRKKD 219


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 34.3 bits (79), Expect = 0.15
 Identities = 27/153 (17%), Positives = 61/153 (39%), Gaps = 16/153 (10%)

Query: 460 ETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLL------EKAEI 513
           ++E        +   + A      S +   +++++L +++     EN L       + ++
Sbjct: 46  DSESSSNSVPSLSLSSTASSLSDSSTYSRSLKEVKL-ERQAQEAYENWLSAKQAQRQKKL 104

Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKK-----LK 568
           Q+ L +   QE E  +++ E  ++  +EK  E    + +    Q   +   K        
Sbjct: 105 QKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQ----QAAKQRTPKHKKEAAES 160

Query: 569 KVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQ 601
             SS+  +AK ER   Q E +K ++ +     +
Sbjct: 161 ASSSLSGSAKPERNVSQEEAKKRLQEWELKKLK 193


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 34.5 bits (80), Expect = 0.15
 Identities = 25/131 (19%), Positives = 68/131 (51%), Gaps = 6/131 (4%)

Query: 469 EVIEDETDAELKRAQSDHQSMVEK--LELLQKKVLVGGENLLEKA---EIQQQLLDAAAQ 523
              E+E + + K  + + ++++++   E  +++     E L E+    EI +++ +    
Sbjct: 55  LAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEA 114

Query: 524 ELEHRKQKEEELKKNIQEKDAERIDM-EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERK 582
           E + +++K+++L++ I E + ERI+  EE+    +EE   + +  ++ +      + ER+
Sbjct: 115 EAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREAERR 174

Query: 583 DVQRENRKEIE 593
           + + E  +E+ 
Sbjct: 175 ERKEEKEREVA 185



 Score = 33.0 bits (76), Expect = 0.57
 Identities = 28/137 (20%), Positives = 58/137 (42%), Gaps = 17/137 (12%)

Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQE---LEHRKQKEE 533
           AE K  +     M+E+  L         E   E+   +++    A  +    E  K+++E
Sbjct: 34  AEEKEEERRIDEMMEEERLKALAE----EEERERKRKEERREGRAVLQEQIEEREKRRQE 89

Query: 534 ELKKNIQEKDA-----ERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQREN 588
           E ++ +QE++      ERI  E+     + E++   +K KK+   +    EER + + E 
Sbjct: 90  EYEERLQEREQMDEIIERIQEED-----EAEAQEKREKQKKLREEIDEFNEERIERKEEE 144

Query: 589 RKEIEGFLDNVRQITKE 605
           ++        + +  +E
Sbjct: 145 KEREREEELKILEYQRE 161



 Score = 29.1 bits (66), Expect = 7.9
 Identities = 24/120 (20%), Positives = 50/120 (41%), Gaps = 20/120 (16%)

Query: 476 DAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEEL 535
           D  ++R Q + +   E  E  +K+          + EI +   +   ++ E ++++ EE 
Sbjct: 102 DEIIERIQEEDE--AEAQEKREKQ-------KKLREEIDEFNEERIERKEEEKEREREEE 152

Query: 536 KKNIQ---EKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEI 592
            K ++   EK     + E +    +EE E    +        L A++E  + +RE   E+
Sbjct: 153 LKILEYQREKAEREEEREAERRERKEEKEREVAR--------LRAQQEEAEDEREELDEL 204


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 605

 Score = 34.8 bits (80), Expect = 0.15
 Identities = 36/207 (17%), Positives = 84/207 (40%), Gaps = 24/207 (11%)

Query: 434 LKKKLEEASVPPPREEPTPNPEP---ELDETEPCVEPSEVIEDETDAEL-----KRAQSD 485
           L+ K  E S+ P ++    +      +++  E   + S   +     ++     K  Q++
Sbjct: 368 LENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNN 427

Query: 486 HQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAE 545
            +  ++K ELL+K+ +   E + +  E+   L + A Q+      K+ EL KN+ +K  +
Sbjct: 428 KEQDIKKEELLEKEFVKKSEKIPKNDELLDNL-ELAKQKF---FNKDIELSKNMLQKFNK 483

Query: 546 RIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKE 605
             + E   +S   +    +  +  VS   L    E             G ++ + ++ + 
Sbjct: 484 FKNEESAENSF-IDIIKQADLVLWVSPNALILGVED-----------LGLIELINKVNES 531

Query: 606 YSYQQAIIQKFIPTKYLELMTQNATWN 632
           + + + I +KF   K +  +++     
Sbjct: 532 FEFIKTIFKKFNKNKLVIAISKKQATK 558


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 35.1 bits (81), Expect = 0.16
 Identities = 42/229 (18%), Positives = 84/229 (36%), Gaps = 30/229 (13%)

Query: 392 NYEETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPT 451
              E   TL    +  +   KAR+ +D +   L      +++ ++  ++ S     ++  
Sbjct: 744 KLYEFAYTLEEEIQYLEDLLKARLEKDYQSKYLDLI-DNLNDAREIKDKESKAIDLDDID 802

Query: 452 PNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKL-ELLQKKVLVGGENLLEK 510
                   E E   +    I D     L+    D     E L EL+++ +    E+L +K
Sbjct: 803 -------FELELIGKQEINI-DYILELLQTFN-DKNGAYESLKELIERIIKEWIEDLRQK 853

Query: 511 AEIQQQLLDAAAQ-------------ELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQ 557
            ++ ++L++A  Q             EL    +KEEE K+  +E   E ++ EE      
Sbjct: 854 KKLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQFAEE---EGLNEEELAFYDD 910

Query: 558 EESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEY 606
               G          +   AKE+    + ++  + +   + V    K +
Sbjct: 911 LALNGGKLPENGTELVEKLAKEKSLREKNKDDWKAK---EEVEAKLKAF 956


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
           the bacterial cell cycle, the tubulin-like cell-division
           protein FtsZ polymerises into a ring structure that
           establishes the location of the nascent division site.
           EzrA modulates the frequency and position of FtsZ ring
           formation.
          Length = 559

 Score = 34.8 bits (81), Expect = 0.16
 Identities = 37/158 (23%), Positives = 71/158 (44%), Gaps = 31/158 (19%)

Query: 424 LIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEP--------SEVIEDET 475
           L   +++I +    LEE                +LDE E   E          +++E E 
Sbjct: 254 LQDLKEQIDQNLALLEEL---------------DLDEAEEENEEIEERIDTLYDILEKEV 298

Query: 476 DAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEEL 535
            A+ K  + +   + + LE  +++     + LL + +  QQ       ELE  ++ E++L
Sbjct: 299 KAK-KFVEKNIDKLTDFLEHAREQN----KQLLLELDRLQQSYTLNEDELETVRELEKQL 353

Query: 536 KK--NIQEKDAERIDMEEK-YSSLQEESEGLSKKLKKV 570
           ++     ++  ERI  ++  YS LQEE E + K+L+++
Sbjct: 354 EELEAQYDQLVERIAEKKVAYSELQEELEEIEKQLEEI 391


>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain
           pfam08565. Expression of a construct consisting of only
           the N-terminal domain of Saccharomyces pombe Cdc37
           results in cellular viability. This indicates that
           interactions with the cochaperone Hsp90 may not be
           essential for Cdc37 function.
          Length = 172

 Score = 33.6 bits (77), Expect = 0.18
 Identities = 34/129 (26%), Positives = 51/129 (39%), Gaps = 18/129 (13%)

Query: 523 QELEHRKQKEEELKKNIQEKDA--ERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEE 580
           +  E RKQ+ EEL+K     D   ER+D  +  S L+EES   S+ + K  +     KE 
Sbjct: 39  ERREQRKQEIEELEKERIMNDRLLERVD--KLLSELKEESLDSSQAVMKSLNENFTDKEN 96

Query: 581 RKDVQRENRKEIEGFLDNVRQITKEYSYQQAI------------IQKFIPTKYLELMTQN 628
            +  Q    + +E   D V+    E +    I             QK +  K  EL  + 
Sbjct: 97  VEPEQPTYNEMVEDLFDQVKDEVDEKNGAALIEELQKHRDKLKKEQKELLKKLDELEKEE 156

Query: 629 AT--WNEDI 635
               W+ D 
Sbjct: 157 KKKIWSVDT 165


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 35.0 bits (80), Expect = 0.19
 Identities = 29/144 (20%), Positives = 64/144 (44%), Gaps = 12/144 (8%)

Query: 474 ETDAELKRAQSDHQSMV-------EKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELE 526
           +T+AEL+   SD QS +       +  E   KK     + +L  A  +Q ++D   +E+ 
Sbjct: 688 QTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIP 747

Query: 527 HRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR 586
             + K +++ ++IQ     + D+EE+ + L   +    ++  KV    +   E  +   +
Sbjct: 748 ELRNKLQKVNRDIQR---LKNDIEEQETLL--GTIMPEEESAKVCLTDVTIMERFQMELK 802

Query: 587 ENRKEIEGFLDNVRQITKEYSYQQ 610
           +  ++I      ++    + + QQ
Sbjct: 803 DVERKIAQQAAKLQGSDLDRTVQQ 826



 Score = 30.0 bits (67), Expect = 5.1
 Identities = 36/163 (22%), Positives = 64/163 (39%), Gaps = 4/163 (2%)

Query: 428 QKEISELKKKLEEASVPPPREEPT-PNPEPELDETEPCVEPSEVIEDETDAELKRAQSDH 486
           +KEI EL+ KL++ +    R +      E  L    P  E ++V    TD  +       
Sbjct: 743 EKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCL--TDVTIMERFQME 800

Query: 487 QSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAER 546
              VE+    Q   L G +      ++ Q+  +    EL+    K E  +K IQ++  + 
Sbjct: 801 LKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQ-HELDTVVSKIELNRKLIQDQQEQI 859

Query: 547 IDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR 589
             ++ K + L+ E   +   L++         E   +VQ   R
Sbjct: 860 QHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIR 902


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 34.6 bits (79), Expect = 0.19
 Identities = 22/131 (16%), Positives = 53/131 (40%), Gaps = 4/131 (3%)

Query: 510 KAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
           KA I Q  L+   +E     +K   L+ ++ +  +    +E       E++E L ++ K+
Sbjct: 137 KATILQNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKE 196

Query: 570 VSSMLLAAKEERKDVQR---ENRKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLELMT 626
             S   +   ER  +     EN++ I     +++ +T++      +++     +  EL  
Sbjct: 197 AQSSSESMSAERNALLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEA-ELER 255

Query: 627 QNATWNEDIGE 637
                 + + +
Sbjct: 256 MKGELKQRLKK 266



 Score = 34.2 bits (78), Expect = 0.25
 Identities = 19/129 (14%), Positives = 61/129 (47%), Gaps = 9/129 (6%)

Query: 491 EKLELLQKKVLVGGENLLEKAEIQQ--QLLDAAAQELEHRKQKEEELKKNIQEKDAER-- 546
           E+ +L++K + + G+    ++ I+Q    L  + ++ E  +++ +E + + +   AER  
Sbjct: 151 ERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSSESMSAERNA 210

Query: 547 -----IDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQ 601
                 + +++   L+++ + L++K ++   +L   ++   +++R   +  +       Q
Sbjct: 211 LLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQ 270

Query: 602 ITKEYSYQQ 610
              E + + 
Sbjct: 271 RRDEETERI 279


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 34.3 bits (79), Expect = 0.19
 Identities = 36/191 (18%), Positives = 86/191 (45%), Gaps = 22/191 (11%)

Query: 402 YASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDET 461
           Y    +K  N+  +      +L    ++EI  L+KK + + + P  E        ++ E 
Sbjct: 92  YRELKEKR-NEFNLGGRSIKSL----EREIERLEKKQQTSVLTPEEEREL---VQKIKEL 143

Query: 462 EPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKA-EIQQQLLDA 520
              +E ++   +E + +LK  +++   + +K   + +K+    + L  +A E  ++++  
Sbjct: 144 RKELEDAKKALEE-NEKLKELKAEIDELKKKAREIHEKI----QELANEAQEYHEEMIKL 198

Query: 521 AAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEE 580
                    ++ +EL+K   E   E +++ +K   L EE   L  +L+++   + A + +
Sbjct: 199 --------FEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRAK 250

Query: 581 RKDVQRENRKE 591
            K  +R  ++E
Sbjct: 251 EKAAKRREKRE 261


>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN.  All proteins in this
           family for which functions are known are ATP binding
           proteins involved in the initiation of recombination and
           recombinational repair [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 563

 Score = 34.3 bits (79), Expect = 0.21
 Identities = 32/114 (28%), Positives = 50/114 (43%), Gaps = 11/114 (9%)

Query: 482 AQSDHQSMVEKLELLQKKVLVGGEN-LLEKAEIQQQLLDAAAQELEHRKQKEEE------ 534
            Q D Q +    E  Q      G N  ++      Q    A Q+L+ R+QKE+E      
Sbjct: 133 GQHDQQLLFRPDEQRQLLDTFAGANEKVKAYRELYQAWLKARQQLKDRQQKEQELAQRLD 192

Query: 535 -LKKNIQEKDAERIDMEEKYSSLQEESEGLSK--KLKKVSSMLLAAKEERKDVQ 585
            L+  ++E +   +   E   +L+ E + LS   KL+++S   LAA     DVQ
Sbjct: 193 FLQFQLEELEEADLQPGE-DEALEAEQQRLSNLEKLRELSQNALAALRGDVDVQ 245


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
           outer membrane].
          Length = 170

 Score = 33.1 bits (76), Expect = 0.23
 Identities = 18/99 (18%), Positives = 45/99 (45%), Gaps = 17/99 (17%)

Query: 520 AAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEE-----------SEGLSKKLK 568
             + +LE   +K ++  + +Q+      +++ K + LQ++           +E   KK K
Sbjct: 44  KVSADLESEFKKRQKELQKMQK------ELKAKEAKLQDDGKMEALSDRAKAEAEIKKEK 97

Query: 569 KVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYS 607
            V++     +E  KD+ R   +E +  L+ +++  +  +
Sbjct: 98  LVNAFNKKQQEYEKDLNRREAEEEQKLLEKIQRAIESVA 136


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 34.3 bits (79), Expect = 0.28
 Identities = 21/98 (21%), Positives = 37/98 (37%), Gaps = 14/98 (14%)

Query: 518 LDAAAQELEHRKQKE--------EELKKNIQE--KDAERIDMEEKYSSLQEESEGLSKKL 567
           L+   Q+++   Q           EL++ + E  K    I  EE+ + L EE      +L
Sbjct: 378 LEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKISTIPSEEQIAQLLEELGEAQNEL 437

Query: 568 KKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKE 605
            +      A  EE        ++ IE     + + TK+
Sbjct: 438 FRSE----AEIEELLRQLETLKEAIEALRKTLDEKTKQ 471


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 33.8 bits (78), Expect = 0.29
 Identities = 29/143 (20%), Positives = 52/143 (36%), Gaps = 25/143 (17%)

Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK-----AEIQQQLLDAAAQELE 526
           E+  + +L    S    + +KL+ L++++       L +            L+ A QEL 
Sbjct: 163 EELLNKDLNLINSIKPKLRKKLQALKEEI-----ASLRQLADELNLCDPLELEKARQELR 217

Query: 527 HRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR 586
               K  E +K ++E   E  ++     +L  +            S LL    E + ++ 
Sbjct: 218 SLSVKISEKRKQLEELQQELQELTIAIEALTNK-----------KSELLEEIAEAEKIRE 266

Query: 587 ENR----KEIEGFLDNVRQITKE 605
           E R    KEI      V  + K 
Sbjct: 267 ECRGWSAKEISKLKAKVSLLQKL 289



 Score = 30.0 bits (68), Expect = 4.1
 Identities = 21/99 (21%), Positives = 47/99 (47%), Gaps = 9/99 (9%)

Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV--- 570
            + L +   + LE  K+ EE L K++   ++ +  + +K  +L+EE   L ++L      
Sbjct: 145 LEGLKEGLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASL-RQLADELNL 203

Query: 571 --SSMLLAAKEERKDVQRE---NRKEIEGFLDNVRQITK 604
                L  A++E + +  +    RK++E     ++++T 
Sbjct: 204 CDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTI 242


>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
           unknown].
          Length = 833

 Score = 33.8 bits (77), Expect = 0.32
 Identities = 31/179 (17%), Positives = 58/179 (32%), Gaps = 28/179 (15%)

Query: 413 ARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIE 472
           A+      D     YQ++I + + +    +    R +           T           
Sbjct: 448 AKAKILQADKAAKAYQEDILQREAQSRGKTAAAERSQEQ--------MTA---------A 490

Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVL---VGGENLLEKAEIQQQLLDAAAQELEHRK 529
            +     ++  +D     EK     +K L      +  L K E +Q+ L      LEH K
Sbjct: 491 LKALLAFQQQIADLSGAKEKAS--DQKSLLWKAEEQYALLKEEAKQRQLQEQKALLEH-K 547

Query: 530 QKEEELKKNIQEKDAERIDMEEK-----YSSLQEESEGLSKKLKKVSSMLLAAKEERKD 583
           ++  E    + E   ++ D  E       S  +  S+   + L + ++ L  A  E   
Sbjct: 548 KETLEYTSQLAELLDQQADRFELSAQAAGSQKERGSDLYREALAQNAAALNKALNELAA 606


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 34.2 bits (78), Expect = 0.33
 Identities = 33/211 (15%), Positives = 79/211 (37%), Gaps = 7/211 (3%)

Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELD-ETEPCVEPSEVIEDETDAELKRAQS 484
           K Q+ + +L+ ++EE        E T         +  P     + +  + + + +R  +
Sbjct: 257 KKQQLLKQLRARIEELRAQEAVLEET-QERINRARKAAPLAAHIKAVT-QIEQQAQRIHT 314

Query: 485 DHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDA 544
           + QS +     L  K     +      E ++ L    +QE+  R   E     +I+E   
Sbjct: 315 ELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVA--TSIREISC 372

Query: 545 ERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEE--RKDVQRENRKEIEGFLDNVRQI 602
           ++  + +   +LQ++   L++KL+ +   L   + E    D +    ++++G L + ++ 
Sbjct: 373 QQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQ 432

Query: 603 TKEYSYQQAIIQKFIPTKYLELMTQNATWNE 633
            +       +    I         +     E
Sbjct: 433 QELQQRYAELCAAAITCTAQCEKLEKIHLQE 463



 Score = 32.6 bits (74), Expect = 0.76
 Identities = 23/166 (13%), Positives = 48/166 (28%), Gaps = 27/166 (16%)

Query: 467 PSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELE 526
            +  ++D T+   +            L  LQ +      +L +     QQ     A +L 
Sbjct: 592 ITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQ-----DLQDVRLHLQQCSQELALKLT 646

Query: 527 HRKQKEEELKKNIQEKDAERI----------------DMEEKYSSLQEESEGLSKKLKKV 570
                +  L +    + A  I                 M+ +   L    E L++    +
Sbjct: 647 ALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEMLAQCQTLL 706

Query: 571 SSMLLAAKEERKDVQREN------RKEIEGFLDNVRQITKEYSYQQ 610
             +    +E  ++             ++    D + Q  KE  +Q 
Sbjct: 707 RELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQA 752



 Score = 31.5 bits (71), Expect = 2.0
 Identities = 25/131 (19%), Positives = 53/131 (40%), Gaps = 3/131 (2%)

Query: 485 DHQSMVEKLELLQKKVLVGGENLL-EKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKD 543
           D  + +  +E  +KK L G   LL  ++++              RKQ  E+  K+++E  
Sbjct: 176 DQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREAL 235

Query: 544 AERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQIT 603
            +        +  +E  E   KK + +  +   A+ E    Q    +E +  ++  R+  
Sbjct: 236 QQTQQSHAYLTQKREAQEEQLKKQQLLKQLR--ARIEELRAQEAVLEETQERINRARKAA 293

Query: 604 KEYSYQQAIIQ 614
              ++ +A+ Q
Sbjct: 294 PLAAHIKAVTQ 304


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
           division and chromosome partitioning].
          Length = 570

 Score = 33.9 bits (78), Expect = 0.33
 Identities = 41/204 (20%), Positives = 82/204 (40%), Gaps = 10/204 (4%)

Query: 425 IKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEP-SEVIEDETDAELKRAQ 483
           +     +  LK++L E S    + E     E EL   +  +E   +++E E +A+    +
Sbjct: 251 VNIDSRLERLKEQLVENSELLTQLE-LDEAEEELGLIQEKIESLYDLLEREVEAK-NVVE 308

Query: 484 SDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK---KNIQ 540
            +   + + LE  ++      E+L E+ E  ++    A  EL   ++ E+ELK     + 
Sbjct: 309 ENLPILPDYLEKAKEN----NEHLKEEIERVKESYRLAETELGSVRKFEKELKELESVLD 364

Query: 541 EKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVR 600
           E        E  YS LQ+  E + K L  +       +E    ++++  +  E       
Sbjct: 365 EILENIEAQEVAYSELQDNLEEIEKALTDIEDEQEKVQEHLTSLRKDELEARENLERLKS 424

Query: 601 QITKEYSYQQAIIQKFIPTKYLEL 624
           ++ +   Y +      +P  +L L
Sbjct: 425 KLHEIKRYMEKSNLPGLPETFLSL 448


>gnl|CDD|206243 pfam14073, Cep57_CLD, Centrosome localisation domain of Cep57.  The
           CLD or centrosome localisation domain of Cep57 is found
           at the N-terminus, and lies approximately between
           residues 58 and 239. This region lies within the first
           alpha-helical coiled-coil segment of Cep57, and
           localises to the centrosome internally to gamma-tubulin,
           suggesting that it is either on both centrioles or on a
           centromatrix component. This N-terminal region can also
           multimerise with the N-terminus of other Cep57
           molecules. The C-terminal part, Family Cep57_MT_bd,
           pfam06657, is the microtubule-binding region of Cep57.
          Length = 178

 Score = 32.9 bits (75), Expect = 0.35
 Identities = 24/74 (32%), Positives = 39/74 (52%), Gaps = 8/74 (10%)

Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQK 531
           +     E +  Q D Q+ +EKLE+L+K+ L        +    Q L +   Q+LE + Q+
Sbjct: 101 QVSLQREKEHDQMDVQAQLEKLEVLEKEYL--------RLTRTQSLAETKIQQLEEKLQE 152

Query: 532 EEELKKNIQEKDAE 545
           EE  +K +Q+K AE
Sbjct: 153 EEHQRKLVQDKAAE 166


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 33.0 bits (75), Expect = 0.38
 Identities = 34/169 (20%), Positives = 60/169 (35%), Gaps = 15/169 (8%)

Query: 433 ELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEK 492
            LK+K       PP E        E +      +  + +E+E   +L   QS  + +V  
Sbjct: 66  GLKEKSTSEPTVPPEEAEPH---AEEEGQLAVRKTKQKVEEEVKEQL---QSLLEKIVVS 119

Query: 493 LELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAE---RIDM 549
            +  ++        L E   +  +  D   + LE  K  EE       E+ A    + DM
Sbjct: 120 KQ--EEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDM 177

Query: 550 EEKYSSLQEESEGLSKK----LKKVSSMLLAAKEERKDVQRENRKEIEG 594
           +EK S  + E      +     K  +  +     + +D   E+ K I+ 
Sbjct: 178 KEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKE 226



 Score = 32.6 bits (74), Expect = 0.58
 Identities = 30/150 (20%), Positives = 56/150 (37%), Gaps = 16/150 (10%)

Query: 417 QDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELD--------------ETE 462
           Q        K ++E+ E  + L E  V   +EE  P  EP+LD              ET 
Sbjct: 91  QLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETL 150

Query: 463 PCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAA 522
              +  E  ED    E   ++   Q M EK    +++     E + +    + +  D   
Sbjct: 151 EAGKVHEETEDSYHVEETASEQYKQDMKEKAS--EQENEDSKEPVEKAERTKAETDDVTE 208

Query: 523 QELEHRKQKEEELKKNIQEKDAERIDMEEK 552
           ++ +      E+ K   +E   E ++ +++
Sbjct: 209 EDYDEEDNPVEDSKAIKEELAKEPVEEQQE 238


>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16).  Autophagy is a
           ubiquitous intracellular degradation system for
           eukaryotic cells. During autophagy, cytoplasmic
           components are enclosed in autophagosomes and delivered
           to lysosomes/vacuoles. ATG16 (also known as Apg16) has
           been shown to be bind to Apg5 and is required for the
           function of the Apg12p-Apg5p conjugate in the yeast
           autophagy pathway.
          Length = 194

 Score = 32.9 bits (75), Expect = 0.40
 Identities = 31/128 (24%), Positives = 65/128 (50%), Gaps = 11/128 (8%)

Query: 481 RAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQ 540
            + +    M +KL  L++++    E   ++ E+ Q+LL     ELE  +++ ++L+K I 
Sbjct: 64  GSDAAIAEMEQKLAKLREEL---TELHKKRGELAQRLLLLN-DELEQLRREIQQLEKTIA 119

Query: 541 EKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLA------AKEERK-DVQRENRKEIE 593
           E  +E   +E +   L+EE +   K  + +   L++      A EE+   +Q+EN++ +E
Sbjct: 120 ELRSEITSLETEIRDLREELQEKEKDNETLQDELISLNIELNALEEKLRKLQKENQELVE 179

Query: 594 GFLDNVRQ 601
            ++    Q
Sbjct: 180 RWMAKKGQ 187


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 33.6 bits (77), Expect = 0.40
 Identities = 26/102 (25%), Positives = 54/102 (52%), Gaps = 7/102 (6%)

Query: 467 PSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELE 526
           P  +IE          + +   ++EKL  L+K++    E+ LEK   +Q+ L    +ELE
Sbjct: 495 PHFIIEQAKT-FYGEFKEEINVLIEKLSALEKELEQKNEH-LEKLLKEQEKLK---KELE 549

Query: 527 HRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLK 568
              ++ +E ++N + +  E+ + +E   +L++E E + ++LK
Sbjct: 550 QEMEELKERERNKKLE-LEK-EAQEALKALKKEVESIIRELK 589



 Score = 32.1 bits (73), Expect = 1.1
 Identities = 25/87 (28%), Positives = 45/87 (51%), Gaps = 3/87 (3%)

Query: 509 EKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQE-ESEGLSKKL 567
            K EI   +   +A E E  +QK E L+K ++E++  + ++E++   L+E E     +  
Sbjct: 509 FKEEINVLIEKLSALEKEL-EQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELE 567

Query: 568 KKVSSMLLAAKEERKDVQRE-NRKEIE 593
           K+    L A K+E + + RE   K+I 
Sbjct: 568 KEAQEALKALKKEVESIIRELKEKKIH 594


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 33.0 bits (76), Expect = 0.43
 Identities = 23/107 (21%), Positives = 39/107 (36%), Gaps = 11/107 (10%)

Query: 499 KVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQE 558
           + L   E  +E    + +  +A  + L       E+ K+  Q  +A+    +E    L E
Sbjct: 197 QALTAKEKAIEAERAKAEAAEAEQELLR------EKQKEEEQMMEAQERSYQEHVKQLIE 250

Query: 559 ESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKE 605
           + E   +KL      +L  K      Q +     EGF      + KE
Sbjct: 251 KMEAEREKLLAEQERMLEHKL-----QEQEELLKEGFKTEAESLQKE 292


>gnl|CDD|235206 PRK04031, PRK04031, DNA primase; Provisional.
          Length = 408

 Score = 33.2 bits (77), Expect = 0.44
 Identities = 27/120 (22%), Positives = 45/120 (37%), Gaps = 21/120 (17%)

Query: 419 PKDALLIKYQKEISELKKKLEEASV-PPPREEPTPNPEPELDETEPCVEPSEVIE----- 472
           P +  L +  K+  +  +K++E    P       P PE +     P  +   V E     
Sbjct: 265 PVEQYLEELGKKAQKAAEKVKEEEEKPEKEPAEQPEPEEKEPAPVPAEKEETVREHIKEL 324

Query: 473 ---------DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQ 523
                    DE    +K        +V+KLE  + KV      ++    I Q+LLD A++
Sbjct: 325 KGTLEARLLDENWNVIKEVPVR--DLVDKLEEAEDKV----YAIVFDGIITQRLLDLASE 378


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 33.4 bits (77), Expect = 0.44
 Identities = 16/108 (14%), Positives = 39/108 (36%), Gaps = 12/108 (11%)

Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
           Q+            ++QKE         K+ E I   +  +S  ++++   K+L+K+ + 
Sbjct: 241 QKAERLRQEAAAYEKQQKE-------LAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEAR 293

Query: 574 LLAAKEERKDVQRENRKEIEG-----FLDNVRQITKEYSYQQAIIQKF 616
           L   +   +      R    G      +     ++K Y   + +++  
Sbjct: 294 LAEERPVEEGKPLAFRFPPPGKRLGKLVLEFENVSKGYDGGRLLLKDL 341


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 32.9 bits (75), Expect = 0.48
 Identities = 30/152 (19%), Positives = 62/152 (40%), Gaps = 23/152 (15%)

Query: 464 CVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQ 523
           C E ++++ +  D+EL+ A+ +  +  + L  L+ +                + LD+   
Sbjct: 23  CEECADILLERLDSELRDAEKERDTYKQYLSKLESQ---------NVEISNYEALDS--- 70

Query: 524 ELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKD 583
           EL+  K++EE L   ++E + E  D++ +   LQEE E L              + +   
Sbjct: 71  ELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLE-----------NEELQYLR 119

Query: 584 VQRENRKEIEGFLDNVRQITKEYSYQQAIIQK 615
                 +      DN++ +  +Y Y    + K
Sbjct: 120 EYNLFDRNNLQLEDNLQSLELQYEYSLNQLDK 151


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 33.4 bits (76), Expect = 0.51
 Identities = 23/125 (18%), Positives = 41/125 (32%), Gaps = 4/125 (3%)

Query: 445 PPREEPTPNPEPELDETEPCVEPSEVIEDE----TDAELKRAQSDHQSMVEKLELLQKKV 500
            P E    N E    E E   E     E+E      AE K  Q     +  K    + + 
Sbjct: 651 RPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGET 710

Query: 501 LVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEES 560
                    + E +    +   +  E  ++ E+E +   + K     + + K +  + E+
Sbjct: 711 EAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGET 770

Query: 561 EGLSK 565
           E   K
Sbjct: 771 EAEGK 775


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 33.4 bits (77), Expect = 0.53
 Identities = 33/165 (20%), Positives = 64/165 (38%), Gaps = 28/165 (16%)

Query: 452 PNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ-SMVEKLELLQKKVLVGGENLLEK 510
           P  + EL + E   E   + E    + ++  +++   S    L++    VLVG E L ++
Sbjct: 709 PEVDEELIDEEAEKEFELLKE--IISAIRNLRAEMNLSPSAPLKV----VLVGSEELEDR 762

Query: 511 AEIQQQLLD--AAAQELEHRKQKEEELKKNIQE---------KDAERIDMEEKYSSLQEE 559
            E  +  +   A  +ELE      EE    +             A  ID+  + + L++E
Sbjct: 763 LEANEDDIKGLANLEELEILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKE 822

Query: 560 SEGLSKKLKKVSSMLLAAK----------EERKDVQRENRKEIEG 594
            E L K++ ++   L              E+ K+   E + ++  
Sbjct: 823 LEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAK 867


>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
           envelope biogenesis, outer membrane].
          Length = 835

 Score = 33.1 bits (76), Expect = 0.56
 Identities = 33/182 (18%), Positives = 69/182 (37%), Gaps = 18/182 (9%)

Query: 443 VPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQ-------SDHQSMVE-KLE 494
           V   R+E       E +     ++    I D+    L +         SD + +    L+
Sbjct: 38  VNSKRQEEAEPAAEEAELQAELIQQELAINDQLSQALNQQTERLNALASDDRQLANLLLQ 97

Query: 495 LLQ-------KKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERI 547
           LLQ       +  ++ G  LL +  +QQ      A + + + +  +E      EK A   
Sbjct: 98  LLQSSRTIREQIAVLRGSLLLSRILLQQLGPLPEAGQPQEQFEVTQERDALQAEK-AYIN 156

Query: 548 DMEEKYSSLQEESEGLSKKLKKVSSMLLAAK-EERKDVQRE-NRKEIEGFLDNVRQITKE 605
            +E +   L  E   +  ++      LL +   +R+ +  + N++++    D +R +  +
Sbjct: 157 ALEGQAEQLTAEVRDILDQILDTRRELLNSLLSQREAISLQLNQQQLSAASDELRSLLHQ 216

Query: 606 YS 607
            S
Sbjct: 217 QS 218


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
           biogenesis [Translation, ribosomal structure and
           biogenesis].
          Length = 1077

 Score = 33.2 bits (75), Expect = 0.57
 Identities = 25/127 (19%), Positives = 51/127 (40%), Gaps = 15/127 (11%)

Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
           E+I+D+     +    D ++  +      +   V  EN     E+  +       E E  
Sbjct: 594 ELIQDDEKGNFED-LEDEENSSDNEMEESRGSSVTAENEESADEVDYE------TEREEN 646

Query: 529 KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQREN 588
            +K+EEL+ N + ++      ++      EE   + ++LK         + E + +  E+
Sbjct: 647 ARKKEELRGNFELEERGD-PEKKDVDWYTEEKRKIEEQLK-------INRSEFETMVPES 698

Query: 589 RKEIEGF 595
           R  IEG+
Sbjct: 699 RVVIEGY 705


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 32.5 bits (74), Expect = 0.61
 Identities = 25/72 (34%), Positives = 34/72 (47%), Gaps = 15/72 (20%)

Query: 522 AQELEHRKQKEEELKKNIQE--------KDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
           AQ LE  KQK EE +K I +        K  E+ + E K    ++    LSKK KK    
Sbjct: 17  AQSLEELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKKKKF--- 73

Query: 574 LLAAKEERKDVQ 585
               K+E+ DV+
Sbjct: 74  ----KKEKVDVR 81


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 32.2 bits (74), Expect = 0.64
 Identities = 15/80 (18%), Positives = 39/80 (48%), Gaps = 10/80 (12%)

Query: 527 HRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR 586
             +Q + EL++ I++ + E+ ++E++ + L+ + E + K            +EER+  ++
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEK----------REEEERQIEEK 166

Query: 587 ENRKEIEGFLDNVRQITKEY 606
            +  EI       +Q+  + 
Sbjct: 167 RHADEIAFLKKQNQQLKSQL 186



 Score = 30.3 bits (69), Expect = 2.2
 Identities = 16/69 (23%), Positives = 40/69 (57%), Gaps = 7/69 (10%)

Query: 509 EKAEIQQQL--LDAAAQELEHR----KQKEEELKKNIQE-KDAERIDMEEKYSSLQEESE 561
            K+E++Q++  L+   +ELE R    + K E ++K  +E +  E     ++ + L+++++
Sbjct: 121 GKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQ 180

Query: 562 GLSKKLKKV 570
            L  +L+++
Sbjct: 181 QLKSQLEQI 189


>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
           The small mechanosensitive channel, MscS, is a part of
           the turgor-driven solute efflux system that protects
           bacteria from lysis in the event of osmotic shock. The
           MscS protein alone is sufficient to form a functional
           mechanosensitive channel gated directly by tension in
           the lipid bilayer. The MscS proteins are heptamers of
           three transmembrane subunits with seven converging M3
           domains, and this MscS_porin is towards the N-terminal
           of the molecules. The high concentration of negative
           charges at the extracellular entrance of the pore helps
           select the cations for efflux.
          Length = 239

 Score = 32.2 bits (74), Expect = 0.65
 Identities = 19/125 (15%), Positives = 52/125 (41%), Gaps = 5/125 (4%)

Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
            +L +         E+  L+Q   L    N L++ +  +Q  D   ++++   ++  EL+
Sbjct: 1   NQLDQLNKRKLPTPEQKALIQ--DLEQALNFLDEIDKSKQKADQYQKQIDDAPKELRELR 58

Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFL 596
           + ++        +  + ++L      L ++L +  S L   +E+ +    +   E++   
Sbjct: 59  QELEALKKTDAPVFPELANLSLSQ--LEQRLAQTLSQLQELQEQLQQENSQLI-ELQTRP 115

Query: 597 DNVRQ 601
           +  +Q
Sbjct: 116 ERAQQ 120


>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein. 
          Length = 312

 Score = 32.6 bits (75), Expect = 0.67
 Identities = 53/206 (25%), Positives = 87/206 (42%), Gaps = 51/206 (24%)

Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEP-SEVIEDE----TDAE-- 478
            Y++EI EL+K+L+E +    R       + E+D      E   E  EDE      AE  
Sbjct: 51  LYEQEIRELRKQLDELTNERARL------QLEIDNLRLAAEDFREKYEDELNLRQSAEND 104

Query: 479 ---LKR-------AQSDHQSMVEKL-------------ELLQKKVLVGGENLLEKAEIQQ 515
              L++       A+ D +  +E L             E+ + +  + G+  +E    ++
Sbjct: 105 IVGLRKDLDEATLARVDLEMKIESLQEELAFLKKNHEEEVRELQSQIQGQVNVEMDAARK 164

Query: 516 QLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLL 575
             L  A  E+  R Q EE  +KN QE        EE Y S   + E L +   +    L 
Sbjct: 165 LDLTKALAEI--RAQYEELAEKNRQE-------AEEWYKS---KLEELQQAAARNGDALR 212

Query: 576 AAKEERKDVQRE-NRKEIEGFLDNVR 600
           +AKEE  +++R+    EIE  L +++
Sbjct: 213 SAKEEITELRRQIQSLEIE--LQSLK 236


>gnl|CDD|237868 PRK14960, PRK14960, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 702

 Score = 32.7 bits (74), Expect = 0.71
 Identities = 46/205 (22%), Positives = 76/205 (37%), Gaps = 34/205 (16%)

Query: 440 EASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKK 499
               P P  EP P PEPE  E EP  EP    E + + +L     +H  ++         
Sbjct: 410 VEPEPEPEPEPEPEPEPE-PEPEPEPEPEPEPEPQPNQDLMVFDPNHHELIG-------- 460

Query: 500 VLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEE 559
                   LE A +Q+ +   +  E +     E++L +   E   ++I+ E   ++   E
Sbjct: 461 --------LESAVVQETV---SVLEEDFIPVPEQKLVQVQAETQVKQIEPEPASTA---E 506

Query: 560 SEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFIPT 619
             GL +      S  LA      D+  E   E +  +        E     A++++   T
Sbjct: 507 PIGLFEASSAEFS--LAQDTSAYDLVSEPVIEQQSLVQ------AEIVETVAVVKEPNAT 558

Query: 620 KYLELMTQNA---TWNEDIGEWELK 641
              +LM Q+          GEW L+
Sbjct: 559 DNSQLMPQDILKLPSQTLEGEWTLE 583


>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are about 260 amino acids in
           length.
          Length = 250

 Score = 32.2 bits (74), Expect = 0.75
 Identities = 14/90 (15%), Positives = 36/90 (40%), Gaps = 12/90 (13%)

Query: 516 QLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLL 575
           Q L AA Q +    Q     +K I +   +  ++  +   L +E E L     ++  ++ 
Sbjct: 26  QALAAAQQTVAAAAQ----SQKKIDKWADQTQELLAEIRQLLKEIENLRVYNDQLQRLVA 81

Query: 576 AAKEERKDVQRENRKEIEGFLDNVRQITKE 605
             ++E   +Q++        ++ + +  + 
Sbjct: 82  NQQQEIASLQQQ--------IEQIEKTRQG 103


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 32.7 bits (75), Expect = 0.75
 Identities = 33/105 (31%), Positives = 52/105 (49%), Gaps = 13/105 (12%)

Query: 515 QQLLDAAAQELEHRKQKEEELK----KNIQEKDAERID-----MEEKYSSLQEESEGLSK 565
           +Q L    +ELE R+QKEEE++    K I EK  E ++     M+ +    +EE + +  
Sbjct: 86  EQGLQRKLKELE-REQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQMEI 144

Query: 566 KLKKVSSMLLAAKEERKDVQRENR---KEIEGFLDNVRQITKEYS 607
             K + S L   KEE K +QR  +   KE E    +  ++ +EY 
Sbjct: 145 LEKALKSYLKIVKEENKSLQRLAKALQKESEERTQDETKMIEEYR 189


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 32.8 bits (75), Expect = 0.77
 Identities = 27/116 (23%), Positives = 50/116 (43%), Gaps = 3/116 (2%)

Query: 503 GGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEG 562
           GGE +LE      Q L +AA  +       +EL+K   E    R    E   +L+E  E 
Sbjct: 103 GGEEILEAQGELGQYLFSAAAGVGSLGSVRDELEKEADELWKPRGRKPEINVALKELKE- 161

Query: 563 LSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFIP 618
           L  ++++V  +     ++      E  +E+      +RQ+ KE   +   +++ +P
Sbjct: 162 LEAEIREV-QLKTRTWKDLVKALDEAEEELANLRKELRQLEKEKQ-RLERLRRLLP 215



 Score = 29.3 bits (66), Expect = 8.0
 Identities = 21/136 (15%), Positives = 49/136 (36%), Gaps = 15/136 (11%)

Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKE 532
           DE   + +RA  + +   ++ +   +       + L +A +  +   A A +     Q  
Sbjct: 686 DERLRDAERALEEAEERHDEAQSALEAWQEQWYDALLEAGLGGRASPAGALDALELLQNI 745

Query: 533 EELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKL-------------KKVSSMLLAAKE 579
           +E  +   +       ME   +  +EE E L++ +             + +   L  A++
Sbjct: 746 KEKLQAADDLRQRIAAMERDLARFEEEVEALAEAVAPEMLGTPADETARALKQRLKRARD 805

Query: 580 ERKDVQR--ENRKEIE 593
                ++  E  +E E
Sbjct: 806 TAAAAEKLAEEIEEAE 821


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 32.2 bits (73), Expect = 0.77
 Identities = 26/113 (23%), Positives = 49/113 (43%), Gaps = 4/113 (3%)

Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
           E++  E    L RA       V +   L++      E L E  + +++LL     ELE  
Sbjct: 106 ELVRKELKNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLK----ELEEL 161

Query: 529 KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEER 581
           + + EE+++ ++  + E   +EE    L  E   L K+  ++   +   +EE 
Sbjct: 162 EAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPEEEL 214


>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ.  Members of
           this family are the FliJ protein found, in nearly every
           case, in the midst of other flagellar biosynthesis genes
           in bacgterial genomes. Typically the fliJ gene is found
           adjacent to the gene for the flagellum-specific ATPase
           FliI. Sequence scoring in the gray zone between trusted
           and noise cutoffs include both probable FliJ proteins
           and components of bacterial type III secretion systems.
          Length = 141

 Score = 31.1 bits (71), Expect = 0.78
 Identities = 19/91 (20%), Positives = 35/91 (38%), Gaps = 16/91 (17%)

Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKA----------------EIQQ 515
           E++   EL +AQ++ + +  +L+ L K      +  LEK                     
Sbjct: 15  EEQAKLELAKAQAEFERLETQLQQLIKYREEYEQQALEKVGAGTSALELSNYQRFIRQLD 74

Query: 516 QLLDAAAQELEHRKQKEEELKKNIQEKDAER 546
           Q +    QEL   +Q+ E  ++ + E   E 
Sbjct: 75  QRIQQQQQELALLQQEVEAKRERLLEARREL 105


>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1.  This is
           the N-terminal 200 residues of a set of proteins
           conserved from yeasts to humans. Most of the proteins in
           this entry have an RhoGEF pfam00621 domain at their
           C-terminal end.
          Length = 158

 Score = 31.2 bits (71), Expect = 0.85
 Identities = 25/106 (23%), Positives = 51/106 (48%), Gaps = 2/106 (1%)

Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
           E  L + + ++ L     +E E RKQ E++L +   E + E+ ++++K   L+E    L 
Sbjct: 51  ELELLREDNER-LSTQYEREKELRKQAEQKLLEFEDELEQEKKELQKKIEDLEENVRQLE 109

Query: 565 KKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQ 610
            K K +S  +   +E   ++++E     E +   + +   EY  +Q
Sbjct: 110 LKAKNLSDQVSRLEERETELKKEYNSLHERYT-KLLKNYVEYVERQ 154


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 32.3 bits (74), Expect = 0.94
 Identities = 19/83 (22%), Positives = 36/83 (43%), Gaps = 15/83 (18%)

Query: 510 KAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEE-------KYSSLQE---- 558
           + E +++      + L   K++EE+ KK +++ +  +I   E       KYS+  E    
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLP 613

Query: 559 ----ESEGLSKKLKKVSSMLLAA 577
               + E +SKK +K  S     
Sbjct: 614 THDADGEEISKKERKKLSKEYDK 636


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 31.2 bits (71), Expect = 0.96
 Identities = 15/44 (34%), Positives = 26/44 (59%)

Query: 509 EKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEK 552
            K ++++  L A  +E E  K+KEE   K + E + ER ++E+K
Sbjct: 68  AKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKK 111


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 32.2 bits (73), Expect = 1.00
 Identities = 21/105 (20%), Positives = 41/105 (39%), Gaps = 3/105 (2%)

Query: 478 ELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKK 537
           EL + + +H+        L++K     EN   + +   +L        E  K+ E E  +
Sbjct: 76  ELMQKELEHK---RAQIELERKASTLAENYERELDRNLELEVRLKALEELEKKAENEAAE 132

Query: 538 NIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERK 582
             +E    +  ++ +   LQ E E   K+ K+  S +     E +
Sbjct: 133 AEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQ 177


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 32.2 bits (73), Expect = 1.1
 Identities = 20/68 (29%), Positives = 35/68 (51%), Gaps = 10/68 (14%)

Query: 523 QELEHRKQKEEELKKNIQEKDAERIDMEEK--YSSLQEESEGLSKKLKKVSSMLLAAKEE 580
           +ELE +K+KEE+ K    EK+ +++   +K   + LQ +       + K S      K  
Sbjct: 16  EELERKKKKEEKAK----EKELKKLKAAQKEAKAKLQAQQASDGTNVPKKS----EKKSR 67

Query: 581 RKDVQREN 588
           ++DV+ EN
Sbjct: 68  KRDVEDEN 75


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 32.5 bits (74), Expect = 1.1
 Identities = 21/92 (22%), Positives = 39/92 (42%), Gaps = 4/92 (4%)

Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKA----EIQQQLLDAAAQELEH 527
           E E DA  +  + +  ++  +LE LQ          LE+A    E  Q     A Q +  
Sbjct: 299 ERELDARTEALEREADALRTRLEALQGSPAYQDAEELERARADAEALQAAAADARQAIRE 358

Query: 528 RKQKEEELKKNIQEKDAERIDMEEKYSSLQEE 559
            + + EE ++ + E+     D E +  + +E+
Sbjct: 359 AESRLEEERRRLDEEAGRLDDAERELRAAREQ 390


>gnl|CDD|226025 COG3494, COG3494, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 279

 Score = 31.6 bits (72), Expect = 1.1
 Identities = 17/87 (19%), Positives = 30/87 (34%), Gaps = 16/87 (18%)

Query: 120 RPIVDKVLSGFNGTIFAYGQTGT-GKTFTMEGVNNV---------PELKGIIPNSFA--- 166
           R   D  L  F     + G+ G   K    EGV+ V         P  + + P+      
Sbjct: 37  RGEADPELKEFEYKEVSIGEVGKLIKLLKTEGVDRVVLAGGVERRPNFRDLRPDKIGLAV 96

Query: 167 --HIFGHIAKADESVKFLVRVSYFEIY 191
              I   + + D+++   V + + E  
Sbjct: 97  LPKIVEALIRGDDALLKAV-IDFIESR 122


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 32.0 bits (73), Expect = 1.2
 Identities = 23/140 (16%), Positives = 60/140 (42%), Gaps = 6/140 (4%)

Query: 468 SEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEH 527
           +E +    + E  R +   +    +   + +K +   E  +E+ E +   L     ELE 
Sbjct: 391 AEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKR---ELEE 447

Query: 528 RKQKEEELKKNIQE-KDAERIDM--EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDV 584
            K++ E+L+  ++  +   R  +  + +  +     E L K+L++    +   + +  ++
Sbjct: 448 LKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAEL 507

Query: 585 QRENRKEIEGFLDNVRQITK 604
           ++  + E+ G    V+ + K
Sbjct: 508 RKMRKLELSGKGTPVKVVEK 527


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 31.9 bits (73), Expect = 1.2
 Identities = 23/93 (24%), Positives = 45/93 (48%), Gaps = 3/93 (3%)

Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL--LDAAAQELE 526
           EVI +E +A  K  +    S +E LEL +++    GE    +AE++     +D    E +
Sbjct: 175 EVISEELEARRKLKEKGLVSRLELLELERERAEAQGELGRLEAELEVLKRQIDELQLERQ 234

Query: 527 HRKQK-EEELKKNIQEKDAERIDMEEKYSSLQE 558
             +Q   EE+ + + E  A   ++ E+ +  ++
Sbjct: 235 QIEQTFREEVLEELTEAQARLAELRERLNKARD 267


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 32.2 bits (74), Expect = 1.2
 Identities = 49/344 (14%), Positives = 113/344 (32%), Gaps = 80/344 (23%)

Query: 307 MGRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKL 366
           +G +H+ DL       +       L + S+    L  L + +  L   +          L
Sbjct: 28  LGVVHIEDLKEELSNERLRKLRSLLTKLSEA---LDKLRSYLPKLNPLREEKKKVSVKSL 84

Query: 367 TRILQDSLGGNSKTVMCATVGPASYNYEETISTLRYASRVKKICNKARVNQDPKDALLIK 426
             +++D                     EE +  +     +K++  +    ++    L   
Sbjct: 85  EELIKDV--------------------EEELEKI--EKEIKELEEEISELENEIKEL--- 119

Query: 427 YQKEISELKKKLEEASVPPPREEPTPN--------PEPELDETEPCVEPSEVIEDETDAE 478
            ++EI  L+       +                  PE +L+E +   +   V    TD  
Sbjct: 120 -EQEIERLEP-WGNFDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKG 177

Query: 479 --------LKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQ 530
                   LK    + +  ++KL   + ++   G           +L+    +ELE  ++
Sbjct: 178 YVYVVVVVLKELSDEVEEELKKLGFERLELEEEG--------TPSELIREIKEELEEIEK 229

Query: 531 KEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRK 590
           + E L + ++E        ++    L    E L  +L++  ++    K ++         
Sbjct: 230 ERESLLEELKELA------KKYLEELLALYEYLEIELERAEALSKFLKTDKTFA------ 277

Query: 591 EIEGFL-----DNVRQITKEYSYQQAIIQK--------FIPTKY 621
            IEG++       ++++  + +   A ++          +PTK 
Sbjct: 278 -IEGWVPEDRVKKLKELIDKATGGSAYVEFVEPDEEEEEVPTKL 320


>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR).  This
           family consists of several mammalian alpha helical
           coiled-coil rod HCR proteins. The function of HCR is
           unknown but it has been implicated in psoriasis in
           humans and is thought to affect keratinocyte
           proliferation.
          Length = 739

 Score = 31.9 bits (72), Expect = 1.2
 Identities = 28/110 (25%), Positives = 50/110 (45%), Gaps = 12/110 (10%)

Query: 479 LKRAQSDHQSMVEKL-------ELLQKKVLVGGENLLEKAEIQQQ-----LLDAAAQELE 526
           L  A+   ++  E L       E ++K +  GG+  LE+A+   Q     L  A  + L 
Sbjct: 106 LAVAEKAGRAEAEGLRAALAGAEEVRKNLEEGGQQELEEAQRLHQEQLSSLTQAHLEALS 165

Query: 527 HRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLA 576
             + K E L+K++Q  +  R    +  ++ Q E++ L ++L K    L A
Sbjct: 166 SLRSKAEGLEKSLQSLETRRAGEAKALAAAQAEADTLREQLSKTQEELEA 215


>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
          Length = 793

 Score = 32.0 bits (72), Expect = 1.2
 Identities = 33/148 (22%), Positives = 64/148 (43%), Gaps = 18/148 (12%)

Query: 449 EPTPNPE-PELDETEPCVEPSEVIED-ETDAELKRAQSDHQSMVEKLELLQKKVLVGGEN 506
           EP+  P   EL ++E   EP E +E+ E   EL+    + + ++E L L+Q++     + 
Sbjct: 252 EPSCRPSMSELLQSEFINEPRENLEEREAAMELRDRIEEQELLLEFLFLIQQRKQEAADK 311

Query: 507 L----------LEKAEIQQQLLDAAAQEL------EHRKQKEEELKKNIQEKDAERIDME 550
           L          +++   +Q +L     ++        R ++  E     +E D     ++
Sbjct: 312 LQDTISLLSSDIDQVVKRQLVLQQKGSDVRSFLASRKRIRQGAETLAAEEENDDNSSKLD 371

Query: 551 EKYSSLQEESEGLSKKLKKVSSMLLAAK 578
           +   S   ES  L + LKK+ S+  A +
Sbjct: 372 DTLESTLLESSRLMRNLKKLESVYFATR 399


>gnl|CDD|115071 pfam06390, NESP55, Neuroendocrine-specific golgi protein P55
           (NESP55).  This family consists of several mammalian
           neuroendocrine-specific golgi protein P55 (NESP55)
           sequences. NESP55 is a novel member of the chromogranin
           family and is a soluble, acidic, heat-stable secretory
           protein that is expressed exclusively in endocrine and
           nervous tissues, although less widely than
           chromogranins.
          Length = 261

 Score = 31.4 bits (70), Expect = 1.3
 Identities = 32/137 (23%), Positives = 57/137 (41%), Gaps = 14/137 (10%)

Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELD-ETEPCVEPSEVIEDETDAELKRAQS 484
           +Y ++  + +   E         E     EPE + +T P  EP    EDE    + +  +
Sbjct: 106 EYDEDDFDTETDSETEPESDIESETEFETEPETEPDTAPTTEPETEPEDEPGPVVPKGAT 165

Query: 485 DHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDA 544
            HQS+ E+L  L+          L+ A+   +    + QE E  ++ EE  +  +   D 
Sbjct: 166 FHQSLTERLHALK----------LQSADASPRRAPPSTQEPESAREGEEPERGPL---DK 212

Query: 545 ERIDMEEKYSSLQEESE 561
           +  D EE+    +EE +
Sbjct: 213 DPRDPEEEEEEKEEEKQ 229


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 32.0 bits (73), Expect = 1.3
 Identities = 34/212 (16%), Positives = 80/212 (37%), Gaps = 33/212 (15%)

Query: 420  KDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAEL 479
            K+ + ++  + +   +++L+E                 L      +   +++  E ++E+
Sbjct: 866  KETIYLQSAQRVELAERQLQE-----------------LKIDVKSISSLKLVNLELESEI 908

Query: 480  KRAQSDHQS-MVEKLELLQK-----KVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE 533
               +    S ++E LE   +     K L+   +L E   I+   L            K  
Sbjct: 909  IELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLP--------ELNKLH 960

Query: 534  ELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR--KE 591
            E++  ++E   E  D+ +K + L  E    + +LK     L    ++   +Q   +  KE
Sbjct: 961  EVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKE 1020

Query: 592  IEGFLDNVRQITKEYSYQQAIIQKFIPTKYLE 623
            +   +  ++  +K  S +   +    P + L+
Sbjct: 1021 LPVEVAELQSASKIISSESTELSILKPLQKLK 1052



 Score = 30.0 bits (68), Expect = 5.5
 Identities = 47/299 (15%), Positives = 103/299 (34%), Gaps = 58/299 (19%)

Query: 414  RVNQDPKDALLIKYQ---KEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEV 470
             + +     L+   +   + I+ LKK L    +             E   +   V+  E+
Sbjct: 910  ELKKSLSSDLIENLEFKTELIARLKKLLNNIDL-------------EEGPSIEYVKLPEL 956

Query: 471  IED-ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLL----EKAEIQQQL--LDAAAQ 523
             +  E +++LK    +++ +++K  +L ++       L     E AE+ +Q   L  + +
Sbjct: 957  NKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTK 1016

Query: 524  ELEHRKQKEEELKKNIQEKDAE-------------RIDMEEKYSSLQEESEGLS------ 564
            +L+    +  EL+   +   +E             +  +  + + LQ   + L       
Sbjct: 1017 QLKELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLLENNQLQARYKALKLRRENS 1076

Query: 565  -------KKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFI 617
                    +L+   ++L     +  +V   N  +    L  +     + +  Q  I KF+
Sbjct: 1077 LLDDKQLYQLESTENLLKTINVKDLEVTNRNLVKPANVLQFIVAQMIKLNLLQ-EISKFL 1135

Query: 618  PTK--YLELMTQNATWNEDIGEWELKCVAYTGNNMRKIVPLPEDTTATQIAQPDFSHLY 674
                  LE + Q  +  +     EL  + +  N      P P    + +      S LY
Sbjct: 1136 SQLVNTLEPVFQKLSVLQL----ELDGLFWEANLEALPSPPPFAALSEKR--LYQSALY 1188


>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like.  This family is a set
           of eukaryotic tropomyosins. Within the yeast Tmp1 and
           Tmp2, biochemical and sequence analyses indicate that
           Tpm2p spans four actin monomers along a filament,
           whereas Tpmlp spans five. Despite its shorter length,
           Tpm2p can compete with Tpm1p for binding to F-actin.
           Over-expression of Tpm2p in vivo alters the axial
           budding of haploids to a bipolar pattern, and this can
           be partially suppressed by co-over-expression of Tpm1p.
           This suggests distinct functions for the two
           tropomyosins, and indicates that the ratio between them
           is important for correct morphogenesis. The family also
           contains higher eukaryote Tmp3 members.
          Length = 143

 Score = 30.6 bits (70), Expect = 1.3
 Identities = 31/141 (21%), Positives = 67/141 (47%), Gaps = 35/141 (24%)

Query: 430 EISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSM 489
           EI+ L+KK ++              E E+++ E  ++ ++   +E++    +  ++ +++
Sbjct: 36  EITSLQKKNQQL-------------EEEVEKLEEQLKEAKEKLEESE----KLATNAEAL 78

Query: 490 VEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDM 549
             +++LL        E  LE++E   + L    ++L    +K EE ++ ++  + ER + 
Sbjct: 79  TRRIQLL--------EEELEESE---KRLKETTEKLREADKKAEESERKVKALENERDEW 127

Query: 550 EEKYSSLQEESEGLSKKLKKV 570
           EEKY       E L KK K+ 
Sbjct: 128 EEKY-------EELEKKYKEA 141


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 32.1 bits (74), Expect = 1.3
 Identities = 20/90 (22%), Positives = 38/90 (42%), Gaps = 7/90 (7%)

Query: 452 PNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKA 511
           PN E +       +E  +  EDE + E      +  S+      L +KVL   E     A
Sbjct: 176 PNAEEDPAHVGSELEELDDDEDEEEEE----DENDDSLAADESELPEKVL---EKFKALA 228

Query: 512 EIQQQLLDAAAQELEHRKQKEEELKKNIQE 541
           +  ++L  A  +++E R  + ++  K  ++
Sbjct: 229 KQYKKLRKAQEKKVEGRLAQHKKYAKLREK 258


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 31.3 bits (71), Expect = 1.3
 Identities = 21/84 (25%), Positives = 40/84 (47%), Gaps = 7/84 (8%)

Query: 523 QELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERK 582
           ++     + +EE K   ++ + E I+ EE     +E +E   ++LK  ++ L   KEE K
Sbjct: 15  EDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKL---KEENK 71

Query: 583 DVQRENRKEIEGFLDNVRQITKEY 606
            ++     E+E   D + +   EY
Sbjct: 72  KLE----NELEALKDRLLRTVAEY 91


>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN.  FtsN is a poorly
           conserved protein active in cell division in a number of
           Proteobacteria. The N-terminal 30 residue region tends
           to by Lys/Arg-rich, and is followed by a
           membrane-spanning region. This is followed by an acidic
           low-complexity region of variable length and a
           well-conserved C-terminal domain of two tandem regions
           matched by pfam05036 (Sporulation related repeat), found
           in several cell division and sporulation proteins. The
           role of FtsN as a suppressor for other cell division
           mutations is poorly understood; it may involve cell wall
           hydrolysis [Cellular processes, Cell division].
          Length = 298

 Score = 31.6 bits (71), Expect = 1.3
 Identities = 26/170 (15%), Positives = 52/170 (30%), Gaps = 6/170 (3%)

Query: 428 QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ 487
           Q    E  +   +        +  P PE      E       +I D  +        +  
Sbjct: 53  QANEPETLQPKNQTENGETAADLPPKPEERWSYIEELEAREVLINDPEEPSNGGGVEESA 112

Query: 488 SMVE----KLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKD 543
            +       LE +Q  +    + L      Q   ++A  Q  E + QK    +      +
Sbjct: 113 QLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVE 172

Query: 544 AERID--MEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKE 591
            E+I   ++E     +   +  ++       +  A K ++ D  +   KE
Sbjct: 173 TEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAPKADKADKTKPKPKE 222


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 31.5 bits (72), Expect = 1.3
 Identities = 18/128 (14%), Positives = 34/128 (26%), Gaps = 10/128 (7%)

Query: 414 RVNQDPKDALLIKYQKEISE------LKKKLEEASVPPPREEP---TPNPEPELDETEPC 464
           RVN  P +A   +  +   +               V  P E        P P     +P 
Sbjct: 70  RVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQP- 128

Query: 465 VEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQE 524
           V+             +          + +    +      +     A  Q + +   A  
Sbjct: 129 VQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPV 188

Query: 525 LEHRKQKE 532
           ++  K+KE
Sbjct: 189 MDKPKRKE 196


>gnl|CDD|178770 PLN03231, PLN03231, putative alpha-galactosidase; Provisional.
          Length = 357

 Score = 31.5 bits (71), Expect = 1.3
 Identities = 21/79 (26%), Positives = 32/79 (40%), Gaps = 12/79 (15%)

Query: 289 RTTGENWRLWAEAQGSVQMGRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVI 348
           R TG++W  W        + R    D A +   A     G +    S ++L +   G   
Sbjct: 236 RVTGDDWDDWKYLVKHFDVAR----DFAAAGLIAIPSVVGGK----SWVDLDMLPFGR-- 285

Query: 349 SALVDGKCTHIPYRNSKLT 367
             L D    + PYRNS+L+
Sbjct: 286 --LTDPAAAYGPYRNSRLS 302


>gnl|CDD|153311 cd07627, BAR_Vps5p, The Bin/Amphiphysin/Rvs (BAR) domain of yeast
           Sorting Nexin Vps5p.  BAR domains are dimerization,
           lipid binding and curvature sensing modules found in
           many different proteins with diverse functions. Sorting
           nexins (SNXs) are Phox homology (PX) domain containing
           proteins that are involved in regulating membrane
           traffic and protein sorting in the endosomal system.
           SNXs differ from each other in their lipid-binding
           specificity, subcellular localization and specific
           function in the endocytic pathway. A subset of SNXs also
           contain BAR domains. The PX-BAR structural unit
           determines the specific membrane targeting of SNXs.
           Vsp5p is the yeast counterpart of human SNX1 and is part
           of the retromer complex, which functions in the
           endosome-to-Golgi retrieval of vacuolar protein sorting
           receptor Vps10p, the Golgi-resident membrane protein
           A-ALP, and endopeptidase Kex2. BAR domains form dimers
           that bind to membranes, induce membrane bending and
           curvature, and may also be involved in protein-protein
           interactions.
          Length = 216

 Score = 31.1 bits (71), Expect = 1.5
 Identities = 23/97 (23%), Positives = 45/97 (46%), Gaps = 13/97 (13%)

Query: 516 QLLDAAAQELEHRKQKEEELKKN-------IQEKDAERIDMEEKYSSLQEESEGLSKKLK 568
           Q   +A  EL  +K + E+LK+        +    +E  + E + S L++E E +S+ +K
Sbjct: 115 QYWQSAESELSKKKAQLEKLKRQGKTQQEKLNSLLSELEEAERRASELKKEFEEVSELIK 174

Query: 569 KVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKE 605
                    + ER +   + R  +E +L++  +  KE
Sbjct: 175 SELERF---ERERVE---DFRNSVEIYLESAIESQKE 205


>gnl|CDD|224477 COG1561, COG1561, Uncharacterized stress-induced protein [Function
           unknown].
          Length = 290

 Score = 31.4 bits (72), Expect = 1.6
 Identities = 19/105 (18%), Positives = 41/105 (39%), Gaps = 21/105 (20%)

Query: 500 VLVGGENLLEKAEIQQQLLDAAAQELEH----RKQKEEELKKNIQEKDAERIDMEEKYSS 555
           VL   E   +   ++  +L A  + L+     R+++   LK ++     +R+D       
Sbjct: 116 VLAIEEEEEDLEALEALILAALEEALDDLIEMREREGAALKADLL----QRLD------- 164

Query: 556 LQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVR 600
                  + + ++KV S++    E  ++       E +  LD  R
Sbjct: 165 ------AIEELVEKVESLMPEILEWYRERLVARLNEAQDQLDEDR 203


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 31.7 bits (72), Expect = 1.6
 Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 4/80 (5%)

Query: 478 ELKRAQSDHQSMVEKLELLQKKVLVGGENLLE----KAEIQQQLLDAAAQELEHRKQKEE 533
           +L++A  +  S  E   LL+ K+    E L E      + ++Q+ +    + ++  +K  
Sbjct: 520 DLQKALQEQGSKSEDSSLLKSKLEEHLEQLHEANEELQKKREQIEELEPDQDQNLSRKIA 579

Query: 534 ELKKNIQEKDAERIDMEEKY 553
           EL+  +Q+KD +   MEE+Y
Sbjct: 580 ELEAALQKKDEDMRAMEERY 599



 Score = 29.8 bits (67), Expect = 6.3
 Identities = 20/86 (23%), Positives = 42/86 (48%)

Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
           + ++   E  Q ++  A QEL  ++Q     +++    D +     E     QEE + L+
Sbjct: 133 QQIMTLEESVQHVVMTAIQELMSKEQGSSPSRESAGNLDQQLKKALEDLKEAQEEKDELA 192

Query: 565 KKLKKVSSMLLAAKEERKDVQRENRK 590
           ++  ++   +L  +EE+  +Q+EN K
Sbjct: 193 QRCHELDKQVLLLQEEKNSLQQENEK 218


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 30.3 bits (69), Expect = 1.7
 Identities = 21/81 (25%), Positives = 38/81 (46%), Gaps = 1/81 (1%)

Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKA-EIQQQLLDAAAQELEHRKQK 531
           DE  A++    ++ + + E+ + L  +     E   E+A EI +Q    A Q  E  K +
Sbjct: 36  DERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAE 95

Query: 532 EEELKKNIQEKDAERIDMEEK 552
            EE  + I+E     I+ E++
Sbjct: 96  AEEELERIKEAAEAEIEAEKE 116


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 31.4 bits (71), Expect = 1.7
 Identities = 24/100 (24%), Positives = 47/100 (47%), Gaps = 12/100 (12%)

Query: 493 LELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEK 552
           LE L+K V  G + ++++A            ++E   +KE E     +E   +R+ +EE 
Sbjct: 363 LERLEKDVEEGEKTIVKEAR-----------QIEEELEKEVEKLGKEEESLFKRVALEEG 411

Query: 553 YSSLQEESEGLSKKL-KKVSSMLLAAKEERKDVQRENRKE 591
              L+++ E   K+L K+   +L   K E  +V++   + 
Sbjct: 412 LKELEQDEENFLKELSKEEKELLEKLKMEASEVEKLFGRA 451


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 31.5 bits (72), Expect = 1.8
 Identities = 13/34 (38%), Positives = 17/34 (50%)

Query: 441 ASVPPPREEPTPNPEPELDETEPCVEPSEVIEDE 474
           A   PP EEP P P P  D+ +P  +  E  + E
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAE 293


>gnl|CDD|223182 COG0104, PurA, Adenylosuccinate synthase [Nucleotide transport and
           metabolism].
          Length = 430

 Score = 31.4 bits (72), Expect = 1.9
 Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 8/71 (11%)

Query: 522 AQELEHRKQKEEELKKNIQEK--------DAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
             +L   +   E+L++ ++ K         AE +D E+      E +E L   +  VS +
Sbjct: 149 VGDLLDPETLREKLERLLEYKNFQLVKYYGAEAVDFEDILDEYYEYAERLKPYVTDVSVL 208

Query: 574 LLAAKEERKDV 584
           L  A +  K V
Sbjct: 209 LNDALDAGKRV 219


>gnl|CDD|218380 pfam05010, TACC, Transforming acidic coiled-coil-containing protein
           (TACC).  This family contains the proteins TACC 1, 2 and
           3 the genes for which are found concentrated in the
           centrosomes of eukaryotic and may play a conserved role
           in organising centrosomal microtubules. The human TACC
           proteins have been linked to cancer and TACC2 has been
           identified as a possible tumour suppressor (AZU-1). The
           functional homologue (Alp7) in Schizosaccharomyces pombe
           has been shown to be required for organisation of
           bipolar spindles.
          Length = 207

 Score = 30.5 bits (69), Expect = 2.1
 Identities = 32/135 (23%), Positives = 63/135 (46%), Gaps = 4/135 (2%)

Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQK 531
           + + DA LK+ + +  S   + E L+KK        LE  +I  +     AQ +E  +++
Sbjct: 4   QKDMDAALKKIKREVISSELESEELKKKYEKLRSENLEMGKIVDEFEKTIAQMIEESQKQ 63

Query: 532 EEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKE 591
           +E  K+ +QE  AE+       +SL+     L K+ +K   ++    E  K  +   +K 
Sbjct: 64  KELSKRELQEVLAEKDQAYADLNSLETSFSDLFKRYEKYKEVI----EGYKKNEETLKKC 119

Query: 592 IEGFLDNVRQITKEY 606
            + +LD +++  + Y
Sbjct: 120 AQEYLDRLKKEEQRY 134



 Score = 29.7 bits (67), Expect = 3.7
 Identities = 44/200 (22%), Positives = 91/200 (45%), Gaps = 22/200 (11%)

Query: 416 NQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPEL-DETEPCVEPSEVIEDE 474
           +Q   DA L K ++E+   + + EE      +         ++ DE E  +  +++IE+ 
Sbjct: 3   SQKDMDAALKKIKREVISSELESEELKKKYEKLRSENLEMGKIVDEFEKTI--AQMIEES 60

Query: 475 TDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK--AEIQQQLLDAAAQELEHRKQKE 532
                ++ +   +  ++++ L +K       N LE   +++ ++  +   + +E  K+ E
Sbjct: 61  -----QKQKELSKRELQEV-LAEKDQAYADLNSLETSFSDLFKRY-EKYKEVIEGYKKNE 113

Query: 533 EELKKNIQEKDAERIDMEEKYSSLQ----EESEGLSKKLKKVSSMLLAAKEERKDVQREN 588
           E LKK  QE        E++Y +L+    E+ E  ++++ +V S    AK E   +Q   
Sbjct: 114 ETLKKCAQEYLDRLKKEEQRYQALKAHAEEKLEIANEEIAQVRS---KAKAETAALQASL 170

Query: 589 RKE---IEGFLDNVRQITKE 605
           RKE   ++   + + Q  KE
Sbjct: 171 RKEQMKVQSLEETLEQKNKE 190


>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH.  This
           bacillus clade of FliH proteins is not found by the Pfam
           FliH model pfam02108, but is closely related to the
           sequences identified by that model. Sequences identified
           by this model are observed in flagellar operons in an
           analogous position relative to other flagellar operon
           genes.
          Length = 255

 Score = 30.9 bits (70), Expect = 2.1
 Identities = 15/76 (19%), Positives = 38/76 (50%), Gaps = 4/76 (5%)

Query: 497 QKKVLVGGENLLEKAEIQQQLLDAAAQE---LEHRKQKEEEL-KKNIQEKDAERIDMEEK 552
           Q++ ++    + +  E +Q+L D   +    LE  + +  ++ ++   +  A R  +E++
Sbjct: 15  QERGIIPLRQVTDVPETEQELADEEQEFEQILEKAEAEAAQIIEQAEAQAAAIREQIEQE 74

Query: 553 YSSLQEESEGLSKKLK 568
            +  +EE E L ++ K
Sbjct: 75  RAQWEEERERLIQEAK 90


>gnl|CDD|237965 PRK15429, PRK15429, formate hydrogenlyase transcriptional activator
           FhlA; Provisional.
          Length = 686

 Score = 31.3 bits (71), Expect = 2.1
 Identities = 26/71 (36%), Positives = 33/71 (46%), Gaps = 18/71 (25%)

Query: 314 DLAGSERQAKTGASGQRLKE---ASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRIL 370
           DL G ER A TGAS QR+     A K +L L  +G++   L             KL R+L
Sbjct: 446 DLFGHERGAFTGASAQRIGRFELADKSSLFLDEVGDMPLEL-----------QPKLLRVL 494

Query: 371 QDS----LGGN 377
           Q+     LG N
Sbjct: 495 QEQEFERLGSN 505


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 31.0 bits (70), Expect = 2.2
 Identities = 27/147 (18%), Positives = 57/147 (38%), Gaps = 5/147 (3%)

Query: 436 KKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLEL 495
                      +++  P  + E +  +   + +E  E +  AE  R +   Q    +   
Sbjct: 45  PGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAA 104

Query: 496 LQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSS 555
            Q +     +   +  E Q+Q  +A A++    K K E   +   +++A++   EE  + 
Sbjct: 105 KQAE-----QAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAK 159

Query: 556 LQEESEGLSKKLKKVSSMLLAAKEERK 582
              E++  + + KK +     AK E K
Sbjct: 160 AAAEAKKKAAEAKKKAEAEAKAKAEAK 186


>gnl|CDD|216249 pfam01025, GrpE, GrpE. 
          Length = 165

 Score = 30.3 bits (69), Expect = 2.2
 Identities = 13/61 (21%), Positives = 32/61 (52%)

Query: 541 EKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVR 600
           E+  E  ++E++  +L+EE E L ++++++   LL    E ++ ++   +E E       
Sbjct: 1   EEKEEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTEREREEAKKYAI 60

Query: 601 Q 601
           +
Sbjct: 61  E 61



 Score = 28.3 bits (64), Expect = 8.1
 Identities = 9/41 (21%), Positives = 24/41 (58%)

Query: 529 KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
           + +EE L++ ++E + E  +++++   L  E E   K+ ++
Sbjct: 10  EDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTER 50


>gnl|CDD|234605 PRK00059, prsA, peptidylprolyl isomerase; Provisional.
          Length = 336

 Score = 30.8 bits (70), Expect = 2.3
 Identities = 14/92 (15%), Positives = 41/92 (44%), Gaps = 9/92 (9%)

Query: 504 GENLLEKAEIQQQLLDAAAQELEHR------KQKEEELKKNIQEKDAE-RIDMEEKYSSL 556
           G+N  +  ++++Q+     Q L+         QK +ELK  +   + E   ++++K + +
Sbjct: 69  GDNYEKNEQVKEQIKQQKEQILDSLITEKVLLQKAKELK--LIPSEEELNKEVDKKINEI 126

Query: 557 QEESEGLSKKLKKVSSMLLAAKEERKDVQREN 588
           +++     ++ ++        +E  K+  +  
Sbjct: 127 KKQFNNDEEQFEEALKATGFTEETFKEYLKNQ 158


>gnl|CDD|227367 COG5034, TNG2, Chromatin remodeling protein, contains PhD zinc
           finger [Chromatin structure and dynamics].
          Length = 271

 Score = 30.7 bits (69), Expect = 2.3
 Identities = 24/100 (24%), Positives = 36/100 (36%), Gaps = 9/100 (9%)

Query: 464 CVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQ 523
             +  + +        K    D Q+  E  + L K++      LL KA   Q+     A 
Sbjct: 35  VCDIIKNLRQMISILKKIIDLDSQTYEEVEDGLLKEI----RELLLKAIYIQKEKSDLAD 90

Query: 524 ELE-----HRKQKEEELKKNIQEKDAERIDMEEKYSSLQE 558
             E     HRK  ++ + K   EK A RI+      S  E
Sbjct: 91  RAEKLLRRHRKLLDDRIAKRPHEKVAARIENCHDAVSRLE 130


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 31.2 bits (70), Expect = 2.3
 Identities = 31/114 (27%), Positives = 45/114 (39%), Gaps = 7/114 (6%)

Query: 446 PREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGE 505
             E      EPE +E E     S    +      K   +++       E  Q    V   
Sbjct: 144 AEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQ----VEAG 199

Query: 506 NLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEE 559
              EK + +QQ    AA ELE  K+K EE +K ++E++  R   E    S +EE
Sbjct: 200 KEFEKLKQKQQ---EAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEE 250


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 29.6 bits (67), Expect = 2.4
 Identities = 26/118 (22%), Positives = 55/118 (46%), Gaps = 4/118 (3%)

Query: 476 DAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEEL 535
           + E    Q++ + ++EK E  ++K+    + L E  E Q ++ + A Q+ E    K  E 
Sbjct: 2   EEEKSSLQAELKRLLEKKEDAEEKI----QKLQEDLEEQAEIANEAQQKYERELVKHAED 57

Query: 536 KKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
            + +Q    +  +++++ + L+ E+E    +L +        K+  +D   E  K IE
Sbjct: 58  IEELQALRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRIE 115



 Score = 29.2 bits (66), Expect = 3.3
 Identities = 25/125 (20%), Positives = 52/125 (41%), Gaps = 22/125 (17%)

Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKE 532
               AELKR     +   EK++ LQ+ +           E Q ++ + A Q+ E    K 
Sbjct: 6   SSLQAELKRLLEKKEDAEEKIQKLQEDL-----------EEQAEIANEAQQKYERELVKH 54

Query: 533 EELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAK----EERKDVQREN 588
            E  + +Q        + ++ + L++E   L  + +   + L  A+    E++K ++ E 
Sbjct: 55  AEDIEELQA-------LRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDEL 107

Query: 589 RKEIE 593
            +  +
Sbjct: 108 SELEK 112


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 31.2 bits (71), Expect = 2.5
 Identities = 24/87 (27%), Positives = 46/87 (52%), Gaps = 11/87 (12%)

Query: 508 LEKAEIQQQLLDAAAQELEHRKQKEE----ELKKNIQEKDAERIDMEEKYSSLQEESEGL 563
           LE  +I+ +  +AAA ELE +++  +    E K+ + E  AE         + Q E+  L
Sbjct: 358 LEDLQIELERANAAASELEKKQKNFDKILAEWKRKVDELQAEL-------DTAQREARNL 410

Query: 564 SKKLKKVSSMLLAAKEERKDVQRENRK 590
           S +L ++ + L   K++ + ++REN+ 
Sbjct: 411 STELFRLKNELEELKDQVEALRRENKN 437



 Score = 30.0 bits (68), Expect = 5.5
 Identities = 29/114 (25%), Positives = 54/114 (47%), Gaps = 10/114 (8%)

Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
           A+L++ + D  S+ E+LE          E    KAE+++QL  A A+  + R + E E  
Sbjct: 264 AQLRQLEHDLDSLREQLE----------EESEAKAELERQLSKANAEIQQWRSKFESEGA 313

Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRK 590
              +E +  +  + +K S L+E +E  + K   +       + E +D+Q E  +
Sbjct: 314 LRAEELEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELER 367


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 31.0 bits (70), Expect = 2.5
 Identities = 25/86 (29%), Positives = 44/86 (51%), Gaps = 2/86 (2%)

Query: 510 KAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
           KA  ++ LL+A  +  ++R++ E+E K   QE       + E+ +SL  + E LS K K 
Sbjct: 67  KALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTERATSLDRKDENLSSKEKT 126

Query: 570 VSS--MLLAAKEERKDVQRENRKEIE 593
           + S    L  K +  D + E  +++E
Sbjct: 127 LESKEQSLTDKSKHIDEREEQVEKLE 152


>gnl|CDD|163611 pfam10046, BLOC1_2, Biogenesis of lysosome-related organelles
           complex-1 subunit 2.  Members of this family of proteins
           play a role in cellular proliferation, as well as in the
           biogenesis of specialized organelles of the
           endosomal-lysosomal system.
          Length = 99

 Score = 28.8 bits (65), Expect = 2.7
 Identities = 29/105 (27%), Positives = 49/105 (46%), Gaps = 11/105 (10%)

Query: 471 IEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQ--QLLDAAAQELEHR 528
            E   +   K  +++ +  VE L+LL+K       N     + ++  Q       E+E  
Sbjct: 1   AEKMFNKISKYVEAELEISVEDLKLLEK------MNENTALKYKKMRQSAKGLRVEIEAL 54

Query: 529 KQKEEELKKNIQEKDA--ERID-MEEKYSSLQEESEGLSKKLKKV 570
            QK EELK  +Q+ DA  +++  +EE    L E S+ L  K+K +
Sbjct: 55  NQKYEELKPYLQQIDAIEQQVTTLEEAVYELDEYSKELESKVKSL 99


>gnl|CDD|203446 pfam06428, Sec2p, GDP/GTP exchange factor Sec2p.  In Saccharomyces
           cerevisiae, Sec2p is a GDP/GTP exchange factor for
           Sec4p, which is required for vesicular transport at the
           post-Golgi stage of yeast secretion.
          Length = 90

 Score = 28.8 bits (65), Expect = 2.7
 Identities = 15/55 (27%), Positives = 32/55 (58%), Gaps = 1/55 (1%)

Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKL-KKVSSMLLAAKEERKDVQRENRK 590
            ++ E+  +R+  E++   L++E E L+  L  + + M+  A+ ER+ V+ +N K
Sbjct: 1   ADLAEEKKKRLRAEKEKKKLEQEVEDLTASLFDEANKMVADARREREAVEIKNEK 55


>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 557

 Score = 31.0 bits (71), Expect = 2.7
 Identities = 28/109 (25%), Positives = 48/109 (44%), Gaps = 16/109 (14%)

Query: 482 AQSDHQSMVEKLELLQKKVL--VGGENLLEKAEIQQ--QLLDAAAQELEHRKQKEEELKK 537
            Q +HQS+++    LQ+++L    G   L +   Q+  Q    A +ELE  ++KE E  +
Sbjct: 128 GQHEHQSLLK--PELQRQLLDAFAGLEELAQEAYQEAYQAWKQARRELEDLQEKERERAQ 185

Query: 538 -------NIQEKDAERIDMEEKYSSLQEESEGLSK--KLKKVSSMLLAA 577
                   ++E +   +   E    L+EE + LS   KL +     L  
Sbjct: 186 RADLLQFQLEELEELNLQPGE-DEELEEERKRLSNSEKLAEAIQNALEL 233


>gnl|CDD|112890 pfam04094, DUF390, Protein of unknown function (DUF390).  This is a
           family of long proteins currently only found in the rice
           genome. They have no known function. However they may be
           some kind of transposable element.
          Length = 843

 Score = 30.9 bits (69), Expect = 2.7
 Identities = 40/174 (22%), Positives = 75/174 (43%), Gaps = 25/174 (14%)

Query: 415 VNQDPKDALLIKYQKEISELKKKLEEA----SVPPPREEPTPNPEPELDETEPCVEPSEV 470
           V  + +D L ++Y    +EL+KKL+ A         RE+     E    + E  +E   +
Sbjct: 475 VLNEAQDDLRLQYGSRAAELEKKLDAAQGVLDAAAAREQRAAENEAASRQREEALEARAM 534

Query: 471 IEDETDAELKRAQSDHQSMVEKLEL-----------------LQKKVLVGGENLLEKAEI 513
             +E     +R  +D ++ V   E                  L++  L   E  LE+AE 
Sbjct: 535 ALEERACAKERDLADREAAVAIREATLAAHEAACAEEEFALRLREDALTERERALEEAEA 594

Query: 514 QQQLLDAAAQELEHRKQ-KEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKK 566
             Q L   A  L  R+  +EE+ ++N++   AER  ++++ + L+  ++ L  +
Sbjct: 595 AAQQL---ADSLFLREAAQEEQARRNLEGARAERAALDQRAAELEARAKELDAR 645


>gnl|CDD|227318 COG4985, COG4985, ABC-type phosphate transport system, auxiliary
           component [Inorganic ion transport and metabolism].
          Length = 289

 Score = 30.7 bits (69), Expect = 2.7
 Identities = 18/107 (16%), Positives = 52/107 (48%), Gaps = 12/107 (11%)

Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
           E+  D  + EL++   + +++ ++++ + ++ +    + LE+  ++           + R
Sbjct: 156 ELDGDPLERELQKRLLEVETLRDQVDKMVEQQVRVINSQLERLRLE-----------KRR 204

Query: 529 KQKEEELKKNIQEK-DAERIDMEEKYSSLQEESEGLSKKLKKVSSML 574
            Q   +L    Q+   AE+ +++++ + LQ E + L  +L++    L
Sbjct: 205 LQLNGQLDDEFQQHYVAEKSELQKRLAQLQTELDALRAELERQFLYL 251


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 30.6 bits (70), Expect = 2.8
 Identities = 14/88 (15%), Positives = 40/88 (45%), Gaps = 1/88 (1%)

Query: 531 KEEELKKNIQEKDAERI-DMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR 589
           +E+  K+     D +++ +++E+   L  E E L  +  ++S  +  A +  +D   E  
Sbjct: 15  REKLKKRGGDALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELI 74

Query: 590 KEIEGFLDNVRQITKEYSYQQAIIQKFI 617
            E++   + ++++       +A +   +
Sbjct: 75  AEVKELKEKLKELEAALDELEAELDTLL 102


>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
           Provisional.
          Length = 569

 Score = 31.0 bits (71), Expect = 2.8
 Identities = 28/103 (27%), Positives = 54/103 (52%), Gaps = 8/103 (7%)

Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
           +++E E  A  K  + +  ++ + LE  +++     + L E+ +  +Q       ELE  
Sbjct: 296 DILEREVKAR-KYVEKNSDTLPDFLEHAKEQN----KELKEEIDRVKQSYTLNESELESV 350

Query: 529 KQKEEELK--KNIQEKDAERIDMEEK-YSSLQEESEGLSKKLK 568
           +Q E++L+  +   ++  ERI  +E  YS LQEE E + K+L+
Sbjct: 351 RQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLE 393


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 28.9 bits (65), Expect = 2.9
 Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 9/69 (13%)

Query: 523 QELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERK 582
            E +  K +E+ELK    EK+AER   + +  +++E       K +K     +AAK   K
Sbjct: 38  LEQQAIKAREKELK---DEKEAER---QRRIQAIKERRA---AKEEKERYEKMAAKMHAK 88

Query: 583 DVQRENRKE 591
            V+R  R+E
Sbjct: 89  KVERLKRRE 97


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 29.5 bits (65), Expect = 2.9
 Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 1/40 (2%)

Query: 439 EEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAE 478
           EE   P P EE    PEPE +E E   EP E  E+E + E
Sbjct: 65  EEEEEPEPEEEGEEEPEPE-EEGEEEPEPEETGEEEPEPE 103



 Score = 28.0 bits (61), Expect = 9.7
 Identities = 12/40 (30%), Positives = 13/40 (32%), Gaps = 3/40 (7%)

Query: 439 EEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAE 478
           EE     P  E     EP   E E   E     E E + E
Sbjct: 73  EEEGEEEPEPEEEGEEEP---EPEETGEEEPEPEPEPEPE 109


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 30.5 bits (69), Expect = 3.0
 Identities = 16/79 (20%), Positives = 38/79 (48%), Gaps = 4/79 (5%)

Query: 466 EPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQEL 525
                     D++L   Q + +++  ++E L  ++    E +  K +  Q+ +D +  E+
Sbjct: 27  ALLSDKIQNQDSKLSELQKEKKNIQNEIESLDNQI----EEIQSKIDELQKEIDQSKAEI 82

Query: 526 EHRKQKEEELKKNIQEKDA 544
           +  +++  ELK+NI E+  
Sbjct: 83  KKLQKEIAELKENIVERQE 101



 Score = 29.3 bits (66), Expect = 6.7
 Identities = 14/78 (17%), Positives = 38/78 (48%), Gaps = 3/78 (3%)

Query: 510 KAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
            A +    +     +L   +++++ ++  I+  D +  +++ K   LQ+E +    ++KK
Sbjct: 25  FAALLSDKIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKK 84

Query: 570 VSSMLLAAKE---ERKDV 584
           +   +   KE   ER+++
Sbjct: 85  LQKEIAELKENIVERQEL 102


>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
           in cytoskeletal structure; family members include
           spectrin, alpha-actinin and dystrophin; the spectrin
           repeat forms a three helix bundle with the second helix
           interrupted by proline in some sequences; the repeats
           are independent folding units; tandem repeats are found
           in differing numbers and arrange in an antiparallel
           manner to form dimers; the repeats are defined by a
           characteristic tryptophan (W) residue in helix A and a
           leucine (L) at the carboxyl end of helix C and separated
           by a linker of 5 residues; two copies of the repeat are
           present here.
          Length = 213

 Score = 30.1 bits (68), Expect = 3.1
 Identities = 26/139 (18%), Positives = 62/139 (44%), Gaps = 5/139 (3%)

Query: 458 LDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQ---KKVLVGGENLLEKAEIQ 514
           L E E  +  ++  +D    E    +  H+++  +L   +   + +   GE L+E+    
Sbjct: 16  LSEKEELLSSTDYGDDLESVEALLKK--HEALEAELAAHEERVEALNELGEQLIEEGHPD 73

Query: 515 QQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSML 574
            + +    +EL  R ++  EL +  +++  E +D+++ +    +  + L +K   ++S  
Sbjct: 74  AEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASED 133

Query: 575 LAAKEERKDVQRENRKEIE 593
           L    E  +   +  KE+E
Sbjct: 134 LGKDLESVEELLKKHKELE 152


>gnl|CDD|220381 pfam09753, Use1, Membrane fusion protein Use1.  This entry is of a
           family of proteins all approximately 300 residues in
           length. The proteins have a single C-terminal
           trans-membrane domain and a SNARE [soluble NSF
           (N-ethylmaleimide-sensitive fusion protein) attachment
           protein receptor] domain of approximately 60 residues.
           The SNARE domains are essential for membrane fusion and
           are conserved from yeasts to humans. Use1 is one of the
           three protein subunits that make up the SNARE complex
           and it is specifically required for Golgi-endoplasmic
           reticulum retrograde transport.
          Length = 251

 Score = 30.1 bits (68), Expect = 3.2
 Identities = 32/155 (20%), Positives = 61/155 (39%), Gaps = 16/155 (10%)

Query: 419 PKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAE 478
           P   +L +Y++ +  LK  LE      P E      +   ++         V E  + A 
Sbjct: 50  PSVDVLTEYRRRVDFLKGLLEAEKPSSPSE------KALANQKLSRGRAPTVDEPRSPAS 103

Query: 479 LK-RAQSDHQSMVE-KLELLQKKVLVGGENLLEKAEIQQQLLD--AAAQELEHRKQKEEE 534
            +   Q+  +   E + ELL     +      +  +    + D  A+A  ++   Q  + 
Sbjct: 104 KEIHQQTKSRYTSELRKELLSGSTSLRRSTDAKDRDSSGPINDDEASAASIDQVLQYHQ- 162

Query: 535 LKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
              N+QEK AE  +M     +L+E+S    + +K+
Sbjct: 163 ---NLQEKLAE--EMLLLARNLKEQSLAAQQIIKE 192


>gnl|CDD|151039 pfam10473, Cenp-F_leu_zip, Leucine-rich repeats of kinetochore
           protein Cenp-F/LEK1.  Cenp-F, a centromeric kinetochore,
           microtubule-binding protein consisting of two
           1,600-amino acid-long coils, is essential for the full
           functioning of the mitotic checkpoint pathway. There are
           several leucine-rich repeats along the sequence of LEK1
           that are considered to be zippers, though they do not
           appear to be binding DNA directly in this instance.
          Length = 140

 Score = 29.4 bits (66), Expect = 3.3
 Identities = 25/93 (26%), Positives = 49/93 (52%), Gaps = 11/93 (11%)

Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
            +LK ++ +   + +K+E L+++        LE +E  Q+L+     E E+ K + E LK
Sbjct: 10  EKLKESEREADLLKDKVENLERE--------LEMSEENQELVIL---EAENSKAEVETLK 58

Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
             I+E      D+E    +++ E E L+K+L++
Sbjct: 59  TKIEEMAESLKDLELDLVTVRSEKENLTKQLQE 91


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 30.2 bits (69), Expect = 3.3
 Identities = 20/82 (24%), Positives = 34/82 (41%), Gaps = 22/82 (26%)

Query: 510 KAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAER-----------IDMEEKYSSLQE 558
           K E++  LLD    +  H+ QK++E ++    K   +           ID  E   SL+E
Sbjct: 53  KEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRLFVLDFKGDIDASE-VESLRE 111

Query: 559 ESEGLSKKLKKVSSMLLAAKEE 580
           E          ++++L  AK  
Sbjct: 112 E----------ITAILAVAKPG 123


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 30.5 bits (69), Expect = 3.4
 Identities = 27/122 (22%), Positives = 55/122 (45%), Gaps = 10/122 (8%)

Query: 466 EPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQEL 525
           E  +  E +   EL++ Q+  Q  +++LE  +++        L   E ++Q  +AA Q  
Sbjct: 79  EQRKKKEQQQAEELQQKQAAEQERLKQLE--KER--------LAAQEQKKQAEEAAKQAA 128

Query: 526 EHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQ 585
             +KQ EE   K      A+     ++ ++  +++   +KK  +  +   AA E +K  +
Sbjct: 129 LKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAE 188

Query: 586 RE 587
            E
Sbjct: 189 AE 190



 Score = 29.8 bits (67), Expect = 5.2
 Identities = 29/126 (23%), Positives = 52/126 (41%), Gaps = 18/126 (14%)

Query: 465 VEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQE 524
           V+P  V+E     +  R Q   +S     E  +KK     E L +K   +Q+ L    ++
Sbjct: 55  VDPGAVVE-----QYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERL----KQ 105

Query: 525 LEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDV 584
           LE  +   +E KK  Q ++A +    +     Q+++E  + K    +     A+ E K  
Sbjct: 106 LEKERLAAQEQKK--QAEEAAKQAALK-----QKQAEEAAAKAAAAAK--AKAEAEAKRA 156

Query: 585 QRENRK 590
               +K
Sbjct: 157 AAAAKK 162


>gnl|CDD|227780 COG5493, COG5493, Uncharacterized conserved protein containing a
           coiled-coil domain [Function unknown].
          Length = 231

 Score = 30.2 bits (68), Expect = 3.4
 Identities = 20/96 (20%), Positives = 41/96 (42%), Gaps = 6/96 (6%)

Query: 507 LLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKK 566
           LLE  EI   +L    + L     K    ++   ++D E +  E +    +   E L  +
Sbjct: 11  LLENPEILVDVLTQRPEILYEVLAKLTPWQQLATKQDVEELRKETEQRQKELADEKLEVR 70

Query: 567 LKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQI 602
            +K +      KE+ K +QR   +E     ++++++
Sbjct: 71  KQKAT------KEDLKLLQRFQEEEFRATKEDIKRL 100


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
           binding.  This is the middle region of a family of TATA
           element modulatory factor 1 proteins conserved in
           eukaryotes that contains at its N-terminal section a
           number of leucine zippers that could potentially form
           coiled coil structures. The whole proteins bind to the
           TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells.
          Length = 74

 Score = 28.0 bits (63), Expect = 3.6
 Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 10/63 (15%)

Query: 507 LLEKAEIQQQLL----DAAAQELEHR------KQKEEELKKNIQEKDAERIDMEEKYSSL 556
           L EK E   QL+      + +EL+H       + K +EL+K I E   +   +E++  +L
Sbjct: 7   LAEKDEQIAQLMEEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENL 66

Query: 557 QEE 559
           +E 
Sbjct: 67  EER 69


>gnl|CDD|235395 PRK05298, PRK05298, excinuclease ABC subunit B; Provisional.
          Length = 652

 Score = 30.4 bits (70), Expect = 3.9
 Identities = 9/11 (81%), Positives = 10/11 (90%)

Query: 138 GQTGTGKTFTM 148
           G TG+GKTFTM
Sbjct: 39  GVTGSGKTFTM 49


>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
           exopolysaccharide biosynthesis [Cell envelope
           biogenesis, outer membrane].
          Length = 458

 Score = 30.1 bits (68), Expect = 3.9
 Identities = 30/155 (19%), Positives = 63/155 (40%), Gaps = 16/155 (10%)

Query: 468 SEVIEDETDAELKRAQSDHQSMVE-KLELLQKKVLVGGENLLEKAEIQQ-QLLDAAAQEL 525
            +++    +A   R   +  ++ + + +  Q +  +   +    A+  Q   L+A   EL
Sbjct: 266 LQLLPLGREAAALREVLESPTIQDLRQQYAQVRQQIADLSTELGAKHPQLVALEAQLAEL 325

Query: 526 EHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQ 585
             R+Q   EL++ +     E   +E++ ++L++E   L  +L K+    L  +    + +
Sbjct: 326 --RQQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRLSKLPK--LQVQLRELERE 381

Query: 586 RE-NRKEIEGFLDNVRQITKEYSYQQAIIQKFIPT 619
            E  R   E  L           YQ+  IQ+  P 
Sbjct: 382 AEAARSLYETLLQ---------RYQELSIQEASPI 407


>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 29.8 bits (67), Expect = 4.0
 Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 5/69 (7%)

Query: 494 ELLQKKVLVGGENLLEKAEIQQQLLDAA---AQELEHRKQKEEELKKNIQEKDAERIDME 550
           E+L++    GG+    +A+     L A    AQ L    QK +E    + +K  E  +  
Sbjct: 126 EMLEENAKKGGKIDKMEADSTD--LKARLRKAQILLEGLQKNQEELFKLLDKYNELREQV 183

Query: 551 EKYSSLQEE 559
           +K SS ++E
Sbjct: 184 QKESSKKKE 192


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 30.1 bits (68), Expect = 4.1
 Identities = 28/142 (19%), Positives = 59/142 (41%), Gaps = 20/142 (14%)

Query: 473 DETDAELKRAQSDHQSMVEKLELLQ------KKVLVGGENLLEKAEIQQQLLDAAAQELE 526
            + +  + + Q+   +  E+L  L       +  L+  +      E  +   D   ++L 
Sbjct: 332 PDAENNVAKLQALVVASSERLLELAQQWEAHRTPLI--DEYRSLKEKNRNKEDETQRQL- 388

Query: 527 HRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK---VSSMLLAAKEERKD 583
               + ++L+  I+E ++E    E+ Y  L +E E   K + +      +L   K  +K 
Sbjct: 389 ---DEIKKLRNKIEELESELQTKEQLYKQLLDEYENAPKSVSRSAYTRRILEIIKNIKK- 444

Query: 584 VQRENRKEIEGFLDNVRQITKE 605
            Q+E   +I+  L + R + KE
Sbjct: 445 -QKE---DIDKILSDTRSLQKE 462


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 30.4 bits (68), Expect = 4.2
 Identities = 23/113 (20%), Positives = 52/113 (46%), Gaps = 7/113 (6%)

Query: 509 EKAEIQQQLLDA------AAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEG 562
           E+  ++Q+L  A      A QEL    ++ ++L+  ++    +R  +E +  SLQ   + 
Sbjct: 117 EREAVRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQ 176

Query: 563 LSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQK 615
           L     ++ S +L  K     +++E +  +    +  +  T+E + + A  Q+
Sbjct: 177 LQASATQLKSQVLDLKLRSAQIEQEAQ-NLATRANAAQARTEELARRAAAAQQ 228


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 30.1 bits (68), Expect = 4.3
 Identities = 30/172 (17%), Positives = 59/172 (34%), Gaps = 35/172 (20%)

Query: 435 KKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEK-- 492
            + +EE S     +E   + + + D     V+  E  E+E    +     + +  V    
Sbjct: 416 DESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDKEAIPSTILEEEPTVGGGL 475

Query: 493 ---LELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRK-QKEEELKKNIQEKDAERID 548
              L+LL+                 + +L     E E R+  KE+E  K + E       
Sbjct: 476 AAALKLLK----------------SRGILKKNQLERERREFLKEKERLKLLAEIRE---- 515

Query: 549 MEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVR 600
                  ++ E +    K  +   M    +EE    + + R + E +  +V+
Sbjct: 516 ------RIERERDRNDGKYSR---MSAREREEYARPENDQRDKEEAYKPDVK 558


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 30.3 bits (68), Expect = 4.3
 Identities = 36/204 (17%), Positives = 79/204 (38%), Gaps = 23/204 (11%)

Query: 412 KARVNQDPKDALLIKYQKEISELKKKLEEASVPPP--------REEPTPNPE---PELDE 460
           K +   + K+  +   Q  I EL K+L +  +           RE+ T   +    + D+
Sbjct: 334 KLKSEIELKEEEIKALQSNIDELHKQLRKQGISTEQFELMNQEREKLTRELDKINIQSDK 393

Query: 461 TEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQ----- 515
               V+  ++        L++    + S+++ +   + ++     +   K  I+Q     
Sbjct: 394 LTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINIEQLFPKG 453

Query: 516 ----QLLDAAAQELEHRKQKE--EELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
               + +  +  EL    Q+    E  K+I  ++    +++   + L +  E L  +L +
Sbjct: 454 SGINESIKKSILELNDEIQERIKTEENKSITLEED-IKNLKHDINELTQILEKLELELSE 512

Query: 570 VSSMLLAAKEERKDVQRENRKEIE 593
            +S    +KEE +      R EIE
Sbjct: 513 ANSKFELSKEENERELVAQRIEIE 536


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 29.2 bits (66), Expect = 4.4
 Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 8/67 (11%)

Query: 476 DAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE-- 533
           +AE++R Q+  + +  +LE L++      E  L +A+ +Q        E + + +KEE  
Sbjct: 63  EAEIERLQNTIERLKTQLEDLER------ELALLQAKERQLEKKLKTLEQKLKNEKEEVQ 116

Query: 534 ELKKNIQ 540
            LK  IQ
Sbjct: 117 RLKNIIQ 123


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 30.2 bits (68), Expect = 4.4
 Identities = 13/62 (20%), Positives = 25/62 (40%), Gaps = 4/62 (6%)

Query: 416 NQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTP----NPEPELDETEPCVEPSEVI 471
            +D K A+ +      + L +   +   PPP  EP        EP+  + +   E  ++ 
Sbjct: 248 EEDLKLAVELVLLPRATRLPEPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIP 307

Query: 472 ED 473
           E+
Sbjct: 308 EE 309



 Score = 29.4 bits (66), Expect = 7.5
 Identities = 22/86 (25%), Positives = 35/86 (40%), Gaps = 6/86 (6%)

Query: 422 ALLIKYQKEISELKKKLE-----EASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETD 476
           AL  + +    +LK  +E      A+  P  E   P P P  +  EP  EP E  + + D
Sbjct: 239 ALHGRTEVTEEDLKLAVELVLLPRATRLPEPEPQPPPPPPPPEPPEPEEEPDEPDQTDPD 298

Query: 477 AELKRAQSDHQSMVEKLE-LLQKKVL 501
              +  Q   + M + +E  L   +L
Sbjct: 299 DGEETDQIPEELMFDAVEADLPDNIL 324


>gnl|CDD|215918 pfam00435, Spectrin, Spectrin repeat.  Spectrin repeat-domains are
           found in several proteins involved in cytoskeletal
           structure. These include spectrin, alpha-actinin and
           dystrophin. The sequence repeat used in this family is
           taken from the structural repeat in reference. The
           spectrin domain- repeat forms a three helix bundle. The
           second helix is interrupted by proline in some
           sequences. The repeats are defined by a characteristic
           tryptophan (W) residue at position 17 in helix A and a
           leucine (L) at 2 residues from the carboxyl end of helix
           C. Although the domain occurs in ultiple repeats along
           sequences, the domains are actually stable on their own
           - ie they act, biophysically, like domains rather than
           repeats that along function when aggregated.
          Length = 105

 Score = 28.4 bits (64), Expect = 4.4
 Identities = 21/127 (16%), Positives = 49/127 (38%), Gaps = 32/127 (25%)

Query: 423 LLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRA 482
           LL ++ ++  +L+  +EE              E  L   +   +   V            
Sbjct: 2   LLQQFFRDADDLESWIEEK-------------EALLSSEDYGKDLESV------------ 36

Query: 483 QSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK---KNI 539
               Q++++K + L+ ++    + +    E+ ++L+       E  +++ EEL    + +
Sbjct: 37  ----QALLKKHKALEAELAAHQDRVEALNELAEKLIAEGHYASEEIQERLEELNERWEQL 92

Query: 540 QEKDAER 546
            E  AER
Sbjct: 93  LELAAER 99


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 30.0 bits (68), Expect = 4.5
 Identities = 38/182 (20%), Positives = 73/182 (40%), Gaps = 39/182 (21%)

Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQ-- 530
            E    + + +   + +   LE L   +    E + E  E  ++LL+   + +   KQ  
Sbjct: 295 SEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNK-ISTNKQSL 353

Query: 531 -----KEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQ 585
                K +++K  I+E  AE +D  E+ + LQ+E + + K   ++         + K   
Sbjct: 354 ITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTKSEL--------VKEKY-- 403

Query: 586 RENRKEIEGFLDNVRQITKEYSYQQAIIQKFIP------TKYLELM------TQNATWNE 633
                        V  + K+   + +II+K+IP        YL++M      T +  +NE
Sbjct: 404 ---------HRGIVTDLLKDSGIKASIIKKYIPYFNKQINHYLQIMEADYNFTLDEEFNE 454

Query: 634 DI 635
            I
Sbjct: 455 TI 456


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 30.4 bits (68), Expect = 4.5
 Identities = 17/51 (33%), Positives = 22/51 (43%), Gaps = 5/51 (9%)

Query: 439 EEASVPPPREEPTP--NPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ 487
           E     P   E  P   PEP ++ET+P   P    E   + E KRA+   Q
Sbjct: 475 EPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFE---EVEEKRAREREQ 522


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
           sigma54-dependent transcription [Transcription / Signal
           transduction mechanisms].
          Length = 225

 Score = 29.6 bits (67), Expect = 4.5
 Identities = 28/129 (21%), Positives = 63/129 (48%), Gaps = 12/129 (9%)

Query: 474 ETDAELKRAQSDHQSMVEKLEL--------LQKKVLVGGENLLEKAEIQQQLLDAAAQEL 525
           + + +L+ AQ+  + + EK EL        L ++ L   ++L + A+  +  L  A +++
Sbjct: 56  QLERKLEEAQARAEKLEEKAELALQAGNEDLAREALEEKQSLEDLAKALEAELQQAEEQV 115

Query: 526 EHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSK---KLKKVSSMLLAAKEERK 582
           E  K++   L++ I E  A++  ++ + ++  +  E +++        S+M    + E K
Sbjct: 116 EKLKKQLAALEQKIAELRAKKEALKARKAA-AKAQEKVNRSLGGGSSSSAMAAFERMEEK 174

Query: 583 DVQRENRKE 591
             +RE R E
Sbjct: 175 IEEREARAE 183


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 30.0 bits (68), Expect = 4.6
 Identities = 23/94 (24%), Positives = 48/94 (51%), Gaps = 7/94 (7%)

Query: 508 LEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKL 567
           L + EI++ + +A A   E +K+KEE   +N    D+     E+   +L+E  + L  + 
Sbjct: 498 LSEEEIERMVKEAEANAEEDKKRKEEIEARN--NADSLAYQAEK---TLKEAGDKLPAEE 552

Query: 568 KKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQ 601
           K+      A  E ++ ++ E+ +EI+   + ++Q
Sbjct: 553 KEKIEK--AVAELKEALKGEDVEEIKAKTEELQQ 584


>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin. 
          Length = 237

 Score = 29.5 bits (67), Expect = 4.7
 Identities = 42/139 (30%), Positives = 64/139 (46%), Gaps = 31/139 (22%)

Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKE 532
           ++ +A+LK A+   +    K E + +K LV  E  LE+AE + +  ++   EL      E
Sbjct: 95  EQLEAQLKEAKEIAEEADRKYEEVARK-LVVVEGDLERAEERAEAAESKIVEL------E 147

Query: 533 EELK---KNIQ------EKDAERID-MEEKYSSLQ----------EESEGLSKKLKKVSS 572
           EELK    N++      EK ++R D  EEK   L           E +E   +KL+K   
Sbjct: 148 EELKVVGNNLKSLEVSEEKASQREDSYEEKIRDLTEKLKEAETRAEFAERSVQKLEKEVD 207

Query: 573 ML----LAAKEERKDVQRE 587
            L    LA KE+ K +  E
Sbjct: 208 RLEDELLAEKEKYKAISEE 226


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 30.1 bits (68), Expect = 4.8
 Identities = 21/110 (19%), Positives = 40/110 (36%), Gaps = 18/110 (16%)

Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
           AEL    S+ ++   KL  L ++              +++ L     EL   ++K EEL+
Sbjct: 185 AELTTLLSEQRAQQAKLAQLLEE--------------RKKTLAQLNSELSADQKKLEELR 230

Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR 586
            N      E    E   +  +E +       +  ++   AA+ +R     
Sbjct: 231 ANESRLKNEIASAEAAAAKAREAAA----AAEAAAARARAAEAKRTGETY 276



 Score = 28.9 bits (65), Expect = 9.8
 Identities = 13/85 (15%), Positives = 35/85 (41%), Gaps = 6/85 (7%)

Query: 512 EIQQQL--LDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
              +++  L A  ++L   + +    +  +    +E+   + K + L EE +    +L  
Sbjct: 158 ARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNS 217

Query: 570 VSSMLLAAKEERKDVQRENRKEIEG 594
                L+A +++ +  R N   ++ 
Sbjct: 218 ----ELSADQKKLEELRANESRLKN 238


>gnl|CDD|180481 PRK06231, PRK06231, F0F1 ATP synthase subunit B; Validated.
          Length = 205

 Score = 29.4 bits (66), Expect = 4.8
 Identities = 27/105 (25%), Positives = 51/105 (48%), Gaps = 15/105 (14%)

Query: 506 NLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAER----------IDMEEKYSS 555
            LLE A+ + +   A A+E+  +   E    K+  EK+A R           ++E++   
Sbjct: 100 QLLENAKQRHENALAQAKEIIDQANYEALQLKSELEKEANRQANLIIFQARQEIEKERRE 159

Query: 556 LQEESEGLSKKLKKVSSMLLAAKEERKDVQRE-NRKEIEGFLDNV 599
           L+E+ +  S +L    +ML A +  +K V RE + K ++ F+  +
Sbjct: 160 LKEQLQKESVEL----AMLAAEELIKKKVDREDDDKLVDEFIREL 200


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 30.0 bits (68), Expect = 4.8
 Identities = 29/139 (20%), Positives = 61/139 (43%), Gaps = 9/139 (6%)

Query: 475 TDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLD----AAAQEL----- 525
              +L+R Q++ QS+ +KLEL   +     E  L + + Q + L     A  + L     
Sbjct: 654 ARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFR 713

Query: 526 EHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQ 585
           E R ++  + +    E D +   +     + + +++   K+LKK     LA+ +   +  
Sbjct: 714 ELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTV 773

Query: 586 RENRKEIEGFLDNVRQITK 604
           +E +++IE     + +I  
Sbjct: 774 KELKRQIEELETTIERIAV 792


>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase.  The model describes
           alanine--tRNA ligase. This enzyme catalyzes the reaction
           (tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
           pyrophosphate + AMP) [Protein synthesis, tRNA
           aminoacylation].
          Length = 851

 Score = 30.0 bits (68), Expect = 4.9
 Identities = 26/106 (24%), Positives = 43/106 (40%), Gaps = 12/106 (11%)

Query: 487 QSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAER 546
               +KL+ L   + V    L +K E   +   A  +ELE  K K  + K +  +  AE 
Sbjct: 704 NEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELESLKLKIADQKIDELKDKAET 763

Query: 547 I----------DMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERK 582
           I          D ++   SL+  ++ L  KL   +  +LA+    K
Sbjct: 764 INGVKVLVEVVDAKDM-KSLKTMADRLKSKLGS-AIYVLASFANGK 807


>gnl|CDD|204444 pfam10319, 7TM_GPCR_Srj, Serpentine type 7TM GPCR chemoreceptor
           Srj.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srj is part of the Str
           superfamily of chemoreceptors. The srj family is
           designated as the out-group based on its location in
           preliminary phylogenetic analyses of the entire
           superfamily. Chemoperception is one of the central
           senses of soil nematodes like C. elegans which are
           otherwise 'blind' and 'deaf'.
          Length = 310

 Score = 29.8 bits (68), Expect = 4.9
 Identities = 8/45 (17%), Positives = 16/45 (35%), Gaps = 10/45 (22%)

Query: 164 SFAHIFGHIA----------KADESVKFLVRVSYFEIYNEEIRDL 198
           S      H +           AD+ ++  +R  + E Y  +  D+
Sbjct: 135 SILFFLFHGSVWTAICYFCLYADDEIRDYIREDFRETYGVDSMDI 179


>gnl|CDD|221623 pfam12531, DUF3731, DNA-K related protein.  This domain family is
           found in bacteria, and is approximately 250 amino acids
           in length. There are two conserved sequence motifs: RPG
           and WRR. The proteins in this family are frequently
           annotated as DNA-K related proteins however there is
           little accompanying literature to confirm this.
          Length = 249

 Score = 29.5 bits (67), Expect = 5.0
 Identities = 20/69 (28%), Positives = 30/69 (43%), Gaps = 14/69 (20%)

Query: 511 AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDA----------ERIDMEEKYSSLQEES 560
              QQQLLD  A  L+   Q+  +L K    +            ER+ +E+K     E  
Sbjct: 65  EGQQQQLLDDLAPYLQPGAQRNLKLPKGPTPQSYDEMVRLAASLERLPVEDK----VELG 120

Query: 561 EGLSKKLKK 569
           E L ++L+K
Sbjct: 121 EWLLERLQK 129


>gnl|CDD|221232 pfam11802, CENP-K, Centromere-associated protein K.  CENP-K is one
           of seven new CENP-A-nucleosome distal (CAD) centromere
           components (the others being CENP-L, CENP-O, CENP-P,
           CENP-Q, CENP-R and CENP-S) that are identified as
           assembling on the CENP-A nucleosome associated complex,
           NAC. The CENP-A NAC is essential, as disruption of the
           complex causes errors of chromosome alignment and
           segregation that preclude cell survival despite
           continued centromere-derived mitotic checkpoint
           signalling. CENP-K is centromere-associated through its
           interaction with one or more components of the CENP-A
           NAC.
          Length = 273

 Score = 29.5 bits (66), Expect = 5.1
 Identities = 36/191 (18%), Positives = 69/191 (36%), Gaps = 37/191 (19%)

Query: 439 EEASVPPPREEPTPNP----EPELDETEPCVEPSEVIED-----------ETDAELKRAQ 483
           E+A + P     +P P    E  L E E   +  E  +            E+DA+L    
Sbjct: 4   EQAELFPDISTDSPAPIDAEEELLRECENIWKDMEECQSKLTLIGTETLPESDAQLSLLI 63

Query: 484 SDHQSMVEKLELLQKKV-----------LVGGENLLEKAEIQQQLLDAAAQELEHRKQKE 532
              +++  + E  QK+            L  G+  L+K  ++Q L       L   + K 
Sbjct: 64  MQMKALTAEYEQWQKRTPEIISLNEDVLLTLGKEELQK--LRQDL----EMVLSSVQSKN 117

Query: 533 EELKKN---IQEKDAERIDMEEKYSSLQEESEGLSKKLKK--VSSMLLAAKEERKDVQRE 587
           E+LK++    Q+   E+  + +  +  Q E +       +  +   L       K+ + +
Sbjct: 118 EKLKEDLEREQQWLDEQQQILDSLNERQSELKNQVVTFSESRIFQELKTKMLRIKEYKEK 177

Query: 588 NRKEIEGFLDN 598
               +  FL+ 
Sbjct: 178 LLSALGEFLEE 188


>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
           homology Domain (IMD), a dimerization module that binds
           and bends membranes.  Inverse (I)-BAR (or IMD) is a
           member of the Bin/Amphiphysin/Rvs (BAR) domain family.
           It is a dimerization and lipid-binding module that bends
           membranes and induces membrane protrusions in the
           opposite direction compared to classical BAR and F-BAR
           domains, which produce membrane invaginations. IMD
           domains are found in Insulin Receptor tyrosine kinase
           Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
           Brain-specific Angiogenesis Inhibitor 1-Associated
           Protein 2-like (BAIAP2L) proteins. These are
           multi-domain proteins that act as scaffolding proteins
           and transducers of a variety of signaling pathways that
           link membrane dynamics and the underlying actin
           cytoskeleton. Most members contain an N-terminal IMD, an
           SH3 domain, and a WASP homology 2 (WH2) actin-binding
           motif at the C-terminus, exccept for MIM which does not
           carry an SH3 domain. Some members contain additional
           domains and motifs. The IMD domain binds and bundles
           actin filaments, binds membranes and produces membrane
           protrusions, and interacts with the small GTPase Rac.
          Length = 223

 Score = 29.3 bits (66), Expect = 5.2
 Identities = 24/128 (18%), Positives = 53/128 (41%), Gaps = 16/128 (12%)

Query: 504 GENLLEKAEIQQQL---LDAAAQELEHRKQKEEELKKNIQEKDAERI-DMEEKYSSLQEE 559
           GE L +  +  + +   L+  A+      +    L+K + E D + I   E+ Y     E
Sbjct: 67  GEALKQIVDTHKSIEASLEQVAKAF--HGELILPLEKKL-ELDQKVINKFEKDYKK---E 120

Query: 560 SEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFIPT 619
            +   + L K  S L   K+ +K  Q+    + +   + + Q  +E + +Q  ++ F+  
Sbjct: 121 YKQKREDLDKARSEL---KKLQKKSQKSGTGKYQ---EKLDQALEELNDKQKELEAFVSQ 174

Query: 620 KYLELMTQ 627
              + + +
Sbjct: 175 GLRDALLE 182


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 30.2 bits (68), Expect = 5.2
 Identities = 19/68 (27%), Positives = 36/68 (52%), Gaps = 2/68 (2%)

Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
           E+ L   E +++ L+  A+E   R+++ EE ++  +EK A   D  +  + +++  E L 
Sbjct: 247 EDFL--LEEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQ 304

Query: 565 KKLKKVSS 572
             LKK S 
Sbjct: 305 NLLKKASR 312


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 30.3 bits (68), Expect = 5.2
 Identities = 12/53 (22%), Positives = 19/53 (35%)

Query: 441 ASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKL 493
           AS PPP   P P+ EP  D+        +     T+          +  ++ L
Sbjct: 425 ASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRKALDAL 477


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 29.9 bits (68), Expect = 5.3
 Identities = 30/128 (23%), Positives = 57/128 (44%), Gaps = 21/128 (16%)

Query: 487 QSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEH-------------RKQKEE 533
           Q+++     LQ+K L G   L +  + Q+  L A  Q LEH                 E+
Sbjct: 201 QALLNAQNDLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQLQLLQEAINSKRLTLSEK 260

Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
            +++   + +A RI        + +E E ++ +L   S  LL A E+   + ++N + ++
Sbjct: 261 TVQEAQSQDEAARIQAN---PLVAQELE-INLQL---SQRLLKATEKLNTLTQQNLR-VK 312

Query: 594 GFLDNVRQ 601
            +LD + Q
Sbjct: 313 NWLDRLTQ 320



 Score = 29.5 bits (67), Expect = 7.4
 Identities = 23/106 (21%), Positives = 53/106 (50%), Gaps = 16/106 (15%)

Query: 492 KLELLQKKVLVGGENLLEKAEIQQ--QLLDAAAQELEHRKQKEEELKKNIQEKDAERIDM 549
           +L+ L K+ L+  E+ L + +++Q   LLD   ++    K++ E+LK+ + +  A+    
Sbjct: 44  QLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQ----KEETEQLKQQLAQAPAKLRQA 99

Query: 550 EEKYSSLQ--------EESEGLSKKLKKVSSMLLAAKEERKDVQRE 587
           + +  +L+        E    LS  L+++ S L    ++ ++ Q +
Sbjct: 100 QAELEALKDDNDEETRETLSTLS--LRQLESRLAQTLDQLQNAQND 143


>gnl|CDD|235871 PRK06835, PRK06835, DNA replication protein DnaC; Validated.
          Length = 329

 Score = 29.9 bits (68), Expect = 5.3
 Identities = 10/16 (62%), Positives = 11/16 (68%)

Query: 131 NGTIFAYGQTGTGKTF 146
           N  +  YG TGTGKTF
Sbjct: 183 NENLLFYGNTGTGKTF 198


>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
           PREFRAN.  This family is the long protein of a novel
           CRISPR subtype, PREFRAN, which is most common in
           Prevotella and Francisella, although widely distributed.
           The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
           helicase Cas3 and endonuclease Cas3-HD.
          Length = 1287

 Score = 30.2 bits (68), Expect = 5.6
 Identities = 25/106 (23%), Positives = 51/106 (48%), Gaps = 7/106 (6%)

Query: 513 IQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSS 572
           I+  L +   +++E RK K +  K+  +EK  +++  ++ Y S+Q  ++ L K +K+  S
Sbjct: 373 IRTALEEYIKKQIEPRKGKSK--KEKYEEK-VKKVVKKQDYLSIQFINDLLEKYIKEFDS 429

Query: 573 ---MLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQK 615
              +    ++   ++    R E     DN+  I   Y   + I+QK
Sbjct: 430 KDRLFENIRDYFAEISLM-RNEDFQTNDNIANIKTAYQTFERILQK 474


>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
           protein.  This domain family is found in eukaryotes, and
           is typically between 212 and 238 amino acids in length.
           The family is found in association with pfam01805. There
           are two completely conserved residues (W and H) that may
           be functionally important. PRP21 is required for
           assembly of the prespliceosome and it interacts with U2
           snRNP and/or pre-mRNA in the prespliceosome. This family
           also contains proteins similar to PRP21, such as the
           mammalian SF3a. SF3a also interacts with U2 snRNP from
           the prespliceosome, converting it to its active form.
          Length = 230

 Score = 29.3 bits (66), Expect = 5.7
 Identities = 10/38 (26%), Positives = 21/38 (55%)

Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEE 551
           +  +L+ A +  E  K +++E +K  +E++ ERI    
Sbjct: 16  KYYILERAYKRAEWEKHQKKEEQKKEEEEEEERIAFAS 53


>gnl|CDD|151608 pfam11166, DUF2951, Protein of unknown function (DUF2951).  This
           family of proteins has no known function. It has a
           highly conserved sequence.
          Length = 97

 Score = 27.8 bits (62), Expect = 6.2
 Identities = 18/69 (26%), Positives = 39/69 (56%), Gaps = 6/69 (8%)

Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL-LDAAAQELEHRKQKE 532
           ET+  ++R + + +++ E L+ ++      G++  E    +  L LD+  +E E  K+ +
Sbjct: 7   ETERRIQRLEENDKTIFESLDEIKD-----GQHTQELVNQKMDLTLDSINRERELDKENK 61

Query: 533 EELKKNIQE 541
           EE +KNI++
Sbjct: 62  EENRKNIKD 70


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 29.3 bits (66), Expect = 6.3
 Identities = 27/146 (18%), Positives = 61/146 (41%), Gaps = 21/146 (14%)

Query: 460 ETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLD 519
           E E  ++  E  + +   E  R +++ +    + +  Q++     ++ L +   Q++L  
Sbjct: 67  ELEAKIKEYEAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQ 126

Query: 520 AAAQELEH--------------RKQKEEELKKNIQEKDAERIDMEEKY--SSLQEESEGL 563
              Q  E               R+  EEE+ +  +E   E  ++E +   + ++ E+ G 
Sbjct: 127 QRRQNEELLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAELERENIRAKIEAEARGR 186

Query: 564 SKKLKK-----VSSMLLAAKEERKDV 584
           +K+ ++        + L A EER+ V
Sbjct: 187 AKEERENEDINREMLKLKANEERETV 212


>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 548

 Score = 29.8 bits (67), Expect = 6.3
 Identities = 27/166 (16%), Positives = 61/166 (36%), Gaps = 23/166 (13%)

Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK----AEIQQQLLDAAAQELEH 527
           E E + E+  A+++  + + +LE+ Q+      E   E     AE + ++    A+    
Sbjct: 229 EAEKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAETEAEVAAWKAETRRE 288

Query: 528 RKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLK---------KVSSMLLAAK 578
            +Q E   ++ IQE+ A+  + E +++   E  E     ++         +  S  + A 
Sbjct: 289 AEQAEILAEQAIQEEKAQA-EQEVQHAKALEAREMRVGLIERQKETELEPQERSYFINAA 347

Query: 579 EERKDVQRENRKEIE---------GFLDNVRQITKEYSYQQAIIQK 615
           + +   + +    I                     E + Q A++  
Sbjct: 348 QRQAQEEAKAAANIAEAIGAQAEAAVETARETEEAERAEQAALVAA 393


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 29.6 bits (66), Expect = 6.3
 Identities = 11/44 (25%), Positives = 20/44 (45%)

Query: 440 EASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQ 483
           +  VP    E     E E DE     EP +V++++++ E +   
Sbjct: 132 KPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEEL 175


>gnl|CDD|218967 pfam06273, eIF-4B, Plant specific eukaryotic initiation factor 4B. 
           This family consists of several plant specific
           eukaryotic initiation factor 4B proteins.
          Length = 496

 Score = 29.7 bits (66), Expect = 6.6
 Identities = 32/153 (20%), Positives = 58/153 (37%), Gaps = 11/153 (7%)

Query: 442 SVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVL 501
               PREE     E  LD  +   E        T ++  R  S   S  E          
Sbjct: 265 GAARPREEVLA--EKGLDWKKLDSEIEAKTSRPTSSQSSRPSSAQSSRSES------PGS 316

Query: 502 VGGENLLEKAEIQQQLLDAAAQE--LEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEE 559
            G E +++         DA  +E  LE + +   ++   ++ +  +R + EE+  +L+EE
Sbjct: 317 QGSEGVVKPRPKVNPFGDAKPREVVLEEKGKDWRKIDLELEHRRVDRPETEEE-KNLKEE 375

Query: 560 SEGLSKKLKKVSSMLLAAKEERKDVQRENRKEI 592
              L   LKK  ++   +K   ++   +   E+
Sbjct: 376 INLLRVDLKKEEAIAPESKGSGQEQSHKGLSEL 408


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 28.1 bits (63), Expect = 6.7
 Identities = 25/106 (23%), Positives = 47/106 (44%), Gaps = 19/106 (17%)

Query: 529 KQKEEELKKNIQEKDAERIDMEEKYSSLQ----------EESEGLSKKLKKVSSMLLAAK 578
           K+  +E +K ++E D    D +E Y S++                ++K K + S+L AA+
Sbjct: 1   KKARKEAQKALEE-DPSIYDYDEVYDSIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAE 59

Query: 579 EERKD--------VQRENRKEIEGFLDNVRQITKEYSYQQAIIQKF 616
             +++        +Q+E  KE + F D  + +T  Y  Q    +K 
Sbjct: 60  RRKREREIAEERKLQKEREKEGDEFADKEKFVTSAYKKQLEENRKL 105


>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
           subunit.  CENP-Q is one of the components that assembles
           onto the CENP-A-nucleosome distal (CAD) centromere. The
           centromere, which is the basic element of chromosome
           inheritance, is epigenetically determined in mammals.
           CENP-A, the centromere-specific histone H3 variant,
           assembles an array of nucleosomes and it is this that
           seems to be the prime candidate for specifying
           centromere identity. CENP-A nucleosomes directly recruit
           a proximal CENP-A nucleosome associated complex (NAC)
           comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
           CENP-C and CENP-H. Assembly of the CENP-A NAC at
           centromeres is dependent on CENP-M, CENP-N and CENP-T.
           Additionally, there are seven other subunits which make
           up the CENP-A-nucleosome distal (CAD) centromere,
           CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
           CENP-S, also assembling on the CENP-A NAC. Fta7 is the
           equivalent component of the fission yeast Sim4 complex.
          Length = 159

 Score = 28.5 bits (64), Expect = 6.8
 Identities = 30/135 (22%), Positives = 56/135 (41%), Gaps = 18/135 (13%)

Query: 484 SDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDA----AAQELEHRKQKEEELKKNI 539
            +     ++L  L++K  +  E  L     Q +LL+        ELE   +  +EL+KN 
Sbjct: 16  KEDSLDYDQL--LRRKRAL--ERQLAAELKQLELLEEEIRREEAELEKDLEYLQELEKNA 71

Query: 540 QEKDAERIDMEEKY--------SSLQEESEGLSKKLKKVSSMLLAAK--EERKDVQRENR 589
           +  + ER +  +          S + EE+E L   L ++S    +     E  D+ ++  
Sbjct: 72  KALEREREEESKNLHPVLRLLESEVLEENELLQDSLLELSERNFSPNLDPELLDLLKQLN 131

Query: 590 KEIEGFLDNVRQITK 604
           K +E    N+ Q+  
Sbjct: 132 KHLESLQGNLEQLAG 146


>gnl|CDD|219083 pfam06548, Kinesin-related, Kinesin-related.  This family
           represents a region within kinesin-related proteins from
           higher plants. Many family members also contain the
           pfam00225 domain. Kinesins are ATP-driven microtubule
           motor proteins that produce directed force. Some family
           members are associated with the phragmoplast, a
           structure composed mainly of microtubules that executes
           cytokinesis in higher plants.
          Length = 488

 Score = 29.5 bits (66), Expect = 6.9
 Identities = 35/161 (21%), Positives = 63/161 (39%), Gaps = 11/161 (6%)

Query: 410 CNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSE 469
                 +   K+ LL    +EI +LK +L+         + T      L      +  S 
Sbjct: 214 FKNFYFDMGEKEVLL----EEIQDLKSQLQCMLSS---SKSTARTRSSLLARSFQLRKSA 266

Query: 470 VIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRK 529
           + E   +   K  + + +   E     + K +   E L  + E  + L +    ELE  K
Sbjct: 267 IPESGDENAEKTLEKERERWTE----AESKWISLTEELRVELEANRSLAEKLEMELESEK 322

Query: 530 QKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
           +  EELK  +Q        + E+Y+ LQE+   L +K +++
Sbjct: 323 KCTEELKDALQRAMQGHARIIEQYAELQEKYNDLLEKHRRI 363


>gnl|CDD|180700 PRK06800, fliH, flagellar assembly protein H; Validated.
          Length = 228

 Score = 29.1 bits (65), Expect = 7.0
 Identities = 15/72 (20%), Positives = 34/72 (47%), Gaps = 5/72 (6%)

Query: 482 AQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKN-IQ 540
            + + Q   E+L   QK +       L +   +QQ L+   Q+L   +++ +E  +  ++
Sbjct: 32  VEEEIQKDHEELLAQQKSL----HKELNQLRQEQQKLERERQQLLADREQFQEHVQQQMK 87

Query: 541 EKDAERIDMEEK 552
           E +A R   +++
Sbjct: 88  EIEAARQQFQKE 99


>gnl|CDD|220716 pfam10359, Fmp27_WPPW, RNA pol II promoter Fmp27 protein domain.
           Fmp27_WPPW is a conserved domain of a family of proteins
           involved in RNA polymerase II transcription initiation.
           It contains characteristic HQR and WPPW sequence motifs.
           and is towards the C-terminal in members which contain
           Fmp27_SW pfam10305.
          Length = 470

 Score = 29.6 bits (67), Expect = 7.1
 Identities = 16/63 (25%), Positives = 36/63 (57%), Gaps = 2/63 (3%)

Query: 512 EIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVS 571
            +Q++LL+   ++LE + +K EE   +++  D E  +++     L+EE   L ++L+ + 
Sbjct: 160 RVQRELLEERLKQLEEQIKKLEEKLDDLELNDTE--ELQSDLEELEEELSVLKERLEFLE 217

Query: 572 SML 574
            +L
Sbjct: 218 KLL 220


>gnl|CDD|234063 TIGR02928, TIGR02928, orc1/cdc6 family replication initiation
           protein.  Members of this protein family are found
           exclusively in the archaea. This set of DNA binding
           proteins shows homology to the origin recognition
           complex subunit 1/cell division control protein 6 family
           in eukaryotes. Several members may be found in genome
           and interact with each other [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 365

 Score = 29.5 bits (67), Expect = 7.1
 Identities = 9/12 (75%), Positives = 11/12 (91%)

Query: 134 IFAYGQTGTGKT 145
           +F YG+TGTGKT
Sbjct: 43  VFIYGKTGTGKT 54


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
           endonuclease-like fold, RmuC family [General function
           prediction only].
          Length = 448

 Score = 29.3 bits (66), Expect = 7.1
 Identities = 20/132 (15%), Positives = 46/132 (34%), Gaps = 7/132 (5%)

Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE 533
               +L       + M+ +L  + +++L     LL     + + L   A+ L  +    E
Sbjct: 18  AFIRQLLLRLGRLEQMLGELAAVLEQLL-----LLLAFRAEAEQLRTFARSL--QALNLE 70

Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
            +++  + K   +  + +    LQ   E L++   +   +     EE      E  ++  
Sbjct: 71  LIQELNELKARLQQQLLQSREQLQLLIESLAQLSSEFQELANEIFEELNRRLAELNQQNL 130

Query: 594 GFLDNVRQITKE 605
             L    +   E
Sbjct: 131 KQLLKPLREVLE 142


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 29.3 bits (66), Expect = 7.2
 Identities = 32/157 (20%), Positives = 66/157 (42%), Gaps = 11/157 (7%)

Query: 421 DALLIK-YQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAEL 479
             LL + Y+KE+   KK+ E   V   + E        LDE    +      + E +++ 
Sbjct: 196 QELLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDD 255

Query: 480 KRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEI-QQQLLDAAAQELEHRKQKEEEL--- 535
              +S  +    + E + K V    +   ++ +  +++ L+  A+E +  K+K  +L   
Sbjct: 256 ---ESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARL 312

Query: 536 ---KKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
               K + +K+  R   +E+     E+ +   +KL K
Sbjct: 313 KEIAKEVAQKEKARARKKEQRKERGEKKKLKRRKLGK 349


>gnl|CDD|236272 PRK08475, PRK08475, F0F1 ATP synthase subunit B; Validated.
          Length = 167

 Score = 28.4 bits (64), Expect = 7.3
 Identities = 7/32 (21%), Positives = 19/32 (59%)

Query: 556 LQEESEGLSKKLKKVSSMLLAAKEERKDVQRE 587
            +     +SK+L+++   L  +KE+++D  ++
Sbjct: 51  YKSRINKISKRLEEIQEKLKESKEKKEDALKK 82


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 29.6 bits (67), Expect = 7.3
 Identities = 44/211 (20%), Positives = 85/211 (40%), Gaps = 45/211 (21%)

Query: 425 IKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPS---EVIEDETDAELKR 481
               K I EL + LEE      +E  +     EL      +  S   + ++D+ ++ +  
Sbjct: 108 ATTPKSIQELVEALEELLEELLKETASDPVVQELVSIFNDLIDSIKEDNLKDDLESLIAS 167

Query: 482 AQSDHQSMVEKL--------ELLQKKVLVGGENLLEKAEIQ-------------QQLLDA 520
           A+ +   + +KL        E L++ +    E LL K E +             +QL   
Sbjct: 168 AKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLE 227

Query: 521 AAQELEH-RKQKEEELKKNIQEKDAE------------RIDMEEKY-----SSLQEESEG 562
             +E E  RK+ EE+L++ ++ +                I+++ ++       ++EE  G
Sbjct: 228 FEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEERNG 287

Query: 563 LSKKLKKVSSMLLA---AKEERKDVQRENRK 590
              KL +++S L     A + R + + EN K
Sbjct: 288 RLAKLAELNSRLKGLEKALDSRSEAEDENHK 318


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 29.6 bits (67), Expect = 7.3
 Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 1/52 (1%)

Query: 447 REEPTPNPEPELD-ETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQ 497
           R  P P+P    + E E    P E  E+E + E    +    ++ E + LL+
Sbjct: 295 RRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEEVRATVAEAIRLLE 346


>gnl|CDD|233050 TIGR00609, recB, exodeoxyribonuclease V, beta subunit.  The RecBCD
           holoenzyme is a multifunctional nuclease with potent
           ATP-dependent exodeoxyribonuclease activity. Ejection of
           RecD, as occurs at chi recombinational hotspots,
           cripples exonuclease activity in favor of recombinagenic
           helicase activity. All proteins in this family for which
           functions are known are DNA-DNA helicases that are used
           as part of an exonuclease-helicase complex (made up of
           RecBCD homologs) that function to generate substrates
           for the initiation of recombination and recombinational
           repair. This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 1087

 Score = 29.7 bits (67), Expect = 7.4
 Identities = 10/22 (45%), Positives = 12/22 (54%)

Query: 130 FNGTIFAYGQTGTGKTFTMEGV 151
            NGT       GTGKTFT+  +
Sbjct: 8   LNGTFLIEASAGTGKTFTIAQL 29


>gnl|CDD|237320 PRK13258, PRK13258, 7-cyano-7-deazaguanine reductase; Provisional.
          Length = 114

 Score = 27.8 bits (63), Expect = 7.5
 Identities = 7/17 (41%), Positives = 10/17 (58%), Gaps = 1/17 (5%)

Query: 667 QPDFSHLY-RYYPSEKV 682
           QPDF+ +   Y P +K 
Sbjct: 34  QPDFATIVIDYIPDKKC 50


>gnl|CDD|181465 PRK08557, PRK08557, hypothetical protein; Provisional.
          Length = 417

 Score = 29.3 bits (66), Expect = 7.7
 Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 9/77 (11%)

Query: 143 GKTFTMEGVNNVPELKGIIPNSFAHI-----FGHIAKADESVKF--LVRVSYFEIYNEEI 195
           GK    E + N  EL  I+ N +  +      G   K ++ +K   L      +   E+I
Sbjct: 91  GKYIKEELIENPEELNEILENDYVGVEIGNFLGVGVKKEDRIKIKDLSLKKELDF--EKI 148

Query: 196 RDLLSKNKDQCLEVKER 212
            D L KNK++  +++E 
Sbjct: 149 EDYLEKNKERIEKLEEN 165


>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
           family.  This model represents a relatively
           well-conserved region near the C-terminus of the tape
           measure protein of a lambda and related phage. This
           protein, which controls phage tail length, is typically
           about 1000 residues in length. Both low-complexity
           sequence and insertion/deletion events appear common in
           this family. Mutational studies suggest a ruler or
           template role in the determination of phage tail length.
           Similar behavior is attributed to proteins from
           distantly related or unrelated families in other phage
           [Mobile and extrachromosomal element functions, Prophage
           functions].
          Length = 332

 Score = 29.0 bits (65), Expect = 7.8
 Identities = 27/143 (18%), Positives = 49/143 (34%), Gaps = 38/143 (26%)

Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
            ++L  A  + L+ R  +               I    K   L EE+E      +K    
Sbjct: 15  LKKLNTADEKSLQSRSDE---------------IIALIKLEKLLEEAE------RKALEA 53

Query: 574 LLAAKEERKDVQRENRKEIEGFLDNVRQ---------ITKEYSYQQAIIQKFIPTKYL-- 622
           L    E    ++ +N+++++ F    +Q         I + +  QQ  + K +  K L  
Sbjct: 54  LKKLAEATASIRAQNKRQLDRFGLGDKQRERLDARLQIDRTFRKQQRDLNKAMTAKGLAG 113

Query: 623 ------ELMTQNATWNEDIGEWE 639
                 +L    A  NE + E  
Sbjct: 114 SDLYKEQLAAIKAALNEALAELH 136


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 28.9 bits (65), Expect = 7.8
 Identities = 25/133 (18%), Positives = 45/133 (33%), Gaps = 15/133 (11%)

Query: 416 NQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCV-------EPS 468
            +D +  L    +   S+ + KL              N   E D+ E  +       +  
Sbjct: 56  KRDLRAELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDAR 115

Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
               D++D+      SD  S  +  E            L E  +I+++   A  +E E  
Sbjct: 116 NSDADDSDSSSDSDSSDDDSDDDDSEDETA------ALLRELEKIKKE--RAEEKEREEE 167

Query: 529 KQKEEELKKNIQE 541
           ++  EE K   +E
Sbjct: 168 EKAAEEEKAREEE 180


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 29.2 bits (66), Expect = 8.0
 Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 7/87 (8%)

Query: 515 QQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKL-----KK 569
           ++ L   A+EL  RK  E+E     +E+ AE   ++E+   L  E + L + L     K 
Sbjct: 5   REALAELAKEL--RKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKP 62

Query: 570 VSSMLLAAKEERKDVQRENRKEIEGFL 596
            +S      EE ++  +    E   +L
Sbjct: 63  AASGEGGGGEEEEEEAKAEAAEFRAYL 89


>gnl|CDD|153327 cd07643, I-BAR_IMD_MIM, Inverse (I)-BAR, also known as the
           IRSp53/MIM homology Domain (IMD), of Missing In
           Metastasis.  The IMD domain, also called
           Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a
           dimerization and lipid-binding module that bends
           membranes and induces membrane protrusions. Members of
           this subfamily include missing in metastasis (MIM) or
           metastasis suppressor 1 (MTSS1), metastasis suppressor
           1-like (MTSSL) or ABBA (Actin-Bundling protein with
           BAIAP2 homology), and similar proteins. They contain an
           N-terminal IMD and a WASP homology 2 (WH2) actin-binding
           motif at the C-terminus. MIM was originally identified
           as a missing transcript from metastatic bladder and
           prostate cancer cells. It is a scaffold protein that
           functions in a signaling pathway between the PDGF
           receptor, Src kinases, and actin assembly. It may also
           function as a cofactor of the Sonic hedgehog (Shh)
           transcriptional pathway and may participate in tumor
           development and progression via this pathway. ABBA
           regulates actin and plasma membrane dynamics to promote
           the extension of radial glia, which is important in
           neuronal migration, axon guidance and neurogenesis. The
           IMD domain of MIM binds and bundles actin filaments,
           binds membranes, and interacts with the small GTPase
           Rac.
          Length = 231

 Score = 29.0 bits (65), Expect = 8.2
 Identities = 17/61 (27%), Positives = 27/61 (44%), Gaps = 7/61 (11%)

Query: 478 ELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKK 537
           E K+A+ + +        LQKK   G  +L       Q  LD+A Q++  +    EE +K
Sbjct: 123 EYKKARQEIKKKSSDTIRLQKKARKGKGDL-------QPQLDSAMQDVNDKYLLLEETEK 175

Query: 538 N 538
            
Sbjct: 176 K 176


>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein.  The Bacillus subtilis divIVA1
           mutation causes misplacement of the septum during cell
           division, resulting in the formation of small, circular,
           anucleate mini-cells. Inactivation of divIVA produces a
           mini-cell phenotype, whereas overproduction of DivIVA
           results in a filamentation phenotype. These proteins
           appear to contain coiled-coils.
          Length = 131

 Score = 28.0 bits (63), Expect = 8.3
 Identities = 20/83 (24%), Positives = 40/83 (48%), Gaps = 2/83 (2%)

Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
           E L ++ E  ++ ++   ++LE  K+ EE L+K +    A+    E K ++ +E    + 
Sbjct: 35  EALYKENEELKEEIERLEEKLEEYKELEETLQKTLVV--AQETAEEVKANAQKEAELIIK 92

Query: 565 KKLKKVSSMLLAAKEERKDVQRE 587
           +   K   ++  A EE K +  E
Sbjct: 93  EAEAKAERIVNDANEEAKKLATE 115


>gnl|CDD|221198 pfam11740, KfrA_N, Plasmid replication region DNA-binding N-term.
           The broad host-range plasmid RK2 is able to replicate in
           and be inherited in a stable manner in diverse
           Gram-negative bacterial species. It encodes a number of
           co-ordinately regulated operons including a central
           control korF1 operon that represses the kfrA operon. The
           KfrA polypeptide is a site-specific DNA-binding protein
           whose operator overlaps the kfrA promoter. The
           N-terminus, containing an helix-turn-helix motif, is
           essential for function. Downstream from this family is
           an extended coiled-coil domain containing a heptad
           repeat segment which is probably responsible for
           formation of multimers, and may provide an example of a
           bridge to host structures required for plasmid
           partitioning.
          Length = 120

 Score = 28.0 bits (63), Expect = 8.4
 Identities = 9/49 (18%), Positives = 19/49 (38%), Gaps = 3/49 (6%)

Query: 511 AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEE 559
           A + +   + A + L          +    E +AE  +   +  +L+EE
Sbjct: 72  AALWEAAQEEAEERLA---AARAAAEAERAELEAELAEAAAEAEALEEE 117


>gnl|CDD|197544 smart00150, SPEC, Spectrin repeats. 
          Length = 101

 Score = 27.3 bits (61), Expect = 8.7
 Identities = 17/111 (15%), Positives = 48/111 (43%), Gaps = 23/111 (20%)

Query: 458 LDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL 517
           L+E E  +   ++ +D             +++++K E  + ++    E +        + 
Sbjct: 14  LEEKEQLLASEDLGKDLESV---------EALLKKHEAFEAELEAHEERV--------EA 56

Query: 518 LDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLK 568
           L+   ++L    ++     + I+E+     ++ E++  L+E +E   +KL+
Sbjct: 57  LNELGEQL---IEEGHPDAEEIEER---LEELNERWEELKELAEERRQKLE 101


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 29.1 bits (65), Expect = 8.8
 Identities = 26/123 (21%), Positives = 43/123 (34%), Gaps = 8/123 (6%)

Query: 437 KLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELL 496
           K +E  V         +    L E +   E      +E D E+    +D ++     EL 
Sbjct: 164 KKQELPVEETLSTIHEDDASTLHEDDELDEEVTSYLNELDDEVTSYFNDGENEENDDELE 223

Query: 497 QKKV--LVGGENLLEKAEIQQQLLD------AAAQELEHRKQKEEELKKNIQEKDAERID 548
            + +  L  GEN  E  E  ++         A   E E   + E+E ++   E+      
Sbjct: 224 AEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEEEAGEEQENEDK 283

Query: 549 MEE 551
            EE
Sbjct: 284 GEE 286


>gnl|CDD|237292 PRK13182, racA, polar chromosome segregation protein; Reviewed.
          Length = 175

 Score = 28.5 bits (64), Expect = 8.8
 Identities = 29/148 (19%), Positives = 57/148 (38%), Gaps = 30/148 (20%)

Query: 433 ELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEK 492
            +K ++EE       ++P+ N   E          S V              D + +  +
Sbjct: 48  YVKSQIEEGQNMQDTQKPSSNDVEETQVNTIVQNISSV--------------DFEQLEAQ 93

Query: 493 LELLQKKVLVGGENLLEKAEIQQQLLDAA-----AQELEHRKQKEEELKKNIQEKDAERI 547
           L  + +++           E+++QL   A      Q L+HR++ EE L++ +Q+ +A   
Sbjct: 94  LNTITRRL----------DELERQLQQKADDVVSYQLLQHRREMEEMLER-LQKLEARLK 142

Query: 548 DMEEKYSSLQEESEGLSKKLKKVSSMLL 575
            +E  Y +   E     +K  K   M+ 
Sbjct: 143 KLEPIYITPDTEPTYEREKKPKRRKMIF 170


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 29.4 bits (66), Expect = 9.0
 Identities = 28/150 (18%), Positives = 59/150 (39%), Gaps = 24/150 (16%)

Query: 491 EKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR-KQKEEELKKNIQEKDAERIDM 549
                  +++ V  E   E+A +       A +    R K+  +EL+  ++    E +D+
Sbjct: 664 LSTLFCVQRLRVAAELQKEEARL-------ALEGNIERTKELNDELRAELELHRKEILDL 716

Query: 550 EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR-KEIEGFLDNVRQITKEYSY 608
            +                    +   AA+EE++  QRE+R + +E  L+ V    + Y  
Sbjct: 717 FD------------CGTADTEDAFREAAREEQQLTQRESRLESLEAQLEGVA--AEAYEL 762

Query: 609 QQAIIQKFIPTKYL-ELMTQNATWNEDIGE 637
             ++ Q+ +  + L  L       +E++ E
Sbjct: 763 SASLDQRELKEEELALLEEAIDALDEEVEE 792


>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201).  This is
           a family of coiled-coil proteins from eukaryotes. The
           function is not known.
          Length = 177

 Score = 28.3 bits (64), Expect = 9.0
 Identities = 21/86 (24%), Positives = 37/86 (43%), Gaps = 15/86 (17%)

Query: 523 QELEHRKQKE-------EELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM-- 573
             L H K+K        E LKK ++EK+     + ++   ++ E + L K+ KK+     
Sbjct: 77  HILTHLKEKLHMLEEELERLKKELKEKEERLAKLRKELYKVKLERDKLRKQNKKLRQQGG 136

Query: 574 ------LLAAKEERKDVQRENRKEIE 593
                 LL   ++  +   E RK +E
Sbjct: 137 LLHVPALLYDYDKTVEKVEEKRKSVE 162


>gnl|CDD|177020 CHL00081, chlI, Mg-protoporyphyrin IX chelatase.
          Length = 350

 Score = 28.8 bits (65), Expect = 9.1
 Identities = 28/79 (35%), Positives = 38/79 (48%), Gaps = 8/79 (10%)

Query: 500 VLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEK----DAERIDMEEKYSS 555
           VLVG  N  E+ E++ QLLD      E R  K+ EL+  I E+    D    +  EKY  
Sbjct: 189 VLVGSGNP-EEGELRPQLLDRFGMHAEIRTVKDPELRVKIVEQRTSFDKNPQEFREKYEE 247

Query: 556 LQEESEGLSKKLKKVSSML 574
            QEE   L  K+    ++L
Sbjct: 248 SQEE---LRSKIVAAQNLL 263


>gnl|CDD|220222 pfam09403, FadA, Adhesion protein FadA.  FadA (Fusobacterium
           adhesin A) is an adhesin which forms two alpha helices.
          Length = 126

 Score = 28.0 bits (62), Expect = 9.1
 Identities = 19/79 (24%), Positives = 35/79 (44%), Gaps = 9/79 (11%)

Query: 515 QQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEES---------EGLSK 565
           Q L+       E  KQ++E  +K +QE    ++  EE Y+ L+EES         + L K
Sbjct: 37  QALMQKEEARFEEEKQEKETAEKEVQELKERQLGREELYAKLKEESKTRFHKDEYKKLLK 96

Query: 566 KLKKVSSMLLAAKEERKDV 584
           +  +  + L     +++  
Sbjct: 97  RYDEYLNKLEQKITDKEQQ 115


>gnl|CDD|223024 PHA03252, PHA03252, DNA packaging tegument protein UL25;
           Provisional.
          Length = 589

 Score = 28.9 bits (65), Expect = 9.7
 Identities = 16/85 (18%), Positives = 36/85 (42%), Gaps = 5/85 (5%)

Query: 512 EIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVS 571
           E  ++L D +A  L   ++  E+L ++   +     ++++    LQ E E L  ++ +  
Sbjct: 29  EDLRRLRDDSALRL---RRYREDLLRDRLLRRRLGEELDDLQKRLQTECEDLRSRVSEAE 85

Query: 572 SMLLAAKE--ERKDVQRENRKEIEG 594
           ++LL      E           ++G
Sbjct: 86  ALLLHDASGGEGGGATNGGEVNVDG 110


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 28.3 bits (64), Expect = 9.7
 Identities = 31/120 (25%), Positives = 56/120 (46%), Gaps = 28/120 (23%)

Query: 474 ETDAELKRAQSDHQSMVEKLE--LLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQK 531
           E + ELK  +++ Q    + E  LLQK      E  L++             + E  ++K
Sbjct: 68  EAERELKERRNELQ----RQEKRLLQK------EETLDR-------------KDESLEKK 104

Query: 532 EEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKE 591
           EE L++  +E  A +  +EEK   L+E  E   ++L+++S +     EE K++  E  + 
Sbjct: 105 EESLEEKEKELAARQQQLEEKEEELEELIEEQQQELERISGL---TAEEAKEILLEEVEA 161


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 29.0 bits (65), Expect = 9.8
 Identities = 32/159 (20%), Positives = 54/159 (33%), Gaps = 29/159 (18%)

Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSD 485
           K    +S   KK  +          +   E E +E        +VI ++  AE      D
Sbjct: 213 KKSNIMSSFFKKKTKEKKEKKEASESTVKE-ESEEESG---KRDVILEDESAEPTGLDED 268

Query: 486 HQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAE 545
                E  +  +                  +  D+  +  E  K+K + LKK + E + E
Sbjct: 269 -----EDEDEPKPSG---------------ERSDSEEETEEKEKEKRKRLKK-MMEDEDE 307

Query: 546 RIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDV 584
             +ME    S  EE E    +       L   +EE+++V
Sbjct: 308 DEEMEIVPESPVEEEESEEPE----PPPLPKKEEEKEEV 342


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.129    0.358 

Gapped
Lambda     K      H
   0.267   0.0724    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,556,769
Number of extensions: 3572205
Number of successful extensions: 6230
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5124
Number of HSP's successfully gapped: 712
Length of query: 705
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 601
Effective length of database: 6,324,786
Effective search space: 3801196386
Effective search space used: 3801196386
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 63 (28.1 bits)