RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7226
         (279 letters)



>gnl|CDD|215803 pfam00225, Kinesin, Kinesin motor domain. 
          Length = 326

 Score =  164 bits (417), Expect = 7e-49
 Identities = 78/242 (32%), Positives = 116/242 (47%), Gaps = 48/242 (19%)

Query: 24  DHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS----- 78
             + F FD +F P  + E ++ E   PL+  +  G N T+ AYGQTG GKTYT+      
Sbjct: 38  REKTFTFDRVFDPEATQEFVYEETAKPLVESVLEGYNVTIFAYGQTGSGKTYTMEGDPPE 97

Query: 79  ---AMIMKTLQHVMQRCNKD-------DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNI 128
               +I + L+ + +R  +         V +SYL++Y+EK YDLL+ +            
Sbjct: 98  EEPGIIPRALEDLFERIEERKERKWSFSVRVSYLEIYNEKIYDLLSPSS----------- 146

Query: 129 PQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNA 188
            +   R++   DP+ G    G       LT  ++ S    ++L+  GN+N+  A T MN 
Sbjct: 147 KKKKLRIR--EDPKKGVYVKG-------LTEVEVTSAEEVLELLELGNKNRTVASTNMNE 197

Query: 189 QSSRSHTICTIYL----------GAMAKLHLVDLAGSEQLFS--LSDNYLLRNEARKINL 236
           +SSRSH I TI +              KL+LVDLAGSE+      +    L+ EA  IN 
Sbjct: 198 ESSRSHAIFTITVEQRNRDTDGSVKTGKLNLVDLAGSERASKTGAAGGRRLK-EAANINK 256

Query: 237 SL 238
           SL
Sbjct: 257 SL 258


>gnl|CDD|238054 cd00106, KISc, Kinesin motor domain. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Kinesins are microtubule-dependent
           molecular motors that play important roles in
           intracellular transport and in cell division. In most
           kinesins, the motor domain is found at the N-terminus
           (N-type), in some its is found in the middle (M-type),
           or C-terminal (C-type). N-type and M-type kinesins are
           (+) end-directed motors, while C-type kinesins are (-)
           end-directed motors, i.e. they transport cargo towards
           the (-) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 328

 Score =  164 bits (417), Expect = 7e-49
 Identities = 76/240 (31%), Positives = 111/240 (46%), Gaps = 44/240 (18%)

Query: 28  FIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVSA------MI 81
           F FD++F PN + E ++     PL+  +  G N T+ AYGQTG GKTYT+        +I
Sbjct: 47  FTFDHVFDPNSTQEDVYETTAKPLVESVLEGYNGTIFAYGQTGSGKTYTMFGSPKDPGII 106

Query: 82  MKTLQHVMQRCNKDD-------VYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQSDQR 134
            + L+ +    ++         V +SYL++Y+EK YDLL+              P     
Sbjct: 107 PRALEDLFNLIDERKEKNKSFSVSVSYLEIYNEKVYDLLSP-----------EPPSKPLS 155

Query: 135 VQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSSRSH 194
           ++           P  G     LT  ++ S   A+ L+  G +N+ TA T MN +SSRSH
Sbjct: 156 LRE---------DPKGGVYVKGLTEVEVGSAEDALSLLQKGLKNRTTASTAMNERSSRSH 206

Query: 195 TICTIYL-----------GAMAKLHLVDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQ 243
            I TI++              +KL+LVDLAGSE+            EA+ IN SL  L  
Sbjct: 207 AIFTIHVEQRNTTNDGRSIKSSKLNLVDLAGSERAKKTGAEGDRLKEAKNINKSLSALGN 266


>gnl|CDD|214526 smart00129, KISc, Kinesin motor, catalytic domain. ATPase.
           Microtubule-dependent molecular motors that play
           important roles in intracellular transport of organelles
           and in cell division.
          Length = 335

 Score =  155 bits (393), Expect = 3e-45
 Identities = 81/242 (33%), Positives = 115/242 (47%), Gaps = 47/242 (19%)

Query: 24  DHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVSA---- 79
             + F FD +F    S E +F E   PL++ +  G NAT+ AYGQTG GKTYT+      
Sbjct: 44  GEKKFTFDKVFDATASQEDVFEETAAPLVDSVLEGYNATIFAYGQTGSGKTYTMIGTPDS 103

Query: 80  --MIMKTLQHVMQRCNKD------DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQS 131
             +I + L+ + ++ +K        V +SYL++Y+EK  DLLN +    L+  I    + 
Sbjct: 104 PGIIPRALKDLFEKIDKREEGWQFSVKVSYLEIYNEKIRDLLNPSS-KKLE--IREDEKG 160

Query: 132 DQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSS 191
              V+G                   LT   + S      L+  GN+N+  A TKMN +SS
Sbjct: 161 GVYVKG-------------------LTEISVSSFEEVYNLLEKGNKNRTVAATKMNEESS 201

Query: 192 RSHTICTIYL-----------GAMAKLHLVDLAGSEQLFSLSDNYLLRN-EARKINLSLH 239
           RSH + TI +           G  +KL+LVDLAGSE+    +     R  EA  IN SL 
Sbjct: 202 RSHAVFTITVEQKIKNSSSGSGKASKLNLVDLAGSER-AKKTGAEGDRLKEAGNINKSLS 260

Query: 240 YL 241
            L
Sbjct: 261 AL 262


>gnl|CDD|238671 cd01375, KISc_KIF9_like, Kinesin motor domain, KIF9-like subgroup;
           might play a role in cell shape remodeling. This
           catalytic (head) domain has ATPase activity and belongs
           to the larger group of P-loop NTPases. Kinesins are
           microtubule-dependent molecular motors that play
           important roles in intracellular transport and in cell
           division. In most kinesins, the motor domain is found at
           the N-terminus (N-type). N-type kinesins are (+)
           end-directed motors, i.e. they transport cargo towards
           the (+) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 334

 Score =  145 bits (367), Expect = 1e-41
 Identities = 73/245 (29%), Positives = 111/245 (45%), Gaps = 43/245 (17%)

Query: 24  DHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS----- 78
           D   F FD +F    S E ++  V  P+++   +G N T+ AYGQTG GKT+T++     
Sbjct: 47  DFS-FKFDGVFHNA-SQEEVYETVAKPVVDSALDGYNGTIFAYGQTGAGKTFTMTGGTES 104

Query: 79  ----AMIMKTLQHVMQRCNKD-----DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIP 129
                +I + L+ V +           V++SYL++Y+E+ YDLL                
Sbjct: 105 YKDRGLIPRALEQVFREVAMRATKTYTVHVSYLEIYNEQLYDLLG--------------- 149

Query: 130 QSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQ 189
                        T             L++    +   A+ L+  G  N+  A T MN  
Sbjct: 150 -DTPEALESLPAVTILEDSEQNIHVKGLSLHSATTEEEALNLLFLGETNRTIAETSMNQA 208

Query: 190 SSRSHTICTIYLGA-----------MAKLHLVDLAGSEQLFSLSDNYLLRNEARKINLSL 238
           SSRSH I TI+L +           ++KL+LVDLAGSE++     +  +  EA+ IN SL
Sbjct: 209 SSRSHCIFTIHLESRSREAGSEVVRLSKLNLVDLAGSERVSKTGVSGQVLKEAKYINKSL 268

Query: 239 HYLEQ 243
            +LEQ
Sbjct: 269 SFLEQ 273


>gnl|CDD|238663 cd01367, KISc_KIF2_like, Kinesin motor domain, KIF2-like group.
           KIF2 is a protein expressed in neurons, which has been
           associated with axonal transport and neuron development;
           alternative splice forms have been implicated in
           lysosomal translocation. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Kinesins are microtubule-dependent
           molecular motors that play important roles in
           intracellular transport and in cell division. In this
           subgroup the motor domain is found in the middle
           (M-type) of the protein chain. M-type kinesins are (+)
           end-directed motors, i.e. they transport cargo towards
           the (+) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second (KIF2 may be slower). To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 322

 Score =  139 bits (352), Expect = 2e-39
 Identities = 73/235 (31%), Positives = 109/235 (46%), Gaps = 42/235 (17%)

Query: 22  FFDHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS--- 78
           + +   F FD +F    +NE ++   + PLI H+F G  AT  AYGQTG GKTYT+    
Sbjct: 47  YIEKHTFRFDYVFDEAVTNEEVYRSTVKPLIPHVFEGGVATCFAYGQTGSGKTYTMLGDE 106

Query: 79  ------AMIMKTLQHVMQRCNKD-DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQS 131
                 A+  + +  ++ + N D  V +S+ ++Y  K +DLLN  K +++          
Sbjct: 107 NQEGLYALAARDIFRLLAQPNDDLGVTVSFFEIYGGKLFDLLNDRKRLSVLE------DG 160

Query: 132 DQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSS 191
              VQ                    LT K + S++  ++LI +GN  + T  T  N QSS
Sbjct: 161 KGNVQ-----------------IVGLTEKPVTSVDELLELIESGNSLRTTGSTGANDQSS 203

Query: 192 RSHTICTIYL------GAMAKLHLVDLAGSEQLFS--LSDNYLLRNEARKINLSL 238
           RSH I  I L        + KL  +DLAGSE+  +     +   R E  +IN SL
Sbjct: 204 RSHAILQIILKNKKLNKLLGKLSFIDLAGSER-GADTSEHDRQTRKEGAEINKSL 257


>gnl|CDD|238666 cd01370, KISc_KIP3_like, Kinesin motor domain, KIP3-like subgroup.
           The yeast kinesin KIP3 plays a role in positioning the
           mitotic spindle. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 338

 Score =  128 bits (323), Expect = 6e-35
 Identities = 72/260 (27%), Positives = 117/260 (45%), Gaps = 56/260 (21%)

Query: 5   SEVGFQNRERQQTKIWLFFDHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLL 64
            E  F+N   ++ K         + FD +F    + E ++     PL++ + NG NAT+ 
Sbjct: 39  EEDAFRNLRARRNK------ELKYSFDRVFDETSTQEEVYENTTKPLVDGVLNGYNATVF 92

Query: 65  AYGQTGGGKTYTVSA------MIMKTLQHVMQRCNKD------DVYMSYLQLYSEKCYDL 112
           AYG TG GKT+T+        +++ T++ +  +  +       +V +SYL++Y+E   DL
Sbjct: 93  AYGATGAGKTHTMLGTDSDPGLMVLTMKDLFDKIEERKDDKEFEVSLSYLEIYNETIRDL 152

Query: 113 LN-GNKEVTLKNWIFNIPQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQL 171
           L+  +  + L+                 DP         G   A LT  +  S    ++L
Sbjct: 153 LSPSSGPLELRE----------------DPN-------QGIVVAGLTEHQPKSAEEILEL 189

Query: 172 IVTGNENKVTAVTKMNAQSSRSHTICTIYLGA------------MAKLHLVDLAGSEQLF 219
           ++ GN N+    T+ NA SSRSH +  I +              + KL L+DLAGSE+  
Sbjct: 190 LMKGNRNRTQEPTEANATSSRSHAVLQITVRQKDRTASINQQVRIGKLSLIDLAGSERA- 248

Query: 220 SLSDNYLLR-NEARKINLSL 238
           S ++N   R  E   IN SL
Sbjct: 249 SATNNRGQRLKEGANINRSL 268


>gnl|CDD|238662 cd01366, KISc_C_terminal, Kinesin motor domain,
           KIFC2/KIFC3/ncd-like carboxy-terminal kinesins. Ncd is a
           spindle motor protein necessary for chromosome
           segregation in meiosis. KIFC2/KIFC3-like kinesins have
           been implicated in motility of the Golgi apparatus as
           well as dentritic and axonal transport in neurons. This
           catalytic (head) domain has ATPase activity and belongs
           to the larger group of P-loop NTPases. Kinesins are
           microtubule-dependent molecular motors that play
           important roles in intracellular transport and in cell
           division. In this subgroup the motor domain is found at
           the C-terminus (C-type). C-type kinesins are (-)
           end-directed motors, i.e. they transport cargo towards
           the (-) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 329

 Score =  126 bits (318), Expect = 2e-34
 Identities = 72/235 (30%), Positives = 105/235 (44%), Gaps = 44/235 (18%)

Query: 27  VFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVSA------M 80
            F FD +F P+ S E +F EV  PL+    +G N  + AYGQTG GKTYT+        +
Sbjct: 46  SFSFDRVFDPDASQEDVFEEV-SPLVQSALDGYNVCIFAYGQTGSGKTYTMEGPPENPGI 104

Query: 81  IMKTLQH---VMQRCNKD----DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQSDQ 133
           I + L+      +   +      +  S L++Y+E   DLL        K  I +  + + 
Sbjct: 105 IPRALEQLFNTAEELKEKGWSYTITASMLEIYNETIRDLLATKPAPKKKLEIKHDSKGET 164

Query: 134 RVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSSRS 193
            V                     LT   + S     +L+  G++N+  A T MN  SSRS
Sbjct: 165 YVTN-------------------LTEVPVSSPEEVTRLLNLGSKNRSVASTNMNEHSSRS 205

Query: 194 HTICTIYL-------GAMA--KLHLVDLAGSEQLF-SLSDNYLLRNEARKINLSL 238
           H +  + +       G     KL+LVDLAGSE+L  S +    L+ EA+ IN SL
Sbjct: 206 HAVFQLKIRGTNLQTGEQTRGKLNLVDLAGSERLKKSGATGDRLK-EAQAINKSL 259


>gnl|CDD|238668 cd01372, KISc_KIF4, Kinesin motor domain, KIF4-like subfamily.
           Members of this group seem to perform a variety of
           functions, and have been implicated in neuronal
           organelle transport and chromosome segregation during
           mitosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 341

 Score =  126 bits (319), Expect = 3e-34
 Identities = 72/230 (31%), Positives = 110/230 (47%), Gaps = 56/230 (24%)

Query: 26  QVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYT--------- 76
           + F FD +F P+ S E ++   + PL++ +F G NAT+LAYGQTG GKTYT         
Sbjct: 40  KSFTFDYVFDPSTSQEEVYNTCVAPLVDGLFEGYNATVLAYGQTGSGKTYTMGTAFTASE 99

Query: 77  -VSAM--IMKTLQHVMQRC--NKDD----VYMSYLQLYSEKCYDLLNGNKEVTLKNWIFN 127
               +  I + +QH+ ++    KD+    + +S+L+LY+E+  DLL  +   + K+    
Sbjct: 100 DEEEVGIIPRAIQHIFKKIDEKKDEPDFQLKVSFLELYNEEVRDLL--SPSTSEKS---- 153

Query: 128 IPQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMN 187
                  +Q   D +      G       LT   ++S    +  +  G+ ++ TA T MN
Sbjct: 154 ------PIQIREDSKGNIIIVG-------LTEVTVNSAQEVMSCLEQGSLSRTTASTAMN 200

Query: 188 AQSSRSHTICTIYLGAM-------------------AKLHLVDLAGSEQL 218
           +QSSRSH I TI L                      +K H VDLAGSE+L
Sbjct: 201 SQSSRSHAIFTITLEQTRKNGPIAPMSGDDKNSTLTSKFHFVDLAGSERL 250


>gnl|CDD|238665 cd01369, KISc_KHC_KIF5, Kinesin motor domain, kinesin heavy chain
           (KHC) or KIF5-like subgroup. Members of this group have
           been associated with organelle transport. This catalytic
           (head) domain has ATPase activity and belongs to the
           larger group of P-loop NTPases. Kinesins are
           microtubule-dependent molecular motors that play
           important roles in intracellular transport and in cell
           division. In most kinesins, the motor domain is found at
           the N-terminus (N-type). N-type kinesins are (+)
           end-directed motors, i.e. they transport cargo towards
           the (+) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 325

 Score =  122 bits (309), Expect = 5e-33
 Identities = 78/239 (32%), Positives = 107/239 (44%), Gaps = 46/239 (19%)

Query: 24  DHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYT-VSAMIM 82
           D + F FD +F PN + E ++  V  P+++ + NG N T+ AYGQTG GKTYT       
Sbjct: 41  DGKTFSFDRVFPPNTTQEDVYNFVAKPIVDDVLNGYNGTIFAYGQTGSGKTYTMEGPPGD 100

Query: 83  KTLQHVMQRCNKD--------------DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNI 128
             L+ ++ R   D               V +SYL++Y EK  DLL+              
Sbjct: 101 PELKGIIPRIVHDIFEHISSMDENLEFHVKVSYLEIYMEKIRDLLD-------------- 146

Query: 129 PQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNA 188
             S   +Q   D   G    G       LT + + S    +++I  G  N+  A T MN 
Sbjct: 147 -VSKDNLQVHEDKNRGVYVKG-------LTERFVSSPEEVLEVINEGKSNRAVASTNMNE 198

Query: 189 QSSRSHTICTIYLGA---------MAKLHLVDLAGSEQLFSLSDNYLLRNEARKINLSL 238
           +SSRSH+I  I L             KL LVDLAGSE++           EA+KIN SL
Sbjct: 199 ESSRSHSIFLITLKQENVETGSKKRGKLFLVDLAGSEKVSKTGAEGQTLEEAKKINKSL 257


>gnl|CDD|238660 cd01364, KISc_BimC_Eg5, Kinesin motor domain, BimC/Eg5 spindle pole
           proteins, participate in spindle assembly and chromosome
           segregation during cell division. This catalytic (head)
           domain has ATPase activity and belongs to the larger
           group of P-loop NTPases. Kinesins are
           microtubule-dependent molecular motors that play
           important roles in intracellular transport and in cell
           division. In most kinesins, the motor domain is found at
           the N-terminus (N-type), N-type kinesins are (+)
           end-directed motors, i.e. they transport cargo towards
           the (+) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 352

 Score =  122 bits (309), Expect = 9e-33
 Identities = 73/251 (29%), Positives = 116/251 (46%), Gaps = 62/251 (24%)

Query: 27  VFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS-------- 78
            + FD +FGP      ++++V+ P+++ +  G N T+ AYGQTG GKTYT+         
Sbjct: 49  TYTFDKVFGPEADQIEVYSQVVSPILDEVLMGYNCTIFAYGQTGTGKTYTMEGDRTDNKG 108

Query: 79  ---------AMIMKTLQHV---MQRCNKD-DVYMSYLQLYSEKCYDLLNGNKEVTLKNWI 125
                     +I + L  +   ++  N +  V +SYL+LY+E+ +DLL+   ++     I
Sbjct: 109 STWELSPHAGIIPRALYQLFEKLESQNTEYSVKVSYLELYNEELFDLLSSESDLNKPLRI 168

Query: 126 FNIPQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKL-----DSLNSAVQLIVTGNENKV 180
           F+                      N      + ++ L     ++ N  ++L+  G+  + 
Sbjct: 169 FD-------------------DTNNKG---GVVIQGLEEITVNNANEGLKLLEKGSAKRK 206

Query: 181 TAVTKMNAQSSRSHTICTIY------------LGAMAKLHLVDLAGSEQLFSLSDNYLLR 228
           TA T MN QSSRSH+I +I             L  + KL+LVDLAGSE +   S     R
Sbjct: 207 TAATLMNDQSSRSHSIFSITIHIKETTISGEELVKIGKLNLVDLAGSENI-GRSGAENKR 265

Query: 229 -NEARKINLSL 238
             EA  IN SL
Sbjct: 266 AREAGNINQSL 276


>gnl|CDD|238667 cd01371, KISc_KIF3, Kinesin motor domain, kinesins II or KIF3_like
           proteins. Subgroup of kinesins, which form heterotrimers
           composed of 2 kinesins and one non-motor accessory
           subunit. Kinesins II play important roles in ciliary
           transport, and have been implicated in neuronal
           transport, melanosome transport, the secretory pathway,
           and mitosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In this group the motor
           domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 333

 Score =  121 bits (306), Expect = 1e-32
 Identities = 75/243 (30%), Positives = 115/243 (47%), Gaps = 49/243 (20%)

Query: 26  QVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS------- 78
           +VF FD ++ PN + E ++ E   PL++ +  G N T+ AYGQTG GKT+T+        
Sbjct: 48  KVFTFDAVYDPNSTQEDVYNETARPLVDSVLEGYNGTIFAYGQTGTGKTFTMEGVREPPE 107

Query: 79  --AMIMKTLQHVMQRCNKDD-----VYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQS 131
              +I  +  H+     K +     V +SYL++Y+E+  DLL  +               
Sbjct: 108 LRGIIPNSFAHIFGHIAKAENVQFLVRVSYLEIYNEEVRDLLGKD--------------Q 153

Query: 132 DQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSS 191
            ++++    P  G          +   VK  + ++  + L   GN+N+    T MN  SS
Sbjct: 154 KKKLELKERPDRGVYVKD----LSMFVVKNAEEMDKLMTL---GNKNRSVGATNMNEDSS 206

Query: 192 RSHTICTIYLGA------------MAKLHLVDLAGSE-QLFSLSDNYLLRNEARKINLSL 238
           RSH+I TI +              + KL+LVDLAGSE Q  + +    L+ EA KINLSL
Sbjct: 207 RSHSIFTITIECSEKGEDGENHIRVGKLNLVDLAGSERQSKTGATGDRLK-EATKINLSL 265

Query: 239 HYL 241
             L
Sbjct: 266 SAL 268


>gnl|CDD|238670 cd01374, KISc_CENP_E, Kinesin motor domain, CENP-E/KIP2-like
           subgroup, involved in chromosome movement and/or spindle
           elongation during mitosis. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Kinesins are microtubule-dependent
           molecular motors that play important roles in
           intracellular transport and in cell division. In most
           kinesins, the motor domain is found at the N-terminus
           (N-type). N-type kinesins are (+) end-directed motors,
           i.e. they transport cargo towards the (+) end of the
           microtubule. Kinesin motor domains hydrolyze ATP at a
           rate of about 80 per second, and move along the
           microtubule at a speed of about 6400 Angstroms per
           second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 321

 Score =  118 bits (298), Expect = 2e-31
 Identities = 68/234 (29%), Positives = 107/234 (45%), Gaps = 46/234 (19%)

Query: 28  FIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVSAM-----IM 82
           F FD +FG   +N  ++  +  P++     G N T+ AYGQT  GKT+T+S       I+
Sbjct: 42  FTFDRVFGGESTNREVYERIAKPVVRSALEGYNGTIFAYGQTSSGKTFTMSGDEQEPGII 101

Query: 83  K-----TLQHVMQRCNKD-DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQSDQRVQ 136
                   Q +    +++  + +SYL++Y+EK  DLL+ +                Q ++
Sbjct: 102 PLAVRDIFQRIQDTPDREFLLRVSYLEIYNEKIKDLLSPS---------------PQELR 146

Query: 137 GPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSSRSHTI 196
              DP  G          A LT + + S    +QLI  G +N+    T  N +SSRSHTI
Sbjct: 147 IREDPNKGVV-------VAGLTEEIVTSPEHLLQLIARGEKNRHVGETDFNERSSRSHTI 199

Query: 197 CTIYLGA------------MAKLHLVDLAGSEQLFSLSDNYLLRNEARKINLSL 238
             + + +            ++ L+L+DLAGSE+  S +     R E   IN SL
Sbjct: 200 FQLTIESRERGDSESGTVRVSTLNLIDLAGSERA-SQTGAGERRKEGSFINKSL 252


>gnl|CDD|238672 cd01376, KISc_KID_like, Kinesin motor domain, KIF22/Kid-like
           subgroup. Members of this group might play a role in
           regulating chromosomal movement along microtubules in
           mitosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 319

 Score =  117 bits (296), Expect = 2e-31
 Identities = 69/234 (29%), Positives = 109/234 (46%), Gaps = 41/234 (17%)

Query: 28  FIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS------AMI 81
           + FD  +G   + E IF+  + P++ H+ +G NAT+ AYG TG GKT+T+        +I
Sbjct: 49  YQFDAFYGTECTQEDIFSREVKPIVPHLLSGQNATVFAYGSTGAGKTHTMLGDPNEPGLI 108

Query: 82  MKTLQHVMQRCNKD----DVYMSYLQLYSEKCYDLLNG-NKEVTLKNWIFNIPQSDQRVQ 136
            +TL  +++   K        MSY ++Y+EK YDLL    KE+ ++         D  + 
Sbjct: 109 PRTLSDLLRMGRKQAWTGAFSMSYYEIYNEKVYDLLEPAKKELPIRE------DKDGNIL 162

Query: 137 GPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSSRSHTI 196
                               LT K + S+    +  +  ++N+  A TK+N  SSRSH +
Sbjct: 163 IV-----------------GLTSKPIKSMAEFEEAYIPASKNRTVAATKLNDNSSRSHAV 205

Query: 197 CTIYLGAMA-------KLHLVDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQ 243
             I +   A       KL+L+DLAGSE      +  +   E+  IN SL  L +
Sbjct: 206 LRIKVTQPASNIQLEGKLNLIDLAGSEDNRRTGNEGIRLKESAAINSSLFVLSK 259


>gnl|CDD|238661 cd01365, KISc_KIF1A_KIF1B, Kinesin motor domain, KIF1_like
           proteins. KIF1A (Unc104) transports synaptic vesicles to
           the nerve  terminal, KIF1B has been implicated in
           transport of mitochondria. Both proteins are expressed
           in neurons. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           In contrast to the majority of dimeric kinesins, most
           KIF1A/Unc104 kinesins are monomeric motors. A
           lysine-rich loop in KIF1A binds to the negatively
           charged C-terminus of tubulin and compensates for the
           lack of a second motor domain, allowing KIF1A to move
           processively.
          Length = 356

 Score =  110 bits (278), Expect = 3e-28
 Identities = 73/244 (29%), Positives = 109/244 (44%), Gaps = 52/244 (21%)

Query: 28  FIFDNIF---GPNDSN----ETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTV--- 77
           F FD+ +      D +    E +F ++   L++H F G N  L AYGQTG GK+YT+   
Sbjct: 50  FSFDHSYWSHDSEDPHYASQEDVFEDLGRELLDHAFEGYNVCLFAYGQTGSGKSYTMMGY 109

Query: 78  ---SAMIMKTLQHVMQRCNKD-------DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFN 127
                +I +  + + QR           +V +SY+++Y+EK  DLLN  K+         
Sbjct: 110 KEEKGIIPRLCEELFQRIESKKEQNLSYEVEVSYMEIYNEKVRDLLNPKKK--------- 160

Query: 128 IPQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMN 187
             + + +V+    P  GP           L+   + S      L+  GN+++ TA T MN
Sbjct: 161 -NKGNLKVR--EHPVLGPYVED-------LSKVAVTSYEDIQNLLEEGNKSRTTASTNMN 210

Query: 188 AQSSRSHTICTIYL-------------GAMAKLHLVDLAGSEQLFSLSDNYLLRNEARKI 234
             SSRSH + TI L               ++K+ LVDLAGSE+  S         E   I
Sbjct: 211 DTSSRSHAVFTIVLTQKKLDKETDLTTEKVSKISLVDLAGSERASSTGAEGDRLKEGSNI 270

Query: 235 NLSL 238
           N SL
Sbjct: 271 NKSL 274


>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
          Length = 568

 Score =  107 bits (268), Expect = 3e-26
 Identities = 76/242 (31%), Positives = 111/242 (45%), Gaps = 43/242 (17%)

Query: 23  FDHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVSA--- 79
                + FD +FGP+ + E ++ E + PLI+ +  G N T+ AYGQTG GKTYT+S    
Sbjct: 53  SKEGTYAFDKVFGPSATQEDVYEETIKPLIDSLLLGYNCTVFAYGQTGSGKTYTMSGTEE 112

Query: 80  ------MIMKTLQHVMQRCNKDDVY---MSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQ 130
                 + +K L   ++  +    +   +SYL++Y+EK YDLL+ N+E            
Sbjct: 113 EPGIIPLSLKELFSKLEDLSMTKDFAVSISYLEIYNEKIYDLLSPNEE------------ 160

Query: 131 SDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQS 190
                               G   A LT K + S    + L+  G +N+ TA T++N +S
Sbjct: 161 ----------SLNIREDSLLGVKVAGLTEKHVSSKEEILDLLRKGEKNRTTASTEINDES 210

Query: 191 SRSHTICTIYLG---------AMAKLHLVDLAGSEQLFSLSDNYLLRNEARKINLSLHYL 241
           SRSH+I  I L            +KL LVDLAGSE+     +      E   IN SL  L
Sbjct: 211 SRSHSIFQIELASKNKVSGTSETSKLSLVDLAGSERAARTGNRGTRLKEGASINKSLLTL 270

Query: 242 EQ 243
             
Sbjct: 271 GN 272


>gnl|CDD|238669 cd01373, KISc_KLP2_like, Kinesin motor domain, KLP2-like subgroup.
           Members of this subgroup seem to play a role in mitosis
           and meiosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 337

 Score = 99.1 bits (247), Expect = 4e-24
 Identities = 67/263 (25%), Positives = 113/263 (42%), Gaps = 57/263 (21%)

Query: 16  QTKIWLFFDHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTY 75
            T +W     ++F FD++   N + E +F  V  PL+    +G N ++ AYGQTG GKTY
Sbjct: 31  DTLVWHSHPPRMFTFDHVADSNTNQEDVFQSVGKPLVEDCLSGYNGSIFAYGQTGSGKTY 90

Query: 76  TV--------------SAMIMKTLQHVMQRCNKDD----------VYMSYLQLYSEKCYD 111
           T+                +I +  +++     +++             S+L++Y+E+  D
Sbjct: 91  TMMGPSSSDDESPHGLQGVIPRIFEYLFSLIQREEEKRGDGLKFLCKCSFLEIYNEQITD 150

Query: 112 LLNGNKEVTLKNWIFNIPQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQL 171
           LL+                + + ++   D          G    +LT + + S     Q+
Sbjct: 151 LLD---------------PTSRNLKIRED-------IKKGVYVENLTEEYVSSYEDVYQV 188

Query: 172 IVTGNENKVTAVTKMNAQSSRSHTICTIYLGAMAK-----------LHLVDLAGSEQLFS 220
           ++ G  N+  A T MN++SSRSH + T  + +  K           L+LVDLAGSE+   
Sbjct: 189 LLKGLSNRKVAATSMNSESSRSHAVFTCTIESWEKKASSTNIRTSRLNLVDLAGSERQKD 248

Query: 221 LSDNYLLRNEARKINLSLHYLEQ 243
                +   EA+ IN SL  L  
Sbjct: 249 DGAEGVRLKEAKNINKSLSTLGH 271


>gnl|CDD|238664 cd01368, KISc_KIF23_like, Kinesin motor domain, KIF23-like
           subgroup. Members of this group may play a role in
           mitosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 345

 Score = 99.0 bits (247), Expect = 5e-24
 Identities = 72/246 (29%), Positives = 110/246 (44%), Gaps = 43/246 (17%)

Query: 23  FDHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS---- 78
                F F  +FGPN + +  F    +PL+  +  G N+ L  YG T  GKTYT+     
Sbjct: 52  QKETKFSFSKVFGPNTTQKEFFEGTALPLVQDLLKGKNSLLFTYGVTNSGKTYTMQGSPG 111

Query: 79  --AMIMKTLQHVMQRCNKDDVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQSDQRVQ 136
              ++ ++L  +        V++SY+++Y+   YDLL  +   T K     + + D    
Sbjct: 112 DGGILPRSLDVIFNSIGGYSVFVSYVEIYNNYIYDLLEDSPSSTKKRQSLRL-REDH--- 167

Query: 137 GPPDPQTGPASPGNGRS-AASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSSRSHT 195
                        NG    A LT  ++ S   A ++   G +N+  A TK+N +SSRSH+
Sbjct: 168 -------------NGNMYVAGLTEVEVSSTEEAREVFKRGQKNRRVAGTKLNRESSRSHS 214

Query: 196 ICTIYLG-----------------AMAKLHLVDLAGSEQLFSLSDNYLLR-NEARKINLS 237
           + TI L                   +++L LVDLAGSE+  S + N   R  EA  IN S
Sbjct: 215 VFTIKLVQAPGDSDGDVDQDKDQITVSQLSLVDLAGSERT-SRTQNTGERLKEAGNINTS 273

Query: 238 LHYLEQ 243
           L  L +
Sbjct: 274 LMTLGK 279


>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
          Length = 1320

 Score = 76.5 bits (188), Expect = 1e-15
 Identities = 70/256 (27%), Positives = 112/256 (43%), Gaps = 62/256 (24%)

Query: 26  QVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTV-------- 77
           Q F FD+I  P  + E IF  V  PL+ +   G N+++ AYGQTG GKTYT+        
Sbjct: 132 QTFTFDSIADPESTQEDIFQLVGAPLVENCLAGFNSSVFAYGQTGSGKTYTMWGPANGLL 191

Query: 78  --------SAMIMKTLQHVMQRCNKDDV-----------YMSYLQLYSEKCYDLLNGNKE 118
                     +  +  + +  R N++ +             S+L++Y+E+  DLL+    
Sbjct: 192 EEHLSGDQQGLTPRVFERLFARINEEQIKHADRQLKYQCRCSFLEIYNEQITDLLD---- 247

Query: 119 VTLKNWIFNIPQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNEN 178
                       S + +Q   D ++G           +LT + + ++    QL++ G  N
Sbjct: 248 -----------PSQKNLQIREDVKSGV-------YVENLTEEYVKTMKDVTQLLIKGLSN 289

Query: 179 KVTAVTKMNAQSSRSHTICTIYLGAMAK-------------LHLVDLAGSEQLFSLSDNY 225
           + T  T +NA+SSRSH++ T  + +  K             ++LVDLAGSE+        
Sbjct: 290 RRTGATSINAESSRSHSVFTCVVESRCKSVADGLSSFKTSRINLVDLAGSERQKLTGAAG 349

Query: 226 LLRNEARKINLSLHYL 241
               EA  IN SL  L
Sbjct: 350 DRLKEAGNINRSLSQL 365


>gnl|CDD|238659 cd01363, Motor_domain, Myosin and Kinesin motor domain. These
           ATPases belong to the P-loop NTPase family and provide
           the driving force in myosin and kinesin mediated
           processes.
          Length = 186

 Score = 51.8 bits (124), Expect = 4e-08
 Identities = 31/84 (36%), Positives = 41/84 (48%), Gaps = 12/84 (14%)

Query: 170 QLIVTGNENKVTAVTKMNAQSSRSHTICTIYLGA------------MAKLHLVDLAGSEQ 217
            L+  GN N+ TA T MN  SSRSH++  I+ G             + K++LVDLAGSE+
Sbjct: 60  DLMDKGNANRTTAATAMNEHSSRSHSVFRIHFGGKNALASATEQPKVGKINLVDLAGSER 119

Query: 218 LFSLSDNYLLRNEARKINLSLHYL 241
           +           E   IN SL  L
Sbjct: 120 IDFSGAEGSRLTETANINKSLSTL 143



 Score = 47.9 bits (114), Expect = 9e-07
 Identities = 20/54 (37%), Positives = 31/54 (57%), Gaps = 7/54 (12%)

Query: 43 IFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTV------SAMIMKTLQHVMQ 90
          +F +V  PL+    +G N  + AYGQTG GKTYT+      + +I +T+  V+ 
Sbjct: 8  VFRDV-GPLLQSALDGYNVCIFAYGQTGSGKTYTMEGKREGAGIIPRTVTDVID 60


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 40.7 bits (95), Expect = 7e-04
 Identities = 20/36 (55%), Positives = 29/36 (80%), Gaps = 3/36 (8%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           + EEEEE+ +EEEEEEE+   ++++E+EEEE EE L
Sbjct: 862 DSEEEEEEEEEEEEEEEE---EEEEEEEEEENEEPL 894



 Score = 36.5 bits (84), Expect = 0.013
 Identities = 14/35 (40%), Positives = 26/35 (74%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++     + G  EEEEEE+++ ++++E+EEEE+EE
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEE 886



 Score = 28.4 bits (63), Expect = 4.9
 Identities = 14/62 (22%), Positives = 34/62 (54%)

Query: 214 GSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
              Q  + +D+  +++E  +  L+     + +++EKG +     +    ++++E+EEEE+
Sbjct: 815 HEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEE 874

Query: 274 EE 275
           EE
Sbjct: 875 EE 876



 Score = 27.7 bits (61), Expect = 8.2
 Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 2/37 (5%)

Query: 242 EQE-EEEEKGKEEEEEEEKK-KGKKKKEKEEEEKEEN 276
           E E E  E+G+E E+E E + +GK + E E + KE  
Sbjct: 729 EGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETE 765



 Score = 27.7 bits (61), Expect = 9.0
 Identities = 11/33 (33%), Positives = 18/33 (54%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           + E E KG+ E E E   + K ++E E E + +
Sbjct: 670 EGETETKGENESEGEIPAERKGEQEGEGEIEAK 702


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 39.1 bits (92), Expect = 0.002
 Identities = 18/79 (22%), Positives = 36/79 (45%), Gaps = 9/79 (11%)

Query: 200 YLGAMAKLHLVDLAGSEQLFSLSD---NYLLRNEARKINLSLHYLEQEEEEEKGKEEEEE 256
           +L  + K +    A       L++    +L  ++            ++  E+  K+ EEE
Sbjct: 373 FLSIIFKHNPELAARLAAFLELTEEEIEFLTGSKKATK------KIKKIVEKAEKKREEE 426

Query: 257 EEKKKGKKKKEKEEEEKEE 275
           +++KK K    K++EE+EE
Sbjct: 427 KKEKKKKAFAGKKKEEEEE 445



 Score = 36.8 bits (86), Expect = 0.009
 Identities = 18/34 (52%), Positives = 29/34 (85%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E++++   GK++EEEEE++K KK++EKEEEE+E 
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462



 Score = 36.1 bits (84), Expect = 0.016
 Identities = 13/35 (37%), Positives = 30/35 (85%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
            +++EEE+ +E+E++EE+K+ ++++ +EE+E+EE 
Sbjct: 437 GKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471



 Score = 35.3 bits (82), Expect = 0.026
 Identities = 19/38 (50%), Positives = 28/38 (73%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           +++    K KEEEEEEEK+K +++KE+EEEE EE   +
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468



 Score = 34.9 bits (81), Expect = 0.039
 Identities = 14/34 (41%), Positives = 27/34 (79%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++++     K+EEEEEE+K+ K+++++EEEE+ E
Sbjct: 430 KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463



 Score = 33.7 bits (78), Expect = 0.10
 Identities = 18/35 (51%), Positives = 31/35 (88%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E+EEEEEK K+EEE+EE+++  +++++EEEEK++ 
Sbjct: 441 EEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 33.0 bits (76), Expect = 0.19
 Identities = 15/34 (44%), Positives = 31/34 (91%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++EEEEE+ +++EEE+E+++ + ++EKEEEE+++
Sbjct: 440 KEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473



 Score = 32.6 bits (75), Expect = 0.23
 Identities = 15/34 (44%), Positives = 31/34 (91%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++EEEEE+ KE++EEE++++ ++ +E++EEE+E+
Sbjct: 439 KKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 36.3 bits (83), Expect = 0.003
 Identities = 12/39 (30%), Positives = 26/39 (66%)

Query: 237 SLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +LH+LE EE+ +   +EEE+++ +  +   E ++E+ +E
Sbjct: 2   TLHFLEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 38.0 bits (89), Expect = 0.004
 Identities = 20/71 (28%), Positives = 33/71 (46%), Gaps = 9/71 (12%)

Query: 207 LHLVDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKK 266
           L+LVD            N  L  E  +        E+E+  +  +EE +EE ++K ++KK
Sbjct: 242 LYLVDKLA---------NLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKK 292

Query: 267 EKEEEEKEENL 277
           ++E E K   L
Sbjct: 293 KEEREAKLAKL 303


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 37.6 bits (87), Expect = 0.006
 Identities = 17/38 (44%), Positives = 27/38 (71%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            +E + E GKEEE+E+E+ K +KKK+KE+ ++E    K
Sbjct: 95  AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132



 Score = 32.9 bits (75), Expect = 0.16
 Identities = 17/38 (44%), Positives = 26/38 (68%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           E  +EEEK KE+ +EE+KKK +K KE+ ++ K +   K
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138



 Score = 31.4 bits (71), Expect = 0.49
 Identities = 14/37 (37%), Positives = 24/37 (64%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           +EE +EK   +E+E+EK+K  ++    EEEK+   V+
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVR 170



 Score = 31.0 bits (70), Expect = 0.71
 Identities = 10/35 (28%), Positives = 24/35 (68%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             +  +E K +  +EEE++K+  K+++K+++EK +
Sbjct: 91  KTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125



 Score = 30.6 bits (69), Expect = 0.83
 Identities = 14/34 (41%), Positives = 23/34 (67%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           EQ +EE+K K+E+ +EE K  K K+E +E+   +
Sbjct: 111 EQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPK 144



 Score = 30.2 bits (68), Expect = 1.4
 Identities = 12/35 (34%), Positives = 23/35 (65%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           + +EE++K KE+ +EE K +  K++ KE+   +E 
Sbjct: 112 QVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEK 146



 Score = 29.5 bits (66), Expect = 2.1
 Identities = 11/34 (32%), Positives = 25/34 (73%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +++ +EEK K++E+ +E+ K +K KE+ +E++  
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143



 Score = 29.5 bits (66), Expect = 2.1
 Identities = 11/34 (32%), Positives = 24/34 (70%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            +EE +++  +EE +E++   +K+KEKE++ +E 
Sbjct: 124 PKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157



 Score = 29.5 bits (66), Expect = 2.2
 Identities = 11/38 (28%), Positives = 22/38 (57%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           +    + K  +E + E  K+ +K+KE+ +EEK++   K
Sbjct: 86  KGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEK 123



 Score = 29.1 bits (65), Expect = 2.7
 Identities = 12/35 (34%), Positives = 25/35 (71%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           + +EE +  K +EE +EK+  K+K++++E++ EE 
Sbjct: 123 KPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157



 Score = 28.7 bits (64), Expect = 3.6
 Identities = 12/35 (34%), Positives = 28/35 (80%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           ++E +E++  +E+E+E++KK ++ +++EEE+K E 
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRER 168



 Score = 28.7 bits (64), Expect = 3.7
 Identities = 15/35 (42%), Positives = 26/35 (74%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E+ +EE K ++ +EE ++K+  K+KEKE+E+K E 
Sbjct: 122 EKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156



 Score = 28.3 bits (63), Expect = 4.8
 Identities = 12/35 (34%), Positives = 24/35 (68%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E+ +E+   KE+E+E+EKK  + +  +EE+++E  
Sbjct: 135 EEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV 169



 Score = 27.9 bits (62), Expect = 6.2
 Identities = 11/35 (31%), Positives = 16/35 (45%)

Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
              ++ E        ++  +KK   KKKE  EEEK
Sbjct: 161 EEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEK 195



 Score = 27.9 bits (62), Expect = 6.4
 Identities = 13/35 (37%), Positives = 22/35 (62%), Gaps = 1/35 (2%)

Query: 242 EQEEEEEKGKEEE-EEEEKKKGKKKKEKEEEEKEE 275
           E+E+E+EK  EE  + EE+KK ++ + K   +K  
Sbjct: 145 EKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPP 179



 Score = 27.9 bits (62), Expect = 6.6
 Identities = 11/34 (32%), Positives = 26/34 (76%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E+E+E+ K ++++++E+ K+  K ++ +EE KE+
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140



 Score = 27.5 bits (61), Expect = 9.7
 Identities = 9/34 (26%), Positives = 18/34 (52%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E+++ E    +   ++  KK    K+KE  E+E+
Sbjct: 162 EEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEK 195


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 37.4 bits (87), Expect = 0.006
 Identities = 19/32 (59%), Positives = 26/32 (81%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           EEEEE+ +E++EEEEK   K+++  EEEEKEE
Sbjct: 38  EEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69



 Score = 37.1 bits (86), Expect = 0.008
 Identities = 15/34 (44%), Positives = 24/34 (70%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E+E+  +K +E +EEEEK++ KKK +K +E   E
Sbjct: 50  EEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTE 83



 Score = 36.3 bits (84), Expect = 0.014
 Identities = 18/38 (47%), Positives = 24/38 (63%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
             ++EEE  +EEE+EE+KKK KK KE   E +  N  K
Sbjct: 54  TTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTK 91



 Score = 35.9 bits (83), Expect = 0.021
 Identities = 18/36 (50%), Positives = 28/36 (77%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           +EEEE+ KEE++EEE+K   K++E +EEE++E   K
Sbjct: 37  DEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKK 72



 Score = 34.4 bits (79), Expect = 0.068
 Identities = 15/32 (46%), Positives = 23/32 (71%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           E+EE+    +EE +EEE+K+ KKKK K+ +E 
Sbjct: 49  EEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80



 Score = 34.0 bits (78), Expect = 0.072
 Identities = 15/38 (39%), Positives = 28/38 (73%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           E  +EEE+ ++EE++EE++K   K+E+ +EE+E+   K
Sbjct: 34  EVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKK 71



 Score = 34.0 bits (78), Expect = 0.094
 Identities = 18/36 (50%), Positives = 27/36 (75%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           YL  E+E EK   +EEEEE+K+ KK++E++  +KEE
Sbjct: 24  YLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEE 59



 Score = 32.8 bits (75), Expect = 0.17
 Identities = 13/34 (38%), Positives = 26/34 (76%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++EEEE+   +EEE +E+++ ++KK+K ++ KE 
Sbjct: 47  KKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80



 Score = 32.0 bits (73), Expect = 0.34
 Identities = 21/49 (42%), Positives = 28/49 (57%), Gaps = 11/49 (22%)

Query: 242 EQEEEEEKGKE-----------EEEEEEKKKGKKKKEKEEEEKEENLVK 279
           E+E+EE+K +E           EEEE+E+KK K KK KE   + E L K
Sbjct: 41  EEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89



 Score = 32.0 bits (73), Expect = 0.35
 Identities = 18/37 (48%), Positives = 25/37 (67%), Gaps = 3/37 (8%)

Query: 242 EQEEEEEKGKEEEE---EEEKKKGKKKKEKEEEEKEE 275
           E+E E+E   EEEE   EE+K++ +K  +KEEE  EE
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEE 64


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 36.9 bits (86), Expect = 0.006
 Identities = 14/34 (41%), Positives = 23/34 (67%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++EE+EEK +E+ +   K K KK  + + EEKE+
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEK 71



 Score = 33.9 bits (78), Expect = 0.060
 Identities = 12/37 (32%), Positives = 21/37 (56%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            EEE+E+ +EE+ +   K   KK  K + E++E   +
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74



 Score = 28.1 bits (63), Expect = 5.2
 Identities = 14/53 (26%), Positives = 25/53 (47%), Gaps = 15/53 (28%)

Query: 242 EQEEEEEKGKE----EEEEEE-----------KKKGKKKKEKEEEEKEENLVK 279
           + E+E++  K+    EE+EE+           K K   K + EE+EK +   +
Sbjct: 25  DDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKE 77



 Score = 27.3 bits (61), Expect = 9.3
 Identities = 12/35 (34%), Positives = 21/35 (60%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           + + EEK K + E+EEK   + +++  E+E  E L
Sbjct: 63  KAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKL 97


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 37.0 bits (86), Expect = 0.006
 Identities = 12/32 (37%), Positives = 23/32 (71%)

Query: 246 EEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           E EK K+E  EE++++ ++K  +EE+ +EE +
Sbjct: 150 ELEKIKKERAEEKEREEEEKAAEEEKAREEEI 181


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 35.7 bits (83), Expect = 0.008
 Identities = 19/46 (41%), Positives = 30/46 (65%), Gaps = 1/46 (2%)

Query: 232 RKINLSLHYLEQE-EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           R I++     E+E E+EEK ++EE+E+E  KG  +KE+ E+  EE 
Sbjct: 10  RIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEEL 55


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 36.2 bits (84), Expect = 0.010
 Identities = 18/60 (30%), Positives = 31/60 (51%), Gaps = 7/60 (11%)

Query: 221 LSDNYLLRNEARKINLSLHYLEQEEEE-------EKGKEEEEEEEKKKGKKKKEKEEEEK 273
              NY +R++ R+    L   ++EEEE       E G E+ E+   K+  K+++K E E+
Sbjct: 101 SDPNYEIRSDVRRQLRFLAQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEE 160


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 34.0 bits (78), Expect = 0.011
 Identities = 8/47 (17%), Positives = 26/47 (55%)

Query: 230 EARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E  K+ ++   + + EE E   ++E++++      K + ++++ E++
Sbjct: 33  EEEKVEIARSVMIEGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDD 79



 Score = 25.9 bits (57), Expect = 9.1
 Identities = 4/30 (13%), Positives = 19/30 (63%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEE 271
           E ++++E   +++++++K       ++++E
Sbjct: 51  EDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 35.0 bits (81), Expect = 0.024
 Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 8/88 (9%)

Query: 197 CTIYLGAMAKLHLVDLAGSEQLFSLSDN----YLLR--NEARKINLSLHYLEQEEEEEKG 250
             I   ++AK+  V   G ++ +  SD     +LL+     +K  L +   E E   +K 
Sbjct: 116 LPILESSLAKICDVKEEGDDKFYKYSDEKLLAWLLKKVERLKKKELDIK--EAEAARDKK 173

Query: 251 KEEEEEEEKKKGKKKKEKEEEEKEENLV 278
           K    EEEKKK KKK  K+++ K+   V
Sbjct: 174 KSNNAEEEKKKKKKKSAKKKKLKKVAAV 201


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 34.4 bits (79), Expect = 0.030
 Identities = 17/45 (37%), Positives = 29/45 (64%), Gaps = 3/45 (6%)

Query: 230 EARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           E   INL   + E+ E+E+K   +E EE++    KK++KE++EK+
Sbjct: 84  EGYSINL---FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125



 Score = 34.0 bits (78), Expect = 0.032
 Identities = 13/32 (40%), Positives = 24/32 (75%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           EE E++ KE  +E E+K+  K K++++E+KE+
Sbjct: 93  EESEKEQKEVSKETEEKEAIKAKKEKKEKKEK 124



 Score = 30.5 bits (69), Expect = 0.60
 Identities = 17/38 (44%), Positives = 26/38 (68%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            ++E++E  KE EE+E  K  K+KKEK+E++  E L K
Sbjct: 95  SEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAK 132


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 34.7 bits (80), Expect = 0.031
 Identities = 12/37 (32%), Positives = 25/37 (67%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           L   E++ K  E E+ +E+ +  +++E+EEEE++E+ 
Sbjct: 153 LSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDF 189



 Score = 34.3 bits (79), Expect = 0.042
 Identities = 14/36 (38%), Positives = 24/36 (66%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
            +    E+K KE E E+  ++ +K +E+EEEE+EE+
Sbjct: 151 EKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEED 186



 Score = 27.4 bits (61), Expect = 8.0
 Identities = 12/38 (31%), Positives = 21/38 (55%), Gaps = 6/38 (15%)

Query: 244 EEEEEKGKEEEEEEEKKKG------KKKKEKEEEEKEE 275
           EEE+   K    E++ K+       ++ ++ EEEE+EE
Sbjct: 145 EEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEE 182


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 33.0 bits (76), Expect = 0.035
 Identities = 9/27 (33%), Positives = 18/27 (66%)

Query: 251 KEEEEEEEKKKGKKKKEKEEEEKEENL 277
              EE++E+++ +++KE+ EEE    L
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEEAAAGL 101



 Score = 33.0 bits (76), Expect = 0.040
 Identities = 9/24 (37%), Positives = 16/24 (66%)

Query: 252 EEEEEEEKKKGKKKKEKEEEEKEE 275
                 E+KK ++++E+E+EE EE
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEESEE 95



 Score = 31.5 bits (72), Expect = 0.12
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 246 EEEKGKEEEEEEEKKKGKKKKEKEEE 271
                  EE++EE+++ ++K+E EEE
Sbjct: 71  AAAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 31.5 bits (72), Expect = 0.13
 Identities = 9/24 (37%), Positives = 17/24 (70%)

Query: 252 EEEEEEEKKKGKKKKEKEEEEKEE 275
                EEKK+ ++++E++EE +EE
Sbjct: 73  AAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 28.8 bits (65), Expect = 1.2
 Identities = 10/17 (58%), Positives = 15/17 (88%)

Query: 242 EQEEEEEKGKEEEEEEE 258
           E++EEEE+ +E+EE EE
Sbjct: 79  EKKEEEEEEEEKEESEE 95



 Score = 28.8 bits (65), Expect = 1.3
 Identities = 11/16 (68%), Positives = 14/16 (87%)

Query: 242 EQEEEEEKGKEEEEEE 257
           ++EEEEE+ KEE EEE
Sbjct: 81  KEEEEEEEEKEESEEE 96



 Score = 28.4 bits (64), Expect = 1.4
 Identities = 10/17 (58%), Positives = 15/17 (88%)

Query: 242 EQEEEEEKGKEEEEEEE 258
           ++EEEEE+ ++EE EEE
Sbjct: 80  KKEEEEEEEEKEESEEE 96



 Score = 28.4 bits (64), Expect = 1.5
 Identities = 9/17 (52%), Positives = 15/17 (88%)

Query: 242 EQEEEEEKGKEEEEEEE 258
           E+++EEE+ +EE+EE E
Sbjct: 78  EEKKEEEEEEEEKEESE 94



 Score = 28.0 bits (63), Expect = 1.9
 Identities = 6/24 (25%), Positives = 15/24 (62%)

Query: 252 EEEEEEEKKKGKKKKEKEEEEKEE 275
                  ++K ++++E+EE+E+ E
Sbjct: 71  AAAAAAAEEKKEEEEEEEEKEESE 94



 Score = 27.6 bits (62), Expect = 3.0
 Identities = 10/17 (58%), Positives = 12/17 (70%)

Query: 242 EQEEEEEKGKEEEEEEE 258
           E+EEEEEK + EEE   
Sbjct: 83  EEEEEEEKEESEEEAAA 99



 Score = 27.2 bits (61), Expect = 3.8
 Identities = 8/17 (47%), Positives = 14/17 (82%)

Query: 242 EQEEEEEKGKEEEEEEE 258
            +E++EE+ +EEE+EE 
Sbjct: 77  AEEKKEEEEEEEEKEES 93


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 35.0 bits (81), Expect = 0.040
 Identities = 24/73 (32%), Positives = 42/73 (57%), Gaps = 4/73 (5%)

Query: 207 LHLVDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKK 266
           L L D    E L +L     +R E +    S+  L+ +EE ++ KEE+E  +++K  +K 
Sbjct: 517 LQLCDKLRDEWLPNLG----IRIEDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKL 572

Query: 267 EKEEEEKEENLVK 279
           +K+EE+K++ L K
Sbjct: 573 KKQEEKKKKELEK 585



 Score = 34.2 bits (79), Expect = 0.063
 Identities = 13/37 (35%), Positives = 26/37 (70%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           L++E+EE++  +E++   K K +++K+K+E EK E  
Sbjct: 553 LQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKA 589



 Score = 30.8 bits (70), Expect = 0.99
 Identities = 13/36 (36%), Positives = 23/36 (63%)

Query: 238 LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           L   ++E+E  K ++   + +K++ KKKKE E+ EK
Sbjct: 553 LQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 33.9 bits (78), Expect = 0.046
 Identities = 9/33 (27%), Positives = 23/33 (69%)

Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
            +++K K+++++++ KK  KK +K E++ E+  
Sbjct: 86  WKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118



 Score = 33.5 bits (77), Expect = 0.055
 Identities = 8/32 (25%), Positives = 24/32 (75%)

Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
            +++K K+++++++KK  K +K+ E+E +++ 
Sbjct: 91  SKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122



 Score = 32.7 bits (75), Expect = 0.10
 Identities = 12/34 (35%), Positives = 27/34 (79%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           +++++K K+++++++  K +KK EKE E+K E+L
Sbjct: 92  KKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDL 125



 Score = 32.4 bits (74), Expect = 0.15
 Identities = 17/34 (50%), Positives = 26/34 (76%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E  EE EK K+E EE++K K KKKK K++++K++
Sbjct: 66  ELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDK 99



 Score = 32.4 bits (74), Expect = 0.16
 Identities = 8/33 (24%), Positives = 23/33 (69%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            ++++ K K+++++++K   K  K ++++EKE 
Sbjct: 86  WKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118



 Score = 31.6 bits (72), Expect = 0.31
 Identities = 13/36 (36%), Positives = 26/36 (72%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           L +E E+ K + EE+++ K K KK K+K++++K++ 
Sbjct: 67  LAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKK 102



 Score = 31.2 bits (71), Expect = 0.34
 Identities = 10/35 (28%), Positives = 27/35 (77%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +++E EE++  + ++++ KKK  K K+K++++K++
Sbjct: 74  VKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD 108



 Score = 30.4 bits (69), Expect = 0.62
 Identities = 10/31 (32%), Positives = 24/31 (77%)

Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           EE++K K ++++ +KKK K K +K++++ ++
Sbjct: 79  EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109



 Score = 30.4 bits (69), Expect = 0.67
 Identities = 14/34 (41%), Positives = 25/34 (73%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           +E  EE  K ++E EEK+K K KK+K +++K+++
Sbjct: 65  KELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKD 98



 Score = 29.7 bits (67), Expect = 1.1
 Identities = 8/34 (23%), Positives = 24/34 (70%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++ + ++K  +++++++K K   KK+ + E+K+E
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115



 Score = 29.7 bits (67), Expect = 1.3
 Identities = 8/34 (23%), Positives = 26/34 (76%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++++ ++K  +++++++ KK  K ++K+E+E E+
Sbjct: 87  KKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 34.9 bits (80), Expect = 0.052
 Identities = 17/37 (45%), Positives = 29/37 (78%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLV 278
           E+EE  E+ +EEEEEEE+++ + ++E+ E+E+EE  V
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEV 476



 Score = 33.7 bits (77), Expect = 0.11
 Identities = 15/34 (44%), Positives = 28/34 (82%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++ EEEE  +EEEEEEE+++ ++++ +EEE ++E
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDE 470



 Score = 33.0 bits (75), Expect = 0.17
 Identities = 18/34 (52%), Positives = 26/34 (76%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E EEEE   +EEEEEEE+++ +++ E+EE E EE
Sbjct: 438 ESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEE 471



 Score = 32.6 bits (74), Expect = 0.25
 Identities = 15/34 (44%), Positives = 27/34 (79%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            +EEE  + +EEEEEEE+++ ++ +E+E E++EE
Sbjct: 439 SEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEE 472



 Score = 32.2 bits (73), Expect = 0.36
 Identities = 19/41 (46%), Positives = 28/41 (68%)

Query: 235 NLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             S     QE EEE+  EEEEEEE+++ ++++E EEEE E+
Sbjct: 429 GESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGED 469



 Score = 28.7 bits (64), Expect = 3.9
 Identities = 16/34 (47%), Positives = 27/34 (79%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             EEEEE+ +EEEEEE++ + ++ +++EEEE+ E
Sbjct: 444 SVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477



 Score = 28.0 bits (62), Expect = 7.0
 Identities = 13/34 (38%), Positives = 25/34 (73%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E+EEEE++ +EEE E+E+++ + + +   EE+ E
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 33.9 bits (78), Expect = 0.053
 Identities = 13/52 (25%), Positives = 31/52 (59%)

Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
               +         ++EEE E+ +E+++EEE++K ++++ ++E+E+ E L  
Sbjct: 29  EERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKS 80



 Score = 33.1 bits (76), Expect = 0.085
 Identities = 15/36 (41%), Positives = 26/36 (72%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
            E+EE EE+ K EE+ E ++K +++ E+E E+K+E 
Sbjct: 23  AEEEEREERKKLEEKREGERKEEEELEEEREKKKEE 58



 Score = 33.1 bits (76), Expect = 0.094
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             E+E EE K  EE+ E E+K+ ++ +E+ E++KEE
Sbjct: 23  AEEEEREERKKLEEKREGERKEEEELEEEREKKKEE 58



 Score = 30.1 bits (68), Expect = 1.0
 Identities = 12/34 (35%), Positives = 25/34 (73%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E EEEE + +++ EE+ + + K+++E EEE +++
Sbjct: 22  EAEEEEREERKKLEEKREGERKEEEELEEEREKK 55



 Score = 29.3 bits (66), Expect = 1.4
 Identities = 15/37 (40%), Positives = 28/37 (75%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
             ++E EEE+ +E ++ EEK++G++K+E+E EE+ E 
Sbjct: 18  RQQREAEEEEREERKKLEEKREGERKEEEELEEEREK 54



 Score = 29.3 bits (66), Expect = 1.5
 Identities = 15/35 (42%), Positives = 25/35 (71%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            +Q   +++  EEEE EE+KK ++K+E E +E+EE
Sbjct: 13  EKQARRQQREAEEEEREERKKLEEKREGERKEEEE 47



 Score = 27.7 bits (62), Expect = 4.8
 Identities = 13/37 (35%), Positives = 21/37 (56%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
             E++E EE+ ++E+EE EK K     E+E  +K   
Sbjct: 58  EEERKEREEQARKEQEEYEKLKSSFVVEEEGTDKLSA 94



 Score = 27.4 bits (61), Expect = 6.1
 Identities = 15/52 (28%), Positives = 30/52 (57%)

Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
             E R+         +EE E+K +EEE +E +++ +K++E+ E+ K   +V+
Sbjct: 34  LEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVE 85



 Score = 27.0 bits (60), Expect = 8.2
 Identities = 12/36 (33%), Positives = 26/36 (72%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             E+ EE +K +E+ E E K++ + ++E+E++++EE
Sbjct: 24  EEEEREERKKLEEKREGERKEEEELEEEREKKKEEE 59


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 32.7 bits (75), Expect = 0.055
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 251 KEEEEEEEKKKGKKKKEKEEEEKEENL 277
             EEEEEE+++ ++++E+ EEE    L
Sbjct: 74  AAEEEEEEEEEEEEEEEESEEEAMAGL 100



 Score = 32.3 bits (74), Expect = 0.065
 Identities = 9/25 (36%), Positives = 17/25 (68%)

Query: 252 EEEEEEEKKKGKKKKEKEEEEKEEN 276
                EE+++ ++++E+EEEE EE 
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 30.8 bits (70), Expect = 0.24
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 250 GKEEEEEEEKKKGKKKKEKEEEE 272
               EEEEE+++ ++++E+E EE
Sbjct: 72  AAAAEEEEEEEEEEEEEEEESEE 94



 Score = 30.4 bits (69), Expect = 0.29
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 247 EEKGKEEEEEEEKKKGKKKKEKEEE 271
                EEEEEEE+++ ++++E EEE
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 30.0 bits (68), Expect = 0.47
 Identities = 9/25 (36%), Positives = 18/25 (72%)

Query: 250 GKEEEEEEEKKKGKKKKEKEEEEKE 274
                EEEE+++ ++++E+EE E+E
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEE 95


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 33.5 bits (77), Expect = 0.059
 Identities = 18/37 (48%), Positives = 24/37 (64%), Gaps = 2/37 (5%)

Query: 241 LEQEEEEEKGKE--EEEEEEKKKGKKKKEKEEEEKEE 275
            E EEEE+K K+  +E ++EKK+ K KKEK  E K  
Sbjct: 147 AEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGS 183



 Score = 32.0 bits (73), Expect = 0.21
 Identities = 15/35 (42%), Positives = 26/35 (74%)

Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           EEEEK ++++++E KK+ K+KK+K+E+  E    K
Sbjct: 150 EEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSK 184



 Score = 31.6 bits (72), Expect = 0.30
 Identities = 14/34 (41%), Positives = 24/34 (70%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E  E+E   K E+E E +++ KK+K+K++E K+E
Sbjct: 133 ETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKE 166



 Score = 30.8 bits (70), Expect = 0.47
 Identities = 17/34 (50%), Positives = 25/34 (73%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E+E   +  KE E EEE+KK KKKK++ ++EK+E
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKE 169



 Score = 30.5 bits (69), Expect = 0.68
 Identities = 15/34 (44%), Positives = 25/34 (73%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++   + + + E EEEEKK+ KKKKE ++E+KE+
Sbjct: 137 KETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170



 Score = 30.5 bits (69), Expect = 0.68
 Identities = 15/33 (45%), Positives = 27/33 (81%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            E+E E  +EE++E++KKK  KK++KE+++K+E
Sbjct: 143 VEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKE 175



 Score = 29.3 bits (66), Expect = 1.6
 Identities = 14/35 (40%), Positives = 27/35 (77%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
            + E+E + +EEE++E+KKK + KKEK+E++ ++ 
Sbjct: 141 AKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKE 175



 Score = 28.9 bits (65), Expect = 2.2
 Identities = 12/33 (36%), Positives = 16/33 (48%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
                 E G E E  E++   K +KE E EE+E
Sbjct: 121 PDGPPSELGSESETSEKETTAKVEKEAEVEEEE 153



 Score = 28.5 bits (64), Expect = 3.2
 Identities = 13/34 (38%), Positives = 26/34 (76%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
              + E++ + EEEE+++KK KK+ +KE++EK++
Sbjct: 139 TTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKD 172



 Score = 28.1 bits (63), Expect = 4.1
 Identities = 11/32 (34%), Positives = 26/32 (81%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           E ++E+++ K+++E+  + KG KKK+K++++K
Sbjct: 162 EVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 27.8 bits (62), Expect = 5.2
 Identities = 8/34 (23%), Positives = 27/34 (79%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++++E +K K+E++++++K  + K  K++++K++
Sbjct: 158 KKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191



 Score = 27.0 bits (60), Expect = 8.2
 Identities = 10/33 (30%), Positives = 17/33 (51%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
              E   + +  E+E   K  K+ + +EEE+KE
Sbjct: 124 PPSELGSESETSEKETTAKVEKEAEVEEEEKKE 156


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 33.5 bits (77), Expect = 0.063
 Identities = 12/38 (31%), Positives = 19/38 (50%)

Query: 238 LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           L Y E ++ +   +   EE E     K++E  E EK+E
Sbjct: 154 LWYAEWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKE 191



 Score = 28.8 bits (65), Expect = 2.7
 Identities = 12/28 (42%), Positives = 15/28 (53%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKE 267
             E+ E     KEEE  E +KK  KKK+
Sbjct: 169 AAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 27.3 bits (61), Expect = 7.5
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEK 268
           +  +E E     +EEE  E +K + KK+K
Sbjct: 168 FAAEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 33.2 bits (76), Expect = 0.066
 Identities = 10/32 (31%), Positives = 20/32 (62%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +EEEE  +E  +E E++   + K + +E+K+ 
Sbjct: 96  DEEEETEEESTDETEQEDPPETKTESKEKKKR 127



 Score = 32.4 bits (74), Expect = 0.10
 Identities = 10/32 (31%), Positives = 17/32 (53%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           E   E +   +E+++ E  K K +KEK + E 
Sbjct: 112 EDPPETKTESKEKKKREVPKPKTEKEKPKTEP 143



 Score = 32.0 bits (73), Expect = 0.14
 Identities = 14/43 (32%), Positives = 24/43 (55%)

Query: 233 KINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           KI   L   + EEEE + +  +E E++   + K E +E++K E
Sbjct: 86  KIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKRE 128



 Score = 29.3 bits (66), Expect = 1.2
 Identities = 12/34 (35%), Positives = 22/34 (64%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++ E+E+  + + E +EKKK +  K K E+EK +
Sbjct: 107 DETEQEDPPETKTESKEKKKREVPKPKTEKEKPK 140



 Score = 29.0 bits (65), Expect = 1.9
 Identities = 10/35 (28%), Positives = 20/35 (57%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E+ EEE   + E+E+  + K + K++K+ E  +  
Sbjct: 99  EETEEESTDETEQEDPPETKTESKEKKKREVPKPK 133



 Score = 28.6 bits (64), Expect = 2.6
 Identities = 12/34 (35%), Positives = 20/34 (58%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E+E EEE   E E+E+  +   + KEK++ E  +
Sbjct: 98  EEETEEESTDETEQEDPPETKTESKEKKKREVPK 131


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 34.1 bits (79), Expect = 0.070
 Identities = 16/38 (42%), Positives = 19/38 (50%), Gaps = 1/38 (2%)

Query: 242 EQEEE-EEKGKEEEEEEEKKKGKKKKEKEEEEKEENLV 278
           EQ E  EE   E  EEEE+KK +KK        E+  V
Sbjct: 65  EQRERLEELAPELLEEEEEKKEEKKGLPPLPNAEKGKV 102


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 33.3 bits (76), Expect = 0.076
 Identities = 17/48 (35%), Positives = 31/48 (64%), Gaps = 3/48 (6%)

Query: 223 DNYLLRN-EARKINLSL--HYLEQEEEEEKGKEEEEEEEKKKGKKKKE 267
           D+  +R  +    + SL  ++LE+E+E++K   +EE+E KKK + K+E
Sbjct: 72  DHKFIRRCKLSDADFSLIKNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119



 Score = 29.1 bits (65), Expect = 1.9
 Identities = 13/22 (59%), Positives = 17/22 (77%)

Query: 254 EEEEEKKKGKKKKEKEEEEKEE 275
           EEE+EKKK   K+EKE ++KE 
Sbjct: 94  EEEKEKKKNLNKEEKEAKKKER 115


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 33.6 bits (77), Expect = 0.083
 Identities = 11/30 (36%), Positives = 21/30 (70%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEE 270
            E+EEEE++    ++EEE+++  K+K K+ 
Sbjct: 153 FEKEEEEQRRLLLQKEEEEQQMNKRKNKQA 182



 Score = 29.0 bits (65), Expect = 2.3
 Identities = 15/32 (46%), Positives = 24/32 (75%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           EQEE EE  + E+EEEE+++   +KE+EE++ 
Sbjct: 143 EQEELEEALEFEKEEEEQRRLLLQKEEEEQQM 174


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 32.6 bits (75), Expect = 0.088
 Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 6/35 (17%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E+EE E + + E+E  E+ K      KEEE +E  
Sbjct: 82  EKEERELRKRAEKEALEQAK------KEEELREAK 110



 Score = 28.8 bits (65), Expect = 1.5
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKK 264
           E+E  E+  KEEE  E K++ +K
Sbjct: 93  EKEALEQAKKEEELREAKRQQRK 115



 Score = 27.2 bits (61), Expect = 5.8
 Identities = 13/42 (30%), Positives = 22/42 (52%), Gaps = 2/42 (4%)

Query: 224 NYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKK 265
            +  +NE  K    L    ++E  E+ K+EEE  E K+ ++K
Sbjct: 76  LFWKKNE--KEERELRKRAEKEALEQAKKEEELREAKRQQRK 115


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 33.1 bits (76), Expect = 0.092
 Identities = 22/65 (33%), Positives = 34/65 (52%), Gaps = 15/65 (23%)

Query: 223 DNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEE------------EEKKKGKKKKEKEE 270
           + Y + ++ RK    L +LEQ E EE+ K +EEE            E+ +  +K KE ++
Sbjct: 104 ERYEITSDVRK---QLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQK 160

Query: 271 EEKEE 275
           EE EE
Sbjct: 161 EEDEE 165


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 32.3 bits (74), Expect = 0.100
 Identities = 13/34 (38%), Positives = 25/34 (73%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           Y   E  +EK K+E E+EE ++ +++K++++EEK
Sbjct: 49  YERLELMDEKWKKETEDEEFQQKREEKKRKDEEK 82


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 32.6 bits (75), Expect = 0.10
 Identities = 11/35 (31%), Positives = 22/35 (62%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E+EE E +  +EEE+ ++   K+  + + E++ EN
Sbjct: 105 EEEEVEVEELDEEEQIDELLEKELAKLKREKRREN 139



 Score = 31.9 bits (73), Expect = 0.20
 Identities = 14/39 (35%), Positives = 26/39 (66%), Gaps = 1/39 (2%)

Query: 242 EQEEEEEKGKEEEE-EEEKKKGKKKKEKEEEEKEENLVK 279
           E EE +E+ + +E  E+E  K K++K +E E K++ ++K
Sbjct: 110 EVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148



 Score = 30.7 bits (70), Expect = 0.47
 Identities = 11/33 (33%), Positives = 18/33 (54%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           E E EE   +E+ +E  +K+  K K ++  E E
Sbjct: 108 EVEVEELDEEEQIDELLEKELAKLKREKRRENE 140



 Score = 30.3 bits (69), Expect = 0.56
 Identities = 15/42 (35%), Positives = 24/42 (57%), Gaps = 7/42 (16%)

Query: 242 EQEEEEEKGKEEEEEEE-------KKKGKKKKEKEEEEKEEN 276
           E+E E E+  EEE+ +E       K K +K++E E ++KE  
Sbjct: 106 EEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEIL 147



 Score = 29.9 bits (68), Expect = 0.71
 Identities = 12/52 (23%), Positives = 27/52 (51%), Gaps = 5/52 (9%)

Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           R + RK          ++E+E+ +EEE E E+   +++ ++  E++   L +
Sbjct: 87  RKKVRK-----LLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKR 133


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 34.0 bits (77), Expect = 0.11
 Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 5/44 (11%)

Query: 241  LEQEEEEEKGKEEE---EEEEKKKGK--KKKEKEEEEKEENLVK 279
             ++ EE+EK   E    E EE KK +  KKKE EE++K E L K
Sbjct: 1680 AKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723



 Score = 33.2 bits (75), Expect = 0.17
 Identities = 17/51 (33%), Positives = 28/51 (54%)

Query: 229  NEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            N+ +    +    E +++ E+ K+ EE+E+K     KKE EE +K E L K
Sbjct: 1659 NKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709



 Score = 32.8 bits (74), Expect = 0.22
 Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 230  EARKINLSLHYLEQEEE---EEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            E +K    L   E EE+   EE  K EEE + K   + KK +E+++K E   K
Sbjct: 1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682



 Score = 32.4 bits (73), Expect = 0.27
 Identities = 18/36 (50%), Positives = 25/36 (69%), Gaps = 3/36 (8%)

Query: 243  QEEEEEKGKEEE---EEEEKKKGKKKKEKEEEEKEE 275
            ++ EE K K EE    EEEKKK ++ K+KE EEK++
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK 1648



 Score = 32.4 bits (73), Expect = 0.32
 Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 3/52 (5%)

Query: 227  LRNEARKINLSLHYLEQEEEEEKGKEEE---EEEEKKKGKKKKEKEEEEKEE 275
            L+    +  +     ++E EE+K K EE   +EEEKKK    K++EE++ EE
Sbjct: 1721 LKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEE 1772



 Score = 31.3 bits (70), Expect = 0.66
 Identities = 12/37 (32%), Positives = 20/37 (54%)

Query: 243  QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            + E E    E E  EEK +  +KK++E ++K +   K
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385



 Score = 31.3 bits (70), Expect = 0.80
 Identities = 19/50 (38%), Positives = 30/50 (60%)

Query: 230  EARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            E  KI       E EE+++K +E +++EE+KK     +KEEE+K E + K
Sbjct: 1726 EENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRK 1775



 Score = 29.7 bits (66), Expect = 2.2
 Identities = 15/39 (38%), Positives = 24/39 (61%)

Query: 241  LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
             E  EE+ +  E+++EE KKK    K+K EE+K+ +  K
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAK 1397



 Score = 29.3 bits (65), Expect = 2.7
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query: 242  EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            +  E ++K  E ++ EE KK  + K+ EE +K +   K
Sbjct: 1504 KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541



 Score = 29.3 bits (65), Expect = 2.8
 Identities = 20/59 (33%), Positives = 31/59 (52%), Gaps = 5/59 (8%)

Query: 226  LLRNEARKINLSLHYLEQEEEEEKGKEEE---EEEEKKKGK--KKKEKEEEEKEENLVK 279
            L + EA +   +    + EEE +    EE    EE+KKK +  KK E++E++  E L K
Sbjct: 1638 LKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696



 Score = 29.3 bits (65), Expect = 3.0
 Identities = 14/38 (36%), Positives = 24/38 (63%)

Query: 242  EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            E+++ EE  K EEE + K +  KK+ +E+++K E   K
Sbjct: 1714 EKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751



 Score = 29.0 bits (64), Expect = 3.6
 Identities = 14/39 (35%), Positives = 24/39 (61%)

Query: 241  LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
             ++ EE +K  E ++ EE KK  + K+ EE++K + L K
Sbjct: 1515 AKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553



 Score = 28.6 bits (63), Expect = 4.5
 Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 3/42 (7%)

Query: 241  LEQEEEEEKGKEEE---EEEEKKKGKKKKEKEEEEKEENLVK 279
             +++ EE K K +E     E KKK  + K+ EE +K +   K
Sbjct: 1488 AKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKK 1529



 Score = 28.6 bits (63), Expect = 4.9
 Identities = 18/36 (50%), Positives = 24/36 (66%), Gaps = 3/36 (8%)

Query: 241  LEQEEEEEKGKEE---EEEEEKKKGKKKKEKEEEEK 273
            L++E EE K  EE   +E EEKKK ++ K+ EEE K
Sbjct: 1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729



 Score = 28.6 bits (63), Expect = 5.0
 Identities = 14/50 (28%), Positives = 29/50 (58%)

Query: 230  EARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            EA+K + +    E+ ++ ++ K++ EE +KK  + KK  E ++K +   K
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKK 1517



 Score = 28.6 bits (63), Expect = 5.6
 Identities = 16/39 (41%), Positives = 26/39 (66%)

Query: 241  LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            L++ EE++K  E ++ EEKKK  + K+K EE K+ +  K
Sbjct: 1283 LKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAK 1321



 Score = 28.2 bits (62), Expect = 5.7
 Identities = 12/37 (32%), Positives = 20/37 (54%)

Query: 243  QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
             + E E   +E E  E+K    +K+KEE +K+ +  K
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384



 Score = 28.2 bits (62), Expect = 6.5
 Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 3/36 (8%)

Query: 241  LEQEEEEEKGKEE---EEEEEKKKGKKKKEKEEEEK 273
            L++ EEE+K  E+   +E EEKKK ++ K+ EEE K
Sbjct: 1625 LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENK 1660



 Score = 28.2 bits (62), Expect = 7.3
 Identities = 14/37 (37%), Positives = 27/37 (72%)

Query: 243  QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            ++ EEEK K E+ ++++ + KKK E+ ++ +EEN +K
Sbjct: 1626 KKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIK 1662



 Score = 27.8 bits (61), Expect = 8.2
 Identities = 10/39 (25%), Positives = 17/39 (43%)

Query: 241  LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
             +++ EE K   E  + E +    + E  EE+ E    K
Sbjct: 1334 AKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKK 1372



 Score = 27.8 bits (61), Expect = 8.3
 Identities = 14/35 (40%), Positives = 24/35 (68%)

Query: 241  LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             ++ EE++K  E ++ EE KK ++KK+ EE +K E
Sbjct: 1539 AKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAE 1573



 Score = 27.8 bits (61), Expect = 9.3
 Identities = 13/43 (30%), Positives = 24/43 (55%), Gaps = 4/43 (9%)

Query: 241  LEQEEEEEKGKEEE----EEEEKKKGKKKKEKEEEEKEENLVK 279
             +++ EE+K K +E       +KK  + KK+ EE++K +   K
Sbjct: 1396 AKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 31.7 bits (72), Expect = 0.11
 Identities = 14/27 (51%), Positives = 20/27 (74%)

Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEE 271
               + K EE+EEEKKK ++K+E+EEE
Sbjct: 71  AAAAEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 31.7 bits (72), Expect = 0.11
 Identities = 15/27 (55%), Positives = 22/27 (81%)

Query: 251 KEEEEEEEKKKGKKKKEKEEEEKEENL 277
             EE+ EEK++ KKK+E++EEE+EE L
Sbjct: 73  AAEEKAEEKEEEKKKEEEKEEEEEEAL 99



 Score = 31.3 bits (71), Expect = 0.16
 Identities = 14/36 (38%), Positives = 22/36 (61%), Gaps = 6/36 (16%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
             EE+ E+ +EE+++EE      +KE+EEEE    L
Sbjct: 73  AAEEKAEEKEEEKKKEE------EKEEEEEEALAGL 102



 Score = 27.8 bits (62), Expect = 2.4
 Identities = 10/24 (41%), Positives = 19/24 (79%)

Query: 252 EEEEEEEKKKGKKKKEKEEEEKEE 275
              EE+ ++K ++KK++EE+E+EE
Sbjct: 72  AAAEEKAEEKEEEKKKEEEKEEEE 95



 Score = 27.5 bits (61), Expect = 3.4
 Identities = 9/24 (37%), Positives = 18/24 (75%)

Query: 252 EEEEEEEKKKGKKKKEKEEEEKEE 275
               EE+ ++ +++K+KEEE++EE
Sbjct: 71  AAAAEEKAEEKEEEKKKEEEKEEE 94


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 32.9 bits (75), Expect = 0.17
 Identities = 16/31 (51%), Positives = 21/31 (67%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           E E EK    EEE E+KK K++K KE+E K+
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKK 35



 Score = 32.6 bits (74), Expect = 0.29
 Identities = 15/31 (48%), Positives = 21/31 (67%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
           E E E++   EEE E +KKK +K KEKE ++
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKK 35



 Score = 28.7 bits (64), Expect = 4.2
 Identities = 14/38 (36%), Positives = 22/38 (57%), Gaps = 3/38 (7%)

Query: 241 LEQEEEEEKGKEEE---EEEEKKKGKKKKEKEEEEKEE 275
           L +EE E K K+EE   E+E KK    +KE + + + +
Sbjct: 13  LTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQ 50


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 31.9 bits (73), Expect = 0.23
 Identities = 15/34 (44%), Positives = 22/34 (64%), Gaps = 3/34 (8%)

Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
            Y E+E+E++K      +EEKK  K++KEK EE 
Sbjct: 92  QYFEEEKEKKKAM---SKEEKKAIKEEKEKLEEP 122



 Score = 29.5 bits (67), Expect = 1.4
 Identities = 15/26 (57%), Positives = 19/26 (73%), Gaps = 3/26 (11%)

Query: 253 EEEEEEKK---KGKKKKEKEEEEKEE 275
           EEE+E+KK   K +KK  KEE+EK E
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLE 120


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 32.5 bits (74), Expect = 0.24
 Identities = 14/34 (41%), Positives = 20/34 (58%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            ++  E K K E E E+K K + KK+ EEE K +
Sbjct: 126 AKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAK 159



 Score = 31.7 bits (72), Expect = 0.34
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            +++ EEE   +   E +KK  + KK+ E E K +   K
Sbjct: 148 AKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAK 186



 Score = 29.0 bits (65), Expect = 2.7
 Identities = 9/35 (25%), Positives = 20/35 (57%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           LEQ+ EE + +   E+  +K+ +++   E+  K+ 
Sbjct: 73  LEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQA 107



 Score = 28.6 bits (64), Expect = 3.2
 Identities = 9/35 (25%), Positives = 19/35 (54%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           ++  E+ + KE E+    +K  K+ E+  ++ EE 
Sbjct: 83  QRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEK 117



 Score = 27.9 bits (62), Expect = 5.6
 Identities = 9/32 (28%), Positives = 17/32 (53%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           E E+     K  ++ E+  K  ++K+K+ EE 
Sbjct: 93  ELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEA 124



 Score = 27.9 bits (62), Expect = 5.8
 Identities = 7/34 (20%), Positives = 19/34 (55%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +  ++E + +   E+  K+  +  K+ EE++K+ 
Sbjct: 88  QARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQA 121



 Score = 27.9 bits (62), Expect = 6.2
 Identities = 9/34 (26%), Positives = 19/34 (55%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           EQ  ++ + K+++ EE K K   + + + E + E
Sbjct: 108 EQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAE 141



 Score = 27.9 bits (62), Expect = 6.5
 Identities = 12/34 (35%), Positives = 21/34 (61%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +Q EE++K  EE + ++  + K K E E E+K +
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAK 145


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region covers
           both the Brf homology II and III regions. This region is
           involved in binding TATA binding protein.
          Length = 95

 Score = 30.3 bits (69), Expect = 0.24
 Identities = 18/59 (30%), Positives = 33/59 (55%), Gaps = 6/59 (10%)

Query: 223 DNYLLRNEARKI------NLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           D+Y+L  E  K+       L+  YLE++EE+E  ++ +E     K KKK++ +++  E 
Sbjct: 5   DSYILSEEEVKLKERIWDELNKDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEA 63


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 32.1 bits (73), Expect = 0.29
 Identities = 10/34 (29%), Positives = 23/34 (67%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +Q+ EE + K+  E+E  K+ +K++   +E+K++
Sbjct: 86  QQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ 119



 Score = 30.2 bits (68), Expect = 1.1
 Identities = 11/34 (32%), Positives = 21/34 (61%), Gaps = 1/34 (2%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +Q  E+E+ K+ E+E    + ++KK+ EE  K+ 
Sbjct: 95  KQAAEQERLKQLEKERLAAQ-EQKKQAEEAAKQA 127



 Score = 29.4 bits (66), Expect = 2.3
 Identities = 10/32 (31%), Positives = 14/32 (43%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           +   E K K E E  +K   + KK+ E E   
Sbjct: 162 KAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAA 193



 Score = 27.8 bits (62), Expect = 6.6
 Identities = 9/33 (27%), Positives = 19/33 (57%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           LE+E    + ++++ EE  K+   K+++ EE  
Sbjct: 106 LEKERLAAQEQKKQAEEAAKQAALKQKQAEEAA 138



 Score = 27.8 bits (62), Expect = 7.5
 Identities = 10/35 (28%), Positives = 19/35 (54%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           EQE  ++  KE    +E+KK  ++  K+   K++ 
Sbjct: 99  EQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQ 133


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 30.7 bits (70), Expect = 0.30
 Identities = 19/40 (47%), Positives = 25/40 (62%), Gaps = 8/40 (20%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKE--------EEEKEE 275
            EE  K  EEE++E++KK KKKKE E        E++KEE
Sbjct: 67  GEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEE 106


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast) and
           GNL3L (human) are putative GTPases which are required
           for growth and play a role in processing of nucleolar
           pre-rRNA. This family contains a potential nuclear
           localisation signal.
          Length = 80

 Score = 29.9 bits (68), Expect = 0.31
 Identities = 14/33 (42%), Positives = 22/33 (66%), Gaps = 1/33 (3%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +E EE+K K+EEE+E +K+  +K E+ E  K  
Sbjct: 49  EEIEEKKRKQEEEKERRKE-ARKAERAEARKRG 80


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 31.0 bits (71), Expect = 0.31
 Identities = 18/55 (32%), Positives = 27/55 (49%), Gaps = 5/55 (9%)

Query: 220 SLSDNY-LLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
            L++ Y  L+       L    L+  E+ EK    + E+ K K +KKK K+EE K
Sbjct: 46  DLNEEYKDLKESLEAALLDKKELKAWEKAEK----KAEKAKAKAEKKKAKKEEPK 96


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 30.3 bits (68), Expect = 0.33
 Identities = 15/38 (39%), Positives = 27/38 (71%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           + E EEE+  +E++EEE+ +G++++ +EEEE E    K
Sbjct: 48  DDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGK 85



 Score = 27.6 bits (61), Expect = 2.7
 Identities = 13/34 (38%), Positives = 24/34 (70%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            QE ++E  +EEE +E+ ++ + + E+EE E+EE
Sbjct: 44  AQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEE 77


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 31.6 bits (72), Expect = 0.37
 Identities = 11/34 (32%), Positives = 21/34 (61%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           + E+E+E      E E+ KK ++ + K+++E EE
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEE 106



 Score = 27.4 bits (61), Expect = 7.8
 Identities = 9/32 (28%), Positives = 18/32 (56%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           ++        E+ +++E+ + KKKKE EE + 
Sbjct: 78  KELSASSLEAEQAKKKEEAEAKKKKEMEELKA 109


>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6.  The VP6 protein a
           minor protein in the core of the virion is probably the
           viral helicase.
          Length = 322

 Score = 31.6 bits (71), Expect = 0.37
 Identities = 16/64 (25%), Positives = 35/64 (54%), Gaps = 2/64 (3%)

Query: 212 LAGSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEE 271
           LA  + +   S+   L+    +INL   +   E +E++ K+E+ +E++    + ++K++ 
Sbjct: 2   LAPGDVIKRSSEE--LKQRQIQINLVDWFESGESKEKEPKDEDGQEQRISDGEGEQKQKG 59

Query: 272 EKEE 275
            KEE
Sbjct: 60  GKEE 63


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 31.5 bits (72), Expect = 0.40
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E+EEEK +  ++ +E    +  +E EEE  EE 
Sbjct: 133 EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEE 165



 Score = 30.4 bits (69), Expect = 0.89
 Identities = 10/38 (26%), Positives = 24/38 (63%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           E EEE+++  ++ +E+  ++  ++ E+E  E+EE   +
Sbjct: 133 EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAE 170



 Score = 29.2 bits (66), Expect = 2.6
 Identities = 14/35 (40%), Positives = 20/35 (57%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E ++E E    E+EEE+ +  KK KE  +EE  E 
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEE 156



 Score = 28.1 bits (63), Expect = 5.9
 Identities = 11/35 (31%), Positives = 20/35 (57%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           ++E E    ++EEE++E  K  K+   EE  +E+ 
Sbjct: 124 DKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDE 158



 Score = 27.7 bits (62), Expect = 6.8
 Identities = 7/29 (24%), Positives = 19/29 (65%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEE 270
           + +E+ ++   EE+EEE  + ++ + ++E
Sbjct: 144 KAKEDSDEELSEEDEEEAAEEEEAEAEKE 172



 Score = 27.7 bits (62), Expect = 7.7
 Identities = 12/34 (35%), Positives = 19/34 (55%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E E  + + +EE++E  KK  +   E+  EE EE
Sbjct: 126 EIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEE 159


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 31.7 bits (72), Expect = 0.41
 Identities = 16/38 (42%), Positives = 24/38 (63%), Gaps = 4/38 (10%)

Query: 242 EQEEEEEKGKEE----EEEEEKKKGKKKKEKEEEEKEE 275
             EE E K KEE    EE E+K+K +  ++KE+ +K+E
Sbjct: 394 ASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431



 Score = 31.7 bits (72), Expect = 0.42
 Identities = 12/34 (35%), Positives = 24/34 (70%), Gaps = 1/34 (2%)

Query: 242 EQEEEEEKGK-EEEEEEEKKKGKKKKEKEEEEKE 274
           E + +EEK K EE E+++K++  + KEK ++++ 
Sbjct: 399 EAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432



 Score = 30.9 bits (70), Expect = 0.69
 Identities = 12/47 (25%), Positives = 25/47 (53%)

Query: 230 EARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
             ++ N  L    ++  E K + +  EE + K K++K K+EE +++ 
Sbjct: 370 LLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQ 416



 Score = 29.0 bits (65), Expect = 3.1
 Identities = 9/36 (25%), Positives = 24/36 (66%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           ++ E +  +  E + +EEK K ++ ++K++E+ +E+
Sbjct: 388 VKDETDASEEAEAKAKEEKLKQEENEKKQKEQADED 423



 Score = 28.6 bits (64), Expect = 4.4
 Identities = 11/34 (32%), Positives = 21/34 (61%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             E ++E    EE E + K+ K K+E+ E++++E
Sbjct: 385 LGEVKDETDASEEAEAKAKEEKLKQEENEKKQKE 418


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 31.9 bits (72), Expect = 0.42
 Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            +E E++   E   + E ++ +  +E EEE+ EE
Sbjct: 297 PEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEE 330



 Score = 30.3 bits (68), Expect = 1.2
 Identities = 13/28 (46%), Positives = 15/28 (53%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
           + EE EE+  EEE    KK  K KK K 
Sbjct: 319 DSEESEEEKNEEEGGLSKKGKKLKKLKG 346



 Score = 29.9 bits (67), Expect = 1.6
 Identities = 10/35 (28%), Positives = 19/35 (54%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
             + EE + K   E   K + ++ ++ EE E+E+N
Sbjct: 294 GNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKN 328



 Score = 29.2 bits (65), Expect = 2.7
 Identities = 7/33 (21%), Positives = 18/33 (54%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             E   K + E++E+ ++  ++K E+E    ++
Sbjct: 305 SPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKK 337



 Score = 28.8 bits (64), Expect = 3.9
 Identities = 14/38 (36%), Positives = 19/38 (50%), Gaps = 5/38 (13%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKK-----KKEKEEEEKE 274
           E  EE E+ K EEE    KKGKK      K+   ++ +
Sbjct: 318 EDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDD 355



 Score = 28.4 bits (63), Expect = 5.0
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKE 267
           EQ+E+ E+ +EE+ EEE    KK K+
Sbjct: 315 EQDEDSEESEEEKNEEEGGLSKKGKK 340



 Score = 28.4 bits (63), Expect = 5.1
 Identities = 10/38 (26%), Positives = 19/38 (50%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            + E E+    EE EEEK + +    K+ ++ ++   K
Sbjct: 310 AKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGK 347



 Score = 28.0 bits (62), Expect = 6.2
 Identities = 5/34 (14%), Positives = 19/34 (55%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E   + E  ++E+ EE +++  +++    ++ ++
Sbjct: 307 EIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 31.2 bits (71), Expect = 0.43
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKK 266
           + EEE EK  + EE  +KKK  K K
Sbjct: 69  DDEEEGEKELQREERLKKKKRVKTK 93


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 31.2 bits (71), Expect = 0.46
 Identities = 16/39 (41%), Positives = 20/39 (51%), Gaps = 3/39 (7%)

Query: 244 EEEEEKGK---EEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           EEE E  K   EEE  ++KKK KKKK K+      +   
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEA 204


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 30.6 bits (69), Expect = 0.47
 Identities = 14/35 (40%), Positives = 21/35 (60%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E  E+EE+  E+EEEEE+ +      K+ E+K  N
Sbjct: 66  EDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNIN 100



 Score = 29.1 bits (65), Expect = 1.5
 Identities = 14/35 (40%), Positives = 29/35 (82%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E+EE+EE+ +E E+ E++++  + +E+EEE++E+N
Sbjct: 54  EEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDN 88


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 30.6 bits (69), Expect = 0.47
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
            +EE+EE  +   E EE++K K +K +EE E 
Sbjct: 32  AEEEDEEIQEALREAEEERKAKHRKMEEEREV 63


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 31.2 bits (71), Expect = 0.47
 Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 3/34 (8%)

Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
            Y + ++EE+K      +EEKK  K +KEK EEE
Sbjct: 92  AYFKAQKEEKKAM---SKEEKKAIKAEKEKLEEE 122



 Score = 28.8 bits (65), Expect = 2.4
 Identities = 12/26 (46%), Positives = 17/26 (65%), Gaps = 3/26 (11%)

Query: 253 EEEEEEKK---KGKKKKEKEEEEKEE 275
           + ++EEKK   K +KK  K E+EK E
Sbjct: 95  KAQKEEKKAMSKEEKKAIKAEKEKLE 120


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 31.6 bits (72), Expect = 0.47
 Identities = 16/33 (48%), Positives = 22/33 (66%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            ++ E K  +  EE+ KKK KK+K+KEEE K E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKRE 762



 Score = 30.1 bits (68), Expect = 1.8
 Identities = 13/37 (35%), Positives = 24/37 (64%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           +  ++ E +     EE+ +KKK K+KK++EE ++EE 
Sbjct: 728 WNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEK 764



 Score = 28.5 bits (64), Expect = 5.1
 Identities = 10/33 (30%), Positives = 16/33 (48%)

Query: 245  EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
              +EK  EE+  E   K +K K++ +   E  L
Sbjct: 1551 SNQEKNIEEDYAESDIKKRKNKKQYKSNTEAEL 1583



 Score = 28.1 bits (63), Expect = 6.6
 Identities = 12/24 (50%), Positives = 19/24 (79%), Gaps = 1/24 (4%)

Query: 252 EEEEEEEKKKGKKKKEKEE-EEKE 274
           EE+ +++KKK KKK+E+ + EEK 
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKA 765



 Score = 27.8 bits (62), Expect = 7.6
 Identities = 14/49 (28%), Positives = 27/49 (55%), Gaps = 2/49 (4%)

Query: 233  KINLSLHYLEQEEEEEKGKEEEEEEEKKKG--KKKKEKEEEEKEENLVK 279
            K ++ +    QEE+E  G+ E E +++KKG  +     +E+  EE+  +
Sbjct: 1515 KKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAE 1563



 Score = 27.8 bits (62), Expect = 7.7
 Identities = 13/48 (27%), Positives = 28/48 (58%), Gaps = 2/48 (4%)

Query: 223 DNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEE 270
            N+  ++   KI  S    E+ ++++K ++++EEE K++ K + E  E
Sbjct: 726 RNWNGKDAEFKI--SDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771


>gnl|CDD|197773 smart00527, HMG17, domain in high mobilty group proteins HMG14 and
           HMG 17. 
          Length = 88

 Score = 29.5 bits (66), Expect = 0.47
 Identities = 14/29 (48%), Positives = 16/29 (55%)

Query: 249 KGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           K K E    + KKG K K+ EE  KEEN 
Sbjct: 42  KKKSENVAAKGKKGAKGKQTEEAGKEENN 70


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 31.4 bits (72), Expect = 0.51
 Identities = 13/50 (26%), Positives = 21/50 (42%), Gaps = 9/50 (18%)

Query: 227 LRNEARKINLSLHYLEQ--EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           LR + RK        E+  E+EE +    +++ E    K K    E +K 
Sbjct: 414 LRKKQRK-------AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456



 Score = 29.9 bits (68), Expect = 1.8
 Identities = 22/51 (43%), Positives = 28/51 (54%), Gaps = 5/51 (9%)

Query: 230 EARKINLSLH--YLEQEEEEEKGK-EEEEEEEKKKGKKKKEKEE--EEKEE 275
            A +I L LH   L  E EEE+G+       E+KK +KK+ K E   EKEE
Sbjct: 379 GAIEIYLKLHDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEE 429


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.  This
           family includes the HABP4 family of hyaluronan-binding
           proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
           HABP4 has been observed to bind hyaluronan (a
           glucosaminoglycan), but it is not known whether this is
           its primary role in vivo. It has also been observed to
           bind RNA, but with a lower affinity than that for
           hyaluronan. PAI-1 mRNA-binding protein specifically
           binds the mRNA of type-1 plasminogen activator inhibitor
           (PAI-1), and is thought to be involved in regulation of
           mRNA stability. However, in both cases, the sequence
           motifs predicted to be important for ligand binding are
           not conserved throughout the family, so it is not known
           whether members of this family share a common function.
          Length = 106

 Score = 29.7 bits (67), Expect = 0.51
 Identities = 13/33 (39%), Positives = 22/33 (66%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           E+ E EEK   EEE  ++   ++++ +EEE+KE
Sbjct: 46  EKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKE 78



 Score = 28.9 bits (65), Expect = 1.1
 Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 3/37 (8%)

Query: 242 EQEEEEEKGKEEE---EEEEKKKGKKKKEKEEEEKEE 275
           EQ EEEEK + EE    EEE  K    +E+E EE+E+
Sbjct: 40  EQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEED 76


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 31.6 bits (72), Expect = 0.52
 Identities = 16/38 (42%), Positives = 27/38 (71%), Gaps = 2/38 (5%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE--KEEN 276
           L + EEEE+ +E  E +E+ +G++++E+E EE  KEE 
Sbjct: 167 LREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEE 204



 Score = 30.8 bits (70), Expect = 0.71
 Identities = 14/37 (37%), Positives = 24/37 (64%), Gaps = 5/37 (13%)

Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
               ++++EE+ +E+EEEEE     +++E EE E EE
Sbjct: 341 WVNPEQKDEEEEQEDEEEEE-----EEEEPEEPEPEE 372



 Score = 27.3 bits (61), Expect = 9.2
 Identities = 10/21 (47%), Positives = 17/21 (80%)

Query: 242 EQEEEEEKGKEEEEEEEKKKG 262
           +++EEEE+ +EE EE E ++G
Sbjct: 353 QEDEEEEEEEEEPEEPEPEEG 373



 Score = 27.3 bits (61), Expect = 9.6
 Identities = 13/32 (40%), Positives = 19/32 (59%), Gaps = 8/32 (25%)

Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
             E+K +EEE+E+E        E+EEEE+E  
Sbjct: 343 NPEQKDEEEEQEDE--------EEEEEEEEPE 366


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 29.1 bits (66), Expect = 0.56
 Identities = 7/21 (33%), Positives = 14/21 (66%)

Query: 255 EEEEKKKGKKKKEKEEEEKEE 275
                 + +KK+E+EEEE+++
Sbjct: 61  AAAAAAEEEKKEEEEEEEEDD 81



 Score = 28.4 bits (64), Expect = 1.1
 Identities = 10/17 (58%), Positives = 11/17 (64%)

Query: 242 EQEEEEEKGKEEEEEEE 258
                EE+ KEEEEEEE
Sbjct: 62  AAAAAEEEKKEEEEEEE 78



 Score = 28.4 bits (64), Expect = 1.4
 Identities = 5/20 (25%), Positives = 13/20 (65%)

Query: 253 EEEEEEKKKGKKKKEKEEEE 272
                 +++ K+++E+EEE+
Sbjct: 61  AAAAAAEEEKKEEEEEEEED 80



 Score = 27.6 bits (62), Expect = 2.3
 Identities = 11/17 (64%), Positives = 12/17 (70%)

Query: 242 EQEEEEEKGKEEEEEEE 258
               EEEK +EEEEEEE
Sbjct: 63  AAAAEEEKKEEEEEEEE 79



 Score = 27.2 bits (61), Expect = 3.0
 Identities = 7/25 (28%), Positives = 13/25 (52%)

Query: 250 GKEEEEEEEKKKGKKKKEKEEEEKE 274
                     ++ KK++E+EEEE +
Sbjct: 57  AAAAAAAAAAEEEKKEEEEEEEEDD 81



 Score = 27.2 bits (61), Expect = 3.5
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query: 244 EEEEEKGKEEEEEEE 258
            EEE+K +EEEEEE+
Sbjct: 66  AEEEKKEEEEEEEED 80



 Score = 26.4 bits (59), Expect = 5.0
 Identities = 8/17 (47%), Positives = 14/17 (82%)

Query: 242 EQEEEEEKGKEEEEEEE 258
             EEE+++ +EEEEE++
Sbjct: 65  AAEEEKKEEEEEEEEDD 81


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 31.6 bits (71), Expect = 0.58
 Identities = 13/34 (38%), Positives = 25/34 (73%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++EE  EK K E E++ +++ +++KEKE+E + E
Sbjct: 580 KREEAVEKAKREAEQKAREEREREKEKEKERERE 613



 Score = 30.8 bits (69), Expect = 0.98
 Identities = 11/29 (37%), Positives = 22/29 (75%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEE 271
           + E E+K +EE E E++K+ ++++E+E E
Sbjct: 589 KREAEQKAREEREREKEKEKERERERERE 617



 Score = 28.9 bits (64), Expect = 3.4
 Identities = 13/40 (32%), Positives = 26/40 (65%)

Query: 236 LSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           L+   L ++ EE   K + E E+K + ++++EKE+E++ E
Sbjct: 572 LASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKERE 611


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 28.1 bits (63), Expect = 0.60
 Identities = 13/31 (41%), Positives = 23/31 (74%)

Query: 234 INLSLHYLEQEEEEEKGKEEEEEEEKKKGKK 264
           ++LSL   E+ E+EE+ ++EEE EE+ + K+
Sbjct: 16  VSLSLCEEEKREDEEENEDEEEGEEQSEVKR 46


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 31.5 bits (71), Expect = 0.60
 Identities = 13/54 (24%), Positives = 24/54 (44%), Gaps = 2/54 (3%)

Query: 226 LLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           L+        L +   E + +E K   +E+ ++  +  + KEK E E+E  L  
Sbjct: 179 LIEETENLAELIIDLEELKLQELK--LKEQAKKALEYYQLKEKLELEEENLLYL 230



 Score = 30.3 bits (68), Expect = 1.2
 Identities = 16/64 (25%), Positives = 32/64 (50%)

Query: 216 EQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E++  L +      E  + +      E+E   +  KE +EEE++KK ++++ K   ++EE
Sbjct: 238 ERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEE 297

Query: 276 NLVK 279
            L  
Sbjct: 298 ELKS 301



 Score = 30.3 bits (68), Expect = 1.5
 Identities = 24/64 (37%), Positives = 36/64 (56%)

Query: 216 EQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E+   L  +YL  NE R   L     +++EE E  K+E E+EE+   +  KE +EEEKE+
Sbjct: 223 EEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEK 282

Query: 276 NLVK 279
            L +
Sbjct: 283 KLQE 286



 Score = 29.2 bits (65), Expect = 2.9
 Identities = 17/35 (48%), Positives = 27/35 (77%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           LE+E ++EK + EE E+E K+ + K+E EEEE+E+
Sbjct: 327 LEKELKKEKEEIEELEKELKELEIKREAEEEEEEQ 361



 Score = 28.4 bits (63), Expect = 5.3
 Identities = 18/39 (46%), Positives = 28/39 (71%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           LE+E +E + K E EEEE+++ +K +EK E+ +EE L K
Sbjct: 341 LEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAK 379



 Score = 28.4 bits (63), Expect = 5.3
 Identities = 20/49 (40%), Positives = 27/49 (55%)

Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
              E  K  LSL   E  EEEEK ++ + EEEK++  K +E+E    EE
Sbjct: 761 EEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEE 809



 Score = 28.4 bits (63), Expect = 5.8
 Identities = 13/55 (23%), Positives = 21/55 (38%)

Query: 225 YLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            LL     K     +  E+EE  ++    +EE          E EE+E+  N  +
Sbjct: 941 LLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDE 995


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 30.7 bits (70), Expect = 0.61
 Identities = 8/33 (24%), Positives = 23/33 (69%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +   E++ ++ E+ + +KK K+K+ K++E+ ++
Sbjct: 5   EALLEQRRRKREQRKARKKQKRKEAKKKEDAQK 37



 Score = 27.7 bits (62), Expect = 5.6
 Identities = 11/34 (32%), Positives = 20/34 (58%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            + E+ +  K+++ +E KKK   +K + EE K E
Sbjct: 13  RKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNE 46



 Score = 26.9 bits (60), Expect = 9.4
 Identities = 10/37 (27%), Positives = 24/37 (64%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
            LEQ   + + ++  +++++K+ KKK++ ++ E EE 
Sbjct: 7   LLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEV 43


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 31.3 bits (72), Expect = 0.62
 Identities = 8/32 (25%), Positives = 18/32 (56%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           + EEE    +E+ E     + ++++++ EK E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 28.2 bits (64), Expect = 5.2
 Identities = 10/32 (31%), Positives = 19/32 (59%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
           ++ EEE    KE+ E    K+ ++K++K E+ 
Sbjct: 510 IDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541



 Score = 27.5 bits (62), Expect = 9.7
 Identities = 8/31 (25%), Positives = 17/31 (54%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEE 271
           +E+E    K + E    ++ + K+KK ++ E
Sbjct: 512 VEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 31.0 bits (71), Expect = 0.64
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 246 EEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           EE K     + +     ++++E+EEEE+EE
Sbjct: 286 EELKEVLSAQAQAAAAEEEEEEEEEEEEEE 315



 Score = 31.0 bits (71), Expect = 0.71
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
           E    + +    EEEEE+++ ++++E  EEE
Sbjct: 290 EVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 30.2 bits (69), Expect = 1.2
 Identities = 9/34 (26%), Positives = 21/34 (61%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E+ +E    + +    E+++ ++++E+EEE  EE
Sbjct: 286 EELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319



 Score = 29.4 bits (67), Expect = 1.7
 Identities = 8/32 (25%), Positives = 19/32 (59%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
           L++    +      EEEE+++ ++++E+  EE
Sbjct: 288 LKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319



 Score = 28.7 bits (65), Expect = 3.1
 Identities = 9/35 (25%), Positives = 20/35 (57%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            E +E      +    EE+++ ++++E+EE  +EE
Sbjct: 286 EELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 30.5 bits (69), Expect = 0.68
 Identities = 16/38 (42%), Positives = 24/38 (63%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
              E+E++  EEE EEE ++   + EKE EEK E+ V+
Sbjct: 82  GDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVE 119



 Score = 30.1 bits (68), Expect = 0.84
 Identities = 13/33 (39%), Positives = 22/33 (66%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +EEE+E    E++E++    K+ +E EEE +EE
Sbjct: 66  KEEEKEAANSEDKEDKGDAEKEDEESEEENEEE 98



 Score = 30.1 bits (68), Expect = 1.1
 Identities = 13/35 (37%), Positives = 22/35 (62%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E EE +E+ KE    E+K+     ++++EE +EEN
Sbjct: 61  EIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEEN 95



 Score = 28.9 bits (65), Expect = 2.2
 Identities = 11/35 (31%), Positives = 23/35 (65%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           ++E++ +  KE+EE EE+ + + ++  +E EKE  
Sbjct: 77  DKEDKGDAEKEDEESEEENEEEDEESSDENEKETE 111



 Score = 28.9 bits (65), Expect = 2.6
 Identities = 11/35 (31%), Positives = 23/35 (65%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E+E    + KE++ + EK+  + ++E EEE++E +
Sbjct: 69  EKEAANSEDKEDKGDAEKEDEESEEENEEEDEESS 103



 Score = 28.5 bits (64), Expect = 3.3
 Identities = 13/35 (37%), Positives = 21/35 (60%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           ++  E E+ KEEE+E    + K+ K   E+E EE+
Sbjct: 57  QETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEES 91


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 31.2 bits (70), Expect = 0.70
 Identities = 17/35 (48%), Positives = 23/35 (65%), Gaps = 2/35 (5%)

Query: 244 EEEEEKGKEEEEEEEKKKGKK--KKEKEEEEKEEN 276
           EEEEE   EE+EE  +K+  K   ++ EEEE+EE 
Sbjct: 141 EEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEEL 175



 Score = 30.4 bits (68), Expect = 1.2
 Identities = 19/36 (52%), Positives = 25/36 (69%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
            +EEEE + +E+EE  EK+ GK   EK EEE+EE L
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEEL 175


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 30.8 bits (70), Expect = 0.73
 Identities = 11/36 (30%), Positives = 20/36 (55%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            L++E E+E   E++ +E ++  +KK EK   E   
Sbjct: 198 LLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASR 233



 Score = 28.1 bits (63), Expect = 5.6
 Identities = 10/33 (30%), Positives = 14/33 (42%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           + E   E  + E E   K    K+K K +  KE
Sbjct: 254 DDESAWEGFESEYEPINKPVRPKRKTKAQRNKE 286


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 28.5 bits (64), Expect = 0.74
 Identities = 12/28 (42%), Positives = 18/28 (64%)

Query: 248 EKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            K KE + EEEK++ +++  K EE KE 
Sbjct: 24  AKKKELKAEEEKREEEEEARKREERKER 51



 Score = 28.1 bits (63), Expect = 1.1
 Identities = 17/37 (45%), Positives = 24/37 (64%), Gaps = 4/37 (10%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKE---EEEKEE 275
           E + EEEK +EEEEE  K++ +K++EK    EE   E
Sbjct: 28  ELKAEEEK-REEEEEARKREERKEREKNKSFEELLNE 63



 Score = 26.1 bits (58), Expect = 5.5
 Identities = 13/35 (37%), Positives = 25/35 (71%)

Query: 238 LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
           L  L+ +++E K +EE+ EEE++  K+++ KE E+
Sbjct: 19  LAKLKAKKKELKAEEEKREEEEEARKREERKEREK 53


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 30.4 bits (69), Expect = 0.77
 Identities = 19/109 (17%), Positives = 38/109 (34%), Gaps = 28/109 (25%)

Query: 168 AVQLIVTGNENKVTAVTKMNAQSSRSHTICTIYLGAMAKLHLVDLAGSEQLFSLSDNYLL 227
            ++    GN  +V     +   ++ S+TI            L  L G           L+
Sbjct: 112 GLRSASAGNWGEVDRNQGLAVLAA-SNTIL-----------LFVLVG----------VLV 149

Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
                 +   L Y E+++ ++K +   EE ++      K+    +K E 
Sbjct: 150 ------LQAGLWYAERKDAKQKEEFAAEERKEALAAAAKKSATPQKVET 192


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 30.0 bits (68), Expect = 0.80
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKK 266
           E+E++EE   E+E EEE+   KK K
Sbjct: 122 EEEDDEEDDDEDESEEEESPVKKVK 146



 Score = 29.6 bits (67), Expect = 0.99
 Identities = 9/27 (33%), Positives = 20/27 (74%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEK 268
           E++EEEE  +E+++E+E ++ +   +K
Sbjct: 118 EEDEEEEDDEEDDDEDESEEEESPVKK 144



 Score = 27.3 bits (61), Expect = 5.3
 Identities = 16/42 (38%), Positives = 26/42 (61%), Gaps = 2/42 (4%)

Query: 238 LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            H +  EE+E    EE+EEEE    ++  +++E E+EE+ VK
Sbjct: 104 QHLVASEEDESDDDEEDEEEED--DEEDDDEDESEEEESPVK 143



 Score = 26.9 bits (60), Expect = 9.1
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKK 266
           E+EE++E+  +E+E EE++   KK 
Sbjct: 121 EEEEDDEEDDDEDESEEEESPVKKV 145


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 30.2 bits (69), Expect = 0.83
 Identities = 14/33 (42%), Positives = 21/33 (63%), Gaps = 3/33 (9%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
           Y E E+E++K   +EE   KK  K++K+K EE 
Sbjct: 94  YFEAEKEKKKAMSKEE---KKAIKEEKDKLEEP 123


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 29.3 bits (66), Expect = 0.88
 Identities = 8/35 (22%), Positives = 17/35 (48%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
                   +   E +E ++ +K++E EEE  ++ L
Sbjct: 70  AAAAAAGAEAAAEADEAEEEEKEEEAEEESDDDML 104


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 30.5 bits (69), Expect = 0.91
 Identities = 10/34 (29%), Positives = 20/34 (58%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E E+EEE+ + +E EEE+ +  +++  +  E   
Sbjct: 372 EDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGS 405



 Score = 28.9 bits (65), Expect = 3.0
 Identities = 12/35 (34%), Positives = 25/35 (71%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           L+  + EE  ++E+EEEE++  + ++E+ E+ +EE
Sbjct: 362 LDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEE 396



 Score = 28.5 bits (64), Expect = 4.1
 Identities = 8/34 (23%), Positives = 23/34 (67%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             E+E+E+ ++  +E E+++G+  +E+  + +E+
Sbjct: 370 VDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSRED 403



 Score = 28.5 bits (64), Expect = 4.2
 Identities = 9/34 (26%), Positives = 20/34 (58%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E+E+  ++ +EEE E+ +++G + +E    E   
Sbjct: 377 EEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSS 410



 Score = 27.8 bits (62), Expect = 6.4
 Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++EEE+   + EEEE E  + +  + +E+   E 
Sbjct: 375 DEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSES 408



 Score = 27.8 bits (62), Expect = 8.3
 Identities = 8/36 (22%), Positives = 22/36 (61%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           E +E+E++ +E+  +E +++  +  E+E  +  E+ 
Sbjct: 369 EVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDG 404



 Score = 27.4 bits (61), Expect = 8.4
 Identities = 7/38 (18%), Positives = 15/38 (39%)

Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           H  E+ E+ E+   +  E+   +       + E K + 
Sbjct: 385 HEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADK 422



 Score = 27.4 bits (61), Expect = 9.8
 Identities = 10/35 (28%), Positives = 17/35 (48%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E EEEE +  EEE  + ++ G  +   +     E+
Sbjct: 384 EHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSES 418


>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain.  This family consists
           of several ATP synthase E chain sequences which are
           components of the CF(0) subunit.
          Length = 83

 Score = 28.6 bits (64), Expect = 1.0
 Identities = 10/33 (30%), Positives = 19/33 (57%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           E++  E + +E+  E+ K    KKK+ ++E K 
Sbjct: 36  EEKIREYEAQEKLIEKAKAAYAKKKQAKQEAKS 68


>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
           N-terminus.  This is an N-terminal domain of DNA
           polymerase III polC subunit A that is found only in
           Firmicutes. DNA polymerase polC-type III enzyme
           functions as the 'replicase' in low G + C Gram-positive
           bacteria. Purine asymmetry is a characteristic of
           organisms with a heterodimeric DNA polymerase III
           alpha-subunit constituted by polC which probably plays a
           direct role in the maintenance of strand-biased gene
           distribution; since, among prokaryotic genomes, the
           distribution of genes on the leading and lagging strands
           of the replication fork is known to be biased. The
           domain is associated with DNA_pol3_alpha pfam07733.
          Length = 180

 Score = 29.6 bits (67), Expect = 1.1
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
           + EQ+EEEE    EE  E  KK + +K+K+E+E
Sbjct: 148 FEEQKEEEEAKLAEEALEALKKKEAEKKKKEKE 180


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 30.7 bits (70), Expect = 1.1
 Identities = 14/34 (41%), Positives = 25/34 (73%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++E+E+   +++EEEEE++K KKKK+  E  + E
Sbjct: 357 DEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390



 Score = 30.0 bits (68), Expect = 1.7
 Identities = 13/33 (39%), Positives = 21/33 (63%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
            +++ ++E  +EEEEEE++KK KK  E    E 
Sbjct: 359 EDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391



 Score = 28.8 bits (65), Expect = 3.7
 Identities = 15/36 (41%), Positives = 24/36 (66%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           E EE+E+   E++EEEE+++ +KKK+K  E     L
Sbjct: 356 EDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391



 Score = 27.7 bits (62), Expect = 9.2
 Identities = 12/35 (34%), Positives = 23/35 (65%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            E EEEEE G ++E+EE+     +++E++ +  +E
Sbjct: 320 EEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDE 354


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 30.8 bits (69), Expect = 1.1
 Identities = 11/33 (33%), Positives = 25/33 (75%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           +++E+++  +++EEEEE+++  K  + E+EE E
Sbjct: 159 DEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191



 Score = 29.6 bits (66), Expect = 2.4
 Identities = 12/34 (35%), Positives = 26/34 (76%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E E+E++  +E++EEEE+++ + K   +E+E++E
Sbjct: 158 EDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191



 Score = 28.5 bits (63), Expect = 5.3
 Identities = 15/34 (44%), Positives = 25/34 (73%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E +++EE  +EEEEEEE+ KG   +++E+E  E+
Sbjct: 162 EDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGED 195


>gnl|CDD|224023 COG1098, VacB, Predicted RNA binding protein (contains ribosomal
           protein S1 domain) [Translation, ribosomal structure and
           biogenesis].
          Length = 129

 Score = 29.2 bits (66), Expect = 1.1
 Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKG-KKKKEKEEEEKEENL 277
            +E  KI+LS+  LE+E E++  K    +   K+G +    +  +  EE+L
Sbjct: 63  IDENGKISLSIRKLEEEPEKQHRKPRFSKSRPKEGFETLLSRLLKWIEESL 113


>gnl|CDD|217451 pfam03249, TSA, Type specific antigen.  There are several antigenic
           variants in Rickettsia tsutsugamushi, and a
           type-specific antigen (TSA) of 56-kilodaltons located on
           the rickettsial surface is responsible for the
           variation. TSA proteins are probably integral membrane
           proteins.
          Length = 502

 Score = 30.2 bits (68), Expect = 1.2
 Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 2/65 (3%)

Query: 212 LAGSEQLFSL-SDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEE 270
           L G+EQ+  L  D   L+  A  I  ++  L  ++EE+ G +   + +KK+G  +K KE 
Sbjct: 337 LNGNEQIIQLYKDLVKLQRHA-GIKKAMEKLAAQQEEDAGNQGGGDCKKKQGASEKSKEG 395

Query: 271 EEKEE 275
            + +E
Sbjct: 396 GKGKE 400


>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
           [DNA replication, recombination, and repair].
          Length = 663

 Score = 30.3 bits (69), Expect = 1.2
 Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 20/66 (30%)

Query: 52  INHMFNGINA-----TLLAYGQTGGGKTYTVSAMIMKTLQHVMQRCNKDDVYMSY----- 101
           I  +  GI       TLL  G TG GKT+T++ +I K          +  + +++     
Sbjct: 21  IAELVEGIENGLKHQTLL--GVTGSGKTFTMANVIAKV--------QRPTLVLAHNKTLA 70

Query: 102 LQLYSE 107
            QLYSE
Sbjct: 71  AQLYSE 76


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 28.9 bits (65), Expect = 1.2
 Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGK 263
           LE  EEE   KEEE E+  +K K
Sbjct: 84  LEDREEERLEKEEEREKRARKRK 106



 Score = 28.1 bits (63), Expect = 1.8
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKK 265
             E+ EE+  E+EEE EK+  K+K
Sbjct: 83  RLEDREEERLEKEEEREKRARKRK 106


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 30.1 bits (68), Expect = 1.3
 Identities = 16/62 (25%), Positives = 25/62 (40%), Gaps = 2/62 (3%)

Query: 210 VDLAGSEQLFSLSDNYL-LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEK 268
           V+L         +  Y      A +       +E E  EE  +E   + +KKK KKK+  
Sbjct: 380 VNLDRQLSELKEAIAYYESAKTALEKAEGKKAIE-EIREELIEEGLLKSKKKKRKKKEWF 438

Query: 269 EE 270
           E+
Sbjct: 439 EK 440



 Score = 30.1 bits (68), Expect = 1.3
 Identities = 17/53 (32%), Positives = 26/53 (49%), Gaps = 3/53 (5%)

Query: 226 LLRNEARKINLSLHYLEQEEEE---EKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            L+        +   LE+ E +   E+ +EE  EE   K KKKK K++E  E+
Sbjct: 388 ELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440


>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
          Length = 194

 Score = 29.8 bits (67), Expect = 1.3
 Identities = 12/35 (34%), Positives = 22/35 (62%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E   E++  +EE  +E  KK  K+  KEE+++++N
Sbjct: 5   EFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQN 39


>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
           (DUF3340).  This presumed domain is found at the
           C-terminus of tail specific proteases. Its function is
           unknown. This family is found in bacteria and
           eukaryotes. This presumed domain is typically between 88
           to 187 amino acids in length.
          Length = 139

 Score = 29.1 bits (66), Expect = 1.3
 Identities = 9/38 (23%), Positives = 20/38 (52%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           E+EE+E+K  + E E  K  G++  +  ++  ++    
Sbjct: 83  EREEDEKKRLKRENERRKALGEEPLKSLDDLPKDKEEP 120


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 29.7 bits (67), Expect = 1.3
 Identities = 14/47 (29%), Positives = 30/47 (63%)

Query: 223 DNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
           +N+L   +A++       LE+++++E+ KE EE E +++  K+K +E
Sbjct: 90  ENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEE 136



 Score = 29.3 bits (66), Expect = 2.2
 Identities = 13/34 (38%), Positives = 29/34 (85%), Gaps = 1/34 (2%)

Query: 243 QEEEEEKGKEEEEE-EEKKKGKKKKEKEEEEKEE 275
           QE E +K K+++++ EE+++ ++KK++EEEE+++
Sbjct: 185 QEWELKKLKQQQQKREEERRKQRKKQQEEEERKQ 218



 Score = 27.8 bits (62), Expect = 6.5
 Identities = 13/40 (32%), Positives = 26/40 (65%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           +L  ++ + + K ++  EEK+K +++KE+EE E  + L K
Sbjct: 92  WLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAK 131


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 30.1 bits (69), Expect = 1.3
 Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 2/39 (5%)

Query: 241 LEQEEEEEKGKEEEEEE--EKKKGKKKKEKEEEEKEENL 277
           LE +EE  K + E E+E  E++   +K EK   +KEENL
Sbjct: 60  LEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENL 98



 Score = 29.4 bits (67), Expect = 2.3
 Identities = 11/49 (22%), Positives = 29/49 (59%)

Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           L  +   +      LE++E+E + K++E E+++++ ++  E++ +E E 
Sbjct: 98  LDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146



 Score = 28.6 bits (65), Expect = 3.7
 Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 14/68 (20%)

Query: 216 EQLFSLSDNYLLRNEARKINLSLHYLEQEEE------EEKGKEEEEEEEKKKGKKKKEKE 269
           E+      N L + E R        L Q+EE      E   K EEE E+K+K  ++K++E
Sbjct: 74  EKELRERRNELQKLEKR--------LLQKEENLDRKLELLEKREEELEKKEKELEQKQQE 125

Query: 270 EEEKEENL 277
            E+KEE L
Sbjct: 126 LEKKEEEL 133



 Score = 28.6 bits (65), Expect = 4.3
 Identities = 21/54 (38%), Positives = 34/54 (62%), Gaps = 3/54 (5%)

Query: 228 RNEARKINLS--LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           R   ++ NL   L  LE+ EEE + K+E+E E+K++  +KKE+E EE  E  ++
Sbjct: 90  RLLQKEENLDRKLELLEKREEELE-KKEKELEQKQQELEKKEEELEELIEEQLQ 142


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 30.3 bits (67), Expect = 1.3
 Identities = 17/48 (35%), Positives = 28/48 (58%)

Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           R+ A K  + L   EQ+ E+EK K E+E  E    + K E+E+++ E+
Sbjct: 141 RDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQ 188


>gnl|CDD|147685 pfam05663, DUF809, Protein of unknown function (DUF809).  This
           family consists of several proteins of unknown function
           Raphanus sativus (Radish) and Brassica napus (Rape).
          Length = 138

 Score = 29.0 bits (64), Expect = 1.3
 Identities = 18/35 (51%), Positives = 28/35 (80%), Gaps = 1/35 (2%)

Query: 241 LEQEEEEEKGK-EEEEEEEKKKGKKKKEKEEEEKE 274
           +E +EE+++GK E E +EEKK+GK + E +EE+KE
Sbjct: 97  IEGKEEKKEGKGEIEGKEEKKEGKGEIEGKEEKKE 131


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 30.1 bits (68), Expect = 1.4
 Identities = 11/33 (33%), Positives = 17/33 (51%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
             EE+EE+ ++E EEE      KK E    + +
Sbjct: 113 LDEEQEERVEKEREEELAGDAMKKLENRTADSK 145



 Score = 29.3 bits (66), Expect = 2.4
 Identities = 11/34 (32%), Positives = 19/34 (55%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
              +E++EE+ +EEE+E   K      E EE+ +
Sbjct: 176 LFRREKKEEEEEEEEDEALIKSLSFGPETEEDRR 209



 Score = 28.2 bits (63), Expect = 5.0
 Identities = 16/42 (38%), Positives = 20/42 (47%), Gaps = 6/42 (14%)

Query: 242 EQEEEEEKGKEEEEEE------EKKKGKKKKEKEEEEKEENL 277
           EQEE  EK +EEE         E +    K+E E  E+ E L
Sbjct: 116 EQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEEL 157


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 30.1 bits (68), Expect = 1.4
 Identities = 11/39 (28%), Positives = 24/39 (61%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           L + +EEE+ +E+E  EE+++ +    ++  +K E  V+
Sbjct: 394 LSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVE 432


>gnl|CDD|216991 pfam02357, NusG, Transcription termination factor nusG. 
          Length = 90

 Score = 28.0 bits (63), Expect = 1.4
 Identities = 14/45 (31%), Positives = 23/45 (51%)

Query: 225 YLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
           Y+LR ++ +       LE++  E     EE  E +K G+KKK + 
Sbjct: 4   YVLRTKSGQEKKVAENLERQGIESFLPPEEVVEVRKNGRKKKVER 48


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 29.6 bits (66), Expect = 1.5
 Identities = 12/33 (36%), Positives = 22/33 (66%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           QE+   K +E  +E + ++ + ++E+EEEE EE
Sbjct: 199 QEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEE 231



 Score = 28.9 bits (64), Expect = 2.4
 Identities = 12/35 (34%), Positives = 24/35 (68%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           YL++ + EEK  +EE+EEE+ + + K+E+ +   +
Sbjct: 209 YLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243


>gnl|CDD|235395 PRK05298, PRK05298, excinuclease ABC subunit B; Provisional.
          Length = 652

 Score = 30.0 bits (69), Expect = 1.5
 Identities = 24/64 (37%), Positives = 29/64 (45%), Gaps = 26/64 (40%)

Query: 57  NGINA-----TLLAYGQTGGGKTYTVSAMIMKT------LQHVMQRCNKDDVYMSYL--Q 103
            GI A     TLL  G TG GKT+T++ +I +       L H     NK       L  Q
Sbjct: 26  EGIEAGEKHQTLL--GVTGSGKTFTMANVIARLQRPTLVLAH-----NKT------LAAQ 72

Query: 104 LYSE 107
           LYSE
Sbjct: 73  LYSE 76


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 30.1 bits (68), Expect = 1.5
 Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 2/73 (2%)

Query: 209 LVDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGK--EEEEEEEKKKGKKKK 266
           L+     E+    S     ++  ++  LS+    +E+  E+ +   E +EE KK+  KK 
Sbjct: 438 LLKKFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKV 497

Query: 267 EKEEEEKEENLVK 279
           +K  E     L++
Sbjct: 498 KKIPEVDTYLLLE 510


>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein.  This family includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 116

 Score = 28.7 bits (64), Expect = 1.5
 Identities = 13/24 (54%), Positives = 16/24 (66%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKK 265
             E+E E  +EEEEE+ KKK K K
Sbjct: 93  RLEDERELAREEEEEKRKKKKKNK 116



 Score = 26.8 bits (59), Expect = 6.0
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKK 266
            E+E +   EEEEE++KK KK K
Sbjct: 94  LEDERELAREEEEEKRKKKKKNK 116


>gnl|CDD|224237 COG1318, COG1318, Predicted transcriptional regulators
           [Transcription].
          Length = 182

 Score = 29.5 bits (66), Expect = 1.5
 Identities = 15/46 (32%), Positives = 24/46 (52%)

Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
           L  E  KI  ++  L++ + E    ++E  EEK KG+  K +  EE
Sbjct: 114 LEKEGLKIRWAVEVLKKIKGEHFPMDKELLEEKLKGEVIKGEPLEE 159


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 29.6 bits (67), Expect = 1.6
 Identities = 19/55 (34%), Positives = 28/55 (50%), Gaps = 6/55 (10%)

Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEE--EEKEENLVK 279
           L         S     +E  E +  EEE E +KK  +K +EKE+  EE E+NL++
Sbjct: 292 LSGLKNSGEPS----LKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIE 342


>gnl|CDD|183737 PRK12773, flhB, flagellar biosynthesis protein FlhB; Reviewed.
          Length = 646

 Score = 30.0 bits (67), Expect = 1.7
 Identities = 19/48 (39%), Positives = 22/48 (45%), Gaps = 2/48 (4%)

Query: 224 NYLLRNEAR--KINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
           N  L  EA   KI+L L   E E   E G E    EE++KG   K  E
Sbjct: 276 NICLSTEAADFKIDLQLFAAEDEGRTEPGSERRRREEREKGNVPKSPE 323


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 29.3 bits (66), Expect = 1.7
 Identities = 13/33 (39%), Positives = 21/33 (63%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
            + +EE+KGKEE+ E E+ + ++  E  EE  E
Sbjct: 21  NENKEEDKGKEEDLEFEEIEKEEIIEDSEESNE 53



 Score = 29.3 bits (66), Expect = 1.9
 Identities = 10/38 (26%), Positives = 21/38 (55%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            +EE++ K ++ E EE +K+   +  +E  E +   +K
Sbjct: 23  NKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60



 Score = 28.6 bits (64), Expect = 3.1
 Identities = 8/38 (21%), Positives = 21/38 (55%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           ++ +EE+   EE E+EE  +  ++  + + E+ ++   
Sbjct: 27  DKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENN 64



 Score = 28.6 bits (64), Expect = 3.2
 Identities = 11/47 (23%), Positives = 27/47 (57%)

Query: 229 NEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            +A+  N+     ++ E +E+ K +EE+ E ++ +K++  E+ E+  
Sbjct: 6   KDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESN 52



 Score = 28.2 bits (63), Expect = 4.1
 Identities = 12/38 (31%), Positives = 22/38 (57%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           ++E  E+  +  E + E+ K +  K KEE +K EN ++
Sbjct: 41  KEEIIEDSEESNEVKIEELKDENNKLKEENKKLENELE 78



 Score = 28.2 bits (63), Expect = 4.3
 Identities = 15/33 (45%), Positives = 19/33 (57%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
             EE+  KE E +EE K  ++  E EE EKEE 
Sbjct: 12  NMEEDCCKENENKEEDKGKEEDLEFEEIEKEEI 44



 Score = 27.8 bits (62), Expect = 5.9
 Identities = 13/34 (38%), Positives = 19/34 (55%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E  EE  + K EE ++E  K K++ +K E E E 
Sbjct: 46  EDSEESNEVKIEELKDENNKLKEENKKLENELEA 79


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 30.0 bits (68), Expect = 1.8
 Identities = 14/36 (38%), Positives = 22/36 (61%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           LE EEE +  K   +  ++ +  +K+E+EEE  EEN
Sbjct: 459 LEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEEN 494



 Score = 27.3 bits (61), Expect = 9.9
 Identities = 13/35 (37%), Positives = 22/35 (62%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             ++EEE K ++   +  K+  K +KE+EEEE +E
Sbjct: 458 ELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDE 492


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 29.6 bits (66), Expect = 1.8
 Identities = 16/40 (40%), Positives = 24/40 (60%), Gaps = 6/40 (15%)

Query: 242 EQEEEEEKGKEEE------EEEEKKKGKKKKEKEEEEKEE 275
             EEEE++ K+EE      EEEEK++ K++ E+   E  E
Sbjct: 229 VLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAE 268



 Score = 29.6 bits (66), Expect = 1.9
 Identities = 13/35 (37%), Positives = 20/35 (57%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E+EE+EE  +E EE EE +   K ++K +    E 
Sbjct: 112 EEEEKEESREEREEVEETEGVTKSEQKNDWRDAEE 146



 Score = 29.6 bits (66), Expect = 2.2
 Identities = 16/39 (41%), Positives = 21/39 (53%), Gaps = 5/39 (12%)

Query: 242 EQEEEEEKGKEE-----EEEEEKKKGKKKKEKEEEEKEE 275
           E EE E   K E      + EE +K +K+ E EEEEK +
Sbjct: 125 EVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPK 163



 Score = 27.3 bits (60), Expect = 9.8
 Identities = 13/35 (37%), Positives = 23/35 (65%)

Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            +E+ G E E  EE++K + ++E+EE E+ E + K
Sbjct: 100 MQEDSGAENETVEEEEKEESREEREEVEETEGVTK 134


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 28.2 bits (63), Expect = 1.9
 Identities = 9/40 (22%), Positives = 20/40 (50%)

Query: 237 SLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           +     + E +EKG + +++E+  K K K+ K ++     
Sbjct: 2   NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPV 41


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 28.4 bits (64), Expect = 1.9
 Identities = 12/34 (35%), Positives = 20/34 (58%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E+ + E+K +E++ E EK      +EK E E E+
Sbjct: 20  EKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEK 53



 Score = 28.4 bits (64), Expect = 2.3
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E E+  +   EE+ E E +K +K+ E+ E E    
Sbjct: 34  EAEKLAKMSAEEKAEYELEKLEKELEELEAELARR 68



 Score = 28.0 bits (63), Expect = 2.6
 Identities = 10/37 (27%), Positives = 21/37 (56%)

Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             + +E+ + + K+EE++ E +K  K   +E+ E E 
Sbjct: 15  KAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYEL 51



 Score = 27.2 bits (61), Expect = 5.8
 Identities = 10/35 (28%), Positives = 20/35 (57%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             + E++++ K+ E E+  K   ++K + E EK E
Sbjct: 21  KAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLE 55


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 29.1 bits (65), Expect = 1.9
 Identities = 15/56 (26%), Positives = 28/56 (50%), Gaps = 8/56 (14%)

Query: 228 RNEARKINLSLHYLEQEEEE--------EKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +  A+K+ +     + EEE+        E+ +EE E+ E    ++ +E EE E+E 
Sbjct: 4   KGNAKKLKVRPPPTKDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59


>gnl|CDD|200031 TIGR00586, mutt, mutator mutT protein.  All proteins in this family
           for which functions are known are involved in repairing
           oxidative damage to dGTP (they are 8-oxo-dGTPases). This
           family is based on the phylogenomic analysis of JA Eisen
           (1999, Ph.D. Thesis, Stanford University). Lowering the
           threshold picks up members of MutT superfamily well [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 128

 Score = 28.7 bits (64), Expect = 1.9
 Identities = 15/44 (34%), Positives = 18/44 (40%)

Query: 214 GSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEE 257
           G  Q FS  +        R I L    LE+ E    GKE + EE
Sbjct: 59  GIPQHFSEFEKLEYEFYPRHITLWFWLLERWEGGPPGKEGQPEE 102


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 29.8 bits (68), Expect = 2.0
 Identities = 12/58 (20%), Positives = 29/58 (50%)

Query: 219 FSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           F +       +++++        ++EE+E+  KE  E  +K + +KKK+ +  + +E 
Sbjct: 156 FGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEG 213


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 28.9 bits (65), Expect = 2.0
 Identities = 7/34 (20%), Positives = 9/34 (26%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
               E + G   E    KKK    +    E    
Sbjct: 108 AALAEADGGPTAEATTPKKKKAAAEAAAAEAAAP 141



 Score = 28.1 bits (63), Expect = 3.3
 Identities = 9/36 (25%), Positives = 10/36 (27%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            LE         +     E    KKKK   E    E
Sbjct: 102 KLELFNAALAEADGGPTAEATTPKKKKAAAEAAAAE 137


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 29.7 bits (67), Expect = 2.0
 Identities = 16/64 (25%), Positives = 33/64 (51%), Gaps = 2/64 (3%)

Query: 216 EQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++L  L +  +   E  +       LE++ E+ +   EE EE K+K + ++ KEE  + E
Sbjct: 508 KELRELEEELIELLELEEALKEE--LEEKLEKLENLLEELEELKEKLQLQQLKEELRQLE 565

Query: 276 NLVK 279
           + ++
Sbjct: 566 DRLQ 569



 Score = 28.6 bits (64), Expect = 4.5
 Identities = 18/49 (36%), Positives = 25/49 (51%), Gaps = 3/49 (6%)

Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           LR E  ++   L  LE+E  E     E EE  K++ ++K EK E   EE
Sbjct: 499 LREEIEELEKELRELEEELIE---LLELEEALKEELEEKLEKLENLLEE 544



 Score = 28.2 bits (63), Expect = 6.5
 Identities = 12/34 (35%), Positives = 19/34 (55%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           +E+ EE E+  EE EEE +      +E EE  ++
Sbjct: 290 IERLEELEREIEELEEELEGLRALLEELEELLEK 323


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 29.3 bits (66), Expect = 2.0
 Identities = 12/39 (30%), Positives = 22/39 (56%), Gaps = 3/39 (7%)

Query: 242 EQEEEEEKGKE---EEEEEEKKKGKKKKEKEEEEKEENL 277
           E+EEEE +  E   EE+  ++   K+  E+ +E++E   
Sbjct: 151 EEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELRR 189



 Score = 27.7 bits (62), Expect = 7.8
 Identities = 14/35 (40%), Positives = 21/35 (60%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           L +EEEEE  + EEE EE    +  +++  EE +E
Sbjct: 149 LPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKE 183


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 28.4 bits (64), Expect = 2.2
 Identities = 8/33 (24%), Positives = 21/33 (63%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E +  + +E++ +EE+ K + ++ KE E +++ 
Sbjct: 94  EAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126



 Score = 26.9 bits (60), Expect = 6.6
 Identities = 10/38 (26%), Positives = 21/38 (55%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           E +E  E  +  +EE++  + + K++ EE ++ E   K
Sbjct: 88  EIDERAEAQRARDEEKKLDEEEAKRQHEEAKEREREKK 125



 Score = 26.5 bits (59), Expect = 9.3
 Identities = 11/33 (33%), Positives = 23/33 (69%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           E +   ++ K+ +EEE K++ ++ KE+E E+K+
Sbjct: 94  EAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 29.8 bits (67), Expect = 2.2
 Identities = 17/66 (25%), Positives = 34/66 (51%)

Query: 210 VDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
           ++++ S  +  LS    L +  ++  L L      E +E+  EE ++E KK  K+KK+  
Sbjct: 154 LEMSPSTPVEKLSGWDTLSDSDQEAVLPLVKKSPSEAKEEKAEERKQESKKGAKRKKDAS 213

Query: 270 EEEKEE 275
            ++K +
Sbjct: 214 GDDKSK 219


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 29.7 bits (67), Expect = 2.2
 Identities = 16/33 (48%), Positives = 24/33 (72%), Gaps = 3/33 (9%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +EEE+E   EEEEEE++ +G  K+  ++EE EE
Sbjct: 391 EEEEDE---EEEEEEDEDEGPSKEHSDDEEFEE 420



 Score = 29.3 bits (66), Expect = 2.5
 Identities = 13/36 (36%), Positives = 25/36 (69%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
            + EEEE++ +EEEE+E++   K+  + EE E+++ 
Sbjct: 388 RDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDV 423



 Score = 28.9 bits (65), Expect = 3.8
 Identities = 13/36 (36%), Positives = 23/36 (63%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
            E+E+EEE+ +E+E+E   K+    +E EE++ E  
Sbjct: 391 EEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESK 426


>gnl|CDD|214692 smart00487, DEXDc, DEAD-like helicases superfamily. 
          Length = 201

 Score = 29.0 bits (65), Expect = 2.2
 Identities = 10/36 (27%), Positives = 17/36 (47%), Gaps = 1/36 (2%)

Query: 52 INHMFNGINATLLAYGQTGGGKTYTVSAMIMKTLQH 87
          I  + +G+   +LA   TG GKT       ++ L+ 
Sbjct: 17 IEALLSGLRDVILA-APTGSGKTLAALLPALEALKR 51


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 29.7 bits (66), Expect = 2.3
 Identities = 11/47 (23%), Positives = 30/47 (63%), Gaps = 1/47 (2%)

Query: 228  RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
            ++E +   +    ++ +EE+E+ +  +  +E++ GKK+KE+E+  ++
Sbjct: 1014 KHEIKDR-IVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRK 1059


>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD.  This
           family consists of several putative bacterial stage IV
           sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
           is known to be essential for efficient sporulation
           although its exact function is unknown.
          Length = 383

 Score = 29.3 bits (66), Expect = 2.3
 Identities = 15/37 (40%), Positives = 20/37 (54%)

Query: 233 KINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
            I    +Y  Q++  +  KEE  E+ KK  KKK EKE
Sbjct: 312 SIVRETYYEVQDKVVKLTKEEAVEKGKKLAKKKLEKE 348


>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein.  PACS-1 is a
           cytosolic sorting protein that directs the localisation
           of membrane proteins in the trans-Golgi network
           (TGN)/endosomal system. PACS-1 connects the clathrin
           adaptor AP-1 to acidic cluster sorting motifs contained
           in the cytoplasmic domain of cargo proteins such as
           furin, the cation-independent mannose-6-phosphate
           receptor and in viral proteins such as human
           immunodeficiency virus type 1 Nef.
          Length = 413

 Score = 29.4 bits (66), Expect = 2.4
 Identities = 12/26 (46%), Positives = 15/26 (57%), Gaps = 6/26 (23%)

Query: 256 EEEKKK------GKKKKEKEEEEKEE 275
           +E+ KK      GKK KEKE E K +
Sbjct: 332 KEKNKKVPVMFLGKKPKEKEVESKSQ 357


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 29.4 bits (67), Expect = 2.4
 Identities = 11/31 (35%), Positives = 18/31 (58%)

Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           EEEE  KEE + E +++ K + E+    + E
Sbjct: 39  EEEEARKEEAKREAEEEAKAEAEEAAAAEAE 69


>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
           MW N terminal.  This domain is found in eukaryotes. This
           domain is about 90 amino acids in length. This domain is
           found associated with pfam00076. This domain is part of
           U1 snRNP, which is the pre-mRNA binding protein of the
           penta-snRNP spliceosome complex. It extends over a
           distance of 180 A from its RNA binding domain, wraps
           around the core domain of U1 snRNP consisting of the
           seven Sm proteins and finally contacts U1-C, which is
           crucial for 5'-splice-site recognition.
          Length = 94

 Score = 27.6 bits (62), Expect = 2.4
 Identities = 8/33 (24%), Positives = 22/33 (66%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++ ++E   E  E   +K+ ++K+EK+E+ +++
Sbjct: 48  KDYKDEPPPEPTETWLEKREREKREKKEKLEKK 80



 Score = 27.2 bits (61), Expect = 3.1
 Identities = 10/37 (27%), Positives = 22/37 (59%)

Query: 238 LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           L   +  ++E   +  E   EK++ +K+++KE+ EK+
Sbjct: 44  LSEFKDYKDEPPPEPTETWLEKREREKREKKEKLEKK 80


>gnl|CDD|220223 pfam09405, Btz, CASC3/Barentsz eIF4AIII binding.  This domain is
           found on CASC3 (cancer susceptibility candidate gene 3
           protein) which is also known as Barentsz (Btz). CASC3 is
           a component of the EJC (exon junction complex) which is
           a complex that is involved in post-transcriptional
           regulation of mRNA in metazoa. The complex is formed by
           the association of four proteins (eIF4AIII, Barentsz,
           Mago, and Y14), mRNA, and ATP. This domain wraps around
           eIF4AIII and stacks against the 5' nucleotide.
          Length = 116

 Score = 28.2 bits (63), Expect = 2.5
 Identities = 10/36 (27%), Positives = 22/36 (61%), Gaps = 1/36 (2%)

Query: 242 EQEEEEEKGKE-EEEEEEKKKGKKKKEKEEEEKEEN 276
           E++       E  E +E+K++ +++ EK ++E +EN
Sbjct: 5   ERQSGRTPSAEPTEPKEDKERKRREHEKYDDEDDEN 40


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 29.4 bits (66), Expect = 2.6
 Identities = 14/53 (26%), Positives = 27/53 (50%)

Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           L  +   +   L   E+ ++E + + EE +E ++  K + EKE +E  + L K
Sbjct: 527 LEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKK 579


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 29.2 bits (66), Expect = 2.7
 Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
           +E EEE   E E   E K  K+++E   EE
Sbjct: 233 EEREEETDVEIETTSETKGTKQEQEGSTEE 262



 Score = 28.8 bits (65), Expect = 4.3
 Identities = 12/37 (32%), Positives = 21/37 (56%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
            E EE EE+   E E   + KG K++++   E++ +L
Sbjct: 230 SETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSL 266



 Score = 27.7 bits (62), Expect = 7.8
 Identities = 12/30 (40%), Positives = 21/30 (70%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           +++E K  +  EE+ KK+ KK+K K EE++
Sbjct: 649 KKKEFKILDYTEEKTKKEEKKEKNKREEKE 678



 Score = 27.7 bits (62), Expect = 8.0
 Identities = 10/37 (27%), Positives = 19/37 (51%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           L++  E E+ +EE + E +   + K  K+E+E     
Sbjct: 226 LKETSETEEREEETDVEIETTSETKGTKQEQEGSTEE 262


>gnl|CDD|219014 pfam06414, Zeta_toxin, Zeta toxin.  This family consists of several
           bacterial zeta toxin proteins. Zeta toxin is thought to
           be part of a postregulational killing system in
           bacteria. It relies on antitoxin/toxin systems that
           secure stable inheritance of low and medium copy number
           plasmids during cell division and kill cells that have
           lost the plasmid.
          Length = 191

 Score = 28.4 bits (64), Expect = 2.8
 Identities = 15/47 (31%), Positives = 22/47 (46%), Gaps = 2/47 (4%)

Query: 61  ATLLAYGQTGGGKTYTVSAMIMKTLQHVMQRCNKDDVYMSYLQLYSE 107
           A LL  GQ G GKT    A++ +     + R + D+   +Y   Y E
Sbjct: 14  AVLLG-GQPGAGKTELARALLEELGGGNVVRIDPDE-LRTYHPDYDE 58


>gnl|CDD|238549 cd01129, PulE-GspE, PulE/GspE The type II secretory pathway is the
           main terminal branch of the general secretory pathway
           (GSP).  It is responsible for the export the majority of
           Gram-negative bacterial exoenzymes and toxins. PulE is a
           cytoplasmic protein of the GSP, which contains an ATP
           binding site and a tetracysteine motif. This subgroup
           also includes PillB and HofB.
          Length = 264

 Score = 28.8 bits (65), Expect = 2.9
 Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 9/62 (14%)

Query: 23  FDHQVFIFD-NIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVSAMI 81
            D +  I D    G    N  IF ++L        +GI   +L  G TG GKT T+ + +
Sbjct: 50  LDKKNQILDLEKLGLKPENLEIFRKLL-----EKPHGI---ILVTGPTGSGKTTTLYSAL 101

Query: 82  MK 83
            +
Sbjct: 102 SE 103


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 29.2 bits (65), Expect = 3.0
 Identities = 14/53 (26%), Positives = 28/53 (52%)

Query: 223 DNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           D  +  N+ R+I  +   +++ +EE    ++ +  + K+  K  EKE E+KE 
Sbjct: 272 DKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKEL 324


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 29.0 bits (66), Expect = 3.0
 Identities = 12/34 (35%), Positives = 15/34 (44%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             E EE    E+EEEEE +        E E  E+
Sbjct: 186 GSELEELDDDEDEEEEEDENDDSLAADESELPEK 219


>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
          Length = 196

 Score = 28.3 bits (63), Expect = 3.0
 Identities = 8/32 (25%), Positives = 19/32 (59%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
            + ++ + + +  +E EE K+  ++KE E +E
Sbjct: 32  EQAQQNQPQQQTVDEIEELKQKLQQKEVEAQE 63


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 29.0 bits (66), Expect = 3.0
 Identities = 15/47 (31%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 234 INLSLHYLEQEEE-EEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
            NL+  Y E +EE +    +++E +   K +KKKEK+E +  +   K
Sbjct: 43  TNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 27.1 bits (61), Expect = 9.7
 Identities = 11/51 (21%), Positives = 26/51 (50%), Gaps = 1/51 (1%)

Query: 217 QLFSLSDNY-LLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKK 266
           ++ +L++ Y  ++ E +   L    L+   + +K KE++E +  K   K +
Sbjct: 41  EVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 27.6 bits (61), Expect = 3.1
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 254 EEEEEKKKGKKKKEKEEEEKEEN 276
                K + KK+++KEEEE+EE+
Sbjct: 82  ATAGAKAEAKKEEKKEEEEEEED 104


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 28.1 bits (62), Expect = 3.3
 Identities = 11/34 (32%), Positives = 23/34 (67%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           EQ E+EE+ + E+EE+++ + ++ K +  EE+  
Sbjct: 37  EQREQEEQERREQEEQDRLEREELKRRAAEERLR 70


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 28.8 bits (64), Expect = 3.4
 Identities = 16/39 (41%), Positives = 24/39 (61%), Gaps = 5/39 (12%)

Query: 242 EQEEEEEKGKEEEEEEEKK-----KGKKKKEKEEEEKEE 275
           E+  EE  G+EEEE+EE       +G ++  +EEEE+ E
Sbjct: 179 EERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAE 217



 Score = 27.7 bits (61), Expect = 7.9
 Identities = 10/33 (30%), Positives = 14/33 (42%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E   E  +EEEEE E     +     E E+ + 
Sbjct: 203 EGNRELNEEEEEEAEGSDDGEDVVDYEGERIDK 235


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 28.7 bits (65), Expect = 3.4
 Identities = 11/53 (20%), Positives = 23/53 (43%), Gaps = 5/53 (9%)

Query: 226 LLRNEARKINLSLH-----YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           L +    ++ L         + + +EE +  E+E E   ++ K+  +K  EE 
Sbjct: 197 LKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEEL 249


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 28.8 bits (65), Expect = 3.6
 Identities = 11/56 (19%), Positives = 21/56 (37%), Gaps = 4/56 (7%)

Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEE----KKKGKKKKEKEEEEKEENLVK 279
           +    ++   L  + +E EEE  + E+  +E           K    E K + +V 
Sbjct: 503 KELLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVL 558


>gnl|CDD|130727 TIGR01666, YCCS, TIGR01666 family membrane protein.  This model
           represents a clade of sequences from gamma and beta
           proteobacteria. These proteins are >700 amino acids long
           and many have been annotated as putative membrane
           proteins. The gene from Salmonella has been annotated as
           a putative efflux transporter. The gene from E. coli has
           the name yccS [Cell envelope, Other].
          Length = 704

 Score = 28.7 bits (64), Expect = 3.7
 Identities = 17/109 (15%), Positives = 40/109 (36%), Gaps = 22/109 (20%)

Query: 181 TAVTKMNAQSSR--------------SHTICTIYLGAMAKLHLVDLAGSEQLFSLSDNYL 226
           T V+ MN +  +              +H++   Y+ A+        A  ++L +L     
Sbjct: 577 TTVSNMNNEPVKYKAYLQKGFRLLKLNHSL-LSYISALG-------AHRDRLKNLQQTAQ 628

Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             +    +   L Y  +  EE       +++E  +  + +++E   +E 
Sbjct: 629 FLDGFYPVAKKLIYTLEHIEEIPEAIFNQQQESIETLELRKQEMTAEER 677


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 28.2 bits (63), Expect = 3.7
 Identities = 19/50 (38%), Positives = 28/50 (56%)

Query: 226 LLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           L R EAR   L     E+  EEE+   EEE EE+++  ++    EEE+E+
Sbjct: 64  LDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEED 113


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
           function of this eukaryotic protein family is unknown.
           The yeast orthologues have been implicated in cell cycle
           progression and biogenesis of 60S ribosomal subunits.
           The Schistosoma mansoni Mak16 has been shown to target
           protein transport to the nucleolus.
          Length = 97

 Score = 27.1 bits (60), Expect = 3.7
 Identities = 15/35 (42%), Positives = 23/35 (65%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           LE EE EE  +EEEEEEE++   + +   ++E+ E
Sbjct: 55  LEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELE 89



 Score = 26.0 bits (57), Expect = 9.2
 Identities = 14/35 (40%), Positives = 23/35 (65%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E EE +E+ +EEEEEE++ + +   + EE E+E  
Sbjct: 59  ESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIE 93


>gnl|CDD|131051 TIGR01996, PTS-II-BC-sucr, PTS system, sucrose-specific IIBC
           component.  This model represents the fused enzyme II B
           and C components of the sucrose-specific PTS sugar
           transporter system. Sucrose is converted to
           sucrose-6-phosphate in the process of translocation into
           the cell. Some of these transporters lack their own IIA
           domains and instead use the glucose IIA protein (IIAglc
           or Crr). The exceptions to this rule are Staphylococci,
           Streptococci, Lactococci, Lactobacilli, etc. which
           contain their own A domain as a C-terminal fusion. This
           family is closely related to the trehalose transporting
           PTS IIBC enzymes and the B and C domains of each are
           described by subfamily-domain level TIGRFAMs models
           (TIGR00826 and TIGR00852, respectively).
          Length = 461

 Score = 28.8 bits (65), Expect = 3.7
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 52  INHMFNGINATLLAYGQTGGGKTYTVSAM 80
           I+H F+ I   LLA  +TGG   + +++M
Sbjct: 318 IHHSFHAIETQLLANTKTGGNFLFPIASM 346


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 28.9 bits (64), Expect = 3.8
 Identities = 10/33 (30%), Positives = 22/33 (66%)

Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           EEE + ++  E+++++K K++KEK    +  +L
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSL 117



 Score = 28.5 bits (63), Expect = 4.2
 Identities = 12/57 (21%), Positives = 31/57 (54%), Gaps = 8/57 (14%)

Query: 221 LSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           +SD Y+   E R+         ++  E+  +E+++ E++K+G+++      E +E++
Sbjct: 77  MSDQYVKLEEERR--------HRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDI 125



 Score = 28.1 bits (62), Expect = 6.7
 Identities = 11/31 (35%), Positives = 22/31 (70%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
           E++ ++ K KE++E+E+++   KKKE E  +
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 27.7 bits (62), Expect = 3.9
 Identities = 15/38 (39%), Positives = 24/38 (63%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           YL++E E +K     EE++K   +K KEK E+E+ +N 
Sbjct: 42  YLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQ 79


>gnl|CDD|176652 cd06140, DNA_polA_I_Bacillus_like_exo, inactive DEDDy 3'-5'
           exonuclease domain of Bacillus stearothermophilus DNA
           polymerase I and similar family-A DNA polymerases.
           Bacillus stearothermophilus-like Polymerase I (Pol I), a
           subgroup of the family-A DNA polymerases, contains an
           inactive DnaQ-like 3'-5' exonuclease domain in the same
           polypeptide chain as the polymerase region. The
           exonuclease-like domain of these proteins possess the
           same fold as the Klenow fragment (KF) of Escherichia
           coli Pol I, but does not contain the four critical
           metal-binding residues necessary for activity. The
           function of this domain is unknown. It might act as a
           spacer between the polymerase and the 5'-3' exonuclease
           domains. Some members of this subgroup, such as those
           from Bacillus sphaericus and Thermus aquaticus, are
           thermostable DNA polymerases.
          Length = 178

 Score = 28.0 bits (63), Expect = 3.9
 Identities = 16/58 (27%), Positives = 23/58 (39%), Gaps = 10/58 (17%)

Query: 225 YLLRNEARKINLS---LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           YLL       +L+     YL +E   +       EE   KG K    +EE   E+L +
Sbjct: 90  YLLDPTRSSYDLADLAKRYLGRELPSD-------EEVYGKGAKFAVPDEEVLAEHLAR 140


>gnl|CDD|165339 PHA03046, PHA03046, Hypothetical protein; Provisional.
          Length = 142

 Score = 27.8 bits (62), Expect = 3.9
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           ++EE    E  E+  ++ G  KK  + EE+E NL+ 
Sbjct: 12  DDEENDRNEHREKTSEEDGHYKKRLDVEEEEPNLIN 47


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 28.3 bits (64), Expect = 3.9
 Identities = 12/34 (35%), Positives = 24/34 (70%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E+ E E + ++EE+E E  + + ++E+ E+E+EE
Sbjct: 167 EEREAERRERKEEKEREVARLRAQQEEAEDEREE 200



 Score = 28.0 bits (63), Expect = 6.7
 Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 6/40 (15%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGK------KKKEKEEEEKEE 275
            Q EE+++ K EE+EEE++  +       K   EEEE+E 
Sbjct: 24  AQIEEKKRIKAEEKEEERRIDEMMEEERLKALAEEEERER 63



 Score = 27.6 bits (62), Expect = 7.2
 Identities = 14/36 (38%), Positives = 22/36 (61%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           E++EEE +  E  EEE  K   +++E+E + KEE  
Sbjct: 35  EEKEEERRIDEMMEEERLKALAEEEERERKRKEERR 70


>gnl|CDD|215349 PLN02647, PLN02647, acyl-CoA thioesterase.
          Length = 437

 Score = 28.6 bits (64), Expect = 3.9
 Identities = 12/35 (34%), Positives = 19/35 (54%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E EEE+   +E E   + +K K+ ++K E E  E 
Sbjct: 225 ETEEEKLLFEEAEARNKLRKKKRGEQKREFENGEA 259



 Score = 27.8 bits (62), Expect = 7.0
 Identities = 15/34 (44%), Positives = 19/34 (55%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           L  E EEEK   EE E   K  KKK+ +++ E E
Sbjct: 222 LSPETEEEKLLFEEAEARNKLRKKKRGEQKREFE 255


>gnl|CDD|216584 pfam01580, FtsK_SpoIIIE, FtsK/SpoIIIE family.  FtsK has extensive
          sequence similarity to wide variety of proteins from
          prokaryotes and plasmids, termed the FtsK/SpoIIIE
          family. This domain contains a putative ATP binding
          P-loop motif. It is found in the FtsK cell division
          protein from E. coli and the stage III sporulation
          protein E SpoIIIE, which has roles in regulation of
          prespore specific gene expression in B. subtilis. A
          mutation in FtsK causes a temperature sensitive block
          in cell division and it is involved in peptidoglycan
          synthesis or modification. The SpoIIIE protein is
          implicated in intercellular chromosomal DNA transfer.
          Length = 201

 Score = 28.1 bits (63), Expect = 3.9
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 60 NATLLAYGQTGGGKTYTVSAMIM 82
             LL  G TG GK+  ++ +I+
Sbjct: 38 MPHLLIAGATGSGKSTFLNTLIL 60


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 28.4 bits (64), Expect = 4.0
 Identities = 14/37 (37%), Positives = 21/37 (56%), Gaps = 5/37 (13%)

Query: 240 YLEQEEEE--EKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           Y E+EE+E  +K K+E +EE +      + K  EE E
Sbjct: 390 YKEREEKEKEKKAKKEGKEERR---IHFQNKSIEEAE 423


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 28.4 bits (64), Expect = 4.1
 Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 7/43 (16%)

Query: 240 YLEQ-------EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           Y+E+       +E   K K   E++  K  KK   K  E+KE+
Sbjct: 363 YIEEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEK 405


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 28.7 bits (64), Expect = 4.2
 Identities = 14/36 (38%), Positives = 25/36 (69%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +L++ +E+++ K+EE+ EE K  +  +E EEEEK  
Sbjct: 275 WLKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGF 310



 Score = 28.7 bits (64), Expect = 4.9
 Identities = 10/29 (34%), Positives = 21/29 (72%)

Query: 251 KEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           KE++EE++++K ++ K +E +E+ E   K
Sbjct: 280 KEKKEEKDEEKSEEVKTEEVDEEFEEEEK 308


>gnl|CDD|163125 TIGR03071, couple_hipA, HipA N-terminal domain.  Although Pfam
           models pfam07805 and pfam07804 currently are called
           HipA-like N-terminal domain and HipA-like C-terminal
           domain, respectively, those models hit the central and
           C-terminal regions of E. coli HipA but not the
           N-terminal region. This model hits the N-terminal region
           of HipA and its homologs, and also identifies proteins
           that lack match regions for pfam07804 and pfam07805.
          Length = 101

 Score = 26.9 bits (60), Expect = 4.4
 Identities = 13/38 (34%), Positives = 19/38 (50%), Gaps = 1/38 (2%)

Query: 208 HLVDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQEE 245
            L +  G +  F   D+YL    A  ++LSL  L+ EE
Sbjct: 13  ILTEDEGGKLSFRYDDDYLGSPSAPPLSLSLP-LQAEE 49


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 28.1 bits (62), Expect = 4.5
 Identities = 13/38 (34%), Positives = 24/38 (63%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           ++ EE E+  E E  EEK   ++ +E++E E+EE  ++
Sbjct: 133 QEREELEERMEWERREEKIDEREDQEEQEREREEQTIE 170


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 27.9 bits (62), Expect = 4.6
 Identities = 12/27 (44%), Positives = 24/27 (88%)

Query: 247 EEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           E+K K+++ E++K++ KKKKEK++++K
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKK 167


>gnl|CDD|236289 PRK08563, PRK08563, DNA-directed RNA polymerase subunit E';
           Provisional.
          Length = 187

 Score = 27.9 bits (63), Expect = 4.6
 Identities = 17/46 (36%), Positives = 21/46 (45%), Gaps = 12/46 (26%)

Query: 231 ARKINLSLHYLEQEEEEEK----------GKEEEEEEEKKKGKKKK 266
           AR + +SL   E+     K          GK E  EEEKKK KK+ 
Sbjct: 144 ARIVAVSL--KERRPRGSKIGLTMRQPGLGKLEWIEEEKKKAKKEA 187


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 28.0 bits (63), Expect = 4.7
 Identities = 14/42 (33%), Positives = 23/42 (54%)

Query: 230 EARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEE 271
              K+   L  L++E EE K +  E + + +K KK +E+ EE
Sbjct: 63  ALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEE 104


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 28.5 bits (64), Expect = 4.7
 Identities = 11/42 (26%), Positives = 20/42 (47%)

Query: 234 INLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           +N     LE+E EE +    + E      KK++++ E +  E
Sbjct: 859 LNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRE 900


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 28.5 bits (64), Expect = 4.9
 Identities = 19/49 (38%), Positives = 30/49 (61%)

Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           L  +  ++   L  LE+E EE K + EE E EK++ + + ++ EEEKEE
Sbjct: 840 LEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEE 888



 Score = 28.1 bits (63), Expect = 5.9
 Identities = 12/52 (23%), Positives = 23/52 (44%)

Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
               R   L     E EE  E+ KE+ E  +++  +++   EE E+    ++
Sbjct: 306 LLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELE 357



 Score = 27.8 bits (62), Expect = 8.4
 Identities = 12/39 (30%), Positives = 19/39 (48%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           LEQ   E +  +EE EE+     ++ E+  E   E L +
Sbjct: 349 LEQLLAELEEAKEELEEKLSALLEELEELFEALREELAE 387


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 27.5 bits (61), Expect = 4.9
 Identities = 13/37 (35%), Positives = 24/37 (64%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           +E+ E++  + ++E EK +  KKK+KE E + + L K
Sbjct: 86  REQREKELAKRQKELEKIELSKKKQKERERRRKKLTK 122


>gnl|CDD|216212 pfam00955, HCO3_cotransp, HCO3- transporter family.  This family
           contains Band 3 anion exchange proteins that exchange
           CL-/HCO3-. This family also includes cotransporters of
           Na+/HCO3-.
          Length = 501

 Score = 28.4 bits (64), Expect = 4.9
 Identities = 9/52 (17%), Positives = 14/52 (26%)

Query: 99  MSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQSDQRVQGPPDPQTGPASPGN 150
             Y+  YS  C   +  N   T  +W        +   G     +    P  
Sbjct: 181 SHYITNYSCVCVPPITPNNHSTEIDWTNLTKSECENYGGLLIGTSCGPHPPP 232


>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D.  SpoVID, the
           stage VI sporulation protein D, is restricted to
           endospore-forming members of the bacteria, all of which
           are found among the Firmicutes. It is widely distributed
           but not quite universal in this group. Between
           well-conserved N-terminal and C-terminal domains is a
           poorly conserved, low-complexity region of variable
           length, rich enough in glutamic acid to cause spurious
           BLAST search results unless a filter is used. The seed
           alignment for this model was trimmed, in effect, by
           choosing member sequences in which these regions are
           relatively short. SpoVID is involved in spore coat
           assembly by the mother cell compartment late in the
           process of sporulation [Cellular processes, Sporulation
           and germination].
          Length = 338

 Score = 28.3 bits (63), Expect = 4.9
 Identities = 11/37 (29%), Positives = 20/37 (54%)

Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
             LE E+EE+   E+ E+ +++       K+  E+EE
Sbjct: 230 TELEVEDEEKALDEQTEDPQQEDALAGDAKKALEEEE 266


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 28.4 bits (63), Expect = 4.9
 Identities = 12/37 (32%), Positives = 16/37 (43%)

Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           +  EE K K E    +KK   + K   E+ K E   K
Sbjct: 178 KAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAK 214



 Score = 28.0 bits (62), Expect = 6.0
 Identities = 12/35 (34%), Positives = 23/35 (65%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            EQ  EE K K+  E+E  K+ +K++ K +E++++
Sbjct: 85  EEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQ 119


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 26.9 bits (60), Expect = 4.9
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 250 GKEEEEEEEKKKGKKKKEKEEEE 272
                  E KK+ KK++E+EE +
Sbjct: 73  AAAAAAAEAKKEEKKEEEEEESD 95


>gnl|CDD|234655 PRK00133, metG, methionyl-tRNA synthetase; Reviewed.
          Length = 673

 Score = 28.6 bits (65), Expect = 4.9
 Identities = 7/32 (21%), Positives = 8/32 (25%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           E   E  KE    +           EE   E 
Sbjct: 536 EALIEASKEAAAAKAAAAAAAAPLAEEPIAET 567


>gnl|CDD|241384 cd13230, PH1_SSRP1-like, Structure Specific Recognition protein 1
           (SSRP1) Pleckstrin homology (PH) domain, repeat 1.
           SSRP1 is a component of FACT (facilitator of chromatin
           transcription), an essential chromatin reorganizing
           factor. In yeast FACT (yFACT) is composed of three
           proteins: Spt16/Cdc68, Pob3, and Nhp6. In metazoans the
           Pob3 and Nhp6 orthologs are fused to form SSRP1/T160 in
           human and mouse, respectively. The middle domain of the
           Pob3 subunit (Pob3-M) has an unusual double pleckstrin
           homology (PH) architecture. yFACT interacts in a
           physiologically important way with the central
           single-strand DNA binding factor RPA to promote a step
           in DNA Replication. Coordinated function by yFACT and
           RPA is important during nucleosome deposition. These
           results support the model that the FACT family has an
           essential role in constructing nucleosomes during DNA
           replication, and suggest that RPA contributes to this
           process. Members of this cd are composed of the first
           PH-like repeat. PH domains have diverse functions, but
           in general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 137

 Score = 27.5 bits (62), Expect = 5.0
 Identities = 14/38 (36%), Positives = 20/38 (52%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLV 278
            +++EE E      EEE ++K K K EKE E     +V
Sbjct: 84  FDKDEEIELELNLTEEELEEKYKGKLEKEYEGPLYEVV 121


>gnl|CDD|199812 cd10488, MH1_R-SMAD, N-terminal Mad Homology 1 (MH1) domain of
           receptor regulated SMADs.  The MH1 is a small
           DNA-binding domain present in SMAD (small mothers
           against decapentaplegic) family of proteins, which are
           signal transducers and transcriptional modulators that
           mediate multiple signaling pathways. It binds to the
           major groove in an unusual manner via a beta hairpin
           structure.  It negatively regulates the functions of the
           MH2 domain, the C-terminal domain of SMAD. This MH1
           domain is found in all receptor regulated SMADs
           (R-SMADs) including SMAD1, SMAD2, SMAD3, SMAD5 and
           SMAD9. SMAD1 plays an essential role in bone development
           and postnatal bone formation through activation by bone
           morphogenetic protein (BMP) type 1 receptor kinase.
           SMAD2 regulates multiple cellular processes, such as
           cell proliferation, apoptosis and differentiation, while
           SMAD3 modulates signals of activin and TGF-beta. SMAD4,
           a common mediator SMAD (co-SMAD) binds R-SMADs, forming
           an oligomeric complex that binds to DNA and serves as a
           transcription factor. SMAD5 is involved in bone
           morphogenetic proteins (BMP) signal modulation, possibly
           playing a role in the pathway involving inhibition of
           hematopoietic progenitor cells by TGF-beta. SMAD9 (also
           known as SMAD8) can mediate the differentiation of
           mesenchymal stem cells (MSCs) into tendon-like cells by
           inhibiting the osteogenic pathway.
          Length = 123

 Score = 27.2 bits (60), Expect = 5.0
 Identities = 16/32 (50%), Positives = 19/32 (59%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           EQ  EEEK  E+  +   KK KKK + EE EK
Sbjct: 13  EQNGEEEKWAEKAVKSLVKKLKKKGQLEELEK 44


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 27.3 bits (61), Expect = 5.2
 Identities = 12/34 (35%), Positives = 25/34 (73%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           EQ+  ++  KE+ +++++KK KKKK+K++  K+ 
Sbjct: 97  EQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130



 Score = 26.6 bits (59), Expect = 9.5
 Identities = 9/28 (32%), Positives = 22/28 (78%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEK 268
           L+++ +++K K+++++++KKK  KK  K
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAK 132


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 27.9 bits (62), Expect = 5.4
 Identities = 9/34 (26%), Positives = 15/34 (44%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            +E+E++   EEEE        +  +   EE E 
Sbjct: 230 PKEDEDDDQGEEEESGSSDSLSEDSDASSEEMES 263


>gnl|CDD|222117 pfam13422, DUF4110, Domain of unknown function (DUF4110).  This is
           a family that is found predominantly at the C-terminus
           of Kelch-containing proteins. However, the exact
           function of this region is not known.
          Length = 99

 Score = 26.9 bits (60), Expect = 5.4
 Identities = 12/28 (42%), Positives = 14/28 (50%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEK 268
           LE EEEE+ G EE     K+K     E 
Sbjct: 69  LEDEEEEQAGIEEVVSLAKEKDWGGGEG 96


>gnl|CDD|235871 PRK06835, PRK06835, DNA replication protein DnaC; Validated.
          Length = 329

 Score = 27.9 bits (63), Expect = 5.5
 Identities = 11/20 (55%), Positives = 13/20 (65%)

Query: 56  FNGINATLLAYGQTGGGKTY 75
           F+  N  LL YG TG GKT+
Sbjct: 179 FDKNNENLLFYGNTGTGKTF 198


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 28.1 bits (63), Expect = 5.6
 Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 230 EARKINLSLHYLEQE-EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           + R   L+   LE+E EE EK KEE EEE  K   +  E ++E KE  
Sbjct: 378 KKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELK 425



 Score = 27.7 bits (62), Expect = 8.8
 Identities = 23/53 (43%), Positives = 32/53 (60%), Gaps = 2/53 (3%)

Query: 227 LRNEARKINLSLHYLEQEEE--EEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
             +E R+I   L  LE+E    EE+ KE EE+EE+ +  KKK KE E++ E L
Sbjct: 305 YLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEEL 357


>gnl|CDD|216796 pfam01935, DUF87, Domain of unknown function DUF87.  The function
          of this prokaryotic domain is unknown. It contains
          several conserved aspartates and histidines that could
          be metal ligands.
          Length = 218

 Score = 27.7 bits (62), Expect = 5.6
 Identities = 7/16 (43%), Positives = 11/16 (68%)

Query: 66 YGQTGGGKTYTVSAMI 81
           G TG GK+ TV+ ++
Sbjct: 29 LGSTGSGKSNTVAVLL 44


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this family
           (Kri1p) is found to be required for 40S ribosome
           biogenesis in the nucleolus.
          Length = 99

 Score = 26.8 bits (60), Expect = 5.7
 Identities = 14/32 (43%), Positives = 20/32 (62%), Gaps = 3/32 (9%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           ++ +EEEK    + EEE K+ K  K +E EEK
Sbjct: 1   KERKEEEK---AQREEELKRLKNLKREEIEEK 29


>gnl|CDD|205495 pfam13315, DUF4085, Protein of unknown function (DUF4085).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 101 and 269 amino
           acids in length.
          Length = 208

 Score = 27.8 bits (62), Expect = 5.9
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 238 LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           L + E +EE E+   EEEE      +  KE+ EE KE+ L
Sbjct: 17  LSFPESQEEWEEELREEEESGNDYLQSLKEEIEEIKEDLL 56


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 28.0 bits (63), Expect = 5.9
 Identities = 11/49 (22%), Positives = 27/49 (55%)

Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           L  +   ++     LE++E+E   KE+  +E++++ ++   ++ EE E 
Sbjct: 92  LDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELER 140


>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal.  This is the N terminal of
           the DNA repair protein Rad54.
          Length = 191

 Score = 27.7 bits (62), Expect = 5.9
 Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 252 EEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           ++ + EE+K  K ++ +E E K  N  K
Sbjct: 143 DKPKIEEEKAEKDQEPEESETKLSNGPK 170


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 27.2 bits (61), Expect = 6.0
 Identities = 15/38 (39%), Positives = 21/38 (55%)

Query: 232 RKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
           R+I L L     +E  EK K   +  EKK  +++KEKE
Sbjct: 92  RRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129



 Score = 26.5 bits (59), Expect = 9.2
 Identities = 17/52 (32%), Positives = 28/52 (53%), Gaps = 10/52 (19%)

Query: 232 RKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKK---------KKEKEEEEKE 274
           R I+L  H  + E+E  + KE++++  K+ GKK         ++ KE  EKE
Sbjct: 60  RVIDLKKHNAKVEKELLREKEKKKKR-KRPGKKRRIALRLRRERTKERAEKE 110


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 28.1 bits (63), Expect = 6.0
 Identities = 10/32 (31%), Positives = 24/32 (75%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           E+EEE++  + E+E+EE +  +++++ +E +K
Sbjct: 288 EEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319


>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078).  This
           family is found from fungi to humans, but its exact
           function is not known.
          Length = 88

 Score = 26.5 bits (59), Expect = 6.2
 Identities = 17/38 (44%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           EQ EE EK +EE E E K++ +++KEK +   EE   K
Sbjct: 47  EQMEELEKAREETERERKER-EERKEKRKRAIEERRKK 83


>gnl|CDD|236325 PRK08654, PRK08654, pyruvate carboxylase subunit A; Validated.
          Length = 499

 Score = 28.0 bits (63), Expect = 6.3
 Identities = 12/51 (23%), Positives = 25/51 (49%), Gaps = 1/51 (1%)

Query: 226 LLRNEA-RKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           ++ NE   + NL  H++E+E    +  +    EE+++ K   EK     ++
Sbjct: 424 VMENENFVRGNLHTHFIEEETTILEEMKRYALEEEEREKTLSEKFFPGNKK 474


>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 292

 Score = 28.0 bits (62), Expect = 6.3
 Identities = 15/57 (26%), Positives = 25/57 (43%)

Query: 129 PQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTK 185
             ++ +VQ     QT    P    +AAS  +K   S +  +QL  +G+ + V    K
Sbjct: 174 TATEAKVQTASPAQTPATPPAGKGAAASGQLKSAPSSHYTLQLSASGSYDNVNGFAK 230


>gnl|CDD|225364 COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase
           PilT [Cell motility and secretion / Intracellular
           trafficking and secretion].
          Length = 353

 Score = 28.0 bits (63), Expect = 6.4
 Identities = 10/18 (55%), Positives = 13/18 (72%)

Query: 64  LAYGQTGGGKTYTVSAMI 81
           L  G TG GK+ T++AMI
Sbjct: 129 LVTGPTGSGKSTTLAAMI 146


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 27.8 bits (63), Expect = 6.4
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEE 271
             E  EE+ +E EEEEE+++ ++ +E E E
Sbjct: 229 LAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258


>gnl|CDD|148072 pfam06244, DUF1014, Protein of unknown function (DUF1014).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 122

 Score = 27.0 bits (60), Expect = 6.4
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           +E+  E+E+ K E  EE+ KK   ++   EE    N  +
Sbjct: 5   IEERLEKEQRKLEAPEEKSKKNAAREVPLEENLNVNRNR 43


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 27.9 bits (63), Expect = 6.5
 Identities = 12/37 (32%), Positives = 25/37 (67%), Gaps = 1/37 (2%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEE-EEKEENL 277
           E E++ E+ +   +E EK K + +++KE+ +E+E+ L
Sbjct: 531 ELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKL 567


>gnl|CDD|219905 pfam08564, CDC37_C, Cdc37 C terminal domain.  Cdc37 is a protein
           required for the activity of numerous eukaryotic protein
           kinases. This domains corresponds to the C terminal
           domain whose function is unclear. It is found C terminal
           to the Hsp90 chaperone (Heat shocked protein 90) binding
           domain pfam08565 and the N terminal kinase binding
           domain of Cdc37 pfam03234.
          Length = 89

 Score = 26.2 bits (58), Expect = 6.6
 Identities = 10/32 (31%), Positives = 22/32 (68%)

Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
             + +G++E +E E++  +++ EKEE E+E+ 
Sbjct: 58  NAKIEGEKEFKELEEEYNEEEAEKEEIEEEDE 89



 Score = 26.2 bits (58), Expect = 8.5
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
              + E EK  +E EEE  ++  +K+E EEE++
Sbjct: 57  PNAKIEGEKEFKELEEEYNEEEAEKEEIEEEDE 89


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 27.9 bits (62), Expect = 6.6
 Identities = 7/33 (21%), Positives = 21/33 (63%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           E+E+E++  + +++E+E +  +  + + E + E
Sbjct: 648 EEEDEDDLEETDDDEDECEAIEDSESESESDGE 680


>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit.  This
           family includes the vacuolar ATP synthase E subunit, as
           well as the archaebacterial ATP synthase E subunit.
          Length = 195

 Score = 27.3 bits (61), Expect = 6.7
 Identities = 15/33 (45%), Positives = 21/33 (63%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           E EEE E  K E  EE +KK ++  EK+E++ E
Sbjct: 16  EAEEEFEIEKAEAVEEAEKKIEEIYEKKEKQAE 48


>gnl|CDD|234340 TIGR03744, traC_PFL_4706, conjugative transfer ATPase, PFL_4706
           family.  Members of this protein family are predicted
           ATP-binding proteins apparently associated with DNA
           conjugal transfer. Members are found both in plasmids
           and in bacterial chromosomal regions that appear to
           derive from integrative elements such as conjugative
           transposons. More distant homologs, outside the scope of
           this family, include type IV secretion/conjugal transfer
           proteins such as TraC, VirB4 and TrsE. The granularity
           of this protein family definition is chosen so as to
           represent one distinctive clade and act as a marker
           through which to define and recognize the class of
           mobile element it serves [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 893

 Score = 28.1 bits (63), Expect = 6.8
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 60  NATLLAYGQTGGGKTYTVSAMIMKTL 85
           NA LL  G TG GK+ T++ ++M+ +
Sbjct: 475 NAHLLILGPTGAGKSATLTNLLMQVM 500


>gnl|CDD|131972 TIGR02926, AhaH, ATP synthase archaeal, H subunit.  he A1/A0 ATP
           synthase is homologous to the V-type (V1/V0, vacuolar)
           ATPase, but functions in the ATP synthetic direction as
           does the F1/F0 ATPase of bacteria. The hydrophilic A1
           "stalk" complex (AhaABCDEFG) is the site of ATP
           generation and is coupled to the membrane-embedded
           proton translocating A0 complex. It is unclear precisely
           where AhaH fits into these complexes.
          Length = 85

 Score = 26.3 bits (58), Expect = 6.8
 Identities = 20/49 (40%), Positives = 28/49 (57%), Gaps = 7/49 (14%)

Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEK--EEEEKE 274
           R EAR++       E EEE  K  EE  +E +++ +K+ EK  EE EKE
Sbjct: 30  REEAREL-----LEEAEEEASKLGEEIIKEAEEEIEKEAEKIREEGEKE 73


>gnl|CDD|216848 pfam02018, CBM_4_9, Carbohydrate binding domain.  This family
           includes diverse carbohydrate binding domains.
          Length = 134

 Score = 27.0 bits (60), Expect = 6.9
 Identities = 17/79 (21%), Positives = 25/79 (31%), Gaps = 16/79 (20%)

Query: 72  GKTYTVSAMIMKTLQHVMQRCNKDDVYMSYLQLYSEKCYDLLNGNKEVTLKNW-----IF 126
           G TYTVS  +               + ++     +   YD +   K V    W      F
Sbjct: 62  GTTYTVSFWVK----ASSGPQ---TISVTLQITDASGNYDTVADEKVVLTGEWTKLEGTF 114

Query: 127 NIPQSDQR----VQGPPDP 141
            IP +       V+  PD 
Sbjct: 115 TIPGTASTVELYVELGPDS 133


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 27.8 bits (62), Expect = 6.9
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 247 EEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           E+K  E +++ EKKK KKK++KE + + E 
Sbjct: 60  EDKNNESKKKSEKKKKKKKEKKEPKSEGET 89


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 27.8 bits (62), Expect = 7.0
 Identities = 10/39 (25%), Positives = 18/39 (46%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
                E     +E E ++  K KKKK+K++++K      
Sbjct: 37  DAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAY 75


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 27.5 bits (61), Expect = 7.0
 Identities = 12/22 (54%), Positives = 19/22 (86%)

Query: 252 EEEEEEEKKKGKKKKEKEEEEK 273
           +++E+EEKKK K+ K +E+EEK
Sbjct: 185 QQKEKEEKKKVKEAKRREKEEK 206


>gnl|CDD|225435 COG2880, COG2880, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 67

 Score = 25.9 bits (57), Expect = 7.1
 Identities = 12/47 (25%), Positives = 23/47 (48%), Gaps = 4/47 (8%)

Query: 232 RKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLV 278
           +K++L     E E+ +   + +E+  E  KG  K+ KE  E+    +
Sbjct: 20  KKVDLK----EGEKVKIVIRVKEKIYEILKGSLKEIKEILEEILEEI 62


>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
           (DUF2098).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 91

 Score = 26.1 bits (58), Expect = 7.3
 Identities = 14/39 (35%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 239 HYLE-QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
            YLE  ++ +EK +E EE++E+   + KKE+E  E  + 
Sbjct: 44  DYLEVTDKVKEKKEEREEDKEELIERIKKEEETFEDVDL 82


>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
           protein.  This domain family is found in eukaryotes, and
           is typically between 212 and 238 amino acids in length.
           The family is found in association with pfam01805. There
           are two completely conserved residues (W and H) that may
           be functionally important. PRP21 is required for
           assembly of the prespliceosome and it interacts with U2
           snRNP and/or pre-mRNA in the prespliceosome. This family
           also contains proteins similar to PRP21, such as the
           mammalian SF3a. SF3a also interacts with U2 snRNP from
           the prespliceosome, converting it to its active form.
          Length = 230

 Score = 27.4 bits (61), Expect = 7.3
 Identities = 15/41 (36%), Positives = 24/41 (58%), Gaps = 3/41 (7%)

Query: 220 SLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKK 260
           S+ D Y +   A K      + + +++EE+ KEEEEEEE+ 
Sbjct: 12  SVKDKYYILERAYKR---AEWEKHQKKEEQKKEEEEEEERI 49


>gnl|CDD|233063 TIGR00631, uvrb, excinuclease ABC, B subunit.  All proteins in this
           family for wich functions are known are DNA helicases
           that function in the nucleotide excision repair and are
           endonucleases that make the 3' incision next to DNA
           damage. They are part of a pathway requiring UvrA, UvrB,
           UvrC, and UvrD homologs. This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 655

 Score = 27.6 bits (62), Expect = 7.7
 Identities = 16/51 (31%), Positives = 28/51 (54%), Gaps = 15/51 (29%)

Query: 62  TLLAYGQTGGGKTYTVSAMIMKTLQHVMQRCNKDDVYMSY-----LQLYSE 107
           TLL  G TG GKT+T++        +V+ + N+  + +++      QLY+E
Sbjct: 33  TLL--GVTGSGKTFTMA--------NVIAQVNRPTLVIAHNKTLAAQLYNE 73


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 27.0 bits (60), Expect = 7.7
 Identities = 11/35 (31%), Positives = 22/35 (62%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
            ++E E+EK KEE   +   + +K++ + E++K E
Sbjct: 80  KKEEAEKEKEKEERFMKALAEAEKERAELEKKKAE 114


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 27.6 bits (62), Expect = 8.1
 Identities = 7/29 (24%), Positives = 22/29 (75%)

Query: 246 EEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
           + E+ K ++ +++KK+ KK K+++++ ++
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKKGRK 414


>gnl|CDD|203903 pfam08312, cwf21, cwf21 domain.  The cwf21 family is involved in
           mRNA splicing. It has been isolated as a subcomplex of
           the splicosome in Schizosaccharomyces pombe. The
           function of the cwf21 domain is to bind directly to the
           spliceosomal protein Prp8. Mutations in the cwf21 domain
           prevent Prp8 from binding. The structure of this domain
           has recently been solved which shows this domain to be
           composed of two alpha helices.
          Length = 46

 Score = 24.8 bits (55), Expect = 8.3
 Identities = 16/41 (39%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 232 RKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
           R+I +      +++ EE+G  EEE EEK    +KK  EE E
Sbjct: 7   REIEVKC-LELRDKLEEQGLSEEEIEEKVDELRKKLLEELE 46


>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain.  FH proteins control
           rearrangements of the actin cytoskeleton, especially in
           the context of cytokinesis and cell polarisation.
           Members of this family have been found to interact with
           Rho-GTPases, profilin and other actin-assoziated
           proteins. These interactions are mediated by the
           proline-rich FH1 domain, usually located in front of FH2
           (but not listed in SMART). Despite this cytosolic
           function, vertebrate formins have been assigned
           functions within the nucleus. A set of Formin-Binding
           Proteins (FBPs) has been shown to bind FH1 with their WW
           domain.
          Length = 392

 Score = 27.7 bits (62), Expect = 8.3
 Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
           + +  EE  K +EEEEE  KK  K+  E E
Sbjct: 330 FSKAAEENIKKEEEEEERRKKLVKETTEYE 359


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 27.6 bits (61), Expect = 8.7
 Identities = 14/34 (41%), Positives = 19/34 (55%), Gaps = 6/34 (17%)

Query: 242 EQEEEEEKGKEEEEEEE------KKKGKKKKEKE 269
           E EE+EE+ +EEEEEEE      +K+      K 
Sbjct: 227 ESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKA 260


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
           timeless (tim) gene is essential for circadian function
           in Drosophila. Putative homologues of Drosophila tim
           have been identified in both mice and humans (mTim and
           hTIM, respectively). Mammalian TIM is not the true
           orthologue of Drosophila TIM, but is the likely
           orthologue of a fly gene, timeout (also called tim-2).
           mTim has been shown to be essential for embryonic
           development, but does not have substantiated circadian
           function. Some family members contain a SANT domain in
           this region.
          Length = 507

 Score = 27.8 bits (61), Expect = 8.7
 Identities = 12/38 (31%), Positives = 19/38 (50%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           + EEEE+  + + EE EK+    + E       ENL +
Sbjct: 251 DPEEEEQLPESKPEETEKRVSAFQVEGSTLISAENLRQ 288


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 27.2 bits (61), Expect = 8.9
 Identities = 15/36 (41%), Positives = 24/36 (66%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
             E EEE+ +E+EEE+++K G   K K  E KE+++
Sbjct: 28  APEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDI 63


>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
          Length = 470

 Score = 27.5 bits (62), Expect = 9.1
 Identities = 20/82 (24%), Positives = 32/82 (39%), Gaps = 8/82 (9%)

Query: 204 MAKLHLVDLAGSEQLFSLSDN------YLLRNEARKINLSLHYLEQEEEEEKGKEEEE-- 255
            ++    +      LF  + +       LL  E       L Y+  EE  E    +EE  
Sbjct: 343 ASEKRQAEFDAILDLFMEALDIDEEIAQLLVEEGFSSLEELAYVPVEELLEIEGFDEETV 402

Query: 256 EEEKKKGKKKKEKEEEEKEENL 277
           EE +++ K+  E E   +EE L
Sbjct: 403 EELRERAKEALETEALAQEEKL 424


>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 122

 Score = 26.3 bits (58), Expect = 9.1
 Identities = 12/37 (32%), Positives = 25/37 (67%)

Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
           LE+ + ++  KE++EEEE K+ + + E+E  ++ + L
Sbjct: 57  LEKRKRDQVLKEQKEEEEAKRLQSELERELMKRAQKL 93


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 26.2 bits (58), Expect = 9.2
 Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 3/54 (5%)

Query: 225 YLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKE-KEEEEKEENL 277
           +  R E R    ++   E+E ++E  KE E +   +  K+++  KEE+E+ E +
Sbjct: 30  WEKRMEKRLEQQAIKAREKELKDE--KEAERQRRIQAIKERRAAKEEKERYEKM 81


>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
           secretion].
          Length = 407

 Score = 27.3 bits (61), Expect = 9.3
 Identities = 8/32 (25%), Positives = 14/32 (43%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
           + E       E + +E K  G++  E   +EK
Sbjct: 96  KIERSTPSLIERKTQEVKDSGEEIAEMMRDEK 127


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 27.4 bits (61), Expect = 9.3
 Identities = 10/38 (26%), Positives = 19/38 (50%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
           +    E   K +EE E K+K ++ +EK  + +E+    
Sbjct: 35  DAAAYENWKKRQEEAEAKRKREELREKIAKAREKRERN 72


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 25.8 bits (57), Expect = 9.5
 Identities = 9/17 (52%), Positives = 13/17 (76%)

Query: 259 KKKGKKKKEKEEEEKEE 275
           KKK KKKK+K + ++E 
Sbjct: 20  KKKKKKKKKKNKSKEEV 36


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 27.1 bits (60), Expect = 9.7
 Identities = 11/35 (31%), Positives = 24/35 (68%)

Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           ++ ++EE   ++ +  E+K+ K+ +E+EE E+EE 
Sbjct: 245 KKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEE 279


>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
          Length = 249

 Score = 26.9 bits (60), Expect = 9.9
 Identities = 14/50 (28%), Positives = 21/50 (42%)

Query: 226 LLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
           L R+E   + + L +    EE E+ +E   E    +       E EE EE
Sbjct: 189 LPRDEEWDVMVDLFFWRDPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEE 238


>gnl|CDD|216186 pfam00910, RNA_helicase, RNA helicase.  This family includes RNA
          helicases thought to be involved in duplex unwinding
          during viral RNA replication. Members of this family
          are found in a variety of single stranded RNA viruses.
          Length = 105

 Score = 26.0 bits (58), Expect = 9.9
 Identities = 11/36 (30%), Positives = 16/36 (44%), Gaps = 1/36 (2%)

Query: 63 LLAYGQTGGGKTYTVSAMIMKTLQHVMQRCNKDDVY 98
          +  YG  G GK+     +    L+H +    KD VY
Sbjct: 1  IWLYGPPGCGKSTLAKYLARALLKH-LGLPKKDSVY 35


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 26.5 bits (59), Expect = 10.0
 Identities = 12/33 (36%), Positives = 22/33 (66%)

Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
           +E+EE  +  E  +E+   ++KK +E++EKEE 
Sbjct: 58  DEDEEADRIYESIDERMDERRKKRREQKEKEEI 90


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.130    0.362 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,822,769
Number of extensions: 1317678
Number of successful extensions: 10795
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8134
Number of HSP's successfully gapped: 1167
Length of query: 279
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 183
Effective length of database: 6,679,618
Effective search space: 1222370094
Effective search space used: 1222370094
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 58 (26.0 bits)