RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1088
         (905 letters)



>gnl|CDD|225661 COG3119, AslA, Arylsulfatase A and related enzymes [Inorganic ion
           transport and metabolism].
          Length = 475

 Score =  207 bits (528), Expect = 3e-58
 Identities = 107/425 (25%), Positives = 168/425 (39%), Gaps = 48/425 (11%)

Query: 56  SSGPPHIIFILADDLGWNDVG-FHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
            +  P+I+ I+ADDLG+ D+G + G    PTPNID LA  G+   N YT    C PSR+A
Sbjct: 1   PAKRPNILIIMADDLGYGDLGAYGGPVVGPTPNIDRLAAEGVRFTNAYTTSPCCGPSRAA 60

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++P  TG+  N       GGLP     L + LKE GY T + GKWHLG   ++   
Sbjct: 61  LLTGRYPFRTGVGGNAEPPGYPGGLPDEVPTLAELLKEAGYYTALFGKWHLGEKDEDPAG 120

Query: 174 TFRGFESHLGYWTGHQD---------YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
              GF+   G+  G  D                E      D     + + DL  ++    
Sbjct: 121 GDHGFDEFYGFLGGLTDEWYPELVDVPPPGDVPEFDQEEGDPYVAGKDSADLADRFRRQA 180

Query: 225 FTAE------AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP---------DHYLNI 269
                      +         +P+            +  + L  P          H    
Sbjct: 181 KEDAPDKPPFLLTAPFAPPAPDPVDFEWPDEYRGFYDRAKRLDRPFFLYLAPPDPHLSRR 240

Query: 270 HRHIEDFK----------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
               +              + +AA +  LD+ +G++++AL++  +L N+I+VF SD+G  
Sbjct: 241 LPAADGLPAEEEEDGARLMTVYAACVRYLDDQIGRLLDALKELGLLDNTIVVFTSDHGAW 300

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP-LLESRGIVAEQYVHVSDWLPTLLS 378
                   A   P RG K TL+EGG R   +I  P  ++  G V +  V + D LPTLL 
Sbjct: 301 L------GAHGTPFRGYKGTLYEGGTRVPLIIRWPGGIKPGGRVVDALVSLIDLLPTLLD 354

Query: 379 AANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYE 438
           AA      +    ++   + +     +       E+ +         Y     E      
Sbjct: 355 AAGVPPPKDLDGQSLPPKLQKPGLDGVPL----LEWLAGGKAAVREEYGYF-GELFGLRA 409

Query: 439 NRYEN 443
            R ++
Sbjct: 410 IRRDD 414



 Score = 30.5 bits (69), Expect = 4.6
 Identities = 12/31 (38%), Positives = 14/31 (45%), Gaps = 1/31 (3%)

Query: 2   QHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
            +    G   GGLP     L + LKE GY T
Sbjct: 74  GNAEPPGYP-GGLPDEVPTLAELLKEAGYYT 103


>gnl|CDD|216172 pfam00884, Sulfatase, Sulfatase. 
          Length = 332

 Score =  187 bits (476), Expect = 8e-53
 Identities = 93/330 (28%), Positives = 145/330 (43%), Gaps = 41/330 (12%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTGK 118
           P+++ +L + L   D+G +G  +  TP +D LA  G++  N+Y    L  PSR A++TG 
Sbjct: 1   PNVVLVLGESLRAPDLGLYGYPRPTTPFLDRLAEEGLLFSNFYSGGTLTAPSRFALLTGL 60

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
            P + G   +        GLP +E  LP  LK  GY T  +GKWHL +Y ++      GF
Sbjct: 61  PPHNFGSGVSTP-----IGLPRTEPSLPDLLKRAGYNTGAIGKWHLSWYNRQSVYKNLGF 115

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
           +   G  TG +D +                +        G  S +    EA++ + N+  
Sbjct: 116 DKFFGRNTG-EDLYKD-------------PEDVGYNCSGGGVSDEALLDEALEFLDNN-- 159

Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-------RSKFAAILHKLDES 291
           D+P FL L    +H      P   PD Y   +   +           + +   L   D++
Sbjct: 160 DKPFFLVLHTMGSHG-----PPYYPDRYPEKYATFKPTSTCSEEQLLNSYDNSLLYTDDA 214

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +G+V+E LE    L N+++V+ SD+G +  G        +   G  +   EGG R   LI
Sbjct: 215 IGRVLEKLENGL-LDNTLVVYTSDHGPSLGG------GGYLHGGKTDNAPEGGYRVPLLI 267

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           WSP  + +G   E  V   D  PT+L  A 
Sbjct: 268 WSPGGKPKGQKNEALVSHVDLFPTILDLAG 297


>gnl|CDD|237491 PRK13759, PRK13759, arylsulfatase; Provisional.
          Length = 485

 Score =  123 bits (311), Expect = 1e-29
 Identities = 100/399 (25%), Positives = 163/399 (40%), Gaps = 93/399 (23%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
             +  P+II I+ D +  + +G +G   + TPN+D LA  G   +N Y+ V  CTP+R+A
Sbjct: 2   VQTKKPNIILIMVDQMRGDCLGCNGNKAVETPNLDMLASEGYNFENAYSAVPSCTPARAA 61

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH-------LGF 166
           ++TG    H G       G         +  LPQ  ++ GY T+ +GK H       LGF
Sbjct: 62  LLTGLSQWHHGR-----VGYGDVVPWNYKNTLPQEFRDAGYYTQCIGKMHVFPQRNLLGF 116

Query: 167 YK-----------KEYTPTFRGFESHLGYWTGHQ------DYFDHSAEEMKMWGLDMRRD 209
           +            +    +   F S    W   +      D  D         G D    
Sbjct: 117 HNVLLHDGYLHSGRNEDKSQFDFVSDYLAWLREKAPGKDPDLTDI--------GWDCNSW 168

Query: 210 LEPAWDLHGKYSTDVFTA-EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
           +   WDL  +     +   E+++ +      +P FL ++ A  HS  PY+P   P  Y +
Sbjct: 169 VARPWDLEERLHPTNWVGSESIEFLRRRDPTKPFFLKMSFARPHS--PYDP---PKRYFD 223

Query: 269 IHRHI--------------------------------EDFKRSK---FAAILHKLDESVG 293
           +++                                  E  +R++   +  I H +D  +G
Sbjct: 224 MYKDADIPDPHIGDWEYAEDQDPEGGSIDALRGNLGEEYARRARAAYYGLITH-IDHQIG 282

Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
           + ++AL++  +L N+II+FVSD+G       L        R  K   +EG      +I+ 
Sbjct: 283 RFLQALKEFGLLDNTIILFVSDHGDMLGDHYL-------FR--KGYPYEGSAHIPFIIYD 333

Query: 354 P---LLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYV 389
           P   L  +RG V +Q V + D +PTLL  A  + IP+ V
Sbjct: 334 PGGLLAGNRGTVIDQVVELRDIMPTLLDLAGGT-IPDDV 371


>gnl|CDD|234202 TIGR03417, chol_sulfatase, choline-sulfatase. 
          Length = 500

 Score = 87.1 bits (216), Expect = 1e-17
 Identities = 92/369 (24%), Positives = 143/369 (38%), Gaps = 75/369 (20%)

Query: 60  PHIIFILADDLGWNDVGFHG-LDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
           P+I+ I+AD L    +  +G    +  PN+  LA   ++  N Y    LC PSR++ M+G
Sbjct: 3   PNILIIMADQLNGTLLPDYGPARWLHAPNLKRLAARSVVFDNAYCASPLCAPSRASFMSG 62

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL-------GF---- 166
           + P  TG   N                   YL+  GYRT + GK H        GF    
Sbjct: 63  QLPSRTGAYDNA------AEFASDIPTYAHYLRRAGYRTALSGKMHFCGPDQLHGFEERL 116

Query: 167 ----YKKEY--TPTFRGFESHLGYW--------------TGHQDYFDHSA--EEMKMWGL 204
               Y  ++  TP +R     + ++              T   DY D  A     K++ L
Sbjct: 117 TTDIYPADFGWTPDWRKPGERIDWYHNMGSVTGAGPCERTNQLDYDDEVAFHARQKLYDL 176

Query: 205 DMRRDLEPAWDL-------HGKYSTDVFTAEAVDIIHNHSTDEP-LFLYLAHAATHS--- 253
              +D  P + L       H  Y   V   +  D+  +     P + +  A    HS   
Sbjct: 177 ARGKDDRP-FCLTVSFTHPHDPY---VIRRKYWDLYEDCEILMPEVAIPYAEQDPHSQRL 232

Query: 254 --ANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIV 311
             A         D        I   +R+ F AI + +D+ +G++++ LE+ R   ++IIV
Sbjct: 233 LDACDLRNFPITD------EQIRRARRAYFGAISY-IDDKIGELLDTLEETRQADDTIIV 285

Query: 312 FVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSD 371
           F SD+G       L     W     K + +EG  R   +I +P   + G V      + D
Sbjct: 286 FTSDHGDM-----LGERGLW----YKMSFFEGSARVPLMIAAPGRFAPGRVDAPVSTL-D 335

Query: 372 WLPTLLSAA 380
            LPTL+  A
Sbjct: 336 LLPTLVDLA 344



 Score = 36.2 bits (84), Expect = 0.088
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 3/41 (7%)

Query: 658 SIQCGPVKEVPCEPQIAPC-LFDIKNDPCEKNNLADRSEDQ 697
            I+ G  K V CE    P  L+D++ DP E  NLAD     
Sbjct: 388 MIRRGRYKFVYCE--ADPDQLYDLEADPHELTNLADDPAHA 426



 Score = 34.7 bits (80), Expect = 0.23
 Identities = 16/39 (41%), Positives = 20/39 (51%), Gaps = 3/39 (7%)

Query: 764 SIQCGPVKEVPCEPQIAPC-LFDIKNDPCEKNNLADRSE 801
            I+ G  K V CE    P  L+D++ DP E  NLAD   
Sbjct: 388 MIRRGRYKFVYCE--ADPDQLYDLEADPHELTNLADDPA 424


>gnl|CDD|224287 COG1368, MdoB, Phosphoglycerol transferase and related proteins,
           alkaline phosphatase superfamily [Cell envelope
           biogenesis, outer membrane].
          Length = 650

 Score = 70.5 bits (173), Expect = 2e-12
 Identities = 55/301 (18%), Positives = 106/301 (35%), Gaps = 43/301 (14%)

Query: 145 LPQYLKELGYRTRIVGKWHLGFYK-KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG 203
           LP  LK+ GY+T  +      F+  K +   F GF+          + FD +A+    WG
Sbjct: 345 LPAILKQQGYKTAALHGGDGSFWNRKSFYKIF-GFDDFFD-----LESFDGNADSEIGWG 398

Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
           L                S      E++ ++      +P F ++   + H   P+E  +  
Sbjct: 399 L----------------SDKDLFKESLPLLKKL--KKPFFSFVITLSNHG--PFELPEGK 438

Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
            + L           + +   +H  DE++G+ ++ L++  +  NS+IV   D+ G +   
Sbjct: 439 RNELLEEPLSASTALANYLQAVHYADEALGQFIDKLKKSGLYKNSVIVLYGDHYGISGNQ 498

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           NL         G K+   +   R   LI +P ++++  +      + D  PT+L     S
Sbjct: 499 NLAMPK---FLG-KSYDIDMLQRVPLLIHAPGIKNKKKIDTVGGQL-DIAPTILGLLGIS 553

Query: 384 ---------DIPNYVNSTVENIIPRYENSILRYENGTHEY--NSPRIENSNTRYENGTHE 432
                    D+       V      +      Y  G +         E  + + +    +
Sbjct: 554 TKSYAFFGRDLLGDEPYKVPFRNGSFGTDAKGYSVGDNRMYDTRTGNEILDLQLDKVKPK 613

Query: 433 Y 433
            
Sbjct: 614 K 614


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 56.6 bits (136), Expect = 4e-08
 Identities = 43/154 (27%), Positives = 75/154 (48%), Gaps = 15/154 (9%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
           K EEE + +++ +K K++KKK++K+K+ +++ +S   E    +  A  +     +E+P  
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDI-VTEEMP-- 139

Query: 777 PQIAPCLFDIK--NDPCEKNN------LADRSEVQRINHYTTEVGYLDPKQRFNQIAYLD 828
               P   D K  NDP    +      LAD  ++    H   E      K          
Sbjct: 140 ENALPSDEDDKDPNDPYRALDIDLDKPLADSEKLPVQKHRNAET----SKSPEKGDVPAV 195

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVL 862
           ++K KK KKK+KK+K+K++ K KK   +G+  +L
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLL 229



 Score = 54.7 bits (131), Expect = 2e-07
 Identities = 35/138 (25%), Positives = 58/138 (42%), Gaps = 22/138 (15%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
           P+K +    +KK KK KKK+KK+K+K++ K KKK     + +    D +      V E  
Sbjct: 187 PEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEAD 246

Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
                        +   ++   A+  E ++                        K KKKK
Sbjct: 247 EASLANTVSGTAPDSEPDEPKDAEAEETKKSP----------------------KHKKKK 284

Query: 835 KKKKKKKKKKKKKKKKKK 852
           ++K+K++KKKKKK    +
Sbjct: 285 QRKEKEEKKKKKKHHHHR 302



 Score = 50.1 bits (119), Expect = 4e-06
 Identities = 38/142 (26%), Positives = 63/142 (44%), Gaps = 23/142 (16%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
               E+ +    +KK KK KKK+KK+K+K++ K    E EG + L               
Sbjct: 184 SKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSL--------------- 228

Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
                   L  + + P    ++A+  E    N  +      +P +  +  A   K+  K 
Sbjct: 229 --------LLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKH 280

Query: 835 KKKKKKKKKKKKKKKKKKMMKK 856
           KKKK++K+K++KKKKKK    +
Sbjct: 281 KKKKQRKEKEEKKKKKKHHHHR 302



 Score = 38.9 bits (90), Expect = 0.013
 Identities = 30/102 (29%), Positives = 44/102 (43%), Gaps = 3/102 (2%)

Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKK 736
           L  + + P    ++A+  E    N  +            D E EE KK  K KKKK++K+
Sbjct: 229 LLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKE 288

Query: 737 KKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQ 778
           K++KKKKKK   +               +Q G V+E P  P 
Sbjct: 289 KEEKKKKKKHHHHRCHH---SDGGAEQPVQNGAVEEEPLPPM 327



 Score = 33.1 bits (75), Expect = 0.72
 Identities = 10/35 (28%), Positives = 27/35 (77%)

Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           +Q   L++E++ +++ +K K++KKK++K+K+  ++
Sbjct: 79  DQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRR 113



 Score = 31.2 bits (70), Expect = 3.1
 Identities = 9/42 (21%), Positives = 32/42 (76%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELA 870
           +E+++ +++ +K K++KKK++K+K  ++ +  + ++ ++++A
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIA 126


>gnl|CDD|216635 pfam01663, Phosphodiest, Type I phosphodiesterase / nucleotide
           pyrophosphatase.  This family consists of
           phosphodiesterases, including human plasma-cell membrane
           glycoprotein PC-1 / alkaline phosphodiesterase i /
           nucleotide pyrophosphatase (nppase). These enzymes
           catalyze the cleavage of phosphodiester and
           phosphosulfate bonds in NAD, deoxynucleotides and
           nucleotide sugars. Also in this family is ATX an
           autotaxin, tumour cell motility-stimulating protein
           which exhibits type I phosphodiesterases activity. The
           alignment encompasses the active site. Also present with
           in this family is 60-kDa Ca2+-ATPase form F. odoratum.
          Length = 342

 Score = 51.6 bits (124), Expect = 9e-07
 Identities = 52/267 (19%), Positives = 95/267 (35%), Gaps = 51/267 (19%)

Query: 63  IFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV--QLCTPSRSAIMTGKHP 120
           + I  D    + +         TPN+ ALA  G+       V   L  P+   I+TG +P
Sbjct: 2   LVISLDGFRADYLDRLAGL---TPNLAALAKEGVSAPYLTPVFPTLTFPNHYTIVTGLYP 58

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE---YTPTFRG 177
              G+  N  Y              P+ LKE  +  ++      G    +    T   +G
Sbjct: 59  GSHGIVGNTWYD-------------PKRLKESTFWDQLP---ESGDGDPKPIWVTAKKQG 102

Query: 178 FESH-LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
            ++  L +   H     +     ++       +     D+  +   D    + +      
Sbjct: 103 LKAAALFWPGSHAAKPGYGPLVSELPDRLDAYNGSVPLDVITEPRVDTVLTDWLK--LLL 160

Query: 237 STDEP--LFLYLAH--AATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
             + P  L +YL       H   P  P             +ED         L ++D ++
Sbjct: 161 DAERPDLLLVYLEEPDRVGHKYGPDSP------------EVEDA--------LRRVDRAI 200

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGA 319
           G+++EAL++R +L N+ ++ VSD+G  
Sbjct: 201 GRLLEALDERGLLENTNVIVVSDHGMT 227


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 50.3 bits (121), Expect = 3e-06
 Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
             +  EKK+K+K+K K KK+ +  K   K++K S   E
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSE 433



 Score = 49.9 bits (120), Expect = 4e-06
 Identities = 13/35 (37%), Positives = 21/35 (60%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
            E+K+K+K+K K KK+ +  K   K++K     EE
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 49.2 bits (118), Expect = 7e-06
 Identities = 13/37 (35%), Positives = 24/37 (64%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
           K  E+++K+K+K K KK+ +  K   K++K   ++EE
Sbjct: 398 KRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 47.2 bits (113), Expect = 3e-05
 Identities = 12/34 (35%), Positives = 21/34 (61%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           P K+   ++ +KK+K+K+K K KK+ +  K   K
Sbjct: 391 PSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGK 424



 Score = 45.7 bits (109), Expect = 8e-05
 Identities = 11/33 (33%), Positives = 20/33 (60%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           K  ++   K+ +KK+K+K+K K KK+ +  K  
Sbjct: 390 KPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNI 422



 Score = 45.7 bits (109), Expect = 8e-05
 Identities = 11/32 (34%), Positives = 19/32 (59%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           K   +  +KK+K+K+K K KK+ +  K   K+
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425



 Score = 45.7 bits (109), Expect = 9e-05
 Identities = 13/36 (36%), Positives = 23/36 (63%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           A  +K+  +  KK   K+ +KK+K+K+K K KK+ +
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417



 Score = 45.3 bits (108), Expect = 1e-04
 Identities = 13/33 (39%), Positives = 21/33 (63%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYP 859
           L K  +KK+K+K+K K KK+ +  K + K+  P
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKP 428



 Score = 44.9 bits (107), Expect = 1e-04
 Identities = 15/52 (28%), Positives = 28/52 (53%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
           P +++  +  KK   K+ +KK+K+K+K K KK+    +  G R+     S +
Sbjct: 383 PSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 44.9 bits (107), Expect = 1e-04
 Identities = 14/34 (41%), Positives = 20/34 (58%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           P  +   EKK  K  KK   K+ +KK+K+K+K K
Sbjct: 378 PKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPK 411



 Score = 44.5 bits (106), Expect = 2e-04
 Identities = 13/32 (40%), Positives = 19/32 (59%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           K   E+K  K  KK   K+ +KK+K+K+K K 
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKV 412



 Score = 44.5 bits (106), Expect = 2e-04
 Identities = 11/37 (29%), Positives = 21/37 (56%)

Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
            +A   +++E+EK+K K KK+ +  K   K++K    
Sbjct: 395 VLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 42.6 bits (101), Expect = 7e-04
 Identities = 11/34 (32%), Positives = 19/34 (55%)

Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            +A   ++K+K+K+K K KK+ +  K   K  K 
Sbjct: 395 VLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKP 428



 Score = 42.2 bits (100), Expect = 0.001
 Identities = 17/61 (27%), Positives = 27/61 (44%), Gaps = 6/61 (9%)

Query: 802 VQRINHYTTE------VGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
           + +I  Y  E      +  L PK +        K  KK   K+ +KK+K+K+K K K   
Sbjct: 357 LGKIERYIEEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416

Query: 856 K 856
           +
Sbjct: 417 R 417



 Score = 41.9 bits (99), Expect = 0.001
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           K   K+ +KK+K+K+K K KK+ +  K 
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRDTKN 421



 Score = 40.7 bits (96), Expect = 0.003
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
           K++K+K+K K KK+ +  K   K+    G
Sbjct: 402 KKEKEKEKPKVKKRHRDTKNIGKRRKPSG 430



 Score = 34.9 bits (81), Expect = 0.17
 Identities = 14/42 (33%), Positives = 20/42 (47%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           +E   K K   +KK  K  KK   K+ +KK   +E+   K R
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKR 415



 Score = 34.5 bits (80), Expect = 0.22
 Identities = 11/31 (35%), Positives = 16/31 (51%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           +E   K K   +KK  K  KK   K+ +KK+
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKE 404



 Score = 33.4 bits (77), Expect = 0.52
 Identities = 6/24 (25%), Positives = 12/24 (50%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
           ++ K KK+ +  K   K++K    
Sbjct: 408 EKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 32.2 bits (74), Expect = 1.2
 Identities = 7/24 (29%), Positives = 12/24 (50%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKK 851
           +K K KK+ +  K   K++K    
Sbjct: 408 EKPKVKKRHRDTKNIGKRRKPSGT 431


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 50.4 bits (121), Expect = 3e-06
 Identities = 21/33 (63%), Positives = 28/33 (84%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           P K++ EEKK +K+KKKKK+KKK KK+KKK +K
Sbjct: 382 PPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 46.1 bits (110), Expect = 7e-05
 Identities = 18/29 (62%), Positives = 25/29 (86%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           + E+KK +K+KKKKK+KKK KK+KKK +K
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 45.7 bits (109), Expect = 7e-05
 Identities = 18/35 (51%), Positives = 28/35 (80%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           YP   +++ ++KK +K+KKKKK+KKK KK+KKK +
Sbjct: 379 YPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 43.8 bits (104), Expect = 3e-04
 Identities = 20/46 (43%), Positives = 32/46 (69%), Gaps = 4/46 (8%)

Query: 716 DKEEEEEKKK----KKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMR 757
           +K  EE K+K     KKK+++KK +K+KKKKK+KKK    +++G +
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 43.4 bits (103), Expect = 4e-04
 Identities = 18/41 (43%), Positives = 28/41 (68%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           EE +EK  K  KKK+++KK +K+KKKKK+KK   + +   +
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 41.9 bits (99), Expect = 0.001
 Identities = 20/42 (47%), Positives = 29/42 (69%), Gaps = 4/42 (9%)

Query: 721 EEKKKK----KKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           EE K+K     KKK+++KK +K+KKKKK+KKK    ++  RK
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 41.5 bits (98), Expect = 0.002
 Identities = 18/29 (62%), Positives = 25/29 (86%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
           K+K+++KK +K+KKKKK+KKK KK  KKG
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKG 412



 Score = 41.1 bits (97), Expect = 0.002
 Identities = 16/24 (66%), Positives = 21/24 (87%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
           K +K+KKKKK+KKK KK+KKK +K
Sbjct: 391 KPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 40.7 bits (96), Expect = 0.003
 Identities = 17/28 (60%), Positives = 24/28 (85%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           +E+KK +K+KKKKK+KKK KK+KK  +K
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 40.3 bits (95), Expect = 0.003
 Identities = 15/28 (53%), Positives = 22/28 (78%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           K ++KK +K+KKKKK+KKK KK+K   +
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 39.2 bits (92), Expect = 0.008
 Identities = 15/29 (51%), Positives = 23/29 (79%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
           K++++KK +K+KKKKK+KKK KK+  K  
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 38.4 bits (90), Expect = 0.015
 Identities = 17/40 (42%), Positives = 26/40 (65%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
            +R  +I     +  KKK+++KK +K+KKKKK+KK  KK 
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKR 408



 Score = 36.9 bits (86), Expect = 0.046
 Identities = 14/23 (60%), Positives = 20/23 (86%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKK 850
            +++KKKKK+KKK KK+KKK +K
Sbjct: 392 PQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 34.9 bits (81), Expect = 0.16
 Identities = 20/59 (33%), Positives = 36/59 (61%), Gaps = 8/59 (13%)

Query: 707 GRFNQIAYPDKEEEEEKKK----KKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
           GR+  I   D+ +EE  K+    K+K  K  KKK+++KK +K+KKK   +++G ++ + 
Sbjct: 357 GRY--IG--DELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKK 411


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 47.9 bits (114), Expect = 4e-06
 Identities = 21/36 (58%), Positives = 29/36 (80%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
           E+K KKKK +  K++KKKKK+KKKKKK++S E  G+
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHPGV 176



 Score = 37.9 bits (88), Expect = 0.008
 Identities = 17/27 (62%), Positives = 21/27 (77%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           EKK KKKK +  K++KKKKK+KK  KK
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKK 167



 Score = 37.9 bits (88), Expect = 0.009
 Identities = 15/25 (60%), Positives = 21/25 (84%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
           +K+ KKKK +  K++KKKKK+KKKK
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKK 165



 Score = 37.5 bits (87), Expect = 0.012
 Identities = 16/24 (66%), Positives = 20/24 (83%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
           K KKKK +  K++KKKKK+KKKKK
Sbjct: 143 KHKKKKHEDDKERKKKKKEKKKKK 166



 Score = 36.7 bits (85), Expect = 0.019
 Identities = 16/30 (53%), Positives = 22/30 (73%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKGYPD 860
           K KKKK +  K++KKKKK+KKK  K+  P+
Sbjct: 143 KHKKKKHEDDKERKKKKKEKKKKKKRHSPE 172



 Score = 36.7 bits (85), Expect = 0.023
 Identities = 17/40 (42%), Positives = 25/40 (62%)

Query: 817 PKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           P++  +    L   +KK KKKK +  K++KKKKK+K  KK
Sbjct: 127 PEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166



 Score = 32.9 bits (75), Expect = 0.44
 Identities = 14/23 (60%), Positives = 21/23 (91%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKK 739
           K+ E++K++KKKKK+KKKKKK+ 
Sbjct: 147 KKHEDDKERKKKKKEKKKKKKRH 169



 Score = 32.9 bits (75), Expect = 0.45
 Identities = 14/25 (56%), Positives = 21/25 (84%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKK 741
           K+++ E  K++KKKKK+KKKKKK+ 
Sbjct: 145 KKKKHEDDKERKKKKKEKKKKKKRH 169



 Score = 32.1 bits (73), Expect = 0.79
 Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 3/43 (6%)

Query: 817 PKQRFNQIAYLDKEKKKKKKKKKKKKKK---KKKKKKKKMMKK 856
            +    +    D E  K  +KK KKKK    K++KKKKK  KK
Sbjct: 122 TQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKK 164



 Score = 31.7 bits (72), Expect = 0.94
 Identities = 14/18 (77%), Positives = 17/18 (94%)

Query: 828 DKEKKKKKKKKKKKKKKK 845
           DKE+KKKKK+KKKKKK+ 
Sbjct: 152 DKERKKKKKEKKKKKKRH 169



 Score = 31.0 bits (70), Expect = 1.6
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMM 854
           + +K++KKKKK+KKKKKK+   +   +
Sbjct: 150 EDDKERKKKKKEKKKKKKRHSPEHPGV 176



 Score = 30.6 bits (69), Expect = 2.2
 Identities = 15/54 (27%), Positives = 25/54 (46%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
            +    D   EE     +  K  +KK KKKK +  K++K   +E+  +K R + 
Sbjct: 118 KKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSP 171



 Score = 30.2 bits (68), Expect = 3.5
 Identities = 12/27 (44%), Positives = 19/27 (70%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMM 854
           D +++KKKKK+KKKKKK+   +   + 
Sbjct: 151 DDKERKKKKKEKKKKKKRHSPEHPGVG 177


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 47.0 bits (112), Expect = 4e-06
 Identities = 22/48 (45%), Positives = 33/48 (68%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           K ++EEK  KK+ K++KKK+KKKKKKK KK     E+EG +   +++ 
Sbjct: 76  KRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSD 123



 Score = 44.7 bits (106), Expect = 2e-05
 Identities = 20/40 (50%), Positives = 30/40 (75%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           +K+ ++E+K  KK+ K++KKK+KKKKKKK KK    EE+ 
Sbjct: 74  EKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKE 113



 Score = 39.6 bits (93), Expect = 0.001
 Identities = 18/43 (41%), Positives = 29/43 (67%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
            +  K EE+++K ++K  KK+ K++KKK+KKKKKKK     + 
Sbjct: 67  EFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKK 109



 Score = 38.1 bits (89), Expect = 0.005
 Identities = 17/35 (48%), Positives = 24/35 (68%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           K+   +    +K  KK+ K++KKK+KKKKKKK KK
Sbjct: 71  KREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKK 105



 Score = 37.7 bits (88), Expect = 0.006
 Identities = 17/39 (43%), Positives = 28/39 (71%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           +EE++ K ++K  KK+ K++KKK+KKKKKKK     ++ 
Sbjct: 72  REEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKE 110



 Score = 35.8 bits (83), Expect = 0.027
 Identities = 17/39 (43%), Positives = 24/39 (61%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           K  + +KKK+KKKKKKK KK  KK++K+  K      + 
Sbjct: 86  KRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDE 124



 Score = 34.3 bits (79), Expect = 0.11
 Identities = 16/27 (59%), Positives = 21/27 (77%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           E+K  KK+ K++KKK+KKKKKKK  K 
Sbjct: 80  EEKTAKKRAKRQKKKQKKKKKKKAKKG 106



 Score = 33.5 bits (77), Expect = 0.20
 Identities = 15/23 (65%), Positives = 19/23 (82%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKK 851
           + K++KKK+KKKKKKK KK  KK
Sbjct: 87  RAKRQKKKQKKKKKKKAKKGNKK 109



 Score = 32.7 bits (75), Expect = 0.35
 Identities = 14/30 (46%), Positives = 23/30 (76%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
           +K++K ++K  KK+ K++KKK+KKK  KK 
Sbjct: 74  EKKRKDEEKTAKKRAKRQKKKQKKKKKKKA 103



 Score = 32.7 bits (75), Expect = 0.37
 Identities = 19/48 (39%), Positives = 34/48 (70%), Gaps = 5/48 (10%)

Query: 716 DKEEEEEKKKKKKKKKK-----KKKKKKKKKKKKKKKKYSNEEEGMRK 758
           + E+EE ++K+++KK+K      KK+ K++KKK+KKKK    ++G +K
Sbjct: 62  ETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKK 109



 Score = 32.3 bits (74), Expect = 0.41
 Identities = 18/40 (45%), Positives = 25/40 (62%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
           K +KKK+KKKKKKK KK  KK++K   K   +   + E+ 
Sbjct: 89  KRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEG 128



 Score = 31.6 bits (72), Expect = 0.82
 Identities = 20/51 (39%), Positives = 33/51 (64%), Gaps = 13/51 (25%)

Query: 717 KEEEEEKKKKKKKKKKKKK-------------KKKKKKKKKKKKKYSNEEE 754
           K+E E+++ ++K+++KK+K             KKK+KKKKKKK K  N++E
Sbjct: 60  KKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKE 110



 Score = 30.8 bits (70), Expect = 1.3
 Identities = 17/35 (48%), Positives = 27/35 (77%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           +++K+KKKKKKK KK  KK++K+  K  + S++EE
Sbjct: 91  QKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEE 125



 Score = 28.9 bits (65), Expect = 6.3
 Identities = 17/38 (44%), Positives = 27/38 (71%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           +++++KKKKKKK KK  KK++K+  K  ++    EEEG
Sbjct: 91  QKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEG 128



 Score = 28.9 bits (65), Expect = 7.1
 Identities = 18/77 (23%), Positives = 37/77 (48%), Gaps = 9/77 (11%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAV 877
           K+  ++  +  K ++KK+K ++K  KK+ K++KKK  KK         +K+    N+   
Sbjct: 61  KETEDEE-FQQKREEKKRKDEEKTAKKRAKRQKKKQKKK--------KKKKAKKGNKKEE 111

Query: 878 APINKPFDKGGDPKNFD 894
              +K  ++  D +   
Sbjct: 112 KEGSKSSEESSDEEEEG 128


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 47.0 bits (112), Expect = 8e-06
 Identities = 20/46 (43%), Positives = 30/46 (65%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
             E+E E ++++KK+KKKKK+ KK+KK+KK KK    E    K + 
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187



 Score = 44.7 bits (106), Expect = 5e-05
 Identities = 20/32 (62%), Positives = 25/32 (78%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           KE ++EKK+KK KK+K  + K  KKKKKKKKK
Sbjct: 161 KEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 44.7 bits (106), Expect = 6e-05
 Identities = 19/31 (61%), Positives = 24/31 (77%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
            K+E++EKK KK+K  + K  KKKKKKKKKK
Sbjct: 163 VKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 44.3 bits (105), Expect = 7e-05
 Identities = 20/39 (51%), Positives = 27/39 (69%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
            + E EEE+KK+KKKKK+ KK+KK+KK KK+K       
Sbjct: 145 KEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGS 183



 Score = 43.6 bits (103), Expect = 1e-04
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
             KE++E+K KK+K  + K  KKKKKKKKKK
Sbjct: 163 VKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 43.2 bits (102), Expect = 2e-04
 Identities = 22/36 (61%), Positives = 28/36 (77%), Gaps = 4/36 (11%)

Query: 717 KEEEEEKKKKKKKKKKKKK----KKKKKKKKKKKKK 748
           K+++E KK+KK+KK KK+K    K  KKKKKKKKKK
Sbjct: 158 KKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 43.2 bits (102), Expect = 2e-04
 Identities = 18/33 (54%), Positives = 25/33 (75%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            K+E +++KK+KK KK+K  + K  KKKKKKKK
Sbjct: 159 KKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191



 Score = 42.4 bits (100), Expect = 3e-04
 Identities = 20/48 (41%), Positives = 34/48 (70%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           ++E E ++++KK+KKKKK+ KK+KK+KK KK+   E +G +K +    
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191



 Score = 40.9 bits (96), Expect = 0.001
 Identities = 20/44 (45%), Positives = 33/44 (75%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
             +EEE+++KKKKK+ KK+KK+KK KK+K  + K S +++  +K
Sbjct: 148 EVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191



 Score = 38.9 bits (91), Expect = 0.005
 Identities = 17/28 (60%), Positives = 21/28 (75%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           KEKKKKK+ KK+KK+KK KK+K    K 
Sbjct: 155 KEKKKKKEVKKEKKEKKDKKEKMVEPKG 182



 Score = 38.2 bits (89), Expect = 0.008
 Identities = 17/29 (58%), Positives = 25/29 (86%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           ++EKK+KKKKK+ KK+KK+KK KK+ M +
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKKDKKEKMVE 179



 Score = 37.8 bits (88), Expect = 0.012
 Identities = 15/25 (60%), Positives = 19/25 (76%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
           +K+ KK+K  + K  KKKKKKKKKK
Sbjct: 169 EKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 37.4 bits (87), Expect = 0.016
 Identities = 16/25 (64%), Positives = 23/25 (92%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
           ++E+KK+KKKKK+ KK+KK+KK KK
Sbjct: 150 EEEEKKEKKKKKEVKKEKKEKKDKK 174



 Score = 37.0 bits (86), Expect = 0.018
 Identities = 14/28 (50%), Positives = 20/28 (71%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKK 740
              +K++++EK  + K  KKKKKKKKKK
Sbjct: 166 EKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 37.0 bits (86), Expect = 0.019
 Identities = 16/29 (55%), Positives = 26/29 (89%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           +++K+KKKKK+ KK+KK+KK KK+KM++ 
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEP 180



 Score = 36.2 bits (84), Expect = 0.033
 Identities = 14/29 (48%), Positives = 21/29 (72%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           +K+KKK+ KK+KK+KK KK+K  +    K
Sbjct: 156 EKKKKKEVKKEKKEKKDKKEKMVEPKGSK 184



 Score = 35.8 bits (83), Expect = 0.049
 Identities = 12/34 (35%), Positives = 22/34 (64%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
              E E  +K+   K +K+ + ++++KK+KKKKK
Sbjct: 128 LGSESETSEKETTAKVEKEAEVEEEEKKEKKKKK 161



 Score = 34.7 bits (80), Expect = 0.12
 Identities = 12/33 (36%), Positives = 23/33 (69%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            + E  EK+   K +K+ + ++++KK+KKKKK+
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKE 162



 Score = 33.9 bits (78), Expect = 0.19
 Identities = 14/26 (53%), Positives = 22/26 (84%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKK 852
            + E+++KK+KKKKK+ KK+KK+KK 
Sbjct: 147 AEVEEEEKKEKKKKKEVKKEKKEKKD 172



 Score = 33.9 bits (78), Expect = 0.23
 Identities = 18/31 (58%), Positives = 22/31 (70%), Gaps = 7/31 (22%)

Query: 829 KEKKKKKKKKKKKKK-------KKKKKKKKK 852
           + KK+KK+KK KK+K       KKKKKKKKK
Sbjct: 162 EVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 33.5 bits (77), Expect = 0.28
 Identities = 15/29 (51%), Positives = 24/29 (82%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           + E ++++KK+KKKKK+ KK+KK+K  KK
Sbjct: 146 EAEVEEEEKKEKKKKKEVKKEKKEKKDKK 174



 Score = 32.8 bits (75), Expect = 0.50
 Identities = 11/33 (33%), Positives = 22/33 (66%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPD 860
            +   K +K+ + ++++KK+KKKKK +KK   +
Sbjct: 137 KETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKE 169



 Score = 30.8 bits (70), Expect = 1.9
 Identities = 12/30 (40%), Positives = 22/30 (73%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
              EK+ + ++++KK+KKKKK+ KK+  +K
Sbjct: 141 AKVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170



 Score = 30.8 bits (70), Expect = 1.9
 Identities = 14/28 (50%), Positives = 22/28 (78%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           KE + ++++KK+KKKKK+ KK+KK  K 
Sbjct: 145 KEAEVEEEEKKEKKKKKEVKKEKKEKKD 172



 Score = 30.5 bits (69), Expect = 2.9
 Identities = 12/28 (42%), Positives = 23/28 (82%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           +++ + ++++KK+KKKKK+ KK+K  KK
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEKK 171



 Score = 29.7 bits (67), Expect = 4.7
 Identities = 12/29 (41%), Positives = 22/29 (75%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
           K +K+ + ++++KK+KKKKK+ KK  K+ 
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170



 Score = 29.3 bits (66), Expect = 7.5
 Identities = 9/34 (26%), Positives = 21/34 (61%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           P +   E +  +K+   K +K+ + ++++KK+KK
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKK 158


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 45.1 bits (107), Expect = 2e-05
 Identities = 21/30 (70%), Positives = 25/30 (83%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           +E+ K+KK+KKKKKKKKKKK  KK  KKKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 44.3 bits (105), Expect = 3e-05
 Identities = 21/30 (70%), Positives = 25/30 (83%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
           KE+ ++KK+KKKKKKKKKKK  KK  KKKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 43.9 bits (104), Expect = 4e-05
 Identities = 21/32 (65%), Positives = 25/32 (78%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           K+  +EK K+KK+KKKKKKKKKKK  KK  KK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKK 133



 Score = 43.5 bits (103), Expect = 5e-05
 Identities = 20/31 (64%), Positives = 25/31 (80%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
            +E+ ++KK+KKKKKKKKKKK  KK  KKKK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 41.6 bits (98), Expect = 3e-04
 Identities = 21/28 (75%), Positives = 23/28 (82%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           KEK K+KK+KKKKKKKKKKK  KK  KK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKK 133



 Score = 41.2 bits (97), Expect = 4e-04
 Identities = 19/29 (65%), Positives = 22/29 (75%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           +K K+KK+KKKKKKKKKKK  KK    KK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 40.4 bits (95), Expect = 7e-04
 Identities = 20/31 (64%), Positives = 25/31 (80%), Gaps = 2/31 (6%)

Query: 720 EEEK--KKKKKKKKKKKKKKKKKKKKKKKKK 748
           EE+K  K+  K+K K+KK+KKKKKKKKKKK 
Sbjct: 96  EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKT 126



 Score = 40.1 bits (94), Expect = 8e-04
 Identities = 18/29 (62%), Positives = 24/29 (82%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
           +K +++++KKKKKKKKKKK  KK  KKKK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 39.7 bits (93), Expect = 0.001
 Identities = 19/28 (67%), Positives = 23/28 (82%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           K+  K+K K+KK+KKKKKKKKKKK  KK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKK 129



 Score = 38.5 bits (90), Expect = 0.003
 Identities = 21/34 (61%), Positives = 23/34 (67%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKK 851
           KQ   + A   K+KKKKKKKKKKK  KK  KKKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 38.1 bits (89), Expect = 0.004
 Identities = 19/30 (63%), Positives = 23/30 (76%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           L ++ K+KK+KKKKKKKKKKK  KK   KK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 37.4 bits (87), Expect = 0.007
 Identities = 17/29 (58%), Positives = 21/29 (72%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
            ++K K+KK+KKKKKKKKKKK  K   K 
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKK 133



 Score = 35.8 bits (83), Expect = 0.029
 Identities = 19/35 (54%), Positives = 26/35 (74%), Gaps = 2/35 (5%)

Query: 825 AYLDKEKK--KKKKKKKKKKKKKKKKKKKKMMKKG 857
           A+L +E+K  K+  K+K K+KK+KKKKKKK  KK 
Sbjct: 92  AFLVEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKT 126



 Score = 34.7 bits (80), Expect = 0.056
 Identities = 16/32 (50%), Positives = 22/32 (68%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
           E++K  K+  K+K K+KK+KKKKKKKK     
Sbjct: 96  EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTS 127


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 46.2 bits (110), Expect = 3e-05
 Identities = 36/140 (25%), Positives = 63/140 (45%), Gaps = 28/140 (20%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
            + E+E +++++ KKKK+ K K  K+  KKKKKK         K   AA+ +     E  
Sbjct: 72  EEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTA----AKSPKAAAPRPKKKSERI 127

Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
                      + + P  K++   RS   +    T         +R        KE++ +
Sbjct: 128 SW------APTLLDSPRRKSS---RSSTVQNKEAT--------HERL-------KEREIR 163

Query: 835 KKKKKKKKKKKKKKKKKKMM 854
           +KK + K +K+K+KKK+K +
Sbjct: 164 RKKIQAKARKRKEKKKEKEL 183



 Score = 32.7 bits (75), Expect = 0.71
 Identities = 17/47 (36%), Positives = 31/47 (65%), Gaps = 3/47 (6%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
           KE   E+ K+++ ++KK + K +K+K+KKK+K   +EE   +L +A 
Sbjct: 150 KEATHERLKEREIRRKKIQAKARKRKEKKKEKELTQEE---RLAEAK 193



 Score = 31.2 bits (71), Expect = 1.9
 Identities = 11/30 (36%), Positives = 20/30 (66%), Gaps = 1/30 (3%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKK-KKKMMKK 856
           + EK+ +++++ KKKK+ K K  K+   KK
Sbjct: 73  EGEKELQREERLKKKKRVKTKAYKEPTKKK 102



 Score = 31.2 bits (71), Expect = 2.2
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           + +++++ KKKK+ K K  K+  KK  KK
Sbjct: 77  ELQREERLKKKKRVKTKAYKEPTKKKKKK 105



 Score = 31.2 bits (71), Expect = 2.2
 Identities = 14/37 (37%), Positives = 18/37 (48%)

Query: 820 RFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           +  Q     K+KK+ K K  K+  KKKKKK     K 
Sbjct: 76  KELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKS 112



 Score = 29.7 bits (67), Expect = 6.8
 Identities = 9/29 (31%), Positives = 20/29 (68%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           D+E+ +K+ +++++ KKKK+ K K   + 
Sbjct: 70  DEEEGEKELQREERLKKKKRVKTKAYKEP 98


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 46.2 bits (110), Expect = 3e-05
 Identities = 22/49 (44%), Positives = 34/49 (69%), Gaps = 3/49 (6%)

Query: 716 DKEEEEEKKKKKKK---KKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
           D+EE+EEK+++K K   K K KK  K K ++K+K K   EE+G+R+L +
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86



 Score = 42.3 bits (100), Expect = 6e-04
 Identities = 16/45 (35%), Positives = 24/45 (53%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           EEE+E+K+++K K   K K KK  K K ++K   + E   K    
Sbjct: 39  EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRE 83



 Score = 35.8 bits (83), Expect = 0.064
 Identities = 13/44 (29%), Positives = 25/44 (56%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
             K+  +E++ ++K+++K K   K K KK  K K   +E+  R+
Sbjct: 32  DVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKRE 75



 Score = 35.4 bits (82), Expect = 0.095
 Identities = 11/36 (30%), Positives = 13/36 (36%), Gaps = 1/36 (2%)

Query: 716 DKEEEE-EKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
           +K + E   K  KKKK K K K            Y 
Sbjct: 193 EKLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYD 228



 Score = 35.0 bits (81), Expect = 0.11
 Identities = 11/33 (33%), Positives = 13/33 (39%), Gaps = 2/33 (6%)

Query: 826 YLDKEKKKKKKKKKKKKKKKKKKK--KKKMMKK 856
             +K K +K  K  KKKK K K K         
Sbjct: 191 INEKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223



 Score = 34.6 bits (80), Expect = 0.18
 Identities = 10/33 (30%), Positives = 13/33 (39%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
            E+ + +K  K  KKKK K K K          
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDD 224



 Score = 34.2 bits (79), Expect = 0.23
 Identities = 13/25 (52%), Positives = 14/25 (56%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKK 851
           L  EK K +K  K  KKKK K K K
Sbjct: 190 LINEKLKAEKAAKGGKKKKGKAKAK 214



 Score = 34.2 bits (79), Expect = 0.23
 Identities = 11/42 (26%), Positives = 24/42 (57%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELA 870
           K KK  K K ++K+K K++K++K + +        ++ ++L 
Sbjct: 57  KAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLR 98



 Score = 33.1 bits (76), Expect = 0.48
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           + E+K+++K K   K K KK  K K+ +K
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKIEEK 69



 Score = 32.3 bits (74), Expect = 0.80
 Identities = 13/36 (36%), Positives = 16/36 (44%)

Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
           K K +K  K  KKKK K K K+   G  D     + 
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDG 229



 Score = 32.3 bits (74), Expect = 0.84
 Identities = 10/28 (35%), Positives = 10/28 (35%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            EK  K  KKKK K K K          
Sbjct: 197 AEKAAKGGKKKKGKAKAKLNVGGANDDD 224



 Score = 32.3 bits (74), Expect = 0.96
 Identities = 9/25 (36%), Positives = 16/25 (64%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
             + K KK  K K ++K+K K++K+
Sbjct: 53  AAKAKAKKALKAKIEEKEKAKREKE 77



 Score = 32.3 bits (74), Expect = 0.98
 Identities = 10/24 (41%), Positives = 17/24 (70%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
           K K KK  K K ++K+K K++K++
Sbjct: 55  KAKAKKALKAKIEEKEKAKREKEE 78



 Score = 31.9 bits (73), Expect = 1.2
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           ++EK K   K K KK  K K ++K+  K+
Sbjct: 46  EEEKAKVAAKAKAKKALKAKIEEKEKAKR 74



 Score = 31.9 bits (73), Expect = 1.2
 Identities = 12/40 (30%), Positives = 19/40 (47%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
           K   K K KK  K K ++K+K K+  ++     L +   E
Sbjct: 51  KVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPE 90



 Score = 31.6 bits (72), Expect = 1.6
 Identities = 11/28 (39%), Positives = 15/28 (53%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           K K   K K KK  K K ++K+K   +K
Sbjct: 49  KAKVAAKAKAKKALKAKIEEKEKAKREK 76



 Score = 31.2 bits (71), Expect = 2.0
 Identities = 12/34 (35%), Positives = 21/34 (61%), Gaps = 5/34 (14%)

Query: 828 DKEKKKKK-----KKKKKKKKKKKKKKKKKMMKK 856
           D+EK+++K     K K KK  K K ++K+K  ++
Sbjct: 42  DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKRE 75



 Score = 30.4 bits (69), Expect = 3.2
 Identities = 8/29 (27%), Positives = 19/29 (65%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           +++++K+++K K   K K KK  K  +++
Sbjct: 40  EEDEEKEEEKAKVAAKAKAKKALKAKIEE 68



 Score = 29.6 bits (67), Expect = 7.0
 Identities = 11/27 (40%), Positives = 14/27 (51%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKG 857
            +K K +K  K  KKKK K K  +  G
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVG 218



 Score = 29.2 bits (66), Expect = 9.0
 Identities = 14/54 (25%), Positives = 27/54 (50%), Gaps = 5/54 (9%)

Query: 716 DKEEEEEKKK-----KKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           D E+E++  K     ++ ++K+++K K   K K KK   +  EE  +  R+   
Sbjct: 25  DDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEE 78



 Score = 29.2 bits (66), Expect = 9.5
 Identities = 12/35 (34%), Positives = 14/35 (40%)

Query: 727 KKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
            +K K +K  K  KKKK K K   N         D
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDD 226


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 45.1 bits (107), Expect = 3e-05
 Identities = 23/48 (47%), Positives = 27/48 (56%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           K+E EEK+K K KKKK KKKK K K KK  KK    E+   K  +   
Sbjct: 75  KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122



 Score = 44.3 bits (105), Expect = 6e-05
 Identities = 22/48 (45%), Positives = 29/48 (60%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           KE EE++K K KKKK KKKK K K KK  KK   +E++  ++  D   
Sbjct: 76  KEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123



 Score = 42.0 bits (99), Expect = 4e-04
 Identities = 19/49 (38%), Positives = 26/49 (53%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
            K + ++KK KKKK K K KK  KK  K +KK     E+ +  L  + S
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYS 130



 Score = 38.5 bits (90), Expect = 0.005
 Identities = 15/34 (44%), Positives = 21/34 (61%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
             K ++++ KKKK K K KK  KK  K +KK +K
Sbjct: 83  KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116



 Score = 38.1 bits (89), Expect = 0.008
 Identities = 12/41 (29%), Positives = 22/41 (53%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
              K+++++KK  KK  K +KK +K+ + K +    S  E 
Sbjct: 92  KKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSET 132



 Score = 36.2 bits (84), Expect = 0.033
 Identities = 21/65 (32%), Positives = 26/65 (40%), Gaps = 5/65 (7%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKG 887
            K K KKKK KKKK K K KK  KK  K        + EKE  +         ++     
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKKDDKSE-----KKDEKEAEDKLEDLTKSYSETLSTL 136

Query: 888 GDPKN 892
            + K 
Sbjct: 137 SELKP 141



 Score = 35.8 bits (83), Expect = 0.042
 Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 4/44 (9%)

Query: 716 DKEEEEEKK----KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
            +E E+ KK    K+K K KKKK KKKK K K KK    +++  
Sbjct: 68  AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSE 111



 Score = 35.1 bits (81), Expect = 0.084
 Identities = 19/44 (43%), Positives = 28/44 (63%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
             KE  EE +K KK+ ++K+K K KKKK KKKK    +++  +K
Sbjct: 63  KKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKK 106



 Score = 34.7 bits (80), Expect = 0.11
 Identities = 10/38 (26%), Positives = 21/38 (55%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
            DK+++++KK  K +KK +K+ + K +   K    +  
Sbjct: 97  KDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLS 134



 Score = 34.3 bits (79), Expect = 0.16
 Identities = 18/56 (32%), Positives = 25/56 (44%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPF 884
           K+ KKKK K K KK  KK  K +K  +K   D L  + K  +    T      + +
Sbjct: 89  KKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRKY 144



 Score = 33.9 bits (78), Expect = 0.19
 Identities = 15/72 (20%), Positives = 35/72 (48%)

Query: 690 LADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           LA+  E  +  +   +  ++ +     K+++++ KK  KK  K +KK +K+ + K +   
Sbjct: 67  LAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLT 126

Query: 750 SNEEEGMRKLRD 761
            +  E +  L +
Sbjct: 127 KSYSETLSTLSE 138



 Score = 33.9 bits (78), Expect = 0.20
 Identities = 21/60 (35%), Positives = 27/60 (45%), Gaps = 15/60 (25%)

Query: 710 NQIAYPDKEEEEEKKKKKKK---------------KKKKKKKKKKKKKKKKKKKYSNEEE 754
              A P  + E  + KKKKK               K+K K KKKK KKKK K K   +++
Sbjct: 46  RHFATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDK 105


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 45.0 bits (107), Expect = 5e-05
 Identities = 13/55 (23%), Positives = 31/55 (56%), Gaps = 1/55 (1%)

Query: 708 RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
              Q+   ++E+ +E+++++  +  K + ++ + K+K K+    E+E MR  R A
Sbjct: 118 FLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRH-RAA 171



 Score = 31.5 bits (72), Expect = 1.3
 Identities = 13/66 (19%), Positives = 31/66 (46%), Gaps = 14/66 (21%)

Query: 816 DPKQRFNQIAYLDKEKKKKK----------KKKKKKKKKKKKKKKKKMMKKGYPDVLSQM 865
           D +++   +  L++E+++K+            K + ++ + K+K K+M K+       +M
Sbjct: 111 DVRKQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEE----DEEM 166

Query: 866 EKELAN 871
               AN
Sbjct: 167 RHRAAN 172


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 46.4 bits (111), Expect = 7e-05
 Identities = 28/174 (16%), Positives = 60/174 (34%), Gaps = 11/174 (6%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQ 778
           EEEE +K++ K++ +++ K + ++    +     +        A                
Sbjct: 39  EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARP 98

Query: 779 IAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKK 838
                             A R + ++             K +        + K  K +++
Sbjct: 99  AED------EAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRR 152

Query: 839 KKKKKKKKKKKKKKMMKKGYPDVLSQME----KELAN-INRTAVAPINKPFDKG 887
           ++ +++++KKKKK+   +  P  +   E     ELA  +   A   I K F  G
Sbjct: 153 RRGRRRRRKKKKKQKPTEKIPREVVIPETITVAELAEKMAVKAAEVIKKLFKLG 206



 Score = 29.8 bits (68), Expect = 7.5
 Identities = 10/52 (19%), Positives = 28/52 (53%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
             K     K+ K  K +++++ +++++KKKKK+K + +      + +  ++ 
Sbjct: 134 KRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPETITVA 185


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 45.3 bits (107), Expect = 1e-04
 Identities = 33/156 (21%), Positives = 60/156 (38%), Gaps = 34/156 (21%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
           K  +E K +  K+++K+K++ K++KKKKK+K     ++   K             +    
Sbjct: 93  KPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPK----------EEAKEKRP 142

Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
           P+        K    EK     R   +              K+R    A    +K  KKK
Sbjct: 143 PK-------EKEKEKEKKVEEPRDREE-------------EKKRERVRAKSRPKKPPKKK 182

Query: 837 KKKKKKKK----KKKKKKKKMMKKGYPDVLSQMEKE 868
              KKK+     K+++  ++ +K    +     E+E
Sbjct: 183 PPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEERE 218



 Score = 40.6 bits (95), Expect = 0.003
 Identities = 25/142 (17%), Positives = 58/142 (40%), Gaps = 17/142 (11%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS--------IQ 766
             KEE +++K K++ K+K+  K+K+K+K+KK ++  + EE  ++ R  A          +
Sbjct: 123 KPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKK 182

Query: 767 CGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAY 826
               K+ P E +              +    +    +            D K R    + 
Sbjct: 183 PPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREK---------EEDDGKDRETTTSP 233

Query: 827 LDKEKKKKKKKKKKKKKKKKKK 848
           +++++ ++  +  ++     KK
Sbjct: 234 MEEDESRQSSEISRRSSSSLKK 255



 Score = 34.9 bits (80), Expect = 0.20
 Identities = 23/71 (32%), Positives = 33/71 (46%), Gaps = 3/71 (4%)

Query: 704 TEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
            E  +  +    +  + EE+KK+++ + K + KK  KKK   KKK   EEE   K R AA
Sbjct: 144 KEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEE---KQRQAA 200

Query: 764 SIQCGPVKEVP 774
                   E P
Sbjct: 201 REAVKGKPEEP 211


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 44.8 bits (106), Expect = 2e-04
 Identities = 34/145 (23%), Positives = 64/145 (44%), Gaps = 10/145 (6%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
           E +E   KKKK+ +KK K KKK  K K+ K   + E+   K +   + +    +    EP
Sbjct: 171 EAQERLTKKKKRGQKKSKYKKKTFKPKRAKSIPDFEKLHEKFQKQLAEKKKSKRPTVPEP 230

Query: 778 ---QIAPCLF--DIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKK 832
              Q +         +         +    +R    +T+      K    +     +++K
Sbjct: 231 FNFQESHKSSSRTYLDQENISAGEENLKPTRRKLPPSTKKWESLVKFLRTE-----RKEK 285

Query: 833 KKKKKKKKKKKKKKKKKKKKMMKKG 857
           + K++++KK+ +++KKKKK+M  K 
Sbjct: 286 EAKEQQEKKELEQRKKKKKEMAPKV 310



 Score = 35.1 bits (81), Expect = 0.16
 Identities = 29/152 (19%), Positives = 59/152 (38%), Gaps = 27/152 (17%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAP 781
            K++KKK++KK  ++ K +++  K +K   E E  +K +            VP   ++ P
Sbjct: 12  TKREKKKREKKSIRQSKLEEELNKLEKKEEEAECKKKFK---------ANPVPASVKL-P 61

Query: 782 CLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKK- 840
              +I     E+           +                   ++ ++E++KK    +K 
Sbjct: 62  LYEEIMEQNEERREEVREKSKAIL------------LSSQKPFSFYEREEQKKAILPRKL 109

Query: 841 ----KKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
                +++ KK K K + K  Y  +L    +E
Sbjct: 110 RSSTSEREPKKFKAKPVPKSIYIPLLKDKMQE 141



 Score = 29.7 bits (67), Expect = 6.7
 Identities = 12/36 (33%), Positives = 28/36 (77%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
           +++E+E K++++KK+ +++KKKKK+   K K+++  
Sbjct: 281 ERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEA 316



 Score = 29.4 bits (66), Expect = 9.9
 Identities = 10/47 (21%), Positives = 27/47 (57%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
            +  + ++K+K+ K++++KK+ +++KKKK     +  ++       Q
Sbjct: 275 VKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQ 321


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 42.4 bits (100), Expect = 2e-04
 Identities = 15/35 (42%), Positives = 28/35 (80%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           +P++ E+E+K+  K+ ++K+  K KK+KK+KK+KK
Sbjct: 91  FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125



 Score = 37.0 bits (86), Expect = 0.015
 Identities = 13/44 (29%), Positives = 27/44 (61%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
             +++E  K+ ++K+  K KK+KK+KK+KK  +K + ++     
Sbjct: 96  EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTT 139



 Score = 34.4 bits (79), Expect = 0.11
 Identities = 13/39 (33%), Positives = 25/39 (64%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
            E+ +K++K+  K+ ++K+  K KK+KK   E++   KL
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKL 130



 Score = 33.6 bits (77), Expect = 0.19
 Identities = 12/29 (41%), Positives = 22/29 (75%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           +  K+ ++K+  K KK+KK+KK+KK+ +K
Sbjct: 101 EVSKETEEKEAIKAKKEKKEKKEKKVAEK 129



 Score = 32.8 bits (75), Expect = 0.34
 Identities = 11/29 (37%), Positives = 20/29 (68%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           + EK++K+  K+ ++K+  K KK+K  KK
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEKK 122



 Score = 32.0 bits (73), Expect = 0.65
 Identities = 14/44 (31%), Positives = 27/44 (61%)

Query: 814 YLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
           + +  ++  +    + E+K+  K KK+KK+KK+KK  +K+ KK 
Sbjct: 91  FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKK 134



 Score = 32.0 bits (73), Expect = 0.65
 Identities = 15/51 (29%), Positives = 28/51 (54%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAP 879
           KE+K+  K+ ++K+  K KK+KK+  +K   + L++ +      N T  A 
Sbjct: 97  KEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKAT 147



 Score = 31.3 bits (71), Expect = 1.1
 Identities = 12/34 (35%), Positives = 19/34 (55%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
             + +++KK+KK+KK  +K  KKK  K  K    
Sbjct: 111 AIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTK 144



 Score = 31.3 bits (71), Expect = 1.1
 Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 3/44 (6%)

Query: 715 PDKEEEEEKKKK---KKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
             KE++E+K+KK   K  KKK  K  K   KK  KK  + +EEG
Sbjct: 114 AKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEG 157



 Score = 29.0 bits (65), Expect = 7.8
 Identities = 14/29 (48%), Positives = 18/29 (62%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            K KK+KK+KK+KK  +K  KKK     K
Sbjct: 112 IKAKKEKKEKKEKKVAEKLAKKKSTKTTK 140


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 45.1 bits (106), Expect = 2e-04
 Identities = 31/139 (22%), Positives = 57/139 (41%), Gaps = 18/139 (12%)

Query: 717  KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE--EEGMRKLRDAASIQCGPVKEVP 774
            K++ EE KK  +  K + +    + +  ++K  + E  +E  +K  DAA  +    K   
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK--- 1391

Query: 775  CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDP-KQRFNQIAYLDKEKKK 833
                        K D  +K    D+ +   +          D  K++  +    D+ KKK
Sbjct: 1392 ------------KADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439

Query: 834  KKKKKKKKKKKKKKKKKKK 852
             ++ KK  + KKK ++ KK
Sbjct: 1440 AEEAKKADEAKKKAEEAKK 1458



 Score = 39.0 bits (90), Expect = 0.016
 Identities = 32/134 (23%), Positives = 61/134 (45%), Gaps = 11/134 (8%)

Query: 717  KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
            K EE+EKK  +  KK+ ++ KK ++ KKK+ +   + E ++K  +   I+    K+   E
Sbjct: 1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE 1741

Query: 777  PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
             +      + K D  EK          +I H   E      + R  + A +++E  ++ +
Sbjct: 1742 DKKKA--EEAKKDEEEKK---------KIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDE 1790

Query: 837  KKKKKKKKKKKKKK 850
            K++ +  KK K   
Sbjct: 1791 KRRMEVDKKIKDIF 1804



 Score = 37.8 bits (87), Expect = 0.033
 Identities = 38/190 (20%), Positives = 85/190 (44%), Gaps = 2/190 (1%)

Query: 681  KNDPCEKNNLADRSEDQRINHYTTEVGRFNQI-AYPDKEEEEEKKKKKKKKKKKKKKKKK 739
            KN    K   A ++E+ RI           ++ A   K+ EE K K ++ KK +++KKK 
Sbjct: 1576 KNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKV 1635

Query: 740  KKKKKKKKKYSNEEEGMRKLRDAASIQCGPV-KEVPCEPQIAPCLFDIKNDPCEKNNLAD 798
            ++ KKK+ +   + E ++K  +   I+     K+   + + A      + D  +      
Sbjct: 1636 EQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695

Query: 799  RSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
            +   +       +    + K++  ++   ++E K K ++ KK+ ++ KKK ++    +  
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755

Query: 859  PDVLSQMEKE 868
               ++ ++KE
Sbjct: 1756 KKKIAHLKKE 1765



 Score = 37.8 bits (87), Expect = 0.033
 Identities = 33/152 (21%), Positives = 63/152 (41%), Gaps = 7/152 (4%)

Query: 705  EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
            E  +    A  D+ E  E+K +  +KKK++ KKK    KKK ++    +E  +K  +   
Sbjct: 1346 EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405

Query: 765  IQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQI 824
             +   +K+     + A        +  + +    ++E  +      +      K    + 
Sbjct: 1406 -KADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAK------KADEAKKKAEEAKK 1458

Query: 825  AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            A   K+K ++ KK  + KKK ++ KK    KK
Sbjct: 1459 AEEAKKKAEEAKKADEAKKKAEEAKKADEAKK 1490



 Score = 36.7 bits (84), Expect = 0.079
 Identities = 34/144 (23%), Positives = 58/144 (40%), Gaps = 23/144 (15%)

Query: 717  KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
            K+ +E KKK ++ KK  + KKK ++ KKK      + E  +K  +AA  +     +    
Sbjct: 1302 KKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD---- 1357

Query: 777  PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
                      + +  E+   A   + +              K++  +    D+ KKK ++
Sbjct: 1358 ----------EAEAAEEKAEAAEKKKEEAKKKADAA-----KKKAEEKKKADEAKKKAEE 1402

Query: 837  KKKK----KKKKKKKKKKKKMMKK 856
             KKK    KK    KKK  +  KK
Sbjct: 1403 DKKKADELKKAAAAKKKADEAKKK 1426



 Score = 36.3 bits (83), Expect = 0.11
 Identities = 34/137 (24%), Positives = 65/137 (47%), Gaps = 9/137 (6%)

Query: 716  DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
            +K++EE KKK    KKK ++KKK  + KKK ++   + + ++K   AA  +    K+   
Sbjct: 1370 EKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK-AAAAKKKADEAKKKAE 1428

Query: 776  EPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKK 835
            E + A        +  + +    ++E  +           + K++  +    D+ KKK +
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAKKAE--------EAKKKAEEAKKADEAKKKAE 1480

Query: 836  KKKKKKKKKKKKKKKKK 852
            + KK  + KKK ++ KK
Sbjct: 1481 EAKKADEAKKKAEEAKK 1497



 Score = 35.5 bits (81), Expect = 0.16
 Identities = 37/164 (22%), Positives = 70/164 (42%), Gaps = 9/164 (5%)

Query: 713  AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE 772
            A  DK++ +E KK    KKK  + KKK ++KKK  +   + E  +K  +A        K+
Sbjct: 1400 AEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEA--------KK 1451

Query: 773  VPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKK 832
               E + A        +  + +    ++E  +      +    + K++ ++     + KK
Sbjct: 1452 KAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE-EAKKKADEAKKAAEAKK 1510

Query: 833  KKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTA 876
            K  + KK ++ KK  + KK    K   +     EK+ A+  + A
Sbjct: 1511 KADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKA 1554



 Score = 35.1 bits (80), Expect = 0.20
 Identities = 33/152 (21%), Positives = 63/152 (41%), Gaps = 15/152 (9%)

Query: 716  DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
            + ++ EE KKK ++ KK  + KKK ++ KK  +   + E  +K  D A       K    
Sbjct: 1455 EAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKK---- 1510

Query: 776  EPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKK 835
                       K D  +K   A +++  +      +       +   +   L K ++ KK
Sbjct: 1511 -----------KADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKK 1559

Query: 836  KKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
             ++KKK ++ KK ++ K M     +   + E+
Sbjct: 1560 AEEKKKAEEAKKAEEDKNMALRKAEEAKKAEE 1591



 Score = 35.1 bits (80), Expect = 0.21
 Identities = 36/152 (23%), Positives = 60/152 (39%), Gaps = 18/152 (11%)

Query: 717  KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
            K+ EE KK  + KKK ++ KKK    KKK ++     E  +   +AA+ +    +E    
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEE---- 1364

Query: 777  PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
                      K +  EK     + +         E    D  ++  +    + +KK  + 
Sbjct: 1365 ----------KAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAE----EDKKKADEL 1410

Query: 837  KKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
            KK    KKK  + KKK  +K   D   +  +E
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKKAEE 1442



 Score = 33.6 bits (76), Expect = 0.67
 Identities = 33/139 (23%), Positives = 64/139 (46%), Gaps = 2/139 (1%)

Query: 716  DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE--EEGMRKLRDAASIQCGPVKEV 773
            +K++ +E KKK ++ KK  + KKK ++ KKK   + +  EE  +    A +       E 
Sbjct: 1300 EKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEA 1359

Query: 774  PCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKK 833
                + A      K +  +K + A +   ++      +    + K++ +++      KKK
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419

Query: 834  KKKKKKKKKKKKKKKKKKK 852
              + KKK ++KKK  + KK
Sbjct: 1420 ADEAKKKAEEKKKADEAKK 1438



 Score = 32.8 bits (74), Expect = 1.1
 Identities = 36/165 (21%), Positives = 71/165 (43%), Gaps = 12/165 (7%)

Query: 716  DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASI-------QCG 768
            + ++ +E KKK ++ KKK  + KK  + KKK   + + E  +K  +A          +  
Sbjct: 1481 EAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAK 1540

Query: 769  PVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQR-INHYTTEVGYLDPKQRFNQIAYL 827
              +E     ++       K +  +K   A ++E  + +     E      + R  ++  L
Sbjct: 1541 KAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKL 1600

Query: 828  DKEKKKKKKKKKKKKKKKKKK----KKKKMMKKGYPDVLSQMEKE 868
             +E+KK K ++ KK ++ K K    KK +  KK    +  +  +E
Sbjct: 1601 YEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEE 1645



 Score = 32.4 bits (73), Expect = 1.3
 Identities = 40/157 (25%), Positives = 74/157 (47%), Gaps = 20/157 (12%)

Query: 713  AYPDKEEEEEKKKKKKKKKK----KKKKKKKKKKKKKKKKYSNEEEGMRK---------L 759
            A   K++ +E KK  + KKK    KK ++ KK  + KK + + + +  +K         L
Sbjct: 1492 AEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADEL 1551

Query: 760  RDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQ 819
            + A  ++    K+   E + A    + KN    K   A ++E  RI     EV  L  ++
Sbjct: 1552 KKAEELKKAEEKKKAEEAKKAE---EDKNMALRKAEEAKKAEEARIE----EVMKLYEEE 1604

Query: 820  RFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            +  +     K ++ K K ++ KK +++KKK +++ KK
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKK 1641



 Score = 31.3 bits (70), Expect = 3.7
 Identities = 35/183 (19%), Positives = 65/183 (35%), Gaps = 40/183 (21%)

Query: 691  ADRSEDQRINHYT-----TEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
            A ++E++R N          +  F +     K EE  K  + KK ++KKK  + KK ++K
Sbjct: 1242 AKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEK 1301

Query: 746  KKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRI 805
            KK     +E  +K  +A                        K D  +K     + +    
Sbjct: 1302 KKA----DEAKKKAEEAK-----------------------KADEAKKKAEEAKKKADAA 1334

Query: 806  NHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQM 865
                 E       ++  + A    E +    + +  ++K +  +KKK   K   D   + 
Sbjct: 1335 KKKAEE------AKKAAEAA--KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386

Query: 866  EKE 868
             +E
Sbjct: 1387 AEE 1389



 Score = 29.7 bits (66), Expect = 8.6
 Identities = 37/156 (23%), Positives = 60/156 (38%), Gaps = 41/156 (26%)

Query: 717  KEEEEEKKKKKKKKKKKKKK----KKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE 772
            K+ EE+KK  + KK ++KKK    KKK ++ KK  +   + E  +K  DAA  +    K+
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKK 1343

Query: 773  VPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKK 832
                            +  +    A   E +                         +EK 
Sbjct: 1344 --------------AAEAAKAEAEAAADEAEA-----------------------AEEKA 1366

Query: 833  KKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
            +  +KKK++ KKK    KKK  +K   D   +  +E
Sbjct: 1367 EAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEE 1402


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 40.1 bits (94), Expect = 4e-04
 Identities = 18/32 (56%), Positives = 20/32 (62%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           K KK   KKKKKKKKKK K K++      EEE
Sbjct: 13  KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEE 44



 Score = 38.6 bits (90), Expect = 0.001
 Identities = 18/32 (56%), Positives = 22/32 (68%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           K K KK   KKKKKKKKKK K K++  + +EE
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEE 42



 Score = 37.8 bits (88), Expect = 0.002
 Identities = 16/36 (44%), Positives = 21/36 (58%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
            KK   KKKKKKKKKK K K++   +   EE+   +
Sbjct: 14  GKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAE 49



 Score = 36.2 bits (84), Expect = 0.009
 Identities = 16/52 (30%), Positives = 25/52 (48%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
             KKKKKKKKKK K K++   +K+++  +  E   K  +         +E  
Sbjct: 18  DVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEED 69



 Score = 35.5 bits (82), Expect = 0.013
 Identities = 16/43 (37%), Positives = 20/43 (46%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELAN 871
           K K KK   KKKKKKKKKK K K+ +     +      +    
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLK 53



 Score = 34.7 bits (80), Expect = 0.029
 Identities = 15/33 (45%), Positives = 24/33 (72%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           + KKKKKKKKKK K K++   +K++++ S+ E 
Sbjct: 18  DVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAES 50



 Score = 33.9 bits (78), Expect = 0.045
 Identities = 16/26 (61%), Positives = 16/26 (61%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
             K K K KK   KKKKKKKKKK K 
Sbjct: 7   GGKLKLKGKKIDVKKKKKKKKKKNKS 32



 Score = 33.9 bits (78), Expect = 0.057
 Identities = 17/32 (53%), Positives = 17/32 (53%)

Query: 724 KKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
             K K K KK   KKKKKKKKKK K   E   
Sbjct: 7   GGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVT 38



 Score = 33.5 bits (77), Expect = 0.059
 Identities = 13/25 (52%), Positives = 16/25 (64%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
            K   KKKKKKKKKK K K++   +
Sbjct: 15  KKIDVKKKKKKKKKKNKSKEEVVTE 39



 Score = 32.8 bits (75), Expect = 0.13
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 826 YLDKEKKKKKKKKKKKKKKKKKKKKKK 852
                KKKKKKKKKK K K++   +K+
Sbjct: 15  KKIDVKKKKKKKKKKNKSKEEVVTEKE 41



 Score = 32.8 bits (75), Expect = 0.14
 Identities = 13/26 (50%), Positives = 20/26 (76%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKK 852
           +D +KKKKKKKKK K K++   +K++
Sbjct: 17  IDVKKKKKKKKKKNKSKEEVVTEKEE 42



 Score = 32.8 bits (75), Expect = 0.14
 Identities = 15/32 (46%), Positives = 19/32 (59%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
           L  +K   KKKKKKKKKK K K++    K+  
Sbjct: 12  LKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEE 43



 Score = 32.0 bits (73), Expect = 0.21
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 727 KKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
             K K K KK   KKKKKKKKK +  +E +  
Sbjct: 7   GGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVT 38



 Score = 31.2 bits (71), Expect = 0.41
 Identities = 15/51 (29%), Positives = 30/51 (58%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
            K + ++KKKKKKKK K K++   +K++++K    ++ +   +  D   I+
Sbjct: 15  KKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIE 65



 Score = 30.1 bits (68), Expect = 0.98
 Identities = 15/26 (57%), Positives = 15/26 (57%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKK 856
             K K K KK   KKKKKKKKK  K 
Sbjct: 7   GGKLKLKGKKIDVKKKKKKKKKKNKS 32



 Score = 27.8 bits (62), Expect = 6.7
 Identities = 11/25 (44%), Positives = 19/25 (76%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
            K+KKKKKKK K K++   +K++++
Sbjct: 20  KKKKKKKKKKNKSKEEVVTEKEEEE 44


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 43.3 bits (102), Expect = 4e-04
 Identities = 35/159 (22%), Positives = 65/159 (40%), Gaps = 22/159 (13%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY----SNEEEGMRKLRDAASIQCGPV 770
            + E+E+E K   ++   +++ ++K+K+K+K+ K      +E+E M  + ++      PV
Sbjct: 266 DEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPES------PV 319

Query: 771 KEVPCEPQIAPCLFDIKNDPCEKNNLADRSEV---QRINHYTTEV---GYLDPKQRFNQI 824
           +E   E    P L   + +  E     D       +R+    T     GYL  K+ +   
Sbjct: 320 EEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRRRVMKKKTFKDEEGYLVTKKVYEWE 379

Query: 825 AYLDKE------KKKKKKKKKKKKKKKKKKKKKKMMKKG 857
           ++ + E      K K K          KK K  K  K+ 
Sbjct: 380 SFSEDEAEPPPTKPKPKVSTPAVPAAAKKPKAPKKKKQS 418



 Score = 36.7 bits (85), Expect = 0.047
 Identities = 28/139 (20%), Positives = 49/139 (35%), Gaps = 24/139 (17%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
           KE   EK + K   K    K+   KK      +  ++   +K +  AS     VKE   E
Sbjct: 189 KETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASES--TVKEESEE 246

Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
                    I  D   +    D  E                +            ++   +
Sbjct: 247 ES--GKRDVILEDESAEPTGLDEDE---------------DEDEP-----KPSGERSDSE 284

Query: 837 KKKKKKKKKKKKKKKKMMK 855
           ++ ++K+K+K+K+ KKMM+
Sbjct: 285 EETEEKEKEKRKRLKKMME 303



 Score = 34.4 bits (79), Expect = 0.26
 Identities = 13/37 (35%), Positives = 15/37 (40%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
            D+ E    K K K          KK K  KKKK S+
Sbjct: 383 EDEAEPPPTKPKPKVSTPAVPAAAKKPKAPKKKKQSS 419



 Score = 32.5 bits (74), Expect = 1.1
 Identities = 28/147 (19%), Positives = 59/147 (40%), Gaps = 15/147 (10%)

Query: 726 KKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC-EPQIAPCLF 784
            KKK K+KK+KK+  +   K++      +    L D ++   G  ++    EP+ +    
Sbjct: 222 FKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERS 281

Query: 785 DIKNDPCEKN-----------NLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKK 833
           D + +  EK               D  E   I     E    + +    +   L K++++
Sbjct: 282 DSEEETEEKEKEKRKRLKKMMEDEDEDEEMEI---VPESPVEEEESEEPEPPPLPKKEEE 338

Query: 834 KKKKKKKKKKKKKKKKKKKMMKKGYPD 860
           K++        +++ +++ M KK + D
Sbjct: 339 KEEVTVSPDGGRRRGRRRVMKKKTFKD 365



 Score = 31.0 bits (70), Expect = 2.9
 Identities = 11/35 (31%), Positives = 13/35 (37%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           + + E E    K K K          KK K  KKK
Sbjct: 381 FSEDEAEPPPTKPKPKVSTPAVPAAAKKPKAPKKK 415


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 42.3 bits (100), Expect = 4e-04
 Identities = 31/143 (21%), Positives = 61/143 (42%), Gaps = 21/143 (14%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
              E++++K++++K +KK+K+K+ KKK+  + S  EE                K  P E 
Sbjct: 6   ALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEE------VKNEENKSKKKAAPIEN 59

Query: 778 QIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK- 836
                +F        K   AD  + ++      +      K++ +    L K + +KKK 
Sbjct: 60  AEGNIVFS-------KVEFADGEQAKKDLKLKKK----KKKKKTDYKQLLKKLEARKKKL 108

Query: 837 ---KKKKKKKKKKKKKKKKMMKK 856
               + K  + ++K+K  K + K
Sbjct: 109 EELDEDKAAEIEEKEKWTKALAK 131



 Score = 37.3 bits (87), Expect = 0.017
 Identities = 20/37 (54%), Positives = 26/37 (70%), Gaps = 5/37 (13%)

Query: 717 KEEEEEKKKKKKKK-----KKKKKKKKKKKKKKKKKK 748
           KE+ E+KK +++KK     KK+K  KK KKKKK KKK
Sbjct: 166 KEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKK 202



 Score = 35.7 bits (83), Expect = 0.053
 Identities = 16/29 (55%), Positives = 18/29 (62%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
            K EE  KK+K  KK KKKKK KKK +  
Sbjct: 178 KKREENLKKRKDDKKNKKKKKAKKKGRIL 206



 Score = 35.4 bits (82), Expect = 0.082
 Identities = 44/172 (25%), Positives = 69/172 (40%), Gaps = 57/172 (33%)

Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG 768
           F+++ + D E+ ++  K KKKKKKKK   K+  KK + +K   + E + + + A      
Sbjct: 66  FSKVEFADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARK--KKLEELDEDKAA------ 117

Query: 769 PVKEVPCEPQIAPCLFDIKNDPCEK-NNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYL 827
                           +I+    EK      ++E  ++          D K     +   
Sbjct: 118 ----------------EIEEK--EKWTKALAKAEGVKVKD--------DEKLLKKALKRK 151

Query: 828 DKEKKKKKKK----------------KK------KKKKKKKKKKKKKMMKKG 857
           +K+KKK KK+                KK      K+K  KK KKKKK  KKG
Sbjct: 152 EKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKG 203



 Score = 35.0 bits (81), Expect = 0.12
 Identities = 16/29 (55%), Positives = 19/29 (65%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
           K+ EE  KK+K  KK KKKKK KKK +  
Sbjct: 178 KKREENLKKRKDDKKNKKKKKAKKKGRIL 206



 Score = 34.2 bits (79), Expect = 0.18
 Identities = 16/37 (43%), Positives = 24/37 (64%), Gaps = 5/37 (13%)

Query: 717 KEEEEEKKKK-----KKKKKKKKKKKKKKKKKKKKKK 748
           ++++ E++KK     KK+K  KK KKKKK KKK +  
Sbjct: 170 EKKKAERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206



 Score = 33.0 bits (76), Expect = 0.39
 Identities = 14/32 (43%), Positives = 22/32 (68%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           +++++ E+  KK+K  KK KKKKK KKK +  
Sbjct: 175 ERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206



 Score = 33.0 bits (76), Expect = 0.46
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKK 852
           KK+K  KK KKKKK KKK +  
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206



 Score = 32.7 bits (75), Expect = 0.58
 Identities = 13/31 (41%), Positives = 20/31 (64%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
           +++KK+++  KK+K  KK KKKKK   K   
Sbjct: 175 ERQKKREENLKKRKDDKKNKKKKKAKKKGRI 205



 Score = 31.9 bits (73), Expect = 0.91
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKK 850
           K++K  KK KKKKK KKK +  
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206



 Score = 31.9 bits (73), Expect = 1.2
 Identities = 12/35 (34%), Positives = 28/35 (80%)

Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           ++ A L++ ++K++++K +KK+K+K+ KKK+  +K
Sbjct: 3   SREALLEQRRRKREQRKARKKQKRKEAKKKEDAQK 37



 Score = 31.5 bits (72), Expect = 1.5
 Identities = 13/23 (56%), Positives = 14/23 (60%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKK 849
           L K K  KK KKKKK KKK +  
Sbjct: 184 LKKRKDDKKNKKKKKAKKKGRIL 206



 Score = 30.4 bits (69), Expect = 3.5
 Identities = 10/39 (25%), Positives = 24/39 (61%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
           E++++K++++K +KK+K+K+ KK       +      +E
Sbjct: 9   EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEE 47


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 41.4 bits (98), Expect = 4e-04
 Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 814 YLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
           Y D K+     A LDK++ K  +K +KK +K K K +KK  K
Sbjct: 51  YKDLKESLEA-ALLDKKELKAWEKAEKKAEKAKAKAEKKKAK 91



 Score = 41.4 bits (98), Expect = 5e-04
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKK 746
           ++E K  +K +KK +K K K +KKK K
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAK 91



 Score = 41.4 bits (98), Expect = 5e-04
 Identities = 16/35 (45%), Positives = 23/35 (65%), Gaps = 2/35 (5%)

Query: 716 DKEE--EEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           DK+E    EK +KK +K K K +KKK KK++ K +
Sbjct: 64  DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 39.0 bits (92), Expect = 0.003
 Identities = 14/34 (41%), Positives = 21/34 (61%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
             KE +  +K +KK +K K K +KKK KK++ K 
Sbjct: 64  DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 38.3 bits (90), Expect = 0.005
 Identities = 14/31 (45%), Positives = 19/31 (61%)

Query: 724 KKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           KK+ K  +K +KK +K K K +KKK   EE 
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEP 95



 Score = 37.5 bits (88), Expect = 0.010
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKK 852
            +K +KK +K K K +KKK KK++ K
Sbjct: 71  WEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 36.4 bits (85), Expect = 0.020
 Identities = 13/28 (46%), Positives = 18/28 (64%)

Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           A+   EKK +K K K +KKK KK++ K 
Sbjct: 70  AWEKAEKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 34.8 bits (81), Expect = 0.074
 Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 726 KKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           KK+ K  +K +KK +K K K +K   ++E  + 
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 32.9 bits (76), Expect = 0.35
 Identities = 14/44 (31%), Positives = 21/44 (47%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           KE  E     KK+ K  +K +KK +K K K +    ++   K R
Sbjct: 55  KESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 32.1 bits (74), Expect = 0.50
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKK 856
           KK+ K  +K +KK +K K K +  K 
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKA 90



 Score = 31.3 bits (72), Expect = 0.95
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKK 851
           K+ +K K K +KKK KK++ K +
Sbjct: 76  KKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 29.4 bits (67), Expect = 5.2
 Identities = 10/30 (33%), Positives = 15/30 (50%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKK 847
           K         +K K K +KKK KK++ K +
Sbjct: 69  KAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 42.7 bits (101), Expect = 5e-04
 Identities = 16/66 (24%), Positives = 28/66 (42%), Gaps = 1/66 (1%)

Query: 694 SEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
                  +      +  +     +  E   +KK+K K++ KKKK K+ K  K  K     
Sbjct: 219 EFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVV-A 277

Query: 754 EGMRKL 759
           +GM+K+
Sbjct: 278 KGMKKI 283



 Score = 38.5 bits (90), Expect = 0.012
 Identities = 15/29 (51%), Positives = 17/29 (58%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            KE+ KKKK K+ K  K  KK   K MKK
Sbjct: 254 SKEEIKKKKPKESKGVKALKKVVAKGMKK 282



 Score = 36.2 bits (84), Expect = 0.069
 Identities = 14/30 (46%), Positives = 19/30 (63%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
            K K K++ KKKK K+ K  K  KK++ KG
Sbjct: 250 KKRKSKEEIKKKKPKESKGVKALKKVVAKG 279



 Score = 35.0 bits (81), Expect = 0.13
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
             + +KK+K K++ KKKK K+ K  K +KK
Sbjct: 245 ESRAEKKRKSKEEIKKKKPKESKGVKALKK 274



 Score = 33.8 bits (78), Expect = 0.32
 Identities = 13/27 (48%), Positives = 18/27 (66%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           EKK+K K++ KKKK K+ K  K + K 
Sbjct: 249 EKKRKSKEEIKKKKPKESKGVKALKKV 275



 Score = 33.5 bits (77), Expect = 0.46
 Identities = 11/25 (44%), Positives = 14/25 (56%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKM 853
           K+KK K+ K  K  KK   K  KK+
Sbjct: 259 KKKKPKESKGVKALKKVVAKGMKKI 283



 Score = 32.3 bits (74), Expect = 0.92
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
             +  +K++K K++ KKKK K+ K  K  K
Sbjct: 244 AESRAEKKRKSKEEIKKKKPKESKGVKALK 273



 Score = 31.9 bits (73), Expect = 1.2
 Identities = 12/44 (27%), Positives = 23/44 (52%), Gaps = 2/44 (4%)

Query: 814 YLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
             + K++  +     +  + + +KK+K K++ KKKK K    KG
Sbjct: 227 LKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPK--ESKG 268



 Score = 29.2 bits (66), Expect = 9.3
 Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 12/62 (19%)

Query: 807 HYTTEVGYLDPKQRFNQIAYLDKEKKKKKK------------KKKKKKKKKKKKKKKKMM 854
            Y   +  L   +      YL + KKKK++            +KK+K K++ KKKK K  
Sbjct: 207 LYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKES 266

Query: 855 KK 856
           K 
Sbjct: 267 KG 268


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 43.4 bits (102), Expect = 6e-04
 Identities = 28/158 (17%), Positives = 61/158 (38%), Gaps = 10/158 (6%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
             K +E+ KK  +  + K+K + +++          NEE            Q        
Sbjct: 200 ELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQ-------- 251

Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
            E + +    + + +   +    ++ E +       E+  L  ++   +   L  E++K 
Sbjct: 252 EEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKV 311

Query: 835 KKKKKKK--KKKKKKKKKKKMMKKGYPDVLSQMEKELA 870
             ++K K  +K+ KK +K+   +K   + L +  KEL 
Sbjct: 312 DDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELE 349



 Score = 40.3 bits (94), Expect = 0.005
 Identities = 29/178 (16%), Positives = 63/178 (35%), Gaps = 3/178 (1%)

Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPV 770
           +     +E EEE++++ +K ++K ++ +++   KKK +        +   +   ++    
Sbjct: 346 KELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEE 405

Query: 771 KEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKE 830
           KE     +++    D+  +  EK       E    +   T+ G L  ++   +   L   
Sbjct: 406 KEAKLLLELSEQEEDLLKE--EKKEELKIVEELEESL-ETKQGKLTEEKEELEKQALKLL 462

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGG 888
           K K + KK +   K+ K  K     +         E             +     K G
Sbjct: 463 KDKLELKKSEDLLKETKLVKLLEQLELLLLRQKLEEASQKESKAREGLAVLLALIKDG 520



 Score = 37.3 bits (86), Expect = 0.046
 Identities = 27/149 (18%), Positives = 70/149 (46%), Gaps = 9/149 (6%)

Query: 717  KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
            +E EE+K K + + K++K+K++KK+ +++ +K +  EE   ++ +  + +   + +   E
Sbjct: 878  EELEEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESE 937

Query: 777  PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRF----NQIAYLDKEKK 832
            P+        + +  E N   +    +R+     E+G ++          +  Y   E K
Sbjct: 938  PEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELK 997

Query: 833  K-----KKKKKKKKKKKKKKKKKKKMMKK 856
            K     +KK+  ++  ++  ++ K+ ++ 
Sbjct: 998  KERLEEEKKELLREIIEETCQRFKEFLEL 1026



 Score = 36.1 bits (83), Expect = 0.11
 Identities = 30/156 (19%), Positives = 72/156 (46%), Gaps = 2/156 (1%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
            EEEEK +K K +++K++K K ++++ +  +   +EE      +   I+     +     
Sbjct: 778 AEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELE 837

Query: 778 QIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKK 837
           ++A  L + +    EK    +   ++        +  L  K+   +   L  E + K++K
Sbjct: 838 ELALELKEEQKL--EKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEK 895

Query: 838 KKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANIN 873
           +K++KK+ +++ +K  + +   + + +   E A I 
Sbjct: 896 EKEEKKELEEESQKDNLLEEKENEIEERIAEEAIIL 931



 Score = 34.2 bits (78), Expect = 0.38
 Identities = 32/150 (21%), Positives = 67/150 (44%), Gaps = 12/150 (8%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE----GMRKLRDAASIQCGP 769
              + E+EE+   +  K+ K+++K+KK ++++ K  + EEE     + KL          
Sbjct: 257 SKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEK 316

Query: 770 VKEVPCEPQIAPCLFDIKNDPCEKNNLA---DRSEVQRINHYTTEVGYLDPKQRFNQIAY 826
           +KE   E ++     ++K +  E   L       E++R      E      +++  Q   
Sbjct: 317 LKE--SEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQ--- 371

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           L++E   KKK + ++     K K++++  K
Sbjct: 372 LEEELLAKKKLESERLSSAAKLKEEELELK 401



 Score = 31.9 bits (72), Expect = 2.3
 Identities = 17/78 (21%), Positives = 39/78 (50%), Gaps = 3/78 (3%)

Query: 679  DIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKK 738
              + +  E N   +   ++R+     E+G  N +A  + EE+EE+  K + KK++ +++K
Sbjct: 946  ADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEK 1005

Query: 739  KKKKK---KKKKKYSNEE 753
            K+  +   ++  +   E 
Sbjct: 1006 KELLREIIEETCQRFKEF 1023



 Score = 29.9 bits (67), Expect = 6.9
 Identities = 9/48 (18%), Positives = 25/48 (52%)

Query: 716  DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
               E EEK+++  K + KK++ +++KK+  ++      +  ++  +  
Sbjct: 980  AIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELF 1027


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 42.8 bits (101), Expect = 7e-04
 Identities = 40/141 (28%), Positives = 55/141 (39%), Gaps = 33/141 (23%)

Query: 722 EKKKKKKKKKKKKKKKK-------KKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
           EK+K KK  + KKKK         K  K+    KY  E EG +K  D             
Sbjct: 175 EKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFT--------- 225

Query: 775 CEPQIAPCLFDIK---NDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEK 831
                     D     +D  EK     + ++ +IN+ + +    D         Y D  +
Sbjct: 226 ---------LDSGGSTDDEAEKKRQEVKKKL-KINNVSLD----DDSTETPASDYYDVSE 271

Query: 832 KKKKKKKKKKKKKKKKKKKKK 852
             K KK KKKKKKKKK++K  
Sbjct: 272 MVKFKKPKKKKKKKKKRRKDL 292



 Score = 37.0 bits (86), Expect = 0.047
 Identities = 20/90 (22%), Positives = 34/90 (37%)

Query: 698 RINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMR 757
           +IN+ + +       A    +  E  K KK KKKKKKKKK++K   + + +   E  G  
Sbjct: 248 KINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSS 307

Query: 758 KLRDAASIQCGPVKEVPCEPQIAPCLFDIK 787
                  ++    +      +      D  
Sbjct: 308 DSGSRKDVEEENARLEDSPKKRKEEQEDDD 337



 Score = 33.6 bits (77), Expect = 0.49
 Identities = 12/45 (26%), Positives = 21/45 (46%)

Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTA 876
           K+K KK  + KKKK         K     +LS+ ++E+    + +
Sbjct: 176 KEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKS 220



 Score = 32.4 bits (74), Expect = 1.1
 Identities = 22/63 (34%), Positives = 30/63 (47%), Gaps = 27/63 (42%)

Query: 719 EEEEKKKKKKKKKKK---------------------------KKKKKKKKKKKKKKKYSN 751
           +E EKK+++ KKK K                           KK KKKKKKKKK++K  +
Sbjct: 234 DEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLD 293

Query: 752 EEE 754
           E+E
Sbjct: 294 EDE 296



 Score = 30.1 bits (68), Expect = 6.1
 Identities = 9/46 (19%), Positives = 24/46 (52%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           N     ++ E + K+++ ++K  K ++K+++  K    K   E++ 
Sbjct: 41  NWKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDD 86


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.  This
           presumed domain is found at the N-terminus of Ribosomal
           L30 proteins and has been termed RL30NT or NUC018.
          Length = 71

 Score = 38.4 bits (90), Expect = 8e-04
 Identities = 14/42 (33%), Positives = 22/42 (52%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
            E+   K+ KK+  KK  +KKK+K   K+ +KY  E     +
Sbjct: 9   NEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAER 50



 Score = 34.1 bits (79), Expect = 0.030
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            EK + K+ KK+  KK  +KKK+K++ K
Sbjct: 9   NEKLRAKRAKKRAAKKAARKKKRKLIFK 36



 Score = 32.2 bits (74), Expect = 0.15
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
           K+K+ +K + K+ KK+  KK  +KKK
Sbjct: 5   KRKRNEKLRAKRAKKRAAKKAARKKK 30



 Score = 31.8 bits (73), Expect = 0.17
 Identities = 10/29 (34%), Positives = 20/29 (68%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           +++ + K+ KK+  KK  +KKK+K + K+
Sbjct: 9   NEKLRAKRAKKRAAKKAARKKKRKLIFKR 37



 Score = 29.5 bits (67), Expect = 1.1
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
             KK+K+ +K + K+ KK+  KK  +
Sbjct: 2   LLKKRKRNEKLRAKRAKKRAAKKAAR 27



 Score = 28.3 bits (64), Expect = 3.5
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKK 856
           KK+K+ +K + K+ KK+  KK   KK
Sbjct: 4   KKRKRNEKLRAKRAKKRAAKKAARKK 29



 Score = 28.0 bits (63), Expect = 3.7
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
           L K K+ +K + K+ KK+  KK  +K  +K  
Sbjct: 3   LKKRKRNEKLRAKRAKKRAAKKAARKKKRKLI 34


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 42.8 bits (101), Expect = 8e-04
 Identities = 22/56 (39%), Positives = 37/56 (66%), Gaps = 10/56 (17%)

Query: 715 PDKEEEEEKKKKKKKKKKK----------KKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           PD+EEEEEK++KK++++K           ++K++KKKK KK K+ + E E + K +
Sbjct: 36  PDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTK 91



 Score = 30.5 bits (69), Expect = 4.5
 Identities = 11/32 (34%), Positives = 21/32 (65%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYP 859
             E+++K++KKKK KK K+   + +++ K  P
Sbjct: 61  VDEEEEKEEKKKKTKKVKETTTEWELLNKTKP 92


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 42.6 bits (101), Expect = 9e-04
 Identities = 18/54 (33%), Positives = 31/54 (57%)

Query: 705 EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           E+          K+ ++  +K +KK++++KK+KKKK    KKK+   EEE  +K
Sbjct: 398 EIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451



 Score = 40.3 bits (95), Expect = 0.004
 Identities = 16/38 (42%), Positives = 30/38 (78%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           +++ EE+KK+KKKK    KKK+++++++K+KK   +EE
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457



 Score = 39.9 bits (94), Expect = 0.005
 Identities = 19/38 (50%), Positives = 30/38 (78%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           K+ EEEKK+KKKK    KKK+++++++K+KK+   EEE
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458



 Score = 39.9 bits (94), Expect = 0.005
 Identities = 16/46 (34%), Positives = 34/46 (73%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           +EEE+++KKKK    KKK+++++++K+KK+++   EEE   + ++ 
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468



 Score = 39.9 bits (94), Expect = 0.006
 Identities = 18/48 (37%), Positives = 31/48 (64%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           K EEE+K+KKKK    KKK+++++++K+KK++   EEE   +      
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469



 Score = 39.9 bits (94), Expect = 0.006
 Identities = 15/40 (37%), Positives = 29/40 (72%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
             E++ E++KK+KKKK    KKK+++++++K+K   E+E 
Sbjct: 418 KAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457



 Score = 39.5 bits (93), Expect = 0.008
 Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 4/50 (8%)

Query: 716 DKEEEE----EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
            +EE E     KK  KK KK  +K +KK++++KK+KK        ++  +
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEE 444



 Score = 39.1 bits (92), Expect = 0.009
 Identities = 14/45 (31%), Positives = 31/45 (68%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
             E+ EKK++++KK+KKKK    KKK++++++   ++E  ++  +
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459



 Score = 37.6 bits (88), Expect = 0.031
 Identities = 14/39 (35%), Positives = 30/39 (76%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           +K E++ +++KK+KKKK    KKK+++++++K+   EE+
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455



 Score = 36.8 bits (86), Expect = 0.049
 Identities = 14/39 (35%), Positives = 31/39 (79%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           ++E++E+KKK    KKK+++++++K+KK+++K+   EE 
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462



 Score = 36.4 bits (85), Expect = 0.062
 Identities = 15/64 (23%), Positives = 29/64 (45%), Gaps = 1/64 (1%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGG 888
           K+  +K +KK++++KK+KKKK     KK   +   + +KE             +  ++  
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE-EEKEEEEEEAEEEKEEEEE 471

Query: 889 DPKN 892
             K 
Sbjct: 472 KKKK 475



 Score = 35.7 bits (83), Expect = 0.10
 Identities = 12/40 (30%), Positives = 31/40 (77%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
            ++++E++KK    KKK+++++++K+KK+++K++   E E
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463



 Score = 35.7 bits (83), Expect = 0.10
 Identities = 15/45 (33%), Positives = 28/45 (62%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
            +I +    ++  KK KK  +K +KK++++KK+KKKK     ++E
Sbjct: 397 EEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKE 441



 Score = 34.9 bits (81), Expect = 0.21
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           +      EKK++++KK+KKKK    KKK++
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEE 442



 Score = 34.9 bits (81), Expect = 0.21
 Identities = 14/42 (33%), Positives = 25/42 (59%)

Query: 811 EVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           E+ +L   ++  +      EK +KK++++KK+KKKK    KK
Sbjct: 398 EIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKK 439



 Score = 34.9 bits (81), Expect = 0.22
 Identities = 14/36 (38%), Positives = 22/36 (61%)

Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
            +I +L   KK  KK KK  +K +KK++++K  KK 
Sbjct: 397 EEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKK 432



 Score = 34.1 bits (79), Expect = 0.33
 Identities = 12/33 (36%), Positives = 32/33 (96%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           ++EEE+EKK+++K++++++ +++K+++++KKKK
Sbjct: 443 EEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 33.4 bits (77), Expect = 0.65
 Identities = 11/34 (32%), Positives = 31/34 (91%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
             KEEEEE++K+KK+++K++++++ +++K+++++
Sbjct: 438 KKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471



 Score = 33.0 bits (76), Expect = 0.80
 Identities = 12/30 (40%), Positives = 23/30 (76%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            ++EKK+KKKK    KKK+++++++K  K+
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452



 Score = 33.0 bits (76), Expect = 0.82
 Identities = 12/33 (36%), Positives = 32/33 (96%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           ++EEEEEK+KK+++K++++++ +++K+++++KK
Sbjct: 441 EEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473



 Score = 32.6 bits (75), Expect = 1.1
 Identities = 15/41 (36%), Positives = 25/41 (60%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
            K+ KK  +K +KK++++KK+KKKK       +   + EKE
Sbjct: 409 TKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE 449



 Score = 32.2 bits (74), Expect = 1.2
 Identities = 11/34 (32%), Positives = 32/34 (94%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            ++EEEE++KK+++K++++++ +++K+++++KKK
Sbjct: 441 EEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKK 474



 Score = 31.4 bits (72), Expect = 2.1
 Identities = 10/30 (33%), Positives = 28/30 (93%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
            +KE++EE+K++++++ +++K+++++KKKK
Sbjct: 446 EEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 30.7 bits (70), Expect = 4.5
 Identities = 11/41 (26%), Positives = 28/41 (68%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
           +++K+KKKK    KKK+++++++K+  ++   +   + E+E
Sbjct: 425 EEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 42.6 bits (100), Expect = 0.001
 Identities = 39/186 (20%), Positives = 61/186 (32%), Gaps = 41/186 (22%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD--------------AA 763
            E+ +++K KKKKKK  K KKK    KK K+  +++       D              +A
Sbjct: 233 GEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSA 292

Query: 764 SIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQ 823
           S      +E    P+I       +++  E++      E                      
Sbjct: 293 SGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEE---------------------- 330

Query: 824 IAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKP 883
                +    KK KK KK K KK    K     G     S ++ E +    TA       
Sbjct: 331 -----EGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKEPK 385

Query: 884 FDKGGD 889
            ++  D
Sbjct: 386 KEEPVD 391



 Score = 37.6 bits (87), Expect = 0.030
 Identities = 15/40 (37%), Positives = 21/40 (52%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
            +E EEEK +++    KK KK KK K KK      + + G
Sbjct: 320 SEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSG 359



 Score = 36.1 bits (83), Expect = 0.078
 Identities = 18/67 (26%), Positives = 32/67 (47%), Gaps = 4/67 (5%)

Query: 692 DRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKK----KKKKKKKKKKKKKKKKKK 747
               ++R +  + E+    +I   +  EE E++K +++    KK KK KK K KK    K
Sbjct: 294 GNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDK 353

Query: 748 KYSNEEE 754
             S+  +
Sbjct: 354 DDSDSGD 360


>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein.  The PcfK-like protein family
           includes the Enterococcus faecalis PcfK protein, which
           is functionally uncharacterized. This family of proteins
           is found in bacteria and viruses. Proteins in this
           family are typically between 137 and 257 amino acids in
           length. There are two completely conserved residues (D
           and L) that may be functionally important.
          Length = 136

 Score = 39.7 bits (93), Expect = 0.001
 Identities = 12/47 (25%), Positives = 25/47 (53%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
           N      +EE+ E +K+  K  ++++ +K +K+ KK KK    +  +
Sbjct: 85  NHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQGQL 131



 Score = 31.6 bits (72), Expect = 0.75
 Identities = 16/77 (20%), Positives = 29/77 (37%), Gaps = 13/77 (16%)

Query: 714 YPDKEEEEEKKKKK----KKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGP 769
           Y + + +  K            +  +++K + +K+  K Y  EE  +RK++         
Sbjct: 67  YDEDDIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEE--LRKIQKR------S 118

Query: 770 VKEVPCEPQIA-PCLFD 785
            K    EP      LFD
Sbjct: 119 KKSKKAEPVQGQLSLFD 135


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
           non-specific RNA binding region part 2.  This is a
           region found N terminal to the catalytic domain of
           glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
           but not in Escherichia coli. This region is thought to
           bind RNA in a non-specific manner, enhancing
           interactions between the tRNA and enzyme, but is not
           essential for enzyme function.
          Length = 83

 Score = 38.5 bits (90), Expect = 0.001
 Identities = 17/45 (37%), Positives = 20/45 (44%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANI 872
           D  KKKKKKKKKK +      K KK   +   +    ME  L   
Sbjct: 21  DLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEGFLGEK 65



 Score = 38.1 bits (89), Expect = 0.001
 Identities = 17/48 (35%), Positives = 21/48 (43%), Gaps = 4/48 (8%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK----KKYSNEEEGMRK 758
           P  E +  KKKKKKKKKK +      K KK       + +   EG   
Sbjct: 16  PKTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEGFLG 63



 Score = 31.6 bits (72), Expect = 0.25
 Identities = 11/25 (44%), Positives = 14/25 (56%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
           + +  KKKKKKKKKK +      K 
Sbjct: 19  EADLVKKKKKKKKKKAEDTAATAKA 43



 Score = 31.2 bits (71), Expect = 0.39
 Identities = 13/27 (48%), Positives = 15/27 (55%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKM 853
           L K+KKKKKKKK +      K KK   
Sbjct: 22  LVKKKKKKKKKKAEDTAATAKAKKATA 48



 Score = 30.8 bits (70), Expect = 0.47
 Identities = 11/25 (44%), Positives = 13/25 (52%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
            +    KKKKKKKKKK +      K
Sbjct: 18  TEADLVKKKKKKKKKKAEDTAATAK 42



 Score = 30.4 bits (69), Expect = 0.79
 Identities = 20/64 (31%), Positives = 27/64 (42%), Gaps = 5/64 (7%)

Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEV---PCE--PQI 779
           K +    KKKKKKKKKK +        ++     + + A    G + E    P E  PQ 
Sbjct: 17  KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEGFLGEKFHKPGENYPQT 76

Query: 780 APCL 783
            P L
Sbjct: 77  YPLL 80



 Score = 27.3 bits (61), Expect = 8.4
 Identities = 12/26 (46%), Positives = 14/26 (53%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKK 856
           K +    KKKKKKKKKK +      K
Sbjct: 17  KTEADLVKKKKKKKKKKAEDTAATAK 42


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 42.3 bits (100), Expect = 0.001
 Identities = 35/176 (19%), Positives = 59/176 (33%), Gaps = 9/176 (5%)

Query: 717  KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
            KE  +E++ K K K K  K +K K KKK+KKK  +  +  +K     + +     E    
Sbjct: 1150 KEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKR-- 1207

Query: 777  PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
                    D K D  + N+     E         +   +   +     +    E   +  
Sbjct: 1208 ------KLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFS 1261

Query: 837  KKKKKKKKKKKKKKKKMMKKGY-PDVLSQMEKELANINRTAVAPINKPFDKGGDPK 891
                 K+ K K   K++    Y P   S+     +N      +P  K   K  +  
Sbjct: 1262 SDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGS 1317



 Score = 34.3 bits (79), Expect = 0.38
 Identities = 37/192 (19%), Positives = 56/192 (29%), Gaps = 27/192 (14%)

Query: 715  PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA----------- 763
              K  +  K K KKK+KKKKK    K KK      S   +   K +              
Sbjct: 1163 KGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSG 1222

Query: 764  --SIQCGPVKEVPCEPQIAPCLFDIKN--------DPCEKNNLADRSEVQRINHYTTEVG 813
                     K  P +  +        N        D    ++L+   + +      + V 
Sbjct: 1223 SDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQ 1282

Query: 814  YLDPK------QRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
            Y  P          N  +      KKK KK+ +      KKKKK   K           K
Sbjct: 1283 YSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVK 1342

Query: 868  ELANINRTAVAP 879
            + +    + +  
Sbjct: 1343 QASASQSSRLLR 1354



 Score = 31.9 bits (73), Expect = 2.1
 Identities = 17/64 (26%), Positives = 26/64 (40%)

Query: 829  KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGG 888
            KE++ K K K K  K +K K KKK  KK         +  +   ++   +   +  D   
Sbjct: 1154 KEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKP 1213

Query: 889  DPKN 892
            D K 
Sbjct: 1214 DNKK 1217



 Score = 29.6 bits (67), Expect = 9.8
 Identities = 16/46 (34%), Positives = 21/46 (45%)

Query: 715  PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
              K  E      KKKKK +KK  +KKK K + K+ S  +      R
Sbjct: 1310 VKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRR 1355


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 41.3 bits (97), Expect = 0.001
 Identities = 32/172 (18%), Positives = 51/172 (29%), Gaps = 25/172 (14%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
           E+E +K+ K  K  +  ++ KK KK          +E   K +     Q         E 
Sbjct: 98  EKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSP---ES 154

Query: 778 QIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKK 837
             +       +      +L    E                  R  + A           +
Sbjct: 155 SPSS---TPNSSRPSTPHLLKAKEGPS---------------RRAKKAAKLSSTASSGDE 196

Query: 838 KKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGD 889
           K  K K   KK  KKM K      L   E + A ++ +A    ++  D   D
Sbjct: 197 KSPKSKAAPKKAGKKMRKWD----LDGDEDDDAVLDYSAPDANDENADAPED 244



 Score = 29.7 bits (67), Expect = 5.8
 Identities = 8/39 (20%), Positives = 12/39 (30%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
                P +  ++  K         +K  K K   KK  K
Sbjct: 172 KAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGK 210


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 40.8 bits (96), Expect = 0.002
 Identities = 17/34 (50%), Positives = 23/34 (67%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           P +EE E  K + ++++ KKKKKKKKKK KK   
Sbjct: 164 PTEEEVELLKARLEEERAKKKKKKKKKKTKKNNA 197



 Score = 40.0 bits (94), Expect = 0.003
 Identities = 16/38 (42%), Positives = 23/38 (60%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
           EEE +  K + ++++ KKKKKKKKKK KK  +      
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAE 203



 Score = 38.9 bits (91), Expect = 0.007
 Identities = 14/48 (29%), Positives = 19/48 (39%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           +EE  +KKKKKKKKK KK        +         E      +   +
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEA 224



 Score = 35.4 bits (82), Expect = 0.11
 Identities = 12/40 (30%), Positives = 17/40 (42%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           ++E  ++KKKKKKKK KK        +         E   
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSS 216



 Score = 33.9 bits (78), Expect = 0.32
 Identities = 14/30 (46%), Positives = 17/30 (56%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           L++E+ KKKKKKKKKK KK           
Sbjct: 176 LEEERAKKKKKKKKKKTKKNNATGSSAEAT 205



 Score = 32.3 bits (74), Expect = 0.88
 Identities = 10/43 (23%), Positives = 18/43 (41%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           ++  +++KKKKKKK KK        +        +    G  +
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQ 220



 Score = 31.9 bits (73), Expect = 1.1
 Identities = 15/32 (46%), Positives = 20/32 (62%), Gaps = 2/32 (6%)

Query: 828 DKEKKKKK--KKKKKKKKKKKKKKKKKMMKKG 857
           + E  K +  +++ KKKKKKKKKK KK    G
Sbjct: 168 EVELLKARLEEERAKKKKKKKKKKTKKNNATG 199



 Score = 31.2 bits (71), Expect = 2.0
 Identities = 17/57 (29%), Positives = 26/57 (45%)

Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAP 879
           ++  L    ++++ KKKKKKKKKK KK           V S +  EL++        
Sbjct: 168 EVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEA 224



 Score = 31.2 bits (71), Expect = 2.3
 Identities = 18/71 (25%), Positives = 26/71 (36%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGD 889
           E+++ KKKKKKKKKK KK        +         E             + K      D
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAPD 236

Query: 890 PKNFDHAWSIF 900
            +  +   S+F
Sbjct: 237 NEKSEVYKSLF 247



 Score = 29.2 bits (66), Expect = 9.6
 Identities = 13/29 (44%), Positives = 21/29 (72%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           ++E +  K + ++++ KKKKKKKKK  KK
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKK 194


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 38.9 bits (91), Expect = 0.002
 Identities = 14/47 (29%), Positives = 25/47 (53%), Gaps = 5/47 (10%)

Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKK-----KKKKKKKKKKKYS 750
             Q+  P ++ +E++K  +  KKK+KK +       KK   KK++ S
Sbjct: 58  SQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRS 104



 Score = 29.3 bits (66), Expect = 3.4
 Identities = 8/35 (22%), Positives = 18/35 (51%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           ++   + +   K+K+KK +    +  KK   KK++
Sbjct: 68  RKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRR 102



 Score = 28.9 bits (65), Expect = 5.3
 Identities = 11/49 (22%), Positives = 20/49 (40%), Gaps = 14/49 (28%)

Query: 818 KQRFNQIAYLDKEKK---------KKKKKKKK-----KKKKKKKKKKKK 852
                Q+    +++K         KKK+KK +       KK   KK+++
Sbjct: 55  IHLSQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRR 103


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 39.2 bits (92), Expect = 0.002
 Identities = 10/30 (33%), Positives = 23/30 (76%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
           +  +E  +K+K+ +K ++KK K+++K+K+K
Sbjct: 101 ERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 38.8 bits (91), Expect = 0.002
 Identities = 11/31 (35%), Positives = 25/31 (80%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           +E  +E+ +K+K+ +K ++KK K+++K+K+K
Sbjct: 100 RERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 37.6 bits (88), Expect = 0.005
 Identities = 10/32 (31%), Positives = 25/32 (78%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           + E  +++ +K+K+ +K ++KK K+++K+K+K
Sbjct: 99  RRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 37.3 bits (87), Expect = 0.008
 Identities = 10/32 (31%), Positives = 23/32 (71%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           +   E  K++ +K+K+ +K ++KK K+++K+K
Sbjct: 97  RLRRERTKERAEKEKRTRKNREKKFKRRQKEK 128



 Score = 36.1 bits (84), Expect = 0.017
 Identities = 13/45 (28%), Positives = 27/45 (60%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           +  K E+E  ++K+KKKK+K+  KK++   + +++ + E     K
Sbjct: 67  HNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEK 111



 Score = 34.2 bits (79), Expect = 0.079
 Identities = 9/43 (20%), Positives = 26/43 (60%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
            +E+E++KK+K+  KK++   + ++++ K++ +        R+
Sbjct: 76  LREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNRE 118



 Score = 33.8 bits (78), Expect = 0.099
 Identities = 9/34 (26%), Positives = 22/34 (64%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKK 851
           K+R       ++ K++ +K+K+ +K ++KK K++
Sbjct: 91  KRRIALRLRRERTKERAEKEKRTRKNREKKFKRR 124



 Score = 33.8 bits (78), Expect = 0.13
 Identities = 6/29 (20%), Positives = 21/29 (72%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
           + ++++ K++ +K+K+ +K ++KK  ++ 
Sbjct: 97  RLRRERTKERAEKEKRTRKNREKKFKRRQ 125



 Score = 33.4 bits (77), Expect = 0.17
 Identities = 10/24 (41%), Positives = 20/24 (83%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
            EK+K+ +K ++KK K+++K+K+K
Sbjct: 107 AEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 33.4 bits (77), Expect = 0.18
 Identities = 6/28 (21%), Positives = 20/28 (71%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
              + ++++ K++ +K+K+ +K ++KK+
Sbjct: 94  IALRLRRERTKERAEKEKRTRKNREKKF 121



 Score = 33.0 bits (76), Expect = 0.22
 Identities = 10/23 (43%), Positives = 20/23 (86%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKK 850
           +KEK+ +K ++KK K+++K+K+K
Sbjct: 108 EKEKRTRKNREKKFKRRQKEKEK 130



 Score = 32.6 bits (75), Expect = 0.27
 Identities = 8/24 (33%), Positives = 19/24 (79%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
           +  +K+K+ +K ++KK K+++K+K
Sbjct: 105 ERAEKEKRTRKNREKKFKRRQKEK 128



 Score = 32.6 bits (75), Expect = 0.31
 Identities = 9/24 (37%), Positives = 20/24 (83%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
           KE+ +K+K+ +K ++KK K+++K+
Sbjct: 104 KERAEKEKRTRKNREKKFKRRQKE 127



 Score = 32.6 bits (75), Expect = 0.33
 Identities = 9/32 (28%), Positives = 23/32 (71%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
            ++++ K++ +K+K+ +K ++KK K+   E+E
Sbjct: 98  LRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129



 Score = 32.2 bits (74), Expect = 0.38
 Identities = 8/33 (24%), Positives = 22/33 (66%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           P K+     + ++++ K++ +K+K+ +K ++KK
Sbjct: 88  PGKKRRIALRLRRERTKERAEKEKRTRKNREKK 120



 Score = 31.5 bits (72), Expect = 0.78
 Identities = 5/25 (20%), Positives = 17/25 (68%)

Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKK 856
             + ++++ K++ +K+K+ +K  +K
Sbjct: 95  ALRLRRERTKERAEKEKRTRKNREK 119



 Score = 30.7 bits (70), Expect = 1.5
 Identities = 13/45 (28%), Positives = 27/45 (60%), Gaps = 8/45 (17%)

Query: 719 EEEEKKKKKKKKKKK---KKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           E+E+KKK+K+  KK+    + ++++ K++ +K+        RK R
Sbjct: 78  EKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRT-----RKNR 117



 Score = 30.3 bits (69), Expect = 2.1
 Identities = 8/24 (33%), Positives = 20/24 (83%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKK 851
            +++K+ +K ++KK K+++K+K+K
Sbjct: 107 AEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 28.8 bits (65), Expect = 5.5
 Identities = 6/28 (21%), Positives = 18/28 (64%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           +   + ++++ K++ +K+K+ +K   KK
Sbjct: 93  RIALRLRRERTKERAEKEKRTRKNREKK 120


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 38.7 bits (90), Expect = 0.002
 Identities = 31/131 (23%), Positives = 60/131 (45%), Gaps = 15/131 (11%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPC 782
            ++K +  K+KK   K+  + KK     +    M+ ++D A+++  P +EV  +      
Sbjct: 7   WRRKMRAIKRKKNAPKELARLKKALGLIDGNVLMKDVKDIATVK--PAEEVKEK------ 58

Query: 783 LFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQI-AYLDKEKKKKKKKKKKK 841
               K +  E     +  E   +++          + +  Q  A++++ + KK KKK++K
Sbjct: 59  ---KKKEGTES---EEDDEKMEVDNKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREK 112

Query: 842 KKKKKKKKKKK 852
           KKKKKK  K  
Sbjct: 113 KKKKKKGAKNL 123



 Score = 32.1 bits (73), Expect = 0.36
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
            +++ KK KKK++KKKKKKK  K   
Sbjct: 99  NQRQAKKLKKKREKKKKKKKGAKNLA 124



 Score = 29.8 bits (67), Expect = 2.8
 Identities = 15/31 (48%), Positives = 19/31 (61%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
           YP    + + KK KKK++KKKKKKK  K   
Sbjct: 94  YPAWMNQRQAKKLKKKREKKKKKKKGAKNLA 124


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 38.7 bits (90), Expect = 0.002
 Identities = 25/64 (39%), Positives = 28/64 (43%), Gaps = 3/64 (4%)

Query: 683 DPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKK 742
           + C  N   D        H TT  G  +    PD    E  +K KKKKKK KK KK KK 
Sbjct: 60  ETCFYNTGNDGVCQNGECHLTTSSGEPSH---PDDHPPEPTEKPKKKKKKSKKTKKPKKS 116

Query: 743 KKKK 746
            KK 
Sbjct: 117 SKKD 120



 Score = 35.6 bits (82), Expect = 0.025
 Identities = 16/27 (59%), Positives = 17/27 (62%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKK 747
            E  +K KKKKKK KK KK KK  KK 
Sbjct: 94  PEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 35.6 bits (82), Expect = 0.026
 Identities = 16/23 (69%), Positives = 16/23 (69%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKK 852
           EK KKKKKK KK KK KK  KK 
Sbjct: 98  EKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 34.9 bits (80), Expect = 0.044
 Identities = 15/23 (65%), Positives = 16/23 (69%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKK 850
           +K KKKKKK KK KK KK  KK 
Sbjct: 98  EKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 32.2 bits (73), Expect = 0.34
 Identities = 14/27 (51%), Positives = 17/27 (62%)

Query: 826 YLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           +  +  +K KKKKKK KK KK KK  K
Sbjct: 92  HPPEPTEKPKKKKKKSKKTKKPKKSSK 118



 Score = 31.4 bits (71), Expect = 0.64
 Identities = 14/20 (70%), Positives = 14/20 (70%)

Query: 729 KKKKKKKKKKKKKKKKKKKK 748
           K KKKKKK KK KK KK  K
Sbjct: 99  KPKKKKKKSKKTKKPKKSSK 118



 Score = 30.6 bits (69), Expect = 1.1
 Identities = 14/22 (63%), Positives = 16/22 (72%)

Query: 732 KKKKKKKKKKKKKKKKKYSNEE 753
           K KKKKKK KK KK KK S ++
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 29.8 bits (67), Expect = 2.3
 Identities = 14/22 (63%), Positives = 14/22 (63%)

Query: 835 KKKKKKKKKKKKKKKKKKMMKK 856
           K KKKKKK KK KK KK   K 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 28.7 bits (64), Expect = 5.5
 Identities = 13/22 (59%), Positives = 17/22 (77%)

Query: 733 KKKKKKKKKKKKKKKKYSNEEE 754
           K KKKKKK KK KK K S++++
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 41.0 bits (97), Expect = 0.002
 Identities = 11/31 (35%), Positives = 19/31 (61%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           ++E K   K +KKK+K++ K  K K K + +
Sbjct: 63  KKELKAWHKAQKKKEKQEAKAAKAKSKPRLF 93



 Score = 40.2 bits (95), Expect = 0.003
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           + A LDK++ K   K +KKK+K++ K  K   K 
Sbjct: 57  KAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKP 90



 Score = 37.5 bits (88), Expect = 0.027
 Identities = 12/30 (40%), Positives = 19/30 (63%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
           DK+E +   K +KKK+K++ K  K K K +
Sbjct: 62  DKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 37.1 bits (87), Expect = 0.032
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           K    +KK+ K   K +KKK+K++ K  K K 
Sbjct: 57  KAALLDKKELKAWHKAQKKKEKQEAKAAKAKS 88



 Score = 34.8 bits (81), Expect = 0.17
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKK 851
             K +KKK+K++ K  K K K +
Sbjct: 69  WHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 34.4 bits (80), Expect = 0.21
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKK 849
             K +KKK+K++ K  K K K +
Sbjct: 69  WHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 34.0 bits (79), Expect = 0.28
 Identities = 12/34 (35%), Positives = 19/34 (55%), Gaps = 3/34 (8%)

Query: 815 LDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKK 848
           LD K+     A+   +KKK+K++ K  K K K +
Sbjct: 61  LDKKE---LKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 34.0 bits (79), Expect = 0.35
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
             KE +   K +KKK+K++ K  K K K +
Sbjct: 62  DKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 32.9 bits (76), Expect = 0.64
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           KEE +     KK+ K   K +KKK+K++ K  
Sbjct: 53  KEELKAALLDKKELKAWHKAQKKKEKQEAKAA 84



 Score = 31.7 bits (73), Expect = 1.6
 Identities = 12/41 (29%), Positives = 22/41 (53%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           Y + +EE +     KK+ K   K +KKK+K++ K    + +
Sbjct: 49  YKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 31.0 bits (71), Expect = 2.6
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGYP 859
           KK+ K   K +KKK+K++ K    K  P
Sbjct: 63  KKELKAWHKAQKKKEKQEAKAAKAKSKP 90



 Score = 29.4 bits (67), Expect = 8.3
 Identities = 14/43 (32%), Positives = 24/43 (55%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
           KE +EE K     KK+ K   K +KKK+K++  + + +   +L
Sbjct: 50  KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRL 92


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region covers
           both the Brf homology II and III regions. This region is
           involved in binding TATA binding protein.
          Length = 95

 Score = 36.9 bits (86), Expect = 0.005
 Identities = 16/31 (51%), Positives = 22/31 (70%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           EE+EEK+ K+K  +     KKKKK+K KKK+
Sbjct: 30  EEQEEKELKQKADEGNNSGKKKKKRKAKKKR 60



 Score = 36.9 bits (86), Expect = 0.005
 Identities = 15/34 (44%), Positives = 21/34 (61%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
           EE+E+K+ K+K  +     KKKKK+K KKK    
Sbjct: 30  EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEA 63



 Score = 35.3 bits (82), Expect = 0.020
 Identities = 14/37 (37%), Positives = 22/37 (59%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
           ++EE+E K+K  +     KKKKK+K KKK+ +     
Sbjct: 31  EQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPAS 67



 Score = 34.9 bits (81), Expect = 0.026
 Identities = 14/34 (41%), Positives = 22/34 (64%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            ++EE+E K+K  +     KKKKK+K KKK+ + 
Sbjct: 30  EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEA 63



 Score = 34.5 bits (80), Expect = 0.036
 Identities = 14/34 (41%), Positives = 22/34 (64%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
           EE+++K+ K+K  +     KKKKK+K KK  +E 
Sbjct: 30  EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEA 63



 Score = 33.4 bits (77), Expect = 0.081
 Identities = 15/39 (38%), Positives = 24/39 (61%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           K+  EE+++K+ K+K  +     KKKKK+K K   +E G
Sbjct: 26  KDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64



 Score = 30.3 bits (69), Expect = 0.92
 Identities = 13/44 (29%), Positives = 23/44 (52%), Gaps = 3/44 (6%)

Query: 826 YLDKEKKKKKKKKK---KKKKKKKKKKKKKMMKKGYPDVLSQME 866
           YL+++++K+ K+K        KKKKK+K K  +       +  E
Sbjct: 28  YLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAE 71



 Score = 29.5 bits (67), Expect = 1.8
 Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 9/53 (16%)

Query: 813 GYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKK---------KKMMKK 856
            YL+ ++        D+     KKKKK+K KKK+ +           KKM++K
Sbjct: 27  DYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAEAVKKMLQK 79


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 37.6 bits (88), Expect = 0.005
 Identities = 16/38 (42%), Positives = 25/38 (65%), Gaps = 6/38 (15%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKK------KKKKKK 748
           K E E  K++KKK+ K+KK+++K++K      KKK K 
Sbjct: 92  KLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKH 129



 Score = 36.1 bits (84), Expect = 0.022
 Identities = 12/30 (40%), Positives = 26/30 (86%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           L+ E+ K++KKK+ K+KK+++K++K+ +K+
Sbjct: 93  LEHERNKQEKKKRSKEKKEEEKERKRQLKQ 122



 Score = 32.6 bits (75), Expect = 0.35
 Identities = 11/24 (45%), Positives = 21/24 (87%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
           KEKK+++K++K++ K++KKK K +
Sbjct: 107 KEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 31.5 bits (72), Expect = 0.70
 Identities = 10/27 (37%), Positives = 22/27 (81%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           K + ++ K++KKK+ K+KK+++K++K 
Sbjct: 92  KLEHERNKQEKKKRSKEKKEEEKERKR 118



 Score = 31.1 bits (71), Expect = 1.2
 Identities = 14/32 (43%), Positives = 25/32 (78%), Gaps = 3/32 (9%)

Query: 829 KEKKKKKKKKKKKKKKKK---KKKKKKMMKKG 857
           +EKKK+ K+KK+++K++K   K++KKK   +G
Sbjct: 100 QEKKKRSKEKKEEEKERKRQLKQQKKKAKHRG 131



 Score = 30.7 bits (70), Expect = 1.2
 Identities = 11/32 (34%), Positives = 22/32 (68%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKK 849
           KQ   + +   KE++K++K++ K++KKK K +
Sbjct: 99  KQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 30.3 bits (69), Expect = 1.7
 Identities = 11/22 (50%), Positives = 18/22 (81%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKK 737
            KEEE+E+K++ K++KKK K +
Sbjct: 109 KKEEEKERKRQLKQQKKKAKHR 130



 Score = 30.3 bits (69), Expect = 1.7
 Identities = 11/24 (45%), Positives = 21/24 (87%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKK 740
           KE++EE+K++K++ K++KKK K +
Sbjct: 107 KEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 30.3 bits (69), Expect = 1.9
 Identities = 12/28 (42%), Positives = 24/28 (85%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           K++KKK+ K+KK+++K++K++ K+  KK
Sbjct: 99  KQEKKKRSKEKKEEEKERKRQLKQQKKK 126



 Score = 30.3 bits (69), Expect = 1.9
 Identities = 11/28 (39%), Positives = 23/28 (82%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            +++KKK+ K+KK+++K++K++ K  KK
Sbjct: 98  NKQEKKKRSKEKKEEEKERKRQLKQQKK 125



 Score = 29.1 bits (66), Expect = 5.2
 Identities = 10/23 (43%), Positives = 20/23 (86%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKK 738
           +K+EEE+++K++ K++KKK K +
Sbjct: 108 EKKEEEKERKRQLKQQKKKAKHR 130


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 38.0 bits (89), Expect = 0.006
 Identities = 9/33 (27%), Positives = 22/33 (66%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
              EEE+  +  +K+  K K++K+++ ++K+K+
Sbjct: 113 ELDEEEQIDELLEKELAKLKREKRRENERKQKE 145



 Score = 36.9 bits (86), Expect = 0.014
 Identities = 10/44 (22%), Positives = 27/44 (61%)

Query: 705 EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           E     ++   + +EEE+  +  +K+  K K++K+++ ++K+K+
Sbjct: 102 EEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145



 Score = 36.5 bits (85), Expect = 0.020
 Identities = 12/47 (25%), Positives = 30/47 (63%), Gaps = 6/47 (12%)

Query: 809 TTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
             EV  LD +++      +D+  +K+  K K++K+++ ++K+K+++K
Sbjct: 108 EVEVEELDEEEQ------IDELLEKELAKLKREKRRENERKQKEILK 148



 Score = 35.3 bits (82), Expect = 0.044
 Identities = 10/32 (31%), Positives = 23/32 (71%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           D+  E+E  K K++K+++ ++K+K+  K++ K
Sbjct: 121 DELLEKELAKLKREKRRENERKQKEILKEQMK 152



 Score = 35.3 bits (82), Expect = 0.046
 Identities = 9/33 (27%), Positives = 24/33 (72%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           ++E+ +E  +K+  K K++K+++ ++K+K+  K
Sbjct: 116 EEEQIDELLEKELAKLKREKRRENERKQKEILK 148



 Score = 33.0 bits (76), Expect = 0.25
 Identities = 8/33 (24%), Positives = 24/33 (72%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           ++ +E  +K+  K K++K+++ ++K+K+  K++
Sbjct: 118 EQIDELLEKELAKLKREKRRENERKQKEILKEQ 150



 Score = 29.6 bits (67), Expect = 4.1
 Identities = 13/51 (25%), Positives = 29/51 (56%), Gaps = 4/51 (7%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
           ++EEE E ++  ++++  +  +K+  K K++K+  NE    RK ++    Q
Sbjct: 104 EEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENE----RKQKEILKEQ 150


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 38.8 bits (91), Expect = 0.007
 Identities = 12/53 (22%), Positives = 26/53 (49%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG 768
             ++ E   +K+K  ++++K K K  +KK+K +    E     + D + +  G
Sbjct: 118 QAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVG 170



 Score = 36.4 bits (85), Expect = 0.039
 Identities = 12/41 (29%), Positives = 23/41 (56%)

Query: 708 RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           R  Q A   +   E++K  ++++K K K  +KK+K + +K 
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154



 Score = 34.5 bits (80), Expect = 0.16
 Identities = 8/32 (25%), Positives = 20/32 (62%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           +  E++ KK++   +K+K  ++++K K K  +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPR 144



 Score = 34.1 bits (79), Expect = 0.19
 Identities = 10/34 (29%), Positives = 21/34 (61%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            +   E+EK  ++++K K K  +KK+K + +K +
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155



 Score = 33.7 bits (78), Expect = 0.29
 Identities = 11/43 (25%), Positives = 20/43 (46%)

Query: 819 QRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDV 861
           QR  Q A   +   +K+K  ++++K K K  +KK   +     
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155



 Score = 33.4 bits (77), Expect = 0.37
 Identities = 9/50 (18%), Positives = 27/50 (54%), Gaps = 3/50 (6%)

Query: 717 KEEEEEKKKKKKKKKKK---KKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
           K + + ++ +++ KK++   +K+K  ++++K K K   ++   R  +   
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEP 156



 Score = 33.0 bits (76), Expect = 0.47
 Identities = 11/38 (28%), Positives = 20/38 (52%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQME 866
            EK+K  ++++K K K  +KK+K   +K  P      +
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSD 163



 Score = 33.0 bits (76), Expect = 0.53
 Identities = 9/51 (17%), Positives = 22/51 (43%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVA 878
             +K++   +K+K  ++++K K K   KK  P       +     + + + 
Sbjct: 118 QAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELT 168



 Score = 31.8 bits (73), Expect = 0.98
 Identities = 9/34 (26%), Positives = 20/34 (58%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
               E+E+  ++++K K K  +KK+K + +K + 
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEP 156



 Score = 31.8 bits (73), Expect = 1.0
 Identities = 7/31 (22%), Positives = 20/31 (64%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           EE + K + ++ +++ KK++   +K+K  ++
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRR 134



 Score = 31.0 bits (71), Expect = 2.3
 Identities = 12/62 (19%), Positives = 26/62 (41%), Gaps = 2/62 (3%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK--LRDAASIQCGPVKEVPCEPQ 778
           EE K K + ++ +++ KK++   +K+K    E +   K   +        P  +      
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSD 163

Query: 779 IA 780
           I+
Sbjct: 164 IS 165



 Score = 31.0 bits (71), Expect = 2.3
 Identities = 8/35 (22%), Positives = 21/35 (60%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           +Q+  +     +++K  ++++K K K  +KK+K +
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPR 150



 Score = 30.3 bits (69), Expect = 3.1
 Identities = 10/36 (27%), Positives = 21/36 (58%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           A  +KE+   +++K K K  +KK+K + +K + +  
Sbjct: 124 AAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159



 Score = 30.3 bits (69), Expect = 3.6
 Identities = 9/35 (25%), Positives = 18/35 (51%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           K+R          ++++K K K  +KK+K + +K 
Sbjct: 120 KKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154



 Score = 29.5 bits (67), Expect = 5.7
 Identities = 10/42 (23%), Positives = 21/42 (50%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           A   ++    ++K K K  +KK+K + +K + +    S+  E
Sbjct: 125 AGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 37.3 bits (87), Expect = 0.007
 Identities = 33/153 (21%), Positives = 60/153 (39%), Gaps = 39/153 (25%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE 772
           A   K+    + KK  K KKK K +K +KK K++ +    +   +          G VK 
Sbjct: 16  ASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEK----------GRVKP 65

Query: 773 VPCEPQIAPCLFDIKNDPCEKN--NLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKE 830
           V   P+             E+    +A +  V                + FN +    K+
Sbjct: 66  VL--PED---------LEKERRLRKVAQKGVV----------------KLFNAVRAAQKK 98

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLS 863
            +K  K+  KK + K+++  K++ K+ + D+L 
Sbjct: 99  TEKAVKEAGKKARVKREELLKEVSKEKFLDLLR 131



 Score = 34.3 bits (79), Expect = 0.086
 Identities = 13/48 (27%), Positives = 26/48 (54%), Gaps = 3/48 (6%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQM---EKELANI 872
            K+K K +K +KK K++ + +K++ + K     VL +    E+ L  +
Sbjct: 33  AKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDLEKERRLRKV 80



 Score = 32.3 bits (74), Expect = 0.35
 Identities = 15/44 (34%), Positives = 28/44 (63%), Gaps = 2/44 (4%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY--PDVLSQMEKE 868
           L  +KK K +K +KK K++ + +K++ ++KG   P +   +EKE
Sbjct: 31  LKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDLEKE 74


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 39.2 bits (92), Expect = 0.010
 Identities = 14/33 (42%), Positives = 26/33 (78%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           + E+EEK+  K++K+ +K KK+++KKKK+ +K 
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKL 586



 Score = 38.9 bits (91), Expect = 0.012
 Identities = 15/34 (44%), Positives = 26/34 (76%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
           KEE+E  K++K+ +K KK+++KKKK+ +K +K  
Sbjct: 557 KEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 35.8 bits (83), Expect = 0.11
 Identities = 18/45 (40%), Positives = 32/45 (71%), Gaps = 3/45 (6%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           DKEE + +K++K+  K++K+ +K KK+++KKKK   E E + K +
Sbjct: 549 DKEELQREKEEKEALKEQKRLRKLKKQEEKKKK---ELEKLEKAK 590



 Score = 35.0 bits (81), Expect = 0.18
 Identities = 13/37 (35%), Positives = 26/37 (70%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
           ++E  +++K+ +K KK+++KKKK+ +K +K K    E
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595



 Score = 34.2 bits (79), Expect = 0.36
 Identities = 14/37 (37%), Positives = 25/37 (67%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           E+E  K++K+ +K KK+++KKKK+ +K +K      E
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595



 Score = 33.8 bits (78), Expect = 0.47
 Identities = 14/39 (35%), Positives = 29/39 (74%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
            D +EE +++K++K+  K++K+ +K KK+++KKK   E+
Sbjct: 547 LDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 32.3 bits (74), Expect = 1.4
 Identities = 13/48 (27%), Positives = 27/48 (56%), Gaps = 4/48 (8%)

Query: 809 TTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            +     D ++   +      EK+  K++K+ +K KK+++KKKK ++K
Sbjct: 542 PSVWKLDDKEELQREKE----EKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 31.5 bits (72), Expect = 2.2
 Identities = 11/44 (25%), Positives = 30/44 (68%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           ++K++ +++K++K+  K++K+ +K KK+   +++ + KL  A  
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKI 591



 Score = 30.0 bits (68), Expect = 6.1
 Identities = 10/38 (26%), Positives = 24/38 (63%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           K   K++ +++K++K+  K++K+ +K   +EE  +K  
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKEL 583


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 38.8 bits (91), Expect = 0.011
 Identities = 13/40 (32%), Positives = 22/40 (55%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
             E KK +KK++K +KK +K++ +K   KK +       K
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAK 447



 Score = 38.4 bits (90), Expect = 0.019
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
                 E+KK +KK++K +KK +K++ +K   K   E 
Sbjct: 404 GNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEA 441



 Score = 37.2 bits (87), Expect = 0.043
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           K E+EE +K   KKK +   KK K    + KK 
Sbjct: 424 KAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456



 Score = 36.1 bits (84), Expect = 0.085
 Identities = 16/65 (24%), Positives = 27/65 (41%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEV 773
            P   E EE++ +       ++KK +KK++K +KK   EE      +  A       K  
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGP 449

Query: 774 PCEPQ 778
             E +
Sbjct: 450 DGETK 454



 Score = 34.1 bits (79), Expect = 0.39
 Identities = 10/25 (40%), Positives = 19/25 (76%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
             E+KK +KK++K +KK +K++ +K
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEK 432



 Score = 33.8 bits (78), Expect = 0.40
 Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 7/47 (14%)

Query: 811 EVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
           E G L P +R        K +KK++K +KK +K++ +K   K   + 
Sbjct: 402 ENGNLSPAER-------KKLRKKQRKAEKKAEKEEAEKAAAKKKAEA 441



 Score = 31.8 bits (73), Expect = 1.7
 Identities = 12/31 (38%), Positives = 14/31 (45%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKKGYPD 860
           E +K   KKK +   KK K      KK  PD
Sbjct: 429 EAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459



 Score = 30.7 bits (70), Expect = 3.9
 Identities = 11/36 (30%), Positives = 19/36 (52%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           K +KK +K++ +K   KKK +   KK    +   +K
Sbjct: 420 KAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
           Williams-Beuren syndrome.  This domain family is found
           in eukaryotes, and is typically between 72 and 83 amino
           acids in length. The family is found in association with
           pfam08241. This family is made up of
           S-adenosylmethionine-dependent methyltransferases. The
           proteins are deleted in Williams-Beuren syndrome (WBS),
           a complex developmental disorder with multisystemic
           manifestations including supravalvular aortic stenosis
           (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 35.4 bits (82), Expect = 0.013
 Identities = 17/52 (32%), Positives = 30/52 (57%), Gaps = 3/52 (5%)

Query: 809 TTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKK---KKKKKKMMKKG 857
             E    D +   +++  + +  ++KKKKKKK KKK K    +KK++M ++G
Sbjct: 17  GEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRG 68



 Score = 33.0 bits (76), Expect = 0.079
 Identities = 14/32 (43%), Positives = 22/32 (68%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           D E+ +  K ++  ++ ++KKKKKKK KKK K
Sbjct: 23  DDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54



 Score = 32.7 bits (75), Expect = 0.11
 Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            + +E+ +  K ++  ++ ++KKKKKKK KKK K
Sbjct: 21  EEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54



 Score = 31.1 bits (71), Expect = 0.38
 Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 4/50 (8%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKK----KKKKKKKKK 748
                   QI         ++ ++KKKKKKK KKK K    +KK++ +++
Sbjct: 18  EEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRR 67



 Score = 28.0 bits (63), Expect = 5.0
 Identities = 14/35 (40%), Positives = 24/35 (68%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
           + EE++E+    K ++  ++ ++KKKKKKK KK S
Sbjct: 19  EGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKS 53


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 38.9 bits (91), Expect = 0.014
 Identities = 37/150 (24%), Positives = 53/150 (35%), Gaps = 25/150 (16%)

Query: 716 DKEEEEEKKKKKKKKKK-----------KKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           D+EE EE  KK   ++K            KK KK+ K + K+KK S+EEE +    +A  
Sbjct: 408 DEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKV 467

Query: 765 IQCGPVKEVPCEPQI--APCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFN 822
            +         E            + +P  K           +              +  
Sbjct: 468 EKVANKLLKRSEKAQKEEEEEELDEENPWLKTT-------SSVGKSAK-----KQDSKKK 515

Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
             + LDK   K  K   K KKKKKK+K   
Sbjct: 516 SSSKLDKAANKISKAAVKVKKKKKKEKSID 545



 Score = 32.0 bits (73), Expect = 1.7
 Identities = 12/26 (46%), Positives = 20/26 (76%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
           KK+KKK+K+K++   K +  KK+K+K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRK 652



 Score = 31.6 bits (72), Expect = 2.3
 Identities = 13/32 (40%), Positives = 22/32 (68%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
           ++ KKK+K+K++   K +  KK+K+K KK  N
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKN 658



 Score = 31.6 bits (72), Expect = 2.5
 Identities = 17/43 (39%), Positives = 25/43 (58%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
           EE K K+ KK K KK  +  KK+K K++    EE ++   +AA
Sbjct: 216 EEAKAKRIKKIKSKKYHRVHKKEKLKEELKEFEELVKADPEAA 258



 Score = 30.8 bits (70), Expect = 4.1
 Identities = 11/31 (35%), Positives = 22/31 (70%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           K+ ++++K+K++   K +  KK+K+K KK K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLK 657



 Score = 30.4 bits (69), Expect = 5.9
 Identities = 11/30 (36%), Positives = 21/30 (70%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           ++ +KK+K+K++   K +  KK+K+K KK 
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKL 656



 Score = 30.0 bits (68), Expect = 7.8
 Identities = 11/31 (35%), Positives = 22/31 (70%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           ++ ++K+K+K++   K +  KK+K+K KK K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLK 657



 Score = 29.6 bits (67), Expect = 8.8
 Identities = 15/34 (44%), Positives = 20/34 (58%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
           EE K K+ KK K KK  +  KK+K K++    EE
Sbjct: 216 EEAKAKRIKKIKSKKYHRVHKKEKLKEELKEFEE 249


>gnl|CDD|225625 COG3083, COG3083, Predicted hydrolase of alkaline phosphatase
           superfamily [General function prediction only].
          Length = 600

 Score = 38.6 bits (90), Expect = 0.018
 Identities = 21/94 (22%), Positives = 43/94 (45%), Gaps = 12/94 (12%)

Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
            D P F YL+  ++H+ +     QA               ++++   L ++D  +G+V+E
Sbjct: 391 EDNPWFSYLSLNSSHANDDPSSNQAK---------ARPPFKNRYQNALREVDSQIGRVLE 441

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
            L    +L N++++  +D+G     FN    + W
Sbjct: 442 QLRNSGLLDNTVVIITADHG---EEFNEEEQNYW 472


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 37.2 bits (87), Expect = 0.018
 Identities = 16/39 (41%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 821 FNQI-AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
           F  I  Y ++EK+KKK   K++KK  K++K+K     GY
Sbjct: 87  FTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEPYGY 125



 Score = 33.4 bits (77), Expect = 0.31
 Identities = 13/29 (44%), Positives = 22/29 (75%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           EE+K+KKK   K++KK  K++K+K ++ Y
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEPY 123



 Score = 31.1 bits (71), Expect = 1.7
 Identities = 13/31 (41%), Positives = 22/31 (70%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           EEE++KKK   K++KK  K++K+K ++   Y
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEPYGY 125


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 37.5 bits (87), Expect = 0.018
 Identities = 18/52 (34%), Positives = 23/52 (44%), Gaps = 1/52 (1%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
           P ++ + EKK KK K K K K K K K K + K      +       AA  Q
Sbjct: 84  PKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSK-TAAKAPAAPNQ 134



 Score = 37.1 bits (86), Expect = 0.029
 Identities = 15/35 (42%), Positives = 19/35 (54%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           P+    + K+K K +KK KK K K K K K K K 
Sbjct: 77  PEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111



 Score = 36.7 bits (85), Expect = 0.031
 Identities = 14/39 (35%), Positives = 19/39 (48%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            +   P +    + K+K K +KK KK K K K K K K 
Sbjct: 71  TEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKP 109



 Score = 36.3 bits (84), Expect = 0.053
 Identities = 16/41 (39%), Positives = 21/41 (51%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           P   + +EK K +KK KK K K K K K K K K   + + 
Sbjct: 79  PTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKK 119



 Score = 35.9 bits (83), Expect = 0.060
 Identities = 14/35 (40%), Positives = 18/35 (51%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            P  E++ +K K K K K K K K K + K KK  
Sbjct: 87  KPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPP 121



 Score = 33.6 bits (77), Expect = 0.34
 Identities = 14/40 (35%), Positives = 17/40 (42%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
               E    K K+K K +KK KK K K K K K     + 
Sbjct: 74  ETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKP 113



 Score = 31.7 bits (72), Expect = 1.5
 Identities = 13/34 (38%), Positives = 16/34 (47%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           P    E E   +    K K+K K +KK KK K K
Sbjct: 67  PKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPK 100



 Score = 30.5 bits (69), Expect = 3.2
 Identities = 14/24 (58%), Positives = 15/24 (62%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
           KEK K +KK KK K K K K K K
Sbjct: 85  KEKPKPEKKPKKPKPKPKPKPKPK 108



 Score = 30.1 bits (68), Expect = 4.8
 Identities = 14/34 (41%), Positives = 17/34 (50%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           P  E E   +    K K+K K +KK KK K K K
Sbjct: 69  PPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPK 102



 Score = 29.8 bits (67), Expect = 6.1
 Identities = 13/24 (54%), Positives = 13/24 (54%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
            EKK KK K K K K K K K K 
Sbjct: 90  PEKKPKKPKPKPKPKPKPKPKVKP 113



 Score = 29.8 bits (67), Expect = 6.6
 Identities = 13/24 (54%), Positives = 14/24 (58%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
           K K +KK KK K K K K K K K
Sbjct: 87  KPKPEKKPKKPKPKPKPKPKPKPK 110



 Score = 29.4 bits (66), Expect = 6.9
 Identities = 13/34 (38%), Positives = 17/34 (50%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           P + E   +    K K+K K +KK KK K K K 
Sbjct: 70  PTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKP 103



 Score = 29.4 bits (66), Expect = 7.5
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           ++ K +KK KK K K K K K K  +K 
Sbjct: 86  EKPKPEKKPKKPKPKPKPKPKPKPKVKP 113


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 36.7 bits (85), Expect = 0.020
 Identities = 9/43 (20%), Positives = 23/43 (53%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           ++  +E +++   + K + K+KKK++  K K +    +   +K
Sbjct: 103 EESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145



 Score = 34.7 bits (80), Expect = 0.080
 Identities = 9/39 (23%), Positives = 22/39 (56%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           ++E  +E +++   + K + K+KKK++  K K    + +
Sbjct: 102 EEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPK 140



 Score = 34.7 bits (80), Expect = 0.084
 Identities = 12/42 (28%), Positives = 22/42 (52%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
            +E E++   + K + K+KKK++  K K +K+    E    K
Sbjct: 106 TDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPK 147



 Score = 34.0 bits (78), Expect = 0.16
 Identities = 10/39 (25%), Positives = 22/39 (56%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
            + EEE   + +++   + K + K+KKK++  K  + +E
Sbjct: 99  EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKE 137



 Score = 33.6 bits (77), Expect = 0.22
 Identities = 12/54 (22%), Positives = 24/54 (44%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGP 769
           ++E EEE   + +++   + K + K+KKK++      E+   K          P
Sbjct: 98  EEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKP 151



 Score = 33.2 bits (76), Expect = 0.27
 Identities = 15/36 (41%), Positives = 23/36 (63%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
           P + + E K+KKK++  K K +K+K K + KK K S
Sbjct: 114 PPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPS 149



 Score = 32.8 bits (75), Expect = 0.32
 Identities = 12/34 (35%), Positives = 22/34 (64%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            + E+E+  + K + K+KKK++  K K +K+K K
Sbjct: 107 DETEQEDPPETKTESKEKKKREVPKPKTEKEKPK 140



 Score = 32.4 bits (74), Expect = 0.48
 Identities = 13/39 (33%), Positives = 20/39 (51%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
                   E +E+KK++  K K +K+K K + KK K  K
Sbjct: 112 EDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150



 Score = 30.1 bits (68), Expect = 2.8
 Identities = 13/33 (39%), Positives = 20/33 (60%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           KE+++ +  K K +K+K K + KK K  K K Y
Sbjct: 122 KEKKKREVPKPKTEKEKPKTEPKKPKPSKPKLY 154



 Score = 29.7 bits (67), Expect = 3.3
 Identities = 11/40 (27%), Positives = 25/40 (62%)

Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           F +++  D EEEE +++   + +++   + K + K+KKK+
Sbjct: 88  FRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKR 127


>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
          Length = 405

 Score = 37.9 bits (88), Expect = 0.020
 Identities = 22/83 (26%), Positives = 35/83 (42%), Gaps = 5/83 (6%)

Query: 679 DIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKK 738
           D+K  P +KN L + +   ++N Y     R   +A     E    K KK+K  KK+K+  
Sbjct: 313 DVKRAPLKKNPLKNLNAMLKLNPYAKTARRMALLA-----EAARVKAKKEKLAKKRKQVS 367

Query: 739 KKKKKKKKKKYSNEEEGMRKLRD 761
           K++    K       + M    D
Sbjct: 368 KEEAAAIKAAGKAWYKTMISDSD 390



 Score = 34.0 bits (78), Expect = 0.40
 Identities = 22/81 (27%), Positives = 39/81 (48%), Gaps = 8/81 (9%)

Query: 785 DIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK--KKKKKK 842
           D+K  P +KN L + + + ++N Y          +R   +A   + K KK+K  KK+K+ 
Sbjct: 313 DVKRAPLKKNPLKNLNAMLKLNPYAKT------ARRMALLAEAARVKAKKEKLAKKRKQV 366

Query: 843 KKKKKKKKKKMMKKGYPDVLS 863
            K++    K   K  Y  ++S
Sbjct: 367 SKEEAAAIKAAGKAWYKTMIS 387


>gnl|CDD|224441 COG1524, COG1524, Uncharacterized proteins of the AP superfamily
           [General function prediction only].
          Length = 450

 Score = 37.9 bits (88), Expect = 0.021
 Identities = 46/303 (15%), Positives = 99/303 (32%), Gaps = 52/303 (17%)

Query: 34  IMAFAVLPLAF----TLSMVFVDLVASSGPPHIIFILADDLG---WNDVGFHGLDQIPTP 86
           +  F++L L F     +          +    ++ I  D L     +            P
Sbjct: 9   VFLFSILDLFFAADVPILSQSYQAATPAPKKKLVLISIDGLRADVLDRKAGI------LP 62

Query: 87  NIDALAYSG----IILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLY----GCERGGL 138
            + +LA +G     ++  + T     P  + ++TG +P   G+  N+LY    G      
Sbjct: 63  FLSSLAENGVHVAELISVFPTTTR--PRHTTLITGSYPDEHGIVGNILYDPETGDSVLQF 120

Query: 139 PLSEKILPQYLKELGYRTRI-VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE 197
            L    +     +          +      K             L Y             
Sbjct: 121 LLDNPTILFPGDDEAGMDVAAPFESLTVSDKLRVAVDLVWDVPILHY----LHIGGPDHI 176

Query: 198 EM--KMWGLDMRRDLEPAWDLHGKYSTDVF-TAEAVDIIHNHSTDEPLFLYLAHAATHSA 254
            M   +   D   +++P +D   ++  D     ++V ++     D  L +YL +      
Sbjct: 177 TMRRFLI--DEDDNIKPGYDYISEHFLDSLLFLDSVLLLDRADPD-LLLVYLPN------ 227

Query: 255 NPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
                + A  H              ++A  + ++D  +G+++E L++R +    +++  S
Sbjct: 228 -----IDAIGHKYGPDSP-------EYAEAVREVDSLLGELLELLKKRGLYEEYLVIITS 275

Query: 315 DNG 317
           D+G
Sbjct: 276 DHG 278


>gnl|CDD|221569 pfam12420, DUF3671, Protein of unknown function.  This domain
           family is found in eukaryotes, and is typically between
           96 and 116 amino acids in length.
          Length = 106

 Score = 35.4 bits (82), Expect = 0.021
 Identities = 14/33 (42%), Positives = 17/33 (51%)

Query: 821 FNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKM 853
           F++I Y+DK KK     KKK KK   KK     
Sbjct: 17  FDKIDYIDKLKKDPNIDKKKFKKIIFKKYGLIF 49


>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal.  The yeast member
           of this family (Kri1p) is found to be required for 40S
           ribosome biogenesis in the nucleolus. This is the
           C-terminal domain of the family.
          Length = 93

 Score = 34.9 bits (81), Expect = 0.025
 Identities = 13/27 (48%), Positives = 19/27 (70%)

Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKK 851
            Y DKE+++K KKK  KK + ++ KKK
Sbjct: 65  PYRDKEERRKDKKKYGKKARLREWKKK 91



 Score = 29.9 bits (68), Expect = 1.6
 Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKK 745
           ++E+++K KKK  KK + ++ KKK  
Sbjct: 68  DKEERRKDKKKYGKKARLREWKKKVF 93



 Score = 29.5 bits (67), Expect = 1.7
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKK 744
           ++EE++K KKK  KK + ++ KKK  
Sbjct: 68  DKEERRKDKKKYGKKARLREWKKKVF 93



 Score = 29.5 bits (67), Expect = 2.0
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKK 743
           ++EE +K KKK  KK + ++ KKK  
Sbjct: 68  DKEERRKDKKKYGKKARLREWKKKVF 93



 Score = 29.1 bits (66), Expect = 2.7
 Identities = 13/29 (44%), Positives = 17/29 (58%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKK 741
            Y DKEE  + KKK  KK + ++ KKK  
Sbjct: 65  PYRDKEERRKDKKKYGKKARLREWKKKVF 93



 Score = 28.7 bits (65), Expect = 3.8
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKY 749
           K++++K KKK  KK + ++ KKK +
Sbjct: 69  KEERRKDKKKYGKKARLREWKKKVF 93



 Score = 28.7 bits (65), Expect = 3.9
 Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKK 747
           +K++++K KKK  KK + ++ KKK  
Sbjct: 68  DKEERRKDKKKYGKKARLREWKKKVF 93



 Score = 28.3 bits (64), Expect = 5.4
 Identities = 9/26 (34%), Positives = 18/26 (69%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKK 746
           ++++++K KKK  KK + ++ KKK  
Sbjct: 68  DKEERRKDKKKYGKKARLREWKKKVF 93



 Score = 27.9 bits (63), Expect = 6.8
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 724 KKKKKKKKKKKKKKKKKKKKKKKKK 748
           K++++K KKK  KK + ++ KKK  
Sbjct: 69  KEERRKDKKKYGKKARLREWKKKVF 93


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 37.9 bits (89), Expect = 0.027
 Identities = 9/34 (26%), Positives = 14/34 (41%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           +K E+         K    KK   K  +K K++Y
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQY 849



 Score = 37.1 bits (87), Expect = 0.053
 Identities = 8/32 (25%), Positives = 11/32 (34%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
             E+ EK +       K    KK   K  +K 
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSRKT 845



 Score = 36.7 bits (86), Expect = 0.063
 Identities = 8/32 (25%), Positives = 13/32 (40%)

Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            YL     K    KK   K  +K K++ +  +
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKTKQQYVASE 853



 Score = 35.9 bits (84), Expect = 0.11
 Identities = 8/33 (24%), Positives = 15/33 (45%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
                 K    KK   K  +K K++  ++E++G
Sbjct: 824 LADAPAKDPAGKKAAVKFSRKTKQQYVASEKDG 856



 Score = 35.5 bits (83), Expect = 0.13
 Identities = 7/26 (26%), Positives = 11/26 (42%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKK 747
             KK   K  +K K++    +K  K 
Sbjct: 833 AGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 35.5 bits (83), Expect = 0.14
 Identities = 8/36 (22%), Positives = 14/36 (38%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            Y      ++   KK   K  +K K++    +K  K
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGK 857



 Score = 35.5 bits (83), Expect = 0.14
 Identities = 7/26 (26%), Positives = 11/26 (42%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
             KK   K  +K K++    +K  K 
Sbjct: 833 AGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 34.8 bits (81), Expect = 0.22
 Identities = 8/34 (23%), Positives = 12/34 (35%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           P+K         K    KK   K  +K K++   
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVA 851



 Score = 34.8 bits (81), Expect = 0.23
 Identities = 8/31 (25%), Positives = 14/31 (45%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
           P K+   +K   K  +K K++    +K  K 
Sbjct: 828 PAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 34.4 bits (80), Expect = 0.30
 Identities = 7/30 (23%), Positives = 13/30 (43%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
            ++   KK   K  +K K++    +K  K 
Sbjct: 829 AKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 34.4 bits (80), Expect = 0.32
 Identities = 7/37 (18%), Positives = 14/37 (37%)

Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           +    ++ E+ +       K    KK   K  +K K+
Sbjct: 811 LELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQ 847



 Score = 34.4 bits (80), Expect = 0.36
 Identities = 8/29 (27%), Positives = 12/29 (41%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           D   K    KK   K  +K K++    +K
Sbjct: 826 DAPAKDPAGKKAAVKFSRKTKQQYVASEK 854



 Score = 34.0 bits (79), Expect = 0.40
 Identities = 7/25 (28%), Positives = 11/25 (44%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
           D   KK   K  +K K++    +K 
Sbjct: 831 DPAGKKAAVKFSRKTKQQYVASEKD 855



 Score = 32.5 bits (75), Expect = 1.3
 Identities = 8/27 (29%), Positives = 11/27 (40%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKG 857
           K    KK   K  +K K++     K G
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKDG 856



 Score = 32.5 bits (75), Expect = 1.3
 Identities = 9/38 (23%), Positives = 14/38 (36%)

Query: 815 LDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
              K R+   A       KK   K  +K K++    +K
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854



 Score = 32.1 bits (74), Expect = 1.5
 Identities = 7/26 (26%), Positives = 10/26 (38%)

Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYS 750
           K +K +       K    KK   K+S
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAAVKFS 842



 Score = 31.7 bits (73), Expect = 2.2
 Identities = 8/34 (23%), Positives = 12/34 (35%), Gaps = 1/34 (2%)

Query: 824 IAYLD-KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           +  L  K +K +       K    KK   K  +K
Sbjct: 811 LELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRK 844



 Score = 31.3 bits (72), Expect = 2.6
 Identities = 7/28 (25%), Positives = 12/28 (42%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           K+   KK   K  +K K++    +   K
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKDGK 857



 Score = 30.5 bits (70), Expect = 5.2
 Identities = 6/25 (24%), Positives = 10/25 (40%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
              KK   K  +K K++    +K  
Sbjct: 832 PAGKKAAVKFSRKTKQQYVASEKDG 856



 Score = 30.5 bits (70), Expect = 5.6
 Identities = 6/23 (26%), Positives = 10/23 (43%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKK 851
           K   K  +K K++    +K  K 
Sbjct: 836 KAAVKFSRKTKQQYVASEKDGKA 858


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 37.2 bits (87), Expect = 0.029
 Identities = 14/43 (32%), Positives = 29/43 (67%), Gaps = 3/43 (6%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           ++ +EE ++KK++KKK++++ K  K   ++++K    EE  RK
Sbjct: 278 EERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKL---EEKERK 317



 Score = 37.2 bits (87), Expect = 0.030
 Identities = 14/37 (37%), Positives = 26/37 (70%), Gaps = 6/37 (16%)

Query: 718 EEEEEKKKKKKKKKKKKKKKK------KKKKKKKKKK 748
           EE +EKK++KKK++++ K  K      +K ++K++KK
Sbjct: 282 EEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKK 318



 Score = 36.8 bits (86), Expect = 0.042
 Identities = 13/38 (34%), Positives = 27/38 (71%), Gaps = 6/38 (15%)

Query: 717 KEEEEEKKKKKKKKKKKK------KKKKKKKKKKKKKK 748
           +E++EEKKK++++ K  K      +K ++K++KK+ +K
Sbjct: 285 QEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 36.5 bits (85), Expect = 0.054
 Identities = 18/48 (37%), Positives = 32/48 (66%), Gaps = 7/48 (14%)

Query: 717 KEEEEEKKKKKKKKK-----KKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
           +EEEE+  K  ++++     +KK++KKK++++ K  K S EE+  RKL
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQ--RKL 311



 Score = 34.9 bits (81), Expect = 0.15
 Identities = 11/38 (28%), Positives = 24/38 (63%), Gaps = 10/38 (26%)

Query: 829 KEKKKKKKKKKKKK----------KKKKKKKKKKMMKK 856
           +EKK++KKK++++           +K ++K++KK  +K
Sbjct: 285 QEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 33.0 bits (76), Expect = 0.57
 Identities = 15/44 (34%), Positives = 30/44 (68%), Gaps = 2/44 (4%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
           DK  EEE++K  K  +++++++ ++KK++KKK+    E  + KL
Sbjct: 262 DKTREEEEEKILKAAEEERQEEAQEKKEEKKKE--EREAKLAKL 303


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 36.5 bits (85), Expect = 0.029
 Identities = 19/44 (43%), Positives = 27/44 (61%), Gaps = 2/44 (4%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
           + E   +KKK    +++KKKKKKK  KKKK KK +    GM+ +
Sbjct: 165 EAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVA--AVGMKAI 206



 Score = 34.6 bits (80), Expect = 0.15
 Identities = 16/35 (45%), Positives = 18/35 (51%)

Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
                +  EEE+KKKKKK  KKKK KK      K 
Sbjct: 171 DKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205



 Score = 34.6 bits (80), Expect = 0.16
 Identities = 16/28 (57%), Positives = 18/28 (64%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            E++KKKKKKK  KKKK KK     MK 
Sbjct: 178 AEEEKKKKKKKSAKKKKLKKVAAVGMKA 205



 Score = 32.7 bits (75), Expect = 0.61
 Identities = 16/36 (44%), Positives = 20/36 (55%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           A   K+    +++KKKKKKK  KKKK KK      K
Sbjct: 169 ARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMK 204



 Score = 30.7 bits (70), Expect = 2.2
 Identities = 16/37 (43%), Positives = 21/37 (56%), Gaps = 1/37 (2%)

Query: 817 PKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKM 853
            K++ N  A  +K+KKKKK  KKKK KK      K +
Sbjct: 171 DKKKSNN-AEEEKKKKKKKSAKKKKLKKVAAVGMKAI 206



 Score = 29.6 bits (67), Expect = 6.7
 Identities = 14/36 (38%), Positives = 20/36 (55%)

Query: 708 RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
           R  + +   +EE+++KKKK  KKKK KK      K 
Sbjct: 170 RDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 37.5 bits (87), Expect = 0.037
 Identities = 20/49 (40%), Positives = 29/49 (59%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVK 771
           +KKKKKKK K+KKKKK ++  K+     N EEG+    +  S +   +K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLK 732



 Score = 29.8 bits (67), Expect = 7.7
 Identities = 14/22 (63%), Positives = 18/22 (81%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKK 852
           +KKKKKKK K+KKKKK ++  K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASK 705



 Score = 29.8 bits (67), Expect = 8.6
 Identities = 17/40 (42%), Positives = 26/40 (65%), Gaps = 2/40 (5%)

Query: 834 KKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANIN 873
           +KKKKKKK K+KKKKK ++  K+ +   L  +E+ +   N
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIH--ALENIEEGIEGSN 721



 Score = 29.5 bits (66), Expect = 9.6
 Identities = 14/23 (60%), Positives = 19/23 (82%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKKM 853
           KKKKKKK K+KKKKK ++  K++
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQI 707


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 37.0 bits (86), Expect = 0.039
 Identities = 13/42 (30%), Positives = 26/42 (61%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           KE+  +   K K ++ KK++  + ++KKK+KK  +E  G++ 
Sbjct: 354 KEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 31.2 bits (71), Expect = 2.9
 Identities = 11/55 (20%), Positives = 27/55 (49%), Gaps = 6/55 (10%)

Query: 804 RINHYTTEVG----YLDPKQRFNQI--AYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           RI+ ++ E        + ++R  ++         K K ++ KK++  + ++KKK+
Sbjct: 329 RIDAFSGEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKE 383



 Score = 30.1 bits (68), Expect = 5.7
 Identities = 9/28 (32%), Positives = 17/28 (60%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            E+ KK++  + ++KKK+KK K +    
Sbjct: 366 PERDKKERPGRYRRKKKEKKAKSERRGL 393



 Score = 30.1 bits (68), Expect = 6.5
 Identities = 8/25 (32%), Positives = 18/25 (72%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
           DK+++  + ++KKK+KK K +++  
Sbjct: 369 DKKERPGRYRRKKKEKKAKSERRGL 393


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 37.0 bits (86), Expect = 0.041
 Identities = 14/36 (38%), Positives = 22/36 (61%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
             K E+++KKKK+KK+ K + + K   K  KK K +
Sbjct: 67  KKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKT 102



 Score = 37.0 bits (86), Expect = 0.047
 Identities = 18/53 (33%), Positives = 25/53 (47%), Gaps = 2/53 (3%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE--EGMRKLRDAASI 765
             KE++E K + + K   K  KK KK KKK  K   NE+      K+ + A  
Sbjct: 76  KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEK 128



 Score = 35.5 bits (82), Expect = 0.13
 Identities = 22/52 (42%), Positives = 25/52 (48%), Gaps = 8/52 (15%)

Query: 716 DKEEEEEKKKKKKKKKKKK--------KKKKKKKKKKKKKKYSNEEEGMRKL 759
               EEE K      KK K        KKK +KKKKKKK+K   + EG  KL
Sbjct: 40  STFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91



 Score = 32.4 bits (74), Expect = 1.3
 Identities = 16/38 (42%), Positives = 26/38 (68%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           K+++E+K  + KKK +KKKKKKK+KK+ K +  +    
Sbjct: 56  KDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93



 Score = 32.0 bits (73), Expect = 1.6
 Identities = 16/31 (51%), Positives = 21/31 (67%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
            + E KKK +KKKKKKK+KK+ K +   K G
Sbjct: 62  KNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92



 Score = 30.8 bits (70), Expect = 3.7
 Identities = 14/40 (35%), Positives = 24/40 (60%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
               ++E++  + KKK +KKKKKKK+KK+ K + +     
Sbjct: 54  TKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93



 Score = 29.7 bits (67), Expect = 8.4
 Identities = 12/39 (30%), Positives = 17/39 (43%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
           K+K+KK+ K + + K   K  KK    K  P      E 
Sbjct: 76  KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 36.7 bits (85), Expect = 0.043
 Identities = 26/162 (16%), Positives = 58/162 (35%), Gaps = 19/162 (11%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
             +  E+ ++K+ +++   +K  K+ ++  K+ +   ++    K + AA  +     E  
Sbjct: 82  KQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAK--AKAEAE 139

Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
            E +      + K    E+      +E +              K+        + E K K
Sbjct: 140 AEKKAKE---EAKKQAEEEAKAKAAAEAK--------------KKAAEAKKKAEAEAKAK 182

Query: 835 KKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTA 876
            + K K K ++ K K +    K   +  ++ E E A      
Sbjct: 183 AEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAE 224



 Score = 36.0 bits (83), Expect = 0.071
 Identities = 18/53 (33%), Positives = 26/53 (49%), Gaps = 1/53 (1%)

Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
           Q     K+ EE K K+  + K K + + +KK K++ KK   EEE   K    A
Sbjct: 113 QAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK-QAEEEAKAKAAAEA 164



 Score = 32.9 bits (75), Expect = 0.79
 Identities = 13/54 (24%), Positives = 26/54 (48%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
            Q      ++  E K K + + +KK K++ KK+ +++ K     E  +K  +A 
Sbjct: 119 KQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAK 172



 Score = 32.1 bits (73), Expect = 1.5
 Identities = 14/41 (34%), Positives = 24/41 (58%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
           A  +K+ +EE KK+ +++ K K   + KKK  + KK +  E
Sbjct: 138 AEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAE 178



 Score = 30.2 bits (68), Expect = 4.7
 Identities = 22/138 (15%), Positives = 47/138 (34%), Gaps = 37/138 (26%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
             EE E+++  ++ ++K+ +++   +K  K+       E   K  +    Q    K    
Sbjct: 76  QAEEAEKQRAAEQARQKELEQRAAAEKAAKQ------AEQAAKQAEEKQKQAEEAKAKQA 129

Query: 776 EPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKK 835
               A                  ++E +              +++  + A    E++ K 
Sbjct: 130 AEAKA------------------KAEAEA-------------EKKAKEEAKKQAEEEAKA 158

Query: 836 KKKKKKKKKKKKKKKKKM 853
           K   + KKK  + KKK  
Sbjct: 159 KAAAEAKKKAAEAKKKAE 176


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 37.1 bits (87), Expect = 0.044
 Identities = 35/132 (26%), Positives = 51/132 (38%), Gaps = 37/132 (28%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMR-KLRDAASIQCGPVKEVP 774
             E  E +K+  K  +KK+KKKKK+K+K+++ K      G   K       Q G V  +P
Sbjct: 606 AHELIEARKRLNKANEKKEKKKKKQKEKQEELK-----VGDEVKYLSLG--QKGEVLSIP 658

Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
                                  D  E         + G +  K   + +  + K KKKK
Sbjct: 659 -----------------------DDKEAI------VQAGIMKMKVPLSDLEKIQKPKKKK 689

Query: 835 KKKKKKKKKKKK 846
           KKK K  K K +
Sbjct: 690 KKKPKTVKPKPR 701



 Score = 36.0 bits (84), Expect = 0.12
 Identities = 26/135 (19%), Positives = 49/135 (36%), Gaps = 38/135 (28%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
           EE E   K+ +K K++ ++KK+K ++++ K     E+  ++    A              
Sbjct: 537 EEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEA-------------- 582

Query: 778 QIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKK 837
                                + E   I     ++                 E +K+  K
Sbjct: 583 ---------------------KKEADEIIKELRQLQKGGYASVKAHEL---IEARKRLNK 618

Query: 838 KKKKKKKKKKKKKKK 852
             +KK+KKKKK+K+K
Sbjct: 619 ANEKKEKKKKKQKEK 633


>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
          Length = 145

 Score = 35.4 bits (82), Expect = 0.045
 Identities = 14/29 (48%), Positives = 17/29 (58%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
            KK KKK KK  K  KK  K KK+K  ++
Sbjct: 4   AKKAKKKAKKAAKAAKKGVKVKKRKVRTS 32



 Score = 34.7 bits (80), Expect = 0.069
 Identities = 15/30 (50%), Positives = 17/30 (56%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
            KK KKK KK  K  KK  K KK+K + S 
Sbjct: 4   AKKAKKKAKKAAKAAKKGVKVKKRKVRTSV 33



 Score = 30.4 bits (69), Expect = 1.7
 Identities = 13/27 (48%), Positives = 16/27 (59%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKM 853
             K+ KKK KK  K  KK  K KK+K+
Sbjct: 3   PAKKAKKKAKKAAKAAKKGVKVKKRKV 29



 Score = 30.4 bits (69), Expect = 2.1
 Identities = 14/27 (51%), Positives = 14/27 (51%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           K   KK KKK KK  K  KK  K KK 
Sbjct: 1   KAPAKKAKKKAKKAAKAAKKGVKVKKR 27



 Score = 30.1 bits (68), Expect = 2.8
 Identities = 13/30 (43%), Positives = 14/30 (46%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
           K   KK KKK KK  K  KK  K   +K  
Sbjct: 1   KAPAKKAKKKAKKAAKAAKKGVKVKKRKVR 30



 Score = 29.7 bits (67), Expect = 3.7
 Identities = 13/31 (41%), Positives = 17/31 (54%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           K   ++ KKK KK  K  KK  K KK+K + 
Sbjct: 1   KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRT 31



 Score = 29.7 bits (67), Expect = 3.9
 Identities = 11/25 (44%), Positives = 14/25 (56%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
             +KK KK  K  KK  K KK+K +
Sbjct: 6   KAKKKAKKAAKAAKKGVKVKKRKVR 30


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 36.7 bits (85), Expect = 0.054
 Identities = 19/61 (31%), Positives = 26/61 (42%)

Query: 694 SEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
             D   +++        Q A P       K K K +K KKK    K   KKKKK  S+++
Sbjct: 88  HIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKD 147

Query: 754 E 754
           E
Sbjct: 148 E 148



 Score = 34.0 bits (78), Expect = 0.41
 Identities = 15/37 (40%), Positives = 18/37 (48%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
           P  + E+ KKK    K   KKKKK    K +  KY N
Sbjct: 118 PKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDN 154



 Score = 32.9 bits (75), Expect = 1.0
 Identities = 33/148 (22%), Positives = 46/148 (31%), Gaps = 52/148 (35%)

Query: 716 DKEEEEEKKKKKKKKKK-------KKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG 768
           +K+  E+  KK KK  K       KK K K+KKK KKK    ++ +     ++ +    G
Sbjct: 49  NKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAG 108

Query: 769 PVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLD 828
           P+               +   P  K                                   
Sbjct: 109 PLA------------ISLMRKPKPKTE--------------------------------- 123

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           K KKK    K   KKKKK    K  + K
Sbjct: 124 KLKKKITVNKSTNKKKKKVLSSKDELIK 151



 Score = 31.3 bits (71), Expect = 2.6
 Identities = 14/60 (23%), Positives = 19/60 (31%)

Query: 686 EKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
            K  L    +            +F          + + K +K KKK    K   KKKKK 
Sbjct: 83  IKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 35.3 bits (82), Expect = 0.060
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 727 KKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           KKK KK KK+K+K++K+ +K     ++E
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGADDGDDE 41



 Score = 33.0 bits (76), Expect = 0.33
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           KKK KK KK+K+K++K+ +K      +E +
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGADDGDDELK 43



 Score = 33.0 bits (76), Expect = 0.43
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKK 744
           KKK KK KK+K+K++K+ +K  
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGA 35



 Score = 33.0 bits (76), Expect = 0.43
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 724 KKKKKKKKKKKKKKKKKKKKKK 745
           KKK KK KK+K+K++K+ +K  
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGA 35



 Score = 33.0 bits (76), Expect = 0.43
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKK 852
           KKK KK KK+K+K++K+ +K  
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGA 35



 Score = 32.6 bits (75), Expect = 0.52
 Identities = 11/31 (35%), Positives = 23/31 (74%)

Query: 728 KKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           KKK KK KK+K+K++K+ +K   + ++ +++
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGADDGDDELKQ 44



 Score = 32.6 bits (75), Expect = 0.53
 Identities = 11/23 (47%), Positives = 18/23 (78%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKK 850
           DK+K KK KK+K+K++K+ +K  
Sbjct: 13  DKKKAKKAKKEKRKQRKQARKGA 35



 Score = 32.6 bits (75), Expect = 0.53
 Identities = 13/34 (38%), Positives = 22/34 (64%)

Query: 837 KKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELA 870
           KKK KK KK+K+K++K  +KG  D   ++++   
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAE 47



 Score = 32.2 bits (74), Expect = 0.62
 Identities = 12/27 (44%), Positives = 19/27 (70%)

Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKK 851
           A L  +KK KK KK+K+K++K+ +K  
Sbjct: 9   AGLVDKKKAKKAKKEKRKQRKQARKGA 35



 Score = 32.2 bits (74), Expect = 0.76
 Identities = 10/22 (45%), Positives = 17/22 (77%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKK 743
           +KK KK KK+K+K++K+ +K  
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGA 35



 Score = 31.8 bits (73), Expect = 0.83
 Identities = 14/33 (42%), Positives = 23/33 (69%)

Query: 731 KKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
           KKK KK KK+K+K++K+     ++G  +L+ AA
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGADDGDDELKQAA 46



 Score = 29.5 bits (67), Expect = 4.8
 Identities = 9/22 (40%), Positives = 17/22 (77%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKK 742
           ++K KK KK+K+K++K+ +K  
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGA 35


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 36.8 bits (85), Expect = 0.060
 Identities = 14/48 (29%), Positives = 21/48 (43%), Gaps = 9/48 (18%)

Query: 716  DKEEEEEKKKKKKK---------KKKKKKKKKKKKKKKKKKKYSNEEE 754
            D + EE ++K ++          KK  ++  KK   KK  KK S  E 
Sbjct: 1172 DAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESET 1219


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 35.6 bits (82), Expect = 0.061
 Identities = 22/52 (42%), Positives = 27/52 (51%)

Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           T  V + N   +P+  E   KK  KKKK   +KKK  KK KKKKK+     E
Sbjct: 28  TESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPE 79



 Score = 35.2 bits (81), Expect = 0.077
 Identities = 21/46 (45%), Positives = 25/46 (54%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
           D E   +K  KKKK   +KKK  KK KKKKK+K    E    +L D
Sbjct: 42  DVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSD 87



 Score = 32.5 bits (74), Expect = 0.62
 Identities = 18/45 (40%), Positives = 24/45 (53%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
            D   E+ +   KK  KKKK   +KKK  KK KK   E+E + +L
Sbjct: 36  DDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPEL 80



 Score = 30.9 bits (70), Expect = 1.9
 Identities = 17/40 (42%), Positives = 26/40 (65%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           P K+  ++KK   +KKK  KK KKKKK+K++  + + EE 
Sbjct: 46  PSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEEL 85



 Score = 30.9 bits (70), Expect = 2.2
 Identities = 17/44 (38%), Positives = 22/44 (50%)

Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRT 875
           KK  KKKK   +KKK  KK K  KK   +V     +EL++    
Sbjct: 48  KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEEN 91



 Score = 30.5 bits (69), Expect = 2.5
 Identities = 15/24 (62%), Positives = 16/24 (66%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
           K  KKKK   +KKK  KK KKKKK
Sbjct: 49  KTSKKKKTTPRKKKTTKKTKKKKK 72



 Score = 29.8 bits (67), Expect = 4.3
 Identities = 16/35 (45%), Positives = 22/35 (62%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLS 863
           K+  KKKK   +KKK  KK KKKK  K+  P++ +
Sbjct: 48  KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAA 82


>gnl|CDD|197876 smart00792, Agouti, Agouti protein.  The agouti protein regulates
           pigmentation in the mouse hair follicle producing a
           black hair with a subapical yellow band. A highly
           homologous protein agouti signal protein (ASIP) is
           present in humans and is expressed at highest levels in
           adipose tissue where it may play a role in energy
           homeostasis and possibly human pigmentation.
          Length = 124

 Score = 34.5 bits (79), Expect = 0.062
 Identities = 17/69 (24%), Positives = 30/69 (43%), Gaps = 13/69 (18%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPC 782
           KK KK   ++ +KK  +KK+KK           + +    +  +C  +++  C+    PC
Sbjct: 52  KKPKKISAEEAEKKLLQKKEKKALTN-------VLRPEPRSPRRCVRLRD-SCKGPQPPC 103

Query: 783 LFDIKNDPC 791
                 DPC
Sbjct: 104 C-----DPC 107


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.  This
           domain is the C-terminal end of ribosomal L18/L5
           proteins.
          Length = 93

 Score = 33.7 bits (78), Expect = 0.062
 Identities = 13/32 (40%), Positives = 20/32 (62%), Gaps = 3/32 (9%)

Query: 729 KKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           +KKK+KK+ K + K+   KK + E+   RK R
Sbjct: 61  RKKKEKKEVKAESKRYNAKKLTLEQ---RKAR 89



 Score = 31.0 bits (71), Expect = 0.50
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKK 850
           D  +KKK+KK+ K + K+   KK
Sbjct: 58  DPSRKKKEKKEVKAESKRYNAKK 80



 Score = 31.0 bits (71), Expect = 0.66
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 833 KKKKKKKKKKKKKKKKKKKKM 853
           +KKK+KK+ K + K+   KK+
Sbjct: 61  RKKKEKKEVKAESKRYNAKKL 81



 Score = 30.2 bits (69), Expect = 1.2
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 722 EKKKKKKKKKKKKKKKKKKKK 742
            +KKK+KK+ K + K+   KK
Sbjct: 60  SRKKKEKKEVKAESKRYNAKK 80



 Score = 29.8 bits (68), Expect = 1.3
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
           KKK+KK+ K + K+   KK   +++K
Sbjct: 62  KKKEKKEVKAESKRYNAKKLTLEQRK 87



 Score = 29.4 bits (67), Expect = 1.9
 Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 724 KKKKKKKKKKKKKKKKKKKK 743
           +KKK+KK+ K + K+   KK
Sbjct: 61  RKKKEKKEVKAESKRYNAKK 80



 Score = 29.4 bits (67), Expect = 1.9
 Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 725 KKKKKKKKKKKKKKKKKKKK 744
           +KKK+KK+ K + K+   KK
Sbjct: 61  RKKKEKKEVKAESKRYNAKK 80



 Score = 29.4 bits (67), Expect = 1.9
 Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 726 KKKKKKKKKKKKKKKKKKKK 745
           +KKK+KK+ K + K+   KK
Sbjct: 61  RKKKEKKEVKAESKRYNAKK 80



 Score = 29.4 bits (67), Expect = 1.9
 Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 832 KKKKKKKKKKKKKKKKKKKK 851
           +KKK+KK+ K + K+   KK
Sbjct: 61  RKKKEKKEVKAESKRYNAKK 80



 Score = 29.1 bits (66), Expect = 3.1
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           +KK+KK+ K + K+   KK   +++K 
Sbjct: 62  KKKEKKEVKAESKRYNAKKLTLEQRKA 88



 Score = 27.5 bits (62), Expect = 8.2
 Identities = 9/33 (27%), Positives = 19/33 (57%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
             K+E++E K + K+   KK   +++K +  +K
Sbjct: 61  RKKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93



 Score = 27.5 bits (62), Expect = 9.0
 Identities = 8/28 (28%), Positives = 18/28 (64%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           ++K+KK+ K + K+   KK   +++K +
Sbjct: 62  KKKEKKEVKAESKRYNAKKLTLEQRKAR 89


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 35.6 bits (82), Expect = 0.066
 Identities = 16/30 (53%), Positives = 21/30 (70%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
           EKKKK K KK K K KK + KKK ++ ++N
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119



 Score = 33.7 bits (77), Expect = 0.24
 Identities = 18/50 (36%), Positives = 24/50 (48%)

Query: 699 INHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            + Y   V     I++  K      +KKKK K KK K K KK + KKK +
Sbjct: 65  SHDYEVLVTGNTGISWRRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 32.5 bits (74), Expect = 0.61
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query: 728 KKKKKKKKKKKKKKKKKKKKKYSNEE 753
           KKKK K KK K K KK + KK + E 
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAREG 116



 Score = 32.5 bits (74), Expect = 0.64
 Identities = 15/23 (65%), Positives = 16/23 (69%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKK 852
           EKKKK K KK K K KK + KKK
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKK 112



 Score = 32.1 bits (73), Expect = 0.80
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKK 852
           ++K+KK K KK K K KK + KKK +
Sbjct: 89  VEKKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 30.6 bits (69), Expect = 2.9
 Identities = 13/26 (50%), Positives = 14/26 (53%)

Query: 729 KKKKKKKKKKKKKKKKKKKKYSNEEE 754
           KKKK K KK K K KK + K    E 
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAREG 116



 Score = 30.2 bits (68), Expect = 3.2
 Identities = 14/27 (51%), Positives = 18/27 (66%)

Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGY 858
           KKKK K KK K K KK + KK  ++G+
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAREGW 117


>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated.  This family of
           proteins is associated with U3 snoRNA. U3 snoRNA is
           required for nucleolar processing of pre-18S ribosomal
           RNA.
          Length = 87

 Score = 33.5 bits (77), Expect = 0.068
 Identities = 16/35 (45%), Positives = 21/35 (60%)

Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
              P+  EEE KK+KKK K +  KK +K+ K  KK
Sbjct: 18  TPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52



 Score = 32.0 bits (73), Expect = 0.24
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           P +  EEE KK+KKK K +  KK +K+ K  KK
Sbjct: 20  PPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52



 Score = 31.6 bits (72), Expect = 0.29
 Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           P  E  EE+ KK+KKK K +  KK +K+ K  KK
Sbjct: 19  PPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52



 Score = 30.1 bits (68), Expect = 0.98
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
            +E+ KK+KKK K +  KK +K+ K +KKG
Sbjct: 24  YEEELKKQKKKNKLRFLKKLEKRPKDVKKG 53



 Score = 29.7 bits (67), Expect = 1.7
 Identities = 12/36 (33%), Positives = 17/36 (47%)

Query: 817 PKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           P           K++KKK K +  KK +K+ K  KK
Sbjct: 17  PTPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
           (DUF1764).  This is a family of eukaryotic proteins of
           unknown function. This family contains many hypothetical
           proteins.
          Length = 98

 Score = 33.6 bits (77), Expect = 0.069
 Identities = 16/39 (41%), Positives = 20/39 (51%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           +KKKK+  K  + K  KK +KK KKK    E     K R
Sbjct: 25  KKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRR 63



 Score = 33.6 bits (77), Expect = 0.083
 Identities = 16/49 (32%), Positives = 26/49 (53%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           + I    K+ +++KK+  K  + K  KK +KK KKK +     EE  R+
Sbjct: 15  DDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRR 63



 Score = 31.3 bits (71), Expect = 0.54
 Identities = 14/53 (26%), Positives = 26/53 (49%), Gaps = 4/53 (7%)

Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE----EEGMR 757
           F+ I    K+++   K  + K  KK +KK KKK +  +    ++    E+G+ 
Sbjct: 18  FSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRTEDGLP 70



 Score = 29.7 bits (67), Expect = 1.5
 Identities = 15/45 (33%), Positives = 24/45 (53%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
           K ++++K+  K  + K  KK +KK KKK +     EE   R+  D
Sbjct: 23  KRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRTED 67



 Score = 28.6 bits (64), Expect = 4.0
 Identities = 12/32 (37%), Positives = 15/32 (46%)

Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
                 KK+KKKKK+  K  + K  KK   K 
Sbjct: 16  DIFSNIKKRKKKKKRTAKTARPKATKKGQKKD 47



 Score = 28.2 bits (63), Expect = 6.2
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDV 861
           ++KKKK+  K  + K  KK +KK   K  +P+ 
Sbjct: 24  RKKKKKRTAKTARPKATKKGQKKDKKKDEFPEF 56



 Score = 28.2 bits (63), Expect = 6.8
 Identities = 13/46 (28%), Positives = 23/46 (50%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           K E+ +K+         KK+KKKKK+  K  +    ++G +K +  
Sbjct: 5   KNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKK 50


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 34.2 bits (79), Expect = 0.071
 Identities = 11/45 (24%), Positives = 26/45 (57%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
            KE+ + +KK+++KK + +K  K   ++K + +    E+ + +L 
Sbjct: 18  AKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELE 62



 Score = 33.8 bits (78), Expect = 0.10
 Identities = 14/37 (37%), Positives = 25/37 (67%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           ++E +K   K+K K +KK+++KK + +K  K S EE+
Sbjct: 10  DKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEK 46



 Score = 32.2 bits (74), Expect = 0.37
 Identities = 14/44 (31%), Positives = 24/44 (54%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           E EE+K    K+  K   K+K K +KK+++  +E E + K+   
Sbjct: 1   EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAE 44



 Score = 32.2 bits (74), Expect = 0.38
 Identities = 8/43 (18%), Positives = 22/43 (51%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           +K+ E EK  K   ++K + + +K +K+ ++ +       ++ 
Sbjct: 30  EKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKA 72



 Score = 32.2 bits (74), Expect = 0.43
 Identities = 10/39 (25%), Positives = 23/39 (58%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           +K++EE+K + +K  K   ++K + + +K +K+    E 
Sbjct: 25  EKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEA 63



 Score = 31.8 bits (73), Expect = 0.43
 Identities = 11/47 (23%), Positives = 26/47 (55%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
           +EE++ + +K  K   ++K + + +K +K+ +    E   R+L+  A
Sbjct: 28  QEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEA 74



 Score = 31.5 bits (72), Expect = 0.70
 Identities = 9/42 (21%), Positives = 23/42 (54%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
            + E+++++KK + +K  K   ++K + + +K   E E +  
Sbjct: 22  AKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEA 63



 Score = 30.3 bits (69), Expect = 1.5
 Identities = 15/43 (34%), Positives = 25/43 (58%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           DK   +EK K +KK+++KK + +K  K   ++K   E E + K
Sbjct: 14  DKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEK 56



 Score = 29.9 bits (68), Expect = 2.2
 Identities = 13/63 (20%), Positives = 30/63 (47%), Gaps = 10/63 (15%)

Query: 686 EKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
           EK     +SE +++   + E          +K E E +K +K+ ++ + +  +++ K + 
Sbjct: 25  EKKQEEKKSEAEKLAKMSAE----------EKAEYELEKLEKELEELEAELARRELKAEA 74

Query: 746 KKK 748
           KK 
Sbjct: 75  KKM 77



 Score = 29.5 bits (67), Expect = 3.0
 Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 1/50 (2%)

Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINR 874
             + KEK K +KK+++KK + +K  K    +K   + L ++EKEL  +  
Sbjct: 15  KAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYE-LEKLEKELEELEA 63



 Score = 28.4 bits (64), Expect = 6.9
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
            K+K K +KK+++KK + +K  K       E  + KL   
Sbjct: 18  AKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKE 57


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this family
           (Kri1p) is found to be required for 40S ribosome
           biogenesis in the nucleolus.
          Length = 99

 Score = 33.4 bits (77), Expect = 0.083
 Identities = 12/56 (21%), Positives = 31/56 (55%), Gaps = 10/56 (17%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE 772
           KE +EE+K +++++ K+ K  K+++ ++K          + K++  A ++   + E
Sbjct: 1   KERKEEEKAQREEELKRLKNLKREEIEEK----------LEKIKKVAGLRGADLSE 46



 Score = 31.1 bits (71), Expect = 0.55
 Identities = 7/28 (25%), Positives = 21/28 (75%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
           ++EK +++++ K+ K  K+++ ++K+ K
Sbjct: 5   EEEKAQREEELKRLKNLKREEIEEKLEK 32



 Score = 31.1 bits (71), Expect = 0.63
 Identities = 8/29 (27%), Positives = 22/29 (75%), Gaps = 1/29 (3%)

Query: 829 KEKKKK-KKKKKKKKKKKKKKKKKKMMKK 856
           KE+K++ K +++++ K+ K  K++++ +K
Sbjct: 1   KERKEEEKAQREEELKRLKNLKREEIEEK 29


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 35.5 bits (82), Expect = 0.090
 Identities = 13/42 (30%), Positives = 24/42 (57%)

Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
             +KK+K+KK++ K +KK+   +KY  EE   +  ++    Q
Sbjct: 160 LLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQ 201


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 35.8 bits (83), Expect = 0.10
 Identities = 32/171 (18%), Positives = 68/171 (39%), Gaps = 25/171 (14%)

Query: 705 EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK----------KYSNEEE 754
           E  +  Q+A   +++ E++ KK + K +K++ + ++ +K  ++               EE
Sbjct: 276 ERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEE 335

Query: 755 GMRKLRDAASIQCGPVKEVPCEPQIAPCLFD-------IKNDPCEKNNLADRSEVQRINH 807
           G++ +R A       +K +  +    P            K    + N     SE++    
Sbjct: 336 GLKSVRLADFYGNEEIK-IELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIA 394

Query: 808 YTTEVGYLDPKQRFNQIAYLD--KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
           Y         K    +       +E +++  ++   K KKKK+KKK+  +K
Sbjct: 395 YYESA-----KTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440



 Score = 33.5 bits (77), Expect = 0.49
 Identities = 15/65 (23%), Positives = 27/65 (41%), Gaps = 1/65 (1%)

Query: 692 DRSEDQRINHYTTEVGRF-NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
             SE +    Y         +       EE  ++  ++   K KKKK+KKK+  +K ++ 
Sbjct: 385 QLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEKFRWF 444

Query: 751 NEEEG 755
              +G
Sbjct: 445 VSSDG 449



 Score = 32.0 bits (73), Expect = 1.9
 Identities = 18/90 (20%), Positives = 28/90 (31%), Gaps = 24/90 (26%)

Query: 680 IKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEE------EKKKKKKKKKKK 733
           I+ D     +       QR      ++            E +      E  K   +K + 
Sbjct: 353 IELDK----SKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEG 408

Query: 734 KK--------------KKKKKKKKKKKKKY 749
           KK               K KKKK+KKK+ +
Sbjct: 409 KKAIEEIREELIEEGLLKSKKKKRKKKEWF 438


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 34.9 bits (81), Expect = 0.11
 Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 821 FNQI-AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
           F  I  Y + EK+KKK   K++KK  K++K K     GY
Sbjct: 88  FTPIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEPYGY 126



 Score = 32.9 bits (76), Expect = 0.53
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           E +K+KKK   K++KK  K++K K ++ Y
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEPY 124



 Score = 30.2 bits (69), Expect = 3.3
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           E EK+KKK   K++KK  K++K K ++ 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123



 Score = 30.2 bits (69), Expect = 3.4
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
           E E+EKKK   K++KK  K++K K ++ 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123



 Score = 29.9 bits (68), Expect = 4.9
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           E E++KKK   K++KK  K++K K ++   Y
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEPYGY 126



 Score = 29.1 bits (66), Expect = 8.6
 Identities = 10/28 (35%), Positives = 19/28 (67%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
           + E+E+KK   K++KK  K++K K ++ 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 36.0 bits (83), Expect = 0.11
 Identities = 14/35 (40%), Positives = 16/35 (45%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           Y    +   +KK K KKK    K K  K  KKK K
Sbjct: 739 YAGTPKTPYEKKTKAKKKSASTKGKAAKTVKKKSK 773



 Score = 35.6 bits (82), Expect = 0.16
 Identities = 14/27 (51%), Positives = 14/27 (51%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           EKK K KKK    K K  K  KKK K 
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKA 774



 Score = 34.8 bits (80), Expect = 0.25
 Identities = 14/38 (36%), Positives = 19/38 (50%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
           E++ K KKK    K K  K  KKK K K K + ++   
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAG 785



 Score = 34.8 bits (80), Expect = 0.25
 Identities = 15/45 (33%), Positives = 18/45 (40%)

Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGP 769
           +KK K KKK    K K  K  KKK   + +   K R        P
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSP 792



 Score = 34.0 bits (78), Expect = 0.48
 Identities = 17/53 (32%), Positives = 21/53 (39%), Gaps = 4/53 (7%)

Query: 696 DQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           D  I  Y        +  Y  K + ++K    K K  K  KKK K K KK  K
Sbjct: 733 DAVITKYAGT----PKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTK 781



 Score = 33.6 bits (77), Expect = 0.62
 Identities = 18/54 (33%), Positives = 20/54 (37%), Gaps = 4/54 (7%)

Query: 823 QIAYLDKEKKKK----KKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANI 872
           +  Y  K K KK     K K  K  KKK K K K   K     L      LA +
Sbjct: 744 KTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPALAAM 797



 Score = 33.6 bits (77), Expect = 0.62
 Identities = 15/38 (39%), Positives = 20/38 (52%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           E++ + KKK    K K  K  KKK K K KK + +  G
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAG 785



 Score = 32.1 bits (73), Expect = 1.5
 Identities = 13/33 (39%), Positives = 16/33 (48%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
             K++    K K  K  KKK K K KK  KK+ 
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 35.1 bits (81), Expect = 0.12
 Identities = 6/35 (17%), Positives = 30/35 (85%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
            +++++K++++++K++KK+++++++K+K ++ +  
Sbjct: 192 LKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQK 226



 Score = 32.8 bits (75), Expect = 0.67
 Identities = 24/142 (16%), Positives = 74/142 (52%), Gaps = 25/142 (17%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE--EEGMR-KLRDAASIQCGPVKEVP 774
            + ++K +K  ++K+K++++K++++ + +++ + E  EE  R K + AA  +        
Sbjct: 98  AQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPK----- 152

Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
                       K +  E  + +     +   + + E    + K+R  +  +  K+ K++
Sbjct: 153 -----------HKKEAAESASSSLSGSAKPERNVSQE----EAKKRLQE--WELKKLKQQ 195

Query: 835 KKKKKKKKKKKKKKKKKKMMKK 856
           ++K++++++K++KK++++  +K
Sbjct: 196 QQKREEERRKQRKKQQEEEERK 217



 Score = 30.9 bits (70), Expect = 2.4
 Identities = 24/141 (17%), Positives = 58/141 (41%), Gaps = 42/141 (29%)

Query: 716 DKEEEEEKKKKKKKKKKKK----KKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVK 771
           +KE EE + +++  K+K +    +K ++  K++  K      E       A+S   G  K
Sbjct: 117 EKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAES------ASSSLSGSAK 170

Query: 772 EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEK 831
                 Q        K              +Q          +   K +        ++K
Sbjct: 171 PERNVSQEE-----AKK------------RLQ---------EWELKKLK------QQQQK 198

Query: 832 KKKKKKKKKKKKKKKKKKKKK 852
           ++++++K++KK+++++++K+K
Sbjct: 199 REEERRKQRKKQQEEEERKQK 219



 Score = 29.7 bits (67), Expect = 7.2
 Identities = 13/47 (27%), Positives = 33/47 (70%), Gaps = 4/47 (8%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
           K+  +E + KK K++++K++++++K++KK++    EEE  ++  + A
Sbjct: 181 KKRLQEWELKKLKQQQQKREEERRKQRKKQQ----EEEERKQKAEEA 223


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 35.5 bits (82), Expect = 0.12
 Identities = 14/51 (27%), Positives = 30/51 (58%), Gaps = 1/51 (1%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMR-KLRDAASI 765
            + E E   K  + K+K K ++ K+K++K+ ++ + EE+ ++ KL   A +
Sbjct: 262 FESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARL 312



 Score = 33.5 bits (77), Expect = 0.54
 Identities = 11/45 (24%), Positives = 25/45 (55%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
            +  E E +   K  + K+K K ++ K+K++K+   E +  ++L+
Sbjct: 259 WEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLK 303



 Score = 32.4 bits (74), Expect = 1.00
 Identities = 12/48 (25%), Positives = 25/48 (52%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRT 875
           + E   K  + K+K K ++ K+K++   +       Q++K+LA + R 
Sbjct: 265 EYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARL 312



 Score = 32.4 bits (74), Expect = 1.3
 Identities = 11/51 (21%), Positives = 25/51 (49%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASI 765
            +  E E +   K  + K+K K ++ K+K++K+     +E  +  +  A +
Sbjct: 259 WEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQL 309



 Score = 32.0 bits (73), Expect = 1.6
 Identities = 8/35 (22%), Positives = 23/35 (65%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
             +K    ++K K ++ K+K++K+ +++ K++K+ 
Sbjct: 268 PINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQL 302



 Score = 31.6 bits (72), Expect = 2.1
 Identities = 34/161 (21%), Positives = 75/161 (46%), Gaps = 22/161 (13%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
            +EE EK+ K +KK+++ ++ ++KK +K   + S  +E    L + +        +    
Sbjct: 199 LQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESA 258

Query: 777 PQIAPCLFDIKNDPCE--------KNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLD 828
            +     ++  N P          +N    R E++R      ++     K++  Q+A L 
Sbjct: 259 WEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQL-----KKKLAQLARLK 313

Query: 829 K---------EKKKKKKKKKKKKKKKKKKKKKKMMKKGYPD 860
           +         + + +KK+++K++ +KKK K++K+ K  YP+
Sbjct: 314 EIAKEVAQKEKARARKKEQRKERGEKKKLKRRKLGKHKYPE 354



 Score = 31.2 bits (71), Expect = 2.3
 Identities = 12/66 (18%), Positives = 35/66 (53%), Gaps = 10/66 (15%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK----------KKKYSNEEEGMRKL 759
           N+   P ++ + ++ K+K++K+ +++ K++K+ KKK           K+ + +E+   + 
Sbjct: 270 NKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARK 329

Query: 760 RDAASI 765
           ++    
Sbjct: 330 KEQRKE 335



 Score = 31.2 bits (71), Expect = 2.5
 Identities = 26/136 (19%), Positives = 51/136 (37%), Gaps = 47/136 (34%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
             E  E + +   K  + K+K K ++ K+K++K   E E   K                 
Sbjct: 258 AWEGFESEYEPINKPVRPKRKTKAQRNKEKRRK---ELEREAKEEKQ------------- 301

Query: 776 EPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKK 835
                            K  LA  + ++ I                 ++A  +K + +KK
Sbjct: 302 ----------------LKKKLAQLARLKEIAK---------------EVAQKEKARARKK 330

Query: 836 KKKKKKKKKKKKKKKK 851
           +++K++ +KKK K++K
Sbjct: 331 EQRKERGEKKKLKRRK 346



 Score = 30.8 bits (70), Expect = 3.8
 Identities = 18/59 (30%), Positives = 27/59 (45%), Gaps = 16/59 (27%)

Query: 715 PDKEEEEEKKKKKKKKK----------------KKKKKKKKKKKKKKKKKYSNEEEGMR 757
            D+   ++K++KK KK                   KK+  KKK KKKK K  + +E +R
Sbjct: 49  GDEATIKKKRRKKLKKPLKSDEILENRSKVPAPSVKKRHVKKKSKKKKSKQVSRKELLR 107



 Score = 29.7 bits (67), Expect = 6.7
 Identities = 15/45 (33%), Positives = 31/45 (68%)

Query: 705 EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           ++ R  +IA    ++E+ + +KK+++K++ +KKK K++K  K KY
Sbjct: 308 QLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLKRRKLGKHKY 352



 Score = 29.7 bits (67), Expect = 8.3
 Identities = 9/36 (25%), Positives = 22/36 (61%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            Y    +    K+K K ++ K+K++K+ +++ K++K
Sbjct: 265 EYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEK 300


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 35.0 bits (81), Expect = 0.12
 Identities = 14/39 (35%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 821 FNQI-AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
           F Q+ AY   +K++KK   K++KK  K +K+K   + G+
Sbjct: 87  FTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEEYGF 125



 Score = 30.8 bits (70), Expect = 2.2
 Identities = 10/29 (34%), Positives = 21/29 (72%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           + +K++KK   K++KK  K +K+K +++Y
Sbjct: 95  KAQKEEKKAMSKEEKKAIKAEKEKLEEEY 123



 Score = 29.6 bits (67), Expect = 5.8
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
           F Q+    K ++EEKK   K++KK  K +K+K +++  
Sbjct: 87  FTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 35.2 bits (81), Expect = 0.12
 Identities = 14/36 (38%), Positives = 16/36 (44%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
             P      E K+ KK KK +  K  K  K KK KK
Sbjct: 134 VPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKK 169



 Score = 34.9 bits (80), Expect = 0.15
 Identities = 17/38 (44%), Positives = 19/38 (50%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
           E +E KK KK +  K  K  K KK KKK   SN    M
Sbjct: 143 EAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKM 180



 Score = 32.5 bits (74), Expect = 0.86
 Identities = 15/33 (45%), Positives = 17/33 (51%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
              E K+ KK KK +  K  K  K KK KKK S
Sbjct: 140 APPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGS 172



 Score = 31.0 bits (70), Expect = 2.4
 Identities = 13/51 (25%), Positives = 21/51 (41%), Gaps = 4/51 (7%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKK----KKKKKYSNEEEGMRKLRD 761
           P+ +E ++ KK +  K  K  K KK KKK     +  K    +   +    
Sbjct: 142 PEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWK 192


>gnl|CDD|153327 cd07643, I-BAR_IMD_MIM, Inverse (I)-BAR, also known as the
           IRSp53/MIM homology Domain (IMD), of Missing In
           Metastasis.  The IMD domain, also called
           Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a
           dimerization and lipid-binding module that bends
           membranes and induces membrane protrusions. Members of
           this subfamily include missing in metastasis (MIM) or
           metastasis suppressor 1 (MTSS1), metastasis suppressor
           1-like (MTSSL) or ABBA (Actin-Bundling protein with
           BAIAP2 homology), and similar proteins. They contain an
           N-terminal IMD and a WASP homology 2 (WH2) actin-binding
           motif at the C-terminus. MIM was originally identified
           as a missing transcript from metastatic bladder and
           prostate cancer cells. It is a scaffold protein that
           functions in a signaling pathway between the PDGF
           receptor, Src kinases, and actin assembly. It may also
           function as a cofactor of the Sonic hedgehog (Shh)
           transcriptional pathway and may participate in tumor
           development and progression via this pathway. ABBA
           regulates actin and plasma membrane dynamics to promote
           the extension of radial glia, which is important in
           neuronal migration, axon guidance and neurogenesis. The
           IMD domain of MIM binds and bundles actin filaments,
           binds membranes, and interacts with the small GTPase
           Rac.
          Length = 231

 Score = 34.7 bits (80), Expect = 0.12
 Identities = 15/51 (29%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 825 AYLDKEKKKKKKKKKKKKKKKKKK--KKKKMMKKGYPDVLSQMEKELANIN 873
             LDK+  K+ KK +++ KKK     + +K  +KG  D+  Q++  + ++N
Sbjct: 114 NQLDKDHAKEYKKARQEIKKKSSDTIRLQKKARKGKGDLQPQLDSAMQDVN 164


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 35.4 bits (82), Expect = 0.13
 Identities = 22/153 (14%), Positives = 48/153 (31%), Gaps = 25/153 (16%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQ 778
                 K +   +++ KKK KK   K K K    +E +++  ++       + +V     
Sbjct: 1   MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIF-- 58

Query: 779 IAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKK 838
                  +     + ++  +    ++      +                    K   KKK
Sbjct: 59  -------LSGMVKDTDDATESDIPKKKTKTAAK----------------AAAAKAPAKKK 95

Query: 839 KKKKKKKKKKKKKKMMKKGYPDVLSQMEKELAN 871
            K +    KK +KK       D+    + ++ N
Sbjct: 96  LKDELDSSKKAEKKNALDKDDDLNYVKDIDVLN 128



 Score = 35.4 bits (82), Expect = 0.15
 Identities = 14/68 (20%), Positives = 22/68 (32%)

Query: 681 KNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKK 740
             +  E         DQ +   +  V   +     D  +++ K   K    K   KKK K
Sbjct: 38  IKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLK 97

Query: 741 KKKKKKKK 748
            +    KK
Sbjct: 98  DELDSSKK 105



 Score = 33.4 bits (77), Expect = 0.55
 Identities = 11/40 (27%), Positives = 18/40 (45%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           P K+ +   K    K   KKK K +    KK +K +  ++
Sbjct: 75  PKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDK 114



 Score = 32.3 bits (74), Expect = 1.4
 Identities = 9/36 (25%), Positives = 13/36 (36%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
              +   +KK K +    KK +KK    K     Y 
Sbjct: 86  AAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYV 121



 Score = 31.5 bits (72), Expect = 2.2
 Identities = 11/44 (25%), Positives = 19/44 (43%)

Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           ++   K+ ++  +    KKK K   K    K   KKK  +E + 
Sbjct: 59  LSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDS 102



 Score = 31.5 bits (72), Expect = 2.4
 Identities = 17/78 (21%), Positives = 22/78 (28%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKG 887
             +  +    KKK K   K    K   KK   D L   +K            +N   D  
Sbjct: 66  TDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDID 125

Query: 888 GDPKNFDHAWSIFGDDLK 905
              +  D       DDL 
Sbjct: 126 VLNQADDDDDDDDDDDLD 143



 Score = 31.1 bits (71), Expect = 2.6
 Identities = 10/38 (26%), Positives = 15/38 (39%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
            K   +    K   KKK K +    KK +KK     ++
Sbjct: 79  TKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDD 116



 Score = 31.1 bits (71), Expect = 3.3
 Identities = 10/49 (20%), Positives = 21/49 (42%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
             +   +    K   KKK K +    KK +KK   ++++ +  ++D   
Sbjct: 78  KTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDV 126



 Score = 30.7 bits (70), Expect = 3.7
 Identities = 12/50 (24%), Positives = 17/50 (34%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
                 +   KKK K +    KK +KK    K    N  + +  L  A  
Sbjct: 83  AKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADD 132


>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 304

 Score = 35.0 bits (81), Expect = 0.13
 Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 3/40 (7%)

Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           F +I   DKEEE +K+ K +++     K   K K+ KKK 
Sbjct: 267 FGKI---DKEEEAKKQAKAERRAANAPKPGAKPKRSKKKA 303


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 35.6 bits (82), Expect = 0.13
 Identities = 19/39 (48%), Positives = 21/39 (53%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           DK    E K KK K KKK KK KKK+ K+   K   EE 
Sbjct: 137 DKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEES 175



 Score = 34.8 bits (80), Expect = 0.19
 Identities = 17/35 (48%), Positives = 19/35 (54%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           E K KK K KKK KK KKK+ K+   K    E E 
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESES 177



 Score = 33.7 bits (77), Expect = 0.51
 Identities = 15/40 (37%), Positives = 22/40 (55%)

Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           + K KK K KKK KK KKK+ K+    ++E   +  D + 
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESK 182



 Score = 32.5 bits (74), Expect = 0.99
 Identities = 32/156 (20%), Positives = 56/156 (35%), Gaps = 28/156 (17%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD-----------AASIQCGPVK 771
           KK K  +++K  ++ K  +KK K  K +  EE +  L+             + +      
Sbjct: 14  KKAKGFERQKLIRRIKNAEKKSKNDKLARIEEEIEALKTLDLDQLAKKYLFSKLIKVITS 73

Query: 772 EVPCEPQIAPCLFDIKND-------------PCEKNNLADR--SEVQRINHYTTEVGYLD 816
           +       +P    I ND               EK    +   S +         +  + 
Sbjct: 74  KPTKAWAESPEFRQIINDKSHKANPSRSEAEVVEKLKSLNNVVSRLLNSKPVKNVLESIM 133

Query: 817 PKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
                ++I  ++ + KK K KKK KK KKK+ K+  
Sbjct: 134 E--GLDKILGIETKAKKGKAKKKTKKSKKKEAKESS 167



 Score = 32.5 bits (74), Expect = 1.1
 Identities = 15/38 (39%), Positives = 22/38 (57%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           E + +K K KKK KK KKK+ K+   K  ++ S  E+ 
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDE 180



 Score = 32.5 bits (74), Expect = 1.1
 Identities = 14/46 (30%), Positives = 26/46 (56%)

Query: 708 RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
             ++I   + + ++ K KKK KK KKK+ K+   K  +++  S +E
Sbjct: 135 GLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDE 180



 Score = 31.0 bits (70), Expect = 3.4
 Identities = 18/46 (39%), Positives = 25/46 (54%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           E + KK K KKK KK KKK+ K+   K  +  +E E   K  ++A 
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAE 188



 Score = 30.2 bits (68), Expect = 5.2
 Identities = 13/39 (33%), Positives = 24/39 (61%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
            + + ++ K KKK KK KKK+ K+   K  +++  +E+E
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDE 180



 Score = 30.2 bits (68), Expect = 5.3
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
             +K K KKK KK KKK+ K+   K
Sbjct: 145 KAKKGKAKKKTKKSKKKEAKESSDK 169


>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
          Length = 381

 Score = 35.4 bits (82), Expect = 0.14
 Identities = 17/66 (25%), Positives = 30/66 (45%), Gaps = 3/66 (4%)

Query: 687 KNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
           KN L +R    R+N    ++ +   +A   +E+   K  +K K ++ +KK KK +  K  
Sbjct: 316 KNPLKNRRVRLRLNPAAKKLRKLAVLA---QEKATAKGAQKVKNRRARKKAKKARLAKVA 372

Query: 747 KKYSNE 752
           K     
Sbjct: 373 KALYKS 378



 Score = 35.0 bits (81), Expect = 0.18
 Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 7/60 (11%)

Query: 793 KNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           KN L +R    R+N           K R   +   +K   K  +K K ++ +KK KK + 
Sbjct: 316 KNPLKNRRVRLRLNPAA-------KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARL 368


>gnl|CDD|236275 PRK08486, PRK08486, single-stranded DNA-binding protein;
           Provisional.
          Length = 182

 Score = 34.4 bits (79), Expect = 0.14
 Identities = 13/67 (19%), Positives = 27/67 (40%)

Query: 419 IENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENG 478
           +++ +   +    + N  +EN   N    YN   ++ Y N    YN       N + +  
Sbjct: 105 LDSKSDNPQANAMQDNSFHENFNNNYPGNYNNPSQDPYMNQAQSYNQNAYAKENQQAQPP 164

Query: 479 THEYNIP 485
            +E ++P
Sbjct: 165 KYENSVP 171


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
           structure and biogenesis].
          Length = 107

 Score = 33.0 bits (75), Expect = 0.14
 Identities = 22/67 (32%), Positives = 30/67 (44%), Gaps = 6/67 (8%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
             K +  ++KK +K K    K KKK  KKKKK++         +L D         KEVP
Sbjct: 2   APKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIR------KEVP 55

Query: 775 CEPQIAP 781
            E  + P
Sbjct: 56  RERVVTP 62



 Score = 29.5 bits (66), Expect = 2.6
 Identities = 13/35 (37%), Positives = 20/35 (57%)

Query: 817 PKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKK 851
           PK + ++    +K K    K KKK  KKKKK++ +
Sbjct: 3   PKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEAR 37



 Score = 28.7 bits (64), Expect = 5.0
 Identities = 15/38 (39%), Positives = 19/38 (50%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQ 864
           L KEKK +K K    K KKK  KKKK  +      + +
Sbjct: 7   LSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDE 44



 Score = 28.3 bits (63), Expect = 6.3
 Identities = 12/38 (31%), Positives = 19/38 (50%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
              K +  K+KK +K K    K KKK+S +++     R
Sbjct: 1   MAPKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARR 38


>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
           (ETRAMP).  These sequences represent a family of
           proteins from the malaria parasite Plasmodium
           falciparum, several of which have been shown to be
           expressed specifically in the ring stage as well as the
           rodent parasite Plasmodium yoelii. A homologue from
           Plasmodium chabaudi was localized to the parasitophorous
           vacuole membrane. Members have an initial hydrophobic,
           Phe/Tyr-rich, stretch long enough to span the membrane,
           a highly charged region rich in Lys, a second putative
           transmembrane region and a second highly charged, low
           complexity sequence region. Some members have up to 100
           residues of additional C-terminal sequence. These genes
           have been shown to be found in the sub-telomeric regions
           of both Plasmodium falciparum and P. yoelii chromosomes.
          Length = 82

 Score = 32.2 bits (74), Expect = 0.16
 Identities = 16/30 (53%), Positives = 17/30 (56%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
             KKKK KK KK     +KKKK KKK   S
Sbjct: 28  VAKKKKLKKLKKIDDDLEKKKKNKKKIIIS 57



 Score = 31.8 bits (73), Expect = 0.26
 Identities = 14/26 (53%), Positives = 18/26 (69%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMM 854
           K+KK KK KK     +KKKK KKK++
Sbjct: 30  KKKKLKKLKKIDDDLEKKKKNKKKII 55



 Score = 30.3 bits (69), Expect = 0.70
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKK 745
             +KKK KK KK     +KKKK KKK
Sbjct: 28  VAKKKKLKKLKKIDDDLEKKKKNKKK 53



 Score = 29.1 bits (66), Expect = 1.8
 Identities = 13/29 (44%), Positives = 17/29 (58%)

Query: 826 YLDKEKKKKKKKKKKKKKKKKKKKKKKMM 854
            +  +KKK KK KK     +KKKK KK +
Sbjct: 26  NVVAKKKKLKKLKKIDDDLEKKKKNKKKI 54



 Score = 28.8 bits (65), Expect = 2.6
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKK 744
             ++KK KK KK     +KKKK KKK
Sbjct: 28  VAKKKKLKKLKKIDDDLEKKKKNKKK 53



 Score = 28.8 bits (65), Expect = 3.0
 Identities = 14/34 (41%), Positives = 18/34 (52%)

Query: 821 FNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMM 854
            N      K+K KK KK     +KKKK KKK ++
Sbjct: 23  LNNNVVAKKKKLKKLKKIDDDLEKKKKNKKKIII 56



 Score = 27.6 bits (62), Expect = 6.7
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKK 743
             +++K KK KK     +KKKK KKK
Sbjct: 28  VAKKKKLKKLKKIDDDLEKKKKNKKK 53


>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
          Length = 177

 Score = 34.1 bits (79), Expect = 0.16
 Identities = 19/52 (36%), Positives = 23/52 (44%), Gaps = 15/52 (28%)

Query: 718 EEEEEKKKKKKKKKK------KKK---------KKKKKKKKKKKKKYSNEEE 754
           E   + K K + +KK      KK          K KKKK K KK K S E+E
Sbjct: 48  EGNTQSKGKGQAQKKDPRIGSKKPIPLGVEEKVKPKKKKPKSKKPKLSPEQE 99


>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain.  This family
           contains a C-terminal presumed domain in Sas10 which
           hash been identified as a regulator of chromatin
           silencing.
          Length = 76

 Score = 31.8 bits (73), Expect = 0.18
 Identities = 10/36 (27%), Positives = 21/36 (58%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           KK+KK +K  K+ K +++  +++   Y  E  G++ 
Sbjct: 33  KKRKKYEKAVKRLKSQRRVVREEDGGYGGETTGIKT 68


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 35.2 bits (81), Expect = 0.18
 Identities = 17/69 (24%), Positives = 36/69 (52%), Gaps = 3/69 (4%)

Query: 686 EKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
           ++  L      + +  YT ++G        D  EE E K K++K K+++ +KK+K++  +
Sbjct: 366 KRQELLKEYNKK-LQDYTKKLGEVKD--ETDASEEAEAKAKEEKLKQEENEKKQKEQADE 422

Query: 746 KKKYSNEEE 754
            K+   ++E
Sbjct: 423 DKEKRQKDE 431



 Score = 32.5 bits (74), Expect = 1.2
 Identities = 12/27 (44%), Positives = 22/27 (81%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKK 743
           K+EE EKK+K++  + K+K++K ++KK
Sbjct: 408 KQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 32.5 bits (74), Expect = 1.2
 Identities = 10/27 (37%), Positives = 21/27 (77%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKK 742
            +EE E+K+K++  + K+K++K ++KK
Sbjct: 408 KQEENEKKQKEQADEDKEKRQKDERKK 434


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
           homolog A (Rrp7A) and similar proteins.  The family
           corresponds to the RRP7 domain of Rrp7A, also termed
           gastric cancer antigen Zg14, and similar proteins which
           are yeast ribosomal RNA-processing protein 7 (Rrp7p)
           homologs mainly found in Metazoans. The cellular
           function of Rrp7A remains unclear currently. Rrp7A
           harbors an N-terminal RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain), and a C-terminal RRP7
           domain.
          Length = 129

 Score = 33.0 bits (76), Expect = 0.19
 Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 22/55 (40%)

Query: 716 DKEEEEEKKKKKK-------------KKKKKKK---------KKKKKKKKKKKKK 748
           DKEEEEEK++K+K             KK ++ K         K  +K+KKKKKKK
Sbjct: 31  DKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKK 85



 Score = 30.3 bits (69), Expect = 1.8
 Identities = 20/61 (32%), Positives = 30/61 (49%), Gaps = 22/61 (36%)

Query: 713 AYPDKEEEEEKKKKK-------------KKKKKKK---------KKKKKKKKKKKKKKYS 750
            Y  +EEEE+++K+K             KK ++ K         K  +K+KKKKKKK+  
Sbjct: 29  EYDKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKELK 88

Query: 751 N 751
           N
Sbjct: 89  N 89


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
           found at the C-terminus of a family of FtsJ-like
           methyltransferases. Members of this family are involved
           in 60S ribosomal biogenesis.
          Length = 212

 Score = 34.2 bits (79), Expect = 0.19
 Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 4/47 (8%)

Query: 827 LDKEKKKKKKKKKK----KKKKKKKKKKKKMMKKGYPDVLSQMEKEL 869
             K KKKKKK+KK     KK  K K  + K +K  Y  V  +M+K+L
Sbjct: 157 YKKAKKKKKKRKKTYVVAKKGNKGKAGRPKGVKGKYKVVDPRMKKDL 203



 Score = 28.8 bits (65), Expect = 9.2
 Identities = 15/31 (48%), Positives = 17/31 (54%), Gaps = 4/31 (12%)

Query: 723 KKKKKKKKK----KKKKKKKKKKKKKKKKKY 749
           KKKKK+KK     KK  K K  + K  K KY
Sbjct: 162 KKKKKRKKTYVVAKKGNKGKAGRPKGVKGKY 192


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 35.3 bits (81), Expect = 0.21
 Identities = 16/30 (53%), Positives = 23/30 (76%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           EEE E+KKKK++K K+K+ KK K  +K+ K
Sbjct: 15  EEELERKKKKEEKAKEKELKKLKAAQKEAK 44



 Score = 34.9 bits (80), Expect = 0.23
 Identities = 13/34 (38%), Positives = 24/34 (70%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            +K+   E++ ++KKKK++K K+K+ KK K  +K
Sbjct: 8   AEKKILTEEELERKKKKEEKAKEKELKKLKAAQK 41



 Score = 34.9 bits (80), Expect = 0.24
 Identities = 16/33 (48%), Positives = 23/33 (69%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            +EE E KKKK++K K+K+ KK K  +K+ K K
Sbjct: 14  TEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46



 Score = 32.9 bits (75), Expect = 0.81
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           EEE ++KKKK++K K+K+ KK K  +K+ K   + +
Sbjct: 15  EEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQ 50



 Score = 31.4 bits (71), Expect = 3.1
 Identities = 15/47 (31%), Positives = 30/47 (63%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
            E E+K   +++ ++KKKK++K K+K+ KK  + ++E   KL+   +
Sbjct: 6   SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQA 52



 Score = 29.9 bits (67), Expect = 7.4
 Identities = 15/48 (31%), Positives = 29/48 (60%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINR 874
           L +E+ ++KKKK++K K+K+ KK K   K+    + +Q   +  N+ +
Sbjct: 13  LTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPK 60


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
           describes the eukaryotic 60S (cytosolic) ribosomal
           protein L7 and paralogs that may or may not also be L7.
           Human, Drosophila, and Arabidopsis all have both a
           typical L7 and an L7-related paralog. This family is
           designated subfamily rather than equivalog to reflect
           these uncharacterized paralogs. Members of this family
           average ~ 250 residues in length, somewhat longer than
           the archaeal L30P/L7E homolog (~ 155 residues) and much
           longer than the related bacterial/organellar form (~ 60
           residues).
          Length = 235

 Score = 33.9 bits (78), Expect = 0.24
 Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 4/68 (5%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE---- 772
           K  +E   +  K+ K KKK  KKK+K   K+  S  +E  +  R+   ++    K     
Sbjct: 8   KARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPGKFY 67

Query: 773 VPCEPQIA 780
           VP E ++ 
Sbjct: 68  VPAEHKLV 75



 Score = 30.4 bits (69), Expect = 3.7
 Identities = 19/62 (30%), Positives = 30/62 (48%), Gaps = 4/62 (6%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAV 877
           K++  Q   +   K+ K KKK  KKK+K   K+ +   K Y     + E+E+  + R A 
Sbjct: 6   KRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEY----RKAEREIIRLKRLAK 61

Query: 878 AP 879
            P
Sbjct: 62  KP 63



 Score = 29.6 bits (67), Expect = 5.5
 Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           K +  ++   +  K+ K KKK  KKK+K   K
Sbjct: 6   KRKARQELAVQVAKQAKAKKKANKKKRKIYFK 37


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 34.7 bits (80), Expect = 0.25
 Identities = 14/32 (43%), Positives = 16/32 (50%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
           PD     EKK K K  KK  +K KK   K +K
Sbjct: 765 PDSAFRGEKKAKPKAAKKDARKAKKPSAKTQK 796



 Score = 34.3 bits (79), Expect = 0.30
 Identities = 17/68 (25%), Positives = 27/68 (39%)

Query: 681 KNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKK 740
           K     K     R   +++N       R  + A P   +++ +K KK   K +K     K
Sbjct: 743 KKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATK 802

Query: 741 KKKKKKKK 748
            K+  KKK
Sbjct: 803 AKRAAKKK 810



 Score = 31.3 bits (71), Expect = 3.1
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKM 853
           +K+ K K  KK  +K KK   K +K+
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKI 797



 Score = 30.5 bits (69), Expect = 5.5
 Identities = 10/38 (26%), Positives = 15/38 (39%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           K +  +K  +K KK   K +K     K K+       E
Sbjct: 776 KPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813



 Score = 30.5 bits (69), Expect = 5.6
 Identities = 13/34 (38%), Positives = 15/34 (44%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKK 851
           K  F   +    EKK K K  KK  +K KK   K
Sbjct: 760 KVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAK 793



 Score = 30.1 bits (68), Expect = 7.6
 Identities = 11/38 (28%), Positives = 17/38 (44%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
             K  +++ +K KK   K +K     K K+  KK   E
Sbjct: 776 KPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 33.4 bits (77), Expect = 0.25
 Identities = 27/141 (19%), Positives = 51/141 (36%), Gaps = 41/141 (29%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
           D   E+E ++ +K  KK K  ++K++ K+  +   +  + ++       I     K+   
Sbjct: 58  DDYREKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQ--- 114

Query: 776 EPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKK 835
                                    E + I          + K+ +    YL K + KK 
Sbjct: 115 -------------------------EKELIK---------EGKKPY----YLKKSEIKKL 136

Query: 836 KKKKKKKKKKKKKKKKKMMKK 856
             KKK  + KK K+  K ++K
Sbjct: 137 VLKKKFDELKKSKQLDKALEK 157



 Score = 30.3 bits (69), Expect = 2.4
 Identities = 12/35 (34%), Positives = 20/35 (57%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
            ++    K +E +K K+  K  +KK+KK   K+KK
Sbjct: 134 KKLVLKKKFDELKKSKQLDKALEKKRKKNAGKEKK 168



 Score = 29.2 bits (66), Expect = 6.0
 Identities = 13/34 (38%), Positives = 19/34 (55%), Gaps = 2/34 (5%)

Query: 717 KEEEEEKKK--KKKKKKKKKKKKKKKKKKKKKKK 748
           K   ++K    KK K+  K  +KK+KK   K+KK
Sbjct: 135 KLVLKKKFDELKKSKQLDKALEKKRKKNAGKEKK 168



 Score = 28.8 bits (65), Expect = 7.3
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           K+   +KK  + KK K+  K  +KK+KK   K
Sbjct: 134 KKLVLKKKFDELKKSKQLDKALEKKRKKNAGK 165



 Score = 28.8 bits (65), Expect = 8.7
 Identities = 13/40 (32%), Positives = 23/40 (57%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
           D+E++  K+ KK++K+  K+ KK   +KK     L   +K
Sbjct: 102 DREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKK 141


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 34.4 bits (79), Expect = 0.26
 Identities = 28/151 (18%), Positives = 46/151 (30%), Gaps = 2/151 (1%)

Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPV 770
             A   K++ EE  K+   K+K+ ++   K     K K   E E  R    A        
Sbjct: 111 LAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAK--AEAEAKRAAAAAKKAAAEAK 168

Query: 771 KEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKE 830
           K+   E          K    E    A     ++      +    + K++    A     
Sbjct: 169 KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAA 228

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKGYPDV 861
           K   + K   +K    K  +K    K   +V
Sbjct: 229 KAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259



 Score = 33.6 bits (77), Expect = 0.42
 Identities = 8/46 (17%), Positives = 16/46 (34%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
            K+   E K    K   + K   +K    K  + +   +   ++ D
Sbjct: 216 KKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVDD 261


>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
          Length = 635

 Score = 34.5 bits (80), Expect = 0.29
 Identities = 20/82 (24%), Positives = 33/82 (40%), Gaps = 8/82 (9%)

Query: 804 RINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK------G 857
           +I  Y    GY D +Q+  Q   L +   KKK++    K +  K+  KK+  K       
Sbjct: 515 KIGRYVG--GYHDARQQQAQYLALKQPAVKKKEEAAAPKAETVKRSSKKLSYKLQRELEQ 572

Query: 858 YPDVLSQMEKELANINRTAVAP 879
            P +L  +E E+  +       
Sbjct: 573 LPQLLEDLEAEIEALQAQVADA 594


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
           protein involved in ribosome biogenesis.
          Length = 173

 Score = 33.2 bits (76), Expect = 0.30
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            +KKKK +    K  +K+ K+K +KKK
Sbjct: 1   VRKKKKNRSSNYKVNRKRLKRKDRKKK 27



 Score = 32.4 bits (74), Expect = 0.58
 Identities = 10/31 (32%), Positives = 15/31 (48%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
           KK +    K  +K+ K+K +KKK     N  
Sbjct: 5   KKNRSSNYKVNRKRLKRKDRKKKINIIGNPI 35



 Score = 32.0 bits (73), Expect = 0.80
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
           KKKK +    K  +K+ K+K +KKK 
Sbjct: 3   KKKKNRSSNYKVNRKRLKRKDRKKKI 28



 Score = 29.7 bits (67), Expect = 4.3
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKK 856
           +KKKK +    K  +K+ K+K  KK
Sbjct: 2   RKKKKNRSSNYKVNRKRLKRKDRKK 26


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 34.2 bits (79), Expect = 0.30
 Identities = 12/44 (27%), Positives = 22/44 (50%)

Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           +      E   + K+K K K ++  +K   + K+KKK+   EE+
Sbjct: 297 KKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEK 340



 Score = 33.1 bits (76), Expect = 0.61
 Identities = 13/43 (30%), Positives = 21/43 (48%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
              E     K+K K K ++  +K   + K+KKK   +EE  +K
Sbjct: 301 LLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKK 343



 Score = 32.3 bits (74), Expect = 1.1
 Identities = 13/60 (21%), Positives = 30/60 (50%)

Query: 696 DQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
            ++I     ++ R  ++    +   + K+K K K ++  +K   + K+KKK+K   E++ 
Sbjct: 283 QEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKK 342



 Score = 30.4 bits (69), Expect = 4.8
 Identities = 14/25 (56%), Positives = 20/25 (80%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
           D EK   + K+KKK+KKK++KKKK+
Sbjct: 320 DNEKLDAEVKEKKKEKKKEEKKKKQ 344



 Score = 30.4 bits (69), Expect = 5.1
 Identities = 21/102 (20%), Positives = 40/102 (39%), Gaps = 7/102 (6%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
           K  + + K+ KK     +     K+K K K +  NE+            +    KE   +
Sbjct: 287 KALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKK----KEEKKK 342

Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPK 818
            QI      I+     +    D+ E + +   T+++ Y+DP+
Sbjct: 343 KQIERLEERIEK---LEVQATDKEENKTVALGTSKINYIDPR 381


>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
           modification].
          Length = 429

 Score = 34.2 bits (78), Expect = 0.31
 Identities = 25/153 (16%), Positives = 53/153 (34%), Gaps = 17/153 (11%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
               + +       K +       ++ ++KKK   +N +    K+ DA        K+V 
Sbjct: 10  SGNNQNKNASVVNNKAEIAAMIDARRLEQKKKGGVTNSKGKTNKVVDAKL--EKEFKDVL 67

Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
              Q+        ND   K    D   + R+           P    +  A +D E   +
Sbjct: 68  QRFQVQE------NDT-PKEITKDEKNMSRM------KSRKAPTI--HMEAEIDDEYGFE 112

Query: 835 KKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
            + K  +++   ++KK+K ++  Y  +   +  
Sbjct: 113 DRMKPYREESLSRQKKRKALQHRYEQLKLVVPY 145


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p), ribosomal RNA-processing protein 7 homolog A
           (Rrp7A), and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
           YCL031C gene from Saccharomyces cerevisiae. It is an
           essential yeast protein involved in pre-rRNA processing
           and ribosome assembly, and is speculated to be required
           for correct assembly of rpS27 into the pre-ribosomal
           particle. Rrp7A, also termed gastric cancer antigen
           Zg14, is the Rrp7p homolog mainly found in Metazoans.
           The cellular function of Rrp7A remains unclear
           currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 118

 Score = 32.2 bits (74), Expect = 0.31
 Identities = 15/38 (39%), Positives = 24/38 (63%), Gaps = 7/38 (18%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKK-------KKKKKKK 745
             +E  E K K+K+KKKKKKK+ +       ++KKK++
Sbjct: 66  AREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEE 103



 Score = 32.2 bits (74), Expect = 0.35
 Identities = 15/42 (35%), Positives = 24/42 (57%), Gaps = 7/42 (16%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKK-------KKKKKKK 747
                 EE  + K K+K+KKKKKKK+ +       ++KKK++
Sbjct: 62  KGKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEE 103



 Score = 31.1 bits (71), Expect = 0.91
 Identities = 19/60 (31%), Positives = 28/60 (46%), Gaps = 24/60 (40%)

Query: 716 DKEEEEEKKKKKKK-------------KKKKK-----------KKKKKKKKKKKKKKYSN 751
           DK EEEEK++ K+              +  +K           K K+K+KKKKKKK+  +
Sbjct: 31  DKREEEEKEEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELED 90



 Score = 30.3 bits (69), Expect = 1.7
 Identities = 16/44 (36%), Positives = 27/44 (61%), Gaps = 7/44 (15%)

Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKK-------KKKKKKK 743
           GR  + A  +  E + K+K+KKKKKKK+ +       ++KKK++
Sbjct: 60  GRKGKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEE 103



 Score = 29.1 bits (66), Expect = 4.0
 Identities = 11/23 (47%), Positives = 17/23 (73%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKK 850
            +E  + K K+K+KKKKKKK+ +
Sbjct: 67  REEAVEAKAKEKEKKKKKKKELE 89


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 34.0 bits (78), Expect = 0.32
 Identities = 14/36 (38%), Positives = 26/36 (72%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
           D+++E+E  ++ K++ +KK+K KKK +K+K    SN
Sbjct: 339 DEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSN 374



 Score = 32.8 bits (75), Expect = 0.70
 Identities = 11/38 (28%), Positives = 25/38 (65%)

Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           + Y    +E+++ +  ++ K++ +KK+K KKK +K+K 
Sbjct: 332 LGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKV 369



 Score = 31.3 bits (71), Expect = 2.4
 Identities = 14/39 (35%), Positives = 26/39 (66%)

Query: 706 VGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
           +G    I   D++E++E+ K++ +KK+K KKK +K+K  
Sbjct: 332 LGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370



 Score = 29.4 bits (66), Expect = 8.4
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
           D E+ K++ +KK+K KKK +K+K  
Sbjct: 346 DDEESKEEVEKKQKVKKKPRKRKVN 370


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast) and
           GNL3L (human) are putative GTPases which are required
           for growth and play a role in processing of nucleolar
           pre-rRNA. This family contains a potential nuclear
           localisation signal.
          Length = 80

 Score = 31.1 bits (71), Expect = 0.35
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           EKK  +  +K +K+ KK    K KKKK
Sbjct: 7   EKKVAEHHRKLRKEAKKNPTWKSKKKK 33



 Score = 29.5 bits (67), Expect = 1.6
 Identities = 11/31 (35%), Positives = 26/31 (83%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           EE EEKK+K++++K+++K+ +K ++ + +K+
Sbjct: 49  EEIEEKKRKQEEEKERRKEARKAERAEARKR 79



 Score = 29.1 bits (66), Expect = 2.1
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKK 851
            E  +K +K+ KK    K KKKK
Sbjct: 11  AEHHRKLRKEAKKNPTWKSKKKK 33


>gnl|CDD|220628 pfam10198, Ada3, Histone acetyltransferases subunit 3.  Ada3 is a
           family of proteins conserved from yeasts to humans. It
           is an essential component of the Ada transcriptional
           coactivator (alteration/deficiency in activation)
           complex. Ada3 plays a key role in linking histone
           acetyltransferase-containing complexes to p53 (tumour
           suppressor protein) thereby regulating p53 acetylation,
           stability and transcriptional activation following DNA
           damage.
          Length = 127

 Score = 32.3 bits (74), Expect = 0.36
 Identities = 17/61 (27%), Positives = 29/61 (47%), Gaps = 11/61 (18%)

Query: 818 KQRFNQIAY------LDKE-----KKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQME 866
           K+   +  Y      LDK+      K+ + + KKKKK+K+    +K   +G   + S +E
Sbjct: 67  KEEMARQEYKTILDDLDKQVDQAYLKRIRARGKKKKKEKRPGAAQKAANEGQESLKSLLE 126

Query: 867 K 867
           K
Sbjct: 127 K 127



 Score = 29.2 bits (66), Expect = 4.8
 Identities = 10/33 (30%), Positives = 16/33 (48%)

Query: 729 KKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
            K+ + + KKKKK+K+         EG   L+ 
Sbjct: 91  LKRIRARGKKKKKEKRPGAAQKAANEGQESLKS 123


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 33.2 bits (76), Expect = 0.38
 Identities = 30/141 (21%), Positives = 64/141 (45%), Gaps = 20/141 (14%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
           EEE E      +  ++ ++++K+K+ +++ + +   + M K+                 P
Sbjct: 92  EEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKM-----------------P 134

Query: 778 QIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGY-LDPK-QRFNQIAYLDKEKKKKK 835
           Q+       K    +K   A   + + +       GY +DP+  RF Q     KEK++KK
Sbjct: 135 QMIADWRAQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPRF-QEMLQQKEKEEKK 193

Query: 836 KKKKKKKKKKKKKKKKKMMKK 856
           K K+ K+++K++K+   ++  
Sbjct: 194 KVKEAKRREKEEKRMAALVAA 214



 Score = 29.4 bits (66), Expect = 6.3
 Identities = 10/29 (34%), Positives = 19/29 (65%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
            KE+EE+KK K+ K+++K++K+       
Sbjct: 186 QKEKEEKKKVKEAKRREKEEKRMAALVAA 214


>gnl|CDD|218396 pfam05039, Agouti, Agouti protein.  The agouti protein regulates
           pigmentation in the mouse hair follicle producing a
           black hair with a subapical yellow band. A highly
           homologous protein agouti signal protein (ASIP)is
           present in humans and is expressed at highest levels in
           adipose tissue where it may play a role in energy
           homeostasis and possibly human pigmentation.
          Length = 90

 Score = 31.4 bits (71), Expect = 0.38
 Identities = 18/64 (28%), Positives = 24/64 (37%), Gaps = 6/64 (9%)

Query: 728 KKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIK 787
             K  KK  K + ++KKK  K   E     + R     +C  + E  C     PC     
Sbjct: 16  LNKGLKKFNKNEAEEKKKSSKKEAEALTEAQGRPPPPRRCVRLHE-SCLGPQTPCC---- 70

Query: 788 NDPC 791
            DPC
Sbjct: 71  -DPC 73


>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
          Length = 64

 Score = 30.6 bits (70), Expect = 0.39
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMK 855
           EK  +K+K+KK   +K+++KK    +
Sbjct: 39  EKPSEKRKRKKAAARKRRRKKLAREE 64



 Score = 29.8 bits (68), Expect = 0.76
 Identities = 9/24 (37%), Positives = 17/24 (70%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKK 745
           EK  +K+K+KK   +K+++KK  +
Sbjct: 39  EKPSEKRKRKKAAARKRRRKKLAR 62



 Score = 27.5 bits (62), Expect = 4.9
 Identities = 8/22 (36%), Positives = 15/22 (68%)

Query: 728 KKKKKKKKKKKKKKKKKKKKKY 749
           K  +K+K+KK   +K+++KK  
Sbjct: 40  KPSEKRKRKKAAARKRRRKKLA 61



 Score = 27.5 bits (62), Expect = 5.2
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 729 KKKKKKKKKKKKKKKKKKKKYSNEE 753
           K  +K+K+KK   +K+++KK + EE
Sbjct: 40  KPSEKRKRKKAAARKRRRKKLAREE 64



 Score = 26.8 bits (60), Expect = 9.8
 Identities = 7/20 (35%), Positives = 15/20 (75%)

Query: 721 EEKKKKKKKKKKKKKKKKKK 740
           E++K+KK   +K+++KK  +
Sbjct: 43  EKRKRKKAAARKRRRKKLAR 62


>gnl|CDD|223625 COG0551, TopA, Zn-finger domain associated with topoisomerase type
           I [DNA replication, recombination, and repair].
          Length = 140

 Score = 32.1 bits (73), Expect = 0.46
 Identities = 28/149 (18%), Positives = 43/149 (28%), Gaps = 21/149 (14%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEV 773
             ++E +   K  K  +   K  K   KK  K   +                   P  + 
Sbjct: 2   IAEEEVKVVLKDLKTGQICPKCGKNMVKKFGKYGIF-------------LGCSNYPKCDY 48

Query: 774 PCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVG--YLDPKQRFNQIAYLDKEK 831
               +    + +     C K        V +   +          PK RF       KEK
Sbjct: 49  YEPEKA---IAEKTGVKCPKCGKGL--LVLKKGRFGKNFLGCSNYPKCRFT-EKPKPKEK 102

Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGYPD 860
           K  K   +K  +KK+KKK   +     P 
Sbjct: 103 KCPKCGSRKLVEKKEKKKGVFLECSNKPC 131



 Score = 31.4 bits (71), Expect = 0.99
 Identities = 18/79 (22%), Positives = 30/79 (37%), Gaps = 7/79 (8%)

Query: 684 PCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
            C K          +   +       +   YP     E+ K K+KK  K   +K  +KK+
Sbjct: 62  KCPKCGKGL--LVLKKGRFGKNFLGCSN--YPKCRFTEKPKPKEKKCPKCGSRKLVEKKE 117

Query: 744 KKKKKY---SNEEEGMRKL 759
           KKK  +   SN+   ++  
Sbjct: 118 KKKGVFLECSNKPCCLKVE 136


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 32.3 bits (73), Expect = 0.47
 Identities = 15/32 (46%), Positives = 22/32 (68%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           KE  + + +K  KKKK+K  +KK K KKK++K
Sbjct: 15  KESGKTQSQKSDKKKKEKVSEKKGKSKKKEEK 46



 Score = 31.5 bits (71), Expect = 0.84
 Identities = 23/72 (31%), Positives = 32/72 (44%), Gaps = 13/72 (18%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ---------- 766
           K+ +E  K + +K  KKKK+K  +KK K KKK   EE+   K+ +    Q          
Sbjct: 12  KQRKESGKTQSQKSDKKKKEKVSEKKGKSKKK---EEKPNGKIPEHEPNQEVTEVEVIIE 68

Query: 767 CGPVKEVPCEPQ 778
             PV  V   P 
Sbjct: 69  KEPVPAVAVAPV 80



 Score = 29.6 bits (66), Expect = 4.7
 Identities = 14/32 (43%), Positives = 20/32 (62%)

Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKM 853
            Q    DK+KK+K  +KK K KKK++K   K+
Sbjct: 20  TQSQKSDKKKKEKVSEKKGKSKKKEEKPNGKI 51


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 32.1 bits (74), Expect = 0.49
 Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 3/51 (5%)

Query: 724 KKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
           K+KKKKKKKKK+ +  ++KKKK       +E  + + DAA +    +K+VP
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAV-DAALLAA--IKKVP 129



 Score = 31.0 bits (71), Expect = 1.3
 Identities = 16/38 (42%), Positives = 23/38 (60%), Gaps = 6/38 (15%)

Query: 825 AYLDKEKKKKKKKK------KKKKKKKKKKKKKKMMKK 856
           AY  ++KKKKKKKK      +KKKK    +++K+  K 
Sbjct: 79  AYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116



 Score = 28.3 bits (64), Expect = 9.7
 Identities = 11/32 (34%), Positives = 23/32 (71%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           K ++++KKKKK+ +  ++KKKK    +++K+ 
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKED 113


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 33.1 bits (76), Expect = 0.49
 Identities = 19/73 (26%), Positives = 36/73 (49%), Gaps = 6/73 (8%)

Query: 703 TTEVGR-FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
            ++V R    +A   KEEEE ++ +++++   +  ++ + K+K K++   E E  R  R 
Sbjct: 108 RSDVRRQLRFLAQKQKEEEE-RRVERRRELGLEDPEQLRLKQKAKEEQKAESEETRH-RA 165

Query: 762 A---ASIQCGPVK 771
           A   A    G  K
Sbjct: 166 ANATALAAIGGRK 178


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 31.9 bits (73), Expect = 0.50
 Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 9/45 (20%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKK---------KKKKK 748
             P   EE  K  +++KK+K+KKKKKKK+ +         KKK++
Sbjct: 62  KGPAAGEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEE 106


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 33.1 bits (76), Expect = 0.51
 Identities = 11/44 (25%), Positives = 22/44 (50%)

Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
            I  P+ + + + K K K  KK +++ K++ K  + +  S  E 
Sbjct: 91  VIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFEN 134



 Score = 32.3 bits (74), Expect = 0.93
 Identities = 13/54 (24%), Positives = 19/54 (35%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKP 883
           E K K K K K K  KK +++ K   K      +   +  A    T+       
Sbjct: 96  EPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPARPTSSTATAAA 149



 Score = 32.0 bits (73), Expect = 1.0
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           P    + E K K K K K K  KK +++ K++ K
Sbjct: 89  PVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVK 122



 Score = 31.2 bits (71), Expect = 2.2
 Identities = 10/37 (27%), Positives = 19/37 (51%)

Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           +  P  E + + K K K K  KK +++ K++ K  + 
Sbjct: 90  VVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEP 126



 Score = 30.4 bits (69), Expect = 3.6
 Identities = 11/44 (25%), Positives = 22/44 (50%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           + K K K K  KK +++ K++ K  + + ++  E     R  +S
Sbjct: 100 KPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPARPTSS 143



 Score = 29.7 bits (67), Expect = 7.1
 Identities = 11/34 (32%), Positives = 16/34 (47%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           P +      K + K K K K K K  KK +++ K
Sbjct: 85  PKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPK 118


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 32.7 bits (75), Expect = 0.52
 Identities = 16/54 (29%), Positives = 34/54 (62%), Gaps = 3/54 (5%)

Query: 708 RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY---SNEEEGMRK 758
           R  Q    ++E EE KK ++K++ ++K++++ +++++KKK+       EE  RK
Sbjct: 17  RRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARK 70



 Score = 31.2 bits (71), Expect = 1.5
 Identities = 13/44 (29%), Positives = 34/44 (77%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           +E EEE+++++KK ++K++ ++K++++ ++++   +EE  RK R
Sbjct: 21  REAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKER 64



 Score = 30.8 bits (70), Expect = 2.1
 Identities = 13/46 (28%), Positives = 32/46 (69%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
           EEEE +++KK ++K++ ++K++++ +++++K   EEE   +   A 
Sbjct: 24  EEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQAR 69



 Score = 30.4 bits (69), Expect = 2.5
 Identities = 17/46 (36%), Positives = 33/46 (71%), Gaps = 2/46 (4%)

Query: 716 DKEEEEEKKKK--KKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
           + EEE EKKK+  ++K+++++ +K++++ +K K  +  EEEG  KL
Sbjct: 47  ELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGTDKL 92


>gnl|CDD|220121 pfam09135, Alb1, Alb1.  Alb1 is a nuclear shuttling factor involved
           in ribosome biogenesis.
          Length = 112

 Score = 31.7 bits (72), Expect = 0.52
 Identities = 12/40 (30%), Positives = 24/40 (60%), Gaps = 2/40 (5%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
             +KK++KKK K+  +K++K++KK + +     V   + K
Sbjct: 46  VSKKKQRKKKPKRLTRKQRKRQKKGLERAE--IVTDTLSK 83



 Score = 30.9 bits (70), Expect = 0.77
 Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
           E   KKK++KKK K+  +K++K++KK     E     L
Sbjct: 44  EGVSKKKQRKKKPKRLTRKQRKRQKKGLERAEIVTDTL 81



 Score = 30.5 bits (69), Expect = 1.1
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           E   KKK++KKK K+  +K++K++KK 
Sbjct: 44  EGVSKKKQRKKKPKRLTRKQRKRQKKG 70



 Score = 30.2 bits (68), Expect = 1.6
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKK 746
           E   KKK++KKK K+  +K++K++KK 
Sbjct: 44  EGVSKKKQRKKKPKRLTRKQRKRQKKG 70



 Score = 30.2 bits (68), Expect = 1.7
 Identities = 13/29 (44%), Positives = 21/29 (72%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
           K E   KKK++KKK K+  +K++K++KK 
Sbjct: 42  KNEGVSKKKQRKKKPKRLTRKQRKRQKKG 70



 Score = 28.6 bits (64), Expect = 5.2
 Identities = 10/25 (40%), Positives = 20/25 (80%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKK 851
           + K+K++KKK K+  +K++K++KK 
Sbjct: 46  VSKKKQRKKKPKRLTRKQRKRQKKG 70


>gnl|CDD|223831 COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 320

 Score = 33.1 bits (75), Expect = 0.53
 Identities = 23/149 (15%), Positives = 43/149 (28%), Gaps = 25/149 (16%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQ 778
           + E K K+     KK  ++ K    +  KK S +               G +     + Q
Sbjct: 177 KAEAKAKEALALLKKGVREAKADFAELAKKQSEDPSSKNG---------GGLLGWNKKGQ 227

Query: 779 IAPCLFDIKNDPCEKNNLA---------------DRSEVQRINHYTTEVGYLDPKQRFNQ 823
           + P          +   ++                + + +        V  +  KQ   Q
Sbjct: 228 LVPEFRKAAFI-LKVGEVSAPVKTSFGYHIIKVEKKRDAKLDFPEVKAVKEISLKQELLQ 286

Query: 824 IAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
            A L+K  K ++K   K  K+        
Sbjct: 287 KAALEKLLKGEQKSFDKLLKEAAIDILSA 315


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 33.5 bits (76), Expect = 0.53
 Identities = 27/171 (15%), Positives = 56/171 (32%), Gaps = 43/171 (25%)

Query: 700 NHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
           N +    G   +I      +EE  +     KKK  +  K+  KK   +     EEG   +
Sbjct: 40  NKFLLAAGSRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELV 99

Query: 760 RDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQ 819
            +  +                                 D+   +             P++
Sbjct: 100 VNEDAA-------------------------------LDKESKKT------------PRR 116

Query: 820 RFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELA 870
              + A    + +++K +KK +K++K KK  + +  +G    +S +E+   
Sbjct: 117 TRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEF 167


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 33.1 bits (76), Expect = 0.56
 Identities = 13/60 (21%), Positives = 24/60 (40%)

Query: 690 LADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
            + + + +          R  ++   ++  E +++  KK K KKKK  K K  K   K  
Sbjct: 175 KSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKWK 234



 Score = 32.3 bits (74), Expect = 0.89
 Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
           +  E +++  KK K KKKK  K K  K   K+  E
Sbjct: 202 QRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKKE 236



 Score = 29.6 bits (67), Expect = 6.0
 Identities = 13/53 (24%), Positives = 25/53 (47%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
             D+ E   +  + +K     +K+KKK  KKK+K Y   +E   + +    ++
Sbjct: 149 LLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVE 201



 Score = 29.6 bits (67), Expect = 7.0
 Identities = 31/171 (18%), Positives = 64/171 (37%), Gaps = 30/171 (17%)

Query: 712 IAYPDKEEEEEKKKKKKK------------KKKKKKKKKKKKKK-----------KKKKK 748
           +    +EE+E   K++ K            K++ + KK +K K+            K   
Sbjct: 67  VHILKREEKEVLSKEQVKLLKTQDLNYVRTKRQAEAKKIEKLKEELHLTDSGSAGPKHIV 126

Query: 749 YSNEEEGMRKLRDAASIQCGP--VKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRIN 806
           + ++EE  +    A      P  +      P+I+     ++           +S  ++  
Sbjct: 127 FVDDEEEQKSFDPAEYFDTTPELLDRRENRPRIS----QLEKTSLVDEKQKKKSAKKKRK 182

Query: 807 HYTTEVGYLDPKQRFNQI-AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            Y       + +++  ++   L+ +++  KK K KKKK  K K  K + K 
Sbjct: 183 LYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKW 233


>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs, all of which are expressed
           during embryonic and early development in the nervous
           system but with different localization and timing.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 241

 Score = 32.7 bits (75), Expect = 0.62
 Identities = 12/36 (33%), Positives = 25/36 (69%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
            +EE++E+  +KK ++ +  KK +K+ +K++ KYS 
Sbjct: 159 KQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSE 194


>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2.  This
           family consists of several Eukaryotic rRNA processing
           protein EBP2 sequences. Ebp2p is required for the
           maturation of 25S rRNA and 60S subunit assembly. Ebp2p
           may be one of the target proteins of Rrs1p for executing
           the signal to regulate ribosome biogenesis. This family
           also plays a role in chromosome segregation.
          Length = 270

 Score = 32.7 bits (75), Expect = 0.63
 Identities = 10/33 (30%), Positives = 15/33 (45%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           D   ++  K   KKK +K    + K   K+K K
Sbjct: 187 DFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAK 219



 Score = 32.7 bits (75), Expect = 0.69
 Identities = 9/32 (28%), Positives = 15/32 (46%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           +  +   KKK +K    + K   K+K K  K+
Sbjct: 192 DAAKGGSKKKGRKGGAARGKPNAKRKAKDAKF 223



 Score = 30.0 bits (68), Expect = 4.5
 Identities = 10/32 (31%), Positives = 14/32 (43%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           D   +   KKK +K    + K   K+K K  K
Sbjct: 191 DDAAKGGSKKKGRKGGAARGKPNAKRKAKDAK 222



 Score = 30.0 bits (68), Expect = 5.1
 Identities = 9/46 (19%), Positives = 16/46 (34%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
                   +   +   K   KKK +K    + K   K+K  + + G
Sbjct: 179 GGDEGDLFDFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAKDAKFG 224


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 33.7 bits (78), Expect = 0.64
 Identities = 14/50 (28%), Positives = 23/50 (46%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
           KEEE+EK  K+  +  KK + +KKK+ K         +  +K+       
Sbjct: 179 KEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEITP 228



 Score = 32.5 bits (75), Expect = 1.4
 Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 1/70 (1%)

Query: 805 INHYTTEVGYLDPKQRFNQ-IAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLS 863
                 +    D K+   +  A  ++E +K  K+  +  KK + +KKK+       +   
Sbjct: 156 FGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPV 215

Query: 864 QMEKELANIN 873
           Q+ K++    
Sbjct: 216 QIGKKIDKEE 225



 Score = 32.1 bits (74), Expect = 2.0
 Identities = 14/53 (26%), Positives = 25/53 (47%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASI 765
            +  ++EEE++K  K+  +  KK + +KKK+ K          + K  D   I
Sbjct: 174 KFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEI 226



 Score = 29.8 bits (68), Expect = 8.9
 Identities = 11/39 (28%), Positives = 23/39 (58%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           + E+ E +K+++ +K  K+  +  KK + +KKK S   +
Sbjct: 171 ELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFD 209


>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
          Length = 132

 Score = 31.5 bits (72), Expect = 0.66
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKK 856
           K +K+ K ++KK + K+K K    KK
Sbjct: 107 KARKELKNRRKKVRGKEKTKVSGKKK 132



 Score = 30.4 bits (69), Expect = 1.9
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKK 851
           KE K ++KK + K+K K   KKK
Sbjct: 110 KELKNRRKKVRGKEKTKVSGKKK 132



 Score = 29.6 bits (67), Expect = 3.0
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
           K +K+ K ++KK + K+K K   KKK
Sbjct: 107 KARKELKNRRKKVRGKEKTKVSGKKK 132



 Score = 29.6 bits (67), Expect = 3.5
 Identities = 10/29 (34%), Positives = 18/29 (62%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
           +KK  +K +K+ K ++KK + K+K K   
Sbjct: 101 KKKAGRKARKELKNRRKKVRGKEKTKVSG 129



 Score = 28.5 bits (64), Expect = 9.0
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 724 KKKKKKKKKKKKKKKKKKKKKKKKKY 749
           KKKK  +K +K+ K ++KK + K+K 
Sbjct: 100 KKKKAGRKARKELKNRRKKVRGKEKT 125


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 33.6 bits (76), Expect = 0.68
 Identities = 11/47 (23%), Positives = 31/47 (65%), Gaps = 2/47 (4%)

Query: 716  DKEEEEEKKKKKKKKKKK--KKKKKKKKKKKKKKKYSNEEEGMRKLR 760
            D+EE+E  +  ++ K+++  KK+K+++++ +K    + +E   ++L+
Sbjct: 1028 DQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLK 1074


>gnl|CDD|221012 pfam11169, DUF2956, Protein of unknown function (DUF2956).  This
           family of proteins with unknown function appears to be
           restricted to Gammaproteobacteria.
          Length = 103

 Score = 31.1 bits (71), Expect = 0.69
 Identities = 11/34 (32%), Positives = 24/34 (70%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
            E KK++K K ++  K +K++ K K+++ +N++E
Sbjct: 37  AEYKKQQKAKAREADKARKQQLKAKQRQAANDDE 70



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 11/36 (30%), Positives = 24/36 (66%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
            E KK++K K ++  K +K++ K K+++  +++EE 
Sbjct: 37  AEYKKQQKAKAREADKARKQQLKAKQRQAANDDEED 72


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 32.3 bits (74), Expect = 0.72
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
             EE E     K+++  + +KK+ KKKK
Sbjct: 169 AAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 32.3 bits (74), Expect = 0.80
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKK 744
            EE E     K+++  + +KK+ KKKK
Sbjct: 170 AEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 31.5 bits (72), Expect = 1.4
 Identities = 9/30 (30%), Positives = 17/30 (56%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKK 742
            +  +E E     K+++  + +KK+ KKKK
Sbjct: 167 EFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 31.1 bits (71), Expect = 1.7
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKK 745
            EE +     K+++  + +KK+ KKKK
Sbjct: 170 AEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 31.1 bits (71), Expect = 1.9
 Identities = 9/33 (27%), Positives = 18/33 (54%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
             EE   ++ +     K+++  + +KK+ KKKK
Sbjct: 164 LLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|198301 cd03192, GST_C_Sigma_like, C-terminal, alpha helical domain of
           Class Sigma-like Glutathione S-transferases.
           Glutathione S-transferase (GST) C-terminal domain
           family, Class Sigma_like; composed of GSTs belonging to
           class Sigma and similar proteins, including GSTs from
           class Mu, Pi, and Alpha. GSTs are cytosolic dimeric
           proteins involved in cellular detoxification by
           catalyzing the conjugation of glutathione (GSH) with a
           wide range of endogenous and xenobiotic alkylating
           agents, including carcinogens, therapeutic drugs,
           environmental toxins, and products of oxidative stress.
           The GST fold contains an N-terminal thioredoxin-fold
           domain and a C-terminal alpha helical domain, with an
           active site located in a cleft between the two domains.
           GSH binds to the N-terminal domain while the hydrophobic
           substrate occupies a pocket in the C-terminal domain.
           Vertebrate class Sigma GSTs are characterized as
           GSH-dependent hematopoietic prostaglandin (PG) D
           synthases and are responsible for the production of PGD2
           by catalyzing the isomerization of PGH2. The functions
           of PGD2 include the maintenance of body temperature,
           inhibition of platelet aggregation, bronchoconstriction,
           vasodilation, and mediation of allergy and inflammation.
           Other class Sigma-like members include the class II
           insect GSTs, S-crystallins from cephalopods,
           nematode-specific GSTs, and 28-kDa GSTs from parasitic
           flatworms. Drosophila GST2 is associated with indirect
           flight muscle and exhibits preference for catalyzing GSH
           conjugation to lipid peroxidation products, indicating
           an anti-oxidant role. S-crystallin constitutes the major
           lens protein in cephalopod eyes and is responsible for
           lens transparency and proper refractive index. The
           28-kDa GST from Schistosoma is a multifunctional enzyme,
           exhibiting GSH transferase, GSH peroxidase, and PGD2
           synthase activities, and may play an important role in
           host-parasite interactions. Members also include novel
           GSTs from the fungus Cunninghamella elegans, designated
           as class Gamma, and from the protozoan Blepharisma
           japonicum, described as a light-inducible GST.
          Length = 104

 Score = 31.1 bits (71), Expect = 0.73
 Identities = 10/42 (23%), Positives = 20/42 (47%)

Query: 833 KKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINR 874
           +      +   ++KK+KKK+ +++  P  L + EK L     
Sbjct: 19  EFAPYFYEPDGEEKKEKKKEFLEEALPKFLGKFEKILKKSGG 60


>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986).  This
           family of proteins has no known function.
          Length = 44

 Score = 29.2 bits (66), Expect = 0.75
 Identities = 13/27 (48%), Positives = 14/27 (51%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           KKK  +K KKK KK   K     K KY
Sbjct: 4   KKKINQKLKKKAKKANAKLHPSNKPKY 30


>gnl|CDD|225816 COG3277, GAR1, RNA-binding protein involved in rRNA processing
           [Translation, ribosomal structure and biogenesis].
          Length = 98

 Score = 30.8 bits (70), Expect = 0.78
 Identities = 9/40 (22%), Positives = 18/40 (45%)

Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            ++    +    +       K  +KK+K  +KK++ KK K
Sbjct: 59  DDRDVKLESLVGDTLYIPPDKLIRKKRKLPRKKRRPKKPK 98



 Score = 30.0 bits (68), Expect = 1.6
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKKK 850
               Y+  +K  +KK+K  +KK++ KK K
Sbjct: 70  GDTLYIPPDKLIRKKRKLPRKKRRPKKPK 98


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 31.6 bits (72), Expect = 0.78
 Identities = 10/36 (27%), Positives = 21/36 (58%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
            +EE E  K+    +++KK   +K K+K ++++  N
Sbjct: 43  LQEETELDKELFTPEEQKKITFQKHKEKPEQEELKN 78



 Score = 30.0 bits (68), Expect = 2.9
 Identities = 11/57 (19%), Positives = 25/57 (43%), Gaps = 1/57 (1%)

Query: 699 INHY-TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
            N Y   ++         + E ++E    +++KK   +K K+K ++++ K     E 
Sbjct: 28  PNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSEN 84



 Score = 29.3 bits (66), Expect = 5.2
 Identities = 18/98 (18%), Positives = 41/98 (41%), Gaps = 20/98 (20%)

Query: 679 DIKNDPCEKNNLADRSEDQRI-NHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKK 737
            +K +  EK ++   ++  +       E+  F        EE+++   +K K+K ++++ 
Sbjct: 25  QVKPNRYEKKDIEINTDYLQEETELDKEL--FT------PEEQKKITFQKHKEKPEQEEL 76

Query: 738 KKK-----------KKKKKKKKYSNEEEGMRKLRDAAS 764
           K +            K  KK+ +S+E E      ++ S
Sbjct: 77  KNQLFSENATENNTVKATKKQLFSSEYEQTSSSSESTS 114


>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein. 
          Length = 105

 Score = 30.8 bits (70), Expect = 0.78
 Identities = 11/25 (44%), Positives = 14/25 (56%)

Query: 726 KKKKKKKKKKKKKKKKKKKKKKKYS 750
           KK++KK  K        K KKKK+S
Sbjct: 4   KKQQKKAAKAAAASAGGKAKKKKWS 28



 Score = 30.0 bits (68), Expect = 1.6
 Identities = 13/37 (35%), Positives = 19/37 (51%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
           P K+++++  K        K KKKK  K K K K +N
Sbjct: 2   PPKKQQKKAAKAAAASAGGKAKKKKWSKGKVKDKLNN 38



 Score = 28.5 bits (64), Expect = 6.2
 Identities = 11/27 (40%), Positives = 13/27 (48%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKG 857
           KK++KK  K        K KKK   KG
Sbjct: 4   KKQQKKAAKAAAASAGGKAKKKKWSKG 30


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 33.3 bits (76), Expect = 0.79
 Identities = 40/190 (21%), Positives = 75/190 (39%), Gaps = 32/190 (16%)

Query: 689 NLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            L +   +  INHY  +  R  +       E E + K   +K    KK+K+  + ++  K
Sbjct: 461 TLGEEKSNHIINHYNEKKSRLEE----KIREIEIEVKDIDEKIVDLKKRKEYLESEEINK 516

Query: 749 YSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHY 808
             NE   +   R                      L DIK    E  +  D+ E +  N Y
Sbjct: 517 SINEYNKIESARAD--------------------LEDIKIKINELKDKHDKYE-EIKNRY 555

Query: 809 TT-EVGYLDPKQR-----FNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVL 862
            + ++  LD K+         I+ +D E  + +  + KK+    + + +++ + G+PD  
Sbjct: 556 KSLKLEDLDSKRTSWLNALAVISLIDIETNRSRSNEIKKQLNDLESRLQEI-EIGFPDDK 614

Query: 863 SQMEKELANI 872
           S ++K +  I
Sbjct: 615 SYIDKSIREI 624


>gnl|CDD|190559 pfam03195, DUF260, Protein of unknown function DUF260. 
          Length = 101

 Score = 30.6 bits (70), Expect = 0.82
 Identities = 14/55 (25%), Positives = 21/55 (38%), Gaps = 12/55 (21%)

Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
           PY P + P  + N+H+    F  S    +L  L           EQR     S++
Sbjct: 20  PYFPAEQPARFANVHKL---FGASNVTKLLKALPP---------EQRDDAMRSLL 62


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 33.1 bits (75), Expect = 0.85
 Identities = 29/151 (19%), Positives = 63/151 (41%), Gaps = 15/151 (9%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
           KE E ++  K+ ++ K++  KK+    K ++K    +       D A  Q   V++   E
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQ-------DNADKQRDEVRQKQQE 256

Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
            +  P        P + ++  +  +V        E   ++ K+  ++ A   K+ K    
Sbjct: 257 AKNLP-------KPADTSSPKEDKQVAENQKREIEKAQIEIKKN-DEEALKAKDHKAFDL 308

Query: 837 KKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
           K++ K  +K+ + K+   +K    V   ++K
Sbjct: 309 KQESKASEKEAEDKELEAQKKREPVAEDLQK 339


>gnl|CDD|216298 pfam01101, HMG14_17, HMG14 and HMG17. 
          Length = 93

 Score = 30.6 bits (69), Expect = 0.86
 Identities = 18/40 (45%), Positives = 22/40 (55%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           P K E + KK   KKKK   K KK  K KK+ K+   +EE
Sbjct: 31  PPKPEPKPKKAAAKKKKPAVKGKKGAKGKKETKQEEAKEE 70


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 32.0 bits (73), Expect = 0.89
 Identities = 14/33 (42%), Positives = 18/33 (54%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
            K+ KKK+    +K    KK KKK K+   EEE
Sbjct: 48  PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEE 80



 Score = 31.7 bits (72), Expect = 1.3
 Identities = 12/45 (26%), Positives = 21/45 (46%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
             I  P  ++ ++K+    +K    KK KKK K+K  ++    E 
Sbjct: 41  EDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPES 85



 Score = 30.1 bits (68), Expect = 4.0
 Identities = 15/41 (36%), Positives = 20/41 (48%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           +P+  E    K+ KKK+    +K    KK KKK K    EE
Sbjct: 39  FPEDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEE 79



 Score = 29.3 bits (66), Expect = 7.2
 Identities = 13/44 (29%), Positives = 20/44 (45%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANIN 873
           +KK+    +K    KK KKK K+ + +      S  +K   N N
Sbjct: 52  KKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENEN 95


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 32.8 bits (75), Expect = 0.91
 Identities = 19/66 (28%), Positives = 30/66 (45%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
             E ++ + K+  KKKKKKKKKKKKK   + Y    +       AA      ++++   P
Sbjct: 44  ISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWP 103

Query: 778 QIAPCL 783
           +     
Sbjct: 104 EQEWKQ 109



 Score = 32.4 bits (74), Expect = 1.1
 Identities = 17/35 (48%), Positives = 21/35 (60%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
           + +E E K+  KKKKKKKKKKKKK   +     Y 
Sbjct: 46  ENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYD 80



 Score = 32.4 bits (74), Expect = 1.2
 Identities = 16/32 (50%), Positives = 20/32 (62%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            E +E + K+  KKKKKKKKKKKKK   +   
Sbjct: 45  SENQEAENKQNNKKKKKKKKKKKKKNLGEAYD 76



 Score = 31.6 bits (72), Expect = 1.9
 Identities = 17/48 (35%), Positives = 23/48 (47%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
           D    E   + ++ + K+  KKKKKKKKKKKKK   E   +       
Sbjct: 37  DAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVV 84



 Score = 31.2 bits (71), Expect = 2.7
 Identities = 19/66 (28%), Positives = 27/66 (40%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
           ++E E ++  KKKKKKKKKKKKK   +                 +D + I+         
Sbjct: 47  NQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQE 106

Query: 776 EPQIAP 781
             Q  P
Sbjct: 107 WKQTQP 112



 Score = 30.8 bits (70), Expect = 3.3
 Identities = 24/103 (23%), Positives = 35/103 (33%), Gaps = 25/103 (24%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKK----------------------KKKKKKKKK 747
                  K+++++ K+K  KKK KK KK                         KKKKKKK
Sbjct: 4   ENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKK 63

Query: 748 KYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDP 790
           K   ++        A  +   PV       Q    +  + N P
Sbjct: 64  KKKKKKNLGEAYDLAYDL---PVVWSSAAFQDNSHIRKLGNWP 103



 Score = 30.8 bits (70), Expect = 3.7
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
              E  + + K++K++ K+K  KKK KK KK 
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKD 32



 Score = 30.1 bits (68), Expect = 5.5
 Identities = 16/42 (38%), Positives = 22/42 (52%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
            + ++  KKKKKKKKKKKKK   +   +    P V S    +
Sbjct: 50  AENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQ 91



 Score = 29.7 bits (67), Expect = 8.7
 Identities = 12/39 (30%), Positives = 23/39 (58%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
              E  + + K++K++ K+K  KKK KK KK  ++++  
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAF 39


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 32.5 bits (74), Expect = 0.96
 Identities = 14/40 (35%), Positives = 24/40 (60%), Gaps = 2/40 (5%)

Query: 715 PDKEEEEEKKKKKKKKK--KKKKKKKKKKKKKKKKKYSNE 752
             K E+E KKK  K+ +   + K++K++K+KK KK+    
Sbjct: 123 FTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPV 162



 Score = 29.4 bits (66), Expect = 9.0
 Identities = 13/37 (35%), Positives = 22/37 (59%), Gaps = 6/37 (16%)

Query: 822 NQIAYLDKEKKKKKKK------KKKKKKKKKKKKKKK 852
           ++    +KE KKK  K      + K++K++K+KK KK
Sbjct: 121 DRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKK 157


>gnl|CDD|216134 pfam00816, Histone_HNS, H-NS histone family. 
          Length = 92

 Score = 30.3 bits (69), Expect = 0.96
 Identities = 13/59 (22%), Positives = 21/59 (35%), Gaps = 19/59 (32%)

Query: 716 DKEEEEEKKKKKKKKKKK-------------------KKKKKKKKKKKKKKKYSNEEEG 755
           +KE E  +K ++     +                    KKK KKK+     KY + + G
Sbjct: 11  EKEIEAARKAERAAALAEIRELAAEYGLTLEELLGGAAKKKAKKKRAPVPAKYRDPDTG 69


>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106.  This
           archaebacterial protein family has no known function.
           Members are predicted to be integral membrane proteins.
          Length = 168

 Score = 31.5 bits (72), Expect = 1.1
 Identities = 12/64 (18%), Positives = 34/64 (53%), Gaps = 9/64 (14%)

Query: 699 INHYTTEVGRFNQIAYPDKEEEEEKKKKK------KKKKKKKKKKKKKKKKKKKKKYSNE 752
           +    +E+ ++  I   D++ E+ +K++K      ++ +K   K   KK +K++++   +
Sbjct: 25  LRGLLSELLQWLLI---DRKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMED 81

Query: 753 EEGM 756
           ++ M
Sbjct: 82  QKEM 85


>gnl|CDD|201540 pfam00992, Troponin, Troponin.  Troponin (Tn) contains three
           subunits, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). this Pfam contains members of
           the TnT subunit. Troponin is a complex of three
           proteins, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). The troponin complex
           regulates Ca++ induced muscle contraction. This family
           includes troponin T and troponin I. Troponin I binds to
           actin and troponin T binds to tropomyosin.
          Length = 131

 Score = 31.0 bits (71), Expect = 1.1
 Identities = 16/48 (33%), Positives = 25/48 (52%), Gaps = 6/48 (12%)

Query: 716 DKEEEEEKKKKKKKKKKKK----KKKKKKKKKKKKKKYSNEEEGMRKL 759
           D EE+  KK K+ +  KKK    + K KK   KK +K ++    ++ L
Sbjct: 67  DIEEKVAKKDKEIEDLKKKVNDLRGKFKKPTLKKVRKSADAM--LKAL 112


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 29.6 bits (67), Expect = 1.1
 Identities = 8/26 (30%), Positives = 23/26 (88%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKK 742
           K EEE+++++++ +K++++K+++K K
Sbjct: 30  KAEEEKREEEEEARKREERKEREKNK 55



 Score = 29.2 bits (66), Expect = 1.4
 Identities = 8/26 (30%), Positives = 23/26 (88%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKK 743
           + EEEK++++++ +K++++K+++K K
Sbjct: 30  KAEEEKREEEEEARKREERKEREKNK 55



 Score = 27.7 bits (62), Expect = 4.4
 Identities = 10/38 (26%), Positives = 30/38 (78%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           K +++E K +++K++++++ +K++++K+++K  S EE 
Sbjct: 23  KAKKKELKAEEEKREEEEEARKREERKEREKNKSFEEL 60


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 31.0 bits (70), Expect = 1.2
 Identities = 20/65 (30%), Positives = 37/65 (56%), Gaps = 3/65 (4%)

Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKK---KKKKKKKKKKYSNEEEGMRKLRDAASIQCG 768
            A P+KE  E++ K  K+ +K +KKKK   KK+  K++K+   E+E  ++ ++   I+  
Sbjct: 47  YAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELS 106

Query: 769 PVKEV 773
             K+ 
Sbjct: 107 KKKQK 111



 Score = 28.7 bits (64), Expect = 7.0
 Identities = 10/40 (25%), Positives = 27/40 (67%)

Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
                +  K++ +EKK+  K++K+++++K+  K++K+ +K
Sbjct: 63  KEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEK 102


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is a
           small protein module with a triple-stranded beta-sheet
           fold. This is a family of WW domain binding proteins.
          Length = 78

 Score = 29.5 bits (67), Expect = 1.2
 Identities = 13/41 (31%), Positives = 27/41 (65%), Gaps = 1/41 (2%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVL-SQMEK 867
           D  +K++KKK+ KK K +++ +++  + K  PD L  +++K
Sbjct: 8   DAYRKEQKKKELKKNKAERQARREAKLAKKNPDRLQRELDK 48



 Score = 28.8 bits (65), Expect = 2.7
 Identities = 11/41 (26%), Positives = 24/41 (58%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
           +  +K++KKK+ KK K +++ +++ K    N +   R+L  
Sbjct: 8   DAYRKEQKKKELKKNKAERQARREAKLAKKNPDRLQRELDK 48



 Score = 28.4 bits (64), Expect = 3.2
 Identities = 12/49 (24%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE-EEGMRKLRDAASIQ 766
           +   K++KKK+ KK K +++ +++ K  KK  +  +  + KL+      
Sbjct: 8   DAYRKEQKKKELKKNKAERQARREAKLAKKNPDRLQRELDKLKAQEESG 56



 Score = 27.2 bits (61), Expect = 8.9
 Identities = 11/30 (36%), Positives = 20/30 (66%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
           D   +E+KKK+ KK K +++ +++ K  KK
Sbjct: 8   DAYRKEQKKKELKKNKAERQARREAKLAKK 37


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 31.2 bits (71), Expect = 1.2
 Identities = 12/32 (37%), Positives = 21/32 (65%), Gaps = 3/32 (9%)

Query: 828 DKEKKKKKKKKKKKKKK---KKKKKKKKMMKK 856
            K K+K ++ K K KK+   K+K+K+++ MK 
Sbjct: 66  KKAKEKLRRDKLKAKKEEAEKEKEKEERFMKA 97



 Score = 30.0 bits (68), Expect = 2.7
 Identities = 15/38 (39%), Positives = 24/38 (63%), Gaps = 4/38 (10%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           E +KK K+K ++ K K KK++ +K+K+K    EE   K
Sbjct: 63  EGRKKAKEKLRRDKLKAKKEEAEKEKEK----EERFMK 96



 Score = 29.6 bits (67), Expect = 3.5
 Identities = 11/32 (34%), Positives = 21/32 (65%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           KK K+K ++ K K KK++ +K+K+K+    + 
Sbjct: 66  KKAKEKLRRDKLKAKKEEAEKEKEKEERFMKA 97



 Score = 29.3 bits (66), Expect = 5.3
 Identities = 12/44 (27%), Positives = 28/44 (63%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           K+ +E+ ++ K K KK++ +K+K+K+++  K  +  E+   +L 
Sbjct: 66  KKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELE 109



 Score = 28.9 bits (65), Expect = 6.2
 Identities = 8/38 (21%), Positives = 26/38 (68%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           +++ + KK++ +K+K+K+++  K   + +K++   E++
Sbjct: 74  RDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKK 111



 Score = 28.9 bits (65), Expect = 6.3
 Identities = 11/37 (29%), Positives = 24/37 (64%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
           KEE E++K+K+++  K   + +K++ + +KKK   + 
Sbjct: 81  KEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKL 117


>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
           CCCH-type Zn-finger protein [General function prediction
           only].
          Length = 299

 Score = 32.0 bits (72), Expect = 1.2
 Identities = 14/71 (19%), Positives = 29/71 (40%), Gaps = 10/71 (14%)

Query: 723 KKKKKKKKKK---------KKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEV 773
           K +  K +           K++ +KK+K + ++K+   E++ +R   D  ++ C      
Sbjct: 35  KNRSTKVQAIIKQIETLNLKEQLEKKEKMRMEEKRREPEKQVIRAGVDPKTVVCALFLNK 94

Query: 774 PCEPQIAPCLF 784
            C      C F
Sbjct: 95  TCAKG-DACKF 104


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 32.8 bits (75), Expect = 1.2
 Identities = 15/35 (42%), Positives = 25/35 (71%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           K    +I+   +EK KKKKKK+KKK+++ K+++K 
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765



 Score = 32.4 bits (74), Expect = 1.4
 Identities = 14/28 (50%), Positives = 23/28 (82%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
           +  EEK KKKKKK+KKK+++ K+++K +
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKAR 766



 Score = 32.4 bits (74), Expect = 1.5
 Identities = 10/35 (28%), Positives = 17/35 (48%)

Query: 717  KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
            +E+  E   KK+K KK+ K   + +     K+Y  
Sbjct: 1558 EEDYAESDIKKRKNKKQYKSNTEAELDFFLKRYLR 1592



 Score = 32.0 bits (73), Expect = 2.1
 Identities = 12/36 (33%), Positives = 25/36 (69%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKM 853
              F     ++++ KKKKKK+KKK+++ K+++K ++
Sbjct: 732 DAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARI 767



 Score = 30.8 bits (70), Expect = 4.2
 Identities = 14/28 (50%), Positives = 23/28 (82%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
           D  EE+ KKKKKK+KKK+++ K+++K +
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKAR 766



 Score = 30.8 bits (70), Expect = 4.2
 Identities = 13/28 (46%), Positives = 23/28 (82%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
           +  E+K KKKKKK+KKK+++ K+++K +
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKAR 766



 Score = 30.1 bits (68), Expect = 7.3
 Identities = 12/34 (35%), Positives = 25/34 (73%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            + +  +  ++K KKKKKK+KKK+++ K+++K +
Sbjct: 733 AEFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766



 Score = 30.1 bits (68), Expect = 7.5
 Identities = 13/33 (39%), Positives = 23/33 (69%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            K+ E +     ++K KKKKKK+KKK+++ K++
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKRE 762



 Score = 30.1 bits (68), Expect = 8.0
 Identities = 12/29 (41%), Positives = 22/29 (75%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKK 742
               EE+ +KKKKK+KKK+++ K+++K +
Sbjct: 738 SDSVEEKTKKKKKKEKKKEEEYKREEKAR 766


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 32.2 bits (74), Expect = 1.3
 Identities = 13/56 (23%), Positives = 34/56 (60%), Gaps = 9/56 (16%)

Query: 718 EEEEEKKKKKKKKKKKKKKK-------KKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
           +EE+E + ++K++K+KK ++       ++ ++K+++K+   EEE   K+ +    +
Sbjct: 109 QEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEE--LKILEYQREK 162



 Score = 31.4 bits (72), Expect = 2.1
 Identities = 16/75 (21%), Positives = 42/75 (56%), Gaps = 11/75 (14%)

Query: 686 EKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
           E+    +R E+ +I  Y  E           K E EE+++ +++++K++K+++  + + +
Sbjct: 142 EEEKEREREEELKILEYQRE-----------KAEREEEREAERRERKEEKEREVARLRAQ 190

Query: 746 KKKYSNEEEGMRKLR 760
           +++  +E E + +LR
Sbjct: 191 QEEAEDEREELDELR 205


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 32.5 bits (74), Expect = 1.3
 Identities = 17/44 (38%), Positives = 22/44 (50%), Gaps = 4/44 (9%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKK----KKKKKKKKKKYSNEEE 754
           PD EE+  KK KK +KK + K  K     K   KK+   S E+ 
Sbjct: 439 PDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKSQEDN 482



 Score = 31.3 bits (71), Expect = 2.5
 Identities = 13/51 (25%), Positives = 18/51 (35%)

Query: 706 VGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
                  A    ++EE+  KK KK +KK + K  K     K     E    
Sbjct: 428 TSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKS 478


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 31.6 bits (72), Expect = 1.3
 Identities = 11/35 (31%), Positives = 20/35 (57%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           EE++K++     KKK K +K  K+ +KK  +   +
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPK 190



 Score = 30.0 bits (68), Expect = 3.8
 Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 5/42 (11%)

Query: 717 KEEEEEKKK-----KKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
           K EE+ K++     KKK K +K  K+ +KK   K  K     
Sbjct: 154 KLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAV 195



 Score = 28.9 bits (65), Expect = 9.2
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
           KE+     KKK K +K  K+ +KK + K    +++ ++K
Sbjct: 160 KERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198


>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1.  Rec10 / Red1 is involved in meiotic
           recombination and chromosome segregation during
           homologous chromosome formation. This protein localises
           to the synaptonemal complex in S. cerevisiae and the
           analogous structures (linear elements) in S. pombe. This
           family is currently only found in fungi.
          Length = 706

 Score = 32.6 bits (74), Expect = 1.3
 Identities = 40/203 (19%), Positives = 77/203 (37%), Gaps = 30/203 (14%)

Query: 689 NLADRSEDQRINHYTTEVGRFNQIAYPDKEEE---EEKKKKKKKKKKKKKKKKKKKKKKK 745
           +L +  ++ +I   T       + + P+       EE++    + +K+K  +   K  + 
Sbjct: 306 DLKEDQKESKIKSGTNLKLSSEKESIPETSYVNVLEEEQSPLVRLQKRKLARSTSKTLES 365

Query: 746 KKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRI 805
            K+   ++    K + A S +   + E P      P    +  DP  K+      +    
Sbjct: 366 LKEVFEDQASSVKNKQAQSEE--NLNESP----KTPIA--VTGDPHLKDPTIIAGKKLM- 416

Query: 806 NHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKK-------------KKKKKKKKKKKK 852
           N  T+E      K     +  + K K  K +KK+              K+ +KK  K K+
Sbjct: 417 NKLTSEKINNPVK-----VVKVSKYKGNKSEKKRDINVLDTIFASPVSKELRKKVGKSKQ 471

Query: 853 MMKKGYPDVLSQMEKELANINRT 875
              K +  V ++ +K+LAN N  
Sbjct: 472 TKLKNFKPVPNKSKKQLANNNSQ 494


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 32.5 bits (75), Expect = 1.3
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKK 849
             +    ++++ KKKKK+KK KK + +K
Sbjct: 726 APLWLDLEKRRLKKKKKRKKVKKWEVEK 753



 Score = 30.6 bits (70), Expect = 4.4
 Identities = 11/23 (47%), Positives = 17/23 (73%)

Query: 727 KKKKKKKKKKKKKKKKKKKKKKY 749
           +K++ KKKKK+KK KK + +K  
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755



 Score = 30.2 bits (69), Expect = 5.6
 Identities = 12/21 (57%), Positives = 17/21 (80%)

Query: 722 EKKKKKKKKKKKKKKKKKKKK 742
           EK++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 30.2 bits (69), Expect = 6.8
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 723 KKKKKKKKKKKKKKKKKKKKK 743
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 30.2 bits (69), Expect = 6.8
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 724 KKKKKKKKKKKKKKKKKKKKK 744
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 30.2 bits (69), Expect = 6.8
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 725 KKKKKKKKKKKKKKKKKKKKK 745
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 30.2 bits (69), Expect = 6.8
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 726 KKKKKKKKKKKKKKKKKKKKK 746
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 30.2 bits (69), Expect = 6.8
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 831 KKKKKKKKKKKKKKKKKKKKK 851
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 30.2 bits (69), Expect = 6.8
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 832 KKKKKKKKKKKKKKKKKKKKK 852
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 30.7 bits (70), Expect = 1.3
 Identities = 16/65 (24%), Positives = 35/65 (53%), Gaps = 2/65 (3%)

Query: 692 DRSEDQRINHYTT--EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           D  ED+    Y      G F+   Y D++EE ++  +   ++  +++KK++++K+K++  
Sbjct: 32  DDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQKEKEEIE 91

Query: 750 SNEEE 754
              EE
Sbjct: 92  KYREE 96


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 31.9 bits (73), Expect = 1.4
 Identities = 10/44 (22%), Positives = 27/44 (61%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
           D+EE++E  KK K+   ++  ++ +++  ++++   E+E   +L
Sbjct: 134 DEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASEL 177


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 30.7 bits (70), Expect = 1.4
 Identities = 19/65 (29%), Positives = 38/65 (58%), Gaps = 11/65 (16%)

Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKK-------KKKKKKK----KKKYSNEEEG 755
           GR  +      +  EE++K K+KKKKKKK+ +       ++KKK++    +KK+  +++ 
Sbjct: 59  GRKRKAGASRNKAAEERRKLKEKKKKKKKELENFYRFQIREKKKEELAELRKKFEEDKKR 118

Query: 756 MRKLR 760
           + +L+
Sbjct: 119 IEQLK 123



 Score = 29.5 bits (67), Expect = 3.7
 Identities = 17/57 (29%), Positives = 25/57 (43%), Gaps = 24/57 (42%)

Query: 716 DKEEEEEKKKKKK------------------------KKKKKKKKKKKKKKKKKKKK 748
           DK EEE K++ K                         + K  ++++K K+KKKKKKK
Sbjct: 31  DKREEEAKEEAKARSEPDEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKK 87



 Score = 28.3 bits (64), Expect = 8.6
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKK 851
           A   + K  ++++K K+KKKKKKK+ +
Sbjct: 64  AGASRNKAAEERRKLKEKKKKKKKELE 90


>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
           function prediction only].
          Length = 386

 Score = 32.1 bits (73), Expect = 1.4
 Identities = 16/41 (39%), Positives = 21/41 (51%)

Query: 705 EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
           E G F+  AYP K E   K+K  +K    K KK  KK + +
Sbjct: 341 EDGAFDVDAYPKKAEAARKRKGDRKGVSHKAKKGGKKNQAE 381



 Score = 31.0 bits (70), Expect = 3.5
 Identities = 12/35 (34%), Positives = 16/35 (45%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
             KK +  +K+K  +K    K KK  KK   E  G
Sbjct: 350 YPKKAEAARKRKGDRKGVSHKAKKGGKKNQAETAG 384


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 30.7 bits (70), Expect = 1.4
 Identities = 14/44 (31%), Positives = 32/44 (72%), Gaps = 5/44 (11%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKK-----KKKKKKKKYSNEEE 754
           D EE+ E+K+++K++K++K++K+K+      +K++++K   E E
Sbjct: 13  DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELE 56



 Score = 30.4 bits (69), Expect = 1.9
 Identities = 12/41 (29%), Positives = 30/41 (73%), Gaps = 3/41 (7%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
           D E+K+++K+++K++K++K++K+K+   KG   V  +  ++
Sbjct: 13  DIEEKREEKEREKEEKERKEEKEKE-WGKG--LVQKEEREK 50



 Score = 28.8 bits (65), Expect = 5.8
 Identities = 11/43 (25%), Positives = 32/43 (74%), Gaps = 4/43 (9%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKK----KKKKKKKYSNEEE 754
           +++ EE++++K++K++K++K+K+  K    K++++K+    E+
Sbjct: 15  EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEK 57


>gnl|CDD|187861 cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Large proteins, some contain Zn-finger domain; signature
           gene for I-A subtype; also known as TM1802 family.
          Length = 579

 Score = 32.3 bits (74), Expect = 1.4
 Identities = 17/57 (29%), Positives = 25/57 (43%), Gaps = 5/57 (8%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKK-----KKKKKKKYSNEEEGMRKLRD 761
           N+I Y  K  E   KK KK  +K     + KK     K  K     N E+ ++ ++D
Sbjct: 84  NKITYLLKTTEPLNKKIKKWFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDIQD 140


>gnl|CDD|216723 pfam01823, MACPF, MAC/Perforin domain.  The membrane-attack complex
           (MAC) of the complement system forms transmembrane
           channels. These channels disrupt the phospholipid
           bilayer of target cells, leading to cell lysis and
           death. A number of proteins participate in the assembly
           of the MAC. Freshly activated C5b binds to C6 to form a
           C5b-6 complex, then to C7 forming the C5b-7 complex. The
           C5b-7 complex binds to C8, which is composed of three
           chains (alpha, beta, and gamma), thus forming the C5b-8
           complex. C5b-8 subsequently binds to C9 and acts as a
           catalyst in the polymerisation of C9. Active MAC has a
           subunit composition of C5b-C6-C7-C8-C9{n}. Perforin is a
           protein found in cytolytic T-cell and killer cells. In
           the presence of calcium, perforin polymerises into
           transmembrane tubules and is capable of lysing,
           non-specifically, a variety of target cells. There are a
           number of regions of similarity in the sequences of
           complement components C6, C7, C8-alpha, C8-beta, C9 and
           perforin. The X-ray crystal structure of a MACPF domain
           reveals that it shares a common fold with bacterial
           cholesterol dependent cytolysins (pfam01289) such as
           perfringolysin O. Three key pieces of evidence suggests
           that MACPF domains and CDCs are homologous: Functional
           similarity (pore formation), conservation of three
           glycine residues at a hinge in both families and
           conservation of a complex core fold.
          Length = 209

 Score = 31.6 bits (72), Expect = 1.4
 Identities = 15/68 (22%), Positives = 23/68 (33%), Gaps = 14/68 (20%)

Query: 701 HYTTEV---GRFNQIAYPDKEEEEEKKKKKKK-----------KKKKKKKKKKKKKKKKK 746
           HY T V   G+   +   D  + ++ K                K   K  K     +K+K
Sbjct: 78  HYITSVTLGGKIVYVLKLDSSQVKKLKLTGSDCVSAEAGASFGKVSSKGCKSTSSSQKEK 137

Query: 747 KKYSNEEE 754
           K  S+  E
Sbjct: 138 KSSSSSIE 145


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 31.8 bits (72), Expect = 1.4
 Identities = 14/47 (29%), Positives = 22/47 (46%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
           K + E+K +   ++K   +KKK   K K  K  +  +   RK   AA
Sbjct: 214 KAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAA 260


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 32.0 bits (73), Expect = 1.4
 Identities = 15/39 (38%), Positives = 23/39 (58%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
           P K ++++KKKKK   KK K KK   K++K + K    +
Sbjct: 252 PSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQ 290



 Score = 31.2 bits (71), Expect = 2.4
 Identities = 16/32 (50%), Positives = 19/32 (59%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
               + KKKKKKKKK   KK K KK   K++K
Sbjct: 250 ANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRK 281



 Score = 29.3 bits (66), Expect = 8.5
 Identities = 16/27 (59%), Positives = 17/27 (62%)

Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGY 858
           K KKKKKKKKK   KK K KK+  K  
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKLTGKQR 280



 Score = 29.3 bits (66), Expect = 9.2
 Identities = 15/24 (62%), Positives = 16/24 (66%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKK 851
           +  K KKKKKKKKK   KK K KK
Sbjct: 251 NPSKVKKKKKKKKKVVHKKYKTKK 274


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 32.1 bits (74), Expect = 1.5
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
           D EEE    K++ +    K+ ++K+KK +K 
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541



 Score = 31.7 bits (73), Expect = 2.0
 Identities = 10/35 (28%), Positives = 20/35 (57%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           +P  + EEE    K++ +    K+ ++K+KK +K 
Sbjct: 507 FPRIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 29.7 bits (67), Expect = 1.6
 Identities = 7/34 (20%), Positives = 14/34 (41%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           P + + ++      +    +   +K K KK KK 
Sbjct: 29  PKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKD 62



 Score = 29.3 bits (66), Expect = 2.6
 Identities = 10/41 (24%), Positives = 16/41 (39%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
             K ++      +    +   +K K KK KK  K   + EG
Sbjct: 31  EGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEG 71



 Score = 28.6 bits (64), Expect = 4.7
 Identities = 15/43 (34%), Positives = 20/43 (46%), Gaps = 10/43 (23%)

Query: 716 DKEEEEEK-KKKKKKKKKKKK---------KKKKKKKKKKKKK 748
           DK+E++ K K K+ K KK               +K K KK KK
Sbjct: 19  DKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61


>gnl|CDD|146145 pfam03357, Snf7, Snf7.  This family of proteins are involved in
           protein sorting and transport from the endosome to the
           vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes
           play an important role in the degradation of both lipids
           and cellular proteins. In order to perform this
           degradative function, vacuoles/lysosomes contain
           numerous hydrolases which have been transported in the
           form of inactive precursors via the biosynthetic pathway
           and are proteolytically activated upon delivery to the
           vacuole/lysosome. The delivery of transmembrane
           proteins, such as activated cell surface receptors to
           the lumen of the vacuole/lysosome, either for
           degradation/downregulation, or in the case of
           hydrolases, for proper localisation, requires the
           formation of multivesicular bodies (MVBs). These late
           endosomal structures are formed by invaginating and
           budding of the limiting membrane into the lumen of the
           compartment. During this process, a subset of the
           endosomal membrane proteins is sorted into the forming
           vesicles. Mature MVBs fuse with the vacuole/lysosome,
           thereby releasing cargo containing vesicles into its
           hydrolytic lumen for degradation. Endosomal proteins
           that are not sorted into the intralumenal MVB vesicles
           are either recycled back to the plasma membrane or Golgi
           complex, or remain in the limiting membrane of the MVB
           and are thereby transported to the limiting membrane of
           the vacuole/lysosome as a consequence of fusion.
           Therefore, the MVB sorting pathway plays a critical role
           in the decision between recycling and degradation of
           membrane proteins. A few archaeal sequences are also
           present within this family.
          Length = 169

 Score = 31.1 bits (71), Expect = 1.6
 Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 10/57 (17%)

Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKK------KKKKMMKKGYPDVLSQMEKELANIN 873
           +   L+K+ KK + + KK  KK  K       K+KK  +K     L Q++ +LAN+ 
Sbjct: 16  KQESLEKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEK----QLDQLDGQLANLE 68



 Score = 29.9 bits (68), Expect = 3.7
 Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 6/57 (10%)

Query: 815 LDPKQRF--NQIAYLDKEKKKKKKKKKKKKKK---KKKKKKKKMMKKGYPDVLSQME 866
           LD KQ     +I  L+ E KK  KK  K       K+KK+ +K + +     L+ +E
Sbjct: 13  LDKKQESLEKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEKQLDQLD-GQLANLE 68



 Score = 28.8 bits (65), Expect = 8.1
 Identities = 10/36 (27%), Positives = 18/36 (50%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
           E++ +K + + KK  KK  K       K+KK   ++
Sbjct: 21  EKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEKQ 56


>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain.  This region has been
           identified as the binding site of the SRP72 protein to
           SRP RNA.
          Length = 57

 Score = 28.6 bits (64), Expect = 1.6
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 724 KKKKKKKKKKKKKKKKKKKKKKKKKY 749
           KK  KKK   ++KKKK  KKK+K K 
Sbjct: 6   KKIAKKKPGTEQKKKKSAKKKRKPKL 31



 Score = 27.8 bits (62), Expect = 3.5
 Identities = 16/28 (57%), Positives = 19/28 (67%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           E KK KK  KKK   ++KKKK  KKK+K
Sbjct: 1   EAKKPKKIAKKKPGTEQKKKKSAKKKRK 28



 Score = 27.4 bits (61), Expect = 4.5
 Identities = 14/25 (56%), Positives = 17/25 (68%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKK 747
           KK  KKK   ++KKKK  KKK+K K
Sbjct: 6   KKIAKKKPGTEQKKKKSAKKKRKPK 30



 Score = 27.0 bits (60), Expect = 7.3
 Identities = 15/29 (51%), Positives = 19/29 (65%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           E +K KK  KKK   ++KKKK  KKK+K 
Sbjct: 1   EAKKPKKIAKKKPGTEQKKKKSAKKKRKP 29


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 32.4 bits (73), Expect = 1.7
 Identities = 27/110 (24%), Positives = 42/110 (38%), Gaps = 12/110 (10%)

Query: 422 SNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENT---NPRYENG 478
           SN  Y N  +  NP Y N   + T   NP   N        Y+ P N N    N  Y N 
Sbjct: 410 SNAGYSNPGNS-NPGYNNAPNSNTPYNNPPNSNT------PYSNPPNSNPPYSNLPYSNT 462

Query: 479 THEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRN 528
              Y+   L N+   +    + +   +YQ+        ++ + N+P+  N
Sbjct: 463 P--YSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLPTANQPAANN 510


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 32.0 bits (73), Expect = 1.7
 Identities = 15/49 (30%), Positives = 30/49 (61%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           + EE E+ KK+ +K +K  ++ ++ ++K KK K   +E  + K+ + AS
Sbjct: 702 ELEEREKAKKELEKLEKALERVEELREKVKKYKALLKERALSKVGEIAS 750



 Score = 32.0 bits (73), Expect = 1.8
 Identities = 30/158 (18%), Positives = 71/158 (44%), Gaps = 18/158 (11%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
           KEE EE +K+ +  +  K+K ++K ++ +++    ++E + +L +        VKE+   
Sbjct: 237 KEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKE-IEELEEK-------VKELK-- 286

Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
                   ++K    E   L++  E         E      ++  N I    KE ++K++
Sbjct: 287 --------ELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEE 338

Query: 837 KKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINR 874
           + ++ KKK K+ +K+    +   ++  + + +   + R
Sbjct: 339 RLEELKKKLKELEKRLEELEERHELYEEAKAKKEELER 376



 Score = 29.6 bits (67), Expect = 8.6
 Identities = 28/142 (19%), Positives = 67/142 (47%), Gaps = 13/142 (9%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQI 779
            E K+ +K+ K+ ++K++K +K+ ++ +K   +E  + KL++ A      +KE+  +   
Sbjct: 459 AELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAE----QLKELEEK--- 511

Query: 780 APCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKK 839
              L     +  EK         +++     E+  L  K+   ++  L K K  + +KK 
Sbjct: 512 ---LKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSL--KKELEKLEELKK-KLAELEKKL 565

Query: 840 KKKKKKKKKKKKKMMKKGYPDV 861
            + +++  +  K++ + G+  V
Sbjct: 566 DELEEELAELLKELEELGFESV 587



 Score = 29.6 bits (67), Expect = 8.7
 Identities = 24/144 (16%), Positives = 65/144 (45%), Gaps = 8/144 (5%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQ 778
           EE  ++ KK+ ++ ++K K+ K+ K+K ++Y    E   +  D   ++    +    E +
Sbjct: 265 EERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDE--LREIEKRLSRLEEE 322

Query: 779 IAPC---LFDIKNDPCEKNNLADR-SEVQR-INHYTTEV-GYLDPKQRFNQIAYLDKEKK 832
           I      + +++        L  +  E+++ +         Y + K +  ++  L K   
Sbjct: 323 INGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLT 382

Query: 833 KKKKKKKKKKKKKKKKKKKKMMKK 856
               +K +K+ ++ +K K+++ ++
Sbjct: 383 GLTPEKLEKELEELEKAKEEIEEE 406



 Score = 29.6 bits (67), Expect = 9.8
 Identities = 32/162 (19%), Positives = 67/162 (41%), Gaps = 31/162 (19%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
           + E  E+  K+K+K+ ++  ++  +   +  +   E E + K           VKE    
Sbjct: 187 RTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKE----------VKE---- 232

Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
                 L ++K    E   L    E    +    E      ++   +I  L KE ++ ++
Sbjct: 233 ------LEELKE---EIEELEKELESLEGSKRKLEE---KIRELEERIEELKKEIEELEE 280

Query: 837 KKKKKKKKKKKKKK----KKMMKKGYPDVLSQMEKELANINR 874
           K K+ K+ K+K ++     +  ++ Y D L ++EK L+ +  
Sbjct: 281 KVKELKELKEKAEEYIKLSEFYEE-YLDELREIEKRLSRLEE 321


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 31.9 bits (73), Expect = 1.7
 Identities = 13/74 (17%), Positives = 34/74 (45%), Gaps = 2/74 (2%)

Query: 687 KNNLADRSEDQRINHYTTEVGRFNQ--IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
           K +      +  I     E+ + ++       +EEEE ++  K+K+++   K +++   +
Sbjct: 153 KEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLAR 212

Query: 745 KKKKYSNEEEGMRK 758
            + K +  E+ +R 
Sbjct: 213 LESKEAALEKQLRL 226



 Score = 31.5 bits (72), Expect = 2.4
 Identities = 24/187 (12%), Positives = 68/187 (36%), Gaps = 17/187 (9%)

Query: 692 DRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
           D +         + V + ++    + ++ +  ++    ++  + + K      +     +
Sbjct: 54  DPTLRDPQISVVSSVTKLSKGPKDEPKQVKIPRQSGLSEEVAEAEAKATSVAAEATTPKS 113

Query: 752 EEEGMRKLRDAASIQCGPVKEVPCEPQIA------PCLFDIKNDPCEKNNLADRSEVQRI 805
            +E +  L +        +KE   +P +         L D   +     NL D  E   I
Sbjct: 114 IQELVEALEELLEE---LLKETASDPVVQELVSIFNDLIDSIKED----NLKDDLE-SLI 165

Query: 806 NHYTTEVGYLDPKQRFNQI-AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQ 864
                E+  L   ++  ++ A  ++E ++  K+K+++   K +++    ++     +  Q
Sbjct: 166 ASAKEELDQL--SKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQ 223

Query: 865 MEKELAN 871
           +  E   
Sbjct: 224 LRLEFER 230


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 31.9 bits (73), Expect = 1.8
 Identities = 24/99 (24%), Positives = 46/99 (46%), Gaps = 12/99 (12%)

Query: 788 NDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKE----KKKKKKKKKKKKK 843
           N+   ++N       ++I++   E      K   + +A L ++    +K+KK  +K    
Sbjct: 374 NELIREHN-------EKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINS 426

Query: 844 KKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINK 882
            +K+ K+ +   K     + ++EK+L NI  TA   INK
Sbjct: 427 LEKEIKQLEAEIKALEKEIKELEKQLTNIEPTADE-INK 464



 Score = 31.1 bits (71), Expect = 3.3
 Identities = 13/45 (28%), Positives = 25/45 (55%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
           E E + ++ KK+ KK ++K ++ + + +KK    E+   K  D A
Sbjct: 92  EIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKA 136



 Score = 30.7 bits (70), Expect = 4.4
 Identities = 13/73 (17%), Positives = 33/73 (45%), Gaps = 2/73 (2%)

Query: 682 NDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKK 741
           N+   ++N    +  +  N    ++     +    KE+ +  +K+KK  +K     +K+ 
Sbjct: 374 NELIREHNEKIDNLKKEKNKAKKKLWLH--LVAELKEDIDAYQKEKKGLEKAINSLEKEI 431

Query: 742 KKKKKKKYSNEEE 754
           K+ + +  + E+E
Sbjct: 432 KQLEAEIKALEKE 444


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 31.6 bits (72), Expect = 1.9
 Identities = 18/92 (19%), Positives = 27/92 (29%), Gaps = 18/92 (19%)

Query: 657 ASIQCGPVKEVPCEPQIAPCLFDIKNDPC--EKNNLADRSEDQRINHYTTEVGRFNQIAY 714
                G V + P          D    P     ++ +  +                    
Sbjct: 299 EPEPTGSVSDRPRHLSS-----DSSPSPPDTSDSDPSTETPPPASL-----------SHS 342

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
           P    E       K+K++  KK+KKKK KK K
Sbjct: 343 PPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374



 Score = 29.3 bits (66), Expect = 9.0
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 729 KKKKKKKKKKKKKKKKKKKKYSN 751
           K+K++  KK+KKKK KK K   N
Sbjct: 356 KRKREGDKKQKKKKSKKLKLTFN 378


>gnl|CDD|214998 smart01054, CaM_binding, Plant calmodulin-binding domain.  The
           sequences featured in this family are found repeated in
           a number of plant calmodulin-binding proteins, and are
           thought to constitute the calmodulin-binding domains..
           Binding of the proteins to calmodulin depends on the
           presence of calcium ions.. These proteins are thought to
           be involved in various processes, such as plant defence
           responses.and stolonisation or tuberization.
          Length = 115

 Score = 30.1 bits (68), Expect = 2.0
 Identities = 12/55 (21%), Positives = 22/55 (40%)

Query: 708 RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
            F +    D + E E+ +K K  KK    K+  K  + K++ +  E  +      
Sbjct: 6   EFRRGKIEDPKPEGERPRKWKFLKKVILLKRFVKSLEDKRRLNPREPNVLPTEVE 60


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 32.0 bits (73), Expect = 2.0
 Identities = 15/70 (21%), Positives = 28/70 (40%), Gaps = 3/70 (4%)

Query: 679 DIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKK 738
           D +N    + N       +  NH+ TE    N   Y   EE+  +++       ++  K+
Sbjct: 876 DTQNSIRGEENTVKGQYSELQNHHKTEALSQN-NKY--NEEKSAQERMPGADSPEELMKR 932

Query: 739 KKKKKKKKKK 748
            K+ + K K 
Sbjct: 933 AKEYQDKHKG 942


>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
           pleckstrin homology (PH) domain.  The BCR gene is one of
           the two genes in the BCR-ABL complex, which is
           associated with the Philadelphia chromosome, a product
           of a reciprocal translocation between chromosomes 22 and
           9. BCR is a GTPase-activating protein (GAP) for RAC1
           (primarily) and CDC42. The Dbl region of BCR has the
           most RhoGEF activity for Cdc42, and less activity
           towards Rac and Rho. Since BCR possesses both GAP and
           GEF activities, it may function to temporally regulate
           the activity of these GTPases. It also displays
           serine/threonine kinase activity. The BCR protein
           contains multiple domains including an N-terminal kinase
           domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
           domain, and a C-terminal RhoGAP domain. ABR, a related
           smaller protein, is structurally similar to BCR, but
           lacks the N-terminal kinase domain and has GAP activity
           for both Rac and Cdc42. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 179

 Score = 30.8 bits (70), Expect = 2.0
 Identities = 8/45 (17%), Positives = 19/45 (42%)

Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
                    ++K K+   ++KK      +  +K +KK + +E  +
Sbjct: 93  QMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLAEQEAAL 137



 Score = 30.4 bits (69), Expect = 3.1
 Identities = 14/44 (31%), Positives = 20/44 (45%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           P   EE  + KK     K+K K+   ++KK     S   E +RK
Sbjct: 85  PKSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRK 128



 Score = 29.2 bits (66), Expect = 6.6
 Identities = 7/37 (18%), Positives = 16/37 (43%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMM 854
           K+  + +    K    ++KK      +  +K +KK+ 
Sbjct: 95  KKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLA 131



 Score = 29.2 bits (66), Expect = 7.0
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
            E+  + KK     K+K K+   ++KK  S+    + KLR
Sbjct: 88  SEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLR 127


>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
           N-terminus.  This is an N-terminal domain of DNA
           polymerase III polC subunit A that is found only in
           Firmicutes. DNA polymerase polC-type III enzyme
           functions as the 'replicase' in low G + C Gram-positive
           bacteria. Purine asymmetry is a characteristic of
           organisms with a heterodimeric DNA polymerase III
           alpha-subunit constituted by polC which probably plays a
           direct role in the maintenance of strand-biased gene
           distribution; since, among prokaryotic genomes, the
           distribution of genes on the leading and lagging strands
           of the replication fork is known to be biased. The
           domain is associated with DNA_pol3_alpha pfam07733.
          Length = 180

 Score = 30.8 bits (70), Expect = 2.1
 Identities = 14/29 (48%), Positives = 20/29 (68%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
            KEEEE K  ++  +  KKK+ +KKKK+K
Sbjct: 151 QKEEEEAKLAEEALEALKKKEAEKKKKEK 179


>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
          Length = 822

 Score = 31.9 bits (73), Expect = 2.1
 Identities = 10/35 (28%), Positives = 17/35 (48%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
            P     E++KK+K+ K + +  K    K  +K K
Sbjct: 759 NPKCNYTEKQKKEKESKSELEALKGVGAKTAEKLK 793



 Score = 31.5 bits (72), Expect = 2.6
 Identities = 12/44 (27%), Positives = 19/44 (43%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
              EK+KK+K+ K + +  K    K  +K      E +  L  A
Sbjct: 763 NYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAA 806


>gnl|CDD|225054 COG2143, COG2143, Thioredoxin-related protein [Posttranslational
           modification, protein turnover, chaperones].
          Length = 182

 Score = 30.6 bits (69), Expect = 2.2
 Identities = 13/40 (32%), Positives = 18/40 (45%)

Query: 813 GYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           GY+ P+Q    + Y+   K K  K  +   KK K   K K
Sbjct: 134 GYMPPEQFLAVLKYVADGKYKDTKTDEDFTKKLKAYIKYK 173


>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
           domain [Chromatin structure and dynamics].
          Length = 211

 Score = 31.0 bits (70), Expect = 2.2
 Identities = 24/130 (18%), Positives = 51/130 (39%), Gaps = 21/130 (16%)

Query: 727 KKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDI 786
            K   +K K+   ++K+   K+ +++ E  ++ ++         +     P  AP    I
Sbjct: 102 GKLLSEKWKELTDEEKEPYYKEANSDRERYQREKE---------EYNKKLPNKAPIGPFI 152

Query: 787 KNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKK 846
           +N+P        R +V+  +    +   ++       I+    E  + KKKK   K KK 
Sbjct: 153 ENEP------KIRPKVEGPS---PDKALVEE---TKIISKAWSELDESKKKKYIDKYKKL 200

Query: 847 KKKKKKMMKK 856
           K++      +
Sbjct: 201 KEEYDSFYPE 210



 Score = 30.6 bits (69), Expect = 3.0
 Identities = 15/62 (24%), Positives = 24/62 (38%)

Query: 683 DPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKK 742
            P  +N    R + +  +     V     I+    E +E KKKK   K KK K++     
Sbjct: 149 GPFIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFY 208

Query: 743 KK 744
            +
Sbjct: 209 PE 210



 Score = 29.1 bits (65), Expect = 8.1
 Identities = 14/44 (31%), Positives = 18/44 (40%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           PDK   EE K   K   +  + KKKK   K KK     +    +
Sbjct: 167 PDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYPE 210


>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit.  This family includes
           prefoldin subunits that are not detected by pfam02996.
          Length = 106

 Score = 29.5 bits (67), Expect = 2.2
 Identities = 14/44 (31%), Positives = 29/44 (65%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
           DKEE +E+ +++K+  +K+ K  +K+ +K +K+    +E + KL
Sbjct: 59  DKEEVKEELEERKETLEKEIKTLEKQLEKLEKELEELKEELYKL 102


>gnl|CDD|180161 PRK05609, nusG, transcription antitermination protein NusG;
           Validated.
          Length = 181

 Score = 30.5 bits (70), Expect = 2.3
 Identities = 15/52 (28%), Positives = 22/52 (42%), Gaps = 9/52 (17%)

Query: 685 CEKN---NLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKK 733
            EK    NL +R E   +           ++  P +E  E K  KKKK ++K
Sbjct: 17  YEKKVKENLENRIETLGMED------LIGEVLVPTEEVVEVKNGKKKKVERK 62


>gnl|CDD|227474 COG5145, RAD14, DNA excision repair protein [DNA replication,
           recombination, and repair].
          Length = 292

 Score = 31.1 bits (70), Expect = 2.3
 Identities = 15/40 (37%), Positives = 28/40 (70%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQME 866
           LD+EK++++K K  +K+KK +KK K++ +K      S+M+
Sbjct: 214 LDREKQRREKMKDDRKEKKLEKKIKELRRKTRTSNYSRMD 253



 Score = 30.7 bits (69), Expect = 3.2
 Identities = 12/42 (28%), Positives = 25/42 (59%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
           + EK++++K K  +K+KK +KK K+ ++K          +R+
Sbjct: 215 DREKQRREKMKDDRKEKKLEKKIKELRRKTRTSNYSRMDVRE 256


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 29.5 bits (67), Expect = 2.3
 Identities = 15/38 (39%), Positives = 22/38 (57%), Gaps = 6/38 (15%)

Query: 825 AYLDKEKKKKKKKKK------KKKKKKKKKKKKKMMKK 856
            Y  +EKKKKKKK +      KK+KK+  +++K   K 
Sbjct: 35  EYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72



 Score = 29.1 bits (66), Expect = 3.6
 Identities = 16/40 (40%), Positives = 26/40 (65%), Gaps = 6/40 (15%)

Query: 715 PDKEEEEEKKKKKKKKKKKKK------KKKKKKKKKKKKK 748
           P+K  +E  K++KKKKKKK +      KK+KK+  +++K 
Sbjct: 29  PEKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRKA 68



 Score = 28.0 bits (63), Expect = 9.5
 Identities = 14/34 (41%), Positives = 22/34 (64%), Gaps = 8/34 (23%)

Query: 723 KKKKKKKKKK--------KKKKKKKKKKKKKKKK 748
           K++KKKKKKK        KK+KK+  +++K  +K
Sbjct: 38  KREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQK 71


>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
          Length = 212

 Score = 30.8 bits (70), Expect = 2.3
 Identities = 14/33 (42%), Positives = 19/33 (57%)

Query: 819 QRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKK 851
           + FN    L + K   K   KK+KKK+ K+KKK
Sbjct: 180 RAFNAYRTLRQAKLNAKFVGKKEKKKQAKEKKK 212


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 30.9 bits (70), Expect = 2.4
 Identities = 14/36 (38%), Positives = 23/36 (63%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
            E E+ KKK++ + KKKK+ ++ K  +KK  +Y  E
Sbjct: 85  LEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKE 120



 Score = 30.9 bits (70), Expect = 2.7
 Identities = 10/32 (31%), Positives = 20/32 (62%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
           P+ E+E      + ++ KKK++ + KKKK+ +
Sbjct: 74  PEDEKELSASSLEAEQAKKKEEAEAKKKKEME 105



 Score = 30.5 bits (69), Expect = 3.5
 Identities = 11/34 (32%), Positives = 20/34 (58%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
                 E ++ KKK++ + KKKK+ ++ K  +KK
Sbjct: 80  LSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113



 Score = 29.7 bits (67), Expect = 7.0
 Identities = 9/33 (27%), Positives = 20/33 (60%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
             ++E+E      + ++ KKK++ + KKKK+ +
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEME 105



 Score = 29.3 bits (66), Expect = 9.4
 Identities = 11/45 (24%), Positives = 26/45 (57%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           +   ++ E++K+      + ++ KKK++ + KKKK   E + ++K
Sbjct: 68  FLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQK 112


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 31.6 bits (72), Expect = 2.5
 Identities = 25/157 (15%), Positives = 59/157 (37%), Gaps = 23/157 (14%)

Query: 694 SEDQRINHYTTEVG--RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
             +   +         R  ++     E EEE+ K  + ++++   + ++ +   +++ + 
Sbjct: 787 ITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNL 846

Query: 752 EEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTE 811
           ++E   +L +    +    +E+    +      + KNDP + N L    +V         
Sbjct: 847 KQEIKLELSEIQEAE----EEIQNINENKNEFVEFKNDPKKLNKLIIAKDVL-------- 894

Query: 812 VGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKK 848
                       +   D+ K+ +K  KKKKK  +K  
Sbjct: 895 ---------IKLVISSDEIKQDEKTTKKKKKDLEKTD 922


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 30.8 bits (70), Expect = 2.5
 Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMK 855
             +KKK + KKK  K  KK +  ++ 
Sbjct: 152 LAEKKKPRSKKKSSKNSKKLRIDLVG 177



 Score = 30.8 bits (70), Expect = 2.9
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
           ++ E   +KKK + KKK  K  KK +  
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLRID 174



 Score = 30.0 bits (68), Expect = 4.4
 Identities = 11/28 (39%), Positives = 14/28 (50%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
           E  E   +KKK + KKK  K  KK +  
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLRID 174



 Score = 30.0 bits (68), Expect = 4.6
 Identities = 11/28 (39%), Positives = 14/28 (50%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
           E  E   +KKK + KKK  K  KK +  
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLRID 174


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap is
           involved in recombination. Rap (recombination adept with
           plasmid) increases lambda-by-plasmid recombination
           catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 30.4 bits (69), Expect = 2.6
 Identities = 17/59 (28%), Positives = 32/59 (54%), Gaps = 7/59 (11%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE-----EGMRKLRDAAS--IQCG 768
           K E+ ++KK+K + + ++++ K +K+K K +    +E         +LRDA    I CG
Sbjct: 35  KREKAQEKKRKAEAQAERRELKARKEKLKTRSDWLKEAQAAVNKYIRLRDAGLPCISCG 93


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 30.6 bits (69), Expect = 2.6
 Identities = 15/39 (38%), Positives = 20/39 (51%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
            Y   ++E E      ++ K+KKKKK KKKK  K    N
Sbjct: 71  GYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPKSLLLN 109


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 31.0 bits (70), Expect = 2.6
 Identities = 31/164 (18%), Positives = 48/164 (29%), Gaps = 18/164 (10%)

Query: 699 INHYTTEVGR-FNQIAYPD-----KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
           I+  T E+GR    +   +     K    +   KK   K K   KK  KKK  K   S +
Sbjct: 180 IDEVTREIGRVLAAVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAK 239

Query: 753 EEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEV 812
           +      + A   +    K +    +              K      ++           
Sbjct: 240 KAAKTAAKAAKKAKKTAKKALKKAAKAV------------KKAAKKAAKAAAKAAKGAAK 287

Query: 813 GYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
                 +   +         K K   K  K+  K KK KK+ KK
Sbjct: 288 ATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 30.8 bits (70), Expect = 2.6
 Identities = 10/31 (32%), Positives = 16/31 (51%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
           +     +K+K  KK  KK    KK+K+K+  
Sbjct: 219 QSSSPSRKRKAPKKVAKKVAAAKKRKQKRSP 249



 Score = 30.4 bits (69), Expect = 3.6
 Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
              +     +K+K  KK  KK    KK+K+K S
Sbjct: 216 PTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRS 248



 Score = 30.0 bits (68), Expect = 5.7
 Identities = 10/34 (29%), Positives = 16/34 (47%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
           +     +K+K  KK  KK    KK+K+K+     
Sbjct: 219 QSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPPPP 252



 Score = 29.7 bits (67), Expect = 6.1
 Identities = 10/38 (26%), Positives = 16/38 (42%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
             +     +K+K  KK  KK    KK+K+K+       
Sbjct: 217 TPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPPPPFN 254


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 30.5 bits (69), Expect = 2.6
 Identities = 11/61 (18%), Positives = 24/61 (39%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
            K ++  KK   + +KK  + + +   +KK  + +          +    +  P +E P 
Sbjct: 122 SKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPA 181

Query: 776 E 776
           E
Sbjct: 182 E 182



 Score = 29.0 bits (65), Expect = 7.5
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           KEE+E K   KK K  K KK  KK   + +KK
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAALEAEKK 138


>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
          Length = 160

 Score = 30.4 bits (69), Expect = 2.7
 Identities = 12/63 (19%), Positives = 23/63 (36%), Gaps = 7/63 (11%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
             +  E+  K+       K + KKK +    K++ +         R A  ++  P + V 
Sbjct: 98  KSRCREDFLKRVAANDAIKAEAKKKGELPSTKRQPAQP-------RPAGFVKPKPTEVVT 150

Query: 775 CEP 777
             P
Sbjct: 151 VRP 153


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 31.2 bits (70), Expect = 2.8
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMM 854
           DK+  K K KK+K  +++++KK K +M
Sbjct: 522 DKDVNKSKNKKRKVDEEEEEKKLKMIM 548



 Score = 30.0 bits (67), Expect = 6.6
 Identities = 19/47 (40%), Positives = 22/47 (46%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
            EEEEEKK K      K+KK  KK K    KK    E   +K +  A
Sbjct: 536 DEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIA 582



 Score = 29.7 bits (66), Expect = 9.1
 Identities = 13/39 (33%), Positives = 24/39 (61%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           E +K   K K KK+K  +++++KK K    SN+++ + K
Sbjct: 520 EADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYK 558


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 31.4 bits (72), Expect = 2.8
 Identities = 33/176 (18%), Positives = 71/176 (40%), Gaps = 24/176 (13%)

Query: 697 QRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE--- 753
           QR+     E+ +          E++EKKK+KKK KKKK K  + K  K  K+ + ++   
Sbjct: 376 QRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQDLEL 435

Query: 754 ---------EGMRKLR--DAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADR--- 799
                    E ++  R    A +     + V          F+      EK  L      
Sbjct: 436 LKNVWGEILESLKAQRKSLRALLVNS--EPVAASEDTVVLAFE-YEIHFEKAMLDKELND 492

Query: 800 ---SEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
              + + ++   + E+  + P+  + +I     +K K ++++++++++     + K
Sbjct: 493 TIENILSKLLGKSVEIVAV-PEDEWQKIREEFLQKHKNEEEEEEEEEELPLIPEAK 547



 Score = 30.2 bits (69), Expect = 6.2
 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
           +Q   Q+         ++K+KKK+KKK KKKK K    K Y  VL +  ++
Sbjct: 382 EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYK-VLKEATRQ 431


>gnl|CDD|183703 PRK12724, PRK12724, flagellar biosynthesis regulator FlhF;
           Provisional.
          Length = 432

 Score = 31.1 bits (70), Expect = 2.9
 Identities = 31/166 (18%), Positives = 72/166 (43%), Gaps = 28/166 (16%)

Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPV 770
           QI  P+K+   EK ++K +  K+  K+K   + ++KK     +    +L +         
Sbjct: 53  QIGIPEKQASREKIERKLQDLKELLKQKSYTEPERKKTLQTLKPLSERLEE--------- 103

Query: 771 KEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKE 830
           KE       +  L ++  +P       +R            VG    K+ F + ++L+ E
Sbjct: 104 KESAIYEIESFELEEVITEP-------ERP-----------VGLSFEKELFEKNSFLESE 145

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKGY-PDVLSQMEKELANINRT 875
               +K+K    ++  ++  ++ M + Y  ++ S++E+ L+ +++ 
Sbjct: 146 TTIVRKEKDSPLQRLGERLVREGMSQSYVEEMASKLEERLSPVDQG 191


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 30.5 bits (69), Expect = 2.9
 Identities = 10/42 (23%), Positives = 15/42 (35%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
           E EK K  K K   +   K   + K  +K+    +     R 
Sbjct: 17  ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERV 58



 Score = 30.5 bits (69), Expect = 3.3
 Identities = 9/44 (20%), Positives = 18/44 (40%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
            ++ +  K K   +   K   + K  +K K    +   E ++KL
Sbjct: 18  SEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKL 61



 Score = 29.8 bits (67), Expect = 4.8
 Identities = 13/24 (54%), Positives = 14/24 (58%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKK 744
             K  K   KK KK KKKKKKK+ 
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKRS 217



 Score = 29.4 bits (66), Expect = 6.6
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKK 741
             + +K   KK KK KKKKKKK+ 
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKRS 217



 Score = 29.0 bits (65), Expect = 9.4
 Identities = 14/44 (31%), Positives = 22/44 (50%), Gaps = 11/44 (25%)

Query: 716 DKEEEEEKKKKKKKKKKK-----------KKKKKKKKKKKKKKK 748
           +K   ++K  +K K   K            ++KK++KKK KKKK
Sbjct: 34  NKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77



 Score = 29.0 bits (65), Expect = 9.9
 Identities = 19/44 (43%), Positives = 23/44 (52%), Gaps = 4/44 (9%)

Query: 809 TTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
           T +VG+LD  +      Y    K  K   KK KK KKKKKKK+ 
Sbjct: 178 TEDVGFLDSFKPL----YTWHYKGDKSSAKKDKKSKKKKKKKRS 217


>gnl|CDD|99975 cd03802, GT1_AviGT4_like, This family is most closely related to
           the GT1 family of glycosyltransferases. aviGT4 in
           Streptomyces viridochromogenes has been shown to be
           involved in biosynthesis of oligosaccharide antibiotic
           avilamycin A. Inactivation of aviGT4 resulted in a
           mutant that accumulated a novel avilamycin derivative
           lacking the terminal eurekanate residue.
          Length = 335

 Score = 31.0 bits (71), Expect = 2.9
 Identities = 12/41 (29%), Positives = 13/41 (31%), Gaps = 4/41 (9%)

Query: 227 AEAVDIIHNHSTDEPLFLYLAHA----ATHSANPYEPLQAP 263
           A   DI+HNHS   PL            T    P   L   
Sbjct: 85  AGDFDIVHNHSLHLPLPFARPLPVPVVTTLHGPPDPELLKL 125


>gnl|CDD|191968 pfam08213, DUF1713, Mitochondrial domain of unknown function
           (DUF1713).  This domain is found at the C terminal end
           of mitochondrial proteins of unknown function.
          Length = 34

 Score = 27.3 bits (61), Expect = 3.0
 Identities = 9/29 (31%), Positives = 18/29 (62%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
            +K+++K KK K KK +K+ +  ++K   
Sbjct: 6   LRKRRRKMKKHKYKKLRKRTRALRRKLDK 34



 Score = 26.2 bits (58), Expect = 8.4
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
           L K ++K KK K KK +K+ +  ++K+ K
Sbjct: 6   LRKRRRKMKKHKYKKLRKRTRALRRKLDK 34


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 31.1 bits (70), Expect = 3.0
 Identities = 14/55 (25%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 705 EVGRFNQIAYPDKEE--EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMR 757
           E+ +  +     K E    +K ++++KKK+K++  KK   K KK++   +E+   
Sbjct: 539 EIYKKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDG 593



 Score = 30.0 bits (67), Expect = 8.0
 Identities = 13/48 (27%), Positives = 26/48 (54%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASI 765
           E +   K ++++KKK+K++  KK   K KK+    +E +  +   +S 
Sbjct: 553 ELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSS 600


>gnl|CDD|171438 PRK12363, PRK12363, phosphoglycerol transferase I; Provisional.
          Length = 703

 Score = 31.0 bits (70), Expect = 3.0
 Identities = 64/272 (23%), Positives = 98/272 (36%), Gaps = 49/272 (18%)

Query: 59  PPHIIFILADDLG---WNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
             +I++I  + L    +++  F GL     PN+  LA   + ++N  +    T      +
Sbjct: 156 RKNIVWIYGESLERTYFDEDVFPGL----MPNLTRLATEAVDVRNLAS----TEGSGWTI 207

Query: 116 TGKHPIHTGMQHNVLYGCE----RGGLPLSE-KILPQYLKELGYRTRIVGKWHLGFYKKE 170
            G      G+      G E    R G  L E + L  YLK+ GY    VG     F  K 
Sbjct: 208 AGMVASMCGVPLTTAQGDENSMDRMGHFLPEARCLGDYLKDQGYTNHYVGGADASFAGKG 267

Query: 171 YTPTFRGFESHLGYWTGHQ-DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
                +   SH G+   H  +YF H             +    AW +H     DV   +A
Sbjct: 268 -----KFLSSH-GFDEVHDVNYFLHDK-------GVAPKHFS-AWGVH----DDVLLDDA 309

Query: 230 VDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDH-YLNIHRHIEDFKRSKFAAILHK 287
            D     S   +P  L      TH    + P       Y +    I          +LH 
Sbjct: 310 YDEFETLSRAGQPFMLTTLTMDTHHPAGHLPSACKGQRYDSPLGDI---------GMLHA 360

Query: 288 L---DESVGKVVEALEQRRMLSNSIIVFVSDN 316
           +   D  +G++V+ +   R   N+IIV  SD+
Sbjct: 361 IKCSDRLIGQLVDRIRNSRYGKNTIIVIASDH 392


>gnl|CDD|191868 pfam07839, CaM_binding, Plant calmodulin-binding domain.  The
           sequences featured in this family are found repeated in
           a number of plant calmodulin-binding proteins, and are
           thought to constitute the calmodulin-binding domains.
           Binding of the proteins to calmodulin depends on the
           presence of calcium ions. These proteins are thought to
           be involved in various processes, such as plant defence
           responses and stolonisation or tuberization.
          Length = 115

 Score = 29.3 bits (66), Expect = 3.0
 Identities = 15/44 (34%), Positives = 21/44 (47%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           KEE   KK+  K +K+K K     KK    K++    E +RK  
Sbjct: 5   KEERRSKKEDPKPEKEKPKSWSNLKKVILLKRFVKSLEKVRKFN 48


>gnl|CDD|219748 pfam08214, KAT11, Histone acetylation protein.  Histone acetylation
           is required in many cellular processes including
           transcription, DNA repair, and chromatin assembly. This
           family contains the fungal KAT11 protein (previously
           known as RTT109) which is required for H3K56
           acetylation. Loss of KAT11 results in the loss of H3K56
           acetylation, both on bulk histone and on chromatin.
           KAT11 and H3K56 acetylation appear to correlate with
           actively transcribed genes and associate with the
           elongating form of Pol II in yeast. This family also
           incorporates the p300/CBP histone acetyltransferase
           domain which has different catalytic properties and
           cofactor regulation to KAT11.
          Length = 341

 Score = 30.8 bits (70), Expect = 3.1
 Identities = 11/53 (20%), Positives = 17/53 (32%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEV 773
                K    KK+ K+ KK    ++    S     ++ L D         KE 
Sbjct: 271 VINPPKLVVSKKQYKRAKKAITGEEYDTLSGAASLVQNLMDKLYETMLKKKED 323


>gnl|CDD|233955 TIGR02627, rhamnulo_kin, rhamnulokinase.  This model describes
           rhamnulokinase, an enzyme that catalyzes the second step
           in rhamnose catabolism.
          Length = 454

 Score = 30.9 bits (70), Expect = 3.1
 Identities = 18/79 (22%), Positives = 29/79 (36%), Gaps = 16/79 (20%)

Query: 440 RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSEN 499
            YEN   +   +  +R++NG                +NG   ++I  LE  I       N
Sbjct: 16  SYENECQKLTLEEIHRFKNGLV-------------SQNGHECWDIDALEQEIR---LGLN 59

Query: 500 RSNDNSYQNEIDGIDVWSV 518
           + +      +  GID W V
Sbjct: 60  KVDAEGIAPDSIGIDTWGV 78


>gnl|CDD|240709 cd12263, RRM_ABT1_like, RNA recognition motif found in activator of
           basal transcription 1 (ABT1) and similar proteins.  This
           subfamily corresponds to the RRM of novel nuclear
           proteins termed ABT1 and its homologous counterpart,
           pre-rRNA-processing protein ESF2 (eighteen S factor 2),
           from yeast. ABT1 associates with the TATA-binding
           protein (TBP) and enhances basal transcription activity
           of class II promoters. Meanwhile, ABT1 could be a
           transcription cofactor that can bind to DNA in a
           sequence-independent manner. The yeast ABT1 homolog,
           ESF2, is a component of 90S preribosomes and 5'
           ETS-based RNPs. It is previously identified as a
           putative partner of the TATA-element binding protein.
           However, it is primarily localized to the nucleolus and
           physically associates with pre-rRNA processing factors.
           ESF2 may play a role in ribosome biogenesis. It is
           required for normal pre-rRNA processing, as well as for
           SSU processome assembly and function. Both ABT1 and ESF2
           contain an RNA recognition motif (RRM), also termed RBD
           (RNA binding domain) or RNP (ribonucleoprotein domain).
           .
          Length = 98

 Score = 29.1 bits (66), Expect = 3.2
 Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 8/39 (20%)

Query: 826 YLDKEKKKKKKKKKKKKKKKKKK--------KKKKMMKK 856
           YL  E   K+K++KKK   KKKK        + KK+ K+
Sbjct: 31  YLQPEDPAKRKRRKKKGGNKKKKFTEGWVEFEDKKVAKR 69



 Score = 27.6 bits (62), Expect = 8.8
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 833 KKKKKKKKKKKKKKKKKKKKMMKKGY 858
           + +   K+K++KKK   KKK   +G+
Sbjct: 33  QPEDPAKRKRRKKKGGNKKKKFTEGW 58


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 30.7 bits (70), Expect = 3.4
 Identities = 15/64 (23%), Positives = 30/64 (46%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
           +  EE E++ K+ + +K+  + +KK  +KK KK   + +    L +          E P 
Sbjct: 69  ELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEPEPPL 128

Query: 776 EPQI 779
            P++
Sbjct: 129 RPRL 132


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 31.1 bits (71), Expect = 3.4
 Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 12/62 (19%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKK----------KKKKKKKKKKYSNEEEGMRKLRDAAS 764
           PD+E  E  K K + KK++K   +          ++K K+KK+K +   + M+++     
Sbjct: 744 PDRERAELNKLKAQLKKERKGAMRELRKDNRFIAREKLKEKKEKDAEYHKKMKRI--VNE 801

Query: 765 IQ 766
           IQ
Sbjct: 802 IQ 803


>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
           Members of the NSCC2 family have been sequenced from
           various yeast, fungal and animals species including
           Saccharomyces cerevisiae, Drosophila melanogaster and
           Homo sapiens. These proteins are the Sec62 proteins,
           believed to be associated with the Sec61 and Sec63
           constituents of the general protein secretary systems of
           yeast microsomes. They are also the non-selective cation
           (NS) channels of the mammalian cytoplasmic membrane. The
           yeast Sec62 protein has been shown to be essential for
           cell growth. The mammalian NS channel proteins has been
           implicated in platelet derived growth factor(PGDF)
           dependent single channel current in fibroblasts. These
           channels are essentially closed in serum deprived
           tissue-culture cells and are specifically opened by
           exposure to PDGF. These channels are reported to exhibit
           equal selectivity for Na+, K+ and Cs+ with low
           permeability to Ca2+, and no permeability to anions
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 232

 Score = 30.3 bits (68), Expect = 3.5
 Identities = 13/25 (52%), Positives = 16/25 (64%)

Query: 727 KKKKKKKKKKKKKKKKKKKKKKYSN 751
           K K   KKK K KK KKK+ K+ +N
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKREAN 232



 Score = 29.1 bits (65), Expect = 8.2
 Identities = 13/23 (56%), Positives = 15/23 (65%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKK 744
           EK K   KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229



 Score = 29.1 bits (65), Expect = 8.2
 Identities = 13/23 (56%), Positives = 15/23 (65%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKK 852
           EK K   KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229


>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 158 and 187 amino
           acids in length. This family includes the PcfB protein.
          Length = 137

 Score = 29.5 bits (67), Expect = 3.5
 Identities = 14/26 (53%), Positives = 19/26 (73%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
           KK KK K+++  KKK++K K K KKK
Sbjct: 112 KKLKKFKEEQAAKKKERKDKVKNKKK 137



 Score = 29.1 bits (66), Expect = 4.6
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 827 LDKEKKKKKKKKKKKKKKKKKKKK 850
           L K K+++  KKK++K K K KKK
Sbjct: 114 LKKFKEEQAAKKKERKDKVKNKKK 137


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 30.6 bits (70), Expect = 3.5
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKK 742
           K+ EE  KK  +K +K KKKKK++K 
Sbjct: 185 KQTEELIKKLLEKPEKPKKKKKRRKG 210



 Score = 30.2 bits (69), Expect = 4.4
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMK 855
           E+  KK  +K +K KKKKK++K   K
Sbjct: 188 EELIKKLLEKPEKPKKKKKRRKGFSK 213



 Score = 29.4 bits (67), Expect = 8.5
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKK 743
           + EE  KK  +K +K KKKKK++K 
Sbjct: 186 QTEELIKKLLEKPEKPKKKKKRRKG 210


>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
           beta and delta phosphatidylinositol 3-kinases (PI3Ks).
           PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
           regulate cell processes such as cell growth,
           differentiation, proliferation, and motility.  PI3Ks
           work on phosphorylation of phosphatidylinositol,
           phosphatidylinositide (4)P (PtdIns (4)P),2 or
           PtdIns(4,5)P2. Specifically they phosphorylate the D3
           hydroxyl group of phosphoinositol lipids on the inositol
           ring. There are 3 classes of PI3Ks based on structure,
           regulation, and specificity. All classes contain a C2
           domain, a PIK domain, and a kinase catalytic domain.
           The members here are class I, beta and delta isoforms of
           PI3Ks and contain both a Ras-binding domain and a
           p85-binding domain.  Class II PI3Ks contain both of
           these as well as a PX domain, and a C-terminal C2 domain
           containing a nuclear localization signal.  C2 domains
           fold into an 8-standed beta-sandwich that can adopt 2
           structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.  Members have a type-I
           topology.
          Length = 173

 Score = 30.0 bits (68), Expect = 3.5
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 830 EKKKKKKKKKKKKKKKKKKKKKKM 853
           E  KK K K+ +K + KKKKKK  
Sbjct: 84  EVSKKAKGKRSRKNQTKKKKKKDD 107



 Score = 30.0 bits (68), Expect = 3.5
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 832 KKKKKKKKKKKKKKKKKKKKKMM 854
             KK K K+ +K + KKKKKK  
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKKDD 107



 Score = 30.0 bits (68), Expect = 3.6
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKK 743
           E  KK K K+ +K + KKKKKK
Sbjct: 84  EVSKKAKGKRSRKNQTKKKKKK 105



 Score = 29.2 bits (66), Expect = 6.0
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 728 KKKKKKKKKKKKKKKKKKKKKYSN 751
             KK K K+ +K + KKKKKK  N
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKKDDN 108


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
           appears in eukaryotes as well as bacteria and tends to
           be found near the C-terminus of the metalloprotease M16C
           (pfam05193).
          Length = 248

 Score = 30.3 bits (69), Expect = 3.6
 Identities = 12/48 (25%), Positives = 26/48 (54%), Gaps = 2/48 (4%)

Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKK--KKKKKKKKYSNEEEGMRKL 759
             +K  EEE++K +KKK    ++ K+K  ++  + ++  +  E +  L
Sbjct: 7   LEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQSTPEDLSCL 54


>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
           MW N terminal.  This domain is found in eukaryotes. This
           domain is about 90 amino acids in length. This domain is
           found associated with pfam00076. This domain is part of
           U1 snRNP, which is the pre-mRNA binding protein of the
           penta-snRNP spliceosome complex. It extends over a
           distance of 180 A from its RNA binding domain, wraps
           around the core domain of U1 snRNP consisting of the
           seven Sm proteins and finally contacts U1-C, which is
           crucial for 5'-splice-site recognition.
          Length = 94

 Score = 28.8 bits (65), Expect = 3.6
 Identities = 11/35 (31%), Positives = 25/35 (71%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           P  E  E   +K++++K++KK+K +KK +++ K++
Sbjct: 54  PPPEPTETWLEKREREKREKKEKLEKKLEEELKEW 88


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 31.1 bits (70), Expect = 3.6
 Identities = 17/60 (28%), Positives = 22/60 (36%), Gaps = 14/60 (23%)

Query: 712 IAYPDKEEEEEKK--------------KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMR 757
             YP   EE   K              K   + K+K K+K    K+ K +  S E EG R
Sbjct: 765 FEYPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGER 824


>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
           II/NMEMI.  CRISPR loci appear to be mobile elements with
           a wide host range. This model represents a protein found
           only in CRISPR-containing species, near other
           CRISPR-associated proteins (cas), as part of the NMENI
           subtype of CRISPR/Cas locus. The species range so far
           for this protein is animal pathogens and commensals only
           [Mobile and extrachromosomal element functions, Other].
          Length = 805

 Score = 30.9 bits (70), Expect = 3.7
 Identities = 9/35 (25%), Positives = 16/35 (45%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           EE+     K+  K++ KK + K K+      +E  
Sbjct: 525 EEQGTNFGKRNSKERYKKNEDKIKEFASALGKEIL 559


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 30.3 bits (69), Expect = 3.8
 Identities = 17/33 (51%), Positives = 24/33 (72%), Gaps = 1/33 (3%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDV 861
            E+  K+KKKKKKKKKKK++ ++ M +K   DV
Sbjct: 82  AEELLKEKKKKKKKKKKKEELREWMCEKFI-DV 113


>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family.  Members of this family are
           short proteins that are rich in aspartate, glutamate,
           lysine and arginine. Although the function of these
           proteins is unknown, they are found to be ubiquitously
           expressed.
          Length = 38

 Score = 27.2 bits (61), Expect = 4.0
 Identities = 11/26 (42%), Positives = 13/26 (50%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKK 747
           E+  KK  KKKKK  K + K   K  
Sbjct: 12  ERNAKKAAKKKKKGAKSQLKAAAKAL 37



 Score = 26.4 bits (59), Expect = 6.6
 Identities = 11/29 (37%), Positives = 15/29 (51%), Gaps = 2/29 (6%)

Query: 827 LDKEK--KKKKKKKKKKKKKKKKKKKKKM 853
             +E+  KK  KKKKK  K + K   K +
Sbjct: 9   QARERNAKKAAKKKKKGAKSQLKAAAKAL 37


>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
           structure and biogenesis].
          Length = 67

 Score = 28.1 bits (63), Expect = 4.0
 Identities = 10/28 (35%), Positives = 20/28 (71%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           EK  +K+K+KK   +K+K K+ +K+++ 
Sbjct: 39  EKPSEKRKRKKAAARKRKFKRLRKEQQR 66



 Score = 27.3 bits (61), Expect = 7.0
 Identities = 9/30 (30%), Positives = 20/30 (66%)

Query: 826 YLDKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
           + +K  +K+K+KK   +K+K K+ +K+  +
Sbjct: 37  FYEKPSEKRKRKKAAARKRKFKRLRKEQQR 66


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 30.2 bits (68), Expect = 4.2
 Identities = 10/55 (18%), Positives = 27/55 (49%)

Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
            R  +     + E+ +K K+ ++KK+++K++  + K+      S+  +     +D
Sbjct: 161 KRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKD 215


>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020).  This
           family of fungal proteins is conserved towards the
           C-terminus of HMG domains. The function is not known.
          Length = 49

 Score = 27.2 bits (60), Expect = 4.3
 Identities = 12/38 (31%), Positives = 18/38 (47%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
              E  K    + + KKK KKK  K+  K+K    ++E
Sbjct: 11  ANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDAYIDDE 48


>gnl|CDD|215856 pfam00318, Ribosomal_S2, Ribosomal protein S2. 
          Length = 205

 Score = 29.8 bits (68), Expect = 4.3
 Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 1/50 (2%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE 772
           K  K+ ++  KK+  K K++ +K +KY    + M+KL D   +   P KE
Sbjct: 101 KSIKRLEELSKKEALKLKRELEKLEKYLGGIKNMKKLPDLVIV-VDPNKE 149


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 30.8 bits (69), Expect = 4.4
 Identities = 13/48 (27%), Positives = 33/48 (68%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           K EE  +K K++ ++K +++++++K+K+K+++   E E  R  + ++S
Sbjct: 580 KREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSS 627


>gnl|CDD|197773 smart00527, HMG17, domain in high mobilty group proteins HMG14 and
           HMG 17. 
          Length = 88

 Score = 28.4 bits (63), Expect = 4.5
 Identities = 15/40 (37%), Positives = 21/40 (52%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           P K E + KK   KKK +    K KK  K K+ + + +EE
Sbjct: 29  PPKPEPKPKKAAAKKKSENVAAKGKKGAKGKQTEEAGKEE 68


>gnl|CDD|218660 pfam05620, DUF788, Protein of unknown function (DUF788).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 166

 Score = 29.6 bits (67), Expect = 4.5
 Identities = 6/26 (23%), Positives = 14/26 (53%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKK 740
               +  E  + K K+++K +K+ +K
Sbjct: 141 LPSSQGAETNETKSKRQEKLEKRGEK 166


>gnl|CDD|237619 PRK14135, recX, recombination regulator RecX; Provisional.
          Length = 263

 Score = 30.2 bits (69), Expect = 4.5
 Identities = 9/35 (25%), Positives = 18/35 (51%)

Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQM 865
           +K  KK +K   K  K+K  + ++ KG+   + + 
Sbjct: 163 EKLLKKYQKLPFKALKQKIIQSLLTKGFSYEVIKA 197


>gnl|CDD|178262 PLN02656, PLN02656, tyrosine transaminase.
          Length = 409

 Score = 30.3 bits (68), Expect = 4.6
 Identities = 17/73 (23%), Positives = 36/73 (49%), Gaps = 6/73 (8%)

Query: 603 WQISALTKGKWKLVKVVKVMRYQVDLTGGPD---QVYLSGL---SDREWLALAMRKLRDA 656
           W ++    G ++  K+V+ ++   D+ GGP    Q  +  +   +D  +    +  L+ +
Sbjct: 252 WFVTTDPSGSFRDPKIVERIKKYFDILGGPATFIQAAVPTILEQTDESFFKKTINILKQS 311

Query: 657 ASIQCGPVKEVPC 669
           + I C  +KE+PC
Sbjct: 312 SDICCDRIKEIPC 324


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 29.7 bits (66), Expect = 4.7
 Identities = 12/47 (25%), Positives = 34/47 (72%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
           ++EEE  ++K++K K+K ++++K++++++++    +EE   + R+ A
Sbjct: 77  RQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEA 123


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 30.0 bits (68), Expect = 4.7
 Identities = 20/74 (27%), Positives = 31/74 (41%), Gaps = 12/74 (16%)

Query: 700 NHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
            HY  ++G         K+ ++E++ KK KKK +    K + +KK  K      E  +  
Sbjct: 115 KHYGIDLG-------KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLEQFQSG 167

Query: 760 RDAASI-----QCG 768
           R  A I     Q G
Sbjct: 168 RLLARISSRPGQSG 181


>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein.  The PcfJ-like protein family
           includes the E. faecalis PcfJ protein, which is
           functionally uncharacterized. It is found in bacteria
           and viruses, and is typically between 159 and 170 amino
           acids in length. There is a conserved HCV sequence
           motif.
          Length = 169

 Score = 29.6 bits (67), Expect = 4.7
 Identities = 8/38 (21%), Positives = 24/38 (63%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
             + ++EK+K +K +K+  + +K  ++ K K ++++ +
Sbjct: 37  KYQAKKEKEKAEKMRKRYPELEKIFRELKSKFEFTDGD 74


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 30.4 bits (69), Expect = 4.8
 Identities = 9/35 (25%), Positives = 21/35 (60%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
                 +KKK++KK+++K+ K+ ++    +E+E  
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDD 210


>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
           This family consists of several eukaryotic ribosome
           biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
           protein that is essential for the maturation of 25 S
           rRNA and the 60 S ribosomal subunit assembly in
           Saccharomyces cerevisiae.
          Length = 164

 Score = 29.6 bits (67), Expect = 4.8
 Identities = 11/37 (29%), Positives = 22/37 (59%)

Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           I     ++ +E    KK+++KK++  K +K++ K KK
Sbjct: 128 IEVKPNDDPKEDPFAKKRREKKERVAKNEKRELKNKK 164


>gnl|CDD|223945 COG1013, PorB, Pyruvate:ferredoxin oxidoreductase and related
           2-oxoacid:ferredoxin oxidoreductases, beta subunit
           [Energy production and conversion].
          Length = 294

 Score = 30.0 bits (68), Expect = 4.8
 Identities = 15/77 (19%), Positives = 29/77 (37%)

Query: 684 PCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
           PC   N  +  +         E G +    Y   + EEEK K          ++  + ++
Sbjct: 204 PCPTGNGRNTMKTIEEAKLAVETGYWPLYRYEPGKAEEEKGKDIPIGIFYPVEEYLEYEE 263

Query: 744 KKKKKYSNEEEGMRKLR 760
           + K    +  E ++KL+
Sbjct: 264 RFKHLTKSNPELIQKLQ 280


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 30.7 bits (70), Expect = 4.8
 Identities = 11/34 (32%), Positives = 18/34 (52%), Gaps = 1/34 (2%)

Query: 818 KQRFNQ-IAYLDKEKKKKKKKKKKKKKKKKKKKK 850
           K RF    A L++EK  ++ + KK  + +  K K
Sbjct: 452 KARFEARQARLEREKAAREARHKKAAEARAAKDK 485


>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090).  This
           family consists of several bacterial proteins of unknown
           function and is known as YqjC in E. coli.
          Length = 115

 Score = 28.8 bits (65), Expect = 4.8
 Identities = 12/54 (22%), Positives = 27/54 (50%), Gaps = 11/54 (20%)

Query: 717 KEEEEEKKKKKKKK--------KKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           + E + K ++K+++        K+ + K    K +KK++K     E   +L++A
Sbjct: 65  RAERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKL---AEAQAELQEA 115


>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7.  TMA7 plays
           a role in protein translation. Deletions of the TMA7
           gene results in altered protein synthesis rates.
          Length = 63

 Score = 27.5 bits (61), Expect = 5.0
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
           KE +EE    K+K+K++ K  K+   K K K
Sbjct: 19  KELDEEDLAFKQKQKEEAKALKELAAKAKGK 49


>gnl|CDD|235620 PRK05841, flgE, flagellar hook protein FlgE; Validated.
          Length = 603

 Score = 30.6 bits (69), Expect = 5.1
 Identities = 42/186 (22%), Positives = 74/186 (39%), Gaps = 43/186 (23%)

Query: 479 THEYNIPRLENSINGNGTSENRSNDNSYQ----NEIDGIDVWSVL----------SRNEP 524
           TH++ +  L N+I    T   RSN   ++    + +D ++  SV+          S N  
Sbjct: 14  THQFGLDSLSNNIANVNTLGYRSNIPEFKTLFSSHLDALNAKSVVANDRNYGVTGSSNAI 73

Query: 525 SKRNTILHNIDDEWQISALTRGKWKLVKEN-----SINGNGTSENRSN----DNSYQNEI 575
           S ++      D E+ ++   +G W +V  N     +IN +G S+ + N    D S+  + 
Sbjct: 74  SNKDGEYKPSDGEFHMAYQGKG-WFVVGPNKNGSMTINKDGFSKKQDNYFTRDGSFARDA 132

Query: 576 DG------------IDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMR 623
           DG            ID+  +       K  T+     DE  I  L       +++ + + 
Sbjct: 133 DGYLVNPEGYYVYGIDLGKI-------KDGTLNSTARDEADIEKLHGNTLSPLQIPQDLT 185

Query: 624 YQVDLT 629
           YQ  LT
Sbjct: 186 YQPVLT 191


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 30.4 bits (68), Expect = 5.2
 Identities = 15/47 (31%), Positives = 18/47 (38%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
           E E E        KKK K+  KK  K +KK   N  +      DA  
Sbjct: 273 EIEAEALASATAVKKKAKEVMKKALKMEKKAIKNAAKDADYFGDADK 319



 Score = 30.0 bits (67), Expect = 6.3
 Identities = 12/32 (37%), Positives = 15/32 (46%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           K E E +        KKK K+  KK  K +KK
Sbjct: 271 KAEIEAEALASATAVKKKAKEVMKKALKMEKK 302


>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
           homology (PH) domain.  Fission yeast Scd1 is an exchange
           factor for Cdc42 and an effector of Ras1, the homolog of
           the human H-Ras. Scd2/Bem1 mediates Cdc42 activation by
           binding to Scd1/Cdc24 and to Cdc42. Ras1 regulates
           Scd1/Cdc24/Ral1, which is a putative guanine nucleotide
           exchange factor for Cdc42, a member of the Rho family of
           Ras-like proteins. Cdc42 then activates the Shk1/Orb2
           protein kinase. Scd1 interacts with Klp5 and Klp6
           kinesins to mediate cytokinesis. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 148

 Score = 29.2 bits (66), Expect = 5.2
 Identities = 15/25 (60%), Positives = 15/25 (60%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKK 745
           E KKKKKK   K K     KKKKKK
Sbjct: 62  EVKKKKKKSSLKSKSSSSSKKKKKK 86



 Score = 28.4 bits (64), Expect = 8.4
 Identities = 15/29 (51%), Positives = 15/29 (51%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
           E KKKKKK   K K     KKKKKK    
Sbjct: 62  EVKKKKKKSSLKSKSSSSSKKKKKKGPLQ 90


>gnl|CDD|227818 COG5531, COG5531, SWIB-domain-containing proteins implicated in
           chromatin remodeling [Chromatin structure and dynamics].
          Length = 237

 Score = 29.7 bits (67), Expect = 5.3
 Identities = 10/43 (23%), Positives = 18/43 (41%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           +EE+      K+ +    + K+  KKKK    S       K++
Sbjct: 84  KEEDSSLPSSKETENGDTEGKETDKKKKSSTISKNSPSGEKVK 126



 Score = 29.3 bits (66), Expect = 8.0
 Identities = 11/39 (28%), Positives = 16/39 (41%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
           KEE+      K+ +    + K+  KKKK      N   G
Sbjct: 84  KEEDSSLPSSKETENGDTEGKETDKKKKSSTISKNSPSG 122


>gnl|CDD|216985 pfam02349, MSG, Major surface glycoprotein.  This is a novel repeat
           in Pneumocystis carinii Major surface glycoprotein (MSG)
           some members of the alignment have up to nine repeats of
           this family, the repeats containing several conserved
           cysteines. The MSG of P. carinii is an important protein
           in host-pathogen interactions. Surface glycoprotein A
           from Pneumocystis carinii is a main target for the host
           immune system, this protein is implicated in the
           attachment of Pneumocystis carinii to the host alveolar
           epithelial cells, alveolar macrophages, host surfactant
           and possibly accounts in part for the hypoxia seen in
           Pneumocystis carinii pneumonia (PCP).
          Length = 81

 Score = 27.8 bits (62), Expect = 5.3
 Identities = 19/35 (54%), Positives = 22/35 (62%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
           EE   K+ K + KKKKK KK  K+K  K KK SNE
Sbjct: 47  EELLLKELKGELKKKKKCKKALKEKCTKLKKESNE 81



 Score = 27.4 bits (61), Expect = 7.9
 Identities = 14/47 (29%), Positives = 25/47 (53%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
           A P  +E+ EK K+K  K KK  K ++   K+ K +   +++  + L
Sbjct: 22  ACPLLKEKCEKLKEKCYKLKKDLKLEELLLKELKGELKKKKKCKKAL 68


>gnl|CDD|220714 pfam10357, Kin17_mid, Domain of Kin17 curved DNA-binding protein.
           Kin17_mid is the conserved central 169 residue region of
           a family of Kin17 proteins. Towards the N-terminal end
           there is a zinc-finger domain, and in human and mouse
           members there is a RecA-like domain further downstream.
           The Kin17 protein in humans forms intra-nuclear foci
           during cell proliferation and is re-distributed in the
           nucleoplasm during the cell cycle.
          Length = 127

 Score = 28.8 bits (65), Expect = 5.4
 Identities = 11/41 (26%), Positives = 25/41 (60%), Gaps = 2/41 (4%)

Query: 816 DPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
             K  F  I Y+D+  +  K++++ +KK+K++K  ++  +K
Sbjct: 82  TEKGWF--ITYIDRSPEALKRQEELRKKEKQEKTDEEREQK 120



 Score = 28.4 bits (64), Expect = 8.7
 Identities = 14/47 (29%), Positives = 30/47 (63%), Gaps = 2/47 (4%)

Query: 702 YTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
             TE G F  I Y D+  E  K++++ +KK+K++K  +++++K  ++
Sbjct: 80  EETEKGWF--ITYIDRSPEALKRQEELRKKEKQEKTDEEREQKLLEE 124


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 29.9 bits (67), Expect = 5.5
 Identities = 32/154 (20%), Positives = 65/154 (42%), Gaps = 23/154 (14%)

Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQC-- 767
             I   ++E  +EK  +++ K+K +++K++  K  +++     E   R     A+I    
Sbjct: 48  ITIQEREREAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVD 107

Query: 768 --GPVKEVPCEPQIAPCLFDIKNDPCEKNNLA-DRSEVQRINHYTTEVGYLDPKQRFNQI 824
                +E   E      L  IK D  E+  +  +++E++++ + T E             
Sbjct: 108 TDDENEEEEYEAWKLRELKRIKRDREEREEMEREKAEIEKMRNMTEE------------- 154

Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
                E++ + +K  K    K KK K K ++K Y
Sbjct: 155 -----ERRAELRKNPKVSTNKAKKGKYKFLQKYY 183


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 30.3 bits (68), Expect = 5.7
 Identities = 12/51 (23%), Positives = 21/51 (41%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
              ++EE +++K K K     K K     K+K  +     EE   + +  A
Sbjct: 153 VTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKA 203



 Score = 29.6 bits (66), Expect = 9.6
 Identities = 13/50 (26%), Positives = 22/50 (44%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
            ++EEE +K+K K K     K K     K+K  +     E + +   A +
Sbjct: 154 TEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKA 203


>gnl|CDD|218845 pfam05991, NYN_YacP, YacP-like NYN domain.  This family consists of
           bacterial proteins related to YacP. This family is
           uncharacterized functionally, but it has been suggested
           that these proteins are nucleases due to them containing
           a NYN domain. NYN (for N4BP1, YacP-like Nuclease)
           domains were discovered by Anantharaman and Aravind.
           Based on gene neighborhoods it was suggested that the
           bacterial YacP proteins interact with the Ribonuclease
           III and TrmH methylase in a processome complex that
           catalyzes the maturation of rRNA and tRNA.
          Length = 165

 Score = 29.1 bits (66), Expect = 5.7
 Identities = 16/45 (35%), Positives = 26/45 (57%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
           +E  EE K+ +KK +KK +K++K +KK   +      E + KLR 
Sbjct: 121 RELAEEVKRAEKKIRKKAEKRRKSRKKYLDRLSDEVLEKLEKLRR 165



 Score = 29.1 bits (66), Expect = 5.7
 Identities = 13/35 (37%), Positives = 25/35 (71%)

Query: 833 KKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
           K+ +KK +KK +K++K +KK + +   +VL ++EK
Sbjct: 128 KRAEKKIRKKAEKRRKSRKKYLDRLSDEVLEKLEK 162


>gnl|CDD|153341 cd07657, F-BAR_Fes_Fer, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Fes (feline sarcoma) and
           Fer (Fes related) tyrosine kinases.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Fes (feline sarcoma), also called
           Fps (Fujinami poultry sarcoma), and Fer (Fes related)
           are cytoplasmic (or nonreceptor) tyrosine kinases that
           play roles in haematopoiesis, inflammation and immunity,
           growth factor signaling, cytoskeletal regulation, cell
           migration and adhesion, and the regulation of cell-cell
           interactions. Although Fes and Fer show redundancy in
           their biological functions, they show differences in
           their expression patterns. Fer is ubiquitously expressed
           while Fes is expressed predominantly in myeloid and
           endothelial cells. Fes and Fer contain an N-terminal
           F-BAR domain, an SH2 domain, and a C-terminal catalytic
           kinase domain. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules. The
           F-BAR domain of Fes is critical in its role in
           microtubule nucleation and bundling.
          Length = 237

 Score = 29.7 bits (67), Expect = 5.8
 Identities = 13/54 (24%), Positives = 24/54 (44%), Gaps = 1/54 (1%)

Query: 696 DQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
           D++    T EV +  +  Y    E+ +  K K ++   K  +  +K  K + KY
Sbjct: 118 DEQYKKLTDEVEKL-KSEYQKLLEDYKAAKSKFEEAVVKGGRGGRKLDKARDKY 170


>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are typically between 137 to 196
           amino acids in length.
          Length = 149

 Score = 29.2 bits (66), Expect = 5.8
 Identities = 11/37 (29%), Positives = 16/37 (43%)

Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
           E ++ +KK  K  +K    K K K+KK K        
Sbjct: 11  EPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASES 47


>gnl|CDD|217754 pfam03836, RasGAP_C, RasGAP C-terminus. 
          Length = 139

 Score = 28.8 bits (65), Expect = 5.8
 Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 6/41 (14%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKK---KYSN---EEEGM 756
           +KKKK  K+K   KK +KK  KK K    KYS     E+G+
Sbjct: 89  QKKKKFSKQKFSVKKLQKKHGKKPKFGSYKYSAKQLYEKGV 129


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 29.9 bits (67), Expect = 5.8
 Identities = 15/40 (37%), Positives = 17/40 (42%)

Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
            K KK  K    K  KK  K KK K +K     V  + EK
Sbjct: 30  AKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEK 69



 Score = 29.5 bits (66), Expect = 8.4
 Identities = 13/42 (30%), Positives = 17/42 (40%)

Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
               K ++  K    K  KK  K KK K  K   KK + + E
Sbjct: 27  KETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFE 68



 Score = 29.5 bits (66), Expect = 8.7
 Identities = 15/59 (25%), Positives = 22/59 (37%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEV 773
               +E  K KK  K    K  KK  K KK K   +  ++   K     S++   V + 
Sbjct: 23  ATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKK 81



 Score = 29.1 bits (65), Expect = 9.7
 Identities = 11/50 (22%), Positives = 19/50 (38%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
               ++  K KK K  K   KK   K +K +  K  +  +   K    ++
Sbjct: 41  TKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSA 90


>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD.  This
           family consists of several putative bacterial stage IV
           sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
           is known to be essential for efficient sporulation
           although its exact function is unknown.
          Length = 383

 Score = 30.0 bits (68), Expect = 5.9
 Identities = 19/45 (42%), Positives = 24/45 (53%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
            KEE  EK KK  KKK +K+  K  K K +K  +   E G  +LR
Sbjct: 329 TKEEAVEKGKKLAKKKLEKEIDKDAKIKDEKVLHERVENGKVRLR 373


>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
           topoisomerase IA [General function prediction only].
          Length = 298

 Score = 30.1 bits (68), Expect = 6.0
 Identities = 11/33 (33%), Positives = 14/33 (42%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           ++  +    KK  KK    K KK   KK   KK
Sbjct: 254 ERRAKGGPGKKPAKKATAAKAKKTTAKKAAAKK 286


>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease.  The potyviridae
           family positive stand RNA viruses with genome encoding a
           polyprotein. members include zucchini yellow mosaic
           virus, and turnip mosaic viruses which cause
           considerable losses of crops worldwide. This family
           consists of a C terminus region from various plant
           potyvirus P1 proteins (found at the N terminus of the
           polyprotein). The C terminus of P1 is a serine-type
           protease responsible for autocatalytic cleavage between
           P1 and the helper component protease pfam00851. The
           entire P1 protein may be involved in virus-host
           interactions.
          Length = 245

 Score = 29.6 bits (67), Expect = 6.0
 Identities = 18/56 (32%), Positives = 24/56 (42%), Gaps = 2/56 (3%)

Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPV 770
             K E    +    K+  KK KKKKKKKK       N    +R++   A  +  PV
Sbjct: 74  VSKTESVSFRTPYYKRTTKKMKKKKKKKKVVMSDKINYL--IRQVLKIAKKKGKPV 127


>gnl|CDD|178645 PLN03097, FHY3, Protein FAR-RED ELONGATED HYPOCOTYL 3; Provisional.
          Length = 846

 Score = 30.3 bits (68), Expect = 6.0
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query: 834 KKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
           K  KKK   KK+K   ++++      D L QM+K
Sbjct: 711 KTNKKKNPTKKRKVNSEQEVTTVAAQDSLQQMDK 744


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 30.4 bits (68), Expect = 6.0
 Identities = 45/216 (20%), Positives = 89/216 (41%), Gaps = 30/216 (13%)

Query: 381  NKSDIPNYVNS--TVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYE 438
            N +D+ N+ +   T+EN + + +  +   +N  HE  S +I    T+Y N  H +  +  
Sbjct: 2469 NANDVKNHFSKDLTIENELIQIQKRLEDIKNAAHEIRSEQI----TKYTNAIHNHIEEQF 2524

Query: 439  NRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYE-----NGTHEYN--IPRLENSI 491
             + EN +++      N  +N   +     N N  P  +     +  +E+   IP ++N I
Sbjct: 2525 KKIENNSNKDEVYKINEIDNIIEKI---INYNKEPEVKLHAIIDNKNEFASIIPDIKNLI 2581

Query: 492  NGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ-ISALTRGKWKL 550
                +    +N+ SY+  I   +  + +  +    +N + H I    + I  L   K  +
Sbjct: 2582 ALIESEYGNNNNISYKVAIKHEEDANNIILDLNKSQNILNHLIHKNKKIIEDLGNIKHGI 2641

Query: 551  VKENSING-------------NGTSENRSNDNSYQN 573
               N+++              N TS N ++D  Y+N
Sbjct: 2642 HNNNNLHTINKHEEISKIKHPNNTSHNNNDDAKYKN 2677


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 29.6 bits (67), Expect = 6.1
 Identities = 35/155 (22%), Positives = 68/155 (43%), Gaps = 16/155 (10%)

Query: 717 KEEEEEKKKKKKK--KKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
           KEE EE K+K+ +  K+  +  +K+ +   K ++   + + +R+ RD  + +   +KE  
Sbjct: 19  KEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEEVQELKEKR 78

Query: 775 CEP-QIAPCLFDIKNDPCEKNNLAD---------RSEVQRINHYTTEVGYLDP---KQRF 821
            E       L     +  EK N  +           E++R+     +   L P   ++  
Sbjct: 79  DEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLE-KKQQTSVLTPEEERELV 137

Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
            +I  L KE +  KK  ++ +K K+ K +   +KK
Sbjct: 138 QKIKELRKELEDAKKALEENEKLKELKAEIDELKK 172


>gnl|CDD|225208 COG2327, WcaK, Polysaccharide pyruvyl transferase family protein
           [Cell wall/membrane/envelope biogenesis].
          Length = 385

 Score = 30.1 bits (68), Expect = 6.2
 Identities = 8/22 (36%), Positives = 13/22 (59%)

Query: 195 SAEEMKMWGLDMRRDLEPAWDL 216
           S E +K  G++ R   +PA+ L
Sbjct: 163 SYELLKQLGINARLVTDPAFLL 184


>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV. 
          Length = 336

 Score = 29.8 bits (67), Expect = 6.2
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 727 KKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
           +  K + K +K K+ KK+KK++  +E +  
Sbjct: 22  RPVKDEAKPRKIKRVKKRKKREEKDELDDE 51


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
           represents a conserved region approximately 60 residues
           long within the eukaryotic targeting protein for Xklp2
           (TPX2). Xklp2 is a kinesin-like protein localised on
           centrosomes throughout the cell cycle and on spindle
           pole microtubules during metaphase. In Xenopus, it has
           been shown that Xklp2 protein is required for centrosome
           separation and maintenance of spindle bi-polarity. TPX2
           is a microtubule-associated protein that mediates the
           binding of the C-terminal domain of Xklp2 to
           microtubules. It is phosphorylated during mitosis in a
           microtubule-dependent way.
          Length = 57

 Score = 27.0 bits (60), Expect = 6.5
 Identities = 11/40 (27%), Positives = 27/40 (67%)

Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           E++ +  KK ++K+K  + +K++ + ++   EEE +++LR
Sbjct: 8   EKRAEFDKKLEEKEKALEAEKEEAEARQKEEEEEAIKQLR 47


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 29.3 bits (66), Expect = 6.7
 Identities = 9/39 (23%), Positives = 23/39 (58%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
            +E +EE+K+    + K+ K   +K+ ++ +++   E+E
Sbjct: 62  IEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDE 100


>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
           module that binds membranes and detects membrane
           curvature.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions including organelle
           biogenesis, membrane trafficking or remodeling, and cell
           division and migration. Mutations in BAR containing
           proteins have been linked to diseases and their
           inactivation in cells leads to altered membrane
           dynamics. A BAR domain with an additional N-terminal
           amphipathic helix (an N-BAR) can drive membrane
           curvature. These N-BAR domains are found in amphiphysins
           and endophilins, among others. BAR domains are also
           frequently found alongside domains that determine lipid
           specificity, such as the Pleckstrin Homology (PH) and
           Phox Homology (PX) domains which are present in beta
           centaurins (ACAPs and ASAPs) and sorting nexins,
           respectively. A FES-CIP4 Homology (FCH) domain together
           with a coiled coil region is called the F-BAR domain and
           is present in Pombe/Cdc15 homology (PCH) family
           proteins, which include Fes/Fes tyrosine kinases, PACSIN
           or syndapin, CIP4-like proteins, and srGAPs, among
           others. The Inverse (I)-BAR or IRSp53/MIM homology
           Domain (IMD) is found in multi-domain proteins, such as
           IRSp53 and MIM, that act as scaffolding proteins and
           transducers of a variety of signaling pathways that link
           membrane dynamics and the underlying actin cytoskeleton.
           BAR domains form dimers that bind to membranes, induce
           membrane bending and curvature, and may also be involved
           in protein-protein interactions. The I-BAR domain
           induces membrane protrusions in the opposite direction
           compared to classical BAR and F-BAR domains, which
           produce membrane invaginations. BAR domains that also
           serve as protein interaction domains include those of
           arfaptin and OPHN1-like proteins, among others, which
           bind to Rac and Rho GAP domains, respectively.
          Length = 194

 Score = 29.3 bits (66), Expect = 6.7
 Identities = 15/51 (29%), Positives = 30/51 (58%), Gaps = 5/51 (9%)

Query: 716 DKEEEEEKKKKKKKKKK-----KKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
           D +E ++++KK  K +      ++K KK +KKKK   K +  EE +++ ++
Sbjct: 92  DLKEIKKRRKKLDKARLDYDAAREKLKKLRKKKKDSSKLAEAEEELQEAKE 142


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 29.6 bits (67), Expect = 6.8
 Identities = 12/44 (27%), Positives = 32/44 (72%)

Query: 705 EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           E  R  ++A   + EE E++++++K+++KK+ K++K+++K+  +
Sbjct: 165 EQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAE 208



 Score = 29.3 bits (66), Expect = 8.6
 Identities = 10/34 (29%), Positives = 27/34 (79%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
           EE E +++++K+++KK+ K++K+++K+  +K + 
Sbjct: 179 EEAERRRREEKERRKKQDKERKQREKETAEKIAA 212


>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
          Length = 763

 Score = 29.9 bits (67), Expect = 7.0
 Identities = 14/26 (53%), Positives = 14/26 (53%)

Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKK 737
           I  P  E E E  K  KK KK KKKK
Sbjct: 233 IEKPQAEAENEAGKSDKKDKKSKKKK 258


>gnl|CDD|214444 MTH00165, ND5, NADH dehydrogenase subunit 5; Provisional.
          Length = 573

 Score = 29.7 bits (68), Expect = 7.2
 Identities = 16/36 (44%), Positives = 20/36 (55%), Gaps = 8/36 (22%)

Query: 37  FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGW 72
           F  L L F LSM+   L+ S   P++I IL   LGW
Sbjct: 87  FIYLVLLFVLSMML--LIIS---PNLISIL---LGW 114


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 28.3 bits (64), Expect = 7.3
 Identities = 12/44 (27%), Positives = 29/44 (65%)

Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
           +E E K+++ +KK +++KK +K+K+++ K+   E E ++   + 
Sbjct: 56  KENEAKRRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEK 99


>gnl|CDD|236413 PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated.
          Length = 367

 Score = 29.6 bits (67), Expect = 7.3
 Identities = 12/37 (32%), Positives = 19/37 (51%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
           KK KKK+    ++  K+  +K KK+     +E   KL
Sbjct: 1   KKTKKKETTTLEQVVKELIEKGKKRGTLTYDEIAEKL 37


>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease.  The Slx4 protein is a
           heteromeric structure-specific endonuclease found in
           fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
           substrates, particularly simple-Y, 5'-flap, or
           replication fork structures by cleaving the strand
           bearing the 5' non-homologous arm at the branch junction
           and thus generating ligatable nicked products from
           5'-flap or replication fork substrates.
          Length = 627

 Score = 29.9 bits (67), Expect = 7.3
 Identities = 14/30 (46%), Positives = 18/30 (60%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
           K+  KKKK KK K +K+ K+K KK K    
Sbjct: 69  KRVPKKKKIKKPKLRKRTKRKNKKIKSLTA 98


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 30.0 bits (66), Expect = 7.5
 Identities = 36/144 (25%), Positives = 63/144 (43%), Gaps = 15/144 (10%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN-----EEEGMRKLRDAASIQCGPV 770
           +K+E + KK + +KKK+K  K +K +K +K KK S      EE  ++K+ +  S    P 
Sbjct: 134 EKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKKASTPAPVEEEIVVKKVANDRSAAPAPE 193

Query: 771 KEVPCEPQIAPC--LFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLD 828
            + P      P   + +I     +KN     SE         +V          Q   + 
Sbjct: 194 PKTPTNTPAEPAEQVQEITGKKNKKNKKKSESEATAAPASVEQV--------VEQPKVVT 245

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
           +E  ++   ++KK KK K+K + +
Sbjct: 246 EEPHQQAAPQEKKNKKNKRKSESE 269


>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
           structure and dynamics].
          Length = 395

 Score = 29.7 bits (67), Expect = 7.6
 Identities = 18/56 (32%), Positives = 30/56 (53%), Gaps = 3/56 (5%)

Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY--SNEEEGMRKLRDAAS 764
            I+ P ++++ EK KK+KK K   +     +  + +  Y  SNE E +R+LR   S
Sbjct: 62  AISIPKRKKQTEKGKKEKKPKVSDRMDLDNENVQLEMLYSISNERE-IRQLRFGGS 116


>gnl|CDD|184389 PRK13912, PRK13912, nuclease NucT; Provisional.
          Length = 177

 Score = 29.0 bits (65), Expect = 7.6
 Identities = 11/38 (28%), Positives = 12/38 (31%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
               + I YLDK    K    K  K K  K       K
Sbjct: 86  NNDQSTIGYLDKYPNIKVCLLKGLKAKNGKYYGIMHQK 123


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 28.8 bits (65), Expect = 7.6
 Identities = 9/47 (19%), Positives = 32/47 (68%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
           +++E+E +K+++K +K+     ++ +K K+++   +++ +++ + AA
Sbjct: 52  QKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAA 98


>gnl|CDD|184543 PRK14157, PRK14157, heat shock protein GrpE; Provisional.
          Length = 227

 Score = 29.2 bits (65), Expect = 7.6
 Identities = 16/30 (53%), Positives = 18/30 (60%), Gaps = 5/30 (16%)

Query: 274 EDFKRSKFAAILHKLD-----ESVGKVVEA 298
           EDF  +K  AILHK D     E+V  VVEA
Sbjct: 177 EDFDPTKHDAILHKPDPDAEKETVDTVVEA 206


>gnl|CDD|180505 PRK06278, PRK06278, cobyrinic acid a,c-diamide synthase; Validated.
          Length = 476

 Score = 29.6 bits (67), Expect = 7.9
 Identities = 14/40 (35%), Positives = 23/40 (57%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
           KE+E+E+  KK K  K+K K +       KK+  N+ +G+
Sbjct: 202 KEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNKPKGI 241


>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain
           pfam08565. Expression of a construct consisting of only
           the N-terminal domain of Saccharomyces pombe Cdc37
           results in cellular viability. This indicates that
           interactions with the cochaperone Hsp90 may not be
           essential for Cdc37 function.
          Length = 172

 Score = 29.0 bits (65), Expect = 8.0
 Identities = 30/140 (21%), Positives = 60/140 (42%), Gaps = 27/140 (19%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
           K+E EE +K++    +  ++  K   + K++   + +  M+ L +  + +     E P  
Sbjct: 45  KQEIEELEKERIMNDRLLERVDKLLSELKEESLDSSQAVMKSLNENFTDKENVEPEQPTY 104

Query: 777 PQIAPCLFD-IKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKK-K 834
            ++   LFD +K++  EKN  A   E+Q                         K + K K
Sbjct: 105 NEMVEDLFDQVKDEVDEKNGAALIEELQ-------------------------KHRDKLK 139

Query: 835 KKKKKKKKKKKKKKKKKKMM 854
           K++K+  KK  + +K++K  
Sbjct: 140 KEQKELLKKLDELEKEEKKK 159


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 27.4 bits (61), Expect = 8.1
 Identities = 5/31 (16%), Positives = 8/31 (25%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           EE E+    +       K  K        + 
Sbjct: 48  EEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78



 Score = 27.4 bits (61), Expect = 9.2
 Identities = 6/35 (17%), Positives = 9/35 (25%)

Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
           E EE +     +       K  K          +E
Sbjct: 46  EGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80


>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
           insulinase-like [General function prediction only].
          Length = 978

 Score = 30.0 bits (68), Expect = 8.2
 Identities = 22/121 (18%), Positives = 44/121 (36%), Gaps = 20/121 (16%)

Query: 700 NHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK-KKKYSNEEEGMRK 758
            HY T       I  P  E EE+ +K++++  +K+  +   +  +K  K     +E   +
Sbjct: 448 PHYVT------VIVLPSPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQ 501

Query: 759 LRDAASIQCGP---VKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYL 815
                 +   P   + +VP           I+    E       ++V   + +T  + YL
Sbjct: 502 PDSEEDLATLPTLKLGDVPD---------PIEKTSLETEVSN-EAKVLHHDLFTNGITYL 551

Query: 816 D 816
            
Sbjct: 552 R 552


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 29.8 bits (67), Expect = 8.4
 Identities = 14/59 (23%), Positives = 27/59 (45%), Gaps = 4/59 (6%)

Query: 696 DQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           D+       E    N     DKE E  +K+  +  K+ +K+ +   +  +KK+ +N+ E
Sbjct: 97  DEIAKLEALE--LLNL--EKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNE 151


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
           protein 2.  THO and TREX form a eukaryotic complex which
           functions in messenger ribonucleoprotein metabolism and
           plays a role in preventing the transcription-associated
           genetic instability. Tho2, along with four other
           subunits forms THO.
          Length = 296

 Score = 29.2 bits (66), Expect = 8.5
 Identities = 11/37 (29%), Positives = 20/37 (54%)

Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           I  P +  ++E ++ +K+ K+         KKKK+KK
Sbjct: 22  IYVPKELYDKEIERLEKQIKELDSSSSGIDKKKKEKK 58


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 30.0 bits (68), Expect = 8.6
 Identities = 11/36 (30%), Positives = 18/36 (50%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
           +KKK KK  +K+ +  +    KK+K+     E  R 
Sbjct: 85  EKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120



 Score = 29.6 bits (67), Expect = 9.6
 Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
           EKKK KK  +K+ +  +    KK+K+  +  E+
Sbjct: 85  EKKKYKKDVEKRNRSYRLYLDKKRKELQALSEK 117


>gnl|CDD|225651 COG3109, ProQ, Activator of osmoprotectant transporter ProP [Signal
           transduction mechanisms].
          Length = 208

 Score = 29.1 bits (65), Expect = 8.7
 Identities = 7/64 (10%), Positives = 23/64 (35%), Gaps = 6/64 (9%)

Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKK----KKKKKKKKKKKKKYSNEEEGMRKLR--DAAS 764
           Q+A      + ++ +++ KK+++      K   ++     +               D ++
Sbjct: 102 QLAEAKARVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPREEVHTPVSDISA 161

Query: 765 IQCG 768
           +  G
Sbjct: 162 LTVG 165


>gnl|CDD|115185 pfam06513, DUF1103, Repeat of unknown function (DUF1103).  This
           family consists of several repeats of around 30 residues
           in length which are found specifically in
           mature-parasite-infected erythrocyte surface antigen
           proteins from Plasmodium falciparum. This family often
           found in conjunction with pfam00226.
          Length = 215

 Score = 29.1 bits (64), Expect = 8.7
 Identities = 19/45 (42%), Positives = 28/45 (62%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           DK    EK +K+ K+K +K+ K+K KKK KK    N+ EG  K++
Sbjct: 20  DKVIGSEKTQKEIKEKVEKRVKEKCKKKVKKGIKENDTEGNDKVK 64


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 29.4 bits (66), Expect = 8.8
 Identities = 10/50 (20%), Positives = 23/50 (46%)

Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
           + A   K    EK + K +KK+  +K++  +++  ++        M + R
Sbjct: 328 ENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARAR 377


>gnl|CDD|217372 pfam03105, SPX, SPX domain.  We have named this region the SPX
           domain after (SYG1, Pho81 and XPR1). This 180 residue
           length domain is found at the amino terminus of a
           variety of proteins. In the yeast protein SYG1, the
           N-terminus directly binds to the G- protein beta subunit
           and inhibits transduction of the mating pheromone
           signal. This finding suggests that all the members of
           this family are involved in G-protein associated signal
           transduction. The N-termini of several proteins involved
           in the regulation of phosphate transport, including the
           putative phosphate level sensors PHO81 from
           Saccharomyces cerevisiae and NUC-2 from Neurospora
           crassa, are also members of this family. The SPX domain
           of S. cerevisiae low-affinity phosphate transporters
           Pho87 and Pho90 auto-regulates uptake and prevents
           efflux. This SPX dependent inhibition is mediated by the
           physical interaction with Spl2 NUC-2 contains several
           ankyrin repeats pfam00023. Several members of this
           family are annotated as XPR1 proteins: the xenotropic
           and polytropic retrovirus receptor confers
           susceptibility to infection with murine leukaemia
           viruses (MLV). The similarity between SYG1, phosphate
           regulators and XPR1 sequences has been previously noted,
           as has the additional similarity to several predicted
           proteins, of unknown function, from Drosophila
           melanogaster, Arabidopsis thaliana, Caenorhabditis
           elegans, Schizosaccharomyces pombe, and Saccharomyces
           cerevisiae. In addition, given the similarities between
           XPR1 and SYG1 and phosphate regulatory proteins, it has
           been proposed that XPR1 might be involved in G-protein
           associated signal transduction and may itself function
           as a phosphate sensor.
          Length = 176

 Score = 28.9 bits (65), Expect = 9.0
 Identities = 14/60 (23%), Positives = 27/60 (45%), Gaps = 2/60 (3%)

Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQME--KELANINRTAVAPINKPFDK 886
           +E++ +    K     K   K  + +KK   ++  +++  K    +NRT    I K +DK
Sbjct: 102 EERRDETSSAKSSPSDKNLLKAFEELKKALLELYRELQLLKSYIELNRTGFRKILKKYDK 161


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 27.7 bits (62), Expect = 9.2
 Identities = 8/21 (38%), Positives = 18/21 (85%)

Query: 716 DKEEEEEKKKKKKKKKKKKKK 736
           D+EEE  +K+++++K+ +K+K
Sbjct: 86  DREEERLEKEEEREKRARKRK 106


>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 594

 Score = 29.6 bits (67), Expect = 9.3
 Identities = 21/135 (15%), Positives = 33/135 (24%), Gaps = 24/135 (17%)

Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP-------- 774
           +K   K K K   K        ++K      + ++K  D          ++         
Sbjct: 449 EKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQEILEKVDS 508

Query: 775 ----CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKE 830
                E         I         L D S+         E+           IA + K 
Sbjct: 509 KSEKLE--------KISAKIDNIKELFDESK----MSILQEILQPALHHIELNIARIYKL 556

Query: 831 KKKKKKKKKKKKKKK 845
             K K+    K   K
Sbjct: 557 NIKDKEDFLNKLYIK 571


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 28.9 bits (65), Expect = 9.3
 Identities = 10/28 (35%), Positives = 13/28 (46%)

Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
             EE +E      KK    +K + KKKK
Sbjct: 169 AAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|239466 cd03368, Ribosomal_S12, S12-like family, 30S ribosomal protein S12
           subfamily; S12 is located at the interface of the large
           and small ribosomal subunits of prokaryotes,
           chloroplasts and mitochondria, where it plays an
           important role in both tRNA and ribosomal subunit
           interactions. S12 is essential for maintenance of a
           pretranslocation state and, together with S13, functions
           as a control element for the rRNA- and tRNA-driven
           movements of translocation. Antibiotics such as
           streptomycin bind S12 and cause the ribosome to misread
           the genetic code.
          Length = 108

 Score = 27.9 bits (63), Expect = 9.4
 Identities = 11/26 (42%), Positives = 14/26 (53%)

Query: 833 KKKKKKKKKKKKKKKKKKKKMMKKGY 858
            +K +KKKKKK K    +    KKG 
Sbjct: 6   IRKGRKKKKKKSKSPALEGCPQKKGV 31


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 29.3 bits (66), Expect = 9.6
 Identities = 18/87 (20%), Positives = 42/87 (48%), Gaps = 10/87 (11%)

Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKM--MKKGYPDVLSQMEKELANI-NR 874
           + + N+++    + K +KK K ++ KK+ K+ K+++  +      + ++++ +L +I N 
Sbjct: 50  QAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSIPNI 109

Query: 875 TA-VAPI------NKPFDKGGDPKNFD 894
                P+      N    + G P  FD
Sbjct: 110 PHESVPVGKDEEDNLEVKRWGTPPVFD 136


>gnl|CDD|206363 pfam14195, DUF4316, Domain of unknown function (DUF4316).  This
           domain is functionally uncharacterized. This domain is
           found in bacteria, and is typically between 56 and 95
           amino acids in length.
          Length = 65

 Score = 26.7 bits (59), Expect = 9.9
 Identities = 13/42 (30%), Positives = 18/42 (42%)

Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
           G  N          EEK     K K+ KK ++ +K  K+KK 
Sbjct: 21  GIINNTPEAPTVPPEEKPSILDKLKEDKKARRSRKPVKQKKH 62


>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
          Length = 815

 Score = 29.4 bits (66), Expect = 10.0
 Identities = 15/50 (30%), Positives = 23/50 (46%)

Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
           KE E E+  KK KK+KK+        K  +      EE  + + +  SI+
Sbjct: 29  KEHEGEQSPKKAKKEKKQDDSGNGNGKSAEDAVKEFEEFCKAIEEHLSIE 78


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.133    0.403 

Gapped
Lambda     K      H
   0.267   0.0685    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 47,158,537
Number of extensions: 4789846
Number of successful extensions: 34216
Number of sequences better than 10.0: 1
Number of HSP's gapped: 23710
Number of HSP's successfully gapped: 1950
Length of query: 905
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 799
Effective length of database: 6,236,078
Effective search space: 4982626322
Effective search space used: 4982626322
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 64 (28.5 bits)