RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5893
         (1182 letters)



>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score =  360 bits (925), Expect = e-106
 Identities = 168/525 (32%), Positives = 272/525 (51%), Gaps = 30/525 (5%)

Query: 478  DSIRDCFVTGKWKASEDASELLRLDDMDDDEELFGDFEDLETGEKHSGDKSGGGSGGVSG 537
              ++  F       S +  E L  DD        G+FEDLE  E  S ++     G    
Sbjct: 574  AQLKSRFQKDATLDSIEGEEELIQDDEK------GNFEDLEDEENSSDNEMEESRGSSVT 627

Query: 538  GGSGDDKPKT-----RAELMEKKRKLKEQFD-AEYDDKDGGGNTYYDDLKTQATRQAELN 591
              + +   +      R E   KK +L+  F+  E  D +     +Y + K +   Q ++N
Sbjct: 628  AENEESADEVDYETEREENARKKEELRGNFELEERGDPEKKDVDWYTEEKRKIEEQLKIN 687

Query: 592  RQQFHDLDDNARVELEGFRAGLYIRVELDGMPCELIENFDPTYPLIVGGLQPGEETIGCV 651
            R +F  +   +RV +EG+RAG Y+R+ L  +P E ++ F+  YP+++GGL P E+ +G V
Sbjct: 688  RSEFETMVPESRVVIEGYRAGRYVRIVLSHVPLEFVDEFNSRYPIVLGGLLPAEKEMGIV 747

Query: 652  RARVKKHRWYGKILKSGNPVIMSVGWRRFQTLPIYSKQEDNMRYRMLKYTPQHVACMAHF 711
            + R+K+HRW+ KILK+ +P+I SVGWRRFQ++P+YS ++   R RMLKYTP+H+ C   F
Sbjct: 748  QGRIKRHRWHKKILKTNDPLIFSVGWRRFQSIPVYSMKDSRTRNRMLKYTPEHMHCNVSF 807

Query: 712  WGPITRSGTGFLAVQDVAKREPGFRVIATGTILDANQTAEVTKKLKLTGVPMKIYKKTAF 771
            +GP+    TGF AVQ     +  FRV+A GTI D N  A++ KKLKL G P +I + T F
Sbjct: 808  YGPVVPPNTGFCAVQ---SEKGDFRVLALGTITDVNGDAKLVKKLKLVGYPKQIVQNTVF 864

Query: 772  IKDMFNSTLEVAKFEGAKIRTVSGIRGQIKKALNKPQGAFRATFEDKIMLSDIVFCRTWY 831
            ++DMF S LEV KFEGA ++ VSG+RGQ+K    K  G +RA FE K+++SDI+  R + 
Sbjct: 865  VRDMFTSDLEVLKFEGASLKAVSGLRGQVKGPHGK-NGEYRAVFEGKMLMSDIITLRCFV 923

Query: 832  KVDIPKLYNPVTSLLLPPEQKDSWTGMKTTGQLKRERGLHNDPQFDSMYTPIVRKPKTMT 891
             V++ +++ PV +LL        W G++   +++   GL +     +  +          
Sbjct: 924  PVEVHRIFIPVDNLLG------KWRGLRRLHEIRESLGLTHSYAPQNDSSSEEMGYGAEE 977

Query: 892  KLKIPKALQKELPYHMKPKYKSKETPKPQRVAVIHSEREQKVASLMKMLRTNYSEKNSKE 951
               +P+ ++ +LP         +      R   +    E +    +K        K+ +E
Sbjct: 978  DYSLPREIESKLP------LDKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEE 1031

Query: 952  KQAMKARMVALKLRAKAAEEAKQQRQRVMKKDIFRTL--RMPKVM 994
            K+ M++   A +      E+ ++QR R    D ++ +  +  K  
Sbjct: 1032 KERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKK 1076



 Score =  117 bits (294), Expect = 2e-26
 Identities = 52/83 (62%), Positives = 65/83 (78%)

Query: 990  MPKVMGVLTHLDMLKNNKTLKTTKKMLKHRFWTEVYAGAKLFYLSGIVHGEYMKNEVKNL 1049
            MP+V+GV+THLD+ KN  TL++ KK LKHRFWTE+Y GAKLFYLSG+ +G Y   E+ NL
Sbjct: 163  MPRVLGVVTHLDLFKNPSTLRSIKKRLKHRFWTEIYQGAKLFYLSGVENGRYPDREILNL 222

Query: 1050 GRFIAVMKFRPLIWQTTHSYMLV 1072
             RFI+VMKFRPL W+  H Y+L 
Sbjct: 223  SRFISVMKFRPLEWRNMHPYVLA 245



 Score =  105 bits (263), Expect = 8e-23
 Identities = 42/75 (56%), Positives = 57/75 (76%)

Query: 1096 QARVKKHRWYGKILKSGNPVIMSVGWRRFQTLPIYSKQEDNMRYRMLKYTPQHVACMAHF 1155
            Q R+K+HRW+ KILK+ +P+I SVGWRRFQ++P+YS ++   R RMLKYTP+H+ C   F
Sbjct: 748  QGRIKRHRWHKKILKTNDPLIFSVGWRRFQSIPVYSMKDSRTRNRMLKYTPEHMHCNVSF 807

Query: 1156 WGPITRSGTGFLAVQ 1170
            +GP+    TGF AVQ
Sbjct: 808  YGPVVPPNTGFCAVQ 822



 Score = 89.8 bits (222), Expect = 5e-18
 Identities = 47/104 (45%), Positives = 63/104 (60%), Gaps = 7/104 (6%)

Query: 379 KKVHRKRQAELTAKQK------NPKAFTFQSVIKGERKFRRKEDIQAKKHHVPQVDRTPL 432
           K  H K + +   K        N KAF   ++ +  R+  R  DI+ KK HVP VDRTP 
Sbjct: 5   KAKHSKAEKKKLKKVMDGKVGNNAKAFAVAAIGQMARQAMRTADIEEKKLHVPMVDRTPK 64

Query: 433 E-PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLI 475
           + PPP +VAVVGPP  GKSTLIR L++ FTK  +  I+GP+T++
Sbjct: 65  DLPPPFIVAVVGPPGTGKSTLIRSLVRRFTKQTIDEIRGPITVV 108



 Score = 79.8 bits (196), Expect = 6e-15
 Identities = 32/48 (66%), Positives = 40/48 (83%)

Query: 55  ITFIECNNDINSMIDISKVADLVLLLIDASFGFEMEIFEFLNICQVHG 102
           ITF+EC +D++ MID++K+ADLVLLLID +FGFEME  EFLNI   HG
Sbjct: 115 ITFLECPSDLHQMIDVAKIADLVLLLIDGNFGFEMETMEFLNILISHG 162



 Score = 33.6 bits (76), Expect = 0.78
 Identities = 37/176 (21%), Positives = 69/176 (39%), Gaps = 32/176 (18%)

Query: 122 AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVV-----------SSQDGRKRR---- 166
           A SS+      DE +   D ++    S+    E+++           SS    K R    
Sbjct: 524 AKSSESDLVVQDEPEDFFDVSKVANESISSNHEKLMESEFEELKKKWSSLAQLKSRFQKD 583

Query: 167 ----------KVMFDD---DLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
                     +++ DD   + ED  D+++  D++ E    +    E E   +E + E + 
Sbjct: 584 ATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETER 643

Query: 214 EERKAEKKNSK-NEHVQELTAKQKNPKAFTFQSVIKGERKF---RSTKPTVVPQPR 265
           EE   +K+  + N  ++E    +K    +  +   K E +    RS   T+VP+ R
Sbjct: 644 EENARKKEELRGNFELEERGDPEKKDVDWYTEEKRKIEEQLKINRSEFETMVPESR 699



 Score = 32.8 bits (74), Expect = 1.3
 Identities = 46/294 (15%), Positives = 111/294 (37%), Gaps = 25/294 (8%)

Query: 127 DSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDD 186
           +SD  +  D  S +    G+ + ++   + ++ +  R+  ++ FDD         D+++D
Sbjct: 388 NSDAIDTVDRESSEIDNVGRKTRRQPTGKAIAEETSRED-ELSFDDSDVST---SDENED 443

Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSV 246
            D T      +NE+ES+ EE   + D +  ++E      E +    A  ++         
Sbjct: 444 VDFTGKKGAINNEDESDNEEVAFDSDSQFDESEGNLRWKEGLASKLAYSQS--------- 494

Query: 247 IKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADD 306
                K       +     +S ++    +K  SA +  +    ++E ++    S  + + 
Sbjct: 495 ----GKRGRNIQKIFYDESLSPEECIEEYKGESAKSSESDLVVQDEPEDFFDVSKVANES 550

Query: 307 NSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASAFLERQANIVNLAKYVYGDMED 366
            SS+ ++ +E +  +  K       L  KS   + A    +E +  ++   +   G+ ED
Sbjct: 551 ISSNHEKLMESEFEELKKKWSSLAQL--KSRFQKDATLDSIEGEEELIQDDE--KGNFED 606

Query: 367 VSVTMEGDEAPNKKVHRKRQAELTAKQKNPKAFTFQSVIKGERKFRRKEDIQAK 420
           +       +   ++           +  +   +  +     E   R+KE+++  
Sbjct: 607 LEDEENSSDNEMEESRGSSVTAENEESADEVDYETER----EENARKKEELRGN 656



 Score = 32.8 bits (74), Expect = 1.5
 Identities = 44/242 (18%), Positives = 90/242 (37%), Gaps = 30/242 (12%)

Query: 122 AISSDDSDDEN----DEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN 177
           AI+ + S ++     D D ++ D+ E    + KK     ++++D     +V FD D +  
Sbjct: 417 AIAEETSREDELSFDDSDVSTSDENEDVDFTGKKGA---INNEDESDNEEVAFDSDSQ-- 471

Query: 178 MDDDDDDDDDDETDDDNDEDNEEESE---------EEEENEEDDDEERKAEKKNSKNEHV 228
            D+ + +    E        ++             +E  + E+  EE K E   S    +
Sbjct: 472 FDESEGNLRWKEGLASKLAYSQSGKRGRNIQKIFYDESLSPEECIEEYKGESAKSSESDL 531

Query: 229 QELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQP--RMSDK-----DLAIRHKITSAL 281
                 Q  P+ F F           S    ++      +  K      L  R +  + L
Sbjct: 532 VV----QDEPEDF-FDVSKVANESISSNHEKLMESEFEELKKKWSSLAQLKSRFQKDATL 586

Query: 282 AGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQK 341
             +  +EE  ++ E   F D   ++NSSD +      +   ++N E    ++++++  + 
Sbjct: 587 DSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREEN 646

Query: 342 AA 343
           A 
Sbjct: 647 AR 648


>gnl|CDD|218343 pfam04950, DUF663, Protein of unknown function (DUF663).  This
           family contains several uncharacterized eukaryotic
           proteins.
          Length = 293

 Score =  297 bits (762), Expect = 4e-92
 Identities = 110/292 (37%), Positives = 160/292 (54%), Gaps = 14/292 (4%)

Query: 548 RAELMEKKRKLKEQFDAEYDDKDGGGN------TYYDDLKTQATRQAELNRQQFHDLDDN 601
             E   K R LK    + +D  +   N        +     + T++  LN  +       
Sbjct: 10  ARERFAKYRGLKSFRTSPWDPNEKDPNLPNDYARIFQFQNYKNTKKRILNEYEE------ 63

Query: 602 ARVELEGFRAGLYIRVELDGMPCELIENFDPTYPLIVGGLQPGEETIGCVRARVKKHRWY 661
               +EG + G Y+R+E+  +P EL ENF+    +IV GL P E  +  V  ++KKHRWY
Sbjct: 64  -EKRIEGVKPGWYVRLEIKNVPKELAENFNSKQLIIVFGLLPHEHKMSVVNFKIKKHRWY 122

Query: 662 GKILKSGNPVIMSVGWRRFQTLPIYSKQEDNMRYRMLKYTPQHVACMAHFWGPITRSGTG 721
            K LKS  P+I+ VG+RRFQ+ PIYS+   N +++  +Y P H  C A F+GPIT   + 
Sbjct: 123 KKPLKSKEPLIIQVGFRRFQSQPIYSQNTSNDKHKYERYLPPHKTCNATFYGPITFPNSP 182

Query: 722 FLAVQDVAKREPGFRVIATGTILDANQTAEVTKKLKLTGVPMKIYKKTAFIKDMFNSTLE 781
            LA +       GFR+ ATG IL+ +    + KKLKLTG P KI+KKTAF++ MF +  +
Sbjct: 183 LLAFKKSFSDTNGFRIAATGCILECDPNRIIAKKLKLTGHPYKIFKKTAFVRYMFFNPED 242

Query: 782 VAKFEGAKIRTVSGIRGQIKKALNKPQGAFRATFEDKIMLSDIVFCRTWYKV 833
           V  F+   + T SG RG IK+AL    G F+ATF+ K+ + DIV  + + +V
Sbjct: 243 VEWFKPVPLFTKSGRRGHIKEALG-THGYFKATFDGKLNMQDIVIMKLYKRV 293



 Score =  103 bits (259), Expect = 4e-24
 Identities = 35/75 (46%), Positives = 49/75 (65%)

Query: 1096 QARVKKHRWYGKILKSGNPVIMSVGWRRFQTLPIYSKQEDNMRYRMLKYTPQHVACMAHF 1155
              ++KKHRWY K LKS  P+I+ VG+RRFQ+ PIYS+   N +++  +Y P H  C A F
Sbjct: 113  NFKIKKHRWYKKPLKSKEPLIIQVGFRRFQSQPIYSQNTSNDKHKYERYLPPHKTCNATF 172

Query: 1156 WGPITRSGTGFLAVQ 1170
            +GPIT   +  LA +
Sbjct: 173  YGPITFPNSPLLAFK 187


>gnl|CDD|206669 cd01882, BMS1, Bms1, an essential GTPase, promotes assembly of
            preribosomal RNA processing complexes.  Bms1 is an
            essential, evolutionarily conserved, nucleolar protein.
            Its depletion interferes with processing of the 35S
            pre-rRNA at sites A0, A1, and A2, and the formation of
            40S subunits. Bms1, the putative endonuclease Rc11, and
            the essential U3 small nucleolar RNA form a stable
            subcomplex that is believed to control an early step in
            the formation of the 40S subumit. The C-terminal domain
            of Bms1 contains a GTPase-activating protein (GAP) that
            functions intramolecularly. It is believed that Rc11
            activates Bms1 by acting as a guanine-nucleotide exchange
            factor (GEF) to promote GDP/GTP exchange, and that
            activated (GTP-bound) Bms1 delivers Rc11 to the
            preribosomes.
          Length = 231

 Score =  169 bits (431), Expect = 1e-47
 Identities = 62/83 (74%), Positives = 68/83 (81%)

Query: 990  MPKVMGVLTHLDMLKNNKTLKTTKKMLKHRFWTEVYAGAKLFYLSGIVHGEYMKNEVKNL 1049
             PKVMGVLTHLD  KNNKTL+ TKK LKHRFWTEVY GAKLFYLSGIVHG Y K E+ NL
Sbjct: 133  FPKVMGVLTHLDKFKNNKTLRKTKKRLKHRFWTEVYDGAKLFYLSGIVHGRYPKTEILNL 192

Query: 1050 GRFIAVMKFRPLIWQTTHSYMLV 1072
             RFI+VMKFRPL W+ +H Y+L 
Sbjct: 193  ARFISVMKFRPLNWRNSHPYVLA 215



 Score =  130 bits (329), Expect = 5e-34
 Identities = 51/88 (57%), Positives = 64/88 (72%)

Query: 398 AFTFQSVIKGERKFRRKEDIQAKKHHVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCLI 457
           AF  QS ++  R+F+R +D++ KK HVP VDRTP EPPP+VV VVGPP VGKSTLIR LI
Sbjct: 1   AFAVQSAVRAARQFQRTQDLEEKKLHVPVVDRTPEEPPPLVVVVVGPPGVGKSTLIRSLI 60

Query: 458 KNFTKTPLSVIKGPVTLIIKDSIRDCFV 485
           K +TK  LS IKGP+T++     R  F+
Sbjct: 61  KRYTKQNLSDIKGPITIVTGKKRRLTFI 88



 Score =  111 bits (280), Expect = 2e-27
 Identities = 39/48 (81%), Positives = 44/48 (91%)

Query: 55  ITFIECNNDINSMIDISKVADLVLLLIDASFGFEMEIFEFLNICQVHG 102
           +TFIEC NDINSMID++K+ADLVLLLID S+GFEME FEFLNI QVHG
Sbjct: 85  LTFIECPNDINSMIDVAKIADLVLLLIDGSYGFEMETFEFLNILQVHG 132


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 90.9 bits (225), Expect = 2e-18
 Identities = 60/218 (27%), Positives = 101/218 (46%), Gaps = 5/218 (2%)

Query: 611 AGLYIRVELDGMPCELIENF-DPTYPLIVGGLQPGEETIGCVRARVKKHRWYGKILKSGN 669
            G  +R++L   P  L E   +P   L+V GL   E+        +++H  Y   LKS  
Sbjct: 535 DGQMVRIKLR-FPKFLYEGLIEPQILLVVYGLLEYEDKKTVHNFSLQRHFEYEVPLKSEE 593

Query: 670 PVIMSVGWRRFQTLPIYSKQED--NMRYRMLKYTPQHVACMAHFWGPITRSGTGFLAVQD 727
            +++ +G RR    P+ SK  +  N   +  +      + +A F GPI+   +  +  + 
Sbjct: 594 SMVVQLGHRRVDICPLISKGSNSPNNNQKYFRRLKPLESGVASFIGPISFGLSPVIIFKK 653

Query: 728 VAKREPGFRVIATGTILDANQTAEVTKKLKLTGVPMKIYKKTAFIKDMFNSTLEVAKFEG 787
            A  E    ++A+G + + +    + K+  LTG P K +K+   ++ MF S  +V  F+ 
Sbjct: 654 SALDELSATLLASGGMNNFDGDRVIAKRAVLTGHPFKNHKRYVTVRYMFFSPEDVMWFKN 713

Query: 788 AKIRTVSGIRGQIKKALNKPQGAFRATFEDKIMLSDIV 825
            ++ T  G  G IK+ L    G F+ATF  KI   D V
Sbjct: 714 IQLFTKRGRTGFIKEPLGT-HGYFKATFSGKIKSQDKV 750



 Score = 33.9 bits (77), Expect = 0.56
 Identities = 18/71 (25%), Positives = 34/71 (47%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
           D++ ED   +D++       DDD  E++ EE   +EE+  DD+E  +      +   ++E
Sbjct: 395 DEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLRE 454

Query: 231 LTAKQKNPKAF 241
               +K  + F
Sbjct: 455 FRDMEKEDREF 465



 Score = 33.9 bits (77), Expect = 0.59
 Identities = 27/116 (23%), Positives = 46/116 (39%), Gaps = 6/116 (5%)

Query: 120 SGAISSDDSDDENDEDDTSDDDTESGQGS---MKKYKEQVVSSQDGRKRRKVMFDDDLED 176
              +  DD       + TS         S    K  +++             M   D +D
Sbjct: 358 LTTVRIDDHGFLPGREQTSKKAAVPKGTSFYQAKWAEDEEEEDGQCNDEESTMSAIDDDD 417

Query: 177 NMDDDDDD-DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
             ++D+++   D+E+  D++E  EE S EEEE +    E R  EK++ +     EL
Sbjct: 418 PKENDNEEVAGDEESAIDDNEGFEELSPEEEERQL--REFRDMEKEDREFPDEAEL 471



 Score = 32.0 bits (72), Expect = 2.3
 Identities = 17/61 (27%), Positives = 28/61 (45%), Gaps = 2/61 (3%)

Query: 1101 KHRWYGKILKSGNPVIMSVGWRRFQTLPIYSKQED--NMRYRMLKYTPQHVACMAHFWGP 1158
            +H  Y   LKS   +++ +G RR    P+ SK  +  N   +  +      + +A F GP
Sbjct: 581  RHFEYEVPLKSEESMVVQLGHRRVDICPLISKGSNSPNNNQKYFRRLKPLESGVASFIGP 640

Query: 1159 I 1159
            I
Sbjct: 641  I 641



 Score = 30.8 bits (69), Expect = 5.9
 Identities = 14/81 (17%), Positives = 36/81 (44%)

Query: 139 DDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDN 198
           D++ E GQ + ++     +   D ++        D E  +DD++  ++    +++     
Sbjct: 395 DEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLRE 454

Query: 199 EEESEEEEENEEDDDEERKAE 219
             + E+E+    D+ E + +E
Sbjct: 455 FRDMEKEDREFPDEAELQPSE 475


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 60.4 bits (147), Expect = 5e-09
 Identities = 30/118 (25%), Positives = 57/118 (48%), Gaps = 8/118 (6%)

Query: 126 DDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDD 185
           ++ D+E ++   S DD +          +     Q      +     D ED  DDDDD +
Sbjct: 286 EEDDEEEEDSKESADDLDDE--FEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLE 343

Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTF 243
           +++E  D +DE+ +EE E+ ++ +++++EE + EKK  K+          ++   FTF
Sbjct: 344 EEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKS------AESTRSELPFTF 395



 Score = 48.8 bits (117), Expect = 2e-05
 Identities = 28/93 (30%), Positives = 47/93 (50%), Gaps = 17/93 (18%)

Query: 165 RRKVMF---DDDLEDNM--------------DDDDDDDDDDETDDDNDEDNEEESEEEEE 207
           ++K MF   DDD ++ +              DD  DDDD D+ D  +   ++  +     
Sbjct: 99  KKKSMFNLEDDDEDEFVLTHLGQSLSEIDKDDDVRDDDDFDDDDLGDLASDDRAAHFGGG 158

Query: 208 NEEDDDEERKAEKKNSKNEHVQELTAKQKNPKA 240
            ++++DEE + E+K SK E ++E+ AK K  KA
Sbjct: 159 EDDEEDEEEQPERKKSKKEVMKEVIAKSKFYKA 191



 Score = 40.8 bits (96), Expect = 0.005
 Identities = 21/66 (31%), Positives = 36/66 (54%), Gaps = 10/66 (15%)

Query: 164 KRRKVMFDDDLEDNMDDDDDDDDDDETDDDND----------EDNEEESEEEEENEEDDD 213
           +R +   +DD E+    +  DD DDE + D+D          ++ EEE   ++E+EEDDD
Sbjct: 280 RRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDD 339

Query: 214 EERKAE 219
           ++ + E
Sbjct: 340 DDLEEE 345



 Score = 38.1 bits (89), Expect = 0.034
 Identities = 16/73 (21%), Positives = 30/73 (41%), Gaps = 6/73 (8%)

Query: 502 DDMDDDEELFGDFEDLETGEKHSGDKSGGGSGGVSGGGSGDDKPKTRAELMEK---KRKL 558
           D  DDD+    D  DL + +       GGG           ++ K++ E+M++   K K 
Sbjct: 131 DVRDDDDFDDDDLGDLASDD--RAAHFGGGEDDEEDEEEQPERKKSKKEVMKEVIAKSKF 188

Query: 559 -KEQFDAEYDDKD 570
            K +     ++ +
Sbjct: 189 YKAERQKAKEEDE 201



 Score = 35.0 bits (81), Expect = 0.29
 Identities = 53/283 (18%), Positives = 96/283 (33%), Gaps = 76/283 (26%)

Query: 107 KSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRR 166
           +S+ EID            DD D ++D+      D  +      +  E+    Q  RK+ 
Sbjct: 121 QSLSEIDKDDDVR------DDDDFDDDDLGDLASDDRAAHFGGGEDDEEDEEEQPERKKS 174

Query: 167 K--VM--------F-----------DDDLEDNMDDDDDD---------------DDDDET 190
           K  VM        F           D+DL + +DDD  D                  +E 
Sbjct: 175 KKEVMKEVIAKSKFYKAERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMTPEEK 234

Query: 191 DDDNDE------------------DNEEESEEEEENEEDDDEERKA----EKKNSKNEHV 228
           DD+ D+                    EE ++EE E  +  + ER      E+++ + E  
Sbjct: 235 DDEYDQRVRELTFDRRAQPTDRTKTEEELAKEEAERLKKLEAERLRRMRGEEEDDEEEED 294

Query: 229 QELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKE 288
            + +A   + +         G  +    +     +  + D+D              +   
Sbjct: 295 SKESADDLDDEFEPDDDDNFGLGQ--GEEDEEEEEDGVDDEDEED----------DDDDL 342

Query: 289 EEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTG 331
           EEEEE  D +  +E  +D  SD+++  E +  +  K  ++   
Sbjct: 343 EEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAE 385



 Score = 33.4 bits (77), Expect = 0.90
 Identities = 23/155 (14%), Positives = 52/155 (33%), Gaps = 45/155 (29%)

Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKN 237
           M  +++DD+++E   ++ +D ++E E ++++     +  + E++                
Sbjct: 282 MRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVD---------- 331

Query: 238 PKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDS 297
                     + E                 D DL               +E+ +   E+ 
Sbjct: 332 ---------DEDEED--------------DDDDLE------------EEEEDVDLSDEEE 356

Query: 298 AFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGL 332
              DE +DD   +E+E  E +  K        + L
Sbjct: 357 DEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391


>gnl|CDD|219725 pfam08142, AARP2CN, AARP2CN (NUC121) domain.  This domain is the
            central domain of AARP2. It is weakly similar to the
            GTP-binding domain of elongation factor TU.
          Length = 81

 Score = 52.2 bits (126), Expect = 2e-08
 Identities = 16/28 (57%), Positives = 19/28 (67%)

Query: 1045 EVKNLGRFIAVMKFRPLIWQTTHSYMLV 1072
            E+ NL RFI+VMK RPL W+    YML 
Sbjct: 1    EILNLLRFISVMKPRPLSWRDQRPYMLA 28


>gnl|CDD|129021 smart00785, AARP2CN, AARP2CN (NUC121) domain.  This domain is the
            central domain of AARP2. It is weakly similar to the
            GTP-binding domain of elongation factor TU.
          Length = 83

 Score = 52.2 bits (126), Expect = 2e-08
 Identities = 16/28 (57%), Positives = 20/28 (71%)

Query: 1045 EVKNLGRFIAVMKFRPLIWQTTHSYMLV 1072
            E+ NL RF++VMK RPL W+  H YML 
Sbjct: 1    EILNLLRFLSVMKPRPLSWRDQHPYMLA 28


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 54.7 bits (132), Expect = 1e-07
 Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 3/71 (4%)

Query: 144 SGQGSMKKYKEQVVSSQDGRKRRKVMFD---DDLEDNMDDDDDDDDDDETDDDNDEDNEE 200
            G G+MK  K+    ++  ++R     D    D E +  DDDDD+D  E+D D+ +D+  
Sbjct: 218 EGGGAMKVLKQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDDDVS 277

Query: 201 ESEEEEENEED 211
           + + E+  + D
Sbjct: 278 DEDGEDLFDTD 288



 Score = 37.4 bits (87), Expect = 0.038
 Identities = 23/96 (23%), Positives = 39/96 (40%), Gaps = 10/96 (10%)

Query: 130 DENDEDDTSDDDTESGQGSM--KKYKEQVVSSQDGR------KRRKVMFDDDLEDNMDDD 181
                D   DD++E+           EQ +   +G       K+ K          +   
Sbjct: 185 PPQQTDGAGDDESEALVRLREADGTLEQRIKGAEGGGAMKVLKQPKKQAKSSKRRTIAQI 244

Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDD--DEE 215
           D  D DDE D  +D+D+E+  E + ++ +DD  DE+
Sbjct: 245 DGIDSDDEGDGSDDDDDEDAIESDLDDSDDDVSDED 280



 Score = 37.0 bits (86), Expect = 0.046
 Identities = 16/96 (16%), Positives = 32/96 (33%), Gaps = 14/96 (14%)

Query: 103 ESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDG 162
            + E+ +   +G  G     +       ++ +  T                +        
Sbjct: 208 GTLEQRIKGAEG--GGAMKVLKQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDD---- 261

Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDN 198
                   +D +E ++DD DDD  D++ +D  D DN
Sbjct: 262 --------EDAIESDLDDSDDDVSDEDGEDLFDTDN 289


>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
           Provisional.
          Length = 337

 Score = 54.4 bits (131), Expect = 2e-07
 Identities = 17/56 (30%), Positives = 28/56 (50%), Gaps = 1/56 (1%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEH 227
            D + +   D+DDDD D  D  +   ++  S + + NEEDD  E+++    S   H
Sbjct: 280 IDEDSDYSSDEDDDDYDSYDSSDSASSDSNS-DVDTNEEDDRGEKESNGAKSNELH 334



 Score = 36.7 bits (85), Expect = 0.064
 Identities = 14/85 (16%), Positives = 24/85 (28%), Gaps = 29/85 (34%)

Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDD 182
            + D+  D + ++D  D D                               D  D+   D 
Sbjct: 278 EAIDEDSDYSSDEDDDDYD-----------------------------SYDSSDSASSDS 308

Query: 183 DDDDDDETDDDNDEDNEEESEEEEE 207
           + D D   +DD  E     ++  E 
Sbjct: 309 NSDVDTNEEDDRGEKESNGAKSNEL 333


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 54.2 bits (130), Expect = 3e-07
 Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 9/125 (7%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQV--VSSQDGRKRRKVMFDDDLEDNMDDD 181
              D +D+ DE D   +D +  +   KK K         D +K ++   DD  E + DD 
Sbjct: 218 LEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDG 277

Query: 182 DDD-------DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
           DD+        D   + +D +E  ++ S E     E + +E   E +  KNE    L+ K
Sbjct: 278 DDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKK 337

Query: 235 QKNPK 239
            K  K
Sbjct: 338 GKKLK 342



 Score = 39.2 bits (91), Expect = 0.012
 Identities = 21/102 (20%), Positives = 48/102 (47%), Gaps = 16/102 (15%)

Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
           SDD DDE  E+D   D + SG    ++  E  +S +   K               + + D
Sbjct: 274 SDDGDDEGREEDYISDSSASGNDPEER--EDKLSPEIPAKP--------------EIEQD 317

Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           +D +E++++ +E+    S++ ++ ++   ++   +K +S + 
Sbjct: 318 EDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSG 359



 Score = 36.1 bits (83), Expect = 0.12
 Identities = 34/186 (18%), Positives = 64/186 (34%), Gaps = 28/186 (15%)

Query: 169 MFDDDLEDNMDDDDDDDDDDETDD------------DNDEDNEEESEEEEENEEDDDEER 216
           M     ++      D+D++ E +             D + D+E++ +E ++  ED DEE+
Sbjct: 181 MMMKAAKNGPAAFGDEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEK 240

Query: 217 KAEKKNSKNEHVQELTAKQKNPK-----AFTFQSVIKGERKFRSTKPTVVPQPRMSDKDL 271
             +KK    ++ ++L   +K  +     A  + S   G+      +   +     S  D 
Sbjct: 241 SKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSD-DGD--DEGREEDYISDSSASGNDP 297

Query: 272 AIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTG 331
             R    S         E   + E     D    +   +E+E    K  K  K  +    
Sbjct: 298 EEREDKLS--------PEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKN 349

Query: 332 LNWKSD 337
              K D
Sbjct: 350 GLDKDD 355



 Score = 35.3 bits (81), Expect = 0.18
 Identities = 31/146 (21%), Positives = 55/146 (37%), Gaps = 9/146 (6%)

Query: 121 GAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDD 180
             I  D+  +E++E+   ++   S +G  K  K +      G+K      D D  D+ DD
Sbjct: 312 PEIEQDEDSEESEEEKNEEEGGLSKKG-KKLKKLK------GKKNGLDKDDSDSGDDSDD 364

Query: 181 DDDDDDDDETDDDNDEDNEEESEEE-EENEEDDDEERKAE-KKNSKNEHVQELTAKQKNP 238
            D D +D  +     +  E + EE  + N         A     SK++  ++   +    
Sbjct: 365 SDIDGEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPARPSPESKDKGKRKAANEVSKS 424

Query: 239 KAFTFQSVIKGERKFRSTKPTVVPQP 264
            A      +K E   +S+     PQ 
Sbjct: 425 PASVPAKKLKTENAPKSSSGKSTPQT 450



 Score = 33.0 bits (75), Expect = 0.94
 Identities = 19/82 (23%), Positives = 34/82 (41%), Gaps = 3/82 (3%)

Query: 124 SSDDSDDENDEDDTSDDDTESG---QGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDD 180
           S+  +D E  ED  S +        Q    +  E+  + ++G   +K      L+   + 
Sbjct: 291 SASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNG 350

Query: 181 DDDDDDDDETDDDNDEDNEEES 202
            D DD D   D D+ + + E+S
Sbjct: 351 LDKDDSDSGDDSDDSDIDGEDS 372



 Score = 31.1 bits (70), Expect = 4.6
 Identities = 31/221 (14%), Positives = 70/221 (31%), Gaps = 43/221 (19%)

Query: 129 DDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDD 188
                 D+  + + E G G   K            K + +  DD+ + +  D   +D D+
Sbjct: 189 GPAAFGDEDEETEGEKGGGGRGK----------DLKIKDLEGDDEDDGDESDKGGEDGDE 238

Query: 189 ETD----------------------DDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           E                          +D+ +E +S++ ++   ++D    +    +  E
Sbjct: 239 EKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPE 298

Query: 227 HVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNS 286
             ++  + +   K    Q     E +              ++++  +  K          
Sbjct: 299 EREDKLSPEIPAKPEIEQDEDSEESEEEK-----------NEEEGGLSKKGKKLKKLKGK 347

Query: 287 KEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPE 327
           K   +++  DS    + +D +  D    V  K  K+ K  E
Sbjct: 348 KNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKEPKKEE 388


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 53.8 bits (130), Expect = 4e-07
 Identities = 30/107 (28%), Positives = 48/107 (44%)

Query: 120 SGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMD 179
           + A  +       DE D+S    +            V       +      DDD +D  D
Sbjct: 85  AAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDD 144

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           DD DDDDDDE DD++D+D++ + E+EE+ E  + E+   +     +E
Sbjct: 145 DDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDE 191



 Score = 53.1 bits (128), Expect = 7e-07
 Identities = 38/147 (25%), Positives = 73/147 (49%), Gaps = 4/147 (2%)

Query: 103 ESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSD----DDTESGQGSMKKYKEQVVS 158
           ES +K+  +ID +   +SG +   D   E+D              +   + KK K+++ S
Sbjct: 43  ESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDS 102

Query: 159 SQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
           S+   K+  +  DDDL    D D  +  DD+ DDD+D+D +++  ++++++EDDDE+   
Sbjct: 103 SKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDD 162

Query: 219 EKKNSKNEHVQELTAKQKNPKAFTFQS 245
           +  + ++E  +E    +K      F  
Sbjct: 163 DDVDDEDEEKKEAKELEKLSDDDDFVW 189



 Score = 46.5 bits (111), Expect = 6e-05
 Identities = 29/112 (25%), Positives = 56/112 (50%), Gaps = 7/112 (6%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
           SS  ++ +N  D   D +       + +  +      D       + DDD++D+ DD+DD
Sbjct: 102 SSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDD-----DLDDDDIDDDDDDEDD 156

Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK--KNSKNEHVQELTA 233
           D+DDD+ D D++++ ++E++E E+  +DDD     +      +     +LTA
Sbjct: 157 DEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLTA 208



 Score = 43.8 bits (104), Expect = 5e-04
 Identities = 15/99 (15%), Positives = 37/99 (37%), Gaps = 1/99 (1%)

Query: 122 AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
              +    D++       D             +  +   D         DD+ +D+ D D
Sbjct: 107 EKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVD 166

Query: 182 DDDDDDDETDD-DNDEDNEEESEEEEENEEDDDEERKAE 219
           D+D++  E  + +   D+++   +E+++E      + A+
Sbjct: 167 DEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAK 205



 Score = 32.7 bits (75), Expect = 1.3
 Identities = 27/180 (15%), Positives = 55/180 (30%), Gaps = 12/180 (6%)

Query: 166 RKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
           ++ +          D           D +D    +  +++ +        +   KK  K+
Sbjct: 39  KEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKD 98

Query: 226 EHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLN 285
           E      A++KN            +      K   V      D D          L   +
Sbjct: 99  ELDSSKKAEKKNALDK--------DDDLNYVKDIDVLNQADDDDDD----DDDDDLDDDD 146

Query: 286 SKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASA 345
             +++++E +D    D+  DD   ++ E  E +   D  +   D   +     A+K A  
Sbjct: 147 IDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKL 206


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 53.5 bits (128), Expect = 8e-07
 Identities = 43/222 (19%), Positives = 85/222 (38%), Gaps = 43/222 (19%)

Query: 127  DSDDENDEDDTSDDDTESGQGSMK-KYKEQVVSSQDGRKRRKVMFDDD------------ 173
            + +D  ++ D  + +    +  M    K+ VVS  +     +   D D            
Sbjct: 3842 NEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSN 3901

Query: 174  -LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD---EERKAEKKNSKNEHVQ 229
             L + + D+ +++D  ET+  ++E +   +E +  ++EDD+   E++  ++K  + E   
Sbjct: 3902 SLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSD 3961

Query: 230  ELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEE 289
            ++               I  E +    +    P P   D DL            L   E+
Sbjct: 3962 DVG--------------IDDEIQPDIQENNSQPPPENEDLDLP---------EDLKLDEK 3998

Query: 290  EEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTG 331
            E +  +DS   D   D  ++DE++  E    KD    +ED  
Sbjct: 3999 EGDVSKDSDLED--MDMEAADENKE-EADAEKDEPMQDEDPL 4037



 Score = 48.1 bits (114), Expect = 3e-05
 Identities = 41/243 (16%), Positives = 93/243 (38%), Gaps = 48/243 (19%)

Query: 115  IHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDL 174
            +   M+G     DS    +E+  S+++ +     +    E + +S + +     ++D+  
Sbjct: 3860 LESDMNGVT--KDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEK-----LWDEPN 3912

Query: 175  EDNMDDDD----------DDDDDDETDDDND--EDNEEESEEEEENEEDD---DEERKAE 219
            E+++ + +          ++ D    +DDN   ED + + +E+EE   DD   D+E + +
Sbjct: 3913 EEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPD 3972

Query: 220  KKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITS 279
             + + ++   E                +  + K    +  V     + D D+        
Sbjct: 3973 IQENNSQPPPENEDLD-----------LPEDLKLDEKEGDVSKDSDLEDMDMEAA----- 4016

Query: 280  ALAGLNSKEEEEEEQEDSAFSDESA--DDNSSDEDETVEPK---THKDSKNPEEDTGLNW 334
                 +  +EE + ++D    DE    ++N+ DED   +        D K  E+    N 
Sbjct: 4017 -----DENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENV 4071

Query: 335  KSD 337
            + +
Sbjct: 4072 QEN 4074



 Score = 46.9 bits (111), Expect = 7e-05
 Identities = 46/214 (21%), Positives = 80/214 (37%), Gaps = 28/214 (13%)

Query: 122  AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
            A+   D  ++ DE++ SDD    G     +   Q  +SQ   +   +   +DL+ +  + 
Sbjct: 3944 ALEDKDRQEKEDEEEMSDD---VGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEG 4000

Query: 182  DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAF 241
            D   D D  D D +  +E + E + E +E   +E   E+ N+ +E +Q            
Sbjct: 4001 DVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQ----------QD 4050

Query: 242  TFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEE--EEQEDSAF 299
             F  + + + K          +    +++        S   G+ S EE E  E  ED A 
Sbjct: 4051 DFSDLAEDDEKMNEDGFE---ENVQENEE--------STEDGVKSDEELEQGEVPEDQAI 4099

Query: 300  -SDESADDNSSDEDETV-EPKTHKDSKNPEEDTG 331
             +    D  S+       E  T K      E+ G
Sbjct: 4100 DNHPKMDAKSTFASAEADEENTDKGIVGENEELG 4133



 Score = 45.4 bits (107), Expect = 2e-04
 Identities = 44/185 (23%), Positives = 81/185 (43%), Gaps = 20/185 (10%)

Query: 126  DDSDDENDED-DTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN--MDDDD 182
            D+ + +  +D D  D D E+      + KE+     D  K   +  +D LE+N  +D+D 
Sbjct: 3996 DEKEGDVSKDSDLEDMDMEAAD----ENKEE----ADAEKDEPMQDEDPLEENNTLDEDI 4047

Query: 183  DDDDD-DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAF 241
              DD  D  +DD   + +   E  +ENEE  ++  K++++  + E V E  A   +PK  
Sbjct: 4048 QQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGE-VPEDQAIDNHPKMD 4106

Query: 242  TFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSD 301
               +    E    +T   +V +     ++  +R   T       +  E E+ QED++   
Sbjct: 4107 AKSTFASAEADEENTDKGIVGENEELGEEDGVRGNGT-------ADGEFEQVQEDTSTPK 4159

Query: 302  ESADD 306
            E+  +
Sbjct: 4160 EAMSE 4164



 Score = 38.1 bits (88), Expect = 0.035
 Identities = 53/261 (20%), Positives = 98/261 (37%), Gaps = 35/261 (13%)

Query: 88   EMEIFEFLNICQVHGESTEKSMMEIDGIHGAMSGAISSDDSDDEN-------------DE 134
            ++++ E L + +  G+ ++ S +E   +  A      +D   DE              DE
Sbjct: 3986 DLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDE 4045

Query: 135  DDTSDDDTESGQGSMKK----YKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDET 190
            D   DD ++  +   K     ++E V  +++  +      D++LE     +D   D+   
Sbjct: 4046 DIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKS-DEELEQGEVPEDQAIDNHPK 4104

Query: 191  DDDNDEDNEEESEEEE-------ENEE--DDDEERKAEKKNSKNEHVQELTAKQKNPKA- 240
             D        E++EE        ENEE  ++D  R     + + E VQE T+  K   + 
Sbjct: 4105 MDAKSTFASAEADEENTDKGIVGENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSE 4164

Query: 241  --FTFQSVIKGERKFRSTK-----PTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEE 293
                +QS+    R+++          +      +  D    H        L +    E++
Sbjct: 4165 ADRQYQSLGDHLREWQQANRIHEWEDLTESQSQAFDDSEFMHVKEDEEEDLQALGNAEKD 4224

Query: 294  QEDSAFSDESADDNSSDEDET 314
            Q  S   DESA+ N    + T
Sbjct: 4225 QIKSIDRDESANQNPDSMNST 4245



 Score = 34.6 bits (79), Expect = 0.39
 Identities = 43/175 (24%), Positives = 70/175 (40%), Gaps = 34/175 (19%)

Query: 64   INSMIDISKVA-----DLVLLLI------DASFGFEMEIFEFLNICQVHGESTEKSMMEI 112
             NS + +SKV      D   + +      D S  F    F FL +C+ HG  TE +    
Sbjct: 2854 SNSEVCLSKVVNQLSEDTTSMNLTRMCIEDPSIPFM--YFVFLFLCKQHGRDTELAGAFE 2911

Query: 113  DGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDD 172
            DG  G   G  + + S+   +ED+  D             +E+V    D  K   + +DD
Sbjct: 2912 DGT-GMKYGTGNINVSEQIEEEDEKDD---------EYGEQEKVSDKNDYLKD-FIDYDD 2960

Query: 173  D----------LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK 217
            D           E    +   D  +D  ++++D++  E+  E E+N E   E+ K
Sbjct: 2961 DGSISTESEPEEEKRECESGFDGWNDYEEEEDDDEKNEKECELEQNAEILIEDEK 3015



 Score = 34.6 bits (79), Expect = 0.50
 Identities = 36/207 (17%), Positives = 82/207 (39%), Gaps = 30/207 (14%)

Query: 102  GESTEKSMMEIDGI--HGAMSGAISSDDSDDENDEDDTSDDDTESGQ--GSMKKYKE--Q 155
            GE+ E  + E DG+  +G   G       D    ++  S+ D +       ++++++  +
Sbjct: 4127 GENEE--LGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEADRQYQSLGDHLREWQQANR 4184

Query: 156  VVSSQDGRKRRKVMFDDDLEDNMDDDDDDD---------------DDDET-DDDNDEDNE 199
            +   +D  + +   FDD    ++ +D+++D               D DE+ + + D  N 
Sbjct: 4185 IHEWEDLTESQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRDESANQNPDSMNS 4244

Query: 200  EESEEEEENEEDDDEERKAE-----KKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFR 254
                E+E +E  D + +  +     K+  ++    E  +  ++ K F        E +  
Sbjct: 4245 TNIAEDEADEVGDKQLQDGQDISDIKQTGEDTLPTEFGSINQSEKVFELSEDEDIEDELP 4304

Query: 255  STKPTVVPQ-PRMSDKDLAIRHKITSA 280
                 + P  P    +DL  +H+ ++ 
Sbjct: 4305 DYNVKITPAMPIDEARDLWNKHEDSTK 4331


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
           Reviewed.
          Length = 129

 Score = 48.5 bits (116), Expect = 1e-06
 Identities = 18/52 (34%), Positives = 34/52 (65%)

Query: 154 EQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE 205
           E+++  ++   ++K  F D  +D +DDD   DDD + +D ++ED+E+E +EE
Sbjct: 78  EEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129



 Score = 43.8 bits (104), Expect = 5e-05
 Identities = 15/43 (34%), Positives = 27/43 (62%)

Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
           FD   +  MD DDD  DDD   DD+ ++ + + E++E+ E+++
Sbjct: 87  FDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129



 Score = 42.7 bits (101), Expect = 1e-04
 Identities = 15/45 (33%), Positives = 27/45 (60%), Gaps = 1/45 (2%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNE-EESEEEEENEEDDDEE 215
           ++  D       D DDD  DDD   D++ +E + +EE++ED+++E
Sbjct: 84  EEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128



 Score = 35.0 bits (81), Expect = 0.064
 Identities = 19/95 (20%), Positives = 32/95 (33%), Gaps = 34/95 (35%)

Query: 107 KSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRR 166
           +S   +D I   +       D   +   D                               
Sbjct: 68  RSWYPVDEIDEEIIPLEEKFDKKKKKFMDG------------------------------ 97

Query: 167 KVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEE 201
               DDD+ D+    DDD D+++ D+++DED E+E
Sbjct: 98  ----DDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128



 Score = 33.1 bits (76), Expect = 0.27
 Identities = 11/47 (23%), Positives = 26/47 (55%), Gaps = 3/47 (6%)

Query: 172 DDLEDNMDDDDDDDDDD---ETDDDNDEDNEEESEEEEENEEDDDEE 215
           D++++ +   ++  D       D D+D  +++   +++ +EED DEE
Sbjct: 74  DEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEE 120


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 52.3 bits (126), Expect = 1e-06
 Identities = 20/81 (24%), Positives = 46/81 (56%), Gaps = 11/81 (13%)

Query: 167 KVMFDDDLEDNMDD-----------DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
            +  D D+E+ + D           ++DDDDD+E+D++++E ++ E +E++++++DDD+ 
Sbjct: 99  VIFDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDI 158

Query: 216 RKAEKKNSKNEHVQELTAKQK 236
              E+   +    +E   K+ 
Sbjct: 159 ATRERSLERRRRRREWEEKRA 179



 Score = 46.1 bits (110), Expect = 9e-05
 Identities = 13/69 (18%), Positives = 36/69 (52%)

Query: 165 RRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
           R    + D  ED+ DD++ D++D+E+    D++++++ +++++    +    +  ++   
Sbjct: 115 RYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREW 174

Query: 225 NEHVQELTA 233
            E   EL  
Sbjct: 175 EEKRAELEF 183



 Score = 31.1 bits (71), Expect = 4.3
 Identities = 36/166 (21%), Positives = 55/166 (33%), Gaps = 50/166 (30%)

Query: 59  ECNNDINSMIDISKVADLVLLLIDASFGFEMEIFEF-----LNICQVHGES--------- 104
           E + DI S+I I+    +V L        ++ ++        N+  V G           
Sbjct: 47  ELDEDIKSVILIN-CGGMVDLEEFLQLDEDVIVYVIDSHRPWNLDNVFGSDQVVIFDDGD 105

Query: 105 TEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRK 164
            E+ + +      A       DD D+E+DE+D     +E                     
Sbjct: 106 IEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSE--------------------- 144

Query: 165 RRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEE 210
                         DD+DDDDDDD+ D    E + E      E EE
Sbjct: 145 --------------DDEDDDDDDDDDDIATRERSLERRRRRREWEE 176


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 50.8 bits (122), Expect = 2e-06
 Identities = 21/70 (30%), Positives = 38/70 (54%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
           DDD E+  + ++D+D DDE +  + E ++E    + E+EE+ DE  K  K++S  E  +E
Sbjct: 97  DDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEE 156

Query: 231 LTAKQKNPKA 240
              +    + 
Sbjct: 157 DEEEAAEEEE 166



 Score = 46.6 bits (111), Expect = 5e-05
 Identities = 25/98 (25%), Positives = 43/98 (43%), Gaps = 21/98 (21%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
             DD ++E + ++  D D E     +                     + D E    D +D
Sbjct: 96  DDDDEEEEWEVEEDEDSDDEGEWIDV---------------------ESDKEIESSDSED 134

Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
           +++ DE      ED++EE  EE+E E  ++EE +AEK+
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172



 Score = 43.5 bits (103), Expect = 4e-04
 Identities = 19/102 (18%), Positives = 41/102 (40%), Gaps = 13/102 (12%)

Query: 129 DDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDD 188
             E +      +          + +E  V   +         D D E    D + D + +
Sbjct: 78  WKEEERKKKEAEQGLESDDDDDEEEEWEVEEDE---------DSDDEGEWIDVESDKEIE 128

Query: 189 ETDDDNDEDNEE----ESEEEEENEEDDDEERKAEKKNSKNE 226
            +D +++E+ +E      E+ +E   ++DEE  AE++ ++ E
Sbjct: 129 SSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAE 170



 Score = 43.5 bits (103), Expect = 5e-04
 Identities = 18/72 (25%), Positives = 40/72 (55%), Gaps = 6/72 (8%)

Query: 173 DLEDNMDDDDD----DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHV 228
           ++E++ D DD+    D + D+  + +D ++EEE +E  +  ++D +E  +E+   +    
Sbjct: 105 EVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAA-- 162

Query: 229 QELTAKQKNPKA 240
           +E  A+ +  KA
Sbjct: 163 EEEEAEAEKEKA 174



 Score = 36.9 bits (86), Expect = 0.050
 Identities = 14/42 (33%), Positives = 26/42 (61%)

Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
           LE   +++    + ++  + +D+D+EEE  E EE+E+ DDE 
Sbjct: 75  LEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEG 116


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 51.9 bits (124), Expect = 2e-06
 Identities = 49/277 (17%), Positives = 100/277 (36%), Gaps = 30/277 (10%)

Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDD 182
            S +D  D +D D     D E  +       E+  S +D       M DD+ E ++ ++ 
Sbjct: 98  ESHEDGSDGSDMDSEDSADDEEEE-------EEDESLEDE------MIDDEDEADLFNES 144

Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT 242
           +   +D +DD+ ++D E++ EEEE  EE +  E+   +K      V +   K      F 
Sbjct: 145 ESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNEFL 204

Query: 243 FQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKI-TSALAGLNSKEEEEEEQEDSAFSD 301
             +  + E                  +D                S E+EE+++E +   +
Sbjct: 205 EATEAEEEAALGD---------EDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYE 255

Query: 302 ESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASAFLERQANIVNLAKYVY 361
           +  D    D+ +        +   P+++  +  ++D   +      + Q +  +  +   
Sbjct: 256 DFFDPKEKDKKKDAGDDAELEDDEPDKEA-VKKEADSKPEEEDEEDDEQEDDQDEEEPPE 314

Query: 362 GDMEDVSVTMEG------DEAPNKKVHRKRQAELTAK 392
             M+ V +          +         KRQA+L  +
Sbjct: 315 AAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKLKQQ 351



 Score = 45.4 bits (107), Expect = 2e-04
 Identities = 31/148 (20%), Positives = 55/148 (37%), Gaps = 15/148 (10%)

Query: 103 ESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDG 162
           E+TE       G           D  D ++DED  S +D E          E     ++ 
Sbjct: 205 EATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSGED-EEDDEEGNIEYEDFFDPKEK 263

Query: 163 RKRRKVMFDDDLEDNMDD--------------DDDDDDDDETDDDNDEDNEEESEEEEEN 208
            K++    D +LED+  D              +D++DD+ E D D +E  E   ++ + +
Sbjct: 264 DKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLD 323

Query: 209 EEDDDEERKAEKKNSKNEHVQELTAKQK 236
           E   +       K   +   ++   KQ+
Sbjct: 324 EPVLEGVDLESPKELSSFEKRQAKLKQQ 351



 Score = 40.7 bits (95), Expect = 0.005
 Identities = 42/258 (16%), Positives = 86/258 (33%), Gaps = 44/258 (17%)

Query: 103 ESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDG 162
           ES E  M++ +      +   S    +D +D++   D++ +  +    + KE V  +   
Sbjct: 125 ESLEDEMIDDEDEADLFN--ESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATRE 182

Query: 163 RKRRKVMFDD---DLEDNMDDD------------DDDDDDDETDDDNDEDNEEESEEEEE 207
           +K  K   DD    L D M++              D+DD ++   D+ ED +++ +    
Sbjct: 183 KKFDKSGVDDKFFKL-DEMNEFLEATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSG 241

Query: 208 NEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT-FQSVIKGERKFRSTKPTVVPQPRM 266
            +E+DDEE   E ++       +   K K   A    +       K    K         
Sbjct: 242 EDEEDDEEGNIEYED-----FFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEE 296

Query: 267 SDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNP 326
                                EE++E+++D    +         + +    +        
Sbjct: 297 D--------------------EEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPK 336

Query: 327 EEDTGLNWKSDLAQKAAS 344
           E  +    ++ L Q+   
Sbjct: 337 ELSSFEKRQAKLKQQIEQ 354



 Score = 40.0 bits (93), Expect = 0.009
 Identities = 41/192 (21%), Positives = 73/192 (38%), Gaps = 17/192 (8%)

Query: 146 QGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE 205
               +KY E +++ +  R   K+       ++ +D  D  D D  D  +DE+ EEE E  
Sbjct: 72  DDLNQKYVEFLINKEHIRVLAKL----QDSESHEDGSDGSDMDSEDSADDEEEEEEDESL 127

Query: 206 EENE-EDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQP 264
           E+   +D+DE     +  S  E + +   +    K    +            K +V    
Sbjct: 128 EDEMIDDEDEADLFNESESSLEDLSDDETEDDEEKKMEEEEA-------GEEKESVEQAT 180

Query: 265 RMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDES-----ADDNSSDEDETVEPKT 319
           R    D +        L  +N   E  E +E++A  DE        D+S D  +  +  +
Sbjct: 181 REKKFDKSGVDDKFFKLDEMNEFLEATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGS 240

Query: 320 HKDSKNPEEDTG 331
            +D ++ EE   
Sbjct: 241 GEDEEDDEEGNI 252


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 51.8 bits (124), Expect = 2e-06
 Identities = 20/78 (25%), Positives = 46/78 (58%), Gaps = 2/78 (2%)

Query: 156 VVSSQDGRKRRKVMFD--DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
           V+ + + + R + +F+  ++ ED++++ DDD+D+ E  +D++ ++E + E+ EE+E++DD
Sbjct: 631 VLKADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDD 690

Query: 214 EERKAEKKNSKNEHVQEL 231
            E             + L
Sbjct: 691 AEANEGVVPIDKAVRRAL 708



 Score = 40.6 bits (95), Expect = 0.005
 Identities = 15/61 (24%), Positives = 29/61 (47%), Gaps = 5/61 (8%)

Query: 170 FDDDLEDNMDDDDD-----DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
            +DDLE+  DD+D+     D + +   D  D + +E+ ++ E NE     ++   +   K
Sbjct: 651 DEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPK 710

Query: 225 N 225
            
Sbjct: 711 V 711



 Score = 35.6 bits (82), Expect = 0.16
 Identities = 23/111 (20%), Positives = 47/111 (42%), Gaps = 9/111 (8%)

Query: 124 SSDDSDDE---NDEDDTSDDDTESGQGSMKKYKEQ---VVSSQDGRKRRKVMFDDDLEDN 177
           + DD D+     D +  S+ D E G+   ++   +    V   D   RR +    +L D 
Sbjct: 658 TDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDA 717

Query: 178 MDDDDDDDDDDETDDDNDEDNEEESE---EEEENEEDDDEERKAEKKNSKN 225
           +D  D +D++   D+     +   ++   E++E  +   E +K  +   +N
Sbjct: 718 LDGGDSEDEEGMDDEQMMRLDTYLAQIFKEKKERIQAGGETKKEAQSQKQN 768



 Score = 34.8 bits (80), Expect = 0.28
 Identities = 13/66 (19%), Positives = 29/66 (43%)

Query: 150 KKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
           K   +Q+   ++  +      DDD ++    +D + + +   +D +ED +E+  E  E  
Sbjct: 638 KSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGV 697

Query: 210 EDDDEE 215
              D+ 
Sbjct: 698 VPIDKA 703



 Score = 32.9 bits (75), Expect = 1.2
 Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 13/79 (16%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEE----EEENEED--DDEERKAEKKNSKNEHVQEL-- 231
           +++D+DD +ETDDD     E+E E     E E+E D  D EE + E     NE V  +  
Sbjct: 648 EEEDEDDLEETDDD-----EDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDK 702

Query: 232 TAKQKNPKAFTFQSVIKGE 250
             ++  PK       + G 
Sbjct: 703 AVRRALPKVLNLPDALDGG 721



 Score = 30.2 bits (68), Expect = 7.2
 Identities = 16/66 (24%), Positives = 29/66 (43%), Gaps = 4/66 (6%)

Query: 120 SGAISSDDS--DDENDEDDTSDDDTESGQGSMKKYKE--QVVSSQDGRKRRKVMFDDDLE 175
           +   S  +S  D E+ E+D  +DD E+ +G +   K   + +             D + E
Sbjct: 667 AIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDE 726

Query: 176 DNMDDD 181
           + MDD+
Sbjct: 727 EGMDDE 732


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 49.7 bits (119), Expect = 3e-06
 Identities = 30/95 (31%), Positives = 52/95 (54%), Gaps = 4/95 (4%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE-EDDDEERKAEKKNSKNEHVQ 229
           +DD E  +++++++++ D  D D+ ED+E ES++EEE E E   EER  +KK  K +  +
Sbjct: 38  EDDEEFEIEEEEEEEEVDS-DFDDSEDDEPESDDEEEGEKELQREERLKKKKRVKTKAYK 96

Query: 230 ELTAKQKNPKAFTFQSVI--KGERKFRSTKPTVVP 262
           E T K+K       +S        K +S + +  P
Sbjct: 97  EPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAP 131



 Score = 30.0 bits (68), Expect = 5.9
 Identities = 20/115 (17%), Positives = 44/115 (38%), Gaps = 2/115 (1%)

Query: 123 ISSDDSDDENDED-DTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
           I  ++ ++E D D D S+DD        +  KE +   +  +K+++V      E      
Sbjct: 45  IEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKE-LQREERLKKKKRVKTKAYKEPTKKKK 103

Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
             D    ++        +++SE         D  R+   ++S  ++ +    + K
Sbjct: 104 KKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKSSRSSTVQNKEATHERLK 158


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 49.6 bits (118), Expect = 4e-06
 Identities = 30/111 (27%), Positives = 48/111 (43%), Gaps = 9/111 (8%)

Query: 127 DSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDD 186
           D D+E +E     D  E      ++  E+V S   G   R++  +++ E    DD +D  
Sbjct: 173 DVDNEEEERLEESDGRE------EEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVV 226

Query: 187 DDETDDDNDEDNEEESEEEE---ENEEDDDEERKAEKKNSKNEHVQELTAK 234
           D E +  + +  EEE  EEE     E + +EE  +E+    NE       K
Sbjct: 227 DYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEESPAKKQK 277



 Score = 40.4 bits (94), Expect = 0.004
 Identities = 24/102 (23%), Positives = 44/102 (43%), Gaps = 11/102 (10%)

Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
           SD  ++E DE+  SD   E  +   ++ +E+   S DG        +D ++   +  D  
Sbjct: 185 SDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDG--------EDVVDYEGERIDKK 236

Query: 185 DDDDETDD---DNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
             ++E  +    N  + E E E   E    ++EE  A+K+  
Sbjct: 237 QGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEESPAKKQKV 278



 Score = 36.5 bits (84), Expect = 0.061
 Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 6/60 (10%)

Query: 177 NMDDDDDDDDDD------ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
           N+  D+D+D+D+      + D++ +E  EE    EEE +E+   +   E     NE  +E
Sbjct: 155 NIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEE 214



 Score = 33.4 bits (76), Expect = 0.58
 Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 5/57 (8%)

Query: 175 EDNMDDDDD-----DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
            DN +D+D+      D D+E ++  +E +  E EE+EE   D   E   E    + E
Sbjct: 158 WDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEE 214



 Score = 30.3 bits (68), Expect = 6.2
 Identities = 14/68 (20%), Positives = 31/68 (45%), Gaps = 2/68 (2%)

Query: 172 DDLEDNMDD--DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQ 229
           D  E+  D+    D   +   + + +E+ E E  ++ E+  D + ER  +K+  + E  +
Sbjct: 186 DGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEE 245

Query: 230 ELTAKQKN 237
           E+    + 
Sbjct: 246 EVINLFEI 253


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 50.4 bits (120), Expect = 5e-06
 Identities = 26/124 (20%), Positives = 52/124 (41%), Gaps = 1/124 (0%)

Query: 102 GESTEKSMMEIDG-IHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQ 160
               E    E +G I     G +  D+  +   E +   +  E  +   +   +   +  
Sbjct: 769 ETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEV 828

Query: 161 DGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
                 + +  ++  +   D+   D    +D  + E+ EEE EEEEE EE+++EE + E+
Sbjct: 829 KDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEE 888

Query: 221 KNSK 224
           +N +
Sbjct: 889 ENEE 892



 Score = 47.3 bits (112), Expect = 5e-05
 Identities = 31/140 (22%), Positives = 58/140 (41%), Gaps = 15/140 (10%)

Query: 102 GESTEKSMMEIDGIHGAMSGAISSDDSDD---ENDED-DTSDDDTESGQGSMKKYKEQVV 157
               E      +  H   + A   +D D+   +  ED +   D+   G+   +   E   
Sbjct: 752 KHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGE 811

Query: 158 SSQDGRKRRKVMFDDDLED-------NMDDDDDDDDDDETDDDNDE----DNEEESEEEE 206
             +   +      D +++D       N ++  +   D++  D        D+EEE EEEE
Sbjct: 812 KDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEE 871

Query: 207 ENEEDDDEERKAEKKNSKNE 226
           E EE+++EE + E++  +NE
Sbjct: 872 EEEEEEEEEEEEEEEEEENE 891



 Score = 43.4 bits (102), Expect = 9e-04
 Identities = 23/99 (23%), Positives = 46/99 (46%), Gaps = 3/99 (3%)

Query: 131 ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLE-DNMDDDDDDDD--D 187
           +   +   + + E+  G   +++ Q  +  D  + +    + +L  +N  +   D+   D
Sbjct: 794 DEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVD 853

Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
                D  +  EEE EEEEE EE+++EE + E++    E
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 39.2 bits (91), Expect = 0.014
 Identities = 45/253 (17%), Positives = 95/253 (37%), Gaps = 27/253 (10%)

Query: 121 GAISSDDSDDENDEDDTSDDDTESG-QGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMD 179
           G  S  +++ E + +   ++++E       K  +E     +      K   + +  ++  
Sbjct: 660 GEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEG 719

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEE-ENEEDDDEERKAEKKNSKNEHVQELTAKQKNP 238
           + + +  +DE + +  E+ EE  +E E E E   + E + ++K +++E   E   K+   
Sbjct: 720 ETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDED 779

Query: 239 KAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSA-LAGLNSKEEEEEEQEDS 297
           +    Q+   GE K        V     ++      H+  S   A     ++E  EQE +
Sbjct: 780 EGEI-QAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELN 838

Query: 298 A-----------------------FSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNW 334
           A                         +E  ++   +E+E  E +  ++ +  EE   L W
Sbjct: 839 AENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEW 898

Query: 335 KSDLAQKAASAFL 347
                ++A   FL
Sbjct: 899 PETRQKQAIYLFL 911



 Score = 38.8 bits (90), Expect = 0.021
 Identities = 22/121 (18%), Positives = 53/121 (43%), Gaps = 5/121 (4%)

Query: 99  QVHGESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVS 158
           ++ G+   +  +E +G   A        D  +   E    D + +   G  +   E    
Sbjct: 790 EMKGDEGAEGKVEHEGETEA-----GEKDEHEGQSETQADDTEVKDETGEQELNAENQGE 844

Query: 159 SQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
           ++   K        D  D+ ++++++++++E +++ +E+ EEE E EE    +  E R+ 
Sbjct: 845 AKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQK 904

Query: 219 E 219
           +
Sbjct: 905 Q 905


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 50.4 bits (120), Expect = 6e-06
 Identities = 30/106 (28%), Positives = 50/106 (47%), Gaps = 14/106 (13%)

Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDD---- 180
           SDDS+ +  EDD SD  ++         K   +  +D +       D   E+ +D+    
Sbjct: 705 SDDSELDFAEDDFSDSTSDDEP------KLDAIDDEDAKSEGSQESDQ--EEGLDEIFYS 756

Query: 181 -DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
            D + D+ D   + ++ED   E E+EEE E  +   ++A+KK  KN
Sbjct: 757 FDGEQDNSDSFAESSEEDESSEEEKEEE-ENKEVSAKRAKKKQRKN 801



 Score = 38.9 bits (90), Expect = 0.017
 Identities = 20/74 (27%), Positives = 36/74 (48%), Gaps = 13/74 (17%)

Query: 161 DGRKRRKVMFDDDLEDNMDDDD-------------DDDDDDETDDDNDEDNEEESEEEEE 207
           DG+ ++      D +D MD+++             DD DD E D   D+ ++  S++E +
Sbjct: 668 DGKGKKSNKASFDSDDEMDENEIWSALVKSRPDVEDDSDDSELDFAEDDFSDSTSDDEPK 727

Query: 208 NEEDDDEERKAEKK 221
            +  DDE+ K+E  
Sbjct: 728 LDAIDDEDAKSEGS 741


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 49.3 bits (118), Expect = 1e-05
 Identities = 36/237 (15%), Positives = 86/237 (36%), Gaps = 44/237 (18%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSS-----------QDGRKRRKVMFDD 172
            S++ D+++DEDD   ++    +  + K KE                Q    R+K   D 
Sbjct: 333 DSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDA 392

Query: 173 DLEDNMDDDDDDDDDDETDDDNDEDNEE------------ESEEEEENEEDDDEERKAEK 220
           ++E+   + + +++ DE +++                   E+E ++  +E+ +E ++ ++
Sbjct: 393 EIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKE 452

Query: 221 KNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPT------VVPQPR-MSDKDLAI 273
            + + E   E  AK +       +   K +++    +                S K    
Sbjct: 453 SDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDS 512

Query: 274 RHKITSALAGLN--------------SKEEEEEEQEDSAFSDESADDNSSDEDETVE 316
           + K +S L                   KE+  +  +D    ++S   +  DE++  +
Sbjct: 513 KKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDD 569



 Score = 46.2 bits (110), Expect = 1e-04
 Identities = 38/269 (14%), Positives = 82/269 (30%), Gaps = 35/269 (13%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
               S+++ DED  S+++ E                 D       M    L    + +DD
Sbjct: 320 GKSVSEEDEDEDSDSEEEDEDDDED------------DDDGENPWMLRKKLGKLKEGEDD 367

Query: 184 DDDDDET------DDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE--LTAKQ 235
           +++            +  +  E ++E EE   E + EE   E++N +             
Sbjct: 368 EENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGP 427

Query: 236 KNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEE-EEEQ 294
           +N +       +K E K    +     +    + +     K+      L  + E+ ++E+
Sbjct: 428 ENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEE--EAKVEKVANKLLKRSEKAQKEE 485

Query: 295 EDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKA------------ 342
           E+    +E+    ++        K     K+  +      K   A               
Sbjct: 486 EEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSID 545

Query: 343 ASAFLERQANIVNLAKYVYGDMEDVSVTM 371
               L  + + + L      D +D  +  
Sbjct: 546 LDDDLIDEEDSIKLDVDDEEDEDDEELPF 574


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 48.8 bits (116), Expect = 1e-05
 Identities = 28/96 (29%), Positives = 47/96 (48%), Gaps = 8/96 (8%)

Query: 135 DDTSDDDTESGQGSMKKY--------KEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDD 186
            +  DDD   G    K+         KE  +         + M ++D ++  DDDDDD+D
Sbjct: 8   SELEDDDWVRGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDED 67

Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKN 222
           +D+ DDD+DED+E+E +++    +D   +   E  N
Sbjct: 68  EDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDN 103



 Score = 45.0 bits (106), Expect = 2e-04
 Identities = 19/52 (36%), Positives = 37/52 (71%)

Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           ED+ ++DDDDDDD++ DDD+D+D+E++ +E++++    D+    +   + NE
Sbjct: 53  EDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNE 104



 Score = 43.1 bits (101), Expect = 7e-04
 Identities = 27/89 (30%), Positives = 41/89 (46%), Gaps = 27/89 (30%)

Query: 126 DDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDD 185
           ++ DDE D+DD  D+D +                           DDD +D+ DD+D+DD
Sbjct: 52  EEDDDEEDDDDDDDEDED---------------------------DDDDDDDEDDEDEDD 84

Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDE 214
           DD    DD+  D+  E++ E    + DDE
Sbjct: 85  DDSTLHDDSSADDGNETDNEAGFADSDDE 113



 Score = 41.9 bits (98), Expect = 0.002
 Identities = 17/49 (34%), Positives = 36/49 (73%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
           D++ +D+ DD+D+DDDDD+ D+D++++++++S   +++  DD  E   E
Sbjct: 56  DEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNE 104



 Score = 40.0 bits (93), Expect = 0.008
 Identities = 19/49 (38%), Positives = 32/49 (65%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
           DDD +++ DDDDDDDD+D+ D+D+D+    +    ++  E D+E   A+
Sbjct: 61  DDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGFAD 109



 Score = 39.2 bits (91), Expect = 0.011
 Identities = 26/97 (26%), Positives = 41/97 (42%), Gaps = 29/97 (29%)

Query: 118 AMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN 177
           AM      ++ DD++D++D  DDD +                           D+D ED 
Sbjct: 49  AMEEEDDDEEDDDDDDDEDEDDDDDDD--------------------------DEDDEDE 82

Query: 178 MDDDDDDDDDDETDDDNDEDNE---EESEEEEENEED 211
            DDD    DD   DD N+ DNE    +S++E ++  +
Sbjct: 83  DDDDSTLHDDSSADDGNETDNEAGFADSDDESDDGSE 119



 Score = 32.3 bits (73), Expect = 1.7
 Identities = 13/40 (32%), Positives = 29/40 (72%)

Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEH 227
           +E DDD ++D++++ E+E+++++DDDE+ + E  +    H
Sbjct: 51  EEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLH 90



 Score = 31.9 bits (72), Expect = 2.5
 Identities = 18/69 (26%), Positives = 31/69 (44%), Gaps = 8/69 (11%)

Query: 269 KDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEE 328
           K+ AIR         L  + EEE  +E+    ++  DD+  DED+  +     D    ++
Sbjct: 34  KENAIRK--------LGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDD 85

Query: 329 DTGLNWKSD 337
           D+ L+  S 
Sbjct: 86  DSTLHDDSS 94


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 48.1 bits (115), Expect = 1e-05
 Identities = 25/88 (28%), Positives = 48/88 (54%), Gaps = 8/88 (9%)

Query: 137 TSDDDTESG----QGSMKKYKEQVVSSQDGRKRR---KVMFDDDLEDNMDDDDDDDDDDE 189
           T+  D + G      +++  KE +       K +   +V+  D+ ED  +  +  ++++E
Sbjct: 233 TTCSDKDKGMEIIGAALEAIKEVIKKKGGDFKVKGEPEVVGGDE-EDLEELLEKAEEEEE 291

Query: 190 TDDDNDEDNEEESEEEEENEEDDDEERK 217
            DD ++ ++E+E +E+EE EEDDDE  K
Sbjct: 292 EDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 38.1 bits (89), Expect = 0.024
 Identities = 10/46 (21%), Positives = 28/46 (60%)

Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
            D++D ++  +   +E+ E++  E E+ +E+D++E + E  +  ++
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 46.4 bits (110), Expect = 1e-05
 Identities = 17/53 (32%), Positives = 34/53 (64%)

Query: 173 DLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
           D +   D++++++D++E ++  D ++EEE  E+EE EE+D+E+    K   K 
Sbjct: 45  DEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKK 97



 Score = 37.5 bits (87), Expect = 0.013
 Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 4/65 (6%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEE----EENEEDDDEERKAEKKNSKNEHVQELTAK 234
           D +D   ++++  D+  ED+EEE EE+    EE E+ +DEE   E +  + E  ++    
Sbjct: 32  DKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDL 91

Query: 235 QKNPK 239
           +   K
Sbjct: 92  KDIEK 96


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 47.1 bits (112), Expect = 2e-05
 Identities = 35/136 (25%), Positives = 53/136 (38%), Gaps = 18/136 (13%)

Query: 97  ICQVHGESTEKSMMEIDGIHGAMSGAISSDDS-------DDENDEDD--------TSDDD 141
              ++G S  K     +     +S  I   DS       + E+D  D            D
Sbjct: 150 DGVMNGVSLSKIEWLNEDFPSELSPFIERIDSVNGPGEPEPEDDPKDSLGNGSSTNGLPD 209

Query: 142 TESGQGSM-KKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDET--DDDNDEDN 198
           +   +    ++Y E+  S     +       D      DD++DDD D +   + +  ED 
Sbjct: 210 SSQDKNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDE 269

Query: 199 EEESEEEEENEEDDDE 214
           EEE  EEEE EED+DE
Sbjct: 270 EEEETEEEEEEEDEDE 285



 Score = 32.1 bits (73), Expect = 1.6
 Identities = 26/165 (15%), Positives = 52/165 (31%), Gaps = 22/165 (13%)

Query: 156 VVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEE----D 211
           VVSS D    R    D +  +++D   +     + +  N++   E S   E  +      
Sbjct: 130 VVSSWDPGGERS---DSENGNSLDGVMNGVSLSKIEWLNEDFPSELSPFIERIDSVNGPG 186

Query: 212 DDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDL 271
           + E     K +  N             K  + +   + E                +    
Sbjct: 187 EPEPEDDPKDSLGNGSSTNGLPDSSQDKNKSLEEYYEKE-------------SSDAAASQ 233

Query: 272 AIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVE 316
               K +         ++EE++ +D  + +E       +E+ET E
Sbjct: 234 DDGPKGSDVKN--KKSDDEEDDDQDGDYVEEKELKEDEEEEETEE 276


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 47.9 bits (114), Expect = 3e-05
 Identities = 27/208 (12%), Positives = 65/208 (31%), Gaps = 35/208 (16%)

Query: 40  PNNEGIHRQEFEIEPITFIEC----NNDINSMIDISKVADLVLLLIDASFGFEMEIFEFL 95
                I      +  I F+               +  +     +    S G   +  +  
Sbjct: 4   NALLAIWCILSYLYEIPFVGSLRLGLRSNYLQSTVEDIEGPSAVDERTS-GVLRDEGKHA 62

Query: 96  NICQVHGESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQ 155
           NI           +         ++   S DD DD+  +    DD+ ++G  +       
Sbjct: 63  NI------LYNSILCNQKKHASFLNQRKSLDDDDDDEFDFLYEDDEDDAGNAT------- 109

Query: 156 VVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
                              E + DDD   +  D  +D + + N +++ + ++++  + + 
Sbjct: 110 -----------------SGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQT 152

Query: 216 RKAEKKNSKNEHVQELTAKQKNPKAFTF 243
            +  K++      ++L  ++K     TF
Sbjct: 153 DQGLKQSVNLSSAEKLIEEKKGQTENTF 180



 Score = 46.0 bits (109), Expect = 1e-04
 Identities = 36/169 (21%), Positives = 62/169 (36%), Gaps = 17/169 (10%)

Query: 176 DNMDDDDDDD--DDDETDDDNDEDNEEESEEEEENEEDDDEERKAE--KKNSKNEHVQEL 231
           D+ DDD+ D   +DDE D  N    E  ++++   E  D +E        +  N+  Q+ 
Sbjct: 87  DDDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDD 146

Query: 232 ------TAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLN 285
                     K     +    +  E+K + T+ T       +D + A      +A  G  
Sbjct: 147 SSNSQTDQGLKQSVNLSSAEKLIEEKKGQ-TENTFKFYNFGNDGEEA------AAKDGGK 199

Query: 286 SKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNW 334
           SK  +     DS    +  D  S++ED+         +K      GL +
Sbjct: 200 SKSSDPGPLNDSDGQGDDGDPESAEEDKAASNTRAAYTKATSVFPGLYF 248



 Score = 32.9 bits (75), Expect = 1.4
 Identities = 21/125 (16%), Positives = 37/125 (29%), Gaps = 34/125 (27%)

Query: 118 AMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVS-SQDGRKRRKVMFDDDLED 176
           A SG  S+DD       D   D DT++       + +   S SQ  +  ++ +     E 
Sbjct: 108 ATSGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEK 167

Query: 177 NMDDDDDD---------------------------------DDDDETDDDNDEDNEEESE 203
            +++                                     +D D   DD D ++ EE +
Sbjct: 168 LIEEKKGQTENTFKFYNFGNDGEEAAAKDGGKSKSSDPGPLNDSDGQGDDGDPESAEEDK 227

Query: 204 EEEEN 208
                
Sbjct: 228 AASNT 232


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 46.3 bits (110), Expect = 3e-05
 Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 6/84 (7%)

Query: 161 DGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEE---EEEN---EEDDDE 214
           D  KR+++  D    D  D D   D D   DD +D+D+E+E+     E E    E  +++
Sbjct: 103 DEIKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEK 162

Query: 215 ERKAEKKNSKNEHVQELTAKQKNP 238
           ER+ E+K ++ E  +E      NP
Sbjct: 163 EREEEEKAAEEEKAREEEILTGNP 186



 Score = 38.9 bits (91), Expect = 0.008
 Identities = 20/103 (19%), Positives = 40/103 (38%), Gaps = 15/103 (14%)

Query: 150 KKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
           K   E    S +     +   DDD ED +     ++D   +D D+ + + +    +++++
Sbjct: 77  KLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSDSSSDSDSSDDDSD 136

Query: 210 EDDDE---------------ERKAEKKNSKNEHVQELTAKQKN 237
           +DD E               ER  EK+  + E   E    ++ 
Sbjct: 137 DDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREE 179



 Score = 33.2 bits (76), Expect = 0.67
 Identities = 23/100 (23%), Positives = 34/100 (34%), Gaps = 44/100 (44%)

Query: 127 DSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDD 186
           D+DD +   D+   D                                     DD DDDD 
Sbjct: 118 DADDSDSSSDSDSSD-------------------------------------DDSDDDDS 140

Query: 187 DDET-------DDDNDEDNEEESEEEEENEEDDDEERKAE 219
           +DET       +    E  EE+  EEEE   ++++ R+ E
Sbjct: 141 EDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180



 Score = 31.2 bits (71), Expect = 2.8
 Identities = 10/24 (41%), Positives = 16/24 (66%)

Query: 120 SGAISSDDSDDENDEDDTSDDDTE 143
           S + S  DS D++ +DD S+D+T 
Sbjct: 122 SDSSSDSDSSDDDSDDDDSEDETA 145


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 45.9 bits (109), Expect = 4e-05
 Identities = 20/79 (25%), Positives = 39/79 (49%), Gaps = 11/79 (13%)

Query: 148 SMKKYKEQVVSSQDGRKRRKVMFDDD-----------LEDNMDDDDDDDDDDETDDDNDE 196
            M   K+        + +RKV    +           LE  + + + +D D+E + D +E
Sbjct: 119 VMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEE 178

Query: 197 DNEEESEEEEENEEDDDEE 215
           + EEE E+E+ +++DDD++
Sbjct: 179 EEEEEEEDEDFDDDDDDDD 197



 Score = 44.4 bits (105), Expect = 1e-04
 Identities = 20/77 (25%), Positives = 35/77 (45%), Gaps = 7/77 (9%)

Query: 134 EDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETD-- 191
             +  +D  E      KK KE      D    +    +++ E+  +D DDDDDDD+ D  
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDDYN 201

Query: 192 -----DDNDEDNEEESE 203
                D+ ++D+ ++ E
Sbjct: 202 AENYFDNGEDDDYDDDE 218


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 45.1 bits (107), Expect = 4e-05
 Identities = 18/51 (35%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEE-ESEEEEENEEDDDEERKAE 219
            +D+     D+DD+  D++E D  +DE +EE +SEEE+      +EE   E
Sbjct: 11  EEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPE 61



 Score = 41.6 bits (98), Expect = 6e-04
 Identities = 19/97 (19%), Positives = 39/97 (40%), Gaps = 26/97 (26%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
            S+   D+ +ED+   ++ E                           D+  ++   D  D
Sbjct: 2   GSESESDDGEEDEELPEEDED--------------------------DESSDEEEVDLPD 35

Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
           D+ D+E+D + ++      EEE + E + + +R+ EK
Sbjct: 36  DEQDEESDSEEEQIFVTRQEEEVDPEAEAEFDREFEK 72



 Score = 31.6 bits (72), Expect = 1.5
 Identities = 10/42 (23%), Positives = 21/42 (50%), Gaps = 1/42 (2%)

Query: 288 EEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEED 329
           +E  +E+E     D+  D+ S  E+E +     ++  +PE +
Sbjct: 23  DESSDEEEVDL-PDDEQDEESDSEEEQIFVTRQEEEVDPEAE 63


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
           Members of this family are bacterial proteins with a
           conserved motif [KR]FYDLN, sometimes flanked by a pair
           of CXXC motifs, followed by a long region of low
           complexity sequence in which roughly half the residues
           are Asp and Glu, including multiple runs of five or more
           acidic residues. The function of members of this family
           is unknown.
          Length = 104

 Score = 43.5 bits (103), Expect = 4e-05
 Identities = 19/44 (43%), Positives = 33/44 (75%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
           +D+ +  +DDDDDDDDDD+  D +D+D + + ++++  E+DDDE
Sbjct: 61  EDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104



 Score = 42.3 bits (100), Expect = 1e-04
 Identities = 19/45 (42%), Positives = 33/45 (73%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
           ++D +D + DDDDDDDDD+   D D+D+ +  +++++  EDDD+E
Sbjct: 60  EEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104



 Score = 41.5 bits (98), Expect = 2e-04
 Identities = 14/57 (24%), Positives = 33/57 (57%), Gaps = 3/57 (5%)

Query: 162 GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEEN---EEDDDEE 215
            + R      +D     +D++D+DD    DDD+D+D+++  + ++++   ++DDD+ 
Sbjct: 41  AKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDF 97



 Score = 35.4 bits (82), Expect = 0.024
 Identities = 13/40 (32%), Positives = 26/40 (65%)

Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
                +D+ DDDDDD  D + DD + +D++++  E++++E
Sbjct: 65  DVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104



 Score = 33.8 bits (78), Expect = 0.089
 Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 27/84 (32%)

Query: 118 AMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN 177
           A +   +  D D+E+++D   DDD +                           DDDL D 
Sbjct: 48  ADAEDAAKKDEDEEDEDDVVLDDDDDDDD------------------------DDDLPD- 82

Query: 178 MDDDDDDDDDDETDDDNDEDNEEE 201
             DDDD D DD+ DDD  ED+++E
Sbjct: 83  -LDDDDVDLDDD-DDDFLEDDDDE 104


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
           the N-terminal region of RXT2-like proteins. In S.
           cerevisiae, RXT2 has been demonstrated to be involved in
           conjugation with cellular fusion (mating) and invasive
           growth. A high throughput localisation study has
           localised RXT2 to the nucleus.
          Length = 141

 Score = 43.5 bits (103), Expect = 7e-05
 Identities = 17/47 (36%), Positives = 29/47 (61%)

Query: 152 YKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDN 198
           Y+  V+S +  +       D+D  D  DDD+DD+DD+E D ++D++N
Sbjct: 29  YERDVLSRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDDEN 75



 Score = 35.1 bits (81), Expect = 0.059
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEE 205
           D+D  D DDD+ DD++DE+ + E ++E
Sbjct: 48  DEDGGDIDDDDEDDEDDEEADAEDDDE 74



 Score = 33.9 bits (78), Expect = 0.18
 Identities = 14/45 (31%), Positives = 27/45 (60%)

Query: 165 RRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
            R V+    ++ N     D+D  D  DDD D++++EE++ E+++E
Sbjct: 30  ERDVLSRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDDE 74



 Score = 33.5 bits (77), Expect = 0.21
 Identities = 13/27 (48%), Positives = 20/27 (74%)

Query: 188 DETDDDNDEDNEEESEEEEENEEDDDE 214
           DE   D D+D+E++ ++EE + EDDDE
Sbjct: 48  DEDGGDIDDDDEDDEDDEEADAEDDDE 74



 Score = 33.1 bits (76), Expect = 0.30
 Identities = 11/28 (39%), Positives = 22/28 (78%)

Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEEN 208
           D+D  D D+ D+D+++D E ++E+++EN
Sbjct: 48  DEDGGDIDDDDEDDEDDEEADAEDDDEN 75



 Score = 33.1 bits (76), Expect = 0.31
 Identities = 9/28 (32%), Positives = 21/28 (75%)

Query: 184 DDDDDETDDDNDEDNEEESEEEEENEED 211
           D+D  + DDD+++D ++E  + E+++E+
Sbjct: 48  DEDGGDIDDDDEDDEDDEEADAEDDDEN 75



 Score = 31.2 bits (71), Expect = 1.3
 Identities = 11/47 (23%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK 239
           D D  + ++ +E++E++E+ D E   ++   K   ++E+ A   +P 
Sbjct: 48  DEDGGDIDDDDEDDEDDEEADAE-DDDENPYKLIRLEEILAPLTHPS 93


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 46.4 bits (110), Expect = 8e-05
 Identities = 42/217 (19%), Positives = 86/217 (39%), Gaps = 13/217 (5%)

Query: 140 DDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNM--DDDDDDDDDDETDDDNDED 197
           DDTE  +   ++ +E+  +S       K         +M   + ++++  +E +++ +E+
Sbjct: 396 DDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEE 455

Query: 198 NEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTK 257
            EEE E EEE  ED++EE + E  N   E ++  +    + +        + E       
Sbjct: 456 EEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGE--------EPEEDAERRN 507

Query: 258 PTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEP 317
             +    RMS+     + + +S       +E  + E  D+    E +D+    E+  +  
Sbjct: 508 SEMAGISRMSEGQ---QPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSS 564

Query: 318 KTHKDSKNPEEDTGLNWKSDLAQKAASAFLERQANIV 354
            T  +      +T ++    L     S  LE  +  V
Sbjct: 565 HTELEGVATPVETKISSSRKLPPPPVSTSLENDSATV 601



 Score = 44.1 bits (104), Expect = 5e-04
 Identities = 17/94 (18%), Positives = 47/94 (50%)

Query: 130 DENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE 189
           D+ +E++         QG+  +  +   +S    +   +   +  E+   +++++++++E
Sbjct: 396 DDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEE 455

Query: 190 TDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
            +++ + + EE  +EEEE E + D   + E + S
Sbjct: 456 EEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGS 489



 Score = 43.0 bits (101), Expect = 0.001
 Identities = 17/116 (14%), Positives = 51/116 (43%), Gaps = 18/116 (15%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
           S +  ++E+ E++  +++ E  +    + +E                +D+ E+    + +
Sbjct: 436 SQESEEEESVEEEEEEEEEEEEEEQESEEEEG---------------EDEEEEE---EVE 477

Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK 239
            D+  E + +   + + + EE EE+ E  + E     + S+ +  +  + + ++P+
Sbjct: 478 ADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQ 533



 Score = 38.7 bits (90), Expect = 0.018
 Identities = 19/122 (15%), Positives = 56/122 (45%), Gaps = 9/122 (7%)

Query: 131 ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDET 190
           E +   TS   ++  + S    +   ++SQ+  +   V  +++ E+  ++++ + +++E 
Sbjct: 408 ERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEG 467

Query: 191 DDDNDEDNEEESEEEEENE--------EDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT 242
           +D+ +E+ E E++   E E         D +E  +  ++ +         ++ + P+  +
Sbjct: 468 EDE-EEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSS 526

Query: 243 FQ 244
            Q
Sbjct: 527 VQ 528



 Score = 30.3 bits (68), Expect = 8.4
 Identities = 19/111 (17%), Positives = 46/111 (41%), Gaps = 11/111 (9%)

Query: 128 SDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDD 187
           S++E  ED+  +++ E+  GS    +E++  S +G         D  E   D +  + + 
Sbjct: 462 SEEEEGEDEEEEEEVEADNGS----EEEMEGSSEGDG-------DGEEPEEDAERRNSEM 510

Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNP 238
                 ++      S  + E+ +++  + ++    S  E   E    +++P
Sbjct: 511 AGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESP 561


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 43.5 bits (103), Expect = 9e-05
 Identities = 16/35 (45%), Positives = 26/35 (74%), Gaps = 1/35 (2%)

Query: 168 VMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEES 202
           V  ++D  D+ ++D++++DD+E DDD DE  EEES
Sbjct: 107 VASEEDESDDDEEDEEEEDDEE-DDDEDESEEEES 140



 Score = 43.1 bits (102), Expect = 1e-04
 Identities = 14/34 (41%), Positives = 25/34 (73%)

Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
           ++  DD++ED EEE +EE+++E++ +EE    KK
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDEDESEEEESPVKK 144



 Score = 42.3 bits (100), Expect = 2e-04
 Identities = 14/30 (46%), Positives = 26/30 (86%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEEN 208
           +D+ DDD++DE ++D++ED++E+  EEEE+
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDEDESEEEES 140



 Score = 41.2 bits (97), Expect = 7e-04
 Identities = 11/31 (35%), Positives = 25/31 (80%)

Query: 177 NMDDDDDDDDDDETDDDNDEDNEEESEEEEE 207
             ++D+ DDD+++ ++++DE++++E E EEE
Sbjct: 108 ASEEDESDDDEEDEEEEDDEEDDDEDESEEE 138



 Score = 40.8 bits (96), Expect = 7e-04
 Identities = 16/33 (48%), Positives = 27/33 (81%), Gaps = 2/33 (6%)

Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEE 206
            E++  DDD++D+++E  DD ++D+E+ESEEEE
Sbjct: 109 SEEDESDDDEEDEEEE--DDEEDDDEDESEEEE 139



 Score = 38.5 bits (90), Expect = 0.005
 Identities = 13/34 (38%), Positives = 24/34 (70%)

Query: 192 DDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
           +D  +D+EE+ EEE++ E+DD++E + E+   K 
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDEDESEEEESPVKK 144



 Score = 38.5 bits (90), Expect = 0.005
 Identities = 14/36 (38%), Positives = 25/36 (69%)

Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK 217
           +D+ DDDE D++ ++D E++ E+E E EE   ++ K
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146



 Score = 36.9 bits (86), Expect = 0.016
 Identities = 12/33 (36%), Positives = 23/33 (69%)

Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
             +D++D+D E+E EE++E ++D+DE  + E  
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESP 141



 Score = 28.8 bits (65), Expect = 8.3
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 112 IDGIHGAMSGAISSDDSDDENDEDDTSDDDTE 143
           I G H   S    SDD +++ +E+D  +DD E
Sbjct: 101 ISGQHLVASEEDESDDDEEDEEEEDDEEDDDE 132


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 43.0 bits (101), Expect = 9e-05
 Identities = 19/48 (39%), Positives = 29/48 (60%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
           D D     D  D+ D+++ETD+  DE  +E   E EE +E++DEE +A
Sbjct: 84  DADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEAEA 131



 Score = 38.0 bits (88), Expect = 0.005
 Identities = 15/61 (24%), Positives = 28/61 (45%), Gaps = 1/61 (1%)

Query: 171 DDDLEDNMDD-DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQ 229
           +   E    D D+D D+  E D  ++ D EEE++E  +   D+ +    E    ++E  +
Sbjct: 71  EAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEAE 130

Query: 230 E 230
            
Sbjct: 131 A 131



 Score = 33.8 bits (77), Expect = 0.18
 Identities = 16/62 (25%), Positives = 30/62 (48%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
            +  +  +  D D+D DE  + +  D  +E EE +E  ++  +E  AE + +  E  +E 
Sbjct: 70  AEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEA 129

Query: 232 TA 233
            A
Sbjct: 130 EA 131



 Score = 33.4 bits (76), Expect = 0.24
 Identities = 11/36 (30%), Positives = 20/36 (55%), Gaps = 1/36 (2%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
              +  ++ E  D  DED +E +E +  +E D++EE
Sbjct: 68  AAAEAAEEAEAADA-DEDADEAAEADAADEADEEEE 102



 Score = 31.5 bits (71), Expect = 0.89
 Identities = 8/45 (17%), Positives = 20/45 (44%)

Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
                     +  +E +  + +++ +E+ E +  +E D+EE   E
Sbjct: 61  GPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDE 105



 Score = 30.7 bits (69), Expect = 1.7
 Identities = 8/48 (16%), Positives = 21/48 (43%)

Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
                +  ++    D +E+++E  E +  D+ + + E   + +E   E
Sbjct: 66  ATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADE 113



 Score = 30.3 bits (68), Expect = 2.7
 Identities = 7/45 (15%), Positives = 20/45 (44%)

Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
               +  ++ +  D  +D ++    + ++E +E EE D+   +  
Sbjct: 67  TAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETA 111


>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D.  SpoVID, the
           stage VI sporulation protein D, is restricted to
           endospore-forming members of the bacteria, all of which
           are found among the Firmicutes. It is widely distributed
           but not quite universal in this group. Between
           well-conserved N-terminal and C-terminal domains is a
           poorly conserved, low-complexity region of variable
           length, rich enough in glutamic acid to cause spurious
           BLAST search results unless a filter is used. The seed
           alignment for this model was trimmed, in effect, by
           choosing member sequences in which these regions are
           relatively short. SpoVID is involved in spore coat
           assembly by the mother cell compartment late in the
           process of sporulation [Cellular processes, Sporulation
           and germination].
          Length = 338

 Score = 45.6 bits (108), Expect = 9e-05
 Identities = 25/120 (20%), Positives = 49/120 (40%), Gaps = 4/120 (3%)

Query: 119 MSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNM 178
           +SG    ++ D E   +D  ++++ S +      +E    +    +R      +      
Sbjct: 133 ISGIQQENNLDAEPAREDEEEEESFSAEFEHPAQEE----TAGEEERTDEPKVEHEAHEQ 188

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNP 238
            +   DDD DE      E  + ESE E   EE++ EE + E +    +  + L  + ++P
Sbjct: 189 HEQPADDDPDEWKISASEPFQLESEVEASPEEENYEEYEDETELEVEDEEKALDEQTEDP 248


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 45.8 bits (108), Expect = 1e-04
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 13/101 (12%)

Query: 111 EIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMF 170
           E D   GA + A  +D  +D+ DE ++ D+  +    S   Y     SS+D         
Sbjct: 436 EFDQDPGAPAHADRADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSED--------- 486

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEED 211
               E +  DD   D+ D   + +D    +  EEEEE E  
Sbjct: 487 ----ESDSADDSWGDESDSGIEHDDGGVGQAIEEEEEEERA 523



 Score = 35.4 bits (81), Expect = 0.18
 Identities = 17/61 (27%), Positives = 27/61 (44%), Gaps = 5/61 (8%)

Query: 171 DDDLEDNMDDDDDD-----DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
           D D+E    D+ D         D  D D D+ +E+ES +E  + EDD +   +    S  
Sbjct: 426 DQDVEAEAHDEFDQDPGAPAHADRADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSE 485

Query: 226 E 226
           +
Sbjct: 486 D 486



 Score = 32.3 bits (73), Expect = 1.7
 Identities = 23/131 (17%), Positives = 47/131 (35%), Gaps = 30/131 (22%)

Query: 118 AMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN 177
           A + A ++  +D + + +   + D + G             +     R     DD  E  
Sbjct: 415 AAAAAAANAPADQDVEAEAHDEFDQDPG-------------APAHADRADSDEDDMDEQE 461

Query: 178 MDDDDDDDDDD----------ETDDDNDEDNEEESEEEEENEEDDD-------EERKAEK 220
             D+  D +DD           ++D++D  ++   +E +   E DD       EE + E+
Sbjct: 462 SGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGGVGQAIEEEEEEE 521

Query: 221 KNSKNEHVQEL 231
           +       + L
Sbjct: 522 RAVLGAVAEML 532



 Score = 31.5 bits (71), Expect = 3.2
 Identities = 24/97 (24%), Positives = 40/97 (41%), Gaps = 13/97 (13%)

Query: 143 ESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLE-----DNMDDDDDDDDDDETDDDNDED 197
           E G+ +         + QD        FD D       D  D D+DD D+ E+ D+   D
Sbjct: 410 EEGRAAAAAAAANAPADQDVEAEAHDEFDQDPGAPAHADRADSDEDDMDEQESGDER-AD 468

Query: 198 NEEESEEEE-------ENEEDDDEERKAEKKNSKNEH 227
            E++S+          E+E D  ++   ++ +S  EH
Sbjct: 469 GEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEH 505


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 45.5 bits (108), Expect = 1e-04
 Identities = 25/160 (15%), Positives = 56/160 (35%), Gaps = 19/160 (11%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
           D   +    D    +DD+ + D + +D ++       N++++D++      ++ +     
Sbjct: 33  DSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDS---- 88

Query: 231 LTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEE 290
                 N   F ++           T    +      D + ++   I + L  LNS   +
Sbjct: 89  -----NNIIDFIYK-------NLPQTNINQLLTKNKYDDNYSLTTLIQN-LFNLNSDISD 135

Query: 291 EEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDT 330
            E+  +S  S    D N + +         + SK  + D 
Sbjct: 136 YEQPRNSEKSTN--DSNKNSDSSIKNDTDTQSSKQDKADN 173



 Score = 37.1 bits (86), Expect = 0.065
 Identities = 21/164 (12%), Positives = 57/164 (34%), Gaps = 30/164 (18%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
             +D+D ++ + D +D+                  S   ++       +  +    +  +
Sbjct: 157 IKNDTDTQSSKQDKADNQKAPS-------SNNTKPSTSNKQ------PNSPKPTQPNQSN 203

Query: 184 DD---DDDETDDDNDEDNE------------EESEEEEENEEDDDEERKAEKKNSKNEHV 228
                DD      + +DN+            + SE+ ++ ++D   + K +K  + N   
Sbjct: 204 SQPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQSKKDKTETSNTKN 263

Query: 229 QELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQ--PRMSDKD 270
            +L  + +           + +    +T+ T + +  P +S+ D
Sbjct: 264 PQLPTQDELKHKSKPAQSFENDVNQSNTRSTSLFETGPSLSNND 307



 Score = 32.8 bits (75), Expect = 1.2
 Identities = 45/256 (17%), Positives = 84/256 (32%), Gaps = 48/256 (18%)

Query: 120 SGAISSDDSDDENDEDDTSD---DDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLED 176
           + +  S  S+D+    DTS    D  ++   S +   ++  S+ D          D +  
Sbjct: 38  TTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYK 97

Query: 177 NMDDDDDDDD----------------------DDETDDDNDEDNEEESEEEEENEEDDDE 214
           N+   + +                        + +  D     N E+S  +     D   
Sbjct: 98  NLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSI 157

Query: 215 ERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIR 274
           +   + ++SK +       K  N KA +  +         S K    P+P   ++  +  
Sbjct: 158 KNDTDTQSSKQD-------KADNQKAPSSNNTKPST----SNKQPNSPKPTQPNQSNS-- 204

Query: 275 HKITSALAGLNSKEEEEEEQEDSAFS---DESADD---NSSDEDETVEPKTHKDS--KNP 326
              +   A   S  ++ +   DSA     D+ ++D      D     +    + S  KNP
Sbjct: 205 QPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQSKKDKTETSNTKNP 264

Query: 327 EEDT--GLNWKSDLAQ 340
           +  T   L  KS  AQ
Sbjct: 265 QLPTQDELKHKSKPAQ 280


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 44.1 bits (105), Expect = 1e-04
 Identities = 14/41 (34%), Positives = 22/41 (53%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEED 211
                    + + D++D E DDD+DED +E+ EEE +   D
Sbjct: 36  AAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPD 76



 Score = 42.2 bits (100), Expect = 7e-04
 Identities = 14/45 (31%), Positives = 25/45 (55%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
                     + + D++D  DDD+D+++E+E +EEE +   D EE
Sbjct: 35  AAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79



 Score = 41.8 bits (99), Expect = 0.001
 Identities = 13/53 (24%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 171 DDDLEDNMDD---DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
           D +           + + D+++ +DD+D+D +E+ ++EEE +   D E   E+
Sbjct: 31  DPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEARER 83



 Score = 38.7 bits (91), Expect = 0.008
 Identities = 11/60 (18%), Positives = 27/60 (45%)

Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA 233
           ++ N           E++ D ++  +++ ++E+E+E+D++E           E   EL  
Sbjct: 30  IDPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEARERFAELRE 89



 Score = 36.4 bits (85), Expect = 0.044
 Identities = 12/49 (24%), Positives = 24/49 (48%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
            DL     D +             E +EE+ E++++++ED+DE+ + E 
Sbjct: 23  SDLISGFIDPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 45.8 bits (108), Expect = 2e-04
 Identities = 25/57 (43%), Positives = 38/57 (66%)

Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
           R+ R +  +D    +    DDDD+D++ DDD++ED+EEE EEEEE +  DDE+ + E
Sbjct: 135 RRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191



 Score = 37.7 bits (87), Expect = 0.046
 Identities = 14/50 (28%), Positives = 32/50 (64%), Gaps = 2/50 (4%)

Query: 177 NMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           ++ ++D    D+   DD+DED +E+  +++E +++++EE + E K   +E
Sbjct: 139 HLAEEDMSPRDNFVIDDDDEDEDED--DDDEEDDEEEEEEEEEIKGFDDE 186


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 43.2 bits (102), Expect = 2e-04
 Identities = 29/111 (26%), Positives = 53/111 (47%), Gaps = 21/111 (18%)

Query: 108 SMMEIDGIHGAM--SGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKR 165
            + EID    AM     I + D + +  E+   D+           Y E     +D    
Sbjct: 83  PLDEIDEEIQAMTEKKDIKAKDKEVDAFEEGDEDELD---------YDEDKEEEED---- 129

Query: 166 RKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNE-EESEEEEENEEDDDEE 215
                D+    + ++DD+D+DDDE  +   ED+E +E E+++E+EED++++
Sbjct: 130 -----DEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDEDDDEDEEDEEDK 175



 Score = 40.5 bits (95), Expect = 0.001
 Identities = 23/91 (25%), Positives = 48/91 (52%)

Query: 130 DENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE 189
           DE DE+  +  + +  +   K+        +D     +   +++ ++    DD++DD+DE
Sbjct: 85  DEIDEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDE 144

Query: 190 TDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
            DD+  E   E+ E +E+ ++D+DEE + +K
Sbjct: 145 DDDEIVEILIEDDEVDEDEDDDEDEEDEEDK 175



 Score = 35.9 bits (83), Expect = 0.051
 Identities = 20/71 (28%), Positives = 36/71 (50%), Gaps = 8/71 (11%)

Query: 164 KRRKVMFDDDLEDNMDDDDD--------DDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
            +    F++  ED +D D+D         D  D+ +DD DED++E  E   E++E D++E
Sbjct: 104 DKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDE 163

Query: 216 RKAEKKNSKNE 226
              E +  + +
Sbjct: 164 DDDEDEEDEED 174



 Score = 33.2 bits (76), Expect = 0.42
 Identities = 15/53 (28%), Positives = 29/53 (54%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELT 232
             D + D  E  D+++ D +E+ EEEE++E D  ++   ++    +E V+ L 
Sbjct: 102 AKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILI 154



 Score = 32.8 bits (75), Expect = 0.55
 Identities = 16/74 (21%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 160 QDGRKRRKVMFDDDLEDNMDDD-DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
           +    + K    D  E+  +D+ D D+D +E +DD  +  ++E+++E+E++++  E    
Sbjct: 96  EKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIE 155

Query: 219 EKKNSKNEHVQELT 232
           + +  ++E   E  
Sbjct: 156 DDEVDEDEDDDEDE 169


>gnl|CDD|221603 pfam12496, BNIP2, Bcl2-/adenovirus E1B nineteen kDa-interacting
           protein 2.  This domain family is found in eukaryotes,
           and is typically between 119 and 133 amino acids in
           length. There is a conserved HGGY sequence motif. This
           family is Bcl2-/adenovirus E1B nineteen kDa-interacting
           protein 2. It interacts with pro- and anti- apoptotic
           molecules in the cell.
          Length = 126

 Score = 42.0 bits (99), Expect = 2e-04
 Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 4/83 (4%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
           DDL +  DD D + DD +T DD+D  +   +E E E    DD  +    ++     + + 
Sbjct: 26  DDLLETPDDLDINVDDLDTPDDSDSLDFPGNELEWE----DDTPKAKAAESPPAGSIPQY 81

Query: 232 TAKQKNPKAFTFQSVIKGERKFR 254
           TA+++      +++V  GE++ R
Sbjct: 82  TAEEERRDGRLWRTVRIGEQEHR 104


>gnl|CDD|235030 PRK02315, PRK02315, adaptor protein; Provisional.
          Length = 233

 Score = 43.3 bits (103), Expect = 3e-04
 Identities = 19/62 (30%), Positives = 31/62 (50%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
             DL  N +D  D   D++ +D   ED  +  E+  E+++D D   K EK+  + E   +
Sbjct: 79  KLDLPLNFEDLIDLPSDEDIEDLFPEDFFKLLEQSLEDKDDLDATAKQEKEEKEAEDEVQ 138

Query: 231 LT 232
           LT
Sbjct: 139 LT 140



 Score = 39.8 bits (94), Expect = 0.004
 Identities = 14/63 (22%), Positives = 30/63 (47%), Gaps = 1/63 (1%)

Query: 167 KVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           K+    + ED +D   D+D +D   +D      E+S E++++ +   ++ K EK+     
Sbjct: 79  KLDLPLNFEDLIDLPSDEDIEDLFPED-FFKLLEQSLEDKDDLDATAKQEKEEKEAEDEV 137

Query: 227 HVQ 229
            + 
Sbjct: 138 QLT 140



 Score = 36.0 bits (84), Expect = 0.081
 Identities = 13/51 (25%), Positives = 24/51 (47%)

Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
                +  D+D+ED   +D     +   +D +D D   + E+EE+  ED+ 
Sbjct: 87  EDLIDLPSDEDIEDLFPEDFFKLLEQSLEDKDDLDATAKQEKEEKEAEDEV 137


>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 44.6 bits (106), Expect = 3e-04
 Identities = 12/57 (21%), Positives = 30/57 (52%), Gaps = 1/57 (1%)

Query: 169 MFDDDLEDNMDDDDDDDDD-DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
           +   D+E   + DD DD++ +    D D+D+E++  + ++++E++ +         K
Sbjct: 211 LLKLDVEIQNELDDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIK 267



 Score = 38.1 bits (89), Expect = 0.028
 Identities = 12/50 (24%), Positives = 28/50 (56%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
           DD++D  ++    D+DD+ +DD  + ++++ EE +   E     ++  +K
Sbjct: 223 DDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKEVSEK 272



 Score = 35.0 bits (81), Expect = 0.25
 Identities = 16/50 (32%), Positives = 27/50 (54%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
           DD  ++  +    D+DDD+ DD  D D+++E E + E E     +  +EK
Sbjct: 223 DDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKEVSEK 272



 Score = 31.1 bits (71), Expect = 3.8
 Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 27/68 (39%)

Query: 126 DDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDD 185
           DD DDE +E   +D+D +                           D+D   +MDDDD+++
Sbjct: 223 DDIDDEEEERVLADEDDD---------------------------DEDDMFDMDDDDEEE 255

Query: 186 DDDETDDD 193
            D E +  
Sbjct: 256 SDPEVERT 263



 Score = 31.1 bits (71), Expect = 4.2
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEE 204
           +  D+ +D+ DD  D DDDDE + D + +     +E
Sbjct: 233 VLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKE 268



 Score = 30.7 bits (70), Expect = 5.0
 Identities = 11/38 (28%), Positives = 22/38 (57%)

Query: 160 QDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDED 197
            D  + R +  +DD +++   D DDDD++E+D + +  
Sbjct: 226 DDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERT 263


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 41.1 bits (96), Expect = 3e-04
 Identities = 27/86 (31%), Positives = 49/86 (56%), Gaps = 1/86 (1%)

Query: 151 KYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEE 210
           K K++VV  ++  K      +++ E+   + DD+ +++E  D++DE+ E E EEEE  EE
Sbjct: 17  KEKKEVVEEKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEE 76

Query: 211 DDDEERKAEKKNSKNEHVQELTAKQK 236
           ++ E     K+ +++E     T KQK
Sbjct: 77  EETEG-ATGKRAAEDEEDDAETKKQK 101


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 44.1 bits (104), Expect = 4e-04
 Identities = 26/107 (24%), Positives = 49/107 (45%), Gaps = 9/107 (8%)

Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDD 182
           I     +D +      DD     Q +  +    ++ S D  +      DD +E++  D++
Sbjct: 184 IEDKAGEDLDGLAAEIDD-----QQAFARVVRDMLGSMDMAEE---TGDDGIEED-ADEE 234

Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQ 229
           D DDD   ++++ E   EESE  +E+EED+ E    E +  + +  +
Sbjct: 235 DGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAE 281



 Score = 42.9 bits (101), Expect = 0.001
 Identities = 21/113 (18%), Positives = 43/113 (38%), Gaps = 6/113 (5%)

Query: 109 MMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKV 168
           M+    +          +D+D+E+ +DD  D++ +S  G  +          +       
Sbjct: 212 MLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEAT--- 268

Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
             D + E+   D  +  +D E  D++DED E   E+        +   + + K
Sbjct: 269 --DGEGEEGEMDAAEASEDSE-SDESDEDTETPGEDARPATPFTELMEEVDYK 318



 Score = 32.9 bits (75), Expect = 1.2
 Identities = 11/55 (20%), Positives = 24/55 (43%)

Query: 176 DNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
            +M ++  DD  +E  D+ D D+++    E+     ++ E   E +  + E    
Sbjct: 216 MDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDG 270



 Score = 32.5 bits (74), Expect = 1.5
 Identities = 18/121 (14%), Positives = 43/121 (35%), Gaps = 18/121 (14%)

Query: 135 DDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDD----DD--- 187
            D  D   E     + + K    +  D   +   ++ D++ED   +D D      DD   
Sbjct: 145 TDKEDAPLEEAVALLVREKLTGDAPPDSAGKVLDLWRDEIEDKAGEDLDGLAAEIDDQQA 204

Query: 188 -----------DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
                       +  ++  +D  EE  +EE+ ++D  +  +  +   +     + + + +
Sbjct: 205 FARVVRDMLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDE 264

Query: 237 N 237
            
Sbjct: 265 A 265


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 44.1 bits (104), Expect = 4e-04
 Identities = 24/112 (21%), Positives = 45/112 (40%), Gaps = 6/112 (5%)

Query: 120 SGAISSDDSDDEN--DEDDTSDDDTESGQGSMKKYKEQVVSSQ----DGRKRRKVMFDDD 173
             A  S D DDE   + +D S  +  +   S  + +E   S      DG        ++ 
Sbjct: 161 KEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEG 220

Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
            E    + ++D  + E+D+ + E +E  S  + E      ++ K +K +S  
Sbjct: 221 EEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTF 272



 Score = 38.3 bits (89), Expect = 0.023
 Identities = 21/97 (21%), Positives = 36/97 (37%), Gaps = 3/97 (3%)

Query: 131 ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRR---KVMFDDDLEDNMDDDDDDDDD 187
              +  T     +  + S  K  E+   S+D  K     +   DD+ E++ D +D    D
Sbjct: 149 GKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYD 208

Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
               D +DE+  EE+     NE+  + E         
Sbjct: 209 GMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEIS 245



 Score = 34.0 bits (78), Expect = 0.48
 Identities = 26/137 (18%), Positives = 50/137 (36%), Gaps = 43/137 (31%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGS--MKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
           S D+S  E   +D SDD+ E    S    +Y   +V S D  +      ++    N ++D
Sbjct: 177 SEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEE-----GEEAPSINYNED 231

Query: 182 DDDDDDDETDDDNDEDN------------------------------------EEESEEE 205
             + + DE+D +  E                                      E+E +++
Sbjct: 232 TSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGSEDEDDDD 291

Query: 206 EENEEDDDEERKAEKKN 222
           E+ + D   ++  ++KN
Sbjct: 292 EDIDPDQVVKKPVKRKN 308



 Score = 33.3 bits (76), Expect = 0.92
 Identities = 12/58 (20%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 171 DDDLEDNMDDDDDD--DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
             + +++ D DD++  + +DE+  +   +++ + EEEE+++ +D  +      +S +E
Sbjct: 160 KKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDE 217



 Score = 30.6 bits (69), Expect = 5.1
 Identities = 21/152 (13%), Positives = 47/152 (30%), Gaps = 28/152 (18%)

Query: 177 NMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
                       +T     ++ +E S++++E E + ++E K+E      E   +   ++ 
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSE---ESAEDDSDDEEEED 198

Query: 237 NPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQED 296
           +               +      +V      + + A                    E   
Sbjct: 199 S-----------DSEDYSQYDGMLVDSSDEEEGEEAP--------------SINYNEDTS 233

Query: 297 SAFSDESADDNSSDEDETVEPKTHKDSKNPEE 328
            + SDES  + S     +   ++   SK P+E
Sbjct: 234 ESESDESDSEISESRSVSDSEESSPPSKKPKE 265


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 44.0 bits (104), Expect = 4e-04
 Identities = 19/66 (28%), Positives = 36/66 (54%), Gaps = 5/66 (7%)

Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK----- 236
           D+++++++ +   +E+   + EEE + EE+ +E++K  KK  +     EL  K K     
Sbjct: 37  DEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTKPIWTR 96

Query: 237 NPKAFT 242
           NPK  T
Sbjct: 97  NPKDVT 102



 Score = 33.6 bits (77), Expect = 0.76
 Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 6/51 (11%)

Query: 187 DDETDDDNDEDNEEESEEEEEN-----EEDDDEERKAEKKNSKNEHVQELT 232
           + E  D+ +E+ +EE +EEEE      EE D+EE K EKK  K + V+E T
Sbjct: 32  EKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKK-KKTKKVKETT 81



 Score = 30.5 bits (69), Expect = 6.2
 Identities = 15/63 (23%), Positives = 29/63 (46%)

Query: 196 EDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRS 255
           E  +E  +EEEE E+++ +E + +  + + E  +E   ++K  K    +           
Sbjct: 30  EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89

Query: 256 TKP 258
           TKP
Sbjct: 90  TKP 92


>gnl|CDD|214472 smart00017, OSTEO, Osteopontin.  Osteopontin is an acidic
           phosphorylated glycoprotein of about 40 Kd which is
           abundant in the mineral matrix of bones and which binds
           tightly to hydroxyapatite. It is suggested that
           osteopontin might function as a cell attachment factor
           and could play a key role in the adhesion of osteoclasts
           to the mineral matrix of bone.
          Length = 287

 Score = 43.4 bits (102), Expect = 4e-04
 Identities = 19/41 (46%), Positives = 29/41 (70%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
           DDL+D+ DDD  DD D+++DD  D D+ +ES+E   ++E D
Sbjct: 70  DDLDDDDDDDHVDDRDNDSDDAEDSDDSDESDESHHSDESD 110



 Score = 37.3 bits (86), Expect = 0.040
 Identities = 17/35 (48%), Positives = 24/35 (68%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE 205
           DDD +D++DD D+D DD E  DD+DE +E    +E
Sbjct: 74  DDDDDDHVDDRDNDSDDAEDSDDSDESDESHHSDE 108



 Score = 35.7 bits (82), Expect = 0.100
 Identities = 16/42 (38%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
           DDD +D+   DD D+D D+ +D +D D  +ES   +E++  D
Sbjct: 73  DDDDDDD-HVDDRDNDSDDAEDSDDSDESDESHHSDESDVTD 113


>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein.  This family consists of several
           eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
           to suppress GCR2 and is highly expressed in the muscle
           and heart. The function of this family is unknown
           although it has been speculated that SGT1 may be
           functionally analogous to the Gcr2p protein of
           Saccharomyces cerevisiae which is known to be a
           regulatory factor of glycolytic gene expression.
          Length = 557

 Score = 43.6 bits (103), Expect = 6e-04
 Identities = 30/114 (26%), Positives = 53/114 (46%), Gaps = 24/114 (21%)

Query: 107 KSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDD-----DTESGQGSMKKYKEQVVSSQD 161
            SM E      +  GA  +DD D+++DE D S+D     D +     +K           
Sbjct: 419 SSMEEFLNKVSSFEGAEFADDEDEDDDEPDDSEDKDVSFDEDEFFEFLKNM--------- 469

Query: 162 GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
                  +    L+D+  D+D  DD D+ D+D+DED++E+ +   ++  ++ EE
Sbjct: 470 -------L---GLKDDEIDNDLPDDSDDADEDDDEDDDEDEDSSSDSTLEELEE 513



 Score = 40.9 bits (96), Expect = 0.005
 Identities = 27/134 (20%), Positives = 56/134 (41%), Gaps = 11/134 (8%)

Query: 90  EIFEFLNICQVHGESTEKSMM-EIDGIHGAMSGAISSDDSDDENDEDD----TSDDDTES 144
           E  E L   + +  ST  S   EI  +       +  ++ D   ++DD     S D+ +S
Sbjct: 330 EYKELLEKAEAYFSSTASSPGEEILDLLQLDLEELRDEEQDLPPEDDDSWLNISPDELDS 389

Query: 145 ----GQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEE 200
                QG  K  K     + +     +      +E+ ++     +  +  DD++++D+E 
Sbjct: 390 ELQERQGDKKDLKSNKEDANEVDDLEE--VVSSMEEFLNKVSSFEGAEFADDEDEDDDEP 447

Query: 201 ESEEEEENEEDDDE 214
           +  E+++   D+DE
Sbjct: 448 DDSEDKDVSFDEDE 461



 Score = 40.1 bits (94), Expect = 0.007
 Identities = 19/73 (26%), Positives = 28/73 (38%), Gaps = 6/73 (8%)

Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
            DD DD+ DED +SD   E  +  M +   ++  +                      +DD
Sbjct: 491 DDDEDDDEDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADIS------NSGSSGAEDD 544

Query: 185 DDDDETDDDNDED 197
           DDD E  +  D D
Sbjct: 545 DDDIEGVEPVDID 557



 Score = 34.3 bits (79), Expect = 0.39
 Identities = 33/204 (16%), Positives = 71/204 (34%), Gaps = 22/204 (10%)

Query: 127 DSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDD--DLEDNMDDDDDD 184
           +     D  + S++  E      + Y     SS  G +   ++  D  +L D   D   +
Sbjct: 317 EKGYFKDNLEGSEEYKE-LLEKAEAYFSSTASS-PGEEILDLLQLDLEELRDEEQDLPPE 374

Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQ 244
           DDD   +   DE + E    E + ++ D +  K +     +   +E+ +  +  +     
Sbjct: 375 DDDSWLNISPDELDSELQ--ERQGDKKDLKSNKEDANEVDDL--EEVVSSME--EFLNKV 428

Query: 245 SVIKGERKFRSTKPTVVPQPRMSDKDLAI---------RHKITSA---LAGLNSKEEEEE 292
           S  +G                  DKD++          ++ +      +      + ++ 
Sbjct: 429 SSFEGAEFADDEDEDDDEPDDSEDKDVSFDEDEFFEFLKNMLGLKDDEIDNDLPDDSDDA 488

Query: 293 EQEDSAFSDESADDNSSDEDETVE 316
           +++D    DE  D +S    E +E
Sbjct: 489 DEDDDEDDDEDEDSSSDSTLEELE 512


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
           family is conserved in fungi but the function is not
           known.
          Length = 509

 Score = 43.4 bits (102), Expect = 6e-04
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 11/84 (13%)

Query: 138 SDDDTESGQGSM---KK---YKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETD 191
           SD+  +S    M   KK   +KEQVV+    R   +   DDD ED ++ ++DD D+   +
Sbjct: 250 SDELGKSLPSLMSPRKKTASFKEQVVT----RTFPERTSDDD-EDAIETEEDDVDESAIE 304

Query: 192 DDNDEDNEEESEEEEENEEDDDEE 215
           DD+D+ + E+S EE      D++ 
Sbjct: 305 DDDDDSDWEDSVEESGRSSVDEKT 328



 Score = 34.1 bits (78), Expect = 0.45
 Identities = 23/119 (19%), Positives = 46/119 (38%), Gaps = 14/119 (11%)

Query: 118 AMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGR--------KRRKVM 169
            + G+   DD D   D   + D    S      K  +   S + G+        +++   
Sbjct: 210 TLGGSSGDDDEDSFEDRMSSQDPKRSSLPKPKPKMFQLGGSDELGKSLPSLMSPRKKTAS 269

Query: 170 FDD-----DLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
           F +        +   DDD+D  + E +DD DE   E+ +++ + E+  +E  ++     
Sbjct: 270 FKEQVVTRTFPERTSDDDEDAIETE-EDDVDESAIEDDDDDSDWEDSVEESGRSSVDEK 327


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 42.4 bits (100), Expect = 0.001
 Identities = 16/71 (22%), Positives = 33/71 (46%), Gaps = 5/71 (7%)

Query: 160 QDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
           +D R+ R    D +     + D+D+D+++E   D  E+ E E  EEE ++  +D   ++ 
Sbjct: 355 RDKRRARLDPIDFE-----EVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESS 409

Query: 220 KKNSKNEHVQE 230
                +   + 
Sbjct: 410 SDVGSDSESKA 420



 Score = 37.0 bits (86), Expect = 0.061
 Identities = 17/65 (26%), Positives = 34/65 (52%), Gaps = 2/65 (3%)

Query: 176 DNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQ 235
           D +D ++ D+D+DE  ++    +E E EE E++EE+  + R+     S ++   +  +K 
Sbjct: 363 DPIDFEEVDEDEDE--EEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKA 420

Query: 236 KNPKA 240
               A
Sbjct: 421 DKESA 425


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 43.1 bits (101), Expect = 0.001
 Identities = 16/45 (35%), Positives = 33/45 (73%), Gaps = 1/45 (2%)

Query: 175 EDNMDDDDDDDDDDETDDD-NDEDNEEESEEEEENEEDDDEERKA 218
           +D  D+ D+D++ DE+ +D +++++E +S +EE+ E+ D+ E KA
Sbjct: 946 DDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKA 990



 Score = 36.1 bits (83), Expect = 0.12
 Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 25/99 (25%)

Query: 121 GAISSDDSDDENDE--DDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNM 178
              S D+SD+  +E  +  +  D ES +                        D+D E + 
Sbjct: 924 MVGSDDESDESEEEVSEYEASSDDESDET-----------------------DEDEESDE 960

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK 217
             +D  +D+ E D  ++ED E+  E E +   D    ++
Sbjct: 961 SSEDLSEDESENDSSDEEDGEDWDELESKAAYDSRPGKR 999



 Score = 31.9 bits (72), Expect = 2.5
 Identities = 25/73 (34%), Positives = 36/73 (49%), Gaps = 11/73 (15%)

Query: 286 SKEEEEEEQEDSA----FSDESADDNSSDE--DETVEPKTHKDSKNPEEDTGLNWKSDLA 339
           S E EEE  E  A     SDE+ +D  SDE  ++  E ++  DS + E+  G +W  +L 
Sbjct: 931 SDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEED--GEDWD-ELE 987

Query: 340 QKAASAFLERQAN 352
            KA  A+  R   
Sbjct: 988 SKA--AYDSRPGK 998



 Score = 31.5 bits (71), Expect = 3.1
 Identities = 12/34 (35%), Positives = 23/34 (67%)

Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
            DDE+D+  +E +E E+  ++E++E D++E   E
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDE 960


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 42.5 bits (101), Expect = 0.001
 Identities = 16/69 (23%), Positives = 33/69 (47%), Gaps = 8/69 (11%)

Query: 152 YKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEED 211
           + +++ +     +RR      +L D   D + ++D      + +E +++E EEEEE+E D
Sbjct: 155 WYDRLEN----GERRL----RELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDEND 206

Query: 212 DDEERKAEK 220
           D       +
Sbjct: 207 DSLAADESE 215



 Score = 34.8 bits (81), Expect = 0.28
 Identities = 11/40 (27%), Positives = 20/40 (50%)

Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           D   D + +ED      E EE ++D+DEE + ++ +    
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLA 210



 Score = 33.6 bits (78), Expect = 0.79
 Identities = 13/66 (19%), Positives = 32/66 (48%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
           ++  +D  +++++D++DD    D  E  E+  E+ +   +   + RKA++K  +    Q 
Sbjct: 190 EELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQH 249

Query: 231 LTAKQK 236
               + 
Sbjct: 250 KKYAKL 255



 Score = 32.5 bits (75), Expect = 1.6
 Identities = 13/39 (33%), Positives = 20/39 (51%)

Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
           +E+  D DD    ++E D   DED EEE+     + E +
Sbjct: 60  VEEAPDADDLLLAENEADAQTDEDAEEEAAAALSSVESE 98



 Score = 31.3 bits (72), Expect = 3.6
 Identities = 11/38 (28%), Positives = 20/38 (52%)

Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           E   D D+    E+E + + +ED +EE  A   + ++E
Sbjct: 61  EEAPDADDLLLAENEADAQTDEDAEEEAAAALSSVESE 98


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 38.9 bits (91), Expect = 0.002
 Identities = 27/106 (25%), Positives = 42/106 (39%), Gaps = 15/106 (14%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
           S    DD ++E+D   DD          Y    + +   + R++  + DD        DD
Sbjct: 8   SRRSRDDYDEEEDEDMDDFIEDDDEEDDYDRDEIWAMFNKGRKRYAYRDD--------DD 59

Query: 184 DDDDDETDDDNDEDNEEESE-------EEEENEEDDDEERKAEKKN 222
           DDDD E    + ++ E  S         EEE  E ++E  K  +K 
Sbjct: 60  DDDDMEAGGADIQEEERRSARLARLEDREEERLEKEEEREKRARKR 105



 Score = 29.7 bits (67), Expect = 3.0
 Identities = 23/98 (23%), Positives = 42/98 (42%), Gaps = 35/98 (35%)

Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDE-------------------TDDDNDED-----N 198
            +R +  +D++ +++MDD  +DDD+++                     DD+D+D      
Sbjct: 8   SRRSRDDYDEEEDEDMDDFIEDDDEEDDYDRDEIWAMFNKGRKRYAYRDDDDDDDDMEAG 67

Query: 199 EEESEEEEE------NEEDDDEER-----KAEKKNSKN 225
             + +EEE         ED +EER     + EK+  K 
Sbjct: 68  GADIQEEERRSARLARLEDREEERLEKEEEREKRARKR 105


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 42.2 bits (99), Expect = 0.002
 Identities = 19/76 (25%), Positives = 37/76 (48%), Gaps = 5/76 (6%)

Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD-----DEERK 217
           R  R+++   +L + M DD + +D+++ DDD   +NE+E + E E E  +     + E  
Sbjct: 188 RVVREMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEAT 247

Query: 218 AEKKNSKNEHVQELTA 233
             +  S  E + +   
Sbjct: 248 DRESESGEEEMVQSDQ 263



 Score = 37.6 bits (87), Expect = 0.038
 Identities = 19/113 (16%), Positives = 46/113 (40%)

Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDD 182
           ++ +  DD   ED+   DD +  +   ++  E     Q+G   ++    D   ++ +++ 
Sbjct: 199 LAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEM 258

Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQ 235
              D D+  D++D+D+E   E             + + K       + + A++
Sbjct: 259 VQSDQDDLPDESDDDSETPGEGARPARPFTSTGGEPDYKVFTTAFDETVDAEE 311



 Score = 32.6 bits (74), Expect = 1.3
 Identities = 19/125 (15%), Positives = 47/125 (37%), Gaps = 28/125 (22%)

Query: 119 MSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNM 178
           M     S+D +D +D+  T ++  E G+G                           E   
Sbjct: 203 MGDDTESEDEEDGDDDQPTENEQEEQGEG---------------------------EGEG 235

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENE-EDDDEERKAEKKNSKNEHVQELTAKQKN 237
            +     + + TD +++   EE  + ++++  ++ D++ +   + ++       T  + +
Sbjct: 236 QEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDSETPGEGARPARPFTSTGGEPD 295

Query: 238 PKAFT 242
            K FT
Sbjct: 296 YKVFT 300


>gnl|CDD|206749 cd01856, YlqF, Circularly permuted YlqF GTPase.  Proteins of the
           YlqF family contain all sequence motifs typical of the
           vast class of P-loop-containing GTPases, but show a
           circular permutation, with a G4-G1-G3 pattern of motifs
           as opposed to the regular G1-G3-G4 pattern seen in most
           GTPases. The YlqF subfamily is represented in all
           eukaryotes as well as a phylogenetically diverse array
           of bacteria (including gram-positive bacteria,
           proteobacteria, Synechocystis, Borrelia, and
           Thermotoga).
          Length = 171

 Score = 39.8 bits (94), Expect = 0.002
 Identities = 19/72 (26%), Positives = 29/72 (40%), Gaps = 1/72 (1%)

Query: 393 QKNPKAFTFQSVIKGERKFRRKEDIQAKKHHVPQVDRTPLEPPPIVVAVVGPPQVGKSTL 452
           +   +   F +   G+   +  +  +       ++    L P P+   VVG P VGKSTL
Sbjct: 72  KSQGEPVLFVNAKNGKGVKKLLKKAKKLLKENEKLKAKGLLPRPLRAMVVGIPNVGKSTL 131

Query: 453 IRCLI-KNFTKT 463
           I  L  K   K 
Sbjct: 132 INRLRGKKVAKV 143


>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
           N-terminal domain.  Autophagocytosis is a
           starvation-induced process responsible for transport of
           cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
           ubiquitin like modifier that is topologically similar to
           the canonical E2 enzyme. It catalyzes the conjugation of
           Atg8 and phosphatidylethanolamine.
          Length = 146

 Score = 39.6 bits (93), Expect = 0.002
 Identities = 14/52 (26%), Positives = 28/52 (53%)

Query: 164 KRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
           +R K M   D  + + +D+D+DD   T   N +  +++  +EE+  E  D++
Sbjct: 77  RRAKQMEYGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDD 128



 Score = 33.8 bits (78), Expect = 0.20
 Identities = 10/45 (22%), Positives = 23/45 (51%)

Query: 168 VMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
           V    + +   DD  D++D  E  DD+D+  +    +E+++++  
Sbjct: 102 VTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADEDDDDDIP 146



 Score = 33.1 bits (76), Expect = 0.34
 Identities = 11/55 (20%), Positives = 27/55 (49%), Gaps = 8/55 (14%)

Query: 167 KVMFDDDLED--------NMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
           +++ D+D +D             DD  D+++  +  D+D++     + + ++DDD
Sbjct: 90  EIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADEDDDDD 144


>gnl|CDD|214640 smart00382, AAA, ATPases associated with a variety of cellular
           activities.  AAA - ATPases associated with a variety of
           cellular activities. This profile/alignment only detects
           a fraction of this vast family. The poorly conserved
           N-terminal helix is missing from the alignment.
          Length = 148

 Score = 39.7 bits (92), Expect = 0.002
 Identities = 14/34 (41%), Positives = 18/34 (52%)

Query: 435 PPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVI 468
           P  V+ +VGPP  GK+TL R L +        VI
Sbjct: 1   PGEVILIVGPPGSGKTTLARALARELGPPGGGVI 34


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 41.0 bits (96), Expect = 0.003
 Identities = 13/65 (20%), Positives = 31/65 (47%)

Query: 162 GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
            R  R ++   D+ + + D+ +  D ++ +D++D   +E+ ++ EE E    +    +  
Sbjct: 196 ARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSD 255

Query: 222 NSKNE 226
            S  E
Sbjct: 256 ASSEE 260



 Score = 39.4 bits (92), Expect = 0.007
 Identities = 13/60 (21%), Positives = 29/60 (48%), Gaps = 1/60 (1%)

Query: 172 DDLEDNMDDDDDD-DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
            D EDN D+DD   D+DD+  ++ +  + +   E+ +   ++ E  + E   +  +   +
Sbjct: 219 ADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPD 278



 Score = 39.4 bits (92), Expect = 0.008
 Identities = 16/59 (27%), Positives = 29/59 (49%)

Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA 233
           E    D +D++D+D+  +D D+D  EE E    +   +D +  +E+  S      E +A
Sbjct: 215 EPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASA 273



 Score = 39.0 bits (91), Expect = 0.011
 Identities = 15/64 (23%), Positives = 32/64 (50%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
           D  E+  D+ +  D +D  D+D+ +++E++ + EEE     D   +    +S+     E+
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEM 266

Query: 232 TAKQ 235
            A +
Sbjct: 267 EAAE 270



 Score = 38.6 bits (90), Expect = 0.014
 Identities = 13/53 (24%), Positives = 27/53 (50%)

Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
           + D+    + +D++D+DD  E +DD+  + EE    +  +E+ D    + E  
Sbjct: 212 LGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESG 264



 Score = 34.4 bits (79), Expect = 0.28
 Identities = 17/98 (17%), Positives = 30/98 (30%), Gaps = 26/98 (26%)

Query: 118 AMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN 177
           +M  A    D  +  D +D  D+D                                 +D 
Sbjct: 205 SMDMAEELGDEPESADSEDNEDEDDPKEDED--------------------------DDQ 238

Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
            ++++    D  ++D +    E ES E E  E   D+ 
Sbjct: 239 GEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDT 276



 Score = 33.6 bits (77), Expect = 0.56
 Identities = 16/92 (17%), Positives = 35/92 (38%), Gaps = 27/92 (29%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
           S+D  D+E+++D   D+D + G+                            E++   D  
Sbjct: 218 SADSEDNEDEDDPKEDEDDDQGEE---------------------------EESGSSDSL 250

Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
            +D D + ++ +    E +E   ++  D D+ 
Sbjct: 251 SEDSDASSEEMESGEMEAAEASADDTPDSDDA 282



 Score = 30.6 bits (69), Expect = 4.3
 Identities = 14/71 (19%), Positives = 32/71 (45%), Gaps = 12/71 (16%)

Query: 172 DDLEDNMDD------------DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
           D L   +DD               D  ++  D+    D+E+  +E++  E++DD++ + E
Sbjct: 183 DRLAACVDDQQAFARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEE 242

Query: 220 KKNSKNEHVQE 230
           +  S +   ++
Sbjct: 243 ESGSSDSLSED 253


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 37.5 bits (87), Expect = 0.003
 Identities = 19/57 (33%), Positives = 37/57 (64%)

Query: 149 MKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE 205
           M+   E ++  +     R VM + +  ++ DDD+DDDDDD+ DD +D+D++++ ++E
Sbjct: 24  MEARTEALIEEEKVEIARSVMIEGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80



 Score = 34.8 bits (80), Expect = 0.026
 Identities = 12/33 (36%), Positives = 29/33 (87%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
           ++ +DDDDDE DDD+D+ ++++ ++++++E+D+
Sbjct: 48  EEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80



 Score = 32.1 bits (73), Expect = 0.21
 Identities = 12/33 (36%), Positives = 29/33 (87%)

Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
           ++ +DDDD+ DDD+D+D +++ +++++++EDD+
Sbjct: 48  EEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80



 Score = 31.7 bits (72), Expect = 0.32
 Identities = 10/33 (30%), Positives = 30/33 (90%)

Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
           ++ +DDD+ +DD+D+D++++ +++++++++DDE
Sbjct: 48  EEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80



 Score = 28.6 bits (64), Expect = 4.4
 Identities = 9/31 (29%), Positives = 27/31 (87%)

Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
           ++ ++ DDD D+D++++ ++++++++DDDE+
Sbjct: 48  EEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78



 Score = 28.2 bits (63), Expect = 5.3
 Identities = 7/32 (21%), Positives = 29/32 (90%)

Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
           ++ +D+ DD++D+D++++ ++++++++DD+++
Sbjct: 48  EEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDD 79



 Score = 27.8 bits (62), Expect = 7.3
 Identities = 7/30 (23%), Positives = 26/30 (86%)

Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERK 217
           +E +DD+D++++++ +++++ ++DDD++ +
Sbjct: 48  EEPEDDDDDEDDDDDDDKDDKDDDDDDDDE 77


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 41.3 bits (97), Expect = 0.003
 Identities = 38/179 (21%), Positives = 68/179 (37%), Gaps = 17/179 (9%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE---------------E 215
             + + N ++D+++ +       +D D++E   E+E+ +E+DDE                
Sbjct: 4   QVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKA 63

Query: 216 RKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRH 275
           R  E K  K + +Q++  +Q         +  KG  K+   +  +       D+  +   
Sbjct: 64  RLKELKKQKKQEIQKILEQQNAAIDADMNNKGKGRLKYLLQQTEIFAHFAKGDQ--SASA 121

Query: 276 KITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNW 334
           K         SK  EEEE E+    +E     S      V+P   K      +  GLNW
Sbjct: 122 KKAKGRGRHASKLTEEEEDEEYLKEEEDGLGGSGGTRLLVQPSCIKGKMRDYQLAGLNW 180


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 39.0 bits (91), Expect = 0.004
 Identities = 14/52 (26%), Positives = 26/52 (50%)

Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
            DD+E + + +  +E E E+  E + +  E++K E    K E  +  T  +K
Sbjct: 94  SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145



 Score = 35.1 bits (81), Expect = 0.075
 Identities = 19/50 (38%), Positives = 27/50 (54%), Gaps = 2/50 (4%)

Query: 190 TDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK 239
           + DD +E+ EEES +E E E  D  E K E K  K   V +   +++ PK
Sbjct: 93  SSDDEEEETEEESTDETEQE--DPPETKTESKEKKKREVPKPKTEKEKPK 140



 Score = 30.1 bits (68), Expect = 4.2
 Identities = 13/42 (30%), Positives = 20/42 (47%)

Query: 199 EEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKA 240
           +EE E EEE+ ++ ++E   E K    E  +    K K  K 
Sbjct: 96  DEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKE 137



 Score = 29.3 bits (66), Expect = 7.3
 Identities = 12/65 (18%), Positives = 25/65 (38%), Gaps = 3/65 (4%)

Query: 202 SEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVV 261
           S  ++E EE ++E     ++    E   E   K+K       +   + E+     K    
Sbjct: 92  SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVP---KPKTEKEKPKTEPKKPKP 148

Query: 262 PQPRM 266
            +P++
Sbjct: 149 SKPKL 153


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
           N-terminal.  This is the very acidic N-terminal region
           of the early transcription elongation factor Spt5. The
           Spt5-Spt4 complex regulates early transcription
           elongation by RNA polymerase II and has an imputed role
           in pre-mRNA processing via its physical association with
           mRNA capping enzymes. The actual function of this
           N-terminal domain is not known although it is
           dispensable for binding to Spt4.
          Length = 92

 Score = 37.4 bits (87), Expect = 0.004
 Identities = 12/67 (17%), Positives = 37/67 (55%)

Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQ 229
            D + E + +++++++++D+ +D +DED   +  E E++      +R+ EK+  ++    
Sbjct: 2   LDTEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEEL 61

Query: 230 ELTAKQK 236
               +++
Sbjct: 62  AEYLRKR 68



 Score = 28.6 bits (64), Expect = 5.3
 Identities = 8/56 (14%), Positives = 32/56 (57%)

Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKN 237
           D + + D+ +++ +E+ ++  +  +E+E  D+ E + ++++ + +  +E   ++  
Sbjct: 3   DTEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDA 58



 Score = 27.8 bits (62), Expect = 9.1
 Identities = 14/35 (40%), Positives = 20/35 (57%)

Query: 288 EEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKD 322
           EEEEEE+E+    D S +D   DE E  + + H+ 
Sbjct: 11  EEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRR 45


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 39.7 bits (93), Expect = 0.004
 Identities = 14/55 (25%), Positives = 33/55 (60%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           ++ E++   ++D + ++   ++  ED+EE +E + E  +D++ + K E K  +NE
Sbjct: 22  ENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKKLENE 76



 Score = 38.6 bits (90), Expect = 0.009
 Identities = 17/68 (25%), Positives = 30/68 (44%), Gaps = 3/68 (4%)

Query: 176 DNMDDDDDDDDDDETDDDNDEDNEEES---EEEEENEEDDDEERKAEKKNSKNEHVQELT 232
              +  D   ++ E D   + +N+EE    EE+ E EE + EE   + + S    ++EL 
Sbjct: 1   MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60

Query: 233 AKQKNPKA 240
            +    K 
Sbjct: 61  DENNKLKE 68



 Score = 38.2 bits (89), Expect = 0.014
 Identities = 15/65 (23%), Positives = 32/65 (49%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
            + +D   ++ ++D   E ++  ++  +EE  E EE E+++  E   E    K E +++ 
Sbjct: 3   KECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDE 62

Query: 232 TAKQK 236
             K K
Sbjct: 63  NNKLK 67



 Score = 37.8 bits (88), Expect = 0.018
 Identities = 15/56 (26%), Positives = 27/56 (48%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           ++D     ++ ++D   +E  +  + + EE  E+ EE+ E   EE K E    K E
Sbjct: 14  EEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEE 69



 Score = 36.7 bits (85), Expect = 0.045
 Identities = 11/71 (15%), Positives = 31/71 (43%)

Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHV 228
             +     + + ++D   ++E  +++    E+   EE E EE  ++  ++ +   +    
Sbjct: 2   EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKD 61

Query: 229 QELTAKQKNPK 239
           +    K++N K
Sbjct: 62  ENNKLKEENKK 72


>gnl|CDD|221970 pfam13191, AAA_16, AAA ATPase domain.  This family of domains
           contain a P-loop motif that is characteristic of the AAA
           superfamily.
          Length = 154

 Score = 38.7 bits (90), Expect = 0.005
 Identities = 12/52 (23%), Positives = 23/52 (44%), Gaps = 4/52 (7%)

Query: 411 FRRKEDIQAKKHHVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCLIKNFTK 462
             R+E+++     +    R      P  V + GP   GK++L+R L++    
Sbjct: 3   VGREEELERLLDAL----RRARSGGPPSVLLTGPSGTGKTSLLRELLEGLLV 50


>gnl|CDD|216161 pfam00865, Osteopontin, Osteopontin. 
          Length = 293

 Score = 40.1 bits (93), Expect = 0.005
 Identities = 36/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%)

Query: 176 DNMDDDDDDDDDD----------ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
           ++MDDDDDDDDDD          E+DD +  D+ + S+E   ++E D+       +    
Sbjct: 64  EHMDDDDDDDDDDHVDSQDSNSNESDDADHTDDSDHSDESHHSDESDESVTDFPTETPAT 123

Query: 226 EHVQELTAKQKNPK------AFTFQSVIKGERKFRSTKPTVVPQ---PRMSDKDLAIRHK 276
           +    +              A+  +S  K  R+  +  P    +     +  ++L    K
Sbjct: 124 DVFTPVVPTVDTNDGRGDSVAYGLRSKSKKFRRSDAQYPDATEEDLTSHVESEELDGAPK 183

Query: 277 ITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTG 331
                  LN   + +  ++DS        + S  +D++VE  + + SK  +    
Sbjct: 184 AIPVAQRLNVPSDWDSREKDS-------HETSQLDDQSVETHSREQSKEYKRKAN 231



 Score = 31.2 bits (70), Expect = 2.7
 Identities = 15/61 (24%), Positives = 31/61 (50%), Gaps = 2/61 (3%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEE--ESEEEEENEEDDDEERKAEKKNSKNEHVQ 229
           DDL+        ++  +  DDD+D+D+++  +S++   NE DD +       + ++ H  
Sbjct: 48  DDLKQETLPSKSNESHEHMDDDDDDDDDDHVDSQDSNSNESDDADHTDDSDHSDESHHSD 107

Query: 230 E 230
           E
Sbjct: 108 E 108



 Score = 30.1 bits (67), Expect = 7.0
 Identities = 22/78 (28%), Positives = 31/78 (39%), Gaps = 24/78 (30%)

Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
           S++S +  D+DD  DDD               V SQD              +  DD D  
Sbjct: 59  SNESHEHMDDDDDDDDDDH-------------VDSQDSN-----------SNESDDADHT 94

Query: 185 DDDDETDDDNDEDNEEES 202
           DD D +D+ +  D  +ES
Sbjct: 95  DDSDHSDESHHSDESDES 112


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
           kDa subunit.  This family contains several DNA-directed
           RNA polymerase 19 kDa polypeptides. The Poxvirus
           DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
           DNA-template-directed extension of the 3'-end of an RNA
           strand by one nucleotide at a time.
          Length = 167

 Score = 38.5 bits (90), Expect = 0.005
 Identities = 17/64 (26%), Positives = 37/64 (57%)

Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA 233
           +ED+ D  D + DDD++++  +E+ +EE  E  E+ +    ++   K  S +E++++ T+
Sbjct: 1   MEDSDDIIDYESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQSNYKIESASENIEDATS 60

Query: 234 KQKN 237
             K 
Sbjct: 61  NPKQ 64



 Score = 31.2 bits (71), Expect = 1.8
 Identities = 12/64 (18%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
           DDD  +  +++++D++D E+ + +D  + ++S  + E+  ++ E+  +  K   +E +  
Sbjct: 13  DDDDSEEYEEEEEDEEDAESLESSDVSSLKQSNYKIESASENIEDATSNPKQV-SEKISA 71

Query: 231 LTAK 234
           +  +
Sbjct: 72  IKRR 75


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 40.8 bits (96), Expect = 0.006
 Identities = 19/74 (25%), Positives = 31/74 (41%), Gaps = 8/74 (10%)

Query: 124  SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
                S+ +      S         +  K      SS+  R+ RK   D   ED+ D + D
Sbjct: 1323 KKKKSEKKTARKKKSK--------TRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVD 1374

Query: 184  DDDDDETDDDNDED 197
            D +D++ +DD D+D
Sbjct: 1375 DSEDEDDEDDEDDD 1388



 Score = 40.0 bits (94), Expect = 0.010
 Identities = 29/211 (13%), Positives = 65/211 (30%), Gaps = 8/211 (3%)

Query: 102  GESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQD 161
             + ++K+ +  +            DD  D    + +  D  +      ++  +   SS  
Sbjct: 1186 ADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDD----EEQKTKPKKSSVK 1241

Query: 162  GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
              K +K       EDN +   DD   +    +  +             +  D E     K
Sbjct: 1242 RLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSK 1301

Query: 222  NSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSAL 281
             S     +     + +  A   +   + +   +    T V Q   S     +R       
Sbjct: 1302 PSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRP----R 1357

Query: 282  AGLNSKEEEEEEQEDSAFSDESADDNSSDED 312
               +    E+++  +   S++  D++  D+D
Sbjct: 1358 KKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388


>gnl|CDD|220888 pfam10846, DUF2722, Protein of unknown function (DUF2722).  This
           eukaryotic family of proteins has no known function.
          Length = 373

 Score = 40.2 bits (94), Expect = 0.006
 Identities = 20/99 (20%), Positives = 38/99 (38%)

Query: 138 SDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDED 197
           S +   S Q  ++   E         K        DL    +    ++DDD  D D+ E 
Sbjct: 264 SQESMTSFQHIIQWKPESQQKKHRRTKSSSSFGVIDLNSISEASQVNEDDDPPDSDSKER 323

Query: 198 NEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
             EE+ + E    DD++++   + +S++E        + 
Sbjct: 324 KNEENSDPESTPSDDNDDKTCSESSSRSESPNRTNTGRY 362


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 40.0 bits (93), Expect = 0.006
 Identities = 34/113 (30%), Positives = 54/113 (47%), Gaps = 7/113 (6%)

Query: 117 GAMSGAISSDDSDDEND---EDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDD 173
                +I++D SD+END   E        E  +G+ K+ +E     +D  +  +V   + 
Sbjct: 71  CGNVASIANDKSDNENDVELEGLNIIVKNEEERGTQKEEEED----EDVEEIEEVEEVEV 126

Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           +E+  DDD+D + DDE + D + D  E + E    E DDD     E  NS +E
Sbjct: 127 VEEEYDDDEDSEKDDEKESDAEGDENELAGEYIIEEVDDDVAILNENINSIDE 179


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 40.2 bits (94), Expect = 0.006
 Identities = 18/91 (19%), Positives = 45/91 (49%), Gaps = 1/91 (1%)

Query: 140 DDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNE 199
           DD +    ++ K  E+V S+ D    ++     +    + D      + + + D  E+ E
Sbjct: 340 DDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAE 399

Query: 200 EESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
            +++EE+  +E++++++K E+ +   E  Q+
Sbjct: 400 AKAKEEKLKQEENEKKQK-EQADEDKEKRQK 429


>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
          Length = 325

 Score = 39.9 bits (94), Expect = 0.007
 Identities = 9/47 (19%), Positives = 19/47 (40%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
            DL ++ D D++  +  +     +E+  +    EEE       +R  
Sbjct: 6   HDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVL 52



 Score = 37.6 bits (88), Expect = 0.034
 Identities = 10/48 (20%), Positives = 23/48 (47%), Gaps = 1/48 (2%)

Query: 174 LEDNMDDDDDDDDDDETDD-DNDEDNEEESEEEEENEEDDDEERKAEK 220
           L+ +  ++D D D++  +  D     EEE  + +  EE+   +   ++
Sbjct: 3   LKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQR 50



 Score = 33.8 bits (78), Expect = 0.56
 Identities = 5/36 (13%), Positives = 17/36 (47%)

Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE 205
           FD++  +  D+    +++   +D  +E+   +   +
Sbjct: 14  FDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQ 49



 Score = 33.4 bits (77), Expect = 0.66
 Identities = 8/39 (20%), Positives = 18/39 (46%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
           D D ++N  +  D+    E +  +++  EEE   +   +
Sbjct: 11  DADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQ 49



 Score = 33.0 bits (76), Expect = 0.98
 Identities = 10/46 (21%), Positives = 22/46 (47%)

Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
           +   D ++D D  ++  +  +E+   EEE ++ D  EE    +  +
Sbjct: 3   LKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGAT 48



 Score = 32.6 bits (75), Expect = 1.1
 Identities = 10/49 (20%), Positives = 18/49 (36%)

Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           M     D ++D   D+N  +  +E    EE   D+D   +        +
Sbjct: 1   MTLKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQ 49



 Score = 31.5 bits (72), Expect = 2.9
 Identities = 10/45 (22%), Positives = 22/45 (48%)

Query: 177 NMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
                 D ++D + D++  E  +E++  EEE  ++D  E +   +
Sbjct: 1   MTLKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQ 45



 Score = 31.5 bits (72), Expect = 3.0
 Identities = 8/42 (19%), Positives = 16/42 (38%)

Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEED 211
            D D       D+    ++E  D++  + E  S+   +   D
Sbjct: 12  ADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLD 53


>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2.  This
           domain family is found in eukaryotes, and is typically
           between 138 and 153 amino acids in length. The family is
           found in association with pfam00493. Mini-chromosome
           maintenance (MCM) proteins are essential for DNA
           replication. These proteins use ATPase activity to
           perform this function.
          Length = 145

 Score = 38.1 bits (89), Expect = 0.008
 Identities = 14/37 (37%), Positives = 24/37 (64%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEER 216
           +D+ D++++  DD +D D E E EE+ E+   D+ ER
Sbjct: 1   EDEPDEEEELEDDIDDLDEEAEEEEDGEDLFGDNMER 37



 Score = 37.7 bits (88), Expect = 0.009
 Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 11/104 (10%)

Query: 128 SDDENDEDDTS--DDDTESGQGSMKKYK--EQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
           ++ E D  D    DD+++  + S    +  E  ++ +D   RR+ M    L+D+ DDD D
Sbjct: 41  ANPELDRYDEDGLDDESDYEELSAAARRAAEARLNRRDRLLRRRRMPAAFLDDDDDDDGD 100

Query: 184 DDDDDET-------DDDNDEDNEEESEEEEENEEDDDEERKAEK 220
            D   +        D+D D D+E+    EEE   +D  + KA  
Sbjct: 101 FDLTAQPRRRRRQYDEDRDADDEDIDPLEEELSLEDLSDVKAAS 144



 Score = 33.5 bits (77), Expect = 0.22
 Identities = 16/41 (39%), Positives = 25/41 (60%), Gaps = 6/41 (14%)

Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEE---ENEEDD 212
           ED  D++++ +DD    DD DE+ EEE + E+   +N E D
Sbjct: 1   EDEPDEEEELEDD---IDDLDEEAEEEEDGEDLFGDNMERD 38



 Score = 29.2 bits (66), Expect = 7.4
 Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 25/85 (29%)

Query: 119 MSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNM 178
           M  A   DD DD+ D D T+                         +RR+  +D+D +   
Sbjct: 86  MPAAFLDDDDDDDGDFDLTAQ-----------------------PRRRRRQYDEDRDA-- 120

Query: 179 DDDDDDDDDDETDDDNDEDNEEESE 203
           DD+D D  ++E   ++  D +  S 
Sbjct: 121 DDEDIDPLEEELSLEDLSDVKAASI 145



 Score = 28.8 bits (65), Expect = 8.2
 Identities = 20/81 (24%), Positives = 34/81 (41%), Gaps = 15/81 (18%)

Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
            D+ D+E + +D  DD  E  +              DG    + +F D++E +   + + 
Sbjct: 1   EDEPDEEEELEDDIDDLDEEAEEEE-----------DG----EDLFGDNMERDYRANPEL 45

Query: 185 DDDDETDDDNDEDNEEESEEE 205
           D  DE   D++ D EE S   
Sbjct: 46  DRYDEDGLDDESDYEELSAAA 66


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 38.9 bits (91), Expect = 0.008
 Identities = 23/71 (32%), Positives = 36/71 (50%), Gaps = 6/71 (8%)

Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA------K 234
           + +D +D    +  DE++EEE+EEE+E   D++E+   EK  S  E      +      +
Sbjct: 74  NSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPSWKPVGTE 133

Query: 235 QKNPKAFTFQS 245
           Q  P A TF S
Sbjct: 134 QTGPHAMTFDS 144



 Score = 30.9 bits (70), Expect = 3.5
 Identities = 10/73 (13%), Positives = 27/73 (36%)

Query: 168 VMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEH 227
                  +    D+ +    D+ +    E+ +EE +E   +E+ +D+    ++     E 
Sbjct: 35  FFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEE 94

Query: 228 VQELTAKQKNPKA 240
            +E   +  +   
Sbjct: 95  NEEEDEESSDENE 107



 Score = 29.7 bits (67), Expect = 6.4
 Identities = 14/70 (20%), Positives = 33/70 (47%)

Query: 168 VMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEH 227
           + F     D    D+ +    +  +  + +  +E E+E  N ED +++  AEK++ ++E 
Sbjct: 34  LFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEE 93

Query: 228 VQELTAKQKN 237
             E   ++ +
Sbjct: 94  ENEEEDEESS 103


>gnl|CDD|224292 COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General
           function prediction only].
          Length = 398

 Score = 39.7 bits (93), Expect = 0.008
 Identities = 19/62 (30%), Positives = 33/62 (53%), Gaps = 7/62 (11%)

Query: 408 ERKFRRKEDIQAKK------HHVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCLIKNFT 461
             K   +ED   KK        +P++ +  L+  P ++ ++GP QVGK+TL++ LIK   
Sbjct: 4   FGKEIIEEDSDIKKWEIERRKLLPRLIK-KLDLRPFIILILGPRQVGKTTLLKLLIKGLL 62

Query: 462 KT 463
           + 
Sbjct: 63  EE 64


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 36.7 bits (84), Expect = 0.009
 Identities = 17/35 (48%), Positives = 27/35 (77%), Gaps = 3/35 (8%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
           ++D D D DE +DD+DED   E +++E+++EDDDE
Sbjct: 9   EEDSDSDSDEEEDDDDED---EEDDDEDDDEDDDE 40



 Score = 36.7 bits (84), Expect = 0.009
 Identities = 13/31 (41%), Positives = 25/31 (80%)

Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
           E ++D+D D++EE ++++E+EEDDDE+   +
Sbjct: 7   EGEEDSDSDSDEEEDDDDEDEEDDDEDDDED 37



 Score = 35.9 bits (82), Expect = 0.020
 Identities = 13/31 (41%), Positives = 25/31 (80%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
           +D D D D++E DDD DE++++E ++E+++E
Sbjct: 10  EDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40



 Score = 34.4 bits (78), Expect = 0.071
 Identities = 12/31 (38%), Positives = 27/31 (87%)

Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
           + ++++D D+DE+ +++ E+EE+++EDDDE+
Sbjct: 7   EGEEDSDSDSDEEEDDDDEDEEDDDEDDDED 37



 Score = 33.6 bits (76), Expect = 0.098
 Identities = 13/30 (43%), Positives = 25/30 (83%)

Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEE 204
           + + D D+++DDDDE ++D+DED++E+ +E
Sbjct: 11  DSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40



 Score = 33.2 bits (75), Expect = 0.17
 Identities = 11/34 (32%), Positives = 24/34 (70%)

Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
           + E D D+D D EE+ ++E+E ++D+D++   ++
Sbjct: 7   EGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40



 Score = 32.8 bits (74), Expect = 0.22
 Identities = 12/34 (35%), Positives = 26/34 (76%)

Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEE 207
           LE   D D D D++++ DD+++ED++E+ +E+++
Sbjct: 6   LEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDD 39



 Score = 31.3 bits (70), Expect = 0.69
 Identities = 12/31 (38%), Positives = 26/31 (83%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEE 201
           +D   D+ +++DDDD+D+E DD++D+++++E
Sbjct: 10  EDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40



 Score = 30.5 bits (68), Expect = 1.2
 Identities = 9/34 (26%), Positives = 24/34 (70%)

Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           + +ED++ +S+EEE+++++D+E+   +     +E
Sbjct: 7   EGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 37.3 bits (87), Expect = 0.010
 Identities = 15/53 (28%), Positives = 27/53 (50%)

Query: 164 KRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEER 216
           K RK       E+N D++D + +D E  +D++ +   E    +  EE  DE++
Sbjct: 70  KERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDEDK 122


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 39.6 bits (92), Expect = 0.011
 Identities = 29/118 (24%), Positives = 54/118 (45%), Gaps = 15/118 (12%)

Query: 104 STEKSMMEIDGIHGAMSGAISSDDSDDENDEDD------TSDDDTESGQGSMKKYKEQVV 157
           +T+K+++EI       S  +   + D   D++       T      +     ++  E+ V
Sbjct: 80  ATKKTVVEISEPLEEGSELVV--NEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKV 137

Query: 158 SSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
                RKRRKV   D+  ++   + +  D +E++     +N  ESEEE + E+DD E+
Sbjct: 138 -----RKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLEN--ESEEELDLEKDDGED 188


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 38.9 bits (91), Expect = 0.011
 Identities = 19/53 (35%), Positives = 29/53 (54%)

Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQ 235
           ++D+++  D+D+ E   EE EEE E EE DD+E   E K  + +      AK 
Sbjct: 129 EEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKRGDTSLTPQAKD 181



 Score = 35.8 bits (83), Expect = 0.11
 Identities = 11/61 (18%), Positives = 26/61 (42%)

Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
           ED  ++ D+DD +   ++  +E   EE +++E  +E   +           + ++ L   
Sbjct: 130 EDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKRGDTSLTPQAKDVLESLIDA 189

Query: 235 Q 235
            
Sbjct: 190 A 190



 Score = 33.9 bits (78), Expect = 0.44
 Identities = 11/45 (24%), Positives = 30/45 (66%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
           ++D++++ D++  +   E+ EEE E EE ++++  +E K ++ ++
Sbjct: 129 EEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKRGDT 173


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 39.5 bits (93), Expect = 0.013
 Identities = 20/101 (19%), Positives = 29/101 (28%), Gaps = 15/101 (14%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
              D DD +  D     D    +G     ++        R+R +     D       D D
Sbjct: 196 RGRDGDDRDRRDRREQGDRREERGR----RDGGDRRGRRRRRDRRDARGDDNREDRGDRD 251

Query: 184 DDDDDET-----------DDDNDEDNEEESEEEEENEEDDD 213
            DD +             D       +  +E E E  EDD 
Sbjct: 252 GDDGEGRGGRRGRRFRDRDRRGRRGGDGGNEREPELREDDV 292



 Score = 34.9 bits (81), Expect = 0.31
 Identities = 18/97 (18%), Positives = 37/97 (38%), Gaps = 1/97 (1%)

Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
            ++ D+     D  D   E+ +G   + +E+     D  +R +    D  E+    D  D
Sbjct: 166 EEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGD 225

Query: 185 DDDDETDDDNDEDNEEESEEEE-ENEEDDDEERKAEK 220
                   D  +   +++ E+  + + DD E R   +
Sbjct: 226 RRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRR 262



 Score = 33.0 bits (76), Expect = 1.2
 Identities = 12/95 (12%), Positives = 35/95 (36%), Gaps = 2/95 (2%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
           +++   D  +  +  + D    +G  +  + +    + GR+  +    DD +     +  
Sbjct: 153 ATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQG 212

Query: 184 D--DDDDETDDDNDEDNEEESEEEEENEEDDDEER 216
           D  ++    D  +        +  +   +D+ E+R
Sbjct: 213 DRREERGRRDGGDRRGRRRRRDRRDARGDDNREDR 247


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 39.0 bits (91), Expect = 0.013
 Identities = 20/124 (16%), Positives = 49/124 (39%), Gaps = 17/124 (13%)

Query: 131 ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKR-------RKVMFDDDLEDNMDDDDD 183
               ++   D   +     + +K      + G+KR       R V      E ++ + D 
Sbjct: 263 IGPIEEEVVDSQAANDEPRRVFK------KKGQKRTTRRVKMRPVRAKPSDEPSLPESDI 316

Query: 184 DDDDDETDDD--NDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAF 241
            ++  + D+   ++        +E++ +EDD+E ++  +K  K +       ++ NP + 
Sbjct: 317 HEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKP--RKRKVNPVSN 374

Query: 242 TFQS 245
            F+ 
Sbjct: 375 NFRR 378



 Score = 35.9 bits (83), Expect = 0.13
 Identities = 31/151 (20%), Positives = 59/151 (39%), Gaps = 26/151 (17%)

Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
           E+++DD+ D   + E ++      E   EE  +++  +DE R+  KK             
Sbjct: 243 EESIDDELDVLREIEAEEAGIGPIE---EEVVDSQAANDEPRRVFKK------------- 286

Query: 235 QKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQ 294
            K  K  T + V     + + +    +P+  + ++   +  K  S   G     +E++E 
Sbjct: 287 -KGQKRTT-RRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDED 344

Query: 295 EDSAFSDESADDNSSDEDETVEPKTHKDSKN 325
           ED        D+ S +E E  +    K  K 
Sbjct: 345 ED--------DEESKEEVEKKQKVKKKPRKR 367


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 39.3 bits (92), Expect = 0.014
 Identities = 18/82 (21%), Positives = 51/82 (62%), Gaps = 8/82 (9%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS-KNEHVQ 229
             ++ED   + DD++++DE +++ ++++E  S+E  ++EE ++++ +++ ++S  N    
Sbjct: 378 ALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNSS-- 435

Query: 230 ELTAKQKNPKAFTFQSVIKGER 251
            L    KN +++    V++G++
Sbjct: 436 -LAVGYKNDRSY----VVRGDK 452



 Score = 32.4 bits (74), Expect = 1.7
 Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 1/69 (1%)

Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSV 246
             E +D N E ++EE E+EEE EE++DE+    K++S +E  +E   + K   +    S+
Sbjct: 378 ALEIEDANTERDDEEEEDEEE-EEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNSSL 436

Query: 247 IKGERKFRS 255
             G +  RS
Sbjct: 437 AVGYKNDRS 445



 Score = 30.1 bits (68), Expect = 9.3
 Identities = 14/29 (48%), Positives = 18/29 (62%)

Query: 286 SKEEEEEEQEDSAFSDESADDNSSDEDET 314
            +EEEEEE ED   S E +DD   +ED+ 
Sbjct: 395 DEEEEEEEDEDEGPSKEHSDDEEFEEDDV 423


>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
          Length = 852

 Score = 39.0 bits (91), Expect = 0.017
 Identities = 17/64 (26%), Positives = 27/64 (42%), Gaps = 2/64 (3%)

Query: 158 SSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK 217
           SS +G         D  E    D  +DD +D    D+D+D+ +      E EED +  R+
Sbjct: 11  SSSNGSGGLVGT--DSGESRWVDGSEDDSEDPASLDDDDDDRDSGGGMLEEEEDGNMRRR 68

Query: 218 AEKK 221
             + 
Sbjct: 69  LIRT 72


>gnl|CDD|213833 TIGR03596, GTPase_YlqF, ribosome biogenesis GTP-binding protein
           YlqF.  Members of this protein family are GTP-binding
           proteins involved in ribosome biogenesis, including the
           essential YlqF protein of Bacillus subtilis, which is an
           essential protein. They are related to Era, EngA, and
           other GTPases of ribosome biogenesis, but are circularly
           permuted. This family is not universal, and is not
           present in Escherichia coli, and so is not as well
           studied as some other GTPases. This model is built for
           bacterial members [Protein synthesis, Other].
          Length = 276

 Score = 38.3 bits (90), Expect = 0.018
 Identities = 16/33 (48%), Positives = 17/33 (51%), Gaps = 1/33 (3%)

Query: 432 LEPPPIVVAVVGPPQVGKSTLIRCLI-KNFTKT 463
           L   PI   +VG P VGKSTLI  L  K   K 
Sbjct: 114 LLNRPIRAMIVGIPNVGKSTLINRLAGKKVAKV 146


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 38.9 bits (90), Expect = 0.020
 Identities = 27/119 (22%), Positives = 50/119 (42%), Gaps = 16/119 (13%)

Query: 132 NDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD-------- 183
           ND DD+ + D    +GS+    E  +   D      ++ +D  E N   ++D        
Sbjct: 99  NDSDDSVNSDKLENEGSVSSIDENELVDLD-----TLLDNDQPEKNESGNNDHATDKENL 153

Query: 184 -DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELT--AKQKNPK 239
            + D   ++D   E+++ ESE E  + + DDE   ++  N +N  V      A  ++  
Sbjct: 154 LESDASSSNDSESEESDSESEIESSDSDHDDENSDSKLDNLRNYIVSLKKDEADAESVL 212


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 38.1 bits (88), Expect = 0.022
 Identities = 16/87 (18%), Positives = 40/87 (45%), Gaps = 13/87 (14%)

Query: 164 KRRKVMFDDDLEDNMDDDD-------------DDDDDDETDDDNDEDNEEESEEEEENEE 210
           K+ K   +++ E+ M ++D              + D ++  ++ +     + E + +  +
Sbjct: 258 KKEKQGAEEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPD 317

Query: 211 DDDEERKAEKKNSKNEHVQELTAKQKN 237
           +  EE++ + +N +NE   EL A + N
Sbjct: 318 EIGEEKEEDDENEENERHTELLADELN 344



 Score = 34.7 bits (79), Expect = 0.30
 Identities = 21/116 (18%), Positives = 49/116 (42%), Gaps = 6/116 (5%)

Query: 113 DGIHGAMSGAISSDDSDDEND-EDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFD 171
           +        A+  +    E + E+  S++D + G        E    S+  +++++   +
Sbjct: 247 EPGEPLSRPALKKEKQGAEEEGEEGMSEEDLDVGAA----EIENKEVSEGDKEQQQEEVE 302

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEE-ESEEEEENEEDDDEERKAEKKNSKNE 226
           +      +   D  D+   + + D++NEE E   E   +E ++ E+  E+K  + E
Sbjct: 303 NAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIEEKRRQME 358


>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22.  The
           mitochondrial protein translocase family, which is
           responsible for movement of nuclear encoded pre-proteins
           into mitochondria, is very complex with at least 19
           components. These proteins include several chaperone
           proteins, four proteins of the outer membrane
           translocase (Tom) import receptor, five proteins of the
           Tom channel complex, five proteins of the inner membrane
           translocase (Tim) and three "motor" proteins. This
           family represents the Tom22 proteins. The N terminal
           region of Tom22 has been shown to have chaperone-like
           activity, and the C terminal region faces the
           intermembrane face.
          Length = 136

 Score = 36.1 bits (84), Expect = 0.024
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEES 202
           DDDD+DD D ++D  +D D E E+
Sbjct: 28  DDDDEDDTDTDSDISDDSDFENET 51



 Score = 33.8 bits (78), Expect = 0.18
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEE 205
            ++ DDDD+D+TD D+D  ++ + E E
Sbjct: 24  QEESDDDDEDDTDTDSDISDDSDFENE 50



 Score = 31.8 bits (73), Expect = 0.85
 Identities = 12/27 (44%), Positives = 17/27 (62%)

Query: 173 DLEDNMDDDDDDDDDDETDDDNDEDNE 199
             E + DD+DD D D +  DD+D +NE
Sbjct: 24  QEESDDDDEDDTDTDSDISDDSDFENE 50



 Score = 31.1 bits (71), Expect = 1.4
 Identities = 12/27 (44%), Positives = 17/27 (62%)

Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEE 207
            ++ DDDDE D D D D  ++S+ E E
Sbjct: 24  QEESDDDDEDDTDTDSDISDDSDFENE 50



 Score = 30.3 bits (69), Expect = 2.4
 Identities = 10/34 (29%), Positives = 14/34 (41%)

Query: 111 EIDGIHGAMSGAISSDDSDDENDEDDTSDDDTES 144
             +      + A    D DDE+D D  SD   +S
Sbjct: 12  FQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDS 45



 Score = 29.1 bits (66), Expect = 7.0
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 183 DDDDDDETDDDNDEDNEEESEEEEENE 209
            ++ DD+ +DD D D++   + + ENE
Sbjct: 24  QEESDDDDEDDTDTDSDISDDSDFENE 50



 Score = 29.1 bits (66), Expect = 7.4
 Identities = 4/25 (16%), Positives = 17/25 (68%)

Query: 191 DDDNDEDNEEESEEEEENEEDDDEE 215
           + D+D++++ +++ +  ++ D + E
Sbjct: 26  ESDDDDEDDTDTDSDISDDSDFENE 50


>gnl|CDD|223052 PHA03346, PHA03346, US22 family homolog; Provisional.
          Length = 520

 Score = 38.2 bits (89), Expect = 0.025
 Identities = 17/66 (25%), Positives = 31/66 (46%)

Query: 161 DGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
           DG        +          DDDD+ D+ DDD+ ED   + +E++  +  D +E + E+
Sbjct: 406 DGGGGLAPPPEPPRPPPPPFPDDDDEGDDEDDDDWEDLGFDLDEDDVYDLKDVDEWRLER 465

Query: 221 KNSKNE 226
           + +  E
Sbjct: 466 RIATAE 471



 Score = 35.9 bits (83), Expect = 0.14
 Identities = 20/92 (21%), Positives = 28/92 (30%), Gaps = 22/92 (23%)

Query: 129 DDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDD--- 185
            +  DE D                     +      R       D +D  DD+DDDD   
Sbjct: 399 PENEDEADGGGGL----------------APPPEPPRPPPPPFPDDDDEGDDEDDDDWED 442

Query: 186 ---DDDETDDDNDEDNEEESEEEEENEEDDDE 214
              D DE D  + +D +E   E      +  E
Sbjct: 443 LGFDLDEDDVYDLKDVDEWRLERRIATAERWE 474



 Score = 32.4 bits (74), Expect = 1.4
 Identities = 16/85 (18%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 156 VVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE-----------TDDDNDEDNEEESEE 204
            V   +  +RR+        +N D+ D                    DD+DE ++E+ ++
Sbjct: 383 RVLRGEFAERRR---PRGPPENEDEADGGGGLAPPPEPPRPPPPPFPDDDDEGDDEDDDD 439

Query: 205 EEENEEDDDEERKAEKKNSKNEHVQ 229
            E+   D DE+   + K+     ++
Sbjct: 440 WEDLGFDLDEDDVYDLKDVDEWRLE 464


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins
           in this family for which functions are known are flap
           endonucleases that generate the 3' incision next to DNA
           damage as part of nucleotide excision repair. This
           family is related to many other flap endonuclease
           families including the fen1 family. This family is based
           on the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1034

 Score = 38.3 bits (89), Expect = 0.026
 Identities = 31/134 (23%), Positives = 56/134 (41%), Gaps = 11/134 (8%)

Query: 102 GESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDD----TSDDDTESGQGSMKKYKE--- 154
           GES +  ++        M       +SD    E D    T +    S      + +E   
Sbjct: 630 GESADLLLISNPMEVEPMESEKEESESDGSFIEVDSVSSTLELQVPSKSQPTDESEENAE 689

Query: 155 QVVSSQDGRKRRKV---MFDDDLEDNMDDDDDDDDDDETDDDNDED-NEEESEEEEENEE 210
             V+S +G  R+++   +FD+  EDN+    +++ D +   +  +D + EE E  E N  
Sbjct: 690 NKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQDISLEELEALEANLL 749

Query: 211 DDDEERKAEKKNSK 224
            +    KA+K+  K
Sbjct: 750 AEQNSLKAQKQQQK 763


>gnl|CDD|216791 pfam01926, MMR_HSR1, 50S ribosome-binding GTPase.  The full-length
           GTPase protein is required for the complete activity of
           the protein of interacting with the 50S ribosome and
           binding of both adenine and guanine nucleotides, with a
           preference for guanine nucleotide.
          Length = 117

 Score = 35.3 bits (82), Expect = 0.031
 Identities = 13/20 (65%), Positives = 14/20 (70%)

Query: 439 VAVVGPPQVGKSTLIRCLIK 458
           VA+VG P VGKSTLI  L  
Sbjct: 2   VALVGRPNVGKSTLINALTG 21


>gnl|CDD|221753 pfam12753, Nro1, Nuclear pore complex subunit Nro1.  In fission
           yeast, this protein is a positive regulator of the
           stability of Sre1N, the sterol regulatory
           element-binding protein which is an ER membrane-bound
           transcription factor that controls adaptation to low
           oxygen-growth. In addition, the fission yeast Nro1 is a
           direct inhibitor of a protein that inhibits SreN1
           degradation, Ofd1 (an oxoglutamate deoxygenase). The
           outcome of this reactivity is that Ofd1 acts as an
           oxygen sensor that regulates the binding of Nro1 to Ofd1
           to control the stability of Sre1N. Solution of the
           structure of Nro1 reveals it to be made up of a number
           of TPR coils.
          Length = 401

 Score = 37.9 bits (88), Expect = 0.032
 Identities = 13/37 (35%), Positives = 25/37 (67%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEEN 208
           DDL D +++   +++ DE  D ++E+ EEE E +E++
Sbjct: 209 DDLLDIVENFGKENEIDEGLDSDEEEEEEEVELDEKH 245


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 37.4 bits (87), Expect = 0.033
 Identities = 13/35 (37%), Positives = 24/35 (68%)

Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
           +D++ +DD  D+D +++   E +E E  EE++DEE
Sbjct: 36  EDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEE 70



 Score = 35.1 bits (81), Expect = 0.17
 Identities = 14/36 (38%), Positives = 25/36 (69%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEE 206
           D++ ED++ D+D++D+    +D+N+   EEE EE E
Sbjct: 37  DEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72



 Score = 34.7 bits (80), Expect = 0.23
 Identities = 15/44 (34%), Positives = 28/44 (63%)

Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
           +D  +D++ +DD  + D++++   EE+  E  E EED++ E KA
Sbjct: 32  QDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEVKA 75



 Score = 34.4 bits (79), Expect = 0.32
 Identities = 14/44 (31%), Positives = 24/44 (54%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
                 ++ +D++ +DD   +D+ DE   EE E E   EE+D+E
Sbjct: 27  PFASAQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEE 70



 Score = 33.2 bits (76), Expect = 0.64
 Identities = 18/51 (35%), Positives = 29/51 (56%)

Query: 162 GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
           G  +       DL ++ + +DD  D+DE D+   E++E E  EEEE+EE +
Sbjct: 22  GNGKGPFASAQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72



 Score = 32.4 bits (74), Expect = 1.1
 Identities = 17/63 (26%), Positives = 30/63 (47%), Gaps = 6/63 (9%)

Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVI 247
           D T+D+  ED+  + +EE+E   ++DE    E++        E    + +P A T    +
Sbjct: 33  DLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEE------DEEGEVKASPDADTTILFV 86

Query: 248 KGE 250
           K E
Sbjct: 87  KPE 89


>gnl|CDD|178458 PLN02867, PLN02867, Probable galacturonosyltransferase.
          Length = 535

 Score = 38.0 bits (88), Expect = 0.034
 Identities = 18/55 (32%), Positives = 32/55 (58%), Gaps = 3/55 (5%)

Query: 206 EENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQS---VIKGERKFRSTK 257
           E  E+DD          S N+ V+E+T+ +++ KAF F++   ++K ERK +S +
Sbjct: 104 EAKEQDDGGRGTKGSTESFNDLVKEMTSNRQDIKAFAFRTKAMLLKMERKVQSAR 158


>gnl|CDD|179525 PRK03003, PRK03003, GTP-binding protein Der; Reviewed.
          Length = 472

 Score = 37.6 bits (88), Expect = 0.035
 Identities = 15/26 (57%), Positives = 17/26 (65%), Gaps = 1/26 (3%)

Query: 428 DRTPLEPPPIVVAVVGPPQVGKSTLI 453
           +     P P VVAVVG P VGKSTL+
Sbjct: 31  EAAEGGPLP-VVAVVGRPNVGKSTLV 55


>gnl|CDD|227199 COG4862, MecA, Negative regulator of genetic competence,
           sporulation and motility [Posttranslational
           modification, protein turnover, chaperones / Signal
           transduction mechanisms / Cell motility and secretion].
          Length = 224

 Score = 37.0 bits (86), Expect = 0.035
 Identities = 14/53 (26%), Positives = 29/53 (54%)

Query: 176 DNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHV 228
           +    DD D+  D  D+D  E+  E+ EE+ ++ E +++  +AEK+     ++
Sbjct: 81  EGEGPDDLDEFLDMPDEDGAEEGLEQLEEKFDDVEKEEQGDEAEKEPDYLYYI 133



 Score = 31.6 bits (72), Expect = 1.7
 Identities = 12/42 (28%), Positives = 23/42 (54%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
           D D   +M D+D  ++  E  ++  +D E+E + +E  +E D
Sbjct: 87  DLDEFLDMPDEDGAEEGLEQLEEKFDDVEKEEQGDEAEKEPD 128



 Score = 31.2 bits (71), Expect = 2.2
 Identities = 17/53 (32%), Positives = 24/53 (45%), Gaps = 2/53 (3%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELT 232
           + +  DD DE  D  DED  EE  E+   EE  D+  K E+ +   +    L 
Sbjct: 81  EGEGPDDLDEFLDMPDEDGAEEGLEQ--LEEKFDDVEKEEQGDEAEKEPDYLY 131


>gnl|CDD|206647 cd00881, GTP_translation_factor, GTP translation factor family
            primarily contains translation initiation, elongation and
            release factors.  The GTP translation factor family
            consists primarily of translation initiation, elongation,
            and release factors, which play specific roles in protein
            translation. In addition, the family includes Snu114p, a
            component of the U5 small nuclear riboprotein particle
            which is a component of the spliceosome and is involved
            in excision of introns, TetM, a tetracycline resistance
            gene that protects the ribosome from tetracycline
            binding, and the unusual subfamily CysN/ATPS, which has
            an unrelated function (ATP sulfurylase) acquired through
            lateral transfer of the EF1-alpha gene and development of
            a new function.
          Length = 183

 Score = 36.5 bits (85), Expect = 0.036
 Identities = 9/67 (13%), Positives = 25/67 (37%), Gaps = 7/67 (10%)

Query: 982  KDIFRTLR--MPKVMGVLTHLDMLKNNK---TLKTTKKMLKHRFWTEV-YAGAKLFYLSG 1035
            ++           ++  +  +D +        L+  K++LK   +T +      +  +S 
Sbjct: 104  REHLNIALAGGLPIIVAVNKIDRVGEEDFDEVLREIKELLKLIGFTFLKGKDVPIIPISA 163

Query: 1036 IVHGEYM 1042
             + GE +
Sbjct: 164  -LTGEGI 169



 Score = 36.5 bits (85), Expect = 0.039
 Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 3/48 (6%)

Query: 55  ITFIECNNDINSMIDISK---VADLVLLLIDASFGFEMEIFEFLNICQ 99
           I FI+     +   +  +    AD  LL++DA+ G E +  E LNI  
Sbjct: 64  INFIDTPGHEDFSKETVRGLAQADGALLVVDANEGVEPQTREHLNIAL 111


>gnl|CDD|234628 PRK00093, PRK00093, GTP-binding protein Der; Reviewed.
          Length = 435

 Score = 37.7 bits (89), Expect = 0.037
 Identities = 14/36 (38%), Positives = 22/36 (61%)

Query: 423 HVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCLIK 458
            +P+ +    E  PI +A++G P VGKS+LI  L+ 
Sbjct: 160 ELPEEEEEDEEDEPIKIAIIGRPNVGKSSLINALLG 195



 Score = 31.9 bits (74), Expect = 2.4
 Identities = 12/15 (80%), Positives = 13/15 (86%)

Query: 438 VVAVVGPPQVGKSTL 452
           VVA+VG P VGKSTL
Sbjct: 3   VVAIVGRPNVGKSTL 17


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 37.3 bits (86), Expect = 0.039
 Identities = 18/96 (18%), Positives = 39/96 (40%), Gaps = 11/96 (11%)

Query: 155 QVVSSQDGRKRRKVMFDDDLEDN-----MDDDDDDDDDDETDD------DNDEDNEEESE 203
           +  + ++ R+  K  + ++ E++        DD + +  +  D       +      ESE
Sbjct: 188 KAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESE 247

Query: 204 EEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK 239
           EEE +E + DE+   + K    +   +   K + P 
Sbjct: 248 EEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPH 283



 Score = 33.9 bits (77), Expect = 0.45
 Identities = 19/97 (19%), Positives = 43/97 (44%), Gaps = 9/97 (9%)

Query: 133 DEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDD 192
           DE +   ++ E     +    E V    +  K +K     DLE  +  D   +   E+++
Sbjct: 198 DEKERYVEEEEESDTEL----EAVTDDSEKEKTKK----KDLEKWLGSDQSMETS-ESEE 248

Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQ 229
           +   ++E + +E+E+N+    + +  + K S+  H+ 
Sbjct: 249 EESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIH 285



 Score = 33.9 bits (77), Expect = 0.47
 Identities = 15/121 (12%), Positives = 48/121 (39%), Gaps = 8/121 (6%)

Query: 115 IHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDL 174
                       +   E +E+  ++ +  +     +K K++ +         K +  D  
Sbjct: 189 AATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDL--------EKWLGSDQS 240

Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
            +  + ++++  + E+D+D DEDN+ +  + + ++     +     +  +    +++ A 
Sbjct: 241 METSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQERENEKIPAV 300

Query: 235 Q 235
           Q
Sbjct: 301 Q 301


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 37.2 bits (86), Expect = 0.040
 Identities = 32/115 (27%), Positives = 57/115 (49%), Gaps = 16/115 (13%)

Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEE--------RKAEKKNSKNEHVQELTAKQKNPKA 240
           E +D++ E+ EEESEEEEE + +DD E        RK ++       +QE   +    KA
Sbjct: 8   EEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRIT-----IQEREREAAKEKA 62

Query: 241 FTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQE 295
              ++  K E + R T   V  + +   K+L ++ + T   A ++  + ++E +E
Sbjct: 63  LEEEAKRKAEERKRETLKIVEEEVK---KELELKKRNTLLEANIDDVDTDDENEE 114


>gnl|CDD|219912 pfam08574, DUF1762, Protein of unknown function (DUF1762).  This is
           a family of proteins of unknown function. Yeast IWR1 is
           known to interact with RNA polymerase II and deletion of
           this protein results in hypersensitivity to the K1
           killer toxin.
          Length = 77

 Score = 34.3 bits (79), Expect = 0.040
 Identities = 9/34 (26%), Positives = 23/34 (67%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
            D+D++  +D  +D++D+ ++  S++E+ N E+ 
Sbjct: 35  IDEDEEYHEDLANDEDDDADQVLSDDEDSNAENY 68



 Score = 33.9 bits (78), Expect = 0.057
 Identities = 14/44 (31%), Positives = 21/44 (47%), Gaps = 2/44 (4%)

Query: 157 VSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDD--NDEDN 198
             S  G  R     ++  ED  +D+DDD D   +DD+  N E+ 
Sbjct: 25  HRSDIGYIRIIDEDEEYHEDLANDEDDDADQVLSDDEDSNAENY 68



 Score = 27.8 bits (62), Expect = 8.7
 Identities = 11/50 (22%), Positives = 24/50 (48%), Gaps = 5/50 (10%)

Query: 171 DDDLEDNMDDDDD-----DDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
            +D E      D       D+D+E  +D   D ++++++   ++ED + E
Sbjct: 17  PEDEEAANHRSDIGYIRIIDEDEEYHEDLANDEDDDADQVLSDDEDSNAE 66


>gnl|CDD|235052 PRK02597, rpoC2, DNA-directed RNA polymerase subunit beta';
            Provisional.
          Length = 1331

 Score = 38.1 bits (89), Expect = 0.041
 Identities = 16/84 (19%), Positives = 24/84 (28%), Gaps = 3/84 (3%)

Query: 130  DENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE 189
             +N   D + D       S           +  R R  +  D    +       D DD+ 
Sbjct: 1251 MQNLRPDYTVDMPARPYAS-SLDDPSDEDLEATRSRHGI--DPSSSNFAAFARPDADDEL 1307

Query: 190  TDDDNDEDNEEESEEEEENEEDDD 213
              DD   D       +EE    D+
Sbjct: 1308 LIDDQLPDPAALEGLQEEGLLSDE 1331


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 36.6 bits (85), Expect = 0.043
 Identities = 30/119 (25%), Positives = 47/119 (39%), Gaps = 23/119 (19%)

Query: 100 VHGESTEKSMME-IDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVS 158
           +HG+  E   M  I      +S  I+S +SD EN  D+  D  +ES  G  ++ K+    
Sbjct: 83  IHGDVKESDPMSFITNPSLTVSVPIASGESDLENLNDE--DTSSESSYGFKEESKK---- 136

Query: 159 SQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE--TDDDNDEDNEEESEEEEENEEDDDEE 215
                            + +  ++ D    E  T D    + EEE   EEE E  + E+
Sbjct: 137 --------------GSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEEEEVKEPED 181


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 34.1 bits (79), Expect = 0.047
 Identities = 11/32 (34%), Positives = 23/32 (71%)

Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
           D D + +E ++  D ++E+E +EEE++++D D
Sbjct: 43  DSDAEWEEEEEGEDLESEDEEDEEEDDDDDMD 74



 Score = 33.7 bits (78), Expect = 0.048
 Identities = 13/60 (21%), Positives = 29/60 (48%), Gaps = 5/60 (8%)

Query: 153 KEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
              V+  ++   +     D D     D D + ++++E +D   ED E+E E++++ + D 
Sbjct: 21  PSSVILPRNPLSQDLPGLDYDY----DSDAEWEEEEEGEDLESEDEEDEEEDDDD-DMDG 75



 Score = 33.3 bits (77), Expect = 0.074
 Identities = 18/42 (42%), Positives = 28/42 (66%), Gaps = 3/42 (7%)

Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEE-ESEEEEENEEDDDEE 215
           +D    D D D D E +++  E+ E+ ESE+EE+ EEDDD++
Sbjct: 33  QDLPGLDYDYDSDAEWEEE--EEGEDLESEDEEDEEEDDDDD 72



 Score = 29.9 bits (68), Expect = 1.3
 Identities = 14/45 (31%), Positives = 27/45 (60%), Gaps = 1/45 (2%)

Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
              DL   +D D D D + E +++ ++   E+ E+EEE+++DD +
Sbjct: 31  LSQDLPG-LDYDYDSDAEWEEEEEGEDLESEDEEDEEEDDDDDMD 74


>gnl|CDD|221955 pfam13173, AAA_14, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily.
          Length = 127

 Score = 35.2 bits (82), Expect = 0.051
 Identities = 9/23 (39%), Positives = 15/23 (65%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNF 460
           ++ + GP QVGK+TL+   +K  
Sbjct: 4   IIVITGPRQVGKTTLLLQFLKEL 26


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 37.4 bits (87), Expect = 0.053
 Identities = 51/237 (21%), Positives = 90/237 (37%), Gaps = 28/237 (11%)

Query: 122 AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
            I++   DD++ E   SD    S     KK K++    +  RK               D+
Sbjct: 248 KINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKD-------------LDE 294

Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK----NEHVQELTAKQ-- 235
           D+ + + E    +D  + ++ EEE    ED  ++RK E+++      ++ +Q   AKQ  
Sbjct: 295 DELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDDFVEDDDDLQASLAKQRR 354

Query: 236 ---KNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSA-LAGLNSKEEEE 291
              K  K    + + +   + RS       +    + D  I    TS  +  L  +  EE
Sbjct: 355 LAQKKRKKLRPEDIARQIAEERSED----EEEAAENNDNGIVIDETSEFVRSLQKEPLEE 410

Query: 292 EEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASAFLE 348
           + +      +E +D    DEDE  E     D +    D     + +  +   S  LE
Sbjct: 411 KPENKDESVEEISDAEEDDEDEEDEDG-DGDVEMSAVDNDEEKEEEDKEAIPSTILE 466



 Score = 33.2 bits (76), Expect = 0.92
 Identities = 36/189 (19%), Positives = 72/189 (38%), Gaps = 18/189 (9%)

Query: 133 DEDDTSDDDTESGQGSMKKYKEQVVSSQDG--------RKRRKVMFDDDL-----EDNMD 179
            EDD  DDDT++     KK +++  + +          ++R      +DL        ++
Sbjct: 82  GEDDDDDDDTKAWLKKSKKRQKKKEAERKKALLLDEKEKERAAEYTSEDLAGLKVGHKVE 141

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK 239
           + ++ +D   T  D     +E+  +E EN E  ++E+  +    K +             
Sbjct: 142 EFEEGEDVILTLKDTGVLEDEDEGDELENVELVEKEKDKKNLELKKKKPDYDPDDDDKFN 201

Query: 240 AFTFQSV----IKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLN-SKEEEEEEQ 294
             +  S     I+G++K      T+       D+    R ++   L   N S +++  E 
Sbjct: 202 KRSILSKYDEEIEGKKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTET 261

Query: 295 EDSAFSDES 303
             S + D S
Sbjct: 262 PASDYYDVS 270


>gnl|CDD|206683 cd01896, DRG, Developmentally Regulated GTP-binding protein (DRG). 
           The developmentally regulated GTP-binding protein (DRG)
           subfamily is an uncharacterized member of the Obg
           family, an evolutionary branch of GTPase superfamily
           proteins. GTPases act as molecular switches regulating
           diverse cellular processes. DRG2 and DRG1 comprise the
           DRG subfamily in eukaryotes. In view of their widespread
           expression in various tissues and high conservation
           among distantly related species in eukaryotes and
           archaea, DRG proteins may regulate fundamental cellular
           processes. It is proposed that the DRG subfamily
           proteins play their physiological roles through RNA
           binding.
          Length = 233

 Score = 36.4 bits (85), Expect = 0.063
 Identities = 12/18 (66%), Positives = 14/18 (77%)

Query: 439 VAVVGPPQVGKSTLIRCL 456
           VA+VG P VGKSTL+  L
Sbjct: 3   VALVGFPSVGKSTLLSKL 20


>gnl|CDD|236570 PRK09563, rbgA, GTPase YlqF; Reviewed.
          Length = 287

 Score = 36.3 bits (85), Expect = 0.067
 Identities = 12/28 (42%), Positives = 15/28 (53%)

Query: 432 LEPPPIVVAVVGPPQVGKSTLIRCLIKN 459
           + P  I   ++G P VGKSTLI  L   
Sbjct: 117 MRPRAIRAMIIGIPNVGKSTLINRLAGK 144


>gnl|CDD|234274 TIGR03594, GTPase_EngA, ribosome-associated GTPase EngA.  EngA
           (YfgK, Der) is a ribosome-associated essential GTPase
           with a duplication of its GTP-binding domain. It is
           broadly to universally distributed among bacteria. It
           appears to function in ribosome biogenesis or stability
           [Protein synthesis, Other].
          Length = 429

 Score = 36.7 bits (86), Expect = 0.074
 Identities = 14/37 (37%), Positives = 22/37 (59%), Gaps = 1/37 (2%)

Query: 423 HVPQVDRT-PLEPPPIVVAVVGPPQVGKSTLIRCLIK 458
            +P+ +     E  PI +A++G P VGKSTL+  L+ 
Sbjct: 158 LLPEEEEEEEEEDGPIKIAIIGRPNVGKSTLVNALLG 194



 Score = 33.2 bits (77), Expect = 0.98
 Identities = 13/21 (61%), Positives = 14/21 (66%)

Query: 438 VVAVVGPPQVGKSTLIRCLIK 458
           VVA+VG P VGKSTL   L  
Sbjct: 1   VVAIVGRPNVGKSTLFNRLTG 21


>gnl|CDD|224009 COG1084, COG1084, Predicted GTPase [General function prediction
           only].
          Length = 346

 Score = 36.1 bits (84), Expect = 0.099
 Identities = 19/55 (34%), Positives = 27/55 (49%), Gaps = 6/55 (10%)

Query: 403 SVIKG-ERKFRRKEDIQAKKHHVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCL 456
           S+IK  +         +     +P +D    + P IVVA  G P VGKS+L+R L
Sbjct: 139 SIIKKIDDDLEFLRKARDHLKKLPAID---PDLPTIVVA--GYPNVGKSSLVRKL 188


>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727).  This
           domain family is found in bacteria and eukaryotes, and
           is approximately 100 amino acids in length.
          Length = 100

 Score = 33.7 bits (78), Expect = 0.10
 Identities = 13/36 (36%), Positives = 21/36 (58%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
           + ++D+  + E +DD+DED EEE E       +D E
Sbjct: 20  EPEEDEILELEEEDDDDEDEEEEYELLASFYHEDQE 55



 Score = 32.6 bits (75), Expect = 0.26
 Identities = 8/27 (29%), Positives = 17/27 (62%)

Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEE 207
            + ++D+    ++ D+D+E+E EE E 
Sbjct: 19  PEPEEDEILELEEEDDDDEDEEEEYEL 45


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 36.3 bits (84), Expect = 0.10
 Identities = 23/125 (18%), Positives = 48/125 (38%), Gaps = 25/125 (20%)

Query: 108 SMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRK 167
             +E   I  +++G   +D S   +D    + +DTE+   S                   
Sbjct: 284 DRLEKRPICPSLAGFQLTDPSGSLDDPFLKAPEDTENEDESDDN---------------- 327

Query: 168 VMFDDDLEDNMD------DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
               DD +DN        D + + +DD  +D  D  +  ++EE E  +E   +  + +++
Sbjct: 328 ---ADDFDDNDKKGDVGFDINAEVEDDPDEDAEDGGDGGDAEEFEAEKEPMLKVCRVDRQ 384

Query: 222 NSKNE 226
            ++  
Sbjct: 385 VTEII 389



 Score = 33.2 bits (76), Expect = 0.92
 Identities = 26/132 (19%), Positives = 47/132 (35%), Gaps = 25/132 (18%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQ---------------VVSSQDGRKRRKV 168
           SS+D ++ + EDD   DD E     +K +                   ++   G      
Sbjct: 252 SSEDPEELSSEDDEFPDDGEIDLTPLKFFPLLDRLEKRPICPSLAGFQLTDPSGSLDDPF 311

Query: 169 M------FDDDLEDNMDDDDDDDDDDET----DDDNDEDNEEESEEEEENEEDDDEERKA 218
           +       ++D  D+  DD DD+D         +   ED+ +E  E+  +  D +E    
Sbjct: 312 LKAPEDTENEDESDDNADDFDDNDKKGDVGFDINAEVEDDPDEDAEDGGDGGDAEEFEAE 371

Query: 219 EKKNSKNEHVQE 230
           ++   K   V  
Sbjct: 372 KEPMLKVCRVDR 383



 Score = 30.9 bits (70), Expect = 4.5
 Identities = 12/42 (28%), Positives = 25/42 (59%)

Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
           D D+ +  +   + D+ E+ +EE  + E+D  ++KA+KK  +
Sbjct: 151 DSDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQR 192



 Score = 30.9 bits (70), Expect = 5.4
 Identities = 13/42 (30%), Positives = 16/42 (38%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
           D +        D DDDDD   DD  E   + S+ E      D
Sbjct: 548 DPNFCPVGQGADGDDDDDGPFDDAAEMLSDPSDGEPGASASD 589


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 36.1 bits (83), Expect = 0.11
 Identities = 11/40 (27%), Positives = 24/40 (60%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
           DD+    +D + ++D D+D+ E++++   N  D+D +  A
Sbjct: 275 DDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDNGA 314



 Score = 35.3 bits (81), Expect = 0.15
 Identities = 11/39 (28%), Positives = 19/39 (48%)

Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
           DDDE    ND D EE+ ++++  + DD      ++    
Sbjct: 274 DDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDN 312



 Score = 32.6 bits (74), Expect = 1.1
 Identities = 11/34 (32%), Positives = 18/34 (52%), Gaps = 2/34 (5%)

Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           D+DE +     +EEE+ +DDD E   +   + N 
Sbjct: 274 DDDEGSSSNDYDEEEDGDDDDNEDNDD--TNTNH 305


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 34.6 bits (80), Expect = 0.11
 Identities = 16/48 (33%), Positives = 23/48 (47%)

Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
           +  +  +E+ EE   EE + EE  DE  + E    K E  +E   KQK
Sbjct: 97  DKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144



 Score = 32.3 bits (74), Expect = 0.75
 Identities = 8/53 (15%), Positives = 28/53 (52%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
             L  +  + +++++++   ++ DE+ + +   E+E  +   E+R+  ++  K
Sbjct: 92  KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144



 Score = 30.3 bits (69), Expect = 2.8
 Identities = 10/57 (17%), Positives = 29/57 (50%), Gaps = 3/57 (5%)

Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
           L+    +++++++ +  + D +E  +E  E+E         E++ E +  + E ++E
Sbjct: 96  LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKL---KREKRRENERKQKEILKE 149


>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
           terminus.  The C terminus of the plasma membrane Nha1
           antiporter plays an important role in the immediate cell
           response to hypo-osmotic shock which prevents an
           execessive loss of ions and water. This domain is found
           with pfam00999.
          Length = 430

 Score = 35.9 bits (83), Expect = 0.11
 Identities = 27/116 (23%), Positives = 40/116 (34%), Gaps = 11/116 (9%)

Query: 111 EIDGIHGAMSG-AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVM 169
           E+D    A       + D  D ND    +D  +E        +      + + RK R+  
Sbjct: 315 EVDLETNAPERVKAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSLLGSEASERRKNREPS 374

Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE----------EDDDEE 215
                       D +D+D E DD + +D EE   E               +DDDEE
Sbjct: 375 SSSPPSRPRSRRDSEDEDTERDDSDSDDEEETPAERRRRLAALGELPSARDDDDEE 430



 Score = 34.4 bits (79), Expect = 0.40
 Identities = 20/115 (17%), Positives = 43/115 (37%), Gaps = 30/115 (26%)

Query: 137 TSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN-----MDDDDDDDDDDETD 191
           T  D  +  +    K + +V    +  +R K    D+++ N      D   ++  +D+  
Sbjct: 297 TKHDFIKQIRNLDPKARREVDLETNAPERVKAPARDEVDTNDSSPLTDSKSEEKLEDDLA 356

Query: 192 D-------------------------DNDEDNEEESEEEEENEEDDDEERKAEKK 221
                                      +  D+E+E  E ++++ DD+EE  AE++
Sbjct: 357 HSLLGSEASERRKNREPSSSSPPSRPRSRRDSEDEDTERDDSDSDDEEETPAERR 411


>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
           (DUF2076).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function. The domain,
           however, is found in various periplasmic ligand-binding
           sensor proteins.
          Length = 234

 Score = 35.4 bits (82), Expect = 0.12
 Identities = 19/95 (20%), Positives = 25/95 (26%), Gaps = 24/95 (25%)

Query: 101 HGESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQ 160
                    +      G      +  D    N+  D   D     QGS            
Sbjct: 163 GHSQPA--EIVDAIGEGGDGSGPAPADDTGINNYGDDDSDAAGGDQGS------------ 208

Query: 161 DGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDND 195
                     + D +D    D   DDDD  DDD+D
Sbjct: 209 ----------NGDDDDGGFADSGYDDDDMDDDDDD 233



 Score = 31.6 bits (72), Expect = 2.0
 Identities = 15/90 (16%), Positives = 28/90 (31%), Gaps = 7/90 (7%)

Query: 109 MMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKV 168
           M+  +G+     G     +  D   E         +    +  Y +    +  G      
Sbjct: 151 MLLGNGLENLFGGHSQPAEIVDAIGEGGDGSGPAPADDTGINNYGDDDSDAAGGD----- 205

Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDN 198
                  D+ D    D   D+ D D+D+D+
Sbjct: 206 --QGSNGDDDDGGFADSGYDDDDMDDDDDD 233


>gnl|CDD|206746 cd01849, YlqF_related_GTPase, Circularly permuted YlqF-related
           GTPases.  These proteins are found in bacteria,
           eukaryotes, and archaea.  They all exhibit a circular
           permutation of the GTPase signature motifs so that the
           order of the conserved G box motifs is G4-G5-G1-G2-G3,
           with G4 and G5 being permuted from the C-terminal region
           of proteins in the Ras superfamily to the N-terminus of
           YlqF-related GTPases.
          Length = 146

 Score = 34.3 bits (79), Expect = 0.12
 Identities = 24/94 (25%), Positives = 36/94 (38%), Gaps = 10/94 (10%)

Query: 374 DEAPNKKVHRKRQAELTAKQKNPKAFTFQSVIKGERKFRRKEDIQAKKHHVPQVDRTPLE 433
           D  P + + +   AEL+        F   +  +G  K + +   Q  K    +       
Sbjct: 39  DLVPKEVLRKWV-AELSELYGTKTFFISATNGQGILKLKAEITKQKLKLKYKK------- 90

Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSV 467
              I V VVG P VGKS+ I  L+  F     S+
Sbjct: 91  --GIRVGVVGLPNVGKSSFINALLNKFKLKVGSI 122


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 35.4 bits (82), Expect = 0.13
 Identities = 14/55 (25%), Positives = 30/55 (54%)

Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQ 235
           DD+D+DDD  D  ++E++EE+ EE+ +       ++  + K  + E  +    ++
Sbjct: 25  DDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEK 79



 Score = 35.0 bits (81), Expect = 0.18
 Identities = 20/78 (25%), Positives = 36/78 (46%), Gaps = 15/78 (19%)

Query: 178 MDDDDDDDD---------------DDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKN 222
           M D DD++                DDE +DD+ +D+ +E E+EE+ EE      KA+ K 
Sbjct: 1   MADWDDEEFEPPAPPAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKK 60

Query: 223 SKNEHVQELTAKQKNPKA 240
           +    ++E    ++  + 
Sbjct: 61  ALKAKIEEKEKAKREKEE 78


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 36.0 bits (83), Expect = 0.13
 Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 23/115 (20%)

Query: 103 ESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDG 162
           ES E+S    D I    S   +  D D++ DE   S + ++S + + +K KE+       
Sbjct: 243 ESEEESGKR-DVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEK------- 294

Query: 163 RKRRKVMFDDDLEDNMDDDDDDDD-------DDETDDDNDEDNEEESEEEEENEE 210
           RKR K M        M+D+D+D++         E ++  + +     ++EEE EE
Sbjct: 295 RKRLKKM--------MEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEE 341



 Score = 34.4 bits (79), Expect = 0.35
 Identities = 17/74 (22%), Positives = 40/74 (54%), Gaps = 2/74 (2%)

Query: 159 SQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
           S++   +R V+ +D+  +    D+D+D+D+        D+EEE+EE+E+ +      +K 
Sbjct: 244 SEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKR--LKKM 301

Query: 219 EKKNSKNEHVQELT 232
            +   ++E ++ + 
Sbjct: 302 MEDEDEDEEMEIVP 315


>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein.  This family includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 116

 Score = 33.7 bits (77), Expect = 0.13
 Identities = 23/109 (21%), Positives = 40/109 (36%), Gaps = 21/109 (19%)

Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
           S D + + D DD  +DD E                         +F    + +  D  D+
Sbjct: 19  SYDDEHDEDMDDFIEDDDEEQD-----------EIPYDSDEIWAIFGKGRKRSYYDRYDE 67

Query: 185 DDDDETDDDNDEDNEEESE----------EEEENEEDDDEERKAEKKNS 223
           DD  +  +    + ++E            E E   E+++E+RK +KKN 
Sbjct: 68  DDALDNMEATFMEIQKEERRSARMARLEDERELAREEEEEKRKKKKKNK 116



 Score = 28.7 bits (64), Expect = 8.3
 Identities = 20/101 (19%), Positives = 40/101 (39%), Gaps = 19/101 (18%)

Query: 148 SMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDE----------- 196
           S     +   SS    +      D+D++D ++DD  D++ DE   D+DE           
Sbjct: 2   SSGPTLKPQTSSNKASRSYDDEHDEDMDDFIEDD--DEEQDEIPYDSDEIWAIFGKGRKR 59

Query: 197 ------DNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
                 D ++  +  E    +  +E +   + ++ E  +EL
Sbjct: 60  SYYDRYDEDDALDNMEATFMEIQKEERRSARMARLEDEREL 100


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 35.8 bits (83), Expect = 0.13
 Identities = 15/65 (23%), Positives = 32/65 (49%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
           D++ + + ++ DDD ++E+DD++  +  E   E          + KA++   K     E 
Sbjct: 235 DEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELER 294

Query: 232 TAKQK 236
            AK++
Sbjct: 295 EAKEE 299



 Score = 35.5 bits (82), Expect = 0.15
 Identities = 14/66 (21%), Positives = 28/66 (42%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
           + L +  DDD +++ DDE+  +  E   E   +    +     +R  EK+  + E   + 
Sbjct: 239 EGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKE 298

Query: 232 TAKQKN 237
             + K 
Sbjct: 299 EKQLKK 304



 Score = 33.9 bits (78), Expect = 0.47
 Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 2/90 (2%)

Query: 153 KEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
           K Q +   + +K  K+  +    D M +   ++ DD+ ++++D+++  E  E E    + 
Sbjct: 212 KRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINK 271

Query: 213 DEERKAEKKNSKN--EHVQELTAKQKNPKA 240
               K + K  +N  +  +EL  + K  K 
Sbjct: 272 PVRPKRKTKAQRNKEKRRKELEREAKEEKQ 301


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 35.6 bits (82), Expect = 0.14
 Identities = 18/48 (37%), Positives = 26/48 (54%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
           DD   N D   +DD+       +++  EEE EE EE+EE++ EE   E
Sbjct: 54  DDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEE 101



 Score = 34.8 bits (80), Expect = 0.26
 Identities = 29/111 (26%), Positives = 54/111 (48%), Gaps = 29/111 (26%)

Query: 106 EKSMMEIDGIHGAMSGAISSDDSDD--ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGR 163
           E++M+ +D         +++ D DD  E  E +TSDD      G+   + E      +GR
Sbjct: 28  EQNMLALD---------VAARDGDDAIELAEPETSDDP----YGNPDPFGED----DEGR 70

Query: 164 KRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
                     +E  + +D ++++ +E ++D +E+ EEESEE E   +  +E
Sbjct: 71  ----------IEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEE 111


>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1).  This family
           consists of several mammalian dentin matrix protein 1
           (DMP1) sequences. The dentin matrix acidic
           phosphoprotein 1 (DMP1) gene has been mapped to human
           chromosome 4q21. DMP1 is a bone and teeth specific
           protein initially identified from mineralised dentin.
           DMP1 is primarily localised in the nuclear compartment
           of undifferentiated osteoblasts. In the nucleus, DMP1
           acts as a transcriptional component for activation of
           osteoblast-specific genes like osteocalcin. During the
           early phase of osteoblast maturation, Ca(2+) surges into
           the nucleus from the cytoplasm, triggering the
           phosphorylation of DMP1 by a nuclear isoform of casein
           kinase II. This phosphorylated DMP1 is then exported out
           into the extracellular matrix, where it regulates
           nucleation of hydroxyapatite. DMP1 is a unique molecule
           that initiates osteoblast differentiation by
           transcription in the nucleus and orchestrates
           mineralised matrix formation extracellularly, at later
           stages of osteoblast maturation. The DMP1 gene has been
           found to be ectopically expressed in lung cancer
           although the reason for this is unknown.
          Length = 514

 Score = 35.8 bits (82), Expect = 0.14
 Identities = 55/239 (23%), Positives = 95/239 (39%), Gaps = 31/239 (12%)

Query: 102 GESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDD----TESGQGSM-------- 149
           G+ST+ S  E   + G   G  S  D  + +DE   SDD     +E G   M        
Sbjct: 181 GDSTQDSESEEHWVGGGSEGESSHGDGSEFDDEGMQSDDPESTRSERGNSRMSSAGLKSK 240

Query: 150 --KKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEE 207
             K   E+  S+QD  + + V +            ++D   E DD N  + + +S E   
Sbjct: 241 ESKGEDEEQASTQDSGESQSVEYPSRKFFRKSRISEEDGRGELDDSNTMEVKSDSTENAG 300

Query: 208 NEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSV-IKGERKFRSTKPTVVPQPRM 266
             +  +  R   +++S+    QE + + ++P + + Q   +  +     ++  VV + R 
Sbjct: 301 LSQSREHSRSESQEDSEENQSQEDSQEVQDPSSESSQEADLPSQENSSESQEEVVSESRG 360

Query: 267 SDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDE-SADDNSSDEDETVEPKTHKDSK 324
            + D    H                E+QEDS  S+E S D  SS E ++ E +   +S 
Sbjct: 361 DNPDNTTSHS---------------EDQEDSESSEEDSLDTPSSSESQSTEEQADSESN 404


>gnl|CDD|213227 cd03260, ABC_PstB_phosphate_transporter, ATP-binding cassette
           domain of the phosphate transport system.  Phosphate
           uptake is of fundamental importance in the cell
           physiology of bacteria because phosphate is required as
           a nutrient. The Pst system of E. coli comprises four
           distinct subunits encoded by the pstS, pstA, pstB, and
           pstC genes. The PstS protein is a phosphate-binding
           protein located in the periplasmic space. PstA and PstC
           are hydrophobic and they form the transmembrane portion
           of the Pst system. PstB is the catalytic subunit, which
           couples the energy of ATP hydrolysis to the import of
           phosphate across cellular membranes through the Pst
           system, often referred as ABC-protein. PstB belongs to
           one of the largest superfamilies of proteins
           characterized by a highly conserved adenosine
           triphosphate (ATP) binding cassette (ABC), which is also
           a nucleotide binding domain (NBD).
          Length = 227

 Score = 35.2 bits (82), Expect = 0.14
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 434 PPPIVVAVVGPPQVGKSTLIRCL 456
           P   + A++GP   GKSTL+R L
Sbjct: 24  PKGEITALIGPSGCGKSTLLRLL 46


>gnl|CDD|220779 pfam10488, PP1c_bdg, Phosphatase-1 catalytic subunit binding
           region.  This conserved C-terminus appears to be a
           protein phosphatase-1 catalytic subunit (PP1C) binding
           region, which may in some circumstances also be
           retroviral in origin since it is found in both herpes
           simplex virus and in mouse and man. This domain is found
           in Gadd-34 apoptosis-associated proteins as well as the
           constitutive repressor of eIF2-alpha
           phosphorylation/protein phosphatase 1, regulatory
           (inhibitor) subunit 15b, otherwise known as CReP.
           Diverse stressful conditions are associated with
           phosphorylation of the {alpha} subunit of eukaryotic
           translation initiation factor 2 (eIF2{alpha}) on serine
           51. This signaling event, which is conserved from yeast
           to mammals, negatively regulates the guanine nucleotide
           exchange factor, eIF2-B and inhibits the recycling of
           eIF2 to its active GTP bound form. In mammalian cells
           eIF2{alpha} phosphorylation emerges as an important
           event in stress signaling that impacts on gene
           expression at both the translational and transcriptional
           levels.
          Length = 307

 Score = 35.4 bits (81), Expect = 0.14
 Identities = 30/141 (21%), Positives = 58/141 (41%), Gaps = 27/141 (19%)

Query: 173 DLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELT 232
           DLE + D +    D++  DD  D D+   SE + E +++      +  K+    +    T
Sbjct: 25  DLESSSDVESISWDEESEDDGFDSDS-SLSESDREQDDEGLHLWNSFTKSVDPYNPLNFT 83

Query: 233 AKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEE 292
           A        T Q+             T+ P+P  S+ D        S    ++S+E    
Sbjct: 84  A--------TIQTAA-----------TIKPKPPSSESDW-------SGEENVSSQEGPLP 117

Query: 293 EQEDSAFSDESADDNSSDEDE 313
              + + S++ + ++S+DE+E
Sbjct: 118 STPEHSSSEDDSWESSADEEE 138


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 34.5 bits (79), Expect = 0.14
 Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 1/67 (1%)

Query: 173 DLEDNMDDDDDDDDDDETDDDNDED-NEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
           D E++ D   ++ ++DE + ++ ED  +EE EE EE EE+     KA   +SK+   + +
Sbjct: 19  DEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKAPSSSSKSSSQETI 78

Query: 232 TAKQKNP 238
           +     P
Sbjct: 79  SIPPTPP 85


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
           function of this eukaryotic protein family is unknown.
           The yeast orthologues have been implicated in cell cycle
           progression and biogenesis of 60S ribosomal subunits.
           The Schistosoma mansoni Mak16 has been shown to target
           protein transport to the nucleolus.
          Length = 97

 Score = 32.9 bits (75), Expect = 0.15
 Identities = 13/46 (28%), Positives = 30/46 (65%)

Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
           F   LE    +++D+++++E +++++ + E  S++EE  EE +D E
Sbjct: 51  FKKALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLE 96



 Score = 31.8 bits (72), Expect = 0.47
 Identities = 11/45 (24%), Positives = 30/45 (66%)

Query: 167 KVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEED 211
           K +  ++ E+N +++++++++++  +     ++EE EEE E+ ED
Sbjct: 53  KALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLED 97



 Score = 30.2 bits (68), Expect = 1.5
 Identities = 10/43 (23%), Positives = 28/43 (65%)

Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
           ++ ++ +++D+E +++ +E++E E E   ++EE ++E    E 
Sbjct: 55  LEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLED 97


>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein.  This family consists of
           Ebola and Marburg virus nucleoproteins. These proteins
           are responsible for encapsidation of genomic RNA. It has
           been found that nucleoprotein DNA vaccines can offer
           protection from the virus.
          Length = 717

 Score = 35.7 bits (82), Expect = 0.16
 Identities = 34/172 (19%), Positives = 59/172 (34%), Gaps = 35/172 (20%)

Query: 171 DDDLEDNMDDDDDDDDD--------DETDDDNDEDNEEESEEEEE-------NEEDDDEE 215
           DD     MDDD DD  D        D  D ++ E +                N ++DD++
Sbjct: 417 DDVQPRPMDDDPDDSRDTTIPPGVVDPIDGESTEYSSYSDSFVGTNDDLVLFNLDEDDDD 476

Query: 216 RKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIK-----GERKFRSTKPTVVPQPRMSDKD 270
            K          + E  A      +   Q + +      +   R+++       +  D+ 
Sbjct: 477 SKP---------IPEQAASTYGQTSRERQGIPEPPPGSHQPGNRASQDLNNNNQKQEDES 527

Query: 271 LAIRHKITSALAGLNSKEEEEE------EQEDSAFSDESADDNSSDEDETVE 316
                K +     L   +EE+E      + + S    ES DD  SD ++ V+
Sbjct: 528 TNPIGKTSLRYQELTPVQEEDEPEDQTDDDDSSLPPLESDDDPGSDNEQGVD 579



 Score = 33.4 bits (76), Expect = 0.87
 Identities = 26/145 (17%), Positives = 47/145 (32%), Gaps = 39/145 (26%)

Query: 128 SDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDD 187
           S D N+ +   +D++ +  G      +++   Q+         +D+ ED  DDDD     
Sbjct: 513 SQDLNNNNQKQEDESTNPIGKTSLRYQELTPVQE---------EDEPEDQTDDDDSSLPP 563

Query: 188 DETDDDNDEDNEE--------------------------ESEEEEENEEDDDEERKAEKK 221
            E+DDD   DNE+                              ++      + +    + 
Sbjct: 564 LESDDDPGSDNEQGVDLTEVAPPAPVYRDEKEQDEIPHPAQNPQDPTGSIGNVDSDILRS 623

Query: 222 NSKNEHVQELT----AKQKNPKAFT 242
           NSK     E T     + + P    
Sbjct: 624 NSKPSAPLEETYMHLLRTQGPFDAI 648


>gnl|CDD|206753 cd04178, Nucleostemin_like, A circularly permuted subfamily of the
           Ras GTPases.  Nucleostemin (NS) is a nucleolar protein
           that functions as a regulator of cell growth and
           proliferation in stem cells and in several types of
           cancer cells, but is not expressed in the differentiated
           cells of most mammalian adult tissues. NS shuttles
           between the nucleolus and nucleoplasm bidirectionally at
           a rate that is fast and independent of cell type.
           Lowering GTP levels decreases the nucleolar retention of
           NS, and expression of NS is abruptly down-regulated
           during differentiation prior to terminal cell division.
           Found only in eukaryotes, NS consists of an N-terminal
           basic domain, a coiled-coil domain, a GTP-binding
           domain, an intermediate domain, and a C-terminal acidic
           domain. Experimental evidence indicates that NS uses its
           GTP-binding property as a molecular switch to control
           the transition between the nucleolus and nucleoplasm,
           and this process involves interaction between the basic,
           GTP-binding, and intermediate domains of the protein.
          Length = 171

 Score = 34.5 bits (80), Expect = 0.17
 Identities = 11/18 (61%), Positives = 13/18 (72%)

Query: 436 PIVVAVVGPPQVGKSTLI 453
            I V VVG P VGKS++I
Sbjct: 116 SITVGVVGYPNVGKSSVI 133


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
           Provisional.
          Length = 1136

 Score = 35.9 bits (82), Expect = 0.17
 Identities = 44/205 (21%), Positives = 87/205 (42%), Gaps = 23/205 (11%)

Query: 129 DDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDD 188
           D E  E    +   + G+  + +Y + + S +       + + D ++D    DDD  D  
Sbjct: 350 DIETVEPQQEEPVQDVGEHQINEYGDILPSLKASINNSAINYYDAVKDGKYLDDDSSDAL 409

Query: 189 ETDDDNDEDNEEES------EEEEENEEDDDEERKAE---KKNSKNEHVQELTA-----K 234
            TD+D   D E++         E+E+ ED++EE   +   ++ S +EHV+E  A     +
Sbjct: 410 YTDEDLLFDLEKQKYMDMLDGSEDESVEDNEEEHSGDANEEELSVDEHVEEHNADDSGEQ 469

Query: 235 QKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQ 294
           Q + ++   QSV   E     +    V +P ++D        +       + +E   +E 
Sbjct: 470 QSDDESGEHQSV--NEIVEEQSVNEHVEEPTVAD-------IVEQETVDEHVEEPAVDEN 520

Query: 295 EDSAFSDESADDNSSDEDETVEPKT 319
           E+   +DE  ++ +  E+   E  +
Sbjct: 521 EEQQTADEHVEEPTIAEEHVEEEIS 545



 Score = 35.9 bits (82), Expect = 0.18
 Identities = 23/102 (22%), Positives = 59/102 (57%), Gaps = 1/102 (0%)

Query: 130  DENDEDDTSDDDTESGQGSMKKYKEQVVSS-QDGRKRRKVMFDDDLEDNMDDDDDDDDDD 188
            +EN E++  ++  E+ + ++++Y E+ V   ++  +       +++E+N +++ +++ ++
Sbjct: 1000 EENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEE 1059

Query: 189  ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
              ++ ++E+ EE  E  EEN E++ EE   E      E+V+E
Sbjct: 1060 NIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEE 1101



 Score = 34.8 bits (79), Expect = 0.35
 Identities = 18/60 (30%), Positives = 42/60 (70%)

Query: 171  DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
            ++++E+N++++ +++ ++  +++ +E +EE  EE EEN E+ DEE   E + +  E+V+E
Sbjct: 996  EENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEE 1055



 Score = 34.0 bits (77), Expect = 0.62
 Identities = 31/134 (23%), Positives = 69/134 (51%), Gaps = 16/134 (11%)

Query: 103  ESTEKSMMEID--GIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQ 160
            E+ E+++ E D   +          D+ + E  E++  ++  E+ + ++++Y E+ V   
Sbjct: 1013 ENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEE- 1071

Query: 161  DGRKRRKVMFDDDLEDNMDDDDDD--DDDDETDDDNDEDNEEESEEE--EENEEDDDEER 216
                      ++++E+N++++ ++  +++ E  ++N E+N EE+ EE  EEN E++ EE 
Sbjct: 1072 ---------IEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEY 1122

Query: 217  KAEKKNSKNEHVQE 230
              E     NE   E
Sbjct: 1123 DDENPEEHNEEYDE 1136



 Score = 33.6 bits (76), Expect = 0.84
 Identities = 26/118 (22%), Positives = 61/118 (51%), Gaps = 13/118 (11%)

Query: 131  ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDD--D 188
            E + ++  +++ E  + ++++Y E+ V   +      V  ++++E+N+++ D+++ +  +
Sbjct: 1016 EENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENV--EENIEENIEEYDEENVEEIE 1073

Query: 189  ETDDDNDEDNEEESEEE---------EENEEDDDEERKAEKKNSKNEHVQELTAKQKN 237
            E  ++N E+N EE+ EE         EEN E++ EE   E      E   +   ++ N
Sbjct: 1074 ENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEEHN 1131



 Score = 33.2 bits (75), Expect = 0.97
 Identities = 26/103 (25%), Positives = 58/103 (56%), Gaps = 2/103 (1%)

Query: 130  DENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE 189
            +EN E++  ++  E+ + ++++  E+ V         + + + D E+  + +++ ++ DE
Sbjct: 980  EENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDE 1039

Query: 190  TDDDNDEDNEEESEEE--EENEEDDDEERKAEKKNSKNEHVQE 230
             + +  E+N EE+ EE  EEN E+ DEE   E + +  E+++E
Sbjct: 1040 ENVEEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEE 1082



 Score = 33.2 bits (75), Expect = 1.1
 Identities = 23/112 (20%), Positives = 50/112 (44%), Gaps = 2/112 (1%)

Query: 122 AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
           A   + S DE+ E+  +DD  E          + V    + +   + + +  + D ++ +
Sbjct: 447 ANEEELSVDEHVEEHNADDSGEQQSDDESGEHQSVNEIVEEQSVNEHVEEPTVADIVEQE 506

Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA 233
             D+  +E     DE+ E+++ +E   E    EE   E+ ++  EH++E  +
Sbjct: 507 TVDEHVEE--PAVDENEEQQTADEHVEEPTIAEEHVEEEISTAEEHIEEPAS 556



 Score = 32.1 bits (72), Expect = 2.4
 Identities = 35/168 (20%), Positives = 84/168 (50%), Gaps = 6/168 (3%)

Query: 171  DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE-EENEEDDDEERKAEK-KNSKNEHV 228
            ++D E+N+++D +++ ++  +++ +E+ EE  EE  EEN E++ EE   E  + +  E+V
Sbjct: 948  EEDAEENVEEDAEENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENV 1007

Query: 229  QELTAK--QKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNS 286
            +E   +  ++N + +  ++V + E          V +   + ++  +   I   +   + 
Sbjct: 1008 EENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEE-NVEENIEENIEEYDE 1066

Query: 287  KEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKD-SKNPEEDTGLN 333
            +  EE E+      +E+ ++N  +  E +E    ++  +N EE+   N
Sbjct: 1067 ENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEEN 1114


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 34.6 bits (80), Expect = 0.18
 Identities = 15/49 (30%), Positives = 29/49 (59%)

Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
            +++ +++ +E   EE+ E+E  EE  +EE +AE  + +   + EL AK
Sbjct: 4   KNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAK 52



 Score = 33.8 bits (78), Expect = 0.26
 Identities = 21/92 (22%), Positives = 44/92 (47%), Gaps = 13/92 (14%)

Query: 149 MKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEEN 208
           M++  EQV    +         + ++E+ ++D+ +++  +E  +    D E+    E E 
Sbjct: 1   MEEKNEQVEEEVE---------ETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEA 51

Query: 209 EEDDDEER----KAEKKNSKNEHVQELTAKQK 236
           + D+ EER    +A+ +N K    +E  A +K
Sbjct: 52  KLDELEERYLRLQADFENYKRRIQKENEAAEK 83


>gnl|CDD|206682 cd01895, EngA2, EngA2 GTPase contains the second domain of EngA.
           This EngA2 subfamily CD represents the second GTPase
           domain of EngA and its orthologs, which are composed of
           two adjacent GTPase domains. Since the sequences of the
           two domains are more similar to each other than to other
           GTPases, it is likely that an ancient gene duplication,
           rather than a fusion of evolutionarily distinct GTPases,
           gave rise to this family. Although the exact function of
           these proteins has not been elucidated, studies have
           revealed that the E. coli EngA homolog, Der, and
           Neisseria gonorrhoeae EngA are essential for cell
           viability. A recent report suggests that E. coli Der
           functions in ribosome assembly and stability.
          Length = 174

 Score = 34.3 bits (80), Expect = 0.18
 Identities = 11/24 (45%), Positives = 17/24 (70%)

Query: 435 PPIVVAVVGPPQVGKSTLIRCLIK 458
            PI +A++G P VGKS+L+  L+ 
Sbjct: 1   DPIKIAIIGRPNVGKSSLLNALLG 24


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 35.7 bits (82), Expect = 0.18
 Identities = 18/88 (20%), Positives = 33/88 (37%), Gaps = 12/88 (13%)

Query: 164 KRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE--------- 214
           +R+  + D          DD    +    + +  N EESE EEE E+ +DE         
Sbjct: 347 ERKDYILDSSSVPLEKQFDDILYFE--KMEIENRNPEESEHEEEVEDYEDENDHSKRICD 404

Query: 215 -ERKAEKKNSKNEHVQELTAKQKNPKAF 241
            +       + +E    L    +N +++
Sbjct: 405 DDELENHFRAADEKNSHLVVGFRNERSY 432



 Score = 33.4 bits (76), Expect = 0.96
 Identities = 12/77 (15%), Positives = 32/77 (41%), Gaps = 2/77 (2%)

Query: 161 DGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
           +  ++  ++    +      DD    +    ++ + +  E  EE E+ E+++D  ++   
Sbjct: 345 EAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICD 404

Query: 221 KNSKNEHVQELTAKQKN 237
            +    H +   A +KN
Sbjct: 405 DDELENHFRA--ADEKN 419


>gnl|CDD|224082 COG1160, COG1160, Predicted GTPases [General function prediction
           only].
          Length = 444

 Score = 35.2 bits (82), Expect = 0.18
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 433 EPPPIVVAVVGPPQVGKSTLIRCLIK 458
           E  PI +A++G P VGKS+LI  ++ 
Sbjct: 175 ETDPIKIAIIGRPNVGKSSLINAILG 200



 Score = 31.4 bits (72), Expect = 3.2
 Identities = 13/21 (61%), Positives = 14/21 (66%)

Query: 438 VVAVVGPPQVGKSTLIRCLIK 458
           VVA+VG P VGKSTL   L  
Sbjct: 5   VVAIVGRPNVGKSTLFNRLTG 25


>gnl|CDD|213223 cd03256, ABC_PhnC_transporter, ATP-binding cassette domain of the
           binding protein-dependent phosphonate transport system. 
           Phosphonates are a class of organophosphorus compounds
           characterized by a chemically stable
           carbon-to-phosphorus (C-P) bond. Phosphonates are
           widespread among naturally occurring compounds in all
           kingdoms of wildlife, but only prokaryotic
           microorganisms are able to cleave this bond. Certain
           bacteria such as E. coli can use alkylphosphonates as a
           phosphorus source. ABC transporters are a subset of
           nucleotide hydrolases that contain a signature motif,
           Q-loop, and H-loop/switch region, in addition to, the
           Walker A motif/P-loop and Walker B motif commonly found
           in a number of ATP- and GTP-binding and hydrolyzing
           proteins.
          Length = 241

 Score = 34.9 bits (81), Expect = 0.19
 Identities = 12/19 (63%), Positives = 15/19 (78%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
            VA++GP   GKSTL+RCL
Sbjct: 29  FVALIGPSGAGKSTLLRCL 47


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
           found in a wide range of eukaryotes. It is a nuclear
           protein and is suggested to be DNA binding. In plants,
           this family is essential for correct circadian clock
           functioning by acting as a light-quality regulator
           coordinating the activities of blue and red light
           signalling pathways during plant growth - inhibiting
           growth in red light but promoting growth in blue light.
          Length = 233

 Score = 34.6 bits (80), Expect = 0.19
 Identities = 14/43 (32%), Positives = 23/43 (53%)

Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHV 228
            DD+ ++D DE  +E+   +E +E D+      +KK  KN  V
Sbjct: 14  GDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSV 56


>gnl|CDD|213229 cd03262, ABC_HisP_GlnQ, ATP-binding cassette domain of the
           histidine and glutamine transporters.  HisP and GlnQ are
           the ATP-binding components of the bacterial periplasmic
           histidine and glutamine permeases, respectively.
           Histidine permease is a multi-subunit complex containing
           the HisQ and HisM integral membrane subunits and two
           copies of HisP. HisP has properties intermediate between
           those of integral and peripheral membrane proteins and
           is accessible from both sides of the membrane,
           presumably by its interaction with HisQ and HisM. The
           two HisP subunits form a homodimer within the complex.
           The domain structure of the amino acid uptake systems is
           typical for prokaryotic extracellular solute binding
           protein-dependent uptake systems. All of the amino acid
           uptake systems also have at least one, and in a few
           cases, two extracellular solute binding proteins located
           in the periplasm of Gram-negative bacteria, or attached
           to the cell membrane of Gram-positive bacteria. The
           best-studied member of the PAAT (polar amino acid
           transport) family is the HisJQMP system of S.
           typhimurium, where HisJ is the extracellular solute
           binding proteins and HisP is the ABC protein.
          Length = 213

 Score = 34.4 bits (80), Expect = 0.20
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
           VV ++GP   GKSTL+RC+
Sbjct: 28  VVVIIGPSGSGKSTLLRCI 46


>gnl|CDD|213179 cd00267, ABC_ATPase, ATP-binding cassette transporter
           nucleotide-binding domain.  ABC transporters are a large
           family of proteins involved in the transport of a wide
           variety of different compounds, like sugars, ions,
           peptides, and more complex organic molecules. The
           nucleotide-binding domain shows the highest similarity
           between all members of the family. ABC transporters are
           a subset of nucleotide hydrolases that contain a
           signature motif, Q-loop, and H-loop/switch region, in
           addition to, the Walker A motif/P-loop and Walker B
           motif commonly found in a number of ATP- and GTP-binding
           and hydrolyzing proteins.
          Length = 157

 Score = 33.8 bits (78), Expect = 0.20
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
           +VA+VGP   GKSTL+R +
Sbjct: 27  IVALVGPNGSGKSTLLRAI 45


>gnl|CDD|224051 COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase
           component [Amino acid transport and metabolism].
          Length = 240

 Score = 34.8 bits (81), Expect = 0.21
 Identities = 12/19 (63%), Positives = 15/19 (78%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
           VV ++GP   GKSTL+RCL
Sbjct: 30  VVVIIGPSGSGKSTLLRCL 48


>gnl|CDD|118064 pfam09528, Ehrlichia_rpt, Ehrlichia tandem repeat (Ehrlichia_rpt). 
           This entry represents 77 residues of an 80 amino acid
           (240 nucleotide) tandem repeat, found in a variable
           number of copies in an immunodominant outer membrane
           protein of Ehrlichia chaffeensis, a tick-borne obligate
           intracellular pathogen.
          Length = 707

 Score = 35.4 bits (80), Expect = 0.23
 Identities = 60/265 (22%), Positives = 107/265 (40%), Gaps = 26/265 (9%)

Query: 89  MEIFEFLN--ICQVHGESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQ 146
           M I  F N    +   E +E    E++       G  SS+   +EN  +  ++D   +  
Sbjct: 25  MRILGFGNKNDEKPQDEDSEILQEELEPAVDDNVGNPSSEVGKEENAPEVKAEDLEPAVA 84

Query: 147 GSMKKYKEQV---VSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESE 203
            S++    +V   VS  +  +    +  +DL+  +D D    + +  D       EEE+ 
Sbjct: 85  ESVEHSSSEVGKEVSETEKEESNPEVKAEDLQPAVDGDIAHHESEVGDKPAKTSKEEENP 144

Query: 204 EEE----ENEEDDDEERKAEK------KNSKNEHVQELTAKQKNPK---AFTFQSVIKGE 250
           E E    E  +DD  E   ++      ++SK E   E+ A+   P    +    S   GE
Sbjct: 145 EIEAEDGEPAKDDGIEESHQEEDEIVSESSKEEFTAEVKAEDLQPAVDGSIEHSSSEVGE 204

Query: 251 RKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLN------SKEEE--EEEQEDSAFSDE 302
              ++ K    P+ +  D   A+   +    + +       SKEEE  E + ED   + +
Sbjct: 205 EVSKTEKEESNPEVKAEDLQPAVDDDVAHHESEVGDKPAETSKEEETPEVKAEDLQPAVD 264

Query: 303 SADDNSSDEDETVEPKTHKDSKNPE 327
            + ++SS E E  + +T K+   PE
Sbjct: 265 GSVEHSSSEIEEHQGETEKEEGIPE 289



 Score = 33.8 bits (76), Expect = 0.62
 Identities = 38/200 (19%), Positives = 73/200 (36%), Gaps = 16/200 (8%)

Query: 133 DEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDD 192
           D     DDD    +  +     +    ++  + +       ++ +++    + ++ + + 
Sbjct: 222 DLQPAVDDDVAHHESEVGDKPAETSKEEETPEVKAEDLQPAVDGSVEHSSSEIEEHQGET 281

Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK--AFTFQSVIKGE 250
           + +E   E   E+ +   DD  E  + +     E V E   ++ NP+  A   Q    GE
Sbjct: 282 EKEEGIPESHAEDLQPAVDDIVEHPSSEPFVAEEEVSETEKEENNPEVLAEDLQDAADGE 341

Query: 251 RKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEE---EEEEQEDSAFSDESADDN 307
                    VV +          R        G   KEE   E   ++D   SD S +  
Sbjct: 342 SGVSDQPAQVVEE----------RESEIEEHQGETEKEEGIPESHAEDDEIASDPSIEHF 391

Query: 308 SSDEDETVEPKTHKDSKNPE 327
           S++  + V  +T K+  NPE
Sbjct: 392 SAEVGKEVS-ETEKEESNPE 410


>gnl|CDD|131353 TIGR02300, FYDLN_acid, TIGR02300 family protein.  Members of this
           family are bacterial proteins with a conserved motif
           [KR]FYDLN, sometimes flanked by a pair of CXXC motifs,
           followed by a long region of low complexity sequence in
           which roughly half the residues are Asp and Glu,
           including multiple runs of five or more acidic residues.
           The function of members of this family is unknown.
          Length = 129

 Score = 33.4 bits (76), Expect = 0.24
 Identities = 25/80 (31%), Positives = 33/80 (41%), Gaps = 4/80 (5%)

Query: 129 DDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDD--- 185
           DDE +     + D E  + +     E+     D         DD  ED    DDDDD   
Sbjct: 50  DDEKEAVAVKEVDPEIDEAAELVSLEEADEEVDPGAPTPAGGDDLAEDEDIADDDDDVTF 109

Query: 186 -DDDETDDDNDEDNEEESEE 204
            +D+E DD +DED    S E
Sbjct: 110 LEDEEDDDIDDEDIIGVSGE 129


>gnl|CDD|233037 TIGR00592, pol2, DNA polymerase (pol2).  All proteins in this
           superfamily for which functions are known are DNA
           polymerases.This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 1172

 Score = 35.4 bits (81), Expect = 0.24
 Identities = 41/275 (14%), Positives = 72/275 (26%), Gaps = 30/275 (10%)

Query: 126 DDSDDENDEDDTSDDDTESGQ-GSMKKYKEQVVSSQDGRKRRKVMFDDDLED-NMDDDDD 183
           D  +   DEDD  D D + G     KK  ++           K +          D    
Sbjct: 41  DGREFFPDEDDILDLDKDDGSAAEAKKKDKENHKKVTKPNNIKAVRIACAPKKKKDRKKS 100

Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTF 243
              D    D   E N+ E+ +            +       +    + ++    P    F
Sbjct: 101 LGKDGLLGDILQELNKTETAQ------------RKITPRLVSVPKLKFSSPADVPAINDF 148

Query: 244 QSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEE---------- 293
            +              V  +  +    + +  K      G    E + E           
Sbjct: 149 SNHHPAVVDIVKKAIPVSTRYLLEKILIPVPLKRAEFAGGDVQMEGDPELKLASFDIETY 208

Query: 294 ---QEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAAS---AFL 347
               +D    DE+  D       T       D ++  E   + WK        S   +  
Sbjct: 209 FHDGKDFFPGDENPADEEIMISTTPVIAKQWDYESEPEARVVTWKKPDKPTTGSYVESVS 268

Query: 348 ERQANIVNLAKYVYGDMEDVSVTMEGDEAPNKKVH 382
           E  + I      +  +  DV +T+ GD      + 
Sbjct: 269 EEISMIKRFWDVIDQEDTDVEITVNGDNFDLVYLA 303


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 34.8 bits (81), Expect = 0.24
 Identities = 11/31 (35%), Positives = 23/31 (74%)

Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNE 199
           M+DD   + +++ +++DDDD  DD +++D+E
Sbjct: 352 MWDDYHREQLEEVEEEDDDDWDDDWDEDDDE 382



 Score = 34.1 bits (79), Expect = 0.46
 Identities = 11/24 (45%), Positives = 19/24 (79%)

Query: 176 DNMDDDDDDDDDDETDDDNDEDNE 199
           + ++++DDDD DD+ D+D+DE  E
Sbjct: 362 EEVEEEDDDDWDDDWDEDDDEGVE 385



 Score = 33.7 bits (78), Expect = 0.51
 Identities = 13/45 (28%), Positives = 23/45 (51%)

Query: 159 SQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESE 203
            +     +     DD      ++ +++DDD+ DDD DED++E  E
Sbjct: 341 EEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEGVE 385



 Score = 30.2 bits (69), Expect = 7.4
 Identities = 10/20 (50%), Positives = 13/20 (65%), Gaps = 2/20 (10%)

Query: 171 DDDLEDNMDDDDDDDDDDET 190
           DDD +D  D D+DDD+  E 
Sbjct: 369 DDDWDD--DWDEDDDEGVEV 386


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 33.9 bits (78), Expect = 0.25
 Identities = 9/53 (16%), Positives = 22/53 (41%)

Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEH 227
           + +    D D D  +   D+  + ++E E E++ E+      +     S+ + 
Sbjct: 89  KKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKP 141



 Score = 30.4 bits (69), Expect = 3.6
 Identities = 10/58 (17%), Positives = 30/58 (51%), Gaps = 2/58 (3%)

Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT 242
               + D D D+ ++++ ++ E+ +E + E++  +   S +E +  L+  +  P+ + 
Sbjct: 90  KSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLS--ELKPRKYA 145


>gnl|CDD|217424 pfam03205, MobB, Molybdopterin guanine dinucleotide synthesis
           protein B.  This protein contains a P-loop.
          Length = 126

 Score = 33.1 bits (76), Expect = 0.25
 Identities = 12/21 (57%), Positives = 15/21 (71%)

Query: 438 VVAVVGPPQVGKSTLIRCLIK 458
           +V VVGP   GK+TLIR L+ 
Sbjct: 2   IVLVVGPKDSGKTTLIRKLLN 22


>gnl|CDD|224085 COG1163, DRG, Predicted GTPase [General function prediction only].
          Length = 365

 Score = 34.6 bits (80), Expect = 0.27
 Identities = 12/18 (66%), Positives = 14/18 (77%)

Query: 439 VAVVGPPQVGKSTLIRCL 456
           VA+VG P VGKSTL+  L
Sbjct: 66  VALVGFPSVGKSTLLNKL 83


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 33.9 bits (77), Expect = 0.28
 Identities = 23/86 (26%), Positives = 44/86 (51%), Gaps = 3/86 (3%)

Query: 149 MKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEEN 208
           +KK K  ++  +   K R+    ++LE+ M+ +  ++  DE +D  +++ E E +  EE 
Sbjct: 115 IKKKKSLIIRQEQIEKARQER--EELEERMEWERREEKIDEREDQEEQEREREEQTIEE- 171

Query: 209 EEDDDEERKAEKKNSKNEHVQELTAK 234
           + DD E    E+  S+ E   + T K
Sbjct: 172 QSDDSEHEIIEQDESETESDDDKTEK 197


>gnl|CDD|234401 TIGR03925, T7SS_EccC_b, type VII secretion protein EccCb.  This
           model represents the C-terminal domain or EccCb subunit
           of the type VII secretion protein EccC as found in the
           Actinobacteria. Type VII secretion is defined more
           broadly as including secretion systems for ESAT-6-like
           proteins in the Firmicutes as well as in the
           Actinobacteria, but this family does not show close
           homologs in the Firmicutes [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 566

 Score = 34.9 bits (81), Expect = 0.28
 Identities = 16/28 (57%), Positives = 19/28 (67%), Gaps = 2/28 (7%)

Query: 439 VAVVGPPQVGKSTLIRCLIKNF--TKTP 464
           VA+VG PQ GKST +R LI +   T TP
Sbjct: 82  VAIVGGPQSGKSTALRTLILSLALTHTP 109


>gnl|CDD|183063 PRK11264, PRK11264, putative amino-acid ABC transporter ATP-binding
           protein YecC; Provisional.
          Length = 250

 Score = 34.3 bits (79), Expect = 0.28
 Identities = 13/29 (44%), Positives = 20/29 (68%), Gaps = 2/29 (6%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLS 466
           VVA++GP   GK+TL+RC+  N  + P +
Sbjct: 31  VVAIIGPSGSGKTTLLRCI--NLLEQPEA 57


>gnl|CDD|206726 cd04163, Era, E. coli Ras-like protein (Era) is a multifunctional
           GTPase.  Era (E. coli Ras-like protein) is a
           multifunctional GTPase found in all bacteria except some
           eubacteria. It binds to the 16S ribosomal RNA (rRNA) of
           the 30S subunit and appears to play a role in the
           assembly of the 30S subunit, possibly by chaperoning the
           16S rRNA. It also contacts several assembly elements of
           the 30S subunit. Era couples cell growth with
           cytokinesis and plays a role in cell division and energy
           metabolism. Homologs have also been found in eukaryotes.
           Era contains two domains: the N-terminal GTPase domain
           and a C-terminal domain KH domain that is critical for
           RNA binding. Both domains are important for Era
           function. Era is functionally able to compensate for
           deletion of RbfA, a cold-shock adaptation protein that
           is required for efficient processing of the 16S rRNA.
          Length = 168

 Score = 33.6 bits (78), Expect = 0.30
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query: 439 VAVVGPPQVGKSTLI 453
           VA++G P VGKSTL+
Sbjct: 6   VAIIGRPNVGKSTLL 20


>gnl|CDD|224742 COG1829, COG1829, Predicted archaeal kinase (sugar kinase
           superfamily) [General function prediction only].
          Length = 283

 Score = 34.3 bits (79), Expect = 0.31
 Identities = 18/78 (23%), Positives = 28/78 (35%), Gaps = 14/78 (17%)

Query: 700 YTPQHVACMAHFWGPITRS--------GTGFL---AVQDVAKREPGFRVIATGTILDANQ 748
           + P H+     F+ P+           G G      V    +   G  V   G  +D   
Sbjct: 6   FVPGHITG---FFVPVIGKDPLKSGSIGAGVALERGVTVEVRFGEGTGVRLNGKKIDLPI 62

Query: 749 TAEVTKKLKLTGVPMKIY 766
           T +V +KL   GV ++I 
Sbjct: 63  TRKVIEKLGPDGVGVRIE 80


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 34.1 bits (78), Expect = 0.31
 Identities = 24/97 (24%), Positives = 38/97 (39%), Gaps = 7/97 (7%)

Query: 512 GDFEDLETGEKHSGDKSGGGSGGVSGGGSGDDKPKTRAELMEKK--RKLKEQFDAEYDDK 569
           GD ++ E  EK     S   S       SG  K K R E  +K    + K+Q +   +D+
Sbjct: 135 GDVKNAEKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDE 194

Query: 570 DG-----GGNTYYDDLKTQATRQAELNRQQFHDLDDN 601
           D      GG++      +      +  RQ+ HD +  
Sbjct: 195 DKRPGGGGGSSGGQSGLSTKDEPPKEKRQKHHDPERR 231


>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin). 
          Length = 585

 Score = 34.6 bits (79), Expect = 0.33
 Identities = 21/91 (23%), Positives = 36/91 (39%), Gaps = 7/91 (7%)

Query: 142 TESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE------TDDDND 195
               Q ++K    + V   DG+ ++K    D+ E +  DD+DD                D
Sbjct: 166 NPQSQATLKS-VFEEVGKLDGQSKQKRERSDEREKSYQDDEDDTYRQNNIPYEDVVGGED 224

Query: 196 EDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
            +  EE EE++  EE    + +  K  S  +
Sbjct: 225 WNPIEEGEEDQTQEEVKRSKERTHKGRSLPD 255


>gnl|CDD|226907 COG4530, COG4530, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 129

 Score = 32.9 bits (75), Expect = 0.33
 Identities = 20/72 (27%), Positives = 36/72 (50%), Gaps = 2/72 (2%)

Query: 146 QGSMKKYKEQVVSSQDGRKRRK--VMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESE 203
             + ++ +E+VV   D        V  +D  +D    DD  D  D+ D D D+D+++   
Sbjct: 47  DAAAEEEEEEVVKEVDAENEEVEVVSLEDADDDPKGGDDLPDLGDDEDVDLDDDDDDTFL 106

Query: 204 EEEENEEDDDEE 215
           E+EE+++DD   
Sbjct: 107 EDEEDDDDDVSG 118



 Score = 31.8 bits (72), Expect = 0.81
 Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 2/40 (5%)

Query: 170 FDDDLEDNMDDDDDDD--DDDETDDDNDEDNEEESEEEEE 207
             DD + ++DDDDDD   +D+E DDD+         ++EE
Sbjct: 89  LGDDEDVDLDDDDDDTFLEDEEDDDDDVSGIIGVPGDDEE 128


>gnl|CDD|234624 PRK00089, era, GTPase Era; Reviewed.
          Length = 292

 Score = 34.3 bits (80), Expect = 0.37
 Identities = 12/19 (63%), Positives = 15/19 (78%)

Query: 439 VAVVGPPQVGKSTLIRCLI 457
           VA+VG P VGKSTL+  L+
Sbjct: 8   VAIVGRPNVGKSTLLNALV 26


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
           complex subunit was formerly known as Srb4 in yeasts or
           Trap80 in Drosophila and human. The Med17 subunit is
           located within the head domain and is essential for cell
           viability to the extent that a mutant strain of
           cerevisiae lacking it shows all RNA polymerase
           II-dependent transcription ceasing at non-permissive
           temperatures.
          Length = 454

 Score = 34.3 bits (79), Expect = 0.39
 Identities = 12/54 (22%), Positives = 29/54 (53%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
           + L + +  +    D  E  D+ ++D E+++++ EEN++  +E  KA  +  + 
Sbjct: 52  ESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDEVLEQ 105


>gnl|CDD|224617 COG1703, ArgK, Putative periplasmic protein kinase ArgK and related
           GTPases of G3E family [Amino acid transport and
           metabolism].
          Length = 323

 Score = 34.2 bits (79), Expect = 0.39
 Identities = 12/37 (32%), Positives = 16/37 (43%)

Query: 431 PLEPPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSV 467
           P      V+ + G P  GKSTLI  L +   +    V
Sbjct: 46  PRTGNAHVIGITGVPGAGKSTLIEALGRELRERGHRV 82


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 34.1 bits (78), Expect = 0.42
 Identities = 22/122 (18%), Positives = 51/122 (41%), Gaps = 16/122 (13%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
             D  +DEN  D+ + D T          ++Q    +   +RR     + +  N+++   
Sbjct: 186 GPDSDEDENPLDEEAPDMTPE-----TSKQDQKEERRAAVERRLAELVEMINWNLEERRR 240

Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEERK-----------AEKKNSKNEHVQELT 232
           D   ++  ++N E + ++ +E  E+ ED + + +           +E +  K+E +  L 
Sbjct: 241 DLRKEQELEENVERDSDDEDEHGEDSEDGETKPESYITSEYIERISEIRKMKDERLSSLA 300

Query: 233 AK 234
           + 
Sbjct: 301 SM 302


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 33.9 bits (77), Expect = 0.47
 Identities = 14/24 (58%), Positives = 18/24 (75%)

Query: 192 DDNDEDNEEESEEEEENEEDDDEE 215
           D++DED EEE EEEEE  E ++ E
Sbjct: 133 DESDEDEEEEEEEEEEEAEVEENE 156



 Score = 32.4 bits (73), Expect = 1.2
 Identities = 33/142 (23%), Positives = 53/142 (37%), Gaps = 12/142 (8%)

Query: 192 DDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIK--G 249
           D ++E+ + +S EEE  EE  +EE         NE       ++   +  T  +V    G
Sbjct: 49  DSSEENGDGDSSEEEGEEETSNEE-------ENNEDSDGNEDEEAEAENTTLSTVTLGYG 101

Query: 250 ERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDE---SADD 306
                 T    +   ++  K      K T        +EEEEEE+E+ A  +E     + 
Sbjct: 102 GDATPGTGNIGLAALQLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNG 161

Query: 307 NSSDEDETVEPKTHKDSKNPEE 328
            S++  E           N EE
Sbjct: 162 TSTNSTEVDHGNGSSGGDNGEE 183



 Score = 31.2 bits (70), Expect = 2.9
 Identities = 14/35 (40%), Positives = 20/35 (57%)

Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
           E + D DE+ EEE EEEE   E++++       NS
Sbjct: 132 EDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNS 166



 Score = 30.4 bits (68), Expect = 5.2
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 188 DETDDDNDEDNEEESEEEEENEED 211
           DE+D+D +E+ EEE EE E  E +
Sbjct: 133 DESDEDEEEEEEEEEEEAEVEENE 156


>gnl|CDD|206752 cd01859, MJ1464, An uncharacterized, circularly permuted subfamily
           of the Ras GTPases.  This family represents archaeal
           GTPase typified by the protein MJ1464 from Methanococcus
           jannaschii. The members of this family show a circular
           permutation of the GTPase signature motifs so that
           C-terminal strands 5, 6, and 7 (strands 6 contain the
           NKxD motif) are relocated to the N terminus.
          Length = 157

 Score = 32.7 bits (75), Expect = 0.47
 Identities = 12/21 (57%), Positives = 17/21 (80%)

Query: 436 PIVVAVVGPPQVGKSTLIRCL 456
           P++V VVG P+VGKS++I  L
Sbjct: 99  PVIVGVVGYPKVGKSSIINAL 119


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 34.0 bits (78), Expect = 0.52
 Identities = 21/78 (26%), Positives = 36/78 (46%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
           +++ ED  DDDD  DD+DE +D      E  +  E   +   +++ + E  +   + V E
Sbjct: 140 EEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKKDVLE 199

Query: 231 LTAKQKNPKAFTFQSVIK 248
                K P+  T+  V K
Sbjct: 200 EPKMFKKPEEITWDDVFK 217



 Score = 33.2 bits (76), Expect = 0.96
 Identities = 15/55 (27%), Positives = 27/55 (49%)

Query: 287 KEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQK 341
           +EEE+E+ +D    DE  D++     E V   +       +ED   +  +DL++K
Sbjct: 141 EEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKK 195



 Score = 30.2 bits (68), Expect = 7.3
 Identities = 28/139 (20%), Positives = 53/139 (38%), Gaps = 29/139 (20%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
           S D+ ++E+++DD    D E          E+V +S +    R                +
Sbjct: 137 SEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDR--------------VKE 182

Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDE----------ERKAEKKNSKNEHVQELT- 232
           DD++DE  D + +D  EE +  ++ EE   +            + +K   ++E +  L  
Sbjct: 183 DDEEDEDADLSKKDVLEEPKMFKKPEEITWDDVFKKLKEIMSARGKKTTDRSEQIDLLEE 242

Query: 233 ----AKQKNPKAFTFQSVI 247
               A+    K     S+I
Sbjct: 243 LLTIAETPYQKVKILFSII 261


>gnl|CDD|227508 COG5181, HSH155, U2 snRNP spliceosome subunit [RNA processing and
           modification].
          Length = 975

 Score = 34.2 bits (78), Expect = 0.54
 Identities = 25/163 (15%), Positives = 47/163 (28%), Gaps = 8/163 (4%)

Query: 168 VMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEH 227
            M D   +            ++ +D     ++ E+     NE  D ++ +      +   
Sbjct: 5   EMEDKAEDSRRSTKQASVWYEDRNDSRGNVSQWEATPRTANESKDLKQEQTPLGYIQVR- 63

Query: 228 VQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQP----RMSDKDLAIRHKITSAL-A 282
            ++L +K +    F   S  K E +  +T     P      R  +       +I   L A
Sbjct: 64  -EDLYSKTRMDPGFQPDSDKKRELELNNTWDMEYPDSKRSSRWDEMGYEPPQEINMCLPA 122

Query: 283 GLNSKEEEEEEQEDSA-FSDESADDNSSDEDETVEPKTHKDSK 324
                  +     D   F  E     + DE +   P       
Sbjct: 123 RGYKALTDFHGYADLGFFKVEDLKYFADDEKDFFMPLLEDREG 165


>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
          Length = 613

 Score = 34.1 bits (79), Expect = 0.57
 Identities = 29/170 (17%), Positives = 51/170 (30%), Gaps = 50/170 (29%)

Query: 170 FDDDLEDNMDDDDDDDDDDETDD------DNDEDNEEESEEE----EENEEDDDEERKAE 219
             +   +  D +D+ +D  +T        D+ E  +  +       +E EED+       
Sbjct: 18  LVEAEAECSDGEDELEDLFDTGSDISDLIDDAEVVQGGNSLALFHQQEAEEDE------- 70

Query: 220 KKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQ---PRM-----SDKDL 271
                 + +Q L                  +RK+ S+          PR+     S +  
Sbjct: 71  ------QQIQAL------------------KRKYLSSPEKSPVADLSPRLEAISLSPRKK 106

Query: 272 AIRHKITSALAGL-NSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTH 320
           A R       +GL NS EEE  +  +    + S       E         
Sbjct: 107 AKRRLFEEQDSGLGNSLEEESTDVVEEEGVESSGGGEGGSETGQGGGNGL 156


>gnl|CDD|238540 cd01120, RecA-like_NTPases, RecA-like NTPases. This family includes
           the NTP binding domain of F1 and V1 H+ATPases, DnaB and
           related helicases as well as bacterial RecA and related
           eukaryotic and archaeal recombinases. This group also
           includes bacterial conjugation proteins and related DNA
           transfer proteins involved in type II and type IV
           secretion.
          Length = 165

 Score = 32.5 bits (74), Expect = 0.58
 Identities = 10/31 (32%), Positives = 14/31 (45%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVI 468
           ++ V GP   GK+TL   L  N       V+
Sbjct: 1   LILVFGPTGSGKTTLALQLALNIATKGGKVV 31


>gnl|CDD|236546 PRK09518, PRK09518, bifunctional cytidylate kinase/GTPase Der;
           Reviewed.
          Length = 712

 Score = 34.0 bits (78), Expect = 0.62
 Identities = 12/20 (60%), Positives = 16/20 (80%)

Query: 438 VVAVVGPPQVGKSTLIRCLI 457
           VVA+VG P VGKSTL+  ++
Sbjct: 277 VVAIVGRPNVGKSTLVNRIL 296


>gnl|CDD|165163 PHA02811, PHA02811, putative host range protein; Provisional.
          Length = 197

 Score = 32.7 bits (74), Expect = 0.62
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEE 207
           D  +  +   DDDDD+D  D+D ++D   + E++ E
Sbjct: 162 DACDYCIISSDDDDDNDNADNDEEDDEVNDIEDDYE 197



 Score = 31.2 bits (70), Expect = 2.0
 Identities = 11/23 (47%), Positives = 18/23 (78%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDD 193
           DDD  DN D+D++DD+ ++ +DD
Sbjct: 173 DDDDNDNADNDEEDDEVNDIEDD 195



 Score = 31.2 bits (70), Expect = 2.4
 Identities = 12/33 (36%), Positives = 21/33 (63%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESE 203
           D  +  + DDDD+D+ D++ +DD   D E++ E
Sbjct: 165 DYCIISSDDDDDNDNADNDEEDDEVNDIEDDYE 197



 Score = 30.8 bits (69), Expect = 3.4
 Identities = 18/50 (36%), Positives = 28/50 (56%), Gaps = 8/50 (16%)

Query: 174 LEDNMDDDDD-------DDDDDETDDDNDEDNEEESEEEEE-NEEDDDEE 215
           L+D ++  DD       D     +DDD+D DN +  EE++E N+ +DD E
Sbjct: 148 LKDYINISDDYYLYDACDYCIISSDDDDDNDNADNDEEDDEVNDIEDDYE 197


>gnl|CDD|224083 COG1161, COG1161, Predicted GTPases [General function prediction
           only].
          Length = 322

 Score = 33.5 bits (77), Expect = 0.66
 Identities = 21/79 (26%), Positives = 30/79 (37%), Gaps = 11/79 (13%)

Query: 382 HRKRQAELTAKQKNPKA--FTFQSVIKGERKFRRKEDIQAKKHHVPQVDRTPLEPPPIVV 439
           + K++  +     + K+     +     E+    K     KK          L    I V
Sbjct: 85  YFKKEEGIKPIFVSAKSRQGGKKIRKALEKLSEEKIKRLKKKG---------LLKRKIRV 135

Query: 440 AVVGPPQVGKSTLIRCLIK 458
            VVG P VGKSTLI  L+ 
Sbjct: 136 GVVGYPNVGKSTLINRLLG 154


>gnl|CDD|217025 pfam02421, FeoB_N, Ferrous iron transport protein B.  Escherichia
           coli has an iron(II) transport system (feo) which may
           make an important contribution to the iron supply of the
           cell under anaerobic conditions. FeoB has been
           identified as part of this transport system. FeoB is a
           large 700-800 amino acid integral membrane protein. The
           N terminus contains a P-loop motif suggesting that iron
           transport may be ATP dependent.
          Length = 190

 Score = 32.8 bits (76), Expect = 0.67
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 437 IVVAVVGPPQVGKSTLIRCL 456
           I +A+VG P VGK+TL   L
Sbjct: 1   ITIALVGNPNVGKTTLFNAL 20


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
           found at the C-terminus of a family of FtsJ-like
           methyltransferases. Members of this family are involved
           in 60S ribosomal biogenesis.
          Length = 212

 Score = 32.7 bits (75), Expect = 0.70
 Identities = 11/41 (26%), Positives = 27/41 (65%)

Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
           +   D+DDE +    E++E++ +++E++++DD ++  AE  
Sbjct: 1   ESKSDEDDEFEVVPAEEDEKKVDKDEDDDKDDIDDLTAEAL 41


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 33.4 bits (76), Expect = 0.73
 Identities = 11/47 (23%), Positives = 21/47 (44%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           D   ++  DE  +D  E +E   EE  E E  + +     ++ ++ E
Sbjct: 341 DVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAERLAETE 387



 Score = 32.6 bits (74), Expect = 1.2
 Identities = 9/47 (19%), Positives = 21/47 (44%)

Query: 173 DLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
           D+      D++ +D+ E D+   E+  E    E + ++  +   + E
Sbjct: 341 DVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAERLAETE 387



 Score = 31.4 bits (71), Expect = 2.7
 Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 3/49 (6%)

Query: 171 DDDLEDNMDDDDDD---DDDDETDDDNDEDNEEESEEEEENEEDDDEER 216
             D  D  +D  +     +  E D   +E  +EESE+E E +E   EE 
Sbjct: 318 LADFGDYYEDIFEVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEV 366


>gnl|CDD|213196 cd03229, ABC_Class3, ATP-binding cassette domain of the binding
           protein-dependent transport systems.  This class is
           comprised of all BPD (Binding Protein Dependent) systems
           that are largely represented in archaea and eubacteria
           and are primarily involved in scavenging solutes from
           the environment. ABC transporters are a large family of
           proteins involved in the transport of a wide variety of
           different compounds, like sugars, ions, peptides, and
           more complex organic molecules. The nucleotide binding
           domain shows the highest similarity between all members
           of the family. ABC transporters are a subset of
           nucleotide hydrolases that contain a signature motif,
           Q-loop, and H-loop/switch region, in addition to, the
           Walker A motif/P-loop and Walker B motif commonly found
           in a number of ATP- and GTP-binding and hydrolyzing
           proteins.
          Length = 178

 Score = 32.5 bits (75), Expect = 0.79
 Identities = 11/19 (57%), Positives = 16/19 (84%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
           +VA++GP   GKSTL+RC+
Sbjct: 28  IVALLGPSGSGKSTLLRCI 46


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 33.0 bits (75), Expect = 0.80
 Identities = 12/38 (31%), Positives = 22/38 (57%)

Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
            +++ + DE+ EEE EEEEE +    E+R++      +
Sbjct: 224 SDSESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKAS 261


>gnl|CDD|165245 PHA02934, PHA02934, Hypothetical protein; Provisional.
          Length = 253

 Score = 33.1 bits (75), Expect = 0.80
 Identities = 18/43 (41%), Positives = 25/43 (58%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
           D D  D++DD DD DD D+ D+  D DN    + E+E   +DD
Sbjct: 20  DIDDIDDIDDIDDIDDIDDIDNVQDIDNGIVQDIEDEASNNDD 62



 Score = 30.0 bits (67), Expect = 7.0
 Identities = 16/45 (35%), Positives = 27/45 (60%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEER 216
           DD++D  D DD DD DD  D DN +D +    ++ E+E  ++++ 
Sbjct: 19  DDIDDIDDIDDIDDIDDIDDIDNVQDIDNGIVQDIEDEASNNDDH 63


>gnl|CDD|206750 cd01857, HSR1_MMR1, A circularly permuted subfamily of the Ras
           GTPases.  Human HSR1 is localized to the human MHC class
           I region and is highly homologous to a putative
           GTP-binding protein, MMR1 from mouse. These proteins
           represent a new subfamily of GTP-binding proteins that
           has only eukaryote members. This subfamily shows a
           circular permutation of the GTPase signature motifs so
           that the C-terminal strands 5, 6, and 7 (strand 6
           contains the G4 box with sequence NKXD) are relocated to
           the N-terminus.
          Length = 140

 Score = 31.8 bits (73), Expect = 0.84
 Identities = 17/59 (28%), Positives = 24/59 (40%), Gaps = 11/59 (18%)

Query: 433 EPPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGP------VTLIIKDSIR--DC 483
                 + +VG P VGKS+LI  L        +SV   P       T+ ++  I   DC
Sbjct: 79  ALNEATIGLVGYPNVGKSSLINAL---VGSKKVSVSSTPGKTKHFQTIFLEPGITLCDC 134


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 33.4 bits (76), Expect = 0.88
 Identities = 31/245 (12%), Positives = 84/245 (34%), Gaps = 17/245 (6%)

Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERK 252
           +N  +   + EE +  E    E+ K   +  + +  ++L  +++N     +  + +    
Sbjct: 184 ENLAELIIDLEELKLQELKLKEQAKKALEYYQLK--EKLELEEENLLYLDYLKLNEERID 241

Query: 253 FRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQ---EDSAFSDESADDNSS 309
                     +   S K    + +   A     +KEEE+E++   E+     +  ++  S
Sbjct: 242 LLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKS 301

Query: 310 DEDETVEPKTHKDSK--NPEEDTGLNWKSDLAQKAASAFLERQANIVNLAKYVYGDMEDV 367
           +  +    K   + K    E++     K    +K     LE++   + + +    + E+ 
Sbjct: 302 ELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQ 361

Query: 368 SVTMEGDEAPNKKVH----------RKRQAELTAKQKNPKAFTFQSVIKGERKFRRKEDI 417
              ++      ++                A+L  ++   K    +          ++ED+
Sbjct: 362 LEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDL 421

Query: 418 QAKKH 422
             ++ 
Sbjct: 422 LKEEK 426



 Score = 30.7 bits (69), Expect = 6.5
 Identities = 24/127 (18%), Positives = 48/127 (37%), Gaps = 1/127 (0%)

Query: 127  DSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDD 186
            D+  E  E++  +   E     +K   E      +    ++   D+  E+   +      
Sbjct: 910  DNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLA 969

Query: 187  DDETD-DDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQS 245
             +E    +     E E +EE  N+++  +ER  E+K      + E T ++       F S
Sbjct: 970  KEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVS 1029

Query: 246  VIKGERK 252
            + +G  K
Sbjct: 1030 INRGLNK 1036


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 33.2 bits (76), Expect = 0.89
 Identities = 12/55 (21%), Positives = 23/55 (41%)

Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT 242
                   +D +E+   E + + +  +++K EKK  K+E   +L  K       T
Sbjct: 48  KVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKT 102


>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
           selection and in elongation by RNA polymerase II
           [Transcription].
          Length = 521

 Score = 33.1 bits (75), Expect = 0.91
 Identities = 30/179 (16%), Positives = 58/179 (32%), Gaps = 23/179 (12%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
            DD+  +  D+D++++ D  ++   E NE+  EE+ +N    + + K E   +    +  
Sbjct: 30  LDDVLSSSSDEDNNENVDYAEESGGEGNEKSEEEKFKNPYRLEGKFKDE---ADRAKIMA 86

Query: 231 LTAKQKNPKAFTFQSVIKG--ERKF------------------RSTKPTVVPQPRMSDKD 270
           +T  ++    F  +  I    ER+                   RST+   +       K 
Sbjct: 87  MTEIERESILFEREEEISKLMERRELAIRMEQQHRSSGCTDTRRSTRYEPLTSAAEEKKK 146

Query: 271 LAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEED 329
             +  K T         E   E Q    + +    +    E+ T              D
Sbjct: 147 KLLELKKTREREERLYSERHIELQRFKDYKELEESEQGLQEEYTPSYAEEAVEDISRTD 205


>gnl|CDD|213181 cd03214, ABC_Iron-Siderophores_B12_Hemin, ATP-binding component of
           iron-siderophores, vitamin B12 and hemin transporters
           and related proteins.  ABC transporters, involved in the
           uptake of siderophores, heme, and vitamin B12, are
           widely conserved in bacteria and archaea. Only very few
           species lack representatives of the siderophore family
           transporters. The E. coli BtuCD protein is an ABC
           transporter mediating vitamin B12 uptake. The two
           ATP-binding cassettes (BtuD) are in close contact with
           each other, as are the two membrane-spanning subunits
           (BtuC); this arrangement is distinct from that observed
           for the E. coli lipid flippase MsbA. The BtuC subunits
           provide 20 transmembrane helices grouped around a
           translocation pathway that is closed to the cytoplasm by
           a gate region, whereas the dimer arrangement of the BtuD
           subunits resembles the ATP-bound form of the Rad50 DNA
           repair enzyme. A prominent cytoplasmic loop of BtuC
           forms the contact region with the ATP-binding cassette
           and represent a conserved motif among the ABC
           transporters.
          Length = 180

 Score = 32.0 bits (74), Expect = 0.96
 Identities = 12/37 (32%), Positives = 18/37 (48%), Gaps = 5/37 (13%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
           +V ++GP   GKSTL++ L        L    G + L
Sbjct: 27  IVGILGPNGAGKSTLLKTLAGL-----LKPSSGEILL 58


>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist.  LIN-8
           is a nuclear protein, present at the sites of
           transcriptional repressor complexes, which interacts
           with LIN-35 Rb.Lin35 Rb is a product of the class B
           synMuv gene lin-35 which silences genes required for
           vulval specification through chromatin modification and
           remodelling. The biological role of the interaction has
           not yet been determined however predictions have been
           made. The interaction shows that class A synMuv genes
           control vulval induction through the transcriptional
           regulation of gene expression. LIN-8 normally functions
           as part of a protein complex however when the complex is
           absent, other family members can partially replace LIN-8
           activity.
          Length = 316

 Score = 32.7 bits (75), Expect = 0.99
 Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 5/46 (10%)

Query: 173 DLEDNMDDDDDD-----DDDDETDDDNDEDNEEESEEEEENEEDDD 213
             ED   DD+DD       +D T +  D +  +E +E E  E  DD
Sbjct: 132 TGEDQEADDEDDIIYDGIFEDRTRESQDMEENDEVDEVEVEEVPDD 177



 Score = 30.8 bits (70), Expect = 4.3
 Identities = 16/36 (44%), Positives = 22/36 (61%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
           D + DD+DD   D   ED   ES++ EEN+E D+ E
Sbjct: 135 DQEADDEDDIIYDGIFEDRTRESQDMEENDEVDEVE 170


>gnl|CDD|184539 PRK14150, PRK14150, heat shock protein GrpE; Provisional.
          Length = 193

 Score = 32.2 bits (74), Expect = 1.00
 Identities = 13/46 (28%), Positives = 21/46 (45%)

Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA 233
           +E      E   EE E E E +E+ DE    ++ +  +  + EL A
Sbjct: 3   NEEQKTPQEQVSEEIEAEVEQQEEADEAELEDELDEADARIAELEA 48


>gnl|CDD|237511 PRK13796, PRK13796, GTPase YqeH; Provisional.
          Length = 365

 Score = 32.9 bits (76), Expect = 1.0
 Identities = 14/23 (60%), Positives = 15/23 (65%)

Query: 439 VAVVGPPQVGKSTLIRCLIKNFT 461
           V VVG   VGKSTLI  +IK  T
Sbjct: 163 VYVVGVTNVGKSTLINRIIKEIT 185


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 31.5 bits (72), Expect = 1.0
 Identities = 17/70 (24%), Positives = 33/70 (47%), Gaps = 10/70 (14%)

Query: 179 DDDDDDDDDDETDDDNDEDNEE----ESEEEEENEEDDD-----EERKAEKKNSKNEHVQ 229
           D  D DD++DE      + + E    + + ++E+EE D      +ER  E++  + E  +
Sbjct: 27  DGVDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRRE-QK 85

Query: 230 ELTAKQKNPK 239
           E    +K  +
Sbjct: 86  EKEEIEKYRE 95



 Score = 30.3 bits (69), Expect = 2.7
 Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 11/75 (14%)

Query: 174 LEDNMDDDDDDDDDDETDDDN---------DEDNEEESEEEEENEEDDDEERKAEKKNSK 224
              ++DD++D+D     D DN         D+++EE     E  +E  DE RK  ++  +
Sbjct: 27  DGVDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQKE 86

Query: 225 NEHVQELTAKQKNPK 239
            E +++     +NPK
Sbjct: 87  KEEIEKYRE--ENPK 99


>gnl|CDD|112137 pfam03308, ArgK, ArgK protein.  The ArgK protein acts as an ATPase
           enzyme and as a kinase, and phosphorylates periplasmic
           binding proteins involved in the LAO (lysine, arginine,
           ornithine)/AO transport systems.
          Length = 267

 Score = 32.7 bits (75), Expect = 1.1
 Identities = 11/30 (36%), Positives = 13/30 (43%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSV 467
            V + G P  GKSTLI  L     +    V
Sbjct: 31  RVGITGVPGAGKSTLIEALGMELRRRGHRV 60


>gnl|CDD|224081 COG1159, Era, GTPase [General function prediction only].
          Length = 298

 Score = 32.5 bits (75), Expect = 1.1
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 439 VAVVGPPQVGKSTLIRCLIK 458
           VA++G P VGKSTL+  L+ 
Sbjct: 9   VAIIGRPNVGKSTLLNALVG 28


>gnl|CDD|213202 cd03235, ABC_Metallic_Cations, ATP-binding cassette domain of the
           metal-type transporters.  This family includes
           transporters involved in the uptake of various metallic
           cations such as iron, manganese, and zinc. The ATPases
           of this group of transporters are very similar to
           members of iron-siderophore uptake family suggesting
           that they share a common ancestor. The best
           characterized metal-type ABC transporters are the
           YfeABCD system of Y. pestis, the SitABCD system of
           Salmonella enterica serovar Typhimurium, and the SitABCD
           transporter of Shigella flexneri. Moreover other
           uncharacterized homologs of these metal-type
           transporters are mainly found in pathogens like
           Haemophilus or enteroinvasive E. coli isolates.
          Length = 213

 Score = 32.1 bits (74), Expect = 1.1
 Identities = 10/24 (41%), Positives = 16/24 (66%)

Query: 434 PPPIVVAVVGPPQVGKSTLIRCLI 457
            P   +A+VGP   GKSTL++ ++
Sbjct: 23  KPGEFLAIVGPNGAGKSTLLKAIL 46


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 33.1 bits (75), Expect = 1.1
 Identities = 29/136 (21%), Positives = 53/136 (38%), Gaps = 24/136 (17%)

Query: 136 DTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDND 195
            ++D   ES +   KK KE+  S     KR++   +    D  + +       +T + + 
Sbjct: 53  SSNDSKAESTKKPNKKIKEEATSPLKSTKRQR---EKPASDTEEPERVTAKKSKTQELSR 109

Query: 196 EDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRS 255
            ++  E E E E E +  + R   ++ S             +PK          ++  RS
Sbjct: 110 PNSPSEGEGEGEGEGESSDSRSVNEEGS------------SDPKDI--------DQDNRS 149

Query: 256 TKPTV-VPQPRMSDKD 270
           + P++  PQ   SD D
Sbjct: 150 SSPSIPSPQDNESDSD 165


>gnl|CDD|177557 PHA03209, PHA03209, serine/threonine kinase US3; Provisional.
          Length = 357

 Score = 32.9 bits (75), Expect = 1.1
 Identities = 12/40 (30%), Positives = 21/40 (52%), Gaps = 3/40 (7%)

Query: 169 MFDDDLEDNMDDDD---DDDDDDETDDDNDEDNEEESEEE 205
             D+DL  ++ D D    DDD     DD+D+D    ++++
Sbjct: 19  AADEDLYSDISDGDLEYSDDDSASESDDDDDDGLIPTKQK 58



 Score = 31.8 bits (72), Expect = 2.2
 Identities = 12/34 (35%), Positives = 18/34 (52%), Gaps = 2/34 (5%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEE 204
           D DLE    DDD   + D+ DDD     ++++ E
Sbjct: 30  DGDLEY--SDDDSASESDDDDDDGLIPTKQKARE 61


>gnl|CDD|224045 COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport
           systems, ATPase components [Inorganic ion transport and
           metabolism / Coenzyme metabolism].
          Length = 258

 Score = 32.5 bits (75), Expect = 1.1
 Identities = 13/37 (35%), Positives = 18/37 (48%), Gaps = 5/37 (13%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
           +  ++GP   GKSTL++CL        L    G V L
Sbjct: 30  ITGILGPNGSGKSTLLKCLAGL-----LKPKSGEVLL 61


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is
           found in bacteria. Proteins in this family are typically
           between 234 to 465 amino acids in length. There is a
           conserved GEV sequence motif.Most members are annotated
           as being SprA-related.
          Length = 261

 Score = 32.4 bits (74), Expect = 1.2
 Identities = 33/212 (15%), Positives = 70/212 (33%), Gaps = 51/212 (24%)

Query: 165 RRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
           ++  +             +        +D +   EEE E+++        +++ E++  +
Sbjct: 50  QKAKLKGQASTAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQE 109

Query: 225 NEHVQELTAKQKNPKA-----------------FTFQ------------------SVIKG 249
            + V+EL A+ +  +A                 +T++                  S + G
Sbjct: 110 EKQVRELKARDREVRAHEQAHAAVGGQYAGAPSYTYERGPDGKRYAVGGEVSIDVSPVPG 169

Query: 250 E-----RKFR-----STKPTVVPQPRMSDKDLA--IRHKITSALAGLNSKEEEEEEQEDS 297
           +      K +     +  P    +P   D+ +A     K   A A L  + +EE  +ED 
Sbjct: 170 DPQATINKMQQVRAAALAPA---EPSSQDRRVAAEATQKENEARAELAQERQEEASEEDE 226

Query: 298 AFSDESADDNSSDEDETVEPKTHKDSKNPEED 329
           A    +  +N     E + P T +     E  
Sbjct: 227 A-EALAGAENQITALEPLAPSTPQVDGQVEAT 257


>gnl|CDD|239388 cd03114, ArgK-like, The function of this protein family is unkown.
           The protein sequences are similar to the ArgK protein in
           E. coli. ArgK protein is a membrane ATPase which is
           required for transporting arginine, ornithine and lysine
           into the cells by the arginine and ornithine (AO system)
           and lysine, arginine and ornithine (LAO) transport
           systems.
          Length = 148

 Score = 31.5 bits (72), Expect = 1.2
 Identities = 11/25 (44%), Positives = 13/25 (52%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTK 462
           V+ + G P  GKSTLI  LI     
Sbjct: 1   VIGITGVPGAGKSTLIDALITALRA 25


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 32.9 bits (75), Expect = 1.2
 Identities = 20/125 (16%), Positives = 51/125 (40%), Gaps = 3/125 (2%)

Query: 127 DSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDD 186
              +  D++D+S   +     +++K        Q   +       +++E   + + + + 
Sbjct: 27  IQQNNGDKEDSSTSTSSLSVSAVEKTSNAKEEIQVDFQHNSESAVEEVEAEDEIEVEQNQ 86

Query: 187 DDETDDDNDEDNEEESEEEEENEEDD-DEERKAEKKNSKNEHVQELTAKQ--KNPKAFTF 243
            D     +    E  S + +  ++D  D + K E++N +   ++EL  +   +  K F +
Sbjct: 87  SDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLERENLRKREIEELAEENFSRGNKLFVY 146

Query: 244 QSVIK 248
             V+K
Sbjct: 147 PQVVK 151


>gnl|CDD|151773 pfam11332, DUF3134, Protein of unknown function (DUF3134).  This
           family of proteins with unknown function appears to be
           restricted to Cyanobacteria.
          Length = 72

 Score = 30.0 bits (68), Expect = 1.2
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDD 193
           ++++ + + DDDD  DDD+ +DD
Sbjct: 50  EEEISELIGDDDDYYDDDDDEDD 72


>gnl|CDD|223998 COG1072, CoaA, Panthothenate kinase [Coenzyme metabolism].
          Length = 283

 Score = 32.6 bits (75), Expect = 1.2
 Identities = 10/27 (37%), Positives = 14/27 (51%)

Query: 430 TPLEPPPIVVAVVGPPQVGKSTLIRCL 456
           T  +  P ++ + G   VGKST  R L
Sbjct: 76  TNNQQRPFIIGIAGSVAVGKSTTARIL 102


>gnl|CDD|215874 pfam00350, Dynamin_N, Dynamin family. 
          Length = 168

 Score = 31.8 bits (73), Expect = 1.3
 Identities = 23/99 (23%), Positives = 37/99 (37%), Gaps = 13/99 (13%)

Query: 439 VAVVGPPQVGKSTLIRCLI-KNFTKTPLSVIKGPVTLIIKDSIRDCFVTGKWKASEDASE 497
           +AVVG    GKS+++  L+ ++    P     GP T           +  +      A  
Sbjct: 1   IAVVGDQSAGKSSVLNALLGRDI--LPRGP--GPTT--------RRPLVLRLGEEPGAIP 48

Query: 498 LLRLDDMDDDEELFGDFEDLETGEKHSGDKSGGGSGGVS 536
                +  D  + F DF +L    +   DK  G   G+S
Sbjct: 49  GAVKVEYKDGLKKFEDFSELREEIEDETDKISGTGKGIS 87


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 31.5 bits (70), Expect = 1.3
 Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 2/87 (2%)

Query: 162 GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDN--EEESEEEEENEEDDDEERKAE 219
           G K  KV  DD    + DD+ +++++ E +++ +E+   EEE EEE E EE  +EE + E
Sbjct: 44  GEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPE 103

Query: 220 KKNSKNEHVQELTAKQKNPKAFTFQSV 246
            +       +     +  P A T +SV
Sbjct: 104 PEPEPEPEPEPEPEPEPEPGAATLKSV 130



 Score = 30.3 bits (67), Expect = 3.1
 Identities = 21/77 (27%), Positives = 36/77 (46%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNP 238
           DD +  D DDE +++ + + EEE EEE E EE+ +EE + E+   +    +     +  P
Sbjct: 53  DDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEP 112

Query: 239 KAFTFQSVIKGERKFRS 255
           +         G    +S
Sbjct: 113 EPEPEPEPEPGAATLKS 129


>gnl|CDD|226164 COG3638, COG3638, ABC-type phosphate/phosphonate transport system,
           ATPase component [Inorganic ion transport and
           metabolism].
          Length = 258

 Score = 32.2 bits (74), Expect = 1.3
 Identities = 11/18 (61%), Positives = 14/18 (77%)

Query: 439 VAVVGPPQVGKSTLIRCL 456
           VA++GP   GKSTL+R L
Sbjct: 33  VAIIGPSGAGKSTLLRSL 50


>gnl|CDD|215628 PLN03196, PLN03196, MOC1-like protein; Provisional.
          Length = 487

 Score = 32.8 bits (75), Expect = 1.3
 Identities = 15/49 (30%), Positives = 23/49 (46%), Gaps = 3/49 (6%)

Query: 171 DDDLEDNMDDDDDDDDDDE---TDDDNDEDNEEESEEEEENEEDDDEER 216
           DD  E  M  D  + ++ E   +     E    ES  +EE+++DDDE  
Sbjct: 432 DDKFEQRMSGDFIEGEEMEPLFSMGGKLEMPGSESVSDEEDDDDDDEVL 480


>gnl|CDD|224042 COG1117, PstB, ABC-type phosphate transport system, ATPase
           component [Inorganic ion transport and metabolism].
          Length = 253

 Score = 32.1 bits (74), Expect = 1.4
 Identities = 16/37 (43%), Positives = 23/37 (62%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
           V A++GP   GKSTL+RCL +     P + ++G V L
Sbjct: 35  VTALIGPSGCGKSTLLRCLNRMNDLIPGARVEGEVLL 71


>gnl|CDD|181906 PRK09493, glnQ, glutamine ABC transporter ATP-binding protein;
           Reviewed.
          Length = 240

 Score = 32.0 bits (73), Expect = 1.4
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
           VV ++GP   GKSTL+RC+
Sbjct: 29  VVVIIGPSGSGKSTLLRCI 47


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 32.8 bits (74), Expect = 1.4
 Identities = 44/227 (19%), Positives = 89/227 (39%), Gaps = 9/227 (3%)

Query: 195  DEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFR 254
            D    E  ++ EE ++D +E +KAE++ +  E  +   A+  +      Q+ IK E   +
Sbjct: 1222 DAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFA--RRQAAIKAEEARK 1279

Query: 255  STKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDET 314
            + +     + + +D+  A + +          K EE ++ +++    E A   +    + 
Sbjct: 1280 ADELKKAEEKKKADE--AKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKK 1337

Query: 315  VEPKTHKDSKNPEEDTGLNWKSDLAQKAASAFLERQANIVNLAKYVYGDMEDVSVTMEGD 374
             E           E      +++ A++ A A  +++      A       E+     + D
Sbjct: 1338 AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK---KAD 1394

Query: 375  EAPNKKVHRKRQAELTAKQKNPKAFTFQSVIKGERKFRRKEDIQAKK 421
            EA  K    K++A+   K    K    ++  K E K  +K D   KK
Sbjct: 1395 EAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK--KKADEAKKK 1439



 Score = 31.3 bits (70), Expect = 3.9
 Identities = 41/208 (19%), Positives = 86/208 (41%), Gaps = 12/208 (5%)

Query: 133  DEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDD 192
            +ED    ++ +  +   KK  E +    +  K+ + +   + E+    ++    ++E   
Sbjct: 1671 EEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKI 1730

Query: 193  DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERK 252
              +E  +E  E++++ E    E +K E++  K  H+++   K+        ++VI+ E  
Sbjct: 1731 KAEEAKKEAEEDKKKAE----EAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786

Query: 253  FRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKE---EEEEEQEDSAFSDESADDNSS 309
                K  +    ++ D    I     + + G         + +E EDSA   E AD  + 
Sbjct: 1787 EEDEKRRMEVDKKIKD----IFDNFANIIEGGKEGNLVINDSKEMEDSA-IKEVADSKNM 1841

Query: 310  DEDETVEPKTHKDSKNPEEDTGLNWKSD 337
              +E    + HK +KN E     N ++D
Sbjct: 1842 QLEEADAFEKHKFNKNNENGEDGNKEAD 1869


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 28.8 bits (65), Expect = 1.4
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 192 DDNDEDNEEESEEEEENEEDDDEER 216
           ++   ++EEE+E+EEE EE  + +R
Sbjct: 22  EEEKREDEEENEDEEEGEEQSEVKR 46



 Score = 28.5 bits (64), Expect = 2.0
 Identities = 9/25 (36%), Positives = 18/25 (72%)

Query: 196 EDNEEESEEEEENEEDDDEERKAEK 220
           E+ + E EEE E+EE+ +E+ + ++
Sbjct: 22  EEEKREDEEENEDEEEGEEQSEVKR 46



 Score = 28.1 bits (63), Expect = 2.9
 Identities = 11/23 (47%), Positives = 19/23 (82%)

Query: 199 EEESEEEEENEEDDDEERKAEKK 221
           EE+ E+EEENE++++ E ++E K
Sbjct: 23  EEKREDEEENEDEEEGEEQSEVK 45



 Score = 26.9 bits (60), Expect = 8.0
 Identities = 8/25 (32%), Positives = 16/25 (64%)

Query: 197 DNEEESEEEEENEEDDDEERKAEKK 221
           + E+  +EEE  +E++ EE+   K+
Sbjct: 22  EEEKREDEEENEDEEEGEEQSEVKR 46


>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366).  This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 227 and
           387 amino acids in length.
          Length = 213

 Score = 31.9 bits (73), Expect = 1.4
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEE 204
              DDD D+    D DE+ ++E   
Sbjct: 189 GAPDDDLDEYDYGDEDEEEDDEPPW 213


>gnl|CDD|188098 TIGR00957, MRP_assoc_pro, multi drug resistance-associated protein
           (MRP).  This model describes multi drug
           resistance-associated protein (MRP) in eukaryotes. The
           multidrug resistance-associated protein is an integral
           membrane protein that causes multidrug resistance when
           overexpressed in mammalian cells. It belongs to ABC
           transporter superfamily. The protein topology and
           function was experimentally demonstrated by epitope
           tagging and immunofluorescence. Insertion of tags in the
           critical regions associated with drug efflux, abrogated
           its function. The C-terminal domain seem to highly
           conserved [Transport and binding proteins, Other].
          Length = 1522

 Score = 32.6 bits (74), Expect = 1.5
 Identities = 36/116 (31%), Positives = 50/116 (43%), Gaps = 20/116 (17%)

Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSV-IKGPVTL------IIKDSIRDCFVT 486
           P   +VAVVG    GKS+L+  L+    K    V +KG V        I  DS+R+  + 
Sbjct: 662 PEGALVAVVGQVGCGKSSLLSALLAEMDKVEGHVHMKGSVAYVPQQAWIQNDSLRENILF 721

Query: 487 GK------WKASEDASELLRLDDMDDDEELFGDFEDLETGEKHSGDKSGGGSGGVS 536
           GK      ++   +A  LL       D E+    +  E GEK   + SGG    VS
Sbjct: 722 GKALNEKYYQQVLEACALL------PDLEILPSGDRTEIGEK-GVNLSGGQKQRVS 770


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 29.9 bits (68), Expect = 1.5
 Identities = 12/15 (80%), Positives = 13/15 (86%)

Query: 199 EEESEEEEENEEDDD 213
           EE+ EEEEE EEDDD
Sbjct: 68  EEKKEEEEEEEEDDD 82



 Score = 28.0 bits (63), Expect = 7.5
 Identities = 11/15 (73%), Positives = 13/15 (86%)

Query: 199 EEESEEEEENEEDDD 213
           EEE +EEEE EE+DD
Sbjct: 67  EEEKKEEEEEEEEDD 81


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 30.4 bits (69), Expect = 1.5
 Identities = 13/20 (65%), Positives = 16/20 (80%)

Query: 196 EDNEEESEEEEENEEDDDEE 215
           E+ EEE EEEEE EE+ +EE
Sbjct: 76  EEEEEEEEEEEEEEEESEEE 95



 Score = 28.9 bits (65), Expect = 5.4
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 196 EDNEEESEEEEENEEDDDEERKAE 219
            + EEE EEEEE EE++ EE    
Sbjct: 75  AEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 28.9 bits (65), Expect = 5.6
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 196 EDNEEESEEEEENEEDDDEERKA 218
              EEE EEEEE EE+++ E +A
Sbjct: 74  AAEEEEEEEEEEEEEEEESEEEA 96


>gnl|CDD|237422 PRK13548, hmuV, hemin importer ATP-binding subunit; Provisional.
          Length = 258

 Score = 32.1 bits (74), Expect = 1.5
 Identities = 12/19 (63%), Positives = 15/19 (78%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
           VVA++GP   GKSTL+R L
Sbjct: 30  VVAILGPNGAGKSTLLRAL 48


>gnl|CDD|240521 cd13394, Syo1_like, Fungal symportin 1 (syo1) and similar proteins.
            This family of eukaryotic proteins includes
           Saccharomyces cerevisiae Ydl063c and Chaetomium
           thermophilum Syo1, which mediate the co-import of two
           ribosomal proteins, Rpl5 and Rpl11 (which both interact
           with 5S rRNA) into the nucleus. Import precedes their
           association with rRNA and subsequent ribosome assembly
           in the nucleolus. The primary structure of syo1 is a
           mixture of Armadillo- (ARM, N-terminal part of syo1) and
           HEAT-repeats (C-terminal part of syo1).
          Length = 597

 Score = 32.7 bits (75), Expect = 1.5
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
           + D +D+   +D+  D+DE +D++DE +E    E    E
Sbjct: 304 ELDKKDDEAMEDEQSDEDEEEDEDDEASETLVPELSPLE 342



 Score = 30.7 bits (70), Expect = 5.2
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
           + D  DD+  +D+  +++EEE E++E +E    E    E
Sbjct: 304 ELDKKDDEAMEDEQSDEDEEEDEDDEASETLVPELSPLE 342


>gnl|CDD|215427 PLN02796, PLN02796, D-glycerate 3-kinase.
          Length = 347

 Score = 32.4 bits (74), Expect = 1.5
 Identities = 20/70 (28%), Positives = 31/70 (44%), Gaps = 10/70 (14%)

Query: 418 QAKKHHVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLIIK 477
           Q + H     D    E PP+V+ +  P   GK+TL+  L+  F  T     +   +L   
Sbjct: 84  QLEAHRSKFKDGD--EIPPLVIGISAPQGCGKTTLVFALVYLFNATG----RRAASL--- 134

Query: 478 DSIRDCFVTG 487
            SI D ++T 
Sbjct: 135 -SIDDFYLTA 143


>gnl|CDD|206646 cd00880, Era_like, E. coli Ras-like protein (Era)-like GTPase.  The
           Era (E. coli Ras-like protein)-like family includes
           several distinct subfamilies (TrmE/ThdF, FeoB, YihA
           (EngB), Era, and EngA/YfgK) that generally show sequence
           conservation in the region between the Walker A and B
           motifs (G1 and G3 box motifs), to the exclusion of other
           GTPases. TrmE is ubiquitous in bacteria and is a
           widespread mitochondrial protein in eukaryotes, but is
           absent from archaea. The yeast member of TrmE family,
           MSS1, is involved in mitochondrial translation;
           bacterial members are often present in
           translation-related operons. FeoB represents an unusual
           adaptation of GTPases for high-affinity iron (II)
           transport. YihA (EngB) family of GTPases is typified by
           the E. coli YihA, which is an essential protein involved
           in cell division control. Era is characterized by a
           distinct derivative of the KH domain (the pseudo-KH
           domain) which is located C-terminal to the GTPase
           domain. EngA and its orthologs are composed of two
           GTPase domains and, since the sequences of the two
           domains are more similar to each other than to other
           GTPases, it is likely that an ancient gene duplication,
           rather than a fusion of evolutionarily distinct GTPases,
           gave rise to this family.
          Length = 161

 Score = 31.4 bits (72), Expect = 1.5
 Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 15/51 (29%)

Query: 440 AVVGPPQVGKSTLIRCLIK----------NFTKTPLSV-----IKGPVTLI 475
           A+ G P VGKS+L+  L+             T+ P+         GPV LI
Sbjct: 1   AIFGRPNVGKSSLLNALLGQNVGIVSPIPGTTRDPVRKEWELLPLGPVVLI 51


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 32.3 bits (74), Expect = 1.6
 Identities = 12/28 (42%), Positives = 17/28 (60%)

Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEE 210
           + +  DE ++  DE+ EEE EE EE E 
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEP 370



 Score = 31.6 bits (72), Expect = 3.2
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEE 215
           + E  D+ +E  +EE EEEEE  E+ + E
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPE 371


>gnl|CDD|206665 cd01876, YihA_EngB, YihA (EngB) GTPase family.  The YihA (EngB)
           subfamily of GTPases is typified by the E. coli YihA, an
           essential protein involved in cell division control.
           YihA and its orthologs are small proteins that typically
           contain less than 200 amino acid residues and consists
           of the GTPase domain only (some of the eukaryotic
           homologs contain an N-terminal extension of about 120
           residues that might be involved in organellar
           targeting). Homologs of yihA are found in most
           Gram-positive and Gram-negative pathogenic bacteria,
           with the exception of Mycobacterium tuberculosis. The
           broad-spectrum nature of YihA and its essentiality for
           cell viability in bacteria make it an attractive
           antibacterial target.
          Length = 170

 Score = 31.3 bits (72), Expect = 1.6
 Identities = 13/31 (41%), Positives = 15/31 (48%), Gaps = 5/31 (16%)

Query: 439 VAVVGPPQVGKSTLIRCLIKN-----FTKTP 464
           VA  G   VGKS+LI  L         +KTP
Sbjct: 2   VAFAGRSNVGKSSLINALTNRKKLARTSKTP 32


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 32.2 bits (73), Expect = 1.6
 Identities = 17/97 (17%), Positives = 38/97 (39%), Gaps = 7/97 (7%)

Query: 129 DDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDD 188
           DDE DE+ TS  + E        + +      D         + ++   + D ++D++  
Sbjct: 188 DDELDEEVTSYLN-ELDDEVTSYFNDGENEENDDE------LEAEVISYLKDGENDNEVK 240

Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
           E       + + +     E E +D+ E + E++  + 
Sbjct: 241 EKIRREYREWKGDKANTNETEIEDESEDEYEEEAGEE 277



 Score = 30.6 bits (69), Expect = 4.4
 Identities = 14/67 (20%), Positives = 29/67 (43%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
           +DD     +DD+ D++     ++ D++      + E  E DD+ E +        E+  E
Sbjct: 179 EDDASTLHEDDELDEEVTSYLNELDDEVTSYFNDGENEENDDELEAEVISYLKDGENDNE 238

Query: 231 LTAKQKN 237
           +  K + 
Sbjct: 239 VKEKIRR 245


>gnl|CDD|223447 COG0370, FeoB, Fe2+ transport system protein B [Inorganic ion
           transport and metabolism].
          Length = 653

 Score = 32.6 bits (75), Expect = 1.6
 Identities = 10/17 (58%), Positives = 13/17 (76%)

Query: 437 IVVAVVGPPQVGKSTLI 453
           + VA+VG P VGK+TL 
Sbjct: 4   LTVALVGNPNVGKTTLF 20


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 31.5 bits (72), Expect = 1.6
 Identities = 17/90 (18%), Positives = 35/90 (38%), Gaps = 29/90 (32%)

Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE------- 215
               K   ++ LE  + +  ++  + +  D+ DE         E +EE+DD         
Sbjct: 17  PPSPKEEEEEALELAIQEAHENALEKKLLDELDE---------ELDEEEDDRFLEEYRRK 67

Query: 216 -----RKAEKKN--------SKNEHVQELT 232
                +   +K+        SK ++V+E+T
Sbjct: 68  RLAEMKALAEKSKFGEVYEISKPDYVREVT 97


>gnl|CDD|206727 cd04164, trmE, trmE is a tRNA modification GTPase.  TrmE (MnmE,
           ThdF, MSS1) is a 3-domain protein found in bacteria and
           eukaryotes. It controls modification of the uridine at
           the wobble position (U34) of tRNAs that read codons
           ending with A or G in the mixed codon family boxes. TrmE
           contains a GTPase domain that forms a canonical Ras-like
           fold. It functions a molecular switch GTPase, and
           apparently uses a conformational change associated with
           GTP hydrolysis to promote the tRNA modification
           reaction, in which the conserved cysteine in the
           C-terminal domain is thought to function as a catalytic
           residue. In bacteria that are able to survive in
           extremely low pH conditions, TrmE regulates
           glutamate-dependent acid resistance.
          Length = 159

 Score = 31.3 bits (72), Expect = 1.7
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query: 437 IVVAVVGPPQVGKSTLIRCLIK 458
           I V + G P VGKS+L+  L  
Sbjct: 4   IKVVIAGKPNVGKSSLLNALAG 25



 Score = 29.8 bits (68), Expect = 5.5
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 74  ADLVLLLIDASFGFEMEIFEFLNICQ 99
           ADLVLL++DAS G + E  E L +  
Sbjct: 83  ADLVLLVVDASEGLDEEDLEILELPA 108


>gnl|CDD|114045 pfam05297, Herpes_LMP1, Herpesvirus latent membrane protein 1
           (LMP1).  This family consists of several latent membrane
           protein 1 or LMP1s mostly from Epstein-Barr virus. LMP1
           of EBV is a 62-65 kDa plasma membrane protein possessing
           six membrane spanning regions, a short cytoplasmic
           N-terminus and a long cytoplasmic carboxy tail of 200
           amino acids. EBV latent membrane protein 1 (LMP1) is
           essential for EBV-mediated transformation and has been
           associated with several cases of malignancies. EBV-like
           viruses in Cynomolgus monkeys (Macaca fascicularis) have
           been associated with high lymphoma rates in
           immunosuppressed monkeys.
          Length = 382

 Score = 32.3 bits (73), Expect = 1.7
 Identities = 25/99 (25%), Positives = 42/99 (42%), Gaps = 13/99 (13%)

Query: 128 SDDENDEDDTSDDDTE-----SGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDD 182
           +DD   E D++ ++       SG G       Q + +  G         D+ +DN   D 
Sbjct: 208 TDDSGHESDSNSNEGRHHLLVSGAGDGPPLCSQNLGAPGGGPDNGPQDPDNTDDNGPQDP 267

Query: 183 DDDDD------DETDDD--NDEDNEEESEEEEENEEDDD 213
           D+ DD      D TDD+   D DN  ++  ++ +  DD+
Sbjct: 268 DNTDDNGPQDPDNTDDNGPQDPDNTADNGPQDPDNTDDN 306


>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
           unknown].
          Length = 622

 Score = 32.4 bits (73), Expect = 1.7
 Identities = 24/132 (18%), Positives = 54/132 (40%), Gaps = 4/132 (3%)

Query: 122 AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
           A + D   D++    T+   ++    S+ + K Q + S+          + +L D  DD 
Sbjct: 349 AFTDDGIRDKDFSAYTASKLSDEDDDSVMESKMQKLFSEK---EIDFGLNSELVDMSDDG 405

Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE-KKNSKNEHVQELTAKQKNPKA 240
           ++ + +D         NE ES+++ E   +  + +  E ++N K   +++          
Sbjct: 406 ENGEMEDTFTSHLPASNESESDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLK 465

Query: 241 FTFQSVIKGERK 252
              +S+ K  +K
Sbjct: 466 DKKESINKKNKK 477


>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B.  Escherichia coli
           stringent starvation protein B (SspB), is thought to
           enhance the specificity of degradation of tmRNA-tagged
           proteins by the ClpXP protease. The tmRNA tag, also
           known as ssrA, is an 11-aa peptide added to the C
           terminus of proteins stalled during translation, targets
           proteins for degradation by ClpXP and ClpAP. SspB a
           cytoplasmic protein that specifically binds to residues
           1-4 and 7 of the tag. Binding of SspB enhances
           degradation of tagged proteins by ClpX, and masks
           sequence elements important for ClpA interactions,
           inhibiting degradation by ClpA. However, more recent
           work has cast doubt on the importance of SspB in
           wild-type cells. SspB is encoded in an operon whose
           synthesis is stimulated by carbon, amino acid, and
           phosphate starvation. SspB may play a special role
           during nutrient stress, for example by ensuring rapid
           degradation of the products of stalled translation,
           without causing a global increase in degradation of all
           ClpXP substrates.
          Length = 153

 Score = 31.0 bits (71), Expect = 1.8
 Identities = 10/45 (22%), Positives = 21/45 (46%), Gaps = 3/45 (6%)

Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
            F+ +  D   ++++++DDD  D   DED E +   + +      
Sbjct: 109 QFEPEEAD---EEEEEEDDDADDSPEDEDPEPKDPPKTKGRPSLK 150


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 32.3 bits (73), Expect = 1.8
 Identities = 33/176 (18%), Positives = 73/176 (41%), Gaps = 7/176 (3%)

Query: 150 KKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
           +K  +   + Q   +  + M +D   +N   ++++ ++   + +  E+ E  ++ E++N+
Sbjct: 81  QKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKND 140

Query: 210 EDDDEERKAEKK--NSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMS 267
             D EE + E+K    + E   +  + ++N   F    +   E  F              
Sbjct: 141 WRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGK 200

Query: 268 DKDLAIRHKITSALAGLNSKEEEEE-----EQEDSAFSDESADDNSSDEDETVEPK 318
           + +   + +  +AL     K++ EE     E+E+     E AD  S +E+E    K
Sbjct: 201 EFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLK 256



 Score = 31.9 bits (72), Expect = 2.2
 Identities = 49/268 (18%), Positives = 94/268 (35%), Gaps = 28/268 (10%)

Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD--DEERKAEK 220
           R+RR+   ++ L      ++       T  + +  N    EE + + +D+    ER A +
Sbjct: 9   RERRRREREEQLRQ--KQEEGSLGQVTTQVEVNSQNSVPDEESKTSTDDEAALLERLARR 66

Query: 221 KNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSA 280
           +  ++E   E   +QK             E K  ST  ++    R   +D    ++    
Sbjct: 67  EERRDERFSEALERQK-------------EFKPTSTDQSLSEPSRRMQEDSGAENETVEE 113

Query: 281 LAGLNSKEEEEEEQEDSAFSDESADDNSSDE----DETVEPKTHKDSKNPEEDTGLNWKS 336
                S+EE EE +E    +     ++  D      E  EP+  ++ K        N   
Sbjct: 114 EEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGE 173

Query: 337 DLAQKAASAFLERQANIVNLAKYVYGDMEDVS------VTMEGDEAPNKKVHRKRQAELT 390
            +  K               A+   G   +          +E +E   K+  R++  E  
Sbjct: 174 FMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLE-E 232

Query: 391 AKQKNPKAFTFQSVIKGERKFRRKEDIQ 418
            +Q+  +    +   + E K R KE+I+
Sbjct: 233 EEQRRKQEEADRKSREEEEKRRLKEEIE 260


>gnl|CDD|217943 pfam04180, LTV, Low temperature viability protein.  The
           low-temperature viability protein LTV1 is involved in
           ribosome biogenesis 40S subunit production.
          Length = 426

 Score = 32.3 bits (73), Expect = 1.8
 Identities = 21/143 (14%), Positives = 51/143 (35%), Gaps = 13/143 (9%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
           D+    N ++ ++DD ++  + D++E +++ S     +E +D     + ++  K     E
Sbjct: 168 DEAYVRNDEESEEDDGEEWEEMDDEEGSDDRSSAGFLDEFEDLSAPGSPQEAMKKHLFVE 227

Query: 231 LTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITS--ALAGLNSKE 288
              + +N +      V++                 +  +      K      +   ++ E
Sbjct: 228 EETESRNTEYSETSEVMRRNE-----------DWTLHFERFKKFEKKNDDVEIGAADNSE 276

Query: 289 EEEEEQEDSAFSDESADDNSSDE 311
            E   + DS    E  +D    E
Sbjct: 277 LEGSIEVDSNRLQEVLNDYYKTE 299



 Score = 30.7 bits (69), Expect = 4.8
 Identities = 29/163 (17%), Positives = 57/163 (34%), Gaps = 26/163 (15%)

Query: 178 MDDDDDDDDD---DETDDDNDEDNEEESEEE--EENEEDDDEERKAEKKNSKNEHVQELT 232
              D D       +  +D+    N+EESEE+  EE EE DDEE   ++ ++      E  
Sbjct: 151 FQPDMDPRLREVLEALEDEAYVRNDEESEEDDGEEWEEMDDEEGSDDRSSAGFLDEFEDL 210

Query: 233 AKQKNP-KAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEE 291
           +   +P +A      ++ E + R+T                   + +     +   E+  
Sbjct: 211 SAPGSPQEAMKKHLFVEEETESRNT-------------------EYSETSEVMRRNEDWT 251

Query: 292 EEQED-SAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLN 333
              E    F  ++ D      D +    + +   N  ++   +
Sbjct: 252 LHFERFKKFEKKNDDVEIGAADNSELEGSIEVDSNRLQEVLND 294



 Score = 30.7 bits (69), Expect = 4.8
 Identities = 24/111 (21%), Positives = 38/111 (34%), Gaps = 12/111 (10%)

Query: 125 SDDSDDENDEDDTSDDDTE-SGQGSMKKY--------KEQVVSSQDGR-KRRKVMFDDDL 174
           SDD       D+  D     S Q +MKK+              S+     RR   +    
Sbjct: 195 SDDRSSAGFLDEFEDLSAPGSPQEAMKKHLFVEEETESRNTEYSETSEVMRRNEDWTLHF 254

Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
           E     +  +DD +    DN E   E S E + N   +      + + +K+
Sbjct: 255 ERFKKFEKKNDDVEIGAADNSE--LEGSIEVDSNRLQEVLNDYYKTEKAKS 303


>gnl|CDD|222046 pfam13324, GCIP, Grap2 and cyclin-D-interacting.  GCIP, or Grap2
           and cyclin-D-interacting protein, is found in
           eukaryotes, and in the human protein CCNDBP1, residues
           149-190 constitute a helix-loop-helix domain, residues
           190-240 an acidic region, and 240-261 a leucine zipper
           domain. GCIP interacts with full-length Grap2 protein
           and with the COOH-terminal unique and SH3 domains
           (designated QC domain) of Grap2. It is potentially
           involved in the regulation of cell differentiation and
           proliferation through Grap2 and cyclin D-mediated
           signalling pathways. In mice, it is involved in
           G1/S-phase progression of hepatocytes, which in older
           animals is associated with the development of liver
           tumours. In vitro it acts as an inhibitory HLH protein,
           for example, blocking transcription of the HNF-4
           promoter. In its function as a cyclin D1-binding protein
           it is able to reduce CDK4-mediated phosphorylation of
           the retinoblastoma protein and to inhibit E2F-mediated
           transcriptional activity. GCIP has also been shown to
           have interact physically with Rad (Ras associated with
           diabetes), Rad being important in regulating cellular
           senescence.
          Length = 271

 Score = 32.0 bits (73), Expect = 1.8
 Identities = 17/34 (50%), Positives = 18/34 (52%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE 205
           DD ED   D   DDDD E DD ND+    E E E
Sbjct: 152 DDSEDPFSDGHHDDDDVEGDDLNDDRYWSEEEME 185


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 30.8 bits (70), Expect = 1.8
 Identities = 10/59 (16%), Positives = 30/59 (50%), Gaps = 1/59 (1%)

Query: 174 LEDNMDDDDDDDDDDETDDDNDED-NEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
            E++       +  ++ D + + D  +EE+E ++E    +++++   +K+ +    +EL
Sbjct: 18  AENDGKLQVKPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEEL 76



 Score = 30.0 bits (68), Expect = 3.6
 Identities = 9/52 (17%), Positives = 20/52 (38%)

Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
            D  L+   +  +  D +  TD   +E   ++     E ++    ++  EK 
Sbjct: 20  NDGKLQVKPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKP 71


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 32.0 bits (73), Expect = 1.9
 Identities = 41/158 (25%), Positives = 59/158 (37%), Gaps = 25/158 (15%)

Query: 195 DEDNEEESEEEEENEEDDDEERKAEK--KNSKNE-----HVQELTAKQKNPKAFTFQSVI 247
           DE+ EE  E+E E E   D  +K E    +SK E      ++EL   Q         S++
Sbjct: 114 DEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQSRRADVDVNSML 173

Query: 248 KGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDN 307
             E  FR  K     +    ++D A+   +        S   E EE    A  DE ++D+
Sbjct: 174 --EALFRREKKE---EEEEEEEDEALIKSL--------SFGPETEEDRRRA-DDEDSEDD 219

Query: 308 SSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASA 345
             D D T  PK+   S               A K + A
Sbjct: 220 EEDNDNTPSPKSGSSSPAKPTSI----LKKSAAKRSEA 253


>gnl|CDD|217059 pfam02480, Herpes_gE, Alphaherpesvirus glycoprotein E.
           Glycoprotein E (gE) of Alphaherpesvirus forms a complex
           with glycoprotein I (gI) (pfam01688), functioning as an
           immunoglobulin G (IgG) Fc binding protein. gE is
           involved in virus spread but is not essential for
           propagation.
          Length = 437

 Score = 32.0 bits (73), Expect = 2.0
 Identities = 6/28 (21%), Positives = 14/28 (50%)

Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENE 209
            ++  +D + D   +D E +S+ + E  
Sbjct: 400 TNELFEDVSSDSESDDEEFDSDSDREPS 427


>gnl|CDD|206684 cd01897, NOG, Nucleolar GTP-binding protein (NOG).  NOG1 is a
           nucleolar GTP-binding protein present in eukaryotes
           ranging from trypanosomes to humans. NOG1 is
           functionally linked to ribosome biogenesis and found in
           association with the nuclear pore complexes and
           identified in many preribosomal complexes. Thus, defects
           in NOG1 can lead to defects in 60S biogenesis. The S.
           cerevisiae NOG1 gene is essential for cell viability,
           and mutations in the predicted G motifs abrogate
           function. It is a member of the ODN family of
           GTP-binding proteins that also includes the bacterial
           Obg and DRG proteins.
          Length = 167

 Score = 31.0 bits (71), Expect = 2.0
 Identities = 10/22 (45%), Positives = 14/22 (63%), Gaps = 2/22 (9%)

Query: 435 PPIVVAVVGPPQVGKSTLIRCL 456
             +V+A  G P VGKS+L+  L
Sbjct: 1   RTLVIA--GYPNVGKSSLVNKL 20


>gnl|CDD|237894 PRK15064, PRK15064, ABC transporter ATP-binding protein;
           Provisional.
          Length = 530

 Score = 32.2 bits (74), Expect = 2.0
 Identities = 9/19 (47%), Positives = 15/19 (78%)

Query: 439 VAVVGPPQVGKSTLIRCLI 457
           +A++G   VGK+TL+R L+
Sbjct: 348 LAIIGENGVGKTTLLRTLV 366


>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B.  A
           small number of taxonomically diverse prokaryotic
           species, including Methanosarcina barkeri, have what
           appears to be a second ATP synthase, in addition to the
           normal F1F0 ATPase in bacteria and A1A0 ATPase in
           archaea. These enzymes use ion gradients to synthesize
           ATP, CC and in principle may run in either direction.
           This model represents the F0 subunit B of this apparent
           second ATP synthase.
          Length = 246

 Score = 31.6 bits (72), Expect = 2.1
 Identities = 18/63 (28%), Positives = 29/63 (46%), Gaps = 16/63 (25%)

Query: 928 EREQKVASLM----------KMLRTNYSEKNSKEKQAMKARMVALKLRAKAAEEAKQQRQ 977
            RE+K+A  +          +  R  Y EKN  E    +  ++      KA EEA+ +RQ
Sbjct: 36  AREKKIAGELADADTKKREAEQERREYEEKNE-ELDQQREVLLT-----KAKEEAQAERQ 89

Query: 978 RVM 980
           R++
Sbjct: 90  RLL 92


>gnl|CDD|206648 cd00882, Ras_like_GTPase, Rat sarcoma (Ras)-like superfamily of
           small guanosine triphosphatases (GTPases).  Ras-like
           GTPase superfamily. The Ras-like superfamily of small
           GTPases consists of several families with an extremely
           high degree of structural and functional similarity. The
           Ras superfamily is divided into at least four families
           in eukaryotes: the Ras, Rho, Rab, and Sar1/Arf families.
           This superfamily also includes proteins like the GTP
           translation factors, Era-like GTPases, and G-alpha chain
           of the heterotrimeric G proteins. Members of the Ras
           superfamily regulate a wide variety of cellular
           functions: the Ras family regulates gene expression, the
           Rho family regulates cytoskeletal reorganization and
           gene expression, the Rab and Sar1/Arf families regulate
           vesicle trafficking, and the Ran family regulates
           nucleocytoplasmic transport and microtubule
           organization. The GTP translation factor family
           regulates initiation, elongation, termination, and
           release in translation, and the Era-like GTPase family
           regulates cell division, sporulation, and DNA
           replication. Members of the Ras superfamily are
           identified by the GTP binding site, which is made up of
           five characteristic sequence motifs, and the switch I
           and switch II regions.
          Length = 161

 Score = 30.9 bits (70), Expect = 2.1
 Identities = 9/19 (47%), Positives = 12/19 (63%)

Query: 440 AVVGPPQVGKSTLIRCLIK 458
            VVG   VGKS+L+  L+ 
Sbjct: 1   VVVGRGGVGKSSLLNALLG 19


>gnl|CDD|226371 COG3853, TelA, Uncharacterized protein involved in tellurite
           resistance [Inorganic ion transport and metabolism].
          Length = 386

 Score = 32.0 bits (73), Expect = 2.1
 Identities = 11/45 (24%), Positives = 21/45 (46%), Gaps = 1/45 (2%)

Query: 177 NMDDDDDDDDDDETDD-DNDEDNEEESEEEEENEEDDDEERKAEK 220
           N D +++   +D+  D D  E    E  ++E+  E D +    +K
Sbjct: 4   NFDIENEATINDKVTDADLPEPENSEIVQKEKKAERDLQLENTKK 48


>gnl|CDD|227388 COG5055, RAD52, Recombination DNA repair protein (RAD52 pathway)
           [DNA replication, recombination, and repair].
          Length = 375

 Score = 31.8 bits (72), Expect = 2.1
 Identities = 18/64 (28%), Positives = 26/64 (40%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
           D    D    D DD  DD +DD  D+D +   +     +E D    K     S N  +  
Sbjct: 256 DLKSLDASKQDQDDLLDDRSDDFQDDDLQNMGDTNVLTDEKDPVVAKQSPTASSNPQITF 315

Query: 231 LTAK 234
           +TA+
Sbjct: 316 VTAR 319


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 32.2 bits (73), Expect = 2.1
 Identities = 30/169 (17%), Positives = 62/169 (36%), Gaps = 11/169 (6%)

Query: 191 DDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGE 250
           +  N+   EEE E+E+  EE   +++K + K    +   +  AK+K P     +   + E
Sbjct: 97  EPKNESGKEEEKEKEQVKEEK--KKKKEKPKEEPKDRKPKEEAKEKRPPK---EKEKEKE 151

Query: 251 RKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESAD----D 306
           +K    +     + R   +  + R K        N K+E  EE++    + E+      +
Sbjct: 152 KKVEEPRDREEEKKRERVRAKS-RPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEE 210

Query: 307 NSSDEDETVEPKTHKDSKNPEEDTG-LNWKSDLAQKAASAFLERQANIV 354
              +E+   E    KD +           +        S+   ++ +  
Sbjct: 211 PDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPS 259



 Score = 31.4 bits (71), Expect = 3.6
 Identities = 17/125 (13%), Positives = 38/125 (30%), Gaps = 10/125 (8%)

Query: 199 EEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKP 258
            +E + E   EE+ ++E+  E+K  K E  +E    +K  +          E++    K 
Sbjct: 95  AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEE-------AKEKRPPKEKE 147

Query: 259 TVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPK 318
               +     +D   R +             ++  ++      +   +         E  
Sbjct: 148 KEKEKKVEEPRD---REEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAV 204

Query: 319 THKDS 323
             K  
Sbjct: 205 KGKPE 209



 Score = 30.2 bits (68), Expect = 7.2
 Identities = 25/180 (13%), Positives = 64/180 (35%), Gaps = 10/180 (5%)

Query: 141 DTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEE 200
           + ++  G  ++ +++ V  +  +K+ K   +       ++  +     E + + ++  EE
Sbjct: 97  EPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156

Query: 201 ESEEEEENEEDDDEER---KAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTK 257
             + EEE + +    +   K   K       +E   ++K  +A         E      +
Sbjct: 157 PRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQA-------AREAVKGKPE 209

Query: 258 PTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEP 317
              V + R  ++D     + T++    +   +  E    S+ S +  D + S        
Sbjct: 210 EPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRE 269


>gnl|CDD|234719 PRK00300, gmk, guanylate kinase; Provisional.
          Length = 205

 Score = 31.2 bits (72), Expect = 2.1
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 441 VVGPPQVGKSTLIRCLIKNFTKTPLSV 467
           + GP   GKSTL++ L++      LSV
Sbjct: 10  LSGPSGAGKSTLVKALLERDPNLQLSV 36


>gnl|CDD|188099 TIGR00972, 3a0107s01c2, phosphate ABC transporter, ATP-binding
           protein.  This model represents the ATP-binding protein
           of a family of ABC transporters for inorganic phosphate.
           In the model species Escherichia coli, a constitutive
           transporter for inorganic phosphate, with low affinity,
           is also present. The high affinity transporter that
           includes this polypeptide is induced when extracellular
           phosphate concentrations are low. The proteins most
           similar to the members of this family but not included
           appear to be amino acid transporters [Transport and
           binding proteins, Anions].
          Length = 247

 Score = 31.5 bits (72), Expect = 2.2
 Identities = 15/37 (40%), Positives = 20/37 (54%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
           V A++GP   GKSTL+R L +     P   I+G V  
Sbjct: 29  VTALIGPSGCGKSTLLRSLNRMNDLVPGVRIEGKVLF 65


>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein.  This
           family consists of REP proteins from Dictyostelium
           (Slime molds). REP protein is likely involved in
           transcription regulation and control of DNA replication,
           specifically amplification of plasmid at low copy
           numbers. The formation of homomultimers may be required
           for their regulatory activity.
          Length = 910

 Score = 32.1 bits (73), Expect = 2.2
 Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 177 NMDDDDDDDDDDETDDDNDEDNEE 200
           N DDD D+D+D + D+D DED +E
Sbjct: 887 NEDDDADEDEDQDEDEDEDEDEDE 910



 Score = 30.2 bits (68), Expect = 8.2
 Identities = 12/34 (35%), Positives = 22/34 (64%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEE 204
           ++ L     +++DDD D++ D D DED +E+ +E
Sbjct: 877 EEYLISQFQENEDDDADEDEDQDEDEDEDEDEDE 910



 Score = 30.2 bits (68), Expect = 8.9
 Identities = 10/22 (45%), Positives = 18/22 (81%)

Query: 193 DNDEDNEEESEEEEENEEDDDE 214
           D+D D +E+ +E+E+ +ED+DE
Sbjct: 889 DDDADEDEDQDEDEDEDEDEDE 910


>gnl|CDD|184582 PRK14236, PRK14236, phosphate transporter ATP-binding protein;
           Provisional.
          Length = 272

 Score = 31.5 bits (72), Expect = 2.2
 Identities = 11/19 (57%), Positives = 13/19 (68%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
           V A +GP   GKSTL+RC 
Sbjct: 53  VTAFIGPSGCGKSTLLRCF 71


>gnl|CDD|131368 TIGR02315, ABC_phnC, phosphonate ABC transporter, ATP-binding
           protein.  Phosphonates are a class of
           phosphorus-containing organic compound with a stable
           direct C-P bond rather than a C-O-P linkage. A number of
           bacterial species have operons, typically about 14 genes
           in size, with genes for ATP-dependent transport of
           phosphonates, degradation, and regulation of the
           expression of the system. Members of this protein family
           are the ATP-binding cassette component of tripartite ABC
           transporters of phosphonates [Transport and binding
           proteins, Anions].
          Length = 243

 Score = 31.5 bits (72), Expect = 2.3
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
            VA++GP   GKSTL+RC+
Sbjct: 30  FVAIIGPSGAGKSTLLRCI 48


>gnl|CDD|213191 cd03224, ABC_TM1139_LivF_branched, ATP-binding cassette domain of
           branched-chain amino acid transporter.  LivF (TM1139) is
           part of the LIV-I bacterial ABC-type two-component
           transport system that imports neutral, branched-chain
           amino acids. The E. coli branched-chain amino acid
           transporter comprises a heterodimer of ABC transporters
           (LivF and LivG), a heterodimer of six-helix TM domains
           (LivM and LivH), and one of two alternative soluble
           periplasmic substrate binding proteins (LivK or LivJ).
           ABC transporters are a large family of proteins involved
           in the transport of a wide variety of different
           compounds, like sugars, ions, peptides, and more complex
           organic molecules.
          Length = 222

 Score = 31.3 bits (72), Expect = 2.3
 Identities = 7/20 (35%), Positives = 15/20 (75%)

Query: 438 VVAVVGPPQVGKSTLIRCLI 457
           +VA++G    GK+TL++ ++
Sbjct: 28  IVALLGRNGAGKTTLLKTIM 47


>gnl|CDD|99707 cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of
           cellular Activities) superfamily represents an ancient
           group of ATPases belonging to the ASCE (for additional
           strand, catalytic E) division of the P-loop NTPase fold.
           The ASCE division also includes ABC, RecA-like,
           VirD4-like, PilT-like, and SF1/2 helicases. Members of
           the AAA+ ATPases function as molecular chaperons, ATPase
           subunits of proteases, helicases, or nucleic-acid
           stimulated ATPases. The AAA+ proteins contain several
           distinct features in addition to the conserved
           alpha-beta-alpha core domain structure and the Walker A
           and B motifs of the P-loop NTPases.
          Length = 151

 Score = 30.6 bits (69), Expect = 2.3
 Identities = 11/38 (28%), Positives = 18/38 (47%), Gaps = 1/38 (2%)

Query: 431 PLEPPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVI 468
            L PP  ++ + GPP  GK+TL R +     +     +
Sbjct: 15  ELPPPKNLL-LYGPPGTGKTTLARAIANELFRPGAPFL 51


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.  This
           family includes the HABP4 family of hyaluronan-binding
           proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
           HABP4 has been observed to bind hyaluronan (a
           glucosaminoglycan), but it is not known whether this is
           its primary role in vivo. It has also been observed to
           bind RNA, but with a lower affinity than that for
           hyaluronan. PAI-1 mRNA-binding protein specifically
           binds the mRNA of type-1 plasminogen activator inhibitor
           (PAI-1), and is thought to be involved in regulation of
           mRNA stability. However, in both cases, the sequence
           motifs predicted to be important for ligand binding are
           not conserved throughout the family, so it is not known
           whether members of this family share a common function.
          Length = 106

 Score = 30.1 bits (68), Expect = 2.3
 Identities = 13/50 (26%), Positives = 30/50 (60%)

Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELT 232
            D+  + T++  +E+ + E EE++  EE+ ++E   E++  + E  +E+T
Sbjct: 31  KDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEMT 80


>gnl|CDD|221866 pfam12935, Sec16_N, Vesicle coat trafficking protein Sec16
           N-terminus.  Sec16 is a multi-domain vesicle coat
           protein. The overall function of Sec16 is in mediating
           the movement of protein-cargo between the organelles of
           the secretory pathway. Over-expression of truncated
           mutants of only the N-terminus are lethal, and this
           portion does not appear to be essential for function so
           may act as a stabilising region.
          Length = 246

 Score = 31.4 bits (71), Expect = 2.3
 Identities = 23/126 (18%), Positives = 51/126 (40%), Gaps = 12/126 (9%)

Query: 100 VHGESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSS 159
           +  +ST + +  +     ++S   S++  D    +++ S+    S  G+ +   +   SS
Sbjct: 99  LTRKSTSQVLDSLGLNPDSLSSPASAEPLDPT-AQNEFSNVLAASTDGNPESESQSEPSS 157

Query: 160 QDGRKRRKVMFDDDLEDNMDD-----------DDDDDDDDETDDDNDEDNEEESEEEEEN 208
           ++    R  + DD+ E    +           D+DDD   + +    E    ++ E  +N
Sbjct: 158 EEELAARAELSDDESESTPTEDDLAERWQAFLDNDDDLLLDDETALAEGPNGDTPENSQN 217

Query: 209 EEDDDE 214
             +DD 
Sbjct: 218 TLNDDS 223


>gnl|CDD|110106 pfam01081, Aldolase, KDPG and KHG aldolase.  This family includes
           the following members: 4-hydroxy-2-oxoglutarate aldolase
           (KHG-aldolase) Phospho-2-dehydro-3-deoxygluconate
           aldolase (KDPG-aldolase).
          Length = 196

 Score = 31.3 bits (71), Expect = 2.3
 Identities = 11/38 (28%), Positives = 20/38 (52%), Gaps = 1/38 (2%)

Query: 717 RSGTGFLAVQDVAKREPGFRVIATGTILDANQTAEVTK 754
           R+     A++ + K  P   ++  GT+L+A Q AE  +
Sbjct: 42  RTPCALDAIRLLRKNRPDA-LVGAGTVLNAQQLAEAAE 78


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 31.9 bits (72), Expect = 2.5
 Identities = 20/97 (20%), Positives = 38/97 (39%), Gaps = 13/97 (13%)

Query: 128 SDDENDED------DTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
           S DE ++D       +S  + ES      K K+ VV+  D     + + + + E      
Sbjct: 50  SSDEEEQDYELRPRVSSSWNNESYNRLPIKTKDNVVA--DVNNGEEFLSESESEA----- 102

Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
             + D D  D+      E++   E   ++  D E++ 
Sbjct: 103 SLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKER 139



 Score = 31.5 bits (71), Expect = 2.9
 Identities = 9/50 (18%), Positives = 24/50 (48%)

Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
           + + E   + D D ++E ++  E ++   E    ++ +S+ E +  +  K
Sbjct: 97  ESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKERIASICTK 146



 Score = 31.1 bits (70), Expect = 4.5
 Identities = 7/51 (13%), Positives = 22/51 (43%)

Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
           + + +   E D D  ++ ++  EE++   E   +++   +K        ++
Sbjct: 97  ESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKERIASICTKI 147


>gnl|CDD|163585 TIGR03873, F420-0_ABC_ATP, proposed F420-0 ABC transporter,
           ATP-binding protein.  This small clade of ABC-type
           transporter ATP-binding protein components is found as a
           three gene cassette along with a periplasmic
           substrate-binding protein (TIGR03868) and a permease
           (TIGR03869). The organisms containing this cassette are
           all Actinobacteria and all contain numerous genes
           requiring the coenzyme F420. This model was defined
           based on five such organisms, four of which are lacking
           all F420 biosynthetic capability save the final
           side-chain polyglutamate attachment step (via the gene
           cofE: TIGR01916). In Jonesia denitrificans DSM 20603 and
           marine actinobacterium PHSC20C1 this cassette is in an
           apparent operon with the cofE gene and, in PHSC20C1,
           also with a F420-dependent glucose-6-phosphate
           dehydrogenase (TIGR03554). Based on these observations
           we propose that this ATP-binding protein is a component
           of an F420-0 (that is, F420 lacking only the
           polyglutamate tail) transporter.
          Length = 256

 Score = 31.3 bits (71), Expect = 2.6
 Identities = 15/33 (45%), Positives = 19/33 (57%), Gaps = 2/33 (6%)

Query: 424 VPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCL 456
           V  VD T   PP  +  ++GP   GKSTL+R L
Sbjct: 17  VDGVDVTA--PPGSLTGLLGPNGSGKSTLLRLL 47


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 29.9 bits (67), Expect = 2.6
 Identities = 12/24 (50%), Positives = 14/24 (58%)

Query: 189 ETDDDNDEDNEEESEEEEENEEDD 212
            T     E  +EE +EEEE EEDD
Sbjct: 82  ATAGAKAEAKKEEKKEEEEEEEDD 105


>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
           Provisional.
          Length = 291

 Score = 31.3 bits (70), Expect = 2.7
 Identities = 19/60 (31%), Positives = 32/60 (53%), Gaps = 7/60 (11%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK-------AEKKNSKNEHVQELT 232
           ++DD+D DD++++ +D+D   E EEEE+      +E          EKK   +E   EL+
Sbjct: 75  EEDDEDVDDDSEEGDDDDGGAEDEEEEKVRGQSGQEGTVALGSGSTEKKLIGSEKQTELS 134



 Score = 31.3 bits (70), Expect = 2.9
 Identities = 14/27 (51%), Positives = 19/27 (70%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEE 205
           +DD+D DDD E  DD+D   E+E EE+
Sbjct: 76  EDDEDVDDDSEEGDDDDGGAEDEEEEK 102



 Score = 30.9 bits (69), Expect = 4.2
 Identities = 16/34 (47%), Positives = 21/34 (61%), Gaps = 2/34 (5%)

Query: 179 DDDDDDDDDDETDDDND--EDNEEESEEEEENEE 210
           DD+D DDD +E DDD+   ED EEE    +  +E
Sbjct: 77  DDEDVDDDSEEGDDDDGGAEDEEEEKVRGQSGQE 110


>gnl|CDD|129528 TIGR00436, era, GTP-binding protein Era.  Era is an essential
           GTPase in Escherichia coli and many other bacteria. It
           plays a role in ribosome biogenesis. Few bacteria lack
           this protein [Protein synthesis, Other].
          Length = 270

 Score = 31.2 bits (71), Expect = 2.7
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 439 VAVVGPPQVGKSTLIRCLIK 458
           VA++G P VGKSTL+  L  
Sbjct: 3   VAILGRPNVGKSTLLNQLHG 22


>gnl|CDD|222800 PHA00458, PHA00458, single-stranded DNA-binding protein.
          Length = 233

 Score = 31.3 bits (71), Expect = 2.7
 Identities = 14/44 (31%), Positives = 25/44 (56%), Gaps = 5/44 (11%)

Query: 175 EDNMDDDDDDD-----DDDETDDDNDEDNEEESEEEEENEEDDD 213
           ED+  D+D++D     +       ++E+ +E+ EEE E +ED D
Sbjct: 189 EDDWADEDEEDGYVASEAKARKPQDEEEWDEDDEEESEEDEDGD 232



 Score = 29.4 bits (66), Expect = 9.5
 Identities = 14/37 (37%), Positives = 23/37 (62%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEE 207
           D+D ED     +      + +++ DED+EEESEE+E+
Sbjct: 194 DEDEEDGYVASEAKARKPQDEEEWDEDDEEESEEDED 230


>gnl|CDD|216194 pfam00922, Phosphoprotein, Vesiculovirus phosphoprotein. 
          Length = 283

 Score = 31.4 bits (71), Expect = 2.9
 Identities = 12/81 (14%), Positives = 25/81 (30%)

Query: 135 DDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDN 194
           D T  +  E  +    K       S +          ++L D+  +   +DD        
Sbjct: 14  DQTLSEIEEMEEQRADKSSTFQEDSVEEHTSPSYYLAEELSDSETEPSIEDDQGLYTQLP 73

Query: 195 DEDNEEESEEEEENEEDDDEE 215
             +  E   +   ++  DD+ 
Sbjct: 74  PAEQVEGFIQGPLDDIADDDI 94


>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
           [Translation, ribosomal structure and biogenesis].
          Length = 569

 Score = 31.6 bits (71), Expect = 2.9
 Identities = 24/126 (19%), Positives = 53/126 (42%), Gaps = 3/126 (2%)

Query: 130 DENDEDDTSDDDTESGQGSMKKYKEQVVS--SQDGRKRRKVMFDDDLEDNMDDDDDDDDD 187
            E+++ D   D     +     Y E      +++  K+   + D   ++ +D+++ D   
Sbjct: 187 SEDEDIDYFQDMPSDDEEEEAIYYEDFFDKPTKEPVKKHSDVKDPKEDEELDEEEHDSAM 246

Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK-NEHVQELTAKQKNPKAFTFQSV 246
           D+   D   D E+E   E   E  D      EK+  + +E ++EL  +   PK + +   
Sbjct: 247 DKVKLDLFADEEDEPNAEGVGEASDKNLSSFEKQQIEMDEQIEELEKELVAPKEWKYAGE 306

Query: 247 IKGERK 252
           +  +++
Sbjct: 307 VSAKKR 312



 Score = 30.0 bits (67), Expect = 9.4
 Identities = 26/143 (18%), Positives = 54/143 (37%), Gaps = 7/143 (4%)

Query: 104 STEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDD-DTESGQGSMKKYKEQVVSSQDG 162
             EK M+  D          S  ++D+E   DD     +    + + +K         D 
Sbjct: 104 FKEKQMLSADVSEIEEQSNDSLSENDEEPSMDDEKTSAEAAREEFAEEKRIPDPYGINDK 163

Query: 163 -----RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK 217
                +  R  +  +D  +  +  +D+D D   D  +D++ EE    E+  ++   E  K
Sbjct: 164 FFDLEKFNRDTLAAEDSNEASEGSEDEDIDYFQDMPSDDEEEEAIYYEDFFDKPTKEPVK 223

Query: 218 AEKKNSKNEHVQELTAKQKNPKA 240
            +  + K+    E   ++++  A
Sbjct: 224 -KHSDVKDPKEDEELDEEEHDSA 245


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 29.5 bits (67), Expect = 2.9
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 199 EEESEEEEENEEDDDEERKAE 219
           E++ EEEEE E+++ EE  A 
Sbjct: 79  EKKEEEEEEEEKEESEEEAAA 99



 Score = 29.5 bits (67), Expect = 3.1
 Identities = 9/20 (45%), Positives = 15/20 (75%)

Query: 196 EDNEEESEEEEENEEDDDEE 215
            + ++E EEEEE +E+ +EE
Sbjct: 77  AEEKKEEEEEEEEKEESEEE 96


>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1.  Rec10 / Red1 is involved in meiotic
           recombination and chromosome segregation during
           homologous chromosome formation. This protein localises
           to the synaptonemal complex in S. cerevisiae and the
           analogous structures (linear elements) in S. pombe. This
           family is currently only found in fungi.
          Length = 706

 Score = 31.8 bits (72), Expect = 3.0
 Identities = 48/333 (14%), Positives = 98/333 (29%), Gaps = 64/333 (19%)

Query: 131 ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDET 190
           ++ E +T D    S   S +         Q   +  K      + D  +D  +      T
Sbjct: 267 QDIEANTKD----SLHMSAQDNHYDNTQLQTPERSTKR--KSPIWDLKEDQKESKIKSGT 320

Query: 191 D-------DDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTF 243
           +       +   E +     EEE++     ++RK  +  SK     +   + +       
Sbjct: 321 NLKLSSEKESIPETSYVNVLEEEQSPLVRLQKRKLARSTSKTLESLKEVFEDQASSVKNK 380

Query: 244 QSVIK-GERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLN-------------SKEE 289
           Q+  +    +   T   V   P + D  +    K+ + L                    +
Sbjct: 381 QAQSEENLNESPKTPIAVTGDPHLKDPTIIAGKKLMNKLTSEKINNPVKVVKVSKYKGNK 440

Query: 290 EEEEQE----DSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASA 345
            E++++    D+ F+   + +      ++ + K   + K     +     ++ +Q   S 
Sbjct: 441 SEKKRDINVLDTIFASPVSKELRKKVGKSKQTKLK-NFKPVPNKSKKQLANNNSQNIKSK 499

Query: 346 FLERQA--NIVNLAKYVYGDMEDVSVTMEGDEAPNKKVHRKRQAELTAKQKNPKAFTFQS 403
            + +    N  NL               E    PN K    +Q             T  S
Sbjct: 500 KVVKAKTNNKANLQD-----------VGECSSPPNNKEKNDKQTS-----------TSSS 537

Query: 404 VIKGERKFRRKEDIQAKKHHVPQVDRTPLEPPP 436
           V+K         D  + +   P  +   LE   
Sbjct: 538 VLK--------SDRSSIEVRNPNANVKKLEDTT 562


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 31.4 bits (71), Expect = 3.0
 Identities = 24/87 (27%), Positives = 35/87 (40%), Gaps = 8/87 (9%)

Query: 144 SGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESE 203
               S+     +   S   RKR++       E + D+D+DD D+DE D         E E
Sbjct: 337 FAISSLPDSDSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDY--------EKE 388

Query: 204 EEEENEEDDDEERKAEKKNSKNEHVQE 230
            +   EED +  R    + SK   V E
Sbjct: 389 RKRRREEDKNFLRLKALELSKYAGVNE 415


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
           timeless (tim) gene is essential for circadian function
           in Drosophila. Putative homologues of Drosophila tim
           have been identified in both mice and humans (mTim and
           hTIM, respectively). Mammalian TIM is not the true
           orthologue of Drosophila TIM, but is the likely
           orthologue of a fly gene, timeout (also called tim-2).
           mTim has been shown to be essential for embryonic
           development, but does not have substantiated circadian
           function. Some family members contain a SANT domain in
           this region.
          Length = 507

 Score = 31.6 bits (71), Expect = 3.0
 Identities = 34/209 (16%), Positives = 69/209 (33%), Gaps = 19/209 (9%)

Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDD 182
           + S      +D ++   D         ++ +E + + Q  R  + +              
Sbjct: 302 LQSCLIRAADDREEDECDQAVPLVPLTEENEEAMENEQFQRLLKALGLRPPRSGQEGFWR 361

Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT 242
                  T       +    EEE E+E  DD + +   ++          A+Q+     +
Sbjct: 362 IPAKLSSTQLRRRAASLSGEEEEPEDELKDDVDGEQADESEHETLALRKNARQRKAGLAS 421

Query: 243 FQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDE 302
            +    GE + ++        P+    +   + + T   +G NS EE +         D 
Sbjct: 422 PEEEALGEEEQKAP-------PKKKQLNQKNK-QQTG--SGTNSDEERD---------DT 462

Query: 303 SADDNSSDEDETVEPKTHKDSKNPEEDTG 331
           S D++    D+   P+ HKD +     T 
Sbjct: 463 SLDEDRDLADDGGLPRIHKDKRAGASLTQ 491



 Score = 30.1 bits (67), Expect = 9.7
 Identities = 16/105 (15%), Positives = 37/105 (35%), Gaps = 9/105 (8%)

Query: 126 DDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDG-------RKRRKVMFDDDLEDNM 178
           DD D E  + D S+ +T + + + ++ K  + S ++        +   K    +      
Sbjct: 391 DDVDGE--QADESEHETLALRKNARQRKAGLASPEEEALGEEEQKAPPKKKQLNQKNKQQ 448

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
                + D++  D   DED +   +        D     +  ++ 
Sbjct: 449 TGSGTNSDEERDDTSLDEDRDLADDGGLPRIHKDKRAGASLTQSP 493


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 31.6 bits (71), Expect = 3.1
 Identities = 10/40 (25%), Positives = 20/40 (50%)

Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
             D+ E N D      D +E DDD++E++ + +  ++   
Sbjct: 99  VQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKG 138



 Score = 30.0 bits (67), Expect = 9.4
 Identities = 12/43 (27%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
           D+ + + D   +  D++ED+++   EEE+N+    ++ K  KK
Sbjct: 101 DNTEQNQDTKGSKTDSEEDDDD--SEEEDNKSTSSKDGKGSKK 141


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 30.5 bits (69), Expect = 3.1
 Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 2/92 (2%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
           S+ +S D+ ++D   +D     + + KK K      +  +K +K     + E+  +   +
Sbjct: 26  STTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKK--KKEKEEVPELAAE 83

Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
           +  D E +++ND+  + E  + +    + + E
Sbjct: 84  ELSDSEENEENDKKVDYELPKVQNTAAEVNHE 115


>gnl|CDD|215443 PLN02829, PLN02829, Probable galacturonosyltransferase.
          Length = 639

 Score = 31.4 bits (71), Expect = 3.2
 Identities = 20/100 (20%), Positives = 42/100 (42%), Gaps = 10/100 (10%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNP 238
           DDD     DD       +  +++S+++E+N +          +    EHVQ+     +  
Sbjct: 107 DDDTHSQTDDIIKQVTQKAGQDDSDQQEKNSQSQSAS-----QAESLEHVQQS---AQTS 158

Query: 239 KAFTFQSVIKGERKFRSTKPTVVPQPRMSD-KDLAIRHKI 277
           +    +  +  +   + T  TV+P  R+   +D  I+ K+
Sbjct: 159 EKVDEKEPLLTKTD-KQTDQTVMPDARVRQLRDQLIKAKV 197


>gnl|CDD|172730 PRK14242, PRK14242, phosphate transporter ATP-binding protein;
           Provisional.
          Length = 253

 Score = 30.9 bits (70), Expect = 3.2
 Identities = 14/37 (37%), Positives = 22/37 (59%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
           V A++GP   GKST +RCL +     P + ++G + L
Sbjct: 34  VTALIGPSGCGKSTFLRCLNRMNDLIPGARVEGEILL 70


>gnl|CDD|206680 cd01893, Miro1, Mitochondrial Rho family 1 (Miro1), N-terminal.
           Miro1 subfamily. Miro (mitochondrial Rho) proteins have
           tandem GTP-binding domains separated by a linker region
           containing putative calcium-binding EF hand motifs.
           Genes encoding Miro-like proteins were found in several
           eukaryotic organisms. This CD represents the N-terminal
           GTPase domain of Miro proteins. These atypical Rho
           GTPases have roles in mitochondrial homeostasis and
           apoptosis. Most Rho proteins contain a lipid
           modification site at the C-terminus; however, Miro is
           one of few Rho subfamilies that lack this feature.
          Length = 168

 Score = 30.4 bits (69), Expect = 3.2
 Identities = 8/20 (40%), Positives = 13/20 (65%)

Query: 439 VAVVGPPQVGKSTLIRCLIK 458
           + ++G   VGKS+LI  L+ 
Sbjct: 5   IVLIGDEGVGKSSLIMSLVS 24


>gnl|CDD|234357 TIGR03797, NHLM_micro_ABC2, NHLM bacteriocin system ABC
           transporter, ATP-binding protein.  Members of this
           protein family are ABC transporter ATP-binding subunits,
           part of a three-gene putative bacteriocin transport
           operon. The other subunits include another ATP-binding
           subunit (TIGR03796), which has an N-terminal leader
           sequence cleavage domain, and an HlyD homolog
           (TIGR03794). In a number of genomes, members of protein
           families related to nitrile hydratase alpha subunit or
           to nif11 have undergone paralogous family expansions,
           with members possessing a putative bacteriocin cleavage
           region ending with a classic Gly-Gly motif. Those sets
           of putative bacteriocins, members of this protein family
           and its partners TIGR03794 and TIGR03796, and
           cyclodehydratase/docking scaffold fusion proteins of
           thiazole/oxazole biosynthesis frequently show correlated
           species distribution and co-clustering within many of
           those genomes [Transport and binding proteins, Amino
           acids, peptides and amines, Cellular processes,
           Biosynthesis of natural products].
          Length = 686

 Score = 31.5 bits (72), Expect = 3.3
 Identities = 12/19 (63%), Positives = 15/19 (78%)

Query: 439 VAVVGPPQVGKSTLIRCLI 457
           VA+VGP   GKSTL+R L+
Sbjct: 482 VAIVGPSGSGKSTLLRLLL 500


>gnl|CDD|213213 cd03246, ABCC_Protease_Secretion, ATP-binding cassette domain of
           PrtD, subfamily C.  This family represents the ABC
           component of the protease secretion system PrtD, a
           60-kDa integral membrane protein sharing 37% identity
           with HlyB, the ABC component of the alpha-hemolysin
           secretion pathway, in the C-terminal domain. They export
           degradative enzymes by using a type I protein secretion
           system and lack an N-terminal signal peptide, but
           contain a C-terminal secretion signal. The Type I
           secretion apparatus is made up of three components, an
           ABC transporter, a membrane fusion protein (MFP), and an
           outer membrane protein (OMP). For the HlyA transporter
           complex, HlyB (ABC transporter) and HlyD (MFP) reside in
           the inner membrane of E. coli. The OMP component is
           TolC, which is thought to interact with the MFP to form
           a continuous channel across the periplasm from the
           cytoplasm to the exterior. HlyB belongs to the family of
           ABC transporters, which are ubiquitous, ATP-dependent
           transmembrane pumps or channels. The spectrum of
           transport substrates ranges from inorganic ions,
           nutrients such as amino acids, sugars, or peptides,
           hydrophobic drugs, to large polypeptides, such as HlyA.
          Length = 173

 Score = 30.6 bits (70), Expect = 3.3
 Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 2/53 (3%)

Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSV-IKG-PVTLIIKDSIRDCF 484
            P   +A++GP   GKSTL R ++     T   V + G  ++    + + D  
Sbjct: 26  EPGESLAIIGPSGSGKSTLARLILGLLRPTSGRVRLDGADISQWDPNELGDHV 78


>gnl|CDD|224025 COG1100, COG1100, GTPase SAR1 and related small G proteins [General
           function prediction only].
          Length = 219

 Score = 30.7 bits (69), Expect = 3.3
 Identities = 11/44 (25%), Positives = 19/44 (43%), Gaps = 2/44 (4%)

Query: 439 VAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLIIKDSIRD 482
           + V+G   VGK+TL+  L+ +  + P         L    +I  
Sbjct: 8   IVVLGDGGVGKTTLLNRLVGD--EFPEGYPPTIGNLDPAKTIEP 49


>gnl|CDD|219240 pfam06957, COPI_C, Coatomer (COPI) alpha subunit C-terminus.  This
           family represents the C-terminus (approximately 500
           residues) of the eukaryotic coatomer alpha subunit.
           Coatomer (COPI) is a large cytosolic protein complex
           which forms a coat around vesicles budding from the
           Golgi apparatus. Such coatomer-coated vesicles have been
           proposed to play a role in many distinct steps of
           intracellular transport. Note that many family members
           also contain the pfam04053 domain.
          Length = 421

 Score = 31.4 bits (71), Expect = 3.3
 Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 1/42 (2%)

Query: 175 EDNMDDDDDDDDDD-ETDDDNDEDNEEESEEEEENEEDDDEE 215
              M  DD  +D D   D+D   + E+   ++EE  E  DEE
Sbjct: 33  FAVMGGDDWGEDADLGLDEDGYLEGEDGLLDDEEGPEGGDEE 74


>gnl|CDD|227952 COG5665, NOT5, CCR4-NOT transcriptional regulation complex, NOT5
           subunit [Transcription].
          Length = 548

 Score = 31.6 bits (71), Expect = 3.3
 Identities = 36/193 (18%), Positives = 59/193 (30%), Gaps = 17/193 (8%)

Query: 170 FDDDLE---DNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD-------EERKAE 219
           F DD++   +N DD D  + D   +D   E     S  E   E ++          +K E
Sbjct: 170 FQDDIKYYVENNDDPDFIEYDTIYEDMGCEIQPSSSNNEAPKEGNNQTSLSSIRSSKKQE 229

Query: 220 KKNSKNEHVQELTAKQK-----NPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIR 274
           +   K    ++++   +      P   +    I       ST   +      S K     
Sbjct: 230 RSPKKKAPQRDVSISDRATTPIAPGVESASQSISSTPTPVSTDTPLHTVKDDSIKFDNST 289

Query: 275 HKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKT--HKDSKNPEEDTGL 332
               +    +  KE E + ++   F  +S D+        VE          N E    L
Sbjct: 290 LGTPTTHVSMKKKESENDSEQQLNFPKDSTDEIRKTIQHDVETNAAFQNPLFNDELKWWL 349

Query: 333 NWKSDLAQKAASA 345
             K  L Q     
Sbjct: 350 ASKRYLTQPLQEM 362


>gnl|CDD|238026 cd00071, GMPK, Guanosine monophosphate kinase (GMPK, EC 2.7.4.8),
           also known as guanylate kinase (GKase), catalyzes the
           reversible phosphoryl transfer from adenosine
           triphosphate (ATP) to guanosine monophosphate (GMP) to
           yield adenosine diphosphate (ADP) and guanosine
           diphosphate (GDP). It plays an essential role in the
           biosynthesis of guanosine triphosphate (GTP). This
           enzyme is also important for the activation of some
           antiviral and anticancer agents, such as acyclovir,
           ganciclovir, carbovir, and thiopurines.
          Length = 137

 Score = 29.8 bits (68), Expect = 3.4
 Identities = 14/33 (42%), Positives = 20/33 (60%), Gaps = 3/33 (9%)

Query: 436 PIVVAVVGPPQVGKSTLIRCLIKNF-TKTPLSV 467
            IV++  GP  VGKSTL++ L++ F      SV
Sbjct: 1   LIVLS--GPSGVGKSTLLKRLLEEFDPNFGFSV 31


>gnl|CDD|220245 pfam09444, MRC1, MRC1-like domain.  This putative domain is found
           to be the most conserved region in mediator of
           replication checkpoint protein 1.
          Length = 145

 Score = 30.0 bits (68), Expect = 3.6
 Identities = 18/49 (36%), Positives = 33/49 (67%), Gaps = 8/49 (16%)

Query: 180 DDDDDDDDDETDDD---NDEDNEEESEEEEEN-----EEDDDEERKAEK 220
           DD+D DDDD+++ +   +DE ++E++E +E       EE +D+E++ EK
Sbjct: 28  DDEDSDDDDDSELEEMIDDEYSKEKAENDELAAKRAEEEREDDEKEVEK 76


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 31.1 bits (71), Expect = 3.6
 Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 5/45 (11%)

Query: 160 QDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEE 204
           Q  ++RRK       ++  +DDDDD+D+D+  +   E  EE   E
Sbjct: 186 QRKKQRRKRS-----KELREDDDDDEDEDDDGEGGGEGGEERQRE 225


>gnl|CDD|224054 COG1131, CcmA, ABC-type multidrug transport system, ATPase
           component [Defense mechanisms].
          Length = 293

 Score = 30.7 bits (70), Expect = 3.7
 Identities = 7/19 (36%), Positives = 13/19 (68%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
           +  ++GP   GK+TL++ L
Sbjct: 33  IFGLLGPNGAGKTTLLKIL 51


>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter.  The Monovalent
           Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
           CPA1 family is a large family of proteins derived from
           Gram-positive and Gram-negative bacteria, blue green
           bacteria, yeast, plants and animals. Transporters from
           eukaryotes have been functionally characterized, and all
           of these catalyze Na+:H+ exchange. Their primary
           physiological functions may be in (1) cytoplasmic pH
           regulation, extruding the H+ generated during
           metabolism, and (2) salt tolerance (in plants), due to
           Na+ uptake into vacuoles. This model is specific for the
           fungal members of this family [Transport and binding
           proteins, Cations and iron carrying compounds].
          Length = 810

 Score = 31.4 bits (71), Expect = 3.8
 Identities = 18/71 (25%), Positives = 32/71 (45%), Gaps = 9/71 (12%)

Query: 145 GQGSMKKYKEQVVSSQD--------GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDE 196
           G+  +KK  + VVS  +            R        ED M DD+ + ++ +  +D+D 
Sbjct: 741 GKRKIKKRNDSVVSVDEEKAIEGPSRVPERGNHDLLHSEDEMADDEAESENMDDYEDSD- 799

Query: 197 DNEEESEEEEE 207
           DN  ES++  +
Sbjct: 800 DNAYESKDHAD 810


>gnl|CDD|213788 TIGR03263, guanyl_kin, guanylate kinase.  Members of this family
           are the enzyme guanylate kinase, also called GMP kinase.
           This enzyme transfers a phosphate from ATP to GMP,
           yielding ADP and GDP [Purines, pyrimidines, nucleosides,
           and nucleotides, Nucleotide and nucleoside
           interconversions].
          Length = 179

 Score = 30.2 bits (69), Expect = 3.9
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query: 441 VVGPPQVGKSTLIRCLIKNFTKTPLSV 467
           + GP   GKSTL++ L++       S+
Sbjct: 5   ISGPSGAGKSTLVKALLEEDPNLKFSI 31


>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 944

 Score = 31.2 bits (71), Expect = 3.9
 Identities = 46/329 (13%), Positives = 89/329 (27%), Gaps = 47/329 (14%)

Query: 120 SGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVM---FDDDLED 176
           + +    D+    ++ D++ + +           +   +S +      V      +D  +
Sbjct: 474 ASSSLDADNSAVPEQIDSTAEQSVVNPSVTDTQVDDTSASNNSAADNTVDDNYSAEDTLE 533

Query: 177 N--MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
           +  +D+ D   D    D   D+     SE      +D+      +   S  E      + 
Sbjct: 534 SNGLDEGDYAQDSAPLDAYQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEAQPSSQSL 593

Query: 235 QKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKD-----LAIRHKITSALAGLNSKEE 289
                                T          +D D     LA R  + S L  L+ KE 
Sbjct: 594 SPIS---------------AVTTAAASL----ADDDILDAVLAARDSLLSDLDALSPKEG 634

Query: 290 EE--EEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASAFL 347
           +      +    +  S    +S       P   + S + + D      +  +   A+   
Sbjct: 635 DGKKSSADRKPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVPEAA--- 691

Query: 348 ERQANIVNLAKYVYGDMEDVSVTMEGDEAPNKKVHRKRQAELTAKQKNPKAFTFQSVIKG 407
              A+    A     D  D     E  E  +        AE    +    A   +     
Sbjct: 692 --LASGSAPAPPPVPDPYDRPPWEEAPEVASANDGPNNAAEGNLSESVEDASNSEL---- 745

Query: 408 ERKFRRKEDIQAKKHHVPQVDRTPLEPPP 436
                  + ++ +  H PQV      P  
Sbjct: 746 -------QAVEQQATHQPQVQAEAQSPAS 767


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 30.8 bits (70), Expect = 3.9
 Identities = 20/77 (25%), Positives = 31/77 (40%), Gaps = 6/77 (7%)

Query: 198 NEEESE------EEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGER 251
            EEE E      EEE  ++   +++K  KKN+      E T     P   +  +   GE 
Sbjct: 165 TEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEA 224

Query: 252 KFRSTKPTVVPQPRMSD 268
           K    K ++ P    S+
Sbjct: 225 KKLKKKRSIAPDNEKSE 241


>gnl|CDD|222347 pfam13729, TraF_2, F plasmid transfer operon, TraF, protein. 
          Length = 267

 Score = 30.7 bits (70), Expect = 4.0
 Identities = 9/23 (39%), Positives = 11/23 (47%)

Query: 172 DDLEDNMDDDDDDDDDDETDDDN 194
           D+L D  DD  D  D   +  DN
Sbjct: 33  DNLVDKFDDLSDIYDALSSALDN 55


>gnl|CDD|234034 TIGR02858, spore_III_AA, stage III sporulation protein AA.  Members
           of this protein are the stage III sporulation protein
           AA, encoded by one of several genes in the spoIIIA
           locus. It seems that this protein is found in a species
           if and only if that species is capable of endospore
           formation [Cellular processes, Sporulation and
           germination].
          Length = 270

 Score = 30.8 bits (70), Expect = 4.0
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 441 VVGPPQVGKSTLIRCLIKNFT 461
           ++ PPQ GK+TL+R L +  +
Sbjct: 116 IISPPQCGKTTLLRDLARILS 136


>gnl|CDD|172741 PRK14253, PRK14253, phosphate ABC transporter ATP-binding protein;
           Provisional.
          Length = 249

 Score = 30.6 bits (69), Expect = 4.1
 Identities = 13/23 (56%), Positives = 16/23 (69%)

Query: 434 PPPIVVAVVGPPQVGKSTLIRCL 456
           P   V A++GP   GKSTL+RCL
Sbjct: 27  PARQVTALIGPSGCGKSTLLRCL 49


>gnl|CDD|206676 cd01889, SelB_euk, SelB, the dedicated elongation factor for
           delivery of selenocysteinyl-tRNA to the ribosome.  SelB
           is an elongation factor needed for the co-translational
           incorporation of selenocysteine. Selenocysteine is coded
           by a UGA stop codon in combination with a specific
           downstream mRNA hairpin. In bacteria, the C-terminal
           part of SelB recognizes this hairpin, while the
           N-terminal part binds GTP and tRNA in analogy with
           elongation factor Tu (EF-Tu). It specifically recognizes
           the selenocysteine charged tRNAsec, which has a UCA
           anticodon, in an EF-Tu like manner. This allows
           insertion of selenocysteine at in-frame UGA stop codons.
           In E. coli SelB binds GTP, selenocysteyl-tRNAsec and a
           stem-loop structure immediately downstream of the UGA
           codon (the SECIS sequence). The absence of active SelB
           prevents the participation of selenocysteyl-tRNAsec in
           translation. Archaeal and animal mechanisms of
           selenocysteine incorporation are more complex. Although
           the SECIS elements have different secondary structures
           and conserved elements between archaea and eukaryotes,
           they do share a common feature. Unlike in E. coli, these
           SECIS elements are located in the 3' UTRs. This group
           contains eukaryotic SelBs and some from archaea.
          Length = 192

 Score = 30.4 bits (69), Expect = 4.1
 Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 3/68 (4%)

Query: 36  VCTAPNNEGIHRQEFEIEPITFIECNND---INSMIDISKVADLVLLLIDASFGFEMEIF 92
           V    + E     + E   IT ++C      I ++I  +++ DL+LL++DA  G + +  
Sbjct: 51  VDKPKHLEDNENPQIENYQITLVDCPGHASLIRTIIGGAQIIDLMLLVVDAKKGIQTQTA 110

Query: 93  EFLNICQV 100
           E L I ++
Sbjct: 111 ECLVIGEL 118


>gnl|CDD|177868 PLN02222, PLN02222, phosphoinositide phospholipase C 2.
          Length = 581

 Score = 31.2 bits (70), Expect = 4.1
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 164 KRRKVMFDDDLEDNMDDDDDDDDD 187
           +R K +  +D   + DDDDDD +D
Sbjct: 283 QRNKSVDKNDSNGDDDDDDDDGED 306



 Score = 30.8 bits (69), Expect = 5.2
 Identities = 12/37 (32%), Positives = 19/37 (51%)

Query: 162 GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDN 198
           GR+    +  +   D  D + DDDDDD+  +D  + N
Sbjct: 275 GREVPSFIQRNKSVDKNDSNGDDDDDDDDGEDKSKKN 311


>gnl|CDD|223487 COG0410, LivF, ABC-type branched-chain amino acid transport
           systems, ATPase component [Amino acid transport and
           metabolism].
          Length = 237

 Score = 30.5 bits (70), Expect = 4.3
 Identities = 7/19 (36%), Positives = 14/19 (73%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
           +VA++G    GK+TL++ +
Sbjct: 31  IVALLGRNGAGKTTLLKTI 49


>gnl|CDD|213220 cd03253, ABCC_ATM1_transporter, ATP-binding cassette domain of
           iron-sulfur clusters transporter, subfamily C.  ATM1 is
           an ABC transporter that is expressed in the
           mitochondria. Although the specific function of ATM1 is
           unknown, its disruption results in the accumulation of
           excess mitochondrial iron, loss of mitochondrial
           cytochromes, oxidative damage to mitochondrial DNA, and
           decreased levels of cytosolic heme proteins. ABC
           transporters are a large family of proteins involved in
           the transport of a wide variety of different compounds,
           like sugars, ions, peptides, and more complex organic
           molecules. The nucleotide binding domain shows the
           highest similarity between all members of the family.
           ABC transporters are a subset of nucleotide hydrolases
           that contain a signature motif, Q-loop, and
           H-loop/switch region, in addition to, the Walker A
           motif/P-loop and Walker B motif commonly found in a
           number of ATP- and GTP-binding and hydrolyzing proteins.
          Length = 236

 Score = 30.7 bits (70), Expect = 4.4
 Identities = 22/81 (27%), Positives = 33/81 (40%), Gaps = 20/81 (24%)

Query: 434 PPPIVVAVVGPPQVGKSTLIRCL-----------------IKNFTKTPLSVIKGPV---T 473
           P    VA+VGP   GKST++R L                 I+  T   L    G V   T
Sbjct: 25  PAGKKVAIVGPSGSGKSTILRLLFRFYDVSSGSILIDGQDIREVTLDSLRRAIGVVPQDT 84

Query: 474 LIIKDSIRDCFVTGKWKASED 494
           ++  D+I      G+  A+++
Sbjct: 85  VLFNDTIGYNIRYGRPDATDE 105


>gnl|CDD|237628 PRK14156, PRK14156, heat shock protein GrpE; Provisional.
          Length = 177

 Score = 30.2 bits (68), Expect = 4.4
 Identities = 20/71 (28%), Positives = 38/71 (53%), Gaps = 8/71 (11%)

Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK-----AEKKNS 223
           M ++   + ++++ +  + +ET ++  E+  E+SE E  NE  D+ E K     AE +N 
Sbjct: 1   MSEEIKNEEVEEEVEATETEETVEEVVEETPEKSELELANERADEFENKYLRAHAEMQNI 60

Query: 224 K---NEHVQEL 231
           +   NE  Q+L
Sbjct: 61  QRRANEERQQL 71


>gnl|CDD|172756 PRK14268, PRK14268, phosphate ABC transporter ATP-binding protein;
           Provisional.
          Length = 258

 Score = 30.7 bits (69), Expect = 4.4
 Identities = 18/55 (32%), Positives = 25/55 (45%), Gaps = 17/55 (30%)

Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLIIKDSIRDCFVTGK 488
           P   V A++GP   GKST IRCL +                 + D I++C + GK
Sbjct: 36  PKNSVTALIGPSGCGKSTFIRCLNR-----------------MNDLIKNCRIEGK 73


>gnl|CDD|238886 cd01903, Ntn_AC_NAAA, AC_NAAA This conserved domain includes two
           closely related proteins, acid ceramidase (AC, also
           known as N-acylsphingosine amidohydrolase), and
           N-acylethanolamine-hydrolyzing acid amidase (NAAA).  AC
           catalyzes the hydrolysis of ceramide to sphingosine and
           fatty acid. Ceramide is required for the biosynthesis of
           most sphingolipids and plays an important role in many
           signal transduction pathways by inducing apoptosis
           and/or arresting cell growth. An inherited deficiency of
           AC activity leads to the lysosomal storage disorder
           known as Farber disease.  AC is considered a "rheostat"
           important for maintaining the proper intracellular
           levels of these lipids since hydrolysis of ceramide is
           the only source of sphingosine in cells.  NAAA is a
           eukaryotic glycoprotein that hydrolyzes bioactive
           N-acylethanolamines, including anandamide (an
           endocannabinoid) and N-palmitoylethanolamine (an
           anti-inflammatory and neuroprotective substance), to
           fatty acids and ethanolamine at acidic pH.  NAAA shows
           structural and functional similarity to acid ceramidase,
           but lacks the ceramide-hydrolyzing activity of AC.
          Length = 231

 Score = 30.3 bits (69), Expect = 4.4
 Identities = 31/105 (29%), Positives = 40/105 (38%), Gaps = 34/105 (32%)

Query: 601 NARVELEGFRAGLYIRVELDGMPC-----ELIEN----------------FDPTYPLIVG 639
           N R  L+G   G+   ++ DG+P      E +EN                  P Y  IVG
Sbjct: 78  NERFSLDGGYNGILALLKKDGIPVSWLIRETLENATSYEDAVEKLSTTPILAPAY-FIVG 136

Query: 640 GLQPGEETIGCV--RARVKKHRWYGKILKSGNPVIMSVGWRRFQT 682
           G++PGE   G V  R R      Y   LK+G        W   QT
Sbjct: 137 GVKPGE---GVVITRNRDSVADVYPLDLKNG-------TWFLVQT 171


>gnl|CDD|172749 PRK14261, PRK14261, phosphate ABC transporter ATP-binding protein;
           Provisional.
          Length = 253

 Score = 30.6 bits (69), Expect = 4.5
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
           V A++GP   GKSTL+RC 
Sbjct: 34  VTALIGPSGCGKSTLLRCF 52


>gnl|CDD|213228 cd03261, ABC_Org_Solvent_Resistant, ATP-binding cassette transport
           system involved in resistant to organic solvents.  ABC
           transporters are a large family of proteins involved in
           the transport of a wide variety of different compounds,
           like sugars, ions, peptides, and more complex organic
           molecules. The nucleotide binding domain shows the
           highest similarity between all members of the family.
           ABC transporters are a subset of nucleotide hydrolases
           that contain a signature motif, Q-loop, and
           H-loop/switch region, in addition to, the Walker A
           motif/P-loop and Walker B motif commonly found in a
           number of ATP- and GTP-binding and hydrolyzing proteins.
          Length = 235

 Score = 30.5 bits (70), Expect = 4.6
 Identities = 9/20 (45%), Positives = 16/20 (80%)

Query: 438 VVAVVGPPQVGKSTLIRCLI 457
           ++A++GP   GKSTL+R ++
Sbjct: 28  ILAIIGPSGSGKSTLLRLIV 47


>gnl|CDD|237645 PRK14235, PRK14235, phosphate transporter ATP-binding protein;
           Provisional.
          Length = 267

 Score = 30.5 bits (69), Expect = 4.6
 Identities = 21/68 (30%), Positives = 27/68 (39%), Gaps = 27/68 (39%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLIIKDSIRDCFVTGKWKASEDASE 497
           V A +GP   GKST +RCL +                 + D+I  C VTGK         
Sbjct: 47  VTAFIGPSGCGKSTFLRCLNR-----------------MNDTIDGCRVTGK--------- 80

Query: 498 LLRLDDMD 505
            + LD  D
Sbjct: 81  -ITLDGED 87


>gnl|CDD|234003 TIGR02768, TraA_Ti, Ti-type conjugative transfer relaxase TraA.  This
            protein contains domains distinctive of a single strand
            exonuclease (N-terminus, MobA/MobL, pfam03389) as well as
            a helicase domain (central region, homologous to the
            corresponding region of the F-type relaxase TraI,
            TIGR02760). This protein likely fills the same role as
            TraI(F), nicking (at the oriT site) and unwinding the
            coiled plasmid prior to conjugative transfer.
          Length = 744

 Score = 30.9 bits (70), Expect = 4.7
 Identities = 23/96 (23%), Positives = 41/96 (42%), Gaps = 1/96 (1%)

Query: 924  VIHSEREQKVASLMKMLRTNYSEKNSKEKQAMKA-RMVALKLRAKAAEEAKQQRQRVMKK 982
             IH  RE+ +  ++   + +  E N    Q M A     ++   +AA EA  +R  + + 
Sbjct: 537  TIHDTREEAIEQVVADWKQDLREANPAGSQIMLAHTRKDVRALNEAAREALIERGELGES 596

Query: 983  DIFRTLRMPKVMGVLTHLDMLKNNKTLKTTKKMLKH 1018
             +F+T R  +       +  L+NN+ L     ML  
Sbjct: 597  ILFQTARGERKFAAGDRIVFLENNRDLGVKNGMLGT 632


>gnl|CDD|227468 COG5139, COG5139, Uncharacterized conserved protein [Function
           unknown].
          Length = 397

 Score = 30.8 bits (69), Expect = 4.8
 Identities = 15/86 (17%), Positives = 39/86 (45%), Gaps = 3/86 (3%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS--KNEHVQELTAKQKN 237
                + ++E    N     +E+ +   N+  D +  +  ++ +  +N +V+    K+K+
Sbjct: 24  QKSTINAENENTKQNQSMEPQETSKGTSNDTKDPDNGEKNEEAAIDENSNVEAAERKRKH 83

Query: 238 PKA-FTFQSVIKGERKFRSTKPTVVP 262
               F+  S+++  +  +S +PT  P
Sbjct: 84  ISTDFSDMSLLRKRKNDQSLQPTREP 109


>gnl|CDD|222307 pfam13671, AAA_33, AAA domain.  This family of domains contain only
           a P-loop motif, that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           just short fragments so there is no Walker B motif.
          Length = 143

 Score = 29.6 bits (67), Expect = 4.8
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 438 VVAVVGPPQVGKSTLIRCLIK 458
           ++ +VG P  GKST  R L++
Sbjct: 1   LILMVGLPGSGKSTFARRLLR 21


>gnl|CDD|184594 PRK14264, PRK14264, phosphate ABC transporter ATP-binding protein;
           Provisional.
          Length = 305

 Score = 30.6 bits (69), Expect = 4.8
 Identities = 16/41 (39%), Positives = 22/41 (53%)

Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
           P   V A++GP   GKST +RCL +   +   + I G V L
Sbjct: 69  PEKSVTALIGPSGCGKSTFLRCLNRMNDRIKAARIDGSVEL 109


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 30.8 bits (70), Expect = 4.9
 Identities = 10/49 (20%), Positives = 24/49 (48%)

Query: 192 DDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKA 240
            D D+D+   +E   EN+E ++++   +KK  K +  ++   +  +   
Sbjct: 31  KDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAY 79


>gnl|CDD|213225 cd03258, ABC_MetN_methionine_transporter, ATP-binding cassette
           domain of methionine transporter.  MetN (also known as
           YusC) is an ABC-type transporter encoded by metN of the
           metNPQ operon in Bacillus subtilis that is involved in
           methionine transport. Other members of this system
           include the MetP permease and the MetQ substrate binding
           protein. ABC transporters are a subset of nucleotide
           hydrolases that contain a signature motif, Q-loop, and
           H-loop/switch region, in addition to, the Walker A
           motif/P-loop and Walker B motif commonly found in a
           number of ATP- and GTP-binding and hydrolyzing proteins.
          Length = 233

 Score = 30.2 bits (69), Expect = 4.9
 Identities = 12/29 (41%), Positives = 17/29 (58%), Gaps = 2/29 (6%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLS 466
           +  ++G    GKSTLIRC+  N  + P S
Sbjct: 33  IFGIIGRSGAGKSTLIRCI--NGLERPTS 59


>gnl|CDD|130570 TIGR01506, ribC_arch, riboflavin synthase.  This archaeal protein
           catalyzes the same reaction, the final step in
           riboflavin biosynthesis, as bacterial riboflavin
           biosynthesis alpha chain. However, it is more similar in
           sequence to 6,7-dimethyl-8-ribityllumazine synthase,
           which catalyzes the previous reaction and which (in
           bacteria) is called the riboflavin synthase beta chain
           [Biosynthesis of cofactors, prosthetic groups, and
           carriers, Riboflavin, FMN, and FAD].
          Length = 151

 Score = 29.9 bits (67), Expect = 4.9
 Identities = 13/36 (36%), Positives = 15/36 (41%)

Query: 207 ENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT 242
           E+E +D EE K    N   EH Q L      P   T
Sbjct: 99  EDEAEDPEELKVLADNRAREHAQNLIMLLFKPDRLT 134


>gnl|CDD|221323 pfam11931, DUF3449, Domain of unknown function (DUF3449).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 181 to 207 amino acids in length. This domain
           has two conserved sequence motifs: PIP and CEICG. The
           domain carries a zinc-finger domain of the C2H2-type.
          Length = 187

 Score = 29.9 bits (68), Expect = 5.0
 Identities = 11/36 (30%), Positives = 25/36 (69%)

Query: 165 RRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEE 200
           R++ + +++ + + D+  +D  +D +DDD+D+D EE
Sbjct: 27  RKQALTEEERQASADESSEDASEDGSDDDSDDDEEE 62


>gnl|CDD|221125 pfam11500, Cut12, Spindle-body formation-associated protein.  This
           is the central coiled-coil region of cut12 also found in
           other fungi, barring S. cerevisiae. The full protein has
           two predicted coiled-coil regions, and one consensus
           phosphorylation site for p34cdc2 and two for MAP kinase.
           During fission yeast mitosis, the duplicated spindle
           pole bodies (SPBs) nucleate microtubule arrays that
           interdigitate to form the mitotic spindle. Cut12 is
           localised to the SPB throughout the cell cycle,
           predominantly around the inner face of the interphase
           SPB, adjacent to the nucleus. Cut12 associates with Fin1
           and is important in this context for the activity of
           Plo1.
          Length = 149

 Score = 29.7 bits (67), Expect = 5.1
 Identities = 10/54 (18%), Positives = 21/54 (38%)

Query: 161 DGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
           DG+ + +      LED+  + ++      TD       E+      + ++D D 
Sbjct: 1   DGKSKPRTKLTQALEDSRKNSENTTPGTSTDTKQAFPPEDAWNPSSKEDDDGDV 54


>gnl|CDD|224985 COG2074, COG2074, 2-phosphoglycerate kinase [Carbohydrate transport
           and metabolism].
          Length = 299

 Score = 30.4 bits (69), Expect = 5.1
 Identities = 36/152 (23%), Positives = 66/152 (43%), Gaps = 26/152 (17%)

Query: 376 APNK--KVHRKRQAELTAKQKNPKAFTFQSVIKGERKFRRKEDIQ-AKKHHVPQVDRTPL 432
            P+    +  + Q EL  K++  +  T   + +  +K   K D + AK++ + +  R   
Sbjct: 30  DPDLAYSIAIEIQEEL--KKEGIRLVTKDEIREVYQKLLEKGDPEVAKRYLLWRRIRKMK 87

Query: 433 EPPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLIIKDSIRDC---FVTGK- 488
            P  +++ + G   VGKST+   L +           G  ++I  DSIR+     ++ + 
Sbjct: 88  RP--LIILIGGASGVGKSTIAGELARRL---------GIRSVISTDSIREVLRKIISPEL 136

Query: 489 ----WKASEDASELLRLDDMDDDEELFGDFED 516
                 +S DA + LR  D  D+  +   FED
Sbjct: 137 LPTLHTSSYDAWKALR--DPTDENPIIAGFED 166


>gnl|CDD|206748 cd01855, YqeH, Circularly permuted YqeH GTPase.  YqeH is an
           essential GTP-binding protein. Depletion of YqeH induces
           an excess initiation of DNA replication, suggesting that
           it negatively controls initiation of chromosome
           replication. The YqeH subfamily is common in eukaryotes
           and sporadically present in bacteria with probable
           acquisition by plants from chloroplasts. Proteins of the
           YqeH family contain all sequence motifs typical of the
           vast class of P-loop-containing GTPases, but show a
           circular permutation, with a G4-G1-G3 pattern of motifs
           as opposed to the regular G1-G3-G4 pattern seen in most
           GTPases.
          Length = 191

 Score = 29.9 bits (68), Expect = 5.2
 Identities = 12/18 (66%), Positives = 13/18 (72%)

Query: 441 VVGPPQVGKSTLIRCLIK 458
           VVG   VGKSTLI  L+K
Sbjct: 130 VVGATNVGKSTLINALLK 147


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 31.0 bits (70), Expect = 5.3
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEER 216
           E  ++ DE+  EE + EE +EE ++EE+
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEEEK 308


>gnl|CDD|151958 pfam11521, TFIIE-A_C-term, C-terminal general transcription factor
           TFIIE alpha.  TFIIE is compiled of two subunits, alpha
           and beta. This family of proteins are the C terminal
           domain of the alpha subunit of the protein which is the
           largest subunit and contains several functional domains
           which are important for basal transcription and cell
           growth. The C terminal end of the protein binds directly
           to the amino-terminal PH domain of p62/Tfb1 (of IIH)
           which is involved in the recruitment of the general
           transcription factor IIH to the transcription
           preinitiation complex. P53 competes for the same binding
           site as TFIIE alpha which shows their structural
           similarity. Like p53, TFIIE alpha 336-439 can activate
           transcription in vivo.
          Length = 86

 Score = 28.6 bits (64), Expect = 5.3
 Identities = 12/38 (31%), Positives = 23/38 (60%), Gaps = 4/38 (10%)

Query: 186 DDDETDDDNDEDNEE----ESEEEEENEEDDDEERKAE 219
           + D ++ D+D  + E    +S +EEE++EDD+ E   +
Sbjct: 2   ESDTSESDDDSPSPEPPAGQSVDEEEDDEDDEFEEVED 39



 Score = 27.9 bits (62), Expect = 9.0
 Identities = 11/34 (32%), Positives = 17/34 (50%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
           + DDD    E       D EE+ E++E  E +D+
Sbjct: 7   ESDDDSPSPEPPAGQSVDEEEDDEDDEFEEVEDE 40


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 31.0 bits (70), Expect = 5.3
 Identities = 14/71 (19%), Positives = 30/71 (42%), Gaps = 3/71 (4%)

Query: 117  GAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLED 176
            G++ G  +++   + ++    S    +       K + Q     + ++   V+ D + E 
Sbjct: 1398 GSVLGRAATNKETESSENVSGSSSSEKDEIDVSAKPRPQ---RANRKQTTYVLSDSESES 1454

Query: 177  NMDDDDDDDDD 187
              D D DDD+D
Sbjct: 1455 ADDSDFDDDED 1465


>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
           Provisional.
          Length = 606

 Score = 30.9 bits (70), Expect = 5.4
 Identities = 8/34 (23%), Positives = 21/34 (61%)

Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
           D   +++E  ++ +     E+ EE+E ++D+D++
Sbjct: 573 DPRANNNEASEETETITVPENNEEDEEDDDEDDD 606


>gnl|CDD|235636 PRK05886, yajC, preprotein translocase subunit YajC; Validated.
          Length = 109

 Score = 29.0 bits (65), Expect = 5.6
 Identities = 9/27 (33%), Positives = 14/27 (51%)

Query: 176 DNMDDDDDDDDDDETDDDNDEDNEEES 202
           D +  DD+DD++   D D D D+    
Sbjct: 82  DRILPDDEDDEELNADLDKDVDDVAGE 108


>gnl|CDD|185558 PTZ00329, PTZ00329, eukaryotic translation initiation factor 1A;
           Provisional.
          Length = 155

 Score = 29.4 bits (66), Expect = 5.6
 Identities = 9/29 (31%), Positives = 19/29 (65%)

Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDED 197
           +FDD+ +D ++  DD+ D++ TD+  +  
Sbjct: 120 IFDDEGDDGIEFQDDESDEEATDEMLNLT 148


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 30.8 bits (69), Expect = 5.7
 Identities = 20/68 (29%), Positives = 35/68 (51%), Gaps = 3/68 (4%)

Query: 196 EDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRS 255
           ED++E  E E E + D+D+   AE   S  + + +       PK F  ++ +K  +K R+
Sbjct: 357 EDDDESEEIESECDPDEDKS-GAEALASIPQTLPDPPTVYGRPKVFARKADLKSTKKCRA 415

Query: 256 --TKPTVV 261
             T P+V+
Sbjct: 416 IVTDPSVI 423


>gnl|CDD|222104 pfam13401, AAA_22, AAA domain. 
          Length = 124

 Score = 29.2 bits (66), Expect = 5.7
 Identities = 7/27 (25%), Positives = 12/27 (44%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTP 464
           +  + G    GK+TL+R L +      
Sbjct: 6   IGVLTGESGSGKTTLLRRLARQLPNRR 32


>gnl|CDD|218576 pfam05395, DARPP-32, Protein phosphatase inhibitor 1/DARPP-32.
           This family consists of several mammalian protein
           phosphatase inhibitor 1 (IPP-1) and dopamine- and
           cAMP-regulated neuronal phosphoprotein (DARPP-32)
           proteins. Protein phosphatase inhibitor-1 is involved in
           signal transduction and is an endogenous inhibitor of
           protein phosphatase-1. It has been demonstrated that
           DARPP-32, if phosphorylated, can inhibit
           protein-phosphatase-1. DARPP-32 has a key role in many
           neurotransmitter pathways throughout the brain and has
           been shown to be involved in controlling receptors, ion
           channels and other physiological factors including the
           brain's response to drugs of abuse, such as cocaine,
           opiates and nicotine. DARPP-32 is reciprocally regulated
           by the two neurotransmitters that are most often
           implicated in schizophrenia - dopamine and glutamate.
           Dopamine activates DARPP-32 through the D1 receptor
           pathway and disables DARPP-32 through the D2 receptor.
           Glutamate, acting through the N-methyl-d-aspartate
           receptor, renders DARPP-32 inactive. A mutant form of
           DARPP-32 has been linked with gastric cancers.
          Length = 170

 Score = 29.9 bits (67), Expect = 5.9
 Identities = 22/113 (19%), Positives = 44/113 (38%), Gaps = 24/113 (21%)

Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYK--------------EQVVSSQDGRKRRKV 168
           + S+ S  E DE+ T +   + G   M   +              +++V S    +    
Sbjct: 40  LVSEQSSPEIDEERTPNQ-LQKGSLQMSPSQRKNMAYTPPTMKELQRIVESHLQSQ---- 94

Query: 169 MFDDDLEDNMDDDDDDDDD--DETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
                LE+N   +++D+ D  DE      E    E+++ EE      ++ ++E
Sbjct: 95  ---GSLEENQASEEEDELDSPDELGYPTGETESTEAQDSEEPGSCPTQDAQSE 144


>gnl|CDD|129833 TIGR00750, lao, LAO/AO transport system ATPase.  In E. coli,
           mutation of this kinase blocks phosphorylation of two
           transporter system periplasmic binding proteins and
           consequently inhibits those transporters. This kinase is
           also found in Gram-positive bacteria, archaea, and the
           roundworm C. elegans. It may have a more general, but
           still unknown function. Mutations have also been found
           that do not phosphorylate the periplasmic binding
           proteins, yet still allow transport. The ATPase activity
           of this protein seems to be necessary, however
           [Transport and binding proteins, Amino acids, peptides
           and amines, Regulatory functions, Protein interactions].
          Length = 300

 Score = 30.1 bits (68), Expect = 6.0
 Identities = 11/30 (36%), Positives = 14/30 (46%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSV 467
            V + G P  GKSTL+  L     +  L V
Sbjct: 36  RVGITGTPGAGKSTLLEALGMELRRRGLRV 65


>gnl|CDD|184587 PRK14241, PRK14241, phosphate transporter ATP-binding protein;
           Provisional.
          Length = 258

 Score = 30.1 bits (68), Expect = 6.0
 Identities = 15/41 (36%), Positives = 22/41 (53%)

Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
            P  V A +GP   GKST++R L +     P + ++G V L
Sbjct: 28  EPRSVTAFIGPSGCGKSTVLRTLNRMHEVIPGARVEGEVLL 68


>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
          Length = 130

 Score = 29.1 bits (65), Expect = 6.0
 Identities = 12/44 (27%), Positives = 18/44 (40%)

Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
             +D D   +DE  +   E+  EE  E   N+E      + E K
Sbjct: 1   DYNDMDMGYEDEPSEPEIEEGAEEDSESNNNDEVKGSIGENEDK 44



 Score = 29.1 bits (65), Expect = 7.2
 Identities = 11/51 (21%), Positives = 21/51 (41%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
           D +D D   E +    E  E   E+ E N  D+ +    E ++ + +   +
Sbjct: 1   DYNDMDMGYEDEPSEPEIEEGAEEDSESNNNDEVKGSIGENEDKEEQEPVQ 51


>gnl|CDD|223529 COG0452, Dfp, Phosphopantothenoylcysteine synthetase/decarboxylase
           [Coenzyme metabolism].
          Length = 392

 Score = 30.4 bits (69), Expect = 6.1
 Identities = 11/61 (18%), Positives = 23/61 (37%), Gaps = 3/61 (4%)

Query: 148 SMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEE 207
            +++    ++ + D          DD E  + D D  +D +  +   +E  E   +E  E
Sbjct: 333 KLERKNLDLIVAND--VSEIGFGSDDNEVRIIDKDG-EDVELPEMSKEELAERILDEIAE 389

Query: 208 N 208
            
Sbjct: 390 L 390


>gnl|CDD|236442 PRK09270, PRK09270, nucleoside triphosphate hydrolase
           domain-containing protein; Reviewed.
          Length = 229

 Score = 29.9 bits (68), Expect = 6.2
 Identities = 10/20 (50%), Positives = 12/20 (60%)

Query: 437 IVVAVVGPPQVGKSTLIRCL 456
            +V + GPP  GKSTL   L
Sbjct: 34  TIVGIAGPPGAGKSTLAEFL 53


>gnl|CDD|184597 PRK14270, PRK14270, phosphate ABC transporter ATP-binding protein;
           Provisional.
          Length = 251

 Score = 29.9 bits (67), Expect = 6.2
 Identities = 14/37 (37%), Positives = 20/37 (54%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
           + A++GP   GKST +RCL +         I+G V L
Sbjct: 32  ITALIGPSGCGKSTFLRCLNRMNDLISNVKIEGEVLL 68


>gnl|CDD|226961 COG4598, HisP, ABC-type histidine transport system, ATPase
           component [Amino acid transport and metabolism].
          Length = 256

 Score = 30.1 bits (68), Expect = 6.4
 Identities = 11/27 (40%), Positives = 18/27 (66%), Gaps = 2/27 (7%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTP 464
           V++++G    GKST +RC+  NF + P
Sbjct: 34  VISIIGSSGSGKSTFLRCI--NFLEKP 58


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
           unknown].
          Length = 523

 Score = 30.6 bits (69), Expect = 6.4
 Identities = 21/116 (18%), Positives = 39/116 (33%), Gaps = 8/116 (6%)

Query: 198 NEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTK 257
               S  E + E   ++  K ++K SK     + ++ ++        S +    K + + 
Sbjct: 36  KNRNSHTENKMESGTNDNNKNKEKLSKLYSDVDSSSSEEEEDGSESISKLNVNSK-KISL 94

Query: 258 PTVVPQPRMSDKD----LAIRHKITSAL-AGLNSKEEEEEEQEDSAFSDESADDNS 308
             V  Q    + D    LA   K++ A  A      EEE+ +      D       
Sbjct: 95  NQVSTQKWRKELDLLASLAYLQKLSGAFKAAALLNNEEEDLKY--TIDDVKFFARL 148


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 30.1 bits (68), Expect = 6.4
 Identities = 15/54 (27%), Positives = 26/54 (48%), Gaps = 6/54 (11%)

Query: 546 KTRAELMEKKRKLKEQFD-----AEYDDKDGGGNTYYDDLKTQATRQAELNRQQ 594
           + R E  E+++ L+EQ       A+Y D+      Y  +L+ Q  +  EL + Q
Sbjct: 86  RARVEAEERRKTLQEQTQQEQQRAQYQDELAR-KRYQKELEQQRRQNEELLKMQ 138


>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 324

 Score = 30.2 bits (68), Expect = 6.5
 Identities = 24/122 (19%), Positives = 32/122 (26%), Gaps = 17/122 (13%)

Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDN--------------DEDNEEESEEEEEN 208
           RK R   F D     M    DDD  DE  DD                   E E+  +   
Sbjct: 28  RKERSSYFRDRPLKRMKSKRDDDPYDEVADDEGVGEVRVVRKNEAPQFTQEHEAARQSPQ 87

Query: 209 EEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRST---KPTVVPQPR 265
            +   E   A+ K    +  Q                  +   + R     +    PQP 
Sbjct: 88  HQYQPEYASAQIKIPVPQPPQISDPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQPV 147

Query: 266 MS 267
            S
Sbjct: 148 HS 149


>gnl|CDD|165339 PHA03046, PHA03046, Hypothetical protein; Provisional.
          Length = 142

 Score = 29.3 bits (66), Expect = 6.5
 Identities = 13/41 (31%), Positives = 20/41 (48%)

Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
           DDDE +D N+   +   E+    +  D EE +    N +NE
Sbjct: 11  DDDEENDRNEHREKTSEEDGHYKKRLDVEEEEPNLINIRNE 51


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 30.7 bits (69), Expect = 6.5
 Identities = 14/72 (19%), Positives = 32/72 (44%)

Query: 177 NMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
           N D   + DD+ +   +  E  E+  +++EE   D+ +    E+ + +N  + +   K  
Sbjct: 13  NNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKST 72

Query: 237 NPKAFTFQSVIK 248
             ++     V+K
Sbjct: 73  KEESKQLLEVLK 84


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 30.4 bits (68), Expect = 6.6
 Identities = 24/120 (20%), Positives = 53/120 (44%), Gaps = 25/120 (20%)

Query: 138 SDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDED 197
            DDD E+   S       +   +   + RK +  D + +  ++ + ++D++  + +  + 
Sbjct: 110 HDDDEENLPSS-------IAPPEIDPEPRKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKV 162

Query: 198 NEEESEEEEENE--------EDDDEERKAEKKNSK----------NEHVQELTAKQKNPK 239
            +E+SEEEEE E        + + + +K   K++           ++ +Q+   KQK  K
Sbjct: 163 VDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDNGKSFWSAASVFSKKLQKWRQKQKLKK 222


>gnl|CDD|221145 pfam11596, DUF3246, Protein of unknown function (DUF3246).  This is
           a small family of fungal proteins one of whose members
           from Pichia stipitis is described as being an extremely
           serine rich protein-mucin-like protein.
          Length = 208

 Score = 29.7 bits (66), Expect = 6.6
 Identities = 19/85 (22%), Positives = 31/85 (36%), Gaps = 10/85 (11%)

Query: 108 SMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRK 167
           S+    G    +    ++DD D +++E D     T +  G+       V +   G   + 
Sbjct: 28  SVSTTAGSSTILPTGSATDDDDYDDEETDCEGQTTANPTGT-------VTTDPTGTTSQT 80

Query: 168 VMFDDDLEDNMDDDDDDDDDDETDD 192
           V+          DDDDD    ET  
Sbjct: 81  VVPTKP---TTTDDDDDTTCVETTI 102


>gnl|CDD|234033 TIGR02857, CydD, thiol reductant ABC exporter, CydD subunit.  The
           gene pair cydCD encodes an ABC-family transporter in
           which each gene contains an N-terminal membrane-spanning
           domain (pfam00664) and a C-terminal ATP-binding domain
           (pfam00005). In E. coli these genes were discovered as
           mutants which caused the terminal heme-copper oxidase
           complex cytochrome bd to fail to assemble. Recent work
           has shown that the transporter is involved in export of
           redox-active thiol compounds such as cysteine and
           glutathione. The linkage to assembly of the cytochrome
           bd complex is further supported by the conserved operon
           structure found outside the gammaproteobacteria
           (cydABCD) containing both the transporter and oxidase
           genes components. The genes used as the seed members for
           this model are all either found in the
           gammproteobacterial context or the CydABCD context. All
           members of this family scoring above trusted at the time
           of its creation were from genomes which encode a
           cytochrome bd complex. Unfortunately, the gene symbol
           nomenclature adopted based on this operon in B. subtilis
           assigns cydC to the third gene in the operon where this
           gene is actually homologous to the E. coli cydD gene. We
           have chosen to name all homologs in this family in
           accordance with the precedence of publication of the E.
           coli name, CydD.
          Length = 529

 Score = 30.3 bits (69), Expect = 6.6
 Identities = 13/23 (56%), Positives = 15/23 (65%)

Query: 434 PPPIVVAVVGPPQVGKSTLIRCL 456
           PP   VA+VGP   GKSTL+  L
Sbjct: 346 PPGERVALVGPSGAGKSTLLNLL 368


>gnl|CDD|165222 PHA02896, PHA02896, A-type inclusion like protein; Provisional.
          Length = 616

 Score = 30.4 bits (68), Expect = 6.7
 Identities = 27/101 (26%), Positives = 42/101 (41%), Gaps = 10/101 (9%)

Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGR--KRRKVMFDDDLEDNMDD 180
           I  D+ D  ND++D    D            +     +D R    +  ++DD+       
Sbjct: 394 IEEDNDDYNNDDEDDLIKDEAPILNRRHGQMDDKYDKRDHRYKNNKYDIYDDESPRYKYK 453

Query: 181 DDDDDDDDETDDD--------NDEDNEEESEEEEENEEDDD 213
           DDD DD+D+ DDD        N ED ++  +E  +   DDD
Sbjct: 454 DDDYDDNDDNDDDHIIPKKANNLEDAKDFEDEMMDAIADDD 494


>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking.  Sec14-like
           Golgi-trafficking domain The GOLD domain is always found
           combined with lipid- or membrane-association domains.
          Length = 136

 Score = 29.3 bits (66), Expect = 6.7
 Identities = 13/38 (34%), Positives = 20/38 (52%)

Query: 194 NDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
             E ++EE EEE E EE +  + +A  K+     V E+
Sbjct: 51  VSESSDEEEEEEAEEEEAETGDVEAGSKSQSRPLVDEI 88


>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
          Length = 1320

 Score = 30.7 bits (69), Expect = 6.8
 Identities = 19/100 (19%), Positives = 40/100 (40%), Gaps = 3/100 (3%)

Query: 134 EDDTSDDDTESGQGSMKKYKEQV-VSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDD 192
            D+  D++ E  + ++++   QV +      +   V  D    +++   D      +  +
Sbjct: 506 VDEDGDEEMEIDEEAVERLCVQVGLQPAGAAEGNNV--DMGRVESIHSSDQQSIIKQGSE 563

Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELT 232
           D D D EE   E+EE  E    +     +N++N    +  
Sbjct: 564 DTDVDMEEAISEQEEKHEITIVDCAEPVRNTQNSLQIDTL 603


>gnl|CDD|223455 COG0378, HypB, Ni2+-binding GTPase involved in regulation of
           expression and maturation of urease and hydrogenase
           [Posttranslational modification, protein turnover,
           chaperones / Transcription].
          Length = 202

 Score = 29.5 bits (67), Expect = 7.0
 Identities = 12/37 (32%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 435 PPIVVAVVGPPQVGKSTLIRCLIKNF-TKTPLSVIKG 470
           P + + V GPP  GK+ LI   ++    +  ++VI G
Sbjct: 12  PMLRIGVGGPPGSGKTALIEKTLRALKDEYKIAVITG 48


>gnl|CDD|235392 PRK05291, trmE, tRNA modification GTPase TrmE; Reviewed.
          Length = 449

 Score = 30.1 bits (69), Expect = 7.1
 Identities = 8/16 (50%), Positives = 11/16 (68%)

Query: 437 IVVAVVGPPQVGKSTL 452
           + V + G P VGKS+L
Sbjct: 216 LKVVIAGRPNVGKSSL 231


>gnl|CDD|213188 cd03221, ABCF_EF-3, ATP-binding cassette domain of elongation
           factor 3, subfamily F.  Elongation factor 3 (EF-3) is a
           cytosolic protein required by fungal ribosomes for in
           vitro protein synthesis and for in vivo growth. EF-3
           stimulates the binding of the EF-1: GTP: aa-tRNA ternary
           complex to the ribosomal A site by facilitated release
           of the deacylated tRNA from the E site. The reaction
           requires ATP hydrolysis. EF-3 contains two ATP
           nucleotide binding sequence (NBS) motifs. NBSI is
           sufficient for the intrinsic ATPase activity. NBSII is
           essential for the ribosome-stimulated functions.
          Length = 144

 Score = 29.0 bits (66), Expect = 7.1
 Identities = 10/37 (27%), Positives = 15/37 (40%)

Query: 439 VAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLI 475
           + +VG    GKSTL++ +          V  G    I
Sbjct: 29  IGLVGRNGAGKSTLLKLIAGELEPDEGIVTWGSTVKI 65


>gnl|CDD|218889 pfam06088, TLP-20, Nucleopolyhedrovirus telokin-like protein-20
           (TLP20).  This family consists of several
           Nucleopolyhedrovirus telokin-like protein-20 (TLP20)
           sequences. The function of this family is unknown but
           TLP20 is known to shares some antigenic similarities to
           the smooth muscle protein telokin although the amino
           acid sequence shows no homologies to telokin.
          Length = 162

 Score = 29.2 bits (66), Expect = 7.2
 Identities = 12/44 (27%), Positives = 21/44 (47%)

Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
              ++D+NDED E+ ++        DD    + KK   +E  Q+
Sbjct: 119 HHSSEDENDEDEEDNADRAGIESGIDDSAPPSPKKQKLDESEQD 162


>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
           trafficking and secretion].
          Length = 340

 Score = 29.9 bits (68), Expect = 7.3
 Identities = 38/188 (20%), Positives = 66/188 (35%), Gaps = 24/188 (12%)

Query: 288 EEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSD-LAQKAASAF 346
               E+ E+     E+  +   +EDE  E +   +        GL+       +     F
Sbjct: 1   MGFFEKLEEELEELEAEKEKIEEEDEEEEKEGWFERLK----QGLSKTKKNFGKGIKGLF 56

Query: 347 LERQANIVNLAKYVYGDME------DVSVTMEGDEAPNKKVHRKRQAELTAKQKNPKAFT 400
           L++    ++    +  ++E      DV V     E   + +   R+ E   K+K     T
Sbjct: 57  LKKIKEKLDED--LLEELEELLIEADVGV-----ETAEEIIEELRKREGK-KKKIKDEET 108

Query: 401 FQSVIKGERKFRRKEDIQAKKHHVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCLIKNF 460
            +  ++         +I      V      P E  P V+  VG   VGK+T I  L K  
Sbjct: 109 VKEALREAL-----IEILRPVDKVDLPLEIPKEKKPFVILFVGVNGVGKTTTIAKLAKYL 163

Query: 461 TKTPLSVI 468
            +   SV+
Sbjct: 164 KQQGKSVL 171


>gnl|CDD|217373 pfam03115, Astro_capsid, Astrovirus capsid protein precursor.  This
           product is encoded by astrovirus ORF2, one of the three
           astrovirus ORFs (1a, 1b, 2). The 87kD precursor protein
           undergoes an intracellular cleavage to form a 79kD
           protein. Subsequently, extracellular trypsin cleavage
           yields the three proteins forming the infectious virion.
          Length = 787

 Score = 30.5 bits (69), Expect = 7.4
 Identities = 20/91 (21%), Positives = 39/91 (42%), Gaps = 3/91 (3%)

Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK 239
           ++   +D  +  +D     E ++E+E   +EDD+ +R     +S +E   +    +    
Sbjct: 655 EERAVEDAPDPYEDLISLEETDTEDESTEDEDDELDRFDLHDSSGSEPEDDDENNRVTLL 714

Query: 240 AFTFQSVIKGERKFRSTK---PTVVPQPRMS 267
           +      +  ER  R TK   PT+  + R S
Sbjct: 715 STLINQGMSVERATRITKRAFPTLAEKTRRS 745


>gnl|CDD|177546 PHA03151, PHA03151, hypothetical protein; Provisional.
          Length = 259

 Score = 29.7 bits (66), Expect = 7.4
 Identities = 35/158 (22%), Positives = 62/158 (39%), Gaps = 11/158 (6%)

Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
            SD ++D++ +    D+DT S   S   +   +VSS D   R     ++D E        
Sbjct: 95  VSDSNNDKDFDFKPQDEDTSSDDSSAPDFITSLVSS-DCEARGLSSSEEDGEPYSKQKMS 153

Query: 184 D--DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAF 241
                D +T++   E++    E+  ++EED  E    ++         ++  K+K  K  
Sbjct: 154 QPLTIDAKTEEITSEEDCCVQEDSSDSEEDVVEAFIRQR--------AQMAGKKKKGKRS 205

Query: 242 TFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITS 279
              S  +  RK R  + +          D   RHK+T 
Sbjct: 206 ISTSDDEPPRKSRRKRHSHRISSSTDSDDEEPRHKMTG 243


>gnl|CDD|226415 COG3899, COG3899, Predicted ATPase [General function prediction
           only].
          Length = 849

 Score = 30.4 bits (69), Expect = 7.6
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKG 470
           VV V G   +GKS L+  + K  T+     IKG
Sbjct: 26  VVLVAGESGIGKSALVNEVHKPITQQRGYFIKG 58


>gnl|CDD|221288 pfam11882, DUF3402, Domain of unknown function (DUF3402).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 350 to 473 amino acids in length. This domain is
           found associated with pfam07923.
          Length = 402

 Score = 30.0 bits (68), Expect = 7.7
 Identities = 10/53 (18%), Positives = 20/53 (37%), Gaps = 1/53 (1%)

Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA 233
           D +    D  D       E + E      ED  ++ + E+ ++ +E +  L  
Sbjct: 39  DTESLVGDPLDISESV-KELKLEMYTSLAEDQPKKEEIERLSTDSEEIPRLDR 90


>gnl|CDD|225618 COG3076, COG3076, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 135

 Score = 29.0 bits (65), Expect = 7.8
 Identities = 11/16 (68%), Positives = 13/16 (81%)

Query: 177 NMDDDDDDDDDDETDD 192
           N +DDDDD+D DE DD
Sbjct: 116 NAEDDDDDEDVDEDDD 131


>gnl|CDD|183044 PRK11231, fecE, iron-dicitrate transporter ATP-binding subunit;
           Provisional.
          Length = 255

 Score = 29.6 bits (67), Expect = 7.9
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIK 458
           P   + A++GP   GKSTL++C  +
Sbjct: 26  PTGKITALIGPNGCGKSTLLKCFAR 50


>gnl|CDD|223272 COG0194, Gmk, Guanylate kinase [Nucleotide transport and
           metabolism].
          Length = 191

 Score = 29.4 bits (67), Expect = 7.9
 Identities = 12/30 (40%), Positives = 20/30 (66%), Gaps = 1/30 (3%)

Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSV 467
           ++ + GP  VGKSTL++ L+++  K   SV
Sbjct: 6   LIVLSGPSGVGKSTLVKALLEDD-KLRFSV 34


>gnl|CDD|213180 cd03213, ABCG_EPDR, Eye pigment and drug resistance transporter
           subfamily G of the ATP-binding cassette superfamily.
           ABCG transporters are involved in eye pigment (EP)
           precursor transport, regulation of lipid-trafficking
           mechanisms, and pleiotropic drug resistance (DR). DR is
           a well-described phenomenon occurring in fungi and
           shares several similarities with processes in bacteria
           and higher eukaryotes. Compared to other members of the
           ABC transporter subfamilies, the ABCG transporter family
           is composed of proteins that have an ATP-binding
           cassette domain at the N-terminus and a TM
           (transmembrane) domain at the C-terminus.
          Length = 194

 Score = 29.4 bits (67), Expect = 8.0
 Identities = 9/19 (47%), Positives = 13/19 (68%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
           + A++GP   GKSTL+  L
Sbjct: 37  LTAIMGPSGAGKSTLLNAL 55


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 28.5 bits (64), Expect = 8.2
 Identities = 10/20 (50%), Positives = 14/20 (70%)

Query: 195 DEDNEEESEEEEENEEDDDE 214
            ++ EEE +EEE  EE DD+
Sbjct: 83  ADEAEEEEKEEEAEEESDDD 102



 Score = 28.1 bits (63), Expect = 9.7
 Identities = 13/22 (59%), Positives = 14/22 (63%)

Query: 192 DDNDEDNEEESEEEEENEEDDD 213
            + DE  EEE EEE E E DDD
Sbjct: 81  AEADEAEEEEKEEEAEEESDDD 102


>gnl|CDD|220924 pfam10986, DUF2796, Protein of unknown function (DUF2796).  This
           bacterial family of proteins has no known function.
          Length = 161

 Score = 29.2 bits (66), Expect = 8.3
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query: 179 DDDDDDDDDDETDDDNDEDNEEESE 203
           D D D  D D  D  +D ++   S+
Sbjct: 83  DHDHDHHDHDHHDHHDDHEHAGHSD 107


>gnl|CDD|214991 smart01044, Btz, CASC3/Barentsz eIF4AIII binding.  This domain is
           found on CASC3 (cancer susceptibility candidate gene 3
           protein) which is also known as Barentsz (Btz). CASC3 is
           a component of the EJC (exon junction complex) which is
           a complex that is involved in post-transcriptional
           regulation of mRNA in metazoa. The complex is formed by
           the association of four proteins (eIF4AIII, Barentsz,
           Mago, and Y14), mRNA, and ATP. This domain wraps around
           eIF4AIII and stacks against the 5' nucleotide.
          Length = 106

 Score = 28.2 bits (63), Expect = 8.4
 Identities = 5/29 (17%), Positives = 10/29 (34%)

Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERK 217
              D+       +   +     +DDE +K
Sbjct: 4   GDGDEKSAPVANKETGKMRRRLEDDEYKK 32


>gnl|CDD|217326 pfam03029, ATP_bind_1, Conserved hypothetical ATP binding protein. 
           Members of this family are found in a range of archaea
           and eukaryotes and have hypothesised ATP binding
           activity.
          Length = 235

 Score = 29.6 bits (67), Expect = 8.6
 Identities = 17/70 (24%), Positives = 32/70 (45%), Gaps = 8/70 (11%)

Query: 441 VVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLIIKDSIRDCFVTGKWKASEDASELLR 500
           VVGP   GK+T +  L +        ++   V ++  D   +      ++A  D  EL+ 
Sbjct: 1   VVGPAGSGKTTFVGALSEIL-----PLLGRSVYVVNLDPAAENL---PYEADIDIRELIT 52

Query: 501 LDDMDDDEEL 510
           + D+ +D+ L
Sbjct: 53  VADVMEDDGL 62


>gnl|CDD|201520 pfam00945, Rhabdo_ncap, Rhabdovirus nucleocapsid protein.  The
           Nucleocapsid (N) Protein is said to have a "tight"
           structure. The carboxyl end of the N-terminal domain
           possesses an RNA binding domain. Sequence alignments
           show 2 regions of reasonable conservation, approx.
           64-103 and 201-329. A whole functional protein is
           required for encapsidation to take place.
          Length = 406

 Score = 30.0 bits (68), Expect = 8.7
 Identities = 12/51 (23%), Positives = 18/51 (35%), Gaps = 5/51 (9%)

Query: 272 AIRHKITSALA-GLNSKEEEEEEQEDSAFSDESADDNSSDED----ETVEP 317
           A   K T       + K+  E   +D   +D    D+  DE     + VE 
Sbjct: 329 AFGSKATLEQRFFRDEKKYTEYRDDDGDNTDAPLADDGDDEAPKSRDVVEW 379


>gnl|CDD|213197 cd03230, ABC_DR_subfamily_A, ATP-binding cassette domain of the
           drug resistance transporter and related proteins,
           subfamily A.  This family of ATP-binding proteins
           belongs to a multi-subunit transporter involved in drug
           resistance (BcrA and DrrA), nodulation, lipid transport,
           and lantibiotic immunity. In bacteria and archaea, these
           transporters usually include an ATP-binding protein and
           one or two integral membrane proteins. Eukaryotic
           systems of the ABCA subfamily display ABC domains that
           are quite similar to this family. The ATP-binding domain
           shows the highest similarity between all members of the
           ABC transporter family. ABC transporters are a subset of
           nucleotide hydrolases that contain a signature motif,
           Q-loop, and H-loop/switch region, in addition to, the
           Walker A motif/P-loop and Walker B motif commonly found
           in a number of ATP- and GTP-binding and hydrolyzing
           proteins.
          Length = 173

 Score = 29.3 bits (67), Expect = 8.8
 Identities = 7/19 (36%), Positives = 13/19 (68%)

Query: 438 VVAVVGPPQVGKSTLIRCL 456
           +  ++GP   GK+TLI+ +
Sbjct: 28  IYGLLGPNGAGKTTLIKII 46


>gnl|CDD|236876 PRK11191, PRK11191, RNase E inhibitor protein; Provisional.
          Length = 138

 Score = 28.8 bits (65), Expect = 8.9
 Identities = 9/21 (42%), Positives = 12/21 (57%)

Query: 175 EDNMDDDDDDDDDDETDDDND 195
           + N +D DD+D  DE DD   
Sbjct: 117 DPNAEDGDDEDFVDEDDDGVR 137


>gnl|CDD|237040 PRK12289, PRK12289, GTPase RsgA; Reviewed.
          Length = 352

 Score = 30.0 bits (68), Expect = 9.0
 Identities = 11/17 (64%), Positives = 12/17 (70%)

Query: 441 VVGPPQVGKSTLIRCLI 457
           V GP  VGKS+LI  LI
Sbjct: 177 VAGPSGVGKSSLINRLI 193


>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal.  This
           domain family is found in eukaryotes, and is typically
           between 99 and 114 amino acids in length. The family is
           found in association with pfam08697, pfam01585. There
           are two completely conserved residues (G and F) that may
           be functionally important. TIP is involved in enamel
           assembly by interacting with one of the major proteins
           responsible for biomineralisation of enamel - tuftelin.
          Length = 106

 Score = 28.1 bits (63), Expect = 9.0
 Identities = 22/75 (29%), Positives = 34/75 (45%), Gaps = 11/75 (14%)

Query: 126 DDSDDENDEDDTSDDDTESGQGSMKKYKEQV--VSSQDGRKRRKVMFDDDLEDNMDDDDD 183
           DD DD+  E         S +   K Y + V  VS         +       +  D +D+
Sbjct: 41  DDDDDDEGERRRGRGRRRSKKK--KDYSKPVAFVSG-------GIKQAAKEPEEDDKEDE 91

Query: 184 DDDDDETDDDNDEDN 198
            +DDDE+D+D+DED+
Sbjct: 92  SEDDDESDEDDDEDD 106


>gnl|CDD|227494 COG5165, POB3, Nucleosome-binding factor SPN, POB3 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 508

 Score = 30.0 bits (67), Expect = 9.1
 Identities = 10/45 (22%), Positives = 18/45 (40%)

Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
             D ED       ++D+ E +D     + + +EE +      D E
Sbjct: 455 ISDSEDINMGSAGEEDESEDEDFQMVSDSDVAEEYDLQAALSDAE 499


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 30.0 bits (68), Expect = 9.1
 Identities = 32/193 (16%), Positives = 64/193 (33%), Gaps = 31/193 (16%)

Query: 160 QDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDD----------NDEDNEEESEEEEENE 209
           Q+ R+ R     ++  D  D+    +  +  +++            E  E +  E  E  
Sbjct: 611 QNNRRDR-----NERRDTRDNRTRREGRENREENRRNRRQAQQQTAETRESQQAEVTEKA 665

Query: 210 EDDDEER---KAEKKNSKNE--HVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQP 264
              DE++   + E++  +N+     +  AK  N +  + Q   + ER           QP
Sbjct: 666 RTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQEER-------VQQVQP 718

Query: 265 RMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSS----DEDETVEPKTH 320
           R   + L  + +I  ++A        EE         E     +           +    
Sbjct: 719 RRKQRQLNQKVRIEQSVAEEAVAPVVEETVAAEPVVQEVPAPRTELVKVPLPVVAQTAPE 778

Query: 321 KDSKNPEEDTGLN 333
           +D +N  E+   N
Sbjct: 779 QDEENNAENRDNN 791


>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
           Provisional.
          Length = 719

 Score = 30.2 bits (68), Expect = 9.2
 Identities = 15/72 (20%), Positives = 29/72 (40%), Gaps = 1/72 (1%)

Query: 149 MKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEEN 208
           M       VSS+  +K ++     D   N    +    D+++DDD +    E  +E E  
Sbjct: 405 MVNMASASVSSKKNQKPQESSPSLDQT-NAASRNSVMMDEDSDDDEEFQIPESEQEPETT 463

Query: 209 EEDDDEERKAEK 220
           + +  +    E+
Sbjct: 464 KNETKDTAMEEE 475


>gnl|CDD|213834 TIGR03597, GTPase_YqeH, ribosome biogenesis GTPase YqeH.  This
           family describes YqeH, a member of a larger family of
           GTPases involved in ribosome biogenesis. Like YqlF, it
           shows a cyclical permutation relative to GTPases EngA
           (in which the GTPase domain is duplicated), Era, and
           others. Members of this protein family are found in a
           relatively small number of bacterial species, including
           Bacillus subtilis but not Escherichia coli [Protein
           synthesis, Other].
          Length = 360

 Score = 29.9 bits (68), Expect = 9.3
 Identities = 12/26 (46%), Positives = 14/26 (53%)

Query: 439 VAVVGPPQVGKSTLIRCLIKNFTKTP 464
           V VVG   VGKS+LI  L+K      
Sbjct: 157 VYVVGVTNVGKSSLINKLLKQNNGDK 182


>gnl|CDD|234199 TIGR03410, urea_trans_UrtE, urea ABC transporter, ATP-binding
           protein UrtE.  Members of this protein family are ABC
           transporter ATP-binding subunits associated with urea
           transport and metabolism. This protein is found in a
           conserved five-gene transport operon typically found
           adjacent to urease genes. It was shown in Cyanobacteria
           that disruption leads to the loss of high-affinity urea
           transport activity [Transport and binding proteins,
           Amino acids, peptides and amines].
          Length = 230

 Score = 29.4 bits (67), Expect = 9.3
 Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 438 VVAVVGPPQVGKSTLIRCLI 457
           V  V+G   VGK+TL++ L+
Sbjct: 28  VTCVLGRNGVGKTTLLKTLM 47


>gnl|CDD|235210 PRK04040, PRK04040, adenylate kinase; Provisional.
          Length = 188

 Score = 29.1 bits (66), Expect = 9.4
 Identities = 9/20 (45%), Positives = 12/20 (60%)

Query: 435 PPIVVAVVGPPQVGKSTLIR 454
              VV V G P VGK+T++ 
Sbjct: 1   MMKVVVVTGVPGVGKTTVLN 20


>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
          Length = 196

 Score = 29.1 bits (65), Expect = 9.6
 Identities = 12/64 (18%), Positives = 31/64 (48%)

Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKN 237
           M++ + + + +E  D N+  + E+ E   E+E+    + + +  +   E  Q+L  K+  
Sbjct: 1   MEEVEKEINKEEEKDVNNLSSNEQMEGPPEDEQAQQNQPQQQTVDEIEELKQKLQQKEVE 60

Query: 238 PKAF 241
            + +
Sbjct: 61  AQEY 64


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 28.9 bits (65), Expect = 9.8
 Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 288 EEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEE 328
           EEEE+ +E++   +E        E+E  +  ++ +   PEE
Sbjct: 85  EEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELPTPEE 125


>gnl|CDD|233208 TIGR00956, 3a01205, Pleiotropic Drug Resistance (PDR) Family
           protein.  [Transport and binding proteins, Other].
          Length = 1394

 Score = 30.1 bits (68), Expect = 9.9
 Identities = 17/71 (23%), Positives = 34/71 (47%), Gaps = 3/71 (4%)

Query: 142 TESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETD-DDNDEDNEE 200
           TE  +G+ +K   +++  + G  +R     +    N +D +  +    TD  D  +D  +
Sbjct: 690 TEFNKGAKQK--GEILVFRRGSLKRAKKAGETSASNKNDIEAGEVLGSTDLTDESDDVND 747

Query: 201 ESEEEEENEED 211
           E + E+E+ ED
Sbjct: 748 EKDMEKESGED 758


>gnl|CDD|235865 PRK06814, PRK06814, acylglycerophosphoethanolamine acyltransferase;
           Provisional.
          Length = 1140

 Score = 29.9 bits (68), Expect = 9.9
 Identities = 11/50 (22%), Positives = 24/50 (48%)

Query: 755 KLKLTGVPMKIYKKTAFIKDMFNSTLEVAKFEGAKIRTVSGIRGQIKKAL 804
           ++ +TG  MKIY     I D   + +   + +G +    S ++ Q+++  
Sbjct: 533 RITVTGSLMKIYDGPGMIADKAGAMVVPVRIDGLQFTHFSRLKNQVRRKW 582


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.132    0.385 

Gapped
Lambda     K      H
   0.267   0.0663    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 60,290,475
Number of extensions: 6064000
Number of successful extensions: 21335
Number of sequences better than 10.0: 1
Number of HSP's gapped: 14958
Number of HSP's successfully gapped: 1066
Length of query: 1182
Length of database: 10,937,602
Length adjustment: 108
Effective length of query: 1074
Effective length of database: 6,147,370
Effective search space: 6602275380
Effective search space used: 6602275380
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 65 (29.0 bits)