RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5893
(1182 letters)
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 360 bits (925), Expect = e-106
Identities = 168/525 (32%), Positives = 272/525 (51%), Gaps = 30/525 (5%)
Query: 478 DSIRDCFVTGKWKASEDASELLRLDDMDDDEELFGDFEDLETGEKHSGDKSGGGSGGVSG 537
++ F S + E L DD G+FEDLE E S ++ G
Sbjct: 574 AQLKSRFQKDATLDSIEGEEELIQDDEK------GNFEDLEDEENSSDNEMEESRGSSVT 627
Query: 538 GGSGDDKPKT-----RAELMEKKRKLKEQFD-AEYDDKDGGGNTYYDDLKTQATRQAELN 591
+ + + R E KK +L+ F+ E D + +Y + K + Q ++N
Sbjct: 628 AENEESADEVDYETEREENARKKEELRGNFELEERGDPEKKDVDWYTEEKRKIEEQLKIN 687
Query: 592 RQQFHDLDDNARVELEGFRAGLYIRVELDGMPCELIENFDPTYPLIVGGLQPGEETIGCV 651
R +F + +RV +EG+RAG Y+R+ L +P E ++ F+ YP+++GGL P E+ +G V
Sbjct: 688 RSEFETMVPESRVVIEGYRAGRYVRIVLSHVPLEFVDEFNSRYPIVLGGLLPAEKEMGIV 747
Query: 652 RARVKKHRWYGKILKSGNPVIMSVGWRRFQTLPIYSKQEDNMRYRMLKYTPQHVACMAHF 711
+ R+K+HRW+ KILK+ +P+I SVGWRRFQ++P+YS ++ R RMLKYTP+H+ C F
Sbjct: 748 QGRIKRHRWHKKILKTNDPLIFSVGWRRFQSIPVYSMKDSRTRNRMLKYTPEHMHCNVSF 807
Query: 712 WGPITRSGTGFLAVQDVAKREPGFRVIATGTILDANQTAEVTKKLKLTGVPMKIYKKTAF 771
+GP+ TGF AVQ + FRV+A GTI D N A++ KKLKL G P +I + T F
Sbjct: 808 YGPVVPPNTGFCAVQ---SEKGDFRVLALGTITDVNGDAKLVKKLKLVGYPKQIVQNTVF 864
Query: 772 IKDMFNSTLEVAKFEGAKIRTVSGIRGQIKKALNKPQGAFRATFEDKIMLSDIVFCRTWY 831
++DMF S LEV KFEGA ++ VSG+RGQ+K K G +RA FE K+++SDI+ R +
Sbjct: 865 VRDMFTSDLEVLKFEGASLKAVSGLRGQVKGPHGK-NGEYRAVFEGKMLMSDIITLRCFV 923
Query: 832 KVDIPKLYNPVTSLLLPPEQKDSWTGMKTTGQLKRERGLHNDPQFDSMYTPIVRKPKTMT 891
V++ +++ PV +LL W G++ +++ GL + + +
Sbjct: 924 PVEVHRIFIPVDNLLG------KWRGLRRLHEIRESLGLTHSYAPQNDSSSEEMGYGAEE 977
Query: 892 KLKIPKALQKELPYHMKPKYKSKETPKPQRVAVIHSEREQKVASLMKMLRTNYSEKNSKE 951
+P+ ++ +LP + R + E + +K K+ +E
Sbjct: 978 DYSLPREIESKLP------LDKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEE 1031
Query: 952 KQAMKARMVALKLRAKAAEEAKQQRQRVMKKDIFRTL--RMPKVM 994
K+ M++ A + E+ ++QR R D ++ + + K
Sbjct: 1032 KERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKK 1076
Score = 117 bits (294), Expect = 2e-26
Identities = 52/83 (62%), Positives = 65/83 (78%)
Query: 990 MPKVMGVLTHLDMLKNNKTLKTTKKMLKHRFWTEVYAGAKLFYLSGIVHGEYMKNEVKNL 1049
MP+V+GV+THLD+ KN TL++ KK LKHRFWTE+Y GAKLFYLSG+ +G Y E+ NL
Sbjct: 163 MPRVLGVVTHLDLFKNPSTLRSIKKRLKHRFWTEIYQGAKLFYLSGVENGRYPDREILNL 222
Query: 1050 GRFIAVMKFRPLIWQTTHSYMLV 1072
RFI+VMKFRPL W+ H Y+L
Sbjct: 223 SRFISVMKFRPLEWRNMHPYVLA 245
Score = 105 bits (263), Expect = 8e-23
Identities = 42/75 (56%), Positives = 57/75 (76%)
Query: 1096 QARVKKHRWYGKILKSGNPVIMSVGWRRFQTLPIYSKQEDNMRYRMLKYTPQHVACMAHF 1155
Q R+K+HRW+ KILK+ +P+I SVGWRRFQ++P+YS ++ R RMLKYTP+H+ C F
Sbjct: 748 QGRIKRHRWHKKILKTNDPLIFSVGWRRFQSIPVYSMKDSRTRNRMLKYTPEHMHCNVSF 807
Query: 1156 WGPITRSGTGFLAVQ 1170
+GP+ TGF AVQ
Sbjct: 808 YGPVVPPNTGFCAVQ 822
Score = 89.8 bits (222), Expect = 5e-18
Identities = 47/104 (45%), Positives = 63/104 (60%), Gaps = 7/104 (6%)
Query: 379 KKVHRKRQAELTAKQK------NPKAFTFQSVIKGERKFRRKEDIQAKKHHVPQVDRTPL 432
K H K + + K N KAF ++ + R+ R DI+ KK HVP VDRTP
Sbjct: 5 KAKHSKAEKKKLKKVMDGKVGNNAKAFAVAAIGQMARQAMRTADIEEKKLHVPMVDRTPK 64
Query: 433 E-PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLI 475
+ PPP +VAVVGPP GKSTLIR L++ FTK + I+GP+T++
Sbjct: 65 DLPPPFIVAVVGPPGTGKSTLIRSLVRRFTKQTIDEIRGPITVV 108
Score = 79.8 bits (196), Expect = 6e-15
Identities = 32/48 (66%), Positives = 40/48 (83%)
Query: 55 ITFIECNNDINSMIDISKVADLVLLLIDASFGFEMEIFEFLNICQVHG 102
ITF+EC +D++ MID++K+ADLVLLLID +FGFEME EFLNI HG
Sbjct: 115 ITFLECPSDLHQMIDVAKIADLVLLLIDGNFGFEMETMEFLNILISHG 162
Score = 33.6 bits (76), Expect = 0.78
Identities = 37/176 (21%), Positives = 69/176 (39%), Gaps = 32/176 (18%)
Query: 122 AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVV-----------SSQDGRKRR---- 166
A SS+ DE + D ++ S+ E+++ SS K R
Sbjct: 524 AKSSESDLVVQDEPEDFFDVSKVANESISSNHEKLMESEFEELKKKWSSLAQLKSRFQKD 583
Query: 167 ----------KVMFDD---DLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
+++ DD + ED D+++ D++ E + E E +E + E +
Sbjct: 584 ATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETER 643
Query: 214 EERKAEKKNSK-NEHVQELTAKQKNPKAFTFQSVIKGERKF---RSTKPTVVPQPR 265
EE +K+ + N ++E +K + + K E + RS T+VP+ R
Sbjct: 644 EENARKKEELRGNFELEERGDPEKKDVDWYTEEKRKIEEQLKINRSEFETMVPESR 699
Score = 32.8 bits (74), Expect = 1.3
Identities = 46/294 (15%), Positives = 111/294 (37%), Gaps = 25/294 (8%)
Query: 127 DSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDD 186
+SD + D S + G+ + ++ + ++ + R+ ++ FDD D+++D
Sbjct: 388 NSDAIDTVDRESSEIDNVGRKTRRQPTGKAIAEETSRED-ELSFDDSDVST---SDENED 443
Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSV 246
D T +NE+ES+ EE + D + ++E E + A ++
Sbjct: 444 VDFTGKKGAINNEDESDNEEVAFDSDSQFDESEGNLRWKEGLASKLAYSQS--------- 494
Query: 247 IKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADD 306
K + +S ++ +K SA + + ++E ++ S + +
Sbjct: 495 ----GKRGRNIQKIFYDESLSPEECIEEYKGESAKSSESDLVVQDEPEDFFDVSKVANES 550
Query: 307 NSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASAFLERQANIVNLAKYVYGDMED 366
SS+ ++ +E + + K L KS + A +E + ++ + G+ ED
Sbjct: 551 ISSNHEKLMESEFEELKKKWSSLAQL--KSRFQKDATLDSIEGEEELIQDDE--KGNFED 606
Query: 367 VSVTMEGDEAPNKKVHRKRQAELTAKQKNPKAFTFQSVIKGERKFRRKEDIQAK 420
+ + ++ + + + + E R+KE+++
Sbjct: 607 LEDEENSSDNEMEESRGSSVTAENEESADEVDYETER----EENARKKEELRGN 656
Score = 32.8 bits (74), Expect = 1.5
Identities = 44/242 (18%), Positives = 90/242 (37%), Gaps = 30/242 (12%)
Query: 122 AISSDDSDDEN----DEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN 177
AI+ + S ++ D D ++ D+ E + KK ++++D +V FD D +
Sbjct: 417 AIAEETSREDELSFDDSDVSTSDENEDVDFTGKKGA---INNEDESDNEEVAFDSDSQ-- 471
Query: 178 MDDDDDDDDDDETDDDNDEDNEEESE---------EEEENEEDDDEERKAEKKNSKNEHV 228
D+ + + E ++ +E + E+ EE K E S +
Sbjct: 472 FDESEGNLRWKEGLASKLAYSQSGKRGRNIQKIFYDESLSPEECIEEYKGESAKSSESDL 531
Query: 229 QELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQP--RMSDK-----DLAIRHKITSAL 281
Q P+ F F S ++ + K L R + + L
Sbjct: 532 VV----QDEPEDF-FDVSKVANESISSNHEKLMESEFEELKKKWSSLAQLKSRFQKDATL 586
Query: 282 AGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQK 341
+ +EE ++ E F D ++NSSD + + ++N E ++++++ +
Sbjct: 587 DSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREEN 646
Query: 342 AA 343
A
Sbjct: 647 AR 648
>gnl|CDD|218343 pfam04950, DUF663, Protein of unknown function (DUF663). This
family contains several uncharacterized eukaryotic
proteins.
Length = 293
Score = 297 bits (762), Expect = 4e-92
Identities = 110/292 (37%), Positives = 160/292 (54%), Gaps = 14/292 (4%)
Query: 548 RAELMEKKRKLKEQFDAEYDDKDGGGN------TYYDDLKTQATRQAELNRQQFHDLDDN 601
E K R LK + +D + N + + T++ LN +
Sbjct: 10 ARERFAKYRGLKSFRTSPWDPNEKDPNLPNDYARIFQFQNYKNTKKRILNEYEE------ 63
Query: 602 ARVELEGFRAGLYIRVELDGMPCELIENFDPTYPLIVGGLQPGEETIGCVRARVKKHRWY 661
+EG + G Y+R+E+ +P EL ENF+ +IV GL P E + V ++KKHRWY
Sbjct: 64 -EKRIEGVKPGWYVRLEIKNVPKELAENFNSKQLIIVFGLLPHEHKMSVVNFKIKKHRWY 122
Query: 662 GKILKSGNPVIMSVGWRRFQTLPIYSKQEDNMRYRMLKYTPQHVACMAHFWGPITRSGTG 721
K LKS P+I+ VG+RRFQ+ PIYS+ N +++ +Y P H C A F+GPIT +
Sbjct: 123 KKPLKSKEPLIIQVGFRRFQSQPIYSQNTSNDKHKYERYLPPHKTCNATFYGPITFPNSP 182
Query: 722 FLAVQDVAKREPGFRVIATGTILDANQTAEVTKKLKLTGVPMKIYKKTAFIKDMFNSTLE 781
LA + GFR+ ATG IL+ + + KKLKLTG P KI+KKTAF++ MF + +
Sbjct: 183 LLAFKKSFSDTNGFRIAATGCILECDPNRIIAKKLKLTGHPYKIFKKTAFVRYMFFNPED 242
Query: 782 VAKFEGAKIRTVSGIRGQIKKALNKPQGAFRATFEDKIMLSDIVFCRTWYKV 833
V F+ + T SG RG IK+AL G F+ATF+ K+ + DIV + + +V
Sbjct: 243 VEWFKPVPLFTKSGRRGHIKEALG-THGYFKATFDGKLNMQDIVIMKLYKRV 293
Score = 103 bits (259), Expect = 4e-24
Identities = 35/75 (46%), Positives = 49/75 (65%)
Query: 1096 QARVKKHRWYGKILKSGNPVIMSVGWRRFQTLPIYSKQEDNMRYRMLKYTPQHVACMAHF 1155
++KKHRWY K LKS P+I+ VG+RRFQ+ PIYS+ N +++ +Y P H C A F
Sbjct: 113 NFKIKKHRWYKKPLKSKEPLIIQVGFRRFQSQPIYSQNTSNDKHKYERYLPPHKTCNATF 172
Query: 1156 WGPITRSGTGFLAVQ 1170
+GPIT + LA +
Sbjct: 173 YGPITFPNSPLLAFK 187
>gnl|CDD|206669 cd01882, BMS1, Bms1, an essential GTPase, promotes assembly of
preribosomal RNA processing complexes. Bms1 is an
essential, evolutionarily conserved, nucleolar protein.
Its depletion interferes with processing of the 35S
pre-rRNA at sites A0, A1, and A2, and the formation of
40S subunits. Bms1, the putative endonuclease Rc11, and
the essential U3 small nucleolar RNA form a stable
subcomplex that is believed to control an early step in
the formation of the 40S subumit. The C-terminal domain
of Bms1 contains a GTPase-activating protein (GAP) that
functions intramolecularly. It is believed that Rc11
activates Bms1 by acting as a guanine-nucleotide exchange
factor (GEF) to promote GDP/GTP exchange, and that
activated (GTP-bound) Bms1 delivers Rc11 to the
preribosomes.
Length = 231
Score = 169 bits (431), Expect = 1e-47
Identities = 62/83 (74%), Positives = 68/83 (81%)
Query: 990 MPKVMGVLTHLDMLKNNKTLKTTKKMLKHRFWTEVYAGAKLFYLSGIVHGEYMKNEVKNL 1049
PKVMGVLTHLD KNNKTL+ TKK LKHRFWTEVY GAKLFYLSGIVHG Y K E+ NL
Sbjct: 133 FPKVMGVLTHLDKFKNNKTLRKTKKRLKHRFWTEVYDGAKLFYLSGIVHGRYPKTEILNL 192
Query: 1050 GRFIAVMKFRPLIWQTTHSYMLV 1072
RFI+VMKFRPL W+ +H Y+L
Sbjct: 193 ARFISVMKFRPLNWRNSHPYVLA 215
Score = 130 bits (329), Expect = 5e-34
Identities = 51/88 (57%), Positives = 64/88 (72%)
Query: 398 AFTFQSVIKGERKFRRKEDIQAKKHHVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCLI 457
AF QS ++ R+F+R +D++ KK HVP VDRTP EPPP+VV VVGPP VGKSTLIR LI
Sbjct: 1 AFAVQSAVRAARQFQRTQDLEEKKLHVPVVDRTPEEPPPLVVVVVGPPGVGKSTLIRSLI 60
Query: 458 KNFTKTPLSVIKGPVTLIIKDSIRDCFV 485
K +TK LS IKGP+T++ R F+
Sbjct: 61 KRYTKQNLSDIKGPITIVTGKKRRLTFI 88
Score = 111 bits (280), Expect = 2e-27
Identities = 39/48 (81%), Positives = 44/48 (91%)
Query: 55 ITFIECNNDINSMIDISKVADLVLLLIDASFGFEMEIFEFLNICQVHG 102
+TFIEC NDINSMID++K+ADLVLLLID S+GFEME FEFLNI QVHG
Sbjct: 85 LTFIECPNDINSMIDVAKIADLVLLLIDGSYGFEMETFEFLNILQVHG 132
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 90.9 bits (225), Expect = 2e-18
Identities = 60/218 (27%), Positives = 101/218 (46%), Gaps = 5/218 (2%)
Query: 611 AGLYIRVELDGMPCELIENF-DPTYPLIVGGLQPGEETIGCVRARVKKHRWYGKILKSGN 669
G +R++L P L E +P L+V GL E+ +++H Y LKS
Sbjct: 535 DGQMVRIKLR-FPKFLYEGLIEPQILLVVYGLLEYEDKKTVHNFSLQRHFEYEVPLKSEE 593
Query: 670 PVIMSVGWRRFQTLPIYSKQED--NMRYRMLKYTPQHVACMAHFWGPITRSGTGFLAVQD 727
+++ +G RR P+ SK + N + + + +A F GPI+ + + +
Sbjct: 594 SMVVQLGHRRVDICPLISKGSNSPNNNQKYFRRLKPLESGVASFIGPISFGLSPVIIFKK 653
Query: 728 VAKREPGFRVIATGTILDANQTAEVTKKLKLTGVPMKIYKKTAFIKDMFNSTLEVAKFEG 787
A E ++A+G + + + + K+ LTG P K +K+ ++ MF S +V F+
Sbjct: 654 SALDELSATLLASGGMNNFDGDRVIAKRAVLTGHPFKNHKRYVTVRYMFFSPEDVMWFKN 713
Query: 788 AKIRTVSGIRGQIKKALNKPQGAFRATFEDKIMLSDIV 825
++ T G G IK+ L G F+ATF KI D V
Sbjct: 714 IQLFTKRGRTGFIKEPLGT-HGYFKATFSGKIKSQDKV 750
Score = 33.9 bits (77), Expect = 0.56
Identities = 18/71 (25%), Positives = 34/71 (47%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
D++ ED +D++ DDD E++ EE +EE+ DD+E + + ++E
Sbjct: 395 DEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLRE 454
Query: 231 LTAKQKNPKAF 241
+K + F
Sbjct: 455 FRDMEKEDREF 465
Score = 33.9 bits (77), Expect = 0.59
Identities = 27/116 (23%), Positives = 46/116 (39%), Gaps = 6/116 (5%)
Query: 120 SGAISSDDSDDENDEDDTSDDDTESGQGS---MKKYKEQVVSSQDGRKRRKVMFDDDLED 176
+ DD + TS S K +++ M D +D
Sbjct: 358 LTTVRIDDHGFLPGREQTSKKAAVPKGTSFYQAKWAEDEEEEDGQCNDEESTMSAIDDDD 417
Query: 177 NMDDDDDD-DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
++D+++ D+E+ D++E EE S EEEE + E R EK++ + EL
Sbjct: 418 PKENDNEEVAGDEESAIDDNEGFEELSPEEEERQL--REFRDMEKEDREFPDEAEL 471
Score = 32.0 bits (72), Expect = 2.3
Identities = 17/61 (27%), Positives = 28/61 (45%), Gaps = 2/61 (3%)
Query: 1101 KHRWYGKILKSGNPVIMSVGWRRFQTLPIYSKQED--NMRYRMLKYTPQHVACMAHFWGP 1158
+H Y LKS +++ +G RR P+ SK + N + + + +A F GP
Sbjct: 581 RHFEYEVPLKSEESMVVQLGHRRVDICPLISKGSNSPNNNQKYFRRLKPLESGVASFIGP 640
Query: 1159 I 1159
I
Sbjct: 641 I 641
Score = 30.8 bits (69), Expect = 5.9
Identities = 14/81 (17%), Positives = 36/81 (44%)
Query: 139 DDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDN 198
D++ E GQ + ++ + D ++ D E +DD++ ++ +++
Sbjct: 395 DEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLRE 454
Query: 199 EEESEEEEENEEDDDEERKAE 219
+ E+E+ D+ E + +E
Sbjct: 455 FRDMEKEDREFPDEAELQPSE 475
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 60.4 bits (147), Expect = 5e-09
Identities = 30/118 (25%), Positives = 57/118 (48%), Gaps = 8/118 (6%)
Query: 126 DDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDD 185
++ D+E ++ S DD + + Q + D ED DDDDD +
Sbjct: 286 EEDDEEEEDSKESADDLDDE--FEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLE 343
Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTF 243
+++E D +DE+ +EE E+ ++ +++++EE + EKK K+ ++ FTF
Sbjct: 344 EEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKS------AESTRSELPFTF 395
Score = 48.8 bits (117), Expect = 2e-05
Identities = 28/93 (30%), Positives = 47/93 (50%), Gaps = 17/93 (18%)
Query: 165 RRKVMF---DDDLEDNM--------------DDDDDDDDDDETDDDNDEDNEEESEEEEE 207
++K MF DDD ++ + DD DDDD D+ D + ++ +
Sbjct: 99 KKKSMFNLEDDDEDEFVLTHLGQSLSEIDKDDDVRDDDDFDDDDLGDLASDDRAAHFGGG 158
Query: 208 NEEDDDEERKAEKKNSKNEHVQELTAKQKNPKA 240
++++DEE + E+K SK E ++E+ AK K KA
Sbjct: 159 EDDEEDEEEQPERKKSKKEVMKEVIAKSKFYKA 191
Score = 40.8 bits (96), Expect = 0.005
Identities = 21/66 (31%), Positives = 36/66 (54%), Gaps = 10/66 (15%)
Query: 164 KRRKVMFDDDLEDNMDDDDDDDDDDETDDDND----------EDNEEESEEEEENEEDDD 213
+R + +DD E+ + DD DDE + D+D ++ EEE ++E+EEDDD
Sbjct: 280 RRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDD 339
Query: 214 EERKAE 219
++ + E
Sbjct: 340 DDLEEE 345
Score = 38.1 bits (89), Expect = 0.034
Identities = 16/73 (21%), Positives = 30/73 (41%), Gaps = 6/73 (8%)
Query: 502 DDMDDDEELFGDFEDLETGEKHSGDKSGGGSGGVSGGGSGDDKPKTRAELMEK---KRKL 558
D DDD+ D DL + + GGG ++ K++ E+M++ K K
Sbjct: 131 DVRDDDDFDDDDLGDLASDD--RAAHFGGGEDDEEDEEEQPERKKSKKEVMKEVIAKSKF 188
Query: 559 -KEQFDAEYDDKD 570
K + ++ +
Sbjct: 189 YKAERQKAKEEDE 201
Score = 35.0 bits (81), Expect = 0.29
Identities = 53/283 (18%), Positives = 96/283 (33%), Gaps = 76/283 (26%)
Query: 107 KSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRR 166
+S+ EID DD D ++D+ D + + E+ Q RK+
Sbjct: 121 QSLSEIDKDDDVR------DDDDFDDDDLGDLASDDRAAHFGGGEDDEEDEEEQPERKKS 174
Query: 167 K--VM--------F-----------DDDLEDNMDDDDDD---------------DDDDET 190
K VM F D+DL + +DDD D +E
Sbjct: 175 KKEVMKEVIAKSKFYKAERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMTPEEK 234
Query: 191 DDDNDE------------------DNEEESEEEEENEEDDDEERKA----EKKNSKNEHV 228
DD+ D+ EE ++EE E + + ER E+++ + E
Sbjct: 235 DDEYDQRVRELTFDRRAQPTDRTKTEEELAKEEAERLKKLEAERLRRMRGEEEDDEEEED 294
Query: 229 QELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKE 288
+ +A + + G + + + + D+D +
Sbjct: 295 SKESADDLDDEFEPDDDDNFGLGQ--GEEDEEEEEDGVDDEDEED----------DDDDL 342
Query: 289 EEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTG 331
EEEEE D + +E +D SD+++ E + + K ++
Sbjct: 343 EEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAE 385
Score = 33.4 bits (77), Expect = 0.90
Identities = 23/155 (14%), Positives = 52/155 (33%), Gaps = 45/155 (29%)
Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKN 237
M +++DD+++E ++ +D ++E E ++++ + + E++
Sbjct: 282 MRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVD---------- 331
Query: 238 PKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDS 297
+ E D DL +E+ + E+
Sbjct: 332 ---------DEDEED--------------DDDDLE------------EEEEDVDLSDEEE 356
Query: 298 AFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGL 332
DE +DD +E+E E + K + L
Sbjct: 357 DEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
>gnl|CDD|219725 pfam08142, AARP2CN, AARP2CN (NUC121) domain. This domain is the
central domain of AARP2. It is weakly similar to the
GTP-binding domain of elongation factor TU.
Length = 81
Score = 52.2 bits (126), Expect = 2e-08
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 1045 EVKNLGRFIAVMKFRPLIWQTTHSYMLV 1072
E+ NL RFI+VMK RPL W+ YML
Sbjct: 1 EILNLLRFISVMKPRPLSWRDQRPYMLA 28
>gnl|CDD|129021 smart00785, AARP2CN, AARP2CN (NUC121) domain. This domain is the
central domain of AARP2. It is weakly similar to the
GTP-binding domain of elongation factor TU.
Length = 83
Score = 52.2 bits (126), Expect = 2e-08
Identities = 16/28 (57%), Positives = 20/28 (71%)
Query: 1045 EVKNLGRFIAVMKFRPLIWQTTHSYMLV 1072
E+ NL RF++VMK RPL W+ H YML
Sbjct: 1 EILNLLRFLSVMKPRPLSWRDQHPYMLA 28
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 54.7 bits (132), Expect = 1e-07
Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 3/71 (4%)
Query: 144 SGQGSMKKYKEQVVSSQDGRKRRKVMFD---DDLEDNMDDDDDDDDDDETDDDNDEDNEE 200
G G+MK K+ ++ ++R D D E + DDDDD+D E+D D+ +D+
Sbjct: 218 EGGGAMKVLKQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDDDVS 277
Query: 201 ESEEEEENEED 211
+ + E+ + D
Sbjct: 278 DEDGEDLFDTD 288
Score = 37.4 bits (87), Expect = 0.038
Identities = 23/96 (23%), Positives = 39/96 (40%), Gaps = 10/96 (10%)
Query: 130 DENDEDDTSDDDTESGQGSM--KKYKEQVVSSQDGR------KRRKVMFDDDLEDNMDDD 181
D DD++E+ EQ + +G K+ K +
Sbjct: 185 PPQQTDGAGDDESEALVRLREADGTLEQRIKGAEGGGAMKVLKQPKKQAKSSKRRTIAQI 244
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDD--DEE 215
D D DDE D +D+D+E+ E + ++ +DD DE+
Sbjct: 245 DGIDSDDEGDGSDDDDDEDAIESDLDDSDDDVSDED 280
Score = 37.0 bits (86), Expect = 0.046
Identities = 16/96 (16%), Positives = 32/96 (33%), Gaps = 14/96 (14%)
Query: 103 ESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDG 162
+ E+ + +G G + ++ + T +
Sbjct: 208 GTLEQRIKGAEG--GGAMKVLKQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDD---- 261
Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDN 198
+D +E ++DD DDD D++ +D D DN
Sbjct: 262 --------EDAIESDLDDSDDDVSDEDGEDLFDTDN 289
>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
Provisional.
Length = 337
Score = 54.4 bits (131), Expect = 2e-07
Identities = 17/56 (30%), Positives = 28/56 (50%), Gaps = 1/56 (1%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEH 227
D + + D+DDDD D D + ++ S + + NEEDD E+++ S H
Sbjct: 280 IDEDSDYSSDEDDDDYDSYDSSDSASSDSNS-DVDTNEEDDRGEKESNGAKSNELH 334
Score = 36.7 bits (85), Expect = 0.064
Identities = 14/85 (16%), Positives = 24/85 (28%), Gaps = 29/85 (34%)
Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDD 182
+ D+ D + ++D D D D D+ D
Sbjct: 278 EAIDEDSDYSSDEDDDDYD-----------------------------SYDSSDSASSDS 308
Query: 183 DDDDDDETDDDNDEDNEEESEEEEE 207
+ D D +DD E ++ E
Sbjct: 309 NSDVDTNEEDDRGEKESNGAKSNEL 333
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 54.2 bits (130), Expect = 3e-07
Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 9/125 (7%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQV--VSSQDGRKRRKVMFDDDLEDNMDDD 181
D +D+ DE D +D + + KK K D +K ++ DD E + DD
Sbjct: 218 LEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDG 277
Query: 182 DDD-------DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
DD+ D + +D +E ++ S E E + +E E + KNE L+ K
Sbjct: 278 DDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKK 337
Query: 235 QKNPK 239
K K
Sbjct: 338 GKKLK 342
Score = 39.2 bits (91), Expect = 0.012
Identities = 21/102 (20%), Positives = 48/102 (47%), Gaps = 16/102 (15%)
Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
SDD DDE E+D D + SG ++ E +S + K + + D
Sbjct: 274 SDDGDDEGREEDYISDSSASGNDPEER--EDKLSPEIPAKP--------------EIEQD 317
Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
+D +E++++ +E+ S++ ++ ++ ++ +K +S +
Sbjct: 318 EDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSG 359
Score = 36.1 bits (83), Expect = 0.12
Identities = 34/186 (18%), Positives = 64/186 (34%), Gaps = 28/186 (15%)
Query: 169 MFDDDLEDNMDDDDDDDDDDETDD------------DNDEDNEEESEEEEENEEDDDEER 216
M ++ D+D++ E + D + D+E++ +E ++ ED DEE+
Sbjct: 181 MMMKAAKNGPAAFGDEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEK 240
Query: 217 KAEKKNSKNEHVQELTAKQKNPK-----AFTFQSVIKGERKFRSTKPTVVPQPRMSDKDL 271
+KK ++ ++L +K + A + S G+ + + S D
Sbjct: 241 SKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSD-DGD--DEGREEDYISDSSASGNDP 297
Query: 272 AIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTG 331
R S E + E D + +E+E K K K +
Sbjct: 298 EEREDKLS--------PEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKN 349
Query: 332 LNWKSD 337
K D
Sbjct: 350 GLDKDD 355
Score = 35.3 bits (81), Expect = 0.18
Identities = 31/146 (21%), Positives = 55/146 (37%), Gaps = 9/146 (6%)
Query: 121 GAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDD 180
I D+ +E++E+ ++ S +G K K + G+K D D D+ DD
Sbjct: 312 PEIEQDEDSEESEEEKNEEEGGLSKKG-KKLKKLK------GKKNGLDKDDSDSGDDSDD 364
Query: 181 DDDDDDDDETDDDNDEDNEEESEEE-EENEEDDDEERKAE-KKNSKNEHVQELTAKQKNP 238
D D +D + + E + EE + N A SK++ ++ +
Sbjct: 365 SDIDGEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPARPSPESKDKGKRKAANEVSKS 424
Query: 239 KAFTFQSVIKGERKFRSTKPTVVPQP 264
A +K E +S+ PQ
Sbjct: 425 PASVPAKKLKTENAPKSSSGKSTPQT 450
Score = 33.0 bits (75), Expect = 0.94
Identities = 19/82 (23%), Positives = 34/82 (41%), Gaps = 3/82 (3%)
Query: 124 SSDDSDDENDEDDTSDDDTESG---QGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDD 180
S+ +D E ED S + Q + E+ + ++G +K L+ +
Sbjct: 291 SASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNG 350
Query: 181 DDDDDDDDETDDDNDEDNEEES 202
D DD D D D+ + + E+S
Sbjct: 351 LDKDDSDSGDDSDDSDIDGEDS 372
Score = 31.1 bits (70), Expect = 4.6
Identities = 31/221 (14%), Positives = 70/221 (31%), Gaps = 43/221 (19%)
Query: 129 DDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDD 188
D+ + + E G G K K + + DD+ + + D +D D+
Sbjct: 189 GPAAFGDEDEETEGEKGGGGRGK----------DLKIKDLEGDDEDDGDESDKGGEDGDE 238
Query: 189 ETD----------------------DDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
E +D+ +E +S++ ++ ++D + + E
Sbjct: 239 EKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPE 298
Query: 227 HVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNS 286
++ + + K Q E + ++++ + K
Sbjct: 299 EREDKLSPEIPAKPEIEQDEDSEESEEEK-----------NEEEGGLSKKGKKLKKLKGK 347
Query: 287 KEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPE 327
K +++ DS + +D + D V K K+ K E
Sbjct: 348 KNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKEPKKEE 388
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 53.8 bits (130), Expect = 4e-07
Identities = 30/107 (28%), Positives = 48/107 (44%)
Query: 120 SGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMD 179
+ A + DE D+S + V + DDD +D D
Sbjct: 85 AAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDD 144
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
DD DDDDDDE DD++D+D++ + E+EE+ E + E+ + +E
Sbjct: 145 DDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDE 191
Score = 53.1 bits (128), Expect = 7e-07
Identities = 38/147 (25%), Positives = 73/147 (49%), Gaps = 4/147 (2%)
Query: 103 ESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSD----DDTESGQGSMKKYKEQVVS 158
ES +K+ +ID + +SG + D E+D + + KK K+++ S
Sbjct: 43 ESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDS 102
Query: 159 SQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
S+ K+ + DDDL D D + DD+ DDD+D+D +++ ++++++EDDDE+
Sbjct: 103 SKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDD 162
Query: 219 EKKNSKNEHVQELTAKQKNPKAFTFQS 245
+ + ++E +E +K F
Sbjct: 163 DDVDDEDEEKKEAKELEKLSDDDDFVW 189
Score = 46.5 bits (111), Expect = 6e-05
Identities = 29/112 (25%), Positives = 56/112 (50%), Gaps = 7/112 (6%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
SS ++ +N D D + + + + D + DDD++D+ DD+DD
Sbjct: 102 SSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDD-----DLDDDDIDDDDDDEDD 156
Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK--KNSKNEHVQELTA 233
D+DDD+ D D++++ ++E++E E+ +DDD + + +LTA
Sbjct: 157 DEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLTA 208
Score = 43.8 bits (104), Expect = 5e-04
Identities = 15/99 (15%), Positives = 37/99 (37%), Gaps = 1/99 (1%)
Query: 122 AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
+ D++ D + + D DD+ +D+ D D
Sbjct: 107 EKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVD 166
Query: 182 DDDDDDDETDD-DNDEDNEEESEEEEENEEDDDEERKAE 219
D+D++ E + + D+++ +E+++E + A+
Sbjct: 167 DEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAK 205
Score = 32.7 bits (75), Expect = 1.3
Identities = 27/180 (15%), Positives = 55/180 (30%), Gaps = 12/180 (6%)
Query: 166 RKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
++ + D D +D + +++ + + KK K+
Sbjct: 39 KEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKD 98
Query: 226 EHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLN 285
E A++KN + K V D D L +
Sbjct: 99 ELDSSKKAEKKNALDK--------DDDLNYVKDIDVLNQADDDDDD----DDDDDLDDDD 146
Query: 286 SKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASA 345
+++++E +D D+ DD ++ E E + D + D + A+K A
Sbjct: 147 IDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKL 206
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 53.5 bits (128), Expect = 8e-07
Identities = 43/222 (19%), Positives = 85/222 (38%), Gaps = 43/222 (19%)
Query: 127 DSDDENDEDDTSDDDTESGQGSMK-KYKEQVVSSQDGRKRRKVMFDDD------------ 173
+ +D ++ D + + + M K+ VVS + + D D
Sbjct: 3842 NEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSN 3901
Query: 174 -LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD---EERKAEKKNSKNEHVQ 229
L + + D+ +++D ET+ ++E + +E + ++EDD+ E++ ++K + E
Sbjct: 3902 SLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSD 3961
Query: 230 ELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEE 289
++ I E + + P P D DL L E+
Sbjct: 3962 DVG--------------IDDEIQPDIQENNSQPPPENEDLDLP---------EDLKLDEK 3998
Query: 290 EEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTG 331
E + +DS D D ++DE++ E KD +ED
Sbjct: 3999 EGDVSKDSDLED--MDMEAADENKE-EADAEKDEPMQDEDPL 4037
Score = 48.1 bits (114), Expect = 3e-05
Identities = 41/243 (16%), Positives = 93/243 (38%), Gaps = 48/243 (19%)
Query: 115 IHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDL 174
+ M+G DS +E+ S+++ + + E + +S + + ++D+
Sbjct: 3860 LESDMNGVT--KDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEK-----LWDEPN 3912
Query: 175 EDNMDDDD----------DDDDDDETDDDND--EDNEEESEEEEENEEDD---DEERKAE 219
E+++ + + ++ D +DDN ED + + +E+EE DD D+E + +
Sbjct: 3913 EEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPD 3972
Query: 220 KKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITS 279
+ + ++ E + + K + V + D D+
Sbjct: 3973 IQENNSQPPPENEDLD-----------LPEDLKLDEKEGDVSKDSDLEDMDMEAA----- 4016
Query: 280 ALAGLNSKEEEEEEQEDSAFSDESA--DDNSSDEDETVEPK---THKDSKNPEEDTGLNW 334
+ +EE + ++D DE ++N+ DED + D K E+ N
Sbjct: 4017 -----DENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENV 4071
Query: 335 KSD 337
+ +
Sbjct: 4072 QEN 4074
Score = 46.9 bits (111), Expect = 7e-05
Identities = 46/214 (21%), Positives = 80/214 (37%), Gaps = 28/214 (13%)
Query: 122 AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
A+ D ++ DE++ SDD G + Q +SQ + + +DL+ + +
Sbjct: 3944 ALEDKDRQEKEDEEEMSDD---VGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEG 4000
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAF 241
D D D D D + +E + E + E +E +E E+ N+ +E +Q
Sbjct: 4001 DVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQ----------QD 4050
Query: 242 TFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEE--EEQEDSAF 299
F + + + K + +++ S G+ S EE E E ED A
Sbjct: 4051 DFSDLAEDDEKMNEDGFE---ENVQENEE--------STEDGVKSDEELEQGEVPEDQAI 4099
Query: 300 -SDESADDNSSDEDETV-EPKTHKDSKNPEEDTG 331
+ D S+ E T K E+ G
Sbjct: 4100 DNHPKMDAKSTFASAEADEENTDKGIVGENEELG 4133
Score = 45.4 bits (107), Expect = 2e-04
Identities = 44/185 (23%), Positives = 81/185 (43%), Gaps = 20/185 (10%)
Query: 126 DDSDDENDED-DTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN--MDDDD 182
D+ + + +D D D D E+ + KE+ D K + +D LE+N +D+D
Sbjct: 3996 DEKEGDVSKDSDLEDMDMEAAD----ENKEE----ADAEKDEPMQDEDPLEENNTLDEDI 4047
Query: 183 DDDDD-DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAF 241
DD D +DD + + E +ENEE ++ K++++ + E V E A +PK
Sbjct: 4048 QQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGE-VPEDQAIDNHPKMD 4106
Query: 242 TFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSD 301
+ E +T +V + ++ +R T + E E+ QED++
Sbjct: 4107 AKSTFASAEADEENTDKGIVGENEELGEEDGVRGNGT-------ADGEFEQVQEDTSTPK 4159
Query: 302 ESADD 306
E+ +
Sbjct: 4160 EAMSE 4164
Score = 38.1 bits (88), Expect = 0.035
Identities = 53/261 (20%), Positives = 98/261 (37%), Gaps = 35/261 (13%)
Query: 88 EMEIFEFLNICQVHGESTEKSMMEIDGIHGAMSGAISSDDSDDEN-------------DE 134
++++ E L + + G+ ++ S +E + A +D DE DE
Sbjct: 3986 DLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDE 4045
Query: 135 DDTSDDDTESGQGSMKK----YKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDET 190
D DD ++ + K ++E V +++ + D++LE +D D+
Sbjct: 4046 DIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKS-DEELEQGEVPEDQAIDNHPK 4104
Query: 191 DDDNDEDNEEESEEEE-------ENEE--DDDEERKAEKKNSKNEHVQELTAKQKNPKA- 240
D E++EE ENEE ++D R + + E VQE T+ K +
Sbjct: 4105 MDAKSTFASAEADEENTDKGIVGENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSE 4164
Query: 241 --FTFQSVIKGERKFRSTK-----PTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEE 293
+QS+ R+++ + + D H L + E++
Sbjct: 4165 ADRQYQSLGDHLREWQQANRIHEWEDLTESQSQAFDDSEFMHVKEDEEEDLQALGNAEKD 4224
Query: 294 QEDSAFSDESADDNSSDEDET 314
Q S DESA+ N + T
Sbjct: 4225 QIKSIDRDESANQNPDSMNST 4245
Score = 34.6 bits (79), Expect = 0.39
Identities = 43/175 (24%), Positives = 70/175 (40%), Gaps = 34/175 (19%)
Query: 64 INSMIDISKVA-----DLVLLLI------DASFGFEMEIFEFLNICQVHGESTEKSMMEI 112
NS + +SKV D + + D S F F FL +C+ HG TE +
Sbjct: 2854 SNSEVCLSKVVNQLSEDTTSMNLTRMCIEDPSIPFM--YFVFLFLCKQHGRDTELAGAFE 2911
Query: 113 DGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDD 172
DG G G + + S+ +ED+ D +E+V D K + +DD
Sbjct: 2912 DGT-GMKYGTGNINVSEQIEEEDEKDD---------EYGEQEKVSDKNDYLKD-FIDYDD 2960
Query: 173 D----------LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK 217
D E + D +D ++++D++ E+ E E+N E E+ K
Sbjct: 2961 DGSISTESEPEEEKRECESGFDGWNDYEEEEDDDEKNEKECELEQNAEILIEDEK 3015
Score = 34.6 bits (79), Expect = 0.50
Identities = 36/207 (17%), Positives = 82/207 (39%), Gaps = 30/207 (14%)
Query: 102 GESTEKSMMEIDGI--HGAMSGAISSDDSDDENDEDDTSDDDTESGQ--GSMKKYKE--Q 155
GE+ E + E DG+ +G G D ++ S+ D + ++++++ +
Sbjct: 4127 GENEE--LGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEADRQYQSLGDHLREWQQANR 4184
Query: 156 VVSSQDGRKRRKVMFDDDLEDNMDDDDDDD---------------DDDET-DDDNDEDNE 199
+ +D + + FDD ++ +D+++D D DE+ + + D N
Sbjct: 4185 IHEWEDLTESQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRDESANQNPDSMNS 4244
Query: 200 EESEEEEENEEDDDEERKAE-----KKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFR 254
E+E +E D + + + K+ ++ E + ++ K F E +
Sbjct: 4245 TNIAEDEADEVGDKQLQDGQDISDIKQTGEDTLPTEFGSINQSEKVFELSEDEDIEDELP 4304
Query: 255 STKPTVVPQ-PRMSDKDLAIRHKITSA 280
+ P P +DL +H+ ++
Sbjct: 4305 DYNVKITPAMPIDEARDLWNKHEDSTK 4331
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 48.5 bits (116), Expect = 1e-06
Identities = 18/52 (34%), Positives = 34/52 (65%)
Query: 154 EQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE 205
E+++ ++ ++K F D +D +DDD DDD + +D ++ED+E+E +EE
Sbjct: 78 EEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129
Score = 43.8 bits (104), Expect = 5e-05
Identities = 15/43 (34%), Positives = 27/43 (62%)
Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
FD + MD DDD DDD DD+ ++ + + E++E+ E+++
Sbjct: 87 FDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129
Score = 42.7 bits (101), Expect = 1e-04
Identities = 15/45 (33%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNE-EESEEEEENEEDDDEE 215
++ D D DDD DDD D++ +E + +EE++ED+++E
Sbjct: 84 EEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128
Score = 35.0 bits (81), Expect = 0.064
Identities = 19/95 (20%), Positives = 32/95 (33%), Gaps = 34/95 (35%)
Query: 107 KSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRR 166
+S +D I + D + D
Sbjct: 68 RSWYPVDEIDEEIIPLEEKFDKKKKKFMDG------------------------------ 97
Query: 167 KVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEE 201
DDD+ D+ DDD D+++ D+++DED E+E
Sbjct: 98 ----DDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128
Score = 33.1 bits (76), Expect = 0.27
Identities = 11/47 (23%), Positives = 26/47 (55%), Gaps = 3/47 (6%)
Query: 172 DDLEDNMDDDDDDDDDD---ETDDDNDEDNEEESEEEEENEEDDDEE 215
D++++ + ++ D D D+D +++ +++ +EED DEE
Sbjct: 74 DEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEE 120
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 52.3 bits (126), Expect = 1e-06
Identities = 20/81 (24%), Positives = 46/81 (56%), Gaps = 11/81 (13%)
Query: 167 KVMFDDDLEDNMDD-----------DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
+ D D+E+ + D ++DDDDD+E+D++++E ++ E +E++++++DDD+
Sbjct: 99 VIFDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDI 158
Query: 216 RKAEKKNSKNEHVQELTAKQK 236
E+ + +E K+
Sbjct: 159 ATRERSLERRRRRREWEEKRA 179
Score = 46.1 bits (110), Expect = 9e-05
Identities = 13/69 (18%), Positives = 36/69 (52%)
Query: 165 RRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
R + D ED+ DD++ D++D+E+ D++++++ +++++ + + ++
Sbjct: 115 RYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREW 174
Query: 225 NEHVQELTA 233
E EL
Sbjct: 175 EEKRAELEF 183
Score = 31.1 bits (71), Expect = 4.3
Identities = 36/166 (21%), Positives = 55/166 (33%), Gaps = 50/166 (30%)
Query: 59 ECNNDINSMIDISKVADLVLLLIDASFGFEMEIFEF-----LNICQVHGES--------- 104
E + DI S+I I+ +V L ++ ++ N+ V G
Sbjct: 47 ELDEDIKSVILIN-CGGMVDLEEFLQLDEDVIVYVIDSHRPWNLDNVFGSDQVVIFDDGD 105
Query: 105 TEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRK 164
E+ + + A DD D+E+DE+D +E
Sbjct: 106 IEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSE--------------------- 144
Query: 165 RRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEE 210
DD+DDDDDDD+ D E + E E EE
Sbjct: 145 --------------DDEDDDDDDDDDDIATRERSLERRRRRREWEE 176
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 50.8 bits (122), Expect = 2e-06
Identities = 21/70 (30%), Positives = 38/70 (54%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
DDD E+ + ++D+D DDE + + E ++E + E+EE+ DE K K++S E +E
Sbjct: 97 DDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEE 156
Query: 231 LTAKQKNPKA 240
+ +
Sbjct: 157 DEEEAAEEEE 166
Score = 46.6 bits (111), Expect = 5e-05
Identities = 25/98 (25%), Positives = 43/98 (43%), Gaps = 21/98 (21%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
DD ++E + ++ D D E + + D E D +D
Sbjct: 96 DDDDEEEEWEVEEDEDSDDEGEWIDV---------------------ESDKEIESSDSED 134
Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
+++ DE ED++EE EE+E E ++EE +AEK+
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172
Score = 43.5 bits (103), Expect = 4e-04
Identities = 19/102 (18%), Positives = 41/102 (40%), Gaps = 13/102 (12%)
Query: 129 DDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDD 188
E + + + +E V + D D E D + D + +
Sbjct: 78 WKEEERKKKEAEQGLESDDDDDEEEEWEVEEDE---------DSDDEGEWIDVESDKEIE 128
Query: 189 ETDDDNDEDNEE----ESEEEEENEEDDDEERKAEKKNSKNE 226
+D +++E+ +E E+ +E ++DEE AE++ ++ E
Sbjct: 129 SSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAE 170
Score = 43.5 bits (103), Expect = 5e-04
Identities = 18/72 (25%), Positives = 40/72 (55%), Gaps = 6/72 (8%)
Query: 173 DLEDNMDDDDD----DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHV 228
++E++ D DD+ D + D+ + +D ++EEE +E + ++D +E +E+ +
Sbjct: 105 EVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAA-- 162
Query: 229 QELTAKQKNPKA 240
+E A+ + KA
Sbjct: 163 EEEEAEAEKEKA 174
Score = 36.9 bits (86), Expect = 0.050
Identities = 14/42 (33%), Positives = 26/42 (61%)
Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
LE +++ + ++ + +D+D+EEE E EE+E+ DDE
Sbjct: 75 LEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEG 116
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 51.9 bits (124), Expect = 2e-06
Identities = 49/277 (17%), Positives = 100/277 (36%), Gaps = 30/277 (10%)
Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDD 182
S +D D +D D D E + E+ S +D M DD+ E ++ ++
Sbjct: 98 ESHEDGSDGSDMDSEDSADDEEEE-------EEDESLEDE------MIDDEDEADLFNES 144
Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT 242
+ +D +DD+ ++D E++ EEEE EE + E+ +K V + K F
Sbjct: 145 ESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNEFL 204
Query: 243 FQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKI-TSALAGLNSKEEEEEEQEDSAFSD 301
+ + E +D S E+EE+++E + +
Sbjct: 205 EATEAEEEAALGD---------EDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYE 255
Query: 302 ESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASAFLERQANIVNLAKYVY 361
+ D D+ + + P+++ + ++D + + Q + + +
Sbjct: 256 DFFDPKEKDKKKDAGDDAELEDDEPDKEA-VKKEADSKPEEEDEEDDEQEDDQDEEEPPE 314
Query: 362 GDMEDVSVTMEG------DEAPNKKVHRKRQAELTAK 392
M+ V + + KRQA+L +
Sbjct: 315 AAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKLKQQ 351
Score = 45.4 bits (107), Expect = 2e-04
Identities = 31/148 (20%), Positives = 55/148 (37%), Gaps = 15/148 (10%)
Query: 103 ESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDG 162
E+TE G D D ++DED S +D E E ++
Sbjct: 205 EATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSGED-EEDDEEGNIEYEDFFDPKEK 263
Query: 163 RKRRKVMFDDDLEDNMDD--------------DDDDDDDDETDDDNDEDNEEESEEEEEN 208
K++ D +LED+ D +D++DD+ E D D +E E ++ + +
Sbjct: 264 DKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLD 323
Query: 209 EEDDDEERKAEKKNSKNEHVQELTAKQK 236
E + K + ++ KQ+
Sbjct: 324 EPVLEGVDLESPKELSSFEKRQAKLKQQ 351
Score = 40.7 bits (95), Expect = 0.005
Identities = 42/258 (16%), Positives = 86/258 (33%), Gaps = 44/258 (17%)
Query: 103 ESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDG 162
ES E M++ + + S +D +D++ D++ + + + KE V +
Sbjct: 125 ESLEDEMIDDEDEADLFN--ESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATRE 182
Query: 163 RKRRKVMFDD---DLEDNMDDD------------DDDDDDDETDDDNDEDNEEESEEEEE 207
+K K DD L D M++ D+DD ++ D+ ED +++ +
Sbjct: 183 KKFDKSGVDDKFFKL-DEMNEFLEATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSG 241
Query: 208 NEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT-FQSVIKGERKFRSTKPTVVPQPRM 266
+E+DDEE E ++ + K K A + K K
Sbjct: 242 EDEEDDEEGNIEYED-----FFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEE 296
Query: 267 SDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNP 326
EE++E+++D + + + +
Sbjct: 297 D--------------------EEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPK 336
Query: 327 EEDTGLNWKSDLAQKAAS 344
E + ++ L Q+
Sbjct: 337 ELSSFEKRQAKLKQQIEQ 354
Score = 40.0 bits (93), Expect = 0.009
Identities = 41/192 (21%), Positives = 73/192 (38%), Gaps = 17/192 (8%)
Query: 146 QGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE 205
+KY E +++ + R K+ ++ +D D D D D +DE+ EEE E
Sbjct: 72 DDLNQKYVEFLINKEHIRVLAKL----QDSESHEDGSDGSDMDSEDSADDEEEEEEDESL 127
Query: 206 EENE-EDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQP 264
E+ +D+DE + S E + + + K + K +V
Sbjct: 128 EDEMIDDEDEADLFNESESSLEDLSDDETEDDEEKKMEEEEA-------GEEKESVEQAT 180
Query: 265 RMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDES-----ADDNSSDEDETVEPKT 319
R D + L +N E E +E++A DE D+S D + + +
Sbjct: 181 REKKFDKSGVDDKFFKLDEMNEFLEATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGS 240
Query: 320 HKDSKNPEEDTG 331
+D ++ EE
Sbjct: 241 GEDEEDDEEGNI 252
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 51.8 bits (124), Expect = 2e-06
Identities = 20/78 (25%), Positives = 46/78 (58%), Gaps = 2/78 (2%)
Query: 156 VVSSQDGRKRRKVMFD--DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
V+ + + + R + +F+ ++ ED++++ DDD+D+ E +D++ ++E + E+ EE+E++DD
Sbjct: 631 VLKADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDD 690
Query: 214 EERKAEKKNSKNEHVQEL 231
E + L
Sbjct: 691 AEANEGVVPIDKAVRRAL 708
Score = 40.6 bits (95), Expect = 0.005
Identities = 15/61 (24%), Positives = 29/61 (47%), Gaps = 5/61 (8%)
Query: 170 FDDDLEDNMDDDDD-----DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
+DDLE+ DD+D+ D + + D D + +E+ ++ E NE ++ + K
Sbjct: 651 DEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPK 710
Query: 225 N 225
Sbjct: 711 V 711
Score = 35.6 bits (82), Expect = 0.16
Identities = 23/111 (20%), Positives = 47/111 (42%), Gaps = 9/111 (8%)
Query: 124 SSDDSDDE---NDEDDTSDDDTESGQGSMKKYKEQ---VVSSQDGRKRRKVMFDDDLEDN 177
+ DD D+ D + S+ D E G+ ++ + V D RR + +L D
Sbjct: 658 TDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDA 717
Query: 178 MDDDDDDDDDDETDDDNDEDNEEESE---EEEENEEDDDEERKAEKKNSKN 225
+D D +D++ D+ + ++ E++E + E +K + +N
Sbjct: 718 LDGGDSEDEEGMDDEQMMRLDTYLAQIFKEKKERIQAGGETKKEAQSQKQN 768
Score = 34.8 bits (80), Expect = 0.28
Identities = 13/66 (19%), Positives = 29/66 (43%)
Query: 150 KKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
K +Q+ ++ + DDD ++ +D + + + +D +ED +E+ E E
Sbjct: 638 KSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGV 697
Query: 210 EDDDEE 215
D+
Sbjct: 698 VPIDKA 703
Score = 32.9 bits (75), Expect = 1.2
Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 13/79 (16%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEE----EEENEED--DDEERKAEKKNSKNEHVQEL-- 231
+++D+DD +ETDDD E+E E E E+E D D EE + E NE V +
Sbjct: 648 EEEDEDDLEETDDD-----EDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDK 702
Query: 232 TAKQKNPKAFTFQSVIKGE 250
++ PK + G
Sbjct: 703 AVRRALPKVLNLPDALDGG 721
Score = 30.2 bits (68), Expect = 7.2
Identities = 16/66 (24%), Positives = 29/66 (43%), Gaps = 4/66 (6%)
Query: 120 SGAISSDDS--DDENDEDDTSDDDTESGQGSMKKYKE--QVVSSQDGRKRRKVMFDDDLE 175
+ S +S D E+ E+D +DD E+ +G + K + + D + E
Sbjct: 667 AIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDE 726
Query: 176 DNMDDD 181
+ MDD+
Sbjct: 727 EGMDDE 732
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 49.7 bits (119), Expect = 3e-06
Identities = 30/95 (31%), Positives = 52/95 (54%), Gaps = 4/95 (4%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE-EDDDEERKAEKKNSKNEHVQ 229
+DD E +++++++++ D D D+ ED+E ES++EEE E E EER +KK K + +
Sbjct: 38 EDDEEFEIEEEEEEEEVDS-DFDDSEDDEPESDDEEEGEKELQREERLKKKKRVKTKAYK 96
Query: 230 ELTAKQKNPKAFTFQSVI--KGERKFRSTKPTVVP 262
E T K+K +S K +S + + P
Sbjct: 97 EPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAP 131
Score = 30.0 bits (68), Expect = 5.9
Identities = 20/115 (17%), Positives = 44/115 (38%), Gaps = 2/115 (1%)
Query: 123 ISSDDSDDENDED-DTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
I ++ ++E D D D S+DD + KE + + +K+++V E
Sbjct: 45 IEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKE-LQREERLKKKKRVKTKAYKEPTKKKK 103
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
D ++ +++SE D R+ ++S ++ + + K
Sbjct: 104 KKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKSSRSSTVQNKEATHERLK 158
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 49.6 bits (118), Expect = 4e-06
Identities = 30/111 (27%), Positives = 48/111 (43%), Gaps = 9/111 (8%)
Query: 127 DSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDD 186
D D+E +E D E ++ E+V S G R++ +++ E DD +D
Sbjct: 173 DVDNEEEERLEESDGRE------EEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVV 226
Query: 187 DDETDDDNDEDNEEESEEEE---ENEEDDDEERKAEKKNSKNEHVQELTAK 234
D E + + + EEE EEE E + +EE +E+ NE K
Sbjct: 227 DYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEESPAKKQK 277
Score = 40.4 bits (94), Expect = 0.004
Identities = 24/102 (23%), Positives = 44/102 (43%), Gaps = 11/102 (10%)
Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
SD ++E DE+ SD E + ++ +E+ S DG +D ++ + D
Sbjct: 185 SDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDG--------EDVVDYEGERIDKK 236
Query: 185 DDDDETDD---DNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
++E + N + E E E E ++EE A+K+
Sbjct: 237 QGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEESPAKKQKV 278
Score = 36.5 bits (84), Expect = 0.061
Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 6/60 (10%)
Query: 177 NMDDDDDDDDDD------ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
N+ D+D+D+D+ + D++ +E EE EEE +E+ + E NE +E
Sbjct: 155 NIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEE 214
Score = 33.4 bits (76), Expect = 0.58
Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 5/57 (8%)
Query: 175 EDNMDDDDD-----DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
DN +D+D+ D D+E ++ +E + E EE+EE D E E + E
Sbjct: 158 WDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEE 214
Score = 30.3 bits (68), Expect = 6.2
Identities = 14/68 (20%), Positives = 31/68 (45%), Gaps = 2/68 (2%)
Query: 172 DDLEDNMDD--DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQ 229
D E+ D+ D + + + +E+ E E ++ E+ D + ER +K+ + E +
Sbjct: 186 DGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEE 245
Query: 230 ELTAKQKN 237
E+ +
Sbjct: 246 EVINLFEI 253
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 50.4 bits (120), Expect = 5e-06
Identities = 26/124 (20%), Positives = 52/124 (41%), Gaps = 1/124 (0%)
Query: 102 GESTEKSMMEIDG-IHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQ 160
E E +G I G + D+ + E + + E + + + +
Sbjct: 769 ETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEV 828
Query: 161 DGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
+ + ++ + D+ D +D + E+ EEE EEEEE EE+++EE + E+
Sbjct: 829 KDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEE 888
Query: 221 KNSK 224
+N +
Sbjct: 889 ENEE 892
Score = 47.3 bits (112), Expect = 5e-05
Identities = 31/140 (22%), Positives = 58/140 (41%), Gaps = 15/140 (10%)
Query: 102 GESTEKSMMEIDGIHGAMSGAISSDDSDD---ENDED-DTSDDDTESGQGSMKKYKEQVV 157
E + H + A +D D+ + ED + D+ G+ + E
Sbjct: 752 KHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGE 811
Query: 158 SSQDGRKRRKVMFDDDLED-------NMDDDDDDDDDDETDDDNDE----DNEEESEEEE 206
+ + D +++D N ++ + D++ D D+EEE EEEE
Sbjct: 812 KDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEE 871
Query: 207 ENEEDDDEERKAEKKNSKNE 226
E EE+++EE + E++ +NE
Sbjct: 872 EEEEEEEEEEEEEEEEEENE 891
Score = 43.4 bits (102), Expect = 9e-04
Identities = 23/99 (23%), Positives = 46/99 (46%), Gaps = 3/99 (3%)
Query: 131 ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLE-DNMDDDDDDDD--D 187
+ + + + E+ G +++ Q + D + + + +L +N + D+ D
Sbjct: 794 DEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVD 853
Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
D + EEE EEEEE EE+++EE + E++ E
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 39.2 bits (91), Expect = 0.014
Identities = 45/253 (17%), Positives = 95/253 (37%), Gaps = 27/253 (10%)
Query: 121 GAISSDDSDDENDEDDTSDDDTESG-QGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMD 179
G S +++ E + + ++++E K +E + K + + ++
Sbjct: 660 GEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEG 719
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEE-ENEEDDDEERKAEKKNSKNEHVQELTAKQKNP 238
+ + + +DE + + E+ EE +E E E E + E + ++K +++E E K+
Sbjct: 720 ETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDED 779
Query: 239 KAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSA-LAGLNSKEEEEEEQEDS 297
+ Q+ GE K V ++ H+ S A ++E EQE +
Sbjct: 780 EGEI-QAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELN 838
Query: 298 A-----------------------FSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNW 334
A +E ++ +E+E E + ++ + EE L W
Sbjct: 839 AENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEW 898
Query: 335 KSDLAQKAASAFL 347
++A FL
Sbjct: 899 PETRQKQAIYLFL 911
Score = 38.8 bits (90), Expect = 0.021
Identities = 22/121 (18%), Positives = 53/121 (43%), Gaps = 5/121 (4%)
Query: 99 QVHGESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVS 158
++ G+ + +E +G A D + E D + + G + E
Sbjct: 790 EMKGDEGAEGKVEHEGETEA-----GEKDEHEGQSETQADDTEVKDETGEQELNAENQGE 844
Query: 159 SQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
++ K D D+ ++++++++++E +++ +E+ EEE E EE + E R+
Sbjct: 845 AKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQK 904
Query: 219 E 219
+
Sbjct: 905 Q 905
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 50.4 bits (120), Expect = 6e-06
Identities = 30/106 (28%), Positives = 50/106 (47%), Gaps = 14/106 (13%)
Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDD---- 180
SDDS+ + EDD SD ++ K + +D + D E+ +D+
Sbjct: 705 SDDSELDFAEDDFSDSTSDDEP------KLDAIDDEDAKSEGSQESDQ--EEGLDEIFYS 756
Query: 181 -DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
D + D+ D + ++ED E E+EEE E + ++A+KK KN
Sbjct: 757 FDGEQDNSDSFAESSEEDESSEEEKEEE-ENKEVSAKRAKKKQRKN 801
Score = 38.9 bits (90), Expect = 0.017
Identities = 20/74 (27%), Positives = 36/74 (48%), Gaps = 13/74 (17%)
Query: 161 DGRKRRKVMFDDDLEDNMDDDD-------------DDDDDDETDDDNDEDNEEESEEEEE 207
DG+ ++ D +D MD+++ DD DD E D D+ ++ S++E +
Sbjct: 668 DGKGKKSNKASFDSDDEMDENEIWSALVKSRPDVEDDSDDSELDFAEDDFSDSTSDDEPK 727
Query: 208 NEEDDDEERKAEKK 221
+ DDE+ K+E
Sbjct: 728 LDAIDDEDAKSEGS 741
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 49.3 bits (118), Expect = 1e-05
Identities = 36/237 (15%), Positives = 86/237 (36%), Gaps = 44/237 (18%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSS-----------QDGRKRRKVMFDD 172
S++ D+++DEDD ++ + + K KE Q R+K D
Sbjct: 333 DSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDA 392
Query: 173 DLEDNMDDDDDDDDDDETDDDNDEDNEE------------ESEEEEENEEDDDEERKAEK 220
++E+ + + +++ DE +++ E+E ++ +E+ +E ++ ++
Sbjct: 393 EIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKE 452
Query: 221 KNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPT------VVPQPR-MSDKDLAI 273
+ + E E AK + + K +++ + S K
Sbjct: 453 SDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDS 512
Query: 274 RHKITSALAGLN--------------SKEEEEEEQEDSAFSDESADDNSSDEDETVE 316
+ K +S L KE+ + +D ++S + DE++ +
Sbjct: 513 KKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDD 569
Score = 46.2 bits (110), Expect = 1e-04
Identities = 38/269 (14%), Positives = 82/269 (30%), Gaps = 35/269 (13%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
S+++ DED S+++ E D M L + +DD
Sbjct: 320 GKSVSEEDEDEDSDSEEEDEDDDED------------DDDGENPWMLRKKLGKLKEGEDD 367
Query: 184 DDDDDET------DDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE--LTAKQ 235
+++ + + E ++E EE E + EE E++N +
Sbjct: 368 EENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGP 427
Query: 236 KNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEE-EEEQ 294
+N + +K E K + + + + K+ L + E+ ++E+
Sbjct: 428 ENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEE--EAKVEKVANKLLKRSEKAQKEE 485
Query: 295 EDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKA------------ 342
E+ +E+ ++ K K+ + K A
Sbjct: 486 EEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSID 545
Query: 343 ASAFLERQANIVNLAKYVYGDMEDVSVTM 371
L + + + L D +D +
Sbjct: 546 LDDDLIDEEDSIKLDVDDEEDEDDEELPF 574
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 48.8 bits (116), Expect = 1e-05
Identities = 28/96 (29%), Positives = 47/96 (48%), Gaps = 8/96 (8%)
Query: 135 DDTSDDDTESGQGSMKKY--------KEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDD 186
+ DDD G K+ KE + + M ++D ++ DDDDDD+D
Sbjct: 8 SELEDDDWVRGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDED 67
Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKN 222
+D+ DDD+DED+E+E +++ +D + E N
Sbjct: 68 EDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDN 103
Score = 45.0 bits (106), Expect = 2e-04
Identities = 19/52 (36%), Positives = 37/52 (71%)
Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
ED+ ++DDDDDDD++ DDD+D+D+E++ +E++++ D+ + + NE
Sbjct: 53 EDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNE 104
Score = 43.1 bits (101), Expect = 7e-04
Identities = 27/89 (30%), Positives = 41/89 (46%), Gaps = 27/89 (30%)
Query: 126 DDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDD 185
++ DDE D+DD D+D + DDD +D+ DD+D+DD
Sbjct: 52 EEDDDEEDDDDDDDEDED---------------------------DDDDDDDEDDEDEDD 84
Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDE 214
DD DD+ D+ E++ E + DDE
Sbjct: 85 DDSTLHDDSSADDGNETDNEAGFADSDDE 113
Score = 41.9 bits (98), Expect = 0.002
Identities = 17/49 (34%), Positives = 36/49 (73%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
D++ +D+ DD+D+DDDDD+ D+D++++++++S +++ DD E E
Sbjct: 56 DEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNE 104
Score = 40.0 bits (93), Expect = 0.008
Identities = 19/49 (38%), Positives = 32/49 (65%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
DDD +++ DDDDDDDD+D+ D+D+D+ + ++ E D+E A+
Sbjct: 61 DDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGFAD 109
Score = 39.2 bits (91), Expect = 0.011
Identities = 26/97 (26%), Positives = 41/97 (42%), Gaps = 29/97 (29%)
Query: 118 AMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN 177
AM ++ DD++D++D DDD + D+D ED
Sbjct: 49 AMEEEDDDEEDDDDDDDEDEDDDDDDD--------------------------DEDDEDE 82
Query: 178 MDDDDDDDDDDETDDDNDEDNE---EESEEEEENEED 211
DDD DD DD N+ DNE +S++E ++ +
Sbjct: 83 DDDDSTLHDDSSADDGNETDNEAGFADSDDESDDGSE 119
Score = 32.3 bits (73), Expect = 1.7
Identities = 13/40 (32%), Positives = 29/40 (72%)
Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEH 227
+E DDD ++D++++ E+E+++++DDDE+ + E + H
Sbjct: 51 EEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLH 90
Score = 31.9 bits (72), Expect = 2.5
Identities = 18/69 (26%), Positives = 31/69 (44%), Gaps = 8/69 (11%)
Query: 269 KDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEE 328
K+ AIR L + EEE +E+ ++ DD+ DED+ + D ++
Sbjct: 34 KENAIRK--------LGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDD 85
Query: 329 DTGLNWKSD 337
D+ L+ S
Sbjct: 86 DSTLHDDSS 94
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 48.1 bits (115), Expect = 1e-05
Identities = 25/88 (28%), Positives = 48/88 (54%), Gaps = 8/88 (9%)
Query: 137 TSDDDTESG----QGSMKKYKEQVVSSQDGRKRR---KVMFDDDLEDNMDDDDDDDDDDE 189
T+ D + G +++ KE + K + +V+ D+ ED + + ++++E
Sbjct: 233 TTCSDKDKGMEIIGAALEAIKEVIKKKGGDFKVKGEPEVVGGDE-EDLEELLEKAEEEEE 291
Query: 190 TDDDNDEDNEEESEEEEENEEDDDEERK 217
DD ++ ++E+E +E+EE EEDDDE K
Sbjct: 292 EDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 38.1 bits (89), Expect = 0.024
Identities = 10/46 (21%), Positives = 28/46 (60%)
Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
D++D ++ + +E+ E++ E E+ +E+D++E + E + ++
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 46.4 bits (110), Expect = 1e-05
Identities = 17/53 (32%), Positives = 34/53 (64%)
Query: 173 DLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
D + D++++++D++E ++ D ++EEE E+EE EE+D+E+ K K
Sbjct: 45 DEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKK 97
Score = 37.5 bits (87), Expect = 0.013
Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 4/65 (6%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEE----EENEEDDDEERKAEKKNSKNEHVQELTAK 234
D +D ++++ D+ ED+EEE EE+ EE E+ +DEE E + + E ++
Sbjct: 32 DKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDL 91
Query: 235 QKNPK 239
+ K
Sbjct: 92 KDIEK 96
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 47.1 bits (112), Expect = 2e-05
Identities = 35/136 (25%), Positives = 53/136 (38%), Gaps = 18/136 (13%)
Query: 97 ICQVHGESTEKSMMEIDGIHGAMSGAISSDDS-------DDENDEDD--------TSDDD 141
++G S K + +S I DS + E+D D D
Sbjct: 150 DGVMNGVSLSKIEWLNEDFPSELSPFIERIDSVNGPGEPEPEDDPKDSLGNGSSTNGLPD 209
Query: 142 TESGQGSM-KKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDET--DDDNDEDN 198
+ + ++Y E+ S + D DD++DDD D + + + ED
Sbjct: 210 SSQDKNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDE 269
Query: 199 EEESEEEEENEEDDDE 214
EEE EEEE EED+DE
Sbjct: 270 EEEETEEEEEEEDEDE 285
Score = 32.1 bits (73), Expect = 1.6
Identities = 26/165 (15%), Positives = 52/165 (31%), Gaps = 22/165 (13%)
Query: 156 VVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEE----D 211
VVSS D R D + +++D + + + N++ E S E +
Sbjct: 130 VVSSWDPGGERS---DSENGNSLDGVMNGVSLSKIEWLNEDFPSELSPFIERIDSVNGPG 186
Query: 212 DDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDL 271
+ E K + N K + + + E +
Sbjct: 187 EPEPEDDPKDSLGNGSSTNGLPDSSQDKNKSLEEYYEKE-------------SSDAAASQ 233
Query: 272 AIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVE 316
K + ++EE++ +D + +E +E+ET E
Sbjct: 234 DDGPKGSDVKN--KKSDDEEDDDQDGDYVEEKELKEDEEEEETEE 276
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 47.9 bits (114), Expect = 3e-05
Identities = 27/208 (12%), Positives = 65/208 (31%), Gaps = 35/208 (16%)
Query: 40 PNNEGIHRQEFEIEPITFIEC----NNDINSMIDISKVADLVLLLIDASFGFEMEIFEFL 95
I + I F+ + + + S G + +
Sbjct: 4 NALLAIWCILSYLYEIPFVGSLRLGLRSNYLQSTVEDIEGPSAVDERTS-GVLRDEGKHA 62
Query: 96 NICQVHGESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQ 155
NI + ++ S DD DD+ + DD+ ++G +
Sbjct: 63 NI------LYNSILCNQKKHASFLNQRKSLDDDDDDEFDFLYEDDEDDAGNAT------- 109
Query: 156 VVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
E + DDD + D +D + + N +++ + ++++ + +
Sbjct: 110 -----------------SGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQT 152
Query: 216 RKAEKKNSKNEHVQELTAKQKNPKAFTF 243
+ K++ ++L ++K TF
Sbjct: 153 DQGLKQSVNLSSAEKLIEEKKGQTENTF 180
Score = 46.0 bits (109), Expect = 1e-04
Identities = 36/169 (21%), Positives = 62/169 (36%), Gaps = 17/169 (10%)
Query: 176 DNMDDDDDDD--DDDETDDDNDEDNEEESEEEEENEEDDDEERKAE--KKNSKNEHVQEL 231
D+ DDD+ D +DDE D N E ++++ E D +E + N+ Q+
Sbjct: 87 DDDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDD 146
Query: 232 ------TAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLN 285
K + + E+K + T+ T +D + A +A G
Sbjct: 147 SSNSQTDQGLKQSVNLSSAEKLIEEKKGQ-TENTFKFYNFGNDGEEA------AAKDGGK 199
Query: 286 SKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNW 334
SK + DS + D S++ED+ +K GL +
Sbjct: 200 SKSSDPGPLNDSDGQGDDGDPESAEEDKAASNTRAAYTKATSVFPGLYF 248
Score = 32.9 bits (75), Expect = 1.4
Identities = 21/125 (16%), Positives = 37/125 (29%), Gaps = 34/125 (27%)
Query: 118 AMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVS-SQDGRKRRKVMFDDDLED 176
A SG S+DD D D DT++ + + S SQ + ++ + E
Sbjct: 108 ATSGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEK 167
Query: 177 NMDDDDDD---------------------------------DDDDETDDDNDEDNEEESE 203
+++ +D D DD D ++ EE +
Sbjct: 168 LIEEKKGQTENTFKFYNFGNDGEEAAAKDGGKSKSSDPGPLNDSDGQGDDGDPESAEEDK 227
Query: 204 EEEEN 208
Sbjct: 228 AASNT 232
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 46.3 bits (110), Expect = 3e-05
Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 6/84 (7%)
Query: 161 DGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEE---EEEN---EEDDDE 214
D KR+++ D D D D D D DD +D+D+E+E+ E E E +++
Sbjct: 103 DEIKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEK 162
Query: 215 ERKAEKKNSKNEHVQELTAKQKNP 238
ER+ E+K ++ E +E NP
Sbjct: 163 EREEEEKAAEEEKAREEEILTGNP 186
Score = 38.9 bits (91), Expect = 0.008
Identities = 20/103 (19%), Positives = 40/103 (38%), Gaps = 15/103 (14%)
Query: 150 KKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
K E S + + DDD ED + ++D +D D+ + + + +++++
Sbjct: 77 KLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSDSSSDSDSSDDDSD 136
Query: 210 EDDDE---------------ERKAEKKNSKNEHVQELTAKQKN 237
+DD E ER EK+ + E E ++
Sbjct: 137 DDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREE 179
Score = 33.2 bits (76), Expect = 0.67
Identities = 23/100 (23%), Positives = 34/100 (34%), Gaps = 44/100 (44%)
Query: 127 DSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDD 186
D+DD + D+ D DD DDDD
Sbjct: 118 DADDSDSSSDSDSSD-------------------------------------DDSDDDDS 140
Query: 187 DDET-------DDDNDEDNEEESEEEEENEEDDDEERKAE 219
+DET + E EE+ EEEE ++++ R+ E
Sbjct: 141 EDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180
Score = 31.2 bits (71), Expect = 2.8
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 120 SGAISSDDSDDENDEDDTSDDDTE 143
S + S DS D++ +DD S+D+T
Sbjct: 122 SDSSSDSDSSDDDSDDDDSEDETA 145
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 45.9 bits (109), Expect = 4e-05
Identities = 20/79 (25%), Positives = 39/79 (49%), Gaps = 11/79 (13%)
Query: 148 SMKKYKEQVVSSQDGRKRRKVMFDDD-----------LEDNMDDDDDDDDDDETDDDNDE 196
M K+ + +RKV + LE + + + +D D+E + D +E
Sbjct: 119 VMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEE 178
Query: 197 DNEEESEEEEENEEDDDEE 215
+ EEE E+E+ +++DDD++
Sbjct: 179 EEEEEEEDEDFDDDDDDDD 197
Score = 44.4 bits (105), Expect = 1e-04
Identities = 20/77 (25%), Positives = 35/77 (45%), Gaps = 7/77 (9%)
Query: 134 EDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETD-- 191
+ +D E KK KE D + +++ E+ +D DDDDDDD+ D
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDDYN 201
Query: 192 -----DDNDEDNEEESE 203
D+ ++D+ ++ E
Sbjct: 202 AENYFDNGEDDDYDDDE 218
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 45.1 bits (107), Expect = 4e-05
Identities = 18/51 (35%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEE-ESEEEEENEEDDDEERKAE 219
+D+ D+DD+ D++E D +DE +EE +SEEE+ +EE E
Sbjct: 11 EEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPE 61
Score = 41.6 bits (98), Expect = 6e-04
Identities = 19/97 (19%), Positives = 39/97 (40%), Gaps = 26/97 (26%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
S+ D+ +ED+ ++ E D+ ++ D D
Sbjct: 2 GSESESDDGEEDEELPEEDED--------------------------DESSDEEEVDLPD 35
Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
D+ D+E+D + ++ EEE + E + + +R+ EK
Sbjct: 36 DEQDEESDSEEEQIFVTRQEEEVDPEAEAEFDREFEK 72
Score = 31.6 bits (72), Expect = 1.5
Identities = 10/42 (23%), Positives = 21/42 (50%), Gaps = 1/42 (2%)
Query: 288 EEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEED 329
+E +E+E D+ D+ S E+E + ++ +PE +
Sbjct: 23 DESSDEEEVDL-PDDEQDEESDSEEEQIFVTRQEEEVDPEAE 63
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or more
acidic residues. The function of members of this family
is unknown.
Length = 104
Score = 43.5 bits (103), Expect = 4e-05
Identities = 19/44 (43%), Positives = 33/44 (75%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
+D+ + +DDDDDDDDDD+ D +D+D + + ++++ E+DDDE
Sbjct: 61 EDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104
Score = 42.3 bits (100), Expect = 1e-04
Identities = 19/45 (42%), Positives = 33/45 (73%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
++D +D + DDDDDDDDD+ D D+D+ + +++++ EDDD+E
Sbjct: 60 EEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104
Score = 41.5 bits (98), Expect = 2e-04
Identities = 14/57 (24%), Positives = 33/57 (57%), Gaps = 3/57 (5%)
Query: 162 GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEEN---EEDDDEE 215
+ R +D +D++D+DD DDD+D+D+++ + ++++ ++DDD+
Sbjct: 41 AKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDF 97
Score = 35.4 bits (82), Expect = 0.024
Identities = 13/40 (32%), Positives = 26/40 (65%)
Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
+D+ DDDDDD D + DD + +D++++ E++++E
Sbjct: 65 DVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104
Score = 33.8 bits (78), Expect = 0.089
Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 27/84 (32%)
Query: 118 AMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN 177
A + + D D+E+++D DDD + DDDL D
Sbjct: 48 ADAEDAAKKDEDEEDEDDVVLDDDDDDDD------------------------DDDLPD- 82
Query: 178 MDDDDDDDDDDETDDDNDEDNEEE 201
DDDD D DD+ DDD ED+++E
Sbjct: 83 -LDDDDVDLDDD-DDDFLEDDDDE 104
>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal. The family represents
the N-terminal region of RXT2-like proteins. In S.
cerevisiae, RXT2 has been demonstrated to be involved in
conjugation with cellular fusion (mating) and invasive
growth. A high throughput localisation study has
localised RXT2 to the nucleus.
Length = 141
Score = 43.5 bits (103), Expect = 7e-05
Identities = 17/47 (36%), Positives = 29/47 (61%)
Query: 152 YKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDN 198
Y+ V+S + + D+D D DDD+DD+DD+E D ++D++N
Sbjct: 29 YERDVLSRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDDEN 75
Score = 35.1 bits (81), Expect = 0.059
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEE 205
D+D D DDD+ DD++DE+ + E ++E
Sbjct: 48 DEDGGDIDDDDEDDEDDEEADAEDDDE 74
Score = 33.9 bits (78), Expect = 0.18
Identities = 14/45 (31%), Positives = 27/45 (60%)
Query: 165 RRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
R V+ ++ N D+D D DDD D++++EE++ E+++E
Sbjct: 30 ERDVLSRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDDE 74
Score = 33.5 bits (77), Expect = 0.21
Identities = 13/27 (48%), Positives = 20/27 (74%)
Query: 188 DETDDDNDEDNEEESEEEEENEEDDDE 214
DE D D+D+E++ ++EE + EDDDE
Sbjct: 48 DEDGGDIDDDDEDDEDDEEADAEDDDE 74
Score = 33.1 bits (76), Expect = 0.30
Identities = 11/28 (39%), Positives = 22/28 (78%)
Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEEN 208
D+D D D+ D+D+++D E ++E+++EN
Sbjct: 48 DEDGGDIDDDDEDDEDDEEADAEDDDEN 75
Score = 33.1 bits (76), Expect = 0.31
Identities = 9/28 (32%), Positives = 21/28 (75%)
Query: 184 DDDDDETDDDNDEDNEEESEEEEENEED 211
D+D + DDD+++D ++E + E+++E+
Sbjct: 48 DEDGGDIDDDDEDDEDDEEADAEDDDEN 75
Score = 31.2 bits (71), Expect = 1.3
Identities = 11/47 (23%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK 239
D D + ++ +E++E++E+ D E ++ K ++E+ A +P
Sbjct: 48 DEDGGDIDDDDEDDEDDEEADAE-DDDENPYKLIRLEEILAPLTHPS 93
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 46.4 bits (110), Expect = 8e-05
Identities = 42/217 (19%), Positives = 86/217 (39%), Gaps = 13/217 (5%)
Query: 140 DDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNM--DDDDDDDDDDETDDDNDED 197
DDTE + ++ +E+ +S K +M + ++++ +E +++ +E+
Sbjct: 396 DDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEE 455
Query: 198 NEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTK 257
EEE E EEE ED++EE + E N E ++ + + + + E
Sbjct: 456 EEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGE--------EPEEDAERRN 507
Query: 258 PTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEP 317
+ RMS+ + + +S +E + E D+ E +D+ E+ +
Sbjct: 508 SEMAGISRMSEGQ---QPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSS 564
Query: 318 KTHKDSKNPEEDTGLNWKSDLAQKAASAFLERQANIV 354
T + +T ++ L S LE + V
Sbjct: 565 HTELEGVATPVETKISSSRKLPPPPVSTSLENDSATV 601
Score = 44.1 bits (104), Expect = 5e-04
Identities = 17/94 (18%), Positives = 47/94 (50%)
Query: 130 DENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE 189
D+ +E++ QG+ + + +S + + + E+ +++++++++E
Sbjct: 396 DDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEE 455
Query: 190 TDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
+++ + + EE +EEEE E + D + E + S
Sbjct: 456 EEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGS 489
Score = 43.0 bits (101), Expect = 0.001
Identities = 17/116 (14%), Positives = 51/116 (43%), Gaps = 18/116 (15%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
S + ++E+ E++ +++ E + + +E +D+ E+ + +
Sbjct: 436 SQESEEEESVEEEEEEEEEEEEEEQESEEEEG---------------EDEEEEE---EVE 477
Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK 239
D+ E + + + + + EE EE+ E + E + S+ + + + + ++P+
Sbjct: 478 ADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQ 533
Score = 38.7 bits (90), Expect = 0.018
Identities = 19/122 (15%), Positives = 56/122 (45%), Gaps = 9/122 (7%)
Query: 131 ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDET 190
E + TS ++ + S + ++SQ+ + V +++ E+ ++++ + +++E
Sbjct: 408 ERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEG 467
Query: 191 DDDNDEDNEEESEEEEENE--------EDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT 242
+D+ +E+ E E++ E E D +E + ++ + ++ + P+ +
Sbjct: 468 EDE-EEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSS 526
Query: 243 FQ 244
Q
Sbjct: 527 VQ 528
Score = 30.3 bits (68), Expect = 8.4
Identities = 19/111 (17%), Positives = 46/111 (41%), Gaps = 11/111 (9%)
Query: 128 SDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDD 187
S++E ED+ +++ E+ GS +E++ S +G D E D + + +
Sbjct: 462 SEEEEGEDEEEEEEVEADNGS----EEEMEGSSEGDG-------DGEEPEEDAERRNSEM 510
Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNP 238
++ S + E+ +++ + ++ S E E +++P
Sbjct: 511 AGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESP 561
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 43.5 bits (103), Expect = 9e-05
Identities = 16/35 (45%), Positives = 26/35 (74%), Gaps = 1/35 (2%)
Query: 168 VMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEES 202
V ++D D+ ++D++++DD+E DDD DE EEES
Sbjct: 107 VASEEDESDDDEEDEEEEDDEE-DDDEDESEEEES 140
Score = 43.1 bits (102), Expect = 1e-04
Identities = 14/34 (41%), Positives = 25/34 (73%)
Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
++ DD++ED EEE +EE+++E++ +EE KK
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDEDESEEEESPVKK 144
Score = 42.3 bits (100), Expect = 2e-04
Identities = 14/30 (46%), Positives = 26/30 (86%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEEN 208
+D+ DDD++DE ++D++ED++E+ EEEE+
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDEDESEEEES 140
Score = 41.2 bits (97), Expect = 7e-04
Identities = 11/31 (35%), Positives = 25/31 (80%)
Query: 177 NMDDDDDDDDDDETDDDNDEDNEEESEEEEE 207
++D+ DDD+++ ++++DE++++E E EEE
Sbjct: 108 ASEEDESDDDEEDEEEEDDEEDDDEDESEEE 138
Score = 40.8 bits (96), Expect = 7e-04
Identities = 16/33 (48%), Positives = 27/33 (81%), Gaps = 2/33 (6%)
Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEE 206
E++ DDD++D+++E DD ++D+E+ESEEEE
Sbjct: 109 SEEDESDDDEEDEEEE--DDEEDDDEDESEEEE 139
Score = 38.5 bits (90), Expect = 0.005
Identities = 13/34 (38%), Positives = 24/34 (70%)
Query: 192 DDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
+D +D+EE+ EEE++ E+DD++E + E+ K
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDEDESEEEESPVKK 144
Score = 38.5 bits (90), Expect = 0.005
Identities = 14/36 (38%), Positives = 25/36 (69%)
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK 217
+D+ DDDE D++ ++D E++ E+E E EE ++ K
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146
Score = 36.9 bits (86), Expect = 0.016
Identities = 12/33 (36%), Positives = 23/33 (69%)
Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
+D++D+D E+E EE++E ++D+DE + E
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESP 141
Score = 28.8 bits (65), Expect = 8.3
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 112 IDGIHGAMSGAISSDDSDDENDEDDTSDDDTE 143
I G H S SDD +++ +E+D +DD E
Sbjct: 101 ISGQHLVASEEDESDDDEEDEEEEDDEEDDDE 132
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 43.0 bits (101), Expect = 9e-05
Identities = 19/48 (39%), Positives = 29/48 (60%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
D D D D+ D+++ETD+ DE +E E EE +E++DEE +A
Sbjct: 84 DADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEAEA 131
Score = 38.0 bits (88), Expect = 0.005
Identities = 15/61 (24%), Positives = 28/61 (45%), Gaps = 1/61 (1%)
Query: 171 DDDLEDNMDD-DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQ 229
+ E D D+D D+ E D ++ D EEE++E + D+ + E ++E +
Sbjct: 71 EAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEAE 130
Query: 230 E 230
Sbjct: 131 A 131
Score = 33.8 bits (77), Expect = 0.18
Identities = 16/62 (25%), Positives = 30/62 (48%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
+ + + D D+D DE + + D +E EE +E ++ +E AE + + E +E
Sbjct: 70 AEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEA 129
Query: 232 TA 233
A
Sbjct: 130 EA 131
Score = 33.4 bits (76), Expect = 0.24
Identities = 11/36 (30%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
+ ++ E D DED +E +E + +E D++EE
Sbjct: 68 AAAEAAEEAEAADA-DEDADEAAEADAADEADEEEE 102
Score = 31.5 bits (71), Expect = 0.89
Identities = 8/45 (17%), Positives = 20/45 (44%)
Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
+ +E + + +++ +E+ E + +E D+EE E
Sbjct: 61 GPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDE 105
Score = 30.7 bits (69), Expect = 1.7
Identities = 8/48 (16%), Positives = 21/48 (43%)
Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
+ ++ D +E+++E E + D+ + + E + +E E
Sbjct: 66 ATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADE 113
Score = 30.3 bits (68), Expect = 2.7
Identities = 7/45 (15%), Positives = 20/45 (44%)
Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
+ ++ + D +D ++ + ++E +E EE D+ +
Sbjct: 67 TAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETA 111
>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D. SpoVID, the
stage VI sporulation protein D, is restricted to
endospore-forming members of the bacteria, all of which
are found among the Firmicutes. It is widely distributed
but not quite universal in this group. Between
well-conserved N-terminal and C-terminal domains is a
poorly conserved, low-complexity region of variable
length, rich enough in glutamic acid to cause spurious
BLAST search results unless a filter is used. The seed
alignment for this model was trimmed, in effect, by
choosing member sequences in which these regions are
relatively short. SpoVID is involved in spore coat
assembly by the mother cell compartment late in the
process of sporulation [Cellular processes, Sporulation
and germination].
Length = 338
Score = 45.6 bits (108), Expect = 9e-05
Identities = 25/120 (20%), Positives = 49/120 (40%), Gaps = 4/120 (3%)
Query: 119 MSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNM 178
+SG ++ D E +D ++++ S + +E + +R +
Sbjct: 133 ISGIQQENNLDAEPAREDEEEEESFSAEFEHPAQEE----TAGEEERTDEPKVEHEAHEQ 188
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNP 238
+ DDD DE E + ESE E EE++ EE + E + + + L + ++P
Sbjct: 189 HEQPADDDPDEWKISASEPFQLESEVEASPEEENYEEYEDETELEVEDEEKALDEQTEDP 248
>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
Length = 534
Score = 45.8 bits (108), Expect = 1e-04
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 13/101 (12%)
Query: 111 EIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMF 170
E D GA + A +D +D+ DE ++ D+ + S Y SS+D
Sbjct: 436 EFDQDPGAPAHADRADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSED--------- 486
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEED 211
E + DD D+ D + +D + EEEEE E
Sbjct: 487 ----ESDSADDSWGDESDSGIEHDDGGVGQAIEEEEEEERA 523
Score = 35.4 bits (81), Expect = 0.18
Identities = 17/61 (27%), Positives = 27/61 (44%), Gaps = 5/61 (8%)
Query: 171 DDDLEDNMDDDDDD-----DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
D D+E D+ D D D D D+ +E+ES +E + EDD + + S
Sbjct: 426 DQDVEAEAHDEFDQDPGAPAHADRADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSE 485
Query: 226 E 226
+
Sbjct: 486 D 486
Score = 32.3 bits (73), Expect = 1.7
Identities = 23/131 (17%), Positives = 47/131 (35%), Gaps = 30/131 (22%)
Query: 118 AMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN 177
A + A ++ +D + + + + D + G + R DD E
Sbjct: 415 AAAAAAANAPADQDVEAEAHDEFDQDPG-------------APAHADRADSDEDDMDEQE 461
Query: 178 MDDDDDDDDDD----------ETDDDNDEDNEEESEEEEENEEDDD-------EERKAEK 220
D+ D +DD ++D++D ++ +E + E DD EE + E+
Sbjct: 462 SGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGGVGQAIEEEEEEE 521
Query: 221 KNSKNEHVQEL 231
+ + L
Sbjct: 522 RAVLGAVAEML 532
Score = 31.5 bits (71), Expect = 3.2
Identities = 24/97 (24%), Positives = 40/97 (41%), Gaps = 13/97 (13%)
Query: 143 ESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLE-----DNMDDDDDDDDDDETDDDNDED 197
E G+ + + QD FD D D D D+DD D+ E+ D+ D
Sbjct: 410 EEGRAAAAAAAANAPADQDVEAEAHDEFDQDPGAPAHADRADSDEDDMDEQESGDER-AD 468
Query: 198 NEEESEEEE-------ENEEDDDEERKAEKKNSKNEH 227
E++S+ E+E D ++ ++ +S EH
Sbjct: 469 GEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEH 505
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 45.5 bits (108), Expect = 1e-04
Identities = 25/160 (15%), Positives = 56/160 (35%), Gaps = 19/160 (11%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
D + D +DD+ + D + +D ++ N++++D++ ++ +
Sbjct: 33 DSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDS---- 88
Query: 231 LTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEE 290
N F ++ T + D + ++ I + L LNS +
Sbjct: 89 -----NNIIDFIYK-------NLPQTNINQLLTKNKYDDNYSLTTLIQN-LFNLNSDISD 135
Query: 291 EEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDT 330
E+ +S S D N + + + SK + D
Sbjct: 136 YEQPRNSEKSTN--DSNKNSDSSIKNDTDTQSSKQDKADN 173
Score = 37.1 bits (86), Expect = 0.065
Identities = 21/164 (12%), Positives = 57/164 (34%), Gaps = 30/164 (18%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
+D+D ++ + D +D+ S ++ + + + +
Sbjct: 157 IKNDTDTQSSKQDKADNQKAPS-------SNNTKPSTSNKQ------PNSPKPTQPNQSN 203
Query: 184 DD---DDDETDDDNDEDNE------------EESEEEEENEEDDDEERKAEKKNSKNEHV 228
DD + +DN+ + SE+ ++ ++D + K +K + N
Sbjct: 204 SQPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQSKKDKTETSNTKN 263
Query: 229 QELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQ--PRMSDKD 270
+L + + + + +T+ T + + P +S+ D
Sbjct: 264 PQLPTQDELKHKSKPAQSFENDVNQSNTRSTSLFETGPSLSNND 307
Score = 32.8 bits (75), Expect = 1.2
Identities = 45/256 (17%), Positives = 84/256 (32%), Gaps = 48/256 (18%)
Query: 120 SGAISSDDSDDENDEDDTSD---DDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLED 176
+ + S S+D+ DTS D ++ S + ++ S+ D D +
Sbjct: 38 TTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYK 97
Query: 177 NMDDDDDDDD----------------------DDETDDDNDEDNEEESEEEEENEEDDDE 214
N+ + + + + D N E+S + D
Sbjct: 98 NLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSI 157
Query: 215 ERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIR 274
+ + ++SK + K N KA + + S K P+P ++ +
Sbjct: 158 KNDTDTQSSKQD-------KADNQKAPSSNNTKPST----SNKQPNSPKPTQPNQSNS-- 204
Query: 275 HKITSALAGLNSKEEEEEEQEDSAFS---DESADD---NSSDEDETVEPKTHKDS--KNP 326
+ A S ++ + DSA D+ ++D D + + S KNP
Sbjct: 205 QPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQSKKDKTETSNTKNP 264
Query: 327 EEDT--GLNWKSDLAQ 340
+ T L KS AQ
Sbjct: 265 QLPTQDELKHKSKPAQ 280
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 44.1 bits (105), Expect = 1e-04
Identities = 14/41 (34%), Positives = 22/41 (53%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEED 211
+ + D++D E DDD+DED +E+ EEE + D
Sbjct: 36 AAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPD 76
Score = 42.2 bits (100), Expect = 7e-04
Identities = 14/45 (31%), Positives = 25/45 (55%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
+ + D++D DDD+D+++E+E +EEE + D EE
Sbjct: 35 AAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79
Score = 41.8 bits (99), Expect = 0.001
Identities = 13/53 (24%), Positives = 28/53 (52%), Gaps = 3/53 (5%)
Query: 171 DDDLEDNMDD---DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
D + + + D+++ +DD+D+D +E+ ++EEE + D E E+
Sbjct: 31 DPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEARER 83
Score = 38.7 bits (91), Expect = 0.008
Identities = 11/60 (18%), Positives = 27/60 (45%)
Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA 233
++ N E++ D ++ +++ ++E+E+E+D++E E EL
Sbjct: 30 IDPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEARERFAELRE 89
Score = 36.4 bits (85), Expect = 0.044
Identities = 12/49 (24%), Positives = 24/49 (48%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
DL D + E +EE+ E++++++ED+DE+ + E
Sbjct: 23 SDLISGFIDPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 45.8 bits (108), Expect = 2e-04
Identities = 25/57 (43%), Positives = 38/57 (66%)
Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
R+ R + +D + DDDD+D++ DDD++ED+EEE EEEEE + DDE+ + E
Sbjct: 135 RRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191
Score = 37.7 bits (87), Expect = 0.046
Identities = 14/50 (28%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 177 NMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
++ ++D D+ DD+DED +E+ +++E +++++EE + E K +E
Sbjct: 139 HLAEEDMSPRDNFVIDDDDEDEDED--DDDEEDDEEEEEEEEEIKGFDDE 186
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 43.2 bits (102), Expect = 2e-04
Identities = 29/111 (26%), Positives = 53/111 (47%), Gaps = 21/111 (18%)
Query: 108 SMMEIDGIHGAM--SGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKR 165
+ EID AM I + D + + E+ D+ Y E +D
Sbjct: 83 PLDEIDEEIQAMTEKKDIKAKDKEVDAFEEGDEDELD---------YDEDKEEEED---- 129
Query: 166 RKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNE-EESEEEEENEEDDDEE 215
D+ + ++DD+D+DDDE + ED+E +E E+++E+EED++++
Sbjct: 130 -----DEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDEDDDEDEEDEEDK 175
Score = 40.5 bits (95), Expect = 0.001
Identities = 23/91 (25%), Positives = 48/91 (52%)
Query: 130 DENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE 189
DE DE+ + + + + K+ +D + +++ ++ DD++DD+DE
Sbjct: 85 DEIDEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDE 144
Query: 190 TDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
DD+ E E+ E +E+ ++D+DEE + +K
Sbjct: 145 DDDEIVEILIEDDEVDEDEDDDEDEEDEEDK 175
Score = 35.9 bits (83), Expect = 0.051
Identities = 20/71 (28%), Positives = 36/71 (50%), Gaps = 8/71 (11%)
Query: 164 KRRKVMFDDDLEDNMDDDDD--------DDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
+ F++ ED +D D+D D D+ +DD DED++E E E++E D++E
Sbjct: 104 DKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDE 163
Query: 216 RKAEKKNSKNE 226
E + + +
Sbjct: 164 DDDEDEEDEED 174
Score = 33.2 bits (76), Expect = 0.42
Identities = 15/53 (28%), Positives = 29/53 (54%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELT 232
D + D E D+++ D +E+ EEEE++E D ++ ++ +E V+ L
Sbjct: 102 AKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILI 154
Score = 32.8 bits (75), Expect = 0.55
Identities = 16/74 (21%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 160 QDGRKRRKVMFDDDLEDNMDDD-DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
+ + K D E+ +D+ D D+D +E +DD + ++E+++E+E++++ E
Sbjct: 96 EKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIE 155
Query: 219 EKKNSKNEHVQELT 232
+ + ++E E
Sbjct: 156 DDEVDEDEDDDEDE 169
>gnl|CDD|221603 pfam12496, BNIP2, Bcl2-/adenovirus E1B nineteen kDa-interacting
protein 2. This domain family is found in eukaryotes,
and is typically between 119 and 133 amino acids in
length. There is a conserved HGGY sequence motif. This
family is Bcl2-/adenovirus E1B nineteen kDa-interacting
protein 2. It interacts with pro- and anti- apoptotic
molecules in the cell.
Length = 126
Score = 42.0 bits (99), Expect = 2e-04
Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 4/83 (4%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
DDL + DD D + DD +T DD+D + +E E E DD + ++ + +
Sbjct: 26 DDLLETPDDLDINVDDLDTPDDSDSLDFPGNELEWE----DDTPKAKAAESPPAGSIPQY 81
Query: 232 TAKQKNPKAFTFQSVIKGERKFR 254
TA+++ +++V GE++ R
Sbjct: 82 TAEEERRDGRLWRTVRIGEQEHR 104
>gnl|CDD|235030 PRK02315, PRK02315, adaptor protein; Provisional.
Length = 233
Score = 43.3 bits (103), Expect = 3e-04
Identities = 19/62 (30%), Positives = 31/62 (50%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
DL N +D D D++ +D ED + E+ E+++D D K EK+ + E +
Sbjct: 79 KLDLPLNFEDLIDLPSDEDIEDLFPEDFFKLLEQSLEDKDDLDATAKQEKEEKEAEDEVQ 138
Query: 231 LT 232
LT
Sbjct: 139 LT 140
Score = 39.8 bits (94), Expect = 0.004
Identities = 14/63 (22%), Positives = 30/63 (47%), Gaps = 1/63 (1%)
Query: 167 KVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
K+ + ED +D D+D +D +D E+S E++++ + ++ K EK+
Sbjct: 79 KLDLPLNFEDLIDLPSDEDIEDLFPED-FFKLLEQSLEDKDDLDATAKQEKEEKEAEDEV 137
Query: 227 HVQ 229
+
Sbjct: 138 QLT 140
Score = 36.0 bits (84), Expect = 0.081
Identities = 13/51 (25%), Positives = 24/51 (47%)
Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
+ D+D+ED +D + +D +D D + E+EE+ ED+
Sbjct: 87 EDLIDLPSDEDIEDLFPEDFFKLLEQSLEDKDDLDATAKQEKEEKEAEDEV 137
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 44.6 bits (106), Expect = 3e-04
Identities = 12/57 (21%), Positives = 30/57 (52%), Gaps = 1/57 (1%)
Query: 169 MFDDDLEDNMDDDDDDDDD-DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
+ D+E + DD DD++ + D D+D+E++ + ++++E++ + K
Sbjct: 211 LLKLDVEIQNELDDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIK 267
Score = 38.1 bits (89), Expect = 0.028
Identities = 12/50 (24%), Positives = 28/50 (56%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
DD++D ++ D+DD+ +DD + ++++ EE + E ++ +K
Sbjct: 223 DDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKEVSEK 272
Score = 35.0 bits (81), Expect = 0.25
Identities = 16/50 (32%), Positives = 27/50 (54%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
DD ++ + D+DDD+ DD D D+++E E + E E + +EK
Sbjct: 223 DDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKEVSEK 272
Score = 31.1 bits (71), Expect = 3.8
Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 27/68 (39%)
Query: 126 DDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDD 185
DD DDE +E +D+D + D+D +MDDDD+++
Sbjct: 223 DDIDDEEEERVLADEDDD---------------------------DEDDMFDMDDDDEEE 255
Query: 186 DDDETDDD 193
D E +
Sbjct: 256 SDPEVERT 263
Score = 31.1 bits (71), Expect = 4.2
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEE 204
+ D+ +D+ DD D DDDDE + D + + +E
Sbjct: 233 VLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKE 268
Score = 30.7 bits (70), Expect = 5.0
Identities = 11/38 (28%), Positives = 22/38 (57%)
Query: 160 QDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDED 197
D + R + +DD +++ D DDDD++E+D + +
Sbjct: 226 DDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERT 263
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 41.1 bits (96), Expect = 3e-04
Identities = 27/86 (31%), Positives = 49/86 (56%), Gaps = 1/86 (1%)
Query: 151 KYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEE 210
K K++VV ++ K +++ E+ + DD+ +++E D++DE+ E E EEEE EE
Sbjct: 17 KEKKEVVEEKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEE 76
Query: 211 DDDEERKAEKKNSKNEHVQELTAKQK 236
++ E K+ +++E T KQK
Sbjct: 77 EETEG-ATGKRAAEDEEDDAETKKQK 101
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 44.1 bits (104), Expect = 4e-04
Identities = 26/107 (24%), Positives = 49/107 (45%), Gaps = 9/107 (8%)
Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDD 182
I +D + DD Q + + ++ S D + DD +E++ D++
Sbjct: 184 IEDKAGEDLDGLAAEIDD-----QQAFARVVRDMLGSMDMAEE---TGDDGIEED-ADEE 234
Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQ 229
D DDD ++++ E EESE +E+EED+ E E + + + +
Sbjct: 235 DGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAE 281
Score = 42.9 bits (101), Expect = 0.001
Identities = 21/113 (18%), Positives = 43/113 (38%), Gaps = 6/113 (5%)
Query: 109 MMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKV 168
M+ + +D+D+E+ +DD D++ +S G + +
Sbjct: 212 MLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEAT--- 268
Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
D + E+ D + +D E D++DED E E+ + + + K
Sbjct: 269 --DGEGEEGEMDAAEASEDSE-SDESDEDTETPGEDARPATPFTELMEEVDYK 318
Score = 32.9 bits (75), Expect = 1.2
Identities = 11/55 (20%), Positives = 24/55 (43%)
Query: 176 DNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
+M ++ DD +E D+ D D+++ E+ ++ E E + + E
Sbjct: 216 MDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDG 270
Score = 32.5 bits (74), Expect = 1.5
Identities = 18/121 (14%), Positives = 43/121 (35%), Gaps = 18/121 (14%)
Query: 135 DDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDD----DD--- 187
D D E + + K + D + ++ D++ED +D D DD
Sbjct: 145 TDKEDAPLEEAVALLVREKLTGDAPPDSAGKVLDLWRDEIEDKAGEDLDGLAAEIDDQQA 204
Query: 188 -----------DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
+ ++ +D EE +EE+ ++D + + + + + + + +
Sbjct: 205 FARVVRDMLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDE 264
Query: 237 N 237
Sbjct: 265 A 265
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 44.1 bits (104), Expect = 4e-04
Identities = 24/112 (21%), Positives = 45/112 (40%), Gaps = 6/112 (5%)
Query: 120 SGAISSDDSDDEN--DEDDTSDDDTESGQGSMKKYKEQVVSSQ----DGRKRRKVMFDDD 173
A S D DDE + +D S + + S + +E S DG ++
Sbjct: 161 KEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEG 220
Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
E + ++D + E+D+ + E +E S + E ++ K +K +S
Sbjct: 221 EEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTF 272
Score = 38.3 bits (89), Expect = 0.023
Identities = 21/97 (21%), Positives = 36/97 (37%), Gaps = 3/97 (3%)
Query: 131 ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRR---KVMFDDDLEDNMDDDDDDDDD 187
+ T + + S K E+ S+D K + DD+ E++ D +D D
Sbjct: 149 GKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYD 208
Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
D +DE+ EE+ NE+ + E
Sbjct: 209 GMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEIS 245
Score = 34.0 bits (78), Expect = 0.48
Identities = 26/137 (18%), Positives = 50/137 (36%), Gaps = 43/137 (31%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGS--MKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
S D+S E +D SDD+ E S +Y +V S D + ++ N ++D
Sbjct: 177 SEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEE-----GEEAPSINYNED 231
Query: 182 DDDDDDDETDDDNDEDN------------------------------------EEESEEE 205
+ + DE+D + E E+E +++
Sbjct: 232 TSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGSEDEDDDD 291
Query: 206 EENEEDDDEERKAEKKN 222
E+ + D ++ ++KN
Sbjct: 292 EDIDPDQVVKKPVKRKN 308
Score = 33.3 bits (76), Expect = 0.92
Identities = 12/58 (20%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 171 DDDLEDNMDDDDDD--DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
+ +++ D DD++ + +DE+ + +++ + EEEE+++ +D + +S +E
Sbjct: 160 KKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDE 217
Score = 30.6 bits (69), Expect = 5.1
Identities = 21/152 (13%), Positives = 47/152 (30%), Gaps = 28/152 (18%)
Query: 177 NMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
+T ++ +E S++++E E + ++E K+E E + ++
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSE---ESAEDDSDDEEEED 198
Query: 237 NPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQED 296
+ + +V + + A E
Sbjct: 199 S-----------DSEDYSQYDGMLVDSSDEEEGEEAP--------------SINYNEDTS 233
Query: 297 SAFSDESADDNSSDEDETVEPKTHKDSKNPEE 328
+ SDES + S + ++ SK P+E
Sbjct: 234 ESESDESDSEISESRSVSDSEESSPPSKKPKE 265
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 44.0 bits (104), Expect = 4e-04
Identities = 19/66 (28%), Positives = 36/66 (54%), Gaps = 5/66 (7%)
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK----- 236
D+++++++ + +E+ + EEE + EE+ +E++K KK + EL K K
Sbjct: 37 DEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTKPIWTR 96
Query: 237 NPKAFT 242
NPK T
Sbjct: 97 NPKDVT 102
Score = 33.6 bits (77), Expect = 0.76
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 6/51 (11%)
Query: 187 DDETDDDNDEDNEEESEEEEEN-----EEDDDEERKAEKKNSKNEHVQELT 232
+ E D+ +E+ +EE +EEEE EE D+EE K EKK K + V+E T
Sbjct: 32 EKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKK-KKTKKVKETT 81
Score = 30.5 bits (69), Expect = 6.2
Identities = 15/63 (23%), Positives = 29/63 (46%)
Query: 196 EDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRS 255
E +E +EEEE E+++ +E + + + + E +E ++K K +
Sbjct: 30 EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89
Query: 256 TKP 258
TKP
Sbjct: 90 TKP 92
>gnl|CDD|214472 smart00017, OSTEO, Osteopontin. Osteopontin is an acidic
phosphorylated glycoprotein of about 40 Kd which is
abundant in the mineral matrix of bones and which binds
tightly to hydroxyapatite. It is suggested that
osteopontin might function as a cell attachment factor
and could play a key role in the adhesion of osteoclasts
to the mineral matrix of bone.
Length = 287
Score = 43.4 bits (102), Expect = 4e-04
Identities = 19/41 (46%), Positives = 29/41 (70%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
DDL+D+ DDD DD D+++DD D D+ +ES+E ++E D
Sbjct: 70 DDLDDDDDDDHVDDRDNDSDDAEDSDDSDESDESHHSDESD 110
Score = 37.3 bits (86), Expect = 0.040
Identities = 17/35 (48%), Positives = 24/35 (68%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE 205
DDD +D++DD D+D DD E DD+DE +E +E
Sbjct: 74 DDDDDDHVDDRDNDSDDAEDSDDSDESDESHHSDE 108
Score = 35.7 bits (82), Expect = 0.100
Identities = 16/42 (38%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
DDD +D+ DD D+D D+ +D +D D +ES +E++ D
Sbjct: 73 DDDDDDD-HVDDRDNDSDDAEDSDDSDESDESHHSDESDVTD 113
>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein. This family consists of several
eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
to suppress GCR2 and is highly expressed in the muscle
and heart. The function of this family is unknown
although it has been speculated that SGT1 may be
functionally analogous to the Gcr2p protein of
Saccharomyces cerevisiae which is known to be a
regulatory factor of glycolytic gene expression.
Length = 557
Score = 43.6 bits (103), Expect = 6e-04
Identities = 30/114 (26%), Positives = 53/114 (46%), Gaps = 24/114 (21%)
Query: 107 KSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDD-----DTESGQGSMKKYKEQVVSSQD 161
SM E + GA +DD D+++DE D S+D D + +K
Sbjct: 419 SSMEEFLNKVSSFEGAEFADDEDEDDDEPDDSEDKDVSFDEDEFFEFLKNM--------- 469
Query: 162 GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
+ L+D+ D+D DD D+ D+D+DED++E+ + ++ ++ EE
Sbjct: 470 -------L---GLKDDEIDNDLPDDSDDADEDDDEDDDEDEDSSSDSTLEELEE 513
Score = 40.9 bits (96), Expect = 0.005
Identities = 27/134 (20%), Positives = 56/134 (41%), Gaps = 11/134 (8%)
Query: 90 EIFEFLNICQVHGESTEKSMM-EIDGIHGAMSGAISSDDSDDENDEDD----TSDDDTES 144
E E L + + ST S EI + + ++ D ++DD S D+ +S
Sbjct: 330 EYKELLEKAEAYFSSTASSPGEEILDLLQLDLEELRDEEQDLPPEDDDSWLNISPDELDS 389
Query: 145 ----GQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEE 200
QG K K + + + +E+ ++ + + DD++++D+E
Sbjct: 390 ELQERQGDKKDLKSNKEDANEVDDLEE--VVSSMEEFLNKVSSFEGAEFADDEDEDDDEP 447
Query: 201 ESEEEEENEEDDDE 214
+ E+++ D+DE
Sbjct: 448 DDSEDKDVSFDEDE 461
Score = 40.1 bits (94), Expect = 0.007
Identities = 19/73 (26%), Positives = 28/73 (38%), Gaps = 6/73 (8%)
Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
DD DD+ DED +SD E + M + ++ + +DD
Sbjct: 491 DDDEDDDEDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADIS------NSGSSGAEDD 544
Query: 185 DDDDETDDDNDED 197
DDD E + D D
Sbjct: 545 DDDIEGVEPVDID 557
Score = 34.3 bits (79), Expect = 0.39
Identities = 33/204 (16%), Positives = 71/204 (34%), Gaps = 22/204 (10%)
Query: 127 DSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDD--DLEDNMDDDDDD 184
+ D + S++ E + Y SS G + ++ D +L D D +
Sbjct: 317 EKGYFKDNLEGSEEYKE-LLEKAEAYFSSTASS-PGEEILDLLQLDLEELRDEEQDLPPE 374
Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQ 244
DDD + DE + E E + ++ D + K + + +E+ + + +
Sbjct: 375 DDDSWLNISPDELDSELQ--ERQGDKKDLKSNKEDANEVDDL--EEVVSSME--EFLNKV 428
Query: 245 SVIKGERKFRSTKPTVVPQPRMSDKDLAI---------RHKITSA---LAGLNSKEEEEE 292
S +G DKD++ ++ + + + ++
Sbjct: 429 SSFEGAEFADDEDEDDDEPDDSEDKDVSFDEDEFFEFLKNMLGLKDDEIDNDLPDDSDDA 488
Query: 293 EQEDSAFSDESADDNSSDEDETVE 316
+++D DE D +S E +E
Sbjct: 489 DEDDDEDDDEDEDSSSDSTLEELE 512
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 43.4 bits (102), Expect = 6e-04
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 11/84 (13%)
Query: 138 SDDDTESGQGSM---KK---YKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETD 191
SD+ +S M KK +KEQVV+ R + DDD ED ++ ++DD D+ +
Sbjct: 250 SDELGKSLPSLMSPRKKTASFKEQVVT----RTFPERTSDDD-EDAIETEEDDVDESAIE 304
Query: 192 DDNDEDNEEESEEEEENEEDDDEE 215
DD+D+ + E+S EE D++
Sbjct: 305 DDDDDSDWEDSVEESGRSSVDEKT 328
Score = 34.1 bits (78), Expect = 0.45
Identities = 23/119 (19%), Positives = 46/119 (38%), Gaps = 14/119 (11%)
Query: 118 AMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGR--------KRRKVM 169
+ G+ DD D D + D S K + S + G+ +++
Sbjct: 210 TLGGSSGDDDEDSFEDRMSSQDPKRSSLPKPKPKMFQLGGSDELGKSLPSLMSPRKKTAS 269
Query: 170 FDD-----DLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
F + + DDD+D + E +DD DE E+ +++ + E+ +E ++
Sbjct: 270 FKEQVVTRTFPERTSDDDEDAIETE-EDDVDESAIEDDDDDSDWEDSVEESGRSSVDEK 327
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 42.4 bits (100), Expect = 0.001
Identities = 16/71 (22%), Positives = 33/71 (46%), Gaps = 5/71 (7%)
Query: 160 QDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
+D R+ R D + + D+D+D+++E D E+ E E EEE ++ +D ++
Sbjct: 355 RDKRRARLDPIDFE-----EVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESS 409
Query: 220 KKNSKNEHVQE 230
+ +
Sbjct: 410 SDVGSDSESKA 420
Score = 37.0 bits (86), Expect = 0.061
Identities = 17/65 (26%), Positives = 34/65 (52%), Gaps = 2/65 (3%)
Query: 176 DNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQ 235
D +D ++ D+D+DE ++ +E E EE E++EE+ + R+ S ++ + +K
Sbjct: 363 DPIDFEEVDEDEDE--EEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKA 420
Query: 236 KNPKA 240
A
Sbjct: 421 DKESA 425
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 43.1 bits (101), Expect = 0.001
Identities = 16/45 (35%), Positives = 33/45 (73%), Gaps = 1/45 (2%)
Query: 175 EDNMDDDDDDDDDDETDDD-NDEDNEEESEEEEENEEDDDEERKA 218
+D D+ D+D++ DE+ +D +++++E +S +EE+ E+ D+ E KA
Sbjct: 946 DDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKA 990
Score = 36.1 bits (83), Expect = 0.12
Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 25/99 (25%)
Query: 121 GAISSDDSDDENDE--DDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNM 178
S D+SD+ +E + + D ES + D+D E +
Sbjct: 924 MVGSDDESDESEEEVSEYEASSDDESDET-----------------------DEDEESDE 960
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK 217
+D +D+ E D ++ED E+ E E + D ++
Sbjct: 961 SSEDLSEDESENDSSDEEDGEDWDELESKAAYDSRPGKR 999
Score = 31.9 bits (72), Expect = 2.5
Identities = 25/73 (34%), Positives = 36/73 (49%), Gaps = 11/73 (15%)
Query: 286 SKEEEEEEQEDSA----FSDESADDNSSDE--DETVEPKTHKDSKNPEEDTGLNWKSDLA 339
S E EEE E A SDE+ +D SDE ++ E ++ DS + E+ G +W +L
Sbjct: 931 SDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEED--GEDWD-ELE 987
Query: 340 QKAASAFLERQAN 352
KA A+ R
Sbjct: 988 SKA--AYDSRPGK 998
Score = 31.5 bits (71), Expect = 3.1
Identities = 12/34 (35%), Positives = 23/34 (67%)
Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
DDE+D+ +E +E E+ ++E++E D++E E
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDE 960
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 42.5 bits (101), Expect = 0.001
Identities = 16/69 (23%), Positives = 33/69 (47%), Gaps = 8/69 (11%)
Query: 152 YKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEED 211
+ +++ + +RR +L D D + ++D + +E +++E EEEEE+E D
Sbjct: 155 WYDRLEN----GERRL----RELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDEND 206
Query: 212 DDEERKAEK 220
D +
Sbjct: 207 DSLAADESE 215
Score = 34.8 bits (81), Expect = 0.28
Identities = 11/40 (27%), Positives = 20/40 (50%)
Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
D D + +ED E EE ++D+DEE + ++ +
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLA 210
Score = 33.6 bits (78), Expect = 0.79
Identities = 13/66 (19%), Positives = 32/66 (48%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
++ +D +++++D++DD D E E+ E+ + + + RKA++K + Q
Sbjct: 190 EELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQH 249
Query: 231 LTAKQK 236
+
Sbjct: 250 KKYAKL 255
Score = 32.5 bits (75), Expect = 1.6
Identities = 13/39 (33%), Positives = 20/39 (51%)
Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
+E+ D DD ++E D DED EEE+ + E +
Sbjct: 60 VEEAPDADDLLLAENEADAQTDEDAEEEAAAALSSVESE 98
Score = 31.3 bits (72), Expect = 3.6
Identities = 11/38 (28%), Positives = 20/38 (52%)
Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
E D D+ E+E + + +ED +EE A + ++E
Sbjct: 61 EEAPDADDLLLAENEADAQTDEDAEEEAAAALSSVESE 98
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 38.9 bits (91), Expect = 0.002
Identities = 27/106 (25%), Positives = 42/106 (39%), Gaps = 15/106 (14%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
S DD ++E+D DD Y + + + R++ + DD DD
Sbjct: 8 SRRSRDDYDEEEDEDMDDFIEDDDEEDDYDRDEIWAMFNKGRKRYAYRDD--------DD 59
Query: 184 DDDDDETDDDNDEDNEEESE-------EEEENEEDDDEERKAEKKN 222
DDDD E + ++ E S EEE E ++E K +K
Sbjct: 60 DDDDMEAGGADIQEEERRSARLARLEDREEERLEKEEEREKRARKR 105
Score = 29.7 bits (67), Expect = 3.0
Identities = 23/98 (23%), Positives = 42/98 (42%), Gaps = 35/98 (35%)
Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDE-------------------TDDDNDED-----N 198
+R + +D++ +++MDD +DDD+++ DD+D+D
Sbjct: 8 SRRSRDDYDEEEDEDMDDFIEDDDEEDDYDRDEIWAMFNKGRKRYAYRDDDDDDDDMEAG 67
Query: 199 EEESEEEEE------NEEDDDEER-----KAEKKNSKN 225
+ +EEE ED +EER + EK+ K
Sbjct: 68 GADIQEEERRSARLARLEDREEERLEKEEEREKRARKR 105
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 42.2 bits (99), Expect = 0.002
Identities = 19/76 (25%), Positives = 37/76 (48%), Gaps = 5/76 (6%)
Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD-----DEERK 217
R R+++ +L + M DD + +D+++ DDD +NE+E + E E E + + E
Sbjct: 188 RVVREMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEAT 247
Query: 218 AEKKNSKNEHVQELTA 233
+ S E + +
Sbjct: 248 DRESESGEEEMVQSDQ 263
Score = 37.6 bits (87), Expect = 0.038
Identities = 19/113 (16%), Positives = 46/113 (40%)
Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDD 182
++ + DD ED+ DD + + ++ E Q+G ++ D ++ +++
Sbjct: 199 LAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEM 258
Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQ 235
D D+ D++D+D+E E + + K + + A++
Sbjct: 259 VQSDQDDLPDESDDDSETPGEGARPARPFTSTGGEPDYKVFTTAFDETVDAEE 311
Score = 32.6 bits (74), Expect = 1.3
Identities = 19/125 (15%), Positives = 47/125 (37%), Gaps = 28/125 (22%)
Query: 119 MSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNM 178
M S+D +D +D+ T ++ E G+G E
Sbjct: 203 MGDDTESEDEEDGDDDQPTENEQEEQGEG---------------------------EGEG 235
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENE-EDDDEERKAEKKNSKNEHVQELTAKQKN 237
+ + + TD +++ EE + ++++ ++ D++ + + ++ T + +
Sbjct: 236 QEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDSETPGEGARPARPFTSTGGEPD 295
Query: 238 PKAFT 242
K FT
Sbjct: 296 YKVFT 300
>gnl|CDD|206749 cd01856, YlqF, Circularly permuted YlqF GTPase. Proteins of the
YlqF family contain all sequence motifs typical of the
vast class of P-loop-containing GTPases, but show a
circular permutation, with a G4-G1-G3 pattern of motifs
as opposed to the regular G1-G3-G4 pattern seen in most
GTPases. The YlqF subfamily is represented in all
eukaryotes as well as a phylogenetically diverse array
of bacteria (including gram-positive bacteria,
proteobacteria, Synechocystis, Borrelia, and
Thermotoga).
Length = 171
Score = 39.8 bits (94), Expect = 0.002
Identities = 19/72 (26%), Positives = 29/72 (40%), Gaps = 1/72 (1%)
Query: 393 QKNPKAFTFQSVIKGERKFRRKEDIQAKKHHVPQVDRTPLEPPPIVVAVVGPPQVGKSTL 452
+ + F + G+ + + + ++ L P P+ VVG P VGKSTL
Sbjct: 72 KSQGEPVLFVNAKNGKGVKKLLKKAKKLLKENEKLKAKGLLPRPLRAMVVGIPNVGKSTL 131
Query: 453 IRCLI-KNFTKT 463
I L K K
Sbjct: 132 INRLRGKKVAKV 143
>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
N-terminal domain. Autophagocytosis is a
starvation-induced process responsible for transport of
cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
ubiquitin like modifier that is topologically similar to
the canonical E2 enzyme. It catalyzes the conjugation of
Atg8 and phosphatidylethanolamine.
Length = 146
Score = 39.6 bits (93), Expect = 0.002
Identities = 14/52 (26%), Positives = 28/52 (53%)
Query: 164 KRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
+R K M D + + +D+D+DD T N + +++ +EE+ E D++
Sbjct: 77 RRAKQMEYGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDD 128
Score = 33.8 bits (78), Expect = 0.20
Identities = 10/45 (22%), Positives = 23/45 (51%)
Query: 168 VMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
V + + DD D++D E DD+D+ + +E+++++
Sbjct: 102 VTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADEDDDDDIP 146
Score = 33.1 bits (76), Expect = 0.34
Identities = 11/55 (20%), Positives = 27/55 (49%), Gaps = 8/55 (14%)
Query: 167 KVMFDDDLED--------NMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
+++ D+D +D DD D+++ + D+D++ + + ++DDD
Sbjct: 90 EIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADEDDDDD 144
>gnl|CDD|214640 smart00382, AAA, ATPases associated with a variety of cellular
activities. AAA - ATPases associated with a variety of
cellular activities. This profile/alignment only detects
a fraction of this vast family. The poorly conserved
N-terminal helix is missing from the alignment.
Length = 148
Score = 39.7 bits (92), Expect = 0.002
Identities = 14/34 (41%), Positives = 18/34 (52%)
Query: 435 PPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVI 468
P V+ +VGPP GK+TL R L + VI
Sbjct: 1 PGEVILIVGPPGSGKTTLARALARELGPPGGGVI 34
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 41.0 bits (96), Expect = 0.003
Identities = 13/65 (20%), Positives = 31/65 (47%)
Query: 162 GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
R R ++ D+ + + D+ + D ++ +D++D +E+ ++ EE E + +
Sbjct: 196 ARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSD 255
Query: 222 NSKNE 226
S E
Sbjct: 256 ASSEE 260
Score = 39.4 bits (92), Expect = 0.007
Identities = 13/60 (21%), Positives = 29/60 (48%), Gaps = 1/60 (1%)
Query: 172 DDLEDNMDDDDDD-DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
D EDN D+DD D+DD+ ++ + + + E+ + ++ E + E + + +
Sbjct: 219 ADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPD 278
Score = 39.4 bits (92), Expect = 0.008
Identities = 16/59 (27%), Positives = 29/59 (49%)
Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA 233
E D +D++D+D+ +D D+D EE E + +D + +E+ S E +A
Sbjct: 215 EPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASA 273
Score = 39.0 bits (91), Expect = 0.011
Identities = 15/64 (23%), Positives = 32/64 (50%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
D E+ D+ + D +D D+D+ +++E++ + EEE D + +S+ E+
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEM 266
Query: 232 TAKQ 235
A +
Sbjct: 267 EAAE 270
Score = 38.6 bits (90), Expect = 0.014
Identities = 13/53 (24%), Positives = 27/53 (50%)
Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
+ D+ + +D++D+DD E +DD+ + EE + +E+ D + E
Sbjct: 212 LGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESG 264
Score = 34.4 bits (79), Expect = 0.28
Identities = 17/98 (17%), Positives = 30/98 (30%), Gaps = 26/98 (26%)
Query: 118 AMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN 177
+M A D + D +D D+D +D
Sbjct: 205 SMDMAEELGDEPESADSEDNEDEDDPKEDED--------------------------DDQ 238
Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
++++ D ++D + E ES E E E D+
Sbjct: 239 GEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDT 276
Score = 33.6 bits (77), Expect = 0.56
Identities = 16/92 (17%), Positives = 35/92 (38%), Gaps = 27/92 (29%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
S+D D+E+++D D+D + G+ E++ D
Sbjct: 218 SADSEDNEDEDDPKEDEDDDQGEE---------------------------EESGSSDSL 250
Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
+D D + ++ + E +E ++ D D+
Sbjct: 251 SEDSDASSEEMESGEMEAAEASADDTPDSDDA 282
Score = 30.6 bits (69), Expect = 4.3
Identities = 14/71 (19%), Positives = 32/71 (45%), Gaps = 12/71 (16%)
Query: 172 DDLEDNMDD------------DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
D L +DD D ++ D+ D+E+ +E++ E++DD++ + E
Sbjct: 183 DRLAACVDDQQAFARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEE 242
Query: 220 KKNSKNEHVQE 230
+ S + ++
Sbjct: 243 ESGSSDSLSED 253
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 37.5 bits (87), Expect = 0.003
Identities = 19/57 (33%), Positives = 37/57 (64%)
Query: 149 MKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE 205
M+ E ++ + R VM + + ++ DDD+DDDDDD+ DD +D+D++++ ++E
Sbjct: 24 MEARTEALIEEEKVEIARSVMIEGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80
Score = 34.8 bits (80), Expect = 0.026
Identities = 12/33 (36%), Positives = 29/33 (87%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
++ +DDDDDE DDD+D+ ++++ ++++++E+D+
Sbjct: 48 EEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80
Score = 32.1 bits (73), Expect = 0.21
Identities = 12/33 (36%), Positives = 29/33 (87%)
Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
++ +DDDD+ DDD+D+D +++ +++++++EDD+
Sbjct: 48 EEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80
Score = 31.7 bits (72), Expect = 0.32
Identities = 10/33 (30%), Positives = 30/33 (90%)
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
++ +DDD+ +DD+D+D++++ +++++++++DDE
Sbjct: 48 EEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80
Score = 28.6 bits (64), Expect = 4.4
Identities = 9/31 (29%), Positives = 27/31 (87%)
Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
++ ++ DDD D+D++++ ++++++++DDDE+
Sbjct: 48 EEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78
Score = 28.2 bits (63), Expect = 5.3
Identities = 7/32 (21%), Positives = 29/32 (90%)
Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
++ +D+ DD++D+D++++ ++++++++DD+++
Sbjct: 48 EEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDD 79
Score = 27.8 bits (62), Expect = 7.3
Identities = 7/30 (23%), Positives = 26/30 (86%)
Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERK 217
+E +DD+D++++++ +++++ ++DDD++ +
Sbjct: 48 EEPEDDDDDEDDDDDDDKDDKDDDDDDDDE 77
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 41.3 bits (97), Expect = 0.003
Identities = 38/179 (21%), Positives = 68/179 (37%), Gaps = 17/179 (9%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE---------------E 215
+ + N ++D+++ + +D D++E E+E+ +E+DDE
Sbjct: 4 QVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKA 63
Query: 216 RKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRH 275
R E K K + +Q++ +Q + KG K+ + + D+ +
Sbjct: 64 RLKELKKQKKQEIQKILEQQNAAIDADMNNKGKGRLKYLLQQTEIFAHFAKGDQ--SASA 121
Query: 276 KITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNW 334
K SK EEEE E+ +E S V+P K + GLNW
Sbjct: 122 KKAKGRGRHASKLTEEEEDEEYLKEEEDGLGGSGGTRLLVQPSCIKGKMRDYQLAGLNW 180
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 39.0 bits (91), Expect = 0.004
Identities = 14/52 (26%), Positives = 26/52 (50%)
Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
DD+E + + + +E E E+ E + + E++K E K E + T +K
Sbjct: 94 SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145
Score = 35.1 bits (81), Expect = 0.075
Identities = 19/50 (38%), Positives = 27/50 (54%), Gaps = 2/50 (4%)
Query: 190 TDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK 239
+ DD +E+ EEES +E E E D E K E K K V + +++ PK
Sbjct: 93 SSDDEEEETEEESTDETEQE--DPPETKTESKEKKKREVPKPKTEKEKPK 140
Score = 30.1 bits (68), Expect = 4.2
Identities = 13/42 (30%), Positives = 20/42 (47%)
Query: 199 EEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKA 240
+EE E EEE+ ++ ++E E K E + K K K
Sbjct: 96 DEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKE 137
Score = 29.3 bits (66), Expect = 7.3
Identities = 12/65 (18%), Positives = 25/65 (38%), Gaps = 3/65 (4%)
Query: 202 SEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVV 261
S ++E EE ++E ++ E E K+K + + E+ K
Sbjct: 92 SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVP---KPKTEKEKPKTEPKKPKP 148
Query: 262 PQPRM 266
+P++
Sbjct: 149 SKPKL 153
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 37.4 bits (87), Expect = 0.004
Identities = 12/67 (17%), Positives = 37/67 (55%)
Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQ 229
D + E + +++++++++D+ +D +DED + E E++ +R+ EK+ ++
Sbjct: 2 LDTEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEEL 61
Query: 230 ELTAKQK 236
+++
Sbjct: 62 AEYLRKR 68
Score = 28.6 bits (64), Expect = 5.3
Identities = 8/56 (14%), Positives = 32/56 (57%)
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKN 237
D + + D+ +++ +E+ ++ + +E+E D+ E + ++++ + + +E ++
Sbjct: 3 DTEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDA 58
Score = 27.8 bits (62), Expect = 9.1
Identities = 14/35 (40%), Positives = 20/35 (57%)
Query: 288 EEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKD 322
EEEEEE+E+ D S +D DE E + + H+
Sbjct: 11 EEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRR 45
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 39.7 bits (93), Expect = 0.004
Identities = 14/55 (25%), Positives = 33/55 (60%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
++ E++ ++D + ++ ++ ED+EE +E + E +D++ + K E K +NE
Sbjct: 22 ENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKKLENE 76
Score = 38.6 bits (90), Expect = 0.009
Identities = 17/68 (25%), Positives = 30/68 (44%), Gaps = 3/68 (4%)
Query: 176 DNMDDDDDDDDDDETDDDNDEDNEEES---EEEEENEEDDDEERKAEKKNSKNEHVQELT 232
+ D ++ E D + +N+EE EE+ E EE + EE + + S ++EL
Sbjct: 1 MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60
Query: 233 AKQKNPKA 240
+ K
Sbjct: 61 DENNKLKE 68
Score = 38.2 bits (89), Expect = 0.014
Identities = 15/65 (23%), Positives = 32/65 (49%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
+ +D ++ ++D E ++ ++ +EE E EE E+++ E E K E +++
Sbjct: 3 KECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDE 62
Query: 232 TAKQK 236
K K
Sbjct: 63 NNKLK 67
Score = 37.8 bits (88), Expect = 0.018
Identities = 15/56 (26%), Positives = 27/56 (48%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
++D ++ ++D +E + + + EE E+ EE+ E EE K E K E
Sbjct: 14 EEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEE 69
Score = 36.7 bits (85), Expect = 0.045
Identities = 11/71 (15%), Positives = 31/71 (43%)
Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHV 228
+ + + ++D ++E +++ E+ EE E EE ++ ++ + +
Sbjct: 2 EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKD 61
Query: 229 QELTAKQKNPK 239
+ K++N K
Sbjct: 62 ENNKLKEENKK 72
>gnl|CDD|221970 pfam13191, AAA_16, AAA ATPase domain. This family of domains
contain a P-loop motif that is characteristic of the AAA
superfamily.
Length = 154
Score = 38.7 bits (90), Expect = 0.005
Identities = 12/52 (23%), Positives = 23/52 (44%), Gaps = 4/52 (7%)
Query: 411 FRRKEDIQAKKHHVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCLIKNFTK 462
R+E+++ + R P V + GP GK++L+R L++
Sbjct: 3 VGREEELERLLDAL----RRARSGGPPSVLLTGPSGTGKTSLLRELLEGLLV 50
>gnl|CDD|216161 pfam00865, Osteopontin, Osteopontin.
Length = 293
Score = 40.1 bits (93), Expect = 0.005
Identities = 36/175 (20%), Positives = 68/175 (38%), Gaps = 26/175 (14%)
Query: 176 DNMDDDDDDDDDD----------ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
++MDDDDDDDDDD E+DD + D+ + S+E ++E D+ +
Sbjct: 64 EHMDDDDDDDDDDHVDSQDSNSNESDDADHTDDSDHSDESHHSDESDESVTDFPTETPAT 123
Query: 226 EHVQELTAKQKNPK------AFTFQSVIKGERKFRSTKPTVVPQ---PRMSDKDLAIRHK 276
+ + A+ +S K R+ + P + + ++L K
Sbjct: 124 DVFTPVVPTVDTNDGRGDSVAYGLRSKSKKFRRSDAQYPDATEEDLTSHVESEELDGAPK 183
Query: 277 ITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTG 331
LN + + ++DS + S +D++VE + + SK +
Sbjct: 184 AIPVAQRLNVPSDWDSREKDS-------HETSQLDDQSVETHSREQSKEYKRKAN 231
Score = 31.2 bits (70), Expect = 2.7
Identities = 15/61 (24%), Positives = 31/61 (50%), Gaps = 2/61 (3%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEE--ESEEEEENEEDDDEERKAEKKNSKNEHVQ 229
DDL+ ++ + DDD+D+D+++ +S++ NE DD + + ++ H
Sbjct: 48 DDLKQETLPSKSNESHEHMDDDDDDDDDDHVDSQDSNSNESDDADHTDDSDHSDESHHSD 107
Query: 230 E 230
E
Sbjct: 108 E 108
Score = 30.1 bits (67), Expect = 7.0
Identities = 22/78 (28%), Positives = 31/78 (39%), Gaps = 24/78 (30%)
Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
S++S + D+DD DDD V SQD + DD D
Sbjct: 59 SNESHEHMDDDDDDDDDDH-------------VDSQDSN-----------SNESDDADHT 94
Query: 185 DDDDETDDDNDEDNEEES 202
DD D +D+ + D +ES
Sbjct: 95 DDSDHSDESHHSDESDES 112
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
kDa subunit. This family contains several DNA-directed
RNA polymerase 19 kDa polypeptides. The Poxvirus
DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
DNA-template-directed extension of the 3'-end of an RNA
strand by one nucleotide at a time.
Length = 167
Score = 38.5 bits (90), Expect = 0.005
Identities = 17/64 (26%), Positives = 37/64 (57%)
Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA 233
+ED+ D D + DDD++++ +E+ +EE E E+ + ++ K S +E++++ T+
Sbjct: 1 MEDSDDIIDYESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQSNYKIESASENIEDATS 60
Query: 234 KQKN 237
K
Sbjct: 61 NPKQ 64
Score = 31.2 bits (71), Expect = 1.8
Identities = 12/64 (18%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
DDD + +++++D++D E+ + +D + ++S + E+ ++ E+ + K +E +
Sbjct: 13 DDDDSEEYEEEEEDEEDAESLESSDVSSLKQSNYKIESASENIEDATSNPKQV-SEKISA 71
Query: 231 LTAK 234
+ +
Sbjct: 72 IKRR 75
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 40.8 bits (96), Expect = 0.006
Identities = 19/74 (25%), Positives = 31/74 (41%), Gaps = 8/74 (10%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
S+ + S + K SS+ R+ RK D ED+ D + D
Sbjct: 1323 KKKKSEKKTARKKKSK--------TRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVD 1374
Query: 184 DDDDDETDDDNDED 197
D +D++ +DD D+D
Sbjct: 1375 DSEDEDDEDDEDDD 1388
Score = 40.0 bits (94), Expect = 0.010
Identities = 29/211 (13%), Positives = 65/211 (30%), Gaps = 8/211 (3%)
Query: 102 GESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQD 161
+ ++K+ + + DD D + + D + ++ + SS
Sbjct: 1186 ADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDD----EEQKTKPKKSSVK 1241
Query: 162 GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
K +K EDN + DD + + + + D E K
Sbjct: 1242 RLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSK 1301
Query: 222 NSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSAL 281
S + + + A + + + + T V Q S +R
Sbjct: 1302 PSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRP----R 1357
Query: 282 AGLNSKEEEEEEQEDSAFSDESADDNSSDED 312
+ E+++ + S++ D++ D+D
Sbjct: 1358 KKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388
>gnl|CDD|220888 pfam10846, DUF2722, Protein of unknown function (DUF2722). This
eukaryotic family of proteins has no known function.
Length = 373
Score = 40.2 bits (94), Expect = 0.006
Identities = 20/99 (20%), Positives = 38/99 (38%)
Query: 138 SDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDED 197
S + S Q ++ E K DL + ++DDD D D+ E
Sbjct: 264 SQESMTSFQHIIQWKPESQQKKHRRTKSSSSFGVIDLNSISEASQVNEDDDPPDSDSKER 323
Query: 198 NEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
EE+ + E DD++++ + +S++E +
Sbjct: 324 KNEENSDPESTPSDDNDDKTCSESSSRSESPNRTNTGRY 362
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 40.0 bits (93), Expect = 0.006
Identities = 34/113 (30%), Positives = 54/113 (47%), Gaps = 7/113 (6%)
Query: 117 GAMSGAISSDDSDDEND---EDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDD 173
+I++D SD+END E E +G+ K+ +E +D + +V +
Sbjct: 71 CGNVASIANDKSDNENDVELEGLNIIVKNEEERGTQKEEEED----EDVEEIEEVEEVEV 126
Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
+E+ DDD+D + DDE + D + D E + E E DDD E NS +E
Sbjct: 127 VEEEYDDDEDSEKDDEKESDAEGDENELAGEYIIEEVDDDVAILNENINSIDE 179
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 40.2 bits (94), Expect = 0.006
Identities = 18/91 (19%), Positives = 45/91 (49%), Gaps = 1/91 (1%)
Query: 140 DDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNE 199
DD + ++ K E+V S+ D ++ + + D + + + D E+ E
Sbjct: 340 DDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAE 399
Query: 200 EESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
+++EE+ +E++++++K E+ + E Q+
Sbjct: 400 AKAKEEKLKQEENEKKQK-EQADEDKEKRQK 429
>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
Length = 325
Score = 39.9 bits (94), Expect = 0.007
Identities = 9/47 (19%), Positives = 19/47 (40%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
DL ++ D D++ + + +E+ + EEE +R
Sbjct: 6 HDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVL 52
Score = 37.6 bits (88), Expect = 0.034
Identities = 10/48 (20%), Positives = 23/48 (47%), Gaps = 1/48 (2%)
Query: 174 LEDNMDDDDDDDDDDETDD-DNDEDNEEESEEEEENEEDDDEERKAEK 220
L+ + ++D D D++ + D EEE + + EE+ + ++
Sbjct: 3 LKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQR 50
Score = 33.8 bits (78), Expect = 0.56
Identities = 5/36 (13%), Positives = 17/36 (47%)
Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE 205
FD++ + D+ +++ +D +E+ + +
Sbjct: 14 FDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQ 49
Score = 33.4 bits (77), Expect = 0.66
Identities = 8/39 (20%), Positives = 18/39 (46%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
D D ++N + D+ E + +++ EEE + +
Sbjct: 11 DADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQ 49
Score = 33.0 bits (76), Expect = 0.98
Identities = 10/46 (21%), Positives = 22/46 (47%)
Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
+ D ++D D ++ + +E+ EEE ++ D EE + +
Sbjct: 3 LKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGAT 48
Score = 32.6 bits (75), Expect = 1.1
Identities = 10/49 (20%), Positives = 18/49 (36%)
Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
M D ++D D+N + +E EE D+D + +
Sbjct: 1 MTLKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQ 49
Score = 31.5 bits (72), Expect = 2.9
Identities = 10/45 (22%), Positives = 22/45 (48%)
Query: 177 NMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
D ++D + D++ E +E++ EEE ++D E + +
Sbjct: 1 MTLKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQ 45
Score = 31.5 bits (72), Expect = 3.0
Identities = 8/42 (19%), Positives = 16/42 (38%)
Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEED 211
D D D+ ++E D++ + E S+ + D
Sbjct: 12 ADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLD 53
>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2. This
domain family is found in eukaryotes, and is typically
between 138 and 153 amino acids in length. The family is
found in association with pfam00493. Mini-chromosome
maintenance (MCM) proteins are essential for DNA
replication. These proteins use ATPase activity to
perform this function.
Length = 145
Score = 38.1 bits (89), Expect = 0.008
Identities = 14/37 (37%), Positives = 24/37 (64%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEER 216
+D+ D++++ DD +D D E E EE+ E+ D+ ER
Sbjct: 1 EDEPDEEEELEDDIDDLDEEAEEEEDGEDLFGDNMER 37
Score = 37.7 bits (88), Expect = 0.009
Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 11/104 (10%)
Query: 128 SDDENDEDDTS--DDDTESGQGSMKKYK--EQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
++ E D D DD+++ + S + E ++ +D RR+ M L+D+ DDD D
Sbjct: 41 ANPELDRYDEDGLDDESDYEELSAAARRAAEARLNRRDRLLRRRRMPAAFLDDDDDDDGD 100
Query: 184 DDDDDET-------DDDNDEDNEEESEEEEENEEDDDEERKAEK 220
D + D+D D D+E+ EEE +D + KA
Sbjct: 101 FDLTAQPRRRRRQYDEDRDADDEDIDPLEEELSLEDLSDVKAAS 144
Score = 33.5 bits (77), Expect = 0.22
Identities = 16/41 (39%), Positives = 25/41 (60%), Gaps = 6/41 (14%)
Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEE---ENEEDD 212
ED D++++ +DD DD DE+ EEE + E+ +N E D
Sbjct: 1 EDEPDEEEELEDD---IDDLDEEAEEEEDGEDLFGDNMERD 38
Score = 29.2 bits (66), Expect = 7.4
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 25/85 (29%)
Query: 119 MSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNM 178
M A DD DD+ D D T+ +RR+ +D+D +
Sbjct: 86 MPAAFLDDDDDDDGDFDLTAQ-----------------------PRRRRRQYDEDRDA-- 120
Query: 179 DDDDDDDDDDETDDDNDEDNEEESE 203
DD+D D ++E ++ D + S
Sbjct: 121 DDEDIDPLEEELSLEDLSDVKAASI 145
Score = 28.8 bits (65), Expect = 8.2
Identities = 20/81 (24%), Positives = 34/81 (41%), Gaps = 15/81 (18%)
Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
D+ D+E + +D DD E + DG + +F D++E + + +
Sbjct: 1 EDEPDEEEELEDDIDDLDEEAEEEE-----------DG----EDLFGDNMERDYRANPEL 45
Query: 185 DDDDETDDDNDEDNEEESEEE 205
D DE D++ D EE S
Sbjct: 46 DRYDEDGLDDESDYEELSAAA 66
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 38.9 bits (91), Expect = 0.008
Identities = 23/71 (32%), Positives = 36/71 (50%), Gaps = 6/71 (8%)
Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA------K 234
+ +D +D + DE++EEE+EEE+E D++E+ EK S E + +
Sbjct: 74 NSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPSWKPVGTE 133
Query: 235 QKNPKAFTFQS 245
Q P A TF S
Sbjct: 134 QTGPHAMTFDS 144
Score = 30.9 bits (70), Expect = 3.5
Identities = 10/73 (13%), Positives = 27/73 (36%)
Query: 168 VMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEH 227
+ D+ + D+ + E+ +EE +E +E+ +D+ ++ E
Sbjct: 35 FFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEE 94
Query: 228 VQELTAKQKNPKA 240
+E + +
Sbjct: 95 NEEEDEESSDENE 107
Score = 29.7 bits (67), Expect = 6.4
Identities = 14/70 (20%), Positives = 33/70 (47%)
Query: 168 VMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEH 227
+ F D D+ + + + + + +E E+E N ED +++ AEK++ ++E
Sbjct: 34 LFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEE 93
Query: 228 VQELTAKQKN 237
E ++ +
Sbjct: 94 ENEEEDEESS 103
>gnl|CDD|224292 COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General
function prediction only].
Length = 398
Score = 39.7 bits (93), Expect = 0.008
Identities = 19/62 (30%), Positives = 33/62 (53%), Gaps = 7/62 (11%)
Query: 408 ERKFRRKEDIQAKK------HHVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCLIKNFT 461
K +ED KK +P++ + L+ P ++ ++GP QVGK+TL++ LIK
Sbjct: 4 FGKEIIEEDSDIKKWEIERRKLLPRLIK-KLDLRPFIILILGPRQVGKTTLLKLLIKGLL 62
Query: 462 KT 463
+
Sbjct: 63 EE 64
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 36.7 bits (84), Expect = 0.009
Identities = 17/35 (48%), Positives = 27/35 (77%), Gaps = 3/35 (8%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
++D D D DE +DD+DED E +++E+++EDDDE
Sbjct: 9 EEDSDSDSDEEEDDDDED---EEDDDEDDDEDDDE 40
Score = 36.7 bits (84), Expect = 0.009
Identities = 13/31 (41%), Positives = 25/31 (80%)
Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
E ++D+D D++EE ++++E+EEDDDE+ +
Sbjct: 7 EGEEDSDSDSDEEEDDDDEDEEDDDEDDDED 37
Score = 35.9 bits (82), Expect = 0.020
Identities = 13/31 (41%), Positives = 25/31 (80%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
+D D D D++E DDD DE++++E ++E+++E
Sbjct: 10 EDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
Score = 34.4 bits (78), Expect = 0.071
Identities = 12/31 (38%), Positives = 27/31 (87%)
Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
+ ++++D D+DE+ +++ E+EE+++EDDDE+
Sbjct: 7 EGEEDSDSDSDEEEDDDDEDEEDDDEDDDED 37
Score = 33.6 bits (76), Expect = 0.098
Identities = 13/30 (43%), Positives = 25/30 (83%)
Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEE 204
+ + D D+++DDDDE ++D+DED++E+ +E
Sbjct: 11 DSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
Score = 33.2 bits (75), Expect = 0.17
Identities = 11/34 (32%), Positives = 24/34 (70%)
Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
+ E D D+D D EE+ ++E+E ++D+D++ ++
Sbjct: 7 EGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
Score = 32.8 bits (74), Expect = 0.22
Identities = 12/34 (35%), Positives = 26/34 (76%)
Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEE 207
LE D D D D++++ DD+++ED++E+ +E+++
Sbjct: 6 LEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDD 39
Score = 31.3 bits (70), Expect = 0.69
Identities = 12/31 (38%), Positives = 26/31 (83%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEE 201
+D D+ +++DDDD+D+E DD++D+++++E
Sbjct: 10 EDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
Score = 30.5 bits (68), Expect = 1.2
Identities = 9/34 (26%), Positives = 24/34 (70%)
Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
+ +ED++ +S+EEE+++++D+E+ + +E
Sbjct: 7 EGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 37.3 bits (87), Expect = 0.010
Identities = 15/53 (28%), Positives = 27/53 (50%)
Query: 164 KRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEER 216
K RK E+N D++D + +D E +D++ + E + EE DE++
Sbjct: 70 KERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDEDK 122
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 39.6 bits (92), Expect = 0.011
Identities = 29/118 (24%), Positives = 54/118 (45%), Gaps = 15/118 (12%)
Query: 104 STEKSMMEIDGIHGAMSGAISSDDSDDENDEDD------TSDDDTESGQGSMKKYKEQVV 157
+T+K+++EI S + + D D++ T + ++ E+ V
Sbjct: 80 ATKKTVVEISEPLEEGSELVV--NEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKV 137
Query: 158 SSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
RKRRKV D+ ++ + + D +E++ +N ESEEE + E+DD E+
Sbjct: 138 -----RKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLEN--ESEEELDLEKDDGED 188
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 38.9 bits (91), Expect = 0.011
Identities = 19/53 (35%), Positives = 29/53 (54%)
Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQ 235
++D+++ D+D+ E EE EEE E EE DD+E E K + + AK
Sbjct: 129 EEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKRGDTSLTPQAKD 181
Score = 35.8 bits (83), Expect = 0.11
Identities = 11/61 (18%), Positives = 26/61 (42%)
Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
ED ++ D+DD + ++ +E EE +++E +E + + ++ L
Sbjct: 130 EDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKRGDTSLTPQAKDVLESLIDA 189
Query: 235 Q 235
Sbjct: 190 A 190
Score = 33.9 bits (78), Expect = 0.44
Identities = 11/45 (24%), Positives = 30/45 (66%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
++D++++ D++ + E+ EEE E EE ++++ +E K ++ ++
Sbjct: 129 EEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKRGDT 173
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 39.5 bits (93), Expect = 0.013
Identities = 20/101 (19%), Positives = 29/101 (28%), Gaps = 15/101 (14%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
D DD + D D +G ++ R+R + D D D
Sbjct: 196 RGRDGDDRDRRDRREQGDRREERGR----RDGGDRRGRRRRRDRRDARGDDNREDRGDRD 251
Query: 184 DDDDDET-----------DDDNDEDNEEESEEEEENEEDDD 213
DD + D + +E E E EDD
Sbjct: 252 GDDGEGRGGRRGRRFRDRDRRGRRGGDGGNEREPELREDDV 292
Score = 34.9 bits (81), Expect = 0.31
Identities = 18/97 (18%), Positives = 37/97 (38%), Gaps = 1/97 (1%)
Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
++ D+ D D E+ +G + +E+ D +R + D E+ D D
Sbjct: 166 EEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGD 225
Query: 185 DDDDETDDDNDEDNEEESEEEE-ENEEDDDEERKAEK 220
D + +++ E+ + + DD E R +
Sbjct: 226 RRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRR 262
Score = 33.0 bits (76), Expect = 1.2
Identities = 12/95 (12%), Positives = 35/95 (36%), Gaps = 2/95 (2%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
+++ D + + + D +G + + + + GR+ + DD + +
Sbjct: 153 ATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQG 212
Query: 184 D--DDDDETDDDNDEDNEEESEEEEENEEDDDEER 216
D ++ D + + + +D+ E+R
Sbjct: 213 DRREERGRRDGGDRRGRRRRRDRRDARGDDNREDR 247
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 39.0 bits (91), Expect = 0.013
Identities = 20/124 (16%), Positives = 49/124 (39%), Gaps = 17/124 (13%)
Query: 131 ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKR-------RKVMFDDDLEDNMDDDDD 183
++ D + + +K + G+KR R V E ++ + D
Sbjct: 263 IGPIEEEVVDSQAANDEPRRVFK------KKGQKRTTRRVKMRPVRAKPSDEPSLPESDI 316
Query: 184 DDDDDETDDD--NDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAF 241
++ + D+ ++ +E++ +EDD+E ++ +K K + ++ NP +
Sbjct: 317 HEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKP--RKRKVNPVSN 374
Query: 242 TFQS 245
F+
Sbjct: 375 NFRR 378
Score = 35.9 bits (83), Expect = 0.13
Identities = 31/151 (20%), Positives = 59/151 (39%), Gaps = 26/151 (17%)
Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
E+++DD+ D + E ++ E EE +++ +DE R+ KK
Sbjct: 243 EESIDDELDVLREIEAEEAGIGPIE---EEVVDSQAANDEPRRVFKK------------- 286
Query: 235 QKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQ 294
K K T + V + + + +P+ + ++ + K S G +E++E
Sbjct: 287 -KGQKRTT-RRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDED 344
Query: 295 EDSAFSDESADDNSSDEDETVEPKTHKDSKN 325
ED D+ S +E E + K K
Sbjct: 345 ED--------DEESKEEVEKKQKVKKKPRKR 367
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 39.3 bits (92), Expect = 0.014
Identities = 18/82 (21%), Positives = 51/82 (62%), Gaps = 8/82 (9%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS-KNEHVQ 229
++ED + DD++++DE +++ ++++E S+E ++EE ++++ +++ ++S N
Sbjct: 378 ALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNSS-- 435
Query: 230 ELTAKQKNPKAFTFQSVIKGER 251
L KN +++ V++G++
Sbjct: 436 -LAVGYKNDRSY----VVRGDK 452
Score = 32.4 bits (74), Expect = 1.7
Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 1/69 (1%)
Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSV 246
E +D N E ++EE E+EEE EE++DE+ K++S +E +E + K + S+
Sbjct: 378 ALEIEDANTERDDEEEEDEEE-EEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNSSL 436
Query: 247 IKGERKFRS 255
G + RS
Sbjct: 437 AVGYKNDRS 445
Score = 30.1 bits (68), Expect = 9.3
Identities = 14/29 (48%), Positives = 18/29 (62%)
Query: 286 SKEEEEEEQEDSAFSDESADDNSSDEDET 314
+EEEEEE ED S E +DD +ED+
Sbjct: 395 DEEEEEEEDEDEGPSKEHSDDEEFEEDDV 423
>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
Length = 852
Score = 39.0 bits (91), Expect = 0.017
Identities = 17/64 (26%), Positives = 27/64 (42%), Gaps = 2/64 (3%)
Query: 158 SSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK 217
SS +G D E D +DD +D D+D+D+ + E EED + R+
Sbjct: 11 SSSNGSGGLVGT--DSGESRWVDGSEDDSEDPASLDDDDDDRDSGGGMLEEEEDGNMRRR 68
Query: 218 AEKK 221
+
Sbjct: 69 LIRT 72
>gnl|CDD|213833 TIGR03596, GTPase_YlqF, ribosome biogenesis GTP-binding protein
YlqF. Members of this protein family are GTP-binding
proteins involved in ribosome biogenesis, including the
essential YlqF protein of Bacillus subtilis, which is an
essential protein. They are related to Era, EngA, and
other GTPases of ribosome biogenesis, but are circularly
permuted. This family is not universal, and is not
present in Escherichia coli, and so is not as well
studied as some other GTPases. This model is built for
bacterial members [Protein synthesis, Other].
Length = 276
Score = 38.3 bits (90), Expect = 0.018
Identities = 16/33 (48%), Positives = 17/33 (51%), Gaps = 1/33 (3%)
Query: 432 LEPPPIVVAVVGPPQVGKSTLIRCLI-KNFTKT 463
L PI +VG P VGKSTLI L K K
Sbjct: 114 LLNRPIRAMIVGIPNVGKSTLINRLAGKKVAKV 146
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 38.9 bits (90), Expect = 0.020
Identities = 27/119 (22%), Positives = 50/119 (42%), Gaps = 16/119 (13%)
Query: 132 NDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD-------- 183
ND DD+ + D +GS+ E + D ++ +D E N ++D
Sbjct: 99 NDSDDSVNSDKLENEGSVSSIDENELVDLD-----TLLDNDQPEKNESGNNDHATDKENL 153
Query: 184 -DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELT--AKQKNPK 239
+ D ++D E+++ ESE E + + DDE ++ N +N V A ++
Sbjct: 154 LESDASSSNDSESEESDSESEIESSDSDHDDENSDSKLDNLRNYIVSLKKDEADAESVL 212
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 38.1 bits (88), Expect = 0.022
Identities = 16/87 (18%), Positives = 40/87 (45%), Gaps = 13/87 (14%)
Query: 164 KRRKVMFDDDLEDNMDDDD-------------DDDDDDETDDDNDEDNEEESEEEEENEE 210
K+ K +++ E+ M ++D + D ++ ++ + + E + + +
Sbjct: 258 KKEKQGAEEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPD 317
Query: 211 DDDEERKAEKKNSKNEHVQELTAKQKN 237
+ EE++ + +N +NE EL A + N
Sbjct: 318 EIGEEKEEDDENEENERHTELLADELN 344
Score = 34.7 bits (79), Expect = 0.30
Identities = 21/116 (18%), Positives = 49/116 (42%), Gaps = 6/116 (5%)
Query: 113 DGIHGAMSGAISSDDSDDEND-EDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFD 171
+ A+ + E + E+ S++D + G E S+ +++++ +
Sbjct: 247 EPGEPLSRPALKKEKQGAEEEGEEGMSEEDLDVGAA----EIENKEVSEGDKEQQQEEVE 302
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEE-ESEEEEENEEDDDEERKAEKKNSKNE 226
+ + D D+ + + D++NEE E E +E ++ E+ E+K + E
Sbjct: 303 NAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIEEKRRQME 358
>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22. The
mitochondrial protein translocase family, which is
responsible for movement of nuclear encoded pre-proteins
into mitochondria, is very complex with at least 19
components. These proteins include several chaperone
proteins, four proteins of the outer membrane
translocase (Tom) import receptor, five proteins of the
Tom channel complex, five proteins of the inner membrane
translocase (Tim) and three "motor" proteins. This
family represents the Tom22 proteins. The N terminal
region of Tom22 has been shown to have chaperone-like
activity, and the C terminal region faces the
intermembrane face.
Length = 136
Score = 36.1 bits (84), Expect = 0.024
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEES 202
DDDD+DD D ++D +D D E E+
Sbjct: 28 DDDDEDDTDTDSDISDDSDFENET 51
Score = 33.8 bits (78), Expect = 0.18
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEE 205
++ DDDD+D+TD D+D ++ + E E
Sbjct: 24 QEESDDDDEDDTDTDSDISDDSDFENE 50
Score = 31.8 bits (73), Expect = 0.85
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 173 DLEDNMDDDDDDDDDDETDDDNDEDNE 199
E + DD+DD D D + DD+D +NE
Sbjct: 24 QEESDDDDEDDTDTDSDISDDSDFENE 50
Score = 31.1 bits (71), Expect = 1.4
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEE 207
++ DDDDE D D D D ++S+ E E
Sbjct: 24 QEESDDDDEDDTDTDSDISDDSDFENE 50
Score = 30.3 bits (69), Expect = 2.4
Identities = 10/34 (29%), Positives = 14/34 (41%)
Query: 111 EIDGIHGAMSGAISSDDSDDENDEDDTSDDDTES 144
+ + A D DDE+D D SD +S
Sbjct: 12 FQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDS 45
Score = 29.1 bits (66), Expect = 7.0
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 183 DDDDDDETDDDNDEDNEEESEEEEENE 209
++ DD+ +DD D D++ + + ENE
Sbjct: 24 QEESDDDDEDDTDTDSDISDDSDFENE 50
Score = 29.1 bits (66), Expect = 7.4
Identities = 4/25 (16%), Positives = 17/25 (68%)
Query: 191 DDDNDEDNEEESEEEEENEEDDDEE 215
+ D+D++++ +++ + ++ D + E
Sbjct: 26 ESDDDDEDDTDTDSDISDDSDFENE 50
>gnl|CDD|223052 PHA03346, PHA03346, US22 family homolog; Provisional.
Length = 520
Score = 38.2 bits (89), Expect = 0.025
Identities = 17/66 (25%), Positives = 31/66 (46%)
Query: 161 DGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
DG + DDDD+ D+ DDD+ ED + +E++ + D +E + E+
Sbjct: 406 DGGGGLAPPPEPPRPPPPPFPDDDDEGDDEDDDDWEDLGFDLDEDDVYDLKDVDEWRLER 465
Query: 221 KNSKNE 226
+ + E
Sbjct: 466 RIATAE 471
Score = 35.9 bits (83), Expect = 0.14
Identities = 20/92 (21%), Positives = 28/92 (30%), Gaps = 22/92 (23%)
Query: 129 DDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDD--- 185
+ DE D + R D +D DD+DDDD
Sbjct: 399 PENEDEADGGGGL----------------APPPEPPRPPPPPFPDDDDEGDDEDDDDWED 442
Query: 186 ---DDDETDDDNDEDNEEESEEEEENEEDDDE 214
D DE D + +D +E E + E
Sbjct: 443 LGFDLDEDDVYDLKDVDEWRLERRIATAERWE 474
Score = 32.4 bits (74), Expect = 1.4
Identities = 16/85 (18%), Positives = 33/85 (38%), Gaps = 14/85 (16%)
Query: 156 VVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE-----------TDDDNDEDNEEESEE 204
V + +RR+ +N D+ D DD+DE ++E+ ++
Sbjct: 383 RVLRGEFAERRR---PRGPPENEDEADGGGGLAPPPEPPRPPPPPFPDDDDEGDDEDDDD 439
Query: 205 EEENEEDDDEERKAEKKNSKNEHVQ 229
E+ D DE+ + K+ ++
Sbjct: 440 WEDLGFDLDEDDVYDLKDVDEWRLE 464
>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2). All proteins
in this family for which functions are known are flap
endonucleases that generate the 3' incision next to DNA
damage as part of nucleotide excision repair. This
family is related to many other flap endonuclease
families including the fen1 family. This family is based
on the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1034
Score = 38.3 bits (89), Expect = 0.026
Identities = 31/134 (23%), Positives = 56/134 (41%), Gaps = 11/134 (8%)
Query: 102 GESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDD----TSDDDTESGQGSMKKYKE--- 154
GES + ++ M +SD E D T + S + +E
Sbjct: 630 GESADLLLISNPMEVEPMESEKEESESDGSFIEVDSVSSTLELQVPSKSQPTDESEENAE 689
Query: 155 QVVSSQDGRKRRKV---MFDDDLEDNMDDDDDDDDDDETDDDNDED-NEEESEEEEENEE 210
V+S +G R+++ +FD+ EDN+ +++ D + + +D + EE E E N
Sbjct: 690 NKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQDISLEELEALEANLL 749
Query: 211 DDDEERKAEKKNSK 224
+ KA+K+ K
Sbjct: 750 AEQNSLKAQKQQQK 763
>gnl|CDD|216791 pfam01926, MMR_HSR1, 50S ribosome-binding GTPase. The full-length
GTPase protein is required for the complete activity of
the protein of interacting with the 50S ribosome and
binding of both adenine and guanine nucleotides, with a
preference for guanine nucleotide.
Length = 117
Score = 35.3 bits (82), Expect = 0.031
Identities = 13/20 (65%), Positives = 14/20 (70%)
Query: 439 VAVVGPPQVGKSTLIRCLIK 458
VA+VG P VGKSTLI L
Sbjct: 2 VALVGRPNVGKSTLINALTG 21
>gnl|CDD|221753 pfam12753, Nro1, Nuclear pore complex subunit Nro1. In fission
yeast, this protein is a positive regulator of the
stability of Sre1N, the sterol regulatory
element-binding protein which is an ER membrane-bound
transcription factor that controls adaptation to low
oxygen-growth. In addition, the fission yeast Nro1 is a
direct inhibitor of a protein that inhibits SreN1
degradation, Ofd1 (an oxoglutamate deoxygenase). The
outcome of this reactivity is that Ofd1 acts as an
oxygen sensor that regulates the binding of Nro1 to Ofd1
to control the stability of Sre1N. Solution of the
structure of Nro1 reveals it to be made up of a number
of TPR coils.
Length = 401
Score = 37.9 bits (88), Expect = 0.032
Identities = 13/37 (35%), Positives = 25/37 (67%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEEN 208
DDL D +++ +++ DE D ++E+ EEE E +E++
Sbjct: 209 DDLLDIVENFGKENEIDEGLDSDEEEEEEEVELDEKH 245
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 37.4 bits (87), Expect = 0.033
Identities = 13/35 (37%), Positives = 24/35 (68%)
Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
+D++ +DD D+D +++ E +E E EE++DEE
Sbjct: 36 EDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEE 70
Score = 35.1 bits (81), Expect = 0.17
Identities = 14/36 (38%), Positives = 25/36 (69%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEE 206
D++ ED++ D+D++D+ +D+N+ EEE EE E
Sbjct: 37 DEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
Score = 34.7 bits (80), Expect = 0.23
Identities = 15/44 (34%), Positives = 28/44 (63%)
Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
+D +D++ +DD + D++++ EE+ E E EED++ E KA
Sbjct: 32 QDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEVKA 75
Score = 34.4 bits (79), Expect = 0.32
Identities = 14/44 (31%), Positives = 24/44 (54%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
++ +D++ +DD +D+ DE EE E E EE+D+E
Sbjct: 27 PFASAQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEE 70
Score = 33.2 bits (76), Expect = 0.64
Identities = 18/51 (35%), Positives = 29/51 (56%)
Query: 162 GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
G + DL ++ + +DD D+DE D+ E++E E EEEE+EE +
Sbjct: 22 GNGKGPFASAQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
Score = 32.4 bits (74), Expect = 1.1
Identities = 17/63 (26%), Positives = 30/63 (47%), Gaps = 6/63 (9%)
Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVI 247
D T+D+ ED+ + +EE+E ++DE E++ E + +P A T +
Sbjct: 33 DLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEE------DEEGEVKASPDADTTILFV 86
Query: 248 KGE 250
K E
Sbjct: 87 KPE 89
>gnl|CDD|178458 PLN02867, PLN02867, Probable galacturonosyltransferase.
Length = 535
Score = 38.0 bits (88), Expect = 0.034
Identities = 18/55 (32%), Positives = 32/55 (58%), Gaps = 3/55 (5%)
Query: 206 EENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQS---VIKGERKFRSTK 257
E E+DD S N+ V+E+T+ +++ KAF F++ ++K ERK +S +
Sbjct: 104 EAKEQDDGGRGTKGSTESFNDLVKEMTSNRQDIKAFAFRTKAMLLKMERKVQSAR 158
>gnl|CDD|179525 PRK03003, PRK03003, GTP-binding protein Der; Reviewed.
Length = 472
Score = 37.6 bits (88), Expect = 0.035
Identities = 15/26 (57%), Positives = 17/26 (65%), Gaps = 1/26 (3%)
Query: 428 DRTPLEPPPIVVAVVGPPQVGKSTLI 453
+ P P VVAVVG P VGKSTL+
Sbjct: 31 EAAEGGPLP-VVAVVGRPNVGKSTLV 55
>gnl|CDD|227199 COG4862, MecA, Negative regulator of genetic competence,
sporulation and motility [Posttranslational
modification, protein turnover, chaperones / Signal
transduction mechanisms / Cell motility and secretion].
Length = 224
Score = 37.0 bits (86), Expect = 0.035
Identities = 14/53 (26%), Positives = 29/53 (54%)
Query: 176 DNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHV 228
+ DD D+ D D+D E+ E+ EE+ ++ E +++ +AEK+ ++
Sbjct: 81 EGEGPDDLDEFLDMPDEDGAEEGLEQLEEKFDDVEKEEQGDEAEKEPDYLYYI 133
Score = 31.6 bits (72), Expect = 1.7
Identities = 12/42 (28%), Positives = 23/42 (54%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
D D +M D+D ++ E ++ +D E+E + +E +E D
Sbjct: 87 DLDEFLDMPDEDGAEEGLEQLEEKFDDVEKEEQGDEAEKEPD 128
Score = 31.2 bits (71), Expect = 2.2
Identities = 17/53 (32%), Positives = 24/53 (45%), Gaps = 2/53 (3%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELT 232
+ + DD DE D DED EE E+ EE D+ K E+ + + L
Sbjct: 81 EGEGPDDLDEFLDMPDEDGAEEGLEQ--LEEKFDDVEKEEQGDEAEKEPDYLY 131
>gnl|CDD|206647 cd00881, GTP_translation_factor, GTP translation factor family
primarily contains translation initiation, elongation and
release factors. The GTP translation factor family
consists primarily of translation initiation, elongation,
and release factors, which play specific roles in protein
translation. In addition, the family includes Snu114p, a
component of the U5 small nuclear riboprotein particle
which is a component of the spliceosome and is involved
in excision of introns, TetM, a tetracycline resistance
gene that protects the ribosome from tetracycline
binding, and the unusual subfamily CysN/ATPS, which has
an unrelated function (ATP sulfurylase) acquired through
lateral transfer of the EF1-alpha gene and development of
a new function.
Length = 183
Score = 36.5 bits (85), Expect = 0.036
Identities = 9/67 (13%), Positives = 25/67 (37%), Gaps = 7/67 (10%)
Query: 982 KDIFRTLR--MPKVMGVLTHLDMLKNNK---TLKTTKKMLKHRFWTEV-YAGAKLFYLSG 1035
++ ++ + +D + L+ K++LK +T + + +S
Sbjct: 104 REHLNIALAGGLPIIVAVNKIDRVGEEDFDEVLREIKELLKLIGFTFLKGKDVPIIPISA 163
Query: 1036 IVHGEYM 1042
+ GE +
Sbjct: 164 -LTGEGI 169
Score = 36.5 bits (85), Expect = 0.039
Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 3/48 (6%)
Query: 55 ITFIECNNDINSMIDISK---VADLVLLLIDASFGFEMEIFEFLNICQ 99
I FI+ + + + AD LL++DA+ G E + E LNI
Sbjct: 64 INFIDTPGHEDFSKETVRGLAQADGALLVVDANEGVEPQTREHLNIAL 111
>gnl|CDD|234628 PRK00093, PRK00093, GTP-binding protein Der; Reviewed.
Length = 435
Score = 37.7 bits (89), Expect = 0.037
Identities = 14/36 (38%), Positives = 22/36 (61%)
Query: 423 HVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCLIK 458
+P+ + E PI +A++G P VGKS+LI L+
Sbjct: 160 ELPEEEEEDEEDEPIKIAIIGRPNVGKSSLINALLG 195
Score = 31.9 bits (74), Expect = 2.4
Identities = 12/15 (80%), Positives = 13/15 (86%)
Query: 438 VVAVVGPPQVGKSTL 452
VVA+VG P VGKSTL
Sbjct: 3 VVAIVGRPNVGKSTL 17
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 37.3 bits (86), Expect = 0.039
Identities = 18/96 (18%), Positives = 39/96 (40%), Gaps = 11/96 (11%)
Query: 155 QVVSSQDGRKRRKVMFDDDLEDN-----MDDDDDDDDDDETDD------DNDEDNEEESE 203
+ + ++ R+ K + ++ E++ DD + + + D + ESE
Sbjct: 188 KAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESE 247
Query: 204 EEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK 239
EEE +E + DE+ + K + + K + P
Sbjct: 248 EEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPH 283
Score = 33.9 bits (77), Expect = 0.45
Identities = 19/97 (19%), Positives = 43/97 (44%), Gaps = 9/97 (9%)
Query: 133 DEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDD 192
DE + ++ E + E V + K +K DLE + D + E+++
Sbjct: 198 DEKERYVEEEEESDTEL----EAVTDDSEKEKTKK----KDLEKWLGSDQSMETS-ESEE 248
Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQ 229
+ ++E + +E+E+N+ + + + K S+ H+
Sbjct: 249 EESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIH 285
Score = 33.9 bits (77), Expect = 0.47
Identities = 15/121 (12%), Positives = 48/121 (39%), Gaps = 8/121 (6%)
Query: 115 IHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDL 174
+ E +E+ ++ + + +K K++ + K + D
Sbjct: 189 AATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDL--------EKWLGSDQS 240
Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
+ + ++++ + E+D+D DEDN+ + + + ++ + + + +++ A
Sbjct: 241 METSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQERENEKIPAV 300
Query: 235 Q 235
Q
Sbjct: 301 Q 301
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 37.2 bits (86), Expect = 0.040
Identities = 32/115 (27%), Positives = 57/115 (49%), Gaps = 16/115 (13%)
Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEE--------RKAEKKNSKNEHVQELTAKQKNPKA 240
E +D++ E+ EEESEEEEE + +DD E RK ++ +QE + KA
Sbjct: 8 EEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRIT-----IQEREREAAKEKA 62
Query: 241 FTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQE 295
++ K E + R T V + + K+L ++ + T A ++ + ++E +E
Sbjct: 63 LEEEAKRKAEERKRETLKIVEEEVK---KELELKKRNTLLEANIDDVDTDDENEE 114
>gnl|CDD|219912 pfam08574, DUF1762, Protein of unknown function (DUF1762). This is
a family of proteins of unknown function. Yeast IWR1 is
known to interact with RNA polymerase II and deletion of
this protein results in hypersensitivity to the K1
killer toxin.
Length = 77
Score = 34.3 bits (79), Expect = 0.040
Identities = 9/34 (26%), Positives = 23/34 (67%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
D+D++ +D +D++D+ ++ S++E+ N E+
Sbjct: 35 IDEDEEYHEDLANDEDDDADQVLSDDEDSNAENY 68
Score = 33.9 bits (78), Expect = 0.057
Identities = 14/44 (31%), Positives = 21/44 (47%), Gaps = 2/44 (4%)
Query: 157 VSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDD--NDEDN 198
S G R ++ ED +D+DDD D +DD+ N E+
Sbjct: 25 HRSDIGYIRIIDEDEEYHEDLANDEDDDADQVLSDDEDSNAENY 68
Score = 27.8 bits (62), Expect = 8.7
Identities = 11/50 (22%), Positives = 24/50 (48%), Gaps = 5/50 (10%)
Query: 171 DDDLEDNMDDDDD-----DDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
+D E D D+D+E +D D ++++++ ++ED + E
Sbjct: 17 PEDEEAANHRSDIGYIRIIDEDEEYHEDLANDEDDDADQVLSDDEDSNAE 66
>gnl|CDD|235052 PRK02597, rpoC2, DNA-directed RNA polymerase subunit beta';
Provisional.
Length = 1331
Score = 38.1 bits (89), Expect = 0.041
Identities = 16/84 (19%), Positives = 24/84 (28%), Gaps = 3/84 (3%)
Query: 130 DENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE 189
+N D + D S + R R + D + D DD+
Sbjct: 1251 MQNLRPDYTVDMPARPYAS-SLDDPSDEDLEATRSRHGI--DPSSSNFAAFARPDADDEL 1307
Query: 190 TDDDNDEDNEEESEEEEENEEDDD 213
DD D +EE D+
Sbjct: 1308 LIDDQLPDPAALEGLQEEGLLSDE 1331
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 36.6 bits (85), Expect = 0.043
Identities = 30/119 (25%), Positives = 47/119 (39%), Gaps = 23/119 (19%)
Query: 100 VHGESTEKSMME-IDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVS 158
+HG+ E M I +S I+S +SD EN D+ D +ES G ++ K+
Sbjct: 83 IHGDVKESDPMSFITNPSLTVSVPIASGESDLENLNDE--DTSSESSYGFKEESKK---- 136
Query: 159 SQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE--TDDDNDEDNEEESEEEEENEEDDDEE 215
+ + ++ D E T D + EEE EEE E + E+
Sbjct: 137 --------------GSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEEEEVKEPED 181
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 34.1 bits (79), Expect = 0.047
Identities = 11/32 (34%), Positives = 23/32 (71%)
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
D D + +E ++ D ++E+E +EEE++++D D
Sbjct: 43 DSDAEWEEEEEGEDLESEDEEDEEEDDDDDMD 74
Score = 33.7 bits (78), Expect = 0.048
Identities = 13/60 (21%), Positives = 29/60 (48%), Gaps = 5/60 (8%)
Query: 153 KEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
V+ ++ + D D D D + ++++E +D ED E+E E++++ + D
Sbjct: 21 PSSVILPRNPLSQDLPGLDYDY----DSDAEWEEEEEGEDLESEDEEDEEEDDDD-DMDG 75
Score = 33.3 bits (77), Expect = 0.074
Identities = 18/42 (42%), Positives = 28/42 (66%), Gaps = 3/42 (7%)
Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEE-ESEEEEENEEDDDEE 215
+D D D D D E +++ E+ E+ ESE+EE+ EEDDD++
Sbjct: 33 QDLPGLDYDYDSDAEWEEE--EEGEDLESEDEEDEEEDDDDD 72
Score = 29.9 bits (68), Expect = 1.3
Identities = 14/45 (31%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
DL +D D D D + E +++ ++ E+ E+EEE+++DD +
Sbjct: 31 LSQDLPG-LDYDYDSDAEWEEEEEGEDLESEDEEDEEEDDDDDMD 74
>gnl|CDD|221955 pfam13173, AAA_14, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily.
Length = 127
Score = 35.2 bits (82), Expect = 0.051
Identities = 9/23 (39%), Positives = 15/23 (65%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNF 460
++ + GP QVGK+TL+ +K
Sbjct: 4 IIVITGPRQVGKTTLLLQFLKEL 26
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 37.4 bits (87), Expect = 0.053
Identities = 51/237 (21%), Positives = 90/237 (37%), Gaps = 28/237 (11%)
Query: 122 AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
I++ DD++ E SD S KK K++ + RK D+
Sbjct: 248 KINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKD-------------LDE 294
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK----NEHVQELTAKQ-- 235
D+ + + E +D + ++ EEE ED ++RK E+++ ++ +Q AKQ
Sbjct: 295 DELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDDFVEDDDDLQASLAKQRR 354
Query: 236 ---KNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSA-LAGLNSKEEEE 291
K K + + + + RS + + D I TS + L + EE
Sbjct: 355 LAQKKRKKLRPEDIARQIAEERSED----EEEAAENNDNGIVIDETSEFVRSLQKEPLEE 410
Query: 292 EEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASAFLE 348
+ + +E +D DEDE E D + D + + + S LE
Sbjct: 411 KPENKDESVEEISDAEEDDEDEEDEDG-DGDVEMSAVDNDEEKEEEDKEAIPSTILE 466
Score = 33.2 bits (76), Expect = 0.92
Identities = 36/189 (19%), Positives = 72/189 (38%), Gaps = 18/189 (9%)
Query: 133 DEDDTSDDDTESGQGSMKKYKEQVVSSQDG--------RKRRKVMFDDDL-----EDNMD 179
EDD DDDT++ KK +++ + + ++R +DL ++
Sbjct: 82 GEDDDDDDDTKAWLKKSKKRQKKKEAERKKALLLDEKEKERAAEYTSEDLAGLKVGHKVE 141
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK 239
+ ++ +D T D +E+ +E EN E ++E+ + K +
Sbjct: 142 EFEEGEDVILTLKDTGVLEDEDEGDELENVELVEKEKDKKNLELKKKKPDYDPDDDDKFN 201
Query: 240 AFTFQSV----IKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLN-SKEEEEEEQ 294
+ S I+G++K T+ D+ R ++ L N S +++ E
Sbjct: 202 KRSILSKYDEEIEGKKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTET 261
Query: 295 EDSAFSDES 303
S + D S
Sbjct: 262 PASDYYDVS 270
>gnl|CDD|206683 cd01896, DRG, Developmentally Regulated GTP-binding protein (DRG).
The developmentally regulated GTP-binding protein (DRG)
subfamily is an uncharacterized member of the Obg
family, an evolutionary branch of GTPase superfamily
proteins. GTPases act as molecular switches regulating
diverse cellular processes. DRG2 and DRG1 comprise the
DRG subfamily in eukaryotes. In view of their widespread
expression in various tissues and high conservation
among distantly related species in eukaryotes and
archaea, DRG proteins may regulate fundamental cellular
processes. It is proposed that the DRG subfamily
proteins play their physiological roles through RNA
binding.
Length = 233
Score = 36.4 bits (85), Expect = 0.063
Identities = 12/18 (66%), Positives = 14/18 (77%)
Query: 439 VAVVGPPQVGKSTLIRCL 456
VA+VG P VGKSTL+ L
Sbjct: 3 VALVGFPSVGKSTLLSKL 20
>gnl|CDD|236570 PRK09563, rbgA, GTPase YlqF; Reviewed.
Length = 287
Score = 36.3 bits (85), Expect = 0.067
Identities = 12/28 (42%), Positives = 15/28 (53%)
Query: 432 LEPPPIVVAVVGPPQVGKSTLIRCLIKN 459
+ P I ++G P VGKSTLI L
Sbjct: 117 MRPRAIRAMIIGIPNVGKSTLINRLAGK 144
>gnl|CDD|234274 TIGR03594, GTPase_EngA, ribosome-associated GTPase EngA. EngA
(YfgK, Der) is a ribosome-associated essential GTPase
with a duplication of its GTP-binding domain. It is
broadly to universally distributed among bacteria. It
appears to function in ribosome biogenesis or stability
[Protein synthesis, Other].
Length = 429
Score = 36.7 bits (86), Expect = 0.074
Identities = 14/37 (37%), Positives = 22/37 (59%), Gaps = 1/37 (2%)
Query: 423 HVPQVDRT-PLEPPPIVVAVVGPPQVGKSTLIRCLIK 458
+P+ + E PI +A++G P VGKSTL+ L+
Sbjct: 158 LLPEEEEEEEEEDGPIKIAIIGRPNVGKSTLVNALLG 194
Score = 33.2 bits (77), Expect = 0.98
Identities = 13/21 (61%), Positives = 14/21 (66%)
Query: 438 VVAVVGPPQVGKSTLIRCLIK 458
VVA+VG P VGKSTL L
Sbjct: 1 VVAIVGRPNVGKSTLFNRLTG 21
>gnl|CDD|224009 COG1084, COG1084, Predicted GTPase [General function prediction
only].
Length = 346
Score = 36.1 bits (84), Expect = 0.099
Identities = 19/55 (34%), Positives = 27/55 (49%), Gaps = 6/55 (10%)
Query: 403 SVIKG-ERKFRRKEDIQAKKHHVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCL 456
S+IK + + +P +D + P IVVA G P VGKS+L+R L
Sbjct: 139 SIIKKIDDDLEFLRKARDHLKKLPAID---PDLPTIVVA--GYPNVGKSSLVRKL 188
>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727). This
domain family is found in bacteria and eukaryotes, and
is approximately 100 amino acids in length.
Length = 100
Score = 33.7 bits (78), Expect = 0.10
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
+ ++D+ + E +DD+DED EEE E +D E
Sbjct: 20 EPEEDEILELEEEDDDDEDEEEEYELLASFYHEDQE 55
Score = 32.6 bits (75), Expect = 0.26
Identities = 8/27 (29%), Positives = 17/27 (62%)
Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEE 207
+ ++D+ ++ D+D+E+E EE E
Sbjct: 19 PEPEEDEILELEEEDDDDEDEEEEYEL 45
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 36.3 bits (84), Expect = 0.10
Identities = 23/125 (18%), Positives = 48/125 (38%), Gaps = 25/125 (20%)
Query: 108 SMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRK 167
+E I +++G +D S +D + +DTE+ S
Sbjct: 284 DRLEKRPICPSLAGFQLTDPSGSLDDPFLKAPEDTENEDESDDN---------------- 327
Query: 168 VMFDDDLEDNMD------DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
DD +DN D + + +DD +D D + ++EE E +E + + +++
Sbjct: 328 ---ADDFDDNDKKGDVGFDINAEVEDDPDEDAEDGGDGGDAEEFEAEKEPMLKVCRVDRQ 384
Query: 222 NSKNE 226
++
Sbjct: 385 VTEII 389
Score = 33.2 bits (76), Expect = 0.92
Identities = 26/132 (19%), Positives = 47/132 (35%), Gaps = 25/132 (18%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQ---------------VVSSQDGRKRRKV 168
SS+D ++ + EDD DD E +K + ++ G
Sbjct: 252 SSEDPEELSSEDDEFPDDGEIDLTPLKFFPLLDRLEKRPICPSLAGFQLTDPSGSLDDPF 311
Query: 169 M------FDDDLEDNMDDDDDDDDDDET----DDDNDEDNEEESEEEEENEEDDDEERKA 218
+ ++D D+ DD DD+D + ED+ +E E+ + D +E
Sbjct: 312 LKAPEDTENEDESDDNADDFDDNDKKGDVGFDINAEVEDDPDEDAEDGGDGGDAEEFEAE 371
Query: 219 EKKNSKNEHVQE 230
++ K V
Sbjct: 372 KEPMLKVCRVDR 383
Score = 30.9 bits (70), Expect = 4.5
Identities = 12/42 (28%), Positives = 25/42 (59%)
Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
D D+ + + + D+ E+ +EE + E+D ++KA+KK +
Sbjct: 151 DSDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQR 192
Score = 30.9 bits (70), Expect = 5.4
Identities = 13/42 (30%), Positives = 16/42 (38%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
D + D DDDDD DD E + S+ E D
Sbjct: 548 DPNFCPVGQGADGDDDDDGPFDDAAEMLSDPSDGEPGASASD 589
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 36.1 bits (83), Expect = 0.11
Identities = 11/40 (27%), Positives = 24/40 (60%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
DD+ +D + ++D D+D+ E++++ N D+D + A
Sbjct: 275 DDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDNGA 314
Score = 35.3 bits (81), Expect = 0.15
Identities = 11/39 (28%), Positives = 19/39 (48%)
Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
DDDE ND D EE+ ++++ + DD ++
Sbjct: 274 DDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDN 312
Score = 32.6 bits (74), Expect = 1.1
Identities = 11/34 (32%), Positives = 18/34 (52%), Gaps = 2/34 (5%)
Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
D+DE + +EEE+ +DDD E + + N
Sbjct: 274 DDDEGSSSNDYDEEEDGDDDDNEDNDD--TNTNH 305
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 34.6 bits (80), Expect = 0.11
Identities = 16/48 (33%), Positives = 23/48 (47%)
Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
+ + +E+ EE EE + EE DE + E K E +E KQK
Sbjct: 97 DKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144
Score = 32.3 bits (74), Expect = 0.75
Identities = 8/53 (15%), Positives = 28/53 (52%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
L + + +++++++ ++ DE+ + + E+E + E+R+ ++ K
Sbjct: 92 KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144
Score = 30.3 bits (69), Expect = 2.8
Identities = 10/57 (17%), Positives = 29/57 (50%), Gaps = 3/57 (5%)
Query: 174 LEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
L+ +++++++ + + D +E +E E+E E++ E + + E ++E
Sbjct: 96 LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKL---KREKRRENERKQKEILKE 149
>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
terminus. The C terminus of the plasma membrane Nha1
antiporter plays an important role in the immediate cell
response to hypo-osmotic shock which prevents an
execessive loss of ions and water. This domain is found
with pfam00999.
Length = 430
Score = 35.9 bits (83), Expect = 0.11
Identities = 27/116 (23%), Positives = 40/116 (34%), Gaps = 11/116 (9%)
Query: 111 EIDGIHGAMSG-AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVM 169
E+D A + D D ND +D +E + + + RK R+
Sbjct: 315 EVDLETNAPERVKAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSLLGSEASERRKNREPS 374
Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE----------EDDDEE 215
D +D+D E DD + +D EE E +DDDEE
Sbjct: 375 SSSPPSRPRSRRDSEDEDTERDDSDSDDEEETPAERRRRLAALGELPSARDDDDEE 430
Score = 34.4 bits (79), Expect = 0.40
Identities = 20/115 (17%), Positives = 43/115 (37%), Gaps = 30/115 (26%)
Query: 137 TSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDN-----MDDDDDDDDDDETD 191
T D + + K + +V + +R K D+++ N D ++ +D+
Sbjct: 297 TKHDFIKQIRNLDPKARREVDLETNAPERVKAPARDEVDTNDSSPLTDSKSEEKLEDDLA 356
Query: 192 D-------------------------DNDEDNEEESEEEEENEEDDDEERKAEKK 221
+ D+E+E E ++++ DD+EE AE++
Sbjct: 357 HSLLGSEASERRKNREPSSSSPPSRPRSRRDSEDEDTERDDSDSDDEEETPAERR 411
>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
(DUF2076). This domain, found in various hypothetical
prokaryotic proteins, has no known function. The domain,
however, is found in various periplasmic ligand-binding
sensor proteins.
Length = 234
Score = 35.4 bits (82), Expect = 0.12
Identities = 19/95 (20%), Positives = 25/95 (26%), Gaps = 24/95 (25%)
Query: 101 HGESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQ 160
+ G + D N+ D D QGS
Sbjct: 163 GHSQPA--EIVDAIGEGGDGSGPAPADDTGINNYGDDDSDAAGGDQGS------------ 208
Query: 161 DGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDND 195
+ D +D D DDDD DDD+D
Sbjct: 209 ----------NGDDDDGGFADSGYDDDDMDDDDDD 233
Score = 31.6 bits (72), Expect = 2.0
Identities = 15/90 (16%), Positives = 28/90 (31%), Gaps = 7/90 (7%)
Query: 109 MMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKV 168
M+ +G+ G + D E + + Y + + G
Sbjct: 151 MLLGNGLENLFGGHSQPAEIVDAIGEGGDGSGPAPADDTGINNYGDDDSDAAGGD----- 205
Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDN 198
D+ D D D+ D D+D+D+
Sbjct: 206 --QGSNGDDDDGGFADSGYDDDDMDDDDDD 233
>gnl|CDD|206746 cd01849, YlqF_related_GTPase, Circularly permuted YlqF-related
GTPases. These proteins are found in bacteria,
eukaryotes, and archaea. They all exhibit a circular
permutation of the GTPase signature motifs so that the
order of the conserved G box motifs is G4-G5-G1-G2-G3,
with G4 and G5 being permuted from the C-terminal region
of proteins in the Ras superfamily to the N-terminus of
YlqF-related GTPases.
Length = 146
Score = 34.3 bits (79), Expect = 0.12
Identities = 24/94 (25%), Positives = 36/94 (38%), Gaps = 10/94 (10%)
Query: 374 DEAPNKKVHRKRQAELTAKQKNPKAFTFQSVIKGERKFRRKEDIQAKKHHVPQVDRTPLE 433
D P + + + AEL+ F + +G K + + Q K +
Sbjct: 39 DLVPKEVLRKWV-AELSELYGTKTFFISATNGQGILKLKAEITKQKLKLKYKK------- 90
Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSV 467
I V VVG P VGKS+ I L+ F S+
Sbjct: 91 --GIRVGVVGLPNVGKSSFINALLNKFKLKVGSI 122
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 35.4 bits (82), Expect = 0.13
Identities = 14/55 (25%), Positives = 30/55 (54%)
Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQ 235
DD+D+DDD D ++E++EE+ EE+ + ++ + K + E + ++
Sbjct: 25 DDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEK 79
Score = 35.0 bits (81), Expect = 0.18
Identities = 20/78 (25%), Positives = 36/78 (46%), Gaps = 15/78 (19%)
Query: 178 MDDDDDDDD---------------DDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKN 222
M D DD++ DDE +DD+ +D+ +E E+EE+ EE KA+ K
Sbjct: 1 MADWDDEEFEPPAPPAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKK 60
Query: 223 SKNEHVQELTAKQKNPKA 240
+ ++E ++ +
Sbjct: 61 ALKAKIEEKEKAKREKEE 78
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 36.0 bits (83), Expect = 0.13
Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 23/115 (20%)
Query: 103 ESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDG 162
ES E+S D I S + D D++ DE S + ++S + + +K KE+
Sbjct: 243 ESEEESGKR-DVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEK------- 294
Query: 163 RKRRKVMFDDDLEDNMDDDDDDDD-------DDETDDDNDEDNEEESEEEEENEE 210
RKR K M M+D+D+D++ E ++ + + ++EEE EE
Sbjct: 295 RKRLKKM--------MEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEE 341
Score = 34.4 bits (79), Expect = 0.35
Identities = 17/74 (22%), Positives = 40/74 (54%), Gaps = 2/74 (2%)
Query: 159 SQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
S++ +R V+ +D+ + D+D+D+D+ D+EEE+EE+E+ + +K
Sbjct: 244 SEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKR--LKKM 301
Query: 219 EKKNSKNEHVQELT 232
+ ++E ++ +
Sbjct: 302 MEDEDEDEEMEIVP 315
>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein. This family includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 116
Score = 33.7 bits (77), Expect = 0.13
Identities = 23/109 (21%), Positives = 40/109 (36%), Gaps = 21/109 (19%)
Query: 125 SDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDD 184
S D + + D DD +DD E +F + + D D+
Sbjct: 19 SYDDEHDEDMDDFIEDDDEEQD-----------EIPYDSDEIWAIFGKGRKRSYYDRYDE 67
Query: 185 DDDDETDDDNDEDNEEESE----------EEEENEEDDDEERKAEKKNS 223
DD + + + ++E E E E+++E+RK +KKN
Sbjct: 68 DDALDNMEATFMEIQKEERRSARMARLEDERELAREEEEEKRKKKKKNK 116
Score = 28.7 bits (64), Expect = 8.3
Identities = 20/101 (19%), Positives = 40/101 (39%), Gaps = 19/101 (18%)
Query: 148 SMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDE----------- 196
S + SS + D+D++D ++DD D++ DE D+DE
Sbjct: 2 SSGPTLKPQTSSNKASRSYDDEHDEDMDDFIEDD--DEEQDEIPYDSDEIWAIFGKGRKR 59
Query: 197 ------DNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
D ++ + E + +E + + ++ E +EL
Sbjct: 60 SYYDRYDEDDALDNMEATFMEIQKEERRSARMARLEDEREL 100
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 35.8 bits (83), Expect = 0.13
Identities = 15/65 (23%), Positives = 32/65 (49%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
D++ + + ++ DDD ++E+DD++ + E E + KA++ K E
Sbjct: 235 DEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELER 294
Query: 232 TAKQK 236
AK++
Sbjct: 295 EAKEE 299
Score = 35.5 bits (82), Expect = 0.15
Identities = 14/66 (21%), Positives = 28/66 (42%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
+ L + DDD +++ DDE+ + E E + + +R EK+ + E +
Sbjct: 239 EGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKE 298
Query: 232 TAKQKN 237
+ K
Sbjct: 299 EKQLKK 304
Score = 33.9 bits (78), Expect = 0.47
Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 2/90 (2%)
Query: 153 KEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD 212
K Q + + +K K+ + D M + ++ DD+ ++++D+++ E E E +
Sbjct: 212 KRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINK 271
Query: 213 DEERKAEKKNSKN--EHVQELTAKQKNPKA 240
K + K +N + +EL + K K
Sbjct: 272 PVRPKRKTKAQRNKEKRRKELEREAKEEKQ 301
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 35.6 bits (82), Expect = 0.14
Identities = 18/48 (37%), Positives = 26/48 (54%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
DD N D +DD+ +++ EEE EE EE+EE++ EE E
Sbjct: 54 DDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEE 101
Score = 34.8 bits (80), Expect = 0.26
Identities = 29/111 (26%), Positives = 54/111 (48%), Gaps = 29/111 (26%)
Query: 106 EKSMMEIDGIHGAMSGAISSDDSDD--ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGR 163
E++M+ +D +++ D DD E E +TSDD G+ + E +GR
Sbjct: 28 EQNMLALD---------VAARDGDDAIELAEPETSDDP----YGNPDPFGED----DEGR 70
Query: 164 KRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
+E + +D ++++ +E ++D +E+ EEESEE E + +E
Sbjct: 71 ----------IEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEE 111
>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1). This family
consists of several mammalian dentin matrix protein 1
(DMP1) sequences. The dentin matrix acidic
phosphoprotein 1 (DMP1) gene has been mapped to human
chromosome 4q21. DMP1 is a bone and teeth specific
protein initially identified from mineralised dentin.
DMP1 is primarily localised in the nuclear compartment
of undifferentiated osteoblasts. In the nucleus, DMP1
acts as a transcriptional component for activation of
osteoblast-specific genes like osteocalcin. During the
early phase of osteoblast maturation, Ca(2+) surges into
the nucleus from the cytoplasm, triggering the
phosphorylation of DMP1 by a nuclear isoform of casein
kinase II. This phosphorylated DMP1 is then exported out
into the extracellular matrix, where it regulates
nucleation of hydroxyapatite. DMP1 is a unique molecule
that initiates osteoblast differentiation by
transcription in the nucleus and orchestrates
mineralised matrix formation extracellularly, at later
stages of osteoblast maturation. The DMP1 gene has been
found to be ectopically expressed in lung cancer
although the reason for this is unknown.
Length = 514
Score = 35.8 bits (82), Expect = 0.14
Identities = 55/239 (23%), Positives = 95/239 (39%), Gaps = 31/239 (12%)
Query: 102 GESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDD----TESGQGSM-------- 149
G+ST+ S E + G G S D + +DE SDD +E G M
Sbjct: 181 GDSTQDSESEEHWVGGGSEGESSHGDGSEFDDEGMQSDDPESTRSERGNSRMSSAGLKSK 240
Query: 150 --KKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEE 207
K E+ S+QD + + V + ++D E DD N + + +S E
Sbjct: 241 ESKGEDEEQASTQDSGESQSVEYPSRKFFRKSRISEEDGRGELDDSNTMEVKSDSTENAG 300
Query: 208 NEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSV-IKGERKFRSTKPTVVPQPRM 266
+ + R +++S+ QE + + ++P + + Q + + ++ VV + R
Sbjct: 301 LSQSREHSRSESQEDSEENQSQEDSQEVQDPSSESSQEADLPSQENSSESQEEVVSESRG 360
Query: 267 SDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDE-SADDNSSDEDETVEPKTHKDSK 324
+ D H E+QEDS S+E S D SS E ++ E + +S
Sbjct: 361 DNPDNTTSHS---------------EDQEDSESSEEDSLDTPSSSESQSTEEQADSESN 404
>gnl|CDD|213227 cd03260, ABC_PstB_phosphate_transporter, ATP-binding cassette
domain of the phosphate transport system. Phosphate
uptake is of fundamental importance in the cell
physiology of bacteria because phosphate is required as
a nutrient. The Pst system of E. coli comprises four
distinct subunits encoded by the pstS, pstA, pstB, and
pstC genes. The PstS protein is a phosphate-binding
protein located in the periplasmic space. PstA and PstC
are hydrophobic and they form the transmembrane portion
of the Pst system. PstB is the catalytic subunit, which
couples the energy of ATP hydrolysis to the import of
phosphate across cellular membranes through the Pst
system, often referred as ABC-protein. PstB belongs to
one of the largest superfamilies of proteins
characterized by a highly conserved adenosine
triphosphate (ATP) binding cassette (ABC), which is also
a nucleotide binding domain (NBD).
Length = 227
Score = 35.2 bits (82), Expect = 0.14
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 434 PPPIVVAVVGPPQVGKSTLIRCL 456
P + A++GP GKSTL+R L
Sbjct: 24 PKGEITALIGPSGCGKSTLLRLL 46
>gnl|CDD|220779 pfam10488, PP1c_bdg, Phosphatase-1 catalytic subunit binding
region. This conserved C-terminus appears to be a
protein phosphatase-1 catalytic subunit (PP1C) binding
region, which may in some circumstances also be
retroviral in origin since it is found in both herpes
simplex virus and in mouse and man. This domain is found
in Gadd-34 apoptosis-associated proteins as well as the
constitutive repressor of eIF2-alpha
phosphorylation/protein phosphatase 1, regulatory
(inhibitor) subunit 15b, otherwise known as CReP.
Diverse stressful conditions are associated with
phosphorylation of the {alpha} subunit of eukaryotic
translation initiation factor 2 (eIF2{alpha}) on serine
51. This signaling event, which is conserved from yeast
to mammals, negatively regulates the guanine nucleotide
exchange factor, eIF2-B and inhibits the recycling of
eIF2 to its active GTP bound form. In mammalian cells
eIF2{alpha} phosphorylation emerges as an important
event in stress signaling that impacts on gene
expression at both the translational and transcriptional
levels.
Length = 307
Score = 35.4 bits (81), Expect = 0.14
Identities = 30/141 (21%), Positives = 58/141 (41%), Gaps = 27/141 (19%)
Query: 173 DLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELT 232
DLE + D + D++ DD D D+ SE + E +++ + K+ + T
Sbjct: 25 DLESSSDVESISWDEESEDDGFDSDS-SLSESDREQDDEGLHLWNSFTKSVDPYNPLNFT 83
Query: 233 AKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEE 292
A T Q+ T+ P+P S+ D S ++S+E
Sbjct: 84 A--------TIQTAA-----------TIKPKPPSSESDW-------SGEENVSSQEGPLP 117
Query: 293 EQEDSAFSDESADDNSSDEDE 313
+ + S++ + ++S+DE+E
Sbjct: 118 STPEHSSSEDDSWESSADEEE 138
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 34.5 bits (79), Expect = 0.14
Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 173 DLEDNMDDDDDDDDDDETDDDNDED-NEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
D E++ D ++ ++DE + ++ ED +EE EE EE EE+ KA +SK+ + +
Sbjct: 19 DEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKAPSSSSKSSSQETI 78
Query: 232 TAKQKNP 238
+ P
Sbjct: 79 SIPPTPP 85
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell cycle
progression and biogenesis of 60S ribosomal subunits.
The Schistosoma mansoni Mak16 has been shown to target
protein transport to the nucleolus.
Length = 97
Score = 32.9 bits (75), Expect = 0.15
Identities = 13/46 (28%), Positives = 30/46 (65%)
Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
F LE +++D+++++E +++++ + E S++EE EE +D E
Sbjct: 51 FKKALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLE 96
Score = 31.8 bits (72), Expect = 0.47
Identities = 11/45 (24%), Positives = 30/45 (66%)
Query: 167 KVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEED 211
K + ++ E+N +++++++++++ + ++EE EEE E+ ED
Sbjct: 53 KALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLED 97
Score = 30.2 bits (68), Expect = 1.5
Identities = 10/43 (23%), Positives = 28/43 (65%)
Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
++ ++ +++D+E +++ +E++E E E ++EE ++E E
Sbjct: 55 LEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLED 97
>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein. This family consists of
Ebola and Marburg virus nucleoproteins. These proteins
are responsible for encapsidation of genomic RNA. It has
been found that nucleoprotein DNA vaccines can offer
protection from the virus.
Length = 717
Score = 35.7 bits (82), Expect = 0.16
Identities = 34/172 (19%), Positives = 59/172 (34%), Gaps = 35/172 (20%)
Query: 171 DDDLEDNMDDDDDDDDD--------DETDDDNDEDNEEESEEEEE-------NEEDDDEE 215
DD MDDD DD D D D ++ E + N ++DD++
Sbjct: 417 DDVQPRPMDDDPDDSRDTTIPPGVVDPIDGESTEYSSYSDSFVGTNDDLVLFNLDEDDDD 476
Query: 216 RKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIK-----GERKFRSTKPTVVPQPRMSDKD 270
K + E A + Q + + + R+++ + D+
Sbjct: 477 SKP---------IPEQAASTYGQTSRERQGIPEPPPGSHQPGNRASQDLNNNNQKQEDES 527
Query: 271 LAIRHKITSALAGLNSKEEEEE------EQEDSAFSDESADDNSSDEDETVE 316
K + L +EE+E + + S ES DD SD ++ V+
Sbjct: 528 TNPIGKTSLRYQELTPVQEEDEPEDQTDDDDSSLPPLESDDDPGSDNEQGVD 579
Score = 33.4 bits (76), Expect = 0.87
Identities = 26/145 (17%), Positives = 47/145 (32%), Gaps = 39/145 (26%)
Query: 128 SDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDD 187
S D N+ + +D++ + G +++ Q+ +D+ ED DDDD
Sbjct: 513 SQDLNNNNQKQEDESTNPIGKTSLRYQELTPVQE---------EDEPEDQTDDDDSSLPP 563
Query: 188 DETDDDNDEDNEE--------------------------ESEEEEENEEDDDEERKAEKK 221
E+DDD DNE+ ++ + + +
Sbjct: 564 LESDDDPGSDNEQGVDLTEVAPPAPVYRDEKEQDEIPHPAQNPQDPTGSIGNVDSDILRS 623
Query: 222 NSKNEHVQELT----AKQKNPKAFT 242
NSK E T + + P
Sbjct: 624 NSKPSAPLEETYMHLLRTQGPFDAI 648
>gnl|CDD|206753 cd04178, Nucleostemin_like, A circularly permuted subfamily of the
Ras GTPases. Nucleostemin (NS) is a nucleolar protein
that functions as a regulator of cell growth and
proliferation in stem cells and in several types of
cancer cells, but is not expressed in the differentiated
cells of most mammalian adult tissues. NS shuttles
between the nucleolus and nucleoplasm bidirectionally at
a rate that is fast and independent of cell type.
Lowering GTP levels decreases the nucleolar retention of
NS, and expression of NS is abruptly down-regulated
during differentiation prior to terminal cell division.
Found only in eukaryotes, NS consists of an N-terminal
basic domain, a coiled-coil domain, a GTP-binding
domain, an intermediate domain, and a C-terminal acidic
domain. Experimental evidence indicates that NS uses its
GTP-binding property as a molecular switch to control
the transition between the nucleolus and nucleoplasm,
and this process involves interaction between the basic,
GTP-binding, and intermediate domains of the protein.
Length = 171
Score = 34.5 bits (80), Expect = 0.17
Identities = 11/18 (61%), Positives = 13/18 (72%)
Query: 436 PIVVAVVGPPQVGKSTLI 453
I V VVG P VGKS++I
Sbjct: 116 SITVGVVGYPNVGKSSVI 133
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 35.9 bits (82), Expect = 0.17
Identities = 44/205 (21%), Positives = 87/205 (42%), Gaps = 23/205 (11%)
Query: 129 DDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDD 188
D E E + + G+ + +Y + + S + + + D ++D DDD D
Sbjct: 350 DIETVEPQQEEPVQDVGEHQINEYGDILPSLKASINNSAINYYDAVKDGKYLDDDSSDAL 409
Query: 189 ETDDDNDEDNEEES------EEEEENEEDDDEERKAE---KKNSKNEHVQELTA-----K 234
TD+D D E++ E+E+ ED++EE + ++ S +EHV+E A +
Sbjct: 410 YTDEDLLFDLEKQKYMDMLDGSEDESVEDNEEEHSGDANEEELSVDEHVEEHNADDSGEQ 469
Query: 235 QKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQ 294
Q + ++ QSV E + V +P ++D + + +E +E
Sbjct: 470 QSDDESGEHQSV--NEIVEEQSVNEHVEEPTVAD-------IVEQETVDEHVEEPAVDEN 520
Query: 295 EDSAFSDESADDNSSDEDETVEPKT 319
E+ +DE ++ + E+ E +
Sbjct: 521 EEQQTADEHVEEPTIAEEHVEEEIS 545
Score = 35.9 bits (82), Expect = 0.18
Identities = 23/102 (22%), Positives = 59/102 (57%), Gaps = 1/102 (0%)
Query: 130 DENDEDDTSDDDTESGQGSMKKYKEQVVSS-QDGRKRRKVMFDDDLEDNMDDDDDDDDDD 188
+EN E++ ++ E+ + ++++Y E+ V ++ + +++E+N +++ +++ ++
Sbjct: 1000 EENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEE 1059
Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
++ ++E+ EE E EEN E++ EE E E+V+E
Sbjct: 1060 NIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEE 1101
Score = 34.8 bits (79), Expect = 0.35
Identities = 18/60 (30%), Positives = 42/60 (70%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
++++E+N++++ +++ ++ +++ +E +EE EE EEN E+ DEE E + + E+V+E
Sbjct: 996 EENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEE 1055
Score = 34.0 bits (77), Expect = 0.62
Identities = 31/134 (23%), Positives = 69/134 (51%), Gaps = 16/134 (11%)
Query: 103 ESTEKSMMEID--GIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQ 160
E+ E+++ E D + D+ + E E++ ++ E+ + ++++Y E+ V
Sbjct: 1013 ENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEE- 1071
Query: 161 DGRKRRKVMFDDDLEDNMDDDDDD--DDDDETDDDNDEDNEEESEEE--EENEEDDDEER 216
++++E+N++++ ++ +++ E ++N E+N EE+ EE EEN E++ EE
Sbjct: 1072 ---------IEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEY 1122
Query: 217 KAEKKNSKNEHVQE 230
E NE E
Sbjct: 1123 DDENPEEHNEEYDE 1136
Score = 33.6 bits (76), Expect = 0.84
Identities = 26/118 (22%), Positives = 61/118 (51%), Gaps = 13/118 (11%)
Query: 131 ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDD--D 188
E + ++ +++ E + ++++Y E+ V + V ++++E+N+++ D+++ + +
Sbjct: 1016 EENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENV--EENIEENIEEYDEENVEEIE 1073
Query: 189 ETDDDNDEDNEEESEEE---------EENEEDDDEERKAEKKNSKNEHVQELTAKQKN 237
E ++N E+N EE+ EE EEN E++ EE E E + ++ N
Sbjct: 1074 ENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEEHN 1131
Score = 33.2 bits (75), Expect = 0.97
Identities = 26/103 (25%), Positives = 58/103 (56%), Gaps = 2/103 (1%)
Query: 130 DENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE 189
+EN E++ ++ E+ + ++++ E+ V + + + D E+ + +++ ++ DE
Sbjct: 980 EENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDE 1039
Query: 190 TDDDNDEDNEEESEEE--EENEEDDDEERKAEKKNSKNEHVQE 230
+ + E+N EE+ EE EEN E+ DEE E + + E+++E
Sbjct: 1040 ENVEEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEE 1082
Score = 33.2 bits (75), Expect = 1.1
Identities = 23/112 (20%), Positives = 50/112 (44%), Gaps = 2/112 (1%)
Query: 122 AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
A + S DE+ E+ +DD E + V + + + + + + D ++ +
Sbjct: 447 ANEEELSVDEHVEEHNADDSGEQQSDDESGEHQSVNEIVEEQSVNEHVEEPTVADIVEQE 506
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA 233
D+ +E DE+ E+++ +E E EE E+ ++ EH++E +
Sbjct: 507 TVDEHVEE--PAVDENEEQQTADEHVEEPTIAEEHVEEEISTAEEHIEEPAS 556
Score = 32.1 bits (72), Expect = 2.4
Identities = 35/168 (20%), Positives = 84/168 (50%), Gaps = 6/168 (3%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE-EENEEDDDEERKAEK-KNSKNEHV 228
++D E+N+++D +++ ++ +++ +E+ EE EE EEN E++ EE E + + E+V
Sbjct: 948 EEDAEENVEEDAEENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENV 1007
Query: 229 QELTAK--QKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNS 286
+E + ++N + + ++V + E V + + ++ + I + +
Sbjct: 1008 EENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEE-NVEENIEENIEEYDE 1066
Query: 287 KEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKD-SKNPEEDTGLN 333
+ EE E+ +E+ ++N + E +E ++ +N EE+ N
Sbjct: 1067 ENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEEN 1114
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 34.6 bits (80), Expect = 0.18
Identities = 15/49 (30%), Positives = 29/49 (59%)
Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
+++ +++ +E EE+ E+E EE +EE +AE + + + EL AK
Sbjct: 4 KNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAK 52
Score = 33.8 bits (78), Expect = 0.26
Identities = 21/92 (22%), Positives = 44/92 (47%), Gaps = 13/92 (14%)
Query: 149 MKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEEN 208
M++ EQV + + ++E+ ++D+ +++ +E + D E+ E E
Sbjct: 1 MEEKNEQVEEEVE---------ETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEA 51
Query: 209 EEDDDEER----KAEKKNSKNEHVQELTAKQK 236
+ D+ EER +A+ +N K +E A +K
Sbjct: 52 KLDELEERYLRLQADFENYKRRIQKENEAAEK 83
>gnl|CDD|206682 cd01895, EngA2, EngA2 GTPase contains the second domain of EngA.
This EngA2 subfamily CD represents the second GTPase
domain of EngA and its orthologs, which are composed of
two adjacent GTPase domains. Since the sequences of the
two domains are more similar to each other than to other
GTPases, it is likely that an ancient gene duplication,
rather than a fusion of evolutionarily distinct GTPases,
gave rise to this family. Although the exact function of
these proteins has not been elucidated, studies have
revealed that the E. coli EngA homolog, Der, and
Neisseria gonorrhoeae EngA are essential for cell
viability. A recent report suggests that E. coli Der
functions in ribosome assembly and stability.
Length = 174
Score = 34.3 bits (80), Expect = 0.18
Identities = 11/24 (45%), Positives = 17/24 (70%)
Query: 435 PPIVVAVVGPPQVGKSTLIRCLIK 458
PI +A++G P VGKS+L+ L+
Sbjct: 1 DPIKIAIIGRPNVGKSSLLNALLG 24
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 35.7 bits (82), Expect = 0.18
Identities = 18/88 (20%), Positives = 33/88 (37%), Gaps = 12/88 (13%)
Query: 164 KRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE--------- 214
+R+ + D DD + + + N EESE EEE E+ +DE
Sbjct: 347 ERKDYILDSSSVPLEKQFDDILYFE--KMEIENRNPEESEHEEEVEDYEDENDHSKRICD 404
Query: 215 -ERKAEKKNSKNEHVQELTAKQKNPKAF 241
+ + +E L +N +++
Sbjct: 405 DDELENHFRAADEKNSHLVVGFRNERSY 432
Score = 33.4 bits (76), Expect = 0.96
Identities = 12/77 (15%), Positives = 32/77 (41%), Gaps = 2/77 (2%)
Query: 161 DGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEK 220
+ ++ ++ + DD + ++ + + E EE E+ E+++D ++
Sbjct: 345 EAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICD 404
Query: 221 KNSKNEHVQELTAKQKN 237
+ H + A +KN
Sbjct: 405 DDELENHFRA--ADEKN 419
>gnl|CDD|224082 COG1160, COG1160, Predicted GTPases [General function prediction
only].
Length = 444
Score = 35.2 bits (82), Expect = 0.18
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 433 EPPPIVVAVVGPPQVGKSTLIRCLIK 458
E PI +A++G P VGKS+LI ++
Sbjct: 175 ETDPIKIAIIGRPNVGKSSLINAILG 200
Score = 31.4 bits (72), Expect = 3.2
Identities = 13/21 (61%), Positives = 14/21 (66%)
Query: 438 VVAVVGPPQVGKSTLIRCLIK 458
VVA+VG P VGKSTL L
Sbjct: 5 VVAIVGRPNVGKSTLFNRLTG 25
>gnl|CDD|213223 cd03256, ABC_PhnC_transporter, ATP-binding cassette domain of the
binding protein-dependent phosphonate transport system.
Phosphonates are a class of organophosphorus compounds
characterized by a chemically stable
carbon-to-phosphorus (C-P) bond. Phosphonates are
widespread among naturally occurring compounds in all
kingdoms of wildlife, but only prokaryotic
microorganisms are able to cleave this bond. Certain
bacteria such as E. coli can use alkylphosphonates as a
phosphorus source. ABC transporters are a subset of
nucleotide hydrolases that contain a signature motif,
Q-loop, and H-loop/switch region, in addition to, the
Walker A motif/P-loop and Walker B motif commonly found
in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 241
Score = 34.9 bits (81), Expect = 0.19
Identities = 12/19 (63%), Positives = 15/19 (78%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
VA++GP GKSTL+RCL
Sbjct: 29 FVALIGPSGAGKSTLLRCL 47
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 34.6 bits (80), Expect = 0.19
Identities = 14/43 (32%), Positives = 23/43 (53%)
Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHV 228
DD+ ++D DE +E+ +E +E D+ +KK KN V
Sbjct: 14 GDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSV 56
>gnl|CDD|213229 cd03262, ABC_HisP_GlnQ, ATP-binding cassette domain of the
histidine and glutamine transporters. HisP and GlnQ are
the ATP-binding components of the bacterial periplasmic
histidine and glutamine permeases, respectively.
Histidine permease is a multi-subunit complex containing
the HisQ and HisM integral membrane subunits and two
copies of HisP. HisP has properties intermediate between
those of integral and peripheral membrane proteins and
is accessible from both sides of the membrane,
presumably by its interaction with HisQ and HisM. The
two HisP subunits form a homodimer within the complex.
The domain structure of the amino acid uptake systems is
typical for prokaryotic extracellular solute binding
protein-dependent uptake systems. All of the amino acid
uptake systems also have at least one, and in a few
cases, two extracellular solute binding proteins located
in the periplasm of Gram-negative bacteria, or attached
to the cell membrane of Gram-positive bacteria. The
best-studied member of the PAAT (polar amino acid
transport) family is the HisJQMP system of S.
typhimurium, where HisJ is the extracellular solute
binding proteins and HisP is the ABC protein.
Length = 213
Score = 34.4 bits (80), Expect = 0.20
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
VV ++GP GKSTL+RC+
Sbjct: 28 VVVIIGPSGSGKSTLLRCI 46
>gnl|CDD|213179 cd00267, ABC_ATPase, ATP-binding cassette transporter
nucleotide-binding domain. ABC transporters are a large
family of proteins involved in the transport of a wide
variety of different compounds, like sugars, ions,
peptides, and more complex organic molecules. The
nucleotide-binding domain shows the highest similarity
between all members of the family. ABC transporters are
a subset of nucleotide hydrolases that contain a
signature motif, Q-loop, and H-loop/switch region, in
addition to, the Walker A motif/P-loop and Walker B
motif commonly found in a number of ATP- and GTP-binding
and hydrolyzing proteins.
Length = 157
Score = 33.8 bits (78), Expect = 0.20
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
+VA+VGP GKSTL+R +
Sbjct: 27 IVALVGPNGSGKSTLLRAI 45
>gnl|CDD|224051 COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase
component [Amino acid transport and metabolism].
Length = 240
Score = 34.8 bits (81), Expect = 0.21
Identities = 12/19 (63%), Positives = 15/19 (78%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
VV ++GP GKSTL+RCL
Sbjct: 30 VVVIIGPSGSGKSTLLRCL 48
>gnl|CDD|118064 pfam09528, Ehrlichia_rpt, Ehrlichia tandem repeat (Ehrlichia_rpt).
This entry represents 77 residues of an 80 amino acid
(240 nucleotide) tandem repeat, found in a variable
number of copies in an immunodominant outer membrane
protein of Ehrlichia chaffeensis, a tick-borne obligate
intracellular pathogen.
Length = 707
Score = 35.4 bits (80), Expect = 0.23
Identities = 60/265 (22%), Positives = 107/265 (40%), Gaps = 26/265 (9%)
Query: 89 MEIFEFLN--ICQVHGESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQ 146
M I F N + E +E E++ G SS+ +EN + ++D +
Sbjct: 25 MRILGFGNKNDEKPQDEDSEILQEELEPAVDDNVGNPSSEVGKEENAPEVKAEDLEPAVA 84
Query: 147 GSMKKYKEQV---VSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESE 203
S++ +V VS + + + +DL+ +D D + + D EEE+
Sbjct: 85 ESVEHSSSEVGKEVSETEKEESNPEVKAEDLQPAVDGDIAHHESEVGDKPAKTSKEEENP 144
Query: 204 EEE----ENEEDDDEERKAEK------KNSKNEHVQELTAKQKNPK---AFTFQSVIKGE 250
E E E +DD E ++ ++SK E E+ A+ P + S GE
Sbjct: 145 EIEAEDGEPAKDDGIEESHQEEDEIVSESSKEEFTAEVKAEDLQPAVDGSIEHSSSEVGE 204
Query: 251 RKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLN------SKEEE--EEEQEDSAFSDE 302
++ K P+ + D A+ + + + SKEEE E + ED + +
Sbjct: 205 EVSKTEKEESNPEVKAEDLQPAVDDDVAHHESEVGDKPAETSKEEETPEVKAEDLQPAVD 264
Query: 303 SADDNSSDEDETVEPKTHKDSKNPE 327
+ ++SS E E + +T K+ PE
Sbjct: 265 GSVEHSSSEIEEHQGETEKEEGIPE 289
Score = 33.8 bits (76), Expect = 0.62
Identities = 38/200 (19%), Positives = 73/200 (36%), Gaps = 16/200 (8%)
Query: 133 DEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDD 192
D DDD + + + ++ + + ++ +++ + ++ + +
Sbjct: 222 DLQPAVDDDVAHHESEVGDKPAETSKEEETPEVKAEDLQPAVDGSVEHSSSEIEEHQGET 281
Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK--AFTFQSVIKGE 250
+ +E E E+ + DD E + + E V E ++ NP+ A Q GE
Sbjct: 282 EKEEGIPESHAEDLQPAVDDIVEHPSSEPFVAEEEVSETEKEENNPEVLAEDLQDAADGE 341
Query: 251 RKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEE---EEEEQEDSAFSDESADDN 307
VV + R G KEE E ++D SD S +
Sbjct: 342 SGVSDQPAQVVEE----------RESEIEEHQGETEKEEGIPESHAEDDEIASDPSIEHF 391
Query: 308 SSDEDETVEPKTHKDSKNPE 327
S++ + V +T K+ NPE
Sbjct: 392 SAEVGKEVS-ETEKEESNPE 410
>gnl|CDD|131353 TIGR02300, FYDLN_acid, TIGR02300 family protein. Members of this
family are bacterial proteins with a conserved motif
[KR]FYDLN, sometimes flanked by a pair of CXXC motifs,
followed by a long region of low complexity sequence in
which roughly half the residues are Asp and Glu,
including multiple runs of five or more acidic residues.
The function of members of this family is unknown.
Length = 129
Score = 33.4 bits (76), Expect = 0.24
Identities = 25/80 (31%), Positives = 33/80 (41%), Gaps = 4/80 (5%)
Query: 129 DDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDD--- 185
DDE + + D E + + E+ D DD ED DDDDD
Sbjct: 50 DDEKEAVAVKEVDPEIDEAAELVSLEEADEEVDPGAPTPAGGDDLAEDEDIADDDDDVTF 109
Query: 186 -DDDETDDDNDEDNEEESEE 204
+D+E DD +DED S E
Sbjct: 110 LEDEEDDDIDDEDIIGVSGE 129
>gnl|CDD|233037 TIGR00592, pol2, DNA polymerase (pol2). All proteins in this
superfamily for which functions are known are DNA
polymerases.This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 1172
Score = 35.4 bits (81), Expect = 0.24
Identities = 41/275 (14%), Positives = 72/275 (26%), Gaps = 30/275 (10%)
Query: 126 DDSDDENDEDDTSDDDTESGQ-GSMKKYKEQVVSSQDGRKRRKVMFDDDLED-NMDDDDD 183
D + DEDD D D + G KK ++ K + D
Sbjct: 41 DGREFFPDEDDILDLDKDDGSAAEAKKKDKENHKKVTKPNNIKAVRIACAPKKKKDRKKS 100
Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTF 243
D D E N+ E+ + + + + ++ P F
Sbjct: 101 LGKDGLLGDILQELNKTETAQ------------RKITPRLVSVPKLKFSSPADVPAINDF 148
Query: 244 QSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEE---------- 293
+ V + + + + K G E + E
Sbjct: 149 SNHHPAVVDIVKKAIPVSTRYLLEKILIPVPLKRAEFAGGDVQMEGDPELKLASFDIETY 208
Query: 294 ---QEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAAS---AFL 347
+D DE+ D T D ++ E + WK S +
Sbjct: 209 FHDGKDFFPGDENPADEEIMISTTPVIAKQWDYESEPEARVVTWKKPDKPTTGSYVESVS 268
Query: 348 ERQANIVNLAKYVYGDMEDVSVTMEGDEAPNKKVH 382
E + I + + DV +T+ GD +
Sbjct: 269 EEISMIKRFWDVIDQEDTDVEITVNGDNFDLVYLA 303
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 34.8 bits (81), Expect = 0.24
Identities = 11/31 (35%), Positives = 23/31 (74%)
Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNE 199
M+DD + +++ +++DDDD DD +++D+E
Sbjct: 352 MWDDYHREQLEEVEEEDDDDWDDDWDEDDDE 382
Score = 34.1 bits (79), Expect = 0.46
Identities = 11/24 (45%), Positives = 19/24 (79%)
Query: 176 DNMDDDDDDDDDDETDDDNDEDNE 199
+ ++++DDDD DD+ D+D+DE E
Sbjct: 362 EEVEEEDDDDWDDDWDEDDDEGVE 385
Score = 33.7 bits (78), Expect = 0.51
Identities = 13/45 (28%), Positives = 23/45 (51%)
Query: 159 SQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESE 203
+ + DD ++ +++DDD+ DDD DED++E E
Sbjct: 341 EEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEGVE 385
Score = 30.2 bits (69), Expect = 7.4
Identities = 10/20 (50%), Positives = 13/20 (65%), Gaps = 2/20 (10%)
Query: 171 DDDLEDNMDDDDDDDDDDET 190
DDD +D D D+DDD+ E
Sbjct: 369 DDDWDD--DWDEDDDEGVEV 386
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 33.9 bits (78), Expect = 0.25
Identities = 9/53 (16%), Positives = 22/53 (41%)
Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEH 227
+ + D D D + D+ + ++E E E++ E+ + S+ +
Sbjct: 89 KKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKP 141
Score = 30.4 bits (69), Expect = 3.6
Identities = 10/58 (17%), Positives = 30/58 (51%), Gaps = 2/58 (3%)
Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT 242
+ D D D+ ++++ ++ E+ +E + E++ + S +E + L+ + P+ +
Sbjct: 90 KSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLS--ELKPRKYA 145
>gnl|CDD|217424 pfam03205, MobB, Molybdopterin guanine dinucleotide synthesis
protein B. This protein contains a P-loop.
Length = 126
Score = 33.1 bits (76), Expect = 0.25
Identities = 12/21 (57%), Positives = 15/21 (71%)
Query: 438 VVAVVGPPQVGKSTLIRCLIK 458
+V VVGP GK+TLIR L+
Sbjct: 2 IVLVVGPKDSGKTTLIRKLLN 22
>gnl|CDD|224085 COG1163, DRG, Predicted GTPase [General function prediction only].
Length = 365
Score = 34.6 bits (80), Expect = 0.27
Identities = 12/18 (66%), Positives = 14/18 (77%)
Query: 439 VAVVGPPQVGKSTLIRCL 456
VA+VG P VGKSTL+ L
Sbjct: 66 VALVGFPSVGKSTLLNKL 83
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 33.9 bits (77), Expect = 0.28
Identities = 23/86 (26%), Positives = 44/86 (51%), Gaps = 3/86 (3%)
Query: 149 MKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEEN 208
+KK K ++ + K R+ ++LE+ M+ + ++ DE +D +++ E E + EE
Sbjct: 115 IKKKKSLIIRQEQIEKARQER--EELEERMEWERREEKIDEREDQEEQEREREEQTIEE- 171
Query: 209 EEDDDEERKAEKKNSKNEHVQELTAK 234
+ DD E E+ S+ E + T K
Sbjct: 172 QSDDSEHEIIEQDESETESDDDKTEK 197
>gnl|CDD|234401 TIGR03925, T7SS_EccC_b, type VII secretion protein EccCb. This
model represents the C-terminal domain or EccCb subunit
of the type VII secretion protein EccC as found in the
Actinobacteria. Type VII secretion is defined more
broadly as including secretion systems for ESAT-6-like
proteins in the Firmicutes as well as in the
Actinobacteria, but this family does not show close
homologs in the Firmicutes [Protein fate, Protein and
peptide secretion and trafficking].
Length = 566
Score = 34.9 bits (81), Expect = 0.28
Identities = 16/28 (57%), Positives = 19/28 (67%), Gaps = 2/28 (7%)
Query: 439 VAVVGPPQVGKSTLIRCLIKNF--TKTP 464
VA+VG PQ GKST +R LI + T TP
Sbjct: 82 VAIVGGPQSGKSTALRTLILSLALTHTP 109
>gnl|CDD|183063 PRK11264, PRK11264, putative amino-acid ABC transporter ATP-binding
protein YecC; Provisional.
Length = 250
Score = 34.3 bits (79), Expect = 0.28
Identities = 13/29 (44%), Positives = 20/29 (68%), Gaps = 2/29 (6%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLS 466
VVA++GP GK+TL+RC+ N + P +
Sbjct: 31 VVAIIGPSGSGKTTLLRCI--NLLEQPEA 57
>gnl|CDD|206726 cd04163, Era, E. coli Ras-like protein (Era) is a multifunctional
GTPase. Era (E. coli Ras-like protein) is a
multifunctional GTPase found in all bacteria except some
eubacteria. It binds to the 16S ribosomal RNA (rRNA) of
the 30S subunit and appears to play a role in the
assembly of the 30S subunit, possibly by chaperoning the
16S rRNA. It also contacts several assembly elements of
the 30S subunit. Era couples cell growth with
cytokinesis and plays a role in cell division and energy
metabolism. Homologs have also been found in eukaryotes.
Era contains two domains: the N-terminal GTPase domain
and a C-terminal domain KH domain that is critical for
RNA binding. Both domains are important for Era
function. Era is functionally able to compensate for
deletion of RbfA, a cold-shock adaptation protein that
is required for efficient processing of the 16S rRNA.
Length = 168
Score = 33.6 bits (78), Expect = 0.30
Identities = 10/15 (66%), Positives = 13/15 (86%)
Query: 439 VAVVGPPQVGKSTLI 453
VA++G P VGKSTL+
Sbjct: 6 VAIIGRPNVGKSTLL 20
>gnl|CDD|224742 COG1829, COG1829, Predicted archaeal kinase (sugar kinase
superfamily) [General function prediction only].
Length = 283
Score = 34.3 bits (79), Expect = 0.31
Identities = 18/78 (23%), Positives = 28/78 (35%), Gaps = 14/78 (17%)
Query: 700 YTPQHVACMAHFWGPITRS--------GTGFL---AVQDVAKREPGFRVIATGTILDANQ 748
+ P H+ F+ P+ G G V + G V G +D
Sbjct: 6 FVPGHITG---FFVPVIGKDPLKSGSIGAGVALERGVTVEVRFGEGTGVRLNGKKIDLPI 62
Query: 749 TAEVTKKLKLTGVPMKIY 766
T +V +KL GV ++I
Sbjct: 63 TRKVIEKLGPDGVGVRIE 80
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 34.1 bits (78), Expect = 0.31
Identities = 24/97 (24%), Positives = 38/97 (39%), Gaps = 7/97 (7%)
Query: 512 GDFEDLETGEKHSGDKSGGGSGGVSGGGSGDDKPKTRAELMEKK--RKLKEQFDAEYDDK 569
GD ++ E EK S S SG K K R E +K + K+Q + +D+
Sbjct: 135 GDVKNAEKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDE 194
Query: 570 DG-----GGNTYYDDLKTQATRQAELNRQQFHDLDDN 601
D GG++ + + RQ+ HD +
Sbjct: 195 DKRPGGGGGSSGGQSGLSTKDEPPKEKRQKHHDPERR 231
>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin).
Length = 585
Score = 34.6 bits (79), Expect = 0.33
Identities = 21/91 (23%), Positives = 36/91 (39%), Gaps = 7/91 (7%)
Query: 142 TESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDE------TDDDND 195
Q ++K + V DG+ ++K D+ E + DD+DD D
Sbjct: 166 NPQSQATLKS-VFEEVGKLDGQSKQKRERSDEREKSYQDDEDDTYRQNNIPYEDVVGGED 224
Query: 196 EDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
+ EE EE++ EE + + K S +
Sbjct: 225 WNPIEEGEEDQTQEEVKRSKERTHKGRSLPD 255
>gnl|CDD|226907 COG4530, COG4530, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 129
Score = 32.9 bits (75), Expect = 0.33
Identities = 20/72 (27%), Positives = 36/72 (50%), Gaps = 2/72 (2%)
Query: 146 QGSMKKYKEQVVSSQDGRKRRK--VMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESE 203
+ ++ +E+VV D V +D +D DD D D+ D D D+D+++
Sbjct: 47 DAAAEEEEEEVVKEVDAENEEVEVVSLEDADDDPKGGDDLPDLGDDEDVDLDDDDDDTFL 106
Query: 204 EEEENEEDDDEE 215
E+EE+++DD
Sbjct: 107 EDEEDDDDDVSG 118
Score = 31.8 bits (72), Expect = 0.81
Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 2/40 (5%)
Query: 170 FDDDLEDNMDDDDDDD--DDDETDDDNDEDNEEESEEEEE 207
DD + ++DDDDDD +D+E DDD+ ++EE
Sbjct: 89 LGDDEDVDLDDDDDDTFLEDEEDDDDDVSGIIGVPGDDEE 128
>gnl|CDD|234624 PRK00089, era, GTPase Era; Reviewed.
Length = 292
Score = 34.3 bits (80), Expect = 0.37
Identities = 12/19 (63%), Positives = 15/19 (78%)
Query: 439 VAVVGPPQVGKSTLIRCLI 457
VA+VG P VGKSTL+ L+
Sbjct: 8 VAIVGRPNVGKSTLLNALV 26
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for cell
viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 34.3 bits (79), Expect = 0.39
Identities = 12/54 (22%), Positives = 29/54 (53%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
+ L + + + D E D+ ++D E+++++ EEN++ +E KA + +
Sbjct: 52 ESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDEVLEQ 105
>gnl|CDD|224617 COG1703, ArgK, Putative periplasmic protein kinase ArgK and related
GTPases of G3E family [Amino acid transport and
metabolism].
Length = 323
Score = 34.2 bits (79), Expect = 0.39
Identities = 12/37 (32%), Positives = 16/37 (43%)
Query: 431 PLEPPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSV 467
P V+ + G P GKSTLI L + + V
Sbjct: 46 PRTGNAHVIGITGVPGAGKSTLIEALGRELRERGHRV 82
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 34.1 bits (78), Expect = 0.42
Identities = 22/122 (18%), Positives = 51/122 (41%), Gaps = 16/122 (13%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
D +DEN D+ + D T ++Q + +RR + + N+++
Sbjct: 186 GPDSDEDENPLDEEAPDMTPE-----TSKQDQKEERRAAVERRLAELVEMINWNLEERRR 240
Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEERK-----------AEKKNSKNEHVQELT 232
D ++ ++N E + ++ +E E+ ED + + + +E + K+E + L
Sbjct: 241 DLRKEQELEENVERDSDDEDEHGEDSEDGETKPESYITSEYIERISEIRKMKDERLSSLA 300
Query: 233 AK 234
+
Sbjct: 301 SM 302
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 33.9 bits (77), Expect = 0.47
Identities = 14/24 (58%), Positives = 18/24 (75%)
Query: 192 DDNDEDNEEESEEEEENEEDDDEE 215
D++DED EEE EEEEE E ++ E
Sbjct: 133 DESDEDEEEEEEEEEEEAEVEENE 156
Score = 32.4 bits (73), Expect = 1.2
Identities = 33/142 (23%), Positives = 53/142 (37%), Gaps = 12/142 (8%)
Query: 192 DDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIK--G 249
D ++E+ + +S EEE EE +EE NE ++ + T +V G
Sbjct: 49 DSSEENGDGDSSEEEGEEETSNEE-------ENNEDSDGNEDEEAEAENTTLSTVTLGYG 101
Query: 250 ERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDE---SADD 306
T + ++ K K T +EEEEEE+E+ A +E +
Sbjct: 102 GDATPGTGNIGLAALQLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNG 161
Query: 307 NSSDEDETVEPKTHKDSKNPEE 328
S++ E N EE
Sbjct: 162 TSTNSTEVDHGNGSSGGDNGEE 183
Score = 31.2 bits (70), Expect = 2.9
Identities = 14/35 (40%), Positives = 20/35 (57%)
Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
E + D DE+ EEE EEEE E++++ NS
Sbjct: 132 EDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNS 166
Score = 30.4 bits (68), Expect = 5.2
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 188 DETDDDNDEDNEEESEEEEENEED 211
DE+D+D +E+ EEE EE E E +
Sbjct: 133 DESDEDEEEEEEEEEEEAEVEENE 156
>gnl|CDD|206752 cd01859, MJ1464, An uncharacterized, circularly permuted subfamily
of the Ras GTPases. This family represents archaeal
GTPase typified by the protein MJ1464 from Methanococcus
jannaschii. The members of this family show a circular
permutation of the GTPase signature motifs so that
C-terminal strands 5, 6, and 7 (strands 6 contain the
NKxD motif) are relocated to the N terminus.
Length = 157
Score = 32.7 bits (75), Expect = 0.47
Identities = 12/21 (57%), Positives = 17/21 (80%)
Query: 436 PIVVAVVGPPQVGKSTLIRCL 456
P++V VVG P+VGKS++I L
Sbjct: 99 PVIVGVVGYPKVGKSSIINAL 119
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 34.0 bits (78), Expect = 0.52
Identities = 21/78 (26%), Positives = 36/78 (46%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
+++ ED DDDD DD+DE +D E + E + +++ + E + + V E
Sbjct: 140 EEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKKDVLE 199
Query: 231 LTAKQKNPKAFTFQSVIK 248
K P+ T+ V K
Sbjct: 200 EPKMFKKPEEITWDDVFK 217
Score = 33.2 bits (76), Expect = 0.96
Identities = 15/55 (27%), Positives = 27/55 (49%)
Query: 287 KEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQK 341
+EEE+E+ +D DE D++ E V + +ED + +DL++K
Sbjct: 141 EEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKK 195
Score = 30.2 bits (68), Expect = 7.3
Identities = 28/139 (20%), Positives = 53/139 (38%), Gaps = 29/139 (20%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
S D+ ++E+++DD D E E+V +S + R +
Sbjct: 137 SEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDR--------------VKE 182
Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDE----------ERKAEKKNSKNEHVQELT- 232
DD++DE D + +D EE + ++ EE + + +K ++E + L
Sbjct: 183 DDEEDEDADLSKKDVLEEPKMFKKPEEITWDDVFKKLKEIMSARGKKTTDRSEQIDLLEE 242
Query: 233 ----AKQKNPKAFTFQSVI 247
A+ K S+I
Sbjct: 243 LLTIAETPYQKVKILFSII 261
>gnl|CDD|227508 COG5181, HSH155, U2 snRNP spliceosome subunit [RNA processing and
modification].
Length = 975
Score = 34.2 bits (78), Expect = 0.54
Identities = 25/163 (15%), Positives = 47/163 (28%), Gaps = 8/163 (4%)
Query: 168 VMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEH 227
M D + ++ +D ++ E+ NE D ++ + +
Sbjct: 5 EMEDKAEDSRRSTKQASVWYEDRNDSRGNVSQWEATPRTANESKDLKQEQTPLGYIQVR- 63
Query: 228 VQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQP----RMSDKDLAIRHKITSAL-A 282
++L +K + F S K E + +T P R + +I L A
Sbjct: 64 -EDLYSKTRMDPGFQPDSDKKRELELNNTWDMEYPDSKRSSRWDEMGYEPPQEINMCLPA 122
Query: 283 GLNSKEEEEEEQEDSA-FSDESADDNSSDEDETVEPKTHKDSK 324
+ D F E + DE + P
Sbjct: 123 RGYKALTDFHGYADLGFFKVEDLKYFADDEKDFFMPLLEDREG 165
>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
Length = 613
Score = 34.1 bits (79), Expect = 0.57
Identities = 29/170 (17%), Positives = 51/170 (30%), Gaps = 50/170 (29%)
Query: 170 FDDDLEDNMDDDDDDDDDDETDD------DNDEDNEEESEEE----EENEEDDDEERKAE 219
+ + D +D+ +D +T D+ E + + +E EED+
Sbjct: 18 LVEAEAECSDGEDELEDLFDTGSDISDLIDDAEVVQGGNSLALFHQQEAEEDE------- 70
Query: 220 KKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQ---PRM-----SDKDL 271
+ +Q L +RK+ S+ PR+ S +
Sbjct: 71 ------QQIQAL------------------KRKYLSSPEKSPVADLSPRLEAISLSPRKK 106
Query: 272 AIRHKITSALAGL-NSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTH 320
A R +GL NS EEE + + + S E
Sbjct: 107 AKRRLFEEQDSGLGNSLEEESTDVVEEEGVESSGGGEGGSETGQGGGNGL 156
>gnl|CDD|238540 cd01120, RecA-like_NTPases, RecA-like NTPases. This family includes
the NTP binding domain of F1 and V1 H+ATPases, DnaB and
related helicases as well as bacterial RecA and related
eukaryotic and archaeal recombinases. This group also
includes bacterial conjugation proteins and related DNA
transfer proteins involved in type II and type IV
secretion.
Length = 165
Score = 32.5 bits (74), Expect = 0.58
Identities = 10/31 (32%), Positives = 14/31 (45%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVI 468
++ V GP GK+TL L N V+
Sbjct: 1 LILVFGPTGSGKTTLALQLALNIATKGGKVV 31
>gnl|CDD|236546 PRK09518, PRK09518, bifunctional cytidylate kinase/GTPase Der;
Reviewed.
Length = 712
Score = 34.0 bits (78), Expect = 0.62
Identities = 12/20 (60%), Positives = 16/20 (80%)
Query: 438 VVAVVGPPQVGKSTLIRCLI 457
VVA+VG P VGKSTL+ ++
Sbjct: 277 VVAIVGRPNVGKSTLVNRIL 296
>gnl|CDD|165163 PHA02811, PHA02811, putative host range protein; Provisional.
Length = 197
Score = 32.7 bits (74), Expect = 0.62
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEE 207
D + + DDDDD+D D+D ++D + E++ E
Sbjct: 162 DACDYCIISSDDDDDNDNADNDEEDDEVNDIEDDYE 197
Score = 31.2 bits (70), Expect = 2.0
Identities = 11/23 (47%), Positives = 18/23 (78%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDD 193
DDD DN D+D++DD+ ++ +DD
Sbjct: 173 DDDDNDNADNDEEDDEVNDIEDD 195
Score = 31.2 bits (70), Expect = 2.4
Identities = 12/33 (36%), Positives = 21/33 (63%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESE 203
D + + DDDD+D+ D++ +DD D E++ E
Sbjct: 165 DYCIISSDDDDDNDNADNDEEDDEVNDIEDDYE 197
Score = 30.8 bits (69), Expect = 3.4
Identities = 18/50 (36%), Positives = 28/50 (56%), Gaps = 8/50 (16%)
Query: 174 LEDNMDDDDD-------DDDDDETDDDNDEDNEEESEEEEE-NEEDDDEE 215
L+D ++ DD D +DDD+D DN + EE++E N+ +DD E
Sbjct: 148 LKDYINISDDYYLYDACDYCIISSDDDDDNDNADNDEEDDEVNDIEDDYE 197
>gnl|CDD|224083 COG1161, COG1161, Predicted GTPases [General function prediction
only].
Length = 322
Score = 33.5 bits (77), Expect = 0.66
Identities = 21/79 (26%), Positives = 30/79 (37%), Gaps = 11/79 (13%)
Query: 382 HRKRQAELTAKQKNPKA--FTFQSVIKGERKFRRKEDIQAKKHHVPQVDRTPLEPPPIVV 439
+ K++ + + K+ + E+ K KK L I V
Sbjct: 85 YFKKEEGIKPIFVSAKSRQGGKKIRKALEKLSEEKIKRLKKKG---------LLKRKIRV 135
Query: 440 AVVGPPQVGKSTLIRCLIK 458
VVG P VGKSTLI L+
Sbjct: 136 GVVGYPNVGKSTLINRLLG 154
>gnl|CDD|217025 pfam02421, FeoB_N, Ferrous iron transport protein B. Escherichia
coli has an iron(II) transport system (feo) which may
make an important contribution to the iron supply of the
cell under anaerobic conditions. FeoB has been
identified as part of this transport system. FeoB is a
large 700-800 amino acid integral membrane protein. The
N terminus contains a P-loop motif suggesting that iron
transport may be ATP dependent.
Length = 190
Score = 32.8 bits (76), Expect = 0.67
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 437 IVVAVVGPPQVGKSTLIRCL 456
I +A+VG P VGK+TL L
Sbjct: 1 ITIALVGNPNVGKTTLFNAL 20
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 32.7 bits (75), Expect = 0.70
Identities = 11/41 (26%), Positives = 27/41 (65%)
Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
+ D+DDE + E++E++ +++E++++DD ++ AE
Sbjct: 1 ESKSDEDDEFEVVPAEEDEKKVDKDEDDDKDDIDDLTAEAL 41
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 33.4 bits (76), Expect = 0.73
Identities = 11/47 (23%), Positives = 21/47 (44%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
D ++ DE +D E +E EE E E + + ++ ++ E
Sbjct: 341 DVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAERLAETE 387
Score = 32.6 bits (74), Expect = 1.2
Identities = 9/47 (19%), Positives = 21/47 (44%)
Query: 173 DLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
D+ D++ +D+ E D+ E+ E E + ++ + + E
Sbjct: 341 DVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAERLAETE 387
Score = 31.4 bits (71), Expect = 2.7
Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 3/49 (6%)
Query: 171 DDDLEDNMDDDDDD---DDDDETDDDNDEDNEEESEEEEENEEDDDEER 216
D D +D + + E D +E +EESE+E E +E EE
Sbjct: 318 LADFGDYYEDIFEVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEV 366
>gnl|CDD|213196 cd03229, ABC_Class3, ATP-binding cassette domain of the binding
protein-dependent transport systems. This class is
comprised of all BPD (Binding Protein Dependent) systems
that are largely represented in archaea and eubacteria
and are primarily involved in scavenging solutes from
the environment. ABC transporters are a large family of
proteins involved in the transport of a wide variety of
different compounds, like sugars, ions, peptides, and
more complex organic molecules. The nucleotide binding
domain shows the highest similarity between all members
of the family. ABC transporters are a subset of
nucleotide hydrolases that contain a signature motif,
Q-loop, and H-loop/switch region, in addition to, the
Walker A motif/P-loop and Walker B motif commonly found
in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 178
Score = 32.5 bits (75), Expect = 0.79
Identities = 11/19 (57%), Positives = 16/19 (84%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
+VA++GP GKSTL+RC+
Sbjct: 28 IVALLGPSGSGKSTLLRCI 46
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 33.0 bits (75), Expect = 0.80
Identities = 12/38 (31%), Positives = 22/38 (57%)
Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
+++ + DE+ EEE EEEEE + E+R++ +
Sbjct: 224 SDSESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKAS 261
>gnl|CDD|165245 PHA02934, PHA02934, Hypothetical protein; Provisional.
Length = 253
Score = 33.1 bits (75), Expect = 0.80
Identities = 18/43 (41%), Positives = 25/43 (58%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
D D D++DD DD DD D+ D+ D DN + E+E +DD
Sbjct: 20 DIDDIDDIDDIDDIDDIDDIDNVQDIDNGIVQDIEDEASNNDD 62
Score = 30.0 bits (67), Expect = 7.0
Identities = 16/45 (35%), Positives = 27/45 (60%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEER 216
DD++D D DD DD DD D DN +D + ++ E+E ++++
Sbjct: 19 DDIDDIDDIDDIDDIDDIDDIDNVQDIDNGIVQDIEDEASNNDDH 63
>gnl|CDD|206750 cd01857, HSR1_MMR1, A circularly permuted subfamily of the Ras
GTPases. Human HSR1 is localized to the human MHC class
I region and is highly homologous to a putative
GTP-binding protein, MMR1 from mouse. These proteins
represent a new subfamily of GTP-binding proteins that
has only eukaryote members. This subfamily shows a
circular permutation of the GTPase signature motifs so
that the C-terminal strands 5, 6, and 7 (strand 6
contains the G4 box with sequence NKXD) are relocated to
the N-terminus.
Length = 140
Score = 31.8 bits (73), Expect = 0.84
Identities = 17/59 (28%), Positives = 24/59 (40%), Gaps = 11/59 (18%)
Query: 433 EPPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGP------VTLIIKDSIR--DC 483
+ +VG P VGKS+LI L +SV P T+ ++ I DC
Sbjct: 79 ALNEATIGLVGYPNVGKSSLINAL---VGSKKVSVSSTPGKTKHFQTIFLEPGITLCDC 134
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 33.4 bits (76), Expect = 0.88
Identities = 31/245 (12%), Positives = 84/245 (34%), Gaps = 17/245 (6%)
Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERK 252
+N + + EE + E E+ K + + + ++L +++N + + +
Sbjct: 184 ENLAELIIDLEELKLQELKLKEQAKKALEYYQLK--EKLELEEENLLYLDYLKLNEERID 241
Query: 253 FRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQ---EDSAFSDESADDNSS 309
+ S K + + A +KEEE+E++ E+ + ++ S
Sbjct: 242 LLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKS 301
Query: 310 DEDETVEPKTHKDSK--NPEEDTGLNWKSDLAQKAASAFLERQANIVNLAKYVYGDMEDV 367
+ + K + K E++ K +K LE++ + + + + E+
Sbjct: 302 ELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQ 361
Query: 368 SVTMEGDEAPNKKVH----------RKRQAELTAKQKNPKAFTFQSVIKGERKFRRKEDI 417
++ ++ A+L ++ K + ++ED+
Sbjct: 362 LEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDL 421
Query: 418 QAKKH 422
++
Sbjct: 422 LKEEK 426
Score = 30.7 bits (69), Expect = 6.5
Identities = 24/127 (18%), Positives = 48/127 (37%), Gaps = 1/127 (0%)
Query: 127 DSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDD 186
D+ E E++ + E +K E + ++ D+ E+ +
Sbjct: 910 DNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLA 969
Query: 187 DDETD-DDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQS 245
+E + E E +EE N+++ +ER E+K + E T ++ F S
Sbjct: 970 KEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVS 1029
Query: 246 VIKGERK 252
+ +G K
Sbjct: 1030 INRGLNK 1036
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 33.2 bits (76), Expect = 0.89
Identities = 12/55 (21%), Positives = 23/55 (41%)
Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT 242
+D +E+ E + + + +++K EKK K+E +L K T
Sbjct: 48 KVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKT 102
>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
selection and in elongation by RNA polymerase II
[Transcription].
Length = 521
Score = 33.1 bits (75), Expect = 0.91
Identities = 30/179 (16%), Positives = 58/179 (32%), Gaps = 23/179 (12%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
DD+ + D+D++++ D ++ E NE+ EE+ +N + + K E + +
Sbjct: 30 LDDVLSSSSDEDNNENVDYAEESGGEGNEKSEEEKFKNPYRLEGKFKDE---ADRAKIMA 86
Query: 231 LTAKQKNPKAFTFQSVIKG--ERKF------------------RSTKPTVVPQPRMSDKD 270
+T ++ F + I ER+ RST+ + K
Sbjct: 87 MTEIERESILFEREEEISKLMERRELAIRMEQQHRSSGCTDTRRSTRYEPLTSAAEEKKK 146
Query: 271 LAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEED 329
+ K T E E Q + + + E+ T D
Sbjct: 147 KLLELKKTREREERLYSERHIELQRFKDYKELEESEQGLQEEYTPSYAEEAVEDISRTD 205
>gnl|CDD|213181 cd03214, ABC_Iron-Siderophores_B12_Hemin, ATP-binding component of
iron-siderophores, vitamin B12 and hemin transporters
and related proteins. ABC transporters, involved in the
uptake of siderophores, heme, and vitamin B12, are
widely conserved in bacteria and archaea. Only very few
species lack representatives of the siderophore family
transporters. The E. coli BtuCD protein is an ABC
transporter mediating vitamin B12 uptake. The two
ATP-binding cassettes (BtuD) are in close contact with
each other, as are the two membrane-spanning subunits
(BtuC); this arrangement is distinct from that observed
for the E. coli lipid flippase MsbA. The BtuC subunits
provide 20 transmembrane helices grouped around a
translocation pathway that is closed to the cytoplasm by
a gate region, whereas the dimer arrangement of the BtuD
subunits resembles the ATP-bound form of the Rad50 DNA
repair enzyme. A prominent cytoplasmic loop of BtuC
forms the contact region with the ATP-binding cassette
and represent a conserved motif among the ABC
transporters.
Length = 180
Score = 32.0 bits (74), Expect = 0.96
Identities = 12/37 (32%), Positives = 18/37 (48%), Gaps = 5/37 (13%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
+V ++GP GKSTL++ L L G + L
Sbjct: 27 IVGILGPNGAGKSTLLKTLAGL-----LKPSSGEILL 58
>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist. LIN-8
is a nuclear protein, present at the sites of
transcriptional repressor complexes, which interacts
with LIN-35 Rb.Lin35 Rb is a product of the class B
synMuv gene lin-35 which silences genes required for
vulval specification through chromatin modification and
remodelling. The biological role of the interaction has
not yet been determined however predictions have been
made. The interaction shows that class A synMuv genes
control vulval induction through the transcriptional
regulation of gene expression. LIN-8 normally functions
as part of a protein complex however when the complex is
absent, other family members can partially replace LIN-8
activity.
Length = 316
Score = 32.7 bits (75), Expect = 0.99
Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 5/46 (10%)
Query: 173 DLEDNMDDDDDD-----DDDDETDDDNDEDNEEESEEEEENEEDDD 213
ED DD+DD +D T + D + +E +E E E DD
Sbjct: 132 TGEDQEADDEDDIIYDGIFEDRTRESQDMEENDEVDEVEVEEVPDD 177
Score = 30.8 bits (70), Expect = 4.3
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
D + DD+DD D ED ES++ EEN+E D+ E
Sbjct: 135 DQEADDEDDIIYDGIFEDRTRESQDMEENDEVDEVE 170
>gnl|CDD|184539 PRK14150, PRK14150, heat shock protein GrpE; Provisional.
Length = 193
Score = 32.2 bits (74), Expect = 1.00
Identities = 13/46 (28%), Positives = 21/46 (45%)
Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA 233
+E E EE E E E +E+ DE ++ + + + EL A
Sbjct: 3 NEEQKTPQEQVSEEIEAEVEQQEEADEAELEDELDEADARIAELEA 48
>gnl|CDD|237511 PRK13796, PRK13796, GTPase YqeH; Provisional.
Length = 365
Score = 32.9 bits (76), Expect = 1.0
Identities = 14/23 (60%), Positives = 15/23 (65%)
Query: 439 VAVVGPPQVGKSTLIRCLIKNFT 461
V VVG VGKSTLI +IK T
Sbjct: 163 VYVVGVTNVGKSTLINRIIKEIT 185
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 31.5 bits (72), Expect = 1.0
Identities = 17/70 (24%), Positives = 33/70 (47%), Gaps = 10/70 (14%)
Query: 179 DDDDDDDDDDETDDDNDEDNEE----ESEEEEENEEDDD-----EERKAEKKNSKNEHVQ 229
D D DD++DE + + E + + ++E+EE D +ER E++ + E +
Sbjct: 27 DGVDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRRE-QK 85
Query: 230 ELTAKQKNPK 239
E +K +
Sbjct: 86 EKEEIEKYRE 95
Score = 30.3 bits (69), Expect = 2.7
Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 11/75 (14%)
Query: 174 LEDNMDDDDDDDDDDETDDDN---------DEDNEEESEEEEENEEDDDEERKAEKKNSK 224
++DD++D+D D DN D+++EE E +E DE RK ++ +
Sbjct: 27 DGVDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQKE 86
Query: 225 NEHVQELTAKQKNPK 239
E +++ +NPK
Sbjct: 87 KEEIEKYRE--ENPK 99
>gnl|CDD|112137 pfam03308, ArgK, ArgK protein. The ArgK protein acts as an ATPase
enzyme and as a kinase, and phosphorylates periplasmic
binding proteins involved in the LAO (lysine, arginine,
ornithine)/AO transport systems.
Length = 267
Score = 32.7 bits (75), Expect = 1.1
Identities = 11/30 (36%), Positives = 13/30 (43%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSV 467
V + G P GKSTLI L + V
Sbjct: 31 RVGITGVPGAGKSTLIEALGMELRRRGHRV 60
>gnl|CDD|224081 COG1159, Era, GTPase [General function prediction only].
Length = 298
Score = 32.5 bits (75), Expect = 1.1
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 439 VAVVGPPQVGKSTLIRCLIK 458
VA++G P VGKSTL+ L+
Sbjct: 9 VAIIGRPNVGKSTLLNALVG 28
>gnl|CDD|213202 cd03235, ABC_Metallic_Cations, ATP-binding cassette domain of the
metal-type transporters. This family includes
transporters involved in the uptake of various metallic
cations such as iron, manganese, and zinc. The ATPases
of this group of transporters are very similar to
members of iron-siderophore uptake family suggesting
that they share a common ancestor. The best
characterized metal-type ABC transporters are the
YfeABCD system of Y. pestis, the SitABCD system of
Salmonella enterica serovar Typhimurium, and the SitABCD
transporter of Shigella flexneri. Moreover other
uncharacterized homologs of these metal-type
transporters are mainly found in pathogens like
Haemophilus or enteroinvasive E. coli isolates.
Length = 213
Score = 32.1 bits (74), Expect = 1.1
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 434 PPPIVVAVVGPPQVGKSTLIRCLI 457
P +A+VGP GKSTL++ ++
Sbjct: 23 KPGEFLAIVGPNGAGKSTLLKAIL 46
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 33.1 bits (75), Expect = 1.1
Identities = 29/136 (21%), Positives = 53/136 (38%), Gaps = 24/136 (17%)
Query: 136 DTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDND 195
++D ES + KK KE+ S KR++ + D + + +T + +
Sbjct: 53 SSNDSKAESTKKPNKKIKEEATSPLKSTKRQR---EKPASDTEEPERVTAKKSKTQELSR 109
Query: 196 EDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRS 255
++ E E E E E + + R ++ S +PK ++ RS
Sbjct: 110 PNSPSEGEGEGEGEGESSDSRSVNEEGS------------SDPKDI--------DQDNRS 149
Query: 256 TKPTV-VPQPRMSDKD 270
+ P++ PQ SD D
Sbjct: 150 SSPSIPSPQDNESDSD 165
>gnl|CDD|177557 PHA03209, PHA03209, serine/threonine kinase US3; Provisional.
Length = 357
Score = 32.9 bits (75), Expect = 1.1
Identities = 12/40 (30%), Positives = 21/40 (52%), Gaps = 3/40 (7%)
Query: 169 MFDDDLEDNMDDDD---DDDDDDETDDDNDEDNEEESEEE 205
D+DL ++ D D DDD DD+D+D ++++
Sbjct: 19 AADEDLYSDISDGDLEYSDDDSASESDDDDDDGLIPTKQK 58
Score = 31.8 bits (72), Expect = 2.2
Identities = 12/34 (35%), Positives = 18/34 (52%), Gaps = 2/34 (5%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEE 204
D DLE DDD + D+ DDD ++++ E
Sbjct: 30 DGDLEY--SDDDSASESDDDDDDGLIPTKQKARE 61
>gnl|CDD|224045 COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport
systems, ATPase components [Inorganic ion transport and
metabolism / Coenzyme metabolism].
Length = 258
Score = 32.5 bits (75), Expect = 1.1
Identities = 13/37 (35%), Positives = 18/37 (48%), Gaps = 5/37 (13%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
+ ++GP GKSTL++CL L G V L
Sbjct: 30 ITGILGPNGSGKSTLLKCLAGL-----LKPKSGEVLL 61
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is
found in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 32.4 bits (74), Expect = 1.2
Identities = 33/212 (15%), Positives = 70/212 (33%), Gaps = 51/212 (24%)
Query: 165 RRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK 224
++ + + +D + EEE E+++ +++ E++ +
Sbjct: 50 QKAKLKGQASTAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQE 109
Query: 225 NEHVQELTAKQKNPKA-----------------FTFQ------------------SVIKG 249
+ V+EL A+ + +A +T++ S + G
Sbjct: 110 EKQVRELKARDREVRAHEQAHAAVGGQYAGAPSYTYERGPDGKRYAVGGEVSIDVSPVPG 169
Query: 250 E-----RKFR-----STKPTVVPQPRMSDKDLA--IRHKITSALAGLNSKEEEEEEQEDS 297
+ K + + P +P D+ +A K A A L + +EE +ED
Sbjct: 170 DPQATINKMQQVRAAALAPA---EPSSQDRRVAAEATQKENEARAELAQERQEEASEEDE 226
Query: 298 AFSDESADDNSSDEDETVEPKTHKDSKNPEED 329
A + +N E + P T + E
Sbjct: 227 A-EALAGAENQITALEPLAPSTPQVDGQVEAT 257
>gnl|CDD|239388 cd03114, ArgK-like, The function of this protein family is unkown.
The protein sequences are similar to the ArgK protein in
E. coli. ArgK protein is a membrane ATPase which is
required for transporting arginine, ornithine and lysine
into the cells by the arginine and ornithine (AO system)
and lysine, arginine and ornithine (LAO) transport
systems.
Length = 148
Score = 31.5 bits (72), Expect = 1.2
Identities = 11/25 (44%), Positives = 13/25 (52%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTK 462
V+ + G P GKSTLI LI
Sbjct: 1 VIGITGVPGAGKSTLIDALITALRA 25
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 32.9 bits (75), Expect = 1.2
Identities = 20/125 (16%), Positives = 51/125 (40%), Gaps = 3/125 (2%)
Query: 127 DSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDD 186
+ D++D+S + +++K Q + +++E + + + +
Sbjct: 27 IQQNNGDKEDSSTSTSSLSVSAVEKTSNAKEEIQVDFQHNSESAVEEVEAEDEIEVEQNQ 86
Query: 187 DDETDDDNDEDNEEESEEEEENEEDD-DEERKAEKKNSKNEHVQELTAKQ--KNPKAFTF 243
D + E S + + ++D D + K E++N + ++EL + + K F +
Sbjct: 87 SDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLERENLRKREIEELAEENFSRGNKLFVY 146
Query: 244 QSVIK 248
V+K
Sbjct: 147 PQVVK 151
>gnl|CDD|151773 pfam11332, DUF3134, Protein of unknown function (DUF3134). This
family of proteins with unknown function appears to be
restricted to Cyanobacteria.
Length = 72
Score = 30.0 bits (68), Expect = 1.2
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDD 193
++++ + + DDDD DDD+ +DD
Sbjct: 50 EEEISELIGDDDDYYDDDDDEDD 72
>gnl|CDD|223998 COG1072, CoaA, Panthothenate kinase [Coenzyme metabolism].
Length = 283
Score = 32.6 bits (75), Expect = 1.2
Identities = 10/27 (37%), Positives = 14/27 (51%)
Query: 430 TPLEPPPIVVAVVGPPQVGKSTLIRCL 456
T + P ++ + G VGKST R L
Sbjct: 76 TNNQQRPFIIGIAGSVAVGKSTTARIL 102
>gnl|CDD|215874 pfam00350, Dynamin_N, Dynamin family.
Length = 168
Score = 31.8 bits (73), Expect = 1.3
Identities = 23/99 (23%), Positives = 37/99 (37%), Gaps = 13/99 (13%)
Query: 439 VAVVGPPQVGKSTLIRCLI-KNFTKTPLSVIKGPVTLIIKDSIRDCFVTGKWKASEDASE 497
+AVVG GKS+++ L+ ++ P GP T + + A
Sbjct: 1 IAVVGDQSAGKSSVLNALLGRDI--LPRGP--GPTT--------RRPLVLRLGEEPGAIP 48
Query: 498 LLRLDDMDDDEELFGDFEDLETGEKHSGDKSGGGSGGVS 536
+ D + F DF +L + DK G G+S
Sbjct: 49 GAVKVEYKDGLKKFEDFSELREEIEDETDKISGTGKGIS 87
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 31.5 bits (70), Expect = 1.3
Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 2/87 (2%)
Query: 162 GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDN--EEESEEEEENEEDDDEERKAE 219
G K KV DD + DD+ +++++ E +++ +E+ EEE EEE E EE +EE + E
Sbjct: 44 GEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPE 103
Query: 220 KKNSKNEHVQELTAKQKNPKAFTFQSV 246
+ + + P A T +SV
Sbjct: 104 PEPEPEPEPEPEPEPEPEPGAATLKSV 130
Score = 30.3 bits (67), Expect = 3.1
Identities = 21/77 (27%), Positives = 36/77 (46%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNP 238
DD + D DDE +++ + + EEE EEE E EE+ +EE + E+ + + + P
Sbjct: 53 DDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEP 112
Query: 239 KAFTFQSVIKGERKFRS 255
+ G +S
Sbjct: 113 EPEPEPEPEPGAATLKS 129
>gnl|CDD|226164 COG3638, COG3638, ABC-type phosphate/phosphonate transport system,
ATPase component [Inorganic ion transport and
metabolism].
Length = 258
Score = 32.2 bits (74), Expect = 1.3
Identities = 11/18 (61%), Positives = 14/18 (77%)
Query: 439 VAVVGPPQVGKSTLIRCL 456
VA++GP GKSTL+R L
Sbjct: 33 VAIIGPSGAGKSTLLRSL 50
>gnl|CDD|215628 PLN03196, PLN03196, MOC1-like protein; Provisional.
Length = 487
Score = 32.8 bits (75), Expect = 1.3
Identities = 15/49 (30%), Positives = 23/49 (46%), Gaps = 3/49 (6%)
Query: 171 DDDLEDNMDDDDDDDDDDE---TDDDNDEDNEEESEEEEENEEDDDEER 216
DD E M D + ++ E + E ES +EE+++DDDE
Sbjct: 432 DDKFEQRMSGDFIEGEEMEPLFSMGGKLEMPGSESVSDEEDDDDDDEVL 480
>gnl|CDD|224042 COG1117, PstB, ABC-type phosphate transport system, ATPase
component [Inorganic ion transport and metabolism].
Length = 253
Score = 32.1 bits (74), Expect = 1.4
Identities = 16/37 (43%), Positives = 23/37 (62%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
V A++GP GKSTL+RCL + P + ++G V L
Sbjct: 35 VTALIGPSGCGKSTLLRCLNRMNDLIPGARVEGEVLL 71
>gnl|CDD|181906 PRK09493, glnQ, glutamine ABC transporter ATP-binding protein;
Reviewed.
Length = 240
Score = 32.0 bits (73), Expect = 1.4
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
VV ++GP GKSTL+RC+
Sbjct: 29 VVVIIGPSGSGKSTLLRCI 47
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 32.8 bits (74), Expect = 1.4
Identities = 44/227 (19%), Positives = 89/227 (39%), Gaps = 9/227 (3%)
Query: 195 DEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFR 254
D E ++ EE ++D +E +KAE++ + E + A+ + Q+ IK E +
Sbjct: 1222 DAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFA--RRQAAIKAEEARK 1279
Query: 255 STKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDET 314
+ + + + +D+ A + + K EE ++ +++ E A + +
Sbjct: 1280 ADELKKAEEKKKADE--AKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKK 1337
Query: 315 VEPKTHKDSKNPEEDTGLNWKSDLAQKAASAFLERQANIVNLAKYVYGDMEDVSVTMEGD 374
E E +++ A++ A A +++ A E+ + D
Sbjct: 1338 AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK---KAD 1394
Query: 375 EAPNKKVHRKRQAELTAKQKNPKAFTFQSVIKGERKFRRKEDIQAKK 421
EA K K++A+ K K ++ K E K +K D KK
Sbjct: 1395 EAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK--KKADEAKKK 1439
Score = 31.3 bits (70), Expect = 3.9
Identities = 41/208 (19%), Positives = 86/208 (41%), Gaps = 12/208 (5%)
Query: 133 DEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDD 192
+ED ++ + + KK E + + K+ + + + E+ ++ ++E
Sbjct: 1671 EEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKI 1730
Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERK 252
+E +E E++++ E E +K E++ K H+++ K+ ++VI+ E
Sbjct: 1731 KAEEAKKEAEEDKKKAE----EAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786
Query: 253 FRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKE---EEEEEQEDSAFSDESADDNSS 309
K + ++ D I + + G + +E EDSA E AD +
Sbjct: 1787 EEDEKRRMEVDKKIKD----IFDNFANIIEGGKEGNLVINDSKEMEDSA-IKEVADSKNM 1841
Query: 310 DEDETVEPKTHKDSKNPEEDTGLNWKSD 337
+E + HK +KN E N ++D
Sbjct: 1842 QLEEADAFEKHKFNKNNENGEDGNKEAD 1869
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 28.8 bits (65), Expect = 1.4
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 192 DDNDEDNEEESEEEEENEEDDDEER 216
++ ++EEE+E+EEE EE + +R
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVKR 46
Score = 28.5 bits (64), Expect = 2.0
Identities = 9/25 (36%), Positives = 18/25 (72%)
Query: 196 EDNEEESEEEEENEEDDDEERKAEK 220
E+ + E EEE E+EE+ +E+ + ++
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVKR 46
Score = 28.1 bits (63), Expect = 2.9
Identities = 11/23 (47%), Positives = 19/23 (82%)
Query: 199 EEESEEEEENEEDDDEERKAEKK 221
EE+ E+EEENE++++ E ++E K
Sbjct: 23 EEKREDEEENEDEEEGEEQSEVK 45
Score = 26.9 bits (60), Expect = 8.0
Identities = 8/25 (32%), Positives = 16/25 (64%)
Query: 197 DNEEESEEEEENEEDDDEERKAEKK 221
+ E+ +EEE +E++ EE+ K+
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVKR 46
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 31.9 bits (73), Expect = 1.4
Identities = 8/25 (32%), Positives = 12/25 (48%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEE 204
DDD D+ D DE+ ++E
Sbjct: 189 GAPDDDLDEYDYGDEDEEEDDEPPW 213
>gnl|CDD|188098 TIGR00957, MRP_assoc_pro, multi drug resistance-associated protein
(MRP). This model describes multi drug
resistance-associated protein (MRP) in eukaryotes. The
multidrug resistance-associated protein is an integral
membrane protein that causes multidrug resistance when
overexpressed in mammalian cells. It belongs to ABC
transporter superfamily. The protein topology and
function was experimentally demonstrated by epitope
tagging and immunofluorescence. Insertion of tags in the
critical regions associated with drug efflux, abrogated
its function. The C-terminal domain seem to highly
conserved [Transport and binding proteins, Other].
Length = 1522
Score = 32.6 bits (74), Expect = 1.5
Identities = 36/116 (31%), Positives = 50/116 (43%), Gaps = 20/116 (17%)
Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSV-IKGPVTL------IIKDSIRDCFVT 486
P +VAVVG GKS+L+ L+ K V +KG V I DS+R+ +
Sbjct: 662 PEGALVAVVGQVGCGKSSLLSALLAEMDKVEGHVHMKGSVAYVPQQAWIQNDSLRENILF 721
Query: 487 GK------WKASEDASELLRLDDMDDDEELFGDFEDLETGEKHSGDKSGGGSGGVS 536
GK ++ +A LL D E+ + E GEK + SGG VS
Sbjct: 722 GKALNEKYYQQVLEACALL------PDLEILPSGDRTEIGEK-GVNLSGGQKQRVS 770
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 29.9 bits (68), Expect = 1.5
Identities = 12/15 (80%), Positives = 13/15 (86%)
Query: 199 EEESEEEEENEEDDD 213
EE+ EEEEE EEDDD
Sbjct: 68 EEKKEEEEEEEEDDD 82
Score = 28.0 bits (63), Expect = 7.5
Identities = 11/15 (73%), Positives = 13/15 (86%)
Query: 199 EEESEEEEENEEDDD 213
EEE +EEEE EE+DD
Sbjct: 67 EEEKKEEEEEEEEDD 81
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 30.4 bits (69), Expect = 1.5
Identities = 13/20 (65%), Positives = 16/20 (80%)
Query: 196 EDNEEESEEEEENEEDDDEE 215
E+ EEE EEEEE EE+ +EE
Sbjct: 76 EEEEEEEEEEEEEEEESEEE 95
Score = 28.9 bits (65), Expect = 5.4
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 196 EDNEEESEEEEENEEDDDEERKAE 219
+ EEE EEEEE EE++ EE
Sbjct: 75 AEEEEEEEEEEEEEEEESEEEAMA 98
Score = 28.9 bits (65), Expect = 5.6
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 196 EDNEEESEEEEENEEDDDEERKA 218
EEE EEEEE EE+++ E +A
Sbjct: 74 AAEEEEEEEEEEEEEEEESEEEA 96
>gnl|CDD|237422 PRK13548, hmuV, hemin importer ATP-binding subunit; Provisional.
Length = 258
Score = 32.1 bits (74), Expect = 1.5
Identities = 12/19 (63%), Positives = 15/19 (78%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
VVA++GP GKSTL+R L
Sbjct: 30 VVAILGPNGAGKSTLLRAL 48
>gnl|CDD|240521 cd13394, Syo1_like, Fungal symportin 1 (syo1) and similar proteins.
This family of eukaryotic proteins includes
Saccharomyces cerevisiae Ydl063c and Chaetomium
thermophilum Syo1, which mediate the co-import of two
ribosomal proteins, Rpl5 and Rpl11 (which both interact
with 5S rRNA) into the nucleus. Import precedes their
association with rRNA and subsequent ribosome assembly
in the nucleolus. The primary structure of syo1 is a
mixture of Armadillo- (ARM, N-terminal part of syo1) and
HEAT-repeats (C-terminal part of syo1).
Length = 597
Score = 32.7 bits (75), Expect = 1.5
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
+ D +D+ +D+ D+DE +D++DE +E E E
Sbjct: 304 ELDKKDDEAMEDEQSDEDEEEDEDDEASETLVPELSPLE 342
Score = 30.7 bits (70), Expect = 5.2
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
+ D DD+ +D+ +++EEE E++E +E E E
Sbjct: 304 ELDKKDDEAMEDEQSDEDEEEDEDDEASETLVPELSPLE 342
>gnl|CDD|215427 PLN02796, PLN02796, D-glycerate 3-kinase.
Length = 347
Score = 32.4 bits (74), Expect = 1.5
Identities = 20/70 (28%), Positives = 31/70 (44%), Gaps = 10/70 (14%)
Query: 418 QAKKHHVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLIIK 477
Q + H D E PP+V+ + P GK+TL+ L+ F T + +L
Sbjct: 84 QLEAHRSKFKDGD--EIPPLVIGISAPQGCGKTTLVFALVYLFNATG----RRAASL--- 134
Query: 478 DSIRDCFVTG 487
SI D ++T
Sbjct: 135 -SIDDFYLTA 143
>gnl|CDD|206646 cd00880, Era_like, E. coli Ras-like protein (Era)-like GTPase. The
Era (E. coli Ras-like protein)-like family includes
several distinct subfamilies (TrmE/ThdF, FeoB, YihA
(EngB), Era, and EngA/YfgK) that generally show sequence
conservation in the region between the Walker A and B
motifs (G1 and G3 box motifs), to the exclusion of other
GTPases. TrmE is ubiquitous in bacteria and is a
widespread mitochondrial protein in eukaryotes, but is
absent from archaea. The yeast member of TrmE family,
MSS1, is involved in mitochondrial translation;
bacterial members are often present in
translation-related operons. FeoB represents an unusual
adaptation of GTPases for high-affinity iron (II)
transport. YihA (EngB) family of GTPases is typified by
the E. coli YihA, which is an essential protein involved
in cell division control. Era is characterized by a
distinct derivative of the KH domain (the pseudo-KH
domain) which is located C-terminal to the GTPase
domain. EngA and its orthologs are composed of two
GTPase domains and, since the sequences of the two
domains are more similar to each other than to other
GTPases, it is likely that an ancient gene duplication,
rather than a fusion of evolutionarily distinct GTPases,
gave rise to this family.
Length = 161
Score = 31.4 bits (72), Expect = 1.5
Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 15/51 (29%)
Query: 440 AVVGPPQVGKSTLIRCLIK----------NFTKTPLSV-----IKGPVTLI 475
A+ G P VGKS+L+ L+ T+ P+ GPV LI
Sbjct: 1 AIFGRPNVGKSSLLNALLGQNVGIVSPIPGTTRDPVRKEWELLPLGPVVLI 51
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 32.3 bits (74), Expect = 1.6
Identities = 12/28 (42%), Positives = 17/28 (60%)
Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEE 210
+ + DE ++ DE+ EEE EE EE E
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEP 370
Score = 31.6 bits (72), Expect = 3.2
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEE 215
+ E D+ +E +EE EEEEE E+ + E
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPE 371
>gnl|CDD|206665 cd01876, YihA_EngB, YihA (EngB) GTPase family. The YihA (EngB)
subfamily of GTPases is typified by the E. coli YihA, an
essential protein involved in cell division control.
YihA and its orthologs are small proteins that typically
contain less than 200 amino acid residues and consists
of the GTPase domain only (some of the eukaryotic
homologs contain an N-terminal extension of about 120
residues that might be involved in organellar
targeting). Homologs of yihA are found in most
Gram-positive and Gram-negative pathogenic bacteria,
with the exception of Mycobacterium tuberculosis. The
broad-spectrum nature of YihA and its essentiality for
cell viability in bacteria make it an attractive
antibacterial target.
Length = 170
Score = 31.3 bits (72), Expect = 1.6
Identities = 13/31 (41%), Positives = 15/31 (48%), Gaps = 5/31 (16%)
Query: 439 VAVVGPPQVGKSTLIRCLIKN-----FTKTP 464
VA G VGKS+LI L +KTP
Sbjct: 2 VAFAGRSNVGKSSLINALTNRKKLARTSKTP 32
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 32.2 bits (73), Expect = 1.6
Identities = 17/97 (17%), Positives = 38/97 (39%), Gaps = 7/97 (7%)
Query: 129 DDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDD 188
DDE DE+ TS + E + + D + ++ + D ++D++
Sbjct: 188 DDELDEEVTSYLN-ELDDEVTSYFNDGENEENDDE------LEAEVISYLKDGENDNEVK 240
Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
E + + + E E +D+ E + E++ +
Sbjct: 241 EKIRREYREWKGDKANTNETEIEDESEDEYEEEAGEE 277
Score = 30.6 bits (69), Expect = 4.4
Identities = 14/67 (20%), Positives = 29/67 (43%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
+DD +DD+ D++ ++ D++ + E E DD+ E + E+ E
Sbjct: 179 EDDASTLHEDDELDEEVTSYLNELDDEVTSYFNDGENEENDDELEAEVISYLKDGENDNE 238
Query: 231 LTAKQKN 237
+ K +
Sbjct: 239 VKEKIRR 245
>gnl|CDD|223447 COG0370, FeoB, Fe2+ transport system protein B [Inorganic ion
transport and metabolism].
Length = 653
Score = 32.6 bits (75), Expect = 1.6
Identities = 10/17 (58%), Positives = 13/17 (76%)
Query: 437 IVVAVVGPPQVGKSTLI 453
+ VA+VG P VGK+TL
Sbjct: 4 LTVALVGNPNVGKTTLF 20
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 31.5 bits (72), Expect = 1.6
Identities = 17/90 (18%), Positives = 35/90 (38%), Gaps = 29/90 (32%)
Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE------- 215
K ++ LE + + ++ + + D+ DE E +EE+DD
Sbjct: 17 PPSPKEEEEEALELAIQEAHENALEKKLLDELDE---------ELDEEEDDRFLEEYRRK 67
Query: 216 -----RKAEKKN--------SKNEHVQELT 232
+ +K+ SK ++V+E+T
Sbjct: 68 RLAEMKALAEKSKFGEVYEISKPDYVREVT 97
>gnl|CDD|206727 cd04164, trmE, trmE is a tRNA modification GTPase. TrmE (MnmE,
ThdF, MSS1) is a 3-domain protein found in bacteria and
eukaryotes. It controls modification of the uridine at
the wobble position (U34) of tRNAs that read codons
ending with A or G in the mixed codon family boxes. TrmE
contains a GTPase domain that forms a canonical Ras-like
fold. It functions a molecular switch GTPase, and
apparently uses a conformational change associated with
GTP hydrolysis to promote the tRNA modification
reaction, in which the conserved cysteine in the
C-terminal domain is thought to function as a catalytic
residue. In bacteria that are able to survive in
extremely low pH conditions, TrmE regulates
glutamate-dependent acid resistance.
Length = 159
Score = 31.3 bits (72), Expect = 1.7
Identities = 10/22 (45%), Positives = 13/22 (59%)
Query: 437 IVVAVVGPPQVGKSTLIRCLIK 458
I V + G P VGKS+L+ L
Sbjct: 4 IKVVIAGKPNVGKSSLLNALAG 25
Score = 29.8 bits (68), Expect = 5.5
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 74 ADLVLLLIDASFGFEMEIFEFLNICQ 99
ADLVLL++DAS G + E E L +
Sbjct: 83 ADLVLLVVDASEGLDEEDLEILELPA 108
>gnl|CDD|114045 pfam05297, Herpes_LMP1, Herpesvirus latent membrane protein 1
(LMP1). This family consists of several latent membrane
protein 1 or LMP1s mostly from Epstein-Barr virus. LMP1
of EBV is a 62-65 kDa plasma membrane protein possessing
six membrane spanning regions, a short cytoplasmic
N-terminus and a long cytoplasmic carboxy tail of 200
amino acids. EBV latent membrane protein 1 (LMP1) is
essential for EBV-mediated transformation and has been
associated with several cases of malignancies. EBV-like
viruses in Cynomolgus monkeys (Macaca fascicularis) have
been associated with high lymphoma rates in
immunosuppressed monkeys.
Length = 382
Score = 32.3 bits (73), Expect = 1.7
Identities = 25/99 (25%), Positives = 42/99 (42%), Gaps = 13/99 (13%)
Query: 128 SDDENDEDDTSDDDTE-----SGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDD 182
+DD E D++ ++ SG G Q + + G D+ +DN D
Sbjct: 208 TDDSGHESDSNSNEGRHHLLVSGAGDGPPLCSQNLGAPGGGPDNGPQDPDNTDDNGPQDP 267
Query: 183 DDDDD------DETDDD--NDEDNEEESEEEEENEEDDD 213
D+ DD D TDD+ D DN ++ ++ + DD+
Sbjct: 268 DNTDDNGPQDPDNTDDNGPQDPDNTADNGPQDPDNTDDN 306
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 32.4 bits (73), Expect = 1.7
Identities = 24/132 (18%), Positives = 54/132 (40%), Gaps = 4/132 (3%)
Query: 122 AISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
A + D D++ T+ ++ S+ + K Q + S+ + +L D DD
Sbjct: 349 AFTDDGIRDKDFSAYTASKLSDEDDDSVMESKMQKLFSEK---EIDFGLNSELVDMSDDG 405
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAE-KKNSKNEHVQELTAKQKNPKA 240
++ + +D NE ES+++ E + + + E ++N K +++
Sbjct: 406 ENGEMEDTFTSHLPASNESESDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLK 465
Query: 241 FTFQSVIKGERK 252
+S+ K +K
Sbjct: 466 DKKESINKKNKK 477
>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B. Escherichia coli
stringent starvation protein B (SspB), is thought to
enhance the specificity of degradation of tmRNA-tagged
proteins by the ClpXP protease. The tmRNA tag, also
known as ssrA, is an 11-aa peptide added to the C
terminus of proteins stalled during translation, targets
proteins for degradation by ClpXP and ClpAP. SspB a
cytoplasmic protein that specifically binds to residues
1-4 and 7 of the tag. Binding of SspB enhances
degradation of tagged proteins by ClpX, and masks
sequence elements important for ClpA interactions,
inhibiting degradation by ClpA. However, more recent
work has cast doubt on the importance of SspB in
wild-type cells. SspB is encoded in an operon whose
synthesis is stimulated by carbon, amino acid, and
phosphate starvation. SspB may play a special role
during nutrient stress, for example by ensuring rapid
degradation of the products of stalled translation,
without causing a global increase in degradation of all
ClpXP substrates.
Length = 153
Score = 31.0 bits (71), Expect = 1.8
Identities = 10/45 (22%), Positives = 21/45 (46%), Gaps = 3/45 (6%)
Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
F+ + D ++++++DDD D DED E + + +
Sbjct: 109 QFEPEEAD---EEEEEEDDDADDSPEDEDPEPKDPPKTKGRPSLK 150
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 32.3 bits (73), Expect = 1.8
Identities = 33/176 (18%), Positives = 73/176 (41%), Gaps = 7/176 (3%)
Query: 150 KKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
+K + + Q + + M +D +N ++++ ++ + + E+ E ++ E++N+
Sbjct: 81 QKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKND 140
Query: 210 EDDDEERKAEKK--NSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMS 267
D EE + E+K + E + + ++N F + E F
Sbjct: 141 WRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGK 200
Query: 268 DKDLAIRHKITSALAGLNSKEEEEE-----EQEDSAFSDESADDNSSDEDETVEPK 318
+ + + + +AL K++ EE E+E+ E AD S +E+E K
Sbjct: 201 EFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLK 256
Score = 31.9 bits (72), Expect = 2.2
Identities = 49/268 (18%), Positives = 94/268 (35%), Gaps = 28/268 (10%)
Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDD--DEERKAEK 220
R+RR+ ++ L ++ T + + N EE + + +D+ ER A +
Sbjct: 9 RERRRREREEQLRQ--KQEEGSLGQVTTQVEVNSQNSVPDEESKTSTDDEAALLERLARR 66
Query: 221 KNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSA 280
+ ++E E +QK E K ST ++ R +D ++
Sbjct: 67 EERRDERFSEALERQK-------------EFKPTSTDQSLSEPSRRMQEDSGAENETVEE 113
Query: 281 LAGLNSKEEEEEEQEDSAFSDESADDNSSDE----DETVEPKTHKDSKNPEEDTGLNWKS 336
S+EE EE +E + ++ D E EP+ ++ K N
Sbjct: 114 EEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGE 173
Query: 337 DLAQKAASAFLERQANIVNLAKYVYGDMEDVS------VTMEGDEAPNKKVHRKRQAELT 390
+ K A+ G + +E +E K+ R++ E
Sbjct: 174 FMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLE-E 232
Query: 391 AKQKNPKAFTFQSVIKGERKFRRKEDIQ 418
+Q+ + + + E K R KE+I+
Sbjct: 233 EEQRRKQEEADRKSREEEEKRRLKEEIE 260
>gnl|CDD|217943 pfam04180, LTV, Low temperature viability protein. The
low-temperature viability protein LTV1 is involved in
ribosome biogenesis 40S subunit production.
Length = 426
Score = 32.3 bits (73), Expect = 1.8
Identities = 21/143 (14%), Positives = 51/143 (35%), Gaps = 13/143 (9%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
D+ N ++ ++DD ++ + D++E +++ S +E +D + ++ K E
Sbjct: 168 DEAYVRNDEESEEDDGEEWEEMDDEEGSDDRSSAGFLDEFEDLSAPGSPQEAMKKHLFVE 227
Query: 231 LTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITS--ALAGLNSKE 288
+ +N + V++ + + K + ++ E
Sbjct: 228 EETESRNTEYSETSEVMRRNE-----------DWTLHFERFKKFEKKNDDVEIGAADNSE 276
Query: 289 EEEEEQEDSAFSDESADDNSSDE 311
E + DS E +D E
Sbjct: 277 LEGSIEVDSNRLQEVLNDYYKTE 299
Score = 30.7 bits (69), Expect = 4.8
Identities = 29/163 (17%), Positives = 57/163 (34%), Gaps = 26/163 (15%)
Query: 178 MDDDDDDDDD---DETDDDNDEDNEEESEEE--EENEEDDDEERKAEKKNSKNEHVQELT 232
D D + +D+ N+EESEE+ EE EE DDEE ++ ++ E
Sbjct: 151 FQPDMDPRLREVLEALEDEAYVRNDEESEEDDGEEWEEMDDEEGSDDRSSAGFLDEFEDL 210
Query: 233 AKQKNP-KAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEE 291
+ +P +A ++ E + R+T + + + E+
Sbjct: 211 SAPGSPQEAMKKHLFVEEETESRNT-------------------EYSETSEVMRRNEDWT 251
Query: 292 EEQED-SAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLN 333
E F ++ D D + + + N ++ +
Sbjct: 252 LHFERFKKFEKKNDDVEIGAADNSELEGSIEVDSNRLQEVLND 294
Score = 30.7 bits (69), Expect = 4.8
Identities = 24/111 (21%), Positives = 38/111 (34%), Gaps = 12/111 (10%)
Query: 125 SDDSDDENDEDDTSDDDTE-SGQGSMKKY--------KEQVVSSQDGR-KRRKVMFDDDL 174
SDD D+ D S Q +MKK+ S+ RR +
Sbjct: 195 SDDRSSAGFLDEFEDLSAPGSPQEAMKKHLFVEEETESRNTEYSETSEVMRRNEDWTLHF 254
Query: 175 EDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKN 225
E + +DD + DN E E S E + N + + + +K+
Sbjct: 255 ERFKKFEKKNDDVEIGAADNSE--LEGSIEVDSNRLQEVLNDYYKTEKAKS 303
>gnl|CDD|222046 pfam13324, GCIP, Grap2 and cyclin-D-interacting. GCIP, or Grap2
and cyclin-D-interacting protein, is found in
eukaryotes, and in the human protein CCNDBP1, residues
149-190 constitute a helix-loop-helix domain, residues
190-240 an acidic region, and 240-261 a leucine zipper
domain. GCIP interacts with full-length Grap2 protein
and with the COOH-terminal unique and SH3 domains
(designated QC domain) of Grap2. It is potentially
involved in the regulation of cell differentiation and
proliferation through Grap2 and cyclin D-mediated
signalling pathways. In mice, it is involved in
G1/S-phase progression of hepatocytes, which in older
animals is associated with the development of liver
tumours. In vitro it acts as an inhibitory HLH protein,
for example, blocking transcription of the HNF-4
promoter. In its function as a cyclin D1-binding protein
it is able to reduce CDK4-mediated phosphorylation of
the retinoblastoma protein and to inhibit E2F-mediated
transcriptional activity. GCIP has also been shown to
have interact physically with Rad (Ras associated with
diabetes), Rad being important in regulating cellular
senescence.
Length = 271
Score = 32.0 bits (73), Expect = 1.8
Identities = 17/34 (50%), Positives = 18/34 (52%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEE 205
DD ED D DDDD E DD ND+ E E E
Sbjct: 152 DDSEDPFSDGHHDDDDVEGDDLNDDRYWSEEEME 185
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 30.8 bits (70), Expect = 1.8
Identities = 10/59 (16%), Positives = 30/59 (50%), Gaps = 1/59 (1%)
Query: 174 LEDNMDDDDDDDDDDETDDDNDED-NEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
E++ + ++ D + + D +EE+E ++E +++++ +K+ + +EL
Sbjct: 18 AENDGKLQVKPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEEL 76
Score = 30.0 bits (68), Expect = 3.6
Identities = 9/52 (17%), Positives = 20/52 (38%)
Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
D L+ + + D + TD +E ++ E ++ ++ EK
Sbjct: 20 NDGKLQVKPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKP 71
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 32.0 bits (73), Expect = 1.9
Identities = 41/158 (25%), Positives = 59/158 (37%), Gaps = 25/158 (15%)
Query: 195 DEDNEEESEEEEENEEDDDEERKAEK--KNSKNE-----HVQELTAKQKNPKAFTFQSVI 247
DE+ EE E+E E E D +K E +SK E ++EL Q S++
Sbjct: 114 DEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQSRRADVDVNSML 173
Query: 248 KGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDN 307
E FR K + ++D A+ + S E EE A DE ++D+
Sbjct: 174 --EALFRREKKE---EEEEEEEDEALIKSL--------SFGPETEEDRRRA-DDEDSEDD 219
Query: 308 SSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASA 345
D D T PK+ S A K + A
Sbjct: 220 EEDNDNTPSPKSGSSSPAKPTSI----LKKSAAKRSEA 253
>gnl|CDD|217059 pfam02480, Herpes_gE, Alphaherpesvirus glycoprotein E.
Glycoprotein E (gE) of Alphaherpesvirus forms a complex
with glycoprotein I (gI) (pfam01688), functioning as an
immunoglobulin G (IgG) Fc binding protein. gE is
involved in virus spread but is not essential for
propagation.
Length = 437
Score = 32.0 bits (73), Expect = 2.0
Identities = 6/28 (21%), Positives = 14/28 (50%)
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENE 209
++ +D + D +D E +S+ + E
Sbjct: 400 TNELFEDVSSDSESDDEEFDSDSDREPS 427
>gnl|CDD|206684 cd01897, NOG, Nucleolar GTP-binding protein (NOG). NOG1 is a
nucleolar GTP-binding protein present in eukaryotes
ranging from trypanosomes to humans. NOG1 is
functionally linked to ribosome biogenesis and found in
association with the nuclear pore complexes and
identified in many preribosomal complexes. Thus, defects
in NOG1 can lead to defects in 60S biogenesis. The S.
cerevisiae NOG1 gene is essential for cell viability,
and mutations in the predicted G motifs abrogate
function. It is a member of the ODN family of
GTP-binding proteins that also includes the bacterial
Obg and DRG proteins.
Length = 167
Score = 31.0 bits (71), Expect = 2.0
Identities = 10/22 (45%), Positives = 14/22 (63%), Gaps = 2/22 (9%)
Query: 435 PPIVVAVVGPPQVGKSTLIRCL 456
+V+A G P VGKS+L+ L
Sbjct: 1 RTLVIA--GYPNVGKSSLVNKL 20
>gnl|CDD|237894 PRK15064, PRK15064, ABC transporter ATP-binding protein;
Provisional.
Length = 530
Score = 32.2 bits (74), Expect = 2.0
Identities = 9/19 (47%), Positives = 15/19 (78%)
Query: 439 VAVVGPPQVGKSTLIRCLI 457
+A++G VGK+TL+R L+
Sbjct: 348 LAIIGENGVGKTTLLRTLV 366
>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B. A
small number of taxonomically diverse prokaryotic
species, including Methanosarcina barkeri, have what
appears to be a second ATP synthase, in addition to the
normal F1F0 ATPase in bacteria and A1A0 ATPase in
archaea. These enzymes use ion gradients to synthesize
ATP, CC and in principle may run in either direction.
This model represents the F0 subunit B of this apparent
second ATP synthase.
Length = 246
Score = 31.6 bits (72), Expect = 2.1
Identities = 18/63 (28%), Positives = 29/63 (46%), Gaps = 16/63 (25%)
Query: 928 EREQKVASLM----------KMLRTNYSEKNSKEKQAMKARMVALKLRAKAAEEAKQQRQ 977
RE+K+A + + R Y EKN E + ++ KA EEA+ +RQ
Sbjct: 36 AREKKIAGELADADTKKREAEQERREYEEKNE-ELDQQREVLLT-----KAKEEAQAERQ 89
Query: 978 RVM 980
R++
Sbjct: 90 RLL 92
>gnl|CDD|206648 cd00882, Ras_like_GTPase, Rat sarcoma (Ras)-like superfamily of
small guanosine triphosphatases (GTPases). Ras-like
GTPase superfamily. The Ras-like superfamily of small
GTPases consists of several families with an extremely
high degree of structural and functional similarity. The
Ras superfamily is divided into at least four families
in eukaryotes: the Ras, Rho, Rab, and Sar1/Arf families.
This superfamily also includes proteins like the GTP
translation factors, Era-like GTPases, and G-alpha chain
of the heterotrimeric G proteins. Members of the Ras
superfamily regulate a wide variety of cellular
functions: the Ras family regulates gene expression, the
Rho family regulates cytoskeletal reorganization and
gene expression, the Rab and Sar1/Arf families regulate
vesicle trafficking, and the Ran family regulates
nucleocytoplasmic transport and microtubule
organization. The GTP translation factor family
regulates initiation, elongation, termination, and
release in translation, and the Era-like GTPase family
regulates cell division, sporulation, and DNA
replication. Members of the Ras superfamily are
identified by the GTP binding site, which is made up of
five characteristic sequence motifs, and the switch I
and switch II regions.
Length = 161
Score = 30.9 bits (70), Expect = 2.1
Identities = 9/19 (47%), Positives = 12/19 (63%)
Query: 440 AVVGPPQVGKSTLIRCLIK 458
VVG VGKS+L+ L+
Sbjct: 1 VVVGRGGVGKSSLLNALLG 19
>gnl|CDD|226371 COG3853, TelA, Uncharacterized protein involved in tellurite
resistance [Inorganic ion transport and metabolism].
Length = 386
Score = 32.0 bits (73), Expect = 2.1
Identities = 11/45 (24%), Positives = 21/45 (46%), Gaps = 1/45 (2%)
Query: 177 NMDDDDDDDDDDETDD-DNDEDNEEESEEEEENEEDDDEERKAEK 220
N D +++ +D+ D D E E ++E+ E D + +K
Sbjct: 4 NFDIENEATINDKVTDADLPEPENSEIVQKEKKAERDLQLENTKK 48
>gnl|CDD|227388 COG5055, RAD52, Recombination DNA repair protein (RAD52 pathway)
[DNA replication, recombination, and repair].
Length = 375
Score = 31.8 bits (72), Expect = 2.1
Identities = 18/64 (28%), Positives = 26/64 (40%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
D D D DD DD +DD D+D + + +E D K S N +
Sbjct: 256 DLKSLDASKQDQDDLLDDRSDDFQDDDLQNMGDTNVLTDEKDPVVAKQSPTASSNPQITF 315
Query: 231 LTAK 234
+TA+
Sbjct: 316 VTAR 319
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 32.2 bits (73), Expect = 2.1
Identities = 30/169 (17%), Positives = 62/169 (36%), Gaps = 11/169 (6%)
Query: 191 DDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGE 250
+ N+ EEE E+E+ EE +++K + K + + AK+K P + + E
Sbjct: 97 EPKNESGKEEEKEKEQVKEEK--KKKKEKPKEEPKDRKPKEEAKEKRPPK---EKEKEKE 151
Query: 251 RKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESAD----D 306
+K + + R + + R K N K+E EE++ + E+ +
Sbjct: 152 KKVEEPRDREEEKKRERVRAKS-RPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEE 210
Query: 307 NSSDEDETVEPKTHKDSKNPEEDTG-LNWKSDLAQKAASAFLERQANIV 354
+E+ E KD + + S+ ++ +
Sbjct: 211 PDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPS 259
Score = 31.4 bits (71), Expect = 3.6
Identities = 17/125 (13%), Positives = 38/125 (30%), Gaps = 10/125 (8%)
Query: 199 EEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTKP 258
+E + E EE+ ++E+ E+K K E +E +K + E++ K
Sbjct: 95 AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEE-------AKEKRPPKEKE 147
Query: 259 TVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPK 318
+ +D R + ++ ++ + + E
Sbjct: 148 KEKEKKVEEPRD---REEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAV 204
Query: 319 THKDS 323
K
Sbjct: 205 KGKPE 209
Score = 30.2 bits (68), Expect = 7.2
Identities = 25/180 (13%), Positives = 64/180 (35%), Gaps = 10/180 (5%)
Query: 141 DTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEE 200
+ ++ G ++ +++ V + +K+ K + ++ + E + + ++ EE
Sbjct: 97 EPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156
Query: 201 ESEEEEENEEDDDEER---KAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTK 257
+ EEE + + + K K +E ++K +A E +
Sbjct: 157 PRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQA-------AREAVKGKPE 209
Query: 258 PTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEP 317
V + R ++D + T++ + + E S+ S + D + S
Sbjct: 210 EPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRE 269
>gnl|CDD|234719 PRK00300, gmk, guanylate kinase; Provisional.
Length = 205
Score = 31.2 bits (72), Expect = 2.1
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 441 VVGPPQVGKSTLIRCLIKNFTKTPLSV 467
+ GP GKSTL++ L++ LSV
Sbjct: 10 LSGPSGAGKSTLVKALLERDPNLQLSV 36
>gnl|CDD|188099 TIGR00972, 3a0107s01c2, phosphate ABC transporter, ATP-binding
protein. This model represents the ATP-binding protein
of a family of ABC transporters for inorganic phosphate.
In the model species Escherichia coli, a constitutive
transporter for inorganic phosphate, with low affinity,
is also present. The high affinity transporter that
includes this polypeptide is induced when extracellular
phosphate concentrations are low. The proteins most
similar to the members of this family but not included
appear to be amino acid transporters [Transport and
binding proteins, Anions].
Length = 247
Score = 31.5 bits (72), Expect = 2.2
Identities = 15/37 (40%), Positives = 20/37 (54%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
V A++GP GKSTL+R L + P I+G V
Sbjct: 29 VTALIGPSGCGKSTLLRSLNRMNDLVPGVRIEGKVLF 65
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium
(Slime molds). REP protein is likely involved in
transcription regulation and control of DNA replication,
specifically amplification of plasmid at low copy
numbers. The formation of homomultimers may be required
for their regulatory activity.
Length = 910
Score = 32.1 bits (73), Expect = 2.2
Identities = 13/24 (54%), Positives = 18/24 (75%)
Query: 177 NMDDDDDDDDDDETDDDNDEDNEE 200
N DDD D+D+D + D+D DED +E
Sbjct: 887 NEDDDADEDEDQDEDEDEDEDEDE 910
Score = 30.2 bits (68), Expect = 8.2
Identities = 12/34 (35%), Positives = 22/34 (64%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEE 204
++ L +++DDD D++ D D DED +E+ +E
Sbjct: 877 EEYLISQFQENEDDDADEDEDQDEDEDEDEDEDE 910
Score = 30.2 bits (68), Expect = 8.9
Identities = 10/22 (45%), Positives = 18/22 (81%)
Query: 193 DNDEDNEEESEEEEENEEDDDE 214
D+D D +E+ +E+E+ +ED+DE
Sbjct: 889 DDDADEDEDQDEDEDEDEDEDE 910
>gnl|CDD|184582 PRK14236, PRK14236, phosphate transporter ATP-binding protein;
Provisional.
Length = 272
Score = 31.5 bits (72), Expect = 2.2
Identities = 11/19 (57%), Positives = 13/19 (68%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
V A +GP GKSTL+RC
Sbjct: 53 VTAFIGPSGCGKSTLLRCF 71
>gnl|CDD|131368 TIGR02315, ABC_phnC, phosphonate ABC transporter, ATP-binding
protein. Phosphonates are a class of
phosphorus-containing organic compound with a stable
direct C-P bond rather than a C-O-P linkage. A number of
bacterial species have operons, typically about 14 genes
in size, with genes for ATP-dependent transport of
phosphonates, degradation, and regulation of the
expression of the system. Members of this protein family
are the ATP-binding cassette component of tripartite ABC
transporters of phosphonates [Transport and binding
proteins, Anions].
Length = 243
Score = 31.5 bits (72), Expect = 2.3
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
VA++GP GKSTL+RC+
Sbjct: 30 FVAIIGPSGAGKSTLLRCI 48
>gnl|CDD|213191 cd03224, ABC_TM1139_LivF_branched, ATP-binding cassette domain of
branched-chain amino acid transporter. LivF (TM1139) is
part of the LIV-I bacterial ABC-type two-component
transport system that imports neutral, branched-chain
amino acids. The E. coli branched-chain amino acid
transporter comprises a heterodimer of ABC transporters
(LivF and LivG), a heterodimer of six-helix TM domains
(LivM and LivH), and one of two alternative soluble
periplasmic substrate binding proteins (LivK or LivJ).
ABC transporters are a large family of proteins involved
in the transport of a wide variety of different
compounds, like sugars, ions, peptides, and more complex
organic molecules.
Length = 222
Score = 31.3 bits (72), Expect = 2.3
Identities = 7/20 (35%), Positives = 15/20 (75%)
Query: 438 VVAVVGPPQVGKSTLIRCLI 457
+VA++G GK+TL++ ++
Sbjct: 28 IVALLGRNGAGKTTLLKTIM 47
>gnl|CDD|99707 cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of
cellular Activities) superfamily represents an ancient
group of ATPases belonging to the ASCE (for additional
strand, catalytic E) division of the P-loop NTPase fold.
The ASCE division also includes ABC, RecA-like,
VirD4-like, PilT-like, and SF1/2 helicases. Members of
the AAA+ ATPases function as molecular chaperons, ATPase
subunits of proteases, helicases, or nucleic-acid
stimulated ATPases. The AAA+ proteins contain several
distinct features in addition to the conserved
alpha-beta-alpha core domain structure and the Walker A
and B motifs of the P-loop NTPases.
Length = 151
Score = 30.6 bits (69), Expect = 2.3
Identities = 11/38 (28%), Positives = 18/38 (47%), Gaps = 1/38 (2%)
Query: 431 PLEPPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVI 468
L PP ++ + GPP GK+TL R + + +
Sbjct: 15 ELPPPKNLL-LYGPPGTGKTTLARAIANELFRPGAPFL 51
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family. This
family includes the HABP4 family of hyaluronan-binding
proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
HABP4 has been observed to bind hyaluronan (a
glucosaminoglycan), but it is not known whether this is
its primary role in vivo. It has also been observed to
bind RNA, but with a lower affinity than that for
hyaluronan. PAI-1 mRNA-binding protein specifically
binds the mRNA of type-1 plasminogen activator inhibitor
(PAI-1), and is thought to be involved in regulation of
mRNA stability. However, in both cases, the sequence
motifs predicted to be important for ligand binding are
not conserved throughout the family, so it is not known
whether members of this family share a common function.
Length = 106
Score = 30.1 bits (68), Expect = 2.3
Identities = 13/50 (26%), Positives = 30/50 (60%)
Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELT 232
D+ + T++ +E+ + E EE++ EE+ ++E E++ + E +E+T
Sbjct: 31 KDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEMT 80
>gnl|CDD|221866 pfam12935, Sec16_N, Vesicle coat trafficking protein Sec16
N-terminus. Sec16 is a multi-domain vesicle coat
protein. The overall function of Sec16 is in mediating
the movement of protein-cargo between the organelles of
the secretory pathway. Over-expression of truncated
mutants of only the N-terminus are lethal, and this
portion does not appear to be essential for function so
may act as a stabilising region.
Length = 246
Score = 31.4 bits (71), Expect = 2.3
Identities = 23/126 (18%), Positives = 51/126 (40%), Gaps = 12/126 (9%)
Query: 100 VHGESTEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSS 159
+ +ST + + + ++S S++ D +++ S+ S G+ + + SS
Sbjct: 99 LTRKSTSQVLDSLGLNPDSLSSPASAEPLDPT-AQNEFSNVLAASTDGNPESESQSEPSS 157
Query: 160 QDGRKRRKVMFDDDLEDNMDD-----------DDDDDDDDETDDDNDEDNEEESEEEEEN 208
++ R + DD+ E + D+DDD + + E ++ E +N
Sbjct: 158 EEELAARAELSDDESESTPTEDDLAERWQAFLDNDDDLLLDDETALAEGPNGDTPENSQN 217
Query: 209 EEDDDE 214
+DD
Sbjct: 218 TLNDDS 223
>gnl|CDD|110106 pfam01081, Aldolase, KDPG and KHG aldolase. This family includes
the following members: 4-hydroxy-2-oxoglutarate aldolase
(KHG-aldolase) Phospho-2-dehydro-3-deoxygluconate
aldolase (KDPG-aldolase).
Length = 196
Score = 31.3 bits (71), Expect = 2.3
Identities = 11/38 (28%), Positives = 20/38 (52%), Gaps = 1/38 (2%)
Query: 717 RSGTGFLAVQDVAKREPGFRVIATGTILDANQTAEVTK 754
R+ A++ + K P ++ GT+L+A Q AE +
Sbjct: 42 RTPCALDAIRLLRKNRPDA-LVGAGTVLNAQQLAEAAE 78
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 31.9 bits (72), Expect = 2.5
Identities = 20/97 (20%), Positives = 38/97 (39%), Gaps = 13/97 (13%)
Query: 128 SDDENDED------DTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDD 181
S DE ++D +S + ES K K+ VV+ D + + + + E
Sbjct: 50 SSDEEEQDYELRPRVSSSWNNESYNRLPIKTKDNVVA--DVNNGEEFLSESESEA----- 102
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKA 218
+ D D D+ E++ E ++ D E++
Sbjct: 103 SLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKER 139
Score = 31.5 bits (71), Expect = 2.9
Identities = 9/50 (18%), Positives = 24/50 (48%)
Query: 185 DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
+ + E + D D ++E ++ E ++ E ++ +S+ E + + K
Sbjct: 97 ESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKERIASICTK 146
Score = 31.1 bits (70), Expect = 4.5
Identities = 7/51 (13%), Positives = 22/51 (43%)
Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
+ + + E D D ++ ++ EE++ E +++ +K ++
Sbjct: 97 ESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKERIASICTKI 147
>gnl|CDD|163585 TIGR03873, F420-0_ABC_ATP, proposed F420-0 ABC transporter,
ATP-binding protein. This small clade of ABC-type
transporter ATP-binding protein components is found as a
three gene cassette along with a periplasmic
substrate-binding protein (TIGR03868) and a permease
(TIGR03869). The organisms containing this cassette are
all Actinobacteria and all contain numerous genes
requiring the coenzyme F420. This model was defined
based on five such organisms, four of which are lacking
all F420 biosynthetic capability save the final
side-chain polyglutamate attachment step (via the gene
cofE: TIGR01916). In Jonesia denitrificans DSM 20603 and
marine actinobacterium PHSC20C1 this cassette is in an
apparent operon with the cofE gene and, in PHSC20C1,
also with a F420-dependent glucose-6-phosphate
dehydrogenase (TIGR03554). Based on these observations
we propose that this ATP-binding protein is a component
of an F420-0 (that is, F420 lacking only the
polyglutamate tail) transporter.
Length = 256
Score = 31.3 bits (71), Expect = 2.6
Identities = 15/33 (45%), Positives = 19/33 (57%), Gaps = 2/33 (6%)
Query: 424 VPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCL 456
V VD T PP + ++GP GKSTL+R L
Sbjct: 17 VDGVDVTA--PPGSLTGLLGPNGSGKSTLLRLL 47
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 29.9 bits (67), Expect = 2.6
Identities = 12/24 (50%), Positives = 14/24 (58%)
Query: 189 ETDDDNDEDNEEESEEEEENEEDD 212
T E +EE +EEEE EEDD
Sbjct: 82 ATAGAKAEAKKEEKKEEEEEEEDD 105
>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
Provisional.
Length = 291
Score = 31.3 bits (70), Expect = 2.7
Identities = 19/60 (31%), Positives = 32/60 (53%), Gaps = 7/60 (11%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK-------AEKKNSKNEHVQELT 232
++DD+D DD++++ +D+D E EEEE+ +E EKK +E EL+
Sbjct: 75 EEDDEDVDDDSEEGDDDDGGAEDEEEEKVRGQSGQEGTVALGSGSTEKKLIGSEKQTELS 134
Score = 31.3 bits (70), Expect = 2.9
Identities = 14/27 (51%), Positives = 19/27 (70%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEE 205
+DD+D DDD E DD+D E+E EE+
Sbjct: 76 EDDEDVDDDSEEGDDDDGGAEDEEEEK 102
Score = 30.9 bits (69), Expect = 4.2
Identities = 16/34 (47%), Positives = 21/34 (61%), Gaps = 2/34 (5%)
Query: 179 DDDDDDDDDDETDDDND--EDNEEESEEEEENEE 210
DD+D DDD +E DDD+ ED EEE + +E
Sbjct: 77 DDEDVDDDSEEGDDDDGGAEDEEEEKVRGQSGQE 110
>gnl|CDD|129528 TIGR00436, era, GTP-binding protein Era. Era is an essential
GTPase in Escherichia coli and many other bacteria. It
plays a role in ribosome biogenesis. Few bacteria lack
this protein [Protein synthesis, Other].
Length = 270
Score = 31.2 bits (71), Expect = 2.7
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 439 VAVVGPPQVGKSTLIRCLIK 458
VA++G P VGKSTL+ L
Sbjct: 3 VAILGRPNVGKSTLLNQLHG 22
>gnl|CDD|222800 PHA00458, PHA00458, single-stranded DNA-binding protein.
Length = 233
Score = 31.3 bits (71), Expect = 2.7
Identities = 14/44 (31%), Positives = 25/44 (56%), Gaps = 5/44 (11%)
Query: 175 EDNMDDDDDDD-----DDDETDDDNDEDNEEESEEEEENEEDDD 213
ED+ D+D++D + ++E+ +E+ EEE E +ED D
Sbjct: 189 EDDWADEDEEDGYVASEAKARKPQDEEEWDEDDEEESEEDEDGD 232
Score = 29.4 bits (66), Expect = 9.5
Identities = 14/37 (37%), Positives = 23/37 (62%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEE 207
D+D ED + + +++ DED+EEESEE+E+
Sbjct: 194 DEDEEDGYVASEAKARKPQDEEEWDEDDEEESEEDED 230
>gnl|CDD|216194 pfam00922, Phosphoprotein, Vesiculovirus phosphoprotein.
Length = 283
Score = 31.4 bits (71), Expect = 2.9
Identities = 12/81 (14%), Positives = 25/81 (30%)
Query: 135 DDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDN 194
D T + E + K S + ++L D+ + +DD
Sbjct: 14 DQTLSEIEEMEEQRADKSSTFQEDSVEEHTSPSYYLAEELSDSETEPSIEDDQGLYTQLP 73
Query: 195 DEDNEEESEEEEENEEDDDEE 215
+ E + ++ DD+
Sbjct: 74 PAEQVEGFIQGPLDDIADDDI 94
>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
[Translation, ribosomal structure and biogenesis].
Length = 569
Score = 31.6 bits (71), Expect = 2.9
Identities = 24/126 (19%), Positives = 53/126 (42%), Gaps = 3/126 (2%)
Query: 130 DENDEDDTSDDDTESGQGSMKKYKEQVVS--SQDGRKRRKVMFDDDLEDNMDDDDDDDDD 187
E+++ D D + Y E +++ K+ + D ++ +D+++ D
Sbjct: 187 SEDEDIDYFQDMPSDDEEEEAIYYEDFFDKPTKEPVKKHSDVKDPKEDEELDEEEHDSAM 246
Query: 188 DETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSK-NEHVQELTAKQKNPKAFTFQSV 246
D+ D D E+E E E D EK+ + +E ++EL + PK + +
Sbjct: 247 DKVKLDLFADEEDEPNAEGVGEASDKNLSSFEKQQIEMDEQIEELEKELVAPKEWKYAGE 306
Query: 247 IKGERK 252
+ +++
Sbjct: 307 VSAKKR 312
Score = 30.0 bits (67), Expect = 9.4
Identities = 26/143 (18%), Positives = 54/143 (37%), Gaps = 7/143 (4%)
Query: 104 STEKSMMEIDGIHGAMSGAISSDDSDDENDEDDTSDD-DTESGQGSMKKYKEQVVSSQDG 162
EK M+ D S ++D+E DD + + + +K D
Sbjct: 104 FKEKQMLSADVSEIEEQSNDSLSENDEEPSMDDEKTSAEAAREEFAEEKRIPDPYGINDK 163
Query: 163 -----RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK 217
+ R + +D + + +D+D D D +D++ EE E+ ++ E K
Sbjct: 164 FFDLEKFNRDTLAAEDSNEASEGSEDEDIDYFQDMPSDDEEEEAIYYEDFFDKPTKEPVK 223
Query: 218 AEKKNSKNEHVQELTAKQKNPKA 240
+ + K+ E ++++ A
Sbjct: 224 -KHSDVKDPKEDEELDEEEHDSA 245
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 29.5 bits (67), Expect = 2.9
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 199 EEESEEEEENEEDDDEERKAE 219
E++ EEEEE E+++ EE A
Sbjct: 79 EKKEEEEEEEEKEESEEEAAA 99
Score = 29.5 bits (67), Expect = 3.1
Identities = 9/20 (45%), Positives = 15/20 (75%)
Query: 196 EDNEEESEEEEENEEDDDEE 215
+ ++E EEEEE +E+ +EE
Sbjct: 77 AEEKKEEEEEEEEKEESEEE 96
>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1. Rec10 / Red1 is involved in meiotic
recombination and chromosome segregation during
homologous chromosome formation. This protein localises
to the synaptonemal complex in S. cerevisiae and the
analogous structures (linear elements) in S. pombe. This
family is currently only found in fungi.
Length = 706
Score = 31.8 bits (72), Expect = 3.0
Identities = 48/333 (14%), Positives = 98/333 (29%), Gaps = 64/333 (19%)
Query: 131 ENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDET 190
++ E +T D S S + Q + K + D +D + T
Sbjct: 267 QDIEANTKD----SLHMSAQDNHYDNTQLQTPERSTKR--KSPIWDLKEDQKESKIKSGT 320
Query: 191 D-------DDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTF 243
+ + E + EEE++ ++RK + SK + + +
Sbjct: 321 NLKLSSEKESIPETSYVNVLEEEQSPLVRLQKRKLARSTSKTLESLKEVFEDQASSVKNK 380
Query: 244 QSVIK-GERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLN-------------SKEE 289
Q+ + + T V P + D + K+ + L +
Sbjct: 381 QAQSEENLNESPKTPIAVTGDPHLKDPTIIAGKKLMNKLTSEKINNPVKVVKVSKYKGNK 440
Query: 290 EEEEQE----DSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASA 345
E++++ D+ F+ + + ++ + K + K + ++ +Q S
Sbjct: 441 SEKKRDINVLDTIFASPVSKELRKKVGKSKQTKLK-NFKPVPNKSKKQLANNNSQNIKSK 499
Query: 346 FLERQA--NIVNLAKYVYGDMEDVSVTMEGDEAPNKKVHRKRQAELTAKQKNPKAFTFQS 403
+ + N NL E PN K +Q T S
Sbjct: 500 KVVKAKTNNKANLQD-----------VGECSSPPNNKEKNDKQTS-----------TSSS 537
Query: 404 VIKGERKFRRKEDIQAKKHHVPQVDRTPLEPPP 436
V+K D + + P + LE
Sbjct: 538 VLK--------SDRSSIEVRNPNANVKKLEDTT 562
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 31.4 bits (71), Expect = 3.0
Identities = 24/87 (27%), Positives = 35/87 (40%), Gaps = 8/87 (9%)
Query: 144 SGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESE 203
S+ + S RKR++ E + D+D+DD D+DE D E E
Sbjct: 337 FAISSLPDSDSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDY--------EKE 388
Query: 204 EEEENEEDDDEERKAEKKNSKNEHVQE 230
+ EED + R + SK V E
Sbjct: 389 RKRRREEDKNFLRLKALELSKYAGVNE 415
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim
have been identified in both mice and humans (mTim and
hTIM, respectively). Mammalian TIM is not the true
orthologue of Drosophila TIM, but is the likely
orthologue of a fly gene, timeout (also called tim-2).
mTim has been shown to be essential for embryonic
development, but does not have substantiated circadian
function. Some family members contain a SANT domain in
this region.
Length = 507
Score = 31.6 bits (71), Expect = 3.0
Identities = 34/209 (16%), Positives = 69/209 (33%), Gaps = 19/209 (9%)
Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDD 182
+ S +D ++ D ++ +E + + Q R + +
Sbjct: 302 LQSCLIRAADDREEDECDQAVPLVPLTEENEEAMENEQFQRLLKALGLRPPRSGQEGFWR 361
Query: 183 DDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT 242
T + EEE E+E DD + + ++ A+Q+ +
Sbjct: 362 IPAKLSSTQLRRRAASLSGEEEEPEDELKDDVDGEQADESEHETLALRKNARQRKAGLAS 421
Query: 243 FQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDE 302
+ GE + ++ P+ + + + T +G NS EE + D
Sbjct: 422 PEEEALGEEEQKAP-------PKKKQLNQKNK-QQTG--SGTNSDEERD---------DT 462
Query: 303 SADDNSSDEDETVEPKTHKDSKNPEEDTG 331
S D++ D+ P+ HKD + T
Sbjct: 463 SLDEDRDLADDGGLPRIHKDKRAGASLTQ 491
Score = 30.1 bits (67), Expect = 9.7
Identities = 16/105 (15%), Positives = 37/105 (35%), Gaps = 9/105 (8%)
Query: 126 DDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDG-------RKRRKVMFDDDLEDNM 178
DD D E + D S+ +T + + + ++ K + S ++ + K +
Sbjct: 391 DDVDGE--QADESEHETLALRKNARQRKAGLASPEEEALGEEEQKAPPKKKQLNQKNKQQ 448
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS 223
+ D++ D DED + + D + ++
Sbjct: 449 TGSGTNSDEERDDTSLDEDRDLADDGGLPRIHKDKRAGASLTQSP 493
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 31.6 bits (71), Expect = 3.1
Identities = 10/40 (25%), Positives = 20/40 (50%)
Query: 170 FDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENE 209
D+ E N D D +E DDD++E++ + + ++
Sbjct: 99 VQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKG 138
Score = 30.0 bits (67), Expect = 9.4
Identities = 12/43 (27%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
D+ + + D + D++ED+++ EEE+N+ ++ K KK
Sbjct: 101 DNTEQNQDTKGSKTDSEEDDDD--SEEEDNKSTSSKDGKGSKK 141
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 30.5 bits (69), Expect = 3.1
Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 2/92 (2%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
S+ +S D+ ++D +D + + KK K + +K +K + E+ + +
Sbjct: 26 STTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKK--KKEKEEVPELAAE 83
Query: 184 DDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
+ D E +++ND+ + E + + + + E
Sbjct: 84 ELSDSEENEENDKKVDYELPKVQNTAAEVNHE 115
>gnl|CDD|215443 PLN02829, PLN02829, Probable galacturonosyltransferase.
Length = 639
Score = 31.4 bits (71), Expect = 3.2
Identities = 20/100 (20%), Positives = 42/100 (42%), Gaps = 10/100 (10%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNP 238
DDD DD + +++S+++E+N + + EHVQ+ +
Sbjct: 107 DDDTHSQTDDIIKQVTQKAGQDDSDQQEKNSQSQSAS-----QAESLEHVQQS---AQTS 158
Query: 239 KAFTFQSVIKGERKFRSTKPTVVPQPRMSD-KDLAIRHKI 277
+ + + + + T TV+P R+ +D I+ K+
Sbjct: 159 EKVDEKEPLLTKTD-KQTDQTVMPDARVRQLRDQLIKAKV 197
>gnl|CDD|172730 PRK14242, PRK14242, phosphate transporter ATP-binding protein;
Provisional.
Length = 253
Score = 30.9 bits (70), Expect = 3.2
Identities = 14/37 (37%), Positives = 22/37 (59%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
V A++GP GKST +RCL + P + ++G + L
Sbjct: 34 VTALIGPSGCGKSTFLRCLNRMNDLIPGARVEGEILL 70
>gnl|CDD|206680 cd01893, Miro1, Mitochondrial Rho family 1 (Miro1), N-terminal.
Miro1 subfamily. Miro (mitochondrial Rho) proteins have
tandem GTP-binding domains separated by a linker region
containing putative calcium-binding EF hand motifs.
Genes encoding Miro-like proteins were found in several
eukaryotic organisms. This CD represents the N-terminal
GTPase domain of Miro proteins. These atypical Rho
GTPases have roles in mitochondrial homeostasis and
apoptosis. Most Rho proteins contain a lipid
modification site at the C-terminus; however, Miro is
one of few Rho subfamilies that lack this feature.
Length = 168
Score = 30.4 bits (69), Expect = 3.2
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 439 VAVVGPPQVGKSTLIRCLIK 458
+ ++G VGKS+LI L+
Sbjct: 5 IVLIGDEGVGKSSLIMSLVS 24
>gnl|CDD|234357 TIGR03797, NHLM_micro_ABC2, NHLM bacteriocin system ABC
transporter, ATP-binding protein. Members of this
protein family are ABC transporter ATP-binding subunits,
part of a three-gene putative bacteriocin transport
operon. The other subunits include another ATP-binding
subunit (TIGR03796), which has an N-terminal leader
sequence cleavage domain, and an HlyD homolog
(TIGR03794). In a number of genomes, members of protein
families related to nitrile hydratase alpha subunit or
to nif11 have undergone paralogous family expansions,
with members possessing a putative bacteriocin cleavage
region ending with a classic Gly-Gly motif. Those sets
of putative bacteriocins, members of this protein family
and its partners TIGR03794 and TIGR03796, and
cyclodehydratase/docking scaffold fusion proteins of
thiazole/oxazole biosynthesis frequently show correlated
species distribution and co-clustering within many of
those genomes [Transport and binding proteins, Amino
acids, peptides and amines, Cellular processes,
Biosynthesis of natural products].
Length = 686
Score = 31.5 bits (72), Expect = 3.3
Identities = 12/19 (63%), Positives = 15/19 (78%)
Query: 439 VAVVGPPQVGKSTLIRCLI 457
VA+VGP GKSTL+R L+
Sbjct: 482 VAIVGPSGSGKSTLLRLLL 500
>gnl|CDD|213213 cd03246, ABCC_Protease_Secretion, ATP-binding cassette domain of
PrtD, subfamily C. This family represents the ABC
component of the protease secretion system PrtD, a
60-kDa integral membrane protein sharing 37% identity
with HlyB, the ABC component of the alpha-hemolysin
secretion pathway, in the C-terminal domain. They export
degradative enzymes by using a type I protein secretion
system and lack an N-terminal signal peptide, but
contain a C-terminal secretion signal. The Type I
secretion apparatus is made up of three components, an
ABC transporter, a membrane fusion protein (MFP), and an
outer membrane protein (OMP). For the HlyA transporter
complex, HlyB (ABC transporter) and HlyD (MFP) reside in
the inner membrane of E. coli. The OMP component is
TolC, which is thought to interact with the MFP to form
a continuous channel across the periplasm from the
cytoplasm to the exterior. HlyB belongs to the family of
ABC transporters, which are ubiquitous, ATP-dependent
transmembrane pumps or channels. The spectrum of
transport substrates ranges from inorganic ions,
nutrients such as amino acids, sugars, or peptides,
hydrophobic drugs, to large polypeptides, such as HlyA.
Length = 173
Score = 30.6 bits (70), Expect = 3.3
Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 2/53 (3%)
Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSV-IKG-PVTLIIKDSIRDCF 484
P +A++GP GKSTL R ++ T V + G ++ + + D
Sbjct: 26 EPGESLAIIGPSGSGKSTLARLILGLLRPTSGRVRLDGADISQWDPNELGDHV 78
>gnl|CDD|224025 COG1100, COG1100, GTPase SAR1 and related small G proteins [General
function prediction only].
Length = 219
Score = 30.7 bits (69), Expect = 3.3
Identities = 11/44 (25%), Positives = 19/44 (43%), Gaps = 2/44 (4%)
Query: 439 VAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLIIKDSIRD 482
+ V+G VGK+TL+ L+ + + P L +I
Sbjct: 8 IVVLGDGGVGKTTLLNRLVGD--EFPEGYPPTIGNLDPAKTIEP 49
>gnl|CDD|219240 pfam06957, COPI_C, Coatomer (COPI) alpha subunit C-terminus. This
family represents the C-terminus (approximately 500
residues) of the eukaryotic coatomer alpha subunit.
Coatomer (COPI) is a large cytosolic protein complex
which forms a coat around vesicles budding from the
Golgi apparatus. Such coatomer-coated vesicles have been
proposed to play a role in many distinct steps of
intracellular transport. Note that many family members
also contain the pfam04053 domain.
Length = 421
Score = 31.4 bits (71), Expect = 3.3
Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 1/42 (2%)
Query: 175 EDNMDDDDDDDDDD-ETDDDNDEDNEEESEEEEENEEDDDEE 215
M DD +D D D+D + E+ ++EE E DEE
Sbjct: 33 FAVMGGDDWGEDADLGLDEDGYLEGEDGLLDDEEGPEGGDEE 74
>gnl|CDD|227952 COG5665, NOT5, CCR4-NOT transcriptional regulation complex, NOT5
subunit [Transcription].
Length = 548
Score = 31.6 bits (71), Expect = 3.3
Identities = 36/193 (18%), Positives = 59/193 (30%), Gaps = 17/193 (8%)
Query: 170 FDDDLE---DNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD-------EERKAE 219
F DD++ +N DD D + D +D E S E E ++ +K E
Sbjct: 170 FQDDIKYYVENNDDPDFIEYDTIYEDMGCEIQPSSSNNEAPKEGNNQTSLSSIRSSKKQE 229
Query: 220 KKNSKNEHVQELTAKQK-----NPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIR 274
+ K ++++ + P + I ST + S K
Sbjct: 230 RSPKKKAPQRDVSISDRATTPIAPGVESASQSISSTPTPVSTDTPLHTVKDDSIKFDNST 289
Query: 275 HKITSALAGLNSKEEEEEEQEDSAFSDESADDNSSDEDETVEPKT--HKDSKNPEEDTGL 332
+ + KE E + ++ F +S D+ VE N E L
Sbjct: 290 LGTPTTHVSMKKKESENDSEQQLNFPKDSTDEIRKTIQHDVETNAAFQNPLFNDELKWWL 349
Query: 333 NWKSDLAQKAASA 345
K L Q
Sbjct: 350 ASKRYLTQPLQEM 362
>gnl|CDD|238026 cd00071, GMPK, Guanosine monophosphate kinase (GMPK, EC 2.7.4.8),
also known as guanylate kinase (GKase), catalyzes the
reversible phosphoryl transfer from adenosine
triphosphate (ATP) to guanosine monophosphate (GMP) to
yield adenosine diphosphate (ADP) and guanosine
diphosphate (GDP). It plays an essential role in the
biosynthesis of guanosine triphosphate (GTP). This
enzyme is also important for the activation of some
antiviral and anticancer agents, such as acyclovir,
ganciclovir, carbovir, and thiopurines.
Length = 137
Score = 29.8 bits (68), Expect = 3.4
Identities = 14/33 (42%), Positives = 20/33 (60%), Gaps = 3/33 (9%)
Query: 436 PIVVAVVGPPQVGKSTLIRCLIKNF-TKTPLSV 467
IV++ GP VGKSTL++ L++ F SV
Sbjct: 1 LIVLS--GPSGVGKSTLLKRLLEEFDPNFGFSV 31
>gnl|CDD|220245 pfam09444, MRC1, MRC1-like domain. This putative domain is found
to be the most conserved region in mediator of
replication checkpoint protein 1.
Length = 145
Score = 30.0 bits (68), Expect = 3.6
Identities = 18/49 (36%), Positives = 33/49 (67%), Gaps = 8/49 (16%)
Query: 180 DDDDDDDDDETDDD---NDEDNEEESEEEEEN-----EEDDDEERKAEK 220
DD+D DDDD+++ + +DE ++E++E +E EE +D+E++ EK
Sbjct: 28 DDEDSDDDDDSELEEMIDDEYSKEKAENDELAAKRAEEEREDDEKEVEK 76
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 31.1 bits (71), Expect = 3.6
Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 5/45 (11%)
Query: 160 QDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEE 204
Q ++RRK ++ +DDDDD+D+D+ + E EE E
Sbjct: 186 QRKKQRRKRS-----KELREDDDDDEDEDDDGEGGGEGGEERQRE 225
>gnl|CDD|224054 COG1131, CcmA, ABC-type multidrug transport system, ATPase
component [Defense mechanisms].
Length = 293
Score = 30.7 bits (70), Expect = 3.7
Identities = 7/19 (36%), Positives = 13/19 (68%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
+ ++GP GK+TL++ L
Sbjct: 33 IFGLLGPNGAGKTTLLKIL 51
>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter. The Monovalent
Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
CPA1 family is a large family of proteins derived from
Gram-positive and Gram-negative bacteria, blue green
bacteria, yeast, plants and animals. Transporters from
eukaryotes have been functionally characterized, and all
of these catalyze Na+:H+ exchange. Their primary
physiological functions may be in (1) cytoplasmic pH
regulation, extruding the H+ generated during
metabolism, and (2) salt tolerance (in plants), due to
Na+ uptake into vacuoles. This model is specific for the
fungal members of this family [Transport and binding
proteins, Cations and iron carrying compounds].
Length = 810
Score = 31.4 bits (71), Expect = 3.8
Identities = 18/71 (25%), Positives = 32/71 (45%), Gaps = 9/71 (12%)
Query: 145 GQGSMKKYKEQVVSSQD--------GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDE 196
G+ +KK + VVS + R ED M DD+ + ++ + +D+D
Sbjct: 741 GKRKIKKRNDSVVSVDEEKAIEGPSRVPERGNHDLLHSEDEMADDEAESENMDDYEDSD- 799
Query: 197 DNEEESEEEEE 207
DN ES++ +
Sbjct: 800 DNAYESKDHAD 810
>gnl|CDD|213788 TIGR03263, guanyl_kin, guanylate kinase. Members of this family
are the enzyme guanylate kinase, also called GMP kinase.
This enzyme transfers a phosphate from ATP to GMP,
yielding ADP and GDP [Purines, pyrimidines, nucleosides,
and nucleotides, Nucleotide and nucleoside
interconversions].
Length = 179
Score = 30.2 bits (69), Expect = 3.9
Identities = 9/27 (33%), Positives = 15/27 (55%)
Query: 441 VVGPPQVGKSTLIRCLIKNFTKTPLSV 467
+ GP GKSTL++ L++ S+
Sbjct: 5 ISGPSGAGKSTLVKALLEEDPNLKFSI 31
>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 944
Score = 31.2 bits (71), Expect = 3.9
Identities = 46/329 (13%), Positives = 89/329 (27%), Gaps = 47/329 (14%)
Query: 120 SGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVM---FDDDLED 176
+ + D+ ++ D++ + + + +S + V +D +
Sbjct: 474 ASSSLDADNSAVPEQIDSTAEQSVVNPSVTDTQVDDTSASNNSAADNTVDDNYSAEDTLE 533
Query: 177 N--MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAK 234
+ +D+ D D D D+ SE +D+ + S E +
Sbjct: 534 SNGLDEGDYAQDSAPLDAYQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEAQPSSQSL 593
Query: 235 QKNPKAFTFQSVIKGERKFRSTKPTVVPQPRMSDKD-----LAIRHKITSALAGLNSKEE 289
T +D D LA R + S L L+ KE
Sbjct: 594 SPIS---------------AVTTAAASL----ADDDILDAVLAARDSLLSDLDALSPKEG 634
Query: 290 EE--EEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSDLAQKAASAFL 347
+ + + S +S P + S + + D + + A+
Sbjct: 635 DGKKSSADRKPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVPEAA--- 691
Query: 348 ERQANIVNLAKYVYGDMEDVSVTMEGDEAPNKKVHRKRQAELTAKQKNPKAFTFQSVIKG 407
A+ A D D E E + AE + A +
Sbjct: 692 --LASGSAPAPPPVPDPYDRPPWEEAPEVASANDGPNNAAEGNLSESVEDASNSEL---- 745
Query: 408 ERKFRRKEDIQAKKHHVPQVDRTPLEPPP 436
+ ++ + H PQV P
Sbjct: 746 -------QAVEQQATHQPQVQAEAQSPAS 767
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 30.8 bits (70), Expect = 3.9
Identities = 20/77 (25%), Positives = 31/77 (40%), Gaps = 6/77 (7%)
Query: 198 NEEESE------EEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGER 251
EEE E EEE ++ +++K KKN+ E T P + + GE
Sbjct: 165 TEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEA 224
Query: 252 KFRSTKPTVVPQPRMSD 268
K K ++ P S+
Sbjct: 225 KKLKKKRSIAPDNEKSE 241
>gnl|CDD|222347 pfam13729, TraF_2, F plasmid transfer operon, TraF, protein.
Length = 267
Score = 30.7 bits (70), Expect = 4.0
Identities = 9/23 (39%), Positives = 11/23 (47%)
Query: 172 DDLEDNMDDDDDDDDDDETDDDN 194
D+L D DD D D + DN
Sbjct: 33 DNLVDKFDDLSDIYDALSSALDN 55
>gnl|CDD|234034 TIGR02858, spore_III_AA, stage III sporulation protein AA. Members
of this protein are the stage III sporulation protein
AA, encoded by one of several genes in the spoIIIA
locus. It seems that this protein is found in a species
if and only if that species is capable of endospore
formation [Cellular processes, Sporulation and
germination].
Length = 270
Score = 30.8 bits (70), Expect = 4.0
Identities = 9/21 (42%), Positives = 15/21 (71%)
Query: 441 VVGPPQVGKSTLIRCLIKNFT 461
++ PPQ GK+TL+R L + +
Sbjct: 116 IISPPQCGKTTLLRDLARILS 136
>gnl|CDD|172741 PRK14253, PRK14253, phosphate ABC transporter ATP-binding protein;
Provisional.
Length = 249
Score = 30.6 bits (69), Expect = 4.1
Identities = 13/23 (56%), Positives = 16/23 (69%)
Query: 434 PPPIVVAVVGPPQVGKSTLIRCL 456
P V A++GP GKSTL+RCL
Sbjct: 27 PARQVTALIGPSGCGKSTLLRCL 49
>gnl|CDD|206676 cd01889, SelB_euk, SelB, the dedicated elongation factor for
delivery of selenocysteinyl-tRNA to the ribosome. SelB
is an elongation factor needed for the co-translational
incorporation of selenocysteine. Selenocysteine is coded
by a UGA stop codon in combination with a specific
downstream mRNA hairpin. In bacteria, the C-terminal
part of SelB recognizes this hairpin, while the
N-terminal part binds GTP and tRNA in analogy with
elongation factor Tu (EF-Tu). It specifically recognizes
the selenocysteine charged tRNAsec, which has a UCA
anticodon, in an EF-Tu like manner. This allows
insertion of selenocysteine at in-frame UGA stop codons.
In E. coli SelB binds GTP, selenocysteyl-tRNAsec and a
stem-loop structure immediately downstream of the UGA
codon (the SECIS sequence). The absence of active SelB
prevents the participation of selenocysteyl-tRNAsec in
translation. Archaeal and animal mechanisms of
selenocysteine incorporation are more complex. Although
the SECIS elements have different secondary structures
and conserved elements between archaea and eukaryotes,
they do share a common feature. Unlike in E. coli, these
SECIS elements are located in the 3' UTRs. This group
contains eukaryotic SelBs and some from archaea.
Length = 192
Score = 30.4 bits (69), Expect = 4.1
Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 3/68 (4%)
Query: 36 VCTAPNNEGIHRQEFEIEPITFIECNND---INSMIDISKVADLVLLLIDASFGFEMEIF 92
V + E + E IT ++C I ++I +++ DL+LL++DA G + +
Sbjct: 51 VDKPKHLEDNENPQIENYQITLVDCPGHASLIRTIIGGAQIIDLMLLVVDAKKGIQTQTA 110
Query: 93 EFLNICQV 100
E L I ++
Sbjct: 111 ECLVIGEL 118
>gnl|CDD|177868 PLN02222, PLN02222, phosphoinositide phospholipase C 2.
Length = 581
Score = 31.2 bits (70), Expect = 4.1
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 164 KRRKVMFDDDLEDNMDDDDDDDDD 187
+R K + +D + DDDDDD +D
Sbjct: 283 QRNKSVDKNDSNGDDDDDDDDGED 306
Score = 30.8 bits (69), Expect = 5.2
Identities = 12/37 (32%), Positives = 19/37 (51%)
Query: 162 GRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDN 198
GR+ + + D D + DDDDDD+ +D + N
Sbjct: 275 GREVPSFIQRNKSVDKNDSNGDDDDDDDDGEDKSKKN 311
>gnl|CDD|223487 COG0410, LivF, ABC-type branched-chain amino acid transport
systems, ATPase component [Amino acid transport and
metabolism].
Length = 237
Score = 30.5 bits (70), Expect = 4.3
Identities = 7/19 (36%), Positives = 14/19 (73%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
+VA++G GK+TL++ +
Sbjct: 31 IVALLGRNGAGKTTLLKTI 49
>gnl|CDD|213220 cd03253, ABCC_ATM1_transporter, ATP-binding cassette domain of
iron-sulfur clusters transporter, subfamily C. ATM1 is
an ABC transporter that is expressed in the
mitochondria. Although the specific function of ATM1 is
unknown, its disruption results in the accumulation of
excess mitochondrial iron, loss of mitochondrial
cytochromes, oxidative damage to mitochondrial DNA, and
decreased levels of cytosolic heme proteins. ABC
transporters are a large family of proteins involved in
the transport of a wide variety of different compounds,
like sugars, ions, peptides, and more complex organic
molecules. The nucleotide binding domain shows the
highest similarity between all members of the family.
ABC transporters are a subset of nucleotide hydrolases
that contain a signature motif, Q-loop, and
H-loop/switch region, in addition to, the Walker A
motif/P-loop and Walker B motif commonly found in a
number of ATP- and GTP-binding and hydrolyzing proteins.
Length = 236
Score = 30.7 bits (70), Expect = 4.4
Identities = 22/81 (27%), Positives = 33/81 (40%), Gaps = 20/81 (24%)
Query: 434 PPPIVVAVVGPPQVGKSTLIRCL-----------------IKNFTKTPLSVIKGPV---T 473
P VA+VGP GKST++R L I+ T L G V T
Sbjct: 25 PAGKKVAIVGPSGSGKSTILRLLFRFYDVSSGSILIDGQDIREVTLDSLRRAIGVVPQDT 84
Query: 474 LIIKDSIRDCFVTGKWKASED 494
++ D+I G+ A+++
Sbjct: 85 VLFNDTIGYNIRYGRPDATDE 105
>gnl|CDD|237628 PRK14156, PRK14156, heat shock protein GrpE; Provisional.
Length = 177
Score = 30.2 bits (68), Expect = 4.4
Identities = 20/71 (28%), Positives = 38/71 (53%), Gaps = 8/71 (11%)
Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERK-----AEKKNS 223
M ++ + ++++ + + +ET ++ E+ E+SE E NE D+ E K AE +N
Sbjct: 1 MSEEIKNEEVEEEVEATETEETVEEVVEETPEKSELELANERADEFENKYLRAHAEMQNI 60
Query: 224 K---NEHVQEL 231
+ NE Q+L
Sbjct: 61 QRRANEERQQL 71
>gnl|CDD|172756 PRK14268, PRK14268, phosphate ABC transporter ATP-binding protein;
Provisional.
Length = 258
Score = 30.7 bits (69), Expect = 4.4
Identities = 18/55 (32%), Positives = 25/55 (45%), Gaps = 17/55 (30%)
Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLIIKDSIRDCFVTGK 488
P V A++GP GKST IRCL + + D I++C + GK
Sbjct: 36 PKNSVTALIGPSGCGKSTFIRCLNR-----------------MNDLIKNCRIEGK 73
>gnl|CDD|238886 cd01903, Ntn_AC_NAAA, AC_NAAA This conserved domain includes two
closely related proteins, acid ceramidase (AC, also
known as N-acylsphingosine amidohydrolase), and
N-acylethanolamine-hydrolyzing acid amidase (NAAA). AC
catalyzes the hydrolysis of ceramide to sphingosine and
fatty acid. Ceramide is required for the biosynthesis of
most sphingolipids and plays an important role in many
signal transduction pathways by inducing apoptosis
and/or arresting cell growth. An inherited deficiency of
AC activity leads to the lysosomal storage disorder
known as Farber disease. AC is considered a "rheostat"
important for maintaining the proper intracellular
levels of these lipids since hydrolysis of ceramide is
the only source of sphingosine in cells. NAAA is a
eukaryotic glycoprotein that hydrolyzes bioactive
N-acylethanolamines, including anandamide (an
endocannabinoid) and N-palmitoylethanolamine (an
anti-inflammatory and neuroprotective substance), to
fatty acids and ethanolamine at acidic pH. NAAA shows
structural and functional similarity to acid ceramidase,
but lacks the ceramide-hydrolyzing activity of AC.
Length = 231
Score = 30.3 bits (69), Expect = 4.4
Identities = 31/105 (29%), Positives = 40/105 (38%), Gaps = 34/105 (32%)
Query: 601 NARVELEGFRAGLYIRVELDGMPC-----ELIEN----------------FDPTYPLIVG 639
N R L+G G+ ++ DG+P E +EN P Y IVG
Sbjct: 78 NERFSLDGGYNGILALLKKDGIPVSWLIRETLENATSYEDAVEKLSTTPILAPAY-FIVG 136
Query: 640 GLQPGEETIGCV--RARVKKHRWYGKILKSGNPVIMSVGWRRFQT 682
G++PGE G V R R Y LK+G W QT
Sbjct: 137 GVKPGE---GVVITRNRDSVADVYPLDLKNG-------TWFLVQT 171
>gnl|CDD|172749 PRK14261, PRK14261, phosphate ABC transporter ATP-binding protein;
Provisional.
Length = 253
Score = 30.6 bits (69), Expect = 4.5
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
V A++GP GKSTL+RC
Sbjct: 34 VTALIGPSGCGKSTLLRCF 52
>gnl|CDD|213228 cd03261, ABC_Org_Solvent_Resistant, ATP-binding cassette transport
system involved in resistant to organic solvents. ABC
transporters are a large family of proteins involved in
the transport of a wide variety of different compounds,
like sugars, ions, peptides, and more complex organic
molecules. The nucleotide binding domain shows the
highest similarity between all members of the family.
ABC transporters are a subset of nucleotide hydrolases
that contain a signature motif, Q-loop, and
H-loop/switch region, in addition to, the Walker A
motif/P-loop and Walker B motif commonly found in a
number of ATP- and GTP-binding and hydrolyzing proteins.
Length = 235
Score = 30.5 bits (70), Expect = 4.6
Identities = 9/20 (45%), Positives = 16/20 (80%)
Query: 438 VVAVVGPPQVGKSTLIRCLI 457
++A++GP GKSTL+R ++
Sbjct: 28 ILAIIGPSGSGKSTLLRLIV 47
>gnl|CDD|237645 PRK14235, PRK14235, phosphate transporter ATP-binding protein;
Provisional.
Length = 267
Score = 30.5 bits (69), Expect = 4.6
Identities = 21/68 (30%), Positives = 27/68 (39%), Gaps = 27/68 (39%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLIIKDSIRDCFVTGKWKASEDASE 497
V A +GP GKST +RCL + + D+I C VTGK
Sbjct: 47 VTAFIGPSGCGKSTFLRCLNR-----------------MNDTIDGCRVTGK--------- 80
Query: 498 LLRLDDMD 505
+ LD D
Sbjct: 81 -ITLDGED 87
>gnl|CDD|234003 TIGR02768, TraA_Ti, Ti-type conjugative transfer relaxase TraA. This
protein contains domains distinctive of a single strand
exonuclease (N-terminus, MobA/MobL, pfam03389) as well as
a helicase domain (central region, homologous to the
corresponding region of the F-type relaxase TraI,
TIGR02760). This protein likely fills the same role as
TraI(F), nicking (at the oriT site) and unwinding the
coiled plasmid prior to conjugative transfer.
Length = 744
Score = 30.9 bits (70), Expect = 4.7
Identities = 23/96 (23%), Positives = 41/96 (42%), Gaps = 1/96 (1%)
Query: 924 VIHSEREQKVASLMKMLRTNYSEKNSKEKQAMKA-RMVALKLRAKAAEEAKQQRQRVMKK 982
IH RE+ + ++ + + E N Q M A ++ +AA EA +R + +
Sbjct: 537 TIHDTREEAIEQVVADWKQDLREANPAGSQIMLAHTRKDVRALNEAAREALIERGELGES 596
Query: 983 DIFRTLRMPKVMGVLTHLDMLKNNKTLKTTKKMLKH 1018
+F+T R + + L+NN+ L ML
Sbjct: 597 ILFQTARGERKFAAGDRIVFLENNRDLGVKNGMLGT 632
>gnl|CDD|227468 COG5139, COG5139, Uncharacterized conserved protein [Function
unknown].
Length = 397
Score = 30.8 bits (69), Expect = 4.8
Identities = 15/86 (17%), Positives = 39/86 (45%), Gaps = 3/86 (3%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNS--KNEHVQELTAKQKN 237
+ ++E N +E+ + N+ D + + ++ + +N +V+ K+K+
Sbjct: 24 QKSTINAENENTKQNQSMEPQETSKGTSNDTKDPDNGEKNEEAAIDENSNVEAAERKRKH 83
Query: 238 PKA-FTFQSVIKGERKFRSTKPTVVP 262
F+ S+++ + +S +PT P
Sbjct: 84 ISTDFSDMSLLRKRKNDQSLQPTREP 109
>gnl|CDD|222307 pfam13671, AAA_33, AAA domain. This family of domains contain only
a P-loop motif, that is characteristic of the AAA
superfamily. Many of the proteins in this family are
just short fragments so there is no Walker B motif.
Length = 143
Score = 29.6 bits (67), Expect = 4.8
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 438 VVAVVGPPQVGKSTLIRCLIK 458
++ +VG P GKST R L++
Sbjct: 1 LILMVGLPGSGKSTFARRLLR 21
>gnl|CDD|184594 PRK14264, PRK14264, phosphate ABC transporter ATP-binding protein;
Provisional.
Length = 305
Score = 30.6 bits (69), Expect = 4.8
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
P V A++GP GKST +RCL + + + I G V L
Sbjct: 69 PEKSVTALIGPSGCGKSTFLRCLNRMNDRIKAARIDGSVEL 109
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 30.8 bits (70), Expect = 4.9
Identities = 10/49 (20%), Positives = 24/49 (48%)
Query: 192 DDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKA 240
D D+D+ +E EN+E ++++ +KK K + ++ + +
Sbjct: 31 KDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAY 79
>gnl|CDD|213225 cd03258, ABC_MetN_methionine_transporter, ATP-binding cassette
domain of methionine transporter. MetN (also known as
YusC) is an ABC-type transporter encoded by metN of the
metNPQ operon in Bacillus subtilis that is involved in
methionine transport. Other members of this system
include the MetP permease and the MetQ substrate binding
protein. ABC transporters are a subset of nucleotide
hydrolases that contain a signature motif, Q-loop, and
H-loop/switch region, in addition to, the Walker A
motif/P-loop and Walker B motif commonly found in a
number of ATP- and GTP-binding and hydrolyzing proteins.
Length = 233
Score = 30.2 bits (69), Expect = 4.9
Identities = 12/29 (41%), Positives = 17/29 (58%), Gaps = 2/29 (6%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLS 466
+ ++G GKSTLIRC+ N + P S
Sbjct: 33 IFGIIGRSGAGKSTLIRCI--NGLERPTS 59
>gnl|CDD|130570 TIGR01506, ribC_arch, riboflavin synthase. This archaeal protein
catalyzes the same reaction, the final step in
riboflavin biosynthesis, as bacterial riboflavin
biosynthesis alpha chain. However, it is more similar in
sequence to 6,7-dimethyl-8-ribityllumazine synthase,
which catalyzes the previous reaction and which (in
bacteria) is called the riboflavin synthase beta chain
[Biosynthesis of cofactors, prosthetic groups, and
carriers, Riboflavin, FMN, and FAD].
Length = 151
Score = 29.9 bits (67), Expect = 4.9
Identities = 13/36 (36%), Positives = 15/36 (41%)
Query: 207 ENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFT 242
E+E +D EE K N EH Q L P T
Sbjct: 99 EDEAEDPEELKVLADNRAREHAQNLIMLLFKPDRLT 134
>gnl|CDD|221323 pfam11931, DUF3449, Domain of unknown function (DUF3449). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 181 to 207 amino acids in length. This domain
has two conserved sequence motifs: PIP and CEICG. The
domain carries a zinc-finger domain of the C2H2-type.
Length = 187
Score = 29.9 bits (68), Expect = 5.0
Identities = 11/36 (30%), Positives = 25/36 (69%)
Query: 165 RRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEE 200
R++ + +++ + + D+ +D +D +DDD+D+D EE
Sbjct: 27 RKQALTEEERQASADESSEDASEDGSDDDSDDDEEE 62
>gnl|CDD|221125 pfam11500, Cut12, Spindle-body formation-associated protein. This
is the central coiled-coil region of cut12 also found in
other fungi, barring S. cerevisiae. The full protein has
two predicted coiled-coil regions, and one consensus
phosphorylation site for p34cdc2 and two for MAP kinase.
During fission yeast mitosis, the duplicated spindle
pole bodies (SPBs) nucleate microtubule arrays that
interdigitate to form the mitotic spindle. Cut12 is
localised to the SPB throughout the cell cycle,
predominantly around the inner face of the interphase
SPB, adjacent to the nucleus. Cut12 associates with Fin1
and is important in this context for the activity of
Plo1.
Length = 149
Score = 29.7 bits (67), Expect = 5.1
Identities = 10/54 (18%), Positives = 21/54 (38%)
Query: 161 DGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDE 214
DG+ + + LED+ + ++ TD E+ + ++D D
Sbjct: 1 DGKSKPRTKLTQALEDSRKNSENTTPGTSTDTKQAFPPEDAWNPSSKEDDDGDV 54
>gnl|CDD|224985 COG2074, COG2074, 2-phosphoglycerate kinase [Carbohydrate transport
and metabolism].
Length = 299
Score = 30.4 bits (69), Expect = 5.1
Identities = 36/152 (23%), Positives = 66/152 (43%), Gaps = 26/152 (17%)
Query: 376 APNK--KVHRKRQAELTAKQKNPKAFTFQSVIKGERKFRRKEDIQ-AKKHHVPQVDRTPL 432
P+ + + Q EL K++ + T + + +K K D + AK++ + + R
Sbjct: 30 DPDLAYSIAIEIQEEL--KKEGIRLVTKDEIREVYQKLLEKGDPEVAKRYLLWRRIRKMK 87
Query: 433 EPPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLIIKDSIRDC---FVTGK- 488
P +++ + G VGKST+ L + G ++I DSIR+ ++ +
Sbjct: 88 RP--LIILIGGASGVGKSTIAGELARRL---------GIRSVISTDSIREVLRKIISPEL 136
Query: 489 ----WKASEDASELLRLDDMDDDEELFGDFED 516
+S DA + LR D D+ + FED
Sbjct: 137 LPTLHTSSYDAWKALR--DPTDENPIIAGFED 166
>gnl|CDD|206748 cd01855, YqeH, Circularly permuted YqeH GTPase. YqeH is an
essential GTP-binding protein. Depletion of YqeH induces
an excess initiation of DNA replication, suggesting that
it negatively controls initiation of chromosome
replication. The YqeH subfamily is common in eukaryotes
and sporadically present in bacteria with probable
acquisition by plants from chloroplasts. Proteins of the
YqeH family contain all sequence motifs typical of the
vast class of P-loop-containing GTPases, but show a
circular permutation, with a G4-G1-G3 pattern of motifs
as opposed to the regular G1-G3-G4 pattern seen in most
GTPases.
Length = 191
Score = 29.9 bits (68), Expect = 5.2
Identities = 12/18 (66%), Positives = 13/18 (72%)
Query: 441 VVGPPQVGKSTLIRCLIK 458
VVG VGKSTLI L+K
Sbjct: 130 VVGATNVGKSTLINALLK 147
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 31.0 bits (70), Expect = 5.3
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEER 216
E ++ DE+ EE + EE +EE ++EE+
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEEEK 308
>gnl|CDD|151958 pfam11521, TFIIE-A_C-term, C-terminal general transcription factor
TFIIE alpha. TFIIE is compiled of two subunits, alpha
and beta. This family of proteins are the C terminal
domain of the alpha subunit of the protein which is the
largest subunit and contains several functional domains
which are important for basal transcription and cell
growth. The C terminal end of the protein binds directly
to the amino-terminal PH domain of p62/Tfb1 (of IIH)
which is involved in the recruitment of the general
transcription factor IIH to the transcription
preinitiation complex. P53 competes for the same binding
site as TFIIE alpha which shows their structural
similarity. Like p53, TFIIE alpha 336-439 can activate
transcription in vivo.
Length = 86
Score = 28.6 bits (64), Expect = 5.3
Identities = 12/38 (31%), Positives = 23/38 (60%), Gaps = 4/38 (10%)
Query: 186 DDDETDDDNDEDNEE----ESEEEEENEEDDDEERKAE 219
+ D ++ D+D + E +S +EEE++EDD+ E +
Sbjct: 2 ESDTSESDDDSPSPEPPAGQSVDEEEDDEDDEFEEVED 39
Score = 27.9 bits (62), Expect = 9.0
Identities = 11/34 (32%), Positives = 17/34 (50%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDD 213
+ DDD E D EE+ E++E E +D+
Sbjct: 7 ESDDDSPSPEPPAGQSVDEEEDDEDDEFEEVEDE 40
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 31.0 bits (70), Expect = 5.3
Identities = 14/71 (19%), Positives = 30/71 (42%), Gaps = 3/71 (4%)
Query: 117 GAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLED 176
G++ G +++ + ++ S + K + Q + ++ V+ D + E
Sbjct: 1398 GSVLGRAATNKETESSENVSGSSSSEKDEIDVSAKPRPQ---RANRKQTTYVLSDSESES 1454
Query: 177 NMDDDDDDDDD 187
D D DDD+D
Sbjct: 1455 ADDSDFDDDED 1465
>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
Provisional.
Length = 606
Score = 30.9 bits (70), Expect = 5.4
Identities = 8/34 (23%), Positives = 21/34 (61%)
Query: 182 DDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
D +++E ++ + E+ EE+E ++D+D++
Sbjct: 573 DPRANNNEASEETETITVPENNEEDEEDDDEDDD 606
>gnl|CDD|235636 PRK05886, yajC, preprotein translocase subunit YajC; Validated.
Length = 109
Score = 29.0 bits (65), Expect = 5.6
Identities = 9/27 (33%), Positives = 14/27 (51%)
Query: 176 DNMDDDDDDDDDDETDDDNDEDNEEES 202
D + DD+DD++ D D D D+
Sbjct: 82 DRILPDDEDDEELNADLDKDVDDVAGE 108
>gnl|CDD|185558 PTZ00329, PTZ00329, eukaryotic translation initiation factor 1A;
Provisional.
Length = 155
Score = 29.4 bits (66), Expect = 5.6
Identities = 9/29 (31%), Positives = 19/29 (65%)
Query: 169 MFDDDLEDNMDDDDDDDDDDETDDDNDED 197
+FDD+ +D ++ DD+ D++ TD+ +
Sbjct: 120 IFDDEGDDGIEFQDDESDEEATDEMLNLT 148
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 30.8 bits (69), Expect = 5.7
Identities = 20/68 (29%), Positives = 35/68 (51%), Gaps = 3/68 (4%)
Query: 196 EDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRS 255
ED++E E E E + D+D+ AE S + + + PK F ++ +K +K R+
Sbjct: 357 EDDDESEEIESECDPDEDKS-GAEALASIPQTLPDPPTVYGRPKVFARKADLKSTKKCRA 415
Query: 256 --TKPTVV 261
T P+V+
Sbjct: 416 IVTDPSVI 423
>gnl|CDD|222104 pfam13401, AAA_22, AAA domain.
Length = 124
Score = 29.2 bits (66), Expect = 5.7
Identities = 7/27 (25%), Positives = 12/27 (44%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTP 464
+ + G GK+TL+R L +
Sbjct: 6 IGVLTGESGSGKTTLLRRLARQLPNRR 32
>gnl|CDD|218576 pfam05395, DARPP-32, Protein phosphatase inhibitor 1/DARPP-32.
This family consists of several mammalian protein
phosphatase inhibitor 1 (IPP-1) and dopamine- and
cAMP-regulated neuronal phosphoprotein (DARPP-32)
proteins. Protein phosphatase inhibitor-1 is involved in
signal transduction and is an endogenous inhibitor of
protein phosphatase-1. It has been demonstrated that
DARPP-32, if phosphorylated, can inhibit
protein-phosphatase-1. DARPP-32 has a key role in many
neurotransmitter pathways throughout the brain and has
been shown to be involved in controlling receptors, ion
channels and other physiological factors including the
brain's response to drugs of abuse, such as cocaine,
opiates and nicotine. DARPP-32 is reciprocally regulated
by the two neurotransmitters that are most often
implicated in schizophrenia - dopamine and glutamate.
Dopamine activates DARPP-32 through the D1 receptor
pathway and disables DARPP-32 through the D2 receptor.
Glutamate, acting through the N-methyl-d-aspartate
receptor, renders DARPP-32 inactive. A mutant form of
DARPP-32 has been linked with gastric cancers.
Length = 170
Score = 29.9 bits (67), Expect = 5.9
Identities = 22/113 (19%), Positives = 44/113 (38%), Gaps = 24/113 (21%)
Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYK--------------EQVVSSQDGRKRRKV 168
+ S+ S E DE+ T + + G M + +++V S +
Sbjct: 40 LVSEQSSPEIDEERTPNQ-LQKGSLQMSPSQRKNMAYTPPTMKELQRIVESHLQSQ---- 94
Query: 169 MFDDDLEDNMDDDDDDDDD--DETDDDNDEDNEEESEEEEENEEDDDEERKAE 219
LE+N +++D+ D DE E E+++ EE ++ ++E
Sbjct: 95 ---GSLEENQASEEEDELDSPDELGYPTGETESTEAQDSEEPGSCPTQDAQSE 144
>gnl|CDD|129833 TIGR00750, lao, LAO/AO transport system ATPase. In E. coli,
mutation of this kinase blocks phosphorylation of two
transporter system periplasmic binding proteins and
consequently inhibits those transporters. This kinase is
also found in Gram-positive bacteria, archaea, and the
roundworm C. elegans. It may have a more general, but
still unknown function. Mutations have also been found
that do not phosphorylate the periplasmic binding
proteins, yet still allow transport. The ATPase activity
of this protein seems to be necessary, however
[Transport and binding proteins, Amino acids, peptides
and amines, Regulatory functions, Protein interactions].
Length = 300
Score = 30.1 bits (68), Expect = 6.0
Identities = 11/30 (36%), Positives = 14/30 (46%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSV 467
V + G P GKSTL+ L + L V
Sbjct: 36 RVGITGTPGAGKSTLLEALGMELRRRGLRV 65
>gnl|CDD|184587 PRK14241, PRK14241, phosphate transporter ATP-binding protein;
Provisional.
Length = 258
Score = 30.1 bits (68), Expect = 6.0
Identities = 15/41 (36%), Positives = 22/41 (53%)
Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
P V A +GP GKST++R L + P + ++G V L
Sbjct: 28 EPRSVTAFIGPSGCGKSTVLRTLNRMHEVIPGARVEGEVLL 68
>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
Length = 130
Score = 29.1 bits (65), Expect = 6.0
Identities = 12/44 (27%), Positives = 18/44 (40%)
Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKK 221
+D D +DE + E+ EE E N+E + E K
Sbjct: 1 DYNDMDMGYEDEPSEPEIEEGAEEDSESNNNDEVKGSIGENEDK 44
Score = 29.1 bits (65), Expect = 7.2
Identities = 11/51 (21%), Positives = 21/51 (41%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
D +D D E + E E E+ E N D+ + E ++ + + +
Sbjct: 1 DYNDMDMGYEDEPSEPEIEEGAEEDSESNNNDEVKGSIGENEDKEEQEPVQ 51
>gnl|CDD|223529 COG0452, Dfp, Phosphopantothenoylcysteine synthetase/decarboxylase
[Coenzyme metabolism].
Length = 392
Score = 30.4 bits (69), Expect = 6.1
Identities = 11/61 (18%), Positives = 23/61 (37%), Gaps = 3/61 (4%)
Query: 148 SMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEE 207
+++ ++ + D DD E + D D +D + + +E E +E E
Sbjct: 333 KLERKNLDLIVAND--VSEIGFGSDDNEVRIIDKDG-EDVELPEMSKEELAERILDEIAE 389
Query: 208 N 208
Sbjct: 390 L 390
>gnl|CDD|236442 PRK09270, PRK09270, nucleoside triphosphate hydrolase
domain-containing protein; Reviewed.
Length = 229
Score = 29.9 bits (68), Expect = 6.2
Identities = 10/20 (50%), Positives = 12/20 (60%)
Query: 437 IVVAVVGPPQVGKSTLIRCL 456
+V + GPP GKSTL L
Sbjct: 34 TIVGIAGPPGAGKSTLAEFL 53
>gnl|CDD|184597 PRK14270, PRK14270, phosphate ABC transporter ATP-binding protein;
Provisional.
Length = 251
Score = 29.9 bits (67), Expect = 6.2
Identities = 14/37 (37%), Positives = 20/37 (54%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTL 474
+ A++GP GKST +RCL + I+G V L
Sbjct: 32 ITALIGPSGCGKSTFLRCLNRMNDLISNVKIEGEVLL 68
>gnl|CDD|226961 COG4598, HisP, ABC-type histidine transport system, ATPase
component [Amino acid transport and metabolism].
Length = 256
Score = 30.1 bits (68), Expect = 6.4
Identities = 11/27 (40%), Positives = 18/27 (66%), Gaps = 2/27 (7%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTP 464
V++++G GKST +RC+ NF + P
Sbjct: 34 VISIIGSSGSGKSTFLRCI--NFLEKP 58
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 30.6 bits (69), Expect = 6.4
Identities = 21/116 (18%), Positives = 39/116 (33%), Gaps = 8/116 (6%)
Query: 198 NEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRSTK 257
S E + E ++ K ++K SK + ++ ++ S + K + +
Sbjct: 36 KNRNSHTENKMESGTNDNNKNKEKLSKLYSDVDSSSSEEEEDGSESISKLNVNSK-KISL 94
Query: 258 PTVVPQPRMSDKD----LAIRHKITSAL-AGLNSKEEEEEEQEDSAFSDESADDNS 308
V Q + D LA K++ A A EEE+ + D
Sbjct: 95 NQVSTQKWRKELDLLASLAYLQKLSGAFKAAALLNNEEEDLKY--TIDDVKFFARL 148
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 30.1 bits (68), Expect = 6.4
Identities = 15/54 (27%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 546 KTRAELMEKKRKLKEQFD-----AEYDDKDGGGNTYYDDLKTQATRQAELNRQQ 594
+ R E E+++ L+EQ A+Y D+ Y +L+ Q + EL + Q
Sbjct: 86 RARVEAEERRKTLQEQTQQEQQRAQYQDELAR-KRYQKELEQQRRQNEELLKMQ 138
>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
partitioning].
Length = 324
Score = 30.2 bits (68), Expect = 6.5
Identities = 24/122 (19%), Positives = 32/122 (26%), Gaps = 17/122 (13%)
Query: 163 RKRRKVMFDDDLEDNMDDDDDDDDDDETDDDN--------------DEDNEEESEEEEEN 208
RK R F D M DDD DE DD E E+ +
Sbjct: 28 RKERSSYFRDRPLKRMKSKRDDDPYDEVADDEGVGEVRVVRKNEAPQFTQEHEAARQSPQ 87
Query: 209 EEDDDEERKAEKKNSKNEHVQELTAKQKNPKAFTFQSVIKGERKFRST---KPTVVPQPR 265
+ E A+ K + Q + + R + PQP
Sbjct: 88 HQYQPEYASAQIKIPVPQPPQISDPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQPV 147
Query: 266 MS 267
S
Sbjct: 148 HS 149
>gnl|CDD|165339 PHA03046, PHA03046, Hypothetical protein; Provisional.
Length = 142
Score = 29.3 bits (66), Expect = 6.5
Identities = 13/41 (31%), Positives = 20/41 (48%)
Query: 186 DDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNE 226
DDDE +D N+ + E+ + D EE + N +NE
Sbjct: 11 DDDEENDRNEHREKTSEEDGHYKKRLDVEEEEPNLINIRNE 51
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 30.7 bits (69), Expect = 6.5
Identities = 14/72 (19%), Positives = 32/72 (44%)
Query: 177 NMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQK 236
N D + DD+ + + E E+ +++EE D+ + E+ + +N + + K
Sbjct: 13 NNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKST 72
Query: 237 NPKAFTFQSVIK 248
++ V+K
Sbjct: 73 KEESKQLLEVLK 84
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 30.4 bits (68), Expect = 6.6
Identities = 24/120 (20%), Positives = 53/120 (44%), Gaps = 25/120 (20%)
Query: 138 SDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDED 197
DDD E+ S + + + RK + D + + ++ + ++D++ + + +
Sbjct: 110 HDDDEENLPSS-------IAPPEIDPEPRKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKV 162
Query: 198 NEEESEEEEENE--------EDDDEERKAEKKNSK----------NEHVQELTAKQKNPK 239
+E+SEEEEE E + + + +K K++ ++ +Q+ KQK K
Sbjct: 163 VDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDNGKSFWSAASVFSKKLQKWRQKQKLKK 222
>gnl|CDD|221145 pfam11596, DUF3246, Protein of unknown function (DUF3246). This is
a small family of fungal proteins one of whose members
from Pichia stipitis is described as being an extremely
serine rich protein-mucin-like protein.
Length = 208
Score = 29.7 bits (66), Expect = 6.6
Identities = 19/85 (22%), Positives = 31/85 (36%), Gaps = 10/85 (11%)
Query: 108 SMMEIDGIHGAMSGAISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRK 167
S+ G + ++DD D +++E D T + G+ V + G +
Sbjct: 28 SVSTTAGSSTILPTGSATDDDDYDDEETDCEGQTTANPTGT-------VTTDPTGTTSQT 80
Query: 168 VMFDDDLEDNMDDDDDDDDDDETDD 192
V+ DDDDD ET
Sbjct: 81 VVPTKP---TTTDDDDDTTCVETTI 102
>gnl|CDD|234033 TIGR02857, CydD, thiol reductant ABC exporter, CydD subunit. The
gene pair cydCD encodes an ABC-family transporter in
which each gene contains an N-terminal membrane-spanning
domain (pfam00664) and a C-terminal ATP-binding domain
(pfam00005). In E. coli these genes were discovered as
mutants which caused the terminal heme-copper oxidase
complex cytochrome bd to fail to assemble. Recent work
has shown that the transporter is involved in export of
redox-active thiol compounds such as cysteine and
glutathione. The linkage to assembly of the cytochrome
bd complex is further supported by the conserved operon
structure found outside the gammaproteobacteria
(cydABCD) containing both the transporter and oxidase
genes components. The genes used as the seed members for
this model are all either found in the
gammproteobacterial context or the CydABCD context. All
members of this family scoring above trusted at the time
of its creation were from genomes which encode a
cytochrome bd complex. Unfortunately, the gene symbol
nomenclature adopted based on this operon in B. subtilis
assigns cydC to the third gene in the operon where this
gene is actually homologous to the E. coli cydD gene. We
have chosen to name all homologs in this family in
accordance with the precedence of publication of the E.
coli name, CydD.
Length = 529
Score = 30.3 bits (69), Expect = 6.6
Identities = 13/23 (56%), Positives = 15/23 (65%)
Query: 434 PPPIVVAVVGPPQVGKSTLIRCL 456
PP VA+VGP GKSTL+ L
Sbjct: 346 PPGERVALVGPSGAGKSTLLNLL 368
>gnl|CDD|165222 PHA02896, PHA02896, A-type inclusion like protein; Provisional.
Length = 616
Score = 30.4 bits (68), Expect = 6.7
Identities = 27/101 (26%), Positives = 42/101 (41%), Gaps = 10/101 (9%)
Query: 123 ISSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGR--KRRKVMFDDDLEDNMDD 180
I D+ D ND++D D + +D R + ++DD+
Sbjct: 394 IEEDNDDYNNDDEDDLIKDEAPILNRRHGQMDDKYDKRDHRYKNNKYDIYDDESPRYKYK 453
Query: 181 DDDDDDDDETDDD--------NDEDNEEESEEEEENEEDDD 213
DDD DD+D+ DDD N ED ++ +E + DDD
Sbjct: 454 DDDYDDNDDNDDDHIIPKKANNLEDAKDFEDEMMDAIADDD 494
>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking. Sec14-like
Golgi-trafficking domain The GOLD domain is always found
combined with lipid- or membrane-association domains.
Length = 136
Score = 29.3 bits (66), Expect = 6.7
Identities = 13/38 (34%), Positives = 20/38 (52%)
Query: 194 NDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQEL 231
E ++EE EEE E EE + + +A K+ V E+
Sbjct: 51 VSESSDEEEEEEAEEEEAETGDVEAGSKSQSRPLVDEI 88
>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
Length = 1320
Score = 30.7 bits (69), Expect = 6.8
Identities = 19/100 (19%), Positives = 40/100 (40%), Gaps = 3/100 (3%)
Query: 134 EDDTSDDDTESGQGSMKKYKEQV-VSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDD 192
D+ D++ E + ++++ QV + + V D +++ D + +
Sbjct: 506 VDEDGDEEMEIDEEAVERLCVQVGLQPAGAAEGNNV--DMGRVESIHSSDQQSIIKQGSE 563
Query: 193 DNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELT 232
D D D EE E+EE E + +N++N +
Sbjct: 564 DTDVDMEEAISEQEEKHEITIVDCAEPVRNTQNSLQIDTL 603
>gnl|CDD|223455 COG0378, HypB, Ni2+-binding GTPase involved in regulation of
expression and maturation of urease and hydrogenase
[Posttranslational modification, protein turnover,
chaperones / Transcription].
Length = 202
Score = 29.5 bits (67), Expect = 7.0
Identities = 12/37 (32%), Positives = 20/37 (54%), Gaps = 1/37 (2%)
Query: 435 PPIVVAVVGPPQVGKSTLIRCLIKNF-TKTPLSVIKG 470
P + + V GPP GK+ LI ++ + ++VI G
Sbjct: 12 PMLRIGVGGPPGSGKTALIEKTLRALKDEYKIAVITG 48
>gnl|CDD|235392 PRK05291, trmE, tRNA modification GTPase TrmE; Reviewed.
Length = 449
Score = 30.1 bits (69), Expect = 7.1
Identities = 8/16 (50%), Positives = 11/16 (68%)
Query: 437 IVVAVVGPPQVGKSTL 452
+ V + G P VGKS+L
Sbjct: 216 LKVVIAGRPNVGKSSL 231
>gnl|CDD|213188 cd03221, ABCF_EF-3, ATP-binding cassette domain of elongation
factor 3, subfamily F. Elongation factor 3 (EF-3) is a
cytosolic protein required by fungal ribosomes for in
vitro protein synthesis and for in vivo growth. EF-3
stimulates the binding of the EF-1: GTP: aa-tRNA ternary
complex to the ribosomal A site by facilitated release
of the deacylated tRNA from the E site. The reaction
requires ATP hydrolysis. EF-3 contains two ATP
nucleotide binding sequence (NBS) motifs. NBSI is
sufficient for the intrinsic ATPase activity. NBSII is
essential for the ribosome-stimulated functions.
Length = 144
Score = 29.0 bits (66), Expect = 7.1
Identities = 10/37 (27%), Positives = 15/37 (40%)
Query: 439 VAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLI 475
+ +VG GKSTL++ + V G I
Sbjct: 29 IGLVGRNGAGKSTLLKLIAGELEPDEGIVTWGSTVKI 65
>gnl|CDD|218889 pfam06088, TLP-20, Nucleopolyhedrovirus telokin-like protein-20
(TLP20). This family consists of several
Nucleopolyhedrovirus telokin-like protein-20 (TLP20)
sequences. The function of this family is unknown but
TLP20 is known to shares some antigenic similarities to
the smooth muscle protein telokin although the amino
acid sequence shows no homologies to telokin.
Length = 162
Score = 29.2 bits (66), Expect = 7.2
Identities = 12/44 (27%), Positives = 21/44 (47%)
Query: 187 DDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQE 230
++D+NDED E+ ++ DD + KK +E Q+
Sbjct: 119 HHSSEDENDEDEEDNADRAGIESGIDDSAPPSPKKQKLDESEQD 162
>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
trafficking and secretion].
Length = 340
Score = 29.9 bits (68), Expect = 7.3
Identities = 38/188 (20%), Positives = 66/188 (35%), Gaps = 24/188 (12%)
Query: 288 EEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEEDTGLNWKSD-LAQKAASAF 346
E+ E+ E+ + +EDE E + + GL+ + F
Sbjct: 1 MGFFEKLEEELEELEAEKEKIEEEDEEEEKEGWFERLK----QGLSKTKKNFGKGIKGLF 56
Query: 347 LERQANIVNLAKYVYGDME------DVSVTMEGDEAPNKKVHRKRQAELTAKQKNPKAFT 400
L++ ++ + ++E DV V E + + R+ E K+K T
Sbjct: 57 LKKIKEKLDED--LLEELEELLIEADVGV-----ETAEEIIEELRKREGK-KKKIKDEET 108
Query: 401 FQSVIKGERKFRRKEDIQAKKHHVPQVDRTPLEPPPIVVAVVGPPQVGKSTLIRCLIKNF 460
+ ++ +I V P E P V+ VG VGK+T I L K
Sbjct: 109 VKEALREAL-----IEILRPVDKVDLPLEIPKEKKPFVILFVGVNGVGKTTTIAKLAKYL 163
Query: 461 TKTPLSVI 468
+ SV+
Sbjct: 164 KQQGKSVL 171
>gnl|CDD|217373 pfam03115, Astro_capsid, Astrovirus capsid protein precursor. This
product is encoded by astrovirus ORF2, one of the three
astrovirus ORFs (1a, 1b, 2). The 87kD precursor protein
undergoes an intracellular cleavage to form a 79kD
protein. Subsequently, extracellular trypsin cleavage
yields the three proteins forming the infectious virion.
Length = 787
Score = 30.5 bits (69), Expect = 7.4
Identities = 20/91 (21%), Positives = 39/91 (42%), Gaps = 3/91 (3%)
Query: 180 DDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPK 239
++ +D + +D E ++E+E +EDD+ +R +S +E + +
Sbjct: 655 EERAVEDAPDPYEDLISLEETDTEDESTEDEDDELDRFDLHDSSGSEPEDDDENNRVTLL 714
Query: 240 AFTFQSVIKGERKFRSTK---PTVVPQPRMS 267
+ + ER R TK PT+ + R S
Sbjct: 715 STLINQGMSVERATRITKRAFPTLAEKTRRS 745
>gnl|CDD|177546 PHA03151, PHA03151, hypothetical protein; Provisional.
Length = 259
Score = 29.7 bits (66), Expect = 7.4
Identities = 35/158 (22%), Positives = 62/158 (39%), Gaps = 11/158 (6%)
Query: 124 SSDDSDDENDEDDTSDDDTESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDD 183
SD ++D++ + D+DT S S + +VSS D R ++D E
Sbjct: 95 VSDSNNDKDFDFKPQDEDTSSDDSSAPDFITSLVSS-DCEARGLSSSEEDGEPYSKQKMS 153
Query: 184 D--DDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKNPKAF 241
D +T++ E++ E+ ++EED E ++ ++ K+K K
Sbjct: 154 QPLTIDAKTEEITSEEDCCVQEDSSDSEEDVVEAFIRQR--------AQMAGKKKKGKRS 205
Query: 242 TFQSVIKGERKFRSTKPTVVPQPRMSDKDLAIRHKITS 279
S + RK R + + D RHK+T
Sbjct: 206 ISTSDDEPPRKSRRKRHSHRISSSTDSDDEEPRHKMTG 243
>gnl|CDD|226415 COG3899, COG3899, Predicted ATPase [General function prediction
only].
Length = 849
Score = 30.4 bits (69), Expect = 7.6
Identities = 13/33 (39%), Positives = 17/33 (51%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSVIKG 470
VV V G +GKS L+ + K T+ IKG
Sbjct: 26 VVLVAGESGIGKSALVNEVHKPITQQRGYFIKG 58
>gnl|CDD|221288 pfam11882, DUF3402, Domain of unknown function (DUF3402). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 350 to 473 amino acids in length. This domain is
found associated with pfam07923.
Length = 402
Score = 30.0 bits (68), Expect = 7.7
Identities = 10/53 (18%), Positives = 20/53 (37%), Gaps = 1/53 (1%)
Query: 181 DDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTA 233
D + D D E + E ED ++ + E+ ++ +E + L
Sbjct: 39 DTESLVGDPLDISESV-KELKLEMYTSLAEDQPKKEEIERLSTDSEEIPRLDR 90
>gnl|CDD|225618 COG3076, COG3076, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 135
Score = 29.0 bits (65), Expect = 7.8
Identities = 11/16 (68%), Positives = 13/16 (81%)
Query: 177 NMDDDDDDDDDDETDD 192
N +DDDDD+D DE DD
Sbjct: 116 NAEDDDDDEDVDEDDD 131
>gnl|CDD|183044 PRK11231, fecE, iron-dicitrate transporter ATP-binding subunit;
Provisional.
Length = 255
Score = 29.6 bits (67), Expect = 7.9
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 434 PPPIVVAVVGPPQVGKSTLIRCLIK 458
P + A++GP GKSTL++C +
Sbjct: 26 PTGKITALIGPNGCGKSTLLKCFAR 50
>gnl|CDD|223272 COG0194, Gmk, Guanylate kinase [Nucleotide transport and
metabolism].
Length = 191
Score = 29.4 bits (67), Expect = 7.9
Identities = 12/30 (40%), Positives = 20/30 (66%), Gaps = 1/30 (3%)
Query: 438 VVAVVGPPQVGKSTLIRCLIKNFTKTPLSV 467
++ + GP VGKSTL++ L+++ K SV
Sbjct: 6 LIVLSGPSGVGKSTLVKALLEDD-KLRFSV 34
>gnl|CDD|213180 cd03213, ABCG_EPDR, Eye pigment and drug resistance transporter
subfamily G of the ATP-binding cassette superfamily.
ABCG transporters are involved in eye pigment (EP)
precursor transport, regulation of lipid-trafficking
mechanisms, and pleiotropic drug resistance (DR). DR is
a well-described phenomenon occurring in fungi and
shares several similarities with processes in bacteria
and higher eukaryotes. Compared to other members of the
ABC transporter subfamilies, the ABCG transporter family
is composed of proteins that have an ATP-binding
cassette domain at the N-terminus and a TM
(transmembrane) domain at the C-terminus.
Length = 194
Score = 29.4 bits (67), Expect = 8.0
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
+ A++GP GKSTL+ L
Sbjct: 37 LTAIMGPSGAGKSTLLNAL 55
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 28.5 bits (64), Expect = 8.2
Identities = 10/20 (50%), Positives = 14/20 (70%)
Query: 195 DEDNEEESEEEEENEEDDDE 214
++ EEE +EEE EE DD+
Sbjct: 83 ADEAEEEEKEEEAEEESDDD 102
Score = 28.1 bits (63), Expect = 9.7
Identities = 13/22 (59%), Positives = 14/22 (63%)
Query: 192 DDNDEDNEEESEEEEENEEDDD 213
+ DE EEE EEE E E DDD
Sbjct: 81 AEADEAEEEEKEEEAEEESDDD 102
>gnl|CDD|220924 pfam10986, DUF2796, Protein of unknown function (DUF2796). This
bacterial family of proteins has no known function.
Length = 161
Score = 29.2 bits (66), Expect = 8.3
Identities = 8/25 (32%), Positives = 12/25 (48%)
Query: 179 DDDDDDDDDDETDDDNDEDNEEESE 203
D D D D D D +D ++ S+
Sbjct: 83 DHDHDHHDHDHHDHHDDHEHAGHSD 107
>gnl|CDD|214991 smart01044, Btz, CASC3/Barentsz eIF4AIII binding. This domain is
found on CASC3 (cancer susceptibility candidate gene 3
protein) which is also known as Barentsz (Btz). CASC3 is
a component of the EJC (exon junction complex) which is
a complex that is involved in post-transcriptional
regulation of mRNA in metazoa. The complex is formed by
the association of four proteins (eIF4AIII, Barentsz,
Mago, and Y14), mRNA, and ATP. This domain wraps around
eIF4AIII and stacks against the 5' nucleotide.
Length = 106
Score = 28.2 bits (63), Expect = 8.4
Identities = 5/29 (17%), Positives = 10/29 (34%)
Query: 189 ETDDDNDEDNEEESEEEEENEEDDDEERK 217
D+ + + +DDE +K
Sbjct: 4 GDGDEKSAPVANKETGKMRRRLEDDEYKK 32
>gnl|CDD|217326 pfam03029, ATP_bind_1, Conserved hypothetical ATP binding protein.
Members of this family are found in a range of archaea
and eukaryotes and have hypothesised ATP binding
activity.
Length = 235
Score = 29.6 bits (67), Expect = 8.6
Identities = 17/70 (24%), Positives = 32/70 (45%), Gaps = 8/70 (11%)
Query: 441 VVGPPQVGKSTLIRCLIKNFTKTPLSVIKGPVTLIIKDSIRDCFVTGKWKASEDASELLR 500
VVGP GK+T + L + ++ V ++ D + ++A D EL+
Sbjct: 1 VVGPAGSGKTTFVGALSEIL-----PLLGRSVYVVNLDPAAENL---PYEADIDIRELIT 52
Query: 501 LDDMDDDEEL 510
+ D+ +D+ L
Sbjct: 53 VADVMEDDGL 62
>gnl|CDD|201520 pfam00945, Rhabdo_ncap, Rhabdovirus nucleocapsid protein. The
Nucleocapsid (N) Protein is said to have a "tight"
structure. The carboxyl end of the N-terminal domain
possesses an RNA binding domain. Sequence alignments
show 2 regions of reasonable conservation, approx.
64-103 and 201-329. A whole functional protein is
required for encapsidation to take place.
Length = 406
Score = 30.0 bits (68), Expect = 8.7
Identities = 12/51 (23%), Positives = 18/51 (35%), Gaps = 5/51 (9%)
Query: 272 AIRHKITSALA-GLNSKEEEEEEQEDSAFSDESADDNSSDED----ETVEP 317
A K T + K+ E +D +D D+ DE + VE
Sbjct: 329 AFGSKATLEQRFFRDEKKYTEYRDDDGDNTDAPLADDGDDEAPKSRDVVEW 379
>gnl|CDD|213197 cd03230, ABC_DR_subfamily_A, ATP-binding cassette domain of the
drug resistance transporter and related proteins,
subfamily A. This family of ATP-binding proteins
belongs to a multi-subunit transporter involved in drug
resistance (BcrA and DrrA), nodulation, lipid transport,
and lantibiotic immunity. In bacteria and archaea, these
transporters usually include an ATP-binding protein and
one or two integral membrane proteins. Eukaryotic
systems of the ABCA subfamily display ABC domains that
are quite similar to this family. The ATP-binding domain
shows the highest similarity between all members of the
ABC transporter family. ABC transporters are a subset of
nucleotide hydrolases that contain a signature motif,
Q-loop, and H-loop/switch region, in addition to, the
Walker A motif/P-loop and Walker B motif commonly found
in a number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 173
Score = 29.3 bits (67), Expect = 8.8
Identities = 7/19 (36%), Positives = 13/19 (68%)
Query: 438 VVAVVGPPQVGKSTLIRCL 456
+ ++GP GK+TLI+ +
Sbjct: 28 IYGLLGPNGAGKTTLIKII 46
>gnl|CDD|236876 PRK11191, PRK11191, RNase E inhibitor protein; Provisional.
Length = 138
Score = 28.8 bits (65), Expect = 8.9
Identities = 9/21 (42%), Positives = 12/21 (57%)
Query: 175 EDNMDDDDDDDDDDETDDDND 195
+ N +D DD+D DE DD
Sbjct: 117 DPNAEDGDDEDFVDEDDDGVR 137
>gnl|CDD|237040 PRK12289, PRK12289, GTPase RsgA; Reviewed.
Length = 352
Score = 30.0 bits (68), Expect = 9.0
Identities = 11/17 (64%), Positives = 12/17 (70%)
Query: 441 VVGPPQVGKSTLIRCLI 457
V GP VGKS+LI LI
Sbjct: 177 VAGPSGVGKSSLINRLI 193
>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal. This
domain family is found in eukaryotes, and is typically
between 99 and 114 amino acids in length. The family is
found in association with pfam08697, pfam01585. There
are two completely conserved residues (G and F) that may
be functionally important. TIP is involved in enamel
assembly by interacting with one of the major proteins
responsible for biomineralisation of enamel - tuftelin.
Length = 106
Score = 28.1 bits (63), Expect = 9.0
Identities = 22/75 (29%), Positives = 34/75 (45%), Gaps = 11/75 (14%)
Query: 126 DDSDDENDEDDTSDDDTESGQGSMKKYKEQV--VSSQDGRKRRKVMFDDDLEDNMDDDDD 183
DD DD+ E S + K Y + V VS + + D +D+
Sbjct: 41 DDDDDDEGERRRGRGRRRSKKK--KDYSKPVAFVSG-------GIKQAAKEPEEDDKEDE 91
Query: 184 DDDDDETDDDNDEDN 198
+DDDE+D+D+DED+
Sbjct: 92 SEDDDESDEDDDEDD 106
>gnl|CDD|227494 COG5165, POB3, Nucleosome-binding factor SPN, POB3 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 508
Score = 30.0 bits (67), Expect = 9.1
Identities = 10/45 (22%), Positives = 18/45 (40%)
Query: 171 DDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEE 215
D ED ++D+ E +D + + +EE + D E
Sbjct: 455 ISDSEDINMGSAGEEDESEDEDFQMVSDSDVAEEYDLQAALSDAE 499
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 30.0 bits (68), Expect = 9.1
Identities = 32/193 (16%), Positives = 64/193 (33%), Gaps = 31/193 (16%)
Query: 160 QDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDD----------NDEDNEEESEEEEENE 209
Q+ R+ R ++ D D+ + + +++ E E + E E
Sbjct: 611 QNNRRDR-----NERRDTRDNRTRREGRENREENRRNRRQAQQQTAETRESQQAEVTEKA 665
Query: 210 EDDDEER---KAEKKNSKNE--HVQELTAKQKNPKAFTFQSVIKGERKFRSTKPTVVPQP 264
DE++ + E++ +N+ + AK N + + Q + ER QP
Sbjct: 666 RTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQEER-------VQQVQP 718
Query: 265 RMSDKDLAIRHKITSALAGLNSKEEEEEEQEDSAFSDESADDNSS----DEDETVEPKTH 320
R + L + +I ++A EE E + +
Sbjct: 719 RRKQRQLNQKVRIEQSVAEEAVAPVVEETVAAEPVVQEVPAPRTELVKVPLPVVAQTAPE 778
Query: 321 KDSKNPEEDTGLN 333
+D +N E+ N
Sbjct: 779 QDEENNAENRDNN 791
>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
Provisional.
Length = 719
Score = 30.2 bits (68), Expect = 9.2
Identities = 15/72 (20%), Positives = 29/72 (40%), Gaps = 1/72 (1%)
Query: 149 MKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETDDDNDEDNEEESEEEEEN 208
M VSS+ +K ++ D N + D+++DDD + E +E E
Sbjct: 405 MVNMASASVSSKKNQKPQESSPSLDQT-NAASRNSVMMDEDSDDDEEFQIPESEQEPETT 463
Query: 209 EEDDDEERKAEK 220
+ + + E+
Sbjct: 464 KNETKDTAMEEE 475
>gnl|CDD|213834 TIGR03597, GTPase_YqeH, ribosome biogenesis GTPase YqeH. This
family describes YqeH, a member of a larger family of
GTPases involved in ribosome biogenesis. Like YqlF, it
shows a cyclical permutation relative to GTPases EngA
(in which the GTPase domain is duplicated), Era, and
others. Members of this protein family are found in a
relatively small number of bacterial species, including
Bacillus subtilis but not Escherichia coli [Protein
synthesis, Other].
Length = 360
Score = 29.9 bits (68), Expect = 9.3
Identities = 12/26 (46%), Positives = 14/26 (53%)
Query: 439 VAVVGPPQVGKSTLIRCLIKNFTKTP 464
V VVG VGKS+LI L+K
Sbjct: 157 VYVVGVTNVGKSSLINKLLKQNNGDK 182
>gnl|CDD|234199 TIGR03410, urea_trans_UrtE, urea ABC transporter, ATP-binding
protein UrtE. Members of this protein family are ABC
transporter ATP-binding subunits associated with urea
transport and metabolism. This protein is found in a
conserved five-gene transport operon typically found
adjacent to urease genes. It was shown in Cyanobacteria
that disruption leads to the loss of high-affinity urea
transport activity [Transport and binding proteins,
Amino acids, peptides and amines].
Length = 230
Score = 29.4 bits (67), Expect = 9.3
Identities = 9/20 (45%), Positives = 14/20 (70%)
Query: 438 VVAVVGPPQVGKSTLIRCLI 457
V V+G VGK+TL++ L+
Sbjct: 28 VTCVLGRNGVGKTTLLKTLM 47
>gnl|CDD|235210 PRK04040, PRK04040, adenylate kinase; Provisional.
Length = 188
Score = 29.1 bits (66), Expect = 9.4
Identities = 9/20 (45%), Positives = 12/20 (60%)
Query: 435 PPIVVAVVGPPQVGKSTLIR 454
VV V G P VGK+T++
Sbjct: 1 MMKVVVVTGVPGVGKTTVLN 20
>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
Length = 196
Score = 29.1 bits (65), Expect = 9.6
Identities = 12/64 (18%), Positives = 31/64 (48%)
Query: 178 MDDDDDDDDDDETDDDNDEDNEEESEEEEENEEDDDEERKAEKKNSKNEHVQELTAKQKN 237
M++ + + + +E D N+ + E+ E E+E+ + + + + E Q+L K+
Sbjct: 1 MEEVEKEINKEEEKDVNNLSSNEQMEGPPEDEQAQQNQPQQQTVDEIEELKQKLQQKEVE 60
Query: 238 PKAF 241
+ +
Sbjct: 61 AQEY 64
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
(DUF2052). This entry is of sequences of two conserved
domains separated by a region of low complexity,
spanning some 200 residues. The function is unknown.
Length = 178
Score = 28.9 bits (65), Expect = 9.8
Identities = 11/41 (26%), Positives = 21/41 (51%)
Query: 288 EEEEEEQEDSAFSDESADDNSSDEDETVEPKTHKDSKNPEE 328
EEEE+ +E++ +E E+E + ++ + PEE
Sbjct: 85 EEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELPTPEE 125
>gnl|CDD|233208 TIGR00956, 3a01205, Pleiotropic Drug Resistance (PDR) Family
protein. [Transport and binding proteins, Other].
Length = 1394
Score = 30.1 bits (68), Expect = 9.9
Identities = 17/71 (23%), Positives = 34/71 (47%), Gaps = 3/71 (4%)
Query: 142 TESGQGSMKKYKEQVVSSQDGRKRRKVMFDDDLEDNMDDDDDDDDDDETD-DDNDEDNEE 200
TE +G+ +K +++ + G +R + N +D + + TD D +D +
Sbjct: 690 TEFNKGAKQK--GEILVFRRGSLKRAKKAGETSASNKNDIEAGEVLGSTDLTDESDDVND 747
Query: 201 ESEEEEENEED 211
E + E+E+ ED
Sbjct: 748 EKDMEKESGED 758
>gnl|CDD|235865 PRK06814, PRK06814, acylglycerophosphoethanolamine acyltransferase;
Provisional.
Length = 1140
Score = 29.9 bits (68), Expect = 9.9
Identities = 11/50 (22%), Positives = 24/50 (48%)
Query: 755 KLKLTGVPMKIYKKTAFIKDMFNSTLEVAKFEGAKIRTVSGIRGQIKKAL 804
++ +TG MKIY I D + + + +G + S ++ Q+++
Sbjct: 533 RITVTGSLMKIYDGPGMIADKAGAMVVPVRIDGLQFTHFSRLKNQVRRKW 582
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.132 0.385
Gapped
Lambda K H
0.267 0.0663 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 60,290,475
Number of extensions: 6064000
Number of successful extensions: 21335
Number of sequences better than 10.0: 1
Number of HSP's gapped: 14958
Number of HSP's successfully gapped: 1066
Length of query: 1182
Length of database: 10,937,602
Length adjustment: 108
Effective length of query: 1074
Effective length of database: 6,147,370
Effective search space: 6602275380
Effective search space used: 6602275380
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 65 (29.0 bits)