RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8152
(444 letters)
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 67.7 bits (166), Expect = 1e-12
Identities = 49/277 (17%), Positives = 105/277 (37%), Gaps = 35/277 (12%)
Query: 42 KASVLALNWGNVEESELLIAYSGQK-IKIYDTDFKNFNTNTDVSFGEGDIVGV-----SR 95
V + + + +LL SG IK++D + T + +G V S
Sbjct: 9 TGGVTCVAF--SPDGKLLATGSGDGTIKVWDLE-----TGELLRTLKGHTGPVRDVAASA 61
Query: 96 YDECILTAVKSGDVKLWRYKTEPDQWTNFNVGPQLECMKHNKHSNIIATGGKENDLKLWD 155
+ + ++LW +T T + + + I+++ ++ +K+WD
Sbjct: 62 DGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWD 121
Query: 156 LNTTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWVNGISFLPNSDKVAVCSKQGYVRVYD 215
+ T K + + + + WVN ++F P+ VA S+ G ++++D
Sbjct: 122 VETGKCLTTLRGHTD------------------WVNSVAFSPDGTFVASSSQDGTIKLWD 163
Query: 216 PKITTQRRPVLKVDIPDVALSCITTTNQSHYVIVGSTTGDMMAVDLRGKGKPCYKYKGFA 275
+ + V + ++ + + ++ S+ G + DL GK +G
Sbjct: 164 LR---TGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLS-TGKCLGTLRGHE 219
Query: 276 GCVKDIACPLNENLILSVSLDRHFRVHDFHSKKLLRK 312
V +A + L+ S S D RV D + + ++
Sbjct: 220 NGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQT 256
Score = 33.5 bits (77), Expect = 0.17
Identities = 36/203 (17%), Positives = 84/203 (41%), Gaps = 36/203 (17%)
Query: 24 SKDKCI-IKNIQNVKYITN----KASVLALNWGNVEESELLIAYSGQ--KIKIYDTDFKN 76
S+DK I + +++ K +T V ++ + +A S Q IK++D +
Sbjct: 112 SRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFS---PDGTFVASSSQDGTIKLWD--LRT 166
Query: 77 FNTNTDVSFGEGDIVGVSRYD--ECILTAVKSGDVKLWRYKTEPDQWTNFNVG--PQLEC 132
++ G++ V+ E +L++ G +KLW T + G +
Sbjct: 167 GKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLST--GKCLGTLRGHENGVNS 224
Query: 133 MKHNKHSNIIATGGKENDLKLWDLNTTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWVNG 192
+ + ++A+G ++ +++WDL T + + + ++ V
Sbjct: 225 VAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------------------VTS 266
Query: 193 ISFLPNSDKVAVCSKQGYVRVYD 215
+++ P+ ++A S G +R++D
Sbjct: 267 LAWSPDGKRLASGSADGTIRIWD 289
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 51.1 bits (122), Expect = 1e-06
Identities = 30/114 (26%), Positives = 51/114 (44%), Gaps = 7/114 (6%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
E+E + + E E KGD + EH + ++ E ++ D + + + +
Sbjct: 778 EDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQEL 837
Query: 391 I--DNDEDKQDESEEDGEEEG-----EEEASENEYYTETDEDEKENEEDDGESD 437
+ E KQDE DG EEE E E E +E+E+E EE++ E++
Sbjct: 838 NAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 45.8 bits (108), Expect = 4e-05
Identities = 31/101 (30%), Positives = 49/101 (48%), Gaps = 10/101 (9%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
E+E + E E E +G S+ + D + +E AE++ DE ++
Sbjct: 802 EHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAK----QDEKGVDGGGG 857
Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
D + +++E EE+ EEE EEE E E E+E+ENEE
Sbjct: 858 SDGGDSEEEEEEEEEEEEEEEEEEEEE------EEEEENEE 892
Score = 44.6 bits (105), Expect = 1e-04
Identities = 26/118 (22%), Positives = 44/118 (37%), Gaps = 9/118 (7%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDI----- 385
E E + E + E +GD E E E+ E + ++ DE
Sbjct: 744 EGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEH 803
Query: 386 -EIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE---EDDGESDGG 439
+ + DE + + + E ++E E E E + K++E + G SDGG
Sbjct: 804 EGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGG 861
Score = 44.6 bits (105), Expect = 1e-04
Identities = 30/120 (25%), Positives = 55/120 (45%), Gaps = 7/120 (5%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDES-DIEIVD 389
E+ + E++ +E KG+++ + + + E E+K + + + E + E
Sbjct: 662 ESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGET 721
Query: 390 EIDNDEDKQD-ESEEDGEEEGEEEASE----NEYYTETDEDEKENE-EDDGESDGGRRRG 443
E + ED+ + E+ E+GEE +E E +E TE D E E+E E + E G
Sbjct: 722 EAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEG 781
Score = 41.9 bits (98), Expect = 6e-04
Identities = 30/111 (27%), Positives = 51/111 (45%), Gaps = 8/111 (7%)
Query: 331 ENETKKE-EQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVD 389
E E E E E+ EE G + E + EN +E + + E + EI
Sbjct: 648 EGERPTEAEGENGEESGG-----EAEQEGETETKGENESEGEIPAERKG-EQEGEGEIEA 701
Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDED-EKENEEDDGESDGG 439
+ + + + + E + E E E E +E+E ET E+ E+ +E +GE++G
Sbjct: 702 KEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGK 752
Score = 41.1 bits (96), Expect = 0.001
Identities = 29/114 (25%), Positives = 48/114 (42%), Gaps = 8/114 (7%)
Query: 331 ENETKKEEQEHSEEKGDSDEK--YDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
+ E + +E +G DE E E + E AE K T E++ E
Sbjct: 710 TEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGE 769
Query: 389 DEIDNDEDKQD---ESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
E + ED+ + ++ EDGE +G+E A E E + + + E+D+ E
Sbjct: 770 TEAEGKEDEDEGEIQAGEDGEMKGDEGA---EGKVEHEGETEAGEKDEHEGQSE 820
Score = 38.1 bits (88), Expect = 0.009
Identities = 26/100 (26%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 345 KGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEED 404
KGD E + R AE + N S + E + E + +N+ + + +E
Sbjct: 632 KGDVAEAEHTGERTGEEGERPTEAEGE-NGEESGGEAEQEGETETKGENESEGEIPAERK 690
Query: 405 GEEEGEEEASENEYYTETDEDEKENE-EDDGESDGGRRRG 443
GE+EGE E E + + + +E E E + E++G G
Sbjct: 691 GEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEG 730
Score = 35.7 bits (82), Expect = 0.057
Identities = 23/118 (19%), Positives = 46/118 (38%), Gaps = 4/118 (3%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQV---TRENYAESKTNKNTSDYDDESDIEI 387
E E + E + + E + EH + E E+ + + E + E
Sbjct: 692 EQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEG 751
Query: 388 VDEIDNDEDKQDESEEDGEE-EGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
E++ + D+++ E E EG+E+ E E D + K +E +G+ + +
Sbjct: 752 KHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEA 809
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 48.7 bits (116), Expect = 4e-06
Identities = 30/104 (28%), Positives = 45/104 (43%), Gaps = 8/104 (7%)
Query: 337 EEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDED 396
E++ + G + E D + F +V R+ + T D E D DE D D+D
Sbjct: 185 EDKAGEDLDGLAAEIDDQQAF--ARVVRDMLGSMDMAEETGDDGIEED---ADEEDGDDD 239
Query: 397 --KQDESEEDGEEEGE-EEASENEYYTETDEDEKENEEDDGESD 437
+E E G EE E + SE + TD + +E E D E+
Sbjct: 240 QPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEAS 283
Score = 38.7 bits (90), Expect = 0.006
Identities = 20/81 (24%), Positives = 42/81 (51%), Gaps = 7/81 (8%)
Query: 364 RENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESE--EDGEEEGEEEASENEYYTE 421
E+ E + + D +++S+ + +DE ++DE+E + EEGE +A+E
Sbjct: 228 EEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAE-----A 282
Query: 422 TDEDEKENEEDDGESDGGRRR 442
+++ E + ++D E+ G R
Sbjct: 283 SEDSESDESDEDTETPGEDAR 303
Score = 33.3 bits (76), Expect = 0.32
Identities = 26/140 (18%), Positives = 49/140 (35%), Gaps = 27/140 (19%)
Query: 324 VRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRE------------------ 365
V + S E K ++ S+ D P + + RE
Sbjct: 124 VAKNLSAMLEEKYDKANFSKRTDKED---APLEEAVALLVREKLTGDAPPDSAGKVLDLW 180
Query: 366 -NYAESKTNKNTSDYDDESDIE-----IVDEIDNDEDKQDESEEDGEEEGEEEASENEYY 419
+ E K ++ E D + +V ++ D +E+ +DG EE +E ++
Sbjct: 181 RDEIEDKAGEDLDGLAAEIDDQQAFARVVRDMLGSMDMAEETGDDGIEEDADEEDGDDDQ 240
Query: 420 TETDEDEKENEEDDGESDGG 439
+ +ED + E+ SD
Sbjct: 241 PDNNEDSEAGREESEGSDES 260
Score = 31.0 bits (70), Expect = 1.5
Identities = 22/115 (19%), Positives = 48/115 (41%), Gaps = 10/115 (8%)
Query: 315 MQSRLTSILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTREN-----YAE 369
+R+ ++ S + ET + E ++ D D+ P++ E ++ RE +E
Sbjct: 204 AFARVVRDMLGSM-DMAEETGDDGIEEDADEEDGDDD-QPDNNEDSEAGREESEGSDESE 261
Query: 370 SKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDE 424
+ T +E +++ + ED + + ++ E E+A +TE E
Sbjct: 262 EDEAEATDGEGEEGEMDAAEA---SEDSESDESDEDTETPGEDARPATPFTELME 313
>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
only].
Length = 466
Score = 47.0 bits (110), Expect = 1e-05
Identities = 39/255 (15%), Positives = 82/255 (32%), Gaps = 30/255 (11%)
Query: 37 KYITNKASVLALNWGNVEESELLIAYSGQKIKIYDTDFKNFNTNTDVSFGEGDIVGVSRY 96
+ V +L + + S I+++D +T + + S
Sbjct: 193 TLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPD 252
Query: 97 DECILTAVKSGDVKLWRYKTEPDQWTNFNVGPQ-LECMKHNKHSNIIATGGKENDLKLWD 155
+ + G ++LW ++ + + + + ++A+G + ++LWD
Sbjct: 253 GSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWD 312
Query: 156 LNTTKNIFSAKNVSHRVLVYSVK--------------------------PDEFDLRRPVW 189
L T K + S H V S+ L
Sbjct: 313 LETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHSN 372
Query: 190 VNGISFLPNSDKVAVCSKQGYVRVYDPKITTQRRPVLKVDIPDVALSCITTTNQSHYVIV 249
V +SF P+ V+ S G VR++D + R +D ++ + + +
Sbjct: 373 VLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLR---NLDGHTSRVTSLDFSPDGKSLAS 429
Query: 250 GSTTGDMMAVDLRGK 264
GS+ + DL+
Sbjct: 430 GSSDNTIRLWDLKTS 444
Score = 46.2 bits (108), Expect = 2e-05
Identities = 32/184 (17%), Positives = 66/184 (35%), Gaps = 16/184 (8%)
Query: 132 CMKHNKHSNIIATGGKENDLKLWDLNTTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWVN 191
+ + ++ +G + +KLWDL+ + + + H V +
Sbjct: 70 SIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLA------------- 116
Query: 192 GISFLPNSDKVAVCSKQGYVRVYDPKITTQRRPVLKVDIPDVALSCITTTNQSHYVIVGS 251
P+ + + + S V ++T + + ++ +++ + + + GS
Sbjct: 117 --LSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGS 174
Query: 252 TTGDMMAVDLRGKGKPCYKYKGFAGCVKDIA-CPLNENLILSVSLDRHFRVHDFHSKKLL 310
+ + + GKP G V +A P LI S S D R+ D + KLL
Sbjct: 175 SLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLL 234
Query: 311 RKDY 314
R
Sbjct: 235 RSTL 238
Score = 46.2 bits (108), Expect = 2e-05
Identities = 58/283 (20%), Positives = 101/283 (35%), Gaps = 38/283 (13%)
Query: 70 YDTDFKNFNTNTDVSFGEGDIV---GVSRYDECILTAVKSGDVKLWRYKTEPDQWTNFNV 126
D+ + ++ + G D + S E +L+ G +KLW +
Sbjct: 47 SDSLVSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEG 106
Query: 127 GP-----QLECMKHNKHSNIIATGGKENDLKLWDLNT-TKNIFSAKNVSHRVLVYSVKPD 180
+L + +S ++A+ + +KLWDL+T K I + + S V + PD
Sbjct: 107 LHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPD 166
Query: 181 E---------------FDLRRPV----------WVNGISFLPNSDK-VAVCSKQGYVRVY 214
+DLR V+ ++F P+ +A S G +R++
Sbjct: 167 GKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLW 226
Query: 215 DPKITTQRRPVLKVDIPDVALSCITTTNQSHYVIVGSTTGDMMAVDLRGKGKPCYKYKGF 274
D R L V S + + GS+ G + DLR G
Sbjct: 227 DLSTGKLLRSTLSGHSDSVVSSF---SPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGH 283
Query: 275 AGCVKDIACPLNENLILSVSLDRHFRVHDFHSKKLLRKDYMQS 317
+ V +A + L+ S S D R+ D + KLL ++
Sbjct: 284 SSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKG 326
Score = 39.7 bits (91), Expect = 0.003
Identities = 27/167 (16%), Positives = 59/167 (35%), Gaps = 3/167 (1%)
Query: 149 NDLKLWDLNTTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWVNGISFLPNSDKVAVCSKQ 208
+ L ++ I +S ++ + I+F P+ + + S
Sbjct: 27 SLSLLSLGSSESGILLLALLSDSLVSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSD 86
Query: 209 GYVRVYDPKITTQRRPVLKVDIPDVALSCITTTNQSHYVIVGSTTGDMMAV--DLRGKGK 266
G ++++D + L+ ++ + +++ S++ D DL GK
Sbjct: 87 GTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGK 146
Query: 267 PCYKYKGFAGCVKDIA-CPLNENLILSVSLDRHFRVHDFHSKKLLRK 312
+G + V +A P + L SLD ++ D + K L
Sbjct: 147 LIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLST 193
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 46.5 bits (111), Expect = 2e-05
Identities = 28/105 (26%), Positives = 56/105 (53%), Gaps = 12/105 (11%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
E E +EE++ E D D++++P+ +N+ + ++ + +D D E DE
Sbjct: 285 EEEDDEEEEDSKESADDLDDEFEPD-------DDDNFGLGQGEEDEEEEEDGVDDE--DE 335
Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
D+D+D ++E E+ + EE+ + + DED++E EE++ E
Sbjct: 336 EDDDDDLEEEEEDVDLSDEEEDEEDED---SDDEDDEEEEEEEKE 377
Score = 45.8 bits (109), Expect = 4e-05
Identities = 25/109 (22%), Positives = 49/109 (44%), Gaps = 6/109 (5%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
E E KEE E ++ + M ++ E + ++ D DDE + + D
Sbjct: 260 EEELAKEEAERLKKLEAERLR------RMRGEEEDDEEEEDSKESADDLDDEFEPDDDDN 313
Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
+ ++DE EE+ + E+E +++ E +ED ++E++ E D
Sbjct: 314 FGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDED 362
Score = 41.5 bits (98), Expect = 8e-04
Identities = 26/104 (25%), Positives = 55/104 (52%), Gaps = 7/104 (6%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVT--RENYAESKTNKNTSDYDD-----ES 383
++E +++ +E +++ D E D ++F + Q E + +++ D DD E
Sbjct: 288 DDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEE 347
Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEK 427
D+++ DE +++ED+ + E+D EEE EE+ + + E+ E
Sbjct: 348 DVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 44.6 bits (105), Expect = 1e-04
Identities = 32/115 (27%), Positives = 48/115 (41%), Gaps = 10/115 (8%)
Query: 331 ENETKK--EEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
ENE E+ + E++GD + D E +M + EN E+ + DE +E
Sbjct: 3983 ENEDLDLPEDLKLDEKEGDVSKDSDLEDMDM-EAADENKEEAD-AEKDEPMQDEDPLEEN 4040
Query: 389 DEIDNDEDKQD-----ESEEDGEEEG-EEEASENEYYTETDEDEKENEEDDGESD 437
+ +D D + D E +E E+G EE ENE TE E E +
Sbjct: 4041 NTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPE 4095
Score = 40.8 bits (95), Expect = 0.002
Identities = 25/112 (22%), Positives = 51/112 (45%), Gaps = 7/112 (6%)
Query: 329 SLENETKKEEQEHSEEKGDSDEKYDPEHFE----MNQVTRENYAESKTNKNTSDYDDESD 384
+L + EE + E+ + + + E E MN VT+++ N ++ + + + D
Sbjct: 3830 TLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLD 3889
Query: 385 IEIVDEIDNDEDKQDES---EEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
E+ D ++ + +E E + E+ E E NE +E + ++EDD
Sbjct: 3890 EEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDD 3941
Score = 38.1 bits (88), Expect = 0.010
Identities = 28/114 (24%), Positives = 48/114 (42%), Gaps = 11/114 (9%)
Query: 336 KEEQEHSEEK----GDSDEKYDPEHFEMNQVTRENYA-----ESKTNKNTSDYDDESDIE 386
++E+E EE G DE +Q EN + K ++ D +SD+E
Sbjct: 3950 RQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLE 4009
Query: 387 IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGR 440
+D DE+K++ E E +E+ E DED ++++ D D +
Sbjct: 4010 DMDMEAADENKEEADAEKDEPMQDEDPLEEN--NTLDEDIQQDDFSDLAEDDEK 4061
Score = 36.5 bits (84), Expect = 0.031
Identities = 25/104 (24%), Positives = 47/104 (45%), Gaps = 3/104 (2%)
Query: 336 KEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDE 395
KE+ + E D EK D E + V ++ + +N S E++ + E +
Sbjct: 3938 KEDDNKALEDKDRQEKEDEEEMS-DDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLD 3996
Query: 396 DKQDESEEDG--EEEGEEEASENEYYTETDEDEKENEEDDGESD 437
+K+ + +D E+ E A EN+ + ++DE +ED E +
Sbjct: 3997 EKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEEN 4040
Score = 35.0 bits (80), Expect = 0.10
Identities = 31/119 (26%), Positives = 45/119 (37%), Gaps = 14/119 (11%)
Query: 329 SLENETKKEEQEHSEEKGD---SDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDI 385
LE +E ++ D DEK + + FE N E ES + SD + E
Sbjct: 4036 PLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENE---ESTEDGVKSDEELEQGE 4092
Query: 386 EIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
D+ ++ K D E +EE TD+ E+ GE DG R G+
Sbjct: 4093 VPEDQAIDNHPKMDAKSTFASAEADEEN--------TDKGIVGENEELGEEDGVRGNGT 4143
Score = 34.2 bits (78), Expect = 0.16
Identities = 36/152 (23%), Positives = 58/152 (38%), Gaps = 30/152 (19%)
Query: 315 MQSRLTSILVRSDFSLENE---TKKEEQEHSEEKGDS-------------DEKYDPEHFE 358
++S + + S S ENE +++E Q+ EE D DE + + E
Sbjct: 3860 LESDMNGVTKDSVVS-ENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLE 3918
Query: 359 MNQVTRENYAESKTNKNTSDYDDESDIEIVD--------EIDNDEDKQDESEEDGEEEGE 410
Q + E A + + S DD +E D E+ +D DE + D +E
Sbjct: 3919 TEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNS 3978
Query: 411 EEASENEYY-----TETDEDEKENEEDDGESD 437
+ ENE + DE E + +D D
Sbjct: 3979 QPPPENEDLDLPEDLKLDEKEGDVSKDSDLED 4010
Score = 33.4 bits (76), Expect = 0.32
Identities = 17/63 (26%), Positives = 33/63 (52%), Gaps = 3/63 (4%)
Query: 375 NTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDG 434
T+ +++ D+E E+ N+ED ++S+ D E E E+ N ++ E EN + +
Sbjct: 3827 ITNTLNEDDDLE---ELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEE 3883
Query: 435 ESD 437
E+
Sbjct: 3884 ENQ 3886
Score = 31.5 bits (71), Expect = 1.1
Identities = 13/59 (22%), Positives = 27/59 (45%), Gaps = 1/59 (1%)
Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTET-DEDEKENEEDDGESD 437
D + + D+++ +++D + + +E E E++ T D ENE D E +
Sbjct: 3826 DITNTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEE 3884
Score = 29.2 bits (65), Expect = 5.5
Identities = 27/107 (25%), Positives = 43/107 (40%), Gaps = 21/107 (19%)
Query: 332 NETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEI 391
N E+ E +EK D + ++ K+ DYDD+ I E
Sbjct: 2922 NINVSEQIEEEDEKDD-----------EYGEQEKVSDKNDYLKDFIDYDDDGSISTESEP 2970
Query: 392 DNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
+ +E ++ ES DG N+Y E D+DEK +E + E +
Sbjct: 2971 E-EEKRECESGFDGW---------NDYEEEEDDDEKNEKECELEQNA 3007
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 43.5 bits (102), Expect = 3e-04
Identities = 20/55 (36%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 381 DESDIEIVDE--IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
E D+ D ID+D++ +DE ++D E++ EEE E E DEDE++ +D
Sbjct: 141 AEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGED 195
Score = 29.6 bits (66), Expect = 4.9
Identities = 16/65 (24%), Positives = 36/65 (55%), Gaps = 6/65 (9%)
Query: 380 DDESDIEIVDEIDNDEDKQDESEED----GEEEGEEEASENEYYTETDEDEKE--NEEDD 433
DD+ D + D+ + D+++++E EE+ +E+ E+E E+ Y +++ D+ + +
Sbjct: 155 DDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTDCFKFIEA 214
Query: 434 GESDG 438
G D
Sbjct: 215 GAGDD 219
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 42.7 bits (101), Expect = 3e-04
Identities = 16/66 (24%), Positives = 34/66 (51%), Gaps = 2/66 (3%)
Query: 379 YDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
+DD E + + +D + EED +++ E + + E + ED++++++DD + D
Sbjct: 101 FDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEE--SSKSEDDEDDDDDDDDDDI 158
Query: 439 GRRRGS 444
R S
Sbjct: 159 ATRERS 164
Score = 29.2 bits (66), Expect = 5.8
Identities = 13/68 (19%), Positives = 29/68 (42%), Gaps = 8/68 (11%)
Query: 375 NTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDG 434
D D E D+D+D++ + E++ + E++ E D+D+ ++++
Sbjct: 109 ELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDD--------EDDDDDDDDDDIAT 160
Query: 435 ESDGGRRR 442
RR
Sbjct: 161 RERSLERR 168
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 42.6 bits (100), Expect = 4e-04
Identities = 18/71 (25%), Positives = 38/71 (53%)
Query: 369 ESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKE 428
S + + + + E + + +E + +E++++E +E EEEGE+E E E + +E+
Sbjct: 427 TSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEM 486
Query: 429 NEEDDGESDGG 439
+G+ DG
Sbjct: 487 EGSSEGDGDGE 497
Score = 40.7 bits (95), Expect = 0.001
Identities = 21/114 (18%), Positives = 48/114 (42%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
+++T++EE+ +E+ T + ++ + E + E +E
Sbjct: 395 QDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEE 454
Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
+ +E + +E E + EEE EE ++N E + + + + + + RR S
Sbjct: 455 EEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNS 508
Score = 33.0 bits (75), Expect = 0.34
Identities = 21/113 (18%), Positives = 48/113 (42%), Gaps = 4/113 (3%)
Query: 329 SLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEI- 387
S E+ + ++ EE + +E+ + E E Q + E E + + + D+ S+ E+
Sbjct: 428 SGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Query: 388 ---VDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
+ D +E ++D + E G SE + + + +E+ + +
Sbjct: 488 GSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQEEPLQPE 540
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 41.4 bits (97), Expect = 5e-04
Identities = 36/126 (28%), Positives = 54/126 (42%), Gaps = 9/126 (7%)
Query: 314 YMQSRLTSILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTN 373
Y + S +V E+E + + GD D + + + Y N
Sbjct: 9 YASFGVNSAVVTGSTPTEHEQNMLALDVAARDGD-----DAIELAEPETSDDPYG----N 59
Query: 374 KNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
+ DDE IE+ D +E++ +E EED EEEGEEE+ E E +T E+ E E
Sbjct: 60 PDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTEASEQL 119
Query: 434 GESDGG 439
E + G
Sbjct: 120 EEHEEG 125
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 40.5 bits (95), Expect = 7e-04
Identities = 23/75 (30%), Positives = 36/75 (48%)
Query: 361 QVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYT 420
+ E+K + + + E E E N EDK+D+ + + E+E EE +E E
Sbjct: 42 DQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101
Query: 421 ETDEDEKENEEDDGE 435
+DE+EKE EE
Sbjct: 102 SSDENEKETEEKTES 116
Score = 34.7 bits (80), Expect = 0.059
Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 10/91 (10%)
Query: 342 SEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDES 401
++ +DE+ + + E E + K ++ +D+ D ++ D +ES
Sbjct: 40 PSDQAAADEQEAKKSDDQETAEIEEVKEEE--KEAANSEDKEDKGDAEKED------EES 91
Query: 402 EEDGEEEGEEEASENEYYTETDEDEKENEED 432
EE+ EEE EE + ENE ET+E + N E
Sbjct: 92 EEENEEEDEESSDENE--KETEEKTESNVEK 120
Score = 30.9 bits (70), Expect = 0.98
Identities = 20/107 (18%), Positives = 46/107 (42%), Gaps = 13/107 (12%)
Query: 319 LTSILVRSDFSLE-NETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTS 377
+ S S + ++E ++ +++ E+ E E +E K +K +
Sbjct: 31 AYQLFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAAN------SEDKEDKGDA 84
Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDG-EEEGEEEASENEYYTETD 423
+ +DE E +N+E+ ++ S+E+ E E + E++ + T
Sbjct: 85 EKEDEES-----EEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126
Score = 28.2 bits (63), Expect = 7.0
Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 4/62 (6%)
Query: 380 DDESDIEIVDEIDNDEDKQDE--SEEDGEEEGEEEA--SENEYYTETDEDEKENEEDDGE 435
SD DE + + E E+ +EE +E A + E + +++++E+EE++ E
Sbjct: 38 SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97
Query: 436 SD 437
D
Sbjct: 98 ED 99
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 41.5 bits (97), Expect = 7e-04
Identities = 27/80 (33%), Positives = 39/80 (48%), Gaps = 8/80 (10%)
Query: 361 QVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDED--KQDESEEDGEEEGE-EEASENE 417
+V RE + + D D E DE D D+D ++E EE GE EGE +E S +
Sbjct: 188 RVVREMLRSMELAEEMGD-----DTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQ 242
Query: 418 YYTETDEDEKENEEDDGESD 437
TD + + EE+ +SD
Sbjct: 243 ESEATDRESESGEEEMVQSD 262
Score = 41.1 bits (96), Expect = 0.001
Identities = 24/103 (23%), Positives = 40/103 (38%), Gaps = 18/103 (17%)
Query: 340 EHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQD 399
E +EE GD E D E + +Q T E++ + + + E
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQPT-----ENEQEEQGEGEGEGQEGSAPQE--------S 244
Query: 400 ESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRR 442
E+ + E GEEE ++ +D + +DD E+ G R
Sbjct: 245 EATDRESESGEEEMVQS-----DQDDLPDESDDDSETPGEGAR 282
Score = 34.5 bits (79), Expect = 0.11
Identities = 15/79 (18%), Positives = 31/79 (39%), Gaps = 15/79 (18%)
Query: 332 NETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEI 391
++ + E E E+ E + + + ++ T + + E+V
Sbjct: 216 DDDQPTENEQEEQGEGEGEGQEG-------------SAPQESEATDRESESGEEEMVQS- 261
Query: 392 DNDEDKQDESEEDGEEEGE 410
+ +D DES++D E GE
Sbjct: 262 -DQDDLPDESDDDSETPGE 279
Score = 28.8 bits (64), Expect = 6.4
Identities = 13/59 (22%), Positives = 29/59 (49%), Gaps = 2/59 (3%)
Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
D ++ +V E+ + +E +D E E EE+ +++ +E E++ E + +G
Sbjct: 182 DQQAFARVVREMLRSMELAEEMGDDTESEDEEDGDDDQ--PTENEQEEQGEGEGEGQEG 238
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 40.7 bits (95), Expect = 0.001
Identities = 32/123 (26%), Positives = 52/123 (42%), Gaps = 8/123 (6%)
Query: 323 LVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYD-D 381
L D ++E +E + E D DE+ + + ++ + K K D D D
Sbjct: 213 LKIKDLEGDDEDDGDESDKGGE--DGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDAD 270
Query: 382 ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEAS-----ENEYYTETDEDEKENEEDDGES 436
E D + D+ +ED +S G + E E + E DED +E+EE+ E
Sbjct: 271 EYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEE 330
Query: 437 DGG 439
+GG
Sbjct: 331 EGG 333
Score = 34.2 bits (78), Expect = 0.14
Identities = 24/114 (21%), Positives = 48/114 (42%), Gaps = 9/114 (7%)
Query: 330 LENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVD 389
+++ K + ++ DSD+ D E + ++ + + + + E I
Sbjct: 256 DDDKKGKRGGDDDADEYDSDDGDDEGR-EEDYISDSSASGNDPEEREDKLSPE--IPAKP 312
Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYT------ETDEDEKENEEDDGESD 437
EI+ DED ++ EE EEEG + D+D+ ++ +D +SD
Sbjct: 313 EIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSD 366
Score = 30.3 bits (68), Expect = 2.2
Identities = 20/70 (28%), Positives = 32/70 (45%), Gaps = 15/70 (21%)
Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEE---------------EASENEYYTETDEDEKE 428
D++I D +DED DES++ GE+ EE + + D+D E
Sbjct: 212 DLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADE 271
Query: 429 NEEDDGESDG 438
+ DDG+ +G
Sbjct: 272 YDSDDGDDEG 281
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 40.4 bits (94), Expect = 0.001
Identities = 29/117 (24%), Positives = 52/117 (44%), Gaps = 3/117 (2%)
Query: 324 VRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDES 383
VR + + NE + + + EK D E E + E ++ N + +D
Sbjct: 107 VRVGYYVNNEYPGITKLEKSDVEEPSEKVDEEDVERE-ILAEKPRVTRFNIVWDNDEDND 165
Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDED--EKENEEDDGESDG 438
+ ++E+++ E DG EE E+E ++ Y E + + E+E EE +G DG
Sbjct: 166 EAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDG 222
Score = 35.4 bits (81), Expect = 0.046
Identities = 21/117 (17%), Positives = 50/117 (42%), Gaps = 3/117 (2%)
Query: 325 RSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESD 384
R + +N+ +E ++ D++E+ E + + E E + + + + E +
Sbjct: 153 RFNIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGRE---EEEDEEVGSDSYGEGNRELN 209
Query: 385 IEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRR 441
E +E + +D +D + +GE +++ E E E + E++ S+ R
Sbjct: 210 EEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPR 266
Score = 28.8 bits (64), Expect = 5.0
Identities = 27/97 (27%), Positives = 41/97 (42%), Gaps = 5/97 (5%)
Query: 337 EEQEHSEEKGDSDEKYDPE--HFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDND 394
EE+E EE +E+ D E + RE E + + D E ++ E +
Sbjct: 177 EEEERLEESDGREEEEDEEVGSDSYGEGNRE-LNEEEEEEAEGSDDGEDVVD--YEGERI 233
Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
+ KQ E EE EE E E + ++E + NEE
Sbjct: 234 DKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEE 270
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 40.9 bits (96), Expect = 0.001
Identities = 21/78 (26%), Positives = 41/78 (52%)
Query: 361 QVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYT 420
+ ++ + ++K + D S +EI D +D+++E EE+ EEE E+E E+
Sbjct: 355 TLNKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSD 414
Query: 421 ETDEDEKENEEDDGESDG 438
+ + +E + E +SDG
Sbjct: 415 DEEFEEDDVESKYEDSDG 432
Score = 28.6 bits (64), Expect = 7.9
Identities = 13/67 (19%), Positives = 29/67 (43%)
Query: 363 TRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTET 422
T ++Y + + + + +E + +E+++DE E +E ++E E +
Sbjct: 367 TEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESK 426
Query: 423 DEDEKEN 429
ED N
Sbjct: 427 YEDSDGN 433
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 38.4 bits (89), Expect = 0.001
Identities = 17/75 (22%), Positives = 30/75 (40%)
Query: 363 TRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTET 422
T AE + + ++ + D D D+ E++ E + EEE E T
Sbjct: 53 TEAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETAD 112
Query: 423 DEDEKENEEDDGESD 437
+ D + E D+ E +
Sbjct: 113 EADAEAEEADEEEDE 127
Score = 29.9 bits (67), Expect = 1.0
Identities = 13/45 (28%), Positives = 24/45 (53%)
Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
+ +E+E +E +EA+E + E DE+E+ +E D +D
Sbjct: 70 AEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEA 114
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 39.4 bits (92), Expect = 0.002
Identities = 15/77 (19%), Positives = 35/77 (45%), Gaps = 1/77 (1%)
Query: 361 QVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYT 420
+V R+ + + D + +D E +E ++D + ++ ++ EEE S +E
Sbjct: 197 RVVRDMLSSMDMAEELGDEPESADSE-DNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSD 255
Query: 421 ETDEDEKENEEDDGESD 437
+ E+ + E + E+
Sbjct: 256 ASSEEMESGEMEAAEAS 272
Score = 32.5 bits (74), Expect = 0.34
Identities = 13/44 (29%), Positives = 23/44 (52%), Gaps = 4/44 (9%)
Query: 396 DKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
D +EE G+E ++ +NE ++D KE+E+DD +
Sbjct: 204 SSMDMAEELGDEPESADSEDNE----DEDDPKEDEDDDQGEEEE 243
Score = 31.3 bits (71), Expect = 0.79
Identities = 12/71 (16%), Positives = 26/71 (36%), Gaps = 5/71 (7%)
Query: 365 ENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDE 424
A+S+ N++ D ++ D D+ +E+ + + E E E E
Sbjct: 216 PESADSEDNEDEDDPKEDED----DDQGEEEESGSSDSLSEDSDASSEEME-SGEMEAAE 270
Query: 425 DEKENEEDDGE 435
++ D +
Sbjct: 271 ASADDTPDSDD 281
Score = 31.3 bits (71), Expect = 0.91
Identities = 22/92 (23%), Positives = 37/92 (40%), Gaps = 14/92 (15%)
Query: 336 KEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDE 395
+ +EE GD E D E E +E+ + DD+ + E + +D
Sbjct: 203 LSSMDMAEELGDEPESADSEDNEDEDDPKED-----------EDDDQGEEE--ESGSSDS 249
Query: 396 DKQD-ESEEDGEEEGEEEASENEYYTETDEDE 426
+D ++ + E GE EA+E D D+
Sbjct: 250 LSEDSDASSEEMESGEMEAAEASADDTPDSDD 281
Score = 29.8 bits (67), Expect = 2.9
Identities = 15/53 (28%), Positives = 26/53 (49%), Gaps = 5/53 (9%)
Query: 386 EIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
+++ +D E+ DE E E+ E+E E DED+ + EE++ S
Sbjct: 201 DMLSSMDMAEELGDEPESADSEDNEDEDDPKE-----DEDDDQGEEEESGSSD 248
>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
Length = 534
Score = 39.6 bits (92), Expect = 0.003
Identities = 20/55 (36%), Positives = 29/55 (52%), Gaps = 4/55 (7%)
Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEY-YTETDEDEKENEEDDG---ESDGG 439
D D+DED DE E E E+ S++ Y Y+ T +++ + DD ESD G
Sbjct: 448 DRADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSG 502
>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
Provisional.
Length = 337
Score = 38.6 bits (90), Expect = 0.004
Identities = 11/55 (20%), Positives = 23/55 (41%), Gaps = 2/55 (3%)
Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
E +++ E+D + + + S + ++++ D NEEDD S
Sbjct: 278 EAIDEDSDYSSDEDDDDYDSYD--SSDSASSDSNSDVDTNEEDDRGEKESNGAKS 330
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 38.8 bits (91), Expect = 0.005
Identities = 25/103 (24%), Positives = 50/103 (48%), Gaps = 2/103 (1%)
Query: 330 LENETKKEEQEHSEEKGDSDEK--YDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEI 387
L++E ++ + D D+ Y + +NQ ++ + + + D DD+ D E
Sbjct: 96 LKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDED 155
Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE 430
DE D+D+D DE EE E + E+ S+++ + ++D +
Sbjct: 156 DDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALR 198
Score = 38.4 bits (90), Expect = 0.007
Identities = 24/108 (22%), Positives = 53/108 (49%), Gaps = 6/108 (5%)
Query: 333 ETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEID 392
+ + + + +E+K D+ D + + V + + + D DD+ DI+ D+ +
Sbjct: 97 KDELDSSKKAEKKNALDKDDDLNYVKDIDVL-NQADDDDDDDDDDDLDDD-DIDDDDDDE 154
Query: 393 NDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGR 440
+D++ D+ + D E+E ++EA E E D+ + D+ +S+ R
Sbjct: 155 DDDEDDDDDDVDDEDEEKKEAKE----LEKLSDDDDFVWDEDDSEALR 198
Score = 34.6 bits (80), Expect = 0.095
Identities = 18/98 (18%), Positives = 49/98 (50%), Gaps = 1/98 (1%)
Query: 329 SLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
L++ K E++ ++ D + D + ++ + + + D DD+ + +
Sbjct: 99 ELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDE 158
Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDE 426
D+ D+D D +DE +++ +E E+ + ++++ + D+ E
Sbjct: 159 DDDDDDVDDEDEEKKE-AKELEKLSDDDDFVWDEDDSE 195
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 39.0 bits (90), Expect = 0.005
Identities = 33/110 (30%), Positives = 61/110 (55%), Gaps = 6/110 (5%)
Query: 329 SLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
++E ++ ++E+ EE ++ E+YD E+ E + E E +N +YD+E+ EI
Sbjct: 1014 NIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIE 1073
Query: 389 DEID-NDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
+ I+ N E+ +E+ E+ EE EE EN +E+ +EN E++ E +
Sbjct: 1074 ENIEENIEENVEENVEENVEEIEENVEEN-----VEENAEENAEENAEEN 1118
Score = 34.4 bits (78), Expect = 0.13
Identities = 32/110 (29%), Positives = 55/110 (50%), Gaps = 3/110 (2%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
EN + E+ E ++ E+YD E+ E + E Y E + + ++ + I +
Sbjct: 1001 ENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEEN 1060
Query: 391 IDN-DEDKQDESEEDGEEEGEEEASEN--EYYTETDEDEKENEEDDGESD 437
I+ DE+ +E EE+ EE EE EN E E +E+ +EN E++ E +
Sbjct: 1061 IEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEEN 1110
Score = 29.0 bits (64), Expect = 5.9
Identities = 26/116 (22%), Positives = 44/116 (37%), Gaps = 11/116 (9%)
Query: 327 DFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIE 386
D S + + E+EHS + + + D EH E + +S +E E
Sbjct: 429 DGSEDESVEDNEEEHSGDANEEELSVD-EHVEEHNADDSGEQQSDDESGEHQSVNEIVEE 487
Query: 387 ----------IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
V +I E + EE +E EE+ + +E+ E E+ EE+
Sbjct: 488 QSVNEHVEEPTVADIVEQETVDEHVEEPAVDENEEQQTADEHVEEPTIAEEHVEEE 543
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 38.1 bits (89), Expect = 0.006
Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 369 ESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKE 428
+ K D+ + + E+V ++ED ++ E+ EEE E++ SE+E E DEDE+E
Sbjct: 253 KEVIKKKGGDFKVKGEPEVVG--GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEE 310
Query: 429 NEEDDGESD 437
E+DD
Sbjct: 311 EEDDDEGDK 319
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 38.1 bits (89), Expect = 0.006
Identities = 21/106 (19%), Positives = 45/106 (42%), Gaps = 18/106 (16%)
Query: 330 LENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVD 389
LE ++E ++ E+G + + + +DE + +
Sbjct: 75 LEKWKEEERKKKEAEQGLESD-----------------DDDDEEEEWEVEEDEDSDDEGE 117
Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
ID + DK+ ES D E+E E++ + + ++DE+ E +E++
Sbjct: 118 WIDVESDKEIESS-DSEDEEEKDEAAKKAKEDSDEELSEEDEEEAA 162
Score = 37.7 bits (88), Expect = 0.009
Identities = 30/108 (27%), Positives = 49/108 (45%), Gaps = 19/108 (17%)
Query: 330 LENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDE-----SD 384
E E KK+E E E D D++ + E + SD + E SD
Sbjct: 79 KEEERKKKEAEQGLESDDDDDEEEEWEVEED--------------EDSDDEGEWIDVESD 124
Query: 385 IEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
EI DE+++DE+ + +E+ +EE SE + +E+E E E++
Sbjct: 125 KEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172
Score = 36.6 bits (85), Expect = 0.019
Identities = 23/60 (38%), Positives = 32/60 (53%), Gaps = 1/60 (1%)
Query: 377 SDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
SD DD+ + E E D D D + E D E + E E+S++E E DE K+ +ED E
Sbjct: 94 SDDDDDEEEEWEVEEDEDSDDEGE-WIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEE 152
Score = 33.1 bits (76), Expect = 0.24
Identities = 15/60 (25%), Positives = 31/60 (51%)
Query: 379 YDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
D +E++++ +E K+ E+E+ E + +++ E E ++ + E E D ESD
Sbjct: 66 VDGIPGLELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDK 125
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 38.7 bits (90), Expect = 0.007
Identities = 26/130 (20%), Positives = 51/130 (39%), Gaps = 12/130 (9%)
Query: 310 LRKDYMQSRLTSILVRSDFSLE-NETKKEEQEHSEEKGDS-DEKYDPEHFEMNQVTRENY 367
LR +Y+QS + I S + ++E +H+ +S +NQ R++
Sbjct: 29 LRSNYLQSTVEDIEGPSAVDERTSGVLRDEGKHANILYNSILCNQKKHASFLNQ--RKSL 86
Query: 368 AESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEK 427
+ ++ Y+D+ D D ES D + E + + T+ + D+
Sbjct: 87 DDDDDDEFDFLYEDDED-------DAGNATSGESSTDDDSLLELPDRDEDADTQANNDQ- 138
Query: 428 ENEEDDGESD 437
N+ D +S
Sbjct: 139 TNDFDQDDSS 148
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 38.3 bits (90), Expect = 0.008
Identities = 17/113 (15%), Positives = 40/113 (35%), Gaps = 1/113 (0%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
E + E + E + + + + + E++ + ++E D E
Sbjct: 117 EAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERD-ERRRR 175
Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRG 443
D ++ + + + E ++ ++ + E+ G DGG RRG
Sbjct: 176 GDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRG 228
Score = 33.7 bits (78), Expect = 0.21
Identities = 16/110 (14%), Positives = 34/110 (30%), Gaps = 9/110 (8%)
Query: 336 KEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYD-DESDIEIVDEIDND 394
+ E + E + E E E + ++ D ++ E E
Sbjct: 133 RGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAE--AERGER 190
Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
+++ + + + + + D E+ D G+ G RRR
Sbjct: 191 GRREERGRDGDDRDRRDRREQG------DRREERGRRDGGDRRGRRRRRD 234
Score = 32.2 bits (74), Expect = 0.61
Identities = 23/115 (20%), Positives = 42/115 (36%), Gaps = 13/115 (11%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
T++EE++ +GD +++ E + R E + + D D +
Sbjct: 161 AERTEEEERDERRRRGDREDRQA----EAERGERGRREERGRDGDDRDRRDRRE------ 210
Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTE--TDEDEKENEEDDGESDGGRRRG 443
D ++ + G+ G + ED + + DDGE GGRR
Sbjct: 211 -QGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGR 264
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 38.0 bits (88), Expect = 0.009
Identities = 13/106 (12%), Positives = 25/106 (23%), Gaps = 14/106 (13%)
Query: 337 EEQEHSEEKGDSDEKYD-PEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDE 395
+ H+ + P H E+ E + D + +
Sbjct: 151 PPESHNPSPNQQPSSFLQPSH--------EDSPEEPEPPTSEPEPDSPGPPQSETPTSSP 202
Query: 396 DKQDESEEDGEEEGE-----EEASENEYYTETDEDEKENEEDDGES 436
Q +E GE + + + DE + E
Sbjct: 203 PPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPPFP 248
Score = 36.1 bits (83), Expect = 0.030
Identities = 14/126 (11%), Positives = 27/126 (21%), Gaps = 4/126 (3%)
Query: 316 QSRLTSILVRSDFSLENETKKE-EQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNK 374
+SR R + G ++E N + + +
Sbjct: 78 ESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELAS-GLSPENTSGSSPESPASHSP 136
Query: 375 NTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED-- 432
S E ++ + + E+ E E T E +
Sbjct: 137 PPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSE 196
Query: 433 DGESDG 438
S
Sbjct: 197 TPTSSP 202
Score = 33.4 bits (76), Expect = 0.24
Identities = 20/119 (16%), Positives = 33/119 (27%), Gaps = 13/119 (10%)
Query: 331 ENETKKEEQEHSE--EKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
E E++E + G E + + E A + +NTS ES
Sbjct: 77 EESRHGEKEERGQGGPSGSGSESVGSPTPSPSG-SAEELASGLSPENTSGSSPESPAS-- 133
Query: 389 DEIDND--------EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
E ES + + + +E E E + +S G
Sbjct: 134 HSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGP 192
Score = 31.5 bits (71), Expect = 0.90
Identities = 16/110 (14%), Positives = 30/110 (27%), Gaps = 2/110 (1%)
Query: 330 LENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVD 389
L E S S P H ++ N+ S + S + +
Sbjct: 118 LSPENTSGSSPESPA-SHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPE 176
Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
E + + E + G + E S + DE + ++
Sbjct: 177 EPE-PPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSP 225
Score = 31.1 bits (70), Expect = 1.1
Identities = 15/107 (14%), Positives = 19/107 (17%), Gaps = 15/107 (14%)
Query: 337 EEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDED 396
E S + + P ES
Sbjct: 121 ENTSGSSPESPASHSPPPS-------------PPSHPGPHEPAPPESHNP--SPNQQPSS 165
Query: 397 KQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRG 443
S ED EE E SE E + + G
Sbjct: 166 FLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPG 212
Score = 28.4 bits (63), Expect = 8.7
Identities = 11/82 (13%), Positives = 23/82 (28%), Gaps = 3/82 (3%)
Query: 361 QVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDG--EEEGEEEASENEY 418
Q R+ ++++T + + + E + E + EE S
Sbjct: 64 QGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENT 123
Query: 419 YTETDEDEKENEEDDGE-SDGG 439
+ E + S G
Sbjct: 124 SGSSPESPASHSPPPSPPSHPG 145
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 37.5 bits (87), Expect = 0.011
Identities = 29/133 (21%), Positives = 58/133 (43%), Gaps = 4/133 (3%)
Query: 308 KLLRKDYMQSRLTSILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENY 367
+LL +++ L SI+ D L ETK ++ + ++ S +K E + +
Sbjct: 118 RLLNSKPVKNVLESIMEGLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESES 177
Query: 368 AESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESE--EDGEEEGEEEASENEYYTETDED 425
+ K+ +D+SD E ++ D+++ Q + + +EE EEA Y +T E
Sbjct: 178 EDES--KSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSES 235
Query: 426 EKENEEDDGESDG 438
E + + +
Sbjct: 236 ESDESDSEISESR 248
Score = 30.6 bits (69), Expect = 1.8
Identities = 23/124 (18%), Positives = 51/124 (41%), Gaps = 4/124 (3%)
Query: 318 RLTSILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTS 377
T ++KK+E + S +K D +E + + + ++ + + + S
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDS 201
Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
+ + D +VD D +E ++ S E+ E E+ E ++++ E + D ES
Sbjct: 202 EDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDE----SDSEISESRSVSDSEESS 257
Query: 438 GGRR 441
+
Sbjct: 258 PPSK 261
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 37.5 bits (87), Expect = 0.013
Identities = 14/64 (21%), Positives = 32/64 (50%)
Query: 374 KNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
K++++YD E++ V E+ + ++++ + + EE E E +E + ED+
Sbjct: 371 KDSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDE 430
Query: 434 GESD 437
E +
Sbjct: 431 FEDE 434
Score = 30.1 bits (68), Expect = 2.4
Identities = 14/79 (17%), Positives = 33/79 (41%)
Query: 336 KEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDE 395
K+ E+ +E D+ E + + E + ++++ + + + DE
Sbjct: 371 KDSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDE 430
Query: 396 DKQDESEEDGEEEGEEEAS 414
+ ++ E+ + E EEEA
Sbjct: 431 FEDEDELEEAQPEEEEEAR 449
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 36.8 bits (85), Expect = 0.014
Identities = 24/110 (21%), Positives = 42/110 (38%), Gaps = 14/110 (12%)
Query: 333 ETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEID 392
T E+ E EE + D E + + + +N + + E + D +
Sbjct: 183 STLHEDDELDEEVTSYLNELDDE-------VTSYFNDGENEENDDELEAEVISYLKDGEN 235
Query: 393 NDEDKQ------DESEEDGEEEGEEEAS-ENEYYTETDEDEKENEEDDGE 435
++E K+ E + D E E E+E E + E++ ED GE
Sbjct: 236 DNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEEEAGEEQENEDKGE 285
Score = 30.6 bits (69), Expect = 1.7
Identities = 20/78 (25%), Positives = 41/78 (52%), Gaps = 4/78 (5%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNK-NTSDYDDESDIEIVD 389
+ E ++ + E E + + ++ E+ + R Y E K +K NT++ + E + E D
Sbjct: 212 DGENEENDDELEAEVISYLKDGENDN-EVKEKIRREYREWKGDKANTNETEIEDESE--D 268
Query: 390 EIDNDEDKQDESEEDGEE 407
E + + ++ E+E+ GEE
Sbjct: 269 EYEEEAGEEQENEDKGEE 286
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 37.0 bits (86), Expect = 0.015
Identities = 16/64 (25%), Positives = 29/64 (45%)
Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGR 440
D D E VDE +++E++Q E + EE + E ++ + + + D ES +
Sbjct: 363 DPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADK 422
Query: 441 RRGS 444
S
Sbjct: 423 ESAS 426
Score = 37.0 bits (86), Expect = 0.018
Identities = 17/85 (20%), Positives = 39/85 (45%), Gaps = 4/85 (4%)
Query: 355 EHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEAS 414
E ++ + +DE + E +++E++ ++SEE+G + E+ +S
Sbjct: 351 ESKMRDKRRARLDPIDFEEVD----EDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSS 406
Query: 415 ENEYYTETDEDEKENEEDDGESDGG 439
E+ +D + K ++E +SD
Sbjct: 407 ESSSDVGSDSESKADKESASDSDSE 431
Score = 28.9 bits (65), Expect = 5.7
Identities = 19/112 (16%), Positives = 36/112 (32%), Gaps = 38/112 (33%)
Query: 324 VRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDES 383
R+ + + +E E EE+ SDE
Sbjct: 358 RRARLDPIDFEEVDEDEDEEEEQRSDE--------------------------------- 384
Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
E + ED ++E + E+ E +S+ +E+ D++ + D E
Sbjct: 385 -----HEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 37.3 bits (86), Expect = 0.016
Identities = 20/66 (30%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 376 TSDYDDESDIEIVDEIDNDEDKQDESEEDGE-EEGEEEASENEYYTETDEDEKENEEDDG 434
+ D DES+ E+ + + +D+ DE++ED E +E E+ SE+E ++ ++E + D+
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDEL 986
Query: 435 ESDGGR 440
ES
Sbjct: 987 ESKAAY 992
Score = 34.6 bits (79), Expect = 0.13
Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 4/73 (5%)
Query: 370 SKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKEN 429
++ + S+Y+ SD E DE D DE+ + SE+ E+E E ++S+ E + DE E +
Sbjct: 932 DESEEEVSEYEASSDDE-SDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKA 990
Query: 430 EEDDGESDGGRRR 442
D G+RR
Sbjct: 991 AYDS---RPGKRR 1000
Score = 33.8 bits (77), Expect = 0.22
Identities = 22/76 (28%), Positives = 37/76 (48%), Gaps = 6/76 (7%)
Query: 346 GDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDG 405
G DE + E + E A S + +D D+ESD E +++ DE + D S+E+
Sbjct: 926 GSDDESDESE-----EEVSEYEASSDDESDETDEDEESD-ESSEDLSEDESENDSSDEED 979
Query: 406 EEEGEEEASENEYYTE 421
E+ +E S+ Y +
Sbjct: 980 GEDWDELESKAAYDSR 995
>gnl|CDD|197651 smart00320, WD40, WD40 repeats. Note that these repeats are
permuted with respect to the structural repeats (blades)
of the beta propeller domain.
Length = 40
Score = 32.7 bits (75), Expect = 0.019
Identities = 7/27 (25%), Positives = 15/27 (55%)
Query: 189 WVNGISFLPNSDKVAVCSKQGYVRVYD 215
V ++F P+ +A S G ++++D
Sbjct: 14 PVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 30.7 bits (70), Expect = 0.11
Identities = 10/40 (25%), Positives = 16/40 (40%)
Query: 264 KGKPCYKYKGFAGCVKDIACPLNENLILSVSLDRHFRVHD 303
G+ KG G V +A + + S S D ++ D
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 28.0 bits (63), Expect = 1.0
Identities = 6/24 (25%), Positives = 12/24 (50%)
Query: 132 CMKHNKHSNIIATGGKENDLKLWD 155
+ + +A+G + +KLWD
Sbjct: 17 SVAFSPDGKYLASGSDDGTIKLWD 40
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 37.0 bits (85), Expect = 0.019
Identities = 25/114 (21%), Positives = 50/114 (43%), Gaps = 1/114 (0%)
Query: 329 SLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
S+ K E++ +E+ S E + E E + + + E D+++
Sbjct: 221 SVSVVLKDEKELARQERVSSWENFKEEPGEPLSRPALKKEKQGAEEEGEEGMSEEDLDVG 280
Query: 389 D-EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRR 441
EI+N E + + E+ EE EA + E ++ ++ E +E+D E++ R
Sbjct: 281 AAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENER 334
>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat.
Length = 39
Score = 32.7 bits (75), Expect = 0.022
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 189 WVNGISFLPNSDKVAVCSKQGYVRVYD 215
V ++F P+ + +A S G VRV+D
Sbjct: 13 PVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 28.9 bits (65), Expect = 0.48
Identities = 5/24 (20%), Positives = 14/24 (58%)
Query: 132 CMKHNKHSNIIATGGKENDLKLWD 155
+ + N++A+G + +++WD
Sbjct: 16 SVAFSPDGNLLASGSDDGTVRVWD 39
Score = 28.5 bits (64), Expect = 0.72
Identities = 15/39 (38%), Positives = 18/39 (46%)
Query: 265 GKPCYKYKGFAGCVKDIACPLNENLILSVSLDRHFRVHD 303
GK KG G V +A + NL+ S S D RV D
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 36.3 bits (84), Expect = 0.022
Identities = 24/101 (23%), Positives = 43/101 (42%), Gaps = 10/101 (9%)
Query: 346 GDSDEKYDPEHFEM-NQVTRENYAESKTNKNTS---DYDDESDIEIVDEIDNDED----- 396
E D + N + +S +KN S Y+ ES + D +
Sbjct: 185 PGEPEPEDDPKDSLGNGSSTNGLPDSSQDKNKSLEEYYEKESSDAAASQDDGPKGSDVKN 244
Query: 397 KQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
K+ + EED +++G+ E E + +E+E E EE++ + D
Sbjct: 245 KKSDDEEDDDQDGDY-VEEKELKEDEEEEETEEEEEEEDED 284
Score = 34.0 bits (78), Expect = 0.13
Identities = 26/97 (26%), Positives = 41/97 (42%), Gaps = 13/97 (13%)
Query: 321 SILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYD 380
S S +++ K E+ + +E D+ D + KN D
Sbjct: 202 SSTNGLPDSSQDKNKSLEEYYEKESSDAAASQDDGP------------KGSDVKNKKSDD 249
Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENE 417
+E D + D ++ E K+DE EE+ EE EEE E+E
Sbjct: 250 EEDDDQDGDYVEEKELKEDE-EEEETEEEEEEEDEDE 285
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 36.7 bits (85), Expect = 0.022
Identities = 23/113 (20%), Positives = 51/113 (45%), Gaps = 4/113 (3%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
E E+ K + +E + + E + + + E ++S+ + + ++ + E
Sbjct: 245 EEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGE-RSDSEEETEEKEKEKRKRLKKMME 303
Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRG 443
++++++ + E EE E E E + +E+++E DGGRRRG
Sbjct: 304 DEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTV---SPDGGRRRG 353
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 37.0 bits (86), Expect = 0.023
Identities = 20/110 (18%), Positives = 44/110 (40%), Gaps = 11/110 (10%)
Query: 336 KEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEI---- 391
K E E++ E+ D + E + + K + +D+ + + +
Sbjct: 321 KSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQ 380
Query: 392 DNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRR 441
+ K++E++ + EE E E +E ++E E+ + + GRR
Sbjct: 381 RAEARKKEENDAEIEELRRELEGE-------EESDEEENEEPSKKNVGRR 423
Score = 32.3 bits (74), Expect = 0.51
Identities = 33/140 (23%), Positives = 59/140 (42%), Gaps = 20/140 (14%)
Query: 303 DFHSKKLLRKDYMQSRLTSILVRSDFSLENETKKEEQE-----HSEEKGDSDEKYDPEHF 357
D + LL +MQ R++ + E E +E EE+ D +E +P
Sbjct: 367 DEENSGLLSMKFMQ--------RAEARKKEENDAEIEELRRELEGEEESDEEENEEPSK- 417
Query: 358 EMNQVTRENYAESKTNKNTSD--YDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASE 415
V R + K E+ E ++ ++DE+++ E EE+ + E+ A++
Sbjct: 418 --KNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKV--EKVANK 473
Query: 416 NEYYTETDEDEKENEEDDGE 435
+E + E+E EE D E
Sbjct: 474 LLKRSEKAQKEEEEEELDEE 493
Score = 30.4 bits (69), Expect = 2.5
Identities = 19/80 (23%), Positives = 26/80 (32%), Gaps = 4/80 (5%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPE---HFEMNQVTRENYAESKTNKNTSDYDDESDIEI 387
EN K + D K N++++ K K D D I
Sbjct: 493 ENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKS-IDLDDDLI 551
Query: 388 VDEIDNDEDKQDESEEDGEE 407
+E D DE +ED EE
Sbjct: 552 DEEDSIKLDVDDEEDEDDEE 571
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 35.2 bits (81), Expect = 0.027
Identities = 20/95 (21%), Positives = 44/95 (46%), Gaps = 4/95 (4%)
Query: 343 EEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESE 402
E G + EK E + + S ++ +++ + DE D+++++E +
Sbjct: 3 EFGGGAPEKKPEEEKKDENLLEHVKITSWDKEDIIKENED----VKDEKQEDDEEEEEED 58
Query: 403 EDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
E+ EE E+ E E + +E+E++ E++ D
Sbjct: 59 EEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKD 93
Score = 33.3 bits (76), Expect = 0.11
Identities = 21/89 (23%), Positives = 42/89 (47%), Gaps = 3/89 (3%)
Query: 347 DSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGE 406
+ ++ EH ++ +E+ K N++ D E D E +E D +E ++ E ED E
Sbjct: 16 EKKDENLLEHVKITSWDKEDI--IKENEDVKDEKQEDD-EEEEEEDEEEIEEPEDIEDEE 72
Query: 407 EEGEEEASENEYYTETDEDEKENEEDDGE 435
E E+E E E + + + +++ +
Sbjct: 73 EIVEDEEEEEEDEEDNVDLKDIEKKNIND 101
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 35.4 bits (82), Expect = 0.027
Identities = 16/61 (26%), Positives = 32/61 (52%)
Query: 377 SDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
SD +E + ++ D++ ++E + +E+ EE SE E T ++E+ + E + E
Sbjct: 7 SDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEAEAEF 66
Query: 437 D 437
D
Sbjct: 67 D 67
Score = 33.9 bits (78), Expect = 0.077
Identities = 10/47 (21%), Positives = 25/47 (53%)
Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
E ++D+ ++DE + +E+ E E + ++DE+ + E++
Sbjct: 3 SESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQI 49
Score = 31.2 bits (71), Expect = 0.53
Identities = 15/47 (31%), Positives = 24/47 (51%), Gaps = 3/47 (6%)
Query: 392 DNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
E + D+ EED E E+E E+ + +E + ++E D ESD
Sbjct: 1 SGSESESDDGEEDEELPEEDEDDESS---DEEEVDLPDDEQDEESDS 44
Score = 30.0 bits (68), Expect = 1.4
Identities = 11/57 (19%), Positives = 30/57 (52%)
Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
S+ E D +++E +++ +++ +E E + ++E E+D +E++ E +
Sbjct: 1 SGSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEE 57
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 35.9 bits (83), Expect = 0.027
Identities = 17/80 (21%), Positives = 42/80 (52%)
Query: 358 EMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENE 417
E+ + R + ++ + D D ++++ +E D D+D++DE + EE + ++
Sbjct: 62 ELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADD 121
Query: 418 YYTETDEDEKENEEDDGESD 437
+ +D D +++ DD +S+
Sbjct: 122 SDSSSDSDSSDDDSDDDDSE 141
Score = 32.8 bits (75), Expect = 0.29
Identities = 24/104 (23%), Positives = 42/104 (40%), Gaps = 4/104 (3%)
Query: 329 SLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
++E+ K + S E + D++ D + + N ++ S D SD +
Sbjct: 79 AIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSDS--SSDSDSSDDDSD 136
Query: 389 DEIDNDEDKQ--DESEEDGEEEGEEEASENEYYTETDEDEKENE 430
D+ DE E E+ +E EE+ E E +E +E E
Sbjct: 137 DDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180
>gnl|CDD|206365 pfam14197, Cep57_CLD_2, Centrosome localisation domain of PPC89.
The N-terminal region of the fission yeast spindle pole
body protein PPC89 has low similarity to the human Cep57
protein. The CLD or centrosome localisation domain of
Cep57 and PPC89 is found at the N-terminus. This region
localises to the centrosome internally to gamma-tubulin,
suggesting that it is either on both centrioles or on a
centromatrix component. This N-terminal region can also
multimerise with the N-terminus of other Cep57
molecules. The C-terminal part, Family Cep57_MT_bd,
pfam06657, is the microtubule-binding region of Cep57
and PPC89.
Length = 69
Score = 33.1 bits (76), Expect = 0.028
Identities = 13/54 (24%), Positives = 21/54 (38%), Gaps = 2/54 (3%)
Query: 294 SLDRHFRVHDFHSKKLLRKDYMQSRLTSILVRS--DFSLENETKKEEQEHSEEK 345
S R VH+ K+L R+ R + + ENE ++E E +
Sbjct: 16 SATRKVAVHEIELKRLNRERDSALRQLGVAYLECQELKAENEALRKELEELRAQ 69
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 36.4 bits (84), Expect = 0.030
Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 6/67 (8%)
Query: 371 KTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE 430
K ++N S + + E E D D+ ++ + +ED E E+ SE+E ED +E+E
Sbjct: 633 KADENKSRHQQLFEGE---EEDEDDLEETDDDEDECEAIEDSESESE---SDGEDGEEDE 686
Query: 431 EDDGESD 437
++D
Sbjct: 687 QEDDAEA 693
Score = 34.8 bits (80), Expect = 0.086
Identities = 15/56 (26%), Positives = 31/56 (55%), Gaps = 3/56 (5%)
Query: 369 ESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDE 424
E + + +D DDE + E ++ D++ + + + E+ E+E E++A NE D+
Sbjct: 650 EDEDDLEETD-DDEDECEAIE--DSESESESDGEDGEEDEQEDDAEANEGVVPIDK 702
Score = 34.1 bits (78), Expect = 0.16
Identities = 21/55 (38%), Positives = 29/55 (52%), Gaps = 6/55 (10%)
Query: 387 IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEK---ENEEDDGESDG 438
I+D + DE+K EGEEE ++ T+ DEDE E+ E + ESDG
Sbjct: 628 ILDVLKADENKSRHQ---QLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDG 679
Score = 31.4 bits (71), Expect = 1.1
Identities = 20/84 (23%), Positives = 36/84 (42%), Gaps = 11/84 (13%)
Query: 362 VTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQ-DESEEDGEEEGEEEASENEYYT 420
V + + +S+ + +++ D + D DE + ++SE + E +GE+
Sbjct: 631 VLKADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDG-------- 682
Query: 421 ETDEDEKENE--EDDGESDGGRRR 442
E DE E + E E D RR
Sbjct: 683 EEDEQEDDAEANEGVVPIDKAVRR 706
Score = 29.5 bits (66), Expect = 5.1
Identities = 18/108 (16%), Positives = 37/108 (34%), Gaps = 7/108 (6%)
Query: 327 DFSLENETKKEEQEHSEEKGDSDEKYD--PEHFEMNQVTRENYAESKTNKNTSDYDDESD 384
D E + E E E G+ DE+ D + + + + K + D
Sbjct: 663 DECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKA--VRRALPKVLNLPDALDG 720
Query: 385 IEIVDEIDNDEDKQDESEEDGEE---EGEEEASENEYYTETDEDEKEN 429
+ DE D+++ + + E +E + + +K+N
Sbjct: 721 GDSEDEEGMDDEQMMRLDTYLAQIFKEKKERIQAGGETKKEAQSQKQN 768
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 35.5 bits (82), Expect = 0.035
Identities = 15/67 (22%), Positives = 33/67 (49%)
Query: 371 KTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE 430
K E + +I +++ E K E E + +E +E+ E E E ++++ +++
Sbjct: 133 SKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDD 192
Query: 431 EDDGESD 437
+DD + D
Sbjct: 193 DDDDDDD 199
Score = 32.4 bits (74), Expect = 0.30
Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 8/55 (14%)
Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEK----ENEEDDGESDGG 439
+ D DE+ + + EE+ EEE E+E ++ + D+D+ EN D+GE D
Sbjct: 164 EAEDVDEEDEKDEEEEEEEEEEDE----DFDDDDDDDDDDYNAENYFDNGEDDDY 214
Score = 30.9 bits (70), Expect = 1.0
Identities = 14/57 (24%), Positives = 32/57 (56%), Gaps = 3/57 (5%)
Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEAS-ENEYYTETDEDEKENEEDDGESDGG 439
++ +E + DE++++E EE+ E+ +++ +++Y E D E+DD + D
Sbjct: 165 AEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDDYNAENYFD--NGEDDDYDDDEA 219
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 35.0 bits (80), Expect = 0.045
Identities = 15/49 (30%), Positives = 32/49 (65%), Gaps = 1/49 (2%)
Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
++ID ED++++ E E+ EE++ ++E+ ++DE E E DD +++
Sbjct: 149 EKIDEREDQEEQEREREEQTIEEQSDDSEHEI-IEQDESETESDDDKTE 196
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 34.7 bits (80), Expect = 0.046
Identities = 24/86 (27%), Positives = 42/86 (48%), Gaps = 7/86 (8%)
Query: 359 MNQVTRENYA--ESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEE-----DGEEEGEE 411
++++ E A E K K D + DE+D DEDK++E ++ D E + E+
Sbjct: 84 LDEIDEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDED 143
Query: 412 EASENEYYTETDEDEKENEEDDGESD 437
E + ++DE + +EDD E +
Sbjct: 144 EDDDEIVEILIEDDEVDEDEDDDEDE 169
Score = 32.4 bits (74), Expect = 0.22
Identities = 27/89 (30%), Positives = 42/89 (47%), Gaps = 10/89 (11%)
Query: 333 ETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTS---------DYDDES 383
E +E Q +E+K + + + FE +Y E K + D +DE
Sbjct: 86 EIDEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDED 145
Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEE 412
D EIV+ + D D+ DE E+D E+E +EE
Sbjct: 146 DDEIVEILIED-DEVDEDEDDDEDEEDEE 173
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 35.0 bits (81), Expect = 0.049
Identities = 18/50 (36%), Positives = 25/50 (50%)
Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
DE +E E+D E E EEE E E ++ D+ E + E D E +G
Sbjct: 25 DEFFWTYLLFEEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEG 74
Score = 31.2 bits (71), Expect = 0.91
Identities = 15/38 (39%), Positives = 21/38 (55%)
Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENE 417
+E + E + D D+ + DE E D EEEGE+E E
Sbjct: 45 IEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREE 82
Score = 28.9 bits (65), Expect = 5.4
Identities = 17/65 (26%), Positives = 33/65 (50%)
Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
D + + +E D++E + +E EE+ E + + + SE++ DE+E E E E
Sbjct: 25 DEFFWTYLLFEEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREERL 84
Query: 438 GGRRR 442
++R
Sbjct: 85 KKKKR 89
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 35.7 bits (82), Expect = 0.052
Identities = 20/107 (18%), Positives = 49/107 (45%), Gaps = 1/107 (0%)
Query: 330 LENETKKEEQEHSEEKGDSDEKYDP-EHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
LE +EE +E D D E + ++ E + +Y+D D +
Sbjct: 204 LEATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEK 263
Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
D+ + D + +++ ++E ++ ++++ E +ED+++ ++ D E
Sbjct: 264 DKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEE 310
Score = 35.0 bits (80), Expect = 0.080
Identities = 22/108 (20%), Positives = 43/108 (39%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
E+ + + SE D + D + + E + D +DE+D+ E
Sbjct: 86 EHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESE 145
Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
++ DE+E+D E++ EEE + E + ++ + G D
Sbjct: 146 SSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDK 193
Score = 34.2 bits (78), Expect = 0.16
Identities = 28/132 (21%), Positives = 55/132 (41%), Gaps = 14/132 (10%)
Query: 324 VRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHF--------------EMNQVTRENYAE 369
+ D + ++E KK E+E + E+ +S E+ E EMN+ AE
Sbjct: 151 LSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNEFLEATEAE 210
Query: 370 SKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKEN 429
+ D ++ + ++ +DED +E+ +EEG E + E D+ +
Sbjct: 211 EEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAG 270
Query: 430 EEDDGESDGGRR 441
++ + E D +
Sbjct: 271 DDAELEDDEPDK 282
Score = 33.0 bits (75), Expect = 0.29
Identities = 20/112 (17%), Positives = 49/112 (43%)
Query: 327 DFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIE 386
D E+E+ E+ E + D ++K + E + + E K + D ++
Sbjct: 139 DLFNESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLD 198
Query: 387 IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
++E + ++E+ E++ E+ ++ + DED E+++ + +G
Sbjct: 199 EMNEFLEATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEG 250
Score = 32.7 bits (74), Expect = 0.50
Identities = 14/60 (23%), Positives = 27/60 (45%)
Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
+D SD +D D+ +D+++E E++ E+ + + E E+ DD D
Sbjct: 101 EDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDE 160
Score = 32.3 bits (73), Expect = 0.54
Identities = 14/87 (16%), Positives = 38/87 (43%)
Query: 340 EHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQD 399
+ ++K D+ + + E E ++ + A+SK + + D++ D + +E +
Sbjct: 261 KEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKV 320
Query: 400 ESEEDGEEEGEEEASENEYYTETDEDE 426
+ +E E + E+ + E + +
Sbjct: 321 KLDEPVLEGVDLESPKELSSFEKRQAK 347
Score = 31.1 bits (70), Expect = 1.5
Identities = 19/107 (17%), Positives = 42/107 (39%), Gaps = 7/107 (6%)
Query: 335 KKEEQEHSEEKGDSDEKYDPEHFEMNQ---VTRENYAESKTNKNTSDYDDESDIEIVDEI 391
E+ + E+ G +++ D E + + + K + ++ +D+ + +
Sbjct: 228 DSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKK 287
Query: 392 DNDEDKQDESEEDGEEEG----EEEASENEYYTETDEDEKENEEDDG 434
+ D ++E EED E+E EE + DE E + +
Sbjct: 288 EADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLES 334
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 34.5 bits (80), Expect = 0.065
Identities = 12/49 (24%), Positives = 25/49 (51%)
Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDE 426
D + + I+++ D++D ++D ++E E+E E E D +E
Sbjct: 31 DPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79
Score = 33.7 bits (78), Expect = 0.11
Identities = 18/58 (31%), Positives = 29/58 (50%), Gaps = 7/58 (12%)
Query: 383 SDIEIVDEID-NDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
SD+ I ID N + E EE+ +++ D+DE E+E+D+ E+D G
Sbjct: 23 SDL-ISGFIDPNAAAAAATAAAIESELDEEDLEDDD-----DDDEDEDEDDEEEADLG 74
Score = 30.2 bits (69), Expect = 1.5
Identities = 10/39 (25%), Positives = 19/39 (48%)
Query: 375 NTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEA 413
N + + + + D + D+ +ED +E+ EEEA
Sbjct: 33 NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 35.2 bits (82), Expect = 0.069
Identities = 13/56 (23%), Positives = 25/56 (44%), Gaps = 3/56 (5%)
Query: 385 IEIVDEIDNDEDKQDE---SEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
+E D ++N E + E D E + +E D++++E EED+ +
Sbjct: 153 LEWYDRLENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDS 208
Score = 30.5 bits (70), Expect = 1.8
Identities = 9/42 (21%), Positives = 17/42 (40%)
Query: 385 IEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDE 426
I++V+E + +D E + E+ E + E E
Sbjct: 57 IQVVEEAPDADDLLLAENEADAQTDEDAEEEAAAALSSVESE 98
Score = 30.5 bits (70), Expect = 2.0
Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 7/56 (12%)
Query: 382 ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
+ + + ID D +EED G E ++ DEDE+E E+++ +S
Sbjct: 161 NGERRLRELIDGFVD--PNAEEDPAHVGSELEELDD-----DEDEEEEEDENDDSL 209
Score = 29.8 bits (68), Expect = 3.1
Identities = 10/45 (22%), Positives = 18/45 (40%)
Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKE 428
D + + D E+ +++ +EE E+E DE E
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESE 215
Score = 28.6 bits (65), Expect = 8.2
Identities = 17/65 (26%), Positives = 26/65 (40%), Gaps = 4/65 (6%)
Query: 380 DDESDIEIVDEID---NDEDKQ-DESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
+D D E +++I ND Q E D ++ E + E E+E E
Sbjct: 37 EDIVDSEQIEDIIAMLNDMGIQVVEEAPDADDLLLAENEADAQTDEDAEEEAAAALSSVE 96
Query: 436 SDGGR 440
S+ GR
Sbjct: 97 SEIGR 101
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 35.0 bits (81), Expect = 0.077
Identities = 16/71 (22%), Positives = 26/71 (36%), Gaps = 3/71 (4%)
Query: 373 NKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
+ + + + + E D +K DE E+D E+ G E + D EE
Sbjct: 315 EQPQGKDEKDLEEKPE-EPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPD--PQEEA 371
Query: 433 DGESDGGRRRG 443
DG+ G
Sbjct: 372 DGQGSSTDPAG 382
Score = 33.5 bits (77), Expect = 0.23
Identities = 7/54 (12%), Positives = 20/54 (37%)
Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
+++D +++ EE G + + + E++ DD + +
Sbjct: 319 GKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDPQEEAD 372
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 34.7 bits (80), Expect = 0.081
Identities = 14/36 (38%), Positives = 22/36 (61%), Gaps = 4/36 (11%)
Query: 377 SDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEE 412
D DDE IE ++D+ +D D S+EDGE+ + +
Sbjct: 257 DDDDDEDAIE--SDLDDSDD--DVSDEDGEDLFDTD 288
Score = 29.7 bits (67), Expect = 3.4
Identities = 14/64 (21%), Positives = 30/64 (46%), Gaps = 5/64 (7%)
Query: 374 KNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
K + I +D ID+D++ ++D E+ E + +++ D+ E+ ED
Sbjct: 230 KKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSD-----DDVSDEDGEDL 284
Query: 434 GESD 437
++D
Sbjct: 285 FDTD 288
>gnl|CDD|182306 PRK10207, PRK10207, dipeptide/tripeptide permease B; Provisional.
Length = 489
Score = 34.8 bits (80), Expect = 0.083
Identities = 13/33 (39%), Positives = 18/33 (54%)
Query: 158 TTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWV 190
T+ N F+ NV H +L +S+ P F P WV
Sbjct: 290 TSLNFFAINNVHHEILGFSINPVSFQALNPFWV 322
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 34.6 bits (79), Expect = 0.11
Identities = 28/111 (25%), Positives = 50/111 (45%), Gaps = 14/111 (12%)
Query: 321 SILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYD 380
S LV+S +E+++ E + E SD D E E ++ + + D
Sbjct: 692 SALVKSRPDVEDDSDDSELD-FAEDDFSDSTSDDE-----PKLDAIDDEDAKSEGSQESD 745
Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
E E +DEI D + ++ + E EE+ S +E +++E+EN+E
Sbjct: 746 QE---EGLDEIFYSFDGEQDNSDSFAESSEEDES-----SEEEKEEEENKE 788
Score = 33.1 bits (75), Expect = 0.37
Identities = 28/111 (25%), Positives = 54/111 (48%), Gaps = 13/111 (11%)
Query: 325 RSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESD 384
++ F ++E + E + K D + D + E++ +AE + +TSD D
Sbjct: 676 KASFDSDDEMDENEIWSALVKSRPDVEDDSDDSELD------FAEDDFSDSTSD-----D 724
Query: 385 IEIVDEIDNDEDKQDESEEDGEEEGEEEA--SENEYYTETDEDEKENEEDD 433
+D ID+++ K + S+E +EEG +E S + +D + +EED+
Sbjct: 725 EPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEEDE 775
Score = 30.0 bits (67), Expect = 3.2
Identities = 22/93 (23%), Positives = 39/93 (41%), Gaps = 12/93 (12%)
Query: 350 EKYDPEHFEMNQVTRENYAESK-TNKNTSDYDDESDIEIVDEIDNDED-----KQDESEE 403
E PE Q + A+ K N + +D + DE+D +E K E
Sbjct: 649 EDIKPEDKFFYQYFTKKTADGKGKKSNKASFDSD------DEMDENEIWSALVKSRPDVE 702
Query: 404 DGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
D ++ E + +E+++ T +DE + + D E
Sbjct: 703 DDSDDSELDFAEDDFSDSTSDDEPKLDAIDDED 735
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 34.2 bits (78), Expect = 0.11
Identities = 19/113 (16%), Positives = 44/113 (38%), Gaps = 18/113 (15%)
Query: 322 ILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDD 381
D S+ + K ++ + D D+ Y K N +++ Y D
Sbjct: 219 TTETKDLSMSTRSYKFGRQTRGFRSDDDQGYG---------------RYKQNLSSNKYGD 263
Query: 382 ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDG 434
+ E D+DE ++ E+ +++ +N+ +T+ + + + D+G
Sbjct: 264 FKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNEDND---DTNTNHGDEDCDNG 313
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 32.7 bits (75), Expect = 0.12
Identities = 16/59 (27%), Positives = 28/59 (47%)
Query: 352 YDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGE 410
Y + + + E + K K DD D +I+ + D DE+ DE +++ EE+ E
Sbjct: 71 YPVDEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129
Score = 32.7 bits (75), Expect = 0.13
Identities = 12/55 (21%), Positives = 32/55 (58%)
Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
DE D EI+ + + K+ + + ++ +++ ++ + E D DE+++E+++ E
Sbjct: 74 DEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128
Score = 31.5 bits (72), Expect = 0.29
Identities = 13/62 (20%), Positives = 28/62 (45%)
Query: 364 RENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETD 423
R Y + ++ +++ D + +D D+D D+ ++ EE+ E + E D
Sbjct: 68 RSWYPVDEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEED 127
Query: 424 ED 425
E+
Sbjct: 128 EE 129
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 34.4 bits (79), Expect = 0.13
Identities = 19/81 (23%), Positives = 36/81 (44%), Gaps = 4/81 (4%)
Query: 367 YAESKTNKNTSDYDDESDI-EIVDEIDN--DEDKQDESEEDGEEEGEEEASENEYYT-ET 422
Y E K+N + S + + E ++ + + + + E D EE E +A E + E
Sbjct: 353 YEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEEN 412
Query: 423 DEDEKENEEDDGESDGGRRRG 443
++ +KE ++D E R
Sbjct: 413 EKKQKEQADEDKEKRQKDERK 433
Score = 31.3 bits (71), Expect = 1.0
Identities = 11/69 (15%), Positives = 27/69 (39%)
Query: 367 YAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDE 426
A +K + D S + + + K + + E +E + E + E++
Sbjct: 347 LALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEK 406
Query: 427 KENEEDDGE 435
+ EE++ +
Sbjct: 407 LKQEENEKK 415
>gnl|CDD|181969 PRK09584, tppB, putative tripeptide transporter permease; Reviewed.
Length = 500
Score = 34.3 bits (79), Expect = 0.13
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 158 TTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWV 190
T+ N F+ +NV H +L +V+P+++ P W+
Sbjct: 297 TSLNFFAIRNVEHSILGIAVEPEQYQALNPFWI 329
>gnl|CDD|233189 TIGR00924, yjdL_sub1_fam, amino acid/peptide transporter
(Peptide:H+ symporter), bacterial. The model describes
proton-dependent oligopeptide transporters in bacteria.
This model is restricted in its range in recognizing
bacterial proton-dependent oligopeptide transporters,
although they are found in yeast, plants and animals.
They function by proton symport in a 1:1 stoichiometry,
which is variable in different species. All of them are
predicted to contain 12 transmembrane domains, for which
limited experimental evidence exists [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 475
Score = 34.0 bits (78), Expect = 0.15
Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 1/39 (2%)
Query: 153 LWD-LNTTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWV 190
L+ + T+ N F+ N+ H +L SV F P WV
Sbjct: 287 LYAQMPTSLNFFADNNMHHEMLGMSVPVIWFQSLNPFWV 325
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 33.2 bits (76), Expect = 0.19
Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 367 YAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASEN 416
A+ T ++ DD D + DE +ED+ + +EE+ +EEGE +AS +
Sbjct: 30 SAQDLTEDEEAE-DDVVDEDEEDEAVVEEDENELTEEEEDEEGEVKASPD 78
Score = 31.7 bits (72), Expect = 0.78
Identities = 14/40 (35%), Positives = 23/40 (57%)
Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKE 428
D +++E + D +ED E+E E ENE E +++E E
Sbjct: 33 DLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 33.6 bits (77), Expect = 0.23
Identities = 13/52 (25%), Positives = 26/52 (50%)
Query: 386 EIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
E V+ N+E+ ++E E G + + + DEDE+++EE + +
Sbjct: 3 EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAK 54
Score = 32.5 bits (74), Expect = 0.59
Identities = 11/44 (25%), Positives = 24/44 (54%)
Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
E++ + + E+E E EA +++D+DE E++D + +
Sbjct: 2 EEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEED 45
Score = 31.7 bits (72), Expect = 0.97
Identities = 16/54 (29%), Positives = 28/54 (51%)
Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRR 442
+E N + ++E EE+ E S+++ EDE E+EEDD E++ +
Sbjct: 2 EEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKA 55
Score = 29.4 bits (66), Expect = 4.3
Identities = 20/64 (31%), Positives = 34/64 (53%), Gaps = 4/64 (6%)
Query: 368 AESKTNKNTSDYDDESDIEIVDEI---DNDEDKQDESEEDGEEEGEEEASENEYYTETDE 424
E + N ++ +DE ++E V D+D+D+ +ED +EE +EEA E+ E +
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEA-ESPAKAEISK 59
Query: 425 DEKE 428
EK
Sbjct: 60 REKA 63
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 33.1 bits (75), Expect = 0.26
Identities = 18/87 (20%), Positives = 37/87 (42%)
Query: 327 DFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIE 386
L N+ E ++ +EK E+ + E+ VT ++ E K+ +
Sbjct: 182 REHLWNKAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSM 241
Query: 387 IVDEIDNDEDKQDESEEDGEEEGEEEA 413
E + +E + ES+ED +E+ + +
Sbjct: 242 ETSESEEEESSESESDEDEDEDNKGKI 268
>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin).
Length = 585
Score = 33.4 bits (76), Expect = 0.27
Identities = 26/125 (20%), Positives = 49/125 (39%), Gaps = 8/125 (6%)
Query: 322 ILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDD 381
I+ + ENE +E+ ++ + +H E + + + K K Y+D
Sbjct: 88 IITEALRQAENEPGGHPRENQPYNLQVEKNFPTDHSEDYETQQWEEEKLKHMKFPRRYED 147
Query: 382 ESD---IEIVDEIDN---DEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
S+ + +EI + Q + EE G+ + + +DE EK ++D E
Sbjct: 148 NSEEKHSKRTNEIVEVFYNPQSQATLKSVFEEVGKLDGQSKQKRERSDEREKSYQDD--E 205
Query: 436 SDGGR 440
D R
Sbjct: 206 DDTYR 210
Score = 33.4 bits (76), Expect = 0.27
Identities = 20/142 (14%), Positives = 38/142 (26%), Gaps = 21/142 (14%)
Query: 323 LVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYA--ESKTNKNTSDYD 380
V + EE + EE S E+ ++ R E + + +
Sbjct: 218 DVVGGEDWNPIEEGEEDQTQEEVKRSKERTHKGRSLPDESKRSGQLGLEDEAEEEKEEKG 277
Query: 381 DESD-----IEIVDEIDNDEDK-QDESEEDGEEEGEEEASENE-------------YYTE 421
ES + N + + ++ + E E E E +
Sbjct: 278 QESRGLSAVQTYLLRWVNARGRGRSQNRAERERSEESEEKELDRASPYQEIEITANLQIP 337
Query: 422 TDEDEKENEEDDGESDGGRRRG 443
+ + + DG GR
Sbjct: 338 PSDLIRMLRKGDGRKPRGRVEE 359
>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
terminus. The C terminus of the plasma membrane Nha1
antiporter plays an important role in the immediate cell
response to hypo-osmotic shock which prevents an
execessive loss of ions and water. This domain is found
with pfam00999.
Length = 430
Score = 33.2 bits (76), Expect = 0.27
Identities = 22/115 (19%), Positives = 36/115 (31%), Gaps = 7/115 (6%)
Query: 329 SLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKT-NKNTSDYDDESDIEI 387
+ N K +E E + P E++ +SK+ K D
Sbjct: 304 QIRNLDPKARREVDLETNAPERVKAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSLLGSE 363
Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRR 442
E + + S + SE+E D D +D+ E+ RRR
Sbjct: 364 ASERRKNREPSSSSPPSRPRSRRD--SEDEDTERDDSDS----DDEEETPAERRR 412
Score = 33.2 bits (76), Expect = 0.28
Identities = 13/88 (14%), Positives = 24/88 (27%)
Query: 346 GDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDG 405
S P+ + + D E+ ++ E
Sbjct: 94 PISQSAISPQRRSSTGENSAESDNTSYGLSKLAEDSENIDVRPVYESDERSGISEEGSRP 153
Query: 406 EEEGEEEASENEYYTETDEDEKENEEDD 433
+ E+E E Y E D+ EN + +
Sbjct: 154 SKLREQEQRPTEAYQEGDQIIIENRQGE 181
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 33.0 bits (75), Expect = 0.31
Identities = 20/71 (28%), Positives = 39/71 (54%), Gaps = 1/71 (1%)
Query: 369 ESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKE 428
E+ ++ + DD+ D + ++ D+D+D +D+ +ED ++ + S + ETD +
Sbjct: 48 EAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGF 107
Query: 429 NEEDDGESDGG 439
+ DD ESD G
Sbjct: 108 ADSDD-ESDDG 117
Score = 30.7 bits (69), Expect = 1.9
Identities = 23/98 (23%), Positives = 38/98 (38%), Gaps = 21/98 (21%)
Query: 333 ETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEID 392
+ +EE E+ + D+ D + E D DD+ D + DE D
Sbjct: 43 KEAEEEAMEEEDDDEEDDDDDDDEDED------------------DDDDDDDEDDEDEDD 84
Query: 393 NDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE 430
+D D+S D +G E +E + DE + +E
Sbjct: 85 DDSTLHDDSSAD---DGNETDNEAGFADSDDESDDGSE 119
Score = 30.0 bits (67), Expect = 2.6
Identities = 9/41 (21%), Positives = 26/41 (63%)
Query: 397 KQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
++ EED +EE +++ + + + D+D++++E++D +
Sbjct: 47 EEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDS 87
Score = 29.6 bits (66), Expect = 3.4
Identities = 23/98 (23%), Positives = 39/98 (39%), Gaps = 20/98 (20%)
Query: 335 KKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDND 394
KE +E + E+ D DE+ D + D D++ D + DE D D
Sbjct: 42 GKEAEEEAMEEEDDDEEDDDD--------------------DDDEDEDDDDDDDDEDDED 81
Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
ED D + D + ++NE +DE ++ +
Sbjct: 82 EDDDDSTLHDDSSADDGNETDNEAGFADSDDESDDGSE 119
Score = 29.6 bits (66), Expect = 3.8
Identities = 15/43 (34%), Positives = 28/43 (65%)
Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
E++ E E+D EE+ +++ E+E + D+DE + +EDD +S
Sbjct: 46 EEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDST 88
Score = 29.6 bits (66), Expect = 3.9
Identities = 13/52 (25%), Positives = 29/52 (55%)
Query: 386 EIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
E +E +ED +E ++D ++E E++ +++ + DED+ ++ D S
Sbjct: 44 EAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSA 95
Score = 29.2 bits (65), Expect = 4.5
Identities = 13/31 (41%), Positives = 21/31 (67%)
Query: 407 EEGEEEASENEYYTETDEDEKENEEDDGESD 437
+E EEEA E E E D+D+ ++E++D + D
Sbjct: 43 KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDD 73
Score = 28.4 bits (63), Expect = 8.4
Identities = 12/35 (34%), Positives = 23/35 (65%)
Query: 403 EDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
++ EEE EE ++E + D+DE E+++DD + +
Sbjct: 43 KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDE 77
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 32.8 bits (75), Expect = 0.39
Identities = 15/48 (31%), Positives = 27/48 (56%)
Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
I + E + +++ DE EE+ +EE +EE + E ++E+E EE
Sbjct: 22 PIYLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69
Score = 30.1 bits (68), Expect = 2.4
Identities = 14/46 (30%), Positives = 26/46 (56%), Gaps = 5/46 (10%)
Query: 382 ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEK 427
E + E+ DE + +E ++ + EE+ + EEE E E +++EK
Sbjct: 30 EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDE-----EEEKEEK 70
Score = 29.7 bits (67), Expect = 3.7
Identities = 16/62 (25%), Positives = 29/62 (46%), Gaps = 1/62 (1%)
Query: 367 YAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDE 426
+ E + K D ++E + E E + ++ E E D EEE EE+ + + ET +
Sbjct: 26 WVEKEVEKEVPDEEEEEEKEEKKE-EEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEW 84
Query: 427 KE 428
+
Sbjct: 85 EL 86
>gnl|CDD|226246 COG3723, RecT, Recombinational DNA repair protein (RecE pathway)
[DNA replication, recombination, and repair].
Length = 276
Score = 32.1 bits (73), Expect = 0.44
Identities = 17/56 (30%), Positives = 26/56 (46%), Gaps = 3/56 (5%)
Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
DE++ ++ +ED + + D E GE EA E + E D E +E ES
Sbjct: 224 DEAERREPLDVTPEEDSE---QLDSEPSGEPEAQETDEENEKPIDVSEEKEPITES 276
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 32.0 bits (73), Expect = 0.45
Identities = 9/59 (15%), Positives = 25/59 (42%), Gaps = 1/59 (1%)
Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
+ D + ++ + + ++E + E+ EE + E E E+ E + ++ +
Sbjct: 4 ECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEE-IIEDSEESNEVKIEELKD 61
Score = 31.3 bits (71), Expect = 0.75
Identities = 14/52 (26%), Positives = 25/52 (48%), Gaps = 3/52 (5%)
Query: 389 DEIDNDEDKQDESEEDGEEEGE---EEASENEYYTETDEDEKENEEDDGESD 437
E + + K + EED +E E E+ + E + +++E ED ES+
Sbjct: 1 MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESN 52
Score = 31.3 bits (71), Expect = 0.84
Identities = 20/85 (23%), Positives = 37/85 (43%), Gaps = 5/85 (5%)
Query: 355 EHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEE--EGEEE 412
E EN E +N + E D ++++ +E +++E ED EE E + E
Sbjct: 1 MEKECKDAKHENMEEDCCKENE---NKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIE 57
Query: 413 ASENEYYTETDEDEKENEEDDGESD 437
++E +E++K E + D
Sbjct: 58 ELKDENNKLKEENKKLENELEALKD 82
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 30.3 bits (68), Expect = 0.52
Identities = 19/59 (32%), Positives = 34/59 (57%)
Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
DDE + E + D++E++ + EE+GEEE E E + + E +ED+ E ++ + D
Sbjct: 48 DDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKTDEDD 106
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 32.3 bits (73), Expect = 0.53
Identities = 25/80 (31%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 348 SDEKYDPEH---FEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDE---S 401
+++K D E+ E + +N E T K + +D +IE V+E++ E++ D+ S
Sbjct: 78 ANDKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDS 137
Query: 402 EEDGEEEGEEEASENEYYTE 421
E+D E+E + E ENE E
Sbjct: 138 EKDDEKESDAEGDENELAGE 157
Score = 29.3 bits (65), Expect = 4.7
Identities = 22/68 (32%), Positives = 39/68 (57%), Gaps = 8/68 (11%)
Query: 379 YDDESDIEIV---DEIDNDEDK--QDESEEDGEEEGEEEASENEYYTET---DEDEKENE 430
D+E+D+E+ + N+E++ Q E EED + E EE E E E DED ++++
Sbjct: 82 SDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDD 141
Query: 431 EDDGESDG 438
E + +++G
Sbjct: 142 EKESDAEG 149
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 32.0 bits (73), Expect = 0.63
Identities = 23/94 (24%), Positives = 32/94 (34%), Gaps = 7/94 (7%)
Query: 351 KYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV-DEIDNDEDKQDESEEDGEEEG 409
+ F EN ES N + D++ ++ D ED DE EDG + G
Sbjct: 305 GSLDDPFLKAPEDTENEDESDDNAD-DFDDNDKKGDVGFDINAEVEDDPDEDAEDGGDGG 363
Query: 410 EEEASENEYYT-----ETDEDEKENEEDDGESDG 438
+ E E E D E + D D
Sbjct: 364 DAEEFEAEKEPMLKVCRVDRQVTEIIDGDEPVDY 397
>gnl|CDD|107300 cd06305, PBP1_methylthioribose_binding_like,
Methylthioribose-binding protein-like of ABC-type
transport systems that belong to a family of
pentose/hexose sugar-binding proteins of the type I
periplasmic binding protein (PBP1) superfamily.
Methylthioribose-binding protein-like of ABC-type
transport systems that belong to a family of
pentose/hexose sugar-binding proteins of the type I
periplasmic binding protein (PBP1) superfamily, which
consists of two alpha/beta globular domains connected by
a three-stranded hinge. This Venus flytrap-like domain
undergoes transition from an open to a closed
conformational state upon ligand binding. The
sugar-binding domain of the periplasmic proteins in this
group is also homologous to the ligand-binding domain of
eukaryotic receptors such as metabotropic glutamate
receptor (mGluR), DNA-binding transcriptional repressors
such as LacI and GalR.
Length = 273
Score = 31.9 bits (73), Expect = 0.63
Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 3/57 (5%)
Query: 224 PVLKVDIPDVALSCITTTNQSHYVIVGSTTGDMMAVDLRGKGKPCYKY-KGFAGCVK 279
PV+ D+ D + T Q Y + + D + DL GKG Y GF +
Sbjct: 82 PVVAFDV-DSDNPKVNNTTQDDYSL-ARLSLDQLVKDLGGKGNVGYVNVAGFPPLDR 136
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 32.1 bits (73), Expect = 0.64
Identities = 15/102 (14%), Positives = 37/102 (36%), Gaps = 16/102 (15%)
Query: 334 TKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEI---VDE 390
++K Q+++ +K DS T + + TS+ +E ++ +
Sbjct: 23 SQKRIQQNNGDKEDS-------------STSTSSLSVSAVEKTSNAKEEIQVDFQHNSES 69
Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
+ + +DE E + + ++S D ++D
Sbjct: 70 AVEEVEAEDEIEVEQNQSDVLKSSSIVKEESISTDMDGIDDD 111
>gnl|CDD|216194 pfam00922, Phosphoprotein, Vesiculovirus phosphoprotein.
Length = 283
Score = 31.8 bits (72), Expect = 0.65
Identities = 13/69 (18%), Positives = 28/69 (40%), Gaps = 2/69 (2%)
Query: 370 SKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKEN 429
S+ + +Y + EI+ E+++ + +E+ EE + YY + + E
Sbjct: 1 SRLRQYLKNYPRLDQT--LSEIEEMEEQRADKSSTFQEDSVEEHTSPSYYLAEELSDSET 58
Query: 430 EEDDGESDG 438
E + G
Sbjct: 59 EPSIEDDQG 67
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 32.0 bits (73), Expect = 0.75
Identities = 23/127 (18%), Positives = 43/127 (33%), Gaps = 31/127 (24%)
Query: 330 LENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVD 389
+ +KEEQE + D D+ + ++ K K D I
Sbjct: 324 DSPKKRKEEQEDDDFVEDDDDLQ-------ASLAKQRRLAQKKRKKLRPED------IAR 370
Query: 390 EIDNDEDKQDESEEDGEEEG------------------EEEASENEYYTETDEDEKENEE 431
+I + + +E + + G EE+ + E D +E++E
Sbjct: 371 QIAEERSEDEEEAAENNDNGIVIDETSEFVRSLQKEPLEEKPENKDESVEEISDAEEDDE 430
Query: 432 DDGESDG 438
D+ + DG
Sbjct: 431 DEEDEDG 437
Score = 30.1 bits (68), Expect = 2.8
Identities = 32/148 (21%), Positives = 55/148 (37%), Gaps = 27/148 (18%)
Query: 313 DYMQSRLTSILVRSDFSLENETKKEEQE-HSEEKGDSDEKYDPEHFEMNQVTRENY---A 368
D + SIL + D +E + KK + + G +D++ + + E+ + + N
Sbjct: 196 DDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLD 255
Query: 369 ESKTNKNTSDYDDESDIEIV--------------DEIDNDE---------DKQDESEEDG 405
+ T SDY D S++ ++D DE S +D
Sbjct: 256 DDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDV 315
Query: 406 EEEGEEEASENEYYTETDEDEKENEEDD 433
EEE + E ED+ E+DD
Sbjct: 316 EEENARLEDSPKKRKEEQEDDDFVEDDD 343
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 31.2 bits (71), Expect = 0.80
Identities = 18/67 (26%), Positives = 27/67 (40%)
Query: 365 ENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDE 424
EN + T+ +S E + E E+K S E + E E E E +E
Sbjct: 115 ENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEEE 174
Query: 425 DEKENEE 431
+ KE E+
Sbjct: 175 EVKEPED 181
Score = 30.0 bits (68), Expect = 2.2
Identities = 22/131 (16%), Positives = 42/131 (32%), Gaps = 34/131 (25%)
Query: 337 EEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAES--KTNKNTSDYDD------------- 381
E +++ +K D + E N T + KT D
Sbjct: 45 EAEDNLLDKAVDDILSNHYLKEGNGTTSVAIGDGMEKTIHGDVKESDPMSFITNPSLTVS 104
Query: 382 ------ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYT-------------ET 422
ESD+E +++ D + +E+ ++ E E + E
Sbjct: 105 VPIASGESDLENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEE 164
Query: 423 DEDEKENEEDD 433
+E+E EE++
Sbjct: 165 EEEEIAEEEEE 175
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 32.0 bits (72), Expect = 0.83
Identities = 23/94 (24%), Positives = 42/94 (44%), Gaps = 2/94 (2%)
Query: 340 EHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQD 399
E EE DEK + E E + + +N E + + ++ES E+ E + +E+ +
Sbjct: 590 EGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARK 649
Query: 400 ESEEDG--EEEGEEEASENEYYTETDEDEKENEE 431
+ E G E E + + + T+E K E+
Sbjct: 650 KEELRGNFELEERGDPEKKDVDWYTEEKRKIEEQ 683
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 30.4 bits (69), Expect = 0.95
Identities = 14/30 (46%), Positives = 23/30 (76%)
Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENE 417
+E ++D+D++DE EED EE+ +E+ SE E
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEE 138
Score = 28.1 bits (63), Expect = 6.2
Identities = 13/27 (48%), Positives = 18/27 (66%), Gaps = 2/27 (7%)
Query: 378 DYDDESDIEIVDEIDNDEDKQDESEED 404
DDE D E +E D ++D +DESEE+
Sbjct: 114 SDDDEEDEE--EEDDEEDDDEDESEEE 138
Score = 27.7 bits (62), Expect = 6.9
Identities = 15/35 (42%), Positives = 24/35 (68%), Gaps = 5/35 (14%)
Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEAS 414
+DESD + + DE+++D+ E+D E+E EEE S
Sbjct: 111 EDESDDD-----EEDEEEEDDEEDDDEDESEEEES 140
Score = 27.3 bits (61), Expect = 9.9
Identities = 16/35 (45%), Positives = 24/35 (68%), Gaps = 5/35 (14%)
Query: 397 KQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
++DES++D E+E EE+ E++ DEDE E EE
Sbjct: 110 EEDESDDDEEDEEEEDDEEDD-----DEDESEEEE 139
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 31.6 bits (72), Expect = 0.96
Identities = 26/122 (21%), Positives = 47/122 (38%), Gaps = 4/122 (3%)
Query: 311 RKDYMQSRLTSILVRSDFSLENETKKEEQE-HSEEKGDSDEKYDPEHFEMNQVTRENYAE 369
K+ +++ + R E E E E EE + E E+ + RE AE
Sbjct: 328 LKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAE 387
Query: 370 SKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKEN 429
++ +E + E+ EI++ E++ + E E+ EE E + +E
Sbjct: 388 --LEAELAEIRNELE-ELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEEL 444
Query: 430 EE 431
E
Sbjct: 445 NE 446
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell cycle
progression and biogenesis of 60S ribosomal subunits.
The Schistosoma mansoni Mak16 has been shown to target
protein transport to the nucleolus.
Length = 97
Score = 29.5 bits (66), Expect = 0.99
Identities = 16/41 (39%), Positives = 24/41 (58%)
Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
E ++ E ++ EEE EEE E E +D++E E E +D E
Sbjct: 56 EAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLE 96
Score = 28.3 bits (63), Expect = 2.3
Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 1/49 (2%)
Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
+ E+ ++ EE+ EEE EEE Y DE+ +E ED
Sbjct: 47 SQSAFKKALEAEESEENDEEEEEEE-EEEDEGEIEYVSDDEELEEEIED 94
Score = 27.9 bits (62), Expect = 3.8
Identities = 12/43 (27%), Positives = 24/43 (55%)
Query: 386 EIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKE 428
+ E + D+++E EE+ E+EGE E ++ E + ++ E
Sbjct: 54 ALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLE 96
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 31.6 bits (71), Expect = 1.0
Identities = 22/133 (16%), Positives = 55/133 (41%), Gaps = 25/133 (18%)
Query: 326 SDFSLENETKKEEQE-HSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESD 384
F E+E + + ++ + G S++ + +N + +S + N+ ++E
Sbjct: 60 EAFDEEDEKRFADWSFNASKSGKSNKDHK----NLNNTKEISLNDSDDSVNSDKLENEGS 115
Query: 385 IEIVDE---------IDNDEDKQDESEEDGEEE-----------GEEEASENEYYTETDE 424
+ +DE +DND+ +++ES + ++ E +E++
Sbjct: 116 VSSIDENELVDLDTLLDNDQPEKNESGNNDHATDKENLLESDASSSNDSESEESDSESEI 175
Query: 425 DEKENEEDDGESD 437
+ +++ DD SD
Sbjct: 176 ESSDSDHDDENSD 188
Score = 30.1 bits (67), Expect = 2.8
Identities = 22/113 (19%), Positives = 47/113 (41%), Gaps = 13/113 (11%)
Query: 327 DFSLENETKKEEQEHSEEKGDSDEKYDP---------EHFEMNQVTRENYAESKTNKNTS 377
D N TK+ S++ +SD+ + E +++ + + E + N
Sbjct: 86 DHKNLNNTKEISLNDSDDSVNSDKLENEGSVSSIDENELVDLDTLLDNDQPEKNESGNN- 144
Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE 430
D +D E + E + D E+ + E E E+S++++ E + + +N
Sbjct: 145 --DHATDKENLLE-SDASSSNDSESEESDSESEIESSDSDHDDENSDSKLDNL 194
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 31.5 bits (71), Expect = 1.1
Identities = 17/106 (16%), Positives = 42/106 (39%), Gaps = 3/106 (2%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDY---DDESDIEI 387
E K++E ++ + D+ + + NQ A+ + KN + D ++
Sbjct: 249 EVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDL 308
Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
E E + ++ E + +++ E A + + E + + +D
Sbjct: 309 KQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSLNED 354
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 30.0 bits (68), Expect = 1.1
Identities = 11/51 (21%), Positives = 25/51 (49%)
Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGR 440
E +++++ EED E E E+ ++E+ + + EE+ + D +
Sbjct: 74 EALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDEDKYK 124
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 30.4 bits (69), Expect = 1.2
Identities = 16/52 (30%), Positives = 26/52 (50%)
Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
+E E V+E + +E +DE EE+ EE E +E + E E + +E
Sbjct: 4 KNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDE 55
Score = 28.0 bits (63), Expect = 6.6
Identities = 14/40 (35%), Positives = 19/40 (47%)
Query: 393 NDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
+E ++E EE EE E+ E E E E E +EE
Sbjct: 4 KNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQ 43
>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22. The
mitochondrial protein translocase family, which is
responsible for movement of nuclear encoded pre-proteins
into mitochondria, is very complex with at least 19
components. These proteins include several chaperone
proteins, four proteins of the outer membrane
translocase (Tom) import receptor, five proteins of the
Tom channel complex, five proteins of the inner membrane
translocase (Tim) and three "motor" proteins. This
family represents the Tom22 proteins. The N terminal
region of Tom22 has been shown to have chaperone-like
activity, and the C terminal region faces the
intermembrane face.
Length = 136
Score = 29.9 bits (68), Expect = 1.2
Identities = 12/49 (24%), Positives = 25/49 (51%), Gaps = 1/49 (2%)
Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
+ E++ DE Q++ EE ++E T+TD D ++ + + E+
Sbjct: 4 LTEVE-DETFQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDSDFENET 51
Score = 28.0 bits (63), Expect = 4.9
Identities = 11/47 (23%), Positives = 25/47 (53%), Gaps = 5/47 (10%)
Query: 355 EHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDES 401
E F+ +N A+ + + D +D++D + +I +D D ++E+
Sbjct: 10 ETFQEKPAAPKNLAQEE---SDDDDEDDTDTD--SDISDDSDFENET 51
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 28.7 bits (65), Expect = 1.2
Identities = 15/36 (41%), Positives = 23/36 (63%), Gaps = 1/36 (2%)
Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEA 413
DYD +SD E +E + ED + E EED EE+ +++
Sbjct: 39 DYDYDSDAEW-EEEEEGEDLESEDEEDEEEDDDDDM 73
Score = 28.0 bits (63), Expect = 1.9
Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 10/49 (20%)
Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
+D D D E EE EEEGE+ E++++E E E+DD + DG
Sbjct: 37 GLDYDYDSDAEWEE--EEEGED--------LESEDEEDEEEDDDDDMDG 75
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 31.1 bits (70), Expect = 1.2
Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 5/60 (8%)
Query: 369 ESKTNKNTSDYDDESDIEIVDEIDN-----DEDKQDESEEDGEEEGEEEASENEYYTETD 423
++T + DDE IE ++ + D+D + E+ EE G E + D
Sbjct: 275 VTRTFPERTSDDDEDAIETEEDDVDESAIEDDDDDSDWEDSVEESGRSSVDEKTMFQRVD 334
>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
Length = 325
Score = 30.7 bits (70), Expect = 1.4
Identities = 10/48 (20%), Positives = 22/48 (45%)
Query: 368 AESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASE 415
+ N DE+ +E+ DE E++ +++ EE + A++
Sbjct: 2 TLKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQ 49
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 30.2 bits (68), Expect = 1.5
Identities = 16/85 (18%), Positives = 35/85 (41%), Gaps = 1/85 (1%)
Query: 351 KYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGE 410
K + M + A+ K + + + E +I+ +E ++++ + E E
Sbjct: 19 KLEEAKANMQKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDI 78
Query: 411 EEASENEYYTETDEDEKENEEDDGE 435
E E E +E+++E ED +
Sbjct: 79 AEQQEKE-EIAQEEEKEEEAEDVKQ 102
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 30.8 bits (70), Expect = 1.6
Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 4/49 (8%)
Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENEY----YTETDEDEKENEED 432
V+ DE+++ E EE+ EEE E E E E T ED +D
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPPLLTPISEDAPLPNDD 390
Score = 29.6 bits (67), Expect = 3.8
Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 11/42 (26%)
Query: 393 NDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDG 434
N E K +E EE +EE EEE +E+ +E E ++G
Sbjct: 343 NPEQKDEE-EEQEDEEEEEE----------EEEPEEPEPEEG 373
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 30.7 bits (70), Expect = 1.6
Identities = 13/54 (24%), Positives = 28/54 (51%), Gaps = 6/54 (11%)
Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
D+EI +E+D+ +D+++E E++ +E+ + +D+ E E D
Sbjct: 215 DVEIQNELDDIDDEEEERVLADEDDDDEDDMFD------MDDDDEEESDPEVER 262
Score = 29.2 bits (66), Expect = 4.7
Identities = 13/37 (35%), Positives = 21/37 (56%)
Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASEN 416
D+E + + DE D+DED + ++D EEE + E
Sbjct: 227 DEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERT 263
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 29.0 bits (64), Expect = 1.7
Identities = 13/34 (38%), Positives = 23/34 (67%), Gaps = 1/34 (2%)
Query: 400 ESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
E EED + + +EE +++ E D+DE ++E+DD
Sbjct: 7 EGEEDSDSDSDEE-EDDDDEDEEDDDEDDDEDDD 39
>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
Length = 1463
Score = 30.9 bits (69), Expect = 1.7
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 413 ASENEYYTETDEDEKENEEDDGESDGGRRRG 443
+S + +E D DEK +ED + GG+RRG
Sbjct: 1267 SSSDSSSSEEDGDEKNEKEDRERAGGGKRRG 1297
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 30.4 bits (68), Expect = 1.8
Identities = 31/118 (26%), Positives = 48/118 (40%), Gaps = 13/118 (11%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDY-----DDESDI 385
E ET EE+ + + G+ DE+ + E+ ++ VT ++ ++
Sbjct: 65 EEETSNEEENNEDSDGNEDEEAEAENTTLSTVTLGYGGDATPGTGNIGLAALQLPKKAGN 124
Query: 386 EIVDEIDNDEDKQDESEEDGEEEGEEEASENEY--------YTETDEDEKENEEDDGE 435
DE +DE EE+ EEE E E ENE TE D + D+GE
Sbjct: 125 AGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGE 182
Score = 30.0 bits (67), Expect = 2.2
Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 2/42 (4%)
Query: 399 DESEE--DGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
D SEE DG+ EE E E +ED NE+++ E++
Sbjct: 49 DSSEENGDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAEN 90
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 30.7 bits (69), Expect = 1.8
Identities = 14/66 (21%), Positives = 19/66 (28%), Gaps = 2/66 (3%)
Query: 377 SDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEE-EASENEYYTETDEDEKEN-EEDDG 434
+ E+DIE D + D E E DE E E E+
Sbjct: 363 VEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAP 422
Query: 435 ESDGGR 440
D +
Sbjct: 423 IPDPAK 428
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 30.4 bits (68), Expect = 2.0
Identities = 15/90 (16%), Positives = 39/90 (43%), Gaps = 1/90 (1%)
Query: 328 FSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEI 387
+ L+ + EE++ E + ++ E + + ++ + N N ++ ES+ E
Sbjct: 44 YDLKKSSSDEEEQDYELRPRVSSSWNNESYNRLPIKTKDNVVADVN-NGEEFLSESESEA 102
Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENE 417
EID+D + + + ++ E + +
Sbjct: 103 SLEIDSDIKDEKQKSLEEQKIAPEIPVKQQ 132
Score = 30.0 bits (67), Expect = 3.0
Identities = 26/138 (18%), Positives = 49/138 (35%), Gaps = 32/138 (23%)
Query: 325 RSDFSLENET-KKEEQEHSEEKG--------DSDEK---------YDPEHFEMNQVTREN 366
S S E T KK E+E ++E D +K D E + R +
Sbjct: 6 NSQSSTEPRTTKKAERELTQEDIEFFNENPEDLGKKVIYDLKKSSSDEEEQDYELRPRVS 65
Query: 367 YAES---------KTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENE 417
+ + KT N +D+ +E ++ + + E D + + E++ S E
Sbjct: 66 SSWNNESYNRLPIKTKDNV-----VADVNNGEEFLSESESEASLEIDSDIKDEKQKSLEE 120
Query: 418 YYTETDEDEKENEEDDGE 435
+ K+ + + E
Sbjct: 121 QKIAPEIPVKQQIDSEKE 138
>gnl|CDD|218504 pfam05215, Spiralin, Spiralin. This family consists of Spiralin
proteins found in spiroplasma bacteria. Spiroplasmas are
helically shaped pathogenic bacteria related to the
mycoplasmas. The surface of spiroplasma bacteria is
crowded with the membrane-anchored lipoprotein spiralin
whose structure and function are unknown although its
cellular function is thought to be a structural and
mechanical one rather than a catalytic one.
Length = 239
Score = 29.9 bits (67), Expect = 2.3
Identities = 17/39 (43%), Positives = 21/39 (53%), Gaps = 4/39 (10%)
Query: 197 PNSDKVAVCSKQGYVRVYDPKITTQRRPVLKVDIPDVAL 235
P DK V K GY++V PK + V KVDI DV +
Sbjct: 111 PAKDKTVVIGKTGYIKVTLPKNS----EVKKVDISDVTV 145
>gnl|CDD|164860 PHA01625, PHA01625, hypothetical protein.
Length = 249
Score = 29.8 bits (67), Expect = 2.3
Identities = 17/55 (30%), Positives = 34/55 (61%), Gaps = 3/55 (5%)
Query: 21 VNLSKDKCIIKNIQNVKYITNKASVLALNWGNVEESELLIAYSGQKIKIYDTDFK 75
+NL+ + +I++I ++K I L L+ +++ ++LL+ YS I++YD FK
Sbjct: 77 INLTYRQKMIRDIDDLKQILES---LNLSKVSLKTNKLLVKYSNYSIELYDIYFK 128
>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
N-terminal domain. Autophagocytosis is a
starvation-induced process responsible for transport of
cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
ubiquitin like modifier that is topologically similar to
the canonical E2 enzyme. It catalyzes the conjugation of
Atg8 and phosphatidylethanolamine.
Length = 146
Score = 29.2 bits (66), Expect = 2.3
Identities = 10/44 (22%), Positives = 21/44 (47%)
Query: 370 SKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEA 413
+ DD +D E + EI +D+D +S + E++ ++
Sbjct: 103 TTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADEDDDDDIP 146
>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter. The Monovalent
Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
CPA1 family is a large family of proteins derived from
Gram-positive and Gram-negative bacteria, blue green
bacteria, yeast, plants and animals. Transporters from
eukaryotes have been functionally characterized, and all
of these catalyze Na+:H+ exchange. Their primary
physiological functions may be in (1) cytoplasmic pH
regulation, extruding the H+ generated during
metabolism, and (2) salt tolerance (in plants), due to
Na+ uptake into vacuoles. This model is specific for the
fungal members of this family [Transport and binding
proteins, Cations and iron carrying compounds].
Length = 810
Score = 30.3 bits (68), Expect = 2.3
Identities = 22/74 (29%), Positives = 33/74 (44%), Gaps = 7/74 (9%)
Query: 365 ENYAESKTNK-----NTSDYDD-ESDIEIVDEIDNDEDKQDESEEDG-EEEGEEEASENE 417
+ A N+ TS YD ES+I+ V +ND + SE ++ EEE
Sbjct: 589 DKLARDTENRDVTYVPTSRYDGIESEIDDVYTYENDSESIASSERRRIKKLREEEQQAYI 648
Query: 418 YYTETDEDEKENEE 431
YTE ++ EN +
Sbjct: 649 AYTEDNQVIIENRQ 662
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 30.1 bits (67), Expect = 2.4
Identities = 21/93 (22%), Positives = 43/93 (46%), Gaps = 4/93 (4%)
Query: 324 VRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDES 383
++ S++ + Q+ + ++ D + + E E +++T S +++ T +YD
Sbjct: 47 FKTTASVDRYGRPLNQDKATKEIDRLYELENESSESSEITDNEEVASASSELTDEYDPAR 106
Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASEN 416
+ I + + DES E+ EEE E SE
Sbjct: 107 G----EGIISTSESSDESREESEEEKANEISEK 135
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 30.4 bits (68), Expect = 2.4
Identities = 18/80 (22%), Positives = 37/80 (46%), Gaps = 4/80 (5%)
Query: 361 QVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEE----ASEN 416
V++E+ + T +N ++D E D+ +ED + S +DG+ + + S
Sbjct: 92 PVSQESEVQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSG 151
Query: 417 EYYTETDEDEKENEEDDGES 436
+ TD + K+++ G S
Sbjct: 152 STTSGTDLNTKQSQTGLGAS 171
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 30.2 bits (68), Expect = 2.4
Identities = 22/116 (18%), Positives = 41/116 (35%), Gaps = 25/116 (21%)
Query: 331 ENETKKEEQEHS--------EEKGDSDEKYDPEHFEMNQVT-----RENYAESKTN--KN 375
E E +KEEQ DSDE +P E +T ++ E + +
Sbjct: 164 EREKEKEEQPMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEERRAAVERR 223
Query: 376 TSDYDDESDIEIVDEIDNDEDKQ----------DESEEDGEEEGEEEASENEYYTE 421
++ + + + + + +Q D+ +E GE+ + E Y T
Sbjct: 224 LAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPESYITS 279
>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
Length = 852
Score = 30.5 bits (69), Expect = 2.4
Identities = 7/35 (20%), Positives = 14/35 (40%)
Query: 376 TSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGE 410
S+ D E + D+ D+ + EE+ +
Sbjct: 32 GSEDDSEDPASLDDDDDDRDSGGGMLEEEEDGNMR 66
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 29.9 bits (67), Expect = 2.6
Identities = 14/52 (26%), Positives = 22/52 (42%), Gaps = 6/52 (11%)
Query: 383 SDIEIVD--EIDND-EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
+E+V+ E D E+ DE ED E E E E + E + ++
Sbjct: 330 EVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEV---AEMELLEVQVDD 378
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 30.3 bits (68), Expect = 2.6
Identities = 15/57 (26%), Positives = 27/57 (47%), Gaps = 3/57 (5%)
Query: 383 SDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
DI ++++ + +ESE EEE E+ EN++ +D+ E E +D
Sbjct: 365 DDILYFEKMEIENRNPEESEH--EEEVEDYEDENDHSKRICDDD-ELENHFRAADEK 418
Score = 29.2 bits (65), Expect = 5.0
Identities = 30/126 (23%), Positives = 45/126 (35%), Gaps = 11/126 (8%)
Query: 286 NENLILSVSLDRHFRVHDFHSKKLLRKDYMQSRLTSILVRSDFSLENETK-KEEQEHSEE 344
N + + D F L S+ L R+ L NE EE E +
Sbjct: 295 NTTFVWNYMEDNVFHSFSLRFLDNLDFLQFLSKYIGCLWRN---LNNEKWGNEEAERKDY 351
Query: 345 KGDSD------EKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQ 398
DS + D +FE ++ N ES+ + DY+DE+D D+DE +
Sbjct: 352 ILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDEND-HSKRICDDDELEN 410
Query: 399 DESEED 404
D
Sbjct: 411 HFRAAD 416
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
kDa subunit. This family contains several DNA-directed
RNA polymerase 19 kDa polypeptides. The Poxvirus
DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
DNA-template-directed extension of the 3'-end of an RNA
strand by one nucleotide at a time.
Length = 167
Score = 29.3 bits (66), Expect = 2.8
Identities = 13/36 (36%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASE 415
+D DI + D+D ++ +E EED EE+ E S
Sbjct: 2 EDSDDIIDYESDDDDSEEYEEEEED-EEDAESLESS 36
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 29.2 bits (66), Expect = 3.0
Identities = 14/40 (35%), Positives = 19/40 (47%), Gaps = 1/40 (2%)
Query: 398 QDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
E EE+ E +EA EN + DE + E D+ E D
Sbjct: 20 PKEEEEEALELAIQEAHENAL-EKKLLDELDEELDEEEDD 58
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 29.8 bits (67), Expect = 3.0
Identities = 26/111 (23%), Positives = 42/111 (37%), Gaps = 9/111 (8%)
Query: 323 LVRSDFSLENETKKEEQEH--SEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYD 380
++R + E E+E S+ D + + + R + +
Sbjct: 252 VLREIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSL 311
Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
ESDI +EI ++K SE G G +E E+E DE+ KE E
Sbjct: 312 PESDIH--EEIPKLDEKS-LSEFLGYMGGIDEDDEDE----DDEESKEEVE 355
>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1). This family
consists of several mammalian dentin matrix protein 1
(DMP1) sequences. The dentin matrix acidic
phosphoprotein 1 (DMP1) gene has been mapped to human
chromosome 4q21. DMP1 is a bone and teeth specific
protein initially identified from mineralised dentin.
DMP1 is primarily localised in the nuclear compartment
of undifferentiated osteoblasts. In the nucleus, DMP1
acts as a transcriptional component for activation of
osteoblast-specific genes like osteocalcin. During the
early phase of osteoblast maturation, Ca(2+) surges into
the nucleus from the cytoplasm, triggering the
phosphorylation of DMP1 by a nuclear isoform of casein
kinase II. This phosphorylated DMP1 is then exported out
into the extracellular matrix, where it regulates
nucleation of hydroxyapatite. DMP1 is a unique molecule
that initiates osteoblast differentiation by
transcription in the nucleus and orchestrates
mineralised matrix formation extracellularly, at later
stages of osteoblast maturation. The DMP1 gene has been
found to be ectopically expressed in lung cancer
although the reason for this is unknown.
Length = 514
Score = 30.0 bits (67), Expect = 3.2
Identities = 28/121 (23%), Positives = 58/121 (47%), Gaps = 9/121 (7%)
Query: 329 SLENETKKEEQEHSEEKGDSDEKYDP--EHFEMNQVTRENYAESKTNKNTSDYDDESDIE 386
S EN ++ +E+ SE +GD+ + E E ++ + E+ ++ ++ + ++++D E
Sbjct: 343 SQENSSESQEEVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQSTEEQADSE 402
Query: 387 IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTET------DEDEKENEEDDGESDGGR 440
+ + + E+ ES ED +E + TE+ E + +EEDD +S
Sbjct: 403 SNESLSSSEESP-ESTEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDDSDSQDSS 461
Query: 441 R 441
R
Sbjct: 462 R 462
>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
Length = 130
Score = 28.7 bits (64), Expect = 3.4
Identities = 19/53 (35%), Positives = 24/53 (45%), Gaps = 2/53 (3%)
Query: 392 DNDED--KQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRR 442
ND D +DE E EEG EE SE+ E ENE+ + + R R
Sbjct: 2 YNDMDMGYEDEPSEPEIEEGAEEDSESNNNDEVKGSIGENEDKEEQEPVQRPR 54
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or more
acidic residues. The function of members of this family
is unknown.
Length = 104
Score = 28.0 bits (63), Expect = 3.5
Identities = 9/56 (16%), Positives = 29/56 (51%), Gaps = 3/56 (5%)
Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
D + + E ++ +D D+ ++D + +++ + + + D+D E+++D+
Sbjct: 52 DAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLD---DDDDDFLEDDDDE 104
>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2. This
domain family is found in eukaryotes, and is typically
between 138 and 153 amino acids in length. The family is
found in association with pfam00493. Mini-chromosome
maintenance (MCM) proteins are essential for DNA
replication. These proteins use ATPase activity to
perform this function.
Length = 145
Score = 28.8 bits (65), Expect = 3.5
Identities = 19/72 (26%), Positives = 35/72 (48%), Gaps = 10/72 (13%)
Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEE-------EASENEYYTETDEDEKENE 430
+ + + + E+ D+ID D D++ E EEDGE+ + E + Y E D++ +
Sbjct: 1 EDEPDEEEELEDDID-DLDEEAEEEEDGEDLFGDNMERDYRANPELDRYDEDGLDDESDY 59
Query: 431 EDDGESDGGRRR 442
E+ + RR
Sbjct: 60 EELSAAA--RRA 69
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 30.0 bits (68), Expect = 3.6
Identities = 10/111 (9%), Positives = 35/111 (31%)
Query: 324 VRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDES 383
V + K S + + + + + K ++ + +S
Sbjct: 1278 VSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKS 1337
Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDG 434
+ + + + + E ++ ++++++ E++EDD
Sbjct: 1338 KTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388
>gnl|CDD|222800 PHA00458, PHA00458, single-stranded DNA-binding protein.
Length = 233
Score = 29.4 bits (66), Expect = 3.6
Identities = 17/44 (38%), Positives = 26/44 (59%)
Query: 394 DEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
++D DE EEDG E +A + + E DED++E E+D + D
Sbjct: 189 EDDWADEDEEDGYVASEAKARKPQDEEEWDEDDEEESEEDEDGD 232
>gnl|CDD|223042 PHA03322, PHA03322, tegument protein VP11/12; Provisional.
Length = 674
Score = 29.6 bits (66), Expect = 3.8
Identities = 29/146 (19%), Positives = 50/146 (34%), Gaps = 16/146 (10%)
Query: 301 VHDFHSKKLLRKDYMQSRLTSILVRSDFSLENETKKEEQEHSEEK-GDSDEKYDPEHFEM 359
+H K L +SR +LVR S + ++ G +K P ++ +
Sbjct: 487 IHMDSPKSPLSISSFKSRARRVLVRPSTSTDTTRNDPSYWFADGDDGRIRDKTYPLNYTL 546
Query: 360 NQVTREN-YAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEY 418
N+ T E + T+++ D+ ++ D + ES + E E S Y
Sbjct: 547 NKATYEGPVSGPGTSEDGEYVDETNN-------DTATGPKCESPDHVYENIEPAESNYSY 599
Query: 419 YTETDED-------EKENEEDDGESD 437
+TD D D G
Sbjct: 600 ENDTDFDSVASDLGRNHRSTDGGSPV 625
>gnl|CDD|227494 COG5165, POB3, Nucleosome-binding factor SPN, POB3 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 508
Score = 29.6 bits (66), Expect = 3.9
Identities = 16/78 (20%), Positives = 31/78 (39%), Gaps = 3/78 (3%)
Query: 365 ENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQ-DESEEDGEEEGEEE--ASENEYYTE 421
E + SK K ++ E + I + ED + E+ E E E+ S+++ E
Sbjct: 429 EQFLHSKGIKARNEEVQERLQTDLGSISDSEDINMGSAGEEDESEDEDFQMVSDSDVAEE 488
Query: 422 TDEDEKENEEDDGESDGG 439
D ++ + G +
Sbjct: 489 YDLQAALSDAEGGSDEER 506
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
(DUF2052). This entry is of sequences of two conserved
domains separated by a region of low complexity,
spanning some 200 residues. The function is unknown.
Length = 178
Score = 28.9 bits (65), Expect = 4.0
Identities = 13/49 (26%), Positives = 23/49 (46%)
Query: 387 IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
+ +D +E + + E+ EEE EE E+ +E+E+ GE
Sbjct: 60 LESSLDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGE 108
Score = 28.5 bits (64), Expect = 5.6
Identities = 21/71 (29%), Positives = 27/71 (38%), Gaps = 5/71 (7%)
Query: 377 SDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASE-----NEYYTETDEDEKENEE 431
S D E + E +E+ +E EED EEE EEE E E D D +
Sbjct: 62 SSLDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELP 121
Query: 432 DDGESDGGRRR 442
E + R
Sbjct: 122 TPEEREELREE 132
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for cell
viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 29.6 bits (67), Expect = 4.1
Identities = 14/38 (36%), Positives = 19/38 (50%)
Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDE 426
+EI + K D SEE EEE +EE ++ D E
Sbjct: 56 EEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVE 93
>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein. This family consists of several
eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
to suppress GCR2 and is highly expressed in the muscle
and heart. The function of this family is unknown
although it has been speculated that SGT1 may be
functionally analogous to the Gcr2p protein of
Saccharomyces cerevisiae which is known to be a
regulatory factor of glycolytic gene expression.
Length = 557
Score = 29.3 bits (66), Expect = 4.9
Identities = 12/47 (25%), Positives = 26/47 (55%)
Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
DEIDND + ++ ++E ++E ++ + +E E+ ++ D E
Sbjct: 475 DEIDNDLPDDSDDADEDDDEDDDEDEDSSSDSTLEELEEYMDQMDAE 521
Score = 28.5 bits (64), Expect = 7.7
Identities = 25/106 (23%), Positives = 44/106 (41%), Gaps = 9/106 (8%)
Query: 337 EEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDI--EIVDEIDND 394
E+ + ++ D D +D + F +N K ++ +D D+SD E DE D+D
Sbjct: 442 EDDDEPDDSEDKDVSFDEDEFFEFL---KNMLGLKDDEIDNDLPDDSDDADEDDDE-DDD 497
Query: 395 EDKQDESE-EDGEEEGEEEASENEYY--TETDEDEKENEEDDGESD 437
ED+ S+ E E + + E ++ + N G D
Sbjct: 498 EDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADISNSGSSGAED 543
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 29.1 bits (65), Expect = 5.0
Identities = 13/31 (41%), Positives = 20/31 (64%), Gaps = 1/31 (3%)
Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDED 425
E + E E++ EE EEE SE E T++++D
Sbjct: 3 EVLELEEEDESGEE-EEEESEEEEETDSEDD 32
Score = 28.3 bits (63), Expect = 7.3
Identities = 26/109 (23%), Positives = 47/109 (43%), Gaps = 4/109 (3%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFE----MNQVTRENYAESKTNKNTSDYDDESDIE 386
E+E+ +EE+E SEE+ ++D + D E + R E + +E
Sbjct: 10 EDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEEEAKR 69
Query: 387 IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
+E + K E E E E ++ + E + + + ENEE++ E
Sbjct: 70 KAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDENEEEEYE 118
>gnl|CDD|147685 pfam05663, DUF809, Protein of unknown function (DUF809). This
family consists of several proteins of unknown function
Raphanus sativus (Radish) and Brassica napus (Rape).
Length = 138
Score = 28.2 bits (62), Expect = 5.1
Identities = 15/36 (41%), Positives = 22/36 (61%)
Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE 430
E K+++ E GE EG+EE E + E E++KE E
Sbjct: 98 EGKEEKKEGKGEIEGKEEKKEGKGEIEGKEEKKEVE 133
>gnl|CDD|131353 TIGR02300, FYDLN_acid, TIGR02300 family protein. Members of this
family are bacterial proteins with a conserved motif
[KR]FYDLN, sometimes flanked by a pair of CXXC motifs,
followed by a long region of low complexity sequence in
which roughly half the residues are Asp and Glu,
including multiple runs of five or more acidic residues.
The function of members of this family is unknown.
Length = 129
Score = 28.0 bits (62), Expect = 5.2
Identities = 19/92 (20%), Positives = 43/92 (46%), Gaps = 3/92 (3%)
Query: 350 EKYDPEHFEMNQVTRENYAESKTNKNTSD--YDDESDIEIVDEIDNDEDKQDESEEDGEE 407
E++ PE ++ R + + D D+ +++ ++E D + D + G++
Sbjct: 34 EQFPPEEALKSRRVRADDEKEAVAVKEVDPEIDEAAELVSLEEADEEVDPGAPTPAGGDD 93
Query: 408 EGEEEASENEYYTET-DEDEKENEEDDGESDG 438
E+E ++ T EDE++++ DD + G
Sbjct: 94 LAEDEDIADDDDDVTFLEDEEDDDIDDEDIIG 125
>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2). All proteins
in this family for which functions are known are flap
endonucleases that generate the 3' incision next to DNA
damage as part of nucleotide excision repair. This
family is related to many other flap endonuclease
families including the fen1 family. This family is based
on the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1034
Score = 29.5 bits (66), Expect = 5.2
Identities = 24/111 (21%), Positives = 46/111 (41%), Gaps = 10/111 (9%)
Query: 332 NETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEI 391
N + E E +E+ +SD + E++ V+ + + +D +E+ V I
Sbjct: 640 NPMEVEPMESEKEESESDGSFI----EVDSVSSTLELQVPSKSQPTDESEENAENKVASI 695
Query: 392 DNDEDKQ------DESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
+ + K+ DESEED EE + + + +D E + E+
Sbjct: 696 EGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQDISLEELEALEA 746
>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D. SpoVID, the
stage VI sporulation protein D, is restricted to
endospore-forming members of the bacteria, all of which
are found among the Firmicutes. It is widely distributed
but not quite universal in this group. Between
well-conserved N-terminal and C-terminal domains is a
poorly conserved, low-complexity region of variable
length, rich enough in glutamic acid to cause spurious
BLAST search results unless a filter is used. The seed
alignment for this model was trimmed, in effect, by
choosing member sequences in which these regions are
relatively short. SpoVID is involved in spore coat
assembly by the mother cell compartment late in the
process of sporulation [Cellular processes, Sporulation
and germination].
Length = 338
Score = 28.7 bits (64), Expect = 5.7
Identities = 20/116 (17%), Positives = 44/116 (37%), Gaps = 7/116 (6%)
Query: 322 ILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDD 381
+ + +D S+ ++ + + D +E+ + E +E A + + +
Sbjct: 125 LQLTADISISGIQQENNLDAEPAREDEEEE-ESFSAEFEHPAQEETAGEEERTDEPKVEH 183
Query: 382 ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
E+ + D+D D+ S E E+E +E+ E ED+ E +
Sbjct: 184 EAHEQHEQPADDDPDEWKISA------SEPFQLESEVEASPEEENYEEYEDETELE 233
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 28.7 bits (65), Expect = 5.9
Identities = 22/140 (15%), Positives = 48/140 (34%), Gaps = 30/140 (21%)
Query: 305 HSKKLLRKDYMQSRLTSILV--RSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQV 362
+S KL K LV + D E E ++ + E G + ++
Sbjct: 269 YSPKLAEKP-------RWLVFNKIDLLDEEEAEERAKAIVEALGWEGP-----VYLISAA 316
Query: 363 TRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEE---EGEEEASENEYY 419
+ E +++ I+ + ++ E E E+ ++ E
Sbjct: 317 SGLGVKELC-------------WDLMTFIEENPREEAEEAEAPEKVEFMWDDYHREQLEE 363
Query: 420 TETDEDEKENEEDDGESDGG 439
E ++D+ +++ D + D G
Sbjct: 364 VEEEDDDDWDDDWDEDDDEG 383
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 28.9 bits (64), Expect = 6.0
Identities = 18/67 (26%), Positives = 27/67 (40%), Gaps = 3/67 (4%)
Query: 369 ESKTNKNTSDYDD---ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDED 425
E + + +D + D + E DN+E DE + EG EE S E + E
Sbjct: 396 EEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLREF 455
Query: 426 EKENEED 432
+ED
Sbjct: 456 RDMEKED 462
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 27.5 bits (61), Expect = 6.2
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 390 EIDNDEDKQDESEEDGEEEGEEEASE 415
E +E ++++ +E+ +EE EEEA
Sbjct: 75 EEKAEEKEEEKKKEEEKEEEEEEALA 100
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 29.0 bits (64), Expect = 6.7
Identities = 26/121 (21%), Positives = 58/121 (47%), Gaps = 3/121 (2%)
Query: 324 VRSDFSLENETKKEEQEHSEE---KGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYD 380
V + E + K EE + +EE K + +K + E ++ Q+ ++ E K + +
Sbjct: 1597 VMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAE 1656
Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGR 440
+E+ I+ +E E+ + ++EE + E +E+ + E +E +K E E++ +
Sbjct: 1657 EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKK 1716
Query: 441 R 441
+
Sbjct: 1717 K 1717
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is
found in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 28.6 bits (64), Expect = 6.9
Identities = 15/57 (26%), Positives = 21/57 (36%), Gaps = 2/57 (3%)
Query: 331 ENETKKE-EQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIE 386
ENE + E QE EE + DE + NQ+T T + + I
Sbjct: 206 ENEARAELAQERQEEASEEDEA-EALAGAENQITALEPLAPSTPQVDGQVEATGAIA 261
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 27.2 bits (61), Expect = 7.0
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 394 DEDKQDESEEDGEEEGEEEASE 415
+E K++E EE+ +EE EEEA+
Sbjct: 78 EEKKEEEEEEEEKEESEEEAAA 99
Score = 26.8 bits (60), Expect = 8.2
Identities = 10/35 (28%), Positives = 16/35 (45%), Gaps = 3/35 (8%)
Query: 405 GEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
EE E E E +E+++E+EE+ G
Sbjct: 72 AAAAAAEEKKEEE---EEEEEKEESEEEAAAGLGA 103
>gnl|CDD|218803 pfam05904, DUF863, Plant protein of unknown function (DUF863).
This family consists of a number of hypothetical
proteins from Arabidopsis thaliana and Oryza sativa. The
function of this family is unknown.
Length = 766
Score = 28.9 bits (64), Expect = 7.0
Identities = 23/122 (18%), Positives = 36/122 (29%), Gaps = 22/122 (18%)
Query: 337 EEQEHSEEKGDSDEKYDPEHFEMNQVTRE------NYAESKTNKNTSDYDDES---DIEI 387
EE + E +K D E+ ++ E ES + T E+
Sbjct: 521 EEDQKRSEVLSPQQKDDDSSDELIKIAAESIVAICRDVESAASSLTDALSKSPLHWFAEL 580
Query: 388 VDEIDNDEDKQDESEEDGEEE-------------GEEEASENEYYTETDEDEKENEEDDG 434
D + + + E E EE E EY + E N E+ G
Sbjct: 581 ASSSKEDLEDKIDKEGCVESSSSDEIDYFEAMTLQLEETKEEEYMPKPLVPENLNFEETG 640
Query: 435 ES 436
+
Sbjct: 641 TA 642
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 28.2 bits (63), Expect = 7.2
Identities = 17/80 (21%), Positives = 31/80 (38%), Gaps = 1/80 (1%)
Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
E ET++ +QE E + + T + + N D++ E +
Sbjct: 102 EAETRQLQQELREAQEAHERARQELLEAAAAPTAPPHVAAPVN-GEQLEPDDNGEEASAD 160
Query: 391 IDNDEDKQDESEEDGEEEGE 410
++ D D +D SEE+ E
Sbjct: 161 LETDPDMKDRSEEERVTYAE 180
>gnl|CDD|217956 pfam04194, PDCD2_C, Programmed cell death protein 2, C-terminal
putative domain.
Length = 165
Score = 27.8 bits (62), Expect = 7.3
Identities = 14/56 (25%), Positives = 22/56 (39%), Gaps = 7/56 (12%)
Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
+Y+ + +E + E E E G+E+ ETDE EK +D
Sbjct: 12 EYETLHAPSLPEEPEEREIDAFEELEKEGGGGKEDE-------ETDESEKAKSDDK 60
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 28.0 bits (62), Expect = 7.4
Identities = 14/43 (32%), Positives = 21/43 (48%)
Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
E+ D E+ EE+ EE + E DE+ +E EE+ S
Sbjct: 21 EEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASS 63
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 27.0 bits (60), Expect = 7.7
Identities = 8/38 (21%), Positives = 15/38 (39%), Gaps = 5/38 (13%)
Query: 397 KQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDG 434
+ + E +E E E +E+ +E +DD
Sbjct: 71 AAAAAGAEAAAEADEAEEE-----EKEEEAEEESDDDM 103
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
Length = 193
Score = 28.2 bits (62), Expect = 7.8
Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASE-NEYYTETDEDEKENEEDDGESD 437
+DE+ E + E EE E++ E+ E + E E E+ENEE + +
Sbjct: 18 IDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKPKEE 68
>gnl|CDD|185626 PTZ00447, PTZ00447, apical membrane antigen 1-like protein;
Provisional.
Length = 508
Score = 28.4 bits (63), Expect = 8.0
Identities = 22/80 (27%), Positives = 39/80 (48%), Gaps = 6/80 (7%)
Query: 360 NQVTRENYAESKTNKNTSDYDDE--SDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENE 417
+++ +E Y + + + T+ DE +D+ DEI D DE E D +E E E +
Sbjct: 403 HKMIKEMYVQKDSLETTAGERDEFKNDVLEEDEISGDNIGPDEIENDHYQEKEIENDNIQ 462
Query: 418 YYTETDEDEKENEEDDGESD 437
Y DE++ E + D ++
Sbjct: 463 Y----DENKNEGKNDHIPNN 478
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 28.5 bits (63), Expect = 8.3
Identities = 27/107 (25%), Positives = 39/107 (36%), Gaps = 3/107 (2%)
Query: 334 TKKEEQEHSEEKGDSDEKY--DPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEI 391
TKK E SE + E + + K +SD ++E + V +
Sbjct: 81 TKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRK- 139
Query: 392 DNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
K DE ED E E E + + E+E E E D + DG
Sbjct: 140 RRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDDG 186
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 28.5 bits (63), Expect = 8.4
Identities = 17/67 (25%), Positives = 28/67 (41%), Gaps = 1/67 (1%)
Query: 365 ENYAESKT-NKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETD 423
ENY K + S + + + E+ Q SEED E+ E+ E D
Sbjct: 435 ENYCVGKRLPPHLSPFASVDSYDPRASLMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDD 494
Query: 424 EDEKENE 430
+D++E +
Sbjct: 495 DDDEELQ 501
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 26.9 bits (60), Expect = 9.2
Identities = 12/35 (34%), Positives = 17/35 (48%), Gaps = 3/35 (8%)
Query: 405 GEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
EEE E E E +E+E+E+EE+ G
Sbjct: 71 AAAAAEEEEEEEE---EEEEEEEESEEEAMAGLGA 102
>gnl|CDD|218576 pfam05395, DARPP-32, Protein phosphatase inhibitor 1/DARPP-32.
This family consists of several mammalian protein
phosphatase inhibitor 1 (IPP-1) and dopamine- and
cAMP-regulated neuronal phosphoprotein (DARPP-32)
proteins. Protein phosphatase inhibitor-1 is involved in
signal transduction and is an endogenous inhibitor of
protein phosphatase-1. It has been demonstrated that
DARPP-32, if phosphorylated, can inhibit
protein-phosphatase-1. DARPP-32 has a key role in many
neurotransmitter pathways throughout the brain and has
been shown to be involved in controlling receptors, ion
channels and other physiological factors including the
brain's response to drugs of abuse, such as cocaine,
opiates and nicotine. DARPP-32 is reciprocally regulated
by the two neurotransmitters that are most often
implicated in schizophrenia - dopamine and glutamate.
Dopamine activates DARPP-32 through the D1 receptor
pathway and disables DARPP-32 through the D2 receptor.
Glutamate, acting through the N-methyl-d-aspartate
receptor, renders DARPP-32 inactive. A mutant form of
DARPP-32 has been linked with gastric cancers.
Length = 170
Score = 27.6 bits (61), Expect = 9.5
Identities = 10/54 (18%), Positives = 16/54 (29%)
Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
+ E+ Q EED + +E T+ + E D G
Sbjct: 94 QGSLEENQASEEEDELDSPDELGYPTGETESTEAQDSEEPGSCPTQDAQSEVGL 147
>gnl|CDD|215541 PLN03020, PLN03020, low-temperature-induced protein; Provisional.
Length = 556
Score = 28.1 bits (62), Expect = 10.0
Identities = 19/80 (23%), Positives = 34/80 (42%), Gaps = 9/80 (11%)
Query: 337 EEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNT-------SDYDDESDIEIVD 389
EE H+ + E D H E V ++ ++K K+T +D+ + D
Sbjct: 10 EEDPHNVGLHSATEGED--HHEKKSVLKKVKDKAKKIKDTITKHGHGHHHDEHRVPDDHD 67
Query: 390 EIDNDEDKQDESEEDGEEEG 409
+ D+D+ +E ED + G
Sbjct: 68 LYEEDDDEDEEMAEDPQIHG 87
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.131 0.379
Gapped
Lambda K H
0.267 0.0696 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,230,928
Number of extensions: 2140521
Number of successful extensions: 6086
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3913
Number of HSP's successfully gapped: 567
Length of query: 444
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 344
Effective length of database: 6,502,202
Effective search space: 2236757488
Effective search space used: 2236757488
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 61 (27.3 bits)