RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9274
(318 letters)
>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
This family contains, amongst other G-protein-coupled
receptors (GCPRs), members of the opsin family, which
have been considered to be typical members of the
rhodopsin superfamily. They share several motifs, mainly
the seven transmembrane helices, GCPRs of the rhodopsin
superfamily. All opsins bind a chromophore, such as
11-cis-retinal. The function of most opsins other than
the photoisomerases is split into two steps: light
absorption and G-protein activation. Photoisomerases, on
the other hand, are not coupled to G-proteins - they are
thought to generate and supply the chromophore that is
used by visual opsins.
Length = 251
Score = 131 bits (332), Expect = 9e-37
Identities = 48/134 (35%), Positives = 76/134 (56%), Gaps = 4/134 (2%)
Query: 85 VVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCK--MYLIVV 142
+V+ +++R+ TN+ ++NLA+ADLLF++ P+ V WPFGD CK +L VV
Sbjct: 1 LVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWPFGDALCKLVGFLFVV 60
Query: 143 TAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHGEV 202
YAS+ L +S+DR+LA+VHP+ +RT R A IL+ W++ +L ++P
Sbjct: 61 NGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILVVWVLALLLSLPPL--LFSW 118
Query: 203 SYTYASELHTACVF 216
T T C+
Sbjct: 119 LRTVEEGNVTTCLI 132
>gnl|CDD|222976 PHA03087, PHA03087, G protein-coupled chemokine receptor-like
protein; Provisional.
Length = 335
Score = 94.5 bits (235), Expect = 3e-22
Identities = 50/210 (23%), Positives = 93/210 (44%), Gaps = 24/210 (11%)
Query: 13 IVDIQHHYATCNISYNEYNRSNIPAICRNFSHEYINDNDVDAEAMERMVSIIVPIFFGII 72
+ I Y + A C D S I+ + + I
Sbjct: 4 TLTINTTIENTTDYYYDTYYDEDYADC---DLNIGYDT----------NSTILIVVYSTI 50
Query: 73 IILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGD 132
G GN ++V+ V ++++ ++ ++NLA++DLLF++ +PF Y+ W FG+
Sbjct: 51 FFFGLVGN-IIVIYVLTKTKIKTPMDIYLLNLAVSDLLFVMT-LPFQIYYYILFQWSFGE 108
Query: 133 VWCKM--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVL 190
CK+ L + Y S+ + +MS+DR++A+VHP+ S + T + L+ WI+ ++
Sbjct: 109 FACKIVSGLYYIGFYNSMNFITVMSVDRYIAIVHPVKSNKINTVKYGYIVSLVIWIISII 168
Query: 191 TAVPVFNMHGEVSYTYASELHT--ACVFLD 218
P+ YT + T C+F +
Sbjct: 169 ETTPIL-----FVYTTKKDHETLICCMFYN 193
>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
Provisional.
Length = 417
Score = 93.5 bits (232), Expect = 2e-21
Identities = 51/173 (29%), Positives = 101/173 (58%), Gaps = 12/173 (6%)
Query: 41 NFSHEYINDNDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLL 100
N ++E IN + + M +S + IF+ II ILG FGNA +++++ F +++++ T++
Sbjct: 80 NITYELIN---IKNKCMYPSISEYIKIFYIIIFILGLFGNAAIIMIL-FCKKIKTITDIY 135
Query: 101 IINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYLIVVTAY----ASVYTLVLMSL 156
I NLAI+DL+F++ PF + W FGD CK +I + Y ++++ + LMS+
Sbjct: 136 IFNLAISDLIFVID-FPFIIYNEFDQ-WIFGDFMCK--VISASYYIGFFSNMFLITLMSI 191
Query: 157 DRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHGEVSYTYASE 209
DR+ A+++PI+ RT + +I+WI+ ++ P + + + ++++
Sbjct: 192 DRYFAILYPISFQKYRTFNIGIILCIISWILSLIITSPAYFIFEASNIIFSAQ 244
>gnl|CDD|220695 pfam10328, 7TM_GPCR_Srx, Serpentine type 7TM GPCR chemoreceptor
Srx. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srx is part of the Srg
superfamily of chemoreceptors. Chemoperception is one of
the central senses of soil nematodes like C. elegans
which are otherwise 'blind' and 'deaf'.
Length = 275
Score = 59.1 bits (144), Expect = 4e-10
Identities = 32/150 (21%), Positives = 61/150 (40%), Gaps = 14/150 (9%)
Query: 71 IIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIAD----LLFIVCCVPFTATDYVTT 126
+I + G N L+ + ++++ +L N AI++ +F+ VP T D
Sbjct: 2 LISLFGIILNWLIFYIFLKLPSLKNSFGILCANKAISNAIICTIFLFYVVPMTLLDL--- 58
Query: 127 CWPFGDVWCKMYLIVVTAY----ASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAIL 182
F ++ + S T +L+SL+RF AV P + + +N I
Sbjct: 59 --SFLPELLNSHIGGLILLGLYEISPLTHLLISLNRFCAVFFPFKYKKIFSIKNTKIIIT 116
Query: 183 ITWIVIVLTAVPVFNMHGEVSYTYASELHT 212
WI+ ++ + + G + Y+ E T
Sbjct: 117 FIWIIAIIISTLFYFPLG-CHFYYSPESLT 145
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 57.3 bits (138), Expect = 4e-09
Identities = 33/63 (52%), Positives = 39/63 (61%), Gaps = 8/63 (12%)
Query: 244 EREVHEEEEEEEEEEEETKAK---ETKLKEKKKLGGGGG-----EEEEEEEEEEEEEEEE 295
E + + E ++E E+E A+ E K EK GGGG EEEEEEEEEEEEEEEE
Sbjct: 820 ETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEE 879
Query: 296 EEE 298
EEE
Sbjct: 880 EEE 882
Score = 54.6 bits (131), Expect = 3e-08
Identities = 30/57 (52%), Positives = 32/57 (56%), Gaps = 19/57 (33%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
GG + + EEEEEEEEEE EEEEEEEEEEEEEEEE EE
Sbjct: 855 GGGSDGGDSEEEEEEEEEE-------------------EEEEEEEEEEEEEEEENEE 892
Score = 47.3 bits (112), Expect = 7e-06
Identities = 28/62 (45%), Positives = 30/62 (48%), Gaps = 22/62 (35%)
Query: 241 KGGEREVHEEEE----EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
K E+ V + EEEEEE EEEEEEEEEEEEEEEEE
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEE------------------EEEEEEEEEEEEEEEEE 887
Query: 297 EE 298
EE
Sbjct: 888 EE 889
Score = 43.8 bits (103), Expect = 9e-05
Identities = 21/56 (37%), Positives = 31/56 (55%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ E ++E E+E + + K+ +K GGG + + EEEEEEEEEEEE
Sbjct: 820 ETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEE 875
Score = 41.1 bits (96), Expect = 7e-04
Identities = 27/92 (29%), Positives = 32/92 (34%), Gaps = 29/92 (31%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEE------------ 283
E+ KG E + E E E E E E + + + E E
Sbjct: 787 EDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAK 846
Query: 284 -----------------EEEEEEEEEEEEEEE 298
EEEEEEEEEEEEEEE
Sbjct: 847 QDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEE 878
Score = 38.4 bits (89), Expect = 0.004
Identities = 20/65 (30%), Positives = 31/65 (47%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
E D +G ++ E ++E E+ E + + K+ G G + + EEEEE
Sbjct: 809 AGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEE 868
Query: 294 EEEEE 298
EEEEE
Sbjct: 869 EEEEE 873
Score = 37.7 bits (87), Expect = 0.008
Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 2/79 (2%)
Query: 225 PDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGG--GEEE 282
+G +T ++E++ ER+ +E E E E +E ET+ +E + G G E+
Sbjct: 669 QEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTED 728
Query: 283 EEEEEEEEEEEEEEEEIGG 301
E E E EE EE E+E G
Sbjct: 729 EGEIETGEEGEEVEDEGEG 747
Score = 37.7 bits (87), Expect = 0.008
Identities = 26/74 (35%), Positives = 33/74 (44%), Gaps = 9/74 (12%)
Query: 234 QSENDYCKGGEREVHEEEEE--EEEEEEETKAK-------ETKLKEKKKLGGGGGEEEEE 284
+ EN GGE E E E E E E E A+ E +++ K+ G E EE
Sbjct: 656 EGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEV 715
Query: 285 EEEEEEEEEEEEEE 298
E E E E E E+E
Sbjct: 716 EHEGETEAEGTEDE 729
Score = 35.7 bits (82), Expect = 0.029
Identities = 21/60 (35%), Positives = 26/60 (43%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
+ E + E E EE E E E E + G GEE E+E E E E + E E G
Sbjct: 700 EAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEG 759
Score = 34.6 bits (79), Expect = 0.083
Identities = 22/64 (34%), Positives = 26/64 (40%), Gaps = 5/64 (7%)
Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
D KG E E EE E E + E+ GGE E+E E E + E E E
Sbjct: 629 DLSKGDVAEAEHTGERTGEEGERPTEAEGENGEE-----SGGEAEQEGETETKGENESEG 683
Query: 298 EIGG 301
EI
Sbjct: 684 EIPA 687
Score = 34.6 bits (79), Expect = 0.085
Identities = 19/61 (31%), Positives = 26/61 (42%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E E EE E E+E ET+ K E + GE+E E E E +E + + E
Sbjct: 653 TEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEA 712
Query: 301 G 301
Sbjct: 713 E 713
Score = 34.2 bits (78), Expect = 0.095
Identities = 16/25 (64%), Positives = 17/25 (68%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKL 268
E E EEEEEEEEEEEE + E L
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENEEPL 894
Score = 34.2 bits (78), Expect = 0.11
Identities = 20/59 (33%), Positives = 28/59 (47%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
G E E E+E E E E + E KE + G E +E+E+E E + E+ E G
Sbjct: 735 GEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKG 793
Score = 33.8 bits (77), Expect = 0.12
Identities = 18/58 (31%), Positives = 23/58 (39%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
GE E E+E E ETK + E G E E E E +E + + E E
Sbjct: 657 GENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEE 714
Score = 33.8 bits (77), Expect = 0.12
Identities = 21/60 (35%), Positives = 28/60 (46%), Gaps = 1/60 (1%)
Query: 243 GEREVHEE-EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
GE E EE EE E+E E E + K E + E E E +E+E+E E + G
Sbjct: 730 GEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDG 789
Score = 33.8 bits (77), Expect = 0.13
Identities = 18/59 (30%), Positives = 29/59 (49%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
+ H+ E E EE E E + + +++ ++ G EE E+E E E E + E E G
Sbjct: 701 AKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEG 759
Score = 33.4 bits (76), Expect = 0.19
Identities = 13/50 (26%), Positives = 19/50 (38%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E EE E E + + + + G E + E E E E E + E
Sbjct: 643 ERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGE 692
Score = 33.0 bits (75), Expect = 0.25
Identities = 16/26 (61%), Positives = 18/26 (69%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEE 260
SE + + E E EEEEEEEEEEEE
Sbjct: 863 SEEEEEEEEEEEEEEEEEEEEEEEEE 888
Score = 32.3 bits (73), Expect = 0.42
Identities = 21/57 (36%), Positives = 29/57 (50%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+G E E E EE EE E+ E + K + + G E E E E E E +E+E+E
Sbjct: 724 EGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDE 780
Score = 32.3 bits (73), Expect = 0.43
Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 1/59 (1%)
Query: 244 EREVHEEEEEEEEEEE-ETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
E E+ EE EE E+E E +A+ E + E E E E +E+E+E E + G
Sbjct: 729 EGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGE 787
Score = 31.9 bits (72), Expect = 0.49
Identities = 22/75 (29%), Positives = 36/75 (48%), Gaps = 2/75 (2%)
Query: 241 KGGEREVHEEEEEEEEEEEETKA--KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+G E E E E E + E ET+ KET+ + + + G E+E E + E+ E + +E
Sbjct: 737 EGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEG 796
Query: 299 IGGVVVSAVDFGSGG 313
G V + +G
Sbjct: 797 AEGKVEHEGETEAGE 811
Score = 31.5 bits (71), Expect = 0.64
Identities = 19/62 (30%), Positives = 25/62 (40%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
G E E E E E EE +A++ E K GE E + E+E E E E +
Sbjct: 645 TGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEA 704
Query: 301 GV 302
Sbjct: 705 DH 706
Score = 31.5 bits (71), Expect = 0.70
Identities = 21/62 (33%), Positives = 30/62 (48%), Gaps = 1/62 (1%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLK-EKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ GE E E+E E E + E ++ E+K G GE E +E + + E E EE E
Sbjct: 658 ENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEH 717
Query: 300 GG 301
G
Sbjct: 718 EG 719
Score = 29.6 bits (66), Expect = 2.7
Identities = 18/47 (38%), Positives = 24/47 (51%), Gaps = 7/47 (14%)
Query: 226 DGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEE-------ETKAKE 265
DG + + E + + E E EEEEEEEE EE ET+ K+
Sbjct: 859 DGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 28.0 bits (62), Expect = 8.3
Identities = 19/67 (28%), Positives = 27/67 (40%), Gaps = 1/67 (1%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E + + E E E+E E ET E + E + G G+ E E E + +E E E
Sbjct: 709 ETEAEEVEHEGETEAEGTEDEGEIETGE-EGEEVEDEGEGEAEGKHEVETEGDRKETEHE 767
Query: 296 EEEIGGV 302
E
Sbjct: 768 GETEAEG 774
>gnl|CDD|165177 PHA02834, PHA02834, chemokine receptor-like protein; Provisional.
Length = 323
Score = 54.1 bits (130), Expect = 2e-08
Identities = 40/161 (24%), Positives = 88/161 (54%), Gaps = 10/161 (6%)
Query: 43 SHEYINDNDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLII 102
++ Y + + D E + V+ V +F+ ++ I G GN LV+ V+ + M ++ +
Sbjct: 9 NYSYYFEEECDFEMVNSDVNYFVIVFYILLFIFGLIGNVLVIAVLIVKRFM-FVVDVYLF 67
Query: 103 NLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYLIV--VTAYASVYTLVLMSLDRFL 160
N+A++DL+ +V PF + + W FG+ CK+ L V V +++++ + L+S+DR++
Sbjct: 68 NIAMSDLM-LVFSFPFIIHNDLNE-WIFGEFMCKLVLGVYFVGFFSNMFFVTLISIDRYI 125
Query: 161 AVVHPIASMSVRTERNALKAIL--ITWIVIVLTAVPVFNMH 199
VV+ + ++ + +L +L W+ V+ ++P ++
Sbjct: 126 LVVN---ATKIKNKSISLSVLLSVAAWVCSVILSMPAMVLY 163
>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
Srsx. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srsx is a solo family
amongst the superfamilies of chemoreceptors.
Chemoperception is one of the central senses of soil
nematodes like C. elegans which are otherwise 'blind'
and 'deaf'.
Length = 257
Score = 50.7 bits (122), Expect = 3e-07
Identities = 27/110 (24%), Positives = 51/110 (46%), Gaps = 14/110 (12%)
Query: 73 IILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTA---TDYVTT--- 126
++G FGN +++++ +++RS + LI +ADLL + + F T T
Sbjct: 1 SVIGIFGNVIMIILTFKKKKLRSKCSYLICVQCLADLLCLSGEIIFVVLLFTGTQLTRNE 60
Query: 127 CWPFGDVWCKMYLIVVTAYASVYTLVL-MSLDRFLAVVHPIASMSVRTER 175
C+ Y+ TA + L+L + +DR +AV PI + + +
Sbjct: 61 CFLIIIP----YIFGQTAQS---PLMLMIGIDRLIAVKFPIFYRLLSSSK 103
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 49.3 bits (118), Expect = 7e-07
Identities = 21/50 (42%), Positives = 29/50 (58%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+EEE+EE+EEE K K KK L E+E+ + E+EE+ E EE
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEED 87
Score = 41.2 bits (97), Expect = 3e-04
Identities = 21/60 (35%), Positives = 34/60 (56%), Gaps = 14/60 (23%)
Query: 249 EEEEEEEEEEEETK---------AKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+EEE+EE+EEE+ K A + K++EK+K + E+EE+ E EE+ E+
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKA-----KREKEEKGLRELEEDTPEDE 92
Score = 31.2 bits (71), Expect = 0.58
Identities = 20/70 (28%), Positives = 32/70 (45%), Gaps = 19/70 (27%)
Query: 246 EVHEEEEEEEEE-----------------EEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
+ E+EE+EEE+ EE+ KAK + KE+K L + E+E E
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAK--REKEEKGLRELEEDTPEDELAE 95
Query: 289 EEEEEEEEEE 298
+ + +EE
Sbjct: 96 KLRLRKLQEE 105
Score = 30.8 bits (70), Expect = 0.84
Identities = 15/44 (34%), Positives = 25/44 (56%)
Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
++EE E A K K K +++ ++ +EEE+EE+EEE
Sbjct: 5 DDEEFEPPAPPAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEE 48
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 47.3 bits (113), Expect = 4e-06
Identities = 20/57 (35%), Positives = 31/57 (54%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
G +V ++E E + E+ + K+ K+ K+ EE+EEE EEEE E E+E
Sbjct: 116 GEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172
Score = 43.9 bits (104), Expect = 5e-05
Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 7/58 (12%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E E + E+EEE++E KAKE +E EE+EEE EEEE E E+E+
Sbjct: 124 DKEIESSDSEDEEEKDEAAKKAKEDSDEELS-------EEDEEEAAEEEEAEAEKEKA 174
Score = 40.4 bits (95), Expect = 7e-04
Identities = 18/56 (32%), Positives = 30/56 (53%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E + E + E+EEE+ +A + ++ + EEE EEEE E E+E+ E+
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASEL 177
Score = 36.2 bits (84), Expect = 0.015
Identities = 17/66 (25%), Positives = 30/66 (45%), Gaps = 7/66 (10%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E K E+ + +++++EEEE E + E E G + E ++E E + E
Sbjct: 81 EERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDE-------GEWIDVESDKEIESSDSE 133
Query: 296 EEEIGG 301
+EE
Sbjct: 134 DEEEKD 139
Score = 36.2 bits (84), Expect = 0.017
Identities = 20/68 (29%), Positives = 33/68 (48%), Gaps = 11/68 (16%)
Query: 243 GEREVHEEEE-----EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
GE EV + E+ +EEE K + + E +++EEEE E EE+E+ ++
Sbjct: 61 GEEEVVDGIPGLELLEKWKEEERKKKEAEQGLE------SDDDDDEEEEWEVEEDEDSDD 114
Query: 298 EIGGVVVS 305
E + V
Sbjct: 115 EGEWIDVE 122
Score = 31.5 bits (72), Expect = 0.49
Identities = 14/65 (21%), Positives = 26/65 (40%), Gaps = 13/65 (20%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
K ER+ E E+ E ++++ E EE+E+ ++E E + E
Sbjct: 79 KEEERKKKEAEQGLESDDDDD-------------EEEEWEVEEDEDSDDEGEWIDVESDK 125
Query: 301 GVVVS 305
+ S
Sbjct: 126 EIESS 130
Score = 28.5 bits (64), Expect = 4.9
Identities = 10/38 (26%), Positives = 22/38 (57%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK 273
E + + ++ + +EE EE+EE A+E + + +K+
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172
Score = 28.1 bits (63), Expect = 6.5
Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 19/87 (21%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKE------------TKLKE-------KKKL 274
E+ + E + E EEEE E E+ KA E K++E K L
Sbjct: 145 AKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPADFAKIQELRLEKGVDKAL 204
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGG 301
GG +++ E +E + ++I G
Sbjct: 205 GGKLKRRDKDAPERHSDELVDADDIEG 231
Score = 27.3 bits (61), Expect = 9.9
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKK 272
+ E D +E +EE EE+EEE + +E + +++K
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 46.3 bits (110), Expect = 6e-06
Identities = 22/49 (44%), Positives = 32/49 (65%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
EEEE+ +E+ E KLKE + +E++EEEEEEEEEE+E+ +
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFD 190
Score = 42.0 bits (99), Expect = 2e-04
Identities = 12/54 (22%), Positives = 31/54 (57%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E E +E+ E++ + + + E+ + EEEEEE+E+ ++++++++
Sbjct: 144 EEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197
Score = 39.0 bits (91), Expect = 0.001
Identities = 12/63 (19%), Positives = 36/63 (57%)
Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
F+ + E ++ E+ E++ +E +A++ +++K EEEE+E+ +++++
Sbjct: 135 FKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194
Query: 293 EEE 295
+++
Sbjct: 195 DDD 197
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 46.6 bits (111), Expect = 9e-06
Identities = 22/86 (25%), Positives = 36/86 (41%), Gaps = 6/86 (6%)
Query: 222 VNRPDG---YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEK---KKLG 275
V P YN + + + E+EV E++ +E E E K E E ++
Sbjct: 179 VEVPHAGASYNPSFEDHQELLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMS 238
Query: 276 GGGGEEEEEEEEEEEEEEEEEEEIGG 301
G EE +++ EEE ++E E
Sbjct: 239 EGLLEESDDDGEEESDDESAWEGFES 264
Score = 36.2 bits (84), Expect = 0.020
Identities = 17/72 (23%), Positives = 27/72 (37%), Gaps = 13/72 (18%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGE--------EEEEEEE 287
E G E E E E E E K + K K K + E E E +
Sbjct: 243 EESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTK-----AQRNKEKRRKELEREAK 297
Query: 288 EEEEEEEEEEEI 299
EE++ +++ ++
Sbjct: 298 EEKQLKKKLAQL 309
Score = 33.1 bits (76), Expect = 0.16
Identities = 12/55 (21%), Positives = 29/55 (52%), Gaps = 6/55 (10%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
ERE EE++ +++ + + KE + +K E+ ++E+ +E E+++
Sbjct: 293 EREAKEEKQLKKKLAQLARLKEIAKEVAQK------EKARARKKEQRKERGEKKK 341
Score = 31.2 bits (71), Expect = 0.69
Identities = 7/25 (28%), Positives = 14/25 (56%)
Query: 279 GEEEEEEEEEEEEEEEEEEEIGGVV 303
E+E + E++ +E E EE+ +
Sbjct: 203 YEKEVKAEKKRQELERVEEKKLEKM 227
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 46.4 bits (110), Expect = 1e-05
Identities = 28/66 (42%), Positives = 35/66 (53%), Gaps = 6/66 (9%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E+ E E E EEEEEEEEE + +E + +E GE+EEEEEE E + E
Sbjct: 430 ESPSMASQESEEEESVEEEEEEEEEEEEEEQESEE------EEGEDEEEEEEVEADNGSE 483
Query: 296 EEEIGG 301
EE G
Sbjct: 484 EEMEGS 489
Score = 41.8 bits (98), Expect = 4e-04
Identities = 22/72 (30%), Positives = 35/72 (48%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+ E + + E E E EEEE E+EEE + E ++++ G + + EE EE+ E
Sbjct: 446 EEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAER 505
Query: 294 EEEEEIGGVVVS 305
E G +S
Sbjct: 506 RNSEMAGISRMS 517
Score = 38.4 bits (89), Expect = 0.005
Identities = 21/58 (36%), Positives = 25/58 (43%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
E E E E+EEEEEE E + E G G GEE EE+ E E +
Sbjct: 461 ESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSE 518
Score = 37.6 bits (87), Expect = 0.007
Identities = 25/69 (36%), Positives = 31/69 (44%), Gaps = 16/69 (23%)
Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
Q + E E EEEEEEEE+E E + E+EEEEEE E +
Sbjct: 436 SQESEEEESVEEEEEEEEEEEEEEQESEEEE----------------GEDEEEEEEVEAD 479
Query: 293 EEEEEEIGG 301
EEE+ G
Sbjct: 480 NGSEEEMEG 488
Score = 37.2 bits (86), Expect = 0.011
Identities = 19/66 (28%), Positives = 27/66 (40%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E + + E E+EEEEEE E E +++ + G G E EE+ E E
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGI 513
Query: 296 EEEIGG 301
G
Sbjct: 514 SRMSEG 519
Score = 35.7 bits (82), Expect = 0.028
Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 5/55 (9%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGE-----EEEEEEEEEEEEEEEEEE 298
EEEE + +E E + ++ + K GE +E EEEE EEEEEEEE
Sbjct: 399 EEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEE 453
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 46.1 bits (110), Expect = 2e-05
Identities = 25/50 (50%), Positives = 37/50 (74%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E++ EEE++E++ KA K KE+++ +EEE+EEEEEE EEE+EEE
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469
Score = 45.7 bits (109), Expect = 2e-05
Identities = 24/53 (45%), Positives = 36/53 (67%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
V + E++ EEE++E K K K+K++ E++EEE+EEEEEE EEE+E
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467
Score = 45.7 bits (109), Expect = 2e-05
Identities = 24/58 (41%), Positives = 39/58 (67%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
E+ + EEE++E++++ A + K +E+++ EE+EEEEEE EEE+EEEEE
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKK 474
Score = 44.5 bits (106), Expect = 4e-05
Identities = 26/85 (30%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 214 CVFLDLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK 273
FL+L + + + S+ K + E++ EEE++E+ K K++++
Sbjct: 389 AAFLELT-----EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEE 443
Query: 274 LGGGGGEEEEEEEEEEEEEEEEEEE 298
++EEE+EEEEEE EEE+EE
Sbjct: 444 EEEEKEKKEEEKEEEEEEAEEEKEE 468
Score = 44.5 bits (106), Expect = 5e-05
Identities = 22/55 (40%), Positives = 36/55 (65%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ ++ EEE++E+++ K +E+++ EEE+EEEEEE EEE+EEEE
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEE 470
Score = 40.3 bits (95), Expect = 0.001
Identities = 20/53 (37%), Positives = 33/53 (62%), Gaps = 6/53 (11%)
Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ + E+ E++ EE K ++ K K ++EEEEEEE+E++EEE+EE
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGK------KKEEEEEEEKEKKEEEKEEE 458
Score = 39.5 bits (93), Expect = 0.002
Identities = 19/57 (33%), Positives = 38/57 (66%), Gaps = 4/57 (7%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E++ +++ +++EE + +E + KE++K EEEE EEE+EEEEE+++++
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEK----EEEEEEAEEEKEEEEEKKKKQAT 478
Score = 38.0 bits (89), Expect = 0.006
Identities = 17/60 (28%), Positives = 34/60 (56%)
Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSA 306
++ ++ ++ KA++ + +EKK+ +++EEEEEEE+E++EEE A
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Score = 33.0 bits (76), Expect = 0.23
Identities = 24/57 (42%), Positives = 35/57 (61%), Gaps = 12/57 (21%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+ ++ +++EEEEEEE+ K +E K EEEEEE EEE+EEEEE++
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKKEEEK------------EEEEEEAEEEKEEEEEKK 473
Score = 29.9 bits (68), Expect = 2.0
Identities = 13/40 (32%), Positives = 25/40 (62%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGE 280
G ++E EEEE+E++EEE+ + +E +EK++ +
Sbjct: 436 AGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 29.1 bits (66), Expect = 3.1
Identities = 14/47 (29%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EEE E T +K+ K KK + E++ EEE++E++++ +
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKA--EKKREEEKKEKKKKAFAGKK 439
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 44.2 bits (104), Expect = 6e-05
Identities = 16/63 (25%), Positives = 31/63 (49%), Gaps = 2/63 (3%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETK--LKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E E E+ EE EEE+ E + +K K KK G G ++++ + ++ ++ + +
Sbjct: 312 PEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGED 371
Query: 301 GVV 303
V
Sbjct: 372 SVS 374
Score = 42.3 bits (99), Expect = 2e-04
Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 11/64 (17%)
Query: 243 GEREVHEEE--------EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
G+ E EE+ + EE E+ + E K + EE EEE+ EEE
Sbjct: 277 GDDEGREEDYISDSSASGNDPEEREDKLSPE---IPAKPEIEQDEDSEESEEEKNEEEGG 333
Query: 295 EEEE 298
++
Sbjct: 334 LSKK 337
Score = 37.2 bits (86), Expect = 0.009
Identities = 17/66 (25%), Positives = 34/66 (51%), Gaps = 3/66 (4%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
+ + E EEE+ EEE K K+ K K K K G ++++ + ++ ++ + + E
Sbjct: 316 QDEDSEESEEEKNEEEGGLSKKGKKLK-KLKGK--KNGLDKDDSDSGDDSDDSDIDGEDS 372
Query: 301 GVVVSA 306
+V+A
Sbjct: 373 VSLVTA 378
Score = 35.7 bits (82), Expect = 0.026
Identities = 17/56 (30%), Positives = 32/56 (57%), Gaps = 5/56 (8%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKL-----GGGGGEEEEEEEEEEEEEEEEEEEI 299
++ E+ +EE+ K K+ K KKKL G GG+++ +E + ++ ++E EE
Sbjct: 230 DKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREED 285
Score = 34.6 bits (79), Expect = 0.059
Identities = 13/45 (28%), Positives = 25/45 (55%)
Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+E+EE E K + K+ K G +E++ +E ++ E+ +EE
Sbjct: 195 DEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEE 239
Score = 33.0 bits (75), Expect = 0.20
Identities = 13/45 (28%), Positives = 23/45 (51%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
+E+EE E E+ + LK K G + +E ++ E+ +EE
Sbjct: 195 DEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEE 239
Score = 29.9 bits (67), Expect = 2.1
Identities = 13/65 (20%), Positives = 28/65 (43%), Gaps = 3/65 (4%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
+ D K G+R ++ +E + ++ + + +E G + EE E++ E
Sbjct: 254 KLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISD---SSASGNDPEEREDKLSPEIPA 310
Query: 296 EEEIG 300
+ EI
Sbjct: 311 KPEIE 315
Score = 29.9 bits (67), Expect = 2.2
Identities = 11/58 (18%), Positives = 25/58 (43%), Gaps = 7/58 (12%)
Query: 236 ENDYCKGGEREVHEEEE-------EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEE 286
E DY ++ EE E + E + ++++ E++K GG ++ ++
Sbjct: 283 EEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 44.2 bits (104), Expect = 6e-05
Identities = 14/55 (25%), Positives = 22/55 (40%), Gaps = 3/55 (5%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
+ H + E + E EE++ E KE++ GG G E EE+
Sbjct: 64 QGHRQTESDTETAEESRHGE---KEERGQGGPSGSGSESVGSPTPSPSGSAEELA 115
Score = 40.0 bits (93), Expect = 0.001
Identities = 13/83 (15%), Positives = 18/83 (21%), Gaps = 9/83 (10%)
Query: 219 LDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGG 278
L P N ++ HE E +
Sbjct: 118 LSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPS---------SFLQ 168
Query: 279 GEEEEEEEEEEEEEEEEEEEIGG 301
E+ EE E E E + G
Sbjct: 169 PSHEDSPEEPEPPTSEPEPDSPG 191
Score = 40.0 bits (93), Expect = 0.001
Identities = 9/64 (14%), Positives = 20/64 (31%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
S N + HE+ EE E ++ + + + +E E +
Sbjct: 158 SPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSP 217
Query: 295 EEEE 298
++
Sbjct: 218 TPQQ 221
Score = 39.2 bits (91), Expect = 0.002
Identities = 6/57 (10%), Positives = 9/57 (15%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
G E E + G E E ++
Sbjct: 108 SGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPS 164
Score = 39.2 bits (91), Expect = 0.002
Identities = 13/103 (12%), Positives = 30/103 (29%), Gaps = 8/103 (7%)
Query: 220 DPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKA--------KETKLKEK 271
P +P + + + + + E + + E T + + + +
Sbjct: 158 SPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSP 217
Query: 272 KKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGGW 314
++ E E+E E E E + + GW
Sbjct: 218 TPQQAPSPNTQQAVEHEDEPTEPEREGPPFPGHRSHSYTVVGW 260
Score = 38.8 bits (90), Expect = 0.002
Identities = 7/62 (11%), Positives = 13/62 (20%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
+ + GE+E + E + + G E E
Sbjct: 75 TAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASH 134
Query: 296 EE 297
Sbjct: 135 SP 136
Score = 38.8 bits (90), Expect = 0.003
Identities = 5/59 (8%), Positives = 8/59 (13%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
EE E + + E E +
Sbjct: 105 PSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQP 163
Score = 37.3 bits (86), Expect = 0.009
Identities = 7/64 (10%), Positives = 12/64 (18%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
+ + +GG E A+E + G E
Sbjct: 83 EKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPS 142
Query: 295 EEEE 298
Sbjct: 143 HPGP 146
Score = 36.5 bits (84), Expect = 0.013
Identities = 5/64 (7%), Positives = 12/64 (18%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
S + + + ++ + E+
Sbjct: 116 SGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSP 175
Query: 295 EEEE 298
EE E
Sbjct: 176 EEPE 179
Score = 36.5 bits (84), Expect = 0.015
Identities = 6/62 (9%), Positives = 13/62 (20%), Gaps = 1/62 (1%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGG-GEEEEEEEEEEEEEEEEEEEI 299
G E+ E ++ + G E E ++ +
Sbjct: 108 SGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFL 167
Query: 300 GG 301
Sbjct: 168 QP 169
Score = 36.5 bits (84), Expect = 0.016
Identities = 10/59 (16%), Positives = 13/59 (22%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E H E+EE + E+ G E E E
Sbjct: 75 TAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPAS 133
Score = 36.5 bits (84), Expect = 0.017
Identities = 16/70 (22%), Positives = 26/70 (37%), Gaps = 5/70 (7%)
Query: 237 NDYCKGGEREVHEEEEEEEEEEEETK-AKETKLKEKKKLGGGG----GEEEEEEEEEEEE 291
G + E+ + E +T+ A+E++ EK++ G GG G E
Sbjct: 49 PAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPS 108
Query: 292 EEEEEEEIGG 301
EE G
Sbjct: 109 GSAEELASGL 118
Score = 34.9 bits (80), Expect = 0.042
Identities = 6/56 (10%), Positives = 13/56 (23%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+ E+ + E E + G + E + +E
Sbjct: 156 NPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEP 211
Score = 33.8 bits (77), Expect = 0.099
Identities = 8/65 (12%), Positives = 15/65 (23%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
+ ++ + E+ E T E G E + +E
Sbjct: 152 PESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEP 211
Query: 296 EEEIG 300
E
Sbjct: 212 GEPQS 216
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 42.2 bits (99), Expect = 8e-05
Identities = 24/63 (38%), Positives = 40/63 (63%), Gaps = 7/63 (11%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
EN+ K ++E ++EEEEEE+EEE + E E++ + E+EEEEEE+EE+ +
Sbjct: 39 ENEDVKDEKQE--DDEEEEEEDEEEIEEPEDIEDEEEIV-----EDEEEEEEDEEDNVDL 91
Query: 296 EEE 298
++
Sbjct: 92 KDI 94
Score = 34.8 bits (80), Expect = 0.026
Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 9/55 (16%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ E EEE+EEE EE E E ++ E +EEEEEE+EE+ + ++ E
Sbjct: 50 DDEEEEEEDEEEIEEPEDIEDEEEIVE---------DEEEEEEDEEDNVDLKDIE 95
Score = 31.0 bits (70), Expect = 0.52
Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 13/79 (16%)
Query: 221 PVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGE 280
P +P+ K E+ ++E +E E+ ++E++ +E E
Sbjct: 9 PEKKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEE-------------E 55
Query: 281 EEEEEEEEEEEEEEEEEEI 299
EE+EEE EE E+ E+EEEI
Sbjct: 56 EEDEEEIEEPEDIEDEEEI 74
Score = 30.6 bits (69), Expect = 0.57
Identities = 26/63 (41%), Positives = 37/63 (58%), Gaps = 5/63 (7%)
Query: 242 GGEREVHEEEEEEEEEEEE-----TKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
GG E EEE+++E E + KE +KE + + E++EEEEEE+EEE EE
Sbjct: 6 GGAPEKKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEP 65
Query: 297 EEI 299
E+I
Sbjct: 66 EDI 68
Score = 29.8 bits (67), Expect = 1.1
Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 14/55 (25%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E + E EE E+ E+EE ++ EEEEEE+EE+ + ++ E+
Sbjct: 56 EEDEEEIEEPEDIEDEEEIVED--------------EEEEEEDEEDNVDLKDIEK 96
Score = 29.8 bits (67), Expect = 1.1
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKE 270
E E+ E+EEEEEE+EE+ + K+
Sbjct: 71 EEEIVEDEEEEEEDEEDNVDLKDIEKK 97
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 43.1 bits (102), Expect = 1e-04
Identities = 29/86 (33%), Positives = 34/86 (39%), Gaps = 35/86 (40%)
Query: 249 EEEEEEEEEEEETKAKE-----------------------------------TKLKEKKK 273
EEEEEEEEEEE + E T + ++K
Sbjct: 288 EEEEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPEQK 347
Query: 274 LGGGGGEEEEEEEEEEEEEEEEEEEI 299
E+EEEEEEEEE EE E EE
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEG 373
Score = 42.4 bits (100), Expect = 2e-04
Identities = 22/73 (30%), Positives = 28/73 (38%), Gaps = 18/73 (24%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKE------------------KKKLGGGGGEEEEEEEE 287
E E EE + E E + L ++ +E+EEEEE
Sbjct: 301 ERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPEQKDEEEEQEDEEEEE 360
Query: 288 EEEEEEEEEEEIG 300
EEEE EE E E G
Sbjct: 361 EEEEPEEPEPEEG 373
Score = 35.4 bits (82), Expect = 0.033
Identities = 19/49 (38%), Positives = 24/49 (48%), Gaps = 19/49 (38%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+++EEEE+E+EE EEEEEEE EE E EE
Sbjct: 346 QKDEEEEQEDEE-------------------EEEEEEEPEEPEPEEGPP 375
Score = 34.3 bits (79), Expect = 0.081
Identities = 30/72 (41%), Positives = 36/72 (50%), Gaps = 15/72 (20%)
Query: 231 TIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEE 290
I ++ DY E E+ E EEEEEEEE +E +E EEEEEEE
Sbjct: 152 KILGTQKDYYVA-EVELREGEEEEEEEEVGEADEE--------------DEGEEEEEEEP 196
Query: 291 EEEEEEEEIGGV 302
EE +EEE GV
Sbjct: 197 EEVPKEEEGTGV 208
Score = 33.9 bits (78), Expect = 0.096
Identities = 21/56 (37%), Positives = 24/56 (42%), Gaps = 19/56 (33%)
Query: 240 CKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
C E +EEEE+E+EEEE EEEEE EE E EE
Sbjct: 339 CTWVNPEQKDEEEEQEDEEEE-------------------EEEEEPEEPEPEEGPP 375
Score = 31.6 bits (72), Expect = 0.65
Identities = 15/30 (50%), Positives = 16/30 (53%)
Query: 237 NDYCKGGEREVHEEEEEEEEEEEETKAKET 266
N K E E +EEEEEEEEE E E
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEE 372
>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin).
Length = 585
Score = 43.4 bits (102), Expect = 1e-04
Identities = 20/57 (35%), Positives = 27/57 (47%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
G E E EE+EE+ +E A +T L G G + E E EE EE+E +
Sbjct: 264 GLEDEAEEEKEEKGQESRGLSAVQTYLLRWVNARGRGRSQNRAERERSEESEEKELD 320
Score = 40.7 bits (95), Expect = 8e-04
Identities = 23/61 (37%), Positives = 32/61 (52%), Gaps = 6/61 (9%)
Query: 242 GGEREVHEEEEEEEEEEEETKA------KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
GGE EE EE++ +EE K K L ++ K G G E+E EEE+EE+ +E
Sbjct: 221 GGEDWNPIEEGEEDQTQEEVKRSKERTHKGRSLPDESKRSGQLGLEDEAEEEKEEKGQES 280
Query: 296 E 296
Sbjct: 281 R 281
Score = 36.1 bits (83), Expect = 0.023
Identities = 14/58 (24%), Positives = 26/58 (44%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K E E+ +++E++T + E G EE EE++ +EE + +E
Sbjct: 188 KQKRERSDEREKSYQDDEDDTYRQNNIPYEDVVGGEDWNPIEEGEEDQTQEEVKRSKE 245
Score = 31.9 bits (72), Expect = 0.44
Identities = 13/62 (20%), Positives = 24/62 (38%), Gaps = 4/62 (6%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEK----KKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E H + E E + LK KL G ++ E +E E+ +++E++
Sbjct: 148 NSEEKHSKRTNEIVEVFYNPQSQATLKSVFEEVGKLDGQSKQKRERSDEREKSYQDDEDD 207
Query: 299 IG 300
Sbjct: 208 TY 209
Score = 27.7 bits (61), Expect = 9.0
Identities = 15/77 (19%), Positives = 24/77 (31%), Gaps = 19/77 (24%)
Query: 241 KGGEREVHEEEEEEEEEE-------------------EETKAKETKLKEKKKLGGGGGEE 281
+ R E EE EE+E + + + +K G EE
Sbjct: 301 RSQNRAERERSEESEEKELDRASPYQEIEITANLQIPPSDLIRMLRKGDGRKPRGRVEEE 360
Query: 282 EEEEEEEEEEEEEEEEE 298
E E +E+ E +
Sbjct: 361 GALEAPEALDEKRELDL 377
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 40.8 bits (96), Expect = 1e-04
Identities = 24/84 (28%), Positives = 36/84 (42%), Gaps = 19/84 (22%)
Query: 244 EREVHEEEEEEE----------EEEEET---KAKETKLKEKK------KLGGGGGEEEEE 284
+ + +E E+EE ++EE+T +AK K K+KK K G E+E
Sbjct: 56 DEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGS 115
Query: 285 EEEEEEEEEEEEEEIGGVVVSAVD 308
+ EE +EEEE E
Sbjct: 116 KSSEESSDEEEEGEEDKQEEPVEI 139
Score = 33.1 bits (76), Expect = 0.078
Identities = 19/86 (22%), Positives = 31/86 (36%), Gaps = 26/86 (30%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKL--------------------------GGGGGEEE 282
E +E+ ++E E + + K +EKK+ G EE+
Sbjct: 53 ELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEK 112
Query: 283 EEEEEEEEEEEEEEEEIGGVVVSAVD 308
E + EE +EEEE V+
Sbjct: 113 EGSKSSEESSDEEEEGEEDKQEEPVE 138
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 42.4 bits (99), Expect = 2e-04
Identities = 24/65 (36%), Positives = 29/65 (44%), Gaps = 4/65 (6%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLK----EKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
E EEEEEEEEEE E + E ++ G G + EE EEE E E G
Sbjct: 137 EDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESVTEAEAEG 196
Query: 302 VVVSA 306
V+
Sbjct: 197 TTVAG 201
Score = 41.2 bits (96), Expect = 4e-04
Identities = 26/65 (40%), Positives = 28/65 (43%), Gaps = 8/65 (12%)
Query: 241 KGGEREVHEEEEEEEEEEEE--------TKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
K E + EEEEEEEEEEE T T E G G + EE EEE
Sbjct: 131 KEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESVT 190
Query: 293 EEEEE 297
E E E
Sbjct: 191 EAEAE 195
Score = 37.0 bits (85), Expect = 0.009
Identities = 17/29 (58%), Positives = 19/29 (65%)
Query: 270 EKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
KK +E+EEEEEEEEEEE E EE
Sbjct: 126 GKKATKEDESDEDEEEEEEEEEEEAEVEE 154
Score = 35.8 bits (82), Expect = 0.019
Identities = 22/73 (30%), Positives = 30/73 (41%), Gaps = 20/73 (27%)
Query: 240 CKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
K G +E+E +E+EE EEEEEEEEE E EE E+
Sbjct: 120 KKAGNAGKKATKEDESDEDEE--------------------EEEEEEEEEAEVEENEQGT 159
Query: 300 GGVVVSAVDFGSG 312
G ++ + G
Sbjct: 160 NGTSTNSTEVDHG 172
Score = 30.4 bits (68), Expect = 1.1
Identities = 20/63 (31%), Positives = 25/63 (39%), Gaps = 18/63 (28%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFG 310
+ EEE EEET EEE E+ + E+EE E +S V G
Sbjct: 58 DSSEEEGEEET------------------SNEEENNEDSDGNEDEEAEAENTTLSTVTLG 99
Query: 311 SGG 313
GG
Sbjct: 100 YGG 102
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 39.5 bits (93), Expect = 2e-04
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGG 301
EEE++EEEEEEEE++++G
Sbjct: 59 AAAAAAAAEEEKKEEEEEEEEDDDMGF 85
Score = 37.6 bits (88), Expect = 9e-04
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
EEE++EEEEEEEE++
Sbjct: 58 AAAAAAAAAEEEKKEEEEEEEEDD 81
Score = 27.2 bits (61), Expect = 3.7
Identities = 10/20 (50%), Positives = 12/20 (60%)
Query: 241 KGGEREVHEEEEEEEEEEEE 260
EE++EEEEEEEE
Sbjct: 60 AAAAAAAEEEKKEEEEEEEE 79
Score = 26.4 bits (59), Expect = 6.4
Identities = 9/12 (75%), Positives = 12/12 (100%)
Query: 249 EEEEEEEEEEEE 260
E++EEEEEEEE+
Sbjct: 69 EKKEEEEEEEED 80
Score = 26.0 bits (58), Expect = 8.5
Identities = 8/12 (66%), Positives = 12/12 (100%)
Query: 249 EEEEEEEEEEEE 260
++EEEEEEEE++
Sbjct: 70 KKEEEEEEEEDD 81
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 42.3 bits (100), Expect = 2e-04
Identities = 20/53 (37%), Positives = 33/53 (62%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E +EEEE+ +E + E + + G G GEE+EEEEE+ ++E+EE++
Sbjct: 286 EEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDD 338
Score = 41.5 bits (98), Expect = 4e-04
Identities = 20/57 (35%), Positives = 36/57 (63%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
G E EEEE+ ++E+E + +E++ + EE+EE+E+ ++E++EEEEE
Sbjct: 317 GQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEE 373
Score = 41.5 bits (98), Expect = 5e-04
Identities = 17/56 (30%), Positives = 34/56 (60%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ E E+ ++E+EE++ E + ++ EE+E+ ++E++EEEEEEE+
Sbjct: 322 DEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKE 377
Score = 40.4 bits (95), Expect = 0.001
Identities = 21/78 (26%), Positives = 44/78 (56%)
Query: 218 DLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGG 277
DLD PD + + + E V +E+EE+++++ E + ++ L ++++
Sbjct: 301 DLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEED 360
Query: 278 GGEEEEEEEEEEEEEEEE 295
++E++EEEEEEE+E+
Sbjct: 361 EDSDDEDDEEEEEEEKEK 378
Score = 39.6 bits (93), Expect = 0.002
Identities = 22/64 (34%), Positives = 34/64 (53%), Gaps = 8/64 (12%)
Query: 251 EEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
EEE +EE E +LK E ++L GEEE++EEEE+ +E ++ + +
Sbjct: 260 EEELAKEEAE------RLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDN 313
Query: 309 FGSG 312
FG G
Sbjct: 314 FGLG 317
Score = 39.2 bits (92), Expect = 0.002
Identities = 20/56 (35%), Positives = 32/56 (57%), Gaps = 3/56 (5%)
Query: 248 HEEEEEEEEEEEETKAKET---KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
EEE++EEEE+ + A + + G G EE+EEEEE+ ++E+EE+
Sbjct: 284 GEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDD 339
Score = 39.2 bits (92), Expect = 0.003
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
R E++EEEE+ +E + + + G G+ EE+EEEEE+ ++E+EE
Sbjct: 283 RGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDD 339
Score = 36.1 bits (84), Expect = 0.021
Identities = 15/64 (23%), Positives = 30/64 (46%), Gaps = 13/64 (20%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
++++ + + EE+EEEEE G +EE+++++ EEEEE+
Sbjct: 302 LDDEFEPDDDDNFGLGQGEEDEEEEE-------------DGVDDEDEEDDDDDLEEEEED 348
Query: 295 EEEE 298
+
Sbjct: 349 VDLS 352
Score = 36.1 bits (84), Expect = 0.022
Identities = 19/69 (27%), Positives = 34/69 (49%), Gaps = 12/69 (17%)
Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
F+ ++D G + +EEEEE+ ++E + + + EEEEE+ + +
Sbjct: 306 FEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDD------------DDDLEEEEEDVDLSD 353
Query: 293 EEEEEEIGG 301
EEE+EE
Sbjct: 354 EEEDEEDED 362
Score = 30.4 bits (69), Expect = 1.5
Identities = 11/53 (20%), Positives = 29/53 (54%), Gaps = 8/53 (15%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE--EEI 299
++ ++++ + + + GGG +EE+EEE+ E ++ ++E +E+
Sbjct: 136 DDFDDDDLGDLASDDRAAHF------GGGEDDEEDEEEQPERKKSKKEVMKEV 182
Score = 29.2 bits (66), Expect = 4.0
Identities = 12/26 (46%), Positives = 19/26 (73%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKL 274
E++EEEEEEE+E K K++ + +L
Sbjct: 366 EDDEEEEEEEKEKKKKKSAESTRSEL 391
Score = 28.8 bits (65), Expect = 4.5
Identities = 10/30 (33%), Positives = 19/30 (63%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKK 273
+ + +E++EEEEEEE+ K K+ + +
Sbjct: 360 DEDSDDEDDEEEEEEEKEKKKKKSAESTRS 389
Score = 28.4 bits (64), Expect = 5.4
Identities = 11/48 (22%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 254 EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE--EEEI 299
++ ++++ + + GG EE+EEE+ E ++ ++E +E I
Sbjct: 136 DDFDDDDLGDLASDDRAAHFGGGEDDEEDEEEQPERKKSKKEVMKEVI 183
>gnl|CDD|110514 pfam01517, HDV_ag, Hepatitis delta virus delta antigen. The
hepatitis delta virus (HDV) encodes a single protein,
the hepatitis delta antigen (HDAg). The central region
of this protein has been shown to bind RNA. Several
interactions are also mediated by a coiled-coil region
at the N terminus of the protein.
Length = 194
Score = 41.0 bits (96), Expect = 3e-04
Identities = 25/66 (37%), Positives = 32/66 (48%), Gaps = 7/66 (10%)
Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE----EEEEEEEEEEIGGVVVSAV 307
++E + KA E K KK+L GG EEEEE EE+EE E + G V V
Sbjct: 95 DQERRDHRRRKALENK---KKQLSSGGKHLSREEEEELRRLTEEDEERERRVAGPRVGGV 151
Query: 308 DFGSGG 313
+ GG
Sbjct: 152 NPLDGG 157
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family. This
family includes the HABP4 family of hyaluronan-binding
proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
HABP4 has been observed to bind hyaluronan (a
glucosaminoglycan), but it is not known whether this is
its primary role in vivo. It has also been observed to
bind RNA, but with a lower affinity than that for
hyaluronan. PAI-1 mRNA-binding protein specifically
binds the mRNA of type-1 plasminogen activator inhibitor
(PAI-1), and is thought to be involved in regulation of
mRNA stability. However, in both cases, the sequence
motifs predicted to be important for ligand binding are
not conserved throughout the family, so it is not known
whether members of this family share a common function.
Length = 106
Score = 39.3 bits (92), Expect = 3e-04
Identities = 17/50 (34%), Positives = 27/50 (54%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
++E E EE +E K + ++K +E EEEE EEEE++E+
Sbjct: 30 VKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEM 79
Score = 33.2 bits (76), Expect = 0.036
Identities = 18/51 (35%), Positives = 27/51 (52%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
G + E EE+ EEE K + + + ++ G EEEE EEEE++E
Sbjct: 28 GSVKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKE 78
Score = 33.2 bits (76), Expect = 0.039
Identities = 20/51 (39%), Positives = 28/51 (54%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
G ++ E EE+ EEEE E K +++ G EEEE EEEE++E
Sbjct: 28 GSVKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKE 78
Score = 32.0 bits (73), Expect = 0.12
Identities = 20/51 (39%), Positives = 25/51 (49%), Gaps = 9/51 (17%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
E EEEE+ E EE+ +E KE EEEE EEEE++E
Sbjct: 37 LTEEQGEEEEKNEVEEKQAVEEEANKEG---------VVEEEEVEEEEDKE 78
Score = 31.2 bits (71), Expect = 0.19
Identities = 22/55 (40%), Positives = 25/55 (45%), Gaps = 17/55 (30%)
Query: 246 EVHEE--EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E+ EE EEEE+ E EE EEE +E EEEE EEEE
Sbjct: 36 ELTEEQGEEEEKNEVEEK---------------QAVEEEANKEGVVEEEEVEEEE 75
Score = 31.2 bits (71), Expect = 0.20
Identities = 15/41 (36%), Positives = 24/41 (58%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKL 274
++E + + E E ++E EEEE EE + KE L+E +K
Sbjct: 47 KNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEMTLEEYEKA 87
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 41.6 bits (98), Expect = 3e-04
Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 4/68 (5%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
+ + EE +E+E+EE + + + +E++ G EEE + E+ E ++G
Sbjct: 361 RLDPIDFEEVDEDEDEEEEQRSDEHEEEE----GEDSEEEGSQSREDGSSESSSDVGSDS 416
Query: 304 VSAVDFGS 311
S D S
Sbjct: 417 ESKADKES 424
Score = 35.9 bits (83), Expect = 0.021
Identities = 12/56 (21%), Positives = 25/56 (44%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E + E+EEEE+ +E + + +E+ G E + + E + ++E
Sbjct: 368 EEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKE 423
Score = 35.5 bits (82), Expect = 0.034
Identities = 15/52 (28%), Positives = 26/52 (50%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
+ HEEEE E+ EEE ++++E E G E + ++E + + E
Sbjct: 380 QRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431
Score = 35.1 bits (81), Expect = 0.041
Identities = 12/55 (21%), Positives = 26/55 (47%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E + +E EEEE E+ E + +++ + G + E + ++E + + E
Sbjct: 377 EEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431
Score = 34.7 bits (80), Expect = 0.050
Identities = 13/56 (23%), Positives = 26/56 (46%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
+ E+ E EEEE E+ EE ++ + + G + E + ++E + + E
Sbjct: 376 EEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431
Score = 33.6 bits (77), Expect = 0.13
Identities = 14/55 (25%), Positives = 25/55 (45%), Gaps = 9/55 (16%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ + EE +E+E+ E+++ EEEE E+ EEE + E+
Sbjct: 359 RARLDPIDFEEVDEDED---------EEEEQRSDEHEEEEGEDSEEEGSQSREDG 404
Score = 32.0 bits (73), Expect = 0.35
Identities = 11/55 (20%), Positives = 22/55 (40%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E +E EEEE E+ + ++ + E + ++E + + E
Sbjct: 377 EEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431
Score = 32.0 bits (73), Expect = 0.40
Identities = 13/60 (21%), Positives = 23/60 (38%), Gaps = 5/60 (8%)
Query: 244 EREVHEEEEE-----EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E +EEEE EEEE E +E + + + E + ++E + +
Sbjct: 370 VDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSD 429
Score = 31.2 bits (71), Expect = 0.65
Identities = 11/55 (20%), Positives = 25/55 (45%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E +E EEEE E+++ + ++ +E + E + ++E + +
Sbjct: 376 EEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDS 430
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 38.8 bits (91), Expect = 4e-04
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 278 GGEEEEEEEEEEEEEEEEEEEIGG 301
EE++EEEEEEEE+EE EEE
Sbjct: 76 AAEEKKEEEEEEEEKEESEEEAAA 99
Score = 37.2 bits (87), Expect = 0.002
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 277 GGGEEEEEEEEEEEEEEEEEEEIGG 301
E++EEEEEEEE+EE EEE G
Sbjct: 76 AAEEKKEEEEEEEEKEESEEEAAAG 100
Score = 33.0 bits (76), Expect = 0.045
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 276 GGGGEEEEEEEEEEEEEEEEEEE 298
EE++EEEEEEEE+EE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEES 93
Score = 31.5 bits (72), Expect = 0.15
Identities = 14/22 (63%), Positives = 16/22 (72%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGG 301
EEEEEEEE+EE EEE +G
Sbjct: 82 EEEEEEEEKEESEEEAAAGLGA 103
Score = 28.8 bits (65), Expect = 1.4
Identities = 15/46 (32%), Positives = 18/46 (39%), Gaps = 19/46 (41%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
EE++EEE EEEEE+EE EEE
Sbjct: 73 AAAAAEEKKEEE-------------------EEEEEKEESEEEAAA 99
Score = 26.8 bits (60), Expect = 7.2
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKE 265
E + EEEEEEE+EE E +A
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEEAAA 99
>gnl|CDD|223017 PHA03235, PHA03235, DNA packaging protein UL33; Provisional.
Length = 409
Score = 41.3 bits (97), Expect = 4e-04
Identities = 44/182 (24%), Positives = 74/182 (40%), Gaps = 28/182 (15%)
Query: 23 CNISYNEYNRSNIPAICRNFSHEYINDNDVDAEAMERMVSIIVPIFFGIIIIL-GFFGNA 81
+ +NE N N + H +ND V E + + F ++II G N
Sbjct: 2 VLLVHNETNDEN------DLLH--VNDTCVPTEGLS--AARTTETFINLLIISVGGPLNL 51
Query: 82 LVVVVVAFNQQMR--STTNLLIINLAIADLLFIVCCVPFTATD---YVTTCWPFGDVWCK 136
+V+V ++ ST L + NL +A+LL V +PF ++ G CK
Sbjct: 52 IVLVTQLLANRVHGFSTPTLYMTNLYLANLL-TVFVLPFIMLSNQGLLSGS-VAG---CK 106
Query: 137 MYLIVVTAYAS----VYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTA 192
+ YAS T+ L++ DR+ V+H R+ K + +TW ++ +
Sbjct: 107 --FASLLYYASCTVGFATVALIAADRY-RVIHQRTRARSSAYRSTYKILGLTWFASLICS 163
Query: 193 VP 194
P
Sbjct: 164 GP 165
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 41.1 bits (96), Expect = 5e-04
Identities = 19/27 (70%), Positives = 21/27 (77%)
Query: 278 GGEEEEEEEEEEEEEEEEEEEIGGVVV 304
G + E EE+EEEEEEEEEEEEI V V
Sbjct: 223 GSDSESEEDEEEEEEEEEEEEIDVVTV 249
Score = 35.3 bits (81), Expect = 0.032
Identities = 16/21 (76%), Positives = 17/21 (80%)
Query: 277 GGGEEEEEEEEEEEEEEEEEE 297
G E EE+EEEEEEEEEEEE
Sbjct: 223 GSDSESEEDEEEEEEEEEEEE 243
Score = 30.3 bits (68), Expect = 1.2
Identities = 14/17 (82%), Positives = 14/17 (82%)
Query: 244 EREVHEEEEEEEEEEEE 260
E E EEEEEEEEEEEE
Sbjct: 227 ESEEDEEEEEEEEEEEE 243
Score = 28.4 bits (63), Expect = 4.8
Identities = 13/19 (68%), Positives = 15/19 (78%)
Query: 242 GGEREVHEEEEEEEEEEEE 260
G + E E+EEEEEEEEEE
Sbjct: 223 GSDSESEEDEEEEEEEEEE 241
>gnl|CDD|220691 pfam10323, 7TM_GPCR_Srv, Serpentine type 7TM GPCR chemoreceptor
Srv. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srv is a member of the
Srg superfamily of chemoreceptors. Chemoperception is
one of the central senses of soil nematodes like C.
elegans which are otherwise 'blind' and 'deaf'.
Length = 283
Score = 40.7 bits (96), Expect = 6e-04
Identities = 32/154 (20%), Positives = 62/154 (40%), Gaps = 35/154 (22%)
Query: 60 MVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFT 119
+ + +P++F I+I L N F ++T L++ IAD++ T
Sbjct: 4 LSIVTLPLYFLILICLLKLRN----RSKTF----KTTFYTLLLQHCIADII--------T 47
Query: 120 ATDYVTTC----WPFGDV-------WC-KMYLIVVTAYASVY----TLVLMSLDRFLAVV 163
+Y + + Y AY ++Y + +S +R+LA+
Sbjct: 48 LLNYYFGAMFPKIGWFRDFYFNYQHYYAAAYY--NIAYYTLYIRCTGISFLSTNRYLAIC 105
Query: 164 HPIASMSVRTERNAL-KAILITWIVIVLTAVPVF 196
PI+ ++ + L K ILI W+ +L ++ V
Sbjct: 106 FPISRLTKIWQSARLWKIILIYWLPGLLISLVVL 139
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 39.8 bits (93), Expect = 6e-04
Identities = 19/76 (25%), Positives = 38/76 (50%)
Query: 223 NRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEE 282
+ F + K E+E +E + +EEE++ + +E + +++ E
Sbjct: 21 EEAKANMQKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAE 80
Query: 283 EEEEEEEEEEEEEEEE 298
++E+EE +EEE+EEE
Sbjct: 81 QQEKEEIAQEEEKEEE 96
Score = 36.0 bits (83), Expect = 0.011
Identities = 20/55 (36%), Positives = 32/55 (58%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E +EEE++ E EE K KE E++ + +EE +EEE+EEE E+ ++
Sbjct: 48 EHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQ 102
Score = 28.6 bits (64), Expect = 2.8
Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 8/60 (13%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
+E+EEE+E + K + ++K E EE E+E+E E E E+I
Sbjct: 36 ADEKEEEKENSDEHVKSKEEEQK--------IEYEEAEKEKEAGEPEREDIAEQQEKEEI 87
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 40.6 bits (96), Expect = 7e-04
Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 5/49 (10%)
Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
+++ +EE K + + EEEEEEEEEEEEEE EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAA-----AEEEEEEEEEEEEEEPSEEEAAA 323
Score = 31.0 bits (71), Expect = 0.71
Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 8/50 (16%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
+ + EE +E + + A E + +E+ EEEEEEE EEE
Sbjct: 279 ADKDALDEELKEVLSAQAQAAAAEEEEEEE--------EEEEEEEPSEEE 320
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 40.5 bits (95), Expect = 7e-04
Identities = 20/46 (43%), Positives = 30/46 (65%), Gaps = 6/46 (13%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
E E+E +EEEE + +E K +E+K ++EEE +EEEE+EE
Sbjct: 30 EVEKEVPDEEEEEEKEEKKEEEEKT------TDKEEEVDEEEEKEE 69
Score = 40.1 bits (94), Expect = 0.001
Identities = 22/55 (40%), Positives = 33/55 (60%), Gaps = 17/55 (30%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E+EV +EEEEEE+EE++ + EE+ ++EEE +EEEE+EE
Sbjct: 32 EKEVPDEEEEEEKEEKKEE-----------------EEKTTDKEEEVDEEEEKEE 69
Score = 40.1 bits (94), Expect = 0.001
Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 14/54 (25%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E E +EEEEEE+EE K +E E+ ++EEE +EEEE+EE
Sbjct: 30 EVEKEVPDEEEEEEKEEKKEEE--------------EKTTDKEEEVDEEEEKEE 69
Score = 32.8 bits (75), Expect = 0.22
Identities = 16/43 (37%), Positives = 21/43 (48%), Gaps = 6/43 (13%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEE 283
+ + EEE +EEEE+EE K K K+KE E E
Sbjct: 50 EEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTT------EWEL 86
Score = 32.0 bits (73), Expect = 0.36
Identities = 17/53 (32%), Positives = 28/53 (52%), Gaps = 5/53 (9%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
E + K ++E E+ ++EEE +E + KE K K+ KK+ +E E E
Sbjct: 39 EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKV-----KETTTEWEL 86
Score = 29.4 bits (66), Expect = 3.0
Identities = 13/24 (54%), Positives = 18/24 (75%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVV 303
EEEEEE+EE++EEEE+ + V
Sbjct: 38 EEEEEEKEEKKEEEEKTTDKEEEV 61
Score = 29.0 bits (65), Expect = 4.3
Identities = 17/39 (43%), Positives = 23/39 (58%)
Query: 259 EETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E+ KA +++ KLG E+EE EEE+EE E E E
Sbjct: 476 EDPKAFASRIYRMIKLGLSIDEDEEVEEEDEEAEVETTE 514
Score = 28.6 bits (64), Expect = 5.2
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
E +EEEEEE+EE++EEEE VD
Sbjct: 34 EVPDEEEEEEKEEKKEEEEKTTDKEEEVD 62
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 40.8 bits (96), Expect = 8e-04
Identities = 26/60 (43%), Positives = 28/60 (46%), Gaps = 19/60 (31%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
E EEEEEEEEE EEEEEEEEEEE EEE+ V A+
Sbjct: 303 PEPEEEEEEEEE-------------------VPEEEEEEEEEEERTFEEEVRATVAEAIR 343
Score = 39.2 bits (92), Expect = 0.002
Identities = 24/63 (38%), Positives = 25/63 (39%), Gaps = 19/63 (30%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVV 304
R E EEEEE EEEE EEEEEEEEEEE V
Sbjct: 295 RRRPPPSPPEPEEEEE-------------------EEEEVPEEEEEEEEEEERTFEEEVR 335
Query: 305 SAV 307
+ V
Sbjct: 336 ATV 338
Score = 28.5 bits (64), Expect = 5.1
Identities = 16/23 (69%), Positives = 17/23 (73%)
Query: 243 GEREVHEEEEEEEEEEEETKAKE 265
E EV EEEEEEEEEEE T +E
Sbjct: 311 EEEEVPEEEEEEEEEEERTFEEE 333
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 40.2 bits (94), Expect = 0.001
Identities = 13/59 (22%), Positives = 23/59 (38%), Gaps = 1/59 (1%)
Query: 241 KGGEREVHEEEEEE-EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ + EE E+ ++EEE + + + +EEE EE E+ E
Sbjct: 176 ESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSE 234
Score = 37.5 bits (87), Expect = 0.008
Identities = 13/56 (23%), Positives = 20/56 (35%), Gaps = 6/56 (10%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ ++EEEE+ + E+ L EEE EE E+ E
Sbjct: 186 SAEDDSDDEEEEDSDSEDYSQ------YDGMLVDSSDEEEGEEAPSINYNEDTSES 235
Score = 36.0 bits (83), Expect = 0.021
Identities = 10/50 (20%), Positives = 28/50 (56%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
++ ++ +++E + + + +E + EE E++ ++EEEE+ + E
Sbjct: 153 KKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202
Score = 34.4 bits (79), Expect = 0.078
Identities = 9/57 (15%), Positives = 31/57 (54%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
++ +++ ++ +++E ++ + +E+ + EE E++ ++EEEE+ +
Sbjct: 146 AKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202
Score = 34.0 bits (78), Expect = 0.082
Identities = 16/79 (20%), Positives = 34/79 (43%), Gaps = 9/79 (11%)
Query: 230 KTIFQS-ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK--------LGGGGGE 280
K + +S K E ++ + +++ +++K KE K K E
Sbjct: 126 KNVLESIMEGLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEE 185
Query: 281 EEEEEEEEEEEEEEEEEEI 299
E++ ++EEEE+ + E+
Sbjct: 186 SAEDDSDDEEEEDSDSEDY 204
Score = 32.9 bits (75), Expect = 0.23
Identities = 11/51 (21%), Positives = 25/51 (49%), Gaps = 6/51 (11%)
Query: 253 EEEEEEEETKAKETKLKEKKKL----GG--GGGEEEEEEEEEEEEEEEEEE 297
+ EE +K + K L GG G E+E++++E+ + ++ ++
Sbjct: 252 DSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDIDPDQVVKK 302
Score = 32.1 bits (73), Expect = 0.32
Identities = 14/69 (20%), Positives = 36/69 (52%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
+++ + +++E +E + +E E + E++ ++EEEE+ + E+ + G++
Sbjct: 152 KKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGML 211
Query: 304 VSAVDFGSG 312
V + D G
Sbjct: 212 VDSSDEEEG 220
Score = 29.0 bits (65), Expect = 3.1
Identities = 13/47 (27%), Positives = 21/47 (44%)
Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
K K K +K K ++++EEE E E+E + E S +
Sbjct: 148 KGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDE 194
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 39.3 bits (92), Expect = 0.001
Identities = 21/59 (35%), Positives = 35/59 (59%), Gaps = 4/59 (6%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
K R+ E EEEE EE ++ + K +++++ EEE E+++EEEE +E EE+
Sbjct: 14 KQARRQQREAEEEEREERKKLEEKREGERKEEEEL----EEEREKKKEEEERKEREEQA 68
Score = 34.7 bits (80), Expect = 0.030
Identities = 19/49 (38%), Positives = 32/49 (65%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EE++ ++ +A+E + +E+KKL E +EEEE EEE E+++EE
Sbjct: 11 LEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEE 59
Score = 33.5 bits (77), Expect = 0.087
Identities = 19/56 (33%), Positives = 27/56 (48%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E ++EEEE +E EE KE + EK K EE ++ +EE E E+
Sbjct: 50 EEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGTDKLSADEESNELLEDF 105
Score = 32.7 bits (75), Expect = 0.14
Identities = 22/68 (32%), Positives = 32/68 (47%), Gaps = 18/68 (26%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE----------E 290
+RE +EEEE EEE E K +E + KE+ EE+ +E+EE E
Sbjct: 34 LEEKREGERKEEEELEEEREKKKEEEERKER--------EEQARKEQEEYEKLKSSFVVE 85
Query: 291 EEEEEEEE 298
EE ++
Sbjct: 86 EEGTDKLS 93
Score = 31.6 bits (72), Expect = 0.30
Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 14/57 (24%)
Query: 244 EREVHEEEEEE--EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
ER+ EE+ E +EEEE + +E K +EEEE +E EE+ +E+EE
Sbjct: 30 ERKKLEEKREGERKEEEELEEEREKK------------KEEEERKEREEQARKEQEE 74
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 39.7 bits (93), Expect = 0.001
Identities = 18/67 (26%), Positives = 32/67 (47%), Gaps = 8/67 (11%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE--EEEE 292
+ + ++++ E+E + K+K+++ EE+E EEEE EEE
Sbjct: 121 DSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERA------EEKEREEEEKAAEEE 174
Query: 293 EEEEEEI 299
+ EEEI
Sbjct: 175 KAREEEI 181
Score = 29.3 bits (66), Expect = 2.3
Identities = 7/54 (12%), Positives = 27/54 (50%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E E+++EE+E + + ++ + + + +++ ++++ E+E +
Sbjct: 94 EGDEDDDEEDEIKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAAL 147
Score = 28.9 bits (65), Expect = 3.3
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGG 276
E EE+E EEEE A+E K +E++ L G
Sbjct: 157 ERAEEKEREEEEKAAEEEKAREEEILTG 184
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for cell
viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 40.0 bits (94), Expect = 0.001
Identities = 14/58 (24%), Positives = 31/58 (53%), Gaps = 6/58 (10%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
G R++ EE EE +E K ++ ++++ ++EE++ ++ EE ++ EE
Sbjct: 44 GSFRDLTEESLREEIAKEAAKIDFSEESDEEE------DDEEDDNDDSEENKDTVEEF 95
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 39.4 bits (92), Expect = 0.001
Identities = 18/64 (28%), Positives = 29/64 (45%), Gaps = 14/64 (21%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E D CK E + ++ +EE+ E EE + +E E+ EE E + EE
Sbjct: 14 EEDCCKENENKEEDKGKEEDLEFEEIEKEEII--------------EDSEESNEVKIEEL 59
Query: 296 EEEI 299
++E
Sbjct: 60 KDEN 63
Score = 37.8 bits (88), Expect = 0.003
Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 5/59 (8%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
+ HE EE+ +E E K ++ +E + EE E+EE E+ EE E +I +
Sbjct: 6 KDAKHENMEEDCCKENENKEEDKGKEEDLEF-----EEIEKEEIIEDSEESNEVKIEEL 59
Score = 30.9 bits (70), Expect = 0.57
Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 9/51 (17%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E ++ + E EE KE + KE E++ +EE+ E EE E+EE I
Sbjct: 4 ECKDAKHENMEEDCCKENENKE---------EDKGKEEDLEFEEIEKEEII 45
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 39.6 bits (92), Expect = 0.001
Identities = 26/71 (36%), Positives = 34/71 (47%), Gaps = 1/71 (1%)
Query: 228 YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
+NK + E + ER V EEEE + E E T E + +KK L G ++ E
Sbjct: 186 WNKAATEREKRQDEK-ERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETS 244
Query: 288 EEEEEEEEEEE 298
E EEEE E E
Sbjct: 245 ESEEEESSESE 255
Score = 31.6 bits (71), Expect = 0.49
Identities = 18/57 (31%), Positives = 28/57 (49%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVS 305
EEEE E E +E + ++ K K +K+ + + E E+E E E+I V S
Sbjct: 247 EEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQERENEKIPAVQHS 303
Score = 29.2 bits (65), Expect = 2.8
Identities = 16/65 (24%), Positives = 27/65 (41%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
QS E E +E+E+E+ + K ++ K + KK E E+E E E+
Sbjct: 239 QSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQERENEKIP 298
Query: 294 EEEEE 298
+
Sbjct: 299 AVQHS 303
>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2. A Saccharomyces
cerevisiae member of this family (PGA2) is an ER protein
which has been implicated in protein trafficking.
Length = 139
Score = 37.9 bits (88), Expect = 0.001
Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 1/63 (1%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
K G + +E E+E E EE + K+ K+ L GG EE+ ++EE+EE+
Sbjct: 36 KLGAKAQEKEHEKERAEREEAREKKAKI-SPNALRGGATAGHGEEDTDDEEDEEDFATPS 94
Query: 301 GVV 303
V
Sbjct: 95 AVP 97
Score = 31.3 bits (71), Expect = 0.30
Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 18/71 (25%)
Query: 241 KGGEREVHEEEEEEEEEEEET------------KAKETKLKEKKKLGGGGGEEEEEEEEE 288
+GG H EE+ ++EE+EE KA++ + K +KL E EE+
Sbjct: 69 RGGATAGHGEEDTDDEEDEEDFATPSAVPQWGKKARKRQRKVIRKL------LEAEEQLR 122
Query: 289 EEEEEEEEEEI 299
E++ ++E+E+I
Sbjct: 123 EDQYDDEDEDI 133
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 38.1 bits (89), Expect = 0.002
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
G E E+E++E E+K E + K K+KL + ++EE E+E+E+EE +
Sbjct: 41 GSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKA 97
Score = 37.7 bits (88), Expect = 0.002
Identities = 21/95 (22%), Positives = 43/95 (45%), Gaps = 12/95 (12%)
Query: 205 TYASELHTACVFLDLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAK 264
+ ++E + RP+G K + + + + +EE E+E+E+EE K
Sbjct: 42 STSNEENEDEDDESTAESKRPEGRKK-----AKEKLRRDKLKAKKEEAEKEKEKEERFMK 96
Query: 265 ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EK++ E E+++ E + +EE++I
Sbjct: 97 ALAEAEKERA-------ELEKKKAEAKLMKEEKKI 124
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 36.9 bits (86), Expect = 0.002
Identities = 18/22 (81%), Positives = 18/22 (81%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGG 301
EEEEEEEEEEEEEEE EEE
Sbjct: 77 EEEEEEEEEEEEEEESEEEAMA 98
Score = 36.2 bits (84), Expect = 0.004
Identities = 18/22 (81%), Positives = 19/22 (86%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGG 301
EEEEEEEEEEEEEE EEE + G
Sbjct: 78 EEEEEEEEEEEEEESEEEAMAG 99
Score = 36.2 bits (84), Expect = 0.004
Identities = 21/36 (58%), Positives = 23/36 (63%), Gaps = 1/36 (2%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFG 310
EEEEEEEEEEEEEEEE EE + A+ FG
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEEAMAGLGAL-FG 105
Score = 34.6 bits (80), Expect = 0.012
Identities = 17/24 (70%), Positives = 17/24 (70%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
EEEEEEEEEEEEEEEE E
Sbjct: 70 AAAAAAEEEEEEEEEEEEEEEESE 93
Score = 32.7 bits (75), Expect = 0.055
Identities = 15/24 (62%), Positives = 15/24 (62%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
EEEEEEEEEEEEEEE
Sbjct: 67 AAAAAAAAAEEEEEEEEEEEEEEE 90
Score = 31.6 bits (72), Expect = 0.16
Identities = 14/24 (58%), Positives = 14/24 (58%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
EEEEEEEEEEEEEE
Sbjct: 66 AAAAAAAAAAEEEEEEEEEEEEEE 89
Score = 27.3 bits (61), Expect = 4.5
Identities = 19/41 (46%), Positives = 19/41 (46%), Gaps = 19/41 (46%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
EEEEEEEEEE EEEEEE EEE
Sbjct: 74 AAEEEEEEEEEE-------------------EEEEEESEEE 95
Score = 26.9 bits (60), Expect = 6.6
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKE 265
EEEEEEEEEEEE + E
Sbjct: 70 AAAAAAEEEEEEEEEEEEEEEESE 93
Score = 26.5 bits (59), Expect = 9.0
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKE 265
EEEEEEEEEEEE +++E
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEE 94
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 38.6 bits (90), Expect = 0.002
Identities = 21/61 (34%), Positives = 33/61 (54%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
K ++E E EE +EEE+E +++ + K + EEE EEE+EE +E E+E
Sbjct: 53 KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEE 112
Query: 301 G 301
Sbjct: 113 K 113
Score = 37.8 bits (88), Expect = 0.003
Identities = 14/56 (25%), Positives = 26/56 (46%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ E E E++E++ KE + E++ +E E+E EE+ E E+
Sbjct: 65 VKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEK 120
Score = 37.4 bits (87), Expect = 0.005
Identities = 21/55 (38%), Positives = 32/55 (58%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E + +++E E EE + + KE E K+ G +E+EE EEE EEE+EE +
Sbjct: 50 EAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSD 104
Score = 35.9 bits (83), Expect = 0.015
Identities = 22/59 (37%), Positives = 29/59 (49%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
E EE +EEE+E E K + EEE EEE+EE +E E+E S V+
Sbjct: 61 EIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVE 119
Score = 34.3 bits (79), Expect = 0.049
Identities = 14/55 (25%), Positives = 31/55 (56%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E+E E++E++ + E + +E++ + +++ E E+E EE+ E E+E
Sbjct: 69 EKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEIT 123
Score = 33.6 bits (77), Expect = 0.097
Identities = 13/49 (26%), Positives = 27/49 (55%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E++E++ + E+E + E + +E+ + E+E EE+ E E+E
Sbjct: 76 EDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITN 124
Score = 30.5 bits (69), Expect = 0.85
Identities = 17/54 (31%), Positives = 24/54 (44%), Gaps = 6/54 (11%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E + E+E+EE EEE + E E +K E EE+ E E+E
Sbjct: 79 EDKGDAEKEDEESEEENEEEDEESSDENEK------ETEEKTESNVEKEITNPS 126
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 39.1 bits (92), Expect = 0.002
Identities = 24/56 (42%), Positives = 29/56 (51%), Gaps = 8/56 (14%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
ER +EEE+E E EEE K E +EK E EEE E E E +EE+E
Sbjct: 136 ERIERKEEEKEREREEELKILE-YQREK-------AEREEEREAERRERKEEKERE 183
Score = 35.7 bits (83), Expect = 0.028
Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 6/50 (12%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
RE E EEE E E E K + K +E +L ++EE E+E EE +E
Sbjct: 160 REKAEREEEREAERRERKEE--KEREVARL----RAQQEEAEDEREELDE 203
Score = 33.0 bits (76), Expect = 0.17
Identities = 17/56 (30%), Positives = 27/56 (48%), Gaps = 7/56 (12%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E++ + EE+EEE + +E +LK EEEE E + +EE E +
Sbjct: 28 EKKRIKAEEKEEERRIDEMMEEERLKALA-------EEEERERKRKEERREGRAVL 76
Score = 31.8 bits (73), Expect = 0.48
Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 8/63 (12%)
Query: 245 REVHEEEEEEEEEEEETKAKETK--------LKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
+ E+EE E+E E+ + K + ++EK++ EEE EE E EEE E
Sbjct: 266 EKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEELEEGERLREEEAER 325
Query: 297 EEI 299
+
Sbjct: 326 QAR 328
Score = 31.4 bits (72), Expect = 0.55
Identities = 18/51 (35%), Positives = 31/51 (60%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+EE E +E ++E + E + ++K++L E+ EE+EE +EE EEE
Sbjct: 210 QEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAE 260
Score = 31.0 bits (71), Expect = 0.72
Identities = 17/57 (29%), Positives = 29/57 (50%), Gaps = 7/57 (12%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKE--KKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E + EEEE E + K + + + ++++ EE E+ +EE EE +E E
Sbjct: 48 EEERLKALAEEEERERKRKEERREGRAVLQEQI-----EEREKRRQEEYEERLQERE 99
Score = 31.0 bits (71), Expect = 0.72
Identities = 25/64 (39%), Positives = 35/64 (54%), Gaps = 13/64 (20%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEE--------EEEEEEEEEEEEEE 295
ERE +E E +EE+E +A+E + K+KK EE E +EEE+E E EE
Sbjct: 97 EREQMDEIIERIQEEDEAEAQEKREKQKKLR-----EEIDEFNEERIERKEEEKEREREE 151
Query: 296 EEEI 299
E +I
Sbjct: 152 ELKI 155
Score = 30.6 bits (70), Expect = 1.1
Identities = 21/58 (36%), Positives = 32/58 (55%), Gaps = 3/58 (5%)
Query: 244 EREVHEE-EEEEEEEEEETKAKETKLKEKKKLGGGGGEE--EEEEEEEEEEEEEEEEE 298
ER + E EEE + E + +E K KE+++ G +E EE E+ +EE EE +E
Sbjct: 40 ERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQE 97
Score = 28.7 bits (65), Expect = 3.6
Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 14/64 (21%)
Query: 244 EREVHEEEEEEE--------EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
ER +EE+E E EE E+ + + +L+ +EE E +E ++E+EE
Sbjct: 172 ERRERKEEKEREVARLRAQQEEAEDEREELDELRADL------YQEEYERKERQKEKEEA 225
Query: 296 EEEI 299
E+
Sbjct: 226 EKRR 229
Score = 28.7 bits (65), Expect = 3.7
Identities = 18/54 (33%), Positives = 30/54 (55%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E +E +E ++E+EE K + K + ++ E+EE +EE EEE E E +
Sbjct: 211 EEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERM 264
Score = 28.0 bits (63), Expect = 6.7
Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 13/63 (20%)
Query: 244 EREVHEEEEEEEEE-EEETKAKETKLKEKKKLGGGGGEEEEEEE------EEEEEEEEEE 296
E E E E +EE E E + +E + E EE +E +EE E +E +
Sbjct: 166 EEEREAERRERKEEKEREVARLRAQQEEAED------EREELDELRADLYQEEYERKERQ 219
Query: 297 EEI 299
+E
Sbjct: 220 KEK 222
Score = 28.0 bits (63), Expect = 7.9
Identities = 20/68 (29%), Positives = 28/68 (41%), Gaps = 19/68 (27%)
Query: 245 REVHEEEEEEEEEEEETKAKETKL--------------KEKKKLGGGGGEEEEEEEEEEE 290
+ E++ E+EE E+E K KE+++ E EEE EE E
Sbjct: 262 ERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRA-----AEREEELEEGE 316
Query: 291 EEEEEEEE 298
EEE E
Sbjct: 317 RLREEEAE 324
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 39.7 bits (93), Expect = 0.002
Identities = 27/62 (43%), Positives = 34/62 (54%), Gaps = 5/62 (8%)
Query: 257 EEEETKAKETK--LKEKKK--LGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDF-GS 311
EE+ +E K LKEK+ + EEEEEEEEEEEEEEE E+ + + S
Sbjct: 378 EEDVKDIEELKKFLKEKEHPVVERWAAEEEEEEEEEEEEEEEPVAEVMMMPAPEMQLPAS 437
Query: 312 GG 313
GG
Sbjct: 438 GG 439
Score = 28.5 bits (64), Expect = 5.7
Identities = 14/23 (60%), Positives = 15/23 (65%)
Query: 244 EREVHEEEEEEEEEEEETKAKET 266
ER EEEEEEEEEEEE +
Sbjct: 400 ERWAAEEEEEEEEEEEEEEEPVA 422
Score = 27.7 bits (62), Expect = 9.6
Identities = 15/37 (40%), Positives = 17/37 (45%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGG 279
E EEEEEEEEEEE + +L GG
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPAPEMQLPASGG 439
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 39.7 bits (92), Expect = 0.002
Identities = 20/58 (34%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKE--KKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E ++ EE ++ EE++ KA+E K E +KK +E EE ++ EE +++E EE
Sbjct: 1658 ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEK 1715
Score = 37.4 bits (86), Expect = 0.011
Identities = 21/56 (37%), Positives = 34/56 (60%), Gaps = 2/56 (3%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKE--KKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E + + EE ++ EE + KA+E K E KKK+ +E EE+++ EE ++ EEE
Sbjct: 1603 EEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEE 1658
Score = 37.0 bits (85), Expect = 0.012
Identities = 21/60 (35%), Positives = 34/60 (56%), Gaps = 3/60 (5%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKET---KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
K E + + EE ++ EE+E KA E + +E KK +E EE+++ EE ++ EEE
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEE 1727
Score = 37.0 bits (85), Expect = 0.014
Identities = 23/57 (40%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E ++ EE ++E EE++ KA+E K EKKK+ EEE++ EE +E+E EE
Sbjct: 1727 ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEE 1783
Score = 35.9 bits (82), Expect = 0.028
Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 4/62 (6%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE----EE 296
K E E ++ EE ++ EEE K K + +K + EE ++ EE+E++ E E
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEA 1698
Query: 297 EE 298
EE
Sbjct: 1699 EE 1700
Score = 35.9 bits (82), Expect = 0.035
Identities = 22/66 (33%), Positives = 38/66 (57%), Gaps = 4/66 (6%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
++E D K E + EE+E++ E + +A+E K E+ K E EE+++ EE ++
Sbjct: 1669 KAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELK----KKEAEEKKKAEELKKA 1724
Query: 294 EEEEEI 299
EEE +I
Sbjct: 1725 EEENKI 1730
Score = 35.5 bits (81), Expect = 0.039
Identities = 25/75 (33%), Positives = 40/75 (53%), Gaps = 10/75 (13%)
Query: 234 QSENDYCKGGEREVHEEEE-------EEEEEEEETKAKETKLKE---KKKLGGGGGEEEE 283
++E D K E E EE +++E EE+ KA+E K E K K E EE
Sbjct: 1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE 1741
Query: 284 EEEEEEEEEEEEEEE 298
++++ EE +++EEE+
Sbjct: 1742 DKKKAEEAKKDEEEK 1756
Score = 34.3 bits (78), Expect = 0.092
Identities = 18/54 (33%), Positives = 36/54 (66%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E+ ++E EE+++ EE K E + K K + EE++++ EE +++EEE+++I
Sbjct: 1706 ELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKI 1759
Score = 34.3 bits (78), Expect = 0.10
Identities = 25/68 (36%), Positives = 34/68 (50%), Gaps = 9/68 (13%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLK---EKKKLGGGGGEEE------EEEEEEEEE 291
K E E E ++E EE KA+E K K EKKK EE EE ++E EE
Sbjct: 1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE 1741
Query: 292 EEEEEEEI 299
++++ EE
Sbjct: 1742 DKKKAEEA 1749
Score = 34.0 bits (77), Expect = 0.11
Identities = 16/57 (28%), Positives = 33/57 (57%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
K E+ +E EE+++ EE KA+E + + E++++ EE ++ EE+E++
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689
Score = 33.2 bits (75), Expect = 0.20
Identities = 16/59 (27%), Positives = 32/59 (54%), Gaps = 4/59 (6%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE----EEEEEEEEE 298
E ++ E+ +++E E K K +LK+ ++ EE ++ EE+ EE ++ EE+
Sbjct: 1628 AEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686
Score = 33.2 bits (75), Expect = 0.21
Identities = 21/61 (34%), Positives = 32/61 (52%), Gaps = 3/61 (4%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLK---EKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
K E + EE ++ EEE+ K ++ K K EKKK EEE + + EE ++ EE
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEE 1672
Query: 298 E 298
+
Sbjct: 1673 D 1673
Score = 32.8 bits (74), Expect = 0.27
Identities = 20/64 (31%), Positives = 35/64 (54%), Gaps = 11/64 (17%)
Query: 246 EVHEEEEE--------EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE---E 294
E+ + EEE ++E EE+ KA+E K E++ E ++ EE++++ EE
Sbjct: 1624 ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKA 1683
Query: 295 EEEE 298
EE+E
Sbjct: 1684 EEDE 1687
Score = 32.0 bits (72), Expect = 0.46
Identities = 17/57 (29%), Positives = 31/57 (54%), Gaps = 4/57 (7%)
Query: 246 EVHEEEEEEEEEEEET----KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E + EE ++ EE++ KA+E K E+ ++ EEE++ + EE ++ EE
Sbjct: 1562 EKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618
Score = 31.6 bits (71), Expect = 0.70
Identities = 20/55 (36%), Positives = 32/55 (58%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
K E + + EE +++EEE+ K K +E+KK E+E EEE +EE+E+
Sbjct: 1737 KEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEK 1791
Score = 31.3 bits (70), Expect = 0.76
Identities = 19/61 (31%), Positives = 31/61 (50%), Gaps = 2/61 (3%)
Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEE 295
D K E + +E +++ EE KA E K K E KK + EE ++ +E +++ E
Sbjct: 1421 DEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAE 1480
Query: 296 E 296
E
Sbjct: 1481 E 1481
Score = 31.3 bits (70), Expect = 0.90
Identities = 19/58 (32%), Positives = 34/58 (58%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K E EE + EEE++ KA+E K E+ K+ ++ EEE+++ E+ +++E E
Sbjct: 1587 KKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAE 1644
Score = 30.5 bits (68), Expect = 1.4
Identities = 15/58 (25%), Positives = 32/58 (55%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K E + +E +++ EE++ KA E K K +++ EE+++ +E +++ EE
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEE 1442
Score = 30.5 bits (68), Expect = 1.6
Identities = 15/55 (27%), Positives = 27/55 (49%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
K E + +E ++ E + KA E K E+ K + EE ++ +E ++ EE
Sbjct: 1490 KKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEE 1544
Score = 30.1 bits (67), Expect = 1.7
Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 2/63 (3%)
Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEE 295
D K E + EE +++ EE KA E K K E KK + EE +++ +E ++
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAA 1506
Query: 296 EEE 298
E +
Sbjct: 1507 EAK 1509
Score = 30.1 bits (67), Expect = 2.1
Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 8/67 (11%)
Query: 241 KGGEREVHEEEEEEEEE--------EEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
K E++ EE ++ EE+ EE KA+E +++E KL + + EE ++ EE
Sbjct: 1559 KAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618
Query: 293 EEEEEEI 299
+ + EE+
Sbjct: 1619 KIKAEEL 1625
Score = 29.7 bits (66), Expect = 2.4
Identities = 14/58 (24%), Positives = 31/58 (53%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K E + + +E ++ + KA E K K ++K +++ EE ++ +E +++ EE
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1455
Score = 29.7 bits (66), Expect = 2.4
Identities = 12/51 (23%), Positives = 25/51 (49%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
E ++ E + E E A E + E+K +EE +++ + +++ EE
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389
Score = 29.7 bits (66), Expect = 2.8
Identities = 16/61 (26%), Positives = 32/61 (52%)
Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
D K + +E +++ EE+ KA E K K ++ +++ EE ++ EE +++ E
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467
Query: 298 E 298
E
Sbjct: 1468 E 1468
Score = 29.7 bits (66), Expect = 2.8
Identities = 15/56 (26%), Positives = 28/56 (50%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
K E +E +++ EE + KA E K + K ++ EE ++ +E ++ EE
Sbjct: 1477 KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE 1532
Score = 29.3 bits (65), Expect = 3.3
Identities = 14/65 (21%), Positives = 35/65 (53%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
++ D + E + E+++EE +++ A + K +EKKK + EE++++ +E ++
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412
Query: 294 EEEEE 298
+
Sbjct: 1413 AAAAK 1417
Score = 29.3 bits (65), Expect = 3.5
Identities = 17/54 (31%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E + +E +++ EE KA E K K E KK ++ EE ++ E + E E
Sbjct: 1300 EKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353
Score = 29.0 bits (64), Expect = 4.2
Identities = 16/53 (30%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEEE 296
E + +E +++ EE KA E K K E KK + E +++ +E ++ EE
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEE 1520
Score = 29.0 bits (64), Expect = 4.3
Identities = 14/58 (24%), Positives = 27/58 (46%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K E + + +++ EE KA E E + EE+ E E+++EE +++
Sbjct: 1322 KKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
Score = 29.0 bits (64), Expect = 4.4
Identities = 16/55 (29%), Positives = 31/55 (56%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ E +E E EE+ +A E K +E KK ++ EE+++ +E +++ EE+
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEED 1403
Score = 29.0 bits (64), Expect = 4.9
Identities = 13/59 (22%), Positives = 31/59 (52%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
K E + EE ++ +E + KA+E K ++ K +++ +E ++ E +++ +E
Sbjct: 1457 KKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA 1515
Score = 28.6 bits (63), Expect = 6.7
Identities = 18/59 (30%), Positives = 26/59 (44%), Gaps = 2/59 (3%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETK--LKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
K E +E +++ EE + KA K +E KK E E +E E EE+ E
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
Score = 28.6 bits (63), Expect = 6.7
Identities = 18/66 (27%), Positives = 33/66 (50%), Gaps = 4/66 (6%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEE----EEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
++E K E + E +++ +E EE KA E K E+ K + EE+++ +E
Sbjct: 1491 KAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADE 1550
Query: 290 EEEEEE 295
++ EE
Sbjct: 1551 LKKAEE 1556
Score = 28.2 bits (62), Expect = 7.1
Identities = 12/58 (20%), Positives = 24/58 (41%)
Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
D K E + E + E E + +EK + EE +++ + +++ EE
Sbjct: 1332 DAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389
Score = 28.2 bits (62), Expect = 7.2
Identities = 16/61 (26%), Positives = 33/61 (54%)
Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
D K E + +E +++ EE KA+E K K ++ +++ EE ++ +E +++ E
Sbjct: 1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE 1493
Query: 298 E 298
E
Sbjct: 1494 E 1494
Score = 28.2 bits (62), Expect = 7.6
Identities = 10/54 (18%), Positives = 29/54 (53%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E +EE +++ + + KA+E K ++ K +++ +E ++ +++ +E
Sbjct: 1370 EKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA 1423
Score = 28.2 bits (62), Expect = 7.7
Identities = 8/56 (14%), Positives = 29/56 (51%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+++ E ++ E + E +A + + ++ +++EE +++ + +++ EE
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEK 1390
Score = 27.8 bits (61), Expect = 9.6
Identities = 17/61 (27%), Positives = 29/61 (47%), Gaps = 3/61 (4%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKE---KKKLGGGGGEEEEEEEEEEEEEEEEEE 297
K E + EE +++ + + KA+E K K + E E EE+ E E+++EE
Sbjct: 1316 KADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375
Query: 298 E 298
Sbjct: 1376 A 1376
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 38.2 bits (88), Expect = 0.002
Identities = 21/65 (32%), Positives = 37/65 (56%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
Q E D + E + EE EEE + +E + +EK++ EEEE++E+EE+E
Sbjct: 49 QEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERI 108
Query: 294 EEEEE 298
++++E
Sbjct: 109 QKQKE 113
Score = 35.5 bits (81), Expect = 0.015
Identities = 18/55 (32%), Positives = 32/55 (58%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E+E E+EE++ E EE K + + + +++ EEE E+EE+ + + EEE
Sbjct: 43 EQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEE 97
Score = 35.1 bits (80), Expect = 0.021
Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 4/58 (6%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ E + EEE +EEE + KE EK K E++E+EE+E ++++EE E
Sbjct: 63 RAAEERLRREEEARRQEEERAREKE----EKAKRKAEEEEKQEQEEQERIQKQKEEAE 116
Score = 32.4 bits (73), Expect = 0.16
Identities = 16/57 (28%), Positives = 30/57 (52%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
+RE E+E E+EE++ + +E K + ++ E +EEE E+EE+ +
Sbjct: 38 QREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKA 94
Score = 32.4 bits (73), Expect = 0.19
Identities = 17/54 (31%), Positives = 27/54 (50%), Gaps = 4/54 (7%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
R+ E+ E+EE+E E + ++ +E+ K EE EEE +EEE
Sbjct: 33 RQAREQREQEEQERREQEEQDRLEREELK----RRAAEERLRREEEARRQEEER 82
Score = 31.2 bits (70), Expect = 0.36
Identities = 19/58 (32%), Positives = 30/58 (51%), Gaps = 11/58 (18%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEE----EEEEEEEEEEEE 297
E E +EEE E+EE ++ + +EK+ E+EE+E ++EE E EE
Sbjct: 72 EEEARRQEEERAREKEEKAKRKAEEEEKQ-------EQEEQERIQKQKEEAEARAREE 122
Score = 31.2 bits (70), Expect = 0.37
Identities = 15/55 (27%), Positives = 27/55 (49%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+R E+ E+EE+E + ++ +L+ ++ E EEE +EEE E
Sbjct: 31 KRRQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERARE 85
Score = 30.1 bits (67), Expect = 1.0
Identities = 17/54 (31%), Positives = 28/54 (51%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
R EE E+EE+ + KA+E + +E+++ ++EE E EE E E
Sbjct: 76 RRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMRLE 129
Score = 28.9 bits (64), Expect = 2.8
Identities = 18/68 (26%), Positives = 37/68 (54%), Gaps = 7/68 (10%)
Query: 234 QSENDYCKGGEREVHEEEEEE--EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
+ E K E E E+EE+E ++++EE +A+ + E+ +L E E+ ++ E+E
Sbjct: 86 KEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMRL-----EREKHFQQIEQE 140
Query: 292 EEEEEEEI 299
E ++ +
Sbjct: 141 RLERKKRL 148
Score = 28.5 bits (63), Expect = 3.2
Identities = 14/54 (25%), Positives = 26/54 (48%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+E E+EE+ + + + +E + +E+++ EE E EE E E E
Sbjct: 78 QEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMRLERE 131
Score = 28.1 bits (62), Expect = 4.7
Identities = 15/65 (23%), Positives = 30/65 (46%), Gaps = 9/65 (13%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+ E + E E +E+EE+E +++ + E + +E E E E E+ +
Sbjct: 85 EKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARARE---------EAERMRLEREKHFQ 135
Query: 294 EEEEE 298
+ E+E
Sbjct: 136 QIEQE 140
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 38.7 bits (90), Expect = 0.002
Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 5/68 (7%)
Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
K +F + D ERE +E+ EEE + KA+E K + K + EEE ++E E
Sbjct: 38 KPVFTRKKDRITIQEREREAAKEKALEEEAKRKAEERKRETLKIV-----EEEVKKELEL 92
Query: 290 EEEEEEEE 297
++ E
Sbjct: 93 KKRNTLLE 100
Score = 36.0 bits (83), Expect = 0.016
Identities = 16/34 (47%), Positives = 23/34 (67%)
Query: 265 ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
ET++ E ++ G EEEEE EEEEE + E++ E
Sbjct: 1 ETEVLELEEEDESGEEEEEESEEEEETDSEDDME 34
Score = 36.0 bits (83), Expect = 0.019
Identities = 23/65 (35%), Positives = 28/65 (43%), Gaps = 19/65 (29%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
E EV E EEE+E EEE EEE EEEEE + E++ E + V
Sbjct: 1 ETEVLELEEEDESGEEE-------------------EEESEEEEETDSEDDMEPRLKPVF 41
Query: 304 VSAVD 308
D
Sbjct: 42 TRKKD 46
Score = 35.3 bits (81), Expect = 0.027
Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 16/65 (24%)
Query: 244 EREVHEEEEEEEEEEEET----------KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
E E EEEEEE EEEEET K T+ K++ + +E E E +E+
Sbjct: 10 EDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITI------QEREREAAKEKAL 63
Query: 294 EEEEE 298
EEE +
Sbjct: 64 EEEAK 68
Score = 33.0 bits (75), Expect = 0.17
Identities = 19/65 (29%), Positives = 32/65 (49%)
Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
+ + E + G E E EEEEE + E++ + + + +KK E E E +E+
Sbjct: 4 VLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKAL 63
Query: 292 EEEEE 296
EEE +
Sbjct: 64 EEEAK 68
Score = 29.5 bits (66), Expect = 1.9
Identities = 22/102 (21%), Positives = 36/102 (35%), Gaps = 28/102 (27%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEE---------------------ETKAKETKLKEKKK 273
+ + GE E E EEEEE + E E + + K K ++
Sbjct: 6 ELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEE 65
Query: 274 LGGGGGEEEE-------EEEEEEEEEEEEEEEIGGVVVSAVD 308
EE + EEE ++E E ++ + + VD
Sbjct: 66 EAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVD 107
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 38.7 bits (90), Expect = 0.002
Identities = 23/72 (31%), Positives = 37/72 (51%), Gaps = 7/72 (9%)
Query: 227 GYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEE 286
Y K + G + + ++ ++EE+++ + KE K E+EEEEE
Sbjct: 221 YYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELK-------EDEEEEE 273
Query: 287 EEEEEEEEEEEE 298
EEEEEEE+E+E
Sbjct: 274 TEEEEEEEDEDE 285
Score = 36.7 bits (85), Expect = 0.010
Identities = 23/81 (28%), Positives = 34/81 (41%), Gaps = 6/81 (7%)
Query: 221 PVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGG 278
PD E + E+E + +++ + + K K E G
Sbjct: 204 TNGLPDSSQDKNKSLE----EYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDY 259
Query: 279 GEEEEEEEEEEEEEEEEEEEI 299
EE+E +E+EEEEE EEEEE
Sbjct: 260 VEEKELKEDEEEEETEEEEEE 280
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 37.7 bits (88), Expect = 0.002
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 20/60 (33%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
G+ V EE+E +++EE+ EEEE++EE+++E+E EEE V
Sbjct: 103 GQHLVASEEDESDDDEED--------------------EEEEDDEEDDDEDESEEEESPV 142
Score = 33.1 bits (76), Expect = 0.082
Identities = 13/38 (34%), Positives = 26/38 (68%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKK 272
SE D E + EE++EE+++E+E++ +E+ +K+ K
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146
Score = 30.8 bits (70), Expect = 0.53
Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 19/60 (31%)
Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
+ S E E ++EE+EEEE++E E+++E+E EEEE
Sbjct: 99 VHISGQHLVASEEDESDDDEEDEEEEDDE-------------------EDDDEDESEEEE 139
>gnl|CDD|220692 pfam10324, 7TM_GPCR_Srw, Serpentine type 7TM GPCR chemoreceptor
Srw. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srw is a solo family
amongst the superfamilies of chemoreceptors.
Chemoperception is one of the central senses of soil
nematodes like C. elegans which are otherwise 'blind'
and 'deaf'. The genes encoding Srw do not appear to be
under as strong an adaptive evolutionary pressure as
those of Srz.
Length = 317
Score = 38.7 bits (91), Expect = 0.003
Identities = 32/137 (23%), Positives = 63/137 (45%), Gaps = 21/137 (15%)
Query: 71 IIIILGFFGNALVVVVVAFNQQMR-STTNLLIINLAIADLLFIVCCV-----PFTATDYV 124
I+ I+GF N + +++ + MR S+ N+++I +AI D++ ++ + F +
Sbjct: 5 ILSIIGFIIN-IFHLIILTRKSMRTSSINIIMIGIAICDIITMLLTIYNFIPEFIISYEN 63
Query: 125 TTCWPF---GDVWCKMYLIVVTAYA---SVYTLVLMSLDRFLAVVHPIASMSVRTER--- 175
+ C P V L + + S + V M+L R L V +P MS + ++
Sbjct: 64 SECIPPDSYLKVLLDWILESLQDISRRCSTWLGVFMALIRTLVVKNP---MSNKIQKLSK 120
Query: 176 --NALKAILITWIVIVL 190
L I+I +I+ +
Sbjct: 121 PKFGLIIIIIVFILSLP 137
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 38.6 bits (90), Expect = 0.003
Identities = 18/54 (33%), Positives = 28/54 (51%), Gaps = 8/54 (14%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
++EEEEEEEE+E K L G EE+ ++E+ E++EE+
Sbjct: 177 FRREKKEEEEEEEEDE--------ALIKSLSFGPETEEDRRRADDEDSEDDEED 222
Score = 34.3 bits (79), Expect = 0.062
Identities = 22/93 (23%), Positives = 36/93 (38%), Gaps = 24/93 (25%)
Query: 233 FQSENDYCKGGEREVHEEEEEEE--------------EEEEETKAKETKLKEKK------ 272
++++ + ER E EEE + E E + +LKE +
Sbjct: 108 YEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQSRRADV 167
Query: 273 ----KLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
L E++EEEEEEEE+E + +
Sbjct: 168 DVNSMLEALFRREKKEEEEEEEEDEALIKSLSF 200
Score = 28.9 bits (65), Expect = 3.2
Identities = 18/60 (30%), Positives = 27/60 (45%), Gaps = 7/60 (11%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEK--KKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
G R ++ +EE+EE K +E +L KKL E + + E E E EE+
Sbjct: 103 GATRNYEADKLDEEQEERVEKEREEELAGDAMKKL-----ENRTADSKREMEVLERLEEL 157
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 38.5 bits (89), Expect = 0.003
Identities = 22/54 (40%), Positives = 32/54 (59%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
EE+EE EE EE + E K ++K EE ++EE+E E EEEE+ + G +
Sbjct: 114 EEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSL 167
Score = 38.5 bits (89), Expect = 0.003
Identities = 23/64 (35%), Positives = 31/64 (48%), Gaps = 9/64 (14%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
+EN+ + E+E EE EE EE E E K + EE ++EE+E E
Sbjct: 106 AENETVEEEEKEESREEREEVEETEGVTKSEQKNDW---------RDAEECQKEEKEPEP 156
Query: 295 EEEE 298
EEEE
Sbjct: 157 EEEE 160
Score = 38.1 bits (88), Expect = 0.005
Identities = 20/58 (34%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ + E EE EE EE E TK+++ + EE ++EE+E E EEEE+
Sbjct: 113 EEEKEESREEREEVEETEGVTKSEQKNDWR---------DAEECQKEEKEPEPEEEEK 161
Score = 34.6 bits (79), Expect = 0.068
Identities = 16/55 (29%), Positives = 31/55 (56%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E E EEEE+EE++ + +++E + + + + + EE ++EE+E E
Sbjct: 102 EDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEP 156
Score = 33.9 bits (77), Expect = 0.093
Identities = 14/60 (23%), Positives = 29/60 (48%)
Query: 239 YCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ R + E E EEE K + + +E+ + G + E++ + + EE ++EE+
Sbjct: 93 LSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEK 152
Score = 33.9 bits (77), Expect = 0.10
Identities = 15/55 (27%), Positives = 26/55 (47%), Gaps = 7/55 (12%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
++ EE + EEEE+ + +E ++ + EEEE+ +EE E E
Sbjct: 219 LKKKREERRKVLEEEEQRRKQEEADRKSR-------EEEEKRRLKEEIERRRAEA 266
Score = 32.7 bits (74), Expect = 0.25
Identities = 15/50 (30%), Positives = 24/50 (48%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E EE +++ EE K E + + +K+ EEEE+ +EE E
Sbjct: 215 ELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRA 264
Score = 31.9 bits (72), Expect = 0.42
Identities = 19/60 (31%), Positives = 27/60 (45%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
E E ++E + + E + ++ G EEEE+EE EE EE EE GV
Sbjct: 73 RFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGV 132
Score = 31.9 bits (72), Expect = 0.49
Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 4/57 (7%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
ER EEEE+ ++EE K + +EK++L EE E E E+ ++ E G
Sbjct: 225 ERRKVLEEEEQRRKQEEADRKSREEEEKRRL----KEEIERRRAEAAEKRQKVPEDG 277
Score = 30.4 bits (68), Expect = 1.3
Identities = 13/53 (24%), Positives = 24/53 (45%), Gaps = 8/53 (15%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
E EEEE+ ++E ++++ +E+K+ +EE E E E
Sbjct: 224 EERRKVLEEEEQRRKQEEADRKSREEEEKR--------RLKEEIERRRAEAAE 268
Score = 30.0 bits (67), Expect = 1.6
Identities = 19/58 (32%), Positives = 25/58 (43%), Gaps = 5/58 (8%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
E E EE E + E K +E +K E+E E EEEE+ + EE G
Sbjct: 120 REEREEVEETEGVTKSEQKNDWRDAEECQK-----EEKEPEPEEEEKPKRGSLEENNG 172
Score = 28.5 bits (63), Expect = 5.7
Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 7/68 (10%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAK-ETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
+EN + +GG E +E E+ ++ + + +L+E KK + EE + EEEE
Sbjct: 181 HTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKK------KREERRKVLEEEE 234
Query: 293 EEEEEEIG 300
+ ++E
Sbjct: 235 QRRKQEEA 242
Score = 27.7 bits (61), Expect = 8.5
Identities = 16/57 (28%), Positives = 24/57 (42%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+ RE +E E E + + K T + +E+ E E EEEE+EE
Sbjct: 62 RLARREERRDERFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEE 118
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 37.3 bits (87), Expect = 0.003
Identities = 17/60 (28%), Positives = 29/60 (48%), Gaps = 8/60 (13%)
Query: 240 CKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ + EEEEE E EE + + + +L EK E + + E+ E E +++EI
Sbjct: 95 GLDKKEKEEEEEEEVEVEELDEEEQIDELLEK--------ELAKLKREKRRENERKQKEI 146
Score = 33.8 bits (78), Expect = 0.041
Identities = 18/33 (54%), Positives = 22/33 (66%)
Query: 267 KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ K +K LG E+EEEEEEE E EE +EEE
Sbjct: 87 RKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQ 119
Score = 33.4 bits (77), Expect = 0.066
Identities = 17/38 (44%), Positives = 24/38 (63%), Gaps = 6/38 (15%)
Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
K ++ +KK+ +EEEEEEE E EE +EEE+I
Sbjct: 89 KVRKLLGLDKKE------KEEEEEEEVEVEELDEEEQI 120
Score = 31.1 bits (71), Expect = 0.35
Identities = 13/35 (37%), Positives = 19/35 (54%)
Query: 264 KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K L +K+K E E EE +EEE+ +E E+
Sbjct: 92 KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEK 126
Score = 29.6 bits (67), Expect = 1.3
Identities = 13/37 (35%), Positives = 19/37 (51%)
Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K KEK++ E EE +EEE+ +E E+E
Sbjct: 92 KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKEL 128
Score = 28.8 bits (65), Expect = 2.0
Identities = 16/31 (51%), Positives = 22/31 (70%)
Query: 268 LKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
LK +KK+ G +++E+EEEEEEE E EE
Sbjct: 84 LKWRKKVRKLLGLDKKEKEEEEEEEVEVEEL 114
Score = 27.3 bits (61), Expect = 7.6
Identities = 9/31 (29%), Positives = 18/31 (58%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKL 274
E ++ E E+E + + K +E + K+K+ L
Sbjct: 117 EEQIDELLEKELAKLKREKRRENERKQKEIL 147
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 37.0 bits (86), Expect = 0.003
Identities = 10/49 (20%), Positives = 12/49 (24%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E + T T K+K E E EEE
Sbjct: 107 NAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEEEA 155
Score = 35.4 bits (82), Expect = 0.012
Identities = 9/53 (16%), Positives = 13/53 (24%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E+ E + +A K K+ E EEE
Sbjct: 103 LELFNAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEEEA 155
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 38.7 bits (90), Expect = 0.003
Identities = 20/66 (30%), Positives = 42/66 (63%)
Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
++E+ + E+E +EE+++++E+ + + K+ K KE+ K E+E+E+E++ EE
Sbjct: 98 PKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157
Query: 293 EEEEEE 298
+ EEE
Sbjct: 158 RDREEE 163
Score = 38.3 bits (89), Expect = 0.004
Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 5/50 (10%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+E + E +EE K KE +EKKK +E+ +EE ++ + +EE +E
Sbjct: 95 AKEPKNESGKEEEKEKEQVKEEKKK-----KKEKPKEEPKDRKPKEEAKE 139
Score = 36.8 bits (85), Expect = 0.014
Identities = 16/69 (23%), Positives = 30/69 (43%), Gaps = 11/69 (15%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKET-----------KLKEKKKLGGGGGEEEEEEEEEE 289
+ + E +++E+ +EE ++ K KE K KEKK EEE++ E
Sbjct: 111 EQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVR 170
Query: 290 EEEEEEEEE 298
+ ++
Sbjct: 171 AKSRPKKPP 179
Score = 34.1 bits (78), Expect = 0.100
Identities = 12/41 (29%), Positives = 26/41 (63%)
Query: 258 EEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ A +TK ++ K G EE+E+E+ +EE+++++E+
Sbjct: 84 GSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKP 124
Score = 32.6 bits (74), Expect = 0.27
Identities = 12/42 (28%), Positives = 24/42 (57%)
Query: 257 EEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ AK KE K G E+E+E+ +EE+++++E+ +
Sbjct: 84 GSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125
Score = 32.6 bits (74), Expect = 0.29
Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 2/64 (3%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
E + K E++V E + EEE++ E +++ K+ K ++E EEE++ +
Sbjct: 143 PKEKE--KEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAA 200
Query: 294 EEEE 297
E
Sbjct: 201 REAV 204
Score = 30.6 bits (69), Expect = 1.2
Identities = 13/59 (22%), Positives = 22/59 (37%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
R+ EE++ E + K K K K EE++ + E + + EE
Sbjct: 154 VEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPD 212
Score = 29.9 bits (67), Expect = 2.3
Identities = 14/66 (21%), Positives = 26/66 (39%), Gaps = 10/66 (15%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGG----------GGEEEEEEEEEEEEEE 293
E E E + ++ K+ K+K+ G+ EE + EE E+E
Sbjct: 161 EEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKE 220
Query: 294 EEEEEI 299
E++ +
Sbjct: 221 EDDGKD 226
Score = 28.3 bits (63), Expect = 6.0
Identities = 11/48 (22%), Positives = 22/48 (45%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E + +E + ++ K E + E +EEE+E+E+ +E
Sbjct: 68 ESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKE 115
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 36.1 bits (83), Expect = 0.004
Identities = 25/66 (37%), Positives = 31/66 (46%), Gaps = 19/66 (28%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
G +E +E EEEEE +E+ +EEEE E EEEE EEEEE G
Sbjct: 43 GAQEGDDEMEEEEEVDED-------------------DEEEEGEGEEEEGEEEEETEGAT 83
Query: 303 VVSAVD 308
A +
Sbjct: 84 GKRAAE 89
Score = 34.6 bits (79), Expect = 0.012
Identities = 20/49 (40%), Positives = 31/49 (63%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+E++E EE+E K E ++ G G++E EEEEE +E++EEEE
Sbjct: 17 KEKKEVVEEKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEE 65
Score = 33.8 bits (77), Expect = 0.028
Identities = 25/64 (39%), Positives = 33/64 (51%), Gaps = 22/64 (34%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE-------EE 294
G+ E+ EEEE +E++EEE G GEEEE EEEEE E E+
Sbjct: 46 EGDDEMEEEEEVDEDDEEEE---------------GEGEEEEGEEEEETEGATGKRAAED 90
Query: 295 EEEE 298
EE++
Sbjct: 91 EEDD 94
Score = 32.6 bits (74), Expect = 0.062
Identities = 18/51 (35%), Positives = 28/51 (54%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
+E++E EE+ K + G E ++E EEEEE +E++EEE G
Sbjct: 17 KEKKEVVEEKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGE 67
Score = 31.9 bits (72), Expect = 0.12
Identities = 19/49 (38%), Positives = 26/49 (53%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E EE+E + A + +E G EEEEE +E++EEEE E E
Sbjct: 21 EVVEEKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGE 69
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 38.7 bits (90), Expect = 0.004
Identities = 17/50 (34%), Positives = 28/50 (56%), Gaps = 5/50 (10%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E EEE+E++ EET E + + E E E + E+ EE+E+E++
Sbjct: 646 EGEEEDEDDLEETDDDEDECEA-----IEDSESESESDGEDGEEDEQEDD 690
Score = 36.8 bits (85), Expect = 0.016
Identities = 14/47 (29%), Positives = 27/47 (57%)
Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E EEE+E++ + + E + + E E + E+ EE+E+E++ E
Sbjct: 646 EGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAE 692
Score = 29.8 bits (67), Expect = 2.4
Identities = 20/66 (30%), Positives = 34/66 (51%), Gaps = 3/66 (4%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
GE E ++ EE +++E+E +A E E + G + EE+E+E++ E E I
Sbjct: 647 GEEEDEDDLEETDDDEDECEAIEDSESESESDGE---DGEEDEQEDDAEANEGVVPIDKA 703
Query: 303 VVSAVD 308
V A+
Sbjct: 704 VRRALP 709
Score = 28.3 bits (63), Expect = 6.0
Identities = 16/63 (25%), Positives = 26/63 (41%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+SE+D G E E ++ E E KA L + L + E+EE ++E+
Sbjct: 674 ESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDEEGMDDEQ 733
Query: 294 EEE 296
Sbjct: 734 MMR 736
Score = 28.3 bits (63), Expect = 6.1
Identities = 13/36 (36%), Positives = 20/36 (55%), Gaps = 2/36 (5%)
Query: 267 KLKEKKK--LGGGGGEEEEEEEEEEEEEEEEEEEIG 300
K E K GEEE+E++ EE +++E+E E
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAI 668
Score = 27.9 bits (62), Expect = 9.2
Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 13/77 (16%)
Query: 236 ENDYCKGGEREVHEEEE-----EEEEEEEETKAKETKL--------KEKKKLGGGGGEEE 282
+ D C+ E E E EE+E+E++ +A E + K L +
Sbjct: 661 DEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDG 720
Query: 283 EEEEEEEEEEEEEEEEI 299
+ E+EE ++E+ +
Sbjct: 721 GDSEDEEGMDDEQMMRL 737
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 36.2 bits (84), Expect = 0.004
Identities = 13/27 (48%), Positives = 15/27 (55%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGG 301
G E +E EEEE+EEE EEE
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEEESDDD 102
Score = 35.5 bits (82), Expect = 0.007
Identities = 12/33 (36%), Positives = 15/33 (45%)
Query: 269 KEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
E +E EEEE+EEE EEE +
Sbjct: 71 AAAAAGAEAAAEADEAEEEEKEEEAEEESDDDM 103
Score = 34.3 bits (79), Expect = 0.016
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 276 GGGGEEEEEEEEEEEEEEEEEEEIGG 301
+E EEEE+EEE EEE ++++
Sbjct: 80 AAEADEAEEEEKEEEAEEESDDDMLF 105
Score = 34.3 bits (79), Expect = 0.017
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGG 301
E EEEE+EEE EEE +++ + G
Sbjct: 80 AAEADEAEEEEKEEEAEEESDDDMLFG 106
Score = 28.1 bits (63), Expect = 2.1
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 263 AKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
A + EEEE+EEE EEE +++
Sbjct: 70 AAAAAAGAEAAAEADEAEEEEKEEEAEEESDDD 102
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 37.7 bits (87), Expect = 0.004
Identities = 23/66 (34%), Positives = 28/66 (42%)
Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
E GE EEEEEE E ++ E +++ GEEEE EEE
Sbjct: 192 EDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLF 251
Query: 293 EEEEEE 298
E E EE
Sbjct: 252 EIEWEE 257
Score = 36.1 bits (83), Expect = 0.017
Identities = 21/66 (31%), Positives = 32/66 (48%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
+ND + +V EEEE EE + + +E + G G E EEEEEE E ++
Sbjct: 163 DNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDG 222
Query: 296 EEEIGG 301
E+ +
Sbjct: 223 EDVVDY 228
Score = 34.6 bits (79), Expect = 0.051
Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 13/73 (17%)
Query: 243 GEREVHEEEEEEEEEEEETKAKE--------TKLKEKKKLGGGGGEEEEEEEEEEE---- 290
E + E + EEEE+EE + + +E++ G GE+ + E E
Sbjct: 178 EEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQ 237
Query: 291 -EEEEEEEEIGGV 302
EEEE EEE+ +
Sbjct: 238 GEEEEMEEEVINL 250
Score = 31.5 bits (71), Expect = 0.47
Identities = 13/33 (39%), Positives = 17/33 (51%)
Query: 276 GGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
EEEE EE + EEEE+EE+G +
Sbjct: 173 DVDNEEEERLEESDGREEEEDEEVGSDSYGEGN 205
Score = 28.4 bits (63), Expect = 4.2
Identities = 12/49 (24%), Positives = 18/49 (36%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E+ ++ + EEE EE + EEEE+EE
Sbjct: 147 EKPRVTRFNIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEE 195
Score = 28.4 bits (63), Expect = 5.3
Identities = 23/67 (34%), Positives = 30/67 (44%), Gaps = 11/67 (16%)
Query: 243 GEREVHEEEEEEEE-----------EEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
G RE++EEEEEE E E E K+ + +E ++ E E EEE EE
Sbjct: 204 GNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEE 263
Query: 292 EEEEEEE 298
EE
Sbjct: 264 VPRNNEE 270
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 38.1 bits (88), Expect = 0.005
Identities = 18/63 (28%), Positives = 33/63 (52%), Gaps = 12/63 (19%)
Query: 248 HEEEEEEEEEEE------ETKAKET------KLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
HE E E+++ + K K T K +KLG EE EEE+++EE++++
Sbjct: 3 HEFASSELEDDDWVRGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDD 62
Query: 296 EEE 298
+++
Sbjct: 63 DDD 65
Score = 33.4 bits (76), Expect = 0.16
Identities = 14/72 (19%), Positives = 35/72 (48%), Gaps = 14/72 (19%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
G E E EEE+++EE++ + E+E++++++++E++E+E++
Sbjct: 41 LGKEAEEEAMEEEDDDEEDDDDDDD--------------EDEDDDDDDDDEDDEDEDDDD 86
Query: 301 GVVVSAVDFGSG 312
+ G
Sbjct: 87 STLHDDSSADDG 98
Score = 33.4 bits (76), Expect = 0.16
Identities = 15/74 (20%), Positives = 38/74 (51%), Gaps = 11/74 (14%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVV 304
R++ +E EEE EEE+ ++ +++E+E++++++++E++E+
Sbjct: 39 RKLGKEAEEEAMEEEDDDEED-----------DDDDDDEDEDDDDDDDDEDDEDEDDDDS 87
Query: 305 SAVDFGSGGWWFES 318
+ D S E+
Sbjct: 88 TLHDDSSADDGNET 101
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 38.1 bits (89), Expect = 0.005
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E EEE +EEE + K ++K G GE+E E ++ ++E + E +E
Sbjct: 400 ELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKE 449
Score = 38.1 bits (89), Expect = 0.006
Identities = 20/72 (27%), Positives = 33/72 (45%), Gaps = 11/72 (15%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE-----------EEE 290
E E + EEE+E+++E+ E +KKLG E++EE E
Sbjct: 326 EDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAEAR 385
Query: 291 EEEEEEEEIGGV 302
++EE + EI +
Sbjct: 386 KKEENDAEIEEL 397
Score = 37.0 bits (86), Expect = 0.012
Identities = 15/55 (27%), Positives = 28/55 (50%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E+ E E EEE +EE + +K ++ G E+E E ++ ++E + E +
Sbjct: 394 IEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFK 448
Score = 37.0 bits (86), Expect = 0.013
Identities = 17/78 (21%), Positives = 37/78 (47%), Gaps = 11/78 (14%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEE---------- 284
S ++ + + + EE+E+++E++++ + K+ KL G +EE
Sbjct: 322 SVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQR 381
Query: 285 -EEEEEEEEEEEEEEIGG 301
E ++EE + E EE+
Sbjct: 382 AEARKKEENDAEIEELRR 399
Score = 35.8 bits (83), Expect = 0.030
Identities = 19/47 (40%), Positives = 27/47 (57%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEE 290
+ + +EEEE E+EEE K ++ K K+ EEEEEE +EE
Sbjct: 448 KEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEEN 494
Score = 34.6 bits (80), Expect = 0.076
Identities = 24/74 (32%), Positives = 37/74 (50%), Gaps = 8/74 (10%)
Query: 243 GEREVHEEEEEEEEEEEETK--------AKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
GE E EEE EE ++ + KE + K+ KK +E++E +EEEE E+E
Sbjct: 403 GEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDE 462
Query: 295 EEEEIGGVVVSAVD 308
EE ++ V +
Sbjct: 463 EEAKVEKVANKLLK 476
Score = 32.7 bits (75), Expect = 0.24
Identities = 16/47 (34%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
EE + EE + +L+ +K G EE+E+E+ + EEE+E+++E
Sbjct: 298 EEARKALEEQLRQGEELR-RKIEGKSVSEEDEDEDSDSEEEDEDDDE 343
Score = 31.2 bits (71), Expect = 0.72
Identities = 17/61 (27%), Positives = 24/61 (39%), Gaps = 4/61 (6%)
Query: 243 GEREVHEEEEEEEEEEEETK----AKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ E++EE K A+ K +E E E EEE +EEE EE +
Sbjct: 358 LGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSK 417
Query: 299 I 299
Sbjct: 418 K 418
Score = 30.8 bits (70), Expect = 1.1
Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 8/50 (16%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
+ E ++EE A E ++L E EEE +EEE EE ++ +G
Sbjct: 380 QRAEARKKEENDA------EIEELRREL--EGEEESDEEENEEPSKKNVG 421
Score = 29.3 bits (66), Expect = 3.6
Identities = 17/49 (34%), Positives = 28/49 (57%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+EE+E++EE + K K+ G E E+E++E EEE+ +EI
Sbjct: 563 DEEDEDDEELPFLFKQKDLIKEAFAGDDVVAEFEKEKKEVIEEEDPKEI 611
>gnl|CDD|220688 pfam10317, 7TM_GPCR_Srd, Serpentine type 7TM GPCR chemoreceptor
Srd. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srd is part of the
larger Str superfamily of chemoreceptors.
Chemoperception is one of the central senses of soil
nematodes like C. elegans which are otherwise 'blind'
and 'deaf'.
Length = 293
Score = 37.5 bits (88), Expect = 0.005
Identities = 29/151 (19%), Positives = 57/151 (37%), Gaps = 32/151 (21%)
Query: 61 VSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNL-----LIINLAIADLLFIVCC 115
SI PIFF ILG N L++ ++ F +S +L L++N A+ ++ +
Sbjct: 1 FSIYYPIFF----ILGLILNLLLLYLIIF----KSPKSLRTYRILLLNTALTQIISCILA 52
Query: 116 VPFTATDYVTTCWP-----------FGDVWCK-MYLIVVTAY-ASVYTLVLMSLDRFLAV 162
FT + FG +C Y +++ S ++L+L R+ +
Sbjct: 53 F-FTQQRIIPNGVSLALISYGPCKYFGPWFCYSGYSLLLHFLLHSGWSLLLTFYYRYYIL 111
Query: 163 VHPIASMSVRTERNALKAILITWIVIVLTAV 193
H + + + +I + +
Sbjct: 112 KHV-----DPKRKKLILIFFLHYIPSLSQLL 137
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 37.9 bits (88), Expect = 0.005
Identities = 23/125 (18%), Positives = 55/125 (44%), Gaps = 15/125 (12%)
Query: 178 LKAILITWIVIVLTAVPVFNMHGEVSYTYASELHTACVFLDLDPVNRPDGYNKTIFQSEN 237
L ++L+ +++ L + + E+ A + +DP N+ I Q +
Sbjct: 5 LLSLLLHILLLGLLILGSLYHSVKPEPGGGGEIIQAVL---VDPGAVAQQANR-IQQQKK 60
Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKE----TKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
E+E ++ E++ EE E+ +A E +L+++ E+ ++ E+ ++
Sbjct: 61 P-AAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAA------AEKAAKQAEQAAKQ 113
Query: 294 EEEEE 298
EE++
Sbjct: 114 AEEKQ 118
Score = 31.7 bits (72), Expect = 0.42
Identities = 17/55 (30%), Positives = 27/55 (49%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E +EE +++ EEE KAK +KK E E + + E + + + EE
Sbjct: 139 EAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEE 193
Score = 30.6 bits (69), Expect = 1.2
Identities = 17/58 (29%), Positives = 28/58 (48%), Gaps = 6/58 (10%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K E + + EE + ++ E KAK EKK +EE +++ EEE + + E
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKA------KEEAKKQAEEEAKAKAAAE 163
Score = 30.2 bits (68), Expect = 1.4
Identities = 14/53 (26%), Positives = 28/53 (52%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+ EE++++ EE + +A E K K + + EE +++ EEE + + E
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAE 163
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 36.9 bits (86), Expect = 0.006
Identities = 17/51 (33%), Positives = 26/51 (50%), Gaps = 5/51 (9%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+EEEEE E + + +KKL +E +EE +EEE++ EE
Sbjct: 19 SPKEEEEEALELAIQEAHENALEKKL-----LDELDEELDEEEDDRFLEEY 64
Score = 35.7 bits (83), Expect = 0.014
Identities = 16/55 (29%), Positives = 24/55 (43%), Gaps = 7/55 (12%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ +EEEEE E + EKK +E +EE +EEE++ E
Sbjct: 15 PKPPSPKEEEEEALELAIQEAHENALEKK-------LLDELDEELDEEEDDRFLE 62
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 38.1 bits (88), Expect = 0.006
Identities = 23/88 (26%), Positives = 43/88 (48%), Gaps = 24/88 (27%)
Query: 218 DLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGG 277
DLD + + + +E D + +++E+E+E++++
Sbjct: 126 DLDMIIIKRRRARHL--AEEDMSPRDNFVIDDDDEDEDEDDDD----------------- 166
Query: 278 GGEEEEEEEEEEEE----EEEEEEEIGG 301
E++EEEEEEEEE ++E+EE+ GG
Sbjct: 167 -EEDDEEEEEEEEEIKGFDDEDEEDEGG 193
Score = 33.1 bits (75), Expect = 0.22
Identities = 15/50 (30%), Positives = 30/50 (60%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
E E ++++EE++EEEE + +E K + + GGE+ E+ E ++ +
Sbjct: 158 EDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTD 207
Score = 31.2 bits (70), Expect = 0.97
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 12/55 (21%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+ + +EE++EEEEEEE + K G +E+EE+E E+ E+ E
Sbjct: 160 EDEDDDDEEDDEEEEEEEEEIK------------GFDDEDEEDEGGEDFTYEKSE 202
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 36.2 bits (84), Expect = 0.006
Identities = 20/61 (32%), Positives = 34/61 (55%), Gaps = 7/61 (11%)
Query: 245 REVHEEEEEEEEEEEETKAKETK-------LKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+E +E+E EE EE + +E + LKE+K+ EE ++EE+ E E+ E+ E
Sbjct: 39 QEEAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEEDAETEDTEDVE 98
Query: 298 E 298
+
Sbjct: 99 D 99
Score = 35.4 bits (82), Expect = 0.009
Identities = 17/65 (26%), Positives = 29/65 (44%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVV 304
+E E + EEE ++ + ET+ E + G E + EEE +E++ V V
Sbjct: 70 KERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDEDKYKTTVTV 129
Query: 305 SAVDF 309
+D
Sbjct: 130 EELDP 134
Score = 30.8 bits (70), Expect = 0.39
Identities = 14/69 (20%), Positives = 32/69 (46%), Gaps = 21/69 (30%)
Query: 253 EEEEEEEETKAKETKLKEKKKLG---------------------GGGGEEEEEEEEEEEE 291
++ +EE + K +E +++E+K++ ++EE+ E E+ E
Sbjct: 36 KKAQEEAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEEDAETEDTE 95
Query: 292 EEEEEEEIG 300
+ E++E G
Sbjct: 96 DVEDDEWEG 104
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 37.9 bits (88), Expect = 0.006
Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 6/60 (10%)
Query: 245 REVHEEEEEEE------EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
R V + +E EE K E L E GG E++E+E++EE +EE E+++
Sbjct: 299 RPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQ 358
Score = 28.2 bits (63), Expect = 5.7
Identities = 13/37 (35%), Positives = 20/37 (54%)
Query: 237 NDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK 273
Y G + + +E++EE +EE E K K K K+K
Sbjct: 332 LGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRK 368
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 37.8 bits (88), Expect = 0.006
Identities = 21/55 (38%), Positives = 30/55 (54%), Gaps = 1/55 (1%)
Query: 244 EREVHEEEEEEEEEEEETKAKET-KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E + E EE E E+ +A+E +L+E L E+E+E EE + EEEEE
Sbjct: 395 EEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEAR 449
Score = 37.5 bits (87), Expect = 0.007
Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 5/55 (9%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+RE E E E+ E E A+ +L + E E+E+E EE + EEEEE
Sbjct: 399 QREAEERLEAEQAERAEEDARLRELYPLPE-----DEFEDEDELEEAQPEEEEEA 448
Score = 35.9 bits (83), Expect = 0.025
Identities = 16/55 (29%), Positives = 26/55 (47%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ EE + E EE + + +E +L E+E E+E+E EE + EE
Sbjct: 390 SQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEE 444
Score = 34.8 bits (80), Expect = 0.058
Identities = 19/55 (34%), Positives = 28/55 (50%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E + E EE E E + + ++L +E E+E+E EE + EEEEE
Sbjct: 393 EEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEE 447
Score = 34.8 bits (80), Expect = 0.058
Identities = 17/65 (26%), Positives = 25/65 (38%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
Q D EEEE + E EE E + ++ E+E E+E+E
Sbjct: 378 QEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDEL 437
Query: 294 EEEEE 298
EE +
Sbjct: 438 EEAQP 442
Score = 33.2 bits (76), Expect = 0.17
Identities = 16/73 (21%), Positives = 30/73 (41%)
Query: 229 NKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
+ + E + K E EEEE + E E + + + + ++ E+E
Sbjct: 372 DSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEF 431
Query: 289 EEEEEEEEEEIGG 301
E+E+E EE +
Sbjct: 432 EDEDELEEAQPEE 444
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 36.6 bits (85), Expect = 0.006
Identities = 15/53 (28%), Positives = 34/53 (64%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
E+ E EE+++ + ++ K+K+ K K+K K ++ E+++E+E E++ E+
Sbjct: 72 EKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124
Score = 32.4 bits (74), Expect = 0.17
Identities = 11/55 (20%), Positives = 36/55 (65%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+++ E EE E+ ++E + K+ +KKK +++++++++++++ E+++E
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115
Score = 27.7 bits (62), Expect = 6.6
Identities = 8/54 (14%), Positives = 34/54 (62%), Gaps = 4/54 (7%)
Query: 249 EEEEEEEEEEEETKAKETKLK----EKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E +++++E + K+K EK+K + +++++++++++++++++
Sbjct: 55 AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD 108
Score = 27.3 bits (61), Expect = 8.1
Identities = 9/50 (18%), Positives = 28/50 (56%), Gaps = 6/50 (12%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
++ +++++++++ K + K +KK +E+E E++ E+ + E
Sbjct: 89 KKSKKKKDKDKDKKDDKKDDKSEKK------DEKEAEDKLEDLTKSYSET 132
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 37.8 bits (87), Expect = 0.006
Identities = 13/46 (28%), Positives = 23/46 (50%)
Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E +E KE + +E + E + + +E EE+EE++E
Sbjct: 283 EIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDE 328
Score = 35.4 bits (81), Expect = 0.029
Identities = 13/48 (27%), Positives = 24/48 (50%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E +E E K ++ + E + + + +E EE+EE++E E
Sbjct: 283 EIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENE 330
Score = 33.5 bits (76), Expect = 0.12
Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 2/51 (3%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E +E E ++E + +E + E + +E EE+EE++E EE
Sbjct: 284 IENKEVSEGDKEQQQEEVENAEAH--KEEVQSDRPDEIGEEKEEDDENEEN 332
Score = 32.4 bits (73), Expect = 0.29
Identities = 17/61 (27%), Positives = 26/61 (42%), Gaps = 4/61 (6%)
Query: 241 KGGEREVHEEEEEEE----EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
+G E E E EE+ E E K KE+++ E +EE + + +E E
Sbjct: 262 QGAEEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGE 321
Query: 297 E 297
E
Sbjct: 322 E 322
Score = 29.3 bits (65), Expect = 3.2
Identities = 14/55 (25%), Positives = 21/55 (38%), Gaps = 19/55 (34%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E E +EE + + +E EE+EE++E EE E
Sbjct: 300 EVENAEAHKEEVQSDRP-------------------DEIGEEKEEDDENEENERH 335
Score = 28.9 bits (64), Expect = 3.4
Identities = 18/63 (28%), Positives = 29/63 (46%), Gaps = 6/63 (9%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE------EEEEEEEEE 297
++E EEE EE E + + K G +E+++EE E EE + + +
Sbjct: 258 KKEKQGAEEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPD 317
Query: 298 EIG 300
EIG
Sbjct: 318 EIG 320
Score = 28.5 bits (63), Expect = 4.8
Identities = 12/56 (21%), Positives = 25/56 (44%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E + + +E E K ++ + +E ++ +E E E+ EE+ + E
Sbjct: 304 AEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIEEKRRQMES 359
Score = 28.1 bits (62), Expect = 6.3
Identities = 9/32 (28%), Positives = 16/32 (50%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETK 267
EN E + +E EE+EE+ + +E +
Sbjct: 302 ENAEAHKEEVQSDRPDEIGEEKEEDDENEENE 333
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 37.5 bits (87), Expect = 0.007
Identities = 16/51 (31%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
+E + EE E KAKE KLK+++ E++++E+ +E++E+ +++E
Sbjct: 390 DETDASEEAEAKAKEEKLKQEEN------EKKQKEQADEDKEKRQKDERKK 434
Score = 36.7 bits (85), Expect = 0.014
Identities = 13/53 (24%), Positives = 32/53 (60%), Gaps = 4/53 (7%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ ++ E ++E + ++ E K KE+K EE E++++E+ +E++E+ +
Sbjct: 380 DYTKKLGEVKDETDASEEAEAKAKEEK----LKQEENEKKQKEQADEDKEKRQ 428
Score = 32.5 bits (74), Expect = 0.30
Identities = 10/50 (20%), Positives = 27/50 (54%), Gaps = 6/50 (12%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
++ ++ E ++ + + K K EE+ ++EE E++++E+ +E
Sbjct: 379 QDYTKKLGEVKDETDASEEAEAKAK------EEKLKQEENEKKQKEQADE 422
Score = 32.5 bits (74), Expect = 0.31
Identities = 12/39 (30%), Positives = 25/39 (64%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK 273
SE K E ++ +EE E++++E+ + KE + K+++K
Sbjct: 395 SEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433
Score = 30.9 bits (70), Expect = 0.78
Identities = 12/42 (28%), Positives = 22/42 (52%)
Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
++ +E K KL++ K G +E + EE E + +EE+
Sbjct: 365 DKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEK 406
Score = 30.5 bits (69), Expect = 1.1
Identities = 13/50 (26%), Positives = 26/50 (52%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ +E +E ++ + KL E K E E + +EE+ ++EE E++
Sbjct: 366 KRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKK 415
Score = 30.1 bits (68), Expect = 1.7
Identities = 12/38 (31%), Positives = 23/38 (60%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK 273
E D + E + EE+ ++EE E++ K + + KEK++
Sbjct: 391 ETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQ 428
Score = 29.4 bits (66), Expect = 2.8
Identities = 13/71 (18%), Positives = 37/71 (52%), Gaps = 1/71 (1%)
Query: 228 YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
K + +++ G++ +E ++ ++ TK K ++K++ + +EE+
Sbjct: 349 LTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTK-KLGEVKDETDASEEAEAKAKEEKL 407
Query: 288 EEEEEEEEEEE 298
++EE E++++E
Sbjct: 408 KQEENEKKQKE 418
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 36.9 bits (86), Expect = 0.007
Identities = 20/79 (25%), Positives = 36/79 (45%), Gaps = 15/79 (18%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI---- 299
RE E+ + E+ + + K+ + + KKK E+ ++ E EE + EE + +
Sbjct: 3 SREALLEQRRRKREQRKARKKQKRKEAKKK------EDAQKSEAEEVKNEENKSKKKAAP 56
Query: 300 -----GGVVVSAVDFGSGG 313
G +V S V+F G
Sbjct: 57 IENAEGNIVFSKVEFADGE 75
Score = 32.3 bits (74), Expect = 0.23
Identities = 11/47 (23%), Positives = 24/47 (51%), Gaps = 6/47 (12%)
Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E+ + K + K +KK ++ +E +++E+ ++ E EE
Sbjct: 2 SSREALLEQRRRKREQRKARKK------QKRKEAKKKEDAQKSEAEE 42
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 35.8 bits (83), Expect = 0.007
Identities = 26/60 (43%), Positives = 35/60 (58%), Gaps = 10/60 (16%)
Query: 244 EREVHEE---EEEEEEEEEETKAKETKLKE--KKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
ERE+ EE ++E E+E +E KE + KK+L EE + EEEE+E EE EEE
Sbjct: 64 EREIAEERKLQKEREKEGDEFADKEKFVTSAYKKQL-----EENRKLEEEEKEREELEEE 118
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 35.4 bits (82), Expect = 0.007
Identities = 10/26 (38%), Positives = 18/26 (69%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIG 300
E ++EE++EEEEEE ++++G
Sbjct: 74 AAAAAAEAKKEEKKEEEEEESDDDMG 99
Score = 34.6 bits (80), Expect = 0.011
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGG 301
E ++EE++EEEEEE ++ G
Sbjct: 73 AAAAAAAEAKKEEKKEEEEEESDDDMG 99
Score = 31.1 bits (71), Expect = 0.22
Identities = 9/24 (37%), Positives = 14/24 (58%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
E ++EE++EEEEEE +
Sbjct: 72 AAAAAAAAEAKKEEKKEEEEEESD 95
Score = 29.6 bits (67), Expect = 0.63
Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 6/38 (15%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSG 312
E ++EE++EEEEEE S D G G
Sbjct: 70 AAAAAAAAAAEAKKEEKKEEEEEE------SDDDMGFG 101
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 36.2 bits (84), Expect = 0.008
Identities = 16/51 (31%), Positives = 27/51 (52%)
Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
EE EE+E++E + +E L + ++ EEE+ +EEE + E E
Sbjct: 13 DEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEAE 63
Score = 30.4 bits (69), Expect = 0.80
Identities = 11/20 (55%), Positives = 16/20 (80%)
Query: 279 GEEEEEEEEEEEEEEEEEEE 298
GEE+EE EE+E++E +EE
Sbjct: 10 GEEDEELPEEDEDDESSDEE 29
Score = 30.4 bits (69), Expect = 0.82
Identities = 13/57 (22%), Positives = 27/57 (47%), Gaps = 11/57 (19%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ E+EE EE+E++ + E + ++E++EE + EEE+ +
Sbjct: 6 ESDDGEEDEELPEEDEDDESSDE-----------EEVDLPDDEQDEESDSEEEQIFV 51
Score = 30.0 bits (68), Expect = 1.0
Identities = 17/51 (33%), Positives = 29/51 (56%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
++ EE+EE EE + E+ +E+ L +EE + EEE+ +EEE+
Sbjct: 8 DDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEV 58
Score = 29.3 bits (66), Expect = 1.8
Identities = 13/69 (18%), Positives = 25/69 (36%), Gaps = 18/69 (26%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
+D + E +E++E +EEE ++E++EE + EEE
Sbjct: 6 ESDDGEEDEELPEEDEDDESSDEEEVD------------------LPDDEQDEESDSEEE 47
Query: 295 EEEEIGGVV 303
+
Sbjct: 48 QIFVTRQEE 56
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 37.4 bits (87), Expect = 0.008
Identities = 21/54 (38%), Positives = 32/54 (59%), Gaps = 6/54 (11%)
Query: 253 EEEEEEEETKAKETKLK------EKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E +++ TKAKET+ ++ E ++EEEE+EEEEEEE+E+ G
Sbjct: 354 ETLNKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEG 407
Score = 34.0 bits (78), Expect = 0.11
Identities = 20/57 (35%), Positives = 29/57 (50%), Gaps = 13/57 (22%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
ER+ EEE+EEEEEEE+ G +E ++EE EE++ E + E
Sbjct: 385 NTERDDEEEEDEEEEEEEDE-------------DEGPSKEHSDDEEFEEDDVESKYE 428
Score = 32.0 bits (73), Expect = 0.45
Identities = 12/20 (60%), Positives = 16/20 (80%)
Query: 280 EEEEEEEEEEEEEEEEEEEI 299
EE+EEEEEEE+E+E +E
Sbjct: 393 EEDEEEEEEEDEDEGPSKEH 412
Score = 32.0 bits (73), Expect = 0.50
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 279 GEEEEEEEEEEEEEEEEEEEIG 300
E+EEEEEEE+E+E +E
Sbjct: 393 EEDEEEEEEEDEDEGPSKEHSD 414
Score = 28.6 bits (64), Expect = 5.2
Identities = 20/73 (27%), Positives = 32/73 (43%), Gaps = 17/73 (23%)
Query: 226 DGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEE 285
D ++ + N E E EEEEEE+E+E +K E ++E
Sbjct: 374 DAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSK-----------------EHSDDE 416
Query: 286 EEEEEEEEEEEEE 298
E EE++ E + E+
Sbjct: 417 EFEEDDVESKYED 429
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 37.3 bits (86), Expect = 0.009
Identities = 16/57 (28%), Positives = 29/57 (50%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E +EEEEEE+E E + + + + E+ ++E E++EE++ EE
Sbjct: 110 DSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDEEKKMEE 166
Score = 36.5 bits (84), Expect = 0.019
Identities = 17/53 (32%), Positives = 28/53 (52%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
EEEEEE+E E+ + + E+ ++E E++EE++ EEE G
Sbjct: 118 EEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDEEKKMEEEEAG 170
Score = 35.7 bits (82), Expect = 0.033
Identities = 16/56 (28%), Positives = 30/56 (53%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
G E+ ++EEEEEE ++ E ++ + + E E E+ ++E E++EE
Sbjct: 106 GSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDEE 161
Score = 34.6 bits (79), Expect = 0.065
Identities = 9/68 (13%), Positives = 25/68 (36%)
Query: 231 TIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEE 290
+ D + + + E+ ++E + +E + E + + + E E
Sbjct: 89 RVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSL 148
Query: 291 EEEEEEEE 298
E+ ++E
Sbjct: 149 EDLSDDET 156
Score = 34.2 bits (78), Expect = 0.095
Identities = 12/56 (21%), Positives = 22/56 (39%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
G + E+ ++EEE + E+ E E E E+ ++E E+
Sbjct: 103 GSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETED 158
Score = 34.2 bits (78), Expect = 0.100
Identities = 13/49 (26%), Positives = 23/49 (46%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E +E + + E + G G + + E+ ++EEEEEE+E
Sbjct: 78 YVEFLINKEHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDES 126
Score = 33.4 bits (76), Expect = 0.15
Identities = 15/64 (23%), Positives = 33/64 (51%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
S D G + + + ++EEEEEE+ ++ + ++ + E E+ ++E E+
Sbjct: 99 SHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETED 158
Query: 295 EEEE 298
+EE+
Sbjct: 159 DEEK 162
Score = 33.4 bits (76), Expect = 0.17
Identities = 22/74 (29%), Positives = 32/74 (43%), Gaps = 4/74 (5%)
Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEE----EEEEE 287
+S + E E EE++ EEEE E K + +KK G +++ +E E
Sbjct: 143 ESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNE 202
Query: 288 EEEEEEEEEEEIGG 301
E E EEE G
Sbjct: 203 FLEATEAEEEAALG 216
Score = 32.7 bits (74), Expect = 0.29
Identities = 14/56 (25%), Positives = 30/56 (53%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ E ++EEEEEE+E + + +++ L E+ ++E E++EE++
Sbjct: 109 MDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDEEKKM 164
Score = 30.7 bits (69), Expect = 1.2
Identities = 12/66 (18%), Positives = 29/66 (43%)
Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
+ E+D + + + EEE+EE+ + ++ + +E+ + +E E +
Sbjct: 273 AELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDL 332
Query: 293 EEEEEE 298
E +E
Sbjct: 333 ESPKEL 338
Score = 29.6 bits (66), Expect = 2.3
Identities = 18/67 (26%), Positives = 34/67 (50%), Gaps = 7/67 (10%)
Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSA 306
+ E E++E ++E KE K ++ E+EE++E+E++++EEE E V
Sbjct: 270 GDDAELEDDEPDKEAVKKEADSKPEE-------EDEEDDEQEDDQDEEEPPEAAMDKVKL 322
Query: 307 VDFGSGG 313
+ G
Sbjct: 323 DEPVLEG 329
Score = 29.6 bits (66), Expect = 2.4
Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 14/67 (20%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+S D E E E E E+ + + E++EE++ EEEE
Sbjct: 125 ESLEDEMIDDEDEADLFNESESSLEDLSDDET--------------EDDEEKKMEEEEAG 170
Query: 294 EEEEEIG 300
EE+E +
Sbjct: 171 EEKESVE 177
Score = 29.2 bits (65), Expect = 3.1
Identities = 17/74 (22%), Positives = 36/74 (48%), Gaps = 10/74 (13%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEE----ETKAKETKLKEKKKLGGGGGEEE------EEE 285
++D G + ++EE E E+ + K K+ + +L ++E + +
Sbjct: 233 KDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSK 292
Query: 286 EEEEEEEEEEEEEI 299
EEE+EE++E+E+
Sbjct: 293 PEEEDEEDDEQEDD 306
Score = 29.2 bits (65), Expect = 3.4
Identities = 15/55 (27%), Positives = 23/55 (41%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E + EEE+EE++E+E E + E E E + E +E E
Sbjct: 288 EADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFE 342
Score = 28.8 bits (64), Expect = 3.9
Identities = 13/58 (22%), Positives = 26/58 (44%), Gaps = 1/58 (1%)
Query: 244 EREVHEEEEEEEEEEEETKAK-ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
+ E HE+ + + + E A E + +E + L ++E+E + E E E+
Sbjct: 96 DSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSD 153
Score = 28.4 bits (63), Expect = 6.8
Identities = 15/65 (23%), Positives = 35/65 (53%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+ N + ++++++ ++ E + E + KK EEE+EE++E+E+++
Sbjct: 248 EEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQ 307
Query: 294 EEEEE 298
+EEE
Sbjct: 308 DEEEP 312
Score = 28.0 bits (62), Expect = 7.6
Identities = 12/57 (21%), Positives = 32/57 (56%), Gaps = 5/57 (8%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+E ++++ ++ E E + + +K+ + EEE+EE++E+E++++E
Sbjct: 259 DPKEKDKKKDAGDDAELEDDEPDKEAVKKE-----ADSKPEEEDEEDDEQEDDQDEE 310
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 34.8 bits (80), Expect = 0.010
Identities = 14/23 (60%), Positives = 20/23 (86%)
Query: 279 GEEEEEEEEEEEEEEEEEEEIGG 301
E+EEE+++EEE+EEEEEE + G
Sbjct: 79 EEKEEEKKKEEEKEEEEEEALAG 101
Score = 32.5 bits (74), Expect = 0.069
Identities = 13/19 (68%), Positives = 18/19 (94%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
EE+EEE+++EEE+EEEEE
Sbjct: 78 AEEKEEEKKKEEEKEEEEE 96
Score = 30.9 bits (70), Expect = 0.28
Identities = 12/19 (63%), Positives = 18/19 (94%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
+ EE+EEE+++EEE+EEEE
Sbjct: 77 KAEEKEEEKKKEEEKEEEE 95
Score = 30.5 bits (69), Expect = 0.31
Identities = 12/20 (60%), Positives = 18/20 (90%)
Query: 279 GEEEEEEEEEEEEEEEEEEE 298
EE+ EE+EEE+++EEE+EE
Sbjct: 74 AEEKAEEKEEEKKKEEEKEE 93
Score = 30.2 bits (68), Expect = 0.43
Identities = 12/19 (63%), Positives = 18/19 (94%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
E+ EE+EEE+++EEE+EEE
Sbjct: 76 EKAEEKEEEKKKEEEKEEE 94
Score = 29.4 bits (66), Expect = 0.73
Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 7/39 (17%)
Query: 263 AKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
A E K +EK+ EE+++EEE+EEEEEE +G
Sbjct: 73 AAEEKAEEKE-------EEKKKEEEKEEEEEEALAGLGA 104
Score = 27.8 bits (62), Expect = 3.1
Identities = 10/19 (52%), Positives = 15/19 (78%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
EE+ EE+EEE+++EEE
Sbjct: 72 AAAEEKAEEKEEEKKKEEE 90
Score = 26.3 bits (58), Expect = 9.4
Identities = 9/19 (47%), Positives = 14/19 (73%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
EE+ EE+EEE+++EE
Sbjct: 71 AAAAEEKAEEKEEEKKKEE 89
Score = 26.3 bits (58), Expect = 9.4
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 243 GEREVHEEEEEEEEEEEETKAKE 265
E + E+EEE+++EEE+ + +E
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEEEE 96
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 35.3 bits (81), Expect = 0.011
Identities = 17/54 (31%), Positives = 27/54 (50%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E E + +E+ +E E A E +E+ +E + E EE +EEE+EE
Sbjct: 75 EAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEE 128
Score = 34.9 bits (80), Expect = 0.013
Identities = 13/58 (22%), Positives = 27/58 (46%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E + +E+ +E +A ++++ +E +E + E EE +EEE+
Sbjct: 69 AAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEED 126
Score = 34.9 bits (80), Expect = 0.015
Identities = 14/53 (26%), Positives = 26/53 (49%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+E+ +E E + + + +E + +E + E EE +EEE+EE E
Sbjct: 78 AADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEAE 130
Score = 34.9 bits (80), Expect = 0.016
Identities = 15/56 (26%), Positives = 27/56 (48%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E + +E+ +E E A + +E++ +E + E EE +EEE+EE
Sbjct: 73 AEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEE 128
Score = 32.2 bits (73), Expect = 0.13
Identities = 14/52 (26%), Positives = 25/52 (48%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+ E+ +E E + +A E + ++ + E EE +EEE+EE E
Sbjct: 80 DADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEAEA 131
Score = 31.5 bits (71), Expect = 0.21
Identities = 17/65 (26%), Positives = 26/65 (40%), Gaps = 8/65 (12%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
++E E E + +E +EEE + E + E EE +EEE+
Sbjct: 75 EAEAADADEDADEAAEADAADEADEEEETDEAVDETAD--------EADAEAEEADEEED 126
Query: 294 EEEEE 298
EE E
Sbjct: 127 EEAEA 131
Score = 28.8 bits (64), Expect = 1.9
Identities = 16/69 (23%), Positives = 21/69 (30%), Gaps = 14/69 (20%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
G E EE E A E + +E E + +E +EEEE
Sbjct: 60 KGPAAAATAAAEAAEEAEAADADE--------------DADEAAEADAADEADEEEETDE 105
Query: 302 VVVSAVDFG 310
V D
Sbjct: 106 AVDETADEA 114
Score = 28.0 bits (62), Expect = 3.6
Identities = 11/48 (22%), Positives = 18/48 (37%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E+ A+ + E E E + +E +EEEE +E
Sbjct: 58 AEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDE 105
Score = 28.0 bits (62), Expect = 3.8
Identities = 14/66 (21%), Positives = 25/66 (37%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E +Y G E E E E+ A + ++ +E+ +E E + +E
Sbjct: 38 EKNYDLGREARDLEWTEAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEA 97
Query: 296 EEEIGG 301
+EE
Sbjct: 98 DEEEET 103
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 37.2 bits (86), Expect = 0.012
Identities = 15/59 (25%), Positives = 33/59 (55%), Gaps = 4/59 (6%)
Query: 244 EREVHEEEEEEEEEEEETKAKE----TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
ER+ +EE+ +E+E ++ KA + KL+ ++ G ++ E++ + + E+E E
Sbjct: 19 ERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDENPE 77
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 37.0 bits (85), Expect = 0.013
Identities = 19/93 (20%), Positives = 34/93 (36%), Gaps = 13/93 (13%)
Query: 226 DGYNKTIFQSENDYCKG-------------GEREVHEEEEEEEEEEEETKAKETKLKEKK 272
D NK +YC G + EE + ++ E
Sbjct: 424 DSINKGKLACVENYCVGKRLPPHLSPFASVDSYDPRASLMTMEETQRHSEEDLVNRFEDV 483
Query: 273 KLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVS 305
+ GEE+++++EE + ++E E E G+ S
Sbjct: 484 RYEHVAGEEDDDDDEELQAQKELELEAQGIKYS 516
Score = 33.9 bits (77), Expect = 0.12
Identities = 15/65 (23%), Positives = 33/65 (50%)
Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
F+ GE + ++EE + ++E E +A+ K E + + + ++ + +EE
Sbjct: 479 RFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEE 538
Query: 292 EEEEE 296
EEE++
Sbjct: 539 EEEKK 543
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
(DUF2052). This entry is of sequences of two conserved
domains separated by a region of low complexity,
spanning some 200 residues. The function is unknown.
Length = 178
Score = 35.9 bits (83), Expect = 0.013
Identities = 21/55 (38%), Positives = 31/55 (56%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E EEEE EEEEE+ + + + +E+ + G G EEEE+ + +E EE E
Sbjct: 75 EQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELPTPEEREEL 129
Score = 30.1 bits (68), Expect = 1.1
Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 11/66 (16%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
++ + + E E EEEEE+ EEE + +E EEEEE+ +
Sbjct: 68 EARLELLEQQEEEESAEEEEEDPEEENEEEEEEY-----------QRGPFGEEEEEDGDS 116
Query: 294 EEEEEI 299
+E
Sbjct: 117 YDELPT 122
Score = 28.5 bits (64), Expect = 3.7
Identities = 21/66 (31%), Positives = 30/66 (45%), Gaps = 11/66 (16%)
Query: 244 EREVHEEEEEEEEEEEET--------KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
ERE + E +A+ L+++++ EEEEE+ EEE EEEE
Sbjct: 42 EREARGRAKRWSEGLSGVLESSLDREEARLELLEQQEEEES---AEEEEEDPEEENEEEE 98
Query: 296 EEEIGG 301
EE G
Sbjct: 99 EEYQRG 104
Score = 28.2 bits (63), Expect = 4.5
Identities = 14/46 (30%), Positives = 21/46 (45%)
Query: 254 EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ EE ++ + +E + EEE EEEEEE + EE
Sbjct: 65 DREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEE 110
Score = 27.8 bits (62), Expect = 6.5
Identities = 17/63 (26%), Positives = 28/63 (44%), Gaps = 6/63 (9%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E+ + R E++EEEE EE + + E++ EE + EEEEE+
Sbjct: 61 ESSLDREEARLELLEQQEEEESAEEEEEDPEEENEEE------EEEYQRGPFGEEEEEDG 114
Query: 296 EEE 298
+
Sbjct: 115 DSY 117
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 36.0 bits (84), Expect = 0.014
Identities = 10/55 (18%), Positives = 23/55 (41%), Gaps = 14/55 (25%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E E + E++++++E+E+E++EEE ++G
Sbjct: 34 AAAAAATAAAIESELDEEDL--------------EDDDDDDEDEDEDDEEEADLG 74
Score = 34.5 bits (80), Expect = 0.042
Identities = 10/57 (17%), Positives = 26/57 (45%), Gaps = 9/57 (15%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
G + + E++ E L++ +++++E+E+E++EEE +
Sbjct: 28 GFIDPNAAAAAATAAAIESELDEEDLED---------DDDDDEDEDEDDEEEADLGP 75
Score = 29.8 bits (68), Expect = 1.6
Identities = 13/55 (23%), Positives = 23/55 (41%), Gaps = 14/55 (25%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E +EE+ E+ + E+E+E++EEE + + EE
Sbjct: 39 ATAAAIESELDEEDLEDDDDDD--------------EDEDEDDEEEADLGPDPEE 79
Score = 29.5 bits (67), Expect = 1.7
Identities = 10/51 (19%), Positives = 20/51 (39%), Gaps = 9/51 (17%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E E + + E + +E+E+E++EEE + + E
Sbjct: 38 AATAAAIESELDEEDLEDDDDD---------DEDEDEDDEEEADLGPDPEE 79
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 34.9 bits (81), Expect = 0.014
Identities = 23/72 (31%), Positives = 33/72 (45%), Gaps = 8/72 (11%)
Query: 228 YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
KT E D E+ E+++EE++ E E AK + +EK E E E+ E
Sbjct: 4 EEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSA-EEKA-------EYELEKLE 55
Query: 288 EEEEEEEEEEEI 299
+E EE E E
Sbjct: 56 KELEELEAELAR 67
Score = 31.5 bits (72), Expect = 0.19
Identities = 17/55 (30%), Positives = 26/55 (47%), Gaps = 5/55 (9%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ EE++ E E+ + A+E E +KL E+E EE E E E + E
Sbjct: 24 WEKKQEEKKSEAEKLAKMSAEEKAEYELEKL-----EKELEELEAELARRELKAE 73
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 35.5 bits (82), Expect = 0.014
Identities = 14/51 (27%), Positives = 25/51 (49%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
++EEEE EEE T E + + K ++ E + + E+E+ + E
Sbjct: 94 SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPK 144
Score = 30.9 bits (70), Expect = 0.47
Identities = 14/54 (25%), Positives = 24/54 (44%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
E E E EE +E E+E+ + K ++KK+ E+E+ + E
Sbjct: 89 RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142
Score = 29.3 bits (66), Expect = 1.5
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 272 KKLGGGGGEEEEEEEEEEEEEEEEEEE 298
++L EEEE EEE +E E+E+
Sbjct: 89 RELSSSDDEEEETEEESTDETEQEDPP 115
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle
pole microtubules during metaphase. In Xenopus, it has
been shown that Xklp2 protein is required for centrosome
separation and maintenance of spindle bi-polarity. TPX2
is a microtubule-associated protein that mediates the
binding of the C-terminal domain of Xklp2 to
microtubules. It is phosphorylated during mitosis in a
microtubule-dependent way.
Length = 57
Score = 33.1 bits (76), Expect = 0.015
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 5/45 (11%)
Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+E E+ + KL+EK+K E E+EE E ++EEEEE I
Sbjct: 4 DERAEKRAEFDKKLEEKEKA-----LEAEKEEAEARQKEEEEEAI 43
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 35.3 bits (81), Expect = 0.017
Identities = 16/56 (28%), Positives = 28/56 (50%), Gaps = 9/56 (16%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+EEE+ + + EE + E ++++ E+ +EE+EE EE EEE
Sbjct: 14 PPPTKDEEEDWDSQAEEVEEDEEEMED---------WEDSLDEEDEEAEEVEEETA 60
Score = 31.0 bits (70), Expect = 0.42
Identities = 19/65 (29%), Positives = 32/65 (49%), Gaps = 17/65 (26%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEE-------EEEEEEE 293
KG +++ ++EEE+ ++ +++E +EEE E EE+EE E
Sbjct: 4 KGNAKKLKVRPPPTKDEEEDWDSQAEEVEE----------DEEEMEDWEDSLDEEDEEAE 53
Query: 294 EEEEE 298
E EEE
Sbjct: 54 EVEEE 58
Score = 28.3 bits (63), Expect = 3.7
Identities = 11/17 (64%), Positives = 12/17 (70%)
Query: 249 EEEEEEEEEEEETKAKE 265
EE+EE EE EEET A
Sbjct: 47 EEDEEAEEVEEETAASS 63
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 36.5 bits (84), Expect = 0.019
Identities = 13/54 (24%), Positives = 26/54 (48%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E +EE+ + + +E++A+E + E E + EEE ++ +EE
Sbjct: 3838 EELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEE 3891
Score = 31.5 bits (71), Expect = 0.73
Identities = 16/63 (25%), Positives = 28/63 (44%), Gaps = 9/63 (14%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE---EEEEEI 299
E +E+ ++++ + + K+ E G EE +E EE E+ +EE E
Sbjct: 4037 LEENNTLDEDIQQDDFSDLAEDDEKMNED------GFEENVQENEESTEDGVKSDEELEQ 4090
Query: 300 GGV 302
G V
Sbjct: 4091 GEV 4093
Score = 30.7 bits (69), Expect = 1.2
Identities = 14/60 (23%), Positives = 28/60 (46%), Gaps = 5/60 (8%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKL---GGGGGEEEEE--EEEEEEEEEEEEEEI 299
++E+++ EE EE A ++ L E + G ++ E E + EEE +++
Sbjct: 3829 NTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDL 3888
Score = 30.7 bits (69), Expect = 1.4
Identities = 12/48 (25%), Positives = 25/48 (52%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
E +EE + E++E E L+E L +++ + E++E+ E+
Sbjct: 4018 ENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNED 4065
Score = 29.6 bits (66), Expect = 3.3
Identities = 16/86 (18%), Positives = 37/86 (43%), Gaps = 13/86 (15%)
Query: 221 PVNRPDGYNKTIFQSEN-DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGG 279
P + + N+ ++ N + E++ +E+ E + +K + K E
Sbjct: 3896 PEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALED-------K 3948
Query: 280 EEEEEEEEEEEE-----EEEEEEEIG 300
+ +E+E+EEE ++E + +I
Sbjct: 3949 DRQEKEDEEEMSDDVGIDDEIQPDIQ 3974
Score = 29.2 bits (65), Expect = 4.4
Identities = 21/101 (20%), Positives = 39/101 (38%), Gaps = 8/101 (7%)
Query: 218 DLDPVNRPDGYNKTIFQSEN-DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGG 276
D DP+ + ++ I Q + D + E+ + EE +E EE+ K E+ + G
Sbjct: 4033 DEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGE 4092
Query: 277 -------GGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFG 310
+ + + E +EE G+V + G
Sbjct: 4093 VPEDQAIDNHPKMDAKSTFASAEADEENTDKGIVGENEELG 4133
Score = 28.4 bits (63), Expect = 6.3
Identities = 13/50 (26%), Positives = 23/50 (46%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
++E+ EE T ++ + + L + E+ EE +E EE E G
Sbjct: 4032 QDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDG 4081
Score = 28.0 bits (62), Expect = 8.1
Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 2/60 (3%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEK--KKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
G V E+ EEE+E+++E +E + K ++ E E EEE+ E E G
Sbjct: 2921 GNINVSEQIEEEDEKDDEYGEQEKVSDKNDYLKDFIDYDDDGSISTESEPEEEKRECESG 2980
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 35.9 bits (83), Expect = 0.020
Identities = 18/65 (27%), Positives = 24/65 (36%), Gaps = 5/65 (7%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEE-----EEEEEEEEEEEEEEEEIGGVV 303
E +E E + E E KE + G EE E+ + EE E E+
Sbjct: 211 ELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAE 270
Query: 304 VSAVD 308
SA D
Sbjct: 271 ASADD 275
Score = 35.6 bits (82), Expect = 0.024
Identities = 12/55 (21%), Positives = 26/55 (47%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
G+ + E+ E+E++ + ++ E+++ G E+ + EE E E E
Sbjct: 213 GDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEME 267
Score = 29.4 bits (66), Expect = 2.2
Identities = 11/49 (22%), Positives = 19/49 (38%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ EE +E E+ E E ++ EEEE + E+ +
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSD 255
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 36.3 bits (84), Expect = 0.020
Identities = 22/47 (46%), Positives = 26/47 (55%), Gaps = 4/47 (8%)
Query: 257 EEEETKAKETK--LKEKKK--LGGGGGEEEEEEEEEEEEEEEEEEEI 299
EE+ T E + LKEK + EEEEEEEEEEEE E E +
Sbjct: 377 EEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPM 423
Score = 33.6 bits (77), Expect = 0.12
Identities = 15/34 (44%), Positives = 18/34 (52%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGG 313
EEEEEEEE E E EE + G V + + G
Sbjct: 408 EEEEEEEEAAEAEAPMEEPVPGFEVPEMPMPAAG 441
Score = 30.2 bits (68), Expect = 1.4
Identities = 16/29 (55%), Positives = 18/29 (62%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKK 272
ER EEEEEEEEEEEE E ++E
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPV 427
Score = 29.4 bits (66), Expect = 2.5
Identities = 12/30 (40%), Positives = 15/30 (50%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKE 270
+ E EEEEEEEEE E +A +
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVP 428
Score = 28.2 bits (63), Expect = 5.8
Identities = 21/63 (33%), Positives = 24/63 (38%), Gaps = 19/63 (30%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDF 309
EEEEEEEEEE EEE E E EE E+ + + A
Sbjct: 402 AEEEEEEEEEE-------------------EEEAAEAEAPMEEPVPGFEVPEMPMPAAGG 442
Query: 310 GSG 312
G G
Sbjct: 443 GGG 445
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 35.4 bits (82), Expect = 0.020
Identities = 19/56 (33%), Positives = 25/56 (44%), Gaps = 13/56 (23%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ E EEE EE E EE + + EE EEE E E +EE+ +I
Sbjct: 4 KNEQVEEEVEETEVEEAVEDEVE-------------EETVEEESEAELLDEEQAKI 46
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 33.1 bits (76), Expect = 0.020
Identities = 15/30 (50%), Positives = 21/30 (70%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKK 273
++E+ EEE+ EEEEE K +E K +EK K
Sbjct: 26 KKELKAEEEKREEEEEARKREERKEREKNK 55
Score = 31.5 bits (72), Expect = 0.079
Identities = 17/32 (53%), Positives = 20/32 (62%), Gaps = 6/32 (18%)
Query: 267 KLKEKKK-LGGGGGEEEEEEEEEEEEEEEEEE 297
KLK KKK L + EEE+ EEEEE + EE
Sbjct: 21 KLKAKKKEL-----KAEEEKREEEEEARKREE 47
Score = 28.8 bits (65), Expect = 0.70
Identities = 15/36 (41%), Positives = 21/36 (58%), Gaps = 6/36 (16%)
Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
K KE K +E+K+ EEEEE + EE +E E+
Sbjct: 25 KKKELKAEEEKR------EEEEEARKREERKEREKN 54
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 34.5 bits (79), Expect = 0.023
Identities = 26/72 (36%), Positives = 41/72 (56%), Gaps = 11/72 (15%)
Query: 241 KGGEREVHEEEEEEE----EEEEETKAKETKLKEKKKLGGGG-------GEEEEEEEEEE 289
+G E + EE+EE + E EEE KAK K++E++++ G ++EE+EEE +
Sbjct: 26 EGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIRDKYGIKKKEEDEEEPQ 85
Query: 290 EEEEEEEEEIGG 301
EEEE +G
Sbjct: 86 AAAEEEEGRLGR 97
Score = 30.6 bits (69), Expect = 0.48
Identities = 13/30 (43%), Positives = 16/30 (53%)
Query: 268 LKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
L K+ G EEE+EE +E E EEE
Sbjct: 20 LGGKEDEGDESDAEEEDEEIQEALREAEEE 49
Score = 29.5 bits (66), Expect = 1.1
Identities = 18/42 (42%), Positives = 25/42 (59%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
EE+EEE + + +E +L KKK E EE+EEEEE+
Sbjct: 78 EEDEEEPQAAAEEEEGRLGRKKKTPEELAAEAGEEDEEEEEK 119
Score = 29.5 bits (66), Expect = 1.1
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 268 LKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
LK+ K + GG +E +E + EEE+EE +E
Sbjct: 13 LKDVKGMLGGKEDEGDESDAEEEDEEIQE 41
Score = 27.9 bits (62), Expect = 3.5
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 270 EKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
++ + EEE+EE +E E EEE +
Sbjct: 23 KEDEGDESDAEEEDEEIQEALREAEEERK 51
Score = 27.5 bits (61), Expect = 5.4
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 269 KEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
KE + EE+EE +E E EEE +
Sbjct: 23 KEDEGDESDAEEEDEEIQEALREAEEERKA 52
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 35.9 bits (83), Expect = 0.024
Identities = 18/77 (23%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
++ + ++E EE+ E ++E E + + E ++ G G+ E + +EE+EEE++
Sbjct: 398 EFVRSLQKEPLEEKPENKDESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDK 457
Query: 298 EIG--GVVVSAVDFGSG 312
E ++ G G
Sbjct: 458 EAIPSTILEEEPTVGGG 474
Score = 33.6 bits (77), Expect = 0.14
Identities = 16/55 (29%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
Query: 253 EEEEEEEETKA--KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVS 305
E+++++++TKA K++K ++KKK +E+E+E E E++ G+ V
Sbjct: 83 EDDDDDDDTKAWLKKSKKRQKKKEAERKKALLLDEKEKERAAEYTSEDLAGLKVG 137
Score = 33.6 bits (77), Expect = 0.15
Identities = 15/57 (26%), Positives = 27/57 (47%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSA 306
E++++ + K + K++KKL + EE E+EEE E G+V+
Sbjct: 339 VEDDDDLQASLAKQRRLAQKKRKKLRPEDIARQIAEERSEDEEEAAENNDNGIVIDE 395
Score = 32.4 bits (74), Expect = 0.32
Identities = 18/66 (27%), Positives = 29/66 (43%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
Q E K ++ EE + EE++E + E + + EE+EEE++E
Sbjct: 404 QKEPLEEKPENKDESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDKEAIPST 463
Query: 294 EEEEEI 299
EEE
Sbjct: 464 ILEEEP 469
Score = 29.3 bits (66), Expect = 2.9
Identities = 14/45 (31%), Positives = 25/45 (55%), Gaps = 4/45 (8%)
Query: 248 HEEEEEEEEEEEETKAKETKLKEK----KKLGGGGGEEEEEEEEE 288
+EE E + + EE + K K +EK KLGG E++++++
Sbjct: 45 RQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDD 89
Score = 28.6 bits (64), Expect = 5.9
Identities = 16/54 (29%), Positives = 25/54 (46%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
R++ EE E+EEE E + E + +E EE+ E ++E EE
Sbjct: 370 RQIAEERSEDEEEAAENNDNGIVIDETSEFVRSLQKEPLEEKPENKDESVEEIS 423
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 34.2 bits (79), Expect = 0.026
Identities = 18/47 (38%), Positives = 31/47 (65%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
EE+ EE+E E + KE K +++K+ G G ++EE E+ EE E+ + +
Sbjct: 15 EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNK 61
Score = 33.8 bits (78), Expect = 0.035
Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 4/50 (8%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
EE+ EE+E E+E K + KE+K+ G G ++EE E+ EE E+ +
Sbjct: 15 EEKREEKEREKEEKER----KEEKEKEWGKGLVQKEEREKRLEELEKAKN 60
Score = 30.0 bits (68), Expect = 0.76
Identities = 14/25 (56%), Positives = 21/25 (84%), Gaps = 1/25 (4%)
Query: 280 EEEEEEEEEEEEEEEEEEEIG-GVV 303
EE+E E+EE+E +EE+E+E G G+V
Sbjct: 19 EEKEREKEEKERKEEKEKEWGKGLV 43
Score = 28.4 bits (64), Expect = 2.9
Identities = 11/20 (55%), Positives = 16/20 (80%)
Query: 280 EEEEEEEEEEEEEEEEEEEI 299
EE+ EE+E E+EE+E +EE
Sbjct: 15 EEKREEKEREKEEKERKEEK 34
Score = 27.3 bits (61), Expect = 7.3
Identities = 10/20 (50%), Positives = 16/20 (80%)
Query: 280 EEEEEEEEEEEEEEEEEEEI 299
E+ EE+E E+EE+E +EE+
Sbjct: 16 EKREEKEREKEEKERKEEKE 35
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 35.9 bits (83), Expect = 0.030
Identities = 17/57 (29%), Positives = 29/57 (50%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E +V+ + EEE+EEE + + +E+E+EE++EE E + EI
Sbjct: 2 EEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEIS 58
Score = 35.5 bits (82), Expect = 0.039
Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 7/64 (10%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
E E + EEE+EEE +A G +++E E+E+E+EE++EE
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARS-------AGSDSDDDEVPAEDEDEDEEDDEEAESPA 53
Query: 304 VSAV 307
+ +
Sbjct: 54 KAEI 57
Score = 34.8 bits (80), Expect = 0.060
Identities = 17/41 (41%), Positives = 23/41 (56%), Gaps = 2/41 (4%)
Query: 263 AKETKLKEKKKLGGGGGEEEEEEEEEEEEE--EEEEEEIGG 301
AK + KK G G + EEEE+EE +EEE+ +GG
Sbjct: 113 AKGDQSASAKKAKGRGRHASKLTEEEEDEEYLKEEEDGLGG 153
Score = 31.3 bits (71), Expect = 0.82
Identities = 14/34 (41%), Positives = 18/34 (52%)
Query: 269 KEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
KK G G + EEEE+EE +EEE+ G
Sbjct: 120 SAKKAKGRGRHASKLTEEEEDEEYLKEEEDGLGG 153
Score = 29.4 bits (66), Expect = 3.0
Identities = 11/29 (37%), Positives = 15/29 (51%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
EE+ + EEE+EEE E + S D
Sbjct: 2 EEQVNTQANEEEDEEELEAVARSAGSDSD 30
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 35.0 bits (81), Expect = 0.031
Identities = 17/71 (23%), Positives = 26/71 (36%), Gaps = 10/71 (14%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG---VVVS 305
EEE+ + E E E E E EEE E++ +G V+
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECT-------GPEPEPEPEPEEEPEKKSGMGPLLLVLAV 167
Query: 306 AVDFGSGGWWF 316
A+ G ++F
Sbjct: 168 ALIGGGAYYYF 178
Score = 32.3 bits (74), Expect = 0.19
Identities = 16/83 (19%), Positives = 28/83 (33%), Gaps = 30/83 (36%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKK-------------LGGGGG------------- 279
V E E E + + +KK L GGG
Sbjct: 128 PVCSVNMTECTGPEPEPEPEPEEEPEKKSGMGPLLLVLAVALIGGGAYYYFKFYKPKQQE 187
Query: 280 ----EEEEEEEEEEEEEEEEEEE 298
+++ +E + +E+EEE++E
Sbjct: 188 KGAPDDDLDEYDYGDEDEEEDDE 210
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 35.8 bits (82), Expect = 0.033
Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 6/44 (13%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
EE E+ + E E KA+E + +EK+K E+E E E E E E
Sbjct: 582 EEAVEKAKREAEQKAREEREREKEK------EKEREREREREAE 619
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 32.6 bits (75), Expect = 0.034
Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 20/53 (37%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ + E EEEEE GE+ E E+EE+EEE+++++
Sbjct: 41 DYDSDAEWEEEEE--------------------GEDLESEDEEDEEEDDDDDM 73
Score = 28.0 bits (63), Expect = 1.8
Identities = 13/46 (28%), Positives = 24/46 (52%), Gaps = 16/46 (34%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
+ E EEEEE E+ + E+EE+EEE+++++ +
Sbjct: 45 DAEWEEEEEGEDLE----------------SEDEEDEEEDDDDDMD 74
>gnl|CDD|146016 pfam03179, V-ATPase_G, Vacuolar (H+)-ATPase G subunit. This family
represents the eukaryotic vacuolar (H+)-ATPase
(V-ATPase) G subunit. V-ATPases generate an acidic
environment in several intracellular compartments.
Correspondingly, they are found as membrane-attached
proteins in several organelles. They are also found in
the plasma membranes of some specialised cells.
V-ATPases consist of peripheral (V1) and membrane
integral (V0) heteromultimeric complexes. The G subunit
is part of the V1 subunit, but is also thought to be
strongly attached to the V0 complex. It may be involved
in the coupling of ATP degradation to H+ translocation.
Length = 105
Score = 33.3 bits (77), Expect = 0.034
Identities = 17/50 (34%), Positives = 27/50 (54%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
++ EE E+E EE + E K E + G G E++ E+E EE+ +E
Sbjct: 32 KQAKEEAEKEIEEYRAQREAEFKEFEAEHSGSRGELEKKIEKETEEKIDE 81
Score = 26.8 bits (60), Expect = 7.4
Identities = 16/50 (32%), Positives = 22/50 (44%), Gaps = 9/50 (18%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E+E E E K + +LK+ K EE E+E EE + E E
Sbjct: 13 EKEAAEIVNEARKRRAKRLKQAK---------EEAEKEIEEYRAQREAEF 53
>gnl|CDD|221821 pfam12871, PRP38_assoc, Pre-mRNA-splicing factor 38-associated
hydrophilic C-term. This domain is a hydrophilic region
found at the C-terminus of plant and metazoan
pre-mRNA-splicing factor 38 proteins. The function is
not known.
Length = 97
Score = 33.2 bits (76), Expect = 0.034
Identities = 10/55 (18%), Positives = 20/55 (36%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E ++ EEEE EEEE++E ++ + + + +
Sbjct: 7 EEDLDEEEESEEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRRRRR 61
Score = 30.9 bits (70), Expect = 0.24
Identities = 16/31 (51%), Positives = 20/31 (64%)
Query: 281 EEEEEEEEEEEEEEEEEEIGGVVVSAVDFGS 311
EE+ +EEEE EEEE++EEI VD G
Sbjct: 7 EEDLDEEEESEEEEDDEEIRRKAERDVDRGR 37
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 35.0 bits (81), Expect = 0.035
Identities = 23/59 (38%), Positives = 33/59 (55%), Gaps = 16/59 (27%)
Query: 242 GGEREVHEE--EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
GG+ E EE E+ EEEEEE+ E+E+EE+E+EEEEE+++E
Sbjct: 273 GGDEEDLEELLEKAEEEEEEDD--------------YSESEDEDEEDEDEEEEEDDDEG 317
Score = 28.9 bits (65), Expect = 3.3
Identities = 15/50 (30%), Positives = 30/50 (60%), Gaps = 9/50 (18%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
E+ E+ EEEEEE++ +++++ E+EEEEE+++E ++
Sbjct: 279 LEELLEKAEEEEEEDDYSESED---------EDEEDEDEEEEEDDDEGDK 319
Score = 28.1 bits (63), Expect = 6.0
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E +E + K + K+K + ++ GG E+ EE E+ EEEEEE++
Sbjct: 250 EAIKEVIKKKGGDFKVKGEPEVVGGDEEDLEELLEKAEEEEEEDDY 295
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 35.0 bits (80), Expect = 0.036
Identities = 21/80 (26%), Positives = 33/80 (41%), Gaps = 16/80 (20%)
Query: 235 SENDYCKGGEREVHEEEEE---------EEEEEEETKAKETKLKEKK-------KLGGGG 278
+EN + +H EE + + K K+ +L K K
Sbjct: 160 TENQSFQPWPERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQ 219
Query: 279 GEEEEEEEEEEEEEEEEEEE 298
+EE+EEEE EEE ++EE +
Sbjct: 220 PQEEQEEEEVEEEAKQEEGQ 239
Score = 33.5 bits (76), Expect = 0.11
Identities = 20/64 (31%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 239 YCKGGEREVHEEEEEEEEEEEETKA--KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
GG +V + ++E+ + + +E K +EK+ +EE+EEEE EEE ++E
Sbjct: 184 LSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQ-------PQEEQEEEEVEEEAKQE 236
Query: 297 EEIG 300
E G
Sbjct: 237 EGQG 240
Score = 32.7 bits (74), Expect = 0.18
Identities = 14/51 (27%), Positives = 29/51 (56%), Gaps = 6/51 (11%)
Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
++E+ + +E + +T+ K+ ++ E+EEEE EEE ++EE +
Sbjct: 197 PKQEQLLSKLQEYLQEHKTEEKQPQE------EQEEEEVEEEAKQEEGQGT 241
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 33.1 bits (76), Expect = 0.039
Identities = 16/57 (28%), Positives = 27/57 (47%), Gaps = 13/57 (22%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEE------EEEEEEEEEEEEEI 299
E+ E+ E++ KA+E +LK++K E E + E +EE+E E
Sbjct: 31 EKRMEKRLEQQAIKAREKELKDEK-------EAERQRRIQAIKERRAAKEEKERYEK 80
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 35.3 bits (81), Expect = 0.039
Identities = 20/65 (30%), Positives = 34/65 (52%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+ E E+ E+E ++ E+E K KE + +K+L + E EEEEEE+ E+
Sbjct: 305 KLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEK 364
Query: 294 EEEEE 298
+E+
Sbjct: 365 LQEKL 369
Score = 34.9 bits (80), Expect = 0.058
Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 4/56 (7%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E + E+E EEEE+ E E + +EK K EEE EEE +EE E E
Sbjct: 769 ELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQ----EEELRALEEELKEEAELLEE 820
Score = 34.6 bits (79), Expect = 0.068
Identities = 21/56 (37%), Positives = 37/56 (66%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E+E+ + E+E ++E+EE + E +LKE + EEEE+ E+ +E+ E+ EEE+
Sbjct: 321 EKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEEL 376
Score = 34.2 bits (78), Expect = 0.10
Identities = 16/50 (32%), Positives = 26/50 (52%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
EEE EEE + +A+ + ++ +EEE EE E +EE++ E
Sbjct: 801 EEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLE 850
Score = 33.0 bits (75), Expect = 0.22
Identities = 15/51 (29%), Positives = 22/51 (43%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EEEEEE+ E+ + E +E E + +EEE E + E
Sbjct: 354 AEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEE 404
Score = 32.6 bits (74), Expect = 0.34
Identities = 17/54 (31%), Positives = 28/54 (51%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E +EE+ +EEEEE K +E + E EE+EE ++E ++E +
Sbjct: 948 EKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERL 1001
Score = 31.9 bits (72), Expect = 0.50
Identities = 15/51 (29%), Positives = 25/51 (49%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EEEEEE+ E+ + K ++ + + K + +EEE E + EE
Sbjct: 355 EEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEE 405
Score = 31.9 bits (72), Expect = 0.58
Identities = 14/52 (26%), Positives = 24/52 (46%), Gaps = 5/52 (9%)
Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EE + +E +E K++L E+EEE + +E +EEE+
Sbjct: 235 LNEERIDLLQELLRDEQEEIESSKQEL-----EKEEEILAQVLKENKEEEKE 281
Score = 31.9 bits (72), Expect = 0.59
Identities = 21/69 (30%), Positives = 36/69 (52%)
Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
E E ++ E++ +E+EEEEE + + +E++K E+E EEEE+
Sbjct: 724 LADKVQEAQDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEK 783
Query: 290 EEEEEEEEE 298
E+ + EEE
Sbjct: 784 TEKLKVEEE 792
Score = 31.5 bits (71), Expect = 0.73
Identities = 16/55 (29%), Positives = 27/55 (49%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E E+ + +EEE + + + E + E+EE+ +EEE EE E +
Sbjct: 790 EEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELK 844
Score = 31.5 bits (71), Expect = 0.76
Identities = 20/56 (35%), Positives = 30/56 (53%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E E E EEE+ + ++ K +E ++L EE++ E+ EEE E EEEI
Sbjct: 809 EELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEI 864
Score = 31.1 bits (70), Expect = 0.87
Identities = 14/63 (22%), Positives = 32/63 (50%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
++E+ +EEE + +E K +E + K +++ +EEEE + E + E + + +
Sbjct: 258 KQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKL 317
Query: 304 VSA 306
+
Sbjct: 318 KES 320
Score = 31.1 bits (70), Expect = 1.0
Identities = 16/50 (32%), Positives = 25/50 (50%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+EEE EEE + E +E+ + +EEE EE E +EE++
Sbjct: 800 QEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKL 849
Score = 30.3 bits (68), Expect = 1.7
Identities = 16/70 (22%), Positives = 25/70 (35%), Gaps = 5/70 (7%)
Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
E K E + EEEEE + K + + E EE+EE
Sbjct: 938 PEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIA-----EFEEKEERYN 992
Query: 290 EEEEEEEEEI 299
++E ++E
Sbjct: 993 KDELKKERLE 1002
Score = 30.3 bits (68), Expect = 1.8
Identities = 13/55 (23%), Positives = 27/55 (49%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E + + + +EEE + + + + ++ +EE+ +EEE EE E +E
Sbjct: 791 EEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKE 845
Score = 29.6 bits (66), Expect = 2.9
Identities = 21/79 (26%), Positives = 37/79 (46%), Gaps = 11/79 (13%)
Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKE-----------KKKLGGGGGE 280
+ E + +E EEE+E++ +EEE K + +E K E
Sbjct: 260 ELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKE 319
Query: 281 EEEEEEEEEEEEEEEEEEI 299
E+E ++ E+E ++E+EEI
Sbjct: 320 SEKELKKLEKELKKEKEEI 338
Score = 29.2 bits (65), Expect = 3.8
Identities = 13/56 (23%), Positives = 26/56 (46%), Gaps = 4/56 (7%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E+ +E ++ EE E A+ E+ KL E + +E+ ++ E + +E
Sbjct: 168 EKRKKKERLKKLIEETENLAELIIDLEELKLQ----ELKLKEQAKKALEYYQLKEK 219
Score = 29.2 bits (65), Expect = 4.0
Identities = 18/65 (27%), Positives = 28/65 (43%), Gaps = 9/65 (13%)
Query: 244 EREVHEEEEEE---EEEEEETKAKETK------LKEKKKLGGGGGEEEEEEEEEEEEEEE 294
EE E E+E+EE +E + L K++LG E EE+EE +
Sbjct: 934 YESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNK 993
Query: 295 EEEEI 299
+E +
Sbjct: 994 DELKK 998
Score = 28.8 bits (64), Expect = 4.7
Identities = 12/63 (19%), Positives = 25/63 (39%), Gaps = 5/63 (7%)
Query: 241 KGGEREVHEEE-----EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
K R EEE E+ +++E K E + + + +E + +E+ ++
Sbjct: 152 KPERRLEIEEEAAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKAL 211
Query: 296 EEE 298
E
Sbjct: 212 EYY 214
Score = 28.4 bits (63), Expect = 6.1
Identities = 16/55 (29%), Positives = 26/55 (47%), Gaps = 8/55 (14%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
E EE+EE + + K+ +L+E+KK E E EE + +E + V
Sbjct: 982 AEFEEKEERYNKDELKKERLEEEKK--------ELLREIIEETCQRFKEFLELFV 1028
>gnl|CDD|178778 PLN03240, PLN03240, putative Low-temperature-induced protein;
Provisional.
Length = 626
Score = 35.2 bits (80), Expect = 0.043
Identities = 22/63 (34%), Positives = 33/63 (52%), Gaps = 2/63 (3%)
Query: 241 KGGEREVHEEE--EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+G ++ V +E E+ EE KA + EK LGGGGGE ++ + EE E+
Sbjct: 471 QGQDKGVSAKEFLTEKLSPGEEDKALSEVVAEKLHLGGGGGETAPPKKGIVTQSEEVEKR 530
Query: 299 IGG 301
+GG
Sbjct: 531 LGG 533
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 34.5 bits (80), Expect = 0.049
Identities = 15/48 (31%), Positives = 28/48 (58%)
Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
+ EEE +EE +E K ++ K + + KL EE+ + EE+E +++
Sbjct: 273 LKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQA 320
Score = 31.1 bits (71), Expect = 0.67
Identities = 15/52 (28%), Positives = 30/52 (57%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E ++ + EEE +EE + K +E K +E++ EE+ + EE+E +++
Sbjct: 269 EEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQA 320
Score = 30.7 bits (70), Expect = 0.97
Identities = 15/50 (30%), Positives = 27/50 (54%), Gaps = 7/50 (14%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EEEEE+ + + ++ + +EKK EE+++EE E + + EE
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKK-------EEKKKEEREAKLAKLSPEEQ 308
Score = 29.9 bits (68), Expect = 1.8
Identities = 19/54 (35%), Positives = 30/54 (55%), Gaps = 10/54 (18%)
Query: 249 EEEEEE----EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
EEEEE+ EEE + +A+E K ++KK+ E E + + EE+ + EE
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKE------EREAKLAKLSPEEQRKLEE 313
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 32.8 bits (74), Expect = 0.049
Identities = 11/37 (29%), Positives = 24/37 (64%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGGWWF 316
+EEE++++E+EE+++E+++ V FG +F
Sbjct: 17 DEEEDDDDEDEEDDDEDDDEDDDEVPVPSFGEAMAYF 53
Score = 30.1 bits (67), Expect = 0.50
Identities = 8/21 (38%), Positives = 17/21 (80%)
Query: 278 GGEEEEEEEEEEEEEEEEEEE 298
G E+ + + +EEE++++E+EE
Sbjct: 8 GEEDSDSDSDEEEDDDDEDEE 28
Score = 29.0 bits (64), Expect = 1.2
Identities = 11/55 (20%), Positives = 29/55 (52%), Gaps = 19/55 (34%)
Query: 239 YCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+ +G E + +EEE++++E+ EE+++E+++E+++E
Sbjct: 5 FLEGEEDSDSDSDEEEDDDDED-------------------EEDDDEDDDEDDDE 40
Score = 28.6 bits (63), Expect = 1.5
Identities = 8/25 (32%), Positives = 18/25 (72%)
Query: 274 LGGGGGEEEEEEEEEEEEEEEEEEE 298
L G + + +EEE++++E+EE++
Sbjct: 6 LEGEEDSDSDSDEEEDDDDEDEEDD 30
Score = 28.6 bits (63), Expect = 1.8
Identities = 7/24 (29%), Positives = 18/24 (75%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
G + + +EEE++++E+EE+++
Sbjct: 8 GEEDSDSDSDEEEDDDDEDEEDDD 31
Score = 28.2 bits (62), Expect = 1.9
Identities = 7/19 (36%), Positives = 18/19 (94%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
+ +EEE++++E+EE+++E+
Sbjct: 15 DSDEEEDDDDEDEEDDDED 33
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 34.9 bits (80), Expect = 0.050
Identities = 18/57 (31%), Positives = 30/57 (52%), Gaps = 2/57 (3%)
Query: 246 EVHEE--EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E+ EE ++ E E+EE+ + E+++ G G GE +E +E E + E E G
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESG 254
Score = 34.1 bits (78), Expect = 0.098
Identities = 16/60 (26%), Positives = 28/60 (46%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
+ E E+EE+ ++ + E + +E+ + G G E +E E + E E E V D
Sbjct: 206 DTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDD 265
Score = 33.4 bits (76), Expect = 0.19
Identities = 13/67 (19%), Positives = 34/67 (50%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+ + D + E E E+ E E E +E + +E++ +++ G + ++++ +E +
Sbjct: 212 EEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESD 271
Query: 294 EEEEEIG 300
++ E G
Sbjct: 272 DDSETPG 278
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 34.8 bits (81), Expect = 0.054
Identities = 19/53 (35%), Positives = 32/53 (60%), Gaps = 7/53 (13%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
ERE+ ++ EE E +E + + +L+EKK E+ +EEE++ EE E+E
Sbjct: 529 ERELEQKAEEAEALLKEAEKLKEELEEKK-------EKLQEEEDKLLEEAEKE 574
Score = 28.6 bits (65), Expect = 4.9
Identities = 21/49 (42%), Positives = 31/49 (63%), Gaps = 8/49 (16%)
Query: 253 EEEEEEEETKAKET--KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EE E E E KA+E LKE +KL +EE EE++E+ +EEE++ +
Sbjct: 526 EELERELEQKAEEAEALLKEAEKL------KEELEEKKEKLQEEEDKLL 568
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 33.2 bits (76), Expect = 0.055
Identities = 12/24 (50%), Positives = 18/24 (75%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
G G +EEEEEEEE E++++ +E
Sbjct: 90 GTGHTRQEEEEEEEENEKQQQSDE 113
Score = 32.9 bits (75), Expect = 0.074
Identities = 11/23 (47%), Positives = 17/23 (73%)
Query: 276 GGGGEEEEEEEEEEEEEEEEEEE 298
G +EEEEEEEE E++++ +E
Sbjct: 92 GHTRQEEEEEEEENEKQQQSDEA 114
Score = 30.9 bits (70), Expect = 0.36
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEI 299
G EEEEEEEE E++++ +E ++
Sbjct: 92 GHTRQEEEEEEEENEKQQQSDEAQV 116
Score = 30.9 bits (70), Expect = 0.40
Identities = 12/29 (41%), Positives = 21/29 (72%), Gaps = 2/29 (6%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKE 270
G R+ EEEEEEEE E++ ++ E ++++
Sbjct: 92 GHTRQ--EEEEEEEENEKQQQSDEAQVQQ 118
>gnl|CDD|215541 PLN03020, PLN03020, low-temperature-induced protein; Provisional.
Length = 556
Score = 34.6 bits (79), Expect = 0.057
Identities = 16/62 (25%), Positives = 30/62 (48%), Gaps = 9/62 (14%)
Query: 247 VHEEEEEEEEEEEET-----KAKETKLKEKKKLGGGGGEEEEEEE----EEEEEEEEEEE 297
+H E E+ E+++ K K K+K+ G G +E + EE+++E+E
Sbjct: 18 LHSATEGEDHHEKKSVLKKVKDKAKKIKDTITKHGHGHHHDEHRVPDDHDLYEEDDDEDE 77
Query: 298 EI 299
E+
Sbjct: 78 EM 79
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 34.7 bits (80), Expect = 0.057
Identities = 20/113 (17%), Positives = 35/113 (30%), Gaps = 16/113 (14%)
Query: 196 FNMHGEVSYTYASELH------TACVFLDLDPVNRPDGYNKTIFQSENDYCK-----GGE 244
GE+ Y + F + + +KT ++ Y K G
Sbjct: 319 LRQKGELLYANLQLIEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGA 378
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+ + + E +E EK + G++ EE EE EE +
Sbjct: 379 KVNLDRQLSELKEAIAYYESAKTALEKAE-----GKKAIEEIREELIEEGLLK 426
Score = 30.4 bits (69), Expect = 1.4
Identities = 20/85 (23%), Positives = 39/85 (45%), Gaps = 12/85 (14%)
Query: 216 FLDLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLG 275
+LD+ P+ K ++ ++ + + + E E++ E+E K + KL
Sbjct: 251 YLDVVPLKAYADLEKLFNEALDEKFERDKIKQLASELEKKLEKE-------LKKLENKL- 302
Query: 276 GGGGEEEEEEEEEEEEEEEEEEEIG 300
E++E+E EE E+ EE + G
Sbjct: 303 ----EKQEDELEELEKAAEELRQKG 323
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 34.4 bits (79), Expect = 0.060
Identities = 12/55 (21%), Positives = 32/55 (58%), Gaps = 7/55 (12%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+R+ E+++ EE ++++ +E +LK+ +K E +E++++ EE ++
Sbjct: 80 QRKKKEQQQAEELQQKQAAEQE-RLKQLEK------ERLAAQEQKKQAEEAAKQA 127
Score = 34.0 bits (78), Expect = 0.079
Identities = 13/58 (22%), Positives = 32/58 (55%), Gaps = 6/58 (10%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K +R + +++E+++ EE + K+ +E+ K + E+E +E++++ EE
Sbjct: 72 KSAKRAEEQRKKKEQQQAEELQQKQAAEQERLK------QLEKERLAAQEQKKQAEEA 123
Score = 33.2 bits (76), Expect = 0.17
Identities = 10/55 (18%), Positives = 28/55 (50%), Gaps = 6/55 (10%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E+ +++ E+E ++ + + +E+KK + EE ++ ++++ EE
Sbjct: 89 AEELQQKQAAEQERLKQLEKERLAAQEQKK------QAEEAAKQAALKQKQAEEA 137
Score = 32.5 bits (74), Expect = 0.30
Identities = 9/50 (18%), Positives = 28/50 (56%), Gaps = 5/50 (10%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+++ EE ++++ + + K EK++L +E++++ EE ++ +
Sbjct: 86 QQQAEELQQKQAAEQERLKQLEKERL-----AAQEQKKQAEEAAKQAALK 130
Score = 29.8 bits (67), Expect = 2.2
Identities = 12/48 (25%), Positives = 27/48 (56%), Gaps = 9/48 (18%)
Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EE+ +++E+ +A+E L++K+ E+E ++ E+E +E
Sbjct: 78 EEQRKKKEQQQAEE--LQQKQ-------AAEQERLKQLEKERLAAQEQ 116
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 34.6 bits (80), Expect = 0.061
Identities = 17/62 (27%), Positives = 28/62 (45%), Gaps = 17/62 (27%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
+N++ E E EEEEEE + E +L+E + + + EE +E+
Sbjct: 142 KNEF----------ELELPEEEEEEPEEMEEELEEDA-------ADRDARKRAAEEAKEQ 184
Query: 296 EE 297
EE
Sbjct: 185 EE 186
Score = 28.5 bits (64), Expect = 5.1
Identities = 14/19 (73%), Positives = 14/19 (73%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
E EEEEEE EE EEE EE
Sbjct: 148 ELPEEEEEEPEEMEEELEE 166
Score = 28.1 bits (63), Expect = 7.0
Identities = 12/20 (60%), Positives = 14/20 (70%)
Query: 280 EEEEEEEEEEEEEEEEEEEI 299
EEEEE EE EEE EE+ +
Sbjct: 152 EEEEEPEEMEEELEEDAADR 171
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 33.3 bits (76), Expect = 0.063
Identities = 18/60 (30%), Positives = 33/60 (55%), Gaps = 4/60 (6%)
Query: 239 YCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
Y K E+E + E+E E++ +KE + EKKK +E++E ++ + E+ E+E
Sbjct: 38 YLKLLEKEGYAVPEKESAEKQVKSSKEDRKFEKKK----KLDEKKEIAKQRKREQREKEL 93
>gnl|CDD|178593 PLN03021, PLN03021, Low-temperature-induced protein; Provisional.
Length = 619
Score = 34.6 bits (78), Expect = 0.065
Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 7/74 (9%)
Query: 247 VHEEEEEEEEEEEETKA----KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
+H EE+E E+ +K KE K K L G + + E++++E +E++ E+ G
Sbjct: 20 IHHPEEDEHHEKGASKVLKKVKEKAKKIKNSLTKHGNGHDHDVEDDDDEYDEQDPEVHGA 79
Query: 303 VV---SAVDFGSGG 313
V SAV G G
Sbjct: 80 PVYESSAVRGGVTG 93
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 118
Score = 32.6 bits (75), Expect = 0.066
Identities = 15/60 (25%), Positives = 30/60 (50%), Gaps = 9/60 (15%)
Query: 246 EVHEEEEEEEEEEEETKAKE---------TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
E ++ EEEE+EE + E T+ K K E + +E+E+++++++E
Sbjct: 28 EEFDKREEEEKEEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKE 87
Score = 26.8 bits (60), Expect = 6.5
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVS 305
+ EEEE+EE +E E +E G V V+
Sbjct: 32 KREEEEKEEAKEARNEPDEDGFVTVT 57
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 33.9 bits (78), Expect = 0.066
Identities = 21/47 (44%), Positives = 24/47 (51%), Gaps = 12/47 (25%)
Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
EEE EE+E EEEE++EE E EEEEEEEE
Sbjct: 19 EEELEEDEFFW------------TYLLFEEEEDDEEFEIEEEEEEEE 53
Score = 33.5 bits (77), Expect = 0.11
Identities = 17/48 (35%), Positives = 28/48 (58%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
EEEE++EE E + +E + + +E E ++EEE E+E + EE
Sbjct: 35 EEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREE 82
Score = 30.8 bits (70), Expect = 0.67
Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 22/68 (32%)
Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
+F+ E D + E EEEEEEEE + ++ E++E E ++
Sbjct: 33 LFEEEED-----DEEFEIEEEEEEEEVDS-----------------DFDDSEDDEPESDD 70
Query: 292 EEEEEEEI 299
EEE E+E+
Sbjct: 71 EEEGEKEL 78
Score = 30.8 bits (70), Expect = 0.77
Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 5/54 (9%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E ++EE E EEEEE + ++ + + E E ++EEE E+E + EE +
Sbjct: 36 EEEDDEEFEIEEEEEEEEVDSDFDDSE-----DDEPESDDEEEGEKELQREERL 84
Score = 30.0 bits (68), Expect = 1.5
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 9/52 (17%)
Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E EEEEEEEE ++ +++ E E ++EEE E+E + EE
Sbjct: 43 FEIEEEEEEEEVDSDFDDSEDDE---------PESDDEEEGEKELQREERLK 85
Score = 29.7 bits (67), Expect = 1.6
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKK 273
E E E ++EEE E+ +E +LK+KK+
Sbjct: 62 EDDEPESDDEEEGEKELQREERLKKKKR 89
Score = 27.3 bits (61), Expect = 9.9
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKL 274
+E E ++EEE E + + + +KKK
Sbjct: 63 DDEPESDDEEEGEKELQREERLKKKKR 89
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 33.8 bits (78), Expect = 0.071
Identities = 18/56 (32%), Positives = 32/56 (57%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
++E EE + EE EE K++E + +E++ EEE++E E E + EE ++
Sbjct: 2 SDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKD 57
Score = 30.0 bits (68), Expect = 1.0
Identities = 20/56 (35%), Positives = 30/56 (53%), Gaps = 8/56 (14%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E EE EE E+ EE +A+E + +E+ E EEE++E E E + EE+
Sbjct: 8 TEEPDAEETEEAEKSEEEEAEEEEPEEE--------NELEEEQQEIAELEAQLEEL 55
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 33.0 bits (74), Expect = 0.082
Identities = 26/66 (39%), Positives = 32/66 (48%), Gaps = 6/66 (9%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
E E EE E EEE EEE + +ET +E + E E E E E E E E E E G
Sbjct: 71 EPEEEGEEEPEPEEEGEEEPEPEETGEEEPEP------EPEPEPEPEPEPEPEPEPEPGA 124
Query: 302 VVVSAV 307
+ +V
Sbjct: 125 ATLKSV 130
Score = 32.2 bits (72), Expect = 0.15
Identities = 25/65 (38%), Positives = 31/65 (47%), Gaps = 4/65 (6%)
Query: 238 DYCKGGEREVHEEEEEE----EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
D G + + EEEEE EE EEE + +E +E + G E E E E E E E
Sbjct: 53 DDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEP 112
Query: 294 EEEEE 298
E E E
Sbjct: 113 EPEPE 117
Score = 32.2 bits (72), Expect = 0.18
Identities = 22/65 (33%), Positives = 31/65 (47%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+ E G + + ++E EEEEE + +E +E + G E E EE EEE E
Sbjct: 43 KGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEP 102
Query: 294 EEEEE 298
E E E
Sbjct: 103 EPEPE 107
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 34.1 bits (79), Expect = 0.085
Identities = 15/50 (30%), Positives = 29/50 (58%), Gaps = 6/50 (12%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E EEEE E + A+ KL++K++ + E++ E+EE E+ +++
Sbjct: 395 EGEEEEGENGNLSPAERKKLRKKQR------KAEKKAEKEEAEKAAAKKK 438
Score = 31.8 bits (73), Expect = 0.43
Identities = 14/54 (25%), Positives = 23/54 (42%), Gaps = 5/54 (9%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E E E E E K ++ + K +KK E+EE E+ +++ E
Sbjct: 395 EGEEEEGENGNLSPAERKKLRKKQRKAEKKA-----EKEEAEKAAAKKKAEAAA 443
Score = 31.4 bits (72), Expect = 0.61
Identities = 10/39 (25%), Positives = 19/39 (48%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
E++ E+EE E A + K + K G E ++ + +
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459
Score = 29.5 bits (67), Expect = 2.9
Identities = 10/39 (25%), Positives = 18/39 (46%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
E++ E+EE KA K E G + E ++ + +
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459
Score = 28.7 bits (65), Expect = 3.9
Identities = 12/39 (30%), Positives = 21/39 (53%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
E++ E+EE E+ AK+ KK G GE ++ + +
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 33.5 bits (77), Expect = 0.094
Identities = 22/67 (32%), Positives = 30/67 (44%), Gaps = 10/67 (14%)
Query: 243 GEREVHEEEEEEEEEEEETKAKE-------TKLKEKKKLGG---GGGEEEEEEEEEEEEE 292
GE ++ +E+ E KE LK ++K G + E EEEEEE
Sbjct: 110 GESDLENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEI 169
Query: 293 EEEEEEI 299
EEEEE+
Sbjct: 170 AEEEEEV 176
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 33.9 bits (78), Expect = 0.095
Identities = 14/76 (18%), Positives = 35/76 (46%)
Query: 226 DGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEE 285
G+N I Q +++ G + + +E+E E++++++ + + +K + +
Sbjct: 40 KGFNADIPQPKDEGIIGDKMKAYEQEALEQKQKDKKRTLQDLADSFEKSDKESQDYSSSD 99
Query: 286 EEEEEEEEEEEEEIGG 301
+EE +E E G
Sbjct: 100 SGFDEEPDEPAETAGS 115
Score = 28.2 bits (63), Expect = 6.8
Identities = 12/58 (20%), Positives = 20/58 (34%), Gaps = 13/58 (22%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
RE+ E + +EE+ + +++ EE E E E EE
Sbjct: 130 NRELGSFYEYPKTDEEK--------ELLREV-----EELESRLATEPSPAPELEEQLA 174
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 34.0 bits (78), Expect = 0.10
Identities = 18/76 (23%), Positives = 30/76 (39%), Gaps = 2/76 (2%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
++D E E EE E +E++ E + + + G E E+ E +E +
Sbjct: 235 DGDDDQPDNNEDS--EAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESD 292
Query: 294 EEEEEIGGVVVSAVDF 309
E+ E G A F
Sbjct: 293 EDTETPGEDARPATPF 308
Score = 32.9 bits (75), Expect = 0.22
Identities = 16/55 (29%), Positives = 26/55 (47%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E ++ EE+ +EE+ + E + G E +E EE+E E + E E G
Sbjct: 221 ETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEG 275
Score = 32.1 bits (73), Expect = 0.43
Identities = 16/67 (23%), Positives = 29/67 (43%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+ + D G + + E+ E EE + E++ E + G G E E + E E+ E
Sbjct: 228 EEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSE 287
Query: 294 EEEEEIG 300
+E +
Sbjct: 288 SDESDED 294
Score = 31.4 bits (71), Expect = 0.74
Identities = 16/57 (28%), Positives = 28/57 (49%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSA 306
EE+ +EE+ ++ + + E + G +E EE+E E + E EE E+ S
Sbjct: 228 EEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASE 284
Score = 30.2 bits (68), Expect = 1.5
Identities = 12/56 (21%), Positives = 29/56 (51%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
G + + +EE+ ++++ ++++ ++ G EE+E E + E EE E +
Sbjct: 223 GDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMD 278
Score = 27.9 bits (62), Expect = 9.1
Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 12/56 (21%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
G+ + E+ +EE+ ++++ E E EE E +E EE+E E
Sbjct: 223 GDDGIEEDADEEDGDDDQPDNNEDS------------EAGREESEGSDESEEDEAE 266
>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6. The VP6 protein a
minor protein in the core of the virion is probably the
viral helicase.
Length = 322
Score = 33.9 bits (77), Expect = 0.10
Identities = 16/61 (26%), Positives = 33/61 (54%), Gaps = 2/61 (3%)
Query: 254 EEEEEEETKAKETKLKEK--KKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGS 311
+ E E+K KE K ++ +++ G GE++++ +EE ++E E+ + + V GS
Sbjct: 26 DWFESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVDRRTHTEVGSGS 85
Query: 312 G 312
Sbjct: 86 S 86
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 33.9 bits (78), Expect = 0.11
Identities = 13/63 (20%), Positives = 31/63 (49%)
Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
Y + +E EEE A + K+KK+ +++ E+++++++E++E +
Sbjct: 25 SYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPK 84
Query: 298 EIG 300
G
Sbjct: 85 SEG 87
Score = 30.5 bits (69), Expect = 1.2
Identities = 19/91 (20%), Positives = 37/91 (40%), Gaps = 6/91 (6%)
Query: 229 NKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKL------KEKKKLGGGGGEEE 282
T + D +++ +++++++E++E ETKL K KK +
Sbjct: 52 TSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111
Query: 283 EEEEEEEEEEEEEEEEIGGVVVSAVDFGSGG 313
E+ + + E E V + A SGG
Sbjct: 112 NEDVDNAFNKIAELAEKSNVYIGAHVSASGG 142
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 33.2 bits (76), Expect = 0.11
Identities = 13/48 (27%), Positives = 21/48 (43%)
Query: 261 TKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
T K K+ KK EEE++ E ++++ EE E + D
Sbjct: 58 TPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGD 105
Score = 27.4 bits (61), Expect = 7.7
Identities = 10/60 (16%), Positives = 26/60 (43%), Gaps = 4/60 (6%)
Query: 240 CKGGEREVHEEEEEEEEEEEETKAKE---TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
K E+ EEE++ E ++++T+ E +E + + E++ ++ +
Sbjct: 70 KKDKEKLT-EEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDIDNEDDMDDSD 128
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 33.3 bits (76), Expect = 0.11
Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 5/63 (7%)
Query: 243 GEREVHEEEEEEE-----EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
GE E +++E E E ++ E + K++ + + G E E E+E E+E EE
Sbjct: 213 GENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEE 272
Query: 298 EIG 300
E G
Sbjct: 273 EAG 275
>gnl|CDD|223731 COG0658, ComEC, Predicted membrane metal-binding protein [General
function prediction only].
Length = 453
Score = 33.6 bits (77), Expect = 0.11
Identities = 13/66 (19%), Positives = 24/66 (36%), Gaps = 5/66 (7%)
Query: 63 IIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATD 122
+IV +I + G N L+ +V+F L+++ L +A LL
Sbjct: 393 LIVSPLISLISLPGLLANFLLSPLVSF-----LIVPLILLGLLLAPLLIGAALARVRPGV 447
Query: 123 YVTTCW 128
+
Sbjct: 448 FGLIQL 453
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 33.9 bits (78), Expect = 0.11
Identities = 20/55 (36%), Positives = 25/55 (45%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
ER E E EE EE + KE K++L EE E+ E EE +EE
Sbjct: 309 ERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEEL 363
Score = 33.1 bits (76), Expect = 0.24
Identities = 20/56 (35%), Positives = 32/56 (57%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
ER++ E + E EEE + +++L+E ++ EE EE +E EE EEE E +
Sbjct: 715 ERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESL 770
Score = 31.6 bits (72), Expect = 0.70
Identities = 17/55 (30%), Positives = 23/55 (41%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ E+ E E EE E+ E +E ++ EE EE E EE E E
Sbjct: 336 KEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEA 390
Score = 31.2 bits (71), Expect = 0.74
Identities = 20/53 (37%), Positives = 26/53 (49%), Gaps = 2/53 (3%)
Query: 249 EEEEEEEEEEEETKAKETKLKEK--KKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EE E EE E+ A+ + KE+ +KL E EE E EE E E E+
Sbjct: 340 EERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAEL 392
Score = 31.2 bits (71), Expect = 0.91
Identities = 17/66 (25%), Positives = 30/66 (45%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+ EN+ + ER +E+ E +EE + + + ++ L +EE EE+ E
Sbjct: 313 ELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLE 372
Query: 294 EEEEEI 299
E EE
Sbjct: 373 ELEELF 378
Score = 30.8 bits (70), Expect = 1.1
Identities = 15/52 (28%), Positives = 21/52 (40%)
Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ E +EE EE E + +L E EE+ EE EE E +
Sbjct: 330 EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEAL 381
Score = 30.5 bits (69), Expect = 1.4
Identities = 18/50 (36%), Positives = 27/50 (54%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E EEE E+ +++ +L+E+ + EE +E EE EEE E EE
Sbjct: 723 RELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEE 772
Score = 30.5 bits (69), Expect = 1.4
Identities = 23/54 (42%), Positives = 35/54 (64%), Gaps = 6/54 (11%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E+ +E EE +EE EE +A++ +L+++ K E EEE+EE EEE E E E+
Sbjct: 853 ELEKELEELKEELEELEAEKEELEDELK------ELEEEKEELEEELRELESEL 900
Score = 29.3 bits (66), Expect = 3.7
Identities = 23/68 (33%), Positives = 36/68 (52%), Gaps = 1/68 (1%)
Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEK-KKLGGGGGEEEEEEEEEEEE 291
++E + E+ + EE E + EE K + L+E+ ++L E EEE EE EEE
Sbjct: 693 LKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEE 752
Query: 292 EEEEEEEI 299
EE +E +
Sbjct: 753 LEELQERL 760
Score = 29.3 bits (66), Expect = 3.8
Identities = 22/52 (42%), Positives = 28/52 (53%), Gaps = 1/52 (1%)
Query: 249 EEEEEEEEEEEETKAKETKLKEK-KKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E EE EEE E + KLKE+ ++L +EE EE EEE EE E +
Sbjct: 758 ERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRL 809
Score = 28.9 bits (65), Expect = 4.0
Identities = 16/53 (30%), Positives = 21/53 (39%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E EE E+ E + + +L+EK EE E EE E E E
Sbjct: 341 ERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELA 393
Score = 28.5 bits (64), Expect = 5.4
Identities = 26/58 (44%), Positives = 37/58 (63%), Gaps = 5/58 (8%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKE-KKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E+E+ E EEE EE EE+ E +L+E +K+L E +EE EE E E+EE E+E+
Sbjct: 827 EQEIEELEEEIEELEEKLDELEEELEELEKELE----ELKEELEELEAEKEELEDELK 880
Score = 27.8 bits (62), Expect = 9.2
Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 3/52 (5%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
EEE EEE + + +L+E +K E + E EE EE EE +EE+ +
Sbjct: 245 EEELSRLEEELEELQEELEEAEK---EIEELKSELEELREELEELQEELLEL 293
Score = 27.8 bits (62), Expect = 9.7
Identities = 19/53 (35%), Positives = 28/53 (52%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
RE+ E E +EE E+ + + +L+ K + E EEE EEE E+ E E
Sbjct: 894 RELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETE 946
>gnl|CDD|216868 pfam02084, Bindin, Bindin.
Length = 239
Score = 33.3 bits (76), Expect = 0.12
Identities = 16/45 (35%), Positives = 21/45 (46%), Gaps = 7/45 (15%)
Query: 243 GEREVHE-------EEEEEEEEEEETKAKETKLKEKKKLGGGGGE 280
G+ EV E EEEEEEE++ ++ L GGGG
Sbjct: 146 GDPEVREQVLSAMQEEEEEEEQDAANGVRDNVLNNLNNAPGGGGY 190
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 33.9 bits (77), Expect = 0.13
Identities = 11/52 (21%), Positives = 24/52 (46%), Gaps = 9/52 (17%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
R E +E E ++ T+ + G + + EE++++ EEE+ +
Sbjct: 87 RSFQEPVSQESEVQDNTEQNQ---------DTKGSKTDSEEDDDDSEEEDNK 129
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 33.0 bits (75), Expect = 0.13
Identities = 14/65 (21%), Positives = 30/65 (46%), Gaps = 1/65 (1%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
E E H EEE + + + E ++KE+ + ++EE+ +E + +E+ ++
Sbjct: 81 EAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDED-KFLL 139
Query: 304 VSAVD 308
D
Sbjct: 140 AEDSD 144
Score = 30.3 bits (68), Expect = 1.2
Identities = 14/55 (25%), Positives = 27/55 (49%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ +E+ +E E+ E TKA+ + E+ E+ + +EE +E EE+
Sbjct: 182 SEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKEELAKEPVEEQ 236
Score = 29.5 bits (66), Expect = 1.8
Identities = 12/53 (22%), Positives = 23/53 (43%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
++E +E + +E++ E ++ L G EE E+ EE E+
Sbjct: 120 KQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQ 172
Score = 29.5 bits (66), Expect = 2.1
Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 6/60 (10%)
Query: 244 EREVHEEEEEEEEE------EEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E EV E+ + E+ EE+ KE +L E K L ++ +E E + EE E+
Sbjct: 102 EEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETED 161
Score = 28.4 bits (63), Expect = 4.3
Identities = 13/59 (22%), Positives = 25/59 (42%), Gaps = 5/59 (8%)
Query: 244 EREVHEEEEE-----EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E+ V ++EE E + +E+ ++++ G EE E+ EE E+
Sbjct: 114 EKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQ 172
Score = 27.6 bits (61), Expect = 7.5
Identities = 14/58 (24%), Positives = 28/58 (48%), Gaps = 6/58 (10%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE--EEEEEEEEEEIGGVVV 304
EEE +E+ + K +K +E G G E + +E++ E+ ++ +E + V
Sbjct: 102 EEEVKEQLQSLLEKIVVSKQEE----DGPGKEPQLDEDKFLLAEDSDDRQETLEAGKV 155
Score = 27.6 bits (61), Expect = 7.6
Identities = 17/76 (22%), Positives = 34/76 (44%)
Query: 224 RPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEE 283
+ D + + +D + E EE E+ EET +++ K K+K E+ +
Sbjct: 131 QLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSK 190
Query: 284 EEEEEEEEEEEEEEEI 299
E E+ E + E +++
Sbjct: 191 EPVEKAERTKAETDDV 206
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 31.8 bits (72), Expect = 0.13
Identities = 16/47 (34%), Positives = 28/47 (59%), Gaps = 14/47 (29%)
Query: 268 LKEKKKLGGG--------------GGEEEEEEEEEEEEEEEEEEEIG 300
+K+ + +GGG G + E ++EE++EEEEEEE+++G
Sbjct: 61 MKKLQNIGGGVAAAAAPAAGAATAGAKAEAKKEEKKEEEEEEEDDLG 107
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 33.8 bits (78), Expect = 0.13
Identities = 22/75 (29%), Positives = 34/75 (45%), Gaps = 9/75 (12%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE------EEEEEEEEEEEIGGV 302
+E E EE EEET + E K G E+E EE EE+E+ ++ E++ +
Sbjct: 227 KETSETEEREEETDVEIETTSETK---GTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Query: 303 VVSAVDFGSGGWWFE 317
+ +WFE
Sbjct: 284 EILKEKKDEELFWFE 298
>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
represents the eukaryotic large ribosomal protein P2.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P2 is located in the L12 stalk, with proteins
P1, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers). Bacteria may have four or six copies
of L7/L12 (two or three homodimers) depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plants have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site in the C-terminal region
of P2. TCS inactivates the ribosome by depurinating a
specific adenine in the sarcin-ricin loop of 28S rRNA.
Length = 109
Score = 31.9 bits (73), Expect = 0.14
Identities = 8/26 (30%), Positives = 16/26 (61%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIG 300
++EE++EE EEE ++++G
Sbjct: 79 AAAAAAAAKKEEKKEESEEESDDDMG 104
Score = 31.5 bits (72), Expect = 0.16
Identities = 7/24 (29%), Positives = 12/24 (50%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
++EE++EE EEE +
Sbjct: 77 AAAAAAAAAAKKEEKKEESEEESD 100
Score = 30.7 bits (70), Expect = 0.30
Identities = 7/24 (29%), Positives = 13/24 (54%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
++EE++EE EEE ++
Sbjct: 78 AAAAAAAAAKKEEKKEESEEESDD 101
Score = 30.7 bits (70), Expect = 0.31
Identities = 7/24 (29%), Positives = 11/24 (45%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
++EE++EE EEE
Sbjct: 76 AAAAAAAAAAAKKEEKKEESEEES 99
Score = 28.4 bits (64), Expect = 1.9
Identities = 7/24 (29%), Positives = 11/24 (45%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
++EE++EE EEE
Sbjct: 75 AAAAAAAAAAAAKKEEKKEESEEE 98
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 33.1 bits (76), Expect = 0.15
Identities = 8/22 (36%), Positives = 11/22 (50%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGG 301
EEEEEEE+++G
Sbjct: 285 AAAAAAAAPAEEEEEEEDDMGF 306
Score = 33.1 bits (76), Expect = 0.16
Identities = 8/22 (36%), Positives = 9/22 (40%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGG 301
EEEEEEE+ G
Sbjct: 284 AAAAAAAAAPAEEEEEEEDDMG 305
Score = 30.8 bits (70), Expect = 0.87
Identities = 8/22 (36%), Positives = 10/22 (45%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGG 301
EEEEEEE++ G
Sbjct: 286 AAAAAAAPAEEEEEEEDDMGFG 307
Score = 30.0 bits (68), Expect = 1.6
Identities = 7/20 (35%), Positives = 7/20 (35%)
Query: 280 EEEEEEEEEEEEEEEEEEEI 299
EEEEEEE
Sbjct: 283 AAAAAAAAAAPAEEEEEEED 302
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 32.7 bits (75), Expect = 0.15
Identities = 14/46 (30%), Positives = 27/46 (58%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
E+ E EEEE+ + + +E L+ K +++ +E ++EE+EE
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165
Score = 30.4 bits (69), Expect = 1.1
Identities = 12/46 (26%), Positives = 24/46 (52%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
E+ E EEEE+ +E + + +++ +E ++EE+EE
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165
Score = 30.0 bits (68), Expect = 1.1
Identities = 15/45 (33%), Positives = 23/45 (51%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
+ E EEE+ +EEE E E+ +L E ++EE+EE
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165
Score = 30.0 bits (68), Expect = 1.2
Identities = 15/46 (32%), Positives = 26/46 (56%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
E+ EEEE+ +EEE E + K + ++ +E ++EE+EE
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 32.8 bits (75), Expect = 0.15
Identities = 15/46 (32%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E+E E EEEE K K+ K + KK+ +E+++++E+ E +
Sbjct: 143 VEKEAEVEEEEKKEKKKKKEVKKEK-----KEKKDKKEKMVEPKGS 183
Score = 31.2 bits (71), Expect = 0.49
Identities = 18/64 (28%), Positives = 32/64 (50%), Gaps = 12/64 (18%)
Query: 243 GEREVHEEEEEEEEE------EEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
E E E+E E E + +E K K+KKK E ++E++E+++++E+
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKK------EVKKEKKEKKDKKEKMV 178
Query: 297 EEIG 300
E G
Sbjct: 179 EPKG 182
Score = 30.8 bits (70), Expect = 0.54
Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 5/49 (10%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E ET KET K +K+ E EEEE++E+++++E ++E
Sbjct: 123 GPPSELGSESETSEKETTAKVEKEA-----EVEEEEKKEKKKKKEVKKE 166
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell cycle
progression and biogenesis of 60S ribosomal subunits.
The Schistosoma mansoni Mak16 has been shown to target
protein transport to the nucleolus.
Length = 97
Score = 31.4 bits (71), Expect = 0.15
Identities = 23/58 (39%), Positives = 28/58 (48%), Gaps = 14/58 (24%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K E E EE +EEEEEEEE + GE E ++EE EEE E+ E
Sbjct: 53 KALEAEESEENDEEEEEEEEEED--------------EGEIEYVSDDEELEEEIEDLE 96
Score = 31.0 bits (70), Expect = 0.24
Identities = 17/30 (56%), Positives = 19/30 (63%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVDF 309
E EE EE +EEEEEEEEEE G + D
Sbjct: 56 EAEESEENDEEEEEEEEEEDEGEIEYVSDD 85
Score = 30.6 bits (69), Expect = 0.28
Identities = 17/29 (58%), Positives = 20/29 (68%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
EE EE +EEEEEEEEEE+E VS +
Sbjct: 58 EESEENDEEEEEEEEEEDEGEIEYVSDDE 86
Score = 30.2 bits (68), Expect = 0.38
Identities = 16/32 (50%), Positives = 20/32 (62%)
Query: 281 EEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSG 312
EE EE +EEEEEEEEEE+ G + + D
Sbjct: 58 EESEENDEEEEEEEEEEDEGEIEYVSDDEELE 89
Score = 28.3 bits (63), Expect = 1.9
Identities = 17/28 (60%), Positives = 19/28 (67%)
Query: 271 KKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
KK L EE +EEEEEEEEEE+E E
Sbjct: 52 KKALEAEESEENDEEEEEEEEEEDEGEI 79
Score = 27.5 bits (61), Expect = 3.1
Identities = 13/29 (44%), Positives = 17/29 (58%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
+EEEEEEEEEE+E E E + +
Sbjct: 63 NDEEEEEEEEEEDEGEIEYVSDDEELEEE 91
Score = 27.1 bits (60), Expect = 5.0
Identities = 13/29 (44%), Positives = 17/29 (58%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
EEEEEEEEEE+E E E + ++
Sbjct: 65 EEEEEEEEEEDEGEIEYVSDDEELEEEIE 93
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 33.3 bits (76), Expect = 0.16
Identities = 16/50 (32%), Positives = 27/50 (54%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E ++ + ++ + + + ++ G EEE EE EE+EEEE EEE
Sbjct: 50 PETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEES 99
Score = 32.1 bits (73), Expect = 0.35
Identities = 20/53 (37%), Positives = 26/53 (49%), Gaps = 7/53 (13%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
E++E E + E +E+ EE EE+EEEE EEE EE E G
Sbjct: 63 FGEDDEGRIEVRISEDGEEEE-------VEEGEEDEEEEGEEESEEFEPLGDT 108
Score = 30.2 bits (68), Expect = 1.5
Identities = 26/74 (35%), Positives = 29/74 (39%), Gaps = 18/74 (24%)
Query: 220 DPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGG 279
+P D Y E+D G EV E+ EEEE EE G
Sbjct: 49 EPETSDDPYGNPDPFGEDD---EGRIEVRISEDGEEEEVEE---------------GEED 90
Query: 280 EEEEEEEEEEEEEE 293
EEEE EEE EE E
Sbjct: 91 EEEEGEEESEEFEP 104
Score = 28.6 bits (64), Expect = 4.4
Identities = 27/69 (39%), Positives = 33/69 (47%), Gaps = 10/69 (14%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
G E EV E EE+EEEE EE + L G+ EE E E+ EE EE
Sbjct: 79 GEEEEVEEGEEDEEEEGEEESEEFEPL----------GDTPEELTEASEQLEEHEEGFQA 128
Query: 302 VVVSAVDFG 310
+V AV+ G
Sbjct: 129 MVEQAVERG 137
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 33.0 bits (75), Expect = 0.17
Identities = 17/54 (31%), Positives = 24/54 (44%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ E EE EEE+ AK K +++ E EE EEEEE ++
Sbjct: 109 AKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDK 162
Score = 31.1 bits (70), Expect = 0.89
Identities = 20/53 (37%), Positives = 25/53 (47%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
RE EE EEEEE ++ KAK K +++ E E EE EEE
Sbjct: 147 REGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEE 199
Score = 30.3 bits (68), Expect = 1.5
Identities = 19/51 (37%), Positives = 24/51 (47%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
RE EE EEE+ + KA + L E EE EEEEE ++E
Sbjct: 113 REGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKE 163
Score = 28.8 bits (64), Expect = 4.4
Identities = 16/47 (34%), Positives = 21/47 (44%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E EE EEE+ KAK K +++ E EE EEE+
Sbjct: 79 REGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKA 125
Score = 28.4 bits (63), Expect = 6.1
Identities = 16/56 (28%), Positives = 24/56 (42%), Gaps = 6/56 (10%)
Query: 245 REVHEEEEEEEEEE------EETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
RE EE EEE+ + KAK L ++K+ G EEE+ + +
Sbjct: 79 REGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAA 134
Score = 28.4 bits (63), Expect = 6.2
Identities = 13/48 (27%), Positives = 26/48 (54%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+ E++ E+ ++ A+ K + +K+L G E + EEE E+E+
Sbjct: 2 DPEKDLEDLKKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALP 49
>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
synthase complex, beta subunit. Nomenclature follows
the description for Methanosarcina thermophila. The
CO-methylating acetyl-CoA synthase is considered the
defining enzyme of the Wood-Ljungdahl pathway, used for
acetate catabolism by sulfate reducing bacteria but for
acetate biosynthesis by acetogenic bacteria such as
oorella thermoacetica (f. Clostridium thermoaceticum)
[Energy metabolism, Chemoautotrophy].
Length = 458
Score = 33.3 bits (76), Expect = 0.17
Identities = 18/48 (37%), Positives = 23/48 (47%), Gaps = 2/48 (4%)
Query: 255 EEEEEETKAKETKLKEKKK--LGGGGGEEEEEEEEEEEEEEEEEEEIG 300
EE+ + T LKEK + E +EEE EEEEE + EE
Sbjct: 375 EEDAKTTDELRKFLKEKGHPVVKRVVREVDEEEIEEEEEAMQPEEMEM 422
>gnl|CDD|130217 TIGR01147, V_ATP_synt_G, vacuolar ATP synthase, subunit G. This
model describes the vacuolar ATP synthase G subunit in
eukaryotes and includes members from diverse groups
e.g., fungi, plants, parasites etc. V-ATPases are
multi-subunit enzymes composed of two functional
domains: A transmembrane Vo domain and a peripheral
catalytic domain V1. The G subunit is one of the
subunits of the catalytic domain. V-ATPases are
responsible for the acidification of endosomes and
lysosomes, which are part of the central vacuolar system
[Energy metabolism, ATP-proton motive force
interconversion].
Length = 113
Score = 31.3 bits (71), Expect = 0.18
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
++ EE ++E E+ ++ + KE K E K LGG G EE+ E E + + E
Sbjct: 34 KQAKEEAQKEVEKYKQQREKEFKEFEAKHLGGNGAAEEKAEAETQAKIRE 83
>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
Length = 196
Score = 32.6 bits (74), Expect = 0.18
Identities = 13/51 (25%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EE E+E +EEE L +++ G +E+ ++ + +++ +E EE+
Sbjct: 2 EEVEKEINKEEEKDVNN--LSSNEQMEGPPEDEQAQQNQPQQQTVDEIEEL 50
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 29.6 bits (67), Expect = 0.19
Identities = 13/19 (68%), Positives = 16/19 (84%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
EEE+ E+EEE E+EEE EE
Sbjct: 22 EEEKREDEEENEDEEEGEE 40
Score = 29.2 bits (66), Expect = 0.32
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
E+EEE E+EEE EE+ E +
Sbjct: 27 EDEEENEDEEEGEEQSEVK 45
Score = 28.5 bits (64), Expect = 0.50
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
E+EEE E+EEE EE+ E
Sbjct: 26 REDEEENEDEEEGEEQSEV 44
Score = 28.5 bits (64), Expect = 0.53
Identities = 12/20 (60%), Positives = 15/20 (75%)
Query: 281 EEEEEEEEEEEEEEEEEEIG 300
EEE+ E+EEE E+EEE E
Sbjct: 22 EEEKREDEEENEDEEEGEEQ 41
Score = 28.5 bits (64), Expect = 0.56
Identities = 12/19 (63%), Positives = 16/19 (84%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
EE+ E+EEE E+EEE EE+
Sbjct: 23 EEKREDEEENEDEEEGEEQ 41
Score = 26.9 bits (60), Expect = 1.6
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
E+ E+EEE E+EEE EE+
Sbjct: 24 EKREDEEENEDEEEGEEQS 42
Score = 26.5 bits (59), Expect = 2.9
Identities = 9/24 (37%), Positives = 16/24 (66%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKK 272
EE+ E+EEE E+ + E + + K+
Sbjct: 23 EEKREDEEENEDEEEGEEQSEVKR 46
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 33.1 bits (76), Expect = 0.20
Identities = 27/53 (50%), Positives = 35/53 (66%), Gaps = 2/53 (3%)
Query: 249 EEEEEEEEEEEETKAKETKLKE-KKKLGGGGGEEEEEEEEEEEEEEEE-EEEI 299
EE EE E EE KAK+ +L+ KK+L G E+ E+E EE E+ +EE EEEI
Sbjct: 355 EELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEI 407
Score = 32.7 bits (75), Expect = 0.24
Identities = 25/57 (43%), Positives = 33/57 (57%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E+EV E EE +EE EE K E+ K+KL E EE EE ++E EE EE++
Sbjct: 227 EKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVK 283
Score = 32.0 bits (73), Expect = 0.51
Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 245 REVHEEEEEEEEEEEETKAKETKL-KEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
RE E+ E+E +E EE K + +L KE + L G + EE+ E EE EE ++EI
Sbjct: 220 REELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEI 275
Score = 31.6 bits (72), Expect = 0.65
Identities = 19/58 (32%), Positives = 30/58 (51%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
RE++E E E EE + E ++KE ++L E E+E E E + + EE+I
Sbjct: 206 LREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRE 263
Score = 31.2 bits (71), Expect = 0.89
Identities = 21/58 (36%), Positives = 30/58 (51%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
E+ +E EE EE+ +E K + K +E KL E +E E E+ EEEI G+
Sbjct: 269 EELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGI 326
Score = 30.4 bits (69), Expect = 1.6
Identities = 22/67 (32%), Positives = 29/67 (43%), Gaps = 11/67 (16%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKK-----------LGGGGGEEEEEEEEEE 289
K E+E+ EE+E ++ EEE +L E +K L EEE EE EE
Sbjct: 608 KDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREE 667
Query: 290 EEEEEEE 296
E E
Sbjct: 668 YLELSRE 674
Score = 28.9 bits (65), Expect = 4.3
Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 17/75 (22%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKE-----------------KKKLGGGGGEEEEEEE 286
E+E+ E EE+E + +E + E LK+ ++KL EE E++
Sbjct: 465 EKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKA 524
Query: 287 EEEEEEEEEEEEIGG 301
EE E+ +E+ ++ G
Sbjct: 525 EEYEKLKEKLIKLKG 539
Score = 28.9 bits (65), Expect = 4.6
Identities = 18/59 (30%), Positives = 31/59 (52%), Gaps = 6/59 (10%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE-EEEEEEEEEIGGVV 303
E+ E E+E E E + E K++E ++ EE ++E EE EE+ +E +E+
Sbjct: 239 EIEELEKELESLEGSKRKLEEKIRELEE-----RIEELKKEIEELEEKVKELKELKEKA 292
Score = 28.1 bits (63), Expect = 8.0
Identities = 16/59 (27%), Positives = 27/59 (45%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
K ++E+ + EE +++ E K + +E +L E E EE EE +E E
Sbjct: 542 KSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPF 600
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 32.0 bits (73), Expect = 0.20
Identities = 13/48 (27%), Positives = 29/48 (60%), Gaps = 9/48 (18%)
Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EE E+E++ +KET+ KE + ++E++E++E++ E++
Sbjct: 92 PEESEKEQKEVSKETEEKEA---------IKAKKEKKEKKEKKVAEKL 130
Score = 29.0 bits (65), Expect = 2.1
Identities = 11/42 (26%), Positives = 25/42 (59%), Gaps = 6/42 (14%)
Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
+ EE E+E++E + +E + + KK E++E++E++
Sbjct: 90 LFPEESEKEQKEVSKETEEKEAIKAKK------EKKEKKEKK 125
Score = 27.8 bits (62), Expect = 5.5
Identities = 15/49 (30%), Positives = 26/49 (53%), Gaps = 7/49 (14%)
Query: 225 PDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK 273
+GY+ +F E + +E +E EE+E KAK+ K ++K+K
Sbjct: 83 AEGYSINLFPEE-------SEKEQKEVSKETEEKEAIKAKKEKKEKKEK 124
Score = 27.0 bits (60), Expect = 9.6
Identities = 10/31 (32%), Positives = 19/31 (61%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKL 274
E +E++E +E EE +A + K ++K+K
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKK 122
>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D. SpoVID, the
stage VI sporulation protein D, is restricted to
endospore-forming members of the bacteria, all of which
are found among the Firmicutes. It is widely distributed
but not quite universal in this group. Between
well-conserved N-terminal and C-terminal domains is a
poorly conserved, low-complexity region of variable
length, rich enough in glutamic acid to cause spurious
BLAST search results unless a filter is used. The seed
alignment for this model was trimmed, in effect, by
choosing member sequences in which these regions are
relatively short. SpoVID is involved in spore coat
assembly by the mother cell compartment late in the
process of sporulation [Cellular processes, Sporulation
and germination].
Length = 338
Score = 32.9 bits (75), Expect = 0.20
Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 5/66 (7%)
Query: 233 FQSENDYCKGGEREVHEEEEEE-----EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
FQ E++ E E +EE E+E E+EE+ + +++ L G + EEEEE
Sbjct: 208 FQLESEVEASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALAGDAKKALEEEEE 267
Query: 288 EEEEEE 293
+ E E
Sbjct: 268 KGERPE 273
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 32.3 bits (74), Expect = 0.21
Identities = 23/79 (29%), Positives = 34/79 (43%), Gaps = 28/79 (35%)
Query: 249 EEEEEEEEEEEETKAKETK----------LKEKKKLGGG-----------GGEEEEEEEE 287
+++EEE+E+E E + K K KKK+G EE+E E
Sbjct: 15 DDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSFLPDKAREEKEAELR 74
Query: 288 EE-------EEEEEEEEEI 299
EE ++E +EEEI
Sbjct: 75 EELREEFLKKQEAVKEEEI 93
Score = 31.6 bits (72), Expect = 0.40
Identities = 13/32 (40%), Positives = 23/32 (71%)
Query: 267 KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K K+KKK G+++EEE+E+E E+E++ +
Sbjct: 2 KKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPK 33
Score = 31.6 bits (72), Expect = 0.41
Identities = 15/35 (42%), Positives = 24/35 (68%)
Query: 264 KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K+ K K+K KL G +EEE+E+E E+E++ +E
Sbjct: 1 KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKES 35
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 31.2 bits (71), Expect = 0.21
Identities = 15/72 (20%), Positives = 25/72 (34%), Gaps = 15/72 (20%)
Query: 227 GYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEE 286
+ + +D + E + +EEE A+ E+ EEE
Sbjct: 46 NKGRKRYAYRDDDDDDDDMEAGGADIQEEERRSARLARL---------------EDREEE 90
Query: 287 EEEEEEEEEEEE 298
E+EEE E+
Sbjct: 91 RLEKEEEREKRA 102
Score = 27.7 bits (62), Expect = 3.6
Identities = 13/38 (34%), Positives = 19/38 (50%)
Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETK 267
I + E + E EEE E+EEE E +A++ K
Sbjct: 69 ADIQEEERRSARLARLEDREEERLEKEEEREKRARKRK 106
Score = 26.6 bits (59), Expect = 8.2
Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 9/48 (18%)
Query: 235 SENDYCKGGEREVHEEE---------EEEEEEEEETKAKETKLKEKKK 273
++D + G ++ EEE E+ EEE E + + K K+K
Sbjct: 59 DDDDDMEAGGADIQEEERRSARLARLEDREEERLEKEEEREKRARKRK 106
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 33.1 bits (75), Expect = 0.22
Identities = 15/56 (26%), Positives = 27/56 (48%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
G + +E EEE E E + E+ ++ + E+ E+E E + +EE+ E
Sbjct: 926 GSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGE 981
Score = 31.5 bits (71), Expect = 0.67
Identities = 16/55 (29%), Positives = 27/55 (49%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E EV E E ++E +ET E + + L E + +EE+ E+ +E E +
Sbjct: 935 EEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESK 989
Score = 30.8 bits (69), Expect = 1.1
Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 2/59 (3%)
Query: 242 GGEREVHEEEEE--EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
G + E E EEE E E + ++ ET E+ E+E E + +EE+ E+ +
Sbjct: 926 GSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWD 984
Score = 30.4 bits (68), Expect = 1.7
Identities = 15/52 (28%), Positives = 24/52 (46%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
+E +E EEE E +A ++ E E+ E+E E + +EE G
Sbjct: 929 DESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDG 980
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 32.7 bits (74), Expect = 0.23
Identities = 21/46 (45%), Positives = 26/46 (56%), Gaps = 4/46 (8%)
Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
V EEEEE E EE+E E KE K+ EEEEEEE + ++
Sbjct: 139 VLEEEEEVEMEEDE----EYYEKEPGKVVDEKSEEEEEEELKTMKD 180
Score = 32.3 bits (73), Expect = 0.35
Identities = 21/51 (41%), Positives = 27/51 (52%), Gaps = 10/51 (19%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EEEE E EE+EE KE G +E+ EEEEEEE + ++ I
Sbjct: 142 EEEEVEMEEDEEYYEKE----------PGKVVDEKSEEEEEEELKTMKDFI 182
Score = 30.8 bits (69), Expect = 1.2
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVV 303
EEEEE E EE+EE E+E G VV
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVV 163
Score = 30.4 bits (68), Expect = 1.3
Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 7/50 (14%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKE-------TKLKEKKKLGG 276
+ + +Y + +V +E+ EEEEEEE K+ TK K G
Sbjct: 149 EEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDNGK 198
Score = 29.6 bits (66), Expect = 2.5
Identities = 14/24 (58%), Positives = 16/24 (66%)
Query: 281 EEEEEEEEEEEEEEEEEEIGGVVV 304
EEEEE E EE+EE E+ G VV
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVV 163
>gnl|CDD|222117 pfam13422, DUF4110, Domain of unknown function (DUF4110). This is
a family that is found predominantly at the C-terminus
of Kelch-containing proteins. However, the exact
function of this region is not known.
Length = 99
Score = 30.8 bits (70), Expect = 0.24
Identities = 22/63 (34%), Positives = 26/63 (41%), Gaps = 4/63 (6%)
Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEE----EEEEEEEEEEEEEEEEIGGVVVSAVDFG 310
E E + E KE KK EE EE E+EEEE+ I VV A +
Sbjct: 31 EIAMESLRDAELSGKELKKDAFDLAEERYWELREELRALEDEEEEQAGIEEVVSLAKEKD 90
Query: 311 SGG 313
GG
Sbjct: 91 WGG 93
Score = 28.9 bits (65), Expect = 1.3
Identities = 13/34 (38%), Positives = 17/34 (50%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGG 278
RE E+EEEE+ + + KEK GG G
Sbjct: 63 REELRALEDEEEEQAGIEEVVSLAKEKDWGGGEG 96
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 32.6 bits (75), Expect = 0.24
Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 8/64 (12%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE--------EEEEEEE 295
ERE+ E E + E +E L K + E E++E+E +E+EEE
Sbjct: 68 ERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEEL 127
Query: 296 EEEI 299
EE I
Sbjct: 128 EELI 131
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 32.8 bits (75), Expect = 0.25
Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 9/84 (10%)
Query: 224 RPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGE--- 280
+ + + + E + + ERE+ E EEE ++ EE+ E+K +L G G +
Sbjct: 404 ALEEIQEELEELEKELEEL-ERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPV 462
Query: 281 -----EEEEEEEEEEEEEEEEEEI 299
EE E+E E E E EE+
Sbjct: 463 CGQELPEEHEKELLELYELELEEL 486
Score = 32.8 bits (75), Expect = 0.30
Identities = 21/51 (41%), Positives = 28/51 (54%)
Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ E E+ E+ +A E +LKE KKL E+EEEE E+E E EE
Sbjct: 192 GQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLA 242
Score = 31.3 bits (71), Expect = 0.89
Identities = 18/56 (32%), Positives = 28/56 (50%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E + EE EE+ E+ E + +LKEK +L E + E+ +E +E EE
Sbjct: 524 EEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELR 579
Score = 29.0 bits (65), Expect = 3.8
Identities = 17/65 (26%), Positives = 28/65 (43%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
E + E+ E EE EE + K + +LKE+ + +E +E EE
Sbjct: 524 EEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRT 583
Query: 295 EEEEI 299
+EE+
Sbjct: 584 RKEEL 588
Score = 29.0 bits (65), Expect = 4.4
Identities = 20/52 (38%), Positives = 27/52 (51%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E+ E E E EE EEE +++ KL + +L EE EE+ EE E E
Sbjct: 618 SLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAE 669
Score = 28.6 bits (64), Expect = 5.0
Identities = 17/54 (31%), Positives = 26/54 (48%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E E+ E EEE + K +++E+++ E E EE E EEE+E
Sbjct: 197 LLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKER 250
Score = 28.6 bits (64), Expect = 5.3
Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 4/56 (7%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
ERE+ E EEE E + E L++ K L E E+ EE+ E+ E E EE+
Sbjct: 297 EREIEELEEELEGLRALLEELEELLEKLKSL----EERLEKLEEKLEKLESELEEL 348
Score = 28.6 bits (64), Expect = 6.0
Identities = 17/55 (30%), Positives = 28/55 (50%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
RE+ EE EE+ E + E +++E ++ G EE EE E+ + EE +
Sbjct: 277 RELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERL 331
Score = 28.2 bits (63), Expect = 7.3
Identities = 18/56 (32%), Positives = 28/56 (50%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E E EEE +E K +E + +++++ E EE E EEE+E EE+
Sbjct: 199 EDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEEL 254
Score = 27.8 bits (62), Expect = 9.2
Identities = 19/56 (33%), Positives = 29/56 (51%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E+E+ + E ++ EE + + +L E EE EE E+E EE E E EE+
Sbjct: 374 EKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEEL 429
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 32.2 bits (74), Expect = 0.28
Identities = 23/60 (38%), Positives = 26/60 (43%), Gaps = 8/60 (13%)
Query: 244 EREVHEEEEEEEEE--------EEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E E+ EE EEE +E E E K E K K+ K G EE E E EE E
Sbjct: 67 EEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEPEP 126
>gnl|CDD|218544 pfam05296, TAS2R, Mammalian taste receptor protein (TAS2R). This
family consists of several forms of mammalian taste
receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled
receptors expressed in subsets of taste receptor cells
of the tongue and palate epithelia and are organised in
the genome in clusters. The proteins are genetically
linked to loci that influence bitter perception in mice
and humans.
Length = 303
Score = 32.2 bits (74), Expect = 0.29
Identities = 22/98 (22%), Positives = 43/98 (43%), Gaps = 23/98 (23%)
Query: 60 MVSIIVPIFFGIII---ILGFFGNALVVVVVA---FNQQMRSTTNLLIINLAIADLL--- 110
M+S + IF + + +LG GN +V+V ++ S+ +L++ +LAI+ +
Sbjct: 1 MLSALEGIFLIVAVVEFLLGILGNGFIVLVNCIDWVKKRKLSSIDLILTSLAISRICLQW 60
Query: 111 ------FIVCCVPFT-----ATDYVTTCWPFGD---VW 134
FI+ P T ++ W F + +W
Sbjct: 61 LILLDNFIIVFFPNTYLSGKLLRIISLFWTFLNHLSLW 98
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 32.5 bits (74), Expect = 0.30
Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 5/56 (8%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE-----EEEEEEEEEEI 299
EE+E+E+ + K E + ++++ EEEE EE ++EEE+EE+
Sbjct: 287 TEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEV 342
Score = 32.1 bits (73), Expect = 0.41
Identities = 14/69 (20%), Positives = 32/69 (46%), Gaps = 4/69 (5%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEE--- 290
+ E++ G + E+E E +E + ++ ++ EE+E+E+ +
Sbjct: 241 KEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300
Query: 291 -EEEEEEEE 298
E+E+E+E
Sbjct: 301 MMEDEDEDE 309
Score = 31.7 bits (72), Expect = 0.55
Identities = 18/65 (27%), Positives = 31/65 (47%), Gaps = 14/65 (21%)
Query: 244 EREVHEEEEEEEEEEEETKAKE-------TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
E +E+E+E+E + E T+ KEK+K + ++ E+E+E+EE
Sbjct: 258 ESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEK-------RKRLKKMMEDEDEDEE 310
Query: 297 EEIGG 301
EI
Sbjct: 311 MEIVP 315
Score = 31.0 bits (70), Expect = 0.94
Identities = 14/56 (25%), Positives = 25/56 (44%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E E + + E+E + E + GE + EEE EE+E+E+ + +
Sbjct: 243 ESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRL 298
Score = 30.6 bits (69), Expect = 1.3
Identities = 20/52 (38%), Positives = 26/52 (50%), Gaps = 13/52 (25%)
Query: 254 EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE-----EEEEEEEIG 300
EEE EE+ K K +LK+ E+E+E+EE E EEEE E
Sbjct: 284 EEETEEKEKEKRKRLKKM--------MEDEDEDEEMEIVPESPVEEEESEEP 327
Score = 28.3 bits (63), Expect = 6.1
Identities = 20/76 (26%), Positives = 28/76 (36%), Gaps = 19/76 (25%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGE--------------EEE-----EEE 286
E EEEE EE E K+ + KE+ + GG +EE ++
Sbjct: 316 ESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRRRVMKKKTFKDEEGYLVTKKV 375
Query: 287 EEEEEEEEEEEEIGGV 302
E E E+E E
Sbjct: 376 YEWESFSEDEAEPPPT 391
>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
Ribosomal proteins P1 and P2 are the eukaryotic proteins
that are functionally equivalent to bacterial L7/L12.
L12p is the archaeal homolog. Unlike other ribosomal
proteins, the archaeal L12p and eukaryotic P1 and P2 do
not share sequence similarity with their bacterial
counterparts. They are part of the ribosomal stalk
(called the L7/L12 stalk in bacteria), along with 28S
rRNA and the proteins L11 and P0 in eukaryotes (23S
rRNA, L11, and L10e in archaea). In bacterial ribosomes,
L7/L12 homodimers bind the extended C-terminal helix of
L10 to anchor the L7/L12 molecules to the ribosome.
Eukaryotic P1/P2 heterodimers and archaeal L12p
homodimers are believed to bind the L10 equivalent
proteins, eukaryotic P0 and archaeal L10e, in a similar
fashion. P1 and P2 (L12p, L7/L12) are the only proteins
in the ribosome to occur as multimers, always appearing
as sets of dimers. Recent data indicate that most
archaeal species contain six copies of L12p (three
homodimers), while eukaryotes have two copies each of P1
and P2 (two heterodimers). Bacteria may have four or six
copies (two or three homodimers), depending on the
species. As in bacteria, the stalk is crucial for
binding of initiation, elongation, and release factors
in eukaryotes and archaea.
Length = 105
Score = 30.7 bits (69), Expect = 0.30
Identities = 12/24 (50%), Positives = 16/24 (66%)
Query: 277 GGGEEEEEEEEEEEEEEEEEEEIG 300
E+ EE +EEEEEEE+E+ G
Sbjct: 78 TAEPAEKAEEAKEEEEEEEDEDFG 101
Score = 30.7 bits (69), Expect = 0.35
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIG 300
E+ EE +EEEEEEE+E+ G
Sbjct: 78 TAEPAEKAEEAKEEEEEEEDEDFGFG 103
Score = 30.3 bits (68), Expect = 0.38
Identities = 13/25 (52%), Positives = 16/25 (64%)
Query: 278 GGEEEEEEEEEEEEEEEEEEEIGGV 302
E E+ EE +EEEEEEE+E G
Sbjct: 78 TAEPAEKAEEAKEEEEEEEDEDFGF 102
Score = 28.8 bits (64), Expect = 1.6
Identities = 11/24 (45%), Positives = 14/24 (58%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
E E+ EE +EEEEEEE+
Sbjct: 74 TAAATAEPAEKAEEAKEEEEEEED 97
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 31.4 bits (72), Expect = 0.33
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ E+ ++E EE ++ A+ EK K G EE EE E EE ++ E+E+
Sbjct: 68 KTRLEKLKKELEELKQRIAELQAQIEKLK---KGREETEERTELLEELKQLEKEL 119
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 32.2 bits (73), Expect = 0.36
Identities = 16/45 (35%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
+E ++EEE + +E + KE L G E +EE+E+ E EE +
Sbjct: 73 DELKKEEERLLDELEELE-KEDDDLDGELVELQEEKEQLENEELQ 116
Score = 27.9 bits (62), Expect = 7.9
Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 7/56 (12%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVV 304
+ E + E +A +++L E KK EEE +E EE E+E++++ G +V
Sbjct: 53 SKLESQNVEISNYEALDSELDELKK-------EEERLLDELEELEKEDDDLDGELV 101
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 31.6 bits (72), Expect = 0.37
Identities = 19/64 (29%), Positives = 34/64 (53%), Gaps = 11/64 (17%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
S ER V +EE ++ +E E K KLK+++ ++ EEE ++ ++++
Sbjct: 163 SSLSGSAKPERNVSQEEAKKRLQEWELK----KLKQQQ-------QKREEERRKQRKKQQ 211
Query: 295 EEEE 298
EEEE
Sbjct: 212 EEEE 215
Score = 30.9 bits (70), Expect = 0.71
Identities = 17/54 (31%), Positives = 29/54 (53%), Gaps = 2/54 (3%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EV E + +E E AK+ + +KKL E++++E E+E EE E + +
Sbjct: 78 EVKLERQAQEAYENWLSAKQ--AQRQKKLQKLLEEKQKQEREKEREEAELRQRL 129
Score = 28.5 bits (64), Expect = 3.7
Identities = 7/30 (23%), Positives = 20/30 (66%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKK 273
E + ++++++ EEE + K+ + +E++K
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQEEEERK 217
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 31.6 bits (72), Expect = 0.39
Identities = 20/60 (33%), Positives = 29/60 (48%), Gaps = 4/60 (6%)
Query: 244 EREVHEEEEE---EEEEEEETKAKETKLKE-KKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E E EE+ EEEE + + K +L+E ++L EEE E E E +E E+
Sbjct: 32 EETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERERLEAEVDEATAEV 91
Score = 27.8 bits (62), Expect = 7.6
Identities = 20/49 (40%), Positives = 23/49 (46%), Gaps = 1/49 (2%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
EE EE E E K K+ + E + L E EEE EEE EEE
Sbjct: 29 EEYEETALELEEKLKQEEE-EAQLLEKKADELEEENRRLEEEAAASEEE 76
Score = 27.4 bits (61), Expect = 8.7
Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 7/48 (14%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
EV + EEE E++E ET+ + +L+E + E E +E E
Sbjct: 88 TAEVAKLEEEREKKEAETRQLQQELREAQ-------EAHERARQELLE 128
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 31.6 bits (71), Expect = 0.40
Identities = 19/52 (36%), Positives = 29/52 (55%), Gaps = 8/52 (15%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E +E EE EE E + +E K+ E+ E++EE+E E EE+ EE+
Sbjct: 129 EKARQEREELEERMEWERREEKIDER--------EDQEEQEREREEQTIEEQ 172
Score = 30.0 bits (67), Expect = 1.0
Identities = 17/54 (31%), Positives = 30/54 (55%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
ERE EE E E EE+ +E + +++++ EE+ ++ E E E++E E
Sbjct: 134 EREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESE 187
Score = 28.1 bits (62), Expect = 5.7
Identities = 13/55 (23%), Positives = 30/55 (54%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E + E EE+ +E E + ++ + +E++ + + E E E++E E E +++
Sbjct: 139 EERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDD 193
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 32.0 bits (72), Expect = 0.42
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
EEEE E +EE +A+E K + +K+G G EE+ + +EE
Sbjct: 242 EEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEE 280
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 32.3 bits (74), Expect = 0.43
Identities = 9/79 (11%), Positives = 35/79 (44%)
Query: 220 DPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGG 279
P + + K E++ +++ + ++ + ++ ++L + +
Sbjct: 1304 SPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDS 1363
Query: 280 EEEEEEEEEEEEEEEEEEE 298
E++++ E ++ E+E++E
Sbjct: 1364 SSEDDDDSEVDDSEDEDDE 1382
>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
N-terminal domain. Autophagocytosis is a
starvation-induced process responsible for transport of
cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
ubiquitin like modifier that is topologically similar to
the canonical E2 enzyme. It catalyzes the conjugation of
Atg8 and phosphatidylethanolamine.
Length = 146
Score = 30.8 bits (70), Expect = 0.43
Identities = 12/67 (17%), Positives = 32/67 (47%), Gaps = 7/67 (10%)
Query: 240 CKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGG-------GGEEEEEEEEEEEEE 292
K E EE E+E+E++ ++K+K G+++++ + + +
Sbjct: 79 AKQMEYGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDAD 138
Query: 293 EEEEEEI 299
E+++++I
Sbjct: 139 EDDDDDI 145
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 31.8 bits (73), Expect = 0.44
Identities = 12/57 (21%), Positives = 30/57 (52%), Gaps = 2/57 (3%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKE--KKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ E EE EE E + E + +++L E++++ +++ +E+++E E+
Sbjct: 330 TFIEENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEGVEV 386
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 31.9 bits (72), Expect = 0.44
Identities = 14/52 (26%), Positives = 28/52 (53%), Gaps = 3/52 (5%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
+ E ++ +EE KAK+ K + K+ E+E E++E E +++ E +
Sbjct: 287 QIEIKKNDEEALKAKDHKAFDLKQ---ESKASEKEAEDKELEAQKKREPVAE 335
Score = 30.4 bits (68), Expect = 1.3
Identities = 11/53 (20%), Positives = 26/53 (49%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
++ ++ E E+ + E K + + + K ++E + E+E E++E E
Sbjct: 273 KQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELE 325
Score = 30.0 bits (67), Expect = 1.8
Identities = 10/46 (21%), Positives = 23/46 (50%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
++++E + A + KE K++ E E+ + E ++ +EE
Sbjct: 251 RQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEE 296
Score = 28.8 bits (64), Expect = 3.9
Identities = 12/57 (21%), Positives = 24/57 (42%), Gaps = 2/57 (3%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ ++ E ++ E E E K E K + ++E + E+E E++E
Sbjct: 268 SPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKEL 324
Score = 28.0 bits (62), Expect = 7.7
Identities = 16/57 (28%), Positives = 29/57 (50%), Gaps = 5/57 (8%)
Query: 244 EREVHEEEEEEEE--EEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+++V E ++ E E + E K E LK K + ++E + E+E E++E E
Sbjct: 272 DKQVAENQKREIEKAQIEIKKNDEEALKAKDH---KAFDLKQESKASEKEAEDKELE 325
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 31.0 bits (71), Expect = 0.45
Identities = 19/57 (33%), Positives = 34/57 (59%), Gaps = 5/57 (8%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
G+ E+ +E ++ EEE+EE + + +L+ K + E+ EEEE + EE+ +EI
Sbjct: 121 GKSELEQEIKKLEEEKEELEKRVAELEAKLE-----AIEKREEEERQIEEKRHADEI 172
>gnl|CDD|225053 COG2142, SdhD, Succinate dehydrogenase, hydrophobic anchor subunit
[Energy production and conversion].
Length = 117
Score = 30.3 bits (69), Expect = 0.45
Identities = 15/76 (19%), Positives = 26/76 (34%), Gaps = 13/76 (17%)
Query: 123 YVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSV--------RTE 174
Y W Y V A+ + V + L A++H + V
Sbjct: 36 YFLLTWLNAT-----YAAWVAFLANPFWKVFLLLLLVAALIHAWNGLRVIIEDYIKPEKL 90
Query: 175 RNALKAILITWIVIVL 190
R AL+ +L+ +V+
Sbjct: 91 RLALQILLVLALVLTG 106
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 30.7 bits (70), Expect = 0.45
Identities = 16/82 (19%), Positives = 37/82 (45%), Gaps = 9/82 (10%)
Query: 218 DLDPVNRPDGYNKTIFQSENDYCKGGEREV---HEEEEEEEEEEEETKAKETKLKEKKKL 274
D+ P + + Y G + + ++E+EE + ++ + ++ E++K
Sbjct: 21 DIGPARDGVDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRK- 79
Query: 275 GGGGGEEEEEEEEEEEEEEEEE 296
+ E++E+EE E+ EE
Sbjct: 80 -----KRREQKEKEEIEKYREE 96
>gnl|CDD|223068 PHA03384, PHA03384, early DNA-binding protein E2A; Provisional.
Length = 445
Score = 32.0 bits (73), Expect = 0.46
Identities = 19/58 (32%), Positives = 26/58 (44%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K G R V EEEEEEEE E A + G G+ +EE++ E++
Sbjct: 29 KKGRRRVSPVEEEEEEEEAEVVAVGFSYPPVRISRGKDGKRPVRPLKEEKDSEKKAST 86
>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal. This
domain family is found in eukaryotes, and is typically
between 99 and 114 amino acids in length. The family is
found in association with pfam08697, pfam01585. There
are two completely conserved residues (G and F) that may
be functionally important. TIP is involved in enamel
assembly by interacting with one of the major proteins
responsible for biomineralisation of enamel - tuftelin.
Length = 106
Score = 30.0 bits (68), Expect = 0.47
Identities = 12/62 (19%), Positives = 32/62 (51%), Gaps = 14/62 (22%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKL--------------GGGGGEEEEEEEEEEEEEEEEE 296
++++++E E + + + +KKK EE+++E+E E+++E +E
Sbjct: 41 DDDDDDEGERRRGRGRRRSKKKKDYSKPVAFVSGGIKQAAKEPEEDDKEDESEDDDESDE 100
Query: 297 EE 298
++
Sbjct: 101 DD 102
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 31.3 bits (71), Expect = 0.48
Identities = 11/52 (21%), Positives = 20/52 (38%), Gaps = 3/52 (5%)
Query: 250 EEEEEEEEEEETKAK---ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+++ + ++ KA E K+ E + + E EE EEE
Sbjct: 118 KDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEE 169
Score = 30.5 bits (69), Expect = 0.72
Identities = 15/44 (34%), Positives = 20/44 (45%)
Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E++ +A+ + EKK EE EEEE EE E
Sbjct: 134 EAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAE 177
Score = 30.5 bits (69), Expect = 0.83
Identities = 17/57 (29%), Positives = 24/57 (42%), Gaps = 8/57 (14%)
Query: 244 EREVHEEEEEE--EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E++V+E E E++ E A + + EEE EE EE EE E
Sbjct: 136 EKKVNEARAEAVAEKKAAEAAAVAAEEAAAAE------EEEAEEAPAEEAPAEESAE 186
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 30.4 bits (69), Expect = 0.48
Identities = 11/43 (25%), Positives = 24/43 (55%)
Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+E +EE E K +KKK G ++ +++ +++ +EE+
Sbjct: 74 DEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEED 116
Score = 29.6 bits (67), Expect = 0.94
Identities = 10/48 (20%), Positives = 26/48 (54%), Gaps = 4/48 (8%)
Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+E +EE + K +KKK G+++ +++ +++ +EE++
Sbjct: 74 DEIDEEIIPLEEKF----DKKKKKFMDGDDDIIDDDILPDDDFDEEDL 117
Score = 27.3 bits (61), Expect = 5.2
Identities = 15/53 (28%), Positives = 27/53 (50%), Gaps = 6/53 (11%)
Query: 249 EEEEEEEEEEEETKAK------ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
+EE EE+ + K K + + + EE+ +EE++E+EE+EE
Sbjct: 77 DEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129
>gnl|CDD|187756 cd09299, TDT, The Tellurite-resistance/Dicarboxylate Transporter
(TDT) family. The Tellurite-resistance/Dicarboxylate
Transporter (TDT) family includes members from all three
kingdoms, but only three members of the family have been
functionally characterized: the TehA protein of E. coli
functioning as a tellurite-resistance uptake permease,
the Mae1 protein of S. pombe functioning in the uptake
of malate and other dicarboxylates, and the sulfite
efflux pump (SSU1) of Saccharomyces cerevisiae. In
plants, the plasma membrane protein SLAC1 (Slow Anion
Channel-Associated 1), which is preferentially expressed
in guard cells, encodes a distant homolog of fungal and
bacterial dicarboxylate/malic acid transport proteins.
SLAC1 is essential in mediating stomatal responses to
physiological and stress stimuli. Members of the TDT
family exhibit 10 putative transmembrane alpha-helical
spanners (TMSs).
Length = 326
Score = 31.5 bits (72), Expect = 0.48
Identities = 19/93 (20%), Positives = 31/93 (33%), Gaps = 14/93 (15%)
Query: 61 VSIIVPIFFGIIIILGFFGN---------ALVVVVVAFNQQMRSTTNLLIINLAIADLLF 111
+ + IF G II N L++V+ Q S LL++ + F
Sbjct: 179 IGFFLFIFIGAIIFSRLVINKLPPGQLAPTLMIVLGPMGQ---SIFALLLLGKNARLIFF 235
Query: 112 -IVCCVPFTATDYVTTCWPFGDVWCKMYLIVVT 143
+ A W FG +W + +V
Sbjct: 236 LYASFLQALAVFASIILWGFG-LWWLLLALVSI 267
>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein. All proteins in this family
for which functions are known are used for the
recognition of DNA damage as part of nucleotide excision
repair. This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 172
Score = 30.9 bits (70), Expect = 0.49
Identities = 22/65 (33%), Positives = 28/65 (43%), Gaps = 7/65 (10%)
Query: 242 GGEREVHEEEEEEEEEEEETKAK--ETKLKEKKK-----LGGGGGEEEEEEEEEEEEEEE 294
G E + EE+E EE +EE K K E KLKE ++ E E E E
Sbjct: 91 GSEEALDEEKERREESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGRVHEHEFGPETNG 150
Query: 295 EEEEI 299
EE+
Sbjct: 151 VEEDT 155
>gnl|CDD|219227 pfam06912, DUF1275, Protein of unknown function (DUF1275). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown although a few members are
thought to be membrane proteins.
Length = 206
Score = 31.0 bits (71), Expect = 0.51
Identities = 12/63 (19%), Positives = 24/63 (38%), Gaps = 1/63 (1%)
Query: 66 PIFFGIIIILGF-FGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYV 124
++ +L F G AL +++ ++ R L ++ L A LL +P +
Sbjct: 48 LALHKLLALLAFVLGAALAGLLIRLLRRRRLRRWLALLLLLEAVLLLAAALLPLALPPLI 107
Query: 125 TTC 127
Sbjct: 108 AIA 110
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 31.6 bits (71), Expect = 0.52
Identities = 19/58 (32%), Positives = 36/58 (62%), Gaps = 3/58 (5%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
ER +EEEE+E+ EE + +E ++ E++ E+ E+++E+E + E +E E+ G
Sbjct: 102 ERGTQKEEEEDEDVEEIEEVEEVEVVEEEY---DDDEDSEKDDEKESDAEGDENELAG 156
Score = 28.9 bits (64), Expect = 3.3
Identities = 19/67 (28%), Positives = 38/67 (56%), Gaps = 3/67 (4%)
Query: 233 FQSENDY-CKGGEREVHEEEEE--EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
+END +G V EEE ++EEEE+ +E + E+ ++ +++E+ E+++
Sbjct: 82 SDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDD 141
Query: 290 EEEEEEE 296
E+E + E
Sbjct: 142 EKESDAE 148
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 31.9 bits (73), Expect = 0.53
Identities = 16/50 (32%), Positives = 22/50 (44%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E E EE+ + E + +L E+ E E EEE EE EE+
Sbjct: 526 AERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVA 575
Score = 27.7 bits (62), Expect = 9.5
Identities = 19/50 (38%), Positives = 28/50 (56%), Gaps = 5/50 (10%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+EE E EE+ +A+ET+ + + L EE EE EE E E E E++
Sbjct: 219 DEEIERYEEQREQARETRDEADEVL-----EEHEERREELETLEAEIEDL 263
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 31.7 bits (73), Expect = 0.56
Identities = 18/56 (32%), Positives = 33/56 (58%), Gaps = 3/56 (5%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E+E+ E E ++ E+ KE L K +L + EEE E++E+E E++++E+
Sbjct: 74 EKELRERRNELQKLEKRLLQKEENLDRKLEL---LEKREEELEKKEKELEQKQQEL 126
Score = 29.4 bits (67), Expect = 3.0
Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 3/58 (5%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
E + + E E+ EEE + KE +L++K++ ++EEE EE EE+ +E E I G
Sbjct: 95 EENLDRKLELLEKREEELEKKEKELEQKQQ---ELEKKEEELEELIEEQLQELERISG 149
Score = 27.8 bits (63), Expect = 8.1
Identities = 17/54 (31%), Positives = 35/54 (64%), Gaps = 6/54 (11%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E+ + ++EE + + E + +E +L++K+K E E++++E E++EEE EE
Sbjct: 88 EKRLLQKEENLDRKLELLEKREEELEKKEK------ELEQKQQELEKKEEELEE 135
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 31.5 bits (71), Expect = 0.56
Identities = 15/65 (23%), Positives = 35/65 (53%), Gaps = 3/65 (4%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKA-KETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
+ E K E+ E + ++ E+E K ++ +LK +++ EE E++ + E+++
Sbjct: 76 KGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQ--QKQAEEAEKQAQLEQKQ 133
Query: 293 EEEEE 297
+EE+
Sbjct: 134 QEEQA 138
Score = 30.7 bits (69), Expect = 0.99
Identities = 33/138 (23%), Positives = 70/138 (50%), Gaps = 11/138 (7%)
Query: 169 MSVRTERNA--LKAILITWIV-IVLTAVPV---FNMHGEVSYTYA-SELHTACVFLDLDP 221
MS TE+N +AI+I+ ++ I+L A+ + + E S + A + +DP
Sbjct: 1 MSKATEQNDKLKRAIIISAVLHIILFALLIWGSLDETIEASGGGGGGSVIDAVM---VDP 57
Query: 222 VNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEE 281
Y + I ++ KG ++ +EE+ EE + + A++ +LK+ +K E+
Sbjct: 58 GAVVQQYGR-IQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQ 116
Query: 282 EEEEEEEEEEEEEEEEEI 299
+++ EE E++ + E+++
Sbjct: 117 QKQAEEAEKQAQLEQKQQ 134
Score = 28.4 bits (63), Expect = 5.8
Identities = 12/65 (18%), Positives = 33/65 (50%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+E + K E+E + +E++++ EE K + + K++++ E++++ E + +
Sbjct: 97 AAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKA 156
Query: 294 EEEEE 298
E
Sbjct: 157 AAEAA 161
>gnl|CDD|219521 pfam07695, 7TMR-DISM_7TM, 7TM diverse intracellular signalling.
This entry represents the transmembrane region of the
7TM-DISM (7TM Receptors with Diverse Intracellular
Signalling Modules).
Length = 207
Score = 31.1 bits (71), Expect = 0.56
Identities = 24/159 (15%), Positives = 54/159 (33%), Gaps = 39/159 (24%)
Query: 67 IFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATD---- 122
+F+GI++ L + N + + R + L + L+I+ + + +
Sbjct: 7 LFYGILLALALY-NLFLFFSL------RDRSYLYYV-------LYILSFLLYQLSLNGLG 52
Query: 123 --YVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKA 180
Y+ WP W L+ + V L+ FL ++ +
Sbjct: 53 FQYL---WPNAPPWLNNKLLYLFLALLVIFFALLFARSFL-----------ELKKYLPRL 98
Query: 181 --ILITWIVIVLTAVPVFNMHGEVSYTYASELHTACVFL 217
+L+ +++L + + + YT + L L
Sbjct: 99 DRLLLGLALLLLLLLLLAPL---FPYTLSLRLAQLLALL 134
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 31.7 bits (73), Expect = 0.58
Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 5/42 (11%)
Query: 235 SENDYCKGGEREVHEEEEEEEEEEEET-----KAKETKLKEK 271
+E D G +++E+EEEEE+ A E++L EK
Sbjct: 178 AEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEK 219
Score = 31.3 bits (72), Expect = 0.73
Identities = 13/55 (23%), Positives = 24/55 (43%), Gaps = 7/55 (12%)
Query: 259 EETKAKETKLKEKKKL-------GGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSA 306
+ + E +L+E E EE +++E+EEEEE+ ++A
Sbjct: 157 DRLENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAA 211
Score = 29.0 bits (66), Expect = 3.8
Identities = 16/70 (22%), Positives = 28/70 (40%), Gaps = 10/70 (14%)
Query: 243 GEREVHE----EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
GER + E + EE+ E + + +EEEEE+E ++ +E E
Sbjct: 162 GERRLRELIDGFVDPNAEEDPAHVGSELEELD------DDEDEEEEEDENDDSLAADESE 215
Query: 299 IGGVVVSAVD 308
+ V+
Sbjct: 216 LPEKVLEKFK 225
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 31.5 bits (72), Expect = 0.63
Identities = 6/54 (11%), Positives = 33/54 (61%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+V + +++++++++ + + + ++++++ ++E+EE++E +E+
Sbjct: 125 DVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKEL 178
Score = 31.5 bits (72), Expect = 0.68
Identities = 6/50 (12%), Positives = 30/50 (60%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+++++++++++ + + + ++++ ++E+EE++E +E E
Sbjct: 130 ADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELE 179
Score = 30.0 bits (68), Expect = 1.6
Identities = 9/61 (14%), Positives = 29/61 (47%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
+++++++++ ++ + E + ++E+EE++E +E E+ V D
Sbjct: 133 DDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDED 192
Query: 309 F 309
Sbjct: 193 D 193
Score = 29.6 bits (67), Expect = 2.2
Identities = 9/90 (10%), Positives = 35/90 (38%), Gaps = 9/90 (10%)
Query: 220 DPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGG 279
+ Y K I + + + ++++ +++++ + + +
Sbjct: 112 LDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDD--------- 162
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVDF 309
++ ++E+EE++E +E E+ +
Sbjct: 163 DDVDDEDEEKKEAKELEKLSDDDDFVWDED 192
>gnl|CDD|218134 pfam04535, DUF588, Domain of unknown function (DUF588). This
family of plant proteins contains a domain that may have
a catalytic activity. It has a conserved arginine and
aspartate that could form an active site. These proteins
are predicted to contain 3 or 4 transmembrane helices.
Length = 150
Score = 30.4 bits (69), Expect = 0.63
Identities = 18/101 (17%), Positives = 35/101 (34%), Gaps = 26/101 (25%)
Query: 104 LAIADLLFIVCCVPFT-------ATDYVTTCWPFGDVWCK-------MYLIVVTAYASVY 149
L +A+L+ + T+ T + F YL+V A A+ Y
Sbjct: 4 LRLAELVLRLAAFVLALAAAVVMGTNKQTKPFFFIQKKASFSDLPAFRYLVVANAIAAGY 63
Query: 150 TLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVL 190
+L+ + L+V R + + W++ +L
Sbjct: 64 SLLQL----VLSVYLLS--------RKKPRTKGLAWLLFIL 92
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 31.6 bits (71), Expect = 0.65
Identities = 20/52 (38%), Positives = 24/52 (46%), Gaps = 6/52 (11%)
Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
H E E+E KA E K+ EKK++ E E EE E E E E I
Sbjct: 434 HAERARIEKENAHRKALEMKILEKKRI------ERLEREERERLERERMERI 479
>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
motility and secretion / Intracellular trafficking and
secretion].
Length = 696
Score = 31.5 bits (72), Expect = 0.67
Identities = 16/72 (22%), Positives = 29/72 (40%), Gaps = 10/72 (13%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
+E++ EE+E+ + + EEEEE ++ + E E+G
Sbjct: 319 LAYLLSKRKEQQAEEKEKPAEEAKK----------EEEEEEEESVDDVLLIDPIELELGY 368
Query: 302 VVVSAVDFGSGG 313
++ VD GG
Sbjct: 369 GLIPLVDEQQGG 380
>gnl|CDD|225153 COG2244, RfbX, Membrane protein involved in the export of O-antigen
and teichoic acid [General function prediction only].
Length = 480
Score = 31.3 bits (71), Expect = 0.72
Identities = 23/179 (12%), Positives = 58/179 (32%), Gaps = 33/179 (18%)
Query: 36 PAICRNFSHEYINDNDVDAEAMERMVSIIVPIFFGIIIILGFFG---------------- 79
AI R + + + ++ +++ + ++++L +
Sbjct: 65 AAITREIAEYREKGEYLLLILLSVLLLLLLALILLLLLLLIAYLLAPIDPVLALLLRILS 124
Query: 80 -----NALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVW 134
L V+ Q L + ++ LL V + F A
Sbjct: 125 LALLLLPLSSVLRGLFQGFGRFGPLALSIVSSIFLLAAVFALLFAALGLA---------- 174
Query: 135 CKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAV 193
++ +V+ A S+ L+++ + + PI S+ + L+ L + +L +
Sbjct: 175 --VWALVLGAVVSLLVLLILLGKKKRGLKRPILRFSLALLKELLRFGLPLLLSSLLNFL 231
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 29.9 bits (68), Expect = 0.74
Identities = 14/27 (51%), Positives = 21/27 (77%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSA 306
++EEEEE+EE+E+E E +E G V V+
Sbjct: 31 DKEEEEEKEEKEKEAEPDEDGWVTVTK 57
Score = 27.2 bits (61), Expect = 5.3
Identities = 16/59 (27%), Positives = 33/59 (55%), Gaps = 8/59 (13%)
Query: 246 EVHEEEEEEEEEEEETKAKE--------TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
E +++EEEEE+EE+E +A+ TK + K + E+E+++++++E
Sbjct: 28 EEYDKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKE 86
>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
motility and secretion].
Length = 214
Score = 30.6 bits (69), Expect = 0.76
Identities = 12/49 (24%), Positives = 23/49 (46%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+E+ EE + K+ + ++E + E E+ + EEE E+E
Sbjct: 24 KEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKVDSIEEEISEKE 72
Score = 28.3 bits (63), Expect = 4.9
Identities = 13/53 (24%), Positives = 24/53 (45%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ E +E+ EE + K + +L E+ +E E+ + EEE E+
Sbjct: 19 ALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKVDSIEEEISEK 71
Score = 27.9 bits (62), Expect = 6.1
Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 1/62 (1%)
Query: 244 EREVHEEEEEEEEE-EEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
E+ E+ + EEE KE + EK K EEEE++ + EE E+ + +
Sbjct: 50 EQTKENSLIEKVDSIEEEISEKEKVMSEKLKEPAQMSSTSEEEEKKAKLEEIPEDVVSIM 109
Query: 303 VV 304
V
Sbjct: 110 VA 111
>gnl|CDD|216683 pfam01758, SBF, Sodium Bile acid symporter family. This family
consists of Na+/bile acid co-transporters. These
transmembrane proteins function in the liver in the
uptake of bile acids from portal blood plasma a process
mediated by the co-transport of Na+. Also in the family
is ARC3 from S. cerevisiae, this is a putative
transmembrane protein involved in resistance to arsenic
compounds.
Length = 188
Score = 30.3 bits (69), Expect = 0.78
Identities = 23/121 (19%), Positives = 45/121 (37%), Gaps = 14/121 (11%)
Query: 72 IIILGFFGNALV--VVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWP 129
++LG ++ +++ + L + L +V C P A V T
Sbjct: 29 ALLLGLLLQWVIMPLLMFILAWFLLRLPPELAVGL------ILVGCAPGGAMSNVWTYLA 82
Query: 130 FGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIV 189
GDV +V+ A +++ ++ L FL + V T + L I I ++
Sbjct: 83 KGDV---ELSVVMVALSTLTAPLVTPLLSFLLSGLLLD---VGTAVSPLSLIKSVLIYVI 136
Query: 190 L 190
+
Sbjct: 137 I 137
>gnl|CDD|218576 pfam05395, DARPP-32, Protein phosphatase inhibitor 1/DARPP-32.
This family consists of several mammalian protein
phosphatase inhibitor 1 (IPP-1) and dopamine- and
cAMP-regulated neuronal phosphoprotein (DARPP-32)
proteins. Protein phosphatase inhibitor-1 is involved in
signal transduction and is an endogenous inhibitor of
protein phosphatase-1. It has been demonstrated that
DARPP-32, if phosphorylated, can inhibit
protein-phosphatase-1. DARPP-32 has a key role in many
neurotransmitter pathways throughout the brain and has
been shown to be involved in controlling receptors, ion
channels and other physiological factors including the
brain's response to drugs of abuse, such as cocaine,
opiates and nicotine. DARPP-32 is reciprocally regulated
by the two neurotransmitters that are most often
implicated in schizophrenia - dopamine and glutamate.
Dopamine activates DARPP-32 through the D1 receptor
pathway and disables DARPP-32 through the D2 receptor.
Glutamate, acting through the N-methyl-d-aspartate
receptor, renders DARPP-32 inactive. A mutant form of
DARPP-32 has been linked with gastric cancers.
Length = 170
Score = 30.3 bits (68), Expect = 0.81
Identities = 12/51 (23%), Positives = 21/51 (41%)
Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ + EE + +E +L +LG GE E E ++ EE +
Sbjct: 89 SHLQSQGSLEENQASEEEDELDSPDELGYPTGETESTEAQDSEEPGSCPTQ 139
Score = 29.5 bits (66), Expect = 1.7
Identities = 12/61 (19%), Positives = 19/61 (31%), Gaps = 4/61 (6%)
Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGGW 314
EE ++E + G E E E ++ EE + + G G
Sbjct: 95 GSLEENQASEEEDELDSPDELGYPTGETESTEAQDSEEPGSCPTQD----AQSEVGLEGP 150
Query: 315 W 315
W
Sbjct: 151 W 151
>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
Length = 194
Score = 30.6 bits (69), Expect = 0.83
Identities = 14/48 (29%), Positives = 25/48 (52%), Gaps = 4/48 (8%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
EE E++ +E E KE K+ +EE++E++ E+ E+E
Sbjct: 4 EEFPSEKDLPQEETTDEAPKKEAKE----APKEEDQEKQNPVEDLEKE 47
>gnl|CDD|225029 COG2118, COG2118, DNA-binding protein [General function prediction
only].
Length = 116
Score = 29.6 bits (67), Expect = 0.84
Identities = 14/47 (29%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
++EE EE + + KL E ++ + E +EE+ +EEE +++ I
Sbjct: 1 MDDEELEEIRRR--KLAELQRQAKLEEQREAQEEQARQEEEAQKQAI 45
Score = 28.1 bits (63), Expect = 2.7
Identities = 16/49 (32%), Positives = 27/49 (55%), Gaps = 6/49 (12%)
Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
++EE EE + K +L+ + KL EE+ E +EE+ +EEE +
Sbjct: 1 MDDEELEEIRRRKLAELQRQAKL------EEQREAQEEQARQEEEAQKQ 43
>gnl|CDD|219921 pfam08592, DUF1772, Domain of unknown function (DUF1772). This
domain is of unknown function.
Length = 137
Score = 30.0 bits (68), Expect = 0.84
Identities = 16/69 (23%), Positives = 26/69 (37%), Gaps = 7/69 (10%)
Query: 53 DAEAMERMVSI--IVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLL 110
+A+ I P F + A + V+A + + LL LA A L
Sbjct: 19 PDQALAAWQRINRRGPAFMPLFAGTSALLFA-YLAVLALRGRGGPGSRLL---LAAA-AL 73
Query: 111 FIVCCVPFT 119
+++ VP T
Sbjct: 74 YLLGVVPVT 82
>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein. This family includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 116
Score = 29.5 bits (66), Expect = 0.84
Identities = 13/44 (29%), Positives = 20/44 (45%), Gaps = 7/44 (15%)
Query: 238 DYCKGGEREVHEEEE-------EEEEEEEETKAKETKLKEKKKL 274
D + E+ +EE E+E E + +E K K+KKK
Sbjct: 72 DNMEATFMEIQKEERRSARMARLEDERELAREEEEEKRKKKKKN 115
Score = 28.7 bits (64), Expect = 1.7
Identities = 14/65 (21%), Positives = 21/65 (32%), Gaps = 18/65 (27%)
Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
E+D E E ++EE + E+E E EEE
Sbjct: 64 RYDEDDALDNMEAT---FMEIQKEERRSARM---------------ARLEDERELAREEE 105
Query: 293 EEEEE 297
EE+ +
Sbjct: 106 EEKRK 110
Score = 28.3 bits (63), Expect = 2.2
Identities = 14/71 (19%), Positives = 25/71 (35%), Gaps = 14/71 (19%)
Query: 225 PDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEE 284
G ++ + ++ E E ++EE A+ E+E E
Sbjct: 54 GKGRKRSYYDRYDEDDALDNMEATFMEIQKEERRSARMARL--------------EDERE 99
Query: 285 EEEEEEEEEEE 295
EEEEE+ +
Sbjct: 100 LAREEEEEKRK 110
Score = 27.9 bits (62), Expect = 2.8
Identities = 14/50 (28%), Positives = 21/50 (42%), Gaps = 14/50 (28%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E ++EE A+ +L E+E E EEEEE+ +
Sbjct: 75 EATFMEIQKEERRSARMARL--------------EDERELAREEEEEKRK 110
>gnl|CDD|218218 pfam04702, Vicilin_N, Vicilin N terminal region. This region is
found in plant seed storage proteins, N-terminal to the
Cupin domain (pfam00190). In Macadamia integrifolia,
this region is processed into peptides of approximately
50 amino acids containing a C-X-X-X-C-(10-12)X-C-X-X-X-C
motif. These peptides exhibit antimicrobial activity in
vitro.
Length = 147
Score = 30.1 bits (67), Expect = 0.88
Identities = 11/44 (25%), Positives = 22/44 (50%)
Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
+++E+ + + + +E+ + G G E +E EEE EE
Sbjct: 104 QQQEQRPRRQQQCQRECREQYQEHGRGEGENINQEGEEERSEEG 147
Score = 30.1 bits (67), Expect = 0.92
Identities = 15/69 (21%), Positives = 30/69 (43%), Gaps = 6/69 (8%)
Query: 228 YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
Y K Q + Y + ++ ++E+ +++ + + +E + GE E +E
Sbjct: 85 YEKEQQQQQRQY-QECQQRCQQQEQRPRRQQQCQRECREQYQEHGR-----GEGENINQE 138
Query: 288 EEEEEEEEE 296
EEE EE
Sbjct: 139 GEEERSEEG 147
Score = 28.1 bits (62), Expect = 3.5
Identities = 8/59 (13%), Positives = 28/59 (47%)
Query: 240 CKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ +R +E + E+++ + + + KE+++ E+ + E+ ++ ++ E
Sbjct: 13 YEQCQRRCEQETRGQREQQQCERRCKEQYKEEQQQQRQREEDPQRRYEQCQQRCQQHEP 71
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 28.9 bits (65), Expect = 0.93
Identities = 8/44 (18%), Positives = 23/44 (52%)
Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
+++E++ + K KE K K+ E E+ + ++++++
Sbjct: 19 DKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKD 62
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 31.1 bits (71), Expect = 0.94
Identities = 14/50 (28%), Positives = 25/50 (50%), Gaps = 5/50 (10%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ EEE ++E L+E +EE +EE+EE + E++E+
Sbjct: 104 GDIEEELQDEPRYDDAYRDLEE-----DDDDDEESDEEDEESSKSEDDED 148
Score = 28.0 bits (63), Expect = 8.8
Identities = 10/56 (17%), Positives = 30/56 (53%), Gaps = 19/56 (33%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ E ++++EE +EE+E E + E++E++++++++++I
Sbjct: 122 DLEEDDDDDEESDEEDE-------------------ESSKSEDDEDDDDDDDDDDI 158
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 31.0 bits (70), Expect = 0.98
Identities = 19/56 (33%), Positives = 32/56 (57%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+E + EEE E+E E+ +E L ++ L G E E++EE +E +EE+E+
Sbjct: 378 VKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKELSKEEKEL 433
Score = 30.2 bits (68), Expect = 1.7
Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 8/66 (12%)
Query: 244 EREVHEEEEEEEEEEE--------ETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E E+ +E E+ +EEE E KE + E+ L EE+E E+ + E E
Sbjct: 385 EEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKELSKEEKELLEKLKMEASEV 444
Query: 296 EEEIGG 301
E+ G
Sbjct: 445 EKLFGR 450
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 30.5 bits (69), Expect = 0.98
Identities = 16/57 (28%), Positives = 25/57 (43%), Gaps = 19/57 (33%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
+ E ++ +E+EE+E EE+E E EEEE+EE E+
Sbjct: 37 DEEAEDDVVDEDEEDE-------------------AVVEEDENELTEEEEDEEGEVK 74
Score = 28.6 bits (64), Expect = 4.5
Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 18/60 (30%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
+EE E++ +E+ E+E EE+E E EEEE+ G V ++ D
Sbjct: 37 DEEAEDDVVDEDE------------------EDEAVVEEDENELTEEEEDEEGEVKASPD 78
>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306). This
family of proteobacterial species proteins has no known
function.
Length = 115
Score = 29.2 bits (66), Expect = 1.0
Identities = 16/55 (29%), Positives = 19/55 (34%), Gaps = 14/55 (25%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
V EE E E E +A EEEEE E E+EE+
Sbjct: 11 LAVRAEEPAEPAETAEEEAA--------------AAAPAPAPEEEEEAELEDEEL 51
Score = 26.9 bits (60), Expect = 6.5
Identities = 18/51 (35%), Positives = 21/51 (41%), Gaps = 9/51 (17%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EE E E EE A EEEEE E E+EE EE ++
Sbjct: 16 EEPAEPAETAEEEAAAAAPA---------PAPEEEEEAELEDEELLEELDL 57
>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH. This
bacillus clade of FliH proteins is not found by the Pfam
FliH model pfam02108, but is closely related to the
sequences identified by that model. Sequences identified
by this model are observed in flagellar operons in an
analogous position relative to other flagellar operon
genes.
Length = 255
Score = 30.5 bits (69), Expect = 1.1
Identities = 18/74 (24%), Positives = 34/74 (45%)
Query: 222 VNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEE 281
P I + E+E+ +EE+E E+ E+ +A+ ++ E+ + E
Sbjct: 10 SVIPAQERGIIPLRQVTDVPETEQELADEEQEFEQILEKAEAEAAQIIEQAEAQAAAIRE 69
Query: 282 EEEEEEEEEEEEEE 295
+ E+E + EEE E
Sbjct: 70 QIEQERAQWEEERE 83
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 30.2 bits (68), Expect = 1.1
Identities = 13/38 (34%), Positives = 17/38 (44%)
Query: 261 TKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ K+T K KKK E EE + EE EE +
Sbjct: 58 PRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEEND 95
Score = 28.6 bits (64), Expect = 3.0
Identities = 14/37 (37%), Positives = 18/37 (48%)
Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K K TK +KKK E EE + EE EE ++
Sbjct: 60 KKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDK 96
Score = 27.9 bits (62), Expect = 6.7
Identities = 13/38 (34%), Positives = 16/38 (42%)
Query: 261 TKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
T K+ K+ KK E E EE + EE EE
Sbjct: 57 TPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEEN 94
>gnl|CDD|184732 PRK14539, PRK14539, 50S ribosomal protein L11/unknown domain fusion
protein; Provisional.
Length = 196
Score = 29.9 bits (67), Expect = 1.1
Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 1/50 (2%)
Query: 249 EEEEEEEEEEEETKAKETKLKE-KKKLGGGGGEEEEEEEEEEEEEEEEEE 297
EE + + KAKE LK + +L G+ E EE+E E++E
Sbjct: 147 EEAKAAAKAAALAKAKEASLKSAEAELKASKGKSIEVNVIGEEDEGEKDE 196
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 30.6 bits (68), Expect = 1.2
Identities = 17/65 (26%), Positives = 39/65 (60%), Gaps = 4/65 (6%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
++EN + + + E+EE++ E+E+ K + KE +L + E+E+++ E+E++
Sbjct: 136 EAENARDRANKSGIELEQEEQKTEQEKQKTE----KEGIELANSQIKAEQEKQKTEQEKQ 191
Query: 294 EEEEE 298
+ E+E
Sbjct: 192 KTEQE 196
Score = 28.7 bits (63), Expect = 4.3
Identities = 14/49 (28%), Positives = 30/49 (61%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E E+EE++ E+ K K K + E+E+++ E+E+++ E+E++
Sbjct: 150 ELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQ 198
>gnl|CDD|240539 cd13134, MATE_like_8, Uncharacterized subfamily of the multidrug
and toxic compound extrusion (MATE) proteins. The
integral membrane proteins from the MATE family are
involved in exporting metabolites across the cell
membrane and are responsible for multidrug resistance
(MDR) in many bacteria and animals. A number of family
members are involved in the synthesis of peptidoglycan
components in bacteria.
Length = 438
Score = 30.5 bits (70), Expect = 1.2
Identities = 27/156 (17%), Positives = 57/156 (36%), Gaps = 28/156 (17%)
Query: 46 YINDNDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAF-----NQQMRSTTNL- 99
+DN V A V + I F +I+I F +++ + ++ R +
Sbjct: 32 RYSDNAVAA------VGVANQILFLLILIFSFISTGTSILIAQYLGAKDKKKARQIAGVA 85
Query: 100 LIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYLI--VVTAYASVYTLVLMSLD 157
L+ NL + + ++ FG ++ + + A S Y L ++
Sbjct: 86 LLFNLLLGLVFSLLLLF-------------FGAELLRLMGLPPELLALGSTY-LRIVGGS 131
Query: 158 RFLAVVHPIASMSVRTERNALKAILITWIVIVLTAV 193
FL + S +R+ + ++ I+ +L V
Sbjct: 132 LFLQALSLTLSAILRSHGRTKIPMYVSLIMNILNIV 167
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 29.9 bits (67), Expect = 1.2
Identities = 10/36 (27%), Positives = 18/36 (50%)
Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
K ++ K + +K GG ++ E + E E E E+
Sbjct: 94 KKEKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPED 129
Score = 29.9 bits (67), Expect = 1.2
Identities = 14/37 (37%), Positives = 20/37 (54%)
Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K KE K KE +K GG + ++ E + E E E E+
Sbjct: 93 KKKEKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPED 129
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 29.5 bits (67), Expect = 1.2
Identities = 14/61 (22%), Positives = 34/61 (55%), Gaps = 8/61 (13%)
Query: 246 EVHEEEEEEEEEEEETKAKE--------TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E +++ EEE +EE + +++ T+ K+K G + EE + +E+++++++
Sbjct: 28 EKYDKREEEAKEEAKARSEPDEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKK 87
Query: 298 E 298
E
Sbjct: 88 E 88
Score = 27.2 bits (61), Expect = 5.6
Identities = 9/25 (36%), Positives = 14/25 (56%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVV 304
++ EEE +EE + E +E G V
Sbjct: 31 DKREEEAKEEAKARSEPDEDGWTTV 55
>gnl|CDD|183538 PRK12460, PRK12460, 2-keto-3-deoxygluconate permease; Provisional.
Length = 312
Score = 30.3 bits (69), Expect = 1.3
Identities = 8/46 (17%), Positives = 21/46 (45%), Gaps = 6/46 (13%)
Query: 60 MVSIIVPIFFGIII------ILGFFGNALVVVVVAFNQQMRSTTNL 99
+V+ ++P+ G+I+ + F +++ F + + NL
Sbjct: 166 LVAALLPLVLGMILGNLDPDMRKFLTKGGPLLIPFFAFALGAGINL 211
>gnl|CDD|218270 pfam04795, PAPA-1, PAPA-1-like conserved region. Family of
proteins with a conserved region found in PAPA-1, a
PAP-1 binding protein.
Length = 89
Score = 28.5 bits (64), Expect = 1.3
Identities = 12/46 (26%), Positives = 25/46 (54%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
E+ EEE+ ET K K + ++K ++ +E++ +E+ E+
Sbjct: 18 EKRAEEEKMETINKLLKKQARRKKREDKIKKGDEKKAQEKAARAEK 63
>gnl|CDD|220112 pfam09110, HAND, HAND. The HAND domain adopts a secondary
structure consisting of four alpha helices, three of
which (H2, H3, H4) form an L-like configuration. Helix
H2 runs antiparallel to helices H3 and H4, packing
closely against helix H4, whilst helix H1 reposes in the
concave surface formed by these three helices and runs
perpendicular to them. The domain confers DNA and
nucleosome binding properties to the protein.
Length = 109
Score = 28.8 bits (65), Expect = 1.3
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 277 GGGEEEEEEEEEEEEEEEEEEEI 299
G +EE+EEE E + + E+E+I
Sbjct: 66 VGDGDEEDEEEREAKRKLEQEKI 88
Score = 28.8 bits (65), Expect = 1.4
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 6/52 (11%)
Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ + + +EE+EEE +AK +EK E EEEEEE++ EE
Sbjct: 63 LDDVGDGDEEDEEEREAKRKLEQEKID------NAEPLTEEEEEEKQRLLEE 108
Score = 28.4 bits (64), Expect = 1.9
Identities = 12/34 (35%), Positives = 18/34 (52%)
Query: 263 AKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
K+ K G G EE+EEE E + + E+E+
Sbjct: 54 KKKIGYKVPLDDVGDGDEEDEEEREAKRKLEQEK 87
Score = 26.9 bits (60), Expect = 6.0
Identities = 14/34 (41%), Positives = 21/34 (61%), Gaps = 7/34 (20%)
Query: 271 KKKLG-------GGGGEEEEEEEEEEEEEEEEEE 297
KKK+G G G+EE+EEE E + + E+E+
Sbjct: 54 KKKIGYKVPLDDVGDGDEEDEEEREAKRKLEQEK 87
>gnl|CDD|223608 COG0534, NorM, Na+-driven multidrug efflux pump [Defense
mechanisms].
Length = 455
Score = 30.3 bits (69), Expect = 1.3
Identities = 32/159 (20%), Positives = 53/159 (33%), Gaps = 40/159 (25%)
Query: 61 VSIIVPIFFGIIIILGFFGNALVVVVVA-------FNQQMRSTTNLLIINLAIADLLFIV 113
V + PIFF II I G V+VA + R L++ L + LL I+
Sbjct: 55 VGLANPIFFLIIAIFIGLGTG-TTVLVAQAIGAGDRKKAKRVLGQGLLLALLLGLLLAIL 113
Query: 114 CCVPFTATDYVTTCWPFGDVWCKMYLI----VVTAYASVYTLVLMSLDRFLAVVHPIASM 169
F + + L+ V A+ Y ++ L + S
Sbjct: 114 -------------LLFFAEPL--LRLLGAPAEVLELAAEYLRII-----LLGAPFALLSF 153
Query: 170 S----VRTERNALKAILITWIV----IVLTAVPVFNMHG 200
+R + + I + IVL + +F + G
Sbjct: 154 VLSGILRGLGDTKTPMYILLLGNLLNIVLNYLLIFGLFG 192
>gnl|CDD|232958 TIGR00399, metG_C_term, methionyl-tRNA synthetase C-terminal
region/beta chain. The methionyl-tRNA synthetase (metG)
is a class I amino acyl-tRNA ligase. This model
describes a region of the methionyl-tRNA synthetase that
is present at the C-terminus of MetG in some species (E.
coli, B. subtilis, Thermotoga maritima, Methanobacterium
thermoautotrophicum), and as a separate beta chain in
Aquifex aeolicus. It is absent in a number of other
species (e.g. Mycoplasma genitalium, Mycobacterium
tuberculosis), while Pyrococcus horikoshii has both a
full length MetG and a second protein homologous to the
beta chain only. Proteins hit by This model should
called methionyl-tRNA synthetase beta chain if and only
if the model metG hits a separate protein not also hit
by This model [Protein synthesis, tRNA aminoacylation].
Length = 137
Score = 29.3 bits (66), Expect = 1.3
Identities = 12/31 (38%), Positives = 17/31 (54%)
Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEE 285
+++ EE K K K KEKK G E ++E
Sbjct: 1 DKKIEELKLKGAKKKEKKDEGEKALEPQKET 31
>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
Tim54. Mitochondrial function depends on the import of
hundreds of different proteins synthesised in the
cytosol. Protein import is a multi-step pathway which
includes the binding of precursor proteins to surface
receptors, translocation of the precursor across one or
both mitochondrial membranes, and folding and assembly
of the imported protein inside the mitochondrion. Most
precursor proteins carry amino-terminal targeting
signals, called pre-sequences, and are imported into
mitochondria via import complexes located in both the
outer and the inner membrane (IM). The IM complex, TIM,
is made up of at least two proteins which mediate
translocation of proteins into the matrix by removing
their signal peptide and another pair of proteins, Tim54
and Tim22, that insert the polytopic proteins, that
carry internal targetting information, into the inner
membrane.
Length = 377
Score = 30.5 bits (69), Expect = 1.3
Identities = 12/51 (23%), Positives = 17/51 (33%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
E E +E E E +A + EE E+ EEE +
Sbjct: 199 EPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKP 249
Score = 28.9 bits (65), Expect = 3.7
Identities = 14/55 (25%), Positives = 20/55 (36%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
E +E ET+ + T E E EE E+ EEE + + V
Sbjct: 200 PPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPV 254
Score = 28.2 bits (63), Expect = 5.9
Identities = 15/60 (25%), Positives = 24/60 (40%), Gaps = 13/60 (21%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVV 304
R+ EE EEE EEE ++ ++ LG + ++ GGV+V
Sbjct: 128 RKAGEEPEEELPLEEEATKEDDLADPRQVLG-------------VYYKAKDPPGAGGVIV 174
>gnl|CDD|223605 COG0531, PotE, Amino acid transporters [Amino acid transport and
metabolism].
Length = 466
Score = 30.5 bits (69), Expect = 1.4
Identities = 17/133 (12%), Positives = 45/133 (33%), Gaps = 16/133 (12%)
Query: 61 VSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTA 120
+ +I ++ +L G + + + I+ + I L+FI+ +
Sbjct: 134 IILIALALIALLTLLNLRGIKASAKINSI---------ITILKIIIL-LIFIILGLFAFG 183
Query: 121 TDYVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKA 180
PF + L + F A+ +A +R +A
Sbjct: 184 FSNGNLFAPFNPGG-----GSFGGILAAILLAFFAFTGFEAIAT-LAEEVKNPKRTIPRA 237
Query: 181 ILITWIVIVLTAV 193
I+++ +++++ +
Sbjct: 238 IILSLLIVLILYI 250
>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2. This
domain family is found in eukaryotes, and is typically
between 138 and 153 amino acids in length. The family is
found in association with pfam00493. Mini-chromosome
maintenance (MCM) proteins are essential for DNA
replication. These proteins use ATPase activity to
perform this function.
Length = 145
Score = 29.2 bits (66), Expect = 1.4
Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 18/50 (36%)
Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
E+E +EEEE E++ ++ +EE EEEE+ E++ G
Sbjct: 1 EDEPDEEEEL------------------EDDIDDLDEEAEEEEDGEDLFG 32
Score = 26.9 bits (60), Expect = 8.6
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 279 GEEEEEEEEEEEEEEEEEEEIGGVVV 304
EEEE E++ ++ +EE EEE G +
Sbjct: 5 DEEEELEDDIDDLDEEAEEEEDGEDL 30
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 30.0 bits (68), Expect = 1.4
Identities = 20/93 (21%), Positives = 39/93 (41%), Gaps = 14/93 (15%)
Query: 226 DGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETK----AKETKLKEK-----KKLGG 276
Y + + G + E E E E++++T +E +L +K K+L
Sbjct: 90 KEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKEL-- 147
Query: 277 GGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDF 309
E+ ++ EE E+ +E + EI + A +
Sbjct: 148 ---EDAKKALEENEKLKELKAEIDELKKKAREI 177
>gnl|CDD|233194 TIGR00930, 2a30, K-Cl cotransporter. [Transport and binding
proteins, Other].
Length = 953
Score = 30.5 bits (69), Expect = 1.4
Identities = 15/60 (25%), Positives = 21/60 (35%), Gaps = 7/60 (11%)
Query: 256 EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGGWW 315
EE + K L G+ +E EE E+ E +E+ G V GW
Sbjct: 29 EELHDLLDKVVSLLGPLADYTNNGQGMKEHEEAEDAEGTKEKPPAGAV-------KFGWV 81
>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
Length = 356
Score = 30.3 bits (68), Expect = 1.5
Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 3/43 (6%)
Query: 264 KETKLKEKKKLGGGGGEEEE---EEEEEEEEEEEEEEEIGGVV 303
KE K + +++L +EE E +E++EEEE EEEE V
Sbjct: 311 KEPKQESQEQLFNPFAIDEEMLEETQEQQEEEENEEEEENDTV 353
Score = 29.6 bits (66), Expect = 2.6
Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 256 EEEEETKAK-ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
EE+E K + + +L + EE +E++EEEE EEEEE
Sbjct: 308 GEEKEPKQESQEQLFNPFAIDEEMLEETQEQQEEEENEEEEE 349
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 29.2 bits (66), Expect = 1.5
Identities = 9/30 (30%), Positives = 13/30 (43%)
Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEE 282
E +E K K T+ +KK E+E
Sbjct: 100 RERTKERAEKEKRTRKNREKKFKRRQKEKE 129
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 30.3 bits (68), Expect = 1.5
Identities = 13/87 (14%), Positives = 24/87 (27%)
Query: 212 TACVFLDLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEK 271
V ++ V + ++ D E E + E E+ + + +
Sbjct: 330 AEAVKAEVAEVTDEVAAESVVQVADRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQV 389
Query: 272 KKLGGGGGEEEEEEEEEEEEEEEEEEE 298
EE E +E E E
Sbjct: 390 DAEAASAAPEEPAALASEAHDETEPEV 416
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 30.4 bits (69), Expect = 1.6
Identities = 21/74 (28%), Positives = 29/74 (39%), Gaps = 8/74 (10%)
Query: 234 QSENDYCKGGEREVHEEEE-------EEEEEEEETKAKETKL-KEKKKLGGGGGEEEEEE 285
Q + K + + +E E E EEE EE +A L L E E +
Sbjct: 839 QEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQL 898
Query: 286 EEEEEEEEEEEEEI 299
E E + EE E +I
Sbjct: 899 RELERKIEELEAQI 912
Score = 29.3 bits (66), Expect = 3.5
Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 8/64 (12%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKK--------LGGGGGEEEEEEEEEEEEEEEEEE 297
+ E+E E+E +E + + LKE+ K L G E EEE EE E + E
Sbjct: 823 RLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLES 882
Query: 298 EIGG 301
+G
Sbjct: 883 RLGD 886
Score = 28.9 bits (65), Expect = 4.6
Identities = 18/69 (26%), Positives = 30/69 (43%), Gaps = 11/69 (15%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLK-----------EKKKLGGGGGEEEEEEEEEEEEE 292
E+E+ + E+EEE+ +E + E L E K+L E EE+ + EE
Sbjct: 722 EKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEAL 781
Query: 293 EEEEEEIGG 301
+ E +
Sbjct: 782 NDLEARLSH 790
>gnl|CDD|220098 pfam09057, Smac_DIABLO, Second Mitochondria-derived Activator of
Caspases. Second Mitochondria-derived Activator of
Caspases promotes apoptosis by activating caspases in
the cytochrome c/Apaf-1/caspase-9 pathway, and by
opposing the inhibitory activity of inhibitor of
apoptosis proteins (XIAP-BIR3). The protein assumes an
elongated three-helix bundle structure, and forms a
dimer in solution.
Length = 234
Score = 29.9 bits (67), Expect = 1.6
Identities = 11/42 (26%), Positives = 20/42 (47%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
EE + +E E K E+K +E +++ + E E+ E
Sbjct: 187 EEVRQLSKEAEKKLAESKAEEIQRMAEYASSIDLSELEDIPE 228
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 28.8 bits (65), Expect = 1.6
Identities = 14/54 (25%), Positives = 21/54 (38%), Gaps = 17/54 (31%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E+ E E + +EE K +EEE + + EE +E E E
Sbjct: 88 EIDERAEAQRARDEEKK-----------------LDEEEAKRQHEEAKEREREK 124
>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
This family consists of several brain acid soluble
protein 1 (BASP1) or neuronal axonal membrane protein
NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
calmodulin-binding protein of unknown function.
Length = 233
Score = 29.8 bits (66), Expect = 1.6
Identities = 18/55 (32%), Positives = 26/55 (47%), Gaps = 1/55 (1%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E +E EE + E T+ KE K +EK + EE+E E+E +EE
Sbjct: 32 EEGTPKENEEAQAAAETTEVKEAK-EEKPDKDAQDTANKTEEKEGEKEAAAAKEE 85
>gnl|CDD|143643 cd07883, RHD-n_NFkB, N-terminal sub-domain of the Rel homology
domain (RHD) of nuclear factor of kappa light
polypeptide gene enhancer in B-cells (NF-kappa B).
Proteins containing the Rel homology domain (RHD) are
metazoan transcription factors. The RHD is composed of
two structural sub-domains; this model characterizes the
N-terminal RHD sub-domain of the NF-kappa B1 and B2
families of transcription factors, also referred to as
class I members of the NF-kappa B family. In class I
NF-kappa Bs, the RHD domain co-occurs with C-terminal
ankyrin repeats. Family members include NF-kappa B1 and
NF-kappa B2. NF-kappa B1 is commonly referred to as p105
or p50 (proteolytically processed form), while NF-kappa
B2 is called p100 or p52 (proteolytically processed
form). NF-kappa B proteins are part of a protein complex
that acts as a transcription factor, which is
responsible for regulating a host of cellular responses
to a variety of stimuli. This complex tightly regulates
the expression of a large number of genes, and is
involved in processes such as adaptive and innate
immunity, stress response, inflammation, cell adhesion,
proliferation and apoptosis. The cytosolic NF-kappa B
complex is activated via phosphorylation of the
ankyrin-repeat containing inhibitory protein I-kappa B,
which dissociates from the complex and exposes the
nuclear localization signal of the heterodimer (NF-kappa
B and REL). p105 and p100 may also act as I-kappa Bs due
to their C-terminal ankyrin repeats.
Length = 197
Score = 29.4 bits (66), Expect = 1.6
Identities = 10/41 (24%), Positives = 19/41 (46%)
Query: 228 YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKL 268
D GG+R++ +EE+ E ++ + +AK L
Sbjct: 123 GYNPGDLVHVDAEGGGDRQLTDEEQAEIRQKAKQQAKSMDL 163
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 29.3 bits (66), Expect = 1.7
Identities = 17/64 (26%), Positives = 36/64 (56%), Gaps = 6/64 (9%)
Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E DY + E E +E + ++E ++ + ++ E E++E +E+E+++E+EE
Sbjct: 117 ELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVE------ILIEDDEVDEDEDDDEDEE 170
Query: 296 EEEI 299
+EE
Sbjct: 171 DEED 174
Score = 29.3 bits (66), Expect = 1.7
Identities = 17/79 (21%), Positives = 38/79 (48%)
Query: 221 PVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGE 280
P++ D + + + ++ K E + EE +E+E + +E K +E + E
Sbjct: 83 PLDEIDEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDE 142
Query: 281 EEEEEEEEEEEEEEEEEEI 299
+E+++E E E++E +
Sbjct: 143 DEDDDEIVEILIEDDEVDE 161
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 30.2 bits (68), Expect = 1.7
Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 5/38 (13%)
Query: 264 KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
KE K K+++K EE+ EE + EE +EE EEE G
Sbjct: 277 KELKEKKEEK-----DEEKSEEVKTEEVDEEFEEEEKG 309
Score = 29.8 bits (67), Expect = 2.7
Identities = 18/55 (32%), Positives = 27/55 (49%), Gaps = 12/55 (21%)
Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAV 307
E++EE++E K++E K EE +EE EEEE+ E V + A
Sbjct: 281 EKKEEKDEEKSEEVK------------TEEVDEEFEEEEKGFYYELYEKVNIEAN 323
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 30.1 bits (68), Expect = 1.7
Identities = 11/59 (18%), Positives = 23/59 (38%), Gaps = 3/59 (5%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE---EEEEEEEI 299
E + + + EE K + ++L+ K + EE+ +E + EE+
Sbjct: 871 EGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEEL 929
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation of
60S ribosomal subunits.
Length = 132
Score = 28.9 bits (65), Expect = 1.7
Identities = 7/29 (24%), Positives = 18/29 (62%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKK 273
+++ + +++ + E+ E KAK EK++
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRAEKRQ 56
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 30.3 bits (68), Expect = 1.7
Identities = 16/61 (26%), Positives = 26/61 (42%), Gaps = 11/61 (18%)
Query: 244 EREVHEEEEEEE------EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E+ +EE E ++ E + + EK ++ E EE E EEE E+ E
Sbjct: 339 EKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEI-----ENRNPEESEHEEEVEDYE 393
Query: 298 E 298
+
Sbjct: 394 D 394
Score = 29.2 bits (65), Expect = 3.2
Identities = 11/47 (23%), Positives = 21/47 (44%), Gaps = 8/47 (17%)
Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
E++ ++ + E ++ + EE E EEE E+ E+E
Sbjct: 357 SVPLEKQFDDILYFEKME--IENRNP------EESEHEEEVEDYEDE 395
Score = 28.8 bits (64), Expect = 4.3
Identities = 14/54 (25%), Positives = 22/54 (40%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
R ++ E+ EE E + ++ +K E+ E E EE E EE
Sbjct: 334 RNLNNEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEE 387
>gnl|CDD|180207 PRK05696, fliL, flagellar basal body-associated protein FliL;
Reviewed.
Length = 170
Score = 29.2 bits (66), Expect = 1.8
Identities = 18/70 (25%), Positives = 22/70 (31%), Gaps = 26/70 (37%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKL------------GGGGG-------EEEEEEEEEEE 290
EEE E E+E K+ KKKL GGGG ++ E
Sbjct: 3 EEEALELEDEAPKS-------KKKLIIIIVIGVLLALGGGGAAWFFMGSSDKAAAAAAEA 55
Query: 291 EEEEEEEEIG 300
E G
Sbjct: 56 AAEAAAAVKG 65
>gnl|CDD|222300 pfam13664, DUF4149, Domain of unknown function (DUF4149).
Length = 98
Score = 28.3 bits (64), Expect = 1.8
Identities = 8/49 (16%), Positives = 20/49 (40%), Gaps = 2/49 (4%)
Query: 59 RMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIA 107
+ + P++F + + L L+ ++ + R LL + L +
Sbjct: 32 ALQGKLFPVYFLLGLALAVL--LLLTELLLGGKAERWQLLLLAVLLLLT 78
>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
Length = 249
Score = 29.6 bits (67), Expect = 1.9
Identities = 12/52 (23%), Positives = 14/52 (26%), Gaps = 8/52 (15%)
Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ EE EE+EE E EE EE E
Sbjct: 203 FWRDPEEAEEKEEAAAETAGV--------EDAAAAAAEAEEGEEWVAAANNE 246
Score = 28.5 bits (64), Expect = 4.3
Identities = 10/34 (29%), Positives = 15/34 (44%)
Query: 282 EEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGGWW 315
+ EE EE+EE E + +A + G W
Sbjct: 206 DPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEW 239
>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
Length = 325
Score = 29.5 bits (67), Expect = 1.9
Identities = 11/55 (20%), Positives = 21/55 (38%), Gaps = 9/55 (16%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+VH+ E+ + +E + + K EEE + + EEE +
Sbjct: 2 TLKVHDLNEDADFDENGVEVFDEKALV---------EEEPSDNDLAEEELLSQGA 47
>gnl|CDD|152814 pfam12379, DUF3655, Protein of unknown function (DUF3655). This
domain family is found in viruses, and is approximately
70 amino acids in length. The family is found in
association with pfam08716, pfam01661, pfam05409,
pfam06471, pfam08717, pfam06478, pfam09401, pfam06460,
pfam08715, pfam08710.
Length = 70
Score = 27.7 bits (61), Expect = 2.0
Identities = 23/56 (41%), Positives = 29/56 (51%), Gaps = 3/56 (5%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKK---LGGGGGEEEEEEEEEEEEEEEEE 296
E E EE E EEEE EET E ++ K L G E + EEEEEE+ ++
Sbjct: 10 EEEDCEEYECEEEEIEETCEHEYGTEDDYKGLPLEFGASTEIVQVEEEEEEDWLDD 65
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 28.2 bits (63), Expect = 2.0
Identities = 11/24 (45%), Positives = 17/24 (70%)
Query: 279 GEEEEEEEEEEEEEEEEEEEIGGV 302
+EEEEEEEEE++ E+ +E +
Sbjct: 9 DDEEEEEEEEEDDLEDLSDEDEFI 32
Score = 28.2 bits (63), Expect = 2.0
Identities = 11/20 (55%), Positives = 16/20 (80%)
Query: 280 EEEEEEEEEEEEEEEEEEEI 299
E ++EEEEEEEEE++ E+
Sbjct: 7 EVDDEEEEEEEEEDDLEDLS 26
Score = 27.4 bits (61), Expect = 2.9
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 281 EEEEEEEEEEEEEEEEEEIGGV 302
E ++EEEEEEEEE++ E++
Sbjct: 7 EVDDEEEEEEEEEDDLEDLSDE 28
Score = 27.4 bits (61), Expect = 3.8
Identities = 18/55 (32%), Positives = 28/55 (50%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E EV +EEEEEEEEE++ + + + + + E+EEEE+ E
Sbjct: 5 EAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAE 59
Score = 26.6 bits (59), Expect = 5.4
Identities = 13/28 (46%), Positives = 17/28 (60%)
Query: 282 EEEEEEEEEEEEEEEEEIGGVVVSAVDF 309
+ E E ++EEEEEEEEE +S D
Sbjct: 3 DTEAEVDDEEEEEEEEEDDLEDLSDEDE 30
Score = 26.6 bits (59), Expect = 6.1
Identities = 11/19 (57%), Positives = 16/19 (84%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
+ E E ++EEEEEEEEE++
Sbjct: 3 DTEAEVDDEEEEEEEEEDD 21
Score = 26.6 bits (59), Expect = 6.9
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 280 EEEEEEEEEEEEEEEEEEE 298
E E ++EEEEEEEEE++
Sbjct: 4 TEAEVDDEEEEEEEEEDDL 22
Score = 26.3 bits (58), Expect = 7.7
Identities = 11/31 (35%), Positives = 17/31 (54%)
Query: 281 EEEEEEEEEEEEEEEEEEIGGVVVSAVDFGS 311
+EEEEEEEEE++ E+ + + A
Sbjct: 10 DEEEEEEEEEDDLEDLSDEDEFIDEAEAEDD 40
>gnl|CDD|216828 pfam01984, dsDNA_bind, Double-stranded DNA-binding domain. This
domain is believed to bind double-stranded DNA of 20
bases length.
Length = 105
Score = 28.3 bits (64), Expect = 2.0
Identities = 9/30 (30%), Positives = 21/30 (70%)
Query: 264 KETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+ +L E ++ GG E++++EE++++EE
Sbjct: 2 RRARLAELQQQQGGLEEQQQQEEQQQQEEA 31
>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 29.4 bits (66), Expect = 2.0
Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 10/62 (16%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
++E+ E+ ++ E E KA E K E K L EEE E ++ E G +V
Sbjct: 76 QKELLEKLKDLAEVNERLKALEKKKAELKDL----------EEEREGILRSKQAEDGKLV 125
Query: 304 VS 305
Sbjct: 126 KI 127
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 28.0 bits (63), Expect = 2.0
Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 4/57 (7%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEK----KKLGGGGGEEEEEEEEEEEEEEEEE 296
ER+ E+ + EEE + K +++EK KK+ G G + EE+ + + + E+
Sbjct: 2 ERKEEEKAQREEELKRLKNLKREEIEEKLEKIKKVAGLRGADLSEEDLADGDFDPEK 58
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 30.1 bits (67), Expect = 2.0
Identities = 15/55 (27%), Positives = 28/55 (50%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
GE E+ +++E+ E+ E + + + ++ G E EE +E + E E EE
Sbjct: 591 GEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREE 645
Score = 28.6 bits (63), Expect = 5.8
Identities = 19/84 (22%), Positives = 37/84 (44%), Gaps = 1/84 (1%)
Query: 215 VFLDLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKL 274
+ D + N D ++ G E EE +E + ET+ +E ++K++L
Sbjct: 595 LIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETE-REENARKKEEL 653
Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
G EE + E+++ + EE+
Sbjct: 654 RGNFELEERGDPEKKDVDWYTEEK 677
>gnl|CDD|220672 pfam10277, Frag1, Frag1/DRAM/Sfk1 family. This family includes
Frag1, DRAM and Sfk1 proteins. Frag1 (FGF receptor
activating protein 1) is a protein that is conserved
from fungi to humans. There are four potential
iso-prenylation sites throughout the peptide, viz CILW,
CIIW and CIGL. Frag1 is a membrane-spanning protein that
is ubiquitously expressed in adult tissues suggesting an
important cellular function. Dram is a family of
proteins conserved from nematodes to humans with six
hydrophobic transmembrane regions and an Endoplasmic
Reticulum signal peptide. It is a lysosomal protein that
induces macro-autophagy as an effector of p53-mediated
death, where p53 is the tumour-suppressor gene that is
frequently mutated in cancer. Expression of Dram is
stress-induced. This region is also part of a family of
small plasma membrane proteins, referred to as Sfk1,
that may act together with or upstream of Stt4p to
generate normal levels of the essential phospholipid
PI4P, thus allowing proper localisation of Stt4p to the
actin cytoskeleton.
Length = 216
Score = 29.1 bits (66), Expect = 2.1
Identities = 6/57 (10%), Positives = 16/57 (28%), Gaps = 1/57 (1%)
Query: 67 IFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDY 123
I + + G L + + L ++ ++ + +PF
Sbjct: 122 IGAILFFVGGLLYMLLQTALSYRLGPTPVLRKSFRLKLILS-IIAFISAIPFGVFFI 177
Score = 29.1 bits (66), Expect = 2.1
Identities = 11/86 (12%), Positives = 30/86 (34%), Gaps = 8/86 (9%)
Query: 71 IIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTT-CWP 129
+ ++ G G + V F +T ++ A +LF V + + +
Sbjct: 93 LALVFGIIGALGLGGVANF-----QSTEDHDVHDIGA-ILFFVGGLLYMLLQTALSYRLG 146
Query: 130 FGDVWCKMYLI-VVTAYASVYTLVLM 154
V K + + ++ + + + +
Sbjct: 147 PTPVLRKSFRLKLILSIIAFISAIPF 172
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 28.7 bits (65), Expect = 2.1
Identities = 13/44 (29%), Positives = 27/44 (61%)
Query: 264 KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAV 307
K K K+KKK G EE++++ EE +E+++ + +++A+
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAALLAAI 125
>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
and repair].
Length = 1187
Score = 30.0 bits (68), Expect = 2.1
Identities = 16/51 (31%), Positives = 24/51 (47%)
Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVS 305
EE E+ + KL+ K EE E E E E++ ++G +VVS
Sbjct: 232 SEEVIESAYELIKLRRKLYGEKRAERVREELREVEREREKKRRKLGILVVS 282
>gnl|CDD|218301 pfam04871, Uso1_p115_C, Uso1 / p115 like vesicle tethering protein,
C terminal region. Also known as General vesicular
transport factor, Transcytosis associate protein (TAP)
and Vesicle docking protein, this myosin-shaped molecule
consists of an N-terminal globular head region, a
coiled-coil tail which mediates dimerisation, and a
short C-terminal acidic region. p115 tethers COP1
vesicles to the Golgi by binding the coiled coil
proteins giantin (on the vesicles) and GM130 (on the
Golgi), via its C-terminal acidic region. It is required
for intercisternal transport in the golgi stack. This
family consists of the acidic C-terminus, which binds to
the golgins giantin and GM130. p115 is thought to
juxtapose two membranes by binding giantin with one
acidic region, and GM130 with another.
Length = 136
Score = 28.8 bits (64), Expect = 2.2
Identities = 21/62 (33%), Positives = 33/62 (53%), Gaps = 9/62 (14%)
Query: 246 EVHEEEEEEEEEEEE--------TKAKETKLKEK-KKLGGGGGEEEEEEEEEEEEEEEEE 296
++ EE +E ++E++ + K TK KEK K LG +E E +E E+EEE
Sbjct: 73 KLDEEALKESQKEQDDLLVLLADQEEKVTKYKEKLKDLGVPVSNDEPLEAIGDESEDEEE 132
Query: 297 EE 298
+E
Sbjct: 133 DE 134
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 29.7 bits (66), Expect = 2.2
Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 8/67 (11%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKL--------KEKKKLGGGGGEEEEEEEEEEEEE 292
KG + E+EEEE+ + +E+ + KE G EE ++ E EE
Sbjct: 383 KGTSFYQAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEE 442
Query: 293 EEEEEEI 299
EEE
Sbjct: 443 LSPEEEE 449
>gnl|CDD|224535 COG1620, LldP, L-lactate permease [Energy production and
conversion].
Length = 522
Score = 29.5 bits (67), Expect = 2.2
Identities = 11/54 (20%), Positives = 22/54 (40%), Gaps = 4/54 (7%)
Query: 154 MSLDRFLAVVHPIASMSVR-TERN--ALKAILITWIVIVLTAVPVFNMHGEVSY 204
+ L +A++ PI + A IT ++ +L A+ F M ++
Sbjct: 2 LWLSALVALL-PIVLFFLLLVVLKLKGYVAAPITLLLTLLIALFYFKMPVVMAS 54
>gnl|CDD|188087 TIGR00841, bass, bile acid transporter. The Bile Acid:Na+
Symporter (BASS) Family (TC 2.A.28) Functionally
characterized members of the BASS family catalyze
Na+:bile acid symport. These systems have been
identified in intestinal, liver and kidney tissues of
animals. These symporters exhibit broad specificity,
taking up a variety of non bile organic compounds as
well as taurocholate and other bile salts. Functionally
uncharacterised homologues are found in plants, yeast,
archaea and bacteria [Transport and binding proteins,
Carbohydrates, organic alcohols, and acids].
Length = 286
Score = 29.6 bits (67), Expect = 2.2
Identities = 17/85 (20%), Positives = 32/85 (37%), Gaps = 8/85 (9%)
Query: 72 IIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFG 131
+I+G ++ + F + + +A + IV C P V T G
Sbjct: 39 GVIIGLLAQYGIMPLTGFLLAK-----VFKLPPELAVGVLIVGCCPGGTASNVFTYLLKG 93
Query: 132 DVWCKMYLIVVTAYASVYTLVLMSL 156
D+ I +T +++ L +M L
Sbjct: 94 DM---ALSISMTTCSTLLALGMMPL 115
>gnl|CDD|224401 COG1484, DnaC, DNA replication protein [DNA replication,
recombination, and repair].
Length = 254
Score = 29.3 bits (66), Expect = 2.3
Identities = 10/38 (26%), Positives = 16/38 (42%)
Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKE 270
+ D + G E E EEE+ E + E +L+
Sbjct: 30 LEQAADCKEWGYAEFLEYLLEEEKLAREARKIERRLRS 67
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 29.5 bits (66), Expect = 2.4
Identities = 10/30 (33%), Positives = 15/30 (50%)
Query: 270 EKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E + G EE +EE E+E E +E +
Sbjct: 333 EVVEKQEGDVVTEESTDEESEDEVEIDESV 362
Score = 28.7 bits (64), Expect = 3.8
Identities = 15/59 (25%), Positives = 21/59 (35%), Gaps = 12/59 (20%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EV E++E + EE T + E E +E EE E E E+
Sbjct: 328 IFEVVEVVEKQEGDVVTEESTDEESED------------EVEIDESVIEEVAEMELLEV 374
Score = 28.7 bits (64), Expect = 4.2
Identities = 15/65 (23%), Positives = 26/65 (40%), Gaps = 10/65 (15%)
Query: 236 ENDYCKGGEREVHE-EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
E + G+ E +EE E+E E ++ ++ E E E + ++ E
Sbjct: 333 EVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEM---------ELLEVQVDDLAERL 383
Query: 295 EEEEI 299
E EI
Sbjct: 384 AETEI 388
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 29.7 bits (67), Expect = 2.4
Identities = 20/58 (34%), Positives = 26/58 (44%), Gaps = 3/58 (5%)
Query: 245 REVHEEEEEEEEEEEETKAKE---TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+V EEE E+E E + + K KKL EEE E + E EE + EI
Sbjct: 395 SKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREI 452
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
Length = 193
Score = 28.9 bits (64), Expect = 2.4
Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 4/54 (7%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E+ E E+ ++E EE + K + E+ + EE E+EEE EE++ +EEI
Sbjct: 20 ELAEREKLKKEVEEIPEQKPEDIVEELE----DQPEEPPEQEEENEEQKPKEEI 69
Score = 27.8 bits (61), Expect = 5.9
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 7/55 (12%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
R++ E E E+ ++E + E K ++ EE E++ EE E+EEE EE
Sbjct: 16 RDIDELAEREKLKKEVEEIPEQKPEDIV-------EELEDQPEEPPEQEEENEEQ 63
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 29.5 bits (66), Expect = 2.4
Identities = 12/51 (23%), Positives = 26/51 (50%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ E E + +E+ + E +L+++++ + E EE+ E E+EE
Sbjct: 668 DLEAEHKSLQEKISSLEDELEKERQNHQELIAKCRELEEKIERAEQEENMQ 718
>gnl|CDD|224251 COG1332, COG1332, CRISPR system related protein, RAMP superfamily
[Defense mechanisms].
Length = 369
Score = 29.3 bits (66), Expect = 2.6
Identities = 25/134 (18%), Positives = 37/134 (27%), Gaps = 25/134 (18%)
Query: 172 RTERNALKAILITWIVIVLTAVPVF-NMHGEVSYTYASELHTACVF-----------LDL 219
R R + L T + N+H + + F ++
Sbjct: 182 RAIRVSDSPALRTSVSDKRIEYKKLMNVHEKGIPLDNLPIEREFFFNVVISIRTRVEVED 241
Query: 220 DPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK------ 273
R +F + K E E+ EE E KA E L + +K
Sbjct: 242 PETKREILAIVNVFNKLD--IKIFSGERFEKTLEENFESYNQKAYEEILAKLEKVDAING 299
Query: 274 -----LGGGGGEEE 282
LG GGG
Sbjct: 300 NIELHLGFGGGWRS 313
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim
have been identified in both mice and humans (mTim and
hTIM, respectively). Mammalian TIM is not the true
orthologue of Drosophila TIM, but is the likely
orthologue of a fly gene, timeout (also called tim-2).
mTim has been shown to be essential for embryonic
development, but does not have substantiated circadian
function. Some family members contain a SANT domain in
this region.
Length = 507
Score = 29.7 bits (66), Expect = 2.6
Identities = 15/53 (28%), Positives = 28/53 (52%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
EEE EEE++ K+ ++ K+ G G +EE ++ +E+ + + GG
Sbjct: 423 EEEALGEEEQKAPPKKKQLNQKNKQQTGSGTNSDEERDDTSLDEDRDLADDGG 475
Score = 29.3 bits (65), Expect = 3.4
Identities = 17/60 (28%), Positives = 30/60 (50%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
K G EE EEE++ K K+ K K++ G G +EE ++ +E+ + ++ G
Sbjct: 416 KAGLASPEEEALGEEEQKAPPKKKQLNQKNKQQTGSGTNSDEERDDTSLDEDRDLADDGG 475
>gnl|CDD|227621 COG5305, COG5305, Predicted membrane protein [Function unknown].
Length = 552
Score = 29.3 bits (66), Expect = 2.7
Identities = 30/140 (21%), Positives = 43/140 (30%), Gaps = 19/140 (13%)
Query: 67 IFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLF--IVCCVPFTATDYV 124
I++ + G L ++V F R LI I L I P A +
Sbjct: 243 IWWLLATAAGL-LAFLPWLLVYFANSFRVGIPDLITLDGIVGQLINPI---YPLIAAWLL 298
Query: 125 TTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSV--RTERNALKAIL 182
F D+ L L + L + ++ A ++ TER L IL
Sbjct: 299 AFSLLFFDL--LHGLD-FPINGVARFLYPIWL--AIILI-LGALYTLCRATERKILLFIL 352
Query: 183 ITWIVIVLTA-----VPVFN 197
I L V VF
Sbjct: 353 TLIIAPALPLILSDLVSVFI 372
>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases.and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain.
Expression of a construct consisting of only the
N-terminal domain of Saccharomyces pombe Cdc37 results
in cellular viability. This indicates that interactions
with the cochaperone Hsp90 may not be essential for
Cdc37 function.
Length = 154
Score = 28.5 bits (64), Expect = 2.7
Identities = 16/56 (28%), Positives = 29/56 (51%), Gaps = 9/56 (16%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
++E+ E + E EE K KLK E++E ++ +E E+EE+++I
Sbjct: 96 KKELEEANGDSEGLLEELKKHRDKLK---------KEQKELRKKLDELEKEEKKKI 142
>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved region
at the N-terminal end of a family of proteins referred
to as CIRs (CBF1-interacting co-repressors). CBF1
(centromere-binding factor 1) acts as a transcription
factor that causes repression by binding specifically to
GTGGGAA motifs in responsive promoters, and it requires
CIR as a co-repressor. CIR binds to histone deacetylase
and to SAP30 and serves as a linker between CBF1 and the
histone deacetylase complex.
Length = 37
Score = 26.3 bits (59), Expect = 2.7
Identities = 8/20 (40%), Positives = 15/20 (75%)
Query: 280 EEEEEEEEEEEEEEEEEEEI 299
+ E++ EEE+++ EE +EI
Sbjct: 15 KAEQKAEEEKKKIEERRKEI 34
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 29.2 bits (66), Expect = 2.7
Identities = 19/75 (25%), Positives = 32/75 (42%), Gaps = 7/75 (9%)
Query: 226 DGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEE 285
+ Y + + +E+HE EEE E + K K++EK+K EE E
Sbjct: 285 ENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKK-FTEKIREKEK------RLEELE 337
Query: 286 EEEEEEEEEEEEEIG 300
+ EE +E ++
Sbjct: 338 QNLIEERKELNSKLE 352
>gnl|CDD|131972 TIGR02926, AhaH, ATP synthase archaeal, H subunit. he A1/A0 ATP
synthase is homologous to the V-type (V1/V0, vacuolar)
ATPase, but functions in the ATP synthetic direction as
does the F1/F0 ATPase of bacteria. The hydrophilic A1
"stalk" complex (AhaABCDEFG) is the site of ATP
generation and is coupled to the membrane-embedded
proton translocating A0 complex. It is unclear precisely
where AhaH fits into these complexes.
Length = 85
Score = 27.5 bits (61), Expect = 2.8
Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 11/77 (14%)
Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEK-----KKLGGGGGEEEEE 284
+ I ++E D E + E EEE ++ E + + +L E+ KLG +E EE
Sbjct: 2 EEIKKAEED----AEELIEEAEEERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEE 57
Query: 285 EEEEEEEE--EEEEEEI 299
E E+E E+ EE E+EI
Sbjct: 58 EIEKEAEKIREEGEKEI 74
>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
region. Members of this family have very varied
localisations within the eukaryotic cell. pinin is known
to localise at the desmosomes and is implicated in
anchoring intermediate filaments to the desmosomal
plaque. SDK2/3 is a dynamically localised nuclear
protein thought to be involved in modulation of
alternative pre-mRNA splicing. memA is a tumour marker
preferentially expressed in human melanoma cell lines. A
common feature of the members of this family is that
they may all participate in regulating protein-protein
interactions.
Length = 131
Score = 28.2 bits (63), Expect = 2.9
Identities = 16/56 (28%), Positives = 30/56 (53%), Gaps = 10/56 (17%)
Query: 244 EREVHEEEEEEEEEEEETKA-----KETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
E+++ E+E++E EE + K + K E +KL E++ E+E+ +E E
Sbjct: 41 EQKLEEQEKQEREELRKEKRELFEERRRKQLELRKL-----EQKMEDEKLQETWHE 91
>gnl|CDD|222800 PHA00458, PHA00458, single-stranded DNA-binding protein.
Length = 233
Score = 29.0 bits (65), Expect = 3.0
Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 11/57 (19%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
GGE + +E+EE+ E KA++ EEE +E++EEE EE+E+ +
Sbjct: 187 GGEDDWADEDEEDGYVASEAKARK-----------PQDEEEWDEDDEEESEEDEDGD 232
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 27.4 bits (61), Expect = 3.0
Identities = 8/41 (19%), Positives = 23/41 (56%), Gaps = 6/41 (14%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
E EE E+++++E + +K ++++++E++E
Sbjct: 46 EGEEPEDDDDDEDDDDDDDKDDKDD------DDDDDDEDDE 80
Score = 25.9 bits (57), Expect = 10.0
Identities = 5/20 (25%), Positives = 19/20 (95%)
Query: 279 GEEEEEEEEEEEEEEEEEEE 298
GEE E+++++E+++++++++
Sbjct: 47 GEEPEDDDDDEDDDDDDDKD 66
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 29.2 bits (66), Expect = 3.1
Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 9/59 (15%)
Query: 243 GERE----VHEEEEEEEEEEEETK-AKETKLKEKKKLGG----GGGEEEEEEEEEEEEE 292
GE+ + E E E++EEE K K+ K + L GE + EEEEE++ E
Sbjct: 85 GEQGLQRKLKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQME 143
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 29.4 bits (66), Expect = 3.1
Identities = 15/76 (19%), Positives = 31/76 (40%)
Query: 228 YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
NK Y + E E EEEE+E+++++ + ++ E +
Sbjct: 120 NNKQFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDR 179
Query: 288 EEEEEEEEEEEIGGVV 303
+E++EE+E+
Sbjct: 180 VKEDDEEDEDADLSKK 195
Score = 27.8 bits (62), Expect = 8.9
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 281 EEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSG 312
E+ E E+EEEEE+E++++ G + G G
Sbjct: 133 EDPESEDEEEEEDEDDDDDGSDDEDEDEDGVG 164
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 29.2 bits (66), Expect = 3.1
Identities = 15/55 (27%), Positives = 27/55 (49%), Gaps = 6/55 (10%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+E EE + EEE + + + +K+L E E E+EE ++ EE++
Sbjct: 195 KEKREELLSKLEEELLARLESKEAALEKQL------RLEFEREKEELRKKYEEKL 243
>gnl|CDD|226341 COG3820, COG3820, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 230
Score = 28.7 bits (64), Expect = 3.2
Identities = 24/75 (32%), Positives = 32/75 (42%), Gaps = 6/75 (8%)
Query: 214 CVFLDLDP-VNRPDGYNKTIFQSENDYCKGGEREVHE-EEEEEEEEEEETKAKETKLKEK 271
C +DLD V R T + E E + EEE EEE E + A KL
Sbjct: 160 CSQIDLDKEVERASRGRPTAKELGPTLLPASETEQLDYEEEREEERELDADAVFAKLSSL 219
Query: 272 KKLGGGGGEEEEEEE 286
K G ++EE+E+
Sbjct: 220 K----SGNKDEEDED 230
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 29.2 bits (66), Expect = 3.3
Identities = 20/56 (35%), Positives = 34/56 (60%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E ++ EEEEEE E EE + +E + ++K+K G + E +E++ E+ EE E+
Sbjct: 17 EEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIEDLLEELEL 72
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 29.0 bits (66), Expect = 3.4
Identities = 17/34 (50%), Positives = 19/34 (55%)
Query: 265 ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E + + EE EEEEEEEEEEE EE E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAE 256
Score = 28.6 bits (65), Expect = 3.8
Identities = 23/53 (43%), Positives = 24/53 (45%), Gaps = 19/53 (35%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
R+ E EEEEEE EEEEEEEEEEE EE E E
Sbjct: 225 RQGRLAEAAEEEEEEA-------------------EEEEEEEEEEEAEEAEAE 258
Score = 27.8 bits (63), Expect = 6.3
Identities = 21/51 (41%), Positives = 21/51 (41%), Gaps = 19/51 (37%)
Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E EEEEE E EEEEEEEEEEE EE E
Sbjct: 226 QGRLAEAAEEEEE-------------------EAEEEEEEEEEEEAEEAEA 257
Score = 27.8 bits (63), Expect = 6.3
Identities = 22/53 (41%), Positives = 23/53 (43%), Gaps = 19/53 (35%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+ G EEEEEE EEEE EEEEEEE EE E E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEE-------------------EEEEEEEAEEAEAE 258
Score = 27.4 bits (62), Expect = 8.6
Identities = 22/53 (41%), Positives = 22/53 (41%), Gaps = 20/53 (37%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
R EEEEEE EEE EEEEEEEE EE E E
Sbjct: 226 QGRLAEAAEEEEEEAEEE--------------------EEEEEEEEAEEAEAE 258
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 28.8 bits (64), Expect = 3.4
Identities = 10/54 (18%), Positives = 22/54 (40%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
R+ ++++ K + K G G E +++E + +EEE+
Sbjct: 236 RQTRGFRSDDDQGYGRYKQNLSSNKYGDFKGVNYGYESDDDEGSSSNDYDEEED 289
>gnl|CDD|218358 pfam04979, IPP-2, Protein phosphatase inhibitor 2 (IPP-2). Protein
phosphotase inhibitor 2 (IPP-2) is a phosphoprotein
conserved among all eukaryotes, and it appears in both
the nucleus and cytoplasm of tissue culture cells.
Length = 123
Score = 27.8 bits (62), Expect = 3.4
Identities = 14/51 (27%), Positives = 19/51 (37%), Gaps = 2/51 (3%)
Query: 249 EEEEEEEEEEEETKAKE--TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
E+ + E E E + KL + EE E E+EE EE
Sbjct: 40 EDSDSESEGNESLTPESLAEKLAAAESSDPSFSIEESESSSSEDEEFSPEE 90
Score = 27.0 bits (60), Expect = 7.0
Identities = 12/56 (21%), Positives = 22/56 (39%), Gaps = 4/56 (7%)
Query: 247 VHEEEEEEEEEEEETKAKETKLKEKK----KLGGGGGEEEEEEEEEEEEEEEEEEE 298
++E+ + E E ++ + +K + EE E E+EE EE
Sbjct: 35 HRMDDEDSDSESEGNESLTPESLAEKLAAAESSDPSFSIEESESSSSEDEEFSPEE 90
>gnl|CDD|234397 TIGR03920, T7SS_EccD, type VII secretion integral membrane protein
EccD. Members of this family are EccD, a component of
actinobacterial type VII secretion systems (T7SS) with
ten to eleven predicted transmembrane helix regions
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 431
Score = 28.9 bits (65), Expect = 3.6
Identities = 28/145 (19%), Positives = 51/145 (35%), Gaps = 23/145 (15%)
Query: 62 SIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTAT 121
+ ++G I G L+V+ A + + ++L A L+ + A
Sbjct: 135 GRGLGGWWGAPIAAG--VIGLLVLAAAVVARTLRDRAVAALSLLGAALVLV-----AVAG 187
Query: 122 DYVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSV--------RT 173
G + L+ T A++ L+L R LAV + +++V
Sbjct: 188 ----ALAVPGALGVPHVLLAGTVLAALAVLLLRLTGRGLAVFTAVVTVAVILALAAAVGM 243
Query: 174 ER----NALKAILITWIVIVLTAVP 194
+ A LI + VL+A P
Sbjct: 244 LWQTPVPGIAAGLILLGLAVLSAAP 268
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 29.2 bits (65), Expect = 3.6
Identities = 16/54 (29%), Positives = 25/54 (46%), Gaps = 4/54 (7%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEE----EEEEEEEEEEEEEEEEI 299
+ EEE+ E++ + + K + + G E E EE E E E EEE+
Sbjct: 126 SDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEEL 179
>gnl|CDD|218637 pfam05558, DREPP, DREPP plasma membrane polypeptide. This family
contains several plant plasma membrane proteins termed
DREPPs as they are developmentally regulated plasma
membrane polypeptides.
Length = 201
Score = 28.5 bits (63), Expect = 3.7
Identities = 16/47 (34%), Positives = 24/47 (51%), Gaps = 11/47 (23%)
Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EE++ EEE A ET+ + +E+E EEE++EEE
Sbjct: 132 EEKKPEEEAPAAETE-----------KKPAVKEKEIVVEEEKKEEEA 167
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 29.0 bits (65), Expect = 3.7
Identities = 12/46 (26%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 254 EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E+EE EE E + +E+++L ++EEEE++ + + ++ +
Sbjct: 143 EQEELEEALEFEKEEEEQRRL-----LLQKEEEEQQMNKRKNKQAL 183
>gnl|CDD|217525 pfam03383, Serpentine_r_xa, Caenorhabditis serpentine receptor-like
protein, class xa. This family contains various
Caenorhabditis proteins, some of which are annotated as
being serpentine receptors, mainly of the xa class.
Length = 153
Score = 28.1 bits (63), Expect = 3.7
Identities = 13/52 (25%), Positives = 23/52 (44%), Gaps = 1/52 (1%)
Query: 146 ASVYTLVLMSLDRFLAVVHPIASMSVRTERN-ALKAILITWIVIVLTAVPVF 196
+ VLM+L+R V+ P + T+R I + ++L +P F
Sbjct: 57 HPLCLSVLMTLNRISIVISPFSQPIWFTDRRLFGYCAGIAILSLILLLIPYF 108
>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
protein. This domain family is found in eukaryotes, and
is typically between 212 and 238 amino acids in length.
The family is found in association with pfam01805. There
are two completely conserved residues (W and H) that may
be functionally important. PRP21 is required for
assembly of the prespliceosome and it interacts with U2
snRNP and/or pre-mRNA in the prespliceosome. This family
also contains proteins similar to PRP21, such as the
mammalian SF3a. SF3a also interacts with U2 snRNP from
the prespliceosome, converting it to its active form.
Length = 230
Score = 28.6 bits (64), Expect = 3.7
Identities = 14/30 (46%), Positives = 19/30 (63%)
Query: 268 LKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
L+E EEE EEEEE+ +EE++EE
Sbjct: 96 LEEPDAAAAPEEEEEMEEEEEDIDEEKKEE 125
Score = 27.8 bits (62), Expect = 6.8
Identities = 13/39 (33%), Positives = 19/39 (48%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGG 279
+ E E EE+ +EE++EE K +KE K G
Sbjct: 107 EEEEMEEEEEDIDEEKKEESAKKSTPPIKEMKIPAAGES 145
Score = 27.8 bits (62), Expect = 7.1
Identities = 13/36 (36%), Positives = 20/36 (55%)
Query: 260 ETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
E K+K + + E EEEEE+ +EE++EE
Sbjct: 90 EQKSKALEEPDAAAAPEEEEEMEEEEEDIDEEKKEE 125
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
kDa subunit. This family contains several DNA-directed
RNA polymerase 19 kDa polypeptides. The Poxvirus
DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
DNA-template-directed extension of the 3'-end of an RNA
strand by one nucleotide at a time.
Length = 167
Score = 28.1 bits (63), Expect = 3.8
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
E ++++ EE EEEEE+EE+ + S V
Sbjct: 11 ESDDDDSEEYEEEEEDEEDAESLESSDVS 39
>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal. The C-terminal domain of
FCP-1 is required for interaction with the carboxy
terminal domain of RAP74. Interaction relies extensively
on van der Waals contacts between hydrophobic residues
situated within alpha-helices in both domains.
Length = 263
Score = 28.8 bits (64), Expect = 3.8
Identities = 20/89 (22%), Positives = 33/89 (37%), Gaps = 19/89 (21%)
Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLG----------GGGG-- 279
I +D +++ EEE+E+ + K + + + G GG G
Sbjct: 155 ILGEGSDDSDSEKKKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTPSSSERSNIGGRGPR 214
Query: 280 -------EEEEEEEEEEEEEEEEEEEIGG 301
EE+EE+ E E E E+ G
Sbjct: 215 GHKRKLNEEDEEDAESESSFESSNEDEEG 243
>gnl|CDD|221753 pfam12753, Nro1, Nuclear pore complex subunit Nro1. In fission
yeast, this protein is a positive regulator of the
stability of Sre1N, the sterol regulatory
element-binding protein which is an ER membrane-bound
transcription factor that controls adaptation to low
oxygen-growth. In addition, the fission yeast Nro1 is a
direct inhibitor of a protein that inhibits SreN1
degradation, Ofd1 (an oxoglutamate deoxygenase). The
outcome of this reactivity is that Ofd1 acts as an
oxygen sensor that regulates the binding of Nro1 to Ofd1
to control the stability of Sre1N. Solution of the
structure of Nro1 reveals it to be made up of a number
of TPR coils.
Length = 401
Score = 28.7 bits (64), Expect = 3.9
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 279 GEEEEEEEEEEEEEEEEEEEI 299
G+E E +E + +EEEEEEE+
Sbjct: 219 GKENEIDEGLDSDEEEEEEEV 239
>gnl|CDD|224667 COG1753, COG1753, Predicted antotoxin, copG family [Signal
transduction mechanisms].
Length = 74
Score = 26.7 bits (59), Expect = 3.9
Identities = 17/59 (28%), Positives = 31/59 (52%), Gaps = 5/59 (8%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKK-----LGGGGGEEEEEEEEEEEEEEEEEEEI 299
+V+E+ + + +E +L EKK+ L G EEE E+ ++EE+E E++
Sbjct: 11 DVYEKLVKMKRGKESFSDVIRELIEKKRGNLEVLMRAFGTLSEEEVEKIKKEEKEAEKM 69
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 29.3 bits (66), Expect = 3.9
Identities = 21/70 (30%), Positives = 32/70 (45%), Gaps = 22/70 (31%)
Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
K IF+ N K E ++ + EE+ TK K+KK E+++EEE
Sbjct: 722 KKIFR--NWNGKDAEFKISDSVEEK-----------TKKKKKK---------EKKKEEEY 759
Query: 290 EEEEEEEEEI 299
+ EE+ EI
Sbjct: 760 KREEKARIEI 769
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 29.0 bits (65), Expect = 3.9
Identities = 20/87 (22%), Positives = 39/87 (44%), Gaps = 7/87 (8%)
Query: 219 LDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEE-------EEEETKAKETKLKEK 271
L+ + R + + Q+E YCK + + E+E EEE +EEE +A + +EK
Sbjct: 368 LECMLRYQSSQRELKQTEAAYCKRLDEKRLFEDEAEEEARQRLADDEEEVRAGDEAREEK 427
Query: 272 KKLGGGGGEEEEEEEEEEEEEEEEEEE 298
++EE + + +++
Sbjct: 428 IAANSQVIDKEEVCNLYDRRDTAWQKQ 454
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 28.2 bits (63), Expect = 3.9
Identities = 13/59 (22%), Positives = 22/59 (37%), Gaps = 8/59 (13%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
ERE+ EE E EEE E + G + +E E++ ++ +
Sbjct: 68 ERELKEEAEAEEEGEVDA--------SPDAGAVAGESSADRKEAEQQGAAQKRKSCRDK 118
>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
Provisional.
Length = 130
Score = 27.9 bits (62), Expect = 4.0
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKL 274
++EEE+EEE E +E L E + L
Sbjct: 96 KKEEEKEEEIPEPTKEEELLGEIRDL 121
Score = 27.5 bits (61), Expect = 4.7
Identities = 15/33 (45%), Positives = 18/33 (54%), Gaps = 6/33 (18%)
Query: 267 KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
KL KK+ EE+EEE E +EEE EI
Sbjct: 92 KLTSKKE------EEKEEEIPEPTKEEELLGEI 118
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 28.9 bits (65), Expect = 4.0
Identities = 19/57 (33%), Positives = 21/57 (36%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
E E EE EE EE E L E+ L E EE EE E E +
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRS 911
>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
Length = 535
Score = 29.1 bits (66), Expect = 4.0
Identities = 15/45 (33%), Positives = 20/45 (44%)
Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E+EE + + KL KK EEEEEE E + E+
Sbjct: 491 EKEEPDIEEVLAKLIGKKLEEEEVEEEEEEEAVVVESAKNYTEKD 535
>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
Length = 815
Score = 29.0 bits (65), Expect = 4.0
Identities = 20/51 (39%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
KGG R+ E +E E E+ KAK K K++ G G G+ E+ +E EE
Sbjct: 18 KGGTRKQKAENKEHEGEQSPKKAK--KEKKQDDSGNGNGKSAEDAVKEFEE 66
>gnl|CDD|216887 pfam02124, Marek_A, Marek's disease glycoprotein A.
Length = 211
Score = 28.5 bits (64), Expect = 4.1
Identities = 11/54 (20%), Positives = 18/54 (33%), Gaps = 5/54 (9%)
Query: 36 PAICRNFSHEYINDNDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAF 89
P F D A ++S+I G+I L G +V+ +
Sbjct: 161 PDGLPVFEDTGTYDASPSAVTGPMVISVI-----GVICGLVALGLLVVLAALCL 209
>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
Length = 177
Score = 28.3 bits (64), Expect = 4.2
Identities = 7/18 (38%), Positives = 15/18 (83%)
Query: 282 EEEEEEEEEEEEEEEEEI 299
++++EEE EEEE ++++
Sbjct: 147 SDDDDEEESEEEESDDDL 164
Score = 28.0 bits (63), Expect = 5.3
Identities = 8/19 (42%), Positives = 15/19 (78%)
Query: 279 GEEEEEEEEEEEEEEEEEE 297
G ++++EEE EEEE +++
Sbjct: 145 GISDDDDEEESEEEESDDD 163
Score = 27.6 bits (62), Expect = 6.4
Identities = 8/19 (42%), Positives = 15/19 (78%)
Query: 278 GGEEEEEEEEEEEEEEEEE 296
G ++++EEE EEEE +++
Sbjct: 145 GISDDDDEEESEEEESDDD 163
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 29.0 bits (65), Expect = 4.3
Identities = 21/65 (32%), Positives = 28/65 (43%), Gaps = 3/65 (4%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+E D K E E+ +E+ +E E E L E K L + EE +E E
Sbjct: 259 DTEEDRTKEREAELEALQEQIDELESSI---EEVLSEIKALASKIKQVNEELTTVRQENE 315
Query: 294 EEEEE 298
E EEE
Sbjct: 316 ELEEE 320
Score = 28.2 bits (63), Expect = 7.4
Identities = 18/56 (32%), Positives = 26/56 (46%), Gaps = 8/56 (14%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+ ++ E E EE E KA +K+K+ EE +E EE EEE +I
Sbjct: 276 QEQIDELESSIEEVLSEIKALASKIKQV--------NEELTTVRQENEELEEEYKI 323
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 28.8 bits (64), Expect = 4.3
Identities = 12/57 (21%), Positives = 27/57 (47%), Gaps = 9/57 (15%)
Query: 244 EREVHEEEEE-EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+V+ EE E E E + ++ +K++K ++ EE++ E +++I
Sbjct: 85 VADVNNGEEFLSESESEASLEIDSDIKDEK--------QKSLEEQKIAPEIPVKQQI 133
>gnl|CDD|179798 PRK04239, PRK04239, hypothetical protein; Provisional.
Length = 110
Score = 27.2 bits (61), Expect = 4.5
Identities = 14/44 (31%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 256 EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EE EE + + KL+E +K + +EE+EE + + E +++ I
Sbjct: 1 EELEEIRRR--KLEELQKQAQEQQQAQEEQEEAQAQAEAQKQAI 42
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 28.9 bits (64), Expect = 4.5
Identities = 18/51 (35%), Positives = 26/51 (50%), Gaps = 8/51 (15%)
Query: 226 DGYNKTI--FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKL 274
DG F ++ E E EEE+EEEE +E + AK K K++K +
Sbjct: 758 DGEQDNSDSFAESSE-----EDESSEEEKEEEENKEVS-AKRAKKKQRKNM 802
>gnl|CDD|226122 COG3594, NolL, Fucose 4-O-acetylase and related acetyltransferases
[Carbohydrate transport and metabolism].
Length = 343
Score = 28.5 bits (64), Expect = 4.5
Identities = 29/153 (18%), Positives = 48/153 (31%), Gaps = 18/153 (11%)
Query: 58 ERMVSIIVP--IFFGIIIILGFF-GNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVC 114
++ ++IVP FF I IL F + + F L+I+ LLFI
Sbjct: 69 KKARTLIVPYIFFFLIYSILYFLLRKFNSELTLNFILPGWELWFLVILFFWFLPLLFITK 128
Query: 115 CVPFTATDYVTTCWPFGDVWCKM------YLIVVTAYASVYTLV--LMSLDRFL--AVVH 164
+ + W V + + I ++ + TLV L L +
Sbjct: 129 LIYN-----LIAVWLAVSVSLIILILVGFFFIPLSLGSFSRTLVFFPFFLLGTLFAKLHF 183
Query: 165 PIASMSVRTERNALKAILITWIVIVLTAVPVFN 197
+ + + VI L FN
Sbjct: 184 WFGGLLRTYKGLRVNIKSFLAAVIFLAVGLAFN 216
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 28.9 bits (65), Expect = 4.5
Identities = 20/105 (19%), Positives = 35/105 (33%), Gaps = 4/105 (3%)
Query: 193 VPVFNMHGEVSYTYASELHTACVFLDLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEE 252
V +FN +E+ + + + ++ I E +
Sbjct: 773 VSLFNNPNAEKILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMR 832
Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
EE E + K K + K +L E +E EEE + E + E
Sbjct: 833 PEELELINEEQKNLKQEIKLEL----SEIQEAEEEIQNINENKNE 873
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 27.6 bits (62), Expect = 4.7
Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 6/38 (15%)
Query: 242 GGEREVHEEEEEEEEEEEETKAKETK------LKEKKK 273
G E EEE++E+E+++ K KE + L+EKKK
Sbjct: 67 GEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKK 104
>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
Length = 232
Score = 28.3 bits (63), Expect = 4.9
Identities = 16/32 (50%), Positives = 18/32 (56%)
Query: 271 KKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
K +GG EE E E E+E EEEEEE V
Sbjct: 53 KADVGGLEVSEETEAEVEDEGGEEEEEEDADV 84
Score = 28.3 bits (63), Expect = 5.5
Identities = 16/50 (32%), Positives = 23/50 (46%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
G EV EE E E E+E + +E + +L G E+ E +EE
Sbjct: 57 GGLEVSEETEAEVEDEGGEEEEEEDADVETELRPRGLTEKTPELSDEEAR 106
>gnl|CDD|227225 COG4888, COG4888, Uncharacterized Zn ribbon-containing protein
[General function prediction only].
Length = 104
Score = 27.1 bits (60), Expect = 5.1
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 275 GGGGGEEEEEEEEEEEEEEEEEE 297
G G GE +EE ++E E EE
Sbjct: 80 GRGCGESGTDEENDQEIESPGEE 102
Score = 26.7 bits (59), Expect = 6.8
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 276 GGGGEEEEEEEEEEEEEEEEEEEI 299
G G E +EE ++E E EE+
Sbjct: 80 GRGCGESGTDEENDQEIESPGEEV 103
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 28.6 bits (63), Expect = 5.4
Identities = 22/71 (30%), Positives = 30/71 (42%)
Query: 229 NKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
N + EN E ++EE EE EE + E ++E + EE EE E
Sbjct: 1018 NVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEENIE 1077
Query: 289 EEEEEEEEEEI 299
E EE EE +
Sbjct: 1078 ENIEENVEENV 1088
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 28.5 bits (64), Expect = 5.4
Identities = 20/105 (19%), Positives = 40/105 (38%), Gaps = 10/105 (9%)
Query: 62 SIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTAT 121
+ G + I+ ++V + R+ LL+I + A L ++ V T
Sbjct: 387 DGLASDVIGQLGIIAALLAPALLVARLKDAAWRAYPRLLLITVLSAAPLALI--VA-ALT 443
Query: 122 DYVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPI 166
Y+ T + L +V A ++T ++ + R L+V
Sbjct: 444 GYLYT-------ALFLALRLVVTVAILFTWYVLYILRGLSVSARR 481
>gnl|CDD|148790 pfam07376, Prosystemin, Prosystemin. This family consists of
several plant specific prosystemin proteins. Prosystemin
is the precursor protein of the 18 amino acid wound
signal systemin which activates systemic defence in
plant leaves against insect herbivores.
Length = 193
Score = 28.1 bits (61), Expect = 5.5
Identities = 21/67 (31%), Positives = 33/67 (49%)
Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
+T Q + + E EE +E+E+ KET K K+ G +E+ + E EE
Sbjct: 37 ETPSQDIKNKDDDAQEIPKVEHEEGGDEKEKIIEKETISKCIIKIEGDDAQEKIKVEYEE 96
Query: 290 EEEEEEE 296
EE E+E+
Sbjct: 97 EECEKEK 103
>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
Length = 586
Score = 28.6 bits (64), Expect = 5.5
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 268 LKEKKKLGGGGGEEEEEEEEEEEEEEE 294
L++K++ GEE +E+EEE+ E+E
Sbjct: 52 LEQKEEKEEDAGEEGDEDEEEQGEDEH 78
Score = 27.8 bits (62), Expect = 8.7
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 267 KLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
+L E+K+ EE +E+EEE+ E+E
Sbjct: 50 QLLEQKEEKEEDAGEEGDEDEEEQGEDEH 78
>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
(CLPTM1). This family consists of several eukaryotic
cleft lip and palate transmembrane protein 1 sequences.
Cleft lip with or without cleft palate is a common birth
defect that is genetically complex. The nonsyndromic
forms have been studied genetically using linkage and
candidate-gene association studies with only partial
success in defining the loci responsible for orofacial
clefting. CLPTM1 encodes a transmembrane protein and has
strong homology to two Caenorhabditis elegans genes,
suggesting that CLPTM1 may belong to a new gene family.
This family also contains the human cisplatin resistance
related protein CRR9p which is associated with
CDDP-induced apoptosis.
Length = 437
Score = 28.4 bits (64), Expect = 5.6
Identities = 13/34 (38%), Positives = 18/34 (52%)
Query: 261 TKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
T K +KK L GG E+EE EEE+ ++
Sbjct: 145 TTYLPKKKVKKKNLLGGKSEKEEPEEEKTPAPDK 178
>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain.
Length = 225
Score = 28.2 bits (63), Expect = 5.6
Identities = 19/81 (23%), Positives = 34/81 (41%), Gaps = 15/81 (18%)
Query: 220 DPVNRPDGY-NKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGG 278
+ P G N +Q +D G + + + E EE E + K +E++ L
Sbjct: 73 NEAVAPGGTINGDGYQEPDDPTSGYA-SISQADRVEGEEPESIR----KWRERRDL---- 123
Query: 279 GEEEEEEEEEEEEEEEEEEEI 299
EE +E E+++EE+
Sbjct: 124 -----RIEERDEASEKKKEEL 139
>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
Length = 103
Score = 27.1 bits (60), Expect = 5.7
Identities = 22/65 (33%), Positives = 33/65 (50%)
Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
K I + + K E E+ ++ EEE ++ E K KE + + KK L G E EE + + E
Sbjct: 27 KKIIRKAKEEAKKIEEEIIKKAEEEAQKLIEKKKKEGEEEAKKILEEGEKEIEELKVKAE 86
Query: 290 EEEEE 294
E E
Sbjct: 87 ENFET 91
>gnl|CDD|227635 COG5325, COG5325, t-SNARE complex subunit, syntaxin [Intracellular
trafficking and secretion].
Length = 283
Score = 28.3 bits (63), Expect = 5.7
Identities = 15/55 (27%), Positives = 25/55 (45%), Gaps = 7/55 (12%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
+ H EEEE+EE + + L+++ EE E ++ E +EEI
Sbjct: 151 DQHPLEEEEDEESLSSLGSQQTLQQQGLS-------NEELEYQQILITERDEEIK 198
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 28.1 bits (63), Expect = 5.8
Identities = 16/53 (30%), Positives = 22/53 (41%)
Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EE E+ E+E KE + +K L EEE E EE +E +
Sbjct: 113 AELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEIREEGQELSSKR 165
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 28.5 bits (63), Expect = 5.8
Identities = 13/43 (30%), Positives = 19/43 (44%)
Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
V EEE + E +E K +EK+K G E +E+
Sbjct: 82 VKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDED 124
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 28.1 bits (63), Expect = 5.8
Identities = 19/56 (33%), Positives = 29/56 (51%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+E+ EE EE+ E E +E K + KK E+ + + E E+EE EEE+
Sbjct: 57 AKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEEL 112
>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1). This family
consists of several mammalian dentin matrix protein 1
(DMP1) sequences. The dentin matrix acidic
phosphoprotein 1 (DMP1) gene has been mapped to human
chromosome 4q21. DMP1 is a bone and teeth specific
protein initially identified from mineralised dentin.
DMP1 is primarily localised in the nuclear compartment
of undifferentiated osteoblasts. In the nucleus, DMP1
acts as a transcriptional component for activation of
osteoblast-specific genes like osteocalcin. During the
early phase of osteoblast maturation, Ca(2+) surges into
the nucleus from the cytoplasm, triggering the
phosphorylation of DMP1 by a nuclear isoform of casein
kinase II. This phosphorylated DMP1 is then exported out
into the extracellular matrix, where it regulates
nucleation of hydroxyapatite. DMP1 is a unique molecule
that initiates osteoblast differentiation by
transcription in the nucleus and orchestrates
mineralised matrix formation extracellularly, at later
stages of osteoblast maturation. The DMP1 gene has been
found to be ectopically expressed in lung cancer
although the reason for this is unknown.
Length = 514
Score = 28.5 bits (63), Expect = 5.8
Identities = 13/54 (24%), Positives = 23/54 (42%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
R+ EE+ E +++ E K + G E E +E+ EE + +E
Sbjct: 270 RKSRISEEDGRGELDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQE 323
>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal. This is the N terminal of
the DNA repair protein Rad54.
Length = 191
Score = 27.7 bits (62), Expect = 5.9
Identities = 8/20 (40%), Positives = 14/20 (70%)
Query: 279 GEEEEEEEEEEEEEEEEEEE 298
+ + EEE+ E+++E EE E
Sbjct: 143 DKPKIEEEKAEKDQEPEESE 162
Score = 27.7 bits (62), Expect = 5.9
Identities = 8/24 (33%), Positives = 15/24 (62%)
Query: 280 EEEEEEEEEEEEEEEEEEEIGGVV 303
+ EEE+ E+++E EE E ++
Sbjct: 146 KIEEEKAEKDQEPEESETKLSNGP 169
>gnl|CDD|233199 TIGR00939, 2a57, Equilibrative Nucleoside Transporter (ENT).
[Transport and binding proteins, Nucleosides, purines
and pyrimidines].
Length = 437
Score = 28.1 bits (63), Expect = 5.9
Identities = 14/83 (16%), Positives = 30/83 (36%), Gaps = 7/83 (8%)
Query: 118 FTATDYVTTCWPF-GDVW------CKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMS 170
TA Y + + ++ KM+ T Y L + + + ++
Sbjct: 16 ITAPQYFIEYYKYAQNIPEAIPSSSKMWKHFNTYYTLASQLPSLLFNSLNLFLIFRIPVT 75
Query: 171 VRTERNALKAILITWIVIVLTAV 193
VR + +++ +V+VL V
Sbjct: 76 VRLLGGLVILLVVVILVMVLVKV 98
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 28.4 bits (64), Expect = 5.9
Identities = 12/56 (21%), Positives = 21/56 (37%), Gaps = 5/56 (8%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E E + E+ + + KE +L ++ E E + E + EEI
Sbjct: 15 SYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQ-----RLEAEGLRLKREVDRLREEI 65
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 27.2 bits (61), Expect = 6.1
Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 20/69 (28%)
Query: 245 REVHEEEEEEEEEEEETKAKETKLKE--------------KKKLGGGGGEEEEEEEEEEE 290
E EE ++ EEE + KE +L+E K++ E++ EEE++
Sbjct: 21 EEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRR------RAEKKAEEEKK 74
Query: 291 EEEEEEEEI 299
+E+EEEI
Sbjct: 75 LRKEKEEEI 83
>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking. Sec14-like
Golgi-trafficking domain The GOLD domain is always found
combined with lipid- or membrane-association domains.
Length = 136
Score = 27.3 bits (61), Expect = 6.1
Identities = 13/24 (54%), Positives = 15/24 (62%)
Query: 279 GEEEEEEEEEEEEEEEEEEEIGGV 302
+EEEEEE EEEE E + E G
Sbjct: 55 SDEEEEEEAEEEEAETGDVEAGSK 78
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 28.1 bits (63), Expect = 6.1
Identities = 18/128 (14%), Positives = 36/128 (28%), Gaps = 14/128 (10%)
Query: 184 TWIVIVLTAVPVFNMHGEVSYTYASELHTACVFLDLDPVNRPDGYNKTIFQSENDYCKGG 243
+ ++++L + + V + L + E
Sbjct: 4 SILLVILLLLIGLLLGVLVVLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLR 63
Query: 244 EREVHEEEEEEEEEEE--------------ETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
ER +E E EE + +A++ E + E E EE
Sbjct: 64 ERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEE 123
Query: 290 EEEEEEEE 297
E++ + E
Sbjct: 124 LEKQLDNE 131
>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2). All proteins
in this family for which functions are known are flap
endonucleases that generate the 3' incision next to DNA
damage as part of nucleotide excision repair. This
family is related to many other flap endonuclease
families including the fen1 family. This family is based
on the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1034
Score = 28.3 bits (63), Expect = 6.2
Identities = 14/63 (22%), Positives = 26/63 (41%), Gaps = 2/63 (3%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEE--EEEEEEEEEEEEEEEIGGVVVSA 306
+ E + +K++ T E+ E E +E E+ +E EE+ I G++
Sbjct: 664 DSVSSTLELQVPSKSQPTDESEENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEE 723
Query: 307 VDF 309
D
Sbjct: 724 KDA 726
Score = 28.3 bits (63), Expect = 6.7
Identities = 12/67 (17%), Positives = 25/67 (37%)
Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+ E + E + + E++ + K+ GE +E E+ +E
Sbjct: 652 EESESDGSFIEVDSVSSTLELQVPSKSQPTDESEENAENKVASIEGEHRKEIEDLLFDES 711
Query: 294 EEEEEIG 300
EE+ +G
Sbjct: 712 EEDNIVG 718
>gnl|CDD|219905 pfam08564, CDC37_C, Cdc37 C terminal domain. Cdc37 is a protein
required for the activity of numerous eukaryotic protein
kinases. This domains corresponds to the C terminal
domain whose function is unclear. It is found C terminal
to the Hsp90 chaperone (Heat shocked protein 90) binding
domain pfam08565 and the N terminal kinase binding
domain of Cdc37 pfam03234.
Length = 89
Score = 26.5 bits (59), Expect = 6.8
Identities = 14/32 (43%), Positives = 16/32 (50%)
Query: 270 EKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
K G +E EEE EEE E+EE EE
Sbjct: 58 NAKIEGEKEFKELEEEYNEEEAEKEEIEEEDE 89
>gnl|CDD|217956 pfam04194, PDCD2_C, Programmed cell death protein 2, C-terminal
putative domain.
Length = 165
Score = 27.4 bits (61), Expect = 7.0
Identities = 15/53 (28%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 247 VHEEEEEEEE---EEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
V EEEE E + +E ++ ++L GG +E+EE +E E+ + +
Sbjct: 6 VFPEEEEYETLHAPSLPEEPEEREIDAFEELEKEGGGGKEDEETDESEKAKSD 58
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 27.9 bits (62), Expect = 7.0
Identities = 13/49 (26%), Positives = 24/49 (48%), Gaps = 7/49 (14%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E+ EE EEEE + + + ++ E+++E+E + E EI
Sbjct: 85 EKLEELEEEEREWYPSLNQMLEEN-------REQQKEKEARRQAREAEI 126
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 28.0 bits (63), Expect = 7.2
Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 5/60 (8%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE-----EEEEEEEEE 298
E E E+++EEE+ +A+E +E K E E E+ E E + +E+EE
Sbjct: 217 EAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEE 276
Score = 27.6 bits (62), Expect = 8.1
Identities = 15/52 (28%), Positives = 24/52 (46%), Gaps = 10/52 (19%)
Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
E+ + E + E E E + L+EK+K EEE+ E +E +E
Sbjct: 202 KEKAIEAERAKAEAAEAEQEL----LREKQK------EEEQMMEAQERSYQE 243
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 28.1 bits (63), Expect = 7.2
Identities = 11/50 (22%), Positives = 28/50 (56%), Gaps = 6/50 (12%)
Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
E + + + K+K + EK E +E+++E+++EE+++++I
Sbjct: 302 LFEMISDLKRKLKSKFERDNEKLDA------EVKEKKKEKKKEEKKKKQI 345
>gnl|CDD|147845 pfam05914, RIB43A, RIB43A. This family consists of several
RIB43A-like eukaryotic proteins. Ciliary and flagellar
microtubules contain a specialised set of
protofilaments, termed ribbons, that are composed of
tubulin and several associated proteins. RIB43A was
first characterized in the unicellular biflagellate,
Chlamydomonas reinhardtii although highly related
sequences are present in several higher eukaryotes
including humans. The function of this protein is
unknown although the structure of RIB43A and its
association with the specialised protofilament ribbons
and with basal bodies is relevant to the proposed role
of ribbons in forming and stabilising doublet and
triplet microtubules and in organising their
three-dimensional structure. Human RIB43A homologues
could represent a structural requirement in centriole
replication in dividing cells.
Length = 379
Score = 28.1 bits (63), Expect = 7.2
Identities = 12/60 (20%), Positives = 26/60 (43%), Gaps = 6/60 (10%)
Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
KG E+++ E+E EEE+ K + + + + E +E +E +++
Sbjct: 287 KGQEQQLQEKERRREEEQLREKEWDRQAINQARAA------VLLERQERRLRKELRKQLD 340
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 27.4 bits (61), Expect = 7.3
Identities = 17/62 (27%), Positives = 32/62 (51%)
Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
+ KG E+E+ E+E +E + E E K+ E ++ E++E E++E + E
Sbjct: 110 EERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEIARLKSEAS 169
Query: 298 EI 299
+I
Sbjct: 170 KI 171
>gnl|CDD|240533 cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic
compound extrusion (MATE)-like proteins. Escherichia
coli Wzx and related proteins from other gram-negative
bacteria are thought to act as flippases, assisting in
the membrane translocation of lipopolysaccharides
including those containing O-antigens. Proteins from the
MATE family are involved in exporting metabolites across
the cell membrane and are often responsible for
multidrug resistance (MDR).
Length = 402
Score = 27.9 bits (63), Expect = 7.5
Identities = 16/87 (18%), Positives = 34/87 (39%), Gaps = 8/87 (9%)
Query: 35 IPAICRNFSHEYINDNDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMR 94
P I + ++ + ++ + + I +PI G+ F ++ ++ F ++
Sbjct: 267 FPRISELYKNDKDELKKLLKKSFKYLFLISLPIAIGLF----LFAEPIITLL--FGEEYL 320
Query: 95 STTNLLIINLAIADLLFIVCCVPFTAT 121
+L I LA LFI F +
Sbjct: 321 PAALVLQI-LAWG-FLFIFLNGIFGSI 345
>gnl|CDD|218316 pfam04893, Yip1, Yip1 domain. The Yip1 integral membrane domain
contains four transmembrane alpha helices. The domain is
characterized by the motifs DLYGP and GY. The Yip1
protein is a golgi protein involved in vesicular
transport that interacts with GTPases.
Length = 171
Score = 27.4 bits (61), Expect = 7.6
Identities = 21/144 (14%), Positives = 56/144 (38%), Gaps = 21/144 (14%)
Query: 50 NDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADL 109
+ + + ++ I + ++LG F AL++ ++A + + L++
Sbjct: 49 SGETIQGLTGGSALGSIIGGYLGLLLGLFLLALLLHLIA--KLFGGDGSFKQT-LSLVGY 105
Query: 110 LFIVCCVPFTATDYVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASM 169
+ + ++ + L++V A +++L L+ L L H +
Sbjct: 106 ALLPLILGGLIALLLS-------LLLGALLLLVGLLALIWSLYLLYL--GLKAAHGL--- 153
Query: 170 SVRTERNALKAILITWIVIVLTAV 193
+ KA+L+ ++++L V
Sbjct: 154 ------SKKKALLVALLLLLLLIV 171
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or more
acidic residues. The function of members of this family
is unknown.
Length = 104
Score = 26.5 bits (59), Expect = 7.6
Identities = 6/46 (13%), Positives = 25/46 (54%)
Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
EE E +++A ++ K +E++ +++++++++++
Sbjct: 34 EEVPPEVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDD 79
>gnl|CDD|236476 PRK09354, recA, recombinase A; Provisional.
Length = 349
Score = 27.8 bits (63), Expect = 7.6
Identities = 20/39 (51%), Positives = 24/39 (61%)
Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
+E E E K++EK L EEEEEE+EEEEEEE
Sbjct: 311 KENPELADEIEKKIREKLGLSAAAAEEEEEEDEEEEEEE 349
>gnl|CDD|221739 pfam12730, ABC2_membrane_4, ABC-2 family transporter protein. This
family is related to the ABC-2 membrane transporter
family pfam01061.
Length = 230
Score = 27.7 bits (62), Expect = 7.6
Identities = 19/145 (13%), Positives = 57/145 (39%), Gaps = 15/145 (10%)
Query: 61 VSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTA 120
+ +I+P+ ++ +L F N + ++ ++ LL + +
Sbjct: 15 ILLILPLLLALLALLLFGVNGD----TGLSPGSLLLSSFSFLSALFLPLLIAILASLLVS 70
Query: 121 TDY--------VTTCWPFGDVW-CKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSV 171
++ ++ G ++ K L+V+ + + +L+L+ L ++ + S+
Sbjct: 71 REFKNGTIKLLLSLPISRGKIFLAK--LLVLLILSLLASLLLLLLSLIAGLLLGGSGFSL 128
Query: 172 RTERNALKAILITWIVIVLTAVPVF 196
AL +L+ ++ +L + +
Sbjct: 129 SLLLGALLLLLLLSLLALLLILLIL 153
>gnl|CDD|236332 PRK08668, PRK08668, NADH dehydrogenase subunit M; Validated.
Length = 610
Score = 28.0 bits (63), Expect = 7.8
Identities = 13/52 (25%), Positives = 19/52 (36%), Gaps = 14/52 (26%)
Query: 63 IIVPIFFGIIIILGFFG--NALVVVVVAFNQQMRSTTNLLIINLAIADLLFI 112
I PI + I FG NAL V L ++ IA +++
Sbjct: 481 GIGPIDYEGWEIKTGFGSWNALTV------------FILFVVGFIIAAIIYF 520
>gnl|CDD|234456 TIGR04074, bacter_Hen1, 3' terminal RNA ribose
2'-O-methyltransferase Hen1. Members of this protein
family are bacterial Hen1, a 3' terminal RNA ribose
2'-O-methyltransferase that acts in bacterial RNA
repair. All members of the seed alignment belong to a
cassette with the RNA repair enzyme polynucleotide
kinase-phosphatase (Pnkp). Chemically similar Hen1 in
eukaryotes acts instead on small regulatory RNAs
[Transcription, RNA processing, Protein synthesis, tRNA
and rRNA base modification].
Length = 462
Score = 28.0 bits (63), Expect = 7.9
Identities = 16/54 (29%), Positives = 20/54 (37%), Gaps = 15/54 (27%)
Query: 274 LGGGGGEEEEEEEEEEEEEEEEEEEIG------GVVVSAV---------DFGSG 312
L E EE E EE +EE E+ VV+A+ D G G
Sbjct: 239 LAEADEAEPEEAETEEAQEEAAEKPPSLNRQRLEAVVAALRESGARSVLDLGCG 292
>gnl|CDD|223751 COG0679, COG0679, Predicted permeases [General function prediction
only].
Length = 311
Score = 27.6 bits (62), Expect = 8.0
Identities = 9/41 (21%), Positives = 14/41 (34%), Gaps = 4/41 (9%)
Query: 67 IFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIA 107
F G+ + L FG + V F L+ L +
Sbjct: 109 GFLGLPVALSLFGEKGLAYAVIFLI----IGLFLMFTLGVI 145
>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
Length = 470
Score = 27.9 bits (63), Expect = 8.1
Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 7/57 (12%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
V EE E E +E +E + + K+ L E E +EE+ ++ + G+
Sbjct: 385 YVPVEELLEIEGFDEETVEELRERAKEAL-------ETEALAQEEKLADDLLSLEGL 434
Score = 27.9 bits (63), Expect = 8.5
Identities = 12/59 (20%), Positives = 23/59 (38%), Gaps = 4/59 (6%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE----EEEEEEEEEEE 298
E +EE+ ++ + + KL G + E+ E E + E +EE+
Sbjct: 412 ALETEALAQEEKLADDLLSLEGLDRELAFKLAEKGIKTLEDLAEQAVDELIDIEGDEEK 470
>gnl|CDD|191647 pfam06947, DUF1290, Protein of unknown function (DUF1290). This
family consists of several bacterial small basic
proteins of around 100 residues in length. The function
of this family is unknown.
Length = 88
Score = 26.4 bits (59), Expect = 8.1
Identities = 7/18 (38%), Positives = 12/18 (66%)
Query: 72 IIILGFFGNALVVVVVAF 89
+ I GFFGN L+ ++ +
Sbjct: 37 VFISGFFGNVLLAALLVY 54
>gnl|CDD|145556 pfam02478, Pneumo_phosprot, Pneumovirus phosphoprotein. This
family represents the phosphoprotein of Paramyxoviridae,
a putative RNA polymerase alpha subunit that may
function in template binding.
Length = 266
Score = 27.5 bits (61), Expect = 8.1
Identities = 20/57 (35%), Positives = 28/57 (49%), Gaps = 3/57 (5%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEK-KKLGG--GGGEEEEEEEEEEEEEEEEEEEI 299
+ E EEE + + KL EK K+L E EEEEE+E+ EE++I
Sbjct: 210 KAAEMLREEESQRAKIGDGSVKLTEKAKELNKILEDESSSGESEEEEEDEDLEEDDI 266
>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118). This is
the highly conserved family of the major tail subunit
protein.
Length = 121
Score = 27.0 bits (59), Expect = 8.1
Identities = 12/51 (23%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
+EE ++ E E E + KE+K +E+ K E++ +++ + ++++ I
Sbjct: 49 KEERDKMETEREKRDKESK-EERDKFISTMNEQQRLMDKQNDMMGQQQQSI 98
>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
Length = 499
Score = 28.1 bits (62), Expect = 8.2
Identities = 16/32 (50%), Positives = 19/32 (59%), Gaps = 6/32 (18%)
Query: 274 LGGGGGEEEEEEEEE------EEEEEEEEEEI 299
L GEE EEE+ + E EEE+EEEEI
Sbjct: 78 LFAEAGEEAEEEDNDRECPDTEAEEEDEEEEI 109
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 28.1 bits (62), Expect = 8.4
Identities = 10/50 (20%), Positives = 19/50 (38%)
Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
+ E+ E + KE L+ EE + E E E + + ++
Sbjct: 135 QPEKNESGNNDHATDKENLLESDASSSNDSESEESDSESEIESSDSDHDD 184
>gnl|CDD|227499 COG5171, YRB1, Ran GTPase-activating protein (Ran-binding protein)
[Intracellular trafficking and secretion].
Length = 211
Score = 27.3 bits (60), Expect = 8.4
Identities = 11/54 (20%), Positives = 19/54 (35%), Gaps = 10/54 (18%)
Query: 256 EEEEETKAKETKLKEKKKLGGGGGEEEE----------EEEEEEEEEEEEEEEI 299
EE K +++ E G G+ E E + + EE+E +
Sbjct: 38 AGGEEKKVQQSPFLENAVPEGDEGKGPESPNIHFEPVVELQRVHLKTNEEDETV 91
>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
production and conversion].
Length = 108
Score = 26.6 bits (59), Expect = 8.5
Identities = 19/54 (35%), Positives = 26/54 (48%), Gaps = 9/54 (16%)
Query: 249 EEEEEEEEEEEETKAKETK---LKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
EE E EE E +A++ L+E ++ E EEE EE E E+E I
Sbjct: 38 REEAREIIEEAEEEAEKLAQEILEEARE------EAEEEAEEILAEAEKEASAI 85
Score = 26.6 bits (59), Expect = 8.6
Identities = 20/55 (36%), Positives = 26/55 (47%), Gaps = 6/55 (10%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
E+ + E EE E EE + + KL ++ EE EE EEE EE E E
Sbjct: 31 EQIIKEAREEAREIIEEAEEEAEKLAQEIL------EEAREEAEEEAEEILAEAE 79
>gnl|CDD|239657 cd03685, ClC_6_like, ClC-6-like chloride channel proteins. This CD
includes ClC-6, ClC-7 and ClC-B, C, D in plants.
Proteins in this family are ubiquitous in eukarotes and
their functions are unclear. They are expressed in
intracellular organelles membranes. This family belongs
to the ClC superfamily of chloride ion channels, which
share the unique double-barreled architecture and
voltage-dependent gating mechanism. The gating is
conferred by the permeating anion itself, acting as the
gating charge. ClC chloride ion channel superfamily
perform a variety of functions including cellular
excitability regulation, cell volume regulation,
membrane potential stabilization, acidification of
intracellular organelles, signal transduction, and
transepithelial transport in animals.
Length = 466
Score = 28.0 bits (63), Expect = 8.6
Identities = 22/102 (21%), Positives = 36/102 (35%), Gaps = 8/102 (7%)
Query: 38 ICRNFSHEYINDNDVDAEAME-RMVSIIVPIFFGIII-ILGFFGNALVVVVVAFNQQMRS 95
C F + D + +I + G+I +LG N L V F +++
Sbjct: 251 KCGLFGPGGLIMFDGSSTKYLYTYFELIPFMLIGVIGGLLGALFNHLNHKVTRFRKRINH 310
Query: 96 TTNLL--IINLAIADLLFIVCCVP----FTATDYVTTCWPFG 131
LL + L ++ + +V F Y CW FG
Sbjct: 311 KGKLLKVLEALLVSLVTSVVAFPQTLLIFFVLYYFLACWTFG 352
>gnl|CDD|148072 pfam06244, DUF1014, Protein of unknown function (DUF1014). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 122
Score = 26.6 bits (59), Expect = 8.6
Identities = 12/45 (26%), Positives = 16/45 (35%)
Query: 254 EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
EE E+E + E ++ KK EE EE E
Sbjct: 6 EERLEKEQRKLEAPEEKSKKNAAREVPLEENLNVNRNRLEEGTVE 50
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 27.0 bits (60), Expect = 9.2
Identities = 12/46 (26%), Positives = 25/46 (54%)
Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
++E ++ ++E KAKE KL++ K+ + E E ++E+
Sbjct: 57 QKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVNAF 102
>gnl|CDD|217401 pfam03169, OPT, OPT oligopeptide transporter protein. The OPT
family of oligopeptide transporters is distinct from the
ABC pfam00005 and PTR pfam00854 transporter families.
OPT transporters were first recognised in fungi (Candida
albicans and Schizosaccharomyces pombe), but this
alignment also includes orthologues from Arabidopsis
thaliana. OPT transporters are thought to have 12-14
transmembrane domains and contain the following motif:
SPYxEVRxxVxxxDDP.
Length = 619
Score = 27.7 bits (62), Expect = 9.6
Identities = 9/51 (17%), Positives = 22/51 (43%), Gaps = 2/51 (3%)
Query: 65 VPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCC 115
VP+++ + ++ L + +VA + L++ L +A + I
Sbjct: 349 VPMWWYLAGLVLSLV--LGIALVAALFPTQLPVWGLLLALLLAFVFAIPSA 397
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 27.7 bits (62), Expect = 9.7
Identities = 13/62 (20%), Positives = 21/62 (33%)
Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVS 305
V E ++E + EE + E + E + + E E E E E+ V
Sbjct: 948 PVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEPEVAPAQVP 1007
Query: 306 AV 307
Sbjct: 1008 EA 1009
>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal. The family represents
the N-terminal region of RXT2-like proteins. In S.
cerevisiae, RXT2 has been demonstrated to be involved in
conjugation with cellular fusion (mating) and invasive
growth. A high throughput localisation study has
localised RXT2 to the nucleus.
Length = 141
Score = 27.0 bits (60), Expect = 9.8
Identities = 9/35 (25%), Positives = 22/35 (62%)
Query: 264 KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
K K ++ GG+ ++++E++E++EE + E+
Sbjct: 37 KRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAED 71
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 27.7 bits (62), Expect = 9.8
Identities = 11/31 (35%), Positives = 18/31 (58%)
Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKL 274
E+E E +E++ + K +E K KE +KL
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEKL 586
>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 486
Score = 27.5 bits (61), Expect = 9.9
Identities = 16/48 (33%), Positives = 25/48 (52%), Gaps = 4/48 (8%)
Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
E+ EE+T E K +K E EE ++E+E+E+E E + V
Sbjct: 424 EKSEEDTLDLEIKSIKKYF----PFIEFEEVKKEKEKEKERPEFVEKV 467
>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
(DUF2098). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 91
Score = 26.1 bits (58), Expect = 9.9
Identities = 18/47 (38%), Positives = 26/47 (55%), Gaps = 6/47 (12%)
Query: 267 KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGG 313
K+KEKK+ E EE++EE E ++EEE V + + G GG
Sbjct: 51 KVKEKKE------EREEDKEELIERIKKEEETFEDVDLGSAGCGGGG 91
>gnl|CDD|223724 COG0651, HyfB, Formate hydrogenlyase subunit 3/Multisubunit Na+/H+
antiporter, MnhD subunit [Energy production and
conversion / Inorganic ion transport and metabolism].
Length = 504
Score = 27.7 bits (62), Expect = 9.9
Identities = 22/148 (14%), Positives = 47/148 (31%), Gaps = 22/148 (14%)
Query: 56 AMERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCC 115
++ I+ ++L F G + + L + L V
Sbjct: 1 MNNLLLLPIILPLLAAALLLLFLGLRRLKAKRFL-----ALLGALGLLLLSILGFLEV-- 53
Query: 116 VPFTATDYVTTCW--PFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRT 173
W PFG ++ ++++ L++ + LA ++ I M
Sbjct: 54 ---GGPVGGLGGWLAPFG------IVLAADPLSAIFLLIIALV-GLLASLYSIGYMRHEK 103
Query: 174 ERN---ALKAILITWIVIVLTAVPVFNM 198
ER L +L+ ++ +FN+
Sbjct: 104 ERAYYYFLLLLLLAGMLGAFLTGDLFNL 131
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.136 0.404
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,111,907
Number of extensions: 1766906
Number of successful extensions: 18951
Number of sequences better than 10.0: 1
Number of HSP's gapped: 12019
Number of HSP's successfully gapped: 1615
Length of query: 318
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 221
Effective length of database: 6,635,264
Effective search space: 1466393344
Effective search space used: 1466393344
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.3 bits)