RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10650
(446 letters)
>gnl|CDD|217215 pfam02752, Arrestin_C, Arrestin (or S-antigen), C-terminal domain.
Ig-like beta-sandwich fold. Scop reports duplication
with N-terminal domain.
Length = 136
Score = 108 bits (271), Expect = 3e-28
Identities = 47/136 (34%), Positives = 71/136 (52%), Gaps = 4/136 (2%)
Query: 279 SSGPVLCRVSLDRGGYVPGETIIINATVYNRSKITVKSTKAALTETIQYLARNKVLES-- 336
SGPV VSL + GYVPGETI I + N+S +K K +L + + Y A+ + +
Sbjct: 1 WSGPVSLSVSLPKKGYVPGETIPITVEIDNQSSKKIKKIKISLVQQVTYKAKTPLRRTKN 60
Query: 337 ETRELASLTRGKIKPGERDDWT-NQQLYVPP-LPPTNLRGCHLIKIQYDVFFIVDPKSIE 394
E R +A G ++PG D + L +P LPP++ C +IK+ Y + VD
Sbjct: 61 EERVVAKEESGGVEPGSSDKFEKELSLQIPTSLPPSSTTKCKIIKVSYKLKVTVDLSGSH 120
Query: 395 KPVKLQLPIMLATYPF 410
++L+LPI + T P
Sbjct: 121 SELRLELPITIGTSPL 136
>gnl|CDD|214976 smart01017, Arrestin_C, Arrestin (or S-antigen), C-terminal domain.
Ig-like beta-sandwich fold. Scop reports duplication
with N-terminal domain. Arrestins comprise a family of
closely-related proteins that includes beta-arrestin-1
and -2, which regulate the function of beta-adrenergic
receptors by binding to their phosphorylated forms,
impairing their capacity to activate G(S) proteins; Cone
photoreceptors C-arrestin (arrestin-X). which could bind
to phosphorylated red/green opsins; and Drosophila
phosrestins I and II, which undergo light-induced
phosphorylation, and probably play a role in
photoreceptor transduction.
Length = 142
Score = 100 bits (252), Expect = 2e-25
Identities = 52/144 (36%), Positives = 74/144 (51%), Gaps = 14/144 (9%)
Query: 279 SSGPVLCRVSLDRGGYVPGETIIINATVYNRSKITVKSTKAALTETIQY-LARNKVLESE 337
SGP+ VSL + GYVPGETI + + N SK TVK K +L +T+ Y + V S
Sbjct: 1 WSGPLSLEVSLPKKGYVPGETIPVTIKITNLSKKTVKKIKVSLVQTVTYVSSDGPVKRSL 60
Query: 338 TRE-----------LASLTRGKIKPGERDDWTNQQLYVPPLPPTNLRGCHLIKIQYDVFF 386
+ + L G + PG +D + QL VPPLPPT+ R C LIK++Y +
Sbjct: 61 AEKSKEKKADRKTLVKELDGGPVLPGNKDKFE-GQLKVPPLPPTS-RTCRLIKVEYKLKV 118
Query: 387 IVDPKSIEKPVKLQLPIMLATYPF 410
+ ++L+LPI + T P
Sbjct: 119 KLRLSGKHSELRLELPITIGTVPL 142
>gnl|CDD|215866 pfam00339, Arrestin_N, Arrestin (or S-antigen), N-terminal domain.
Ig-like beta-sandwich fold. Scop reports duplication
with C-terminal domain.
Length = 148
Score = 63.1 bits (154), Expect = 5e-12
Identities = 23/60 (38%), Positives = 33/60 (55%)
Query: 117 LSPGIHSFPFKLGLPLGLPSTFLGKHGWVQYFCKAALREPTGYTHKNQQVFIIMSPIDLN 176
L G H+FPF LP P +F G+ G ++Y K L P HK ++VF ++ +DLN
Sbjct: 89 LPAGTHAFPFSFELPPNCPPSFEGQPGGIRYEVKVELDRPWKKDHKRKKVFTVIRKLDLN 148
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 46.6 bits (111), Expect = 4e-06
Identities = 20/54 (37%), Positives = 31/54 (57%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
EE E +KE + K ++ + +KKK K K KK KK E++ E+E ++KL
Sbjct: 69 EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122
Score = 35.4 bits (82), Expect = 0.024
Identities = 19/47 (40%), Positives = 30/47 (63%), Gaps = 10/47 (21%)
Query: 23 KEEETE----------EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K++E EEK+K K KKKK KKKK ++++K+++KD+K
Sbjct: 63 KKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109
Score = 31.6 bits (72), Expect = 0.47
Identities = 15/33 (45%), Positives = 23/33 (69%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E TE +KKKK+ ++ +K KK YEE++K + K
Sbjct: 55 AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWK 87
Score = 31.6 bits (72), Expect = 0.55
Identities = 11/64 (17%), Positives = 33/64 (51%), Gaps = 6/64 (9%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
+K ++K+D++K+ K+++ +++ +KK +K+ + K + + + ++L
Sbjct: 87 KKKKSKKKKDKDKD------KKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140
Query: 62 HNCY 65
Y
Sbjct: 141 PRKY 144
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 45.9 bits (109), Expect = 3e-05
Identities = 22/54 (40%), Positives = 34/54 (62%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E+ EEKE++++E + +EE EEE+K++KKKK KK K+ E E + K
Sbjct: 38 EEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTK 91
Score = 44.8 bits (106), Expect = 7e-05
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE 50
E+ E++E +E+E + +E + EEEK++KKKK KK K+ E
Sbjct: 37 DEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86
Score = 43.2 bits (102), Expect = 2e-04
Identities = 20/53 (37%), Positives = 33/53 (62%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+E+E++EKE K + EEE ++++K++KKKK K+ +E E E NK
Sbjct: 37 DEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89
Score = 34.7 bits (80), Expect = 0.091
Identities = 15/35 (42%), Positives = 25/35 (71%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
KE EEE+++K++KK++++K EEE EEE+
Sbjct: 33 KEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEK 67
Score = 34.4 bits (79), Expect = 0.13
Identities = 15/40 (37%), Positives = 30/40 (75%), Gaps = 3/40 (7%)
Query: 23 KEEETE---EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
KE E E EE++++K++KK++++K +EEE +EE++ +
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKE 68
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 45.1 bits (107), Expect = 5e-05
Identities = 19/59 (32%), Positives = 28/59 (47%), Gaps = 1/59 (1%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE-EEKDNKLLHN 63
GEE+ D E G E + + K+K K ++ K+K+R E E + EEK K
Sbjct: 249 GEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLA 307
Score = 28.5 bits (64), Expect = 6.8
Identities = 16/53 (30%), Positives = 28/53 (52%), Gaps = 6/53 (11%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEE------TEEEKKKKKKKKKKKKKKKRYEE 49
E+ EE+ D+ G ++ +E+ T +E KKK++KK KK + +E
Sbjct: 18 EQFLEEQRDERITGGVLAEKPDEDLFVVDTTGDEATIKKKRRKKLKKPLKSDE 70
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 44.1 bits (104), Expect = 1e-04
Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 4/56 (7%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+ G K K KE + E K+++K+K++ K++KK+ +E+ KEE KD K
Sbjct: 81 EKGGSKGPAAKT----KPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132
Score = 44.1 bits (104), Expect = 1e-04
Identities = 16/56 (28%), Positives = 36/56 (64%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+ K E++++Q KE + K +E +++K K++ K+K+ K++ +E+EK+ E+
Sbjct: 102 SGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157
Score = 40.3 bits (94), Expect = 0.002
Identities = 19/55 (34%), Positives = 36/55 (65%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
NE G EE++++E+ + KE+ EE K +K K++ K+K+ + +E+EKE++
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKV 154
Score = 39.1 bits (91), Expect = 0.004
Identities = 17/55 (30%), Positives = 38/55 (69%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+ G+E+E ++++V + K+E+ +EE K +K K++ K+K+ E+E+++E+K
Sbjct: 99 KNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153
Score = 38.7 bits (90), Expect = 0.005
Identities = 14/55 (25%), Positives = 31/55 (56%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
K +EK +E + +E+ ++K+K+K+KK ++ + R EE+++E +
Sbjct: 118 KKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAK 172
Score = 38.3 bits (89), Expect = 0.006
Identities = 20/58 (34%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+ K +E + GKEEE E+E+ K++KKKKK+K K+ ++ + +EE K
Sbjct: 86 KGPAAKTKPAKEPKNES---GKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140
Score = 37.6 bits (87), Expect = 0.014
Identities = 18/55 (32%), Positives = 27/55 (49%), Gaps = 3/55 (5%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EK EE D+E+E R + KK KKK KKK+ EE++++ +
Sbjct: 150 KEKKVEEPRDREEEK---KRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAR 201
Score = 37.2 bits (86), Expect = 0.016
Identities = 16/64 (25%), Positives = 36/64 (56%), Gaps = 7/64 (10%)
Query: 3 EKGGEEKEDQEKEVGGIS-------RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
E+ +E+ +EK+ R +EE +E++ K+K+K+K+KK + + E+E++
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKK 165
Query: 56 KDNK 59
++
Sbjct: 166 RERV 169
Score = 36.8 bits (85), Expect = 0.023
Identities = 18/58 (31%), Positives = 30/58 (51%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K KE + + + KE+ EE+KKKK+K K++ K +K EE +++ K
Sbjct: 89 AAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEK 146
Score = 36.4 bits (84), Expect = 0.025
Identities = 16/53 (30%), Positives = 34/53 (64%), Gaps = 1/53 (1%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E+++++EK+V R +EEE + E+ + K + KK KKK ++++ E++ +
Sbjct: 145 EKEKEKEKKVEE-PRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQ 196
Score = 36.0 bits (83), Expect = 0.038
Identities = 11/50 (22%), Positives = 27/50 (54%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+ + + ++ KE EE++++ ++ K K ++ EE+E+E+D
Sbjct: 173 SRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEED 222
Score = 35.6 bits (82), Expect = 0.044
Identities = 18/58 (31%), Positives = 36/58 (62%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E E +++EKE + K+++ E+ K++ K +K K++ K++ +EKE+EK+ K
Sbjct: 96 KEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153
Score = 35.2 bits (81), Expect = 0.066
Identities = 19/56 (33%), Positives = 36/56 (64%), Gaps = 1/56 (1%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK-RYEEEEKEEEK 56
++ ++K+++ KE + KEE E+ K+K+K+K+KK ++ R EEEK+ E+
Sbjct: 113 VKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRER 168
Score = 34.5 bits (79), Expect = 0.13
Identities = 21/56 (37%), Positives = 33/56 (58%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
KG K KE S +EE+ +E+ K++KKKKK+K K++ + + KEE K+
Sbjct: 84 GSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKE 139
Score = 34.1 bits (78), Expect = 0.15
Identities = 16/57 (28%), Positives = 28/57 (49%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
++ +EKE + +E K+ E K + KK KKK K+ E E+E+++
Sbjct: 144 KEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAA 200
Score = 33.3 bits (76), Expect = 0.27
Identities = 13/67 (19%), Positives = 30/67 (44%), Gaps = 10/67 (14%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKK----------KKKRYEEEEK 52
+K E+ +++ K+ KE+ +EK+K+K+KK ++ + + +K
Sbjct: 118 KKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKK 177
Query: 53 EEEKDNK 59
+K
Sbjct: 178 PPKKKPP 184
Score = 32.2 bits (73), Expect = 0.56
Identities = 14/53 (26%), Positives = 27/53 (50%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
K E K ++ V + +G + + K K+ K + K++ E+E+ +EEK
Sbjct: 65 KCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEK 117
Score = 28.7 bits (64), Expect = 8.1
Identities = 14/55 (25%), Positives = 32/55 (58%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
E+ E++ +EKE + +E EE+KK+++ + K + KK +++ ++K+
Sbjct: 135 EEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKE 189
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 43.8 bits (104), Expect = 1e-04
Identities = 22/53 (41%), Positives = 40/53 (75%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E+E +EK+ + K+EE EEE+K+KK+++K++++++ EE+E+EEEK K
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 43.0 bits (102), Expect = 2e-04
Identities = 18/53 (33%), Positives = 41/53 (77%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+ +E+++++ GK++E EEE++K+KK+++K+++++ EEE++EEE+ K
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKK 474
Score = 42.2 bits (100), Expect = 4e-04
Identities = 19/37 (51%), Positives = 26/37 (70%)
Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
K E+ EE+KK+KKKK KKK EEEE++E+K+
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452
Score = 41.8 bits (99), Expect = 6e-04
Identities = 22/47 (46%), Positives = 32/47 (68%)
Query: 13 EKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K++ I E++ EEEKK+KKKK KKK+ EEEEKE++++ K
Sbjct: 409 TKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Score = 39.9 bits (94), Expect = 0.002
Identities = 20/58 (34%), Positives = 37/58 (63%), Gaps = 4/58 (6%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
E+ +EK+ + KEEE EEEK+KK+++K++++++ E+EE+EE+K
Sbjct: 422 KREEEKKEKKKKAFAGKK----KEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 39.5 bits (93), Expect = 0.003
Identities = 16/37 (43%), Positives = 27/37 (72%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K+ E E+++KKKK KKK+++ EE+EK+EE+ +
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457
Score = 39.1 bits (92), Expect = 0.004
Identities = 15/57 (26%), Positives = 40/57 (70%), Gaps = 3/57 (5%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EK +++E+++KE + ++E++++++K+KK+++K+ EEE +EE+++ +
Sbjct: 417 EKAEKKREEEKKE---KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEE 470
Score = 39.1 bits (92), Expect = 0.004
Identities = 20/55 (36%), Positives = 36/55 (65%)
Query: 5 GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
G ++ + K++ + K EE ++EKKKK KKK++++ E+E+KEEEK+ +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458
Score = 38.0 bits (89), Expect = 0.010
Identities = 17/57 (29%), Positives = 37/57 (64%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K + E EK+ + K+++ KKK+++++++K+KK+ +EEE+EE ++ K
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466
Score = 36.8 bits (86), Expect = 0.023
Identities = 14/53 (26%), Positives = 37/53 (69%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EK ++++E + K+ ++K+++++++K+KK++++ EEEE+ EE+ +
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468
Score = 36.4 bits (85), Expect = 0.030
Identities = 19/59 (32%), Positives = 43/59 (72%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
+K EEK++++K+ + +EEE EE++KK+++K++++++ + +EEE+E++K L
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQATL 479
Score = 34.5 bits (80), Expect = 0.10
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
T +K KK KK +K +K+ EEE+KE++K
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKK 432
Score = 33.7 bits (78), Expect = 0.19
Identities = 11/50 (22%), Positives = 32/50 (64%)
Query: 10 EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E++ + + G + ++ + +K +KK++++KK+KK+ K++E++ +
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEE 445
Score = 29.1 bits (66), Expect = 5.5
Identities = 13/48 (27%), Positives = 21/48 (43%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+ + ++ E EE + KK KK KK E+ EK+ E+
Sbjct: 378 FKHNPELAARLAAFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREE 425
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 40.4 bits (95), Expect = 3e-04
Identities = 23/61 (37%), Positives = 41/61 (67%), Gaps = 5/61 (8%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEK--KKKKKKKKKKKKKKRYEEEEKEEEKDN 58
M+EK +E ED+E + R +++ +EEK KK+ K++KKK+KKK+ ++ +K +K+
Sbjct: 55 MDEKWKKETEDEEFQ---QKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEE 111
Query: 59 K 59
K
Sbjct: 112 K 112
Score = 30.4 bits (69), Expect = 0.84
Identities = 14/42 (33%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 20 SRGKEEE--TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
R +E E ++K KK+ + ++ ++KR E++ K+EEK K
Sbjct: 44 LRRREYERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAK 85
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 41.2 bits (97), Expect = 3e-04
Identities = 21/57 (36%), Positives = 31/57 (54%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E EKE K +EE+ E++KKK+ KK+KK+KK K+ + E + K K
Sbjct: 131 ESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187
Score = 40.9 bits (96), Expect = 4e-04
Identities = 18/49 (36%), Positives = 33/49 (67%)
Query: 5 GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
G E + +++ + + E E EE+K+KKKKK+ KK+KK++ +++EK
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177
Score = 36.2 bits (84), Expect = 0.017
Identities = 14/57 (24%), Positives = 32/57 (56%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
E + + + K E+ E ++++KK+KKKKK+ K+ ++E+K++++
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPK 181
Score = 35.8 bits (83), Expect = 0.022
Identities = 17/55 (30%), Positives = 32/55 (58%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
G + E E + E E E ++++KK+KKKKK+ + E++EK+++K+
Sbjct: 121 PDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKE 175
Score = 35.8 bits (83), Expect = 0.024
Identities = 14/38 (36%), Positives = 23/38 (60%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
EK+ +++ K+++ + + K KKKKKKKKKK
Sbjct: 156 EKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 32.8 bits (75), Expect = 0.19
Identities = 16/55 (29%), Positives = 34/55 (61%)
Query: 5 GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
G + E+G S E+ET + +K+ + ++++KK+K+ ++E K+E+K+ K
Sbjct: 117 GYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKK 171
Score = 32.4 bits (74), Expect = 0.33
Identities = 14/62 (22%), Positives = 30/62 (48%), Gaps = 3/62 (4%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKK---KKRYEEEEKEEEKDNKLLH 62
G E + + +E + ++++KK+KKKK KK +E++ ++EK +
Sbjct: 123 GPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKG 182
Query: 63 NC 64
+
Sbjct: 183 SK 184
Score = 31.2 bits (71), Expect = 0.70
Identities = 18/49 (36%), Positives = 32/49 (65%), Gaps = 8/49 (16%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKK---KKKKKKKRYEEEEK 52
EE+E +EK+ + KE + E+++KK KK+K K KKK+ ++++K
Sbjct: 150 EEEEKKEKK-----KKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation
factor 3 (eIF3). In yeast it is called Hcr1. The
Saccharomyces cerevisiae protein eIF3j (HCR1) has been
shown to be required for processing of 20S pre-rRNA and
binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 40.8 bits (96), Expect = 6e-04
Identities = 14/54 (25%), Positives = 29/54 (53%), Gaps = 6/54 (11%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
+E+ED+EKE ++ + + K KK K K ++K++ + E++E+
Sbjct: 38 DEEEDEEKE------EEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELE 85
Score = 40.0 bits (94), Expect = 0.001
Identities = 19/56 (33%), Positives = 30/56 (53%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
E+++D K+ +E+E E+ K K K KK K K E+E+ + EK+ K L
Sbjct: 26 DEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGL 81
Score = 35.8 bits (83), Expect = 0.031
Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 7/52 (13%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
+E++++EK K + KK K K ++K+K KR +EE+ E +
Sbjct: 42 DEEKEEEKA-------KVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86
Score = 34.2 bits (79), Expect = 0.080
Identities = 16/58 (27%), Positives = 34/58 (58%), Gaps = 3/58 (5%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEE---KEEEKDNKLL 61
E++E +E++ ++ K ++ + K ++K+K K++K++K E E E+E KL
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLR 98
Score = 34.2 bits (79), Expect = 0.10
Identities = 14/54 (25%), Positives = 27/54 (50%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E+ EEK + K + E+EK K++K++K ++ + E++ EK
Sbjct: 43 EEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEK 96
Score = 31.6 bits (72), Expect = 0.62
Identities = 10/23 (43%), Positives = 12/23 (52%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKK 45
E+ E+ K KKKK K K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAK 214
Score = 31.2 bits (71), Expect = 0.98
Identities = 9/18 (50%), Positives = 10/18 (55%)
Query: 29 EEKKKKKKKKKKKKKKKR 46
EK K KKKK K K +
Sbjct: 197 AEKAAKGGKKKKGKAKAK 214
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 41.1 bits (96), Expect = 9e-04
Identities = 25/73 (34%), Positives = 34/73 (46%), Gaps = 21/73 (28%)
Query: 6 GEEKEDQEKEVGGISRGK---------------------EEETEEEKKKKKKKKKKKKKK 44
G+E E+ E E GG RGK E+ +EEK KKKKKK K KK
Sbjct: 194 GDEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKK 253
Query: 45 KRYEEEEKEEEKD 57
K ++++ + D
Sbjct: 254 KLDDDKKGKRGGD 266
Score = 38.4 bits (89), Expect = 0.006
Identities = 15/56 (26%), Positives = 24/56 (42%), Gaps = 2/56 (3%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
K E+++ +E EEE KK KK KK K KK +++ + +
Sbjct: 310 AKPEIEQDEDSEESE--EEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSD 363
Score = 38.4 bits (89), Expect = 0.007
Identities = 19/55 (34%), Positives = 25/55 (45%), Gaps = 4/55 (7%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEK----KKKKKKKKKKKKKKRYEEEEKEEEKD 57
E E E S EEE EE+ KK KK KK K KK ++++ + D
Sbjct: 307 EIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDD 361
Score = 35.7 bits (82), Expect = 0.055
Identities = 13/52 (25%), Positives = 23/52 (44%), Gaps = 6/52 (11%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
EE E+++ E G +++ KK KK K KK K + + + +
Sbjct: 321 EESEEEKNEEEG------GLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSD 366
Score = 31.1 bits (70), Expect = 1.2
Identities = 15/62 (24%), Positives = 29/62 (46%), Gaps = 3/62 (4%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGK---EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+E ++ +D+ +E IS + E E K + K + ++ + EE EEEK+
Sbjct: 269 ADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKN 328
Query: 58 NK 59
+
Sbjct: 329 EE 330
Score = 29.9 bits (67), Expect = 2.8
Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 7/62 (11%)
Query: 2 NEKGGEEK----EDQEKEVGGISRGKEEETEEEKKKKKKK--KKKKKKKKRYEEEEKEEE 55
N+ E E K EE+EEEK +++ KK KK KK ++ ++
Sbjct: 295 NDPEEREDKLSPEIPAKPEIEQD-EDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDK 353
Query: 56 KD 57
D
Sbjct: 354 DD 355
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 41.1 bits (97), Expect = 0.001
Identities = 19/50 (38%), Positives = 27/50 (54%), Gaps = 7/50 (14%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
E E++E E G +S E+KK +KK++K +KK EE EK K
Sbjct: 395 EGEEEEGENGNLS-------PAERKKLRKKQRKAEKKAEKEEAEKAAAKK 437
Score = 33.8 bits (78), Expect = 0.19
Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 5/45 (11%)
Query: 23 KEEETEEEKKKKKKKKKK--KKKKKRYEEEEKEEEKD---NKLLH 62
K+ E EE +K KKK + KK K + E K+ + D KL
Sbjct: 423 KKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEKLAR 467
Score = 31.1 bits (71), Expect = 1.4
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
+ ++EE E+ KKK + KK K E +K +
Sbjct: 424 KAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP 458
Score = 29.9 bits (68), Expect = 2.7
Identities = 14/40 (35%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 22 GKEEETEEEKKKKKKKKK-KKKKKKRYEEEEKEEEKDNKL 60
G+EEE E ++KK +KK++K ++ EKEE +
Sbjct: 396 GEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAA 435
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 37.8 bits (88), Expect = 0.001
Identities = 17/29 (58%), Positives = 21/29 (72%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
KKKKKKKKKK K K+ E++EEEK +
Sbjct: 20 KKKKKKKKKKNKSKEEVVTEKEEEEKSSA 48
Score = 35.1 bits (81), Expect = 0.009
Identities = 17/27 (62%), Positives = 19/27 (70%)
Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+ KKKKKKKKKK K K EKEEE+
Sbjct: 18 DVKKKKKKKKKKNKSKEEVVTEKEEEE 44
Score = 33.2 bits (76), Expect = 0.041
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
KK KKKKKKKKK+ + +E+ + +
Sbjct: 14 GKKIDVKKKKKKKKKKNKSKEEVVTEKEE 42
Score = 32.8 bits (75), Expect = 0.057
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K KK KKKKKKKK+ + ++E + +
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKE 41
Score = 31.6 bits (72), Expect = 0.15
Identities = 14/29 (48%), Positives = 20/29 (68%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K KKKKKKKKKK + +EE E+++ +
Sbjct: 16 KIDVKKKKKKKKKKNKSKEEVVTEKEEEE 44
Score = 29.7 bits (67), Expect = 0.84
Identities = 15/39 (38%), Positives = 27/39 (69%)
Query: 22 GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
GK+ + +++KKKKKKK K K++ +EEE++ ++ L
Sbjct: 14 GKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDL 52
Score = 28.5 bits (64), Expect = 1.9
Identities = 12/30 (40%), Positives = 18/30 (60%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
K K KK KKKKKK+ ++ + +EE +
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEK 40
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 38.0 bits (89), Expect = 0.002
Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
E E+ KEV +E+ + EKK+++KK + +K K EE+ E E +
Sbjct: 1 EPEEEKTFTD-KEVDKAI--AKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELE 52
Score = 33.0 bits (76), Expect = 0.079
Identities = 12/34 (35%), Positives = 20/34 (58%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
K+EE + E +K K ++K + E+ EKE E+
Sbjct: 27 KQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60
Score = 33.0 bits (76), Expect = 0.097
Identities = 12/37 (32%), Positives = 22/37 (59%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E++ EE+K + +K K ++K E EK E++ +L
Sbjct: 25 EKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEEL 61
Score = 33.0 bits (76), Expect = 0.097
Identities = 14/58 (24%), Positives = 27/58 (46%), Gaps = 2/58 (3%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EK EK+ +EK+ + + EEK + + +K +K+ ++ E + E K
Sbjct: 19 KEKAKWEKKQEEKKSEAEKL--AKMSAEEKAEYELEKLEKELEELEAELARRELKAEA 74
Score = 31.5 bits (72), Expect = 0.36
Identities = 14/39 (35%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 23 KEEETE-EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
+EE+T +++ K K+K K +K+ EE++ E EK K+
Sbjct: 3 EEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKM 41
Score = 28.4 bits (64), Expect = 3.6
Identities = 10/35 (28%), Positives = 20/35 (57%), Gaps = 2/35 (5%)
Query: 27 TEEEKKK--KKKKKKKKKKKKRYEEEEKEEEKDNK 59
T++E K K+K K +KK++ + E ++ K +
Sbjct: 9 TDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSA 43
Score = 28.0 bits (63), Expect = 4.4
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E EE+ K+ K K+K E+++EE+K
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEA 35
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 37.3 bits (87), Expect = 0.004
Identities = 12/32 (37%), Positives = 22/32 (68%)
Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
R + + EK+K+ +K ++KK K+R +E+EK
Sbjct: 99 RRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 31.9 bits (73), Expect = 0.24
Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 5/44 (11%)
Query: 23 KEEETEEEKKKKKKKKKKKK---KKKRYEE--EEKEEEKDNKLL 61
KE E+EKKKK+K+ KK+ + R E E E+EK +
Sbjct: 73 KELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKN 116
Score = 31.1 bits (71), Expect = 0.47
Identities = 10/39 (25%), Positives = 20/39 (51%)
Query: 18 GISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+ + E E++KK+K+ KK++ + E KE +
Sbjct: 70 KVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAE 108
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this
family (Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 36.5 bits (85), Expect = 0.004
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
+E EEEK +++++ K+ K KR E EEK
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEK 29
Score = 28.0 bits (63), Expect = 3.0
Identities = 7/23 (30%), Positives = 17/23 (73%), Gaps = 1/23 (4%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKK 45
KEEE + +++++ K+ K K+++
Sbjct: 4 KEEE-KAQREEELKRLKNLKREE 25
Score = 26.8 bits (60), Expect = 8.7
Identities = 10/38 (26%), Positives = 22/38 (57%), Gaps = 7/38 (18%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
E KE+++ + +EEE + K K+++ ++K +K
Sbjct: 2 ERKEEEKAQ-------REEELKRLKNLKREEIEEKLEK 32
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 38.1 bits (89), Expect = 0.004
Identities = 18/58 (31%), Positives = 36/58 (62%), Gaps = 2/58 (3%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
+ K + + +EK+ R EEE EE+KK ++K++ ++K++ EEE+E++K+
Sbjct: 3 IGAK--KRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEE 58
Score = 36.6 bits (85), Expect = 0.010
Identities = 16/56 (28%), Positives = 39/56 (69%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
++ EE+E +E++ R E + EEE +++++KKK+++++K EE+ ++E+++
Sbjct: 19 QQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEE 74
Score = 36.2 bits (84), Expect = 0.017
Identities = 19/50 (38%), Positives = 34/50 (68%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E K+ +EK G +E E E EKKK+++++K+++++ R E+EE E+ K
Sbjct: 30 ERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLK 79
Score = 32.7 bits (75), Expect = 0.24
Identities = 12/50 (24%), Positives = 31/50 (62%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
++ E++ + EEE E++K+++++K+++++ +K EE EK +
Sbjct: 32 KKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSS 81
Score = 27.7 bits (62), Expect = 8.5
Identities = 16/61 (26%), Positives = 34/61 (55%), Gaps = 4/61 (6%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E +E+E+ E+E K+EE E ++++++ +K++++ +K EEE +KL
Sbjct: 37 KREGERKEEEELEEE----REKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGTDKL 92
Query: 61 L 61
Sbjct: 93 S 93
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 39.4 bits (91), Expect = 0.005
Identities = 19/60 (31%), Positives = 35/60 (58%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
+K E K+ + +E K+ E E + K ++ KK+ ++ KK+ EE +K+EE+ K+ H
Sbjct: 1702 KKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAH 1761
Score = 39.0 bits (90), Expect = 0.006
Identities = 18/57 (31%), Positives = 31/57 (54%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+ E D+ + + E++ EE KKK KKK ++KK+ +E +K+ E+D K
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405
Score = 37.4 bits (86), Expect = 0.016
Identities = 17/57 (29%), Positives = 31/57 (54%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K + K+ + +E K+ E E + K ++ KK ++ KK+ EE +K EE + K
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689
Score = 36.3 bits (83), Expect = 0.039
Identities = 11/50 (22%), Positives = 24/50 (48%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
++K D K+ ++ E + E + + + ++K E++KEE K
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK 1377
Score = 36.3 bits (83), Expect = 0.041
Identities = 17/56 (30%), Positives = 30/56 (53%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K EE + E++ K+ E +E+K + KK+ ++ KK E ++KE E+ K
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717
Score = 35.9 bits (82), Expect = 0.047
Identities = 19/53 (35%), Positives = 34/53 (64%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EE + E+E + + K++E EE+KK ++ KK +++ K + EE K+ E+D K
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKK 1675
Score = 35.9 bits (82), Expect = 0.051
Identities = 20/59 (33%), Positives = 31/59 (52%), Gaps = 2/59 (3%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKK--KKKKKKKKKRYEEEEKEEEKDNK 59
EK E+ + +E I +E + EE KKK + KK ++ +KK E +KE E+ K
Sbjct: 1645 EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703
Score = 35.9 bits (82), Expect = 0.056
Identities = 10/52 (19%), Positives = 26/52 (50%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K+ +E + + E E ++ + ++K + +KK+ E ++K + K
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386
Score = 35.5 bits (81), Expect = 0.060
Identities = 19/57 (33%), Positives = 34/57 (59%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K E + + +E K++E EE+KK ++ KK +++ K + EE +KE E+D K
Sbjct: 1688 KKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKK 1744
Score = 35.5 bits (81), Expect = 0.064
Identities = 19/56 (33%), Positives = 29/56 (51%), Gaps = 2/56 (3%)
Query: 3 EKGGEE--KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+K EE K + + + E E EEK + +KKK++ KKK ++K EEK
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEK 1390
Score = 35.1 bits (80), Expect = 0.10
Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 4/57 (7%)
Query: 7 EEKEDQEKEVGGISRGKEEET----EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
++KE +EK+ + EEE EE KK ++ KKK ++ K+ EE+EK+ + K
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695
Score = 34.7 bits (79), Expect = 0.10
Identities = 18/63 (28%), Positives = 36/63 (57%), Gaps = 10/63 (15%)
Query: 8 EKEDQEKEVGGISRGKEEET----------EEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+K +++K+ ++ EE+ EE KK ++ KKK+ ++K+ EE K+ E++
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEE 1727
Query: 58 NKL 60
NK+
Sbjct: 1728 NKI 1730
Score = 34.7 bits (79), Expect = 0.11
Identities = 19/59 (32%), Positives = 35/59 (59%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
+K E K+ +E + ++ EE+ + ++ KK ++ KK ++KK+ EE +K EE N L
Sbjct: 1522 KKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMAL 1580
Score = 34.3 bits (78), Expect = 0.16
Identities = 19/57 (33%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K EE + + E+ + K++ +E KKK ++KKK + KK+ EE +K +E K
Sbjct: 1397 KKKAEEDKKKADELKKAAAAKKK-ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKK 1452
Score = 34.3 bits (78), Expect = 0.17
Identities = 21/63 (33%), Positives = 33/63 (52%), Gaps = 5/63 (7%)
Query: 2 NEKGGEE--KEDQEKEVGGISRGKEEE---TEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
++K +E K K+ ++ K EE +E KKK ++ KK + KK+ EE +K EE
Sbjct: 1403 DKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEA 1462
Query: 57 DNK 59
K
Sbjct: 1463 KKK 1465
Score = 34.0 bits (77), Expect = 0.19
Identities = 21/58 (36%), Positives = 38/58 (65%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
EK + +E ++ E I + ++ EEEKKK ++ KKK+ ++K+ EE K+ E++NK+
Sbjct: 1604 EKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKI 1661
Score = 34.0 bits (77), Expect = 0.20
Identities = 20/59 (33%), Positives = 32/59 (54%), Gaps = 5/59 (8%)
Query: 3 EKGGEE--KEDQEKEVGGISRGKEE---ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+K EE K D+ K+ ++ +E + EE KKK + KK + KK+ +E +K EE
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEA 1521
Score = 33.6 bits (76), Expect = 0.30
Identities = 17/60 (28%), Positives = 33/60 (55%), Gaps = 4/60 (6%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKK----KKRYEEEEKEEEKDNK 59
+ E+K+++ K+ ++ K EE ++ + KKK ++ KKK KK ++K +E K
Sbjct: 1367 EAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKK 1426
Score = 33.2 bits (75), Expect = 0.31
Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 1/58 (1%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKK-KKRYEEEEKEEEKDNK 59
+K E + + + + EE+ E +KKK++ KKK KK+ EE++K +E K
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
Score = 33.2 bits (75), Expect = 0.31
Identities = 17/51 (33%), Positives = 29/51 (56%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K D+ K+ + E + EEKKK + KKK ++ K+ +E +K+ E+ K
Sbjct: 1279 KADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1329
Score = 33.2 bits (75), Expect = 0.36
Identities = 19/55 (34%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 8 EKEDQEKEVGGISRGKEEE---TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K+ +EK+ ++ K EE +E KKK ++ KK ++ KK+ EE +K +E K
Sbjct: 1424 KKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478
Score = 33.2 bits (75), Expect = 0.38
Identities = 11/57 (19%), Positives = 24/57 (42%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K K+ E+ K E + + ++K + +K+ EE +K+ + K
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385
Score = 33.2 bits (75), Expect = 0.38
Identities = 15/59 (25%), Positives = 28/59 (47%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K E K+ E +E E+K + +KKK++ KK+ + +K+ E+ K
Sbjct: 1334 AKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392
Score = 33.2 bits (75), Expect = 0.38
Identities = 12/57 (21%), Positives = 23/57 (40%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K E K+ E+ K++ E +K + K + + E E++ E K
Sbjct: 1315 KKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK 1371
Score = 33.2 bits (75), Expect = 0.39
Identities = 16/54 (29%), Positives = 28/54 (51%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+K + + + +E K++ E++KK + KK KKK E ++K EEK
Sbjct: 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK 1430
Score = 33.2 bits (75), Expect = 0.39
Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K EEK+ + E + +++ +E KK KKK + KK+ EE++K +E K
Sbjct: 1384 KKKAEEKKKAD-EAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439
Score = 32.8 bits (74), Expect = 0.43
Identities = 8/54 (14%), Positives = 26/54 (48%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E+ ++ + +K+ + E E + + + ++K + E++++E +K
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKK 1378
Score = 32.8 bits (74), Expect = 0.47
Identities = 20/54 (37%), Positives = 33/54 (61%), Gaps = 1/54 (1%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+K E K+ +E+ K+E EE+KKK ++ KK +++KK+ +KEEEK
Sbjct: 1716 KKAEELKKAEEENKIKAEEAKKEA-EEDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768
Score = 32.8 bits (74), Expect = 0.52
Identities = 10/57 (17%), Positives = 25/57 (43%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
K +E + + +E + +++ EE KK + K + + E +E+ + +
Sbjct: 1315 KKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK 1371
Score = 32.4 bits (73), Expect = 0.57
Identities = 18/54 (33%), Positives = 32/54 (59%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+K E K+ +E + ++ EE + ++ KK ++KKK + K+ EE +K EEK
Sbjct: 1510 KKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEK 1563
Score = 32.4 bits (73), Expect = 0.58
Identities = 15/51 (29%), Positives = 31/51 (60%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K ++ ++ + + +E++ +E KK ++KKK + KK+ EE +K +E K
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKK 1323
Score = 32.0 bits (72), Expect = 0.74
Identities = 20/57 (35%), Positives = 32/57 (56%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K E K+ +E E K+E E +K ++ KKK+ ++KKK E ++ EEE K
Sbjct: 1675 KKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIK 1731
Score = 32.0 bits (72), Expect = 0.81
Identities = 15/56 (26%), Positives = 25/56 (44%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K +E + + +E K++ E +KK KKK ++ KK E + E E
Sbjct: 1302 KKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357
Score = 32.0 bits (72), Expect = 0.84
Identities = 17/60 (28%), Positives = 29/60 (48%), Gaps = 3/60 (5%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEE---ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E E+ E EK+ + + + EE+KK + KKK ++ KK+ +E +K K
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419
Score = 32.0 bits (72), Expect = 0.85
Identities = 19/58 (32%), Positives = 32/58 (55%), Gaps = 4/58 (6%)
Query: 3 EKGGEE--KEDQEKEVGGISRGKEEETE--EEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+K EE K D+ K+ ++ K +E + E KKK + KK ++ K+ +E +K EE
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEA 1533
Score = 32.0 bits (72), Expect = 0.91
Identities = 10/53 (18%), Positives = 25/53 (47%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
++K ++ K+ ++ K EE ++ + K + + + EE+ E + K
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373
Score = 31.6 bits (71), Expect = 0.92
Identities = 16/58 (27%), Positives = 29/58 (50%), Gaps = 4/58 (6%)
Query: 3 EKGGEE--KEDQEKEVGGISRGKEEETEE--EKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+K EE K D+ K+ ++ K + ++ E+ KK + K + + +E E EEK
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365
Score = 31.6 bits (71), Expect = 1.1
Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 4/63 (6%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEE----TEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+K E K+ +E + K +E EE KK + KKK ++ KK+ +E +K E
Sbjct: 1449 AKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEA 1508
Query: 57 DNK 59
K
Sbjct: 1509 KKK 1511
Score = 31.3 bits (70), Expect = 1.3
Identities = 16/55 (29%), Positives = 31/55 (56%), Gaps = 3/55 (5%)
Query: 8 EKEDQEKEVGGISRGKEEE---TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K+ +E + ++ K EE +E KKK ++ KKK + K+ E +K+ ++ K
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKK 1517
Score = 31.3 bits (70), Expect = 1.5
Identities = 15/57 (26%), Positives = 27/57 (47%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EK ++ + +E K++ E +K + KKK ++ KKK ++K EE
Sbjct: 1288 EKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344
Score = 30.9 bits (69), Expect = 1.7
Identities = 15/50 (30%), Positives = 29/50 (58%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
++K D+ K+ + +E + E+ KK + KK ++ K+ +E +K EEK
Sbjct: 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEK 1545
Score = 30.5 bits (68), Expect = 2.1
Identities = 13/56 (23%), Positives = 26/56 (46%), Gaps = 4/56 (7%)
Query: 8 EKEDQEKEVGGISRGKEEET----EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K+ +E + ++ K EE + KKK ++ KK + K E +E + +
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAE 1363
Score = 30.5 bits (68), Expect = 2.4
Identities = 14/57 (24%), Positives = 30/57 (52%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E+ ++ E+ +K + +E + +E KKK ++ KKK + + E K++ + K
Sbjct: 1460 EEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAK 1516
Score = 30.5 bits (68), Expect = 2.4
Identities = 15/55 (27%), Positives = 31/55 (56%), Gaps = 2/55 (3%)
Query: 3 EKGGEE--KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
+K +E K + K+ ++ EE + ++ KK ++ KK + K+ EE++K +E
Sbjct: 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADE 1550
Score = 30.5 bits (68), Expect = 2.5
Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETE--EEKKKKKKKKKKKKKKKRYEEEEKEEE 55
+ E E+ G K+ ET EE +K ++ KKK + ++ EE K E+
Sbjct: 1085 EDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAED 1138
Score = 30.1 bits (67), Expect = 2.8
Identities = 12/53 (22%), Positives = 30/53 (56%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
++ ++ E+ +K+ + E + + ++ KK ++ KK + K+ EE +K +E
Sbjct: 1486 DEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADE 1538
Score = 29.7 bits (66), Expect = 4.1
Identities = 17/54 (31%), Positives = 31/54 (57%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+K E K+ E + ++ +E + E+KKK + KK ++ K+ EE++K EE
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEA 1569
Score = 29.7 bits (66), Expect = 4.2
Identities = 14/55 (25%), Positives = 29/55 (52%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
+ +K+ E + ++ +E + E+ KK + KK ++KK+ +E +K EE
Sbjct: 1502 AKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE 1556
Score = 29.7 bits (66), Expect = 4.6
Identities = 13/47 (27%), Positives = 26/47 (55%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
K ++E++ + ++ + E KK ++ KK ++ K+ EEE EE
Sbjct: 1207 KAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEE 1253
Score = 29.3 bits (65), Expect = 4.9
Identities = 19/55 (34%), Positives = 32/55 (58%), Gaps = 3/55 (5%)
Query: 8 EKEDQEKEVGGISRGKEEETE--EEKKKKKKKKKKKKK-KKRYEEEEKEEEKDNK 59
+K+ +E + ++ K EE + EE KKK ++ KK + KK+ EE +K +E K
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491
Score = 29.3 bits (65), Expect = 5.9
Identities = 16/53 (30%), Positives = 27/53 (50%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
K +E + + +E K++ E +KK + KK + KKK E ++ EE K
Sbjct: 1470 KKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522
Score = 29.3 bits (65), Expect = 6.0
Identities = 18/57 (31%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETE-EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K E K+ +EK+ ++ EE+ + +E KKK ++ KK + K+ EE K++ K
Sbjct: 1279 KADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAK 1335
Score = 29.0 bits (64), Expect = 6.8
Identities = 17/57 (29%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK----KRYEEEEKEEEKDNK 59
++K +Q K+ + K EE ++ +++ K K ++ KK K+ EE K+ E+D K
Sbjct: 1632 KKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688
Score = 29.0 bits (64), Expect = 7.3
Identities = 12/56 (21%), Positives = 24/56 (42%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+ + G + ++ + + +E EE K ++ KK + K E + EE K
Sbjct: 1066 VGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAK 1121
Score = 29.0 bits (64), Expect = 7.7
Identities = 14/60 (23%), Positives = 33/60 (55%), Gaps = 3/60 (5%)
Query: 3 EKGGEEK---EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K EE ++ +K+ + +E + + E+ KK + KKK ++ + +E K++ ++ K
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAK 1496
Score = 28.6 bits (63), Expect = 9.0
Identities = 18/52 (34%), Positives = 32/52 (61%), Gaps = 4/52 (7%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
K E K++ E++ + EE ++E++KKK KK+++K+ EE KE+E
Sbjct: 1731 KAEEAKKEAEED----KKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKE 1778
Score = 28.6 bits (63), Expect = 9.3
Identities = 15/54 (27%), Positives = 28/54 (51%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E+ EE E+ ++ + E+ +K + KK ++KK+ +E +K EEK
Sbjct: 1248 ERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEK 1301
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 36.5 bits (85), Expect = 0.006
Identities = 20/63 (31%), Positives = 33/63 (52%), Gaps = 9/63 (14%)
Query: 5 GGEEKEDQEKEVGGISRGK--------EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
GG DQ EV I+R + +E E ++ + ++KK +++ KR EE KE E+
Sbjct: 64 GGGLFVDQA-EVDAIARSRVQPVLDEIDERAEAQRARDEEKKLDEEEAKRQHEEAKERER 122
Query: 57 DNK 59
+ K
Sbjct: 123 EKK 125
Score = 28.4 bits (64), Expect = 3.1
Identities = 9/38 (23%), Positives = 23/38 (60%)
Query: 19 ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
I E + +++KK +++ K++ + +E E+E++K
Sbjct: 89 IDERAEAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association
with pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 35.4 bits (82), Expect = 0.006
Identities = 17/48 (35%), Positives = 29/48 (60%), Gaps = 2/48 (4%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
GEE E+ ++++ + ++ ++KKKKKKK KKK K + +KE
Sbjct: 17 GEEGEEDDEQID--ASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKE 62
Score = 33.8 bits (78), Expect = 0.021
Identities = 14/39 (35%), Positives = 21/39 (53%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
E+ E++ I K + ++KKKKKKK KKK +
Sbjct: 16 LGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54
Score = 31.5 bits (72), Expect = 0.15
Identities = 15/43 (34%), Positives = 22/43 (51%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
G E+ +++ E S+ + +KKKKKKK KKK K
Sbjct: 12 LPNGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54
Score = 26.9 bits (60), Expect = 5.8
Identities = 20/61 (32%), Positives = 29/61 (47%), Gaps = 5/61 (8%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKK-----KKKKKKRYEEEEKEEEKDN 58
G E +ED E+ R + +KKKKKK KK +KK++ KE + D+
Sbjct: 16 LGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKEVKPDS 75
Query: 59 K 59
K
Sbjct: 76 K 76
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 36.9 bits (86), Expect = 0.009
Identities = 11/36 (30%), Positives = 27/36 (75%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E E++++K++++K +KK+KR E ++KE+ + ++
Sbjct: 5 EALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEA 40
Score = 36.1 bits (84), Expect = 0.016
Identities = 14/38 (36%), Positives = 25/38 (65%)
Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
R K E+ + KK+K+K+ KKK+ ++ E EE + E++
Sbjct: 12 RRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENK 49
Score = 35.4 bits (82), Expect = 0.034
Identities = 13/32 (40%), Positives = 23/32 (71%)
Query: 28 EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E+ K +KK+K+K+ KKK+ ++ E EE K+ +
Sbjct: 16 EQRKARKKQKRKEAKKKEDAQKSEAEEVKNEE 47
Score = 32.3 bits (74), Expect = 0.34
Identities = 8/37 (21%), Positives = 22/37 (59%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+++++K++++K +KK+ +E K++E K
Sbjct: 1 PSSREALLEQRRRKREQRKARKKQKRKEAKKKEDAQK 37
Score = 31.5 bits (72), Expect = 0.53
Identities = 12/32 (37%), Positives = 22/32 (68%)
Query: 28 EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E++K +KK+K+K+ KKK ++ + EE N+
Sbjct: 15 REQRKARKKQKRKEAKKKEDAQKSEAEEVKNE 46
Score = 30.7 bits (70), Expect = 1.1
Identities = 8/37 (21%), Positives = 26/37 (70%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+E E+ ++K++++K +KK+K++ +++++ +K
Sbjct: 4 REALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEA 40
Score = 30.0 bits (68), Expect = 2.2
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
KK K+K+K+KKK K+ +E KE+ + K
Sbjct: 145 KKALKRKEKQKKKSKKEWKERKEKVEKKK 173
Score = 29.6 bits (67), Expect = 2.3
Identities = 10/16 (62%), Positives = 13/16 (81%)
Query: 29 EEKKKKKKKKKKKKKK 44
++KK KKKKK KKK +
Sbjct: 189 DDKKNKKKKKAKKKGR 204
Score = 29.6 bits (67), Expect = 2.6
Identities = 11/17 (64%), Positives = 11/17 (64%)
Query: 31 KKKKKKKKKKKKKKKRY 47
KK KKKKK KKK R
Sbjct: 189 DDKKNKKKKKAKKKGRI 205
Score = 29.6 bits (67), Expect = 2.6
Identities = 13/23 (56%), Positives = 15/23 (65%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKR 46
EE ++ K KK KKKKK KKK
Sbjct: 181 EENLKKRKDDKKNKKKKKAKKKG 203
Score = 29.6 bits (67), Expect = 2.7
Identities = 23/61 (37%), Positives = 31/61 (50%), Gaps = 3/61 (4%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEE--ETEEEKKKKKKKKKKKKKKKRYEEE-EKEEEKDNK 59
K ++ E G I K E + E+ KK K KKKKKKKK Y++ +K E + K
Sbjct: 48 NKSKKKAAPIENAEGNIVFSKVEFADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKK 107
Query: 60 L 60
L
Sbjct: 108 L 108
Score = 28.8 bits (65), Expect = 4.1
Identities = 10/16 (62%), Positives = 12/16 (75%)
Query: 30 EKKKKKKKKKKKKKKK 45
+ KK KKKKK KKK +
Sbjct: 189 DDKKNKKKKKAKKKGR 204
Score = 28.8 bits (65), Expect = 4.7
Identities = 12/26 (46%), Positives = 21/26 (80%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEK 56
+K+K+KKK KK+ K+R E+ EK++ +
Sbjct: 150 RKEKQKKKSKKEWKERKEKVEKKKAE 175
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 36.3 bits (84), Expect = 0.010
Identities = 17/58 (29%), Positives = 30/58 (51%), Gaps = 3/58 (5%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+E+E+ E+E + E+E E K + K+KKK++ K E+EK + + K
Sbjct: 91 LSSSDDEEEETEEES---TDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 37.6 bits (88), Expect = 0.010
Identities = 15/54 (27%), Positives = 25/54 (46%), Gaps = 3/54 (5%)
Query: 9 KEDQEKEVGGISR---GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K EK+ G S+ K E +E++K+K K KK+ + K + K +
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 36.5 bits (85), Expect = 0.027
Identities = 13/48 (27%), Positives = 22/48 (45%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
+ K+V K+E+ +E+ K KK+ + K KR + EE
Sbjct: 387 KTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 36.1 bits (84), Expect = 0.034
Identities = 14/33 (42%), Positives = 19/33 (57%)
Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
T+ +KK K KK KR E++EKE+EK
Sbjct: 380 TKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKV 412
Score = 35.3 bits (82), Expect = 0.063
Identities = 13/51 (25%), Positives = 23/51 (45%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
+ + +K + + KE+E E+ K KK+ + K K+R EE
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 33.0 bits (76), Expect = 0.33
Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 6/53 (11%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+E + K GK + K+ +KK+K+K+K K K+ +D K
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKV------KKRHRDTK 420
Score = 29.5 bits (67), Expect = 3.9
Identities = 10/35 (28%), Positives = 18/35 (51%)
Query: 28 EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
+KK K KK K+ ++E+++E+ K H
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 37.0 bits (86), Expect = 0.011
Identities = 13/68 (19%), Positives = 31/68 (45%), Gaps = 11/68 (16%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEE-----EKKKKKKKKKKKKKKKRYEEEEKEEE 55
+ +D + + E+ET EK KK++ ++K+++++ EE++
Sbjct: 124 SSSDSDSSDDDSDDD------DSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAR 177
Query: 56 KDNKLLHN 63
++ L N
Sbjct: 178 EEEILTGN 185
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 37.4 bits (87), Expect = 0.013
Identities = 17/34 (50%), Positives = 22/34 (64%)
Query: 22 GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
E + KK KKKKKKKKK++K +E+E E E
Sbjct: 267 YDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPE 300
Score = 37.4 bits (87), Expect = 0.014
Identities = 17/33 (51%), Positives = 23/33 (69%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+ +E K KK KKKKKKKKK+R + +E E E +
Sbjct: 268 DVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPE 300
Score = 34.0 bits (78), Expect = 0.18
Identities = 15/62 (24%), Positives = 35/62 (56%), Gaps = 6/62 (9%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEE------KKKKKKKKKKKKKKKRYEEEEKEEEKD 57
EK ++ ++GGI E++ +++ KK KK++KKK+ ++K+ +++E++
Sbjct: 63 AKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQKKKEAERKKALLLDEKEKER 122
Query: 58 NK 59
Sbjct: 123 AA 124
Score = 32.8 bits (75), Expect = 0.42
Identities = 14/25 (56%), Positives = 18/25 (72%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEE 55
K KKKKKKKKK++K E+E + E
Sbjct: 277 KPKKKKKKKKKRRKDLDEDELEPEA 301
Score = 32.8 bits (75), Expect = 0.43
Identities = 19/71 (26%), Positives = 33/71 (46%), Gaps = 12/71 (16%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEK------------KKKKKKKKKKKKKKRYEE 49
+E + +E ++K ++ TE KK KKKKKKKKK+++ +
Sbjct: 234 DEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLD 293
Query: 50 EEKEEEKDNKL 60
E++ E + L
Sbjct: 294 EDELEPEAEGL 304
Score = 31.7 bits (72), Expect = 0.86
Identities = 14/29 (48%), Positives = 17/29 (58%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
KKKKKKKKK++K +E E E E
Sbjct: 278 PKKKKKKKKKRRKDLDEDELEPEAEGLGS 306
Score = 30.1 bits (68), Expect = 2.4
Identities = 13/51 (25%), Positives = 25/51 (49%), Gaps = 1/51 (1%)
Query: 7 EEKEDQEKEVGGISRGKEEE-TEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EE E ++K+ + T++E +KK+++ KKK K ++ E
Sbjct: 211 EEIEGKKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTET 261
Score = 29.0 bits (65), Expect = 5.6
Identities = 17/69 (24%), Positives = 27/69 (39%), Gaps = 16/69 (23%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKK----------------KKKKKKKKKRYEEE 50
E K +E+ I++ +E+ K K KK KK++KK+ E
Sbjct: 50 EAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQKKKEAER 109
Query: 51 EKEEEKDNK 59
+K D K
Sbjct: 110 KKALLLDEK 118
Score = 29.0 bits (65), Expect = 6.1
Identities = 14/46 (30%), Positives = 24/46 (52%), Gaps = 7/46 (15%)
Query: 24 EEETEE-------EKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
E+E +E EK+K KK + KKKK Y+ ++ ++ +L
Sbjct: 162 EDEGDELENVELVEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILS 207
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 37.6 bits (87), Expect = 0.014
Identities = 14/55 (25%), Positives = 26/55 (47%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
E +E E + G ++ KEE ++ K K K + KK + +E + + + L
Sbjct: 435 ELEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETKLVKLLEQLEL 489
Score = 36.1 bits (83), Expect = 0.048
Identities = 18/60 (30%), Positives = 35/60 (58%), Gaps = 1/60 (1%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKK-KKKKKKKKRYEEEEKEEEKDNKL 60
+ +++E++E++ + KE EEEK +K K ++K++K K EEE + E++ K
Sbjct: 754 EKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKE 813
Score = 34.6 bits (79), Expect = 0.14
Identities = 12/54 (22%), Positives = 24/54 (44%), Gaps = 2/54 (3%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKK--KKKKKKRYEEEEKEEEKDN 58
++E + I+ +E+E K + KK++ +KK+ R EE +
Sbjct: 969 AKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKE 1022
Score = 34.2 bits (78), Expect = 0.17
Identities = 18/49 (36%), Positives = 27/49 (55%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
E K + K +E+ E EK+ KK +K+ KK+K+ EE EKE +
Sbjct: 298 ELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELK 346
Score = 33.4 bits (76), Expect = 0.30
Identities = 13/43 (30%), Positives = 25/43 (58%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLY 67
+ E + ++ K K++ KK + Y+ +EK E ++ LL+ YL
Sbjct: 192 DLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLK 234
Score = 32.2 bits (73), Expect = 0.59
Identities = 18/54 (33%), Positives = 34/54 (62%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
+EK+ QE+E+ +++ +EE E K +++K ++K K E+E K+ EK+ K
Sbjct: 280 KEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKK 333
Score = 32.2 bits (73), Expect = 0.65
Identities = 11/57 (19%), Positives = 24/57 (42%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
EE Q + +++ EEE K K++++ K + E K ++++
Sbjct: 265 EEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESE 321
Score = 31.9 bits (72), Expect = 0.80
Identities = 17/63 (26%), Positives = 34/63 (53%), Gaps = 2/63 (3%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E+ ++ EKE+ + KE + E+E+ ++ +K+ K+ + KR EEE+EE+ +
Sbjct: 309 RKVDDEEKLKESEKELKKLE--KELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQ 366
Query: 61 LHN 63
Sbjct: 367 EKL 369
Score = 31.5 bits (71), Expect = 1.1
Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 6/66 (9%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKK------KKKKKKKKKKKRYEEEEKEEE 55
EK E+ ++E + I R EEE EE+ +K +++ KKK E +
Sbjct: 333 KEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAK 392
Query: 56 KDNKLL 61
+ L
Sbjct: 393 LKEEEL 398
Score = 31.1 bits (70), Expect = 1.7
Identities = 12/47 (25%), Positives = 24/47 (51%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
EE E+ + + + K +E + +++ KK + + K+K EEE
Sbjct: 181 EETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENL 227
Score = 30.7 bits (69), Expect = 1.8
Identities = 13/58 (22%), Positives = 31/58 (53%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
E+ +E+ + +E + +E+ EEE ++ + K+++K ++ EEE E ++
Sbjct: 806 ALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEE 863
Score = 30.7 bits (69), Expect = 1.8
Identities = 13/56 (23%), Positives = 29/56 (51%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
+ + + + K +E+E+E KK +K+ KK+K++ E+E +E + +
Sbjct: 300 KSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAE 355
Score = 30.7 bits (69), Expect = 2.2
Identities = 13/61 (21%), Positives = 27/61 (44%), Gaps = 3/61 (4%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
+ KE + E E+ EE+K++K K ++++ + EE ++E E +
Sbjct: 765 EEKSELSLKEKELAEEEEK---TEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEE 821
Query: 61 L 61
Sbjct: 822 Q 822
Score = 29.9 bits (67), Expect = 3.2
Identities = 17/49 (34%), Positives = 29/49 (59%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
K++ EKE +++ +E EEEK+KK ++++ K K EE + E K
Sbjct: 257 SKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLK 305
Score = 29.9 bits (67), Expect = 3.7
Identities = 11/56 (19%), Positives = 23/56 (41%), Gaps = 6/56 (10%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
E K + K ++ + E + K+K + +++ Y + K E+ LL
Sbjct: 195 ELKLQELKLK------EQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQ 244
Score = 29.2 bits (65), Expect = 5.1
Identities = 15/56 (26%), Positives = 23/56 (41%), Gaps = 1/56 (1%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK-KRYEEEEKEEEKDNKLL 61
E ++ E KEEE E K+ K++ E E++EE+ NK
Sbjct: 940 ELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDE 995
Score = 29.2 bits (65), Expect = 5.2
Identities = 14/59 (23%), Positives = 29/59 (49%), Gaps = 6/59 (10%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E +E+E+ E +E E EEE + K+ K+++K++ +EE+ + +
Sbjct: 244 QELLRDEQEEIESSK------QELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEE 296
Score = 28.8 bits (64), Expect = 7.6
Identities = 13/54 (24%), Positives = 25/54 (46%), Gaps = 2/54 (3%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E +E +E+ K+E EEEKK+ ++ ++ ++ E E + L
Sbjct: 983 EFEEKEERYNK--DELKKERLEEEKKELLREIIEETCQRFKEFLELFVSINRGL 1034
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 36.3 bits (84), Expect = 0.017
Identities = 25/63 (39%), Positives = 33/63 (52%), Gaps = 7/63 (11%)
Query: 18 GIS-RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYYRVTHIC 76
GIS R K T EKKKK K KK K K K+ E+++K E N + Y + +THI
Sbjct: 77 GISWRRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKARE-----GWNNFSY-FPEITHIV 130
Query: 77 FSD 79
+
Sbjct: 131 IKE 133
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 36.8 bits (86), Expect = 0.018
Identities = 20/52 (38%), Positives = 34/52 (65%), Gaps = 5/52 (9%)
Query: 13 EKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRY-----EEEEKEEEKDNK 59
E+E I + EEE +EE ++KK++KKK++++ + EE+ K EEK+ K
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERK 317
Score = 32.2 bits (74), Expect = 0.51
Identities = 12/39 (30%), Positives = 25/39 (64%), Gaps = 5/39 (12%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
EEK+ +E+E + + EE++K ++K++KK+ +K
Sbjct: 289 EEKKKEERE-----AKLAKLSPEEQRKLEEKERKKQARK 322
Score = 31.8 bits (73), Expect = 0.72
Identities = 12/45 (26%), Positives = 28/45 (62%), Gaps = 3/45 (6%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
E+ E+KE+++KE K + E+++K ++K++KK+ ++
Sbjct: 281 QEEAQEKKEEKKKEE---REAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 28.4 bits (64), Expect = 7.8
Identities = 11/38 (28%), Positives = 26/38 (68%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
+EEE + K +++++++ ++KK +++E+ E K KL
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKL 303
Score = 28.4 bits (64), Expect = 8.0
Identities = 14/38 (36%), Positives = 27/38 (71%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
K E EEEK K +++++++ + +EE+K+EE++ KL
Sbjct: 263 KTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKL 300
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 35.5 bits (82), Expect = 0.020
Identities = 11/33 (33%), Positives = 25/33 (75%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EE+E+E+K+ K+ ++K+ K +E+++++EK
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEK 124
Score = 33.6 bits (77), Expect = 0.099
Identities = 12/39 (30%), Positives = 26/39 (66%)
Query: 22 GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
+E E E+++ K+ ++K+ K K+ ++E+KE++ KL
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKL 130
Score = 31.3 bits (71), Expect = 0.53
Identities = 8/31 (25%), Positives = 20/31 (64%)
Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
EE +K++K+ K+ ++K + +KE+++
Sbjct: 91 FPEESEKEQKEVSKETEEKEAIKAKKEKKEK 121
Score = 31.3 bits (71), Expect = 0.57
Identities = 19/54 (35%), Positives = 29/54 (53%), Gaps = 6/54 (11%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EE E ++KE +E E+E K KK+KK+KK+KK E+ K++
Sbjct: 92 PEESEKEQKE------VSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTT 139
Score = 30.9 bits (70), Expect = 0.63
Identities = 11/36 (30%), Positives = 24/36 (66%)
Query: 26 ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
E EK++K+ K+ ++K+ ++EK+E+K+ K+
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVA 127
Score = 30.1 bits (68), Expect = 1.2
Identities = 17/54 (31%), Positives = 27/54 (50%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EK +E + +E I KE++ ++EKK +K KKK K +K +K
Sbjct: 96 EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKK 149
Score = 29.3 bits (66), Expect = 2.0
Identities = 13/53 (24%), Positives = 29/53 (54%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E+ ++E++ ++E + +K+KK+KK+KK +K ++ K + K
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTK 144
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 37.1 bits (86), Expect = 0.022
Identities = 14/57 (24%), Positives = 28/57 (49%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
++++ + +E +EK +RG+ ++ ++ KK KK K+ E E EE
Sbjct: 1168 LDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETY 1224
Score = 31.0 bits (70), Expect = 1.8
Identities = 18/58 (31%), Positives = 24/58 (41%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K E E E+ G + E E K K + KKK E+EE++E D K
Sbjct: 1210 TTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLK 1267
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 35.6 bits (82), Expect = 0.023
Identities = 19/42 (45%), Positives = 29/42 (69%)
Query: 11 DQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
D E G + K+++ E++K++KKKKK+KKKKKKR+ E
Sbjct: 133 DSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHP 174
Score = 33.3 bits (76), Expect = 0.14
Identities = 13/54 (24%), Positives = 29/54 (53%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
K + + S + + E+K KKKK + K++KK+ +E++K++++
Sbjct: 115 HKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKR 168
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 35.3 bits (82), Expect = 0.024
Identities = 17/55 (30%), Positives = 32/55 (58%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+EKE++E+E + EEE +E +K+ K K++K++ E ++KE K+
Sbjct: 95 GLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKE 149
Score = 30.7 bits (70), Expect = 0.89
Identities = 13/50 (26%), Positives = 29/50 (58%), Gaps = 7/50 (14%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E +E E+E + +E E++ K K++K+++ +++ +E KE+ K
Sbjct: 110 EVEELDEEE-------QIDELLEKELAKLKREKRRENERKQKEILKEQMK 152
Score = 29.2 bits (66), Expect = 2.7
Identities = 10/36 (27%), Positives = 22/36 (61%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNC 64
+ KK+K+++++++ EE EEE+ ++LL
Sbjct: 92 KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKE 127
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 36.5 bits (85), Expect = 0.025
Identities = 19/63 (30%), Positives = 33/63 (52%), Gaps = 11/63 (17%)
Query: 7 EEKEDQEKEVGGISRGKEEE-----------TEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
EE QEKE G G E+KK++KK+++K+ K+ R ++++ E+E
Sbjct: 149 EEPVQQEKEAAGSGGGGGGTWGLVGVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDE 208
Query: 56 KDN 58
D+
Sbjct: 209 DDD 211
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 36.5 bits (85), Expect = 0.026
Identities = 14/35 (40%), Positives = 23/35 (65%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
EEK+ K++K+ +K KK+ E+++KE EK K
Sbjct: 558 EEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIP 592
Score = 34.6 bits (80), Expect = 0.12
Identities = 11/34 (32%), Positives = 26/34 (76%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+E+E +E K++K+ +K KK++++ ++E ++ EK
Sbjct: 555 REKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 34.2 bits (79), Expect = 0.13
Identities = 13/33 (39%), Positives = 24/33 (72%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
EE +++K++K+ K++KR + +K+EEK K L
Sbjct: 551 EELQREKEEKEALKEQKRLRKLKKQEEKKKKEL 583
Score = 33.1 bits (76), Expect = 0.37
Identities = 15/48 (31%), Positives = 32/48 (66%), Gaps = 7/48 (14%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
++KE+ ++E KEE+ +++K+ +K KK+++KK+ E E+ E+
Sbjct: 548 DDKEELQRE-------KEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 33.1 bits (76), Expect = 0.37
Identities = 15/51 (29%), Positives = 31/51 (60%)
Query: 10 EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
ED+ +EE + EK++K+ K++K+ +K ++EEK++++ KL
Sbjct: 536 EDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKL 586
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
This presumed domain is found at the N-terminus of
Ribosomal L30 proteins and has been termed RL30NT or
NUC018.
Length = 71
Score = 33.4 bits (77), Expect = 0.027
Identities = 14/46 (30%), Positives = 23/46 (50%), Gaps = 9/46 (19%)
Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKR---------YEEEEKEEEKD 57
R + E+ ++ KK+ KK +KKKR Y +E + E+D
Sbjct: 6 RKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERD 51
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 34.8 bits (81), Expect = 0.030
Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
EE +D ++ + K+E EK +KK +K K K +K ++ K+EE +L
Sbjct: 48 NEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEK---KKAKKEEPKPRL 99
Score = 29.0 bits (66), Expect = 3.0
Identities = 9/25 (36%), Positives = 16/25 (64%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRY 47
K+ E + K +KKK KK++ K + +
Sbjct: 76 KKAEKAKAKAEKKKAKKEEPKPRLF 100
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion
and trafficking].
Length = 150
Score = 34.7 bits (80), Expect = 0.031
Identities = 16/55 (29%), Positives = 32/55 (58%), Gaps = 4/55 (7%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
EK+D E + +EETE +K+ +++KK ++++E+ ++EE N+L
Sbjct: 31 YEKKDIEIN----TDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLF 81
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 35.1 bits (81), Expect = 0.042
Identities = 11/50 (22%), Positives = 31/50 (62%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EE +++EKE +++ E++ ++ +++ +++ ++ +E EKE E+
Sbjct: 63 EEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEE 112
Score = 35.1 bits (81), Expect = 0.047
Identities = 18/53 (33%), Positives = 26/53 (49%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E K+ ++E I KEEE E + K+ K +K+ EEE EEE +
Sbjct: 50 EAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEES 102
Score = 33.6 bits (77), Expect = 0.15
Identities = 17/58 (29%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E+ E ++KE G + ++EE+EEE +++ ++ + +K E EEK E K
Sbjct: 66 KEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEK---ETEEKTESNVEK 120
Score = 32.4 bits (74), Expect = 0.35
Identities = 14/61 (22%), Positives = 30/61 (49%), Gaps = 2/61 (3%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKK--KKKKKKKKKRYEEEEKEEEKDN 58
++ E+E + E EE K+++K+ + K+ K E+E++E E++N
Sbjct: 36 FPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEEN 95
Query: 59 K 59
+
Sbjct: 96 E 96
Score = 28.5 bits (64), Expect = 6.3
Identities = 18/57 (31%), Positives = 28/57 (49%), Gaps = 4/57 (7%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
N + E+K D EKE EEE EEE ++ + +K+ ++K EKE +
Sbjct: 74 NSEDKEDKGDAEKE----DEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 36.0 bits (83), Expect = 0.045
Identities = 13/43 (30%), Positives = 19/43 (44%)
Query: 15 EVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+ R + +TE+ KKK K KKKK+ + E K
Sbjct: 110 LAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKY 152
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 35.3 bits (82), Expect = 0.053
Identities = 20/49 (40%), Positives = 28/49 (57%), Gaps = 7/49 (14%)
Query: 5 GGEEKEDQEKEVGGISR-------GKEEETEEEKKKKKKKKKKKKKKKR 46
G E KE+ K + I K EE + +K+KKKKK+KKK KK++
Sbjct: 361 GDELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRK 409
Score = 32.6 bits (75), Expect = 0.38
Identities = 11/24 (45%), Positives = 20/24 (83%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKR 46
K ++ +++KK+KKK KK+KKK ++
Sbjct: 391 KPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 29.2 bits (66), Expect = 4.5
Identities = 13/41 (31%), Positives = 26/41 (63%), Gaps = 5/41 (12%)
Query: 24 EEETEEEKKK-----KKKKKKKKKKKKRYEEEEKEEEKDNK 59
+ EE K+K KKK+++KK +K++ +++ K++ K K
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRK 409
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 35.5 bits (82), Expect = 0.054
Identities = 16/55 (29%), Positives = 34/55 (61%), Gaps = 3/55 (5%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+K GE K++ + ++ KEE+ ++E+ +KK+K++ + K E+ +K+E K
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK---EKRQKDERKK 434
Score = 35.2 bits (81), Expect = 0.073
Identities = 13/52 (25%), Positives = 33/52 (63%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K + K+ S E + +EEK K+++ +KK+K++ ++E++++++ K
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 34.0 bits (78), Expect = 0.16
Identities = 15/56 (26%), Positives = 32/56 (57%), Gaps = 3/56 (5%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKK---KKKKKKRYEEEEKEEEKDNK 59
++ +D K++G + + E E K K++K K +KK+K +E+KE+ + ++
Sbjct: 376 KKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431
Score = 30.1 bits (68), Expect = 2.4
Identities = 10/52 (19%), Positives = 24/52 (46%), Gaps = 7/52 (13%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
G+++++ KE ++ ++ KK + K + + E + KEE+
Sbjct: 364 GDKRQELLKEY-------NKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLK 408
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A
peptide from the central, syntaxin binding domain of
synaphin competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal
exocytosis by promoting interaction between the
complementary syntaxin and synaptobrevin transmembrane
regions that reside in opposing membranes prior to
fusion.
Length = 139
Score = 34.1 bits (78), Expect = 0.055
Identities = 11/47 (23%), Positives = 22/47 (46%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
+E E +E+E +E ++ ++++K K +K EE E
Sbjct: 17 KGMLGGKEDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREV 63
Score = 32.9 bits (75), Expect = 0.12
Identities = 17/52 (32%), Positives = 27/52 (51%), Gaps = 7/52 (13%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
GG+E E E EEE EE ++ ++ ++++K K R EEE+E
Sbjct: 19 MLGGKEDEGDES-------DAEEEDEEIQEALREAEEERKAKHRKMEEEREV 63
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 33.8 bits (78), Expect = 0.057
Identities = 19/54 (35%), Positives = 35/54 (64%), Gaps = 8/54 (14%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKK--KKKKKKKKRYEEEEKEEEKDN 58
E++E++E+E +E+E +EEK+K+ K +K++++KR EE EK + K
Sbjct: 16 EKREEKEREK------EEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPL 63
Score = 33.8 bits (78), Expect = 0.058
Identities = 15/47 (31%), Positives = 29/47 (61%), Gaps = 7/47 (14%)
Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKRY-------EEEEKEEEKDNKL 60
EE+ EE++++K++K++K++K+K + EE EK E+ K
Sbjct: 12 IDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKA 58
Score = 29.2 bits (66), Expect = 2.1
Identities = 19/59 (32%), Positives = 32/59 (54%), Gaps = 3/59 (5%)
Query: 3 EKGGEEKEDQEKEVGG---ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
++ E KE++EKE G +E+ EE +K K K + + Y+EE KE+E+ +
Sbjct: 25 KEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDEELKEQERWD 83
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 35.5 bits (82), Expect = 0.057
Identities = 18/65 (27%), Positives = 27/65 (41%), Gaps = 7/65 (10%)
Query: 19 ISRGKEEETEEEKKKKKKKKKKKKKKKRYE-------EEEKEEEKDNKLLHNCYLYSYYR 71
I +EE+ K+ K++ KK + K E E KEE +N + L YY+
Sbjct: 520 IEMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNILKLRLYYQ 579
Query: 72 VTHIC 76
C
Sbjct: 580 QNGKC 584
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 32.5 bits (74), Expect = 0.058
Identities = 6/34 (17%), Positives = 15/34 (44%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
E E E+ + K + ++++ ++E D
Sbjct: 46 EGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDD 79
Score = 28.6 bits (64), Expect = 1.7
Identities = 5/32 (15%), Positives = 15/32 (46%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
+ E+ ++++ K K +++E +E
Sbjct: 49 EPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80
Score = 27.4 bits (61), Expect = 3.7
Identities = 10/46 (21%), Positives = 16/46 (34%), Gaps = 12/46 (26%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEE 51
GEE ED + + E++ K K E++E
Sbjct: 47 GEEPEDDDDD------------EDDDDDDDKDDKDDDDDDDDEDDE 80
>gnl|CDD|191136 pfam04949, Transcrip_act, Transcriptional activator. This family
of proteins may act as a transcriptional activator. It
plays a role in stress response in plants.
Length = 160
Score = 34.1 bits (78), Expect = 0.061
Identities = 13/52 (25%), Positives = 27/52 (51%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
+++D+E +S K +E E E++K + +++ + + R EEE K
Sbjct: 15 FGGSPMMDEKDEELSRSALSLFKAKEEEIERRKMEVRERVQAQLGRVEEETK 66
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 35.1 bits (81), Expect = 0.063
Identities = 16/49 (32%), Positives = 25/49 (51%), Gaps = 2/49 (4%)
Query: 13 EKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
KEV + KE+ TE + K+ KK+ +K+ E +E + D KL
Sbjct: 60 NKEVAKLKIAKEDATELIAETKELKKEITEKEA--EVQEAKAALDAKLK 106
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 33.3 bits (76), Expect = 0.064
Identities = 15/27 (55%), Positives = 19/27 (70%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEE 50
E TE+ KKKKKK KK KK KK +++
Sbjct: 94 PEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 32.2 bits (73), Expect = 0.16
Identities = 13/27 (48%), Positives = 18/27 (66%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEE 51
E E+ KKKKKK KK KK K+ +++
Sbjct: 94 PEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 30.6 bits (69), Expect = 0.54
Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 3/34 (8%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
++ E +K KKKKKK KK K+ +K +KD
Sbjct: 90 DDHPPEPTEKPKKKKKKSKKTKK---PKKSSKKD 120
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 35.0 bits (80), Expect = 0.073
Identities = 16/34 (47%), Positives = 26/34 (76%)
Query: 28 EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
E++ KK KKK+KK+K+K+R ++++KE E LL
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLL 229
Score = 33.1 bits (75), Expect = 0.34
Identities = 14/35 (40%), Positives = 25/35 (71%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
K + EKK KK KKK+KK+K++ +++K++E +
Sbjct: 189 KGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVE 223
Score = 32.7 bits (74), Expect = 0.45
Identities = 11/37 (29%), Positives = 25/37 (67%)
Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
R + E++K++KKK++K+K+ ++R+ E ++D
Sbjct: 88 RRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDED 124
Score = 32.3 bits (73), Expect = 0.61
Identities = 10/33 (30%), Positives = 24/33 (72%)
Query: 28 EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
EEE++ +++ +K K++KK+ E+E++ + + L
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSL 117
Score = 32.0 bits (72), Expect = 0.71
Identities = 12/32 (37%), Positives = 23/32 (71%)
Query: 26 ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
E + +K KKK+KK+K+K++ + +++E E K
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKS 227
Score = 30.8 bits (69), Expect = 1.4
Identities = 13/34 (38%), Positives = 23/34 (67%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
K E + +KK KK KKK+K+ +E+E++++K
Sbjct: 185 KSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDK 218
Score = 30.8 bits (69), Expect = 1.6
Identities = 12/37 (32%), Positives = 25/37 (67%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E+++++ KKK+KK+K+K++ K + +E E + L
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLAL 232
Score = 30.4 bits (68), Expect = 2.1
Identities = 14/38 (36%), Positives = 26/38 (68%)
Query: 20 SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
S+ E+ +KK KK KKK+KK++ +E +K+++K+
Sbjct: 184 SKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKE 221
Score = 29.3 bits (65), Expect = 5.5
Identities = 18/47 (38%), Positives = 24/47 (51%)
Query: 11 DQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
D EK R E EK +KK KK K+ E++EKE+E+D
Sbjct: 169 DSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERD 215
Score = 28.5 bits (63), Expect = 7.7
Identities = 17/35 (48%), Positives = 23/35 (65%), Gaps = 3/35 (8%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E+ + +KK KK KKK+KK E+EKE +KD K
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKK---EKEKERDKDKK 219
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 35.1 bits (80), Expect = 0.076
Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 14/68 (20%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKK--------------KKKKKKKKKKKKKRYEEEEKE 53
E E G +EE + +K+ + K K K KKR +EE+E
Sbjct: 481 EDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEE 540
Query: 54 EEKDNKLL 61
E+K ++
Sbjct: 541 EKKLKMIM 548
Score = 33.1 bits (75), Expect = 0.34
Identities = 19/50 (38%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+ +++ E E GI K ET E K K K KK+K EEEEK+ +
Sbjct: 501 QAQKELELEAQGI---KYSETSEADKDVNKSKNKKRKVDE-EEEEKKLKM 546
Score = 29.7 bits (66), Expect = 3.4
Identities = 21/66 (31%), Positives = 30/66 (45%), Gaps = 8/66 (12%)
Query: 3 EKGG---EEKEDQEKEVGGISRGKEEETEEEKKKKKKK-----KKKKKKKKRYEEEEKEE 54
E G E + +K+V K + EEE++KK K K+KK KK K+E
Sbjct: 509 EAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKE 568
Query: 55 EKDNKL 60
E+ L
Sbjct: 569 EQAENL 574
Score = 28.5 bits (63), Expect = 9.9
Identities = 17/59 (28%), Positives = 32/59 (54%), Gaps = 8/59 (13%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKK--KKKRYEEEEKEEEKDN 58
+K + ++++++V EEE E++ K K+KK KK +Y +KEE+ +N
Sbjct: 521 ADKDVNKSKNKKRKV------DEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAEN 573
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein
7 (Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal
RNA recognition motif (RRM), also termed RBD (RNA
binding domain) or RNP (ribonucleoprotein domain), and
a C-terminal RRP7 domain.
Length = 118
Score = 33.0 bits (76), Expect = 0.079
Identities = 14/29 (48%), Positives = 18/29 (62%)
Query: 17 GGISRGKEEETEEEKKKKKKKKKKKKKKK 45
G + EE E K K+K+KKKKKKK+
Sbjct: 60 GRKGKTAREEAVEAKAKEKEKKKKKKKEL 88
Score = 32.2 bits (74), Expect = 0.13
Identities = 14/30 (46%), Positives = 18/30 (60%)
Query: 16 VGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
G +GK E + K K+K+KKKKKKK
Sbjct: 57 TRGGRKGKTAREEAVEAKAKEKEKKKKKKK 86
Score = 31.8 bits (73), Expect = 0.20
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 18 GISRGKEEETEEEKKKKKKKKKKKKKK 44
G + +E + K+K+KKKKKKK+ +
Sbjct: 63 GKTAREEAVEAKAKEKEKKKKKKKELE 89
Score = 30.7 bits (70), Expect = 0.56
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 17 GGISRGKEEETEEEKKKKKKKKKKKKK 43
G +R + E + ++K+KKKKKKK+ +
Sbjct: 63 GKTAREEAVEAKAKEKEKKKKKKKELE 89
Score = 29.9 bits (68), Expect = 1.0
Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 15/52 (28%)
Query: 16 VGGISRGKEEETEEEKKKKKKKKKK---------KKKK------KRYEEEEK 52
+ + E+EKKKKKKK+ + KKK+ K++EE++K
Sbjct: 65 TAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEELAELRKKFEEDKK 116
Score = 28.4 bits (64), Expect = 3.3
Identities = 14/29 (48%), Positives = 17/29 (58%)
Query: 17 GGISRGKEEETEEEKKKKKKKKKKKKKKK 45
GG E E K K+K+KKKKKKK+
Sbjct: 59 GGRKGKTAREEAVEAKAKEKEKKKKKKKE 87
Score = 27.2 bits (61), Expect = 8.2
Identities = 17/44 (38%), Positives = 22/44 (50%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
E E ED V R + EE + K K+K+KKKKKK+
Sbjct: 43 EARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKK 86
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 33.2 bits (76), Expect = 0.087
Identities = 15/40 (37%), Positives = 24/40 (60%), Gaps = 6/40 (15%)
Query: 23 KEEETEEEKKKKKKK------KKKKKKKKRYEEEEKEEEK 56
KE E E ++ K+K KK+ ++ ++ EEEEKE E+
Sbjct: 75 KEREKEGDEFADKEKFVTSAYKKQLEENRKLEEEEKEREE 114
Score = 29.7 bits (67), Expect = 1.4
Identities = 15/48 (31%), Positives = 26/48 (54%), Gaps = 4/48 (8%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEET----EEEKKKKKKKKKKKKKKK 45
EK G+E D+EK V + + EE EEEK++++ +++ K
Sbjct: 77 REKEGDEFADKEKFVTSAYKKQLEENRKLEEEEKEREELEEENDVTKG 124
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 34.7 bits (80), Expect = 0.095
Identities = 14/42 (33%), Positives = 23/42 (54%), Gaps = 2/42 (4%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYE 48
EK + +K + I +E E K KKKK+KKK+ +++
Sbjct: 403 LEKAEGKKAIEEIR--EELIEEGLLKSKKKKRKKKEWFEKFR 442
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 34.0 bits (78), Expect = 0.096
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
+K KK KKK K++ EEEK+ E D+
Sbjct: 60 RKPATTKKSKKKDKEKLTEEEKKPESDD 87
Score = 31.7 bits (72), Expect = 0.47
Identities = 16/33 (48%), Positives = 19/33 (57%)
Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
T K KK KKK K+K EEE+K E D+K
Sbjct: 57 TTPRKPATTKKSKKKDKEKLTEEEKKPESDDDK 89
Score = 29.7 bits (67), Expect = 2.2
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKD 57
KK KKK K+K + ++ E +++K
Sbjct: 64 TTKKSKKKDKEKLTEEEKKPESDDDKT 90
Score = 29.0 bits (65), Expect = 4.3
Identities = 12/36 (33%), Positives = 24/36 (66%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
++ T ++ KKK K+K +++KK +++K EE +N
Sbjct: 60 RKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENEN 95
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 34.4 bits (79), Expect = 0.10
Identities = 15/38 (39%), Positives = 23/38 (60%)
Query: 20 SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
G E + ++ K KKK KK KKK+ K +++ EEE +
Sbjct: 139 ILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESE 176
Score = 34.4 bits (79), Expect = 0.11
Identities = 15/65 (23%), Positives = 24/65 (36%), Gaps = 9/65 (13%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK---------RYEEEEK 52
NE E + D+ SR + E KK K+KK E+E
Sbjct: 229 NEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGSEDED 288
Query: 53 EEEKD 57
++++D
Sbjct: 289 DDDED 293
Score = 33.7 bits (77), Expect = 0.22
Identities = 14/44 (31%), Positives = 27/44 (61%)
Query: 14 KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
K +G ++ K+ + +++ KK KKK+ K+ K EEE + E++
Sbjct: 138 KILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDES 181
Score = 31.7 bits (72), Expect = 0.81
Identities = 13/39 (33%), Positives = 20/39 (51%)
Query: 20 SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
K + ++ KK KKK+ K+ K EE E E+E +
Sbjct: 145 KAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKS 183
Score = 31.0 bits (70), Expect = 1.3
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E K KK K KKK KK K+ E +E ++ D +
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173
Score = 31.0 bits (70), Expect = 1.3
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 28 EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E + KK K KKK KK KK+ +E +++ + +
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEE 174
Score = 30.2 bits (68), Expect = 2.3
Identities = 16/53 (30%), Positives = 29/53 (54%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E + ++ GI ++ ++K KK KKK+ K+ + +EEE E E ++K
Sbjct: 130 ESIMEGLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESK 182
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 34.5 bits (79), Expect = 0.10
Identities = 15/53 (28%), Positives = 19/53 (35%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
K EK E E + K E EEK +KKK K K + +
Sbjct: 201 KAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAE 253
Score = 29.5 bits (66), Expect = 3.7
Identities = 11/54 (20%), Positives = 25/54 (46%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E+ + E+ + + + K+ E E + +K K + + K K ++ E E+
Sbjct: 174 EEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEE 227
Score = 28.4 bits (63), Expect = 8.6
Identities = 15/58 (25%), Positives = 38/58 (65%), Gaps = 3/58 (5%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKK--KKKKRYEEEEKEEEKDNK 59
KG ++++ +E++V + ++ E+E+ K+ +K++ K +++K+ EE EK+ + + K
Sbjct: 76 KGEQQRKKKEEQVAEELK-PKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQK 132
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 34.8 bits (81), Expect = 0.11
Identities = 19/51 (37%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKK--KKKKKKKKKKKKRYEEEEKEEEKD 57
KE ++ + GG + K E E +K+ K +KK+KKKKK+ E++E+ + D
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGD 641
Score = 29.8 bits (68), Expect = 3.1
Identities = 15/46 (32%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 10 EDQEKEVGGISRGKEEETEEEKKKKKKKKKK-KKKKKRYEEEEKEE 54
E+ E+E+ K EE E K+ +K K++ ++KK++ +EEE +
Sbjct: 526 EELEREL----EQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKL 567
Score = 29.4 bits (67), Expect = 4.7
Identities = 22/63 (34%), Positives = 28/63 (44%), Gaps = 15/63 (23%)
Query: 25 EETEEEKKKKKKKKKKKKKK------------KRYEEEEKEEEKDNKLLHNCYLYSYYRV 72
E+ ++ KKKKKKK K K K RYEE E+ +K L + L Y V
Sbjct: 680 EKIQKPKKKKKKKPKTVKPKPRTVSLELDLRGMRYEE---ALERLDKYLDDALLAGYGEV 736
Query: 73 THI 75
I
Sbjct: 737 LII 739
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 33.0 bits (76), Expect = 0.11
Identities = 17/49 (34%), Positives = 22/49 (44%), Gaps = 8/49 (16%)
Query: 3 EKGGEEKED--------QEKEVGGISRGKEEETEEEKKKKKKKKKKKKK 43
+ E ED K G SR K E + K+KKKKKKK+ +
Sbjct: 42 KARSEPDEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKKELE 90
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 34.6 bits (79), Expect = 0.11
Identities = 15/55 (27%), Positives = 28/55 (50%), Gaps = 4/55 (7%)
Query: 8 EKEDQEKEVGGISRGK----EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
+ + K+ +R K + EEEK +KK +K++K KK + E++ E +
Sbjct: 105 ALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEV 159
Score = 28.9 bits (64), Expect = 7.2
Identities = 11/53 (20%), Positives = 25/53 (47%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
K E + +E + ++ ++E KK ++ ++K + EE++ EK
Sbjct: 83 KTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEK 135
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 33.9 bits (78), Expect = 0.11
Identities = 16/29 (55%), Positives = 17/29 (58%)
Query: 28 EEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EEE+ KKKKKKKKKK KK E
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEAT 205
Score = 33.1 bits (76), Expect = 0.25
Identities = 14/32 (43%), Positives = 15/32 (46%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EE +KKKKKKKKK KK E
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVSS 208
Score = 31.9 bits (73), Expect = 0.53
Identities = 16/37 (43%), Positives = 17/37 (45%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K EE KKKKKKKKKK KK E +
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSA 209
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 34.2 bits (79), Expect = 0.12
Identities = 15/52 (28%), Positives = 27/52 (51%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
GE + + + S ++EE ++E KK K+ ++ + EEE EEE+
Sbjct: 116 GEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEA 167
Score = 32.3 bits (74), Expect = 0.47
Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 2/60 (3%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEET--EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+ E+ +D+ + + S + E + E+E++K + KK K+ EE EEE +
Sbjct: 105 EVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEE 164
Score = 29.6 bits (67), Expect = 3.4
Identities = 15/57 (26%), Positives = 29/57 (50%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
++G + +KE+ EEE +E KK K+ ++ ++ EE +EEE + +
Sbjct: 114 DEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAE 170
Score = 28.9 bits (65), Expect = 6.1
Identities = 10/38 (26%), Positives = 23/38 (60%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
EEK++ K+ S + E +EE+ ++++ + +K+K
Sbjct: 136 EEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173
Score = 28.5 bits (64), Expect = 6.8
Identities = 9/42 (21%), Positives = 23/42 (54%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEE 49
E+E E EE +EE++++ ++++ + +K++ E
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASE 176
>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
Length = 64
Score = 31.0 bits (71), Expect = 0.12
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 26 ETEEEKKKKKKKKKKKKKKKRYEEEE 51
E EK+K+KK +K+++K+ EE
Sbjct: 39 EKPSEKRKRKKAAARKRRRKKLAREE 64
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 34.0 bits (78), Expect = 0.13
Identities = 14/54 (25%), Positives = 29/54 (53%), Gaps = 9/54 (16%)
Query: 7 EEKEDQEKEVGGISRGKEE----ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
++ E+ EK+ R E+ E E+ +K K+ ++ K+ EE++K+ E+
Sbjct: 75 QQAEEAEKQ-----RAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEE 123
Score = 33.3 bits (76), Expect = 0.21
Identities = 14/57 (24%), Positives = 31/57 (54%), Gaps = 7/57 (12%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E+ ++ E+++K+ EE ++ + K K + + +KK EE +K+ E++ K
Sbjct: 108 EQAAKQAEEKQKQA-------EEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAK 157
Score = 32.9 bits (75), Expect = 0.29
Identities = 18/63 (28%), Positives = 24/63 (38%), Gaps = 15/63 (23%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKK----------KKKKKKKKKKKRYEEEEKEEEK 56
E K Q E + E E EKK K K K + KK+ E +K+ E
Sbjct: 123 EAKAKQAAE-----AKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEA 177
Query: 57 DNK 59
+ K
Sbjct: 178 EAK 180
Score = 31.7 bits (72), Expect = 0.66
Identities = 9/55 (16%), Positives = 25/55 (45%), Gaps = 2/55 (3%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE--EKDNK 59
E ++ + E ++ E+ K+ ++ K+ ++K+ + EE + + K
Sbjct: 79 EAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAK 133
Score = 31.7 bits (72), Expect = 0.74
Identities = 8/37 (21%), Positives = 23/37 (62%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+ EE E+++ ++ ++K+ +++ E+ K+ E+ K
Sbjct: 76 QAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112
Score = 31.3 bits (71), Expect = 0.88
Identities = 15/53 (28%), Positives = 24/53 (45%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E E ++K + EEE + + + KKK + KKK E + + E K
Sbjct: 136 AEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188
Score = 31.0 bits (70), Expect = 1.4
Identities = 14/53 (26%), Positives = 26/53 (49%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
E + + + EK+ ++ + EE + K + KKK + KK+ E E K +
Sbjct: 131 EAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183
Score = 30.6 bits (69), Expect = 1.8
Identities = 14/57 (24%), Positives = 30/57 (52%), Gaps = 2/57 (3%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
++ EK ++ E ++ EE+ ++ ++ K K+ + K K E E+K +E+ K
Sbjct: 96 QRAAAEKAAKQAE--QAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150
Score = 29.8 bits (67), Expect = 2.7
Identities = 11/62 (17%), Positives = 29/62 (46%), Gaps = 3/62 (4%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKK---KKKKKRYEEEEKEEEKD 57
++ E+ Q++ + K + E+ K+ ++K+K + K K+ E + + E +
Sbjct: 80 AEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAE 139
Query: 58 NK 59
+
Sbjct: 140 AE 141
Score = 29.4 bits (66), Expect = 4.3
Identities = 10/52 (19%), Positives = 21/52 (40%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
E + + +K+ + E E + + + K K K ++ K K + K
Sbjct: 155 EAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAA 206
Score = 28.6 bits (64), Expect = 7.7
Identities = 13/52 (25%), Positives = 22/52 (42%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E+E + K + E ++ + + K K + K K K E + K E K
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 32.0 bits (73), Expect = 0.14
Identities = 17/56 (30%), Positives = 22/56 (39%), Gaps = 4/56 (7%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
E + ++ E T E+ K KK KK KK K E KD+ L N
Sbjct: 31 EGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKW----EGNTCKDSSFLVN 82
Score = 29.3 bits (66), Expect = 1.2
Identities = 10/37 (27%), Positives = 16/37 (43%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K ++ E++ K K K+ K KK + E K
Sbjct: 17 KWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEK 53
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 34.3 bits (79), Expect = 0.14
Identities = 18/35 (51%), Positives = 24/35 (68%)
Query: 22 GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
GK+ E + ++K KKKKKK+K+ EEE K EEK
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEK 764
Score = 33.9 bits (78), Expect = 0.21
Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 4/51 (7%)
Query: 4 KGGEEKEDQEKEVGGIS----RGKEEETEEEKKKKKKKKKKKKKKKRYEEE 50
GE + D+EK+ S + K E + + KK+K KK+ K E E
Sbjct: 1532 GQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEAE 1582
Score = 32.0 bits (73), Expect = 0.85
Identities = 14/30 (46%), Positives = 23/30 (76%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
+ EE+ KKKKKK+KKK+++ + EE+ + E
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKARIE 768
Score = 31.6 bits (72), Expect = 1.1
Identities = 16/35 (45%), Positives = 26/35 (74%), Gaps = 2/35 (5%)
Query: 19 ISRGKEEETEEEKKKKKKKKK--KKKKKKRYEEEE 51
IS EE+T+++KKK+KKK++ K+++K R E E
Sbjct: 737 ISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771
Score = 30.8 bits (70), Expect = 2.0
Identities = 14/34 (41%), Positives = 22/34 (64%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E + ++K KKKKKK+KK+ EE ++EE+
Sbjct: 732 DAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many
hypothetical proteins.
Length = 98
Score = 32.1 bits (73), Expect = 0.14
Identities = 17/63 (26%), Positives = 27/63 (42%), Gaps = 5/63 (7%)
Query: 2 NEKGGEEKEDQEK-----EVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E+ EK D+ + K++ T + + K KK +KK KK+ E E EE
Sbjct: 1 KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60
Query: 57 DNK 59
+
Sbjct: 61 KRR 63
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 34.2 bits (78), Expect = 0.14
Identities = 21/59 (35%), Positives = 41/59 (69%), Gaps = 5/59 (8%)
Query: 3 EKGGEEKEDQEK-EVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E GE K+D++ + GG S G + E EEE+++++++++++++ EEEE+EEE + L
Sbjct: 840 ENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEE----EEEEEEEENEEPL 894
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 33.6 bits (77), Expect = 0.15
Identities = 16/63 (25%), Positives = 35/63 (55%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
M E +E E++E++ G + EE E+E+ + ++ + K + ++E + +++NK
Sbjct: 13 MEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKK 72
Query: 61 LHN 63
L N
Sbjct: 73 LEN 75
Score = 32.0 bits (73), Expect = 0.42
Identities = 13/59 (22%), Positives = 28/59 (47%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
K +ED KE K +E + E ++ +K++ + ++ E + +E + +N L
Sbjct: 8 AKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKL 66
Score = 30.1 bits (68), Expect = 1.9
Identities = 22/71 (30%), Positives = 32/71 (45%), Gaps = 3/71 (4%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
++ E E+ EKE + E + E+ K + K K++ KK E E E KD +L
Sbjct: 28 KGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKKL--ENELEALKD-RL 84
Query: 61 LHNCYLYSYYR 71
L Y YR
Sbjct: 85 LRTVAEYDNYR 95
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 33.7 bits (78), Expect = 0.16
Identities = 18/68 (26%), Positives = 36/68 (52%), Gaps = 13/68 (19%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRY-------------EEEEKE 53
EE E++ +E + E EE++ + ++K++K+KK + EE+E+E
Sbjct: 89 EEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKERE 148
Query: 54 EEKDNKLL 61
E++ K+L
Sbjct: 149 REEELKIL 156
Score = 31.8 bits (73), Expect = 0.63
Identities = 12/35 (34%), Positives = 28/35 (80%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+EEE E E++++K++K+++ + R ++EE E+E++
Sbjct: 165 REEEREAERRERKEEKEREVARLRAQQEEAEDERE 199
Score = 31.0 bits (71), Expect = 1.1
Identities = 14/37 (37%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEE--EKEEEKD 57
KE E EEE K + +++K ++++ E E E++EEK+
Sbjct: 145 KEREREEELKILEYQREKAEREEEREAERRERKEEKE 181
Score = 30.3 bits (69), Expect = 2.3
Identities = 10/48 (20%), Positives = 28/48 (58%), Gaps = 13/48 (27%)
Query: 23 KEEETEEEKKKKKKKKKKKK-------------KKKRYEEEEKEEEKD 57
+ EEE++++K+K+++++ K+++ E EE+ +E++
Sbjct: 52 LKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQERE 99
Score = 30.3 bits (69), Expect = 2.3
Identities = 13/52 (25%), Positives = 28/52 (53%), Gaps = 6/52 (11%)
Query: 12 QEKEVGGISRGKEEETEEEKKKKKK-KKKKKKKKKRYEEEEKEEEKDNKLLH 62
+EK+ R K EE EEE++ + ++++ K EE E++ +++ +
Sbjct: 27 EEKK-----RIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGR 73
Score = 29.9 bits (68), Expect = 2.6
Identities = 11/41 (26%), Positives = 26/41 (63%), Gaps = 7/41 (17%)
Query: 25 EETEEEKKKKKKKKKKKKKK-------KRYEEEEKEEEKDN 58
E EEE++ +++++K++K++ ++ E E++ EE D
Sbjct: 163 AEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDE 203
Score = 29.9 bits (68), Expect = 3.0
Identities = 7/32 (21%), Positives = 21/32 (65%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
+ E EEE ++ ++ ++++ +++ EEE++
Sbjct: 304 RAAEREEELEEGERLREEEAERQARIEEERQR 335
Score = 28.3 bits (64), Expect = 7.5
Identities = 8/38 (21%), Positives = 22/38 (57%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
KEE E++++ ++ ++ ++++ + EEE+ L
Sbjct: 300 KEERRAAEREEELEEGERLREEEAERQARIEEERQRLL 337
Score = 28.3 bits (64), Expect = 9.5
Identities = 12/60 (20%), Positives = 32/60 (53%), Gaps = 11/60 (18%)
Query: 7 EEKEDQEKEVGGISRGKE------EETEEEKKKKKKKKKKKKKK-----KRYEEEEKEEE 55
EE+E + K G+ EE E+ ++++ +++ +++++ +R +EE++ E
Sbjct: 57 EEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEA 116
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 32.7 bits (75), Expect = 0.16
Identities = 7/36 (19%), Positives = 9/36 (25%)
Query: 22 GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
G E KKKK + + E
Sbjct: 116 GPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAA 151
Score = 32.0 bits (73), Expect = 0.29
Identities = 8/51 (15%), Positives = 12/51 (23%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
E K + + + E KKKK + E
Sbjct: 97 EPKPSKLELFNAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAA 147
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 32.4 bits (74), Expect = 0.18
Identities = 17/32 (53%), Positives = 22/32 (68%), Gaps = 3/32 (9%)
Query: 28 EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E+ K+KK+KKKKKKKKKK + K+ K K
Sbjct: 107 EKAKQKKQKKKKKKKKKK---KTSKKAAKKKK 135
Score = 32.0 bits (73), Expect = 0.25
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKK 45
K+++ +++KKKKK KK KKKK
Sbjct: 113 KQKKKKKKKKKKKTSKKAAKKKK 135
Score = 32.0 bits (73), Expect = 0.26
Identities = 13/33 (39%), Positives = 22/33 (66%)
Query: 14 KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
K+V ++++ +++KKKKKKK KK KK+
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 31.6 bits (72), Expect = 0.35
Identities = 15/28 (53%), Positives = 22/28 (78%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+EK K+KK+KKKKKKKK+ + +K +K
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKK 133
Score = 29.7 bits (67), Expect = 1.3
Identities = 13/36 (36%), Positives = 23/36 (63%), Gaps = 5/36 (13%)
Query: 29 EEKK-----KKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EE+K K+K K+KK+KKK+ ++++K+ K
Sbjct: 96 EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAA 131
Score = 28.9 bits (65), Expect = 2.6
Identities = 15/38 (39%), Positives = 25/38 (65%), Gaps = 5/38 (13%)
Query: 27 TEEEK-----KKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EE+K K+K K+KK+KKKK+ ++++K +K K
Sbjct: 95 VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132
Score = 27.7 bits (62), Expect = 7.2
Identities = 11/32 (34%), Positives = 23/32 (71%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EE + K+ K+K K+KK+KK+ ++++K++
Sbjct: 96 EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTS 127
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 33.7 bits (77), Expect = 0.18
Identities = 16/52 (30%), Positives = 30/52 (57%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
++ E E G+ ++EEE ++K+K+K+K+ KK +E+E EE +
Sbjct: 262 PTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEI 313
Score = 31.0 bits (70), Expect = 1.3
Identities = 18/66 (27%), Positives = 33/66 (50%), Gaps = 11/66 (16%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKK-----------KKKKKKKKKKKKRYEEEE 51
E+ G+ E E + E+E E+E K ++K+K+K+K+ K+ E+E
Sbjct: 246 EESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDE 305
Query: 52 KEEEKD 57
E+E+
Sbjct: 306 DEDEEM 311
Score = 29.0 bits (65), Expect = 5.1
Identities = 13/30 (43%), Positives = 19/30 (63%)
Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
KKK K+KK+KK+ + +EE EEE +
Sbjct: 222 FKKKTKEKKEKKEASESTVKEESEEESGKR 251
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 33.8 bits (78), Expect = 0.19
Identities = 15/37 (40%), Positives = 26/37 (70%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
EE +EE ++ KK + ++ ++ R EEE+ EEE+D+K
Sbjct: 261 EELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKE 297
Score = 30.0 bits (68), Expect = 3.5
Identities = 15/65 (23%), Positives = 28/65 (43%), Gaps = 8/65 (12%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKK--------KKKKKKKKKRYEEEEKE 53
+ + GG ++EE + E+KK KK K K K +++ +EE E
Sbjct: 142 DLGDLASDDRAAHFGGGEDDEEDEEEQPERKKSKKEVMKEVIAKSKFYKAERQKAKEEDE 201
Query: 54 EEKDN 58
+ ++
Sbjct: 202 DLREE 206
Score = 29.6 bits (67), Expect = 4.1
Identities = 13/41 (31%), Positives = 25/41 (60%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRY 47
+ +++E E S +++E EEE++K+KKKKK + +
Sbjct: 350 DLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
Score = 28.8 bits (65), Expect = 7.0
Identities = 13/48 (27%), Positives = 28/48 (58%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE 50
E+ E+ + ++E ++E +EE+++++K+KKKKK + E
Sbjct: 343 EEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).
This family consists of several Plasmodium falciparum
SPAM (secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the
alanine heptad-repeat domain. Heptad repeats in which
the a and d position contain hydrophobic residues
generate amphipathic alpha-helices which give rise to
helical bundles or coiled-coil structures in proteins.
SPAM is an example of a P. falciparum antigen in which
a repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 32.5 bits (74), Expect = 0.20
Identities = 15/57 (26%), Positives = 29/57 (50%), Gaps = 11/57 (19%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EK +++E++E++ I ++ E EEE + ++ EEEE EE+ +
Sbjct: 46 EKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEE-----------EEEEDEEDNVDL 91
Score = 32.1 bits (73), Expect = 0.26
Identities = 16/61 (26%), Positives = 26/61 (42%), Gaps = 6/61 (9%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEE--EEKEEEKDNKLLHNC 64
EE ED E E E+E EEE+ ++ K +KK + +++ L+
Sbjct: 63 EEPEDIEDE----EEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQNLISKN 118
Query: 65 Y 65
Y
Sbjct: 119 Y 119
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 33.9 bits (77), Expect = 0.20
Identities = 25/62 (40%), Positives = 32/62 (51%), Gaps = 6/62 (9%)
Query: 1 MNEKGGEEK---EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
M EK E++ ED+E G+E + EE K+ + KKKK KK K E KE E
Sbjct: 206 MVEKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVK---EVTKEYEVQ 262
Query: 58 NK 59
NK
Sbjct: 263 NK 264
>gnl|CDD|221012 pfam11169, DUF2956, Protein of unknown function (DUF2956). This
family of proteins with unknown function appears to be
restricted to Gammaproteobacteria.
Length = 103
Score = 31.5 bits (72), Expect = 0.21
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 6/60 (10%)
Query: 4 KGGEEKEDQEKEVG-GISRGKEEETEEEKKKK----KKKKKKKKKKKRYEEEEKEEEKDN 58
K G+ KE Q K + GI +G E +++K K K +K++ K K+R + EE+
Sbjct: 17 KPGQTKE-QTKLIAQGIEKGIAEYKKQQKAKAREADKARKQQLKAKQRQAANDDEEDTIE 75
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 33.9 bits (78), Expect = 0.21
Identities = 17/49 (34%), Positives = 29/49 (59%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
+E +++ KE +E E EEE K +K K K+ ++ ++EE+EEE
Sbjct: 441 KENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEE 489
Score = 33.5 bits (77), Expect = 0.24
Identities = 17/62 (27%), Positives = 30/62 (48%), Gaps = 1/62 (1%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE-EKDNKLL 61
+ EE + + +E+ G+EE EEE ++ KK ++K E++ E +K K
Sbjct: 384 ARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKEN 443
Query: 62 HN 63
N
Sbjct: 444 KN 445
Score = 33.1 bits (76), Expect = 0.34
Identities = 14/30 (46%), Positives = 22/30 (73%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
KK+KKK+K+K++ + E +KE+ KD KL
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKL 656
Score = 32.7 bits (75), Expect = 0.38
Identities = 17/57 (29%), Positives = 30/57 (52%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+ +K ++ G + KE E+++ KK+ K + K+KK+ EE E EEE +
Sbjct: 412 NEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVE 468
Score = 32.7 bits (75), Expect = 0.41
Identities = 17/55 (30%), Positives = 30/55 (54%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
+ E +EK+ E+E E + +K K K+ +K + EEEE+E +++N L
Sbjct: 443 NKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWL 497
Score = 30.4 bits (69), Expect = 2.1
Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 4/54 (7%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
+ E+KE E+E EEE + EK K K+ +K +K EEEE +EE
Sbjct: 444 KNEFKEKKESDEEE----ELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEE 493
Score = 28.9 bits (65), Expect = 6.9
Identities = 17/38 (44%), Positives = 21/38 (55%), Gaps = 7/38 (18%)
Query: 28 EEEKKKKKKKKKKKK----KKKRYEEEEKEEEKDNKLL 61
EE K K+ KK K KK KK E+ KEE K+ + L
Sbjct: 216 EEAKAKRIKKIKSKKYHRVHKK---EKLKEELKEFEEL 250
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 33.5 bits (76), Expect = 0.21
Identities = 19/59 (32%), Positives = 30/59 (50%), Gaps = 4/59 (6%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
M EE+E E E S E+E + K +K+K KK +K + E E+E++N+
Sbjct: 241 METSESEEEESSESE----SDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQERENE 295
>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein. This family includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 116
Score = 31.8 bits (72), Expect = 0.24
Identities = 11/39 (28%), Positives = 23/39 (58%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
E Q++E + E+ E ++++++K+KKKKK +
Sbjct: 78 FMEIQKEERRSARMARLEDERELAREEEEEKRKKKKKNK 116
Score = 31.0 bits (70), Expect = 0.43
Identities = 12/42 (28%), Positives = 25/42 (59%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
E E + +E+ ++R ++E +++++K+KKKKK K
Sbjct: 75 EATFMEIQKEERRSARMARLEDERELAREEEEEKRKKKKKNK 116
>gnl|CDD|176682 cd07261, Glo_EDI_BRP_like_11, This conserved domain belongs to a
superfamily including the bleomycin resistance protein,
glyoxalase I, and type I ring-cleaving dioxygenases.
This protein family belongs to a conserved domain
superfamily that is found in a variety of structurally
related metalloproteins, including the bleomycin
resistance protein, glyoxalase I, and type I
ring-cleaving dioxygenases. A bound metal ion is
required for protein activities for the members of this
superfamily. A variety of metal ions have been found in
the catalytic centers of these proteins including
Fe(II), Mn(II), Zn(II), Ni(II) and Mg(II). The protein
superfamily contains members with or without domain
swapping.
Length = 114
Score = 31.4 bits (72), Expect = 0.24
Identities = 13/28 (46%), Positives = 15/28 (53%)
Query: 107 STFLGQAPVLLSPGIHSFPFKLGLPLGL 134
S LG+ PV LSP F G+ LGL
Sbjct: 17 SELLGREPVELSPTFALFVLGSGVKLGL 44
Score = 29.1 bits (66), Expect = 1.8
Identities = 11/24 (45%), Positives = 13/24 (54%)
Query: 82 GQAPVLLSPGIHSFPFKLGLPLGL 105
G+ PV LSP F G+ LGL
Sbjct: 21 GREPVELSPTFALFVLGSGVKLGL 44
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 33.2 bits (76), Expect = 0.28
Identities = 13/53 (24%), Positives = 29/53 (54%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+ +E Q+K+ R K+ E E +++KK+ ++ K+ ++++ EE K
Sbjct: 88 QAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAK 140
Score = 30.9 bits (70), Expect = 1.5
Identities = 8/54 (14%), Positives = 27/54 (50%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
+++ +++ + + + + E++K+ ++ K+ K+K+ EE + K
Sbjct: 94 QKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKA 147
Score = 29.0 bits (65), Expect = 5.8
Identities = 5/57 (8%), Positives = 32/57 (56%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K +++ ++ ++ + + ++ E+E+ +++KK+ ++ + ++++ ++
Sbjct: 82 KKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAA 138
Score = 29.0 bits (65), Expect = 5.9
Identities = 12/55 (21%), Positives = 24/55 (43%), Gaps = 6/55 (10%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKK--KKKKKKKKKRYEEEEKEEEKDNK 59
++ E+ K+ + K+++ EE K K K + + KR K+ + K
Sbjct: 118 KQAEEAAKQ----AALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAK 168
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This
is a family of uncharacterized proteins.
Length = 449
Score = 33.0 bits (75), Expect = 0.29
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKD 57
KK+ +K K+ ++ EEE ++E+D
Sbjct: 33 KKENAIRKLGKEAEEEAMEEEDDDEED 59
Score = 31.1 bits (70), Expect = 1.3
Identities = 16/56 (28%), Positives = 26/56 (46%), Gaps = 1/56 (1%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
E E ED + V G KE+ T + KK+ +K K+ E E+E++ +
Sbjct: 4 EFASSELEDDD-WVRGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEE 58
Score = 31.1 bits (70), Expect = 1.3
Identities = 9/28 (32%), Positives = 18/28 (64%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
K+ +K K+ +++ EEE+ +EE D+
Sbjct: 34 KENAIRKLGKEAEEEAMEEEDDDEEDDD 61
Score = 28.8 bits (64), Expect = 5.8
Identities = 6/27 (22%), Positives = 17/27 (62%)
Query: 32 KKKKKKKKKKKKKKRYEEEEKEEEKDN 58
KK+ +K K+ + EE+++++++
Sbjct: 33 KKENAIRKLGKEAEEEAMEEEDDDEED 59
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 31.5 bits (72), Expect = 0.31
Identities = 13/32 (40%), Positives = 21/32 (65%)
Query: 28 EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E ++ K++KKK+ K+KK E+E K + K K
Sbjct: 93 LEHERNKQEKKKRSKEKKEEEKERKRQLKQQK 124
Score = 30.7 bits (70), Expect = 0.54
Identities = 14/40 (35%), Positives = 27/40 (67%)
Query: 13 EKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
E + GIS ++ + E ++ K++KKK+ K+K+ EE+E+
Sbjct: 77 EVKKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEKER 116
Score = 29.9 bits (68), Expect = 1.1
Identities = 16/39 (41%), Positives = 27/39 (69%), Gaps = 4/39 (10%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
E+ QEK+ R KE++ EE+++K++ K++KKK K R
Sbjct: 96 ERNKQEKK----KRSKEKKEEEKERKRQLKQQKKKAKHR 130
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 33.1 bits (76), Expect = 0.31
Identities = 17/50 (34%), Positives = 29/50 (58%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+ E+E+ GI +E E+E++ ++ KKK K+ +KR EE E+ E
Sbjct: 314 KRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHEL 363
Score = 33.1 bits (76), Expect = 0.39
Identities = 20/58 (34%), Positives = 28/58 (48%), Gaps = 4/58 (6%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKK----KKKKKKKKKRYEEEEKEEEK 56
G E EKE+ + + KEE EE K + KK+ K+ KK EE +K + K
Sbjct: 380 RLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGK 437
Score = 32.3 bits (74), Expect = 0.59
Identities = 16/54 (29%), Positives = 31/54 (57%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EK +E ++ E+ I ++E E K+K ++K ++ ++R EE +KE E+
Sbjct: 224 EKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEE 277
Score = 32.0 bits (73), Expect = 0.76
Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 3/57 (5%)
Query: 3 EKGGEEKEDQEKEVGGIS---RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EK EE E ++++ + + ++E E+ ++ KKK + +KK EEE E K
Sbjct: 521 EKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLK 577
Score = 31.6 bits (72), Expect = 0.94
Identities = 14/67 (20%), Positives = 27/67 (40%), Gaps = 10/67 (14%)
Query: 7 EEKEDQEKEVGGIS----------RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+E E E + ++E EE ++K K+ K+ K+K + Y + + E+
Sbjct: 245 KELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEE 304
Query: 57 DNKLLHN 63
L
Sbjct: 305 YLDELRE 311
Score = 31.6 bits (72), Expect = 1.2
Identities = 16/54 (29%), Positives = 33/54 (61%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E+++ +K + + EE E EK+ ++ +K+ ++ +K+Y EEE EE ++ L
Sbjct: 616 REEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYL 669
Score = 30.8 bits (70), Expect = 1.9
Identities = 16/58 (27%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 7 EEKEDQEKEVGGIS---RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
+E + EK + + G EE +E ++K+++ ++ KKK K E+ +E E+ ++L
Sbjct: 307 DELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELY 364
Score = 30.4 bits (69), Expect = 2.2
Identities = 18/58 (31%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Query: 7 EEKEDQEKEVGGISRGKEE---ETEEEKKKKKKKKKK-KKKKKRYEEEEKEEEKDNKL 60
EE E + + + G R EE E EE ++ KK+ ++ ++K K +E +++ E+ KL
Sbjct: 241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKL 298
Score = 30.0 bits (68), Expect = 3.4
Identities = 12/50 (24%), Positives = 26/50 (52%), Gaps = 2/50 (4%)
Query: 7 EEKEDQEKEVGGISRGKE--EETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
E + +E+ + + + EE +EE ++ +K+ + + KR EE+ E
Sbjct: 214 SELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRE 263
Score = 29.6 bits (67), Expect = 4.6
Identities = 17/57 (29%), Positives = 32/57 (56%), Gaps = 2/57 (3%)
Query: 5 GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
G E E+ KE+ + E EK+ K+ ++K++K +K E EK +K+++L+
Sbjct: 442 GRELTEEHRKEL--LEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELI 496
Score = 28.9 bits (65), Expect = 6.4
Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 4/54 (7%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E+ EE + + +E+ K +E +E K+K ++ K + + Y +E +E EK
Sbjct: 265 EERIEELKKEIEEL----EEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEK 314
Score = 28.9 bits (65), Expect = 7.2
Identities = 18/54 (33%), Positives = 29/54 (53%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+E E+ +E+ IS E EE +K +K+ K+ ++ K+ EE EKE E
Sbjct: 199 EKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEG 252
Score = 28.5 bits (64), Expect = 8.6
Identities = 13/52 (25%), Positives = 28/52 (53%), Gaps = 1/52 (1%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
++ ++ EK + + + E EE K KK++ ++ KK+ E+ E+E +
Sbjct: 345 KKLKELEKRLEEL-EERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEE 395
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation
of 60S ribosomal subunits.
Length = 132
Score = 31.6 bits (72), Expect = 0.32
Identities = 12/36 (33%), Positives = 22/36 (61%)
Query: 19 ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
+SR K+ ++K K +K +KK K++ R E+ + E
Sbjct: 24 LSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALE 59
Score = 30.4 bits (69), Expect = 0.82
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+ KKK K +K +KK KR EK + +
Sbjct: 32 KAKKKLKSEKLEKKAKRQLRAEKRQALE 59
Score = 27.7 bits (62), Expect = 6.8
Identities = 6/33 (18%), Positives = 19/33 (57%)
Query: 14 KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
K++ + + E E+K K++ + +K++ ++
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEK 60
>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
[Translation, ribosomal structure and biogenesis].
Length = 569
Score = 33.1 bits (75), Expect = 0.33
Identities = 14/54 (25%), Positives = 29/54 (53%)
Query: 5 GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
G EK+ + + G++ KEE T E+K + ++ K+K+ K K ++++
Sbjct: 476 GKAEKDGEIRLKNGVAMSKEELTREDKNRLRRALKRKRSKANLPNVNKRSKRND 529
Score = 29.7 bits (66), Expect = 4.1
Identities = 10/32 (31%), Positives = 14/32 (43%)
Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
E KK K K+ + +EEE + D L
Sbjct: 220 EPVKKHSDVKDPKEDEELDEEEHDSAMDKVKL 251
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 31.6 bits (72), Expect = 0.34
Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 8/57 (14%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKK-KKKRYEEEEKEEEKD 57
+K +KE+ EKE KE+E K + +K++ + +KK+ E + +EEK
Sbjct: 74 RDKLKAKKEEAEKE-------KEKEERFMKALAEAEKERAELEKKKAEAKLMKEEKK 123
Score = 30.8 bits (70), Expect = 0.61
Identities = 14/56 (25%), Positives = 30/56 (53%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
N G ++ ++ S + + E KK K+K ++ K K K+ E E+++E+++
Sbjct: 37 NSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEE 92
Score = 30.0 bits (68), Expect = 1.3
Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 5/41 (12%)
Query: 25 EETEEEK-----KKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E T E K KK K+K ++ K K + EE EKE+EK+ +
Sbjct: 54 ESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERF 94
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 31.9 bits (73), Expect = 0.34
Identities = 15/53 (28%), Positives = 30/53 (56%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
+ +D+E+E+ + +E+E +E KK KK + KK +++ E +K +L
Sbjct: 99 KNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQL 151
>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein.
Length = 122
Score = 31.1 bits (71), Expect = 0.35
Identities = 14/57 (24%), Positives = 31/57 (54%), Gaps = 5/57 (8%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKK-----KKKKKKRYEEEEKEEEKDN 58
E Q++E+ + E+ E+ ++ +++KK +++KKK +EE + E+K
Sbjct: 59 EAIAQQQQELEQAEKQVEQAREQWQEANQERKKLEKLLEREKKKEQKEENRREQKLL 115
>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
domain [Chromatin structure and dynamics].
Length = 211
Score = 32.1 bits (73), Expect = 0.36
Identities = 15/53 (28%), Positives = 20/53 (37%), Gaps = 1/53 (1%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
+ G + E I K +E KKKK K KK K+ Y+ E
Sbjct: 160 PKVEGPSPDKALVEETKI-ISKAWSELDESKKKKYIDKYKKLKEEYDSFYPEA 211
Score = 27.9 bits (62), Expect = 9.1
Identities = 15/55 (27%), Positives = 21/55 (38%), Gaps = 5/55 (9%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+G + +E IS+ E E KKKK K KK EE + +
Sbjct: 161 KVEGPSPDKALVEETKIISKAWSELDE-----SKKKKYIDKYKKLKEEYDSFYPE 210
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized.
This family of proteins is found in bacteria. Proteins
in this family are approximately 90 amino acids in
length. There are two completely conserved L residues
that may be functionally important.
Length = 70
Score = 30.0 bits (68), Expect = 0.37
Identities = 13/36 (36%), Positives = 22/36 (61%), Gaps = 7/36 (19%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
KE + EEEK++++++ +K EE KE EK+
Sbjct: 27 KELKAEEEKREEEEEARK-------REERKEREKNK 55
Score = 29.2 bits (66), Expect = 0.63
Identities = 14/36 (38%), Positives = 27/36 (75%), Gaps = 1/36 (2%)
Query: 23 KEEETEEEKKKKKKKKKKKKKK-KRYEEEEKEEEKD 57
+EE+ EEE++ +K++++K+++K K +EE E E D
Sbjct: 32 EEEKREEEEEARKREERKEREKNKSFEELLNESELD 67
Score = 29.2 bits (66), Expect = 0.72
Identities = 9/31 (29%), Positives = 23/31 (74%)
Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+E K +++K++++++ +KR E +E+E+ K
Sbjct: 26 KKELKAEEEKREEEEEARKREERKEREKNKS 56
Score = 29.2 bits (66), Expect = 0.75
Identities = 8/30 (26%), Positives = 23/30 (76%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
K+E EE+K++++++ +K+++++ E+ K
Sbjct: 26 KKELKAEEEKREEEEEARKREERKEREKNK 55
Score = 27.3 bits (61), Expect = 2.9
Identities = 7/31 (22%), Positives = 23/31 (74%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
+ EE+++++++ +K++++K+R + + EE
Sbjct: 30 KAEEEKREEEEEARKREERKEREKNKSFEEL 60
>gnl|CDD|220245 pfam09444, MRC1, MRC1-like domain. This putative domain is found
to be the most conserved region in mediator of
replication checkpoint protein 1.
Length = 145
Score = 31.6 bits (72), Expect = 0.39
Identities = 19/69 (27%), Positives = 36/69 (52%), Gaps = 15/69 (21%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEE-----------KKKKKKKKKKKKKKKRYEEEEKEEE 55
EE ED+ +GG +E+++++ + K+K + + KR EEE +++E
Sbjct: 15 EESEDEWAGLGGAD---DEDSDDDDDSELEEMIDDEYSKEKAENDELAAKRAEEEREDDE 71
Query: 56 KD-NKLLHN 63
K+ KLL +
Sbjct: 72 KEVEKLLDD 80
Score = 28.9 bits (65), Expect = 2.8
Identities = 16/68 (23%), Positives = 32/68 (47%), Gaps = 19/68 (27%)
Query: 7 EEKEDQEKEV---------GGISRGK----------EEETEEEKKKKKKKKKKKKKKKRY 47
EE+ED EKEV GG+ R + + E E ++++ K+++ ++R
Sbjct: 65 EEREDDEKEVEKLLDDITTGGLRRRRGKNGLDLELSDSEDELLQRRRLKRRELALMRQRL 124
Query: 48 EEEEKEEE 55
E+ ++
Sbjct: 125 LEDGDLDK 132
>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
region. Members of this family have very varied
localisations within the eukaryotic cell. pinin is known
to localise at the desmosomes and is implicated in
anchoring intermediate filaments to the desmosomal
plaque. SDK2/3 is a dynamically localised nuclear
protein thought to be involved in modulation of
alternative pre-mRNA splicing. memA is a tumour marker
preferentially expressed in human melanoma cell lines. A
common feature of the members of this family is that
they may all participate in regulating protein-protein
interactions.
Length = 131
Score = 31.3 bits (71), Expect = 0.40
Identities = 17/77 (22%), Positives = 39/77 (50%), Gaps = 9/77 (11%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKK-------KKKKKKKKKRYEEEEKEEEK 56
+ +EK I + EE+ ++E+++ +K ++++K+ + R E++ E+EK
Sbjct: 25 QEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKMEDEK 84
Query: 57 DNKL--LHNCYLYSYYR 71
+ HN L ++ R
Sbjct: 85 LQETWHEHNLALANFIR 101
Score = 28.6 bits (64), Expect = 3.0
Identities = 15/54 (27%), Positives = 33/54 (61%), Gaps = 1/54 (1%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE-EKDNK 59
EEK+ + G + ++ ++EE + K+K++ + +++ EE+EK+E E+ K
Sbjct: 4 EEKKRNRRMFGLLLGTLQKFSQEESRLTSKEKRRAEIEQKLEEQEKQEREELRK 57
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 32.8 bits (75), Expect = 0.40
Identities = 17/37 (45%), Positives = 24/37 (64%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K+E+ E KKK +KKKKKKK+K+ + E E + K
Sbjct: 58 KKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFK 94
Score = 32.0 bits (73), Expect = 0.69
Identities = 17/53 (32%), Positives = 26/53 (49%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EE + + + ++K +KKKKKKK+KK+ + E E K K K
Sbjct: 45 EENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPK 97
Score = 30.8 bits (70), Expect = 1.6
Identities = 17/69 (24%), Positives = 27/69 (39%), Gaps = 9/69 (13%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK---------RYEEEEK 52
K EK+ ++K+ + + E K KK KK KKK K + + +
Sbjct: 65 ESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAE 124
Query: 53 EEEKDNKLL 61
EK N +
Sbjct: 125 LAEKSNVYI 133
Score = 29.7 bits (67), Expect = 3.4
Identities = 13/44 (29%), Positives = 21/44 (47%), Gaps = 7/44 (15%)
Query: 23 KEEETEEEKKKKKKKKKKK-------KKKKRYEEEEKEEEKDNK 59
E E + KK KK KKK ++++K+E+K+ K
Sbjct: 41 TFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPK 84
Score = 29.3 bits (66), Expect = 4.8
Identities = 16/61 (26%), Positives = 30/61 (49%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYL 66
++K+ ++++ S G+ + + KK KK KKK K K E+ + K +L +
Sbjct: 72 KKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNV 131
Query: 67 Y 67
Y
Sbjct: 132 Y 132
Score = 28.5 bits (64), Expect = 7.4
Identities = 15/35 (42%), Positives = 25/35 (71%)
Query: 26 ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
+ +E+K + KKK +KKKKK+ E++E + E + KL
Sbjct: 57 DKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91
>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved
region at the N-terminal end of a family of proteins
referred to as CIRs (CBF1-interacting co-repressors).
CBF1 (centromere-binding factor 1) acts as a
transcription factor that causes repression by binding
specifically to GTGGGAA motifs in responsive promoters,
and it requires CIR as a co-repressor. CIR binds to
histone deacetylase and to SAP30 and serves as a linker
between CBF1 and the histone deacetylase complex.
Length = 37
Score = 29.1 bits (66), Expect = 0.40
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 28 EEEKKKKKKKKKKKKKKKRYEEE 50
E E+K +++KK ++ +K EEE
Sbjct: 15 EAEQKALEEQKKIEELRKEIEEE 37
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 33.1 bits (75), Expect = 0.41
Identities = 16/60 (26%), Positives = 26/60 (43%), Gaps = 7/60 (11%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE-------EKEEEKDNKLL 61
KE Q + I + E E + K + K+ + K + + E +K+ EKDN L
Sbjct: 229 KEAQLESSREIVKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSEL 288
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 31.1 bits (71), Expect = 0.42
Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 7/47 (14%)
Query: 16 VGGISRGKEEETEEEKKKKKKKKKKKKK-----KKRYEEEEKEEEKD 57
G GK E E+++K+KKKKKKK+ + + R E++KEE+ D
Sbjct: 65 AAGEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLR--EKKKEEQAD 109
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 32.8 bits (75), Expect = 0.44
Identities = 10/48 (20%), Positives = 23/48 (47%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
E+ E ++E+ EE + + ++ K++ ++K EE +E
Sbjct: 330 EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEEL 377
Score = 32.8 bits (75), Expect = 0.47
Identities = 13/53 (24%), Positives = 24/53 (45%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+EK + KE EE E+ + ++ K++ ++K EE EE +
Sbjct: 329 KEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEAL 381
Score = 29.3 bits (66), Expect = 5.4
Identities = 16/55 (29%), Positives = 31/55 (56%), Gaps = 3/55 (5%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
EE E+ E E IS +E E E + ++ +++ ++ K++ E ++E E+ LL
Sbjct: 295 EEIEELEGE---ISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLL 346
Score = 28.9 bits (65), Expect = 6.2
Identities = 18/61 (29%), Positives = 33/61 (54%), Gaps = 5/61 (8%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE-----EKEEEKDNKLL 61
EE E+ EKE+ + EE EE ++ +++ + K++ + E E E+ EE +N+L
Sbjct: 260 EELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELE 319
Query: 62 H 62
Sbjct: 320 E 320
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 32.0 bits (73), Expect = 0.46
Identities = 18/54 (33%), Positives = 31/54 (57%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E D+ + IS+ ++ +EK+KKK KKK+K K +E ++ E+K K+
Sbjct: 147 PELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKV 200
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region
covers both the Brf homology II and III regions. This
region is involved in binding TATA binding protein.
Length = 95
Score = 30.3 bits (69), Expect = 0.49
Identities = 16/41 (39%), Positives = 23/41 (56%), Gaps = 7/41 (17%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEE 49
+E +EKE +++ +E KKKKK+K KKKR E
Sbjct: 30 EEQEEKE-------LKQKADEGNNSGKKKKKRKAKKKRDEA 63
Score = 28.8 bits (65), Expect = 1.6
Identities = 10/33 (30%), Positives = 17/33 (51%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
KE + + ++ KKKKK+K K+ +E
Sbjct: 35 KELKQKADEGNNSGKKKKKRKAKKKRDEAGPAS 67
Score = 27.2 bits (61), Expect = 5.0
Identities = 11/35 (31%), Positives = 21/35 (60%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
++EE E ++K + KKKKKR +++++E
Sbjct: 31 EQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGP 65
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 30.4 bits (69), Expect = 0.49
Identities = 13/47 (27%), Positives = 32/47 (68%), Gaps = 1/47 (2%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
E++ ++ E I + +E+E ++EK+ +++++ + K++R +EEKE
Sbjct: 31 EKRMEKRLEQQAI-KAREKELKDEKEAERQRRIQAIKERRAAKEEKE 76
Score = 27.8 bits (62), Expect = 5.1
Identities = 12/40 (30%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKRYE-EEEKEEEKDNK 59
R K + T EK+ +K+ +++ K + E ++EKE E+ +
Sbjct: 22 RPKSKLTSWEKRMEKRLEQQAIKAREKELKDEKEAERQRR 61
>gnl|CDD|219528 pfam07703, A2M_N_2, Alpha-2-macroglobulin family N-terminal region.
This family includes a region of the
alpha-2-macroglobulin family.
Length = 136
Score = 31.0 bits (71), Expect = 0.50
Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 10/57 (17%)
Query: 287 VSLDRGGYVPGETIIINATVYNRSKITVKSTKAALTETIQYLARNKVLESETRELAS 343
+SLD+ Y GET K+TV S T +R K+L S ++
Sbjct: 3 LSLDKKSYKVGETA----------KVTVTSPFPIGTALYTVESRGKILSSGVVDVKG 49
>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
Length = 673
Score = 32.5 bits (74), Expect = 0.51
Identities = 17/62 (27%), Positives = 31/62 (50%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYL 66
E+K QEK + K E T E+ K+ + K + ++++R E+ +K + + L
Sbjct: 369 EQKLAQEKAQLDAAVEKGELTPEQAKQIARAKLEPEERERLEKIDKAQAALKQAQSAFDL 428
Query: 67 YS 68
YS
Sbjct: 429 YS 430
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein
7 homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 30.7 bits (70), Expect = 0.52
Identities = 19/49 (38%), Positives = 24/49 (48%), Gaps = 6/49 (12%)
Query: 3 EKGGEEKEDQEKEVGGI------SRGKEEETEEEKKKKKKKKKKKKKKK 45
EK +EKE + E G + R K E K +K+KKKKKKK
Sbjct: 37 EKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKK 85
Score = 29.6 bits (67), Expect = 1.5
Identities = 16/66 (24%), Positives = 33/66 (50%), Gaps = 12/66 (18%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEE-------KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+++E++EKE ++E +E KK ++ K +K+ E+++++K K
Sbjct: 31 DKEEEEEKE-----EKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKK 85
Query: 60 LLHNCY 65
L N Y
Sbjct: 86 ELKNFY 91
Score = 27.2 bits (61), Expect = 9.9
Identities = 13/36 (36%), Positives = 22/36 (61%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
KE + +K+KKKKKKK+ K Y + +E +++
Sbjct: 67 KESVAAKAAEKEKKKKKKKELKNFYRFQIRESKRNE 102
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 31.9 bits (73), Expect = 0.53
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 19 ISRGKEEETEEEKKKKKKKKKKKKKKK 45
SR +++ +E+ KKKK K+ K K
Sbjct: 245 ESRAEKKRKSKEEIKKKKPKESKGVKA 271
Score = 30.4 bits (69), Expect = 1.6
Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 5/39 (12%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
E+ E E K+ +++EE KKKK K+ K K
Sbjct: 239 EDVEAAESRAE-----KKRKSKEEIKKKKPKESKGVKAL 272
>gnl|CDD|238686 cd01390, HMGB-UBF_HMG-box, HMGB-UBF_HMG-box, class II and III
members of the HMG-box superfamily of DNA-binding
proteins. These proteins bind the minor groove of DNA
in a non-sequence specific fashion and contain two or
more tandem HMG boxes. Class II members include
non-histone chromosomal proteins, HMG1 and HMG2, which
bind to bent or distorted DNA such as four-way DNA
junctions, synthetic DNA cruciforms, kinked
cisplatin-modified DNA, DNA bulges, cross-overs in
supercoiled DNA, and can cause looping of linear DNA.
Class III members include nucleolar and mitochondrial
transcription factors, UBF and mtTF1, which bind
four-way DNA junctions.
Length = 66
Score = 29.5 bits (67), Expect = 0.54
Identities = 11/21 (52%), Positives = 18/21 (85%)
Query: 30 EKKKKKKKKKKKKKKKRYEEE 50
E++KKK ++K +K K+RYE+E
Sbjct: 44 EEEKKKYEEKAEKDKERYEKE 64
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 32.0 bits (73), Expect = 0.56
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E E E +++++ KKKK+ K K E +K+++KD
Sbjct: 73 EGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPT 108
Score = 31.6 bits (72), Expect = 0.57
Identities = 18/58 (31%), Positives = 25/58 (43%), Gaps = 14/58 (24%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKK-------KKKKKRYEEEEKEEEKD 57
E +++E E KE + EE KKKK+ K K KKKKK + +
Sbjct: 67 ESDDEEEGE-------KELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAA 117
Score = 29.3 bits (66), Expect = 4.0
Identities = 14/36 (38%), Positives = 23/36 (63%), Gaps = 3/36 (8%)
Query: 23 KEEETEEEKKK---KKKKKKKKKKKKRYEEEEKEEE 55
+ +E E +KK K +K+K+KKK+K +EE+ E
Sbjct: 156 RLKEREIRRKKIQAKARKRKEKKKEKELTQEERLAE 191
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 31.5 bits (72), Expect = 0.57
Identities = 15/28 (53%), Positives = 20/28 (71%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EE+K+KKK K++KK EE+EK EE
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 30.7 bits (70), Expect = 1.3
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEK 52
EE +E+KK K++KK K+++ + EE
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 31.3 bits (71), Expect = 0.57
Identities = 10/62 (16%), Positives = 25/62 (40%), Gaps = 3/62 (4%)
Query: 1 MNEKGGEEKEDQEKEVGGISR---GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
E E + D + G ++ +E E++ +K+K + K++K ++ +
Sbjct: 74 EAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDPRGGTQDV 133
Query: 58 NK 59
Sbjct: 134 VD 135
>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090). This
family consists of several bacterial proteins of unknown
function and is known as YqjC in E. coli.
Length = 115
Score = 30.3 bits (69), Expect = 0.62
Identities = 14/49 (28%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
+ E++E+EV +E+E +E + K K +KK++K E + + +E
Sbjct: 70 AKVEEKEQEV----AEREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 32.0 bits (73), Expect = 0.64
Identities = 13/36 (36%), Positives = 19/36 (52%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E +E + K+ KKKKKKKKK+ ++ E
Sbjct: 44 ISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAY 79
Score = 31.6 bits (72), Expect = 0.81
Identities = 21/65 (32%), Positives = 30/65 (46%), Gaps = 3/65 (4%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKK---KKKKKKKKKKKKKRYEEEEKEEEKDN 58
+K + K+D + + ++ E E E K KKKKKKKKKKKKK E
Sbjct: 23 KKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLP 82
Query: 59 KLLHN 63
+ +
Sbjct: 83 VVWSS 87
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 32.4 bits (73), Expect = 0.64
Identities = 11/54 (20%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 5 GGEEKEDQE--KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E ++ K+ R K++E +E + ++ K+++ KK E E++ +
Sbjct: 1007 VPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKT 1060
Score = 30.1 bits (67), Expect = 2.8
Identities = 12/62 (19%), Positives = 28/62 (45%), Gaps = 5/62 (8%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEE-----EKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E +++ +E G S E E E ++++ +KK++ + +E EE+ + +
Sbjct: 607 LEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARKKEELRGNFELEERGDPE 666
Query: 57 DN 58
Sbjct: 667 KK 668
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 31.7 bits (73), Expect = 0.66
Identities = 9/33 (27%), Positives = 18/33 (54%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+ ++K+ K K +KKK+K+ + K + K
Sbjct: 57 KAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 30.6 bits (70), Expect = 1.5
Identities = 11/40 (27%), Positives = 22/40 (55%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
E+ ++ ++E+ K+E K +KKK+K++ K K
Sbjct: 46 NEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAK 85
Score = 29.0 bits (66), Expect = 5.2
Identities = 8/23 (34%), Positives = 13/23 (56%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKK 45
K ++KK+K++ K K K K
Sbjct: 67 KAWHKAQKKKEKQEAKAAKAKSK 89
Score = 28.3 bits (64), Expect = 9.0
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 33 KKKKKKKKKKKKKRYEEEEKEEEKDNK 59
KK+ K K +KK+ ++E K + +K
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 28.3 bits (64), Expect = 9.9
Identities = 8/29 (27%), Positives = 16/29 (55%), Gaps = 3/29 (10%)
Query: 27 TEEEKK---KKKKKKKKKKKKKRYEEEEK 52
++E K K +KKK+K++ K + +
Sbjct: 62 DKKELKAWHKAQKKKEKQEAKAAKAKSKP 90
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 31.6 bits (72), Expect = 0.67
Identities = 12/35 (34%), Positives = 22/35 (62%)
Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
E+EKK++ K +KK+ ++Y EE ++ +LL
Sbjct: 164 KEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELL 198
Score = 28.9 bits (65), Expect = 4.3
Identities = 13/37 (35%), Positives = 20/37 (54%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+ E+ EE +KK K KK+ YE+E++E D
Sbjct: 25 EYEKLEELRKKLKAKKEDDSDDTDYEDEDEELYLDES 61
Score = 28.6 bits (64), Expect = 6.0
Identities = 8/32 (25%), Positives = 20/32 (62%)
Query: 26 ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
E E+ ++ +KK K KK+ + + ++E+++
Sbjct: 24 EEYEKLEELRKKLKAKKEDDSDDTDYEDEDEE 55
Score = 28.2 bits (63), Expect = 9.2
Identities = 12/45 (26%), Positives = 22/45 (48%)
Query: 26 ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYY 70
+ +E+K++ K +KK+ +K EE ++ LL Y Y
Sbjct: 164 KEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEY 208
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 32.2 bits (73), Expect = 0.67
Identities = 15/38 (39%), Positives = 26/38 (68%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
K+ TEEE ++KKKK++K K+K+ + + ++E KL
Sbjct: 10 KKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKL 47
Score = 31.4 bits (71), Expect = 1.1
Identities = 21/78 (26%), Positives = 31/78 (39%), Gaps = 23/78 (29%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKK---------------------KK 39
E E+K E+E+ + K+EE +EK+ KK K K
Sbjct: 3 RTESEAEKKILTEEELE--RKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPK 60
Query: 40 KKKKKKRYEEEEKEEEKD 57
K +KK R + E E +D
Sbjct: 61 KSEKKSRKRDVEDENPED 78
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 31.6 bits (72), Expect = 0.69
Identities = 8/34 (23%), Positives = 20/34 (58%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
++E+ + ++ KKK++ + +++E EE K
Sbjct: 75 EDEKELSASSLEAEQAKKKEEAEAKKKKEMEELK 108
Score = 31.6 bits (72), Expect = 0.75
Identities = 11/39 (28%), Positives = 23/39 (58%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
+ E ++ KKK++ + KKKK EE + ++K ++ +
Sbjct: 82 ASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKE 120
Score = 28.2 bits (63), Expect = 9.3
Identities = 9/33 (27%), Positives = 19/33 (57%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
K E+ +E + ++ KKK++ +++KE E
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEME 105
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 30.3 bits (69), Expect = 0.73
Identities = 11/73 (15%), Positives = 38/73 (52%), Gaps = 11/73 (15%)
Query: 1 MNEKGGEEKED---QEKEVGGISRGK--EEETEEEKK------KKKKKKKKKKKKKRYEE 49
++ E+++ Q+ + G+ ++E EE + ++ +++KK+++++ +E
Sbjct: 29 VDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQKEKE 88
Query: 50 EEKEEEKDNKLLH 62
E ++ ++N +
Sbjct: 89 EIEKYREENPKIQ 101
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 31.3 bits (71), Expect = 0.76
Identities = 10/26 (38%), Positives = 20/26 (76%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEE 49
+++ +EEKKK K+ K+++K++KR
Sbjct: 185 QQKEKEEKKKVKEAKRREKEEKRMAA 210
Score = 28.2 bits (63), Expect = 6.6
Identities = 13/53 (24%), Positives = 24/53 (45%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EE +Q+KE + +E E + K + + +KR E++ K+ K
Sbjct: 106 EENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAAKERK 158
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 31.3 bits (71), Expect = 0.77
Identities = 11/54 (20%), Positives = 25/54 (46%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
EK K+ G + ++++ +K K K + +R ++ +E+K+ K
Sbjct: 18 SEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKK 71
Score = 30.1 bits (68), Expect = 1.9
Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 5/40 (12%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKK 41
EK + Q E R K+ ++E+K++KKK KKKK
Sbjct: 43 IEKFKLLIKAQMAE-----RVKKLHSQEKKEEKKKPKKKK 77
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 31.9 bits (73), Expect = 0.81
Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 3/69 (4%)
Query: 8 EKEDQEKEVGGISRGKEE---ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNC 64
E E Q +E+ + EE + E E +KK+++ +K K K + +K +K + L
Sbjct: 92 EIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEA 151
Query: 65 YLYSYYRVT 73
Y+
Sbjct: 152 LKGLNYKKN 160
Score = 30.0 bits (68), Expect = 2.7
Identities = 14/42 (33%), Positives = 24/42 (57%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNC 64
E E + E+ KK+ KK ++K ++ E EK+EE+ K +
Sbjct: 91 IEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKF 132
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 29.3 bits (66), Expect = 0.81
Identities = 12/37 (32%), Positives = 18/37 (48%)
Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
TE + KKKKKKKKKK + + ++ +
Sbjct: 18 TEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEG 54
Score = 28.5 bits (64), Expect = 1.9
Identities = 12/37 (32%), Positives = 19/37 (51%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K E +KKKKKKKKK + + ++ E ++
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSE 53
Score = 26.6 bits (59), Expect = 8.3
Identities = 8/30 (26%), Positives = 15/30 (50%)
Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+KKKKKKK + + ++ E+ +
Sbjct: 26 KKKKKKKKAEDTAATAKAKKATAEDVSEGA 55
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 30.4 bits (69), Expect = 0.85
Identities = 12/46 (26%), Positives = 22/46 (47%), Gaps = 7/46 (15%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEE 51
EEK + KE + ++E+ +K +K+ KK KK + +
Sbjct: 92 EEEKAEARKE-------ALKAYQQEELRKIQKRSKKSKKAEPVQGQ 130
>gnl|CDD|197700 smart00398, HMG, high mobility group.
Length = 70
Score = 28.8 bits (65), Expect = 0.85
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
E++K ++K KK K+RYEEE E +
Sbjct: 42 LLSEEEKAPYEEKAKKDKERYEEEMPEYK 70
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
This domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 29.4 bits (67), Expect = 0.88
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKD 57
KKK+KK+ K + K+ ++ E+ K
Sbjct: 62 KKKEKKEVKAESKRYNAKKLTLEQRKA 88
Score = 28.7 bits (65), Expect = 1.6
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 30 EKKKKKKKKKKKKKKKRYEEEEKE 53
EKK+ K + K+ KK E+ K
Sbjct: 65 EKKEVKAESKRYNAKKLTLEQRKA 88
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 31.5 bits (71), Expect = 0.94
Identities = 22/62 (35%), Positives = 32/62 (51%), Gaps = 6/62 (9%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKK------KKKKKKRYEEEEKEE 54
M E G E E E+E SR + EE EE + K ++K ++ +K+ E E +EE
Sbjct: 100 MQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEE 159
Query: 55 EK 56
EK
Sbjct: 160 EK 161
Score = 31.2 bits (70), Expect = 1.1
Identities = 18/54 (33%), Positives = 30/54 (55%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E E++EKE R + EETE K ++K + ++ + EE+E E E++ K
Sbjct: 109 ETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKP 162
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 31.6 bits (71), Expect = 0.95
Identities = 16/64 (25%), Positives = 33/64 (51%), Gaps = 6/64 (9%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
E+G +++E+++++V I +E E EE+ + +K +E+E + E D L
Sbjct: 102 ERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKD------DEKESDAEGDENELA 155
Query: 63 NCYL 66
Y+
Sbjct: 156 GEYI 159
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 31.0 bits (71), Expect = 0.95
Identities = 13/28 (46%), Positives = 19/28 (67%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E +K+KKK K++KK EE++K EE
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is
functionally equivalent to L7/L12 in bacteria and the
P1 and P2 proteins in eukaryotes. L12p is homologous to
P1 and P2 but is not homologous to bacterial L7/L12. It
is located in the L12 stalk, with proteins L10, L11,
and 23S rRNA. L12p is the only protein in the ribosome
to occur as multimers, always appearing as sets of
dimers. Recent data indicate that most archaeal species
contain six copies of L12p (three homodimers), while
eukaryotes have four copies (two heterodimers), and
bacteria may have four or six copies (two or three
homodimers), depending on the species. The organization
of proteins within the stalk has been characterized
primarily in bacteria, where L7/L12 forms either two or
three homodimers and each homodimer binds to the
extended C-terminal helix of L10. L7/L12 is attached to
the ribosome through L10 and is the only ribosomal
protein that does not directly interact with rRNA.
Archaeal L12p is believed to function in a similar
fashion. However, hybrid ribosomes containing the large
subunit from E. coli with an archaeal stalk are able to
bind archaeal and eukaryotic elongation factors but not
bacterial elongation factors. In several mesophilic and
thermophilic archaeal species, the binding of 23S rRNA
to protein L11 and to the L10/L12p pentameric complex
was found to be temperature-dependent and cooperative.
Length = 106
Score = 29.8 bits (67), Expect = 0.96
Identities = 10/26 (38%), Positives = 21/26 (80%)
Query: 32 KKKKKKKKKKKKKKRYEEEEKEEEKD 57
++K ++K+++KK+ EE+E+EEE+
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEEA 98
Score = 29.4 bits (66), Expect = 1.2
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEE 55
++K ++K+++KKK E+EE+EEE
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 29.4 bits (66), Expect = 1.2
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 28 EEEKKKKKKKKKKKKKKKRYEEEE 51
EEK ++K+++KKK+++K EEEE
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 29.0 bits (65), Expect = 1.6
Identities = 10/27 (37%), Positives = 20/27 (74%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEE 50
EE+ ++K+++KKK+++K+ EEE
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEE 97
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 31.3 bits (71), Expect = 0.96
Identities = 11/40 (27%), Positives = 17/40 (42%)
Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
+E+ K K KK KK R + + +E+ D L
Sbjct: 188 SSTASSGDEKSPKSKAAPKKAGKKMRKWDLDGDEDDDAVL 227
>gnl|CDD|193475 pfam13002, LDB19, Arrestin_N terminal like. This is a family of
proteins related to the Arrestin_N terminal family.
Length = 183
Score = 30.6 bits (69), Expect = 1.0
Identities = 18/65 (27%), Positives = 27/65 (41%), Gaps = 3/65 (4%)
Query: 114 PVLLSPGIHSFPFKLGLPLGLPSTFLGKHGWVQYFCKAALREPT---GYTHKNQQVFIIM 170
LS G HS+PF P LP++ V+Y A + T G + Q+ +
Sbjct: 38 TTDLSVGSHSYPFSYLFPGSLPASTSNSETQVKYELIATVTYKTPRRGISPSKPQLLKLN 97
Query: 171 SPIDL 175
PI +
Sbjct: 98 MPIAV 102
>gnl|CDD|223403 COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational
modification, protein turnover, chaperones].
Length = 623
Score = 31.5 bits (72), Expect = 1.0
Identities = 11/45 (24%), Positives = 22/45 (48%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
E + K I++G + E++ + +++KK+ K E KE
Sbjct: 469 PEFEGKPFKSITKGDLDLELLEEEDEADSEEEKKEFKPLLERVKE 513
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 31.2 bits (71), Expect = 1.0
Identities = 15/64 (23%), Positives = 28/64 (43%), Gaps = 3/64 (4%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKK---KKKKKKKRYEEEEKEEEKD 57
+ EK K ++ I + + K K K ++ K + K ++E+K+EEK
Sbjct: 282 LQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKK 341
Query: 58 NKLL 61
K +
Sbjct: 342 KKQI 345
Score = 30.4 bits (69), Expect = 2.0
Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 1/57 (1%)
Query: 7 EEKEDQEKE-VGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
E D +++ R E+ E K+KKK+KKK++KKKK+ E E+ EK
Sbjct: 304 EMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQAT 360
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 31.6 bits (72), Expect = 1.1
Identities = 12/40 (30%), Positives = 21/40 (52%)
Query: 20 SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EE+ ++++ K K K K K R + +K+E+K K
Sbjct: 1144 QEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKK 1183
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 31.7 bits (72), Expect = 1.1
Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 7/62 (11%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKK-------KKKKKKKKKRYEEEEKEEEKDNK 59
EEKE E+ + + E E K ++++ ++ ++K +R EE E+E E+ +
Sbjct: 246 EEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEE 305
Query: 60 LL 61
L
Sbjct: 306 EL 307
Score = 29.3 bits (66), Expect = 5.5
Identities = 13/58 (22%), Positives = 27/58 (46%), Gaps = 4/58 (6%)
Query: 10 EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKK----KKRYEEEEKEEEKDNKLLHN 63
+ E + +EEE E ++ ++ ++K ++ ++ EE E+E E LL
Sbjct: 259 LEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEE 316
Score = 29.0 bits (65), Expect = 6.5
Identities = 14/51 (27%), Positives = 26/51 (50%), Gaps = 4/51 (7%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
E ED E + EEE +E KK ++ ++++++++ E E EE
Sbjct: 196 ELLEDIEDLL----EALEEELKELKKLEEIQEEQEEEELEQEIEALEERLA 242
Score = 28.6 bits (64), Expect = 8.3
Identities = 10/53 (18%), Positives = 24/53 (45%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E E+ E+E+ + E E+ + ++ ++K + E E + E + +
Sbjct: 626 NELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIE 678
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 31.0 bits (70), Expect = 1.1
Identities = 17/40 (42%), Positives = 21/40 (52%), Gaps = 6/40 (15%)
Query: 12 QEKEVGGISRGKEEETE------EEKKKKKKKKKKKKKKK 45
Q + G GK+E + E K+KKKKK KKKK K
Sbjct: 65 QAERKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPK 104
>gnl|CDD|146145 pfam03357, Snf7, Snf7. This family of proteins are involved in
protein sorting and transport from the endosome to the
vacuole/lysosome in eukaryotic cells.
Vacuoles/lysosomes play an important role in the
degradation of both lipids and cellular proteins. In
order to perform this degradative function,
vacuoles/lysosomes contain numerous hydrolases which
have been transported in the form of inactive
precursors via the biosynthetic pathway and are
proteolytically activated upon delivery to the
vacuole/lysosome. The delivery of transmembrane
proteins, such as activated cell surface receptors to
the lumen of the vacuole/lysosome, either for
degradation/downregulation, or in the case of
hydrolases, for proper localisation, requires the
formation of multivesicular bodies (MVBs). These late
endosomal structures are formed by invaginating and
budding of the limiting membrane into the lumen of the
compartment. During this process, a subset of the
endosomal membrane proteins is sorted into the forming
vesicles. Mature MVBs fuse with the vacuole/lysosome,
thereby releasing cargo containing vesicles into its
hydrolytic lumen for degradation. Endosomal proteins
that are not sorted into the intralumenal MVB vesicles
are either recycled back to the plasma membrane or
Golgi complex, or remain in the limiting membrane of
the MVB and are thereby transported to the limiting
membrane of the vacuole/lysosome as a consequence of
fusion. Therefore, the MVB sorting pathway plays a
critical role in the decision between recycling and
degradation of membrane proteins. A few archaeal
sequences are also present within this family.
Length = 169
Score = 30.3 bits (69), Expect = 1.1
Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 7/35 (20%)
Query: 23 KEEETEEEKKKKKKKKKKK-------KKKKRYEEE 50
K+ + E + KK KK K K+KKRYE++
Sbjct: 22 KKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEKQ 56
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 30.8 bits (70), Expect = 1.2
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+ +K++KK K++KK E+EK EE+
Sbjct: 95 KAQKEEKKAMSKEEKKAIKAEKEKLEEE 122
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 30.2 bits (68), Expect = 1.2
Identities = 19/53 (35%), Positives = 32/53 (60%), Gaps = 3/53 (5%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
EKE EK+V KE+ E+KKK +KK+ K++KR + E++ ++ +L
Sbjct: 51 EKESAEKQVKSS---KEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKEL 100
Score = 27.5 bits (61), Expect = 9.7
Identities = 15/50 (30%), Positives = 24/50 (48%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E EKE +E + +KKKK +KK+ K+R E+ ++E
Sbjct: 45 EGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELA 94
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of
70kDa MW N terminal. This domain is found in
eukaryotes. This domain is about 90 amino acids in
length. This domain is found associated with pfam00076.
This domain is part of U1 snRNP, which is the pre-mRNA
binding protein of the penta-snRNP spliceosome complex.
It extends over a distance of 180 A from its RNA
binding domain, wraps around the core domain of U1
snRNP consisting of the seven Sm proteins and finally
contacts U1-C, which is crucial for 5'-splice-site
recognition.
Length = 94
Score = 29.2 bits (66), Expect = 1.4
Identities = 14/28 (50%), Positives = 23/28 (82%)
Query: 26 ETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
ET EK++++K++KK+K +K+ EEE KE
Sbjct: 60 ETWLEKREREKREKKEKLEKKLEEELKE 87
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 29.3 bits (66), Expect = 1.4
Identities = 11/42 (26%), Positives = 29/42 (69%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
E GG + +++E+ ++R ++ E E +K+++++K+ +K+K
Sbjct: 65 EAGGADIQEEERRSARLARLEDREEERLEKEEEREKRARKRK 106
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast)
and GNL3L (human) are putative GTPases which are
required for growth and play a role in processing of
nucleolar pre-rRNA. This family contains a potential
nuclear localisation signal.
Length = 80
Score = 28.8 bits (65), Expect = 1.4
Identities = 10/31 (32%), Positives = 23/31 (74%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
EE EE+K+K++++K+++K+ ++ E E +
Sbjct: 49 EEIEEKKRKQEEEKERRKEARKAERAEARKR 79
Score = 27.6 bits (62), Expect = 3.2
Identities = 9/29 (31%), Positives = 21/29 (72%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
EEKK+K++++K+++K+ R E + ++
Sbjct: 52 EEKKRKQEEEKERRKEARKAERAEARKRG 80
>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9. CRISPR
(Clustered Regularly Interspaced Short Palindromic
Repeats) and associated Cas proteins comprise a system
for heritable host defense by prokaryotic cells against
phage and other foreign DNA; Very large protein
containing McrA/HNH-nuclease related domain and a
RuvC-like nuclease domain; signature gene for type II.
Length = 799
Score = 31.2 bits (71), Expect = 1.4
Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 5/63 (7%)
Query: 19 ISRGKEEETEEEKKKKKKKKKK-----KKKKKRYEEEEKEEEKDNKLLHNCYLYSYYRVT 73
I +E T + K +KK++KK K+ E++ KE D K + L YY+
Sbjct: 518 IEMARENGTNKGTKNRKKRQKKNEDNIKEAASALEQKLKELPLDIKSKNILKLRLYYQQN 577
Query: 74 HIC 76
C
Sbjct: 578 GKC 580
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 30.1 bits (67), Expect = 1.5
Identities = 17/54 (31%), Positives = 36/54 (66%), Gaps = 4/54 (7%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+ EE+ +E+E +R +EEE EK++K K+K ++++K+ EE+E+ +++
Sbjct: 62 RRAAEERLRREEE----ARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQ 111
Score = 28.9 bits (64), Expect = 3.3
Identities = 17/60 (28%), Positives = 37/60 (61%), Gaps = 3/60 (5%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEET---EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E+ E+E+ ++ +EEE EEE+ ++K++K K+K ++ ++E++E+E+ K
Sbjct: 51 EQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQK 110
Score = 27.8 bits (61), Expect = 9.1
Identities = 15/54 (27%), Positives = 30/54 (55%), Gaps = 1/54 (1%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E+ EKE++ K K+E+ E+E+ +K+K++ + + ++ E E EK
Sbjct: 80 EERAREKEEKAKRKAEE-EEKQEQEEQERIQKQKEEAEARAREEAERMRLEREK 132
>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777). This is
a family of eukaryotic proteins of unknown function.
Some of the proteins in this family are putative nucleic
acid binding proteins.
Length = 158
Score = 29.8 bits (67), Expect = 1.6
Identities = 10/40 (25%), Positives = 23/40 (57%)
Query: 20 SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
SR + ++K+++ K ++ KK+ ++ KEE+ + K
Sbjct: 61 SRSRSPSRRRDRKRERDKDAREPKKRERQKLIKEEDLEGK 100
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 30.9 bits (69), Expect = 1.6
Identities = 12/55 (21%), Positives = 26/55 (47%)
Query: 5 GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
E E +K I + + E ++ +K+++ KK K + ++K+E + K
Sbjct: 420 ASNESESDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKK 474
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 30.5 bits (69), Expect = 1.6
Identities = 10/53 (18%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEE--EEKEEEKDNK 59
K+ Q ++ ++++ E EK++++ + +++ K++YEE +K ++ +
Sbjct: 96 KQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQ 148
Score = 30.1 bits (68), Expect = 1.9
Identities = 8/29 (27%), Positives = 26/29 (89%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
++ +++++K++++++K++KK++ EEE K+
Sbjct: 190 KKLKQQQQKREEERRKQRKKQQEEEERKQ 218
Score = 29.7 bits (67), Expect = 2.7
Identities = 9/32 (28%), Positives = 28/32 (87%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+E E +K K++++K++++++K+ +++++EEE+
Sbjct: 185 QEWELKKLKQQQQKREEERRKQRKKQQEEEER 216
Score = 29.3 bits (66), Expect = 3.2
Identities = 11/37 (29%), Positives = 28/37 (75%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K+ E E KK K++++K+++++R + ++++EE++ K
Sbjct: 181 KKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERK 217
>gnl|CDD|240709 cd12263, RRM_ABT1_like, RNA recognition motif found in activator
of basal transcription 1 (ABT1) and similar proteins.
This subfamily corresponds to the RRM of novel nuclear
proteins termed ABT1 and its homologous counterpart,
pre-rRNA-processing protein ESF2 (eighteen S factor 2),
from yeast. ABT1 associates with the TATA-binding
protein (TBP) and enhances basal transcription activity
of class II promoters. Meanwhile, ABT1 could be a
transcription cofactor that can bind to DNA in a
sequence-independent manner. The yeast ABT1 homolog,
ESF2, is a component of 90S preribosomes and 5'
ETS-based RNPs. It is previously identified as a
putative partner of the TATA-element binding protein.
However, it is primarily localized to the nucleolus and
physically associates with pre-rRNA processing factors.
ESF2 may play a role in ribosome biogenesis. It is
required for normal pre-rRNA processing, as well as for
SSU processome assembly and function. Both ABT1 and
ESF2 contain an RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain). .
Length = 98
Score = 28.7 bits (65), Expect = 1.6
Identities = 10/22 (45%), Positives = 16/22 (72%)
Query: 28 EEEKKKKKKKKKKKKKKKRYEE 49
E+ K+K++KKK KKK++ E
Sbjct: 35 EDPAKRKRRKKKGGNKKKKFTE 56
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 30.7 bits (70), Expect = 1.7
Identities = 12/43 (27%), Positives = 19/43 (44%), Gaps = 8/43 (18%)
Query: 31 KKKKKKKKKKKKKKKRYE--------EEEKEEEKDNKLLHNCY 65
++KKK KK +K+ + Y E + EK +LH
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRHVLHYHN 126
>gnl|CDD|236295 PRK08570, rpl19e, 50S ribosomal protein L19e; Reviewed.
Length = 150
Score = 29.4 bits (67), Expect = 1.9
Identities = 15/44 (34%), Positives = 26/44 (59%), Gaps = 3/44 (6%)
Query: 12 QEKEVGGISRGKEEETEEEKKKKKKK---KKKKKKKKRYEEEEK 52
+ K GISRG+ E E++KK +++ +K KK R ++E+
Sbjct: 51 KAKPKKGISRGRARERHEKRKKGRRRGPGSRKGKKGARTPKKER 94
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 30.0 bits (68), Expect = 1.9
Identities = 15/62 (24%), Positives = 27/62 (43%), Gaps = 9/62 (14%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKK--KKKKKKKKKKRYE----EEEKEEEKDNKL 60
E ED E +V S+ + E E ++ K+ +K K ++ E + +E+ N L
Sbjct: 52 IELEDLENQV---SQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSL 108
Query: 61 LH 62
Sbjct: 109 ED 110
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 29.7 bits (67), Expect = 1.9
Identities = 10/56 (17%), Positives = 20/56 (35%), Gaps = 8/56 (14%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKK--------KKKKKKKKKKKRYEEEEKEEEK 56
KE++E +V + + KK + + + +KK E E+
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEE 162
Score = 28.6 bits (64), Expect = 4.5
Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 3/47 (6%)
Query: 17 GGISRGKEEETEEEKK---KKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
GG+++G E + E K K++K+ K KK + K+ K L
Sbjct: 87 GGVAKGAFTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAAL 133
>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897). This
is a bacterial family of uncharacterized proteins.
Length = 55
Score = 27.6 bits (62), Expect = 1.9
Identities = 10/26 (38%), Positives = 15/26 (57%)
Query: 32 KKKKKKKKKKKKKKRYEEEEKEEEKD 57
K + K+KKKK KR E++ + D
Sbjct: 30 KFPQFDKQKKKKAKRNAEQDAKNSAD 55
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 30.4 bits (68), Expect = 1.9
Identities = 13/43 (30%), Positives = 19/43 (44%)
Query: 10 EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
++QEKE+ I + + E K K K + K K E E
Sbjct: 236 KEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEA 278
>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020). This
family of fungal proteins is conserved towards the
C-terminus of HMG domains. The function is not known.
Length = 49
Score = 27.6 bits (61), Expect = 2.0
Identities = 11/35 (31%), Positives = 16/35 (45%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
E E K + + KKK K+ +E +EKD
Sbjct: 9 REANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDA 43
Score = 27.2 bits (60), Expect = 2.2
Identities = 14/45 (31%), Positives = 22/45 (48%), Gaps = 6/45 (13%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEE 51
E ++ K+ SR K+ K KKK K+ K+K Y ++E
Sbjct: 10 EANSERNKDNDLRSRVKK------KAKKKFGKEDSKEKDAYIDDE 48
Score = 25.6 bits (56), Expect = 8.8
Identities = 9/31 (29%), Positives = 17/31 (54%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
E ++ + + KKK KKK + + +EK+
Sbjct: 12 NSERNKDNDLRSRVKKKAKKKFGKEDSKEKD 42
>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
(CLPTM1). This family consists of several eukaryotic
cleft lip and palate transmembrane protein 1 sequences.
Cleft lip with or without cleft palate is a common birth
defect that is genetically complex. The nonsyndromic
forms have been studied genetically using linkage and
candidate-gene association studies with only partial
success in defining the loci responsible for orofacial
clefting. CLPTM1 encodes a transmembrane protein and has
strong homology to two Caenorhabditis elegans genes,
suggesting that CLPTM1 may belong to a new gene family.
This family also contains the human cisplatin resistance
related protein CRR9p which is associated with
CDDP-induced apoptosis.
Length = 437
Score = 30.3 bits (69), Expect = 2.0
Identities = 12/37 (32%), Positives = 18/37 (48%)
Query: 35 KKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYYR 71
KKK KKK + EKEE ++ K + S++
Sbjct: 148 LPKKKVKKKNLLGGKSEKEEPEEEKTPAPDKIVSHWH 184
>gnl|CDD|238037 cd00084, HMG-box, High Mobility Group (HMG)-box is found in a
variety of eukaryotic chromosomal proteins and
transcription factors. HMGs bind to the minor groove of
DNA and have been classified by DNA binding
preferences. Two phylogenically distinct groups of
Class I proteins bind DNA in a sequence specific
fashion and contain a single HMG box. One group
(SOX-TCF) includes transcription factors, TCF-1, -3,
-4; and also SRY and LEF-1, which bind four-way DNA
junctions and duplex DNA targets. The second group
(MATA) includes fungal mating type gene products MC,
MATA1 and Ste11. Class II and III proteins (HMGB-UBF)
bind DNA in a non-sequence specific fashion and contain
two or more tandem HMG boxes. Class II members include
non-histone chromosomal proteins, HMG1 and HMG2, which
bind to bent or distorted DNA such as four-way DNA
junctions, synthetic DNA cruciforms, kinked
cisplatin-modified DNA, DNA bulges, cross-overs in
supercoiled DNA, and can cause looping of linear DNA.
Class III members include nucleolar and mitochondrial
transcription factors, UBF and mtTF1, which bind
four-way DNA junctions.
Length = 66
Score = 28.0 bits (63), Expect = 2.0
Identities = 17/46 (36%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 9 KEDQEKEVGGISR--GKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
E+ VG IS+ G+ ++ E++KKK ++K +K K+RYE+E
Sbjct: 21 AENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKERYEKEMP 66
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 29.5 bits (67), Expect = 2.2
Identities = 9/27 (33%), Positives = 19/27 (70%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKD 57
KK KK+K+K++K+ ++ ++ +E K
Sbjct: 18 KKAKKEKRKQRKQARKGADDGDDELKQ 44
Score = 27.6 bits (62), Expect = 9.3
Identities = 8/30 (26%), Positives = 20/30 (66%)
Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+K+K+K++K+ +K ++E K+ ++ K
Sbjct: 21 KKEKRKQRKQARKGADDGDDELKQAAEEAK 50
>gnl|CDD|203738 pfam07716, bZIP_2, Basic region leucine zipper.
Length = 54
Score = 27.6 bits (62), Expect = 2.2
Identities = 13/46 (28%), Positives = 27/46 (58%), Gaps = 8/46 (17%)
Query: 26 ETEEEKKKKKK-----KKKKKKKKKRYEEEE---KEEEKDNKLLHN 63
+ +E + ++++ ++ ++KKK+R EE E KE E++N L
Sbjct: 1 KDDEYRDRRRRNNEAARRSREKKKQREEELEERVKELEEENAQLRQ 46
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.
This family represents the C-terminus (approximately
300 residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 29.9 bits (67), Expect = 2.2
Identities = 16/48 (33%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 12 QEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
QE+E +E+ EE+ K+K +++K++ K EEE K+E + K
Sbjct: 51 QERER----EAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKK 94
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 30.5 bits (69), Expect = 2.2
Identities = 11/56 (19%), Positives = 24/56 (42%), Gaps = 3/56 (5%)
Query: 5 GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E+ QE + G ++++ + K KK K + ++K + ++KL
Sbjct: 193 EKAEERKQESKKG---AKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLESKL 245
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 30.2 bits (68), Expect = 2.3
Identities = 13/54 (24%), Positives = 27/54 (50%)
Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYYRVTHICFSDP 80
T+E K ++ + ++ KKR E + ++D + + C + + TH+C P
Sbjct: 153 TDEAKVEEFLETAREIYKKRDERTKALSDEDVDVFYGCTMCQSFAPTHVCVVTP 206
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 29.5 bits (67), Expect = 2.6
Identities = 18/53 (33%), Positives = 31/53 (58%), Gaps = 3/53 (5%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+ + K + E+E I + +EE+ E EK+ + + K + +KR EEE + EEK
Sbjct: 117 QAEQGKSELEQE---IKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEK 166
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 29.5 bits (67), Expect = 2.6
Identities = 7/34 (20%), Positives = 19/34 (55%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+E E+EK ++++K K K ++ + ++ +
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155
Score = 29.5 bits (67), Expect = 3.1
Identities = 8/34 (23%), Positives = 19/34 (55%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
K E E++K ++++K K K R + + + ++
Sbjct: 121 KREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154
Score = 28.7 bits (65), Expect = 4.5
Identities = 9/37 (24%), Positives = 20/37 (54%)
Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+ K+ E EK+K ++++K K K ++ + +K
Sbjct: 118 QAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154
Score = 28.0 bits (63), Expect = 7.6
Identities = 13/57 (22%), Positives = 26/57 (45%), Gaps = 4/57 (7%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K ++ +Q+ + R E E+ ++++K K K +KKR +K E +
Sbjct: 108 AKVQAQRAEQQAK----KREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP 160
>gnl|CDD|215038 PLN00040, PLN00040, Protein MAK16 homolog; Provisional.
Length = 233
Score = 29.7 bits (67), Expect = 2.7
Identities = 10/37 (27%), Positives = 15/37 (40%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+ EE + K KKK + E E++ E K
Sbjct: 196 EAEEELPKSDKNPNSKKKSRVHVEIEYEDEIEYKSLM 232
Score = 28.6 bits (64), Expect = 6.2
Identities = 13/43 (30%), Positives = 18/43 (41%)
Query: 20 SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
EE EE K K KKK + E E ++E + L+
Sbjct: 191 MEEVEEAEEELPKSDKNPNSKKKSRVHVEIEYEDEIEYKSLMS 233
Score = 28.6 bits (64), Expect = 6.5
Identities = 10/40 (25%), Positives = 20/40 (50%)
Query: 11 DQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE 50
++ E+ + +EE + +K KKK + + YE+E
Sbjct: 186 NKVLEMEEVEEAEEELPKSDKNPNSKKKSRVHVEIEYEDE 225
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 29.6 bits (67), Expect = 2.7
Identities = 12/56 (21%), Positives = 28/56 (50%), Gaps = 8/56 (14%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+ + E+E++EK +E E K + ++ + K+K + ++E++EE
Sbjct: 119 LEQL---EREEEEK-----RDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEM 166
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim
have been identified in both mice and humans (mTim and
hTIM, respectively). Mammalian TIM is not the true
orthologue of Drosophila TIM, but is the likely
orthologue of a fly gene, timeout (also called tim-2).
mTim has been shown to be essential for embryonic
development, but does not have substantiated circadian
function. Some family members contain a SANT domain in
this region.
Length = 507
Score = 30.1 bits (67), Expect = 2.7
Identities = 19/101 (18%), Positives = 34/101 (33%), Gaps = 13/101 (12%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL----- 60
GEE+E +++ + + +E+E E +K +++K EEE E++ K
Sbjct: 380 GEEEEPEDELKDDVDGEQADESEHETLALRKNARQRKAGLASPEEEALGEEEQKAPPKKK 439
Query: 61 --------LHNCYLYSYYRVTHICFSDPPGQAPVLLSPGIH 93
S + A P IH
Sbjct: 440 QLNQKNKQQTGSGTNSDEERDDTSLDEDRDLADDGGLPRIH 480
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is
a small protein module with a triple-stranded
beta-sheet fold. This is a family of WW domain binding
proteins.
Length = 78
Score = 27.6 bits (62), Expect = 2.7
Identities = 9/20 (45%), Positives = 15/20 (75%)
Query: 27 TEEEKKKKKKKKKKKKKKKR 46
T+ +K++KKK+ KK K +R
Sbjct: 7 TDAYRKEQKKKELKKNKAER 26
Score = 26.9 bits (60), Expect = 5.7
Identities = 8/20 (40%), Positives = 14/20 (70%)
Query: 29 EEKKKKKKKKKKKKKKKRYE 48
E+KKK+ KK K +++ +R
Sbjct: 13 EQKKKELKKNKAERQARREA 32
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 28.5 bits (64), Expect = 2.7
Identities = 4/27 (14%), Positives = 19/27 (70%), Gaps = 2/27 (7%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEE 51
E+++++++++++++++ EEE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEE--SEEE 95
>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein. The PcfJ-like protein family
includes the E. faecalis PcfJ protein, which is
functionally uncharacterized. It is found in bacteria
and viruses, and is typically between 159 and 170 amino
acids in length. There is a conserved HCV sequence
motif.
Length = 169
Score = 29.2 bits (66), Expect = 2.8
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+ + + KK+K+K +K +KRY E EK +
Sbjct: 32 DRAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKS 66
>gnl|CDD|181091 PRK07720, fliJ, flagellar biosynthesis chaperone; Validated.
Length = 146
Score = 28.9 bits (65), Expect = 2.8
Identities = 10/22 (45%), Positives = 15/22 (68%)
Query: 34 KKKKKKKKKKKKRYEEEEKEEE 55
KK +K K+KK++ + EEK E
Sbjct: 109 KKYEKMKEKKQEMFALEEKAAE 130
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 30.1 bits (69), Expect = 2.8
Identities = 7/48 (14%), Positives = 17/48 (35%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEE 49
+ED + ++E EEE++ + + + E+
Sbjct: 172 GFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEK 219
>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
Tim54. Mitochondrial function depends on the import of
hundreds of different proteins synthesised in the
cytosol. Protein import is a multi-step pathway which
includes the binding of precursor proteins to surface
receptors, translocation of the precursor across one or
both mitochondrial membranes, and folding and assembly
of the imported protein inside the mitochondrion. Most
precursor proteins carry amino-terminal targeting
signals, called pre-sequences, and are imported into
mitochondria via import complexes located in both the
outer and the inner membrane (IM). The IM complex, TIM,
is made up of at least two proteins which mediate
translocation of proteins into the matrix by removing
their signal peptide and another pair of proteins, Tim54
and Tim22, that insert the polytopic proteins, that
carry internal targetting information, into the inner
membrane.
Length = 377
Score = 29.7 bits (67), Expect = 2.8
Identities = 17/84 (20%), Positives = 30/84 (35%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYL 66
+ E E V + E E + + ++ + E E+ EE++NK +
Sbjct: 196 DPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPVP 255
Query: 67 YSYYRVTHICFSDPPGQAPVLLSP 90
Y + P + P LL P
Sbjct: 256 KPYISPDEYPSAPLPPELPQLLQP 279
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 29.5 bits (66), Expect = 2.9
Identities = 13/49 (26%), Positives = 31/49 (63%), Gaps = 3/49 (6%)
Query: 13 EKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK---EEEKDN 58
EKE +SR K ++ ++ K+K+K++ + ++++E+ + EE++ N
Sbjct: 144 EKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRN 192
Score = 29.1 bits (65), Expect = 4.0
Identities = 9/42 (21%), Positives = 25/42 (59%)
Query: 14 KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
K+ R + + +E+++ + +K K+ ++K+ EE+ + +E
Sbjct: 153 KKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDE 194
>gnl|CDD|115706 pfam07070, Spo0M, SpoOM protein. This family consists of several
bacterial SpoOM proteins which are thought to control
sporulation in Bacillus subtilis.Spo0M exerts certain
negative effects on sporulation and its gene expression
is controlled by sigmaH.
Length = 218
Score = 29.6 bits (67), Expect = 2.9
Identities = 12/30 (40%), Positives = 13/30 (43%), Gaps = 1/30 (3%)
Query: 117 LSPG-IHSFPFKLGLPLGLPSTFLGKHGWV 145
+ PG FPF L LP P T WV
Sbjct: 82 IQPGEERVFPFTLTLPWETPITIGDAKVWV 111
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 30.0 bits (68), Expect = 2.9
Identities = 12/28 (42%), Positives = 20/28 (71%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EEK+K+KK KK+ K+++R + K E+
Sbjct: 394 EEKEKEKKAKKEGKEERRIHFQNKSIEE 421
Score = 28.8 bits (65), Expect = 5.8
Identities = 11/30 (36%), Positives = 21/30 (70%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
E E+EK+KK KK+ K++++ ++ + EE
Sbjct: 392 EREEKEKEKKAKKEGKEERRIHFQNKSIEE 421
Score = 28.4 bits (64), Expect = 9.5
Identities = 11/32 (34%), Positives = 21/32 (65%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
+ EE E+EKK KK+ K++++ + + E+ E
Sbjct: 392 EREEKEKEKKAKKEGKEERRIHFQNKSIEEAE 423
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 30.0 bits (67), Expect = 3.0
Identities = 15/56 (26%), Positives = 29/56 (51%), Gaps = 3/56 (5%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKK---KKKKKKKKKRYEEEEKEEEKDNK 59
++ ED + + S EE+ EE + + K+K KKK ++ E E+++ +K
Sbjct: 227 DDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDK 282
Score = 29.2 bits (65), Expect = 5.1
Identities = 14/43 (32%), Positives = 22/43 (51%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
E E+ D E E + +EEE EEK+ ++ ++KK K
Sbjct: 145 ESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDK 187
>gnl|CDD|235602 PRK05776, PRK05776, DNA topoisomerase I; Provisional.
Length = 670
Score = 30.0 bits (68), Expect = 3.0
Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 2/50 (4%)
Query: 325 IQYLARNKVLESETRELASLTRGKIKP--GERDDWTNQQLYVPPLPPTNL 372
++ L+R+ L T+G +KP G +DD + +Y PP NL
Sbjct: 331 LKGLSRSPQYRPLVNLLLKETKGVLKPVQGPKDDPAHPAIYPTGEPPKNL 380
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 29.8 bits (67), Expect = 3.0
Identities = 10/42 (23%), Positives = 20/42 (47%)
Query: 20 SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
+ KE E E++ + K+ +K+ + E EK+ E +
Sbjct: 111 EKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEE 152
>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
Length = 194
Score = 29.4 bits (66), Expect = 3.1
Identities = 8/38 (21%), Positives = 23/38 (60%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
+E +E++ +++ + KK+ + +E+++EK N +
Sbjct: 4 EEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPV 41
Score = 27.9 bits (62), Expect = 8.4
Identities = 9/34 (26%), Positives = 17/34 (50%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
K+ EE + KK+ K+ K+ +E++ E
Sbjct: 10 KDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVED 43
>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases.and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain.
Expression of a construct consisting of only the
N-terminal domain of Saccharomyces pombe Cdc37 results
in cellular viability. This indicates that interactions
with the cochaperone Hsp90 may not be essential for
Cdc37 function.
Length = 154
Score = 28.9 bits (65), Expect = 3.2
Identities = 18/55 (32%), Positives = 32/55 (58%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
NE E ++ +KE+ + E EE KK + K KK++K+ ++ +E ++EEK
Sbjct: 85 NEMLAELQDQLKKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEK 139
>gnl|CDD|226776 COG4326, Spo0M, Sporulation control protein [General function
prediction only].
Length = 270
Score = 29.4 bits (66), Expect = 3.3
Identities = 20/73 (27%), Positives = 31/73 (42%), Gaps = 16/73 (21%)
Query: 49 EEEKEEEKDNKLLHNCYLYSYYRVTHICFSDPPGQAPVLLSPGIHSFPFKLGLPLGLPST 108
E + E + L N +R+ + F+ PG+ +FPF+L LP P T
Sbjct: 79 EVDDERGQQQGTLAN------WRLPY-AFTIQPGEE--------RNFPFELSLPWNTPVT 123
Query: 109 FLGQAPVLLSPGI 121
G A V + G+
Sbjct: 124 I-GDAKVWVETGL 135
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 29.6 bits (67), Expect = 3.3
Identities = 18/52 (34%), Positives = 31/52 (59%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
EK E+ E++E+E + +EEE +EE+KK+K K K +E++ E+
Sbjct: 14 EKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 30.1 bits (68), Expect = 3.3
Identities = 14/43 (32%), Positives = 19/43 (44%), Gaps = 7/43 (16%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
N K EEK QE R ++ EE K+ K+ + K K
Sbjct: 907 NNKYNEEKSAQE-------RMPGADSPEELMKRAKEYQDKHKG 942
>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361).
This is a region of 120 amino acids that is conserved
in a family of proteins found from plants to fungi. The
function is not known.
Length = 114
Score = 28.3 bits (64), Expect = 3.3
Identities = 10/35 (28%), Positives = 18/35 (51%)
Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNC 64
E+KK ++ K+ +KK +EKE + + L
Sbjct: 60 ERKKATRRLKQLEKKLEETSDEKEIKDLKEQLSKA 94
>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit. This
family includes the vacuolar ATP synthase E subunit, as
well as the archaebacterial ATP synthase E subunit.
Length = 195
Score = 29.2 bits (66), Expect = 3.3
Identities = 15/47 (31%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
+++ E++ I EEE E EK + ++ +KK ++ YE++EK+ E
Sbjct: 3 RQEAEEKAEEIRAEAEEEFEIEKAEAVEEAEKKIEE-IYEKKEKQAE 48
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 29.7 bits (67), Expect = 3.4
Identities = 10/53 (18%), Positives = 26/53 (49%), Gaps = 4/53 (7%)
Query: 9 KEDQEKEVGGISRGKEEET----EEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+E+ EK + + + E KK++ + ++KKK++ + E+ ++
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 29.8 bits (67), Expect = 3.4
Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 4/45 (8%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
+E G E + +EE++EE +KK+K KKK +K+K
Sbjct: 329 SEFLGYMGGIDEDD----EDEDDEESKEEVEKKQKVKKKPRKRKV 369
Score = 28.6 bits (64), Expect = 6.6
Identities = 15/44 (34%), Positives = 27/44 (61%), Gaps = 7/44 (15%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
+ GG +++D++++ EE EE +KK+K KKK +K+K
Sbjct: 332 LGYMGGIDEDDEDED-------DEESKEEVEKKQKVKKKPRKRK 368
Score = 28.6 bits (64), Expect = 6.7
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 8/39 (20%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
E+ ED++ E E + E EKK+K KKK +K+K
Sbjct: 340 EDDEDEDDE--------ESKEEVEKKQKVKKKPRKRKVN 370
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 27.9 bits (63), Expect = 3.4
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 2/48 (4%)
Query: 14 KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
KE+G + K+E+ E + K+ K + K + E E E E D LL
Sbjct: 57 KEIGKAKK-KKEDAEALIAEVKELKDELKALEA-ELRELEAELDKLLL 102
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 29.5 bits (66), Expect = 3.5
Identities = 11/58 (18%), Positives = 24/58 (41%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
+ E+E Q + +EE E+ + +K K+++ +E + +E L
Sbjct: 82 AEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLL 139
>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
production and conversion].
Length = 108
Score = 28.2 bits (63), Expect = 3.5
Identities = 14/50 (28%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E KE+ E+ + +R + E EE +++ +K ++ ++ EE E+E E+
Sbjct: 25 EAKEEAEQIIKE-AREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAEE 73
Score = 27.8 bits (62), Expect = 4.3
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
+EE EE K++ ++ K+ +++ R EE EEE
Sbjct: 20 DEEIEEAKEEAEQIIKEAREEAREIIEEAEEE 51
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 29.5 bits (67), Expect = 3.5
Identities = 20/51 (39%), Positives = 27/51 (52%), Gaps = 4/51 (7%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
EE E++ KE ++E E EKK +KK KK KK + EE EE +
Sbjct: 72 EEYEEELKE----YEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLE 118
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 29.6 bits (67), Expect = 3.5
Identities = 15/57 (26%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 1 MNEKGGEEKEDQEKEV-GGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+NE+ E K+ +++ R +E E +++K+ K ++ +K+ E+ EK EK
Sbjct: 311 LNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEK 367
>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
synthase complex, beta subunit. Nomenclature follows
the description for Methanosarcina thermophila. The
CO-methylating acetyl-CoA synthase is considered the
defining enzyme of the Wood-Ljungdahl pathway, used for
acetate catabolism by sulfate reducing bacteria but for
acetate biosynthesis by acetogenic bacteria such as
oorella thermoacetica (f. Clostridium thermoaceticum)
[Energy metabolism, Chemoautotrophy].
Length = 458
Score = 29.8 bits (67), Expect = 3.5
Identities = 13/54 (24%), Positives = 29/54 (53%)
Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYYRVTHICFSDP 80
T+ +K K++ +K ++ +KR E + ++D + + C + + TH+C P
Sbjct: 151 TDPDKVKEELEKAREIYEKRDERTKALSDEDVDVFYGCVMCQSFAPTHVCIVTP 204
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 29.7 bits (67), Expect = 3.6
Identities = 6/50 (12%), Positives = 22/50 (44%), Gaps = 3/50 (6%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E+ + + S + ++ + + +++KK + ++K+ E+
Sbjct: 182 AEQAELTTLL---SEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEE 228
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease
M16C (pfam05193).
Length = 248
Score = 29.1 bits (66), Expect = 3.6
Identities = 14/38 (36%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEE-EKDNKLL 61
+ET EEK +++++K +KKK EE+KE+ + L
Sbjct: 4 DETLEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLEL 41
>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
structure and biogenesis].
Length = 67
Score = 27.3 bits (61), Expect = 3.6
Identities = 9/25 (36%), Positives = 19/25 (76%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKE 53
E++K+KK +K+K K+ +E+++E
Sbjct: 43 EKRKRKKAAARKRKFKRLRKEQQRE 67
Score = 26.5 bits (59), Expect = 6.9
Identities = 10/21 (47%), Positives = 14/21 (66%)
Query: 26 ETEEEKKKKKKKKKKKKKKKR 46
E EK+K+KK +K+K KR
Sbjct: 39 EKPSEKRKRKKAAARKRKFKR 59
>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
Length = 112
Score = 28.0 bits (61), Expect = 3.7
Identities = 15/41 (36%), Positives = 25/41 (60%)
Query: 19 ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
IS E +T E KK+ + +KK +K E+E+K E+ +N+
Sbjct: 46 ISNEDEMKTTHEHIKKENEDEKKPEKPENEDEKKPEKPENE 86
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle
pole microtubules during metaphase. In Xenopus, it has
been shown that Xklp2 protein is required for
centrosome separation and maintenance of spindle
bi-polarity. TPX2 is a microtubule-associated protein
that mediates the binding of the C-terminal domain of
Xklp2 to microtubules. It is phosphorylated during
mitosis in a microtubule-dependent way.
Length = 57
Score = 27.0 bits (60), Expect = 3.7
Identities = 12/33 (36%), Positives = 22/33 (66%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E+ E +KK ++K+K + +K+ E +KEEE+
Sbjct: 8 EKRAEFDKKLEEKEKALEAEKEEAEARQKEEEE 40
>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase. This domain family is found in
eukaryotes, and is approximately 140 amino acids in
length. The family is found in association with
pfam00069. Polo-like kinase 1 (Plx1) is essential during
mitosis for the activation of Cdc25C, for spindle
assembly, and for cyclin B degradation. This family is
Polo kinase kinase (PKK) which phosphorylates Polo
kinase and Polo-like kinase to activate them. PKK is a
serine/threonine kinase.
Length = 142
Score = 28.4 bits (64), Expect = 3.8
Identities = 13/45 (28%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKRYEE-EEKEEEKDNKLLHNC 64
KE + E EK + ++++KK+ K EE E+K ++++ + L
Sbjct: 75 EKKELKQEVEKLPRFQEQEKKRMKAEKEEQEQKHQKQEREFLAKQ 119
>gnl|CDD|205311 pfam13130, DUF3952, Domain of unknown function (DUF3952). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria, and is approximately
110 amino acids in length. There is a conserved VMSAS
sequence motif.
Length = 107
Score = 28.1 bits (63), Expect = 3.9
Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 10/56 (17%)
Query: 7 EEKEDQEKEV------GGISRGKEE----ETEEEKKKKKKKKKKKKKKKRYEEEEK 52
EEKED E GI K++ ET +E +K+K K Y+EE+
Sbjct: 52 EEKEDGEHSKTIYQTTEGIYNTKDKSLYGETTQEITTDIDNEKEKDKNNNYKEEKV 107
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 29.4 bits (67), Expect = 4.0
Identities = 12/37 (32%), Positives = 22/37 (59%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
++E E KK+ + K++ K R E E++ E+ N+L
Sbjct: 48 KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNEL 84
Score = 29.4 bits (67), Expect = 4.1
Identities = 14/56 (25%), Positives = 37/56 (66%), Gaps = 1/56 (1%)
Query: 2 NEKGGEEKEDQ-EKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EK +KE+ ++++ + + +EE ++EK+ ++K+++ +KK++ EE +E+ +
Sbjct: 87 LEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ 142
Score = 29.0 bits (66), Expect = 5.8
Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 4/56 (7%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
EE D++ E K EE E+K+K+ ++K+++ +KK E EE EE+ +L
Sbjct: 95 EENLDRKLE----LLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97
family. This model family represents the major capsid
protein component of the heads (capsids) of
bacteriophage HK97, phi-105, P27, and related phage.
This model represents one of several analogous families
lacking detectable sequence similarity. The gene
encoding this component is typically located in an
operon encoding the small and large terminase subunits,
the portal protein and the prohead or maturation
protease [Mobile and extrachromosomal element
functions, Prophage functions].
Length = 384
Score = 29.2 bits (66), Expect = 4.0
Identities = 9/55 (16%), Positives = 23/55 (41%), Gaps = 2/55 (3%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK--KRYEEEEKEEEKDNK 59
E + KE+ ++ ++ EE+K + K++ K + E+ ++
Sbjct: 6 EALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 29.5 bits (67), Expect = 4.0
Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 14 KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
KE+G + E++ EE + K+ K+K K+ + +E E E D LL
Sbjct: 57 KEIGRALKRGEDDAEELIAEVKELKEKLKELEA-ALDELEAELDTLLL 103
>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
Length = 763
Score = 29.5 bits (66), Expect = 4.1
Identities = 14/31 (45%), Positives = 16/31 (51%)
Query: 15 EVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
E G + + E E E K KK KK KKKK
Sbjct: 228 EGGELIEKPQAEAENEAGKSDKKDKKSKKKK 258
>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 199 to 238 amino acids in length. This domain
is found associated with pfam06512, pfam00520. This
domain has a conserved ADD sequence motif.
Length = 222
Score = 29.0 bits (65), Expect = 4.1
Identities = 9/29 (31%), Positives = 16/29 (55%)
Query: 28 EEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E K+++ + K KK++ EE + EK
Sbjct: 9 ESAKERRNRNDKNKKEEHSIGSEEGDSEK 37
>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV.
Length = 336
Score = 29.4 bits (66), Expect = 4.3
Identities = 11/26 (42%), Positives = 20/26 (76%)
Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEE 55
++ K +K K+ KK+KKR E++E ++E
Sbjct: 26 DEAKPRKIKRVKKRKKREEKDELDDE 51
Score = 28.3 bits (63), Expect = 8.5
Identities = 10/26 (38%), Positives = 18/26 (69%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEE 54
+E K +K K+ KK+KK+ ++E +E
Sbjct: 26 DEAKPRKIKRVKKRKKREEKDELDDE 51
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 29.2 bits (65), Expect = 4.3
Identities = 16/63 (25%), Positives = 24/63 (38%), Gaps = 12/63 (19%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKKKKKK------------KKKKKKKRYEEEEKEEEK 56
+++ G EEE E +K+K K K K+K + EE EE+
Sbjct: 140 AALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEE 199
Query: 57 DNK 59
K
Sbjct: 200 KAK 202
>gnl|CDD|220624 pfam10187, Nefa_Nip30_N, N-terminal domain of NEFA-interacting
nuclear protein NIP30. This is a the N-terminal 100
amino acids of a family of proteins conserved from
plants to humans. The full-length protein has
putatively been called NEFA-interacting nuclear protein
NIP30, however no reference could be found to confirm
this.
Length = 99
Score = 27.7 bits (62), Expect = 4.4
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
E+E E + ++ ++ +KK+KR E EE EE
Sbjct: 67 EDEVEFLDEVEESRRAAEKKRKREEAEELEE 97
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 29.0 bits (65), Expect = 4.4
Identities = 16/56 (28%), Positives = 27/56 (48%), Gaps = 3/56 (5%)
Query: 6 GEEKEDQEKEVGGISRGK---EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
G K+ +K + K E EE+ +K +KK K+ E+ ++E+EKD
Sbjct: 121 GINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDE 176
Score = 28.2 bits (63), Expect = 6.7
Identities = 16/63 (25%), Positives = 34/63 (53%), Gaps = 6/63 (9%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKK------KKKKKRYEEEEKEEE 55
+ G + + K G+ +EE+ +E+ +KK K+ ++ ++ EEEE+EEE
Sbjct: 124 KKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEE 183
Query: 56 KDN 58
+++
Sbjct: 184 EED 186
Score = 28.2 bits (63), Expect = 8.1
Identities = 15/56 (26%), Positives = 29/56 (51%), Gaps = 4/56 (7%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
K G E++E +S +++ E E + ++ +K ++ EEEE+EEE +
Sbjct: 136 KRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEE----EEEEEEEEDE 187
>gnl|CDD|223232 COG0154, GatA, Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit
and related amidases [Translation, ribosomal structure
and biogenesis].
Length = 475
Score = 29.2 bits (66), Expect = 4.4
Identities = 18/76 (23%), Positives = 25/76 (32%), Gaps = 27/76 (35%)
Query: 86 VLLSPGIHSFPFKLG---------------LPLGLPSTFLGQAPVLLSPGIHSFPF---K 127
VLL+P + K+G +P+ G P I S P
Sbjct: 385 VLLTPTTPTPAPKIGESESDGDDPLEMYLLDVFTVPANLAGL------PAI-SVPAGFTA 437
Query: 128 LGLPLGLPSTFLGKHG 143
GLP+GL +G
Sbjct: 438 DGLPVGL--QLIGPAF 451
>gnl|CDD|153294 cd07610, FCH_F-BAR, The Extended FES-CIP4 Homology (FCH) or F-BAR
(FCH and Bin/Amphiphysin/Rvs) domain, a dimerization
module that binds and bends membranes. F-BAR domains
are dimerization modules that bind and bend membranes
and are found in proteins involved in membrane dynamics
and actin reorganization. F-BAR domain containing
proteins, also known as Pombe Cdc15 homology (PCH)
family proteins, include Fes and Fer tyrosine kinases,
PACSINs/Syndapins, FCHO, PSTPIP, CIP4-like proteins and
srGAPs. Many members also contain an SH3 domain and play
roles in endocytosis. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
These tubules have diameters larger than those observed
with N-BARs. The F-BAR domains of some members such as
NOSTRIN and Rgd1 are important for the subcellular
localization of the protein.
Length = 191
Score = 28.8 bits (65), Expect = 4.4
Identities = 18/60 (30%), Positives = 26/60 (43%), Gaps = 11/60 (18%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKKK-----KKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
KED+E+ K+E E EK KKK K KK ++ E+ EK ++
Sbjct: 92 KEDKEQA------RKKELAEGEKLKKKLQELWAKLAKKADEEYREQVEKLNPAQSEYEEE 145
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 29.3 bits (66), Expect = 4.5
Identities = 13/39 (33%), Positives = 23/39 (58%), Gaps = 6/39 (15%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
G++ ED ++E G + E + KKK KKK+++K +
Sbjct: 164 GDDGEDDDEE------GSDGEEDGAKKKAKKKRQRKPES 196
Score = 28.2 bits (63), Expect = 9.8
Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 4/40 (10%)
Query: 8 EKEDQEKEVGGISRGKEEE----TEEEKKKKKKKKKKKKK 43
KE E G ++E EE+ KKK KKK+++K
Sbjct: 154 NKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRK 193
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 28.3 bits (64), Expect = 4.5
Identities = 13/26 (50%), Positives = 20/26 (76%)
Query: 32 KKKKKKKKKKKKKKRYEEEEKEEEKD 57
K+KKKKKKKKK+ + +EE++K+
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVS 107
Score = 27.5 bits (62), Expect = 9.6
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K+KKKKKKKKK+ + E+++K + +
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEER 110
>gnl|CDD|221544 pfam12353, eIF3g, Eukaryotic translation initiation factor 3
subunit G. This domain family is found in eukaryotes,
and is approximately 130 amino acids in length. The
family is found in association with pfam00076. This
family is subunit G of the eukaryotic translation
initiation factor 3. Subunit G is required for eIF3
integrity.
Length = 126
Score = 28.0 bits (63), Expect = 4.5
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 19 ISRGKEEETEEEKKKKKKKKKKKKKKKRY 47
+ R +EE EEE+++K KK KK K R
Sbjct: 81 LIRNWKEEEEEEEEEKAKKSGKKLVKCRI 109
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ
homolog and probable assembly protein of the Mycoplasma
terminal organelle. The terminal organelle is involved
in both cytadherence and gliding motility [Cellular
processes, Chemotaxis and motility].
Length = 871
Score = 29.4 bits (65), Expect = 4.6
Identities = 14/38 (36%), Positives = 21/38 (55%)
Query: 19 ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
I++ E + + KKKKKKKKK + + K+E K
Sbjct: 91 INKSGEFDNITDDDTPSKKKKKKKKKGWFWAKSKQESK 128
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents
the seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is
recognized by the pfam model tRNA-synt_2b. The
seryl-tRNA synthetases of two archaeal species,
Methanococcus jannaschii and Methanobacterium
thermoautotrophicum, differ considerably and are
included in a different model [Protein synthesis, tRNA
aminoacylation].
Length = 418
Score = 29.3 bits (66), Expect = 4.6
Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 6/64 (9%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETE-EEKKKK-----KKKKKKKKKKKRYEEEEKEEE 55
+G D EK + K+ +E EE + K K+ K K +KK EE K+E
Sbjct: 19 KARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKEL 78
Query: 56 KDNK 59
K+ K
Sbjct: 79 KELK 82
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 27.6 bits (61), Expect = 4.7
Identities = 12/37 (32%), Positives = 22/37 (59%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKK 38
+G EE+ ++E+E G + + E EE+ + KK+K
Sbjct: 65 EGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQK 101
Score = 27.2 bits (60), Expect = 7.8
Identities = 15/53 (28%), Positives = 27/53 (50%), Gaps = 7/53 (13%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EE+E E + G+EEE EEE++ + K+ E++E++ + K
Sbjct: 53 EEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAA-------EDEEDDAETK 98
>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein. All proteins in this family
for which functions are known are used for the
recognition of DNA damage as part of nucleotide excision
repair. This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 172
Score = 28.6 bits (64), Expect = 4.8
Identities = 10/29 (34%), Positives = 24/29 (82%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
EE +E+K+++++ K++ K+K++E++ KE
Sbjct: 93 EEALDEEKERREESKEEMKEKKFEKKLKE 121
Score = 27.4 bits (61), Expect = 9.6
Identities = 18/64 (28%), Positives = 29/64 (45%), Gaps = 11/64 (17%)
Query: 5 GGEEKEDQEKEVGGISRGKEEETEEEKKKKK-----------KKKKKKKKKKRYEEEEKE 53
G EE D+EKE S+ + +E + EKK K+ KK+ + + + E
Sbjct: 91 GSEEALDEEKERREESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGRVHEHEFGPETNG 150
Query: 54 EEKD 57
E+D
Sbjct: 151 VEED 154
>gnl|CDD|180505 PRK06278, PRK06278, cobyrinic acid a,c-diamide synthase; Validated.
Length = 476
Score = 29.2 bits (66), Expect = 4.8
Identities = 14/37 (37%), Positives = 20/37 (54%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
KE+E EE KK K K+K K + K+EE++
Sbjct: 202 KEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNKP 238
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 29.2 bits (65), Expect = 4.8
Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 9/65 (13%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKK---------KKKKKKKKRYEEEEKE 53
G EE+ED+E G E EEE+++ + + ++ KK+ EEEE E
Sbjct: 185 SDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEME 244
Query: 54 EEKDN 58
EE N
Sbjct: 245 EEVIN 249
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 29.0 bits (65), Expect = 4.8
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
K++KKK++KK ++ + EEE + EK +
Sbjct: 12 TKREKKKREKKSIRQSKLEEELNKLEKKEEE 42
Score = 28.6 bits (64), Expect = 7.8
Identities = 12/49 (24%), Positives = 23/49 (46%), Gaps = 3/49 (6%)
Query: 10 EDQEKEVGGISRGKEEETEEEKKKKKKKKK---KKKKKKRYEEEEKEEE 55
++E++ I K + E++ KK K K K +++ +EEE
Sbjct: 95 YEREEQKKAILPRKLRSSTSEREPKKFKAKPVPKSIYIPLLKDKMQEEE 143
Score = 28.2 bits (63), Expect = 8.6
Identities = 12/59 (20%), Positives = 29/59 (49%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
++++ E+ K ++ E K + ++K+K+ K++ E++E E+ K K
Sbjct: 245 LDQENISAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKK 303
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. This
model represents a group of paralogous families in
plasmodium species alternately annotated as reticulocyte
binding protein, 235-kDa family protein and rhoptry
protein. Rhoptry protein is localized on the cell surface
and is extremely large (although apparently lacking in
repeat structure) and is important for the process of
invasion of the RBCs by the parasite. These proteins are
found in P. falciparum, P. vivax and P. yoelii.
Length = 2757
Score = 29.6 bits (66), Expect = 5.0
Identities = 16/53 (30%), Positives = 30/53 (56%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
+ K+D E + I ++ E +K ++K K+ KK+K R E++ + +K NK
Sbjct: 1541 KTKKDSEIIIKEIKDAHKKFILEAEKSEQKIKEIKKEKFRIEDDAAKNDKSNK 1593
>gnl|CDD|240597 cd12933, eIF3G, eIF3G domain found in eukaryotic translation
initiation factor 3 subunit G (eIF-3G) and similar
proteins. eIF-3G, also termed eIF-3 subunit 4, or
eIF-3-delta, or eIF3-p42, or eIF3-p44, is the
RNA-binding subunit of eIF3. eIF3 is a large
multi-subunit complex that plays a central role in the
initiation of translation by binding to the 40 S
ribosomal subunit and promoting the binding of
methionyl-tRNAi and mRNA. eIF-3G binds 18 S rRNA and
beta-globin mRNA, and therefore appears to be a
nonspecific RNA-binding protein. Besides, eIF-3G is one
of the cytosolic targets; it interacts with mature
apoptosis-inducing factor (AIF). This family also
includes yeast eIF3-p33, a homolog of vertebrate
eIF-3G; it plays an important role in the initiation
phase of protein synthesis in yeast. It binds both mRNA
and rRNA fragments due to an RNA recognition motif near
its C-terminus.
Length = 114
Score = 27.6 bits (62), Expect = 5.0
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 19 ISRGKEEETEEEKKKKKKKKKKKK 42
+ R KEE EEE++ KKKKKK KK
Sbjct: 71 LIRNKEELEEEEEEAKKKKKKGKK 94
>gnl|CDD|148072 pfam06244, DUF1014, Protein of unknown function (DUF1014). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 122
Score = 27.8 bits (62), Expect = 5.1
Identities = 11/36 (30%), Positives = 20/36 (55%)
Query: 20 SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
+R + EE E++++K + ++K KK E EE
Sbjct: 1 TRAQIEERLEKEQRKLEAPEEKSKKNAAREVPLEEN 36
Score = 27.8 bits (62), Expect = 5.2
Identities = 8/35 (22%), Positives = 19/35 (54%)
Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
EE+ +K+++K + ++K + +E + L N
Sbjct: 6 EERLEKEQRKLEAPEEKSKKNAAREVPLEENLNVN 40
>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 28.6 bits (64), Expect = 5.1
Identities = 10/56 (17%), Positives = 17/56 (30%)
Query: 4 KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
G E ED E+ + E E+ + + E EK ++
Sbjct: 34 PVGAEAEDAAAELAEKKAAAQSEIEKFCANIADAAADQLYALQKELLEKLKDLAEV 89
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 28.9 bits (65), Expect = 5.2
Identities = 13/61 (21%), Positives = 29/61 (47%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
+ ++ ++ +E + E ++ K + K+K K+++KR E +E+ EE K
Sbjct: 217 VELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKAAKRREKREELKERAEEIYEKFKR 276
Query: 63 N 63
Sbjct: 277 G 277
>gnl|CDD|146016 pfam03179, V-ATPase_G, Vacuolar (H+)-ATPase G subunit. This
family represents the eukaryotic vacuolar (H+)-ATPase
(V-ATPase) G subunit. V-ATPases generate an acidic
environment in several intracellular compartments.
Correspondingly, they are found as membrane-attached
proteins in several organelles. They are also found in
the plasma membranes of some specialised cells.
V-ATPases consist of peripheral (V1) and membrane
integral (V0) heteromultimeric complexes. The G subunit
is part of the V1 subunit, but is also thought to be
strongly attached to the V0 complex. It may be involved
in the coupling of ATP degradation to H+ translocation.
Length = 105
Score = 27.5 bits (62), Expect = 5.3
Identities = 10/36 (27%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 24 EEETEEEKKKKKKKKKKKKKKK--RYEEEEKEEEKD 57
E+E EE + +++ + K+ + + E EK+ EK+
Sbjct: 39 EKEIEEYRAQREAEFKEFEAEHSGSRGELEKKIEKE 74
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 29.1 bits (66), Expect = 5.4
Identities = 13/51 (25%), Positives = 29/51 (56%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
E+E ++ + + E++ ++++KKK KKKK K + + + KE + +
Sbjct: 382 EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQD 432
>gnl|CDD|206402 pfam14234, DUF4336, Domain of unknown function (DUF4336).
Length = 314
Score = 28.7 bits (65), Expect = 5.5
Identities = 13/27 (48%), Positives = 17/27 (62%), Gaps = 1/27 (3%)
Query: 83 QAPVLLSPGIHSFPFKLGLP-LGLPST 108
+A V ++PG SFP L L LG+P T
Sbjct: 84 KAQVWVAPGQWSFPLNLPLSWLGIPRT 110
Score = 28.7 bits (65), Expect = 5.5
Identities = 13/27 (48%), Positives = 17/27 (62%), Gaps = 1/27 (3%)
Query: 112 QAPVLLSPGIHSFPFKLGLP-LGLPST 137
+A V ++PG SFP L L LG+P T
Sbjct: 84 KAQVWVAPGQWSFPLNLPLSWLGIPRT 110
>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
(Spore_III_AF). This family represents the stage III
sporulation protein AF (Spore_III_AF) of the bacterial
endospore formation program, which exists in some but
not all members of the Firmicutes (formerly called
low-GC Gram-positives). The C-terminal region of these
proteins is poorly conserved.
Length = 185
Score = 28.4 bits (64), Expect = 5.5
Identities = 9/55 (16%), Positives = 23/55 (41%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
++E D ++ + +E+ + + + + K K+ EE E+ E+
Sbjct: 111 IDEDLESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEESEEAEK 165
>gnl|CDD|217940 pfam04177, TAP42, TAP42-like family. The TOR signalling pathway
activates a cell-growth program in response to
nutrients. TIP41 (pfam04176) interacts with TAP42 and
negatively regulates the TOR signaling pathway.
Length = 335
Score = 28.8 bits (65), Expect = 5.5
Identities = 8/51 (15%), Positives = 23/51 (45%), Gaps = 7/51 (13%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
+E +QE + GG+ + + ++++ + E++++E K
Sbjct: 281 DEYLEQEMKRGGV-------PQGGGEAAASAEEEEDDEDDEEDDDEETLKA 324
>gnl|CDD|218660 pfam05620, DUF788, Protein of unknown function (DUF788). This
family consists of several eukaryotic proteins of
unknown function.
Length = 166
Score = 28.5 bits (64), Expect = 5.6
Identities = 8/21 (38%), Positives = 14/21 (66%)
Query: 24 EEETEEEKKKKKKKKKKKKKK 44
ET E K K+++K +K+ +K
Sbjct: 146 GAETNETKSKRQEKLEKRGEK 166
Score = 28.1 bits (63), Expect = 6.4
Identities = 6/26 (23%), Positives = 13/26 (50%)
Query: 20 SRGKEEETEEEKKKKKKKKKKKKKKK 45
+ E K K+++K +K+ +K
Sbjct: 141 LPSSQGAETNETKSKRQEKLEKRGEK 166
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 29.1 bits (65), Expect = 5.6
Identities = 14/23 (60%), Positives = 18/23 (78%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKE 53
+KKKKKKK K+KKKK+ E K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 29.2 bits (66), Expect = 5.6
Identities = 16/63 (25%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKK-KKRYEEEEKEEEKDNK 59
+ EK E E+E+ ++R + +E EK+ + + +++K++ +K+YEE+ ++E +
Sbjct: 194 LKEKREELLSKLEEEL--LARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQA 251
Query: 60 LLH 62
H
Sbjct: 252 EAH 254
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This
family includes outer membrane proteins such as OmpH
among others. Skp (OmpH) has been characterized as a
molecular chaperone that interacts with unfolded
proteins as they emerge in the periplasm from the Sec
translocation machinery.
Length = 140
Score = 27.9 bits (63), Expect = 5.6
Identities = 11/40 (27%), Positives = 25/40 (62%)
Query: 22 GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
GK + + EK+ KK++ + +K +K ++ +++ +KD L
Sbjct: 16 GKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATL 55
>gnl|CDD|107059 PHA01746, PHA01746, hypothetical protein.
Length = 131
Score = 28.0 bits (62), Expect = 5.7
Identities = 11/27 (40%), Positives = 21/27 (77%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEE 50
EE+ E++++K++K ++K +KK EEE
Sbjct: 104 EEDDMEQQEEKEEKGREKGRKKNVEEE 130
Score = 27.2 bits (60), Expect = 9.2
Identities = 11/29 (37%), Positives = 23/29 (79%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEE 51
+EEE + E++++K++K ++K +K+ EEE
Sbjct: 102 QEEEDDMEQQEEKEEKGREKGRKKNVEEE 130
>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078). This
family is found from fungi to humans, but its exact
function is not known.
Length = 88
Score = 27.2 bits (61), Expect = 5.7
Identities = 8/37 (21%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 21 RGKE-EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
R ++ EE E+ +++ ++++K+++++K + EE +
Sbjct: 45 RKEQMEELEKAREETERERKEREERKEKRKRAIEERR 81
>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
Members of this family of small ribosomal proteins
possess one of three conserved blocks of sequence found
in proteins that stimulate the dissociation of guanine
nucleotides from G-proteins, leaving open the
possibility that MRP-S27 might be a functional partner
of GTP-binding ribosomal proteins.
Length = 417
Score = 29.0 bits (65), Expect = 5.7
Identities = 11/50 (22%), Positives = 24/50 (48%), Gaps = 2/50 (4%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E EKE + +++ E +K ++KKK +++ +EE ++
Sbjct: 366 ERLPTIEKEDLELYEQRQQLWFFENRKLWQRKKKLREQA--DEEYQQRHA 413
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 28.0 bits (63), Expect = 5.8
Identities = 18/61 (29%), Positives = 33/61 (54%), Gaps = 7/61 (11%)
Query: 14 KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE----KDNKLLHNCYLYSY 69
+E+ + E+E E +K+ +K+ ++ KK EEE +E + K N L+ LYS+
Sbjct: 72 REMLLFWKKNEKEERELRKRAEKEALEQAKK---EEELREAKRQQRKLNFLITQTELYSH 128
Query: 70 Y 70
+
Sbjct: 129 F 129
>gnl|CDD|238706 cd01418, Ribosomal_L19e_A, Ribosomal protein L19e, archaeal.
L19e is found in the large ribosomal subunit of
eukaryotes and archaea. L19e is distinct from the
ribosomal subunit L19, which is found in prokaryotes.
It consists of two small globular domains connected by
an extended segment. It is located toward the surface
of the large subunit, with one exposed end involved in
forming the intersubunit bridge with the small subunit.
The other exposed end is involved in forming the
translocon binding site, along with L22, L23, L24, L29,
and L31e subunits.
Length = 145
Score = 28.1 bits (63), Expect = 5.8
Identities = 15/46 (32%), Positives = 27/46 (58%), Gaps = 3/46 (6%)
Query: 18 GISRGKEEETEEEKKKKKKKK---KKKKKKKRYEEEEKEEEKDNKL 60
GISRG+ +E E++KK +++ +K KK R ++E+ + L
Sbjct: 54 GISRGRLKERHEKRKKGRRRGPGSRKGKKGARTPKKERWIKTIRAL 99
>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
Length = 111
Score = 27.6 bits (61), Expect = 5.8
Identities = 13/51 (25%), Positives = 23/51 (45%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K +QE E+ EE+ ++ +++ + R E E+K E NK
Sbjct: 30 KNEQENEIKEAKSKYEEKAKKTEEESLNMYNAALMEARKEAEKKAVEIINK 80
>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
CCCH-type Zn-finger protein [General function
prediction only].
Length = 299
Score = 28.9 bits (64), Expect = 5.8
Identities = 11/30 (36%), Positives = 21/30 (70%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
K+ ET K++ +KK+K + ++KR E E++
Sbjct: 46 KQIETLNLKEQLEKKEKMRMEEKRREPEKQ 75
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4;
Provisional.
Length = 191
Score = 28.2 bits (63), Expect = 6.0
Identities = 16/49 (32%), Positives = 24/49 (48%)
Query: 10 EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
+D E I K + ++ +KKK KK KKKK+ +EE E +
Sbjct: 36 DDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEE 84
Score = 28.2 bits (63), Expect = 6.9
Identities = 12/33 (36%), Positives = 18/33 (54%)
Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
KKK +KKK KK + +++EKEE +
Sbjct: 53 KKKTTPRKKKTTKKTKKKKKEKEEVPELAAEEL 85
Score = 28.2 bits (63), Expect = 7.0
Identities = 13/40 (32%), Positives = 22/40 (55%), Gaps = 4/40 (10%)
Query: 24 EEETEEEKKKKKKKKKKKKK----KKRYEEEEKEEEKDNK 59
E +++ KKKK +KKK KK+ +E+E+ E +
Sbjct: 44 EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAE 83
>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752). This
domain family is found in eukaryotes, and is typically
between 140 and 163 amino acids in length.
Length = 148
Score = 28.1 bits (63), Expect = 6.0
Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 7/59 (11%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK-------KRYEEEEKEEEKDN 58
E S + +E +K K+ +KK+ K K+ ++++KEEE D+
Sbjct: 51 LGVEAGASAPAAASAKASKRDKEMARKVKEYNEKKRGKSLVEQHQKKQKKKKKEEENDD 109
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 29.0 bits (66), Expect = 6.0
Identities = 12/23 (52%), Positives = 17/23 (73%)
Query: 32 KKKKKKKKKKKKKKRYEEEEKEE 54
+K++ KKKKK+KK + E EK E
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 28.9 bits (65), Expect = 6.1
Identities = 12/53 (22%), Positives = 29/53 (54%), Gaps = 8/53 (15%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EK+++E+E EEE E K + ++ ++R ++E+ E+++++
Sbjct: 179 REKKEEEEE--------EEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDN 223
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 27.6 bits (61), Expect = 6.1
Identities = 16/39 (41%), Positives = 22/39 (56%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
KE++ E+ K K KKK KKK+ EE + D +LL
Sbjct: 9 KEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELL 47
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 28.3 bits (64), Expect = 6.3
Identities = 16/65 (24%), Positives = 32/65 (49%), Gaps = 4/65 (6%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYE--EEEKEE--EKD 57
E+ ++ +D EK+ + K E EK+ K K + + ++R+E E E++E +K
Sbjct: 71 VEELRKKLKDYEKDKQSLKNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELYDKF 130
Query: 58 NKLLH 62
+
Sbjct: 131 EAAIQ 135
>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain. This family
consists of several ATP synthase E chain sequences
which are components of the CF(0) subunit.
Length = 83
Score = 27.1 bits (60), Expect = 6.4
Identities = 11/38 (28%), Positives = 19/38 (50%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
K E E ++K +K K KKK+ ++E K + +
Sbjct: 38 KIREYEAQEKLIEKAKAAYAKKKQAKQEAKSLAEGSSA 75
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a
molecular chaperone that interacts with unfolded
proteins as they emerge in the periplasm from the Sec
translocation machinery.
Length = 157
Score = 28.0 bits (63), Expect = 6.5
Identities = 13/40 (32%), Positives = 25/40 (62%)
Query: 22 GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
GK + + EK+ KK + + +KK+K ++EE++ +K L
Sbjct: 33 GKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATL 72
>gnl|CDD|232861 TIGR00181, pepF, oligoendopeptidase F. This family represents
the oligoendopeptidase F clade of the family of larger
M3 or thimet (for thiol-dependent metallopeptidase)
oligopeptidase family. Lactococcus lactis PepF
hydrolyzed peptides of 7 and 17 amino acids with fairly
broad specificity. The homolog of lactococcal PepF in
group B Streptococcus was named PepB (PMID:8757883),
with the name difference reflecting a difference in
species of origin rather activity; substrate profiles
were quite similar. Differences in substrate
specificity should be expected in other species. The
gene is duplicated in Lactococcus lactis on the plasmid
that bears it. A shortened second copy is found in
Bacillus subtilis [Protein fate, Degradation of
proteins, peptides, and glycopeptides].
Length = 591
Score = 28.8 bits (65), Expect = 6.6
Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 7/63 (11%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYL 66
+ KE+ E + E E+ K+ K KK K+ + E EEK LL+ Y
Sbjct: 17 KNKEEWELFL-------EALEEDIKEIKAFKKGLLHSKETFLEALALEEKILILLNRLYN 69
Query: 67 YSY 69
Y+
Sbjct: 70 YAS 72
>gnl|CDD|221339 pfam11957, efThoc1, THO complex subunit 1 transcription elongation
factor. The THO complex plays a role in coupling
transcription elongation to mRNA export. It is composed
of subunits THP2, HPR1, THO2 and MFT1. The THO complex
is a nuclear complex that is required for transcription
elongation through genes containing tandemly repeated
DNA sequences. The THO complex is also part of the TREX
(TRanscription EXport) complex that is involved in
coupling transcription to export of mRNAs to the
cytoplasm.
Length = 494
Score = 28.9 bits (65), Expect = 6.6
Identities = 12/50 (24%), Positives = 17/50 (34%), Gaps = 2/50 (4%)
Query: 23 KEEETEEEKKKKKKKKKK--KKKKKRYEEEEKEEEKDNKLLHNCYLYSYY 70
E E + K K + K + KR EE + + YL S
Sbjct: 185 IETELYNKIKSKTAEIKVDSRALNKRKRNPLDGEELASSFFNPKYLTSRD 234
>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
Length = 405
Score = 28.6 bits (64), Expect = 6.6
Identities = 11/34 (32%), Positives = 17/34 (50%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
+ E + K KK+K KK+K+ +EE K
Sbjct: 342 RMALLAEAARVKAKKEKLAKKRKQVSKEEAAAIK 375
>gnl|CDD|235245 PRK04183, PRK04183, glutamyl-tRNA(Gln) amidotransferase subunit
D; Validated.
Length = 419
Score = 28.7 bits (65), Expect = 6.6
Identities = 9/36 (25%), Positives = 15/36 (41%)
Query: 18 GISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
GI K E E +K +K K++ K+ +
Sbjct: 42 GIDIDKIAEIELLEKGEKPKQEPPPKEIEKDPGLPN 77
>gnl|CDD|220839 pfam10661, EssA, WXG100 protein secretion system (Wss), protein
EssA. The WXG100 protein secretion system (Wss) is
responsible for the secretion of WXG100 proteins
(pfam06013) such as ESAT-6 and CFP-10 in Mycobacterium
tuberculosis or EsxA and EsxB in Staphylococcus aureus.
In S. aureus, the Wss seems to be encoded by a locus of
eight CDS, called ess (eSAT-6 secretion system). This
locus encodes, amongst several other proteins, EssA, a
protein predicted to possess one transmembrane domain.
Due to its predicted membrane location and its absolute
requirement for WXG100 protein secretion, it has been
speculated that EssA could form a secretion apparatus
in conjunction with the polytopic membrane protein
EsaA, YukC (pfam10140) and YukAB, which is a
membrane-bound ATPase containing Ftsk/SpoIIIE domains
(pfam01580) called EssC in S. aureus and Snm1/Snm2 in
Mycobacterium tuberculosis. Proteins homologous to
EssA, YukC, EsaA and YukD seem absent from
mycobacteria.
Length = 145
Score = 27.9 bits (62), Expect = 6.7
Identities = 20/72 (27%), Positives = 31/72 (43%), Gaps = 1/72 (1%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKK-KKRYEEEEKEEEKDNKLL 61
EK ++ + E + GI E EE KKK ++K+ + + E+KE K
Sbjct: 28 EKNEKKLRETELDKLGIELFTTETEEEINKKKNAEQKEMEDIENSLFSEDKEGNVAVKET 87
Query: 62 HNCYLYSYYRVT 73
+ S Y VT
Sbjct: 88 KDSLFSSEYEVT 99
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 29.0 bits (65), Expect = 6.9
Identities = 18/68 (26%), Positives = 31/68 (45%), Gaps = 3/68 (4%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
GE KE+ + +S E E E+K + +K K+++K +E E+E E+ +
Sbjct: 503 KTFYGEFKEEINVLIEKLS---ALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERE 559
Query: 62 HNCYLYSY 69
N L
Sbjct: 560 RNKKLELE 567
>gnl|CDD|216295 pfam01093, Clusterin, Clusterin.
Length = 434
Score = 28.9 bits (65), Expect = 6.9
Identities = 23/61 (37%), Positives = 33/61 (54%), Gaps = 10/61 (16%)
Query: 6 GEEKEDQE--KEVGGISRGKE--EETEEEKKK-----KKKKKKKKKKKKRYEE-EEKEEE 55
GE+ D+E + G+ + K E TEEE K +K KK+K++ K E EEK EE
Sbjct: 10 GEKYVDEEVKNALIGVKQMKTLMERTEEEHKNLMSTLEKTKKEKEEALKLANEVEEKLEE 69
Query: 56 K 56
+
Sbjct: 70 E 70
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 28.4 bits (64), Expect = 6.9
Identities = 15/44 (34%), Positives = 26/44 (59%)
Query: 18 GISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
GI GK+++ ++E++ KK KKK + + E +K + D LL
Sbjct: 118 GIDLGKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALL 161
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 28.8 bits (65), Expect = 6.9
Identities = 7/41 (17%), Positives = 18/41 (43%)
Query: 8 EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYE 48
+ +E + + +E +++ K K K + K++Y
Sbjct: 364 KAAIEELQAEFVDNAEELAKLQDELDKIVKTKSELVKEKYH 404
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 28.7 bits (64), Expect = 7.1
Identities = 8/32 (25%), Positives = 22/32 (68%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
K+++TEEE+++K+++++++ R + K
Sbjct: 394 KQDDTEEEERRKRQERERQGTSSRSSDPSKAS 425
Score = 28.3 bits (63), Expect = 8.8
Identities = 14/36 (38%), Positives = 28/36 (77%)
Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
EEE EEE+++++++++ ++++ EEEE+E E DN
Sbjct: 446 EEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNG 481
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 28.9 bits (65), Expect = 7.1
Identities = 11/50 (22%), Positives = 23/50 (46%), Gaps = 4/50 (8%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EE E++ +E+ + E KK++ + + + R E + EE +
Sbjct: 864 EELEEELEEL----EAALRDLESRLGDLKKERDELEAQLRELERKIEELE 909
Score = 28.5 bits (64), Expect = 9.8
Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 4/58 (6%)
Query: 3 EKGGEEKEDQEKEVGGI----SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EK EKE QE + I E+ E KK++ +++ ++ + E
Sbjct: 827 EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRL 884
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 28.4 bits (64), Expect = 7.3
Identities = 9/32 (28%), Positives = 19/32 (59%), Gaps = 3/32 (9%)
Query: 27 TEEEKKKKKKKKKK---KKKKKRYEEEEKEEE 55
T +EK + ++ K + +++ E++KEEE
Sbjct: 200 TAKEKAIEAERAKAEAAEAEQELLREKQKEEE 231
>gnl|CDD|236012 PRK07406, PRK07406, RNA polymerase sigma factor RpoD; Validated.
Length = 373
Score = 28.6 bits (64), Expect = 7.3
Identities = 12/50 (24%), Positives = 22/50 (44%)
Query: 1 MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE 50
+ + D +++ S +E +E+E K K + K KK Y E+
Sbjct: 14 DRQPAATDDLDSDEDEESSSENADELSEKEAKAKALARIKIGPKKVYTED 63
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 28.8 bits (64), Expect = 7.5
Identities = 16/55 (29%), Positives = 25/55 (45%), Gaps = 3/55 (5%)
Query: 10 EDQEKEVGGISRGKEEETEEEKKKKKKKK---KKKKKKKRYEEEEKEEEKDNKLL 61
E+Q++E+ ++ EE K K K K++ K E E+KE E K
Sbjct: 277 ENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKRE 331
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 28.5 bits (64), Expect = 7.5
Identities = 17/54 (31%), Positives = 29/54 (53%)
Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYYRVTHICFSDP 80
T+EEK +++ +K ++ KKR E E+D + + C L + TH+C P
Sbjct: 154 TDEEKVEEELEKAREIYKKRDERARDLHEEDVDVFYGCTLCQSFAPTHVCVITP 207
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 28.6 bits (64), Expect = 7.6
Identities = 11/27 (40%), Positives = 14/27 (51%)
Query: 20 SRGKEEETEEEKKKKKKKKKKKKKKKR 46
S + + + KKK K K KK KKR
Sbjct: 757 SASTKGKAAKTVKKKSKAKSKKTTKKR 783
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 28.0 bits (63), Expect = 7.8
Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 3/60 (5%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
NE+ EE E+ E E +EE EEE + + +++ K E E K +E + + L
Sbjct: 5 NEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIA---ELEAKLDELEERYL 61
>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
This family consists of the N-terminal region of the
prokaryotic fibronectin-binding protein. Fibronectin
binding is considered to be an important virulence
factor in streptococcal infections. Fibronectin is a
dimeric glycoprotein that is present in a soluble form
in plasma and extracellular fluids; it is also present
in a fibrillar form on cell surfaces. Both the soluble
and cellular forms of fibronectin may be incorporated
into the extracellular tissue matrix. While fibronectin
has critical roles in eukaryotic cellular processes,
such as adhesion, migration and differentiation, it is
also a substrate for the attachment of bacteria. The
binding of pathogenic Streptococcus pyogenes and
Staphylococcus aureus to epithelial cells via
fibronectin facilitates their internalisation and
systemic spread within the host.
Length = 447
Score = 28.4 bits (64), Expect = 7.8
Identities = 14/23 (60%), Positives = 18/23 (78%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKK 45
+EE E+ K+KK+KKKKKKKK
Sbjct: 425 REELIEQGYLKEKKRKKKKKKKK 447
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 28.7 bits (65), Expect = 7.9
Identities = 10/50 (20%), Positives = 25/50 (50%)
Query: 11 DQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
D KE ++EE +E+ K+ + KK + ++ ++ + + K+ +
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPV 215
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 28.1 bits (62), Expect = 7.9
Identities = 15/54 (27%), Positives = 31/54 (57%), Gaps = 4/54 (7%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
E+ +E+EDQE++ R +EE+T EE+ + + ++ + E ++ + EK
Sbjct: 148 EEKIDEREDQEEQ----EREREEQTIEEQSDDSEHEIIEQDESETESDDDKTEK 197
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 28.8 bits (65), Expect = 7.9
Identities = 10/69 (14%), Positives = 29/69 (42%), Gaps = 10/69 (14%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
+++ EE E + +++ ++ ++++++++ EEK +L
Sbjct: 133 SDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREW----------EEKRAELE 182
Query: 62 HNCYLYSYY 70
Y +YY
Sbjct: 183 FYYYQGTYY 191
>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
Phosphatase-Interacting Proteins. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Vetebrates contain two
Proline-Serine-Threonine Phosphatase-Interacting
Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
mainly expressed in hematopoietic cells and are involved
in the regulation of cell adhesion and motility.
Mutations in PSTPIPs have been shown to cause
autoinflammatory disorders. PSTPIP1 contains an
N-terminal F-BAR domain, PEST motifs, and a C-terminal
SH3 domain, while PSTPIP2 contains only the N-terminal
F-BAR domain. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 239
Score = 28.2 bits (63), Expect = 8.0
Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 7/56 (12%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKK--KKKKKKKKRYEEEEKEEEK 56
EK E +E Q++E R K E+ + +K KK+ KK K KK YE++ +E++K
Sbjct: 88 EKLEEFREKQKEE-----RKKTEDIMKRSQKNKKELYKKTMKAKKSYEQKCREKDK 138
>gnl|CDD|204055 pfam08764, Coagulase, Staphylococcus aureus coagulase.
Staphylococcus aureus secretes a cofactor called
coagulase. Coagulase is an extracellular protein that
forms a complex with human prothrombin, and activates it
without the usual proteolytic cleavages. The resulting
complex directly initiates blood clotting.
Length = 282
Score = 28.2 bits (63), Expect = 8.1
Identities = 11/40 (27%), Positives = 22/40 (55%)
Query: 34 KKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYYRVT 73
+KKK + +K KK Y++ ++E + + Y+ Y +T
Sbjct: 73 EKKKAEYEKYKKWYKKYKEENPNSKLKMKTFHKYNLYDLT 112
>gnl|CDD|147685 pfam05663, DUF809, Protein of unknown function (DUF809). This
family consists of several proteins of unknown function
Raphanus sativus (Radish) and Brassica napus (Rape).
Length = 138
Score = 27.4 bits (60), Expect = 8.3
Identities = 17/43 (39%), Positives = 28/43 (65%)
Query: 14 KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
K+V I +G+ E EE+K+ K + + K++KK+ E E +EEK
Sbjct: 87 KKVSPIIKGEIEGKEEKKEGKGEIEGKEEKKEGKGEIEGKEEK 129
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 28.2 bits (63), Expect = 8.3
Identities = 14/57 (24%), Positives = 25/57 (43%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
K E E + + K + ++ KK K K K K K K + + + K ++ +K
Sbjct: 67 PKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSK 123
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 28.2 bits (63), Expect = 8.3
Identities = 11/57 (19%), Positives = 27/57 (47%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
EE+E E EV + + EE +KK+ + ++ +++ + +E + ++
Sbjct: 75 EERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHERARQELLEAAA 131
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 28.4 bits (64), Expect = 8.4
Identities = 9/59 (15%), Positives = 21/59 (35%)
Query: 2 NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
+ + + G + + + KKK K K K +++ ++E D+
Sbjct: 46 KKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK 104
>gnl|CDD|206050 pfam13879, KIAA1430, KIAA1430 homologue. This is a family of
KIAA1430 homologues. The function is not known.
Length = 98
Score = 26.9 bits (60), Expect = 8.4
Identities = 11/33 (33%), Positives = 18/33 (54%), Gaps = 2/33 (6%)
Query: 31 KKKKKKKKKKKKKKKRYEEEE--KEEEKDNKLL 61
K ++ K KK +EEE E E++N++L
Sbjct: 16 KPPRRPAHLLLKLKKLQQEEERLAEIERENQIL 48
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 27.9 bits (62), Expect = 8.4
Identities = 18/47 (38%), Positives = 24/47 (51%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
+ E QEKE KEEE E+ K+++ K+KK K EEK
Sbjct: 76 EDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKRKKPFKEMNLEEK 122
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 28.1 bits (63), Expect = 8.5
Identities = 10/22 (45%), Positives = 11/22 (50%)
Query: 24 EEETEEEKKKKKKKKKKKKKKK 45
E EKKK + KKK K K
Sbjct: 148 RTEILAEKKKPRSKKKSSKNSK 169
Score = 28.1 bits (63), Expect = 9.7
Identities = 10/26 (38%), Positives = 13/26 (50%)
Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKR 46
E E++K + KKK K KK R
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLR 172
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 27.8 bits (62), Expect = 8.5
Identities = 13/51 (25%), Positives = 27/51 (52%)
Query: 10 EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
E+ K+ G I + + + T+ + + +K + + +K EE K +K N+L
Sbjct: 129 EENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNEL 179
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 28.2 bits (63), Expect = 8.6
Identities = 12/34 (35%), Positives = 19/34 (55%)
Query: 7 EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKK 40
EE + EKE R ++ EEE++ K+K K+
Sbjct: 148 EEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQ 181
>gnl|CDD|220121 pfam09135, Alb1, Alb1. Alb1 is a nuclear shuttling factor
involved in ribosome biogenesis.
Length = 112
Score = 27.1 bits (60), Expect = 8.6
Identities = 11/42 (26%), Positives = 25/42 (59%)
Query: 9 KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE 50
K K G ++ K E ++K++KKK K+ +K+++ +++
Sbjct: 28 KRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKRQKK 69
>gnl|CDD|227474 COG5145, RAD14, DNA excision repair protein [DNA replication,
recombination, and repair].
Length = 292
Score = 28.0 bits (62), Expect = 8.7
Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 1/32 (3%)
Query: 22 GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
G EE + EK++++K K +K+KK E++ KE
Sbjct: 209 GSLEELDREKQRREKMKDDRKEKK-LEKKIKE 239
>gnl|CDD|219589 pfam07808, RED_N, RED-like protein N-terminal region. This
family contains sequences that are similar to the
N-terminal region of Red protein. This and related
proteins contain a RED repeat which consists of a
number of RE and RD sequence elements. The region in
question has several conserved NLS sequences and a
putative trimeric coiled-coil region, suggesting that
these proteins are expressed in the nucleus. The
function of Red protein is unknown, but efficient
sequestration to nuclear bodies suggests that its
expression may be tightly regulated of that the protein
self-aggregates extremely efficiently.
Length = 238
Score = 27.9 bits (62), Expect = 8.8
Identities = 9/22 (40%), Positives = 13/22 (59%)
Query: 38 KKKKKKKKRYEEEEKEEEKDNK 59
KKKKK ++EE E++ N
Sbjct: 1 KKKKKYAYLRKQEENAEKEINP 22
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 27.9 bits (62), Expect = 8.9
Identities = 12/26 (46%), Positives = 19/26 (73%)
Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEE 50
EE +E+KK K++K+ KKK+R + E
Sbjct: 94 EEEKEKKKNLNKEEKEAKKKERAKRE 119
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 28.6 bits (64), Expect = 8.9
Identities = 13/43 (30%), Positives = 23/43 (53%)
Query: 3 EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
+KG ++K+ + G + + EE+ KK KK +KK + K
Sbjct: 417 DKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDK 459
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 28.7 bits (65), Expect = 8.9
Identities = 17/42 (40%), Positives = 26/42 (61%)
Query: 19 ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
SR KE++ +EE K K KK ++ E++ +EEEKD +L
Sbjct: 488 SSRRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQL 529
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein
is found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 28.1 bits (63), Expect = 9.0
Identities = 12/23 (52%), Positives = 19/23 (82%)
Query: 35 KKKKKKKKKKKRYEEEEKEEEKD 57
KKKKKKKK K + ++++EE++D
Sbjct: 1 KKKKKKKKSKLSFGDDDEEEDED 23
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 28.3 bits (63), Expect = 9.0
Identities = 11/39 (28%), Positives = 19/39 (48%), Gaps = 1/39 (2%)
Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEE-EKEEEKDNKL 60
+T+ KKK + K++ + EE EKE K ++
Sbjct: 410 TLYKTKIFYHKKKINQNNKEQDIKKEELLEKEFVKKSEK 448
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 28.7 bits (65), Expect = 9.1
Identities = 9/30 (30%), Positives = 17/30 (56%)
Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
EEE +K++ K++ +++ K EE E
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEA 68
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 28.4 bits (63), Expect = 9.2
Identities = 14/42 (33%), Positives = 24/42 (57%)
Query: 19 ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
S K++ +E+ + +K+ KK K+ EEEE+ E+D L
Sbjct: 4 ESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAML 45
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 27.7 bits (62), Expect = 9.6
Identities = 13/40 (32%), Positives = 18/40 (45%)
Query: 6 GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
E K+ ++KE KE KK +K + KKKK
Sbjct: 157 AERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|212164 cd11650, AT4G37440_like, Uncharacterized protein domain conserved
in plants. This domain contains an extensive protein
sequence fragment that appears conserved in a number of
plant proteins, including the gene product of
Arabidopsis thaliana locus AT4G37440, which has been
identified in transcriptional profiling as expressed at
different levels in white cabbage cultivars.
Length = 253
Score = 27.8 bits (62), Expect = 9.7
Identities = 14/50 (28%), Positives = 24/50 (48%), Gaps = 13/50 (26%)
Query: 13 EKEVGGISRGKE-----EETEEEKKK--------KKKKKKKKKKKKRYEE 49
++E+ I + KE + E E K ++K K++K+KR EE
Sbjct: 114 DRELALIEQEKELQLEGTKAEGENSKSAPFSGERHERKVMKRRKRKRVEE 163
>gnl|CDD|233048 TIGR00605, rad4, DNA repair protein rad4. All proteins in this
family for which functions are known are involved in
targeting nucleotide excision repair to specific
regions of the genome.This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 713
Score = 28.3 bits (63), Expect = 9.9
Identities = 21/82 (25%), Positives = 34/82 (41%), Gaps = 7/82 (8%)
Query: 10 EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK--LLHN---- 63
ED KE + R E E E EK+ K +++K +++ + K+ K L H
Sbjct: 3 EDLAKEYFELIRKVENEKEAEKQPKSRRRKVRRENEPSLRRRKKRFKTGLNELPHEVVLM 62
Query: 64 CYLYSYYRVTHICFSDPPGQAP 85
C L S + + S P +
Sbjct: 63 CNLDSTHSDDRVV-SVPDSLSV 83
>gnl|CDD|215349 PLN02647, PLN02647, acyl-CoA thioesterase.
Length = 437
Score = 28.2 bits (63), Expect = 9.9
Identities = 16/42 (38%), Positives = 24/42 (57%), Gaps = 6/42 (14%)
Query: 26 ETEEEK------KKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
ETEEEK + + K +KKK+ +++ E E E E+ LL
Sbjct: 225 ETEEEKLLFEEAEARNKLRKKKRGEQKREFENGEAERLEALL 266
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.137 0.409
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,926,567
Number of extensions: 2282164
Number of successful extensions: 14838
Number of sequences better than 10.0: 1
Number of HSP's gapped: 11304
Number of HSP's successfully gapped: 1122
Length of query: 446
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 346
Effective length of database: 6,502,202
Effective search space: 2249761892
Effective search space used: 2249761892
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (27.4 bits)