RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy13443
(188 letters)
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 39.2 bits (91), Expect = 9e-04
Identities = 15/21 (71%), Positives = 18/21 (85%)
Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
GG+ EEEEEEEEEEE+ E+E
Sbjct: 860 GGDSEEEEEEEEEEEEEEEEE 880
Score = 38.4 bits (89), Expect = 0.002
Identities = 19/31 (61%), Positives = 21/31 (67%), Gaps = 6/31 (19%)
Query: 160 VDKKGG------EEEEEEEEEEEEEDNIEQE 184
VD GG EEEEEEEEEEEEE+ E+E
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEEEEEEE 882
Score = 37.7 bits (87), Expect = 0.002
Identities = 15/19 (78%), Positives = 17/19 (89%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
EEEEEEEEEEEEE+ E+E
Sbjct: 868 EEEEEEEEEEEEEEEEEEE 886
Score = 37.3 bits (86), Expect = 0.003
Identities = 16/24 (66%), Positives = 19/24 (79%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
D + EEEEEEEEEEEEE+ E+E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEE 885
Score = 37.3 bits (86), Expect = 0.003
Identities = 15/17 (88%), Positives = 16/17 (94%)
Query: 166 EEEEEEEEEEEEEDNIE 182
EEEEEEEEEEEEE+N E
Sbjct: 876 EEEEEEEEEEEEEENEE 892
Score = 37.3 bits (86), Expect = 0.004
Identities = 15/19 (78%), Positives = 17/19 (89%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
EEEEEEEEEEEEE+ E+E
Sbjct: 871 EEEEEEEEEEEEEEEEEEE 889
Score = 37.3 bits (86), Expect = 0.004
Identities = 15/24 (62%), Positives = 20/24 (83%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
+++ EEEEEEEEEEEEE+ E+E
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEE 888
Score = 36.9 bits (85), Expect = 0.005
Identities = 15/19 (78%), Positives = 16/19 (84%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
EEEEEEEEEEEEE+ E E
Sbjct: 873 EEEEEEEEEEEEEEEEENE 891
Score = 36.5 bits (84), Expect = 0.006
Identities = 15/23 (65%), Positives = 17/23 (73%)
Query: 166 EEEEEEEEEEEEEDNIEQEFLFE 188
EEEEEEEEEEEEE+ E+ E
Sbjct: 875 EEEEEEEEEEEEEEENEEPLSLE 897
Score = 36.5 bits (84), Expect = 0.007
Identities = 15/21 (71%), Positives = 17/21 (80%)
Query: 166 EEEEEEEEEEEEEDNIEQEFL 186
EEEEEEEEEEEEE+ +E L
Sbjct: 874 EEEEEEEEEEEEEEEENEEPL 894
Score = 35.7 bits (82), Expect = 0.010
Identities = 14/19 (73%), Positives = 16/19 (84%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
EEEEEEEEEEEEE+ E+
Sbjct: 872 EEEEEEEEEEEEEEEEEEN 890
Score = 33.0 bits (75), Expect = 0.083
Identities = 13/20 (65%), Positives = 16/20 (80%)
Query: 166 EEEEEEEEEEEEEDNIEQEF 185
EEEEEEEEEEE E+ + E+
Sbjct: 879 EEEEEEEEEEENEEPLSLEW 898
>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain.
Length = 26
Score = 33.1 bits (76), Expect = 0.004
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 121 HLDDHMRRHTGEKPHACGMCGYECAQ 146
+L HMR HTGEKP+ C +CG +
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 27.4 bits (61), Expect = 0.57
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 93 YMRTHIDTHNGEKPFRCALCAYSARR 118
+R H+ TH GEKP++C +C S
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 25.4 bits (56), Expect = 2.7
Identities = 7/25 (28%), Positives = 13/25 (52%)
Query: 1 MLRHVRTHLAKQQFECDFCPYGAKQ 25
+ RH+RTH ++ ++C C
Sbjct: 2 LRRHMRTHTGEKPYKCPVCGKSFSS 26
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 34.5 bits (80), Expect = 0.004
Identities = 12/14 (85%), Positives = 14/14 (100%)
Query: 166 EEEEEEEEEEEEED 179
EEE++EEEEEEEED
Sbjct: 67 EEEKKEEEEEEEED 80
Score = 34.1 bits (79), Expect = 0.006
Identities = 10/14 (71%), Positives = 14/14 (100%)
Query: 166 EEEEEEEEEEEEED 179
E++EEEEEEEE++D
Sbjct: 69 EKKEEEEEEEEDDD 82
Score = 34.1 bits (79), Expect = 0.007
Identities = 11/14 (78%), Positives = 14/14 (100%)
Query: 166 EEEEEEEEEEEEED 179
EE++EEEEEEEE+D
Sbjct: 68 EEKKEEEEEEEEDD 81
Score = 31.1 bits (71), Expect = 0.080
Identities = 10/14 (71%), Positives = 13/14 (92%)
Query: 166 EEEEEEEEEEEEED 179
EEE++EEEEEEE+
Sbjct: 66 AEEEKKEEEEEEEE 79
Score = 29.5 bits (67), Expect = 0.34
Identities = 9/14 (64%), Positives = 12/14 (85%)
Query: 166 EEEEEEEEEEEEED 179
EEE++EEEEEE+
Sbjct: 65 AAEEEKKEEEEEEE 78
Score = 28.7 bits (65), Expect = 0.52
Identities = 7/16 (43%), Positives = 9/16 (56%)
Query: 163 KGGEEEEEEEEEEEEE 178
EEE++EEEE
Sbjct: 60 AAAAAAAEEEKKEEEE 75
Score = 28.7 bits (65), Expect = 0.58
Identities = 8/18 (44%), Positives = 11/18 (61%)
Query: 162 KKGGEEEEEEEEEEEEED 179
EEE++EEEEE+
Sbjct: 60 AAAAAAAEEEKKEEEEEE 77
Score = 27.6 bits (62), Expect = 1.5
Identities = 7/21 (33%), Positives = 12/21 (57%)
Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
EEE++EEE+ E++
Sbjct: 60 AAAAAAAEEEKKEEEEEEEED 80
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 34.2 bits (79), Expect = 0.009
Identities = 12/24 (50%), Positives = 16/24 (66%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIE 182
++K EEEEEEE+EE EE+
Sbjct: 76 AAEEKKEEEEEEEEKEESEEEAAA 99
Score = 32.6 bits (75), Expect = 0.036
Identities = 12/22 (54%), Positives = 17/22 (77%)
Query: 163 KGGEEEEEEEEEEEEEDNIEQE 184
EE++EEEEEEEE++ E+E
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEE 96
Score = 31.5 bits (72), Expect = 0.088
Identities = 11/22 (50%), Positives = 16/22 (72%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIE 182
E++EEEEEEEE+E++ E
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEE 95
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 35.2 bits (81), Expect = 0.013
Identities = 17/37 (45%), Positives = 24/37 (64%), Gaps = 1/37 (2%)
Query: 149 QLMQHLRKV-HKVDKKGGEEEEEEEEEEEEEDNIEQE 184
+L + L++ H V ++ EEEEEEEEEEEEE +
Sbjct: 385 ELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEA 421
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 34.7 bits (80), Expect = 0.020
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 166 EEEEEEEEEEEEEDNIEQEFLFE 188
EEE+EEEEEEE+ED + +
Sbjct: 392 EEEDEEEEEEEDEDEGPSKEHSD 414
Score = 30.5 bits (69), Expect = 0.52
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 161 DKKGGEEEEEEEEEEEEED 179
D + E+EEEEEEE+E+E
Sbjct: 389 DDEEEEDEEEEEEEDEDEG 407
Score = 29.3 bits (66), Expect = 1.3
Identities = 10/21 (47%), Positives = 17/21 (80%)
Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
++EEEE+EEEEEE++ ++
Sbjct: 387 ERDDEEEEDEEEEEEEDEDEG 407
Score = 28.6 bits (64), Expect = 2.4
Identities = 12/24 (50%), Positives = 16/24 (66%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
D ++EEEE+EEEEE+ E E
Sbjct: 383 DANTERDDEEEEDEEEEEEEDEDE 406
Score = 27.8 bits (62), Expect = 4.2
Identities = 9/26 (34%), Positives = 19/26 (73%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
+++ E ++EEEE+EEEE+ +++
Sbjct: 380 EIEDANTERDDEEEEDEEEEEEEDED 405
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 33.5 bits (77), Expect = 0.024
Identities = 14/33 (42%), Positives = 21/33 (63%)
Query: 152 QHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
QHL + + EE+EEEE++EE++D E E
Sbjct: 104 QHLVASEEDESDDDEEDEEEEDDEEDDDEDESE 136
Score = 30.8 bits (70), Expect = 0.22
Identities = 8/19 (42%), Positives = 17/19 (89%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
E+EEEE++EE+++++ +E
Sbjct: 119 EDEEEEDDEEDDDEDESEE 137
Score = 29.6 bits (67), Expect = 0.48
Identities = 9/19 (47%), Positives = 18/19 (94%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
EEEE++EE+++E+++ E+E
Sbjct: 121 EEEEDDEEDDDEDESEEEE 139
Score = 28.4 bits (64), Expect = 1.4
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 153 HLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
H+ H V + E +++EE+EEEE+D + +
Sbjct: 100 HISGQHLVASEEDESDDDEEDEEEEDDEEDDD 131
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 34.3 bits (79), Expect = 0.033
Identities = 13/30 (43%), Positives = 19/30 (63%)
Query: 155 RKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
R ++ EEEE+E+EEEEEE+ +E
Sbjct: 338 RCTWVNPEQKDEEEEQEDEEEEEEEEEPEE 367
Score = 33.5 bits (77), Expect = 0.045
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 156 KVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
++K EEE+E+EEEEEEE+ E+
Sbjct: 340 TWVNPEQKDEEEEQEDEEEEEEEEEPEEP 368
Score = 32.7 bits (75), Expect = 0.085
Identities = 13/24 (54%), Positives = 18/24 (75%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
D++ +E+EEEEEEEEE + E E
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPE 371
Score = 31.2 bits (71), Expect = 0.27
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
K EE+E+EEEEEEEE+ E E
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPE 369
Score = 30.8 bits (70), Expect = 0.39
Identities = 11/24 (45%), Positives = 18/24 (75%)
Query: 165 GEEEEEEEEEEEEEDNIEQEFLFE 188
E+++EEEE+E+EE+ E+E E
Sbjct: 344 PEQKDEEEEQEDEEEEEEEEEPEE 367
Score = 30.0 bits (68), Expect = 0.72
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
+ + E+E+EEEEEEEEE +
Sbjct: 345 EQKDEEEEQEDEEEEEEEEEPEEPEP 370
Score = 29.3 bits (66), Expect = 1.1
Identities = 12/28 (42%), Positives = 16/28 (57%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQEFL 186
+ +++ EEEEEEEE EE E L
Sbjct: 350 EEEQEDEEEEEEEEEPEEPEPEEGPPLL 377
Score = 26.6 bits (59), Expect = 10.0
Identities = 14/26 (53%), Positives = 19/26 (73%)
Query: 163 KGGEEEEEEEEEEEEEDNIEQEFLFE 188
+G + EEEEEEEEEE+ E++ L E
Sbjct: 282 RGLYQLEEEEEEEEEEEPAERDELEE 307
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 34.1 bits (79), Expect = 0.033
Identities = 11/18 (61%), Positives = 14/18 (77%)
Query: 162 KKGGEEEEEEEEEEEEED 179
++ EEEEEEEEE EE+
Sbjct: 303 EEEEEEEEEEEEEPSEEE 320
Score = 33.3 bits (77), Expect = 0.048
Identities = 15/19 (78%), Positives = 17/19 (89%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
EEEEEEEEEEEEE+ E+E
Sbjct: 302 EEEEEEEEEEEEEEPSEEE 320
Score = 32.5 bits (75), Expect = 0.11
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNI 181
++ EEEEEEEEEE E+
Sbjct: 299 AAAEEEEEEEEEEEEEEPSEEEA 321
Score = 31.4 bits (72), Expect = 0.26
Identities = 13/19 (68%), Positives = 15/19 (78%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
EEEEEEEEEEEE+ +E
Sbjct: 301 AEEEEEEEEEEEEEEPSEE 319
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 33.2 bits (76), Expect = 0.043
Identities = 13/34 (38%), Positives = 21/34 (61%)
Query: 151 MQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
++ L ++ +EEEEEEEEEE+ED + +
Sbjct: 160 LKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDD 193
Score = 31.6 bits (72), Expect = 0.17
Identities = 14/33 (42%), Positives = 23/33 (69%)
Query: 148 SQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDN 180
S L + L+++ D +E++EEEEEEEEE++
Sbjct: 154 SMLEKKLKELEAEDVDEEDEKDEEEEEEEEEED 186
Score = 31.3 bits (71), Expect = 0.22
Identities = 13/24 (54%), Positives = 19/24 (79%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
D+K EEEEEEEEE+E+ D+ + +
Sbjct: 172 DEKDEEEEEEEEEEDEDFDDDDDD 195
Score = 29.7 bits (67), Expect = 0.73
Identities = 8/29 (27%), Positives = 19/29 (65%), Gaps = 4/29 (13%)
Query: 161 DKKGGEEEEEEEEEEEEED----NIEQEF 185
+++ EEE+E+ ++++++D N E F
Sbjct: 178 EEEEEEEEDEDFDDDDDDDDDDYNAENYF 206
Score = 29.3 bits (66), Expect = 0.90
Identities = 11/24 (45%), Positives = 18/24 (75%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
+ + EEEEEEEEEE+E+ + + +
Sbjct: 171 EDEKDEEEEEEEEEEDEDFDDDDD 194
Score = 27.8 bits (62), Expect = 2.8
Identities = 11/34 (32%), Positives = 21/34 (61%)
Query: 155 RKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
+ + EE+E++EEEEEEE+ +++F +
Sbjct: 159 KLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDD 192
>gnl|CDD|219038 pfam06459, RR_TM4-6, Ryanodine Receptor TM 4-6. This region covers
TM regions 4-6 of the ryanodine receptor 1 family.
Length = 217
Score = 33.1 bits (76), Expect = 0.051
Identities = 15/37 (40%), Positives = 18/37 (48%), Gaps = 6/37 (16%)
Query: 158 HKVDKKGGEEEEEEE------EEEEEEDNIEQEFLFE 188
+KV EEEEEE EE+EE+ I L E
Sbjct: 145 YKVSTSPAEEEEEEGSGSGDDGEEDEEEGIVYFVLQE 181
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 33.5 bits (77), Expect = 0.054
Identities = 16/32 (50%), Positives = 22/32 (68%), Gaps = 1/32 (3%)
Query: 148 SQLMQHLRKV-HKVDKKGGEEEEEEEEEEEEE 178
+L + L++ H V ++ EEEEEEEEEEEE
Sbjct: 385 EELKKFLKEKEHPVVERWAAEEEEEEEEEEEE 416
Score = 30.0 bits (68), Expect = 0.70
Identities = 13/23 (56%), Positives = 17/23 (73%)
Query: 166 EEEEEEEEEEEEEDNIEQEFLFE 188
EEEEEEEEEEEEE+ + + +
Sbjct: 406 EEEEEEEEEEEEEEPVAEVMMMP 428
Score = 27.0 bits (60), Expect = 6.7
Identities = 11/17 (64%), Positives = 13/17 (76%)
Query: 168 EEEEEEEEEEEDNIEQE 184
EEEEEEEEE+ E+E
Sbjct: 403 AAEEEEEEEEEEEEEEE 419
Score = 27.0 bits (60), Expect = 7.2
Identities = 11/18 (61%), Positives = 13/18 (72%)
Query: 167 EEEEEEEEEEEEDNIEQE 184
EEEEEEEEEE+ E+
Sbjct: 403 AAEEEEEEEEEEEEEEEP 420
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 33.1 bits (76), Expect = 0.076
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 164 GGEEEEEEEEEEEEEDNIEQEFLFE 188
EEEE EEEEEEE+ E+ F E
Sbjct: 309 EEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 28.5 bits (64), Expect = 2.5
Identities = 12/20 (60%), Positives = 13/20 (65%)
Query: 166 EEEEEEEEEEEEEDNIEQEF 185
E EEEEEEEEE E+E
Sbjct: 303 PEPEEEEEEEEEVPEEEEEE 322
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 31.6 bits (72), Expect = 0.078
Identities = 12/22 (54%), Positives = 17/22 (77%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIE 182
+++ EEEEEEEEEE EE+ +
Sbjct: 77 EEEEEEEEEEEEEEESEEEAMA 98
Score = 30.8 bits (70), Expect = 0.13
Identities = 15/19 (78%), Positives = 17/19 (89%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
EEEEEEEEEEEEE+ E+E
Sbjct: 77 EEEEEEEEEEEEEEESEEE 95
Score = 30.8 bits (70), Expect = 0.14
Identities = 14/19 (73%), Positives = 16/19 (84%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
EEEEEEEEEEEEE+ +E
Sbjct: 76 EEEEEEEEEEEEEEEESEE 94
Score = 30.0 bits (68), Expect = 0.24
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNI 181
+++ EEEEEEEEEEE E+
Sbjct: 74 AAEEEEEEEEEEEEEEEESEEEA 96
Score = 29.6 bits (67), Expect = 0.40
Identities = 12/21 (57%), Positives = 14/21 (66%)
Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
EEEEEEEEEEE+ E+
Sbjct: 72 AAAAEEEEEEEEEEEEEEEES 92
Score = 29.6 bits (67), Expect = 0.40
Identities = 14/19 (73%), Positives = 15/19 (78%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
EEEEEEEEEEEE+ E E
Sbjct: 75 AEEEEEEEEEEEEEEEESE 93
Score = 28.5 bits (64), Expect = 0.79
Identities = 12/21 (57%), Positives = 14/21 (66%)
Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
EEEEEEEEEE+ E+E
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEE 91
Score = 26.5 bits (59), Expect = 4.4
Identities = 11/21 (52%), Positives = 13/21 (61%)
Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
EEEEEEEEE+ E+E
Sbjct: 70 AAAAAAEEEEEEEEEEEEEEE 90
>gnl|CDD|215121 PLN02189, PLN02189, cellulose synthase.
Length = 1040
Score = 33.1 bits (75), Expect = 0.092
Identities = 28/106 (26%), Positives = 41/106 (38%), Gaps = 26/106 (24%)
Query: 98 IDTHNGEKPFR------CALCAYSARRKTHLDDHMRRHTGEKPHACGMCG---------Y 142
I H KP R C +C D+ G+ AC CG Y
Sbjct: 20 IHGHEEPKPLRNLDGQVCEICG---------DEIGLTVDGDLFVACNECGFPVCRPCYEY 70
Query: 143 ECAQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
E +G+Q + +K K G E +++EE+ D+IE EF +
Sbjct: 71 ERREGTQNCPQCKTRYKRLK--GSPRVEGDDDEEDIDDIEHEFNID 114
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 31.2 bits (71), Expect = 0.098
Identities = 11/14 (78%), Positives = 12/14 (85%)
Query: 166 EEEEEEEEEEEEED 179
EEEE+EEE EEE D
Sbjct: 87 EEEEKEEEAEEESD 100
Score = 31.2 bits (71), Expect = 0.10
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 162 KKGGEEEEEEEEEEEEEDNIE 182
+ E EEEE+EEE EE++ +
Sbjct: 81 AEADEAEEEEKEEEAEEESDD 101
Score = 30.8 bits (70), Expect = 0.14
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQEFLF 187
+E EEEE+EEE E+ + + LF
Sbjct: 79 AAAEADEAEEEEKEEEAEEESDDDMLF 105
Score = 28.5 bits (64), Expect = 0.83
Identities = 8/25 (32%), Positives = 13/25 (52%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQEF 185
+ +E EEEE+EEE + +
Sbjct: 78 EAAAEADEAEEEEKEEEAEEESDDD 102
Score = 27.4 bits (61), Expect = 2.4
Identities = 10/28 (35%), Positives = 14/28 (50%), Gaps = 2/28 (7%)
Query: 159 KVDKKGGEEEEEEEE--EEEEEDNIEQE 184
G E E +E EEE+E+ E+E
Sbjct: 71 AAAAAGAEAAAEADEAEEEEKEEEAEEE 98
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 30.9 bits (70), Expect = 0.13
Identities = 10/22 (45%), Positives = 18/22 (81%)
Query: 162 KKGGEEEEEEEEEEEEEDNIEQ 183
+K E+EEE+++EEE+E+ E+
Sbjct: 76 EKAEEKEEEKKKEEEKEEEEEE 97
Score = 29.8 bits (67), Expect = 0.28
Identities = 11/24 (45%), Positives = 19/24 (79%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIE 182
K ++K E+++EEE+EEEEE+ +
Sbjct: 77 KAEEKEEEKKKEEEKEEEEEEALA 100
Score = 29.8 bits (67), Expect = 0.31
Identities = 10/23 (43%), Positives = 19/23 (82%)
Query: 162 KKGGEEEEEEEEEEEEEDNIEQE 184
++ EE+EEE+++EEE++ E+E
Sbjct: 75 EEKAEEKEEEKKKEEEKEEEEEE 97
Score = 28.2 bits (63), Expect = 1.1
Identities = 9/18 (50%), Positives = 15/18 (83%)
Query: 161 DKKGGEEEEEEEEEEEEE 178
+K +EEE+++EEE+EE
Sbjct: 76 EKAEEKEEEKKKEEEKEE 93
Score = 26.7 bits (59), Expect = 4.3
Identities = 9/21 (42%), Positives = 16/21 (76%)
Query: 159 KVDKKGGEEEEEEEEEEEEED 179
+ + EEE+++EEE+EEE+
Sbjct: 75 EEKAEEKEEEKKKEEEKEEEE 95
>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
only].
Length = 467
Score = 32.0 bits (72), Expect = 0.15
Identities = 25/77 (32%), Positives = 32/77 (41%), Gaps = 2/77 (2%)
Query: 85 NYHTPTRKYMRTHIDTHNGEKPFRCALCAYSARRKTHLDDHMRRHTGEKPHACGMCGYEC 144
N + T N +P C C S R HL H+R HTGEKP C G +
Sbjct: 12 NNSVLSSTPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDK 71
Query: 145 AQG--SQLMQHLRKVHK 159
+ +L +HLR H
Sbjct: 72 SFSRPLELSRHLRTHHN 88
Score = 27.4 bits (60), Expect = 6.3
Identities = 18/65 (27%), Positives = 26/65 (40%), Gaps = 2/65 (3%)
Query: 73 PDASYTYMCYLCNYHTPTRKYMRTHIDTHNGEKPFRCAL--CAYSARRKTHLDDHMRRHT 130
+A C C +++ HI +H GEKP +C+ C S R L H+R H
Sbjct: 28 SNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHH 87
Query: 131 GEKPH 135
Sbjct: 88 NNPSD 92
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 32.0 bits (72), Expect = 0.15
Identities = 13/23 (56%), Positives = 17/23 (73%)
Query: 162 KKGGEEEEEEEEEEEEEDNIEQE 184
+ E+EEEEEEEEEEE +E+
Sbjct: 133 DESDEDEEEEEEEEEEEAEVEEN 155
Score = 32.0 bits (72), Expect = 0.15
Identities = 13/30 (43%), Positives = 20/30 (66%)
Query: 155 RKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
+K KK +E+E +E+EEEEE+ E+E
Sbjct: 120 KKAGNAGKKATKEDESDEDEEEEEEEEEEE 149
Score = 31.2 bits (70), Expect = 0.26
Identities = 16/30 (53%), Positives = 20/30 (66%)
Query: 155 RKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
+K K D+ +EEEEEEEEEEE + E E
Sbjct: 127 KKATKEDESDEDEEEEEEEEEEEAEVEENE 156
Score = 29.3 bits (65), Expect = 1.2
Identities = 13/18 (72%), Positives = 14/18 (77%)
Query: 166 EEEEEEEEEEEEEDNIEQ 183
EEEEEEEEEE E + EQ
Sbjct: 140 EEEEEEEEEEAEVEENEQ 157
Score = 26.6 bits (58), Expect = 8.6
Identities = 9/19 (47%), Positives = 10/19 (52%)
Query: 164 GGEEEEEEEEEEEEEDNIE 182
GG+ EE EEE E E
Sbjct: 177 GGDNGEEGEEESVTEAEAE 195
>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
(DUF2098). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 91
Score = 30.4 bits (69), Expect = 0.15
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 148 SQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
+ L + DK ++EE EE++EE + I++E
Sbjct: 37 TDLYYRTDYLEVTDKVKEKKEEREEDKEELIERIKKE 73
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 30.8 bits (70), Expect = 0.16
Identities = 9/20 (45%), Positives = 16/20 (80%)
Query: 162 KKGGEEEEEEEEEEEEEDNI 181
+ +EE++EEEEEE +D++
Sbjct: 79 AEAKKEEKKEEEEEESDDDM 98
Score = 30.0 bits (68), Expect = 0.28
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQEFLF 187
++EE++EEEEEE D+ LF
Sbjct: 77 AAAEAKKEEKKEEEEEESDDDMGFGLF 103
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 31.1 bits (71), Expect = 0.19
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
K +K+ EEEE E EE +EE+ I++
Sbjct: 98 KKEKEEEEEEEVEVEELDEEEQIDEL 123
Score = 28.4 bits (64), Expect = 1.5
Identities = 18/32 (56%), Positives = 21/32 (65%)
Query: 153 HLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
+RK+ +DKK EEEEEEE E EE D EQ
Sbjct: 89 KVRKLLGLDKKEKEEEEEEEVEVEELDEEEQI 120
Score = 28.0 bits (63), Expect = 1.9
Identities = 11/26 (42%), Positives = 14/26 (53%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
+K EEEEE E EE +E+ E
Sbjct: 97 DKKEKEEEEEEEVEVEELDEEEQIDE 122
Score = 27.3 bits (61), Expect = 3.2
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
+ +++ EE E EE +EEE+ + E
Sbjct: 100 EKEEEEEEEVEVEELDEEEQIDELLE 125
Score = 26.9 bits (60), Expect = 4.9
Identities = 8/26 (30%), Positives = 18/26 (69%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
+ +++ E EE +EEE+ ++ +E+E
Sbjct: 102 EEEEEEEVEVEELDEEEQIDELLEKE 127
Score = 26.5 bits (59), Expect = 6.1
Identities = 12/24 (50%), Positives = 16/24 (66%)
Query: 162 KKGGEEEEEEEEEEEEEDNIEQEF 185
K +EEEEEEE E EE + E++
Sbjct: 97 DKKEKEEEEEEEVEVEELDEEEQI 120
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 30.6 bits (69), Expect = 0.20
Identities = 11/20 (55%), Positives = 17/20 (85%)
Query: 162 KKGGEEEEEEEEEEEEEDNI 181
K ++EE++EEEEEEED++
Sbjct: 87 KAEAKKEEKKEEEEEEEDDL 106
Score = 29.1 bits (65), Expect = 0.55
Identities = 12/23 (52%), Positives = 15/23 (65%), Gaps = 3/23 (13%)
Query: 157 VHKVDKKGGEEEEEEEEEEEEED 179
KK EE++EEEEEEE+D
Sbjct: 86 AKAEAKK---EEKKEEEEEEEDD 105
Score = 27.2 bits (60), Expect = 3.1
Identities = 9/20 (45%), Positives = 15/20 (75%)
Query: 161 DKKGGEEEEEEEEEEEEEDN 180
K ++EE++EEEEEE++
Sbjct: 85 GAKAEAKKEEKKEEEEEEED 104
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 31.4 bits (71), Expect = 0.24
Identities = 13/40 (32%), Positives = 23/40 (57%)
Query: 149 QLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
+L+ L ++ K D E E +EE+E+ +N E ++L E
Sbjct: 81 RLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLRE 120
>gnl|CDD|238704 cd01413, SIR2_Af2, SIR2_Af2: Archaeal and prokaryotic group which
includes Archaeoglobus fulgidus Sir2-Af2, Sulfolobus
solfataricus ssSir2, and several bacterial homologs; and
are members of the SIR2 family of proteins, silent
information regulator 2 (Sir2) enzymes which catalyze
NAD+-dependent protein/histone deacetylation. Sir2
proteins have been shown to regulate gene silencing, DNA
repair, metabolic enzymes, and life span. The Sir2
homolog from the archaea Sulfolobus solftaricus
deacetylates the non-specific DNA protein Alba to
mediate transcription repression.
Length = 222
Score = 30.8 bits (70), Expect = 0.35
Identities = 15/56 (26%), Positives = 24/56 (42%), Gaps = 9/56 (16%)
Query: 22 GAKQAADVHNHVQQIHMGVNFVCVHCKQ---FEVVPTRKTQTLEHCATCVDMVRPD 74
G+K ++H +Q + CV+C E V K + C C ++RPD
Sbjct: 100 GSKNVIELHGTLQTAY------CVNCGSKYDLEEVKYAKKHEVPRCPKCGGIIRPD 149
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 29.8 bits (67), Expect = 0.38
Identities = 10/15 (66%), Positives = 11/15 (73%)
Query: 164 GGEEEEEEEEEEEEE 178
G +EEEEEEEE E
Sbjct: 92 GHTRQEEEEEEEENE 106
Score = 28.6 bits (64), Expect = 1.1
Identities = 10/20 (50%), Positives = 14/20 (70%)
Query: 165 GEEEEEEEEEEEEEDNIEQE 184
G +EEEEEEE+N +Q+
Sbjct: 90 GTGHTRQEEEEEEEENEKQQ 109
Score = 27.9 bits (62), Expect = 2.2
Identities = 10/19 (52%), Positives = 12/19 (63%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
+EEEEEEEE E+ E
Sbjct: 95 RQEEEEEEEENEKQQQSDE 113
Score = 27.5 bits (61), Expect = 2.5
Identities = 10/21 (47%), Positives = 14/21 (66%)
Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
G +EEEEEEEE+ +Q+
Sbjct: 90 GTGHTRQEEEEEEEENEKQQQ 110
Score = 27.1 bits (60), Expect = 3.2
Identities = 10/20 (50%), Positives = 14/20 (70%)
Query: 163 KGGEEEEEEEEEEEEEDNIE 182
EEEEEEEE E+++ + E
Sbjct: 94 TRQEEEEEEEENEKQQQSDE 113
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 30.8 bits (70), Expect = 0.38
Identities = 8/18 (44%), Positives = 9/18 (50%)
Query: 162 KKGGEEEEEEEEEEEEED 179
EEEEEEE+D
Sbjct: 286 AAAAAAAPAEEEEEEEDD 303
Score = 29.6 bits (67), Expect = 0.86
Identities = 8/21 (38%), Positives = 8/21 (38%)
Query: 159 KVDKKGGEEEEEEEEEEEEED 179
EEEEEEED
Sbjct: 282 AAAAAAAAAAAPAEEEEEEED 302
Score = 29.2 bits (66), Expect = 1.2
Identities = 7/14 (50%), Positives = 9/14 (64%)
Query: 166 EEEEEEEEEEEEED 179
EEEEEEE++
Sbjct: 291 AAPAEEEEEEEDDM 304
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 31.1 bits (70), Expect = 0.39
Identities = 8/26 (30%), Positives = 15/26 (57%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
++ + EE EEE+ EEE + ++
Sbjct: 312 PEIEQDEDSEESEEEKNEEEGGLSKK 337
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 28.1 bits (63), Expect = 0.40
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
E+ E+EEE E+EE+ EQ
Sbjct: 24 EKREDEEENEDEEEGEEQS 42
Score = 27.3 bits (61), Expect = 0.80
Identities = 11/19 (57%), Positives = 13/19 (68%)
Query: 167 EEEEEEEEEEEEDNIEQEF 185
EEE+ E+EEE ED E E
Sbjct: 22 EEEKREDEEENEDEEEGEE 40
Score = 25.8 bits (57), Expect = 2.5
Identities = 9/18 (50%), Positives = 12/18 (66%)
Query: 167 EEEEEEEEEEEEDNIEQE 184
E+EEE E+EEE + E
Sbjct: 26 REDEEENEDEEEGEEQSE 43
Score = 25.0 bits (55), Expect = 5.0
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 161 DKKGGEEEEEEEEEEEEED 179
++K +EEE E+EEE EE
Sbjct: 23 EEKREDEEENEDEEEGEEQ 41
Score = 25.0 bits (55), Expect = 6.0
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
E+EEE E+EEE E+ E +
Sbjct: 27 EDEEENEDEEEGEEQSEVK 45
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 30.7 bits (69), Expect = 0.46
Identities = 10/31 (32%), Positives = 20/31 (64%)
Query: 154 LRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
++K + + K G E EEE EEE++++ + +
Sbjct: 32 MKKENAIRKLGKEAEEEAMEEEDDDEEDDDD 62
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 30.8 bits (69), Expect = 0.54
Identities = 8/20 (40%), Positives = 18/20 (90%)
Query: 165 GEEEEEEEEEEEEEDNIEQE 184
++E+E+E++++EED+ E+E
Sbjct: 155 DDDEDEDEDDDDEEDDEEEE 174
Score = 30.0 bits (67), Expect = 0.80
Identities = 9/24 (37%), Positives = 19/24 (79%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
D + +E++++EE++EEE+ E+E
Sbjct: 156 DDEDEDEDDDDEEDDEEEEEEEEE 179
Score = 28.9 bits (64), Expect = 1.8
Identities = 8/21 (38%), Positives = 17/21 (80%)
Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
+E+E+E++++EE+D E+E
Sbjct: 155 DDDEDEDEDDDDEEDDEEEEE 175
Score = 27.7 bits (61), Expect = 4.2
Identities = 9/30 (30%), Positives = 20/30 (66%)
Query: 155 RKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
R +D +E+E++++EE++E+ E+E
Sbjct: 148 RDNFVIDDDDEDEDEDDDDEEDDEEEEEEE 177
>gnl|CDD|215224 PLN02400, PLN02400, cellulose synthase.
Length = 1085
Score = 30.7 bits (69), Expect = 0.55
Identities = 26/109 (23%), Positives = 43/109 (39%), Gaps = 26/109 (23%)
Query: 92 KYMRTHIDTHNGEKPFR------CALCAYSARRKTHLDDHMRRHTGEKPHACGMCG---- 141
+ +R D+ +G KP + C +C DD TG+ AC C
Sbjct: 16 ELVRIRHDSDSGPKPLKNLNGQICQICG---------DDVGVTETGDVFVACNECAFPVC 66
Query: 142 -----YECAQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEF 185
YE G+Q + + + G E +E+E++ D++E EF
Sbjct: 67 RPCYEYERKDGTQCCPQCKT--RYRRHKGSPRVEGDEDEDDVDDLENEF 113
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 30.5 bits (69), Expect = 0.56
Identities = 13/37 (35%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 148 SQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
S++ R ++D EE +E+E+EEEE+ + E E
Sbjct: 352 SKMRDKRRA--RLDPIDFEEVDEDEDEEEEQRSDEHE 386
Score = 27.8 bits (62), Expect = 3.4
Identities = 8/20 (40%), Positives = 14/20 (70%)
Query: 165 GEEEEEEEEEEEEEDNIEQE 184
E +E+E+EEEE+ + +E
Sbjct: 368 EEVDEDEDEEEEQRSDEHEE 387
Score = 27.4 bits (61), Expect = 4.4
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
E+E+EEEE+ +E E+
Sbjct: 372 EDEDEEEEQRSDEHEEEEG 390
>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
Ribosomal proteins P1 and P2 are the eukaryotic proteins
that are functionally equivalent to bacterial L7/L12.
L12p is the archaeal homolog. Unlike other ribosomal
proteins, the archaeal L12p and eukaryotic P1 and P2 do
not share sequence similarity with their bacterial
counterparts. They are part of the ribosomal stalk
(called the L7/L12 stalk in bacteria), along with 28S
rRNA and the proteins L11 and P0 in eukaryotes (23S
rRNA, L11, and L10e in archaea). In bacterial ribosomes,
L7/L12 homodimers bind the extended C-terminal helix of
L10 to anchor the L7/L12 molecules to the ribosome.
Eukaryotic P1/P2 heterodimers and archaeal L12p
homodimers are believed to bind the L10 equivalent
proteins, eukaryotic P0 and archaeal L10e, in a similar
fashion. P1 and P2 (L12p, L7/L12) are the only proteins
in the ribosome to occur as multimers, always appearing
as sets of dimers. Recent data indicate that most
archaeal species contain six copies of L12p (three
homodimers), while eukaryotes have two copies each of P1
and P2 (two heterodimers). Bacteria may have four or six
copies (two or three homodimers), depending on the
species. As in bacteria, the stalk is crucial for
binding of initiation, elongation, and release factors
in eukaryotes and archaea.
Length = 105
Score = 29.2 bits (65), Expect = 0.56
Identities = 11/14 (78%), Positives = 13/14 (92%)
Query: 166 EEEEEEEEEEEEED 179
EE +EEEEEEE+ED
Sbjct: 86 EEAKEEEEEEEDED 99
Score = 28.0 bits (62), Expect = 1.4
Identities = 12/21 (57%), Positives = 13/21 (61%)
Query: 167 EEEEEEEEEEEEDNIEQEFLF 187
EE +EEEEEEED LF
Sbjct: 85 AEEAKEEEEEEEDEDFGFGLF 105
Score = 26.8 bits (59), Expect = 3.3
Identities = 9/15 (60%), Positives = 13/15 (86%)
Query: 166 EEEEEEEEEEEEEDN 180
EE +EEEEEEE+++
Sbjct: 85 AEEAKEEEEEEEDED 99
>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
synthase complex, beta subunit. Nomenclature follows
the description for Methanosarcina thermophila. The
CO-methylating acetyl-CoA synthase is considered the
defining enzyme of the Wood-Ljungdahl pathway, used for
acetate catabolism by sulfate reducing bacteria but for
acetate biosynthesis by acetogenic bacteria such as
oorella thermoacetica (f. Clostridium thermoaceticum)
[Energy metabolism, Chemoautotrophy].
Length = 458
Score = 30.2 bits (68), Expect = 0.62
Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 1/37 (2%)
Query: 149 QLMQHLRKV-HKVDKKGGEEEEEEEEEEEEEDNIEQE 184
+L + L++ H V K+ E +EEE EEEEE +E
Sbjct: 383 ELRKFLKEKGHPVVKRVVREVDEEEIEEEEEAMQPEE 419
>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
(CLPTM1). This family consists of several eukaryotic
cleft lip and palate transmembrane protein 1 sequences.
Cleft lip with or without cleft palate is a common birth
defect that is genetically complex. The nonsyndromic
forms have been studied genetically using linkage and
candidate-gene association studies with only partial
success in defining the loci responsible for orofacial
clefting. CLPTM1 encodes a transmembrane protein and has
strong homology to two Caenorhabditis elegans genes,
suggesting that CLPTM1 may belong to a new gene family.
This family also contains the human cisplatin resistance
related protein CRR9p which is associated with
CDDP-induced apoptosis.
Length = 437
Score = 30.0 bits (68), Expect = 0.76
Identities = 14/41 (34%), Positives = 19/41 (46%), Gaps = 6/41 (14%)
Query: 146 QGSQLMQHLRKVHKVDKK----GGEEEEEEEEEEEEEDNIE 182
L +L K K KK GG+ E+EE EEE+ +
Sbjct: 140 FVFPLTTYLPK--KKVKKKNLLGGKSEKEEPEEEKTPAPDK 178
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 28.0 bits (63), Expect = 0.80
Identities = 6/16 (37%), Positives = 13/16 (81%)
Query: 165 GEEEEEEEEEEEEEDN 180
+EE+EEE+++++ D
Sbjct: 60 EDEEDEEEDDDDDMDG 75
Score = 26.8 bits (60), Expect = 2.5
Identities = 10/25 (40%), Positives = 16/25 (64%), Gaps = 5/25 (20%)
Query: 166 EEEEE-----EEEEEEEEDNIEQEF 185
EEEEE E+EE+EE++ + +
Sbjct: 49 EEEEEGEDLESEDEEDEEEDDDDDM 73
Score = 25.6 bits (57), Expect = 6.3
Identities = 5/16 (31%), Positives = 12/16 (75%)
Query: 164 GGEEEEEEEEEEEEED 179
EE+EEE+++++ +
Sbjct: 60 EDEEDEEEDDDDDMDG 75
Score = 25.3 bits (56), Expect = 8.7
Identities = 10/15 (66%), Positives = 13/15 (86%)
Query: 165 GEEEEEEEEEEEEED 179
GE+ E E+EE+EEED
Sbjct: 54 GEDLESEDEEDEEED 68
>gnl|CDD|227003 COG4656, RnfC, Predicted NADH:ubiquinone oxidoreductase, subunit
RnfC [Energy production and conversion].
Length = 529
Score = 30.0 bits (68), Expect = 0.82
Identities = 16/60 (26%), Positives = 24/60 (40%), Gaps = 11/60 (18%)
Query: 107 FRCALCAYSARR-----KTHLDDHMRRHTGEKPHA------CGMCGYECAQGSQLMQHLR 155
RC+LCA + + + +H E+ H CG C Y C L+Q+ R
Sbjct: 368 IRCSLCADACPVNLLPQQLYWFAKGEQHDEEEEHNLLDCIECGACAYVCPSNIPLVQYFR 427
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 29.5 bits (66), Expect = 0.87
Identities = 13/17 (76%), Positives = 16/17 (94%)
Query: 166 EEEEEEEEEEEEEDNIE 182
EE+EEEEEEEEEE+ I+
Sbjct: 229 EEDEEEEEEEEEEEEID 245
Score = 29.1 bits (65), Expect = 1.2
Identities = 13/22 (59%), Positives = 16/22 (72%)
Query: 158 HKVDKKGGEEEEEEEEEEEEED 179
D + E+EEEEEEEEEEE+
Sbjct: 222 SGSDSESEEDEEEEEEEEEEEE 243
Score = 27.2 bits (60), Expect = 6.1
Identities = 12/17 (70%), Positives = 14/17 (82%)
Query: 166 EEEEEEEEEEEEEDNIE 182
E EE+EEEEEEEE+ E
Sbjct: 227 ESEEDEEEEEEEEEEEE 243
Score = 26.8 bits (59), Expect = 6.7
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 165 GEEEEEEEEEEEEEDNIEQE 184
G + E EE+EEEEE+ E+E
Sbjct: 223 GSDSESEEDEEEEEEEEEEE 242
Score = 26.8 bits (59), Expect = 7.1
Identities = 12/21 (57%), Positives = 16/21 (76%)
Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
G + E EE+EEEEEE+ E+E
Sbjct: 223 GSDSESEEDEEEEEEEEEEEE 243
>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
Provisional.
Length = 130
Score = 28.6 bits (64), Expect = 0.96
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 167 EEEEEEEEEEEEDNIEQEFLFE 188
++EEE+EEE E E+E L E
Sbjct: 96 KKEEEKEEEIPEPTKEEELLGE 117
Score = 27.9 bits (62), Expect = 1.8
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 154 LRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
++ +K+ K EE+EEE E +E+ + E
Sbjct: 87 VKVFNKLTSKKEEEKEEEIPEPTKEEELLGE 117
>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
Length = 613
Score = 29.3 bits (67), Expect = 1.1
Identities = 11/43 (25%), Positives = 21/43 (48%), Gaps = 5/43 (11%)
Query: 148 SQLMQHLR-----KVHKVDKKGGEEEEEEEEEEEEEDNIEQEF 185
+ +L V + + +E+EEE+EE++ E+EF
Sbjct: 452 EFWISYLEEFDGKPFKSVARGDLDLGKEDEEEKEEKEEAEEEF 494
Score = 28.2 bits (64), Expect = 2.8
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 154 LRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFL 186
+ V + D G+E+EEE+EE+EE + + L
Sbjct: 466 FKSVARGDLDLGKEDEEEKEEKEEAEEEFKPLL 498
>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
represents the eukaryotic large ribosomal protein P2.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P2 is located in the L12 stalk, with proteins
P1, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers). Bacteria may have four or six copies
of L7/L12 (two or three homodimers) depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plants have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site in the C-terminal region
of P2. TCS inactivates the ribosome by depurinating a
specific adenine in the sarcin-ricin loop of 28S rRNA.
Length = 109
Score = 28.4 bits (64), Expect = 1.1
Identities = 8/14 (57%), Positives = 12/14 (85%)
Query: 166 EEEEEEEEEEEEED 179
EE++EE EEE ++D
Sbjct: 89 EEKKEESEEESDDD 102
Score = 27.6 bits (62), Expect = 1.9
Identities = 8/14 (57%), Positives = 12/14 (85%)
Query: 166 EEEEEEEEEEEEED 179
+EE++EE EEE +D
Sbjct: 88 KEEKKEESEEESDD 101
Score = 27.2 bits (61), Expect = 2.2
Identities = 8/14 (57%), Positives = 12/14 (85%)
Query: 166 EEEEEEEEEEEEED 179
++EE++EE EEE D
Sbjct: 87 KKEEKKEESEEESD 100
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 29.4 bits (66), Expect = 1.2
Identities = 8/29 (27%), Positives = 15/29 (51%), Gaps = 5/29 (17%)
Query: 165 GEEEEEEEEEEEEEDNIEQ-----EFLFE 188
EEE +E ++ + +D E E +F+
Sbjct: 285 PEEEPDEPDQTDPDDGEETDQIPEELMFD 313
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 29.0 bits (65), Expect = 1.2
Identities = 14/24 (58%), Positives = 18/24 (75%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
+K+ E+EEEEE EEEEE+ E E
Sbjct: 262 EKELKEDEEEEETEEEEEEEDEDE 285
Score = 27.5 bits (61), Expect = 4.3
Identities = 12/20 (60%), Positives = 17/20 (85%)
Query: 160 VDKKGGEEEEEEEEEEEEED 179
V++K +E+EEEEE EEEE+
Sbjct: 260 VEEKELKEDEEEEETEEEEE 279
Score = 26.7 bits (59), Expect = 7.7
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 152 QHLRKVHKVDKKGGEEEEEEEEEEEEE 178
K K D++ E EEEEEEE+E+E
Sbjct: 259 YVEEKELKEDEEEEETEEEEEEEDEDE 285
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 29.3 bits (65), Expect = 1.3
Identities = 11/36 (30%), Positives = 22/36 (61%), Gaps = 1/36 (2%)
Query: 145 AQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDN 180
+Q S++ Q + ++ K + EE++++ EEEDN
Sbjct: 94 SQESEV-QDNTEQNQDTKGSKTDSEEDDDDSEEEDN 128
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 28.7 bits (64), Expect = 1.3
Identities = 11/17 (64%), Positives = 16/17 (94%)
Query: 168 EEEEEEEEEEEDNIEQE 184
E+EEEEEE+EEDN++ +
Sbjct: 76 EDEEEEEEDEEDNVDLK 92
Score = 27.9 bits (62), Expect = 2.3
Identities = 9/19 (47%), Positives = 15/19 (78%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
E+EEEEEE+EE+ +++
Sbjct: 76 EDEEEEEEDEEDNVDLKDI 94
Score = 27.9 bits (62), Expect = 2.3
Identities = 12/28 (42%), Positives = 18/28 (64%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
D + EEE+EEE EE E+ E+E + +
Sbjct: 50 DDEEEEEEDEEEIEEPEDIEDEEEIVED 77
Score = 27.9 bits (62), Expect = 2.5
Identities = 12/23 (52%), Positives = 19/23 (82%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQ 183
D K ++E++EEEEEE+E+ IE+
Sbjct: 42 DVKDEKQEDDEEEEEEDEEEIEE 64
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 29.0 bits (65), Expect = 1.4
Identities = 14/25 (56%), Positives = 17/25 (68%)
Query: 164 GGEEEEEEEEEEEEEDNIEQEFLFE 188
G EEE EE EE+EEE+ E+ FE
Sbjct: 79 GEEEEVEEGEEDEEEEGEEESEEFE 103
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 28.7 bits (64), Expect = 1.5
Identities = 12/24 (50%), Positives = 16/24 (66%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
++ EEEEEE EEEEE + E +
Sbjct: 9 EEDESGEEEEEESEEEEETDSEDD 32
>gnl|CDD|129294 TIGR00190, thiC, thiamine biosynthesis protein ThiC. The thiC
ortholog is designated thiA in Bacillus subtilis
[Biosynthesis of cofactors, prosthetic groups, and
carriers, Thiamine].
Length = 423
Score = 28.9 bits (65), Expect = 1.5
Identities = 13/45 (28%), Positives = 15/45 (33%), Gaps = 6/45 (13%)
Query: 101 HNGEKPFRCALCAYSARRKTHLDDHMRRHTGEKPHACGMCGYECA 145
+ E+ F AL AR E C MCG CA
Sbjct: 376 FDWERQFELALDPEKAREY-----RDETPP-EDEDFCSMCGKYCA 414
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 29.0 bits (65), Expect = 1.7
Identities = 11/43 (25%), Positives = 23/43 (53%)
Query: 146 QGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
+G ++ L++ + + EE + EE +EE + E+ F +E
Sbjct: 271 EGWDWLKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYE 313
>gnl|CDD|237359 PRK13352, PRK13352, thiamine biosynthesis protein ThiC;
Provisional.
Length = 431
Score = 28.9 bits (66), Expect = 1.7
Identities = 15/42 (35%), Positives = 16/42 (38%), Gaps = 7/42 (16%)
Query: 104 EKPFRCALCAYSARRKTHLDDHMRRHTGEKPHACGMCGYECA 145
EK F AL AR R + AC MCG CA
Sbjct: 382 EKQFELALDPEKARE-------YRDERLKDEEACSMCGKFCA 416
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 28.7 bits (65), Expect = 1.7
Identities = 11/34 (32%), Positives = 20/34 (58%)
Query: 151 MQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
+ R+ K +KK +++EEEEEE+ ++E
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452
Score = 27.2 bits (61), Expect = 6.5
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
K +EEEEEEE+E++E+ E+E
Sbjct: 433 KAFAGKKKEEEEEEEKEKKEEEKEEE 458
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 28.9 bits (65), Expect = 2.1
Identities = 13/38 (34%), Positives = 21/38 (55%), Gaps = 3/38 (7%)
Query: 145 AQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIE 182
QG +L RK+ +E+E+ + EEE+ED+ E
Sbjct: 309 RQGEELR---RKIEGKSVSEEDEDEDSDSEEEDEDDDE 343
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 28.1 bits (62), Expect = 2.2
Identities = 14/40 (35%), Positives = 24/40 (60%), Gaps = 4/40 (10%)
Query: 151 MQHLRKVHKVDKKGGEEEEEEEEE----EEEEDNIEQEFL 186
M+ R+ K+D++ +EE+E E E EE+ D+ E E +
Sbjct: 142 MEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEII 181
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 28.2 bits (63), Expect = 2.4
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
K + KG EE+ E EE E+EE + E
Sbjct: 24 KEEDKGKEEDLEFEEIEKEEIIEDSE 49
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 28.5 bits (63), Expect = 2.5
Identities = 5/25 (20%), Positives = 11/25 (44%)
Query: 160 VDKKGGEEEEEEEEEEEEEDNIEQE 184
+ EE + + +E E+ E +
Sbjct: 303 NAEAHKEEVQSDRPDEIGEEKEEDD 327
>gnl|CDD|145910 pfam03012, PP_M1, Phosphoprotein. This family includes the M1
phosphoprotein non-structural RNA polymerase alpha
subunit, which is thought to be a component of the
active polymerase, and may be involved in template
binding.
Length = 298
Score = 28.2 bits (63), Expect = 2.7
Identities = 8/30 (26%), Positives = 16/30 (53%)
Query: 150 LMQHLRKVHKVDKKGGEEEEEEEEEEEEED 179
L + + ++ D K + EEE+E+E +
Sbjct: 49 LPEDMSRLQISDAKPSQFTEEEDEDEGSSE 78
>gnl|CDD|220661 pfam10263, SprT-like, SprT-like family. This family represents a
domain found in eukaryotes and prokaryotes. The domain
contains a characteristic motif of the zinc
metallopeptidases. This family includes the bacterial
SprT protein.
Length = 153
Score = 27.7 bits (62), Expect = 2.7
Identities = 11/46 (23%), Positives = 18/46 (39%), Gaps = 8/46 (17%)
Query: 67 CVDMVRPDASYTYMCYLCNYHTPTRKYMRTHIDTHNGEKPFRCALC 112
+++V Y Y C C P ++ +R H +RC C
Sbjct: 108 DIEVVSGRKRYIYRCGSCGQLYPRKRRIRRHK--------YRCGRC 145
>gnl|CDD|236153 PRK08116, PRK08116, hypothetical protein; Validated.
Length = 268
Score = 28.1 bits (63), Expect = 2.8
Identities = 14/50 (28%), Positives = 26/50 (52%), Gaps = 3/50 (6%)
Query: 137 CGMCGYECAQGSQLMQHLRKVH---KVDKKGGEEEEEEEEEEEEEDNIEQ 183
C +CG + +L+ R V + +++ E +E EEE E++ IE+
Sbjct: 19 CEVCGKRIEKVVELLGKNRIVPIMCECEREAEEAKEREEENREKQRRIER 68
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 28.1 bits (63), Expect = 2.9
Identities = 8/24 (33%), Positives = 16/24 (66%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
+ E E+E+EE+E+EE+ + +
Sbjct: 292 EDDYSESEDEDEEDEDEEEEEDDD 315
Score = 26.5 bits (59), Expect = 8.2
Identities = 10/26 (38%), Positives = 18/26 (69%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
K +++ E++ E E+E+EED E+E
Sbjct: 285 KAEEEEEEDDYSESEDEDEEDEDEEE 310
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 26.8 bits (59), Expect = 3.0
Identities = 11/18 (61%), Positives = 13/18 (72%)
Query: 161 DKKGGEEEEEEEEEEEEE 178
++ GEEEE EEEEE E
Sbjct: 64 EEGEGEEEEGEEEEETEG 81
Score = 26.1 bits (57), Expect = 5.8
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
D + EEEE +E++EEEE E+E
Sbjct: 48 DDEMEEEEEVDEDDEEEEGEGEEE 71
Score = 25.7 bits (56), Expect = 8.0
Identities = 12/24 (50%), Positives = 14/24 (58%)
Query: 165 GEEEEEEEEEEEEEDNIEQEFLFE 188
GE EEEE EEEEE + + E
Sbjct: 66 GEGEEEEGEEEEETEGATGKRAAE 89
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 28.0 bits (62), Expect = 3.0
Identities = 10/20 (50%), Positives = 12/20 (60%)
Query: 160 VDKKGGEEEEEEEEEEEEED 179
VD + E EE + EEEED
Sbjct: 174 VDNEEEERLEESDGREEEED 193
Score = 26.5 bits (58), Expect = 9.3
Identities = 14/28 (50%), Positives = 16/28 (57%), Gaps = 3/28 (10%)
Query: 159 KVDKKGGEEEEEEEE---EEEEEDNIEQ 183
++DKK GEEEE EEE E E E
Sbjct: 232 RIDKKQGEEEEMEEEVINLFEIEWEEES 259
>gnl|CDD|172627 PRK14138, PRK14138, NAD-dependent deacetylase; Provisional.
Length = 244
Score = 27.9 bits (62), Expect = 3.0
Identities = 13/57 (22%), Positives = 29/57 (50%), Gaps = 10/57 (17%)
Query: 22 GAKQAADVHNHVQQIHMGVNFVCVHC-KQF---EVVPTRKTQTLEHCATCVDMVRPD 74
G+K+ ++H +V+ + CV C K++ +V+ + + C C ++RP+
Sbjct: 106 GSKKVIELHGNVE------EYYCVRCGKRYTVEDVIEKLEKSDVPRCDDCSGLIRPN 156
>gnl|CDD|129414 TIGR00314, cdhA, CO dehydrogenase/acetyl-CoA synthase complex,
epsilon subunit. Acetyl-CoA decarbonylase/synthase
(ACDS) is a multienzyme complex. Carbon monoxide
dehydrogenase is a synonym. The ACDS complex carries out
an unusual reaction involving the reversible cleavage
and synthesis of acetyl-CoA in methanogens. The model
contains the prosite signature for 4Fe-4S ferredoxins
[C-x(2)-C-x(2)-C-x(3)-C-[PEG]] between residues 448-462
of the model [Energy metabolism, Chemoautotrophy].
Length = 784
Score = 27.9 bits (62), Expect = 3.4
Identities = 14/54 (25%), Positives = 19/54 (35%)
Query: 117 RRKTHLDDHMRRHTGEKPHACGMCGYECAQGSQLMQHLRKVHKVDKKGGEEEEE 170
RK DD K CG C C ++ + + K D E+ EE
Sbjct: 384 ERKFLPDDEELMELANKCTQCGNCVRTCPNSLRVDEAMAHAQKGDLSKLEQLEE 437
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell cycle
progression and biogenesis of 60S ribosomal subunits.
The Schistosoma mansoni Mak16 has been shown to target
protein transport to the nucleolus.
Length = 97
Score = 26.8 bits (59), Expect = 3.5
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 160 VDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
++ + EE +EEEEEEEEE++ +
Sbjct: 55 LEAEESEENDEEEEEEEEEEDEGEIEYVS 83
>gnl|CDD|153040 pfam12606, RELT, Tumour necrosis factor receptor superfamily member
19. This family of proteins is found in bacteria and
eukaryotes. Proteins in this family are typically
between 49 and 288 amino acids in length. There are two
completely conserved residues (K and Y) that may be
functionally important. The members of tumor necrosis
factor receptor (TNFR) superfamily have been designated
as the "guardians of the immune system" due to their
roles in immune cell proliferation, differentiation,
activation, and death (apoptosis). The messenger RNA of
RELT is especially abundant in hematologic tissues such
as spleen, lymph node, and peripheral blood leukocytes
as well as in leukemias and lymphomas. RELT is able to
activate the NF-kappaB pathway and selectively binds
tumor necrosis factor receptor-associated factor 1.
Length = 50
Score = 25.7 bits (57), Expect = 3.5
Identities = 10/20 (50%), Positives = 13/20 (65%)
Query: 158 HKVDKKGGEEEEEEEEEEEE 177
VD E EE++EEE+EE
Sbjct: 31 CTVDPLEDEPEEKKEEEKEE 50
>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal. This
domain family is found in eukaryotes, and is typically
between 99 and 114 amino acids in length. The family is
found in association with pfam08697, pfam01585. There
are two completely conserved residues (G and F) that may
be functionally important. TIP is involved in enamel
assembly by interacting with one of the major proteins
responsible for biomineralisation of enamel - tuftelin.
Length = 106
Score = 26.9 bits (60), Expect = 3.6
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 162 KKGGEEEEEEEEEEEEEDNIEQE 184
K+ +E EE+++E+E ED+ E +
Sbjct: 77 KQAAKEPEEDDKEDESEDDDESD 99
>gnl|CDD|234013 TIGR02785, addA_Gpos, helicase-exonuclease AddAB, AddA subunit,
Firmicutes type. AddAB, also called RexAB, substitutes
for RecBCD in several bacterial lineages. These DNA
recombination proteins act before synapse and are
particularly important for DNA repair of double-stranded
breaks by homologous recombination. The term AddAB is
used broadly, with AddA homologous between the
Firmicutes (as modeled here) and the
alphaproteobacteria, while the partner AddB proteins
show no strong homology across the two groups of species
[DNA metabolism, DNA replication, recombination, and
repair].
Length = 1230
Score = 28.1 bits (63), Expect = 3.8
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 157 VHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
+++ EEEE +EE E D +QE
Sbjct: 518 LYEKLLIEEAEEEEIDEEAEILDKAQQE 545
>gnl|CDD|216813 pfam01964, ThiC, ThiC family. ThiC is found within the thiamine
biosynthesis operon. ThiC is involved in pyrimidine
biosynthesis. The precise catalytic function of ThiC is
still not known. ThiC participates in the formation of
4-Amino-5-hydroxymethyl-2-methylpyrimidine from AIR, an
intermediate in the de novo pyrimidine biosynthesis.
Length = 421
Score = 27.8 bits (63), Expect = 3.9
Identities = 15/43 (34%), Positives = 16/43 (37%), Gaps = 8/43 (18%)
Query: 104 EKPFRCALCAYSARRKTHLDDHMRRHTG-EKPHACGMCGYECA 145
EK F AL AR +R E C MCG CA
Sbjct: 378 EKQFELALDPEKARE-------IRDERLPEDADFCSMCGKFCA 413
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 27.9 bits (62), Expect = 3.9
Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 11/64 (17%)
Query: 131 GEKPHACGMCG---------YECAQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNI 181
GE AC C YE +G+Q + +K K G E +EEE++ D++
Sbjct: 52 GEPFVACNECAFPVCRPCYEYERREGNQACPQCKTRYKRIK--GSPRVEGDEEEDDIDDL 109
Query: 182 EQEF 185
E EF
Sbjct: 110 ENEF 113
>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal. The family represents
the N-terminal region of RXT2-like proteins. In S.
cerevisiae, RXT2 has been demonstrated to be involved in
conjugation with cellular fusion (mating) and invasive
growth. A high throughput localisation study has
localised RXT2 to the nucleus.
Length = 141
Score = 27.0 bits (60), Expect = 4.1
Identities = 6/26 (23%), Positives = 21/26 (80%)
Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
++D+ GG+ ++++E++E++E+ ++
Sbjct: 46 RIDEDGGDIDDDDEDDEDDEEADAED 71
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 27.6 bits (61), Expect = 4.2
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 157 VHKVDKKGGEEEEEEEEEEEEEDNIEQ 183
V V+K+ G+ EE +EE ED +E
Sbjct: 332 VEVVEKQEGDVVTEESTDEESEDEVEI 358
Score = 27.2 bits (60), Expect = 6.3
Identities = 9/24 (37%), Positives = 14/24 (58%)
Query: 160 VDKKGGEEEEEEEEEEEEEDNIEQ 183
V + +EE E+E E +E IE+
Sbjct: 342 VVTEESTDEESEDEVEIDESVIEE 365
>gnl|CDD|236631 PRK09885, PRK09885, putative toxin YafO; Provisional.
Length = 132
Score = 27.1 bits (60), Expect = 4.2
Identities = 10/40 (25%), Positives = 14/40 (35%), Gaps = 4/40 (10%)
Query: 87 HTPTRKYMR----THIDTHNGEKPFRCALCAYSARRKTHL 122
+ T ++ HI N PF L +S HL
Sbjct: 47 DSFTWPLLKFERVAHIHLANVNNPFPLQLRQFSRTSGYHL 86
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 27.6 bits (61), Expect = 4.2
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 162 KKGGEEEEEEEEEEEEEDNIEQE 184
+ E E+EEEEEE E DN +E
Sbjct: 462 SEEEEGEDEEEEEEVEADNGSEE 484
Score = 26.8 bits (59), Expect = 7.9
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
+ + E EEEEEEEEEE+ EQE
Sbjct: 438 ESEEEESVEEEEEEEEEEEEEEQE 461
Score = 26.8 bits (59), Expect = 9.9
Identities = 14/31 (45%), Positives = 18/31 (58%)
Query: 158 HKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
++ EEE EEEEEEEE+ E+E E
Sbjct: 433 SMASQESEEEESVEEEEEEEEEEEEEEQESE 463
>gnl|CDD|132613 TIGR03574, selen_PSTK, L-seryl-tRNA(Sec) kinase, archaeal. Members
of this protein are L-seryl-tRNA(Sec) kinase. This
enzyme is part of a two-step pathway in Eukaryota and
Archaea for performing selenocysteine biosynthesis by
changing serine misacylated on selenocysteine-tRNA to
selenocysteine. This enzyme performs the first step,
phosphorylation of the OH group of the serine side
chain. This family represents archaeal proteins with
this activity [Protein synthesis, tRNA aminoacylation].
Length = 249
Score = 27.5 bits (61), Expect = 4.3
Identities = 12/49 (24%), Positives = 23/49 (46%), Gaps = 11/49 (22%)
Query: 149 QLMQHLRKVHKVDKKGGEEEEE-----------EEEEEEEEDNIEQEFL 186
Q++ L K K+DK+ ++ E + E+E+ +E+EF
Sbjct: 197 QIVGELIKTGKLDKELIKKLSELRKEFLKSIKDIKAEKEDFRRVEKEFK 245
>gnl|CDD|145556 pfam02478, Pneumo_phosprot, Pneumovirus phosphoprotein. This
family represents the phosphoprotein of Paramyxoviridae,
a putative RNA polymerase alpha subunit that may
function in template binding.
Length = 266
Score = 27.5 bits (61), Expect = 4.4
Identities = 12/17 (70%), Positives = 15/17 (88%)
Query: 165 GEEEEEEEEEEEEEDNI 181
GE EEEEE+E+ EED+I
Sbjct: 250 GESEEEEEDEDLEEDDI 266
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 27.7 bits (61), Expect = 4.4
Identities = 6/18 (33%), Positives = 13/18 (72%)
Query: 167 EEEEEEEEEEEEDNIEQE 184
+ +E+E+EEE D++ +
Sbjct: 3949 DRQEKEDEEEMSDDVGID 3966
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 27.4 bits (62), Expect = 4.5
Identities = 13/33 (39%), Positives = 18/33 (54%)
Query: 152 QHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
+ R +++ E EEEEEEEEEEE +
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEA 257
>gnl|CDD|180707 PRK06811, PRK06811, RNA polymerase factor sigma-70; Validated.
Length = 189
Score = 27.2 bits (61), Expect = 4.6
Identities = 9/34 (26%), Positives = 17/34 (50%), Gaps = 4/34 (11%)
Query: 155 RKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
RK+ K + E + +E E++IE E + +
Sbjct: 90 RKLTKNN----EIDSIDEFILISEESIENEIILK 119
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 27.3 bits (61), Expect = 5.0
Identities = 11/29 (37%), Positives = 15/29 (51%)
Query: 156 KVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
K K D EE+EEE EEE+ ++
Sbjct: 143 KKAKEDSDEELSEEDEEEAAEEEEAEAEK 171
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 27.3 bits (60), Expect = 5.2
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 146 QGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
Q +Q + HK ++K +EE+EEEE EEE E +
Sbjct: 201 QLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQ 239
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 27.0 bits (60), Expect = 5.2
Identities = 13/20 (65%), Positives = 14/20 (70%)
Query: 159 KVDKKGGEEEEEEEEEEEEE 178
VD + EEEEEE EEEEE
Sbjct: 156 TVDLEPPEEEEEEIAEEEEE 175
Score = 26.2 bits (58), Expect = 9.7
Identities = 12/19 (63%), Positives = 13/19 (68%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
EEEEEE EEEE+ E E
Sbjct: 162 PEEEEEEIAEEEEEVKEPE 180
>gnl|CDD|237609 PRK14096, pgi, glucose-6-phosphate isomerase; Provisional.
Length = 528
Score = 27.5 bits (62), Expect = 5.2
Identities = 12/25 (48%), Positives = 15/25 (60%), Gaps = 1/25 (4%)
Query: 21 YGAKQAADVHNHVQQIHMGV-NFVC 44
YG K + D H +VQQ+ GV NF
Sbjct: 342 YGNKGSTDQHAYVQQLRDGVDNFFV 366
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 27.3 bits (60), Expect = 5.3
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
EEEEE E EE+E+ E+E
Sbjct: 140 LEEEEEVEMEEDEEYYEKE 158
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 27.0 bits (60), Expect = 5.4
Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 1/39 (2%)
Query: 149 QLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLF 187
L++ L K+ K ++ +E EEEE+ EEE E+E L
Sbjct: 146 ALLRELEKI-KKERAEEKEREEEEKAAEEEKAREEEILT 183
>gnl|CDD|178484 PLN02896, PLN02896, cinnamyl-alcohol dehydrogenase.
Length = 353
Score = 27.1 bits (60), Expect = 5.5
Identities = 18/79 (22%), Positives = 32/79 (40%), Gaps = 8/79 (10%)
Query: 103 GEKPFRCALCAYSARRKT----HLDDHMRRHT--GEKPHACG--MCGYECAQGSQLMQHL 154
G+ L A ++R + H++D H E+ A G +C + S+L+ HL
Sbjct: 228 GDSKLFSILSAVNSRMGSIALVHIEDICDAHIFLMEQTKAEGRYICCVDSYDMSELINHL 287
Query: 155 RKVHKVDKKGGEEEEEEEE 173
K + +EE+
Sbjct: 288 SKEYPCSNIQVRLDEEKRG 306
>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
Length = 232
Score = 27.1 bits (60), Expect = 5.6
Identities = 10/18 (55%), Positives = 14/18 (77%)
Query: 167 EEEEEEEEEEEEDNIEQE 184
E+E EEEEEE+ ++E E
Sbjct: 70 EDEGGEEEEEEDADVETE 87
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 25.9 bits (57), Expect = 6.0
Identities = 13/23 (56%), Positives = 18/23 (78%)
Query: 166 EEEEEEEEEEEEEDNIEQEFLFE 188
EEEEEEEE++ E+ + E EF+ E
Sbjct: 12 EEEEEEEEDDLEDLSDEDEFIDE 34
Score = 25.5 bits (56), Expect = 9.8
Identities = 10/20 (50%), Positives = 16/20 (80%)
Query: 165 GEEEEEEEEEEEEEDNIEQE 184
+EEEEEEEEE++ +++ E
Sbjct: 9 DDEEEEEEEEEDDLEDLSDE 28
>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
Length = 356
Score = 27.2 bits (60), Expect = 6.2
Identities = 10/15 (66%), Positives = 14/15 (93%)
Query: 166 EEEEEEEEEEEEEDN 180
+E++EEEE EEEE+N
Sbjct: 336 QEQQEEEENEEEEEN 350
Score = 26.5 bits (58), Expect = 9.2
Identities = 10/15 (66%), Positives = 14/15 (93%)
Query: 166 EEEEEEEEEEEEEDN 180
E++EEEE EEEEE++
Sbjct: 337 EQQEEEENEEEEEND 351
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 27.1 bits (61), Expect = 6.3
Identities = 9/25 (36%), Positives = 15/25 (60%)
Query: 160 VDKKGGEEEEEEEEEEEEEDNIEQE 184
G E EE +++E+EEE+ E +
Sbjct: 182 PAHVGSELEELDDDEDEEEEEDEND 206
>gnl|CDD|238703 cd01412, SIRT5_Af1_CobB, SIRT5_Af1_CobB: Eukaryotic, archaeal and
prokaryotic group (class3) which includes human sirtuin
SIRT5, Archaeoglobus fulgidus Sir2-Af1, and E. coli
CobB; and are members of the SIR2 family of proteins,
silent information regulator 2 (Sir2) enzymes which
catalyze NAD+-dependent protein/histone deacetylation.
Sir2 proteins have been shown to regulate gene
silencing, DNA repair, metabolic enzymes, and life span.
CobB is a bacterial sirtuin that deacetylates acetyl-CoA
synthetase at an active site lysine to stimulate its
enzymatic activity. .
Length = 224
Score = 26.8 bits (60), Expect = 6.3
Identities = 8/32 (25%), Positives = 11/32 (34%), Gaps = 1/32 (3%)
Query: 44 CVHCKQ-FEVVPTRKTQTLEHCATCVDMVRPD 74
C C E + L C C ++RP
Sbjct: 112 CSSCGYVGENNEEIPEEELPRCPKCGGLLRPG 143
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 26.6 bits (59), Expect = 6.4
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 156 KVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
K KK E+ EEE++ E +D+ +E
Sbjct: 64 TTKKSKKKDKEKLTEEEKKPESDDDKTEE 92
>gnl|CDD|237078 PRK12366, PRK12366, replication factor A; Reviewed.
Length = 637
Score = 27.3 bits (61), Expect = 6.4
Identities = 9/22 (40%), Positives = 14/22 (63%)
Query: 160 VDKKGGEEEEEEEEEEEEEDNI 181
V ++ G + E +EEEEE+ I
Sbjct: 45 VAQELGIDIEYGQEEEEEDFKI 66
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 27.3 bits (61), Expect = 6.5
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQEF 185
D G + EE+EEEEE+ + E E
Sbjct: 312 DNFGLGQGEEDEEEEEDGVDDEDEE 336
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 27.1 bits (60), Expect = 6.7
Identities = 11/47 (23%), Positives = 17/47 (36%)
Query: 142 YECAQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
Y+ + L + ++ E EE E E+ E E L E
Sbjct: 376 YDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRE 422
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 26.4 bits (58), Expect = 6.9
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 162 KKGGEEEEEEEEEEEEEDNIEQE 184
+G E + EEE+EE +E E E
Sbjct: 25 DEGDESDAEEEDEEIQEALREAE 47
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 26.4 bits (58), Expect = 7.0
Identities = 9/18 (50%), Positives = 11/18 (61%)
Query: 166 EEEEEEEEEEEEEDNIEQ 183
+EE+EE EE EEE
Sbjct: 46 DEEDEEAEEVEEETAASS 63
Score = 26.4 bits (58), Expect = 8.1
Identities = 11/24 (45%), Positives = 18/24 (75%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
D + E EE+EEE E+ ED++++E
Sbjct: 25 DSQAEEVEEDEEEMEDWEDSLDEE 48
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 26.5 bits (59), Expect = 7.0
Identities = 8/30 (26%), Positives = 13/30 (43%)
Query: 149 QLMQHLRKVHKVDKKGGEEEEEEEEEEEEE 178
Q K+ + EE +EEE++ E
Sbjct: 33 QEAHENALEKKLLDELDEELDEEEDDRFLE 62
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 26.8 bits (60), Expect = 7.2
Identities = 7/20 (35%), Positives = 14/20 (70%)
Query: 161 DKKGGEEEEEEEEEEEEEDN 180
+ ++E+E+E++EEE D
Sbjct: 54 EDDDDDDEDEDEDDEEEADL 73
Score = 26.4 bits (59), Expect = 9.5
Identities = 6/23 (26%), Positives = 16/23 (69%)
Query: 161 DKKGGEEEEEEEEEEEEEDNIEQ 183
+ +++E+E+E++EEE ++
Sbjct: 53 LEDDDDDDEDEDEDDEEEADLGP 75
>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
Length = 177
Score = 26.4 bits (59), Expect = 7.3
Identities = 9/20 (45%), Positives = 16/20 (80%)
Query: 166 EEEEEEEEEEEEEDNIEQEF 185
+++EEE EEEE +D++ + F
Sbjct: 149 DDDEEESEEEESDDDLLRLF 168
>gnl|CDD|237731 PRK14494, PRK14494, putative molybdopterin-guanine dinucleotide
biosynthesis protein MobB/FeS domain-containing protein
protein; Provisional.
Length = 229
Score = 26.5 bits (59), Expect = 7.9
Identities = 5/10 (50%), Positives = 6/10 (60%)
Query: 137 CGMCGYECAQ 146
CG CG+ C
Sbjct: 153 CGHCGFNCKG 162
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 26.9 bits (60), Expect = 8.0
Identities = 7/31 (22%), Positives = 16/31 (51%)
Query: 158 HKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
D ++E+EE++E +E + + + F
Sbjct: 158 EDDDDDDVDDEDEEKKEAKELEKLSDDDDFV 188
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 26.9 bits (60), Expect = 8.2
Identities = 14/74 (18%), Positives = 31/74 (41%), Gaps = 6/74 (8%)
Query: 121 HLDDHMRRHTGEKPHACGMCGYECAQGSQLMQ----HLRK--VHKVDKKGGEEEEEEEEE 174
H + G + + Y G +++ L K V ++ ++EEEE
Sbjct: 174 HKTKSKKVLVGYVRNLLRLTEYCPELGFDILELIIERLLKLDVEIQNELDDIDDEEEERV 233
Query: 175 EEEEDNIEQEFLFE 188
+ED+ +++ +F+
Sbjct: 234 LADEDDDDEDDMFD 247
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 26.6 bits (59), Expect = 8.5
Identities = 10/17 (58%), Positives = 12/17 (70%)
Query: 166 EEEEEEEEEEEEEDNIE 182
EEEEEE EE E+ +E
Sbjct: 149 LPEEEEEEPEEMEEELE 165
>gnl|CDD|178497 PLN02909, PLN02909, Endoglucanase.
Length = 486
Score = 26.8 bits (59), Expect = 8.7
Identities = 14/46 (30%), Positives = 21/46 (45%)
Query: 70 MVRPDASYTYMCYLCNYHTPTRKYMRTHIDTHNGEKPFRCALCAYS 115
MV + Y L N K+ + H T++GE PF C+ Y+
Sbjct: 194 MVFRHVDHKYSRRLLNKAKLLFKFAKAHKGTYDGECPFYCSYSGYN 239
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 26.9 bits (59), Expect = 8.7
Identities = 9/14 (64%), Positives = 10/14 (71%)
Query: 166 EEEEEEEEEEEEED 179
EE E EEE E+ ED
Sbjct: 381 EESEHEEEVEDYED 394
Score = 26.9 bits (59), Expect = 9.4
Identities = 8/33 (24%), Positives = 13/33 (39%)
Query: 156 KVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
V K+ K E E++ E ++ FE
Sbjct: 47 SVRKIGKFDYELAVYRVEDDSEGGTGDEAENFE 79
>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
Length = 586
Score = 26.7 bits (59), Expect = 8.7
Identities = 9/29 (31%), Positives = 18/29 (62%)
Query: 149 QLMQHLRKVHKVDKKGGEEEEEEEEEEEE 177
QL++ + + + G+E+EEE+ E+E
Sbjct: 50 QLLEQKEEKEEDAGEEGDEDEEEQGEDEH 78
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 26.2 bits (58), Expect = 9.0
Identities = 9/43 (20%), Positives = 27/43 (62%)
Query: 141 GYECAQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQ 183
GY+ ++ ++++ K+ ++ KK ++E++ ++ + ED E+
Sbjct: 13 GYDPSEYKEVVEEYEKLEELRKKLKAKKEDDSDDTDYEDEDEE 55
>gnl|CDD|224459 COG1542, COG1542, Uncharacterized conserved protein [Function
unknown].
Length = 593
Score = 26.8 bits (59), Expect = 9.7
Identities = 10/33 (30%), Positives = 18/33 (54%)
Query: 154 LRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFL 186
L KV +++++G E + EE +E +E E
Sbjct: 11 LLKVKELEERGEAIEVKALTEEFKERLLELELQ 43
>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger.
Length = 23
Score = 24.0 bits (52), Expect = 9.7
Identities = 9/23 (39%), Positives = 13/23 (56%)
Query: 107 FRCALCAYSARRKTHLDDHMRRH 129
+RC C + K+ L +HMR H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
>gnl|CDD|234635 PRK00104, scpA, segregation and condensation protein A; Reviewed.
Length = 242
Score = 26.3 bits (59), Expect = 9.8
Identities = 8/17 (47%), Positives = 13/17 (76%)
Query: 167 EEEEEEEEEEEEDNIEQ 183
EE EEEEE+ ++ +E+
Sbjct: 78 EEFEEEEEDPRQELLEK 94
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
kDa subunit. This family contains several DNA-directed
RNA polymerase 19 kDa polypeptides. The Poxvirus
DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
DNA-template-directed extension of the 3'-end of an RNA
strand by one nucleotide at a time.
Length = 167
Score = 26.2 bits (58), Expect = 9.8
Identities = 9/19 (47%), Positives = 14/19 (73%)
Query: 166 EEEEEEEEEEEEEDNIEQE 184
++ EE EEEEE+E++ E
Sbjct: 15 DDSEEYEEEEEDEEDAESL 33
>gnl|CDD|240275 PTZ00113, PTZ00113, proliferating cell nuclear antigen;
Provisional.
Length = 275
Score = 26.3 bits (58), Expect = 9.9
Identities = 18/56 (32%), Positives = 26/56 (46%), Gaps = 2/56 (3%)
Query: 133 KPHACGMCGYECAQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
K H G Y C + L ++ V KV K + + EE++DNI F+FE
Sbjct: 51 KLHDSGFSHYRCDRERALGINIASVTKVFKLCSNNDSVLIQSEEDKDNI--NFVFE 104
>gnl|CDD|227515 COG5188, PRP9, Splicing factor 3a, subunit 3 [RNA processing and
modification].
Length = 470
Score = 26.5 bits (58), Expect = 9.9
Identities = 20/100 (20%), Positives = 34/100 (34%), Gaps = 18/100 (18%)
Query: 101 HNGEKPFRCALCA---YSARRKTHLDDHMRRHTGEKPHACGMCGYECAQGS--------- 148
H + F C +C Y R + RH E H G+
Sbjct: 369 HGLDIEFECEICGNYVYYGRDR------FDRHFEEDRHIYGLECLGIKPSRVFKGITRIG 422
Query: 149 QLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
+ M+ ++ + E EE E+EE N+ + ++E
Sbjct: 423 EAMKLWNRMEESSSSLKVPTEYSEEFEDEEGNVMSKKVYE 462
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.134 0.434
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,278,775
Number of extensions: 795552
Number of successful extensions: 4955
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4136
Number of HSP's successfully gapped: 390
Length of query: 188
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 97
Effective length of database: 6,901,388
Effective search space: 669434636
Effective search space used: 669434636
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 56 (25.2 bits)