RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4652
(291 letters)
>gnl|CDD|224953 COG2042, COG2042, Uncharacterized conserved protein [Function
unknown].
Length = 179
Score = 179 bits (456), Expect = 2e-56
Identities = 86/251 (34%), Positives = 113/251 (45%), Gaps = 87/251 (34%)
Query: 1 MWDMKHCDPKKCSGRKLARHNLIKILKLGERFRGFPKEAKFYLDKFKWGRTFLTLNEELL 60
++ CDPKKC+G+KL R L ++L++G +F G
Sbjct: 16 IYHAGQCDPKKCTGKKLERFGLARLLRVGGKFPG-------------------------- 49
Query: 61 DLYKACDTVLFIFLMRICLTPVGVQCVSKADHDIIQENGAAVVDCSWAKIEETPFNKMKT 120
+ LTP + +S AD DI++ G VVDCSW ++E F K++
Sbjct: 50 ----------------VVLTPFAEKALSPADRDIVERFGITVVDCSWNRVERV-FKKLRG 92
Query: 121 NHPRLLPFLIAANPINYGKPCKLSCVEALAATFIIAGFPKEAKFYLDKFKWGRTFLTLNE 180
R LPFL+AANP+NYGKP KLS EALAA I GF EA
Sbjct: 93 REHRRLPFLVAANPVNYGKPFKLSSAEALAAALYIVGFKDEA-----------------S 135
Query: 181 ELLDLYKECDTIFYEDLDGERFPKEAKFYLDKFKWGRTFLTLNEELLDLYKECDTSTDVV 240
EL L KFKWG TFL LN+ELL+ Y + S +VV
Sbjct: 136 EL---------------------------LSKFKWGHTFLELNKELLEEYSNAEDSAEVV 168
Query: 241 KVQNEYLKKIE 251
++Q EYL+K E
Sbjct: 169 EIQEEYLEKGE 179
>gnl|CDD|112833 pfam04034, DUF367, Domain of unknown function (DUF367).
Length = 128
Score = 172 bits (438), Expect = 2e-54
Identities = 69/114 (60%), Positives = 83/114 (72%)
Query: 77 ICLTPVGVQCVSKADHDIIQENGAAVVDCSWAKIEETPFNKMKTNHPRLLPFLIAANPIN 136
+ LTP +S D DI++ +G AVVDCSWA++EE PF K++ H RLLPFL+AANP+N
Sbjct: 1 LVLTPFAEIALSPEDKDIVERSGIAVVDCSWARLEEVPFKKIRGRHERLLPFLVAANPVN 60
Query: 137 YGKPCKLSCVEALAATFIIAGFPKEAKFYLDKFKWGRTFLTLNEELLDLYKECD 190
YGKPCKLSCVEALAA I GF EA L KFKWG TFL LN+ELL+ Y C+
Sbjct: 61 YGKPCKLSCVEALAAALYIVGFKDEASELLSKFKWGHTFLELNKELLEAYSNCE 114
Score = 50.0 bits (120), Expect = 7e-08
Identities = 20/34 (58%), Positives = 23/34 (67%)
Query: 34 GFPKEAKFYLDKFKWGRTFLTLNEELLDLYKACD 67
GF EA L KFKWG TFL LN+ELL+ Y C+
Sbjct: 81 GFKDEASELLSKFKWGHTFLELNKELLEAYSNCE 114
>gnl|CDD|235024 PRK02287, PRK02287, hypothetical protein; Provisional.
Length = 171
Score = 168 bits (427), Expect = 3e-52
Identities = 80/247 (32%), Positives = 106/247 (42%), Gaps = 85/247 (34%)
Query: 5 KHCDPKKCSGRKLARHNLIKILKLGERFRGFPKEAKFYLDKFKWGRTFLTLNEELLDLYK 64
DPKKC+ RKL R L ++++ R P+ +
Sbjct: 10 NQDDPKKCTARKLVRFGLARLVRSI---RKIPRGS------------------------- 41
Query: 65 ACDTVLFIFLMRICLTPVGVQCVSKADHDIIQENGAAVVDCSWAKIEETPFNKMKTNHPR 124
I L P + +S AD DI+++ G +DCSW + E F K+ H R
Sbjct: 42 ------------IVLNPFAEKALSPADRDIVEKRGIVALDCSWNEAERV-FFKLLRGHHR 88
Query: 125 LLPFLIAANPINYGKPCKLSCVEALAATFIIAGFPKEAKFYLDKFKWGRTFLTLNEELLD 184
LPFL+AANP+NYGKP KLS VEALAA I GF +EA E++
Sbjct: 89 RLPFLVAANPVNYGKPFKLSSVEALAAALYILGFKEEA-----------------EKI-- 129
Query: 185 LYKECDTIFYEDLDGERFPKEAKFYLDKFKWGRTFLTLNEELLDLYKECDTSTDVVKVQN 244
L KFKWG TFL LN+E L+ Y S ++V++Q
Sbjct: 130 -------------------------LSKFKWGHTFLELNKEPLEAYARAKDSEEIVEIQK 164
Query: 245 EYLKKIE 251
EYL KI
Sbjct: 165 EYLGKIL 171
>gnl|CDD|217870 pfam04068, RLI, Possible Fer4-like domain in RNase L inhibitor,
RLI. Possible metal-binding domain in endoribonuclease
RNase L inhibitor. Found at the N-terminal end of RNase
L inhibitor proteins, adjacent to the 4Fe-4S binding
domain, fer4, pfam00037. Also often found adjacent to
the DUF367 domain pfam04034 in uncharacterized
proteins. The RNase L system plays a major role in the
anti-viral and anti-proliferative activities of
interferons, and could possibly play a more general
role in the regulation of RNA stability in mammalian
cells. Inhibitory activity requires
concentration-dependent association of RLI with RNase
L.
Length = 35
Score = 49.4 bits (119), Expect = 1e-08
Identities = 13/31 (41%), Positives = 22/31 (70%)
Query: 1 MWDMKHCDPKKCSGRKLARHNLIKILKLGER 31
+ D CDPKKC+GRK AR +++++ G++
Sbjct: 5 IVDFDQCDPKKCTGRKCARFCPVRLVRTGKK 35
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 40.0 bits (94), Expect = 0.001
Identities = 15/31 (48%), Positives = 19/31 (61%)
Query: 257 PNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
P + E E EEEEEEEEEE++ E+E
Sbjct: 303 PEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 36.9 bits (86), Expect = 0.010
Identities = 17/37 (45%), Positives = 19/37 (51%)
Query: 254 RNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGR 290
R P P P E EEEEE EEEEEE++ E
Sbjct: 296 RRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEE 332
Score = 31.5 bits (72), Expect = 0.50
Identities = 17/34 (50%), Positives = 20/34 (58%)
Query: 254 RNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
R P P E EEEEEEEEE EE++ E+E
Sbjct: 292 RRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEE 325
Score = 27.7 bits (62), Expect = 7.9
Identities = 18/46 (39%), Positives = 21/46 (45%), Gaps = 8/46 (17%)
Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAI--------EDEL 288
E E + P E EEEEEEE EEE +A E+EL
Sbjct: 304 EPEEEEEEEEEVPEEEEEEEEEEERTFEEEVRATVAEAIRLLEEEL 349
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 38.8 bits (90), Expect = 0.003
Identities = 17/23 (73%), Positives = 20/23 (86%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
SE EEEEEEEEEEEEE++ E+E
Sbjct: 863 SEEEEEEEEEEEEEEEEEEEEEE 885
Score = 37.7 bits (87), Expect = 0.006
Identities = 16/22 (72%), Positives = 20/22 (90%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
+SEEEEEEEEEEEEE++ E+E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEE 883
Score = 37.3 bits (86), Expect = 0.010
Identities = 16/22 (72%), Positives = 19/22 (86%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
E EEEEEEEEEEEEE++ E+E
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENE 891
Score = 36.9 bits (85), Expect = 0.011
Identities = 16/22 (72%), Positives = 19/22 (86%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
E EEEEEEEEEEEEE++ E+E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEE 888
Score = 36.9 bits (85), Expect = 0.011
Identities = 16/23 (69%), Positives = 19/23 (82%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDEL 288
E EEEEEEEEEEEEE++ E+ L
Sbjct: 872 EEEEEEEEEEEEEEEEEENEEPL 894
Score = 36.9 bits (85), Expect = 0.012
Identities = 16/22 (72%), Positives = 19/22 (86%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
E EEEEEEEEEEEEE++ E+E
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEE 887
Score = 36.9 bits (85), Expect = 0.012
Identities = 16/22 (72%), Positives = 19/22 (86%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
E EEEEEEEEEEEEE++ E+E
Sbjct: 868 EEEEEEEEEEEEEEEEEEEEEE 889
Score = 36.5 bits (84), Expect = 0.014
Identities = 16/22 (72%), Positives = 19/22 (86%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
E EEEEEEEEEEEEE++ E+E
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEE 886
Score = 36.1 bits (83), Expect = 0.023
Identities = 15/22 (68%), Positives = 18/22 (81%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
E EEEEEEEEEEEEE++ E+
Sbjct: 869 EEEEEEEEEEEEEEEEEEEEEN 890
Score = 36.1 bits (83), Expect = 0.023
Identities = 15/22 (68%), Positives = 18/22 (81%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
E EEEEEEEEEEEEE++ +E
Sbjct: 871 EEEEEEEEEEEEEEEEEEENEE 892
Score = 33.8 bits (77), Expect = 0.12
Identities = 14/22 (63%), Positives = 17/22 (77%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
E EEEEEEEEEEEE ++ + E
Sbjct: 876 EEEEEEEEEEEEEENEEPLSLE 897
Score = 30.7 bits (69), Expect = 1.0
Identities = 15/29 (51%), Positives = 16/29 (55%)
Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEE 279
EEE + E EEEEEEEEE EE
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 29.6 bits (66), Expect = 2.4
Identities = 13/23 (56%), Positives = 18/23 (78%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
S+ + EEEEEEEEEE++ E+E
Sbjct: 858 SDGGDSEEEEEEEEEEEEEEEEE 880
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 35.8 bits (83), Expect = 0.020
Identities = 23/58 (39%), Positives = 31/58 (53%), Gaps = 2/58 (3%)
Query: 228 DLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIE 285
+L + T DV ++ LKK +E+ P V + E EEEEEEEEEEEE +
Sbjct: 370 ELRDKIATEEDVKDIEE--LKKFLKEKEHPVVERWAAEEEEEEEEEEEEEEEPVAEVM 425
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 33.4 bits (77), Expect = 0.023
Identities = 11/19 (57%), Positives = 16/19 (84%)
Query: 263 PPSESEEEEEEEEEEEEED 281
+E E++EEEEEEEE++D
Sbjct: 64 AAAEEEKKEEEEEEEEDDD 82
Score = 32.6 bits (75), Expect = 0.044
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 263 PPSESEEEEEEEEEEEEED 281
+ EE++EEEEEEEE+D
Sbjct: 63 AAAAEEEKKEEEEEEEEDD 81
Score = 31.4 bits (72), Expect = 0.094
Identities = 12/19 (63%), Positives = 16/19 (84%)
Query: 263 PPSESEEEEEEEEEEEEED 281
+ +EEE++EEEEEEEED
Sbjct: 62 AAAAAEEEKKEEEEEEEED 80
Score = 30.3 bits (69), Expect = 0.26
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
+ + EEE++EEEE++ +D++G
Sbjct: 58 AAAAAAAAAEEEKKEEEEEEEEDDDMG 84
Score = 26.0 bits (58), Expect = 8.0
Identities = 8/25 (32%), Positives = 15/25 (60%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
+ + EEE++EEE++ ED+
Sbjct: 57 AAAAAAAAAAEEEKKEEEEEEEEDD 81
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 33.4 bits (77), Expect = 0.027
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
+ + EE++EEEEEEEE + E+E
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESEEEAA 98
Score = 33.4 bits (77), Expect = 0.029
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 259 VPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
+ + EE++EEEEEEE+K +E
Sbjct: 67 AAAAAAAAAAAEEKKEEEEEEEEKEESEE 95
Score = 32.2 bits (74), Expect = 0.068
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
E +EEEEEEEE+EE ++ LG
Sbjct: 76 AAEEKKEEEEEEEEKEESEEEAAAGLG 102
Score = 31.8 bits (73), Expect = 0.094
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
+ +EE++EEEEEEEE++++ E+
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEEEAAA 99
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 35.3 bits (81), Expect = 0.030
Identities = 14/18 (77%), Positives = 15/18 (83%)
Query: 263 PPSESEEEEEEEEEEEEE 280
SE +EEEEEEEEEEEE
Sbjct: 226 SESEEDEEEEEEEEEEEE 243
Score = 34.9 bits (80), Expect = 0.039
Identities = 15/21 (71%), Positives = 15/21 (71%)
Query: 261 DFPPSESEEEEEEEEEEEEED 281
D E EEEEEEEEEEEE D
Sbjct: 225 DSESEEDEEEEEEEEEEEEID 245
Score = 30.3 bits (68), Expect = 1.1
Identities = 13/17 (76%), Positives = 16/17 (94%)
Query: 265 SESEEEEEEEEEEEEED 281
S+SE EE+EEEEEEEE+
Sbjct: 224 SDSESEEDEEEEEEEEE 240
Score = 28.4 bits (63), Expect = 4.7
Identities = 10/17 (58%), Positives = 10/17 (58%)
Query: 260 PDFPPSESEEEEEEEEE 276
E EEEEEEEEE
Sbjct: 227 ESEEDEEEEEEEEEEEE 243
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 35.2 bits (81), Expect = 0.034
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 2/58 (3%)
Query: 228 DLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIE 285
DLY + T D + + L++ +E+ P V + E EEEEEEEEE E + +E
Sbjct: 369 DLYDKIATEEDATTI--DELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPME 424
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 34.7 bits (80), Expect = 0.047
Identities = 12/26 (46%), Positives = 19/26 (73%)
Query: 262 FPPSESEEEEEEEEEEEEEDKAIEDE 287
P + +EEEE+E+EEEEE++ +E
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEE 367
Score = 32.0 bits (73), Expect = 0.37
Identities = 15/27 (55%), Positives = 18/27 (66%)
Query: 264 PSESEEEEEEEEEEEEEDKAIEDELGR 290
E E+E+EEEEEEEEE + E E G
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEGP 374
Score = 27.7 bits (62), Expect = 8.5
Identities = 11/20 (55%), Positives = 12/20 (60%)
Query: 261 DFPPSESEEEEEEEEEEEEE 280
+ E EEEEEE EE E E
Sbjct: 352 EQEDEEEEEEEEEPEEPEPE 371
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 34.5 bits (80), Expect = 0.048
Identities = 14/27 (51%), Positives = 19/27 (70%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
+ +EEEEEEEEEEEEE+ + E+
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPSEEEAAA 323
Score = 34.5 bits (80), Expect = 0.051
Identities = 14/27 (51%), Positives = 19/27 (70%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
+ + EEEEEEEEEEEE++ E+E
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEPSEEEAA 322
Score = 32.1 bits (74), Expect = 0.25
Identities = 14/27 (51%), Positives = 16/27 (59%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
E EEEEEEEEEEE ++ LG
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEEAAAGLG 326
Score = 28.7 bits (65), Expect = 3.7
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
+++ EEEEEEEEEE++ E
Sbjct: 294 AQAQAAAAEEEEEEEEEEEEEEPSE 318
Score = 27.9 bits (63), Expect = 7.2
Identities = 14/35 (40%), Positives = 18/35 (51%)
Query: 245 EYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEE 279
E LK++ + + E EEEEEEE EEE
Sbjct: 286 EELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 35.0 bits (80), Expect = 0.051
Identities = 18/68 (26%), Positives = 31/68 (45%)
Query: 221 TLNEELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEE 280
+LNE+L D E D K + E + + + + +E+E+EEE +
Sbjct: 3902 SLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSD 3961
Query: 281 DKAIEDEL 288
D I+DE+
Sbjct: 3962 DVGIDDEI 3969
Score = 28.8 bits (64), Expect = 5.1
Identities = 17/75 (22%), Positives = 37/75 (49%), Gaps = 6/75 (8%)
Query: 220 LTLNEELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEE 279
L L+E+ D+ K+ D ++ +E ++ + E++ P + D P E +E+ ++++
Sbjct: 3993 LKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEP-MQDEDPLEENNTLDEDIQQDD 4051
Query: 280 -----EDKAIEDELG 289
ED +E G
Sbjct: 4052 FSDLAEDDEKMNEDG 4066
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 34.3 bits (78), Expect = 0.055
Identities = 14/22 (63%), Positives = 19/22 (86%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
ES+E+EEEEEEEEEE+ +E+
Sbjct: 134 ESDEDEEEEEEEEEEEAEVEEN 155
Score = 32.0 bits (72), Expect = 0.31
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDELG 289
E+EEEEEEEEEEE + E+E G
Sbjct: 134 ESDEDEEEEEEEEEEEAEVEENEQG 158
Score = 30.0 bits (67), Expect = 1.4
Identities = 12/22 (54%), Positives = 18/22 (81%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
+ +E+EEEEEEEEE++A +E
Sbjct: 133 DESDEDEEEEEEEEEEEAEVEE 154
Score = 27.7 bits (61), Expect = 8.0
Identities = 13/22 (59%), Positives = 17/22 (77%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
E E +E+EEEEEEEE++ E E
Sbjct: 132 EDESDEDEEEEEEEEEEEAEVE 153
Score = 27.4 bits (60), Expect = 8.8
Identities = 11/19 (57%), Positives = 13/19 (68%)
Query: 266 ESEEEEEEEEEEEEEDKAI 284
E EEEEEEE E EE ++
Sbjct: 141 EEEEEEEEEAEVEENEQGT 159
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 32.3 bits (74), Expect = 0.075
Identities = 13/25 (52%), Positives = 18/25 (72%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
+ + EEEEEEEEEEEE++ +E
Sbjct: 70 AAAAAAEEEEEEEEEEEEEEEESEE 94
Score = 31.6 bits (72), Expect = 0.11
Identities = 14/26 (53%), Positives = 20/26 (76%)
Query: 264 PSESEEEEEEEEEEEEEDKAIEDELG 289
+ EEEEEEEEEEEEE+++ E+ +
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEAMA 98
Score = 31.6 bits (72), Expect = 0.12
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
+ + EEEEEEEEEEE++ E E
Sbjct: 69 AAAAAAAEEEEEEEEEEEEEEEESE 93
Score = 30.8 bits (70), Expect = 0.24
Identities = 15/25 (60%), Positives = 20/25 (80%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
+ +EEEEEEEEEEEEE++ E+E
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95
Score = 30.4 bits (69), Expect = 0.32
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
+ + EEEEEEEEEE++ E+
Sbjct: 68 AAAAAAAAEEEEEEEEEEEEEEEES 92
Score = 29.6 bits (67), Expect = 0.63
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDELG 289
E EEEEEEEEEEEE ++ LG
Sbjct: 78 EEEEEEEEEEEEEESEEEAMAGLG 101
Score = 28.1 bits (63), Expect = 2.3
Identities = 13/38 (34%), Positives = 20/38 (52%)
Query: 250 IEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
I++ P + + EEEEEEEEE++ E+E
Sbjct: 54 IKKAAAAPVAAAAAAAAAAAAAEEEEEEEEEEEEEEEE 91
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 32.5 bits (74), Expect = 0.089
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
+ +EEEEEEEE E + DE
Sbjct: 91 TGHTRQEEEEEEEENEKQQQSDE 113
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 32.7 bits (75), Expect = 0.095
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
S+ +EE+EEEE++EE+D E E
Sbjct: 114 SDDDEEDEEEEDDEEDDDEDESE 136
Score = 31.9 bits (73), Expect = 0.17
Identities = 11/22 (50%), Positives = 20/22 (90%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
E +EEEE++EE+++ED++ E+E
Sbjct: 118 EEDEEEEDDEEDDDEDESEEEE 139
Score = 30.8 bits (70), Expect = 0.41
Identities = 12/23 (52%), Positives = 19/23 (82%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
ES+++EE+EEEE++E+ EDE
Sbjct: 112 DESDDDEEDEEEEDDEEDDDEDE 134
Score = 28.4 bits (64), Expect = 2.2
Identities = 9/23 (39%), Positives = 18/23 (78%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
+ E+EEEE++EE++++ E+E
Sbjct: 116 DDEEDEEEEDDEEDDDEDESEEE 138
Score = 27.7 bits (62), Expect = 4.5
Identities = 11/22 (50%), Positives = 18/22 (81%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
E E +++EE+EEEE+D+ +DE
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDE 132
Score = 27.3 bits (61), Expect = 6.1
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDEL 288
++EE+EEEE++EED ++
Sbjct: 114 SDDDEEDEEEEDDEEDDDEDESE 136
Score = 26.9 bits (60), Expect = 7.8
Identities = 11/23 (47%), Positives = 18/23 (78%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
SE +E +++EE+EEEED +D+
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDD 131
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 33.5 bits (76), Expect = 0.13
Identities = 18/50 (36%), Positives = 24/50 (48%), Gaps = 2/50 (4%)
Query: 244 NEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEE--EEEDKAIEDELGRI 291
+ +I+ E P VPD E EE E EE+EE E+E + DE
Sbjct: 120 SIAPPEIDPEPRKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEE 169
>gnl|CDD|152814 pfam12379, DUF3655, Protein of unknown function (DUF3655). This
domain family is found in viruses, and is approximately
70 amino acids in length. The family is found in
association with pfam08716, pfam01661, pfam05409,
pfam06471, pfam08717, pfam06478, pfam09401, pfam06460,
pfam08715, pfam08710.
Length = 70
Score = 29.6 bits (66), Expect = 0.29
Identities = 14/28 (50%), Positives = 18/28 (64%)
Query: 262 FPPSESEEEEEEEEEEEEEDKAIEDELG 289
+PP E E+ EE E EEEE ++ E E G
Sbjct: 6 YPPDEEEDCEEYECEEEEIEETCEHEYG 33
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 30.5 bits (69), Expect = 0.29
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 256 MPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
P +E + EE+EEE+++EE+K E+E
Sbjct: 65 APAAAAAAAAEEKAEEKEEEKKKEEEKEEEEE 96
Score = 29.8 bits (67), Expect = 0.49
Identities = 10/26 (38%), Positives = 19/26 (73%)
Query: 260 PDFPPSESEEEEEEEEEEEEEDKAIE 285
+ + EE+++EEE+EEEE++A+
Sbjct: 75 EEKAEEKEEEKKKEEEKEEEEEEALA 100
Score = 29.4 bits (66), Expect = 0.89
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
E EE+EEE+++EEE + E+E
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 28.2 bits (63), Expect = 1.9
Identities = 10/26 (38%), Positives = 20/26 (76%)
Query: 264 PSESEEEEEEEEEEEEEDKAIEDELG 289
++EE+EEE+++EEE+++ E+ L
Sbjct: 75 EEKAEEKEEEKKKEEEKEEEEEEALA 100
Score = 27.8 bits (62), Expect = 2.5
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
E EEE+++EEE+EEE++ LG
Sbjct: 77 KAEEKEEEKKKEEEKEEEEEEALAGLG 103
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 30.4 bits (69), Expect = 0.32
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
+E++E EEEE+EEE E+++ +D L
Sbjct: 79 AAAEADEAEEEEKEEEAEEESDDDMLF 105
Score = 28.9 bits (65), Expect = 1.2
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 264 PSESEEEEEEEEEEEEEDKAIEDE 287
+ +E +E EEEE+EEE + D+
Sbjct: 78 EAAAEADEAEEEEKEEEAEEESDD 101
Score = 28.5 bits (64), Expect = 1.8
Identities = 10/24 (41%), Positives = 17/24 (70%)
Query: 264 PSESEEEEEEEEEEEEEDKAIEDE 287
+E+ E +E EEEE+E++A E+
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEEES 99
Score = 27.8 bits (62), Expect = 3.1
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
+ +E EEEE+EEE ++ +D++
Sbjct: 78 EAAAEADEAEEEEKEEEAEEESDDDML 104
>gnl|CDD|240630 cd05305, L-AlaDH, Alanine dehydrogenase NAD-binding and catalytic
domains. Alanine dehydrogenase (L-AlaDH) catalyzes the
NAD-dependent conversion of pyruvate to L-alanine via
reductive amination. Like formate dehydrogenase and
related enzymes, L-AlaDH is comprised of 2 domains
connected by a long alpha helical stretch, each
resembling a Rossmann fold NAD-binding domain. The
NAD-binding domain is inserted within the linear
sequence of the more divergent catalytic domain. Ligand
binding and active site residues are found in the cleft
between the subdomains. L-AlaDH is typically hexameric
and is critical in carbon and nitrogen metabolism in
micro-organisms.
Length = 359
Score = 32.0 bits (74), Expect = 0.33
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 76 RICLTPVGVQCVSKADHDIIQENGA 100
R+ LTP GV + A H+++ E GA
Sbjct: 15 RVALTPAGVAELVAAGHEVLVEKGA 39
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 32.2 bits (74), Expect = 0.36
Identities = 24/136 (17%), Positives = 44/136 (32%), Gaps = 11/136 (8%)
Query: 160 KEAKFYLDKFKWGRTFLTLNEELLDLYKECDTIFYEDLDGERFPKEAKFYLDKFKWGRTF 219
K L + L NE+ + + + D E P E +
Sbjct: 747 APLKVVLVGSEELEDRLEANEDDIKGLANLEELEILSPDPEEPPVEVTAVVGGA----EL 802
Query: 220 LTLNEELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDF----PPSESEEEEEEEE 275
L+DL E + K + K+I+ + F P E+E+E+
Sbjct: 803 FLPLAGLIDLAAELA---RLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLA 859
Query: 276 EEEEEDKAIEDELGRI 291
E + + +E+ L +
Sbjct: 860 EYQVKLAKLEERLAVL 875
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 32.1 bits (73), Expect = 0.37
Identities = 11/37 (29%), Positives = 19/37 (51%)
Query: 252 EERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
E + P P P E EEE +E ++ + +D D++
Sbjct: 270 EPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQI 306
Score = 30.2 bits (68), Expect = 1.3
Identities = 11/38 (28%), Positives = 16/38 (42%)
Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
E E P P P EEE +E ++ + D E +
Sbjct: 268 EPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQ 305
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 31.9 bits (72), Expect = 0.42
Identities = 11/52 (21%), Positives = 29/52 (55%)
Query: 236 STDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
D++ ++ + + EE P + E+E+E++++EE++++ E+E
Sbjct: 126 DLDMIIIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEE 177
Score = 29.6 bits (66), Expect = 2.7
Identities = 14/22 (63%), Positives = 18/22 (81%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
+ EE++EEEEEEEEE K +DE
Sbjct: 165 DDEEDDEEEEEEEEEIKGFDDE 186
Score = 28.1 bits (62), Expect = 7.0
Identities = 9/23 (39%), Positives = 20/23 (86%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDEL 288
E E+E++++EE++EE++ E+E+
Sbjct: 158 EDEDEDDDDEEDDEEEEEEEEEI 180
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 32.0 bits (73), Expect = 0.43
Identities = 13/33 (39%), Positives = 17/33 (51%)
Query: 255 NMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
+ D E EEEEEEE+E+E K D+
Sbjct: 383 DANTERDDEEEEDEEEEEEEDEDEGPSKEHSDD 415
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 30.1 bits (67), Expect = 0.47
Identities = 11/25 (44%), Positives = 20/25 (80%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDELG 289
S+S+EEE++++E+EE+D +DE
Sbjct: 14 SDSDEEEDDDDEDEEDDDEDDDEDD 38
Score = 29.4 bits (65), Expect = 0.79
Identities = 8/22 (36%), Positives = 18/22 (81%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
+S+ + +EEE++++ED+ +DE
Sbjct: 11 DSDSDSDEEEDDDDEDEEDDDE 32
Score = 28.2 bits (62), Expect = 2.1
Identities = 9/23 (39%), Positives = 19/23 (82%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
S+S+ +EEE++++E+E+ ED+
Sbjct: 12 SDSDSDEEEDDDDEDEEDDDEDD 34
Score = 27.4 bits (60), Expect = 4.0
Identities = 8/23 (34%), Positives = 19/23 (82%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDEL 288
E ++++E+EE+++E+D +DE+
Sbjct: 19 EEDDDDEDEEDDDEDDDEDDDEV 41
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 31.8 bits (72), Expect = 0.51
Identities = 19/45 (42%), Positives = 26/45 (57%), Gaps = 3/45 (6%)
Query: 248 KKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKA---IEDELG 289
+ +EEE + ESEEEE E+EEEEEE +A E+E+
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 30.3 bits (68), Expect = 1.5
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 260 PDFPPSESEEEEEEEEEEEEEDKAIEDE 287
P ESEEEE EEEEEEE++ E+E
Sbjct: 432 PSMASQESEEEESVEEEEEEEEEEEEEE 459
Score = 29.5 bits (66), Expect = 2.4
Identities = 16/59 (27%), Positives = 29/59 (49%)
Query: 231 KECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELG 289
K TS + + ++ ++ E + E +E EEEE E+EEE++ +E + G
Sbjct: 423 KASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNG 481
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 31.5 bits (71), Expect = 0.59
Identities = 14/40 (35%), Positives = 19/40 (47%)
Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGR 290
EER P+ P E++E+ EE EEE E L +
Sbjct: 297 PEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSK 336
>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking. Sec14-like
Golgi-trafficking domain The GOLD domain is always found
combined with lipid- or membrane-association domains.
Length = 136
Score = 30.0 bits (68), Expect = 0.60
Identities = 15/26 (57%), Positives = 18/26 (69%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDELGR 290
SES +EEEEEE EEEE + + E G
Sbjct: 52 SESSDEEEEEEAEEEEAETGDVEAGS 77
>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
Length = 232
Score = 31.0 bits (70), Expect = 0.64
Identities = 14/24 (58%), Positives = 18/24 (75%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDEL 288
+E+E E+E EEEEEED +E EL
Sbjct: 65 TEAEVEDEGGEEEEEEDADVETEL 88
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family. This
family includes the HABP4 family of hyaluronan-binding
proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
HABP4 has been observed to bind hyaluronan (a
glucosaminoglycan), but it is not known whether this is
its primary role in vivo. It has also been observed to
bind RNA, but with a lower affinity than that for
hyaluronan. PAI-1 mRNA-binding protein specifically
binds the mRNA of type-1 plasminogen activator inhibitor
(PAI-1), and is thought to be involved in regulation of
mRNA stability. However, in both cases, the sequence
motifs predicted to be important for ligand binding are
not conserved throughout the family, so it is not known
whether members of this family share a common function.
Length = 106
Score = 29.7 bits (67), Expect = 0.66
Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 3/49 (6%)
Query: 234 DTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDK 282
D ++ + Q E +K E E + + E EEEE EEEEDK
Sbjct: 32 DEIAELTEEQGEEEEKNEVEEKQAVEEE---ANKEGVVEEEEVEEEEDK 77
>gnl|CDD|235562 PRK05687, fliH, flagellar assembly protein H; Validated.
Length = 246
Score = 30.7 bits (70), Expect = 0.77
Identities = 13/39 (33%), Positives = 15/39 (38%)
Query: 253 ERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
P P+ E E EE EEE E+EL I
Sbjct: 26 PPPEPPPPEPAAPPPEPPEPEEVPEEEAAPLTEEELEAI 64
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 30.6 bits (69), Expect = 0.80
Identities = 14/24 (58%), Positives = 18/24 (75%)
Query: 264 PSESEEEEEEEEEEEEEDKAIEDE 287
E +E+EEEEE EEEE++ EDE
Sbjct: 262 EKELKEDEEEEETEEEEEEEDEDE 285
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 30.4 bits (69), Expect = 0.90
Identities = 15/40 (37%), Positives = 22/40 (55%)
Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGR 290
EEE DF SE +E E ++EEE E++ E+ L +
Sbjct: 47 EEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREERLKK 86
>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
Ribosomal proteins P1 and P2 are the eukaryotic proteins
that are functionally equivalent to bacterial L7/L12.
L12p is the archaeal homolog. Unlike other ribosomal
proteins, the archaeal L12p and eukaryotic P1 and P2 do
not share sequence similarity with their bacterial
counterparts. They are part of the ribosomal stalk
(called the L7/L12 stalk in bacteria), along with 28S
rRNA and the proteins L11 and P0 in eukaryotes (23S
rRNA, L11, and L10e in archaea). In bacterial ribosomes,
L7/L12 homodimers bind the extended C-terminal helix of
L10 to anchor the L7/L12 molecules to the ribosome.
Eukaryotic P1/P2 heterodimers and archaeal L12p
homodimers are believed to bind the L10 equivalent
proteins, eukaryotic P0 and archaeal L10e, in a similar
fashion. P1 and P2 (L12p, L7/L12) are the only proteins
in the ribosome to occur as multimers, always appearing
as sets of dimers. Recent data indicate that most
archaeal species contain six copies of L12p (three
homodimers), while eukaryotes have two copies each of P1
and P2 (two heterodimers). Bacteria may have four or six
copies (two or three homodimers), depending on the
species. As in bacteria, the stalk is crucial for
binding of initiation, elongation, and release factors
in eukaryotes and archaea.
Length = 105
Score = 29.2 bits (65), Expect = 0.97
Identities = 12/19 (63%), Positives = 16/19 (84%)
Query: 263 PPSESEEEEEEEEEEEEED 281
P ++EE +EEEEEEE+ED
Sbjct: 81 PAEKAEEAKEEEEEEEDED 99
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 30.4 bits (69), Expect = 0.99
Identities = 12/56 (21%), Positives = 19/56 (33%)
Query: 226 LLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEED 281
L + E + + + E L + + EEEEEEE+D
Sbjct: 248 LAAIGLESGFTFPLAEKIKEALANPSAAAAAAAAAAAAAAAAAAAPAEEEEEEEDD 303
Score = 27.3 bits (61), Expect = 9.3
Identities = 7/18 (38%), Positives = 11/18 (61%)
Query: 264 PSESEEEEEEEEEEEEED 281
+ + EEEEEEE++
Sbjct: 287 AAAAAAPAEEEEEEEDDM 304
>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal. This is the N terminal of
the DNA repair protein Rad54.
Length = 191
Score = 30.0 bits (68), Expect = 0.99
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 257 PNVPDFPPSESEEEEEEEEEEEEEDK 282
P V D P E E+ E+++E EE E K
Sbjct: 139 PTVDDKPKIEEEKAEKDQEPEESETK 164
>gnl|CDD|151958 pfam11521, TFIIE-A_C-term, C-terminal general transcription factor
TFIIE alpha. TFIIE is compiled of two subunits, alpha
and beta. This family of proteins are the C terminal
domain of the alpha subunit of the protein which is the
largest subunit and contains several functional domains
which are important for basal transcription and cell
growth. The C terminal end of the protein binds directly
to the amino-terminal PH domain of p62/Tfb1 (of IIH)
which is involved in the recruitment of the general
transcription factor IIH to the transcription
preinitiation complex. P53 competes for the same binding
site as TFIIE alpha which shows their structural
similarity. Like p53, TFIIE alpha 336-439 can activate
transcription in vivo.
Length = 86
Score = 28.6 bits (64), Expect = 0.99
Identities = 12/36 (33%), Positives = 23/36 (63%), Gaps = 1/36 (2%)
Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIED 286
E + + P+ P+ P +S +EEE++E++E E+ E
Sbjct: 7 ESDDDSPS-PEPPAGQSVDEEEDDEDDEFEEVEDEP 41
>gnl|CDD|225411 COG2855, COG2855, Predicted membrane protein [Function unknown].
Length = 334
Score = 30.3 bits (69), Expect = 1.1
Identities = 15/77 (19%), Positives = 25/77 (32%), Gaps = 9/77 (11%)
Query: 22 LIKILKLGERFRGFPKEAKFYLDKFKWG-------RTFLTLNEELLDLYKACDTVLFIFL 74
L IL+ K K + F G + + E++ T L
Sbjct: 232 LSLILRRQSGAGSGSKGLKITIPWFILGFLLVALLNSLGLIPAEVVSALVTLSTFLLAMA 291
Query: 75 M-RICLTPVGVQCVSKA 90
M + LT ++ + KA
Sbjct: 292 MAALGLT-THIKALKKA 307
>gnl|CDD|219735 pfam08170, POPLD, POPLD (NUC188) domain. This domain is found in
POP1-like nucleolar proteins.
Length = 92
Score = 28.8 bits (65), Expect = 1.1
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 1/37 (2%)
Query: 247 LKKIEEERNMPNVP-DFPPSESEEEEEEEEEEEEEDK 282
L++I E P P DFP +E+ EE EE EE E K
Sbjct: 31 LRQIHFESGQPFFPGDFPDTEAGWLEELEEREELEKK 67
>gnl|CDD|217634 pfam03601, Cons_hypoth698, Conserved hypothetical protein 698.
Length = 305
Score = 30.3 bits (69), Expect = 1.2
Identities = 12/50 (24%), Positives = 18/50 (36%), Gaps = 8/50 (16%)
Query: 39 AKFYLDKFKWG-------RTFLTLNEELLDLYKACDTVLFIFLMR-ICLT 80
+ + F G T + L E++ K T L + M I LT
Sbjct: 239 IRITIPWFVLGFLLVALLNTLVLLPPEVVSALKTLSTFLLLMAMAAIGLT 288
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 28.7 bits (64), Expect = 1.2
Identities = 11/17 (64%), Positives = 17/17 (100%)
Query: 265 SESEEEEEEEEEEEEED 281
+E+++EE++EEEEEEED
Sbjct: 88 AEAKKEEKKEEEEEEED 104
Score = 28.3 bits (63), Expect = 2.0
Identities = 10/17 (58%), Positives = 14/17 (82%)
Query: 265 SESEEEEEEEEEEEEED 281
+EE++EEEEEEE+D
Sbjct: 89 EAKKEEKKEEEEEEEDD 105
Score = 27.2 bits (60), Expect = 5.0
Identities = 12/23 (52%), Positives = 19/23 (82%), Gaps = 3/23 (13%)
Query: 267 SEEEEEEEEEEEEEDKAIEDELG 289
+E ++EE++EEEEE+ ED+LG
Sbjct: 88 AEAKKEEKKEEEEEE---EDDLG 107
>gnl|CDD|132829 cd07296, PX_PLD1, The phosphoinositide binding Phox Homology domain
of Phospholipase D1. The PX domain is a
phosphoinositide binding module present in many proteins
with diverse functions such as cell signaling, vesicular
trafficking, protein sorting, and lipid modification,
among others. Phospholipase D (PLD) catalyzes the
hydrolysis of the phosphodiester bond of
phosphatidylcholine to generate membrane-bound
phosphatidic acid and choline. PLDs are implicated in
many cellular functions like signaling, cytoskeletal
reorganization, vesicular transport, stress responses,
and the control of differentiation, proliferation, and
survival. PLD1 contains PX and Pleckstrin Homology (PH)
domains in addition to the catalytic domain. It acts as
an effector of Rheb in the signaling of the mammalian
target of rapamycin (mTOR), a serine/threonine protein
kinase that transduces nutrients and other stimuli to
regulate many cellular processes. PLD1 also regulates
the secretion of the procoagulant von Willebrand factor
(VWF) in endothelial cells. The PX domain is involved in
targeting of proteins to PI-enriched membranes, and may
also be involved in protein-protein interaction. The PX
domain of PLD1 specifically binds to
phosphatidylinositol-3,4,5-trisphosphate [PI(3,4,5)P3],
which enables PLD1 to mediate signals via the ERK1/2
pathway.
Length = 135
Score = 29.1 bits (65), Expect = 1.3
Identities = 15/43 (34%), Positives = 22/43 (51%), Gaps = 5/43 (11%)
Query: 249 KIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
K E R+MP++P EEE EE+ K +ED L ++
Sbjct: 79 KRGEPRHMPSLP-----RGAEEEAREEQFSSRRKQLEDYLSKL 116
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 28.4 bits (64), Expect = 1.4
Identities = 9/19 (47%), Positives = 15/19 (78%)
Query: 263 PPSESEEEEEEEEEEEEED 281
++ EE++EEEEEE ++D
Sbjct: 79 AEAKKEEKKEEEEEESDDD 97
Score = 27.3 bits (61), Expect = 3.5
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 263 PPSESEEEEEEEEEEEEED 281
+EE++EEEEEE +D
Sbjct: 78 AAEAKKEEKKEEEEEESDD 96
Score = 27.3 bits (61), Expect = 3.8
Identities = 8/27 (29%), Positives = 19/27 (70%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
+ + E ++EE++EEE++ +D++G
Sbjct: 73 AAAAAAAEAKKEEKKEEEEEESDDDMG 99
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 29.1 bits (65), Expect = 1.6
Identities = 18/60 (30%), Positives = 31/60 (51%)
Query: 228 DLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
D+ KE + D + +E ++ +EE E E+EEEEEE+EE++ ++D
Sbjct: 35 DIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDI 94
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 29.3 bits (66), Expect = 1.6
Identities = 14/22 (63%), Positives = 18/22 (81%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
E +EEEEEEEEEE+ED +D+
Sbjct: 173 EKDEEEEEEEEEEDEDFDDDDD 194
Score = 28.6 bits (64), Expect = 3.3
Identities = 16/54 (29%), Positives = 29/54 (53%)
Query: 234 DTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
T+ + +E L +E++ D + ++EEEEEEEEEE++ +D+
Sbjct: 140 GLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDD 193
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 29.8 bits (67), Expect = 1.8
Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 15/65 (23%)
Query: 224 EELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKA 283
EE D +E D D + E ++ E E SES+ E+ EE+E+E++ +A
Sbjct: 649 EEDEDDLEETDDDED----ECEAIEDSESE-----------SESDGEDGEEDEQEDDAEA 693
Query: 284 IEDEL 288
E +
Sbjct: 694 NEGVV 698
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 29.8 bits (68), Expect = 1.9
Identities = 17/46 (36%), Positives = 22/46 (47%), Gaps = 6/46 (13%)
Query: 249 KIEEERNMPNVPDFPPSESEEEEEEEEEEEEED------KAIEDEL 288
+ N P SE EE +++E+EEEEED A E EL
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESEL 216
Score = 28.2 bits (64), Expect = 6.8
Identities = 10/41 (24%), Positives = 19/41 (46%)
Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
P E +++E+EEEEE+E D ++ + +
Sbjct: 176 PNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESEL 216
>gnl|CDD|216284 pfam01074, Glyco_hydro_38, Glycosyl hydrolases family 38 N-terminal
domain. Glycosyl hydrolases are key enzymes of
carbohydrate metabolism.
Length = 269
Score = 29.5 bits (67), Expect = 2.1
Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 4/46 (8%)
Query: 223 NEELLDLYKECDTSTDVVKVQ----NEYLKKIEEERNMPNVPDFPP 264
+ELL+ + + KVQ ++Y +EE DFP
Sbjct: 222 TDELLEYINRWNELPGLPKVQFGTPSDYFDALEEATWPTKSGDFPT 267
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 29.2 bits (66), Expect = 2.1
Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 8/46 (17%)
Query: 245 EYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGR 290
E L+K EEE + E E+E+EE+E+E++ +D+ G
Sbjct: 281 ELLEKAEEEEE--------EDDYSESEDEDEEDEDEEEEEDDDEGD 318
Score = 28.1 bits (63), Expect = 6.0
Identities = 12/37 (32%), Positives = 20/37 (54%)
Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
+EE + E E++ E E+E+EED+ E+E
Sbjct: 275 DEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEE 311
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 29.6 bits (67), Expect = 2.1
Identities = 7/36 (19%), Positives = 16/36 (44%)
Query: 256 MPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
P P+ P E++++ E+++ +D R
Sbjct: 261 PPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRA 296
Score = 29.6 bits (67), Expect = 2.4
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIED 286
E P P P + + E+EEE+++ ED+A+E+
Sbjct: 264 PPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299
Score = 28.5 bits (64), Expect = 5.2
Identities = 12/41 (29%), Positives = 20/41 (48%)
Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
+ P PP E++++ E+EEE+D A + L I
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 29.2 bits (65), Expect = 2.4
Identities = 12/31 (38%), Positives = 16/31 (51%)
Query: 257 PNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
P PD E E EE + EEEED+ + +
Sbjct: 169 PAQPDVDNEEEERLEESDGREEEEDEEVGSD 199
>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
Length = 390
Score = 29.1 bits (66), Expect = 2.5
Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 5/55 (9%)
Query: 238 DVVKVQNEYLKKIEE---ERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELG 289
D ++ +N++L KI+E E+ P P S EEE E EEE K +E G
Sbjct: 233 DFIRQKNKWLAKIKEWVGEKGG--GPIIPYSAEFEEELAELGSEEERKEYLEEYG 285
>gnl|CDD|217192 pfam02703, Adeno_E1A, Early E1A protein. This is a family of
adenovirus early E1A proteins. The E1A protein is 32 kDa
it can however be cleaved to yield the 28 kDa protein.
The E1A protein is responsible for the transcriptional
activation of the early genes with in the viral genome
at the start of the infection process as well as some
cellular genes.
Length = 264
Score = 28.9 bits (65), Expect = 2.5
Identities = 10/22 (45%), Positives = 13/22 (59%)
Query: 262 FPPSESEEEEEEEEEEEEEDKA 283
FPPS+SE+EE E+ E
Sbjct: 117 FPPSDSEDEEREQGTSFESAAN 138
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 29.3 bits (66), Expect = 2.5
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
P E +E+E+EEEE+ E+E
Sbjct: 364 PIDFEEVDEDEDEEEEQRSDEHEEE 388
Score = 28.2 bits (63), Expect = 5.9
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 261 DFPPSESEEEEEEEEEEEEEDKAIEDE 287
D E +E+E+EEEE+ D+ E+E
Sbjct: 363 DPIDFEEVDEDEDEEEEQRSDEHEEEE 389
Score = 27.8 bits (62), Expect = 8.4
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 249 KIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
K+ ++R P E+E+EEEE+ +E + E E
Sbjct: 353 KMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGE 391
Score = 27.8 bits (62), Expect = 8.5
Identities = 8/28 (28%), Positives = 15/28 (53%)
Query: 262 FPPSESEEEEEEEEEEEEEDKAIEDELG 289
E +E+E+EEEE+ ++ E+
Sbjct: 365 IDFEEVDEDEDEEEEQRSDEHEEEEGED 392
Score = 27.4 bits (61), Expect = 9.2
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 267 SEEEEEEEEEEEEEDKAIEDELGR 290
EE +E+E+EEEE ++ E E
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEE 389
>gnl|CDD|217714 pfam03755, YicC_N, YicC-like family, N-terminal region. Family of
bacterial proteins. Although poorly characterized, the
members of this protein family have been demonstrated to
play a role in stationary phase survival. These proteins
are not essential during stationary phase.
Length = 159
Score = 28.3 bits (64), Expect = 2.8
Identities = 21/75 (28%), Positives = 27/75 (36%), Gaps = 19/75 (25%)
Query: 220 LTLNEELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPD------FPPSESEEEEEE 273
L +NE LL Y LK++ EE + P EEEE+
Sbjct: 77 LKVNEALLKAYLAA-------------LKELAEELGLAAPISLDDLLRLPGVLEIEEEED 123
Query: 274 EEEEEEEDKAIEDEL 288
EEE E A+ L
Sbjct: 124 EEELEALWAALLAAL 138
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 28.6 bits (64), Expect = 2.8
Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 4/38 (10%)
Query: 254 RNMPNV----PDFPPSESEEEEEEEEEEEEEDKAIEDE 287
RN+P + + S+ EEEE EEE +E ++ E
Sbjct: 79 RNLPAMWKIFRELSSSDDEEEETEEESTDETEQEDPPE 116
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 29.2 bits (65), Expect = 2.9
Identities = 21/91 (23%), Positives = 32/91 (35%), Gaps = 13/91 (14%)
Query: 201 RFPKEAKFYLDKFKWGRT-FLTLNEELLDLYKECDTSTDVVKVQNEYLKK---IEEERNM 256
RF F K+ + LN E + + +Y+ + E+
Sbjct: 314 RFLDNLDFLQFLSKYIGCLWRNLNNE---------KWGNEEAERKDYILDSSSVPLEKQF 364
Query: 257 PNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
++ F E E EE E EEE + EDE
Sbjct: 365 DDILYFEKMEIENRNPEESEHEEEVEDYEDE 395
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 29.0 bits (65), Expect = 2.9
Identities = 11/48 (22%), Positives = 23/48 (47%)
Query: 240 VKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
+K EY ++ E+ + + E+ + E EE E E+ + E++
Sbjct: 370 IKDSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEED 417
>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
represents the eukaryotic large ribosomal protein P2.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P2 is located in the L12 stalk, with proteins
P1, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers). Bacteria may have four or six copies
of L7/L12 (two or three homodimers) depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plants have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site in the C-terminal region
of P2. TCS inactivates the ribosome by depurinating a
specific adenine in the sarcin-ricin loop of 28S rRNA.
Length = 109
Score = 27.6 bits (62), Expect = 3.0
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 263 PPSESEEEEEEEEEEEEED 281
+ +EE++EE EEE +D
Sbjct: 83 AAAAKKEEKKEESEEESDD 101
Score = 27.2 bits (61), Expect = 4.0
Identities = 8/22 (36%), Positives = 14/22 (63%)
Query: 260 PDFPPSESEEEEEEEEEEEEED 281
++ EE++EE EEE ++D
Sbjct: 81 AAAAAAKKEEKKEESEEESDDD 102
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.0 bits (65), Expect = 3.0
Identities = 12/43 (27%), Positives = 27/43 (62%)
Query: 245 EYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
+Y KK+ E ++ + + ++++EE+ ++EE E++ K DE
Sbjct: 380 DYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADE 422
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 28.3 bits (64), Expect = 3.2
Identities = 14/61 (22%), Positives = 28/61 (45%), Gaps = 8/61 (13%)
Query: 235 TST-------DVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
TST V K+ + Y K+ ++++ E +++ EE +E+ KA++
Sbjct: 62 TSTKVDISGVKVPKLNDAYFKRKKKKKK-KKKEGEFFEEKKKKYVVSEERKEDQKAVDAA 120
Query: 288 L 288
L
Sbjct: 121 L 121
>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH. This
bacillus clade of FliH proteins is not found by the Pfam
FliH model pfam02108, but is closely related to the
sequences identified by that model. Sequences identified
by this model are observed in flagellar operons in an
analogous position relative to other flagellar operon
genes.
Length = 255
Score = 28.6 bits (64), Expect = 3.2
Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 1/34 (2%)
Query: 258 NVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
V D P +E E +EE+E E+ +KA E E +I
Sbjct: 24 QVTDVPETEQELADEEQEFEQILEKA-EAEAAQI 56
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 29.0 bits (65), Expect = 3.3
Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 15/67 (22%)
Query: 230 YKECDTSTDVVKVQNEYL---------KKIEEERNMPNVPDFPPSESEEEEEEEEEEEEE 280
Y E ++VK +E++ K++E+E VPD E E+EE++EEEE+
Sbjct: 2 YLEERRIKELVKKHSEFINYPIYLWVEKEVEKE-----VPD-EEEEEEKEEKKEEEEKTT 55
Query: 281 DKAIEDE 287
DK E +
Sbjct: 56 DKEEEVD 62
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 28.3 bits (64), Expect = 3.5
Identities = 9/24 (37%), Positives = 18/24 (75%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDELG 289
E E++++++E+E+ED E +LG
Sbjct: 51 EDLEDDDDDDEDEDEDDEEEADLG 74
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 28.6 bits (64), Expect = 3.7
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 9/60 (15%)
Query: 241 KVQNEYLKKIEEERN----MPNVPDFPPSESEEEE-----EEEEEEEEEDKAIEDELGRI 291
K + + LKK+ E+ + M VP+ P E E EE ++EEE+E+ + + GR
Sbjct: 292 KEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRR 351
Score = 28.3 bits (63), Expect = 5.9
Identities = 17/45 (37%), Positives = 24/45 (53%), Gaps = 2/45 (4%)
Query: 245 EYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEED--KAIEDE 287
E E+E P S+SEEE EE+E+E+ + K +EDE
Sbjct: 261 EPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDE 305
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 26.8 bits (60), Expect = 3.7
Identities = 8/16 (50%), Positives = 15/16 (93%)
Query: 266 ESEEEEEEEEEEEEED 281
ESE+EE+EEE+++++
Sbjct: 58 ESEDEEDEEEDDDDDM 73
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 29.0 bits (64), Expect = 3.7
Identities = 16/50 (32%), Positives = 27/50 (54%)
Query: 241 KVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGR 290
K++ E LKK EEE+ +E +++ EE ++ EEE+K E +
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAK 1668
Score = 27.8 bits (61), Expect = 8.2
Identities = 19/68 (27%), Positives = 34/68 (50%)
Query: 224 EELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKA 283
E L +E + ++ K + E KK EE + E+++E EE++++ EE K
Sbjct: 1692 EALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751
Query: 284 IEDELGRI 291
E+E +I
Sbjct: 1752 DEEEKKKI 1759
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
kDa subunit. This family contains several DNA-directed
RNA polymerase 19 kDa polypeptides. The Poxvirus
DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
DNA-template-directed extension of the 3'-end of an RNA
strand by one nucleotide at a time.
Length = 167
Score = 28.1 bits (63), Expect = 3.9
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 256 MPNVPDFPPSESEEEEEEEEEEEEED 281
M + D ES++++ EE EEEEED
Sbjct: 1 MEDSDDIIDYESDDDDSEEYEEEEED 26
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 28.8 bits (65), Expect = 3.9
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
E+EEEEE+ ++ED+ +D+
Sbjct: 318 QGEEDEEEEEDGVDDEDEEDDDD 340
Score = 28.0 bits (63), Expect = 7.9
Identities = 18/47 (38%), Positives = 23/47 (48%), Gaps = 14/47 (29%)
Query: 245 EYLKKIEEER--NMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELG 289
E LKK+E ER M EEE++EEEE+ K D+L
Sbjct: 269 ERLKKLEAERLRRM------------RGEEEDDEEEEDSKESADDLD 303
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 28.5 bits (64), Expect = 4.0
Identities = 14/20 (70%), Positives = 14/20 (70%)
Query: 264 PSESEEEEEEEEEEEEEDKA 283
P E EEE EE EEE EED A
Sbjct: 150 PEEEEEEPEEMEEELEEDAA 169
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 27.6 bits (62), Expect = 4.5
Identities = 14/41 (34%), Positives = 21/41 (51%)
Query: 248 KKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
KK+ + + E E E EE +EEE+ D+ +E EL
Sbjct: 88 KKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKEL 128
>gnl|CDD|146371 pfam03700, Sorting_nexin, Sorting nexin, N-terminal domain. These
proteins bins to the cytoplasmic domain of plasma
membrane receptors. and are involved in endocytic
protein trafficking. The N-terminal domain appears to be
specific to sorting nexins 1 and 2.
Length = 127
Score = 27.6 bits (61), Expect = 4.5
Identities = 12/24 (50%), Positives = 13/24 (54%)
Query: 252 EERNMPNVPDFPPSESEEEEEEEE 275
ER P PD P E EE EE E+
Sbjct: 2 AEREPPPFPDGEPEEPEELEEGED 25
>gnl|CDD|222049 pfam13330, Mucin2_WxxW, Mucin-2 protein WxxW repeating region.
This family is repeating region found on mucins 2 and 5.
The function is not known, but the repeat can be present
in up to 32 copies, as in a member from Branchiostoma
floridae. The region carries a highly conserved WxxW
sequence motif and also has at least six well conserved
cysteine residues.
Length = 88
Score = 26.9 bits (60), Expect = 4.7
Identities = 7/32 (21%), Positives = 9/32 (28%), Gaps = 1/32 (3%)
Query: 76 RICLTPVGVQC-VSKADHDIIQENGAAVVDCS 106
C P ++C V G V C
Sbjct: 29 SFCEMPTDIECRVVGTPQVPASMTGQNFVTCD 60
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 28.5 bits (63), Expect = 4.9
Identities = 15/84 (17%), Positives = 34/84 (40%)
Query: 200 ERFPKEAKFYLDKFKWGRTFLTLNEELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNV 259
E F +E L + + EE + E D +++N+ + + ++E+ V
Sbjct: 242 ENFKEEPGEPLSRPALKKEKQGAEEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEV 301
Query: 260 PDFPPSESEEEEEEEEEEEEEDKA 283
+ + E + + +E EE +
Sbjct: 302 ENAEAHKEEVQSDRPDEIGEEKEE 325
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 28.2 bits (63), Expect = 4.9
Identities = 10/43 (23%), Positives = 21/43 (48%)
Query: 245 EYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
E + E + N + P E E++++ EEEE ++ ++
Sbjct: 211 ELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSED 253
Score = 27.9 bits (62), Expect = 6.0
Identities = 19/82 (23%), Positives = 38/82 (46%), Gaps = 16/82 (19%)
Query: 224 EELLDLYKE---------CDTSTDVVKVQ-------NEYLKKIEEERNMPNVPDFPPSES 267
E+L+DL++ D V Q + L ++ + + P+ SE
Sbjct: 164 EKLVDLWRPWIEDKAGKDLDRLAACVDDQQAFARVVRDMLSSMDMAEELGDEPESADSED 223
Query: 268 EEEEEEEEEEEEEDKAIEDELG 289
E+E++ +E+E++D+ E+E G
Sbjct: 224 NEDEDDPKEDEDDDQGEEEESG 245
Score = 27.5 bits (61), Expect = 8.1
Identities = 9/27 (33%), Positives = 14/27 (51%)
Query: 261 DFPPSESEEEEEEEEEEEEEDKAIEDE 287
D P + ++++ EEEE D ED
Sbjct: 228 DDPKEDEDDDQGEEEESGSSDSLSEDS 254
>gnl|CDD|227277 COG4941, COG4941, Predicted RNA polymerase sigma factor containing
a TPR repeat domain [Transcription].
Length = 415
Score = 28.6 bits (64), Expect = 4.9
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDELGRI 291
S+E+EE EE E +D+ I D+ R+
Sbjct: 87 LSDEDEEMEEAEALDDEHIRDDRLRL 112
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 28.7 bits (64), Expect = 5.0
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 264 PSESEEEEEEEEEEEEEDKAIEDE 287
P +E+ E EE E D++ EDE
Sbjct: 241 PDNNEDSEAGREESEGSDESEEDE 264
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 27.8 bits (62), Expect = 5.0
Identities = 11/47 (23%), Positives = 22/47 (46%)
Query: 242 VQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
++ + K+ E + + E E+EE E+ EE ++ +EL
Sbjct: 13 MEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEEL 59
>gnl|CDD|237061 PRK12327, nusA, transcription elongation factor NusA; Provisional.
Length = 362
Score = 28.3 bits (64), Expect = 5.1
Identities = 13/23 (56%), Positives = 16/23 (69%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
SESE EEE EEE E+ +E+E
Sbjct: 340 SESEAEEEGAIEEEVEEDVVEEE 362
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 27.7 bits (62), Expect = 5.2
Identities = 9/24 (37%), Positives = 14/24 (58%)
Query: 264 PSESEEEEEEEEEEEEEDKAIEDE 287
P E E++E +EEE + +DE
Sbjct: 17 PEEDEDDESSDEEEVDLPDDEQDE 40
>gnl|CDD|220438 pfam09845, DUF2072, Zn-ribbon containing protein (DUF2072). This
archaeal protein has no known function.
Length = 121
Score = 27.0 bits (60), Expect = 5.3
Identities = 16/42 (38%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 251 EEERNMPNVP-DFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
E +R P+VP + P E E E +EE E ++D+ +E RI
Sbjct: 35 ERKRPKPDVPEEKPVEEIERETDEELLEVKQDREQGEEGERI 76
>gnl|CDD|129772 TIGR00689, rpiB_lacA_lacB, sugar-phosphate isomerase,
RpiB/LacA/LacB family. Proteins of known function in
this family act as sugar (pentose and/or
hexose)-phosphate isomerases, including the LacA and
LacB subunits of galactose-6-phosphate isomerases from
Gram-positive bacteria and RpiB. RpiB is the second
ribose phosphate isomerase of E. coli. It lacks homology
to RpiA, its inducer is unknown (but is not ribose), and
it can be replaced by the homologous
galactose-6-phosphate isomerase of Streptococcus mutans,
all of which suggests that the ribose phosphate
isomerase activity of RpiB is a secondary function. On
the other hand, there appear to be a significant number
of species which contain rpiB, lack rpiA and seem to
require rpi activity in order to complete the pentose
phosphate pathway.
Length = 144
Score = 27.3 bits (61), Expect = 5.4
Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 2/32 (6%)
Query: 16 KLAR-HNLIKILKLGERFRGFPKEAKFYLDKF 46
LAR HN +L LG R G + A +D F
Sbjct: 91 ALARQHNDANVLCLGSRVVG-VELALSIVDAF 121
>gnl|CDD|203922 pfam08377, MAP2_projctn, MAP2/Tau projection domain. This domain
is found in the MAP2/Tau family of proteins which
includes MAP2, MAP4, Tau, and their homologs. All
isoforms contain a conserved C-terminal domain
containing tubulin-binding repeats (pfam00418), and a
N-terminal projection domain of varying size. This
domain has a net negative charge and exerts a long-range
repulsive force. This provides a mechanism that can
regulate microtubule spacing which might facilitate
efficient organelle transport.
Length = 1134
Score = 28.6 bits (63), Expect = 5.4
Identities = 23/67 (34%), Positives = 31/67 (46%), Gaps = 16/67 (23%)
Query: 232 ECDTSTD-VVKVQNEYLK-KIEEERNMPNVPDFP--------------PSESEEEEEEEE 275
E D+S D +VQ E+++ EE P++P P P+E EEEE E
Sbjct: 811 EADSSIDEKAEVQMEFIQLPKEESTETPDIPAIPSDVTQPQPEAIVSEPAEVRGEEEEIE 870
Query: 276 EEEEEDK 282
E E DK
Sbjct: 871 AEGEYDK 877
>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
Length = 112
Score = 27.3 bits (59), Expect = 5.4
Identities = 16/53 (30%), Positives = 29/53 (54%)
Query: 235 TSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
++ D +K +E++KK E+ P P+ + E+ E E+E++ E EDE
Sbjct: 47 SNEDEMKTTHEHIKKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDE 99
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 25.4 bits (56), Expect = 5.9
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDELGR 290
E E+ E+EEE E+EE+ + E+ R
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVKR 46
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 26.8 bits (59), Expect = 5.9
Identities = 15/37 (40%), Positives = 19/37 (51%)
Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
E E N D E EE +E++EEEE E + E E
Sbjct: 38 ENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGE 74
>gnl|CDD|215313 PLN02575, PLN02575, haloacid dehalogenase-like hydrolase.
Length = 381
Score = 27.9 bits (62), Expect = 6.1
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 260 PDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
P+F P E E E E+EE+ E A D++
Sbjct: 351 PEFGPPEPELEMEKEEDRELPSSAGVDDI 379
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 28.0 bits (63), Expect = 6.2
Identities = 16/48 (33%), Positives = 26/48 (54%)
Query: 240 VKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
V+ K+ EE++ + EEEEEE+E++EEE + E+E
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 26.6 bits (59), Expect = 6.3
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 266 ESEEEEEEEEEEEEEDKAIEDELGRI 291
E ++EEEEEEEEE++ + + DE I
Sbjct: 7 EVDDEEEEEEEEEDDLEDLSDEDEFI 32
Score = 26.3 bits (58), Expect = 7.7
Identities = 14/24 (58%), Positives = 18/24 (75%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDEL 288
+ EEEEEEEEE++ ED + EDE
Sbjct: 8 VDDEEEEEEEEEDDLEDLSDEDEF 31
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 27.8 bits (62), Expect = 7.2
Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 11/42 (26%)
Query: 247 LKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
L+KI++ER E+E EEEE+ EE+KA E+E+
Sbjct: 151 LEKIKKER-----------AEEKEREEEEKAAEEEKAREEEI 181
Score = 27.4 bits (61), Expect = 8.4
Identities = 8/27 (29%), Positives = 19/27 (70%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDELGRI 291
S+S +++ ++++ E+E A+ EL +I
Sbjct: 128 SDSSDDDSDDDDSEDETAALLRELEKI 154
>gnl|CDD|222891 PHA02585, 16, small terminase protein; Provisional.
Length = 161
Score = 27.1 bits (60), Expect = 7.2
Identities = 18/69 (26%), Positives = 31/69 (44%), Gaps = 5/69 (7%)
Query: 221 TLNEELLDLYKECD--TSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEE 278
N+E+L ++KE T + + ++ N+ N F S +E +E + E
Sbjct: 94 NTNKEILKIHKEMKDITEEQTGTKKG---EPQGDQMNIENATVFMGSPTELMDEVGDAYE 150
Query: 279 EEDKAIEDE 287
EE+K I E
Sbjct: 151 EEEKVINGE 159
>gnl|CDD|224250 COG1331, COG1331, Highly conserved protein containing a thioredoxin
domain [Posttranslational modification, protein
turnover, chaperones].
Length = 667
Score = 28.1 bits (63), Expect = 7.3
Identities = 28/99 (28%), Positives = 40/99 (40%), Gaps = 14/99 (14%)
Query: 193 FYEDLDGERFPKEAKFYLDKFKWGRTFLTLNEELLDLYKECDTSTDVVKVQNEYLKKIEE 252
FY LD + +E K+Y W + L E L + + DV + N E
Sbjct: 316 FYSSLDADSDGEEGKYYT----W--SVEELKEVLGEDAELACKYFDVSEEGN------FE 363
Query: 253 ERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
RN+ +VP P E EE EE+ E +K + R
Sbjct: 364 GRNVLHVP--GPLEEAIEEAEEKLERAREKLLAAREKRK 400
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 28.1 bits (63), Expect = 7.4
Identities = 6/57 (10%), Positives = 21/57 (36%), Gaps = 2/57 (3%)
Query: 231 KECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
K + K + + + + P + + E++++ E D + +++
Sbjct: 1326 KSEKKTARKKKSKTRVKQASASQSSRLLRR--PRKKKSDSSSEDDDDSEVDDSEDED 1380
>gnl|CDD|220112 pfam09110, HAND, HAND. The HAND domain adopts a secondary
structure consisting of four alpha helices, three of
which (H2, H3, H4) form an L-like configuration. Helix
H2 runs antiparallel to helices H3 and H4, packing
closely against helix H4, whilst helix H1 reposes in the
concave surface formed by these three helices and runs
perpendicular to them. The domain confers DNA and
nucleosome binding properties to the protein.
Length = 109
Score = 26.5 bits (59), Expect = 7.5
Identities = 8/30 (26%), Positives = 16/30 (53%)
Query: 257 PNVPDFPPSESEEEEEEEEEEEEEDKAIED 286
+ D + E+EEE E + + E + I++
Sbjct: 61 VPLDDVGDGDEEDEEEREAKRKLEQEKIDN 90
>gnl|CDD|218562 pfam05342, Peptidase_M26_N, M26 IgA1-specific Metallo-endopeptidase
N-terminal region. These peptidases, which cleave
mammalian IgA, are found in Gram-positive bacteria.
Often found associated with pfam00746, they may be
attached to the cell wall.
Length = 252
Score = 27.3 bits (61), Expect = 8.7
Identities = 17/61 (27%), Positives = 23/61 (37%), Gaps = 20/61 (32%)
Query: 225 ELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAI 284
E LD YKE ST +V + E SE E EE+ + + K +
Sbjct: 1 ENLDYYKEYTLSTTMVYDRGE--------------------GSETETLEEKPVQLDLKKV 40
Query: 285 E 285
E
Sbjct: 41 E 41
>gnl|CDD|107059 PHA01746, PHA01746, hypothetical protein.
Length = 131
Score = 26.8 bits (59), Expect = 8.7
Identities = 14/45 (31%), Positives = 22/45 (48%), Gaps = 1/45 (2%)
Query: 244 NEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
NEY+K N + E +EE+EE+ E+ K +E+E
Sbjct: 88 NEYVKFTPRNNNAIQEEEDD-MEQQEEKEEKGREKGRKKNVEEEF 131
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 26.8 bits (59), Expect = 9.7
Identities = 10/24 (41%), Positives = 19/24 (79%)
Query: 265 SESEEEEEEEEEEEEEDKAIEDEL 288
S++EE EE+EEE E+ + ++++E
Sbjct: 26 SQAEEVEEDEEEMEDWEDSLDEED 49
Score = 26.8 bits (59), Expect = 10.0
Identities = 14/33 (42%), Positives = 16/33 (48%)
Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKA 283
E+E M + D E EE EE EEE KA
Sbjct: 33 EDEEEMEDWEDSLDEEDEEAEEVEEETAASSKA 65
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.139 0.429
Gapped
Lambda K H
0.267 0.0685 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,608,325
Number of extensions: 1559347
Number of successful extensions: 7162
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5727
Number of HSP's successfully gapped: 430
Length of query: 291
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 195
Effective length of database: 6,679,618
Effective search space: 1302525510
Effective search space used: 1302525510
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.6 bits)