RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6462
(234 letters)
>gnl|CDD|114751 pfam06047, SynMuv_product, Ras-induced vulval development
antagonist. This family is from synthetic multi-vulval
genes which encode chromatin-associated proteins
involved in transcriptional repression. This protein has
a role in antagonising Ras-induced vulval development.
Length = 105
Score = 175 bits (445), Expect = 1e-56
Identities = 79/103 (76%), Positives = 90/103 (87%)
Query: 126 QKDYGKALLPGEGAAMAAYVAEGKRIPRRGEIGLTSDEIERYEAVGYVMSGSRHRRMEAV 185
YG ALLPGEGAAMA YV +GKRIPRRGEIGLTS+EIE +E VGYVMSGSRH+RM AV
Sbjct: 3 PLSYGGALLPGEGAAMAQYVQKGKRIPRRGEIGLTSEEIENFEKVGYVMSGSRHKRMNAV 62
Query: 186 RIRKENQIYSADEKRALAMFSKEERQKRENRILTQFKEMVSSK 228
RIRKENQ+YSA+EKRALAMF+ EER KREN+IL+QFKE++ K
Sbjct: 63 RIRKENQVYSAEEKRALAMFNYEERAKRENKILSQFKELIDRK 105
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 37.8 bits (88), Expect = 0.001
Identities = 23/53 (43%), Positives = 29/53 (54%)
Query: 35 RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSED 87
RK S KK KK +H + K +KKR++ SSSSSS SSSSS +
Sbjct: 68 RKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSSE 120
Score = 30.5 bits (69), Expect = 0.29
Identities = 23/62 (37%), Positives = 28/62 (45%), Gaps = 14/62 (22%)
Query: 45 SSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSS 104
S KK+K + +S K KK++ SSSSSS SSSSS SS S
Sbjct: 75 SQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSSS--------------SSSSE 120
Query: 105 DS 106
S
Sbjct: 121 SS 122
Score = 29.3 bits (66), Expect = 0.69
Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 8/50 (16%)
Query: 47 KKSKKKKHKKDKKK--------KSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
++ + HKK +KK K S K++++ SSSSSS SSSSS E
Sbjct: 71 QQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSSE 120
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 38.1 bits (89), Expect = 0.003
Identities = 17/75 (22%), Positives = 28/75 (37%), Gaps = 8/75 (10%)
Query: 36 KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAV 95
K S ++++K K K K R+ +K S SSS+ SE +
Sbjct: 1322 KKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVD------- 1374
Query: 96 KSSDSSHSSDSDNSD 110
S+ D ++ D
Sbjct: 1375 -DSEDEDDEDDEDDD 1388
Score = 37.3 bits (87), Expect = 0.006
Identities = 26/85 (30%), Positives = 36/85 (42%), Gaps = 2/85 (2%)
Query: 35 RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSA 94
R S T G S + K K K+ KKK K S + KK +S S + + DK
Sbjct: 1157 RLKSKTKGKASKLRKPKLKKKE--KKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPD 1214
Query: 95 VKSSDSSHSSDSDNSDRDTVGPAPK 119
K S+SS S D+ ++ T
Sbjct: 1215 NKKSNSSGSDQEDDEEQKTKPKKSS 1239
Score = 36.2 bits (84), Expect = 0.012
Identities = 18/67 (26%), Positives = 25/67 (37%)
Query: 46 SKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSD 105
KK KK + K +KKKSK K+ SS D + D S D
Sbjct: 1321 LKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDED 1380
Query: 106 SDNSDRD 112
++ + D
Sbjct: 1381 DEDDEDD 1387
Score = 30.4 bits (69), Expect = 1.1
Identities = 16/78 (20%), Positives = 26/78 (33%)
Query: 33 SGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDK 92
+++ S S K K KK + KK KK +KK S + +
Sbjct: 1292 PDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSR 1351
Query: 93 SAVKSSDSSHSSDSDNSD 110
+ S S++ D
Sbjct: 1352 LLRRPRKKKSDSSSEDDD 1369
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 36.5 bits (85), Expect = 0.009
Identities = 10/32 (31%), Positives = 17/32 (53%)
Query: 44 ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSS 75
E +K K+K K + + +K KR+K +S
Sbjct: 401 EKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTS 432
Score = 36.1 bits (84), Expect = 0.012
Identities = 13/42 (30%), Positives = 23/42 (54%)
Query: 36 KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSS 77
K S ++ K SKK K+ +KK+ +K K + KK++ +
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTK 420
Score = 36.1 bits (84), Expect = 0.013
Identities = 15/52 (28%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 32 PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKK--KKYSSSSSSDS 81
PS +K S +K+++KK+ +K+K K K+ + K K+ S +S+
Sbjct: 383 PSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 34.1 bits (79), Expect = 0.056
Identities = 12/34 (35%), Positives = 18/34 (52%)
Query: 39 PTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
P + S KK+ K K K+ KK K+++K K
Sbjct: 378 PKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPK 411
Score = 32.2 bits (74), Expect = 0.20
Identities = 13/57 (22%), Positives = 25/57 (43%)
Query: 32 PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
P + S + SKK K+ +K +K+K K K++ + + S + +E
Sbjct: 378 PKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 35.4 bits (82), Expect = 0.014
Identities = 20/67 (29%), Positives = 32/67 (47%), Gaps = 2/67 (2%)
Query: 21 LMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
L+ N+ S PT +SS S+K+K K KK +KKRK+K+ +
Sbjct: 198 LLKLNDNDKTAASSASVVPTP--QSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPPPPFNM 255
Query: 81 SSSSSED 87
S +++D
Sbjct: 256 VSDNTDD 262
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 35.5 bits (82), Expect = 0.022
Identities = 21/90 (23%), Positives = 29/90 (32%), Gaps = 1/90 (1%)
Query: 32 PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSS-SSSDSSSSSEDEWV 90
+ P S S ++ S R SSS SS SS+SS E
Sbjct: 277 GPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESS 336
Query: 91 DKSAVKSSDSSHSSDSDNSDRDTVGPAPKQ 120
+AV S S S + P+ +
Sbjct: 337 RGAAVSPGPSPSRSPSPSRPPPPADPSSPR 366
Score = 27.4 bits (61), Expect = 8.4
Identities = 12/48 (25%), Positives = 17/48 (35%), Gaps = 1/48 (2%)
Query: 74 SSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGPAPKQH 121
S+S+S + + D + SSDSS S S P
Sbjct: 213 ISASASSPAPAPGRSAADDAGASSSDSSSSESSG-CGWGPENECPLPR 259
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 33.9 bits (78), Expect = 0.041
Identities = 17/49 (34%), Positives = 22/49 (44%)
Query: 35 RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSS 83
A E ++ KK+K KK + KK KK KK KK+K S
Sbjct: 139 TTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187
Score = 29.7 bits (67), Expect = 1.00
Identities = 16/41 (39%), Positives = 20/41 (48%)
Query: 32 PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
K + K+ K KK K + K SKK KK+KKKK
Sbjct: 153 EKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 27.4 bits (61), Expect = 6.2
Identities = 15/48 (31%), Positives = 23/48 (47%)
Query: 35 RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSS 82
K + +E +K KKK+ KK+KK+K K +K + K S
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191
Score = 27.0 bits (60), Expect = 7.2
Identities = 16/60 (26%), Positives = 25/60 (41%)
Query: 38 SPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKS 97
S TS E++ K +K+ ++++KK KK KK KK+ K K
Sbjct: 132 SETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 33.3 bits (76), Expect = 0.049
Identities = 17/26 (65%), Positives = 22/26 (84%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKK 72
KK KKKKH+ DK++K KK +K+KKKK
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKK 167
Score = 31.0 bits (70), Expect = 0.30
Identities = 15/34 (44%), Positives = 20/34 (58%)
Query: 39 PTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
T D K +KKHKK K + K+ KK+KK+K
Sbjct: 129 ETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEK 162
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 33.7 bits (78), Expect = 0.063
Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 4/80 (5%)
Query: 45 SSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSS 104
SKK+ KK K +K + K+ +++K+KK + + E+E K K + +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 105 DSDNSDRDTVGPAPKQHVTL 124
+ + K+ TL
Sbjct: 464 EEK----EEEEEKKKKQATL 479
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 33.1 bits (76), Expect = 0.099
Identities = 21/67 (31%), Positives = 29/67 (43%), Gaps = 5/67 (7%)
Query: 9 EKKEKELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKD---KKKKSKKS 65
++ K L EF LD L R+ E+++ +KK K KKKK K+S
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETE--EDVEAAESRAEKKRKSKEEIKKKKPKES 266
Query: 66 KKRKKKK 72
K K K
Sbjct: 267 KGVKALK 273
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 32.6 bits (75), Expect = 0.14
Identities = 15/35 (42%), Positives = 20/35 (57%)
Query: 37 ASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKK 71
P KK +K+K KK +KKK KK KK+ +K
Sbjct: 380 PKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 32.2 bits (74), Expect = 0.23
Identities = 14/26 (53%), Positives = 19/26 (73%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKK 72
K+ +KK K+ KKKK KK K++KKK
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKK 411
Score = 30.3 bits (69), Expect = 0.77
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKK 72
K K+++ K K+KK KK KK+ KK+
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKR 408
Score = 29.9 bits (68), Expect = 1.1
Identities = 14/26 (53%), Positives = 20/26 (76%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKK 72
KK ++KK +K KKKK +K K +K+KK
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKK 410
Score = 29.6 bits (67), Expect = 1.6
Identities = 14/26 (53%), Positives = 19/26 (73%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKK 72
++ KK + +K KKK+ KK KKRKKK
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKG 412
Score = 29.6 bits (67), Expect = 1.7
Identities = 14/25 (56%), Positives = 17/25 (68%)
Query: 48 KSKKKKHKKDKKKKSKKSKKRKKKK 72
K KKK ++ K +K KK KKRKKK
Sbjct: 381 KPPKKKREEKKPQKRKKKKKRKKKG 405
Score = 29.2 bits (66), Expect = 1.7
Identities = 12/26 (46%), Positives = 19/26 (73%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKK 72
KK +++K + +KKK K+ KK KK+K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRK 409
Score = 28.8 bits (65), Expect = 2.9
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKK 72
++ K +K KK KK+K K K++KK +
Sbjct: 388 EEKKPQKRKKKKKRKKKGKKRKKKGR 413
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 32.1 bits (73), Expect = 0.15
Identities = 25/76 (32%), Positives = 37/76 (48%), Gaps = 10/76 (13%)
Query: 45 SSKKSKKKKHKKDKKKKSKKSKKRKKKKY----------SSSSSSDSSSSSEDEWVDKSA 94
S K SKKKK KKK +KK+KK+KK+K S S ++ + D + K
Sbjct: 47 SKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQ 106
Query: 95 VKSSDSSHSSDSDNSD 110
+++ +H D SD
Sbjct: 107 NTAAEVNHEDVIDLSD 122
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 32.9 bits (75), Expect = 0.16
Identities = 23/85 (27%), Positives = 34/85 (40%), Gaps = 3/85 (3%)
Query: 29 SVLPSGRKASPTSGDESS--KKSKKKKHK-KDKKKKSKKSKKRKKKKYSSSSSSDSSSSS 85
S G A DE KK KK + K +DK K K KK S S D+++
Sbjct: 427 STSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKSQEDNNNIP 486
Query: 86 EDEWVDKSAVKSSDSSHSSDSDNSD 110
+EWV K ++ +++
Sbjct: 487 PEEWVMKKILEWVPDLEEDGTEDPG 511
Score = 27.9 bits (62), Expect = 6.4
Identities = 21/82 (25%), Positives = 28/82 (34%), Gaps = 2/82 (2%)
Query: 36 KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWV--DKS 93
+AS SS+KS S K K+KK K +S+ + S +DE K
Sbjct: 390 QASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKG 449
Query: 94 AVKSSDSSHSSDSDNSDRDTVG 115
S SD G
Sbjct: 450 KKNQKKGRDKSSKVPSDSKAGG 471
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 32.7 bits (75), Expect = 0.16
Identities = 7/44 (15%), Positives = 22/44 (50%)
Query: 44 ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSED 87
+K+++++K K++ K + + K + S+++ D +
Sbjct: 17 LKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQ 60
Score = 28.0 bits (63), Expect = 5.1
Identities = 13/55 (23%), Positives = 30/55 (54%), Gaps = 3/55 (5%)
Query: 46 SKKSKKKKHKKDKKKKSKKSKKRKKKKY---SSSSSSDSSSSSEDEWVDKSAVKS 97
SK + KK+ K + ++ K +K+ K ++ S S++++DE +D + ++
Sbjct: 9 SKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRA 63
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 31.3 bits (72), Expect = 0.19
Identities = 11/27 (40%), Positives = 14/27 (51%)
Query: 46 SKKSKKKKHKKDKKKKSKKSKKRKKKK 72
KK K K +KK + K+K KKK
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKA 90
Score = 30.6 bits (70), Expect = 0.35
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 43 DESSKKSKKKKHKKDKKKKSKKSKKRKK 70
D+ K+ +K KK +K K+K KK+ K
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKAK 91
Score = 28.6 bits (65), Expect = 1.6
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 48 KSKKKKHKKDKKKKSKKSKKRKKKK 72
K++KK K K + KK+KK + K
Sbjct: 73 KAEKKAEKAKAKAEKKKAKKEEPKP 97
>gnl|CDD|221569 pfam12420, DUF3671, Protein of unknown function. This domain
family is found in eukaryotes, and is typically between
96 and 116 amino acids in length.
Length = 106
Score = 30.7 bits (70), Expect = 0.23
Identities = 11/25 (44%), Positives = 11/25 (44%)
Query: 49 SKKKKHKKDKKKKSKKSKKRKKKKY 73
K KKD KK KK KKY
Sbjct: 21 DYIDKLKKDPNIDKKKFKKIIFKKY 45
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 31.8 bits (72), Expect = 0.34
Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 34 GRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKS 93
++ S T+ ++SS K K+K HK D++ +SKK++ ++ SSS S SSSS +K
Sbjct: 733 KKQTSKTASEKSSSKGKRK-HKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKE 791
Query: 94 AVKSSDSS 101
+ KSS +
Sbjct: 792 SRKSSRNK 799
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 30.4 bits (69), Expect = 0.41
Identities = 17/29 (58%), Positives = 19/29 (65%)
Query: 44 ESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
+ K KK+K KK KKKK K SKK KKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 28.1 bits (63), Expect = 2.1
Identities = 16/29 (55%), Positives = 19/29 (65%)
Query: 44 ESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
E +K+ K+KK KK KKKK K KKKK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 28.1 bits (63), Expect = 2.2
Identities = 13/31 (41%), Positives = 18/31 (58%)
Query: 48 KSKKKKHKKDKKKKSKKSKKRKKKKYSSSSS 78
K K+ K KK+K KK KK+KKK ++
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132
Score = 28.1 bits (63), Expect = 2.2
Identities = 14/31 (45%), Positives = 16/31 (51%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSS 77
K K K+ K+ KKKK KK KK KK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 27.7 bits (62), Expect = 2.8
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
K+ K+K K+ K+KK KK KK+KK ++
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 31.2 bits (71), Expect = 0.47
Identities = 17/69 (24%), Positives = 26/69 (37%), Gaps = 7/69 (10%)
Query: 36 KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSS-------EDE 88
K + SKK +KK KK K+KK KS+ K + + S + ED
Sbjct: 56 KDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDV 115
Query: 89 WVDKSAVKS 97
+ +
Sbjct: 116 DNAFNKIAE 124
Score = 29.3 bits (66), Expect = 1.7
Identities = 17/50 (34%), Positives = 26/50 (52%)
Query: 28 FSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSS 77
+L + + S+KK KK+ + KKKS+K KK+KK+K S
Sbjct: 36 KEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKS 85
Score = 28.9 bits (65), Expect = 2.1
Identities = 16/60 (26%), Positives = 26/60 (43%)
Query: 40 TSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSD 99
T D+ K+ + K K +KKKK KK KK K + + + S+ K ++
Sbjct: 54 TKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNE 113
>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.
The proteins in this family are poorly characterized,
but an investigation has indicated that the immediate
early protein is required the down-regulation of MHC
class I expression in dendritic cells. Human herpesvirus
6 immediate early protein is also referred to as U90.
Length = 993
Score = 31.2 bits (70), Expect = 0.47
Identities = 27/93 (29%), Positives = 35/93 (37%), Gaps = 14/93 (15%)
Query: 30 VLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD----SSSSS 85
V PS + S K KK +K K+ K KK K S + SS S
Sbjct: 638 VTPSKTFYKRDKKKDISTNRKVKKRTAKRKTVGYKTDKSKKIKSDSLPTDTNVIVISSES 697
Query: 86 EDEWVDKSAVKSS----------DSSHSSDSDN 108
EDE + +K S S SS+SD+
Sbjct: 698 EDEEDGFNIIKKSQLKKKIKSELKSESSSESDD 730
>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease. The potyviridae
family positive stand RNA viruses with genome encoding a
polyprotein. members include zucchini yellow mosaic
virus, and turnip mosaic viruses which cause
considerable losses of crops worldwide. This family
consists of a C terminus region from various plant
potyvirus P1 proteins (found at the N terminus of the
polyprotein). The C terminus of P1 is a serine-type
protease responsible for autocatalytic cleavage between
P1 and the helper component protease pfam00851. The
entire P1 protein may be involved in virus-host
interactions.
Length = 245
Score = 30.8 bits (70), Expect = 0.51
Identities = 15/67 (22%), Positives = 26/67 (38%), Gaps = 6/67 (8%)
Query: 6 RRGEKKEKELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKS 65
R K+E+ + Q + S + + + S ++ K +KK
Sbjct: 41 REKLKREEREERQFLQGAYASIVSKITPIGTDKVSKTESVSFRTPYYKR------TTKKM 94
Query: 66 KKRKKKK 72
KK+KKKK
Sbjct: 95 KKKKKKK 101
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 31.0 bits (71), Expect = 0.55
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 43 DESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
D+ K+ K KK +K+++K +K + K +
Sbjct: 62 DKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 30.1 bits (68), Expect = 0.56
Identities = 9/53 (16%), Positives = 23/53 (43%)
Query: 27 LFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSS 79
+F L S + +ES+ +++++ + K + +K K+ K +
Sbjct: 87 IFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKP 139
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 30.4 bits (69), Expect = 0.63
Identities = 17/69 (24%), Positives = 28/69 (40%), Gaps = 4/69 (5%)
Query: 43 DESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSH 102
+E +KK KKKK KK KK + S + + S + E + S
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGE----AKKLKKKRSI 233
Query: 103 SSDSDNSDR 111
+ D++ S+
Sbjct: 234 APDNEKSEV 242
>gnl|CDD|216818 pfam01970, TctA, Tripartite tricarboxylate transporter TctA family.
This family, formerly known as DUF112, is a family of
bacterial and archaeal tripartite tricarboxylate
transporters of the extracytoplasmic solute binding
receptor-dependent transporter group of families,
distinct from the ABC and TRAP-T families. TctA is part
of the tripartite TctABC system which, as characterized
in S. typhimurium, is a secondary carrier that depends
for activity on the extracytoplasmic
tricarboxylate-binding receptor TctC as well as two
integral membrane proteins, TctA and TctB. complete
three-component systems are found only in bacteria. TctA
is a large transmembrane protein with up to 12 predicted
membrane spanning regions in bacteria and up to 11 such
in archaea, with the N-terminal within the cytoplasm.
TctA is thought to be a permease, and in most other
bacteria functions without TctB and TctC molecules.
Length = 418
Score = 30.5 bits (70), Expect = 0.66
Identities = 11/27 (40%), Positives = 16/27 (59%), Gaps = 2/27 (7%)
Query: 132 ALLPGEGAAMAAYVA--EGKRIPRRGE 156
+LPG GA +AA++A KR + E
Sbjct: 244 GILPGAGATIAAFLAYGLAKRTSKNPE 270
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 30.8 bits (70), Expect = 0.66
Identities = 20/61 (32%), Positives = 26/61 (42%), Gaps = 8/61 (13%)
Query: 10 KKEKELDYQEFLMLDNE----LFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKS 65
++EKE Q L D E L SVL + K +E +S KK K K+ KS
Sbjct: 1525 QEEKEPAGQGELESDKEKKGNLESVLSNQEKNI----EEDYAESDIKKRKNKKQYKSNTE 1580
Query: 66 K 66
Sbjct: 1581 A 1581
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 30.6 bits (69), Expect = 0.69
Identities = 13/24 (54%), Positives = 19/24 (79%)
Query: 45 SSKKSKKKKHKKDKKKKSKKSKKR 68
S KK KKKK K+ KKKK++++ K+
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQ 706
>gnl|CDD|218662 pfam05623, DUF789, Protein of unknown function (DUF789). This
family consists of several plant proteins of unknown
function.
Length = 289
Score = 30.4 bits (69), Expect = 0.75
Identities = 14/42 (33%), Positives = 19/42 (45%), Gaps = 1/42 (2%)
Query: 74 SSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVG 115
S +SS+ SS + W +S SD S SSD +G
Sbjct: 102 SRDTSSEGSSDCSESWDSRSD-GDSDGSFSSDESELSSQRLG 142
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 30.4 bits (68), Expect = 0.82
Identities = 19/87 (21%), Positives = 38/87 (43%), Gaps = 2/87 (2%)
Query: 37 ASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVK 96
SP + ++ ++ ++ +KKSK ++ + +S S + E E D +V
Sbjct: 74 TSPLKSTKRQREKPASDTEEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGESSDSRSVN 133
Query: 97 SSDSSHSSDSDNSDRDTVG--PAPKQH 121
SS D D +R + P+P+ +
Sbjct: 134 EEGSSDPKDIDQDNRSSSPSIPSPQDN 160
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 30.2 bits (68), Expect = 0.85
Identities = 19/75 (25%), Positives = 31/75 (41%), Gaps = 10/75 (13%)
Query: 45 SSKKSKKK--KHKKDKKKKSKKSKKRKKKKYSSSSSSDS--------SSSSEDEWVDKSA 94
++KK K K+ D++K+ +SKK K +S + + + + E S
Sbjct: 7 ATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSD 66
Query: 95 VKSSDSSHSSDSDNS 109
V SS S D S
Sbjct: 67 VDSSSSEEEEDGSES 81
Score = 28.7 bits (64), Expect = 3.3
Identities = 15/54 (27%), Positives = 23/54 (42%)
Query: 35 RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
RK S K++ + + + + K K+K S DSSSS E+E
Sbjct: 23 RKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSDVDSSSSEEEE 76
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 30.1 bits (68), Expect = 1.1
Identities = 24/97 (24%), Positives = 39/97 (40%), Gaps = 7/97 (7%)
Query: 37 ASPTSGDESSKKSKKKKHKKDKKKKSKKSK----KRKKKKYSSSSSSDSSSSSEDEWVDK 92
S D ++ SK+ K K S +K ++ + + S+S + D
Sbjct: 154 DSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASD--DT 211
Query: 93 SAVKSSDSSHSSDSDNSDRDTVGPAPKQHVTLTQKDY 129
+ KSS + S SD S D++ + TQKDY
Sbjct: 212 ANQKSSSKDNQSMSD-SALDSILDQYSEDAKKTQKDY 247
Score = 28.6 bits (64), Expect = 3.0
Identities = 15/70 (21%), Positives = 29/70 (41%)
Query: 37 ASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVK 96
SPT+ + +K K KKS + K + +D++++S + DK
Sbjct: 21 TSPTAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFST 80
Query: 97 SSDSSHSSDS 106
S+ S++
Sbjct: 81 IDSSTSDSNN 90
Score = 28.2 bits (63), Expect = 4.1
Identities = 15/74 (20%), Positives = 26/74 (35%), Gaps = 7/74 (9%)
Query: 62 SKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSD-------NSDRDTV 114
K + K SS + + SS+ + D SS+++ S S+ + +
Sbjct: 143 EKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQS 202
Query: 115 GPAPKQHVTLTQKD 128
P T QK
Sbjct: 203 NSQPASDDTANQKS 216
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 30.0 bits (67), Expect = 1.2
Identities = 18/60 (30%), Positives = 29/60 (48%)
Query: 50 KKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNS 109
+ K K+ + K KK+K++ SSS SS SS + + +S S S+S +S
Sbjct: 97 RSKTPIKNNDNVTTPIKANKKEKHNLDSSSSSSISSSLTNISFFSSPTSIYSCLSNSLSS 156
Score = 29.2 bits (65), Expect = 2.5
Identities = 19/67 (28%), Positives = 29/67 (43%), Gaps = 1/67 (1%)
Query: 46 SKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDS-SHSS 104
+++SK D K+ K++K SSSSS SSS + S S+S
Sbjct: 95 NERSKTPIKNNDNVTTPIKANKKEKHNLDSSSSSSISSSLTNISFFSSPTSIYSCLSNSL 154
Query: 105 DSDNSDR 111
S +S +
Sbjct: 155 SSKHSPK 161
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 29.9 bits (67), Expect = 1.2
Identities = 14/52 (26%), Positives = 19/52 (36%)
Query: 38 SPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEW 89
+P E K K + K K K KK KK+ S S+ +W
Sbjct: 140 APPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDW 191
Score = 29.1 bits (65), Expect = 2.1
Identities = 15/50 (30%), Positives = 17/50 (34%)
Query: 39 PTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
P S K KK K K K K +K KK S S+ D
Sbjct: 136 PISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDP 185
Score = 27.9 bits (62), Expect = 4.6
Identities = 14/49 (28%), Positives = 19/49 (38%)
Query: 32 PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
P ++ +S K K K KK KKK S ++ K S D
Sbjct: 142 PEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPD 190
>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
Length = 852
Score = 30.1 bits (68), Expect = 1.2
Identities = 19/45 (42%), Positives = 24/45 (53%), Gaps = 4/45 (8%)
Query: 74 SSSSSSD---SSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVG 115
SSS+ S + S E WVD S S D + S D D+ DRD+ G
Sbjct: 11 SSSNGSGGLVGTDSGESRWVDGSEDDSEDPA-SLDDDDDDRDSGG 54
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 29.8 bits (67), Expect = 1.3
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSS 78
KK+K KK K K+ K+ K+K K S ++
Sbjct: 749 KKTKAKKKSASTKGKAAKTVKKKSKAKSKKTT 780
Score = 27.9 bits (62), Expect = 5.5
Identities = 11/45 (24%), Positives = 16/45 (35%)
Query: 36 KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
K + +S K K K K K K K +K+ + S
Sbjct: 749 KKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPA 793
Score = 27.5 bits (61), Expect = 6.7
Identities = 12/33 (36%), Positives = 16/33 (48%)
Query: 51 KKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSS 83
+KK K KK S K K K K S + S ++
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTT 780
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 28.7 bits (64), Expect = 1.3
Identities = 14/31 (45%), Positives = 18/31 (58%)
Query: 42 GDESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
D + ++K K KK K KK+KK KK KK
Sbjct: 90 DDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 26.0 bits (57), Expect = 9.2
Identities = 12/34 (35%), Positives = 17/34 (50%)
Query: 39 PTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
P+ D+ + +K KK KK K K K+ KK
Sbjct: 86 PSHPDDHPPEPTEKPKKKKKKSKKTKKPKKSSKK 119
>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
Length = 479
Score = 29.7 bits (67), Expect = 1.3
Identities = 17/73 (23%), Positives = 29/73 (39%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDS 106
KK KKKK K K+ + +K + S + E + + +S SS + +
Sbjct: 9 KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGTFNNYGTNESQSSSTQNSN 68
Query: 107 DNSDRDTVGPAPK 119
N + + PK
Sbjct: 69 KNEENIVLEDFPK 81
>gnl|CDD|215953 pfam00500, Late_protein_L1, L1 (late) protein.
Length = 500
Score = 29.6 bits (67), Expect = 1.3
Identities = 17/57 (29%), Positives = 25/57 (43%), Gaps = 3/57 (5%)
Query: 14 ELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKK 70
E+D +E LD + F P GRK +G K +K + +S KRK+
Sbjct: 447 EVDLKEKFSLDLDQF---PLGRKFLLQAGLRQRPSKKTRKRSATSSSTTSRSAKRKR 500
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 29.8 bits (67), Expect = 1.3
Identities = 18/73 (24%), Positives = 27/73 (36%)
Query: 21 LMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
L +D++ + S + +G + +K K K +K KK K KK SS
Sbjct: 87 LHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSK 146
Query: 81 SSSSSEDEWVDKS 93
D KS
Sbjct: 147 DELIKYDNNKPKS 159
Score = 29.0 bits (65), Expect = 2.3
Identities = 19/80 (23%), Positives = 32/80 (40%)
Query: 9 EKKEKELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKR 68
+KK+ + DN S S + A P + K K + K K +K + K+
Sbjct: 79 QKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKK 138
Query: 69 KKKKYSSSSSSDSSSSSEDE 88
KKK SS +++ +
Sbjct: 139 KKKVLSSKDELIKYDNNKPK 158
Score = 29.0 bits (65), Expect = 2.8
Identities = 17/50 (34%), Positives = 21/50 (42%)
Query: 36 KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSS 85
K D+ +KK K KD KK K KK+ KKK D+ S
Sbjct: 50 KKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDS 99
>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
Length = 534
Score = 29.6 bits (66), Expect = 1.7
Identities = 19/76 (25%), Positives = 31/76 (40%), Gaps = 11/76 (14%)
Query: 32 PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVD 91
P+ + + D+ + + + D + S S YS++SS D S S++D W D
Sbjct: 444 PAHADRADSDEDDMDE-QESGDERADGEDDSDSSYS-----YSTTSSEDESDSADDSWGD 497
Query: 92 KSAVKSSDSSHSSDSD 107
SDS D
Sbjct: 498 -----ESDSGIEHDDG 508
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 29.5 bits (66), Expect = 1.7
Identities = 16/67 (23%), Positives = 32/67 (47%)
Query: 44 ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHS 103
+++ ++K KK++K K K+ KK K + + + + +S+ V K + K S
Sbjct: 12 ILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDV 71
Query: 104 SDSDNSD 110
D + D
Sbjct: 72 EDENPED 78
Score = 29.1 bits (65), Expect = 2.6
Identities = 25/76 (32%), Positives = 35/76 (46%), Gaps = 10/76 (13%)
Query: 43 DESSKKSKKKKHKKDKKKKSKKSK---KRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSS- 98
E + KKKK +K K+K+ KK K K K K + +SD ++ + KS KS
Sbjct: 14 TEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPK-----KSEKKSRK 68
Query: 99 -DSSHSSDSDNSDRDT 113
D + D D DT
Sbjct: 69 RDVEDENPEDFIDPDT 84
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 29.2 bits (65), Expect = 1.8
Identities = 16/54 (29%), Positives = 29/54 (53%), Gaps = 3/54 (5%)
Query: 60 KKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSS---HSSDSDNSD 110
K+SKK+ +R ++K +++SS +E + + VK D S+S+ SD
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSD 161
Score = 28.9 bits (64), Expect = 2.9
Identities = 21/111 (18%), Positives = 39/111 (35%), Gaps = 3/111 (2%)
Query: 9 EKKEKELDYQEFLMLD---NELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKS 65
E E + E ++ + + S R + S + +K + K K++K KK
Sbjct: 87 EISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKM 146
Query: 66 KKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGP 116
+ + + S S SD S ++ + + D D S P
Sbjct: 147 DEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDDGEDISHTYGWPP 197
>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
(ETRAMP). These sequences represent a family of
proteins from the malaria parasite Plasmodium
falciparum, several of which have been shown to be
expressed specifically in the ring stage as well as the
rodent parasite Plasmodium yoelii. A homologue from
Plasmodium chabaudi was localized to the
parasitophorous vacuole membrane. Members have an
initial hydrophobic, Phe/Tyr-rich, stretch long enough
to span the membrane, a highly charged region rich in
Lys, a second putative transmembrane region and a
second highly charged, low complexity sequence region.
Some members have up to 100 residues of additional
C-terminal sequence. These genes have been shown to be
found in the sub-telomeric regions of both Plasmodium
falciparum and P. yoelii chromosomes.
Length = 82
Score = 27.6 bits (62), Expect = 1.8
Identities = 15/42 (35%), Positives = 19/42 (45%)
Query: 38 SPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSS 79
+P + K KK K K +K KK KKK SS +S
Sbjct: 20 APGLNNNVVAKKKKLKKLKKIDDDLEKKKKNKKKIIISSIAS 61
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 29.4 bits (66), Expect = 1.8
Identities = 19/82 (23%), Positives = 32/82 (39%), Gaps = 4/82 (4%)
Query: 43 DESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSH 102
DE + K K KK K+ + + KK + SS +SS + KS++S+
Sbjct: 39 DELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPA----KPRKSAESTR 94
Query: 103 SSDSDNSDRDTVGPAPKQHVTL 124
SS + T ++
Sbjct: 95 SSHPKSKAPSTESEEEEEPEET 116
>gnl|CDD|185600 PTZ00410, PTZ00410, NAD-dependent SIR2; Provisional.
Length = 349
Score = 29.1 bits (65), Expect = 1.9
Identities = 26/90 (28%), Positives = 37/90 (41%), Gaps = 11/90 (12%)
Query: 65 SKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGPAPKQHVTL 124
+K+ + SS SSSDSS+SS + + + +D RD P Q
Sbjct: 263 AKEGRSSSSSSRSSSDSSTSSSSDGYGQFG------DYEADPGGVCRDIFFPGDCQESVR 316
Query: 125 TQKDYGKALLPGEGAAMAAYVAEGKRIPRR 154
+AL G G A+A V EG+ P
Sbjct: 317 R---LAEAL--GLGEALAKRVREGEAAPET 341
>gnl|CDD|227651 COG5347, COG5347, GTPase-activating protein that regulates ARFs
(ADP-ribosylation factors), involved in ARF-mediated
vesicular transport [Intracellular trafficking and
secretion].
Length = 319
Score = 29.0 bits (65), Expect = 1.9
Identities = 19/106 (17%), Positives = 34/106 (32%), Gaps = 4/106 (3%)
Query: 25 NELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKK----SKKSKKRKKKKYSSSSSSD 80
N + + P S +KK KK + KK S SS+ + D
Sbjct: 85 NRFYEKNLLDQLLLPIKAKYDSSVAKKYIRKKYELKKFIDDSSSPSDFSSFSASSTRTVD 144
Query: 81 SSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGPAPKQHVTLTQ 126
S D + +S + + D + ++ + +LT
Sbjct: 145 SVDDRLDSESQSRSSSASLGNSNRPDDELNVESFQSTGSKPRSLTS 190
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 29.2 bits (65), Expect = 2.0
Identities = 28/109 (25%), Positives = 36/109 (33%), Gaps = 2/109 (1%)
Query: 43 DESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSH 102
+E + + KK KK K K KK K S S DS S D S V +
Sbjct: 324 EEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKE 383
Query: 103 SSDSDNSDRDTVGPAP--KQHVTLTQKDYGKALLPGEGAAMAAYVAEGK 149
+ D + P + KD GK E + A V K
Sbjct: 384 PKKEEPVDSNPSSPGNSGPARPSPESKDKGKRKAANEVSKSPASVPAKK 432
>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
topoisomerase IA [General function prediction only].
Length = 298
Score = 28.9 bits (65), Expect = 2.0
Identities = 13/39 (33%), Positives = 19/39 (48%)
Query: 33 SGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKK 71
G + ++ K+KK KK KK+ K+KK KK
Sbjct: 258 KGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296
>gnl|CDD|222911 PHA02616, PHA02616, VP2/VP3; Provisional.
Length = 259
Score = 28.9 bits (64), Expect = 2.1
Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 2/40 (5%)
Query: 39 PTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSS 78
P S + + SKK K K S+ SKKR+ + S++
Sbjct: 216 PVSPTKKAALSKKSKWTG--TKSSQSSKKRRGRSTGRSTT 253
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 28.5 bits (64), Expect = 2.1
Identities = 19/67 (28%), Positives = 27/67 (40%)
Query: 43 DESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSH 102
E +K KK+ +K K K KK K+KK K SE + ++ K D +
Sbjct: 68 AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTK 127
Query: 103 SSDSDNS 109
S S
Sbjct: 128 SYSETLS 134
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 29.3 bits (65), Expect = 2.1
Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 3/90 (3%)
Query: 40 TSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSD 99
+ +E+ K K K ++ + S K + K S S+S S+ + SA S
Sbjct: 124 ANPNEAGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDE---SATDSGK 180
Query: 100 SSHSSDSDNSDRDTVGPAPKQHVTLTQKDY 129
+S S + PAPK +T ++ Y
Sbjct: 181 ASASVAGIVGADEEAPPAPKNTLTPLEELY 210
>gnl|CDD|217363 pfam03094, Mlo, Mlo family. A family of plant integral membrane
proteins, first discovered in barley. Mutants lacking
wild-type Mlo proteins show broad spectrum resistance to
the powdery mildew fungus, and dysregulated cell death
control, with spontaneous cell death in response to
developmental or abiotic stimuli. Thus wild-type Mlo
proteins are thought to be inhibitors of cell death
whose deficiency lowers the threshold required to
trigger the cascade of events that result in plant cell
death. Mlo proteins are localised in the plasma membrane
and possess seven transmembrane regions; thus the Mlo
family is the only major higher plant family to possess
7 transmembrane domains. It has been suggested that Mlo
proteins function as G-protein coupled receptors in
plants; however the molecular and biological functions
of Mlo proteins remain to be fully determined.
Length = 481
Score = 29.3 bits (66), Expect = 2.2
Identities = 16/52 (30%), Positives = 21/52 (40%)
Query: 34 GRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSS 85
G DE +K+ KK HK KKKK K + SS ++ S
Sbjct: 414 GSSMKKAVFDEQVQKALKKWHKTAKKKKKHKRSVKSGSTTPGSSRDETPSRG 465
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 29.3 bits (65), Expect = 2.2
Identities = 15/51 (29%), Positives = 25/51 (49%)
Query: 32 PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSS 82
P K + + S K KKKK +K+K++K KK K + + S ++
Sbjct: 263 PDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQP 313
Score = 28.9 bits (64), Expect = 2.9
Identities = 23/79 (29%), Positives = 39/79 (49%), Gaps = 4/79 (5%)
Query: 38 SPTSGD----ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKS 93
SP GD E K KKK KK+K+K+ K KK++ + + S + S + V ++
Sbjct: 186 SPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEA 245
Query: 94 AVKSSDSSHSSDSDNSDRD 112
S ++ S + +S+ D
Sbjct: 246 DEASLANTVSGTAPDSEPD 264
Score = 27.3 bits (60), Expect = 7.4
Identities = 16/75 (21%), Positives = 38/75 (50%), Gaps = 5/75 (6%)
Query: 43 DESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE-----WVDKSAVKS 97
D+ K ++++H++ +K ++ KKR+K+K + S+++ VD +
Sbjct: 79 DQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEM 138
Query: 98 SDSSHSSDSDNSDRD 112
+++ SD D+ D +
Sbjct: 139 PENALPSDEDDKDPN 153
Score = 27.0 bits (59), Expect = 9.8
Identities = 16/57 (28%), Positives = 25/57 (43%)
Query: 32 PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
P +A T KK K++K K++KKKK K R + + + E+E
Sbjct: 266 PKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGAVEEE 322
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 28.9 bits (65), Expect = 2.2
Identities = 19/78 (24%), Positives = 31/78 (39%)
Query: 35 RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSA 94
+K++ + S KK + K + S +KK + S+S S SS+E
Sbjct: 244 KKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPESTSQSPSSAEPTSESPQT 303
Query: 95 VKSSDSSHSSDSDNSDRD 112
+S S D +SD
Sbjct: 304 AGNSSLSSLGDYSDSDES 321
>gnl|CDD|220662 pfam10265, DUF2217, Uncharacterized conserved protein (DUF2217).
This is a family of conserved proteins of from 500 - 600
residues found from worms to humans. Its function is not
known.
Length = 515
Score = 29.0 bits (65), Expect = 2.4
Identities = 18/74 (24%), Positives = 30/74 (40%)
Query: 45 SSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSS 104
++ + K+ + ++ +++ RK K SS SS D S+ SS S SS
Sbjct: 35 FLRRRRGKRKMEGEQLGTRRPLSRKIGKCSSRRVRSPSSKPNDTLSGASSKLSSKHSGSS 94
Query: 105 DSDNSDRDTVGPAP 118
S S D +
Sbjct: 95 HSLASVSDRNSSSS 108
>gnl|CDD|153366 cd07682, F-BAR_srGAP2, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Protein 2. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs. srGAP2 is expressed in zones
of neuronal differentiation. It plays a role in the
regeneration of neurons and axons. srGAP2 contains an
N-terminal F-BAR domain, a Rho GAP domain, and a
C-terminal SH3 domain. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
Length = 263
Score = 28.5 bits (63), Expect = 2.5
Identities = 26/89 (29%), Positives = 51/89 (57%), Gaps = 4/89 (4%)
Query: 9 EKKEKELDYQEFLM-LDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKK 67
+ KE L QE LM + NEL++V+ + + D S +SK K+ +K ++K+ +S +
Sbjct: 116 KSKEVGLQLQEDLMKVLNELYTVM---KTYHMYNADSISAQSKLKEAEKQEEKQMSRSVR 172
Query: 68 RKKKKYSSSSSSDSSSSSEDEWVDKSAVK 96
++ ++ S S ++ E++ V +S+VK
Sbjct: 173 QEDRQTPRSPDSTTNIRIEEKHVRRSSVK 201
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 28.1 bits (63), Expect = 2.7
Identities = 17/62 (27%), Positives = 27/62 (43%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDS 106
+K+ KK+ K+ KKK+ KK KK+ KK S SS E ++ +
Sbjct: 81 EKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIM 140
Query: 107 DN 108
+
Sbjct: 141 EK 142
>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
function prediction only].
Length = 386
Score = 28.7 bits (64), Expect = 2.8
Identities = 11/25 (44%), Positives = 14/25 (56%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKK 71
K +K K D+K S K+KK KK
Sbjct: 353 KAEAARKRKGDRKGVSHKAKKGGKK 377
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 29.0 bits (66), Expect = 2.8
Identities = 9/42 (21%), Positives = 15/42 (35%)
Query: 43 DESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSS 84
E + K KK+ RK K+ +S D ++
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKAT 859
Score = 27.8 bits (63), Expect = 6.6
Identities = 6/30 (20%), Positives = 11/30 (36%)
Query: 43 DESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
E +K + K KK+ + +K
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRK 844
>gnl|CDD|114659 pfam05950, Orthopox_A36R, Orthopoxvirus A36R protein. This family
consists of several Orthopoxvirus A36R proteins. The
A36R protein is predicted to be a type Ib membrane
protein.
Length = 158
Score = 27.8 bits (61), Expect = 3.1
Identities = 18/48 (37%), Positives = 25/48 (52%), Gaps = 4/48 (8%)
Query: 67 KRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTV 114
K KK K S+SS S S+ SE +W D S ++D DN R+ +
Sbjct: 42 KLKKIKSSNSSKSSKSTDSESDWED----HCSAMEQNNDVDNISRNEI 85
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 26.9 bits (60), Expect = 3.1
Identities = 13/38 (34%), Positives = 18/38 (47%)
Query: 50 KKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSED 87
KKK KK KKK + K KK ++ S+ + E
Sbjct: 23 VKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEG 60
Score = 25.8 bits (57), Expect = 8.3
Identities = 10/39 (25%), Positives = 18/39 (46%)
Query: 50 KKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
KK KK KKKK++ + K K +++ + +
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEG 60
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many
hypothetical proteins.
Length = 98
Score = 27.0 bits (60), Expect = 3.3
Identities = 15/61 (24%), Positives = 28/61 (45%)
Query: 11 KEKELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKK 70
KE++ + + +++FS + +K + + K+ KK KKDKKK +
Sbjct: 1 KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60
Query: 71 K 71
K
Sbjct: 61 K 61
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 28.7 bits (64), Expect = 3.4
Identities = 9/46 (19%), Positives = 23/46 (50%)
Query: 44 ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEW 89
E +++ + KK S+++ K+ K ++ +S+S ++ E
Sbjct: 1180 EKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYG 1225
Score = 28.7 bits (64), Expect = 3.8
Identities = 25/91 (27%), Positives = 37/91 (40%), Gaps = 16/91 (17%)
Query: 36 KASPTSGDESSKKSKKKKHKK-DKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSA 94
K + G KK +K + +KK S + K+ SS + SSSSE + +D SA
Sbjct: 1372 KPAEAIGISPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENVSGSSSSEKDEIDVSA 1431
Query: 95 ---------------VKSSDSSHSSDSDNSD 110
+ S+S + DSD D
Sbjct: 1432 KPRPQRANRKQTTYVLSDSESESADDSDFDD 1462
Score = 28.3 bits (63), Expect = 4.0
Identities = 11/77 (14%), Positives = 29/77 (37%), Gaps = 8/77 (10%)
Query: 44 ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHS 103
E +++ ++ + + +KK ++ KK + ++ +S SE S
Sbjct: 1176 EEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETT--------EETYGSS 1227
Query: 104 SDSDNSDRDTVGPAPKQ 120
+ + + V P +
Sbjct: 1228 AMETENVAEVVKPKGRA 1244
>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug).
All proteins in this family for whcih functions are
known are G-T or G-U mismatch glycosylases that
function in base excision repair. This family is based
on the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University). Used 2pf model [DNA
metabolism, DNA replication, recombination, and
repair].
Length = 328
Score = 28.5 bits (63), Expect = 3.6
Identities = 14/41 (34%), Positives = 19/41 (46%)
Query: 31 LPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKK 71
P RK P S + KK K KS KSK++++K
Sbjct: 56 APKFRKRKPRSNEPYRPVEPKKPSDSKKSGKSTKSKEKQEK 96
>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1. Mating in
fungi is controlled by the loci that determine the
mating type of an individual, and only individuals with
differing mating types can mate. Basidiomycete fungi
have evolved a unique mating system, termed tetrapolar
or bifactorial incompatibility, in which mating type is
determined by two unlinked loci; compatibility at both
loci is required for mating to occur. The multi-allelic
tetrapolar mating system is considered to be a novel
innovation that could have only evolved once, and is
thus unique to the mushroom fungi. This domain is
C-terminal to the homeodomain transcription factor
region.
Length = 418
Score = 28.2 bits (63), Expect = 3.8
Identities = 18/81 (22%), Positives = 32/81 (39%), Gaps = 8/81 (9%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKKY-----SSSSSSDSSSSSEDEWVDKSAVKSSDSS 101
++ K +K ++ + + + KR Y S + SS+ S +D S V +S +
Sbjct: 45 EQLKDEKRRRRQTPRQSRRSKRAAHAYPSPERSPALSSERLLSPSPSVLDLSPVLASPQT 104
Query: 102 ---HSSDSDNSDRDTVGPAPK 119
S S + D D K
Sbjct: 105 GKRRRSSSPSDDEDEAERPSK 125
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 28.3 bits (63), Expect = 3.8
Identities = 21/86 (24%), Positives = 30/86 (34%), Gaps = 2/86 (2%)
Query: 32 PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVD 91
++ +S S SK K K + S + + S S S S +E +
Sbjct: 310 SKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNESK--Q 367
Query: 92 KSAVKSSDSSHSSDSDNSDRDTVGPA 117
K A KSS + S D S G
Sbjct: 368 KRASKSSSGARDSKKDASGMSANGTV 393
Score = 28.0 bits (62), Expect = 5.3
Identities = 45/206 (21%), Positives = 83/206 (40%), Gaps = 26/206 (12%)
Query: 37 ASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD-----SSSSSEDEWVD 91
D + + ++ +K + S S K+ S+SSS+ S SS + +
Sbjct: 278 HHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHSN 337
Query: 92 KSAVKSSDSSHSSDSDNSDRDTVGPAPKQHVTLTQKDYGKALLPGEGAAMAAYVAE---- 147
++ SS SHSS + G P V+ + +A GA + A
Sbjct: 338 RNKSNSSPKSHSSAN--------GSVPSSSVSDNESKQKRASKSSSGARDSKKDASGMSA 389
Query: 148 -GKRIPRRGEIGLTS-DEIERYEAVGYVMSG--SRHRRMEAVRIRKENQIYSADEKRALA 203
G E +++ IER E + + R+ E+ +R + + ++ E+ +
Sbjct: 390 NGTVENCIPENKISTPSAIERLEQDIKKLQAELQQARQNES-ELRNQISLLTSLER---S 445
Query: 204 MFSKEERQKRENRIL-TQFKEMVSSK 228
+ S + K+EN +L T+ MVS+K
Sbjct: 446 LKSDLGQLKKENDMLQTKLNSMVSAK 471
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 28.6 bits (64), Expect = 3.9
Identities = 14/44 (31%), Positives = 23/44 (52%)
Query: 29 SVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
S +KA P + + ++K+KK K K + K+K+ KKK
Sbjct: 767 SAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKK 810
Score = 27.8 bits (62), Expect = 6.0
Identities = 9/42 (21%), Positives = 20/42 (47%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
+K K K K +K+KK + +K +++ + ++ E
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region
covers both the Brf homology II and III regions. This
region is involved in binding TATA binding protein.
Length = 95
Score = 26.8 bits (60), Expect = 4.1
Identities = 12/40 (30%), Positives = 23/40 (57%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSE 86
++ K+ K K D+ S K KK++K K + +S+++E
Sbjct: 32 QEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAE 71
>gnl|CDD|227371 COG5038, COG5038, Ca2+-dependent lipid-binding protein, contains C2
domain [General function prediction only].
Length = 1227
Score = 28.2 bits (63), Expect = 4.3
Identities = 18/111 (16%), Positives = 35/111 (31%), Gaps = 13/111 (11%)
Query: 32 PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSS--DSSSSSEDE- 88
+ ++ +S + S + K + + + KK ++++ + +
Sbjct: 2 STKQQHYRSSDNYSGNRPIPTIPKFFRSRGQRAEKKEEEQEMQPEDEKLFAPIAQRTVQI 61
Query: 89 ----------WVDKSAVKSSDSSHSSDSDNSDRDTVGPAPKQHVTLTQKDY 129
D S S SS + SD DT P L + DY
Sbjct: 62 ADVNFQGAKGIDDLSFTVPKQSIESSSPEKSDVDTSNTRPSVSRELHKDDY 112
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 28.0 bits (63), Expect = 4.4
Identities = 13/77 (16%), Positives = 26/77 (33%)
Query: 36 KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAV 95
+ + KK K K K K K + + S + ++ +D+ +
Sbjct: 65 DTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDI 124
Query: 96 KSSDSSHSSDSDNSDRD 112
+ + D D+ D D
Sbjct: 125 DVLNQADDDDDDDDDDD 141
Score = 28.0 bits (63), Expect = 4.4
Identities = 15/86 (17%), Positives = 25/86 (29%)
Query: 29 SVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
+ D++++ KK K K + KKK SS +
Sbjct: 53 DQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNAL 112
Query: 89 WVDKSAVKSSDSSHSSDSDNSDRDTV 114
D D + +D+ D D
Sbjct: 113 DKDDDLNYVKDIDVLNQADDDDDDDD 138
Score = 28.0 bits (63), Expect = 5.0
Identities = 16/75 (21%), Positives = 28/75 (37%)
Query: 36 KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAV 95
P +++ K+ K KK K + +K +K ++ D + +D V A
Sbjct: 72 SDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQAD 131
Query: 96 KSSDSSHSSDSDNSD 110
D D D+ D
Sbjct: 132 DDDDDDDDDDLDDDD 146
>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain. This is a
short domain found in bacterial type II/III secretory
system proteins. The architecture of these proteins
suggest that this family may be functionally analogous
to pfam03958.
Length = 95
Score = 26.5 bits (59), Expect = 4.4
Identities = 16/41 (39%), Positives = 20/41 (48%)
Query: 74 SSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTV 114
+S S S S S+S + S SS SS S DS + R T
Sbjct: 22 TSGSVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSSGTRITT 62
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 27.5 bits (61), Expect = 4.4
Identities = 15/34 (44%), Positives = 18/34 (52%), Gaps = 1/34 (2%)
Query: 35 RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKR 68
RK + T+ E KK K KK+K KK K K R
Sbjct: 82 RKPATTA-VEKKKKGKSKKNKLKGKKDEDKKKAR 114
>gnl|CDD|217667 pfam03666, NPR3, Nitrogen Permease regulator of amino acid
transport activity 3. This family, also known in yeasts
as Rmd11, complexes with NPR2, pfam06218. This complex
heterodimer is responsible for inactivating TORC1. an
evolutionarily conserved protein complex that controls
cell size via nutritional input signals, specifically,
in response to amino acid starvation.
Length = 424
Score = 28.1 bits (63), Expect = 4.5
Identities = 19/64 (29%), Positives = 28/64 (43%), Gaps = 2/64 (3%)
Query: 63 KKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGPAPKQHV 122
+K KK+K+KK + +D S D S KSS S S NSD + +
Sbjct: 45 RKKKKKKQKKSDRADPNDDREPSVDSEDSSS--KSSSKSESGSLANSDPASDPSSEDSKS 102
Query: 123 TLTQ 126
++T
Sbjct: 103 SMTM 106
Score = 28.1 bits (63), Expect = 4.6
Identities = 27/89 (30%), Positives = 38/89 (42%), Gaps = 7/89 (7%)
Query: 21 LMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
L +D+ F P + + +K KKKK K K ++ + R+ S SSS
Sbjct: 24 LKIDDLRFVGHPIHVDE-----NGTWRKKKKKKQK--KSDRADPNDDREPSVDSEDSSSK 76
Query: 81 SSSSSEDEWVDKSAVKSSDSSHSSDSDNS 109
SSS SE + S S SS S S +
Sbjct: 77 SSSKSESGSLANSDPASDPSSEDSKSSMT 105
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 27.5 bits (61), Expect = 5.2
Identities = 21/93 (22%), Positives = 38/93 (40%), Gaps = 6/93 (6%)
Query: 35 RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSA 94
R+A G ++ KK + +++ K KK KK KKKK S S V++S
Sbjct: 64 RQAERKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKK------SPKSLLLNFVTVEESK 117
Query: 95 VKSSDSSHSSDSDNSDRDTVGPAPKQHVTLTQK 127
S++ + + + +++ T K
Sbjct: 118 PDDEVHEQKSETLSLPIELEEQSMGRYLPTTFK 150
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 28.0 bits (63), Expect = 5.2
Identities = 12/60 (20%), Positives = 27/60 (45%), Gaps = 1/60 (1%)
Query: 34 GRKASPTSGDESSKKSKK-KKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDK 92
G + +G+ S + KK +K ++ +KK++K + K + ++ + D K
Sbjct: 396 GEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455
>gnl|CDD|221825 pfam12877, DUF3827, Domain of unknown function (DUF3827). This
family contains the human KIAA1549 protein which has
been found to be fused fused to BRAF gene in many cases
of pilocytic astrocytomas. The fusion is due mainly to a
tandem duplication of 2 Mb at 7q34. Although nothing is
known about the function of KIAA1549 protein, the BRAF
protein is a well characterized oncoprotein. It is a
serine/threonine protein kinase which is implicated in
MAP/ERK signalling, a critical pathway for the
regulation of cell division, differentiation and
secretion.
Length = 684
Score = 28.0 bits (62), Expect = 5.5
Identities = 20/98 (20%), Positives = 35/98 (35%), Gaps = 4/98 (4%)
Query: 21 LMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
M+ E + P +++ P E K K + S K ++R +K S S
Sbjct: 338 EMIIQEPAPLPPLKKESLPIEDAEVPTP----KSKSSQDGSSNKKRRRGRKSPSDGDSEG 393
Query: 81 SSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGPAP 118
SS S +KS S+ S ++ + +
Sbjct: 394 SSVISNRSSREKSGRPSTTPSVTAQQKPTKEEGRKKPA 431
>gnl|CDD|215456 PLN02850, PLN02850, aspartate-tRNA ligase.
Length = 530
Score = 27.7 bits (62), Expect = 5.9
Identities = 12/54 (22%), Positives = 21/54 (38%)
Query: 38 SPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVD 91
S E S + KK K K++K ++ K +++S D + D
Sbjct: 1 SSQEAVEESGEKISKKAAKKAAAKAEKLRREATAKAAAASLEDEDDPLASNYGD 54
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 27.4 bits (61), Expect = 6.0
Identities = 20/80 (25%), Positives = 30/80 (37%), Gaps = 10/80 (12%)
Query: 34 GRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKS 93
+KA+ S SS K K K KK KK +K +D +D S
Sbjct: 181 AKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRKW----------DLDGDEDDDAVLDYS 230
Query: 94 AVKSSDSSHSSDSDNSDRDT 113
A ++D + + D + D
Sbjct: 231 APDANDENADAPEDVEEVDQ 250
>gnl|CDD|220232 pfam09421, FRQ, Frequency clock protein. The frequency clock
protein, is the central component of the frq-based
circadian negative feedback loop, regulates various
aspects of the circadian clock in Neurospora crassa.
This protein has been shown to interact with itself via
a coiled-coil.
Length = 989
Score = 27.7 bits (61), Expect = 6.3
Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 1/62 (1%)
Query: 41 SGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDS 100
S D S KS++ D + S K+ +RKK+K ++ S + SSDS
Sbjct: 532 SSDSSGDKSQRSPETDDTEDGSDKNGRRKKRKTQTADSEIGRNGLSKF-GGPQVSSSSDS 590
Query: 101 SH 102
H
Sbjct: 591 FH 592
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 27.3 bits (61), Expect = 6.9
Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 29 SVLPSGRKASPTSGDESSKK-SKKKKHKKDKKKKSKKSKKRK 69
+ P+ SP + E S K+K + DKK+K KKSKK K
Sbjct: 333 TPPPASLSHSPPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 27.3 bits (61), Expect = 7.2
Identities = 8/37 (21%), Positives = 16/37 (43%)
Query: 52 KKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
+KKK+ KK ++++ K+ D + E
Sbjct: 177 HSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGE 213
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 27.1 bits (60), Expect = 7.2
Identities = 20/60 (33%), Positives = 25/60 (41%), Gaps = 17/60 (28%)
Query: 5 WRRGEKKEKELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKK 64
++ EK+E EL Q+F L EL S KK K KK K D + K K
Sbjct: 44 YKVYEKREFELKKQQFEQLKAEL-----------------SKKKKKFKKEKVDVRVKVVK 86
>gnl|CDD|215562 PLN03078, PLN03078, Putative tRNA pseudouridine synthase;
Provisional.
Length = 513
Score = 27.6 bits (61), Expect = 7.5
Identities = 16/63 (25%), Positives = 25/63 (39%)
Query: 51 KKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSD 110
K + K K K + ++ K S SS S + E ++ S SS S S + +
Sbjct: 246 KYRKKLPGKHKQRNGAVSRRAKSSKEMSSSESEENHGEISEEDEEDLSFSSIPSGSSDEN 305
Query: 111 RDT 113
D
Sbjct: 306 EDI 308
>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
Members of the NSCC2 family have been sequenced from
various yeast, fungal and animals species including
Saccharomyces cerevisiae, Drosophila melanogaster and
Homo sapiens. These proteins are the Sec62 proteins,
believed to be associated with the Sec61 and Sec63
constituents of the general protein secretary systems of
yeast microsomes. They are also the non-selective cation
(NS) channels of the mammalian cytoplasmic membrane. The
yeast Sec62 protein has been shown to be essential for
cell growth. The mammalian NS channel proteins has been
implicated in platelet derived growth factor(PGDF)
dependent single channel current in fibroblasts. These
channels are essentially closed in serum deprived
tissue-culture cells and are specifically opened by
exposure to PDGF. These channels are reported to exhibit
equal selectivity for Na+, K+ and Cs+ with low
permeability to Ca2+, and no permeability to anions
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 232
Score = 27.2 bits (60), Expect = 7.5
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 51 KKKHKKDKKKKSKKSKKRKKKKYSS 75
K K+ KK KSKK KK++ K+ ++
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKREAN 232
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 27.5 bits (61), Expect = 7.6
Identities = 16/118 (13%), Positives = 41/118 (34%), Gaps = 5/118 (4%)
Query: 6 RRGEKKEKELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKK-----KHKKDKKK 60
+ K+EK+ ++ + + K P ++ +K ++ + KK ++
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV 169
Query: 61 KSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGPAP 118
++K K+ KK + + + + AVK + + + G
Sbjct: 170 RAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDR 227
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 27.4 bits (61), Expect = 7.7
Identities = 15/44 (34%), Positives = 18/44 (40%)
Query: 44 ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSED 87
+ S+ K KK KKKK KK K+RK S D
Sbjct: 265 DYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSD 308
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 27.3 bits (61), Expect = 8.0
Identities = 14/41 (34%), Positives = 17/41 (41%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSED 87
KK KK + K K KK KK S +S SS +
Sbjct: 350 KKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKK 390
Score = 27.3 bits (61), Expect = 8.2
Identities = 11/40 (27%), Positives = 16/40 (40%)
Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSE 86
+KK K + K K KK KK+ S + + S
Sbjct: 347 TATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSS 386
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 26.9 bits (60), Expect = 8.0
Identities = 14/38 (36%), Positives = 19/38 (50%)
Query: 44 ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDS 81
+ K + ++ KK KKKKS K KK KK + S
Sbjct: 171 DKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKAISS 208
>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
Length = 417
Score = 27.1 bits (60), Expect = 8.1
Identities = 14/69 (20%), Positives = 24/69 (34%), Gaps = 4/69 (5%)
Query: 40 TSGDESSKK----SKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAV 95
T D++ K S K KK K ++ K + + + S S V V
Sbjct: 86 TFEDKTQVKVKVVSAPTKKKKAMPKSVVRAPKPLENPVPAQAESSGSKPVPSIPVSTPEV 145
Query: 96 KSSDSSHSS 104
K+ + +
Sbjct: 146 KAPAPALTP 154
>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 944
Score = 27.4 bits (61), Expect = 8.7
Identities = 16/68 (23%), Positives = 27/68 (39%)
Query: 42 GDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSS 101
S K+ KK D+K K+ S+ +SS D+S +S +D ++ S
Sbjct: 627 DALSPKEGDGKKSSADRKPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQS 686
Query: 102 HSSDSDNS 109
+ S
Sbjct: 687 VPEAALAS 694
>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
Length = 496
Score = 27.2 bits (60), Expect = 8.9
Identities = 10/47 (21%), Positives = 23/47 (48%)
Query: 34 GRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
GRK PTS +K ++++ K D++ + +++ ++ D
Sbjct: 70 GRKKKPTSSPPKAKTTRRRTKKTDQELDPEGAEEDQEAAEDGEDYDD 116
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 26.7 bits (59), Expect = 9.0
Identities = 13/22 (59%), Positives = 13/22 (59%)
Query: 48 KSKKKKHKKDKKKKSKKSKKRK 69
K K KKDKK K KK KKR
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217
>gnl|CDD|119000 pfam10480, ICAP-1_inte_bdg, Beta-1 integrin binding protein.
ICAP-1 is a serine/threonine-rich protein that binds to
the cytoplasmic domains of beta-1 integrins in a highly
specific manner, binding to a NPXY sequence motif on the
beta-1 integrin. The cytoplasmic domains of integrins
are essential for cell adhesion, and the fact that
phosphorylation of ICAP-1 by interaction with the
cell-matrix implies an important role of ICAP-1 during
integrin-dependent cell adhesion. Overexpression of
ICAP-1 strongly reduces the integrin-mediated cell
spreading on extracellular matrix and inhibits both
Cdc42 and Rac1. In addition, ICAP-1 induces release of
Cdc42 from cellular membranes and prevents the
dissociation of GDP from this GTPase. An additional
function of ICAP-1 is to promote differentiation of
osteoprogenitors by supporting their condensation
through modulating the integrin high affinity state,.
Length = 200
Score = 27.0 bits (59), Expect = 9.0
Identities = 19/57 (33%), Positives = 30/57 (52%), Gaps = 10/57 (17%)
Query: 67 KRKKKKYSSSSSSDSSSSSEDEWVDKS----------AVKSSDSSHSSDSDNSDRDT 113
++ KK++SSSSS S S++ + VD S A +DS+ SS N++ D
Sbjct: 3 RKGKKRHSSSSSQSSEISTKSKSVDSSLGGLSRSSTVASLDTDSTKSSGQSNANSDA 59
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 27.2 bits (60), Expect = 9.5
Identities = 15/60 (25%), Positives = 26/60 (43%)
Query: 37 ASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVK 96
A + ++K KK K K K SKK+ K S+ + +++ + K+A K
Sbjct: 199 AKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKK 258
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.304 0.120 0.316
Gapped
Lambda K H
0.267 0.0710 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,202,972
Number of extensions: 1020309
Number of successful extensions: 5055
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3444
Number of HSP's successfully gapped: 640
Length of query: 234
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 140
Effective length of database: 6,768,326
Effective search space: 947565640
Effective search space used: 947565640
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 57 (25.8 bits)