RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14604
(209 letters)
>gnl|CDD|215770 pfam00176, SNF2_N, SNF2 family N-terminal domain. This domain is
found in proteins involved in a variety of processes
including transcription regulation (e.g., SNF2, STH1,
brahma, MOT1), DNA repair (e.g. ERCC6, RAD16, RAD5), DNA
recombination (e.g. RAD54), and chromatin unwinding
(e.g. ISWI) as well as a variety of other proteins with
little functional information (e.g. lodestar, ETL1).
Length = 301
Score = 115 bits (290), Expect = 2e-31
Identities = 52/101 (51%), Positives = 59/101 (58%), Gaps = 1/101 (0%)
Query: 108 YQLEGINWLRYSWGQNIDTILADEMGLGKTIQTITFLYSLFKEGHC-KGPFLVSAPLSTI 166
YQLEG+NWL + ILADEMGLGKT+QTI L + KEG +GP LV PLST+
Sbjct: 1 YQLEGVNWLISLESNGLGGILADEMGLGKTLQTIALLATYLKEGKDRRGPTLVVCPLSTL 60
Query: 167 INWEREFETWAPDFYVVTYVGDKDCRIVLRDHDISWEDTAN 207
NW EFE WAP VV Y GD R LR DT +
Sbjct: 61 HNWLNEFEKWAPALRVVVYHGDGRERSKLRQSMAKRLDTYD 101
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 96.0 bits (239), Expect = 5e-23
Identities = 56/159 (35%), Positives = 74/159 (46%), Gaps = 25/159 (15%)
Query: 46 KKKKSKGRGKTKTKE-------LQEDEDGASGSKPRRYTPPPEKPTTDLRKKYEKQPPFL 98
KK K +GR +K E L+E+EDG GS R P +R
Sbjct: 122 KKAKGRGRHASKLTEEEEDEEYLKEEEDGLGGSGGTRLLVQPSCIKGKMRD--------- 172
Query: 99 DDTGMQLHAYQLEGINWLRYSWGQNIDTILADEMGLGKTIQTITFLYSLFKEGHCKGPFL 158
YQL G+NWL + I+ ILADEMGLGKT+QTI+ L L + GP +
Sbjct: 173 ---------YQLAGLNWLIRLYENGINGILADEMGLGKTLQTISLLGYLHEYRGITGPHM 223
Query: 159 VSAPLSTIINWEREFETWAPDFYVVTYVGDKDCRIVLRD 197
V AP ST+ NW E + P V + G+ + R R+
Sbjct: 224 VVAPKSTLGNWMNEIRRFCPVLRAVKFHGNPEERAHQRE 262
>gnl|CDD|223627 COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family
[Transcription / DNA replication, recombination, and
repair].
Length = 866
Score = 90.9 bits (225), Expect = 3e-21
Identities = 54/162 (33%), Positives = 74/162 (45%), Gaps = 8/162 (4%)
Query: 34 DEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSKPRRYTPPPEKPTTDLR-KKYE 92
E LK + R ++ L++ R ++ +LR +
Sbjct: 267 PEQLKEEDLFARLRLLDPLRLADLSQILEKFVRETLKLSARDLKDELKELLAELRLSEDL 326
Query: 93 KQPPFLDDTGMQLHAYQLEGINWLR---YSWGQNIDTILADEMGLGKTIQTITFLYSLFK 149
P D +L YQLEG+NWL S ILAD+MGLGKT+QTI L SL +
Sbjct: 327 LNAPEPVDLSAELRPYQLEGVNWLSELLRSNLLGG--ILADDMGLGKTVQTIALLLSLLE 384
Query: 150 EGHC-KGPFLVSAPLSTIINWEREFETWAPDF-YVVTYVGDK 189
GP L+ P S + NW+REFE +APD V+ Y G+K
Sbjct: 385 SIKVYLGPALIVVPASLLSNWKREFEKFAPDLRLVLVYHGEK 426
>gnl|CDD|214692 smart00487, DEXDc, DEAD-like helicases superfamily.
Length = 201
Score = 67.5 bits (165), Expect = 5e-14
Identities = 29/97 (29%), Positives = 32/97 (32%), Gaps = 7/97 (7%)
Query: 104 QLHAYQLEGINWLRYSWGQNIDTILADEMGLGKTIQTITFLYSLFKEGHCKGPFLVSAPL 163
L YQ E I L D ILA G GKT+ + K G G LV P
Sbjct: 8 PLRPYQKEAIEALLSGLR---DVILAAPTGSGKTLAALLPALEALKRGK-GGRVLVLVPT 63
Query: 164 -STIINWEREFETWAPDFY--VVTYVGDKDCRIVLRD 197
W E + P VV G R LR
Sbjct: 64 RELAEQWAEELKKLGPSLGLKVVGLYGGDSKREQLRK 100
>gnl|CDD|238005 cd00046, DEXDc, DEAD-like helicases superfamily. A diverse family
of proteins involved in ATP-dependent RNA or DNA
unwinding. This domain contains the ATP-binding region.
Length = 144
Score = 55.0 bits (133), Expect = 6e-10
Identities = 17/70 (24%), Positives = 24/70 (34%), Gaps = 3/70 (4%)
Query: 125 DTILADEMGLGKTIQTITFLYSLFKEGHCKGPFLVSAPLSTIINWEREFETWAPDFY--V 182
D +LA G GKT+ + + L G LV AP + N E V
Sbjct: 2 DVLLAAPTGSGKTLAALLPILELLDSLK-GGQVLVLAPTRELANQVAERLKELFGEGIKV 60
Query: 183 VTYVGDKDCR 192
+G +
Sbjct: 61 GYLIGGTSIK 70
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 34.0 bits (78), Expect = 0.051
Identities = 23/93 (24%), Positives = 36/93 (38%), Gaps = 6/93 (6%)
Query: 14 RGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSKP 73
+G KKKK ++K + D + GKK + KGR K+ + A G K
Sbjct: 418 KGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKS---SKVPSDSKAGGKKE 474
Query: 74 RRYTPPPEK---PTTDLRKKYEKQPPFLDDTGM 103
+ P + KK + P L++ G
Sbjct: 475 SVKSQEDNNNIPPEEWVMKKILEWVPDLEEDGT 507
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 32.1 bits (73), Expect = 0.15
Identities = 14/74 (18%), Positives = 24/74 (32%)
Query: 8 VRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDG 67
A +R K + R+ K + ED + + KK++ K R +
Sbjct: 146 ECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSS 205
Query: 68 ASGSKPRRYTPPPE 81
S PP+
Sbjct: 206 GGQSGLSTKDEPPK 219
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 31.6 bits (72), Expect = 0.19
Identities = 17/67 (25%), Positives = 27/67 (40%)
Query: 11 AAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASG 70
+ K KK + K D D +D K +K+++ + + TK E S
Sbjct: 79 EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSE 138
Query: 71 SKPRRYT 77
KPR+Y
Sbjct: 139 LKPRKYA 145
Score = 30.8 bits (70), Expect = 0.34
Identities = 14/77 (18%), Positives = 26/77 (33%), Gaps = 12/77 (15%)
Query: 16 KKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSKPRR 75
K+K K + SK D D K KK + K+ +++ + +
Sbjct: 81 KQKWKWKKKKSKKKKDKD------------KDKKDDKKDDKSEKKDEKEAEDKLEDLTKS 128
Query: 76 YTPPPEKPTTDLRKKYE 92
Y+ + +KY
Sbjct: 129 YSETLSTLSELKPRKYA 145
Score = 26.6 bits (59), Expect = 8.4
Identities = 17/79 (21%), Positives = 28/79 (35%), Gaps = 1/79 (1%)
Query: 24 RASKAADDDDDEDLKAAYFN-DGKKKKSKGRGKTKTKELQEDEDGASGSKPRRYTPPPEK 82
+ K ++ E +K Y K K K K K K+ + +D ++ E
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120
Query: 83 PTTDLRKKYEKQPPFLDDT 101
DL K Y + L +
Sbjct: 121 KLEDLTKSYSETLSTLSEL 139
>gnl|CDD|235319 PRK04914, PRK04914, ATP-dependent helicase HepA; Validated.
Length = 956
Score = 31.7 bits (73), Expect = 0.26
Identities = 10/11 (90%), Positives = 11/11 (100%)
Query: 128 LADEMGLGKTI 138
LADE+GLGKTI
Sbjct: 174 LADEVGLGKTI 184
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 31.0 bits (70), Expect = 0.42
Identities = 17/94 (18%), Positives = 36/94 (38%)
Query: 2 KSKRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKEL 61
K++ A AR K+ + A + + + ++ AA ++ + K +
Sbjct: 334 KTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDA 393
Query: 62 QEDEDGASGSKPRRYTPPPEKPTTDLRKKYEKQP 95
+ED S ++ + +PP + TT E
Sbjct: 394 SPNEDTPSENEESKGSPPQVEATTTAEPNREPSQ 427
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 30.5 bits (69), Expect = 0.59
Identities = 17/78 (21%), Positives = 28/78 (35%), Gaps = 4/78 (5%)
Query: 16 KKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSKPRR 75
K ST + K +++ + KKKK + + E + SK +
Sbjct: 48 KVATTSTKKDKKEDKNNESKKKSE----KKKKKKKEKKEPKSEGETKLGFKTPKKSKKTK 103
Query: 76 YTPPPEKPTTDLRKKYEK 93
PP KP D+ + K
Sbjct: 104 KKPPKPKPNEDVDNAFNK 121
>gnl|CDD|222918 PHA02687, PHA02687, ORF061 late transcription factor VLTF-4;
Provisional.
Length = 231
Score = 30.0 bits (67), Expect = 0.66
Identities = 19/70 (27%), Positives = 28/70 (40%), Gaps = 10/70 (14%)
Query: 4 KRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQE 63
KRR + A+ K +KS K DD + D ++K + +E +
Sbjct: 87 KRRTKK--KAKADKPEKSPKAVEKLCPPDDRD--------DKNEEKEPTEEAQRNEESGD 136
Query: 64 DEDGASGSKP 73
E GASG P
Sbjct: 137 AEGGASGRSP 146
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 30.2 bits (68), Expect = 0.86
Identities = 13/74 (17%), Positives = 22/74 (29%)
Query: 13 ARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSK 72
R KKK+ A + ++D+ DLK K + A+ K
Sbjct: 1242 GRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARRAAARK 1301
Query: 73 PRRYTPPPEKPTTD 86
+ + D
Sbjct: 1302 KPLASVSVISDSDD 1315
Score = 28.3 bits (63), Expect = 3.9
Identities = 22/85 (25%), Positives = 31/85 (36%), Gaps = 7/85 (8%)
Query: 3 SKRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQ 62
K P R AAAR KK S S + DDDDD ++ + KKK + K
Sbjct: 1289 VKAVPARRAAAR-KKPLASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKA-- 1345
Query: 63 EDEDGASGSKPRRYTPPPEKPTTDL 87
+ ++ P + L
Sbjct: 1346 ----AKPPAAAKKRGPATVQSGQKL 1366
Score = 27.1 bits (60), Expect = 9.3
Identities = 15/69 (21%), Positives = 22/69 (31%), Gaps = 11/69 (15%)
Query: 2 KSKRRPVRGAAARGKKKKKSTGRASKAADDD----DDEDLKAAY-------FNDGKKKKS 50
K +R RG + KK + + + E + Y N + K
Sbjct: 1181 KLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKP 1240
Query: 51 KGRGKTKTK 59
KGR K K
Sbjct: 1241 KGRAGAKKK 1249
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 29.6 bits (67), Expect = 1.1
Identities = 10/40 (25%), Positives = 17/40 (42%)
Query: 2 KSKRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAY 41
K + + A KKK K+ + + +DDD+D
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDGYD 231
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 29.8 bits (67), Expect = 1.2
Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 10/66 (15%)
Query: 46 KKKKSKGRGKTKTKE-------LQEDEDGASGSKPRRYTPPPEKPTTDLRKKYEKQPPFL 98
KKKKSK + K K +E L+ E+G GS + E P L KK+ + P F+
Sbjct: 688 KKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEK---SSETPQLKLEKKFGQSPVFV 744
Query: 99 DDTGMQ 104
T ++
Sbjct: 745 ASTLLE 750
>gnl|CDD|178493 PLN02905, PLN02905, beta-amylase.
Length = 702
Score = 29.2 bits (65), Expect = 1.9
Identities = 19/74 (25%), Positives = 28/74 (37%), Gaps = 7/74 (9%)
Query: 27 KAADDDDDED------LKAAYFNDGKKKKSKGRGK-TKTKELQEDEDGASGSKPRRYTPP 79
K DD D + + DG S G + + +QE G RR P
Sbjct: 25 KEEDDGDRRNREKHIPVTGMVSVDGAAVSSSGNDEFMHQQSIQEQVGTPGGGGSRRSRPL 84
Query: 80 PEKPTTDLRKKYEK 93
EK T LR+++ +
Sbjct: 85 EEKERTKLRERHRR 98
>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator. This
family of trans-activating transcriptional regulator
(TATR), also known as intermediate early protein 1, are
common to the Nucleopolyhedroviruses.
Length = 571
Score = 28.7 bits (64), Expect = 2.3
Identities = 20/90 (22%), Positives = 28/90 (31%), Gaps = 4/90 (4%)
Query: 6 RPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDE 65
P + K+ T K ++ D D D + K K R K K +Q
Sbjct: 107 SPGESLPESLSETKQVTVSKRKGSELDSDSDSSDSSKGKKLVNKPKKRQKYKKATIQ--- 163
Query: 66 DGASGSKPRRY-TPPPEKPTTDLRKKYEKQ 94
S ++ Y T TD KY
Sbjct: 164 SPTSLTEEENYNTEICTVAPTDQIAKYFLM 193
>gnl|CDD|224107 COG1186, PrfB, Protein chain release factor B [Translation,
ribosomal structure and biogenesis].
Length = 239
Score = 28.4 bits (64), Expect = 2.7
Identities = 11/43 (25%), Positives = 20/43 (46%), Gaps = 4/43 (9%)
Query: 55 KTKTKELQEDEDGAS----GSKPRRYTPPPEKPTTDLRKKYEK 93
+ +++E + G++ R Y P +PT DLR E+
Sbjct: 174 EKRSQEKNRERALKKLIGWGNQIRSYVLDPYQPTKDLRTGVER 216
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 28.5 bits (64), Expect = 2.9
Identities = 17/89 (19%), Positives = 26/89 (29%), Gaps = 8/89 (8%)
Query: 16 KKKKKSTGRASKAADDDDDEDLKAAYFND--------GKKKKSKGRGKTKTKELQEDEDG 67
+K K + SK DDDD L + KKKK K + K K + +
Sbjct: 19 QKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLA 78
Query: 68 ASGSKPRRYTPPPEKPTTDLRKKYEKQPP 96
+ + +Q
Sbjct: 79 YDLPVVWSSAAFQDNSHIRKLGNWPEQEW 107
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 28.6 bits (63), Expect = 2.9
Identities = 8/59 (13%), Positives = 19/59 (32%)
Query: 48 KKSKGRGKTKTKELQEDEDGASGSKPRRYTPPPEKPTTDLRKKYEKQPPFLDDTGMQLH 106
K++ G + +E + T P ++ LR ++ P + +
Sbjct: 712 KETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVP 770
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 28.4 bits (63), Expect = 3.2
Identities = 23/74 (31%), Positives = 35/74 (47%), Gaps = 7/74 (9%)
Query: 11 AAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGK-KKKSKGRGKTKTKELQEDEDGAS 69
AAA+ K + +AS+ D DED KA K K + R KTK E +++E+
Sbjct: 209 AAAKAKAAALAKQKASQGNGDSGDEDAKAKAIAAAKAKAAAAARAKTKGAEGKKEEE--- 265
Query: 70 GSKPRRYTPPPEKP 83
P++ P +P
Sbjct: 266 ---PKQEEPSVNQP 276
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 28.6 bits (64), Expect = 3.3
Identities = 18/77 (23%), Positives = 32/77 (41%), Gaps = 8/77 (10%)
Query: 1 GKSKR-RPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRG-KTKT 58
GK+ R + +G A + K++ G+ D K A KK K + KT
Sbjct: 735 GKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKT 794
Query: 59 KELQEDEDGASGSKPRR 75
+++ A+ +K +R
Sbjct: 795 QKI------AAATKAKR 805
>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
Length = 935
Score = 28.5 bits (63), Expect = 3.5
Identities = 22/100 (22%), Positives = 38/100 (38%), Gaps = 22/100 (22%)
Query: 15 GKKKKKSTGRASKAADD----DDDEDLKAAYFNDGKKKKS---------------KGRGK 55
G+ + +TG + ++D DDE+L +G K+K G+
Sbjct: 339 GEHDEGATGETREESEDTESDGDDEELPRIVSREGTKRKRPPIFLRRLHRLLLMRAGKLT 398
Query: 56 TKTKELQEDEDGASGSKPRRYTPPPEKPTTDLRKKYEKQP 95
+ +E E + PR PP EKP ++ + E
Sbjct: 399 ERAREALEKASEPTYGTPR---PPVEKPRPEVPQSLETAT 435
>gnl|CDD|215553 PLN03052, PLN03052, acetate--CoA ligase; Provisional.
Length = 728
Score = 28.1 bits (63), Expect = 3.6
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 182 VVTYVGDKDCRIVLRDHDISWED 204
+V K R+ LR+ D+SW+D
Sbjct: 312 IVLPADGKSVRVKLREGDMSWDD 334
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 27.8 bits (62), Expect = 3.8
Identities = 16/83 (19%), Positives = 35/83 (42%), Gaps = 1/83 (1%)
Query: 11 AAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASG 70
AA +++ K + A ++ E+ K A ++ K+ K + + E +E+E+
Sbjct: 43 QAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE-EENEEEDEE 101
Query: 71 SKPRRYTPPPEKPTTDLRKKYEK 93
S EK +++ K+
Sbjct: 102 SSDENEKETEEKTESNVEKEITN 124
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 28.2 bits (63), Expect = 3.8
Identities = 19/67 (28%), Positives = 29/67 (43%)
Query: 4 KRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQE 63
R + A + ++ K G + DDDDD+D KA K++K K + K L E
Sbjct: 58 LREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQKKKEAERKKALLLDE 117
Query: 64 DEDGASG 70
E +
Sbjct: 118 KEKERAA 124
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This
is a family of proteins conserved in fungi. The
function is not known.
Length = 436
Score = 27.8 bits (62), Expect = 4.6
Identities = 16/78 (20%), Positives = 23/78 (29%), Gaps = 4/78 (5%)
Query: 10 GAAARGKKKKKSTGRASKAADDDDD--EDLK--AAYFNDGKKKKSKGRGKTKTKELQEDE 65
KK K + + +DD+D E L KK K + T +
Sbjct: 9 PDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKS 68
Query: 66 DGASGSKPRRYTPPPEKP 83
+ S P KP
Sbjct: 69 SKPTESSAASSEEKPAKP 86
>gnl|CDD|238071 cd00124, MYSc, Myosin motor domain. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 679
Score = 28.0 bits (63), Expect = 4.7
Identities = 20/75 (26%), Positives = 27/75 (36%), Gaps = 15/75 (20%)
Query: 108 YQLEGINWLRYSWGQN---IDTILADEMGL-----------GKTIQT-ITFLYSLFKEGH 152
YQ EGI+W + N ID I GL T +T + L + K +
Sbjct: 408 YQEEGIDWESIDFTDNQEVIDLIEKKPGGLLSLLDEECLFPKGTDETFLEKLNNKLKSNN 467
Query: 153 CKGPFLVSAPLSTII 167
P +AP I
Sbjct: 468 AFYPAKKNAPTEFTI 482
>gnl|CDD|165245 PHA02934, PHA02934, Hypothetical protein; Provisional.
Length = 253
Score = 27.3 bits (60), Expect = 5.0
Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 2/39 (5%)
Query: 118 YSWGQNIDTILADEMGLGKTIQTITFLYSLFKEGHCKGP 156
Y ++TIL D GLG + TI+F+ L + C P
Sbjct: 155 YDINNEVNTILMDNKGLGVRLATISFITELGRR--CMNP 191
>gnl|CDD|181766 PRK09295, PRK09295, bifunctional cysteine
desulfurase/selenocysteine lyase; Validated.
Length = 406
Score = 27.8 bits (62), Expect = 5.1
Identities = 11/26 (42%), Positives = 14/26 (53%), Gaps = 4/26 (15%)
Query: 111 EGINWLRYSWG----QNIDTILADEM 132
EGIN + SWG + D I+ EM
Sbjct: 96 EGINLVANSWGNSNVRAGDNIIISEM 121
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 27.7 bits (61), Expect = 5.3
Identities = 15/57 (26%), Positives = 24/57 (42%), Gaps = 10/57 (17%)
Query: 16 KKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSK 72
KK + T R + AA D +E +K + K R + K K++ ED +
Sbjct: 111 KKTPRRTRRKAAAASSDVEE----------EKTEKKVRKRRKVKKMDEDVEDQGSES 157
>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
Length = 1052
Score = 27.6 bits (61), Expect = 5.8
Identities = 19/57 (33%), Positives = 23/57 (40%), Gaps = 3/57 (5%)
Query: 2 KSKRRPVRGAAA---RGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGK 55
K+ RR V+ AAA KKK A KAA AA KK +K +
Sbjct: 976 KTVRRSVKKAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAKKAPAKKVAR 1032
>gnl|CDD|218777 pfam05847, Baculo_LEF-3, Nucleopolyhedrovirus late expression
factor 3 (LEF-3). This family consists of LEF-3
Nucleopolyhedrovirus late expression factor 3 (LEF-3)
sequences which are known to be ssDNA-binding proteins.
Alkaline nuclease (AN) and LEF-3 may participate in
homologous recombination of the baculovirus genome in a
manner similar to that of exonuclease (Redalpha) and
DNA-binding protein (Redbeta) of the Red-mediated
homologous recombination system of bacteriophage lambda.
LEF-3 is essential for transporting the putative
baculovirus helicase protein P143 into the nucleus where
they function together during viral DNA replication.
LEF-3 and other proteins have been shown to bind to
closely linked sites on viral chromatin in vivo,
suggesting that they may form part of the baculovirus
replisome.
Length = 348
Score = 27.2 bits (61), Expect = 6.3
Identities = 10/34 (29%), Positives = 18/34 (52%)
Query: 31 DDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQED 64
+D +E +KA ++ D K K K ++L+ D
Sbjct: 269 EDSNEWIKAVFYVDNKNNDKKNNDDDKLQKLEMD 302
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many
hypothetical proteins.
Length = 98
Score = 25.9 bits (57), Expect = 6.5
Identities = 15/67 (22%), Positives = 23/67 (34%), Gaps = 3/67 (4%)
Query: 26 SKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSKPRRYTPPPEKPTT 85
K + D D+ + N K+KK K R + + K PE P
Sbjct: 3 EKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDE---FPEFPEE 59
Query: 86 DLRKKYE 92
R++ E
Sbjct: 60 SKRRRTE 66
>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
Length = 479
Score = 27.0 bits (60), Expect = 7.3
Identities = 12/58 (20%), Positives = 25/58 (43%)
Query: 16 KKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSKP 73
KKKKK +ASK + + K + + ++K + + +E +S ++
Sbjct: 9 KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGTFNNYGTNESQSSSTQN 66
>gnl|CDD|153108 cd01049, RNRR2, Ribonucleotide Reductase, R2/beta subunit,
ferritin-like diiron-binding domain. Ribonucleotide
Reductase, R2/beta subunit (RNRR2) is a member of a
broad superfamily of ferritin-like diiron-carboxylate
proteins. The RNR protein catalyzes the conversion of
ribonucleotides to deoxyribonucleotides and is found in
all eukaryotes, many prokaryotes, several viruses, and
few archaea. The catalytically active form of RNR is a
proposed alpha2-beta2 tetramer. The homodimeric alpha
subunit (R1) contains the active site and redox active
cysteines as well as the allosteric binding sites. The
beta subunit (R2) contains a diiron cluster that, in its
reduced state, reacts with dioxygen to form a stable
tyrosyl radical and a diiron(III) cluster. This
essential tyrosyl radical is proposed to generate a
thiyl radical, located on a cysteine residue in the R1
active site that initiates ribonucleotide reduction. The
beta subunit is composed of 10-13 helices, the 8 longest
helices form an alpha-helical bundle; some have 2
addition beta strands. Yeast is unique in that it
assembles both homodimers and heterodimers of RNRR2. The
yeast heterodimer, Y2Y4, contains R2 (Y2) and a R2
homolog (Y4) that lacks the diiron center and is
proposed to only assist in cofactor assembly, and
perhaps stabilize R1 (Y1) in its active conformation.
Length = 288
Score = 26.8 bits (60), Expect = 7.9
Identities = 6/30 (20%), Positives = 12/30 (40%), Gaps = 1/30 (3%)
Query: 103 MQLHAYQLEGINWLRYSWGQNIDTILADEM 132
+ A +LE + R I + ++M
Sbjct: 223 LIKEAVELE-KEFARDLLPDGILGLNKEDM 251
>gnl|CDD|130307 TIGR01240, mevDPdecarb, diphosphomevalonate decarboxylase. This
enzyme catalyzes the last step in the synthesis of
isopentenyl diphosphate (IPP) in the mevalonate pathway.
Alternate names: mevalonate diphosphate decarboxylase;
pyrophosphomevalonate decarboxylase [Central
intermediary metabolism, Other].
Length = 305
Score = 26.8 bits (59), Expect = 8.4
Identities = 19/69 (27%), Positives = 23/69 (33%), Gaps = 2/69 (2%)
Query: 1 GKSKRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYF--NDGKKKKSKGRGKTKT 58
G + R G A K K + A + ADD D ND KK S +G T
Sbjct: 136 GSACRSLFGGYVAWEKGKDDHSSAAVQVADDSDWPQXAMCVLVVNDIKKDVSSRQGMQLT 195
Query: 59 KELQEDEDG 67
E
Sbjct: 196 VATSELFKE 204
>gnl|CDD|221144 pfam11595, DUF3245, Protein of unknown function (DUF3245). This is
a family of proteins conserved in fungi. The function is
not known, and there is no S. cerevisiae member.
Length = 145
Score = 26.1 bits (57), Expect = 9.1
Identities = 17/52 (32%), Positives = 24/52 (46%), Gaps = 4/52 (7%)
Query: 2 KSKRRPVRGAAARGKKKKKSTGRASKAADDDDDED----LKAAYFNDGKKKK 49
K++ PV+ AA K+ K + + DDDDDED KK+K
Sbjct: 94 KAEGGPVKRKAAVVAKEAKQSSKGVGDDDDDDDEDESRSAAFGKKGSNKKRK 145
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns
from pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and
Prp22. ATP hydrolysis by the DEAH-box enzyme Prp16
promotes a conformational change in the spliceosome
that leads to protection of the 3'ss from targeted
RNase H cleavage. This change, which probably reflects
binding of the 3'ss PyAG in the catalytic centre of the
spliceosome, requires the ordered recruitment of Slu7,
Prp18, and Prp22 to the spliceosome. There is a close
functional relationship between Prp8, Prp18, and Slu7,
and Prp18 interacts with Slu7, so that together they
recruit Prp22 to the spliceosome. Most members of the
family carry a zinc-finger of the CCHC-type upstream of
this domain.
Length = 236
Score = 26.6 bits (59), Expect = 9.2
Identities = 11/53 (20%), Positives = 24/53 (45%)
Query: 16 KKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGA 68
+KK + + D D +++ + Y ++ S + +T + L+ ED A
Sbjct: 32 LRKKLKAKKEDDSDDTDYEDEDEELYLDESDMGDSDSKTRTTVRNLRIREDTA 84
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 26.8 bits (59), Expect = 10.0
Identities = 19/94 (20%), Positives = 30/94 (31%), Gaps = 2/94 (2%)
Query: 2 KSKRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKEL 61
A A+ KK S +A+K A AA KK +K K K +
Sbjct: 208 AKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAV 267
Query: 62 QEDEDGASGSKPRRYTPPPEKPTTDLRKKYEKQP 95
+ + +K T + K +K+
Sbjct: 268 K--KAAKKAAKAAAKAAKGAAKATKGKAKAKKKA 299
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.134 0.408
Gapped
Lambda K H
0.267 0.0643 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,994,145
Number of extensions: 1029466
Number of successful extensions: 1133
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1062
Number of HSP's successfully gapped: 93
Length of query: 209
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 116
Effective length of database: 6,812,680
Effective search space: 790270880
Effective search space used: 790270880
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 57 (25.9 bits)