RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy2633
(764 letters)
>gnl|CDD|202677 pfam03531, SSrecog, Structure-specific recognition protein (SSRP1).
SSRP1 has been implicated in transcriptional initiation
and elongation and in DNA replication and repair. This
domain belongs to the Pleckstrin homology fold
superfamily.
Length = 213
Score = 318 bits (818), Expect = e-104
Identities = 126/215 (58%), Positives = 161/215 (74%), Gaps = 2/215 (0%)
Query: 74 FAGFKETEIDKICSFFKTNFKIDVPEKELSVRGWNWGGTKFIGNVLSFDVNNTTAFEIPL 133
F GF++ ++DK+ +F K+NF + EKELSV+GWNWG F G+ L+FDV + AFE+PL
Sbjct: 1 FDGFRDQDLDKLQNFIKSNFSKSIDEKELSVKGWNWGEADFKGSNLTFDVGSKPAFEVPL 60
Query: 134 RDVSQCTTGKNEVTVEFHPNDEAAESLIEMRFYIPTNEIAGDTDPVEAFKDSVMNQASVI 193
+VSQ GKNEVT+EFH ND++ SL+EMRF++P D D VE FK++V+ +A VI
Sbjct: 61 SNVSQTLQGKNEVTLEFHQNDDSGVSLMEMRFHVPVTNTEDDGDAVEVFKETVLAKADVI 120
Query: 194 NATGDAIAVFNEIQCLTPRGRYDIKIFNSFFQLHGKTFDYKIPISTVLRMFTLPHKDGRQ 253
AT DAI F EI LTPRGRYDIK++ +F +LHGKT+DYKIP S++ R+F LPHKD RQ
Sbjct: 121 QATEDAIVTFEEILILTPRGRYDIKLYPTFLRLHGKTYDYKIPYSSINRLFLLPHKDQRQ 180
Query: 254 NFFVISLDPPIKQGQTRYHFLTLLFNQDETSEMEL 288
FFVISLDPPI+QGQTRY +L L F +DE EMEL
Sbjct: 181 VFFVISLDPPIRQGQTRYPYLVLQFEKDE--EMEL 213
>gnl|CDD|227494 COG5165, POB3, Nucleosome-binding factor SPN, POB3 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 508
Score = 324 bits (831), Expect = e-102
Identities = 173/506 (34%), Positives = 265/506 (52%), Gaps = 34/506 (6%)
Query: 20 PGKLKITDQNIVFK-NKKTGKVEQINGPDLDFVNWQKLVGTWALRIFLKNGSLHRFAGFK 78
G ++I + +K + K + ++ W + V + L+I +K +++ GF
Sbjct: 16 KGTVRIARSGLGWKASDSERKPFTLPRNEVKDAEWSRGVRGYKLKIRVKGNAVYELDGFS 75
Query: 79 ETEIDKICSFFKTNFKIDVPEKELSVRGWNWGGTKFIGNVLSFDVNNTTAFEIPLRDV-S 137
+ +ID++ + F F+I + +KELS+ GWNWG G F N FEIP+ D+ +
Sbjct: 76 QNDIDELKNIFSEYFRITLEQKELSIAGWNWGELGINGQEAVFFRNTKPIFEIPVDDIEN 135
Query: 138 QCTTGKNEVTVEFHPNDE----AAESLIEMRFYIP----TNEIAGDTD----PVEAFKDS 185
KNEV+VEF DE A + L+EMRFY P +IAG EAF +
Sbjct: 136 TNLDIKNEVSVEFRIQDEEYQPAGDELVEMRFYSPGVKTKEDIAGGESVEKSMAEAFYEE 195
Query: 186 VMNQASVINATGDAIAVFNEIQCLTPRGRYDIKIFNSFFQLHGKTFDYKIPISTVLRMFT 245
+ +A + + GDAI F + TPRGRYDI + + +L GKT+DYKI ++ ++
Sbjct: 196 LKEKADIGESAGDAIVSFEGLSLATPRGRYDIDFYRDYLRLRGKTYDYKIYYKSIKMLYV 255
Query: 246 LPHKDGRQNFFVISLDPPIKQGQTRYHFLTLLFNQDETSEMELPFSEQELKEKYEGKLDK 305
LP D + VI +PP++QGQTRY FL + F +DE E+EL +++ +E Y+ KL
Sbjct: 256 LPKIDDGHRYVVIGAEPPLRQGQTRYPFLVVQFQKDEDVEVELNVEDEDYEENYKDKLKG 315
Query: 306 ELSGPTYEVMAKIMKVIVNRKITVPGSFRGHTGTSAVTCSYKAAAGYIYPLERGFIFIHK 365
E G EV +++M+ + RK+ P F G AV CS KA G +YPL+ F+F+ K
Sbjct: 316 EYDGLLSEVFSEVMEGLTVRKVVRPSEFESRDGMRAVRCSMKANEGQLYPLDDCFLFLPK 375
Query: 366 PPIHIRFEEIATVNFARSGGST---RSFDFEIELKSGVLHTFSSIEKEEYGKLFDFIREK 422
P + + +I+ V F+R G S+ R+FD + L+S +TF++I K+E G L F+ K
Sbjct: 376 PTLRLDLSDISLVEFSRIGLSSMQARTFDLTLFLRSPGSYTFNNISKDEQGALEQFLHSK 435
Query: 423 KLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNP 482
++ +N E+ + D GS D E +EDE E DEDF
Sbjct: 436 GIKARNEEVQER--LQTDLGSISDS------------EDINMGSAGEEDESE--DEDFQM 479
Query: 483 DQAESDVAEEYDSNPTPTDSDEDSDA 508
++SDVAEEYD +D++ SD
Sbjct: 480 -VSDSDVAEEYDLQAALSDAEGGSDE 504
>gnl|CDD|241384 cd13230, PH1_SSRP1-like, Structure Specific Recognition protein 1
(SSRP1) Pleckstrin homology (PH) domain, repeat 1.
SSRP1 is a component of FACT (facilitator of chromatin
transcription), an essential chromatin reorganizing
factor. In yeast FACT (yFACT) is composed of three
proteins: Spt16/Cdc68, Pob3, and Nhp6. In metazoans the
Pob3 and Nhp6 orthologs are fused to form SSRP1/T160 in
human and mouse, respectively. The middle domain of the
Pob3 subunit (Pob3-M) has an unusual double pleckstrin
homology (PH) architecture. yFACT interacts in a
physiologically important way with the central
single-strand DNA binding factor RPA to promote a step
in DNA Replication. Coordinated function by yFACT and
RPA is important during nucleosome deposition. These
results support the model that the FACT family has an
essential role in constructing nucleosomes during DNA
replication, and suggest that RPA contributes to this
process. Members of this cd are composed of the first
PH-like repeat. PH domains have diverse functions, but
in general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 137
Score = 234 bits (599), Expect = 2e-73
Identities = 83/137 (60%), Positives = 108/137 (78%)
Query: 195 ATGDAIAVFNEIQCLTPRGRYDIKIFNSFFQLHGKTFDYKIPISTVLRMFTLPHKDGRQN 254
ATGDAI F ++ CLTPRGRYDI ++ SF +LHGKT+DYKIP S++ R+F LP D R
Sbjct: 1 ATGDAIVTFEDVLCLTPRGRYDIDMYPSFLRLHGKTYDYKIPYSSISRLFLLPKPDDRHV 60
Query: 255 FFVISLDPPIKQGQTRYHFLTLLFNQDETSEMELPFSEQELKEKYEGKLDKELSGPTYEV 314
FFVISLDPPI+QGQTRY FL + F++DE E+EL +E+EL+EKY+GKL+KE GP YEV
Sbjct: 61 FFVISLDPPIRQGQTRYPFLVMQFDKDEEIELELNLTEEELEEKYKGKLEKEYEGPLYEV 120
Query: 315 MAKIMKVIVNRKITVPG 331
++++ K + +KITVPG
Sbjct: 121 VSRVFKALTGKKITVPG 137
>gnl|CDD|241385 cd13231, PH2_SSRP1-like, Structure Specific Recognition protein 1
(SSRP1) Pleckstrin homology (PH) domain, repeat 2.
SSRP1 is a component of FACT (facilitator of chromatin
transcription), an essential chromatin reorganizing
factor. In yeast FACT (yFACT) is composed of three
proteins: Spt16/Cdc68, Pob3, and Nhp6. In metazoans the
Pob3 and Nhp6 orthologs are fused to form SSRP1/T160 in
human and mouse, respectively.The middle domain of the
Pob3 subunit (Pob3-M) has an unusual double pleckstrin
homology (PH) architecture. yFACT interacts in a
physiologically important way with the central
single-strand DNA binding factor RPA to promote a step
in DNA Replication. Coordinated function by yFACT and
RPA is important during nucleosome deposition. These
results support the model that the FACT family has an
essential role in constructing nucleosomes during DNA
replication, and suggest that RPA contributes to this
process. Members of this cd are composed of the second
PH-like repeat. PH domains have diverse functions, but
in general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 100
Score = 164 bits (418), Expect = 4e-48
Identities = 56/100 (56%), Positives = 76/100 (76%), Gaps = 3/100 (3%)
Query: 333 FRGHTGTSAVTCSYKAAAGYIYPLERGFIFIHKPPIHIRFEEIATVNFARSGG---STRS 389
F+ +G A+ CSYKA G +YPLE+ F+F+HKPPI IRF++I++V F+R GG S+R+
Sbjct: 1 FKSSSGQHAIKCSYKANEGLLYPLEKSFLFVHKPPILIRFDDISSVEFSRVGGSSTSSRT 60
Query: 390 FDFEIELKSGVLHTFSSIEKEEYGKLFDFIREKKLRVKNT 429
FD EIELK G HTFSSI++EEYG L DF++ K L++KN
Sbjct: 61 FDLEIELKGGTEHTFSSIDREEYGPLEDFLKSKGLKIKNE 100
>gnl|CDD|219876 pfam08512, Rtt106, Histone chaperone Rttp106-like. This family
includes Rttp106, a histone chaperone involved in
heterochromatin-mediated silencing. This domain belongs
to the Pleckstrin homology domain superfamily.
Length = 93
Score = 118 bits (297), Expect = 1e-31
Identities = 40/93 (43%), Positives = 59/93 (63%), Gaps = 2/93 (2%)
Query: 339 TSAVTCSYKAAAGYIYPLERGFIF-IHKPPIHIRFEEIATVNFARSGGSTRSFDFEIELK 397
V CSYKA+ G +YPLE+G +F KPP+ I ++I +V+F R G R+FD + K
Sbjct: 1 FPGVPCSYKASEGLLYPLEKGLLFLFEKPPLVITLDDIESVSFERVSGGLRTFDLVVVFK 60
Query: 398 -SGVLHTFSSIEKEEYGKLFDFIREKKLRVKNT 429
SG +H FSSI++EE L DF++ K +++K
Sbjct: 61 YSGPVHEFSSIDREELDVLKDFLKSKGIKIKEG 93
>gnl|CDD|189580 pfam00505, HMG_box, HMG (high mobility group) box.
Length = 69
Score = 84.2 bits (209), Expect = 4e-20
Identities = 31/69 (44%), Positives = 45/69 (65%)
Query: 554 PKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKED 613
PK+P+SA+ ++ + R K+K +NPG+ EI+K GE WK +S +EK Y+EK K K
Sbjct: 1 PKRPLSAFFLFSQEQRAKLKAENPGLKNAEISKILGEKWKNLSEEEKKPYEEKAEKEKAR 60
Query: 614 YNEALKAYK 622
Y +A AYK
Sbjct: 61 YEKAYPAYK 69
>gnl|CDD|238686 cd01390, HMGB-UBF_HMG-box, HMGB-UBF_HMG-box, class II and III
members of the HMG-box superfamily of DNA-binding
proteins. These proteins bind the minor groove of DNA in
a non-sequence specific fashion and contain two or more
tandem HMG boxes. Class II members include non-histone
chromosomal proteins, HMG1 and HMG2, which bind to bent
or distorted DNA such as four-way DNA junctions,
synthetic DNA cruciforms, kinked cisplatin-modified DNA,
DNA bulges, cross-overs in supercoiled DNA, and can
cause looping of linear DNA. Class III members include
nucleolar and mitochondrial transcription factors, UBF
and mtTF1, which bind four-way DNA junctions.
Length = 66
Score = 81.5 bits (202), Expect = 4e-19
Identities = 31/66 (46%), Positives = 45/66 (68%)
Query: 554 PKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKED 613
PK+P+SAY ++ + R K+KK+NP S TE+ K GE WK +S +EK +Y+EK K KE
Sbjct: 1 PKRPLSAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKER 60
Query: 614 YNEALK 619
Y + +K
Sbjct: 61 YEKEMK 66
>gnl|CDD|238037 cd00084, HMG-box, High Mobility Group (HMG)-box is found in a
variety of eukaryotic chromosomal proteins and
transcription factors. HMGs bind to the minor groove of
DNA and have been classified by DNA binding preferences.
Two phylogenically distinct groups of Class I proteins
bind DNA in a sequence specific fashion and contain a
single HMG box. One group (SOX-TCF) includes
transcription factors, TCF-1, -3, -4; and also SRY and
LEF-1, which bind four-way DNA junctions and duplex DNA
targets. The second group (MATA) includes fungal mating
type gene products MC, MATA1 and Ste11. Class II and III
proteins (HMGB-UBF) bind DNA in a non-sequence specific
fashion and contain two or more tandem HMG boxes. Class
II members include non-histone chromosomal proteins,
HMG1 and HMG2, which bind to bent or distorted DNA such
as four-way DNA junctions, synthetic DNA cruciforms,
kinked cisplatin-modified DNA, DNA bulges, cross-overs
in supercoiled DNA, and can cause looping of linear DNA.
Class III members include nucleolar and mitochondrial
transcription factors, UBF and mtTF1, which bind
four-way DNA junctions.
Length = 66
Score = 80.4 bits (199), Expect = 8e-19
Identities = 29/66 (43%), Positives = 47/66 (71%)
Query: 554 PKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKED 613
PK+P+SAY ++ + R ++K +NPG+S EI+K GE+WK++S +EK +Y+EK K KE
Sbjct: 1 PKRPLSAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKER 60
Query: 614 YNEALK 619
Y + +
Sbjct: 61 YEKEMP 66
>gnl|CDD|197700 smart00398, HMG, high mobility group.
Length = 70
Score = 78.9 bits (195), Expect = 4e-18
Identities = 36/70 (51%), Positives = 48/70 (68%)
Query: 553 APKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKE 612
PK+PMSA+M++ + R KIK +NP +S EI+KK GE WK +S +EKA Y+EK K KE
Sbjct: 1 KPKRPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKE 60
Query: 613 DYNEALKAYK 622
Y E + YK
Sbjct: 61 RYEEEMPEYK 70
>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
domain [Chromatin structure and dynamics].
Length = 211
Score = 79.9 bits (197), Expect = 6e-17
Identities = 53/179 (29%), Positives = 81/179 (45%), Gaps = 23/179 (12%)
Query: 529 KPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKG 588
KPRK K K RKKKD N PK+P+SAY ++ + R++I+K+NP ++F E+ K
Sbjct: 51 KPRKKTKSK-----RLVRKKKDPNGPKRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLL 105
Query: 589 GELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETA 648
E WK ++ +EK Y ++ N +E Y + Y + K E
Sbjct: 106 SEKWKELTDEEKEPYYKEANSDRERYQREKEEYNK----------KLPNKAPIGPFIENE 155
Query: 649 STAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDS 707
K++ K S + ++ K SK + DE K K K +K + E DS
Sbjct: 156 P------KIRPKVEGPSPDKA--LVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDS 206
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 68.1 bits (167), Expect = 1e-11
Identities = 57/263 (21%), Positives = 86/263 (32%), Gaps = 18/263 (6%)
Query: 504 EDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMM 563
E+ + E + K+++ + + K K+ K+KK + KK
Sbjct: 1142 EEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKK---KEKKKKKSSADKSKKASVVGNS 1198
Query: 564 WFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKE 623
D EK K D+ KK E+ + K + K ++ + K
Sbjct: 1199 KRVDSDEKRKLDD-----KPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKS 1253
Query: 624 SGGGQDSDDGKTSKSKKPAKKKETASTAV-SPHKVKSKEFIESNGSSSDSDDDKKASSKR 682
S + SK KP + S SP + ESNG S S KK KR
Sbjct: 1254 SEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKR 1313
Query: 683 KRDSDEDSKAKTKKKKEKSES-ESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYD 741
S K K K +K+ + +S + + S+ K KS E DD+
Sbjct: 1314 LEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSE- 1372
Query: 742 EPVESTPPSSDAGSDASSSEDDD 764
D EDDD
Sbjct: 1373 -------VDDSEDEDDEDDEDDD 1388
Score = 43.5 bits (103), Expect = 5e-04
Identities = 26/164 (15%), Positives = 57/164 (34%)
Query: 590 ELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETAS 649
++W K + +E+ +++ + + ++ G K K+ KKK +A
Sbjct: 1128 DMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSAD 1187
Query: 650 TAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGE 709
+ V + + ++S+ D S ED + + K K+ S S +
Sbjct: 1188 KSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKK 1247
Query: 710 SEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDA 753
+ K S+ + + + + + S PP S
Sbjct: 1248 NNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKR 1291
>gnl|CDD|238684 cd01388, SOX-TCF_HMG-box, SOX-TCF_HMG-box, class I member of the
HMG-box superfamily of DNA-binding proteins. These
proteins contain a single HMG box, and bind the minor
groove of DNA in a highly sequence-specific manner.
Members include SRY and its homologs in insects and
vertebrates, and transcription factor-like proteins,
TCF-1, -3, -4, and LEF-1. They appear to bind the minor
groove of the A/T C A A A G/C-motif.
Length = 72
Score = 53.5 bits (129), Expect = 4e-09
Identities = 23/68 (33%), Positives = 39/68 (57%)
Query: 555 KKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDY 614
K+PM+A+M++ R K+ ++ P I+K G+ WK +S++EK Y E+ K KE +
Sbjct: 3 KRPMNAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKELH 62
Query: 615 NEALKAYK 622
+ YK
Sbjct: 63 MKLYPDYK 70
>gnl|CDD|185511 PTZ00199, PTZ00199, high mobility group protein; Provisional.
Length = 94
Score = 53.7 bits (129), Expect = 5e-09
Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 2/94 (2%)
Query: 532 KSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGIS--FTEIAKKGG 589
+KK + KRKKKD NAPK+ +SAYM + + R +I +NP ++ + K G
Sbjct: 1 MAKKQGKVLVRKNKRKKKDPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVG 60
Query: 590 ELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKE 623
E W +S +EKA Y++K + K Y + Y +
Sbjct: 61 EAWNKLSEEEKAPYEKKAQEDKVRYEKEKAEYAK 94
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 57.5 bits (138), Expect = 2e-08
Identities = 73/305 (23%), Positives = 130/305 (42%), Gaps = 22/305 (7%)
Query: 431 KGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVA 490
K ++ K + DE + A A+ K +AA++ E+ + E+ + E++ A
Sbjct: 1303 KADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAA 1362
Query: 491 EEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKD 550
EE + A KKK E+ +K +K + KK + + KKK
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAA--KKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA 1420
Query: 551 KNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKA 610
A KK EK K D E KK E K +K+KAE +K +A
Sbjct: 1421 DEAKKKA-----------EEKKKAD-------EAKKKAEEAKKADEAKKKAEEAKKAEEA 1462
Query: 611 KEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSS 670
K+ EA KA + +++ +K K KK+ + K + + +
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522
Query: 671 DSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSK 730
+D+ KKA +K +DE KA+ KKK ++ + + ++E++KK++ K ++ K+ +
Sbjct: 1523 KADEAKKAEEAKK--ADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMAL 1580
Query: 731 KEDEI 735
++ E
Sbjct: 1581 RKAEE 1585
Score = 55.9 bits (134), Expect = 7e-08
Identities = 75/295 (25%), Positives = 128/295 (43%), Gaps = 19/295 (6%)
Query: 458 KREAAERDDEDDEDEDESTDEDFNPDQA-----ESDVAEEYDSNPTPTDSDEDSDASEGT 512
K+ ER++E+ +E+ F QA E+ A+E +DE A E
Sbjct: 1243 KKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKK 1302
Query: 513 K----KKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDM 568
K KKK E+++K +K ++KK +++ + KK A K A
Sbjct: 1303 KADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAA 1362
Query: 569 REKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQ 628
EK + E KK K K+KA DE KA+ED +A + K + +
Sbjct: 1363 EEKAEAAEKKKE--EAKKKADAAKKKAEEKKKA--DEAKKKAEEDKKKADELKKAAAAKK 1418
Query: 629 DSDDGKTSKSKK----PAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKR 684
+D+ K +K AKKK + K K++E ++ + +++ KKA +K+
Sbjct: 1419 KADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478
Query: 685 DSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEE 739
+ + KKK E+++ ++D E++K ++K K K ++KK DE E
Sbjct: 1479 AEEAKKADEAKKKAEEAKKKAD--EAKKAAEAKKKADEAKKAEEAKKADEAKKAE 1531
Score = 51.7 bits (123), Expect = 2e-06
Identities = 73/310 (23%), Positives = 133/310 (42%), Gaps = 15/310 (4%)
Query: 439 EDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPT 498
E+ + + + +A A + EAAE E E + E + D A+ E+ ++
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKE--EAKKKADAAKKKAEEKKKADEA 1396
Query: 499 PTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPM 558
++ED ++ KK K K ++K + KK + + + KK A KK
Sbjct: 1397 KKKAEEDKKKADELKKAAAAK--KKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAE 1454
Query: 559 SAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEAL 618
A E KK E KK E K +K+KA E+ K ++ +A
Sbjct: 1455 EA-----KKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKA---EEAKKKADEAKKAA 1506
Query: 619 KAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKA 678
+A K++ + +++ K + K A++ + A A + K + ++ +++ KKA
Sbjct: 1507 EAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKA 1566
Query: 679 SSKRKRDSDEDS---KAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEI 735
+K + D++ KA+ KK E++ E E+EKK K++ ++K K E+
Sbjct: 1567 EEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELK 1626
Query: 736 DDEEYDEPVE 745
EE + VE
Sbjct: 1627 KAEEEKKKVE 1636
Score = 48.6 bits (115), Expect = 1e-05
Identities = 63/309 (20%), Positives = 122/309 (39%), Gaps = 10/309 (3%)
Query: 457 VKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKK 516
V++ R ED + + + E+ AE+ ++E +E KK +
Sbjct: 1187 VRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAE 1246
Query: 517 KEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDN 576
+E++ + I E+ R + + + + +K K EK K D
Sbjct: 1247 EERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKAD- 1305
Query: 577 PGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTS 636
E KK E K +K+KAE +K KA +A +A K + + +
Sbjct: 1306 ------EAKKKAEEAKKADEAKKKAEEAKK--KADAAKKKAEEAKKAAEAAKAEAEAAAD 1357
Query: 637 KSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDE-DSKAKTK 695
+++ +K E A K K+ + +D+ KK + + K+ +DE A K
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAK 1417
Query: 696 KKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGS 755
KK ++++ +++ + E K K++ ++K K E+ EE + E + +A
Sbjct: 1418 KKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKK 1477
Query: 756 DASSSEDDD 764
A ++ D
Sbjct: 1478 KAEEAKKAD 1486
Score = 48.6 bits (115), Expect = 1e-05
Identities = 66/302 (21%), Positives = 122/302 (40%), Gaps = 21/302 (6%)
Query: 458 KREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKK-- 515
K E A + ++ + E E+ D E+ AEE +N +E A ++
Sbjct: 1213 KAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAI 1272
Query: 516 KKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKD 575
K E++RK + + K K D+ + KKK A KK E K D
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKK---AEEKKKADEAKKKA-----------EEAKKAD 1318
Query: 576 NPGISFTEIAKKGGELWKTVS-SKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGK 634
E KK K +K+ AE + +A D EA + E+ + + K
Sbjct: 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKK 1378
Query: 635 TS----KSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDS 690
+ K + KK + A K K+ E ++ + +D+ KK + ++K+ +
Sbjct: 1379 KADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438
Query: 691 KAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPS 750
KA+ KK ++++ +++ + +E K K++ ++K K E+ +E + E
Sbjct: 1439 KAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKK 1498
Query: 751 SD 752
+D
Sbjct: 1499 AD 1500
Score = 45.9 bits (108), Expect = 1e-04
Identities = 80/350 (22%), Positives = 156/350 (44%), Gaps = 22/350 (6%)
Query: 408 EKEEYGKLFDFIREKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDE 467
+KEE K D ++K K + +K +ED +D+ K+ A A+ K + A++ E
Sbjct: 1372 KKEEAKKKADAAKKKAEEKKKADEAKK-KAEEDKKKADELKKAAA--AKKKADEAKKKAE 1428
Query: 468 DDEDEDES---TDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTI 524
+ + DE+ +E D+A+ E + ++E A E KKK E+++K
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEA--KKKAEEAKKAD 1486
Query: 525 TISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEI 584
+K ++KK + + + + KKK A K + E K D E
Sbjct: 1487 EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADE--AKKAEE 1544
Query: 585 AKKGGELWKT--VSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDS---------DDG 633
KK EL K + E+ + E+ KA+ED N AL+ +E+ +++ ++
Sbjct: 1545 KKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEE 1604
Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
K K+++ K +E A K + ++ ++++ KKA +K + + KA
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAA 1664
Query: 694 TKKKKEKSES-ESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDE 742
+ KK + + +++ + +E + K+ K ++KK +E+ +E +E
Sbjct: 1665 EEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE 1714
Score = 45.5 bits (107), Expect = 1e-04
Identities = 64/330 (19%), Positives = 135/330 (40%), Gaps = 21/330 (6%)
Query: 428 NTGKGEKPNYKE-DFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAE 486
+G KP+YK+ DF + +D + +A + + E + E+ E+ +
Sbjct: 1067 GQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAED 1126
Query: 487 SDVAEEYDSNPTPTDSDEDSDASEGTK---KKKKEKSRKTITISEKPRKSKKDKESSGGS 543
+ AEE ++E A + + +K E +RK + K +KK + +
Sbjct: 1127 ARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARK-AEEARKAEDAKKAEAARKAE 1185
Query: 544 TKRKKKDKNAPKKPMSAYMMWFNDMREKIK--------KDNPGISFTEIAKKGGELWKTV 595
RK ++ + A + K + K + E AKK E K
Sbjct: 1186 EVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKA 1245
Query: 596 SSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPA--------KKKET 647
+ E K +A+ + +A ++ + +D+ K ++ KK A KKK
Sbjct: 1246 EEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKAD 1305
Query: 648 ASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDS 707
+ + K+ E + + D K ++ + + E +KA+ + +++E+ +
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365
Query: 708 GESEKEKKSKSKPKGDKSKSKSKKEDEIDD 737
E+ ++KK ++K K D +K K++++ + D+
Sbjct: 1366 AEAAEKKKEEAKKKADAAKKKAEEKKKADE 1395
Score = 44.4 bits (104), Expect = 3e-04
Identities = 71/338 (21%), Positives = 137/338 (40%), Gaps = 7/338 (2%)
Query: 421 EKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDF 480
E K + + K ++ K + DE + A A+ K + A++ E + DE+ +
Sbjct: 1461 EAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEE 1520
Query: 481 NPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESS 540
E+ AEE ++E A E K ++ +K+ + E + + +
Sbjct: 1521 AKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMAL 1580
Query: 541 GGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEK 600
+ + KK ++ ++ M Y E+ KK E KK E K V +K
Sbjct: 1581 RKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKK 1640
Query: 601 AEYDEK-----VNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPH 655
E +EK + KA+E+ + + K+++ KK A +
Sbjct: 1641 KEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEE 1700
Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKK 715
K++E + ++ K + + + E++K + ++ K+K+E E+ E EK+K
Sbjct: 1701 AKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE-EAKKDEEEKKKI 1759
Query: 716 SKSKPKGDKSKSKSKKEDE-IDDEEYDEPVESTPPSSD 752
+ K + +K + +KE E + +EE DE E D
Sbjct: 1760 AHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797
Score = 44.4 bits (104), Expect = 3e-04
Identities = 74/331 (22%), Positives = 130/331 (39%), Gaps = 39/331 (11%)
Query: 421 EKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDF 480
E + + K E+ E+ ++D K+ +A VK+ + D ++ + E +
Sbjct: 1197 EDARKAEAARKAEEERKAEEARKAEDAKKAEA----VKKAEEAKKDAEEAKKAEEERNNE 1252
Query: 481 NPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPR--KSKKDKE 538
+ E + ++E A E K ++K+K+ + EK + ++KK E
Sbjct: 1253 EIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAE 1312
Query: 539 SSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSK 598
+ + + KKK + A KK +A E K + E A E +++
Sbjct: 1313 EAKKADEAKKKAEEAKKKADAAK----KKAEEAKKAAEAAKAEAEAAADEAE-----AAE 1363
Query: 599 EKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPA---KKKETASTAVSPH 655
EKAE EK + + +A K E D K + KK A KK A
Sbjct: 1364 EKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA 1423
Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKE--------KSESESDS 707
K K++E KKA +K+ + + KKK E K ++E
Sbjct: 1424 KKKAEE-------------KKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK 1470
Query: 708 GESEKEKKSKSKPKGDKSKSKSKKEDEIDDE 738
E +KK++ K D++K K+++ + DE
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADE 1501
Score = 42.4 bits (99), Expect = 0.001
Identities = 68/349 (19%), Positives = 138/349 (39%), Gaps = 9/349 (2%)
Query: 421 EKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTD-ED 479
E K + K K D E++ A A+ K E A++ DE + +E+ E+
Sbjct: 1402 EDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEE 1461
Query: 480 FNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKES 539
E+ A+E +++ KKK ++++K +K ++KK +E+
Sbjct: 1462 AKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEA 1521
Query: 540 SGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKE 599
+K ++ + A D +K ++ + + +
Sbjct: 1522 KKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALR 1581
Query: 600 KAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPA---KKKETASTAVSPHK 656
KAE +K +A+ + E +K Y+E + + K ++K A KK E V K
Sbjct: 1582 KAEEAKKAEEARIE--EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLK 1639
Query: 657 VKSKEFI---ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKE 713
K E E + + + K A +K + D+ + KK +E + +++ + E E
Sbjct: 1640 KKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699
Query: 714 KKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSED 762
+ K++ K + KK +E+ E + +++ +A D +E+
Sbjct: 1700 EAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEE 1748
Score = 37.4 bits (86), Expect = 0.039
Identities = 64/340 (18%), Positives = 126/340 (37%), Gaps = 11/340 (3%)
Query: 431 KGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVA 490
K E+ +E ++EK+ A A+ EA + +E + E+E +Q + A
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKK----KVEQLKKKEA 1643
Query: 491 EEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKD 550
EE ++E++ + KK E+ +K ++K + +K + + K
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703
Query: 551 KNAPKKPMSAYMMWFNDMRE-----KIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDE 605
KK + ++++ KIK + E KK E K K+K + +
Sbjct: 1704 AEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLK 1763
Query: 606 KVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIES 665
K + K + K + D+ + + K K + + I
Sbjct: 1764 KEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVIND 1823
Query: 666 NGSSSDSDDDKKASSK--RKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGD 723
+ DS + A SK + ++D K K K E E + + KEK K + +
Sbjct: 1824 SKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEE 1883
Query: 724 KSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDD 763
++ ++ + DD E + P + ++ D +D+
Sbjct: 1884 IEEADEIEKIDKDDIEREIPNNNMAGKNNDIIDDKLDKDE 1923
Score = 36.7 bits (84), Expect = 0.056
Identities = 61/304 (20%), Positives = 117/304 (38%), Gaps = 11/304 (3%)
Query: 446 DEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDED 505
DE + + + +A DE + + D D D + EE E
Sbjct: 1049 DEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTET 1108
Query: 506 SDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWF 565
A E K ++ +K + +E+ RK++ +++ K +K A
Sbjct: 1109 GKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEE 1168
Query: 566 NDMREKIKKDNPGISFTEIAK----KGGELWKTVSSKEKAEYDEKVNKAK--EDYNEALK 619
E KK E+ K + E + + KAE + K +A+ ED +A
Sbjct: 1169 ARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEA 1228
Query: 620 AYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKAS 679
K +D+++ K ++ ++ ++ A H + + I++ + +D+ KKA
Sbjct: 1229 VKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAE-EARKADELKKAE 1287
Query: 680 SKRK----RDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEI 735
K+K + ++E KA KKK + ++D + + E+ K K ++KK E
Sbjct: 1288 EKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEA 1347
Query: 736 DDEE 739
E
Sbjct: 1348 AKAE 1351
Score = 31.6 bits (71), Expect = 2.0
Identities = 65/345 (18%), Positives = 141/345 (40%), Gaps = 23/345 (6%)
Query: 408 EKEEYGKLFDFIREKKLRVKNTGKGEKPNYK-EDFGSSDDEKEPDAYLARVKREAAERDD 466
E + + EKK++ + K E+ K E+ +++EK+ L + + E ++ +
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE 1650
Query: 467 EDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITI 526
E + E+E+ + E+ ++ED + KK+ E+++K +
Sbjct: 1651 ELKKAEEENK---IKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEEL 1707
Query: 527 SEKPRKSKKDKES-SGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIA 585
+K + KK E + K K + A K+ +++ +K E
Sbjct: 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEE 1767
Query: 586 KKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKES------GGGQDSDDGKTSKSK 639
KK E+ K + + E DE+ K + + ++ +K ++ GG + + SK
Sbjct: 1768 KKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEM 1827
Query: 640 KPAKKKETASTAVS----PHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTK 695
+ + KE A + + +F ++N + D + K+A +++D ED + + +
Sbjct: 1828 EDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGN--KEADFNKEKDLKEDDEEEIE 1885
Query: 696 KKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEY 740
+ E + + D E E + K + +D++D +EY
Sbjct: 1886 EADEIEKIDKDDIEREI-PNNNMAGKNNDI-----IDDKLDKDEY 1924
>gnl|CDD|204115 pfam09011, DUF1898, Domain of unknown function (DUF1898). This
domain is predominantly found in Maelstrom homolog
proteins. It has no known function.
Length = 69
Score = 48.6 bits (116), Expect = 2e-07
Identities = 22/64 (34%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 552 NAPKKPMSAYMMWFNDMREKIKKDNPGIS-FTEIAKKGGELWKTVSSKEKAEYDEKVNKA 610
N PK +AY + MR ++K++ P + E +K E WK +S +EK +Y+EK +
Sbjct: 2 NKPKAKRNAYFFFVQTMRPELKREGPQVPGVAEFSKLCSEKWKAMSEEEKEKYEEKARED 61
Query: 611 KEDY 614
K+ Y
Sbjct: 62 KKRY 65
>gnl|CDD|238685 cd01389, MATA_HMG-box, MATA_HMG-box, class I member of the HMG-box
superfamily of DNA-binding proteins. These proteins
contain a single HMG box, and bind the minor groove of
DNA in a highly sequence-specific manner. Members
include the fungal mating type gene products MC, MATA1
and Ste11.
Length = 77
Score = 45.3 bits (108), Expect = 3e-06
Identities = 20/69 (28%), Positives = 39/69 (56%)
Query: 554 PKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKED 613
+P +A++++ D ++K +NPG++ EI++ G +W++ S + KA Y E + KE
Sbjct: 2 IPRPRNAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKER 61
Query: 614 YNEALKAYK 622
+ YK
Sbjct: 62 HAREYPDYK 70
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 45.0 bits (107), Expect = 1e-04
Identities = 27/204 (13%), Positives = 60/204 (29%), Gaps = 9/204 (4%)
Query: 562 MMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAY 621
M + E ++ ++A K SK +E + +A E + +
Sbjct: 1 MTTASTKAELAAEEEAKKKLKKLAAKS-------KSKGFITKEE-IKEALESKKKTPEQI 52
Query: 622 KESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDD-DKKASS 680
+ T + + K+ TA K+ + S +KK +
Sbjct: 53 DQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNAL 112
Query: 681 KRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEY 740
+ D + + + + + D + + + D + + + +DEE
Sbjct: 113 DKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEK 172
Query: 741 DEPVESTPPSSDAGSDASSSEDDD 764
E E S D + +
Sbjct: 173 KEAKELEKLSDDDDFVWDEDDSEA 196
Score = 41.1 bits (97), Expect = 0.002
Identities = 23/153 (15%), Positives = 53/153 (34%), Gaps = 5/153 (3%)
Query: 593 KTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAV 652
K + ++ + ++ +D ++A ++ + + +K+ K K+ ++
Sbjct: 45 KKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK 104
Query: 653 SPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKK-----EKSESESDS 707
K + + + D D +A D D+D + + + D
Sbjct: 105 KAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDD 164
Query: 708 GESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEY 740
+ E E+K ++K S DE D E
Sbjct: 165 VDDEDEEKKEAKELEKLSDDDDFVWDEDDSEAL 197
Score = 40.0 bits (94), Expect = 0.005
Identities = 25/234 (10%), Positives = 58/234 (24%), Gaps = 26/234 (11%)
Query: 507 DASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFN 566
+ TK + + + + KSK + K + K +
Sbjct: 1 MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPE---------- 50
Query: 567 DMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGG 626
+ + +K A+ AK+ + L + K++
Sbjct: 51 --QIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEK 108
Query: 627 GQDSDDGKTSKSKKPAKKKETASTAVSP---HKVKSKEFIESNGSSSDSDDDKKASSKRK 683
D K A + + + + D +DD +
Sbjct: 109 KNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDE 168
Query: 684 RDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDD 737
+ +++K + S+ D +++ + K K D
Sbjct: 169 DEEKKEAKE------LEKLSDDDDFVWDEDDSEALRQ-----ARKDAKLTATAD 211
Score = 36.9 bits (86), Expect = 0.038
Identities = 22/117 (18%), Positives = 43/117 (36%)
Query: 409 KEEYGKLFDFIREKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDED 468
K K + + + +K + D + D + + DD D
Sbjct: 89 KAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDID 148
Query: 469 DEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTIT 525
D+D+DE DED + D + + E+ ++ SD+D + + ++RK
Sbjct: 149 DDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAK 205
Score = 36.5 bits (85), Expect = 0.059
Identities = 30/136 (22%), Positives = 57/136 (41%)
Query: 403 TFSSIEKEEYGKLFDFIREKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAA 462
T S I K++ K K + K + ++ D+ + Y+ +
Sbjct: 70 TESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQ 129
Query: 463 ERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRK 522
DD+DD+D+D+ D+D + D + D E+ D + + +E +A E K +
Sbjct: 130 ADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVW 189
Query: 523 TITISEKPRKSKKDKE 538
SE R+++KD +
Sbjct: 190 DEDDSEALRQARKDAK 205
Score = 32.3 bits (74), Expect = 1.0
Identities = 25/148 (16%), Positives = 59/148 (39%), Gaps = 4/148 (2%)
Query: 420 REKKLRVKNTGKG---EKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDEST 476
K + K K + P K+ D K+ + + ++ +D + +++
Sbjct: 73 DIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEK-KNALDKDDDLNYVKDIDVLNQAD 131
Query: 477 DEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKD 536
D+D + D + D + D + D ++D D + ++K+++++ +S+ +
Sbjct: 132 DDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDE 191
Query: 537 KESSGGSTKRKKKDKNAPKKPMSAYMMW 564
+S RK A P+ AY+
Sbjct: 192 DDSEALRQARKDAKLTATADPVKAYLKQ 219
Score = 30.3 bits (69), Expect = 4.3
Identities = 28/182 (15%), Positives = 58/182 (31%), Gaps = 14/182 (7%)
Query: 470 EDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEK 529
E + DQ ++ T+SD ++ K K+ + ++
Sbjct: 40 EALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDE 99
Query: 530 PRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGG 589
SKK ++K + N + D+ ++
Sbjct: 100 LDSSKK--------AEKKNALDKDDDLNYVKDIDVLNQADDDDDDDD----DDDLDDDDI 147
Query: 590 ELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKK--KET 647
+ ++ + D+ V+ E+ EA + K S D S++ + A+K K T
Sbjct: 148 DDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLT 207
Query: 648 AS 649
A+
Sbjct: 208 AT 209
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 44.2 bits (104), Expect = 2e-04
Identities = 49/275 (17%), Positives = 92/275 (33%), Gaps = 24/275 (8%)
Query: 444 SDDEKEPDAYLARVKREAAER--------------DDEDDEDEDESTDEDFNPDQAESDV 489
D EK+ L +K+EAA R + +E+ E P + +
Sbjct: 1 MDPEKD----LEDLKKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTI 56
Query: 490 AEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKK 549
E + + A K+K + + +T EK + K ++ K
Sbjct: 57 EE------AKRRAAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAK 110
Query: 550 DKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNK 609
K + ++ + + + ++G E + E +
Sbjct: 111 QKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAA 170
Query: 610 AKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSS 669
A A A +++ + + T + K AK K A+ + ++ + NG S
Sbjct: 171 AAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGDS 230
Query: 670 SDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESE 704
D D KA + K + ++AKTK + K E E
Sbjct: 231 GDEDAKAKAIAAAKAKAAAAARAKTKGAEGKKEEE 265
Score = 31.1 bits (70), Expect = 2.7
Identities = 33/140 (23%), Positives = 59/140 (42%), Gaps = 7/140 (5%)
Query: 583 EIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPA 642
+AK+ E + V+ +EKA+ K A + AL K G T + K A
Sbjct: 73 ALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTE-----EVTEEEKAAA 127
Query: 643 KKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTK-KKKEKS 701
K K A+ + ++ + + + ++ K +K K + +KA K+K
Sbjct: 128 KAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAE 187
Query: 702 ESESDSGESEKEKKSKSKPK 721
E + E +E+K+K+K K
Sbjct: 188 AGE-GTEEVTEEEKAKAKAK 206
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 44.5 bits (105), Expect = 2e-04
Identities = 30/184 (16%), Positives = 63/184 (34%), Gaps = 8/184 (4%)
Query: 584 IAKKGGELWKTVSSKEKAEYD-----EKVNKAKEDYNEALKAYKE-SGGGQDSDDGKTSK 637
K L+ ++ +K + ++ +D + L E G S + T
Sbjct: 58 EGKHANILYNSILCNQKKHASFLNQRKSLDDDDDDEFDFLYEDDEDDAGNATSGESSTDD 117
Query: 638 SKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSD--EDSKAKTK 695
+ ++ +F + + S+S +D K S E+ K +T+
Sbjct: 118 DSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTE 177
Query: 696 KKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGS 755
+ +D E+ + KSK + S + + D E E ++ + A +
Sbjct: 178 NTFKFYNFGNDGEEAAAKDGGKSKSSDPGPLNDSDGQGDDGDPESAEEDKAASNTRAAYT 237
Query: 756 DASS 759
A+S
Sbjct: 238 KATS 241
Score = 34.5 bits (79), Expect = 0.25
Identities = 37/185 (20%), Positives = 66/185 (35%), Gaps = 14/185 (7%)
Query: 392 FEIELKSGVLHTFSSIEKEEYGKLFDFIREKKLRVKNTGKGEKPNYKEDFGSSDDEKEPD 451
+ L+S L S++E E D LR + Y +
Sbjct: 25 LRLGLRSNYLQ--STVEDIEGPSAVDERTSGVLRDEGKHANIL--YNSILCNQKKHASFL 80
Query: 452 AYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEG 511
+R++ + DD+D+ D DED + + + + DS D DED+D
Sbjct: 81 N-----QRKSLDDDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDAD---- 131
Query: 512 TKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKK-KDKNAPKKPMSAYMMWFNDMRE 570
T+ + + S + + K+S S+ K ++K + + + ND E
Sbjct: 132 TQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEE 191
Query: 571 KIKKD 575
KD
Sbjct: 192 AAAKD 196
Score = 31.8 bits (72), Expect = 1.6
Identities = 20/111 (18%), Positives = 37/111 (33%), Gaps = 12/111 (10%)
Query: 665 SNGSSSDSDD-------DKKASSKRKRDS----DEDSKAKTKKKKEKSESESDSGESEKE 713
S SS+D D D+ A ++ D D+D + ++ + +S + S EK
Sbjct: 110 SGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSA-EKL 168
Query: 714 KKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
+ K + K + D + D + +D D
Sbjct: 169 IEEKKGQTENTFKFYNFGNDGEEAAAKDGGKSKSSDPGPLNDSDGQGDDGD 219
Score = 30.6 bits (69), Expect = 3.7
Identities = 17/119 (14%), Positives = 38/119 (31%), Gaps = 12/119 (10%)
Query: 646 ETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESES 705
+ H F+ S D DDD+ + DED + ++ +S
Sbjct: 67 NSILCNQKKHAS----FLNQRKSLDDDDDDEFDFL---YEDDEDDAGNATSGESSTDDDS 119
Query: 706 DSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
++++ + ++ D + + DD + + S + S E+
Sbjct: 120 LLELPDRDEDADTQANND-----QTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKK 173
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 44.3 bits (104), Expect = 2e-04
Identities = 47/169 (27%), Positives = 76/169 (44%), Gaps = 30/169 (17%)
Query: 598 KEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKK-KETASTAVSPHK 656
K+ A D + + ED + + S S+D K +KKP KK KE A+ SP K
Sbjct: 23 KQTASPDGRASPTNEDQRSSGR-NSPSAASTSSNDSKAESTKKPNKKIKEEAT---SPLK 78
Query: 657 VKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKS 716
+K E +SD+++ ++ ++K K+KT++ + GE E E
Sbjct: 79 -STKRQREK--PASDTEEPERVTAK---------KSKTQELSRPNSPSEGEGEGEGE--- 123
Query: 717 KSKPKGDKSKSKSKKED-EIDDEEYDEPVESTPPS----SDAGSDASSS 760
G+ S S+S E+ D ++ D+ S+ PS D SD+ SS
Sbjct: 124 -----GESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNESDSDSS 167
Score = 38.5 bits (89), Expect = 0.013
Identities = 33/128 (25%), Positives = 57/128 (44%), Gaps = 1/128 (0%)
Query: 594 TVSSKEKAEYDEKVNKA-KEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAV 652
T S+ KAE +K NK KE+ LK+ K SD + + K + S
Sbjct: 52 TSSNDSKAESTKKPNKKIKEEATSPLKSTKRQREKPASDTEEPERVTAKKSKTQELSRPN 111
Query: 653 SPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEK 712
SP + + + E S S S +++ +S + D D S + + + +ES+SDS ++
Sbjct: 112 SPSEGEGEGEGEGESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNESDSDSSAQQQ 171
Query: 713 EKKSKSKP 720
+ + P
Sbjct: 172 LLQPQGPP 179
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 43.7 bits (103), Expect = 4e-04
Identities = 50/283 (17%), Positives = 91/283 (32%), Gaps = 51/283 (18%)
Query: 504 EDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMM 563
ED+ A E +K ++ +R ++K + K + +TK+ + + + S+ M
Sbjct: 1171 EDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAME 1230
Query: 564 WFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKE 623
N K + G K ++ ++ E ++++ D + L AY
Sbjct: 1231 TENVAEVVKPK-----------GRAGAKKKAPAAAKEKEEEDEI----LDLKDRLAAYNL 1275
Query: 624 SGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDD-------- 675
S + + PA++ ++ V S S D DDD
Sbjct: 1276 DSAPAQSAKMEETVKAVPARRAAARKKPLASVSVISD--------SDDDDDDFAVEVSLA 1327
Query: 676 ----KKASSKRKRDSDEDSKAKTKKKKE----------------KSESESDSGESEKEKK 715
KK K + + +K KK K +K +K
Sbjct: 1328 ERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKPAEAIGISPEKKVRK 1387
Query: 716 SKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDAS 758
++ P KS S + + E E V + S D S
Sbjct: 1388 MRASPFNKKSGSVLGRAATNKETESSENVSGSSSSEKDEIDVS 1430
Score = 39.1 bits (91), Expect = 0.010
Identities = 47/268 (17%), Positives = 89/268 (33%), Gaps = 21/268 (7%)
Query: 456 RVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKK 515
+V R+A ++ + S E S + E + + K
Sbjct: 1196 KVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAAK 1255
Query: 516 KKEKSRKTITISEKPRKSKKDKESSGGSTKRK------KKDKNAPKKPMSAYMMWFNDMR 569
+KE+ + + + ++ D + + + + A KKP+++ + +
Sbjct: 1256 EKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARRAAARKKPLASVSVISDSDD 1315
Query: 570 EKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYD---------EKVNKAKEDYNEALKA 620
+ + KKGG K ++ +KA V ++ E LK
Sbjct: 1316 DDDDFAVEVSLAERLKKKGGR--KPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKP 1373
Query: 621 YKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIE-SNGSSSDSDDDKKAS 679
+ G S + K K + K++ S K E E +GSSS D+ S
Sbjct: 1374 AEAIG---ISPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENVSGSSSSEKDEIDVS 1430
Query: 680 SKRKRDSDEDSKAKTKKKKEKSESESDS 707
+K + + +SES DS
Sbjct: 1431 AKPRPQRANRKQTTYVLSDSESESADDS 1458
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 43.4 bits (102), Expect = 5e-04
Identities = 61/343 (17%), Positives = 129/343 (37%), Gaps = 37/343 (10%)
Query: 438 KEDFGSSDDEKEPDAY-----LARVKREA--AERDDEDDEDEDESTDE---DFNPDQAES 487
+ED + D E +A + V +++ +E ++ D E+E++ DE D D + S
Sbjct: 3843 EEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNS 3902
Query: 488 ------DVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSG 541
D E D T S+E S A+ + KE K + ++ K +++ S
Sbjct: 3903 LNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDD 3962
Query: 542 GSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKA 601
+ + D+ E +K D +K G+ SK+
Sbjct: 3963 VGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLD----------EKEGD-----VSKDSD 4007
Query: 602 EYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKE 661
D + A E+ EA E QD D + + + +++ S + +++
Sbjct: 4008 LEDMDMEAADENKEEADAEKDEPM--QDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNED 4065
Query: 662 FIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPK 721
E N ++ + S + + E + + K +++S +E ++++ K
Sbjct: 4066 GFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEENTDKGI 4125
Query: 722 GDKSKSKSKKED----EIDDEEYDEPVESTPPSSDAGSDASSS 760
+++ +++ D E+++ E T +A S+A
Sbjct: 4126 VGENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEADRQ 4168
Score = 38.8 bits (90), Expect = 0.014
Identities = 57/330 (17%), Positives = 121/330 (36%), Gaps = 37/330 (11%)
Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS 502
+ + D+ L + EAA DE+++E+ E P Q E D EE ++
Sbjct: 3997 EKEGDVSKDSDLEDMDMEAA------DENKEEADAEKDEPMQDE-DPLEENNTLDEDIQQ 4049
Query: 503 DEDSDASEGTKKKKKEKSRKTITISEKPRK--SKKDKESSGGSTKRKKKDKNAPKKPMSA 560
D+ SD +E +K ++ + + +E+ + K D+E G + N PK +
Sbjct: 4050 DDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKS 4109
Query: 561 YMMWFNDMREKIKKDNP----GISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNE 616
+ ++N E+ ++ G + E + E + KE +E
Sbjct: 4110 -----TFASAEADEENTDKGIVGENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSE 4164
Query: 617 ALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEF--IESNGS------ 668
A + Y+ G D + + + E + + S EF ++ +
Sbjct: 4165 ADRQYQSLG-----DHLREWQQANRIHEWEDLTESQS-QAFDDSEFMHVKEDEEEDLQAL 4218
Query: 669 -SSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKS 727
+++ D K + + DS T +++++ D + + S K G+ +
Sbjct: 4219 GNAEKDQIKSIDRDESANQNPDSMNSTNIAEDEADEVGDKQLQDGQDISDIKQTGEDTLP 4278
Query: 728 ----KSKKEDEIDDEEYDEPVESTPPSSDA 753
+ +++ + DE +E P +
Sbjct: 4279 TEFGSINQSEKVFELSEDEDIEDELPDYNV 4308
Score = 32.7 bits (74), Expect = 0.98
Identities = 17/100 (17%), Positives = 43/100 (43%), Gaps = 8/100 (8%)
Query: 665 SNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDK 724
+N + + D S R+ +SD + K E S+S+ + +++ P+
Sbjct: 3841 ANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLS 3900
Query: 725 SKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
+ ++++ +DEP E ++ S+ S+ +++
Sbjct: 3901 --------NSLNEKLWDEPNEEDLLETEQKSNEQSAANNE 3932
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 43.1 bits (101), Expect = 7e-04
Identities = 49/274 (17%), Positives = 112/274 (40%), Gaps = 27/274 (9%)
Query: 502 SDEDSDASEGT-KKKKKEKSRKTITISEKPRKSKKDKESSG-----------GSTKRKKK 549
S D +E T ++ +E R T E +S + E G G ++K
Sbjct: 631 SKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERK 690
Query: 550 DKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEK-------AE 602
+ + + A E + ++ G + E + GE+ +E AE
Sbjct: 691 GEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAE 750
Query: 603 YDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSK-- 660
+V + + E+ G +D D+G+ + K + + H+ +++
Sbjct: 751 GKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAG 810
Query: 661 ---EFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSK 717
E + + +D + K + +++ +++ +AK +K SD G+SE+E++ +
Sbjct: 811 EKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEE 870
Query: 718 SKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSS 751
+ ++ + + ++E+E ++EE +EP+ P +
Sbjct: 871 EE---EEEEEEEEEEEEEEEEENEEPLSLEWPET 901
Score = 32.7 bits (74), Expect = 0.92
Identities = 23/161 (14%), Positives = 61/161 (37%), Gaps = 13/161 (8%)
Query: 610 AKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPH------KVKSKEFI 663
A D ++ A E G + ++G+ + +E+ A + +S+ I
Sbjct: 626 ALGDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEI 685
Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGD 723
+ + + + + + +++ + + ++E D GE E ++ +
Sbjct: 686 PAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEG 745
Query: 724 KSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
+ +++ K E E + + + E ++A ED+D
Sbjct: 746 EGEAEGKHEVETEGDRKETEHE-------GETEAEGKEDED 779
Score = 29.6 bits (66), Expect = 7.9
Identities = 28/180 (15%), Positives = 70/180 (38%), Gaps = 8/180 (4%)
Query: 586 KKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKK 645
+ GGE + ++ K E N+++ + K +E G ++ + + + +
Sbjct: 662 ESGGEAEQEGETETKGE-----NESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVE 716
Query: 646 ETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESES 705
T + + + G + + + +A K + +++ D K + + ++E +
Sbjct: 717 HEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKE 776
Query: 706 DSGESEKEKKSKSKPKGDK---SKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSED 762
D E E + + KGD+ K + + E E +++ E T D + ++
Sbjct: 777 DEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQE 836
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 42.2 bits (99), Expect = 8e-04
Identities = 29/148 (19%), Positives = 54/148 (36%), Gaps = 4/148 (2%)
Query: 606 KVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIES 665
K ++K +EA+K + GG KT +K+P + + K+ +
Sbjct: 65 KCAESKLSSDEAVK--RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKK--KK 120
Query: 666 NGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKS 725
+ D+K + K K K K+KK + + + + + ++KS+PK
Sbjct: 121 KEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180
Query: 726 KSKSKKEDEIDDEEYDEPVESTPPSSDA 753
K K+ E +EE
Sbjct: 181 KKPPNKKKEPPEEEKQRQAAREAVKGKP 208
Score = 40.6 bits (95), Expect = 0.003
Identities = 32/152 (21%), Positives = 58/152 (38%), Gaps = 4/152 (2%)
Query: 595 VSSKEKAEYDEKVNKAKEDYNEAL-KAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVS 653
S E ++V K A K KE ++ K + K KKK+
Sbjct: 68 ESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEE 127
Query: 654 PHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKK--EKSESESDSGESE 711
P K KE + + + +K+ + RD +E+ K + + K K + +
Sbjct: 128 PKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKK 187
Query: 712 KEKKSKSKPKGDKSKS-KSKKEDEIDDEEYDE 742
KE + K + ++ K K E+ +EE ++
Sbjct: 188 KEPPEEEKQRQAAREAVKGKPEEPDVNEEREK 219
Score = 39.1 bits (91), Expect = 0.009
Identities = 22/129 (17%), Positives = 48/129 (37%)
Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
++ S A K+ + P N S + + +K+ + K+ E K +
Sbjct: 68 ESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEE 127
Query: 694 TKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDA 753
K +K K E++ EKEK+ + K + + + + KK + + + + P +
Sbjct: 128 PKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKK 187
Query: 754 GSDASSSED 762
+
Sbjct: 188 KEPPEEEKQ 196
Score = 35.6 bits (82), Expect = 0.088
Identities = 39/271 (14%), Positives = 92/271 (33%), Gaps = 13/271 (4%)
Query: 491 EEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSK-KDKESSGGSTKRKKK 549
E + + + +++ E KKK+K K ++ K K KE K+ ++
Sbjct: 97 EPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156
Query: 550 DKNAPKKPMSAYMMWFN--DMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAE-YDEK 606
++ ++ + + K K N E K+ + V K + +E+
Sbjct: 157 PRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEE 216
Query: 607 VNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESN 666
K ++D + + + S+ + KK S +++ + +
Sbjct: 217 REKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETR-------- 268
Query: 667 GSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSK 726
SS + + +S R + S + ++ E + +++ K S + K
Sbjct: 269 -ESSKRTETRPRTSLRPPSARPASARPAPPRVKRKEIVTVLQDAQGVGKIVSNVILEGKK 327
Query: 727 SKSKKEDEIDDEEYDEPVESTPPSSDAGSDA 757
S+ + ++ E + + D D
Sbjct: 328 SEDEDDENFVVEAAAQAPDIVAGGEDEAEDG 358
Score = 32.6 bits (74), Expect = 0.99
Identities = 33/155 (21%), Positives = 59/155 (38%), Gaps = 7/155 (4%)
Query: 593 KTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAV 652
K ++K K + K KE+ + KE + + K + +K + +
Sbjct: 86 KGPAAKTKPAKEPKNESGKEE-----EKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140
Query: 653 SPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK--TKKKKEKSESESDSGES 710
P K K KE + D +++KK R + + K KKKE E E +
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAA 200
Query: 711 EKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVE 745
+ K K + + + +++D D E P+E
Sbjct: 201 REAVKGKPEEPDVNEEREKEEDDGKDRETTTSPME 235
Score = 32.2 bits (73), Expect = 1.0
Identities = 42/257 (16%), Positives = 78/257 (30%), Gaps = 33/257 (12%)
Query: 502 SDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAY 561
S SD + +K K T K K++ KE + K++ K +KP
Sbjct: 69 SKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKE-- 126
Query: 562 MMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAY 621
K +K + + E +++ K E+ + +
Sbjct: 127 -------EPKDRKPKE---------------EAKEKRPPKEKEKEKEKKVEEPRDREEE- 163
Query: 622 KESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSK 681
K+ + K KKP KK+ K + G + D +++ +
Sbjct: 164 KKRERVRAKSRPKKPPKKKPPNKKKEPPE---EEKQRQAAREAVKGKPEEPDVNEEREKE 220
Query: 682 RKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYD 741
+D K + E ES + S S K D S S + E + +
Sbjct: 221 E-----DDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTE 275
Query: 742 EPVESTPPSSDAGSDAS 758
++ A ++
Sbjct: 276 TRPRTSLRPPSARPASA 292
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 42.4 bits (100), Expect = 9e-04
Identities = 53/290 (18%), Positives = 93/290 (32%), Gaps = 45/290 (15%)
Query: 457 VKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDAS------- 509
K + E +DED + E+E D+D + D E+ D++ ++
Sbjct: 320 GKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFM 379
Query: 510 EGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMR 569
+ + +KKE++ I E+ R+ + +E S + KN ++
Sbjct: 380 QRAEARKKEENDAEI---EELRRELEGEEESDEEENEEPSKKNVGRRKFGP--------- 427
Query: 570 EKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQD 629
+ S K K E + + KE E +D
Sbjct: 428 ------------------ENGEKEAESKKLKKENKNEFKEKKES--------DEEEELED 461
Query: 630 SDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDED 689
++ K K K+ + + +E +SS KK SK+K S D
Sbjct: 462 EEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLD 521
Query: 690 SKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEE 739
A K + E + + D K E++ DDEE
Sbjct: 522 KAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEE 571
Score = 33.5 bits (77), Expect = 0.47
Identities = 22/147 (14%), Positives = 53/147 (36%), Gaps = 3/147 (2%)
Query: 413 GKLFDFIREKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDE 472
+ + E++ + + + + G + E + + + + + DE++E E
Sbjct: 401 LEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELE 460
Query: 473 DESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRK 532
DE E+ ++ + + + + + +E + + K S+K
Sbjct: 461 DE---EEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSS 517
Query: 533 SKKDKESSGGSTKRKKKDKNAPKKPMS 559
SK DK ++ S K K K+
Sbjct: 518 SKLDKAANKISKAAVKVKKKKKKEKSI 544
Score = 32.7 bits (75), Expect = 0.96
Identities = 36/191 (18%), Positives = 74/191 (38%), Gaps = 15/191 (7%)
Query: 582 TEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKP 641
E ++G EL + + K +E DE + E+ ++ D + KK
Sbjct: 305 EEQLRQGEELRRKIEGKSVSEEDEDEDSDSEE------EDEDDDEDDDDGENPWMLRKK- 357
Query: 642 AKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDK-----KASSKRKRDSDEDS---KAK 693
K + + S +F++ + ++D + + + +SDE+ +K
Sbjct: 358 LGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSK 417
Query: 694 TKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDA 753
+ K E+ E+E +K K K K +S +E+E++DEE + +
Sbjct: 418 KNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKR 477
Query: 754 GSDASSSEDDD 764
A E+++
Sbjct: 478 SEKAQKEEEEE 488
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 41.2 bits (97), Expect = 9e-04
Identities = 27/113 (23%), Positives = 43/113 (38%), Gaps = 15/113 (13%)
Query: 460 EAAERDDEDDED----EDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEG---- 511
E +DE E+E DE+F ++ E + + D + + D E D EG
Sbjct: 19 EEELEEDEFFWTYLLFEEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKEL 78
Query: 512 -------TKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKP 557
KK+ K K+ K T +K + K + + KKK + P
Sbjct: 79 QREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAP 131
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 42.3 bits (100), Expect = 0.001
Identities = 24/100 (24%), Positives = 45/100 (45%), Gaps = 8/100 (8%)
Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDED 479
+ K + +P+ ++FG E++ + + + D+EDD+D+ E +ED
Sbjct: 293 EDSKESADDLDDEFEPDDDDNFGLGQGEEDEE----EEEDGVDDEDEEDDDDDLEEEEED 348
Query: 480 FNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEK 519
+ E D +E + + DE+ + E KKKKK
Sbjct: 349 VDLSDEEEDEEDEDSDD----EDDEEEEEEEKEKKKKKSA 384
Score = 40.4 bits (95), Expect = 0.004
Identities = 59/301 (19%), Positives = 102/301 (33%), Gaps = 65/301 (21%)
Query: 462 AERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSR 521
+E D +DD +D+ D+D D A D A + D++ D E ++KK S+
Sbjct: 124 SEIDKDDDVRDDDDFDDDDLGDLASDDRAAHFGG-----GEDDEEDEEEQPERKK---SK 175
Query: 522 KTITISEKPRKSK-------KDKE-----------------SSGGSTKRKKKDKNAPKKP 557
K + + E KSK K KE S + K K P++
Sbjct: 176 KEV-MKEVIAKSKFYKAERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMTPEEK 234
Query: 558 MSAYMMWFN----DMR----EKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNK 609
Y D R ++ K + E+AK+ E +K E E++ +
Sbjct: 235 DDEYDQRVRELTFDRRAQPTDRTKTEE------ELAKEEAE------RLKKLE-AERLRR 281
Query: 610 AKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSS 669
+ + + + D DD +E +
Sbjct: 282 MRGEEEDDEEEEDSKESADDLDDEFEPDDDD--NFGLGQGEEDEE----EEEDGVDDEDE 335
Query: 670 SDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKS 729
D DDD + + D D + + ++++ + D E E+E+K K K K +S
Sbjct: 336 EDDDDDLEEE-----EEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
Query: 730 K 730
Sbjct: 391 L 391
Score = 40.4 bits (95), Expect = 0.004
Identities = 32/115 (27%), Positives = 58/115 (50%), Gaps = 4/115 (3%)
Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDA----YLARVKREAAERDDEDDEDEDES 475
R +++R + E+ + KE DDE EPD L + + + E +D D++++E
Sbjct: 278 RLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEED 337
Query: 476 TDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKP 530
D+D ++ + D+++E + DED + E +K+KK+K T SE P
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELP 392
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 41.4 bits (97), Expect = 0.002
Identities = 15/78 (19%), Positives = 34/78 (43%)
Query: 687 DEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVES 746
E K K KK+ +S+ + +K + + + + + ++ D+E +E +S
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDS 201
Query: 747 TPPSSDAGSDASSSEDDD 764
S G SS++++
Sbjct: 202 EDYSQYDGMLVDSSDEEE 219
Score = 35.2 bits (81), Expect = 0.14
Identities = 32/128 (25%), Positives = 53/128 (41%), Gaps = 3/128 (2%)
Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDD--EDEDESTD 477
+K K T K +K KE D+E+ ++ E+AE D +D+ ED D
Sbjct: 146 AKKGKAKKKTKKSKKKEAKESSDKDDEEES-ESEDESKSEESAEDDSDDEEEEDSDSEDY 204
Query: 478 EDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDK 537
++ +S EE + P+ +++ S++ + +SR E SKK K
Sbjct: 205 SQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKKPK 264
Query: 538 ESSGGSTK 545
E ST
Sbjct: 265 EKKTSSTF 272
Score = 34.0 bits (78), Expect = 0.29
Identities = 29/166 (17%), Positives = 62/166 (37%), Gaps = 20/166 (12%)
Query: 593 KTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAV 652
K +K++ E + +D E ++ ES + ++D + ++ + ++ +
Sbjct: 150 KAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDG 209
Query: 653 SPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEK 712
SS + + ++ S D+ E ++ + +S S SDS ES
Sbjct: 210 -----------MLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSP 258
Query: 713 EKKSKSKPKGDKSK---------SKSKKEDEIDDEEYDEPVESTPP 749
K + K + S S+ ED+ D++ + V P
Sbjct: 259 PSKKPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDIDPDQVVKKPV 304
Score = 33.7 bits (77), Expect = 0.33
Identities = 15/109 (13%), Positives = 33/109 (30%)
Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKK 715
K K ++ S + + ED + ++ S+ E + ++
Sbjct: 146 AKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYS 205
Query: 716 SKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
D S + +E + D + S S++ S D +
Sbjct: 206 QYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSE 254
Score = 30.2 bits (68), Expect = 4.5
Identities = 24/120 (20%), Positives = 45/120 (37%)
Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
+T K AKKK S + K+ E + S +S ++ A + +EDS ++
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202
Query: 694 TKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDA 753
+ + +S E +E S + + + + EI + E + P S
Sbjct: 203 DYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKK 262
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 39.7 bits (93), Expect = 0.002
Identities = 20/68 (29%), Positives = 33/68 (48%)
Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGD 723
GS S++ + + + K E+ + K KKKK++ + E + +KEK + K
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185
Query: 724 KSKSKSKK 731
K K K KK
Sbjct: 186 KKKKKKKK 193
Score = 34.3 bits (79), Expect = 0.12
Identities = 19/66 (28%), Positives = 36/66 (54%)
Query: 668 SSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKS 727
S+S+ +K ++ + E + + K+KK+K E + + E + +K+ +PKG K K
Sbjct: 128 LGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187
Query: 728 KSKKED 733
K KK+
Sbjct: 188 KKKKKK 193
Score = 33.9 bits (78), Expect = 0.20
Identities = 20/70 (28%), Positives = 39/70 (55%)
Query: 482 PDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSG 541
PD S++ E +++ T + + +A ++KK++K +K + +K +K KK+K
Sbjct: 121 PDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEP 180
Query: 542 GSTKRKKKDK 551
+K+KKK K
Sbjct: 181 KGSKKKKKKK 190
Score = 32.4 bits (74), Expect = 0.55
Identities = 15/63 (23%), Positives = 26/63 (41%)
Query: 669 SSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSK 728
D S++++ AK +K+ E E E + +KE K + K K DK +
Sbjct: 118 YGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177
Query: 729 SKK 731
+
Sbjct: 178 VEP 180
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 41.1 bits (97), Expect = 0.002
Identities = 18/78 (23%), Positives = 38/78 (48%)
Query: 686 SDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVE 745
+ + K KK EK+E + + + EK+KK+ + K ++ + + K++ E + EE +E E
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 746 STPPSSDAGSDASSSEDD 763
+ ++ D
Sbjct: 464 EEKEEEEEKKKKQATLFD 481
Score = 39.1 bits (92), Expect = 0.007
Identities = 12/79 (15%), Positives = 39/79 (49%)
Query: 672 SDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
++++ + + K+ + + K K +K++ E + + + K K + + ++ + K ++
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 732 EDEIDDEEYDEPVESTPPS 750
++E ++E +E E
Sbjct: 455 KEEEEEEAEEEKEEEEEKK 473
Score = 37.2 bits (87), Expect = 0.031
Identities = 12/77 (15%), Positives = 38/77 (49%)
Query: 658 KSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSK 717
+ EF+ + ++ +++KR+ ++ K K +K E E + + +KE++ +
Sbjct: 397 EEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456
Query: 718 SKPKGDKSKSKSKKEDE 734
+ + + + + ++E +
Sbjct: 457 EEEEEAEEEKEEEEEKK 473
Score = 33.7 bits (78), Expect = 0.42
Identities = 20/75 (26%), Positives = 29/75 (38%), Gaps = 5/75 (6%)
Query: 690 SKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPP 749
SK TKK K+ E E EK++K K G K K+E+E +++E E +
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAG-----KKKEEEEEEEKEKKEEEKEEEE 459
Query: 750 SSDAGSDASSSEDDD 764
E
Sbjct: 460 EEAEEEKEEEEEKKK 474
Score = 32.6 bits (75), Expect = 0.85
Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 12/83 (14%)
Query: 637 KSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKK 696
SKK KK K + +E E KK + K+ +E+ + K KK
Sbjct: 404 GSKKATKKI--KKIVEKAEKKREEEKKE----------KKKKAFAGKKKEEEEEEEKEKK 451
Query: 697 KKEKSESESDSGESEKEKKSKSK 719
++EK E E ++ E ++E++ K K
Sbjct: 452 EEEKEEEEEEAEEEKEEEEEKKK 474
Score = 31.4 bits (72), Expect = 1.9
Identities = 13/70 (18%), Positives = 34/70 (48%)
Query: 670 SDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKS 729
KK ++ + + K +KKK+ + E E+EK+ K + K ++ +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 730 KKEDEIDDEE 739
++++E ++++
Sbjct: 464 EEKEEEEEKK 473
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 39.7 bits (93), Expect = 0.003
Identities = 15/82 (18%), Positives = 39/82 (47%), Gaps = 1/82 (1%)
Query: 672 SDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
S +A++ + D + + ++ K E + + +KE K ++ + ++S+ ++++
Sbjct: 38 SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97
Query: 732 EDEIDDEEYDEPVESTPPSSDA 753
EDE +E +E S+
Sbjct: 98 EDEESSDE-NEKETEEKTESNV 118
Score = 37.4 bits (87), Expect = 0.014
Identities = 16/94 (17%), Positives = 41/94 (43%), Gaps = 3/94 (3%)
Query: 630 SDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDED 689
S + + + KK +VK +E ++S+ +DK + K +S+E+
Sbjct: 38 SSPSDQAAADEQEAKKSDDQETAEIEEVKEEE---KEAANSEDKEDKGDAEKEDEESEEE 94
Query: 690 SKAKTKKKKEKSESESDSGESEKEKKSKSKPKGD 723
++ + ++ +++E E++ +K + P
Sbjct: 95 NEEEDEESSDENEKETEEKTESNVEKEITNPSWK 128
Score = 36.6 bits (85), Expect = 0.024
Identities = 16/90 (17%), Positives = 44/90 (48%), Gaps = 4/90 (4%)
Query: 658 KSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSK 717
S + + SDD + A + ++ ++++ A ++ K++K ++E + ESE+E +
Sbjct: 40 PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEA-ANSEDKEDKGDAEKEDEESEEENE-- 96
Query: 718 SKPKGDKSKSKSKKEDEIDDEEYDEPVEST 747
+ + S K+ +E + ++ + +
Sbjct: 97 -EEDEESSDENEKETEEKTESNVEKEITNP 125
Score = 29.3 bits (66), Expect = 5.9
Identities = 18/78 (23%), Positives = 32/78 (41%)
Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS 502
++ DE+E + E E +E+ E + ED + E + +EE + S
Sbjct: 44 AAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESS 103
Query: 503 DEDSDASEGTKKKKKEKS 520
DE+ +E + EK
Sbjct: 104 DENEKETEEKTESNVEKE 121
Score = 29.3 bits (66), Expect = 6.6
Identities = 15/86 (17%), Positives = 37/86 (43%), Gaps = 3/86 (3%)
Query: 427 KNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAE 486
+ +E+ +++ E + D A + E +E ++E++++E +E ++ E
Sbjct: 56 DQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTE 115
Query: 487 SDVAEEY---DSNPTPTDSDEDSDAS 509
S+V +E P T+ +
Sbjct: 116 SNVEKEITNPSWKPVGTEQTGPHAMT 141
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 39.9 bits (93), Expect = 0.005
Identities = 34/168 (20%), Positives = 64/168 (38%), Gaps = 11/168 (6%)
Query: 582 TEIAKKGGELWKTVSSKEKAEYDE----KVNKAKEDYNEALKAYKESGGGQDSDDGKTSK 637
TE K GG K + K+ DE + +K ED +E K+ ++ K
Sbjct: 200 TEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDK 259
Query: 638 SKKPAKKKETASTAVSPHKVKSKEFI---ESNGSSSDSDDDKKASSKRKRDSDEDSKAKT 694
K + + +E +S+ S +D ++ + S E + +
Sbjct: 260 KGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDED 319
Query: 695 KKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDE 742
++ E+ ++E + G S+K K K K K K +D+ D + +
Sbjct: 320 SEESEEEKNEEEGGLSKKGK----KLKKLKGKKNGLDKDDSDSGDDSD 363
Score = 37.2 bits (86), Expect = 0.036
Identities = 27/181 (14%), Positives = 69/181 (38%), Gaps = 11/181 (6%)
Query: 592 WKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTA 651
+KT++++E E ++ K + + ++G D+ + ++ +K +
Sbjct: 156 YKTLTAEEAEEEMKRRKKTANGFQLMMMKAAKNGPAAFGDEDEETEGEKGGGGR---GKD 212
Query: 652 VSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKS--------ES 703
+ ++ + + + S +D + SK+K+ +K K K+ E
Sbjct: 213 LKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEY 272
Query: 704 ESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDD 763
+SD G+ E ++ ++ED++ E +P S+ + + E+
Sbjct: 273 DSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEG 332
Query: 764 D 764
Sbjct: 333 G 333
Score = 36.9 bits (85), Expect = 0.039
Identities = 61/310 (19%), Positives = 99/310 (31%), Gaps = 48/310 (15%)
Query: 456 RVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKK 515
K A DED+E E E D D+ + + + +D + E +KKK
Sbjct: 185 AAKNGPAAFGDEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKK 244
Query: 516 KKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKD 575
KK+ + K +K D + ++ D +A + RE+
Sbjct: 245 KKKLA--------KNKKKLDDDKK-----GKRGGDDDADEYDSDDGDDE---GREEDYIS 288
Query: 576 NPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKT 635
+ S + +E E K + + +E + +E + + G +
Sbjct: 289 DSSASGND-----------PEEREDKLSPEIPAKPEIEQDEDSEESEEE--KNEEEGGLS 335
Query: 636 SKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTK 695
K KK K K + + SDS DD S DS AK +
Sbjct: 336 KKGKKLKKLKGKKN--------------GLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQ 381
Query: 696 KKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEP-----VESTPPS 750
K+ +K E + S K K K K +E+ P E+ P S
Sbjct: 382 KEPKKEEPVDSNPSSPGNSGPARPSPESKDKGKRKAANEVSKSPASVPAKKLKTENAPKS 441
Query: 751 SDAGSDASSS 760
S S +
Sbjct: 442 SSGKSTPQTF 451
Score = 30.7 bits (69), Expect = 3.6
Identities = 39/191 (20%), Positives = 62/191 (32%), Gaps = 14/191 (7%)
Query: 587 KGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGK--------TSKS 638
KGGE SK+K + K K +D + + + DSDDG S S
Sbjct: 231 KGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDS 290
Query: 639 KKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDK-----KASSKRKRDSDEDSKAK 693
E +SP +K IE + S +S+++K S K K+ K
Sbjct: 291 SASGNDPEEREDKLSPEIP-AKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKN 349
Query: 694 TKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDA 753
K + + + S S K K K+E + PS ++
Sbjct: 350 GLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPARPSPES 409
Query: 754 GSDASSSEDDD 764
++
Sbjct: 410 KDKGKRKAANE 420
Score = 29.2 bits (65), Expect = 8.8
Identities = 27/120 (22%), Positives = 48/120 (40%), Gaps = 4/120 (3%)
Query: 431 KGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVA 490
K +K K DD+K +A E D +D +DE D + + +D
Sbjct: 243 KKKKKLAKNKKKLDDDKKGKRG----GDDDADEYDSDDGDDEGREEDYISDSSASGNDPE 298
Query: 491 EEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKD 550
E D + + + E +++ ++EK+ + +S+K +K KK K G K
Sbjct: 299 EREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDS 358
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 39.8 bits (93), Expect = 0.005
Identities = 32/125 (25%), Positives = 46/125 (36%), Gaps = 5/125 (4%)
Query: 631 DDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDS 690
D K+ KK AS + EF++ S + KK S +
Sbjct: 6 DSLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEA----SRPGT 61
Query: 691 KAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPS 750
KK + +ES + S E + K KS S KSK +EE +EP E+ P
Sbjct: 62 PRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEE-EEPEETPDPI 120
Query: 751 SDAGS 755
+ G
Sbjct: 121 ASIGG 125
Score = 38.2 bits (89), Expect = 0.013
Identities = 27/111 (24%), Positives = 40/111 (36%), Gaps = 11/111 (9%)
Query: 661 EFIESNGSSSDSDDDKK--ASSKRKRDSDEDS-------KAKTKKKKEKSESESDSGESE 711
EF++S K +SK + DED + K K K E+ +
Sbjct: 3 EFLDSLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTP 62
Query: 712 KEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSED 762
+ K SKP + S +K + E S+ P S A S S E+
Sbjct: 63 RNPKKSSKPTESSAASSEEKPAKPRKSA--ESTRSSHPKSKAPSTESEEEE 111
Score = 30.5 bits (69), Expect = 3.6
Identities = 30/119 (25%), Positives = 39/119 (32%), Gaps = 11/119 (9%)
Query: 439 EDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPT 498
E S DEK P K +A++ EDDED E DE E P
Sbjct: 3 EFLDSLPDEKAPTK--KPKKGDASKDSTEDDEDILEFLDEL---------EQSEKAKPPK 51
Query: 499 PTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKP 557
KK K + EKP K +K ES+ S + K ++
Sbjct: 52 KPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEE 110
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 39.4 bits (92), Expect = 0.006
Identities = 31/129 (24%), Positives = 47/129 (36%), Gaps = 6/129 (4%)
Query: 632 DGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDED-- 689
+GK SK P S K + + + K AS KR +
Sbjct: 159 NGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIM 218
Query: 690 ---SKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVES 746
K KTK+KKEK E+ + + E E++S + ++ +DE+ DEP S
Sbjct: 219 SSFFKKKTKEKKEKKEASESTVKEESEEESGKRDV-ILEDESAEPTGLDEDEDEDEPKPS 277
Query: 747 TPPSSDAGS 755
S
Sbjct: 278 GERSDSEEE 286
Score = 38.3 bits (89), Expect = 0.014
Identities = 34/115 (29%), Positives = 50/115 (43%), Gaps = 6/115 (5%)
Query: 637 KSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKK 696
KK +KKE + S K +S+E ES +D+ + D DED K
Sbjct: 222 FKKKTKEKKEKKEASESTVKEESEE--ESGKRDVILEDESAEPTGLDEDEDEDEP---KP 276
Query: 697 KKEKSESESDSGESEKEK-KSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPS 750
E+S+SE ++ E EKEK K K D+ + + + E +E E PP
Sbjct: 277 SGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPP 331
Score = 32.9 bits (75), Expect = 0.61
Identities = 19/66 (28%), Positives = 31/66 (46%)
Query: 457 VKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKK 516
K + E+ E+E D + AE +E + P S E SD+ E T++K+
Sbjct: 232 KKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKE 291
Query: 517 KEKSRK 522
KEK ++
Sbjct: 292 KEKRKR 297
Score = 32.9 bits (75), Expect = 0.61
Identities = 28/153 (18%), Positives = 61/153 (39%), Gaps = 10/153 (6%)
Query: 622 KESGGGQDSDDGKTSKSKKPAKKKETASTAVSPH----------KVKSKEFIESNGSSSD 671
K + QD+ T++ + + AS +P K K+KE E +S
Sbjct: 179 KSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASES 238
Query: 672 SDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
+ ++ KRD + ++ ++ E E + S + S+ + + + + + +
Sbjct: 239 TVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRL 298
Query: 732 EDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
+ ++DE+ DE +E P S ++ E
Sbjct: 299 KKMMEDEDEDEEMEIVPESPVEEEESEEPEPPP 331
Score = 32.5 bits (74), Expect = 0.78
Identities = 24/121 (19%), Positives = 47/121 (38%), Gaps = 11/121 (9%)
Query: 633 GKTSKSKKPAKKKETA-STAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSK 691
K K+ + ++ +VK K S+ + D K ++ K + K
Sbjct: 151 SPALKPTANGKRPSSKPPKSIMSPEVKVK-------SAKKTQDTSKETTTEKTEGKTSVK 203
Query: 692 AKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSS 751
A + K+ +S S +K+ K K + K S+S ++E ++E V S+
Sbjct: 204 AASLKRNPPKKSNIMSSFFKKKTKEKKE---KKEASESTVKEESEEESGKRDVILEDESA 260
Query: 752 D 752
+
Sbjct: 261 E 261
Score = 32.5 bits (74), Expect = 0.81
Identities = 43/256 (16%), Positives = 93/256 (36%), Gaps = 30/256 (11%)
Query: 482 PDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSG 541
++ + T + S + K+ +KS + +K K KK+K+ +
Sbjct: 177 KVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEAS 236
Query: 542 GSTKRKKKDKNAPKKPM------SAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTV 595
ST +++ ++ + K+ + + D E K + S +E +
Sbjct: 237 ESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEE------- 289
Query: 596 SSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPH 655
KEK + +K+ + +++ E + ++S++ + K ++KE + +
Sbjct: 290 KEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGG 349
Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKK 715
+ + + + K+K DE+ TKK E ES S+ K
Sbjct: 350 RRRGRRRVM----------------KKKTFKDEEGYLVTKKVYE-WESFSEDEAEPPPTK 392
Query: 716 SKSKPKGDKSKSKSKK 731
K K + +KK
Sbjct: 393 PKPKVSTPAVPAAAKK 408
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 39.3 bits (92), Expect = 0.006
Identities = 28/110 (25%), Positives = 45/110 (40%), Gaps = 4/110 (3%)
Query: 632 DGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSK 691
G T KSK ++ V ++ S E N ++ S K++ ++E K
Sbjct: 13 SGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTS----TKKDKKEDKNNESKK 68
Query: 692 AKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYD 741
KKKK+K E + E E + K+ K K+K K K +D +
Sbjct: 69 KSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNA 118
Score = 33.2 bits (76), Expect = 0.54
Identities = 27/89 (30%), Positives = 42/89 (47%), Gaps = 6/89 (6%)
Query: 491 EEYDSNPTPTDSDEDSDASEGTKKK-KKEKSRKTITISEKPRKSKKDKESSGGSTKRKKK 549
EE T T D+ D + +KKK +K+K +K +K KS+ + + + K+ KK
Sbjct: 45 EENKVATTSTKKDKKEDKNNESKKKSEKKKKKK---KEKKEPKSEGETKLGFKTPKKSKK 101
Query: 550 DKNAPKKPMSAYMMW--FNDMREKIKKDN 576
K P KP + FN + E +K N
Sbjct: 102 TKKKPPKPKPNEDVDNAFNKIAELAEKSN 130
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 39.2 bits (91), Expect = 0.008
Identities = 23/113 (20%), Positives = 41/113 (36%), Gaps = 9/113 (7%)
Query: 452 AYLARVKRE------AAERDDEDDEDEDESTDEDFNP-DQAESDVAEEYDSNPTPTDSDE 504
ARV RE AE +D E EDE +D P + + + E + E
Sbjct: 184 QAFARVVREMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQE 243
Query: 505 DSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKP 557
++ ++E + + P +S D E+ G + + + +P
Sbjct: 244 SEATDRESESGEEEMVQ--SDQDDLPDESDDDSETPGEGARPARPFTSTGGEP 294
Score = 29.1 bits (65), Expect = 9.1
Identities = 23/97 (23%), Positives = 35/97 (36%), Gaps = 16/97 (16%)
Query: 430 GKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDE---------DDEDEDESTDEDF 480
G ++P E + E E A + EA +R+ E D +D + +D+D
Sbjct: 215 GDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDS 274
Query: 481 -------NPDQAESDVAEEYDSNPTPTDSDEDSDASE 510
P + + E D T DE DA E
Sbjct: 275 ETPGEGARPARPFTSTGGEPDYKVFTTAFDETVDAEE 311
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 37.7 bits (88), Expect = 0.008
Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 5/84 (5%)
Query: 674 DDKKASSKRKRDSDEDSKAKTKK-----KKEKSESESDSGESEKEKKSKSKPKGDKSKSK 728
D + +K+K+ + K KK +K K + + + +K+K K K DKS+ K
Sbjct: 54 DAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK 113
Query: 729 SKKEDEIDDEEYDEPVESTPPSSD 752
+KE E E+ + T +
Sbjct: 114 DEKEAEDKLEDLTKSYSETLSTLS 137
Score = 33.5 bits (77), Expect = 0.24
Identities = 13/59 (22%), Positives = 28/59 (47%)
Query: 681 KRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEE 739
++ + + SK K K K+K + + D +K++K D +KS S+ + + +
Sbjct: 82 QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140
Score = 32.7 bits (75), Expect = 0.36
Identities = 18/71 (25%), Positives = 28/71 (39%), Gaps = 1/71 (1%)
Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKK 715
KVK KE+ E KK K K D+ K++KK EK + ++ +
Sbjct: 73 KVK-KEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSE 131
Query: 716 SKSKPKGDKSK 726
+ S K +
Sbjct: 132 TLSTLSELKPR 142
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 38.4 bits (89), Expect = 0.009
Identities = 28/101 (27%), Positives = 47/101 (46%), Gaps = 9/101 (8%)
Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
KT++ KKPA KK T S + K +K ++ + KKA+ +K S + + K
Sbjct: 11 KTTEEKKPAAKKATTSKETAKTKKTAK--------TTSTKAAKKAAKVKKTKSVKTTTKK 62
Query: 694 TKKKKEKSES-ESDSGESEKEKKSKSKPKGDKSKSKSKKED 733
K EK+ES + +S + KK + ++ +K K
Sbjct: 63 VTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNT 103
Score = 30.7 bits (69), Expect = 3.1
Identities = 31/120 (25%), Positives = 45/120 (37%), Gaps = 13/120 (10%)
Query: 593 KTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAV 652
K +++EK +K +KE A S T +KK AK K+T S
Sbjct: 9 KKKTTEEKKPAAKKATTSKETAKTKKTAKTTS----------TKAAKKAAKVKKTKSVKT 58
Query: 653 SPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEK 712
+ KV K S + K + K++ S E +A K K S+ SEK
Sbjct: 59 TTKKVTVKF---EKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLFASEK 115
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 39.0 bits (91), Expect = 0.010
Identities = 49/290 (16%), Positives = 88/290 (30%), Gaps = 34/290 (11%)
Query: 469 DEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISE 528
D+ + +ST + + D +S+ E S T + + +D + + + +K TI S
Sbjct: 28 DDPQKDSTAKTTSHDSKKSNDDET--SKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSST 85
Query: 529 KPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKG 588
+ D + K + S T + +
Sbjct: 86 SDSNNIIDFIYKNLPQTNINQLLTKNKY-------------------DDNYSLTTLIQNL 126
Query: 589 GELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETA 648
L +S E+ EK +++ + K Q S K K P+
Sbjct: 127 FNLNSDISDYEQPRNSEKSTNDSNKNSDS--SIKNDTDTQSSKQDKADNQKAPSSNNTKP 184
Query: 649 STAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDS-----------DEDSKAKTKKK 697
ST+ + S SDD S K + D+ S+ K +
Sbjct: 185 STSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQ 244
Query: 698 KEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVEST 747
K+ + K+ P D+ K KSK +++ ST
Sbjct: 245 KDYASQSKKDKTETSNTKNPQLPTQDELKHKSKPAQSFENDVNQSNTRST 294
Score = 38.2 bits (89), Expect = 0.016
Identities = 25/164 (15%), Positives = 46/164 (28%), Gaps = 30/164 (18%)
Query: 629 DSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRD--- 685
D+D + + + + + S I+ + + + +K K D
Sbjct: 59 DTDKADNNNTSNQ-DNNDKKFSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNY 117
Query: 686 ----------------SDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKS 729
SD + ++K S SDS SK ++
Sbjct: 118 SLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAP 177
Query: 730 KKEDEIDDEEYDEP----------VESTPPSSDAGSDASSSEDD 763
+ +P S P S D + SSS+D+
Sbjct: 178 SSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDN 221
Score = 29.8 bits (67), Expect = 7.7
Identities = 11/104 (10%), Positives = 32/104 (30%), Gaps = 3/104 (2%)
Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGD 723
++ S +D + S + + ++ ++ + + +S +
Sbjct: 37 KTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIY 96
Query: 724 KSKSKSKKEDEIDDEEYDEP---VESTPPSSDAGSDASSSEDDD 764
K+ ++ + +YD+ + SD S E
Sbjct: 97 KNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPR 140
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 38.8 bits (90), Expect = 0.011
Identities = 19/74 (25%), Positives = 37/74 (50%), Gaps = 3/74 (4%)
Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS 502
S + E+E Y A E D+ D+++E + + ED + D++E+D ++E D
Sbjct: 931 SDESEEEVSEYEAS---SDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELE 987
Query: 503 DEDSDASEGTKKKK 516
+ + S K++K
Sbjct: 988 SKAAYDSRPGKRRK 1001
Score = 30.0 bits (67), Expect = 5.5
Identities = 18/80 (22%), Positives = 36/80 (45%), Gaps = 7/80 (8%)
Query: 461 AAERDDEDDED-----EDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKK 515
DDE DE E E++ +D + + E + ++E + + +S+ DS E +
Sbjct: 924 MVGSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDW 983
Query: 516 KKEKSRKTITISEKPRKSKK 535
+ +S+ +P K +K
Sbjct: 984 DELESKA--AYDSRPGKRRK 1001
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 39.0 bits (91), Expect = 0.011
Identities = 45/169 (26%), Positives = 67/169 (39%), Gaps = 12/169 (7%)
Query: 599 EKAEYDEKVN---KAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPH 655
++ E DE N KE + L+ K+ DD K +K +K E
Sbjct: 161 DEDEGDELENVELVEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKK-- 218
Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKK 715
K + ++S GS+ D + K+ K+K + S + S+ D E K KK
Sbjct: 219 KSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTETPASD-YYDVSEMVKFKK 277
Query: 716 SKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
K K K K KK + DE+ EP SSD+GS E++
Sbjct: 278 PKKK------KKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENA 320
Score = 32.4 bits (74), Expect = 0.93
Identities = 38/198 (19%), Positives = 67/198 (33%), Gaps = 17/198 (8%)
Query: 545 KRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYD 604
K KK + KKP +D + K+ EI K + + D
Sbjct: 178 KDKKNLELKKKKPDYDP----DDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTD 233
Query: 605 EKVNKAKEDYNEALKAYKESGGGQD--------SDDGKTSKSKKPAKKKETASTAVSPHK 656
++ K +++ + LK S D + K KKP KKK+ +
Sbjct: 234 DEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKK-----R 288
Query: 657 VKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKS 716
K + E + S K + + + KK+KE+ E + + + + S
Sbjct: 289 RKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDDFVEDDDDLQAS 348
Query: 717 KSKPKGDKSKSKSKKEDE 734
+K + K + K E
Sbjct: 349 LAKQRRLAQKKRKKLRPE 366
Score = 30.1 bits (68), Expect = 5.5
Identities = 30/162 (18%), Positives = 54/162 (33%), Gaps = 12/162 (7%)
Query: 593 KTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDD------GKTSKSKKPAKKKE 646
K + K+K + + K + L Y E G+ + A+KK
Sbjct: 181 KNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTDDEAEKKR 240
Query: 647 TASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESD 706
S + + +SD D + +K + K K +K ++ E E +
Sbjct: 241 QEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPE 300
Query: 707 SGESEKEK-KSKSKPKGDK-----SKSKSKKEDEIDDEEYDE 742
+ S+ + + S K K+E E DD D+
Sbjct: 301 AEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDDFVEDD 342
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 37.8 bits (88), Expect = 0.013
Identities = 37/160 (23%), Positives = 64/160 (40%), Gaps = 18/160 (11%)
Query: 605 EKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIE 664
K KA + E+ K+ K + K ++K+ +A SP E
Sbjct: 105 AKSPKAMRTFEESKKSKKTVDSM--IERKPKEPGLKRKQRKKAQESATSP---------E 153
Query: 665 SNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDK 724
S+ SS+ + K +K ++ +K+ S + S EK KSK K
Sbjct: 154 SSPSSTPNSSRPSTPHLLK------AKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKK 207
Query: 725 SKSKSKKEDEIDDEEYDEPVE-STPPSSDAGSDASSSEDD 763
+ K +K D DE+ D ++ S P ++D +DA ++
Sbjct: 208 AGKKMRKWDLDGDEDDDAVLDYSAPDANDENADAPEDVEE 247
Score = 29.3 bits (66), Expect = 7.0
Identities = 17/74 (22%), Positives = 29/74 (39%), Gaps = 2/74 (2%)
Query: 688 EDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVEST 747
E K K + + +S +S+K S + K + K K+ + E ES+
Sbjct: 98 EKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQ--ESATSPESS 155
Query: 748 PPSSDAGSDASSSE 761
P S+ S S+
Sbjct: 156 PSSTPNSSRPSTPH 169
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 37.7 bits (87), Expect = 0.022
Identities = 21/154 (13%), Positives = 46/154 (29%), Gaps = 12/154 (7%)
Query: 607 VNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESN 666
V++A E L A + + +P + + A + + E +
Sbjct: 292 VDEALEREPAELAAAAVAS---AASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAES 348
Query: 667 GSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSK 726
D + + +E S+A E+ + +G++ + ++ +
Sbjct: 349 VVQVADRDGESTPA-----VEETSEADI----EREQPGDLAGQAPAAHQVDAEAASAAPE 399
Query: 727 SKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSS 760
+ E DE E E P D +
Sbjct: 400 EPAALASEAHDETEPEVPEKAAPIPDPAKPDELA 433
Score = 36.5 bits (84), Expect = 0.052
Identities = 19/102 (18%), Positives = 36/102 (35%), Gaps = 8/102 (7%)
Query: 439 EDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNP--------DQAESDVA 490
E + ++ E D + A + D + ++ P D+ E +V
Sbjct: 358 ESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVP 417
Query: 491 EEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRK 532
E+ P P DE + A G + + R+ S + R+
Sbjct: 418 EKAAPIPDPAKPDELAVAGPGDDPAEPDGIRRQDDFSSRRRR 459
Score = 35.3 bits (81), Expect = 0.12
Identities = 18/112 (16%), Positives = 29/112 (25%), Gaps = 1/112 (0%)
Query: 449 EPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDA 508
+PD VK E AE DE + + + E D D A
Sbjct: 325 QPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPAVEETSEADIE-REQPGDLAGQA 383
Query: 509 SEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSA 560
+ + S + ++ + E D P + A
Sbjct: 384 PAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVA 435
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 37.3 bits (87), Expect = 0.023
Identities = 25/128 (19%), Positives = 51/128 (39%), Gaps = 10/128 (7%)
Query: 422 KKLRVKNTGKGEKPNYKEDFGSSDDE---------KEPDAYLARVKREAAERDDEDDEDE 472
KK R K G + K+ +G + ++ + K + +DD+DE
Sbjct: 41 KKDRGKPASMGLEAGEKKGYGEEEVVDGIPGLELLEKWKEEERKKKEAEQGLESDDDDDE 100
Query: 473 DESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRK 532
+E + + + D + + +S+ SD + D E + KK K +SE+ +
Sbjct: 101 EEEWEVEEDEDSDDEGEWIDVESDKEIESSDSE-DEEEKDEAAKKAKEDSDEELSEEDEE 159
Query: 533 SKKDKESS 540
++E +
Sbjct: 160 EAAEEEEA 167
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 37.4 bits (86), Expect = 0.031
Identities = 58/280 (20%), Positives = 107/280 (38%), Gaps = 19/280 (6%)
Query: 465 DDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTI 524
DD D DE+ D DF + + +E D+ DSD D SEG + K+ + K
Sbjct: 431 DDSDVSTSDENEDVDFTGKKGAINNEDESDNEEVAFDSDSQFDESEGNLRWKEGLASKL- 489
Query: 525 TISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREK--IKKDNPGISFT 582
+S K + K + +P++ + Y E + +D P F
Sbjct: 490 ----AYSQSGKRGRNIQ---KIFYDESLSPEECIEEYKGESAKSSESDLVVQDEPEDFF- 541
Query: 583 EIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPA 642
+++K E + K E++ K LK+ + DS +G+ +
Sbjct: 542 DVSKVANESISSNHEKLMESEFEELKKKWSSL-AQLKSRFQKDATLDSIEGEEELIQD-- 598
Query: 643 KKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSE 702
+K E ES GSS +++++ A E++ K ++ + E
Sbjct: 599 DEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARKKEELRGNFE 658
Query: 703 SESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDE 742
E + G+ EK K + K K +++ +I+ E++
Sbjct: 659 LE-ERGDPEK----KDVDWYTEEKRKIEEQLKINRSEFET 693
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 36.8 bits (86), Expect = 0.044
Identities = 41/218 (18%), Positives = 76/218 (34%), Gaps = 43/218 (19%)
Query: 528 EKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKK 587
+ + KK E R KDK+A + ++K AKK
Sbjct: 468 AREARHKKAAE------ARAAKDKDAVAAAL-----------ARVK-----------AKK 499
Query: 588 GGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKET 647
V + V A+E +A + + D KK
Sbjct: 500 AAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQAAAAAD----------PKKAA 549
Query: 648 ASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDS 707
+ A++ K K +N + + D KKA + ++AK KK +++ S
Sbjct: 550 VAAAIARAKAKKAAQQAANAEAEEEVDPKKA-----AVAAAIARAKAKKAAQQAASAEPE 604
Query: 708 GESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVE 745
+ + K+ +++K+KK ++ + E +EPV+
Sbjct: 605 EQVAEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPVD 642
Score = 36.1 bits (84), Expect = 0.083
Identities = 46/261 (17%), Positives = 91/261 (34%), Gaps = 43/261 (16%)
Query: 447 EKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDS 506
E+E A AR K+ A R +D + + +A + + P +S +
Sbjct: 463 EREKAAREARHKKAAEARAAKDKDAVAAAL-ARVKAKKAAATQPIVIKAGARPDNSAVIA 521
Query: 507 DASEGTKKKKKEKSRKTITISEKPRKS-----------KKDKESSGGSTKRKKKDKNAPK 555
+ + ++ K + P+K+ KK + + + ++ D PK
Sbjct: 522 AREARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVD---PK 578
Query: 556 KPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYN 615
K A + R K KK + E ++ +KA + +AK
Sbjct: 579 KAAVAAAI----ARAKAKKAAQQAASAEPEEQV-----AEVDPKKAAVAAAIARAK---- 625
Query: 616 EALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDD 675
A KA ++ + +K + A++ K + ++N +++D
Sbjct: 626 -AKKAEQQ---------ANAEPEEPVDPRKAAVAAAIARAKARKAAQQQANAEPEEAEDP 675
Query: 676 KKASSKRKRDSDEDSKAKTKK 696
KKA + ++AK KK
Sbjct: 676 KKA-----AVAAAIARAKAKK 691
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 36.5 bits (84), Expect = 0.047
Identities = 15/64 (23%), Positives = 33/64 (51%)
Query: 454 LARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTK 513
L + E A +++DDE++D+ D++ D + D ++ D + + +DS A +G +
Sbjct: 41 LGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNE 100
Query: 514 KKKK 517
+
Sbjct: 101 TDNE 104
Score = 31.9 bits (72), Expect = 1.2
Identities = 36/151 (23%), Positives = 63/151 (41%), Gaps = 23/151 (15%)
Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDED 479
+E +R E+ +ED DD+ + D + E + DD+D++DEDE D+
Sbjct: 34 KENAIRKLGKEAEEEAMEEEDDDEEDDDDDDD------EDEDDDDDDDDEDDEDEDDDDS 87
Query: 480 FNPDQAESDVAEEYDSNPTPTDSDEDSD------------ASEGTKKKKKEKSRKTITIS 527
D + +D E D+ DSD++SD + T +K E R+
Sbjct: 88 TLHDDSSADDGNETDNEAGFADSDDESDDGSEYVFWAPGTTTAATSPRKLETMRRKSR-- 145
Query: 528 EKPRKSKKDKESSGGSTKRKKKDKNAPKKPM 558
R++ S K+++K K + P+
Sbjct: 146 ---RRTSDSSADSLNERKQRRKWKRPRRSPI 173
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 36.8 bits (85), Expect = 0.048
Identities = 27/131 (20%), Positives = 43/131 (32%), Gaps = 2/131 (1%)
Query: 622 KESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSK 681
K+ + S S S + S K +E + S S S K
Sbjct: 764 KKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRN-KEEEMLPSPSSPLSSSSPKPEHPS 822
Query: 682 RKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKG-DKSKSKSKKEDEIDDEEY 740
RKR ++ + + S ++S S S K K++ KG SK + ++
Sbjct: 823 RKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTEGKGSSTSKEHKGSSGDTPNKAS 882
Query: 741 DEPVESTPPSS 751
PV S
Sbjct: 883 SFPVPPLSNGS 893
Score = 31.4 bits (71), Expect = 2.4
Identities = 17/43 (39%), Positives = 27/43 (62%), Gaps = 3/43 (6%)
Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDK 676
K+S K+K + + + P KS+EFIE++ SSSDS +D+
Sbjct: 595 KSSVPPAAEKRKYKSPSKIVP---KSREFIETDSSSSDSPEDE 634
Score = 29.9 bits (67), Expect = 6.3
Identities = 24/107 (22%), Positives = 39/107 (36%), Gaps = 7/107 (6%)
Query: 643 KKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSE 702
+K+ S+ S + +E + S S+D + + K S + E +
Sbjct: 336 QKQYDPSSKTSTNSQSGTSMLEDDLKLSSSEDSDEEQATEKPPSRNTPPSAPSSNPEPAA 395
Query: 703 SESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPP 749
S S S S +S S S S+ E D E +EP + P
Sbjct: 396 SSSGSSSSSSGSES-------SSGSDSESESSSSDSEENEPPRTASP 435
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 36.4 bits (84), Expect = 0.053
Identities = 45/209 (21%), Positives = 69/209 (33%), Gaps = 11/209 (5%)
Query: 517 KEKSRK-TITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIK-- 573
KE++ ++T EK S K S+ + N S + EK K
Sbjct: 203 KEENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDH 262
Query: 574 KDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDG 633
+ I S + E E + + + KE+ S
Sbjct: 263 HHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSSSAAA 322
Query: 634 KT--SKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSK 691
+ SKS K AK + SP S + S SD++ SK+KR S S
Sbjct: 323 GSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNE------SKQKRASKSSSG 376
Query: 692 AKTKKKKEKSESESDSGESEKEKKSKSKP 720
A+ KK S + + E+ + S P
Sbjct: 377 ARDSKKDASGMSANGTVENCIPENKISTP 405
Score = 30.3 bits (68), Expect = 5.0
Identities = 51/255 (20%), Positives = 93/255 (36%), Gaps = 29/255 (11%)
Query: 504 EDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSG----GSTKR-KKKDKNAPKKPM 558
S A+ G+ K KS K ++ K ++G S + K K A K
Sbjct: 316 NSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNESKQKRASKSSS 375
Query: 559 SAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEAL 618
A KKD G+S + K + ++ + K + + +A
Sbjct: 376 GARD---------SKKDASGMSANGTVENCIPENKISTPSAIERLEQDIKKLQAELQQAR 426
Query: 619 KAYKE--SGGGQDSDDGKTSKSKKPAKKKE-----TASTAVSPHKVKSKEFIESNGSSSD 671
+ E + + ++ KS KKE T ++ K K K+ ++S
Sbjct: 427 QNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLK 486
Query: 672 SDDDKKAS------SKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKS 725
S+ D + + ++KR +E+ A + + + S + ES K+ K + + K
Sbjct: 487 SEADSRVNAEKQLAEEKKRKKEEEETAA-RAAAQAAASREECAESLKQAKQDLEMEIKKL 545
Query: 726 KSKSK-KEDEIDDEE 739
+ K KE+E E
Sbjct: 546 EHDLKLKEEECRMLE 560
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 35.6 bits (82), Expect = 0.058
Identities = 16/96 (16%), Positives = 40/96 (41%)
Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKK 715
+ KE + SD + K++ ++++ EDS+ + +++K E + + E ++
Sbjct: 141 EKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGG 200
Query: 716 SKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSS 751
G S + + +++ +P P S
Sbjct: 201 GGGSSGGQSGLSTKDEPPKEKRQKHHDPERRLEPQS 236
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 36.2 bits (83), Expect = 0.062
Identities = 28/141 (19%), Positives = 54/141 (38%), Gaps = 14/141 (9%)
Query: 626 GGQDSDDGKTSKSKKPAKKKET--ASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRK 683
D + + K + K+ T +S + E + + D+ DK++ +
Sbjct: 60 EEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELV----VNEDAALDKESKKTPR 115
Query: 684 RDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEP 743
R + + A + ++EK+E +K +K + K D+ E E+ D E E
Sbjct: 116 RTRRKAAAASSDVEEEKTE--------KKVRKRRKVKKMDEDVEDQGSESEVSDVEESEF 167
Query: 744 VESTPPSSDAGSDASSSEDDD 764
V S S+ D + +D
Sbjct: 168 VTSLENESEEELDLEKDDGED 188
Score = 29.6 bits (66), Expect = 6.6
Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 8/67 (11%)
Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS 502
+S D +E KR ++ DED ED+ ++ SDV E +S
Sbjct: 124 ASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESE--------VSDVEESEFVTSLENES 175
Query: 503 DEDSDAS 509
+E+ D
Sbjct: 176 EEELDLE 182
>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1). This family
consists of several mammalian dentin matrix protein 1
(DMP1) sequences. The dentin matrix acidic
phosphoprotein 1 (DMP1) gene has been mapped to human
chromosome 4q21. DMP1 is a bone and teeth specific
protein initially identified from mineralised dentin.
DMP1 is primarily localised in the nuclear compartment
of undifferentiated osteoblasts. In the nucleus, DMP1
acts as a transcriptional component for activation of
osteoblast-specific genes like osteocalcin. During the
early phase of osteoblast maturation, Ca(2+) surges into
the nucleus from the cytoplasm, triggering the
phosphorylation of DMP1 by a nuclear isoform of casein
kinase II. This phosphorylated DMP1 is then exported out
into the extracellular matrix, where it regulates
nucleation of hydroxyapatite. DMP1 is a unique molecule
that initiates osteoblast differentiation by
transcription in the nucleus and orchestrates
mineralised matrix formation extracellularly, at later
stages of osteoblast maturation. The DMP1 gene has been
found to be ectopically expressed in lung cancer
although the reason for this is unknown.
Length = 514
Score = 36.2 bits (83), Expect = 0.067
Identities = 70/331 (21%), Positives = 138/331 (41%), Gaps = 29/331 (8%)
Query: 459 REAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSD----ASEGTKK 514
R + D+ D+DED+S D+ F D + E + SDEDS +SE +
Sbjct: 93 RSTGKEGDDKDDDEDDSGDDTFGDDDSGPGPEERQWGGNSRLGSDEDSADTTQSSEDSTP 152
Query: 515 KKKEKSRKTITIS-EKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMRE--- 570
+ + ++ T + S + + + D GG + + + + S D E
Sbjct: 153 QGENSAQDTTSESRDLDNEDEVDSRPEGGDSTQDSESEEHWVGGGSEGESSHGDGSEFDD 212
Query: 571 -KIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALK---------- 619
++ D+P + +E S + K E +E+ + +++++
Sbjct: 213 EGMQSDDPESTRSERGNSRMSSAGLKSKESKGEDEEQASTQDSGESQSVEYPSRKFFRKS 272
Query: 620 AYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKAS 679
E G + DD T + K + + S + + +S+E E N S DS + + S
Sbjct: 273 RISEEDGRGELDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQEDSQEVQDPS 332
Query: 680 SKRKRDSDEDSKAKTKKKKEKSESES---------DSGESEKEKKSKSKPKGDK-SKSKS 729
S+ +++D S+ + + +E+ SES E +++ +S + D S S+S
Sbjct: 333 SESSQEADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSES 392
Query: 730 KKEDEIDDEEYDEPVESTPPSSDAGSDASSS 760
+ +E D E +E + S+ S ++ D +SS
Sbjct: 393 QSTEEQADSESNESLSSSEESPESTEDENSS 423
Score = 32.7 bits (74), Expect = 0.78
Identities = 29/137 (21%), Positives = 65/137 (47%), Gaps = 3/137 (2%)
Query: 628 QDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSD 687
+DS+ + P+ + ++ +S E + S+ S +S +D+ +SS+ S
Sbjct: 374 EDSESSEEDSLDTPSSSESQSTE--EQADSESNESLSSSEESPESTEDENSSSQEGLQS- 430
Query: 688 EDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVEST 747
+ +++ ++ +SE +S S E + + + S+ K D + ++S E D + + +ES
Sbjct: 431 HSASTESRSQESQSEQDSRSEEDDSDSQDSSRSKEDSNSTESASSSEEDGQPKNTEIESR 490
Query: 748 PPSSDAGSDASSSEDDD 764
+ DA + + DD
Sbjct: 491 KLTVDAYHNKPIGDQDD 507
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 36.0 bits (83), Expect = 0.076
Identities = 23/130 (17%), Positives = 54/130 (41%), Gaps = 5/130 (3%)
Query: 429 TGKGEKPNYK--EDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAE 486
+ GE P+ E E+E + + E ++E +++E+E E N + E
Sbjct: 426 STSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEE 485
Query: 487 SDVAEEYDSNPTPTDSDE---DSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGS 543
+ + E D + + D +S+ + ++ + ++ R + E P++ ES
Sbjct: 486 MEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAE 545
Query: 544 TKRKKKDKNA 553
+ ++ D+
Sbjct: 546 SVGEESDEEL 555
Score = 30.3 bits (68), Expect = 4.8
Identities = 14/90 (15%), Positives = 37/90 (41%)
Query: 675 DKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDE 734
+++ R SD + T + S+ E E++ + + + ++ + +S++E+
Sbjct: 408 ERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEG 467
Query: 735 IDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
D+EE +E + + D +
Sbjct: 468 EDEEEEEEVEADNGSEEEMEGSSEGDGDGE 497
Score = 30.3 bits (68), Expect = 5.4
Identities = 26/117 (22%), Positives = 49/117 (41%), Gaps = 5/117 (4%)
Query: 643 KKKETASTAVSPH-KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKS 701
K++E S SK S S S + + + + + +E+ + +++E+
Sbjct: 405 KRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEE----EEEEEQ 460
Query: 702 ESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDAS 758
ESE + GE E+E++ G + + + E + D EE +E E S S
Sbjct: 461 ESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMS 517
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 35.8 bits (82), Expect = 0.11
Identities = 20/108 (18%), Positives = 41/108 (37%), Gaps = 5/108 (4%)
Query: 439 EDFGSSDDEKEPDAYLARVK----REAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYD 494
+D G ++ A K +A +DE++ED + DE+ + D +E D
Sbjct: 364 DDHGFLPGREQTSKKAAVPKGTSFYQAKWAEDEEEED-GQCNDEESTMSAIDDDDPKEND 422
Query: 495 SNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGG 542
+ D + D +EG ++ E+ + + K ++
Sbjct: 423 NEEVAGDEESAIDDNEGFEELSPEEEERQLREFRDMEKEDREFPDEAE 470
>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein. This family consists of several
eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
to suppress GCR2 and is highly expressed in the muscle
and heart. The function of this family is unknown
although it has been speculated that SGT1 may be
functionally analogous to the Gcr2p protein of
Saccharomyces cerevisiae which is known to be a
regulatory factor of glycolytic gene expression.
Length = 557
Score = 35.5 bits (82), Expect = 0.12
Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 3/89 (3%)
Query: 660 KEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKT---KKKKEKSESESDSGESEKEKKS 716
K + D+D + + D ++D + + E+ E D ++E ++
Sbjct: 467 KNMLGLKDDEIDNDLPDDSDDADEDDDEDDDEDEDSSSDSTLEELEEYMDQMDAELKQTD 526
Query: 717 KSKPKGDKSKSKSKKEDEIDDEEYDEPVE 745
S + S ED+ DD E EPV+
Sbjct: 527 SSNNADISNSGSSGAEDDDDDIEGVEPVD 555
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or more
acidic residues. The function of members of this family
is unknown.
Length = 104
Score = 32.7 bits (75), Expect = 0.17
Identities = 14/51 (27%), Positives = 26/51 (50%)
Query: 455 ARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDED 505
A+ + + DD +D+D+ D+D PD + DV + D + D D++
Sbjct: 54 AKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104
Score = 29.6 bits (67), Expect = 1.7
Identities = 15/54 (27%), Positives = 26/54 (48%)
Query: 455 ARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDA 508
A+ + AA+ +D +DEDE ++D D + D ++ + D D D D
Sbjct: 41 AKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDD 94
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 35.0 bits (80), Expect = 0.17
Identities = 18/65 (27%), Positives = 29/65 (44%)
Query: 433 EKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEE 492
E+ +Y D S EK+ D L K E R+ E+ E E+E D + D ++ ++
Sbjct: 347 ERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDD 406
Query: 493 YDSNP 497
N
Sbjct: 407 ELENH 411
Score = 33.8 bits (77), Expect = 0.35
Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 1/78 (1%)
Query: 445 DDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDV-AEEYDSNPTPTDSD 503
++E E Y+ E+ +D ++ E+ NP+++E + E+Y+ +
Sbjct: 343 NEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRI 402
Query: 504 EDSDASEGTKKKKKEKSR 521
D D E + EK+
Sbjct: 403 CDDDELENHFRAADEKNS 420
>gnl|CDD|153272 cd07588, BAR_Amphiphysin, The Bin/Amphiphysin/Rvs (BAR) domain of
Amphiphysins. BAR domains are dimerization, lipid
binding and curvature sensing modules found in many
different proteins with diverse functions. Amphiphysins
function primarily in endocytosis and other membrane
remodeling events. They contain an N-terminal BAR domain
with an additional N-terminal amphipathic helix (an
N-BAR), a variable central domain, and a C-terminal SH3
domain. This subfamily is composed of different isoforms
of amphiphysin and Bridging integrator 2 (Bin2).
Amphiphysin I proteins, enriched in the brain and
nervous system, contain domains that bind clathrin,
Adaptor Protein complex 2 (AP2), dynamin and
synaptojanin. They function in synaptic vesicle
endocytosis. Some amphiphysin II isoforms, also called
Bridging integrator 1 (Bin1), are localized in many
different tissues and may function in intracellular
vesicle trafficking. In skeletal muscle, Bin1 plays a
role in the organization and maintenance of the T-tubule
network. Bin2 is mainly expressed in hematopoietic cells
and is upregulated during granulocyte differentiation.
The N-BAR domains of amphiphysins form a curved dimer
with a positively-charged concave face that can drive
membrane bending and curvature.
Length = 211
Score = 34.3 bits (79), Expect = 0.17
Identities = 19/67 (28%), Positives = 39/67 (58%), Gaps = 4/67 (5%)
Query: 557 PMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNE 616
P++AY F +++++I K + + A+ E + +K+K + D+K+ KA+E+ +
Sbjct: 95 PLTAYQSQFPEVKKRIAKRGRKLVDYDSARHNLE---ALKAKKKVD-DQKLTKAEEELQQ 150
Query: 617 ALKAYKE 623
A K Y+E
Sbjct: 151 AKKVYEE 157
>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
Provisional.
Length = 337
Score = 34.4 bits (79), Expect = 0.20
Identities = 12/59 (20%), Positives = 25/59 (42%), Gaps = 3/59 (5%)
Query: 463 ERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSR 521
E DED + + D+D++ + + + +S+ D++E+ D E K
Sbjct: 278 EAIDEDSDYSSDEDDDDYDSYDSSDSASSDSNSD---VDTNEEDDRGEKESNGAKSNEL 333
Score = 31.3 bits (71), Expect = 2.0
Identities = 11/56 (19%), Positives = 28/56 (50%), Gaps = 5/56 (8%)
Query: 467 EDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRK 522
D++ + S ++D + D +S DS + ++SD D++ + +K+ ++
Sbjct: 280 IDEDSDYSSDEDDDDYDSYDSS-----DSASSDSNSDVDTNEEDDRGEKESNGAKS 330
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 34.7 bits (80), Expect = 0.20
Identities = 14/61 (22%), Positives = 32/61 (52%)
Query: 677 KASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEID 736
K K D +E+ + + KK++E+ ++ + E+E+K + K K K K + + + ++
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88
Query: 737 D 737
Sbjct: 89 K 89
Score = 30.1 bits (68), Expect = 5.0
Identities = 18/72 (25%), Positives = 33/72 (45%), Gaps = 10/72 (13%)
Query: 463 ERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRK 522
E + E ++E+E E+ +E + T + + D + + KKKK +K ++
Sbjct: 30 EVEKEVPDEEEEEEKEE----------KKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79
Query: 523 TITISEKPRKSK 534
T T E K+K
Sbjct: 80 TTTEWELLNKTK 91
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 34.7 bits (79), Expect = 0.21
Identities = 33/155 (21%), Positives = 57/155 (36%), Gaps = 9/155 (5%)
Query: 422 KKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREA-AERDDEDDEDEDESTDEDF 480
+KL V+ E E EK+ + K+E ERD + ++ +
Sbjct: 171 EKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLL 230
Query: 481 NPDQ---AESDVAEEYDSNPTPTDSDEDSDAS-EGTKKKKKEKSRKTITISEKPRKSKKD 536
D + + VAE +++ T S D+ + K + E+++K S K +K K+
Sbjct: 231 ALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKK----SPKHKKKKQR 286
Query: 537 KESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREK 571
KE K+K A N E+
Sbjct: 287 KEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGAVEE 321
Score = 32.7 bits (74), Expect = 0.75
Identities = 51/255 (20%), Positives = 79/255 (30%), Gaps = 46/255 (18%)
Query: 458 KREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKK 517
KRE +R +DED P Q V EE N P+D D D D ++ +
Sbjct: 103 KREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPENALPSDED-DKDPNDPYRALDI 161
Query: 518 EKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNP 577
+ + + P +K ++ K P K D P
Sbjct: 162 DLDKPLADSEKLP--------------VQKHRNAETSKSPE--------------KGDVP 193
Query: 578 GISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSK 637
+ KK K KEK E +++ +K K K E +
Sbjct: 194 AVE-----KKS----KKPKKKEKKEKEKERDKDK-------KKEVEGFKSLLLALDDSPA 237
Query: 638 SKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDE-DSKAKTKK 696
S + + AS A + + + + K K+K+ E + K K KK
Sbjct: 238 SAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKK 297
Query: 697 KKEKSESESDSGESE 711
SD G +
Sbjct: 298 HHHHRCHHSDGGAEQ 312
Score = 32.0 bits (72), Expect = 1.5
Identities = 34/136 (25%), Positives = 53/136 (38%), Gaps = 12/136 (8%)
Query: 628 QDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSD 687
+ D K K KKKE K K KE +E S + DD AS+ ++D
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKE-VEGFKSLLLALDDSPASAASVAEAD 246
Query: 688 EDSKAKT-----------KKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEID 736
E S A T + K ++E S + +K+K+ K K + K K
Sbjct: 247 EASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHS 306
Query: 737 DEEYDEPVESTPPSSD 752
D ++PV++ +
Sbjct: 307 DGGAEQPVQNGAVEEE 322
Score = 31.6 bits (71), Expect = 1.7
Identities = 22/87 (25%), Positives = 42/87 (48%), Gaps = 1/87 (1%)
Query: 671 DSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSK 730
D D DK + K + A+T K EK + + +S+K K K K + +K + K K
Sbjct: 160 DIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKK-PKKKEKKEKEKERDKDK 218
Query: 731 KEDEIDDEEYDEPVESTPPSSDAGSDA 757
K++ + ++ +P S+ + ++A
Sbjct: 219 KKEVEGFKSLLLALDDSPASAASVAEA 245
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 33.9 bits (78), Expect = 0.24
Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 2/62 (3%)
Query: 672 SDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
++++ + + R +E +K K KKKK+K++ + +G S + S + P S +
Sbjct: 165 TEEEVEL--LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVG 222
Query: 732 ED 733
E
Sbjct: 223 EA 224
Score = 28.9 bits (65), Expect = 9.5
Identities = 16/71 (22%), Positives = 25/71 (35%), Gaps = 4/71 (5%)
Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
K K KK KK ++ + S+G+ + K + +E S+
Sbjct: 184 KKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAPDNEKSEVY 243
Query: 694 ----TKKKKEK 700
T KKEK
Sbjct: 244 KSLFTSHKKEK 254
>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
Length = 534
Score = 34.2 bits (78), Expect = 0.25
Identities = 26/86 (30%), Positives = 39/86 (45%), Gaps = 14/86 (16%)
Query: 671 DSDDDKKASSKRKR-DSDEDSKAKTKKKKEKSESESDSG--------ESEKEKKSKSKPK 721
+ D D A + R DSDED + + E+++ E DS SE E S
Sbjct: 436 EFDQDPGAPAHADRADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSADDSW 495
Query: 722 GDKSKSKSKKEDE-----IDDEEYDE 742
GD+S S + +D I++EE +E
Sbjct: 496 GDESDSGIEHDDGGVGQAIEEEEEEE 521
Score = 33.8 bits (77), Expect = 0.37
Identities = 19/72 (26%), Positives = 32/72 (44%)
Query: 667 GSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSK 726
S + D D++ S + D ++DS + SE ESDS + +S S + D
Sbjct: 450 ADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGG 509
Query: 727 SKSKKEDEIDDE 738
E+E ++E
Sbjct: 510 VGQAIEEEEEEE 521
Score = 33.4 bits (76), Expect = 0.54
Identities = 18/101 (17%), Positives = 40/101 (39%), Gaps = 9/101 (8%)
Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGD 723
+ +++ +D D +A + + D+D A + S+ + + ++++ + D
Sbjct: 417 AAAAANAPADQDVEAEAHD--EFDQDPGAPAHADRADSDEDDMDEQESGDERADGEDDSD 474
Query: 724 KSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
S S S E + + D+ S SD+ DD
Sbjct: 475 SSYSYSTTSSEDESDSADD-------SWGDESDSGIEHDDG 508
Score = 31.1 bits (70), Expect = 2.4
Identities = 16/62 (25%), Positives = 25/62 (40%), Gaps = 1/62 (1%)
Query: 445 DDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDE 504
+ E + A+R D D++D DE D D E D Y + T ++ +
Sbjct: 430 EAEAHDEFDQDPGAPAHADRADSDEDDMDEQESGDERAD-GEDDSDSSYSYSTTSSEDES 488
Query: 505 DS 506
DS
Sbjct: 489 DS 490
Score = 30.7 bits (69), Expect = 3.7
Identities = 18/82 (21%), Positives = 28/82 (34%)
Query: 455 ARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKK 514
R AA + D+D + ++F+ D A+ DS+ D E D +
Sbjct: 412 GRAAAAAAAANAPADQDVEAEAHDEFDQDPGAPAHADRADSDEDDMDEQESGDERADGED 471
Query: 515 KKKEKSRKTITISEKPRKSKKD 536
+ T SE S D
Sbjct: 472 DSDSSYSYSTTSSEDESDSADD 493
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 34.4 bits (79), Expect = 0.27
Identities = 38/166 (22%), Positives = 55/166 (33%), Gaps = 26/166 (15%)
Query: 590 ELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETAS 649
+ K S K + E D K + D N+ L + D K K K K++
Sbjct: 24 KNLKHSSYKIRLESDIKDSLLNLDINKKL---------HEKLDKKNKKFNKTDDLKDSKK 74
Query: 650 TAVSPHKVKSKEFIESNGSSSDSDDD----KKASSKRKRDSDEDSKAKTKKKKEKSESES 705
T K+K K+ I+ D D+ K S + K K K EK
Sbjct: 75 T-----KLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEK----- 124
Query: 706 DSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSS 751
+K +KS K K SK E D + + P +
Sbjct: 125 ---LKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKSISIHSPLT 167
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 33.9 bits (78), Expect = 0.29
Identities = 12/63 (19%), Positives = 25/63 (39%), Gaps = 1/63 (1%)
Query: 460 EAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSN-PTPTDSDEDSDASEGTKKKKKE 518
+DED+E+E S + + + + + + + + SD SD+ K+
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESAS 426
Query: 519 KSR 521
S
Sbjct: 427 DSD 429
>gnl|CDD|223052 PHA03346, PHA03346, US22 family homolog; Provisional.
Length = 520
Score = 34.0 bits (78), Expect = 0.32
Identities = 16/91 (17%), Positives = 31/91 (34%), Gaps = 4/91 (4%)
Query: 434 KPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEY 493
+P + D A R +DD++ D+ D+D+ + D + Y
Sbjct: 394 RPRGPPENEDEADGGGGLAPPPEPPRPPPPPFPDDDDEGDDEDDDDWEDLGFDLDEDDVY 453
Query: 494 DSNPTPTDSDEDSDASEGTKKKKKEKSRKTI 524
D D DE ++ E ++ +
Sbjct: 454 D----LKDVDEWRLERRIATAERWELGQRLL 480
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 33.8 bits (77), Expect = 0.39
Identities = 49/275 (17%), Positives = 97/275 (35%), Gaps = 34/275 (12%)
Query: 440 DFGSSDDEKEPDAYLARVKREAAERDDEDDED-EDESTDEDFNPDQAESDVAEEYDSNPT 498
D + + ++ + D+++E+ EDES +++ D+ E+D+ E +S+
Sbjct: 90 VLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESS-- 147
Query: 499 PTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPM 558
+ D + + +KK +E+ S + +K + SG K K D+
Sbjct: 148 -LEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDE------- 199
Query: 559 SAYMMWFNDMREKIKKDNPGISFTEIAKKG--------GELWKTVSSKEKAEYDEKVNKA 610
N+ E + + E + G+ + S E E DE+ N
Sbjct: 200 ------MNEFLEATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIE 253
Query: 611 KEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSS 670
ED+ + + K+ G D++ K+ KK+ K +E E +
Sbjct: 254 YEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEA---------DSKPEEEDEEDDEQE 304
Query: 671 DSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESES 705
D D+++ D E + S
Sbjct: 305 DDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELS 339
Score = 33.4 bits (76), Expect = 0.44
Identities = 52/277 (18%), Positives = 106/277 (38%), Gaps = 48/277 (17%)
Query: 463 ERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRK 522
DDE++E+EDES +++ D+ E+D+ E +S+ + D + + +KK +E+
Sbjct: 114 SADDEEEEEEDESLEDEMIDDEDEADLFNESESS---LEDLSDDETEDDEEKKMEEEEAG 170
Query: 523 TITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFT 582
S + +K + SG K K D+ N+ E + +
Sbjct: 171 EEKESVEQATREKKFDKSGVDDKFFKLDE-------------MNEFLEATEAEEEAALGD 217
Query: 583 EIAKKG--------GELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGK 634
E + G+ + S E E DE+ N ED+ + + K+ G D++
Sbjct: 218 EDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELED 277
Query: 635 TSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKT 694
K+ KK+ +DS +++ +++ D+D +
Sbjct: 278 DEPDKEAVKKE------------------------ADSKPEEEDEEDDEQEDDQDEEEPP 313
Query: 695 KKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
+ +K + + E + K +K ++K K+
Sbjct: 314 EAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKLKQ 350
Score = 30.7 bits (69), Expect = 3.0
Identities = 30/123 (24%), Positives = 53/123 (43%), Gaps = 5/123 (4%)
Query: 431 KGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVA 490
K +K + +D DDE + +A + E D+EDDE ED+ +E+ +
Sbjct: 263 KDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKL 322
Query: 491 EEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKD 550
+E P D +S + +K++ K ++ I EK + K G KK+
Sbjct: 323 DE----PVLEGVDLESPKELSSFEKRQAKLKQQIEQLEKENLAPKSWTLK-GEVTAKKRP 377
Query: 551 KNA 553
+N+
Sbjct: 378 QNS 380
Score = 29.2 bits (65), Expect = 9.0
Identities = 17/96 (17%), Positives = 36/96 (37%)
Query: 669 SSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSK 728
S D DD+ S + DE+ + + + E + + + +P + K +
Sbjct: 229 SEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKE 288
Query: 729 SKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
+ + E +DEE DE + + + D+
Sbjct: 289 ADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDE 324
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 33.5 bits (76), Expect = 0.40
Identities = 24/80 (30%), Positives = 38/80 (47%)
Query: 663 IESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKG 722
+E+ S+ + KK ++ SD+ + +++E SESESD E E K K K
Sbjct: 214 LEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSESESDEDEDEDNKGKIRKRKT 273
Query: 723 DKSKSKSKKEDEIDDEEYDE 742
D +K K I+ E+ E
Sbjct: 274 DDAKKSRKPHIHIEYEQERE 293
Score = 33.5 bits (76), Expect = 0.40
Identities = 25/91 (27%), Positives = 37/91 (40%), Gaps = 3/91 (3%)
Query: 444 SDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSD 503
DEKE Y+ + E + D+ E E T + SD + E + S+
Sbjct: 196 RQDEKER--YVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSE 253
Query: 504 EDSDASEGTKKKKKEKSRKTITISEKPRKSK 534
+SD E K K + RKT ++K RK
Sbjct: 254 SESDEDEDEDNKGKIRKRKTDD-AKKSRKPH 283
Score = 32.7 bits (74), Expect = 0.68
Identities = 33/173 (19%), Positives = 70/173 (40%), Gaps = 9/173 (5%)
Query: 545 KRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGELW-KTVSSKEKAEY 603
K +++K +K ++A N +E + + GI LW K + +EK +
Sbjct: 139 KMNRREKLRERKALNAAKFEKNIEKELMLRLKSGIYGDTPLNVREHLWNKAATEREKRQD 198
Query: 604 DEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFI 663
+++ +E+ ++ +DD + K+KK +K S +E
Sbjct: 199 EKERYVEEEEESDTELEA-------VTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEES 251
Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKS 716
+ S D D+D K ++ R +D+ K++ + E E ++ + + S
Sbjct: 252 SESESDEDEDEDNKGKIRK-RKTDDAKKSRKPHIHIEYEQERENEKIPAVQHS 303
Score = 30.4 bits (68), Expect = 3.4
Identities = 28/138 (20%), Positives = 56/138 (40%), Gaps = 18/138 (13%)
Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDED 479
+E LR+K+ G+ P + E + ++ E++ +E+E+ T+ +
Sbjct: 163 KELMLRLKSGIYGDTP-------LNVREHLWNKAATEREKRQDEKERYVEEEEESDTELE 215
Query: 480 FNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKES 539
D +E + ++ D S T + ++E+S SE +D+++
Sbjct: 216 AVTDDSEKEKTKKKDLEKWLGSDQ-----SMETSESEEEES------SESESDEDEDEDN 264
Query: 540 SGGSTKRKKKDKNAPKKP 557
G KRK D +KP
Sbjct: 265 KGKIRKRKTDDAKKSRKP 282
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 33.7 bits (77), Expect = 0.41
Identities = 24/86 (27%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 667 GSSSDSDDDKKASS----KRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKG 722
GS ++S+ +KK + +RK+ +E +K K KK + ++ E+ + + + ++ S
Sbjct: 1 GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKA-KLQAQQASDGTNVP 59
Query: 723 DKSKSKSKKEDEIDDEEYDEPVESTP 748
KS+ KS+K D D+ D TP
Sbjct: 60 KKSEKKSRKRDVEDENPEDFIDPDTP 85
Score = 30.6 bits (69), Expect = 3.8
Identities = 17/69 (24%), Positives = 33/69 (47%), Gaps = 4/69 (5%)
Query: 495 SNPTPTDSDEDSDASEGTKKKK---KEKSRKTITISEKPRKSK-KDKESSGGSTKRKKKD 550
+ + E KKK+ KEK K + ++K K+K + +++S G+ KK +
Sbjct: 4 TESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSE 63
Query: 551 KNAPKKPMS 559
K + K+ +
Sbjct: 64 KKSRKRDVE 72
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 33.4 bits (77), Expect = 0.47
Identities = 15/79 (18%), Positives = 33/79 (41%), Gaps = 12/79 (15%)
Query: 444 SDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSD 503
+ + EP A E + DDE+ ++EDE + + + + + D D D
Sbjct: 108 EELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDD------------DDD 155
Query: 504 EDSDASEGTKKKKKEKSRK 522
+D E + ++++ +
Sbjct: 156 DDIATRERSLERRRRRREW 174
Score = 30.0 bits (68), Expect = 5.5
Identities = 18/85 (21%), Positives = 38/85 (44%), Gaps = 5/85 (5%)
Query: 440 DFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTP 499
D G ++E + + R+ E DD+D+E ++E + + D + D ++ D T
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATR 161
Query: 500 TDSDEDSDASEGTKKKKKEKSRKTI 524
S E ++++ E+ R +
Sbjct: 162 ERSLERRR-----RRREWEEKRAEL 181
Score = 29.6 bits (67), Expect = 8.1
Identities = 15/78 (19%), Positives = 31/78 (39%), Gaps = 3/78 (3%)
Query: 465 DDEDDEDEDE---STDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSR 521
DD D E+E + D+ + + + D EE D + ED + + +R
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATR 161
Query: 522 KTITISEKPRKSKKDKES 539
+ + R+ ++K +
Sbjct: 162 ERSLERRRRRREWEEKRA 179
>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
Length = 815
Score = 33.6 bits (77), Expect = 0.47
Identities = 19/84 (22%), Positives = 36/84 (42%), Gaps = 7/84 (8%)
Query: 686 SDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVE 745
+ E+ K T+K+K +++ +K KK K + KS ++ + EE+ + +E
Sbjct: 13 AAEEGKGGTRKQKAENKEHEGEQSPKKAKKEKKQDDSGNGNGKSAEDAVKEFEEFCKAIE 72
Query: 746 STPPSSDA-------GSDASSSED 762
G D+S S+D
Sbjct: 73 EHLSIEQMREILEENGQDSSGSDD 96
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 33.5 bits (76), Expect = 0.50
Identities = 31/99 (31%), Positives = 45/99 (45%), Gaps = 10/99 (10%)
Query: 430 GKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDV 489
GKG+K N K F S D+ E + + A VK DD DD + D ++DF SD
Sbjct: 669 GKGKKSN-KASFDSDDEMDENEIWSALVKSRPDVEDDSDDSELD-FAEDDF------SDS 720
Query: 490 AEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISE 528
D P D++ SEG+++ +E+ I S
Sbjct: 721 TS--DDEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSF 757
Score = 30.4 bits (68), Expect = 4.3
Identities = 20/105 (19%), Positives = 48/105 (45%), Gaps = 1/105 (0%)
Query: 434 KPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEY 493
+P+ ++D S+ + D + + + D DDED ++ + ++ ++ +
Sbjct: 698 RPDVEDDSDDSELDFAEDDFSDSTSDDEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSF 757
Query: 494 DSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKE 538
D +DS +S + + +++KE+ + +S K K K+ K
Sbjct: 758 DGEQDNSDSFAESSEEDESSEEEKEE-EENKEVSAKRAKKKQRKN 801
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 33.2 bits (76), Expect = 0.52
Identities = 23/129 (17%), Positives = 36/129 (27%), Gaps = 9/129 (6%)
Query: 645 KETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESE 704
S + P + + + S D +D+ S + S KK SE
Sbjct: 193 ALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPAKPTSILKKSAAKRSE 252
Query: 705 SDSGESEKEKKSKSKPKGDKSKS--KSKKE-------DEIDDEEYDEPVESTPPSSDAGS 755
+ S K+ D S KK E E ++ SS +
Sbjct: 253 APSSSKAKKNSRGIPKPRDALSSLVVRKKAAPESTSQSPSSAEPTSESPQTAGNSSLSSL 312
Query: 756 DASSSEDDD 764
S D+
Sbjct: 313 GDYSDSDES 321
Score = 30.5 bits (69), Expect = 3.6
Identities = 33/201 (16%), Positives = 69/201 (34%), Gaps = 34/201 (16%)
Query: 385 GSTRSFDFEIELKSGVLHTFSSIEKEEYGKLFDFIREKKLRVKNTGKGEKPNYKE--DFG 442
G+TR+++ + + E+E G E + +E +
Sbjct: 103 GATRNYEADKLDEEQEERVEKEREEELAGDAM-KKLENRTADSKREMEVLERLEELKELQ 161
Query: 443 SSDDEKEPDAYL---ARVKREAAERDDED--------------------DEDEDESTDED 479
S + + ++ L R +++ E ++E+ +DED DE+
Sbjct: 162 SRRADVDVNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEE 221
Query: 480 FNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKES 539
N + + PT S A++ ++ K++K KPR +
Sbjct: 222 DNDNTPSPKSGSSSPAKPT---SILKKSAAKRSEAPSSSKAKKNSRGIPKPRDA-----L 273
Query: 540 SGGSTKRKKKDKNAPKKPMSA 560
S ++K ++ + P SA
Sbjct: 274 SSLVVRKKAAPESTSQSPSSA 294
>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal. The C-terminal domain of
FCP-1 is required for interaction with the carboxy
terminal domain of RAP74. Interaction relies extensively
on van der Waals contacts between hydrophobic residues
situated within alpha-helices in both domains.
Length = 263
Score = 32.7 bits (74), Expect = 0.53
Identities = 26/106 (24%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 660 KEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKS-ESESDSGESEKEKKSKS 718
KE + G SD D +K K+ ++E+ + + +K++ + E +S
Sbjct: 150 KEVDDILGEGSDDSDSEK---KKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTPSSSERS 206
Query: 719 KPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
G + +K +E ++E D ES+ SS+ + SSSE D+
Sbjct: 207 NIGGRGPRGHKRKLNE--EDEEDAESESSFESSNEDEEGSSSEADE 250
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 32.9 bits (75), Expect = 0.53
Identities = 23/106 (21%), Positives = 34/106 (32%), Gaps = 13/106 (12%)
Query: 625 GGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKR 684
G KT +K AKK + + + K K K A S +K
Sbjct: 195 GAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKA-------------AKTAVSAKKA 241
Query: 685 DSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSK 730
AK KK K + + +K K +K +K +K
Sbjct: 242 AKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAK 287
Score = 31.8 bits (72), Expect = 1.2
Identities = 26/99 (26%), Positives = 42/99 (42%), Gaps = 3/99 (3%)
Query: 633 GKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKA 692
KT+ S K A K TA+ A K +K+ ++ + KKA+ + + +KA
Sbjct: 232 AKTAVSAKKAAK--TAAKAAKKAKKTAKKALKKAAKAV-KKAAKKAAKAAAKAAKGAAKA 288
Query: 693 KTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
K K K ++ + K K + PK K+KK
Sbjct: 289 TKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKK 327
Score = 30.6 bits (69), Expect = 3.4
Identities = 28/155 (18%), Positives = 46/155 (29%), Gaps = 20/155 (12%)
Query: 506 SDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWF 565
++A + K + K + K K K+ + + KK K
Sbjct: 197 ANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAK------ 250
Query: 566 NDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESG 625
K KK AKK + K + + KA + +A K ++
Sbjct: 251 -----KAKKT---------AKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAK 296
Query: 626 GGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSK 660
SK+K AK + + KV K
Sbjct: 297 KKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331
Score = 30.2 bits (68), Expect = 4.1
Identities = 11/58 (18%), Positives = 20/58 (34%)
Query: 674 DDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
+ KKA+ S + K K S+ + ++ K +K +K K
Sbjct: 198 NAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKT 255
Score = 30.2 bits (68), Expect = 4.4
Identities = 31/136 (22%), Positives = 48/136 (35%), Gaps = 10/136 (7%)
Query: 606 KVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHK--VKSKEFI 663
N K A K+ + + K KK AK +A A K +
Sbjct: 196 AANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKT 255
Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSK--AKTKKKKEKSESE------SDSGESEKEKK 715
+ KKA+ K + + + +K AK K K K++ + + S K
Sbjct: 256 AKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKA 315
Query: 716 SKSKPKGDKSKSKSKK 731
K KG K+K +KK
Sbjct: 316 PKRGAKGKKAKKVTKK 331
Score = 29.5 bits (66), Expect = 7.6
Identities = 24/157 (15%), Positives = 50/157 (31%), Gaps = 2/157 (1%)
Query: 562 MMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAY 621
MM +++ +I + + K +K+ + + K A A
Sbjct: 177 MMTIDEVTREIGRVLAAVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAV 236
Query: 622 KESGGGQDSDDGKTSKSKKPAK--KKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKAS 679
+ + K K KK + + K + G++ + KA
Sbjct: 237 SAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAK 296
Query: 680 SKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKS 716
K + + SKAK K K ++ + +K++
Sbjct: 297 KKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKKRA 333
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 33.1 bits (76), Expect = 0.55
Identities = 18/108 (16%), Positives = 39/108 (36%), Gaps = 1/108 (0%)
Query: 644 KKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDED-SKAKTKKKKEKSE 702
+ E + +K+ + + D DD A + E+ K KKKK+K +
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66
Query: 703 SESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPS 750
+ + GE+ + + ++ + E ++ PP+
Sbjct: 67 KKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWKQTQPPT 114
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 32.7 bits (74), Expect = 0.57
Identities = 17/118 (14%), Positives = 42/118 (35%)
Query: 438 KEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNP 497
++ +PD +R E++EDE+ +D ++ ++ EE
Sbjct: 160 NDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGS 219
Query: 498 TPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPK 555
+ D + KK+ +E+ + I+ + +++ S ++ K
Sbjct: 220 DDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEESPAKKQK 277
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 31.5 bits (71), Expect = 0.58
Identities = 21/77 (27%), Positives = 34/77 (44%), Gaps = 1/77 (1%)
Query: 429 TGKGEKPNYKEDFGSSDDEKEPDAYLA-RVKREAAERDDEDDEDEDESTDEDFNPDQAES 487
G+ EK +++ +E +A A EAAE D D+ DE+E TDE + E+
Sbjct: 55 AGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEA 114
Query: 488 DVAEEYDSNPTPTDSDE 504
D E +++
Sbjct: 115 DAEAEEADEEEDEEAEA 131
Score = 29.9 bits (67), Expect = 1.9
Identities = 11/52 (21%), Positives = 18/52 (34%)
Query: 459 REAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASE 510
A + +E E DED + E + T DE +D ++
Sbjct: 64 AAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEAD 115
Score = 29.5 bits (66), Expect = 3.1
Identities = 21/93 (22%), Positives = 42/93 (45%), Gaps = 2/93 (2%)
Query: 427 KNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERD-DEDDEDEDESTDEDFNPDQA 485
KN G + E + EK P A A E + + DED DE+ + D D+A
Sbjct: 39 KNYDLGREARDLEWTEAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEAD-AADEA 97
Query: 486 ESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKE 518
+ + + + T ++D +++ ++ + ++ E
Sbjct: 98 DEEEETDEAVDETADEADAEAEEADEEEDEEAE 130
>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).
This family consists of several hypothetical proteins
from Arabidopsis thaliana and Oryza sativa. The function
of this family is unknown.
Length = 564
Score = 33.3 bits (76), Expect = 0.59
Identities = 25/136 (18%), Positives = 48/136 (35%), Gaps = 5/136 (3%)
Query: 631 DDGKTSKSKKPAKKKETASTAVSPHKV---KSKEFIES--NGSSSDSDDDKKASSKRKRD 685
D T +P E T +P KV +S+ S G+S S + SS R
Sbjct: 156 DSSPTVIGPRPRSFSELNLTDRTPAKVRSSRSELGAPSPSGGTSCPSSSGGRRSSIGSRR 215
Query: 686 SDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVE 745
+ + K + + S S+ + +++ KS KS + + +
Sbjct: 216 LRGSASLRKKVAVLSAPRKPGSRSSDCKSSPRARSSSAKSPFKSSIQRKATKALSKLSLR 275
Query: 746 STPPSSDAGSDASSSE 761
++P + S + +
Sbjct: 276 ASPKDTSKSSKSEVAP 291
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 32.8 bits (75), Expect = 0.63
Identities = 12/45 (26%), Positives = 18/45 (40%)
Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEK 700
K+ K E G D+D + + + + K K KKK K
Sbjct: 322 KLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRK 366
Score = 32.1 bits (73), Expect = 1.2
Identities = 28/132 (21%), Positives = 54/132 (40%), Gaps = 2/132 (1%)
Query: 472 EDESTDEDFN-PDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKP 530
E+ES D++ + + E++ A DS +D KKK +K R T + +P
Sbjct: 242 EEESIDDELDVLREIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQK-RTTRRVKMRP 300
Query: 531 RKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTEIAKKGGE 590
++K E S + ++ +K +S ++ + + E + ++ S E+ KK
Sbjct: 301 VRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKV 360
Query: 591 LWKTVSSKEKAE 602
K K
Sbjct: 361 KKKPRKRKVNPV 372
Score = 30.5 bits (69), Expect = 3.4
Identities = 20/103 (19%), Positives = 38/103 (36%), Gaps = 3/103 (2%)
Query: 622 KESGGGQDSDDGKTSKSKKPAKKKET--ASTAVSPHKVKSKEFI-ESNGSSSDSDDDKKA 678
+E Q ++D KK +K+ T K + + ES+ D+K+
Sbjct: 268 EEVVDSQAANDEPRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKS 327
Query: 679 SSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPK 721
S+ + + E+S+ E + + K+K K K
Sbjct: 328 LSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 33.2 bits (76), Expect = 0.66
Identities = 25/141 (17%), Positives = 58/141 (41%), Gaps = 16/141 (11%)
Query: 602 EYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSK- 660
E + A D ++ ++ ++ D+ S +K K+E A + K +
Sbjct: 17 ELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARLKELKKQKKQEI 76
Query: 661 -EFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSK 719
+ +E ++ D+D ++K K + K ++E + ++ ++ + +K
Sbjct: 77 QKILEQQNAAIDADM--------------NNKGKGRLKYLLQQTEIFAHFAKGDQSASAK 122
Query: 720 PKGDKSKSKSKKEDEIDDEEY 740
+ + SK +E +DEEY
Sbjct: 123 KAKGRGRHASKLTEEEEDEEY 143
Score = 32.5 bits (74), Expect = 0.98
Identities = 31/156 (19%), Positives = 59/156 (37%), Gaps = 12/156 (7%)
Query: 463 ERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNP---TPTDSDEDSDASEGTKKK--KK 517
E + + + +E +E+ + + D P D ++D +A K + K+
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60
Query: 518 EKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNP 577
EK+R K +K ++ ++ D N K Y++ ++ K +
Sbjct: 61 EKARLK---ELKKQKKQEIQKILEQQNAAIDADMNNKGKGRLKYLLQQTEIFAHFAKGDQ 117
Query: 578 GISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKED 613
S AKK + S + E DE+ K +ED
Sbjct: 118 SAS----AKKAKGRGRHASKLTEEEEDEEYLKEEED 149
Score = 30.5 bits (69), Expect = 4.7
Identities = 13/54 (24%), Positives = 27/54 (50%)
Query: 439 EDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEE 492
E + +E+E + L V R A D+D+ ++ +++ + ++AES E
Sbjct: 3 EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAE 56
>gnl|CDD|215584 PLN03113, PLN03113, DNA ligase 1; Provisional.
Length = 744
Score = 33.0 bits (75), Expect = 0.70
Identities = 32/121 (26%), Positives = 54/121 (44%), Gaps = 9/121 (7%)
Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
+++ K+ +T S + SP K K E ++N ++ + ++ + DS
Sbjct: 13 SNARAAAKKKQPQTQSQSSSPKKRKIGETQDANLGKTNVSEGTLPKTEDTIEPKSDSA-- 70
Query: 694 TKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSK----SKKEDEIDDEE--YDEPVEST 747
K + S DS K+ ++ SKPK D+ KSK KK ++ D E+ Y E E
Sbjct: 71 -KPRSSTSSIAEDSKTGTKKAQTLSKPKKDEMKSKIGLLKKKPNDFDPEKVAYWEKGERV 129
Query: 748 P 748
P
Sbjct: 130 P 130
>gnl|CDD|220710 pfam10351, Apt1, Golgi-body localisation protein domain. This is
the C-terminus of a family of proteins conserved from
plants to humans. The plant members are localised to the
Golgi proteins and appear to regulate membrane
trafficking, as they are required for rapid vesicle
accumulation at the tip of the pollen tube. The
C-terminus probably contains the Golgi localisation
signal and it is well-conserved.
Length = 451
Score = 32.6 bits (75), Expect = 0.76
Identities = 16/76 (21%), Positives = 29/76 (38%), Gaps = 2/76 (2%)
Query: 623 ESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKR 682
+S ++ D +S S + T+ ++ S + S SSD DK+ S K
Sbjct: 306 DSSKREEDSDISSSSSSGSRRSSSTSRSSSSSSSLLS--SSSILSKSSDKSKDKRFSLKL 363
Query: 683 KRDSDEDSKAKTKKKK 698
+ E+S +
Sbjct: 364 SKSEKEESDDLEEMIS 379
>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1. Rec10 / Red1 is involved in meiotic
recombination and chromosome segregation during
homologous chromosome formation. This protein localises
to the synaptonemal complex in S. cerevisiae and the
analogous structures (linear elements) in S. pombe. This
family is currently only found in fungi.
Length = 706
Score = 32.6 bits (74), Expect = 0.86
Identities = 52/337 (15%), Positives = 95/337 (28%), Gaps = 33/337 (9%)
Query: 443 SSDDEKEPDAYLARVKREAAERDDEDD--EDEDESTDEDFNPDQAESDVAEEYDSNPTPT 500
S+ D + L +R + D ED+ ES + + S+ P
Sbjct: 280 SAQDNHYDNTQLQTPERSTKRKSPIWDLKEDQKESKIKSGTNLKLSSE------KESIPE 333
Query: 501 DSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSA 560
S + E + + +K + + S+ K+ E S K K+
Sbjct: 334 TSYVNVLEEEQSPLVRLQKRKLARSTSKTLESLKEVFEDQASSVKNKQAQSEENLNESPK 393
Query: 561 YMMWFNDMREKIKKDNPGISFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKA 620
+ +K + K E K K K + +
Sbjct: 394 TPIAVTG-DPHLKDPTIIAGKKLMNKLTSEKINNPVKVVKVS---KYKGNKSEKKRDINV 449
Query: 621 YKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSD-------SD 673
S KK K K+T P KSK+ + +N S + +
Sbjct: 450 LDTIFASPVS----KELRKKVGKSKQTKLKNFKPVPNKSKKQLANNNSQNIKSKKVVKAK 505
Query: 674 DDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSK------SKPKGDKSKS 727
+ KA+ + + K K K+ S S S + + K + +
Sbjct: 506 TNNKANLQDVGECSSPPNNKEKNDKQTSTSSSVLKSDRSSIEVRNPNANVKKLEDTTYNA 565
Query: 728 KSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
K + + + S DA + A + +
Sbjct: 566 KFPTVSKNNAYTLVDI----STSEDAVNSADDTRSVN 598
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 31.3 bits (71), Expect = 0.98
Identities = 19/67 (28%), Positives = 28/67 (41%)
Query: 660 KEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSK 719
K F E + S + ++ ++ S+ D K+K+K E E EK K K
Sbjct: 86 KIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145
Query: 720 PKGDKSK 726
PK K K
Sbjct: 146 PKPSKPK 152
Score = 30.9 bits (70), Expect = 1.3
Identities = 23/79 (29%), Positives = 34/79 (43%), Gaps = 12/79 (15%)
Query: 456 RVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKK 515
++ RE + DDE++E E+ESTDE D E ++ E K K
Sbjct: 86 KIFRELSSSDDEEEETEEESTDETEQEDPPE------------TKTESKEKKKREVPKPK 133
Query: 516 KKEKSRKTITISEKPRKSK 534
+++ KT KP K K
Sbjct: 134 TEKEKPKTEPKKPKPSKPK 152
Score = 28.6 bits (64), Expect = 6.6
Identities = 17/61 (27%), Positives = 30/61 (49%), Gaps = 5/61 (8%)
Query: 498 TPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKP 557
+ +D +E+ E T + ++E +T T S++ +K + K K+K K PKKP
Sbjct: 92 SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPK-----PKTEKEKPKTEPKKP 146
Query: 558 M 558
Sbjct: 147 K 147
>gnl|CDD|151958 pfam11521, TFIIE-A_C-term, C-terminal general transcription factor
TFIIE alpha. TFIIE is compiled of two subunits, alpha
and beta. This family of proteins are the C terminal
domain of the alpha subunit of the protein which is the
largest subunit and contains several functional domains
which are important for basal transcription and cell
growth. The C terminal end of the protein binds directly
to the amino-terminal PH domain of p62/Tfb1 (of IIH)
which is involved in the recruitment of the general
transcription factor IIH to the transcription
preinitiation complex. P53 competes for the same binding
site as TFIIE alpha which shows their structural
similarity. Like p53, TFIIE alpha 336-439 can activate
transcription in vivo.
Length = 86
Score = 29.8 bits (67), Expect = 1.0
Identities = 15/42 (35%), Positives = 21/42 (50%)
Query: 703 SESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPV 744
SESD+ ES+ + S P G + ED+ +E DEP
Sbjct: 1 SESDTSESDDDSPSPEPPAGQSVDEEEDDEDDEFEEVEDEPT 42
Score = 27.9 bits (62), Expect = 5.6
Identities = 7/37 (18%), Positives = 19/37 (51%)
Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDED 479
+S+ + + + + E +D++D++ +E DE
Sbjct: 5 TSESDDDSPSPEPPAGQSVDEEEDDEDDEFEEVEDEP 41
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 32.5 bits (74), Expect = 1.1
Identities = 25/119 (21%), Positives = 45/119 (37%), Gaps = 15/119 (12%)
Query: 440 DFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAE-SDVAEEYDSNPT 498
D S D E DD +ED DE +D PD E S+ E
Sbjct: 211 DMLGSMDMAEETG------------DDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSD 258
Query: 499 PTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKP 557
++ DE ++A++G + ++ E + + +S +D E+ G + ++
Sbjct: 259 ESEEDE-AEATDG-EGEEGEMDAAEASEDSESDESDEDTETPGEDARPATPFTELMEEV 315
Score = 31.4 bits (71), Expect = 2.3
Identities = 22/83 (26%), Positives = 36/83 (43%), Gaps = 4/83 (4%)
Query: 432 GEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDE-DESTD---EDFNPDQAES 487
G + + D D+ + D + +AAE ++ + DE DE T+ ED P +
Sbjct: 250 GREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTETPGEDARPATPFT 309
Query: 488 DVAEEYDSNPTPTDSDEDSDASE 510
++ EE D + DE A E
Sbjct: 310 ELMEEVDYKVFTREFDEIVLAEE 332
Score = 31.0 bits (70), Expect = 2.5
Identities = 23/98 (23%), Positives = 40/98 (40%), Gaps = 8/98 (8%)
Query: 659 SKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGE-SEKEKKSK 717
+ IE + D DDD+ D++EDS+A ++ + ESE D E ++ E +
Sbjct: 223 GDDGIEEDADEEDGDDDQP-------DNNEDSEAGREESEGSDESEEDEAEATDGEGEEG 275
Query: 718 SKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGS 755
+ S+ E + D E E P ++
Sbjct: 276 EMDAAEASEDSESDESDEDTETPGEDARPATPFTELME 313
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 32.3 bits (73), Expect = 1.1
Identities = 35/160 (21%), Positives = 67/160 (41%), Gaps = 6/160 (3%)
Query: 604 DEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFI 663
D K +++ED A + +E Q D K + A+ K + + +
Sbjct: 202 DLKERESQEDAKRAQQLKEELDKKQIDAD-KAQQKADFAQDNADKQRDEVRQKQQEAKNL 260
Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSG--ESEKEKKSKSKPK 721
+S +DK+ + +KR+ E ++ + KK E++ D + ++E K+ K
Sbjct: 261 PKPADTSSPKEDKQVAENQKREI-EKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEA 319
Query: 722 GDKSKSKSKKEDEI--DDEEYDEPVESTPPSSDAGSDASS 759
DK KK + + D ++ VE+ P S + + SS
Sbjct: 320 EDKELEAQKKREPVAEDLQKTKPQVEAQPTSLNEDAIDSS 359
Score = 29.2 bits (65), Expect = 10.0
Identities = 26/115 (22%), Positives = 47/115 (40%), Gaps = 9/115 (7%)
Query: 455 ARVKREAAERDDEDDEDEDE-STDEDFNPDQAESDVAE------EYDSNPTPTDSDEDSD 507
A+ ++ E D+ D D+ DF D A+ E E + P P D+ +
Sbjct: 212 AKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKE 271
Query: 508 ASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKR--KKKDKNAPKKPMSA 560
+ + +K+E + I I + ++ K K+ K+ K +K A K + A
Sbjct: 272 DKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEA 326
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 31.9 bits (73), Expect = 1.1
Identities = 17/75 (22%), Positives = 29/75 (38%), Gaps = 4/75 (5%)
Query: 659 SKEFIESNGSSSDSDDDKK----ASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEK 714
KE ++S DK KR+ + D ++ +KK KS+ E + ++ K
Sbjct: 208 YKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESK 267
Query: 715 KSKSKPKGDKSKSKS 729
K+ K K
Sbjct: 268 GVKALKKVVAKGMKK 282
>gnl|CDD|234027 TIGR02824, quinone_pig3, putative NAD(P)H quinone oxidoreductase,
PIG3 family. Members of this family are putative
quinone oxidoreductases that belong to the broader
superfamily (modeled by Pfam pfam00107) of
zinc-dependent alcohol (of medium chain length)
dehydrogenases and quinone oxiooreductases. The
alignment shows no motif of conserved Cys residues as
are found in zinc-binding members of the superfamily,
and members are likely to be quinone oxidoreductases
instead. A member of this family in Homo sapiens, PIG3,
is induced by p53 but is otherwise uncharacterized
[Unknown function, Enzymes of unknown specificity].
Length = 325
Score = 31.8 bits (73), Expect = 1.1
Identities = 19/58 (32%), Positives = 25/58 (43%), Gaps = 9/58 (15%)
Query: 578 GISFTEI--AK-KGGELWKTVSSKEKAEYDEK------VNKAKEDYNEALKAYKESGG 626
GI T I AK G ++ T S EK E +N +ED+ E +KA G
Sbjct: 151 GIGTTAIQLAKAFGARVFTTAGSDEKCAACEALGADIAINYREEDFVEVVKAETGGKG 208
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 32.5 bits (74), Expect = 1.1
Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 4/90 (4%)
Query: 675 DKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDE 734
+K+ + + S ++K +ES + S+K K K KG + +K +
Sbjct: 381 EKEMDAFSIQASSAGLIGSSEKSLGSNESSPAASNSDKGSKKK---KGKSTSTKGGTAES 437
Query: 735 IDDEEYDEPVESTPPSSDAGSDASSSEDDD 764
I D+E D P + + G D SS D
Sbjct: 438 IPDDEEDAPKKGKK-NQKKGRDKSSKVPSD 466
Score = 32.1 bits (73), Expect = 1.2
Identities = 24/108 (22%), Positives = 42/108 (38%), Gaps = 11/108 (10%)
Query: 664 ESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEK-KSKSKPKG 722
SN SS + + K S K+K S + + E G+ ++K + KS
Sbjct: 405 GSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVP 464
Query: 723 DKSKSKSKKE-DEIDDEEYDEP---------VESTPPSSDAGSDASSS 760
SK+ KKE + ++ + P +E P + G++ S
Sbjct: 465 SDSKAGGKKESVKSQEDNNNIPPEEWVMKKILEWVPDLEEDGTEDPGS 512
Score = 32.1 bits (73), Expect = 1.3
Identities = 20/98 (20%), Positives = 34/98 (34%), Gaps = 4/98 (4%)
Query: 447 EKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQA----ESDVAEEYDSNPTPTDS 502
EKE DA+ + E +ES+ N D+ +
Sbjct: 381 EKEMDAFSIQASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPD 440
Query: 503 DEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESS 540
DE+ +G K +KK + + + S+ KK+ S
Sbjct: 441 DEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKS 478
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 32.2 bits (73), Expect = 1.1
Identities = 28/142 (19%), Positives = 54/142 (38%), Gaps = 32/142 (22%)
Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDED 479
E K R + +GE+ ++D +DD+++ + EA E + + E + E +ED
Sbjct: 635 DENKSRHQQLFEGEE-EDEDDLEETDDDED--------ECEAIEDSESESESDGEDGEED 685
Query: 480 FNPDQAES-----DVAEEYDS------NPTPTDSDEDSDASEGTKKKK------------ 516
D AE+ + + N DS+ EG ++
Sbjct: 686 EQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDEEGMDDEQMMRLDTYLAQIF 745
Query: 517 KEKSRKTITISEKPRKSKKDKE 538
KEK + E ++++ K+
Sbjct: 746 KEKKERIQAGGETKKEAQSQKQ 767
>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
structure and dynamics].
Length = 395
Score = 32.1 bits (73), Expect = 1.1
Identities = 13/62 (20%), Positives = 24/62 (38%)
Query: 674 DDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKED 733
+K K+ + SD + E S S+ E + + SK + ++ K +
Sbjct: 72 TEKGKKEKKPKVSDRMDLDNENVQLEMLYSISNEREIRQLRFGGSKVQNPHEGARVKNIN 131
Query: 734 EI 735
EI
Sbjct: 132 EI 133
>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.
The proteins in this family are poorly characterized,
but an investigation has indicated that the immediate
early protein is required the down-regulation of MHC
class I expression in dendritic cells. Human herpesvirus
6 immediate early protein is also referred to as U90.
Length = 993
Score = 32.4 bits (73), Expect = 1.2
Identities = 23/85 (27%), Positives = 37/85 (43%), Gaps = 2/85 (2%)
Query: 632 DGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSK 691
D K S KK TA +K + I+S+ +D++ SS+ + + D +
Sbjct: 648 DKKKDISTNRKVKKRTAKRKTVGYKTDKSKKIKSDSLPTDTNV-IVISSESEDEEDGFNI 706
Query: 692 AKTKKKKEKSESESDSGESEKEKKS 716
K + K+K +SE S ES E
Sbjct: 707 IKKSQLKKKIKSELKS-ESSSESDD 730
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 31.8 bits (73), Expect = 1.3
Identities = 18/82 (21%), Positives = 33/82 (40%), Gaps = 13/82 (15%)
Query: 653 SPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEK 712
P + +E E + + KK K++ KA+ K +KE++E + K
Sbjct: 390 KPLLAEGEEE-EGENGNLSPAERKKLRKKQR-------KAEKKAEKEEAEK-----AAAK 436
Query: 713 EKKSKSKPKGDKSKSKSKKEDE 734
+K + K ++KK D
Sbjct: 437 KKAEAAAKKAKGPDGETKKVDP 458
Score = 31.1 bits (71), Expect = 2.9
Identities = 14/70 (20%), Positives = 31/70 (44%)
Query: 675 DKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDE 734
+ + ++ S A+ KK ++K E E+ +K+ +K K + + K+K D
Sbjct: 392 LLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451
Query: 735 IDDEEYDEPV 744
+ +P+
Sbjct: 452 ETKKVDPDPL 461
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 31.7 bits (72), Expect = 1.4
Identities = 22/149 (14%), Positives = 55/149 (36%), Gaps = 15/149 (10%)
Query: 611 KEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSS 670
K + K + Q + K K+ ++ ++P + +
Sbjct: 276 KYPLTDEFKEHHAEQRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELH--------A 327
Query: 671 DSDDDKKASSKRKRDSDEDSKAKTKKKKE-KSESESDSGESEKEKKSKSKPKGDKSKSKS 729
++ + KK + K ++ K +K++ + E ++ + ++++ + K+K
Sbjct: 328 ENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKK 387
Query: 730 KKEDEIDDEEYDEPVES-----TPPSSDA 753
K + E D P E+ +PP +A
Sbjct: 388 KGLIDASPNE-DTPSENEESKGSPPQVEA 415
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 31.3 bits (71), Expect = 1.5
Identities = 19/73 (26%), Positives = 34/73 (46%)
Query: 691 KAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPS 750
K T +K + +S + EK + + KP+ D K++ + D ++EE + ES +
Sbjct: 54 KRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASAN 113
Query: 751 SDAGSDASSSEDD 763
S + D DD
Sbjct: 114 SLSDIDNEDDMDD 126
>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter. The Monovalent
Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
CPA1 family is a large family of proteins derived from
Gram-positive and Gram-negative bacteria, blue green
bacteria, yeast, plants and animals. Transporters from
eukaryotes have been functionally characterized, and all
of these catalyze Na+:H+ exchange. Their primary
physiological functions may be in (1) cytoplasmic pH
regulation, extruding the H+ generated during
metabolism, and (2) salt tolerance (in plants), due to
Na+ uptake into vacuoles. This model is specific for the
fungal members of this family [Transport and binding
proteins, Cations and iron carrying compounds].
Length = 810
Score = 31.8 bits (72), Expect = 1.5
Identities = 55/316 (17%), Positives = 102/316 (32%), Gaps = 52/316 (16%)
Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAA---ERDDEDDEDEDEST 476
+KLR + E D G +KE +A+ A + E+
Sbjct: 512 WRQKLRNNK--EMEDDIEMNDLGRERLQKEKEAHAATFDLSTTTTTQLGTENGRGGGLEE 569
Query: 477 DEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKD 536
N + +E ++ D+ E SR SE +
Sbjct: 570 RSKTN----IKERSENVNTIYGLDKLARDT---ENRDVTYVPTSRYDGIESEIDDVYTYE 622
Query: 537 KESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMR-----------EKIKKDNPGISFTEIA 585
+S ++ +++ K ++ AY+ + D + E + + G E+
Sbjct: 623 NDSESIASSERRRIKKLREEEQQAYIAYTEDNQVIIENRQGEILEYVDIHDRGARDAEVG 682
Query: 586 KKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKESGGGQD----SDDGKTSKSKKP 641
G K S + + N+AK+ A K G D + G+ + +
Sbjct: 683 VHNGGRLKRALSPPLEKLHQITNEAKKSKYYAYKV------GNDLIIEDESGEVFRRYR- 735
Query: 642 AKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKA------SSKRKRDSDEDSKAKTK 695
+SPH K K I+ S S D++KA +R S+ +
Sbjct: 736 ----------ISPHGGKRK--IKKRNDSVVSVDEEKAIEGPSRVPERGNHDLLHSEDEMA 783
Query: 696 KKKEKSESESDSGESE 711
+ +SE+ D +S+
Sbjct: 784 DDEAESENMDDYEDSD 799
>gnl|CDD|241458 cd13304, PH2_Rtt106, Histone chaperone RTT106, regulator of Ty1
transposition protein 106 Pleckstrin homology (PH)
domain. Rtt106 is a histone chaperone. Rtt106 contains
an N-terminal homodimerization domain and two C-terminal
pleckstrin-homology (PH) domains (PH1 and PH2). The
binding of Rtt106 to H3K56-acetylated (H3-H4)2 tetramers
contributes to nucleosome assembly in terms of DNA
replication, gene silencing and maintenance of genomic
stability. The N-terminal domain homodimerizes
homodimerizes and interacts with H3-H4 independently of
acetylation while the double PH domain binds the
K56-containing region of H3. Rtt106 also interacts with
both the SWI/SNF and RSC chromatin remodeling complexes
and is involved in their cell-cycle dependent
recruitment to histone gene pairs regulated by the HIR
co-repressor complex (HTA1-HTB1, HHT1-HHF1, and
HHT2-HHF2). This cd contains the second PH domain
repeat. PH domains have diverse functions, but in
general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 89
Score = 29.2 bits (66), Expect = 1.7
Identities = 19/84 (22%), Positives = 38/84 (45%), Gaps = 7/84 (8%)
Query: 342 VTCSYKAAAGYIYPLERGFIFIHKPPIHI-RFEEIATVNFARSGGSTRSFDFEIELKSGV 400
V A G + L+ IF K PI + +I ++++ S + +F+ + +K+
Sbjct: 7 VEAHKGAKEGVLLFLQDYIIFGFKKPILLFDISQIESISY--SNITRLTFNLNLTVKNAK 64
Query: 401 LHT----FSSIEKEEYGKLFDFIR 420
FS I++ + + DFI+
Sbjct: 65 KEEETLEFSMIDQAFFQVIDDFIK 88
>gnl|CDD|235040 PRK02463, PRK02463, OxaA-like protein precursor; Provisional.
Length = 307
Score = 31.2 bits (71), Expect = 1.7
Identities = 18/68 (26%), Positives = 27/68 (39%), Gaps = 22/68 (32%)
Query: 489 VAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKK 548
+AEE+ NP K K RK +T S+ + K+KK
Sbjct: 259 IAEEFAKNPP--------------KAYKSPSGRKDVTPSQNQAIASL--------PKKKK 296
Query: 549 KDKNAPKK 556
K++NA K+
Sbjct: 297 KNRNAGKQ 304
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 31.7 bits (72), Expect = 1.8
Identities = 17/80 (21%), Positives = 40/80 (50%), Gaps = 2/80 (2%)
Query: 459 REAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKE 518
RE E +DE++E++++ D+ + + + D + ++S D + + + +
Sbjct: 132 REDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDAD 191
Query: 519 KSRKTITISEKPRKSKKDKE 538
S+K + E+P+ KK +E
Sbjct: 192 LSKKDVL--EEPKMFKKPEE 209
Score = 29.4 bits (66), Expect = 8.3
Identities = 19/62 (30%), Positives = 27/62 (43%)
Query: 458 KREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKK 517
+ E + DD+DD +DE DED E + E + D +ED DA K +
Sbjct: 140 EEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKKDVLE 199
Query: 518 EK 519
E
Sbjct: 200 EP 201
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 31.2 bits (71), Expect = 1.8
Identities = 15/80 (18%), Positives = 33/80 (41%), Gaps = 3/80 (3%)
Query: 688 EDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK---EDEIDDEEYDEPV 744
E+++ K KKE + D+ +S S + D + + K+ E++ + + D+
Sbjct: 64 EEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSD 123
Query: 745 ESTPPSSDAGSDASSSEDDD 764
S+ S +D+
Sbjct: 124 SSSDSDSSDDDSDDDDSEDE 143
Score = 28.9 bits (65), Expect = 7.9
Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 3/51 (5%)
Query: 460 EAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASE 510
E E DDE+DE + + +ED A+ + + +D D D D SE
Sbjct: 94 EGDEDDDEEDEIKRKRIEEDARNSDADDSDSSSDSDS---SDDDSDDDDSE 141
>gnl|CDD|238859 cd01821, Rhamnogalacturan_acetylesterase_like,
Rhamnogalacturan_acetylesterase_like subgroup of
SGNH-hydrolases. Rhamnogalacturan acetylesterase removes
acetyl esters from rhamnogalacturonan substrates, and
renders them susceptible to degradation by
rhamnogalacturonases. Rhamnogalacturonans are highly
branched regions in pectic polysaccharides, consisting
of repeating -(1,2)-L-Rha-(1,4)-D-GalUA disaccharide
units, with many rhamnose residues substituted by
neutral oligosaccharides such as arabinans, galactans
and arabinogalactans. Extracellular enzymes
participating in the degradation of plant cell wall
polymers, such as Rhamnogalacturonan acetylesterase,
would typically be found in saprophytic and plant
pathogenic fungi and bacteria.
Length = 198
Score = 30.6 bits (70), Expect = 1.8
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 373 EEIATVNFARSGGSTRSFDFE 393
I VN A+ G S+RSF E
Sbjct: 33 TGITVVNHAKGGRSSRSFRDE 53
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 31.6 bits (70), Expect = 1.8
Identities = 23/98 (23%), Positives = 52/98 (53%)
Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
K KS+K +K+ A + KV +K+ E ++++ K+A+ + + + + +
Sbjct: 69 KVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQER 128
Query: 694 TKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
+K++EK E++ ++EK+K+ K + + K+KK
Sbjct: 129 IQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKK 166
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 31.6 bits (72), Expect = 2.0
Identities = 22/84 (26%), Positives = 38/84 (45%), Gaps = 6/84 (7%)
Query: 431 KGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVA 490
K + +Y D + + DA R E + ++E++EDEDE ++ ++ +
Sbjct: 365 KETEQDYILD--AFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEH----SDDEEF 418
Query: 491 EEYDSNPTPTDSDEDSDASEGTKK 514
EE D DSD +S + G K
Sbjct: 419 EEDDVESKYEDSDGNSSLAVGYKN 442
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 31.4 bits (71), Expect = 2.0
Identities = 19/117 (16%), Positives = 38/117 (32%), Gaps = 3/117 (2%)
Query: 620 AYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKAS 679
A + S + + + PA +S +P + E SD +S
Sbjct: 285 ARRISTPMTEEIKSWQTPLQTPA---MYSSDYQAPKPEPIYTWEELLRERFPSDLFAISS 341
Query: 680 SKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEID 736
DS K+K+ + + E E S + + K ++E++ +
Sbjct: 342 LPDSDSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDYEKERKRRREEDKN 398
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 31.3 bits (71), Expect = 2.1
Identities = 18/70 (25%), Positives = 27/70 (38%), Gaps = 11/70 (15%)
Query: 425 RVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQ 484
+ K K K D + E D DD+DDED ES +D + D
Sbjct: 228 QPKKQAKSSKRRTIAQIDGIDSDDEGD-----------GSDDDDDEDAIESDLDDSDDDV 276
Query: 485 AESDVAEEYD 494
++ D + +D
Sbjct: 277 SDEDGEDLFD 286
Score = 30.5 bits (69), Expect = 3.5
Identities = 17/70 (24%), Positives = 29/70 (41%), Gaps = 10/70 (14%)
Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS 502
++ + +R A+ D D +DE + +D+D + D ESD DS
Sbjct: 223 MKVLKQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESD----------LDDS 272
Query: 503 DEDSDASEGT 512
D+D +G
Sbjct: 273 DDDVSDEDGE 282
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 31.3 bits (71), Expect = 2.2
Identities = 20/89 (22%), Positives = 34/89 (38%), Gaps = 9/89 (10%)
Query: 641 PAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEK 700
+E V ++KE K +KRK+D+ D K+K K K ++
Sbjct: 172 SDSDQEAVLPLVKKSPSEAKEEKAEERKQE-----SKKGAKRKKDASGDDKSK-KAKTDR 225
Query: 701 SESESDSGESEKEKKSKSKPKGDKSKSKS 729
S + +KKS +++SK
Sbjct: 226 DVST---STAASQKKSSDLESKLEAQSKE 251
Score = 30.5 bits (69), Expect = 4.5
Identities = 14/60 (23%), Positives = 30/60 (50%)
Query: 497 PTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKK 556
+P+++ E+ + KK K +K + +K +K+K D++ S + +KK + K
Sbjct: 185 KSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLESK 244
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 31.1 bits (70), Expect = 2.3
Identities = 24/135 (17%), Positives = 64/135 (47%), Gaps = 1/135 (0%)
Query: 598 KEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKV 657
++A E++ + +++ +A + K++ + + + ++ A+K A
Sbjct: 94 PKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARK-AAAEQKKKAEAA 152
Query: 658 KSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSK 717
K+K E+ + ++ KKA K + +KA+ K+K+E+E+ + + + +++
Sbjct: 153 KAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAE 212
Query: 718 SKPKGDKSKSKSKKE 732
+K K +K + +E
Sbjct: 213 AKAKAEKKAEAAAEE 227
Score = 30.3 bits (68), Expect = 4.3
Identities = 27/143 (18%), Positives = 60/143 (41%), Gaps = 1/143 (0%)
Query: 598 KEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKV 657
KE+ + E+ +A+E +A K+ + K + AK K A A
Sbjct: 108 KERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAA 167
Query: 658 KSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKK-KKEKSESESDSGESEKEKKS 716
++K+ E +++ K ++ K+ ++ ++KA +K K E ++E +
Sbjct: 168 EAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEE 227
Query: 717 KSKPKGDKSKSKSKKEDEIDDEE 739
K+ + K+ +K+K + +
Sbjct: 228 KAAAEKKKAAAKAKADKAAAAAK 250
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 31.5 bits (71), Expect = 2.3
Identities = 14/64 (21%), Positives = 25/64 (39%), Gaps = 1/64 (1%)
Query: 689 DSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTP 748
KA + K++ E + E + E S +G++ K ++E ID + P
Sbjct: 790 TIKAAARAKQKPKEKGPNDKEIKIESPSVET-EGERCTIKQREEKGIDAPAILNVKKKKP 848
Query: 749 PSSD 752
D
Sbjct: 849 YKVD 852
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 30.0 bits (68), Expect = 2.4
Identities = 18/83 (21%), Positives = 42/83 (50%), Gaps = 1/83 (1%)
Query: 649 STAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSG 708
A K +S S + + D+D +++++ KR + KAK K +++K +++ +
Sbjct: 26 KKASKKKKKRSNSSPGSTSNEENEDEDDESTAESKR-PEGRKKAKEKLRRDKLKAKKEEA 84
Query: 709 ESEKEKKSKSKPKGDKSKSKSKK 731
E EKEK+ + +++ + +
Sbjct: 85 EKEKEKEERFMKALAEAEKERAE 107
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 30.5 bits (69), Expect = 2.4
Identities = 25/97 (25%), Positives = 41/97 (42%), Gaps = 9/97 (9%)
Query: 418 FIREKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAE------RDDEDDED 471
I +K + K ++ D +++ ++K AE DE++E+
Sbjct: 121 GINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEE 180
Query: 472 EDESTDEDFNPDQAESD---VAEEYDSNPTPTDSDED 505
E+E DEDF+ D + D AE Y N D D+D
Sbjct: 181 EEEEEDEDFDDDDDDDDDDYNAENYFDNGEDDDYDDD 217
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 31.0 bits (70), Expect = 2.5
Identities = 13/96 (13%), Positives = 38/96 (39%), Gaps = 7/96 (7%)
Query: 654 PHKVKSKEFIESNGSSSDSDDDK-------KASSKRKRDSDEDSKAKTKKKKEKSESESD 706
P K K G SD D++ K+D E+ +A +++ + +
Sbjct: 173 PMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEERRAAVERRLAELVEMIN 232
Query: 707 SGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDE 742
E+ + + + + +++ + +++ E+ ++
Sbjct: 233 WNLEERRRDLRKEQELEENVERDSDDEDEHGEDSED 268
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 31.2 bits (70), Expect = 2.5
Identities = 24/104 (23%), Positives = 48/104 (46%), Gaps = 9/104 (8%)
Query: 668 SSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEK-EKKSKSKPKGDKSK 726
+ ++D + +SD S ++ ++ SESE +S +S+ ++ S SK ++
Sbjct: 138 KNESGNNDHATDKENLLESDASSSNDSESEESDSESEIESSDSDHDDENSDSKLDNLRNY 197
Query: 727 SKSKKEDE--------IDDEEYDEPVESTPPSSDAGSDASSSED 762
S K+DE DD + E ++ P ++ S +S + D
Sbjct: 198 IVSLKKDEADAESVLSSDDNDSIEEIKYDPHETNKESGSSETID 241
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 30.8 bits (70), Expect = 2.8
Identities = 18/99 (18%), Positives = 36/99 (36%), Gaps = 6/99 (6%)
Query: 456 RVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS---DEDSDASEGT 512
R +R+ E+ DE + E E+ PD + D E+ P D +
Sbjct: 308 RRRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDP 367
Query: 513 KKKKKEKSRKTITISE--KPRK-SKKDKESSGGSTKRKK 548
+++ + T + + R + + G S +R +
Sbjct: 368 QEEADGQGSSTDPAGDIFRIRVLAPPQARARGASGRRSR 406
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 30.9 bits (70), Expect = 2.8
Identities = 19/86 (22%), Positives = 38/86 (44%), Gaps = 4/86 (4%)
Query: 657 VKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKS 716
K E ++SN D DK+ K + + K + K E+++ K K+
Sbjct: 350 TKLYEEVKSNT---DLSGDKR-QELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEE 405
Query: 717 KSKPKGDKSKSKSKKEDEIDDEEYDE 742
K K + ++ K K + +++ + + DE
Sbjct: 406 KLKQEENEKKQKEQADEDKEKRQKDE 431
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 30.7 bits (70), Expect = 3.0
Identities = 18/61 (29%), Positives = 26/61 (42%), Gaps = 8/61 (13%)
Query: 462 AERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSR 521
E DD DDE+E+ ++ D + D + D D +E+SD KE S
Sbjct: 220 NELDDIDDEEEERVLADE---DDDDEDDMFDMD-----DDDEEESDPEVERTSTIKEVSE 271
Query: 522 K 522
K
Sbjct: 272 K 272
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 30.4 bits (69), Expect = 3.0
Identities = 24/102 (23%), Positives = 41/102 (40%)
Query: 630 SDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDED 689
DD + KS PA+ +T + + + + S D KK++ K+++ E
Sbjct: 128 VDDEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKEL 187
Query: 690 SKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
+ K ++KK K + + E KK K K K K
Sbjct: 188 KERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKV 229
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 30.9 bits (70), Expect = 3.3
Identities = 20/102 (19%), Positives = 32/102 (31%), Gaps = 19/102 (18%)
Query: 636 SKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKA-----SSKRKRDSDEDS 690
S + P +TA + K+K+ KK S+ R +
Sbjct: 726 SSERAPRNVGKTA-------REKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPK 778
Query: 691 KAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKE 732
AK +K K S K +K + K ++ K E
Sbjct: 779 AAKKDARKAKK-------PSAKTQKIAAATKAKRAAKKKVAE 813
>gnl|CDD|148314 pfam06632, XRCC4, DNA double-strand break repair and V(D)J
recombination protein XRCC4. This family consists of
several eukaryotic DNA double-strand break repair and
V(D)J recombination protein XRCC4 sequences. In the
non-homologous end joining pathway of DNA double-strand
break repair, the ligation step is catalyzed by a
complex of XRCC4 and DNA ligase IV. It is thought that
XRCC4 and ligase IV are essential for alignment-based
gap filling, as well as for final ligation of the
breaks.
Length = 331
Score = 30.6 bits (69), Expect = 3.3
Identities = 22/68 (32%), Positives = 30/68 (44%)
Query: 695 KKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAG 754
KK K +S + S E EK K K + K+ ED D DE E+ P S++
Sbjct: 187 KKAKIRSLHKLLSEAQELEKSIKQKKETSACSDKTPDEDSKYDGSTDEEQEAPPKPSESM 246
Query: 755 SDASSSED 762
A S +D
Sbjct: 247 PAAVSKDD 254
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 30.8 bits (69), Expect = 3.3
Identities = 29/119 (24%), Positives = 46/119 (38%), Gaps = 7/119 (5%)
Query: 448 KEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDS----NPTPTDSD 503
K+ LA ++ E +++ DE E + S P E + E DS P
Sbjct: 490 KKSKKKLAPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKP 549
Query: 504 EDSDASEGTKK---KKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMS 559
++ E KK K+ K K T+S+KP K K +K K + ++P
Sbjct: 550 GETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTR 608
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 30.2 bits (68), Expect = 3.5
Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 2/69 (2%)
Query: 685 DSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPV 744
DS+ + K + + D GE E+ S S + + S+ + E++ E
Sbjct: 216 PESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASA-- 273
Query: 745 ESTPPSSDA 753
+ TP S DA
Sbjct: 274 DDTPDSDDA 282
>gnl|CDD|236289 PRK08563, PRK08563, DNA-directed RNA polymerase subunit E';
Provisional.
Length = 187
Score = 29.8 bits (68), Expect = 3.6
Identities = 10/15 (66%), Positives = 14/15 (93%)
Query: 293 QELKEKYEGKLDKEL 307
+ L+EKYEG++DKEL
Sbjct: 28 EVLREKYEGRIDKEL 42
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 29.8 bits (67), Expect = 3.6
Identities = 14/74 (18%), Positives = 26/74 (35%), Gaps = 6/74 (8%)
Query: 660 KEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSK 719
+E +E N D +A DS + K + + + E K ++
Sbjct: 125 EEMLEENAKKGGKIDKMEA------DSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNE 178
Query: 720 PKGDKSKSKSKKED 733
+ K SKK++
Sbjct: 179 LREQVQKESSKKKE 192
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 30.5 bits (69), Expect = 3.7
Identities = 24/135 (17%), Positives = 58/135 (42%), Gaps = 1/135 (0%)
Query: 598 KEKAEYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKV 657
KE+ E+ +A+E +A K++ + +K+K A+ K A+ A
Sbjct: 108 KERLAAQEQKKQAEEAAKQAALKQKQAEEAA-AKAAAAAKAKAEAEAKRAAAAAKKAAAE 166
Query: 658 KSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSK 717
K+ + ++ KKA ++ + ++K K + + +K + ++ E K+
Sbjct: 167 AKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAA 226
Query: 718 SKPKGDKSKSKSKKE 732
+ ++K+ ++K
Sbjct: 227 AAKAAAEAKAAAEKA 241
>gnl|CDD|176221 cd08260, Zn_ADH6, Alcohol dehydrogenases of the MDR family.
NAD(P)(H)-dependent oxidoreductases are the major
enzymes in the interconversion of alcohols and
aldehydes, or ketones. This group has the characteristic
catalytic and structural zinc sites of the
zinc-dependent alcohol dehydrogenases. Alcohol
dehydrogenase in the liver converts ethanol and NAD+ to
acetaldehyde and NADH, while in yeast and some other
microorganisms ADH catalyzes the conversion acetaldehyde
to ethanol in alcoholic fermentation. ADH is a member of
the medium chain alcohol dehydrogenase family (MDR),
which has a NAD(P)(H)-binding domain in a Rossmann fold
of a beta-alpha form. The NAD(H)-binding region is
comprised of 2 structurally similar halves, each of
which contacts a mononucleotide. A GxGxxG motif after
the first mononucleotide contact half allows the close
contact of the coenzyme with the ADH backbone. The
N-terminal catalytic domain has a distant homology to
GroES. These proteins typically form dimers (typically
higher plants, mammals) or tetramers (yeast, bacteria),
and have 2 tightly bound zinc atoms per subunit, a
catalytic zinc at the active site and a structural zinc
in a lobe of the catalytic domain. NAD(H)-binding occurs
in the cleft between the catalytic and coenzyme-binding
domains at the active site, and coenzyme binding induces
a conformational closing of this cleft. Coenzyme binding
typically precedes and contributes to substrate binding.
In human ADH catalysis, the zinc ion helps coordinate
the alcohol, followed by deprotonation of a histidine,
the ribose of NAD, a serine, then the alcohol, which
allows the transfer of a hydride to NAD+, creating NADH
and a zinc-bound aldehyde or ketone. In yeast and some
bacteria, the active site zinc binds an aldehyde,
polarizing it, and leading to the reverse reaction.
Length = 345
Score = 30.3 bits (69), Expect = 3.8
Identities = 12/43 (27%), Positives = 17/43 (39%), Gaps = 1/43 (2%)
Query: 173 AGDTDPVEAFKDSVMNQASV-INATGDAIAVFNEIQCLTPRGR 214
+ D A +D A V ++A G N + L RGR
Sbjct: 217 SEVEDVAAAVRDLTGGGAHVSVDALGIPETCRNSVASLRKRGR 259
>gnl|CDD|226947 COG4581, COG4581, Superfamily II RNA helicase [DNA replication,
recombination, and repair].
Length = 1041
Score = 30.8 bits (70), Expect = 3.9
Identities = 19/104 (18%), Positives = 34/104 (32%), Gaps = 3/104 (2%)
Query: 638 SKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSD-DDKKASSKRKRDSDEDSKAKTKK 696
+K E + +V+ K S + D D S +++ + K
Sbjct: 4 TKNFRVPSEYGFVSTKEKRVERKAATAEEESLKNIDISDLFERSVDMNKDPKEATDQVKA 63
Query: 697 KKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEY 740
+ SE ES + + + SK D + D + D E
Sbjct: 64 VDKHSEIESLNVGMRLDVRLSSKELSDVAVPDD--YDMVPDAES 105
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 30.2 bits (68), Expect = 4.1
Identities = 19/82 (23%), Positives = 36/82 (43%), Gaps = 7/82 (8%)
Query: 633 GKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKA 692
GK SK+ K K +T ++ + + + + + ++ K +S +
Sbjct: 2 GKASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKN-------RNSHTENKMESGTNDNN 54
Query: 693 KTKKKKEKSESESDSGESEKEK 714
K K+K K S+ DS SE+E+
Sbjct: 55 KNKEKLSKLYSDVDSSSSEEEE 76
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 30.5 bits (70), Expect = 4.1
Identities = 18/70 (25%), Positives = 30/70 (42%), Gaps = 5/70 (7%)
Query: 411 EYGKLFDFIREKKLRVKNTGKGEKPNYKEDF-GSSDDEKEPDAYLARVKREAAERDDEDD 469
E D I E R++N + +E G D E D + E + D++++
Sbjct: 144 ESPLTIDAILEWYDRLEN----GERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEE 199
Query: 470 EDEDESTDED 479
E+EDE+ D
Sbjct: 200 EEEDENDDSL 209
>gnl|CDD|239823 cd04331, RNAP_E_N, RNAP_E_N: RpoE, N-terminal ribonucleoprotein
(RNP) domain. RpoE (subunit E) is a subunit of the
archaeal RNA polymerase (RNAP) that is homologous to
Rpb7 of eukaryotic RNAP II, Rpc25 of eukaryotic RNAP
III, and Rpa43 of eukaryotic RNAP I. RpoE
heterodimerizes with RpoF, another RNA polymerase
subunit. RpoE has an elongated two-domain structure that
includes an N-terminal RNP domain and a C-terminal
oligonucleotide-binding (OB) domain. Both domains of
RpoE bind single-stranded RNA.
Length = 80
Score = 27.9 bits (63), Expect = 4.1
Identities = 11/14 (78%), Positives = 13/14 (92%)
Query: 294 ELKEKYEGKLDKEL 307
LKEKYEG+LDK+L
Sbjct: 28 ILKEKYEGRLDKDL 41
>gnl|CDD|218908 pfam06136, DUF966, Domain of unknown function (DUF966). Family of
plant proteins with unknown function.
Length = 308
Score = 30.1 bits (68), Expect = 4.3
Identities = 20/94 (21%), Positives = 36/94 (38%)
Query: 669 SSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSK 728
SS S D + +S RK + D+ ++ + S + + + + +
Sbjct: 91 SSSSKGDPEEASSRKLQEESDTPPVNRRANQSWSSSDLAEYKVYKAEEPADASTQTDDRR 150
Query: 729 SKKEDEIDDEEYDEPVESTPPSSDAGSDASSSED 762
S+ E + E S P SS + S +SS E
Sbjct: 151 SRDSSEAESTELSREEISPPSSSSSPSSSSSPET 184
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 29.6 bits (67), Expect = 4.4
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 692 AKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKK 731
A+ K K E ++ E+ K + +G+K +SK KK
Sbjct: 157 AEWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|221750 pfam12745, HGTP_anticodon2, Anticodon binding domain of tRNAs.
This is an HGTP_anticodon binding domain, found largely
on Gcn2 proteins which bind tRNA to down regulate
translation in certain stress situations.
Length = 260
Score = 30.0 bits (68), Expect = 4.5
Identities = 22/104 (21%), Positives = 41/104 (39%), Gaps = 15/104 (14%)
Query: 420 REKKLRVKNTGKGEKPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDED 479
+EK L+VKN G+ E D + + D + ++ E ER+ + E +
Sbjct: 72 KEKPLKVKNLGRKE-----------DTDIDYDELVPWLRSEIRERNGQKREGSALRSTLL 120
Query: 480 FNPDQAESDVAEEYDSNPTPTDSDEDS----DASEGTKKKKKEK 519
+P Q E + S T+ + A+ K +++K
Sbjct: 121 RHPSQPEDANSSSGSSQIGDTERQKVVVVLNAATRSKKSNRRDK 164
>gnl|CDD|185272 PRK15374, PRK15374, pathogenicity island 1 effector protein SipB;
Provisional.
Length = 593
Score = 30.3 bits (68), Expect = 4.5
Identities = 20/120 (16%), Positives = 41/120 (34%), Gaps = 8/120 (6%)
Query: 592 WKTVSSKEKA---EYDEKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETA 648
W+ + +K + ++ A + EA Y+ S D+ S KK A
Sbjct: 111 WQAMIESQKEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAK---SVYDAAEKKLTQA 167
Query: 649 STAVSPHKVKSKEFIESNGSSSDSDDD-KKASSKRKRDSDEDSKAKTKKKKEKSESESDS 707
+ + ++ + + + +A + +D KA T K K+E +
Sbjct: 168 QNKLQSLDPADPGYAQAEAAVEQAGKEATEAKEALDKATDATVKAGT-DAKAKAEKADNI 226
>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22. The
mitochondrial protein translocase family, which is
responsible for movement of nuclear encoded pre-proteins
into mitochondria, is very complex with at least 19
components. These proteins include several chaperone
proteins, four proteins of the outer membrane
translocase (Tom) import receptor, five proteins of the
Tom channel complex, five proteins of the inner membrane
translocase (Tim) and three "motor" proteins. This
family represents the Tom22 proteins. The N terminal
region of Tom22 has been shown to have chaperone-like
activity, and the C terminal region faces the
intermembrane face.
Length = 136
Score = 29.1 bits (66), Expect = 4.5
Identities = 13/48 (27%), Positives = 20/48 (41%), Gaps = 5/48 (10%)
Query: 445 DDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEE 492
+DE + A E DD+D++D D +D +SD E
Sbjct: 8 EDETFQEKPAAPKNLAQEESDDDDEDDTDTDSDISD-----DSDFENE 50
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 30.1 bits (68), Expect = 4.6
Identities = 15/44 (34%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 460 EAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSD 503
E AE D D+++EDE+ E+ + E + EE + +P D+D
Sbjct: 38 EEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEVKASP-DAD 80
Score = 29.4 bits (66), Expect = 7.3
Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 6/48 (12%)
Query: 460 EAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSD 507
+A+ ED+E ED+ DED E D A + T+ +ED +
Sbjct: 29 ASAQDLTEDEEAEDDVVDED------EEDEAVVEEDENELTEEEEDEE 70
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 30.5 bits (69), Expect = 4.6
Identities = 23/86 (26%), Positives = 39/86 (45%), Gaps = 13/86 (15%)
Query: 658 KSKEFIESNGSSS-----DSDDDKKASSKRKRDSDEDSKAKTKKKKEKSES--ESDSGES 710
K K FI SN S + ++K+ + + + +SD++ K + E E D ES
Sbjct: 1505 KKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564
Query: 711 EKEKKSKSKPKGDKSKSKSKKEDEID 736
+ +K+ K + KS E E+D
Sbjct: 1565 DIKKRKNKK------QYKSNTEAELD 1584
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 29.8 bits (67), Expect = 4.6
Identities = 22/122 (18%), Positives = 48/122 (39%), Gaps = 17/122 (13%)
Query: 643 KKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSE 702
++ ++ + P + NGSS++ D S +D ++ EK
Sbjct: 177 ERIDSVNGPGEPEPEDDPKDSLGNGSSTNGLPD----------SSQDKNKSLEEYYEKES 226
Query: 703 SESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSED 762
S++ + + + PKG K+K ++E DD++ D E + + E+
Sbjct: 227 SDAAASQDDG-------PKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEE 279
Query: 763 DD 764
++
Sbjct: 280 EE 281
>gnl|CDD|234780 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha;
Validated.
Length = 339
Score = 30.1 bits (69), Expect = 4.6
Identities = 11/44 (25%), Positives = 19/44 (43%), Gaps = 4/44 (9%)
Query: 580 SFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKE 623
TE+ K + +E+ E +N+ K+ AL+ KE
Sbjct: 38 ELTELLKG----LGKLPPEERKEAGALINELKQAIEAALEERKE 77
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 30.3 bits (69), Expect = 4.7
Identities = 14/58 (24%), Positives = 24/58 (41%), Gaps = 4/58 (6%)
Query: 676 KKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKED 733
K S K+ + AK +KKEK + + K++ +K G + K E+
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKV----KKRHRDTKNIGKRRKPSGTSEE 434
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 29.7 bits (67), Expect = 5.2
Identities = 24/77 (31%), Positives = 33/77 (42%), Gaps = 9/77 (11%)
Query: 471 DEDESTDEDF--NPDQAES-DVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITIS 527
D ES F NP S +A D D S++S G K++ K+ S +
Sbjct: 86 DVKESDPMSFITNPSLTVSVPIASGESDLENLNDEDTSSESSYGFKEESKKGS------A 139
Query: 528 EKPRKSKKDKESSGGST 544
E + +KD SS GST
Sbjct: 140 ETLKLEEKDSSSSEGST 156
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 30.0 bits (68), Expect = 5.5
Identities = 15/55 (27%), Positives = 26/55 (47%), Gaps = 2/55 (3%)
Query: 690 SKAKTKK--KKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEEYDE 742
SK KK K +++E E + E K + + KG KS + ++ +D +Y
Sbjct: 9 SKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRA 63
>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2. This
domain family is found in eukaryotes, and is typically
between 138 and 153 amino acids in length. The family is
found in association with pfam00493. Mini-chromosome
maintenance (MCM) proteins are essential for DNA
replication. These proteins use ATPase activity to
perform this function.
Length = 145
Score = 28.8 bits (65), Expect = 5.7
Identities = 13/42 (30%), Positives = 20/42 (47%)
Query: 437 YKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDE 478
+ +D D + + A R +R+ E D DDED D +E
Sbjct: 90 FLDDDDDDDGDFDLTAQPRRRRRQYDEDRDADDEDIDPLEEE 131
>gnl|CDD|227508 COG5181, HSH155, U2 snRNP spliceosome subunit [RNA processing and
modification].
Length = 975
Score = 29.9 bits (67), Expect = 5.8
Identities = 20/128 (15%), Positives = 40/128 (31%), Gaps = 9/128 (7%)
Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAE-EYDSNPTPTD 501
SS+ E E A +R + A ED D + + + ++ + + + P
Sbjct: 1 SSNPEMEDKAEDSRRSTKQASVWYEDRNDSRGNVSQWEATPRTANESKDLKQEQTPLGYI 60
Query: 502 SDEDSDAS--------EGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNA 553
+ S + KK+E E P + + G ++ +
Sbjct: 61 QVREDLYSKTRMDPGFQPDSDKKRELELNNTWDMEYPDSKRSSRWDEMGYEPPQEINMCL 120
Query: 554 PKKPMSAY 561
P + A
Sbjct: 121 PARGYKAL 128
>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB. Isd proteins are
iron-regulated surface proteins found in Bacillus,
Staphylococcus and Listeria species and are responsible
for heme scavenging from hemoproteins. The IsdB protein
is only observed in Staphylococcus and consists of an
N-terminal hydrophobic signal sequence, a pair of tandem
NEAT (NEAr Transporter, pfam05031) domains which confers
the ability to bind heme and a C-terminal sortase
processing signal which targets the protein to the cell
wall. IsdB is believed to make a direct contact with
methemoglobin facilitating transfer of heme to IsdB. The
heme is then transferred to other cell wall-bound NEAT
domain proteins such as IsdA and IsdC.
Length = 644
Score = 29.9 bits (66), Expect = 5.9
Identities = 29/127 (22%), Positives = 59/127 (46%), Gaps = 4/127 (3%)
Query: 605 EKVNKAKEDYNEALKAYKESGGGQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIE 664
+K + K+D ++ KE+ +S KT +K + E++ST + ++ +
Sbjct: 493 QKQDSQKDDNKQSPSVEKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAK 552
Query: 665 SNGSSSDSDDD---KKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPK 721
+SS++ D A S +DS KA K + ++S + ++ +E K+KS P+
Sbjct: 553 PTTASSETTKDVVQTSAGSSEAKDSAPLQKANIKNTND-GHTQSQNNKNTQENKAKSLPQ 611
Query: 722 GDKSKSK 728
+ +K
Sbjct: 612 TGEESNK 618
>gnl|CDD|177546 PHA03151, PHA03151, hypothetical protein; Provisional.
Length = 259
Score = 29.4 bits (65), Expect = 5.9
Identities = 22/107 (20%), Positives = 35/107 (32%)
Query: 443 SSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDS 502
SSDD PD + V + R E++ E + D E ++
Sbjct: 114 SSDDSSAPDFITSLVSSDCEARGLSSSEEDGEPYSKQKMSQPLTIDAKTEEITSEEDCCV 173
Query: 503 DEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKK 549
EDS SE + + R + +K K K ++K
Sbjct: 174 QEDSSDSEEDVVEAFIRQRAQMAGKKKKGKRSISTSDDEPPRKSRRK 220
>gnl|CDD|221825 pfam12877, DUF3827, Domain of unknown function (DUF3827). This
family contains the human KIAA1549 protein which has
been found to be fused fused to BRAF gene in many cases
of pilocytic astrocytomas. The fusion is due mainly to a
tandem duplication of 2 Mb at 7q34. Although nothing is
known about the function of KIAA1549 protein, the BRAF
protein is a well characterized oncoprotein. It is a
serine/threonine protein kinase which is implicated in
MAP/ERK signalling, a critical pathway for the
regulation of cell division, differentiation and
secretion.
Length = 684
Score = 29.9 bits (67), Expect = 6.0
Identities = 21/94 (22%), Positives = 36/94 (38%), Gaps = 17/94 (18%)
Query: 675 DKKASSKRKRDSDEDSKAKTKKKKEKSESESD-------SGESEKEKKSK-SKPKGDKSK 726
D + + + + S + S K +++ KS S+ D S S +EK + S ++
Sbjct: 359 DAEVPTPKSKSSQDGSSNKKRRRGRKSPSDGDSEGSSVISNRSSREKSGRPSTTPSVTAQ 418
Query: 727 SKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSS 760
K KE E + P S SS+
Sbjct: 419 QKPTKE---------EGRKKPAPPSGTDEQLSSA 443
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 29.6 bits (67), Expect = 6.0
Identities = 11/44 (25%), Positives = 22/44 (50%)
Query: 444 SDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAES 487
DE++ + L + + E E D + EDEDE +++ + +
Sbjct: 273 GGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDE 316
>gnl|CDD|220388 pfam09766, FimP, Fms-interacting protein. This entry carries part
of the crucial 144 N-terminal residues of the FmiP
protein, which is essential for the binding of the
protein to the cytoplasmic domain of activated
Fms-molecules in M-CSF induced haematopoietic
differentiation of macrophages. The C-terminus contains
a putative nuclear localisation sequence and a leucine
zipper which suggest further, as yet unknown, nuclear
functions. The level of FMIP expression might form a
threshold that determines whether cells differentiate
into macrophages or into granulocytes.
Length = 352
Score = 29.6 bits (67), Expect = 6.3
Identities = 14/47 (29%), Positives = 24/47 (51%)
Query: 489 VAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKK 535
++ +S+ D D DSD E K++KK +S + +K K+ K
Sbjct: 205 FEQQKESSEQSQDDDSDSDEEEEQKRQKKRRSTDQEKLDDKRLKNSK 251
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 30.0 bits (67), Expect = 6.3
Identities = 39/162 (24%), Positives = 63/162 (38%), Gaps = 10/162 (6%)
Query: 470 EDEDESTDEDFNPDQAESDVAEEYDSNPT---PTDSDEDSDASEGTKKKKKEKSRKTITI 526
E++D D +F + E+ + N + SD S AS + K K ++ +
Sbjct: 112 EEDDSKDDMEFKANPNEAGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSS 171
Query: 527 SEKPRKSKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISFTE--- 583
E S K S G ++ APK ++ + +P E
Sbjct: 172 DESATDSGKASASVAGIVGADEEAPPAPKNTLTPLEELYETNVNLFALKHPLEKLEEEID 231
Query: 584 IAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEAL-KAYKES 624
I K G+ V+ +E+ E DE+ +A+ED EAL K E
Sbjct: 232 ILKNDGDK---VAEEEEFELDEEHEEAEEDKKEALEKIGAEG 270
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 29.0 bits (65), Expect = 6.4
Identities = 14/74 (18%), Positives = 28/74 (37%)
Query: 634 KTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDEDSKAK 693
K + KK ++ +T + K K KE + + SD ++ + +K D +
Sbjct: 48 KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQN 107
Query: 694 TKKKKEKSESESDS 707
T + + S
Sbjct: 108 TAAEVNHEDVIDLS 121
>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 944
Score = 29.7 bits (67), Expect = 7.0
Identities = 30/173 (17%), Positives = 58/173 (33%), Gaps = 23/173 (13%)
Query: 603 YDEKVNKAKEDYNEA---LKAYKES----GGGQDSDDGKTSKSKKPAKKKETASTAVSPH 655
E + DY + L AY++ + + + ++A+ A
Sbjct: 531 TLESNGLDEGDYAQDSAPLDAYQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEAQPSS 590
Query: 656 KVKSKEFIESNGSSSDSDDDKKASSKRKRDS---DEDSKAKTKKKKEKSESESDSGESEK 712
+ S + ++S +DDD + RDS D D S E D +S
Sbjct: 591 QSLSPISAVTTAAASLADDDILDAVLAARDSLLSDLD---------ALSPKEGDGKKSSA 641
Query: 713 EKKSKSKPKG----DKSKSKSKKEDEIDDEEYDEPVESTPPSSDAGSDASSSE 761
++K K+ P SK S + +D + + + +A+ +
Sbjct: 642 DRKPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVPEAALAS 694
>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
Provisional.
Length = 719
Score = 29.8 bits (67), Expect = 7.1
Identities = 28/137 (20%), Positives = 55/137 (40%), Gaps = 19/137 (13%)
Query: 438 KEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDED-ESTDEDFNPDQAESDVAEEYDSN 496
K++ + D A R + D++ D+DE+ + + + P+ +++ ++
Sbjct: 416 KKNQKPQESSPSLDQTNA-ASRNSVMMDEDSDDDEEFQIPESEQEPETTKNET-KDTAME 473
Query: 497 PTPTDSDEDSDASEGTKKKKKEKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPKK 556
P D + S S ++ ++K+R IS+ K S K DK+ K
Sbjct: 474 EEPQDKIDLSCFSGNLRRDDRDKARDCWRISD----GNNFKVRS----KNFCYDKS--KI 523
Query: 557 PMSAYMM------WFND 567
P ++M WF D
Sbjct: 524 PAGKHLMDLVAVDWFKD 540
>gnl|CDD|111764 pfam02912, Phe_tRNA-synt_N, Aminoacyl tRNA synthetase class II,
N-terminal domain.
Length = 73
Score = 27.1 bits (61), Expect = 7.2
Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 4/45 (8%)
Query: 580 SFTEIAKKGGELWKTVSSKEKAEYDEKVNKAKEDYNEALKAYKES 624
TE+ K G+L S +E+ + +N+AKE EAL+ K +
Sbjct: 22 PLTELLKGLGKL----SPEERPKVGALINEAKEAVEEALEERKAA 62
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 29.5 bits (65), Expect = 7.6
Identities = 18/83 (21%), Positives = 38/83 (45%)
Query: 670 SDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKS 729
D DKK + + E+++ + K + E E E EK+K K + S+ K+
Sbjct: 120 FADDQDKKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKA 179
Query: 730 KKEDEIDDEEYDEPVESTPPSSD 752
++E + ++E + + +S+
Sbjct: 180 EQEKQKTEQEKQKTEQEKQKTSN 202
>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
Provisional.
Length = 418
Score = 29.5 bits (66), Expect = 7.8
Identities = 15/108 (13%), Positives = 31/108 (28%), Gaps = 8/108 (7%)
Query: 662 FIESNGSSSDSDDDKKASSKRKRDSDEDSKAKTKKKKEKSESE--SDSGESEKEKKSKSK 719
E + +R R ++ + +E++ + S K S S
Sbjct: 298 GSEHSDPEPLPASLPPPPVRRPRVKHPEAGKEEPDGARNAEAKEPAQPATSTSSKGSSSA 357
Query: 720 PKGDKSKSK------SKKEDEIDDEEYDEPVESTPPSSDAGSDASSSE 761
D + + DD+ P++ T S + +S
Sbjct: 358 QNKDSGSTGPGSSLAAASSFLEDDDFGSPPLDLTTSLRHMPSPSVTSA 405
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 29.5 bits (66), Expect = 8.1
Identities = 17/74 (22%), Positives = 31/74 (41%), Gaps = 3/74 (4%)
Query: 434 KPNYKEDFGSSDDEKEPDAYLARVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEY 493
KP + GS + K + ++ K+ A+ ++ E T +D D+ + E+
Sbjct: 241 KPKMFQLGGSDELGKSLPSLMSPRKKTASFKEQVVTRTFPERTSDD---DEDAIETEEDD 297
Query: 494 DSNPTPTDSDEDSD 507
D D+DSD
Sbjct: 298 VDESAIEDDDDDSD 311
>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
UL32. The large phosphorylated protein (UL32-like) of
herpes viruses is the polypeptide most frequently
reactive in immuno-blotting analyses with antisera when
compared with other viral proteins.
Length = 777
Score = 29.5 bits (66), Expect = 8.2
Identities = 54/311 (17%), Positives = 90/311 (28%), Gaps = 32/311 (10%)
Query: 459 REAAERDDED----DEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASE-GTK 513
++ E DD DED+D+ D + D+ + DED D S +
Sbjct: 272 EDSLEYDDPGLESTDEDDDDDGDSSLQTFKPLLDLTGSSLWSDDEESGDEDGDGSGFAPE 331
Query: 514 KKKKEKSRKTITISEKPRKSKKDKESSGG--STKRKKKDKNAPKKPMSAYMMWFNDMREK 571
K SR T+ + R K S T + P S
Sbjct: 332 PLIKTDSRSNDTLVDLGRGGGSLKLDSVDAPGTSSYLFEPGLSPSPNSG----------- 380
Query: 572 IKKDNPGISFTEIAKKGGELWKTV----SSKEKAEYDEKVNKAKEDYNEA-LKAYKESGG 626
K+ PGI TE + K + + A +
Sbjct: 381 --KEMPGILTTENLDLPLASTDSTEMDPEDKRGGAVKINNSGILAWGLKTPGLAVNDERS 438
Query: 627 GQDSDDGKTSKSKKPAKKKETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDS 686
S DG T P+ + +S V + S +S D + + S
Sbjct: 439 IAVSSDGITDVLDPPSPLRLHSSDKVIDSVSPPSKRRVSAPASRLDDAKRPEVTATPESS 498
Query: 687 DEDSKAKTKKKKEKSESESDSGESEKEKKSKSK-------PKGDKSKSKSKKEDEIDDEE 739
DS+ +++++ S+++S S KE + + PK ++
Sbjct: 499 GSDSEGGASGREDETSSDAESVVSIKELRPRIGFINKSPPPKSPPKSRRTLIVALSLASP 558
Query: 740 YDEPVESTPPS 750
PS
Sbjct: 559 STAGSPRPKPS 569
>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator. This
family of trans-activating transcriptional regulator
(TATR), also known as intermediate early protein 1, are
common to the Nucleopolyhedroviruses.
Length = 571
Score = 29.5 bits (66), Expect = 8.2
Identities = 16/75 (21%), Positives = 27/75 (36%), Gaps = 4/75 (5%)
Query: 466 DEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKKKKEKSRKTIT 525
E ES E + E DS+ DS + S + K KK + K T
Sbjct: 106 LSPGESLPESLSETKQVTVS-KRKGSELDSDS---DSSDSSKGKKLVNKPKKRQKYKKAT 161
Query: 526 ISEKPRKSKKDKESS 540
I ++++ ++
Sbjct: 162 IQSPTSLTEEENYNT 176
>gnl|CDD|193106 pfam12628, Inhibitor_I71, Falstatin, cysteine peptidase inhibitor.
This family of peptidase inhibitors is expressed from
plasmodial protozoal species. Falstatin is found to be a
potent reversible inhibitor of the P. falciparum
cysteine proteases falcipain-2 and falcipain-3, as well
as other parasite- and non-parasite-derived cysteine
proteases, but is only a relatively weak inhibitor of
the P. falciparum cysteine proteases falcipain-1 and
dipeptidyl aminopeptidase 1. Thus, P. falciparum
requires expression of falstatin to limit proteolysis by
certain host or parasite cysteine proteases during
erythrocyte invasion.
Length = 173
Score = 28.7 bits (64), Expect = 8.5
Identities = 14/49 (28%), Positives = 23/49 (46%)
Query: 533 SKKDKESSGGSTKRKKKDKNAPKKPMSAYMMWFNDMREKIKKDNPGISF 581
S K+ ++ S+ + + D N P KP + + +R IK PG F
Sbjct: 100 SSKNVINNNQSSSQNETDSNKPPKPRMQLVGGPSMLRSVIKGHKPGKYF 148
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 28.0 bits (63), Expect = 8.9
Identities = 18/64 (28%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 680 SKRKRDSDEDSKAKTKKKKEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKEDEIDDEE 739
++ +++ + + K++KE+ E E G +KE++ K + +K+K+K DDE+
Sbjct: 13 DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLA-RYADDED 71
Query: 740 YDEP 743
YDE
Sbjct: 72 YDEE 75
>gnl|CDD|236615 PRK09709, PRK09709, exonuclease VIII; Reviewed.
Length = 877
Score = 29.6 bits (66), Expect = 9.4
Identities = 25/103 (24%), Positives = 44/103 (42%), Gaps = 5/103 (4%)
Query: 456 RVKREAAERDDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSDASEGTKKK 515
RV E + D+E + + DE + E D A E+ + P D+ ++ + ++
Sbjct: 388 RVTEEGKQDDEEKPQPSGTTADEQATAETVEPD-ATEHHQDTQPLDAQSQVNSVDAKYQE 446
Query: 516 KK---EKSRKTITISEKPRKSKKDKESSGGSTKRKKKDKNAPK 555
+ ++RK I S+ P + K +S G D N PK
Sbjct: 447 LRAELHEARKNIP-SKNPVDADKLLAASRGEFVEGISDPNDPK 488
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 28.1 bits (63), Expect = 9.4
Identities = 12/25 (48%), Positives = 13/25 (52%)
Query: 709 ESEKEKKSKSKPKGDKSKSKSKKED 733
E K+KK K K K K K SKK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAA 131
>gnl|CDD|217373 pfam03115, Astro_capsid, Astrovirus capsid protein precursor. This
product is encoded by astrovirus ORF2, one of the three
astrovirus ORFs (1a, 1b, 2). The 87kD precursor protein
undergoes an intracellular cleavage to form a 79kD
protein. Subsequently, extracellular trypsin cleavage
yields the three proteins forming the infectious virion.
Length = 787
Score = 29.4 bits (66), Expect = 9.4
Identities = 11/43 (25%), Positives = 18/43 (41%)
Query: 465 DDEDDEDEDESTDEDFNPDQAESDVAEEYDSNPTPTDSDEDSD 507
D +D E TD + + E D + +D + + ED D
Sbjct: 664 DPYEDLISLEETDTEDESTEDEDDELDRFDLHDSSGSEPEDDD 706
>gnl|CDD|113815 pfam05058, ActA, ActA Protein. The ActA family is found in
Listeria and is associated with motility. ActA protein
acts as a scaffold to assemble and activate host cell
actin cytoskeletal factors at the bacterial surface,
resulting in directional actin polymerisation and
propulsion of the bacterium through the cytoplasm of the
host cell.
Length = 601
Score = 29.4 bits (65), Expect = 9.6
Identities = 44/155 (28%), Positives = 65/155 (41%), Gaps = 16/155 (10%)
Query: 590 ELWKTVSSKEKAEYDEKVNKAKE----DYNEALKAYKESGGGQDSDDGKTSKSKKPAKKK 645
E + VSS++ E EK NK K+ D LKA E G ++++ S+ + A
Sbjct: 56 ETAREVSSRDIKEL-EKSNKVKDANKADLIAMLKAKAEKGPNINNNNNNYSEQSENAAIN 114
Query: 646 ETASTAVSPHKVKSKEFIESNGSSSDSDDDKKASSKRKRDSDE--------DSKAKTKKK 697
E AS A P + G SDS + K K SD D KT KK
Sbjct: 115 EEASGADRPAIQVERR---HPGLPSDSAAEIKKRRKAIASSDSELESLTYPDKPTKTNKK 171
Query: 698 KEKSESESDSGESEKEKKSKSKPKGDKSKSKSKKE 732
K ES +D+ ES+ + +S + K+ ++
Sbjct: 172 KVAKESIADASESDLDSSMQSADESSPQPLKANQQ 206
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.308 0.128 0.357
Gapped
Lambda K H
0.267 0.0722 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 38,314,376
Number of extensions: 3817270
Number of successful extensions: 10802
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7295
Number of HSP's successfully gapped: 968
Length of query: 764
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 660
Effective length of database: 6,324,786
Effective search space: 4174358760
Effective search space used: 4174358760
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 63 (28.1 bits)