RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy18075
(999 letters)
>gnl|CDD|238064 cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease
inhibitor (RI)-like subfamily. LRRs are 20-29 residue
sequence motifs present in many proteins that
participate in protein-protein interactions and have
different functions and cellular locations. LRRs
correspond to structural units consisting of a beta
strand (LxxLxLxxN/CxL conserved pattern) and an alpha
helix. This alignment contains 12 strands corresponding
to 11 full repeats, consistent with the extent observed
in the subfamily acting as Ran GTPase Activating
Proteins (RanGAP1).
Length = 319
Score = 212 bits (543), Expect = 7e-62
Identities = 131/331 (39%), Positives = 185/331 (55%), Gaps = 22/331 (6%)
Query: 386 KLDNKEDAKVIVDAINEVKVLVSLNLEGNTLGVNAAKAIADALSKHEHFKRALWKDMFTG 445
+L E A ++ + ++VL LEGNTLG AAKA+A AL K TG
Sbjct: 8 ELLKTERATELLPKLLCLQVL---RLEGNTLGEEAAKALASALRPQPSLKELCLSLNETG 64
Query: 446 RMKTEIPDALRYLGNGLQQAGARLVELDLSDNAFGPIGVEGLADLLRSSCCFALEELKLN 505
R IP L+ L GL G L ELDLSDNA GP G L LLRSS +L+ELKLN
Sbjct: 65 R----IPRGLQSLLQGLT-KGCGLQELDLSDNALGPDGCGVLESLLRSS---SLQELKLN 116
Query: 506 NNGLGITGCKLLSKALHDCYESSKKEGSPLALKVFIAGRNRLENEGAKMLAAVFKKLKTL 565
NNGLG G +LL+K L D P AL+ + GRNRLE + LA + + L
Sbjct: 117 NNGLGDRGLRLLAKGLKDL---------PPALEKLVLGRNRLEGASCEALAKALRANRDL 167
Query: 566 ERVEMPQNGIYHVGITALSDAFEENKNLRHLNLNDNTITYKGAIPLGQALSKLPSLAILN 625
+ + + NGI GI AL++ + N NL L+LN+N +T +GA L + L+ L SL +LN
Sbjct: 168 KELNLANNGIGDAGIRALAEGLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLN 227
Query: 626 LGDCLLKSAGASSIAK-YLTDNTTLEDVNLTCNEISVQGGLDLVKAMKNKTKLKQINVSE 684
LGD L AGA+++A L+ N +L ++L+CN+I+ G DL + + K L ++++
Sbjct: 228 LGDNNLTDAGAAALASALLSPNISLLTLSLSCNDITDDGAKDLAEVLAEKESLLELDLRG 287
Query: 685 NQFGEEGVEEMEKLMKSFG-MAAALVLEDDE 714
N+FGEEG + + + + G +L ++DD
Sbjct: 288 NKFGEEGAQLLAESLLEPGNELESLWVKDDS 318
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 172 bits (437), Expect = 7e-47
Identities = 119/367 (32%), Positives = 178/367 (48%), Gaps = 17/367 (4%)
Query: 377 KLSFAGQGLKLDNKEDAKVIVDAINEVKVLVSLNLEGNTLGVNAAKAIADALSKHEHFKR 436
K S G+ LKL+ KED K +V+ + + LV ++L GNT+G A + + + ++ + +
Sbjct: 3 KFSIGGKKLKLETKEDVKGVVEELEMMDELVEVDLSGNTIGTEAMEELCNVIANVRNLRV 62
Query: 437 ALWKDMFTGRMKTEIPDALRYLGNGLQQAGARLVELDLSDNAFGPIGVEGLADLLRSSCC 496
+ D FTGR K E+ L L L + RL ++DLSDNAFG E L DL+ SS
Sbjct: 63 VNFSDAFTGRDKDELYSNLVMLLKALLKC-PRLQKVDLSDNAFGSEFPEELGDLISSST- 120
Query: 497 FALEELKLNNNGLGITGCKLLSKALHDCYESSKKEGSPLALKVFIAGRNRLENEGAKMLA 556
L LKLNNNGLG + KAL + K P L+V I GRNRLEN ++ A
Sbjct: 121 -DLVHLKLNNNGLGPIAGGRIGKALFHLAYNKKAADKP-KLEVVICGRNRLENGSKELSA 178
Query: 557 AVFKKLKTLERVEMPQNGIYHVGITALS-DAFEENKNLRHLNLNDNTITYKGAIPLGQAL 615
A+ + + L+ V++ QNGI G+T L+ + +L L+L DNT T +G+ L AL
Sbjct: 179 ALLESHENLKEVKIQQNGIRPEGVTMLAFLGLFYSHSLEVLDLQDNTFTLEGSRYLADAL 238
Query: 616 SKLPSLAILNLGDCLLKSAGASSIAKYLTD--NTTLEDVNLTCNEISVQGGLDL-VKAMK 672
+ L L L DCLL + G S+ + + L + NE LD+ + +
Sbjct: 239 CEWNLLRELRLNDCLLSNEGVKSVLRRFNEKFVPNLMPLPGDYNERRGGIILDISLNEFE 298
Query: 673 --NKTKLKQINVSENQFGEEGV--EEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEE 728
L + + N+ E + E + + V+E EG+ EE +E E
Sbjct: 299 QDAVPLLVDLERNGNRIKELADFGDYYEDIFEVVE-----VVEKQEGDVVTEESTDEESE 353
Query: 729 ENDSDAE 735
+ E
Sbjct: 354 DEVEIDE 360
>gnl|CDD|203777 pfam07834, RanGAP1_C, RanGAP1 C-terminal domain. Ran-GTPase
activating protein 1 (RanGAP1) is a GTPase activator for
the nuclear Ras-related regulatory protein Ran,
converting it to the putatively inactive GDP-bound
state. Its C-terminal domain is required for RanGAP1
localisation at the vertebrate nuclear pore complex, and
is sumoylated by the small ubiquitin-related modifier
protein (SUMO-1). This domain is composed almost
entirely of helical substructures that are organised
into an alpha-alpha superhelix fold, with the exception
of the peptide containing the lysine residue required
for SUMO-1 conjugation.
Length = 169
Score = 48.8 bits (116), Expect = 2e-06
Identities = 38/138 (27%), Positives = 70/138 (50%), Gaps = 10/138 (7%)
Query: 781 VTDFLAAPSVSLFNSLGPNCAQLFLDEINSEPEDRYLESMLLAIMKVSSLVPSTTTLQSP 840
++ FL PS LGP +QL +++ ++ +E A +KVSS+ ++
Sbjct: 17 LSTFLVFPSPEKLIRLGPKRSQLIAQQVDVTDAEKVVE----AFLKVSSVYKDEPEVKQ- 71
Query: 841 SSSLFECTKVLYRELFNWASNSDKLSTVTNALLVHLGLIKCEDKSFKINYNLNGCMTAVR 900
++FE T L R+ F+ ++ + ++ +LLV++GL+K EDK KI+ L G + +
Sbjct: 72 --AVFETTDALMRKAFS--NSPFQSNSFITSLLVNMGLLKSEDKVKKIS-VLPGPLLTLN 126
Query: 901 LLSEAKLLPSLTESQLLA 918
+ + + P LLA
Sbjct: 127 HMVQQEYFPKDLAGVLLA 144
Score = 38.8 bits (90), Expect = 0.005
Identities = 29/99 (29%), Positives = 53/99 (53%), Gaps = 4/99 (4%)
Query: 174 ESMLLAIMKVSSLVPSTTTLPSSSLFECTKVLYRELFNWASNSDKLSTVTNALLVHLGLI 233
E ++ A +KVSS+ + ++FE T L R+ F+ ++ + ++ +LLV++GL+
Sbjct: 50 EKVVEAFLKVSSVYKDEPEV-KQAVFETTDALMRKAFS--NSPFQSNSFITSLLVNMGLL 106
Query: 234 KCEDKSFKINYNLNGCMTAVRLLSEAKLLPSLTESQLLA 272
K EDK KI+ L G + + + + + P LLA
Sbjct: 107 KSEDKVKKIS-VLPGPLLTLNHMVQQEYFPKDLAGVLLA 144
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 40.1 bits (94), Expect = 0.003
Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 3/84 (3%)
Query: 665 LDLVKAMKNKTKLKQINVSENQFGEEGVEEMEKL-MKSFGMAAALVLEDDEGECSDEEQD 723
+ + K K L + F EE + EKL M + + DE + DEE++
Sbjct: 120 MGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEE 179
Query: 724 EESEEENDSDAEGDNSNLSHNDSN 747
EE EEE++ + D+ + +D N
Sbjct: 180 EEEEEEDEDFDDDDDDD--DDDYN 201
Score = 34.0 bits (78), Expect = 0.25
Identities = 13/38 (34%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 7 LEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSN 44
+ DE + DEE++EE EEE++ + D+ + +D N
Sbjct: 166 EDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD--DDDYN 201
Score = 29.7 bits (67), Expect = 5.8
Identities = 9/36 (25%), Positives = 23/36 (63%)
Query: 2 AAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSN 37
A +++++ + +EE++EE E+E+ D + D+ +
Sbjct: 163 LEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198
>gnl|CDD|197686 smart00368, LRR_RI, Leucine rich repeat, ribonuclease inhibitor
type.
Length = 28
Score = 34.3 bits (80), Expect = 0.008
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 499 LEELKLNNNGLGITGCKLLSKALHD 523
L EL L+NN LG G + L++AL D
Sbjct: 4 LRELDLSNNKLGDEGARALAEALKD 28
Score = 31.2 bits (72), Expect = 0.10
Identities = 12/26 (46%), Positives = 15/26 (57%)
Query: 468 RLVELDLSDNAFGPIGVEGLADLLRS 493
L ELDLS+N G G LA+ L+
Sbjct: 3 SLRELDLSNNKLGDEGARALAEALKD 28
Score = 30.5 bits (70), Expect = 0.20
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 590 NKNLRHLNLNDNTITYKGAIPLGQALS 616
N +LR L+L++N + +GA L +AL
Sbjct: 1 NPSLRELDLSNNKLGDEGARALAEALK 27
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 37.6 bits (87), Expect = 0.022
Identities = 12/41 (29%), Positives = 24/41 (58%)
Query: 9 DDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNA 49
DD+ S + DEE + ++D + + D++N +H D + + A
Sbjct: 274 DDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDNGA 314
Score = 37.6 bits (87), Expect = 0.022
Identities = 12/41 (29%), Positives = 24/41 (58%)
Query: 712 DDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNA 752
DD+ S + DEE + ++D + + D++N +H D + + A
Sbjct: 274 DDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDNGA 314
Score = 30.3 bits (68), Expect = 5.4
Identities = 9/36 (25%), Positives = 21/36 (58%)
Query: 12 GECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSH 47
G SD+++ S + ++ + D+ N ++D+N +H
Sbjct: 270 GYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNH 305
Score = 30.3 bits (68), Expect = 5.4
Identities = 9/36 (25%), Positives = 21/36 (58%)
Query: 715 GECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSH 750
G SD+++ S + ++ + D+ N ++D+N +H
Sbjct: 270 GYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNH 305
>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
Provisional.
Length = 337
Score = 36.7 bits (85), Expect = 0.057
Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 1/57 (1%)
Query: 12 GECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSH 68
GE DE+ D S+E++D D+S+ + +DSN S +N+ + + N + +
Sbjct: 277 GEAIDEDSDYSSDEDDDDYDSYDSSDSASSDSN-SDVDTNEEDDRGEKESNGAKSNE 332
Score = 36.7 bits (85), Expect = 0.057
Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 1/57 (1%)
Query: 715 GECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSH 771
GE DE+ D S+E++D D+S+ + +DSN S +N+ + + N + +
Sbjct: 277 GEAIDEDSDYSSDEDDDDYDSYDSSDSASSDSN-SDVDTNEEDDRGEKESNGAKSNE 332
Score = 34.0 bits (78), Expect = 0.32
Identities = 15/58 (25%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNSNL-SHNDSNHSHNASNQSHNNSNQSHNTS 64
ED + +++ D +S + +DS + NS++ ++ + + SN + SN+ H TS
Sbjct: 282 EDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEKESNGA--KSNELHLTS 337
Score = 34.0 bits (78), Expect = 0.32
Identities = 15/58 (25%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNL-SHNDSNHSHNASNQSHNNSNQSHNTS 767
ED + +++ D +S + +DS + NS++ ++ + + SN + SN+ H TS
Sbjct: 282 EDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEKESNGA--KSNELHLTS 337
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The
CAF-1 or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression
in addition to DNA-based checkpoints.
Length = 76
Score = 33.3 bits (77), Expect = 0.065
Identities = 11/26 (42%), Positives = 19/26 (73%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEG 33
E++EGE + E +E+ EE++D D +G
Sbjct: 50 EEEEGEDLESEDEEDEEEDDDDDMDG 75
Score = 33.3 bits (77), Expect = 0.065
Identities = 11/26 (42%), Positives = 19/26 (73%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEG 736
E++EGE + E +E+ EE++D D +G
Sbjct: 50 EEEEGEDLESEDEEDEEEDDDDDMDG 75
>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
Length = 613
Score = 36.4 bits (85), Expect = 0.088
Identities = 21/90 (23%), Positives = 38/90 (42%), Gaps = 12/90 (13%)
Query: 6 VLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSN 65
L + E ECSD E + E + SD S+L +D+ ++ + + ++
Sbjct: 17 FLVEAEAECSDGEDELEDLFDTGSDI----SDLI-DDAEVVQGGNSLALFHQQEAEEDEQ 71
Query: 66 QSHSVSQLKQHSVTDFLAAPSVSLFNSLGP 95
Q + LK+ +L++P S L P
Sbjct: 72 Q---IQALKRK----YLSSPEKSPVADLSP 94
Score = 36.4 bits (85), Expect = 0.088
Identities = 21/90 (23%), Positives = 38/90 (42%), Gaps = 12/90 (13%)
Query: 709 VLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSN 768
L + E ECSD E + E + SD S+L +D+ ++ + + ++
Sbjct: 17 FLVEAEAECSDGEDELEDLFDTGSDI----SDLI-DDAEVVQGGNSLALFHQQEAEEDEQ 71
Query: 769 QSHSVSQLKQHSVTDFLAAPSVSLFNSLGP 798
Q + LK+ +L++P S L P
Sbjct: 72 Q---IQALKRK----YLSSPEKSPVADLSP 94
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 35.7 bits (82), Expect = 0.13
Identities = 21/89 (23%), Positives = 41/89 (46%), Gaps = 21/89 (23%)
Query: 671 MKNKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEEN 730
MK + ++++ G+E EE E+D+ E D++ D+E E+++
Sbjct: 32 MKKENAIRKL-------GKEAEEEAM-------------EEEDDDEEDDDDDDDEDEDDD 71
Query: 731 DSDAEGDNSNLSHNDSNHSHNASNQSHNN 759
D D + D+ + +DS H+ S+ N
Sbjct: 72 DDDDDEDDEDEDDDDST-LHDDSSADDGN 99
Score = 33.4 bits (76), Expect = 0.58
Identities = 15/50 (30%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 7 LEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNN 56
E+D+ E D++ D+E E+++D D + D+ + +DS H+ S+ N
Sbjct: 51 EEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDST-LHDDSSADDGN 99
Score = 30.3 bits (68), Expect = 5.4
Identities = 9/48 (18%), Positives = 29/48 (60%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHN 55
++++ + D+E +++ ++++D D E ++ + S + S + N++ N
Sbjct: 56 DEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDN 103
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 33.5 bits (77), Expect = 0.15
Identities = 18/69 (26%), Positives = 37/69 (53%), Gaps = 3/69 (4%)
Query: 672 KNKTKLKQINVSENQF-GEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEEN 730
+ K K ++ + E + EE +E+EK +K A L+ E+++ E ++ + E++E+
Sbjct: 41 EAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDE--EDAETEDTEDVE 98
Query: 731 DSDAEGDNS 739
D + EG
Sbjct: 99 DDEWEGFPE 107
Score = 30.4 bits (69), Expect = 2.1
Identities = 10/57 (17%), Positives = 20/57 (35%), Gaps = 9/57 (15%)
Query: 689 EEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDE---------ESEEENDSDAEG 736
E+ ++E ++ +K + + + D E DE EE D +
Sbjct: 66 EKQLKERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDEDK 122
>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat.
Length = 60
Score = 31.7 bits (73), Expect = 0.17
Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 8/67 (11%)
Query: 565 LERVEMPQNGIYHVGITALSDAFEENKNLRHLNLNDNTITYKGAIPLGQALSKLPSLAIL 624
L+ +++ N + + AF+ NL+ L+L+ N +T +I A S LPSL L
Sbjct: 2 LKSLDLSNNRLTVIP----DGAFKGLPNLKVLDLSGNNLT---SISPE-AFSGLPSLRSL 53
Query: 625 NLGDCLL 631
+L L
Sbjct: 54 DLSGNNL 60
Score = 28.3 bits (64), Expect = 2.3
Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 8/68 (11%)
Query: 592 NLRHLNLNDNTITYKGAIPLGQALSKLPSLAILNLGDCLLKSAGASSIAKYLTDNTTLED 651
NL+ L+L++N +T A LP+L +L+L L S + + +L
Sbjct: 1 NLKSLDLSNNRLTVIP----DGAFKGLPNLKVLDLSGNNLTSISPEA----FSGLPSLRS 52
Query: 652 VNLTCNEI 659
++L+ N +
Sbjct: 53 LDLSGNNL 60
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 35.1 bits (81), Expect = 0.23
Identities = 20/75 (26%), Positives = 35/75 (46%), Gaps = 6/75 (8%)
Query: 693 EEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNA 752
E + ++ +F AL +ED E DEE+++E EEE + + EG + H+D
Sbjct: 367 TEQDYILDAFS---ALEIEDANTERDDEEEEDEEEEEEEDEDEGPSK--EHSDDE-EFEE 420
Query: 753 SNQSHNNSNQSHNTS 767
+ + N+S
Sbjct: 421 DDVESKYEDSDGNSS 435
Score = 35.1 bits (81), Expect = 0.23
Identities = 18/61 (29%), Positives = 29/61 (47%), Gaps = 3/61 (4%)
Query: 4 ALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNT 63
AL +ED E DEE+++E EEE + + EG + H+D + + N+
Sbjct: 378 ALEIEDANTERDDEEEEDEEEEEEEDEDEGPSK--EHSDDE-EFEEDDVESKYEDSDGNS 434
Query: 64 S 64
S
Sbjct: 435 S 435
>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies). Leucine rich
repeats are short sequence motifs present in a number of
proteins with diverse functions and cellular locations.
These repeats are usually involved in protein-protein
interactions. Each Leucine Rich Repeat is composed of a
beta-alpha unit. These units form elongated non-globular
structures. Leucine Rich Repeats are often flanked by
cysteine rich domains.
Length = 43
Score = 30.5 bits (70), Expect = 0.30
Identities = 15/43 (34%), Positives = 22/43 (51%), Gaps = 6/43 (13%)
Query: 591 KNLRHLNLNDNTITYKGAIPLGQALSKLPSLAILNLGDCLLKS 633
NL L+L++N IT +P LS LP+L L+L +
Sbjct: 1 TNLETLDLSNNQIT---DLP---PLSNLPNLETLDLSGNKITD 37
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 34.2 bits (79), Expect = 0.44
Identities = 20/63 (31%), Positives = 30/63 (47%), Gaps = 9/63 (14%)
Query: 684 ENQFGEEGV------EEMEKLMKSFGM---AAALVLEDDEGECSDEEQDEESEEENDSDA 734
+N FG + V + E+L L +DD+ E SDEE +E S+ E+D D
Sbjct: 90 DNVFGSDQVVIFDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDD 149
Query: 735 EGD 737
+ D
Sbjct: 150 DDD 152
Score = 30.3 bits (69), Expect = 5.2
Identities = 14/31 (45%), Positives = 20/31 (64%)
Query: 4 ALVLEDDEGECSDEEQDEESEEENDSDAEGD 34
L +DD+ E SDEE +E S+ E+D D + D
Sbjct: 122 DLEEDDDDDEESDEEDEESSKSEDDEDDDDD 152
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 31.9 bits (73), Expect = 0.78
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 1 MAAALVLEDDEGECSDEEQDEESEEENDSDAE 32
+A+ DD+ E +EE DEE ++E++S+ E
Sbjct: 107 VASEEDESDDDEEDEEEEDDEEDDDEDESEEE 138
Score = 31.9 bits (73), Expect = 0.78
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 704 MAAALVLEDDEGECSDEEQDEESEEENDSDAE 735
+A+ DD+ E +EE DEE ++E++S+ E
Sbjct: 107 VASEEDESDDDEEDEEEEDDEEDDDEDESEEE 138
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 32.7 bits (75), Expect = 0.95
Identities = 18/59 (30%), Positives = 28/59 (47%), Gaps = 5/59 (8%)
Query: 685 NQFGEE----GVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNS 739
+GEE G+ +E L K E ++G SD++ DEE E E + D + D+
Sbjct: 58 KGYGEEEVVDGIPGLELLEKWK-EEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDE 115
Score = 30.0 bits (68), Expect = 6.2
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNS 36
E ++G SD++ DEE E E + D + D+
Sbjct: 87 EAEQGLESDDDDDEEEEWEVEEDEDSDDE 115
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 32.7 bits (74), Expect = 1.0
Identities = 22/106 (20%), Positives = 35/106 (33%), Gaps = 7/106 (6%)
Query: 673 NKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDS 732
+ ++ E++ E VEE E + KS D EC EE++ E EEE
Sbjct: 109 ETVEEEEK--EESREEREEVEETEGVTKSE----QKNDWRDAEECQKEEKEPEPEEEEKP 162
Query: 733 DAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSHSVSQLKQ 778
N + H + + + +LKQ
Sbjct: 163 KRGSLEEN-NGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQ 207
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 32.3 bits (74), Expect = 1.2
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDN 35
EDD E DE++++E EEE + D EGD
Sbjct: 292 EDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 32.3 bits (74), Expect = 1.2
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDN 738
EDD E DE++++E EEE + D EGD
Sbjct: 292 EDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 29.6 bits (67), Expect = 7.8
Identities = 11/69 (15%), Positives = 29/69 (42%), Gaps = 10/69 (14%)
Query: 689 EEGVEEMEKLMKSFG-----MAAALVLEDDEGECSD-----EEQDEESEEENDSDAEGDN 738
+E +++++K G V+ DE + + EE++EE + D + ++
Sbjct: 246 GAALEAIKEVIKKKGGDFKVKGEPEVVGGDEEDLEELLEKAEEEEEEDDYSESEDEDEED 305
Query: 739 SNLSHNDSN 747
+ + +
Sbjct: 306 EDEEEEEDD 314
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 32.4 bits (73), Expect = 1.6
Identities = 22/91 (24%), Positives = 41/91 (45%), Gaps = 1/91 (1%)
Query: 685 NQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHN 744
N GEE E+ + G CS + + ++ ++ + EG+ NL
Sbjct: 3 NHIGEEASAEVLSGRNNTGDDGVHGGSCHPSHCSPQPRGDKFNDKFEFAGEGEKPNLL-V 61
Query: 745 DSNHSHNASNQSHNNSNQSHNTSNQSHSVSQ 775
D+N +H + S NNS ++ N + ++S S+
Sbjct: 62 DANENHINTRSSDNNSCENANMNRDANSSSE 92
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 31.6 bits (71), Expect = 1.7
Identities = 13/53 (24%), Positives = 26/53 (49%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQS 60
++DE E +EE++E EEN+ G ++N + D + + + +S
Sbjct: 136 DEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEES 188
Score = 31.6 bits (71), Expect = 1.7
Identities = 13/53 (24%), Positives = 26/53 (49%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQS 763
++DE E +EE++E EEN+ G ++N + D + + + +S
Sbjct: 136 DEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEES 188
Score = 31.6 bits (71), Expect = 2.1
Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 4/90 (4%)
Query: 11 EGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSHSV 70
+ + SDE+++EE EEE + +AE + + N ++ + + + +S + + SV
Sbjct: 131 KEDESDEDEEEEEEEEEE-EAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESV 189
Query: 71 SQLKQHSVTDFLAAPSV-SLFNSLGPNCAQ 99
++ + T +A P+ S P
Sbjct: 190 TEAEAEGTT--VAGPTTTSPNGGFQPTTPP 217
Score = 31.6 bits (71), Expect = 2.1
Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 4/90 (4%)
Query: 714 EGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSHSV 773
+ + SDE+++EE EEE + +AE + + N ++ + + + +S + + SV
Sbjct: 131 KEDESDEDEEEEEEEEEE-EAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESV 189
Query: 774 SQLKQHSVTDFLAAPSV-SLFNSLGPNCAQ 802
++ + T +A P+ S P
Sbjct: 190 TEAEAEGTT--VAGPTTTSPNGGFQPTTPP 217
>gnl|CDD|225191 COG2309, AmpS, Leucyl aminopeptidase (aminopeptidase T) [Amino acid
transport and metabolism].
Length = 385
Score = 31.9 bits (73), Expect = 1.9
Identities = 10/32 (31%), Positives = 15/32 (46%)
Query: 12 GECSDEEQDEESEEENDSDAEGDNSNLSHNDS 43
G + +E +N+S GDN + H DS
Sbjct: 326 GSAYPFNVEGGAEMQNESAIHGDNVSDVHVDS 357
Score = 31.9 bits (73), Expect = 1.9
Identities = 10/32 (31%), Positives = 15/32 (46%)
Query: 715 GECSDEEQDEESEEENDSDAEGDNSNLSHNDS 746
G + +E +N+S GDN + H DS
Sbjct: 326 GSAYPFNVEGGAEMQNESAIHGDNVSDVHVDS 357
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 31.6 bits (71), Expect = 1.9
Identities = 16/53 (30%), Positives = 24/53 (45%), Gaps = 7/53 (13%)
Query: 688 GEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSN 740
E+ + + L K G D E S+ E++E SE E+D D + DN
Sbjct: 221 SEKEKTKKKDLEKWLG-------SDQSMETSESEEEESSESESDEDEDEDNKG 266
Score = 30.4 bits (68), Expect = 4.8
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNSN 37
D E S+ E++E SE E+D D + DN
Sbjct: 237 SDQSMETSESEEEESSESESDEDEDEDNKG 266
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.
This family represents the C-terminus (approximately
300 residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 31.4 bits (71), Expect = 2.1
Identities = 16/30 (53%), Positives = 23/30 (76%)
Query: 6 VLEDDEGECSDEEQDEESEEENDSDAEGDN 35
VLE +E + S EE++EESEEE ++D+E D
Sbjct: 4 VLELEEEDESGEEEEEESEEEEETDSEDDM 33
Score = 31.4 bits (71), Expect = 2.1
Identities = 16/30 (53%), Positives = 23/30 (76%)
Query: 709 VLEDDEGECSDEEQDEESEEENDSDAEGDN 738
VLE +E + S EE++EESEEE ++D+E D
Sbjct: 4 VLELEEEDESGEEEEEESEEEEETDSEDDM 33
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 31.9 bits (72), Expect = 2.1
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAE 32
E++E E +EE++EE EEE + + E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 31.9 bits (72), Expect = 2.1
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAE 735
E++E E +EE++EE EEE + + E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 30.3 bits (68), Expect = 6.4
Identities = 13/32 (40%), Positives = 21/32 (65%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNSNLS 39
E++E E +EE++EE EEE + + E + LS
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLS 895
Score = 30.3 bits (68), Expect = 6.4
Identities = 13/32 (40%), Positives = 21/32 (65%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLS 742
E++E E +EE++EE EEE + + E + LS
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLS 895
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 31.6 bits (71), Expect = 2.2
Identities = 18/71 (25%), Positives = 37/71 (52%)
Query: 672 KNKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEEND 731
+N +L+ +N+ E G ++ E+ + + + E D+++D E ++E +
Sbjct: 85 ENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKE 144
Query: 732 SDAEGDNSNLS 742
SDAEGD + L+
Sbjct: 145 SDAEGDENELA 155
>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
Length = 470
Score = 31.8 bits (73), Expect = 2.2
Identities = 20/92 (21%), Positives = 41/92 (44%), Gaps = 9/92 (9%)
Query: 647 TTLEDV-NLTCNEISVQGGLD--LVKAMKNKTK--LKQINVSENQFGEEGVEEMEKLMKS 701
++LE++ + E+ G D V+ ++ + K L+ +++ + + + +E L +
Sbjct: 378 SSLEELAYVPVEELLEIEGFDEETVEELRERAKEALETEALAQEEKLADDLLSLEGLDRE 437
Query: 702 FGMAAA----LVLEDDEGECSDEEQDEESEEE 729
A LED + DE D E +EE
Sbjct: 438 LAFKLAEKGIKTLEDLAEQAVDELIDIEGDEE 469
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association
with mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 29.3 bits (66), Expect = 2.5
Identities = 11/34 (32%), Positives = 19/34 (55%)
Query: 9 DDEGECSDEEQDEESEEENDSDAEGDNSNLSHND 42
D E E DEE++EE EE++ D ++ + +
Sbjct: 3 DTEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAE 36
Score = 29.3 bits (66), Expect = 2.5
Identities = 11/34 (32%), Positives = 19/34 (55%)
Query: 712 DDEGECSDEEQDEESEEENDSDAEGDNSNLSHND 745
D E E DEE++EE EE++ D ++ + +
Sbjct: 3 DTEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAE 36
>gnl|CDD|218773 pfam05841, Apc15p, Apc15p protein. The anaphase-promoting complex
(APC) is a conserved multi-subunit ubiquitin ligase
required for the degradation of key cell cycle
regulators Members of this family are components of the
anaphase-promoting complex homologous to Apc15p.
Length = 114
Score = 29.7 bits (67), Expect = 2.5
Identities = 16/80 (20%), Positives = 29/80 (36%), Gaps = 9/80 (11%)
Query: 42 DSNHSHNASNQSHNNSNQSHNTSNQSHSVSQLKQHSVTDFLAAPSVSLFNSLGPNCAQLF 101
D S N ++SN ++T + S L + + + S ++ + A+L
Sbjct: 10 DEQSSRNYRKGRQSHSN--NSTDRELLSAPPLAAYGASSEQPYYTRSSQSASRGSLARLT 67
Query: 102 LDEINSEPEDRYLESMLLAI 121
L+E LE I
Sbjct: 68 LEE-------EALERRKQNI 80
Score = 29.7 bits (67), Expect = 2.5
Identities = 16/80 (20%), Positives = 29/80 (36%), Gaps = 9/80 (11%)
Query: 745 DSNHSHNASNQSHNNSNQSHNTSNQSHSVSQLKQHSVTDFLAAPSVSLFNSLGPNCAQLF 804
D S N ++SN ++T + S L + + + S ++ + A+L
Sbjct: 10 DEQSSRNYRKGRQSHSN--NSTDRELLSAPPLAAYGASSEQPYYTRSSQSASRGSLARLT 67
Query: 805 LDEINSEPEDRYLESMLLAI 824
L+E LE I
Sbjct: 68 LEE-------EALERRKQNI 80
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 31.3 bits (71), Expect = 3.0
Identities = 11/72 (15%), Positives = 28/72 (38%), Gaps = 1/72 (1%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQS 67
+D + SD +++ ++ + DN + +N + SN+ N+ + + S
Sbjct: 146 TNDSNKNSDSSIKNDTDTQSSKQDKADNQKAP-SSNNTKPSTSNKQPNSPKPTQPNQSNS 204
Query: 68 HSVSQLKQHSVT 79
S + +
Sbjct: 205 QPASDDTANQKS 216
Score = 31.3 bits (71), Expect = 3.0
Identities = 11/72 (15%), Positives = 28/72 (38%), Gaps = 1/72 (1%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQS 770
+D + SD +++ ++ + DN + +N + SN+ N+ + + S
Sbjct: 146 TNDSNKNSDSSIKNDTDTQSSKQDKADNQKAP-SSNNTKPSTSNKQPNSPKPTQPNQSNS 204
Query: 771 HSVSQLKQHSVT 782
S + +
Sbjct: 205 QPASDDTANQKS 216
Score = 30.5 bits (69), Expect = 5.0
Identities = 20/146 (13%), Positives = 56/146 (38%), Gaps = 18/146 (12%)
Query: 698 LMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSH 757
+ A A +D + + + + +S++ ND + D S+ + ++ ++N SNQ +
Sbjct: 18 PTLTSPTAYA---DDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKAD-NNNTSNQDN 73
Query: 758 NNSNQSHNTSNQSHSVSQLKQHSVTDFLAAPSVSLFNSLGPNCAQLFLDEINSEPEDRYL 817
N+ S + + + + + P ++ L N ++ L
Sbjct: 74 NDKKFS-TIDSSTSDSNNIIDFIYKNL---PQTNINQLLTKNKYD----------DNYSL 119
Query: 818 ESMLLAIMKVSSLVPSTTTLQSPSSS 843
+++ + ++S + ++ S
Sbjct: 120 TTLIQNLFNLNSDISDYEQPRNSEKS 145
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase
19 kDa subunit. This family contains several
DNA-directed RNA polymerase 19 kDa polypeptides. The
Poxvirus DNA-directed RNA polymerase (EC: 2.7.7.6)
catalyzes DNA-template-directed extension of the 3'-end
of an RNA strand by one nucleotide at a time.
Length = 167
Score = 30.1 bits (68), Expect = 3.1
Identities = 17/68 (25%), Positives = 33/68 (48%), Gaps = 4/68 (5%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQS 67
E D+ + + E++EE EE+ +S D S+L ++ + S N + + N S
Sbjct: 11 ESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQSNY----KIESASENIEDATSNPKQVS 66
Query: 68 HSVSQLKQ 75
+S +K+
Sbjct: 67 EKISAIKR 74
Score = 30.1 bits (68), Expect = 3.1
Identities = 17/68 (25%), Positives = 33/68 (48%), Gaps = 4/68 (5%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQS 770
E D+ + + E++EE EE+ +S D S+L ++ + S N + + N S
Sbjct: 11 ESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQSNY----KIESASENIEDATSNPKQVS 66
Query: 771 HSVSQLKQ 778
+S +K+
Sbjct: 67 EKISAIKR 74
>gnl|CDD|205694 pfam13516, LRR_6, Leucine Rich repeat.
Length = 24
Score = 27.2 bits (61), Expect = 3.3
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 590 NKNLRHLNLNDNTITYKGAIPLGQ 613
NL L+L+ N I +GA L +
Sbjct: 1 LTNLTSLDLSGNQIGDEGASALAK 24
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 31.0 bits (70), Expect = 3.5
Identities = 15/46 (32%), Positives = 25/46 (54%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 53
E++E E +EE+++ESEEE D E + + N S S++
Sbjct: 447 EEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEG 492
Score = 31.0 bits (70), Expect = 3.5
Identities = 15/46 (32%), Positives = 25/46 (54%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 756
E++E E +EE+++ESEEE D E + + N S S++
Sbjct: 447 EEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEG 492
Score = 29.9 bits (67), Expect = 9.6
Identities = 13/52 (25%), Positives = 22/52 (42%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQ 59
+DE E + E D SEEE + +EGD + N+ + ++
Sbjct: 467 GEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSE 518
Score = 29.9 bits (67), Expect = 9.6
Identities = 13/52 (25%), Positives = 22/52 (42%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQ 762
+DE E + E D SEEE + +EGD + N+ + ++
Sbjct: 467 GEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSE 518
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 31.0 bits (70), Expect = 3.5
Identities = 18/89 (20%), Positives = 30/89 (33%), Gaps = 10/89 (11%)
Query: 700 KSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEG-----DNSNLSHNDSNHSHNASN 754
SF + +DD+ E +D+E + N + E L D + A+N
Sbjct: 77 ASFLNQRKSLDDDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANN 136
Query: 755 QSHNNSNQ-----SHNTSNQSHSVSQLKQ 778
N+ +Q S SV+
Sbjct: 137 DQTNDFDQDDSSNSQTDQGLKQSVNLSSA 165
Score = 31.0 bits (70), Expect = 4.5
Identities = 16/78 (20%), Positives = 27/78 (34%), Gaps = 10/78 (12%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEG-----DNSNLSHNDSNHSHNASNQSHNNSNQ--- 59
+DD+ E +D+E + N + E L D + A+N N+ +Q
Sbjct: 88 DDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDS 147
Query: 60 --SHNTSNQSHSVSQLKQ 75
S SV+
Sbjct: 148 SNSQTDQGLKQSVNLSSA 165
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 31.1 bits (70), Expect = 3.7
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHS 46
E+D SD ++ E E E+D + +S +S N+++ S
Sbjct: 3843 EEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDS 3881
Score = 31.1 bits (70), Expect = 3.7
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHS 749
E+D SD ++ E E E+D + +S +S N+++ S
Sbjct: 3843 EEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDS 3881
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The
domain is found in the primary vegetative sigma factor.
The function of this domain is unclear and can be
removed without loss of function.
Length = 211
Score = 30.2 bits (69), Expect = 3.9
Identities = 14/36 (38%), Positives = 21/36 (58%), Gaps = 1/36 (2%)
Query: 1 MAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNS 36
AAA+ E DE + D++ D+E E+E D + E D
Sbjct: 40 TAAAIESELDEEDLEDDDDDDEDEDE-DDEEEADLG 74
Score = 30.2 bits (69), Expect = 3.9
Identities = 14/36 (38%), Positives = 21/36 (58%), Gaps = 1/36 (2%)
Query: 704 MAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNS 739
AAA+ E DE + D++ D+E E+E D + E D
Sbjct: 40 TAAAIESELDEEDLEDDDDDDEDEDE-DDEEEADLG 74
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 29.9 bits (67), Expect = 4.0
Identities = 16/79 (20%), Positives = 30/79 (37%), Gaps = 8/79 (10%)
Query: 671 MKNKTKLKQINVSENQFGEEGVEEMEKLMKSFGMAAALV---LEDDEGECSDEEQDEESE 727
M K K++ V ++ E+ S +ED E +E+++ E
Sbjct: 1 MPPKGNAKKLKVRPPP-----TKDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEV 55
Query: 728 EENDSDAEGDNSNLSHNDS 746
EE + + S+ S + S
Sbjct: 56 EEETAASSKAPSSSSKSSS 74
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 30.8 bits (69), Expect = 4.4
Identities = 14/51 (27%), Positives = 27/51 (52%)
Query: 689 EEGVEEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNS 739
+E E E++ + + E DE E SDE ++ SE+E+++D+ +
Sbjct: 929 DESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEED 979
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 30.7 bits (70), Expect = 4.5
Identities = 10/35 (28%), Positives = 20/35 (57%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHND 42
+DD+ E +E+ D EEE++ D + D+ + +
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEE 372
Score = 30.7 bits (70), Expect = 4.5
Identities = 10/35 (28%), Positives = 20/35 (57%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHND 745
+DD+ E +E+ D EEE++ D + D+ + +
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEE 372
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. This
model represents a group of paralogous families in
plasmodium species alternately annotated as reticulocyte
binding protein, 235-kDa family protein and rhoptry
protein. Rhoptry protein is localized on the cell surface
and is extremely large (although apparently lacking in
repeat structure) and is important for the process of
invasion of the RBCs by the parasite. These proteins are
found in P. falciparum, P. vivax and P. yoelii.
Length = 2757
Score = 30.8 bits (69), Expect = 5.0
Identities = 33/131 (25%), Positives = 55/131 (41%), Gaps = 20/131 (15%)
Query: 637 SSIAKYLTDNTTLEDVNLTCNEISVQGGLDLVKAMKNKTKLKQINVSENQFGEEGVEEME 696
+ IA+ D T+LE+V I++ G +L K L++I+ E + E ++ ME
Sbjct: 1197 NEIAEIEKDKTSLEEVK----GINLSYGKNL-----GKLFLEKID-EEKKKSEHMIKAME 1246
Query: 697 KLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 756
++ DE + E + E E D AE + N+SH+D H S +
Sbjct: 1247 AYIEDL----------DEIKEKSPEIENEMGIEMDIKAEMETFNISHDDDKDHHIISKKH 1296
Query: 757 HNNSNQSHNTS 767
N + S
Sbjct: 1297 DENISDIREKS 1307
>gnl|CDD|237078 PRK12366, PRK12366, replication factor A; Reviewed.
Length = 637
Score = 30.8 bits (70), Expect = 5.0
Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 6/72 (8%)
Query: 676 KLKQINVSENQFGEEGVEEMEKLMKSFG-----MAAALVLEDDEGECSDEEQDEESEEEN 730
KLKQ + E EE +++ + ++ G AA +++ + G + Q+EE E+
Sbjct: 7 KLKQK-ILEKISEEELEKKINEKIEENGGFLGEEAALMMVAQELGIDIEYGQEEEEEDFK 65
Query: 731 DSDAEGDNSNLS 742
SD E N+
Sbjct: 66 ISDIEEGQINVE 77
>gnl|CDD|220419 pfam09814, DUF2351, Uncharacterized conserved protein (DUF2351).
Members of this family of proteins have no known
function.
Length = 350
Score = 30.2 bits (68), Expect = 5.1
Identities = 17/62 (27%), Positives = 29/62 (46%), Gaps = 7/62 (11%)
Query: 800 CAQLFLDEINSEPEDRYL-------ESMLLAIMKVSSLVPSTTTLQSPSSSLFECTKVLY 852
A L L+ +NS R++ + +LL + LV S+ + ++ S S KVLY
Sbjct: 237 VAALLLELLNSHSTRRFVIQSSDGKQYILLWVFNSDLLVSSSGSPRAGSLSARNAMKVLY 296
Query: 853 RE 854
+
Sbjct: 297 QP 298
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 30.6 bits (69), Expect = 5.2
Identities = 17/82 (20%), Positives = 38/82 (46%), Gaps = 15/82 (18%)
Query: 665 LDLVKAMKNKTKLKQINVSENQFGEEGVE-EMEKLMKSFGMAAALVLEDDEGECSDEEQD 723
LD++KA +NK++ +Q+ E + ++ E + + ED+ D E +
Sbjct: 629 LDVLKADENKSRHQQLFEGEEEDEDDLEETDDD--------------EDECEAIEDSESE 674
Query: 724 EESEEENDSDAEGDNSNLSHND 745
ES+ E+ + E ++ ++
Sbjct: 675 SESDGEDGEEDEQEDDAEANEG 696
>gnl|CDD|240339 PTZ00265, PTZ00265, multidrug resistance protein (mdr1);
Provisional.
Length = 1466
Score = 30.4 bits (68), Expect = 6.0
Identities = 31/156 (19%), Positives = 66/156 (42%), Gaps = 19/156 (12%)
Query: 718 SDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSHSVSQLK 777
++++ + ND+D + D + ++ DS ++ + N+ +++ + SN+ +
Sbjct: 726 NNQKVSSKKSSNNDNDKDSDMKSSAYKDSERGYDPDEMNGNSKHENESASNKKSCKMSDE 785
Query: 778 QHSVTDFLAAPSVSLFNSL------GPNCAQLFLDEINSEPEDRYLESMLLAIMKVSSLV 831
S + A + +L PN ++ EI S +D + + L+I+ L
Sbjct: 786 NASENN--AGGKLPFLRNLFKRKPKAPNNLRIVYREIFSYKKD--VTIIALSILVAGGLY 841
Query: 832 PSTTTLQSP-SSSLFECTKVLYRELFNWASNSDKLS 866
P L + S+LF + N +NS+K S
Sbjct: 842 PVFALLYAKYVSTLF--------DFANLEANSNKYS 869
Score = 30.0 bits (67), Expect = 9.0
Identities = 22/121 (18%), Positives = 52/121 (42%), Gaps = 10/121 (8%)
Query: 15 SDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQSHNNSNQSHNTSNQSHSVSQLK 74
++++ + ND+D + D + ++ DS ++ + N+ +++ + SN+ +
Sbjct: 726 NNQKVSSKKSSNNDNDKDSDMKSSAYKDSERGYDPDEMNGNSKHENESASNKKSCKMSDE 785
Query: 75 QHSVTDFLAAPSVSLFNSL------GPNCAQLFLDEINSEPEDRYLESMLLAIMKVSSLV 128
S + A + +L PN ++ EI S +D + + L+I+ L
Sbjct: 786 NASENN--AGGKLPFLRNLFKRKPKAPNNLRIVYREIFSYKKD--VTIIALSILVAGGLY 841
Query: 129 P 129
P
Sbjct: 842 P 842
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel,
four-helix bundle structure with a large hydrophobic
patch in which 23 residues of one monomer form van der
Waals contacts with the other monomer. This CENP-B
dimer configuration may be suitable for capturing two
distant CENP-B boxes during centromeric heterochromatin
formation.
Length = 101
Score = 28.2 bits (62), Expect = 6.5
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGD 34
E+D SDEE+D++ E+E D D + D
Sbjct: 9 EEDSDSDSDEEEDDDDEDEEDDDEDDD 35
Score = 28.2 bits (62), Expect = 6.5
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGD 737
E+D SDEE+D++ E+E D D + D
Sbjct: 9 EEDSDSDSDEEEDDDDEDEEDDDEDDD 35
>gnl|CDD|219905 pfam08564, CDC37_C, Cdc37 C terminal domain. Cdc37 is a protein
required for the activity of numerous eukaryotic protein
kinases. This domains corresponds to the C terminal
domain whose function is unclear. It is found C terminal
to the Hsp90 chaperone (Heat shocked protein 90) binding
domain pfam08565 and the N terminal kinase binding
domain of Cdc37 pfam03234.
Length = 89
Score = 28.1 bits (63), Expect = 7.2
Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 6/56 (10%)
Query: 687 FGEEGVEEMEKLMKSF---GM---AAALVLEDDEGECSDEEQDEESEEENDSDAEG 736
+ VEE E M+ G+ A + E + E +E +EE+E+E + +
Sbjct: 34 LAKMPVEEAEYHMERCIDSGLWVPNAKIEGEKEFKELEEEYNEEEAEKEEIEEEDE 89
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell
cycle progression and biogenesis of 60S ribosomal
subunits. The Schistosoma mansoni Mak16 has been shown
to target protein transport to the nucleolus.
Length = 97
Score = 27.9 bits (62), Expect = 7.8
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNSNLS 39
E+DE E +EE+++E E E SD E +
Sbjct: 62 ENDEEEEEEEEEEDEGEIEYVSDDEELEEEIE 93
Score = 27.9 bits (62), Expect = 7.8
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLS 742
E+DE E +EE+++E E E SD E +
Sbjct: 62 ENDEEEEEEEEEEDEGEIEYVSDDEELEEEIE 93
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 29.4 bits (66), Expect = 8.3
Identities = 9/46 (19%), Positives = 23/46 (50%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 53
D + DE+ +E E+++ + E S+ S ++ + + + +S
Sbjct: 218 SADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMES 263
Score = 29.4 bits (66), Expect = 8.3
Identities = 9/46 (19%), Positives = 23/46 (50%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDNSNLSHNDSNHSHNASNQS 756
D + DE+ +E E+++ + E S+ S ++ + + + +S
Sbjct: 218 SADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMES 263
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 29.7 bits (67), Expect = 8.8
Identities = 11/46 (23%), Positives = 21/46 (45%)
Query: 693 EEMEKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDN 738
E+M + +LE+ + + +E DE + E +S+ E N
Sbjct: 225 EKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPIN 270
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 29.6 bits (67), Expect = 9.0
Identities = 18/53 (33%), Positives = 30/53 (56%), Gaps = 8/53 (15%)
Query: 690 EGVEEM----EKLMKSFGMAAALVLEDDEGECSDEEQDEESEEENDSDAEGDN 738
+ +EE E+L + + V E+DE E SD E EE E++++ D +G+N
Sbjct: 302 KALEEQLRQGEELRRK--IEGKSVSEEDEDEDSDSE--EEDEDDDEDDDDGEN 350
>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal. The family represents
the N-terminal region of RXT2-like proteins. In S.
cerevisiae, RXT2 has been demonstrated to be involved
in conjugation with cellular fusion (mating) and
invasive growth. A high throughput localisation study
has localised RXT2 to the nucleus.
Length = 141
Score = 28.5 bits (64), Expect = 9.1
Identities = 9/28 (32%), Positives = 21/28 (75%)
Query: 8 EDDEGECSDEEQDEESEEENDSDAEGDN 35
++D G+ D+++D+E +EE D++ + +N
Sbjct: 48 DEDGGDIDDDDEDDEDDEEADAEDDDEN 75
Score = 28.5 bits (64), Expect = 9.1
Identities = 9/28 (32%), Positives = 21/28 (75%)
Query: 711 EDDEGECSDEEQDEESEEENDSDAEGDN 738
++D G+ D+++D+E +EE D++ + +N
Sbjct: 48 DEDGGDIDDDDEDDEDDEEADAEDDDEN 75
>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
Length = 356
Score = 29.6 bits (66), Expect = 9.3
Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 9/71 (12%)
Query: 668 VKAMKNKTKLKQINVSENQF---GEEG---VEEMEKLMKSFGMAAALVLEDDEGECSDEE 721
VK + KT + + V+ GEE E E+L F + ++ E E EE
Sbjct: 285 VKPVIGKTFQEMVKVAAKDLILGGEEKEPKQESQEQLFNPFAIDEEMLEETQE---QQEE 341
Query: 722 QDEESEEENDS 732
++ E EEEND+
Sbjct: 342 EENEEEEENDT 352
>gnl|CDD|139605 PRK13484, PRK13484, putative iron-regulated outer membrane
virulence protein; Provisional.
Length = 682
Score = 29.6 bits (66), Expect = 9.4
Identities = 13/45 (28%), Positives = 23/45 (51%)
Query: 338 PAVIELNHKGRMTDKPNDDVSDILKKLESISVESGQDSTKLSFAG 382
PA + + ++ KP D+ D +K +E IS+ G + +S G
Sbjct: 48 PASVSVITSEQLQKKPVSDLVDAVKDVEGISITGGNEKPDISIRG 92
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.128 0.355
Gapped
Lambda K H
0.267 0.0685 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 47,249,974
Number of extensions: 4536590
Number of successful extensions: 6302
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5632
Number of HSP's successfully gapped: 195
Length of query: 999
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 893
Effective length of database: 6,236,078
Effective search space: 5568817654
Effective search space used: 5568817654
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 64 (28.5 bits)