RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy13068
(950 letters)
>gnl|CDD|215083 PLN00162, PLN00162, transport protein sec23; Provisional.
Length = 761
Score = 319 bits (820), Expect = 4e-96
Identities = 107/195 (54%), Positives = 144/195 (73%), Gaps = 1/195 (0%)
Query: 749 GAPIPQGGRGCIQFITHYQAPSGEKKVRVTTIARNWADATTQLDHISSGFDQEAAAVIMG 808
P P G + +QF+T YQ +G+ ++RVTT+ R W + + + + +GFDQEAAAV+M
Sbjct: 469 SNPQPPGQQFFLQFLTRYQHSNGQTRLRVTTVTRRWVEGS-SSEELVAGFDQEAAAVVMA 527
Query: 809 RMVVNRAEQDDGPDVMRWADRTLIRLCQKFGHYNKDDPHSFQLPENFQMYPQFMYHLRRS 868
R+ ++ E ++ D RW DR LIRLC KFG Y KDDP SF+L NF +YPQFM++LRRS
Sbjct: 528 RLASHKMETEEEFDATRWLDRALIRLCSKFGDYRKDDPSSFRLSPNFSLYPQFMFNLRRS 587
Query: 869 QFLQVFNNSPDETSYYRHVLMREDLSQSLIMIQPVLYSYGFNGPPVPVLLDTSSIKPDCI 928
QF+QVFNNSPDET+Y+R +L RE+++ SL+MIQP L SY FNGPP PVLLD +SI D I
Sbjct: 588 QFVQVFNNSPDETAYFRMMLNRENVTNSLVMIQPTLISYSFNGPPEPVLLDVASIAADRI 647
Query: 929 LLMDTFFQLLIFHGE 943
LL+D++F ++IFHG
Sbjct: 648 LLLDSYFSVVIFHGS 662
Score = 195 bits (498), Expect = 4e-52
Identities = 71/138 (51%), Positives = 89/138 (64%), Gaps = 29/138 (21%)
Query: 519 CQKFGHYNKDDPHSFQLPENFQMYPQFMYHLRRSQFLQVFNNSPDETSYYRTLLIHLPHY 578
C KFG Y KDDP SF+L NF +YPQFM++LRRSQF+QVF
Sbjct: 554 CSKFGDYRKDDPSSFRLSPNFSLYPQFMFNLRRSQFVQVF-------------------- 593
Query: 579 SRFQFLQVFNNSPDETSYYRHVLMREDLSQSLIMIQPVLYSYGFNGPPVPVLLDTSSIKP 638
NNSPDET+Y+R +L RE+++ SL+MIQP L SY FNGPP PVLLD +SI
Sbjct: 594 ---------NNSPDETAYFRMMLNRENVTNSLVMIQPTLISYSFNGPPEPVLLDVASIAA 644
Query: 639 DCILLMDTFFQLLIFHGE 656
D ILL+D++F ++IFHG
Sbjct: 645 DRILLLDSYFSVVIFHGS 662
Score = 189 bits (482), Expect = 4e-50
Identities = 71/132 (53%), Positives = 96/132 (72%), Gaps = 7/132 (5%)
Query: 203 EERDGIRCTWNVWPSSKLEASRLVMPVGCLYQPLKEKPDLPPLQYEPLLCMRNQCRAILN 262
E DG+R +WNVWPSSK+EAS+ V+P+ LY PLK P+LP L Y+PL C CRA+LN
Sbjct: 7 EAIDGVRMSWNVWPSSKIEASKCVIPLAALYTPLKPLPELPVLPYDPLRCRT--CRAVLN 64
Query: 263 PLCQVDYKSKLWVCNFCFQRNAFPPQYAAITEQHQPAELHPQFTTIEYTIPVSQCFPKYP 322
P C+VD+++K+W+C FCFQRN FPP Y++I+E + PAEL PQ+TT+EYT+P P
Sbjct: 65 PYCRVDFQAKIWICPFCFQRNHFPPHYSSISETNLPAELFPQYTTVEYTLPPGSGGAPSP 124
Query: 323 LRSLPVFLPFLI 334
PVF+ F++
Sbjct: 125 ----PVFV-FVV 131
Score = 128 bits (324), Expect = 2e-30
Identities = 44/75 (58%), Positives = 59/75 (78%), Gaps = 2/75 (2%)
Query: 351 SPNLPPLQYEPLLCMRNQCRAILNPLCQVDYKSKLWVCNFCFQRNAFPPQYAAITEQHQP 410
P LP L Y+PL C CRA+LNP C+VD+++K+W+C FCFQRN FPP Y++I+E + P
Sbjct: 43 LPELPVLPYDPLRCRT--CRAVLNPYCRVDFQAKIWICPFCFQRNHFPPHYSSISETNLP 100
Query: 411 AELHPQFTTIEYTIP 425
AEL PQ+TT+EYT+P
Sbjct: 101 AELFPQYTTVEYTLP 115
Score = 99.2 bits (248), Expect = 4e-21
Identities = 37/64 (57%), Positives = 47/64 (73%), Gaps = 2/64 (3%)
Query: 12 EERDGIRCTWNVWPSSKLEASRLVMPVGCLYQPLKEKPDLPPLQYEPLLCMRNQCRAILN 71
E DG+R +WNVWPSSK+EAS+ V+P+ LY PLK P+LP L Y+PL C CRA+LN
Sbjct: 7 EAIDGVRMSWNVWPSSKIEASKCVIPLAALYTPLKPLPELPVLPYDPLRCRT--CRAVLN 64
Query: 72 PLCQ 75
P C+
Sbjct: 65 PYCR 68
Score = 77.3 bits (191), Expect = 2e-14
Identities = 30/62 (48%), Positives = 39/62 (62%)
Query: 656 EEPIRSHNDIHKGNNKYQKMATKHYEGLALRAAIVGHAIDIYSCALDQTGLLEMKLCCNA 715
EPIRSH D+ K Y K A K YEGLA + GH +D+++C+LDQ G+ EMK+
Sbjct: 307 SEPIRSHKDLDKDAAPYYKKAVKFYEGLAKQLVAQGHVLDVFACSLDQVGVAEMKVAVER 366
Query: 716 TG 717
TG
Sbjct: 367 TG 368
>gnl|CDD|227380 COG5047, SEC23, Vesicle coat complex COPII, subunit SEC23
[Intracellular trafficking and secretion].
Length = 755
Score = 261 bits (669), Expect = 4e-75
Identities = 118/201 (58%), Positives = 140/201 (69%), Gaps = 3/201 (1%)
Query: 743 STTAAHGAPIPQGGRGCIQFITHYQAPSGEKKVRVTTIARNWADATTQLDHISSGFDQEA 802
+ AA G+ + IQFIT YQ SG ++RVTT+AR + D L I+ FDQEA
Sbjct: 459 ALGAASGSA-QRPAEAYIQFITTYQHSSGTYRIRVTTVARMFTDG--GLPKINRSFDQEA 515
Query: 803 AAVIMGRMVVNRAEQDDGPDVMRWADRTLIRLCQKFGHYNKDDPHSFQLPENFQMYPQFM 862
AAV M R+ +AE +D DV RW DR LIRLCQKF Y KDDP SF+L NF +YPQFM
Sbjct: 516 AAVFMARIAAFKAETEDIIDVFRWIDRNLIRLCQKFADYRKDDPSSFRLDPNFTLYPQFM 575
Query: 863 YHLRRSQFLQVFNNSPDETSYYRHVLMREDLSQSLIMIQPVLYSYGFNGPPVPVLLDTSS 922
YHLRRS FL VFNNSPDET++YRH+L D++ SLIMIQP L SY F VPVLLD+ S
Sbjct: 576 YHLRRSPFLSVFNNSPDETAFYRHMLNNADVNDSLIMIQPTLQSYSFEKGGVPVLLDSVS 635
Query: 923 IKPDCILLMDTFFQLLIFHGE 943
+KPD ILL+DTFF +LIFHG
Sbjct: 636 VKPDVILLLDTFFHILIFHGS 656
Score = 158 bits (400), Expect = 8e-40
Identities = 74/138 (53%), Positives = 85/138 (61%), Gaps = 29/138 (21%)
Query: 519 CQKFGHYNKDDPHSFQLPENFQMYPQFMYHLRRSQFLQVFNNSPDETSYYRTLLIHLPHY 578
CQKF Y KDDP SF+L NF +YPQFMYHLRRS FL VFNNSPDET
Sbjct: 548 CQKFADYRKDDPSSFRLDPNFTLYPQFMYHLRRSPFLSVFNNSPDET------------- 594
Query: 579 SRFQFLQVFNNSPDETSYYRHVLMREDLSQSLIMIQPVLYSYGFNGPPVPVLLDTSSIKP 638
++YRH+L D++ SLIMIQP L SY F VPVLLD+ S+KP
Sbjct: 595 ----------------AFYRHMLNNADVNDSLIMIQPTLQSYSFEKGGVPVLLDSVSVKP 638
Query: 639 DCILLMDTFFQLLIFHGE 656
D ILL+DTFF +LIFHG
Sbjct: 639 DVILLLDTFFHILIFHGS 656
Score = 151 bits (384), Expect = 7e-38
Identities = 60/132 (45%), Positives = 80/132 (60%), Gaps = 8/132 (6%)
Query: 199 FYQNEERDGIRCTWNVWPSSKLEASRLVMPVGCLYQPLKEKPDLPPLQYEPLLCMRNQCR 258
F EE DGIR TWNV+P+++ +A+R V+P+ CLY PL E L YEP+ C C+
Sbjct: 3 FEIIEENDGIRLTWNVFPATRGDATRTVIPIACLYTPLHEDDALTVNYYEPVKCTA-PCK 61
Query: 259 AILNPLCQVDYKSKLWVCNFCFQRNAFPPQYAAITEQHQPAELHPQFTTIEYTIPVSQCF 318
A+LNP C +D +++ W+C FC QRN PPQY I+ + P EL PQ +TIEYT+
Sbjct: 62 AVLNPYCHIDERNQSWICPFCNQRNTLPPQYRDISNANLPLELLPQSSTIEYTLSKPVIL 121
Query: 319 PKYPLRSLPVFL 330
P PVF
Sbjct: 122 P-------PVFF 126
Score = 95.7 bits (238), Expect = 4e-20
Identities = 33/67 (49%), Positives = 44/67 (65%), Gaps = 1/67 (1%)
Query: 8 FYQNEERDGIRCTWNVWPSSKLEASRLVMPVGCLYQPLKEKPDLPPLQYEPLLCMRNQCR 67
F EE DGIR TWNV+P+++ +A+R V+P+ CLY PL E L YEP+ C C+
Sbjct: 3 FEIIEENDGIRLTWNVFPATRGDATRTVIPIACLYTPLHEDDALTVNYYEPVKCTA-PCK 61
Query: 68 AILNPLC 74
A+LNP C
Sbjct: 62 AVLNPYC 68
Score = 89.2 bits (221), Expect = 4e-18
Identities = 34/74 (45%), Positives = 46/74 (62%), Gaps = 1/74 (1%)
Query: 352 PNLPPLQYEPLLCMRNQCRAILNPLCQVDYKSKLWVCNFCFQRNAFPPQYAAITEQHQPA 411
L YEP+ C C+A+LNP C +D +++ W+C FC QRN PPQY I+ + P
Sbjct: 44 DALTVNYYEPVKCTA-PCKAVLNPYCHIDERNQSWICPFCNQRNTLPPQYRDISNANLPL 102
Query: 412 ELHPQFTTIEYTIP 425
EL PQ +TIEYT+
Sbjct: 103 ELLPQSSTIEYTLS 116
Score = 62.2 bits (151), Expect = 8e-10
Identities = 30/61 (49%), Positives = 43/61 (70%)
Query: 657 EPIRSHNDIHKGNNKYQKMATKHYEGLALRAAIVGHAIDIYSCALDQTGLLEMKLCCNAT 716
EP+RSH+DI + ++ K ATK Y+GLA R A GHA+DI++ LDQ G++EM+ +T
Sbjct: 303 EPMRSHHDIESDSAQHSKKATKFYKGLAERVANQGHALDIFAGCLDQIGIMEMEPLTTST 362
Query: 717 G 717
G
Sbjct: 363 G 363
>gnl|CDD|218277 pfam04815, Sec23_helical, Sec23/Sec24 helical domain. COPII-coated
vesicles carry proteins from the endoplasmic reticulum
to the Golgi complex. This vesicular transport can be
reconstituted by using three cytosolic components
containing five proteins: the small GTPase Sar1p, the
Sec23p/24p complex, and the Sec13p/Sec31p complex. This
domain is composed of five alpha helices.
Length = 103
Score = 138 bits (350), Expect = 2e-38
Identities = 33/103 (32%), Positives = 52/103 (50%), Gaps = 2/103 (1%)
Query: 800 QEAAAVIMGRMVVNRAEQDDGPDVMRWADRTLIRLCQKFGHYNKDDPHS--FQLPENFQM 857
QEA AV++ + V +A D RW D+ L+ + + Y S LPE+ ++
Sbjct: 1 QEAIAVLLAKKAVEKALTSSLKDARRWLDKKLVDILAAYRKYCASSSSSGQLILPESLKL 60
Query: 858 YPQFMYHLRRSQFLQVFNNSPDETSYYRHVLMREDLSQSLIMI 900
P +M L +S L+ N SPDE +Y R +L+ + L+MI
Sbjct: 61 LPLYMLALLKSPALRGGNVSPDERAYARCLLLSLPVENLLLMI 103
Score = 85.3 bits (212), Expect = 6e-20
Identities = 22/116 (18%), Positives = 39/116 (33%), Gaps = 31/116 (26%)
Query: 500 QEDGNEEIEDERETKMEEECQKFGHYNKDDPHS--FQLPENFQMYPQFMYHLRRSQFLQV 557
++ + K+ + + Y S LPE+ ++ P +M L +S
Sbjct: 17 LTSSLKDARRWLDKKLVDILAAYRKYCASSSSSGQLILPESLKLLPLYMLALLKS----- 71
Query: 558 FNNSPDETSYYRTLLIHLPHYSRFQFLQVFNNSPDETSYYRHVLMREDLSQSLIMI 613
L+ N SPDE +Y R +L+ + L+MI
Sbjct: 72 ------------------------PALRGGNVSPDERAYARCLLLSLPVENLLLMI 103
>gnl|CDD|200443 cd11287, Sec23_C, C-terminal Actin depolymerization factor-homology
domain of Sec23. The C-terminal domain of the Sec23
subunit of the coat protein complex II (COPII) is
distantly related to gelsolin-like repeats and the actin
depolymerizing domains found in cofilin and similar
proteins. Sec23 forms a tight complex with Sec24. The
cytoplasmic Sec23/24 complex is recruited together with
Sar1-GTP and Sec13/31 to induce coat polymerization and
membrane deformation in the forming of COPII-coated
endoplasmic reticulum vesicles. The function of the
Sec23 C-terminal domain is unclear.
Length = 121
Score = 105 bits (263), Expect = 1e-26
Identities = 41/53 (77%), Positives = 46/53 (86%)
Query: 604 EDLSQSLIMIQPVLYSYGFNGPPVPVLLDTSSIKPDCILLMDTFFQLLIFHGE 656
ED+S SLIMIQP LYSY FNGPP PVLLD+SSI PD ILL+DTFF +LI+HGE
Sbjct: 1 EDVSNSLIMIQPTLYSYSFNGPPEPVLLDSSSILPDRILLLDTFFHILIYHGE 53
Score = 105 bits (263), Expect = 1e-26
Identities = 41/53 (77%), Positives = 46/53 (86%)
Query: 891 EDLSQSLIMIQPVLYSYGFNGPPVPVLLDTSSIKPDCILLMDTFFQLLIFHGE 943
ED+S SLIMIQP LYSY FNGPP PVLLD+SSI PD ILL+DTFF +LI+HGE
Sbjct: 1 EDVSNSLIMIQPTLYSYSFNGPPEPVLLDSSSILPDRILLLDTFFHILIYHGE 53
>gnl|CDD|238755 cd01478, Sec23-like, Sec23-like: Protein and membrane traffic in
eukaryotes is mediated by at least in part by the
budding and fusion of intracellular transport vesicles
that selectively carry cargo proteins and lipids from
donor to acceptor organelles. The two main classes of
vesicular carriers within the endocytic and the
biosynthetic pathways are COP- and clathrin-coated
vesicles. Formation of COPII vesicles requires the
ordered assembly of the coat built from several
cytosolic components GTPase Sar1, complexes of
Sec23-Sec24 and Sec13-Sec31. The process is initiated by
the conversion of GDP to GTP by the GTPase Sar1 which
then recruits the heterodimeric complex of Sec23 and
Sec24. This heterodimeric complex generates the
pre-budding complex. The final step leading to membrane
deformation and budding of COPII-coated vesicles is
carried by the heterodimeric complex Sec13-Sec31. The
members of this CD belong to the Sec23-like family. Sec
23 is very similar to Sec24. The Sec23 and Sec24
polypeptides fold into five distinct domains: a
beta-barrel, a zinc finger, a vWA or trunk, an all
helical region and a carboxy Gelsolin domain. The
members of this subgroup lack the consensus MIDAS motif
but have the overall Para-Rossmann type fold that is
characteristic of this superfamily.
Length = 267
Score = 102 bits (257), Expect = 4e-24
Identities = 39/72 (54%), Positives = 50/72 (69%), Gaps = 2/72 (2%)
Query: 657 EPIRSHNDIHKGNNKYQKMATKHYEGLALRAAIVGHAIDIYSCALDQTGLLEMKLCCNAT 716
+PIRSH+DI K N KY K A K Y+ LA R A GHA+DI++ LDQ GLLEMK+ N+T
Sbjct: 187 DPIRSHHDIDKDNAKYYKKAVKFYDSLAKRLAANGHAVDIFAGCLDQVGLLEMKVLVNST 246
Query: 717 GKPSHVLIAIEF 728
G HV+++ F
Sbjct: 247 G--GHVVLSDSF 256
>gnl|CDD|203092 pfam04810, zf-Sec23_Sec24, Sec23/Sec24 zinc finger. COPII-coated
vesicles carry proteins from the endoplasmic reticulum
to the Golgi complex. This vesicular transport can be
reconstituted by using three cytosolic components
containing five proteins: the small GTPase Sar1p, the
Sec23p/24p complex, and the Sec13p/Sec31p complex. This
domain is found to be zinc binding domain.
Length = 39
Score = 66.4 bits (163), Expect = 5e-14
Identities = 22/41 (53%), Positives = 26/41 (63%), Gaps = 2/41 (4%)
Query: 249 PLLCMRNQCRAILNPLCQVDYKSKLWVCNFCFQRNAFPPQY 289
P+ C R CRA LNP CQ D+ + W CN C QRN PP+Y
Sbjct: 1 PVRCRR--CRAYLNPYCQFDFGGRRWTCNLCGQRNDLPPEY 39
Score = 66.4 bits (163), Expect = 5e-14
Identities = 22/41 (53%), Positives = 26/41 (63%), Gaps = 2/41 (4%)
Query: 361 PLLCMRNQCRAILNPLCQVDYKSKLWVCNFCFQRNAFPPQY 401
P+ C R CRA LNP CQ D+ + W CN C QRN PP+Y
Sbjct: 1 PVRCRR--CRAYLNPYCQFDFGGRRWTCNLCGQRNDLPPEY 39
Score = 28.3 bits (64), Expect = 1.5
Identities = 11/18 (61%), Positives = 12/18 (66%), Gaps = 2/18 (11%)
Query: 58 PLLCMRNQCRAILNPLCQ 75
P+ C R CRA LNP CQ
Sbjct: 1 PVRCRR--CRAYLNPYCQ 16
>gnl|CDD|147125 pfam04811, Sec23_trunk, Sec23/Sec24 trunk domain. COPII-coated
vesicles carry proteins from the endoplasmic reticulum
to the Golgi complex. This vesicular transport can be
reconstituted by using three cytosolic components
containing five proteins: the small GTPase Sar1p, the
Sec23p/24p complex, and the Sec13p/Sec31p complex. This
domain is known as the trunk domain and has an
alpha/beta vWA fold and forms the dimer interface.
Length = 241
Score = 70.7 bits (174), Expect = 2e-13
Identities = 25/91 (27%), Positives = 39/91 (42%), Gaps = 15/91 (16%)
Query: 642 LLMDTFFQ---LLIFHGEEPI------------RSHNDIHKGNNKYQKMATKHYEGLALR 686
LL+ F +++F G P SH+D K K K A K Y+ LA
Sbjct: 125 LLLKAAFTGGKIMLFQGGLPTVGPGGKLKSRLDESHHDTDKEKAKLVKKADKFYKSLAKE 184
Query: 687 AAIVGHAIDIYSCALDQTGLLEMKLCCNATG 717
GH++D+++ +LD + E+ TG
Sbjct: 185 CVAQGHSVDLFAFSLDYVDVAELGCLSRLTG 215
>gnl|CDD|219707 pfam08033, Sec23_BS, Sec23/Sec24 beta-sandwich domain.
Length = 86
Score = 55.2 bits (134), Expect = 1e-09
Identities = 14/38 (36%), Positives = 20/38 (52%)
Query: 748 HGAPIPQGGRGCIQFITHYQAPSGEKKVRVTTIARNWA 785
P+ G + IQF Y SGE+++RVTT+A
Sbjct: 49 IDEPLDSGKQAYIQFALLYTHSSGERRIRVTTVALPVT 86
>gnl|CDD|238745 cd01468, trunk_domain, trunk domain. COPII-coated vesicles carry
proteins from the endoplasmic reticulum to the Golgi
complex. This vesicular transport can be reconstituted
by using three cytosolic components containing five
proteins: the small GTPase Sar1p, the Sec23p/24p
complex, and the Sec13p/Sec31p complex. This domain is
known as the trunk domain and has an alpha/beta vWA fold
and forms the dimer interface. Some members of this
family possess a partial MIDAS motif that is a
characteristic feature of most vWA domain proteins.
Length = 239
Score = 58.8 bits (143), Expect = 2e-09
Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 5/61 (8%)
Query: 657 EPIRSHNDIHKGNNKYQKMATKHYEGLALRAAIVGHAIDIYSCALDQTGLLEMKLCCNAT 716
EPIRSH++ + K ATK Y+ LA G +D+++ +LD + +K +T
Sbjct: 160 EPIRSHDEA-----QLLKPATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKST 214
Query: 717 G 717
G
Sbjct: 215 G 215
>gnl|CDD|200436 cd11280, gelsolin_like, Tandemly repeated domains found in
gelsolin, severin, villin, and related proteins.
Gelsolin repeats occur in gelsolin, severin, villin,
advillin, villidin, supervillin, flightless, quail,
fragmin, and other proteins, usually in several copies.
They co-occur with villin headpiece domains,
leucine-rich repeats, and several other domains. These
gelsolin-related actin binding proteins (GRABPs) play
regulatory roles in the assembly and disassembly of
actin filaments; they are involved in F-actin capping,
uncapping, severing, or the nucleation of actin
filaments. Severing of actin filaments is Ca2+
dependent. Villins are also linked to generating bundles
of F-actin with uniform filament polarity, which is most
likely mediated by their extra villin headpiece domain.
Many family members have also adopted functions in the
nucleus, including the regulation of transcription.
Supervillin, gelsolin, and flightless I are involved in
intracellular signaling via nuclear hormone receptors.
The gelsolin-like domain is distantly related to the
actin depolymerizing domains found in cofilin and
similar proteins.
Length = 88
Score = 42.7 bits (101), Expect = 3e-05
Identities = 10/43 (23%), Positives = 19/43 (44%)
Query: 904 LYSYGFNGPPVPVLLDTSSIKPDCILLMDTFFQLLIFHGEVSN 946
V L +SS+ D + ++DT ++ I+ G S+
Sbjct: 6 RVRGSKAIEIEEVPLASSSLDSDDVFVLDTGSEIYIWQGRASS 48
Score = 40.8 bits (96), Expect = 2e-04
Identities = 9/42 (21%), Positives = 17/42 (40%)
Query: 617 LYSYGFNGPPVPVLLDTSSIKPDCILLMDTFFQLLIFHGEEP 658
V L +SS+ D + ++DT ++ I+ G
Sbjct: 6 RVRGSKAIEIEEVPLASSSLDSDDVFVLDTGSEIYIWQGRAS 47
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 44.6 bits (105), Expect = 3e-04
Identities = 24/66 (36%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
G QE N E + + E+ + GDS E+EE EE+ E+EE++E+ EE E+E E
Sbjct: 833 GEQELNAENQGEAKQDEKGVDGGGGSDGGDS-EEEEEEEEEEEEEEEEEEEEEEEEEENE 891
Query: 513 TKMEEE 518
+ E
Sbjct: 892 EPLSLE 897
Score = 37.3 bits (86), Expect = 0.043
Identities = 22/119 (18%), Positives = 50/119 (42%), Gaps = 5/119 (4%)
Query: 436 NDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIE 495
++ + E + E + + E E +QE ++Q + + +E+ ++
Sbjct: 794 DEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVD 853
Query: 496 DEEDQEDGNEEIEDERETKMEEECQKFGHYNKDD-----PHSFQLPENFQMYPQFMYHL 549
+ G+ E E+E E + EEE ++ +++ P S + PE Q +++ L
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQAIYLFLL 912
Score = 33.8 bits (77), Expect = 0.48
Identities = 19/70 (27%), Positives = 34/70 (48%), Gaps = 8/70 (11%)
Query: 455 QEENIE-EKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGN-------EE 506
E I E+ + E EA++ + +G+++ +E E E E +++G EE
Sbjct: 681 SEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEE 740
Query: 507 IEDERETKME 516
+EDE E + E
Sbjct: 741 VEDEGEGEAE 750
Score = 33.4 bits (76), Expect = 0.72
Identities = 19/81 (23%), Positives = 38/81 (46%), Gaps = 6/81 (7%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQ------EGQGDSQEDEEMEEQRIEDEEDQEDGNEEIE 508
+ + + K + ++E E E + EG+ ++ E+ E E E E + + E
Sbjct: 700 EAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEG 759
Query: 509 DERETKMEEECQKFGHYNKDD 529
D +ET+ E E + G ++D+
Sbjct: 760 DRKETEHEGETEAEGKEDEDE 780
Score = 32.3 bits (73), Expect = 1.6
Identities = 20/69 (28%), Positives = 30/69 (43%), Gaps = 5/69 (7%)
Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIE---D 509
G E E + + E +QEG+G+ + E + E EE + +G E E D
Sbjct: 671 GETETKGENESEGEIPAE--RKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTED 728
Query: 510 ERETKMEEE 518
E E + EE
Sbjct: 729 EGEIETGEE 737
Score = 31.5 bits (71), Expect = 2.7
Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 3/72 (4%)
Query: 455 QEENIEEKFDDSSQEEYFEAE---DQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
E E+ ++ E EAE + E +GD +E E E E +ED+++G + ++
Sbjct: 730 GEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDG 789
Query: 512 ETKMEEECQKFG 523
E K +E +
Sbjct: 790 EMKGDEGAEGKV 801
Score = 31.1 bits (70), Expect = 3.2
Identities = 19/70 (27%), Positives = 34/70 (48%), Gaps = 5/70 (7%)
Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE----EIE 508
+ E E++ + + EE E + EG+G+++ E+E + E + E E E E
Sbjct: 720 ETEAEGTEDEGEIETGEEG-EEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDE 778
Query: 509 DERETKMEEE 518
DE E + E+
Sbjct: 779 DEGEIQAGED 788
Score = 31.1 bits (70), Expect = 3.3
Identities = 22/76 (28%), Positives = 35/76 (46%), Gaps = 13/76 (17%)
Query: 455 QEENIEEKFDDSSQEEYFEAE---DQEGQGDSQED------EEMEEQRIEDEEDQEDGNE 505
QE E K ++ S+ E AE +QEG+G+ + E E+ + E + +G E
Sbjct: 669 QEGETETKGENESEGE-IPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTE 727
Query: 506 ---EIEDERETKMEEE 518
EIE E + E+
Sbjct: 728 DEGEIETGEEGEEVED 743
Score = 30.0 bits (67), Expect = 9.2
Identities = 20/55 (36%), Positives = 25/55 (45%), Gaps = 4/55 (7%)
Query: 465 DSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE-EIEDERETKMEEE 518
D S+ + EAE G+ +E E E +E G E E E E ETK E E
Sbjct: 629 DLSKGDVAEAEHT---GERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENE 680
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 43.8 bits (104), Expect = 4e-04
Identities = 27/107 (25%), Positives = 51/107 (47%), Gaps = 14/107 (13%)
Query: 456 EENIEEKFDDSSQEEY-------FEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIE 508
E + ++ F EE + ED+E D E+EE + ++EED+ED + + E
Sbjct: 307 EPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDE 366
Query: 509 DERETKMEEECQK----FGHYNKDDPHSFQLPENFQMYPQFMYHLRR 551
D+ E + EE+ +K + P +F P++ + +F+ L+
Sbjct: 367 DDEEEEEEEKEKKKKKSAESTRSELPFTFPCPKS---HEEFLELLKG 410
Score = 38.1 bits (89), Expect = 0.027
Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 2/65 (3%)
Query: 456 EENIEEKFDDSSQEEYFEAED--QEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
EE+ +E DD E + +D GQG+ E+EE + EDEED +D EE E++ +
Sbjct: 292 EEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDL 351
Query: 514 KMEEE 518
EEE
Sbjct: 352 SDEEE 356
Score = 35.7 bits (83), Expect = 0.14
Identities = 17/57 (29%), Positives = 32/57 (56%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
EE ++ D +E+ +D E++E EE+ D+ED+ED ++++E+E E
Sbjct: 291 EEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEE 347
Score = 33.4 bits (77), Expect = 0.75
Identities = 17/60 (28%), Positives = 33/60 (55%), Gaps = 5/60 (8%)
Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEM-EEQRIEDEEDQEDG----NEEIEDERETKMEEE 518
D+ ++ A+D + + + +D+ Q EDEE++EDG +EE +D+ + EE+
Sbjct: 289 DEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEED 348
Score = 30.7 bits (70), Expect = 4.6
Identities = 13/64 (20%), Positives = 25/64 (39%), Gaps = 9/64 (14%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK-- 514
I++ D ++ D + GD + ED E+ EE + +++K
Sbjct: 124 SEIDKDDDVRDDDD----FDDDDLGDL--ASDDRAAHFGGGEDDEEDEEEQPERKKSKKE 177
Query: 515 -MEE 517
M+E
Sbjct: 178 VMKE 181
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 41.4 bits (97), Expect = 5e-04
Identities = 25/69 (36%), Positives = 36/69 (52%), Gaps = 6/69 (8%)
Query: 473 EAED--QEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQKFGHYNKDDP 530
E ED E Q D +E+EE +E+ IE+ ED ED E +EDE E + +EE + D
Sbjct: 39 ENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNV----DLKDI 94
Query: 531 HSFQLPENF 539
+ + F
Sbjct: 95 EKKNINDIF 103
Score = 41.0 bits (96), Expect = 8e-04
Identities = 19/61 (31%), Positives = 36/61 (59%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
E+ +EK +D +EE + E+ E D +++EE+ E E+EED+ED + + E++
Sbjct: 41 EDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNIN 100
Query: 516 E 516
+
Sbjct: 101 D 101
Score = 38.7 bits (90), Expect = 0.005
Identities = 21/64 (32%), Positives = 38/64 (59%), Gaps = 3/64 (4%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDE--EDQEDGNEEIEDERETK 514
E+I ++ +D E+ + E++E + D +E EE E+ E+E ED+E+ E+ ED + K
Sbjct: 34 EDIIKENEDVKDEKQEDDEEEE-EEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLK 92
Query: 515 MEEE 518
E+
Sbjct: 93 DIEK 96
Score = 34.4 bits (79), Expect = 0.13
Identities = 24/99 (24%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 433 ITENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQ 492
I EN++V + +++ EE+ D+ EE + ED+E + +E+EE +E+
Sbjct: 37 IKENEDVKDEKQ----------EDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEE 86
Query: 493 RIEDEEDQEDGNEEIEDERETKMEEECQKF--GHYNKDD 529
D +D E + I D + ++ Q +Y K++
Sbjct: 87 DNVDLKDIE--KKNINDIFNSTQDDNAQNLISKNYKKNE 123
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 39.9 bits (93), Expect = 6e-04
Identities = 22/64 (34%), Positives = 34/64 (53%), Gaps = 10/64 (15%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGN-----EEIEDE 510
EEN ++ DD E E++ + D +E+ E EE+ E+EE+ E E+ ED+
Sbjct: 40 EENGAQEGDDEM-----EEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDD 94
Query: 511 RETK 514
ETK
Sbjct: 95 AETK 98
Score = 38.4 bits (89), Expect = 0.002
Identities = 16/53 (30%), Positives = 26/53 (49%)
Query: 461 EKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
E ++ E+ +GD + +EE E ++EE+ E EE E+E ET
Sbjct: 27 ENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEET 79
Score = 35.3 bits (81), Expect = 0.025
Identities = 19/62 (30%), Positives = 32/62 (51%), Gaps = 4/62 (6%)
Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQ---RI-EDEEDQEDGNEEIE 508
G QE + E + ++ E+ E E + + + +E+EE E R EDEED + ++
Sbjct: 43 GAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKT 102
Query: 509 DE 510
DE
Sbjct: 103 DE 104
Score = 33.8 bits (77), Expect = 0.093
Identities = 12/44 (27%), Positives = 26/44 (59%)
Query: 475 EDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
E++E +DE EE+ +++++++E+G E E+ E + E
Sbjct: 38 ENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEG 81
Score = 30.3 bits (68), Expect = 1.5
Identities = 14/50 (28%), Positives = 25/50 (50%)
Query: 469 EEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
EE ++ G+ E+ +E E EE++E ++ E+E E + EE
Sbjct: 24 EEKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEG 73
>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit SEC24/subunit
SFB2/subunit SFB3 [Intracellular trafficking and
secretion].
Length = 861
Score = 42.5 bits (100), Expect = 0.001
Identities = 33/170 (19%), Positives = 63/170 (37%), Gaps = 9/170 (5%)
Query: 759 CIQFITHYQAPSGEKKVRVTTIARNWADATTQLDHISSGFDQEAAAVIMGRMVVNRAEQD 818
Q Y GE+++RV ++ ++ + + + DQ A A I+ + +A
Sbjct: 568 YFQVALLYTLNDGERRIRVVNLS---LPTSSSIREVYASADQLAIACILAKKASTKALNS 624
Query: 819 DGPDVMRWADRTLIRLCQ--KFGHYNKDDPHSFQLPENFQMYPQFMYHLRRSQFLQVFNN 876
+ +++++ + + K + LP N ++ P M L +S + +
Sbjct: 625 SLKEARVLINKSMVDILKAYKKELVKSNTSTQLPLPANLKLLPLLMLALLKSSAFRSGST 684
Query: 877 SPDETSYYRHVLMREDLSQSLIMIQPVLYS----YGFNGPPVPVLLDTSS 922
D + L L Q + I P LY+ G P LL S
Sbjct: 685 PSDIRISALNRLTSLPLKQLMRNIYPTLYALHDMPIEAGLPDEGLLVLPS 734
Score = 42.1 bits (99), Expect = 0.002
Identities = 30/118 (25%), Positives = 46/118 (38%), Gaps = 14/118 (11%)
Query: 182 QLSLETLKISMTTYEEFF---YQNEERDGIRCTWNVWP--SSKLEASRLVMPVGCLYQP- 235
L L I F N +R T P + L+ S++ P G + +P
Sbjct: 124 DLFLPPPPIVPPLTTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKI--PFGLVIRPF 181
Query: 236 --LKEKPDLPPLQYE--PLLCMRNQCRAILNPLCQVDYKSKLWVCNFCFQRNAFPPQY 289
L + D PL + + C R CR+ +NP Q + + W CN C +N P +
Sbjct: 182 LELYPEEDPVPLVEDGSIVRCRR--CRSYINPFVQFIEQGRKWRCNICRSKNDVPEGF 237
Score = 39.8 bits (93), Expect = 0.007
Identities = 35/158 (22%), Positives = 56/158 (35%), Gaps = 38/158 (24%)
Query: 298 PAELHPQFTTIEYTIPVSQCFPKYPLRSLPVFLPFLIQFVFFLAITHLINQVGSP----- 352
P + P TT S C PKY +RS +P L+ + P
Sbjct: 129 PPPIVPPLTTNFVGSEQSNCSPKY-VRSTMYAIP---------ETNDLLKKSKIPFGLVI 178
Query: 353 ----------NLPPLQYE--PLLCMRNQCRAILNPLCQVDYKSKLWVCNFCFQRNAFPPQ 400
+ PL + + C R CR+ +NP Q + + W CN C +N P
Sbjct: 179 RPFLELYPEEDPVPLVEDGSIVRCRR--CRSYINPFVQFIEQGRKWRCNICRSKNDVPEG 236
Query: 401 YAAITEQHQP-------AELHPQFTTIEYTIPLLYSIT 431
+ + + P EL + +++ P YS+
Sbjct: 237 FDNPSGPNDPRSDRYSRPEL--KSGVVDFLAPKEYSLR 272
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 41.8 bits (98), Expect = 0.001
Identities = 30/98 (30%), Positives = 50/98 (51%), Gaps = 7/98 (7%)
Query: 469 EEYFEAEDQEGQGDSQEDEEMEEQ--RIEDEEDQEDGN-EEIEDERETKMEEECQKFGHY 525
+EA D E +E+E + ++ +E E+D DG E+++E+E EE Q Y
Sbjct: 62 ISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREY 121
Query: 526 NKDDPHSFQLPENFQ-MYPQFMYHLRRSQFLQ---VFN 559
N D ++ QL +N Q + Q+ Y L + L+ +FN
Sbjct: 122 NLFDRNNLQLEDNLQSLELQYEYSLNQLDKLRKTNIFN 159
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 41.2 bits (97), Expect = 0.002
Identities = 21/64 (32%), Positives = 32/64 (50%), Gaps = 2/64 (3%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
+ E E DD +EE +E E+ E D E ++E + + D ED EE +DE K
Sbjct: 87 EAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSED--EEEKDEAAKK 144
Query: 515 MEEE 518
+E+
Sbjct: 145 AKED 148
Score = 38.9 bits (91), Expect = 0.008
Identities = 17/82 (20%), Positives = 34/82 (41%), Gaps = 4/82 (4%)
Query: 444 EIHIKPRICGL----QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEED 499
E + I GL + + E K ++ Q + +D E + E++E + E +
Sbjct: 62 EEEVVDGIPGLELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDV 121
Query: 500 QEDGNEEIEDERETKMEEECQK 521
+ D E D + + ++E K
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAK 143
Score = 36.6 bits (85), Expect = 0.047
Identities = 20/68 (29%), Positives = 39/68 (57%), Gaps = 5/68 (7%)
Query: 456 EENIEEKFDDSSQEEYF----EAEDQEG-QGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
E+ +E DD +EE+ E D EG D + D+E+E EDEE++++ ++ +++
Sbjct: 89 EQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKED 148
Query: 511 RETKMEEE 518
+ ++ EE
Sbjct: 149 SDEELSEE 156
Score = 30.4 bits (69), Expect = 4.2
Identities = 17/59 (28%), Positives = 32/59 (54%), Gaps = 1/59 (1%)
Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGN-EEIEDERETKMEEECQK 521
DD + E++ + DS+++EE +E + +ED ++ EE E+E + E E +K
Sbjct: 113 DDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEK 171
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 40.4 bits (94), Expect = 0.003
Identities = 22/84 (26%), Positives = 37/84 (44%), Gaps = 7/84 (8%)
Query: 440 IRRNEIHIKPRIC--GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDE 497
+ R + KPR+ + +N +E D++ + D E + +E + EE+ EDE
Sbjct: 140 VEREILAEKPRVTRFNIVWDN-DEDNDEAPPAQ--PDVDNEEEERLEESDGREEE--EDE 194
Query: 498 EDQEDGNEEIEDERETKMEEECQK 521
E D E E + EEE +
Sbjct: 195 EVGSDSYGEGNRELNEEEEEEAEG 218
Score = 40.4 bits (94), Expect = 0.003
Identities = 19/64 (29%), Positives = 30/64 (46%), Gaps = 4/64 (6%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
+ + EE+ D EE EG + E+EE E + +D ED D E D+++ +
Sbjct: 184 ESDGREEEED----EEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGE 239
Query: 515 MEEE 518
EE
Sbjct: 240 EEEM 243
Score = 39.6 bits (92), Expect = 0.005
Identities = 26/77 (33%), Positives = 36/77 (46%), Gaps = 12/77 (15%)
Query: 453 GLQEENIEEKFDDSSQEEYFEA-----EDQEGQGDSQEDEEMEEQRIED------EEDQE 501
G +EE EE DS E E E+ EG D ++ + E +RI+ E ++E
Sbjct: 187 GREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEE 246
Query: 502 DGN-EEIEDERETKMEE 517
N EIE E E+ EE
Sbjct: 247 VINLFEIEWEEESPSEE 263
Score = 39.2 bits (91), Expect = 0.006
Identities = 17/64 (26%), Positives = 33/64 (51%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
+EE +EE +E+ D G+G+ + +EE EE+ ++ ++ + E E + +
Sbjct: 178 EEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQ 237
Query: 515 MEEE 518
EEE
Sbjct: 238 GEEE 241
Score = 38.8 bits (90), Expect = 0.008
Identities = 18/86 (20%), Positives = 38/86 (44%), Gaps = 4/86 (4%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGD----SQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
+ D+ +E E++ +E + D S E + E+EE++ +G+++ ED +
Sbjct: 168 PPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVD 227
Query: 513 TKMEEECQKFGHYNKDDPHSFQLPEN 538
+ E +K G + + L E
Sbjct: 228 YEGERIDKKQGEEEEMEEEVINLFEI 253
Score = 33.0 bits (75), Expect = 0.54
Identities = 14/69 (20%), Positives = 27/69 (39%), Gaps = 2/69 (2%)
Query: 449 PRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIE 508
P + E E + +EE + E E EE+ E+E + D E++
Sbjct: 169 PAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEE--EEEAEGSDDGEDVV 226
Query: 509 DERETKMEE 517
D ++++
Sbjct: 227 DYEGERIDK 235
Score = 33.0 bits (75), Expect = 0.57
Identities = 16/63 (25%), Positives = 25/63 (39%), Gaps = 3/63 (4%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
+EE E DD +E E + Q +EE E+ + + + E E +E
Sbjct: 211 EEEEEAEGSDDGEDVVDYEGERID---KKQGEEEEMEEEVINLFEIEWEEESPSEEVPRN 267
Query: 515 MEE 517
EE
Sbjct: 268 NEE 270
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 40.7 bits (95), Expect = 0.003
Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 3/90 (3%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
+ + E+ DD +EE E+ + E D E + E E + +E ED+ E KM
Sbjct: 108 DMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNES--ESSLEDLSDDET-EDDEEKKM 164
Query: 516 EEECQKFGHYNKDDPHSFQLPENFQMYPQF 545
EEE + + + + + +F
Sbjct: 165 EEEEAGEEKESVEQATREKKFDKSGVDDKF 194
Score = 38.8 bits (90), Expect = 0.015
Identities = 19/80 (23%), Positives = 36/80 (45%), Gaps = 7/80 (8%)
Query: 450 RICGLQEENIEEKFDDSSQEEYFEAEDQEG-QGDSQEDEEMEEQRIEDE------EDQED 502
I L + E +D S ++ED + + +EDE +E++ I+DE + E
Sbjct: 87 HIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESES 146
Query: 503 GNEEIEDERETKMEEECQKF 522
E++ D+ EE+ +
Sbjct: 147 SLEDLSDDETEDDEEKKMEE 166
Score = 33.0 bits (75), Expect = 0.84
Identities = 13/61 (21%), Positives = 30/61 (49%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
+E ++++ D +EE E ++QE D +E E +++ +E +G + + +
Sbjct: 282 KEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSF 341
Query: 516 E 516
E
Sbjct: 342 E 342
Score = 32.7 bits (74), Expect = 1.2
Identities = 17/57 (29%), Positives = 25/57 (43%), Gaps = 2/57 (3%)
Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKME 516
+EK + E ED E ++ + E + EDEED E E+ +DE E
Sbjct: 261 KEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDE--QEDDQDEEEPPEA 315
Score = 30.3 bits (68), Expect = 5.9
Identities = 15/87 (17%), Positives = 37/87 (42%), Gaps = 1/87 (1%)
Query: 444 EIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEM-EEQRIEDEEDQED 502
++ KP + L ++ +E + + +D E D + +M E +DEE++E+
Sbjct: 64 KLQNKPILDDLNQKYVEFLINKEHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEE 123
Query: 503 GNEEIEDERETKMEEECQKFGHYNKDD 529
++ + + E + + +D
Sbjct: 124 DESLEDEMIDDEDEADLFNESESSLED 150
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 37.4 bits (86), Expect = 0.005
Identities = 12/31 (38%), Positives = 21/31 (67%)
Query: 475 EDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE 505
ED + D +ED++ E++ +DE+D ED +E
Sbjct: 10 EDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
Score = 31.7 bits (71), Expect = 0.37
Identities = 13/33 (39%), Positives = 23/33 (69%), Gaps = 2/33 (6%)
Query: 478 EGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
EG+ DS D + EE +D+ED+ED +E+ +++
Sbjct: 7 EGEEDSDSDSDEEED--DDDEDEEDDDEDDDED 37
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 38.9 bits (90), Expect = 0.005
Identities = 24/84 (28%), Positives = 47/84 (55%), Gaps = 5/84 (5%)
Query: 440 IRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIED--- 496
+++++I K + QE+ + + + EE E E +E + D +ED+E +E+ E+
Sbjct: 110 VKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTI 169
Query: 497 EEDQEDGNEEI--EDERETKMEEE 518
EE +D EI +DE ET+ +++
Sbjct: 170 EEQSDDSEHEIIEQDESETESDDD 193
Score = 30.8 bits (69), Expect = 2.3
Identities = 21/81 (25%), Positives = 44/81 (54%), Gaps = 4/81 (4%)
Query: 445 IHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEME-EQRIEDEEDQEDG 503
I I+ ++ +++++I++K ++E E QE + + +E ME E+R E +++ED
Sbjct: 101 IRIRDQLRSVKKDDIKKKKSLIIRQEQIEKARQERE---ELEERMEWERREEKIDEREDQ 157
Query: 504 NEEIEDERETKMEEECQKFGH 524
E+ + E +EE+ H
Sbjct: 158 EEQEREREEQTIEEQSDDSEH 178
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 39.9 bits (94), Expect = 0.006
Identities = 24/89 (26%), Positives = 42/89 (47%), Gaps = 6/89 (6%)
Query: 435 ENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQ-EEYFEAEDQEGQGDSQEDEEMEEQR 493
E + RRNE+ R +EE ++ K + + EE E +++E + +E EE+
Sbjct: 68 ERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEEL 127
Query: 494 IEDEEDQEDGNEEI-----EDERETKMEE 517
E +Q + E I E+ +E +EE
Sbjct: 128 EELIAEQREELERISGLTQEEAKEILLEE 156
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 38.6 bits (90), Expect = 0.008
Identities = 19/63 (30%), Positives = 36/63 (57%), Gaps = 4/63 (6%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
+ E K +D +EE E E+ E + ++ EE E +IE+ +D+ N ++++E + K+
Sbjct: 18 CKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDE---NNKLKEENK-KL 73
Query: 516 EEE 518
E E
Sbjct: 74 ENE 76
Score = 38.2 bits (89), Expect = 0.009
Identities = 23/73 (31%), Positives = 37/73 (50%), Gaps = 9/73 (12%)
Query: 456 EENIEEKFDDSSQEEYF-EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE-- 512
E+ ++ ++ +E+ E E++E +ED E EE IE EE ED E E + E
Sbjct: 2 EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEE--IEKEEIIEDSEESNEVKIEEL 59
Query: 513 ----TKMEEECQK 521
K++EE +K
Sbjct: 60 KDENNKLKEENKK 72
Score = 37.8 bits (88), Expect = 0.013
Identities = 19/63 (30%), Positives = 34/63 (53%), Gaps = 1/63 (1%)
Query: 452 CGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDE-EDQEDGNEEIEDE 510
C + EN EE E+ E E +E DS+E E++ + ++DE ++ N+++E+E
Sbjct: 17 CCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKKLENE 76
Query: 511 RET 513
E
Sbjct: 77 LEA 79
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 38.6 bits (90), Expect = 0.008
Identities = 13/56 (23%), Positives = 30/56 (53%)
Query: 463 FDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
F + ++ + E + E E+++E+ +DEE++E+ EE ED + +++
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197
Score = 34.7 bits (80), Expect = 0.15
Identities = 11/43 (25%), Positives = 22/43 (51%)
Query: 466 SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIE 508
E E+ E + +E+EE E++ +D++D +D + E
Sbjct: 161 KELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDDYNAE 203
Score = 33.6 bits (77), Expect = 0.32
Identities = 18/56 (32%), Positives = 34/56 (60%), Gaps = 5/56 (8%)
Query: 456 EENIEEKFDDSSQEEYF-EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
EE+I+EK ++ EAED + + + E+EE EE +EED++ +++ +D+
Sbjct: 146 EEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEE----EEEDEDFDDDDDDDD 197
Score = 29.3 bits (66), Expect = 7.4
Identities = 11/54 (20%), Positives = 28/54 (51%), Gaps = 2/54 (3%)
Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEE 506
L + K ++ + + +D+E + + +E++E + +D++D +D N E
Sbjct: 152 KLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDD--DDDDDDDDYNAE 203
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 37.7 bits (88), Expect = 0.009
Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 2/66 (3%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDS-QEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
E E++ D+SS EE + D E +S E+E++ R E+E D E E + E E
Sbjct: 15 ELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEA-EAEFDREFEKM 73
Query: 515 MEEECQ 520
M E +
Sbjct: 74 MAESLE 79
Score = 34.3 bits (79), Expect = 0.13
Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 4/59 (6%)
Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEE-DQEDGNEEI---EDERETKMEEE 518
D +E+ E+ E S E+E +DEE D E+ + E+E + + E E
Sbjct: 7 SDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEAEAE 65
Score = 33.5 bits (77), Expect = 0.27
Identities = 15/55 (27%), Positives = 27/55 (49%), Gaps = 1/55 (1%)
Query: 464 DDSSQEEYFEAEDQ-EGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
+S ++ E E+ E D + +E E +DE+D+E +EE + + EE
Sbjct: 3 SESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEE 57
Score = 29.7 bits (67), Expect = 4.5
Identities = 18/57 (31%), Positives = 28/57 (49%), Gaps = 3/57 (5%)
Query: 465 DSSQEEYFEAEDQEGQGDSQEDEE---MEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
S+ E + E+ E + ED+E EE + D+E E+ + E E T+ EEE
Sbjct: 1 SGSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEE 57
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 38.6 bits (90), Expect = 0.010
Identities = 21/83 (25%), Positives = 34/83 (40%), Gaps = 10/83 (12%)
Query: 443 NEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEE---MEE-QRIEDEE 498
E IK + N + DDS ++ D + D EDE + E ++I+ E
Sbjct: 101 EEDEIKRKRIEEDARNSDA--DDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKER 158
Query: 499 DQEDGNEEIEDERETKMEEECQK 521
+ E+ +E E EEE +
Sbjct: 159 AE----EKEREEEEKAAEEEKAR 177
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 38.1 bits (89), Expect = 0.013
Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 2/74 (2%)
Query: 450 RICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRI--EDEEDQEDGNEEI 507
R+ L EE +EE + + E ED E +E+EE E + E+D+ + ++E
Sbjct: 13 RMKKLLEEELEEDEFFWTYLLFEEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEE 72
Query: 508 EDERETKMEEECQK 521
E E+E + EE +K
Sbjct: 73 EGEKELQREERLKK 86
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 38.2 bits (88), Expect = 0.024
Identities = 23/67 (34%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIED-EEDQEDGNEEIEDERETK 514
EENIEE ++ +E E E+ + D + EE+EE E+ EE+ E+ EE ++E +
Sbjct: 1012 EENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEE 1071
Query: 515 MEEECQK 521
+EE ++
Sbjct: 1072 IEENIEE 1078
Score = 37.1 bits (85), Expect = 0.062
Identities = 31/100 (31%), Positives = 48/100 (48%), Gaps = 6/100 (6%)
Query: 432 YITENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEE 491
Y EN + N EEN EE +++ EE E D+E + +E EE E
Sbjct: 1022 YDEENVEEVEENVEEYDEENVEEIEENAEENVEENI-EENIEEYDEE---NVEEIEENIE 1077
Query: 492 QRIED--EEDQEDGNEEIEDERETKMEEECQKFGHYNKDD 529
+ IE+ EE+ E+ EEIE+ E +EE ++ N ++
Sbjct: 1078 ENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEE 1117
Score = 35.5 bits (81), Expect = 0.18
Identities = 22/74 (29%), Positives = 41/74 (55%), Gaps = 2/74 (2%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
EENIEE +++ +E E ++ + + +E EE E+ DEE+ E+ E E+ E +
Sbjct: 1000 EENIEENVEENVEENIEENVEEYDEENVEEVEENVEEY--DEENVEEIEENAEENVEENI 1057
Query: 516 EEECQKFGHYNKDD 529
EE +++ N ++
Sbjct: 1058 EENIEEYDEENVEE 1071
Score = 34.4 bits (78), Expect = 0.40
Identities = 25/78 (32%), Positives = 40/78 (51%), Gaps = 5/78 (6%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQ-EGQGDSQEDEEMEEQRIEDEEDQEDGNEE-IEDERET 513
EENIEE ++ +E E E+ E + +E +EE E EE+ E+ EE E+ E
Sbjct: 1054 EENIEENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEE 1113
Query: 514 KMEEECQKFGHYNKDDPH 531
EE ++ Y+ ++P
Sbjct: 1114 NAEENAEE---YDDENPE 1128
Score = 33.6 bits (76), Expect = 0.59
Identities = 30/88 (34%), Positives = 50/88 (56%), Gaps = 5/88 (5%)
Query: 437 DNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQED-EEMEEQRIE 495
+NV E +I+ + EEN+EE + + EEY E +E + +++E+ EE E+ IE
Sbjct: 1005 ENVEENVEENIEENVEEYDEENVEEV--EENVEEYDEENVEEIEENAEENVEENIEENIE 1062
Query: 496 --DEEDQEDGNEEIEDERETKMEEECQK 521
DEE+ E+ E IE+ E +EE ++
Sbjct: 1063 EYDEENVEEIEENIEENIEENVEENVEE 1090
Score = 33.2 bits (75), Expect = 0.92
Identities = 20/69 (28%), Positives = 35/69 (50%), Gaps = 6/69 (8%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEE------MEEQRIEDEEDQEDGNEEIED 509
+E++EE D S E+ + E E Q ++ EE +EE + D +QE +E +E+
Sbjct: 455 DEHVEEHNADDSGEQQSDDESGEHQSVNEIVEEQSVNEHVEEPTVADIVEQETVDEHVEE 514
Query: 510 ERETKMEEE 518
+ EE+
Sbjct: 515 PAVDENEEQ 523
Score = 33.2 bits (75), Expect = 0.93
Identities = 18/59 (30%), Positives = 31/59 (52%), Gaps = 5/59 (8%)
Query: 465 DSSQEEYFEAEDQEGQGDSQE-----DEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
D S++E E ++E GD+ E DE +EE +D +Q+ +E E + ++ EE
Sbjct: 429 DGSEDESVEDNEEEHSGDANEEELSVDEHVEEHNADDSGEQQSDDESGEHQSVNEIVEE 487
Score = 32.8 bits (74), Expect = 1.2
Identities = 18/62 (29%), Positives = 33/62 (53%), Gaps = 2/62 (3%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
EENIEE +++ +E E E+ + + EE E+ EE+ E+ ++E +E +
Sbjct: 1077 EENIEENVEENVEENVEEIEENVEENVEENAEENAEENA--EENAEEYDDENPEEHNEEY 1134
Query: 516 EE 517
+E
Sbjct: 1135 DE 1136
Score = 32.8 bits (74), Expect = 1.2
Identities = 21/77 (27%), Positives = 46/77 (59%), Gaps = 3/77 (3%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQED-EEMEEQRIED--EEDQEDGNEEIEDERE 512
EEN+EE +++ +E E ++ + + +E+ EE E+ +E+ EE+ E+ EE ++E
Sbjct: 968 EENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENV 1027
Query: 513 TKMEEECQKFGHYNKDD 529
++EE +++ N ++
Sbjct: 1028 EEVEENVEEYDEENVEE 1044
Score = 32.5 bits (73), Expect = 1.2
Identities = 27/99 (27%), Positives = 55/99 (55%), Gaps = 4/99 (4%)
Query: 435 ENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQED-EEMEEQR 493
+N+NV + H + I EEN+EE +++ +E E ++ + + +E+ EE E+
Sbjct: 931 QNENVPEHLKEHAEANIEEDAEENVEEDAEENVEENVEENVEENVEENVEENVEENVEEN 990
Query: 494 IED--EEDQEDGNEE-IEDERETKMEEECQKFGHYNKDD 529
+E+ EE+ E+ EE +E+ E +EE +++ N ++
Sbjct: 991 VEENVEENVEENIEENVEENVEENIEENVEEYDEENVEE 1029
Score = 31.7 bits (71), Expect = 2.4
Identities = 22/76 (28%), Positives = 37/76 (48%), Gaps = 10/76 (13%)
Query: 456 EENIEEKFDDSSQEEYFE----------AEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE 505
EEN+EE +++ +E E E+ E + ++E +EE EE E+ E
Sbjct: 984 EENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVE 1043
Query: 506 EIEDERETKMEEECQK 521
EIE+ E +EE ++
Sbjct: 1044 EIEENAEENVEENIEE 1059
Score = 31.3 bits (70), Expect = 3.5
Identities = 19/64 (29%), Positives = 33/64 (51%), Gaps = 2/64 (3%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIED--EEDQEDGNEEIEDERETK 514
E IEE +++ +E E ++ + + EE E+ E+ EE+ E+ EE +DE +
Sbjct: 1070 EEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEE 1129
Query: 515 MEEE 518
EE
Sbjct: 1130 HNEE 1133
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 36.0 bits (83), Expect = 0.025
Identities = 12/41 (29%), Positives = 26/41 (63%)
Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
+ ++ + +EDEE++E E EE+++ + ++E+ERE
Sbjct: 23 KEDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREV 63
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 37.6 bits (87), Expect = 0.032
Identities = 15/70 (21%), Positives = 25/70 (35%), Gaps = 2/70 (2%)
Query: 461 EKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQ 520
+ DD +EE + ++ D +E E+ E E +E +E E + EE
Sbjct: 275 DDGDDEGREEDYISDSSASGNDPEEREDKLSP--EIPAKPEIEQDEDSEESEEEKNEEEG 332
Query: 521 KFGHYNKDDP 530
K
Sbjct: 333 GLSKKGKKLK 342
Score = 37.6 bits (87), Expect = 0.035
Identities = 15/62 (24%), Positives = 25/62 (40%), Gaps = 1/62 (1%)
Query: 456 EENIEEKF-DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
+E EE + DSS E ++ + EQ + EE +E+ NEE +
Sbjct: 279 DEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKG 338
Query: 515 ME 516
+
Sbjct: 339 KK 340
Score = 31.5 bits (71), Expect = 2.3
Identities = 16/61 (26%), Positives = 24/61 (39%), Gaps = 2/61 (3%)
Query: 458 NIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
N ++ DD + + + E D +DE EE I D GN+ E E + E
Sbjct: 251 NKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSA--SGNDPEEREDKLSPEI 308
Query: 518 E 518
Sbjct: 309 P 309
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 37.6 bits (87), Expect = 0.036
Identities = 19/64 (29%), Positives = 33/64 (51%), Gaps = 2/64 (3%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
+ EE+ ++ +EE E E +E +G+ +E+EE E EE+ E +E D E +
Sbjct: 443 ESVEEEEEEEEEEEEE--EQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPE 500
Query: 515 MEEE 518
+ E
Sbjct: 501 EDAE 504
Score = 37.6 bits (87), Expect = 0.039
Identities = 22/68 (32%), Positives = 34/68 (50%), Gaps = 7/68 (10%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEED-------QEDGNEEIE 508
EE++EE+ ++ +EE E E +E +G+ +E+EE E EE+ DG E E
Sbjct: 442 EESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEE 501
Query: 509 DERETKME 516
D E
Sbjct: 502 DAERRNSE 509
Score = 37.2 bits (86), Expect = 0.052
Identities = 21/55 (38%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
Query: 466 SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQ 520
+SQE E E+ + + +E+EE EE++ +EE+ ED EE E E + EEE +
Sbjct: 435 ASQES--EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 34.1 bits (78), Expect = 0.43
Identities = 17/51 (33%), Positives = 28/51 (54%)
Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
+E ++ S EE E E++E + + + +EE E E+EE + D E E E
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 33.3 bits (76), Expect = 0.67
Identities = 21/75 (28%), Positives = 39/75 (52%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKME 516
E+ +S +EE E E++E + + +E++E EE+ EDEE++E+ + E E +
Sbjct: 430 ESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGS 489
Query: 517 EECQKFGHYNKDDPH 531
E G ++D
Sbjct: 490 SEGDGDGEEPEEDAE 504
Score = 31.4 bits (71), Expect = 2.7
Identities = 15/53 (28%), Positives = 27/53 (50%)
Query: 466 SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
SS + E + ++ E EE E+EE++E+ EE ++ E + E+E
Sbjct: 418 SSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDE 470
Score = 31.4 bits (71), Expect = 2.8
Identities = 16/66 (24%), Positives = 27/66 (40%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
++E EE+ + + E EG GD +E EE E+R + +E + +
Sbjct: 468 EDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSV 527
Query: 515 MEEECQ 520
E Q
Sbjct: 528 QPESPQ 533
Score = 30.7 bits (69), Expect = 4.2
Identities = 19/56 (33%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 477 QEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQKFGHYNKDDPHS 532
QE + + +EE EE+ E+EE++E +EE E E E + EE G + + S
Sbjct: 437 QESEEEESVEEEEEEE--EEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSS 490
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 37.4 bits (87), Expect = 0.038
Identities = 16/63 (25%), Positives = 28/63 (44%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
EE E + + E A +E + ++ + +EE+ + EE E E+E+ E K
Sbjct: 294 EELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKN 353
Query: 516 EEE 518
E
Sbjct: 354 ELA 356
Score = 31.3 bits (71), Expect = 3.3
Identities = 21/74 (28%), Positives = 37/74 (50%), Gaps = 5/74 (6%)
Query: 450 RICGLQE-ENIEEKFDDSSQEEYFEAEDQEGQ-GDSQEDEEMEEQRIEDEEDQEDGNEEI 507
+ GL++ E + E + +E + E+ EGQ + ED E + +E+E + EEI
Sbjct: 162 ELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEI 221
Query: 508 ED---ERETKMEEE 518
++ E E + E E
Sbjct: 222 QEEQEEEELEQEIE 235
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 36.9 bits (86), Expect = 0.039
Identities = 16/66 (24%), Positives = 34/66 (51%), Gaps = 13/66 (19%)
Query: 447 IKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEE 506
+P + G EE++EE + +AE++E + D E E+ +E E++ E+ E+
Sbjct: 267 GEPEVVGGDEEDLEELLE--------KAEEEEEEDDYSESEDEDE-----EDEDEEEEED 313
Query: 507 IEDERE 512
++ +
Sbjct: 314 DDEGDK 319
Score = 36.2 bits (84), Expect = 0.075
Identities = 13/46 (28%), Positives = 26/46 (56%), Gaps = 1/46 (2%)
Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
+ ED E + E+EE EE + ED+++ +E+ E+E + ++
Sbjct: 275 DEEDLEELLEKAEEEE-EEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 35.4 bits (82), Expect = 0.041
Identities = 13/37 (35%), Positives = 23/37 (62%), Gaps = 4/37 (10%)
Query: 478 EGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
+ +S +DEE EE +E+D+ED +E+ +E E+
Sbjct: 109 SEEDESDDDEEDEE----EEDDEEDDDEDESEEEESP 141
Score = 29.6 bits (67), Expect = 3.9
Identities = 10/37 (27%), Positives = 20/37 (54%)
Query: 466 SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQED 502
S Q ED+ + E+EE +E+ +++E +E+
Sbjct: 102 SGQHLVASEEDESDDDEEDEEEEDDEEDDDEDESEEE 138
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 37.1 bits (86), Expect = 0.043
Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 3/58 (5%)
Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
EE DD +E+ E + + Q D+ ED E E+ E ++ E+ + + + EE
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDNNED---SEAGREESEGSDESEEDEAEATDGEGEE 274
Score = 36.0 bits (83), Expect = 0.11
Identities = 18/83 (21%), Positives = 36/83 (43%), Gaps = 2/83 (2%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
+E+ +++ D++ E ++ D E++E E E EE + D E ED +
Sbjct: 233 EEDGDDDQPDNNEDSE--AGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDE 290
Query: 515 MEEECQKFGHYNKDDPHSFQLPE 537
+E+ + G + +L E
Sbjct: 291 SDEDTETPGEDARPATPFTELME 313
Score = 34.8 bits (80), Expect = 0.22
Identities = 14/65 (21%), Positives = 28/65 (43%), Gaps = 3/65 (4%)
Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
+ EE ++ ++ + EE + +D + + + EE DE +ED E + E E
Sbjct: 217 DMAEETGDDGIEEDADEE--DGDDDQPDNNEDSEAGREESEGSDES-EEDEAEATDGEGE 273
Query: 513 TKMEE 517
+
Sbjct: 274 EGEMD 278
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 37.0 bits (86), Expect = 0.046
Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 13/75 (17%)
Query: 460 EEKFDDSSQEE--Y----FEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDG-------NEE 506
++K+ + + E Y F A + E ++DEE E++ E+EED+++G +EE
Sbjct: 358 KQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEE 417
Query: 507 IEDERETKMEEECQK 521
E++ E+
Sbjct: 418 FEEDDVESKYEDSDG 432
Score = 35.5 bits (82), Expect = 0.14
Identities = 18/56 (32%), Positives = 28/56 (50%), Gaps = 4/56 (7%)
Query: 453 GLQEENIEEKFDD----SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGN 504
L+ E+ + DD +EE E ED+ + +DEE EE +E + + DGN
Sbjct: 378 ALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGN 433
Score = 33.6 bits (77), Expect = 0.65
Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 3/54 (5%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
E ++ D+ E + E + +EDEE EE E++ED+ E +DE
Sbjct: 366 ETEQDYILDAFSALEIEDANTERDDEEEEDEEEEE---EEDEDEGPSKEHSDDE 416
Score = 32.0 bits (73), Expect = 1.7
Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 1/59 (1%)
Query: 459 IEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
I + F E+ D E + D +E+EE +E +E +D E ED+ E+K E+
Sbjct: 372 ILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDD-EEFEEDDVESKYED 429
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell cycle
progression and biogenesis of 60S ribosomal subunits.
The Schistosoma mansoni Mak16 has been shown to target
protein transport to the nucleolus.
Length = 97
Score = 34.5 bits (79), Expect = 0.046
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEED 499
+ EE ++ +EE E E+ EG+ + D+E E+ IED ED
Sbjct: 53 KALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLED 97
Score = 31.8 bits (72), Expect = 0.40
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 2/55 (3%)
Query: 458 NIEEKFDDSSQEEYFEAEDQE--GQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
+ S+ ++ EAE+ E + + +E+EE +E IE D E+ EEIED
Sbjct: 41 DEPYNISQSAFKKALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDL 95
Score = 30.6 bits (69), Expect = 0.86
Identities = 14/50 (28%), Positives = 30/50 (60%)
Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIED 509
+ F + + E E D+E + + +E++E E + + D+E+ E+ E++ED
Sbjct: 48 QSAFKKALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLED 97
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 36.9 bits (85), Expect = 0.053
Identities = 25/75 (33%), Positives = 34/75 (45%), Gaps = 8/75 (10%)
Query: 455 QEENIEEKFDDSSQE-EYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDG----NEEIED 509
Q+E D S E ED + ++ E+EE EE R E EE +E +E+ D
Sbjct: 81 QKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKND 140
Query: 510 ERETKMEEECQKFGH 524
R+ EECQK
Sbjct: 141 WRD---AEECQKEEK 152
Score = 32.7 bits (74), Expect = 0.86
Identities = 19/86 (22%), Positives = 41/86 (47%), Gaps = 6/86 (6%)
Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
G + E +EE+ + S+EE E +E +G ++ +++ + + E+ + +E E E+E+
Sbjct: 105 GAENETVEEEEKEESREE--REEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKP 162
Query: 513 TKMEEECQKFGHYNKDDPHSFQLPEN 538
+ E + H + EN
Sbjct: 163 KRGSLEENN----GEFMTHKLKHTEN 184
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 36.8 bits (85), Expect = 0.054
Identities = 16/61 (26%), Positives = 27/61 (44%), Gaps = 1/61 (1%)
Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEEC 519
EE DD+ E+ + +D + + QE++ E E +E + E E+ EE
Sbjct: 201 EEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGE-GQEGSAPQESEATDRESESGEEEMV 259
Query: 520 Q 520
Q
Sbjct: 260 Q 260
Score = 36.1 bits (83), Expect = 0.098
Identities = 9/60 (15%), Positives = 24/60 (40%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
+ E E+ + +E ++ E E E E + + ++ ++ +++ E E
Sbjct: 222 ENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDSETPGEGA 281
Score = 32.6 bits (74), Expect = 1.1
Identities = 14/89 (15%), Positives = 31/89 (34%), Gaps = 3/89 (3%)
Query: 455 QEENIEEKFDD---SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
E EE DD + E+ + E + + +E E E E +E+ + +D+
Sbjct: 207 TESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDL 266
Query: 512 ETKMEEECQKFGHYNKDDPHSFQLPENFQ 540
+ +++ + G +
Sbjct: 267 PDESDDDSETPGEGARPARPFTSTGGEPD 295
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 36.8 bits (85), Expect = 0.054
Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 1/58 (1%)
Query: 464 DDSSQEEYFEAEDQEGQGDSQE-DEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQ 520
+ Y + + + QE D EE E+ ED+ + +E + +E E Q
Sbjct: 318 LADFGDYYEDIFEVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQ 375
>gnl|CDD|219236 pfam06949, DUF1292, Protein of unknown function (DUF1292). This
family consists of several hypothetical bacterial
proteins of around 90 residues in length. The function
of this family is unknown.
Length = 74
Score = 33.3 bits (77), Expect = 0.057
Identities = 23/69 (33%), Positives = 34/69 (49%), Gaps = 9/69 (13%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQE------GQGDSQEDEEMEEQRIEDEEDQEDGNEEIED 509
EE E +F+ + FE E +E + ED E+ +IE++ED E+ IED
Sbjct: 1 EEGNEVEFE---VLDTFEVEGKEYILLLPVGEEEDEDIEVYIFKIEEDEDGEEELLPIED 57
Query: 510 ERETKMEEE 518
+ E M EE
Sbjct: 58 DEEWDMVEE 66
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 36.9 bits (86), Expect = 0.058
Identities = 17/83 (20%), Positives = 40/83 (48%), Gaps = 7/83 (8%)
Query: 437 DNVIRRNEIHIKPRICGLQEENIEEKFDD-SSQEEYFEAEDQEGQGDSQEDEEMEEQRIE 495
DNV +++ I + +IEE+ D ++ + +++ D + DEE EE
Sbjct: 90 DNVFGSDQVVI------FDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKS 143
Query: 496 DEEDQEDGNEEIEDERETKMEEE 518
++++ +D +++ +D + E
Sbjct: 144 EDDEDDDDDDDDDDIATRERSLE 166
>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein. This family includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 116
Score = 34.5 bits (79), Expect = 0.059
Identities = 19/60 (31%), Positives = 28/60 (46%), Gaps = 3/60 (5%)
Query: 459 IEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
I F + Y++ D++ D+ E ME Q+ EE + +EDERE EEE
Sbjct: 49 IWAIFGKGRKRSYYDRYDEDDALDNMEATFMEIQK---EERRSARMARLEDERELAREEE 105
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 35.5 bits (82), Expect = 0.065
Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 3/62 (4%)
Query: 460 EEKFDDSSQEEYFEAEDQEGQG---DSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKME 516
E D+ +E E E+ + D +DE+ ++ I + ++D +E ED+ E + +
Sbjct: 112 EGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDEDDDEDEED 171
Query: 517 EE 518
EE
Sbjct: 172 EE 173
Score = 30.9 bits (70), Expect = 1.8
Identities = 15/70 (21%), Positives = 36/70 (51%)
Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEEC 519
++K D+ +E + D + + +ED+E++ E++++ ED +E +E E +E
Sbjct: 103 KDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDED 162
Query: 520 QKFGHYNKDD 529
+ +D+
Sbjct: 163 EDDDEDEEDE 172
Score = 30.1 bits (68), Expect = 3.2
Identities = 14/54 (25%), Positives = 30/54 (55%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
E+ EE+ DD E +D++ D + +E+ ++++ED ++ E+ ED+
Sbjct: 122 EDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDEDDDEDEEDEEDK 175
Score = 30.1 bits (68), Expect = 3.2
Identities = 16/67 (23%), Positives = 37/67 (55%), Gaps = 3/67 (4%)
Query: 454 LQEENI--EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
+ EE EK D ++++ +A ++ + + DE+ EE+ ++ + +D N++ EDE
Sbjct: 87 IDEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDD-EDED 145
Query: 512 ETKMEEE 518
+ ++ E
Sbjct: 146 DDEIVEI 152
Score = 29.0 bits (65), Expect = 8.8
Identities = 18/61 (29%), Positives = 32/61 (52%), Gaps = 5/61 (8%)
Query: 456 EENIEEKFD---DSSQEEYFEAED-QEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
EE E++ D D +EE E + + D ED++ E I E+D+ D +E+ +++
Sbjct: 111 EEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDD-EIVEILIEDDEVDEDEDDDEDE 169
Query: 512 E 512
E
Sbjct: 170 E 170
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 35.8 bits (83), Expect = 0.073
Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 5/68 (7%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEM----EEQRIEDEEDQEDGNEEIEDERE 512
EN+ ++ S F+ E ++G ++ + EE E D E E+ EEI +E E
Sbjct: 115 ENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEEE 174
Query: 513 TKME-EEC 519
E E+C
Sbjct: 175 EVKEPEDC 182
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 36.0 bits (83), Expect = 0.077
Identities = 19/62 (30%), Positives = 33/62 (53%), Gaps = 9/62 (14%)
Query: 456 EENIEEKFDDSSQEEY----FEAEDQEGQ-----GDSQEDEEMEEQRIEDEEDQEDGNEE 506
++ IE ++S + Y ED EG+ + E+EE+EE ++EE+ E+ +EE
Sbjct: 42 DDAIELAEPETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEE 101
Query: 507 IE 508
E
Sbjct: 102 FE 103
Score = 33.7 bits (77), Expect = 0.46
Identities = 20/62 (32%), Positives = 35/62 (56%), Gaps = 4/62 (6%)
Query: 464 DDS-SQEEYFEAEDQEGQGD-SQEDEEMEEQ-RIEDEEDQEDGNEEIEDERETKMEEECQ 520
DD+ E ++D G D ED+E + RI ++ ++E+ EE E++ E + EEE +
Sbjct: 42 DDAIELAEPETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEE-VEEGEEDEEEEGEEESE 100
Query: 521 KF 522
+F
Sbjct: 101 EF 102
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 35.8 bits (83), Expect = 0.099
Identities = 16/64 (25%), Positives = 27/64 (42%), Gaps = 5/64 (7%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEG-----QGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
E E+K + + E E EG D +E+ + E E + E N+ + +
Sbjct: 217 ERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPK 276
Query: 511 RETK 514
R+TK
Sbjct: 277 RKTK 280
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 36.2 bits (83), Expect = 0.11
Identities = 22/121 (18%), Positives = 48/121 (39%), Gaps = 21/121 (17%)
Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM-----------EEECQK 521
AED+E + DEE I+D++ +E+ NEE+ + E+ + EEE ++
Sbjct: 392 WAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQ 451
Query: 522 FGHYNKDDPHSFQLPENFQMYPQFMYHLRRSQF----------LQVFNNSPDETSYYRTL 571
+ + + P+ ++ P R ++ + P +++L
Sbjct: 452 LREFRDMEKEDREFPDEAELQPSESAIERYKEYRGLRNLYTCSWRSDEKDPSFPEEWKSL 511
Query: 572 L 572
+
Sbjct: 512 V 512
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 34.5 bits (80), Expect = 0.12
Identities = 20/123 (16%), Positives = 43/123 (34%), Gaps = 30/123 (24%)
Query: 458 NIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER-----E 512
+ K +EE E + E+ +++E D+E+ + +E+ R E
Sbjct: 12 ILPPKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAE 71
Query: 513 TKMEEECQKFGHYNKDDPHSFQLPENFQMYPQFMYHLRRSQFLQVFNNSPDETSYYRTLL 572
K E KFG +Y + + +++ + +T ++
Sbjct: 72 MKALAEKSKFGE---------------------VYEISKPDYVREVTEASKDT----WVV 106
Query: 573 IHL 575
+HL
Sbjct: 107 VHL 109
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 33.8 bits (77), Expect = 0.14
Identities = 11/66 (16%), Positives = 30/66 (45%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
E +++ + E D+ + D+ ++ + EE+ E ++ D + +E + +
Sbjct: 65 AATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEE 124
Query: 515 MEEECQ 520
+EE +
Sbjct: 125 EDEEAE 130
Score = 33.0 bits (75), Expect = 0.25
Identities = 13/64 (20%), Positives = 26/64 (40%), Gaps = 1/64 (1%)
Query: 456 EENIEEKFDDSSQEEYFEA-EDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
+ + + E +A ED + ++ +E +E+ DE E +E + E
Sbjct: 63 AAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEAD 122
Query: 515 MEEE 518
EE+
Sbjct: 123 EEED 126
Score = 31.8 bits (72), Expect = 0.60
Identities = 16/67 (23%), Positives = 23/67 (34%), Gaps = 11/67 (16%)
Query: 463 FDDSSQEE-YF---EAEDQE-------GQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
F DS E+ Y EA D E +G + E E E D + + E
Sbjct: 32 FVDSKCEKNYDLGREARDLEWTEAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEA 91
Query: 512 ETKMEEE 518
+ E +
Sbjct: 92 DAADEAD 98
>gnl|CDD|201354 pfam00626, Gelsolin, Gelsolin repeat.
Length = 76
Score = 32.3 bits (74), Expect = 0.15
Identities = 10/33 (30%), Positives = 16/33 (48%)
Query: 626 PVPVLLDTSSIKPDCILLMDTFFQLLIFHGEEP 658
P PV L S+ L+DT F + ++ G+
Sbjct: 5 PPPVPLSQESLNSGDCYLLDTGFTIFLWVGKGS 37
Score = 31.1 bits (71), Expect = 0.35
Identities = 11/34 (32%), Positives = 18/34 (52%)
Query: 913 PVPVLLDTSSIKPDCILLMDTFFQLLIFHGEVSN 946
P PV L S+ L+DT F + ++ G+ S+
Sbjct: 5 PPPVPLSQESLNSGDCYLLDTGFTIFLWVGKGSS 38
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 35.5 bits (82), Expect = 0.15
Identities = 22/70 (31%), Positives = 33/70 (47%), Gaps = 3/70 (4%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE---R 511
+EE + + ++ EE EA + DS +DE E EDEED E+ + E R
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60
Query: 512 ETKMEEECQK 521
E +E +K
Sbjct: 61 EKARLKELKK 70
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 35.8 bits (82), Expect = 0.16
Identities = 20/72 (27%), Positives = 41/72 (56%), Gaps = 3/72 (4%)
Query: 454 LQEENI--EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQR-IEDEEDQEDGNEEIEDE 510
L EE++ + F +E + +D + + D +E+EE EE + +DE+++++G E+ E
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYE 199
Query: 511 RETKMEEECQKF 522
+ + +C KF
Sbjct: 200 KSEVDKTDCFKF 211
Score = 33.5 bits (76), Expect = 0.67
Identities = 14/71 (19%), Positives = 33/71 (46%), Gaps = 8/71 (11%)
Query: 475 EDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQK--------FGHYN 526
ED + + D++ E++ +D+++++D EE E+E ++E ++ +
Sbjct: 143 EDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSE 202
Query: 527 KDDPHSFQLPE 537
D F+ E
Sbjct: 203 VDKTDCFKFIE 213
Score = 30.8 bits (69), Expect = 4.4
Identities = 23/107 (21%), Positives = 43/107 (40%), Gaps = 17/107 (15%)
Query: 441 RRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQE-----------DEEM 489
+ + IK R C +E + + ++ + GD EE
Sbjct: 85 KDTSVKIKQRTCPFHKEGSSFEMGSKNITCFYPIHGKAEIGDLDMIIIKRRRARHLAEED 144
Query: 490 EEQR----IEDEEDQE--DGNEEIEDERETKMEEECQKFGHYNKDDP 530
R I+D+++ E D ++E +DE E + EEE + F +++D
Sbjct: 145 MSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 35.1 bits (81), Expect = 0.17
Identities = 21/87 (24%), Positives = 33/87 (37%), Gaps = 11/87 (12%)
Query: 435 ENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRI 494
EN + E+ + R+ L+E + D EA + + +E EE+
Sbjct: 138 ENRTADSKREMEVLERLEELKELQ--SRRADVDVNSMLEALFRREK------KEEEEEEE 189
Query: 495 EDEEDQED---GNEEIEDERETKMEEE 518
EDE + G E ED R E+
Sbjct: 190 EDEALIKSLSFGPETEEDRRRADDEDS 216
>gnl|CDD|178439 PLN02847, PLN02847, triacylglycerol lipase.
Length = 633
Score = 35.2 bits (81), Expect = 0.17
Identities = 26/89 (29%), Positives = 35/89 (39%), Gaps = 9/89 (10%)
Query: 464 DDSSQEEYFEAEDQEGQGDSQED---------EEMEEQRIEDEEDQEDGNEEIEDERETK 514
DD +EE +ED+ +E+ E E QR E E D + EE +E
Sbjct: 471 DDEEEEEPLLSEDRVITSSVEEEVTEGELWYELEKELQRQETEVDAQAQEEEAAAAKEIT 530
Query: 515 MEEECQKFGHYNKDDPHSFQLPENFQMYP 543
EE +K S L E+ Q YP
Sbjct: 531 EEENVLAKAVESKTSITSSDLSESQQFYP 559
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 35.4 bits (82), Expect = 0.17
Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 1/76 (1%)
Query: 462 KFDDSSQ-EEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQ 520
K ++S+ EE E D E + S+ +EQ EED +EE ED +T+ ++ +
Sbjct: 225 KLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLE 284
Query: 521 KFGHYNKDDPHSFQLP 536
++ F+ P
Sbjct: 285 ILKEKKDEELFWFEKP 300
Score = 30.0 bits (68), Expect = 7.1
Identities = 14/61 (22%), Positives = 26/61 (42%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
EE EE + + QE +G ++ED + + ED + ED ++ + +
Sbjct: 233 EEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDE 292
Query: 516 E 516
E
Sbjct: 293 E 293
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 34.7 bits (79), Expect = 0.18
Identities = 21/53 (39%), Positives = 29/53 (54%), Gaps = 9/53 (16%)
Query: 461 EKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQE-DGNEEIEDERE 512
+ DSS+E G GDS E+E EE E+E +++ DGNE+ E E E
Sbjct: 45 QSGSDSSEEN--------GDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAE 89
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 35.4 bits (81), Expect = 0.19
Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 13/91 (14%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQED-----EEMEEQRIEDEEDQEDGNEEIEDE 510
EE++ E + S E+ A + E S+ED E+ + Q EDEE+ D + I+DE
Sbjct: 3913 EEDLLET-EQKSNEQ--SAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSD-DVGIDDE 3968
Query: 511 RETKMEEECQKFGHYNKDDPHSFQLPENFQM 541
+ ++E + + LPE+ ++
Sbjct: 3969 IQPDIQENNSQ----PPPENEDLDLPEDLKL 3995
Score = 33.0 bits (75), Expect = 0.88
Identities = 17/75 (22%), Positives = 40/75 (53%), Gaps = 11/75 (14%)
Query: 455 QEENIEEKFDDSSQEEYFE------AEDQEGQGDSQEDEEMEEQRIEDEE---DQEDGNE 505
++ ++EK D S++ E A++ + + D+++DE M+++ +E D++ +
Sbjct: 3991 EDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQD 4050
Query: 506 EIED--ERETKMEEE 518
+ D E + KM E+
Sbjct: 4051 DFSDLAEDDEKMNED 4065
Score = 32.7 bits (74), Expect = 1.4
Identities = 20/80 (25%), Positives = 38/80 (47%), Gaps = 2/80 (2%)
Query: 454 LQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEE-MEEQRIEDEEDQEDGNEEIEDERE 512
+Q+E+ E+ + ++ + + D + +E+ EE E+EE EDG + ++E E
Sbjct: 4031 MQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDG-VKSDEELE 4089
Query: 513 TKMEEECQKFGHYNKDDPHS 532
E Q ++ K D S
Sbjct: 4090 QGEVPEDQAIDNHPKMDAKS 4109
Score = 32.3 bits (73), Expect = 1.8
Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 9/82 (10%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDE--EMEEQ---RIEDEEDQEDGNEEIEDE 510
E+ +D EE ED Q D E E E+E +D E+ N + E+E
Sbjct: 3825 EDITNTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEE 3884
Query: 511 RETKMEEECQKFGHYNKDDPHS 532
+ ++EE +D +S
Sbjct: 3885 NQ-DLDEEVNDI---PEDLSNS 3902
Score = 32.3 bits (73), Expect = 1.9
Identities = 23/81 (28%), Positives = 37/81 (45%), Gaps = 14/81 (17%)
Query: 455 QEENIEEKFDDSSQEEYFEAE---DQEGQGDSQEDEEMEEQRIEDEEDQEDGNEE--IED 509
+EE EK + E+ E D++ Q D + ++ E +DE+ EDG EE E+
Sbjct: 4020 KEEADAEKDEPMQDEDPLEENNTLDEDIQQD--DFSDLAE---DDEKMNEDGFEENVQEN 4074
Query: 510 ERET----KMEEECQKFGHYN 526
E T K +EE ++
Sbjct: 4075 EESTEDGVKSDEELEQGEVPE 4095
Score = 31.5 bits (71), Expect = 2.7
Identities = 19/98 (19%), Positives = 37/98 (37%), Gaps = 31/98 (31%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEG----------------------QGDSQEDEEMEEQ 492
++E+ EE DD ++ + + QE +GD +D ++E+
Sbjct: 3952 EKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDM 4011
Query: 493 RIE---------DEEDQEDGNEEIEDERETKMEEECQK 521
+E D E E +E E ++E+ Q+
Sbjct: 4012 DMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQ 4049
Score = 31.5 bits (71), Expect = 2.8
Identities = 14/86 (16%), Positives = 33/86 (38%), Gaps = 11/86 (12%)
Query: 441 RRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEA---EDQEGQGDSQEDEEM---EEQRI 494
+ N IH + + D S+ + + ED + G++++D+ ++
Sbjct: 4181 QANRIH-EWEDLTESQSQAF----DDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRDESA 4235
Query: 495 EDEEDQEDGNEEIEDERETKMEEECQ 520
D + EDE + +++ Q
Sbjct: 4236 NQNPDSMNSTNIAEDEADEVGDKQLQ 4261
Score = 30.4 bits (68), Expect = 6.1
Identities = 15/56 (26%), Positives = 23/56 (41%), Gaps = 2/56 (3%)
Query: 465 DSSQEEYFEAEDQEGQG--DSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
D QE+ E + G G E++ ED++ +E NEE + E E
Sbjct: 3801 DLPQEKSNSGELESGTGLGSGVGAEDITNTLNEDDDLEELANEEDTANQSDLDESE 3856
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 34.8 bits (80), Expect = 0.19
Identities = 9/60 (15%), Positives = 32/60 (53%), Gaps = 2/60 (3%)
Query: 459 IEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
+K D +++ E + + + +E++++ EE+++ E+ ++++E + ++E
Sbjct: 374 YNKKLQDYTKKL--GEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431
Score = 32.1 bits (73), Expect = 1.7
Identities = 17/60 (28%), Positives = 33/60 (55%), Gaps = 7/60 (11%)
Query: 454 LQE--ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
LQ+ + + E D++ E EA+ +E + +E+E ++Q+ E+ ED + +DER
Sbjct: 378 LQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENE--KKQK---EQADEDKEKRQKDER 432
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 34.7 bits (80), Expect = 0.23
Identities = 17/60 (28%), Positives = 25/60 (41%)
Query: 459 IEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
+ D EE E ED+E + S E EE E + E+E Q + E + + E
Sbjct: 358 RRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSE 417
Score = 32.0 bits (73), Expect = 1.6
Identities = 11/63 (17%), Positives = 29/63 (46%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
E ++E D+ ++ E E++EG+ +E + E + + E + ++E+
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESAS 426
Query: 516 EEE 518
+ +
Sbjct: 427 DSD 429
Score = 31.6 bits (72), Expect = 2.0
Identities = 17/61 (27%), Positives = 22/61 (36%), Gaps = 6/61 (9%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQG---DSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
E+ EE+ D +EE E ED E +G E D E + D E D
Sbjct: 374 EDEEEEQRSDEHEEE--EGEDSEEEGSQSREDGSSESSSDVGSDSESKAD-KESASDSDS 430
Query: 513 T 513
Sbjct: 431 E 431
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 35.0 bits (80), Expect = 0.26
Identities = 18/71 (25%), Positives = 29/71 (40%), Gaps = 4/71 (5%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDE-EDQEDGNEEIEDERET 513
EE + E S E ED+E + E++ E E++ D+EDG + E E +
Sbjct: 934 SEEEVSEYEASSDDESDETDEDEES---DESSEDLSEDESENDSSDEEDGEDWDELESKA 990
Query: 514 KMEEECQKFGH 524
+ K
Sbjct: 991 AYDSRPGKRRK 1001
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 34.4 bits (80), Expect = 0.28
Identities = 21/84 (25%), Positives = 42/84 (50%), Gaps = 10/84 (11%)
Query: 434 TENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQR 493
E + RRNE+ + +EEN++ K E E ++E ++ E+ EQ+
Sbjct: 73 FEKELRERRNELQKLEKRLLQKEENLDRK------LELLEKREEE----LEKKEKELEQK 122
Query: 494 IEDEEDQEDGNEEIEDERETKMEE 517
++ E +E+ EE+ +E+ ++E
Sbjct: 123 QQELEKKEEELEELIEEQLQELER 146
>gnl|CDD|240272 PTZ00109, PTZ00109, DNA gyrase subunit b; Provisional.
Length = 903
Score = 34.5 bits (79), Expect = 0.29
Identities = 21/95 (22%), Positives = 42/95 (44%), Gaps = 9/95 (9%)
Query: 425 PLLYSITYI-TENDNVIRRNE---IHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQ 480
P LY IT + NV +N I+ +E N+ K + E +
Sbjct: 722 PPLYRITNNRMKQFNVSTKNSKKYIYTWSD----EELNVLIKLLNKDYSSKETTRSVEEK 777
Query: 481 GDSQ-EDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
G++ D E E+++++++ +E+ +E+E + E
Sbjct: 778 GNAPDLDNEYEDEKLDNKNMRENNVDEVELKTELG 812
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 34.0 bits (78), Expect = 0.32
Identities = 17/62 (27%), Positives = 28/62 (45%), Gaps = 3/62 (4%)
Query: 457 ENIEEKFDD--SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
+ EE D+ S+ E E ED + D +D+ EE+ + ED + E+ +
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDP-KEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGE 265
Query: 515 ME 516
ME
Sbjct: 266 ME 267
Score = 32.5 bits (74), Expect = 0.81
Identities = 15/62 (24%), Positives = 25/62 (40%), Gaps = 6/62 (9%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQ------EDEEMEEQRIEDEEDQEDGNEEIE 508
E+N +E +++ E++ G DS EEME +E E D + +
Sbjct: 221 SEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPDSD 280
Query: 509 DE 510
D
Sbjct: 281 DA 282
Score = 31.7 bits (72), Expect = 1.4
Identities = 6/46 (13%), Positives = 21/46 (45%)
Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
A+ ++ + + E+ ++ + E+EE + + + ++ E
Sbjct: 218 SADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMES 263
Score = 31.7 bits (72), Expect = 1.7
Identities = 15/80 (18%), Positives = 30/80 (37%), Gaps = 6/80 (7%)
Query: 447 IKPRICGLQEENIEEK--FDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIED---EEDQE 501
+ L ++++ F ++ + E GD E + E+ ED E++ +
Sbjct: 178 AGKDLDRL-AACVDDQQAFARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDEDD 236
Query: 502 DGNEEIEDERETKMEEECQK 521
D EE E + E+
Sbjct: 237 DQGEEEESGSSDSLSEDSDA 256
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 33.9 bits (77), Expect = 0.35
Identities = 20/62 (32%), Positives = 28/62 (45%), Gaps = 5/62 (8%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
+ + E + QEE AE + + S +E+ E E EED E NEE E E
Sbjct: 286 NKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGE---EKEEDDE--NEENERHTELLA 340
Query: 516 EE 517
+E
Sbjct: 341 DE 342
Score = 33.1 bits (75), Expect = 0.63
Identities = 16/65 (24%), Positives = 31/65 (47%), Gaps = 7/65 (10%)
Query: 461 EKFDDSSQEEYFEAEDQEG-------QGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
+K ++EE E +E + + E +EQ+ E+ E+ E EE++ +R
Sbjct: 258 KKEKQGAEEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPD 317
Query: 514 KMEEE 518
++ EE
Sbjct: 318 EIGEE 322
Score = 30.1 bits (67), Expect = 7.1
Identities = 14/64 (21%), Positives = 29/64 (45%), Gaps = 2/64 (3%)
Query: 460 EEKFDDSSQEEY--FEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
EE + S+E+ AE + + + E+ +E+ E +E+ + DE + EE
Sbjct: 266 EEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEE 325
Query: 518 ECQK 521
+ +
Sbjct: 326 DDEN 329
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 33.2 bits (76), Expect = 0.40
Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEME-EQRIEDEEDQEDGNEEIEDERET 513
QE E+ + +E + ++ +EDEE E E EDEE ++ +E E++ E+
Sbjct: 57 QETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTES 116
Query: 514 KMEEE 518
+E+E
Sbjct: 117 NVEKE 121
Score = 32.0 bits (73), Expect = 1.0
Identities = 13/64 (20%), Positives = 31/64 (48%), Gaps = 1/64 (1%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
+ +E+ S ++ AE +E + + +E E++ + + ++ED E E+E E +
Sbjct: 42 DQAAADEQEAKKSDDQE-TAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDE 100
Query: 515 MEEE 518
+
Sbjct: 101 ESSD 104
Score = 31.2 bits (71), Expect = 2.0
Identities = 15/64 (23%), Positives = 29/64 (45%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
QE + + + EE E E + + +ED+ E+ E+ E++ + +E + K
Sbjct: 49 QEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEK 108
Query: 515 MEEE 518
EE
Sbjct: 109 ETEE 112
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 31.3 bits (71), Expect = 0.43
Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 9/77 (11%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDS---QEDEEMEEQRIEDEEDQED------GNEE 506
EE EE + +E ++ E + D + D E++ ED E+ + G+E
Sbjct: 14 EEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEELAEYLRKRYGDEA 73
Query: 507 IEDERETKMEEECQKFG 523
D ++ +E Q+
Sbjct: 74 DADADDSDSDEVPQRLL 90
Score = 31.3 bits (71), Expect = 0.47
Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 9/63 (14%)
Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEE-----DQEDGNEEIEDERETKMEEE 518
DD +EE E E+ + + S EDE ++E ED+ D+ EE ED E + E
Sbjct: 9 DDEEEEE--EEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEE--LAEY 64
Query: 519 CQK 521
+K
Sbjct: 65 LRK 67
>gnl|CDD|215628 PLN03196, PLN03196, MOC1-like protein; Provisional.
Length = 487
Score = 33.5 bits (77), Expect = 0.49
Identities = 17/79 (21%), Positives = 32/79 (40%), Gaps = 11/79 (13%)
Query: 444 EIHIKPRICGLQEENIE-----------EKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQ 492
E IKPR + ++ I+ +KF+ ++ E E+ E EM
Sbjct: 405 ESRIKPRYERVAKKGIKCSLAWFLNCSDDKFEQRMSGDFIEGEEMEPLFSMGGKLEMPGS 464
Query: 493 RIEDEEDQEDGNEEIEDER 511
+E+ +D ++E+ R
Sbjct: 465 ESVSDEEDDDDDDEVLYRR 483
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 33.9 bits (77), Expect = 0.51
Identities = 12/64 (18%), Positives = 30/64 (46%), Gaps = 2/64 (3%)
Query: 468 QEEYFEAEDQEGQGDSQEDEEMEEQRIEDE--EDQEDGNEEIEDERETKMEEECQKFGHY 525
+ E+ + + E+ R E E+ +D +EE++ ++E ++E + K+
Sbjct: 459 RASLMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSET 518
Query: 526 NKDD 529
++ D
Sbjct: 519 SEAD 522
Score = 31.6 bits (71), Expect = 2.3
Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 17/79 (21%)
Query: 461 EKFDDSSQEE---YFEAEDQE---GQGDSQEDEEMEEQR-----------IEDEEDQEDG 503
E+ S+E+ FE E G+ D +DEE++ Q+ E E +D
Sbjct: 466 EETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDV 525
Query: 504 NEEIEDERETKMEEECQKF 522
N+ +R+ EEE +K
Sbjct: 526 NKSKNKKRKVDEEEEEKKL 544
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 33.3 bits (76), Expect = 0.54
Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 11/66 (16%)
Query: 464 DDSSQEEYFEAEDQEG---QGDSQEDEEMEEQRIEDEED-QEDGNE-------EIEDERE 512
+ S EEY+E E + Q D + +++ ++ +DEED +DG+ E E+E E
Sbjct: 214 KNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEE 273
Query: 513 TKMEEE 518
T+ EEE
Sbjct: 274 TEEEEE 279
Score = 29.4 bits (66), Expect = 8.1
Identities = 12/57 (21%), Positives = 29/57 (50%), Gaps = 4/57 (7%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
+ + + + E+ D Q+ + +EE+ ++++E++E+ EE E+E E
Sbjct: 232 SQDDGPKGSDVKNKKSDDEED----DDQDGDYVEEKELKEDEEEEETEEEEEEEDED 284
Score = 29.4 bits (66), Expect = 8.3
Identities = 14/35 (40%), Positives = 23/35 (65%), Gaps = 4/35 (11%)
Query: 475 EDQEGQGDSQEDEEM----EEQRIEDEEDQEDGNE 505
ED + GD E++E+ EE+ E+EE++ED +E
Sbjct: 251 EDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 33.6 bits (77), Expect = 0.54
Identities = 11/48 (22%), Positives = 22/48 (45%)
Query: 474 AEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQK 521
E ++ + Q +E ++Q E E ++ E + + + K EE K
Sbjct: 78 EEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125
Score = 33.2 bits (76), Expect = 0.63
Identities = 12/52 (23%), Positives = 25/52 (48%), Gaps = 5/52 (9%)
Query: 475 EDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE-----EIEDERETKMEEECQK 521
+ Q+ ++E + +EQ+ +E Q+ E ++E ER E++ Q
Sbjct: 69 QQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQA 120
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 31.6 bits (72), Expect = 0.54
Identities = 21/83 (25%), Positives = 37/83 (44%), Gaps = 11/83 (13%)
Query: 433 ITENDNV--------IRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQ 484
I E +N+ + +E +K + L+ NIEE ++ A +
Sbjct: 17 INE-ENLKAVLEAAGVEVDEARVKALVAALEGVNIEEAIKKAAAAPVAAAAAAAAAAAAA 75
Query: 485 EDEEMEEQRIEDEEDQEDGNEEI 507
E+EE EE+ E+EE++ + EE
Sbjct: 76 EEEEEEEEEEEEEEEESE--EEA 96
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 31.1 bits (71), Expect = 0.56
Identities = 11/44 (25%), Positives = 26/44 (59%), Gaps = 4/44 (9%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEED 499
+E + + F D +Y+E ED+E + +DE++ + +++E+
Sbjct: 60 DEEMAKLFGD----DYYEEEDEEEKPTWDDDEDIGDLVPDEDEE 99
Score = 29.5 bits (67), Expect = 2.3
Identities = 19/75 (25%), Positives = 35/75 (46%), Gaps = 11/75 (14%)
Query: 441 RRNEI-----HIKPRICGLQEENI-EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRI 494
+R EI IK ++ GL+ ++ EE D F+ E + + ++ E+
Sbjct: 22 KREEIEEKLEKIK-KVAGLRGADLSEEDLADGD----FDPEKWDEEMAKLFGDDYYEEED 76
Query: 495 EDEEDQEDGNEEIED 509
E+E+ D +E+I D
Sbjct: 77 EEEKPTWDDDEDIGD 91
>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6. The VP6 protein a
minor protein in the core of the virion is probably the
viral helicase.
Length = 322
Score = 33.1 bits (75), Expect = 0.58
Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 10/64 (15%)
Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEE--DQEDGNEEIEDE 510
L++ I+ D E E + +DE+ +EQRI D E ++ G +E E +
Sbjct: 14 ELKQRQIQINLVD-------WFESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKE-ESD 65
Query: 511 RETK 514
+ET+
Sbjct: 66 KETE 69
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 33.7 bits (77), Expect = 0.59
Identities = 13/60 (21%), Positives = 26/60 (43%)
Query: 458 NIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
+E Q E ED++ ++ +DE+ E + E + E E+ E++ + E
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAE 692
Score = 33.3 bits (76), Expect = 0.70
Identities = 13/61 (21%), Positives = 26/61 (42%), Gaps = 5/61 (8%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
EE E+ +++ +E ++ + +S+ D E E EDE++ + E E
Sbjct: 648 EEEDEDDLEETDDDEDECEAIEDSESESESDGEDGE---EDEQEDDA--EANEGVVPIDK 702
Query: 516 E 516
Sbjct: 703 A 703
Score = 32.2 bits (73), Expect = 1.7
Identities = 17/75 (22%), Positives = 32/75 (42%), Gaps = 1/75 (1%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEME-EQRIEDEEDQEDGNEEIEDERETK 514
E+++EE DD + E E + E + D ++ EE E E E E ++ +
Sbjct: 652 EDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKV 711
Query: 515 MEEECQKFGHYNKDD 529
+ G ++D+
Sbjct: 712 LNLPDALDGGDSEDE 726
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 30.5 bits (69), Expect = 0.65
Identities = 9/30 (30%), Positives = 19/30 (63%)
Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQED 502
E ED + D +D++ +++ +D++D ED
Sbjct: 49 EPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78
>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
Length = 111
Score = 31.4 bits (71), Expect = 0.65
Identities = 12/60 (20%), Positives = 27/60 (45%), Gaps = 1/60 (1%)
Query: 439 VIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEE 498
+I+ E I I ++ E E + ++ + +A+ E + + + + E R E E+
Sbjct: 14 IIKEKEESIDKEIANIKNEQ-ENEIKEAKSKYEEKAKKTEEESLNMYNAALMEARKEAEK 72
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 31.5 bits (72), Expect = 0.68
Identities = 11/52 (21%), Positives = 31/52 (59%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEE 506
+E E+ D ++++ + +D D D++ +E+ +++E+D+++ +EE
Sbjct: 78 EEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129
Score = 30.0 bits (68), Expect = 2.3
Identities = 9/57 (15%), Positives = 23/57 (40%)
Query: 462 KFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
D+ +E E + + D + + + D + E++++E + E+E
Sbjct: 72 PVDEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128
Score = 28.8 bits (65), Expect = 6.3
Identities = 14/59 (23%), Positives = 31/59 (52%), Gaps = 8/59 (13%)
Query: 465 DSSQEEYFEAEDQE----GQGDSQEDEEMEEQRIEDEE-DQEDGNEEIEDERETKMEEE 518
D EE E++ + +D+ +++ + D++ D+ED +EE +++ E +EE
Sbjct: 74 DEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEE---DEE 129
>gnl|CDD|241092 cd12648, RRM3_UHM_PUF60, RNA recognition motif 3 in UHM domain of
poly(U)-binding-splicing factor PUF60 and similar
proteins. This subgroup corresponds to the RRM3 of
PUF60, also termed FUSE-binding protein-interacting
repressor (FBP-interacting repressor or FIR), or
Ro-binding protein 1 (RoBP1), or Siah-binding protein 1
(Siah-BP1), an essential splicing factor that functions
as a poly-U RNA-binding protein required to reconstitute
splicing in depleted nuclear extracts. Its function is
enhanced through interaction with U2 auxiliary factor
U2AF65. PUF60 also controls human c-myc gene expression
by binding and inhibiting the transcription factor far
upstream sequence element (FUSE)-binding-protein (FBP),
an activator of c-myc promoters. PUF60 contains two
central RNA recognition motifs (RRMs), also termed RBDs
(RNA binding domains) or RNPs (ribonucleoprotein
domains), and a C-terminal U2AF (U2 auxiliary factor)
homology motifs (UHM) that harbors another RRM and binds
to tryptophan-containing linear peptide motifs (UHM
ligand motifs, ULMs) in several nuclear proteins. The
research indicates that PUF60 binds FUSE as a dimer, and
only the first two RRM domains participate in the
single-stranded DNA recognition. .
Length = 98
Score = 30.8 bits (70), Expect = 0.69
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 503 GNEEIEDERETKMEEECQKFGHYNK 527
G E+++D+ E ++ EEC KFG N+
Sbjct: 11 GPEDLDDDLEGEVTEECGKFGAVNR 35
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 33.0 bits (75), Expect = 0.72
Identities = 14/50 (28%), Positives = 28/50 (56%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE 505
EE +++ DD ++ E +D + + EDE+ ++ + D+ +DGNE
Sbjct: 51 EEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNE 100
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 33.1 bits (76), Expect = 0.74
Identities = 18/70 (25%), Positives = 32/70 (45%), Gaps = 8/70 (11%)
Query: 477 QEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQKFGHYNKDDPHSFQLP 536
EG+ S+EDE+ E +EED++D ++ + E + ++ K D+ +S
Sbjct: 318 IEGKSVSEEDED--EDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENS---- 371
Query: 537 ENFQMYPQFM 546
M FM
Sbjct: 372 GLLSMK--FM 379
Score = 32.7 bits (75), Expect = 1.1
Identities = 22/111 (19%), Positives = 38/111 (34%), Gaps = 12/111 (10%)
Query: 434 TENDNVIRRNEIHIKPRICGLQEENIEE-KFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQ 492
EN ++R+ +K + + KF ++ E D E + +E E EE
Sbjct: 348 GENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEES 407
Query: 493 RIEDEEDQEDGNE----------EIEDERETKMEEECQKF-GHYNKDDPHS 532
E+ E+ N E E E + +E +F D+
Sbjct: 408 DEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEE 458
>gnl|CDD|240678 cd12232, RRM3_U2AF65, RNA recognition motif 3 found in U2 large
nuclear ribonucleoprotein auxiliary factor U2AF 65 kDa
subunit (U2AF65) and similar proteins. This subfamily
corresponds to the RRM3 of U2AF65 and dU2AF50. U2AF65,
also termed U2AF2, is the large subunit of U2 small
nuclear ribonucleoprotein (snRNP) auxiliary factor
(U2AF), which has been implicated in the recruitment of
U2 snRNP to pre-mRNAs and is a highly conserved
heterodimer composed of large and small subunits. U2AF65
specifically recognizes the intron polypyrimidine tract
upstream of the 3' splice site and promotes binding of
U2 snRNP to the pre-mRNA branchpoint. U2AF65 also plays
an important role in the nuclear export of mRNA. It
facilitates the formation of a messenger
ribonucleoprotein export complex, containing both the
NXF1 receptor and the RNA substrate. Moreover, U2AF65
interacts directly and specifically with expanded CAG
RNA, and serves as an adaptor to link expanded CAG RNA
to NXF1 for RNA export. U2AF65 contains an N-terminal RS
domain rich in arginine and serine, followed by a
proline-rich segment and three C-terminal RNA
recognition motifs (RRMs), also termed RBDs (RNA binding
domains) or RNPs (ribonucleoprotein domains). The
N-terminal RS domain stabilizes the interaction of U2
snRNP with the branch point (BP) by contacting the
branch region, and further promotes base pair
interactions between U2 snRNA and the BP. The
proline-rich segment mediates protein-protein
interactions with the RRM domain of the small U2AF
subunit (U2AF35 or U2AF1). The RRM1 and RRM2 are
sufficient for specific RNA binding, while RRM3 is
responsible for protein-protein interactions. The family
also includes Splicing factor U2AF 50 kDa subunit
(dU2AF50), the Drosophila ortholog of U2AF65. dU2AF50
functions as an essential pre-mRNA splicing factor in
flies. It associates with intronless mRNAs and plays a
significant and unexpected role in the nuclear export of
a large number of intronless mRNAs.
Length = 89
Score = 30.6 bits (70), Expect = 0.75
Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 5/31 (16%)
Query: 494 IEDEEDQEDGNEEIEDERETKMEEECQKFGH 524
E+ ED E+ E +ED ++EEC K+G
Sbjct: 12 PEELEDDEEYEEILED-----VKEECGKYGK 37
>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil. Members of
this family are predominantly found in the
actin-bundling protein Cortexillin I from Dictyostelium
discoideum. They adopt a structure consisting of an
18-heptad-repeat alpha-helical coiled-coil, and are a
prerequisite for the assembly of Cortexillin I.
Length = 107
Score = 31.1 bits (70), Expect = 0.79
Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 6/75 (8%)
Query: 450 RICGLQEENIEEKFDDSSQEEYFEAEDQ--EGQGDSQEDEEMEEQRIEDEEDQ-EDGNEE 506
++ GL+ EK +SQEE + +DQ + + EQR+ + + + ++ +
Sbjct: 17 KLAGLENSLESEK---ASQEELAKQKDQLRSLLASLEAQKAEREQRLRELQAKIDELLKN 73
Query: 507 IEDERETKMEEECQK 521
+E E+ KME E +
Sbjct: 74 LELEKLAKMELESRL 88
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 32.0 bits (73), Expect = 0.89
Identities = 18/73 (24%), Positives = 36/73 (49%), Gaps = 8/73 (10%)
Query: 454 LQEENIEE--KFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEED-QEDGNEE---- 506
+EE EE K ++ + E E E+ E + + +++EE ++R E QE+ +
Sbjct: 23 AEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSF 82
Query: 507 -IEDERETKMEEE 518
+E+E K+ +
Sbjct: 83 VVEEEGTDKLSAD 95
Score = 31.2 bits (71), Expect = 1.5
Identities = 21/77 (27%), Positives = 35/77 (45%), Gaps = 4/77 (5%)
Query: 449 PRICGLQEENIEEKFDDSSQEEYFEAEDQE----GQGDSQEDEEMEEQRIEDEEDQEDGN 504
+I + +EEK Q E E E +E + E +E EE E E+ +E+
Sbjct: 1 KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60
Query: 505 EEIEDERETKMEEECQK 521
+ +E+ K +EE +K
Sbjct: 61 RKEREEQARKEQEEYEK 77
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains separated
by a hinge in the middle. The eukaryotic SMC proteins
form two kind of heterodimers: the SMC1/SMC3 and the
SMC2/SMC4 types. These heterodimers constitute an
essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 33.0 bits (75), Expect = 0.89
Identities = 25/122 (20%), Positives = 50/122 (40%), Gaps = 9/122 (7%)
Query: 433 ITENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQ 492
++ DN++ E I+ RI +E I K++ +E E D++ + + ++EE E
Sbjct: 906 ESQKDNLLEEKENEIEERIA--EEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERN 963
Query: 493 RIEDEEDQEDGNEEIEDERETKMEEECQKFGH-------YNKDDPHSFQLPENFQMYPQF 545
+ +E GN + E + +EE K + + E Q + +F
Sbjct: 964 KRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEF 1023
Query: 546 MY 547
+
Sbjct: 1024 LE 1025
Score = 31.1 bits (70), Expect = 3.3
Identities = 19/124 (15%), Positives = 36/124 (29%), Gaps = 8/124 (6%)
Query: 403 AITEQHQPAELHPQFTTIEYTIPLLYSITYITENDNVIRRNEIHIKP---RICGLQEENI 459
+ E+ E I + E + + E + + R L
Sbjct: 912 LLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKE 971
Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIE-----DEEDQEDGNEEIEDERETK 514
E + FE +++ D + E +EE++ E EE + E +E
Sbjct: 972 ELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVSIN 1031
Query: 515 MEEE 518
Sbjct: 1032 RGLN 1035
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 32.4 bits (74), Expect = 0.93
Identities = 13/37 (35%), Positives = 23/37 (62%)
Query: 484 QEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQ 520
+E +E+EE+ + EED +E+E+ ET +E E +
Sbjct: 5 REQQELEERMEQMEEDMRRAQKELEEYEETALELEEK 41
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 32.2 bits (73), Expect = 1.0
Identities = 17/103 (16%), Positives = 40/103 (38%), Gaps = 14/103 (13%)
Query: 441 RRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQE-----------GQGDSQEDEEM 489
++ E+ ++ + + E++ +D +E + E G+ + +DE
Sbjct: 164 KKQELPVEETLSTIHEDDASTLHEDDELDEEVTSYLNELDDEVTSYFNDGENEENDDELE 223
Query: 490 EEQ--RIEDEEDQEDGNEEIEDE-RETKMEEECQKFGHYNKDD 529
E ++D E+ + E+I E RE K ++ +
Sbjct: 224 AEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDES 266
Score = 29.5 bits (66), Expect = 7.7
Identities = 28/143 (19%), Positives = 49/143 (34%), Gaps = 26/143 (18%)
Query: 393 QRNAFPPQYAAITEQHQPAELHPQFTTI----EYTIPLLYSITYITENDNVIRRNEIHIK 448
Q +A PP + Q + +TI T+ + + + NE+ +
Sbjct: 152 QSDAPPPVP--DMPKKQELPVEETLSTIHEDDASTLHEDDEL----DEEVTSYLNELDDE 205
Query: 449 PRICGLQEENIEEK--FDDSSQEEYFEAED------------QEGQGDSQEDEEMEEQRI 494
EN E + + E+ +E +GD E E +
Sbjct: 206 VTSYFNDGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDE 265
Query: 495 EDEEDQEDGNEEIEDERETKMEE 517
++E +E+ EE E+E K EE
Sbjct: 266 SEDEYEEEAGEEQENE--DKGEE 286
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 32.7 bits (75), Expect = 1.0
Identities = 10/62 (16%), Positives = 36/62 (58%), Gaps = 4/62 (6%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKME 516
+++ + ++ + +D D +D+++++ ++++D++D +++++DE E K E
Sbjct: 119 NYVKDIDVLNQADDDDDDDDD----DDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKE 174
Query: 517 EE 518
+
Sbjct: 175 AK 176
Score = 31.1 bits (71), Expect = 2.7
Identities = 10/55 (18%), Positives = 28/55 (50%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
+++ ++ DD ++ + +D E D D+E EE++ E ++ +++ +
Sbjct: 136 DDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWD 190
>gnl|CDD|217940 pfam04177, TAP42, TAP42-like family. The TOR signalling pathway
activates a cell-growth program in response to
nutrients. TIP41 (pfam04176) interacts with TAP42 and
negatively regulates the TOR signaling pathway.
Length = 335
Score = 32.3 bits (74), Expect = 1.1
Identities = 15/48 (31%), Positives = 21/48 (43%), Gaps = 2/48 (4%)
Query: 467 SQEEYFEAEDQEGQGDSQEDEEMEEQRIE--DEEDQEDGNEEIEDERE 512
+ +EY E E + G E E DE+D+ED +EE R
Sbjct: 279 TVDEYLEQEMKRGGVPQGGGEAAASAEEEEDDEDDEEDDDEETLKARA 326
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 31.6 bits (71), Expect = 1.1
Identities = 18/67 (26%), Positives = 33/67 (49%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
QE +E+ D +EE +E +E EE+R ++E++ E E+++E +
Sbjct: 44 QERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQE 103
Query: 515 MEEECQK 521
+E QK
Sbjct: 104 EQERIQK 110
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 32.6 bits (74), Expect = 1.1
Identities = 10/66 (15%), Positives = 15/66 (22%), Gaps = 8/66 (12%)
Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
GL EN +S A E E N++ +
Sbjct: 117 GLSPENTSGSSPES------PASHSPPPSPPSHPGPHEPA--PPESHNPSPNQQPSSFLQ 168
Query: 513 TKMEEE 518
E+
Sbjct: 169 PSHEDS 174
Score = 30.3 bits (68), Expect = 5.9
Identities = 9/65 (13%), Positives = 19/65 (29%), Gaps = 2/65 (3%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQED--EEMEEQRIEDEEDQEDGNEEIEDERET 513
+ + + E + + +Q+ Q + EE E + D + E T
Sbjct: 140 PPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPT 199
Query: 514 KMEEE 518
Sbjct: 200 SSPPP 204
Score = 29.6 bits (66), Expect = 9.7
Identities = 8/65 (12%), Positives = 18/65 (27%), Gaps = 2/65 (3%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQE--GQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
+ S + A + + Q ++ + E+ E E E +
Sbjct: 132 ASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPG 191
Query: 514 KMEEE 518
+ E
Sbjct: 192 PPQSE 196
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 32.4 bits (74), Expect = 1.1
Identities = 18/66 (27%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 466 SSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK-MEEECQKFGH 524
+E E D+E + + +E +E EE+ + EE+ ++ E+ E +++TK ++E ++
Sbjct: 27 VEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86
Query: 525 YNKDDP 530
NK P
Sbjct: 87 LNKTKP 92
Score = 31.7 bits (72), Expect = 1.9
Identities = 15/59 (25%), Positives = 33/59 (55%), Gaps = 3/59 (5%)
Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
E + + +EE E E+++ + + D+E E ++EE++E+ ++ + +ET E E
Sbjct: 30 EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEV---DEEEEKEEKKKKTKKVKETTTEWE 85
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 32.1 bits (73), Expect = 1.4
Identities = 11/63 (17%), Positives = 29/63 (46%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
E I ++ ++ + + E + Q + E++ + ED+ + +E+E+ + +
Sbjct: 386 AELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEE 445
Query: 516 EEE 518
EE
Sbjct: 446 EEA 448
Score = 30.5 bits (69), Expect = 4.3
Identities = 14/60 (23%), Positives = 23/60 (38%), Gaps = 6/60 (10%)
Query: 465 DSSQEEYFEAEDQEGQGDSQEDEEMEEQRIE------DEEDQEDGNEEIEDERETKMEEE 518
D E+ AE + + + + E+R+E EED E E + E+E
Sbjct: 377 DQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDE 436
Score = 29.4 bits (66), Expect = 10.0
Identities = 19/64 (29%), Positives = 29/64 (45%), Gaps = 6/64 (9%)
Query: 462 KFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQK 521
K D EY DQE + + E+ QR E+E Q + E +E E+ + EE+ +
Sbjct: 367 KIDIKDSAEY----DQEAED--AKVAELISQREEEEALQREAEERLEAEQAERAEEDARL 420
Query: 522 FGHY 525
Y
Sbjct: 421 RELY 424
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 32.1 bits (73), Expect = 1.4
Identities = 16/70 (22%), Positives = 26/70 (37%), Gaps = 6/70 (8%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDG------NEEIE 508
+E DD + E + E + D+E EE ++ Q DG +EE
Sbjct: 161 KEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEG 220
Query: 509 DERETKMEEE 518
+E + E
Sbjct: 221 EEAPSINYNE 230
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 32.1 bits (73), Expect = 1.4
Identities = 11/45 (24%), Positives = 22/45 (48%)
Query: 474 AEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
E + + Q+ EE E+QR ++ Q++ + E+ K E+
Sbjct: 65 KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQ 109
>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
[Translation, ribosomal structure and biogenesis].
Length = 569
Score = 32.4 bits (73), Expect = 1.5
Identities = 18/77 (23%), Positives = 33/77 (42%), Gaps = 5/77 (6%)
Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIED-EEDQEDGNEEIEDERETKMEEECQKF 522
+ S+ E E+ +E Q S + E+EEQ + E+ E+ + + E EE +
Sbjct: 92 GEESELEEAESVFKEKQMLSADVSEIEEQSNDSLSENDEEPSMDDEKTSAEAAREEFAEE 151
Query: 523 GHYNKDDPHSFQLPENF 539
K P + + + F
Sbjct: 152 ----KRIPDPYGINDKF 164
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 32.3 bits (74), Expect = 1.6
Identities = 23/90 (25%), Positives = 46/90 (51%), Gaps = 6/90 (6%)
Query: 429 SITYITENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEE 488
+ + E + + ++ + L+++ EE++++ EEY E +E G E EE
Sbjct: 631 AFEELAETEKRLEE----LRKELEELEKKYSEEEYEELR-EEYLELS-RELAGLRAELEE 684
Query: 489 MEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
+E++R E ++ E EE+E+ + K E E
Sbjct: 685 LEKRREEIKKTLEKLKEELEEREKAKKELE 714
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 31.0 bits (70), Expect = 1.7
Identities = 23/106 (21%), Positives = 41/106 (38%), Gaps = 21/106 (19%)
Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQKFGHYNKDDPHS 532
E + + E EE E+++ E ++ED E+ E + E EEE ++
Sbjct: 48 EHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQE-KEEIAQEEEKEE----------- 95
Query: 533 FQLPENFQMYPQFMYHLRRSQFLQVFNNSPDETSYYRTLLIHLPHY 578
+ R+ F ++ + +E + L HLPHY
Sbjct: 96 ---EAEDVKQQEVFSFKRKKPFKEM---NLEEKIDF---LAHLPHY 132
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 29.1 bits (66), Expect = 1.8
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 487 EEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
EE+EE++ + EE++E E + ER +
Sbjct: 49 EEIEEKKRKQEEEKERRKEARKAERAEARKRG 80
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 30.7 bits (70), Expect = 1.8
Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 5/49 (10%)
Query: 470 EYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
+ + +E + + +E+ E+EE DEE+Q D E +E E E+
Sbjct: 92 KLLGLDKKEKEEEEEEEVEVEE---LDEEEQID--ELLEKELAKLKREK 135
Score = 29.6 bits (67), Expect = 4.2
Identities = 12/46 (26%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 473 EAEDQEGQGDSQEDEEM-EEQRIEDEEDQEDGNEEIEDERETKMEE 517
+ E +E + + E EE+ EE++I++ ++E + E RE + ++
Sbjct: 98 KKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQ 143
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 32.0 bits (72), Expect = 2.0
Identities = 14/60 (23%), Positives = 28/60 (46%), Gaps = 2/60 (3%)
Query: 456 EENIEEKFDDS-SQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
E NI F + SQE + ++ Q + EE +D E++++ + +D + +K
Sbjct: 82 EPNITRSFQEPVSQESEVQDNTEQNQDTKGSKTDSEEDD-DDSEEEDNKSTSSKDGKGSK 140
>gnl|CDD|240820 cd12374, RRM_UHM_SPF45_PUF60, RNA recognition motif in UHM domain
of 45 kDa-splicing factor (SPF45) and similar proteins.
This subfamily corresponds to the RRM found in UHM
domain of 45 kDa-splicing factor (SPF45 or RBM17),
poly(U)-binding-splicing factor PUF60 (FIR or Hfp or
RoBP1 or Siah-BP1), and similar proteins. SPF45 is an
RNA-binding protein consisting of an unstructured
N-terminal region, followed by a G-patch motif and a
C-terminal U2AF (U2 auxiliary factor) homology motifs
(UHM) that harbors a RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain) and an Arg-Xaa-Phe sequence
motif. SPF45 regulates alternative splicing of the
apoptosis regulatory gene FAS (also known as CD95). It
induces exon 6 skipping in FAS pre-mRNA through the UHM
domain that binds to tryptophan-containing linear
peptide motifs (UHM ligand motifs, ULMs) present in the
3' splice site-recognizing factors U2AF65, SF1 and
SF3b155. PUF60 is an essential splicing factor that
functions as a poly-U RNA-binding protein required to
reconstitute splicing in depleted nuclear extracts. Its
function is enhanced through interaction with U2
auxiliary factor U2AF65. PUF60 also controls human c-myc
gene expression by binding and inhibiting the
transcription factor far upstream sequence element
(FUSE)-binding-protein (FBP), an activator of c-myc
promoters. PUF60 contains two central RRMs and a
C-terminal UHM domain. .
Length = 85
Score = 29.5 bits (67), Expect = 2.0
Identities = 8/23 (34%), Positives = 16/23 (69%)
Query: 505 EEIEDERETKMEEECQKFGHYNK 527
EI+++ + ++EEEC+K+G
Sbjct: 13 GEIDEDLKDEIEEECEKYGKVLN 35
>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein. This family consists of mammalian
MRVI1 proteins which are related to the
lymphoid-restricted membrane protein (JAW1) and the IP3
receptor associated cGMP kinase substrates A and B
(IRAGA and IRAGB). The function of MRVI1 is unknown
although mutations in the Mrvi1 gene induces myeloid
leukaemia by altering the expression of a gene important
for myeloid cell growth and/or differentiation so it has
been speculated that Mrvi1 is a tumour suppressor gene.
IRAG is very similar in sequence to MRVI1 and is an
essential NO/cGKI-dependent regulator of IP3-induced
calcium release. Activation of cGKI decreases
IP3-stimulated elevations in intracellular calcium,
induces smooth muscle relaxation and contributes to the
antiproliferative and pro-apoptotic effects of NO/cGMP.
Jaw1 is a member of a class of proteins with
COOH-terminal hydrophobic membrane anchors and is
structurally similar to proteins involved in vesicle
targeting and fusion. This suggests that the function
and/or the structure of the ER in lymphocytes may be
modified by lymphoid-restricted resident ER proteins.
Length = 538
Score = 31.9 bits (72), Expect = 2.1
Identities = 10/51 (19%), Positives = 21/51 (41%), Gaps = 4/51 (7%)
Query: 467 SQEEYFEAEDQ----EGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
S + +AE++ + Q + + E +R E+ E E ++E
Sbjct: 407 STYSWADAEEEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDKEQ 457
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 31.6 bits (71), Expect = 2.2
Identities = 20/68 (29%), Positives = 37/68 (54%), Gaps = 3/68 (4%)
Query: 443 NEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQED 502
NE ++ + +N EE+ +Q+E E ED E + +E E +EE+ +DE+ ++D
Sbjct: 84 NENDVELEGLNIIVKNEEER---GTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKD 140
Query: 503 GNEEIEDE 510
+E + E
Sbjct: 141 DEKESDAE 148
Score = 30.8 bits (69), Expect = 3.3
Identities = 30/116 (25%), Positives = 48/116 (41%), Gaps = 19/116 (16%)
Query: 405 TEQHQPAELHPQFTTIEYT----IPLLYSITYITEND------NVIRRNEIHIKPRICGL 454
E+ +E EY+ + + + END N+I +NE + R G
Sbjct: 51 PEEQDLSEACDYIAIEEYSACGNVASIANDKSDNENDVELEGLNIIVKNE---EER--GT 105
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
Q+E E++ EE E E+ E + +D+E E+ E E D E E+ E
Sbjct: 106 QKEEEEDE----DVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENELAGE 157
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 31.5 bits (71), Expect = 2.2
Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 6/63 (9%)
Query: 475 EDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE----ECQKFGHYNKDDP 530
++E + + +EDEE E+ E + ++ +EE E+E M++ E Q KD+
Sbjct: 140 LEEEEEVEMEEDEEYYEK--EPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDNG 197
Query: 531 HSF 533
SF
Sbjct: 198 KSF 200
>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1). This family
consists of several mammalian dentin matrix protein 1
(DMP1) sequences. The dentin matrix acidic
phosphoprotein 1 (DMP1) gene has been mapped to human
chromosome 4q21. DMP1 is a bone and teeth specific
protein initially identified from mineralised dentin.
DMP1 is primarily localised in the nuclear compartment
of undifferentiated osteoblasts. In the nucleus, DMP1
acts as a transcriptional component for activation of
osteoblast-specific genes like osteocalcin. During the
early phase of osteoblast maturation, Ca(2+) surges into
the nucleus from the cytoplasm, triggering the
phosphorylation of DMP1 by a nuclear isoform of casein
kinase II. This phosphorylated DMP1 is then exported out
into the extracellular matrix, where it regulates
nucleation of hydroxyapatite. DMP1 is a unique molecule
that initiates osteoblast differentiation by
transcription in the nucleus and orchestrates
mineralised matrix formation extracellularly, at later
stages of osteoblast maturation. The DMP1 gene has been
found to be ectopically expressed in lung cancer
although the reason for this is unknown.
Length = 514
Score = 31.6 bits (71), Expect = 2.3
Identities = 18/64 (28%), Positives = 30/64 (46%), Gaps = 1/64 (1%)
Query: 454 LQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
L + N E DS++ + + +SQED E E Q ED ++ +D + E E +
Sbjct: 283 LDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSE-ENQSQEDSQEVQDPSSESSQEADL 341
Query: 514 KMEE 517
+E
Sbjct: 342 PSQE 345
Score = 30.8 bits (69), Expect = 4.2
Identities = 21/73 (28%), Positives = 35/73 (47%), Gaps = 6/73 (8%)
Query: 454 LQEENIEEKFD-DSSQEEYFEAEDQEGQGDSQEDEEMEEQRIE-DEEDQEDGNEEIEDER 511
EE+ E D +SS +E ++ + SQE + ++ R E D+ D +D + ED
Sbjct: 409 SSEESPESTEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDDSDSQDSSRSKEDSN 468
Query: 512 ETK----MEEECQ 520
T+ EE+ Q
Sbjct: 469 STESASSSEEDGQ 481
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 28.7 bits (65), Expect = 2.3
Identities = 15/69 (21%), Positives = 32/69 (46%), Gaps = 12/69 (17%)
Query: 441 RRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQ 500
+ I PR L ++ +D S E+ E E+ G+ E E+ +E+++
Sbjct: 19 TKPSSVILPR-NPLSQDLPGLDYDYDSDAEWEEEEE----GEDLESED-------EEDEE 66
Query: 501 EDGNEEIED 509
ED +++++
Sbjct: 67 EDDDDDMDG 75
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 31.7 bits (72), Expect = 2.5
Identities = 13/63 (20%), Positives = 24/63 (38%), Gaps = 3/63 (4%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDS---QEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
+ + FD +++ E ED E GD +E E +E ++ E I+ +
Sbjct: 336 KKGDVGFDINAEVEDDPDEDAEDGGDGGDAEEFEAEKEPMLKVCRVDRQVTEIIDGDEPV 395
Query: 514 KME 516
Sbjct: 396 DYC 398
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 31.6 bits (71), Expect = 2.5
Identities = 20/67 (29%), Positives = 31/67 (46%), Gaps = 1/67 (1%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
EE + + D EE +AE+ E + +E EE + + EE ++ EE + E K
Sbjct: 1664 AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK-KAEELKKKEAEEKKKAEELK 1722
Query: 515 MEEECQK 521
EE K
Sbjct: 1723 KAEEENK 1729
Score = 30.5 bits (68), Expect = 5.2
Identities = 22/86 (25%), Positives = 42/86 (48%), Gaps = 9/86 (10%)
Query: 440 IRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAED----QEGQGDSQEDEEMEEQRIE 495
+R+ E K +E IEE +E+ +AE+ +E + ++E ++ EE++ +
Sbjct: 1580 LRKAEEAKK-----AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634
Query: 496 DEEDQEDGNEEIEDERETKMEEECQK 521
E+ ++ EE + E K EE K
Sbjct: 1635 VEQLKKKEAEEKKKAEELKKAEEENK 1660
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 27.7 bits (62), Expect = 2.6
Identities = 13/25 (52%), Positives = 18/25 (72%), Gaps = 1/25 (4%)
Query: 490 EEQRIEDEEDQEDGNEEIEDERETK 514
EE++ EDEE+ ED EE E++ E K
Sbjct: 22 EEEKREDEEENEDE-EEGEEQSEVK 45
>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH. This
bacillus clade of FliH proteins is not found by the Pfam
FliH model pfam02108, but is closely related to the
sequences identified by that model. Sequences identified
by this model are observed in flagellar operons in an
analogous position relative to other flagellar operon
genes.
Length = 255
Score = 30.9 bits (70), Expect = 2.7
Identities = 21/88 (23%), Positives = 35/88 (39%), Gaps = 3/88 (3%)
Query: 433 ITENDNVIRRNEIHIKP-RICGLQEENIEEKFDDSSQEEYF--EAEDQEGQGDSQEDEEM 489
I +VI E I P R E +E D+ + E +AE + Q Q + +
Sbjct: 5 IKSTKSVIPAQERGIIPLRQVTDVPETEQELADEEQEFEQILEKAEAEAAQIIEQAEAQA 64
Query: 490 EEQRIEDEEDQEDGNEEIEDERETKMEE 517
R + E+++ EE E + +E
Sbjct: 65 AAIREQIEQERAQWEEERERLIQEAKQE 92
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 31.6 bits (72), Expect = 2.7
Identities = 17/61 (27%), Positives = 34/61 (55%), Gaps = 3/61 (4%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKME 516
IE+K + + E+ E ++E Q ++ +++EQ E++ E+ N + E E E ++E
Sbjct: 815 REIEQKLNRLTLEK--EYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKE-ELEEELE 871
Query: 517 E 517
E
Sbjct: 872 E 872
>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin).
Length = 585
Score = 31.5 bits (71), Expect = 2.7
Identities = 17/92 (18%), Positives = 33/92 (35%), Gaps = 11/92 (11%)
Query: 455 QEENIEEKFDDSSQEEYFEAED---------QEGQGDSQEDEEMEEQRIEDEEDQEDGNE 505
Q + E+ D ++ Y + ED E ED E+ ED+ +E
Sbjct: 186 QSKQKRERSD-EREKSYQDDEDDTYRQNNIPYEDVV-GGEDWNPIEEGEEDQTQEEVKRS 243
Query: 506 EIEDERETKMEEECQKFGHYNKDDPHSFQLPE 537
+ + + +E ++ G +D + E
Sbjct: 244 KERTHKGRSLPDESKRSGQLGLEDEAEEEKEE 275
Score = 30.3 bits (68), Expect = 6.0
Identities = 17/60 (28%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 453 GLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEE-QRIEDEEDQEDGNEEIEDER 511
G IEE +D +QEE ++++ +G S DE Q ++E +E+ E+ ++ R
Sbjct: 222 GEDWNPIEEGEEDQTQEEVKRSKERTHKGRSLPDESKRSGQLGLEDEAEEEKEEKGQESR 281
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 31.5 bits (71), Expect = 2.8
Identities = 26/115 (22%), Positives = 42/115 (36%), Gaps = 20/115 (17%)
Query: 406 EQHQPAELHPQFTTIEYTIPLLYSITYITENDNVIRRNEIHIKPRICGLQEENIEEKFDD 465
E+ Q EL P+ ++ Y+ I DNV+ +++ N EE +
Sbjct: 53 EEEQDYELRPRVSSSWNN--ESYNRLPIKTKDNVVA--DVN-----------NGEEFLSE 97
Query: 466 SSQEEYFE---AEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE 517
S E E E Q +E + E ++ + D E E I +EE
Sbjct: 98 SESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSE--KERIASICTKIIEE 150
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 31.2 bits (71), Expect = 2.9
Identities = 22/96 (22%), Positives = 38/96 (39%), Gaps = 21/96 (21%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEG-QGDSQEDE--------------------EMEEQRI 494
EE EE+ + + ++E E D EG + D E++
Sbjct: 290 EEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPEQKDE 349
Query: 495 EDEEDQEDGNEEIEDERETKMEEECQKFGHYNKDDP 530
E+E++ E+ EE E+ E + EE ++D P
Sbjct: 350 EEEQEDEEEEEEEEEPEEPEPEEGPPLLTPISEDAP 385
>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
selection and in elongation by RNA polymerase II
[Transcription].
Length = 521
Score = 31.2 bits (70), Expect = 3.2
Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 7/66 (10%)
Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIE----DEEDQEDGNEEIEDERETKM---E 516
D++ +Y E EG S+E++ R+E DE D+ E ERE+ + E
Sbjct: 41 DNNENVDYAEESGGEGNEKSEEEKFKNPYRLEGKFKDEADRAKIMAMTEIERESILFERE 100
Query: 517 EECQKF 522
EE K
Sbjct: 101 EEISKL 106
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
(DUF2052). This entry is of sequences of two conserved
domains separated by a region of low complexity,
spanning some 200 residues. The function is unknown.
Length = 178
Score = 30.1 bits (68), Expect = 3.2
Identities = 20/68 (29%), Positives = 33/68 (48%), Gaps = 3/68 (4%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
QEE + ++ EE E E++E Q +EE E+ DE + EE+ +E ++
Sbjct: 77 QEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELPTPEEREELREEFISE 136
Query: 515 MEEECQKF 522
M Q+F
Sbjct: 137 MH---QRF 141
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 30.8 bits (70), Expect = 3.2
Identities = 20/82 (24%), Positives = 34/82 (41%), Gaps = 2/82 (2%)
Query: 437 DNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIED 496
+N + + E I + + + EEK E A + E Q + +E++ E
Sbjct: 58 ENQVSQLESEI--QEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAEL 115
Query: 497 EEDQEDGNEEIEDERETKMEEE 518
E+ E +EIED +E E
Sbjct: 116 MEEIEKLEKEIEDLKERLERLE 137
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 30.7 bits (69), Expect = 3.2
Identities = 18/69 (26%), Positives = 25/69 (36%), Gaps = 4/69 (5%)
Query: 454 LQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDE----EDQEDGNEEIED 509
QE K + +++ Y E Q E+ EQ ED E E E +D
Sbjct: 146 RQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDD 205
Query: 510 ERETKMEEE 518
E +EE
Sbjct: 206 VTEEDYDEE 214
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 31.1 bits (70), Expect = 3.4
Identities = 11/53 (20%), Positives = 22/53 (41%)
Query: 458 NIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
N E + D + +++ ++ E R +E + E+ E+ EDE
Sbjct: 343 NEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDE 395
Score = 30.7 bits (69), Expect = 5.0
Identities = 18/56 (32%), Positives = 30/56 (53%), Gaps = 6/56 (10%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
+E++FDD YFE + E + +E E+ +ED ED+ D ++ I D+ E
Sbjct: 358 VPLEKQFDD---ILYFEKMEIE---NRNPEESEHEEEVEDYEDENDHSKRICDDDE 407
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 30.2 bits (69), Expect = 3.4
Identities = 12/41 (29%), Positives = 20/41 (48%)
Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
A + E E EE +D++D ED +E+ E+E +
Sbjct: 33 NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADL 73
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 30.8 bits (69), Expect = 3.7
Identities = 16/86 (18%), Positives = 33/86 (38%), Gaps = 17/86 (19%)
Query: 446 HIKPRICGLQEENIEEKFDDSSQEEY------FEAEDQEGQ-----------GDSQEDEE 488
H+ + +E+ +EK +EE +D E + G Q E
Sbjct: 184 HLWNKAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMET 243
Query: 489 MEEQRIEDEEDQEDGNEEIEDERETK 514
E + E E + D +E+ +++ + +
Sbjct: 244 SESEEEESSESESDEDEDEDNKGKIR 269
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 29.9 bits (67), Expect = 3.9
Identities = 16/72 (22%), Positives = 31/72 (43%), Gaps = 6/72 (8%)
Query: 454 LQEENI---EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
L++E E++ + + E E + E +E+ +QR E +E + + E
Sbjct: 42 LEKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQR--KREQREKELAKRQKE 99
Query: 511 RETKMEEECQKF 522
E K+E +K
Sbjct: 100 LE-KIELSKKKQ 110
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 30.4 bits (69), Expect = 4.3
Identities = 11/34 (32%), Positives = 19/34 (55%)
Query: 482 DSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
+ +E EEMEE+ ED D++ E+ +E +
Sbjct: 153 EEEEPEEMEEELEEDAADRDARKRAAEEAKEQEE 186
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 30.6 bits (70), Expect = 4.5
Identities = 9/52 (17%), Positives = 26/52 (50%)
Query: 460 EEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
E + ++ ++ F +D + + +EE ++ +D ++ +D E+ +R
Sbjct: 339 EAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEGVEVIYKR 390
>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2. This
domain family is found in eukaryotes, and is typically
between 138 and 153 amino acids in length. The family is
found in association with pfam00493. Mini-chromosome
maintenance (MCM) proteins are essential for DNA
replication. These proteins use ATPase activity to
perform this function.
Length = 145
Score = 29.2 bits (66), Expect = 4.7
Identities = 13/28 (46%), Positives = 19/28 (67%), Gaps = 2/28 (7%)
Query: 485 EDEEMEEQRIEDEEDQEDGNEEIEDERE 512
EDE EE+ +ED+ D D +EE E+E +
Sbjct: 1 EDEPDEEEELEDDID--DLDEEAEEEED 26
Score = 29.2 bits (66), Expect = 5.2
Identities = 12/42 (28%), Positives = 21/42 (50%), Gaps = 5/42 (11%)
Query: 470 EYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQED-GNEEIEDE 510
E E++E + D + +E E+EED ED + +E +
Sbjct: 1 EDEPDEEEELEDDIDDLDEEA----EEEEDGEDLFGDNMERD 38
>gnl|CDD|225281 COG2425, COG2425, Uncharacterized protein containing a von
Willebrand factor type A (vWA) domain [General function
prediction only].
Length = 437
Score = 30.4 bits (69), Expect = 4.7
Identities = 29/184 (15%), Positives = 63/184 (34%), Gaps = 21/184 (11%)
Query: 461 EKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEECQ 520
E++++ Q+ E + +G+ + +++ I + E EE+E+ ++ +
Sbjct: 110 ERWEELLQDLQREGSEDFLEGEREGLLSEKQEEISLSGEMEGILEEVEELISGRLWDMSA 169
Query: 521 KFGHYNKDDPHSFQLPENFQMYPQFM-YHLRRSQFLQVFNNSPDETSYYRTLLIHLPHYS 579
+L Y F+ + L+ E +T +
Sbjct: 170 GELKKG-----DLELLGK---YADFLLKFPELRKILEQLGREDLEIEEGKTKIGRSFERR 221
Query: 580 -RFQFLQVFNN----SPDET-SYYRHVLMREDLSQSLIMIQPVLYSYGFNG---PPVPVL 630
+ ++ P E S +L E + +++ L +Y G PV +L
Sbjct: 222 GEVSGITQSDDLLRLLPIERASLADPILSVEFYRK---LVEKKLLTYRLQGKSEGPVILL 278
Query: 631 LDTS 634
LD S
Sbjct: 279 LDKS 282
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 30.8 bits (70), Expect = 5.0
Identities = 15/65 (23%), Positives = 26/65 (40%)
Query: 454 LQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERET 513
L+E + + + E E ++ + +E E E + E E E EE++ RE
Sbjct: 745 LEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREA 804
Query: 514 KMEEE 518
E
Sbjct: 805 LDELR 809
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 30.4 bits (68), Expect = 5.3
Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 1/58 (1%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERE 512
Q E E +D + ++ E + E EE + + E E D ++E D +
Sbjct: 135 QPEKNESGNNDHATDKENLLESDASSSNDSESEESDSES-EIESSDSDHDDENSDSKL 191
>gnl|CDD|220624 pfam10187, Nefa_Nip30_N, N-terminal domain of NEFA-interacting
nuclear protein NIP30. This is a the N-terminal 100
amino acids of a family of proteins conserved from
plants to humans. The full-length protein has putatively
been called NEFA-interacting nuclear protein NIP30,
however no reference could be found to confirm this.
Length = 99
Score = 28.5 bits (64), Expect = 5.3
Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 2/48 (4%)
Query: 473 EAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNE--EIEDERETKMEEE 518
E+E E + QE+ + E++ DG E E + K +EE
Sbjct: 4 ESELDEARKRRQEEVRAPRDPKAEPEEEYDGRSLYERLQENKDKKQEE 51
>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein. This family consists of several
eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
to suppress GCR2 and is highly expressed in the muscle
and heart. The function of this family is unknown
although it has been speculated that SGT1 may be
functionally analogous to the Gcr2p protein of
Saccharomyces cerevisiae which is known to be a
regulatory factor of glycolytic gene expression.
Length = 557
Score = 30.1 bits (68), Expect = 6.1
Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 4/59 (6%)
Query: 456 EENIEEKFDDSSQ---EEYFEAEDQE-GQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
++ E+ DS+ EEY + D E Q DS + ++ ED +D E +E
Sbjct: 496 DDEDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADISNSGSSGAEDDDDDIEGVEPV 554
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 30.2 bits (68), Expect = 6.3
Identities = 18/79 (22%), Positives = 31/79 (39%), Gaps = 10/79 (12%)
Query: 454 LQEENIEEKFDDSSQEEY-FEAEDQEGQGDSQEDEEME--EQRIEDEEDQEDGNEE---- 506
LQ+E + + + E E E E + D E + + E E + + N+
Sbjct: 95 LQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERL 154
Query: 507 ---IEDERETKMEEECQKF 522
E + E +E E +KF
Sbjct: 155 KFENEKKLEESLELEREKF 173
Score = 29.4 bits (66), Expect = 9.2
Identities = 21/93 (22%), Positives = 38/93 (40%), Gaps = 18/93 (19%)
Query: 443 NEIHIKPRICGLQEENI------------EEKFDDSSQEEYFEAEDQEGQGDS-----QE 485
I KP E++I E++ + E FE E E + +E
Sbjct: 6 VPIQTKPFTIPKCEDSIKGEQARYKQIEQEDQSRILNTLEEFEKEANEKRAQYRSAKKKE 65
Query: 486 DEEMEEQRIEDEEDQED-GNEEIEDERETKMEE 517
++EEQ I +++Q++ NE+I+ +E
Sbjct: 66 LSQLEEQLINQKKEQKNLFNEQIKQFELALQDE 98
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 30.5 bits (69), Expect = 6.3
Identities = 19/66 (28%), Positives = 35/66 (53%), Gaps = 6/66 (9%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQ-----GDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
+ ++E+ ++ +EE EAE + ++ E EQ IE+ E++ + EE DE
Sbjct: 789 QALQEELEEL-EEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDEL 847
Query: 512 ETKMEE 517
E ++EE
Sbjct: 848 EEELEE 853
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 28.5 bits (64), Expect = 6.5
Identities = 14/46 (30%), Positives = 25/46 (54%)
Query: 469 EEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETK 514
++ + +D E G ++EE R+ ED+E+ E E+ERE +
Sbjct: 56 DDDDDDDDMEAGGADIQEEERRSARLARLEDREEERLEKEEEREKR 101
>gnl|CDD|187810 cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Multidomain protein with permuted HD nuclease domain,
palm domain and Zn-ribbon; MTH326-like has inactivated
polymerase catalytic domain; alr1562 and slr7011 -
predicted only on the basis of size, presence of HD
domain, and location with RAMPs in one operon; signature
gene for type III; also known as Crm2 family.
Length = 475
Score = 30.1 bits (68), Expect = 6.7
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 2/66 (3%)
Query: 454 LQEENIEEKFDDSSQ--EEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDER 511
L+EE EE ++ +EY + E+ E + E RIED + + EE+E R
Sbjct: 96 LKEEIEEETWESILNKVKEYLDIENVEPFLLKSDALERISDRIEDGNEDREEAEELEAAR 155
Query: 512 ETKMEE 517
+ +
Sbjct: 156 KNVRDF 161
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 28.8 bits (65), Expect = 7.0
Identities = 13/56 (23%), Positives = 25/56 (44%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDE 510
E+ ++E+ + E ++++ + + ED E +E E D EE DE
Sbjct: 65 LEKQLKERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDE 120
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 29.6 bits (66), Expect = 7.3
Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 1/62 (1%)
Query: 456 EENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKM 515
EE ++ + + ++ Q S+ E ++E + E+++ QE+ EE E E E K
Sbjct: 177 EELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEE-EVEEEAKQ 235
Query: 516 EE 517
EE
Sbjct: 236 EE 237
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
Length = 193
Score = 29.3 bits (65), Expect = 7.4
Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 4/69 (5%)
Query: 464 DDSSQEEYFEAEDQEGQGDSQED--EEMEEQRIEDEEDQEDGNEEIEDERETKME-EECQ 520
D+ ++ E + E +E ED EE+E+Q E+ +QE+ NEE + + E +E +
Sbjct: 19 DELAEREKLKKEVEEIPEQKPEDIVEELEDQP-EEPPEQEEENEEQKPKEEIDYPIQENK 77
Query: 521 KFGHYNKDD 529
F N DD
Sbjct: 78 SFDEKNLDD 86
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 29.2 bits (66), Expect = 7.5
Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 7/75 (9%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEE-------DQEDGNEEI 507
+E EK ++ EE E+ E + + QE E+E Q E ++ + E+ +
Sbjct: 14 EETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQAEFENLRKRT 73
Query: 508 EDERETKMEEECQKF 522
E ERE + +KF
Sbjct: 74 EREREEAKKYAIEKF 88
>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
Provisional.
Length = 337
Score = 29.8 bits (67), Expect = 7.6
Identities = 11/52 (21%), Positives = 20/52 (38%)
Query: 455 QEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEE 506
++ + DD + Y ++ +S D E+ R E E + NE
Sbjct: 282 EDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEKESNGAKSNEL 333
>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist. LIN-8
is a nuclear protein, present at the sites of
transcriptional repressor complexes, which interacts
with LIN-35 Rb.Lin35 Rb is a product of the class B
synMuv gene lin-35 which silences genes required for
vulval specification through chromatin modification and
remodelling. The biological role of the interaction has
not yet been determined however predictions have been
made. The interaction shows that class A synMuv genes
control vulval induction through the transcriptional
regulation of gene expression. LIN-8 normally functions
as part of a protein complex however when the complex is
absent, other family members can partially replace LIN-8
activity.
Length = 316
Score = 29.7 bits (67), Expect = 8.0
Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 2/74 (2%)
Query: 464 DDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEE-ECQKF 522
DD + FE +E Q D +E++E++E +E+ D N +++ ++ M + K
Sbjct: 142 DDIIYDGIFEDRTRESQ-DMEENDEVDEVEVEEVPDDYGANLAVDEPEQSTMSRPQEVKQ 200
Query: 523 GHYNKDDPHSFQLP 536
+ P + Q P
Sbjct: 201 SVPAQQAPPNPQQP 214
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 29.7 bits (67), Expect = 8.4
Identities = 17/69 (24%), Positives = 27/69 (39%), Gaps = 15/69 (21%)
Query: 465 DSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGN---------------EEIED 509
+ +EE + +D E + E+E E+ +DE QE E + D
Sbjct: 129 EEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKRGDTSLTPQAKDVLESLID 188
Query: 510 ERETKMEEE 518
E K+E E
Sbjct: 189 AAEWKLEVE 197
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 29.8 bits (67), Expect = 8.7
Identities = 20/84 (23%), Positives = 39/84 (46%), Gaps = 7/84 (8%)
Query: 431 TYITENDNVIRRNEIHIKPRICGLQEENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEME 490
++ DN I+ E K ++ L++ IEE + E E+E QE D ++ +
Sbjct: 2549 ELLSYIDNEIKELENE-KLKL--LEKAKIEE---SRKERERIESETQEDNTDEEQINRQQ 2602
Query: 491 EQRI-EDEEDQEDGNEEIEDERET 513
++R+ ++EE + E + E
Sbjct: 2603 QERLQKEEEQKAYSQERLNREVSG 2626
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 29.5 bits (66), Expect = 8.9
Identities = 20/75 (26%), Positives = 37/75 (49%), Gaps = 12/75 (16%)
Query: 457 ENIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEME----------EQRIEDEEDQEDGNEE 506
E +E + +D S EE E E+ E + ++ +++ME + RI +E + + +E
Sbjct: 3 EVLELEEEDESGEE--EEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKE 60
Query: 507 IEDERETKMEEECQK 521
E E K + E +K
Sbjct: 61 KALEEEAKRKAEERK 75
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 29.8 bits (68), Expect = 9.3
Identities = 13/47 (27%), Positives = 22/47 (46%)
Query: 472 FEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGNEEIEDERETKMEEE 518
F + E E E EDEE++ED N++ E+++ E+
Sbjct: 173 FVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEK 219
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 29.5 bits (67), Expect = 9.4
Identities = 21/59 (35%), Positives = 36/59 (61%), Gaps = 4/59 (6%)
Query: 448 KPRICGLQEE--NIEEKFDDSSQEEYFEAEDQEGQGDSQEDEEMEEQRIEDEEDQEDGN 504
K R+ L++E +EE++ D EE ++AE QG Q EE+E+ R+E E+ + +G+
Sbjct: 433 KERLEDLEKELAELEEEYAD--LEEQWKAEKAAIQGIQQIKEEIEQVRLELEQAEREGD 489
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.138 0.431
Gapped
Lambda K H
0.267 0.0696 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 48,885,622
Number of extensions: 4836736
Number of successful extensions: 7219
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6054
Number of HSP's successfully gapped: 360
Length of query: 950
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 844
Effective length of database: 6,236,078
Effective search space: 5263249832
Effective search space used: 5263249832
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 64 (28.5 bits)