RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14939
(971 letters)
>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain.
Length = 26
Score = 43.9 bits (104), Expect = 4e-06
Identities = 18/25 (72%), Positives = 21/25 (84%)
Query: 46 NLTRHLRTHTGEQPYKCKYCERSFS 70
NL RH+RTHTGE+PYKC C +SFS
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFS 25
Score = 43.9 bits (104), Expect = 4e-06
Identities = 18/25 (72%), Positives = 21/25 (84%)
Query: 777 NLTRHLRTHTGEQPYKCKYCERSFS 801
NL RH+RTHTGE+PYKC C +SFS
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFS 25
Score = 37.8 bits (88), Expect = 6e-04
Identities = 14/25 (56%), Positives = 19/25 (76%), Gaps = 1/25 (4%)
Query: 74 NLQRHVRNIHNKEKPFKCPLCDRCF 98
NL+RH+R H EKP+KCP+C + F
Sbjct: 1 NLRRHMR-THTGEKPYKCPVCGKSF 24
Score = 37.8 bits (88), Expect = 6e-04
Identities = 14/25 (56%), Positives = 19/25 (76%), Gaps = 1/25 (4%)
Query: 805 NLQRHVRNIHNKEKPFKCPLCDRCF 829
NL+RH+R H EKP+KCP+C + F
Sbjct: 1 NLRRHMR-THTGEKPYKCPVCGKSF 24
Score = 36.6 bits (85), Expect = 0.001
Identities = 13/27 (48%), Positives = 20/27 (74%), Gaps = 1/27 (3%)
Query: 239 NLQRHIRTHHVGARQHACQECGKTFAT 265
NL+RH+RTH G + + C CGK+F++
Sbjct: 1 NLRRHMRTHT-GEKPYKCPVCGKSFSS 26
Score = 33.9 bits (78), Expect = 0.013
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 269 LKQHTHIHSSVKPFRCEVCHKAYTQ 293
L++H H+ KP++C VC K+++
Sbjct: 2 LRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 25.8 bits (57), Expect = 8.6
Identities = 8/25 (32%), Positives = 11/25 (44%), Gaps = 1/25 (4%)
Query: 212 HLNFHQAQVHG-RKFPCENCTKVFS 235
+L H G + + C C K FS
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFS 25
>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type. The C2H2 zinc finger
is the classical zinc finger domain. The two conserved
cysteines and histidines co-ordinate a zinc ion. The
following pattern describes the zinc finger.
#-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
be any amino acid, and numbers in brackets indicate the
number of residues. The positions marked # are those
that are important for the stable fold of the zinc
finger. The final position can be either his or cys.
The C2H2 zinc finger is composed of two short beta
strands followed by an alpha helix. The amino terminal
part of the helix binds the major groove in DNA binding
zinc fingers. The accepted consensus binding sequence
for Sp1 is usually defined by the asymmetric
hexanucleotide core GGGCGG but this sequence does not
include, among others, the GAG (=CTC) repeat that
constitutes a high-affinity site for Sp1 binding to the
wt1 promoter.
Length = 22
Score = 38.9 bits (91), Expect = 2e-04
Identities = 14/21 (66%), Positives = 15/21 (71%)
Query: 34 CKYCGKVFPRSANLTRHLRTH 54
C CGK F R +NL RHLRTH
Sbjct: 2 CPDCGKSFSRKSNLKRHLRTH 22
Score = 38.9 bits (91), Expect = 2e-04
Identities = 14/21 (66%), Positives = 15/21 (71%)
Query: 765 CKYCGKVFPRSANLTRHLRTH 785
C CGK F R +NL RHLRTH
Sbjct: 2 CPDCGKSFSRKSNLKRHLRTH 22
Score = 38.1 bits (89), Expect = 4e-04
Identities = 13/21 (61%), Positives = 16/21 (76%)
Query: 227 CENCTKVFSDPSNLQRHIRTH 247
C +C K FS SNL+RH+RTH
Sbjct: 2 CPDCGKSFSRKSNLKRHLRTH 22
Score = 34.2 bits (79), Expect = 0.009
Identities = 10/22 (45%), Positives = 15/22 (68%)
Query: 283 RCEVCHKAYTQFSNLCRHKRMH 304
+C C K++++ SNL RH R H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 33.9 bits (78), Expect = 0.014
Identities = 11/22 (50%), Positives = 16/22 (72%)
Query: 90 KCPLCDRCFGQQTNLDRHLKKH 111
KCP C + F +++NL RHL+ H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 33.9 bits (78), Expect = 0.014
Identities = 11/22 (50%), Positives = 16/22 (72%)
Query: 821 KCPLCDRCFGQQTNLDRHLKKH 842
KCP C + F +++NL RHL+ H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 32.3 bits (74), Expect = 0.044
Identities = 10/21 (47%), Positives = 14/21 (66%)
Query: 256 CQECGKTFATSSGLKQHTHIH 276
C +CGK+F+ S LK+H H
Sbjct: 2 CPDCGKSFSRKSNLKRHLRTH 22
Score = 30.8 bits (70), Expect = 0.13
Identities = 12/20 (60%), Positives = 15/20 (75%)
Query: 61 KCKYCERSFSISSNLQRHVR 80
KC C +SFS SNL+RH+R
Sbjct: 1 KCPDCGKSFSRKSNLKRHLR 20
Score = 30.8 bits (70), Expect = 0.13
Identities = 12/20 (60%), Positives = 15/20 (75%)
Query: 792 KCKYCERSFSISSNLQRHVR 811
KC C +SFS SNL+RH+R
Sbjct: 1 KCPDCGKSFSRKSNLKRHLR 20
Score = 28.5 bits (64), Expect = 0.94
Identities = 9/22 (40%), Positives = 13/22 (59%)
Query: 312 KCVKCDQSFSTVTSLSKHKRFC 333
KC C +SFS ++L +H R
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger. This family contains
a number of divergent C2H2 type zinc fingers.
Length = 24
Score = 36.8 bits (85), Expect = 0.001
Identities = 12/24 (50%), Positives = 14/24 (58%)
Query: 225 FPCENCTKVFSDPSNLQRHIRTHH 248
F C C K FS L+RH+R HH
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKHH 24
Score = 33.8 bits (77), Expect = 0.012
Identities = 11/23 (47%), Positives = 12/23 (52%)
Query: 32 YACKYCGKVFPRSANLTRHLRTH 54
+ C CGK F L RHLR H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 33.8 bits (77), Expect = 0.012
Identities = 11/23 (47%), Positives = 12/23 (52%)
Query: 763 YACKYCGKVFPRSANLTRHLRTH 785
+ C CGK F L RHLR H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 33.0 bits (75), Expect = 0.026
Identities = 13/23 (56%), Positives = 16/23 (69%)
Query: 89 FKCPLCDRCFGQQTNLDRHLKKH 111
FKCPLC + F + L RHL+KH
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 33.0 bits (75), Expect = 0.026
Identities = 13/23 (56%), Positives = 16/23 (69%)
Query: 820 FKCPLCDRCFGQQTNLDRHLKKH 842
FKCPLC + F + L RHL+KH
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 32.2 bits (73), Expect = 0.044
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 60 YKCKYCERSFSISSNLQRHVRNIH 83
+KC C +SFS L+RH+R H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKHH 24
Score = 32.2 bits (73), Expect = 0.044
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 791 YKCKYCERSFSISSNLQRHVRNIH 814
+KC C +SFS L+RH+R H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKHH 24
Score = 29.1 bits (65), Expect = 0.58
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 282 FRCEVCHKAYTQFSNLCRHKRMH 304
F+C +C K+++ L RH R H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 28.0 bits (62), Expect = 1.8
Identities = 9/23 (39%), Positives = 13/23 (56%)
Query: 254 HACQECGKTFATSSGLKQHTHIH 276
C CGK+F++ LK+H H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 27.6 bits (61), Expect = 2.1
Identities = 6/24 (25%), Positives = 12/24 (50%)
Query: 198 YRCESCSQTFCWRPHLNFHQAQVH 221
++C C ++F + L H + H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKHH 24
Score = 26.4 bits (58), Expect = 6.2
Identities = 9/22 (40%), Positives = 13/22 (59%)
Query: 312 KCVKCDQSFSTVTSLSKHKRFC 333
KC C +SFS+ +L +H R
Sbjct: 2 KCPLCGKSFSSKDALKRHLRKH 23
>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger.
Length = 23
Score = 36.7 bits (85), Expect = 0.001
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 32 YACKYCGKVFPRSANLTRHLRTH 54
Y C CGKVF + L H+RTH
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 36.7 bits (85), Expect = 0.001
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 763 YACKYCGKVFPRSANLTRHLRTH 785
Y C CGKVF + L H+RTH
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 35.5 bits (82), Expect = 0.003
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 225 FPCENCTKVFSDPSNLQRHIRTH 247
+ C C KVF S L+ H+RTH
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 33.6 bits (77), Expect = 0.018
Identities = 7/23 (30%), Positives = 14/23 (60%)
Query: 89 FKCPLCDRCFGQQTNLDRHLKKH 111
++CP C + F ++ L H++ H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 33.6 bits (77), Expect = 0.018
Identities = 7/23 (30%), Positives = 14/23 (60%)
Query: 820 FKCPLCDRCFGQQTNLDRHLKKH 842
++CP C + F ++ L H++ H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 32.8 bits (75), Expect = 0.032
Identities = 9/23 (39%), Positives = 11/23 (47%)
Query: 282 FRCEVCHKAYTQFSNLCRHKRMH 304
+RC C K + S L H R H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 31.7 bits (72), Expect = 0.079
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 254 HACQECGKTFATSSGLKQHTHIH 276
+ C ECGK F + S L++H H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 30.1 bits (68), Expect = 0.31
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 60 YKCKYCERSFSISSNLQRHVR 80
Y+C C + F S L+ H+R
Sbjct: 1 YRCPECGKVFKSKSALREHMR 21
Score = 30.1 bits (68), Expect = 0.31
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 791 YKCKYCERSFSISSNLQRHVR 811
Y+C C + F S L+ H+R
Sbjct: 1 YRCPECGKVFKSKSALREHMR 21
Score = 25.9 bits (57), Expect = 8.7
Identities = 6/20 (30%), Positives = 13/20 (65%)
Query: 312 KCVKCDQSFSTVTSLSKHKR 331
+C +C + F + ++L +H R
Sbjct: 2 RCPECGKVFKSKSALREHMR 21
>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
Length = 128
Score = 36.0 bits (83), Expect = 0.019
Identities = 21/55 (38%), Positives = 29/55 (52%), Gaps = 6/55 (10%)
Query: 59 PYKCKYCERSFSISSNLQRHVR-NIHNKEKPFKCPLCDRCF-GQQTNLDRHLKKH 111
PY C C FS S +L++H+R H+K CP+C + F + LD KKH
Sbjct: 73 PYVCPLCLMPFSSSVSLKQHIRYTEHSK----VCPVCGKEFRNTDSTLDHVCKKH 123
Score = 36.0 bits (83), Expect = 0.019
Identities = 21/55 (38%), Positives = 29/55 (52%), Gaps = 6/55 (10%)
Query: 790 PYKCKYCERSFSISSNLQRHVR-NIHNKEKPFKCPLCDRCF-GQQTNLDRHLKKH 842
PY C C FS S +L++H+R H+K CP+C + F + LD KKH
Sbjct: 73 PYVCPLCLMPFSSSVSLKQHIRYTEHSK----VCPVCGKEFRNTDSTLDHVCKKH 123
Score = 32.5 bits (74), Expect = 0.35
Identities = 16/53 (30%), Positives = 22/53 (41%), Gaps = 2/53 (3%)
Query: 32 YACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHN 84
Y C C F S +L +H+R E C C + F + + HV HN
Sbjct: 74 YVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHVCKKHN 124
Score = 32.5 bits (74), Expect = 0.35
Identities = 16/53 (30%), Positives = 22/53 (41%), Gaps = 2/53 (3%)
Query: 763 YACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHN 815
Y C C F S +L +H+R E C C + F + + HV HN
Sbjct: 74 YVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHVCKKHN 124
Score = 28.3 bits (63), Expect = 8.7
Identities = 16/49 (32%), Positives = 21/49 (42%), Gaps = 5/49 (10%)
Query: 225 FPCENCTKVFSDPSNLQRHIR-THHVGARQHACQECGKTFATSSGLKQH 272
+ C C FS +L++HIR T H C CGK F + H
Sbjct: 74 YVCPLCLMPFSSSVSLKQHIRYTEH----SKVCPVCGKEFRNTDSTLDH 118
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 37.1 bits (86), Expect = 0.053
Identities = 39/235 (16%), Positives = 62/235 (26%), Gaps = 26/235 (11%)
Query: 473 DSPPTAEEANSNQRPSPARPTISASNAVTYPDPREQLEVVTTDVKSDVNQKPRDKLTNFS 532
+PP A +R SP + S+ + ++D S + N
Sbjct: 196 STPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEC 255
Query: 533 IDKLSGKMDSPKDEDASEESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFVKTPATS 592
+ P + ++ S +S S + +P S
Sbjct: 256 PLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRAS 315
Query: 593 PEIGKD-ESVINKSSPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTPMA 651
ES + +S + S VS PSP +RSPSP P +
Sbjct: 316 SSSSSSRESSSSSTSSSSE---------SSRGAAVSPGPSP--SRSPSPSRPPPPADPSS 364
Query: 652 YPKPIHPMSLEAFFPLKNFGSRSYPPFPPPHPTDPHERLCPPSLPGFNPQRSFPF 706
K P P P P R ++ G +R
Sbjct: 365 PRKRPRP--------------SRAPSSPAASAGRPTRRRARAAVAGRARRRDATG 405
Score = 33.2 bits (76), Expect = 0.81
Identities = 26/155 (16%), Positives = 39/155 (25%), Gaps = 12/155 (7%)
Query: 546 EDASEESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFVKTPATSPEIGKDESVINKS 605
+ E A D DL S+ + SD + V A + + + +
Sbjct: 15 AEGGEFFPRPPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPG 74
Query: 606 SPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTPMAYPKPIHPMSLEAFF 665
T E SP + SP+PP + P P P S
Sbjct: 75 PGTEAPAN-ESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPP---PPTPPPAS----- 125
Query: 666 PLKNFGSRSYPPFPPPH---PTDPHERLCPPSLPG 697
P + P P + P
Sbjct: 126 PPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPA 160
>gnl|CDD|202452 pfam02892, zf-BED, BED zinc finger.
Length = 43
Score = 32.2 bits (74), Expect = 0.064
Identities = 11/23 (47%), Positives = 12/23 (52%), Gaps = 3/23 (13%)
Query: 34 CKYCGKVFPRSA---NLTRHLRT 53
CKYCGK+ NL RHL
Sbjct: 18 CKYCGKILSGGGGTSNLKRHLVR 40
Score = 32.2 bits (74), Expect = 0.064
Identities = 11/23 (47%), Positives = 12/23 (52%), Gaps = 3/23 (13%)
Query: 765 CKYCGKVFPRSA---NLTRHLRT 784
CKYCGK+ NL RHL
Sbjct: 18 CKYCGKILSGGGGTSNLKRHLVR 40
Score = 30.7 bits (70), Expect = 0.26
Identities = 10/26 (38%), Positives = 14/26 (53%), Gaps = 3/26 (11%)
Query: 61 KCKYCERSFSIS---SNLQRHVRNIH 83
+CKYC + S SNL+RH+
Sbjct: 17 RCKYCGKILSGGGGTSNLKRHLVRKC 42
Score = 30.7 bits (70), Expect = 0.26
Identities = 10/26 (38%), Positives = 14/26 (53%), Gaps = 3/26 (11%)
Query: 792 KCKYCERSFSIS---SNLQRHVRNIH 814
+CKYC + S SNL+RH+
Sbjct: 17 RCKYCGKILSGGGGTSNLKRHLVRKC 42
>gnl|CDD|214746 smart00614, ZnF_BED, BED zinc finger. DNA-binding domain in
chromatin-boundary-element-binding proteins and
transposases.
Length = 50
Score = 32.4 bits (74), Expect = 0.065
Identities = 14/28 (50%), Positives = 17/28 (60%), Gaps = 5/28 (17%)
Query: 61 KCKYCERSFSIS-----SNLQRHVRNIH 83
KCKYC + S S SNL+RH+R H
Sbjct: 20 KCKYCGKKLSRSSKGGTSNLRRHLRRKH 47
Score = 32.4 bits (74), Expect = 0.065
Identities = 14/28 (50%), Positives = 17/28 (60%), Gaps = 5/28 (17%)
Query: 792 KCKYCERSFSIS-----SNLQRHVRNIH 814
KCKYC + S S SNL+RH+R H
Sbjct: 20 KCKYCGKKLSRSSKGGTSNLRRHLRRKH 47
Score = 30.1 bits (68), Expect = 0.50
Identities = 14/25 (56%), Positives = 15/25 (60%), Gaps = 5/25 (20%)
Query: 34 CKYCGKVFPRSA-----NLTRHLRT 53
CKYCGK RS+ NL RHLR
Sbjct: 21 CKYCGKKLSRSSKGGTSNLRRHLRR 45
Score = 30.1 bits (68), Expect = 0.50
Identities = 14/25 (56%), Positives = 15/25 (60%), Gaps = 5/25 (20%)
Query: 765 CKYCGKVFPRSA-----NLTRHLRT 784
CKYCGK RS+ NL RHLR
Sbjct: 21 CKYCGKKLSRSSKGGTSNLRRHLRR 45
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 36.4 bits (84), Expect = 0.078
Identities = 38/224 (16%), Positives = 60/224 (26%), Gaps = 14/224 (6%)
Query: 478 AEEANSNQRPSPARPTISASNAVTYPDPREQLEVVTTDVKSDVNQKPRDKLTNFSIDKLS 537
E+ S + +P N + +E+ +V K K K K
Sbjct: 80 VEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK---PKEE 136
Query: 538 GKMDSPKDE--DASEESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFVKTPATSPEI 595
K P E E+ + ++ + +KS + K P +
Sbjct: 137 AKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQ 196
Query: 596 GKDESVINKSSPTPPDDPVE------ESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTP 649
+ K P PD E + K T ++ S S + P
Sbjct: 197 RQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKP 256
Query: 650 MAYPKPIHPMSLEAFFPLKNFGSRSYPPFPPPHPTDPHERLCPP 693
P P + E K +R PP R PP
Sbjct: 257 DPSPSMASPETRE---SSKRTETRPRTSLRPPSARPASARPAPP 297
>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
UL32. The large phosphorylated protein (UL32-like) of
herpes viruses is the polypeptide most frequently
reactive in immuno-blotting analyses with antisera when
compared with other viral proteins.
Length = 777
Score = 36.4 bits (84), Expect = 0.086
Identities = 31/178 (17%), Positives = 46/178 (25%), Gaps = 10/178 (5%)
Query: 532 SIDKLSGKMDSPKDEDASEESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFVKTPAT 591
S DK+ + P S ++ D A+ S SD + ++
Sbjct: 460 SSDKVIDSVSPPSKRRVS----APASRLDDAKRPEVTATPESSGSDSEGGASGREDETSS 515
Query: 592 SPE----IGKD--ESVINKSSPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKA 645
E I + SP P P + S S + PS
Sbjct: 516 DAESVVSIKELRPRIGFINKSPPPKSPPKSRRTLIVALSLASPSTAGSPRPKPSLGKFVI 575
Query: 646 NTTPMAYPKPIHPMSLEAFFPLKNFGSRSYPPFPPPHPTDPHERLCPPSLPGFNPQRS 703
T P A+ + + T P L P +L G P S
Sbjct: 576 GTDPFAFANTVRLTDNMRGGNGVGSSVKPKGSASSKPLTGPGSDLKPATLNGKTPSSS 633
Score = 31.0 bits (70), Expect = 4.0
Identities = 30/178 (16%), Positives = 50/178 (28%), Gaps = 12/178 (6%)
Query: 527 KLTNFSIDKLSGKMDSPKDEDASEESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFV 586
+ +S D+ SG D A E ++ D +DL S +
Sbjct: 308 GSSLWSDDEESGDEDGDGSGFAPEPLIKTDSRSNDTLVDLGRGGGSLKLDSVDAPGTSSY 367
Query: 587 KT-------PATSPEIGKDESVINKSSPTPPDDPVEESKRSPTPQIVSRSPSPIKN-RSP 638
P + E+ + N P D E V + S I
Sbjct: 368 LFEPGLSPSPNSGKEMPGILTTENLDLPLASTDSTEMDPEDKRGGAVKINNSGILAWGLK 427
Query: 639 SPPVSKANTTPMAYPKPIHPMSLEAFFPLK----NFGSRSYPPFPPPHPTDPHERLCP 692
+P ++ + +A L+ PL+ + S P + P RL
Sbjct: 428 TPGLAVNDERSIAVSSDGITDVLDPPSPLRLHSSDKVIDSVSPPSKRRVSAPASRLDD 485
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 36.5 bits (84), Expect = 0.087
Identities = 65/407 (15%), Positives = 104/407 (25%), Gaps = 68/407 (16%)
Query: 331 RFCDSTTPGTTPSGQNSLSAMSHQTPGIPNPYLM---------YPR-------------- 367
RF + P G + P P ++ +PR
Sbjct: 2487 RFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASD 2546
Query: 368 -PGGTLPFYPPSLMPPYPGLFQPNQTPTPHHPFLSNPLLFNQSMKSNEELDRAKININLS 426
G P PP+ P P P P P P + +++ + + A+ +
Sbjct: 2547 DAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPS---EPAVTSRARRPDAPPQSARPRAPVD 2603
Query: 427 VEAYYRFQLQKNVQQSEEKRKNSPSPRKSDVPKTSGGKLFEFSESKDSPPTAEEANSNQR 486
R + + + P P S + P + + R
Sbjct: 2604 DRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPH---PPPTVPPPERPRDDPAPGR 2660
Query: 487 PSPARPTISASNAVTYPDPREQLEVVTTDVKSDVNQKPRDKLTNFSIDKLSGKMDSPKDE 546
S R A P Q+PR + ++ L+ D P
Sbjct: 2661 VSRPRRARRLGRAAQASSP---------------PQRPRRRAARPTVGSLTSLAD-PPPP 2704
Query: 547 DASEESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFVKTPAT-SPEIGKDESVINKS 605
+ E P S + + A + PA +P
Sbjct: 2705 PPTPEPAPHAL-----------VSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG 2753
Query: 606 SPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTPMAYPKP----IHPMSL 661
P P +P P + P R P V+ + + + P P P ++
Sbjct: 2754 PARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESRESLPSPWDPADPPAAV 2811
Query: 662 EAFFPLKNFGSRSYPPFPPPHPTDPHERLCPPSLPGFNPQRSFPFLG 708
A + P PPP P P P P S P G
Sbjct: 2812 LAPAAALPPAASPAGPLPPPTSAQP----TAPPPPPGPPPPSLPLGG 2854
Score = 36.5 bits (84), Expect = 0.100
Identities = 63/371 (16%), Positives = 102/371 (27%), Gaps = 29/371 (7%)
Query: 334 DSTTPGTTPSGQNSLSAMSHQT--PGIPNPYLMYPRPGGTLPFYPPSLMPPYPGLFQPNQ 391
D + P P+ + S A++ + P P P PR P PP P P+
Sbjct: 2565 DRSVPPPRPAPRPSEPAVTSRARRPDAP-PQSARPRAPVDDRGDPRGPAPPSP--LPPDT 2621
Query: 392 TPTPHHPFLSNPLLFNQSMKSNEELDRAKININLSVEAYYRFQLQKNVQ-QSEEKRKNSP 450
P +P + A R + + + +SP
Sbjct: 2622 HAPDPPPPSPSPAAN--EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP 2679
Query: 451 S--PRKSDVPKTSGGKLFEFSESKDSPPTAEEANSNQRPSPARPTISASNAVTYPDPREQ 508
PR+ T G L ++ PPT E A + P A+ P
Sbjct: 2680 PQRPRRRAARPTVG-SLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL--- 2735
Query: 509 LEVVTTDVKSDVNQKPRDKLTNFSIDKLSGKMDSPKDEDASEESKPKKAKYCDQPLDLSI 568
P + + P P A P L+
Sbjct: 2736 ----------PAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR 2785
Query: 569 ASKSSIDSDKQDANETFVKTPATSPEIGKDESVINKSSPT---PPDDPVEESKRSPTPQI 625
+ +S+ ++ + + + ++ +SP PP + + P P
Sbjct: 2786 PAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP 2845
Query: 626 VSRSPSPIKNRSPSPPVSKANTTPMAYPKPIHPMSLEAFFPLKNFGSRSYPPFPPPHPTD 685
S + +P V + + KP P + SRS F P D
Sbjct: 2846 PPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESF--ALPPD 2903
Query: 686 PHERLCPPSLP 696
ER P P
Sbjct: 2904 QPERPPQPQAP 2914
Score = 30.7 bits (69), Expect = 6.0
Identities = 20/92 (21%), Positives = 24/92 (26%), Gaps = 6/92 (6%)
Query: 607 PTPPDDPVEESKRSPTPQIVSRSPSP-IKNRSPSPPVSKANTTPMAYPKPIHPMS-LEAF 664
P P P + P P P R P + P P L A
Sbjct: 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
Query: 665 FPLKNFGSRSYPPFPPPHPTDPHERLCPPSLP 696
P G + P F P P E + P
Sbjct: 2968 VP----GRVAVPRFRVPQPAPSREAPASSTPP 2995
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 35.4 bits (82), Expect = 0.13
Identities = 24/135 (17%), Positives = 43/135 (31%), Gaps = 8/135 (5%)
Query: 587 KTPATSPEIGKDE-----SVINKSSPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPP 641
+ P P + +++ + N + P +D S P S + P
Sbjct: 146 EDPNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPS 205
Query: 642 VSKANTTPMAYPKPIHPMSLEAFFPLKNFGSRSYPPFPPPHPTDPHERLCPPSLPGFNPQ 701
++ P P P S ++ + S PP P P P +++ P S P
Sbjct: 206 SPSDSSLP---PAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPP 262
Query: 702 RSFPFLGLMNGINNS 716
+ + GI
Sbjct: 263 SASATPAPIGGITLD 277
Score = 31.6 bits (72), Expect = 2.1
Identities = 24/111 (21%), Positives = 35/111 (31%), Gaps = 7/111 (6%)
Query: 545 DEDASEESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFVKTPATSPEIGKDESVINK 604
DEDA + + + D + AS S D + P+ S +
Sbjct: 158 DEDADVATTNSDNSFPGEDADPASASPS-------DPPSSSPGVPSFPSPPEDPSSPSDS 210
Query: 605 SSPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTPMAYPKP 655
S P P ++ + SP P P PPV + A P P
Sbjct: 211 SLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTP 261
Score = 30.0 bits (68), Expect = 5.7
Identities = 14/66 (21%), Positives = 19/66 (28%), Gaps = 3/66 (4%)
Query: 338 PGTTPSGQNSLSAMSHQTPGIPNPYLMYPRPGGTLPFYPPSLMPPYPGLFQPNQTPTPHH 397
P ++ S P P+ + P P+ P PG P P
Sbjct: 195 PSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPP---SPESPTNPSPPPGPAAPPPPPVQQV 251
Query: 398 PFLSNP 403
P LS
Sbjct: 252 PPLSTA 257
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 35.4 bits (81), Expect = 0.19
Identities = 58/289 (20%), Positives = 102/289 (35%), Gaps = 50/289 (17%)
Query: 440 QQSEEKRKNSPSPRKSDVPKTSGGKLFEFSESKDSPPTAEEANSNQRPSPARPTISASNA 499
Q E+ RK +PSP V + S P +S I+A+
Sbjct: 182 QTREKVRKYTPSPLGRMVNSYAPSTFMTMPRSHGITPRTTVKDSE---------ITATYK 232
Query: 500 VTYPDPREQL--EVVTTDVKSDVNQKPRDKLTNFSIDKLSGKMDSPKD-EDASEESKPKK 556
+ +P ++ + T +K + P D L+ SP+ + + + P++
Sbjct: 233 MLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLT----SPRSVVEKNTLTTPRR 288
Query: 557 AKYCDQPLDLSIASKSSIDSDKQDANETFVKTPATSPEIGKDESVINKSSPTPPDDPVEE 616
+ + K+++ + + E TPATS E S++ SSP
Sbjct: 289 VESNSSTNHWGLVGKNNLTTPQGTVLE---HTPATS-EGQVTISIMTGSSPAETKASTAA 344
Query: 617 SK-RSPTPQ----IVSRSPS----PIKNRSPSP-----PVSKANTTPMAY------PKPI 656
K R+P + V + + KN S +P P +A T + P P
Sbjct: 345 WKIRNPLSRTSAPAVRIASATFRGLEKNPSTAPSTPATPRVRAVLTTQVHHCVVVKPAPA 404
Query: 657 HPMSLEAFFPLKNFGSRSYPPFPPPHPTDPHERLCPPSLPGFNPQRSFP 705
P + P + + +P P P P+ PP P +P+ +P
Sbjct: 405 VPTT-----PSPSLTTALFPEAPSPSPS-----ALPPGQPDLHPKAEYP 443
>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor. This family is secreted by
gram-negative Gammaproteobacteria such as Pseudomonas
syringae of tomato and the fire blight plant pathogen
Erwinia amylovora, amongst others. It is an essential
pathogenicity factor of approximately 198 kDa. Its
injection into the host-plant is dependent upon the
bacterial type III or Hrp secretion system. The family
is long and carries a number of predicted functional
regions, including an ERMS or endoplasmic reticulum
membrane retention signal at both the C- and the
N-termini, a leucine-zipper motif from residues 539-560,
and a nuclear localisation signal at 1358-1361. this
conserved AvrE-family of effectors is among the few that
are required for full virulence of many phytopathogenic
pseudomonads, erwinias and pantoeas.
Length = 1771
Score = 34.8 bits (80), Expect = 0.33
Identities = 49/313 (15%), Positives = 90/313 (28%), Gaps = 65/313 (20%)
Query: 436 QKNVQQSEE----------KRKNSPSPRKSDV--------PKTSGGKLFEFS----ESKD 473
K Q E ++ + +++ + K+F+ S + +
Sbjct: 10 TKTAVQPEATPSAGAPTGLQQSSESPTQRASHSLASEGKKNRKKMPKVFQKSSAPRQIQA 69
Query: 474 SPPTAEEANSNQRPSPARPTISASNAVTYPDPREQLEVVTTDVKSDVNQKPRDKLTNFSI 533
+PP A + S PT+ A+ D Q + + + + +
Sbjct: 70 APPQALNPTAAAPQSSRGPTLRELLALPEDDGETQAPESSPSARRLTRSEGVAR---HEM 126
Query: 534 DKLSGKMDSPKDEDAS--EESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFVKTPAT 591
+ L+G+ D D ++ K + P+ SK S K PAT
Sbjct: 127 EDLAGRPVVKPDADRQLRQDILNKSSSSRRPPV-----SKEEGTSSKM---------PAT 172
Query: 592 SPEIGKDESVINKSSPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTPMA 651
+ ++ D+ +E + + Q S + P+ P A PM
Sbjct: 173 ALA---------SAALFKDDEIRQEVDAARSDQASQSRLSRSRGNPPAIPPDAAPRQPML 223
Query: 652 YPKPIHPMSLEAFFPLKNFGSRSYPPFPPPHPTDPHERLCPPSLPGFNPQRSFPFLGLMN 711
S F ++ P D + GFNP L
Sbjct: 224 ------TRSAGGRFEGEDENLERNLQPQSPITLDKKGK---LDFSGFNPP------ALNT 268
Query: 712 GINNSLGVNRTPY 724
+ +LG Y
Sbjct: 269 LLQQTLGKPGQTY 281
>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
transition [Transcription / Cell division and chromosome
partitioning].
Length = 423
Score = 33.5 bits (76), Expect = 0.49
Identities = 28/105 (26%), Positives = 39/105 (37%), Gaps = 26/105 (24%)
Query: 223 RKFPCENCTKVFSDPSNLQRHIRT-----HHVGARQHACQ--ECGKTFATSSGLKQHT-H 274
RK PC N + +R+I T + + C C K + +GLK H H
Sbjct: 314 RKLPCTNSSSNGKLAHGGERNIDTPSRMLKVKDGKPYKCPVEGCNKKYKNQNGLKYHMLH 373
Query: 275 IHSSV------------------KPFRCEVCHKAYTQFSNLCRHK 301
H + KP+RCEVC K Y + L H+
Sbjct: 374 GHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR 418
>gnl|CDD|187811 cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Multidomain protein with permuted HD nuclease domain,
palm domain and Zn-ribbon; signature gene for type III;
also known as Csm1 family.
Length = 650
Score = 33.5 bits (77), Expect = 0.63
Identities = 18/115 (15%), Positives = 34/115 (29%), Gaps = 23/115 (20%)
Query: 7 KPFQEILTSQNNAVNGSSSKTTKDRYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCE 66
K + + + S + +K CK CG+ P + E C CE
Sbjct: 370 KRYSYLELAA--LNPRDSKEGSKGTRECKVCGREEPIAE----------DEDEGLCPTCE 417
Query: 67 RSFSISSNLQRHVRNI--------HNKEKPFKCPLC---DRCFGQQTNLDRHLKK 110
R + + L + + K L + ++ L+ L +
Sbjct: 418 RLYELGKELLKDDSFLVTEKEDGGKKLPKFNGYYLLFAYEADEYEELALEDELVR 472
Score = 33.5 bits (77), Expect = 0.63
Identities = 18/115 (15%), Positives = 34/115 (29%), Gaps = 23/115 (20%)
Query: 738 KPFQEILTSQNNAVNGSSSKTTKDRYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCE 797
K + + + S + +K CK CG+ P + E C CE
Sbjct: 370 KRYSYLELAA--LNPRDSKEGSKGTRECKVCGREEPIAE----------DEDEGLCPTCE 417
Query: 798 RSFSISSNLQRHVRNI--------HNKEKPFKCPLC---DRCFGQQTNLDRHLKK 841
R + + L + + K L + ++ L+ L +
Sbjct: 418 RLYELGKELLKDDSFLVTEKEDGGKKLPKFNGYYLLFAYEADEYEELALEDELVR 472
>gnl|CDD|147567 pfam05454, DAG1, Dystroglycan (Dystrophin-associated glycoprotein
1). Dystroglycan is one of the dystrophin-associated
glycoproteins, which is encoded by a 5.5 kb transcript
in human. The protein product is cleaved into two
non-covalently associated subunits, [alpha] (N-terminal)
and [beta] (C-terminal). In skeletal muscle the
dystroglycan complex works as a transmembrane linkage
between the extracellular matrix and the cytoskeleton.
[alpha]-dystroglycan is extracellular and binds to
merosin ([alpha]-2 laminin) in the basement membrane,
while [beta]-dystroglycan is a transmembrane protein and
binds to dystrophin, which is a large rod-like
cytoskeletal protein, absent in Duchenne muscular
dystrophy patients. Dystrophin binds to intracellular
actin cables. In this way, the dystroglycan complex,
which links the extracellular matrix to the
intracellular actin cables, is thought to provide
structural integrity in muscle tissues. The dystroglycan
complex is also known to serve as an agrin receptor in
muscle, where it may regulate agrin-induced
acetylcholine receptor clustering at the neuromuscular
junction. There is also evidence which suggests the
function of dystroglycan as a part of the signal
transduction pathway because it is shown that Grb2, a
mediator of the Ras-related signal pathway, can interact
with the cytoplasmic domain of dystroglycan. In general,
aberrant expression of dystrophin-associated protein
complex underlies the pathogenesis of Duchenne muscular
dystrophy, Becker muscular dystrophy and severe
childhood autosomal recessive muscular dystrophy.
Interestingly, no genetic disease has been described for
either [alpha]- or [beta]-dystroglycan. Dystroglycan is
widely distributed in non-muscle tissues as well as in
muscle tissues. During epithelial morphogenesis of
kidney, the dystroglycan complex is shown to act as a
receptor for the basement membrane. Dystroglycan
expression in mouse brain and neural retina has also
been reported. However, the physiological role of
dystroglycan in non-muscle tissues has remained unclear.
Length = 290
Score = 33.0 bits (75), Expect = 0.74
Identities = 28/114 (24%), Positives = 44/114 (38%), Gaps = 9/114 (7%)
Query: 594 EIGKDESVINKSSPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTPMAYP 653
+ + I K P D +++SK P S P +K P P + +
Sbjct: 179 TLEDQATFIKKGVPIIFADELDDSK----PPPSSSMPLILKEEKPPLPPPEYPNQNVPET 234
Query: 654 KPIHPMSLEAFFPLKNFGSRSYPPFPPPHPTDPHE--RLCPPSLPGFNPQRSFP 705
P++ L + PL++ + P PPP T P E P ++ P RS P
Sbjct: 235 TPLNQDLLGEYTPLRDEDPNAPPYQPPPPFTTPMEGKGSRPKNMT---PYRSPP 285
>gnl|CDD|204841 pfam12171, zf-C2H2_jaz, Zinc-finger double-stranded RNA-binding.
This domain family is found in archaea and eukaryotes,
and is approximately 30 amino acids in length. The
mammalian members of this group occur multiple times
along the protein, joined by flexible linkers, and are
referred to as JAZ - dsRNA-binding ZF protein -
zinc-fingers. The JAZ proteins are expressed in all
tissues tested and localise in the nucleus, particularly
the nucleolus. JAZ preferentially binds to
double-stranded (ds) RNA or RNA/DNA hybrids rather than
DNA. In addition to binding double-stranded RNA, these
zinc-fingers are required for nucleolar localisation.
Length = 27
Score = 29.1 bits (66), Expect = 0.74
Identities = 7/20 (35%), Positives = 10/20 (50%)
Query: 253 QHACQECGKTFATSSGLKQH 272
Q C C K F + + L+ H
Sbjct: 1 QFYCVACDKYFKSENALENH 20
Score = 26.8 bits (60), Expect = 4.2
Identities = 7/23 (30%), Positives = 13/23 (56%)
Query: 224 KFPCENCTKVFSDPSNLQRHIRT 246
+F C C K F + L+ H+++
Sbjct: 1 QFYCVACDKYFKSENALENHLKS 23
Score = 26.4 bits (59), Expect = 6.7
Identities = 11/26 (42%), Positives = 14/26 (53%), Gaps = 2/26 (7%)
Query: 88 PFKCPLCDRCFGQQTNLDRHL--KKH 111
F C CD+ F + L+ HL KKH
Sbjct: 1 QFYCVACDKYFKSENALENHLKSKKH 26
Score = 26.4 bits (59), Expect = 6.7
Identities = 11/26 (42%), Positives = 14/26 (53%), Gaps = 2/26 (7%)
Query: 819 PFKCPLCDRCFGQQTNLDRHL--KKH 842
F C CD+ F + L+ HL KKH
Sbjct: 1 QFYCVACDKYFKSENALENHLKSKKH 26
Score = 26.0 bits (58), Expect = 8.5
Identities = 7/23 (30%), Positives = 11/23 (47%)
Query: 31 RYACKYCGKVFPRSANLTRHLRT 53
++ C C K F L HL++
Sbjct: 1 QFYCVACDKYFKSENALENHLKS 23
Score = 26.0 bits (58), Expect = 8.5
Identities = 7/23 (30%), Positives = 11/23 (47%)
Query: 762 RYACKYCGKVFPRSANLTRHLRT 784
++ C C K F L HL++
Sbjct: 1 QFYCVACDKYFKSENALENHLKS 23
>gnl|CDD|220445 pfam09858, DUF2085, Predicted membrane protein (DUF2085). This
domain, found in various hypothetical prokaryotic
proteins, has no known function.
Length = 93
Score = 30.7 bits (70), Expect = 0.82
Identities = 8/24 (33%), Positives = 10/24 (41%), Gaps = 2/24 (8%)
Query: 76 QRHVRNIHNKEKPFKCPLCDRCFG 99
Q R+ F P+C RC G
Sbjct: 4 QLPERSFFFGGHQF--PVCARCTG 25
Score = 30.7 bits (70), Expect = 0.82
Identities = 8/24 (33%), Positives = 10/24 (41%), Gaps = 2/24 (8%)
Query: 807 QRHVRNIHNKEKPFKCPLCDRCFG 830
Q R+ F P+C RC G
Sbjct: 4 QLPERSFFFGGHQF--PVCARCTG 25
>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
only].
Length = 467
Score = 32.7 bits (74), Expect = 1.1
Identities = 20/57 (35%), Positives = 28/57 (49%)
Query: 32 YACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKP 88
+C C F R +LTRH+R+HTGE+P +C Y S S L+ + P
Sbjct: 34 DSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNP 90
Score = 32.7 bits (74), Expect = 1.1
Identities = 20/57 (35%), Positives = 28/57 (49%)
Query: 763 YACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKP 819
+C C F R +LTRH+R+HTGE+P +C Y S S L+ + P
Sbjct: 34 DSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNP 90
Score = 31.2 bits (70), Expect = 3.3
Identities = 37/275 (13%), Positives = 73/275 (26%), Gaps = 2/275 (0%)
Query: 532 SIDKLSGKMDSPKDEDASEESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFVKTPAT 591
S + +S S + + +N P
Sbjct: 95 SKSLPLSNSKASSSSLSSSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLRNNPLPGN 154
Query: 592 SPEIGKDESVINKSSPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTPMA 651
+ + P P + ++ + + I S + I + S + P+S + + P +
Sbjct: 155 NSSSVNTPQSNSLHPPLPANSLSKDPSSNLSLLISSNVSTSIPSSSENSPLSSSYSIPSS 214
Query: 652 YPKPIHPMSLEAFFPLKNFGSRSYPPFPPPHPTDPHERLCPPSLPGFNPQRSFPFLGLMN 711
S + N + + +
Sbjct: 215 SSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTASSQSSS 274
Query: 712 GINNSLGVNRTPYEHLLRSQISPYANKPFQEILTSQNNAVNGSSSKTTKDRYACKYCGKV 771
+ + + Q + ++ ++ +G S K Y+ CGK+
Sbjct: 275 PNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYS--LCGKL 332
Query: 772 FPRSANLTRHLRTHTGEQPYKCKYCERSFSISSNL 806
F R+ L RH+ HT P K K S S L
Sbjct: 333 FSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 32.5 bits (74), Expect = 1.2
Identities = 21/112 (18%), Positives = 35/112 (31%), Gaps = 19/112 (16%)
Query: 532 SIDKLSGKMDSPKDEDASEE---------------SKPKKAKYCDQPLDLSIASKSSIDS 576
+ K K D+ KD +E PKK K +P KSS
Sbjct: 13 APTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSS--- 69
Query: 577 DKQDANETFVKTPATSPE-IGKDESVINKSSPTPPDDPVEESKRSPTPQIVS 627
+++ + P + + S P + EE + TP ++
Sbjct: 70 KPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEEPEETPDPIA 121
>gnl|CDD|165133 PHA02768, PHA02768, hypothetical protein; Provisional.
Length = 55
Score = 29.2 bits (65), Expect = 1.3
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 2/36 (5%)
Query: 32 YACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCER 67
Y C CG+++ + ++ HLR H K C+R
Sbjct: 6 YECPICGEIYIKRKSMITHLRKHNTN--LKLSNCKR 39
Score = 29.2 bits (65), Expect = 1.3
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 2/36 (5%)
Query: 763 YACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCER 798
Y C CG+++ + ++ HLR H K C+R
Sbjct: 6 YECPICGEIYIKRKSMITHLRKHNTN--LKLSNCKR 39
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 32.4 bits (74), Expect = 1.3
Identities = 15/86 (17%), Positives = 20/86 (23%), Gaps = 4/86 (4%)
Query: 578 KQDANETFVK-TPATSPEIGKDESVINKSSPTPPDDPVEESKRSP---TPQIVSRSPSPI 633
KQ F + A P S S +S P P + P+ +
Sbjct: 376 KQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAV 435
Query: 634 KNRSPSPPVSKANTTPMAYPKPIHPM 659
PS K I
Sbjct: 436 PVNPPSTAPQAVRPAQFKEEKKIPVS 461
>gnl|CDD|224119 COG1198, PriA, Primosomal protein N' (replication factor Y) -
superfamily II helicase [DNA replication, recombination,
and repair].
Length = 730
Score = 32.3 bits (74), Expect = 1.5
Identities = 15/65 (23%), Positives = 21/65 (32%), Gaps = 23/65 (35%)
Query: 33 ACKYCGKVF--PR-SANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPF 89
C+ CG + P + LT H T +C YC + + P
Sbjct: 437 LCRDCGYIAECPNCDSPLTLHKATG----QLRCHYCG----------------YQEPIPQ 476
Query: 90 KCPLC 94
CP C
Sbjct: 477 SCPEC 481
Score = 32.3 bits (74), Expect = 1.5
Identities = 15/65 (23%), Positives = 21/65 (32%), Gaps = 23/65 (35%)
Query: 764 ACKYCGKVF--PR-SANLTRHLRTHTGEQPYKCKYCERSFSISSNLQRHVRNIHNKEKPF 820
C+ CG + P + LT H T +C YC + + P
Sbjct: 437 LCRDCGYIAECPNCDSPLTLHKATG----QLRCHYCG----------------YQEPIPQ 476
Query: 821 KCPLC 825
CP C
Sbjct: 477 SCPEC 481
>gnl|CDD|206083 pfam13912, zf-C2H2_6, C2H2-type zinc finger.
Length = 27
Score = 28.0 bits (63), Expect = 1.5
Identities = 8/23 (34%), Positives = 11/23 (47%)
Query: 32 YACKYCGKVFPRSANLTRHLRTH 54
+ C CGK F L H ++H
Sbjct: 2 HTCGVCGKTFSSLQALGGHKKSH 24
Score = 28.0 bits (63), Expect = 1.5
Identities = 8/23 (34%), Positives = 11/23 (47%)
Query: 763 YACKYCGKVFPRSANLTRHLRTH 785
+ C CGK F L H ++H
Sbjct: 2 HTCGVCGKTFSSLQALGGHKKSH 24
Score = 27.2 bits (61), Expect = 2.8
Identities = 8/23 (34%), Positives = 11/23 (47%)
Query: 282 FRCEVCHKAYTQFSNLCRHKRMH 304
C VC K ++ L HK+ H
Sbjct: 2 HTCGVCGKTFSSLQALGGHKKSH 24
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 32.0 bits (73), Expect = 1.6
Identities = 11/89 (12%), Positives = 20/89 (22%), Gaps = 5/89 (5%)
Query: 605 SSPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANT-----TPMAYPKPIHPM 659
+ P + P + +P+ +P PV+ P A P +
Sbjct: 389 PAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALA 448
Query: 660 SLEAFFPLKNFGSRSYPPFPPPHPTDPHE 688
+ P P
Sbjct: 449 PAPPAQAAPETVAIPVRVAPEPAVASAAP 477
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 32.4 bits (73), Expect = 1.7
Identities = 50/257 (19%), Positives = 73/257 (28%), Gaps = 46/257 (17%)
Query: 467 EFSESKDSPPTAEEANSNQRPSPAR---------PTISASNAVTYPDPREQLEVVTTDVK 517
E S+ D PP EA+ +P + + P E E
Sbjct: 502 EDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKP 561
Query: 518 SDVNQKPRDKLTNFSIDKLSGKMDSPKDEDASEESKPKKAKYCDQPLDLSIASKSSIDSD 577
+ K+ S K D +D E KPK+ + +P + S
Sbjct: 562 GPAKEHKPSKIPTLS-KKPEFPKDPKHPKDPEEPKKPKRPRSAQRP--------TRPKSP 612
Query: 578 KQDANETFVKTPATSPEIGKDESVINKSSPTPPDDPVEESKRSPTPQIVSRSPSPIKNRS 637
K K+P + ES + P PP P +R P+I+ P +S
Sbjct: 613 KLPELLDIPKSPK------RPESPKSPKRPPPPQRPSS-PERPEGPKIIKSPKPP---KS 662
Query: 638 PSPPV------------------SKANTTPMAYPKPIHPMSLEAFFPLKNFGSRSYPPFP 679
P PP SK T + + + E + P P
Sbjct: 663 PKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLP 722
Query: 680 PPHPTDPHERLCPPSLP 696
P P D P P
Sbjct: 723 PKLPRDEEFPFEPIGDP 739
>gnl|CDD|220661 pfam10263, SprT-like, SprT-like family. This family represents a
domain found in eukaryotes and prokaryotes. The domain
contains a characteristic motif of the zinc
metallopeptidases. This family includes the bacterial
SprT protein.
Length = 153
Score = 30.8 bits (70), Expect = 1.7
Identities = 13/47 (27%), Positives = 19/47 (40%), Gaps = 8/47 (17%)
Query: 23 SSSKTTKDRYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSF 69
S + Y C CG+++PR R +R H Y+C C
Sbjct: 111 VVSGRKRYIYRCGSCGQLYPR----KRRIRRHK----YRCGRCGGKL 149
Score = 30.8 bits (70), Expect = 1.7
Identities = 13/47 (27%), Positives = 19/47 (40%), Gaps = 8/47 (17%)
Query: 754 SSSKTTKDRYACKYCGKVFPRSANLTRHLRTHTGEQPYKCKYCERSF 800
S + Y C CG+++PR R +R H Y+C C
Sbjct: 111 VVSGRKRYIYRCGSCGQLYPR----KRRIRRHK----YRCGRCGGKL 149
>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
Length = 306
Score = 31.6 bits (71), Expect = 1.9
Identities = 20/81 (24%), Positives = 30/81 (37%), Gaps = 3/81 (3%)
Query: 620 SPTPQIVSRSPSPIKNRSPSPPVSKANTTPMAYPKPIHPMSLEAFFPLKNFGSRSYPPFP 679
+P P V + +P++ P P A T P P M+ ++ ++ P P
Sbjct: 26 TPQPNPVIQPQAPVQPGQPGAPQQLAIPTQQPQPVPTSAMTPHV---VQQAPAQPAPAAP 82
Query: 680 PPHPTDPHERLCPPSLPGFNP 700
P E L P P F P
Sbjct: 83 PAAGAALPEALEVPPPPAFTP 103
>gnl|CDD|227499 COG5171, YRB1, Ran GTPase-activating protein (Ran-binding protein)
[Intracellular trafficking and secretion].
Length = 211
Score = 31.1 bits (70), Expect = 2.2
Identities = 24/105 (22%), Positives = 35/105 (33%), Gaps = 10/105 (9%)
Query: 436 QKNVQQSEEKRKNSPSPRKSDVPKTS----GGKLFEFSESKDSPPTAEEA--NSNQRPSP 489
+K Q EK +N R DV GK + P E A ++ P
Sbjct: 5 RKKKQAKIEKEENEQKERSLDVVSKGDAFGDGKAGGEEKKVQQSPFLENAVPEGDEGKGP 64
Query: 490 ARPTISASNAVTYPDPREQLEVVTTDVKSDVNQKPRDKLTNFSID 534
P I V +++ + T + V K R KL F +
Sbjct: 65 ESPNIHFEPVVE----LQRVHLKTNEEDETVLFKARAKLFRFDEE 105
>gnl|CDD|150045 pfam09237, GAGA, GAGA factor. Members of this family bind to a
5'-GAGAG-3' DNA consensus binding site, and contain a
Cys2-His2 zinc finger core as well as an N-terminal
extension containing two highly basic regions. The zinc
finger core binds in the DNA major groove and recognises
the first three GAG bases of the consensus in a manner
similar to that seen in other classical zinc finger-DNA
complexes. The second basic region forms a helix that
interacts in the major groove recognising the last G of
the consensus, while the first basic region wraps around
the DNA in the minor groove and recognises the A in the
fourth position of the consensus sequence.
Length = 54
Score = 28.4 bits (63), Expect = 2.5
Identities = 11/31 (35%), Positives = 17/31 (54%)
Query: 79 VRNIHNKEKPFKCPLCDRCFGQQTNLDRHLK 109
+++ E+P CP+C Q NL RHL+
Sbjct: 15 IKSKSQSEQPATCPICQAVIRQSRNLRRHLE 45
Score = 28.4 bits (63), Expect = 2.5
Identities = 11/31 (35%), Positives = 17/31 (54%)
Query: 810 VRNIHNKEKPFKCPLCDRCFGQQTNLDRHLK 840
+++ E+P CP+C Q NL RHL+
Sbjct: 15 IKSKSQSEQPATCPICQAVIRQSRNLRRHLE 45
>gnl|CDD|223037 PHA03301, PHA03301, envelope glycoprotein L; Provisional.
Length = 226
Score = 30.6 bits (69), Expect = 2.8
Identities = 25/110 (22%), Positives = 42/110 (38%), Gaps = 2/110 (1%)
Query: 546 EDASEESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFVKTPATSPEIGKDESVINKS 605
ED S P A L + ++ S Q A+ T V + E + + +
Sbjct: 107 EDLSHALFPANALETTTRRALYKEVRLALASRSQAASHTPVPPGCVNFEYSRTRRCVGRR 166
Query: 606 SPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTPMAYPKP 655
+P P ++P ++R + SP P +P P + + TP KP
Sbjct: 167 TPGPANEP--RTRRPLSAPDDEASPQPKSLATPPPVAAPSRRTPRPRRKP 214
>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
Length = 416
Score = 30.7 bits (69), Expect = 3.6
Identities = 21/81 (25%), Positives = 26/81 (32%)
Query: 606 SPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTPMAYPKPIHPMSLEAFF 665
SP PP D E+K P P P + + PKP P
Sbjct: 274 SPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGGEPKPGPPRPAPDAD 333
Query: 666 PLKNFGSRSYPPFPPPHPTDP 686
+ + S FPPP P P
Sbjct: 334 RPEGWPSLEAITFPPPTPATP 354
>gnl|CDD|236692 PRK10431, PRK10431, N-acetylmuramoyl-l-alanine amidase II;
Provisional.
Length = 445
Score = 31.0 bits (70), Expect = 3.6
Identities = 19/53 (35%), Positives = 26/53 (49%), Gaps = 4/53 (7%)
Query: 587 KTPATSPEIGKDESV---INKSSPTPPDDPVEESKRSPTPQIVSRSPS-PIKN 635
KT A + G + +V IN P PP P +KR TP +V+ S P +N
Sbjct: 107 KTEAVKRQNGSNYTVVFTINADVPPPPPPPPVVAKRVETPAVVAPRVSEPARN 159
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 30.8 bits (69), Expect = 4.0
Identities = 20/80 (25%), Positives = 29/80 (36%), Gaps = 5/80 (6%)
Query: 570 SKSSIDSDKQDANETFVKTPATSPEIGKDESVINKSSPTPPDDPVEESKRSPTPQIVSRS 629
S +S ++ + + V P+TSP G S P P V S P + S S
Sbjct: 576 SAASSNTTFSSDSPSTVVAPSTSPPAG-----HLGSPPATPSKIVSPSTSPPASHLGSPS 630
Query: 630 PSPIKNRSPSPPVSKANTTP 649
+P S S +P
Sbjct: 631 TTPSSPESSIKVASTETASP 650
>gnl|CDD|234471 TIGR04104, cxxc_20_cxxc, cxxc_20_cxxc protein. This small,
uncommon, poorly conserved protein is found primarily
in the Firmicutes. It features are pair of CxxC motifs
separated by about 20 amino acids, followed by a highly
hydrophobic region of about 45 amino acids. It has no
conserved gene neighborhood, and its function is
unknown.
Length = 94
Score = 28.5 bits (64), Expect = 5.1
Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 4/34 (11%)
Query: 61 KCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLC 94
CK C FS L +++ + +P KCP C
Sbjct: 2 ICKNCNEKFSYKELL----KSLFSLYRPIKCPNC 31
Score = 28.5 bits (64), Expect = 5.1
Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 4/34 (11%)
Query: 792 KCKYCERSFSISSNLQRHVRNIHNKEKPFKCPLC 825
CK C FS L +++ + +P KCP C
Sbjct: 2 ICKNCNEKFSYKELL----KSLFSLYRPIKCPNC 31
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 30.7 bits (69), Expect = 5.2
Identities = 25/193 (12%), Positives = 62/193 (32%), Gaps = 20/193 (10%)
Query: 479 EEANSNQRPSPARPTISASNAVTYPDPREQLEVVTTDVKSDVNQKPRDKLTNFSIDKLSG 538
E + P+ ++S + P Q V+ + ++ ++
Sbjct: 408 ERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEE-------EEEEEEQ 460
Query: 539 KMDSPKDEDASEESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFVKTPATSPEIGKD 598
+ + + ED EE + + ++ ++ S + ++DA + S
Sbjct: 461 ESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQ 520
Query: 599 ESVINKSSPTPPDDPVEESKRSPTPQIVS----------RSPSPIKNRSPSPPVSKANTT 648
+ S P+ P EE + + S SP+ + + V+ T
Sbjct: 521 ---QPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSSHTELEGVATPVET 577
Query: 649 PMAYPKPIHPMSL 661
++ + + P +
Sbjct: 578 KISSSRKLPPPPV 590
>gnl|CDD|205121 pfam12874, zf-met, Zinc-finger of C2H2 type. This is a zinc-finger
domain with the CxxCx(12)Hx(6)H motif, found in multiple
copies in a wide range of proteins from plants to
metazoans. Some member proteins, particularly those from
plants, are annotated as being RNA-binding.
Length = 25
Score = 26.3 bits (59), Expect = 5.6
Identities = 11/25 (44%), Positives = 14/25 (56%), Gaps = 2/25 (8%)
Query: 89 FKCPLCDRCFGQQTNLDRHL--KKH 111
F C LC+ F ++ L HL KKH
Sbjct: 1 FYCELCNVTFTSESQLKSHLRGKKH 25
Score = 26.3 bits (59), Expect = 5.6
Identities = 11/25 (44%), Positives = 14/25 (56%), Gaps = 2/25 (8%)
Query: 820 FKCPLCDRCFGQQTNLDRHL--KKH 842
F C LC+ F ++ L HL KKH
Sbjct: 1 FYCELCNVTFTSESQLKSHLRGKKH 25
>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
Provisional.
Length = 418
Score = 30.4 bits (69), Expect = 5.6
Identities = 22/101 (21%), Positives = 33/101 (32%), Gaps = 19/101 (18%)
Query: 573 SIDSDKQ--DAN--------ETFVKTPATSPEIGKDESVINKSSPTPPDDPVE------- 615
I++DK D + F + T E+G S I + PP
Sbjct: 80 IIETDKVSVDIRAPASGVITKIFAEEGDTV-EVGAPLSEI-DTGGAPPAAAPAAAAAAKA 137
Query: 616 ESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTPMAYPKPI 656
E P+ + +P P P+PP + P KP
Sbjct: 138 EKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPP 178
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 30.2 bits (68), Expect = 5.7
Identities = 23/91 (25%), Positives = 27/91 (29%), Gaps = 22/91 (24%)
Query: 607 PTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTPMAYPKPIHPMSLEAFFP 666
P PP P + +P P P P P+P N P P
Sbjct: 41 PAPPPPPPSTAAAAPAPAAPPPPPPP--AAPPAPQPDDPNAAPPPPPAD----------- 87
Query: 667 LKNFGSRSYPPFPPPHPTDPHERLCPPSLPG 697
P PPP P DP+ P PG
Sbjct: 88 ---------PNAPPPPPVDPNAPPPPAPEPG 109
>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
Provisional.
Length = 418
Score = 30.2 bits (68), Expect = 5.9
Identities = 27/182 (14%), Positives = 48/182 (26%), Gaps = 22/182 (12%)
Query: 472 KDSPPTAEEANSNQRPSPARPTISASNAVTYPDPREQLEVVTTDVKSDVNQKPRDKLTNF 531
+A NS+ S + T D+ D + D
Sbjct: 255 GQLTSSAFMLNSSLPTSAPKRRSRR------AGAMRAAAGETADLADDDGSEHSDPEPLP 308
Query: 532 SIDKLS-GKMDSPKDEDASEESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFVKTPA 590
+ + K +A +E + S SS S ++ P
Sbjct: 309 ASLPPPPVRRPRVKHPEAGKEEPDGARNAEAKEPAQPATSTSSKGSSSAQNKDSGSTGPG 368
Query: 591 TSPEIGKDESVINKSSPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTPM 650
+S ++ + +D S + PSP +P PP + P+
Sbjct: 369 SS----------LAAASSFLEDDDFGSPPLDLTTSLRHMPSPSVTSAPEPP-----SIPL 413
Query: 651 AY 652
Y
Sbjct: 414 TY 415
>gnl|CDD|222450 pfam13909, zf-H2C2_5, C2H2-type zinc-finger domain.
Length = 24
Score = 26.4 bits (59), Expect = 6.0
Identities = 9/24 (37%), Positives = 14/24 (58%), Gaps = 1/24 (4%)
Query: 225 FPCENCTKVFSDPSNLQRHIRTHH 248
+ C +C S P L+RH++ HH
Sbjct: 1 YKCSHCP-YSSTPKKLERHLKLHH 23
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 30.2 bits (68), Expect = 6.5
Identities = 17/67 (25%), Positives = 24/67 (35%), Gaps = 3/67 (4%)
Query: 630 PSPIKNRSPSPPVSKANTTPMAYPKPIHPMSLEAFFPLKNFGSRSYPPFPPPHPTDPHER 689
P P K + +P + P PK ++ P + + PP PP P P
Sbjct: 366 PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIP---PKEPVRETATPPPVPPRPVAPPVP 422
Query: 690 LCPPSLP 696
P S P
Sbjct: 423 HTPESAP 429
Score = 29.4 bits (66), Expect = 9.9
Identities = 24/119 (20%), Positives = 35/119 (29%), Gaps = 31/119 (26%)
Query: 563 PLDLSIASKSSIDSDKQDANETFVKTPATSPEIGKDESVINKSSPTPPDDPVEESKRSPT 622
P ++ S + P+T P+ ++ PP +PV E+ P
Sbjct: 364 PAPQPAKPTAAAPSPVRPT-----PAPSTRPKAAA-------AANIPPKEPVRETATPPP 411
Query: 623 PQIVSRSPSPIKNRSPSPPVSKANTTPMAYPKPIHPMSLEAFFPLKNFGSRSYPPFPPP 681
P P+ P P S T A P P + PP PP
Sbjct: 412 V-----PPRPVAPPVPHTPESAPKLTRAAIPVDEKP--------------KYTPPAPPK 451
>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator. Shisa is a
transcription factor-type molecule that physically
interacts with immature forms of the Wnt receptor
Frizzled and the FGF receptor within the endoplasmic
reticulum to inhibit their post-translational maturation
and trafficking to the cell surface.
Length = 177
Score = 29.4 bits (66), Expect = 6.5
Identities = 15/72 (20%), Positives = 24/72 (33%), Gaps = 12/72 (16%)
Query: 619 RSPTPQIVSRSPSPIKNRSPSPPVSKANTTPMAYPKPIHPMSLEAFFPLKNFGSRSYPPF 678
R P + + + ++ P S A + P + HPM + PP+
Sbjct: 108 RPQRPVMTRATSTTVQTTPLPQPPSTAPSYPGPQYQGYHPMPPQ--------PGMPAPPY 159
Query: 679 ----PPPHPTDP 686
PPP P
Sbjct: 160 SLQYPPPGLLQP 171
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 30.1 bits (67), Expect = 6.5
Identities = 30/186 (16%), Positives = 58/186 (31%), Gaps = 24/186 (12%)
Query: 471 SKDSPPTAEEANSNQRPSPARPTISASNAVTYPDPREQLEVVTTDVKSDVNQKPRDKLTN 530
+ S P A + P T + + + T P+ T+ + P +T
Sbjct: 451 TTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTS--ATPNATSPTPAVTT 508
Query: 531 FSIDKLSGKMDSPKDEDASEESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFVKTPA 590
+ + + S S P + I ++ S A
Sbjct: 509 PNATSPTTQKTSDTPNATS-------------PTPIVIGVTTTATSPPTGTTSV---PNA 552
Query: 591 TSPEIGKDESVINKSSPTPPDDP------VEESKRSPTPQIVSRSPSPIKNRSPSPPVSK 644
TSP++ ++ V N ++P P V + S+ P + +P +
Sbjct: 553 TSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNS 612
Query: 645 ANTTPM 650
+TTP+
Sbjct: 613 TSTTPL 618
>gnl|CDD|202808 pfam03920, TLE_N, Groucho/TLE N-terminal Q-rich domain. The
N-terminal domain of the Grouch/TLE co-repressor
proteins are involved in oligomerisation.
Length = 134
Score = 29.0 bits (65), Expect = 6.6
Identities = 13/30 (43%), Positives = 17/30 (56%), Gaps = 3/30 (10%)
Query: 214 NFHQAQVHGRKFPCENCTKVFSDPSNLQRH 243
F QAQ H K CE K+ S+ + +QRH
Sbjct: 33 QFLQAQYHSLKLECE---KLASEKTEMQRH 59
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 29.9 bits (67), Expect = 6.8
Identities = 23/145 (15%), Positives = 43/145 (29%), Gaps = 19/145 (13%)
Query: 522 QKPRDKLTNFSIDKLSGKMDSPKDEDASEE---SKPKKAKYCDQPLDLSIASKSSIDSDK 578
K ++ + + ++E EE SK K + + S D
Sbjct: 302 DKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGD- 360
Query: 579 QDANETFVKTPATSPEIGKDESVINKSSPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSP 638
D++++ + + + K ++PV+ + SP S P
Sbjct: 361 -DSDDSDIDGEDSVSLVTA-----KKQKEPKKEEPVDSNPSSPGN---SGPA------RP 405
Query: 639 SPPVSKANTTPMAYPKPIHPMSLEA 663
SP A P S+ A
Sbjct: 406 SPESKDKGKRKAANEVSKSPASVPA 430
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 29.0 bits (65), Expect = 7.1
Identities = 18/61 (29%), Positives = 22/61 (36%), Gaps = 8/61 (13%)
Query: 343 SGQNSLSAMSHQTPGIPNPYLMYPRPGGTLPFYPPSLMPPYPGLFQPNQTPTPHHPFLSN 402
GQ+S++ H P +P P PF P L PP P Q P P L
Sbjct: 77 PGQHSMTPTQHHQPNLPQPA--------QQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPP 128
Query: 403 P 403
Sbjct: 129 Q 129
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins is
found in bacteria. Proteins in this family are typically
between 115 and 181 amino acids in length. There are two
completely conserved residues (F and G) that may be
functionally important.
Length = 110
Score = 28.1 bits (63), Expect = 7.1
Identities = 13/49 (26%), Positives = 14/49 (28%)
Query: 353 HQTPGIPNPYLMYPRPGGTLPFYPPSLMPPYPGLFQPNQTPTPHHPFLS 401
Q P Y P + P PYP Q P FLS
Sbjct: 22 QQPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPKQSPQQQQPPQFSSFLS 70
>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
Length = 694
Score = 29.9 bits (67), Expect = 7.8
Identities = 25/100 (25%), Positives = 31/100 (31%), Gaps = 12/100 (12%)
Query: 605 SSPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVS--KANTTPMAYPKPIHPMSLE 662
SS PP P P P P + R S P +A P + L
Sbjct: 426 SSRQPPGAPAPRRDNDPPP--------PPRARPGSTPACARRARAQRARDAGPEYVDPLG 477
Query: 663 AFFPLKNFGSRSYPPFPPPHPTDPHERL--CPPSLPGFNP 700
A L + P P P + R+ PP LP N
Sbjct: 478 ALRRLPAGAAPPPEPAAAPSPATYYTRMGGGPPRLPPRNR 517
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 29.6 bits (66), Expect = 8.7
Identities = 30/163 (18%), Positives = 53/163 (32%), Gaps = 8/163 (4%)
Query: 492 PTISASNAVTYPDPREQLE-VVTTDVKSDVNQKPRDKLTNFSIDKLS-----GKMDSPKD 545
P + + + T P E + V + T + +LS G +
Sbjct: 168 PKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTS 227
Query: 546 EDASEESKPKKAKYCDQPLDLSIASKSSIDSDKQDANETFVKTPATSPEIGKDESVINKS 605
+ +++P ++ S +S D + + T P S S
Sbjct: 228 SNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPH--S 285
Query: 606 SPTPPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTT 648
+ TPP + PTP+ + + S SPP +AN T
Sbjct: 286 TATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPT 328
>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
Length = 417
Score = 29.8 bits (67), Expect = 8.7
Identities = 28/135 (20%), Positives = 43/135 (31%), Gaps = 28/135 (20%)
Query: 422 NINLSVEAYYRFQLQKNVQQSEEKRKNSPSP-RKSDVPKTSGGKLFEFSESKDSPPTAEE 480
+S E RF + +++ K K +P +K S + P AE
Sbjct: 71 RCRVSDEDINRFLTKTFEDKTQVKVKVVSAPTKKKKAMPKSVVRA-PKPLENPVPAQAES 129
Query: 481 ANSNQRPSPARPTISASNAVTYPDPREQLEVVTTDVKSDVNQKPRDKLTNFSIDKLSGKM 540
+ S PS V + + P LT D+L +
Sbjct: 130 SGSKPVPSI-------------------------PVSTPEVKAPAPALTPSQKDRLETLL 164
Query: 541 DSPKDEDASEESKPK 555
SPKD+ + KPK
Sbjct: 165 -SPKDKISLNSEKPK 178
Score = 29.8 bits (67), Expect = 8.9
Identities = 11/49 (22%), Positives = 19/49 (38%), Gaps = 2/49 (4%)
Query: 616 ESKRSPTPQIVSRSPSPIKNR-SPSPPVSKANTTPMAYPKPIHPMSLEA 663
K+ P+ V R+P P++N S + P + P + A
Sbjct: 102 TKKKKAMPKSVVRAPKPLENPVPAQAESSGSKPVP-SIPVSTPEVKAPA 149
>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 153
Score = 28.7 bits (64), Expect = 9.0
Identities = 11/47 (23%), Positives = 21/47 (44%), Gaps = 1/47 (2%)
Query: 609 PPDDPVEESKRSPTPQIVSRSPSPIKNRSPSPPVSKANTTPMAYPKP 655
+ V +P P + S +P+P+ +P+P + A + P P
Sbjct: 47 AVQEQVPTPAPAPAPAVPS-APTPVAPAAPAPAPASAGENVVTAPMP 92
>gnl|CDD|233037 TIGR00592, pol2, DNA polymerase (pol2). All proteins in this
superfamily for which functions are known are DNA
polymerases.This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 1172
Score = 30.0 bits (67), Expect = 9.1
Identities = 20/118 (16%), Positives = 33/118 (27%), Gaps = 6/118 (5%)
Query: 518 SDVNQKPRDKLTNFSIDKLSGKMDSPKDEDASEESKPKKAKYCDQPLDLSIASKSSIDSD 577
+ +K +D+ + D G + E E+ +K L K S +D
Sbjct: 88 ACAPKKKKDRKKSLGKD---GLLGDILQELNKTETAQRKITP---RLVSVPKLKFSSPAD 141
Query: 578 KQDANETFVKTPATSPEIGKDESVINKSSPTPPDDPVEESKRSPTPQIVSRSPSPIKN 635
N+ PA + K V + PV + V P
Sbjct: 142 VPAINDFSNHHPAVVDIVKKAIPVSTRYLLEKILIPVPLKRAEFAGGDVQMEGDPELK 199
>gnl|CDD|235030 PRK02315, PRK02315, adaptor protein; Provisional.
Length = 233
Score = 29.0 bits (66), Expect = 9.8
Identities = 12/50 (24%), Positives = 19/50 (38%), Gaps = 2/50 (4%)
Query: 505 PREQ-LEVVTTDVKSDVNQKPRDKLTNFSIDKLSGKMDSPKDEDASEESK 553
P E+ LEV T K D+ D + S + + + E+S
Sbjct: 66 PNEKGLEVFVTKSKLDLPLNFEDLIDLPSDEDIE-DLFPEDFFKLLEQSL 114
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.130 0.392
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 47,472,627
Number of extensions: 4484386
Number of successful extensions: 4181
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3985
Number of HSP's successfully gapped: 253
Length of query: 971
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 865
Effective length of database: 6,236,078
Effective search space: 5394207470
Effective search space used: 5394207470
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 64 (28.6 bits)