RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12290
(530 letters)
>gnl|CDD|238046 cd00094, HX, Hemopexin-like repeats.; Hemopexin is a heme-binding
protein that transports heme to the liver.
Hemopexin-like repeats occur in vitronectin and some
matrix metalloproteinases family (matrixins). The HX
repeats of some matrixins bind tissue inhibitor of
metalloproteinases (TIMPs). This CD contains 4 instances
of the repeat.
Length = 194
Score = 192 bits (490), Expect = 1e-58
Identities = 81/198 (40%), Positives = 120/198 (60%), Gaps = 6/198 (3%)
Query: 232 PDTCDT-SYDAIAVIRTEVFVFKDRWMWRIGDKGVLPGYPVLIHQFWSELPHNLTHIDAV 290
PD CD S+DA+ +R E++ FK R+ WR+ G PG P LI FW LP + +DA
Sbjct: 1 PDACDPLSFDAVTTLRGELYFFKGRYFWRL-SPGKPPGSPFLISSFWPSLP---SPVDAA 56
Query: 291 YQRID-NHIAFFIGRQYFLFEGIRLLPGYPRPLTALGLPASLERIDAAMVWGYNSKTYLF 349
++R D I FF G +Y+++ G L PGYP+P++ LG P ++++IDAA+ W N KTY F
Sbjct: 57 FERPDTGKIYFFKGDKYWVYTGKNLEPGYPKPISDLGFPPTVKQIDAALRWPDNGKTYFF 116
Query: 350 SGTRYWKLDDETGRSEPDYPRNMTENWRGVGVDIDDAFQWKDGATYFFKGKGFWKFKDVS 409
G +YW+ D++T + +P YP+ + ++ GV +D AF+W DG YFFKG +W+F S
Sbjct: 117 KGDKYWRYDEKTQKMDPGYPKLIETDFPGVPDKVDAAFRWLDGYYYFFKGDQYWRFDPRS 176
Query: 410 MRVEKEKPSPSAQFWMKC 427
V P + W+ C
Sbjct: 177 KEVRVGYPLKISSDWLGC 194
>gnl|CDD|214524 smart00120, HX, Hemopexin-like repeats. Hemopexin is a
heme-binding protein that transports heme to the liver.
Hemopexin-like repeats occur in vitronectin and some
matrix metalloproteinases family (matrixins). The HX
repeats of some matrixins bind tissue inhibitor of
metalloproteinases (TIMPs).
Length = 45
Score = 45.7 bits (109), Expect = 6e-07
Identities = 17/47 (36%), Positives = 27/47 (57%), Gaps = 3/47 (6%)
Query: 334 IDAAMVWGYNSKTYLFSGTRYWKLDDETGRSEPDYPRNMTENWRGVG 380
IDAA + KTY F G +YW+ D + R +P YP+ ++ + G+
Sbjct: 1 IDAAFELR-DGKTYFFKGDKYWRFDPK--RVDPGYPKLISSFFPGLP 44
Score = 42.2 bits (100), Expect = 1e-05
Identities = 17/44 (38%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 287 IDAVYQRIDNHIAFFIGRQYFLFEGIRLLPGYPRPLTAL--GLP 328
IDA ++ D FF G +Y+ F+ R+ PGYP+ +++ GLP
Sbjct: 1 IDAAFELRDGKTYFFKGDKYWRFDPKRVDPGYPKLISSFFPGLP 44
Score = 41.5 bits (98), Expect = 2e-05
Identities = 18/44 (40%), Positives = 22/44 (50%), Gaps = 1/44 (2%)
Query: 240 DAIAVIR-TEVFVFKDRWMWRIGDKGVLPGYPVLIHQFWSELPH 282
DA +R + + FK WR K V PGYP LI F+ LP
Sbjct: 2 DAAFELRDGKTYFFKGDKYWRFDPKRVDPGYPKLISSFFPGLPC 45
Score = 41.1 bits (97), Expect = 3e-05
Identities = 18/42 (42%), Positives = 25/42 (59%), Gaps = 2/42 (4%)
Query: 383 IDDAFQWKDGATYFFKGKGFWKFKDVSMRVEKEKPSPSAQFW 424
ID AF+ +DG TYFFKG +W+F RV+ P + F+
Sbjct: 1 IDAAFELRDGKTYFFKGDKYWRFDPK--RVDPGYPKLISSFF 40
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 49.6 bits (119), Expect = 2e-06
Identities = 24/107 (22%), Positives = 38/107 (35%), Gaps = 5/107 (4%)
Query: 117 VPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPE----TPQDSPRREPSLRQPNQ-YPE 171
V R H + EH SP+ + P + + P P ++ +P Q P+
Sbjct: 68 VHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQ 127
Query: 172 TPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPA 218
+P P P Q P +P+ PA P + +P P
Sbjct: 128 PVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPV 174
Score = 40.8 bits (96), Expect = 0.001
Identities = 22/102 (21%), Positives = 31/102 (30%), Gaps = 8/102 (7%)
Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHP------TPESPRREPPPRHPNQYPETPQDSPR 158
P R+P P P + P +P P P Q P +PQ +P
Sbjct: 94 PPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPA 153
Query: 159 REPSLRQPNQYPET--PHEPRRDSPPRHPNQFPETPRKPPAR 198
+P P + P EP P + KP +
Sbjct: 154 PQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKPKRK 195
Score = 37.7 bits (88), Expect = 0.012
Identities = 23/114 (20%), Positives = 34/114 (29%), Gaps = 20/114 (17%)
Query: 94 EEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETP 153
E ++ P++ P + +P + P PE+ P Q P
Sbjct: 80 EHEAARPSPQHQYQPPYASAQPR-QPVQQP----------PEAQVPPQHAPRPAQPAPQP 128
Query: 154 QDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEP 207
P +P QP Q P +P +P P P AEP
Sbjct: 129 VQQPAYQPQPEQPLQQPVSPQ----VAPAPQPVH-----SAPQPAQQAFQPAEP 173
>gnl|CDD|215676 pfam00045, Hemopexin, Hemopexin. Hemopexin is a heme-binding
protein that transports heme to the liver.
Hemopexin-like repeats occur in vitronectin and some
matrix metallopeptidases family (matrixins). The HX
repeats of some matrixins bind tissue inhibitor of
metallopeptidases (TIMPs).
Length = 45
Score = 44.1 bits (105), Expect = 3e-06
Identities = 18/44 (40%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 287 IDAVYQRIDNHIAFFIGRQYFLFEGIRLLPGYPRPLTAL--GLP 328
IDA ++ D FF GR+Y+ F+ R+ PGYP+ ++ GLP
Sbjct: 1 IDAAFEDRDGKTYFFKGRKYWRFDPQRVEPGYPKLISDFWPGLP 44
Score = 43.3 bits (103), Expect = 5e-06
Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 3/47 (6%)
Query: 334 IDAAMVWGYNSKTYLFSGTRYWKLDDETGRSEPDYPRNMTENWRGVG 380
IDAA KTY F G +YW+ D + R EP YP+ +++ W G+
Sbjct: 1 IDAAFEDRDG-KTYFFKGRKYWRFDPQ--RVEPGYPKLISDFWPGLP 44
Score = 43.3 bits (103), Expect = 5e-06
Identities = 19/44 (43%), Positives = 22/44 (50%), Gaps = 1/44 (2%)
Query: 240 DAIAVIRT-EVFVFKDRWMWRIGDKGVLPGYPVLIHQFWSELPH 282
DA R + + FK R WR + V PGYP LI FW LP
Sbjct: 2 DAAFEDRDGKTYFFKGRKYWRFDPQRVEPGYPKLISDFWPGLPC 45
Score = 41.0 bits (97), Expect = 3e-05
Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 383 IDDAFQWKDGATYFFKGKGFWKFKDVSMRVEKEKPSPSAQFW 424
ID AF+ +DG TYFFKG+ +W+F RVE P + FW
Sbjct: 1 IDAAFEDRDGKTYFFKGRKYWRFDPQ--RVEPGYPKLISDFW 40
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 49.9 bits (119), Expect = 3e-06
Identities = 31/143 (21%), Positives = 38/143 (26%), Gaps = 10/143 (6%)
Query: 98 WAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPE---TPQ 154
W + P P P P P +P P P P+ P
Sbjct: 2802 WDPADPPAAVLAPAAALPPAASPAGPLP--PPTSAQPTAPPPPPGPPPPSLPLGGSVAPG 2859
Query: 155 DSPRREPSLRQPNQYPETPHEPR-----RDSPPRHPNQFPETPRKPPARYPDIARAEPRT 209
RR P R P P P P R + R F P +P A P+
Sbjct: 2860 GDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQP 2919
Query: 210 TPTTTTPPATRRGESNPGPSDKP 232
P PP + P P
Sbjct: 2920 QPQPPPPPQPQPPPPPPPRPQPP 2942
Score = 44.9 bits (106), Expect = 1e-04
Identities = 32/137 (23%), Positives = 40/137 (29%), Gaps = 16/137 (11%)
Query: 102 PRYSPSPDLTTRRPEV-----PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS 156
R + RP V P T PE P + P P Q P
Sbjct: 2868 SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPP-QPQPQPPPP 2926
Query: 157 PRREPSLRQPNQYPETPHEPRRD----SPPRHPNQFPETPRKPPARYPDIAR-----AEP 207
P+ +P P + P+ P P D P P P R A
Sbjct: 2927 PQPQPPPPPPPR-PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
Query: 208 RTTPTTTTPPATRRGES 224
R P ++TPP T S
Sbjct: 2986 REAPASSTPPLTGHSLS 3002
Score = 43.8 bits (103), Expect = 2e-04
Identities = 32/142 (22%), Positives = 40/142 (28%), Gaps = 11/142 (7%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPR-REPPPRHPNQYPETPQDS---- 156
PR R P P P P H P+ P SP EP P P P +
Sbjct: 2598 PRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPA 2657
Query: 157 ------PRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTT 210
PRR L + Q P PRR + PP +
Sbjct: 2658 PGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVS 2717
Query: 211 PTTTTPPATRRGESNPGPSDKP 232
T P +++P P
Sbjct: 2718 ATPLPPGPAAARQASPALPAAP 2739
Score = 43.0 bits (101), Expect = 4e-04
Identities = 29/140 (20%), Positives = 36/140 (25%), Gaps = 26/140 (18%)
Query: 105 SPSPDLTTR---RPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
+P D+ R R +P P P R R + P D P R P
Sbjct: 2857 APGGDVRRRPPSRSPAAKPAAPA------RPPVRRLARPAVSRSTESFAL-PPDQPERPP 2909
Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTP----- 216
Q P P + PP P P P +P P+ P
Sbjct: 2910 ----QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG 2965
Query: 217 -------PATRRGESNPGPS 229
R P PS
Sbjct: 2966 ALVPGRVAVPRFRVPQPAPS 2985
Score = 41.1 bits (96), Expect = 0.002
Identities = 30/133 (22%), Positives = 36/133 (27%), Gaps = 18/133 (13%)
Query: 105 SPSPDLTTRRPE-VPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSL 163
+PD + P PRP P P PP + P + PR
Sbjct: 2561 PAAPDRSVPPPRPAPRPSEPAVTSRARRPDA------PPQSARPRAPVDDRGDPRGPA-- 2612
Query: 164 RQPNQYPETPHEPRRDSPPRHPNQF----PETPRKPPARYPDIARAEPRTTPTTTTPPAT 219
P+ P H P P P P PP P A R + P
Sbjct: 2613 -PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSR----PRRA 2667
Query: 220 RRGESNPGPSDKP 232
RR S P
Sbjct: 2668 RRLGRAAQASSPP 2680
Score = 40.3 bits (94), Expect = 0.003
Identities = 35/137 (25%), Positives = 43/137 (31%), Gaps = 8/137 (5%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPR---HPNQYPETPQDSPR 158
+ SP RR + P P +P EP P P P + +
Sbjct: 2673 AAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP--EPAPHALVSATPLPPGPAAARQ 2730
Query: 159 REPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAE--PRTTPTTTTP 216
P+L P P P P P + P T PPA P A A PR
Sbjct: 2731 ASPALPAAPAPPAVPAGPATPGGPARPAR-PPTTAGPPAPAPPAAPAAGPPRRLTRPAVA 2789
Query: 217 PATRRGESNPGPSDKPD 233
+ ES P P D D
Sbjct: 2790 SLSESRESLPSPWDPAD 2806
Score = 38.8 bits (90), Expect = 0.009
Identities = 26/134 (19%), Positives = 32/134 (23%), Gaps = 3/134 (2%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTH-HHPEHHPTPESPRREPPPR--HPNQYPETPQDSPR 158
P P P+ P R P SP + P R P T P
Sbjct: 2643 PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
Query: 159 REPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPA 218
P +P + P P P P P A P PP
Sbjct: 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT 2762
Query: 219 TRRGESNPGPSDKP 232
T + P+
Sbjct: 2763 TAGPPAPAPPAAPA 2776
Score = 37.6 bits (87), Expect = 0.021
Identities = 26/110 (23%), Positives = 30/110 (27%), Gaps = 4/110 (3%)
Query: 123 PTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPP 182
P P P R PPPR + E S R P + P P + R P
Sbjct: 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDD-RGDPR 2609
Query: 183 RHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKP 232
P +P P PD P P P D P
Sbjct: 2610 GPA---PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP 2656
Score = 36.1 bits (83), Expect = 0.052
Identities = 35/144 (24%), Positives = 39/144 (27%), Gaps = 16/144 (11%)
Query: 77 GESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPES 136
G P R + R + R D T VP P PTP
Sbjct: 371 GRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASV------PTPAPTPVP 424
Query: 137 PRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHP------NQFPE 190
PPP P E D P RQP P D R + PE
Sbjct: 425 ASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPE 484
Query: 191 TPRKPPA----RYPDIARAEPRTT 210
P A R+PD A R
Sbjct: 485 PPGADLAELLGRHPDTAGTVVRLA 508
Score = 35.7 bits (82), Expect = 0.081
Identities = 34/145 (23%), Positives = 39/145 (26%), Gaps = 17/145 (11%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
P +P P P P P P +P P P P P R
Sbjct: 2704 PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGP-ATPGGPARPARPPT 2762
Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQ-------------FPETPRKPPARYPDIARAEPR 208
+ P P P P P R P P PPA A A P
Sbjct: 2763 TAGPPA--PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
Query: 209 T-TPTTTTPPATRRGESNPGPSDKP 232
+P PP T + P P P
Sbjct: 2821 AASPAGPLPPPTSAQPTAPPPPPGP 2845
Score = 35.7 bits (82), Expect = 0.082
Identities = 29/136 (21%), Positives = 34/136 (25%), Gaps = 15/136 (11%)
Query: 105 SPSPDLTTRRPE-VPRPRHPTHHHP---------EHHPTPESPRREPPPRHPNQYPETPQ 154
SP P P P+ R PT P HHP R P R
Sbjct: 338 SPLPRPRQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHP----KRASLPTRKRRSARHAAT 393
Query: 155 DSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTT 214
R Q P +P P P P P + + P
Sbjct: 394 PFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPS-AEPGSDDGPAPPPER 452
Query: 215 TPPATRRGESNPGPSD 230
PPA + P D
Sbjct: 453 QPPAPATEPAPDDPDD 468
Score = 33.4 bits (76), Expect = 0.36
Identities = 26/112 (23%), Positives = 30/112 (26%), Gaps = 6/112 (5%)
Query: 116 EVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHE 175
R PT P P +P PP R + +S PS P P
Sbjct: 2754 PARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA 2813
Query: 176 PRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
P + P P P PP P P P G PG
Sbjct: 2814 P---AAALPPAASPAGPLPPP---TSAQPTAPPPPPGPPPPSLPLGGSVAPG 2859
Score = 31.8 bits (72), Expect = 1.2
Identities = 32/130 (24%), Positives = 35/130 (26%), Gaps = 12/130 (9%)
Query: 105 SPSPDLTTRRPEVPR-PRHP-THHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPS 162
SP+ P VP P P P PT P PP P P PRR
Sbjct: 2732 SPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP------PRRLTR 2785
Query: 163 LRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRG 222
+ P P P P P A P A P T+ P
Sbjct: 2786 PAVASLSESRESLP----SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841
Query: 223 ESNPGPSDKP 232
P P P
Sbjct: 2842 PPGPPPPSLP 2851
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 46.7 bits (111), Expect = 2e-05
Identities = 31/140 (22%), Positives = 38/140 (27%), Gaps = 6/140 (4%)
Query: 99 AVIPRYSPSPDLTTRRPEVPRP-RHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSP 157
A P P T PE+PRP R P + P + P P P
Sbjct: 92 AGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQ 151
Query: 158 RREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPP 217
R EP + R PPR P P + R EP
Sbjct: 152 RPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQERD-----REPYDAGRPEYDQ 206
Query: 218 ATRRGESNPGPSDKPDTCDT 237
R + D+P T
Sbjct: 207 RRRDYDHPRPDWDRPRRDRT 226
Score = 39.0 bits (91), Expect = 0.005
Identities = 32/149 (21%), Positives = 37/149 (24%), Gaps = 21/149 (14%)
Query: 99 AVIPRYSPSPDLTTRRPEVP----RPRHPTHHHPEHHPTPESPRREPPPRHPNQYP--ET 152
P L T RP P RP + R PPR P P
Sbjct: 128 DRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYA 187
Query: 153 PQDSPRREPSLRQPNQYP-------------ETPHEPRRDSPPRHPNQFPETPRKPPARY 199
P+ REP +Y + P R D P P P
Sbjct: 188 PEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPE 247
Query: 200 PDIARAEPRTTPTTTTPPATRRGESNPGP 228
D A P A + PGP
Sbjct: 248 RDDAPVVPIRPSAPGPLAA--QPAPAPGP 274
Score = 37.9 bits (88), Expect = 0.015
Identities = 22/100 (22%), Positives = 27/100 (27%), Gaps = 8/100 (8%)
Query: 133 TPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETP 192
P + PPP H + E P R P Y + R PR Q P
Sbjct: 89 DPSAGEPAPPPPHARRTSEPELPRPGRRPY----EGYGGPRADDRPPGLPRQD-QLPTAR 143
Query: 193 RKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKP 232
PA R EP P ++ P
Sbjct: 144 PAYPAYQQ---RPEPGAWPRAADDYGWQQQRLGFPPRAPY 180
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 44.3 bits (104), Expect = 1e-04
Identities = 33/122 (27%), Positives = 52/122 (42%), Gaps = 25/122 (20%)
Query: 112 TRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPN--QY 169
+++PE P+ P+H PE P++ PR Q P P+ SP+ L P +
Sbjct: 576 SKKPEFPK-------DPKHPKDPEEPKKPKRPRSA-QRPTRPK-SPKLPELLDIPKSPKR 626
Query: 170 PETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPS 229
PE+P P+R PP Q P +P +P E + PP + + +P
Sbjct: 627 PESPKSPKRPPPP----QRPSSPERP----------EGPKIIKSPKPPKSPKPPFDPKFK 672
Query: 230 DK 231
+K
Sbjct: 673 EK 674
Score = 38.5 bits (89), Expect = 0.010
Identities = 41/180 (22%), Positives = 59/180 (32%), Gaps = 26/180 (14%)
Query: 85 RTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVP-RPRHPTH-HHPEHHPTPESP----R 138
+T + D E +P TT RP P PR P P E P
Sbjct: 690 KTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEF 749
Query: 139 REPPPRHPNQYPETPQDSPR--------REPSLRQPNQYPETPHEPRRDSPPRHPNQFPE 190
PP + ETP D+P +E + P+ + R DSP H +
Sbjct: 750 FTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMK-RPDSPSEHED---- 804
Query: 191 TPRKPPARYPDIARAEPRTT--PTTTTPPATRRGESNPGPSDKPDTC--DTSYDAIAVIR 246
KPP +P + + R +TT + G S K S+D + +
Sbjct: 805 ---KPPGDHPSLPKKRHRLDGLALSTTDLESDAGRIAKDASGKIVKLKRSKSFDDLTTVE 861
Score = 38.1 bits (88), Expect = 0.013
Identities = 22/125 (17%), Positives = 43/125 (34%), Gaps = 1/125 (0%)
Query: 108 PDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPN 167
P+ + P+ P + E + P+ P + + P +E +
Sbjct: 514 PEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIP 573
Query: 168 QYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
+ P P+ P+ P + P+ P++P + P+ P + +R ES
Sbjct: 574 TLSKKPEFPKDPKHPKDPEE-PKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKS 632
Query: 228 PSDKP 232
P P
Sbjct: 633 PKRPP 637
Score = 28.9 bits (64), Expect = 8.4
Identities = 27/138 (19%), Positives = 47/138 (34%), Gaps = 8/138 (5%)
Query: 101 IPRYSPSPDLTT--RRPEVP-RPRHPTH-HHPEHHPTPESPRREPPPRHPN---QYPETP 153
IP+ P+ +RP P RP P P+ +P+ P+ PP P ++ +
Sbjct: 620 IPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDY 679
Query: 154 QDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTT 213
D+ + + E+ +++ P P TPR P + P P
Sbjct: 680 LDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPR-DEEFPFEPIGD 738
Query: 214 TTPPATRRGESNPGPSDK 231
E P ++
Sbjct: 739 PDAEQPDDIEFFTPPEEE 756
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 43.9 bits (103), Expect = 2e-04
Identities = 34/130 (26%), Positives = 42/130 (32%), Gaps = 9/130 (6%)
Query: 102 PRYSPSPDLTT--RRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRR 159
P+ +P T + +P PT + +PPPR P TP P
Sbjct: 655 PQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAP-----TPMRPPAA 709
Query: 160 EPSLRQPNQYPETPHEPRRDSPPR-HPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPA 218
P Q P +P R P +PPA P AR P P PPA
Sbjct: 710 PPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR-PPAAAPGRARPPA 768
Query: 219 TRRGESNPGP 228
G P P
Sbjct: 769 AAPGAPTPQP 778
Score = 43.5 bits (102), Expect = 2e-04
Identities = 36/148 (24%), Positives = 43/148 (29%), Gaps = 8/148 (5%)
Query: 95 EKSWAVIPRYSPSPDLTTRRPEVPRPRHP-THHHPEHHPTPESPRREPP-PRHPNQYPET 152
+ +W I P T +P P T P PTP P PP
Sbjct: 663 KPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATG 722
Query: 153 PQDSPRREPSLRQPNQYPETPHEPRRDSPPR-HPNQFPETPRKPPARYPDIARAEPRTTP 211
P P +P P +P R P +PPA P P P
Sbjct: 723 RARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAP----GAPTPQP 778
Query: 212 TTTTPPATR-RGESNPGPSDKPDTCDTS 238
PPA + R P P P TS
Sbjct: 779 PPQAPPAPQQRPRGAPTPQPPPQAGPTS 806
Score = 40.1 bits (93), Expect = 0.003
Identities = 38/149 (25%), Positives = 46/149 (30%), Gaps = 20/149 (13%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTP-ESPRREPPPRHPN--QYPETP-QDSP 157
PR P P P P P + PTP + P+ E P P Q P Q SP
Sbjct: 621 PRQWPMP--LRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSP 678
Query: 158 RREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIAR------------- 204
++ P T P R P P P + PA AR
Sbjct: 679 TGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPA 738
Query: 205 -AEPRTTPTTTTPPATRRGESNPGPSDKP 232
A R P P R + PG + P
Sbjct: 739 AAPGRARPPAAAPGRARPPAAAPGRARPP 767
Score = 32.3 bits (73), Expect = 0.78
Identities = 19/70 (27%), Positives = 24/70 (34%), Gaps = 3/70 (4%)
Query: 118 PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPR 177
P P P P + P P+ P Q P PQ PR P+ P P+
Sbjct: 751 PGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPT---PQPPPQAGPTSM 807
Query: 178 RDSPPRHPNQ 187
+ P P Q
Sbjct: 808 QLMPRAAPGQ 817
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 40.4 bits (94), Expect = 0.002
Identities = 21/86 (24%), Positives = 35/86 (40%), Gaps = 3/86 (3%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
PR +P+P L RH P PT ++P ++P ++P Q PQ P+++P
Sbjct: 177 PREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQ---QPQQQPQQQP 233
Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQ 187
P+ P + + H
Sbjct: 234 VQPAQQPTPQNPAQQPPQTEQGHKRS 259
Score = 34.3 bits (78), Expect = 0.17
Identities = 15/87 (17%), Positives = 26/87 (29%), Gaps = 10/87 (11%)
Query: 134 PESPRREPPP----------RHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPR 183
P PR P P H +++ P P + +QP ++
Sbjct: 174 PNPPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQP 233
Query: 184 HPNQFPETPRKPPARYPDIARAEPRTT 210
TP+ P + P + R+
Sbjct: 234 VQPAQQPTPQNPAQQPPQTEQGHKRSR 260
Score = 34.3 bits (78), Expect = 0.18
Identities = 19/71 (26%), Positives = 30/71 (42%), Gaps = 8/71 (11%)
Query: 103 RYSPSPDLTTRRPEVPRP-RHPTHHHPEHHPTPESPRREPPPRH----PNQYPETPQDSP 157
R++P P T++P V P + PT +P P + ++ P N + PQ
Sbjct: 198 RHAPKP---TQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254
Query: 158 RREPSLRQPNQ 168
+ S Q NQ
Sbjct: 255 GHKRSREQGNQ 265
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 40.8 bits (95), Expect = 0.002
Identities = 23/131 (17%), Positives = 41/131 (31%), Gaps = 6/131 (4%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYP------ETPQD 155
P +P+P+ ++ + +P + + P+ P P P Q P E P
Sbjct: 371 PVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQ 430
Query: 156 SPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTT 215
P P+ QP E ++ + E + PA + +
Sbjct: 431 QPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVV 490
Query: 216 PPATRRGESNP 226
P E+ P
Sbjct: 491 EPEPVVEETKP 501
Score = 40.5 bits (94), Expect = 0.002
Identities = 34/153 (22%), Positives = 52/153 (33%), Gaps = 9/153 (5%)
Query: 104 YSPSPDLTTRRPEVP------RPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSP 157
+SP L P P P + P++ P+ Q P+ P
Sbjct: 729 FSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQ 788
Query: 158 RREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPD-IARAEPRTTPTTTTP 216
+ +QP +P++ P+ Q P+ P P +Y P+ T P
Sbjct: 789 PQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHP 848
Query: 217 PATRRGESNPGPSDKPDTCDTSYDAIAVIRTEV 249
R G+S P KP T S D + +EV
Sbjct: 849 LLMRNGDSR--PLHKPTTPLPSLDLLTPPPSEV 879
Score = 35.1 bits (80), Expect = 0.10
Identities = 21/100 (21%), Positives = 34/100 (34%), Gaps = 1/100 (1%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQ-YPETPQDSPRRE 160
P Y+P+ + ++P P + P PE P + Q PQ + + E
Sbjct: 406 PYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTE 465
Query: 161 PSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYP 200
+ +QP +P+ P PAR P
Sbjct: 466 QTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPP 505
Score = 34.3 bits (78), Expect = 0.20
Identities = 21/121 (17%), Positives = 38/121 (31%), Gaps = 12/121 (9%)
Query: 80 QPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPE-VPRPRHPTHHHPEHHPTPESPR 138
QP + + E+ + P Y+P+P+ ++P P P P + +S
Sbjct: 399 QPVQPQQPYYAPAAEQPAQQ--PYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTF 456
Query: 139 REPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPAR 198
Q ++P Q Y + ++ P P +PP
Sbjct: 457 APQSTYQTEQ--------TYQQP-AAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLY 507
Query: 199 Y 199
Y
Sbjct: 508 Y 508
Score = 32.4 bits (73), Expect = 0.84
Identities = 22/121 (18%), Positives = 34/121 (28%), Gaps = 7/121 (5%)
Query: 106 PSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQ 165
S D+ +P V P E P P P Q + Q + + L+Q
Sbjct: 347 ASVDVPPAQPTVAWQPVPGPQTGE-------PVIAPAPEGYPQQSQYAQPAVQYNEPLQQ 399
Query: 166 PNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESN 225
P Q + + P + P + P P + Y A +P
Sbjct: 400 PVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQ 459
Query: 226 P 226
Sbjct: 460 S 460
Score = 29.7 bits (66), Expect = 5.8
Identities = 23/99 (23%), Positives = 38/99 (38%), Gaps = 9/99 (9%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREP-PPRHPNQYPETPQDSPRRE 160
P+ +P ++P+ P P + P+ P+ ++P P P + PQ +
Sbjct: 768 PQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ 827
Query: 161 PSLRQPNQYP-ETPHEP------RRDSPPRHPNQFPETP 192
P +QP Q P + R+ R P P TP
Sbjct: 828 PQYQQPQQPVAPQPQDTLLHPLLMRNGDSR-PLHKPTTP 865
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 39.9 bits (93), Expect = 0.003
Identities = 24/128 (18%), Positives = 39/128 (30%), Gaps = 6/128 (4%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
P+ S + P P+ P P++ + +PPP+ P Q P
Sbjct: 104 PQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQS------PP 157
Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRR 221
Q Q P + +P + P P+ P P + + + P
Sbjct: 158 QQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSSMAMQPPYSGAPPSQQFY 217
Query: 222 GESNPGPS 229
G P P
Sbjct: 218 GPPQPSPY 225
Score = 34.9 bits (80), Expect = 0.083
Identities = 26/138 (18%), Positives = 45/138 (32%), Gaps = 7/138 (5%)
Query: 64 LSPRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHP 123
LS + E ++ K E + A P + P P P+ P
Sbjct: 29 LSHEEAQSSEAHSFHVDSTKQPPAPEQVAKHELADA------PLQQVNAALPPAPAPQSP 82
Query: 124 THHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPR 183
+ P S + Q +P++EP P+Q P + + P+
Sbjct: 83 QPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQ 142
Query: 184 HPNQFPETPR-KPPARYP 200
P Q P+ + + P + P
Sbjct: 143 PPPQVPQQQQYQSPPQQP 160
>gnl|CDD|218658 pfam05616, Neisseria_TspB, Neisseria meningitidis TspB protein.
This family consists of several Neisseria meningitidis
TspB virulence factor proteins.
Length = 502
Score = 40.0 bits (93), Expect = 0.003
Identities = 35/102 (34%), Positives = 40/102 (39%), Gaps = 22/102 (21%)
Query: 100 VIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTP-ESPRREPPPRHPNQYPETPQDSPR 158
VIPR PDLT E P P +P E+P P PR E P P
Sbjct: 313 VIPR----PDLTPGSAEAP------EAQPLPEVSPAENPANNPNPR------ENPGTRPN 356
Query: 159 REPSL-RQPNQYPETPHEP--RRDSP--PRHPNQFPETPRKP 195
EP P+ P+T +P R DSP P PN RK
Sbjct: 357 PEPDPDLNPDANPDTDGQPGTRPDSPAVPDRPNGRHRKERKE 398
>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 484
Score = 39.6 bits (93), Expect = 0.004
Identities = 16/117 (13%), Positives = 24/117 (20%), Gaps = 6/117 (5%)
Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPET 191
+ + PP P + P L + + E
Sbjct: 56 AAAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAP----AAP 111
Query: 192 PRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSYDAIAVIRTE 248
+ PA + A AE A R D I I +
Sbjct: 112 EAQAPAAPAERAAAENAARRLARAAAAAPR--PRVPADAAAAVADAVKARIERIVND 166
Score = 33.0 bits (76), Expect = 0.45
Identities = 18/121 (14%), Positives = 26/121 (21%), Gaps = 9/121 (7%)
Query: 99 AVIPRYSPSPDLTTRRPEVPRP--RHPTHHHPEHHPTP---ESPRREPPPRHP-NQYPET 152
A+ P + +P P P E P P E +R R
Sbjct: 50 ALAPPAAAAPAAAQPPPAAAPAAVSRPAAPAAE--PAPWLVEHAKRLTAQREQLVARAAA 107
Query: 153 PQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPT 212
P + P+ E R + A + R
Sbjct: 108 PAAPEAQAPAAPAERAAAENA-ARRLARAAAAAPRPRVPADAAAAVADAVKARIERIVND 166
Query: 213 T 213
T
Sbjct: 167 T 167
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 38.9 bits (91), Expect = 0.004
Identities = 22/106 (20%), Positives = 27/106 (25%), Gaps = 8/106 (7%)
Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRRE--PSLRQPNQYPETPHEPRRDSPPRHPNQ-- 187
P P P+ P P E S + P P + D+PP P
Sbjct: 174 GEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSL-PPAPSSFQSDTPPPSPESPT 232
Query: 188 ---FPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSD 230
P P PP PT + AT D
Sbjct: 233 NPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDD 278
Score = 32.7 bits (75), Expect = 0.46
Identities = 16/105 (15%), Positives = 26/105 (24%), Gaps = 14/105 (13%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
P + P+ + + P P+ + P P P P P
Sbjct: 195 PSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPL 254
Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAE 206
S +P P + P IA+A+
Sbjct: 255 STAKPTP-------PSASATPAPIGGITLDDD-------AIAKAQ 285
Score = 32.0 bits (73), Expect = 0.78
Identities = 22/103 (21%), Positives = 28/103 (27%), Gaps = 9/103 (8%)
Query: 132 PTPESPRRE-------PPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRH 184
P P P E N +P D PS P+ P P P P
Sbjct: 148 PNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPS-DPPSSSPGVPSFPSPPEDPSS 206
Query: 185 PNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
P+ P P + D P + + PP P
Sbjct: 207 PSDSSLPPA-PSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPV 248
Score = 30.4 bits (69), Expect = 2.2
Identities = 18/100 (18%), Positives = 24/100 (24%), Gaps = 1/100 (1%)
Query: 104 YSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSL 163
+ + + P P P SP P P+ + P+
Sbjct: 174 GEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTN 233
Query: 164 RQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIA 203
P P P P P P TP A I
Sbjct: 234 PSPPPGPAAPPPPPVQQVPPLSTAKP-TPPSASATPAPIG 272
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 39.8 bits (93), Expect = 0.005
Identities = 24/136 (17%), Positives = 31/136 (22%), Gaps = 10/136 (7%)
Query: 101 IPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRRE 160
+ S T P P P P SP PP P P +P
Sbjct: 78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSP------- 130
Query: 161 PSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARA---EPRTTPTTTTPP 217
P P + P P A A T ++PP
Sbjct: 131 APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPP 190
Query: 218 ATRRGESNPGPSDKPD 233
A + P +
Sbjct: 191 AEPPPSTPPAAASPRP 206
Score = 38.6 bits (90), Expect = 0.009
Identities = 20/137 (14%), Positives = 36/137 (26%), Gaps = 10/137 (7%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
R P+ ++ R P P + +P + R + + R
Sbjct: 280 SRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGA 339
Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDI------ARAEPRTTPTTTT 215
++ +P S P P ++P R R
Sbjct: 340 AVS----PGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVA 395
Query: 216 PPATRRGESNPGPSDKP 232
A RR + P+ +P
Sbjct: 396 GRARRRDATGRFPAGRP 412
Score = 35.9 bits (83), Expect = 0.069
Identities = 25/133 (18%), Positives = 28/133 (21%), Gaps = 9/133 (6%)
Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
L P TP+ P P P R S +
Sbjct: 797 RRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPP-PARSSESSK 855
Query: 165 QPNQYPETP---HEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRR 221
RR P P P P A A A P P R
Sbjct: 856 SKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKA-----AAAAPPAGAPAPRPRPAPR 910
Query: 222 GESNPGPSDKPDT 234
+ P P PD
Sbjct: 911 VKLGPMPPGGPDP 923
Score = 35.1 bits (81), Expect = 0.11
Identities = 30/129 (23%), Positives = 35/129 (27%), Gaps = 13/129 (10%)
Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
S S P R P+ P P SPR+ P P P P R +
Sbjct: 333 SESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARA 392
Query: 165 QPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
RRD+ R P +P D A P T GE
Sbjct: 393 AVA-----GRARRRDATGRFPAG------RPRPSPLDAGAASGA--FYARYPLLTPSGEP 439
Query: 225 NPGPSDKPD 233
PG P
Sbjct: 440 WPGSPPPPP 448
Score = 31.7 bits (72), Expect = 1.4
Identities = 18/130 (13%), Positives = 29/130 (22%), Gaps = 22/130 (16%)
Query: 112 TRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYP---------------ETPQDS 156
P R P P R + E P
Sbjct: 200 AAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPR 259
Query: 157 PRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTP 216
P + + + P P P R P + + P + P ++P
Sbjct: 260 PAPITLPTRIWE-ASGWNGPSSRPGPASS------SSSPRERSPSPSPSSPGSGPAPSSP 312
Query: 217 PATRRGESNP 226
A+ S+
Sbjct: 313 RASSSSSSSR 322
Score = 31.3 bits (71), Expect = 1.6
Identities = 20/139 (14%), Positives = 32/139 (23%), Gaps = 11/139 (7%)
Query: 95 EKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQ 154
E + RRP R P + + +R+ P+ E+
Sbjct: 779 EPQRGAGSSPPVRAEAAFRRPGRLRRSGPA----ADAASRTASKRKSRSHTPDGGSESS- 833
Query: 155 DSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTT 214
R + +P P +S P R R
Sbjct: 834 GPARPPGAAARP------PPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAA 887
Query: 215 TPPATRRGESNPGPSDKPD 233
P A P+ +P
Sbjct: 888 PPKAAAAAPPAGAPAPRPR 906
Score = 30.5 bits (69), Expect = 2.9
Identities = 18/119 (15%), Positives = 27/119 (22%), Gaps = 2/119 (1%)
Query: 120 PRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRD 179
+ + T + P P + R + + P
Sbjct: 41 GQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPA-- 98
Query: 180 SPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTS 238
SP R + P P P P A P +P R S P +
Sbjct: 99 SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGA 157
Score = 29.8 bits (67), Expect = 5.4
Identities = 24/130 (18%), Positives = 34/130 (26%), Gaps = 7/130 (5%)
Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
SP+ T R P P P P +P PRR P +P +P R +
Sbjct: 176 LSSPEETARAPSSPPAEPPPST-PPAAASPRPPRRSSPISAS---ASSPAPAPGRSAADD 231
Query: 165 QPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
++ N E P PA R + + S
Sbjct: 232 AGASSSDSSSSESSGCGWGPEN---ECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSS 288
Query: 225 NPGPSDKPDT 234
+ P
Sbjct: 289 SSPRERSPSP 298
Score = 29.0 bits (65), Expect = 7.5
Identities = 28/172 (16%), Positives = 40/172 (23%), Gaps = 21/172 (12%)
Query: 78 ESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESP 137
E+ R+ + SP+P + P P P P
Sbjct: 78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEM 137
Query: 138 RREPPPRHPN--QYPETPQDSPRREPS--------------LRQPNQYPETPHEPRRDSP 181
R P P SP S + + P +P S
Sbjct: 138 LRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPST 197
Query: 182 PRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPD 233
P +PP R I+ + P A G S+ S
Sbjct: 198 PPAA-----ASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSES 244
>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
Length = 328
Score = 38.0 bits (88), Expect = 0.010
Identities = 19/83 (22%), Positives = 30/83 (36%), Gaps = 11/83 (13%)
Query: 140 EPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH---EPRRDSPPRHPNQFPETPRKPP 196
EP P + + + + + + R P EP++ P +T KP
Sbjct: 126 EPATVAPVRNGNASRQTAKTQTAERPATTRPARKQAVIEPKK------PQATAKTEPKPV 179
Query: 197 ARYPDIARAEPRTTPTTTTPPAT 219
A+ P R EP +T PA
Sbjct: 180 AQTPK--RTEPAAPVASTKAPAA 200
Score = 34.5 bits (79), Expect = 0.12
Identities = 20/97 (20%), Positives = 33/97 (34%), Gaps = 9/97 (9%)
Query: 133 TPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETP 192
+ ++ + + R P Q EP ++P +T +P +P R P
Sbjct: 139 SRQTAKTQTAERPATTRPARKQAVI--EP--KKPQATAKTEPKPVAQTPKRTEPAAPVAS 194
Query: 193 RKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPS 229
K PA A + P T TT P + +
Sbjct: 195 TKAPA-----ATSTPAPKETATTAPVQTASPAQTTAT 226
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 37.4 bits (87), Expect = 0.019
Identities = 27/124 (21%), Positives = 42/124 (33%), Gaps = 15/124 (12%)
Query: 118 PRPRHPTHHHPEHHPTPESPRREPPPRH-----PNQYPETPQDSPRREP---SLRQPNQY 169
+ + P PR+ P+ P YP+ PQ P + L P+Q
Sbjct: 167 QQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQA 226
Query: 170 PETPHEPRRDSPPRHPN-----QFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
P P P P+ P QFP ++ P P + + + PP +
Sbjct: 227 PAQPPLPP--QLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTP 284
Query: 225 NPGP 228
+PG
Sbjct: 285 HPGL 288
Score = 34.4 bits (79), Expect = 0.15
Identities = 20/107 (18%), Positives = 32/107 (29%), Gaps = 11/107 (10%)
Query: 106 PSPDLTTRRPEVPRPRHPTH----HHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
P L+ + P P P+ P P+ P P++ P Q P +
Sbjct: 247 QFPGLSQQMPPPP-PQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLL 305
Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPR 208
L Q Q + + R Q + + RA+ R
Sbjct: 306 PLVQQPQGQQRGPQFREQLVQLSQQQ------REALSQEEAKRAKRR 346
Score = 33.2 bits (76), Expect = 0.35
Identities = 19/83 (22%), Positives = 31/83 (37%)
Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH 174
PE +P+ + P P + P P Q P+ P S + P QP Q + P
Sbjct: 211 PEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPP 270
Query: 175 EPRRDSPPRHPNQFPETPRKPPA 197
+P+ PP++ +
Sbjct: 271 QPQAQPPPQNQPTPHPGLPQGQN 293
Score = 33.2 bits (76), Expect = 0.40
Identities = 30/140 (21%), Positives = 49/140 (35%), Gaps = 16/140 (11%)
Query: 97 SWAVIPRYSPSPDLTTRRPEVPR--PRHPTHHHPE-------HHPTPESPRREPPPRHPN 147
+ PR + P PE P P+ P H + P+ + PP+ P
Sbjct: 181 PQGMPPRQAAFP--QQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQ 238
Query: 148 QYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEP 207
Q P Q S + P P+ P + ++ PP+ Q P P+ P +P + + +
Sbjct: 239 Q-PPPLQQPQFPGLSQQMPPPPPQPPQQQQQ--PPQPQAQPP--PQNQPTPHPGLPQGQN 293
Query: 208 RTTPTTTTPPATRRGESNPG 227
P P + G
Sbjct: 294 APLPPPQQPQLLPLVQQPQG 313
Score = 31.3 bits (71), Expect = 1.4
Identities = 14/90 (15%), Positives = 28/90 (31%), Gaps = 6/90 (6%)
Query: 118 PRPRHPTHHHPEHHPTPESPRR--EPPPRHPNQYPETPQDS----PRREPSLRQPNQYPE 171
P + P ++ PPP+ P Q + PQ P+ +P+ +
Sbjct: 233 PPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQ 292
Query: 172 TPHEPRRDSPPRHPNQFPETPRKPPARYPD 201
P P P ++ ++ +
Sbjct: 293 NAPLPPPQQPQLLPLVQQPQGQQRGPQFRE 322
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 37.2 bits (86), Expect = 0.021
Identities = 31/165 (18%), Positives = 54/165 (32%), Gaps = 14/165 (8%)
Query: 66 PRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTH 125
+++KE ++ E + DR+EEK + S P ++ + + P
Sbjct: 144 KEKEKEKEKKVEEPR----------DREEEKKRERVRAKS-RPKKPPKKKPPNKKKEPPE 192
Query: 126 HHPEHHPTPESPRREPP-PRHPNQYPETPQDSPRREPSLRQPNQYPETPH-EPRRDSPPR 183
+ E+ + +P P + + D RE + + E R S
Sbjct: 193 EEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSS 252
Query: 184 HPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGP 228
P P R E R T+ PP+ R + P P
Sbjct: 253 LKKPDPSPSMASPETRESSKRTETRPR-TSLRPPSARPASARPAP 296
Score = 35.6 bits (82), Expect = 0.063
Identities = 31/186 (16%), Positives = 59/186 (31%), Gaps = 28/186 (15%)
Query: 67 RNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHH 126
+ Q K+E + + +P E DRK ++ ++ E PR R
Sbjct: 110 KEQVKEEKKKKKEKPKEEPK----DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDRE-EEK 164
Query: 127 HPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETP-----------HE 175
E P++ P + PN+ E P++ +R+ + PE P
Sbjct: 165 KRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDG 224
Query: 176 PRRDSPPRHPNQF------------PETPRKPPARYPDIARAEPRTTPTTTTPPATRRGE 223
R++ + + +KP + ++ T T P T
Sbjct: 225 KDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETRPRTSLRP 284
Query: 224 SNPGPS 229
+ P+
Sbjct: 285 PSARPA 290
Score = 32.9 bits (75), Expect = 0.48
Identities = 25/160 (15%), Positives = 51/160 (31%), Gaps = 13/160 (8%)
Query: 65 SPRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPT 124
P+N+ KE+ + Q E+ +++ K P + +RP + +
Sbjct: 97 EPKNESGKEEEKEKEQVKEEKKKK---KEKPKEEP--KDRKPKEEAKEKRPPKEKEKE-- 149
Query: 125 HHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRH 184
+ E RE + ++ P ++ + + P + R+ +
Sbjct: 150 ----KEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKE-PPEEEKQRQAAREA- 203
Query: 185 PNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
PE P R + + R T T+ R S
Sbjct: 204 VKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSS 243
Score = 29.9 bits (67), Expect = 3.5
Identities = 18/103 (17%), Positives = 24/103 (23%), Gaps = 7/103 (6%)
Query: 62 EVLSPRNQRKKED-------RIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRR 114
E + K+ED + R S R+ S TR
Sbjct: 210 EPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRE 269
Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSP 157
P P S R PP + QD+
Sbjct: 270 SSKRTETRPRTSLRPPSARPASARPAPPRVKRKEIVTVLQDAQ 312
>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1). The
homeobox gene Prox1 is expressed in a subpopulation of
endothelial cells that, after budding from veins, gives
rise to the mammalian lymphatic system. Prox1 has been
found to be an early specific marker for the developing
liver and pancreas in the mammalian foregut endoderm.
This family contains an atypical homeobox domain.
Length = 908
Score = 37.0 bits (85), Expect = 0.030
Identities = 40/161 (24%), Positives = 51/161 (31%), Gaps = 35/161 (21%)
Query: 61 TEVLSPRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRP 120
TE LS KK+ RT+S R AV P
Sbjct: 572 TEALSLVVTPKKKRHKVTDTRITPRTVS---RILALRDAVGPAAG--------------- 613
Query: 121 RHPTHHHPEHHPTPESPR---REPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPR 177
HH HP+ S PP RHP P T PSL Q + P +
Sbjct: 614 ----THHQPLHPSSLSASMGFHPPPFRHPFPLPLT---VAIPNPSLHQSEVFMGYPFQSP 666
Query: 178 R-----DSPPRHPNQFPETPRKPPARYPDIARA--EPRTTP 211
SPP P+ PR + +P + A P ++P
Sbjct: 667 HLGAPSGSPPGKDRDSPDLPRPTTSLHPKLLSAHHHPGSSP 707
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 36.6 bits (84), Expect = 0.033
Identities = 39/186 (20%), Positives = 56/186 (30%), Gaps = 35/186 (18%)
Query: 82 FRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPE------ 135
R + +S C KE K+ + + P EV P +P+ P+ P +
Sbjct: 157 LRGKDVSCC--KEPKTAVTTSKTTSWPT------EVSHPTYPSQVTPQSQPATQGHQTAT 208
Query: 136 -----------------SPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPE---TPHE 175
+ P P P SP+ PS +Q H
Sbjct: 209 ANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHT 268
Query: 176 PRRDSPPRHPN-QFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDT 234
RR +PP N + P + PP P PT TT + S+P T
Sbjct: 269 QRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPT 328
Query: 235 CDTSYD 240
D
Sbjct: 329 TQNLVD 334
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 36.0 bits (83), Expect = 0.040
Identities = 11/51 (21%), Positives = 14/51 (27%), Gaps = 1/51 (1%)
Query: 112 TRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQ-DSPRREP 161
+ P P T + P E P+ D PRRE
Sbjct: 15 EQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRER 65
Score = 34.5 bits (79), Expect = 0.15
Identities = 17/75 (22%), Positives = 26/75 (34%), Gaps = 7/75 (9%)
Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPR-----HPN 186
P P SP P P P + + P+ P +PRR+ P+
Sbjct: 19 PAPPSPAAAPAPPPPAKTAAPA--TKAAAPAAAAPRAEKPKKDKPRRERKPKPASLWKLE 76
Query: 187 QFPETPRKPPARYPD 201
F P++ R+ D
Sbjct: 77 DFVVEPQEGKTRFHD 91
Score = 32.6 bits (74), Expect = 0.54
Identities = 14/63 (22%), Positives = 23/63 (36%), Gaps = 2/63 (3%)
Query: 135 ESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH--EPRRDSPPRHPNQFPETP 192
E+ + P P P P P + P+ + P +P++D P R P +
Sbjct: 13 EAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPASL 72
Query: 193 RKP 195
K
Sbjct: 73 WKL 75
Score = 29.5 bits (66), Expect = 4.6
Identities = 10/43 (23%), Positives = 13/43 (30%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPR 144
+P P T P P + PRRE P+
Sbjct: 26 AAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPK 68
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 35.8 bits (83), Expect = 0.052
Identities = 20/128 (15%), Positives = 28/128 (21%), Gaps = 13/128 (10%)
Query: 130 HHPTPESPRREP-PPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQF 188
P +P P P + P+ P +P
Sbjct: 382 ARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAP-----PAAAPPAPVAAPAAAAP-- 434
Query: 189 PETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSYDAIAVIRTE 248
P PA + P T P R P + + A + TE
Sbjct: 435 AAAPAAAPAAVALAPAPPAQAAPETVAIP--VRVAPEPAVASAAPAPAAAPAAARLTPTE 492
Query: 249 VFVFKDRW 256
D W
Sbjct: 493 EG---DVW 497
Score = 34.3 bits (79), Expect = 0.19
Identities = 21/131 (16%), Positives = 25/131 (19%), Gaps = 8/131 (6%)
Query: 99 AVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPR 158
A P +P P P P + P P
Sbjct: 372 AAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAA------PPAAAPPAP 425
Query: 159 REPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARY-PDIARAEPRTTPTTTTPP 217
P P Q P R P+ A A P P
Sbjct: 426 VAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAA-PA 484
Query: 218 ATRRGESNPGP 228
A R + G
Sbjct: 485 AARLTPTEEGD 495
>gnl|CDD|215479 PLN02887, PLN02887, hydrolase family protein.
Length = 580
Score = 36.0 bits (83), Expect = 0.054
Identities = 20/99 (20%), Positives = 28/99 (28%), Gaps = 12/99 (12%)
Query: 134 PESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEP---RRDSPPRHPNQFPE 190
P R P R P +P ++ P L QP+ +P R D + F +
Sbjct: 3 PMLSRVSLPLRLPRLFPAAAREMPPAARLLLQPSSLSFSPKTSNFNRLDVTSTEFSSFRK 62
Query: 191 -----TPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
KP R A R + E
Sbjct: 63 GFHAFRSTKPRGR----VWAAHRGYRKIRRRAPPPKKEQ 97
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 35.2 bits (81), Expect = 0.059
Identities = 26/140 (18%), Positives = 44/140 (31%), Gaps = 14/140 (10%)
Query: 76 IGESQPFRERTLSICDRK---EEKSWAVIPRYSPSPDLTTRRPEVPRP-------RHPTH 125
+ + ER ++ R E++ A++ R + L +P + H
Sbjct: 51 SEKKAAYAERDMAFLQRDAWIAERNNALMERDNKFAALQPVENSLPPGCGILPGTKGIHH 110
Query: 126 HHPEHHPTPESPRREPPPRHPNQYPETPQDSP-RREPSLRQPNQYPETPHEPRRDSP--- 181
HHP H + P P +E + Q P+ P P+ P
Sbjct: 111 LQMLHHPQLGDNPYGTREMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKK 170
Query: 182 PRHPNQFPETPRKPPARYPD 201
N+ + P P PD
Sbjct: 171 GSVSNRSVKMPGIDPRSKPD 190
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 35.7 bits (83), Expect = 0.060
Identities = 17/94 (18%), Positives = 20/94 (21%)
Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH 174
P P P P P P P +P +P+ T
Sbjct: 420 AAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAA 479
Query: 175 EPRRDSPPRHPNQFPETPRKPPARYPDIARAEPR 208
P P P P A A R
Sbjct: 480 PAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLR 513
Score = 35.0 bits (81), Expect = 0.11
Identities = 24/136 (17%), Positives = 36/136 (26%), Gaps = 7/136 (5%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRR-- 159
P +P P P +P P + P +
Sbjct: 615 PAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAG 674
Query: 160 --EPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPP 217
P+ P P P P +P + P PPA D A+P + P
Sbjct: 675 GAAPAAPPPAPAPAAPAAPAGAAPAQ---PAPAPAATPPAGQADDPAAQPPQAAQGASAP 731
Query: 218 ATRRGESNPGPSDKPD 233
+ + P P + D
Sbjct: 732 SPAADDPVPLPPEPDD 747
Score = 35.0 bits (81), Expect = 0.12
Identities = 18/117 (15%), Positives = 22/117 (18%)
Query: 106 PSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQ 165
P P P P ++ P Q P + L
Sbjct: 684 APAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPP 743
Query: 166 PNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRG 222
P P PP P P P E + RR
Sbjct: 744 EPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRD 800
Score = 34.2 bits (79), Expect = 0.20
Identities = 26/139 (18%), Positives = 37/139 (26%), Gaps = 14/139 (10%)
Query: 106 PSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPR------REPPPRHPNQYPETPQDSPRR 159
P+ P V HHP+H P++ + P P P +
Sbjct: 636 PAEASAAPAPGVA----APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPA 691
Query: 160 EPSLRQPNQ----YPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTT 215
P+ P Q TP + D P P Q + P D P
Sbjct: 692 APAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDP 751
Query: 216 PPATRRGESNPGPSDKPDT 234
A + P P+
Sbjct: 752 AGAPAQPPPPPAPAPAAAP 770
Score = 33.8 bits (78), Expect = 0.26
Identities = 22/124 (17%), Positives = 30/124 (24%), Gaps = 6/124 (4%)
Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
P+P + E RP P P +P P + P
Sbjct: 600 PPAPASSGPPEEAARPA-----APAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHP 654
Query: 165 QPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
+ P+ + P + P P PA A A P AT
Sbjct: 655 KHVAVPD-ASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQ 713
Query: 225 NPGP 228
P
Sbjct: 714 ADDP 717
Score = 33.8 bits (78), Expect = 0.26
Identities = 20/106 (18%), Positives = 28/106 (26%), Gaps = 7/106 (6%)
Query: 132 PTPESPRREPPPRHPNQYPETPQDSPR---REPSLRQPNQYPETPHEPRRDSPPRHPNQF 188
P P P Q +P P+ + + + P + S P
Sbjct: 678 PAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADD 737
Query: 189 ----PETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSD 230
P P PP A+ P P PA S P +
Sbjct: 738 PVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783
Score = 32.7 bits (75), Expect = 0.54
Identities = 16/119 (13%), Positives = 21/119 (17%), Gaps = 13/119 (10%)
Query: 132 PTPESPR--------REPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPR 183
P P P Q P +P P +
Sbjct: 406 PAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQ 465
Query: 184 HPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSYDAI 242
P P P PA + P +D T + I
Sbjct: 466 -----PAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPEI 519
Score = 31.5 bits (72), Expect = 1.3
Identities = 14/84 (16%), Positives = 17/84 (20%)
Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH 174
P+ P P P P P P PS + P
Sbjct: 416 APAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPE 475
Query: 175 EPRRDSPPRHPNQFPETPRKPPAR 198
+P P PA
Sbjct: 476 PTAAPAPAPPAAPAPAAAPAAPAA 499
Score = 31.1 bits (71), Expect = 1.9
Identities = 19/125 (15%), Positives = 26/125 (20%), Gaps = 11/125 (8%)
Query: 105 SPSPDLTTRRPEVPRPRHPTHH-HPEHHPTPESPRREPPPR--HPNQYPETPQDSPR--R 159
+P+ + P P P P P P +P +
Sbjct: 393 APAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGG 452
Query: 160 EPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPAT 219
PS P +P P P A P A P P
Sbjct: 453 APSPPPAAAPSAQPAPAPAAAPEPTAAPAPA---PPAAPAPA---AAPAAPAAPAAPAGA 506
Query: 220 RRGES 224
+
Sbjct: 507 DDAAT 511
Score = 30.7 bits (70), Expect = 2.4
Identities = 15/129 (11%), Positives = 25/129 (19%), Gaps = 1/129 (0%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
P + P P + P H ++ P S +
Sbjct: 609 PEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDG 668
Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRR 221
+ P P +P A P P A +
Sbjct: 669 WPAKAGGAAPAAPPPAPAPAAPAAPA-GAAPAQPAPAPAATPPAGQADDPAAQPPQAAQG 727
Query: 222 GESNPGPSD 230
+ +D
Sbjct: 728 ASAPSPAAD 736
>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
This family consists mainly of the potato leaf roll
virus readthrough protein. This is generated via a
readthrough of open reading frame 3 a coat protein
allowing transcription of open reading frame 5 to give
an extended coat protein with a large c-terminal
addition or read through domain. The readthrough protein
is thought to play a role in the circulative aphid
transmission of potato leaf roll virus. Also in the
family is open reading frame 6 from beet western yellows
virus and potato leaf roll virus both luteovirus and an
unknown protein from cucurbit aphid-borne yellows virus
a closterovirus.
Length = 460
Score = 35.5 bits (82), Expect = 0.061
Identities = 30/151 (19%), Positives = 41/151 (27%), Gaps = 11/151 (7%)
Query: 112 TRRPEVPRPRHPTH---HHPEHHPTP-ESPRREPPPRHPNQYPETPQDSPRREPSLRQPN 167
RRP PRP H P P E + P Q PE+ D+ SL +
Sbjct: 244 VRRP--PRPGHLLVVSTPELVLPPEPDEQSSERQTFKTPPQ-PESSSDAENGLVSLVDED 300
Query: 168 QYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
E + D+PP + A P++ A P P
Sbjct: 301 DKEEVSRDSESDAPPDDTDLTRALAEYEAA-APEVPDAARTVLQGKEQPSPDPVESPGP- 358
Query: 228 PSDKPDTCDTSYDAIAVIRTEVFVFKDRWMW 258
D S + V + R
Sbjct: 359 --DLTPGYPKSDEVAGTYLGGGSVAEGRDPL 387
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 34.8 bits (80), Expect = 0.12
Identities = 17/97 (17%), Positives = 23/97 (23%), Gaps = 9/97 (9%)
Query: 133 TPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETP 192
P + P P P +P P P P EP R++ P
Sbjct: 361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIP--PKEPVRETAT------PPPV 412
Query: 193 RKPPARYP-DIARAEPRTTPTTTTPPATRRGESNPGP 228
P P P + + P P
Sbjct: 413 PPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAP 449
Score = 31.7 bits (72), Expect = 1.1
Identities = 16/78 (20%), Positives = 19/78 (24%), Gaps = 1/78 (1%)
Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
+PSP T P + P P E+ P P P P R
Sbjct: 375 APSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHT-PESAPKLTR 433
Query: 165 QPNQYPETPHEPRRDSPP 182
E P P
Sbjct: 434 AAIPVDEKPKYTPPAPPK 451
Score = 31.7 bits (72), Expect = 1.2
Identities = 17/81 (20%), Positives = 23/81 (28%), Gaps = 9/81 (11%)
Query: 150 PETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRT 209
P P + P P + PP+ P + TP P R P
Sbjct: 368 PAKPT-AAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR--------PVA 418
Query: 210 TPTTTTPPATRRGESNPGPSD 230
P TP + + P D
Sbjct: 419 PPVPHTPESAPKLTRAAIPVD 439
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 34.9 bits (80), Expect = 0.12
Identities = 21/106 (19%), Positives = 24/106 (22%), Gaps = 5/106 (4%)
Query: 132 PTPESPRREPPPRHPNQYPET--PQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFP 189
P +P PP P P + P+ R P P
Sbjct: 392 PAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPA 451
Query: 190 ETPRKPPARYPDIARAEPRTTPTTTTPPATRR---GESNPGPSDKP 232
P PA A A PR R P D P
Sbjct: 452 PAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPP 497
Score = 34.1 bits (78), Expect = 0.18
Identities = 24/134 (17%), Positives = 30/134 (22%), Gaps = 9/134 (6%)
Query: 99 AVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPR 158
A P + +P R PR +P R P P + P
Sbjct: 449 APAPAPAAAPAAAARPA-AAGPRPVA------AAAAAAPARAAPAAAPAPADDDPPPWEE 501
Query: 159 REPSLRQPNQYPETPHEPRRD--SPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTP 216
P P S P P+ + A P A A T
Sbjct: 502 LPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVV 561
Query: 217 PATRRGESNPGPSD 230
S G D
Sbjct: 562 APRPPRASASGLPD 575
Score = 32.5 bits (74), Expect = 0.55
Identities = 27/150 (18%), Positives = 33/150 (22%), Gaps = 14/150 (9%)
Query: 97 SWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS 156
+ A P P+P P P P +P R P +
Sbjct: 376 TAAAAPVAQPAPAAAAPAAAAPAPAAPPA------APAAAPAAAAAARAVAAAPARRSPA 429
Query: 157 PRREPSLRQ-----PNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTP 211
P + RQ P P P A ARA P P
Sbjct: 430 PEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAP 489
Query: 212 TTTTPPATRRGESNPG--PSDKPDTCDTSY 239
E P S P D +
Sbjct: 490 -APADDDPPPWEELPPEFASPAPAQPDAAP 518
>gnl|CDD|233927 TIGR02557, HpaP, type III secretion protein HpaP. This family of
genes is always found in type III secretion operons,
althought its function in the processes of secretion and
virulence is unclear. Hpa stands for Hrp-associated
gene, where Hrp stands for hypersensitivity response and
virulence.
Length = 201
Score = 33.7 bits (77), Expect = 0.13
Identities = 23/107 (21%), Positives = 31/107 (28%), Gaps = 23/107 (21%)
Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPET 191
+ P P P RR L Q + + P
Sbjct: 1 IIQGARLARPAP-------ADPARPARRRTPLAQLRRRDALAYAP--------------P 39
Query: 192 PRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTS 238
PR P D R EPR + PP +++ P+ PD D S
Sbjct: 40 PRPEPPPPCDEDRPEPRADTRASDPPPEAPTDAD--PAQPPDDPDAS 84
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 34.5 bits (79), Expect = 0.14
Identities = 24/114 (21%), Positives = 31/114 (27%), Gaps = 8/114 (7%)
Query: 134 PESPRREPPPRHPNQY----PETPQDSPRREPS----LRQPNQYPETPHEPRRDSPPRHP 185
E PP + P P +PR P+ P P T P P
Sbjct: 168 AEGTLAAPPLGEGSADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTT 227
Query: 186 NQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSY 239
P T P+ +A T P T G P + P + Y
Sbjct: 228 TSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPEASRY 281
Score = 31.8 bits (72), Expect = 0.81
Identities = 19/92 (20%), Positives = 23/92 (25%), Gaps = 3/92 (3%)
Query: 108 PDLTTRRPEV-PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQP 166
P L P + P T SP P P P T +P + Q
Sbjct: 188 PALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQA 247
Query: 167 NQYPETPHEPRRDSPPRHPNQFPETPRKPPAR 198
PE P P + P P
Sbjct: 248 GTTPEAEGTP--APPTPGGGEAPPANATPAPE 277
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 33.4 bits (76), Expect = 0.16
Identities = 33/123 (26%), Positives = 43/123 (34%), Gaps = 14/123 (11%)
Query: 110 LTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQY 169
L T P P PT HP P P P P HP Q +T + +P +
Sbjct: 45 LETGYPFCP----PTTPHPSSQPPPCP----PSPGHPPQTNDTHEKDLALQPPPGGKKKE 96
Query: 170 PETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPS 229
+ + P+Q PE + P+ +P P T PP GE GPS
Sbjct: 97 KKKKETEKPAQGGEKPDQGPEAKGEGEGHEPE----DP--PPEDTPPPPGGEGEVEGGPS 150
Query: 230 DKP 232
P
Sbjct: 151 PGP 153
>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
Length = 580
Score = 34.4 bits (79), Expect = 0.18
Identities = 16/118 (13%), Positives = 29/118 (24%), Gaps = 1/118 (0%)
Query: 111 TTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHP-NQYPETPQDSPRREPSLRQPNQY 169
T+ VPRP + + + R PP P SPR P
Sbjct: 68 TSTIYTVPRPPRGPEQTLDKPDSLPASRELPPGPTPVPPGGFRGASSPRLGADSTSPRFL 127
Query: 170 PETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
+ + E + P + + P + +++
Sbjct: 128 YQVNFPVILAPIGESNSSSEELSEEEEHSRPPPSESLKVKNGGKVYPKGFSKHKTHKR 185
Score = 30.1 bits (68), Expect = 3.3
Identities = 27/186 (14%), Positives = 41/186 (22%), Gaps = 23/186 (12%)
Query: 66 PRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVI-PRYSPSPDLTTRR-PEVPRPRHP 123
PR Q + + S + + + + P S S + P
Sbjct: 5 PRGQPGRWN---YDSSDESPEGSRDENFDAERDDFLTPLGSTSEATSEDDDDLYPPRETG 61
Query: 124 THHHPEHHPTPESPRREPPPRHPNQYPETP--QDSPRREPSLRQPNQ----------YPE 171
+ PR PPR P Q + P + R P P
Sbjct: 62 SGGGVATSTIYTVPR---PPRGPEQTLDKPDSLPASRELPPGPTPVPPGGFRGASSPRLG 118
Query: 172 TPHEPRRDS--PPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPS 229
R P + E P + + G+ P
Sbjct: 119 ADSTSPRFLYQVNFPVILAPIGESNSSSEELS-EEEEHSRPPPSESLKVKNGGKVYPKGF 177
Query: 230 DKPDTC 235
K T
Sbjct: 178 SKHKTH 183
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 34.1 bits (78), Expect = 0.19
Identities = 25/84 (29%), Positives = 34/84 (40%), Gaps = 13/84 (15%)
Query: 97 SWAVIPRYSPSPDLTTRRPEVPRP---RHPTHHHPEH-------HPTPES---PRREPPP 143
V P + SPD+ + PE P RHP PE P+P S P ++PP
Sbjct: 377 DGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPV 436
Query: 144 RHPNQYPETPQDSPRREPSLRQPN 167
P +PQ S R + +P
Sbjct: 437 ARSAPLPPSPQASAPRNVASGKPG 460
Score = 29.5 bits (66), Expect = 5.1
Identities = 16/76 (21%), Positives = 21/76 (27%), Gaps = 7/76 (9%)
Query: 123 PTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPP 182
P PE +P PE P P PS P + P + P
Sbjct: 384 PAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPS-------PASAPTPEQQPPV 436
Query: 183 RHPNQFPETPRKPPAR 198
P +P+ R
Sbjct: 437 ARSAPLPPSPQASAPR 452
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 34.1 bits (78), Expect = 0.19
Identities = 17/100 (17%), Positives = 26/100 (26%), Gaps = 11/100 (11%)
Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPET 191
TP R P + S + P +TP D P
Sbjct: 269 QTPMPERSWQTPAQTPARRISTPM-TEEIKSWQTP---LQTPAMYSSDYQAPKPEPIYTW 324
Query: 192 PRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDK 231
R+P ++ P + S+ GP+ K
Sbjct: 325 EELLRERFPS-------DLFAISSLPDSDSEASDSGPTRK 357
>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
Length = 2033
Score = 34.4 bits (78), Expect = 0.19
Identities = 34/137 (24%), Positives = 45/137 (32%), Gaps = 16/137 (11%)
Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
S S ++ P R P P+P PRR P R + +D P P
Sbjct: 1818 SSSSSSSSSSSSSPSSRPSRSATPSLSPSPSPPRRAPVDRSRSGR-RRERDRPSANPFRW 1876
Query: 165 QPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
P Q H P +P P + P R P TP++ T +R
Sbjct: 1877 APRQRSRADHSPDGTAPGDAPLNLEDGP----------GRGRPIWTPSSATTLPSRS--- 1923
Query: 225 NPGPSDKPDTCDTSYDA 241
GP D D +T A
Sbjct: 1924 --GPEDSVDETETEDSA 1938
>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 504
Score = 34.0 bits (78), Expect = 0.21
Identities = 19/87 (21%), Positives = 22/87 (25%), Gaps = 11/87 (12%)
Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPET 191
P+ PP P + R E +R P P F
Sbjct: 346 PSEGVAAVAPPAPAPADLTQRLN---RLEKEVRSLRSAPTAAATAAGAPLP----DFDPR 398
Query: 192 PRKPPARYPDIARAEPRTTPTTTTPPA 218
PR PPA A P P A
Sbjct: 399 PRGPPA----PEPARSAEAPPLVAPAA 421
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 33.5 bits (76), Expect = 0.22
Identities = 21/101 (20%), Positives = 29/101 (28%), Gaps = 7/101 (6%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
P P P +P P P +P P P R ++PN P + R +P
Sbjct: 180 PNMGMRPGFNQMPPHMPG-MPPNQMRPGFNPMPGMPPRPGFNQNPNMMPNMNRPGFRPQP 238
Query: 162 S-LRQPNQYPETPHEPRRDSPPR-----HPNQFPETPRKPP 196
P + R P P+ PP
Sbjct: 239 GGFNHPGTPMGPNMQQRPGFNPNQGMNPPPHMAGPRAGFPP 279
>gnl|CDD|218163 pfam04592, SelP_N, Selenoprotein P, N terminal region. SelP is the
only known eukaryotic selenoprotein that contains
multiple selenocysteine (Sec) residues, and accounts for
more than 50% of the selenium content of rat and human
plasma. It is thought to be glycosylated. SelP may have
antioxidant properties. It can attach to epithelial
cells, and may protect vascular endothelial cells
against peroxynitrite toxicity. The high selenium
content of SelP suggests that it may be involved in
selenium intercellular transport or storage. The
promoter structure of bovine SelP suggest that it may be
involved in countering heavy metal intoxication, and may
also have a developmental function. The N-terminal
region of SelP can exist independently of the C terminal
region. Zebrafish selenoprotein Pb lacks the C terminal
Sec-rich region, and a protein encoded by the rat SelP
gene and lacking this region has also been reported.
N-terminal region contains a conserved SecxxCys motif,
which is similar to the CysxxCys found in thioredoxins.
It is speculated that the N terminal region may adopt a
thioredoxin fold and catalyze redox reactions. The
N-terminal region also contains a His-rich region, which
is thought to mediate heparin binding. Binding to
heparan proteoglycans could account for the membrane
binding properties of SelP. The function of the
bacterial members of this family is uncharcterised.
Length = 238
Score = 33.3 bits (76), Expect = 0.23
Identities = 15/70 (21%), Positives = 23/70 (32%)
Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
+T P P HP HH E H + +P + S ++P
Sbjct: 163 DECKKVTLATVNKPVEAEPRQDHPHHHSHHEHQGHAHHHPHGHHHPGSNSHSESQQPDPD 222
Query: 165 QPNQYPETPH 174
+P + P H
Sbjct: 223 KPTEPPSGLH 232
>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
Length = 694
Score = 33.8 bits (77), Expect = 0.29
Identities = 27/114 (23%), Positives = 37/114 (32%), Gaps = 5/114 (4%)
Query: 132 PTPESPRRE-PPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH---EPRRDSPPRHPNQ 187
P +PRR+ PP P P + RR + R + PE RR P
Sbjct: 431 PGAPAPRRDNDPPPPPRARPGSTPACARRARAQRARDAGPEYVDPLGALRRLPAGAAPPP 490
Query: 188 FPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSYDA 241
P P Y + PR P T R + P + P+ + Y
Sbjct: 491 EPAAAPSPATYYTRMGGGPPRLPPRNRATE-TLRPDWGPPAAAPPEQMEDPYLE 543
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 33.2 bits (76), Expect = 0.32
Identities = 12/44 (27%), Positives = 15/44 (34%), Gaps = 3/44 (6%)
Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHE 175
P P+ P PPP PE D P + + P E
Sbjct: 269 PEPQPP--PPPPPPEPPEPEEEPDEPDQTDPDDGEET-DQIPEE 309
Score = 29.4 bits (66), Expect = 6.0
Identities = 11/50 (22%), Positives = 17/50 (34%), Gaps = 2/50 (4%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPE 151
PR + P+ + P P P P E ++ +Q PE
Sbjct: 261 PRATRLPEPEPQPPPPPPPPEPPEPEEEPDEPDQTD--PDDGEETDQIPE 308
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 33.4 bits (77), Expect = 0.36
Identities = 15/77 (19%), Positives = 22/77 (28%), Gaps = 3/77 (3%)
Query: 126 HHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHP 185
+ + P P+ P P P P P P P + + P + P
Sbjct: 519 NTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPP--PPPTATQASSNAPAQIPADSSP 576
Query: 186 NQ-FPETPRKPPARYPD 201
PE P P +
Sbjct: 577 PPPIPEEPTPSPTKDSS 593
Score = 30.7 bits (70), Expect = 2.1
Identities = 22/79 (27%), Positives = 26/79 (32%), Gaps = 1/79 (1%)
Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
S S T P P P P PT +P PPP P + +
Sbjct: 516 SASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSS 575
Query: 165 QPNQYPETP-HEPRRDSPP 182
P PE P P +DS P
Sbjct: 576 PPPPIPEEPTPSPTKDSSP 594
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 33.1 bits (75), Expect = 0.44
Identities = 16/62 (25%), Positives = 18/62 (29%), Gaps = 6/62 (9%)
Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH 174
P VP P P P+ P PP SP S R Q +P
Sbjct: 432 PSVPSPGGPGSQPPQSVSGGMIP---SPPALMPSPSPQMSQSP---ASQRTIQQDMVSPG 485
Query: 175 EP 176
P
Sbjct: 486 GP 487
>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
partitioning].
Length = 324
Score = 32.5 bits (74), Expect = 0.49
Identities = 23/134 (17%), Positives = 37/134 (27%), Gaps = 4/134 (2%)
Query: 103 RYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPS 162
S D ++ + E + P + S + +
Sbjct: 43 MKSKRDDDPYDEVADDEGVGEVRVVRKNEAPQFTQEHEAARQSPQHQYQPEYASAQIKIP 102
Query: 163 LRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPP----ARYPDIARAEPRTTPTTTTPPA 218
+ QP Q + P P+ P Q PE R+P A P + T PA
Sbjct: 103 VPQPPQISDPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQPVHSAAPQPAVQTVQPA 162
Query: 219 TRRGESNPGPSDKP 232
+ P +P
Sbjct: 163 VPEQQVQPEEVVEP 176
Score = 32.1 bits (73), Expect = 0.71
Identities = 17/89 (19%), Positives = 27/89 (30%)
Query: 107 SPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQP 166
S + P+ P+ P H P + + R P E P P + +
Sbjct: 96 SAQIKIPVPQPPQISDPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQPVHSAAPQPA 155
Query: 167 NQYPETPHEPRRDSPPRHPNQFPETPRKP 195
Q + ++ P PE R P
Sbjct: 156 VQTVQPAVPEQQVQPEEVVEPAPEVKRPP 184
>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9. The KAR9 protein in
Saccharomyces cerevisiae is a cytoskeletal protein
required for karyogamy, correct positioning of the
mitotic spindle and for orientation of cytoplasmic
microtubules. KAR9 localises at the shmoo tip in mating
cells and at the tip of the growing bud in anaphase.
Length = 626
Score = 32.9 bits (75), Expect = 0.50
Identities = 24/136 (17%), Positives = 33/136 (24%), Gaps = 16/136 (11%)
Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPET 191
PT S E P E + SP LR P + P
Sbjct: 445 PTLASIPDEKPS--NISVFEDSETSPNSSTLLRDPPPKKCGEESGHLPNNPFFNKLKLTL 502
Query: 192 PRKPPARYPDIARAEPRTTPTTTTPPAT---------RRGESNPGPSDKPDTC-DTSYDA 241
PP R T PT + P + S P P
Sbjct: 503 SSIPPLSP----RQSIITLPTPSRPASRISSLSLRLGSYSGSIVSPPPYPTLVSRKGAAG 558
Query: 242 IAVIRTEVFVFKDRWM 257
++ R+ + +R
Sbjct: 559 LSFNRSVSDIEGERIG 574
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 32.7 bits (74), Expect = 0.55
Identities = 21/80 (26%), Positives = 28/80 (35%), Gaps = 3/80 (3%)
Query: 98 WAVIPRYSPSPDLTTRRPEV-PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS 156
+ S +P T P V HH P P P P +PE P S
Sbjct: 367 RGLEKNPSTAPS-TPATPRVRAVLTTQVHHCVVVKPAPAVPTTPSPSLTTALFPEAPSPS 425
Query: 157 PRREPSLRQPNQYPETPHEP 176
P P QP+ +P+ + P
Sbjct: 426 PSALPP-GQPDLHPKAEYPP 444
>gnl|CDD|239805 cd04278, ZnMc_MMP, Zinc-dependent metalloprotease, matrix
metalloproteinase (MMP) sub-family. MMPs are responsible
for a great deal of pericellular proteolysis of
extracellular matrix and cell surface molecules, playing
crucial roles in morphogenesis, cell fate specification,
cell migration, tissue repair, tumorigenesis, gain or
loss of tissue-specific functions, and apoptosis. In
many instances, they are anchored to cell membranes via
trans-membrane domains, and their activity is controlled
via TIMPs (tissue inhibitors of metalloproteinases).
Length = 157
Score = 31.4 bits (72), Expect = 0.55
Identities = 8/16 (50%), Positives = 12/16 (75%)
Query: 6 WELPEDDKMAIQSLYG 21
++L +DD IQ+LYG
Sbjct: 142 FKLSQDDIRGIQALYG 157
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 32.7 bits (75), Expect = 0.58
Identities = 19/92 (20%), Positives = 25/92 (27%), Gaps = 7/92 (7%)
Query: 173 PHEPRR----DSPPRHPNQFPETPR--KPPARYPDIARAEPRTTPTTTTPPATRRGESNP 226
PH RR + E P P PD + + P E P
Sbjct: 306 PHRRRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEP 365
Query: 227 GPSDKPDTCDTSYDAIAVI-RTEVFVFKDRWM 257
P ++ D +S D I R V
Sbjct: 366 DPQEEADGQGSSTDPAGDIFRIRVLAPPQARA 397
Score = 30.4 bits (69), Expect = 2.7
Identities = 12/93 (12%), Positives = 25/93 (26%), Gaps = 3/93 (3%)
Query: 138 RREPPPRHPNQYPETPQDSPRR---EPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRK 194
RR+P + + + ++ P +P + PR P + ++ P++
Sbjct: 310 RRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDPQE 369
Query: 195 PPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
P R G
Sbjct: 370 EADGQGSSTDPAGDIFRIRVLAPPQARARGASG 402
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 32.4 bits (73), Expect = 0.58
Identities = 23/120 (19%), Positives = 36/120 (30%), Gaps = 12/120 (10%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHH--PTPESPRREPPPRHPNQYPETPQDSPRR 159
P +P + P P+ PTP + + P P+Q D P
Sbjct: 42 PAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPD-PAV 100
Query: 160 EPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPAT 219
P L + P+ E + H + ++ PD A T +PP
Sbjct: 101 APQLAAAPK-PD-AAEAFTSAAQAHEAPA-DAGTSAASKKPDPA------AHTQHSPPPF 151
>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
region.
Length = 295
Score = 32.4 bits (73), Expect = 0.59
Identities = 34/159 (21%), Positives = 45/159 (28%), Gaps = 26/159 (16%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
SPSP ++ R HH P P P +P +P +
Sbjct: 116 LFSSPSPQDSSPRLSA----FTQHHRPVITGHSGISASPHPTPSPLHFPTSPILPQQPSS 171
Query: 162 SLRQPN-QYPETPH---------------------EPRRDSPPRHPNQFPETPRKPPARY 199
+YP H +P + PN F TP P
Sbjct: 172 YFPHTAIRYPPHLHPQDPLKEFVQLVCDPSSQQAGQPNGSGQGKVPNHFLPTPMLAPPPP 231
Query: 200 PDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTS 238
P +AR P P T P + G + S T TS
Sbjct: 232 PPMARPVPLPMPDTKPPTTSTEGGATSPTSPTYSTPSTS 270
>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in
biomineralisation. They seem to regulate the formation
of crystallites during the secretory stage of tooth
enamel development. thought to play a major role in the
structural organisation and mineralisation of developing
enamel. They are found in the extracellular matrix.
Mutations in X-chromosomal amelogenin can cause
Amelogenesis imperfecta.
Length = 174
Score = 31.4 bits (71), Expect = 0.59
Identities = 25/102 (24%), Positives = 37/102 (36%), Gaps = 11/102 (10%)
Query: 106 PSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS-------PR 158
P L+ + P+ + HP HH P ++P P +P P P
Sbjct: 49 PIIPLSPQMPQQQQSAHPKL--TPHHQLLILPPQQPMMPVPGHHPMVPMTGQQPHLQPPA 106
Query: 159 REPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYP 200
+ P Q P+ P +P PP P Q P P+ +P
Sbjct: 107 QHPLQPTYGQNPQ-PQQPTHTQPPVQPQQ-PADPQPGQPMFP 146
Score = 31.4 bits (71), Expect = 0.68
Identities = 23/78 (29%), Positives = 35/78 (44%), Gaps = 6/78 (7%)
Query: 120 PRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRD 179
P+ P P HHP ++P + P Q+P P + P +QP + P +P++
Sbjct: 79 PQQPMMPVPGHHPMVPMTGQQPHLQPPAQHPLQPT--YGQNPQPQQPTH-TQPPVQPQQP 135
Query: 180 SPPRHPNQ--FPETPRKP 195
+ P P Q FP P P
Sbjct: 136 ADP-QPGQPMFPMQPLPP 152
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 32.7 bits (74), Expect = 0.60
Identities = 38/130 (29%), Positives = 47/130 (36%), Gaps = 17/130 (13%)
Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
SP P + + P P + P P HHP P PPP + Q P + P L
Sbjct: 372 SPFPQMPSNLPPPPALK-PLSSLPTHHP----PSAHPPPLQLMPQSQPLQSVPAQPPVLT 426
Query: 165 Q----PNQYPETPHEPRRDSPPRHP-NQFPETPRKPPARYPDIARAEPRTTPTTTTPPAT 219
Q P + PH PP+ P Q P T PA P + T+TP A
Sbjct: 427 QSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPA-------IGPPPSLPTSTPAAP 479
Query: 220 RRGESNPGPS 229
R S P
Sbjct: 480 PRASSGSQPP 489
Score = 32.4 bits (73), Expect = 0.77
Identities = 30/133 (22%), Positives = 41/133 (30%), Gaps = 5/133 (3%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
P P +P H P P S +P +Q P P S +
Sbjct: 260 SSRHPQSSHHGPGPPMP---HALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPH 316
Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRR 221
S P+Q P +P R+ P P P P + + P P +
Sbjct: 317 SHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQ 376
Query: 222 GESN--PGPSDKP 232
SN P P+ KP
Sbjct: 377 MPSNLPPPPALKP 389
Score = 32.0 bits (72), Expect = 0.94
Identities = 19/108 (17%), Positives = 30/108 (27%), Gaps = 7/108 (6%)
Query: 122 HPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSP 181
P P + PP P+ PQ SP + + Q + +P
Sbjct: 174 QPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSP---IAAQPAPQPQQPSPLSLISAP 230
Query: 182 PRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPS 229
HP + P P ++ P + + PGP
Sbjct: 231 SLHPQRLPS----PHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPP 274
Score = 29.7 bits (66), Expect = 4.4
Identities = 27/110 (24%), Positives = 46/110 (41%), Gaps = 9/110 (8%)
Query: 116 EVPRPRHPTHHHPEHHPTPESPRREPP----PRH---PNQYPETPQDSPRREPSLRQPNQ 168
E P P P+ H + PT P+ P H P+ +P+ P + P P+L+ +
Sbjct: 334 EQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPP-PPALKPLSS 392
Query: 169 YPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPA 218
P T H P PP + + PA+ P + +++ +T P +
Sbjct: 393 LP-THHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKASTHPHS 441
>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486). This
family is made up of members from various Burkholderia
spp. The function is unknown.
Length = 245
Score = 31.8 bits (72), Expect = 0.61
Identities = 28/148 (18%), Positives = 37/148 (25%), Gaps = 13/148 (8%)
Query: 91 DRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYP 150
D A P + E P H E PE EP P
Sbjct: 30 DAAGPHDDAAEPVLTDQIVPG---AEQAASAAPVHAAREATADPEFVAVEPVPTPHVPAV 86
Query: 151 ETPQDSP-RREPSLRQ--PNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEP 207
P D+ EP + P + Q P A D A P
Sbjct: 87 ALPGDTDAPAEPGAAPHVVAERAAAMQAPLPSALAADDPQAPPA----GATAADAGDAAP 142
Query: 208 RTTP---TTTTPPATRRGESNPGPSDKP 232
TP +PPA + ++ +
Sbjct: 143 DATPPAAGDASPPAAAQAAASAAAALTD 170
>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
Provisional.
Length = 576
Score = 32.2 bits (73), Expect = 0.65
Identities = 28/144 (19%), Positives = 40/144 (27%), Gaps = 35/144 (24%)
Query: 101 IPRYSPSPDLTTRRPEVP----RPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS 156
+P S P P VP +P + P P+ P +EP + E P+D
Sbjct: 354 VPDESNVPPNP---PNVPGGSNSEFSSDVENPPNPPNPDIPEQEPNIPEDSN-KEVPEDV 409
Query: 157 PRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTP 216
P PE + + P + N+ P P
Sbjct: 410 PME----------PEDDRDNNFNEPKKPENKGDGQNE-----------------PVIPKP 442
Query: 217 PATRRGESNPGPSDKPDTCDTSYD 240
R +SN P S D
Sbjct: 443 LDNERDQSNKNKQVNPGNRHNSED 466
Score = 29.2 bits (65), Expect = 6.1
Identities = 27/129 (20%), Positives = 39/129 (30%), Gaps = 20/129 (15%)
Query: 106 PSPDLTTRRPEVPRPRHPTHHHPEHH---PTPESPRREPPPRHPNQYPETPQDSPRREPS 162
P+P T + PRP P + P++P+ PP P +D
Sbjct: 292 PAPVPPTPEDDNPRPTDDEFAVPNFNEGLDVPDNPQDPVPP------PNEGKDG------ 339
Query: 163 LRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRG 222
N E P D P N P P P ++ P P +
Sbjct: 340 ----NPNEENLFPPGDDEVPDESNVPP-NPPNVPGGSNSEFSSDVENPPNPPNPDIPEQE 394
Query: 223 ESNPGPSDK 231
+ P S+K
Sbjct: 395 PNIPEDSNK 403
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 32.5 bits (74), Expect = 0.65
Identities = 18/115 (15%), Positives = 30/115 (26%), Gaps = 7/115 (6%)
Query: 113 RRPEVPRPRHPTH-HHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPE 171
R+ + P P + P+ P P P +PN
Sbjct: 383 RQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL---RPNGLA- 438
Query: 172 TPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNP 226
+P R+ + P P YP ++ P + +G N
Sbjct: 439 --PMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNK 491
>gnl|CDD|173550 PTZ00357, PTZ00357, methyltransferase; Provisional.
Length = 1072
Score = 32.4 bits (73), Expect = 0.79
Identities = 26/117 (22%), Positives = 42/117 (35%), Gaps = 9/117 (7%)
Query: 136 SPRREPPPRHPNQYPETP------QDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFP 189
+P P HP+ P+ P +D + + YP+ P R +PP + +
Sbjct: 428 APHGVAPQEHPHYRPDAPGTTLAEEDVVEAQEYEHPNHGYPDEDGAPLRRTPPTYHDGES 487
Query: 190 ETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSYDAIAVIR 246
E + A Y + T T+ + + G DTSY+A A R
Sbjct: 488 EGSQ---ADYSLFDWYDSVTELETSHEADSAVAAAKGGSKGSGSASDTSYEAAADCR 541
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 31.6 bits (72), Expect = 0.82
Identities = 31/143 (21%), Positives = 49/143 (34%), Gaps = 27/143 (18%)
Query: 93 KEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHP---TPE---------SPRRE 140
K E+ W I +L R+ VP P P+ + P T E + R+
Sbjct: 44 KPEERWRYIK------ELENRQIGVPTPTEPSAGGEVNSPTQLTDEQRQLLEQMQADMRQ 97
Query: 141 PPPRHP-----NQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKP 195
P + Q P+ P+ + + + +Q P T +P+ +PPR P P+ P
Sbjct: 98 QPTQLSEVPYNEQTPQVPRSTVQIQQQAQQQQP-PATTAQPQPVTPPRQTTA-PVQPQTP 155
Query: 196 PARYPDIARAEPRTTPTTTTPPA 218
A P T
Sbjct: 156 APVRT--QPAAPVTQAVEAPKVE 176
Score = 31.6 bits (72), Expect = 0.93
Identities = 24/96 (25%), Positives = 35/96 (36%), Gaps = 8/96 (8%)
Query: 164 RQPNQYPETPHEPRRDSPPRHPNQFPET--PRKPPARYPDIARAEPRTTPTTTTPPATRR 221
+QP Q E P+ + PR Q + ++PPA A+ +P T P TT P +
Sbjct: 97 QQPTQLSEVPYNEQTPQVPRSTVQIQQQAQQQQPPAT---TAQPQPVTPPRQTTAPVQPQ 153
Query: 222 GESNPGPSDKPDTCDTSYDAIAVIRTEVFVFKDRWM 257
P P + A + RWM
Sbjct: 154 ---TPAPVRTQPAAPVTQAVEAPKVEAEKEKEQRWM 186
Score = 30.0 bits (68), Expect = 2.8
Identities = 17/81 (20%), Positives = 29/81 (35%), Gaps = 2/81 (2%)
Query: 128 PEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQ 187
P + TP+ PR + Q + P + + +P P + P +P+ +P R
Sbjct: 106 PYNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQP--VTPPRQTTAPVQPQTPAPVRTQPA 163
Query: 188 FPETPRKPPARYPDIARAEPR 208
P T + E R
Sbjct: 164 APVTQAVEAPKVEAEKEKEQR 184
>gnl|CDD|219053 pfam06482, Endostatin, Collagenase NC10 and Endostatin. NC10
stands for Non-helical region 10 and is taken from human
COL15A1. A mutation in this region in human COL18A1 is
associated with an increased risk of prostrate cancer.
This domain is cleaved from the precursor and forms
endostatin. Endostatin is a key tumour suppressor and
has been used highly successfully to treat cancer. It is
a potent angiogenesis inhibitor. Endostatin also binds a
zinc ion near the N-terminus; this is likely to be of
structural rather than functional importance according
to.
Length = 291
Score = 31.6 bits (72), Expect = 0.83
Identities = 13/85 (15%), Positives = 20/85 (23%), Gaps = 13/85 (15%)
Query: 96 KSWAVIP--RYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETP 153
W + P P T +P P P P P P+ +
Sbjct: 42 NGWRKVLLGELVPLPGTTATQPPPVV------LTPWSDPRLPDPPHLPDPQTHSATAHRN 95
Query: 154 QDSPRREPSLRQPNQYPETPHEPRR 178
P L P + + +
Sbjct: 96 P-----HPPLNSPARIAHVHQDFQP 115
Score = 30.5 bits (69), Expect = 2.2
Identities = 12/58 (20%), Positives = 20/58 (34%)
Query: 129 EHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPN 186
E P P + +PPP + + P P + + P +SP R +
Sbjct: 51 ELVPLPGTTATQPPPVVLTPWSDPRLPDPPHLPDPQTHSATAHRNPHPPLNSPARIAH 108
>gnl|CDD|234706 PRK00269, zipA, cell division protein ZipA; Reviewed.
Length = 293
Score = 31.7 bits (72), Expect = 0.87
Identities = 25/121 (20%), Positives = 31/121 (25%), Gaps = 14/121 (11%)
Query: 102 PRYSPSPDLTTRRPEVPRPRH--PTHHHPE--HHPTPESPRREPPPRHPNQYPETPQDSP 157
YS PD E+ P TH P+ H P R R + + +
Sbjct: 41 RSYSNLPDEEEGSAELLGPARVLDTHKEPQLDEHDLPSMSARPRERRRDTKTAKQQKRGR 100
Query: 158 RREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPP 217
EP Q D P F + EPR P PP
Sbjct: 101 GSEP---QQGDLNL-------DLDEVEPALFSDRDDDFTPDKRKSKGREPRIEPPKELPP 150
Query: 218 A 218
Sbjct: 151 V 151
>gnl|CDD|132911 cd06844, STAS, Sulphate Transporter and Anti-Sigma factor
antagonist domain found in the C-terminal region of
sulphate transporters as well as in bacterial and
archaeal proteins involved in the regulation of sigma
factors. The STAS (Sulphate Transporter and Anti-Sigma
factor antagonist) domain is found in the C-terminal
region of sulphate transporters as well as in bacterial
and archaeal proteins involved in the regulation of
sigma factors, like anti-anti-sigma factors and
"stressosome" components. The sigma factor regulators
are involved in protein-protein interaction which is
regulated by phosphorylation.
Length = 100
Score = 29.8 bits (67), Expect = 0.89
Identities = 21/78 (26%), Positives = 26/78 (33%), Gaps = 12/78 (15%)
Query: 275 QFWSELPHNLTHIDAVYQRIDNHIAFFIGRQYFL-FEGIRLLPGYPRPLTALGLPASLER 333
QF EL HN+T I I F+ G +L R A+G L
Sbjct: 25 QFKEELLHNIT------NVAGKTIVIDISALEFMDSSGTGVLLERSRLAEAVGGQFVLTG 78
Query: 334 IDAAMVWGYNSKTYLFSG 351
I A+ T SG
Sbjct: 79 ISPAV-----RITLTESG 91
>gnl|CDD|214832 smart00817, Amelin, Ameloblastin precursor (Amelin). This family
consists of several mammalian Ameloblastin precursor
(Amelin) proteins. Matrix proteins of tooth enamel
consist mainly of amelogenin but also of non-amelogenin
proteins, which, although their volumetric percentage is
low, have an important role in enamel mineralisation.
One of the non-amelogenin proteins is ameloblastin, also
known as amelin and sheathlin. Ameloblastin (AMBN) is
one of the enamel sheath proteins which is though to
have a role in determining the prismatic structure of
growing enamel crystals.
Length = 411
Score = 31.8 bits (72), Expect = 0.90
Identities = 25/92 (27%), Positives = 37/92 (40%), Gaps = 9/92 (9%)
Query: 127 HPEHHPTPESPRREP--PPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRH 184
P H T + P PP P+Q PQ P +P L QP P P+++ P
Sbjct: 80 RPREHETQQYEYSLPVHPPPLPSQPSLQPQ-QPGLKPFL-QPTALPTNQATPQKNGP--- 134
Query: 185 PNQFPETPRKPPARYPDIARAEPRTTPTTTTP 216
Q P +PP + ++ P+ P+ P
Sbjct: 135 --QPPMHLGQPPLQQAELPMIPPQVAPSDKPP 164
>gnl|CDD|218429 pfam05097, DUF688, Protein of unknown function (DUF688). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana.
Length = 446
Score = 31.7 bits (71), Expect = 0.91
Identities = 24/97 (24%), Positives = 36/97 (37%), Gaps = 2/97 (2%)
Query: 109 DLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQ 168
+L V R E T + R P P E P REP P +
Sbjct: 7 NLNAPLLSVRRVPSTPESSSESENTKTTSTRSIKPSCPVTKSEIDSG-PVREPG-SVPFK 64
Query: 169 YPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARA 205
+ +TP +P+ +S P+ TP+ PP R + +
Sbjct: 65 WEQTPGKPKDESKPQIQEHPDFTPKLPPGRSRTVPVS 101
>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
Length = 1463
Score = 32.1 bits (72), Expect = 0.93
Identities = 35/134 (26%), Positives = 51/134 (38%), Gaps = 17/134 (12%)
Query: 90 CDRKEEKS---WAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREP----- 141
CD +E S + P P D TR+ R + H + P + ++
Sbjct: 768 CDEQEPLSVSPYGPEPDRPPDDDFETRKG---LKRKSSEDHADPIPEGNATKKTCGLQGL 824
Query: 142 PPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPD 201
P P PET +D+P P P P P E P+ P Q P+ P+ P +
Sbjct: 825 PDSLPPAVPETDRDNPLLPPCPITPEGPPCPPREE-----PQQP-QEPQEPQSPSFHISE 878
Query: 202 IARAEPRTTPTTTT 215
I A +TP + T
Sbjct: 879 IGEALFHSTPVSPT 892
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 31.9 bits (72), Expect = 0.97
Identities = 14/93 (15%), Positives = 17/93 (18%), Gaps = 14/93 (15%)
Query: 132 PTPESPRREPPPRHP----NQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQ 187
+ P H Q P+ P P P+ YP P N
Sbjct: 362 AAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNP 421
Query: 188 FPETPRKPPARYPDIARAEPRTTPTTTTPPATR 220
P P P
Sbjct: 422 QSPGTSYGP----------EPVGPVPPQPTNPY 444
>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein. This is a viral
attachment glycoprotein from region G of metaviruses. It
is high in serine and threonine suggesting it is highly
glycosylated.
Length = 183
Score = 31.1 bits (70), Expect = 1.0
Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 1/56 (1%)
Query: 180 SPPRHPN-QFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDT 234
SPP + + P TP P P+ ++T ++T P AT P+ PDT
Sbjct: 34 SPPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEPTSTPDT 89
Score = 28.4 bits (63), Expect = 7.8
Identities = 20/115 (17%), Positives = 33/115 (28%), Gaps = 13/115 (11%)
Query: 137 PRREPPPRHPNQ--------YPETPQDSPRREPS----LRQPNQYPETPHEPRRDSPPR- 183
P P +HP Q T + EP+ Q + P S +
Sbjct: 51 PDTNPNSQHPTQQSTESSTLPAATSESHLETEPTSTPDTTNRQQTVDRHTTPPSSSRTQT 110
Query: 184 HPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTS 238
+ + +R TTTT + G+ S +P + T+
Sbjct: 111 TQAVHEKKNTRTTSRTQTPPTTSTAAVQTTTTTNTSSTGKEPTTTSVQPRSSATT 165
>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein. Tymoviruses
are single stranded RNA viruses. This family includes a
protein of unknown function that has been named based on
its molecular weight. Tymoviruses such as the ononis
yellow mosaic tymovirus encode only three proteins. Of
these two are overlapping this protein overlaps a larger
ORF that is thought to be the polymerase.
Length = 458
Score = 31.6 bits (72), Expect = 1.0
Identities = 31/130 (23%), Positives = 45/130 (34%), Gaps = 21/130 (16%)
Query: 107 SPDLTTRRPEVPRPRHPTHHHPE-----HHPTPESPR--REPPPRHPNQYP--------- 150
S +L P+ HP H P P P+ R+ P P + P
Sbjct: 37 SSNLPLDHPKGTSSFPQLLRHPSLRLRLHPPPPPCPQNPRDFPSLQPLELPLHHSLLRHV 96
Query: 151 -ETPQDS--PRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEP 207
ET + S P +P L + Q P H R S P H + + + AR D+ +
Sbjct: 97 HETLKVSQAPGEQPKLPRAPQLP--SHPRRHRSLPHHLHHSSKHRNRLHARRSDVLPSLS 154
Query: 208 RTTPTTTTPP 217
+ P P
Sbjct: 155 NSRPVHELPK 164
Score = 30.1 bits (68), Expect = 3.2
Identities = 27/123 (21%), Positives = 36/123 (29%), Gaps = 17/123 (13%)
Query: 107 SPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQP 166
SP+L R ++P P P P + R PS+ P
Sbjct: 351 SPNL--RSSQLPPP--TKRRLRL---LPVPPPKVQALPLT-----ALAPLVRHSPSIPLP 398
Query: 167 NQYPETPHEPRRDSPPRH--PNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGES 224
+ P S H P PR P + + P P T PPA+ S
Sbjct: 399 HPPSALPSHVGASSSKHHRLPPSVLPGPRLSS---PSPSPSLPTRRPGTPPPPASPPTPS 455
Query: 225 NPG 227
P
Sbjct: 456 PPS 458
>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
Length = 290
Score = 31.5 bits (71), Expect = 1.1
Identities = 27/140 (19%), Positives = 37/140 (26%), Gaps = 21/140 (15%)
Query: 107 SPDLTTRRPEVP---RPRHPTHHHPEH-----HPTP-------ESPRREPPPRHPNQYPE 151
P RR R + H +P E P P R P+ PE
Sbjct: 39 EPADRARRGAGQARGRSQAERRFHHYDEARADYPYYTGSSSEDERPADPRPSRRPHAQPE 98
Query: 152 TPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTP 211
P R + P R + P P R A A+
Sbjct: 99 ASGPGPARGA------RGPAGSRGRGRRAESPSPRDPPNPKGASAPRGRKSACADSAALL 152
Query: 212 TTTTPPATRRGESNPGPSDK 231
P A +R ++ G + K
Sbjct: 153 DAPAPAAPKRQKTPAGLARK 172
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 30.9 bits (70), Expect = 1.2
Identities = 21/112 (18%), Positives = 32/112 (28%), Gaps = 8/112 (7%)
Query: 128 PEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQ 187
P P PE E P + EP+ +P + P+ +P++ P+ +
Sbjct: 52 PTEEPQPEPEPPEEQP------KPPTEPETPPEPTPPKPKEKPKPEKKPKK-PKPKPKPK 104
Query: 188 FPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSY 239
P+ P P P R S S SY
Sbjct: 105 PKPKPKVKPQPKPKK-PPSKTAAKAPAAPNQPARPPSAASASGAATGPSASY 155
Score = 28.6 bits (64), Expect = 7.4
Identities = 24/110 (21%), Positives = 36/110 (32%), Gaps = 12/110 (10%)
Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETP----QDSPRRE 160
P+ T P P+P+ PE P P+ +P P+ + P S
Sbjct: 69 PPTEPETPPEPTPPKPKEKPK--PEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAA 126
Query: 161 PSLRQPNQY--PETPHEPRRDSPPRHPNQFPETPRKP---PARYPDIARA 205
+ PNQ P + + + R+ RYP ARA
Sbjct: 127 KAPAAPNQPARPPSAASASGAATGP-SASYLSGLRRAIRRAPRYPAQARA 175
>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
family. Members of this protein family occur in
Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
related Mycoplasmas in small paralogous families that
may also include truncated forms and/or pseudogenes.
Members are predicted lipoproteins with a conserved
signal peptidase II processing and lipid attachment
site. Note that the name for certain characterized
members, p72, reflects an anomalous apparent molecular
weight, given a theoretical MW of about 61 kDa.
Length = 541
Score = 31.3 bits (71), Expect = 1.4
Identities = 18/92 (19%), Positives = 34/92 (36%), Gaps = 8/92 (8%)
Query: 91 DRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYP 150
+ E + P+ PD E + + E +P+ +P ++P P +
Sbjct: 40 IKPNENT----PKIPKKPD-NKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKEEI 94
Query: 151 ETPQDSPRR---EPSLRQPNQYPETPHEPRRD 179
E P+D P++ +P QPN +
Sbjct: 95 EKPKDEPKKPDKKPQADQPNNVHADQPNNNKV 126
>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210). This is
a family of proteins conserved in yeasts. The function
is not known. The Schizosaccharomyces pombe member is
SPBC18E5.07 and the Saccharomyces cerevisiae member is
AIM21.
Length = 671
Score = 31.4 bits (71), Expect = 1.4
Identities = 28/168 (16%), Positives = 40/168 (23%), Gaps = 21/168 (12%)
Query: 63 VLSPRNQRKKEDRIGESQPFRERTLS--------ICDRKEEKSWAVIPRYSPSPDLTTRR 114
+ SP ER + +S AV P + +R
Sbjct: 286 LSSPAPDSSSFSSPSGESGLEEREAEEPILASDEVAKEPAGESPAVSPSFEREKSEKSRH 345
Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS-------PRREPSLRQPN 167
P+ R + + P+ R P P P+D P E S R
Sbjct: 346 ESDPKSRENSKPASIYGSVPDLIRHTPLEDVEEYEPLFPEDESEIAVKPPTEESSRRPEE 405
Query: 168 QYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTT 215
+ P E + P P TP T
Sbjct: 406 EKHRFPSEDVWEDSPSSLQDTATVST------PSNPPPRASETPEQET 447
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 31.0 bits (70), Expect = 1.4
Identities = 19/82 (23%), Positives = 24/82 (29%), Gaps = 1/82 (1%)
Query: 123 PTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQY-PETPHEPRRDSP 181
P + + P P P P P P + P PN P P +P P
Sbjct: 34 PATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPP 93
Query: 182 PRHPNQFPETPRKPPARYPDIA 203
P P P P R +
Sbjct: 94 PPVDPNAPPPPAPEPGRIDNAV 115
>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
partitioning].
Length = 264
Score = 31.0 bits (70), Expect = 1.5
Identities = 26/128 (20%), Positives = 38/128 (29%), Gaps = 18/128 (14%)
Query: 103 RYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESP----------RREPPPRHPNQYPET 152
+S L R+ VP+P P TPE + +P
Sbjct: 63 VWSYIKALEDRQIGVPQPTEPAAVKDAERLTPEQRQLLEQMEVDQKAQPTQLGEQPEQAR 122
Query: 153 PQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRK-PPARYPDIARAEPRTTP 211
++ PR + Q Q T + P P PE P+ PA P+ P+
Sbjct: 123 IEEQPRTQS---QKAQSQATTVQT----QPVKPKPRPEKPQPVAPAPAPEPVEKAPKAEA 175
Query: 212 TTTTPPAT 219
P
Sbjct: 176 APPPKPKA 183
>gnl|CDD|115071 pfam06390, NESP55, Neuroendocrine-specific golgi protein P55
(NESP55). This family consists of several mammalian
neuroendocrine-specific golgi protein P55 (NESP55)
sequences. NESP55 is a novel member of the chromogranin
family and is a soluble, acidic, heat-stable secretory
protein that is expressed exclusively in endocrine and
nervous tissues, although less widely than
chromogranins.
Length = 261
Score = 30.6 bits (68), Expect = 1.6
Identities = 29/112 (25%), Positives = 41/112 (36%), Gaps = 18/112 (16%)
Query: 128 PEHHPTPESPRREPPPRHPNQYP-----ETPQDSPRREPSLRQPNQYPETPHEPRR---D 179
PE P P P+ + + ++ SPRR P Q + EP R D
Sbjct: 152 PEDEPGPVVPKGATFHQSLTERLHALKLQSADASPRRAPPSTQEPESAREGEEPERGPLD 211
Query: 180 SPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDK 231
PR P + E ++ + P R +P+ PA RR S P K
Sbjct: 212 KDPRDPEEEEE-EKEEEKQQPH--RCKPKK-------PARRRDPSPESPPKK 253
Score = 28.7 bits (63), Expect = 7.9
Identities = 20/75 (26%), Positives = 29/75 (38%), Gaps = 3/75 (4%)
Query: 118 PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPR 177
PR P+ PE E P R P + P E ++ +E +QP++ R
Sbjct: 186 PRRAPPSTQEPESAREGEEPERGPLDKDPRDPEEEEEE---KEEEKQQPHRCKPKKPARR 242
Query: 178 RDSPPRHPNQFPETP 192
RD P P + P
Sbjct: 243 RDPSPESPPKKGAIP 257
>gnl|CDD|233777 TIGR02205, septum_zipA, cell division protein ZipA. This model
represents the full length of bacterial cell division
protein ZipA. The N-terminal hydrophobic stretch is an
uncleaved signal-anchor sequence. This is followed by an
unconserved, variable length, low complexity region, and
then a conserved C-terminal region of about 140 amino
acids (see pfam04354) that interacts with the
tubulin-like cell division protein FtsZ [Cellular
processes, Cell division].
Length = 284
Score = 30.6 bits (69), Expect = 1.7
Identities = 15/128 (11%), Positives = 25/128 (19%), Gaps = 21/128 (16%)
Query: 92 RKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPE--------------SP 137
K + P+ + + P R HP P +
Sbjct: 34 DKAPLDRMKLKSDDPTSEEMVQPDNSPNTRVERGEHPIPQPREQHLPSISELVAYQRDKS 93
Query: 138 RREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHE-PRRDSPPRHPNQFPETP--RK 194
+ P Q P P Q P + +
Sbjct: 94 VDDEEASIPMQ----PTQQQYDMPQPNNVAQQTVEPRVAKSLPEASPQEEEVGKNLEVTA 149
Query: 195 PPARYPDI 202
PP + +
Sbjct: 150 PPKQKDKV 157
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 31.0 bits (71), Expect = 1.8
Identities = 21/128 (16%), Positives = 35/128 (27%), Gaps = 7/128 (5%)
Query: 99 AVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPP---PRHPNQYPETPQD 155
A + + +P P+ P P P+ P P R Q +
Sbjct: 378 AASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATK 437
Query: 156 SPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARY----PDIARAEPRTTP 211
+ + EP+ + + E P +K R+ P + EP TP
Sbjct: 438 AKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATP 497
Query: 212 TTTTPPAT 219
Sbjct: 498 KALKKALE 505
>gnl|CDD|191194 pfam05111, Amelin, Ameloblastin precursor (Amelin). This family
consists of several mammalian Ameloblastin precursor
(Amelin) proteins. Matrix proteins of tooth enamel
consist mainly of amelogenin but also of non-amelogenin
proteins, which, although their volumetric percentage is
low, have an important role in enamel mineralisation.
One of the non-amelogenin proteins is ameloblastin, also
known as amelin and sheathlin. Ameloblastin (AMBN) is
one of the enamel sheath proteins which is though to
have a role in determining the prismatic structure of
growing enamel crystals.
Length = 410
Score = 30.7 bits (69), Expect = 1.9
Identities = 23/89 (25%), Positives = 35/89 (39%), Gaps = 3/89 (3%)
Query: 123 PTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPP 182
P H + + P P P+ P+ P P +P+ Q +TP + PP
Sbjct: 96 PREHETQQYEYSVPVHPPPLPSQPSLQPQQPGLKPFLQPTAATAIQ--DTPQKA-GPQPP 152
Query: 183 RHPNQFPETPRKPPARYPDIARAEPRTTP 211
HP Q P + P + +A +E TP
Sbjct: 153 MHPGQLPLQEAELPEVHEQVAPSEKPPTP 181
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 30.8 bits (69), Expect = 2.0
Identities = 8/57 (14%), Positives = 10/57 (17%), Gaps = 7/57 (12%)
Query: 140 EPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPP 196
P P P +P+ PH P P
Sbjct: 208 AEPAEEEAPAPSEAGSEPAPDPA-------ARAPHAAPDPPAPAPAPAKTAAPAAAA 257
Score = 28.8 bits (64), Expect = 9.0
Identities = 9/55 (16%), Positives = 15/55 (27%), Gaps = 2/55 (3%)
Query: 166 PNQYPETPHEPRRDSPPRH-PNQFPETPRKPPARYPDIARAEPRTTPTTTTPPAT 219
P + E ++ P+ P P A A +T P +
Sbjct: 207 PAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAP-APAPAKTAAPAAAAPVS 260
>gnl|CDD|220828 pfam10630, DUF2476, Protein of unknown function (DUF2476). This is
a family of proteins of unknown function. The family is
rich in proline residues.
Length = 255
Score = 30.6 bits (68), Expect = 2.0
Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 6/71 (8%)
Query: 132 PTPESPRREPPPRHPNQYPETPQDSPR----REPSLRQPNQYPETPHEPRRDSPPRHPNQ 187
+P + P P P+ P SPR + S+ +P P++P +P SP +P +
Sbjct: 182 FSPYRDGQVPGPWSPSASPVAESCSPRFIWSLKGSMLKP--LPDSPLQPLPPSPSPNPQE 239
Query: 188 FPETPRKPPAR 198
P +PP +
Sbjct: 240 QSPLPTRPPCK 250
>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist. LIN-8
is a nuclear protein, present at the sites of
transcriptional repressor complexes, which interacts
with LIN-35 Rb.Lin35 Rb is a product of the class B
synMuv gene lin-35 which silences genes required for
vulval specification through chromatin modification and
remodelling. The biological role of the interaction has
not yet been determined however predictions have been
made. The interaction shows that class A synMuv genes
control vulval induction through the transcriptional
regulation of gene expression. LIN-8 normally functions
as part of a protein complex however when the complex is
absent, other family members can partially replace LIN-8
activity.
Length = 316
Score = 30.4 bits (69), Expect = 2.2
Identities = 12/81 (14%), Positives = 21/81 (25%), Gaps = 2/81 (2%)
Query: 135 ESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRK 194
E P ++ ++ P+ Q P P +P + ++
Sbjct: 173 EVPDDYGANLAVDEPEQSTMSRPQEVKQSVPAQQAPPNPQQP--MPSASSESATSKSAST 230
Query: 195 PPARYPDIARAEPRTTPTTTT 215
P PR P T
Sbjct: 231 SRESSPQPQSPPPRRVPAPTV 251
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 30.7 bits (69), Expect = 2.2
Identities = 33/151 (21%), Positives = 48/151 (31%), Gaps = 10/151 (6%)
Query: 84 ERTLSICDRKEEKSWAVIPR----YSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRR 139
E + + + AV+PR Y+ DL + P P P P+
Sbjct: 350 EAPSPPIEEEPPQPKAVVPRPLSPYTAYEDL--KPPTSPIPTPPSSSPAS--SKSVDAVA 405
Query: 140 EPPPRHPNQYPETPQDSPRREP-SLRQPNQYPETPHEPRRD-SPPRHPNQFPETPRKPPA 197
+P P + + P EP + P +P+ D PP P+ T P
Sbjct: 406 KPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTGVSPSV 465
Query: 198 RYPDIARAEPRTTPTTTTPPATRRGESNPGP 228
A P T P T A +N P
Sbjct: 466 SSTSSVPAVPDTAPATAATDAAAPPPANMRP 496
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 30.5 bits (69), Expect = 2.3
Identities = 20/130 (15%), Positives = 33/130 (25%), Gaps = 16/130 (12%)
Query: 103 RYSPSPDLTTRRPEVPRPRHPTHHH----PEHHPTPESPRREPPPRHPNQYPETPQDSPR 158
+ S P + R T + R + P + E Q
Sbjct: 159 QSSRLPPRMAKHEAQERLTKKKKRGQKKSKYKKKTFKPKRAKSIP-DFEKLHEKFQKQLA 217
Query: 159 R-----EPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTT 213
P++ +P + E+ R + E KP R P +T
Sbjct: 218 EKKKSKRPTVPEPFNFQESHKSSSRTYLDQENISAGEENLKP------TRRKLPPSTKKW 271
Query: 214 TTPPATRRGE 223
+ R E
Sbjct: 272 ESLVKFLRTE 281
>gnl|CDD|193258 pfam12782, Innate_immun, Invertebrate innate immunity transcript
family. The immune response of the purple sea urchin
appears to be more complex than previously believed in
that it uses immune-related gene families homologous to
vertebrate Toll-like and NOD/NALP-like receptor families
as well as C-type lectins and a rudimentary complement
system. In addition, the species also produces this
unusual family of mRNAs, also known as 185/333, which is
strongly upregulated in response to pathogen challenge.
Length = 312
Score = 30.4 bits (68), Expect = 2.3
Identities = 33/153 (21%), Positives = 45/153 (29%), Gaps = 22/153 (14%)
Query: 78 ESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTP--- 134
E QPF +R S EE R P R + T
Sbjct: 159 EEQPFGQRNES----SEEDG-----RPHPHHHRHHGHNHFGRKPFGGRPFGRRNHTEGHQ 209
Query: 135 -ESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRD---------SPPRH 184
+ + P RH N+ + QD P E + N + P R SP R
Sbjct: 210 GHNETGDHPHRHHNKTGDGDQDRPMFEMRPFRFNPFGRKPFGDRPFDRRNGTEEGSPRRD 269
Query: 185 PNQFPETPRKPPARYPDIARAEPRTTPTTTTPP 217
++ P R + P T TT+ P
Sbjct: 270 GHRRPYGNRGRWGENESEEKEHPTTESVTTSSP 302
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 30.5 bits (69), Expect = 2.5
Identities = 12/68 (17%), Positives = 13/68 (19%), Gaps = 2/68 (2%)
Query: 105 SPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLR 164
PS P P P P P P P P P R
Sbjct: 384 PPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPA--AAAPPARSADPA 441
Query: 165 QPNQYPET 172
+
Sbjct: 442 AAASAGDR 449
Score = 29.3 bits (66), Expect = 5.5
Identities = 15/63 (23%), Positives = 18/63 (28%), Gaps = 2/63 (3%)
Query: 170 PETPHEPRRDSPPRHPNQFPET--PRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPG 227
P P P P P P PPA A A P P A ++P
Sbjct: 382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSADPA 441
Query: 228 PSD 230
+
Sbjct: 442 AAA 444
Score = 28.9 bits (65), Expect = 7.7
Identities = 20/107 (18%), Positives = 23/107 (21%), Gaps = 22/107 (20%)
Query: 98 WAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPR---HPNQYPETPQ 154
W +P+ P VP P P P PP P
Sbjct: 389 WGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSADPAAAASAGD 448
Query: 155 -----------DSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPE 190
P SL Q + P S FPE
Sbjct: 449 RWRAFVAFVKGKKPALGASLEQGS--------PLGVSAGLLEIGFPE 487
>gnl|CDD|215908 pfam00413, Peptidase_M10, Matrixin. The members of this family are
enzymes that cleave peptides. These proteases require
zinc for catalysis.
Length = 159
Score = 29.5 bits (67), Expect = 2.6
Identities = 9/15 (60%), Positives = 10/15 (66%)
Query: 7 ELPEDDKMAIQSLYG 21
L +DD IQSLYG
Sbjct: 145 RLDQDDIKGIQSLYG 159
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins is
found in bacteria. Proteins in this family are typically
between 115 and 181 amino acids in length. There are two
completely conserved residues (F and G) that may be
functionally important.
Length = 110
Score = 28.5 bits (64), Expect = 3.0
Identities = 15/56 (26%), Positives = 23/56 (41%)
Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYP 170
P + + P P H P P P + + P PQ P++ P +QP Q+
Sbjct: 11 PPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPKQSPQQQQPPQFS 66
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 29.4 bits (66), Expect = 3.2
Identities = 21/77 (27%), Positives = 28/77 (36%), Gaps = 4/77 (5%)
Query: 120 PRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRD 179
P+ P P H + +P P Q P PQ +P Q P+ P P
Sbjct: 69 PQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPL---QPPQPQQPMQPQPPVHPIPP 125
Query: 180 SPPRHPNQFPETPRKPP 196
PP+ P P P +P
Sbjct: 126 LPPQPPLP-PMFPMQPL 141
Score = 29.0 bits (65), Expect = 4.0
Identities = 25/86 (29%), Positives = 33/86 (38%), Gaps = 4/86 (4%)
Query: 101 IPRYSPSPDLTTRRPEVPRPRH----PTHHHPEHHPTPESPRREPPPRHPNQYPETPQDS 156
IP + ++P +P P PT HH + P P +P P P Q + Q
Sbjct: 58 IPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQ 117
Query: 157 PRREPSLRQPNQYPETPHEPRRDSPP 182
P P P Q P P P + PP
Sbjct: 118 PPVHPIPPLPPQPPLPPMFPMQPLPP 143
Score = 28.6 bits (64), Expect = 4.9
Identities = 27/92 (29%), Positives = 37/92 (40%), Gaps = 16/92 (17%)
Query: 123 PTHHHPEHHPTPESPRREP-PPRHP-----NQYPETPQDSPRREPSLRQPNQYPETP--- 173
PTH HH P P ++P P+ P Q+ TP + +P+L QP Q P P
Sbjct: 48 PTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTP--TQHHQPNLPQPAQQPFQPQPL 105
Query: 174 -----HEPRRDSPPRHPNQFPETPRKPPARYP 200
+P + PP HP P +P
Sbjct: 106 QPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFP 137
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 30.1 bits (68), Expect = 3.3
Identities = 20/95 (21%), Positives = 28/95 (29%), Gaps = 3/95 (3%)
Query: 141 PPPRHPNQY--PETPQDSPRREPSLR-QPNQYPETPHEPRRDSPPRHPNQFPETPRKPPA 197
R P Q+ P Q + +PS + P + S P+ Q TP
Sbjct: 370 SGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSV 429
Query: 198 RYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKP 232
P P +T PA + E S
Sbjct: 430 DPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVS 464
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 29.6 bits (66), Expect = 3.7
Identities = 17/43 (39%), Positives = 23/43 (53%), Gaps = 4/43 (9%)
Query: 180 SPPRHPNQFPETPR-KPPARYPDIARAE-PRTTPTTTTPPATR 220
S P P+ E P+ KP A P + E P+ TP TTP +T+
Sbjct: 43 STP--PSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSSTK 83
Score = 29.2 bits (65), Expect = 5.9
Identities = 13/47 (27%), Positives = 20/47 (42%), Gaps = 5/47 (10%)
Query: 190 ETPRKPPARYPDIARAEPRTTPTTTTPPATR--RGESNPGPSDKPDT 234
E P+ P P P++ P TTPP+T+ + P + T
Sbjct: 39 EAPQSTP---PSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSST 82
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are transmembrane
heparin sulfate proteoglycans which are implicated in
the binding of extracellular matrix components and
growth factors.
Length = 207
Score = 29.3 bits (66), Expect = 3.7
Identities = 9/57 (15%), Positives = 14/57 (24%), Gaps = 8/57 (14%)
Query: 111 TTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPN 167
+T+ T P T E+ ETP + + N
Sbjct: 97 STKTSPTVSTTVTTTTSPSETDTEEATTTVS--------TETPTEGGSSAATDPSKN 145
>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
Length = 416
Score = 30.0 bits (67), Expect = 3.8
Identities = 23/103 (22%), Positives = 28/103 (27%), Gaps = 15/103 (14%)
Query: 104 YSPSPDLTTRRPEVPRPRHPTHHHPEHHP-TPESPRREPPPRHPNQYPETPQDSPRREPS 162
Y P P + P P PE P +P RE P + EP
Sbjct: 264 YEPPPAPSGGSPA-PPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGGEPK 322
Query: 163 LR---------QPNQYPETPHEPRRDSPPRHPNQFPETPRKPP 196
+P +P PP P P PR P
Sbjct: 323 PGPPRPAPDADRPEGWPSLEAITF---PPPTPAT-PAVPRARP 361
>gnl|CDD|220260 pfam09483, HpaP, Type III secretion protein (HpaP). This entry
represents proteins encoded by genes which are always
found in type III secretion operons, although their
function in the processes of secretion and virulence is
unclear. Hpa stands for Hrp-associated gene, where Hrp
stands for hypersensitivity response and virulence. see
also PMID:18584024.
Length = 185
Score = 29.0 bits (65), Expect = 4.0
Identities = 17/98 (17%), Positives = 23/98 (23%), Gaps = 7/98 (7%)
Query: 155 DSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTT 214
+ R + R + P P P P PR P
Sbjct: 3 AGAAPRAARRSFDYAR-----LMRRAARAGPPGTPA-PPGPAEDAHPEFPERPRDAPAPP 56
Query: 215 TPPATRRGESNPGPS-DKPDTCDTSYDAIAVIRTEVFV 251
PP G+ +P P D A + V
Sbjct: 57 APPRATDGDRDPQPLADALAEAMARAAAARFAAGQRMV 94
>gnl|CDD|236779 PRK10864, PRK10864, putative methyltransferase; Provisional.
Length = 346
Score = 29.7 bits (67), Expect = 4.1
Identities = 21/84 (25%), Positives = 29/84 (34%), Gaps = 2/84 (2%)
Query: 91 DRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREP-PPRHPNQY 149
D + KS V Y S D + +R PR P + RR R+
Sbjct: 3 DELKGKSGKVKVMYVRSDDDSDKRTHNPRTGKGGGR-PSGKSRADGGRRPARDDRNSQSR 61
Query: 150 PETPQDSPRREPSLRQPNQYPETP 173
+DSP R S ++ PE
Sbjct: 62 DRKWEDSPWRTVSRAPGDETPEKA 85
>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509). This
family consists of several uncharacterized viral
proteins from the Marek's disease-like viruses. Members
of this family are typically around 400 residues in
length. The function of this family is unknown.
Length = 377
Score = 29.6 bits (66), Expect = 4.1
Identities = 20/81 (24%), Positives = 26/81 (32%), Gaps = 5/81 (6%)
Query: 94 EEKSWAVIPRYSPSPDLTTRRPEVPRPRH-PTHHHPEHHPTPESPRREPPPRHPNQYPET 152
+E A P P R P P + P E+P PPR P+
Sbjct: 191 DEPDAAAQRPPQPGPSGQNRSPRTPTLSNVRVLDAPVATNRGEAP---SPPRTDTLDPDP 247
Query: 153 PQDSPRREPSLRQPNQYPETP 173
P R R P+ P +P
Sbjct: 248 AIAGPSR-AVNRTPSPRPSSP 267
Score = 29.6 bits (66), Expect = 4.8
Identities = 36/195 (18%), Positives = 55/195 (28%), Gaps = 19/195 (9%)
Query: 57 TNLSTEVLSPRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPE 116
T + P R + + P R + + + + PS E
Sbjct: 122 TPIPCFAEVPVFPRPYQSSGDDDGPSTSRGSGVARVRPTVIQHRVDKTRPS------DYE 175
Query: 117 VPRPRHPTHHHPEHHPTPE---SPRREPPPRHPNQYPETP---------QDSPRREPSLR 164
RPR +P P+ +P P N+ P TP
Sbjct: 176 NHRPRPFAMANPSWVDEPDAAAQRPPQPGPSGQNRSPRTPTLSNVRVLDAPVATNRGEAP 235
Query: 165 QPNQYPETPHEPRRDSPPRHPNQFPE-TPRKPPARYPDIARAEPRTTPTTTTPPATRRGE 223
P + +P P R N+ P P PP + A+P T R G
Sbjct: 236 SPPRTDTLDPDPAIAGPSRAVNRTPSPRPSSPPPEIDEEYNAQPDPWGTNAQTWDNRAGT 295
Query: 224 SNPGPSDKPDTCDTS 238
+ G + D+ S
Sbjct: 296 PDDGSAISEDSAGGS 310
>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain. This region of the APC
family of proteins is known as the basic domain. It
contains a high proportion of positively charged amino
acids and interacts with microtubules.
Length = 359
Score = 29.7 bits (66), Expect = 4.1
Identities = 24/110 (21%), Positives = 40/110 (36%), Gaps = 10/110 (9%)
Query: 128 PEHHPTPESPRREPPP-----RHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPP 182
P + +P ++ PP P + P Q R +P++ E PR +P
Sbjct: 14 PANRSQSTTPSKKGPPLKTQPSDPPKSPSPGQQRSRSLHRPAKPSELAELSPPPRSATP- 72
Query: 183 RHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKP 232
P + +TP ++ ++ PR P T R PGP +
Sbjct: 73 --PARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAG--RNSILPGPGNSL 118
>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1. Mating in
fungi is controlled by the loci that determine the
mating type of an individual, and only individuals with
differing mating types can mate. Basidiomycete fungi
have evolved a unique mating system, termed tetrapolar
or bifactorial incompatibility, in which mating type is
determined by two unlinked loci; compatibility at both
loci is required for mating to occur. The multi-allelic
tetrapolar mating system is considered to be a novel
innovation that could have only evolved once, and is
thus unique to the mushroom fungi. This domain is
C-terminal to the homeodomain transcription factor
region.
Length = 418
Score = 29.8 bits (67), Expect = 4.3
Identities = 26/122 (21%), Positives = 37/122 (30%), Gaps = 19/122 (15%)
Query: 113 RRPEVPRPRHPTHHHPEHHPTPE-----SPRREPPPRHPNQYPETPQDSP----RREPSL 163
RR + PR + +P+PE S R P SP RR S
Sbjct: 53 RRRQTPRQSRRSKRAAHAYPSPERSPALSSERLLSPSPSVLDLSPVLASPQTGKRRRSSS 112
Query: 164 RQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYP--------DIARAEPRTTPTTTT 215
++ R S + P P P A P +++ A PT +
Sbjct: 113 PSDDEDEAERPSKRPRSDSISSSSSPAKP--PEACLPSPAASTQDELSEASAAPLPTPSL 170
Query: 216 PP 217
P
Sbjct: 171 SP 172
>gnl|CDD|215581 PLN03109, PLN03109, ETHYLENE-INSENSITIVE3-like3 protein;
Provisional.
Length = 599
Score = 29.8 bits (67), Expect = 4.8
Identities = 18/88 (20%), Positives = 27/88 (30%), Gaps = 14/88 (15%)
Query: 111 TTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHP---NQYPETPQDSPRRE-----PS 162
+ PE R P P+ T + +R+ P Q E Q+ P E P
Sbjct: 347 VAQEPERARDDAPNQVVPDKEKTKKPRKRKRPRGRSTVAEQEVEVTQEHPPAESRNALPD 406
Query: 163 LRQ------PNQYPETPHEPRRDSPPRH 184
+ Q T HE + +
Sbjct: 407 MNHVDAQGMEYQITGTSHENDTVTALQA 434
>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824). This is
a repeating domain found in fungal proteins. It is
proline-rich, and the function is not known.
Length = 135
Score = 28.3 bits (63), Expect = 4.9
Identities = 28/121 (23%), Positives = 35/121 (28%), Gaps = 13/121 (10%)
Query: 65 SPRNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPT 124
S R +RKK +R ER +R A Y D T P P
Sbjct: 24 SQRRERKKAER--------ER-----ERYRHDHDAYSDSYEEPYDPTPYPPSPPVSDPRY 70
Query: 125 HHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRH 184
+ + + P P PPP Y P Q YP P + PR
Sbjct: 71 YPNSNYFPPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGPGQDPYAPRPRR 130
Query: 185 P 185
Sbjct: 131 A 131
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 29.3 bits (66), Expect = 4.9
Identities = 13/69 (18%), Positives = 22/69 (31%), Gaps = 7/69 (10%)
Query: 128 PEHHPTPESPRREPPPRHPNQYPET---PQDSPRREPSLRQPNQYPETPHEPRRDSPPRH 184
P H + SP PP + P T P S P +P +R+ +
Sbjct: 310 PRHLSSDSSP---SPPDTSDSDPSTETPPPASLSHSPPAAFERPLALSPKR-KREGDKKQ 365
Query: 185 PNQFPETPR 193
+ + +
Sbjct: 366 KKKKSKKLK 374
>gnl|CDD|173181 PRK14718, PRK14718, ribonuclease III; Provisional.
Length = 467
Score = 29.4 bits (65), Expect = 4.9
Identities = 31/156 (19%), Positives = 45/156 (28%), Gaps = 19/156 (12%)
Query: 70 RKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRP-----EVPRPRHPT 124
R + + GE++ R + A +P D R P V R H
Sbjct: 299 RADDAKAGETKAGEARASADAKAGAHTHAAAMPAVPEQADDAARSPATTPVAVIRAAHVE 358
Query: 125 HHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRRE-------------PSLRQPNQYPE 171
H + P P +P ++ PE D P E + E
Sbjct: 359 HGLDKGEPRASKPAEKPAAA-TDKPPEKASDKPSPEKTSEKTPDKSHEKQLDKSSEPVAE 417
Query: 172 TPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEP 207
+ D P ET +PP + AEP
Sbjct: 418 KALDKTADKPDAAARLPAETADRPPRARDASSSAEP 453
>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
Length = 463
Score = 29.3 bits (65), Expect = 5.0
Identities = 14/49 (28%), Positives = 20/49 (40%), Gaps = 2/49 (4%)
Query: 150 PETPQDSPRREPSLRQPNQYPETP--HEPRRDSPPRHPNQFPETPRKPP 196
PET P +Q + P +P+ S P P Q + P+ PP
Sbjct: 182 PETTDPKPSPPAEDKQKPKVESAPVAEKPKAPSSPPPPKQSAKEPQLPP 230
Score = 28.6 bits (63), Expect = 8.6
Identities = 18/75 (24%), Positives = 27/75 (36%), Gaps = 11/75 (14%)
Query: 134 PESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPR 193
PE+ +P P P + P+ E + E P P PP+ + P+ P
Sbjct: 182 PETTDPKPSP------PAEDKQKPKVESA-----PVAEKPKAPSSPPPPKQSAKEPQLPP 230
Query: 194 KPPARYPDIARAEPR 208
K R + R R
Sbjct: 231 KERERRVPMTRLRKR 245
Score = 28.6 bits (63), Expect = 9.9
Identities = 21/75 (28%), Positives = 26/75 (34%), Gaps = 6/75 (8%)
Query: 96 KSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHP--EHHPTPESPRREPPPRHPNQYPETP 153
KS + +PS + P P P E P E P+ P P Q + P
Sbjct: 167 KSEDAASQVTPSQKIPETTDPKPSPPAEDKQKPKVESAPVAEKPKAPSSPPPPKQSAKEP 226
Query: 154 QDSP----RREPSLR 164
Q P RR P R
Sbjct: 227 QLPPKERERRVPMTR 241
>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
Length = 432
Score = 29.3 bits (66), Expect = 5.2
Identities = 29/112 (25%), Positives = 37/112 (33%), Gaps = 22/112 (19%)
Query: 132 PTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQF-PE 190
PTP ++E N + P R P+L E+ E R Q+ E
Sbjct: 93 PTPVETQQEEKA--VNPFTPQPGQREERRPTL-------ESEEEWRARLKREQEEQYLRE 143
Query: 191 TPRKPPARYP------------DIARAEPRTTPTTTTPPATRRGESNPGPSD 230
R+ AR DI + E RT T T T S G SD
Sbjct: 144 RQRQRMARLQANAAAYDSPLAVDIGKVEKRTADTNDTATQTTNATSTAGASD 195
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 29.1 bits (65), Expect = 5.8
Identities = 13/66 (19%), Positives = 18/66 (27%), Gaps = 1/66 (1%)
Query: 172 TPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDK 231
P S P P +P + P A E ++ + S PS
Sbjct: 85 APPAATPTSAPTPTPSPPASPASGMSAAPASAV-EEKSPSEESATATAPESPSTSVPSSG 143
Query: 232 PDTCDT 237
D T
Sbjct: 144 SDAAST 149
>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
Length = 1000
Score = 29.3 bits (65), Expect = 6.1
Identities = 25/125 (20%), Positives = 36/125 (28%), Gaps = 15/125 (12%)
Query: 77 GESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPES 136
G + P+ R R W+ P + + P PRP H +
Sbjct: 792 GYAGPWGLRAQHPRYRHSWAYWSQYPGHGH-----PQGPWAPRPPHLPPQWDGSAGHGQD 846
Query: 137 PRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKP- 195
+ P P Q S + Q + P SP P P +P
Sbjct: 847 QVSQFPHLQSETGPPRLQLSQVPQLPYSQT---LVSSSAPSWSSPQ------PRAPIRPI 897
Query: 196 PARYP 200
P R+P
Sbjct: 898 PTRFP 902
>gnl|CDD|114645 pfam05934, MCLC, Mid-1-related chloride channel (MCLC). This
family consists of several mid-1-related chloride
channels. mid-1-related chloride channel (MCLC) proteins
function as a chloride channel when incorporated in the
planar lipid bilayer.
Length = 577
Score = 29.3 bits (65), Expect = 6.2
Identities = 20/139 (14%), Positives = 36/139 (25%), Gaps = 10/139 (7%)
Query: 106 PSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQ 165
P R E+ R + +PE + R P +
Sbjct: 435 PYAKTYEGRREILRDNDVDLRFQTGNKSPEILKAFD----------LDDADAREHPPVVP 484
Query: 166 PNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESN 225
++ P +P+ + E ++ I + T P A + +
Sbjct: 485 SHKSPTADADPKDTGGIKGEGTGEELSQEDHQIAKPIKESGNDERGNTEGPEAAEKAQLK 544
Query: 226 PGPSDKPDTCDTSYDAIAV 244
+ PD T A V
Sbjct: 545 SEAAGSPDQGSTYSPARGV 563
>gnl|CDD|111993 pfam03157, Glutenin_hmw, High molecular weight glutenin subunit.
Members of this family include high molecular weight
subunits of glutenin. This group of gluten proteins is
thought to be largely responsible for the elastic
properties of gluten, and hence, doughs. Indeed,
glutenin high molecular weight subunits are classified
as elastomeric proteins, because the glutenin network
can withstand significant deformations without breaking,
and return to the original conformation when the stress
is removed. Elastomeric proteins differ considerably in
amino acid sequence, but they are all polymers whose
subunits consist of elastomeric domains, composed of
repeated motifs, and non-elastic domains that mediate
cross-linking between the subunits. The elastomeric
domain motifs are all rich in glycine residues in
addition to other hydrophobic residues. High molecular
weight glutenin subunits have an extensive central
elastomeric domain, flanked by two terminal non-elastic
domains that form disulphide cross-links. The central
elastomeric domain is characterized by the following
three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It
possesses overlapping beta-turns within and between the
repeated motifs, and assumes a regular helical secondary
structure with a diameter of approx. 1.9 nm and a pitch
of approx. 1.5 nm.
Length = 779
Score = 29.3 bits (64), Expect = 6.6
Identities = 14/75 (18%), Positives = 27/75 (36%)
Query: 126 HHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHP 185
H+ P ++ + P Q + Q ++P+ Q P +P + P +P
Sbjct: 244 HYLASPQQPGQGQQPGQLQQPAQGQQPEQGQQGQQPAQGQQGHQPAQGQQPGQGQPGHYP 303
Query: 186 NQFPETPRKPPARYP 200
+ + P YP
Sbjct: 304 ASPQQPGQGQPGHYP 318
>gnl|CDD|206085 pfam13914, Phostensin, Phostensin PP1-binding and SH3-binding
region. Phostensin has been identified as a PP1
regulatory protein binding PP1 at the KISF motif. The
domain also appears to carry an incomplete incomplete
SH3-binding domain PxRxP further upstream. It is likely
that Phostensin targets PP1 to the F-actin cytoskeleton.
Phostensin binds to actin and decreases the elongation
and depolymerisation rates of actin filament pointed
ends.
Length = 132
Score = 28.1 bits (62), Expect = 6.7
Identities = 17/83 (20%), Positives = 23/83 (27%), Gaps = 2/83 (2%)
Query: 115 PEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPH 174
P+ +P HP + R N + P+ P P Q N ET
Sbjct: 2 PQAAKPPGSPLPSRHLHPAKPGHQSGLQRRGGNTFTVVPKRKPGSLPHGSQAN--GETAP 59
Query: 175 EPRRDSPPRHPNQFPETPRKPPA 197
+P D R P
Sbjct: 60 QPAEDEEAISLLGPTGKKRYPTV 82
>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
Length = 226
Score = 28.7 bits (64), Expect = 6.7
Identities = 20/94 (21%), Positives = 24/94 (25%), Gaps = 3/94 (3%)
Query: 147 NQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQF---PETPRKPPARYPDIA 203
E S + + P E S P + E PPA
Sbjct: 79 GDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARD 138
Query: 204 RAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDT 237
P T TP R P S +P T T
Sbjct: 139 GPTPDPTAQPATPDERRSPRQRPPVSGEPPTPST 172
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 27.9 bits (62), Expect = 6.8
Identities = 13/53 (24%), Positives = 16/53 (30%), Gaps = 3/53 (5%)
Query: 166 PNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPA 218
P P P +PP + P P PP P + A P P
Sbjct: 81 PLTPPAPPEPV---TPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPP 130
>gnl|CDD|132858 cd07219, Pat_PNPLA1, Patatin-like phospholipase domain containing
protein 1. Members of this family share a patatin
domain, initially discovered in potato tubers. Some
members of PNPLA1 subfamily do not have the lipase
consensus sequence Gly-X-Ser-X-Gly which is essential
for hydrolase activity. This family includes PNPLA1
from Homo sapiens and Gallus gallus. Currently, there is
no literature available on the physiological role,
structure, or enzymatic activity of PNPLA1. It is
expressed in various human tissues in low mRNA levels.
Length = 382
Score = 29.1 bits (65), Expect = 6.8
Identities = 23/107 (21%), Positives = 33/107 (30%), Gaps = 10/107 (9%)
Query: 123 PTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPP 182
P P S E + +P L P PE+ E +SP
Sbjct: 286 PQRSQPSLQDGQASL--EESWQPHLARAPKGDGRGLHDPPLSPPLAAPESTAEWVVESPV 343
Query: 183 RHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPS 229
P E+ P D++ P + P + P+ S PG S
Sbjct: 344 SSPASPLESSPSLPGSLTDLS---PASLPAVHSLPS-----STPGLS 382
>gnl|CDD|220233 pfam09422, WTX, WTX protein. The WTX protein is found to be
inactivated in one third of Wilms tumours. The WTX
protein is functionally uncharacterized.
Length = 467
Score = 29.2 bits (65), Expect = 6.9
Identities = 21/131 (16%), Positives = 31/131 (23%), Gaps = 15/131 (11%)
Query: 116 EVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRRE----PSLRQP----- 166
E + P +E PR + PQD+ + +L P
Sbjct: 103 EEAKEPEGGLILPSSLTASLECVKEETPRAAREENAPPQDADGGKVSPASALEAPGTSCV 162
Query: 167 ---NQYPETPHEPRRDSPPRHPN-QFPETPRKPPARYPDIARAEPRTT-PTTTTPPATRR 221
+ + P P P +A T T P
Sbjct: 163 ECGDPAKPGSESSAFEDPGPGLGADSLCQPLTPETGEKVVAEDASITGDVPVKTVPPVDS 222
Query: 222 GESNPGPSDKP 232
+P PSD
Sbjct: 223 STVDP-PSDPS 232
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 29.0 bits (64), Expect = 7.7
Identities = 23/114 (20%), Positives = 31/114 (27%), Gaps = 6/114 (5%)
Query: 104 YSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSL 163
+ +PD T V T H E H T P P + D P+
Sbjct: 422 FHKAPDTTKSVIFVY-----TLVHVEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTG 476
Query: 164 RQPNQYPETPHEPRRDSPPRHPNQFPE-TPRKPPARYPDIARAEPRTTPTTTTP 216
+ PE R + P P A P + T+ TP
Sbjct: 477 TTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTP 530
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 28.9 bits (64), Expect = 7.9
Identities = 25/92 (27%), Positives = 35/92 (38%), Gaps = 11/92 (11%)
Query: 152 TPQDSPRREPSLRQPNQYPETPHEP------RRDSPPRHPNQFPETPRKPPARYPDIARA 205
T Q + R+ + + P T E + S P+ Q PA + ++
Sbjct: 62 TTQAANTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQT 121
Query: 206 EPR-TTPTT--TTPPATRRGESNPGPSDKPDT 234
TTP T TTPP+T P S K DT
Sbjct: 122 TTESTTPKTKVTTPPST--NTPQPMQSTKSDT 151
>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor. This family is secreted by
gram-negative Gammaproteobacteria such as Pseudomonas
syringae of tomato and the fire blight plant pathogen
Erwinia amylovora, amongst others. It is an essential
pathogenicity factor of approximately 198 kDa. Its
injection into the host-plant is dependent upon the
bacterial type III or Hrp secretion system. The family
is long and carries a number of predicted functional
regions, including an ERMS or endoplasmic reticulum
membrane retention signal at both the C- and the
N-termini, a leucine-zipper motif from residues 539-560,
and a nuclear localisation signal at 1358-1361. this
conserved AvrE-family of effectors is among the few that
are required for full virulence of many phytopathogenic
pseudomonads, erwinias and pantoeas.
Length = 1771
Score = 29.0 bits (65), Expect = 7.9
Identities = 13/84 (15%), Positives = 26/84 (30%)
Query: 93 KEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPET 152
KEE + + +P + + + E+ + R PP P+ P
Sbjct: 161 KEEGTSSKMPATALASAALFKDDEIRQEVDAARSDQASQSRLSRSRGNPPAIPPDAAPRQ 220
Query: 153 PQDSPRREPSLRQPNQYPETPHEP 176
P + ++ E +P
Sbjct: 221 PMLTRSAGGRFEGEDENLERNLQP 244
>gnl|CDD|147567 pfam05454, DAG1, Dystroglycan (Dystrophin-associated glycoprotein
1). Dystroglycan is one of the dystrophin-associated
glycoproteins, which is encoded by a 5.5 kb transcript
in human. The protein product is cleaved into two
non-covalently associated subunits, [alpha] (N-terminal)
and [beta] (C-terminal). In skeletal muscle the
dystroglycan complex works as a transmembrane linkage
between the extracellular matrix and the cytoskeleton.
[alpha]-dystroglycan is extracellular and binds to
merosin ([alpha]-2 laminin) in the basement membrane,
while [beta]-dystroglycan is a transmembrane protein and
binds to dystrophin, which is a large rod-like
cytoskeletal protein, absent in Duchenne muscular
dystrophy patients. Dystrophin binds to intracellular
actin cables. In this way, the dystroglycan complex,
which links the extracellular matrix to the
intracellular actin cables, is thought to provide
structural integrity in muscle tissues. The dystroglycan
complex is also known to serve as an agrin receptor in
muscle, where it may regulate agrin-induced
acetylcholine receptor clustering at the neuromuscular
junction. There is also evidence which suggests the
function of dystroglycan as a part of the signal
transduction pathway because it is shown that Grb2, a
mediator of the Ras-related signal pathway, can interact
with the cytoplasmic domain of dystroglycan. In general,
aberrant expression of dystrophin-associated protein
complex underlies the pathogenesis of Duchenne muscular
dystrophy, Becker muscular dystrophy and severe
childhood autosomal recessive muscular dystrophy.
Interestingly, no genetic disease has been described for
either [alpha]- or [beta]-dystroglycan. Dystroglycan is
widely distributed in non-muscle tissues as well as in
muscle tissues. During epithelial morphogenesis of
kidney, the dystroglycan complex is shown to act as a
receptor for the basement membrane. Dystroglycan
expression in mouse brain and neural retina has also
been reported. However, the physiological role of
dystroglycan in non-muscle tissues has remained unclear.
Length = 290
Score = 28.4 bits (63), Expect = 8.2
Identities = 30/125 (24%), Positives = 45/125 (36%), Gaps = 12/125 (9%)
Query: 67 RNQRKKEDRIGESQPFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHH 126
R +RK + + + F ++ + I E P S L +P +P P +P +
Sbjct: 171 RKKRKGKLTLEDQATFIKKGVPIIFADELDDSKPPPSSSMPLILKEEKPPLPPPEYPNQN 230
Query: 127 HPEHHPTPE------SPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDS 180
PE P + +P R+ P P P P +P R N P R
Sbjct: 231 VPETTPLNQDLLGEYTPLRDEDPNAPPYQPPPPFTTPMEGKGSRPKNM------TPYRSP 284
Query: 181 PPRHP 185
PP P
Sbjct: 285 PPYVP 289
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 28.0 bits (61), Expect = 8.2
Identities = 13/59 (22%), Positives = 17/59 (28%)
Query: 124 THHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPP 182
T E E E P E ++ E +P PE EP + P
Sbjct: 58 TDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEP 116
Score = 27.6 bits (60), Expect = 9.0
Identities = 11/59 (18%), Positives = 14/59 (23%)
Query: 118 PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEP 176
P PE E E P ++ +P PE EP
Sbjct: 62 DEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEPEP 120
>gnl|CDD|216263 pfam01044, Vinculin, Vinculin family.
Length = 850
Score = 28.9 bits (65), Expect = 8.3
Identities = 17/62 (27%), Positives = 20/62 (32%), Gaps = 6/62 (9%)
Query: 142 PPRHP---NQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPAR 198
P R N+ P E R+ P P PP P PET R+ R
Sbjct: 607 PIRQAVLMNRSPPELPTDSEFE---REDEPADPRPPPPVETEPPPRPPPPPETDREEMRR 663
Query: 199 YP 200
P
Sbjct: 664 LP 665
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 28.7 bits (64), Expect = 8.3
Identities = 20/75 (26%), Positives = 28/75 (37%), Gaps = 7/75 (9%)
Query: 158 RREPSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPP 217
R++ +L QP P P P P P PPA P + PT +PP
Sbjct: 136 RKKEALPQP-----PPPAPVVMMQP--PPPHAMPPASPPAAQPAPSAPASSPPPTPASPP 188
Query: 218 ATRRGESNPGPSDKP 232
+ +S+ P P
Sbjct: 189 PAKAPKSSHPPLKSP 203
>gnl|CDD|222914 PHA02666, PHA02666, hypothetical protein; Provisional.
Length = 287
Score = 28.7 bits (63), Expect = 8.3
Identities = 20/135 (14%), Positives = 29/135 (21%), Gaps = 7/135 (5%)
Query: 102 PRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREP 161
R + + R+ R P + P RH Q+
Sbjct: 31 HRRRANSMESRRKSRPSRQHRSAERTPTTASSLTHENNTAPSRHGKQHSCKASSRSSHN- 89
Query: 162 SLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRR 221
R H R H + R Y ++ P T R
Sbjct: 90 --RGSTSSSHNHHAHRGPHQSAHRRSKHDAVRDT---YQPCPQSPETDLYKGRLPGETER 144
Query: 222 GESNPGP-SDKPDTC 235
P D P+
Sbjct: 145 HYETPDHIYDVPEDV 159
>gnl|CDD|180523 PRK06305, PRK06305, DNA polymerase III subunits gamma and tau;
Validated.
Length = 451
Score = 28.6 bits (64), Expect = 8.6
Identities = 9/60 (15%), Positives = 14/60 (23%), Gaps = 9/60 (15%)
Query: 132 PTPESPRREPPP---RHPNQYPETPQDSPRREPSLRQ------PNQYPETPHEPRRDSPP 182
P R E P + N P+ L Q + + + P
Sbjct: 365 PAFTQLRLESCPPQIKSSNSQQTVSSPQPQPVAKLEQGSLLTAASPQTKKETIHQEVPSP 424
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 29.0 bits (65), Expect = 8.8
Identities = 22/118 (18%), Positives = 31/118 (26%), Gaps = 7/118 (5%)
Query: 117 VPRPRHPTHHHPEHHPTPESPR-REPPPRHPNQYPET-PQDSPRREPSLR--QPNQYPET 172
P P+++ E+ P P T R +R +
Sbjct: 635 FPPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARGDPVRPTAHHAALRA 694
Query: 173 PHEPRRDSPPRHPNQFPETPRKPPARYPD---IARAEPRTTPTTTTPPATRRGESNPG 227
P PR PP P P P A P + T P PP + +
Sbjct: 695 PQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAEQADGAEN 752
>gnl|CDD|234080 TIGR02987, met_A_Alw26, type II restriction m6 adenine DNA
methyltransferase, Alw26I/Eco31I/Esp3I family. Members
of this family are the m6-adenine DNA methyltransferase
protein, or domain of a fusion protein that also carries
m5 cytosine methyltransferase activity, of type II
restriction systems of the Alw26I/Eco31I/Esp3I family. A
methyltransferase of this family is alway accompanied by
a type II restriction endonuclease from the
Alw26I/Eco31I/Esp3I family (TIGR02986) and by an
adenine-specific modification methyltransferase. Members
of this family are unusual in that regions of similarity
to homologs outside this family are circularly permuted
[DNA metabolism, Restriction/modification].
Length = 524
Score = 28.6 bits (64), Expect = 9.2
Identities = 15/75 (20%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 41 IEAEFKWFDGGN-STICTNLSTEVLSPRNQRKKEDRIGESQPFRERTLSICDRKEEKSWA 99
+ E K F G N +T +L++ S + + D+ + + + K
Sbjct: 225 FQEEAKLFSGVNQATSIIHLNSGSKSKKIELIFYDKSLSEKGDNITYVDLNIIKSMNLSI 284
Query: 100 VIPRYSPSPDLTTRR 114
IP+ S + LT +
Sbjct: 285 PIPQGS-NHLLTILK 298
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 28.1 bits (63), Expect = 9.4
Identities = 23/101 (22%), Positives = 38/101 (37%), Gaps = 16/101 (15%)
Query: 128 PEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRDSPPRHPNQ 187
PE P PE P+ P + P+ P+ +P + Q P+R+ P P
Sbjct: 76 PEPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQ-------PKREVKPVEP-- 126
Query: 188 FPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGP 228
+P + + + A A P ++ T S+ GP
Sbjct: 127 ------RPASPFENTAPARPTSSTATAAASKPVTSVSS-GP 160
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 28.7 bits (64), Expect = 9.4
Identities = 20/132 (15%), Positives = 27/132 (20%), Gaps = 8/132 (6%)
Query: 118 PRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPR 177
P R P + + R E P P T
Sbjct: 383 PGARAAAAVGASAVP-AVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAP-PATADRGD 440
Query: 178 RDSPPRHPNQFPETPRKPPARYPDIARAEPRT------TPTTTTPPATRRGESNPGPSDK 231
+ P R D A+P P + PP + +
Sbjct: 441 DAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPS 500
Query: 232 PDTCDTSYDAIA 243
T DA A
Sbjct: 501 AATPAAVPDARA 512
>gnl|CDD|227889 COG5602, SIN3, Histone deacetylase complex, SIN3 component
[Chromatin structure and dynamics].
Length = 1163
Score = 28.7 bits (64), Expect = 9.6
Identities = 21/115 (18%), Positives = 34/115 (29%), Gaps = 4/115 (3%)
Query: 120 PRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRREPSLRQPNQYPETPHEPRRD 179
P P H +P S ++ P HP+ + ++ + N P +
Sbjct: 7 PVDPPHDNPGDKVMTGSLKQISPKYHPSMSRDHDVNTSLQNSEPWPDNGSARKPSY-SPE 65
Query: 180 SPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGE-SNPGPSDKPD 233
S + P R RY + + P+T G SN P
Sbjct: 66 SMSSYGG--PMDERSGTRRYYEDVEQILSSLAGDNPDPSTDHGSASNQNMERSPP 118
>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
Length = 1320
Score = 28.7 bits (64), Expect = 10.0
Identities = 18/75 (24%), Positives = 26/75 (34%), Gaps = 6/75 (8%)
Query: 120 PRHP----THHHPEHHPTPESPRREPPPRHPNQYPE-TPQDSPRREPSLRQPNQYPETPH 174
PR+ T E P P S + +P R E P S +P+
Sbjct: 7 PRNAILRETSSGEEQSPNPSSHKSKPSSRKLKSSKENAPPPDLNSLTSDLKPDHRS-ASA 65
Query: 175 EPRRDSPPRHPNQFP 189
+ + PPR P+ P
Sbjct: 66 KLKSPLPPRPPSSNP 80
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.136 0.448
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 28,580,410
Number of extensions: 2809071
Number of successful extensions: 4503
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3537
Number of HSP's successfully gapped: 392
Length of query: 530
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 429
Effective length of database: 6,457,848
Effective search space: 2770416792
Effective search space used: 2770416792
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.4 bits)