RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16012
(712 letters)
>gnl|CDD|215740 pfam00134, Cyclin_N, Cyclin, N-terminal domain. Cyclins regulate
cyclin dependent kinases (CDKs). Human cyclin-O is a
Uracil-DNA glycosylase that is related to other cyclins.
Cyclins contain two domains of similar all-alpha fold,
of which this family corresponds with the N-terminal
domain.
Length = 127
Score = 145 bits (369), Expect = 4e-41
Identities = 49/99 (49%), Positives = 70/99 (70%), Gaps = 2/99 (2%)
Query: 244 RTVLLDWLSEVCEVYTLHRETFYLTIDYIDRYLSTNANIVPKQQLQLIGVAALMIAAKVE 303
R +L+DWL EV E + L ET YL ++Y+DR+LS VP+ +LQL+GV L+IAAK E
Sbjct: 31 RAILIDWLVEVHEEFKLLPETLYLAVNYLDRFLSKQ--PVPRTKLQLVGVTCLLIAAKYE 88
Query: 304 EIYPPKVSDYAYVTDGACSSQDILSTEMNILAVLEWNIT 342
EIYPP V D+ Y+TD A + ++IL E+ IL+ L W+++
Sbjct: 89 EIYPPSVEDFVYITDNAYTKEEILRMELLILSTLNWDLS 127
>gnl|CDD|227357 COG5024, COG5024, Cyclin [Cell division and chromosome
partitioning].
Length = 440
Score = 99.5 bits (248), Expect = 4e-22
Identities = 56/167 (33%), Positives = 88/167 (52%), Gaps = 17/167 (10%)
Query: 244 RTVLLDWLSEVCEVYTLHRETFYLTIDYIDRYLSTNANIVPKQQLQLIGVAALMIAAKVE 303
R++L+DWL EV + L ET +L I+ IDR+LS + +V ++ QL+G++AL IA+K E
Sbjct: 213 RSILVDWLVEVHGKFGLLPETLFLAINIIDRFLS--SRVVSLEKYQLVGISALFIASKYE 270
Query: 304 EIYPPKVSDYAYVTDGACSSQDILSTEMNILAVLEWNITPVTAHYWLNVFLQVVYHRSSR 363
E+ P + D Y TDGA + DI+ E +L VL++NI+ + +L +
Sbjct: 271 EVNCPSIKDLVYATDGAFTRDDIIRAERYMLEVLDFNISWPSPMSFLRRISKAS------ 324
Query: 364 CHNLGFIYPAFSRTLYTQCIRLLDLCHLDIRCLNYSYSVLATTAIYL 410
Y FSRT ++ +D + + S S A A+YL
Sbjct: 325 ------DYDIFSRTPAKF---SSEISPVDYKFIQISPSWCAAAAMYL 362
>gnl|CDD|238003 cd00043, CYCLIN, Cyclin box fold. Protein binding domain
functioning in cell-cycle and transcription control.
Present in cyclins, TFIIB and Retinoblastoma (RB).The
cyclins consist of 8 classes of cell cycle regulators
that regulate cyclin dependent kinases (CDKs). TFIIB is
a transcription factor that binds the TATA box. Cyclins,
TFIIB and RB contain 2 copies of the domain.
Length = 88
Score = 78.0 bits (193), Expect = 1e-17
Identities = 34/91 (37%), Positives = 49/91 (53%), Gaps = 4/91 (4%)
Query: 244 RTVLLDWLSEVCEVYTLHRETFYLTIDYIDRYLSTNANIVPKQQLQLIGVAALMIAAKVE 303
R LD+L V + L ET L ++ +DR+L V + L+ AAL +AAKVE
Sbjct: 2 RPTPLDFLRRVAKALGLSPETLTLAVNLLDRFLLD--YSVLGRSPSLVAAAALYLAAKVE 59
Query: 304 EIYPPKVSDYAYVTDGACSSQDILSTEMNIL 334
EI PP + D +VT A + ++IL E +L
Sbjct: 60 EI-PPWLKDLVHVTGYA-TEEEILRMEKLLL 88
Score = 29.9 bits (68), Expect = 0.99
Identities = 14/36 (38%), Positives = 18/36 (50%), Gaps = 1/36 (2%)
Query: 377 TLYTQCIRLLDLCHLDIRCLNYSYSVLATTAIYLVC 412
TL + LLD LD L S S++A A+YL
Sbjct: 22 TLTL-AVNLLDRFLLDYSVLGRSPSLVAAAALYLAA 56
>gnl|CDD|214641 smart00385, CYCLIN, domain present in cyclins, TFIIB and
Retinoblastoma. A helical domain present in cyclins and
TFIIB (twice) and Retinoblastoma (once). A protein
recognition domain functioning in cell-cycle and
transcription control.
Length = 83
Score = 74.2 bits (183), Expect = 2e-16
Identities = 28/86 (32%), Positives = 40/86 (46%), Gaps = 4/86 (4%)
Query: 249 DWLSEVCEVYTLHRETFYLTIDYIDRYLSTNANIVPKQQLQLIGVAALMIAAKVEEIYPP 308
D+L VC+ L ET L ++ +DR+LS K LI AAL +A+K EE P
Sbjct: 1 DFLRRVCKALNLDPETLNLAVNLLDRFLSDY--KFLKYSPSLIAAAALYLASKTEETPPW 58
Query: 309 KVSDYAYVTDGACSSQDILSTEMNIL 334
Y + ++IL E +L
Sbjct: 59 TKELVHYTGY--FTEEEILRMERLLL 82
Score = 30.3 bits (69), Expect = 0.69
Identities = 14/36 (38%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Query: 377 TLYTQCIRLLDLCHLDIRCLNYSYSVLATTAIYLVC 412
TL + LLD D + L YS S++A A+YL
Sbjct: 16 TLNL-AVNLLDRFLSDYKFLKYSPSLIAAAALYLAS 50
>gnl|CDD|217307 pfam02984, Cyclin_C, Cyclin, C-terminal domain. Cyclins regulate
cyclin dependent kinases (CDKs). Human CCNO is a
Uracil-DNA glycosylase that is related to other cyclins.
Cyclins contain two domains of similar all-alpha fold,
of which this family corresponds with the C-terminal
domain.
Length = 117
Score = 41.5 bits (98), Expect = 2e-04
Identities = 17/59 (28%), Positives = 26/59 (44%), Gaps = 7/59 (11%)
Query: 385 LLDLCHLDIRCLNYSYSVLATTAIYLVCS-------KELACMISGLRLESLEDCIEWMN 436
LL+L LD L Y S++A A+YL E +G E L+ C++ +
Sbjct: 27 LLELSLLDYDFLKYPPSLIAAAAVYLARKTLGSPPWTETLEHYTGYSEEDLKPCVKLLL 85
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 40.3 bits (94), Expect = 0.004
Identities = 29/91 (31%), Positives = 37/91 (40%), Gaps = 11/91 (12%)
Query: 503 PTTPG---NRTYSVPCDMTP-PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILP 558
P PG R S P PP+ G TP AR AR PT++ P P P
Sbjct: 2720 PLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP-------P 2772
Query: 559 ATPSSSKASRKSAPPAITTPCSSKSLRSGSD 589
A P++ R + P + S +SL S D
Sbjct: 2773 AAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803
Score = 38.8 bits (90), Expect = 0.014
Identities = 15/85 (17%), Positives = 23/85 (27%)
Query: 503 PTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPS 562
P P P +T P +S ++ S + + + PA P
Sbjct: 2769 PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPL 2828
Query: 563 SSKASRKSAPPAITTPCSSKSLRSG 587
S + P SL G
Sbjct: 2829 PPPTSAQPTAPPPPPGPPPPSLPLG 2853
Score = 36.1 bits (83), Expect = 0.075
Identities = 17/93 (18%), Positives = 24/93 (25%), Gaps = 14/93 (15%)
Query: 489 AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPT 548
+ L P TP P S P A + +P AP
Sbjct: 2691 TVGSLTSLADPPPPPPTPEPA---------PHALVSATPLPPGPAAARQASPALPAAPAP 2741
Query: 549 TPSSSKTILPATPSSSKASRKSAPPAITTPCSS 581
+ P P +R + PP P +
Sbjct: 2742 PAVPAGPATPGGP-----ARPARPPTTAGPPAP 2769
Score = 33.0 bits (75), Expect = 0.81
Identities = 18/92 (19%), Positives = 32/92 (34%), Gaps = 2/92 (2%)
Query: 489 AIAESAKLLKEEQQPTTPGNRTYSV--PCDMTPPPSSSKGQTTPSSKARQARTPTSSKAP 546
A+A ++ + P P + +V P PP +S G P + A+ P P
Sbjct: 2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPP 2846
Query: 547 PTTPSSSKTILPATPSSSKASRKSAPPAITTP 578
P + ++ P + +S P
Sbjct: 2847 PPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP 2878
Score = 32.6 bits (74), Expect = 0.92
Identities = 16/81 (19%), Positives = 28/81 (34%), Gaps = 4/81 (4%)
Query: 503 PTTPGNRTYSVPCD--MTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT 560
PTT G + P PP ++ S++R++ A P + A
Sbjct: 2761 PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA--AAL 2818
Query: 561 PSSSKASRKSAPPAITTPCSS 581
P ++ + PP P +
Sbjct: 2819 PPAASPAGPLPPPTSAQPTAP 2839
Score = 30.7 bits (69), Expect = 3.6
Identities = 18/70 (25%), Positives = 24/70 (34%)
Query: 504 TTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSS 563
+ P VP PPP++ P S A P P T + AT +
Sbjct: 414 SVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRKA 473
Query: 564 SKASRKSAPP 573
A R+ PP
Sbjct: 474 LDALRERRPP 483
Score = 29.5 bits (66), Expect = 9.6
Identities = 13/78 (16%), Positives = 25/78 (32%), Gaps = 3/78 (3%)
Query: 501 QQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT 560
+ + ++++P D P + P + + P PP P P
Sbjct: 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP---PLA 2944
Query: 561 PSSSKASRKSAPPAITTP 578
P++ A A+ P
Sbjct: 2945 PTTDPAGAGEPSGAVPQP 2962
Score = 29.1 bits (65), Expect = 9.9
Identities = 17/88 (19%), Positives = 22/88 (25%), Gaps = 1/88 (1%)
Query: 501 QQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPA- 559
P + P S ++ + R P PP P P
Sbjct: 2870 SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP 2929
Query: 560 TPSSSKASRKSAPPAITTPCSSKSLRSG 587
P R P A TT + SG
Sbjct: 2930 QPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
Score = 29.1 bits (65), Expect = 10.0
Identities = 14/81 (17%), Positives = 19/81 (23%), Gaps = 3/81 (3%)
Query: 503 PTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPS 562
P G P P+S A P + S PA P
Sbjct: 394 PFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDG---PAPPP 450
Query: 563 SSKASRKSAPPAITTPCSSKS 583
+ + PA P +
Sbjct: 451 ERQPPAPATEPAPDDPDDATR 471
>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB. Isd proteins are
iron-regulated surface proteins found in Bacillus,
Staphylococcus and Listeria species and are responsible
for heme scavenging from hemoproteins. The IsdB protein
is only observed in Staphylococcus and consists of an
N-terminal hydrophobic signal sequence, a pair of tandem
NEAT (NEAr Transporter, pfam05031) domains which confers
the ability to bind heme and a C-terminal sortase
processing signal which targets the protein to the cell
wall. IsdB is believed to make a direct contact with
methemoglobin facilitating transfer of heme to IsdB. The
heme is then transferred to other cell wall-bound NEAT
domain proteins such as IsdA and IsdC.
Length = 644
Score = 39.9 bits (92), Expect = 0.005
Identities = 30/109 (27%), Positives = 41/109 (37%), Gaps = 2/109 (1%)
Query: 466 HNNVFDYAYSYHKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSK 525
H DY YH V E A A+ KE+Q + T + P T PP +
Sbjct: 433 HVKTIDYDGQYHVRIVDKEAFTKANADKTNK-KEQQDNSAKKETTPATPSKPTTPPVEKE 491
Query: 526 GQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT-PSSSKASRKSAPP 573
Q S K ++P+ K + S K PAT P+ + S P
Sbjct: 492 SQKQDSQKDDNKQSPSVEKENDASSESGKDKTPATKPAKGEVESSSTTP 540
>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
Length = 592
Score = 39.3 bits (91), Expect = 0.007
Identities = 24/101 (23%), Positives = 38/101 (37%), Gaps = 1/101 (0%)
Query: 499 EEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILP 558
+E + N T P TP ++ ++ ++ K P P
Sbjct: 59 DEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQVEKAPAEP 118
Query: 559 ATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQ-LQTITA 598
AT S+ + S+ PA +LRSG+ Q +QTI A
Sbjct: 119 ATVSNPDNATSSSTPATYNLLQKSALRSGATVQSFIQTIQA 159
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 38.5 bits (90), Expect = 0.010
Identities = 19/105 (18%), Positives = 41/105 (39%), Gaps = 5/105 (4%)
Query: 484 ELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSS 543
E++ +A SA+L + Q P + + + TP P++ T +S+ ++
Sbjct: 142 EITTMADQSSAELSQNSGQSV-PLDTSTTTDPATTPAPAAPVDTTPTNSQT----PAVAT 196
Query: 544 KAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGS 588
P ++ + ++ + AP A TP + L +
Sbjct: 197 APAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQ 241
Score = 31.9 bits (73), Expect = 0.97
Identities = 17/91 (18%), Positives = 26/91 (28%), Gaps = 3/91 (3%)
Query: 497 LKEEQQPTTPGNRTYSVPCDMTPPPSSS---KGQTTPSSKARQARTPTSSKAPPTTPSSS 553
L + P D TP S + P+ +Q S+A T ++
Sbjct: 164 LDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATP 223
Query: 554 KTILPATPSSSKASRKSAPPAITTPCSSKSL 584
PATP + T +L
Sbjct: 224 APAAPATPDGAAPLPTDQAGVSTPAADPNAL 254
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 38.1 bits (88), Expect = 0.010
Identities = 25/92 (27%), Positives = 36/92 (39%), Gaps = 10/92 (10%)
Query: 481 VSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTP 540
V + A A + K E +TP P P S TTP S +A
Sbjct: 20 VITTTTQAANATTPSSTKVEAPQSTP-------PSTKVEAPQSKPNATTPPSTKVEAPQQ 72
Query: 541 TSSKAPPTTPSSSKTILPATPSSSKASRKSAP 572
T + TTPSS+K P +P++ + + P
Sbjct: 73 TPN---ATTPSSTKVETPQSPTTKQVPTEINP 101
Score = 38.1 bits (88), Expect = 0.011
Identities = 25/67 (37%), Positives = 35/67 (52%), Gaps = 8/67 (11%)
Query: 524 SKGQTTPSSKARQARTPTSSK--APPTTPSSSKTILP------ATPSSSKASRKSAPPAI 575
+ G T +++A A TP+S+K AP +TP S+K P TP S+K P
Sbjct: 17 TTGVITTTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNA 76
Query: 576 TTPCSSK 582
TTP S+K
Sbjct: 77 TTPSSTK 83
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 37.8 bits (88), Expect = 0.021
Identities = 26/91 (28%), Positives = 34/91 (37%), Gaps = 3/91 (3%)
Query: 502 QPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATP 561
PT + P P+SS S + +P S AP + +SS +
Sbjct: 265 LPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRES 324
Query: 562 SSSKASRKSAPP--AITTPCSSKSLRSGSDS 590
SSS S S A +P S S RS S S
Sbjct: 325 SSSSTSSSSESSRGAAVSPGPSPS-RSPSPS 354
Score = 37.1 bits (86), Expect = 0.045
Identities = 15/68 (22%), Positives = 28/68 (41%)
Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTP 578
SS T+ SS++ + + +P +PS S+ PA PSS + + + +
Sbjct: 320 SSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPA 379
Query: 579 CSSKSLRS 586
S+
Sbjct: 380 ASAGRPTR 387
Score = 35.1 bits (81), Expect = 0.17
Identities = 15/82 (18%), Positives = 22/82 (26%)
Query: 493 SAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSS 552
+ + + S +P S PSS TP + PP+
Sbjct: 74 GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPD 133
Query: 553 SKTILPATPSSSKASRKSAPPA 574
+L S S P A
Sbjct: 134 LSEMLRPVGSPGPPPAASPPAA 155
Score = 34.4 bits (79), Expect = 0.29
Identities = 11/61 (18%), Positives = 20/61 (32%)
Query: 514 PCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPP 573
++P PS S+ + + + P+ SS P+ +A A
Sbjct: 338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGR 397
Query: 574 A 574
A
Sbjct: 398 A 398
Score = 34.0 bits (78), Expect = 0.32
Identities = 19/100 (19%), Positives = 33/100 (33%), Gaps = 4/100 (4%)
Query: 503 PTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPT----SSKAPPTTPSSSKTILP 558
+ + P S G S +A + + + SS ++ SS +
Sbjct: 283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVS 342
Query: 559 ATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTITA 598
PS S++ S PP P S + S + +A
Sbjct: 343 PGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASA 382
Score = 33.2 bits (76), Expect = 0.68
Identities = 18/91 (19%), Positives = 33/91 (36%), Gaps = 6/91 (6%)
Query: 504 TTPGNRTYSVPCDMTP---PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT 560
+R ++P SS P S A +P ++P S+ PA
Sbjct: 165 DAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPP--RRSSPISASASSPA- 221
Query: 561 PSSSKASRKSAPPAITTPCSSKSLRSGSDSQ 591
P+ +++ A + + SS+S G +
Sbjct: 222 PAPGRSAADDAGASSSDSSSSESSGCGWGPE 252
Score = 32.1 bits (73), Expect = 1.3
Identities = 17/99 (17%), Positives = 36/99 (36%), Gaps = 7/99 (7%)
Query: 492 ESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPS 551
+ +++L+ P P S P P + + + A +P + P++P
Sbjct: 133 DLSEMLRPVGSPGPPPAA--SPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPP 190
Query: 552 SSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDS 590
+ PS+ A+ PP ++P S+ +
Sbjct: 191 A-----EPPPSTPPAAASPRPPRRSSPISASASSPAPAP 224
Score = 31.3 bits (71), Expect = 2.6
Identities = 20/87 (22%), Positives = 29/87 (33%)
Query: 504 TTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSS 563
++ + P + P P + SS+ P + SSS +PS
Sbjct: 241 SSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSP 300
Query: 564 SKASRKSAPPAITTPCSSKSLRSGSDS 590
S AP + SS S R S S
Sbjct: 301 SSPGSGPAPSSPRASSSSSSSRESSSS 327
Score = 30.9 bits (70), Expect = 3.0
Identities = 10/83 (12%), Positives = 25/83 (30%), Gaps = 3/83 (3%)
Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSK--TILPATPSSSKASRKSAPPAI 575
+ S + S +R + +P+ P S K A S + ++ +
Sbjct: 332 SSESSRGAAVSPGPSPSR-SPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRA 390
Query: 576 TTPCSSKSLRSGSDSQQLQTITA 598
+ ++ R + +
Sbjct: 391 RAAVAGRARRRDATGRFPAGRPR 413
Score = 30.5 bits (69), Expect = 3.6
Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 8/84 (9%)
Query: 502 QPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPA-- 559
P++PG+ P +P SSS + SS + + + SS+ +P S + P+
Sbjct: 299 SPSSPGSG----PAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPS 354
Query: 560 --TPSSSKASRKSAPPAITTPCSS 581
P + +S + P P S
Sbjct: 355 RPPPPADPSSPRKRPRPSRAPSSP 378
Score = 29.4 bits (66), Expect = 8.1
Identities = 21/99 (21%), Positives = 37/99 (37%), Gaps = 17/99 (17%)
Query: 498 KEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSS--------KARQARTPTSSKAPPTT 549
E PT PG + PPP+ PS + + P + +PP
Sbjct: 101 AREGSPTPPGPSS-----PDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAA 155
Query: 550 PSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGS 588
+S A S + +SR++A P + ++++ S
Sbjct: 156 GASP----AAVASDAASSRQAALPLSSPEETARAPSSPP 190
>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
Length = 328
Score = 37.2 bits (86), Expect = 0.024
Identities = 24/109 (22%), Positives = 48/109 (44%), Gaps = 6/109 (5%)
Query: 489 AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQART--PTSSKAP 546
A ++AK E+ TT R +V + P + +T P A+ + P + A
Sbjct: 138 ASRQTAKTQTAERPATTRPARKQAV---IEPKKPQATAKTEPKPVAQTPKRTEPAAPVAS 194
Query: 547 PTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQT 595
P+++ T P +++ A ++A PA TT + ++ + L++
Sbjct: 195 TKAPAATSTPAP-KETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKS 242
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 37.2 bits (86), Expect = 0.028
Identities = 16/95 (16%), Positives = 32/95 (33%), Gaps = 6/95 (6%)
Query: 487 DLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAP 546
D + E +KE+ ++ + + PP ++ TP+ + S AP
Sbjct: 56 DKTVKEYK--IKEKDFVVVMVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAP 113
Query: 547 PTT--PSSS--KTILPATPSSSKASRKSAPPAITT 577
+ S ++ P S S S+ +
Sbjct: 114 ASAVEEKSPSEESATATAPESPSTSVPSSGSDAAS 148
Score = 35.3 bits (81), Expect = 0.098
Identities = 16/85 (18%), Positives = 27/85 (31%), Gaps = 5/85 (5%)
Query: 513 VPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAP 572
V ++ P + + P++ A TPT S S A P+S+ + +
Sbjct: 70 VVVMVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMS-----AAPASAVEEKSPSE 124
Query: 573 PAITTPCSSKSLRSGSDSQQLQTIT 597
+ T S S T
Sbjct: 125 ESATATAPESPSTSVPSSGSDAAST 149
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 36.4 bits (85), Expect = 0.056
Identities = 13/88 (14%), Positives = 26/88 (29%), Gaps = 1/88 (1%)
Query: 499 EEQQPT-TPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTIL 557
+ P T + ++ + A ++ A P P+++
Sbjct: 34 ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAA 93
Query: 558 PATPSSSKASRKSAPPAITTPCSSKSLR 585
A + A+ +AP A LR
Sbjct: 94 AAPAAPPAAAAAAAPAAAAVEDEVTPLR 121
>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
Length = 327
Score = 35.8 bits (83), Expect = 0.063
Identities = 19/92 (20%), Positives = 32/92 (34%), Gaps = 10/92 (10%)
Query: 502 QPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPT-TPSSSKTILPAT 560
PT P P +K T + + P + K +++ + PAT
Sbjct: 174 LPTAPA---------TVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPAT 224
Query: 561 PSSSKASRKSAPPAITTPCSSKSLRSGSDSQQ 592
K+ SA + P S +L+ S S+
Sbjct: 225 SGKPKSGAASARALSSAPASHYTLQLSSASRS 256
Score = 31.2 bits (71), Expect = 1.8
Identities = 24/104 (23%), Positives = 38/104 (36%), Gaps = 13/104 (12%)
Query: 501 QQPTTP-GNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPA 559
P TP G + +P +++ S +GQ +S+ Q T T AP T S +PA
Sbjct: 132 APPQTPNGQQRIELPGNISDALSQQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAKVPA 191
Query: 560 TPSSSK------------ASRKSAPPAITTPCSSKSLRSGSDSQ 591
T + + P +S +SG+ S
Sbjct: 192 TAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASA 235
Score = 30.0 bits (68), Expect = 4.3
Identities = 14/73 (19%), Positives = 25/73 (34%), Gaps = 6/73 (8%)
Query: 501 QQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT 560
T ++ VP P+ + T +T T + P T+ P +
Sbjct: 177 APATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGK------PKS 230
Query: 561 PSSSKASRKSAPP 573
++S + SAP
Sbjct: 231 GAASARALSSAPA 243
>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
Length = 580
Score = 36.3 bits (84), Expect = 0.063
Identities = 31/222 (13%), Positives = 53/222 (23%), Gaps = 16/222 (7%)
Query: 444 EKSPGTDYCDSDSNELP-PGPLRHNNVFDYAYSY----HKHSVSMELSDLAIAESAKLLK 498
K G + + P R N + S + D + +
Sbjct: 3 GKPRGQPGRWNYDSSDESPEGSRDENFDAERDDFLTPLGSTSEATSEDDDDLYPPRETGS 62
Query: 499 EEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILP 558
T+ Y+VP PP + P S P P P
Sbjct: 63 GGGVATST---IYTVP---RPPRGPEQTLDKPDSLPASRELP-PGPTPVPPGGFRGASSP 115
Query: 559 ATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTITAEYIIYHAIISRRLQTKISRT 618
+ S + R + + S + + + + K+
Sbjct: 116 RLGADSTSPRFLYQVNFPVILAPIGESNSSSEELSEEEEHSRPPPSESLKVKNGGKVYPK 175
Query: 619 AFYMMCA-KDQDSLSLRDPHMLDRHPSIV---PRMRTLSVSY 656
F K + L R S V +++ LS S
Sbjct: 176 GFSKHKTHKRSEFSGLTKKAARKRKGSFVFKPSQLKELSGSL 217
>gnl|CDD|113413 pfam04642, DUF601, Protein of unknown function, DUF601. This
family represents a conserved region found in several
uncharacterized plant proteins.
Length = 311
Score = 35.4 bits (81), Expect = 0.077
Identities = 30/114 (26%), Positives = 42/114 (36%), Gaps = 12/114 (10%)
Query: 482 SMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPT 541
++E S L AE AK + E+ G+R P + K P+S++R T
Sbjct: 4 TLEDSKLRAAEKAKQPQAEEDS---GSR-------QKPSTLAGKNPDAPTSESRTPSKAT 53
Query: 542 SSKAPPTTPSSSKTILPAT--PSSSKASRKSAPPAITTPCSSKSLRSGSDSQQL 593
SSK P + K S ++SR P K DSQ L
Sbjct: 54 SSKDPSKRYADKKRKQSEKDARSPPRSSRPRTEEKDAGPSQQKEKGKKGDSQDL 107
>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
Validated.
Length = 709
Score = 35.8 bits (82), Expect = 0.080
Identities = 21/91 (23%), Positives = 38/91 (41%), Gaps = 7/91 (7%)
Query: 508 NRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT------- 560
N P T ++ + P +A A+TP + + PS KT P +
Sbjct: 374 NTELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAAAMPSEGKTAGPVSNQENNDV 433
Query: 561 PSSSKASRKSAPPAITTPCSSKSLRSGSDSQ 591
P A ++ A T S+KS+++ S+++
Sbjct: 434 PPWEDAPDEAQTAAGTAQTSAKSIQTASEAE 464
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 35.3 bits (80), Expect = 0.11
Identities = 22/97 (22%), Positives = 39/97 (40%), Gaps = 7/97 (7%)
Query: 489 AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPT 548
A A AK + P + + P PP+ K P A+ A P + APP
Sbjct: 239 AAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPA--KAAAPP---AKAAAPPAKAAAPPA 293
Query: 549 TPSSS--KTILPATPSSSKASRKSAPPAITTPCSSKS 583
+++ K +++ ++ +APPA +K+
Sbjct: 294 KAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKA 330
Score = 33.0 bits (74), Expect = 0.56
Identities = 24/101 (23%), Positives = 40/101 (39%), Gaps = 13/101 (12%)
Query: 489 AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTP----SSKARQARTPTSSK 544
A A AK + P + + P PP+ K P ++ A+ A P +
Sbjct: 260 AAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPA--KAAAAPAKAAAAPAKAAAAPAKAA 317
Query: 545 APPTTPSSSKTILPATPSSSKASRKSAPP--AITTPCSSKS 583
APP +K P +++ ++ +APP A P K+
Sbjct: 318 APP-----AKAAAPPAKAATPPAKAAAPPAKAAAAPVGKKA 353
Score = 32.6 bits (73), Expect = 0.72
Identities = 20/99 (20%), Positives = 39/99 (39%), Gaps = 4/99 (4%)
Query: 489 AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSK--ARQARTPTSSKAP 546
A A AK + P + + P PP+ + ++ A+ A P + A
Sbjct: 246 AAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAA 305
Query: 547 PTTPSSSKTILPATPSSSKA--SRKSAPPAITTPCSSKS 583
P +++ A P+ + A ++ + PPA +K+
Sbjct: 306 PAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPAKA 344
Score = 30.7 bits (68), Expect = 2.6
Identities = 20/99 (20%), Positives = 40/99 (40%), Gaps = 4/99 (4%)
Query: 487 DLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAP 546
D A A +AK ++ P + + P +++ + A+ A P + A
Sbjct: 192 DAAAAAAAKQKAAAKKAAAPSGKKSAKAA--APAKAAAAPAKAAAPPAKAAAAPAKAAAA 249
Query: 547 PTTPSSSKTILPATPSSSKA--SRKSAPPAITTPCSSKS 583
P ++ A P+ + A ++ +APPA +K+
Sbjct: 250 PAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKA 288
>gnl|CDD|148820 pfam07432, Hc1, Histone H1-like protein Hc1. This family consists
of several bacterial histone H1-like Hc1 proteins. In
Chlamydia, Hc1 is expressed in the late stages of the
life cycle, concomitant with the reorganisation of
chlamydial reticulate bodies into elementary bodies.
This suggests that Hc1 protein plays a role in the
condensation of chromatin during intracellular
differentiation.
Length = 123
Score = 33.2 bits (75), Expect = 0.12
Identities = 13/65 (20%), Positives = 27/65 (41%), Gaps = 2/65 (3%)
Query: 523 SSKGQTTPSSKARQ-ARTPTSSKAPPTTPSSSKTILPATPSSSK-ASRKSAPPAITTPCS 580
S + P+ KA + A+ P + P +++ A +K A++K+A +
Sbjct: 58 SGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKKTAAKKVKKMSK 117
Query: 581 SKSLR 585
+S
Sbjct: 118 KRSTA 122
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 35.5 bits (81), Expect = 0.12
Identities = 22/85 (25%), Positives = 35/85 (41%), Gaps = 8/85 (9%)
Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAI-- 575
TP S++ TT SS +P++ AP T+P + P S S ++PPA
Sbjct: 572 TPTGSAASSNTTFSSD-----SPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPASHL 626
Query: 576 -TTPCSSKSLRSGSDSQQLQTITAE 599
+ + S S +T + E
Sbjct: 627 GSPSTTPSSPESSIKVASTETASPE 651
Score = 34.3 bits (78), Expect = 0.22
Identities = 30/114 (26%), Positives = 47/114 (41%), Gaps = 19/114 (16%)
Query: 503 PTTPG------NRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSS--KAPPTTPS--- 551
P PG T S P +++ +PS+ + +P + +PP TPS
Sbjct: 556 PYIPGLFAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIV 615
Query: 552 SSKTILPA--------TPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTIT 597
S T PA TPSS ++S K A +P SS + S S + +++
Sbjct: 616 SPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVSMVSMS 669
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 35.1 bits (81), Expect = 0.14
Identities = 16/102 (15%), Positives = 34/102 (33%), Gaps = 4/102 (3%)
Query: 491 AESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTP 550
+ A + +Q P ++ P P +++ +PS + A+ A
Sbjct: 366 GDDASGGRGPKQHIKP---VFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAG 422
Query: 551 SSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQ 592
+ T+ P++ + S P P K + S+
Sbjct: 423 -TPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKV 463
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 34.7 bits (79), Expect = 0.17
Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 5/82 (6%)
Query: 493 SAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQAR--TPTSSKAPPTTP 550
SA +++ QP ++ ++ TP P + QTT S + + TP S+ P
Sbjct: 88 SASKIEKISQPKQEEQKSLNISA--TPAPKQEQSQTTTESTTPKTKVTTPPSTNT-PQPM 144
Query: 551 SSSKTILPATPSSSKASRKSAP 572
S+K+ P +P+ +A P
Sbjct: 145 QSTKSDTPQSPTIKQAQTDMTP 166
Score = 29.7 bits (66), Expect = 5.1
Identities = 21/79 (26%), Positives = 32/79 (40%), Gaps = 5/79 (6%)
Query: 522 SSSKGQTTPSSKARQARTPTSSKAPPT----TPSSSKTILPATPSSS-KASRKSAPPAIT 576
+ + TT ++ RQ RTP KAP T T +S + K+ SA PA
Sbjct: 56 AGANSATTQAANTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISATPAPK 115
Query: 577 TPCSSKSLRSGSDSQQLQT 595
S + S + ++ T
Sbjct: 116 QEQSQTTTESTTPKTKVTT 134
Score = 29.3 bits (65), Expect = 7.3
Identities = 24/93 (25%), Positives = 36/93 (38%), Gaps = 11/93 (11%)
Query: 491 AESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQT-----TPSSKARQARTPTSSKA 545
+ K E+ P T +T + + P + ++ TP+ K Q++T T S
Sbjct: 68 TRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTT 127
Query: 546 PP---TTPSSSKTILPATPSSSKASRKSAPPAI 575
P TTP S+ T P S S P I
Sbjct: 128 PKTKVTTPPSTNT---PQPMQSTKSDTPQSPTI 157
>gnl|CDD|178141 PLN02526, PLN02526, acyl-coenzyme A oxidase.
Length = 412
Score = 34.4 bits (79), Expect = 0.20
Identities = 16/50 (32%), Positives = 23/50 (46%), Gaps = 12/50 (24%)
Query: 305 IYPPKVSDYAYVTDGACSSQDILSTE-----MNILAVLEWNITPVTAHYW 349
I+PP VSDY Y D D+L+ E + +E + P+ YW
Sbjct: 14 IFPPSVSDY-YQFD------DLLTPEEQALRKRVRECMEKEVAPIMTEYW 56
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 32.9 bits (75), Expect = 0.21
Identities = 18/86 (20%), Positives = 28/86 (32%), Gaps = 8/86 (9%)
Query: 476 YHKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKAR 535
+++H +SME E+ K+L P P + P +P P+ T +
Sbjct: 64 HNEHILSME------QEARKILAPLTPPAPP--EPVTPPTAQSPAPAVPTPPPTSTPAVP 115
Query: 536 QARTPTSSKAPPTTPSSSKTILPATP 561
A P S P P
Sbjct: 116 PAPAAAVPAPAAAPPPSDPPQPPRAP 141
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 34.0 bits (78), Expect = 0.26
Identities = 27/93 (29%), Positives = 35/93 (37%), Gaps = 17/93 (18%)
Query: 516 DMTPPPSSSKGQTTP---SSKARQARTPTSSKAPPTTP---------SSSKTILPATPSS 563
SSS + T S A+++ P+SSKA + SS A P S
Sbjct: 227 PSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPES 286
Query: 564 SKASRKSAPPAITTP-----CSSKSLRSGSDSQ 591
+ S SA P +P S SL SDS
Sbjct: 287 TSQSPSSAEPTSESPQTAGNSSLSSLGDYSDSD 319
>gnl|CDD|219262 pfam07001, BAT2_N, BAT2 N-terminus. This family represents the
N-terminus (approximately 200 residues) of the
proline-rich protein BAT2. BAT2 is similar to other
proteins with large proline-rich domains, such as some
nuclear proteins, collagens, elastin, and synapsin.
Length = 189
Score = 33.4 bits (76), Expect = 0.26
Identities = 20/71 (28%), Positives = 26/71 (36%), Gaps = 1/71 (1%)
Query: 524 SKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKS 583
K T +SK Q+ +S PP TP S + T +SS P P +KS
Sbjct: 81 PKDGTGWASKQDQSDPKSSDALPPQTPESQPPAVSQTSASSPPP-PPQPATPLVPGGAKS 139
Query: 584 LRSGSDSQQLQ 594
S Q
Sbjct: 140 WAVASAKPGAQ 150
>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
Length = 590
Score = 34.3 bits (79), Expect = 0.27
Identities = 14/71 (19%), Positives = 21/71 (29%), Gaps = 1/71 (1%)
Query: 509 RTYSVPCDMTPPPSSSKGQT-TPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKAS 567
S SSS Q+ PSS + + R S + S + + S +
Sbjct: 5 NPTSRSQSHASSSSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSPPSV 64
Query: 568 RKSAPPAITTP 578
TP
Sbjct: 65 TLPPAATTQTP 75
>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
Length = 209
Score = 32.9 bits (75), Expect = 0.36
Identities = 12/79 (15%), Positives = 25/79 (31%), Gaps = 2/79 (2%)
Query: 513 VPCDMTPPPSSSKGQTTPSS--KARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKS 570
VP P ++ ++ + R + A P ++TI + A+ K
Sbjct: 130 VPVPAPAPAVAAAAAQAAAAPKAPAKPRAKSPRPAAKAAPKPTETITAKKAKKTAAAPKP 189
Query: 571 APPAITTPCSSKSLRSGSD 589
TP + + +
Sbjct: 190 TADKTATPAKKTTKKKKTK 208
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 33.7 bits (77), Expect = 0.39
Identities = 9/74 (12%), Positives = 17/74 (22%), Gaps = 4/74 (5%)
Query: 501 QQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT 560
P D ++ S + P S P + +
Sbjct: 445 GDAPVPAKANARASADSRCDERDAQPPADSGSAS----APASDAPPDAAFEPAPRAAAPS 500
Query: 561 PSSSKASRKSAPPA 574
++ A + PA
Sbjct: 501 AATPAAVPDARAPA 514
Score = 32.5 bits (74), Expect = 0.97
Identities = 16/84 (19%), Positives = 26/84 (30%), Gaps = 5/84 (5%)
Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSS---KASRKSAPPAI 575
P + P+ AR A +S P T + P ++ A+R APPA
Sbjct: 372 VPARVAGA--VPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAA 429
Query: 576 TTPCSSKSLRSGSDSQQLQTITAE 599
P ++ +
Sbjct: 430 PAPPATADRGDDAADGDAPVPAKA 453
Score = 31.4 bits (71), Expect = 2.1
Identities = 20/105 (19%), Positives = 35/105 (33%), Gaps = 8/105 (7%)
Query: 489 AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPT 548
A A+S ++ Q P + S P PP ++ + ++ + P
Sbjct: 457 ASADSRCDERDAQPPADS--GSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPA 514
Query: 549 TPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQL 593
S A P + +A PA P + R+G + L
Sbjct: 515 AASREDAPAAAAPPAPEAR--PPTPAAAAP----AARAGGAAAAL 553
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 33.5 bits (76), Expect = 0.41
Identities = 20/98 (20%), Positives = 34/98 (34%)
Query: 479 HSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQAR 538
H + D A+A ++ ++ P P ++ P+ + A
Sbjct: 47 HQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPDPAVAPQLAA 106
Query: 539 TPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAIT 576
P A T ++ PA +S AS+K P A T
Sbjct: 107 APKPDAAEAFTSAAQAHEAPADAGTSAASKKPDPAAHT 144
>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24).
This family consists of several MGC-24 (or Cd164
antigen) proteins from eukaryotic organisms.
MGC-24/CD164 is a sialomucin expressed in many normal
and cancerous tissues. In humans, soluble and
transmembrane forms of MGC-24 are produced by
alternative splicing.
Length = 187
Score = 32.7 bits (74), Expect = 0.43
Identities = 22/90 (24%), Positives = 27/90 (30%), Gaps = 7/90 (7%)
Query: 502 QPTTPGNRTYSVPCDMTPPPSSSKGQTTPS-SKARQARTPTSSKAPPTTPSSSKTILPAT 560
Q P Y TT S S A PT+S A PT S T
Sbjct: 78 QQCGPEEPGYCSSQAEVVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSP------T 131
Query: 561 PSSSKASRKSAPPAITTPCSSKSLRSGSDS 590
S + + TP S +S D+
Sbjct: 132 TSHHHVTSEPKTNTTVTPTSQPDRKSTFDA 161
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 32.9 bits (75), Expect = 0.49
Identities = 17/56 (30%), Positives = 24/56 (42%), Gaps = 1/56 (1%)
Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAP-PTTPSSSKTILPATPSSSKASRKSAPP 573
PPP + P + P P+ P+SS PA+P +KA + S PP
Sbjct: 144 PPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPP 199
>gnl|CDD|227600 COG5275, COG5275, BRCT domain type II [General function prediction
only].
Length = 276
Score = 32.4 bits (73), Expect = 0.61
Identities = 25/104 (24%), Positives = 40/104 (38%), Gaps = 5/104 (4%)
Query: 490 IAESAKLLKEEQQPTTPGN--RTYSVPCDMTPPPSSSKGQTTPSSKARQARTP--TSSKA 545
+ E KL+ + TTP S P S+K TT +T T+S +
Sbjct: 1 VHEDDKLVGSDGVSTTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDS 60
Query: 546 PPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSD 589
P + T PA P + K++ + TT ++ + RS
Sbjct: 61 KPVVHQTRATRKPAQPKAEKSTTSKSKSHTTTA-TTHTSRSSKS 103
Score = 32.4 bits (73), Expect = 0.66
Identities = 14/74 (18%), Positives = 21/74 (28%), Gaps = 8/74 (10%)
Query: 499 EEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILP 558
+ P D P ++ T +A T T+ S S T
Sbjct: 42 TSKDVVHPVKTELDTTSDSKPVVHQTR--ATRKPAQPKAEKST------TSKSKSHTTTA 93
Query: 559 ATPSSSKASRKSAP 572
T +S + K P
Sbjct: 94 TTHTSRSSKSKGLP 107
>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein. This is a viral
attachment glycoprotein from region G of metaviruses. It
is high in serine and threonine suggesting it is highly
glycosylated.
Length = 183
Score = 31.9 bits (72), Expect = 0.63
Identities = 29/101 (28%), Positives = 42/101 (41%), Gaps = 1/101 (0%)
Query: 475 SYHKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPS-SK 533
S H S E S L A S L+ E T T PPSSS+ QTT + +
Sbjct: 57 SQHPTQQSTESSTLPAATSESHLETEPTSTPDTTNRQQTVDRHTTPPSSSRTQTTQAVHE 116
Query: 534 ARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPA 574
+ RT + ++ PPTT +++ T +SS +
Sbjct: 117 KKNTRTTSRTQTPPTTSTAAVQTTTTTNTSSTGKEPTTTSV 157
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 32.9 bits (75), Expect = 0.69
Identities = 21/103 (20%), Positives = 38/103 (36%), Gaps = 3/103 (2%)
Query: 478 KHSVS---MELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKA 534
KH S + L+ + S L + ++ T T + TP S + + +
Sbjct: 3 KHYFSKSAVALTLALLTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETP 62
Query: 535 RQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITT 577
A T + + T +S AT +++ +R P A T
Sbjct: 63 VAATTAAEAPSSSETAETSDPTSEATDTTTSEARTVTPAATET 105
Score = 29.4 bits (66), Expect = 7.7
Identities = 13/73 (17%), Positives = 28/73 (38%), Gaps = 3/73 (4%)
Query: 528 TTPSSKARQARTPTSSKAPPTTPSSSKTILPA-TPSSSKASRKSAPPAITTPCSSKSLRS 586
T + K QA + P T + ++ P + ++ +A P A TT + S
Sbjct: 19 TASNPKLAQAEEIVT--TTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSE 76
Query: 587 GSDSQQLQTITAE 599
+++ + +
Sbjct: 77 TAETSDPTSEATD 89
>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus. This family represents
the C-terminus of eukaryotic neogenin precursor
proteins, which contains several potential
phosphorylation sites. Neogenin is a member of the N-CAM
family of cell adhesion molecules (and therefore
contains multiple copies of pfam00047 and pfam00041) and
is closely related to the DCC tumour suppressor gene
product - these proteins may play an integral role in
regulating differentiation programmes and/or cell
migration events within many adult and embryonic
tissues.
Length = 295
Score = 32.6 bits (74), Expect = 0.70
Identities = 19/108 (17%), Positives = 35/108 (32%), Gaps = 11/108 (10%)
Query: 474 YSYHKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSK-------- 525
Y +H+ S+ + + + +T P S
Sbjct: 120 YLHHQFSLRPMPFSTLTVQRLYQHGDRAESVESVRQTPE--PPYLPAAQSESSNAAEEAP 177
Query: 526 GQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPP 573
++ P++ R P S A P P+S TI P PS+ +++
Sbjct: 178 SRSIPTAHVRPTH-PLKSFAVPALPASMSTIEPKLPSTPLLTQQGPTL 224
>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
Length = 1320
Score = 33.0 bits (75), Expect = 0.74
Identities = 30/91 (32%), Positives = 38/91 (41%), Gaps = 11/91 (12%)
Query: 497 LKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSK--APPTTPSSSK 554
+K P R S + +P PSS K + PSS R SSK APP +S
Sbjct: 1 MKHFMLPRNAILRETSSGEEQSPNPSSHK--SKPSS-----RKLKSSKENAPPPDLNSLT 53
Query: 555 TILPATPSSSKASRKSAPPAITTPCSSKSLR 585
+ L S+ A KS P P SS L+
Sbjct: 54 SDLKPDHRSASAKLKSPLPP--RPPSSNPLK 82
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 32.8 bits (74), Expect = 0.81
Identities = 21/81 (25%), Positives = 32/81 (39%), Gaps = 7/81 (8%)
Query: 478 KHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQA 537
S+ +E+ I E+ L P PG+ T+ + C P +S G TP S
Sbjct: 194 SFSIRLEMGGNGIDETCDLEDNYISPVLPGSNTFFITCSGDKPHFASGGILTPIS----- 248
Query: 538 RTPTSSKAPPTTPSSSKTILP 558
P + P T + S + P
Sbjct: 249 --PMKTPTPGTGYNYSLWLGP 267
Score = 32.4 bits (73), Expect = 0.96
Identities = 22/96 (22%), Positives = 36/96 (37%), Gaps = 5/96 (5%)
Query: 504 TTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATP-- 561
TP T S + T P S + TP++ + T + PTT +S T +P
Sbjct: 472 GTPTGTTSSTLPEDTSPTSRTT-SATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTP 530
Query: 562 --SSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQT 595
+ S P T+ ++ S + +S T
Sbjct: 531 IVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNT 566
Score = 31.3 bits (70), Expect = 2.4
Identities = 19/83 (22%), Positives = 32/83 (38%), Gaps = 6/83 (7%)
Query: 500 EQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPS------SKARQARTPTSSKAPPTTPSSS 553
E TT T S+P T P S+ T+ + S + +PTS T ++S
Sbjct: 442 EPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATS 501
Query: 554 KTILPATPSSSKASRKSAPPAIT 576
T TP+++ + +
Sbjct: 502 PTPAVTTPNATSPTTQKTSDTPN 524
Score = 30.1 bits (67), Expect = 5.1
Identities = 16/90 (17%), Positives = 30/90 (33%), Gaps = 3/90 (3%)
Query: 511 YSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKS 570
+ P T P++ + PTS TT S+ T ++ A+ +
Sbjct: 440 HVEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNA 499
Query: 571 APPAITTPC---SSKSLRSGSDSQQLQTIT 597
P +S + + SD+ + T
Sbjct: 500 TSPTPAVTTPNATSPTTQKTSDTPNATSPT 529
Score = 29.8 bits (66), Expect = 7.5
Identities = 13/75 (17%), Positives = 34/75 (45%), Gaps = 1/75 (1%)
Query: 504 TTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSS 563
T+ N T + +P +++ T + + T ++P+S + +P++ S
Sbjct: 547 TSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHS 606
Query: 564 S-KASRKSAPPAITT 577
+ +++ S P +T+
Sbjct: 607 TPRSNSTSTTPLLTS 621
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 32.6 bits (74), Expect = 0.84
Identities = 22/118 (18%), Positives = 43/118 (36%), Gaps = 3/118 (2%)
Query: 478 KHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQA 537
K S E + L L TP +R + P + +++ A +
Sbjct: 546 KASAGDERTGLRPESEPGTLPYGSSVQTPPDRPKAATKGSRKPSPRKEPKSSVPPAAEKR 605
Query: 538 RTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQT 595
+ + SK P + +T + SS +S PP+ +P +++S + S +
Sbjct: 606 KYKSPSKIVPKSREFIET---DSSSSDSPEDESLPPSSQSPGNTESSKESCASLRTPV 660
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 31.9 bits (73), Expect = 0.95
Identities = 19/82 (23%), Positives = 27/82 (32%), Gaps = 13/82 (15%)
Query: 501 QQPTT----PGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTI 556
QQPT P N + TP S Q ++ +Q T+ P T P +
Sbjct: 97 QQPTQLSEVPYN-------EQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTA- 148
Query: 557 LPATPSSSKASRKSAPPAITTP 578
P P + R +T
Sbjct: 149 -PVQPQTPAPVRTQPAAPVTQA 169
Score = 29.6 bits (67), Expect = 5.5
Identities = 13/69 (18%), Positives = 26/69 (37%), Gaps = 6/69 (8%)
Query: 516 DMTPPPSSSK-----GQTTPSSKAR-QARTPTSSKAPPTTPSSSKTILPATPSSSKASRK 569
DM P+ QT ++ Q + + PP T + + + P +++ +
Sbjct: 94 DMRQQPTQLSEVPYNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQ 153
Query: 570 SAPPAITTP 578
+ P T P
Sbjct: 154 TPAPVRTQP 162
>gnl|CDD|239731 cd03762, proteasome_beta_type_6, proteasome beta type-6 subunit.
The 20S proteasome, multisubunit proteolytic complex, is
the central enzyme of nonlysosomal protein degradation
in both the cytosol and nucleus. It is composed of 28
subunits arranged as four homoheptameric rings that
stack on top of one another forming an elongated
alpha-beta-beta-alpha cylinder with a central cavity.
The proteasome alpha and beta subunits are members of
the N-terminal nucleophile (Ntn)-hydrolase superfamily.
Their N-terminal threonine residues are exposed as a
nucleophile in peptide bond hydrolysis. Mammals have 7
alpha and 7 beta proteasome subunits while archaea have
one of each.
Length = 188
Score = 31.4 bits (72), Expect = 0.99
Identities = 18/78 (23%), Positives = 29/78 (37%), Gaps = 12/78 (15%)
Query: 567 SRKSAPPAITTPCSSK---------SLRSGS--DSQQLQTITAEYIIYHAII-SRRLQTK 614
SR S + + K RSGS D+Q + Y+ H+I K
Sbjct: 18 SRTSTGSYVANRVTDKLTQLHDRIYCCRSGSAADTQAIADYVRYYLDMHSIELGEPPLVK 77
Query: 615 ISRTAFYMMCAKDQDSLS 632
+ + F +C ++ LS
Sbjct: 78 TAASLFKNLCYNYKEMLS 95
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 32.5 bits (74), Expect = 1.0
Identities = 25/120 (20%), Positives = 41/120 (34%), Gaps = 13/120 (10%)
Query: 486 SDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKA 545
+ +AK +EE + +R + D P S+ +T + AR+A A
Sbjct: 1246 AKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARRAAARKKPLA 1305
Query: 546 PPTTPSSSKT--------ILPATPSSSKASRKSA---PPAITTPCSSKSLRSGSDSQQLQ 594
+ S S + A K RK A A P ++K + G + Q
Sbjct: 1306 SVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAK--KRGPATVQSG 1363
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 32.2 bits (73), Expect = 1.1
Identities = 23/84 (27%), Positives = 33/84 (39%), Gaps = 8/84 (9%)
Query: 498 KEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSS-----KAPPTTPSS 552
KEE + P + S P P P S ++ +++P S K SS
Sbjct: 386 KEEPVDSNPSSPGNSGPA--RPSPESKDKGKRKAA-NEVSKSPASVPAKKLKTENAPKSS 442
Query: 553 SKTILPATPSSSKASRKSAPPAIT 576
S P T S SK+S +A +T
Sbjct: 443 SGKSTPQTFSGSKSSSNAADGGVT 466
>gnl|CDD|215039 PLN00041, PLN00041, photosystem I reaction center subunit II;
Provisional.
Length = 196
Score = 31.5 bits (71), Expect = 1.2
Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 5/59 (8%)
Query: 520 PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTP 578
P+SS + P + S+ P + S+++ + A + +A+ K AP T P
Sbjct: 5 TPASSGRKLVPWASFL----SKSTSKAPASLSATRAVRAAAAAE-EAAAKEAPVGFTPP 58
>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain. This region of the APC
family of proteins is known as the basic domain. It
contains a high proportion of positively charged amino
acids and interacts with microtubules.
Length = 359
Score = 32.0 bits (72), Expect = 1.2
Identities = 18/71 (25%), Positives = 32/71 (45%), Gaps = 5/71 (7%)
Query: 505 TPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTT-----PSSSKTILPA 559
+P R+ + P + PSSS QT+ S+ P +++ P +S + +P
Sbjct: 64 SPPPRSATPPARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPR 123
Query: 560 TPSSSKASRKS 570
T S ++A S
Sbjct: 124 TSSPARALLAS 134
Score = 31.2 bits (70), Expect = 1.8
Identities = 20/81 (24%), Positives = 33/81 (40%), Gaps = 4/81 (4%)
Query: 513 VPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSK-TILPATPSSSKASRKSA 571
P TPP +K ++ SS+ P P T S+ + +ILP +S +++
Sbjct: 66 PPRSATPPARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPRTS 125
Query: 572 PPAITTPCSSKSLRSGSDSQQ 592
PA S S +Q+
Sbjct: 126 SPARA---LLASSGSQHKTQK 143
Score = 31.2 bits (70), Expect = 1.9
Identities = 23/81 (28%), Positives = 33/81 (40%), Gaps = 5/81 (6%)
Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSA---PPA 574
P P++ TTPS K +T S PP +PS + + +K S + PP
Sbjct: 11 IPGPANRSQSTTPSKKGPPLKTQPS--DPPKSPSPGQQRSRSLHRPAKPSELAELSPPPR 68
Query: 575 ITTPCSSKSLRSGSDSQQLQT 595
TP + + S S Q T
Sbjct: 69 SATPPARLAKTPSSSSSQTST 89
>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
Length = 290
Score = 31.5 bits (71), Expect = 1.2
Identities = 22/108 (20%), Positives = 37/108 (34%), Gaps = 7/108 (6%)
Query: 477 HKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTP-PPSSSKGQTTPSSKAR 535
H +D + E P R ++ P P P ++G + R
Sbjct: 60 RFHHYDEARADYPYYTGSSSEDERPADPRPSRRPHAQPEASGPGPARGARGPAGSRGRGR 119
Query: 536 QARTPTSSKAPP-----TTPSSSKTILPATPSSSKASRKSAPPAITTP 578
+A +P S + PP + P K+ + + A +AP TP
Sbjct: 120 RAESP-SPRDPPNPKGASAPRGRKSACADSAALLDAPAPAAPKRQKTP 166
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 31.9 bits (72), Expect = 1.2
Identities = 30/161 (18%), Positives = 53/161 (32%), Gaps = 18/161 (11%)
Query: 440 VILCEKSPGTDYCDSDSNELPPGPLRHNNVFDYAYSYHKHSVSMELSDLAIAESAKLLKE 499
+ LCE P C D N P + VS ++K
Sbjct: 132 LALCESGP---ICQRDYN-PRDRPKCRCTL-------RGKDVSCCKEPKTAVTTSKTTSW 180
Query: 500 EQQPTTPGNRTYSVPCDMTPPP-SSSKGQTTPSSKARQARTPTSSKAPPTTPSSSK-TIL 557
+ + P + P +TP +++G T ++ R + T TT S+ +
Sbjct: 181 PTEVSHP-----TYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQTE 235
Query: 558 PATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTITA 598
P + PP+ T+ S + +Q+ +T A
Sbjct: 236 PPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPA 276
Score = 30.4 bits (68), Expect = 4.1
Identities = 19/91 (20%), Positives = 35/91 (38%), Gaps = 3/91 (3%)
Query: 509 RTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASR 568
+T S P +++ P S+ T S A Q ++ ++ T T S+ +
Sbjct: 176 KTTSWPTEVSHPTYPSQ-VTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPE--P 232
Query: 569 KSAPPAITTPCSSKSLRSGSDSQQLQTITAE 599
++ PP S S + Q Q+ T +
Sbjct: 233 QTEPPPSQRGPSGSPQHPPSTTSQDQSTTGD 263
Score = 30.0 bits (67), Expect = 4.3
Identities = 17/74 (22%), Positives = 26/74 (35%)
Query: 505 TPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSS 564
T G T S P T PP S +G + + S + + P S+
Sbjct: 221 TQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNR 280
Query: 565 KASRKSAPPAITTP 578
++ +A P TT
Sbjct: 281 RSPHSTATPPPTTK 294
Score = 30.0 bits (67), Expect = 5.0
Identities = 26/131 (19%), Positives = 51/131 (38%), Gaps = 23/131 (17%)
Query: 500 EQQPTTPGNRTYSVPCDMTPPPSSSKGQTT-------------PSSKARQARTPTSSKAP 546
E Q P ++ PP ++S+ Q+T P + + + R+P S+ P
Sbjct: 231 EPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNR-RSPHSTATP 289
Query: 547 PTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSG--------SDSQQLQTITA 598
P T +T P TP + ++ + P ++P ++ + D + +I
Sbjct: 290 PPTTKRQETGRP-TPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPNSICY 348
Query: 599 EYIIYHAIISR 609
IY+ + R
Sbjct: 349 GVGIYNEALPR 359
Score = 30.0 bits (67), Expect = 5.1
Identities = 19/88 (21%), Positives = 36/88 (40%), Gaps = 2/88 (2%)
Query: 500 EQQPTTPGNRTYSVPCDMTP-PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILP 558
+ QP T G++T + ++ P ++G TT S+ Q P S+ P+
Sbjct: 196 QSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQT-EPPPSQRGPSGSPQHPPSTT 254
Query: 559 ATPSSSKASRKSAPPAITTPCSSKSLRS 586
+ S+ + TP ++ + RS
Sbjct: 255 SQDQSTTGDGQEHTQRRKTPPATSNRRS 282
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 32.0 bits (72), Expect = 1.3
Identities = 21/88 (23%), Positives = 32/88 (36%), Gaps = 7/88 (7%)
Query: 491 AESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTP 550
+ K +E ++P P R+ P T P S + K+ + P S K+P P
Sbjct: 585 PKHPKDPEEPKKPKRP--RSAQRP---TRPKSPKLPELLDIPKS--PKRPESPKSPKRPP 637
Query: 551 SSSKTILPATPSSSKASRKSAPPAITTP 578
+ P P K + PP P
Sbjct: 638 PPQRPSSPERPEGPKIIKSPKPPKSPKP 665
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 31.8 bits (72), Expect = 1.3
Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 6/89 (6%)
Query: 514 PCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPP 573
P D+ P + T +S TP+++ +PP+T + + A P + P
Sbjct: 199 PADVFVPATPRPTPRTTAS-PETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTP 257
Query: 574 AITTPCS-----SKSLRSGSDSQQLQTIT 597
A TP + + + S+ T+T
Sbjct: 258 APPTPGGGEAPPANATPAPEASRYELTVT 286
>gnl|CDD|219392 pfam07382, HC2, Histone H1-like nucleoprotein HC2. This family
contains the bacterial histone H1-like nucleoprotein HC2
(approximately 200 residues long), which seems to be
found mostly in Chlamydia. HC2 functions in DNA
condensation, although it has been suggested that it
also has other roles.
Length = 187
Score = 31.3 bits (70), Expect = 1.3
Identities = 13/60 (21%), Positives = 23/60 (38%), Gaps = 1/60 (1%)
Query: 532 SKARQARTPTSSKAPPTTPSSSKTIL-PATPSSSKASRKSAPPAITTPCSSKSLRSGSDS 590
+K A+ + KA P++ K + A + K + +A S+ GS S
Sbjct: 110 AKKPAAKKAVAKKAVARKPAAKKAVAKKAASTCHKNHKHTAACKRVASSSATRAACGSKS 169
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 31.7 bits (72), Expect = 1.4
Identities = 16/62 (25%), Positives = 24/62 (38%), Gaps = 7/62 (11%)
Query: 517 MTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAIT 576
+ P P+ + P++ A PT AP T P K A + R++A P
Sbjct: 360 LVPVPAPQPAK--PTAAAPSPVRPT--PAPSTRP---KAAAAANIPPKEPVRETATPPPV 412
Query: 577 TP 578
P
Sbjct: 413 PP 414
Score = 30.5 bits (69), Expect = 3.8
Identities = 17/95 (17%), Positives = 23/95 (24%), Gaps = 4/95 (4%)
Query: 502 QPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATP 561
+PT + PP +T P P S L
Sbjct: 380 RPTPAPSTRPKAAAAANIPPKEPVRETATPPPV----PPRPVAPPVPHTPESAPKLTRAA 435
Query: 562 SSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTI 596
K PPA + G +QL+ I
Sbjct: 436 IPVDEKPKYTPPAPPKEEEKALIADGDVLEQLEAI 470
Score = 29.8 bits (67), Expect = 6.1
Identities = 11/74 (14%), Positives = 21/74 (28%), Gaps = 1/74 (1%)
Query: 509 RTYSVPCDMTPP-PSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKAS 567
P T PS + PS++ + A P +++ +P P +
Sbjct: 363 VPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVP 422
Query: 568 RKSAPPAITTPCSS 581
T +
Sbjct: 423 HTPESAPKLTRAAI 436
>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
Length = 496
Score = 31.8 bits (72), Expect = 1.5
Identities = 16/117 (13%), Positives = 36/117 (30%), Gaps = 12/117 (10%)
Query: 475 SYHKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTY--SVPCDMTPPPSSSKGQTTPSS 532
H S++ + S++ + +P + + S P + ++ PS
Sbjct: 9 HLHHSYSSLDRREKTCLRSSQKTRRFPKPKASLHPSIKRSRPGRCSTNGAAVPESPKPSR 68
Query: 533 KARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSD 589
+ R+ + +S TT +K K ++ P + D
Sbjct: 69 RGRKKKPTSSPPKAKTTRRRTK----------KTDQELDPEGAEEDQEAAEDGEDYD 115
>gnl|CDD|219401 pfam07404, TEBP_beta, Telomere-binding protein beta subunit (TEBP
beta). This family consists of several telomere-binding
protein beta subunits which appear to be specific to the
family Oxytrichidae. Telomeres are specialised
protein-DNA complexes that compose the ends of
eukaryotic chromosomes. Telomeres protect chromosome
termini from degradation and recombination and act
together with telomerase to ensure complete genome
replication. TEBP beta forms a complex with TEBP alpha
and this complex is able to recognise and bind ssDNA to
form a sequence-specific, telomeric nucleoprotein
complex that caps the very 3' ends of chromosomes.
Length = 375
Score = 31.6 bits (71), Expect = 1.5
Identities = 16/47 (34%), Positives = 24/47 (51%), Gaps = 5/47 (10%)
Query: 529 TPSSKARQARTPTSSKAPPTTPSS----SKTILPATPS-SSKASRKS 570
TP+ + + TP S+A ++ SK +PA PS S K S K+
Sbjct: 285 TPNKPSSRKETPQKSQAGKSSAKKTTTGSKKAVPANPSPSGKKSTKT 331
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 31.8 bits (72), Expect = 1.5
Identities = 13/74 (17%), Positives = 25/74 (33%), Gaps = 2/74 (2%)
Query: 516 DMTPPPSSSK--GQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPP 573
P +++ Q P++ A A P + P ++ A ++ +R+S P
Sbjct: 371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAP 430
Query: 574 AITTPCSSKSLRSG 587
S R
Sbjct: 431 EALAAARQASARGP 444
Score = 31.4 bits (71), Expect = 2.1
Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 3/66 (4%)
Query: 521 PSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCS 580
P S G P A A P + AP ++ PA P ++ A+ +A A +
Sbjct: 365 PGQSGGGAGP---ATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAA 421
Query: 581 SKSLRS 586
+ + RS
Sbjct: 422 APARRS 427
Score = 31.0 bits (70), Expect = 2.6
Identities = 14/67 (20%), Positives = 26/67 (38%), Gaps = 7/67 (10%)
Query: 512 SVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSA 571
+ P +P P + +++ AR P + AP P+++ A ++ R A
Sbjct: 421 AAPARRSPAPEA-----LAAARQASARGPGGAPAPAPAPAAAPA--AAARPAAAGPRPVA 473
Query: 572 PPAITTP 578
A P
Sbjct: 474 AAAAAAP 480
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 31.6 bits (71), Expect = 1.6
Identities = 25/105 (23%), Positives = 37/105 (35%), Gaps = 12/105 (11%)
Query: 503 PTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAP-------PTTPSSSKT 555
P + G T P P ++ TPSS A P+++ +P P P S
Sbjct: 389 PASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSG- 447
Query: 556 ILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTITAEY 600
+P P+ P A P S S + + TAE+
Sbjct: 448 -IPPRPAPRMPEASPVPGA---PDSVASASDAPPTLGDPSDTAEH 488
>gnl|CDD|218639 pfam05564, Auxin_repressed, Dormancy/auxin associated protein.
This family contains several plant dormancy-associated
and auxin-repressed proteins the function of which are
poorly understood.
Length = 119
Score = 29.9 bits (67), Expect = 1.8
Identities = 14/58 (24%), Positives = 22/58 (37%), Gaps = 1/58 (1%)
Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKT-ILPATPSSSKASRKSAPPA 574
T P S SSK + R+ + + PT+P++ T P + R P
Sbjct: 24 TSPLSIKDSSEGGSSKDKVQRSISMPASTPTSPATPTTPTSPFSARKENVWRSVFHPG 81
>gnl|CDD|218190 pfam04651, Pox_A12, Poxvirus A12 protein.
Length = 188
Score = 30.5 bits (69), Expect = 1.9
Identities = 18/81 (22%), Positives = 30/81 (37%), Gaps = 6/81 (7%)
Query: 518 TPPPSSSKGQTTPSS-KARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAIT 576
+ + + PS + + + TS+ + + S+ T S S SR AP
Sbjct: 38 NRGGNLAGPENNPSDNEVKAGKRVTSASSSKSKRCSTSTSKTKPCSRSSRSRSGAPRRRG 97
Query: 577 TPCSSKSLRSGSDSQQLQTIT 597
T S D Q +Q +T
Sbjct: 98 TAFG-----SMEDPQIVQAVT 113
>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase complex.
Length = 539
Score = 31.4 bits (71), Expect = 1.9
Identities = 17/46 (36%), Positives = 22/46 (47%), Gaps = 2/46 (4%)
Query: 529 TPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPA 574
PSS A A +K P P + PA+ KAS+ SAPP+
Sbjct: 201 KPSSSAAPA--APKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPS 244
>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210). This is
a family of proteins conserved in yeasts. The function
is not known. The Schizosaccharomyces pombe member is
SPBC18E5.07 and the Saccharomyces cerevisiae member is
AIM21.
Length = 671
Score = 31.4 bits (71), Expect = 2.0
Identities = 21/108 (19%), Positives = 33/108 (30%), Gaps = 4/108 (3%)
Query: 492 ESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPS 551
E A E+ P + P + S S Q T + P +S+ P S
Sbjct: 389 EIAVKPPTEESSRRPEEEKHRFPSEDVWEDSPSSLQDTATVSTPSNPPPRASETPEQETS 448
Query: 552 SSKTILPATPSSSKASR----KSAPPAITTPCSSKSLRSGSDSQQLQT 595
S + + P S+ + S +SQ+L T
Sbjct: 449 RSSSEVSLDPHQSELKSEKKKARPEVSKQRFPSRDVWEDAPESQELVT 496
Score = 31.1 bits (70), Expect = 2.3
Identities = 18/91 (19%), Positives = 33/91 (36%), Gaps = 8/91 (8%)
Query: 486 SDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKA 545
S++++ LK E++ P P + P S+ T +
Sbjct: 452 SEVSLDPHQSELKSEKKKARPE------VSKQRFPSRDVW-EDAPESQELVTTEETPEEV 504
Query: 546 PPTTPSSSKTILPATPSSSK-ASRKSAPPAI 575
++P +K +P+ P K S K PP +
Sbjct: 505 KSSSPGVTKPAIPSRPKKGKPTSEKRKPPPV 535
Score = 30.3 bits (68), Expect = 4.3
Identities = 20/64 (31%), Positives = 31/64 (48%), Gaps = 3/64 (4%)
Query: 521 PSSSKGQTTPSSKAR-QARTPTSSK-APPTTPSSSKTILPATPSSSKASRKSAPPAITTP 578
SSS G T P+ +R + PTS K PP P K +PA P+ + +++ A ++
Sbjct: 505 KSSSPGVTKPAIPSRPKKGKPTSEKRKPPPVPKKPKPQIPARPAKLQ-KQQAGEEANSSA 563
Query: 579 CSSK 582
K
Sbjct: 564 FKPK 567
>gnl|CDD|222011 pfam13257, DUF4048, Domain of unknown function (DUF4048). This
presumed domain is functionally uncharacterized. This
domain family is found in eukaryotes, and is typically
between 228 and 257 amino acids in length.
Length = 242
Score = 30.9 bits (70), Expect = 2.0
Identities = 17/83 (20%), Positives = 30/83 (36%), Gaps = 17/83 (20%)
Query: 518 TPPPSSSKGQTTPSSKARQ-------ARTPTSSKAPPTTPSSSKTILPATPSSSKAS--- 567
PPP S + + +S++R + + +S + +S K A S S
Sbjct: 122 VPPPRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTSKGATSGKDSKSADIDVSFWSEFG 181
Query: 568 -------RKSAPPAITTPCSSKS 583
KS A +TP + +
Sbjct: 182 IDTPGQKSKSPQKASSTPAGNTN 204
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are transmembrane
heparin sulfate proteoglycans which are implicated in
the binding of extracellular matrix components and
growth factors.
Length = 207
Score = 30.5 bits (69), Expect = 2.1
Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 6/80 (7%)
Query: 516 DMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAI 575
+T + K TT SS T T+S T+ +S T+ +++ S A
Sbjct: 70 PVTTSATPPKLTTTSSSP--SNDTTTAS----TSTKTSPTVSTTVTTTTSPSETDTEEAT 123
Query: 576 TTPCSSKSLRSGSDSQQLQT 595
TT + GS + +
Sbjct: 124 TTVSTETPTEGGSSAATDPS 143
>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein. Tymoviruses
are single stranded RNA viruses. This family includes a
protein of unknown function that has been named based on
its molecular weight. Tymoviruses such as the ononis
yellow mosaic tymovirus encode only three proteins. Of
these two are overlapping this protein overlaps a larger
ORF that is thought to be the polymerase.
Length = 458
Score = 31.2 bits (71), Expect = 2.1
Identities = 18/69 (26%), Positives = 30/69 (43%), Gaps = 5/69 (7%)
Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRK--SAPPAIT 576
P + + P +R+P+S +P PS+ + +LP P +ASR P +
Sbjct: 245 PLHPHTTRPSPPRP--AFSRSPSSPLSPLPRPSTRRGLLP-NPRLPRASRGHLPPPTSSA 301
Query: 577 TPCSSKSLR 585
P + LR
Sbjct: 302 PPRPNGGLR 310
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 31.2 bits (71), Expect = 2.1
Identities = 17/95 (17%), Positives = 29/95 (30%), Gaps = 5/95 (5%)
Query: 489 AIAESAKLLKEEQQP--TTPGNRTYSVPCDMTPPPSSSKGQTTP---SSKARQARTPTSS 543
A AE + E P + + P P +++ P + A A ++
Sbjct: 374 APAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAA 433
Query: 544 KAPPTTPSSSKTILPATPSSSKASRKSAPPAITTP 578
A + + L P + A A P P
Sbjct: 434 PAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAP 468
Score = 29.3 bits (66), Expect = 9.3
Identities = 15/58 (25%), Positives = 26/58 (44%)
Query: 521 PSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTP 578
P+++ P+ K AR ++ A ++ PA ++ AS +APPA P
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPP 423
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 31.0 bits (70), Expect = 2.2
Identities = 25/140 (17%), Positives = 45/140 (32%), Gaps = 22/140 (15%)
Query: 470 FDYAYSYHKHSVS-MELSDLAI--AESAKLLKEEQQ--------------PTTPGN--RT 510
F+ S H +SV L D + +LKE P T N R
Sbjct: 79 FEEVTSVHVYSVQKAPLKDSNVLYDVDYDILKENLHNCSKNSLEYGKQAGPITNPNVKRR 138
Query: 511 YSVPCDMTPPPSSSKGQTTPSSKARQARTPTSS-KAPPTTPSSSKTILPATPSSSKASRK 569
V P +S + T + K ++ P S S+ KT + ++++
Sbjct: 139 TGVGLPPVAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEK--T 196
Query: 570 SAPPAITTPCSSKSLRSGSD 589
++ ++ S+
Sbjct: 197 EGKTSVKAASLKRNPPKKSN 216
>gnl|CDD|221361 pfam11990, DUF3487, Protein of unknown function (DUF3487). This
family of proteins is functionally uncharacterized. This
protein is found in bacteria. Proteins in this family
are typically between 121 to 136 amino acids in length.
This protein has a conserved RLN sequence motif.
Length = 122
Score = 29.5 bits (67), Expect = 2.2
Identities = 6/27 (22%), Positives = 10/27 (37%)
Query: 139 PHMLDRHPSIVPRMRTPVFWIITGLSL 165
P L+R P + + + GL
Sbjct: 8 PDRLNREPVVFRGLTADELGVAAGLGA 34
>gnl|CDD|213867 TIGR03824, FlgM_jcvi, flagellar biosynthesis anti-sigma factor
FlgM. FlgM interacts with and inhibits the alternative
sigma factor sigma(28) FliA. The C-terminus of FlgM
contains the sigma(28)-binding domain.
Length = 96
Score = 28.8 bits (65), Expect = 2.5
Identities = 12/52 (23%), Positives = 19/52 (36%)
Query: 548 TTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTITAE 599
+ S A++ SA S S+ S +QQLQ++ A
Sbjct: 5 GIGPLPVNAALRSTRSVAAAKASAAAQAAAASSGDSVSLSSLAQQLQSLEAA 56
>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 292
Score = 30.7 bits (69), Expect = 2.5
Identities = 26/106 (24%), Positives = 35/106 (33%), Gaps = 11/106 (10%)
Query: 485 LSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSK 544
LS L L E P R SVP P T P + K
Sbjct: 116 LSQLNNVAVTSTLPTEPATVAP-VRNASVPTAERPAI------TRPVRAQAVSEPAVEPK 168
Query: 545 APPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDS 590
A T ++ + A+P+ + A PPA +S L+S S
Sbjct: 169 AAKTATATEAKVQTASPAQTPA----TPPAGKGAAASGQLKSAPSS 210
>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
Length = 1467
Score = 31.3 bits (70), Expect = 2.5
Identities = 18/78 (23%), Positives = 29/78 (37%), Gaps = 1/78 (1%)
Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPS-SSKTILPATPSSSKASRKSAPPAITT 577
P+ + + + + P + PP S + + S S+PPA TT
Sbjct: 1180 GIPAPQLLSSYIPTGSVLYQDPIFTYIPPGIIGMSGTNTFTFKAAQLQLSAASSPPAATT 1239
Query: 578 PCSSKSLRSGSDSQQLQT 595
P S S S +Q + T
Sbjct: 1240 PTPPPSSSSSSSAQSIST 1257
>gnl|CDD|135173 PRK04654, PRK04654, sec-independent translocase; Provisional.
Length = 214
Score = 30.6 bits (68), Expect = 2.5
Identities = 14/69 (20%), Positives = 27/69 (39%), Gaps = 7/69 (10%)
Query: 530 PSSKARQARTPTSSKAPPTTPSSSKTILPA-------TPSSSKASRKSAPPAITTPCSSK 582
+ A P ++A P P+ +T++PA P ++ A P P +
Sbjct: 137 QAHTPVPAPAPVIAQAQPIAPAPHQTLVPAPHDTIVPAPHAAHLPSAPATPVSVAPVDAG 196
Query: 583 SLRSGSDSQ 591
+ S + S+
Sbjct: 197 TSASPTPSE 205
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 30.7 bits (69), Expect = 2.6
Identities = 16/73 (21%), Positives = 36/73 (49%), Gaps = 1/73 (1%)
Query: 527 QTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRS 586
+ T K + + P + KA T+ ++KT A +S+KA++K+A T + + +
Sbjct: 5 KKTTKKKTTEEKKPAAKKAT-TSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKV 63
Query: 587 GSDSQQLQTITAE 599
++ +++ E
Sbjct: 64 TVKFEKTESVKKE 76
Score = 30.3 bits (68), Expect = 3.0
Identities = 9/74 (12%), Positives = 19/74 (25%), Gaps = 5/74 (6%)
Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKAPPT-----TPSSSKTILPATPSSSKASRKSAP 572
P + + + A+T ++ A T S T T K
Sbjct: 17 KPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKE 76
Query: 573 PAITTPCSSKSLRS 586
+++ +
Sbjct: 77 SVAKKTVKKEAVSA 90
>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
family).
Length = 619
Score = 31.0 bits (70), Expect = 2.8
Identities = 20/90 (22%), Positives = 30/90 (33%), Gaps = 5/90 (5%)
Query: 501 QQPTTPGNRTYSVPCDMTPPP-SSSKGQT--TPSSKARQARTPTSSKAPPTTPSSSKTIL 557
G+ SVP TP + + T TPS K T +S + S S
Sbjct: 456 VMEPPYGSTESSVP--STPSTRRNDRNITSNTPSLKRTPNLTKSSLSQEASLISKSTGNT 513
Query: 558 PATPSSSKASRKSAPPAITTPCSSKSLRSG 587
+ + + PA + +RSG
Sbjct: 514 HKHSTPRRLTTLPKLPAASRSSKGNLIRSG 543
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 30.8 bits (69), Expect = 2.9
Identities = 16/67 (23%), Positives = 26/67 (38%), Gaps = 4/67 (5%)
Query: 514 PCDMTPP---PSSSKGQTTPSSKARQAR-TPTSSKAPPTTPSSSKTILPATPSSSKASRK 569
P PP + S TPS++A + +P +++ P S L + PS
Sbjct: 179 PSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLP 238
Query: 570 SAPPAIT 576
S P +
Sbjct: 239 SPHPPLQ 245
>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).
This family consists of several hypothetical proteins
from Arabidopsis thaliana and Oryza sativa. The function
of this family is unknown.
Length = 564
Score = 31.0 bits (70), Expect = 2.9
Identities = 17/81 (20%), Positives = 27/81 (33%), Gaps = 11/81 (13%)
Query: 517 MTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASR-------- 568
+ + K + S R +SS +P S AT + SK S
Sbjct: 226 VAVLSAPRKPGSRSSDCKSSPRARSSS---AKSPFKSSIQRKATKALSKLSLRASPKDTS 282
Query: 569 KSAPPAITTPCSSKSLRSGSD 589
KS+ + P S++ S
Sbjct: 283 KSSKSEVAPPKKSEAKVPSSS 303
Score = 29.0 bits (65), Expect = 9.4
Identities = 19/70 (27%), Positives = 28/70 (40%), Gaps = 5/70 (7%)
Query: 521 PSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSK----TILPATPSSSKASRKSAPPAIT 576
PS S G + PSS + + S+ + S K P P S + KS+P A +
Sbjct: 192 PSPSGGTSCPSSSGGRRSSI-GSRRLRGSASLRKKVAVLSAPRKPGSRSSDCKSSPRARS 250
Query: 577 TPCSSKSLRS 586
+ S S
Sbjct: 251 SSAKSPFKSS 260
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 30.6 bits (69), Expect = 3.0
Identities = 16/54 (29%), Positives = 27/54 (50%), Gaps = 3/54 (5%)
Query: 521 PSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPA 574
P++ G S+KA+ A S K T S+K A +++KA++K+ A
Sbjct: 206 PAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAK---KAAKTAAKAAKKAKKTA 256
Score = 30.2 bits (68), Expect = 4.0
Identities = 10/68 (14%), Positives = 23/68 (33%), Gaps = 3/68 (4%)
Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPS---SSKTILPATPSSSKASRKSAPPAI 575
+ + + A A+ + A + + K + A + KA++K+A A
Sbjct: 220 KSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAA 279
Query: 576 TTPCSSKS 583
+
Sbjct: 280 KAAKGAAK 287
>gnl|CDD|233367 TIGR01349, PDHac_trf_mito, pyruvate dehydrogenase complex
dihydrolipoamide acetyltransferase, long form. This
model represents one of several closely related clades
of the dihydrolipoamide acetyltransferase subunit of the
pyruvate dehydrogenase complex. It includes sequences
from mitochondria and from alpha and beta branches of
the proteobacteria, as well as from some other bacteria.
Sequences from Gram-positive bacteria are not included.
The non-enzymatic homolog protein X, which serves as an
E3 component binding protein, falls within the clade
phylogenetically but is rejected by its low score
[Energy metabolism, Pyruvate dehydrogenase].
Length = 436
Score = 30.5 bits (69), Expect = 3.0
Identities = 14/52 (26%), Positives = 18/52 (34%), Gaps = 5/52 (9%)
Query: 529 TPSSKARQARTP--TSSKAPPTTPSSSKTILPATPSSS---KASRKSAPPAI 575
S A A P + APP+ P S +P S S K + I
Sbjct: 89 KLESSASPAPKPSEIAPTAPPSAPKPSPAPQKQSPEPSSPAPLSDKESGDRI 140
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 30.8 bits (69), Expect = 3.0
Identities = 17/76 (22%), Positives = 29/76 (38%)
Query: 508 NRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKAS 567
N Y+ P + P S+ P S + TP S+ P SS + ++
Sbjct: 431 NTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQH 490
Query: 568 RKSAPPAITTPCSSKS 583
R + PA P +++
Sbjct: 491 RAANQPAANLPTANQP 506
Score = 30.4 bits (68), Expect = 4.8
Identities = 20/70 (28%), Positives = 31/70 (44%), Gaps = 2/70 (2%)
Query: 508 NRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKAS 567
N YS P + P +++ TP + + TP S+ P + P S TP S+ A
Sbjct: 411 NAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSN-PPNSNPPYSNLPYSNTPYSN-AP 468
Query: 568 RKSAPPAITT 577
+APP+
Sbjct: 469 LSNAPPSSAK 478
>gnl|CDD|237803 PRK14724, PRK14724, DNA topoisomerase III; Provisional.
Length = 987
Score = 30.7 bits (69), Expect = 3.0
Identities = 15/66 (22%), Positives = 25/66 (37%), Gaps = 4/66 (6%)
Query: 520 PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTIL----PATPSSSKASRKSAPPAI 575
PP + ++ A T A P +++K + A A +K+APPA
Sbjct: 855 PPRKTAAAKAGAASAAFGGTVAVKAAKPAKKAAAKKVAAKTAAAKTPRKAAKKKAAPPAA 914
Query: 576 TTPCSS 581
S+
Sbjct: 915 GLKPSA 920
>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9. The KAR9 protein in
Saccharomyces cerevisiae is a cytoskeletal protein
required for karyogamy, correct positioning of the
mitotic spindle and for orientation of cytoplasmic
microtubules. KAR9 localises at the shmoo tip in mating
cells and at the tip of the growing bud in anaphase.
Length = 626
Score = 30.6 bits (69), Expect = 3.1
Identities = 18/112 (16%), Positives = 31/112 (27%), Gaps = 3/112 (2%)
Query: 477 HKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQ 536
+ E + + T N + + D P + P++
Sbjct: 438 LRRMNIKPTLASIPDEKPSNISVFEDSETSPNSSTLL-RDPPPKKCGEESGHLPNNPFFN 496
Query: 537 ARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGS 588
T S PP S ++I+ S ASR S+ S +
Sbjct: 497 KLKLTLSSIPPL--SPRQSIITLPTPSRPASRISSLSLRLGSYSGSIVSPPP 546
>gnl|CDD|205278 pfam13097, CENP-U, CENP-A nucleosome associated complex (NAC)
subunit. CENP-U is one of the components that assembles
onto the CENP-A-nucleosome associated complex (NAC). The
centromere, which is the basic element of chromosome
inheritance, is epigenetically determined in mammals.
CENP-A, the centromere-specific histone H3 variant,
assembles an array of nucleosomes and it is this that
seems to be the prime candidate for specifying
centromere identity. CENP-A nucleosomes directly recruit
a proximal CENP-A nucleosome associated complex (NAC)
comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
CENP-C and CENP-H. Assembly of the CENP-A NAC at
centromeres is dependent on CENP-M, CENP-N and CENP-T.
Additionally, there are seven other subunits which make
up the CENP-A-nucleosome distal (CAD) centromere,
CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
CENP-S, also assembling on the CENP-A NAC. FTA4 is the
equivalent component of the fission yeast Sim4 complex.
Length = 174
Score = 29.9 bits (67), Expect = 3.1
Identities = 24/140 (17%), Positives = 47/140 (33%), Gaps = 10/140 (7%)
Query: 498 KEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTIL 557
++P++ V D TP + + P+S P S S+T
Sbjct: 10 PSAERPSS---ERPEVTSDAASLSEKPASDVTPPKTSPSSSKPSSEDESPANQSQSETQK 66
Query: 558 PATPSSSKASRKSA-----PPAITTPCSSKSLRSGSDSQQLQTITAEYIIYHAIISRRLQ 612
+P K R S ++ C + SD +L + +E+ +R++
Sbjct: 67 KLSPGRRKKPRSSHSGSDSSDSVHVWCLEG--KKRSDITELDVVLSEFEKICLEYRQRVE 124
Query: 613 TKISRTAFYMMCAKDQDSLS 632
+K + A ++ L
Sbjct: 125 SKACKQAIKDFFNNLKEELI 144
>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
This family consists of several brain acid soluble
protein 1 (BASP1) or neuronal axonal membrane protein
NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
calmodulin-binding protein of unknown function.
Length = 233
Score = 30.2 bits (67), Expect = 3.2
Identities = 21/108 (19%), Positives = 39/108 (36%), Gaps = 10/108 (9%)
Query: 491 AESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTP 550
E +E+ P +T PP +S + P++ A +AP +
Sbjct: 75 GEKEAAAAKEEAPKAEPEKTEGAAEAKAEPPKASDPEQEPAAAPGPA---AGGEAPKASE 131
Query: 551 SSSKTILPATPSSSK-------ASRKSAPPAITTPCSSKSLRSGSDSQ 591
+SS+ A P+ + ++K+ PA + SDS+
Sbjct: 132 ASSQPAESAAPAKEEEKSKEEGEAKKTEAPAAAAQETKSDAAPASDSK 179
>gnl|CDD|221472 pfam12228, DUF3604, Protein of unknown function (DUF3604). This
family of proteins is found in bacteria. Proteins in
this family are typically between 621 and 693 amino
acids in length.
Length = 592
Score = 30.7 bits (70), Expect = 3.2
Identities = 15/42 (35%), Positives = 21/42 (50%), Gaps = 4/42 (9%)
Query: 102 TKLPPRPSPFPTLDWANEEDLWDLL---CAKDQDSLSLRDPH 140
+P + PF +LD N EDLWD + + S+ L PH
Sbjct: 178 GAVPAQVLPFSSLDSPNPEDLWDWMDAYEEQTGGSV-LAIPH 218
>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
Length = 226
Score = 30.2 bits (68), Expect = 3.4
Identities = 20/92 (21%), Positives = 35/92 (38%), Gaps = 9/92 (9%)
Query: 499 EEQQPTTPGNRT--YSVPCDMTPPP-SSSKGQTTPSSKARQARTPTSSKAPP------TT 549
+ + T P + S D P + + Q+ P + + T ++ PP
Sbjct: 80 DGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARDG 139
Query: 550 PSSSKTILPATPSSSKASRKSAPPAITTPCSS 581
P+ T PATP ++ R+ P + P S
Sbjct: 140 PTPDPTAQPATPDERRSPRQRPPVSGEPPTPS 171
Score = 29.8 bits (67), Expect = 4.5
Identities = 18/86 (20%), Positives = 28/86 (32%), Gaps = 12/86 (13%)
Query: 500 EQQPTTPGNRTYSVPCDMTPPPSSSKGQTT-------PSSKARQARTPTSSKAPPTTPSS 552
+ QP + D + PP +S T P++ A + A P TP
Sbjct: 99 DAQPAAE-----AEAADQSAPPEASSTSATDEAATDPPATAAARDGPTPDPTAQPATPDE 153
Query: 553 SKTILPATPSSSKASRKSAPPAITTP 578
++ P S + S P A
Sbjct: 154 RRSPRQRPPVSGEPPTPSTPDAHVAG 179
>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
dihydrolipoamide acetyltransferase, long form. This
model describes a subset of pyruvate dehydrogenase
complex dihydrolipoamide acetyltransferase specifically
close by both phylogenetic and per cent identity (UPGMA)
trees. Members of this set include two or three copies
of the lipoyl-binding domain. E. coli AceF is a member
of this model, while mitochondrial and some other
bacterial forms belong to a separate model [Energy
metabolism, Pyruvate dehydrogenase].
Length = 546
Score = 30.2 bits (68), Expect = 3.8
Identities = 11/52 (21%), Positives = 20/52 (38%), Gaps = 1/52 (1%)
Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPS-SSKTILPATPSSSKASRK 569
P + S T P A A + AP + + + A P+ + +R+
Sbjct: 206 QPAAQSPAATQPEPAAAPAAAKAQAPAPQQAGTQNPAKVDHAAPAVRRLARE 257
>gnl|CDD|225711 COG3170, FimV, Tfp pilus assembly protein FimV [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 755
Score = 30.3 bits (68), Expect = 3.9
Identities = 13/53 (24%), Positives = 22/53 (41%), Gaps = 3/53 (5%)
Query: 509 RTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATP 561
R Y+V D PP S + + P ++ APP P +++ + P
Sbjct: 133 REYTVLLD---PPGYSPKSALSAEPSHPVPAPAAASAPPPPPRAARPVRQPAP 182
>gnl|CDD|227952 COG5665, NOT5, CCR4-NOT transcriptional regulation complex, NOT5
subunit [Transcription].
Length = 548
Score = 30.4 bits (68), Expect = 4.0
Identities = 39/164 (23%), Positives = 53/164 (32%), Gaps = 37/164 (22%)
Query: 429 EDCIEWMNIYWVILCEKSPGTDYCDSDSNELPPGPLRHNNVFDYAYSYHKHSVSMELSDL 488
D IE+ IY + CE P S SN P NN + +
Sbjct: 184 PDFIEYDTIYEDMGCEIQP------SSSNN--EAPKEGNN----------QTSLSSIRSS 225
Query: 489 AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPT 548
E + K Q+ S S TTP + ++ + + S P
Sbjct: 226 KKQERSPKKKAPQRDV-----------------SISDRATTPIAPGVESASQSISSTP-- 266
Query: 549 TPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQ 592
TP S+ T L S S TT S K S +DS+Q
Sbjct: 267 TPVSTDTPLHTVKDDSIKFDNSTLGTPTTHVSMKKKESENDSEQ 310
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
function prediction only].
Length = 872
Score = 30.2 bits (68), Expect = 4.3
Identities = 16/75 (21%), Positives = 23/75 (30%), Gaps = 4/75 (5%)
Query: 505 TPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT---- 560
T T S + Q +T TSS +P T S ++T T
Sbjct: 781 TKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTS 840
Query: 561 PSSSKASRKSAPPAI 575
PS + P +
Sbjct: 841 PSQTTTGGGICGPIV 855
>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 153
Score = 29.1 bits (65), Expect = 4.4
Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 7/57 (12%)
Query: 531 SSKARQARTPTSSKAP--PTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLR 585
S+ Q TP + AP P+ P+ PA +S + +T P K LR
Sbjct: 46 SAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPAS-----AGENVVTAPMPGKILR 97
>gnl|CDD|115805 pfam07175, Osteoregulin, Osteoregulin. This family represents a
conserved region approximately 180 residues long within
osteoregulin, a bone-remodelling protein expressed
highly in osteocytes within trabecular and cortical
bone. A conserved RGD motif is found towards the
C-terminal end of this region, and this is potentially
involved in integrin recognition.
Length = 163
Score = 29.2 bits (65), Expect = 4.4
Identities = 14/60 (23%), Positives = 24/60 (40%)
Query: 494 AKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSS 553
+LL EE + TP + +P D+ + SK + ++P SK S+
Sbjct: 68 IELLGEENKEKTPQSVLSVIPADVNYAKAHSKDKKNQQRDLLAQKSPVKSKHTHRIRRST 127
>gnl|CDD|221743 pfam12735, Trs65, TRAPP trafficking subunit Trs65. This family is
one of the subunits of the TRAPP Golgi trafficking
complex. TRAPP subunits are found in two different sized
complexes, TRAPP I and TRAPP II. While both complexes
contain the same seven subunits, Bet3p, Bet5p, Trs20p,
Trs23p, Trs31p, Trs33p and Trs85p, with TRAPPC human
equivalents, TRAPP II has the additional three
subunits,Trs65p, Trs120p and Trs130p. While it has been
implicated in cell wall biogenesis and stress response,
the role of Trs65 in TRAPP II is supported by the
findings that the protein co-localises with Trs130p, and
deletion of TRS65 in yeast leads to a conditional lethal
phenotype if either one of the other TRAPP II-specific
subunits is modified. Furthermore, the trs65 mutant has
reduced Ypt31/32p guanine nucleotide exchange, GEF,
activity.
Length = 271
Score = 29.8 bits (67), Expect = 4.8
Identities = 21/75 (28%), Positives = 36/75 (48%)
Query: 494 AKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSS 553
K+L E + + ++ D + PP S + + + +P++S AP PSSS
Sbjct: 62 VKILVESENCSPRIETSWETNVDFSLPPPISTALRRSNRPSSLSISPSTSSAPSVGPSSS 121
Query: 554 KTILPATPSSSKASR 568
+T L A S+SK +
Sbjct: 122 RTSLRAANSNSKLNG 136
>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
Provisional.
Length = 481
Score = 29.8 bits (66), Expect = 5.2
Identities = 10/60 (16%), Positives = 24/60 (40%)
Query: 528 TTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSG 587
P+ + ++ T + T + K AT + K + P + ++ +++SG
Sbjct: 148 VAPTQEVKKETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSG 207
>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
factor N terminal domain. The N terminus of the PEA3
transcription factors is implicated in transactivation
and in inhibition of DNA binding. Transactivation is
potentiated by activation of the Ras/MAP kinase and
protein kinase A signalling cascades. The N terminal
region contains conserved MAP kinase phosphorylation
sites.
Length = 336
Score = 29.8 bits (67), Expect = 5.3
Identities = 23/111 (20%), Positives = 40/111 (36%), Gaps = 10/111 (9%)
Query: 494 AKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSK-----APPT 548
K+ +E Q P+ N C PP + S A + + K + P
Sbjct: 85 TKIKREPQSPSKELNS----SCSQKQPPYPYGEKCLYSYSAYDRKPASGFKPPTPPSTPC 140
Query: 549 TPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTITAE 599
+P + + + S S S+PP+ TP ++S S + +E
Sbjct: 141 SPVNPQETVRQLQPSGPLS-NSSPPSPHTPLPNQSPLPPPMSSPDSSYPSE 190
>gnl|CDD|215362 PLN02676, PLN02676, polyamine oxidase.
Length = 487
Score = 29.7 bits (67), Expect = 5.6
Identities = 15/37 (40%), Positives = 19/37 (51%), Gaps = 2/37 (5%)
Query: 171 YEYRLYCVNNYFLFTIFTPSNIVLPILTLNKVTRQVS 207
YE +Y + FL T I P L LNKV R++S
Sbjct: 223 YESLVYYLAEQFLST--KSGKITDPRLKLNKVVREIS 257
>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1. Mating in
fungi is controlled by the loci that determine the
mating type of an individual, and only individuals with
differing mating types can mate. Basidiomycete fungi
have evolved a unique mating system, termed tetrapolar
or bifactorial incompatibility, in which mating type is
determined by two unlinked loci; compatibility at both
loci is required for mating to occur. The multi-allelic
tetrapolar mating system is considered to be a novel
innovation that could have only evolved once, and is
thus unique to the mushroom fungi. This domain is
C-terminal to the homeodomain transcription factor
region.
Length = 418
Score = 29.8 bits (67), Expect = 5.7
Identities = 14/78 (17%), Positives = 20/78 (25%), Gaps = 4/78 (5%)
Query: 516 DMTP--PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPP 573
DMTP + RQ+R + +P S + S S
Sbjct: 38 DMTPDLKEQLKDEKRRRRQTPRQSRRSKRAAHAYPSPERSPALSSERLLSPSPS--VLDL 95
Query: 574 AITTPCSSKSLRSGSDSQ 591
+ R S S
Sbjct: 96 SPVLASPQTGKRRRSSSP 113
Score = 29.0 bits (65), Expect = 9.0
Identities = 18/81 (22%), Positives = 28/81 (34%), Gaps = 10/81 (12%)
Query: 527 QTTPSSKARQA-----RTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAIT----- 576
Q+ S +A A R+P S +PS S L +S + ++ + +
Sbjct: 60 QSRRSKRAAHAYPSPERSPALSSERLLSPSPSVLDLSPVLASPQTGKRRRSSSPSDDEDE 119
Query: 577 TPCSSKSLRSGSDSQQLQTIT 597
SK RS S S
Sbjct: 120 AERPSKRPRSDSISSSSSPAK 140
>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566). Family of
related proteins that is plant specific.
Length = 313
Score = 29.5 bits (66), Expect = 5.8
Identities = 20/119 (16%), Positives = 41/119 (34%), Gaps = 6/119 (5%)
Query: 477 HKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSK--- 533
+ + + S + L T+ +R + PPSSS + T SS
Sbjct: 5 VSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLN-APASPPSSSPARNTSSSSSFG 63
Query: 534 --ARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDS 590
++ + + + S S+ A S + + ++ ++P S+ S S
Sbjct: 64 LSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRRTTSSDLS 122
>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle.
Length = 519
Score = 29.9 bits (66), Expect = 5.9
Identities = 29/144 (20%), Positives = 48/144 (33%), Gaps = 8/144 (5%)
Query: 444 EKSPGTDYCDSDSNELPPGPLRHNNVFDYAYSYHKHSVSMELSDLAIAESAKLLKEEQQP 503
E+ + DSDS E P K S A A + + ++ P
Sbjct: 21 EEDSESSSEDSDSEEEMPAAKNPPQA--------KPSGKSPQVKAASAPAKESPQKGAPP 72
Query: 504 TTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSS 563
TPG + SS+ ++ + A T T+S A + + PA+ +
Sbjct: 73 VTPGKAGPAAAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAKPLGKNSQVRPASTVT 132
Query: 564 SKASRKSAPPAITTPCSSKSLRSG 587
S K A S +++ G
Sbjct: 133 PGPSGKGANLPCPQKAGSAAVQVG 156
Score = 29.5 bits (65), Expect = 8.1
Identities = 23/81 (28%), Positives = 34/81 (41%), Gaps = 12/81 (14%)
Query: 514 PCDMTP---PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKS 570
P MT P+ QT S + TPTS+K PP TP+ KA +
Sbjct: 233 PAAMTAAQAKPALKTPQTKASPRKGTPITPTSAKVPPVRVG--------TPAPRKAGAVT 284
Query: 571 APPAITTPCSSK-SLRSGSDS 590
+P ++P ++ + R DS
Sbjct: 285 SPACASSPALARGTQRPDEDS 305
>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 226
Score = 29.1 bits (65), Expect = 6.1
Identities = 10/67 (14%), Positives = 18/67 (26%), Gaps = 3/67 (4%)
Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAIT- 576
PP ++ A + P PP + + +P + K + P
Sbjct: 69 QPPEGVAQEIQDAGDAAAASVDPQPVAQPPVESTPAG--VPVAAQTPKPVKPPKQPPAGA 126
Query: 577 TPCSSKS 583
P
Sbjct: 127 VPAKPTP 133
>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
Length = 171
Score = 29.0 bits (65), Expect = 6.2
Identities = 17/77 (22%), Positives = 28/77 (36%), Gaps = 3/77 (3%)
Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAI-TT 577
P + +T P S A TP K + P ++ K+ P T
Sbjct: 63 PRLAEGNFRTMPVQTVTATTPSGSRPAVTATPEQ-KLLAPRPLFTAAREVKTVVPVSSVT 121
Query: 578 PCS-SKSLRSGSDSQQL 593
P + + LR+G + +L
Sbjct: 122 PVTPPRPLRTGEQTARL 138
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 29.9 bits (67), Expect = 6.3
Identities = 15/68 (22%), Positives = 27/68 (39%), Gaps = 3/68 (4%)
Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKA---PPTTPSSSKTILPATPSSSKASRKSAPPA 574
T P +++ P + +P + PPT+PS + + PSS+ K A
Sbjct: 477 TAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSA 536
Query: 575 ITTPCSSK 582
T + +
Sbjct: 537 PPTALADE 544
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 29.5 bits (66), Expect = 6.4
Identities = 10/54 (18%), Positives = 18/54 (33%)
Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAP 572
P P S P A+ A T + AP ++ P + + ++
Sbjct: 19 PAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPASL 72
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 29.5 bits (67), Expect = 6.5
Identities = 13/78 (16%), Positives = 19/78 (24%), Gaps = 6/78 (7%)
Query: 493 SAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSS 552
K P P + P A QA + ++ P +
Sbjct: 524 PPPPQKSPPPPAPTP------PLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPP 577
Query: 553 SKTILPATPSSSKASRKS 570
TPS +K S
Sbjct: 578 PPIPEEPTPSPTKDSSPE 595
>gnl|CDD|227578 COG5253, MSS4, Phosphatidylinositol-4-phosphate 5-kinase [Signal
transduction mechanisms].
Length = 612
Score = 29.5 bits (66), Expect = 6.7
Identities = 16/74 (21%), Positives = 31/74 (41%), Gaps = 2/74 (2%)
Query: 517 MTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAIT 576
++ PS P S++ ++ PTS +P P S + ++ + + P +
Sbjct: 175 LSSQPSRKPTSENPKSESDNSKLPTSVNSP--LPDKSLLKRTLSNFWAERNSYNWKPLVY 232
Query: 577 TPCSSKSLRSGSDS 590
C S+ + S SD
Sbjct: 233 PSCPSEHIFSDSDV 246
>gnl|CDD|223023 PHA03249, PHA03249, DNA packaging tegument protein UL25;
Provisional.
Length = 653
Score = 29.6 bits (66), Expect = 7.1
Identities = 16/52 (30%), Positives = 29/52 (55%), Gaps = 2/52 (3%)
Query: 526 GQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITT 577
G+ + +R R T+ P + ++ +PATPSS K+ + S+PP+I +
Sbjct: 86 GRNKATKPSR--RNKTTQCRPTSLALATAATMPATPSSGKSPKVSSPPSIPS 135
>gnl|CDD|240626 cd05301, GDH, D-glycerate dehydrogenase/hydroxypyruvate reductase
(GDH). D-glycerate dehydrogenase (GDH, also known as
hydroxypyruvate reductase, HPR) catalyzes the reversible
reaction of (R)-glycerate + NAD+ to hydroxypyruvate +
NADH + H+. In humans, HPR deficiency causes primary
hyperoxaluria type 2, characterized by over-excretion of
L-glycerate and oxalate in the urine, possibly due to an
imbalance in competition with L-lactate dehydrogenase,
another formate dehydrogenase (FDH)-like enzyme. GDH,
like FDH and other members of the D-specific hydroxyacid
dehydrogenase family that also includes L-alanine
dehydrogenase and S-adenosylhomocysteine hydrolase,
typically have a characteristic arrangement of 2 similar
subdomains of the alpha/beta Rossmann-fold NAD+ binding
form, despite often low sequence identity. The NAD+
binding domain is inserted within the linear sequence of
the mostly N-terminal catalytic domain, which has a
similar domain structure to the internal NAD binding
domain. Structurally, these domains are connected by
extended alpha helices and create a cleft in which NAD
is bound, primarily to the C-terminal portion of the 2nd
(internal) domain. Some related proteins have similar
structural subdomain but with a tandem arrangement of
the catalytic and NAD-binding subdomains in the linear
sequence. While many members of this family are dimeric,
alanine DH is hexameric and phosphoglycerate DH is
tetrameric.
Length = 309
Score = 29.3 bits (67), Expect = 7.7
Identities = 12/34 (35%), Positives = 15/34 (44%), Gaps = 3/34 (8%)
Query: 31 KRLRQDEEALAAERKPLSELITR---VSLSSPGS 61
R + EE L A L EL+ VSL P +
Sbjct: 176 SRKPEAEEELGARYVSLDELLAESDFVSLHCPLT 209
>gnl|CDD|227725 COG5438, COG5438, Predicted multitransmembrane protein [Function
unknown].
Length = 385
Score = 29.2 bits (66), Expect = 7.7
Identities = 20/71 (28%), Positives = 29/71 (40%), Gaps = 11/71 (15%)
Query: 275 YLSTNANIVPKQQL---QLIGV-AALM-----IAAKVEEIYP--PKVSDYAYVTDGACSS 323
YLS N I +IG A+M IA+ + E++ P ++ G
Sbjct: 244 YLSKNVGINFSGLFLAGVIIGSLGAIMDVAISIASSINELHEQNPNITVKELFKSGMNIG 303
Query: 324 QDILSTEMNIL 334
QDI+ T N L
Sbjct: 304 QDIMGTMTNTL 314
>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B
N-terminal. This is the eukaryotic DNA polymerase alpha
subunit B N-terminal domain which is involved in complex
formation. Also see pfam04058.
Length = 239
Score = 28.9 bits (65), Expect = 7.8
Identities = 20/125 (16%), Positives = 43/125 (34%), Gaps = 9/125 (7%)
Query: 473 AYSYHKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCD----MTPPPSSSKGQT 528
++S ++ +L+ + + + L+ + + + S T
Sbjct: 34 SFSTNRGLEDTDLTLENLRKFEEELQSQLEKRVRTPASIKTSKRLIEVPEAEESLLDSYT 93
Query: 529 TPSSKARQARTPTSSKAP--PTTPSSSKTI---LPATPSSSKASRKSAPPAITTPCSSKS 583
TPS K R ++ + P + S+S L +P+S S + P+ + S
Sbjct: 94 TPSDKGGMLRILSTPELPKRKRSFSASSLESPSLFFSPASFSPSAAPSTPSPNSAKFSSR 153
Query: 584 LRSGS 588
G
Sbjct: 154 SNPGE 158
>gnl|CDD|236886 PRK11249, katE, hydroperoxidase II; Provisional.
Length = 752
Score = 29.2 bits (66), Expect = 8.2
Identities = 10/34 (29%), Positives = 13/34 (38%)
Query: 516 DMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTT 549
+ P S + P+ Q P S KAP T
Sbjct: 25 SLAPEDGSHRPAPEPTPPGAQPTAPGSLKAPDTR 58
>gnl|CDD|165528 PHA03270, PHA03270, envelope glycoprotein C; Provisional.
Length = 466
Score = 29.1 bits (65), Expect = 9.2
Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 2/60 (3%)
Query: 517 MTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSS--KTILPATPSSSKASRKSAPPA 574
+ + +G + +S+A + +P S++ P TTP+ + K S K+ +PPA
Sbjct: 19 LCAGAGAPRGAVSNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGPASPPKSGPPKSPPA 78
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 29.2 bits (65), Expect = 9.3
Identities = 15/62 (24%), Positives = 21/62 (33%)
Query: 514 PCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPP 573
+ P PS + + P AR AP P+ + A P SS S P
Sbjct: 211 AEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGDSGPYVTP 270
Query: 574 AI 575
+
Sbjct: 271 LV 272
>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
Length = 417
Score = 29.0 bits (65), Expect = 9.7
Identities = 13/56 (23%), Positives = 21/56 (37%), Gaps = 3/56 (5%)
Query: 523 SSKGQTTPSSKARQARTPTSSK--APPTTPSSSKTILPATPSSSKASRKSAPPAIT 576
S+ + + R P + P SS +P+ P S K+ PA+T
Sbjct: 99 SAPTKKKKAMPKSVVRAPKPLENPVPAQAESSGSKPVPSIP-VSTPEVKAPAPALT 153
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.132 0.395
Gapped
Lambda K H
0.267 0.0783 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,437,736
Number of extensions: 3390375
Number of successful extensions: 4260
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3834
Number of HSP's successfully gapped: 254
Length of query: 712
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 608
Effective length of database: 6,324,786
Effective search space: 3845469888
Effective search space used: 3845469888
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (27.9 bits)