RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy834
(926 letters)
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 58.6 bits (142), Expect = 1e-08
Identities = 58/372 (15%), Positives = 101/372 (27%), Gaps = 41/372 (11%)
Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVN-----------PNPNSRANPNPNPNSRA 612
P T P ++ S LS P P A+P P+P
Sbjct: 75 PGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDL 134
Query: 613 NPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSR 672
+ L P ++ + + V S + Q++L + S E+ + S
Sbjct: 135 SEMLRPVGS---PGPPPAASPPAAGASPAAVASDAASSR--QAALPLSSPEETARAPSSP 189
Query: 673 VQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRV 732
SP R ++ + +P +
Sbjct: 190 PAEPPPSTPPAAASPRPPRR-------------------SSPISASASSPAPAPGRSAAD 230
Query: 733 QSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAE 792
+ ++ S + + R + S + ++
Sbjct: 231 DAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPG---PASS 287
Query: 793 QGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAE 852
SP RS + S S S+ S S+ + S ++ + +
Sbjct: 288 SSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSP 347
Query: 853 QRSPNTRSRVQSSATTRSTEQRSRS-RSPLTRSTEQGSPNIRSTE-QRASRTRSADQASQ 910
RSP + SR A S +R R R+P + + G P R A R R D +
Sbjct: 348 SRSP-SPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGR 406
Query: 911 SVVTRSRSKLPS 922
R R
Sbjct: 407 FPAGRPRPSPLD 418
Score = 57.1 bits (138), Expect = 4e-08
Identities = 54/356 (15%), Positives = 98/356 (27%), Gaps = 28/356 (7%)
Query: 568 PPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSAQ 627
PP P +P P+P ++ +P P P + + A + A
Sbjct: 108 PPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAA 167
Query: 628 NPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSP 687
+ + ++ + + S SP R + S+ +P
Sbjct: 168 SSRQAALPLSSPEETARAPSSPPAEPPPS---TPPAAASPRPPRRSSPISASASSPAPAP 224
Query: 688 ISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPT 747
S+ S ++++ P E + + S +
Sbjct: 225 GRSAADDA----GASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS 280
Query: 748 TRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPN 807
SS + R R S +P++ SS++ R + S S S R
Sbjct: 281 RPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAA 340
Query: 808 TRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSAT 867
G +RS P P S+ ++ P S A
Sbjct: 341 VSP-------------GPSPSRSPSPSRP-------PPPADPSSPRKRPRPSRAPSSPAA 380
Query: 868 TRSTEQRSRSR-SPLTRSTEQGSPNIRSTEQRASRTRSADQASQSVVTRSRSKLPS 922
+ R R+R + R+ + + + A AS + R PS
Sbjct: 381 SAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPS 436
Score = 49.8 bits (119), Expect = 8e-06
Identities = 55/310 (17%), Positives = 94/310 (30%), Gaps = 11/310 (3%)
Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
P +P P G PP P + +P + + + + A+A
Sbjct: 127 PPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAP 186
Query: 624 NSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAE 683
+S P + R + S+ + A S + SS + S
Sbjct: 187 SSPPAEPPPSTPPAAASPR--PPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSES 244
Query: 684 QGS---PISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRS 740
G P + + + + + + N + P P SSS R R S +
Sbjct: 245 SGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSS-PRERSPSPSPSSP 303
Query: 741 RVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRS 800
+P++ SS ++R SS + S + RS P +
Sbjct: 304 GSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSP---GPSPSRSPSPSRPPPPA 360
Query: 801 AVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTV-QGAPRIHSREQSSPNTHSAEQRSPNTR 859
SP R R + ++ +A G PT R A R R+ + + SP
Sbjct: 361 DP-SSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDA 419
Query: 860 SRVQSSATTR 869
+ R
Sbjct: 420 GAASGAFYAR 429
Score = 46.3 bits (110), Expect = 9e-05
Identities = 46/267 (17%), Positives = 81/267 (30%), Gaps = 21/267 (7%)
Query: 563 RPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQA 622
R + PP P S SP P + P S + +P P + A +
Sbjct: 184 RAPSSPPAEPPPSTPPAAASPRPPRRSS-------PISASASSPAPAPGRSAADDAGASS 236
Query: 623 NNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSA 682
++S+ + P + + S + SS RS
Sbjct: 237 SDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSP 296
Query: 683 EQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRV 742
+ + S+ S ++++S+ + E S +RS
Sbjct: 297 SPSPSSPGSGPAP-SSPRASSSSSSSRESSSSSTSSSSESSRGAAVSP---GPSPSRSPS 352
Query: 743 QSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSAV 802
S P + SSP R R + + +A G P TR R +++ R+ + +
Sbjct: 353 PSRPPPPADP-SSPRKRPRPSRAPSSPAASAGRP-TRRRARAAVAGRARRRDATGRF--- 407
Query: 803 QRSPNTRSRVQSSLTTHSAEQGSPTTR 829
R + S A G+ R
Sbjct: 408 -----PAGRPRPSPLDAGAASGAFYAR 429
Score = 39.0 bits (91), Expect = 0.015
Identities = 52/361 (14%), Positives = 88/361 (24%), Gaps = 26/361 (7%)
Query: 568 PPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSAQ 627
D SG+QG+L + A P +A +
Sbjct: 30 DAADDLLSGSQGQLVSD-----SAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANES 84
Query: 628 NPLPEVSSSPNT--RSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQG 685
P S S + SPT + SP S +R
Sbjct: 85 RSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSP 144
Query: 686 SPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSS 745
P + A +S Q LP S R+ +
Sbjct: 145 GPPPAASPPAAGASPAAV----ASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPA 200
Query: 746 PTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPS-IRSAVQR 804
+ + S+ + A S + SS + S G +
Sbjct: 201 AASPR---PPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPL 257
Query: 805 SPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQS 864
+ + + S G + + SSP S + S +
Sbjct: 258 PRPAPITLPTRIWEASGWNGPSSRPG--------PASSSSSPRERSPSPSPSSPGSG-PA 308
Query: 865 SATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQRASRTRSADQASQSVVTRSRSKLPSKK 924
++ R++ S SR + ST S + R + S + + S +
Sbjct: 309 PSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSR--PPPPADPSSPR 366
Query: 925 K 925
K
Sbjct: 367 K 367
Score = 31.7 bits (72), Expect = 2.2
Identities = 30/192 (15%), Positives = 56/192 (29%), Gaps = 12/192 (6%)
Query: 560 QQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRR 619
+ PL P + P P + + +P+P+P+S +
Sbjct: 252 ENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSS 311
Query: 620 AQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANI 679
+A++S+ + SSS ++ S + SR A+
Sbjct: 312 PRASSSSSSSRESSSSSTSSSSESSRGAAVSP---------GPSPSRSPSPSRPPPPADP 362
Query: 680 RSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTR 739
S + SR S RRA+A + + ++ R + SP
Sbjct: 363 SSPRKRPRPSRA--PSSPAASAGRPTRRRARAAVAGRA-RRRDATGRFPAGRPRPSPLDA 419
Query: 740 SRVQSSPTTRSR 751
+ R
Sbjct: 420 GAASGAFYARYP 431
Score = 29.8 bits (67), Expect = 9.2
Identities = 17/106 (16%), Positives = 28/106 (26%), Gaps = 1/106 (0%)
Query: 566 TPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNS 625
+ D S + G PP P + + +R N RR +
Sbjct: 821 SRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGK-NGRRRPRPPEP 879
Query: 626 AQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRS 671
P + +P R R + + G P+ R
Sbjct: 880 RARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDPRG 925
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 57.0 bits (138), Expect = 4e-08
Identities = 29/161 (18%), Positives = 57/161 (35%), Gaps = 45/161 (27%)
Query: 395 QLNKEQQRLAPLLDRLRRYEADLKALYKRRWILQKKFDKVKHTRRQKFENMYDFVNDNID 454
+ + ++R L + E + L + +++ +R++F+ +D +N+N
Sbjct: 971 EYEEVEERYEELKSQREDLEEAKEKL-------LEVIEELDKEKRERFKETFDKINENFS 1023
Query: 455 ATYKVHLKSSACQVYLIVSGWPKKKENLVIVVQSNLFSPQALTQSFGHAFLDITGAEQEP 514
+K ++ L +P
Sbjct: 1024 EIFKELFGGGTAELELTEPD--------------------------------------DP 1045
Query: 515 YLAGMNFSCTPQGRSYVGIQHLSGGERTLAALAMIFTIWKL 555
AG+ S P G+ + LSGGE++L ALA++F I K
Sbjct: 1046 LTAGIEISARPPGKKLQSLSLLSGGEKSLTALALLFAIQKY 1086
>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
Rho/Rac/Cdc42-like GTPases [Signal transduction
mechanisms].
Length = 1175
Score = 52.2 bits (125), Expect = 1e-06
Identities = 51/261 (19%), Positives = 87/261 (33%), Gaps = 34/261 (13%)
Query: 568 PPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPN---------PNPNSRANPNLSR 618
Q P Q +L+P + N NS + + + ++P L +
Sbjct: 28 SKQLLPPRRLQRKLNPISIRNGA---DNDIINSESKESFGKYALGHQIFSSFSSSPKLFQ 84
Query: 619 RAQA--NNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSS 676
R + + +SS N+ Q SP + S + + QS + P S VQ
Sbjct: 85 RRNSAGPITHSPSATSSTSSLNSNDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQKR 144
Query: 677 ANI-----RSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSR 731
N + PI T + + + + S LP SP+ R +
Sbjct: 145 KNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLP----SPHFRQK 200
Query: 732 VQSSPTTRSRVQSSPTTRSR-VQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRS 790
SS T S S P+ R SS + S SS + + S + S+
Sbjct: 201 FSSSDT--SNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNI---- 254
Query: 791 AEQGSPSIRSAVQRSPNTRSR 811
+PS ++ S +++
Sbjct: 255 ----TPSSSNSEAMSTSSKRP 271
Score = 44.9 bits (106), Expect = 2e-04
Identities = 57/295 (19%), Positives = 105/295 (35%), Gaps = 27/295 (9%)
Query: 639 TRSRVQSSPTTRSRVQSSL---TIQSAEQ-GSPNTRSRVQSSANIRSAEQGSPISRTRFT 694
+ + V R+Q L +I++ N+ S+ + S
Sbjct: 23 SDAFVSKQLLPPRRLQRKLNPISIRNGADNDIINSESKESFGKYALGHQIFS-------- 74
Query: 695 KYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQS 754
S K R + + +SS N+ Q SP + S + +T+SR S
Sbjct: 75 -SFSSSPKLFQRRNSAGPITHSPSATSSTSSLNSNDGDQFSPASDSLSFNPSSTQSRKDS 133
Query: 755 SPTTRSRVQSS---LTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRSR 811
P S VQ L S+ G+ ++ + G+P+ RS + P+ S+
Sbjct: 134 GPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNG----SHAGAPNARS-RKEIPSLGSQ 188
Query: 812 VQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRST 871
+ H ++ S + S P I + S N+ + P++ + V + S+
Sbjct: 189 SMQLPSPHFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPS---FPHSSTAVLLKRHSGSS 245
Query: 872 EQRSRSRSPLTRSTEQGSPNIRSTEQRASRTRSADQASQSVVTRSRSKLPSKKKD 926
S S +T S+ S+++ A + +V + R +L KKD
Sbjct: 246 -GASLISSNITPSSSNSEAMSTSSKRP--YIYPALLSRVAVEFKMRLQLGDHKKD 297
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 51.9 bits (124), Expect = 2e-06
Identities = 61/364 (16%), Positives = 102/364 (28%), Gaps = 16/364 (4%)
Query: 557 VRVQQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNL 616
V + RP PP + P + GP P S + P+ ++ P P+P+ AN
Sbjct: 2582 VTSRARRPDAPPQSARPR--APVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPD 2639
Query: 617 SRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSS 676
+ P + + +R R R+ SS + + + T + S
Sbjct: 2640 PHPPPTVPPPERPRDDPAPGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSL 2698
Query: 677 ANIRSAE---QGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQ 733
A+ + +P + T + + A A +P ++P +R
Sbjct: 2699 ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPA 2758
Query: 734 SSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQ 793
PTT +P +SL+ SP + ++ +A
Sbjct: 2759 RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
Query: 794 GSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQ 853
+ + P + P S G SP A
Sbjct: 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP 2878
Query: 854 RSPNTRSRVQSSATTRSTE---------QRSRSRSPLTRSTEQGSPNIRSTEQRASRTRS 904
P R R+ A +RSTE +R Q P Q
Sbjct: 2879 ARPPVR-RLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937
Query: 905 ADQA 908
Q
Sbjct: 2938 RPQP 2941
Score = 41.8 bits (98), Expect = 0.002
Identities = 55/344 (15%), Positives = 83/344 (24%), Gaps = 20/344 (5%)
Query: 563 RPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQA 622
R S P + + PT G P P P P+ A P A A
Sbjct: 2669 RLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAA 2728
Query: 623 NNS--------AQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQ 674
+ A +P ++P +R PTT + A
Sbjct: 2729 RQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAV 2788
Query: 675 SSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQS 734
+S + SP L + + A + ++ P
Sbjct: 2789 ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848
Query: 735 SPTTRSRV------QSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANI 788
S V + P +RS + P +R + S +T S
Sbjct: 2849 SLPLGGSVAPGGDVRRRPPSRSPA-AKPAAPARPPVRRL--ARPAVSRSTESFALPPDQP 2905
Query: 789 RSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNT 848
Q Q P + Q Q + GA
Sbjct: 2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG 2965
Query: 849 HSAEQRSPNTRSRVQSSATTRSTEQRS---RSRSPLTRSTEQGS 889
R R RV A +R S + L+R + S
Sbjct: 2966 ALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWAS 3009
Score = 35.3 bits (81), Expect = 0.19
Identities = 22/102 (21%), Positives = 32/102 (31%), Gaps = 6/102 (5%)
Query: 560 QQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRR 619
Q RP P P+ Q PP P P P P P P +P P+ +
Sbjct: 2904 QPERPPQPQAPPPPQPQPQ---PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP 2960
Query: 620 AQANNS---AQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLT 658
+ + +P SR + +T SL+
Sbjct: 2961 QPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLS 3002
Score = 32.2 bits (73), Expect = 1.9
Identities = 15/77 (19%), Positives = 23/77 (29%), Gaps = 8/77 (10%)
Query: 560 QQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRR 619
Q RP P P S P P P P P P++ + + A P +
Sbjct: 403 DQTRPAAPVPASVPTPA-----PTPVPASAPPPPATPLPSAEP---GSDDGPAPPPERQP 454
Query: 620 AQANNSAQNPLPEVSSS 636
P+ ++
Sbjct: 455 PAPATEPAPDDPDDATR 471
Score = 31.4 bits (71), Expect = 2.7
Identities = 28/150 (18%), Positives = 42/150 (28%), Gaps = 8/150 (5%)
Query: 563 RPLTPPPQSDPESGTQ---GELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRR 619
RP + P + P + + L+ P + S P P P P
Sbjct: 2865 RPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP 2924
Query: 620 AQANNSAQ---NPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSS 676
P P+ +P T P+ + P R RV
Sbjct: 2925 PPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVP--RFRVPQP 2982
Query: 677 ANIRSAEQGSPISRTRFTKYTLQSTKSNLS 706
A R A S T + + S S+L+
Sbjct: 2983 APSREAPASSTPPLTGHSLSRVSSWASSLA 3012
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains separated
by a hinge in the middle. The eukaryotic SMC proteins
form two kind of heterodimers: the SMC1/SMC3 and the
SMC2/SMC4 types. These heterodimers constitute an
essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 46.5 bits (110), Expect = 7e-05
Identities = 18/40 (45%), Positives = 26/40 (65%)
Query: 513 EPYLAGMNFSCTPQGRSYVGIQHLSGGERTLAALAMIFTI 552
+P+ G+ S P G+ + +LSGGE+TL ALA+IF I
Sbjct: 1056 DPFSGGIEISARPPGKGVKNLDNLSGGEKTLVALALIFAI 1095
>gnl|CDD|218658 pfam05616, Neisseria_TspB, Neisseria meningitidis TspB protein.
This family consists of several Neisseria meningitidis
TspB virulence factor proteins.
Length = 502
Score = 44.6 bits (105), Expect = 2e-04
Identities = 24/84 (28%), Positives = 31/84 (36%), Gaps = 3/84 (3%)
Query: 567 PPPQSDPESGTQGELSP---PTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
P P P S E P +P NP + NP N PNP P+ NP+ +
Sbjct: 315 PRPDLTPGSAEAPEAQPLPEVSPAENPANNPNPRENPGTRPNPEPDPDLNPDANPDTDGQ 374
Query: 624 NSAQNPLPEVSSSPNTRSRVQSSP 647
+ P V PN R R +
Sbjct: 375 PGTRPDSPAVPDRPNGRHRKERKE 398
Score = 36.5 bits (84), Expect = 0.059
Identities = 22/67 (32%), Positives = 29/67 (43%), Gaps = 8/67 (11%)
Query: 249 RP-LTPPPQSDPESGTQGELSP-------PTPGPNPNSRANPNPNPNSRANPNLSRRAQA 300
RP LTP PE+ E+SP P P NP +R NP P+P+ + N Q
Sbjct: 316 RPDLTPGSAEAPEAQPLPEVSPAENPANNPNPRENPGTRPNPEPDPDLNPDANPDTDGQP 375
Query: 301 NNSAQNP 307
+P
Sbjct: 376 GTRPDSP 382
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and C-terminal
domains of this protein are well conserved, but the
central hinge region is skewed in composition and highly
divergent [Cellular processes, Cell division, DNA
metabolism, Chromosome-associated proteins].
Length = 1164
Score = 44.7 bits (106), Expect = 2e-04
Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 7/51 (13%)
Query: 514 PYLAGMNFSCTPQGRSYVGIQHLSGGERTLAALAMIFTIWKLVVRVQQLRP 564
P+ G+ S P+G+ ++ +SGGE++L AL+ IF I Q+ +P
Sbjct: 1053 PFAGGLELSAKPKGKPVQRLEAMSGGEKSLTALSFIFAI-------QRYKP 1096
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 39.4 bits (92), Expect = 0.008
Identities = 28/141 (19%), Positives = 42/141 (29%), Gaps = 25/141 (17%)
Query: 555 LVVRVQQLRPLTP-PPQSDPESGTQGELSPPTPGPNPHSRVNP---------------NP 598
L + QL+P P P G G+ PP G P + N P
Sbjct: 371 LQDQFMQLQPRMRQLPMGSPMGGAMGQ--PPYYGQGPQQQFNGQPLGWPRMSMMPTPMGP 428
Query: 599 NSRANPN---PNPNSRANPNLSRRAQANNSAQNPLPEVSSS---PNTRSRVQSSPTTRSR 652
PN P RA ++ A P+ + P ++ Q T
Sbjct: 429 GGPLRPNGLAPMNAVRAPSRNAQNAAQKPP-MQPVMYPPNYQSLPLSQDLPQPQSTASQG 487
Query: 653 VQSSLTIQSAEQGSPNTRSRV 673
Q+ Q +P + +V
Sbjct: 488 GQNKKLAQVLASATPQMQKQV 508
>gnl|CDD|213245 cd03278, ABC_SMC_barmotin, ATP-binding cassette domain of barmotin,
a member of the SMC protein family. Barmotin is a tight
junction-associated protein expressed in rat epithelial
cells which is thought to have an important regulatory
role in tight junction barrier function. Barmotin
belongs to the SMC protein family. SMC proteins are
large (approximately 110 to 170 kDa), and each is
arranged into five recognizable domains. Amino-acid
sequence homology of SMC proteins between species is
largely confined to the amino- and carboxy-terminal
globular domains. The amino-terminal domain contains a
'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the
single-letter amino-acid code), which by mutational
studies has been shown to be essential in several
proteins. The carboxy-terminal domain contains a
sequence (the DA-box) that resembles a 'Walker B' motif,
and a motif with homology to the signature sequence of
the ATP-binding cassette (ABC) family of ATPases. The
sequence homology within the carboxy-terminal domain is
relatively high within the SMC1-SMC4 group, whereas SMC5
and SMC6 show some divergence in both of these
sequences. In eukaryotic cells, the proteins are found
as heterodimers of SMC1 paired with SMC3, SMC2 with
SMC4, and SMC5 with SMC6 (formerly known as Rad18).
Length = 197
Score = 37.8 bits (89), Expect = 0.010
Identities = 13/29 (44%), Positives = 18/29 (62%), Gaps = 7/29 (24%)
Query: 536 LSGGERTLAALAMIFTIWKLVVRVQQLRP 564
LSGGE+ L ALA++F I ++RP
Sbjct: 114 LSGGEKALTALALLFAI-------FRVRP 135
>gnl|CDD|213242 cd03275, ABC_SMC1_euk, ATP-binding cassette domain of eukaryotic
SMC1 proteins. The structural maintenance of
chromosomes (SMC) proteins are large (approximately 110
to 170 kDa), and each is arranged into five recognizable
domains. Amino-acid sequence homology of SMC proteins
between species is largely confined to the amino- and
carboxy-terminal globular domains. The amino-terminal
domain contains a 'Walker A' nucleotide-binding domain
(GxxGxGKS/T, in the single-letter amino-acid code),
which by mutational studies has been shown to be
essential in several proteins. The carboxy-terminal
domain contains a sequence (the DA-box) that resembles a
'Walker B' motif, and a motif with homology to the
signature sequence of the ATP-binding cassette (ABC)
family of ATPases. The sequence homology within the
carboxy-terminal domain is relatively high within the
SMC1-SMC4 group, whereas SMC5 and SMC6 show some
divergence in both of these sequences. In eukaryotic
cells, the proteins are found as heterodimers of SMC1
paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6
(formerly known as Rad18).
Length = 247
Score = 38.3 bits (90), Expect = 0.012
Identities = 14/29 (48%), Positives = 22/29 (75%)
Query: 524 TPQGRSYVGIQHLSGGERTLAALAMIFTI 552
P G+ + + +LSGGE+T+AALA++F I
Sbjct: 144 NPPGKRFRDMDNLSGGEKTMAALALLFAI 172
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 38.0 bits (88), Expect = 0.022
Identities = 39/185 (21%), Positives = 64/185 (34%), Gaps = 23/185 (12%)
Query: 563 RPLTPPPQSDPESGTQGELSPPT-PGPNPHSRVNPNPNS--------RANPNPNPNSRAN 613
RPL+P + +L PPT P P P S + S + P+P S +N
Sbjct: 369 RPLSPYTAYE-------DLKPPTSPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASN 421
Query: 614 PNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRV 673
AQ PL + + + SPT + V S++ S+ P+T
Sbjct: 422 VPEVEPAQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAP-- 479
Query: 674 QSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQ 733
++A +A R + + +L + +A SS+
Sbjct: 480 -ATAATDAAAPPPANMR----PLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGN 534
Query: 734 SSPTT 738
S+P T
Sbjct: 535 SAPPT 539
>gnl|CDD|213206 cd03239, ABC_SMC_head, The SMC head domain belongs to the
ATP-binding cassette superfamily. The structural
maintenance of chromosomes (SMC) proteins are essential
for successful chromosome transmission during
replication and segregation of the genome in all
organisms. SMCs are generally present as single proteins
in bacteria, and as at least six distinct proteins in
eukaryotes. The proteins range in size from
approximately 110 to 170 kDa, and each has five distinct
domains: amino- and carboxy-terminal globular domains,
which contain sequences characteristic of ATPases, two
coiled-coil regions separating the terminal domains ,
and a central flexible hinge. SMC proteins function
together with other proteins in a range of chromosomal
transactions, including chromosome condensation,
sister-chromatid cohesion, recombination, DNA repair,
and epigenetic silencing of gene expression.
Length = 178
Score = 36.9 bits (86), Expect = 0.022
Identities = 14/31 (45%), Positives = 22/31 (70%), Gaps = 3/31 (9%)
Query: 525 PQGRSYVGIQHLSGGERTLAALAMIFTIWKL 555
QG+ Q LSGGE++L+ALA+IF + ++
Sbjct: 87 LQGKVE---QILSGGEKSLSALALIFALQEI 114
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 37.2 bits (87), Expect = 0.045
Identities = 43/244 (17%), Positives = 67/244 (27%), Gaps = 31/244 (12%)
Query: 560 QQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRR 619
TP P P +P T P S P P + +P
Sbjct: 372 SAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPAN 431
Query: 620 AQANNSAQNPLPEVSSSPNTRSRVQSS---PTTRSRVQS-----SLTIQSAE-QGSPNTR 670
A + N L E+ ++ + P+TR + SL A SPN
Sbjct: 432 AANAPPSLN-LEEL------WQQILAKLELPSTRMLLSQQAELVSLDSNRAVIAVSPNWL 484
Query: 671 SRVQSSAN-IRSAEQ---GSPISRTRFTKYTLQSTKSNLSRRAQ----ANNSAQNPLPEV 722
VQS + A G I K L+S + S A+ S P P
Sbjct: 485 GMVQSRKPLLEQAFAKVLGRSI------KLNLESQSGSASNTAKTPPPPQKSPPPPAPTP 538
Query: 723 SSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRV 782
T + +P + ++ P S + ++ +
Sbjct: 539 PLPQPTATAPPPTPPPPPPTATQASSN-APAQIPADSSPPPPIPEEPTPSPTKDSSPEEI 597
Query: 783 QSSA 786
+A
Sbjct: 598 DKAA 601
Score = 29.9 bits (68), Expect = 8.4
Identities = 12/78 (15%), Positives = 22/78 (28%), Gaps = 3/78 (3%)
Query: 554 KLVVRVQQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPN---PNS 610
P P PQ + PP S + ++P P +
Sbjct: 526 PPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPT 585
Query: 611 RANPNLSRRAQANNSAQN 628
+ S + + +A+N
Sbjct: 586 PSPTKDSSPEEIDKAAKN 603
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 37.0 bits (85), Expect = 0.061
Identities = 68/377 (18%), Positives = 130/377 (34%), Gaps = 47/377 (12%)
Query: 567 PPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRA--QANN 624
P SG +PPT G S + + N N + A N + + N
Sbjct: 152 APHAQHNHSGKTNGDNPPTGGQYHQSGGTSRNHQMMDSNKNCPADALFNETNPSGEHKRN 211
Query: 625 SAQNPLPE---VSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGS------PNTRSRVQS 675
S +P + S PN VQ + + + +S T A + + N + R S
Sbjct: 212 SIDGDIPSDIYIDSQPN-EGDVQKTNPWQGKQGNSATSPPANENNAVTLSCSNDQQRGAS 270
Query: 676 SANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQ-- 733
SA AE G R S ++ + ++A NPL +++ R+ +Q
Sbjct: 271 SA----AESGYAHHRG--------SNIASHTPNDNIMHAANNPLN--NTNDAQRNAIQGD 316
Query: 734 ---SSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRV-QSSANIR 789
+P ++ ++ S +S G+ + Q + R
Sbjct: 317 LVRGAPNDKNSFDRGNEKTYQIYGGFHDGSPNAASAGAPFNGLGNQADGGHINQVHPDAR 376
Query: 790 SAEQGSP--------SIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSR 841
A G P + S +S +S S +S + AP ++
Sbjct: 377 GAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAGYS---NPGNSNPGYNNAPNSNTP 433
Query: 842 EQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQRASR 901
+ PN+++ PN+ + + + S +PL+ + + + S A +
Sbjct: 434 YNNPPNSNTPYSNPPNSNPPYSNLPYSNT----PYSNAPLSNAPPSSAKDHHSAYHAAYQ 489
Query: 902 TRSADQASQSVVTRSRS 918
R+A+Q + ++ T ++
Sbjct: 490 HRAANQPAANLPTANQP 506
Score = 33.1 bits (75), Expect = 0.94
Identities = 51/277 (18%), Positives = 104/277 (37%), Gaps = 14/277 (5%)
Query: 601 RANPNPNPN-SRANPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTI 659
R PN + R N + Q + P +S+ + + + Q
Sbjct: 319 RGAPNDKNSFDRGN---EKTYQIYGGFHDGSPNAASAGAPFNGLGNQADGGHINQVHPDA 375
Query: 660 QSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPL 719
+ A G P++ + +A +A+ + S F+ + SN N+A N
Sbjct: 376 RGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFS----NAGYSNPGNSNPGYNNAPNSN 431
Query: 720 PEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTR 779
++ PN+ + + P + + P + + ++P + + S+ SA + R
Sbjct: 432 TPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHR 491
Query: 780 SRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAE-QGSPTTRSTVQGAPRI 838
+ Q +AN+ +A Q P+ + + N+ +S SA G+ T + G +
Sbjct: 492 AANQPAANLPTANQ--PAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNGIAKR 549
Query: 839 HSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRS 875
+ N EQ + V+SS++ S+E +
Sbjct: 550 EDHPEGGTNRQKYEQSDEES---VESSSSENSSENEN 583
Score = 30.4 bits (68), Expect = 6.1
Identities = 45/276 (16%), Positives = 91/276 (32%), Gaps = 33/276 (11%)
Query: 648 TTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSR 707
+ + + + ++ + + R SS N+ +A + +R + +
Sbjct: 63 ANENHINTRSSDNNSCENANMNRDANSSSENVGNANGNNYDNRAHVGR--AYGAGACEQS 120
Query: 708 RAQANNSAQNPLPEVSSSPN---TRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQS 764
Q+N +P+ + PN R R + + Q + + ++ + PT QS
Sbjct: 121 NQQSNIPIGDPVNHLRGHPNFGEPRERAEDAAP---HAQHNHSGKTNGDNPPTGGQYHQS 177
Query: 765 SLTIQSAEQGSPNTRSRVQSSAN--IRSAEQGSPSIRS---------------AVQRSPN 807
T ++ + N + N S E SI VQ++
Sbjct: 178 GGTSRNHQMMDSNKNCPADALFNETNPSGEHKRNSIDGDIPSDIYIDSQPNEGDVQKTNP 237
Query: 808 TRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSAT 867
+ + +S T+ A + + T S R S S H R N S +
Sbjct: 238 WQGKQGNSATSPPANENNAVTLSCSNDQQRGASSAAESGYAH---HRGSNIASHTPNDNI 294
Query: 868 TRSTEQRSRSRSPLTRSTEQ-----GSPNIRSTEQR 898
+ + + R+ Q G+PN +++ R
Sbjct: 295 MHAANNPLNNTNDAQRNAIQGDLVRGAPNDKNSFDR 330
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 36.7 bits (84), Expect = 0.062
Identities = 45/273 (16%), Positives = 83/273 (30%), Gaps = 11/273 (4%)
Query: 620 AQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANI 679
+ + + P +T V ++ T + + E SP +R+ +SA
Sbjct: 441 VEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRT---TSATP 497
Query: 680 RSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTR 739
+ ++ T T Q T S+ +S P + V + T+
Sbjct: 498 NATSPTPAVTTPNATSPTTQKT-SDTPNATSPTPIVIGVTTTATSPPTGTTSV-PNATSP 555
Query: 740 SRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIR 799
+ SP + + S + S++T GS T + I S+ +P
Sbjct: 556 QVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQ----PGIPSSSHSTPRSN 611
Query: 800 SAVQRSPNTRSRVQSSLTTHSAEQGSPTTR--STVQGAPRIHSREQSSPNTHSAEQRSPN 857
S T + P+T ST+ P + Q S +S+ R P
Sbjct: 612 STSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSSTSRYPG 671
Query: 858 TRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSP 890
+ +T + S T +
Sbjct: 672 EVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTG 704
Score = 36.7 bits (84), Expect = 0.065
Identities = 54/316 (17%), Positives = 97/316 (30%), Gaps = 22/316 (6%)
Query: 566 TPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNS 625
T P S +GT P P + + PN+ + A +++ +
Sbjct: 466 TADPTSGTPTGTTSSTLPEDTSPTSRT-TSATPNATSPTPAVTTPNATSPTTQKTSDTPN 524
Query: 626 AQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAE---QGSPNTRSRVQSSANIRSA 682
A +P P V T S PT + V ++ + Q E + NT + + + SA
Sbjct: 525 ATSPTPIVIGVTTT---ATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSA 581
Query: 683 EQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVS---SSPNTRSRVQSSPTTR 739
S+ ++N+++ PL + N S P+T
Sbjct: 582 VTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTT 641
Query: 740 SRVQSSPTTRSRV--QSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPS 797
SP Q S S +G PN + ++
Sbjct: 642 HVSTLSPGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNA---------TSPSAPSG 692
Query: 798 IRSAVQRSPNTRSRVQSSLTTHSAEQ-GSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSP 856
++AV +T + S+ S + T+ T +GA R ++ S +
Sbjct: 693 QKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLR 752
Query: 857 NTRSRVQSSATTRSTE 872
+ TT+
Sbjct: 753 PRWTFTSPPVTTKQAT 768
Score = 30.9 bits (69), Expect = 4.4
Identities = 38/254 (14%), Positives = 72/254 (28%), Gaps = 22/254 (8%)
Query: 677 ANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSP 736
AN + + I+RT T T + K + S V P+ + V ++P
Sbjct: 395 ANPVADAKTLIITRTA-TNATTTTHKVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVPTTP 453
Query: 737 ------TTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRS 790
T + + PT+ + ++ +T S + ++ + + + ++ N S
Sbjct: 454 SLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATS 513
Query: 791 AEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGA---PRIHSREQSSPN 847
S V ++ T+ S ++ Q P ++ +
Sbjct: 514 PTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTS 573
Query: 848 THSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRST------------EQGSPNIRST 895
S + T S+ T S RS G NI
Sbjct: 574 APSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEE 633
Query: 896 EQRASRTRSADQAS 909
T S
Sbjct: 634 TPSVPSTTHVSTLS 647
>gnl|CDD|213239 cd03272, ABC_SMC3_euk, ATP-binding cassette domain of eukaryotic
SMC3 proteins. The structural maintenance of
chromosomes (SMC) proteins are large (approximately 110
to 170 kDa), and each is arranged into five recognizable
domains. Amino-acid sequence homology of SMC proteins
between species is largely confined to the amino- and
carboxy-terminal globular domains. The amino-terminal
domain contains a 'Walker A' nucleotide-binding domain
(GxxGxGKS/T, in the single-letter amino-acid code),
which by mutational studies has been shown to be
essential in several proteins. The carboxy-terminal
domain contains a sequence (the DA-box) that resembles a
'Walker B' motif, and a motif with homology to the
signature sequence of the ATP-binding cassette (ABC)
family of ATPases. The sequence homology within the
carboxy-terminal domain is relatively high within the
SMC1-SMC4 group, whereas SMC5 and SMC6 show some
divergence in both of these sequences. In eukaryotic
cells, the proteins are found as heterodimers of SMC1
paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6
(formerly known as Rad18).
Length = 243
Score = 35.7 bits (83), Expect = 0.072
Identities = 16/40 (40%), Positives = 22/40 (55%), Gaps = 10/40 (25%)
Query: 525 PQGR----------SYVGIQHLSGGERTLAALAMIFTIWK 554
PQG+ +Q LSGG+++L ALA+IF I K
Sbjct: 138 PQGKINSLTNMKQDEQQEMQQLSGGQKSLVALALIFAIQK 177
>gnl|CDD|213241 cd03274, ABC_SMC4_euk, ATP-binding cassette domain of eukaryotic
SMC4 proteins. The structural maintenance of
chromosomes (SMC) proteins are large (approximately 110
to 170 kDa), and each is arranged into five recognizable
domains. Amino-acid sequence homology of SMC proteins
between species is largely confined to the amino- and
carboxy-terminal globular domains. The amino-terminal
domain contains a 'Walker A' nucleotide-binding domain
(GxxGxGKS/T, in the single-letter amino-acid code),
which by mutational studies has been shown to be
essential in several proteins. The carboxy-terminal
domain contains a sequence (the DA-box) that resembles a
'Walker B' motif, and a motif with homology to the
signature sequence of the ATP-binding cassette (ABC)
family of ATPases. The sequence homology within the
carboxy-terminal domain is relatively high within the
SMC1-SMC4 group, whereas SMC5 and SMC6 show some
divergence in both of these sequences. In eukaryotic
cells, the proteins are found as heterodimers of SMC1
paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6
(formerly known as Rad18).
Length = 212
Score = 35.3 bits (82), Expect = 0.087
Identities = 12/23 (52%), Positives = 20/23 (86%)
Query: 528 RSYVGIQHLSGGERTLAALAMIF 550
+S+ I +LSGGE+TL++LA++F
Sbjct: 120 KSWKNISNLSGGEKTLSSLALVF 142
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 34.7 bits (80), Expect = 0.20
Identities = 22/131 (16%), Positives = 38/131 (29%), Gaps = 9/131 (6%)
Query: 566 TPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNS 625
P P + + T N + +P S + +P +S P+ + +S
Sbjct: 147 DPNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSS 206
Query: 626 AQNP-LPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQ 684
+ LP SS + + P S T S G S +
Sbjct: 207 PSDSSLPPAPSSFQS----DTPPP----SPESPTNPSPPPGPAAPPPPPVQQVPPLSTAK 258
Query: 685 GSPISRTRFTK 695
+P S +
Sbjct: 259 PTPPSASATPA 269
Score = 33.1 bits (76), Expect = 0.60
Identities = 17/76 (22%), Positives = 23/76 (30%)
Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
P P S PE + S P P+ P P+ + NP+P Q
Sbjct: 192 PGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQV 251
Query: 624 NSAQNPLPEVSSSPNT 639
P S+ T
Sbjct: 252 PPLSTAKPTPPSASAT 267
Score = 30.8 bits (70), Expect = 2.9
Identities = 20/99 (20%), Positives = 28/99 (28%), Gaps = 6/99 (6%)
Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
P S + + P P P + +S P P+ P S + N
Sbjct: 176 DADPASASPSDPPSSSPGVPSFPSPPEDP--SSPSDSSLPPAPSSFQSDTPPPSPESPTN 233
Query: 624 NSAQNPLPEVSSSPNTRSRVQSS---PTTRSRVQSSLTI 659
S P P P + S PT S + I
Sbjct: 234 PSPP-PGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPI 271
Score = 30.4 bits (69), Expect = 4.2
Identities = 9/84 (10%), Positives = 16/84 (19%)
Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
P S + P P S + P NP+ P
Sbjct: 190 SSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQ 249
Query: 624 NSAQNPLPEVSSSPNTRSRVQSSP 647
+ + + +
Sbjct: 250 QVPPLSTAKPTPPSASATPAPIGG 273
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc gene
is often associated with scpB (TIGR00281) and scpA genes,
where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 35.0 bits (81), Expect = 0.21
Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
Query: 501 GHAFLDITGAEQEPYLAGMNFSCTPQGRSYVGIQHLSGGERTLAALAMIFTIWKL 555
G A L +T E + AG+ P G+ + LSGGE+ L ALA++F I+K+
Sbjct: 1056 GEAELRLTDPE-DLLEAGIEIFAQPPGKKNQNLSLLSGGEKALTALALLFAIFKV 1109
>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor. This family is secreted by
gram-negative Gammaproteobacteria such as Pseudomonas
syringae of tomato and the fire blight plant pathogen
Erwinia amylovora, amongst others. It is an essential
pathogenicity factor of approximately 198 kDa. Its
injection into the host-plant is dependent upon the
bacterial type III or Hrp secretion system. The family
is long and carries a number of predicted functional
regions, including an ERMS or endoplasmic reticulum
membrane retention signal at both the C- and the
N-termini, a leucine-zipper motif from residues 539-560,
and a nuclear localisation signal at 1358-1361. this
conserved AvrE-family of effectors is among the few that
are required for full virulence of many phytopathogenic
pseudomonads, erwinias and pantoeas.
Length = 1771
Score = 35.1 bits (81), Expect = 0.21
Identities = 50/220 (22%), Positives = 72/220 (32%), Gaps = 22/220 (10%)
Query: 713 NSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSA- 771
+ Q + +P + SPT R+ + + + P + + IQ+A
Sbjct: 12 TAVQPEATPSAGAPTGLQQSSESPTQRASHSLASEGKKNRKKMPKVFQKSSAPRQIQAAP 71
Query: 772 EQGSPNTRSRVQSSANIR----SAEQGSPSIRSAVQRSPNTRSRVQSSLTT-HSAEQGSP 826
Q T + QSS A A + SP+ R +S H E +
Sbjct: 72 PQALNPTAAAPQSSRGPTLRELLALPEDDGETQAPESSPSARRLTRSEGVARHEMEDLA- 130
Query: 827 TTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTE 886
R V+ R Q N S+ +R P V T S P T
Sbjct: 131 -GRPVVKPDADRQLR-QDILNKSSSSRRPP-----VSKEEGTSSKM-------PATALAS 176
Query: 887 QGSPNIRSTEQRASRTRSADQASQSVVTRSRSKLPSKKKD 926
Q RS DQASQS ++RSR P+ D
Sbjct: 177 AALFKDDEIRQEVDAARS-DQASQSRLSRSRGNPPAIPPD 215
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 34.3 bits (78), Expect = 0.34
Identities = 29/145 (20%), Positives = 43/145 (29%), Gaps = 1/145 (0%)
Query: 524 TPQGRSYVGIQHLSGGERTLAALAMIFTIWKLVVRVQQLRPLTP-PPQSDPESGTQGELS 582
TP ++ I H+ A M+ W R TP P + P Q +
Sbjct: 660 TPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAA 719
Query: 583 PPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLPEVSSSPNTRSR 642
P + A P A P +R A P +P +
Sbjct: 720 ATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP 779
Query: 643 VQSSPTTRSRVQSSLTIQSAEQGSP 667
Q+ P + R + + T Q Q P
Sbjct: 780 PQAPPAPQQRPRGAPTPQPPPQAGP 804
>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
Provisional.
Length = 576
Score = 34.2 bits (78), Expect = 0.35
Identities = 18/82 (21%), Positives = 31/82 (37%), Gaps = 9/82 (10%)
Query: 573 PESGTQGELSPPTPGPNPHSRVNPNPNSRANPN---------PNPNSRANPNLSRRAQAN 623
PE+ G+ P P P + R N N + NP P+ R N N + + +
Sbjct: 427 PENKGDGQNEPVIPKPLDNERDQSNKNKQVNPGNRHNSEDRYTRPHGRNNENRNYNNKNS 486
Query: 624 NSAQNPLPEVSSSPNTRSRVQS 645
+ ++P P + + S
Sbjct: 487 DIPKHPERSEHEQPEDKKKKSS 508
>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
Provisional.
Length = 481
Score = 34.0 bits (77), Expect = 0.35
Identities = 29/139 (20%), Positives = 58/139 (41%), Gaps = 13/139 (9%)
Query: 660 QSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQ----ANNSA 715
Q+A +P + ++ A + +Q +P+ + T + K + + Q A A
Sbjct: 245 QTANTATPKAEVKTEAPA---AEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKAPTEA 301
Query: 716 QNPLPEVSSSPNTRS-RVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQG 774
P P S++ N ++ T + S+P+ + ++ T + ++ A QG
Sbjct: 302 AKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTN-----ANQG 356
Query: 775 SPNTRSRVQSSANIRSAEQ 793
S N S +SA I A++
Sbjct: 357 SSNNNSNSSASAIIAEAQK 375
Score = 34.0 bits (77), Expect = 0.40
Identities = 30/143 (20%), Positives = 54/143 (37%), Gaps = 23/143 (16%)
Query: 596 PNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQS 655
P ++ P P+ N+ AN + N+ P +++ NT S ++ T
Sbjct: 298 PTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNT------ 351
Query: 656 SLTIQSAEQGSPNTRSRVQSSANIRSAEQG------------SPISRTRFTKYTLQSTKS 703
+A QGS N S +SA I A++ + + +TKY
Sbjct: 352 -----NANQGSSNNNSNSSASAIIAEAQKHLGKAYSWGGNGPTTFDCSGYTKYVFAKAGI 406
Query: 704 NLSRRAQANNSAQNPLPEVSSSP 726
+L R + A ++ + E + P
Sbjct: 407 SLPRTSGAQYASTTRISESQAKP 429
>gnl|CDD|177556 PHA03195, PHA03195, tegument protein VP11/12; Provisional.
Length = 746
Score = 34.2 bits (78), Expect = 0.37
Identities = 20/77 (25%), Positives = 34/77 (44%)
Query: 249 RPLTPPPQSDPESGTQGELSPPTPGPNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPL 308
P P + P S T+G P PG SR P P++ +P + A+ + +A+N
Sbjct: 439 CPPQGPLPAPPNSKTRGTFRRPRPGSVRGSRQLPASPPSNIVSPRTNPAAEGSTAAKNGQ 498
Query: 309 FSKGEHLCGTLNFQDCT 325
++ + + F DC
Sbjct: 499 GAETIQVRSSGEFNDCI 515
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 33.6 bits (77), Expect = 0.48
Identities = 20/90 (22%), Positives = 29/90 (32%), Gaps = 3/90 (3%)
Query: 701 TKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTT-- 758
S L ++ + P P T S + R +Q T + S T
Sbjct: 236 KISPLKKKKTGTLKSSKPEPGTPLKRQTSPASSSQKSRRRSLQRVLTDERKSSSRRTPSL 295
Query: 759 -RSRVQSSLTIQSAEQGSPNTRSRVQSSAN 787
RSR SSL + S N + S +
Sbjct: 296 LRSRTNSSLIEFLKRESSENLLPSLSSRTS 325
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 33.1 bits (75), Expect = 0.55
Identities = 23/94 (24%), Positives = 31/94 (32%), Gaps = 5/94 (5%)
Query: 558 RVQQLRPLTPP-PQSDPESGTQGELSPPTPGPNPHSR---VNPNPNSRANPNPNPNSRAN 613
R+ R P P G ++ P PG P+ NP P P N N
Sbjct: 168 RINPQRFGFPMQPNMGMRPGFN-QMPPHMPGMPPNQMRPGFNPMPGMPPRPGFNQNPNMM 226
Query: 614 PNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSP 647
PN++R P PN + R +P
Sbjct: 227 PNMNRPGFRPQPGGFNHPGTPMGPNMQQRPGFNP 260
>gnl|CDD|140307 PTZ00284, PTZ00284, protein kinase; Provisional.
Length = 467
Score = 33.4 bits (76), Expect = 0.60
Identities = 21/79 (26%), Positives = 29/79 (36%), Gaps = 3/79 (3%)
Query: 847 NTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQRASRTRSAD 906
N S N + T+RST RS S R T + + R+ + T
Sbjct: 25 NALSGNSPKANNSAS-TGQTTSRSTNSARRSGSKRDRETATSTDSGRTKSHEGAATT--K 81
Query: 907 QASQSVVTRSRSKLPSKKK 925
QA+ + T P KKK
Sbjct: 82 QATTTPTTNVEVAPPPKKK 100
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 32.9 bits (75), Expect = 0.71
Identities = 16/75 (21%), Positives = 23/75 (30%), Gaps = 5/75 (6%)
Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
P PP + PP P P + + + P +PN+ P
Sbjct: 44 PPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPP-----PVDP 98
Query: 624 NSAQNPLPEVSSSPN 638
N+ P PE N
Sbjct: 99 NAPPPPAPEPGRIDN 113
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 31.9 bits (71), Expect = 0.76
Identities = 14/65 (21%), Positives = 16/65 (24%), Gaps = 5/65 (7%)
Query: 566 TPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNS 625
+ +PE E P P P P P P P P A S
Sbjct: 75 EGEEEPEPE-----EEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEPEPEPGAATLKS 129
Query: 626 AQNPL 630
P
Sbjct: 130 VALPF 134
Score = 29.2 bits (64), Expect = 6.1
Identities = 10/48 (20%), Positives = 13/48 (27%), Gaps = 1/48 (2%)
Query: 567 PPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANP 614
P+ + E + E P P P P P P P
Sbjct: 62 DEPEEEEEPEPE-EEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEP 108
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 32.8 bits (74), Expect = 0.95
Identities = 27/107 (25%), Positives = 46/107 (42%), Gaps = 5/107 (4%)
Query: 684 QGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTR---SRVQSSPTTRS 740
+G P F + + A +N + + P +P+T + S P T S
Sbjct: 553 KGVPYIPGLFAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPS 612
Query: 741 RVQSSPTT--RSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSS 785
++ S T+ S + S TT S +SS+ + S E SP + +V S+
Sbjct: 613 KIVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIKVAST 659
Score = 32.8 bits (74), Expect = 1.0
Identities = 31/106 (29%), Positives = 47/106 (44%), Gaps = 3/106 (2%)
Query: 712 NNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSA 771
N + N P S++ + + SP+T +SP + S P T S++ S T A
Sbjct: 565 NPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPA-GHLGSPPATPSKIVSPSTSPPA 623
Query: 772 EQ-GSPNTR-SRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSS 815
GSP+T S +SS + S E SP V + ++ S V S
Sbjct: 624 SHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVSMVSMS 669
>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 433 to 538 amino acids in length. This domain is
found associated with pfam00616, pfam00168. This domain
has two conserved sequence motifs: DLQ and PLSFQNP.
Length = 489
Score = 32.8 bits (74), Expect = 1.0
Identities = 54/310 (17%), Positives = 88/310 (28%), Gaps = 21/310 (6%)
Query: 597 NPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSS 656
NP + P+ R S ++ Q + + S +R+ S PT ++
Sbjct: 10 NPTPVQQQSRRPHERNPSPPSVSCGLSSGLQKIMMDDLDSSVDFTRLPS-PTPENKDLFF 68
Query: 657 LTIQSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQ 716
+T S Q SP + S + I E G P + LQ S +
Sbjct: 69 VTRPSLAQSSP---AYSSSYSEINEPELGMPNGGRSLSLVDLQDAASGQVHSSPMGLPGA 125
Query: 717 NPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQ------- 769
L S + RS S Q R+ + + ++ L+ Q
Sbjct: 126 LGLRRASVPWSARSGQSSIAGGLRLSQPQSAPTVRLLHAAPAQPQLLQPLSFQNPVYHMA 185
Query: 770 -------SAEQGSPNTRSRVQSSANIRSAEQGSPSIRSA--VQRSPNTRSRVQSSLTTHS 820
+ GS N S S + + S R +R Q SL
Sbjct: 186 APLPVSPRGDSGSENHSSASSHSNSEDCSLSAGFSNSGGEEFTRRSTDFTRRQLSLPDRQ 245
Query: 821 AEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSP 880
+ P S R+ +H S + S +S +R
Sbjct: 246 HQPALPRQNSAGP-QRRVDQPSPPGGGSHRGRIPPSLLSSLPSEGSMLSSEWPQSGARPR 304
Query: 881 LTRSTEQGSP 890
S+ +G
Sbjct: 305 QQSSSSKGDS 314
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 32.6 bits (75), Expect = 1.1
Identities = 11/36 (30%), Positives = 16/36 (44%), Gaps = 3/36 (8%)
Query: 515 YLAGMNFSCTPQGRSYVGIQHLSGGERTLAALAMIF 550
YL F+ Q + + LSGGE+ LA +
Sbjct: 422 YLGRFGFTGEDQEKP---VGVLSGGEKARLLLAKLL 454
>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2). All proteins
in this family for which functions are known are flap
endonucleases that generate the 3' incision next to DNA
damage as part of nucleotide excision repair. This
family is related to many other flap endonuclease
families including the fen1 family. This family is based
on the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1034
Score = 32.6 bits (74), Expect = 1.3
Identities = 30/201 (14%), Positives = 61/201 (30%), Gaps = 9/201 (4%)
Query: 29 EIHIPTVDSTSSVSSEEIRRHESNLNLLRRKTQQRRTIQHTESSTSSSNSEQNLRELTKA 88
H + + +S E + +ESNL+ + SS + + + E
Sbjct: 551 SDHPEQFEFQNELSPLETKNNESNLSSDAETEGSPNPEMPSWSSVTVPSEALDNYE---- 606
Query: 89 TTYIQNIESLLSQASDLIERRQLEAHRILYVLLKFRTELEGTHQSYDSGSDQAGFISPME 148
TT N + + + A I+ + L ++ + +S SD S +E
Sbjct: 607 TTNPSNAKEVRNFAETGIQTTNVGESADLLLISNPMEVEPMESEKEESESD----GSFIE 662
Query: 149 TQDENPPSEHSSEFPGVERCLRRIESSLLNVRDLLENRPSDTLSPAQSEVQAEFLSGESR 208
+ E E++ V + + E + + + G
Sbjct: 663 VDSVSSTLELQVPSKSQPTDESE-ENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIE 721
Query: 209 EELTFANESSETNSQGVPRAE 229
EE + +E + E
Sbjct: 722 EEKDADDFKNEWQDISLEELE 742
>gnl|CDD|215814 pfam00242, DNA_pol_viral_N, DNA polymerase (viral) N-terminal
domain.
Length = 379
Score = 32.1 bits (73), Expect = 1.4
Identities = 20/124 (16%), Positives = 34/124 (27%), Gaps = 7/124 (5%)
Query: 806 PNTRSRVQSSLTTHSAEQGSPTTRSTVQGA---PRIHSREQSSPNTHSAEQRSPNTRSRV 862
R + T S + Q R+ + H + RS + R R
Sbjct: 215 SENRRTRNLANNTSRKSDTSRSVGPVRQSQIQRSRLGLQANQGKLAHGQQGRSGSIRGRK 274
Query: 863 QSSATTRSTEQR---SRSRSPLTRSTEQGSPNIRSTEQRASRTRSADQASQSVVTRSRSK 919
S+ T R S + S+ E S ++++ S S
Sbjct: 275 HST-TRRPFGVEPSSSGVTTNRASSSSSCFHQSAVRETAYSSLSTSERHSSSGHAVELRS 333
Query: 920 LPSK 923
+P
Sbjct: 334 IPGG 337
Score = 30.9 bits (70), Expect = 3.1
Identities = 25/155 (16%), Positives = 47/155 (30%), Gaps = 16/155 (10%)
Query: 666 SPNTRSRVQSSANI--RSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNS--------A 715
S+ I RS + + ++ + S R++Q S
Sbjct: 198 RHGDEPFGAQSSGILSRSENRRTRNLANNTSRKSDTSRSVGPVRQSQIQRSRLGLQANQG 257
Query: 716 QNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGS 775
+ + S + R R S+ V+ S + + ++S ++ QS+ S
Sbjct: 258 KLAHGQQGRSGSIRGRKHSTTRRPFGVEPSSSGVTTNRASSSSSCFHQSA---VRETAYS 314
Query: 776 PNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRS 810
+ S SS+ SI S N
Sbjct: 315 SLSTSERHSSS---GHAVELRSIPGGSVSSQNAGP 346
Score = 29.4 bits (66), Expect = 8.7
Identities = 34/152 (22%), Positives = 54/152 (35%), Gaps = 14/152 (9%)
Query: 609 NSRANPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPN 668
SR+ N R ANN+++ S P +S++Q S Q L + S +
Sbjct: 212 LSRSE-NRRTRNLANNTSRKSDTSRSVGPVRQSQIQRSRLGLQANQGKLAH-GQQGRSGS 269
Query: 669 TRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNT 728
R R S+ R G S + T S+ S Q+ + E + S +
Sbjct: 270 IRGRKHSTTR-RP--FGVEPSSSGVTTNRASSSSSCFH---------QSAVRETAYSSLS 317
Query: 729 RSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRS 760
S SS ++S P Q++ S
Sbjct: 318 TSERHSSSGHAVELRSIPGGSVSSQNAGPLLS 349
>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal. Mediator is a
large complex of up to 33 proteins that is conserved
from plants to fungi to humans - the number and
representation of individual subunits varying with
species. It is arranged into four different sections, a
core, a head, a tail and a kinase-activity part, and the
number of subunits within each of these is what varies
with species. Overall, Mediator regulates the
transcriptional activity of RNA polymerase II but it
would appear that each of the four different sections
has a slightly different function. Mediator subunit
Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
transcriptional co-repressor.
Length = 381
Score = 31.9 bits (72), Expect = 1.5
Identities = 21/106 (19%), Positives = 41/106 (38%)
Query: 800 SAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTR 859
+ Q S + + S + +T +T A + + S+PNT + + T
Sbjct: 118 TYNQLGNAGASASITKTSNGSDAATTSSTANTPAAAKVLKANAASAPNTTTGVGSAATTA 177
Query: 860 SRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQRASRTRSA 905
+ ++ATT +T Q+ + T+ T + +A SA
Sbjct: 178 AISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASA 223
Score = 30.0 bits (67), Expect = 6.0
Identities = 18/184 (9%), Positives = 53/184 (28%), Gaps = 5/184 (2%)
Query: 559 VQQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSR 618
Q + +G+ + + P + N+ + PN + +
Sbjct: 120 NQLGNAGASASITKTSNGSDAA-TTSSTANTPAAAKVLKANAASAPNTTTGVGSAATTAA 178
Query: 619 RAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSAN 678
+ + + P + + +++ + Q++ + Q+++
Sbjct: 179 ISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSLGVPQNTSM 238
Query: 679 IRSAEQGSPISRTRFTKYTLQSTKSNLSR----RAQANNSAQNPLPEVSSSPNTRSRVQS 734
+ +P+ + + S L+ R QN +++ S N +
Sbjct: 239 LAQIPNPTPLMQLLNGVSPNNAMASPLNNMSPMRNLNQMGNQNNGGQMTPSANNGNMNNQ 298
Query: 735 SPTT 738
S
Sbjct: 299 SREN 302
>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein.
Length = 346
Score = 31.6 bits (72), Expect = 1.7
Identities = 11/60 (18%), Positives = 18/60 (30%), Gaps = 2/60 (3%)
Query: 572 DPESGTQGELSPPTPGPNPHSRVNPNPNSRAN--PNPNPNSRANPNLSRRAQANNSAQNP 629
+P + L P SR++ + NSR S R + N +
Sbjct: 128 NPNNDEAIPLRFSPGLPKGFYIEGFRGRSRSSSRSSSRSNSRGPSRGSSRNNSRNRNSSS 187
>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA. Members
of this protein share two defining regions. One is a
histidine/serine-rich cluster, typically
H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
the context of a pair of radical SAM proteins, HxsB and
HxsC, and a fourth protein HxsD. The system is predicted
to perform peptide modifications, likely in the
His-Xaa-Ser region, to produce some uncharacterized
natural product.
Length = 186
Score = 31.0 bits (70), Expect = 1.9
Identities = 19/89 (21%), Positives = 34/89 (38%)
Query: 835 APRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNIRS 894
R HS +S + S + + S +T S S S SP S+ Q P+
Sbjct: 52 GHRSHSSHRSHSSHSSHYSGAGGSYSVPSGDTSTYSYPVPSPSYSPSPGSSIQSLPSTTG 111
Query: 895 TEQRASRTRSADQASQSVVTRSRSKLPSK 923
++S + + + +V R ++ L
Sbjct: 112 VRPQSSAENANSEKRKLLVIRVQTALVVF 140
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 31.4 bits (71), Expect = 2.2
Identities = 21/86 (24%), Positives = 33/86 (38%), Gaps = 8/86 (9%)
Query: 841 REQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQRAS 900
RE+ ++ S R R + + R +RSR RS S R ++R
Sbjct: 7 REREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDS--------RPRDRRRY 58
Query: 901 RTRSADQASQSVVTRSRSKLPSKKKD 926
+RS S V RSR + + +
Sbjct: 59 DSRSPRSLRYSSVRRSRDRPRRRSRS 84
>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9. The KAR9 protein in
Saccharomyces cerevisiae is a cytoskeletal protein
required for karyogamy, correct positioning of the
mitotic spindle and for orientation of cytoplasmic
microtubules. KAR9 localises at the shmoo tip in mating
cells and at the tip of the growing bud in anaphase.
Length = 626
Score = 31.8 bits (72), Expect = 2.3
Identities = 31/260 (11%), Positives = 70/260 (26%), Gaps = 11/260 (4%)
Query: 605 NPNPNSRANPNLSRRAQANNSAQNPLPE---VSSSPNTRSRVQSSPTTRSRVQSSLTIQS 661
+ +S+ + + + NP SSSP++ + + SS + +
Sbjct: 366 RDSQSSKIQQIRDSISVSGSDYSNPGSSIDTPSSSPSSSVIMTPPDSGPGSNVSSRRVGT 425
Query: 662 AEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPE 721
S + + NI+ S + ++++ + + E
Sbjct: 426 PGSKSDRVGAVLLRRMNIKPTL-ASIPDEKPSNISVFEDSETSPNSSTLLRDPPPKKCGE 484
Query: 722 VSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSR 781
S T S + +S + +R + S S +
Sbjct: 485 ESGHLPNNPFFNKLKLTLSSIPPLSPRQSIITLPTPSRPASRIS----SLSLRLGSYSGS 540
Query: 782 VQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSR 841
+ S + + + RS S ++ + + SR
Sbjct: 541 IVSPPPYPTLVSRKGAAGLSFNRS---VSDIEGERIGRYNLLPTRIPALPFKAESTTSSR 597
Query: 842 EQSSPNTHSAEQRSPNTRSR 861
SS + + P + R
Sbjct: 598 RSSSLPSPTGVIGFPGSVPR 617
>gnl|CDD|182059 PRK09752, PRK09752, adhesin; Provisional.
Length = 1250
Score = 31.6 bits (71), Expect = 2.3
Identities = 11/27 (40%), Positives = 14/27 (51%)
Query: 582 SPPTPGPNPHSRVNPNPNSRANPNPNP 608
SPP P P P P+P+ +P P P
Sbjct: 922 SPPDPDPTPDPDPTPDPDPTPDPEPTP 948
Score = 31.2 bits (70), Expect = 3.2
Identities = 13/41 (31%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 580 ELSPPTP-GPNPHSRVNPNPNSRANPNPNPNSRANPNLSRR 619
E++PP+P P+P +P P+ P+P P P L+ +
Sbjct: 917 EVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAK 957
Score = 30.4 bits (68), Expect = 6.2
Identities = 15/47 (31%), Positives = 22/47 (46%), Gaps = 11/47 (23%)
Query: 246 LELRPLTPPPQSDPESGTQGELSPPTPGPNPNSRANPNPNPNSRANP 292
L + +TPP DP+ PTP P+P +P+P P+ P
Sbjct: 913 LRSQEVTPPSPPDPD---------PTPDPDPT--PDPDPTPDPEPTP 948
>gnl|CDD|213194 cd03227, ABC_Class2, ATP-binding cassette domain of non-transporter
proteins. ABC-type Class 2 contains systems involved in
cellular processes other than transport. These families
are characterized by the fact that the ABC subunit is
made up of duplicated, fused ABC modules (ABC2). No
known transmembrane proteins or domains are associated
with these proteins.
Length = 162
Score = 30.4 bits (69), Expect = 2.4
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 536 LSGGERTLAALAMIFTIWKL 555
LSGGE+ L+ALA+I + L
Sbjct: 78 LSGGEKELSALALILALASL 97
>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
UL32. The large phosphorylated protein (UL32-like) of
herpes viruses is the polypeptide most frequently
reactive in immuno-blotting analyses with antisera when
compared with other viral proteins.
Length = 777
Score = 31.4 bits (71), Expect = 2.6
Identities = 44/357 (12%), Positives = 92/357 (25%), Gaps = 24/357 (6%)
Query: 571 SDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPL 630
+P PG ++ S + +P + + +
Sbjct: 369 FEPGLSPSPNSGKEMPGILTTENLDLPLASTDSTEMDPEDKRGGAVKINNSGILAWGLKT 428
Query: 631 PEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTR--SRVQSSANIRSAEQGSPI 688
P ++ + V S S + S S SA
Sbjct: 429 PGLAVNDERSIAVSSDG----ITDVLDPPSPLRLHSSDKVIDSVSPPSKRRVSAPASRLD 484
Query: 689 SRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTT 748
R + T + S + + A E SS + V S R R
Sbjct: 485 DAKRP-----EVTATPESSGSDSEGGASGREDETSSDAES---VVSIKELRPR---IGFI 533
Query: 749 RSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNT 808
+SR ++ + S S R + S + + V+ + N
Sbjct: 534 NKSPPPKSPPKSR-RTLIVALSLASPSTAGSPRPKPSLGKFVIGTDPFAFANTVRLTDNM 592
Query: 809 RSRVQSSLTTHSAEQGSPTT----RSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQS 864
R + S S ++ A + + SS +A +++ ++ S
Sbjct: 593 RGGNGVGSSVKPKGSASSKPLTGPGSDLKPATL-NGKTPSSSLVGAARNAGASSKVKIPS 651
Query: 865 SATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQRASRTRSADQASQSVVTRSRSKLP 921
++ + T S ++ + ++A + S + +
Sbjct: 652 GLGGFTSPISLLESALEDVLTSATSTPVKK-NDPYLWDTNGEKAGGGTESASTTDVF 707
>gnl|CDD|171912 PRK13223, PRK13223, phosphoglycolate phosphatase; Provisional.
Length = 272
Score = 31.0 bits (70), Expect = 2.7
Identities = 22/65 (33%), Positives = 32/65 (49%), Gaps = 23/65 (35%)
Query: 376 LRWQDRGGVCRQHDVKMQLQLNKEQQRLAPLLDRLR--RYEADLKALYKRRWIL------ 427
L+W + GV +M L NK ++ +APLLD+++ RY RWI+
Sbjct: 110 LKWLKKQGV------EMALITNKPERFVAPLLDQMKIGRY---------FRWIIGGDTLP 154
Query: 428 QKKFD 432
QKK D
Sbjct: 155 QKKPD 159
>gnl|CDD|227952 COG5665, NOT5, CCR4-NOT transcriptional regulation complex, NOT5
subunit [Transcription].
Length = 548
Score = 31.2 bits (70), Expect = 3.0
Identities = 32/178 (17%), Positives = 71/178 (39%), Gaps = 8/178 (4%)
Query: 713 NSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTT-RSRVQSSLTIQSA 771
+S+ N P+ ++ + S ++SS + + SP ++ + + R+ + ++SA
Sbjct: 203 SSSNNEAPKEGNNQTSLSSIRSS----KKQERSPKKKAPQRDVSISDRATTPIAPGVESA 258
Query: 772 EQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRST 831
Q +T + V + + + + S ++ +P T ++ + + +EQ + +
Sbjct: 259 SQSISSTPTPVSTDTPLHTVKDDSIKFDNSTLGTPTTHVSMKKKESENDSEQQLNFPKDS 318
Query: 832 VQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGS 889
I Q T++A Q A+ R Q + SP ST + S
Sbjct: 319 TD---EIRKTIQHDVETNAAFQNPLFNDELKWWLASKRYLTQPLQEMSPSMVSTLENS 373
Score = 30.8 bits (69), Expect = 4.1
Identities = 25/127 (19%), Positives = 50/127 (39%), Gaps = 7/127 (5%)
Query: 699 QSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTT 758
Q++ S++ + S + P+ S + R+ +P S QS +T + V +
Sbjct: 216 QTSLSSIRSSKKQERSPKKKAPQRDVSISDRATTPIAPGVESASQSISSTPTPVSTDTPL 275
Query: 759 RSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQG------SPS-IRSAVQRSPNTRSR 811
+ S+ ++ G+P T ++ + +EQ S IR +Q T +
Sbjct: 276 HTVKDDSIKFDNSTLGTPTTHVSMKKKESENDSEQQLNFPKDSTDEIRKTIQHDVETNAA 335
Query: 812 VQSSLTT 818
Q+ L
Sbjct: 336 FQNPLFN 342
>gnl|CDD|220096 pfam09052, SipA, Salmonella invasion protein A. Salmonella
invasion protein A is an actin-binding protein that
contributes to host cytoskeletal rearrangements by
stimulating actin polymerisation and counteracting
F-actin destabilising proteins. Members of this family
possess an all-helical fold consisting of eight
alpha-helices arranged so that six long, amphipathic
helices form a compact fold that surrounds a final,
predominantly hydrophobic helix in the middle of the
molecule.
Length = 674
Score = 31.2 bits (70), Expect = 3.1
Identities = 42/211 (19%), Positives = 68/211 (32%), Gaps = 29/211 (13%)
Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
P P P P G G+ S P + H ++ + S N + +RR N
Sbjct: 277 PPEPMPDGGPTPGGNGKTSQPV-EIHYHINIHNDNRSYDNRVFDNRGDTYLGGARRHYDN 335
Query: 624 -------NSAQNPLPEVSSSPNTRSRVQSSPTT-----RSRVQS---------SLTIQSA 662
N AQ+P + + + + S P + + VQ S T+ +
Sbjct: 336 SYHENSENDAQSPTSQTNDLSRNGNSLLSPPASPAAGQHALVQKVTSVLPHSISGTVDTF 395
Query: 663 EQGS---PNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPL 719
S + S +R A GS T + + ++ S R NS+
Sbjct: 396 ANNSAEKVFNHTPDNSDGAVRLAGIGSDGLTTSSQERSANNSLSRGGRPLNIQNSSVTD- 454
Query: 720 PEVSSSPNTRSRVQSSPTTRSRVQSSPTTRS 750
P + + S SS TT+S
Sbjct: 455 ---PLHPVLTAADGAEGVKSSTDNSSDTTKS 482
>gnl|CDD|213181 cd03214, ABC_Iron-Siderophores_B12_Hemin, ATP-binding component of
iron-siderophores, vitamin B12 and hemin transporters
and related proteins. ABC transporters, involved in the
uptake of siderophores, heme, and vitamin B12, are
widely conserved in bacteria and archaea. Only very few
species lack representatives of the siderophore family
transporters. The E. coli BtuCD protein is an ABC
transporter mediating vitamin B12 uptake. The two
ATP-binding cassettes (BtuD) are in close contact with
each other, as are the two membrane-spanning subunits
(BtuC); this arrangement is distinct from that observed
for the E. coli lipid flippase MsbA. The BtuC subunits
provide 20 transmembrane helices grouped around a
translocation pathway that is closed to the cytoplasm by
a gate region, whereas the dimer arrangement of the BtuD
subunits resembles the ATP-bound form of the Rad50 DNA
repair enzyme. A prominent cytoplasmic loop of BtuC
forms the contact region with the ATP-binding cassette
and represent a conserved motif among the ABC
transporters.
Length = 180
Score = 30.1 bits (69), Expect = 3.1
Identities = 12/32 (37%), Positives = 14/32 (43%), Gaps = 8/32 (25%)
Query: 525 PQGRSYVGIQH--------LSGGERTLAALAM 548
PQ +G+ H LSGGER LA
Sbjct: 79 PQALELLGLAHLADRPFNELSGGERQRVLLAR 110
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 31.2 bits (71), Expect = 3.2
Identities = 32/166 (19%), Positives = 55/166 (33%), Gaps = 26/166 (15%)
Query: 527 GRSYVGIQHLSGGERTLAALAMIFTIWKLVVRVQQLRPLTPPPQSDPESGTQGELSPPTP 586
GR V + + R A L L R ++ P + + + + E P P
Sbjct: 288 GRRRVTAEDV----REAAELV-------LPHRRRRKPFEQPQGKDEKDLEEKPEEPGPDP 336
Query: 587 -----GPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPL-----PEVSSS 636
G + + R +P P + P+ A S+ +P V +
Sbjct: 337 EKPDEGEDDAEQ----SGPRGHPTPGNDDEKEPDPQEEADGQGSSTDPAGDIFRIRVLAP 392
Query: 637 PNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSA 682
P R+R S +R+R S A + + A +R+A
Sbjct: 393 PQARARGASGRRSRTRSDSRGRYVRARRNRGPP-DDLAVDATLRAA 437
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 30.5 bits (69), Expect = 3.3
Identities = 19/89 (21%), Positives = 28/89 (31%), Gaps = 6/89 (6%)
Query: 559 VQQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSR 618
P P P+ P+ + + P P P P P P + P P P + ++
Sbjct: 74 ETPPEPTPPKPKEKPK--PEKKPKKPKPKPKPK----PKPKPKVKPQPKPKKPPSKTAAK 127
Query: 619 RAQANNSAQNPLPEVSSSPNTRSRVQSSP 647
A N P S+S S
Sbjct: 128 APAAPNQPARPPSAASASGAATGPSASYL 156
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 31.0 bits (70), Expect = 3.3
Identities = 12/44 (27%), Positives = 14/44 (31%), Gaps = 4/44 (9%)
Query: 252 TPP---PQSDPESGTQGELSPPTPGPNPNSRANPNPNPNSRANP 292
T P S P G +PP P P + P P P
Sbjct: 904 TAPGGDAASAPPPGAGPP-APPQAVPPPRTTQPPAAPPRGPDVP 946
>gnl|CDD|215187 PLN02328, PLN02328, lysine-specific histone demethylase 1 homolog.
Length = 808
Score = 31.1 bits (70), Expect = 3.3
Identities = 32/126 (25%), Positives = 47/126 (37%), Gaps = 23/126 (18%)
Query: 184 ENRPSDTLSPAQSEVQAEFLSGESREELTFANESSETNSQGVPRAERIMQRIESSLLQVR 243
E + + + ++V +E S E+ +L+ + SE N + + Q + L
Sbjct: 4 ETKEPEDPADNVNDVVSEASSPET--DLSLSPSQSEQNIENDGQNSPETQSPLTELQPS- 60
Query: 244 NLLELRPLTPP-PQSDPESGTQGELSPPTPGP-NPNSRANPNPNPNSR-----------A 290
PL P P S +QG+ S P NPNS P P P R A
Sbjct: 61 ------PLPPNTTLDAPVSDSQGDESSSEQQPQNPNS-TEPAPPPKKRRRRKRFFTEINA 113
Query: 291 NPNLSR 296
NP R
Sbjct: 114 NPAFRR 119
Score = 30.3 bits (68), Expect = 5.6
Identities = 17/56 (30%), Positives = 20/56 (35%), Gaps = 2/56 (3%)
Query: 566 TPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSR--ANPNLSRR 619
+P QS L P T P S + +S NPNS A P RR
Sbjct: 47 SPETQSPLTELQPSPLPPNTTLDAPVSDSQGDESSSEQQPQNPNSTEPAPPPKKRR 102
>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB. Isd proteins are
iron-regulated surface proteins found in Bacillus,
Staphylococcus and Listeria species and are responsible
for heme scavenging from hemoproteins. The IsdB protein
is only observed in Staphylococcus and consists of an
N-terminal hydrophobic signal sequence, a pair of tandem
NEAT (NEAr Transporter, pfam05031) domains which confers
the ability to bind heme and a C-terminal sortase
processing signal which targets the protein to the cell
wall. IsdB is believed to make a direct contact with
methemoglobin facilitating transfer of heme to IsdB. The
heme is then transferred to other cell wall-bound NEAT
domain proteins such as IsdA and IsdC.
Length = 644
Score = 31.1 bits (69), Expect = 3.3
Identities = 27/152 (17%), Positives = 62/152 (40%), Gaps = 7/152 (4%)
Query: 706 SRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSS 765
+++ Q +NSA+ ++P T S+ + P + + QS + SS
Sbjct: 462 NKKEQQDNSAKK-----ETTPATPSKPTTPPVEKESQKQDSQKDDNKQSPSVEKENDASS 516
Query: 766 LTIQSAEQGSPNTRSRVQSSANI--RSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQ 823
+ + + + V+SS+ + + S T+ VQ+S + A+
Sbjct: 517 ESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKDVVQTSAGSSEAKD 576
Query: 824 GSPTTRSTVQGAPRIHSREQSSPNTHSAEQRS 855
+P ++ ++ H++ Q++ NT + +S
Sbjct: 577 SAPLQKANIKNTNDGHTQSQNNKNTQENKAKS 608
Score = 31.1 bits (69), Expect = 3.7
Identities = 29/148 (19%), Positives = 61/148 (41%), Gaps = 8/148 (5%)
Query: 574 ESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANN-SAQNPLPE 632
++ + E +P TP S+ P + + + N + N+ S+++ +
Sbjct: 468 DNSAKKETTPATP-----SKPTTPPVEKESQKQDSQKDDNKQSPSVEKENDASSESGKDK 522
Query: 633 VSSSPNTRSRVQSSPTTRSRVQSSL--TIQSAEQGSPNTRSRVQSSANIRSAEQGSPISR 690
++ + V+SS TT ++V S+ + S T+ VQ+SA A+ +P+ +
Sbjct: 523 TPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKDVVQTSAGSSEAKDSAPLQK 582
Query: 691 TRFTKYTLQSTKSNLSRRAQANNSAQNP 718
T+S ++ Q N + P
Sbjct: 583 ANIKNTNDGHTQSQNNKNTQENKAKSLP 610
>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
translational repressor [Translation, ribosomal
structure and biogenesis].
Length = 777
Score = 31.3 bits (71), Expect = 3.4
Identities = 46/269 (17%), Positives = 80/269 (29%), Gaps = 34/269 (12%)
Query: 596 PNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQS 655
S A + S++ + N S + S+ +T+ +S T S +
Sbjct: 81 SGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNANSSVTLSSSTA 140
Query: 656 SLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSA 715
S S + PN + + + S S T S+ S ++
Sbjct: 141 SSMFNSNKLPLPN------PNHSNSATTNQSGSS----FINTPASSSSQPLTNLVVSSIK 190
Query: 716 QNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRV-QSSPTTRSRVQSSLTI---QSA 771
+ P S SP + S + + + + + SP + QS
Sbjct: 191 RFPY-LTSLSPFFNYLIDPSSDSATASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQSV 249
Query: 772 EQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRST 831
E S S + S PSIR+ +SA S+
Sbjct: 250 ENNIILNSS--SSINELTSIYGSVPSIRNLRG--------------LNSALVSFLNVSSS 293
Query: 832 VQGAPRIHSREQS---SPNTHSAEQRSPN 857
++ +E S SP+T S + P
Sbjct: 294 SLAFSALNGKEVSPTGSPSTRSFARVLPK 322
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 31.1 bits (70), Expect = 3.6
Identities = 39/245 (15%), Positives = 85/245 (34%), Gaps = 50/245 (20%)
Query: 702 KSNLSRRAQANNSAQN---PLPEVSSSP--NTRSRVQSSPTTRSRVQSSPTTRSRVQSSP 756
NL Q N+ + L E S +P N + + + ++ S SS
Sbjct: 74 NLNLPDYNQIQNNTHDFYIDLNERSKTPIKNNDNVTTPIKANKKEKHNLDSSSSSSISSS 133
Query: 757 TTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSL 816
T SS + S + +S + + SP + ++ + +T + S
Sbjct: 134 LTNISFFSSP---------TSIYSCLSNS----LSSKHSPKV---IKENQSTHVNISSDN 177
Query: 817 TTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQ----RSPNTRSRVQSSATTRSTE 872
+ + E + ++ + TH+ RSP S ++++ ++ E
Sbjct: 178 SPRNKEISN-------------KQLKKQTNVTHTTCYDKMRRSPRNTSTIKNNTNDKNKE 224
Query: 873 QRSRSRSPLTRSTEQGSPNIRSTEQRA------------SRTRSADQASQSVVTRSRSKL 920
+ + + + R++E+ S T+ + ++VV+ RS +
Sbjct: 225 KNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGIRSSV 284
Query: 921 PSKKK 925
K+K
Sbjct: 285 LLKRK 289
>gnl|CDD|236504 PRK09418, PRK09418, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 780
Score = 30.8 bits (69), Expect = 3.6
Identities = 22/112 (19%), Positives = 39/112 (34%), Gaps = 6/112 (5%)
Query: 565 LTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANN 624
+T D E+G + +PPT + P N NP + + NN
Sbjct: 652 ITKKNDDDKETGGENPTTPPTGEGDNGENPTTPPTGEGNNGENPTTPPT------GEGNN 705
Query: 625 SAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSS 676
P N S ++ S+ ++++ E+ P T + V S+
Sbjct: 706 GGNPTTPSTDEGNNAGSGQTTTDNQNSKETTTVSENKEERDLPKTGTSVAST 757
>gnl|CDD|177646 PHA03418, PHA03418, hypothetical E4 protein; Provisional.
Length = 230
Score = 30.5 bits (68), Expect = 3.8
Identities = 16/68 (23%), Positives = 24/68 (35%), Gaps = 3/68 (4%)
Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
P P PQ DP+ PP P P PN +++ P + A
Sbjct: 39 PHHPNPQEDPDKNPS---PPPDPPLTPRPPAQPNGHNKPPVTKQPGGEGTEEDHQAPLAA 95
Query: 624 NSAQNPLP 631
++ +P P
Sbjct: 96 DADDDPRP 103
>gnl|CDD|182276 PRK10160, PRK10160, taurine transporter subunit; Provisional.
Length = 275
Score = 30.5 bits (69), Expect = 3.8
Identities = 11/30 (36%), Positives = 18/30 (60%), Gaps = 1/30 (3%)
Query: 542 TLAALAMIFTIWKLVVRVQQLRPL-TPPPQ 570
++ LA++ +W V +Q + PL PPPQ
Sbjct: 27 SIGTLAVLLAVWWAVAALQLISPLFLPPPQ 56
>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183). This
family consists of several eukaryotic proteins of around
360 residues in length. The function of this family is
unknown.
Length = 317
Score = 30.5 bits (69), Expect = 3.9
Identities = 16/74 (21%), Positives = 23/74 (31%), Gaps = 4/74 (5%)
Query: 543 LAALAMIFTIWKLVVRVQQLRPLTPPPQSDPESGTQGELSPPTPGPNPH-SRVNPNPNSR 601
LA LA I ++K + +R P P+ G G G P S
Sbjct: 166 LAVLAFI--LYKFFLSCGGVRG-GPRPERAGYGGGGGGGGGGGGGGGSGPGPPPPGFKSS 222
Query: 602 ANPNPNPNSRANPN 615
P P + +
Sbjct: 223 FPPPYGPGAGPSSG 236
>gnl|CDD|128420 smart00110, C1Q, Complement component C1q domain. Globular domain
found in many collagens and eponymously in complement
C1q. When part of full length proteins these domains
form a 'bouquet' due to the multimerization of
heterotrimers. The C1q fold is similar to that of tumour
necrosis factor.
Length = 135
Score = 29.6 bits (67), Expect = 4.0
Identities = 12/63 (19%), Positives = 23/63 (36%), Gaps = 18/63 (28%)
Query: 430 KFDKVKHTRRQKFEN-----------MYDFVNDNIDATYKVHLKSSACQVYLIVSGWPKK 478
+FDKV + ++ ++ +Y F +Y V K +V L+ +G
Sbjct: 26 RFDKVLYNQQGHYDPRTGKFTCPVPGVYYF-------SYHVESKGRNVKVSLMKNGIQVM 78
Query: 479 KEN 481
Sbjct: 79 STY 81
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 30.5 bits (68), Expect = 4.4
Identities = 26/110 (23%), Positives = 41/110 (37%), Gaps = 7/110 (6%)
Query: 701 TKSNLSRRAQANNSAQNPLPEVSSSPNTRS-RVQSSPTTRSRVQSSPTTRSRVQSSPTTR 759
T S QA N+ Q P++ +PNT + +S + +S S+
Sbjct: 55 TAGANSATTQAANTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISATPAP 114
Query: 760 SRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTR 809
+ QS T +S P T+ S N Q S +S +SP +
Sbjct: 115 KQEQSQTTTESTT---PKTKVTTPPSTNT---PQPMQSTKSDTPQSPTIK 158
>gnl|CDD|177475 PHA02693, PHA02693, hypothetical protein; Provisional.
Length = 710
Score = 30.8 bits (69), Expect = 4.7
Identities = 27/114 (23%), Positives = 47/114 (41%), Gaps = 14/114 (12%)
Query: 721 EVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTR- 779
+ S + ++ SR S+ +TRS + T + TT + +S ++ P TR
Sbjct: 271 DESDTADSCSRSFSTQSTRSTRSTRSTRSGA--ETDTTDPDLDPDDD-ESFDEVGPLTRR 327
Query: 780 ----------SRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQ 823
S SSA++R +GS I + S +V + T +AE+
Sbjct: 328 FTATSFAPRASVRSSSASMRLHARGSTRISEPLMSSAARVPKVSMAPTLDTAEE 381
>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal. Members
of this family are found in the N terminal region of
cytochrome B561, as well as in various other putative
uncharacterized proteins.
Length = 559
Score = 30.5 bits (69), Expect = 5.4
Identities = 34/209 (16%), Positives = 70/209 (33%), Gaps = 17/209 (8%)
Query: 569 PQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSAQN 628
+S P S T ++ P HS + +P+ A+P+ + + S + +N
Sbjct: 117 KKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSSTIQQSPQLTPSNK--- 173
Query: 629 PLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPI 688
+SP++ + S ++ V SS + R A S ++
Sbjct: 174 -----PASPSSSYQSPSYSSSLGPVNSSGN---------RSNLRSSPWALRSSGDKKDIT 219
Query: 689 SRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTT 748
+ ++ + L + P ++S ++ + S S
Sbjct: 220 TDEKYLETFLAEVDEEQHMITSSAGKNATPPETINSFGSSSPSFWNYSRNASDAARSLKK 279
Query: 749 RSRVQSSPTTRSRVQSSLTIQSAEQGSPN 777
RS S S+ ++S + + E PN
Sbjct: 280 RSYQLSPSPVPSKQKASTSPKKGEGEPPN 308
>gnl|CDD|236049 PRK07562, PRK07562, ribonucleotide-diphosphate reductase subunit
alpha; Validated.
Length = 1220
Score = 30.3 bits (69), Expect = 5.6
Identities = 10/71 (14%), Positives = 28/71 (39%)
Query: 641 SRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQS 700
+ V S+ TR R + + + + + +++ +++ + + ++
Sbjct: 1113 TNVVSTGLTRGRKPTLVVVTGGGAAAGAAVAAAPAASAATFSDKAAAAAAAAAAAAAAEA 1172
Query: 701 TKSNLSRRAQA 711
K RRA+A
Sbjct: 1173 KKVEAERRAEA 1183
>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
family).
Length = 619
Score = 30.2 bits (68), Expect = 5.7
Identities = 42/205 (20%), Positives = 69/205 (33%), Gaps = 7/205 (3%)
Query: 539 GERTLAALAMIFTIWKLVVRVQQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNP 598
G L +A I W+ R ++ R ++ S T P S +
Sbjct: 419 GVPLLERMAQIEAQWERH-RQEKQRAKARKKLANKTSTVMEPPYGSTESSVP-STPSTRR 476
Query: 599 NSRANPNPNPNSRANPNL---SRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQS 655
N R + P+ + PNL S +A+ +++ S R SR
Sbjct: 477 NDRNITSNTPSLKRTPNLTKSSLSQEASLISKSTGNTHKHSTPRRLTTLPKLPAASRSSK 536
Query: 656 SLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSA 715
I+S G+ ++ S N +S E P+ R +TK S + N
Sbjct: 537 GNLIRSGANGNASSDLSSPGSINSKSPEHSVPLVRVFDIHLRASTTKGRHSTPST-NEKK 595
Query: 716 QNPLPEVSSSPNTRSRVQSSPTTRS 740
+ L SP + V ++P S
Sbjct: 596 KRLLKRSPLSP-PKESVATTPRLNS 619
>gnl|CDD|237854 PRK14898, PRK14898, DNA-directed RNA polymerase subunit A'';
Provisional.
Length = 858
Score = 30.2 bits (68), Expect = 5.9
Identities = 14/44 (31%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Query: 29 EIHIPTVDSTSSVS---SEEIRRHESNLNLLRRKTQQRRTIQHT 69
EI+ ++D V + RH++N L++ KT+ RTI+ T
Sbjct: 123 EIYALSLDQDEKVHWKRIISVIRHKANGKLIKIKTESGRTIRAT 166
>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein. SOG2 proteins in
Saccharomyces cerevisiae are involved in cell separation
and cytokinesis.
Length = 419
Score = 30.1 bits (68), Expect = 6.0
Identities = 30/187 (16%), Positives = 55/187 (29%), Gaps = 13/187 (6%)
Query: 713 NSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAE 772
N+ + P + S S R R +PT R SS
Sbjct: 143 NAWSSLGPPLQHRKRDAVTASPSSMIARNTPISDRLRPR-SVTPTRGRRPSSS------- 194
Query: 773 QGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTV 832
S + + ++S +N++ P + RS S S+ + A S + +
Sbjct: 195 PRSLSNPTTLESPSNLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLATPRSGESFRST 254
Query: 833 QGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNI 892
P S +P + E ++ T +T+ R LT +
Sbjct: 255 ---PTSGSS-SINPVSGLDEAEEDRIDEQLFLKLRT-ATDMALRVLPQLTEQFSKSLIAS 309
Query: 893 RSTEQRA 899
++
Sbjct: 310 TTSRNIT 316
>gnl|CDD|223020 PHA03246, PHA03246, large tegument protein UL36; Provisional.
Length = 3095
Score = 30.3 bits (68), Expect = 6.2
Identities = 58/374 (15%), Positives = 115/374 (30%), Gaps = 45/374 (12%)
Query: 573 PESGTQGELSPPTPGPNPHSRVNP-NPNSRANPNPNPNSRANPNLSR------------- 618
P TQG+ + P N + A P+++ + +
Sbjct: 2645 PALATQGDQDGFVSKVGSAMKYKPPNVTAMAQSIVQPDTQKSSIKKKTDIRLAGYRTMPA 2704
Query: 619 --RAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSS 676
R + N P S + V SP +R ++ G Q S
Sbjct: 2705 DNRQKGNGLRHKPKTGRSHTTIKNVEVYPSPPRWTRTAAAQKRHGEILG--------QVS 2756
Query: 677 ANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTR------- 729
I + + +P++ R K+++ + + R S + S S +
Sbjct: 2757 PPIVTLDNATPVNGIRHEKHSIPYDRLDEHRTDPDALSTPPVIAAPSPSLIYKELSGGIG 2816
Query: 730 ------SRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQ 783
R Q + +RS+ + S + + + +RS + + S R
Sbjct: 2817 KNILKMGRKQIN--SRSKERMSSKASNSPRKNNISRSTADILVGMASTTSSHSPLLQRAD 2874
Query: 784 ----SSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAP-RI 838
+ NI E ++ N V++ L + + P ++ + R
Sbjct: 2875 DMYINIKNIPVPEVKEEGNWKGCKKRNNHVPEVETRLYDNESFPPPPNIIASWKQRVVRF 2934
Query: 839 HSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQR 898
+P ++ ++ T + +ATT + S R P+ E G +
Sbjct: 2935 WEILPHTPRGNAHRKKVSQTTHK-PPNATTAKHDHLSMYRGPVLTEAESGENKFNNVNVD 2993
Query: 899 ASRTRSADQASQSV 912
R RS ++ S
Sbjct: 2994 PIRKRSESESLSST 3007
>gnl|CDD|234058 TIGR02916, PEP_his_kin, putative PEP-CTERM system histidine kinase.
Members of this protein family have a novel N-terminal
domain, a single predicted membrane-spanning helix, and
a predicted cystosolic histidine kinase domain. We
designate this protein PrsK, and its companion
DNA-binding response regulator protein (TIGR02915) PrsR.
These predicted signal-transducing proteins appear to
enable enhancer-dependent transcriptional activation.
The prsK gene is often associated with exopolysaccharide
biosynthesis genes [Protein fate, Protein and peptide
secretion and trafficking, Signal transduction,
Two-component systems].
Length = 679
Score = 30.1 bits (68), Expect = 6.4
Identities = 29/100 (29%), Positives = 41/100 (41%), Gaps = 18/100 (18%)
Query: 616 LSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLT--IQSAEQGSPNT---- 669
L RRA A+ AQ P PEVS + R R R++ L +Q+A + +P
Sbjct: 548 LLRRAIASKRAQGPRPEVSIDTDLSVRAD-----RERLERVLGHLVQNALEATPGEGRVA 602
Query: 670 --RSRVQSSANIRSAEQGSPIS----RTRFTKYTLQSTKS 703
R +A I + G +S R R K +TK
Sbjct: 603 IRVERECGAARIEIEDSGCGMSPAFIRERLFK-PFDTTKG 641
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 30.1 bits (67), Expect = 7.3
Identities = 18/97 (18%), Positives = 34/97 (35%), Gaps = 5/97 (5%)
Query: 567 PPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSA 626
PE T P P P PH + P+ P + + + AQA A
Sbjct: 23 NTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDL-----AQAPTPA 77
Query: 627 QNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAE 663
+ + + +P+ + P ++ ++ +AE
Sbjct: 78 ASEKFDPAPAPHQAASRAPDPAVAPQLAAAPKPDAAE 114
>gnl|CDD|233376 TIGR01363, strep_his_triad, streptococcal histidine triad protein.
This model represents the N-terminal half of a family of
Streptococcal proteins that contain a signal peptide and
then up to five repeats of a region that includes a
His-X-X-His-X-His (histidine triad) motif. Three repeats
are found in the seed alignment. Members of this family
from Streptococcus pneumoniae are suggested to cleave
human C3, and the member PhpA has been shown in vaccine
studies to be a protective antigen in mice [Cellular
processes, Pathogenesis].
Length = 348
Score = 29.5 bits (66), Expect = 7.9
Identities = 8/48 (16%), Positives = 15/48 (31%), Gaps = 2/48 (4%)
Query: 264 QGELSPPTPGPNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLFSK 311
+ SR P+ NP + N+++QN +
Sbjct: 235 KQGRGARPSDYRQGSR--KASIPDVTPNPGQPAPPRPNDASQNKTQRQ 280
Score = 29.5 bits (66), Expect = 8.6
Identities = 9/44 (20%), Positives = 14/44 (31%), Gaps = 3/44 (6%)
Query: 270 PTPGPNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLFSKGE 313
G S + PNP P R A+ + +G+
Sbjct: 245 YRQGSRKASIPDVTPNP---GQPAPPRPNDASQNKTQRQEFQGK 285
>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
Length = 327
Score = 29.7 bits (67), Expect = 8.3
Identities = 33/204 (16%), Positives = 70/204 (34%), Gaps = 17/204 (8%)
Query: 596 PNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQ- 654
P + + N + N +LS + ++ Q+ ++ + S + Q
Sbjct: 61 PTEHESQQSSNNAGAEKNIDLSGSSSLSSGNQSSPSA--ANNTSDGHDASGVKNTAPPQD 118
Query: 655 ------SSLTIQSAEQGSPNTRSRVQSSANIRSA--EQGSPISRTRFTKYTLQSTKSNLS 706
S Q+A +PN + R++ NI A +Q ++ ++ +T + +
Sbjct: 119 ISAPPISPTPTQAAPPQTPNGQQRIELPGNISDALSQQQGQVNA--ASQNAQGNTSTLPT 176
Query: 707 RRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSL 766
A S +P + + T + + + V P T + +S
Sbjct: 177 APATVAPSKGAKVPATAETHPT----PPQKPATKKPAVNHHKTATVAVPPATSGKPKSGA 232
Query: 767 TIQSAEQGSPNTRSRVQSSANIRS 790
A +P + +Q S+ RS
Sbjct: 233 ASARALSSAPASHYTLQLSSASRS 256
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 29.6 bits (66), Expect = 9.7
Identities = 66/341 (19%), Positives = 126/341 (36%), Gaps = 49/341 (14%)
Query: 558 RVQQLRPLTPPPQSDPE---SGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANP 614
R ++ P TP P + E +P TP + ++ + R + P R
Sbjct: 117 RTAKITPTTPKNNYSPTAAGTERVKEDTPATPSRALNHYISTSGRQRVK-SYTPKPRGEV 175
Query: 615 NLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTI--QSAEQGSP--NTR 670
S Q + P SP R +P+T + S I ++ + S T
Sbjct: 176 KSSSPTQTREKVRKYTP----SPLGRMVNSYAPSTFMTMPRSHGITPRTTVKDSEITATY 231
Query: 671 SRVQSSANIRSAEQGSPISRTRFTKYTLQ-----------------STKSNLS--RRAQA 711
++++ + R+A + +P T T K+ L+ RR ++
Sbjct: 232 KMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVES 291
Query: 712 NNSAQNPLPEVSSSPNT--RSRVQSSP-TTRSRVQSSPTTRSRVQ----SSPTTRSRVQS 764
N+S + ++ T + ++ +P T+ +V S T S S+ + R
Sbjct: 292 NSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPL 351
Query: 765 SLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSA----------VQRSPNTRSRVQS 814
S T A + + T ++ + + + +P +R+ V+ +P +
Sbjct: 352 SRTSAPAVRIASATFRGLEKNPSTAPSTPATPRVRAVLTTQVHHCVVVKPAPAVPTTPSP 411
Query: 815 SLTTHS-AEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQR 854
SLTT E SP+ + G P +H + + P+ S E+R
Sbjct: 412 SLTTALFPEAPSPSPSALPPGQPDLHPKAEYPPDLFSVEER 452
>gnl|CDD|227367 COG5034, TNG2, Chromatin remodeling protein, contains PhD zinc
finger [Chromatin structure and dynamics].
Length = 271
Score = 29.1 bits (65), Expect = 10.0
Identities = 18/106 (16%), Positives = 33/106 (31%), Gaps = 11/106 (10%)
Query: 809 RSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRS------RV 862
R + L A++ + ++ SR + + + +A + S RS
Sbjct: 96 LRRHRKLLDDRIAKRPHEKVAARIENCHDAVSRLERNSYSSAARRSSGEHRSAASSQGSR 155
Query: 863 QSSATTRSTEQRSRSRSPLT-----RSTEQGSPNIRSTEQRASRTR 903
+ R + RSP S SP++ T R
Sbjct: 156 HTKLKKRKNIHNLKRRSPELSSKREVSFTLESPSVPDTATRVKEGN 201
>gnl|CDD|227436 COG5105, MIH1, Mitotic inducer, protein phosphatase [Cell division
and chromosome partitioning].
Length = 427
Score = 29.2 bits (65), Expect = 10.0
Identities = 22/211 (10%), Positives = 58/211 (27%), Gaps = 14/211 (6%)
Query: 712 NNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSA 771
+ + L + ++ + ++S +S + + +++ +L
Sbjct: 1 MKNIFHGLEDECANEDVFFFQKAS--KKSIFGDKKNIFRNIATFFKPKAKH--ALADDDL 56
Query: 772 EQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRST 831
+ +N RS E P N + +G T
Sbjct: 57 INKENFAFDKRPLLSNHRSKEIAGPF--------LNIKQLGHRDELDEKENEGDDATLHL 108
Query: 832 VQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPN 891
R+ S ++ + + + + S+ + P S N
Sbjct: 109 HFALQRMTSSSANASSDNEQCPADVDQMYIKKFYEIPWSSSENIEFEDPGHDPFVDNSDN 168
Query: 892 IRSTEQRAS--RTRSADQASQSVVTRSRSKL 920
+ R S + + ++ + +V T +
Sbjct: 169 SKMNHLRGSGKQPKCREKIAFAVWTSLQGMR 199
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.309 0.122 0.338
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 43,769,469
Number of extensions: 4116647
Number of successful extensions: 5523
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4820
Number of HSP's successfully gapped: 418
Length of query: 926
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 820
Effective length of database: 6,236,078
Effective search space: 5113583960
Effective search space used: 5113583960
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 64 (28.6 bits)