RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12968
(402 letters)
>gnl|CDD|239088 cd02395, SF1_like-KH, Splicing factor 1 (SF1) K homology
RNA-binding domain (KH). Splicing factor 1 (SF1)
specifically recognizes the intron branch point sequence
(BPS) UACUAAC in the pre-mRNA transcripts during
spliceosome assembly. We show that the KH-QUA2 region of
SF1 defines an enlarged KH (hnRNP K) fold which is
necessary and sufficient for BPS binding. KH binds
single-stranded RNA or DNA. It is found in a wide
variety of proteins including ribosomal proteins,
transcription factors and post-transcriptional modifiers
of mRNA.
Length = 120
Score = 162 bits (411), Expect = 4e-49
Identities = 61/113 (53%), Positives = 83/113 (73%), Gaps = 3/113 (2%)
Query: 74 KVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSMRDRSKEEELRKEGDP 133
KV +PVK++P++NFVG +LGP+GN+L++L++ET K+SI GKGSM+D KEEELR P
Sbjct: 3 KVYIPVKQYPKYNFVGLILGPRGNTLKQLEKETGAKISIRGKGSMKDGKKEEELRG---P 59
Query: 134 KNAHLVEDLHVQVSALAPPAEAHARIAYALSELRKYIIPDVNDEIRQEQFREM 186
K AHL E LHV ++A PP EA A+ A+ EL K I NDE+++EQ RE+
Sbjct: 60 KYAHLNEPLHVLITAETPPEEALAKAVEAIEELLKPAIEGGNDELKREQLREL 112
>gnl|CDD|227503 COG5176, MSL5, Splicing factor (branch point binding protein) [RNA
processing and modification].
Length = 269
Score = 55.0 bits (132), Expect = 2e-08
Identities = 25/75 (33%), Positives = 45/75 (60%), Gaps = 4/75 (5%)
Query: 74 KVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSMRDRSKEEELRKEGDP 133
K+ +PV+E+P NFVG L+GP+G++L++L+ + K++I G GS+ KE ++ +
Sbjct: 151 KIYIPVQEYPESNFVGLLIGPRGSTLKQLERISRAKIAIRGSGSV----KEGKISSDTPE 206
Query: 134 KNAHLVEDLHVQVSA 148
+ LH + A
Sbjct: 207 SLKNAEAVLHCLIEA 221
>gnl|CDD|197652 smart00322, KH, K homology RNA-binding domain.
Length = 68
Score = 47.7 bits (114), Expect = 2e-07
Identities = 21/72 (29%), Positives = 39/72 (54%), Gaps = 11/72 (15%)
Query: 71 VKVKVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSMRDRSKEEELRKE 130
V ++VL+P + VG ++G G+++++++EET K+ I G GS + E+
Sbjct: 3 VTIEVLIPA------DKVGLIIGKGGSTIKKIEEETGVKIDIPGPGS---EERVVEIT-- 51
Query: 131 GDPKNAHLVEDL 142
G P+N +L
Sbjct: 52 GPPENVEKAAEL 63
>gnl|CDD|238053 cd00105, KH-I, K homology RNA-binding domain, type I. KH binds
single-stranded RNA or DNA. It is found in a wide
variety of proteins including ribosomal proteins,
transcription factors and post-transcriptional modifiers
of mRNA. There are two different KH domains that belong
to different protein folds, but they share a single KH
motif. The KH motif is folded into a beta alpha alpha
beta unit. In addition to the core, type II KH domains
(e.g. ribosomal protein S3) include N-terminal extension
and type I KH domains (e.g. hnRNP K) contain C-terminal
extension.
Length = 64
Score = 45.6 bits (109), Expect = 8e-07
Identities = 19/70 (27%), Positives = 35/70 (50%), Gaps = 9/70 (12%)
Query: 73 VKVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSMRDRSKEEELRKEGD 132
+VLVP VG+++G G++++ ++EET K+ I G S+E + G
Sbjct: 2 ERVLVPSS------LVGRIIGKGGSTIKEIREETGAKIKIPDSG---SGSEERIVTITGT 52
Query: 133 PKNAHLVEDL 142
P+ ++L
Sbjct: 53 PEAVEKAKEL 62
>gnl|CDD|215657 pfam00013, KH_1, KH domain. KH motifs bind RNA in vitro.
Autoantibodies to Nova, a KH domain protein, cause
paraneoplastic opsoclonus ataxia.
Length = 59
Score = 36.0 bits (84), Expect = 0.002
Identities = 11/48 (22%), Positives = 24/48 (50%), Gaps = 6/48 (12%)
Query: 72 KVKVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSMR 119
++L+P VG+++G G++++ ++EET K+ I
Sbjct: 1 TERILIPPD------KVGRIIGKGGSNIKEIREETGVKIRIPDDRDDT 42
>gnl|CDD|221895 pfam13014, KH_3, KH domain. KH motifs bind RNA in vitro. This
RNA-binding domain is required for the efficient
anchoring of ASH1-mRNA to the distal tip of the daughter
cell. ASH1 is a specific repressor of transcription that
localizes asymmetrically to the daughter cell nucleus.
RNA localisation is a widespread mechanism for achieving
localised protein synthesis.
Length = 42
Score = 35.6 bits (83), Expect = 0.002
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 87 FVGKLLGPKGNSLRRLQEETMTKMSI 112
VG ++G G +++ ++EET K+ I
Sbjct: 1 LVGAIIGKGGETIKEIREETGAKIQI 26
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 40.2 bits (94), Expect = 0.002
Identities = 43/178 (24%), Positives = 56/178 (31%), Gaps = 16/178 (8%)
Query: 145 QVSALAPPAEAHARIAYALSELRKYIIPDVNDEIRQEQFREMGYVHHEPRPQMRHPRPPP 204
+ A PA A A A A + R+ P+ RQ R G P P P
Sbjct: 403 PAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPG----------GAPAPAP 452
Query: 205 HMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKPAPSA 264
AA PA A P P P A+ + D E+ P
Sbjct: 453 APAAA-PAAA--ARPAAAGPRPV---AAAAAAAPARAAPAAAPAPADDDPPPWEELPPEF 506
Query: 265 GVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTSSR 322
P AA A + + DPA DP + +A+ AA A P + R
Sbjct: 507 ASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPR 564
Score = 32.2 bits (73), Expect = 0.53
Identities = 38/180 (21%), Positives = 54/180 (30%), Gaps = 13/180 (7%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
P P P AA PA AP A P P A + + S
Sbjct: 385 PAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP 444
Query: 253 ARAAMEKPAPSAGVYEYEDPHAAHAYYET-TASYDPAMYDPYASTAGDYYDT-------- 303
A PAP+A P AA A+ PA P A+ A D
Sbjct: 445 GGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPP 504
Query: 304 ---SASYYETAAPAPLPSTSSRTGGSGWKSSSGYAEHHSSSRSKSSAPIRSSSSEKVVNA 360
S + + AP + E + + + + AP ++++E VV
Sbjct: 505 EFASPAPAQP-DAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
Score = 29.1 bits (65), Expect = 5.3
Identities = 29/125 (23%), Positives = 33/125 (26%), Gaps = 5/125 (4%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
P P P AA PA AP A P A+ S
Sbjct: 374 PATAAAAPVAQPAPAAAAPAA--AAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPE 431
Query: 253 ARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAA 312
A AA + + A A A+ PA P A A AA
Sbjct: 432 ALAAARQASARGPGGAPAPAPAPAA--APAAAARPAAAGPRPVAAAA-AAAPARAAPAAA 488
Query: 313 PAPLP 317
PAP
Sbjct: 489 PAPAD 493
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 39.7 bits (93), Expect = 0.002
Identities = 23/73 (31%), Positives = 25/73 (34%), Gaps = 10/73 (13%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
P P P A P P P A P RP P PPAA+
Sbjct: 385 PSAAWGAPTPAA---PAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAA-------PP 434
Query: 253 ARAAMEKPAPSAG 265
AR+A A SAG
Sbjct: 435 ARSADPAAAASAG 447
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 39.5 bits (92), Expect = 0.003
Identities = 36/134 (26%), Positives = 45/134 (33%), Gaps = 4/134 (2%)
Query: 192 EPRPQMRHPRPPPH-MVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSIL 250
+P P P P PH +V+A P P A R A P PP P PA G
Sbjct: 2700 DPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPAR 2759
Query: 251 DRARAAMEKPAPSAGVYEYEDPHA---AHAYYETTASYDPAMYDPYASTAGDYYDTSASY 307
A PAP A A A + P+ +DP A +A
Sbjct: 2760 PPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP 2819
Query: 308 YETAAPAPLPSTSS 321
+ PLP +S
Sbjct: 2820 PAASPAGPLPPPTS 2833
Score = 34.1 bits (78), Expect = 0.17
Identities = 17/48 (35%), Positives = 19/48 (39%), Gaps = 4/48 (8%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPA 240
P Q P P +P P PP+ Q PP PP P PP A
Sbjct: 2901 PPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP----PPRPQPPLA 2944
Score = 33.8 bits (77), Expect = 0.22
Identities = 21/71 (29%), Positives = 23/71 (32%), Gaps = 3/71 (4%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
P AA PA L P Q APP PP PPPP G V D
Sbjct: 2806 DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP---PPPGPPPPSLPLGGSVAPGGDV 2862
Query: 253 ARAAMEKPAPS 263
R + +
Sbjct: 2863 RRRPPSRSPAA 2873
Score = 32.2 bits (73), Expect = 0.57
Identities = 34/132 (25%), Positives = 40/132 (30%), Gaps = 12/132 (9%)
Query: 194 RPQMRHPRPPPHMVA--ARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGK--VLSI 249
RP+ R RP + A P P PP AP +V PP PAA+R L
Sbjct: 2682 RPRRRAARPTVGSLTSLADPPP----PPPTPEPAPHALVSATPLPPGPAAARQASPALPA 2737
Query: 250 LDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYE 309
A PA G P PA P T AS E
Sbjct: 2738 APAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP----AVASLSE 2793
Query: 310 TAAPAPLPSTSS 321
+ P P +
Sbjct: 2794 SRESLPSPWDPA 2805
Score = 31.4 bits (71), Expect = 0.98
Identities = 32/135 (23%), Positives = 36/135 (26%), Gaps = 15/135 (11%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
P P H PP P+P P P RP P P SR + L R
Sbjct: 2616 PLPPDTHAPDPP---PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGR 2672
Query: 253 ARAAMEKP---------APSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDT 303
A A P + DP T A+ G
Sbjct: 2673 AAQASSPPQRPRRRAARPTVGSLTSLADPPPPP---PTPEPAPHALVSATPLPPGPAAAR 2729
Query: 304 SASYYETAAPAPLPS 318
AS AAPAP
Sbjct: 2730 QASPALPAAPAPPAV 2744
Score = 31.1 bits (70), Expect = 1.3
Identities = 31/149 (20%), Positives = 40/149 (26%), Gaps = 12/149 (8%)
Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
P PP G P R A P PP P R ++ + +
Sbjct: 2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL 2798
Query: 260 PAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPST 319
P+P DP A A+ P P S TA P P
Sbjct: 2799 PSPW-------DPADPPAAVLAPAAALPPAASPAGPLP-----PPTSAQPTAPPPPPGPP 2846
Query: 320 SSRTGGSGWKSSSGYAEHHSSSRSKSSAP 348
G + G SRS ++ P
Sbjct: 2847 PPSLPLGGSVAPGGDVRRRPPSRSPAAKP 2875
Score = 31.1 bits (70), Expect = 1.5
Identities = 18/53 (33%), Positives = 20/53 (37%), Gaps = 4/53 (7%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAA----PPHMVRPPHPPPPPAA 241
P PPP AA P P + AA P + PP PPPP A
Sbjct: 269 PETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAPA 321
Score = 30.7 bits (69), Expect = 1.8
Identities = 19/88 (21%), Positives = 22/88 (25%)
Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
P P P A P P P P P P +S RAR
Sbjct: 2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRA 2673
Query: 260 PAPSAGVYEYEDPHAAHAYYETTASYDP 287
S+ A T+ DP
Sbjct: 2674 AQASSPPQRPRRRAARPTVGSLTSLADP 2701
Score = 29.5 bits (66), Expect = 4.7
Identities = 25/91 (27%), Positives = 33/91 (36%), Gaps = 10/91 (10%)
Query: 181 EQFREMGYVHHEPRPQMRHP-----RPPPHMVAARPAPGLLAPPRMQHAAP----PHMVR 231
E+ RE R + RP P R AP +LA R P +
Sbjct: 998 EEARERAEAARAARERFFQKLQGVLRPLPDFGGLRAAPAVLATLRADLPGGWTDLPDAAQ 1057
Query: 232 PPHPPPPPAASRGKVLSILDRARAAMEKPAP 262
PP AA R + +L + R A+E P P
Sbjct: 1058 AA-PPEVRAALRADLWGLLGQYREALEHPTP 1087
Score = 28.8 bits (64), Expect = 7.2
Identities = 11/50 (22%), Positives = 14/50 (28%)
Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAA 241
+P+ P +P P PPR Q P P A
Sbjct: 2910 QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAV 2959
>gnl|CDD|239089 cd02396, PCBP_like_KH, K homology RNA-binding domain, PCBP_like.
Members of this group possess KH domains in a tandem
arrangement. Most members, similar to the poly(C)
binding proteins (PCBPs) and Nova, containing three KH
domains, with the first and second domains, which are
represented here, in tandem arrangement, followed by a
large spacer region, with the third domain near the
C-terminal end of the protein. The poly(C) binding
proteins (PCBPs) can be divided into two groups, hnRNPs
K/J and the alphaCPs, which share a triple KH domain
configuration and poly(C) binding specificity. They
play roles in mRNA stabilization, translational
activation, and translational silencing. Nova-1 and
Nova-2 are nuclear RNA-binding proteins that regulate
splicing. This group also contains plant proteins that
seem to have two tandem repeat arrrangements, like Hen4,
a protein that plays a role in AGAMOUS (AG) pre-mRNA
processing and important step in plant development. In
general, KH binds single-stranded RNA or DNA. It is
found in a wide variety of proteins including ribosomal
proteins, transcription factors and post-transcriptional
modifiers of mRNA.
Length = 65
Score = 35.1 bits (82), Expect = 0.005
Identities = 10/43 (23%), Positives = 24/43 (55%), Gaps = 6/43 (13%)
Query: 73 VKVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGK 115
+++LVP + G ++G G++++ ++EET K+ +
Sbjct: 2 LRLLVPSSQ------AGSIIGKGGSTIKEIREETGAKIRVSKS 38
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 37.8 bits (88), Expect = 0.009
Identities = 33/136 (24%), Positives = 41/136 (30%), Gaps = 14/136 (10%)
Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPP---RMQHAAPPHMVRPPHPPPPPAASRGKVLS 248
+ P AA PA +A AAP P PP A V +
Sbjct: 370 AEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPP-APVAA 428
Query: 249 ILDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYY 308
A AA AP+A P A ET A +P ++A
Sbjct: 429 PAAAAPAAAPAAAPAAVALAPAPPAQAAP--ETVAIPVRVAPEPAVASAAP--------A 478
Query: 309 ETAAPAPLPSTSSRTG 324
AAPA T + G
Sbjct: 479 PAAAPAAARLTPTEEG 494
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 37.8 bits (88), Expect = 0.010
Identities = 12/49 (24%), Positives = 18/49 (36%), Gaps = 1/49 (2%)
Query: 191 HEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPP 239
+ P+PP P P PP+ Q P + + + P PP
Sbjct: 251 LSQQMPPPPPQPPQQQ-QQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPP 298
Score = 34.7 bits (80), Expect = 0.092
Identities = 11/64 (17%), Positives = 13/64 (20%), Gaps = 4/64 (6%)
Query: 180 QEQFREMGYVHHEPRPQMRHPR----PPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHP 235
Q + P Q P+ P P Q P P
Sbjct: 172 LPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPP 231
Query: 236 PPPP 239
PP
Sbjct: 232 LPPQ 235
Score = 30.9 bits (70), Expect = 1.3
Identities = 17/63 (26%), Positives = 22/63 (34%), Gaps = 4/63 (6%)
Query: 179 RQEQFREMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPP 238
RQ F + G +P +P+PP L P A PP + P PPP
Sbjct: 187 RQAAFPQQGP---PEQPPG-YPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPP 242
Query: 239 PAA 241
Sbjct: 243 LQQ 245
Score = 29.7 bits (67), Expect = 3.1
Identities = 15/62 (24%), Positives = 18/62 (29%), Gaps = 2/62 (3%)
Query: 180 QEQFREMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPP 239
Q V P+ + P P P PP Q P + P PPP P
Sbjct: 205 QPPQGHPEQVQ--PQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQP 262
Query: 240 AA 241
Sbjct: 263 PQ 264
Score = 29.4 bits (66), Expect = 4.0
Identities = 12/52 (23%), Positives = 14/52 (26%), Gaps = 4/52 (7%)
Query: 191 HEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAAS 242
P + PPP P P A PP +P P P
Sbjct: 245 QPQFPGLSQQMPPPPP----QPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQ 292
Score = 29.0 bits (65), Expect = 5.6
Identities = 14/70 (20%), Positives = 18/70 (25%), Gaps = 5/70 (7%)
Query: 194 RPQMRHPRPPPHMVAARP-APGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
R Q PP V + P A P+ P P P P +
Sbjct: 166 RQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGY-PQPPQGHPEQVQP---QQFLP 221
Query: 253 ARAAMEKPAP 262
A + P
Sbjct: 222 APSQAPAQPP 231
Score = 28.2 bits (63), Expect = 8.7
Identities = 13/49 (26%), Positives = 18/49 (36%), Gaps = 5/49 (10%)
Query: 195 PQMRHPRPPPH-----MVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPP 238
P ++ P+ P +P PP+ Q PP PHP P
Sbjct: 241 PPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLP 289
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 35.7 bits (83), Expect = 0.040
Identities = 17/73 (23%), Positives = 18/73 (24%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
P P A AP A P A P P P PP A
Sbjct: 397 AAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSP 456
Query: 253 ARAAMEKPAPSAG 265
AA P+
Sbjct: 457 PPAAAPSAQPAPA 469
Score = 31.9 bits (73), Expect = 0.72
Identities = 21/81 (25%), Positives = 24/81 (29%), Gaps = 1/81 (1%)
Query: 184 REMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASR 243
R +G P P AA PAP AP AP P P P PA +
Sbjct: 383 RRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAA-APQPAPAPAPAPAP 441
Query: 244 GKVLSILDRARAAMEKPAPSA 264
A PA +
Sbjct: 442 PSPAGNAPAGGAPSPPPAAAP 462
Score = 31.5 bits (72), Expect = 1.0
Identities = 23/132 (17%), Positives = 32/132 (24%), Gaps = 2/132 (1%)
Query: 207 VAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAME--KPAPSA 264
PA + AAP P P P AA+ S A P A
Sbjct: 599 GPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVA 658
Query: 265 GVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTSSRTG 324
+ A A P A+ A A A P +
Sbjct: 659 VPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPA 718
Query: 325 GSGWKSSSGYAE 336
+++ G +
Sbjct: 719 AQPPQAAQGASA 730
Score = 30.3 bits (69), Expect = 1.9
Identities = 17/75 (22%), Positives = 20/75 (26%), Gaps = 6/75 (8%)
Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
P P + PA A P AA P P P PP A +
Sbjct: 439 PAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPA------A 492
Query: 252 RARAAMEKPAPSAGV 266
A AP+
Sbjct: 493 APAAPAAPAAPAGAD 507
Score = 30.3 bits (69), Expect = 2.1
Identities = 18/77 (23%), Positives = 21/77 (27%), Gaps = 6/77 (7%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
P P A A Q A P P PP PA + +
Sbjct: 401 AAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPA---PAPAPPSPAGNAPAGGAPSPP 457
Query: 253 ARAAMEK---PAPSAGV 266
AA PAP+A
Sbjct: 458 PAAAPSAQPAPAPAAAP 474
Score = 30.0 bits (68), Expect = 3.1
Identities = 18/116 (15%), Positives = 28/116 (24%)
Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
P AA AP P A P P P G+ + A +
Sbjct: 668 GWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQG 727
Query: 260 PAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAP 315
+ + + P + PA P + A +A +
Sbjct: 728 ASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783
Score = 29.6 bits (67), Expect = 3.3
Identities = 29/156 (18%), Positives = 35/156 (22%)
Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
P P AA PA AP A H P + A
Sbjct: 625 AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPA 684
Query: 260 PAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPST 319
PAP+A A A DP A SA P PLP
Sbjct: 685 PAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPE 744
Query: 320 SSRTGGSGWKSSSGYAEHHSSSRSKSSAPIRSSSSE 355
+ + + +A S
Sbjct: 745 PDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
Score = 29.6 bits (67), Expect = 4.1
Identities = 27/133 (20%), Positives = 36/133 (27%)
Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSI 249
P H+ + G P A P P P PAA G +
Sbjct: 641 AAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700
Query: 250 LDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYE 309
A AA + A A + A+ DP P D A
Sbjct: 701 PAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPP 760
Query: 310 TAAPAPLPSTSSR 322
APAP + ++
Sbjct: 761 PPAPAPAAAPAAA 773
Score = 29.2 bits (66), Expect = 4.5
Identities = 25/125 (20%), Positives = 35/125 (28%), Gaps = 18/125 (14%)
Query: 198 RHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAM 257
R R + A A A AA P P P AA+ A AA
Sbjct: 377 RLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAA--------PAPAAA 428
Query: 258 EKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLP 317
+PAP P A A + + + + + AAP P
Sbjct: 429 PQPAP--------APAPAPA--PPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTA 478
Query: 318 STSSR 322
+ +
Sbjct: 479 APAPA 483
Score = 28.8 bits (65), Expect = 6.4
Identities = 21/90 (23%), Positives = 22/90 (24%), Gaps = 11/90 (12%)
Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRP---PHPPPPPAASRGKVLS 248
P P P A P P +PP P P P P A
Sbjct: 422 APAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPT---- 477
Query: 249 ILDRARAAMEKPAPSAGVYEYEDPHAAHAY 278
A A PA A P A A
Sbjct: 478 ----AAPAPAPPAAPAPAAAPAAPAAPAAP 503
Score = 28.4 bits (64), Expect = 8.3
Identities = 28/135 (20%), Positives = 36/135 (26%), Gaps = 5/135 (3%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
R + R A AP A AP P PAA+
Sbjct: 379 ERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPA-----P 433
Query: 253 ARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAA 312
A A P AG + +A PA TA A+ AA
Sbjct: 434 APAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAA 493
Query: 313 PAPLPSTSSRTGGSG 327
PA + ++ G
Sbjct: 494 PAAPAAPAAPAGADD 508
Score = 28.4 bits (64), Expect = 9.5
Identities = 15/74 (20%), Positives = 18/74 (24%), Gaps = 7/74 (9%)
Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
P P AA P AP + P A S
Sbjct: 405 APAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSP-------P 457
Query: 252 RARAAMEKPAPSAG 265
A A +PAP+
Sbjct: 458 PAAAPSAQPAPAPA 471
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 35.6 bits (82), Expect = 0.045
Identities = 21/75 (28%), Positives = 26/75 (34%), Gaps = 7/75 (9%)
Query: 194 RPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAP----PHMVRPPHPPPPPAASRGKVLSI 249
RP P P AA P P + A P P V PP P P +A + +I
Sbjct: 380 RPT-PAPSTRPKAAAAANIPP--KEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAI 436
Query: 250 LDRARAAMEKPAPSA 264
+ PAP
Sbjct: 437 PVDEKPKYTPPAPPK 451
Score = 34.0 bits (78), Expect = 0.13
Identities = 18/57 (31%), Positives = 24/57 (42%), Gaps = 1/57 (1%)
Query: 192 EPRPQMRHPRPPPHMVAARPAPGL-LAPPRMQHAAPPHMVRPPHPPPPPAASRGKVL 247
EP + P P P A P P + P++ AA P +P + PP P K L
Sbjct: 401 EPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKEEEKAL 457
Score = 30.2 bits (68), Expect = 2.5
Identities = 13/73 (17%), Positives = 18/73 (24%), Gaps = 6/73 (8%)
Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
PP A PPR APP PH P + + +
Sbjct: 387 TRPKAAAAANIPPKEPVRETATPPPVPPRP--VAPP----VPHTPESAPKLTRAAIPVDE 440
Query: 252 RARAAMEKPAPSA 264
+ + P
Sbjct: 441 KPKYTPPAPPKEE 453
Score = 28.2 bits (63), Expect = 8.1
Identities = 18/73 (24%), Positives = 21/73 (28%), Gaps = 2/73 (2%)
Query: 193 PRPQMRHPRPP-PHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
P PQ P P V PAP PP PPP R V +
Sbjct: 364 PAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR-PVAPPVP 422
Query: 252 RARAAMEKPAPSA 264
+ K +A
Sbjct: 423 HTPESAPKLTRAA 435
>gnl|CDD|145536 pfam02444, HEV_ORF1, Hepatitis E virus ORF-2 (Putative capsid
protein). The Hepatitis E virus (HEV) genome is a
single-stranded, positive-sense RNA molecule of
approximately 7.5 kb. Three open reading frames (ORF)
were identified within the HEV genome: ORF1 encodes
non-structural proteins, ORF2 encodes the putative
structural protein(s), and ORF3 encodes a protein of
unknown function. ORF2 contains a consensus signal
peptide sequence at its amino terminus and a capsid-like
region with a high content of basic amino acids similar
to that seen with other virus capsid proteins.
Length = 114
Score = 33.5 bits (76), Expect = 0.052
Identities = 18/49 (36%), Positives = 19/49 (38%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAA 241
P P P P M RP L R H AP + RP PP PP
Sbjct: 57 QSPIFIQPTPSPPMSPLRPGLDLAFANRPAHLAPLGVTRPSAPPLPPVV 105
>gnl|CDD|226193 COG3667, PcoB, Uncharacterized protein involved in copper
resistance [Inorganic ion transport and metabolism].
Length = 321
Score = 34.9 bits (80), Expect = 0.069
Identities = 24/74 (32%), Positives = 27/74 (36%), Gaps = 3/74 (4%)
Query: 190 HHEP--RPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVL 247
H P P P H + P P + AP M H A PHM P P P A+
Sbjct: 36 EHAPMDAPHPAMPGMDHHAHSKMPGPEMAAPQ-MDHGAMPHMDHAPPPIPTQHAAERSRS 94
Query: 248 SILDRARAAMEKPA 261
AR A PA
Sbjct: 95 PASAAARVAAFPPA 108
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 34.8 bits (80), Expect = 0.077
Identities = 31/142 (21%), Positives = 42/142 (29%), Gaps = 19/142 (13%)
Query: 128 RKEGDPKNAHLVEDLHVQVSALAPPAEAHARIAYALSELRKYIIPDVNDEIRQEQFREMG 187
R P ++ + V VS P E R+ YA P + + + Q G
Sbjct: 850 RISTSPSSSTFPQLSRVLVSVREPRTEGGVRVGYA---------PTLAEALDQVFGPGTG 900
Query: 188 YVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVL 247
V P P A PAP Q PP +PP PP +
Sbjct: 901 RVATAPGGD---AASAPPPGAGPPAP-------PQAVPPPRTTQPPAAPPRGPDVPPAAV 950
Query: 248 SILDRARAAMEKPAPSAGVYEY 269
+ L A + S Y
Sbjct: 951 AELRETLADLRSAQRSGDFTAY 972
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 34.8 bits (80), Expect = 0.088
Identities = 40/221 (18%), Positives = 54/221 (24%), Gaps = 31/221 (14%)
Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPR---MQHAAPPHMVRPPHPPPPPAASRGKV 246
P P PP +P P M P PP AS V
Sbjct: 103 EGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAV 162
Query: 247 LSILDRARAAME-----------KPAPSAGVYEYEDPHAA----HAYYETTASY----DP 287
S +R A +P A P AA ++ P
Sbjct: 163 ASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAP 222
Query: 288 AMYDPYASTAGDYYDTSASYY--------ETAAPAPLPSTSSRTGGSGWKSSSGYAEHHS 339
A A AG S+S E P P P+ W++S
Sbjct: 223 APGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAP-ITLPTRIWEASGWNGPSSR 281
Query: 340 SSRSKSSAPIRSSSSEKVVNASLVGLQGAQSTEYGYEDRKR 380
+ SS+ R S ++ G + R
Sbjct: 282 PGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSR 322
Score = 32.8 bits (75), Expect = 0.40
Identities = 34/169 (20%), Positives = 44/169 (26%), Gaps = 29/169 (17%)
Query: 191 HEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSIL 250
E P + PA P R PP PPPP
Sbjct: 77 TEAPANESRSTPTWSLSTLAPAS----PAREGSPTPPG--PSSPDPPPPTPPPASPPPSP 130
Query: 251 DRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYET 310
+ M +P S G P AA A AS +AS +
Sbjct: 131 APDLSEMLRPVGSPGPPPAASPPAAGASPAAVAS------------------DAASSRQA 172
Query: 311 AAPAPLPSTSSRTGGS-----GWKSSSGYAEHHSSSRSKSSAPIRSSSS 354
A P P ++R S + A RS + SS +
Sbjct: 173 ALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPA 221
Score = 31.7 bits (72), Expect = 0.78
Identities = 36/159 (22%), Positives = 49/159 (30%), Gaps = 9/159 (5%)
Query: 203 PPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKPAP 262
P A +P PP AA P P ++ + AA + A
Sbjct: 179 PEETARAPSSPPAEPPPSTPPAAASP---RPPRRSSPISASASSPAPAPGRSAADDAGAS 235
Query: 263 SAGVYEYEDPHAAHAYYETTASYDPAM------YDPYASTAGDYYDTSASYYETAAPAPL 316
S+ E PA + G + ++
Sbjct: 236 SSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERS 295
Query: 317 PSTSSRTGGSGWKSSSGYAEHHSSSRSKSSAPIRSSSSE 355
PS S + GSG SS A SSS +SS+ SSSSE
Sbjct: 296 PSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSE 334
>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
Validated.
Length = 598
Score = 34.1 bits (79), Expect = 0.15
Identities = 18/85 (21%), Positives = 27/85 (31%), Gaps = 8/85 (9%)
Query: 158 RIAYALSELRKYIIPDVNDEIRQEQFREMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLA 217
R+AYA ++L P ++ +R+ +E P AA APG A
Sbjct: 370 RLAYA-ADL-----PTPDEALRR--LQEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAA 421
Query: 218 PPRMQHAAPPHMVRPPHPPPPPAAS 242
+ P AA
Sbjct: 422 AAPAAGGPAAALAAVPDAAAAAAAP 446
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 33.3 bits (76), Expect = 0.16
Identities = 24/76 (31%), Positives = 28/76 (36%), Gaps = 4/76 (5%)
Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSI 249
+ +P P P PPP AA PAP PP A P P+ PPP +
Sbjct: 38 NADPAP----PPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPP 93
Query: 250 LDRARAAMEKPAPSAG 265
A PAP G
Sbjct: 94 PPVDPNAPPPPAPEPG 109
Score = 29.9 bits (67), Expect = 2.6
Identities = 17/47 (36%), Positives = 21/47 (44%), Gaps = 1/47 (2%)
Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKV 246
P PP P PP +A PP V P+ PPPPA G++
Sbjct: 66 PAAPPAPQPDDPNAAPPPPPADPNAPPPPPV-DPNAPPPPAPEPGRI 111
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 33.5 bits (77), Expect = 0.18
Identities = 17/104 (16%), Positives = 34/104 (32%), Gaps = 5/104 (4%)
Query: 172 PDVNDEIRQEQFREMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVR 231
++ R + + +P+ +PP V + AP P P + +
Sbjct: 78 AQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQ 137
Query: 232 PPHPPPPPAASRGKVLSILDRARAAMEKPAPSAGVYEYEDPHAA 275
P P P + + + P P+ ++ +P AA
Sbjct: 138 PEQPLQQPVSP-----QVAPAPQPVHSAPQPAQQAFQPAEPVAA 176
Score = 28.5 bits (64), Expect = 6.1
Identities = 15/53 (28%), Positives = 18/53 (33%), Gaps = 4/53 (7%)
Query: 195 PQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRP-PHPPPPPAASRGKV 246
P P P V + P P A Q A P +P P P P + K
Sbjct: 145 PVSPQVAPAPQPVHSAPQP---AQQAFQPAEPVAAPQPEPVAEPAPVMDKPKR 194
Score = 28.1 bits (63), Expect = 8.2
Identities = 14/44 (31%), Positives = 18/44 (40%)
Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPP 233
P PQ H P P A +PA + AP A P ++ P
Sbjct: 149 QVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKP 192
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 33.6 bits (77), Expect = 0.18
Identities = 20/78 (25%), Positives = 22/78 (28%), Gaps = 5/78 (6%)
Query: 192 EPRPQMRHP-RPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSIL 250
R P RP H A R PR PP P P AA S L
Sbjct: 674 PSRVARGDPVRPTAHHAALRAPQ----APRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSL 729
Query: 251 DRARAAMEKPAPSAGVYE 268
+ A +P P
Sbjct: 730 IASPTAPPEPEPPGAEQA 747
Score = 31.3 bits (71), Expect = 1.2
Identities = 19/90 (21%), Positives = 19/90 (21%), Gaps = 9/90 (10%)
Query: 193 PRPQMRHPRPPPHMVAA-RPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
P P H R A G P HAA R P P P G
Sbjct: 658 GVPGPVPVGMPAHTARPSRVARGDPVRPTAHHAAL----RAPQAPRPGGPPGGGGGLPPP 713
Query: 252 RARAAMEKPAPSAGVYEYED----PHAAHA 277
A PAP
Sbjct: 714 PDLPAAAGPAPCGSSLIASPTAPPEPEPPG 743
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 32.9 bits (75), Expect = 0.32
Identities = 31/151 (20%), Positives = 35/151 (23%), Gaps = 20/151 (13%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAP---------------PRMQHAAPPHMVRP-PHPP 236
P R P RP G P P + A P + RP P
Sbjct: 99 PPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAW 158
Query: 237 PPPAASRGKVLSILDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYAST 296
P A G L A P S Y E Y YD Y
Sbjct: 159 PRAADDYGWQQQRLGFPPRA---PYASPASYAPEQERDREPYDAGRPEYDQR-RRDYDHP 214
Query: 297 AGDYYDTSASYYETAAPAPLPSTSSRTGGSG 327
D+ + P P R G
Sbjct: 215 RPDWDRPRRDRTDRPEPPPGAGHVHRGGPGP 245
>gnl|CDD|219053 pfam06482, Endostatin, Collagenase NC10 and Endostatin. NC10
stands for Non-helical region 10 and is taken from human
COL15A1. A mutation in this region in human COL18A1 is
associated with an increased risk of prostrate cancer.
This domain is cleaved from the precursor and forms
endostatin. Endostatin is a key tumour suppressor and
has been used highly successfully to treat cancer. It is
a potent angiogenesis inhibitor. Endostatin also binds a
zinc ion near the N-terminus; this is likely to be of
structural rather than functional importance according
to.
Length = 291
Score = 32.0 bits (73), Expect = 0.48
Identities = 17/73 (23%), Positives = 24/73 (32%), Gaps = 2/73 (2%)
Query: 183 FREMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPP--HMVRPPHPPPPPA 240
+R++ P P +PPP ++ P L PP + R PHPP
Sbjct: 44 WRKVLLGELVPLPGTTATQPPPVVLTPWSDPRLPDPPHLPDPQTHSATAHRNPHPPLNSP 103
Query: 241 ASRGKVLSILDRA 253
A V
Sbjct: 104 ARIAHVHQDFQPV 116
>gnl|CDD|172502 PRK14001, PRK14001, potassium-transporting ATPase subunit C;
Provisional.
Length = 189
Score = 31.6 bits (71), Expect = 0.49
Identities = 35/140 (25%), Positives = 58/140 (41%), Gaps = 29/140 (20%)
Query: 274 AAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTSSRTGGSGWKSSSG 333
+AH + TA+ + P S+AGD YD +AS + S G + K +
Sbjct: 53 SAHIGQQFTAA---KYFHPRPSSAGDGYDAAAS------------SGSNLGPTNEKLLAA 97
Query: 334 YAEHHSSSRSKSSAP---------IRSSSSE-----KVVNASLVGLQGAQSTEYGYEDRK 379
AE ++ R +++ P + S S VVNA L + AQ+ +
Sbjct: 98 VAERVTAYRKENNLPADTLVPVDAVTGSGSGLDPAISVVNAKLQAPRVAQARNISIRQVE 157
Query: 380 RRLDPYSDVESKKFVGSRVV 399
R ++ ++D F+G R V
Sbjct: 158 RLIEDHTDARGLGFLGERAV 177
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 32.1 bits (73), Expect = 0.53
Identities = 27/121 (22%), Positives = 39/121 (32%), Gaps = 15/121 (12%)
Query: 202 PPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKPA 261
A AP A AA PP P PPA + DR A + A
Sbjct: 401 AVTGAAGAALAPKAAAA-----AAATRAEAPPAAPAPPATA--------DRGDDAADGDA 447
Query: 262 PSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTSS 321
P + +A + + + PA ++ A D +A A AP +T +
Sbjct: 448 PVPA--KANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPA 505
Query: 322 R 322
Sbjct: 506 A 506
Score = 31.0 bits (70), Expect = 1.2
Identities = 29/148 (19%), Positives = 42/148 (28%), Gaps = 6/148 (4%)
Query: 206 MVAARPAPGL-LAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKPAPSA 264
M+A PA AP A V P A V ++ AA AP A
Sbjct: 355 MLAFEPAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKA 414
Query: 265 GV----YEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTS 320
E P AA A T D A A +++ P
Sbjct: 415 AAAAAATRAEAPPAAPAPPATADRGDDAADGD-APVPAKANARASADSRCDERDAQPPAD 473
Query: 321 SRTGGSGWKSSSGYAEHHSSSRSKSSAP 348
S + + + A + R+ + +
Sbjct: 474 SGSASAPASDAPPDAAFEPAPRAAAPSA 501
Score = 30.6 bits (69), Expect = 1.8
Identities = 19/124 (15%), Positives = 27/124 (21%), Gaps = 4/124 (3%)
Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
R + A A P AA+ + A A
Sbjct: 374 ARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAA--AAAATRAEAPPAAPA 431
Query: 260 PAPSAGVYE--YEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLP 317
P +A + + A AS D + A D SA + A
Sbjct: 432 PPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFE 491
Query: 318 STSS 321
Sbjct: 492 PAPR 495
Score = 30.6 bits (69), Expect = 1.8
Identities = 25/114 (21%), Positives = 35/114 (30%), Gaps = 3/114 (2%)
Query: 202 PPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKPA 261
P + A PAPG A + +A P + A + + RA A PA
Sbjct: 372 VPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPA 431
Query: 262 PSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAP 315
P A +D A A+ A D + +APA
Sbjct: 432 PPATADRGDDAADGDAPVPAKAN-ARASADSRCDERDAQPPADSG--SASAPAS 482
>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 484
Score = 31.9 bits (73), Expect = 0.67
Identities = 24/115 (20%), Positives = 31/115 (26%), Gaps = 6/115 (5%)
Query: 146 VSALAPPAEAHARIAYALSELRKYIIPDVNDEIRQEQFREMGYVHHEPRPQMRHPR-PPP 204
++ALAPPA A + E E ++ R
Sbjct: 48 LAALAPPAAAAPA-----AAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLV 102
Query: 205 HMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
AA AP AP A AA R +V + A A K
Sbjct: 103 ARAAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAAAVADAVK 157
Score = 30.3 bits (69), Expect = 1.7
Identities = 17/105 (16%), Positives = 27/105 (25%), Gaps = 1/105 (0%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGL-LAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
P P P V+ AP AP ++HA R A + + +
Sbjct: 59 PAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAPAAPEAQAPAA 118
Query: 252 RARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYAST 296
A A + A + A+ A+
Sbjct: 119 PAERAAAENAARRLARAAAAAPRPRVPADAAAAVADAVKARIERI 163
Score = 28.8 bits (65), Expect = 6.1
Identities = 14/69 (20%), Positives = 20/69 (28%), Gaps = 2/69 (2%)
Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHM--VRPPHPPPPPAASRGKVL 247
+ + P P A PA A + A RP P AA V
Sbjct: 98 REQLVARAAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAAAVADAVK 157
Query: 248 SILDRARAA 256
+ ++R
Sbjct: 158 ARIERIVND 166
>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
Length = 318
Score = 31.4 bits (71), Expect = 0.69
Identities = 23/85 (27%), Positives = 27/85 (31%), Gaps = 4/85 (4%)
Query: 194 RPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRA 253
R + R P PP RP P +P PP P PPP A+ R
Sbjct: 3 RARSRSPSPPRRPSPPRPTP-PRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRG 61
Query: 254 RAA---MEKPAPSAGVYEYEDPHAA 275
A AP +D AA
Sbjct: 62 CPAGVTFSSSAPPRPPLGLDDAPAA 86
>gnl|CDD|239087 cd02394, vigilin_like_KH, K homology RNA-binding
domain_vigilin_like. The vigilin family is a large and
extended family of multiple KH-domain proteins,
including vigilin, also called high density lipoprotein
binding protien (HBP), fungal Scp160 and bicaudal-C.
Yeast Scp160p has been shown to bind RNA and to
associate with both soluble and membrane-bound
polyribosomes as a mRNP component. Bicaudal-C is a
RNA-binding molecule believed to function in embryonic
development at the post-transcriptional level. In
general, KH binds single-stranded RNA or DNA. It is
found in a wide variety of proteins including ribosomal
proteins, transcription factors and post-transcriptional
modifiers of mRNA.
Length = 62
Score = 28.6 bits (65), Expect = 0.71
Identities = 16/69 (23%), Positives = 31/69 (44%), Gaps = 11/69 (15%)
Query: 73 VKVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSMRDRSKEEELRKEGD 132
+V +P K H ++G KG+++R++ EET K+ SK + + G
Sbjct: 2 EEVEIPKKLHRF------IIGKKGSNIRKIMEETGVKIRF-----PDPGSKSDTITITGP 50
Query: 133 PKNAHLVED 141
+N ++
Sbjct: 51 KENVEKAKE 59
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 31.8 bits (73), Expect = 0.79
Identities = 27/140 (19%), Positives = 39/140 (27%), Gaps = 24/140 (17%)
Query: 194 RPQMRHPRPPPHMVAARPAPGLLAP--PRMQHAAPPHMVRPPHPPPPPAAS--------- 242
P P P +A PA A P A P PP P P +
Sbjct: 360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETT 419
Query: 243 ------RGKVLSILDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYD-PYAS 295
R ++ +A +PA ++ ++ E AS PA A
Sbjct: 420 SQLLAARQQLQRAQGATKAKKSEPAAAS------RARPVNSALERLASVRPAPSALEKAP 473
Query: 296 TAGDYYDTSASYYETAAPAP 315
+ Y A+ P
Sbjct: 474 AKKEAYRWKATNPVEVKKEP 493
Score = 28.7 bits (65), Expect = 6.6
Identities = 20/89 (22%), Positives = 30/89 (33%), Gaps = 7/89 (7%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDR 252
P P P+ P + LLA R Q + P A+ V S L+R
Sbjct: 400 PPPPASAPQQAPAVPLPETTSQLLAA-RQQLQRAQGATKAKKSEPAAASRARPVNSALER 458
Query: 253 ------ARAAMEKPAPSAGVYEYEDPHAA 275
A +A+EK Y ++ +
Sbjct: 459 LASVRPAPSALEKAPAKKEAYRWKATNPV 487
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 31.4 bits (71), Expect = 0.97
Identities = 14/54 (25%), Positives = 16/54 (29%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKV 246
P P PPP AA A P +P P AS K+
Sbjct: 22 PSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPASLWKL 75
>gnl|CDD|218233 pfam04732, Filament_head, Intermediate filament head (DNA binding)
region. This family represents the N-terminal head
region of intermediate filaments. Intermediate filament
heads bind DNA. Vimentin heads are able to alter nuclear
architecture and chromatin distribution, and the
liberation of heads by HIV-1 protease liberates may play
an important role in HIV-1 associated cytopathogenesis
and carcinogenesis. Phosphorylation of the head region
can affect filament stability. The head has been shown
to interaction with the rod domain of the same protein.
Length = 89
Score = 28.6 bits (64), Expect = 1.5
Identities = 18/54 (33%), Positives = 24/54 (44%), Gaps = 2/54 (3%)
Query: 303 TSASYYETAAPAPLPSTSSRTGGSG--WKSSSGYAEHHSSSRSKSSAPIRSSSS 354
+++SY T S+S +GGS SSSG+ S S SS SS
Sbjct: 2 STSSYRRTFGDPTRSSSSRSSGGSSGRSSSSSGFRSRSVSRSSSSSPSPSLKSS 55
Score = 27.1 bits (60), Expect = 5.6
Identities = 16/57 (28%), Positives = 24/57 (42%), Gaps = 3/57 (5%)
Query: 304 SASYYETAAPAPLPSTSSRTGGSGW---KSSSGYAEHHSSSRSKSSAPIRSSSSEKV 357
S+S + + S+SS SSS SS R +SS+ S SS+ +
Sbjct: 16 SSSSRSSGGSSGRSSSSSGFRSRSVSRSSSSSPSPSLKSSYRKRSSSAPPSLSSDSL 72
Score = 26.7 bits (59), Expect = 6.9
Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 2/49 (4%)
Query: 318 STSS--RTGGSGWKSSSGYAEHHSSSRSKSSAPIRSSSSEKVVNASLVG 364
STSS RT G +SSS + SS RS SS+ RS S + ++S
Sbjct: 2 STSSYRRTFGDPTRSSSSRSSGGSSGRSSSSSGFRSRSVSRSSSSSPSP 50
>gnl|CDD|236234 PRK08312, PRK08312, putative indolepyruvate oxidoreductase subunit
B; Reviewed.
Length = 510
Score = 30.7 bits (70), Expect = 1.6
Identities = 15/63 (23%), Positives = 22/63 (34%), Gaps = 6/63 (9%)
Query: 224 AAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKPAPS---AGVY---EYEDPHAAHA 277
AAPP P P ++L R A A + G+ +Y+D A
Sbjct: 217 AAPPPPAAAAPTAPRPRPPALAARALLARVEAEFPAAARAMLLEGLRRLVDYQDAAYAAE 276
Query: 278 YYE 280
Y +
Sbjct: 277 YLD 279
>gnl|CDD|237802 PRK14723, flhF, flagellar biosynthesis regulator FlhF; Provisional.
Length = 767
Score = 30.5 bits (69), Expect = 1.6
Identities = 14/53 (26%), Positives = 19/53 (35%), Gaps = 1/53 (1%)
Query: 207 VAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEK 259
A PAP L + A + P P P A G + L R +E+
Sbjct: 57 AYAPPAPAPLPAALVAPAPAAASIAAPAAVPAPGAI-GDLRGELQSMRGMLER 108
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 30.5 bits (69), Expect = 1.6
Identities = 9/48 (18%), Positives = 10/48 (20%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPA 240
P P P + P P V PP P
Sbjct: 400 PSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAV 447
>gnl|CDD|233927 TIGR02557, HpaP, type III secretion protein HpaP. This family of
genes is always found in type III secretion operons,
althought its function in the processes of secretion and
virulence is unclear. Hpa stands for Hrp-associated
gene, where Hrp stands for hypersensitivity response and
virulence.
Length = 201
Score = 29.9 bits (67), Expect = 1.7
Identities = 15/74 (20%), Positives = 18/74 (24%), Gaps = 7/74 (9%)
Query: 176 DEIRQEQFREMGYVHHEPRPQMR-------HPRPPPHMVAARPAPGLLAPPRMQHAAPPH 228
+ + R + P PPP RP P P
Sbjct: 11 APADPARPARRRTPLAQLRRRDALAYAPPPRPEPPPPCDEDRPEPRADTRASDPPPEAPT 70
Query: 229 MVRPPHPPPPPAAS 242
P PP P AS
Sbjct: 71 DADPAQPPDDPDAS 84
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 30.2 bits (68), Expect = 1.8
Identities = 16/46 (34%), Positives = 17/46 (36%), Gaps = 3/46 (6%)
Query: 197 MRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAAS 242
M P PP M A P AP + PP P PPP A
Sbjct: 152 MMQPPPPHAMPPASPPAAQPAPSAPASSPPP---TPASPPPAKAPK 194
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 30.4 bits (68), Expect = 1.8
Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 1/87 (1%)
Query: 274 AAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTSSRTGGSGWKSSSG 333
+A A E S D + + AG + E A + S + SG KS+S
Sbjct: 106 SAAAILEEDDSKDDMEFKANPNEAGKPGKPKGNQGEGLASSS-DGKSKASAKSGSKSASK 164
Query: 334 YAEHHSSSRSKSSAPIRSSSSEKVVNA 360
+ E +SS S + + S+S +V A
Sbjct: 165 HGESNSSDESATDSGKASASVAGIVGA 191
>gnl|CDD|236128 PRK07908, PRK07908, hypothetical protein; Provisional.
Length = 349
Score = 30.4 bits (69), Expect = 1.9
Identities = 20/62 (32%), Positives = 23/62 (37%), Gaps = 2/62 (3%)
Query: 118 MRDRSKEEELRKEGDPKNAHLVEDLHVQVSALAPPAEAHARIAYALSELRKYIIPDVNDE 177
D S LR GD + D V V PP R+A L +L Y P DE
Sbjct: 3 SSDTSPLAALRHHGDQDAGPGLLDFAVNVRHDTPPEWLRERLAARLGDLAAY--PSTEDE 60
Query: 178 IR 179
R
Sbjct: 61 RR 62
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 30.3 bits (68), Expect = 1.9
Identities = 23/91 (25%), Positives = 26/91 (28%), Gaps = 7/91 (7%)
Query: 192 EPRPQMRHPRPPPHMV----AARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVL 247
+P + PR P V RP P A P P P PA S
Sbjct: 187 DPALPLSAPRLGPADVFVPATPRPTPRTTASPET--TPTPSTTTSPPSTTIPAPSTTIAA 244
Query: 248 SILDRARAAMEKPAPSA-GVYEYEDPHAAHA 277
A PAP G E +A A
Sbjct: 245 PQAGTTPEAEGTPAPPTPGGGEAPPANATPA 275
>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486). This
family is made up of members from various Burkholderia
spp. The function is unknown.
Length = 245
Score = 29.9 bits (67), Expect = 2.1
Identities = 31/150 (20%), Positives = 38/150 (25%), Gaps = 26/150 (17%)
Query: 190 HHEPRPQMRHPRPPPHMVAARPA--------PGLLAPPRMQHAAPPHMVRPP----HPPP 237
H P PH AA P A HAA P P P
Sbjct: 20 HPVPARSSSADAAGPHDDAAEPVLTDQIVPGAEQAASAAPVHAAREATADPEFVAVEPVP 79
Query: 238 PP-------------AASRGKVLSILDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTAS 284
P A G ++ AAM+ P PSA + A A
Sbjct: 80 TPHVPAVALPGDTDAPAEPGAAPHVVAERAAAMQAPLPSALAADDPQAPPAGATAADAGD 139
Query: 285 YDPAMYDPYASTAGDYYDTSASYYETAAPA 314
P P A+ + + AA
Sbjct: 140 AAPD-ATPPAAGDASPPAAAQAAASAAAAL 168
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 29.6 bits (67), Expect = 2.2
Identities = 14/49 (28%), Positives = 16/49 (32%), Gaps = 2/49 (4%)
Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPA 240
P PP A P AP R Q AAP + + P A
Sbjct: 131 PATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAP--VTQAVEAPKVEA 177
Score = 28.1 bits (63), Expect = 7.7
Identities = 12/50 (24%), Positives = 14/50 (28%), Gaps = 1/50 (2%)
Query: 193 PRPQM-RHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAA 241
PR + + A A P Q AP P PAA
Sbjct: 115 PRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAA 164
>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator. Shisa is a
transcription factor-type molecule that physically
interacts with immature forms of the Wnt receptor
Frizzled and the FGF receptor within the endoplasmic
reticulum to inhibit their post-translational maturation
and trafficking to the cell surface.
Length = 177
Score = 29.4 bits (66), Expect = 2.2
Identities = 17/47 (36%), Positives = 22/47 (46%), Gaps = 6/47 (12%)
Query: 191 HEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPP 237
P PQ + P P P PG+ APP PP +++P PPP
Sbjct: 136 SYPGPQYQGYHPMP------PQPGMPAPPYSLQYPPPGLLQPQGPPP 176
Score = 29.4 bits (66), Expect = 2.3
Identities = 12/83 (14%), Positives = 17/83 (20%), Gaps = 15/83 (18%)
Query: 199 HPRPPPHMVAARPAPGLLAPPRMQHAAPP-HMVRPPHPPPPPAASRGKVLSILDR--ARA 255
P M A +Q P P P P +
Sbjct: 107 CRPQRPVMTRATSTT-------VQTTPLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPY 159
Query: 256 AMEKPAPSAGVYEYEDPHAAHAY 278
+++ P P P Y
Sbjct: 160 SLQYPPPGLL-----QPQGPPPY 177
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 30.3 bits (68), Expect = 2.3
Identities = 14/54 (25%), Positives = 18/54 (33%), Gaps = 11/54 (20%)
Query: 189 VHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAAS 242
V P P P A+ PAP PP +++ PPP P
Sbjct: 219 VSFNPFLPGPSPAQPSAPPASIPAP--PIPP---------VIQYVAPPPVPPPQ 261
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 30.2 bits (68), Expect = 2.3
Identities = 14/67 (20%), Positives = 18/67 (26%), Gaps = 8/67 (11%)
Query: 195 PQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPP----HMVRPPHPPPPPAASRGKVLSIL 250
M R P PP PP + P P AS+G L
Sbjct: 438 APMNAVRAPSRNAQNAAQK----PPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKL 493
Query: 251 DRARAAM 257
+ A+
Sbjct: 494 AQVLASA 500
Score = 30.2 bits (68), Expect = 2.4
Identities = 14/63 (22%), Positives = 18/63 (28%)
Query: 180 QEQFREMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPP 239
Q + +G+ P P P P + AP R A P PP
Sbjct: 408 QFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPN 467
Query: 240 AAS 242
S
Sbjct: 468 YQS 470
>gnl|CDD|180801 PRK07033, PRK07033, hypothetical protein; Provisional.
Length = 427
Score = 30.0 bits (68), Expect = 2.4
Identities = 15/72 (20%), Positives = 25/72 (34%), Gaps = 10/72 (13%)
Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPA-------ASRGKVLSILDR 252
P P PG P A P P P A A+ +L+++ +
Sbjct: 6 PFSAGSGGFVPPNPGDRTPAAAPAAGAP---FQPRPGRGAASGLNPLVAAANPLLNLIPQ 62
Query: 253 ARAAMEKPAPSA 264
R+ + P P++
Sbjct: 63 IRSTVHHPDPAS 74
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 29.0 bits (65), Expect = 2.6
Identities = 14/50 (28%), Positives = 17/50 (34%), Gaps = 4/50 (8%)
Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPP 239
HH+P +P P P MQ P H + P P PP
Sbjct: 87 HHQPNLPQPAQQPFQPQPLQPPQP----QQPMQPQPPVHPIPPLPPQPPL 132
Score = 28.6 bits (64), Expect = 4.1
Identities = 11/49 (22%), Positives = 18/49 (36%)
Query: 190 HHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPP 238
+ Q + +PP +P P + P + P + P P PP
Sbjct: 95 PAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPP 143
Score = 27.8 bits (62), Expect = 7.6
Identities = 13/49 (26%), Positives = 17/49 (34%)
Query: 191 HEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPP 239
+PQ P P + +P + P Q PP P PP P
Sbjct: 98 QPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLP 146
>gnl|CDD|100009 cd06152, YjgF_YER057c_UK114_like_4, YjgF, YER057c, and UK114
belong to a large family of proteins present in
bacteria, archaea, and eukaryotes with no definitive
function. The conserved domain is similar in structure
to chorismate mutase but there is no sequence
similarity and no functional connection. Members of
this family have been implicated in isoleucine (Yeo7,
Ibm1, aldR) and purine (YjgF) biosynthesis, as well as
threonine anaerobic degradation (tdcF) and
mitochondrial DNA maintenance (Ibm1). This domain
homotrimerizes forming a distinct intersubunit cavity
that may serve as a small molecule binding site.
Length = 114
Score = 28.4 bits (64), Expect = 2.9
Identities = 7/23 (30%), Positives = 13/23 (56%)
Query: 34 HSYTLKLLDDEINKAMASNKKKY 56
+SY + + ++E M N KK+
Sbjct: 62 NSYHVDIKNEEAFGLMVENFKKW 84
>gnl|CDD|234012 TIGR02784, addA_alphas, double-strand break repair helicase AddA,
alphaproteobacterial type. AddAB, also called RexAB,
substitutes for RecBCD in several bacterial lineages.
These DNA recombination proteins act before synapse and
are particularly important for DNA repair of
double-stranded breaks by homologous recombination. The
term AddAB is used broadly, with AddA homologous between
the alphaproteobacteria (as modeled here) and the
Firmicutes, while the partner AddB proteins show no
strong homology across the two groups of species [DNA
metabolism, DNA replication, recombination, and repair].
Length = 1135
Score = 30.0 bits (68), Expect = 3.0
Identities = 16/75 (21%), Positives = 22/75 (29%), Gaps = 8/75 (10%)
Query: 192 EPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILD 251
+ ++R R V A A P +R P P PA R S
Sbjct: 901 KAEWRLRFTRRDWDPVGLPVE----AAQTDTLEALPDWLRAP-APAEPALPRPLAPS--- 952
Query: 252 RARAAMEKPAPSAGV 266
A++ P V
Sbjct: 953 GLGGAIDSALPGEAV 967
>gnl|CDD|233909 TIGR02520, pilus_B_mal_scr, type IVB pilus formation outer membrane
protein, R64 PilN family. Several related protein
families encode outer membrane pore proteins for type II
secretion, type III secretion, and type IV pilus
formation. This protein family appears to encode a
secretin for pilus formation, although it is quite
different from PilQ. Members include the PilN
lipoprotein of the plasmid R64 thin pilus, a type IV
pilus. Scoring between the trusted and noise cutoffs are
examples of bundle-forming pilus B (bfpB) [Cell
envelope, Surface structures, Protein fate, Protein and
peptide secretion and trafficking].
Length = 497
Score = 29.9 bits (67), Expect = 3.1
Identities = 35/164 (21%), Positives = 49/164 (29%), Gaps = 41/164 (25%)
Query: 208 AARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRG------------KVLSILDRARA 255
AAR P +L P RPP P A S + +
Sbjct: 75 AARAIPAVLP-------RPDEGGRPPLPSGLGATSTQSEGPGGTAGLITDISWEGEPLGG 127
Query: 256 AMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYYETAAPAP 315
+++ G+ + A +Y D + A +TA +
Sbjct: 128 LLDQVTSRLGLSWKYENGAVRIFY-----TDTRTFQVKALPG-----------DTALNSS 171
Query: 316 LPSTSSRTGGSGWKSSSGYAEHHSSSRSKSSAPIRSSSSEKVVN 359
+ STSS T GSG SS G S S S+ V N
Sbjct: 172 VTSTSSSTAGSGSSSSGG------SGNSGSTQSTAVKLESSVHN 209
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 29.7 bits (66), Expect = 3.2
Identities = 31/135 (22%), Positives = 45/135 (33%), Gaps = 19/135 (14%)
Query: 189 VHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLS 248
+H Q P P PH A+R +AP P P PAAS
Sbjct: 31 LHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKP-----DLAQAPTPAASEK---- 81
Query: 249 ILDRARAAMEKPAPSAGVYEYEDPHAAHAYYETTASYDPAMYDPYASTAGDYYDTSASYY 308
D A PAP DP A + A+ P + + S A + + +
Sbjct: 82 -FDPA------PAPHQAASRAPDPAVAP---QLAAAPKPDAAEAFTSAAQAHEAPADAGT 131
Query: 309 ETAAPAPLPSTSSRT 323
A+ P P+ ++
Sbjct: 132 SAASKKPDPAAHTQH 146
>gnl|CDD|214981 smart01025, BEN, The BEN domain is found in diverse animal
proteins. Proteins containing BEN domains are
BANP/SMAR1, NAC1 and the Drosophila mod(mdg4) isoform C,
the chordopoxvirus virosomal protein E5R and several
proteins of polydnaviruses. Computational analysis
suggests that the BEN domain mediates protein-DNA and
protein-protein interactions during chromatin
organisation and transcription.
Length = 80
Score = 27.3 bits (61), Expect = 3.5
Identities = 18/78 (23%), Positives = 29/78 (37%), Gaps = 15/78 (19%)
Query: 104 EETMTKMSILGKGSMRDRSKEEELRKEGDPKNAHLVEDLHVQVSALAPPAEAHARIAYAL 163
ET+ S+ GK S S +K DP V+ + + P + L
Sbjct: 14 RETLANSSVTGKRSRTGGSG----KKPLDPNK---VQAIREYLEERFPSVD--------L 58
Query: 164 SELRKYIIPDVNDEIRQE 181
E K I +N++ R+
Sbjct: 59 DEDWKVCIQAINNKCRRL 76
>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
Length = 694
Score = 29.5 bits (66), Expect = 3.6
Identities = 21/87 (24%), Positives = 29/87 (33%), Gaps = 5/87 (5%)
Query: 206 MVAARPAPGLLAPPRMQHAAPPHMVRPPHPP-----PPPAASRGKVLSILDRARAAMEKP 260
++++R PG AP R PP RP P +R +D A P
Sbjct: 424 LLSSRQPPGAPAPRRDNDPPPPPRARPGSTPACARRARAQRARDAGPEYVDPLGALRRLP 483
Query: 261 APSAGVYEYEDPHAAHAYYETTASYDP 287
A +A E + YY P
Sbjct: 484 AGAAPPPEPAAAPSPATYYTRMGGGPP 510
>gnl|CDD|131743 TIGR02696, pppGpp_PNP, guanosine pentaphosphate synthetase
I/polynucleotide phosphorylase. Sohlberg, et al present
characterization of two proteins from Streptomyces
coelicolor. The protein in this family was shown to have
poly(A) polymerase activity and may be responsible for
polyadenylating RNA in this species. Reference 2 showed
that a nearly identical plasmid-encoded protein from
Streptomyces antibioticus is a bifunctional enzyme that
acts also as a guanosine pentaphosphate synthetase.
Length = 719
Score = 29.4 bits (66), Expect = 4.1
Identities = 14/46 (30%), Positives = 29/46 (63%), Gaps = 6/46 (13%)
Query: 73 VKVLVPVKEHPRFNFVGKLLGPKGNSLRRLQEETMTKMSILGKGSM 118
+ V +PV + +G+++GPKG + ++Q+ET ++SI G++
Sbjct: 580 ITVKIPVDK------IGEVIGPKGKMINQIQDETGAEISIEDDGTV 619
>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
Length = 416
Score = 29.2 bits (65), Expect = 4.5
Identities = 16/58 (27%), Positives = 19/58 (32%), Gaps = 3/58 (5%)
Query: 184 REMGYVHHEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAA 241
RE G P P R +P P AP + P + PPP PA
Sbjct: 299 RETGGEGEGPEPAGRDGAAGGE---PKPGPPRPAPDADRPEGWPSLEAITFPPPTPAT 353
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 29.3 bits (65), Expect = 4.6
Identities = 11/53 (20%), Positives = 16/53 (30%), Gaps = 3/53 (5%)
Query: 191 HEPRPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASR 243
H P+P +P A +P A Q + P P A +
Sbjct: 199 HAPKPT---QQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQ 248
>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
Length = 931
Score = 29.4 bits (66), Expect = 4.6
Identities = 21/89 (23%), Positives = 26/89 (29%), Gaps = 22/89 (24%)
Query: 209 ARPAPGLLAPPRMQHAAPPHMVRPP-------------HPPPPPAASRGKVLSILDRARA 255
A A G +A Q AAP P PP PPA + +
Sbjct: 40 AEGAAGPVAKAAEQMAAPEAAEAAPLPAAAESIASPEVPPPVPPAPA---------QEGE 90
Query: 256 AMEKPAPSAGVYEYEDPHAAHAYYETTAS 284
A PSA P A + A+
Sbjct: 91 APAAEQPSAVPAPSAAPAPAEPVEPSLAA 119
>gnl|CDD|220243 pfam09440, eIF3_N, eIF3 subunit 6 N terminal domain. This is the N
terminal domain of subunit 6 translation initiation
factor eIF3.
Length = 133
Score = 28.0 bits (63), Expect = 4.7
Identities = 16/57 (28%), Positives = 25/57 (43%), Gaps = 4/57 (7%)
Query: 1 MDIERELEDGNDTKANEYVKT---LLKEKMTLDSDSHSYTLKLLDDEINKAMASNKK 54
D+ +EL ++ E +LKE L + T L D E+ K + S+KK
Sbjct: 51 ADLRKELY-PDEEMPAELAAKRQEVLKELQELQEEVAPITKLLEDPEVIKNLRSDKK 106
>gnl|CDD|235357 PRK05177, minC, septum formation inhibitor; Reviewed.
Length = 239
Score = 28.8 bits (65), Expect = 4.9
Identities = 16/47 (34%), Positives = 20/47 (42%)
Query: 194 RPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPA 240
RP + P PP +V RPA + P A PP P P P+
Sbjct: 87 RPSLLGPGMPPALVGGRPAGDVEIPEEEPAAPPPAPAAPEAPAAVPS 133
>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
Provisional.
Length = 376
Score = 28.7 bits (64), Expect = 5.4
Identities = 14/44 (31%), Positives = 17/44 (38%), Gaps = 8/44 (18%)
Query: 200 PRPPPHMVAA-----RPAPGLLAPPRMQHAAPPHMVRPPHPPPP 238
P PP M+A PAP + P A V+P PP
Sbjct: 59 PAPPSTMIATNTKPFHPAPIDVPPDP---PAAQEAVQPTAPPSA 99
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 29.0 bits (64), Expect = 5.6
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 303 TSASYYETAAPAPLPSTSSRTGGSGWKSSSGY 334
S ++ T +P P P T+S+ G G S+S Y
Sbjct: 638 PSTTHVSTLSPGPGPGTTSQVSGPGNSSTSRY 669
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 28.9 bits (64), Expect = 5.6
Identities = 24/83 (28%), Positives = 29/83 (34%), Gaps = 5/83 (6%)
Query: 200 PRPPPHMVAARPAPGLLAPPRMQH-AAPPHMVRPPHPPP----PPAASRGKVLSILDRAR 254
P R P AP R + AA P RPP P PPAA+ G+
Sbjct: 713 RAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG 772
Query: 255 AAMEKPAPSAGVYEYEDPHAAHA 277
A +P P A + P A
Sbjct: 773 APTPQPPPQAPPAPQQRPRGAPT 795
>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
Length = 280
Score = 28.7 bits (63), Expect = 6.0
Identities = 16/49 (32%), Positives = 16/49 (32%), Gaps = 1/49 (2%)
Query: 193 PRPQMRHPRPP-PHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPA 240
P P P P PAP PP PP V P P PA
Sbjct: 91 PAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPA 139
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 29.1 bits (66), Expect = 6.2
Identities = 18/64 (28%), Positives = 19/64 (29%), Gaps = 10/64 (15%)
Query: 201 RPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAMEKP 260
P A A AAP P PPAA A AA KP
Sbjct: 37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAA----------AAPAAPPKP 86
Query: 261 APSA 264
A +A
Sbjct: 87 AAAA 90
>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183). This
family consists of several eukaryotic proteins of around
360 residues in length. The function of this family is
unknown.
Length = 317
Score = 28.5 bits (64), Expect = 6.4
Identities = 12/38 (31%), Positives = 15/38 (39%), Gaps = 5/38 (13%)
Query: 317 PSTSSRTGGSGWKSSSGYAEHHSSSRSKSSAPIRSSSS 354
+ S SSS SS S SS+ R+SS
Sbjct: 278 GRSYGSGSPSYSPSSSSN-----SSSSSSSSSTRTSSG 310
>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR. This region is found
to the N-terminus of the pfam00170 transcription factor
domain. It is between 150 and 200 amino acids in length.
The N-terminal half is rather rich in proline residues
and has been termed the PRD (proline rich domain),
whereas the C-terminal half is more polar and has been
called the MFMR (multifunctional mosaic region). It has
been suggested that this family is composed of three
sub-families called A, B and C, classified according to
motif composition. It has been suggested that some of
these motifs may be involved in mediating
protein-protein interactions. The MFMR region contains a
nuclear localisation signal in bZIP opaque and GBF-2.
The MFMR also contains a transregulatory activity in
TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
signals.
Length = 189
Score = 27.9 bits (62), Expect = 6.6
Identities = 12/40 (30%), Positives = 14/40 (35%), Gaps = 8/40 (20%)
Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPP 239
PRPPP + A P+M P P PP
Sbjct: 40 PRPPPPYFNSSVASS--------PQPHPYMWGPQQPMMPP 71
>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 559
Score = 28.8 bits (64), Expect = 6.8
Identities = 17/79 (21%), Positives = 27/79 (34%), Gaps = 2/79 (2%)
Query: 150 APPAEAHARIAYALSELRKYIIPDVNDEIRQEQFR--EMGYVHHEPRPQMRHPRPPPHMV 207
P +A A A + + + + + G+ + PQ+ P P +V
Sbjct: 178 RAPVQAPVVAAPAPVPAIAAALAAHAAYAQDDDEQLDDDGFDLDDALPQILPPAALPPIV 237
Query: 208 AARPAPGLLAPPRMQHAAP 226
A AP LA AP
Sbjct: 238 VAPAAPAALAAVAAAAPAP 256
>gnl|CDD|197247 cd09149, PLDc_vPLD5_2, Putative catalytic domain, repeat 2, of
inactive veterbrate phospholipase PLD5. Putative
catalytic domain, repeat 2, of inactive veterbrate
phospholipases D5 (PLD5, EC 3.1.4.4), homologs of the
vaccinia virus protein K4 encoded by the HindIII K4L
gene. Vertebrate PLD5 has been assigned to the PLD
superfamily, since it shows high sequence similarity to
other human homologs of protein K4, which contain two
copies of the conserved HKD motif (H-x-K-x(4)-D, where
x represents any amino acid residue). However, due to
the lack of functionally important histidine and lysine
residues in the HKD motif, vetebrate PLD5 has been
characterized as an inactive PLD.
Length = 188
Score = 27.9 bits (62), Expect = 7.1
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 5/46 (10%)
Query: 51 SNKKKYDSPYLDLFREKPIL--VKVKVLVPV--KEHP-RFNFVGKL 91
S ++Y S RE +L V+V++L+ K P FNFV L
Sbjct: 46 SYARRYWSRIDSKIREALVLRSVRVRLLISFWRKTDPLTFNFVSSL 91
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 28.4 bits (63), Expect = 7.3
Identities = 12/42 (28%), Positives = 13/42 (30%)
Query: 200 PRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAA 241
P P A P+ P A PH P P P A
Sbjct: 207 PAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPA 248
>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
Provisional.
Length = 172
Score = 28.0 bits (62), Expect = 7.4
Identities = 14/48 (29%), Positives = 15/48 (31%)
Query: 194 RPQMRHPRPPPHMVAARPAPGLLAPPRMQHAAPPHMVRPPHPPPPPAA 241
RPQ + AP A Q A P P P PAA
Sbjct: 114 RPQGDDQGGQGGGNYNQSAPRQQAQRPQQAAQQQSRPAPQQPAPQPAA 161
>gnl|CDD|178608 PLN03046, PLN03046, D-glycerate 3-kinase; Provisional.
Length = 460
Score = 28.0 bits (62), Expect = 9.3
Identities = 14/99 (14%), Positives = 26/99 (26%), Gaps = 7/99 (7%)
Query: 272 PHAAHAYYETTAS-------YDPAMYDPYASTAGDYYDTSASYYETAAPAPLPSTSSRTG 324
P + + Y P + D S S Y ++ + + S G
Sbjct: 28 PIPCASAAAFNFNPISNPFNLKRRPYSPSSHKFNDRVAASCSSYPSSKLSSRKTHLSCPG 87
Query: 325 GSGWKSSSGYAEHHSSSRSKSSAPIRSSSSEKVVNASLV 363
H ++ + +S S + S V
Sbjct: 88 CGCSWIQDNSMVHDYATTAATSKRCSSLPTSSPALVSSV 126
>gnl|CDD|106738 PRK13797, PRK13797, putative bifunctional allantoicase/OHCU
decarboxylase; Provisional.
Length = 516
Score = 28.0 bits (62), Expect = 9.6
Identities = 13/33 (39%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 231 RPPHPPPPPAASRGKVLSILD--RARAAMEKPA 261
RP P PP ++R V+S+ R +AAM++ A
Sbjct: 412 RPTQAPAPPTSARATVVSLDAPRREQAAMDQAA 444
>gnl|CDD|181337 PRK08266, PRK08266, hypothetical protein; Provisional.
Length = 542
Score = 28.1 bits (63), Expect = 9.7
Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 7/58 (12%)
Query: 206 MVAARPAPGLLAPP------RMQHAAPPHMVRPPHPPPPPAASRGKVLSILDRARAAM 257
M++ RP P L P R AA P + P P P P A +++ A+ M
Sbjct: 153 MLSGRPRPVALEMPWDVFGQRAPVAAAPPLRPAPPPAPDPDAI-AAAAALIAAAKNPM 209
>gnl|CDD|221509 pfam12287, Caprin-1_C, Cytoplasmic
activation/proliferation-associated protein-1 C term.
This family of proteins is found in eukaryotes. Proteins
in this family are typically between 343 and 708 amino
acids in length. This family is the C terminal region of
caprin-1. Caprin-1 is a protein involved in regulating
cellular proliferation. In mutated phenotypes, the G1
phase of the cell cycle is greatly lengthened, impairing
normal proliferation. The C terminal region of caprin-1
contains RGG motifs which are characteristic of RNA
binding domains. It is possible that caprin-1 functions
through an RNA binding mechanism.
Length = 319
Score = 28.0 bits (62), Expect = 9.9
Identities = 13/53 (24%), Positives = 21/53 (39%), Gaps = 9/53 (16%)
Query: 189 VHHEPRPQMRHPRPPPHMVAARPAPG---LLAPPRMQHAAPPHMVRPPHPPPP 238
VH E R P V +P P +++P + + P + +P H P
Sbjct: 45 VHSESRL------SQPSAVPVQPEPTQVPMVSPTSEGYTSSPPLYQPSHTAEP 91
>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN). This family
consists of several eukaryotic survival motor neuron
(SMN) proteins. The Survival of Motor Neurons (SMN)
protein, the product of the spinal muscular
atrophy-determining gene, is part of a large
macromolecular complex (SMN complex) that functions in
the assembly of spliceosomal small nuclear
ribonucleoproteins (snRNPs). The SMN complex functions
as a specificity factor essential for the efficient
assembly of Sm proteins on U snRNAs and likely protects
cells from illicit, and potentially deleterious,
non-specific binding of Sm proteins to RNAs.
Length = 264
Score = 27.6 bits (61), Expect = 9.9
Identities = 19/58 (32%), Positives = 22/58 (37%), Gaps = 10/58 (17%)
Query: 193 PRPQMRHPRPPPHMVAARPAPGL--------LAPPRMQHAAPPHMVRPPHPPPPPAAS 242
P +PR PP P PG PP + PP + PP PPPP S
Sbjct: 163 GPPSPWNPRFPP--GPPPPPPGFGRHGEKPSGWPPFLSGWPPPFPLGPPMIPPPPPMS 218
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.129 0.371
Gapped
Lambda K H
0.267 0.0703 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,925,085
Number of extensions: 2065459
Number of successful extensions: 3608
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2766
Number of HSP's successfully gapped: 328
Length of query: 402
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 303
Effective length of database: 6,546,556
Effective search space: 1983606468
Effective search space used: 1983606468
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 60 (26.9 bits)