BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 026649
(235 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q04832|HEXP_LEIMA DNA-binding protein HEXBP OS=Leishmania major GN=HEXBP PE=4 SV=1
Length = 271
Score = 93.6 bits (231), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 83/281 (29%), Positives = 109/281 (38%), Gaps = 99/281 (35%)
Query: 15 CNNCKRPGHFARECPNV--------AVCNNCGLPGHIASECTTQAR--------CWNCRE 58
C NC + GH+ARECP C CG GH++ EC +AR C+ C E
Sbjct: 18 CRNCGKEGHYARECPEADSKGDERSTTCFRCGEEGHMSRECPNEARSGAAGAMTCFRCGE 77
Query: 59 PGHMASNCHNEGI--------CHSCGKTGHRARDCSTH----------------VQSG-- 92
GHM+ +C N C+ CG+ GH +RDC + Q G
Sbjct: 78 AGHMSRDCPNSAKPGAAKGFECYKCGQEGHLSRDCPSSQGGSRGGYGQKRGRSGAQGGYS 137
Query: 93 GDLRLCNNCYKPGHIAADCTN---------DKACKNCRKTGHIARDCQN---------EP 134
GD R C C GHI+ DC N D+ C C GHI+RDC N +
Sbjct: 138 GD-RTCYKCGDAGHISRDCPNGQGGYSGAGDRTCYKCGDAGHISRDCPNGQGGYSGAGDR 196
Query: 135 VCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDGGGGGGRYVGYHDVICRSCNQ 194
C C +GH++R+CP S G D C C +
Sbjct: 197 KCYKCGESGHMSRECPSAGSTGS---------------------------GDRACYKCGK 229
Query: 195 MGHMSRDCVGPLI-----------ICRNCGGRGHMAYECPS 224
GH+SR+C C CG GH++ +CPS
Sbjct: 230 PGHISRECPEAGGSYGGSRGGGDRTCYKCGEAGHISRDCPS 270
Score = 65.5 bits (158), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/178 (26%), Positives = 74/178 (41%), Gaps = 54/178 (30%)
Query: 9 MSQGNLCNNCKRPGHFARECPNVA------------------------VCNNCGLPGHIA 44
++G C C + GH +R+CP+ C CG GHI+
Sbjct: 93 AAKGFECYKCGQEGHLSRDCPSSQGGSRGGYGQKRGRSGAQGGYSGDRTCYKCGDAGHIS 152
Query: 45 SECTT---------QARCWNCREPGHMASNCHN---------EGICHSCGKTGHRARDCS 86
+C C+ C + GH++ +C N + C+ CG++GH +R+C
Sbjct: 153 RDCPNGQGGYSGAGDRTCYKCGDAGHISRDCPNGQGGYSGAGDRKCYKCGESGHMSRECP 212
Query: 87 THVQSGGDLRLCNNCYKPGHIAADCTN------------DKACKNCRKTGHIARDCQN 132
+ +G R C C KPGHI+ +C D+ C C + GHI+RDC +
Sbjct: 213 SAGSTGSGDRACYKCGKPGHISRECPEAGGSYGGSRGGGDRTCYKCGEAGHISRDCPS 270
Score = 49.3 bits (116), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 51/134 (38%), Gaps = 41/134 (30%)
Query: 113 NDKACKNCRKTGHIARDC--------QNEPVCNLCNIAGHVARQCPKGDSLGERGGGGGG 164
+ +C+NC K GH AR+C + C C GH++R+CP G G
Sbjct: 14 SSTSCRNCGKEGHYARECPEADSKGDERSTTCFRCGEEGHMSRECPNEARSGAAGA---- 69
Query: 165 ERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDC-------VGPLIICRNCGGRGH 217
+ C C + GHMSRDC C CG GH
Sbjct: 70 ----------------------MTCFRCGEAGHMSRDCPNSAKPGAAKGFECYKCGQEGH 107
Query: 218 MAYECPSGRIADRG 231
++ +CPS + RG
Sbjct: 108 LSRDCPSSQGGSRG 121
>sp|P53849|GIS2_YEAST Zinc finger protein GIS2 OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=GIS2 PE=1 SV=1
Length = 153
Score = 92.8 bits (229), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 87/157 (55%), Gaps = 17/157 (10%)
Query: 9 MSQGNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQ-----ARCWNCREPGHMA 63
MSQ C C + GH A +C + +C NC PGH+ ++CT +C+NC E GH+
Sbjct: 1 MSQ-KACYVCGKIGHLAEDCDSERLCYNCNKPGHVQTDCTMPRTVEFKQCYNCGETGHVR 59
Query: 64 SNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYK---PGHIAADCTNDK----- 115
S C + C +C +TGH +R+C ++ ++ +CYK P H+A DC +
Sbjct: 60 SECTVQR-CFNCNQTGHISRECPEPKKTSRFSKV--SCYKCGGPNHMAKDCMKEDGISGL 116
Query: 116 ACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKG 152
C C + GH++RDCQN+ +C CN GH+++ CPK
Sbjct: 117 KCYTCGQAGHMSRDCQNDRLCYNCNETGHISKDCPKA 153
Score = 83.6 bits (205), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/163 (34%), Positives = 78/163 (47%), Gaps = 16/163 (9%)
Query: 68 NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCT-----NDKACKNCRK 122
++ C+ CGK GH A DC + RLC NC KPGH+ DCT K C NC +
Sbjct: 2 SQKACYVCGKIGHLAEDCDSE-------RLCYNCNKPGHVQTDCTMPRTVEFKQCYNCGE 54
Query: 123 TGHIARDCQNEPVCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDGGGGGGRYV 182
TGH+ +C + C CN GH++R+CP+ + GG +
Sbjct: 55 TGHVRSECTVQ-RCFNCNQTGHISRECPEPKKTSRFSKVSCYK--CGGPNHMAKDCMKED 111
Query: 183 GYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSG 225
G + C +C Q GHMSRDC + C NC GH++ +CP
Sbjct: 112 GISGLKCYTCGQAGHMSRDCQNDRL-CYNCNETGHISKDCPKA 153
Score = 72.4 bits (176), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 82/162 (50%), Gaps = 21/162 (12%)
Query: 49 TQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIA 108
+Q C+ C + GH+A +C +E +C++C K GH DC+ + + + C NC + GH+
Sbjct: 2 SQKACYVCGKIGHLAEDCDSERLCYNCNKPGHVQTDCT--MPRTVEFKQCYNCGETGHVR 59
Query: 109 ADCTNDKACKNCRKTGHIARDCQ--------NEPVCNLCNIAGHVARQCPKGDSLGERGG 160
++CT + C NC +TGHI+R+C ++ C C H+A+ C K D +
Sbjct: 60 SECTVQR-CFNCNQTGHISRECPEPKKTSRFSKVSCYKCGGPNHMAKDCMKEDGIS---- 114
Query: 161 GGGGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDC 202
G G G +D +C +CN+ GH+S+DC
Sbjct: 115 ------GLKCYTCGQAGHMSRDCQNDRLCYNCNETGHISKDC 150
Score = 43.9 bits (102), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 47/118 (39%), Gaps = 31/118 (26%)
Query: 113 NDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGG 172
+ KAC C K GH+A DC +E +C CN GHV C ++ +
Sbjct: 2 SQKACYVCGKIGHLAEDCDSERLCYNCNKPGHVQTDCTMPRTVEFKQ------------- 48
Query: 173 DGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGRIADR 230
C +C + GH+ +C + C NC GH++ ECP + R
Sbjct: 49 ----------------CYNCGETGHVRSECT--VQRCFNCNQTGHISRECPEPKKTSR 88
>sp|P36627|BYR3_SCHPO Cellular nucleic acid-binding protein homolog
OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=byr3 PE=4 SV=1
Length = 179
Score = 83.6 bits (205), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/167 (38%), Positives = 80/167 (47%), Gaps = 19/167 (11%)
Query: 2 LKNVLSFMSQGNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECT---TQARCWNCRE 58
+ V G C NC GH AREC ++C NC GH ASECT + C+ C
Sbjct: 6 VPTVPQTTRPGPRCYNCGENGHQARECTKGSICYNCNQTGHKASECTEPQQEKTCYACGT 65
Query: 59 PGHMASNCHNE------GICHSCGKTGHRARDCSTH-VQSGGDL---RLCNNCY---KPG 105
GH+ +C + C+ CG+ GH ARDC T+ QSGG R NCY G
Sbjct: 66 AGHLVRDCPSSPNPRQGAECYKCGRVGHIARDCRTNGQQSGGRFGGHRSNMNCYACGSYG 125
Query: 106 HIAADCTNDKACKNCRKTGHIARDCQNEP---VCNLCNIAGHVARQC 149
H A DCT C +C K GH + +CQ +C CN GH+A C
Sbjct: 126 HQARDCTMGVKCYSCGKIGHRSFECQQASDGQLCYKCNQPGHIAVNC 172
Score = 67.0 bits (162), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/178 (29%), Positives = 75/178 (42%), Gaps = 52/178 (29%)
Query: 52 RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC 111
RC+NC E GH A C IC++C +TGH+A +C+ Q
Sbjct: 18 RCYNCGENGHQARECTKGSICYNCNQTGHKASECTEPQQ--------------------- 56
Query: 112 TNDKACKNCRKTGHIARDCQNEP------VCNLCNIAGHVARQCPKGDSLGERGGGGGGE 165
+K C C GH+ RDC + P C C GH+AR C
Sbjct: 57 --EKTCYACGTAGHLVRDCPSSPNPRQGAECYKCGRVGHIARDCRT-------------- 100
Query: 166 RGGGGGGDGGGGGGRYVGYH-DVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYEC 222
+G GGR+ G+ ++ C +C GH +RDC + C +CG GH ++EC
Sbjct: 101 -------NGQQSGGRFGGHRSNMNCYACGSYGHQARDCTM-GVKCYSCGKIGHRSFEC 150
Score = 61.6 bits (148), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 55/168 (32%), Positives = 76/168 (45%), Gaps = 21/168 (12%)
Query: 72 CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCT---NDKACKNCRKTGHIAR 128
C++CG+ GH+AR+C+ +C NC + GH A++CT +K C C GH+ R
Sbjct: 19 CYNCGENGHQARECT-------KGSICYNCNQTGHKASECTEPQQEKTCYACGTAGHLVR 71
Query: 129 DCQNEP------VCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDGGGGGGRYV 182
DC + P C C GH+AR C + GG GG R G G +
Sbjct: 72 DCPSSPNPRQGAECYKCGRVGHIARDCRTNGQ--QSGGRFGGHRSNMNCYACGSYGHQAR 129
Query: 183 G-YHDVICRSCNQMGHMSRDC--VGPLIICRNCGGRGHMAYECPSGRI 227
V C SC ++GH S +C +C C GH+A C S I
Sbjct: 130 DCTMGVKCYSCGKIGHRSFECQQASDGQLCYKCNQPGHIAVNCTSPVI 177
Score = 35.0 bits (79), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 25/50 (50%), Gaps = 2/50 (4%)
Query: 188 ICRSCNQMGHMSRDCVGPL--IICRNCGGRGHMAYECPSGRIADRGYRRY 235
IC +CNQ GH + +C P C CG GH+ +CPS +G Y
Sbjct: 37 ICYNCNQTGHKASECTEPQQEKTCYACGTAGHLVRDCPSSPNPRQGAECY 86
>sp|Q3T0Q6|CNBP_BOVIN Cellular nucleic acid-binding protein OS=Bos taurus GN=CNBP PE=2
SV=1
Length = 170
Score = 82.0 bits (201), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/168 (36%), Positives = 81/168 (48%), Gaps = 36/168 (21%)
Query: 12 GNLCNNCKRPGHFARECPNVA----------------------VCNNCGLPGHIASECTT 49
N C C R GH+ARECP +C CG GH+A +C
Sbjct: 3 SNECFKCGRSGHWARECPTGGGRGRGMRSRGRGFQFVSSSLPDICYRCGESGHLAKDCDL 62
Query: 50 QAR-CWNCREPGHMASNCHN-----EGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYK 103
Q C+NC GH+A +C E C++CGK GH ARDC D + C +C +
Sbjct: 63 QEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDH-----ADEQKCYSCGE 117
Query: 104 PGHIAADCTNDKACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
GHI DCT K C C +TGH+A +C +E C C +GH+AR+C
Sbjct: 118 FGHIQKDCTKVK-CYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 164
Score = 73.6 bits (179), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 76/142 (53%), Gaps = 20/142 (14%)
Query: 6 LSFMSQG--NLCNNCKRPGHFARECP-NVAVCNNCGLPGHIASECTTQAR-----CWNCR 57
F+S ++C C GH A++C C NCG GHIA +C R C+NC
Sbjct: 36 FQFVSSSLPDICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCG 95
Query: 58 EPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC--TN 113
+PGH+A +C +E C+SCG+ GH +DC T V+ C C + GH+A +C T+
Sbjct: 96 KPGHLARDCDHADEQKCYSCGEFGHIQKDC-TKVK-------CYRCGETGHVAINCSKTS 147
Query: 114 DKACKNCRKTGHIARDCQNEPV 135
+ C C ++GH+AR+C E
Sbjct: 148 EVNCYRCGESGHLARECTIEAT 169
Score = 59.7 bits (143), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 5/81 (6%)
Query: 15 CNNCKRPGHFARECPNV--AVCNNCGLPGHIASECTTQARCWNCREPGHMASNCH--NEG 70
C NC +PGH AR+C + C +CG GHI +CT + +C+ C E GH+A NC +E
Sbjct: 91 CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCT-KVKCYRCGETGHVAINCSKTSEV 149
Query: 71 ICHSCGKTGHRARDCSTHVQS 91
C+ CG++GH AR+C+ +
Sbjct: 150 NCYRCGESGHLARECTIEATA 170
Score = 32.0 bits (71), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 188 ICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGR 226
IC C + GH+++DC C NCG GH+A +C +
Sbjct: 46 ICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPK 84
>sp|O42395|CNBP_CHICK Cellular nucleic acid-binding protein OS=Gallus gallus GN=CNBP PE=2
SV=1
Length = 172
Score = 81.3 bits (199), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 81/170 (47%), Gaps = 38/170 (22%)
Query: 12 GNLCNNCKRPGHFARECPNVA-----------------------VCNNCGLPGHIASECT 48
N C C R GH+ARECP +C CG GH+A +C
Sbjct: 3 SNECFKCGRTGHWARECPTGIGRGRGMRSRGRAGFQFMSSSLPDICYRCGESGHLAKDCD 62
Query: 49 TQAR--CWNCREPGHMASNCHN-----EGICHSCGKTGHRARDCSTHVQSGGDLRLCNNC 101
Q C+NC GH+A +C E C++CGK GH ARDC D + C +C
Sbjct: 63 LQEDKACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDH-----ADEQKCYSC 117
Query: 102 YKPGHIAADCTNDKACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
+ GHI DCT K C C +TGH+A +C +E C C +GH+AR+C
Sbjct: 118 GEFGHIQKDCTKVK-CYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 166
Score = 75.1 bits (183), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 76/143 (53%), Gaps = 21/143 (14%)
Query: 6 LSFMSQG--NLCNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQAR-----CWNC 56
FMS ++C C GH A++C C NCG GHIA +C R C+NC
Sbjct: 37 FQFMSSSLPDICYRCGESGHLAKDCDLQEDKACYNCGRGGHIAKDCKEPKREREQCCYNC 96
Query: 57 REPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC--T 112
+PGH+A +C +E C+SCG+ GH +DC T V+ C C + GH+A +C T
Sbjct: 97 GKPGHLARDCDHADEQKCYSCGEFGHIQKDC-TKVK-------CYRCGETGHVAINCSKT 148
Query: 113 NDKACKNCRKTGHIARDCQNEPV 135
++ C C ++GH+AR+C E
Sbjct: 149 SEVNCYRCGESGHLARECTIEAT 171
Score = 63.2 bits (152), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/174 (30%), Positives = 68/174 (39%), Gaps = 57/174 (32%)
Query: 72 CHSCGKTGHRARDCST----------------HVQSGGDLRLCNNCYKPGHIAADC--TN 113
C CG+TGH AR+C T S +C C + GH+A DC
Sbjct: 6 CFKCGRTGHWARECPTGIGRGRGMRSRGRAGFQFMSSSLPDICYRCGESGHLAKDCDLQE 65
Query: 114 DKACKNCRKTGHIARDCQN-----EPVCNLCNIAGHVARQCPKGDSLGERGGGGGGERGG 168
DKAC NC + GHIA+DC+ E C C GH+AR C D
Sbjct: 66 DKACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHAD--------------- 110
Query: 169 GGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYEC 222
+ C SC + GH+ +DC + C CG GH+A C
Sbjct: 111 -----------------EQKCYSCGEFGHIQKDCTK--VKCYRCGETGHVAINC 145
Score = 59.7 bits (143), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 5/81 (6%)
Query: 15 CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQARCWNCREPGHMASNCH--NEG 70
C NC +PGH AR+C + C +CG GHI +CT + +C+ C E GH+A NC +E
Sbjct: 93 CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCT-KVKCYRCGETGHVAINCSKTSEV 151
Query: 71 ICHSCGKTGHRARDCSTHVQS 91
C+ CG++GH AR+C+ +
Sbjct: 152 NCYRCGESGHLARECTIEATA 172
>sp|P62634|CNBP_RAT Cellular nucleic acid-binding protein OS=Rattus norvegicus GN=Cnbp
PE=2 SV=1
Length = 177
Score = 79.7 bits (195), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 81/175 (46%), Gaps = 43/175 (24%)
Query: 12 GNLCNNCKRPGHFARECPNVA-----------------------------VCNNCGLPGH 42
N C C R GH+ARECP +C CG GH
Sbjct: 3 SNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGH 62
Query: 43 IASECTTQAR-CWNCREPGHMASNCHN-----EGICHSCGKTGHRARDCSTHVQSGGDLR 96
+A +C Q C+NC GH+A +C E C++CGK GH ARDC D +
Sbjct: 63 LAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDH-----ADEQ 117
Query: 97 LCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
C +C + GHI DCT K C C +TGH+A +C +E C C +GH+AR+C
Sbjct: 118 KCYSCGEFGHIQKDCTKVK-CYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 171
Score = 73.9 bits (180), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 76/142 (53%), Gaps = 20/142 (14%)
Query: 6 LSFMSQG--NLCNNCKRPGHFARECP-NVAVCNNCGLPGHIASECTTQAR-----CWNCR 57
F+S ++C C GH A++C C NCG GHIA +C R C+NC
Sbjct: 43 FQFVSSSLPDICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCG 102
Query: 58 EPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC--TN 113
+PGH+A +C +E C+SCG+ GH +DC T V+ C C + GH+A +C T+
Sbjct: 103 KPGHLARDCDHADEQKCYSCGEFGHIQKDC-TKVK-------CYRCGETGHVAINCSKTS 154
Query: 114 DKACKNCRKTGHIARDCQNEPV 135
+ C C ++GH+AR+C E
Sbjct: 155 EVNCYRCGESGHLARECTIEAT 176
Score = 59.7 bits (143), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 5/81 (6%)
Query: 15 CNNCKRPGHFARECPNV--AVCNNCGLPGHIASECTTQARCWNCREPGHMASNCH--NEG 70
C NC +PGH AR+C + C +CG GHI +CT + +C+ C E GH+A NC +E
Sbjct: 98 CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCT-KVKCYRCGETGHVAINCSKTSEV 156
Query: 71 ICHSCGKTGHRARDCSTHVQS 91
C+ CG++GH AR+C+ +
Sbjct: 157 NCYRCGESGHLARECTIEATA 177
Score = 32.0 bits (71), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 188 ICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGR 226
IC C + GH+++DC C NCG GH+A +C +
Sbjct: 53 ICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPK 91
>sp|Q5R5R5|CNBP_PONAB Cellular nucleic acid-binding protein OS=Pongo abelii GN=CNBP PE=2
SV=1
Length = 177
Score = 79.7 bits (195), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 81/175 (46%), Gaps = 43/175 (24%)
Query: 12 GNLCNNCKRPGHFARECPNVA-----------------------------VCNNCGLPGH 42
N C C R GH+ARECP +C CG GH
Sbjct: 3 SNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGH 62
Query: 43 IASECTTQAR-CWNCREPGHMASNCHN-----EGICHSCGKTGHRARDCSTHVQSGGDLR 96
+A +C Q C+NC GH+A +C E C++CGK GH ARDC D +
Sbjct: 63 LAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDH-----ADEQ 117
Query: 97 LCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
C +C + GHI DCT K C C +TGH+A +C +E C C +GH+AR+C
Sbjct: 118 KCYSCGEFGHIQKDCTKVK-CYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 171
Score = 73.9 bits (180), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 76/142 (53%), Gaps = 20/142 (14%)
Query: 6 LSFMSQG--NLCNNCKRPGHFARECP-NVAVCNNCGLPGHIASECTTQAR-----CWNCR 57
F+S ++C C GH A++C C NCG GHIA +C R C+NC
Sbjct: 43 FQFVSSSLPDICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCG 102
Query: 58 EPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC--TN 113
+PGH+A +C +E C+SCG+ GH +DC T V+ C C + GH+A +C T+
Sbjct: 103 KPGHLARDCDHADEQKCYSCGEFGHIQKDC-TKVK-------CYRCGETGHVAINCSKTS 154
Query: 114 DKACKNCRKTGHIARDCQNEPV 135
+ C C ++GH+AR+C E
Sbjct: 155 EVNCYRCGESGHLARECTIEAT 176
Score = 59.7 bits (143), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 5/81 (6%)
Query: 15 CNNCKRPGHFARECPNV--AVCNNCGLPGHIASECTTQARCWNCREPGHMASNCH--NEG 70
C NC +PGH AR+C + C +CG GHI +CT + +C+ C E GH+A NC +E
Sbjct: 98 CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCT-KVKCYRCGETGHVAINCSKTSEV 156
Query: 71 ICHSCGKTGHRARDCSTHVQS 91
C+ CG++GH AR+C+ +
Sbjct: 157 NCYRCGESGHLARECTIEATA 177
Score = 32.0 bits (71), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 188 ICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGR 226
IC C + GH+++DC C NCG GH+A +C +
Sbjct: 53 ICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPK 91
>sp|P62633|CNBP_HUMAN Cellular nucleic acid-binding protein OS=Homo sapiens GN=CNBP PE=1
SV=1
Length = 177
Score = 79.7 bits (195), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 81/175 (46%), Gaps = 43/175 (24%)
Query: 12 GNLCNNCKRPGHFARECPNVA-----------------------------VCNNCGLPGH 42
N C C R GH+ARECP +C CG GH
Sbjct: 3 SNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGH 62
Query: 43 IASECTTQAR-CWNCREPGHMASNCHN-----EGICHSCGKTGHRARDCSTHVQSGGDLR 96
+A +C Q C+NC GH+A +C E C++CGK GH ARDC D +
Sbjct: 63 LAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDH-----ADEQ 117
Query: 97 LCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
C +C + GHI DCT K C C +TGH+A +C +E C C +GH+AR+C
Sbjct: 118 KCYSCGEFGHIQKDCTKVK-CYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 171
Score = 73.9 bits (180), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 76/142 (53%), Gaps = 20/142 (14%)
Query: 6 LSFMSQG--NLCNNCKRPGHFARECP-NVAVCNNCGLPGHIASECTTQAR-----CWNCR 57
F+S ++C C GH A++C C NCG GHIA +C R C+NC
Sbjct: 43 FQFVSSSLPDICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCG 102
Query: 58 EPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC--TN 113
+PGH+A +C +E C+SCG+ GH +DC T V+ C C + GH+A +C T+
Sbjct: 103 KPGHLARDCDHADEQKCYSCGEFGHIQKDC-TKVK-------CYRCGETGHVAINCSKTS 154
Query: 114 DKACKNCRKTGHIARDCQNEPV 135
+ C C ++GH+AR+C E
Sbjct: 155 EVNCYRCGESGHLARECTIEAT 176
Score = 59.7 bits (143), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 5/81 (6%)
Query: 15 CNNCKRPGHFARECPNV--AVCNNCGLPGHIASECTTQARCWNCREPGHMASNCH--NEG 70
C NC +PGH AR+C + C +CG GHI +CT + +C+ C E GH+A NC +E
Sbjct: 98 CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCT-KVKCYRCGETGHVAINCSKTSEV 156
Query: 71 ICHSCGKTGHRARDCSTHVQS 91
C+ CG++GH AR+C+ +
Sbjct: 157 NCYRCGESGHLARECTIEATA 177
Score = 32.0 bits (71), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 188 ICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGR 226
IC C + GH+++DC C NCG GH+A +C +
Sbjct: 53 ICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPK 91
>sp|P53996|CNBP_MOUSE Cellular nucleic acid-binding protein OS=Mus musculus GN=Cnbp PE=2
SV=2
Length = 178
Score = 79.3 bits (194), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/176 (35%), Positives = 81/176 (46%), Gaps = 44/176 (25%)
Query: 12 GNLCNNCKRPGHFARECPNVA-----------------------------VCNNCGLPGH 42
N C C R GH+ARECP +C CG GH
Sbjct: 3 SNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGH 62
Query: 43 IASECTTQAR--CWNCREPGHMASNCHN-----EGICHSCGKTGHRARDCSTHVQSGGDL 95
+A +C Q C+NC GH+A +C E C++CGK GH ARDC D
Sbjct: 63 LAKDCDLQEDEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDH-----ADE 117
Query: 96 RLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
+ C +C + GHI DCT K C C +TGH+A +C +E C C +GH+AR+C
Sbjct: 118 QKCYSCGEFGHIQKDCTKVK-CYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 172
Score = 73.6 bits (179), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 76/143 (53%), Gaps = 21/143 (14%)
Query: 6 LSFMSQG--NLCNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQAR-----CWNC 56
F+S ++C C GH A++C C NCG GHIA +C R C+NC
Sbjct: 43 FQFVSSSLPDICYRCGESGHLAKDCDLQEDEACYNCGRGGHIAKDCKEPKREREQCCYNC 102
Query: 57 REPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC--T 112
+PGH+A +C +E C+SCG+ GH +DC T V+ C C + GH+A +C T
Sbjct: 103 GKPGHLARDCDHADEQKCYSCGEFGHIQKDC-TKVK-------CYRCGETGHVAINCSKT 154
Query: 113 NDKACKNCRKTGHIARDCQNEPV 135
++ C C ++GH+AR+C E
Sbjct: 155 SEVNCYRCGESGHLARECTIEAT 177
Score = 59.7 bits (143), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 5/81 (6%)
Query: 15 CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQARCWNCREPGHMASNCH--NEG 70
C NC +PGH AR+C + C +CG GHI +CT + +C+ C E GH+A NC +E
Sbjct: 99 CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCT-KVKCYRCGETGHVAINCSKTSEV 157
Query: 71 ICHSCGKTGHRARDCSTHVQS 91
C+ CG++GH AR+C+ +
Sbjct: 158 NCYRCGESGHLARECTIEATA 178
Score = 58.2 bits (139), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/180 (28%), Positives = 68/180 (37%), Gaps = 63/180 (35%)
Query: 72 CHSCGKTGHRARDCST----------------------HVQSGGDLRLCNNCYKPGHIAA 109
C CG++GH AR+C T S +C C + GH+A
Sbjct: 6 CFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGHLAK 65
Query: 110 DC--TNDKACKNCRKTGHIARDCQN-----EPVCNLCNIAGHVARQCPKGDSLGERGGGG 162
DC D+AC NC + GHIA+DC+ E C C GH+AR C D
Sbjct: 66 DCDLQEDEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHAD--------- 116
Query: 163 GGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYEC 222
+ C SC + GH+ +DC + C CG GH+A C
Sbjct: 117 -----------------------EQKCYSCGEFGHIQKDCTK--VKCYRCGETGHVAINC 151
>sp|O76743|GLH4_CAEEL ATP-dependent RNA helicase glh-4 OS=Caenorhabditis elegans GN=glh-4
PE=2 SV=2
Length = 1156
Score = 72.8 bits (177), Expect = 2e-12, Method: Composition-based stats.
Identities = 42/116 (36%), Positives = 54/116 (46%), Gaps = 18/116 (15%)
Query: 34 CNNCGLPGHIASEC----TTQARCWNCREPGHMASNCHN----EGICHSCGKTGHRARDC 85
C+NCG GHI+ EC + C NC + GH AS+C G C +CG GH A DC
Sbjct: 572 CHNCGEEGHISKECDKPKVPRFPCRNCEQLGHFASDCDQPRVPRGPCRNCGIEGHFAVDC 631
Query: 86 STHVQSGGDLRLCNNCYKPGHIAADCTNDKA-------CKNCRKTGHIARDCQNEP 134
G C NC + GH A DC N++ C+ C + GH +C P
Sbjct: 632 DQPKVPRGP---CRNCGQEGHFAKDCQNERVRMEPTEPCRRCAEEGHWGYECPTRP 684
Score = 70.9 bits (172), Expect = 7e-12, Method: Composition-based stats.
Identities = 42/113 (37%), Positives = 51/113 (45%), Gaps = 18/113 (15%)
Query: 53 CWNCREPGHMASNCHNEGI----CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIA 108
C NC E GH++ C + C +C + GH A DC G C NC GH A
Sbjct: 572 CHNCGEEGHISKECDKPKVPRFPCRNCEQLGHFASDCDQPRVPRGP---CRNCGIEGHFA 628
Query: 109 ADCTNDKA----CKNCRKTGHIARDCQNEPV-------CNLCNIAGHVARQCP 150
DC K C+NC + GH A+DCQNE V C C GH +CP
Sbjct: 629 VDCDQPKVPRGPCRNCGQEGHFAKDCQNERVRMEPTEPCRRCAEEGHWGYECP 681
Score = 66.6 bits (161), Expect = 1e-10, Method: Composition-based stats.
Identities = 45/148 (30%), Positives = 57/148 (38%), Gaps = 54/148 (36%)
Query: 96 RLCNNCYKPGHIAADCTNDKA----CKNCRKTGHIARDCQNEPV----CNLCNIAGHVAR 147
R C+NC + GHI+ +C K C+NC + GH A DC V C C I GH A
Sbjct: 570 RGCHNCGEEGHISKECDKPKVPRFPCRNCEQLGHFASDCDQPRVPRGPCRNCGIEGHFAV 629
Query: 148 QC-----PKGDSLGERGGGGGGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDC 202
C P+G CR+C Q GH ++DC
Sbjct: 630 DCDQPKVPRGP-----------------------------------CRNCGQEGHFAKDC 654
Query: 203 ------VGPLIICRNCGGRGHMAYECPS 224
+ P CR C GH YECP+
Sbjct: 655 QNERVRMEPTEPCRRCAEEGHWGYECPT 682
Score = 61.6 bits (148), Expect = 5e-09, Method: Composition-based stats.
Identities = 32/88 (36%), Positives = 43/88 (48%), Gaps = 15/88 (17%)
Query: 15 CNNCKRPGHFARECPNVAV----CNNCGLPGHIASEC----TTQARCWNCREPGHMASNC 66
C NC++ GHFA +C V C NCG+ GH A +C + C NC + GH A +C
Sbjct: 595 CRNCEQLGHFASDCDQPRVPRGPCRNCGIEGHFAVDCDQPKVPRGPCRNCGQEGHFAKDC 654
Query: 67 HNEGI-------CHSCGKTGHRARDCST 87
NE + C C + GH +C T
Sbjct: 655 QNERVRMEPTEPCRRCAEEGHWGYECPT 682
Score = 42.7 bits (99), Expect = 0.002, Method: Composition-based stats.
Identities = 18/42 (42%), Positives = 25/42 (59%), Gaps = 3/42 (7%)
Query: 189 CRSCNQMGHMSRDCVGPLI---ICRNCGGRGHMAYECPSGRI 227
CR+C Q+GH + DC P + CRNCG GH A +C ++
Sbjct: 595 CRNCEQLGHFASDCDQPRVPRGPCRNCGIEGHFAVDCDQPKV 636
Score = 35.8 bits (81), Expect = 0.25, Method: Composition-based stats.
Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 7/45 (15%)
Query: 15 CNNCKRPGHFARECPNVAV-------CNNCGLPGHIASECTTQAR 52
C NC + GHFA++C N V C C GH EC T+ +
Sbjct: 641 CRNCGQEGHFAKDCQNERVRMEPTEPCRRCAEEGHWGYECPTRPK 685
>sp|O65639|CSP1_ARATH Cold shock protein 1 OS=Arabidopsis thaliana GN=CSP1 PE=2 SV=1
Length = 299
Score = 72.0 bits (175), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 70/167 (41%), Gaps = 52/167 (31%)
Query: 15 CNNCKRPGHFARECPNVAV-------------CNNCGLPGHIASECTTQAR--------- 52
C NC GHFAR+C + C CG GH+A +CT ++
Sbjct: 134 CYNCGDTGHFARDCTSAGNGDQRGATKGGNDGCYTCGDVGHVARDCTQKSVGNGDQRGAV 193
Query: 53 ------CWNCREPGHMASNCHNE-------------GICHSCGKTGHRARDCSTHVQSGG 93
C+ C + GH A +C + G C+SCG GH ARDC+T Q
Sbjct: 194 KGGNDGCYTCGDVGHFARDCTQKVAAGNVRSGGGGSGTCYSCGGVGHIARDCATKRQP-- 251
Query: 94 DLRLCNNCYKPGHIAADCT--------NDKACKNCRKTGHIARDCQN 132
R C C GH+A DC ND AC C K GH AR+C +
Sbjct: 252 -SRGCYQCGGSGHLARDCDQRGSGGGGNDNACYKCGKEGHFARECSS 297
Score = 70.5 bits (171), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 85/220 (38%), Gaps = 77/220 (35%)
Query: 34 CNNCGLPGHIASEC-------------TTQARCWNCREPGHMASNCHNEGI--------- 71
C NCG GHI+ +C C+NC + GH A +C + G
Sbjct: 102 CYNCGELGHISKDCGIGGGGGGGERRSRGGEGCYNCGDTGHFARDCTSAGNGDQRGATKG 161
Query: 72 ----CHSCGKTGHRARDCSTHVQSGGDLR--------LCNNCYKPGHIAADCTNDKACKN 119
C++CG GH ARDC+ GD R C C GH A DCT A N
Sbjct: 162 GNDGCYTCGDVGHVARDCTQKSVGNGDQRGAVKGGNDGCYTCGDVGHFARDCTQKVAAGN 221
Query: 120 CRK-------------TGHIARDC--QNEPV--CNLCNIAGHVARQCPKGDSLGERGGGG 162
R GHIARDC + +P C C +GH+AR C
Sbjct: 222 VRSGGGGSGTCYSCGGVGHIARDCATKRQPSRGCYQCGGSGHLARDCD------------ 269
Query: 163 GGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDC 202
+RG GGGG+ D C C + GH +R+C
Sbjct: 270 --QRGSGGGGN------------DNACYKCGKEGHFAREC 295
Score = 63.9 bits (154), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 78/202 (38%), Gaps = 65/202 (32%)
Query: 53 CWNCREPGHMASNCHNEGI-------------CHSCGKTGHRARDCSTHVQSGGDLRLCN 99
C+NC E GH++ +C G C++CG TGH ARDC++ GD R
Sbjct: 102 CYNCGELGHISKDCGIGGGGGGGERRSRGGEGCYNCGDTGHFARDCTS--AGNGDQR--- 156
Query: 100 NCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPV---------------CNLCNIAGH 144
A + C C GH+ARDC + V C C GH
Sbjct: 157 --------GATKGGNDGCYTCGDVGHVARDCTQKSVGNGDQRGAVKGGNDGCYTCGDVGH 208
Query: 145 VARQCPKGDSLGERGGGGGGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVG 204
AR C + + G GGGG C SC +GH++RDC
Sbjct: 209 FARDCTQKVAAGNVRSGGGGSG---------------------TCYSCGGVGHIARDCAT 247
Query: 205 ---PLIICRNCGGRGHMAYECP 223
P C CGG GH+A +C
Sbjct: 248 KRQPSRGCYQCGGSGHLARDCD 269
Score = 38.9 bits (89), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 60/154 (38%), Gaps = 40/154 (25%)
Query: 117 CKNCRKTGHIARDCQNEPV-------------CNLCNIAGHVARQCPKGDSLGERGGGGG 163
C NC + GHI++DC C C GH AR C + +RG G
Sbjct: 102 CYNCGELGHISKDCGIGGGGGGGERRSRGGEGCYNCGDTGHFARDCTSAGNGDQRGATKG 161
Query: 164 GERGGGGGGDGG-----------GGG---GRYVGYHDVICRSCNQMGHMSRDCVGPLI-- 207
G G GD G G G G G +D C +C +GH +RDC +
Sbjct: 162 GNDGCYTCGDVGHVARDCTQKSVGNGDQRGAVKGGNDG-CYTCGDVGHFARDCTQKVAAG 220
Query: 208 ----------ICRNCGGRGHMAYECPSGRIADRG 231
C +CGG GH+A +C + R RG
Sbjct: 221 NVRSGGGGSGTCYSCGGVGHIARDCATKRQPSRG 254
Score = 36.2 bits (82), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 38/88 (43%), Gaps = 15/88 (17%)
Query: 4 NVLSFMSQGNLCNNCKRPGHFARECPNVAV----CNNCGLPGHIASECTTQARCWNCREP 59
NV S C +C GH AR+C C CG GH+A +C +
Sbjct: 221 NVRSGGGGSGTCYSCGGVGHIARDCATKRQPSRGCYQCGGSGHLARDCDQRG-------- 272
Query: 60 GHMASNCHNEGICHSCGKTGHRARDCST 87
+ N+ C+ CGK GH AR+CS+
Sbjct: 273 ---SGGGGNDNACYKCGKEGHFARECSS 297
>sp|Q8T8R1|Y3800_DROME CCHC-type zinc finger protein CG3800 OS=Drosophila melanogaster
GN=CG3800 PE=1 SV=1
Length = 165
Score = 65.1 bits (157), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 36/161 (22%)
Query: 9 MSQGNLCNNCKRPGHFARECP-------------------------------NVAVCNNC 37
MS C C RPGHFAR+C N C C
Sbjct: 1 MSMSATCYKCNRPGHFARDCSLGGGGGPGGVGGGGGGGGGGMRGNDGGGMRRNREKCYKC 60
Query: 38 GLPGHIASECTTQA-RCWNCREPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGD 94
GH A C +A RC+ C GH++ +C + C+ C KTGH R+C V G
Sbjct: 61 NQFGHFARACPEEAERCYRCNGIGHISKDCTQADNPTCYRCNKTGHWVRNCPEAVNERGP 120
Query: 95 LRL-CNNCYKPGHIAADC-TNDKACKNCRKTGHIARDCQNE 133
+ C C + GHI+ +C K C C K+GH+ R+C +
Sbjct: 121 TNVSCYKCNRTGHISKNCPETSKTCYGCGKSGHLRRECDEK 161
Score = 57.8 bits (138), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 50/180 (27%), Positives = 68/180 (37%), Gaps = 53/180 (29%)
Query: 70 GICHSCGKTGHRARDCSTHVQSGG------------------------DLRLCNNCYKPG 105
C+ C + GH ARDCS G + C C + G
Sbjct: 5 ATCYKCNRPGHFARDCSLGGGGGPGGVGGGGGGGGGGMRGNDGGGMRRNREKCYKCNQFG 64
Query: 106 HIAADCTND-KACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQCPKGDSLGERGGGG 162
H A C + + C C GHI++DC + P C CN GH R CP +++ ERG
Sbjct: 65 HFARACPEEAERCYRCNGIGHISKDCTQADNPTCYRCNKTGHWVRNCP--EAVNERGPT- 121
Query: 163 GGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYEC 222
+V C CN+ GH+S++C C CG GH+ EC
Sbjct: 122 -----------------------NVSCYKCNRTGHISKNCPETSKTCYGCGKSGHLRREC 158
Score = 30.8 bits (68), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 14/37 (37%), Positives = 18/37 (48%)
Query: 189 CRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSG 225
C CNQ GH +R C C C G GH++ +C
Sbjct: 57 CYKCNQFGHFARACPEEAERCYRCNGIGHISKDCTQA 93
>sp|Q8WW36|ZCH13_HUMAN Zinc finger CCHC domain-containing protein 13 OS=Homo sapiens
GN=ZCCHC13 PE=2 SV=1
Length = 166
Score = 55.5 bits (132), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/108 (38%), Positives = 55/108 (50%), Gaps = 6/108 (5%)
Query: 48 TTQARCWNCREPGHMASNCHNEG-ICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGH 106
T C+ C E G A NC G IC++CG++GH A+DC + + C C + GH
Sbjct: 42 TLSYTCYCCGESGRNAKNCVLLGNICYNCGRSGHIAKDCKDPKRE--RRQHCYTCGRLGH 99
Query: 107 IAADCTNDK--ACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKG 152
+A DC K C +C K GHI +DC + C C GHVA C K
Sbjct: 100 LARDCDRQKEQKCYSCGKLGHIQKDCA-QVKCYRCGEIGHVAINCSKA 146
Score = 51.6 bits (122), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 51/91 (56%), Gaps = 8/91 (8%)
Query: 3 KNVLSFMSQGNLCNNCKRPGHFARECPNVAV-----CNNCGLPGHIASECTTQA--RCWN 55
+N + + GN+C NC R GH A++C + C CG GH+A +C Q +C++
Sbjct: 55 RNAKNCVLLGNICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKCYS 114
Query: 56 CREPGHMASNCHNEGICHSCGKTGHRARDCS 86
C + GH+ +C + C+ CG+ GH A +CS
Sbjct: 115 CGKLGHIQKDC-AQVKCYRCGEIGHVAINCS 144
Score = 50.8 bits (120), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 52/109 (47%), Gaps = 16/109 (14%)
Query: 15 CNNCKRPGHFARECPNVA-VCNNCGLPGHIASECTTQAR-----CWNCREPGHMASNC-- 66
C C G A+ C + +C NCG GHIA +C R C+ C GH+A +C
Sbjct: 47 CYCCGESGRNAKNCVLLGNICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDR 106
Query: 67 HNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDK 115
E C+SCGK GH +DC+ C C + GH+A +C+ +
Sbjct: 107 QKEQKCYSCGKLGHIQKDCAQ--------VKCYRCGEIGHVAINCSKAR 147
Score = 38.9 bits (89), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 45/115 (39%), Gaps = 39/115 (33%)
Query: 117 CKNCRKTGHIARDCQNEPV-----CNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGG 171
C NC ++GHIA+DC++ C C GH+AR C +
Sbjct: 67 CYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQK------------------ 108
Query: 172 GDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGR 226
+ C SC ++GH+ +DC + C CG GH+A C R
Sbjct: 109 --------------EQKCYSCGKLGHIQKDCAQ--VKCYRCGEIGHVAINCSKAR 147
>sp|Q94C69|CSP3_ARATH Cold shock domain-containing protein 3 OS=Arabidopsis thaliana
GN=CSP3 PE=2 SV=1
Length = 301
Score = 53.5 bits (127), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 72/207 (34%), Gaps = 67/207 (32%)
Query: 11 QGNLCNNCKRPGHFARECP-----------------NVAVCNNCGLPGHIASECTTQAR- 52
G C NC GH A++C C CG GH A +C
Sbjct: 92 SGGNCFNCGEVGHMAKDCDGGSGGKSFGGGGGRRSGGEGECYMCGDVGHFARDCRQSGGG 151
Query: 53 -----------CWNCREPGHMASNC----------------HNEGICHSCGKTGHRARDC 85
C++C E GH+A +C C+ CG GH ARDC
Sbjct: 152 NSGGGGGGGRPCYSCGEVGHLAKDCRGGSGGNRYGGGGGRGSGGDGCYMCGGVGHFARDC 211
Query: 86 --STHVQSGGDLRLCNNCYKPGHIAADCTND---------KACKNCRKTGHIARDCQNEP 134
+ GG C C GHIA CT+ +AC C TGH+ARDC
Sbjct: 212 RQNGGGNVGGGGSTCYTCGGVGHIAKVCTSKIPSGGGGGGRACYECGGTGHLARDCDRRG 271
Query: 135 V-----------CNLCNIAGHVARQCP 150
C +C GH AR+C
Sbjct: 272 SGSSGGGGGSNKCFICGKEGHFARECT 298
Score = 44.3 bits (103), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 75/233 (32%), Gaps = 90/233 (38%)
Query: 53 CWNCREPGHMASNCHNEGI-----------------CHSCGKTGHRARDC-----STHVQ 90
C+NC E GHMA +C C+ CG GH ARDC
Sbjct: 96 CFNCGEVGHMAKDCDGGSGGKSFGGGGGRRSGGEGECYMCGDVGHFARDCRQSGGGNSGG 155
Query: 91 SGGDLRLCNNCYKPGHIAADC----------------TNDKACKNCRKTGHIARDCQN-- 132
GG R C +C + GH+A DC + C C GH ARDC+
Sbjct: 156 GGGGGRPCYSCGEVGHLAKDCRGGSGGNRYGGGGGRGSGGDGCYMCGGVGHFARDCRQNG 215
Query: 133 -------EPVCNLCNIAGHVARQC----PKGDSLGERGGGGGGERGGGGGGDGGGGGGRY 181
C C GH+A+ C P G G R
Sbjct: 216 GGNVGGGGSTCYTCGGVGHIAKVCTSKIPSGGGGGGRA---------------------- 253
Query: 182 VGYHDVICRSCNQMGHMSRDC----------VGPLIICRNCGGRGHMAYECPS 224
C C GH++RDC G C CG GH A EC S
Sbjct: 254 -------CYECGGTGHLARDCDRRGSGSSGGGGGSNKCFICGKEGHFARECTS 299
Score = 42.0 bits (97), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 43/102 (42%), Gaps = 29/102 (28%)
Query: 15 CNNCKRPGHFARECPN---------VAVCNNCGLPGHIASECTTQA---------RCWNC 56
C C GHFAR+C + C CG GHIA CT++ C+ C
Sbjct: 198 CYMCGGVGHFARDCRQNGGGNVGGGGSTCYTCGGVGHIAKVCTSKIPSGGGGGGRACYEC 257
Query: 57 REPGHMASNCHNEGI-----------CHSCGKTGHRARDCST 87
GH+A +C G C CGK GH AR+C++
Sbjct: 258 GGTGHLARDCDRRGSGSSGGGGGSNKCFICGKEGHFARECTS 299
>sp|Q05313|GAG_FIVWO Gag polyprotein OS=Feline immunodeficiency virus (isolate Wo)
GN=gag PE=3 SV=1
Length = 450
Score = 53.5 bits (127), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/44 (54%), Positives = 29/44 (65%), Gaps = 6/44 (13%)
Query: 12 GNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWN 55
G +C NCKRPGH AR+C +V CN CG PGH+A A+CW
Sbjct: 374 GPVCFNCKRPGHLARQCRDVKKCNKCGKPGHLA------AKCWQ 411
Score = 43.5 bits (101), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 2 LKNVLSFMSQGNLCNNCKRP-GHFAREC---PNVAVCNNCGLPGH---IASECTTQAR-- 52
LK LS N +CK+ H E + C G PG+ + +E T+ +
Sbjct: 312 LKQSLSI---ANANADCKKAMSHLKPESTLEEKLRACQEIGFPGYKMQLLAEALTKVQVV 368
Query: 53 --------CWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
C+NC+ PGH+A C + C+ CGK GH A C
Sbjct: 369 QSKGPGPVCFNCKRPGHLARQCRDVKKCNKCGKPGHLAAKC 409
Score = 41.2 bits (95), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 39/90 (43%), Gaps = 10/90 (11%)
Query: 41 GHIASECTTQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNN 100
H+ E T + + C+E G + + + K VQS G +C N
Sbjct: 330 SHLKPESTLEEKLRACQEIGF--PGYKMQLLAEALTKV--------QVVQSKGPGPVCFN 379
Query: 101 CYKPGHIAADCTNDKACKNCRKTGHIARDC 130
C +PGH+A C + K C C K GH+A C
Sbjct: 380 CKRPGHLARQCRDVKKCNKCGKPGHLAAKC 409
Score = 35.8 bits (81), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 24/37 (64%)
Query: 116 ACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKG 152
C NC++ GH+AR C++ CN C GH+A +C +G
Sbjct: 376 VCFNCKRPGHLARQCRDVKKCNKCGKPGHLAAKCWQG 412
>sp|P19027|GAG_FIVSD Gag polyprotein OS=Feline immunodeficiency virus (strain San Diego)
GN=gag PE=3 SV=1
Length = 450
Score = 52.4 bits (124), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 29/44 (65%), Gaps = 6/44 (13%)
Query: 12 GNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWN 55
G +C NCK+PGH AR+C +V CN CG PGH+A A+CW
Sbjct: 374 GPVCFNCKKPGHLARQCRDVKKCNKCGKPGHLA------AKCWQ 411
Score = 42.7 bits (99), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 6/65 (9%)
Query: 72 CHSCGKTGHRARDCSTH------VQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGH 125
C G G++ + + VQS G +C NC KPGH+A C + K C C K GH
Sbjct: 345 CQEIGSPGYKMQLLAEALTKVQVVQSKGSGPVCFNCKKPGHLARQCRDVKKCNKCGKPGH 404
Query: 126 IARDC 130
+A C
Sbjct: 405 LAAKC 409
Score = 42.4 bits (98), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 33/65 (50%), Gaps = 13/65 (20%)
Query: 34 CNNCGLPGH---IASECTTQAR----------CWNCREPGHMASNCHNEGICHSCGKTGH 80
C G PG+ + +E T+ + C+NC++PGH+A C + C+ CGK GH
Sbjct: 345 CQEIGSPGYKMQLLAEALTKVQVVQSKGSGPVCFNCKKPGHLARQCRDVKKCNKCGKPGH 404
Query: 81 RARDC 85
A C
Sbjct: 405 LAAKC 409
Score = 38.1 bits (87), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 37/82 (45%), Gaps = 3/82 (3%)
Query: 80 HRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLC 139
+ R C G ++L + + C NC+K GH+AR C++ CN C
Sbjct: 340 EKLRACQEIGSPGYKMQLLAEALTKVQVVQSKGSGPVCFNCKKPGHLARQCRDVKKCNKC 399
Query: 140 NIAGHVARQCPKGDSLGERGGG 161
GH+A +C +G G+R G
Sbjct: 400 GKPGHLAAKCWQG---GKRNSG 418
>sp|P16087|GAG_FIVPE Gag polyprotein OS=Feline immunodeficiency virus (isolate Petaluma)
GN=gag PE=3 SV=1
Length = 450
Score = 51.6 bits (122), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 6/44 (13%)
Query: 12 GNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWN 55
G +C NCK+PGH AR+C V CN CG PGH+A A+CW
Sbjct: 374 GPVCFNCKKPGHLARQCREVKKCNKCGKPGHLA------AKCWQ 411
Score = 42.4 bits (98), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 31/65 (47%), Gaps = 6/65 (9%)
Query: 72 CHSCGKTGHRARDCSTH------VQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGH 125
C G G++ + + VQS G +C NC KPGH+A C K C C K GH
Sbjct: 345 CQEIGSPGYKMQLLAEALTKVQVVQSKGSGPVCFNCKKPGHLARQCREVKKCNKCGKPGH 404
Query: 126 IARDC 130
+A C
Sbjct: 405 LAAKC 409
Score = 42.0 bits (97), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 2 LKNVLSFMSQGNLCNNCKRP-GHFAREC---PNVAVCNNCGLPGH---IASECTTQAR-- 52
LK LS N +CK+ H E + C G PG+ + +E T+ +
Sbjct: 312 LKQSLSI---ANANADCKKAMSHLKPESTLEEKLRACQEIGSPGYKMQLLAEALTKVQVV 368
Query: 53 --------CWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
C+NC++PGH+A C C+ CGK GH A C
Sbjct: 369 QSKGSGPVCFNCKKPGHLARQCREVKKCNKCGKPGHLAAKC 409
Score = 37.7 bits (86), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 33/74 (44%)
Query: 80 HRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLC 139
+ R C G ++L + + C NC+K GH+AR C+ CN C
Sbjct: 340 EKLRACQEIGSPGYKMQLLAEALTKVQVVQSKGSGPVCFNCKKPGHLARQCREVKKCNKC 399
Query: 140 NIAGHVARQCPKGD 153
GH+A +C +G+
Sbjct: 400 GKPGHLAAKCWQGN 413
>sp|P03354|POL_RSVP Gag-Pro-Pol polyprotein OS=Rous sarcoma virus (strain Prague C)
GN=gag-pro-pol PE=1 SV=2
Length = 1603
Score = 50.8 bits (120), Expect = 9e-06, Method: Composition-based stats.
Identities = 35/96 (36%), Positives = 40/96 (41%), Gaps = 22/96 (22%)
Query: 80 HRARDCSTHVQSGGDLR-LCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNL 138
+R RD T SGG R LC C PGH A C + N R+ C L
Sbjct: 492 NRERDGQT--GSGGRARGLCYTCGSPGHYQAQCPKKRKSGNSRER------------CQL 537
Query: 139 CNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDG 174
CN GH A+QC K D G G+R G G G
Sbjct: 538 CNGMGHNAKQCRKRD-------GNQGQRPGKGLSSG 566
Score = 40.0 bits (92), Expect = 0.014, Method: Composition-based stats.
Identities = 18/57 (31%), Positives = 22/57 (38%)
Query: 55 NCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC 111
N G S G+C++CG GH C +SG C C GH A C
Sbjct: 492 NRERDGQTGSGGRARGLCYTCGSPGHYQAQCPKKRKSGNSRERCQLCNGMGHNAKQC 548
Score = 37.0 bits (84), Expect = 0.11, Method: Composition-based stats.
Identities = 16/54 (29%), Positives = 22/54 (40%), Gaps = 12/54 (22%)
Query: 32 AVCNNCGLPGHIASECTTQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
+C CG PGH ++C + + N RE C C GH A+ C
Sbjct: 507 GLCYTCGSPGHYQAQCPKKRKSGNSRER------------CQLCNGMGHNAKQC 548
Score = 33.9 bits (76), Expect = 0.97, Method: Composition-based stats.
Identities = 16/53 (30%), Positives = 20/53 (37%), Gaps = 12/53 (22%)
Query: 14 LCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWNCREPGHMASNC 66
LC C PGH+ +CP N ++ RC C GH A C
Sbjct: 508 LCYTCGSPGHYQAQCPKKRKSGN------------SRERCQLCNGMGHNAKQC 548
Score = 33.5 bits (75), Expect = 1.4, Method: Composition-based stats.
Identities = 22/87 (25%), Positives = 29/87 (33%), Gaps = 27/87 (31%)
Query: 116 ACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDGG 175
A N + G + +C C GH QCPK +R G ER
Sbjct: 489 AVVNRERDGQTGSGGRARGLCYTCGSPGHYQAQCPK-----KRKSGNSRER--------- 534
Query: 176 GGGGRYVGYHDVICRSCNQMGHMSRDC 202
C+ CN MGH ++ C
Sbjct: 535 -------------CQLCNGMGHNAKQC 548
>sp|Q1A267|POL_SIVMB Gag-Pol polyprotein OS=Simian immunodeficiency virus (isolate MB66)
GN=gag-pol PE=3 SV=4
Length = 1438
Score = 50.1 bits (118), Expect = 1e-05, Method: Composition-based stats.
Identities = 25/63 (39%), Positives = 29/63 (46%), Gaps = 5/63 (7%)
Query: 9 MSQGNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQAR--CWNCREPGHMASNC 66
MSQ N+ KR + + C NCG GHIA C R CW C + GH NC
Sbjct: 365 MSQAQHSNDAKRQFKGPKR---IVKCFNCGKEGHIARNCKAPRRKGCWKCGQEGHQMRNC 421
Query: 67 HNE 69
NE
Sbjct: 422 TNE 424
Score = 41.6 bits (96), Expect = 0.005, Method: Composition-based stats.
Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 5/52 (9%)
Query: 72 CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKT 123
C +CGK GH AR+C + G C C + GH +CTN++ R+T
Sbjct: 387 CFNCGKEGHIARNCKAPRRKG-----CWKCGQEGHQMRNCTNERQANFFRET 433
Score = 41.6 bits (96), Expect = 0.005, Method: Composition-based stats.
Identities = 19/43 (44%), Positives = 22/43 (51%), Gaps = 2/43 (4%)
Query: 98 CNNCYKPGHIAADCT--NDKACKNCRKTGHIARDCQNEPVCNL 138
C NC K GHIA +C K C C + GH R+C NE N
Sbjct: 387 CFNCGKEGHIARNCKAPRRKGCWKCGQEGHQMRNCTNERQANF 429
Score = 37.4 bits (85), Expect = 0.091, Method: Composition-based stats.
Identities = 18/46 (39%), Positives = 21/46 (45%), Gaps = 2/46 (4%)
Query: 15 CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQARCWNCRE 58
C NC + GH AR C P C CG GH CT + + RE
Sbjct: 387 CFNCGKEGHIARNCKAPRRKGCWKCGQEGHQMRNCTNERQANFFRE 432
Score = 35.4 bits (80), Expect = 0.31, Method: Composition-based stats.
Identities = 16/44 (36%), Positives = 23/44 (52%), Gaps = 1/44 (2%)
Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYECPSGRIAD 229
V C +C + GH++R+C P C CG GH C + R A+
Sbjct: 385 VKCFNCGKEGHIARNCKAPRRKGCWKCGQEGHQMRNCTNERQAN 428
Score = 35.4 bits (80), Expect = 0.37, Method: Composition-based stats.
Identities = 16/35 (45%), Positives = 18/35 (51%), Gaps = 2/35 (5%)
Query: 117 CKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
C NC K GHIAR+C+ C C GH R C
Sbjct: 387 CFNCGKEGHIARNCKAPRRKGCWKCGQEGHQMRNC 421
>sp|Q82851|POL_JEMBR Gag-Pol polyprotein OS=Jembrana disease virus GN=gag-pol PE=3 SV=1
Length = 1432
Score = 50.1 bits (118), Expect = 1e-05, Method: Composition-based stats.
Identities = 20/48 (41%), Positives = 31/48 (64%), Gaps = 1/48 (2%)
Query: 86 STHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNE 133
+ +V+ G+++ C C KPGHI DC N K C C K GH+ R+C+++
Sbjct: 358 AINVKGDGEVQRCYGCGKPGHIRRDCKNQK-CFKCGKPGHLQRNCKSK 404
Score = 48.1 bits (113), Expect = 5e-05, Method: Composition-based stats.
Identities = 22/67 (32%), Positives = 33/67 (49%), Gaps = 8/67 (11%)
Query: 3 KNVLSFMSQGNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWNCREPGHM 62
K + F+++ N K G R C CG PGHI +C Q +C+ C +PGH+
Sbjct: 346 KQKMQFLAEAFAAINVKGDGEVQR-------CYGCGKPGHIRRDCKNQ-KCFKCGKPGHL 397
Query: 63 ASNCHNE 69
NC ++
Sbjct: 398 QRNCKSK 404
Score = 45.8 bits (107), Expect = 3e-04, Method: Composition-based stats.
Identities = 16/36 (44%), Positives = 23/36 (63%), Gaps = 1/36 (2%)
Query: 52 RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCST 87
RC+ C +PGH+ +C N+ C CGK GH R+C +
Sbjct: 369 RCYGCGKPGHIRRDCKNQK-CFKCGKPGHLQRNCKS 403
Score = 43.5 bits (101), Expect = 0.001, Method: Composition-based stats.
Identities = 16/38 (42%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 15 CNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQAR 52
C C +PGH R+C N C CG PGH+ C ++ R
Sbjct: 370 CYGCGKPGHIRRDCKNQK-CFKCGKPGHLQRNCKSKNR 406
Score = 33.5 bits (75), Expect = 1.4, Method: Composition-based stats.
Identities = 14/36 (38%), Positives = 17/36 (47%), Gaps = 2/36 (5%)
Query: 189 CRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPS 224
C C + GH+ RDC C CG GH+ C S
Sbjct: 370 CYGCGKPGHIRRDCKNQK--CFKCGKPGHLQRNCKS 403
>sp|P31821|GAG_FIVT2 Gag polyprotein OS=Feline immunodeficiency virus (isolate TM2)
GN=gag PE=3 SV=1
Length = 449
Score = 49.7 bits (117), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 20/34 (58%), Positives = 25/34 (73%)
Query: 14 LCNNCKRPGHFARECPNVAVCNNCGLPGHIASEC 47
+C NCK+PGH AR+C CNNCG PGH+A+ C
Sbjct: 376 VCFNCKKPGHLARQCKEAKRCNNCGKPGHLAANC 409
Score = 45.8 bits (107), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 42/101 (41%), Gaps = 14/101 (13%)
Query: 72 CHSCGKTGHRARDCST------HVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGH 125
C G G++ + + VQ+ G +C NC KPGH+A C K C NC K GH
Sbjct: 345 CQEVGSPGYKMQLLAEALTRVQTVQTKGPRLVCFNCKKPGHLARQCKEAKRCNNCGKPGH 404
Query: 126 IARDCQ--------NEPVCNLCNIAGHVARQCPKGDSLGER 158
+A +C NE V V + P + E+
Sbjct: 405 LAANCWQGGRKTSGNEKVGRAAAPVNQVQQIVPSAPPMEEK 445
Score = 43.9 bits (102), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 31/58 (53%)
Query: 33 VCNNCGLPGHIASECTTQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQ 90
VC NC PGH+A +C RC NC +PGH+A+NC G S + RA VQ
Sbjct: 376 VCFNCKKPGHLARQCKEAKRCNNCGKPGHLAANCWQGGRKTSGNEKVGRAAAPVNQVQ 433
Score = 43.1 bits (100), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 46/101 (45%), Gaps = 20/101 (19%)
Query: 2 LKNVLSFMSQGNLCNNCKRP-GHFAREC---PNVAVCNNCGLPGH---IASECTTQAR-- 52
LK LS N +CKR H E + C G PG+ + +E T+ +
Sbjct: 312 LKQSLSI---ANANPDCKRAMSHLKPESTLEEKLRACQEVGSPGYKMQLLAEALTRVQTV 368
Query: 53 --------CWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
C+NC++PGH+A C C++CGK GH A +C
Sbjct: 369 QTKGPRLVCFNCKKPGHLARQCKEAKRCNNCGKPGHLAANC 409
>sp|O92956|POL_RSVSB Gag-Pro-Pol polyprotein OS=Rous sarcoma virus (strain
Schmidt-Ruppin B) GN=gag-pro-pol PE=1 SV=2
Length = 1603
Score = 48.5 bits (114), Expect = 4e-05, Method: Composition-based stats.
Identities = 32/85 (37%), Positives = 39/85 (45%), Gaps = 16/85 (18%)
Query: 80 HRARDCSTHVQSGGDLR-LCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNL 138
+R RD T SGG R LC C PGH A C + N R+ C L
Sbjct: 492 NRERDGQT--GSGGRARRLCYTCGSPGHYQAQCPKKRKSGNSRER------------CQL 537
Query: 139 CNIAGHVARQCPKGDS-LGERGGGG 162
C+ GH A+QC + DS G+R G G
Sbjct: 538 CDGMGHNAKQCRRRDSNQGQRPGRG 562
Score = 36.6 bits (83), Expect = 0.14, Method: Composition-based stats.
Identities = 16/53 (30%), Positives = 22/53 (41%), Gaps = 12/53 (22%)
Query: 33 VCNNCGLPGHIASECTTQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
+C CG PGH ++C + + N RE C C GH A+ C
Sbjct: 508 LCYTCGSPGHYQAQCPKKRKSGNSRER------------CQLCDGMGHNAKQC 548
Score = 36.2 bits (82), Expect = 0.19, Method: Composition-based stats.
Identities = 17/57 (29%), Positives = 21/57 (36%)
Query: 55 NCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC 111
N G S +C++CG GH C +SG C C GH A C
Sbjct: 492 NRERDGQTGSGGRARRLCYTCGSPGHYQAQCPKKRKSGNSRERCQLCDGMGHNAKQC 548
Score = 33.5 bits (75), Expect = 1.4, Method: Composition-based stats.
Identities = 16/53 (30%), Positives = 20/53 (37%), Gaps = 12/53 (22%)
Query: 14 LCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWNCREPGHMASNC 66
LC C PGH+ +CP N ++ RC C GH A C
Sbjct: 508 LCYTCGSPGHYQAQCPKKRKSGN------------SRERCQLCDGMGHNAKQC 548
>sp|Q79666|POL_HV1MV Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group O
(isolate MVP5180) GN=gag-pol PE=3 SV=3
Length = 1446
Score = 48.5 bits (114), Expect = 4e-05, Method: Composition-based stats.
Identities = 33/92 (35%), Positives = 41/92 (44%), Gaps = 9/92 (9%)
Query: 98 CNNCYKPGHIAADC--TNDKACKNCRKTGHIARDCQNEPVCN----LCNIAGHVARQ-CP 150
C NC K GHIA +C + C C + GH +DC+N N + GH ARQ C
Sbjct: 396 CFNCGKEGHIAKNCRAPRKRGCWKCGQEGHQMKDCKNGRQANFFREVLASGGHEARQLCA 455
Query: 151 KGDSLGERGGGGGGERGGGGGGDGGGGGGRYV 182
+ GGG E G + GG GR V
Sbjct: 456 ETSVPISPTNGGGSE--GTRESESEGGSGRAV 485
Score = 42.4 bits (98), Expect = 0.003, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 19/37 (51%), Gaps = 2/37 (5%)
Query: 34 CNNCGLPGHIASECTTQAR--CWNCREPGHMASNCHN 68
C NCG GHIA C + CW C + GH +C N
Sbjct: 396 CFNCGKEGHIAKNCRAPRKRGCWKCGQEGHQMKDCKN 432
Score = 42.0 bits (97), Expect = 0.003, Method: Composition-based stats.
Identities = 16/42 (38%), Positives = 25/42 (59%), Gaps = 2/42 (4%)
Query: 52 RCWNCREPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQS 91
+C+NC + GH+A NC + C CG+ GH+ +DC Q+
Sbjct: 395 KCFNCGKEGHIAKNCRAPRKRGCWKCGQEGHQMKDCKNGRQA 436
Score = 37.4 bits (85), Expect = 0.079, Method: Composition-based stats.
Identities = 15/42 (35%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 189 CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYECPSGRIAD 229
C +C + GH++++C P C CG GH +C +GR A+
Sbjct: 396 CFNCGKEGHIAKNCRAPRKRGCWKCGQEGHQMKDCKNGRQAN 437
Score = 33.9 bits (76), Expect = 1.0, Method: Composition-based stats.
Identities = 14/35 (40%), Positives = 17/35 (48%), Gaps = 2/35 (5%)
Query: 15 CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASEC 47
C NC + GH A+ C P C CG GH +C
Sbjct: 396 CFNCGKEGHIAKNCRAPRKRGCWKCGQEGHQMKDC 430
>sp|Q04095|POL_RSVSA Gag-Pro-Pol polyprotein OS=Avian leukosis virus RSA GN=gag-pro-pol
PE=3 SV=2
Length = 1603
Score = 47.8 bits (112), Expect = 6e-05, Method: Composition-based stats.
Identities = 33/96 (34%), Positives = 40/96 (41%), Gaps = 22/96 (22%)
Query: 80 HRARDCSTHVQSGGDLR-LCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNL 138
+R RD T SGG R LC C PGH A C + N R+ C L
Sbjct: 492 NRERDGQT--GSGGRARGLCYTCGSPGHYQAQCPKKRKSGNSRER------------CQL 537
Query: 139 CNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDG 174
C+ GH A+QC + D G G+R G G G
Sbjct: 538 CDGMGHNAKQCRRRD-------GNQGQRPGKGLSSG 566
Score = 39.7 bits (91), Expect = 0.019, Method: Composition-based stats.
Identities = 18/57 (31%), Positives = 22/57 (38%)
Query: 55 NCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC 111
N G S G+C++CG GH C +SG C C GH A C
Sbjct: 492 NRERDGQTGSGGRARGLCYTCGSPGHYQAQCPKKRKSGNSRERCQLCDGMGHNAKQC 548
Score = 36.6 bits (83), Expect = 0.14, Method: Composition-based stats.
Identities = 16/54 (29%), Positives = 22/54 (40%), Gaps = 12/54 (22%)
Query: 32 AVCNNCGLPGHIASECTTQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
+C CG PGH ++C + + N RE C C GH A+ C
Sbjct: 507 GLCYTCGSPGHYQAQCPKKRKSGNSRER------------CQLCDGMGHNAKQC 548
Score = 33.5 bits (75), Expect = 1.4, Method: Composition-based stats.
Identities = 16/53 (30%), Positives = 20/53 (37%), Gaps = 12/53 (22%)
Query: 14 LCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWNCREPGHMASNC 66
LC C PGH+ +CP N ++ RC C GH A C
Sbjct: 508 LCYTCGSPGHYQAQCPKKRKSGN------------SRERCQLCDGMGHNAKQC 548
>sp|Q9P795|AIR1_SCHPO Protein air1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=air1 PE=1 SV=2
Length = 315
Score = 47.8 bits (112), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 56/121 (46%), Gaps = 19/121 (15%)
Query: 10 SQGNLCNNCKRPGHFARECPNVAVCNNCG-LPGHIASECTTQARCWNCREPGHMASNCHN 68
S+ +C+NCK GH +++CP+V +C CG + HI+ C +C NC GH+A+ C
Sbjct: 86 SESIVCHNCKGNGHISKDCPHV-LCTTCGAIDDHISVRCPWTKKCMNCGLLGHIAARCSE 144
Query: 69 -----EGICHSCGKTGHRARDCST----HVQSGGDLRL--------CNNCYKPGHIAADC 111
+C +C H + C +V+ +R+ C NC H DC
Sbjct: 145 PRKRGPRVCRTCHTDTHTSSTCPLIWRYYVEKEHPVRIDVSEVRKFCYNCASDEHFGDDC 204
Query: 112 T 112
T
Sbjct: 205 T 205
Score = 39.7 bits (91), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 43/97 (44%), Gaps = 16/97 (16%)
Query: 61 HMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNC-YKPGHIAADCTNDKACKN 119
+ S+ +CH+C GH ++DC HV LC C HI+ C K C N
Sbjct: 80 YFGSDPSESIVCHNCKGNGHISKDC-PHV-------LCTTCGAIDDHISVRCPWTKKCMN 131
Query: 120 CRKTGHIARDCQNEP------VCNLCNIAGHVARQCP 150
C GHIA C +EP VC C+ H + CP
Sbjct: 132 CGLLGHIAARC-SEPRKRGPRVCRTCHTDTHTSSTCP 167
Score = 35.0 bits (79), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 24/96 (25%), Positives = 41/96 (42%), Gaps = 8/96 (8%)
Query: 132 NEPVCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDGGGGGGRYVGY---HDVI 188
++ + N ++ H + DS + + G D G RY G ++
Sbjct: 33 DQSITNSVSLEKHDFQGSDDHDSSTDLSDSTLEDVEGSEWADVSRG--RYFGSDPSESIV 90
Query: 189 CRSCNQMGHMSRDCVGPLIICRNCGG-RGHMAYECP 223
C +C GH+S+DC P ++C CG H++ CP
Sbjct: 91 CHNCKGNGHISKDC--PHVLCTTCGAIDDHISVRCP 124
Score = 32.7 bits (73), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 56/162 (34%), Gaps = 23/162 (14%)
Query: 11 QGNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWNC-REPGHMASNCHNE 69
+G+ + R +F + VC+NC GHI+ +C C C H++ C
Sbjct: 68 EGSEWADVSRGRYFGSDPSESIVCHNCKGNGHISKDC-PHVLCTTCGAIDDHISVRCPWT 126
Query: 70 GICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC------------------ 111
C +CG GH A CS + G R+C C+ H ++ C
Sbjct: 127 KKCMNCGLLGHIAARCSEPRKRGP--RVCRTCHTDTHTSSTCPLIWRYYVEKEHPVRIDV 184
Query: 112 -TNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKG 152
K C NC H DC N CP G
Sbjct: 185 SEVRKFCYNCASDEHFGDDCTLPSRSNYPESTAFCEANCPSG 226
>sp|O91080|POL_HV1YF Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group N
(isolate YBF30) GN=gag-pol PE=3 SV=3
Length = 1449
Score = 47.0 bits (110), Expect = 1e-04, Method: Composition-based stats.
Identities = 18/39 (46%), Positives = 21/39 (53%), Gaps = 2/39 (5%)
Query: 34 CNNCGLPGHIASECTTQAR--CWNCREPGHMASNCHNEG 70
C NCG GH+A C R CW C + GH +C NEG
Sbjct: 395 CFNCGKEGHLARNCKAPRRGGCWKCGQEGHQMKDCKNEG 433
Score = 44.3 bits (103), Expect = 7e-04, Method: Composition-based stats.
Identities = 16/36 (44%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
Query: 52 RCWNCREPGHMASNCH--NEGICHSCGKTGHRARDC 85
+C+NC + GH+A NC G C CG+ GH+ +DC
Sbjct: 394 KCFNCGKEGHLARNCKAPRRGGCWKCGQEGHQMKDC 429
Score = 39.7 bits (91), Expect = 0.017, Method: Composition-based stats.
Identities = 16/38 (42%), Positives = 22/38 (57%), Gaps = 2/38 (5%)
Query: 98 CNNCYKPGHIAADCTNDK--ACKNCRKTGHIARDCQNE 133
C NC K GH+A +C + C C + GH +DC+NE
Sbjct: 395 CFNCGKEGHLARNCKAPRRGGCWKCGQEGHQMKDCKNE 432
Score = 39.3 bits (90), Expect = 0.024, Method: Composition-based stats.
Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 5/43 (11%)
Query: 72 CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTND 114
C +CGK GH AR+C + G C C + GH DC N+
Sbjct: 395 CFNCGKEGHLARNCKAPRRGG-----CWKCGQEGHQMKDCKNE 432
Score = 33.5 bits (75), Expect = 1.3, Method: Composition-based stats.
Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 1/35 (2%)
Query: 189 CRSCNQMGHMSRDCVGPLI-ICRNCGGRGHMAYEC 222
C +C + GH++R+C P C CG GH +C
Sbjct: 395 CFNCGKEGHLARNCKAPRRGGCWKCGQEGHQMKDC 429
Score = 32.3 bits (72), Expect = 3.1, Method: Composition-based stats.
Identities = 14/35 (40%), Positives = 18/35 (51%), Gaps = 2/35 (5%)
Query: 117 CKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
C NC K GH+AR+C+ C C GH + C
Sbjct: 395 CFNCGKEGHLARNCKAPRRGGCWKCGQEGHQMKDC 429
>sp|Q9IDV9|POL_HV1YB Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group N
(isolate YBF106) GN=gag-pol PE=3 SV=3
Length = 1449
Score = 47.0 bits (110), Expect = 1e-04, Method: Composition-based stats.
Identities = 18/39 (46%), Positives = 21/39 (53%), Gaps = 2/39 (5%)
Query: 34 CNNCGLPGHIASECTTQAR--CWNCREPGHMASNCHNEG 70
C NCG GH+A C R CW C + GH +C NEG
Sbjct: 394 CFNCGKEGHLARNCKAPRRRGCWKCGQEGHQMKDCKNEG 432
Score = 42.4 bits (98), Expect = 0.003, Method: Composition-based stats.
Identities = 20/51 (39%), Positives = 25/51 (49%), Gaps = 5/51 (9%)
Query: 72 CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRK 122
C +CGK GH AR+C + G C C + GH DC N+ N RK
Sbjct: 394 CFNCGKEGHLARNCKAPRRRG-----CWKCGQEGHQMKDCKNEGXQANFRK 439
Score = 41.2 bits (95), Expect = 0.006, Method: Composition-based stats.
Identities = 33/119 (27%), Positives = 49/119 (41%), Gaps = 39/119 (32%)
Query: 3 KNVLSFMSQGNLCNN----CKR------PGHFARECPNVAVCNNCGLPGHIA---SECTT 49
++V ++M++ L N CK+ PG E + C G P H A +E
Sbjct: 313 QDVKNWMTETLLVQNANPDCKQILKALGPGATLEEM--MTACQGVGGPAHKARVLAEAMA 370
Query: 50 QA--------------------RCWNCREPGHMASNC---HNEGICHSCGKTGHRARDC 85
QA +C+NC + GH+A NC G C CG+ GH+ +DC
Sbjct: 371 QAQTATSVFVQRGNFKGIRKTIKCFNCGKEGHLARNCKAPRRRG-CWKCGQEGHQMKDC 428
Score = 41.2 bits (95), Expect = 0.007, Method: Composition-based stats.
Identities = 25/80 (31%), Positives = 33/80 (41%), Gaps = 18/80 (22%)
Query: 72 CHSCGKTGHRAR-----------DCSTHVQSGGDLRL-----CNNCYKPGHIAADCT--N 113
C G H+AR S VQ G + C NC K GH+A +C
Sbjct: 352 CQGVGGPAHKARVLAEAMAQAQTATSVFVQRGNFKGIRKTIKCFNCGKEGHLARNCKAPR 411
Query: 114 DKACKNCRKTGHIARDCQNE 133
+ C C + GH +DC+NE
Sbjct: 412 RRGCWKCGQEGHQMKDCKNE 431
Score = 34.7 bits (78), Expect = 0.58, Method: Composition-based stats.
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 2/46 (4%)
Query: 179 GRYVGYHDVI-CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
G + G I C +C + GH++R+C P C CG GH +C
Sbjct: 383 GNFKGIRKTIKCFNCGKEGHLARNCKAPRRRGCWKCGQEGHQMKDC 428
Score = 32.3 bits (72), Expect = 3.1, Method: Composition-based stats.
Identities = 14/35 (40%), Positives = 18/35 (51%), Gaps = 2/35 (5%)
Query: 117 CKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
C NC K GH+AR+C+ C C GH + C
Sbjct: 394 CFNCGKEGHLARNCKAPRRRGCWKCGQEGHQMKDC 428
>sp|Q9QBZ9|POL_HV197 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
subtype K (isolate 97ZR-EQTB11) GN=gag-pol PE=3 SV=2
Length = 1429
Score = 46.6 bits (109), Expect = 1e-04, Method: Composition-based stats.
Identities = 26/80 (32%), Positives = 35/80 (43%), Gaps = 24/80 (30%)
Query: 31 VAVCNNCGLPGHIA-------SECTTQA---------------RCWNCREPGHMASNCH- 67
+ C G PGH A S+ T A +C+NC + GH+A NC
Sbjct: 343 MTACQGVGGPGHKARVLAEAMSQVTNSAVMMQRGNFKGQRRIIKCFNCGKEGHLARNCRA 402
Query: 68 -NEGICHSCGKTGHRARDCS 86
+ C CGK GH+ +DCS
Sbjct: 403 PRKKGCWKCGKEGHQMKDCS 422
Score = 39.7 bits (91), Expect = 0.017, Method: Composition-based stats.
Identities = 18/45 (40%), Positives = 23/45 (51%), Gaps = 5/45 (11%)
Query: 72 CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKA 116
C +CGK GH AR+C + G C C K GH DC+ +A
Sbjct: 387 CFNCGKEGHLARNCRAPRKKG-----CWKCGKEGHQMKDCSERQA 426
Score = 37.0 bits (84), Expect = 0.12, Method: Composition-based stats.
Identities = 19/59 (32%), Positives = 25/59 (42%), Gaps = 10/59 (16%)
Query: 2 LKNVLSFMSQGNL--------CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQ 50
+ N M +GN C NC + GH AR C P C CG GH +C+ +
Sbjct: 366 VTNSAVMMQRGNFKGQRRIIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDCSER 424
Score = 35.4 bits (80), Expect = 0.30, Method: Composition-based stats.
Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 179 GRYVGYHDVI-CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
G + G +I C +C + GH++R+C P C CG GH +C
Sbjct: 376 GNFKGQRRIIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDC 421
Score = 33.5 bits (75), Expect = 1.5, Method: Composition-based stats.
Identities = 23/83 (27%), Positives = 33/83 (39%), Gaps = 12/83 (14%)
Query: 72 CHSCGKTGHRAR---DCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIAR 128
C G GH+AR + + V + + N I C NC K GH+AR
Sbjct: 346 CQGVGGPGHKARVLAEAMSQVTNSAVMMQRGNFKGQRRIIK-------CFNCGKEGHLAR 398
Query: 129 DCQ--NEPVCNLCNIAGHVARQC 149
+C+ + C C GH + C
Sbjct: 399 NCRAPRKKGCWKCGKEGHQMKDC 421
>sp|P05959|POL_HV1RH Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
subtype B (isolate RF/HAT3) GN=gag-pol PE=1 SV=3
Length = 1436
Score = 46.6 bits (109), Expect = 2e-04, Method: Composition-based stats.
Identities = 21/51 (41%), Positives = 27/51 (52%), Gaps = 3/51 (5%)
Query: 22 GHFARECPNVAVCNNCGLPGHIASECTTQAR--CWNCREPGHMASNCHNEG 70
G+F R+ + C NCG GHIA C + CW C + GH +C NEG
Sbjct: 381 GNF-RDQRKIVKCFNCGKVGHIAKNCRAPRKKGCWKCGKEGHQMKDCTNEG 430
Score = 44.7 bits (104), Expect = 5e-04, Method: Composition-based stats.
Identities = 27/80 (33%), Positives = 34/80 (42%), Gaps = 18/80 (22%)
Query: 72 CHSCGKTGHRAR----------DCSTHVQSGGDLR------LCNNCYKPGHIAADC--TN 113
C G H+AR + +T + G+ R C NC K GHIA +C
Sbjct: 350 CQGVGGPSHKARILAEAMSQVTNSATIMLQKGNFRDQRKIVKCFNCGKVGHIAKNCRAPR 409
Query: 114 DKACKNCRKTGHIARDCQNE 133
K C C K GH +DC NE
Sbjct: 410 KKGCWKCGKEGHQMKDCTNE 429
Score = 44.7 bits (104), Expect = 6e-04, Method: Composition-based stats.
Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 52 RCWNCREPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLR 96
+C+NC + GH+A NC + C CGK GH+ +DC+ + LR
Sbjct: 391 KCFNCGKVGHIAKNCRAPRKKGCWKCGKEGHQMKDCTNEGRQANFLR 437
Score = 38.5 bits (88), Expect = 0.036, Method: Composition-based stats.
Identities = 17/43 (39%), Positives = 21/43 (48%), Gaps = 2/43 (4%)
Query: 15 CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQARCWN 55
C NC + GH A+ C P C CG GH +CT + R N
Sbjct: 392 CFNCGKVGHIAKNCRAPRKKGCWKCGKEGHQMKDCTNEGRQAN 434
Score = 34.7 bits (78), Expect = 0.56, Method: Composition-based stats.
Identities = 16/45 (35%), Positives = 26/45 (57%), Gaps = 2/45 (4%)
Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYECPS-GRIAD 229
V C +C ++GH++++C P C CG GH +C + GR A+
Sbjct: 390 VKCFNCGKVGHIAKNCRAPRKKGCWKCGKEGHQMKDCTNEGRQAN 434
>sp|P19560|POL_BIV29 Gag-Pol polyprotein OS=Bovine immunodeficiency virus (strain R29)
GN=gag-pol PE=1 SV=2
Length = 1475
Score = 46.2 bits (108), Expect = 2e-04, Method: Composition-based stats.
Identities = 16/36 (44%), Positives = 24/36 (66%), Gaps = 1/36 (2%)
Query: 52 RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCST 87
RC+ C + GH+ NC + C+ CGK GH+AR+C +
Sbjct: 404 RCYGCGKTGHLKRNCKQQK-CYHCGKPGHQARNCRS 438
Score = 42.4 bits (98), Expect = 0.003, Method: Composition-based stats.
Identities = 15/36 (41%), Positives = 22/36 (61%), Gaps = 1/36 (2%)
Query: 34 CNNCGLPGHIASECTTQARCWNCREPGHMASNCHNE 69
C CG GH+ C Q +C++C +PGH A NC ++
Sbjct: 405 CYGCGKTGHLKRNCKQQ-KCYHCGKPGHQARNCRSK 439
Score = 40.8 bits (94), Expect = 0.008, Method: Composition-based stats.
Identities = 17/40 (42%), Positives = 22/40 (55%), Gaps = 8/40 (20%)
Query: 72 CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC 111
C+ CGKTGH R+C + C +C KPGH A +C
Sbjct: 405 CYGCGKTGHLKRNCKQ--------QKCYHCGKPGHQARNC 436
Score = 40.0 bits (92), Expect = 0.014, Method: Composition-based stats.
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 3/46 (6%)
Query: 90 QSGG--DLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNE 133
Q+ G D R C C K GH+ +C K C +C K GH AR+C+++
Sbjct: 395 QTSGPEDGRRCYGCGKTGHLKRNCKQQK-CYHCGKPGHQARNCRSK 439
Score = 39.3 bits (90), Expect = 0.026, Method: Composition-based stats.
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 1/41 (2%)
Query: 12 GNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQAR 52
G C C + GH R C C +CG PGH A C ++ R
Sbjct: 402 GRRCYGCGKTGHLKRNCKQQK-CYHCGKPGHQARNCRSKNR 441
Score = 38.1 bits (87), Expect = 0.050, Method: Composition-based stats.
Identities = 15/35 (42%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Query: 115 KACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQC 149
+ C C KTGH+ R+C+ + C C GH AR C
Sbjct: 403 RRCYGCGKTGHLKRNCKQQK-CYHCGKPGHQARNC 436
Score = 33.5 bits (75), Expect = 1.3, Method: Composition-based stats.
Identities = 14/36 (38%), Positives = 18/36 (50%), Gaps = 2/36 (5%)
Query: 189 CRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPS 224
C C + GH+ R+C C +CG GH A C S
Sbjct: 405 CYGCGKTGHLKRNCKQQK--CYHCGKPGHQARNCRS 438
>sp|P23425|GAG_VILV2 Gag polyprotein OS=Maedi visna virus (strain 1514 / clone LV1-1KS2)
GN=gag PE=3 SV=1
Length = 442
Score = 46.2 bits (108), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%)
Query: 52 RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLR 96
+C+NC +PGH+A C ICH CGK GH +DC Q G + R
Sbjct: 386 KCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQKKQQGNNRR 430
Score = 35.8 bits (81), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 15 CNNCKRPGHFARECPNVAVCNNCGLPGHIASEC 47
C NC +PGH AR+C +C++CG GH+ +C
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419
Score = 34.3 bits (77), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 98 CNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNE 133
C NC KPGH+A C C +C K GH+ +DC+ +
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQK 422
Score = 30.8 bits (68), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 18/59 (30%), Positives = 29/59 (49%)
Query: 91 SGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQC 149
G ++L +P A ++ C NC K GH+AR C+ +C+ C GH+ + C
Sbjct: 361 EGFKMQLLAQALRPQGKAGHKGVNQKCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419
>sp|P35955|GAG_VILVK Gag polyprotein OS=Maedi visna virus (strain KV1772) GN=gag PE=3
SV=1
Length = 442
Score = 46.2 bits (108), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%)
Query: 52 RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLR 96
+C+NC +PGH+A C ICH CGK GH +DC Q G + R
Sbjct: 386 KCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQKKQQGNNRR 430
Score = 35.8 bits (81), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 15 CNNCKRPGHFARECPNVAVCNNCGLPGHIASEC 47
C NC +PGH AR+C +C++CG GH+ +C
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419
Score = 34.3 bits (77), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 98 CNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNE 133
C NC KPGH+A C C +C K GH+ +DC+ +
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQK 422
Score = 32.0 bits (71), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 18/59 (30%), Positives = 29/59 (49%)
Query: 91 SGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQC 149
G ++L +P A ++ C NC K GH+AR C+ +C+ C GH+ + C
Sbjct: 361 EGFKMQLLAQALRPQGKAGQKGVNQKCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419
>sp|P23424|GAG_VILV1 Gag polyprotein OS=Maedi visna virus (strain 1514 / clone LV1-1KS1)
GN=gag PE=3 SV=1
Length = 442
Score = 46.2 bits (108), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%)
Query: 52 RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLR 96
+C+NC +PGH+A C ICH CGK GH +DC Q G + R
Sbjct: 386 KCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQKKQQGNNRR 430
Score = 35.8 bits (81), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 15 CNNCKRPGHFARECPNVAVCNNCGLPGHIASEC 47
C NC +PGH AR+C +C++CG GH+ +C
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419
Score = 34.3 bits (77), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 98 CNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNE 133
C NC KPGH+A C C +C K GH+ +DC+ +
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQK 422
Score = 30.8 bits (68), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 18/59 (30%), Positives = 29/59 (49%)
Query: 91 SGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQC 149
G ++L +P A ++ C NC K GH+AR C+ +C+ C GH+ + C
Sbjct: 361 EGFKMQLLAQALRPQGKAGHKGVNQKCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419
>sp|P03352|GAG_VILV Gag polyprotein OS=Maedi visna virus (strain 1514) GN=gag PE=3 SV=1
Length = 442
Score = 46.2 bits (108), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%)
Query: 52 RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLR 96
+C+NC +PGH+A C ICH CGK GH +DC Q G + R
Sbjct: 386 KCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQKKQQGNNRR 430
Score = 35.8 bits (81), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 15 CNNCKRPGHFARECPNVAVCNNCGLPGHIASEC 47
C NC +PGH AR+C +C++CG GH+ +C
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419
Score = 34.3 bits (77), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 98 CNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNE 133
C NC KPGH+A C C +C K GH+ +DC+ +
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQK 422
Score = 31.2 bits (69), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 18/59 (30%), Positives = 29/59 (49%)
Query: 91 SGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQC 149
G ++L +P A ++ C NC K GH+AR C+ +C+ C GH+ + C
Sbjct: 361 EGFKMQLLAQALRPQGKAGHKGVNQKCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419
>sp|Q8AII1|POL_SIVTN Gag-Pol polyprotein OS=Simian immunodeficiency virus (isolate TAN1)
GN=gag-pol PE=3 SV=4
Length = 1462
Score = 45.8 bits (107), Expect = 3e-04, Method: Composition-based stats.
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 2/46 (4%)
Query: 50 QARCWNCREPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGG 93
Q +C+NC + GH A NC + C CG+ GH+ +DC+T S G
Sbjct: 416 QLQCFNCGKVGHTARNCRAPRKKGCWRCGQEGHQMKDCTTRNNSTG 461
Score = 38.9 bits (89), Expect = 0.029, Method: Composition-based stats.
Identities = 19/45 (42%), Positives = 24/45 (53%), Gaps = 3/45 (6%)
Query: 9 MSQGNL-CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQ 50
+ +G L C NC + GH AR C P C CG GH +CTT+
Sbjct: 412 LKKGQLQCFNCGKVGHTARNCRAPRKKGCWRCGQEGHQMKDCTTR 456
Score = 34.7 bits (78), Expect = 0.51, Method: Composition-based stats.
Identities = 15/35 (42%), Positives = 18/35 (51%), Gaps = 2/35 (5%)
Query: 98 CNNCYKPGHIAADC--TNDKACKNCRKTGHIARDC 130
C NC K GH A +C K C C + GH +DC
Sbjct: 419 CFNCGKVGHTARNCRAPRKKGCWRCGQEGHQMKDC 453
Score = 33.9 bits (76), Expect = 0.92, Method: Composition-based stats.
Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)
Query: 189 CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYECPS 224
C +C ++GH +R+C P C CG GH +C +
Sbjct: 419 CFNCGKVGHTARNCRAPRKKGCWRCGQEGHQMKDCTT 455
Score = 32.3 bits (72), Expect = 2.6, Method: Composition-based stats.
Identities = 14/35 (40%), Positives = 18/35 (51%), Gaps = 2/35 (5%)
Query: 117 CKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
C NC K GH AR+C+ + C C GH + C
Sbjct: 419 CFNCGKVGHTARNCRAPRKKGCWRCGQEGHQMKDC 453
>sp|P24740|POL_HV1U4 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
subtype A (isolate U455) GN=gag-pol PE=1 SV=3
Length = 1428
Score = 45.4 bits (106), Expect = 3e-04, Method: Composition-based stats.
Identities = 24/79 (30%), Positives = 35/79 (44%), Gaps = 23/79 (29%)
Query: 31 VAVCNNCGLPGHIA---SECTTQA------------------RCWNCREPGHMASNCH-- 67
+ C G PGH A +E +Q +C+NC + GH+A NC
Sbjct: 343 MTACQGVGGPGHKARVLAEAMSQVQQTSIMMQRGNFRGPRRIKCFNCGKEGHLAKNCRAP 402
Query: 68 NEGICHSCGKTGHRARDCS 86
+ C CGK GH+ +DC+
Sbjct: 403 RKKGCWKCGKEGHQMKDCT 421
Score = 37.4 bits (85), Expect = 0.095, Method: Composition-based stats.
Identities = 24/83 (28%), Positives = 34/83 (40%), Gaps = 13/83 (15%)
Query: 72 CHSCGKTGHRAR---DCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIAR 128
C G GH+AR + + VQ + N P I C NC K GH+A+
Sbjct: 346 CQGVGGPGHKARVLAEAMSQVQQTSIMMQRGNFRGPRRIK--------CFNCGKEGHLAK 397
Query: 129 DCQ--NEPVCNLCNIAGHVARQC 149
+C+ + C C GH + C
Sbjct: 398 NCRAPRKKGCWKCGKEGHQMKDC 420
Score = 37.0 bits (84), Expect = 0.13, Method: Composition-based stats.
Identities = 18/51 (35%), Positives = 23/51 (45%), Gaps = 9/51 (17%)
Query: 9 MSQGNL-------CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQ 50
M +GN C NC + GH A+ C P C CG GH +CT +
Sbjct: 373 MQRGNFRGPRRIKCFNCGKEGHLAKNCRAPRKKGCWKCGKEGHQMKDCTER 423
Score = 35.4 bits (80), Expect = 0.37, Method: Composition-based stats.
Identities = 14/45 (31%), Positives = 23/45 (51%), Gaps = 1/45 (2%)
Query: 179 GRYVGYHDVICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
G + G + C +C + GH++++C P C CG GH +C
Sbjct: 376 GNFRGPRRIKCFNCGKEGHLAKNCRAPRKKGCWKCGKEGHQMKDC 420
>sp|P04584|POL_HV2RO Gag-Pol polyprotein OS=Human immunodeficiency virus type 2 subtype
A (isolate ROD) GN=gag-pol PE=1 SV=3
Length = 1464
Score = 45.4 bits (106), Expect = 3e-04, Method: Composition-based stats.
Identities = 17/35 (48%), Positives = 21/35 (60%), Gaps = 2/35 (5%)
Query: 34 CNNCGLPGHIASECTTQAR--CWNCREPGHMASNC 66
C NCG GH A +C R CW C +PGH+ +NC
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNC 425
Score = 44.3 bits (103), Expect = 8e-04, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 31/63 (49%), Gaps = 7/63 (11%)
Query: 72 CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ 131
C +CGK GH AR C + G C C KPGHI +C + +A +TG + ++
Sbjct: 391 CWNCGKEGHSARQCRAPRRQG-----CWKCGKPGHIMTNCPDRQA--GFLRTGPLGKEAP 443
Query: 132 NEP 134
P
Sbjct: 444 QLP 446
Score = 41.2 bits (95), Expect = 0.006, Method: Composition-based stats.
Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 2/35 (5%)
Query: 15 CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASEC 47
C NC + GH AR+C P C CG PGHI + C
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNC 425
Score = 38.5 bits (88), Expect = 0.042, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 20/37 (54%), Gaps = 4/37 (10%)
Query: 52 RCWNCREPGHMASNC---HNEGICHSCGKTGHRARDC 85
+CWNC + GH A C +G C CGK GH +C
Sbjct: 390 KCWNCGKEGHSARQCRAPRRQG-CWKCGKPGHIMTNC 425
Score = 33.9 bits (76), Expect = 0.88, Method: Composition-based stats.
Identities = 14/36 (38%), Positives = 18/36 (50%), Gaps = 1/36 (2%)
Query: 189 CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYECP 223
C +C + GH +R C P C CG GH+ CP
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNCP 426
>sp|P24107|POL_HV2CA Gag-Pol polyprotein OS=Human immunodeficiency virus type 2 subtype
A (isolate CAM2) GN=gag-pol PE=3 SV=3
Length = 1462
Score = 45.4 bits (106), Expect = 3e-04, Method: Composition-based stats.
Identities = 17/35 (48%), Positives = 21/35 (60%), Gaps = 2/35 (5%)
Query: 34 CNNCGLPGHIASECTTQAR--CWNCREPGHMASNC 66
C NCG GH A +C R CW C +PGH+ +NC
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNC 425
Score = 44.7 bits (104), Expect = 5e-04, Method: Composition-based stats.
Identities = 27/79 (34%), Positives = 34/79 (43%), Gaps = 14/79 (17%)
Query: 15 CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQA----RCWNC--------REPG 60
C NC + GH AR+C P C CG PGHI + C + R W R P
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNCPDRQAGFLRDWPLGKEAPQFPRGPS 450
Query: 61 HMASNCHNEGICHSCGKTG 79
+N ++ I S G TG
Sbjct: 451 STGANTNSTPIGSSSGSTG 469
Score = 43.5 bits (101), Expect = 0.001, Method: Composition-based stats.
Identities = 19/45 (42%), Positives = 24/45 (53%), Gaps = 5/45 (11%)
Query: 72 CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKA 116
C +CGK GH AR C + G C C KPGHI +C + +A
Sbjct: 391 CWNCGKEGHSARQCRAPRRQG-----CWKCGKPGHIMTNCPDRQA 430
Score = 38.5 bits (88), Expect = 0.042, Method: Composition-based stats.
Identities = 16/37 (43%), Positives = 20/37 (54%), Gaps = 4/37 (10%)
Query: 52 RCWNCREPGHMASNC---HNEGICHSCGKTGHRARDC 85
+CWNC + GH A C +G C CGK GH +C
Sbjct: 390 KCWNCGKEGHSARQCRAPRRQG-CWKCGKPGHIMTNC 425
Score = 35.0 bits (79), Expect = 0.47, Method: Composition-based stats.
Identities = 22/60 (36%), Positives = 27/60 (45%), Gaps = 3/60 (5%)
Query: 98 CNNCYKPGHIAADC--TNDKACKNCRKTGHIARDCQNEPVCNLCNIA-GHVARQCPKGDS 154
C NC K GH A C + C C K GHI +C + L + G A Q P+G S
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNCPDRQAGFLRDWPLGKEAPQFPRGPS 450
Score = 33.9 bits (76), Expect = 0.88, Method: Composition-based stats.
Identities = 14/36 (38%), Positives = 18/36 (50%), Gaps = 1/36 (2%)
Query: 189 CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYECP 223
C +C + GH +R C P C CG GH+ CP
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNCP 426
>sp|O12158|POL_HV192 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
subtype C (isolate 92BR025) GN=gag-pol PE=1 SV=2
Length = 1431
Score = 45.4 bits (106), Expect = 4e-04, Method: Composition-based stats.
Identities = 29/89 (32%), Positives = 34/89 (38%), Gaps = 8/89 (8%)
Query: 31 VAVCNNCGLPGHIASECTTQARCWNCREPGHMASNCHNEG---ICHSCGKTGHRARDCST 87
+ C G PGH A N SNC C +CGK GH AR+C
Sbjct: 345 MTACQGVGGPGHKARVLAEAMSKVNNTNIMMQRSNCKGPKRTIKCFNCGKEGHLARNCRA 404
Query: 88 HVQSGGDLRLCNNCYKPGHIAADCTNDKA 116
+ G C C K GH DCT +A
Sbjct: 405 PRKKG-----CWKCGKEGHQVKDCTERQA 428
Score = 39.7 bits (91), Expect = 0.018, Method: Composition-based stats.
Identities = 24/83 (28%), Positives = 35/83 (42%), Gaps = 12/83 (14%)
Query: 72 CHSCGKTGHRAR---DCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIAR 128
C G GH+AR + + V + + +NC P C NC K GH+AR
Sbjct: 348 CQGVGGPGHKARVLAEAMSKVNNTNIMMQRSNCKGPKRTIK-------CFNCGKEGHLAR 400
Query: 129 DCQ--NEPVCNLCNIAGHVARQC 149
+C+ + C C GH + C
Sbjct: 401 NCRAPRKKGCWKCGKEGHQVKDC 423
Score = 38.1 bits (87), Expect = 0.059, Method: Composition-based stats.
Identities = 16/38 (42%), Positives = 19/38 (50%), Gaps = 2/38 (5%)
Query: 15 CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQ 50
C NC + GH AR C P C CG GH +CT +
Sbjct: 389 CFNCGKEGHLARNCRAPRKKGCWKCGKEGHQVKDCTER 426
Score = 34.3 bits (77), Expect = 0.70, Method: Composition-based stats.
Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 1/35 (2%)
Query: 189 CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
C +C + GH++R+C P C CG GH +C
Sbjct: 389 CFNCGKEGHLARNCRAPRKKGCWKCGKEGHQVKDC 423
>sp|P35963|POL_HV1Y2 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
subtype B (isolate YU-2) GN=gag-pol PE=1 SV=3
Length = 1435
Score = 45.1 bits (105), Expect = 4e-04, Method: Composition-based stats.
Identities = 25/81 (30%), Positives = 35/81 (43%), Gaps = 25/81 (30%)
Query: 31 VAVCNNCGLPGHIA-------SECTTQA----------------RCWNCREPGHMASNCH 67
+ C G PGH A S+ T A +C+NC + GH+A NC
Sbjct: 347 MTACQGVGGPGHKARVLAEAMSQVTNSATIMMQRGNFRNQRKTVKCFNCGKEGHIAKNCR 406
Query: 68 --NEGICHSCGKTGHRARDCS 86
+ C CGK GH+ +DC+
Sbjct: 407 APRKKGCWKCGKEGHQMKDCT 427
Score = 34.7 bits (78), Expect = 0.52, Method: Composition-based stats.
Identities = 21/83 (25%), Positives = 34/83 (40%), Gaps = 11/83 (13%)
Query: 72 CHSCGKTGHRAR---DCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIAR 128
C G GH+AR + + V + + + ++ C N C K GHIA+
Sbjct: 350 CQGVGGPGHKARVLAEAMSQVTNSATIMMQRGNFRNQRKTVKCFN------CGKEGHIAK 403
Query: 129 DCQ--NEPVCNLCNIAGHVARQC 149
+C+ + C C GH + C
Sbjct: 404 NCRAPRKKGCWKCGKEGHQMKDC 426
Score = 32.7 bits (73), Expect = 2.2, Method: Composition-based stats.
Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)
Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
V C +C + GH++++C P C CG GH +C
Sbjct: 390 VKCFNCGKEGHIAKNCRAPRKKGCWKCGKEGHQMKDC 426
>sp|Q73368|POL_HV1B9 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
subtype B (strain 89.6) GN=gag-pol PE=3 SV=3
Length = 1435
Score = 45.1 bits (105), Expect = 4e-04, Method: Composition-based stats.
Identities = 25/81 (30%), Positives = 35/81 (43%), Gaps = 25/81 (30%)
Query: 31 VAVCNNCGLPGHIA-------SECTTQA----------------RCWNCREPGHMASNCH 67
+ C G PGH A S+ T A +C+NC + GH+A NC
Sbjct: 347 MTACQGVGGPGHKARVLAEAMSQVTNSATIMMQRGNFRNQRKTVKCFNCGKEGHIAKNCR 406
Query: 68 --NEGICHSCGKTGHRARDCS 86
+ C CGK GH+ +DC+
Sbjct: 407 APRKKGCWKCGKEGHQMKDCT 427
Score = 34.7 bits (78), Expect = 0.52, Method: Composition-based stats.
Identities = 21/83 (25%), Positives = 34/83 (40%), Gaps = 11/83 (13%)
Query: 72 CHSCGKTGHRAR---DCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIAR 128
C G GH+AR + + V + + + ++ C N C K GHIA+
Sbjct: 350 CQGVGGPGHKARVLAEAMSQVTNSATIMMQRGNFRNQRKTVKCFN------CGKEGHIAK 403
Query: 129 DCQ--NEPVCNLCNIAGHVARQC 149
+C+ + C C GH + C
Sbjct: 404 NCRAPRKKGCWKCGKEGHQMKDC 426
Score = 32.7 bits (73), Expect = 2.2, Method: Composition-based stats.
Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)
Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
V C +C + GH++++C P C CG GH +C
Sbjct: 390 VKCFNCGKEGHIAKNCRAPRKKGCWKCGKEGHQMKDC 426
>sp|Q9WC63|POL_HV1S9 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
subtype J (isolate SE9173) GN=gag-pol PE=3 SV=3
Length = 1432
Score = 44.7 bits (104), Expect = 5e-04, Method: Composition-based stats.
Identities = 24/80 (30%), Positives = 35/80 (43%), Gaps = 24/80 (30%)
Query: 31 VAVCNNCGLPGHIA---SECTTQA-------------------RCWNCREPGHMASNCH- 67
+ C G PGH A +E +Q +C+NC + GH+A NC
Sbjct: 347 MTACQGVGGPGHKARVLAEAMSQVTNTNIMMQRGNFRDHKRIVKCFNCGKQGHIAKNCRA 406
Query: 68 -NEGICHSCGKTGHRARDCS 86
+ C CGK GH+ +DC+
Sbjct: 407 PRKKGCWKCGKEGHQMKDCT 426
Score = 33.9 bits (76), Expect = 0.91, Method: Composition-based stats.
Identities = 23/80 (28%), Positives = 34/80 (42%), Gaps = 6/80 (7%)
Query: 72 CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ 131
C G GH+AR ++ + N + G+ D C NC K GHIA++C+
Sbjct: 350 CQGVGGPGHKAR---VLAEAMSQVTNTNIMMQRGNFR-DHKRIVKCFNCGKQGHIAKNCR 405
Query: 132 --NEPVCNLCNIAGHVARQC 149
+ C C GH + C
Sbjct: 406 APRKKGCWKCGKEGHQMKDC 425
Score = 33.5 bits (75), Expect = 1.4, Method: Composition-based stats.
Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)
Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
V C +C + GH++++C P C CG GH +C
Sbjct: 389 VKCFNCGKQGHIAKNCRAPRKKGCWKCGKEGHQMKDC 425
>sp|Q9WC54|POL_HV1S2 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
subtype J (isolate SE9280) GN=gag-pol PE=3 SV=3
Length = 1432
Score = 44.7 bits (104), Expect = 5e-04, Method: Composition-based stats.
Identities = 24/80 (30%), Positives = 35/80 (43%), Gaps = 24/80 (30%)
Query: 31 VAVCNNCGLPGHIA---SECTTQA-------------------RCWNCREPGHMASNCH- 67
+ C G PGH A +E +Q +C+NC + GH+A NC
Sbjct: 347 MTACQGVGGPGHKARVLAEAMSQVTNTNIMMQRGNFRDHKRIVKCFNCGKQGHIAKNCRA 406
Query: 68 -NEGICHSCGKTGHRARDCS 86
+ C CGK GH+ +DC+
Sbjct: 407 PRKKGCWKCGKEGHQMKDCT 426
Score = 33.9 bits (76), Expect = 0.91, Method: Composition-based stats.
Identities = 23/80 (28%), Positives = 34/80 (42%), Gaps = 6/80 (7%)
Query: 72 CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ 131
C G GH+AR ++ + N + G+ D C NC K GHIA++C+
Sbjct: 350 CQGVGGPGHKAR---VLAEAMSQVTNTNIMMQRGNFR-DHKRIVKCFNCGKQGHIAKNCR 405
Query: 132 --NEPVCNLCNIAGHVARQC 149
+ C C GH + C
Sbjct: 406 APRKKGCWKCGKEGHQMKDC 425
Score = 33.5 bits (75), Expect = 1.4, Method: Composition-based stats.
Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)
Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
V C +C + GH++++C P C CG GH +C
Sbjct: 389 VKCFNCGKQGHIAKNCRAPRKKGCWKCGKEGHQMKDC 425
>sp|P12497|POL_HV1N5 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
subtype B (isolate NY5) GN=gag-pol PE=1 SV=4
Length = 1435
Score = 44.7 bits (104), Expect = 6e-04, Method: Composition-based stats.
Identities = 25/81 (30%), Positives = 35/81 (43%), Gaps = 25/81 (30%)
Query: 31 VAVCNNCGLPGHIA-------SECTTQA----------------RCWNCREPGHMASNCH 67
+ C G PGH A S+ T A +C+NC + GH+A NC
Sbjct: 347 MTACQGVGGPGHKARVLAEAMSQVTNPATIMIQKGNFRNQRKTVKCFNCGKEGHIAKNCR 406
Query: 68 --NEGICHSCGKTGHRARDCS 86
+ C CGK GH+ +DC+
Sbjct: 407 APRKKGCWKCGKEGHQMKDCT 427
Score = 33.9 bits (76), Expect = 0.96, Method: Composition-based stats.
Identities = 21/83 (25%), Positives = 34/83 (40%), Gaps = 11/83 (13%)
Query: 72 CHSCGKTGHRAR---DCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIAR 128
C G GH+AR + + V + + + ++ C N C K GHIA+
Sbjct: 350 CQGVGGPGHKARVLAEAMSQVTNPATIMIQKGNFRNQRKTVKCFN------CGKEGHIAK 403
Query: 129 DCQ--NEPVCNLCNIAGHVARQC 149
+C+ + C C GH + C
Sbjct: 404 NCRAPRKKGCWKCGKEGHQMKDC 426
Score = 32.7 bits (73), Expect = 2.2, Method: Composition-based stats.
Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)
Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
V C +C + GH++++C P C CG GH +C
Sbjct: 390 VKCFNCGKEGHIAKNCRAPRKKGCWKCGKEGHQMKDC 426
>sp|Q9HFF2|YL92_SCHPO Uncharacterized protein C683.02c OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=SPAC683.02c PE=4 SV=1
Length = 218
Score = 44.3 bits (103), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 41/90 (45%), Gaps = 17/90 (18%)
Query: 13 NLCNNCKRPGHFARECP----NVAVCNNCGLPGHIASECTTQ-----ARCWNCREPGHMA 63
C C++ GH ++CP NV++C CG H + C+ + A+C+ C E GH++
Sbjct: 77 KFCFACRQQGHIVQDCPEAKDNVSICFRCGSKEHSLNACSKKGPLKFAKCFICHENGHLS 136
Query: 64 SNCHNE--------GICHSCGKTGHRARDC 85
C G C C H A+DC
Sbjct: 137 GQCEQNPKGLYPKGGCCKFCSSVHHLAKDC 166
Score = 39.7 bits (91), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 40/92 (43%), Gaps = 18/92 (19%)
Query: 53 CWNCREPGHMASNC----HNEGICHSCGKTGHRARDCSTHVQSGGDLRL--CNNCYKPGH 106
C+ CR+ GH+ +C N IC CG H CS G L+ C C++ GH
Sbjct: 79 CFACRQQGHIVQDCPEAKDNVSICFRCGSKEHSLNACS----KKGPLKFAKCFICHENGH 134
Query: 107 IAADCTNDKA--------CKNCRKTGHIARDC 130
++ C + CK C H+A+DC
Sbjct: 135 LSGQCEQNPKGLYPKGGCCKFCSSVHHLAKDC 166
>sp|A1L2T6|ZCHC7_XENLA Zinc finger CCHC domain-containing protein 7 OS=Xenopus laevis
GN=zcchc7 PE=2 SV=2
Length = 563
Score = 42.7 bits (99), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 38/85 (44%), Gaps = 8/85 (9%)
Query: 8 FMSQGNLCNNCKRPGHFARECP---NVAVCNNCGLPGHIASECTTQARCWNCREPGHMAS 64
+ + +C NC + GH ++ CP + C CG GH + C ++ C NC PGH
Sbjct: 260 YTDKNVVCRNCDKRGHLSKNCPVPKKLPACCLCGERGHYQNSCPSRY-CLNCFLPGHFFK 318
Query: 65 NC----HNEGICHSCGKTGHRARDC 85
C + CH C GH A C
Sbjct: 319 ECIERAYWRKTCHRCSMPGHYADAC 343
Score = 42.0 bits (97), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 35/79 (44%), Gaps = 6/79 (7%)
Query: 152 GDSLGERGGGGGGERGGGGGGDGGGG----GGRYVGYHDVICRSCNQMGHMSRDCVGP-- 205
D GERG G RY +V+CR+C++ GH+S++C P
Sbjct: 226 SDDEGERGAAWSISEKDVEAQIGNYTPLRRSNRYYTDKNVVCRNCDKRGHLSKNCPVPKK 285
Query: 206 LIICRNCGGRGHMAYECPS 224
L C CG RGH CPS
Sbjct: 286 LPACCLCGERGHYQNSCPS 304
Score = 37.4 bits (85), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 44/181 (24%), Positives = 61/181 (33%), Gaps = 61/181 (33%)
Query: 53 CWNCREPGHMASNC---HNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAA 109
C NC + GH++ NC C CG+ GH C + R C NC+ PGH
Sbjct: 267 CRNCDKRGHLSKNCPVPKKLPACCLCGERGHYQNSCPS--------RYCLNCFLPGHFFK 318
Query: 110 DCTNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCP---KGDSLGERGGGGGGER 166
+C RKT C+ C++ GH A CP + L + G +
Sbjct: 319 ECIERAY---WRKT------------CHRCSMPGHYADACPEIWRQYHLTIKAGPIKKPK 363
Query: 167 GGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGR 226
G D ++ C NC +GH YEC R
Sbjct: 364 SHSGQKD--------------------------------IVYCCNCAKKGHCIYECKERR 391
Query: 227 I 227
+
Sbjct: 392 M 392
Score = 35.8 bits (81), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 43/105 (40%), Gaps = 16/105 (15%)
Query: 15 CNNCKRPGHFARECPNVA----VCNNCGLPGHIASECTTQARCWNC-------REPGHMA 63
C NC PGHF +EC A C+ C +PGH A C R ++ ++P
Sbjct: 307 CLNCFLPGHFFKECIERAYWRKTCHRCSMPGHYADACPEIWRQYHLTIKAGPIKKP---K 363
Query: 64 SNCHNEGICH--SCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGH 106
S+ + I + +C K GH +C + L C + H
Sbjct: 364 SHSGQKDIVYCCNCAKKGHCIYECKERRMNSDKLPTCQLVFSYDH 408
Score = 31.2 bits (69), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 20/71 (28%), Positives = 31/71 (43%), Gaps = 11/71 (15%)
Query: 83 RDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDC---QNEPVCNLC 139
+D + + LR N Y ++ C+NC K GH++++C + P C LC
Sbjct: 241 KDVEAQIGNYTPLRRSNRYYTDKNVV--------CRNCDKRGHLSKNCPVPKKLPACCLC 292
Query: 140 NIAGHVARQCP 150
GH CP
Sbjct: 293 GERGHYQNSCP 303
>sp|P33458|GAG_CAEVC Gag polyprotein OS=Caprine arthritis encephalitis virus (strain
Cork) GN=gag PE=3 SV=1
Length = 441
Score = 42.0 bits (97), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 23/34 (67%)
Query: 52 RCWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
RC+NC +PGH A C ICH+CGK GH ++C
Sbjct: 380 RCYNCGKPGHQARQCRQGIICHNCGKRGHMQKEC 413
Score = 38.9 bits (89), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%)
Query: 11 QGNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASEC 47
Q C NC +PGH AR+C +C+NCG GH+ EC
Sbjct: 377 QPQRCYNCGKPGHQARQCRQGIICHNCGKRGHMQKEC 413
Score = 34.7 bits (78), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 32/75 (42%), Gaps = 5/75 (6%)
Query: 61 HMASNCHNEGICHSCGKTGHRARDCSTHVQSG-----GDLRLCNNCYKPGHIAADCTNDK 115
AS C G G + + + ++ G G + C NC KPGH A C
Sbjct: 339 QQASVEEKMQACRDVGSEGFKMQLLAQALRPGKGKGNGQPQRCYNCGKPGHQARQCRQGI 398
Query: 116 ACKNCRKTGHIARDC 130
C NC K GH+ ++C
Sbjct: 399 ICHNCGKRGHMQKEC 413
Score = 33.1 bits (74), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 189 CRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYEC 222
C +C + GH +R C IIC NCG RGHM EC
Sbjct: 381 CYNCGKPGHQARQCRQG-IICHNCGKRGHMQKEC 413
Score = 30.8 bits (68), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 18/33 (54%)
Query: 34 CNNCGLPGHIASECTTQARCWNCREPGHMASNC 66
C NCG PGH A +C C NC + GHM C
Sbjct: 381 CYNCGKPGHQARQCRQGIICHNCGKRGHMQKEC 413
>sp|P34689|GLH1_CAEEL ATP-dependent RNA helicase glh-1 OS=Caenorhabditis elegans GN=glh-1
PE=1 SV=3
Length = 763
Score = 42.0 bits (97), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 47/120 (39%), Gaps = 47/120 (39%)
Query: 13 NLCNNCKRPGHFARECP------NVAVCNNCGLPGHIASECTTQ---------------- 50
N C NC++PGH + +CP VC NC PGH + ECT +
Sbjct: 158 NNCFNCQQPGHRSSDCPEPRKEREPRVCYNCQQPGHTSRECTEERKPREGRTGGFGGGAG 217
Query: 51 ------------------------ARCWNCREPGHMASNCHNEGI-CHSCGKTGHRARDC 85
+C+NC+ GH ++ C C +CG+ GHR+ +C
Sbjct: 218 FGNNGGNDGFGGDGGFGGGEERGPMKCFNCKGEGHRSAECPEPPRGCFNCGEQGHRSNEC 277
Score = 38.1 bits (87), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 62/132 (46%), Gaps = 19/132 (14%)
Query: 98 CNNCYKPGHIAADCTNDK------ACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPK 151
C NC +PGH ++DC + C NC++ GH +R+C E R+ +
Sbjct: 160 CFNCQQPGHRSSDCPEPRKEREPRVCYNCQQPGHTSRECTEE-------------RKPRE 206
Query: 152 GDSLGERGGGGGGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRN 211
G + G GG G G GG G G GG G + C +C GH S +C P C N
Sbjct: 207 GRTGGFGGGAGFGNNGGNDGFGGDGGFGGGEERGPMKCFNCKGEGHRSAECPEPPRGCFN 266
Query: 212 CGGRGHMAYECP 223
CG +GH + ECP
Sbjct: 267 CGEQGHRSNECP 278
>sp|P40507|AIR1_YEAST Protein AIR1 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=AIR1 PE=1 SV=1
Length = 360
Score = 41.6 bits (96), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 48/121 (39%), Gaps = 18/121 (14%)
Query: 15 CNNCKRPGHFARECPNVAVCNNCG-LPGHIASECTTQARCWNCREPGHMASNCHNEG--- 70
CNNC + GH R CP+V +C CG + H + C C NC GH S C ++
Sbjct: 76 CNNCSQRGHLKRNCPHV-ICTYCGFMDDHYSQHCPKAIICTNCNANGHYKSQCPHKWKKV 134
Query: 71 ICHSCGKTGH---------RARDCSTHVQSGGDLR----LCNNCYKPGHIAADCTNDKAC 117
C C H R+ T + GD C NC GH DC ++
Sbjct: 135 FCTLCNSKRHSRERCPSIWRSYLLKTKDANQGDFDFQTVFCYNCGNAGHFGDDCAERRSS 194
Query: 118 K 118
+
Sbjct: 195 R 195
Score = 32.0 bits (71), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 31/92 (33%), Gaps = 37/92 (40%)
Query: 133 EPVCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDGGGGGGRYVGYHDVICRSC 192
EP CN C+ GH+ R CP VIC C
Sbjct: 73 EPKCNNCSQRGHLKRNCPH-----------------------------------VICTYC 97
Query: 193 NQMG-HMSRDCVGPLIICRNCGGRGHMAYECP 223
M H S+ C IIC NC GH +CP
Sbjct: 98 GFMDDHYSQHCPK-AIICTNCNANGHYKSQCP 128
>sp|Q9NUD5|ZCHC3_HUMAN Zinc finger CCHC domain-containing protein 3 OS=Homo sapiens
GN=ZCCHC3 PE=1 SV=1
Length = 404
Score = 41.6 bits (96), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 93 GDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKG 152
G + C C H++ CT D+ C C + GH++ C+ VCNLC GH QCPK
Sbjct: 331 GQPKTCFKCGSRTHMSGSCTQDR-CFRCGEEGHLSPYCRKGIVCNLCGKRGHAFAQCPKA 389
Score = 40.4 bits (93), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 1/57 (1%)
Query: 33 VCNNCGLPGHIASECTTQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHV 89
C CG H++ CT Q RC+ C E GH++ C +C+ CGK GH C V
Sbjct: 335 TCFKCGSRTHMSGSCT-QDRCFRCGEEGHLSPYCRKGIVCNLCGKRGHAFAQCPKAV 390
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.323 0.141 0.500
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 107,698,315
Number of Sequences: 539616
Number of extensions: 5394359
Number of successful extensions: 108989
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 1340
Number of HSP's successfully gapped in prelim test: 567
Number of HSP's that attempted gapping in prelim test: 44388
Number of HSP's gapped (non-prelim): 35501
length of query: 235
length of database: 191,569,459
effective HSP length: 114
effective length of query: 121
effective length of database: 130,053,235
effective search space: 15736441435
effective search space used: 15736441435
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 59 (27.3 bits)