BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy16574
         (152 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q8T8R1|Y3800_DROME CCHC-type zinc finger protein CG3800 OS=Drosophila melanogaster
           GN=CG3800 PE=1 SV=1
          Length = 165

 Score =  102 bits (254), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 51/114 (44%), Positives = 67/114 (58%), Gaps = 18/114 (15%)

Query: 39  CYKCNNYGHFARECATESVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSS 98
           CYKCN +GHFAR C  E+  CY C+G GH++KDCT   +  CY CN +GH+ RNCP   +
Sbjct: 57  CYKCNQFGHFARACPEEAERCYRCNGIGHISKDCTQADNPTCYRCNKTGHWVRNCPEAVN 116

Query: 99  KR------CYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +R      CY C++ GH++K CP            + S TCY CG  GHL  +C
Sbjct: 117 ERGPTNVSCYKCNRTGHISKNCP------------ETSKTCYGCGKSGHLRREC 158



 Score = 53.1 bits (126), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 37/67 (55%), Gaps = 3/67 (4%)

Query: 9   CYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECATESVTCYNCSGQGHV 68
           CY C   GH+  +CP+  + + RG    + CYKCN  GH ++ C   S TCY C   GH+
Sbjct: 98  CYRCNKTGHWVRNCPE--AVNERGP-TNVSCYKCNRTGHISKNCPETSKTCYGCGKSGHL 154

Query: 69  AKDCTVK 75
            ++C  K
Sbjct: 155 RRECDEK 161



 Score = 30.8 bits (68), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 16/48 (33%), Positives = 20/48 (41%), Gaps = 10/48 (20%)

Query: 5   STIQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           + + CY C   GH   +CP+ S            CY C   GH  REC
Sbjct: 121 TNVSCYKCNRTGHISKNCPETSK----------TCYGCGKSGHLRREC 158


>sp|Q04832|HEXP_LEIMA DNA-binding protein HEXBP OS=Leishmania major GN=HEXBP PE=4 SV=1
          Length = 271

 Score = 94.7 bits (234), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 57/183 (31%), Positives = 81/183 (44%), Gaps = 42/183 (22%)

Query: 3   STSTIQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECATE------- 55
           + S+  C NC   GHY   CP+   AD++GD+    C++C   GH +REC  E       
Sbjct: 12  TESSTSCRNCGKEGHYARECPE---ADSKGDERSTTCFRCGEEGHMSRECPNEARSGAAG 68

Query: 56  SVTCYNCSGQGHVAKDCT------VKSSIICYNCNSSGHFARNCPN-------------- 95
           ++TC+ C   GH+++DC             CY C   GH +R+CP+              
Sbjct: 69  AMTCFRCGEAGHMSRDCPNSAKPGAAKGFECYKCGQEGHLSRDCPSSQGGSRGGYGQKRG 128

Query: 96  --------DSSKRCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
                      + CY C  AGH++++CP    G S         TCY CG  GH+S DC 
Sbjct: 129 RSGAQGGYSGDRTCYKCGDAGHISRDCPNGQGGYS----GAGDRTCYKCGDAGHISRDCP 184

Query: 148 LVQ 150
             Q
Sbjct: 185 NGQ 187



 Score = 84.7 bits (208), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 57/173 (32%), Positives = 74/173 (42%), Gaps = 34/173 (19%)

Query: 7   IQCYNCFDFGHYQYSCP--QKSSADARGDKVGI-----------VCYKCNNYGHFAREC- 52
            +CY C   GH    CP  Q  S    G K G             CYKC + GH +R+C 
Sbjct: 97  FECYKCGQEGHLSRDCPSSQGGSRGGYGQKRGRSGAQGGYSGDRTCYKCGDAGHISRDCP 156

Query: 53  -------ATESVTCYNCSGQGHVAKDC-------TVKSSIICYNCNSSGHFARNCPNDSS 98
                       TCY C   GH+++DC       +      CY C  SGH +R CP+  S
Sbjct: 157 NGQGGYSGAGDRTCYKCGDAGHISRDCPNGQGGYSGAGDRKCYKCGESGHMSRECPSAGS 216

Query: 99  -----KRCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
                + CY C + GH+++ECP +  G           TCY CG  GH+S DC
Sbjct: 217 TGSGDRACYKCGKPGHISRECP-EAGGSYGGSRGGGDRTCYKCGEAGHISRDC 268



 Score = 84.3 bits (207), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 51/181 (28%), Positives = 74/181 (40%), Gaps = 46/181 (25%)

Query: 3   STSTIQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECATE------- 55
           +   + C+ C + GH    CP  +   A     G  CYKC   GH +R+C +        
Sbjct: 66  AAGAMTCFRCGEAGHMSRDCPNSAKPGA---AKGFECYKCGQEGHLSRDCPSSQGGSRGG 122

Query: 56  ----------------SVTCYNCSGQGHVAKDC-------TVKSSIICYNCNSSGHFARN 92
                             TCY C   GH+++DC       +      CY C  +GH +R+
Sbjct: 123 YGQKRGRSGAQGGYSGDRTCYKCGDAGHISRDCPNGQGGYSGAGDRTCYKCGDAGHISRD 182

Query: 93  CPND-------SSKRCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYD 145
           CPN          ++CY C ++GHM++ECP      S          CY CG  GH+S +
Sbjct: 183 CPNGQGGYSGAGDRKCYKCGESGHMSRECP------SAGSTGSGDRACYKCGKPGHISRE 236

Query: 146 C 146
           C
Sbjct: 237 C 237



 Score = 68.6 bits (166), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 56/131 (42%), Gaps = 27/131 (20%)

Query: 9   CYNCFDFGHYQYSCPQKSSA-DARGDKVGIVCYKCNNYGHFAREC--------ATESVTC 59
           CY C D GH    CP         GD+    CYKC + GH +R+C              C
Sbjct: 142 CYKCGDAGHISRDCPNGQGGYSGAGDRT---CYKCGDAGHISRDCPNGQGGYSGAGDRKC 198

Query: 60  YNCSGQGHVAKDCTVKSSI-----ICYNCNSSGHFARNCPN----------DSSKRCYAC 104
           Y C   GH++++C    S       CY C   GH +R CP              + CY C
Sbjct: 199 YKCGESGHMSRECPSAGSTGSGDRACYKCGKPGHISRECPEAGGSYGGSRGGGDRTCYKC 258

Query: 105 HQAGHMAKECP 115
            +AGH++++CP
Sbjct: 259 GEAGHISRDCP 269



 Score = 58.2 bits (139), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 46/111 (41%), Gaps = 20/111 (18%)

Query: 2   SSTSTIQCYNCFDFGHYQYSCPQKSSA-DARGDKVGIVCYKCNNYGHFARECATESVT-- 58
           S      CY C D GH    CP         GD+    CYKC   GH +REC +   T  
Sbjct: 163 SGAGDRTCYKCGDAGHISRDCPNGQGGYSGAGDRK---CYKCGESGHMSRECPSAGSTGS 219

Query: 59  ----CYNCSGQGHVAKDC----------TVKSSIICYNCNSSGHFARNCPN 95
               CY C   GH++++C                 CY C  +GH +R+CP+
Sbjct: 220 GDRACYKCGKPGHISRECPEAGGSYGGSRGGGDRTCYKCGEAGHISRDCPS 270


>sp|O65639|CSP1_ARATH Cold shock protein 1 OS=Arabidopsis thaliana GN=CSP1 PE=2 SV=1
          Length = 299

 Score = 88.6 bits (218), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 76/170 (44%), Gaps = 34/170 (20%)

Query: 9   CYNCFDFGHYQYSCPQKSSADARGDKVG--IVCYKCNNYGHFARECATESV--------- 57
           CYNC D GH+   C    + D RG   G    CY C + GH AR+C  +SV         
Sbjct: 134 CYNCGDTGHFARDCTSAGNGDQRGATKGGNDGCYTCGDVGHVARDCTQKSVGNGDQRGAV 193

Query: 58  -----TCYNCSGQGHVAKDCTVK-----------SSIICYNCNSSGHFARNCPNDS--SK 99
                 CY C   GH A+DCT K            S  CY+C   GH AR+C      S+
Sbjct: 194 KGGNDGCYTCGDVGHFARDCTQKVAAGNVRSGGGGSGTCYSCGGVGHIARDCATKRQPSR 253

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCKLV 149
            CY C  +GH+A++C  + +G            CY CG +GH + +C  V
Sbjct: 254 GCYQCGGSGHLARDCDQRGSGGGGNDNA-----CYKCGKEGHFARECSSV 298



 Score = 73.6 bits (179), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 58/176 (32%), Positives = 73/176 (41%), Gaps = 47/176 (26%)

Query: 9   CYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFARECATE----------- 55
           CYNC + GH    C          R  + G  CY C + GHFAR+C +            
Sbjct: 102 CYNCGELGHISKDCGIGGGGGGGERRSRGGEGCYNCGDTGHFARDCTSAGNGDQRGATKG 161

Query: 56  -SVTCYNCSGQGHVAKDCTVKS-------------SIICYNCNSSGHFARNCP------- 94
            +  CY C   GHVA+DCT KS             +  CY C   GHFAR+C        
Sbjct: 162 GNDGCYTCGDVGHVARDCTQKSVGNGDQRGAVKGGNDGCYTCGDVGHFARDCTQKVAAGN 221

Query: 95  ----NDSSKRCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
                  S  CY+C   GH+A++C         +P    S  CY CG  GHL+ DC
Sbjct: 222 VRSGGGGSGTCYSCGGVGHIARDCA-----TKRQP----SRGCYQCGGSGHLARDC 268


>sp|Q3T0Q6|CNBP_BOVIN Cellular nucleic acid-binding protein OS=Bos taurus GN=CNBP PE=2
           SV=1
          Length = 170

 Score = 88.2 bits (217), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 50/165 (30%), Positives = 76/165 (46%), Gaps = 21/165 (12%)

Query: 5   STIQCYNCFDFGHYQYSCPQKSSADAR-----------GDKVGIVCYKCNNYGHFARECA 53
           S+ +C+ C   GH+   CP                      +  +CY+C   GH A++C 
Sbjct: 2   SSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGFQFVSSSLPDICYRCGESGHLAKDCD 61

Query: 54  TESVTCYNCSGQGHVAKDC---TVKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHM 110
            +   CYNC   GH+AKDC     +    CYNC   GH AR+C +   ++CY+C + GH+
Sbjct: 62  LQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYSCGEFGHI 121

Query: 111 AKECP-------GQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCKL 148
            K+C        G+T   +        + CY CG  GHL+ +C +
Sbjct: 122 QKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTI 166



 Score = 68.6 bits (166), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 45/136 (33%), Positives = 59/136 (43%), Gaps = 36/136 (26%)

Query: 39  CYKCNNYGHFARECATESV---------------------TCYNCSGQGHVAKDCTVKSS 77
           C+KC   GH+AREC T                         CY C   GH+AKDC ++  
Sbjct: 6   CFKCGRSGHWARECPTGGGRGRGMRSRGRGFQFVSSSLPDICYRCGESGHLAKDCDLQED 65

Query: 78  IICYNCNSSGHFARNCPNDSSKR---CYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCY 134
             CYNC   GH A++C     +R   CY C + GH+A++C      K           CY
Sbjct: 66  A-CYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQK-----------CY 113

Query: 135 VCGHQGHLSYDCKLVQ 150
            CG  GH+  DC  V+
Sbjct: 114 SCGEFGHIQKDCTKVK 129



 Score = 48.9 bits (115), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 13/71 (18%)

Query: 8   QCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECA-TESVTCYNCSGQG 66
           +CY+C +FGH Q  C +            + CY+C   GH A  C+ T  V CY C   G
Sbjct: 111 KCYSCGEFGHIQKDCTK------------VKCYRCGETGHVAINCSKTSEVNCYRCGESG 158

Query: 67  HVAKDCTVKSS 77
           H+A++CT++++
Sbjct: 159 HLARECTIEAT 169


>sp|P62634|CNBP_RAT Cellular nucleic acid-binding protein OS=Rattus norvegicus GN=Cnbp
           PE=2 SV=1
          Length = 177

 Score = 86.3 bits (212), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 52/172 (30%), Positives = 78/172 (45%), Gaps = 28/172 (16%)

Query: 5   STIQCYNCFDFGHYQYSCPQKSSADA-------------RG-----DKVGIVCYKCNNYG 46
           S+ +C+ C   GH+   CP                    RG       +  +CY+C   G
Sbjct: 2   SSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESG 61

Query: 47  HFARECATESVTCYNCSGQGHVAKDC---TVKSSIICYNCNSSGHFARNCPNDSSKRCYA 103
           H A++C  +   CYNC   GH+AKDC     +    CYNC   GH AR+C +   ++CY+
Sbjct: 62  HLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYS 121

Query: 104 CHQAGHMAKECP-------GQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCKL 148
           C + GH+ K+C        G+T   +        + CY CG  GHL+ +C +
Sbjct: 122 CGEFGHIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTI 173



 Score = 49.3 bits (116), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 13/71 (18%)

Query: 8   QCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECA-TESVTCYNCSGQG 66
           +CY+C +FGH Q  C +            + CY+C   GH A  C+ T  V CY C   G
Sbjct: 118 KCYSCGEFGHIQKDCTK------------VKCYRCGETGHVAINCSKTSEVNCYRCGESG 165

Query: 67  HVAKDCTVKSS 77
           H+A++CT++++
Sbjct: 166 HLARECTIEAT 176


>sp|Q5R5R5|CNBP_PONAB Cellular nucleic acid-binding protein OS=Pongo abelii GN=CNBP PE=2
           SV=1
          Length = 177

 Score = 86.3 bits (212), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 52/172 (30%), Positives = 78/172 (45%), Gaps = 28/172 (16%)

Query: 5   STIQCYNCFDFGHYQYSCPQKSSADA-------------RG-----DKVGIVCYKCNNYG 46
           S+ +C+ C   GH+   CP                    RG       +  +CY+C   G
Sbjct: 2   SSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESG 61

Query: 47  HFARECATESVTCYNCSGQGHVAKDC---TVKSSIICYNCNSSGHFARNCPNDSSKRCYA 103
           H A++C  +   CYNC   GH+AKDC     +    CYNC   GH AR+C +   ++CY+
Sbjct: 62  HLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYS 121

Query: 104 CHQAGHMAKECP-------GQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCKL 148
           C + GH+ K+C        G+T   +        + CY CG  GHL+ +C +
Sbjct: 122 CGEFGHIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTI 173



 Score = 49.3 bits (116), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 13/71 (18%)

Query: 8   QCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECA-TESVTCYNCSGQG 66
           +CY+C +FGH Q  C +            + CY+C   GH A  C+ T  V CY C   G
Sbjct: 118 KCYSCGEFGHIQKDCTK------------VKCYRCGETGHVAINCSKTSEVNCYRCGESG 165

Query: 67  HVAKDCTVKSS 77
           H+A++CT++++
Sbjct: 166 HLARECTIEAT 176


>sp|P62633|CNBP_HUMAN Cellular nucleic acid-binding protein OS=Homo sapiens GN=CNBP PE=1
           SV=1
          Length = 177

 Score = 86.3 bits (212), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 52/172 (30%), Positives = 78/172 (45%), Gaps = 28/172 (16%)

Query: 5   STIQCYNCFDFGHYQYSCPQKSSADA-------------RG-----DKVGIVCYKCNNYG 46
           S+ +C+ C   GH+   CP                    RG       +  +CY+C   G
Sbjct: 2   SSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESG 61

Query: 47  HFARECATESVTCYNCSGQGHVAKDC---TVKSSIICYNCNSSGHFARNCPNDSSKRCYA 103
           H A++C  +   CYNC   GH+AKDC     +    CYNC   GH AR+C +   ++CY+
Sbjct: 62  HLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYS 121

Query: 104 CHQAGHMAKECP-------GQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCKL 148
           C + GH+ K+C        G+T   +        + CY CG  GHL+ +C +
Sbjct: 122 CGEFGHIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTI 173



 Score = 49.3 bits (116), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 13/71 (18%)

Query: 8   QCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECA-TESVTCYNCSGQG 66
           +CY+C +FGH Q  C +            + CY+C   GH A  C+ T  V CY C   G
Sbjct: 118 KCYSCGEFGHIQKDCTK------------VKCYRCGETGHVAINCSKTSEVNCYRCGESG 165

Query: 67  HVAKDCTVKSS 77
           H+A++CT++++
Sbjct: 166 HLARECTIEAT 176


>sp|O42395|CNBP_CHICK Cellular nucleic acid-binding protein OS=Gallus gallus GN=CNBP PE=2
           SV=1
          Length = 172

 Score = 85.1 bits (209), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 51/167 (30%), Positives = 76/167 (45%), Gaps = 23/167 (13%)

Query: 5   STIQCYNCFDFGHYQYSCPQKSSADAR------------GDKVGIVCYKCNNYGHFAREC 52
           S+ +C+ C   GH+   CP                       +  +CY+C   GH A++C
Sbjct: 2   SSNECFKCGRTGHWARECPTGIGRGRGMRSRGRAGFQFMSSSLPDICYRCGESGHLAKDC 61

Query: 53  -ATESVTCYNCSGQGHVAKDCT---VKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAG 108
              E   CYNC   GH+AKDC     +    CYNC   GH AR+C +   ++CY+C + G
Sbjct: 62  DLQEDKACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYSCGEFG 121

Query: 109 HMAKECP-------GQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCKL 148
           H+ K+C        G+T   +        + CY CG  GHL+ +C +
Sbjct: 122 HIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTI 168



 Score = 49.3 bits (116), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 13/71 (18%)

Query: 8   QCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECA-TESVTCYNCSGQG 66
           +CY+C +FGH Q  C +            + CY+C   GH A  C+ T  V CY C   G
Sbjct: 113 KCYSCGEFGHIQKDCTK------------VKCYRCGETGHVAINCSKTSEVNCYRCGESG 160

Query: 67  HVAKDCTVKSS 77
           H+A++CT++++
Sbjct: 161 HLARECTIEAT 171


>sp|P53996|CNBP_MOUSE Cellular nucleic acid-binding protein OS=Mus musculus GN=Cnbp PE=2
           SV=2
          Length = 178

 Score = 83.6 bits (205), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 53/173 (30%), Positives = 78/173 (45%), Gaps = 29/173 (16%)

Query: 5   STIQCYNCFDFGHYQYSCPQKSSADA-------------RG-----DKVGIVCYKCNNYG 46
           S+ +C+ C   GH+   CP                    RG       +  +CY+C   G
Sbjct: 2   SSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESG 61

Query: 47  HFAREC-ATESVTCYNCSGQGHVAKDCT---VKSSIICYNCNSSGHFARNCPNDSSKRCY 102
           H A++C   E   CYNC   GH+AKDC     +    CYNC   GH AR+C +   ++CY
Sbjct: 62  HLAKDCDLQEDEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCY 121

Query: 103 ACHQAGHMAKECP-------GQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCKL 148
           +C + GH+ K+C        G+T   +        + CY CG  GHL+ +C +
Sbjct: 122 SCGEFGHIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTI 174



 Score = 49.3 bits (116), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 13/71 (18%)

Query: 8   QCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECA-TESVTCYNCSGQG 66
           +CY+C +FGH Q  C +            + CY+C   GH A  C+ T  V CY C   G
Sbjct: 119 KCYSCGEFGHIQKDCTK------------VKCYRCGETGHVAINCSKTSEVNCYRCGESG 166

Query: 67  HVAKDCTVKSS 77
           H+A++CT++++
Sbjct: 167 HLARECTIEAT 177


>sp|P53849|GIS2_YEAST Zinc finger protein GIS2 OS=Saccharomyces cerevisiae (strain ATCC
           204508 / S288c) GN=GIS2 PE=1 SV=1
          Length = 153

 Score = 77.0 bits (188), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/118 (37%), Positives = 62/118 (52%), Gaps = 24/118 (20%)

Query: 8   QCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECAT-------ESVTCY 60
           QCYNC + GH +  C  +             C+ CN  GH +REC           V+CY
Sbjct: 48  QCYNCGETGHVRSECTVQR------------CFNCNQTGHISRECPEPKKTSRFSKVSCY 95

Query: 61  NCSGQGHVAKDCTVK---SSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECP 115
            C G  H+AKDC  +   S + CY C  +GH +R+C ND  + CY C++ GH++K+CP
Sbjct: 96  KCGGPNHMAKDCMKEDGISGLKCYTCGQAGHMSRDCQND--RLCYNCNETGHISKDCP 151



 Score = 76.6 bits (187), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 43/113 (38%), Positives = 63/113 (55%), Gaps = 14/113 (12%)

Query: 38  VCYKCNNYGHFARECATESVTCYNCSGQGHVAKDCTVKSSII---CYNCNSSGHFARNCP 94
            CY C   GH A +C +E + CYNC+  GHV  DCT+  ++    CYNC  +GH    C 
Sbjct: 5   ACYVCGKIGHLAEDCDSERL-CYNCNKPGHVQTDCTMPRTVEFKQCYNCGETGHVRSEC- 62

Query: 95  NDSSKRCYACHQAGHMAKECPGQTAGKSPEPVVDMS-LTCYVCGHQGHLSYDC 146
             + +RC+ C+Q GH+++ECP       P+     S ++CY CG   H++ DC
Sbjct: 63  --TVQRCFNCNQTGHISRECP------EPKKTSRFSKVSCYKCGGPNHMAKDC 107



 Score = 76.6 bits (187), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 64/149 (42%), Gaps = 32/149 (21%)

Query: 9   CYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECA----TESVTCYNCSG 64
           CY C   GH    C  +            +CY CN  GH   +C      E   CYNC  
Sbjct: 6   CYVCGKIGHLAEDCDSER-----------LCYNCNKPGHVQTDCTMPRTVEFKQCYNCGE 54

Query: 65  QGHVAKDCTVKSSIICYNCNSSGHFARNCPND------SSKRCYACHQAGHMAKECPGQT 118
            GHV  +CTV+    C+NCN +GH +R CP        S   CY C    HMAK+C  + 
Sbjct: 55  TGHVRSECTVQR---CFNCNQTGHISRECPEPKKTSRFSKVSCYKCGGPNHMAKDCMKED 111

Query: 119 AGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
                       L CY CG  GH+S DC+
Sbjct: 112 GISG--------LKCYTCGQAGHMSRDCQ 132



 Score = 70.9 bits (172), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 65/147 (44%), Gaps = 30/147 (20%)

Query: 9   CYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECATESVTCYNCSGQGHV 68
           CYNC   GH Q  C    + + +       CY C   GH   EC  +   C+NC+  GH+
Sbjct: 25  CYNCNKPGHVQTDCTMPRTVEFK------QCYNCGETGHVRSECTVQR--CFNCNQTGHI 76

Query: 69  AKDCTVK------SSIICYNCNSSGHFARNCPND---SSKRCYACHQAGHMAKECPGQTA 119
           +++C         S + CY C    H A++C  +   S  +CY C QAGHM+++C     
Sbjct: 77  SRECPEPKKTSRFSKVSCYKCGGPNHMAKDCMKEDGISGLKCYTCGQAGHMSRDCQN--- 133

Query: 120 GKSPEPVVDMSLTCYVCGHQGHLSYDC 146
                        CY C   GH+S DC
Sbjct: 134 ----------DRLCYNCNETGHISKDC 150



 Score = 64.3 bits (155), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 37/99 (37%), Positives = 55/99 (55%), Gaps = 10/99 (10%)

Query: 1   MSSTSTIQ-CYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECATES--- 56
           + S  T+Q C+NC   GH    CP+     +R  KV   CYKC    H A++C  E    
Sbjct: 58  VRSECTVQRCFNCNQTGHISRECPEPKKT-SRFSKVS--CYKCGGPNHMAKDCMKEDGIS 114

Query: 57  -VTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCP 94
            + CY C   GH+++DC  ++  +CYNCN +GH +++CP
Sbjct: 115 GLKCYTCGQAGHMSRDC--QNDRLCYNCNETGHISKDCP 151



 Score = 31.2 bits (69), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 17/52 (32%), Positives = 23/52 (44%), Gaps = 13/52 (25%)

Query: 97  SSKRCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCKL 148
           S K CY C + GH+A++C             D    CY C   GH+  DC +
Sbjct: 2   SQKACYVCGKIGHLAEDC-------------DSERLCYNCNKPGHVQTDCTM 40


>sp|P36627|BYR3_SCHPO Cellular nucleic acid-binding protein homolog
           OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
           GN=byr3 PE=4 SV=1
          Length = 179

 Score = 76.6 bits (187), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 48/126 (38%), Positives = 68/126 (53%), Gaps = 11/126 (8%)

Query: 34  KVGIVCYKCNNYGHFARECATESVTCYNCSGQGHVAKDCT-VKSSIICYNCNSSGHFARN 92
           + G  CY C   GH AREC   S+ CYNC+  GH A +CT  +    CY C ++GH  R+
Sbjct: 14  RPGPRCYNCGENGHQARECTKGSI-CYNCNQTGHKASECTEPQQEKTCYACGTAGHLVRD 72

Query: 93  CPNDSSKR----CYACHQAGHMAKECPG---QTAGKSPEPVVDMSLTCYVCGHQGHLSYD 145
           CP+  + R    CY C + GH+A++C     Q+ G+      +M+  CY CG  GH + D
Sbjct: 73  CPSSPNPRQGAECYKCGRVGHIARDCRTNGQQSGGRFGGHRSNMN--CYACGSYGHQARD 130

Query: 146 CKLVQK 151
           C +  K
Sbjct: 131 CTMGVK 136



 Score = 73.9 bits (180), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 51/158 (32%), Positives = 68/158 (43%), Gaps = 42/158 (26%)

Query: 8   QCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECAT--ESVTCYNCSGQ 65
           +CYNC + GH    C +           G +CY CN  GH A EC    +  TCY C   
Sbjct: 18  RCYNCGENGHQARECTK-----------GSICYNCNQTGHKASECTEPQQEKTCYACGTA 66

Query: 66  GHVAKDC----TVKSSIICYNCNSSGHFARNCPND------------SSKRCYACHQAGH 109
           GH+ +DC      +    CY C   GH AR+C  +            S+  CYAC   GH
Sbjct: 67  GHLVRDCPSSPNPRQGAECYKCGRVGHIARDCRTNGQQSGGRFGGHRSNMNCYACGSYGH 126

Query: 110 MAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
            A++C              M + CY CG  GH S++C+
Sbjct: 127 QARDC-------------TMGVKCYSCGKIGHRSFECQ 151



 Score = 64.3 bits (155), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 46/155 (29%), Positives = 65/155 (41%), Gaps = 37/155 (23%)

Query: 9   CYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECAT-----ESVTCYNCS 63
           CYNC   GH    C +              CY C   GH  R+C +     +   CY C 
Sbjct: 38  CYNCNQTGHKASECTEPQQEK--------TCYACGTAGHLVRDCPSSPNPRQGAECYKCG 89

Query: 64  GQGHVAKDCTV------------KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMA 111
             GH+A+DC              +S++ CY C S GH AR+C      +CY+C + GH +
Sbjct: 90  RVGHIARDCRTNGQQSGGRFGGHRSNMNCYACGSYGHQARDC--TMGVKCYSCGKIGHRS 147

Query: 112 KECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
            EC   + G+           CY C   GH++ +C
Sbjct: 148 FECQQASDGQ----------LCYKCNQPGHIAVNC 172



 Score = 41.6 bits (96), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 35/71 (49%), Gaps = 13/71 (18%)

Query: 5   STIQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFAREC--ATESVTCYNC 62
           S + CY C  +GH            AR   +G+ CY C   GH + EC  A++   CY C
Sbjct: 114 SNMNCYACGSYGH-----------QARDCTMGVKCYSCGKIGHRSFECQQASDGQLCYKC 162

Query: 63  SGQGHVAKDCT 73
           +  GH+A +CT
Sbjct: 163 NQPGHIAVNCT 173



 Score = 37.4 bits (85), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 26/54 (48%), Gaps = 8/54 (14%)

Query: 4   TSTIQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECATESV 57
           T  ++CY+C   GH  + C Q S         G +CYKCN  GH A  C +  +
Sbjct: 132 TMGVKCYSCGKIGHRSFECQQASD--------GQLCYKCNQPGHIAVNCTSPVI 177


>sp|O76743|GLH4_CAEEL ATP-dependent RNA helicase glh-4 OS=Caenorhabditis elegans GN=glh-4
           PE=2 SV=2
          Length = 1156

 Score = 70.5 bits (171), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 41/116 (35%), Positives = 49/116 (42%), Gaps = 15/116 (12%)

Query: 39  CYKCNNYGHFARECATESVT---CYNCSGQGHVAKDCTVKSSI--ICYNCNSSGHFARNC 93
           C+ C   GH ++EC    V    C NC   GH A DC         C NC   GHFA +C
Sbjct: 572 CHNCGEEGHISKECDKPKVPRFPCRNCEQLGHFASDCDQPRVPRGPCRNCGIEGHFAVDC 631

Query: 94  PNDSSKR--CYACHQAGHMAKECPGQTAGKSP-EPVVDMSLTCYVCGHQGHLSYDC 146
                 R  C  C Q GH AK+C  +     P EP       C  C  +GH  Y+C
Sbjct: 632 DQPKVPRGPCRNCGQEGHFAKDCQNERVRMEPTEP-------CRRCAEEGHWGYEC 680



 Score = 66.6 bits (161), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 38/117 (32%), Positives = 49/117 (41%), Gaps = 17/117 (14%)

Query: 9   CYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECATESVT---CYNCSGQ 65
           C+NC + GH    C +              C  C   GHFA +C    V    C NC  +
Sbjct: 572 CHNCGEEGHISKECDKPKVPR-------FPCRNCEQLGHFASDCDQPRVPRGPCRNCGIE 624

Query: 66  GHVAKDCTVKSSI--ICYNCNSSGHFARNCPNDS-----SKRCYACHQAGHMAKECP 115
           GH A DC         C NC   GHFA++C N+      ++ C  C + GH   ECP
Sbjct: 625 GHFAVDCDQPKVPRGPCRNCGQEGHFAKDCQNERVRMEPTEPCRRCAEEGHWGYECP 681



 Score = 58.5 bits (140), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 30/93 (32%), Positives = 41/93 (44%), Gaps = 13/93 (13%)

Query: 59  CYNCSGQGHVAKDCTVKS--SIICYNCNSSGHFARNCPNDSSKR--CYACHQAGHMAKEC 114
           C+NC  +GH++K+C         C NC   GHFA +C      R  C  C   GH A +C
Sbjct: 572 CHNCGEEGHISKECDKPKVPRFPCRNCEQLGHFASDCDQPRVPRGPCRNCGIEGHFAVDC 631

Query: 115 PGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
                 + P         C  CG +GH + DC+
Sbjct: 632 DQPKVPRGP---------CRNCGQEGHFAKDCQ 655


>sp|Q8WW36|ZCH13_HUMAN Zinc finger CCHC domain-containing protein 13 OS=Homo sapiens
           GN=ZCCHC13 PE=2 SV=1
          Length = 166

 Score = 68.2 bits (165), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 52/112 (46%), Gaps = 17/112 (15%)

Query: 38  VCYKCNNYGHFARECATESVTCYNCSGQGHVAKDCT---VKSSIICYNCNSSGHFARNCP 94
            CY C   G  A+ C      CYNC   GH+AKDC     +    CY C   GH AR+C 
Sbjct: 46  TCYCCGESGRNAKNCVLLGNICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCD 105

Query: 95  NDSSKRCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
               ++CY+C + GH+ K+C                + CY CG  GH++ +C
Sbjct: 106 RQKEQKCYSCGKLGHIQKDCA--------------QVKCYRCGEIGHVAINC 143



 Score = 59.3 bits (142), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 40/139 (28%), Positives = 60/139 (43%), Gaps = 21/139 (15%)

Query: 2   SSTSTIQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECATESVT--- 58
           S+T +  CY C + G    +C            +G +CY C   GH A++C         
Sbjct: 40  STTLSYTCYCCGESGRNAKNCVL----------LGNICYNCGRSGHIAKDCKDPKRERRQ 89

Query: 59  -CYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKEC--- 114
            CY C   GH+A+DC  +    CY+C   GH  ++C   +  +CY C + GH+A  C   
Sbjct: 90  HCYTCGRLGHLARDCDRQKEQKCYSCGKLGHIQKDC---AQVKCYRCGEIGHVAINCSKA 146

Query: 115 -PGQTAGKSPEPVVDMSLT 132
            PGQ       P     ++
Sbjct: 147 RPGQLLPLRQIPTSSQGMS 165


>sp|Q94C69|CSP3_ARATH Cold shock domain-containing protein 3 OS=Arabidopsis thaliana
           GN=CSP3 PE=2 SV=1
          Length = 301

 Score = 65.5 bits (158), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 57/183 (31%), Positives = 67/183 (36%), Gaps = 42/183 (22%)

Query: 2   SSTSTIQCYNCFDFGHYQYSCPQKSSADARGDKVGIV------CYKCNNYGHFARECATE 55
           S  S   C+NC + GH    C   S   + G   G        CY C + GHFAR+C   
Sbjct: 89  SRGSGGNCFNCGEVGHMAKDCDGGSGGKSFGGGGGRRSGGEGECYMCGDVGHFARDCRQS 148

Query: 56  SVT-----------CYNCSGQGHVAKDC--------------TVKSSIICYNCNSSGHFA 90
                         CY+C   GH+AKDC                     CY C   GHFA
Sbjct: 149 GGGNSGGGGGGGRPCYSCGEVGHLAKDCRGGSGGNRYGGGGGRGSGGDGCYMCGGVGHFA 208

Query: 91  RNCPN-------DSSKRCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLS 143
           R+C              CY C   GH+AK C      K P         CY CG  GHL+
Sbjct: 209 RDCRQNGGGNVGGGGSTCYTCGGVGHIAKVC----TSKIPSGGGGGGRACYECGGTGHLA 264

Query: 144 YDC 146
            DC
Sbjct: 265 RDC 267



 Score = 61.2 bits (147), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 57/135 (42%), Gaps = 29/135 (21%)

Query: 9   CYNCFDFGHYQYSCPQKSSADARGDKVGIV-----CYKCNNYGHFAREC--------ATE 55
           CY+C + GH    C   S  +  G   G       CY C   GHFAR+C           
Sbjct: 163 CYSCGEVGHLAKDCRGGSGGNRYGGGGGRGSGGDGCYMCGGVGHFARDCRQNGGGNVGGG 222

Query: 56  SVTCYNCSGQGHVAKDCTVK-------SSIICYNCNSSGHFARNCPN---------DSSK 99
             TCY C G GH+AK CT K           CY C  +GH AR+C             S 
Sbjct: 223 GSTCYTCGGVGHIAKVCTSKIPSGGGGGGRACYECGGTGHLARDCDRRGSGSSGGGGGSN 282

Query: 100 RCYACHQAGHMAKEC 114
           +C+ C + GH A+EC
Sbjct: 283 KCFICGKEGHFAREC 297



 Score = 60.8 bits (146), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 49/171 (28%), Positives = 68/171 (39%), Gaps = 32/171 (18%)

Query: 9   CYNCFDFGHYQYSCPQKSSADARGDKVGIV-CYKCNNYGHFARECATESVT--------- 58
           CY C D GH+   C Q    ++ G   G   CY C   GH A++C   S           
Sbjct: 132 CYMCGDVGHFARDCRQSGGGNSGGGGGGGRPCYSCGEVGHLAKDCRGGSGGNRYGGGGGR 191

Query: 59  ------CYNCSGQGHVAKDC-------TVKSSIICYNCNSSGHFARNCPND-------SS 98
                 CY C G GH A+DC              CY C   GH A+ C +          
Sbjct: 192 GSGGDGCYMCGGVGHFARDCRQNGGGNVGGGGSTCYTCGGVGHIAKVCTSKIPSGGGGGG 251

Query: 99  KRCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCKLV 149
           + CY C   GH+A++C  + +          S  C++CG +GH + +C  V
Sbjct: 252 RACYECGGTGHLARDCDRRGS--GSSGGGGGSNKCFICGKEGHFARECTSV 300



 Score = 49.7 bits (117), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 46/117 (39%), Gaps = 28/117 (23%)

Query: 59  CYNCSGQGHVAKDC---------------TVKSSIICYNCNSSGHFARNCPNDSS----- 98
           C+NC   GH+AKDC                      CY C   GHFAR+C          
Sbjct: 96  CFNCGEVGHMAKDCDGGSGGKSFGGGGGRRSGGEGECYMCGDVGHFARDCRQSGGGNSGG 155

Query: 99  -----KRCYACHQAGHMAKEC---PGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
                + CY+C + GH+AK+C    G                CY+CG  GH + DC+
Sbjct: 156 GGGGGRPCYSCGEVGHLAKDCRGGSGGNRYGGGGGRGSGGDGCYMCGGVGHFARDCR 212



 Score = 32.3 bits (72), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 28/61 (45%), Gaps = 4/61 (6%)

Query: 91  RNCPNDSSKRCYACHQAGHMAKECP----GQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
            N    S   C+ C + GHMAK+C     G++ G            CY+CG  GH + DC
Sbjct: 86  ENSSRGSGGNCFNCGEVGHMAKDCDGGSGGKSFGGGGGRRSGGEGECYMCGDVGHFARDC 145

Query: 147 K 147
           +
Sbjct: 146 R 146


>sp|P05895|POL_SIVVT Gag-Pol polyprotein OS=Simian immunodeficiency virus agm.vervet
           (isolate AGM TYO-1) GN=gag-pol PE=3 SV=2
          Length = 1467

 Score = 54.3 bits (129), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 19/43 (44%), Positives = 24/43 (55%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQ 117
           +  + CYNC   GH  R CP     RC  C + GH+AK+C GQ
Sbjct: 399 RPPVKCYNCGKFGHMQRQCPEPRKMRCLKCGKPGHLAKDCRGQ 441



 Score = 43.9 bits (102), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 17/46 (36%), Positives = 23/46 (50%), Gaps = 9/46 (19%)

Query: 7   IQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           ++CYNC  FGH Q  CP+            + C KC   GH A++C
Sbjct: 402 VKCYNCGKFGHMQRQCPEPRK---------MRCLKCGKPGHLAKDC 438



 Score = 42.7 bits (99), Expect = 0.001,   Method: Composition-based stats.
 Identities = 18/48 (37%), Positives = 25/48 (52%), Gaps = 11/48 (22%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
           +CY C + GHM ++C        PEP     + C  CG  GHL+ DC+
Sbjct: 403 KCYNCGKFGHMQRQC--------PEP---RKMRCLKCGKPGHLAKDCR 439



 Score = 41.2 bits (95), Expect = 0.002,   Method: Composition-based stats.
 Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 57  VTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNC 93
           V CYNC   GH+ + C     + C  C   GH A++C
Sbjct: 402 VKCYNCGKFGHMQRQCPEPRKMRCLKCGKPGHLAKDC 438


>sp|P27980|POL_SIVVG Gag-Pol polyprotein OS=Simian immunodeficiency virus agm.vervet
           (isolate AGM3) GN=gag-pol PE=3 SV=2
          Length = 1465

 Score = 54.3 bits (129), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 19/43 (44%), Positives = 24/43 (55%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQ 117
           +  + CYNC   GH  R CP     RC  C + GH+AK+C GQ
Sbjct: 399 RPPVKCYNCGKFGHMQRQCPEPRKMRCLKCGKPGHLAKDCRGQ 441



 Score = 43.9 bits (102), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 17/46 (36%), Positives = 23/46 (50%), Gaps = 9/46 (19%)

Query: 7   IQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           ++CYNC  FGH Q  CP+            + C KC   GH A++C
Sbjct: 402 VKCYNCGKFGHMQRQCPEPRK---------MRCLKCGKPGHLAKDC 438



 Score = 42.7 bits (99), Expect = 0.001,   Method: Composition-based stats.
 Identities = 18/48 (37%), Positives = 25/48 (52%), Gaps = 11/48 (22%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
           +CY C + GHM ++C        PEP     + C  CG  GHL+ DC+
Sbjct: 403 KCYNCGKFGHMQRQC--------PEP---RKMRCLKCGKPGHLAKDCR 439



 Score = 41.2 bits (95), Expect = 0.002,   Method: Composition-based stats.
 Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 57  VTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNC 93
           V CYNC   GH+ + C     + C  C   GH A++C
Sbjct: 402 VKCYNCGKFGHMQRQCPEPRKMRCLKCGKPGHLAKDC 438


>sp|P17283|POL_SIVCZ Gag-Pol polyprotein OS=Simian immunodeficiency virus (isolate CPZ
           GAB1) GN=gag-pol PE=3 SV=2
          Length = 1384

 Score = 53.1 bits (126), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 20/43 (46%), Positives = 24/43 (55%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQ 117
           K  I C+NC   GH ARNC     K C+ C Q GH  K+C G+
Sbjct: 338 KRKIKCFNCGKEGHLARNCKAPRRKGCWRCGQEGHQMKDCTGR 380



 Score = 44.3 bits (103), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 19/43 (44%), Positives = 22/43 (51%), Gaps = 1/43 (2%)

Query: 32  GDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCT 73
           G K  I C+ C   GH AR C A     C+ C  +GH  KDCT
Sbjct: 336 GPKRKIKCFNCGKEGHLARNCKAPRRKGCWRCGQEGHQMKDCT 378



 Score = 41.2 bits (95), Expect = 0.003,   Method: Composition-based stats.
 Identities = 12/37 (32%), Positives = 21/37 (56%)

Query: 57  VTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNC 93
           + C+NC  +GH+A++C       C+ C   GH  ++C
Sbjct: 341 IKCFNCGKEGHLARNCKAPRRKGCWRCGQEGHQMKDC 377



 Score = 35.4 bits (80), Expect = 0.14,   Method: Composition-based stats.
 Identities = 15/47 (31%), Positives = 23/47 (48%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C+ C + GH+A+ C      K+P         C+ CG +GH   DC
Sbjct: 342 KCFNCGKEGHLARNC------KAPR-----RKGCWRCGQEGHQMKDC 377



 Score = 31.2 bits (69), Expect = 2.5,   Method: Composition-based stats.
 Identities = 14/56 (25%), Positives = 24/56 (42%), Gaps = 13/56 (23%)

Query: 7   IQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFARECATESVTCY 60
           I+C+NC   GH   +C  P++             C++C   GH  ++C    V  +
Sbjct: 341 IKCFNCGKEGHLARNCKAPRRKG-----------CWRCGQEGHQMKDCTGRQVNFF 385


>sp|P27978|GAG_SIVVG Gag polyprotein OS=Simian immunodeficiency virus agm.vervet
           (isolate AGM3) GN=gag PE=3 SV=1
          Length = 521

 Score = 52.8 bits (125), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 19/43 (44%), Positives = 24/43 (55%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQ 117
           +  + CYNC   GH  R CP     RC  C + GH+AK+C GQ
Sbjct: 399 RPPVKCYNCGKFGHMQRQCPEPRKMRCLKCGKPGHLAKDCRGQ 441



 Score = 42.7 bits (99), Expect = 0.001,   Method: Composition-based stats.
 Identities = 17/46 (36%), Positives = 23/46 (50%), Gaps = 9/46 (19%)

Query: 7   IQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           ++CYNC  FGH Q  CP+            + C KC   GH A++C
Sbjct: 402 VKCYNCGKFGHMQRQCPEPRK---------MRCLKCGKPGHLAKDC 438



 Score = 40.8 bits (94), Expect = 0.003,   Method: Composition-based stats.
 Identities = 18/48 (37%), Positives = 25/48 (52%), Gaps = 11/48 (22%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
           +CY C + GHM ++CP        EP     + C  CG  GHL+ DC+
Sbjct: 403 KCYNCGKFGHMQRQCP--------EP---RKMRCLKCGKPGHLAKDCR 439


>sp|Q1A249|POL_SIVEK Gag-Pol polyprotein OS=Simian immunodeficiency virus (isolate
           EK505) GN=gag-pol PE=3 SV=3
          Length = 1448

 Score = 52.4 bits (124), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 18/41 (43%), Positives = 24/41 (58%)

Query: 74  VKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKEC 114
           ++ +I C+NC   GH ARNC     K C+ C Q GH  K+C
Sbjct: 389 IRKTIKCFNCGKEGHLARNCKAPRKKGCWKCGQEGHQMKDC 429



 Score = 42.4 bits (98), Expect = 0.001,   Method: Composition-based stats.
 Identities = 12/39 (30%), Positives = 23/39 (58%)

Query: 55  ESVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNC 93
           +++ C+NC  +GH+A++C       C+ C   GH  ++C
Sbjct: 391 KTIKCFNCGKEGHLARNCKAPRKKGCWKCGQEGHQMKDC 429



 Score = 42.0 bits (97), Expect = 0.002,   Method: Composition-based stats.
 Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 1/43 (2%)

Query: 31  RGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDC 72
           +G +  I C+ C   GH AR C A     C+ C  +GH  KDC
Sbjct: 387 KGIRKTIKCFNCGKEGHLARNCKAPRKKGCWKCGQEGHQMKDC 429



 Score = 36.6 bits (83), Expect = 0.060,   Method: Composition-based stats.
 Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 11/48 (22%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
           +C+ C + GH+A+ C      K+P         C+ CG +GH   DC+
Sbjct: 394 KCFNCGKEGHLARNC------KAPR-----KKGCWKCGQEGHQMKDCR 430



 Score = 34.3 bits (77), Expect = 0.33,   Method: Composition-based stats.
 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           TI+C+NC   GH   +C  P+K             C+KC   GH  ++C
Sbjct: 392 TIKCFNCGKEGHLARNCKAPRKKG-----------CWKCGQEGHQMKDC 429


>sp|Q89928|POL_HV2EH Gag-Pol polyprotein OS=Human immunodeficiency virus type 2 subtype
           B (isolate EHO) GN=gag-pol PE=3 SV=3
          Length = 1464

 Score = 52.4 bits (124), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 18/46 (39%), Positives = 27/46 (58%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTAG 120
           K ++ C+NC  +GH AR C     + C+ C Q GH+  +CP + AG
Sbjct: 384 KRTVTCWNCGKAGHTARQCKAPRRQGCWKCGQQGHIMSKCPERQAG 429



 Score = 48.9 bits (115), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 22/70 (31%), Positives = 29/70 (41%), Gaps = 9/70 (12%)

Query: 56  SVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECP 115
           +VTC+NC   GH A+ C       C+ C   GH    CP           QAG +     
Sbjct: 386 TVTCWNCGKAGHTARQCKAPRRQGCWKCGQQGHIMSKCPE---------RQAGFLRVRPL 436

Query: 116 GQTAGKSPEP 125
           G+ A + P P
Sbjct: 437 GKEASQFPRP 446



 Score = 43.1 bits (100), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 17/48 (35%), Positives = 24/48 (50%), Gaps = 1/48 (2%)

Query: 26  SSADARGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDC 72
           ++A  R  K  + C+ C   GH AR+C A     C+ C  QGH+   C
Sbjct: 376 AAAQPRAGKRTVTCWNCGKAGHTARQCKAPRRQGCWKCGQQGHIMSKC 423



 Score = 36.6 bits (83), Expect = 0.053,   Method: Composition-based stats.
 Identities = 22/65 (33%), Positives = 31/65 (47%), Gaps = 15/65 (23%)

Query: 85  SSGHFARNCPNDSSKR---CYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGH 141
           S+  FA   P  + KR   C+ C +AGH A++C      K+P         C+ CG QGH
Sbjct: 371 STNPFAAAQPR-AGKRTVTCWNCGKAGHTARQC------KAPR-----RQGCWKCGQQGH 418

Query: 142 LSYDC 146
           +   C
Sbjct: 419 IMSKC 423



 Score = 31.2 bits (69), Expect = 2.4,   Method: Composition-based stats.
 Identities = 14/49 (28%), Positives = 20/49 (40%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           T+ C+NC   GH    C  P++             C+KC   GH   +C
Sbjct: 386 TVTCWNCGKAGHTARQCKAPRRQG-----------CWKCGQQGHIMSKC 423


>sp|P27973|POL_SIVV1 Gag-Pol polyprotein OS=Simian immunodeficiency virus agm.vervet
           (isolate AGM155) GN=gag-pol PE=3 SV=2
          Length = 1470

 Score = 52.0 bits (123), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 18/38 (47%), Positives = 22/38 (57%)

Query: 80  CYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQ 117
           CYNC   GH  R CP     +C  C + GH+AK+C GQ
Sbjct: 400 CYNCGKFGHMQRQCPEPRKIKCLKCGKPGHLAKDCRGQ 437



 Score = 44.3 bits (103), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 18/45 (40%), Positives = 22/45 (48%), Gaps = 9/45 (20%)

Query: 8   QCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           +CYNC  FGH Q  CP+            I C KC   GH A++C
Sbjct: 399 KCYNCGKFGHMQRQCPEPRK---------IKCLKCGKPGHLAKDC 434



 Score = 43.5 bits (101), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 15/35 (42%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 39  CYKCNNYGHFARECAT-ESVTCYNCSGQGHVAKDC 72
           CY C  +GH  R+C     + C  C   GH+AKDC
Sbjct: 400 CYNCGKFGHMQRQCPEPRKIKCLKCGKPGHLAKDC 434



 Score = 42.7 bits (99), Expect = 0.001,   Method: Composition-based stats.
 Identities = 18/48 (37%), Positives = 25/48 (52%), Gaps = 11/48 (22%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
           +CY C + GHM ++C        PEP     + C  CG  GHL+ DC+
Sbjct: 399 KCYNCGKFGHMQRQC--------PEP---RKIKCLKCGKPGHLAKDCR 435



 Score = 41.6 bits (96), Expect = 0.002,   Method: Composition-based stats.
 Identities = 14/35 (40%), Positives = 18/35 (51%)

Query: 59  CYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNC 93
           CYNC   GH+ + C     I C  C   GH A++C
Sbjct: 400 CYNCGKFGHMQRQCPEPRKIKCLKCGKPGHLAKDC 434


>sp|Q1A250|GAG_SIVEK Gag polyprotein OS=Simian immunodeficiency virus (isolate EK505)
           GN=gag PE=3 SV=3
          Length = 511

 Score = 52.0 bits (123), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 18/41 (43%), Positives = 24/41 (58%)

Query: 74  VKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKEC 114
           ++ +I C+NC   GH ARNC     K C+ C Q GH  K+C
Sbjct: 389 IRKTIKCFNCGKEGHLARNCKAPRKKGCWKCGQEGHQMKDC 429



 Score = 42.0 bits (97), Expect = 0.001,   Method: Composition-based stats.
 Identities = 12/39 (30%), Positives = 23/39 (58%)

Query: 55  ESVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNC 93
           +++ C+NC  +GH+A++C       C+ C   GH  ++C
Sbjct: 391 KTIKCFNCGKEGHLARNCKAPRKKGCWKCGQEGHQMKDC 429



 Score = 41.6 bits (96), Expect = 0.002,   Method: Composition-based stats.
 Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 1/43 (2%)

Query: 31  RGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDC 72
           +G +  I C+ C   GH AR C A     C+ C  +GH  KDC
Sbjct: 387 KGIRKTIKCFNCGKEGHLARNCKAPRKKGCWKCGQEGHQMKDC 429



 Score = 36.2 bits (82), Expect = 0.082,   Method: Composition-based stats.
 Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 11/48 (22%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
           +C+ C + GH+A+ C      K+P         C+ CG +GH   DC+
Sbjct: 394 KCFNCGKEGHLARNC------KAPR-----KKGCWKCGQEGHQMKDCR 430



 Score = 33.5 bits (75), Expect = 0.44,   Method: Composition-based stats.
 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           TI+C+NC   GH   +C  P+K             C+KC   GH  ++C
Sbjct: 392 TIKCFNCGKEGHLARNCKAPRKKG-----------CWKCGQEGHQMKDC 429


>sp|P17282|GAG_SIVCZ Gag polyprotein OS=Simian immunodeficiency virus (isolate CPZ GAB1)
           GN=gag PE=3 SV=1
          Length = 508

 Score = 52.0 bits (123), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 20/43 (46%), Positives = 24/43 (55%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQ 117
           K  I C+NC   GH ARNC     K C+ C Q GH  K+C G+
Sbjct: 397 KRKIKCFNCGKEGHLARNCKAPRRKGCWRCGQEGHQMKDCTGR 439



 Score = 42.7 bits (99), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 19/43 (44%), Positives = 22/43 (51%), Gaps = 1/43 (2%)

Query: 32  GDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCT 73
           G K  I C+ C   GH AR C A     C+ C  +GH  KDCT
Sbjct: 395 GPKRKIKCFNCGKEGHLARNCKAPRRKGCWRCGQEGHQMKDCT 437



 Score = 40.0 bits (92), Expect = 0.005,   Method: Composition-based stats.
 Identities = 12/37 (32%), Positives = 21/37 (56%)

Query: 57  VTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNC 93
           + C+NC  +GH+A++C       C+ C   GH  ++C
Sbjct: 400 IKCFNCGKEGHLARNCKAPRRKGCWRCGQEGHQMKDC 436



 Score = 34.3 bits (77), Expect = 0.33,   Method: Composition-based stats.
 Identities = 15/47 (31%), Positives = 23/47 (48%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C+ C + GH+A+ C      K+P         C+ CG +GH   DC
Sbjct: 401 KCFNCGKEGHLARNC------KAPR-----RKGCWRCGQEGHQMKDC 436



 Score = 30.0 bits (66), Expect = 6.1,   Method: Composition-based stats.
 Identities = 17/62 (27%), Positives = 27/62 (43%), Gaps = 16/62 (25%)

Query: 7   IQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFARECATESVTCYNCSG 64
           I+C+NC   GH   +C  P++             C++C   GH  ++C    V   N  G
Sbjct: 400 IKCFNCGKEGHLARNCKAPRRKG-----------CWRCGQEGHQMKDCTGRQV---NFLG 445

Query: 65  QG 66
           +G
Sbjct: 446 KG 447


>sp|O12158|POL_HV192 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype C (isolate 92BR025) GN=gag-pol PE=1 SV=2
          Length = 1431

 Score = 51.6 bits (122), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 28/82 (34%), Positives = 39/82 (47%), Gaps = 2/82 (2%)

Query: 39  CYKCNNYGHFARECATESVTCYNCSGQGHVAKDCT-VKSSIICYNCNSSGHFARNCPNDS 97
           C      GH AR  A E+++  N +       +C   K +I C+NC   GH ARNC    
Sbjct: 348 CQGVGGPGHKARVLA-EAMSKVNNTNIMMQRSNCKGPKRTIKCFNCGKEGHLARNCRAPR 406

Query: 98  SKRCYACHQAGHMAKECPGQTA 119
            K C+ C + GH  K+C  + A
Sbjct: 407 KKGCWKCGKEGHQVKDCTERQA 428



 Score = 47.0 bits (110), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 19/51 (37%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 28  ADARGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCTVKSS 77
           ++ +G K  I C+ C   GH AR C A     C+ C  +GH  KDCT + +
Sbjct: 378 SNCKGPKRTIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQVKDCTERQA 428



 Score = 35.4 bits (80), Expect = 0.14,   Method: Composition-based stats.
 Identities = 24/90 (26%), Positives = 39/90 (43%), Gaps = 16/90 (17%)

Query: 59  CYNCSGQGHVAKDCTVKSSIICYNCNSSGHFAR-NCPN-DSSKRCYACHQAGHMAKECPG 116
           C    G GH A+   V +  +    N++    R NC     + +C+ C + GH+A+ C  
Sbjct: 348 CQGVGGPGHKAR---VLAEAMSKVNNTNIMMQRSNCKGPKRTIKCFNCGKEGHLARNC-- 402

Query: 117 QTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
               ++P         C+ CG +GH   DC
Sbjct: 403 ----RAPR-----KKGCWKCGKEGHQVKDC 423



 Score = 34.3 bits (77), Expect = 0.28,   Method: Composition-based stats.
 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           TI+C+NC   GH   +C  P+K             C+KC   GH  ++C
Sbjct: 386 TIKCFNCGKEGHLARNCRAPRKKG-----------CWKCGKEGHQVKDC 423


>sp|Q77373|POL_HV1AN Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group O
           (isolate ANT70) GN=gag-pol PE=3 SV=3
          Length = 1435

 Score = 51.6 bits (122), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 19/40 (47%), Positives = 23/40 (57%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKEC 114
           K +I C+NC   GH ARNC     K C+ C Q GH  K+C
Sbjct: 391 KGTIKCFNCGKEGHIARNCRAPRKKGCWKCGQEGHQMKDC 430



 Score = 46.6 bits (109), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 17/64 (26%), Positives = 31/64 (48%)

Query: 56  SVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECP 115
           ++ C+NC  +GH+A++C       C+ C   GH  ++C N            GH A++  
Sbjct: 393 TIKCFNCGKEGHIARNCRAPRKKGCWKCGQEGHQMKDCRNGKQFFRQILASGGHEARQLC 452

Query: 116 GQTA 119
            +T+
Sbjct: 453 AETS 456



 Score = 33.9 bits (76), Expect = 0.36,   Method: Composition-based stats.
 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           TI+C+NC   GH   +C  P+K             C+KC   GH  ++C
Sbjct: 393 TIKCFNCGKEGHIARNCRAPRKKG-----------CWKCGQEGHQMKDC 430


>sp|Q9Q720|POL_HV1V9 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype H (isolate VI991) GN=gag-pol PE=3 SV=3
          Length = 1436

 Score = 51.6 bits (122), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 19/45 (42%), Positives = 25/45 (55%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTA 119
           + ++ C NC   GH ARNC     K C+ C Q GH  K+C G+ A
Sbjct: 390 RRTVKCSNCGKEGHIARNCRAPRKKGCWKCGQEGHQMKDCTGRQA 434



 Score = 41.6 bits (96), Expect = 0.002,   Method: Composition-based stats.
 Identities = 19/70 (27%), Positives = 30/70 (42%), Gaps = 2/70 (2%)

Query: 56  SVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSK--RCYACHQAGHMAKE 113
           +V C NC  +GH+A++C       C+ C   GH  ++C    +   R     Q G   + 
Sbjct: 392 TVKCSNCGKEGHIARNCRAPRKKGCWKCGQEGHQMKDCTGRQANFFRENLAFQQGKAREF 451

Query: 114 CPGQTAGKSP 123
            P +    SP
Sbjct: 452 PPEEARANSP 461



 Score = 41.6 bits (96), Expect = 0.002,   Method: Composition-based stats.
 Identities = 17/44 (38%), Positives = 22/44 (50%), Gaps = 1/44 (2%)

Query: 31  RGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCT 73
           +G +  + C  C   GH AR C A     C+ C  +GH  KDCT
Sbjct: 387 KGPRRTVKCSNCGKEGHIARNCRAPRKKGCWKCGQEGHQMKDCT 430



 Score = 32.3 bits (72), Expect = 1.1,   Method: Composition-based stats.
 Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C  C + GH+A+ C      ++P         C+ CG +GH   DC
Sbjct: 394 KCSNCGKEGHIARNC------RAPR-----KKGCWKCGQEGHQMKDC 429



 Score = 31.6 bits (70), Expect = 2.1,   Method: Composition-based stats.
 Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           T++C NC   GH   +C  P+K             C+KC   GH  ++C
Sbjct: 392 TVKCSNCGKEGHIARNCRAPRKKG-----------CWKCGQEGHQMKDC 429


>sp|Q9Q721|GAG_HV1V9 Gag polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype H (isolate VI991) GN=gag PE=3 SV=3
          Length = 498

 Score = 51.6 bits (122), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 19/45 (42%), Positives = 25/45 (55%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTA 119
           + ++ C NC   GH ARNC     K C+ C Q GH  K+C G+ A
Sbjct: 390 RRTVKCSNCGKEGHIARNCRAPRKKGCWKCGQEGHQMKDCTGRQA 434



 Score = 41.6 bits (96), Expect = 0.002,   Method: Composition-based stats.
 Identities = 17/44 (38%), Positives = 22/44 (50%), Gaps = 1/44 (2%)

Query: 31  RGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCT 73
           +G +  + C  C   GH AR C A     C+ C  +GH  KDCT
Sbjct: 387 KGPRRTVKCSNCGKEGHIARNCRAPRKKGCWKCGQEGHQMKDCT 430



 Score = 41.2 bits (95), Expect = 0.002,   Method: Composition-based stats.
 Identities = 13/38 (34%), Positives = 21/38 (55%)

Query: 56  SVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNC 93
           +V C NC  +GH+A++C       C+ C   GH  ++C
Sbjct: 392 TVKCSNCGKEGHIARNCRAPRKKGCWKCGQEGHQMKDC 429



 Score = 32.3 bits (72), Expect = 1.1,   Method: Composition-based stats.
 Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C  C + GH+A+ C      ++P         C+ CG +GH   DC
Sbjct: 394 KCSNCGKEGHIARNC------RAPR-----KKGCWKCGQEGHQMKDC 429



 Score = 31.6 bits (70), Expect = 2.1,   Method: Composition-based stats.
 Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           T++C NC   GH   +C  P+K             C+KC   GH  ++C
Sbjct: 392 TVKCSNCGKEGHIARNCRAPRKKG-----------CWKCGQEGHQMKDC 429


>sp|O12157|GAG_HV192 Gag polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype C (isolate 92BR025) GN=gag PE=3 SV=3
          Length = 496

 Score = 51.6 bits (122), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 28/82 (34%), Positives = 39/82 (47%), Gaps = 2/82 (2%)

Query: 39  CYKCNNYGHFARECATESVTCYNCSGQGHVAKDCT-VKSSIICYNCNSSGHFARNCPNDS 97
           C      GH AR  A E+++  N +       +C   K +I C+NC   GH ARNC    
Sbjct: 348 CQGVGGPGHKARVLA-EAMSKVNNTNIMMQRSNCKGPKRTIKCFNCGKEGHLARNCRAPR 406

Query: 98  SKRCYACHQAGHMAKECPGQTA 119
            K C+ C + GH  K+C  + A
Sbjct: 407 KKGCWKCGKEGHQVKDCTERQA 428



 Score = 46.6 bits (109), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 19/51 (37%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 28  ADARGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCTVKSS 77
           ++ +G K  I C+ C   GH AR C A     C+ C  +GH  KDCT + +
Sbjct: 378 SNCKGPKRTIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQVKDCTERQA 428



 Score = 35.0 bits (79), Expect = 0.19,   Method: Composition-based stats.
 Identities = 24/90 (26%), Positives = 39/90 (43%), Gaps = 16/90 (17%)

Query: 59  CYNCSGQGHVAKDCTVKSSIICYNCNSSGHFAR-NCPN-DSSKRCYACHQAGHMAKECPG 116
           C    G GH A+   V +  +    N++    R NC     + +C+ C + GH+A+ C  
Sbjct: 348 CQGVGGPGHKAR---VLAEAMSKVNNTNIMMQRSNCKGPKRTIKCFNCGKEGHLARNC-- 402

Query: 117 QTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
               ++P         C+ CG +GH   DC
Sbjct: 403 ----RAPR-----KKGCWKCGKEGHQVKDC 423



 Score = 33.9 bits (76), Expect = 0.34,   Method: Composition-based stats.
 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           TI+C+NC   GH   +C  P+K             C+KC   GH  ++C
Sbjct: 386 TIKCFNCGKEGHLARNCRAPRKKG-----------CWKCGKEGHQVKDC 423


>sp|Q9IDV9|POL_HV1YB Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group N
           (isolate YBF106) GN=gag-pol PE=3 SV=3
          Length = 1449

 Score = 51.2 bits (121), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 17/44 (38%), Positives = 25/44 (56%)

Query: 74  VKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQ 117
           ++ +I C+NC   GH ARNC     + C+ C Q GH  K+C  +
Sbjct: 388 IRKTIKCFNCGKEGHLARNCKAPRRRGCWKCGQEGHQMKDCKNE 431



 Score = 44.7 bits (104), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 13/45 (28%), Positives = 26/45 (57%)

Query: 55  ESVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSK 99
           +++ C+NC  +GH+A++C       C+ C   GH  ++C N+  +
Sbjct: 390 KTIKCFNCGKEGHLARNCKAPRRRGCWKCGQEGHQMKDCKNEGXQ 434



 Score = 41.6 bits (96), Expect = 0.002,   Method: Composition-based stats.
 Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 1/43 (2%)

Query: 31  RGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDC 72
           +G +  I C+ C   GH AR C A     C+ C  +GH  KDC
Sbjct: 386 KGIRKTIKCFNCGKEGHLARNCKAPRRRGCWKCGQEGHQMKDC 428



 Score = 37.4 bits (85), Expect = 0.033,   Method: Composition-based stats.
 Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 11/48 (22%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
           +C+ C + GH+A+ C      K+P         C+ CG +GH   DCK
Sbjct: 393 KCFNCGKEGHLARNC------KAPR-----RRGCWKCGQEGHQMKDCK 429



 Score = 35.4 bits (80), Expect = 0.12,   Method: Composition-based stats.
 Identities = 17/50 (34%), Positives = 24/50 (48%), Gaps = 9/50 (18%)

Query: 6   TIQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECATE 55
           TI+C+NC   GH   +C    +   RG      C+KC   GH  ++C  E
Sbjct: 391 TIKCFNCGKEGHLARNC---KAPRRRG------CWKCGQEGHQMKDCKNE 431


>sp|Q1A268|GAG_SIVMB Gag polyprotein OS=Simian immunodeficiency virus (isolate MB66)
           GN=gag PE=3 SV=3
          Length = 499

 Score = 51.2 bits (121), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 23/69 (33%), Positives = 34/69 (49%), Gaps = 2/69 (2%)

Query: 57  VTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPG 116
           V C+NC  +GH+A++C       C+ C   GH  RNC N+  ++     +    +K  PG
Sbjct: 385 VKCFNCGKEGHIARNCKAPRRKGCWKCGQEGHQMRNCTNE--RQANFLGKLWPSSKGRPG 442

Query: 117 QTAGKSPEP 125
               K PEP
Sbjct: 443 NFLQKRPEP 451



 Score = 40.0 bits (92), Expect = 0.005,   Method: Composition-based stats.
 Identities = 16/44 (36%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 31  RGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCT 73
           +G K  + C+ C   GH AR C A     C+ C  +GH  ++CT
Sbjct: 379 KGPKRIVKCFNCGKEGHIARNCKAPRRKGCWKCGQEGHQMRNCT 422



 Score = 33.1 bits (74), Expect = 0.73,   Method: Composition-based stats.
 Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C+ C + GH+A+ C      K+P         C+ CG +GH   +C
Sbjct: 386 KCFNCGKEGHIARNC------KAPR-----RKGCWKCGQEGHQMRNC 421



 Score = 32.7 bits (73), Expect = 0.77,   Method: Composition-based stats.
 Identities = 15/51 (29%), Positives = 22/51 (43%), Gaps = 13/51 (25%)

Query: 7   IQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFARECATE 55
           ++C+NC   GH   +C  P++             C+KC   GH  R C  E
Sbjct: 385 VKCFNCGKEGHIARNCKAPRRKG-----------CWKCGQEGHQMRNCTNE 424


>sp|Q74230|GAG_HV2EH Gag polyprotein OS=Human immunodeficiency virus type 2 subtype B
           (isolate EHO) GN=gag PE=3 SV=3
          Length = 519

 Score = 51.2 bits (121), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 18/46 (39%), Positives = 27/46 (58%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTAG 120
           K ++ C+NC  +GH AR C     + C+ C Q GH+  +CP + AG
Sbjct: 384 KRTVTCWNCGKAGHTARQCKAPRRQGCWKCGQQGHIMSKCPERQAG 429



 Score = 43.5 bits (101), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 15/39 (38%), Positives = 19/39 (48%)

Query: 56  SVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCP 94
           +VTC+NC   GH A+ C       C+ C   GH    CP
Sbjct: 386 TVTCWNCGKAGHTARQCKAPRRQGCWKCGQQGHIMSKCP 424



 Score = 42.4 bits (98), Expect = 0.001,   Method: Composition-based stats.
 Identities = 17/48 (35%), Positives = 24/48 (50%), Gaps = 1/48 (2%)

Query: 26  SSADARGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDC 72
           ++A  R  K  + C+ C   GH AR+C A     C+ C  QGH+   C
Sbjct: 376 AAAQPRAGKRTVTCWNCGKAGHTARQCKAPRRQGCWKCGQQGHIMSKC 423



 Score = 35.4 bits (80), Expect = 0.13,   Method: Composition-based stats.
 Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 13/64 (20%)

Query: 85  SSGHFARNCPNDSSKR--CYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHL 142
           S+  FA   P    +   C+ C +AGH A++C      K+P         C+ CG QGH+
Sbjct: 371 STNPFAAAQPRAGKRTVTCWNCGKAGHTARQC------KAPR-----RQGCWKCGQQGHI 419

Query: 143 SYDC 146
              C
Sbjct: 420 MSKC 423



 Score = 30.4 bits (67), Expect = 4.4,   Method: Composition-based stats.
 Identities = 14/49 (28%), Positives = 20/49 (40%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           T+ C+NC   GH    C  P++             C+KC   GH   +C
Sbjct: 386 TVTCWNCGKAGHTARQCKAPRRQG-----------CWKCGQQGHIMSKC 423


>sp|P05892|GAG_SIVVT Gag polyprotein OS=Simian immunodeficiency virus agm.vervet
           (isolate AGM TYO-1) GN=gag PE=3 SV=1
          Length = 519

 Score = 51.2 bits (121), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 18/38 (47%), Positives = 22/38 (57%)

Query: 80  CYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQ 117
           CYNC   GH  R CP     +C  C + GH+AK+C GQ
Sbjct: 399 CYNCGKFGHMQRQCPEPRKTKCLKCGKLGHLAKDCRGQ 436



 Score = 43.1 bits (100), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 17/46 (36%), Positives = 22/46 (47%), Gaps = 9/46 (19%)

Query: 7   IQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           ++CYNC  FGH Q  CP+              C KC   GH A++C
Sbjct: 397 LRCYNCGKFGHMQRQCPEPRKTK---------CLKCGKLGHLAKDC 433



 Score = 40.8 bits (94), Expect = 0.003,   Method: Composition-based stats.
 Identities = 19/48 (39%), Positives = 24/48 (50%), Gaps = 11/48 (22%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
           RCY C + GHM ++C        PEP       C  CG  GHL+ DC+
Sbjct: 398 RCYNCGKFGHMQRQC--------PEP---RKTKCLKCGKLGHLAKDCR 434


>sp|Q77372|GAG_HV1AN Gag polyprotein OS=Human immunodeficiency virus type 1 group O
           (isolate ANT70) GN=gag PE=3 SV=3
          Length = 498

 Score = 51.2 bits (121), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 19/40 (47%), Positives = 23/40 (57%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKEC 114
           K +I C+NC   GH ARNC     K C+ C Q GH  K+C
Sbjct: 391 KGTIKCFNCGKEGHIARNCRAPRKKGCWKCGQEGHQMKDC 430



 Score = 43.5 bits (101), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 13/40 (32%), Positives = 23/40 (57%)

Query: 56  SVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPN 95
           ++ C+NC  +GH+A++C       C+ C   GH  ++C N
Sbjct: 393 TIKCFNCGKEGHIARNCRAPRKKGCWKCGQEGHQMKDCRN 432



 Score = 33.5 bits (75), Expect = 0.52,   Method: Composition-based stats.
 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           TI+C+NC   GH   +C  P+K             C+KC   GH  ++C
Sbjct: 393 TIKCFNCGKEGHIARNCRAPRKKG-----------CWKCGQEGHQMKDC 430


>sp|Q9IDV8|GAG_HV1YB Gag polyprotein OS=Human immunodeficiency virus type 1 group N
           (isolate YBF106) GN=gag PE=3 SV=3
          Length = 511

 Score = 50.4 bits (119), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 17/44 (38%), Positives = 25/44 (56%)

Query: 74  VKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQ 117
           ++ +I C+NC   GH ARNC     + C+ C Q GH  K+C  +
Sbjct: 388 IRKTIKCFNCGKEGHLARNCKAPRRRGCWKCGQEGHQMKDCKNE 431



 Score = 44.3 bits (103), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 13/45 (28%), Positives = 26/45 (57%)

Query: 55  ESVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSK 99
           +++ C+NC  +GH+A++C       C+ C   GH  ++C N+  +
Sbjct: 390 KTIKCFNCGKEGHLARNCKAPRRRGCWKCGQEGHQMKDCKNEGXQ 434



 Score = 40.8 bits (94), Expect = 0.003,   Method: Composition-based stats.
 Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 1/43 (2%)

Query: 31  RGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDC 72
           +G +  I C+ C   GH AR C A     C+ C  +GH  KDC
Sbjct: 386 KGIRKTIKCFNCGKEGHLARNCKAPRRRGCWKCGQEGHQMKDC 428



 Score = 36.6 bits (83), Expect = 0.056,   Method: Composition-based stats.
 Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 11/48 (22%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
           +C+ C + GH+A+ C      K+P         C+ CG +GH   DCK
Sbjct: 393 KCFNCGKEGHLARNC------KAPR-----RRGCWKCGQEGHQMKDCK 429



 Score = 35.0 bits (79), Expect = 0.19,   Method: Composition-based stats.
 Identities = 17/50 (34%), Positives = 24/50 (48%), Gaps = 9/50 (18%)

Query: 6   TIQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECATE 55
           TI+C+NC   GH   +C    +   RG      C+KC   GH  ++C  E
Sbjct: 391 TIKCFNCGKEGHLARNC---KAPRRRG------CWKCGQEGHQMKDCKNE 431


>sp|Q75002|POL_HV1ET Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype C (isolate ETH2220) GN=gag-pol PE=3 SV=3
          Length = 1439

 Score = 50.4 bits (119), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 19/45 (42%), Positives = 25/45 (55%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTA 119
           K +I C+NC   GH ARNC     K C+ C + GH  K+C  + A
Sbjct: 384 KRAIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDCTERQA 428



 Score = 47.0 bits (110), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 24  QKSSADARGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCTVKSS 77
           QKS  + +G K  I C+ C   GH AR C A     C+ C  +GH  KDCT + +
Sbjct: 376 QKS--NFKGPKRAIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDCTERQA 428



 Score = 44.3 bits (103), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 22/71 (30%), Positives = 34/71 (47%), Gaps = 4/71 (5%)

Query: 56  SVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSK--RCYACHQAGHMAKE 113
           ++ C+NC  +GH+A++C       C+ C   GH  ++C    +   R     Q G  A+E
Sbjct: 386 AIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDCTERQANFFRETLAFQQGK-ARE 444

Query: 114 CPG-QTAGKSP 123
            P  QT   SP
Sbjct: 445 FPSEQTRANSP 455



 Score = 34.3 bits (77), Expect = 0.31,   Method: Composition-based stats.
 Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C+ C + GH+A+ C      ++P         C+ CG +GH   DC
Sbjct: 388 KCFNCGKEGHLARNC------RAPR-----KKGCWKCGKEGHQMKDC 423



 Score = 32.0 bits (71), Expect = 1.3,   Method: Composition-based stats.
 Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 13/48 (27%)

Query: 7   IQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           I+C+NC   GH   +C  P+K             C+KC   GH  ++C
Sbjct: 387 IKCFNCGKEGHLARNCRAPRKKG-----------CWKCGKEGHQMKDC 423


>sp|Q75001|GAG_HV1ET Gag polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype C (isolate ETH2220) GN=gag PE=3 SV=3
          Length = 504

 Score = 50.4 bits (119), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 19/45 (42%), Positives = 25/45 (55%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTA 119
           K +I C+NC   GH ARNC     K C+ C + GH  K+C  + A
Sbjct: 384 KRAIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDCTERQA 428



 Score = 47.0 bits (110), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 24  QKSSADARGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCTVKSS 77
           QKS  + +G K  I C+ C   GH AR C A     C+ C  +GH  KDCT + +
Sbjct: 376 QKS--NFKGPKRAIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDCTERQA 428



 Score = 42.0 bits (97), Expect = 0.001,   Method: Composition-based stats.
 Identities = 18/70 (25%), Positives = 32/70 (45%), Gaps = 3/70 (4%)

Query: 56  SVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECP 115
           ++ C+NC  +GH+A++C       C+ C   GH  ++C   + ++     +     K  P
Sbjct: 386 AIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDC---TERQANFLGRLWPSNKGRP 442

Query: 116 GQTAGKSPEP 125
           G      PEP
Sbjct: 443 GNFLQSRPEP 452



 Score = 34.3 bits (77), Expect = 0.31,   Method: Composition-based stats.
 Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C+ C + GH+A+ C      ++P         C+ CG +GH   DC
Sbjct: 388 KCFNCGKEGHLARNC------RAPR-----KKGCWKCGKEGHQMKDC 423



 Score = 32.0 bits (71), Expect = 1.3,   Method: Composition-based stats.
 Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 13/48 (27%)

Query: 7   IQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           I+C+NC   GH   +C  P+K             C+KC   GH  ++C
Sbjct: 387 IKCFNCGKEGHLARNCRAPRKKG-----------CWKCGKEGHQMKDC 423


>sp|P27972|GAG_SIVV1 Gag polyprotein OS=Simian immunodeficiency virus agm.vervet
           (isolate AGM155) GN=gag PE=3 SV=1
          Length = 520

 Score = 50.1 bits (118), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 18/38 (47%), Positives = 22/38 (57%)

Query: 80  CYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQ 117
           CYNC   GH  R CP     +C  C + GH+AK+C GQ
Sbjct: 400 CYNCGKFGHMQRQCPEPRKIKCLKCGKPGHLAKDCRGQ 437



 Score = 42.4 bits (98), Expect = 0.001,   Method: Composition-based stats.
 Identities = 18/45 (40%), Positives = 22/45 (48%), Gaps = 9/45 (20%)

Query: 8   QCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           +CYNC  FGH Q  CP+            I C KC   GH A++C
Sbjct: 399 KCYNCGKFGHMQRQCPEPRK---------IKCLKCGKPGHLAKDC 434



 Score = 42.4 bits (98), Expect = 0.001,   Method: Composition-based stats.
 Identities = 16/44 (36%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 39  CYKCNNYGHFARECAT-ESVTCYNCSGQGHVAKDCTVKSSIICY 81
           CY C  +GH  R+C     + C  C   GH+AKDC  + + + Y
Sbjct: 400 CYNCGKFGHMQRQCPEPRKIKCLKCGKPGHLAKDCRGQVNFLGY 443



 Score = 40.4 bits (93), Expect = 0.004,   Method: Composition-based stats.
 Identities = 18/48 (37%), Positives = 25/48 (52%), Gaps = 11/48 (22%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
           +CY C + GHM ++C        PEP     + C  CG  GHL+ DC+
Sbjct: 399 KCYNCGKFGHMQRQC--------PEP---RKIKCLKCGKPGHLAKDCR 435



 Score = 40.0 bits (92), Expect = 0.006,   Method: Composition-based stats.
 Identities = 14/35 (40%), Positives = 18/35 (51%)

Query: 59  CYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNC 93
           CYNC   GH+ + C     I C  C   GH A++C
Sbjct: 400 CYNCGKFGHMQRQCPEPRKIKCLKCGKPGHLAKDC 434


>sp|P20876|POL_HV2ST Gag-Pol polyprotein OS=Human immunodeficiency virus type 2 subtype
           A (isolate ST) GN=gag-pol PE=3 SV=3
          Length = 1463

 Score = 50.1 bits (118), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 18/46 (39%), Positives = 27/46 (58%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTAG 120
           + +I C+NC   GH AR C     + C+ C +AGH+  +CP + AG
Sbjct: 386 RRTIKCWNCGKEGHSARQCRAPRRQGCWKCGKAGHIMAKCPERQAG 431



 Score = 48.5 bits (114), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 21/68 (30%), Positives = 31/68 (45%), Gaps = 4/68 (5%)

Query: 56  SVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECP 115
           ++ C+NC  +GH A+ C       C+ C  +GH    CP    +R     + G M KE P
Sbjct: 388 TIKCWNCGKEGHSARQCRAPRRQGCWKCGKAGHIMAKCP----ERQAGFLRVGPMGKEAP 443

Query: 116 GQTAGKSP 123
               G +P
Sbjct: 444 QFPCGPNP 451



 Score = 31.6 bits (70), Expect = 1.7,   Method: Composition-based stats.
 Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           TI+C+NC   GH    C  P++             C+KC   GH   +C
Sbjct: 388 TIKCWNCGKEGHSARQCRAPRRQG-----------CWKCGKAGHIMAKC 425


>sp|P20875|POL_HV1JR Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype B (isolate JRCSF) GN=gag-pol PE=1 SV=3
          Length = 1439

 Score = 49.7 bits (117), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 18/45 (40%), Positives = 25/45 (55%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTA 119
           + ++ C+NC   GH ARNC     K C+ C + GH  KEC  + A
Sbjct: 387 RKNVKCFNCGKEGHIARNCRAPRKKGCWKCGKEGHQMKECTERQA 431



 Score = 42.4 bits (98), Expect = 0.001,   Method: Composition-based stats.
 Identities = 13/39 (33%), Positives = 22/39 (56%)

Query: 55  ESVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNC 93
           ++V C+NC  +GH+A++C       C+ C   GH  + C
Sbjct: 388 KNVKCFNCGKEGHIARNCRAPRKKGCWKCGKEGHQMKEC 426



 Score = 42.0 bits (97), Expect = 0.001,   Method: Composition-based stats.
 Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 1/48 (2%)

Query: 31  RGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCTVKSS 77
           R  +  + C+ C   GH AR C A     C+ C  +GH  K+CT + +
Sbjct: 384 RNQRKNVKCFNCGKEGHIARNCRAPRKKGCWKCGKEGHQMKECTERQA 431



 Score = 32.7 bits (73), Expect = 0.81,   Method: Composition-based stats.
 Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 13/48 (27%)

Query: 7   IQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           ++C+NC   GH   +C  P+K             C+KC   GH  +EC
Sbjct: 390 VKCFNCGKEGHIARNCRAPRKKG-----------CWKCGKEGHQMKEC 426



 Score = 32.0 bits (71), Expect = 1.4,   Method: Composition-based stats.
 Identities = 13/47 (27%), Positives = 23/47 (48%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C+ C + GH+A+ C      ++P         C+ CG +GH   +C
Sbjct: 391 KCFNCGKEGHIARNC------RAPR-----KKGCWKCGKEGHQMKEC 426


>sp|P34689|GLH1_CAEEL ATP-dependent RNA helicase glh-1 OS=Caenorhabditis elegans GN=glh-1
           PE=1 SV=3
          Length = 763

 Score = 49.7 bits (117), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 48/131 (36%), Gaps = 45/131 (34%)

Query: 9   CYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECATES------------ 56
           C+NC   GH    CP+      R ++   VCY C   GH +REC  E             
Sbjct: 160 CFNCQQPGHRSSDCPE-----PRKEREPRVCYNCQQPGHTSRECTEERKPREGRTGGFGG 214

Query: 57  ---------------------------VTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHF 89
                                      + C+NC G+GH + +C  +    C+NC   GH 
Sbjct: 215 GAGFGNNGGNDGFGGDGGFGGGEERGPMKCFNCKGEGHRSAECP-EPPRGCFNCGEQGHR 273

Query: 90  ARNCPNDSSKR 100
           +  CPN +  R
Sbjct: 274 SNECPNPAKPR 284



 Score = 42.0 bits (97), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/120 (23%), Positives = 43/120 (35%), Gaps = 44/120 (36%)

Query: 39  CYKCNNYGHFARECAT-----ESVTCYNCSGQGHVAKDCT-------------------- 73
           C+ C   GH + +C       E   CYNC   GH +++CT                    
Sbjct: 160 CFNCQQPGHRSSDCPEPRKEREPRVCYNCQQPGHTSRECTEERKPREGRTGGFGGGAGFG 219

Query: 74  ------------------VKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECP 115
                              +  + C+NC   GH +  CP +  + C+ C + GH + ECP
Sbjct: 220 NNGGNDGFGGDGGFGGGEERGPMKCFNCKGEGHRSAECP-EPPRGCFNCGEQGHRSNECP 278



 Score = 31.6 bits (70), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 24/51 (47%), Gaps = 7/51 (13%)

Query: 101 CYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCKLVQK 151
           C+ C Q GH + +CP     K  EP V     CY C   GH S +C   +K
Sbjct: 160 CFNCQQPGHRSSDCP--EPRKEREPRV-----CYNCQQPGHTSRECTEERK 203



 Score = 30.0 bits (66), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 19/72 (26%), Positives = 30/72 (41%), Gaps = 10/72 (13%)

Query: 7   IQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECATESVTCYNCSGQG 66
           ++C+NC   GH    CP+      RG      C+ C   GH + EC   +       G+G
Sbjct: 242 MKCFNCKGEGHRSAECPEPP----RG------CFNCGEQGHRSNECPNPAKPREGVEGEG 291

Query: 67  HVAKDCTVKSSI 78
             A    V+ ++
Sbjct: 292 PKATYVPVEDNM 303


>sp|P20873|GAG_HV1JR Gag polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype B (isolate JRCSF) GN=gag PE=3 SV=3
          Length = 504

 Score = 49.7 bits (117), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 18/45 (40%), Positives = 25/45 (55%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTA 119
           + ++ C+NC   GH ARNC     K C+ C + GH  KEC  + A
Sbjct: 387 RKNVKCFNCGKEGHIARNCRAPRKKGCWKCGKEGHQMKECTERQA 431



 Score = 42.7 bits (99), Expect = 0.001,   Method: Composition-based stats.
 Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 3/71 (4%)

Query: 55  ESVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKEC 114
           ++V C+NC  +GH+A++C       C+ C   GH  + C   + ++     +     K  
Sbjct: 388 KNVKCFNCGKEGHIARNCRAPRKKGCWKCGKEGHQMKEC---TERQANFLGKIWPSYKGR 444

Query: 115 PGQTAGKSPEP 125
           PG      PEP
Sbjct: 445 PGNFLQSRPEP 455



 Score = 42.0 bits (97), Expect = 0.001,   Method: Composition-based stats.
 Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 1/48 (2%)

Query: 31  RGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCTVKSS 77
           R  +  + C+ C   GH AR C A     C+ C  +GH  K+CT + +
Sbjct: 384 RNQRKNVKCFNCGKEGHIARNCRAPRKKGCWKCGKEGHQMKECTERQA 431



 Score = 32.7 bits (73), Expect = 0.87,   Method: Composition-based stats.
 Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 13/48 (27%)

Query: 7   IQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           ++C+NC   GH   +C  P+K             C+KC   GH  +EC
Sbjct: 390 VKCFNCGKEGHIARNCRAPRKKG-----------CWKCGKEGHQMKEC 426



 Score = 32.0 bits (71), Expect = 1.5,   Method: Composition-based stats.
 Identities = 13/47 (27%), Positives = 23/47 (48%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C+ C + GH+A+ C      ++P         C+ CG +GH   +C
Sbjct: 391 KCFNCGKEGHIARNC------RAPR-----KKGCWKCGKEGHQMKEC 426


>sp|P12451|POL_HV2SB Gag-Pol polyprotein OS=Human immunodeficiency virus type 2 subtype
           A (isolate SBLISY) GN=gag-pol PE=3 SV=3
          Length = 1462

 Score = 49.7 bits (117), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 31/68 (45%), Gaps = 4/68 (5%)

Query: 48  FARECATESVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSKRCYACHQA 107
           FA      ++ C+NC  +GH A+ C       C+ C  SGH   NCP+    R     +A
Sbjct: 379 FAAAQQKRAIKCWNCGKEGHSARQCRAPRRQGCWKCGKSGHIMANCPD----RQAGFLRA 434

Query: 108 GHMAKECP 115
             M KE P
Sbjct: 435 WTMGKEAP 442



 Score = 49.7 bits (117), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 18/46 (39%), Positives = 26/46 (56%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTAG 120
           K +I C+NC   GH AR C     + C+ C ++GH+   CP + AG
Sbjct: 385 KRAIKCWNCGKEGHSARQCRAPRRQGCWKCGKSGHIMANCPDRQAG 430



 Score = 38.1 bits (87), Expect = 0.021,   Method: Composition-based stats.
 Identities = 16/44 (36%), Positives = 21/44 (47%), Gaps = 1/44 (2%)

Query: 30  ARGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDC 72
           A   K  I C+ C   GH AR+C A     C+ C   GH+  +C
Sbjct: 381 AAQQKRAIKCWNCGKEGHSARQCRAPRRQGCWKCGKSGHIMANC 424



 Score = 32.0 bits (71), Expect = 1.4,   Method: Composition-based stats.
 Identities = 13/47 (27%), Positives = 23/47 (48%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C+ C + GH A++C      ++P         C+ CG  GH+  +C
Sbjct: 389 KCWNCGKEGHSARQC------RAPR-----RQGCWKCGKSGHIMANC 424


>sp|O41798|POL_HV19N Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype G (isolate 92NG083) GN=gag-pol PE=3 SV=3
          Length = 1435

 Score = 49.7 bits (117), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 19/42 (45%), Positives = 23/42 (54%)

Query: 78  IICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTA 119
           I C+NC   GH ARNC     K C+ C + GH  KEC  + A
Sbjct: 391 IKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKECTERQA 432



 Score = 45.1 bits (105), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 29/69 (42%), Gaps = 2/69 (2%)

Query: 57  VTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSK--RCYACHQAGHMAKEC 114
           + C+NC  +GH+A++C       C+ C   GH  + C    +   R     Q G   K  
Sbjct: 391 IKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKECTERQANFLRENLAFQQGEARKLS 450

Query: 115 PGQTAGKSP 123
           P Q    SP
Sbjct: 451 PEQDRANSP 459



 Score = 42.0 bits (97), Expect = 0.001,   Method: Composition-based stats.
 Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 28  ADARGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCTVKSS 77
           ++ +G +  I C+ C   GH AR C A     C+ C  +GH  K+CT + +
Sbjct: 382 SNFKGPRRIIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKECTERQA 432



 Score = 33.5 bits (75), Expect = 0.49,   Method: Composition-based stats.
 Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 13/48 (27%)

Query: 7   IQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           I+C+NC   GH   +C  P+K             C+KC   GH  +EC
Sbjct: 391 IKCFNCGKEGHLARNCRAPRKKG-----------CWKCGKEGHQMKEC 427



 Score = 32.3 bits (72), Expect = 1.1,   Method: Composition-based stats.
 Identities = 13/47 (27%), Positives = 23/47 (48%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C+ C + GH+A+ C      ++P         C+ CG +GH   +C
Sbjct: 392 KCFNCGKEGHLARNC------RAPR-----KKGCWKCGKEGHQMKEC 427


>sp|P0C1K7|GAG_HV19N Gag polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype G (isolate 92NG083) GN=gag PE=3 SV=2
          Length = 497

 Score = 49.7 bits (117), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 19/42 (45%), Positives = 23/42 (54%)

Query: 78  IICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTA 119
           I C+NC   GH ARNC     K C+ C + GH  KEC  + A
Sbjct: 391 IKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKECTERQA 432



 Score = 42.4 bits (98), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/71 (30%), Positives = 34/71 (47%), Gaps = 6/71 (8%)

Query: 28  ADARGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCTVKSS-----IICY 81
           ++ +G +  I C+ C   GH AR C A     C+ C  +GH  K+CT + +     I   
Sbjct: 382 SNFKGPRRIIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKECTERQANFLGKIWPS 441

Query: 82  NCNSSGHFARN 92
           N    G+F +N
Sbjct: 442 NKGRPGNFLQN 452



 Score = 40.4 bits (93), Expect = 0.004,   Method: Composition-based stats.
 Identities = 12/37 (32%), Positives = 20/37 (54%)

Query: 57  VTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNC 93
           + C+NC  +GH+A++C       C+ C   GH  + C
Sbjct: 391 IKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKEC 427



 Score = 33.5 bits (75), Expect = 0.49,   Method: Composition-based stats.
 Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 13/48 (27%)

Query: 7   IQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           I+C+NC   GH   +C  P+K             C+KC   GH  +EC
Sbjct: 391 IKCFNCGKEGHLARNCRAPRKKG-----------CWKCGKEGHQMKEC 427



 Score = 32.3 bits (72), Expect = 1.1,   Method: Composition-based stats.
 Identities = 13/47 (27%), Positives = 23/47 (48%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C+ C + GH+A+ C      ++P         C+ CG +GH   +C
Sbjct: 392 KCFNCGKEGHLARNC------RAPR-----KKGCWKCGKEGHQMKEC 427


>sp|P15833|POL_HV2D2 Gag-Pol polyprotein OS=Human immunodeficiency virus type 2 subtype
           B (isolate D205) GN=gag-pol PE=3 SV=3
          Length = 1465

 Score = 49.7 bits (117), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 22/68 (32%), Positives = 30/68 (44%), Gaps = 9/68 (13%)

Query: 56  SVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECP 115
           +VTC+NC  QGH A+ C       C+ C  +GH    CP           QAG +     
Sbjct: 387 TVTCWNCGKQGHTARQCRAPRRQGCWKCGKTGHIMSKCPE---------RQAGFLRVRTL 437

Query: 116 GQTAGKSP 123
           G+ A + P
Sbjct: 438 GKEASQLP 445



 Score = 48.9 bits (115), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 16/46 (34%), Positives = 26/46 (56%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTAG 120
           + ++ C+NC   GH AR C     + C+ C + GH+  +CP + AG
Sbjct: 385 RGTVTCWNCGKQGHTARQCRAPRRQGCWKCGKTGHIMSKCPERQAG 430



 Score = 38.1 bits (87), Expect = 0.018,   Method: Composition-based stats.
 Identities = 19/68 (27%), Positives = 29/68 (42%), Gaps = 2/68 (2%)

Query: 32  GDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFA 90
           G +  + C+ C   GH AR+C A     C+ C   GH+   C  + +       + G  A
Sbjct: 383 GKRGTVTCWNCGKQGHTARQCRAPRRQGCWKCGKTGHIMSKCPERQAGF-LRVRTLGKEA 441

Query: 91  RNCPNDSS 98
              P+D S
Sbjct: 442 SQLPHDPS 449



 Score = 30.8 bits (68), Expect = 3.6,   Method: Composition-based stats.
 Identities = 14/49 (28%), Positives = 20/49 (40%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           T+ C+NC   GH    C  P++             C+KC   GH   +C
Sbjct: 387 TVTCWNCGKQGHTARQCRAPRRQG-----------CWKCGKTGHIMSKC 424


>sp|O89940|POL_HV1SE Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype G (isolate SE6165) GN=gag-pol PE=3 SV=3
          Length = 1433

 Score = 49.3 bits (116), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 18/45 (40%), Positives = 25/45 (55%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTA 119
           + +I C+NC   GH ARNC     K C+ C + GH  K+C  + A
Sbjct: 388 RRTIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDCTERQA 432



 Score = 44.7 bits (104), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 28  ADARGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCTVKSS 77
           ++ +G +  I C+ C   GH AR C A     C+ C  +GH  KDCT + +
Sbjct: 382 SNFKGPRRTIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDCTERQA 432



 Score = 42.0 bits (97), Expect = 0.002,   Method: Composition-based stats.
 Identities = 12/38 (31%), Positives = 22/38 (57%)

Query: 56  SVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNC 93
           ++ C+NC  +GH+A++C       C+ C   GH  ++C
Sbjct: 390 TIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDC 427



 Score = 34.3 bits (77), Expect = 0.31,   Method: Composition-based stats.
 Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C+ C + GH+A+ C      ++P         C+ CG +GH   DC
Sbjct: 392 KCFNCGKEGHLARNC------RAPR-----KKGCWKCGKEGHQMKDC 427



 Score = 33.9 bits (76), Expect = 0.35,   Method: Composition-based stats.
 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           TI+C+NC   GH   +C  P+K             C+KC   GH  ++C
Sbjct: 390 TIKCFNCGKEGHLARNCRAPRKKG-----------CWKCGKEGHQMKDC 427


>sp|P20874|GAG_HV2ST Gag polyprotein OS=Human immunodeficiency virus type 2 subtype A
           (isolate ST) GN=gag PE=3 SV=3
          Length = 521

 Score = 49.3 bits (116), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 18/46 (39%), Positives = 27/46 (58%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTAG 120
           + +I C+NC   GH AR C     + C+ C +AGH+  +CP + AG
Sbjct: 386 RRTIKCWNCGKEGHSARQCRAPRRQGCWKCGKAGHIMAKCPERQAG 431



 Score = 41.6 bits (96), Expect = 0.002,   Method: Composition-based stats.
 Identities = 13/39 (33%), Positives = 20/39 (51%)

Query: 56  SVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCP 94
           ++ C+NC  +GH A+ C       C+ C  +GH    CP
Sbjct: 388 TIKCWNCGKEGHSARQCRAPRRQGCWKCGKAGHIMAKCP 426



 Score = 35.4 bits (80), Expect = 0.15,   Method: Composition-based stats.
 Identities = 14/37 (37%), Positives = 18/37 (48%), Gaps = 1/37 (2%)

Query: 37  IVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDC 72
           I C+ C   GH AR+C A     C+ C   GH+   C
Sbjct: 389 IKCWNCGKEGHSARQCRAPRRQGCWKCGKAGHIMAKC 425



 Score = 31.2 bits (69), Expect = 2.8,   Method: Composition-based stats.
 Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           TI+C+NC   GH    C  P++             C+KC   GH   +C
Sbjct: 388 TIKCWNCGKEGHSARQCRAPRRQG-----------CWKCGKAGHIMAKC 425


>sp|O89939|GAG_HV1SE Gag polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype G (isolate SE6165) GN=gag PE=3 SV=3
          Length = 495

 Score = 48.9 bits (115), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 18/45 (40%), Positives = 25/45 (55%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTA 119
           + +I C+NC   GH ARNC     K C+ C + GH  K+C  + A
Sbjct: 388 RRTIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDCTERQA 432



 Score = 44.3 bits (103), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 23/71 (32%), Positives = 34/71 (47%), Gaps = 6/71 (8%)

Query: 28  ADARGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDCTVKSS-----IICY 81
           ++ +G +  I C+ C   GH AR C A     C+ C  +GH  KDCT + +     I   
Sbjct: 382 SNFKGPRRTIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDCTERQANFLGKIWPS 441

Query: 82  NCNSSGHFARN 92
           N    G+F +N
Sbjct: 442 NKGRPGNFLQN 452



 Score = 41.6 bits (96), Expect = 0.002,   Method: Composition-based stats.
 Identities = 12/38 (31%), Positives = 22/38 (57%)

Query: 56  SVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNC 93
           ++ C+NC  +GH+A++C       C+ C   GH  ++C
Sbjct: 390 TIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDC 427



 Score = 33.9 bits (76), Expect = 0.41,   Method: Composition-based stats.
 Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C+ C + GH+A+ C      ++P         C+ CG +GH   DC
Sbjct: 392 KCFNCGKEGHLARNC------RAPR-----KKGCWKCGKEGHQMKDC 427



 Score = 33.5 bits (75), Expect = 0.46,   Method: Composition-based stats.
 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 13/49 (26%)

Query: 6   TIQCYNCFDFGHYQYSC--PQKSSADARGDKVGIVCYKCNNYGHFAREC 52
           TI+C+NC   GH   +C  P+K             C+KC   GH  ++C
Sbjct: 390 TIKCFNCGKEGHLARNCRAPRKKG-----------CWKCGKEGHQMKDC 427


>sp|O91080|POL_HV1YF Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group N
           (isolate YBF30) GN=gag-pol PE=3 SV=3
          Length = 1449

 Score = 48.9 bits (115), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 17/44 (38%), Positives = 23/44 (52%)

Query: 74  VKSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQ 117
           ++  I C+NC   GH ARNC       C+ C Q GH  K+C  +
Sbjct: 389 IRKPIKCFNCGKEGHLARNCKAPRRGGCWKCGQEGHQMKDCKNE 432



 Score = 43.5 bits (101), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 13/43 (30%), Positives = 24/43 (55%)

Query: 57  VTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPNDSSK 99
           + C+NC  +GH+A++C       C+ C   GH  ++C N+  +
Sbjct: 393 IKCFNCGKEGHLARNCKAPRRGGCWKCGQEGHQMKDCKNEGRQ 435



 Score = 41.6 bits (96), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/59 (37%), Positives = 26/59 (44%), Gaps = 5/59 (8%)

Query: 19  QYSCPQKSSADARGDKVGI----VCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDC 72
           Q   P  S    RG+  GI     C+ C   GH AR C A     C+ C  +GH  KDC
Sbjct: 371 QVQQPTTSVFAQRGNFKGIRKPIKCFNCGKEGHLARNCKAPRRGGCWKCGQEGHQMKDC 429



 Score = 37.7 bits (86), Expect = 0.024,   Method: Composition-based stats.
 Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 11/48 (22%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDCK 147
           +C+ C + GH+A+ C     G            C+ CG +GH   DCK
Sbjct: 394 KCFNCGKEGHLARNCKAPRRGG-----------CWKCGQEGHQMKDCK 430



 Score = 34.7 bits (78), Expect = 0.22,   Method: Composition-based stats.
 Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 9/49 (18%)

Query: 7   IQCYNCFDFGHYQYSCPQKSSADARGDKVGIVCYKCNNYGHFARECATE 55
           I+C+NC   GH   +C     A  RG      C+KC   GH  ++C  E
Sbjct: 393 IKCFNCGKEGHLARNC----KAPRRGG-----CWKCGQEGHQMKDCKNE 432


>sp|P12450|GAG_HV2SB Gag polyprotein OS=Human immunodeficiency virus type 2 subtype A
           (isolate SBLISY) GN=gag PE=3 SV=3
          Length = 520

 Score = 48.9 bits (115), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 18/46 (39%), Positives = 26/46 (56%)

Query: 75  KSSIICYNCNSSGHFARNCPNDSSKRCYACHQAGHMAKECPGQTAG 120
           K +I C+NC   GH AR C     + C+ C ++GH+   CP + AG
Sbjct: 385 KRAIKCWNCGKEGHSARQCRAPRRQGCWKCGKSGHIMANCPDRQAG 430



 Score = 47.0 bits (110), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 17/48 (35%), Positives = 24/48 (50%)

Query: 48  FARECATESVTCYNCSGQGHVAKDCTVKSSIICYNCNSSGHFARNCPN 95
           FA      ++ C+NC  +GH A+ C       C+ C  SGH   NCP+
Sbjct: 379 FAAAQQKRAIKCWNCGKEGHSARQCRAPRRQGCWKCGKSGHIMANCPD 426



 Score = 37.4 bits (85), Expect = 0.032,   Method: Composition-based stats.
 Identities = 16/44 (36%), Positives = 21/44 (47%), Gaps = 1/44 (2%)

Query: 30  ARGDKVGIVCYKCNNYGHFAREC-ATESVTCYNCSGQGHVAKDC 72
           A   K  I C+ C   GH AR+C A     C+ C   GH+  +C
Sbjct: 381 AAQQKRAIKCWNCGKEGHSARQCRAPRRQGCWKCGKSGHIMANC 424



 Score = 31.2 bits (69), Expect = 2.2,   Method: Composition-based stats.
 Identities = 13/47 (27%), Positives = 23/47 (48%), Gaps = 11/47 (23%)

Query: 100 RCYACHQAGHMAKECPGQTAGKSPEPVVDMSLTCYVCGHQGHLSYDC 146
           +C+ C + GH A++C      ++P         C+ CG  GH+  +C
Sbjct: 389 KCWNCGKEGHSARQC------RAPR-----RQGCWKCGKSGHIMANC 424


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.321    0.132    0.448 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 57,588,933
Number of Sequences: 539616
Number of extensions: 2177138
Number of successful extensions: 8638
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 267
Number of HSP's successfully gapped in prelim test: 281
Number of HSP's that attempted gapping in prelim test: 5366
Number of HSP's gapped (non-prelim): 2292
length of query: 152
length of database: 191,569,459
effective HSP length: 107
effective length of query: 45
effective length of database: 133,830,547
effective search space: 6022374615
effective search space used: 6022374615
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 56 (26.2 bits)