BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 026649
         (235 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q04832|HEXP_LEIMA DNA-binding protein HEXBP OS=Leishmania major GN=HEXBP PE=4 SV=1
          Length = 271

 Score = 93.6 bits (231), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 83/281 (29%), Positives = 109/281 (38%), Gaps = 99/281 (35%)

Query: 15  CNNCKRPGHFARECPNV--------AVCNNCGLPGHIASECTTQAR--------CWNCRE 58
           C NC + GH+ARECP            C  CG  GH++ EC  +AR        C+ C E
Sbjct: 18  CRNCGKEGHYARECPEADSKGDERSTTCFRCGEEGHMSRECPNEARSGAAGAMTCFRCGE 77

Query: 59  PGHMASNCHNEGI--------CHSCGKTGHRARDCSTH----------------VQSG-- 92
            GHM+ +C N           C+ CG+ GH +RDC +                  Q G  
Sbjct: 78  AGHMSRDCPNSAKPGAAKGFECYKCGQEGHLSRDCPSSQGGSRGGYGQKRGRSGAQGGYS 137

Query: 93  GDLRLCNNCYKPGHIAADCTN---------DKACKNCRKTGHIARDCQN---------EP 134
           GD R C  C   GHI+ DC N         D+ C  C   GHI+RDC N         + 
Sbjct: 138 GD-RTCYKCGDAGHISRDCPNGQGGYSGAGDRTCYKCGDAGHISRDCPNGQGGYSGAGDR 196

Query: 135 VCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDGGGGGGRYVGYHDVICRSCNQ 194
            C  C  +GH++R+CP   S G                             D  C  C +
Sbjct: 197 KCYKCGESGHMSRECPSAGSTGS---------------------------GDRACYKCGK 229

Query: 195 MGHMSRDCVGPLI-----------ICRNCGGRGHMAYECPS 224
            GH+SR+C                 C  CG  GH++ +CPS
Sbjct: 230 PGHISRECPEAGGSYGGSRGGGDRTCYKCGEAGHISRDCPS 270



 Score = 65.5 bits (158), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 47/178 (26%), Positives = 74/178 (41%), Gaps = 54/178 (30%)

Query: 9   MSQGNLCNNCKRPGHFARECPNVA------------------------VCNNCGLPGHIA 44
            ++G  C  C + GH +R+CP+                           C  CG  GHI+
Sbjct: 93  AAKGFECYKCGQEGHLSRDCPSSQGGSRGGYGQKRGRSGAQGGYSGDRTCYKCGDAGHIS 152

Query: 45  SECTT---------QARCWNCREPGHMASNCHN---------EGICHSCGKTGHRARDCS 86
            +C              C+ C + GH++ +C N         +  C+ CG++GH +R+C 
Sbjct: 153 RDCPNGQGGYSGAGDRTCYKCGDAGHISRDCPNGQGGYSGAGDRKCYKCGESGHMSRECP 212

Query: 87  THVQSGGDLRLCNNCYKPGHIAADCTN------------DKACKNCRKTGHIARDCQN 132
           +   +G   R C  C KPGHI+ +C              D+ C  C + GHI+RDC +
Sbjct: 213 SAGSTGSGDRACYKCGKPGHISRECPEAGGSYGGSRGGGDRTCYKCGEAGHISRDCPS 270



 Score = 49.3 bits (116), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/134 (27%), Positives = 51/134 (38%), Gaps = 41/134 (30%)

Query: 113 NDKACKNCRKTGHIARDC--------QNEPVCNLCNIAGHVARQCPKGDSLGERGGGGGG 164
           +  +C+NC K GH AR+C        +    C  C   GH++R+CP     G  G     
Sbjct: 14  SSTSCRNCGKEGHYARECPEADSKGDERSTTCFRCGEEGHMSRECPNEARSGAAGA---- 69

Query: 165 ERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDC-------VGPLIICRNCGGRGH 217
                                 + C  C + GHMSRDC             C  CG  GH
Sbjct: 70  ----------------------MTCFRCGEAGHMSRDCPNSAKPGAAKGFECYKCGQEGH 107

Query: 218 MAYECPSGRIADRG 231
           ++ +CPS +   RG
Sbjct: 108 LSRDCPSSQGGSRG 121


>sp|P53849|GIS2_YEAST Zinc finger protein GIS2 OS=Saccharomyces cerevisiae (strain ATCC
           204508 / S288c) GN=GIS2 PE=1 SV=1
          Length = 153

 Score = 92.8 bits (229), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 57/157 (36%), Positives = 87/157 (55%), Gaps = 17/157 (10%)

Query: 9   MSQGNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQ-----ARCWNCREPGHMA 63
           MSQ   C  C + GH A +C +  +C NC  PGH+ ++CT        +C+NC E GH+ 
Sbjct: 1   MSQ-KACYVCGKIGHLAEDCDSERLCYNCNKPGHVQTDCTMPRTVEFKQCYNCGETGHVR 59

Query: 64  SNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYK---PGHIAADCTNDK----- 115
           S C  +  C +C +TGH +R+C    ++    ++  +CYK   P H+A DC  +      
Sbjct: 60  SECTVQR-CFNCNQTGHISRECPEPKKTSRFSKV--SCYKCGGPNHMAKDCMKEDGISGL 116

Query: 116 ACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKG 152
            C  C + GH++RDCQN+ +C  CN  GH+++ CPK 
Sbjct: 117 KCYTCGQAGHMSRDCQNDRLCYNCNETGHISKDCPKA 153



 Score = 83.6 bits (205), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 57/163 (34%), Positives = 78/163 (47%), Gaps = 16/163 (9%)

Query: 68  NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCT-----NDKACKNCRK 122
           ++  C+ CGK GH A DC +        RLC NC KPGH+  DCT       K C NC +
Sbjct: 2   SQKACYVCGKIGHLAEDCDSE-------RLCYNCNKPGHVQTDCTMPRTVEFKQCYNCGE 54

Query: 123 TGHIARDCQNEPVCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDGGGGGGRYV 182
           TGH+  +C  +  C  CN  GH++R+CP+             +   GG         +  
Sbjct: 55  TGHVRSECTVQ-RCFNCNQTGHISRECPEPKKTSRFSKVSCYK--CGGPNHMAKDCMKED 111

Query: 183 GYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSG 225
           G   + C +C Q GHMSRDC    + C NC   GH++ +CP  
Sbjct: 112 GISGLKCYTCGQAGHMSRDCQNDRL-CYNCNETGHISKDCPKA 153



 Score = 72.4 bits (176), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 82/162 (50%), Gaps = 21/162 (12%)

Query: 49  TQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIA 108
           +Q  C+ C + GH+A +C +E +C++C K GH   DC+  +    + + C NC + GH+ 
Sbjct: 2   SQKACYVCGKIGHLAEDCDSERLCYNCNKPGHVQTDCT--MPRTVEFKQCYNCGETGHVR 59

Query: 109 ADCTNDKACKNCRKTGHIARDCQ--------NEPVCNLCNIAGHVARQCPKGDSLGERGG 160
           ++CT  + C NC +TGHI+R+C         ++  C  C    H+A+ C K D +     
Sbjct: 60  SECTVQR-CFNCNQTGHISRECPEPKKTSRFSKVSCYKCGGPNHMAKDCMKEDGIS---- 114

Query: 161 GGGGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDC 202
                 G      G  G       +D +C +CN+ GH+S+DC
Sbjct: 115 ------GLKCYTCGQAGHMSRDCQNDRLCYNCNETGHISKDC 150



 Score = 43.9 bits (102), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 47/118 (39%), Gaps = 31/118 (26%)

Query: 113 NDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGG 172
           + KAC  C K GH+A DC +E +C  CN  GHV   C    ++  +              
Sbjct: 2   SQKACYVCGKIGHLAEDCDSERLCYNCNKPGHVQTDCTMPRTVEFKQ------------- 48

Query: 173 DGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGRIADR 230
                           C +C + GH+  +C   +  C NC   GH++ ECP  +   R
Sbjct: 49  ----------------CYNCGETGHVRSECT--VQRCFNCNQTGHISRECPEPKKTSR 88


>sp|P36627|BYR3_SCHPO Cellular nucleic acid-binding protein homolog
           OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
           GN=byr3 PE=4 SV=1
          Length = 179

 Score = 83.6 bits (205), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 64/167 (38%), Positives = 80/167 (47%), Gaps = 19/167 (11%)

Query: 2   LKNVLSFMSQGNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECT---TQARCWNCRE 58
           +  V      G  C NC   GH AREC   ++C NC   GH ASECT    +  C+ C  
Sbjct: 6   VPTVPQTTRPGPRCYNCGENGHQARECTKGSICYNCNQTGHKASECTEPQQEKTCYACGT 65

Query: 59  PGHMASNCHNE------GICHSCGKTGHRARDCSTH-VQSGGDL---RLCNNCY---KPG 105
            GH+  +C +         C+ CG+ GH ARDC T+  QSGG     R   NCY     G
Sbjct: 66  AGHLVRDCPSSPNPRQGAECYKCGRVGHIARDCRTNGQQSGGRFGGHRSNMNCYACGSYG 125

Query: 106 HIAADCTNDKACKNCRKTGHIARDCQNEP---VCNLCNIAGHVARQC 149
           H A DCT    C +C K GH + +CQ      +C  CN  GH+A  C
Sbjct: 126 HQARDCTMGVKCYSCGKIGHRSFECQQASDGQLCYKCNQPGHIAVNC 172



 Score = 67.0 bits (162), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 53/178 (29%), Positives = 75/178 (42%), Gaps = 52/178 (29%)

Query: 52  RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC 111
           RC+NC E GH A  C    IC++C +TGH+A +C+   Q                     
Sbjct: 18  RCYNCGENGHQARECTKGSICYNCNQTGHKASECTEPQQ--------------------- 56

Query: 112 TNDKACKNCRKTGHIARDCQNEP------VCNLCNIAGHVARQCPKGDSLGERGGGGGGE 165
             +K C  C   GH+ RDC + P       C  C   GH+AR C                
Sbjct: 57  --EKTCYACGTAGHLVRDCPSSPNPRQGAECYKCGRVGHIARDCRT-------------- 100

Query: 166 RGGGGGGDGGGGGGRYVGYH-DVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYEC 222
                  +G   GGR+ G+  ++ C +C   GH +RDC    + C +CG  GH ++EC
Sbjct: 101 -------NGQQSGGRFGGHRSNMNCYACGSYGHQARDCTM-GVKCYSCGKIGHRSFEC 150



 Score = 61.6 bits (148), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 55/168 (32%), Positives = 76/168 (45%), Gaps = 21/168 (12%)

Query: 72  CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCT---NDKACKNCRKTGHIAR 128
           C++CG+ GH+AR+C+          +C NC + GH A++CT    +K C  C   GH+ R
Sbjct: 19  CYNCGENGHQARECT-------KGSICYNCNQTGHKASECTEPQQEKTCYACGTAGHLVR 71

Query: 129 DCQNEP------VCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDGGGGGGRYV 182
           DC + P       C  C   GH+AR C       + GG  GG R        G  G +  
Sbjct: 72  DCPSSPNPRQGAECYKCGRVGHIARDCRTNGQ--QSGGRFGGHRSNMNCYACGSYGHQAR 129

Query: 183 G-YHDVICRSCNQMGHMSRDC--VGPLIICRNCGGRGHMAYECPSGRI 227
                V C SC ++GH S +C       +C  C   GH+A  C S  I
Sbjct: 130 DCTMGVKCYSCGKIGHRSFECQQASDGQLCYKCNQPGHIAVNCTSPVI 177



 Score = 35.0 bits (79), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 25/50 (50%), Gaps = 2/50 (4%)

Query: 188 ICRSCNQMGHMSRDCVGPL--IICRNCGGRGHMAYECPSGRIADRGYRRY 235
           IC +CNQ GH + +C  P     C  CG  GH+  +CPS     +G   Y
Sbjct: 37  ICYNCNQTGHKASECTEPQQEKTCYACGTAGHLVRDCPSSPNPRQGAECY 86


>sp|Q3T0Q6|CNBP_BOVIN Cellular nucleic acid-binding protein OS=Bos taurus GN=CNBP PE=2
           SV=1
          Length = 170

 Score = 82.0 bits (201), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 62/168 (36%), Positives = 81/168 (48%), Gaps = 36/168 (21%)

Query: 12  GNLCNNCKRPGHFARECPNVA----------------------VCNNCGLPGHIASECTT 49
            N C  C R GH+ARECP                         +C  CG  GH+A +C  
Sbjct: 3   SNECFKCGRSGHWARECPTGGGRGRGMRSRGRGFQFVSSSLPDICYRCGESGHLAKDCDL 62

Query: 50  QAR-CWNCREPGHMASNCHN-----EGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYK 103
           Q   C+NC   GH+A +C       E  C++CGK GH ARDC        D + C +C +
Sbjct: 63  QEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDH-----ADEQKCYSCGE 117

Query: 104 PGHIAADCTNDKACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
            GHI  DCT  K C  C +TGH+A +C   +E  C  C  +GH+AR+C
Sbjct: 118 FGHIQKDCTKVK-CYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 164



 Score = 73.6 bits (179), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 52/142 (36%), Positives = 76/142 (53%), Gaps = 20/142 (14%)

Query: 6   LSFMSQG--NLCNNCKRPGHFARECP-NVAVCNNCGLPGHIASECTTQAR-----CWNCR 57
             F+S    ++C  C   GH A++C      C NCG  GHIA +C    R     C+NC 
Sbjct: 36  FQFVSSSLPDICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCG 95

Query: 58  EPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC--TN 113
           +PGH+A +C   +E  C+SCG+ GH  +DC T V+       C  C + GH+A +C  T+
Sbjct: 96  KPGHLARDCDHADEQKCYSCGEFGHIQKDC-TKVK-------CYRCGETGHVAINCSKTS 147

Query: 114 DKACKNCRKTGHIARDCQNEPV 135
           +  C  C ++GH+AR+C  E  
Sbjct: 148 EVNCYRCGESGHLARECTIEAT 169



 Score = 59.7 bits (143), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 5/81 (6%)

Query: 15  CNNCKRPGHFARECPNV--AVCNNCGLPGHIASECTTQARCWNCREPGHMASNCH--NEG 70
           C NC +PGH AR+C +     C +CG  GHI  +CT + +C+ C E GH+A NC   +E 
Sbjct: 91  CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCT-KVKCYRCGETGHVAINCSKTSEV 149

Query: 71  ICHSCGKTGHRARDCSTHVQS 91
            C+ CG++GH AR+C+    +
Sbjct: 150 NCYRCGESGHLARECTIEATA 170



 Score = 32.0 bits (71), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 188 ICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGR 226
           IC  C + GH+++DC      C NCG  GH+A +C   +
Sbjct: 46  ICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPK 84


>sp|O42395|CNBP_CHICK Cellular nucleic acid-binding protein OS=Gallus gallus GN=CNBP PE=2
           SV=1
          Length = 172

 Score = 81.3 bits (199), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 62/170 (36%), Positives = 81/170 (47%), Gaps = 38/170 (22%)

Query: 12  GNLCNNCKRPGHFARECPNVA-----------------------VCNNCGLPGHIASECT 48
            N C  C R GH+ARECP                          +C  CG  GH+A +C 
Sbjct: 3   SNECFKCGRTGHWARECPTGIGRGRGMRSRGRAGFQFMSSSLPDICYRCGESGHLAKDCD 62

Query: 49  TQAR--CWNCREPGHMASNCHN-----EGICHSCGKTGHRARDCSTHVQSGGDLRLCNNC 101
            Q    C+NC   GH+A +C       E  C++CGK GH ARDC        D + C +C
Sbjct: 63  LQEDKACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDH-----ADEQKCYSC 117

Query: 102 YKPGHIAADCTNDKACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
            + GHI  DCT  K C  C +TGH+A +C   +E  C  C  +GH+AR+C
Sbjct: 118 GEFGHIQKDCTKVK-CYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 166



 Score = 75.1 bits (183), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 53/143 (37%), Positives = 76/143 (53%), Gaps = 21/143 (14%)

Query: 6   LSFMSQG--NLCNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQAR-----CWNC 56
             FMS    ++C  C   GH A++C       C NCG  GHIA +C    R     C+NC
Sbjct: 37  FQFMSSSLPDICYRCGESGHLAKDCDLQEDKACYNCGRGGHIAKDCKEPKREREQCCYNC 96

Query: 57  REPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC--T 112
            +PGH+A +C   +E  C+SCG+ GH  +DC T V+       C  C + GH+A +C  T
Sbjct: 97  GKPGHLARDCDHADEQKCYSCGEFGHIQKDC-TKVK-------CYRCGETGHVAINCSKT 148

Query: 113 NDKACKNCRKTGHIARDCQNEPV 135
           ++  C  C ++GH+AR+C  E  
Sbjct: 149 SEVNCYRCGESGHLARECTIEAT 171



 Score = 63.2 bits (152), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 53/174 (30%), Positives = 68/174 (39%), Gaps = 57/174 (32%)

Query: 72  CHSCGKTGHRARDCST----------------HVQSGGDLRLCNNCYKPGHIAADC--TN 113
           C  CG+TGH AR+C T                   S     +C  C + GH+A DC    
Sbjct: 6   CFKCGRTGHWARECPTGIGRGRGMRSRGRAGFQFMSSSLPDICYRCGESGHLAKDCDLQE 65

Query: 114 DKACKNCRKTGHIARDCQN-----EPVCNLCNIAGHVARQCPKGDSLGERGGGGGGERGG 168
           DKAC NC + GHIA+DC+      E  C  C   GH+AR C   D               
Sbjct: 66  DKACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHAD--------------- 110

Query: 169 GGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYEC 222
                            +  C SC + GH+ +DC    + C  CG  GH+A  C
Sbjct: 111 -----------------EQKCYSCGEFGHIQKDCTK--VKCYRCGETGHVAINC 145



 Score = 59.7 bits (143), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 5/81 (6%)

Query: 15  CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQARCWNCREPGHMASNCH--NEG 70
           C NC +PGH AR+C   +   C +CG  GHI  +CT + +C+ C E GH+A NC   +E 
Sbjct: 93  CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCT-KVKCYRCGETGHVAINCSKTSEV 151

Query: 71  ICHSCGKTGHRARDCSTHVQS 91
            C+ CG++GH AR+C+    +
Sbjct: 152 NCYRCGESGHLARECTIEATA 172


>sp|P62634|CNBP_RAT Cellular nucleic acid-binding protein OS=Rattus norvegicus GN=Cnbp
           PE=2 SV=1
          Length = 177

 Score = 79.7 bits (195), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/175 (35%), Positives = 81/175 (46%), Gaps = 43/175 (24%)

Query: 12  GNLCNNCKRPGHFARECPNVA-----------------------------VCNNCGLPGH 42
            N C  C R GH+ARECP                                +C  CG  GH
Sbjct: 3   SNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGH 62

Query: 43  IASECTTQAR-CWNCREPGHMASNCHN-----EGICHSCGKTGHRARDCSTHVQSGGDLR 96
           +A +C  Q   C+NC   GH+A +C       E  C++CGK GH ARDC        D +
Sbjct: 63  LAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDH-----ADEQ 117

Query: 97  LCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
            C +C + GHI  DCT  K C  C +TGH+A +C   +E  C  C  +GH+AR+C
Sbjct: 118 KCYSCGEFGHIQKDCTKVK-CYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 171



 Score = 73.9 bits (180), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 52/142 (36%), Positives = 76/142 (53%), Gaps = 20/142 (14%)

Query: 6   LSFMSQG--NLCNNCKRPGHFARECP-NVAVCNNCGLPGHIASECTTQAR-----CWNCR 57
             F+S    ++C  C   GH A++C      C NCG  GHIA +C    R     C+NC 
Sbjct: 43  FQFVSSSLPDICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCG 102

Query: 58  EPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC--TN 113
           +PGH+A +C   +E  C+SCG+ GH  +DC T V+       C  C + GH+A +C  T+
Sbjct: 103 KPGHLARDCDHADEQKCYSCGEFGHIQKDC-TKVK-------CYRCGETGHVAINCSKTS 154

Query: 114 DKACKNCRKTGHIARDCQNEPV 135
           +  C  C ++GH+AR+C  E  
Sbjct: 155 EVNCYRCGESGHLARECTIEAT 176



 Score = 59.7 bits (143), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 5/81 (6%)

Query: 15  CNNCKRPGHFARECPNV--AVCNNCGLPGHIASECTTQARCWNCREPGHMASNCH--NEG 70
           C NC +PGH AR+C +     C +CG  GHI  +CT + +C+ C E GH+A NC   +E 
Sbjct: 98  CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCT-KVKCYRCGETGHVAINCSKTSEV 156

Query: 71  ICHSCGKTGHRARDCSTHVQS 91
            C+ CG++GH AR+C+    +
Sbjct: 157 NCYRCGESGHLARECTIEATA 177



 Score = 32.0 bits (71), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 188 ICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGR 226
           IC  C + GH+++DC      C NCG  GH+A +C   +
Sbjct: 53  ICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPK 91


>sp|Q5R5R5|CNBP_PONAB Cellular nucleic acid-binding protein OS=Pongo abelii GN=CNBP PE=2
           SV=1
          Length = 177

 Score = 79.7 bits (195), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/175 (35%), Positives = 81/175 (46%), Gaps = 43/175 (24%)

Query: 12  GNLCNNCKRPGHFARECPNVA-----------------------------VCNNCGLPGH 42
            N C  C R GH+ARECP                                +C  CG  GH
Sbjct: 3   SNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGH 62

Query: 43  IASECTTQAR-CWNCREPGHMASNCHN-----EGICHSCGKTGHRARDCSTHVQSGGDLR 96
           +A +C  Q   C+NC   GH+A +C       E  C++CGK GH ARDC        D +
Sbjct: 63  LAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDH-----ADEQ 117

Query: 97  LCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
            C +C + GHI  DCT  K C  C +TGH+A +C   +E  C  C  +GH+AR+C
Sbjct: 118 KCYSCGEFGHIQKDCTKVK-CYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 171



 Score = 73.9 bits (180), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 52/142 (36%), Positives = 76/142 (53%), Gaps = 20/142 (14%)

Query: 6   LSFMSQG--NLCNNCKRPGHFARECP-NVAVCNNCGLPGHIASECTTQAR-----CWNCR 57
             F+S    ++C  C   GH A++C      C NCG  GHIA +C    R     C+NC 
Sbjct: 43  FQFVSSSLPDICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCG 102

Query: 58  EPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC--TN 113
           +PGH+A +C   +E  C+SCG+ GH  +DC T V+       C  C + GH+A +C  T+
Sbjct: 103 KPGHLARDCDHADEQKCYSCGEFGHIQKDC-TKVK-------CYRCGETGHVAINCSKTS 154

Query: 114 DKACKNCRKTGHIARDCQNEPV 135
           +  C  C ++GH+AR+C  E  
Sbjct: 155 EVNCYRCGESGHLARECTIEAT 176



 Score = 59.7 bits (143), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 5/81 (6%)

Query: 15  CNNCKRPGHFARECPNV--AVCNNCGLPGHIASECTTQARCWNCREPGHMASNCH--NEG 70
           C NC +PGH AR+C +     C +CG  GHI  +CT + +C+ C E GH+A NC   +E 
Sbjct: 98  CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCT-KVKCYRCGETGHVAINCSKTSEV 156

Query: 71  ICHSCGKTGHRARDCSTHVQS 91
            C+ CG++GH AR+C+    +
Sbjct: 157 NCYRCGESGHLARECTIEATA 177



 Score = 32.0 bits (71), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 188 ICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGR 226
           IC  C + GH+++DC      C NCG  GH+A +C   +
Sbjct: 53  ICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPK 91


>sp|P62633|CNBP_HUMAN Cellular nucleic acid-binding protein OS=Homo sapiens GN=CNBP PE=1
           SV=1
          Length = 177

 Score = 79.7 bits (195), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/175 (35%), Positives = 81/175 (46%), Gaps = 43/175 (24%)

Query: 12  GNLCNNCKRPGHFARECPNVA-----------------------------VCNNCGLPGH 42
            N C  C R GH+ARECP                                +C  CG  GH
Sbjct: 3   SNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGH 62

Query: 43  IASECTTQAR-CWNCREPGHMASNCHN-----EGICHSCGKTGHRARDCSTHVQSGGDLR 96
           +A +C  Q   C+NC   GH+A +C       E  C++CGK GH ARDC        D +
Sbjct: 63  LAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDH-----ADEQ 117

Query: 97  LCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
            C +C + GHI  DCT  K C  C +TGH+A +C   +E  C  C  +GH+AR+C
Sbjct: 118 KCYSCGEFGHIQKDCTKVK-CYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 171



 Score = 73.9 bits (180), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 52/142 (36%), Positives = 76/142 (53%), Gaps = 20/142 (14%)

Query: 6   LSFMSQG--NLCNNCKRPGHFARECP-NVAVCNNCGLPGHIASECTTQAR-----CWNCR 57
             F+S    ++C  C   GH A++C      C NCG  GHIA +C    R     C+NC 
Sbjct: 43  FQFVSSSLPDICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCG 102

Query: 58  EPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC--TN 113
           +PGH+A +C   +E  C+SCG+ GH  +DC T V+       C  C + GH+A +C  T+
Sbjct: 103 KPGHLARDCDHADEQKCYSCGEFGHIQKDC-TKVK-------CYRCGETGHVAINCSKTS 154

Query: 114 DKACKNCRKTGHIARDCQNEPV 135
           +  C  C ++GH+AR+C  E  
Sbjct: 155 EVNCYRCGESGHLARECTIEAT 176



 Score = 59.7 bits (143), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 5/81 (6%)

Query: 15  CNNCKRPGHFARECPNV--AVCNNCGLPGHIASECTTQARCWNCREPGHMASNCH--NEG 70
           C NC +PGH AR+C +     C +CG  GHI  +CT + +C+ C E GH+A NC   +E 
Sbjct: 98  CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCT-KVKCYRCGETGHVAINCSKTSEV 156

Query: 71  ICHSCGKTGHRARDCSTHVQS 91
            C+ CG++GH AR+C+    +
Sbjct: 157 NCYRCGESGHLARECTIEATA 177



 Score = 32.0 bits (71), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 188 ICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGR 226
           IC  C + GH+++DC      C NCG  GH+A +C   +
Sbjct: 53  ICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPK 91


>sp|P53996|CNBP_MOUSE Cellular nucleic acid-binding protein OS=Mus musculus GN=Cnbp PE=2
           SV=2
          Length = 178

 Score = 79.3 bits (194), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/176 (35%), Positives = 81/176 (46%), Gaps = 44/176 (25%)

Query: 12  GNLCNNCKRPGHFARECPNVA-----------------------------VCNNCGLPGH 42
            N C  C R GH+ARECP                                +C  CG  GH
Sbjct: 3   SNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGH 62

Query: 43  IASECTTQAR--CWNCREPGHMASNCHN-----EGICHSCGKTGHRARDCSTHVQSGGDL 95
           +A +C  Q    C+NC   GH+A +C       E  C++CGK GH ARDC        D 
Sbjct: 63  LAKDCDLQEDEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDH-----ADE 117

Query: 96  RLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
           + C +C + GHI  DCT  K C  C +TGH+A +C   +E  C  C  +GH+AR+C
Sbjct: 118 QKCYSCGEFGHIQKDCTKVK-CYRCGETGHVAINCSKTSEVNCYRCGESGHLAREC 172



 Score = 73.6 bits (179), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 52/143 (36%), Positives = 76/143 (53%), Gaps = 21/143 (14%)

Query: 6   LSFMSQG--NLCNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQAR-----CWNC 56
             F+S    ++C  C   GH A++C       C NCG  GHIA +C    R     C+NC
Sbjct: 43  FQFVSSSLPDICYRCGESGHLAKDCDLQEDEACYNCGRGGHIAKDCKEPKREREQCCYNC 102

Query: 57  REPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC--T 112
            +PGH+A +C   +E  C+SCG+ GH  +DC T V+       C  C + GH+A +C  T
Sbjct: 103 GKPGHLARDCDHADEQKCYSCGEFGHIQKDC-TKVK-------CYRCGETGHVAINCSKT 154

Query: 113 NDKACKNCRKTGHIARDCQNEPV 135
           ++  C  C ++GH+AR+C  E  
Sbjct: 155 SEVNCYRCGESGHLARECTIEAT 177



 Score = 59.7 bits (143), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 5/81 (6%)

Query: 15  CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQARCWNCREPGHMASNCH--NEG 70
           C NC +PGH AR+C   +   C +CG  GHI  +CT + +C+ C E GH+A NC   +E 
Sbjct: 99  CYNCGKPGHLARDCDHADEQKCYSCGEFGHIQKDCT-KVKCYRCGETGHVAINCSKTSEV 157

Query: 71  ICHSCGKTGHRARDCSTHVQS 91
            C+ CG++GH AR+C+    +
Sbjct: 158 NCYRCGESGHLARECTIEATA 178



 Score = 58.2 bits (139), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 51/180 (28%), Positives = 68/180 (37%), Gaps = 63/180 (35%)

Query: 72  CHSCGKTGHRARDCST----------------------HVQSGGDLRLCNNCYKPGHIAA 109
           C  CG++GH AR+C T                         S     +C  C + GH+A 
Sbjct: 6   CFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGESGHLAK 65

Query: 110 DC--TNDKACKNCRKTGHIARDCQN-----EPVCNLCNIAGHVARQCPKGDSLGERGGGG 162
           DC    D+AC NC + GHIA+DC+      E  C  C   GH+AR C   D         
Sbjct: 66  DCDLQEDEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHAD--------- 116

Query: 163 GGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYEC 222
                                  +  C SC + GH+ +DC    + C  CG  GH+A  C
Sbjct: 117 -----------------------EQKCYSCGEFGHIQKDCTK--VKCYRCGETGHVAINC 151


>sp|O76743|GLH4_CAEEL ATP-dependent RNA helicase glh-4 OS=Caenorhabditis elegans GN=glh-4
           PE=2 SV=2
          Length = 1156

 Score = 72.8 bits (177), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 42/116 (36%), Positives = 54/116 (46%), Gaps = 18/116 (15%)

Query: 34  CNNCGLPGHIASEC----TTQARCWNCREPGHMASNCHN----EGICHSCGKTGHRARDC 85
           C+NCG  GHI+ EC      +  C NC + GH AS+C       G C +CG  GH A DC
Sbjct: 572 CHNCGEEGHISKECDKPKVPRFPCRNCEQLGHFASDCDQPRVPRGPCRNCGIEGHFAVDC 631

Query: 86  STHVQSGGDLRLCNNCYKPGHIAADCTNDKA-------CKNCRKTGHIARDCQNEP 134
                  G    C NC + GH A DC N++        C+ C + GH   +C   P
Sbjct: 632 DQPKVPRGP---CRNCGQEGHFAKDCQNERVRMEPTEPCRRCAEEGHWGYECPTRP 684



 Score = 70.9 bits (172), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 42/113 (37%), Positives = 51/113 (45%), Gaps = 18/113 (15%)

Query: 53  CWNCREPGHMASNCHNEGI----CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIA 108
           C NC E GH++  C    +    C +C + GH A DC       G    C NC   GH A
Sbjct: 572 CHNCGEEGHISKECDKPKVPRFPCRNCEQLGHFASDCDQPRVPRGP---CRNCGIEGHFA 628

Query: 109 ADCTNDKA----CKNCRKTGHIARDCQNEPV-------CNLCNIAGHVARQCP 150
            DC   K     C+NC + GH A+DCQNE V       C  C   GH   +CP
Sbjct: 629 VDCDQPKVPRGPCRNCGQEGHFAKDCQNERVRMEPTEPCRRCAEEGHWGYECP 681



 Score = 66.6 bits (161), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 45/148 (30%), Positives = 57/148 (38%), Gaps = 54/148 (36%)

Query: 96  RLCNNCYKPGHIAADCTNDKA----CKNCRKTGHIARDCQNEPV----CNLCNIAGHVAR 147
           R C+NC + GHI+ +C   K     C+NC + GH A DC    V    C  C I GH A 
Sbjct: 570 RGCHNCGEEGHISKECDKPKVPRFPCRNCEQLGHFASDCDQPRVPRGPCRNCGIEGHFAV 629

Query: 148 QC-----PKGDSLGERGGGGGGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDC 202
            C     P+G                                    CR+C Q GH ++DC
Sbjct: 630 DCDQPKVPRGP-----------------------------------CRNCGQEGHFAKDC 654

Query: 203 ------VGPLIICRNCGGRGHMAYECPS 224
                 + P   CR C   GH  YECP+
Sbjct: 655 QNERVRMEPTEPCRRCAEEGHWGYECPT 682



 Score = 61.6 bits (148), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 32/88 (36%), Positives = 43/88 (48%), Gaps = 15/88 (17%)

Query: 15  CNNCKRPGHFARECPNVAV----CNNCGLPGHIASEC----TTQARCWNCREPGHMASNC 66
           C NC++ GHFA +C    V    C NCG+ GH A +C      +  C NC + GH A +C
Sbjct: 595 CRNCEQLGHFASDCDQPRVPRGPCRNCGIEGHFAVDCDQPKVPRGPCRNCGQEGHFAKDC 654

Query: 67  HNEGI-------CHSCGKTGHRARDCST 87
            NE +       C  C + GH   +C T
Sbjct: 655 QNERVRMEPTEPCRRCAEEGHWGYECPT 682



 Score = 42.7 bits (99), Expect = 0.002,   Method: Composition-based stats.
 Identities = 18/42 (42%), Positives = 25/42 (59%), Gaps = 3/42 (7%)

Query: 189 CRSCNQMGHMSRDCVGPLI---ICRNCGGRGHMAYECPSGRI 227
           CR+C Q+GH + DC  P +    CRNCG  GH A +C   ++
Sbjct: 595 CRNCEQLGHFASDCDQPRVPRGPCRNCGIEGHFAVDCDQPKV 636



 Score = 35.8 bits (81), Expect = 0.25,   Method: Composition-based stats.
 Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 7/45 (15%)

Query: 15  CNNCKRPGHFARECPNVAV-------CNNCGLPGHIASECTTQAR 52
           C NC + GHFA++C N  V       C  C   GH   EC T+ +
Sbjct: 641 CRNCGQEGHFAKDCQNERVRMEPTEPCRRCAEEGHWGYECPTRPK 685


>sp|O65639|CSP1_ARATH Cold shock protein 1 OS=Arabidopsis thaliana GN=CSP1 PE=2 SV=1
          Length = 299

 Score = 72.0 bits (175), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 55/167 (32%), Positives = 70/167 (41%), Gaps = 52/167 (31%)

Query: 15  CNNCKRPGHFARECPNVAV-------------CNNCGLPGHIASECTTQAR--------- 52
           C NC   GHFAR+C +                C  CG  GH+A +CT ++          
Sbjct: 134 CYNCGDTGHFARDCTSAGNGDQRGATKGGNDGCYTCGDVGHVARDCTQKSVGNGDQRGAV 193

Query: 53  ------CWNCREPGHMASNCHNE-------------GICHSCGKTGHRARDCSTHVQSGG 93
                 C+ C + GH A +C  +             G C+SCG  GH ARDC+T  Q   
Sbjct: 194 KGGNDGCYTCGDVGHFARDCTQKVAAGNVRSGGGGSGTCYSCGGVGHIARDCATKRQP-- 251

Query: 94  DLRLCNNCYKPGHIAADCT--------NDKACKNCRKTGHIARDCQN 132
             R C  C   GH+A DC         ND AC  C K GH AR+C +
Sbjct: 252 -SRGCYQCGGSGHLARDCDQRGSGGGGNDNACYKCGKEGHFARECSS 297



 Score = 70.5 bits (171), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 67/220 (30%), Positives = 85/220 (38%), Gaps = 77/220 (35%)

Query: 34  CNNCGLPGHIASEC-------------TTQARCWNCREPGHMASNCHNEGI--------- 71
           C NCG  GHI+ +C                  C+NC + GH A +C + G          
Sbjct: 102 CYNCGELGHISKDCGIGGGGGGGERRSRGGEGCYNCGDTGHFARDCTSAGNGDQRGATKG 161

Query: 72  ----CHSCGKTGHRARDCSTHVQSGGDLR--------LCNNCYKPGHIAADCTNDKACKN 119
               C++CG  GH ARDC+      GD R         C  C   GH A DCT   A  N
Sbjct: 162 GNDGCYTCGDVGHVARDCTQKSVGNGDQRGAVKGGNDGCYTCGDVGHFARDCTQKVAAGN 221

Query: 120 CRK-------------TGHIARDC--QNEPV--CNLCNIAGHVARQCPKGDSLGERGGGG 162
            R               GHIARDC  + +P   C  C  +GH+AR C             
Sbjct: 222 VRSGGGGSGTCYSCGGVGHIARDCATKRQPSRGCYQCGGSGHLARDCD------------ 269

Query: 163 GGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDC 202
             +RG GGGG+            D  C  C + GH +R+C
Sbjct: 270 --QRGSGGGGN------------DNACYKCGKEGHFAREC 295



 Score = 63.9 bits (154), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 60/202 (29%), Positives = 78/202 (38%), Gaps = 65/202 (32%)

Query: 53  CWNCREPGHMASNCHNEGI-------------CHSCGKTGHRARDCSTHVQSGGDLRLCN 99
           C+NC E GH++ +C   G              C++CG TGH ARDC++     GD R   
Sbjct: 102 CYNCGELGHISKDCGIGGGGGGGERRSRGGEGCYNCGDTGHFARDCTS--AGNGDQR--- 156

Query: 100 NCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPV---------------CNLCNIAGH 144
                    A    +  C  C   GH+ARDC  + V               C  C   GH
Sbjct: 157 --------GATKGGNDGCYTCGDVGHVARDCTQKSVGNGDQRGAVKGGNDGCYTCGDVGH 208

Query: 145 VARQCPKGDSLGERGGGGGGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVG 204
            AR C +  + G    GGGG                        C SC  +GH++RDC  
Sbjct: 209 FARDCTQKVAAGNVRSGGGGSG---------------------TCYSCGGVGHIARDCAT 247

Query: 205 ---PLIICRNCGGRGHMAYECP 223
              P   C  CGG GH+A +C 
Sbjct: 248 KRQPSRGCYQCGGSGHLARDCD 269



 Score = 38.9 bits (89), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 46/154 (29%), Positives = 60/154 (38%), Gaps = 40/154 (25%)

Query: 117 CKNCRKTGHIARDCQNEPV-------------CNLCNIAGHVARQCPKGDSLGERGGGGG 163
           C NC + GHI++DC                  C  C   GH AR C    +  +RG   G
Sbjct: 102 CYNCGELGHISKDCGIGGGGGGGERRSRGGEGCYNCGDTGHFARDCTSAGNGDQRGATKG 161

Query: 164 GERGGGGGGDGG-----------GGG---GRYVGYHDVICRSCNQMGHMSRDCVGPLI-- 207
           G  G    GD G           G G   G   G +D  C +C  +GH +RDC   +   
Sbjct: 162 GNDGCYTCGDVGHVARDCTQKSVGNGDQRGAVKGGNDG-CYTCGDVGHFARDCTQKVAAG 220

Query: 208 ----------ICRNCGGRGHMAYECPSGRIADRG 231
                      C +CGG GH+A +C + R   RG
Sbjct: 221 NVRSGGGGSGTCYSCGGVGHIARDCATKRQPSRG 254



 Score = 36.2 bits (82), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 38/88 (43%), Gaps = 15/88 (17%)

Query: 4   NVLSFMSQGNLCNNCKRPGHFARECPNVAV----CNNCGLPGHIASECTTQARCWNCREP 59
           NV S       C +C   GH AR+C         C  CG  GH+A +C  +         
Sbjct: 221 NVRSGGGGSGTCYSCGGVGHIARDCATKRQPSRGCYQCGGSGHLARDCDQRG-------- 272

Query: 60  GHMASNCHNEGICHSCGKTGHRARDCST 87
              +    N+  C+ CGK GH AR+CS+
Sbjct: 273 ---SGGGGNDNACYKCGKEGHFARECSS 297


>sp|Q8T8R1|Y3800_DROME CCHC-type zinc finger protein CG3800 OS=Drosophila melanogaster
           GN=CG3800 PE=1 SV=1
          Length = 165

 Score = 65.1 bits (157), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 36/161 (22%)

Query: 9   MSQGNLCNNCKRPGHFARECP-------------------------------NVAVCNNC 37
           MS    C  C RPGHFAR+C                                N   C  C
Sbjct: 1   MSMSATCYKCNRPGHFARDCSLGGGGGPGGVGGGGGGGGGGMRGNDGGGMRRNREKCYKC 60

Query: 38  GLPGHIASECTTQA-RCWNCREPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGD 94
              GH A  C  +A RC+ C   GH++ +C   +   C+ C KTGH  R+C   V   G 
Sbjct: 61  NQFGHFARACPEEAERCYRCNGIGHISKDCTQADNPTCYRCNKTGHWVRNCPEAVNERGP 120

Query: 95  LRL-CNNCYKPGHIAADC-TNDKACKNCRKTGHIARDCQNE 133
             + C  C + GHI+ +C    K C  C K+GH+ R+C  +
Sbjct: 121 TNVSCYKCNRTGHISKNCPETSKTCYGCGKSGHLRRECDEK 161



 Score = 57.8 bits (138), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 50/180 (27%), Positives = 68/180 (37%), Gaps = 53/180 (29%)

Query: 70  GICHSCGKTGHRARDCSTHVQSGG------------------------DLRLCNNCYKPG 105
             C+ C + GH ARDCS     G                         +   C  C + G
Sbjct: 5   ATCYKCNRPGHFARDCSLGGGGGPGGVGGGGGGGGGGMRGNDGGGMRRNREKCYKCNQFG 64

Query: 106 HIAADCTND-KACKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQCPKGDSLGERGGGG 162
           H A  C  + + C  C   GHI++DC   + P C  CN  GH  R CP  +++ ERG   
Sbjct: 65  HFARACPEEAERCYRCNGIGHISKDCTQADNPTCYRCNKTGHWVRNCP--EAVNERGPT- 121

Query: 163 GGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYEC 222
                                  +V C  CN+ GH+S++C      C  CG  GH+  EC
Sbjct: 122 -----------------------NVSCYKCNRTGHISKNCPETSKTCYGCGKSGHLRREC 158



 Score = 30.8 bits (68), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 14/37 (37%), Positives = 18/37 (48%)

Query: 189 CRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSG 225
           C  CNQ GH +R C      C  C G GH++ +C   
Sbjct: 57  CYKCNQFGHFARACPEEAERCYRCNGIGHISKDCTQA 93


>sp|Q8WW36|ZCH13_HUMAN Zinc finger CCHC domain-containing protein 13 OS=Homo sapiens
           GN=ZCCHC13 PE=2 SV=1
          Length = 166

 Score = 55.5 bits (132), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 42/108 (38%), Positives = 55/108 (50%), Gaps = 6/108 (5%)

Query: 48  TTQARCWNCREPGHMASNCHNEG-ICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGH 106
           T    C+ C E G  A NC   G IC++CG++GH A+DC    +     + C  C + GH
Sbjct: 42  TLSYTCYCCGESGRNAKNCVLLGNICYNCGRSGHIAKDCKDPKRE--RRQHCYTCGRLGH 99

Query: 107 IAADCTNDK--ACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKG 152
           +A DC   K   C +C K GHI +DC  +  C  C   GHVA  C K 
Sbjct: 100 LARDCDRQKEQKCYSCGKLGHIQKDCA-QVKCYRCGEIGHVAINCSKA 146



 Score = 51.6 bits (122), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 51/91 (56%), Gaps = 8/91 (8%)

Query: 3   KNVLSFMSQGNLCNNCKRPGHFARECPNVAV-----CNNCGLPGHIASECTTQA--RCWN 55
           +N  + +  GN+C NC R GH A++C +        C  CG  GH+A +C  Q   +C++
Sbjct: 55  RNAKNCVLLGNICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKCYS 114

Query: 56  CREPGHMASNCHNEGICHSCGKTGHRARDCS 86
           C + GH+  +C  +  C+ CG+ GH A +CS
Sbjct: 115 CGKLGHIQKDC-AQVKCYRCGEIGHVAINCS 144



 Score = 50.8 bits (120), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 52/109 (47%), Gaps = 16/109 (14%)

Query: 15  CNNCKRPGHFARECPNVA-VCNNCGLPGHIASECTTQAR-----CWNCREPGHMASNC-- 66
           C  C   G  A+ C  +  +C NCG  GHIA +C    R     C+ C   GH+A +C  
Sbjct: 47  CYCCGESGRNAKNCVLLGNICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDR 106

Query: 67  HNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDK 115
             E  C+SCGK GH  +DC+           C  C + GH+A +C+  +
Sbjct: 107 QKEQKCYSCGKLGHIQKDCAQ--------VKCYRCGEIGHVAINCSKAR 147



 Score = 38.9 bits (89), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 31/115 (26%), Positives = 45/115 (39%), Gaps = 39/115 (33%)

Query: 117 CKNCRKTGHIARDCQNEPV-----CNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGG 171
           C NC ++GHIA+DC++        C  C   GH+AR C +                    
Sbjct: 67  CYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQK------------------ 108

Query: 172 GDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGR 226
                         +  C SC ++GH+ +DC    + C  CG  GH+A  C   R
Sbjct: 109 --------------EQKCYSCGKLGHIQKDCAQ--VKCYRCGEIGHVAINCSKAR 147


>sp|Q94C69|CSP3_ARATH Cold shock domain-containing protein 3 OS=Arabidopsis thaliana
           GN=CSP3 PE=2 SV=1
          Length = 301

 Score = 53.5 bits (127), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 58/207 (28%), Positives = 72/207 (34%), Gaps = 67/207 (32%)

Query: 11  QGNLCNNCKRPGHFARECP-----------------NVAVCNNCGLPGHIASECTTQAR- 52
            G  C NC   GH A++C                      C  CG  GH A +C      
Sbjct: 92  SGGNCFNCGEVGHMAKDCDGGSGGKSFGGGGGRRSGGEGECYMCGDVGHFARDCRQSGGG 151

Query: 53  -----------CWNCREPGHMASNC----------------HNEGICHSCGKTGHRARDC 85
                      C++C E GH+A +C                     C+ CG  GH ARDC
Sbjct: 152 NSGGGGGGGRPCYSCGEVGHLAKDCRGGSGGNRYGGGGGRGSGGDGCYMCGGVGHFARDC 211

Query: 86  --STHVQSGGDLRLCNNCYKPGHIAADCTND---------KACKNCRKTGHIARDCQNEP 134
             +     GG    C  C   GHIA  CT+          +AC  C  TGH+ARDC    
Sbjct: 212 RQNGGGNVGGGGSTCYTCGGVGHIAKVCTSKIPSGGGGGGRACYECGGTGHLARDCDRRG 271

Query: 135 V-----------CNLCNIAGHVARQCP 150
                       C +C   GH AR+C 
Sbjct: 272 SGSSGGGGGSNKCFICGKEGHFARECT 298



 Score = 44.3 bits (103), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 63/233 (27%), Positives = 75/233 (32%), Gaps = 90/233 (38%)

Query: 53  CWNCREPGHMASNCHNEGI-----------------CHSCGKTGHRARDC-----STHVQ 90
           C+NC E GHMA +C                      C+ CG  GH ARDC          
Sbjct: 96  CFNCGEVGHMAKDCDGGSGGKSFGGGGGRRSGGEGECYMCGDVGHFARDCRQSGGGNSGG 155

Query: 91  SGGDLRLCNNCYKPGHIAADC----------------TNDKACKNCRKTGHIARDCQN-- 132
            GG  R C +C + GH+A DC                +    C  C   GH ARDC+   
Sbjct: 156 GGGGGRPCYSCGEVGHLAKDCRGGSGGNRYGGGGGRGSGGDGCYMCGGVGHFARDCRQNG 215

Query: 133 -------EPVCNLCNIAGHVARQC----PKGDSLGERGGGGGGERGGGGGGDGGGGGGRY 181
                     C  C   GH+A+ C    P G   G R                       
Sbjct: 216 GGNVGGGGSTCYTCGGVGHIAKVCTSKIPSGGGGGGRA---------------------- 253

Query: 182 VGYHDVICRSCNQMGHMSRDC----------VGPLIICRNCGGRGHMAYECPS 224
                  C  C   GH++RDC           G    C  CG  GH A EC S
Sbjct: 254 -------CYECGGTGHLARDCDRRGSGSSGGGGGSNKCFICGKEGHFARECTS 299



 Score = 42.0 bits (97), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 43/102 (42%), Gaps = 29/102 (28%)

Query: 15  CNNCKRPGHFARECPN---------VAVCNNCGLPGHIASECTTQA---------RCWNC 56
           C  C   GHFAR+C            + C  CG  GHIA  CT++           C+ C
Sbjct: 198 CYMCGGVGHFARDCRQNGGGNVGGGGSTCYTCGGVGHIAKVCTSKIPSGGGGGGRACYEC 257

Query: 57  REPGHMASNCHNEGI-----------CHSCGKTGHRARDCST 87
              GH+A +C   G            C  CGK GH AR+C++
Sbjct: 258 GGTGHLARDCDRRGSGSSGGGGGSNKCFICGKEGHFARECTS 299


>sp|Q05313|GAG_FIVWO Gag polyprotein OS=Feline immunodeficiency virus (isolate Wo)
           GN=gag PE=3 SV=1
          Length = 450

 Score = 53.5 bits (127), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 24/44 (54%), Positives = 29/44 (65%), Gaps = 6/44 (13%)

Query: 12  GNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWN 55
           G +C NCKRPGH AR+C +V  CN CG PGH+A      A+CW 
Sbjct: 374 GPVCFNCKRPGHLARQCRDVKKCNKCGKPGHLA------AKCWQ 411



 Score = 43.5 bits (101), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)

Query: 2   LKNVLSFMSQGNLCNNCKRP-GHFAREC---PNVAVCNNCGLPGH---IASECTTQAR-- 52
           LK  LS     N   +CK+   H   E      +  C   G PG+   + +E  T+ +  
Sbjct: 312 LKQSLSI---ANANADCKKAMSHLKPESTLEEKLRACQEIGFPGYKMQLLAEALTKVQVV 368

Query: 53  --------CWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
                   C+NC+ PGH+A  C +   C+ CGK GH A  C
Sbjct: 369 QSKGPGPVCFNCKRPGHLARQCRDVKKCNKCGKPGHLAAKC 409



 Score = 41.2 bits (95), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 39/90 (43%), Gaps = 10/90 (11%)

Query: 41  GHIASECTTQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNN 100
            H+  E T + +   C+E G        + +  +  K           VQS G   +C N
Sbjct: 330 SHLKPESTLEEKLRACQEIGF--PGYKMQLLAEALTKV--------QVVQSKGPGPVCFN 379

Query: 101 CYKPGHIAADCTNDKACKNCRKTGHIARDC 130
           C +PGH+A  C + K C  C K GH+A  C
Sbjct: 380 CKRPGHLARQCRDVKKCNKCGKPGHLAAKC 409



 Score = 35.8 bits (81), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 16/37 (43%), Positives = 24/37 (64%)

Query: 116 ACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKG 152
            C NC++ GH+AR C++   CN C   GH+A +C +G
Sbjct: 376 VCFNCKRPGHLARQCRDVKKCNKCGKPGHLAAKCWQG 412


>sp|P19027|GAG_FIVSD Gag polyprotein OS=Feline immunodeficiency virus (strain San Diego)
           GN=gag PE=3 SV=1
          Length = 450

 Score = 52.4 bits (124), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 23/44 (52%), Positives = 29/44 (65%), Gaps = 6/44 (13%)

Query: 12  GNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWN 55
           G +C NCK+PGH AR+C +V  CN CG PGH+A      A+CW 
Sbjct: 374 GPVCFNCKKPGHLARQCRDVKKCNKCGKPGHLA------AKCWQ 411



 Score = 42.7 bits (99), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 6/65 (9%)

Query: 72  CHSCGKTGHRARDCSTH------VQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGH 125
           C   G  G++ +  +        VQS G   +C NC KPGH+A  C + K C  C K GH
Sbjct: 345 CQEIGSPGYKMQLLAEALTKVQVVQSKGSGPVCFNCKKPGHLARQCRDVKKCNKCGKPGH 404

Query: 126 IARDC 130
           +A  C
Sbjct: 405 LAAKC 409



 Score = 42.4 bits (98), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 33/65 (50%), Gaps = 13/65 (20%)

Query: 34  CNNCGLPGH---IASECTTQAR----------CWNCREPGHMASNCHNEGICHSCGKTGH 80
           C   G PG+   + +E  T+ +          C+NC++PGH+A  C +   C+ CGK GH
Sbjct: 345 CQEIGSPGYKMQLLAEALTKVQVVQSKGSGPVCFNCKKPGHLARQCRDVKKCNKCGKPGH 404

Query: 81  RARDC 85
            A  C
Sbjct: 405 LAAKC 409



 Score = 38.1 bits (87), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 37/82 (45%), Gaps = 3/82 (3%)

Query: 80  HRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLC 139
            + R C      G  ++L         +     +   C NC+K GH+AR C++   CN C
Sbjct: 340 EKLRACQEIGSPGYKMQLLAEALTKVQVVQSKGSGPVCFNCKKPGHLARQCRDVKKCNKC 399

Query: 140 NIAGHVARQCPKGDSLGERGGG 161
              GH+A +C +G   G+R  G
Sbjct: 400 GKPGHLAAKCWQG---GKRNSG 418


>sp|P16087|GAG_FIVPE Gag polyprotein OS=Feline immunodeficiency virus (isolate Petaluma)
           GN=gag PE=3 SV=1
          Length = 450

 Score = 51.6 bits (122), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 6/44 (13%)

Query: 12  GNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWN 55
           G +C NCK+PGH AR+C  V  CN CG PGH+A      A+CW 
Sbjct: 374 GPVCFNCKKPGHLARQCREVKKCNKCGKPGHLA------AKCWQ 411



 Score = 42.4 bits (98), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 31/65 (47%), Gaps = 6/65 (9%)

Query: 72  CHSCGKTGHRARDCSTH------VQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGH 125
           C   G  G++ +  +        VQS G   +C NC KPGH+A  C   K C  C K GH
Sbjct: 345 CQEIGSPGYKMQLLAEALTKVQVVQSKGSGPVCFNCKKPGHLARQCREVKKCNKCGKPGH 404

Query: 126 IARDC 130
           +A  C
Sbjct: 405 LAAKC 409



 Score = 42.0 bits (97), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)

Query: 2   LKNVLSFMSQGNLCNNCKRP-GHFAREC---PNVAVCNNCGLPGH---IASECTTQAR-- 52
           LK  LS     N   +CK+   H   E      +  C   G PG+   + +E  T+ +  
Sbjct: 312 LKQSLSI---ANANADCKKAMSHLKPESTLEEKLRACQEIGSPGYKMQLLAEALTKVQVV 368

Query: 53  --------CWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
                   C+NC++PGH+A  C     C+ CGK GH A  C
Sbjct: 369 QSKGSGPVCFNCKKPGHLARQCREVKKCNKCGKPGHLAAKC 409



 Score = 37.7 bits (86), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 21/74 (28%), Positives = 33/74 (44%)

Query: 80  HRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLC 139
            + R C      G  ++L         +     +   C NC+K GH+AR C+    CN C
Sbjct: 340 EKLRACQEIGSPGYKMQLLAEALTKVQVVQSKGSGPVCFNCKKPGHLARQCREVKKCNKC 399

Query: 140 NIAGHVARQCPKGD 153
              GH+A +C +G+
Sbjct: 400 GKPGHLAAKCWQGN 413


>sp|P03354|POL_RSVP Gag-Pro-Pol polyprotein OS=Rous sarcoma virus (strain Prague C)
           GN=gag-pro-pol PE=1 SV=2
          Length = 1603

 Score = 50.8 bits (120), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 40/96 (41%), Gaps = 22/96 (22%)

Query: 80  HRARDCSTHVQSGGDLR-LCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNL 138
           +R RD  T   SGG  R LC  C  PGH  A C   +   N R+             C L
Sbjct: 492 NRERDGQT--GSGGRARGLCYTCGSPGHYQAQCPKKRKSGNSRER------------CQL 537

Query: 139 CNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDG 174
           CN  GH A+QC K D       G  G+R G G   G
Sbjct: 538 CNGMGHNAKQCRKRD-------GNQGQRPGKGLSSG 566



 Score = 40.0 bits (92), Expect = 0.014,   Method: Composition-based stats.
 Identities = 18/57 (31%), Positives = 22/57 (38%)

Query: 55  NCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC 111
           N    G   S     G+C++CG  GH    C    +SG     C  C   GH A  C
Sbjct: 492 NRERDGQTGSGGRARGLCYTCGSPGHYQAQCPKKRKSGNSRERCQLCNGMGHNAKQC 548



 Score = 37.0 bits (84), Expect = 0.11,   Method: Composition-based stats.
 Identities = 16/54 (29%), Positives = 22/54 (40%), Gaps = 12/54 (22%)

Query: 32  AVCNNCGLPGHIASECTTQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
            +C  CG PGH  ++C  + +  N RE             C  C   GH A+ C
Sbjct: 507 GLCYTCGSPGHYQAQCPKKRKSGNSRER------------CQLCNGMGHNAKQC 548



 Score = 33.9 bits (76), Expect = 0.97,   Method: Composition-based stats.
 Identities = 16/53 (30%), Positives = 20/53 (37%), Gaps = 12/53 (22%)

Query: 14  LCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWNCREPGHMASNC 66
           LC  C  PGH+  +CP      N            ++ RC  C   GH A  C
Sbjct: 508 LCYTCGSPGHYQAQCPKKRKSGN------------SRERCQLCNGMGHNAKQC 548



 Score = 33.5 bits (75), Expect = 1.4,   Method: Composition-based stats.
 Identities = 22/87 (25%), Positives = 29/87 (33%), Gaps = 27/87 (31%)

Query: 116 ACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDGG 175
           A  N  + G      +   +C  C   GH   QCPK     +R  G   ER         
Sbjct: 489 AVVNRERDGQTGSGGRARGLCYTCGSPGHYQAQCPK-----KRKSGNSRER--------- 534

Query: 176 GGGGRYVGYHDVICRSCNQMGHMSRDC 202
                        C+ CN MGH ++ C
Sbjct: 535 -------------CQLCNGMGHNAKQC 548


>sp|Q1A267|POL_SIVMB Gag-Pol polyprotein OS=Simian immunodeficiency virus (isolate MB66)
           GN=gag-pol PE=3 SV=4
          Length = 1438

 Score = 50.1 bits (118), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 25/63 (39%), Positives = 29/63 (46%), Gaps = 5/63 (7%)

Query: 9   MSQGNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQAR--CWNCREPGHMASNC 66
           MSQ    N+ KR     +    +  C NCG  GHIA  C    R  CW C + GH   NC
Sbjct: 365 MSQAQHSNDAKRQFKGPKR---IVKCFNCGKEGHIARNCKAPRRKGCWKCGQEGHQMRNC 421

Query: 67  HNE 69
            NE
Sbjct: 422 TNE 424



 Score = 41.6 bits (96), Expect = 0.005,   Method: Composition-based stats.
 Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 5/52 (9%)

Query: 72  CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKT 123
           C +CGK GH AR+C    + G     C  C + GH   +CTN++     R+T
Sbjct: 387 CFNCGKEGHIARNCKAPRRKG-----CWKCGQEGHQMRNCTNERQANFFRET 433



 Score = 41.6 bits (96), Expect = 0.005,   Method: Composition-based stats.
 Identities = 19/43 (44%), Positives = 22/43 (51%), Gaps = 2/43 (4%)

Query: 98  CNNCYKPGHIAADCT--NDKACKNCRKTGHIARDCQNEPVCNL 138
           C NC K GHIA +C     K C  C + GH  R+C NE   N 
Sbjct: 387 CFNCGKEGHIARNCKAPRRKGCWKCGQEGHQMRNCTNERQANF 429



 Score = 37.4 bits (85), Expect = 0.091,   Method: Composition-based stats.
 Identities = 18/46 (39%), Positives = 21/46 (45%), Gaps = 2/46 (4%)

Query: 15  CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQARCWNCRE 58
           C NC + GH AR C  P    C  CG  GH    CT + +    RE
Sbjct: 387 CFNCGKEGHIARNCKAPRRKGCWKCGQEGHQMRNCTNERQANFFRE 432



 Score = 35.4 bits (80), Expect = 0.31,   Method: Composition-based stats.
 Identities = 16/44 (36%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYECPSGRIAD 229
           V C +C + GH++R+C  P    C  CG  GH    C + R A+
Sbjct: 385 VKCFNCGKEGHIARNCKAPRRKGCWKCGQEGHQMRNCTNERQAN 428



 Score = 35.4 bits (80), Expect = 0.37,   Method: Composition-based stats.
 Identities = 16/35 (45%), Positives = 18/35 (51%), Gaps = 2/35 (5%)

Query: 117 CKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
           C NC K GHIAR+C+      C  C   GH  R C
Sbjct: 387 CFNCGKEGHIARNCKAPRRKGCWKCGQEGHQMRNC 421


>sp|Q82851|POL_JEMBR Gag-Pol polyprotein OS=Jembrana disease virus GN=gag-pol PE=3 SV=1
          Length = 1432

 Score = 50.1 bits (118), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 20/48 (41%), Positives = 31/48 (64%), Gaps = 1/48 (2%)

Query: 86  STHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNE 133
           + +V+  G+++ C  C KPGHI  DC N K C  C K GH+ R+C+++
Sbjct: 358 AINVKGDGEVQRCYGCGKPGHIRRDCKNQK-CFKCGKPGHLQRNCKSK 404



 Score = 48.1 bits (113), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 22/67 (32%), Positives = 33/67 (49%), Gaps = 8/67 (11%)

Query: 3   KNVLSFMSQGNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWNCREPGHM 62
           K  + F+++     N K  G   R       C  CG PGHI  +C  Q +C+ C +PGH+
Sbjct: 346 KQKMQFLAEAFAAINVKGDGEVQR-------CYGCGKPGHIRRDCKNQ-KCFKCGKPGHL 397

Query: 63  ASNCHNE 69
             NC ++
Sbjct: 398 QRNCKSK 404



 Score = 45.8 bits (107), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 16/36 (44%), Positives = 23/36 (63%), Gaps = 1/36 (2%)

Query: 52  RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCST 87
           RC+ C +PGH+  +C N+  C  CGK GH  R+C +
Sbjct: 369 RCYGCGKPGHIRRDCKNQK-CFKCGKPGHLQRNCKS 403



 Score = 43.5 bits (101), Expect = 0.001,   Method: Composition-based stats.
 Identities = 16/38 (42%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 15  CNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQAR 52
           C  C +PGH  R+C N   C  CG PGH+   C ++ R
Sbjct: 370 CYGCGKPGHIRRDCKNQK-CFKCGKPGHLQRNCKSKNR 406



 Score = 33.5 bits (75), Expect = 1.4,   Method: Composition-based stats.
 Identities = 14/36 (38%), Positives = 17/36 (47%), Gaps = 2/36 (5%)

Query: 189 CRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPS 224
           C  C + GH+ RDC      C  CG  GH+   C S
Sbjct: 370 CYGCGKPGHIRRDCKNQK--CFKCGKPGHLQRNCKS 403


>sp|P31821|GAG_FIVT2 Gag polyprotein OS=Feline immunodeficiency virus (isolate TM2)
           GN=gag PE=3 SV=1
          Length = 449

 Score = 49.7 bits (117), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 20/34 (58%), Positives = 25/34 (73%)

Query: 14  LCNNCKRPGHFARECPNVAVCNNCGLPGHIASEC 47
           +C NCK+PGH AR+C     CNNCG PGH+A+ C
Sbjct: 376 VCFNCKKPGHLARQCKEAKRCNNCGKPGHLAANC 409



 Score = 45.8 bits (107), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 42/101 (41%), Gaps = 14/101 (13%)

Query: 72  CHSCGKTGHRARDCST------HVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGH 125
           C   G  G++ +  +        VQ+ G   +C NC KPGH+A  C   K C NC K GH
Sbjct: 345 CQEVGSPGYKMQLLAEALTRVQTVQTKGPRLVCFNCKKPGHLARQCKEAKRCNNCGKPGH 404

Query: 126 IARDCQ--------NEPVCNLCNIAGHVARQCPKGDSLGER 158
           +A +C         NE V         V +  P    + E+
Sbjct: 405 LAANCWQGGRKTSGNEKVGRAAAPVNQVQQIVPSAPPMEEK 445



 Score = 43.9 bits (102), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/58 (43%), Positives = 31/58 (53%)

Query: 33  VCNNCGLPGHIASECTTQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQ 90
           VC NC  PGH+A +C    RC NC +PGH+A+NC   G   S  +   RA      VQ
Sbjct: 376 VCFNCKKPGHLARQCKEAKRCNNCGKPGHLAANCWQGGRKTSGNEKVGRAAAPVNQVQ 433



 Score = 43.1 bits (100), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 46/101 (45%), Gaps = 20/101 (19%)

Query: 2   LKNVLSFMSQGNLCNNCKRP-GHFAREC---PNVAVCNNCGLPGH---IASECTTQAR-- 52
           LK  LS     N   +CKR   H   E      +  C   G PG+   + +E  T+ +  
Sbjct: 312 LKQSLSI---ANANPDCKRAMSHLKPESTLEEKLRACQEVGSPGYKMQLLAEALTRVQTV 368

Query: 53  --------CWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
                   C+NC++PGH+A  C     C++CGK GH A +C
Sbjct: 369 QTKGPRLVCFNCKKPGHLARQCKEAKRCNNCGKPGHLAANC 409


>sp|O92956|POL_RSVSB Gag-Pro-Pol polyprotein OS=Rous sarcoma virus (strain
           Schmidt-Ruppin B) GN=gag-pro-pol PE=1 SV=2
          Length = 1603

 Score = 48.5 bits (114), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 32/85 (37%), Positives = 39/85 (45%), Gaps = 16/85 (18%)

Query: 80  HRARDCSTHVQSGGDLR-LCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNL 138
           +R RD  T   SGG  R LC  C  PGH  A C   +   N R+             C L
Sbjct: 492 NRERDGQT--GSGGRARRLCYTCGSPGHYQAQCPKKRKSGNSRER------------CQL 537

Query: 139 CNIAGHVARQCPKGDS-LGERGGGG 162
           C+  GH A+QC + DS  G+R G G
Sbjct: 538 CDGMGHNAKQCRRRDSNQGQRPGRG 562



 Score = 36.6 bits (83), Expect = 0.14,   Method: Composition-based stats.
 Identities = 16/53 (30%), Positives = 22/53 (41%), Gaps = 12/53 (22%)

Query: 33  VCNNCGLPGHIASECTTQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
           +C  CG PGH  ++C  + +  N RE             C  C   GH A+ C
Sbjct: 508 LCYTCGSPGHYQAQCPKKRKSGNSRER------------CQLCDGMGHNAKQC 548



 Score = 36.2 bits (82), Expect = 0.19,   Method: Composition-based stats.
 Identities = 17/57 (29%), Positives = 21/57 (36%)

Query: 55  NCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC 111
           N    G   S      +C++CG  GH    C    +SG     C  C   GH A  C
Sbjct: 492 NRERDGQTGSGGRARRLCYTCGSPGHYQAQCPKKRKSGNSRERCQLCDGMGHNAKQC 548



 Score = 33.5 bits (75), Expect = 1.4,   Method: Composition-based stats.
 Identities = 16/53 (30%), Positives = 20/53 (37%), Gaps = 12/53 (22%)

Query: 14  LCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWNCREPGHMASNC 66
           LC  C  PGH+  +CP      N            ++ RC  C   GH A  C
Sbjct: 508 LCYTCGSPGHYQAQCPKKRKSGN------------SRERCQLCDGMGHNAKQC 548


>sp|Q79666|POL_HV1MV Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group O
           (isolate MVP5180) GN=gag-pol PE=3 SV=3
          Length = 1446

 Score = 48.5 bits (114), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 33/92 (35%), Positives = 41/92 (44%), Gaps = 9/92 (9%)

Query: 98  CNNCYKPGHIAADC--TNDKACKNCRKTGHIARDCQNEPVCN----LCNIAGHVARQ-CP 150
           C NC K GHIA +C     + C  C + GH  +DC+N    N    +    GH ARQ C 
Sbjct: 396 CFNCGKEGHIAKNCRAPRKRGCWKCGQEGHQMKDCKNGRQANFFREVLASGGHEARQLCA 455

Query: 151 KGDSLGERGGGGGGERGGGGGGDGGGGGGRYV 182
           +         GGG E  G    +  GG GR V
Sbjct: 456 ETSVPISPTNGGGSE--GTRESESEGGSGRAV 485



 Score = 42.4 bits (98), Expect = 0.003,   Method: Composition-based stats.
 Identities = 16/37 (43%), Positives = 19/37 (51%), Gaps = 2/37 (5%)

Query: 34  CNNCGLPGHIASECTTQAR--CWNCREPGHMASNCHN 68
           C NCG  GHIA  C    +  CW C + GH   +C N
Sbjct: 396 CFNCGKEGHIAKNCRAPRKRGCWKCGQEGHQMKDCKN 432



 Score = 42.0 bits (97), Expect = 0.003,   Method: Composition-based stats.
 Identities = 16/42 (38%), Positives = 25/42 (59%), Gaps = 2/42 (4%)

Query: 52  RCWNCREPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQS 91
           +C+NC + GH+A NC    +  C  CG+ GH+ +DC    Q+
Sbjct: 395 KCFNCGKEGHIAKNCRAPRKRGCWKCGQEGHQMKDCKNGRQA 436



 Score = 37.4 bits (85), Expect = 0.079,   Method: Composition-based stats.
 Identities = 15/42 (35%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 189 CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYECPSGRIAD 229
           C +C + GH++++C  P    C  CG  GH   +C +GR A+
Sbjct: 396 CFNCGKEGHIAKNCRAPRKRGCWKCGQEGHQMKDCKNGRQAN 437



 Score = 33.9 bits (76), Expect = 1.0,   Method: Composition-based stats.
 Identities = 14/35 (40%), Positives = 17/35 (48%), Gaps = 2/35 (5%)

Query: 15  CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASEC 47
           C NC + GH A+ C  P    C  CG  GH   +C
Sbjct: 396 CFNCGKEGHIAKNCRAPRKRGCWKCGQEGHQMKDC 430


>sp|Q04095|POL_RSVSA Gag-Pro-Pol polyprotein OS=Avian leukosis virus RSA GN=gag-pro-pol
           PE=3 SV=2
          Length = 1603

 Score = 47.8 bits (112), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 33/96 (34%), Positives = 40/96 (41%), Gaps = 22/96 (22%)

Query: 80  HRARDCSTHVQSGGDLR-LCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNL 138
           +R RD  T   SGG  R LC  C  PGH  A C   +   N R+             C L
Sbjct: 492 NRERDGQT--GSGGRARGLCYTCGSPGHYQAQCPKKRKSGNSRER------------CQL 537

Query: 139 CNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDG 174
           C+  GH A+QC + D       G  G+R G G   G
Sbjct: 538 CDGMGHNAKQCRRRD-------GNQGQRPGKGLSSG 566



 Score = 39.7 bits (91), Expect = 0.019,   Method: Composition-based stats.
 Identities = 18/57 (31%), Positives = 22/57 (38%)

Query: 55  NCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC 111
           N    G   S     G+C++CG  GH    C    +SG     C  C   GH A  C
Sbjct: 492 NRERDGQTGSGGRARGLCYTCGSPGHYQAQCPKKRKSGNSRERCQLCDGMGHNAKQC 548



 Score = 36.6 bits (83), Expect = 0.14,   Method: Composition-based stats.
 Identities = 16/54 (29%), Positives = 22/54 (40%), Gaps = 12/54 (22%)

Query: 32  AVCNNCGLPGHIASECTTQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
            +C  CG PGH  ++C  + +  N RE             C  C   GH A+ C
Sbjct: 507 GLCYTCGSPGHYQAQCPKKRKSGNSRER------------CQLCDGMGHNAKQC 548



 Score = 33.5 bits (75), Expect = 1.4,   Method: Composition-based stats.
 Identities = 16/53 (30%), Positives = 20/53 (37%), Gaps = 12/53 (22%)

Query: 14  LCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWNCREPGHMASNC 66
           LC  C  PGH+  +CP      N            ++ RC  C   GH A  C
Sbjct: 508 LCYTCGSPGHYQAQCPKKRKSGN------------SRERCQLCDGMGHNAKQC 548


>sp|Q9P795|AIR1_SCHPO Protein air1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
           GN=air1 PE=1 SV=2
          Length = 315

 Score = 47.8 bits (112), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 56/121 (46%), Gaps = 19/121 (15%)

Query: 10  SQGNLCNNCKRPGHFARECPNVAVCNNCG-LPGHIASECTTQARCWNCREPGHMASNCHN 68
           S+  +C+NCK  GH +++CP+V +C  CG +  HI+  C    +C NC   GH+A+ C  
Sbjct: 86  SESIVCHNCKGNGHISKDCPHV-LCTTCGAIDDHISVRCPWTKKCMNCGLLGHIAARCSE 144

Query: 69  -----EGICHSCGKTGHRARDCST----HVQSGGDLRL--------CNNCYKPGHIAADC 111
                  +C +C    H +  C      +V+    +R+        C NC    H   DC
Sbjct: 145 PRKRGPRVCRTCHTDTHTSSTCPLIWRYYVEKEHPVRIDVSEVRKFCYNCASDEHFGDDC 204

Query: 112 T 112
           T
Sbjct: 205 T 205



 Score = 39.7 bits (91), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 33/97 (34%), Positives = 43/97 (44%), Gaps = 16/97 (16%)

Query: 61  HMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNC-YKPGHIAADCTNDKACKN 119
           +  S+     +CH+C   GH ++DC  HV       LC  C     HI+  C   K C N
Sbjct: 80  YFGSDPSESIVCHNCKGNGHISKDC-PHV-------LCTTCGAIDDHISVRCPWTKKCMN 131

Query: 120 CRKTGHIARDCQNEP------VCNLCNIAGHVARQCP 150
           C   GHIA  C +EP      VC  C+   H +  CP
Sbjct: 132 CGLLGHIAARC-SEPRKRGPRVCRTCHTDTHTSSTCP 167



 Score = 35.0 bits (79), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 24/96 (25%), Positives = 41/96 (42%), Gaps = 8/96 (8%)

Query: 132 NEPVCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDGGGGGGRYVGY---HDVI 188
           ++ + N  ++  H  +     DS  +       +  G    D   G  RY G      ++
Sbjct: 33  DQSITNSVSLEKHDFQGSDDHDSSTDLSDSTLEDVEGSEWADVSRG--RYFGSDPSESIV 90

Query: 189 CRSCNQMGHMSRDCVGPLIICRNCGG-RGHMAYECP 223
           C +C   GH+S+DC  P ++C  CG    H++  CP
Sbjct: 91  CHNCKGNGHISKDC--PHVLCTTCGAIDDHISVRCP 124



 Score = 32.7 bits (73), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 40/162 (24%), Positives = 56/162 (34%), Gaps = 23/162 (14%)

Query: 11  QGNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQARCWNC-REPGHMASNCHNE 69
           +G+   +  R  +F  +     VC+NC   GHI+ +C     C  C     H++  C   
Sbjct: 68  EGSEWADVSRGRYFGSDPSESIVCHNCKGNGHISKDC-PHVLCTTCGAIDDHISVRCPWT 126

Query: 70  GICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC------------------ 111
             C +CG  GH A  CS   + G   R+C  C+   H ++ C                  
Sbjct: 127 KKCMNCGLLGHIAARCSEPRKRGP--RVCRTCHTDTHTSSTCPLIWRYYVEKEHPVRIDV 184

Query: 112 -TNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKG 152
               K C NC    H   DC      N           CP G
Sbjct: 185 SEVRKFCYNCASDEHFGDDCTLPSRSNYPESTAFCEANCPSG 226


>sp|O91080|POL_HV1YF Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group N
           (isolate YBF30) GN=gag-pol PE=3 SV=3
          Length = 1449

 Score = 47.0 bits (110), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 18/39 (46%), Positives = 21/39 (53%), Gaps = 2/39 (5%)

Query: 34  CNNCGLPGHIASECTTQAR--CWNCREPGHMASNCHNEG 70
           C NCG  GH+A  C    R  CW C + GH   +C NEG
Sbjct: 395 CFNCGKEGHLARNCKAPRRGGCWKCGQEGHQMKDCKNEG 433



 Score = 44.3 bits (103), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 16/36 (44%), Positives = 23/36 (63%), Gaps = 2/36 (5%)

Query: 52  RCWNCREPGHMASNCH--NEGICHSCGKTGHRARDC 85
           +C+NC + GH+A NC     G C  CG+ GH+ +DC
Sbjct: 394 KCFNCGKEGHLARNCKAPRRGGCWKCGQEGHQMKDC 429



 Score = 39.7 bits (91), Expect = 0.017,   Method: Composition-based stats.
 Identities = 16/38 (42%), Positives = 22/38 (57%), Gaps = 2/38 (5%)

Query: 98  CNNCYKPGHIAADCTNDK--ACKNCRKTGHIARDCQNE 133
           C NC K GH+A +C   +   C  C + GH  +DC+NE
Sbjct: 395 CFNCGKEGHLARNCKAPRRGGCWKCGQEGHQMKDCKNE 432



 Score = 39.3 bits (90), Expect = 0.024,   Method: Composition-based stats.
 Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 5/43 (11%)

Query: 72  CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTND 114
           C +CGK GH AR+C    + G     C  C + GH   DC N+
Sbjct: 395 CFNCGKEGHLARNCKAPRRGG-----CWKCGQEGHQMKDCKNE 432



 Score = 33.5 bits (75), Expect = 1.3,   Method: Composition-based stats.
 Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 189 CRSCNQMGHMSRDCVGPLI-ICRNCGGRGHMAYEC 222
           C +C + GH++R+C  P    C  CG  GH   +C
Sbjct: 395 CFNCGKEGHLARNCKAPRRGGCWKCGQEGHQMKDC 429



 Score = 32.3 bits (72), Expect = 3.1,   Method: Composition-based stats.
 Identities = 14/35 (40%), Positives = 18/35 (51%), Gaps = 2/35 (5%)

Query: 117 CKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
           C NC K GH+AR+C+      C  C   GH  + C
Sbjct: 395 CFNCGKEGHLARNCKAPRRGGCWKCGQEGHQMKDC 429


>sp|Q9IDV9|POL_HV1YB Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group N
           (isolate YBF106) GN=gag-pol PE=3 SV=3
          Length = 1449

 Score = 47.0 bits (110), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 18/39 (46%), Positives = 21/39 (53%), Gaps = 2/39 (5%)

Query: 34  CNNCGLPGHIASECTTQAR--CWNCREPGHMASNCHNEG 70
           C NCG  GH+A  C    R  CW C + GH   +C NEG
Sbjct: 394 CFNCGKEGHLARNCKAPRRRGCWKCGQEGHQMKDCKNEG 432



 Score = 42.4 bits (98), Expect = 0.003,   Method: Composition-based stats.
 Identities = 20/51 (39%), Positives = 25/51 (49%), Gaps = 5/51 (9%)

Query: 72  CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRK 122
           C +CGK GH AR+C    + G     C  C + GH   DC N+    N RK
Sbjct: 394 CFNCGKEGHLARNCKAPRRRG-----CWKCGQEGHQMKDCKNEGXQANFRK 439



 Score = 41.2 bits (95), Expect = 0.006,   Method: Composition-based stats.
 Identities = 33/119 (27%), Positives = 49/119 (41%), Gaps = 39/119 (32%)

Query: 3   KNVLSFMSQGNLCNN----CKR------PGHFARECPNVAVCNNCGLPGHIA---SECTT 49
           ++V ++M++  L  N    CK+      PG    E   +  C   G P H A   +E   
Sbjct: 313 QDVKNWMTETLLVQNANPDCKQILKALGPGATLEEM--MTACQGVGGPAHKARVLAEAMA 370

Query: 50  QA--------------------RCWNCREPGHMASNC---HNEGICHSCGKTGHRARDC 85
           QA                    +C+NC + GH+A NC      G C  CG+ GH+ +DC
Sbjct: 371 QAQTATSVFVQRGNFKGIRKTIKCFNCGKEGHLARNCKAPRRRG-CWKCGQEGHQMKDC 428



 Score = 41.2 bits (95), Expect = 0.007,   Method: Composition-based stats.
 Identities = 25/80 (31%), Positives = 33/80 (41%), Gaps = 18/80 (22%)

Query: 72  CHSCGKTGHRAR-----------DCSTHVQSGGDLRL-----CNNCYKPGHIAADCT--N 113
           C   G   H+AR             S  VQ G    +     C NC K GH+A +C    
Sbjct: 352 CQGVGGPAHKARVLAEAMAQAQTATSVFVQRGNFKGIRKTIKCFNCGKEGHLARNCKAPR 411

Query: 114 DKACKNCRKTGHIARDCQNE 133
            + C  C + GH  +DC+NE
Sbjct: 412 RRGCWKCGQEGHQMKDCKNE 431



 Score = 34.7 bits (78), Expect = 0.58,   Method: Composition-based stats.
 Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 2/46 (4%)

Query: 179 GRYVGYHDVI-CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
           G + G    I C +C + GH++R+C  P    C  CG  GH   +C
Sbjct: 383 GNFKGIRKTIKCFNCGKEGHLARNCKAPRRRGCWKCGQEGHQMKDC 428



 Score = 32.3 bits (72), Expect = 3.1,   Method: Composition-based stats.
 Identities = 14/35 (40%), Positives = 18/35 (51%), Gaps = 2/35 (5%)

Query: 117 CKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
           C NC K GH+AR+C+      C  C   GH  + C
Sbjct: 394 CFNCGKEGHLARNCKAPRRRGCWKCGQEGHQMKDC 428


>sp|Q9QBZ9|POL_HV197 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype K (isolate 97ZR-EQTB11) GN=gag-pol PE=3 SV=2
          Length = 1429

 Score = 46.6 bits (109), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/80 (32%), Positives = 35/80 (43%), Gaps = 24/80 (30%)

Query: 31  VAVCNNCGLPGHIA-------SECTTQA---------------RCWNCREPGHMASNCH- 67
           +  C   G PGH A       S+ T  A               +C+NC + GH+A NC  
Sbjct: 343 MTACQGVGGPGHKARVLAEAMSQVTNSAVMMQRGNFKGQRRIIKCFNCGKEGHLARNCRA 402

Query: 68  -NEGICHSCGKTGHRARDCS 86
             +  C  CGK GH+ +DCS
Sbjct: 403 PRKKGCWKCGKEGHQMKDCS 422



 Score = 39.7 bits (91), Expect = 0.017,   Method: Composition-based stats.
 Identities = 18/45 (40%), Positives = 23/45 (51%), Gaps = 5/45 (11%)

Query: 72  CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKA 116
           C +CGK GH AR+C    + G     C  C K GH   DC+  +A
Sbjct: 387 CFNCGKEGHLARNCRAPRKKG-----CWKCGKEGHQMKDCSERQA 426



 Score = 37.0 bits (84), Expect = 0.12,   Method: Composition-based stats.
 Identities = 19/59 (32%), Positives = 25/59 (42%), Gaps = 10/59 (16%)

Query: 2   LKNVLSFMSQGNL--------CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQ 50
           + N    M +GN         C NC + GH AR C  P    C  CG  GH   +C+ +
Sbjct: 366 VTNSAVMMQRGNFKGQRRIIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDCSER 424



 Score = 35.4 bits (80), Expect = 0.30,   Method: Composition-based stats.
 Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 2/46 (4%)

Query: 179 GRYVGYHDVI-CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
           G + G   +I C +C + GH++R+C  P    C  CG  GH   +C
Sbjct: 376 GNFKGQRRIIKCFNCGKEGHLARNCRAPRKKGCWKCGKEGHQMKDC 421



 Score = 33.5 bits (75), Expect = 1.5,   Method: Composition-based stats.
 Identities = 23/83 (27%), Positives = 33/83 (39%), Gaps = 12/83 (14%)

Query: 72  CHSCGKTGHRAR---DCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIAR 128
           C   G  GH+AR   +  + V +   +    N      I         C NC K GH+AR
Sbjct: 346 CQGVGGPGHKARVLAEAMSQVTNSAVMMQRGNFKGQRRIIK-------CFNCGKEGHLAR 398

Query: 129 DCQ--NEPVCNLCNIAGHVARQC 149
           +C+   +  C  C   GH  + C
Sbjct: 399 NCRAPRKKGCWKCGKEGHQMKDC 421


>sp|P05959|POL_HV1RH Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype B (isolate RF/HAT3) GN=gag-pol PE=1 SV=3
          Length = 1436

 Score = 46.6 bits (109), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 21/51 (41%), Positives = 27/51 (52%), Gaps = 3/51 (5%)

Query: 22  GHFARECPNVAVCNNCGLPGHIASECTTQAR--CWNCREPGHMASNCHNEG 70
           G+F R+   +  C NCG  GHIA  C    +  CW C + GH   +C NEG
Sbjct: 381 GNF-RDQRKIVKCFNCGKVGHIAKNCRAPRKKGCWKCGKEGHQMKDCTNEG 430



 Score = 44.7 bits (104), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 27/80 (33%), Positives = 34/80 (42%), Gaps = 18/80 (22%)

Query: 72  CHSCGKTGHRAR----------DCSTHVQSGGDLR------LCNNCYKPGHIAADC--TN 113
           C   G   H+AR          + +T +   G+ R       C NC K GHIA +C    
Sbjct: 350 CQGVGGPSHKARILAEAMSQVTNSATIMLQKGNFRDQRKIVKCFNCGKVGHIAKNCRAPR 409

Query: 114 DKACKNCRKTGHIARDCQNE 133
            K C  C K GH  +DC NE
Sbjct: 410 KKGCWKCGKEGHQMKDCTNE 429



 Score = 44.7 bits (104), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 2/47 (4%)

Query: 52  RCWNCREPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGGDLR 96
           +C+NC + GH+A NC    +  C  CGK GH+ +DC+   +    LR
Sbjct: 391 KCFNCGKVGHIAKNCRAPRKKGCWKCGKEGHQMKDCTNEGRQANFLR 437



 Score = 38.5 bits (88), Expect = 0.036,   Method: Composition-based stats.
 Identities = 17/43 (39%), Positives = 21/43 (48%), Gaps = 2/43 (4%)

Query: 15  CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQARCWN 55
           C NC + GH A+ C  P    C  CG  GH   +CT + R  N
Sbjct: 392 CFNCGKVGHIAKNCRAPRKKGCWKCGKEGHQMKDCTNEGRQAN 434



 Score = 34.7 bits (78), Expect = 0.56,   Method: Composition-based stats.
 Identities = 16/45 (35%), Positives = 26/45 (57%), Gaps = 2/45 (4%)

Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYECPS-GRIAD 229
           V C +C ++GH++++C  P    C  CG  GH   +C + GR A+
Sbjct: 390 VKCFNCGKVGHIAKNCRAPRKKGCWKCGKEGHQMKDCTNEGRQAN 434


>sp|P19560|POL_BIV29 Gag-Pol polyprotein OS=Bovine immunodeficiency virus (strain R29)
           GN=gag-pol PE=1 SV=2
          Length = 1475

 Score = 46.2 bits (108), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 16/36 (44%), Positives = 24/36 (66%), Gaps = 1/36 (2%)

Query: 52  RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCST 87
           RC+ C + GH+  NC  +  C+ CGK GH+AR+C +
Sbjct: 404 RCYGCGKTGHLKRNCKQQK-CYHCGKPGHQARNCRS 438



 Score = 42.4 bits (98), Expect = 0.003,   Method: Composition-based stats.
 Identities = 15/36 (41%), Positives = 22/36 (61%), Gaps = 1/36 (2%)

Query: 34  CNNCGLPGHIASECTTQARCWNCREPGHMASNCHNE 69
           C  CG  GH+   C  Q +C++C +PGH A NC ++
Sbjct: 405 CYGCGKTGHLKRNCKQQ-KCYHCGKPGHQARNCRSK 439



 Score = 40.8 bits (94), Expect = 0.008,   Method: Composition-based stats.
 Identities = 17/40 (42%), Positives = 22/40 (55%), Gaps = 8/40 (20%)

Query: 72  CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADC 111
           C+ CGKTGH  R+C          + C +C KPGH A +C
Sbjct: 405 CYGCGKTGHLKRNCKQ--------QKCYHCGKPGHQARNC 436



 Score = 40.0 bits (92), Expect = 0.014,   Method: Composition-based stats.
 Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 3/46 (6%)

Query: 90  QSGG--DLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNE 133
           Q+ G  D R C  C K GH+  +C   K C +C K GH AR+C+++
Sbjct: 395 QTSGPEDGRRCYGCGKTGHLKRNCKQQK-CYHCGKPGHQARNCRSK 439



 Score = 39.3 bits (90), Expect = 0.026,   Method: Composition-based stats.
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 1/41 (2%)

Query: 12  GNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASECTTQAR 52
           G  C  C + GH  R C     C +CG PGH A  C ++ R
Sbjct: 402 GRRCYGCGKTGHLKRNCKQQK-CYHCGKPGHQARNCRSKNR 441



 Score = 38.1 bits (87), Expect = 0.050,   Method: Composition-based stats.
 Identities = 15/35 (42%), Positives = 20/35 (57%), Gaps = 1/35 (2%)

Query: 115 KACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQC 149
           + C  C KTGH+ R+C+ +  C  C   GH AR C
Sbjct: 403 RRCYGCGKTGHLKRNCKQQK-CYHCGKPGHQARNC 436



 Score = 33.5 bits (75), Expect = 1.3,   Method: Composition-based stats.
 Identities = 14/36 (38%), Positives = 18/36 (50%), Gaps = 2/36 (5%)

Query: 189 CRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPS 224
           C  C + GH+ R+C      C +CG  GH A  C S
Sbjct: 405 CYGCGKTGHLKRNCKQQK--CYHCGKPGHQARNCRS 438


>sp|P23425|GAG_VILV2 Gag polyprotein OS=Maedi visna virus (strain 1514 / clone LV1-1KS2)
           GN=gag PE=3 SV=1
          Length = 442

 Score = 46.2 bits (108), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 27/45 (60%)

Query: 52  RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLR 96
           +C+NC +PGH+A  C    ICH CGK GH  +DC    Q G + R
Sbjct: 386 KCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQKKQQGNNRR 430



 Score = 35.8 bits (81), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 15  CNNCKRPGHFARECPNVAVCNNCGLPGHIASEC 47
           C NC +PGH AR+C    +C++CG  GH+  +C
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419



 Score = 34.3 bits (77), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 22/36 (61%)

Query: 98  CNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNE 133
           C NC KPGH+A  C     C +C K GH+ +DC+ +
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQK 422



 Score = 30.8 bits (68), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 18/59 (30%), Positives = 29/59 (49%)

Query: 91  SGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQC 149
            G  ++L     +P   A     ++ C NC K GH+AR C+   +C+ C   GH+ + C
Sbjct: 361 EGFKMQLLAQALRPQGKAGHKGVNQKCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419


>sp|P35955|GAG_VILVK Gag polyprotein OS=Maedi visna virus (strain KV1772) GN=gag PE=3
           SV=1
          Length = 442

 Score = 46.2 bits (108), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 27/45 (60%)

Query: 52  RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLR 96
           +C+NC +PGH+A  C    ICH CGK GH  +DC    Q G + R
Sbjct: 386 KCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQKKQQGNNRR 430



 Score = 35.8 bits (81), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 15  CNNCKRPGHFARECPNVAVCNNCGLPGHIASEC 47
           C NC +PGH AR+C    +C++CG  GH+  +C
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419



 Score = 34.3 bits (77), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 22/36 (61%)

Query: 98  CNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNE 133
           C NC KPGH+A  C     C +C K GH+ +DC+ +
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQK 422



 Score = 32.0 bits (71), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 18/59 (30%), Positives = 29/59 (49%)

Query: 91  SGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQC 149
            G  ++L     +P   A     ++ C NC K GH+AR C+   +C+ C   GH+ + C
Sbjct: 361 EGFKMQLLAQALRPQGKAGQKGVNQKCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419


>sp|P23424|GAG_VILV1 Gag polyprotein OS=Maedi visna virus (strain 1514 / clone LV1-1KS1)
           GN=gag PE=3 SV=1
          Length = 442

 Score = 46.2 bits (108), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 27/45 (60%)

Query: 52  RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLR 96
           +C+NC +PGH+A  C    ICH CGK GH  +DC    Q G + R
Sbjct: 386 KCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQKKQQGNNRR 430



 Score = 35.8 bits (81), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 15  CNNCKRPGHFARECPNVAVCNNCGLPGHIASEC 47
           C NC +PGH AR+C    +C++CG  GH+  +C
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419



 Score = 34.3 bits (77), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 22/36 (61%)

Query: 98  CNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNE 133
           C NC KPGH+A  C     C +C K GH+ +DC+ +
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQK 422



 Score = 30.8 bits (68), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 18/59 (30%), Positives = 29/59 (49%)

Query: 91  SGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQC 149
            G  ++L     +P   A     ++ C NC K GH+AR C+   +C+ C   GH+ + C
Sbjct: 361 EGFKMQLLAQALRPQGKAGHKGVNQKCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419


>sp|P03352|GAG_VILV Gag polyprotein OS=Maedi visna virus (strain 1514) GN=gag PE=3 SV=1
          Length = 442

 Score = 46.2 bits (108), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 27/45 (60%)

Query: 52  RCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHVQSGGDLR 96
           +C+NC +PGH+A  C    ICH CGK GH  +DC    Q G + R
Sbjct: 386 KCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQKKQQGNNRR 430



 Score = 35.8 bits (81), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 15  CNNCKRPGHFARECPNVAVCNNCGLPGHIASEC 47
           C NC +PGH AR+C    +C++CG  GH+  +C
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419



 Score = 34.3 bits (77), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 22/36 (61%)

Query: 98  CNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNE 133
           C NC KPGH+A  C     C +C K GH+ +DC+ +
Sbjct: 387 CYNCGKPGHLARQCRQGIICHHCGKRGHMQKDCRQK 422



 Score = 31.2 bits (69), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 18/59 (30%), Positives = 29/59 (49%)

Query: 91  SGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQC 149
            G  ++L     +P   A     ++ C NC K GH+AR C+   +C+ C   GH+ + C
Sbjct: 361 EGFKMQLLAQALRPQGKAGHKGVNQKCYNCGKPGHLARQCRQGIICHHCGKRGHMQKDC 419


>sp|Q8AII1|POL_SIVTN Gag-Pol polyprotein OS=Simian immunodeficiency virus (isolate TAN1)
           GN=gag-pol PE=3 SV=4
          Length = 1462

 Score = 45.8 bits (107), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 2/46 (4%)

Query: 50  QARCWNCREPGHMASNCH--NEGICHSCGKTGHRARDCSTHVQSGG 93
           Q +C+NC + GH A NC    +  C  CG+ GH+ +DC+T   S G
Sbjct: 416 QLQCFNCGKVGHTARNCRAPRKKGCWRCGQEGHQMKDCTTRNNSTG 461



 Score = 38.9 bits (89), Expect = 0.029,   Method: Composition-based stats.
 Identities = 19/45 (42%), Positives = 24/45 (53%), Gaps = 3/45 (6%)

Query: 9   MSQGNL-CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQ 50
           + +G L C NC + GH AR C  P    C  CG  GH   +CTT+
Sbjct: 412 LKKGQLQCFNCGKVGHTARNCRAPRKKGCWRCGQEGHQMKDCTTR 456



 Score = 34.7 bits (78), Expect = 0.51,   Method: Composition-based stats.
 Identities = 15/35 (42%), Positives = 18/35 (51%), Gaps = 2/35 (5%)

Query: 98  CNNCYKPGHIAADC--TNDKACKNCRKTGHIARDC 130
           C NC K GH A +C     K C  C + GH  +DC
Sbjct: 419 CFNCGKVGHTARNCRAPRKKGCWRCGQEGHQMKDC 453



 Score = 33.9 bits (76), Expect = 0.92,   Method: Composition-based stats.
 Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 189 CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYECPS 224
           C +C ++GH +R+C  P    C  CG  GH   +C +
Sbjct: 419 CFNCGKVGHTARNCRAPRKKGCWRCGQEGHQMKDCTT 455



 Score = 32.3 bits (72), Expect = 2.6,   Method: Composition-based stats.
 Identities = 14/35 (40%), Positives = 18/35 (51%), Gaps = 2/35 (5%)

Query: 117 CKNCRKTGHIARDCQ--NEPVCNLCNIAGHVARQC 149
           C NC K GH AR+C+   +  C  C   GH  + C
Sbjct: 419 CFNCGKVGHTARNCRAPRKKGCWRCGQEGHQMKDC 453


>sp|P24740|POL_HV1U4 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype A (isolate U455) GN=gag-pol PE=1 SV=3
          Length = 1428

 Score = 45.4 bits (106), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 24/79 (30%), Positives = 35/79 (44%), Gaps = 23/79 (29%)

Query: 31  VAVCNNCGLPGHIA---SECTTQA------------------RCWNCREPGHMASNCH-- 67
           +  C   G PGH A   +E  +Q                   +C+NC + GH+A NC   
Sbjct: 343 MTACQGVGGPGHKARVLAEAMSQVQQTSIMMQRGNFRGPRRIKCFNCGKEGHLAKNCRAP 402

Query: 68  NEGICHSCGKTGHRARDCS 86
            +  C  CGK GH+ +DC+
Sbjct: 403 RKKGCWKCGKEGHQMKDCT 421



 Score = 37.4 bits (85), Expect = 0.095,   Method: Composition-based stats.
 Identities = 24/83 (28%), Positives = 34/83 (40%), Gaps = 13/83 (15%)

Query: 72  CHSCGKTGHRAR---DCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIAR 128
           C   G  GH+AR   +  + VQ    +    N   P  I         C NC K GH+A+
Sbjct: 346 CQGVGGPGHKARVLAEAMSQVQQTSIMMQRGNFRGPRRIK--------CFNCGKEGHLAK 397

Query: 129 DCQ--NEPVCNLCNIAGHVARQC 149
           +C+   +  C  C   GH  + C
Sbjct: 398 NCRAPRKKGCWKCGKEGHQMKDC 420



 Score = 37.0 bits (84), Expect = 0.13,   Method: Composition-based stats.
 Identities = 18/51 (35%), Positives = 23/51 (45%), Gaps = 9/51 (17%)

Query: 9   MSQGNL-------CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQ 50
           M +GN        C NC + GH A+ C  P    C  CG  GH   +CT +
Sbjct: 373 MQRGNFRGPRRIKCFNCGKEGHLAKNCRAPRKKGCWKCGKEGHQMKDCTER 423



 Score = 35.4 bits (80), Expect = 0.37,   Method: Composition-based stats.
 Identities = 14/45 (31%), Positives = 23/45 (51%), Gaps = 1/45 (2%)

Query: 179 GRYVGYHDVICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
           G + G   + C +C + GH++++C  P    C  CG  GH   +C
Sbjct: 376 GNFRGPRRIKCFNCGKEGHLAKNCRAPRKKGCWKCGKEGHQMKDC 420


>sp|P04584|POL_HV2RO Gag-Pol polyprotein OS=Human immunodeficiency virus type 2 subtype
           A (isolate ROD) GN=gag-pol PE=1 SV=3
          Length = 1464

 Score = 45.4 bits (106), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 17/35 (48%), Positives = 21/35 (60%), Gaps = 2/35 (5%)

Query: 34  CNNCGLPGHIASECTTQAR--CWNCREPGHMASNC 66
           C NCG  GH A +C    R  CW C +PGH+ +NC
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNC 425



 Score = 44.3 bits (103), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 31/63 (49%), Gaps = 7/63 (11%)

Query: 72  CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ 131
           C +CGK GH AR C    + G     C  C KPGHI  +C + +A     +TG + ++  
Sbjct: 391 CWNCGKEGHSARQCRAPRRQG-----CWKCGKPGHIMTNCPDRQA--GFLRTGPLGKEAP 443

Query: 132 NEP 134
             P
Sbjct: 444 QLP 446



 Score = 41.2 bits (95), Expect = 0.006,   Method: Composition-based stats.
 Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 2/35 (5%)

Query: 15  CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASEC 47
           C NC + GH AR+C  P    C  CG PGHI + C
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNC 425



 Score = 38.5 bits (88), Expect = 0.042,   Method: Composition-based stats.
 Identities = 16/37 (43%), Positives = 20/37 (54%), Gaps = 4/37 (10%)

Query: 52  RCWNCREPGHMASNC---HNEGICHSCGKTGHRARDC 85
           +CWNC + GH A  C     +G C  CGK GH   +C
Sbjct: 390 KCWNCGKEGHSARQCRAPRRQG-CWKCGKPGHIMTNC 425



 Score = 33.9 bits (76), Expect = 0.88,   Method: Composition-based stats.
 Identities = 14/36 (38%), Positives = 18/36 (50%), Gaps = 1/36 (2%)

Query: 189 CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYECP 223
           C +C + GH +R C  P    C  CG  GH+   CP
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNCP 426


>sp|P24107|POL_HV2CA Gag-Pol polyprotein OS=Human immunodeficiency virus type 2 subtype
           A (isolate CAM2) GN=gag-pol PE=3 SV=3
          Length = 1462

 Score = 45.4 bits (106), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 17/35 (48%), Positives = 21/35 (60%), Gaps = 2/35 (5%)

Query: 34  CNNCGLPGHIASECTTQAR--CWNCREPGHMASNC 66
           C NCG  GH A +C    R  CW C +PGH+ +NC
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNC 425



 Score = 44.7 bits (104), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 27/79 (34%), Positives = 34/79 (43%), Gaps = 14/79 (17%)

Query: 15  CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQA----RCWNC--------REPG 60
           C NC + GH AR+C  P    C  CG PGHI + C  +     R W          R P 
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNCPDRQAGFLRDWPLGKEAPQFPRGPS 450

Query: 61  HMASNCHNEGICHSCGKTG 79
              +N ++  I  S G TG
Sbjct: 451 STGANTNSTPIGSSSGSTG 469



 Score = 43.5 bits (101), Expect = 0.001,   Method: Composition-based stats.
 Identities = 19/45 (42%), Positives = 24/45 (53%), Gaps = 5/45 (11%)

Query: 72  CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKA 116
           C +CGK GH AR C    + G     C  C KPGHI  +C + +A
Sbjct: 391 CWNCGKEGHSARQCRAPRRQG-----CWKCGKPGHIMTNCPDRQA 430



 Score = 38.5 bits (88), Expect = 0.042,   Method: Composition-based stats.
 Identities = 16/37 (43%), Positives = 20/37 (54%), Gaps = 4/37 (10%)

Query: 52  RCWNCREPGHMASNC---HNEGICHSCGKTGHRARDC 85
           +CWNC + GH A  C     +G C  CGK GH   +C
Sbjct: 390 KCWNCGKEGHSARQCRAPRRQG-CWKCGKPGHIMTNC 425



 Score = 35.0 bits (79), Expect = 0.47,   Method: Composition-based stats.
 Identities = 22/60 (36%), Positives = 27/60 (45%), Gaps = 3/60 (5%)

Query: 98  CNNCYKPGHIAADC--TNDKACKNCRKTGHIARDCQNEPVCNLCNIA-GHVARQCPKGDS 154
           C NC K GH A  C     + C  C K GHI  +C +     L +   G  A Q P+G S
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNCPDRQAGFLRDWPLGKEAPQFPRGPS 450



 Score = 33.9 bits (76), Expect = 0.88,   Method: Composition-based stats.
 Identities = 14/36 (38%), Positives = 18/36 (50%), Gaps = 1/36 (2%)

Query: 189 CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYECP 223
           C +C + GH +R C  P    C  CG  GH+   CP
Sbjct: 391 CWNCGKEGHSARQCRAPRRQGCWKCGKPGHIMTNCP 426


>sp|O12158|POL_HV192 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype C (isolate 92BR025) GN=gag-pol PE=1 SV=2
          Length = 1431

 Score = 45.4 bits (106), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 29/89 (32%), Positives = 34/89 (38%), Gaps = 8/89 (8%)

Query: 31  VAVCNNCGLPGHIASECTTQARCWNCREPGHMASNCHNEG---ICHSCGKTGHRARDCST 87
           +  C   G PGH A          N        SNC        C +CGK GH AR+C  
Sbjct: 345 MTACQGVGGPGHKARVLAEAMSKVNNTNIMMQRSNCKGPKRTIKCFNCGKEGHLARNCRA 404

Query: 88  HVQSGGDLRLCNNCYKPGHIAADCTNDKA 116
             + G     C  C K GH   DCT  +A
Sbjct: 405 PRKKG-----CWKCGKEGHQVKDCTERQA 428



 Score = 39.7 bits (91), Expect = 0.018,   Method: Composition-based stats.
 Identities = 24/83 (28%), Positives = 35/83 (42%), Gaps = 12/83 (14%)

Query: 72  CHSCGKTGHRAR---DCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIAR 128
           C   G  GH+AR   +  + V +   +   +NC  P            C NC K GH+AR
Sbjct: 348 CQGVGGPGHKARVLAEAMSKVNNTNIMMQRSNCKGPKRTIK-------CFNCGKEGHLAR 400

Query: 129 DCQ--NEPVCNLCNIAGHVARQC 149
           +C+   +  C  C   GH  + C
Sbjct: 401 NCRAPRKKGCWKCGKEGHQVKDC 423



 Score = 38.1 bits (87), Expect = 0.059,   Method: Composition-based stats.
 Identities = 16/38 (42%), Positives = 19/38 (50%), Gaps = 2/38 (5%)

Query: 15  CNNCKRPGHFAREC--PNVAVCNNCGLPGHIASECTTQ 50
           C NC + GH AR C  P    C  CG  GH   +CT +
Sbjct: 389 CFNCGKEGHLARNCRAPRKKGCWKCGKEGHQVKDCTER 426



 Score = 34.3 bits (77), Expect = 0.70,   Method: Composition-based stats.
 Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 189 CRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
           C +C + GH++R+C  P    C  CG  GH   +C
Sbjct: 389 CFNCGKEGHLARNCRAPRKKGCWKCGKEGHQVKDC 423


>sp|P35963|POL_HV1Y2 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype B (isolate YU-2) GN=gag-pol PE=1 SV=3
          Length = 1435

 Score = 45.1 bits (105), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 25/81 (30%), Positives = 35/81 (43%), Gaps = 25/81 (30%)

Query: 31  VAVCNNCGLPGHIA-------SECTTQA----------------RCWNCREPGHMASNCH 67
           +  C   G PGH A       S+ T  A                +C+NC + GH+A NC 
Sbjct: 347 MTACQGVGGPGHKARVLAEAMSQVTNSATIMMQRGNFRNQRKTVKCFNCGKEGHIAKNCR 406

Query: 68  --NEGICHSCGKTGHRARDCS 86
              +  C  CGK GH+ +DC+
Sbjct: 407 APRKKGCWKCGKEGHQMKDCT 427



 Score = 34.7 bits (78), Expect = 0.52,   Method: Composition-based stats.
 Identities = 21/83 (25%), Positives = 34/83 (40%), Gaps = 11/83 (13%)

Query: 72  CHSCGKTGHRAR---DCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIAR 128
           C   G  GH+AR   +  + V +   + +    ++       C N      C K GHIA+
Sbjct: 350 CQGVGGPGHKARVLAEAMSQVTNSATIMMQRGNFRNQRKTVKCFN------CGKEGHIAK 403

Query: 129 DCQ--NEPVCNLCNIAGHVARQC 149
           +C+   +  C  C   GH  + C
Sbjct: 404 NCRAPRKKGCWKCGKEGHQMKDC 426



 Score = 32.7 bits (73), Expect = 2.2,   Method: Composition-based stats.
 Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
           V C +C + GH++++C  P    C  CG  GH   +C
Sbjct: 390 VKCFNCGKEGHIAKNCRAPRKKGCWKCGKEGHQMKDC 426


>sp|Q73368|POL_HV1B9 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype B (strain 89.6) GN=gag-pol PE=3 SV=3
          Length = 1435

 Score = 45.1 bits (105), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 25/81 (30%), Positives = 35/81 (43%), Gaps = 25/81 (30%)

Query: 31  VAVCNNCGLPGHIA-------SECTTQA----------------RCWNCREPGHMASNCH 67
           +  C   G PGH A       S+ T  A                +C+NC + GH+A NC 
Sbjct: 347 MTACQGVGGPGHKARVLAEAMSQVTNSATIMMQRGNFRNQRKTVKCFNCGKEGHIAKNCR 406

Query: 68  --NEGICHSCGKTGHRARDCS 86
              +  C  CGK GH+ +DC+
Sbjct: 407 APRKKGCWKCGKEGHQMKDCT 427



 Score = 34.7 bits (78), Expect = 0.52,   Method: Composition-based stats.
 Identities = 21/83 (25%), Positives = 34/83 (40%), Gaps = 11/83 (13%)

Query: 72  CHSCGKTGHRAR---DCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIAR 128
           C   G  GH+AR   +  + V +   + +    ++       C N      C K GHIA+
Sbjct: 350 CQGVGGPGHKARVLAEAMSQVTNSATIMMQRGNFRNQRKTVKCFN------CGKEGHIAK 403

Query: 129 DCQ--NEPVCNLCNIAGHVARQC 149
           +C+   +  C  C   GH  + C
Sbjct: 404 NCRAPRKKGCWKCGKEGHQMKDC 426



 Score = 32.7 bits (73), Expect = 2.2,   Method: Composition-based stats.
 Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
           V C +C + GH++++C  P    C  CG  GH   +C
Sbjct: 390 VKCFNCGKEGHIAKNCRAPRKKGCWKCGKEGHQMKDC 426


>sp|Q9WC63|POL_HV1S9 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype J (isolate SE9173) GN=gag-pol PE=3 SV=3
          Length = 1432

 Score = 44.7 bits (104), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 35/80 (43%), Gaps = 24/80 (30%)

Query: 31  VAVCNNCGLPGHIA---SECTTQA-------------------RCWNCREPGHMASNCH- 67
           +  C   G PGH A   +E  +Q                    +C+NC + GH+A NC  
Sbjct: 347 MTACQGVGGPGHKARVLAEAMSQVTNTNIMMQRGNFRDHKRIVKCFNCGKQGHIAKNCRA 406

Query: 68  -NEGICHSCGKTGHRARDCS 86
             +  C  CGK GH+ +DC+
Sbjct: 407 PRKKGCWKCGKEGHQMKDCT 426



 Score = 33.9 bits (76), Expect = 0.91,   Method: Composition-based stats.
 Identities = 23/80 (28%), Positives = 34/80 (42%), Gaps = 6/80 (7%)

Query: 72  CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ 131
           C   G  GH+AR      ++   +   N   + G+   D      C NC K GHIA++C+
Sbjct: 350 CQGVGGPGHKAR---VLAEAMSQVTNTNIMMQRGNFR-DHKRIVKCFNCGKQGHIAKNCR 405

Query: 132 --NEPVCNLCNIAGHVARQC 149
              +  C  C   GH  + C
Sbjct: 406 APRKKGCWKCGKEGHQMKDC 425



 Score = 33.5 bits (75), Expect = 1.4,   Method: Composition-based stats.
 Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
           V C +C + GH++++C  P    C  CG  GH   +C
Sbjct: 389 VKCFNCGKQGHIAKNCRAPRKKGCWKCGKEGHQMKDC 425


>sp|Q9WC54|POL_HV1S2 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype J (isolate SE9280) GN=gag-pol PE=3 SV=3
          Length = 1432

 Score = 44.7 bits (104), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 35/80 (43%), Gaps = 24/80 (30%)

Query: 31  VAVCNNCGLPGHIA---SECTTQA-------------------RCWNCREPGHMASNCH- 67
           +  C   G PGH A   +E  +Q                    +C+NC + GH+A NC  
Sbjct: 347 MTACQGVGGPGHKARVLAEAMSQVTNTNIMMQRGNFRDHKRIVKCFNCGKQGHIAKNCRA 406

Query: 68  -NEGICHSCGKTGHRARDCS 86
             +  C  CGK GH+ +DC+
Sbjct: 407 PRKKGCWKCGKEGHQMKDCT 426



 Score = 33.9 bits (76), Expect = 0.91,   Method: Composition-based stats.
 Identities = 23/80 (28%), Positives = 34/80 (42%), Gaps = 6/80 (7%)

Query: 72  CHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQ 131
           C   G  GH+AR      ++   +   N   + G+   D      C NC K GHIA++C+
Sbjct: 350 CQGVGGPGHKAR---VLAEAMSQVTNTNIMMQRGNFR-DHKRIVKCFNCGKQGHIAKNCR 405

Query: 132 --NEPVCNLCNIAGHVARQC 149
              +  C  C   GH  + C
Sbjct: 406 APRKKGCWKCGKEGHQMKDC 425



 Score = 33.5 bits (75), Expect = 1.4,   Method: Composition-based stats.
 Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
           V C +C + GH++++C  P    C  CG  GH   +C
Sbjct: 389 VKCFNCGKQGHIAKNCRAPRKKGCWKCGKEGHQMKDC 425


>sp|P12497|POL_HV1N5 Gag-Pol polyprotein OS=Human immunodeficiency virus type 1 group M
           subtype B (isolate NY5) GN=gag-pol PE=1 SV=4
          Length = 1435

 Score = 44.7 bits (104), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 25/81 (30%), Positives = 35/81 (43%), Gaps = 25/81 (30%)

Query: 31  VAVCNNCGLPGHIA-------SECTTQA----------------RCWNCREPGHMASNCH 67
           +  C   G PGH A       S+ T  A                +C+NC + GH+A NC 
Sbjct: 347 MTACQGVGGPGHKARVLAEAMSQVTNPATIMIQKGNFRNQRKTVKCFNCGKEGHIAKNCR 406

Query: 68  --NEGICHSCGKTGHRARDCS 86
              +  C  CGK GH+ +DC+
Sbjct: 407 APRKKGCWKCGKEGHQMKDCT 427



 Score = 33.9 bits (76), Expect = 0.96,   Method: Composition-based stats.
 Identities = 21/83 (25%), Positives = 34/83 (40%), Gaps = 11/83 (13%)

Query: 72  CHSCGKTGHRAR---DCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIAR 128
           C   G  GH+AR   +  + V +   + +    ++       C N      C K GHIA+
Sbjct: 350 CQGVGGPGHKARVLAEAMSQVTNPATIMIQKGNFRNQRKTVKCFN------CGKEGHIAK 403

Query: 129 DCQ--NEPVCNLCNIAGHVARQC 149
           +C+   +  C  C   GH  + C
Sbjct: 404 NCRAPRKKGCWKCGKEGHQMKDC 426



 Score = 32.7 bits (73), Expect = 2.2,   Method: Composition-based stats.
 Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 187 VICRSCNQMGHMSRDCVGPLII-CRNCGGRGHMAYEC 222
           V C +C + GH++++C  P    C  CG  GH   +C
Sbjct: 390 VKCFNCGKEGHIAKNCRAPRKKGCWKCGKEGHQMKDC 426


>sp|Q9HFF2|YL92_SCHPO Uncharacterized protein C683.02c OS=Schizosaccharomyces pombe
           (strain 972 / ATCC 24843) GN=SPAC683.02c PE=4 SV=1
          Length = 218

 Score = 44.3 bits (103), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 41/90 (45%), Gaps = 17/90 (18%)

Query: 13  NLCNNCKRPGHFARECP----NVAVCNNCGLPGHIASECTTQ-----ARCWNCREPGHMA 63
             C  C++ GH  ++CP    NV++C  CG   H  + C+ +     A+C+ C E GH++
Sbjct: 77  KFCFACRQQGHIVQDCPEAKDNVSICFRCGSKEHSLNACSKKGPLKFAKCFICHENGHLS 136

Query: 64  SNCHNE--------GICHSCGKTGHRARDC 85
             C           G C  C    H A+DC
Sbjct: 137 GQCEQNPKGLYPKGGCCKFCSSVHHLAKDC 166



 Score = 39.7 bits (91), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 40/92 (43%), Gaps = 18/92 (19%)

Query: 53  CWNCREPGHMASNC----HNEGICHSCGKTGHRARDCSTHVQSGGDLRL--CNNCYKPGH 106
           C+ CR+ GH+  +C     N  IC  CG   H    CS      G L+   C  C++ GH
Sbjct: 79  CFACRQQGHIVQDCPEAKDNVSICFRCGSKEHSLNACS----KKGPLKFAKCFICHENGH 134

Query: 107 IAADCTNDKA--------CKNCRKTGHIARDC 130
           ++  C  +          CK C    H+A+DC
Sbjct: 135 LSGQCEQNPKGLYPKGGCCKFCSSVHHLAKDC 166


>sp|A1L2T6|ZCHC7_XENLA Zinc finger CCHC domain-containing protein 7 OS=Xenopus laevis
           GN=zcchc7 PE=2 SV=2
          Length = 563

 Score = 42.7 bits (99), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 38/85 (44%), Gaps = 8/85 (9%)

Query: 8   FMSQGNLCNNCKRPGHFARECP---NVAVCNNCGLPGHIASECTTQARCWNCREPGHMAS 64
           +  +  +C NC + GH ++ CP    +  C  CG  GH  + C ++  C NC  PGH   
Sbjct: 260 YTDKNVVCRNCDKRGHLSKNCPVPKKLPACCLCGERGHYQNSCPSRY-CLNCFLPGHFFK 318

Query: 65  NC----HNEGICHSCGKTGHRARDC 85
            C    +    CH C   GH A  C
Sbjct: 319 ECIERAYWRKTCHRCSMPGHYADAC 343



 Score = 42.0 bits (97), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 35/79 (44%), Gaps = 6/79 (7%)

Query: 152 GDSLGERGGGGGGERGGGGGGDGGGG----GGRYVGYHDVICRSCNQMGHMSRDCVGP-- 205
            D  GERG              G         RY    +V+CR+C++ GH+S++C  P  
Sbjct: 226 SDDEGERGAAWSISEKDVEAQIGNYTPLRRSNRYYTDKNVVCRNCDKRGHLSKNCPVPKK 285

Query: 206 LIICRNCGGRGHMAYECPS 224
           L  C  CG RGH    CPS
Sbjct: 286 LPACCLCGERGHYQNSCPS 304



 Score = 37.4 bits (85), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 44/181 (24%), Positives = 61/181 (33%), Gaps = 61/181 (33%)

Query: 53  CWNCREPGHMASNC---HNEGICHSCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGHIAA 109
           C NC + GH++ NC        C  CG+ GH    C +        R C NC+ PGH   
Sbjct: 267 CRNCDKRGHLSKNCPVPKKLPACCLCGERGHYQNSCPS--------RYCLNCFLPGHFFK 318

Query: 110 DCTNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCP---KGDSLGERGGGGGGER 166
           +C         RKT            C+ C++ GH A  CP   +   L  + G     +
Sbjct: 319 ECIERAY---WRKT------------CHRCSMPGHYADACPEIWRQYHLTIKAGPIKKPK 363

Query: 167 GGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYECPSGR 226
              G  D                                ++ C NC  +GH  YEC   R
Sbjct: 364 SHSGQKD--------------------------------IVYCCNCAKKGHCIYECKERR 391

Query: 227 I 227
           +
Sbjct: 392 M 392



 Score = 35.8 bits (81), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 43/105 (40%), Gaps = 16/105 (15%)

Query: 15  CNNCKRPGHFARECPNVA----VCNNCGLPGHIASECTTQARCWNC-------REPGHMA 63
           C NC  PGHF +EC   A     C+ C +PGH A  C    R ++        ++P    
Sbjct: 307 CLNCFLPGHFFKECIERAYWRKTCHRCSMPGHYADACPEIWRQYHLTIKAGPIKKP---K 363

Query: 64  SNCHNEGICH--SCGKTGHRARDCSTHVQSGGDLRLCNNCYKPGH 106
           S+   + I +  +C K GH   +C     +   L  C   +   H
Sbjct: 364 SHSGQKDIVYCCNCAKKGHCIYECKERRMNSDKLPTCQLVFSYDH 408



 Score = 31.2 bits (69), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 20/71 (28%), Positives = 31/71 (43%), Gaps = 11/71 (15%)

Query: 83  RDCSTHVQSGGDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDC---QNEPVCNLC 139
           +D    + +   LR  N  Y   ++         C+NC K GH++++C   +  P C LC
Sbjct: 241 KDVEAQIGNYTPLRRSNRYYTDKNVV--------CRNCDKRGHLSKNCPVPKKLPACCLC 292

Query: 140 NIAGHVARQCP 150
              GH    CP
Sbjct: 293 GERGHYQNSCP 303


>sp|P33458|GAG_CAEVC Gag polyprotein OS=Caprine arthritis encephalitis virus (strain
           Cork) GN=gag PE=3 SV=1
          Length = 441

 Score = 42.0 bits (97), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 23/34 (67%)

Query: 52  RCWNCREPGHMASNCHNEGICHSCGKTGHRARDC 85
           RC+NC +PGH A  C    ICH+CGK GH  ++C
Sbjct: 380 RCYNCGKPGHQARQCRQGIICHNCGKRGHMQKEC 413



 Score = 38.9 bits (89), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 23/37 (62%)

Query: 11  QGNLCNNCKRPGHFARECPNVAVCNNCGLPGHIASEC 47
           Q   C NC +PGH AR+C    +C+NCG  GH+  EC
Sbjct: 377 QPQRCYNCGKPGHQARQCRQGIICHNCGKRGHMQKEC 413



 Score = 34.7 bits (78), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 32/75 (42%), Gaps = 5/75 (6%)

Query: 61  HMASNCHNEGICHSCGKTGHRARDCSTHVQSG-----GDLRLCNNCYKPGHIAADCTNDK 115
             AS       C   G  G + +  +  ++ G     G  + C NC KPGH A  C    
Sbjct: 339 QQASVEEKMQACRDVGSEGFKMQLLAQALRPGKGKGNGQPQRCYNCGKPGHQARQCRQGI 398

Query: 116 ACKNCRKTGHIARDC 130
            C NC K GH+ ++C
Sbjct: 399 ICHNCGKRGHMQKEC 413



 Score = 33.1 bits (74), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 21/34 (61%), Gaps = 1/34 (2%)

Query: 189 CRSCNQMGHMSRDCVGPLIICRNCGGRGHMAYEC 222
           C +C + GH +R C    IIC NCG RGHM  EC
Sbjct: 381 CYNCGKPGHQARQCRQG-IICHNCGKRGHMQKEC 413



 Score = 30.8 bits (68), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 18/33 (54%)

Query: 34  CNNCGLPGHIASECTTQARCWNCREPGHMASNC 66
           C NCG PGH A +C     C NC + GHM   C
Sbjct: 381 CYNCGKPGHQARQCRQGIICHNCGKRGHMQKEC 413


>sp|P34689|GLH1_CAEEL ATP-dependent RNA helicase glh-1 OS=Caenorhabditis elegans GN=glh-1
           PE=1 SV=3
          Length = 763

 Score = 42.0 bits (97), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 47/120 (39%), Gaps = 47/120 (39%)

Query: 13  NLCNNCKRPGHFARECP------NVAVCNNCGLPGHIASECTTQ---------------- 50
           N C NC++PGH + +CP         VC NC  PGH + ECT +                
Sbjct: 158 NNCFNCQQPGHRSSDCPEPRKEREPRVCYNCQQPGHTSRECTEERKPREGRTGGFGGGAG 217

Query: 51  ------------------------ARCWNCREPGHMASNCHNEGI-CHSCGKTGHRARDC 85
                                    +C+NC+  GH ++ C      C +CG+ GHR+ +C
Sbjct: 218 FGNNGGNDGFGGDGGFGGGEERGPMKCFNCKGEGHRSAECPEPPRGCFNCGEQGHRSNEC 277



 Score = 38.1 bits (87), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 62/132 (46%), Gaps = 19/132 (14%)

Query: 98  CNNCYKPGHIAADCTNDK------ACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPK 151
           C NC +PGH ++DC   +       C NC++ GH +R+C  E             R+  +
Sbjct: 160 CFNCQQPGHRSSDCPEPRKEREPRVCYNCQQPGHTSRECTEE-------------RKPRE 206

Query: 152 GDSLGERGGGGGGERGGGGGGDGGGGGGRYVGYHDVICRSCNQMGHMSRDCVGPLIICRN 211
           G + G  GG G G  GG  G  G GG G       + C +C   GH S +C  P   C N
Sbjct: 207 GRTGGFGGGAGFGNNGGNDGFGGDGGFGGGEERGPMKCFNCKGEGHRSAECPEPPRGCFN 266

Query: 212 CGGRGHMAYECP 223
           CG +GH + ECP
Sbjct: 267 CGEQGHRSNECP 278


>sp|P40507|AIR1_YEAST Protein AIR1 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
           S288c) GN=AIR1 PE=1 SV=1
          Length = 360

 Score = 41.6 bits (96), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 48/121 (39%), Gaps = 18/121 (14%)

Query: 15  CNNCKRPGHFARECPNVAVCNNCG-LPGHIASECTTQARCWNCREPGHMASNCHNEG--- 70
           CNNC + GH  R CP+V +C  CG +  H +  C     C NC   GH  S C ++    
Sbjct: 76  CNNCSQRGHLKRNCPHV-ICTYCGFMDDHYSQHCPKAIICTNCNANGHYKSQCPHKWKKV 134

Query: 71  ICHSCGKTGH---------RARDCSTHVQSGGDLR----LCNNCYKPGHIAADCTNDKAC 117
            C  C    H         R+    T   + GD       C NC   GH   DC   ++ 
Sbjct: 135 FCTLCNSKRHSRERCPSIWRSYLLKTKDANQGDFDFQTVFCYNCGNAGHFGDDCAERRSS 194

Query: 118 K 118
           +
Sbjct: 195 R 195



 Score = 32.0 bits (71), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 31/92 (33%), Gaps = 37/92 (40%)

Query: 133 EPVCNLCNIAGHVARQCPKGDSLGERGGGGGGERGGGGGGDGGGGGGRYVGYHDVICRSC 192
           EP CN C+  GH+ R CP                                    VIC  C
Sbjct: 73  EPKCNNCSQRGHLKRNCPH-----------------------------------VICTYC 97

Query: 193 NQMG-HMSRDCVGPLIICRNCGGRGHMAYECP 223
             M  H S+ C    IIC NC   GH   +CP
Sbjct: 98  GFMDDHYSQHCPK-AIICTNCNANGHYKSQCP 128


>sp|Q9NUD5|ZCHC3_HUMAN Zinc finger CCHC domain-containing protein 3 OS=Homo sapiens
           GN=ZCCHC3 PE=1 SV=1
          Length = 404

 Score = 41.6 bits (96), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 93  GDLRLCNNCYKPGHIAADCTNDKACKNCRKTGHIARDCQNEPVCNLCNIAGHVARQCPKG 152
           G  + C  C    H++  CT D+ C  C + GH++  C+   VCNLC   GH   QCPK 
Sbjct: 331 GQPKTCFKCGSRTHMSGSCTQDR-CFRCGEEGHLSPYCRKGIVCNLCGKRGHAFAQCPKA 389



 Score = 40.4 bits (93), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 1/57 (1%)

Query: 33  VCNNCGLPGHIASECTTQARCWNCREPGHMASNCHNEGICHSCGKTGHRARDCSTHV 89
            C  CG   H++  CT Q RC+ C E GH++  C    +C+ CGK GH    C   V
Sbjct: 335 TCFKCGSRTHMSGSCT-QDRCFRCGEEGHLSPYCRKGIVCNLCGKRGHAFAQCPKAV 390


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.323    0.141    0.500 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 107,698,315
Number of Sequences: 539616
Number of extensions: 5394359
Number of successful extensions: 108989
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 1340
Number of HSP's successfully gapped in prelim test: 567
Number of HSP's that attempted gapping in prelim test: 44388
Number of HSP's gapped (non-prelim): 35501
length of query: 235
length of database: 191,569,459
effective HSP length: 114
effective length of query: 121
effective length of database: 130,053,235
effective search space: 15736441435
effective search space used: 15736441435
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 59 (27.3 bits)