BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy10308
         (225 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|340727004|ref|XP_003401841.1| PREDICTED: hypothetical protein LOC100648841 [Bombus terrestris]
          Length = 1992

 Score =  211 bits (538), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 130/229 (56%), Positives = 152/229 (66%), Gaps = 16/229 (6%)

Query: 3    SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWG-TSQPQGGWSGTWV 59
            +++LWG P  K RGPPPG+        SNGW     G     ++WG  S     W  TW+
Sbjct: 1774 TSELWGAPMSKVRGPPPGLSSKTTGNTSNGWAGF--GTVSRSSSWGFQSSTNAAWVSTWL 1831

Query: 60   LLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGN 119
            LLKNLTPQIDGSTLKTLC+QHGP+Q+F LYLNH +AL KYS+R+EAIKAQG LNNC+LGN
Sbjct: 1832 LLKNLTPQIDGSTLKTLCMQHGPVQDFRLYLNHGIALTKYSSRDEAIKAQGALNNCVLGN 1891

Query: 120  TTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL-SNKDTWSSGGGGGNT 178
            TTIFAE+P+D EV SLL  LS           G G   R S+      DTW     GG++
Sbjct: 1892 TTIFAESPADTEVHSLLQQLSHGGQQQAGATTGAGWGLRPSNKTGPPPDTW-----GGSS 1946

Query: 179  SQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
            SQLWG P  PSS  SLW +  +DS D  RATPSSLNS+LPGDLLGGESM
Sbjct: 1947 SQLWGAP--PSS-NSLWSSAGIDSNDQQRATPSSLNSYLPGDLLGGESM 1992


>gi|350414279|ref|XP_003490265.1| PREDICTED: hypothetical protein LOC100744615 [Bombus impatiens]
          Length = 1991

 Score =  211 bits (537), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 130/229 (56%), Positives = 152/229 (66%), Gaps = 16/229 (6%)

Query: 3    SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWG-TSQPQGGWSGTWV 59
            +++LWG P  K RGPPPG+        SNGW     G     ++WG  S     W  TW+
Sbjct: 1773 TSELWGAPMSKVRGPPPGLSSKTTGNTSNGWAGF--GTVSRSSSWGFQSSTNAAWVSTWL 1830

Query: 60   LLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGN 119
            LLKNLTPQIDGSTLKTLC+QHGP+Q+F LYLNH +AL KYS+R+EAIKAQG LNNC+LGN
Sbjct: 1831 LLKNLTPQIDGSTLKTLCMQHGPVQDFRLYLNHGIALTKYSSRDEAIKAQGALNNCVLGN 1890

Query: 120  TTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL-SNKDTWSSGGGGGNT 178
            TTIFAE+P+D EV SLL  LS           G G   R S+      DTW     GG++
Sbjct: 1891 TTIFAESPADTEVHSLLQQLSHGGQQQAGATTGAGWGLRPSNKTGPPPDTW-----GGSS 1945

Query: 179  SQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
            SQLWG P  PSS  SLW +  +DS D  RATPSSLNS+LPGDLLGGESM
Sbjct: 1946 SQLWGAP--PSS-NSLWSSAGIDSNDQQRATPSSLNSYLPGDLLGGESM 1991


>gi|383860126|ref|XP_003705542.1| PREDICTED: protein Gawky-like [Megachile rotundata]
          Length = 1832

 Score =  205 bits (521), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 122/224 (54%), Positives = 145/224 (64%), Gaps = 13/224 (5%)

Query: 3    SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVL 60
            +++LWG P  K RGPPPG+        SNGW      G    +    S    GW  TW+L
Sbjct: 1586 TSELWGAPMSKVRGPPPGLSSKATGNTSNGWAGLGTVGRSSSSWGLQSSTNAGWVSTWLL 1645

Query: 61   LKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNT 120
            LKNLTPQIDGSTLKTLC+QHGP+Q+F LYLNH +AL KYS+R+EAIKAQG LNNC+LGNT
Sbjct: 1646 LKNLTPQIDGSTLKTLCMQHGPVQDFRLYLNHGIALTKYSSRDEAIKAQGALNNCVLGNT 1705

Query: 121  TIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL-SNKDTWSSGGGGGNTS 179
            TIFAE+P+D+EV +LL  LS           G G   R S+      DTW     GG++S
Sbjct: 1706 TIFAESPADSEVHALLQQLSHGGQQQTGATTGAGWSLRPSNKTGPPPDTW-----GGSSS 1760

Query: 180  QLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLG 221
            QLWG    P S  SLW +  +DS D  RATPSSLNS+LPGDLLG
Sbjct: 1761 QLWGA---PQSSNSLWSSTGIDSNDQQRATPSSLNSYLPGDLLG 1801


>gi|328783711|ref|XP_395115.4| PREDICTED: hypothetical protein LOC411646 [Apis mellifera]
          Length = 1801

 Score =  203 bits (516), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 126/225 (56%), Positives = 147/225 (65%), Gaps = 16/225 (7%)

Query: 3    SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWG-TSQPQGGWSGTWV 59
            +++LWG P  K RGPPPG+        SNGW     G     ++WG  S     W  TW+
Sbjct: 1575 TSELWGAPMSKVRGPPPGLSSKATGNASNGWAGF--GTVSRSSSWGFQSSTNAAWVSTWL 1632

Query: 60   LLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGN 119
            LLKNLTPQIDGSTLKTLC+QHGP+Q+F LYLNH +AL KYS+R+EAIKAQG LNNC+LGN
Sbjct: 1633 LLKNLTPQIDGSTLKTLCMQHGPVQDFRLYLNHGIALTKYSSRDEAIKAQGALNNCVLGN 1692

Query: 120  TTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL-SNKDTWSSGGGGGNT 178
            TTIFAE+P+D EV SLL  LS           G G   R S+      DTW     GG++
Sbjct: 1693 TTIFAESPADTEVHSLLQQLSHGGQQQAGATTGAGWGLRPSNKTGPPPDTW-----GGSS 1747

Query: 179  SQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLG 221
            SQLWG P  PSS  SLW    +DS D  RATPSSLNS+LPGDLLG
Sbjct: 1748 SQLWGAP--PSS-NSLWSNAGIDSNDQQRATPSSLNSYLPGDLLG 1789


>gi|380028808|ref|XP_003698078.1| PREDICTED: uncharacterized protein LOC100863913 [Apis florea]
          Length = 1807

 Score =  203 bits (516), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 127/229 (55%), Positives = 149/229 (65%), Gaps = 16/229 (6%)

Query: 3    SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWG-TSQPQGGWSGTWV 59
            +++LWG P  K RGPPPG+        SNGW     G     ++WG  S     W  TW+
Sbjct: 1560 TSELWGAPMSKVRGPPPGLSSKATGNASNGWAGF--GTVSRSSSWGFQSSTNAAWVSTWL 1617

Query: 60   LLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGN 119
            LLKNLTPQIDGSTLKTLC+QHGP+Q+F LYLNH +AL KYS+R+EAIKAQG LNNC+LGN
Sbjct: 1618 LLKNLTPQIDGSTLKTLCMQHGPVQDFRLYLNHGIALTKYSSRDEAIKAQGALNNCVLGN 1677

Query: 120  TTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL-SNKDTWSSGGGGGNT 178
            TTIFAE+P+D EV SLL  LS           G G   R S+      DTW     GG++
Sbjct: 1678 TTIFAESPADTEVHSLLQQLSHGGQQQAGATTGAGWGLRPSNKTGPPPDTW-----GGSS 1732

Query: 179  SQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
            SQLWG P  PSS  SLW    +DS D  RATPSSLNS+LPGDLLG  S+
Sbjct: 1733 SQLWGAP--PSS-NSLWSNAGIDSNDQQRATPSSLNSYLPGDLLGDGSV 1778


>gi|307198673|gb|EFN79509.1| Trinucleotide repeat-containing gene 6C protein [Harpegnathos
            saltator]
          Length = 2031

 Score =  196 bits (499), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 123/230 (53%), Positives = 151/230 (65%), Gaps = 15/230 (6%)

Query: 3    SNDLWGPP--KPRGPPPGMMGGGGKPPSNGW--MVRPNGGGGGGNTWGTSQPQGGWSGTW 58
            +++LWG P  K RGPPPG+   G    SNGW  +   +          ++    GW  TW
Sbjct: 1810 TSELWGAPMSKARGPPPGLGSKGATNTSNGWAGLGSVSRSSSSWGLQSSTVSNSGWMSTW 1869

Query: 59   VLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILG 118
            +LLKNLTPQIDGSTLKTLC QHGP+Q+F LY NH +AL KYSTR+EAIKAQG LNNC+LG
Sbjct: 1870 LLLKNLTPQIDGSTLKTLCAQHGPVQDFRLYQNHGIALTKYSTRDEAIKAQGALNNCVLG 1929

Query: 119  NTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGS-SALSNKDTWSSGGGGGN 177
            NTTIFAE+P+++EV ++L  L          +GG G   R +  A    DTW     GG+
Sbjct: 1930 NTTIFAESPAESEVHTILQQLGHGGQQQAGGSGGAGWGLRPTNKAGPPPDTW-----GGS 1984

Query: 178  TSQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
            +SQLWG P  P+S  SLW    +D+ D  RATPSSLNS+LPGDLLGGESM
Sbjct: 1985 SSQLWGVP--PTS-NSLWSNAGIDNSDQQRATPSSLNSYLPGDLLGGESM 2031


>gi|332026373|gb|EGI66502.1| Trinucleotide repeat-containing gene 6A protein [Acromyrmex
            echinatior]
          Length = 1888

 Score =  195 bits (496), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 125/236 (52%), Positives = 154/236 (65%), Gaps = 26/236 (11%)

Query: 3    SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGT--------SQPQG 52
            +++LWG P  K RGPPPG+   G    SNGW     G G G  +  +        +    
Sbjct: 1666 TSELWGAPMSKARGPPPGLSSKGATNASNGW-----GAGLGSVSRSSSSWGLQSSTVSNS 1720

Query: 53   GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNL 112
            GW  TW+LLKNLTPQIDGSTLKTLC+QHGP+Q+F LY NH +AL KYS+R+EAIKAQG L
Sbjct: 1721 GWMSTWLLLKNLTPQIDGSTLKTLCMQHGPVQDFRLYQNHGIALTKYSSRDEAIKAQGAL 1780

Query: 113  NNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGS-SALSNKDTWSS 171
            NNC+LGNTTIFAE+P+++EV ++L  L          +GG G   R +  A    DTW  
Sbjct: 1781 NNCVLGNTTIFAESPAESEVAAILQQLGHGGQQQAGGSGGAGWGLRPTNKAGPPPDTW-- 1838

Query: 172  GGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
               GG++SQLWG P  P+S  SLW    +D+ D  RATPSSLNS+LPGDLLGGESM
Sbjct: 1839 ---GGSSSQLWGAP--PTS-NSLWSNAGIDNSDQQRATPSSLNSYLPGDLLGGESM 1888


>gi|307185285|gb|EFN71385.1| Trinucleotide repeat-containing gene 6A protein [Camponotus
            floridanus]
          Length = 2022

 Score =  192 bits (488), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 125/236 (52%), Positives = 154/236 (65%), Gaps = 26/236 (11%)

Query: 3    SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQG-------- 52
            +++LWG P  K RGPPPG+   G    SNGW     G G G  +  +S            
Sbjct: 1800 TSELWGAPMSKARGPPPGLGSKGATNASNGW-----GAGLGTVSRSSSSWGLQSSSVSNT 1854

Query: 53   GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNL 112
            GW  TW+LL+NLTPQIDGSTLKTLC+QHGP+Q+F LY NH +AL KYS+R+EAIKAQG L
Sbjct: 1855 GWMSTWLLLRNLTPQIDGSTLKTLCMQHGPVQDFRLYQNHGIALTKYSSRDEAIKAQGAL 1914

Query: 113  NNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGS-SALSNKDTWSS 171
            NNC+LGNTTIFAE+P ++EV ++L  L         ++GG G   R +  A    DTW  
Sbjct: 1915 NNCVLGNTTIFAESPGESEVHTILQQLGHGGQQQAGSSGGAGWGLRPTNKAGPPPDTW-- 1972

Query: 172  GGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
               GG++SQLWG P  P+S  SLW    +D+ D  RATPSSLNS+LPGDLLGGESM
Sbjct: 1973 ---GGSSSQLWGAP--PTS-NSLWSNAGIDNSDQQRATPSSLNSYLPGDLLGGESM 2022


>gi|322794818|gb|EFZ17765.1| hypothetical protein SINV_10484 [Solenopsis invicta]
          Length = 2013

 Score =  189 bits (481), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 122/234 (52%), Positives = 152/234 (64%), Gaps = 26/234 (11%)

Query: 3    SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGT--------SQPQG 52
            +++LWG P  K RGPPPG+   G    SNGW     G G G  +  +        +    
Sbjct: 1771 TSELWGAPMGKARGPPPGLSTKGATNASNGW-----GAGLGSVSRSSSSWGLQSSTVSNS 1825

Query: 53   GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNL 112
            GW  TW+LLKNLTPQIDGSTLKTLC+QHGP+Q+F LY NH +AL KYS+R+EAIKAQG L
Sbjct: 1826 GWMSTWLLLKNLTPQIDGSTLKTLCMQHGPVQDFRLYQNHGIALTKYSSRDEAIKAQGAL 1885

Query: 113  NNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGS-SALSNKDTWSS 171
            NNC+LGNTTIFAE+P+++EV ++L  L          +GG G   R +  A    DTW  
Sbjct: 1886 NNCVLGNTTIFAESPAESEVAAILQQLGHGGQQQAGGSGGAGWGLRPTNKAGPPPDTW-- 1943

Query: 172  GGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGE 223
               GG++SQLWG P  P+S  SLW    +D+ D  RATPSSLNS+LPGDLLGG+
Sbjct: 1944 ---GGSSSQLWGAP--PTS-NSLWSNAGIDNSDQQRATPSSLNSYLPGDLLGGD 1991


>gi|189240445|ref|XP_973043.2| PREDICTED: similar to gawky CG31992-PA [Tribolium castaneum]
          Length = 1014

 Score =  188 bits (478), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 121/229 (52%), Positives = 148/229 (64%), Gaps = 35/229 (15%)

Query: 3    SNDLWGPPKPRGPPPGMMGGGGKPPSNGWMVRPNGGGG--GGNTWGTSQPQGGWSGTWVL 60
            +++LW  PK RGPPPG+   GG    NGW    + GGG  G  +WG S         W+L
Sbjct: 815  TSELWAAPKSRGPPPGLSAKGGAL-VNGWSSAASWGGGQRGSGSWGGS--------PWLL 865

Query: 61   LKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNT 120
            L+NLT QIDGSTL+TLC+QHGPLQ+FHLYL+   ALAKYSTREEA KAQ  LNNC+LGNT
Sbjct: 866  LRNLTAQIDGSTLRTLCMQHGPLQSFHLYLHQGFALAKYSTREEATKAQTALNNCVLGNT 925

Query: 121  TIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL--SNKDTWSSGGGGGNT 178
            TI AE PS+ +  +LL  +++  ++       +G W RGS+    +  DTWS+G      
Sbjct: 926  TILAENPSEWDANALLQQVASQQSS-------SGAW-RGSTKQPSTGSDTWSTG------ 971

Query: 179  SQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
               W   SN  S  SLWG+  LD+ D  RATPSSLNSFLPGDLLGGESM
Sbjct: 972  ---W---SNSQSSASLWGSTTLDTTDPARATPSSLNSFLPGDLLGGESM 1014


>gi|403182875|gb|EAT40858.2| AAEL007447-PA [Aedes aegypti]
          Length = 1541

 Score =  169 bits (428), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 111/230 (48%), Positives = 136/230 (59%), Gaps = 29/230 (12%)

Query: 5    DLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLK 62
            DLW  P  K R  PPG+   GGK  SNGW     G G  G   G +     WS TW+LLK
Sbjct: 1328 DLWDNPLGKSRVGPPGLKTAGGKLDSNGWSSHSAGSGAAGWNSGAAT----WSSTWILLK 1383

Query: 63   NLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTI 122
            NL+ QIDG TL+TLC+QHGPL  FHLYLNH +AL KYSTREEA KAQ  LNNC+LG+TTI
Sbjct: 1384 NLSAQIDGPTLRTLCIQHGPLLAFHLYLNHGIALCKYSTREEANKAQMALNNCMLGSTTI 1443

Query: 123  FAEAPSDAEVQSLLAHLSATANNNNNNNGGTGG--WARGSSALSNK------DTWSSGGG 174
             AE P++++VQ++L HL      N      +GG  W  G++A S        D W S   
Sbjct: 1444 CAETPTESDVQNILQHLGPPNGTNGLTGSQSGGQNWRLGAAAQSQSVRTPAADAWGSA-- 1501

Query: 175  GGNTSQLWGTPSNPSSGGSLWGAPPLD-SVDRATPSSLNSFLPGDLLGGE 223
                   W T     +G +LWG  PL+   DRATP++LNS+LP  LLG +
Sbjct: 1502 -------WPT---TGAGSNLWG--PLEGPSDRATPANLNSYLPESLLGTD 1539


>gi|312384473|gb|EFR29197.1| hypothetical protein AND_02094 [Anopheles darlingi]
          Length = 698

 Score =  168 bits (425), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 86/147 (58%), Positives = 101/147 (68%), Gaps = 14/147 (9%)

Query: 5   DLWG------PPKPRGPPPGMMGG------GGKPPSNGWMVRPNGGGGGGNTWGTSQPQG 52
           D+W       P  PRGPPPG+  G      GG   +NGW+ RP+    G   W      G
Sbjct: 448 DVWAGGSSGVPKTPRGPPPGLSSGKPAGTPGGPTGTNGWIQRPSHSSAG--NWSAGGATG 505

Query: 53  GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNL 112
            W  TW+LLKNLT QIDGSTL+TLC+QHGPLQNFHLYLNH +AL KY TREEA KAQ  L
Sbjct: 506 AWYSTWLLLKNLTAQIDGSTLRTLCMQHGPLQNFHLYLNHGIALCKYLTREEASKAQLAL 565

Query: 113 NNCILGNTTIFAEAPSDAEVQSLLAHL 139
           NNC+LGNTTI AE+P+D+EVQ++L HL
Sbjct: 566 NNCVLGNTTICAESPTDSEVQAILQHL 592


>gi|270012524|gb|EFA08972.1| hypothetical protein TcasGA2_TC006679 [Tribolium castaneum]
          Length = 1344

 Score =  168 bits (425), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 110/218 (50%), Positives = 135/218 (61%), Gaps = 35/218 (16%)

Query: 3    SNDLWGPPKPRGPPPGMMGGGGKPPSNGWMVRPNGGGG--GGNTWGTSQPQGGWSGTWVL 60
            +++LW  PK RGPPPG+   GG    NGW    + GGG  G  +WG S         W+L
Sbjct: 1132 TSELWAAPKSRGPPPGLSAKGGAL-VNGWSSAASWGGGQRGSGSWGGS--------PWLL 1182

Query: 61   LKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNT 120
            L+NLT QIDGSTL+TLC+QHGPLQ+FHLYL+   ALAKYSTREEA KAQ  LNNC+LGNT
Sbjct: 1183 LRNLTAQIDGSTLRTLCMQHGPLQSFHLYLHQGFALAKYSTREEATKAQTALNNCVLGNT 1242

Query: 121  TIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL--SNKDTWSSGGGGGNT 178
            TI AE PS+ +  +LL  +++           +G W RGS+    +  DTWS+G      
Sbjct: 1243 TILAENPSEWDANALLQQVASQQ-------SSSGAW-RGSTKQPSTGSDTWSTG------ 1288

Query: 179  SQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSF 214
               W   SN  S  SLWG+  LD+ D  RATPSSLNSF
Sbjct: 1289 ---W---SNSQSSASLWGSTTLDTTDPARATPSSLNSF 1320


>gi|157116102|ref|XP_001652769.1| hypothetical protein AaeL_AAEL007447 [Aedes aegypti]
          Length = 1270

 Score =  167 bits (424), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 111/230 (48%), Positives = 136/230 (59%), Gaps = 29/230 (12%)

Query: 5    DLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLK 62
            DLW  P  K R  PPG+   GGK  SNGW     G G  G   G +     WS TW+LLK
Sbjct: 1057 DLWDNPLGKSRVGPPGLKTAGGKLDSNGWSSHSAGSGAAGWNSGAAT----WSSTWILLK 1112

Query: 63   NLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTI 122
            NL+ QIDG TL+TLC+QHGPL  FHLYLNH +AL KYSTREEA KAQ  LNNC+LG+TTI
Sbjct: 1113 NLSAQIDGPTLRTLCIQHGPLLAFHLYLNHGIALCKYSTREEANKAQMALNNCMLGSTTI 1172

Query: 123  FAEAPSDAEVQSLLAHLSATANNNNNNNGGTGG--WARGSSALSNK------DTWSSGGG 174
             AE P++++VQ++L HL      N      +GG  W  G++A S        D W S   
Sbjct: 1173 CAETPTESDVQNILQHLGPPNGTNGLTGSQSGGQNWRLGAAAQSQSVRTPAADAWGSA-- 1230

Query: 175  GGNTSQLWGTPSNPSSGGSLWGAPPLD-SVDRATPSSLNSFLPGDLLGGE 223
                   W T     +G +LWG  PL+   DRATP++LNS+LP  LLG +
Sbjct: 1231 -------WPT---TGAGSNLWG--PLEGPSDRATPANLNSYLPESLLGTD 1268


>gi|357617904|gb|EHJ71059.1| hypothetical protein KGM_13480 [Danaus plexippus]
          Length = 1088

 Score =  162 bits (411), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 101/215 (46%), Positives = 117/215 (54%), Gaps = 40/215 (18%)

Query: 16   PPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGT----SQPQGGW-SGTWVLLKNLTPQIDG 70
            PP    GG KP  + W  +P     G N W      S+    W + TW+LL+NLT QIDG
Sbjct: 909  PPATSAGGLKP-LDVWGAKPRPAPPGLNKWPQHHVNSRAAPSWQTSTWLLLRNLTAQIDG 967

Query: 71   STLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDA 130
            STLKTLCVQHGPLQNFHLYLN  LALA+YSTREEA KAQ  LNNC+L NTTIFAE+P+++
Sbjct: 968  STLKTLCVQHGPLQNFHLYLNQGLALARYSTREEAAKAQMALNNCVLSNTTIFAESPAES 1027

Query: 131  EVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKDTWSSGGGGGNTSQLWGTPSNPSS 190
            +VQ +L HL +             GW                   G    LW        
Sbjct: 1028 DVQLILQHLGSGGGGAWRGGASKDGW------------------NGAFPGLWQE------ 1063

Query: 191  GGSLWGAPPLDSVDRATPSSLNSFLPGDLLGGESM 225
                          RATPSSLNSFLP DLLGGES+
Sbjct: 1064 ----------QHEQRATPSSLNSFLPPDLLGGESI 1088


>gi|157116104|ref|XP_001652770.1| hypothetical protein AaeL_AAEL007449 [Aedes aegypti]
 gi|108876634|gb|EAT40859.1| AAEL007449-PA [Aedes aegypti]
          Length = 1501

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 115/240 (47%), Positives = 139/240 (57%), Gaps = 42/240 (17%)

Query: 5    DLWGPP--KP-RGPPPGMMGGGGKPPS---NGWM----VRPNGGGGGGNTWGTSQPQGGW 54
            DLWG P  KP RGPPPG+  G  K  S   NGW     V+ +G GG    W +     GW
Sbjct: 1281 DLWGAPVGKPTRGPPPGL--GANKNVSSAPNGWPGSSGVQRSGSGG---NWPS-----GW 1330

Query: 55   SGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNN 114
              +W+LLKNLTPQID +TL+TLC+QHGPLQN  LY NH LAL KYS+REEA KAQ  LNN
Sbjct: 1331 GSSWLLLKNLTPQIDVATLRTLCMQHGPLQNLQLYANHGLALIKYSSREEANKAQQALNN 1390

Query: 115  CILGNTTIFAEAPSDAEVQSLLAHLSATANNNN--------NNNGGTGGWA---RGSSAL 163
            C LG++TI AE PSD EVQ+ L  L   A +          N++GG    A   R +   
Sbjct: 1391 CPLGSSTIGAECPSDTEVQAYLQQLGTQAGSITSNAMVAPPNSSGGVTSVAQSWRQAPRT 1450

Query: 164  SNKDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSFLPGDLLGGE 223
               DTW SG         W  P   +  GS++ AP   + +R+TPS+LNSFLP  LLG E
Sbjct: 1451 GGSDTWGSG---------W--PPTSTGTGSMFWAPIEGATERSTPSNLNSFLPESLLGSE 1499


>gi|170049180|ref|XP_001854407.1| gawky [Culex quinquefasciatus]
 gi|167871061|gb|EDS34444.1| gawky [Culex quinquefasciatus]
          Length = 1406

 Score =  154 bits (389), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 104/185 (56%), Positives = 124/185 (67%), Gaps = 18/185 (9%)

Query: 54   WSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLN 113
            W  TWVLLKNLT QIDGSTL+TLC+QHGP+QNFHLYLNH +AL KY +REEA KAQ  LN
Sbjct: 1223 WYSTWVLLKNLTAQIDGSTLRTLCMQHGPVQNFHLYLNHGIALCKYLSREEANKAQQALN 1282

Query: 114  NCILGNTTIFAEAPSDAEVQSLLAHLSAT--ANNNNNNNGGTGGWARGSSAL------SN 165
            NC+LGNTTI AE+P  +EVQ++L HL     ANNNN NN   G    GSS L      +N
Sbjct: 1283 NCVLGNTTICAESPLASEVQTILQHLGIPGGANNNNINNNNNGNINVGSSGLGNNNNNNN 1342

Query: 166  KDTWSSGGGGG----NTSQLWGT--PSNPSSGGSLWGAPPLD-SVDRATPSSLNSFLPGD 218
               W S G       + +  WG+  PS+  +G +LW   PLD   +R TPS+LNSFLP +
Sbjct: 1343 AQPWRSSGSQQANIRSAADTWGSGWPSS-GAGANLWT--PLDGPTERGTPSNLNSFLPEN 1399

Query: 219  LLGGE 223
            LLGGE
Sbjct: 1400 LLGGE 1404


>gi|321479469|gb|EFX90425.1| hypothetical protein DAPPUDRAFT_300013 [Daphnia pulex]
          Length = 1645

 Score =  150 bits (378), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 113/271 (41%), Positives = 137/271 (50%), Gaps = 56/271 (20%)

Query: 3    SNDLWGPP-KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQP---------QG 52
            S D W  P KPRGPPPG+    G        V P   G   +  G+  P          G
Sbjct: 1383 SADPWSAPNKPRGPPPGITPSAG--------VGPKTAGRDWSAAGSRSPWPGTNTTGSGG 1434

Query: 53   GWSGT------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAI 106
             W+G+      W++L+NLTPQIDGSTLKTLCVQHGPL NFHLYLNH +AL +YST EEA 
Sbjct: 1435 TWAGSLNGSSSWLVLRNLTPQIDGSTLKTLCVQHGPLHNFHLYLNHGVALIRYSTGEEAA 1494

Query: 107  KAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNG-----------GTGG 155
            KAQ  LNNC+LGNTTI+A+  ++++VQ  L  L   A                    + G
Sbjct: 1495 KAQSALNNCVLGNTTIYADLANESDVQGWLQQLGMPAQQQQQQQQQQNQQQQQQAVSSSG 1554

Query: 156  W-ARGSS---------ALSNKDTWSSGGGGGNTSQLWGTPSNPSSGG-------SLWGAP 198
            W  RGS+                 ++  G  N    WG+ +   S         S+W  P
Sbjct: 1555 WGVRGSTPGAGSNSGSGNGGGVGSNASKGSANAGDNWGSGAGGPSSSPWSTGPNSVWSTP 1614

Query: 199  PLDSVDRATPS----SLNSFLPGDLLGGESM 225
             LD   R TPS    SLNSFLPGDLLG ESM
Sbjct: 1615 NLDRDLRTTPSSLNASLNSFLPGDLLGNESM 1645


>gi|157116106|ref|XP_001652771.1| hypothetical protein AaeL_AAEL007436 [Aedes aegypti]
 gi|108876635|gb|EAT40860.1| AAEL007436-PA, partial [Aedes aegypti]
          Length = 1086

 Score =  149 bits (377), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 83/132 (62%), Positives = 96/132 (72%), Gaps = 6/132 (4%)

Query: 12   PRGPPPGMMGGGGKPP----SNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKNLTPQ 67
            PRGPPPG+    GK P    SNGW  RP  GG    T G      GW  TW+LLKNLT Q
Sbjct: 928  PRGPPPGL--SAGKNPGGFGSNGWNQRPGPGGNNWPTGGGGGGGPGWYSTWILLKNLTTQ 985

Query: 68   IDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAP 127
            IDG TL+TLC+QHGPLQNFHLYLNH +AL KY +REEA KAQ  LNNC+LGNTTI AE+P
Sbjct: 986  IDGPTLRTLCMQHGPLQNFHLYLNHGIALCKYQSREEANKAQQALNNCVLGNTTICAESP 1045

Query: 128  SDAEVQSLLAHL 139
            +++EVQ++L HL
Sbjct: 1046 TESEVQTILQHL 1057


>gi|158297477|ref|XP_317704.4| AGAP007802-PA [Anopheles gambiae str. PEST]
 gi|157015214|gb|EAA12443.4| AGAP007802-PA [Anopheles gambiae str. PEST]
          Length = 1218

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 81/136 (59%), Positives = 95/136 (69%), Gaps = 8/136 (5%)

Query: 12   PRGPPPGMMGGGGKPPSNG--------WMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
            PRGPPPG+  G      +         WM R + G GG  + G     G W  TW+LLKN
Sbjct: 1046 PRGPPPGLSSGKVSGSGSVGGGPGSNGWMPRTSHGQGGNWSAGGGGASGSWYSTWLLLKN 1105

Query: 64   LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
            LT QIDGSTL+TLC+QHGPLQNFHLYLNH +AL KY TREEA KAQ  LNNC+LGNTTI 
Sbjct: 1106 LTAQIDGSTLRTLCMQHGPLQNFHLYLNHGIALCKYLTREEANKAQLALNNCVLGNTTIC 1165

Query: 124  AEAPSDAEVQSLLAHL 139
            AE+P+D+EVQ++L HL
Sbjct: 1166 AESPTDSEVQTILQHL 1181


>gi|442614446|ref|NP_001014691.2| gawky, isoform J [Drosophila melanogaster]
 gi|440218154|gb|AAX52511.2| gawky, isoform J [Drosophila melanogaster]
          Length = 1382

 Score =  144 bits (362), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 81/167 (48%), Positives = 101/167 (60%), Gaps = 36/167 (21%)

Query: 2    SSNDLWGPP----KPRGPPPGMMGGGGKPP-------------SNGWMVRPNGGG----- 39
            ++++LW  P      RGPPPG+     K               +NGW+ +P  GG     
Sbjct: 1044 ATSELWTSPLNKSSSRGPPPGLTANSNKSANSNASTPTTITGGANGWL-QPRSGGVQTTN 1102

Query: 40   ----GGGNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLA 95
                GG  TWG+S         W+LLKNLT QIDG TL+TLC+QHGPL +FH YLN  +A
Sbjct: 1103 TNWTGGNTTWGSS---------WLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIA 1153

Query: 96   LAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSAT 142
            L KY+TREEA KAQ  LNNC+L NTTIFAE+PS+ EVQS++ HL  T
Sbjct: 1154 LCKYTTREEANKAQMALNNCVLANTTIFAESPSENEVQSIMQHLPQT 1200


>gi|24638679|ref|NP_726596.1| gawky, isoform A [Drosophila melanogaster]
 gi|24638681|ref|NP_726597.1| gawky, isoform B [Drosophila melanogaster]
 gi|24638687|ref|NP_726600.1| gawky, isoform E [Drosophila melanogaster]
 gi|24638689|ref|NP_726601.1| gawky, isoform F [Drosophila melanogaster]
 gi|75017682|sp|Q8SY33.1|GAWKY_DROME RecName: Full=Protein Gawky
 gi|18447359|gb|AAL68245.1| LD47780p [Drosophila melanogaster]
 gi|22759367|gb|AAF59323.2| gawky, isoform A [Drosophila melanogaster]
 gi|22759368|gb|AAF59322.2| gawky, isoform B [Drosophila melanogaster]
 gi|22759371|gb|AAN06508.1| gawky, isoform E [Drosophila melanogaster]
 gi|22759372|gb|AAN06509.1| gawky, isoform F [Drosophila melanogaster]
          Length = 1384

 Score =  144 bits (362), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 81/167 (48%), Positives = 101/167 (60%), Gaps = 36/167 (21%)

Query: 2    SSNDLWGPP----KPRGPPPGMMGGGGKPP-------------SNGWMVRPNGGG----- 39
            ++++LW  P      RGPPPG+     K               +NGW+ +P  GG     
Sbjct: 1046 ATSELWTSPLNKSSSRGPPPGLTANSNKSANSNASTPTTITGGANGWL-QPRSGGVQTTN 1104

Query: 40   ----GGGNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLA 95
                GG  TWG+S         W+LLKNLT QIDG TL+TLC+QHGPL +FH YLN  +A
Sbjct: 1105 TNWTGGNTTWGSS---------WLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIA 1155

Query: 96   LAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSAT 142
            L KY+TREEA KAQ  LNNC+L NTTIFAE+PS+ EVQS++ HL  T
Sbjct: 1156 LCKYTTREEANKAQMALNNCVLANTTIFAESPSENEVQSIMQHLPQT 1202


>gi|442614444|ref|NP_726599.2| gawky, isoform I [Drosophila melanogaster]
 gi|440218153|gb|AAN06507.2| gawky, isoform I [Drosophila melanogaster]
          Length = 1381

 Score =  144 bits (362), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 81/167 (48%), Positives = 101/167 (60%), Gaps = 36/167 (21%)

Query: 2    SSNDLWGPP----KPRGPPPGMMGGGGKPP-------------SNGWMVRPNGGG----- 39
            ++++LW  P      RGPPPG+     K               +NGW+ +P  GG     
Sbjct: 1043 ATSELWTSPLNKSSSRGPPPGLTANSNKSANSNASTPTTITGGANGWL-QPRSGGVQTTN 1101

Query: 40   ----GGGNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLA 95
                GG  TWG+S         W+LLKNLT QIDG TL+TLC+QHGPL +FH YLN  +A
Sbjct: 1102 TNWTGGNTTWGSS---------WLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIA 1152

Query: 96   LAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSAT 142
            L KY+TREEA KAQ  LNNC+L NTTIFAE+PS+ EVQS++ HL  T
Sbjct: 1153 LCKYTTREEANKAQMALNNCVLANTTIFAESPSENEVQSIMQHLPQT 1199


>gi|195354411|ref|XP_002043691.1| GM26772 [Drosophila sechellia]
 gi|194128879|gb|EDW50922.1| GM26772 [Drosophila sechellia]
          Length = 1385

 Score =  143 bits (360), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 80/155 (51%), Positives = 99/155 (63%), Gaps = 18/155 (11%)

Query: 2    SSNDLWGPP----KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTW-----GTSQPQ- 51
            ++++LW  P      RGPPPG+     K  +N     P    GG N W     G+ QP  
Sbjct: 1047 ATSELWTSPLNKSSSRGPPPGLTANSNKS-ANCNTSTPTTITGGANGWLQPRSGSVQPTN 1105

Query: 52   ----GG---WSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREE 104
                GG   W  +W+LLKNLT QIDG TL+TLC+QHGPL +FH YLN  +AL KY+TREE
Sbjct: 1106 TNWTGGNTTWGSSWLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIALCKYTTREE 1165

Query: 105  AIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHL 139
            A KAQ  LNNC+L NTTIFAE+PS+ EVQS++ HL
Sbjct: 1166 ANKAQMALNNCVLANTTIFAESPSETEVQSIMQHL 1200


>gi|195564306|ref|XP_002105762.1| GD24374 [Drosophila simulans]
 gi|194201637|gb|EDX15213.1| GD24374 [Drosophila simulans]
          Length = 1164

 Score =  143 bits (360), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 81/158 (51%), Positives = 100/158 (63%), Gaps = 18/158 (11%)

Query: 2    SSNDLWGPP----KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTW-----GTSQPQ- 51
            ++++LW  P      RGPPPG+     K  +N     P    GG N W     G+ QP  
Sbjct: 986  ATSELWTSPLNKSSSRGPPPGLTANSNKS-ANSNTSTPTTITGGANGWLQPRSGSVQPTN 1044

Query: 52   ----GG---WSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREE 104
                GG   W  +W+LLKNLT QIDG TL+TLC+QHGPL +FH YLN  +AL KY+TREE
Sbjct: 1045 TNWTGGNTTWGSSWLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIALCKYTTREE 1104

Query: 105  AIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSAT 142
            A KAQ  LNNC+L NTTIFAE+PS+ EVQS++ HL  T
Sbjct: 1105 ANKAQMALNNCVLANTTIFAESPSETEVQSIMQHLPQT 1142


>gi|427794461|gb|JAA62682.1| Hypothetical protein, partial [Rhipicephalus pulchellus]
          Length = 967

 Score =  142 bits (359), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 103/262 (39%), Positives = 129/262 (49%), Gaps = 63/262 (24%)

Query: 5   DLWGP-PKPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
           D W P PK RGPPPG+        S+GW + P      GN             ++++LKN
Sbjct: 728 DPWNPAPKIRGPPPGLS-------SSGWELHPVKQTSSGN------------NSFLVLKN 768

Query: 64  LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
           LTPQIDGSTLKTLC+QHGPLQ FHL+L H LALA+YST EEA KAQ  L+NC+L NTT+ 
Sbjct: 769 LTPQIDGSTLKTLCMQHGPLQLFHLFLKHGLALAQYSTCEEASKAQSALHNCVLSNTTMV 828

Query: 124 AEAPSDAEVQSLLAHL------------------SATANNNNNNNGGTGGWARGSSALSN 165
           A  P++ EV   L  L                  +  A  N  +      +A    +   
Sbjct: 829 AYIPNEVEVAQFLQQLGNGLGQHPSSQQQQHQQQAWGAPTNAYHPPRPAQFAPSRPSKQP 888

Query: 166 KDTWSS---------GGGGGNTSQLWGTPSNPSSGGSLWGAPPL---------DSVDR-- 205
            + W++              NT+ LW   S P +  SLW AP           + +D   
Sbjct: 889 AEPWNTAAPPSVSSAAVSSSNTNHLW---SFPGAASSLWAAPQTSQAGGSSGSNQIDHDH 945

Query: 206 --ATPSSLNSFLPGDLLGGESM 225
                SSLNSFLPGDLL GESM
Sbjct: 946 GGGPQSSLNSFLPGDLLNGESM 967


>gi|195172562|ref|XP_002027066.1| GL18179 [Drosophila persimilis]
 gi|194112844|gb|EDW34887.1| GL18179 [Drosophila persimilis]
          Length = 1226

 Score =  142 bits (359), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 82/180 (45%), Positives = 108/180 (60%), Gaps = 22/180 (12%)

Query: 2    SSNDLWGPP----KPRGPPPGMMGGGGKP----------------PSNGWMVRPNGGGGG 41
            S+++LW  P      RGPPPG+     K                  +NGW+   +G    
Sbjct: 897  STSELWTSPLNKASSRGPPPGLTTNANKSGNGVSGVTSTSSTIAGSANGWLQTRSGVPTT 956

Query: 42   GNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYST 101
              T   +     WS +W+LLKNLT QIDG TL+TLC+QHGPL +FHLYLN  +AL KY+T
Sbjct: 957  NTT--WTGGNTSWSSSWLLLKNLTAQIDGPTLRTLCMQHGPLVSFHLYLNQGIALCKYTT 1014

Query: 102  REEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSS 161
            REEA KAQ  LNNC+LGNTTIFAE PS+ EVQ++L HL    ++ N+  G + G + G++
Sbjct: 1015 REEASKAQMALNNCVLGNTTIFAETPSENEVQNILQHLPQVPSSTNSAIGSSVGSSVGTA 1074


>gi|427797345|gb|JAA64124.1| Putative trinucleotide repeat-containing protein, partial
           [Rhipicephalus pulchellus]
          Length = 804

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 103/262 (39%), Positives = 129/262 (49%), Gaps = 63/262 (24%)

Query: 5   DLWGP-PKPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
           D W P PK RGPPPG+        S+GW + P      GN             ++++LKN
Sbjct: 565 DPWNPAPKIRGPPPGLS-------SSGWELHPVKQTSSGNN------------SFLVLKN 605

Query: 64  LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
           LTPQIDGSTLKTLC+QHGPLQ FHL+L H LALA+YST EEA KAQ  L+NC+L NTT+ 
Sbjct: 606 LTPQIDGSTLKTLCMQHGPLQLFHLFLKHGLALAQYSTCEEASKAQSALHNCVLSNTTMV 665

Query: 124 AEAPSDAEVQSLLAHL------------------SATANNNNNNNGGTGGWARGSSALSN 165
           A  P++ EV   L  L                  +  A  N  +      +A    +   
Sbjct: 666 AYIPNEVEVAQFLQQLGNGLGQHPSSQQQQHQQQAWGAPTNAYHPPRPAQFAPSRPSKQP 725

Query: 166 KDTWSS---------GGGGGNTSQLWGTPSNPSSGGSLWGAPPL---------DSVDR-- 205
            + W++              NT+ LW   S P +  SLW AP           + +D   
Sbjct: 726 AEPWNTAAPPSVSSAAVSSSNTNHLW---SFPGAASSLWAAPQTSQAGGSSGSNQIDHDH 782

Query: 206 --ATPSSLNSFLPGDLLGGESM 225
                SSLNSFLPGDLL GESM
Sbjct: 783 GGGPQSSLNSFLPGDLLNGESM 804


>gi|198462026|ref|XP_001352316.2| GA16600 [Drosophila pseudoobscura pseudoobscura]
 gi|198140166|gb|EAL29242.2| GA16600 [Drosophila pseudoobscura pseudoobscura]
          Length = 1396

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 82/180 (45%), Positives = 108/180 (60%), Gaps = 22/180 (12%)

Query: 2    SSNDLWGPP----KPRGPPPGMMGGGGKP----------------PSNGWMVRPNGGGGG 41
            S+++LW  P      RGPPPG+     K                  +NGW+   +G    
Sbjct: 1067 STSELWTSPLNKASSRGPPPGLTTNANKSGNGVSGVTSTSSTIAGSANGWLQTRSGVPTT 1126

Query: 42   GNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYST 101
              T   +     WS +W+LLKNLT QIDG TL+TLC+QHGPL +FHLYLN  +AL KY+T
Sbjct: 1127 NTT--WTGGNTSWSSSWLLLKNLTAQIDGPTLRTLCMQHGPLVSFHLYLNQGIALCKYTT 1184

Query: 102  REEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSS 161
            REEA KAQ  LNNC+LGNTTIFAE PS+ EVQ++L HL    ++ N+  G + G + G++
Sbjct: 1185 REEASKAQMALNNCVLGNTTIFAETPSENEVQNILQHLPQVPSSTNSAIGSSVGSSVGTA 1244


>gi|194770632|ref|XP_001967395.1| GF19039 [Drosophila ananassae]
 gi|190618126|gb|EDV33650.1| GF19039 [Drosophila ananassae]
          Length = 1375

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 78/161 (48%), Positives = 100/161 (62%), Gaps = 28/161 (17%)

Query: 2    SSNDLWGPP----KPRGPPPGMMGGGGKPP---------------SNGWMVRPNGGGGGG 42
            S+++LW  P      RGPPPG+     K                 +NGW+        GG
Sbjct: 1029 STSELWTSPLNKSSSRGPPPGLTASSNKSGNGGSTTSTSTAISGGANGWLQT-----RGG 1083

Query: 43   NTWGTSQPQGG----WSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAK 98
            +   TS    G    WS +W+LLKNLT QIDGSTL+TLC+QHGPL +FHLYL+  +AL K
Sbjct: 1084 SVQATSTTWSGGNAPWSSSWLLLKNLTAQIDGSTLRTLCMQHGPLVSFHLYLSQGIALCK 1143

Query: 99   YSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHL 139
            Y+TREEA KAQ  LNNC+L NTTIFAE+P++ EVQ+++ HL
Sbjct: 1144 YATREEANKAQMALNNCVLANTTIFAESPNENEVQNIMQHL 1184


>gi|427793715|gb|JAA62309.1| Putative alpha-1 collagen type iii, partial [Rhipicephalus
            pulchellus]
          Length = 1160

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 103/262 (39%), Positives = 129/262 (49%), Gaps = 63/262 (24%)

Query: 5    DLWGP-PKPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
            D W P PK RGPPPG+        S+GW + P      GN             ++++LKN
Sbjct: 921  DPWNPAPKIRGPPPGLS-------SSGWELHPVKQTSSGNN------------SFLVLKN 961

Query: 64   LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
            LTPQIDGSTLKTLC+QHGPLQ FHL+L H LALA+YST EEA KAQ  L+NC+L NTT+ 
Sbjct: 962  LTPQIDGSTLKTLCMQHGPLQLFHLFLKHGLALAQYSTCEEASKAQSALHNCVLSNTTMV 1021

Query: 124  AEAPSDAEVQSLLAHL------------------SATANNNNNNNGGTGGWARGSSALSN 165
            A  P++ EV   L  L                  +  A  N  +      +A    +   
Sbjct: 1022 AYIPNEVEVAQFLQQLGNGLGQHPSSQQQQHQQQAWGAPTNAYHPPRPAQFAPSRPSKQP 1081

Query: 166  KDTWSS---------GGGGGNTSQLWGTPSNPSSGGSLWGAPPL---------DSVDR-- 205
             + W++              NT+ LW   S P +  SLW AP           + +D   
Sbjct: 1082 AEPWNTAAPPSVSSAAVSSSNTNHLW---SFPGAASSLWAAPQTSQAGGSSGSNQIDHDH 1138

Query: 206  --ATPSSLNSFLPGDLLGGESM 225
                 SSLNSFLPGDLL GESM
Sbjct: 1139 GGGPQSSLNSFLPGDLLNGESM 1160


>gi|410981860|ref|XP_003997284.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
            [Felis catus]
          Length = 1727

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 100/238 (42%), Positives = 131/238 (55%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1495 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1551

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1552 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1611

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWAR-----GSSALSNKDT--WS 170
            GNTTI AE   + EV   LA   A  + +   +G   G AR     GS  L   DT  WS
Sbjct: 1612 GNTTILAEFAGEEEVNRFLAQGQAVPSTSGWQSGTGAGQARLGASGGSHGLVRSDTGHWS 1671

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
            +    G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1672 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1727


>gi|427788417|gb|JAA59660.1| Putative trinucleotide repeat-containing protein [Rhipicephalus
            pulchellus]
          Length = 1449

 Score =  140 bits (353), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 103/262 (39%), Positives = 129/262 (49%), Gaps = 63/262 (24%)

Query: 5    DLWGP-PKPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
            D W P PK RGPPPG+        S+GW + P      GN             ++++LKN
Sbjct: 1210 DPWNPAPKIRGPPPGLS-------SSGWELHPVKQTSSGNN------------SFLVLKN 1250

Query: 64   LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
            LTPQIDGSTLKTLC+QHGPLQ FHL+L H LALA+YST EEA KAQ  L+NC+L NTT+ 
Sbjct: 1251 LTPQIDGSTLKTLCMQHGPLQLFHLFLKHGLALAQYSTCEEASKAQSALHNCVLSNTTMV 1310

Query: 124  AEAPSDAEVQSLLAHL------------------SATANNNNNNNGGTGGWARGSSALSN 165
            A  P++ EV   L  L                  +  A  N  +      +A    +   
Sbjct: 1311 AYIPNEVEVAQFLQQLGNGLGQHPSSQQQQHQQQAWGAPTNAYHPPRPAQFAPSRPSKQP 1370

Query: 166  KDTWSS---------GGGGGNTSQLWGTPSNPSSGGSLWGAPPL---------DSVDR-- 205
             + W++              NT+ LW   S P +  SLW AP           + +D   
Sbjct: 1371 AEPWNTAAPPSVSSAAVSSSNTNHLW---SFPGAASSLWAAPQTSQAGGSSGSNQIDHDH 1427

Query: 206  --ATPSSLNSFLPGDLLGGESM 225
                 SSLNSFLPGDLL GESM
Sbjct: 1428 GGGPQSSLNSFLPGDLLNGESM 1449


>gi|410981862|ref|XP_003997285.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
            [Felis catus]
          Length = 1691

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 100/238 (42%), Positives = 131/238 (55%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1459 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1515

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1516 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1575

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWAR-----GSSALSNKDT--WS 170
            GNTTI AE   + EV   LA   A  + +   +G   G AR     GS  L   DT  WS
Sbjct: 1576 GNTTILAEFAGEEEVNRFLAQGQAVPSTSGWQSGTGAGQARLGASGGSHGLVRSDTGHWS 1635

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
            +    G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1636 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1691


>gi|427788411|gb|JAA59657.1| Putative trinucleotide repeat-containing protein [Rhipicephalus
            pulchellus]
          Length = 1471

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 103/262 (39%), Positives = 129/262 (49%), Gaps = 63/262 (24%)

Query: 5    DLWGP-PKPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
            D W P PK RGPPPG+        S+GW + P      GN             ++++LKN
Sbjct: 1232 DPWNPAPKIRGPPPGLS-------SSGWELHPVKQTSSGNN------------SFLVLKN 1272

Query: 64   LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
            LTPQIDGSTLKTLC+QHGPLQ FHL+L H LALA+YST EEA KAQ  L+NC+L NTT+ 
Sbjct: 1273 LTPQIDGSTLKTLCMQHGPLQLFHLFLKHGLALAQYSTCEEASKAQSALHNCVLSNTTMV 1332

Query: 124  AEAPSDAEVQSLLAHL------------------SATANNNNNNNGGTGGWARGSSALSN 165
            A  P++ EV   L  L                  +  A  N  +      +A    +   
Sbjct: 1333 AYIPNEVEVAQFLQQLGNGLGQHPSSQQQQHQQQAWGAPTNAYHPPRPAQFAPSRPSKQP 1392

Query: 166  KDTWSS---------GGGGGNTSQLWGTPSNPSSGGSLWGAPPL---------DSVDR-- 205
             + W++              NT+ LW   S P +  SLW AP           + +D   
Sbjct: 1393 AEPWNTAAPPSVSSAAVSSSNTNHLW---SFPGAASSLWAAPQTSQAGGSSGSNQIDHDH 1449

Query: 206  --ATPSSLNSFLPGDLLGGESM 225
                 SSLNSFLPGDLL GESM
Sbjct: 1450 GGGPQSSLNSFLPGDLLNGESM 1471


>gi|449283096|gb|EMC89799.1| Trinucleotide repeat-containing gene 6C protein [Columba livia]
          Length = 1719

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 101/238 (42%), Positives = 134/238 (56%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1487 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGASPLGWTSSYSSGSAWSTDSSGRTSS 1543

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1544 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1603

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTGGWARGSS----ALSNKDT--WS 170
            GNTTI AE   + EV   LA   A    ++  +N G+G    GSS    AL   DT  W+
Sbjct: 1604 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSNSGSGQTRLGSSSSSHALVRSDTGHWN 1663

Query: 171  --SGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRA-TPSSLNSFLPGDLLGGESM 225
                GG G++  LWG   +P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1664 PPCLGGKGSSDLLWG--GDPQCSSSLWGPPSTDDGGVIGSPTPLNTLLPGDLLSGESI 1719


>gi|74210597|dbj|BAE23657.1| unnamed protein product [Mus musculus]
          Length = 727

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 100/253 (39%), Positives = 138/253 (54%), Gaps = 35/253 (13%)

Query: 3   SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
           +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 480 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 537

Query: 55  SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
             +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 538 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 597

Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
           AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 598 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 657

Query: 161 SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
           S+ ++ + W+  G  G      + + LWGT   P    SLWG P  D    ++PS +N+F
Sbjct: 658 SSRTDVNHWNGAGLSGANCGDLHGTSLWGT---PHYSTSLWGPPSSDPRGISSPSPINAF 714

Query: 215 LPGDLL--GGESM 225
           L  D L  GGESM
Sbjct: 715 LSVDHLGGGGESM 727


>gi|26336695|dbj|BAC32030.1| unnamed protein product [Mus musculus]
          Length = 627

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 100/253 (39%), Positives = 138/253 (54%), Gaps = 35/253 (13%)

Query: 3   SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
           +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 380 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 437

Query: 55  SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
             +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 438 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 497

Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
           AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 498 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 557

Query: 161 SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
           S+ ++ + W+  G  G      + + LWGT   P    SLWG P  D    ++PS +N+F
Sbjct: 558 SSRTDVNHWNGAGLSGANCGDLHGTSLWGT---PHYSTSLWGPPSSDPRGISSPSPINAF 614

Query: 215 LPGDLL--GGESM 225
           L  D L  GGESM
Sbjct: 615 LSVDHLGGGGESM 627


>gi|26341344|dbj|BAC34334.1| unnamed protein product [Mus musculus]
          Length = 337

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 100/253 (39%), Positives = 139/253 (54%), Gaps = 35/253 (13%)

Query: 3   SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
           +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 90  AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 147

Query: 55  SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
             +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 148 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 207

Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
           AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 208 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 267

Query: 161 SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
           S+ ++ + W+  G  G      + + LWGTP   +   SLWG P  D    ++PS +N+F
Sbjct: 268 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 324

Query: 215 LPGDLL--GGESM 225
           L  D L  GGESM
Sbjct: 325 LSVDHLGGGGESM 337


>gi|195450735|ref|XP_002072610.1| GK13697 [Drosophila willistoni]
 gi|194168695|gb|EDW83596.1| GK13697 [Drosophila willistoni]
          Length = 1437

 Score =  139 bits (351), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 62/89 (69%), Positives = 76/89 (85%)

Query: 54   WSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLN 113
            WS +W+LLKNLT QIDG TL+TLC+QHGPL +FHLYLN  +AL KY+TREE+ KAQ  LN
Sbjct: 1166 WSSSWLLLKNLTAQIDGPTLRTLCMQHGPLVSFHLYLNQGIALCKYATREESNKAQMTLN 1225

Query: 114  NCILGNTTIFAEAPSDAEVQSLLAHLSAT 142
            NC+LGNTTIFAE+P++AEVQ++L HL  T
Sbjct: 1226 NCVLGNTTIFAESPNEAEVQNILQHLPQT 1254


>gi|344241798|gb|EGV97901.1| Trinucleotide repeat-containing gene 6C protein [Cricetulus
           griseus]
          Length = 802

 Score =  139 bits (351), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 97/243 (39%), Positives = 129/243 (53%), Gaps = 30/243 (12%)

Query: 3   SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
           S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 570 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDASGRTSS 626

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 627 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 686

Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN------------NNNGGTGGWARGSSALSN 165
           GNTTI AE   + EV   LA   A    ++             ++G T G  R  +A   
Sbjct: 687 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQPSSGGSQPRLGSSGSTHGLVRSDTA--- 743

Query: 166 KDTWSSG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRA-TPSSLNSFLPGDLLGG 222
              WS+    G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL G
Sbjct: 744 --HWSTPCLSGKGSSELLWG--GVPQYSSSLWGPPSADDARVIGSPTPLNTLLPGDLLSG 799

Query: 223 ESM 225
           ESM
Sbjct: 800 ESM 802


>gi|74218630|dbj|BAE25197.1| unnamed protein product [Mus musculus]
 gi|74218632|dbj|BAE25198.1| unnamed protein product [Mus musculus]
          Length = 500

 Score =  139 bits (351), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 100/253 (39%), Positives = 138/253 (54%), Gaps = 35/253 (13%)

Query: 3   SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
           +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 253 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 310

Query: 55  SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
             +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 311 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 370

Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
           AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 371 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 430

Query: 161 SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
           S+ ++ + W+  G  G      + + LWGT   P    SLWG P  D    ++PS +N+F
Sbjct: 431 SSRTDVNHWNGAGLSGANCGDLHGTSLWGT---PHYSTSLWGPPSSDPRGISSPSPINAF 487

Query: 215 LPGDLL--GGESM 225
           L  D L  GGESM
Sbjct: 488 LSVDHLGGGGESM 500


>gi|391334963|ref|XP_003741867.1| PREDICTED: uncharacterized protein LOC100898741 [Metaseiulus
            occidentalis]
          Length = 1067

 Score =  139 bits (351), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 98/214 (45%), Positives = 117/214 (54%), Gaps = 32/214 (14%)

Query: 39   GGGGNTWGTS-QPQGGWSGT----------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFH 87
            G  G  WG S QP    SG           +++LKNLT QIDGSTLKTLC+QHGP+Q FH
Sbjct: 859  GSNGKKWGESDQPGSILSGLPGPVSPTGKGFLVLKNLTAQIDGSTLKTLCIQHGPVQLFH 918

Query: 88   LYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNN 147
            L+LNH  AL +Y TREEA+KA+  LNNC+L NTTI A  PS+ EVQ LL   +  + N  
Sbjct: 919  LFLNHGFALIQYMTREEALKAESALNNCVLSNTTILAYVPSEREVQQLLYLANYQSLNQG 978

Query: 148  NNN---------GGTGGWARGSSALSNKDTWSSG------GGGGNTSQLWGTPSNPSSG- 191
              N                +G+  L   +   SG      G      Q  G PS  +SG 
Sbjct: 979  RPNPQQQQQQQQQQANHSQQGNPQLQGVNPQQSGPRLPPSGVLQQPQQNCGWPSAANSGA 1038

Query: 192  GSLWGAPPLDSVDRATPSSLNSFLPGDLLGGESM 225
            G+LWG P  D+ D A    LNSFLPGDLL GESM
Sbjct: 1039 GALWGPP--DANDTA---PLNSFLPGDLLSGESM 1067


>gi|444727787|gb|ELW68265.1| Trinucleotide repeat-containing 6C protein [Tupaia chinensis]
          Length = 922

 Score =  139 bits (350), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 97/238 (40%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3   SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
           S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 690 SHELWKVPRNTTAPTRPPPGLTNPK---PSSTWGASPLGWTSSYSSGSAWSADTSGRTSS 746

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 747 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 806

Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTGGWARGSSALSNKDTWSSG---- 172
           GNTTI AE   + EV   LA   A    ++  ++GGT     G+S  S+    S      
Sbjct: 807 GNTTILAEFAGEEEVNRFLAQGQALPTTSSWQSSGGTSQPRLGASGSSHGLVRSDAGHWN 866

Query: 173 ----GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
               G  GN+  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 867 APCLGAKGNSELLWG--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGESI 922


>gi|195402239|ref|XP_002059714.1| GJ14351 [Drosophila virilis]
 gi|194155928|gb|EDW71112.1| GJ14351 [Drosophila virilis]
          Length = 1377

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 80/154 (51%), Positives = 99/154 (64%), Gaps = 19/154 (12%)

Query: 2    SSNDLWGPP----KPRGPPPGMM-----GGGGKPPS-------NGWMVRPNGGGGGGNTW 45
            ++++LW  P      RGPPPG+      G     PS       NGW+  PN      NT 
Sbjct: 1052 ATSELWTSPLNKSSSRGPPPGLSTNKSGGVTATTPSPTVAGNSNGWL--PNRSVPNTNTT 1109

Query: 46   GTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEA 105
             T      W+ +W+LLKNL  QIDGSTL+TLC+QHGPL +FH YLN  +AL KY+TREEA
Sbjct: 1110 WTGA-NAAWNSSWLLLKNLNAQIDGSTLRTLCMQHGPLVSFHPYLNQGIALCKYTTREEA 1168

Query: 106  IKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHL 139
             KAQ  LNNC+LGNTTIFAE+PS+ EVQ++L HL
Sbjct: 1169 NKAQMALNNCVLGNTTIFAESPSENEVQNILQHL 1202


>gi|355725495|gb|AES08575.1| trinucleotide repeat containing 6A [Mustela putorius furo]
          Length = 486

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 100/252 (39%), Positives = 136/252 (53%), Gaps = 40/252 (15%)

Query: 3   SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRPNGGGGG----------GNTW 45
           +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG          G +W
Sbjct: 239 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGCSW 296

Query: 46  GTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEA 105
           G S    G    W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE 
Sbjct: 297 GESS--SGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEV 354

Query: 106 IKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWAR 158
           +KAQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   + 
Sbjct: 355 VKAQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSH 414

Query: 159 GSSALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSS 210
             S+ ++ + W+  G  G        + LWGT   P    SLWG PP  S  R  ++PS 
Sbjct: 415 SFSSRTDLNHWNGAGLSGTNCGDLHGTSLWGT---PHYSTSLWG-PPSSSDPRGISSPSP 470

Query: 211 LNSFLPGDLLGG 222
           +N+FL  D LGG
Sbjct: 471 INAFLSVDHLGG 482


>gi|449475873|ref|XP_002196372.2| PREDICTED: trinucleotide repeat-containing gene 6A protein
            [Taeniopygia guttata]
          Length = 1913

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 105/253 (41%), Positives = 141/253 (55%), Gaps = 35/253 (13%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWS 55
            +++LW    PPK    P  PPPG+ G   KPP + W      GGG GN+     P   W 
Sbjct: 1666 AHELWKVPLPPKSITAPSRPPPGLTGQ--KPPLSTWDNSLRLGGGWGNSDARYTPGSSWG 1723

Query: 56   GT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKA 108
             +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +KA
Sbjct: 1724 ESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVKA 1783

Query: 109  QGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTG----GWARGSSALS 164
            Q +L+ C+LGNTTI AE  S+ E+    A   +   +    + G+     G   GS + S
Sbjct: 1784 QKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQSLGSSQSRLGSIDGSHSFS 1843

Query: 165  NKDT---WSSGGGGGNTS------QLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSF 214
            N++    W+  G  G +S       LWG+P+  S   SLWGAP   D+   ++PS +N+F
Sbjct: 1844 NRNDLNHWNGAGLSGTSSGDLHGTSLWGSPNYSS---SLWGAPSSNDTRGISSPSPINAF 1900

Query: 215  LPGDLL--GGESM 225
            L  D L  GGESM
Sbjct: 1901 LSVDHLGGGGESM 1913


>gi|263359644|gb|ACY70480.1| hypothetical protein DVIR88_6g0017 [Drosophila virilis]
          Length = 1394

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 80/154 (51%), Positives = 99/154 (64%), Gaps = 19/154 (12%)

Query: 2    SSNDLWGPP----KPRGPPPGMM-----GGGGKPPS-------NGWMVRPNGGGGGGNTW 45
            ++++LW  P      RGPPPG+      G     PS       NGW+  PN      NT 
Sbjct: 1069 ATSELWTSPLNKSSSRGPPPGLSTNKSGGVTATTPSPTVAGNSNGWL--PNRSVPNTNTT 1126

Query: 46   GTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEA 105
             T      W+ +W+LLKNL  QIDGSTL+TLC+QHGPL +FH YLN  +AL KY+TREEA
Sbjct: 1127 WTGA-NAAWNSSWLLLKNLNAQIDGSTLRTLCMQHGPLVSFHPYLNQGIALCKYTTREEA 1185

Query: 106  IKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHL 139
             KAQ  LNNC+LGNTTIFAE+PS+ EVQ++L HL
Sbjct: 1186 NKAQMALNNCVLGNTTIFAESPSENEVQNILQHL 1219


>gi|444725719|gb|ELW66274.1| Trinucleotide repeat-containing 6A protein [Tupaia chinensis]
          Length = 1894

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 103/257 (40%), Positives = 140/257 (54%), Gaps = 42/257 (16%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRPNGGGGG----------GNTW 45
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG          G+ W
Sbjct: 1646 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSNW 1703

Query: 46   GTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEA 105
            G S    G    W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE 
Sbjct: 1704 GESS--SGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEV 1761

Query: 106  IKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWAR 158
            +KAQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   + 
Sbjct: 1762 VKAQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSH 1821

Query: 159  GSSALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSS 210
              S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS 
Sbjct: 1822 SFSSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSP 1877

Query: 211  LNSFLPGDLL--GGESM 225
            +N+FL  D L  GGESM
Sbjct: 1878 INAFLSVDHLGGGGESM 1894


>gi|148685346|gb|EDL17293.1| mCG20982, isoform CRA_d [Mus musculus]
          Length = 1937

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 101/253 (39%), Positives = 139/253 (54%), Gaps = 35/253 (13%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1690 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1747

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1748 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1807

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1808 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1867

Query: 161  SALSNKDTWSSGG-GGGNT-----SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
            S+ ++ + W+  G  G N      + LWGTP   +   SLWG P  D    ++PS +N+F
Sbjct: 1868 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1924

Query: 215  LPGDLL--GGESM 225
            L  D L  GGESM
Sbjct: 1925 LSVDHLGGGGESM 1937


>gi|117190552|ref|NP_659174.3| trinucleotide repeat-containing gene 6A protein [Mus musculus]
          Length = 1896

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 101/253 (39%), Positives = 139/253 (54%), Gaps = 35/253 (13%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1649 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1706

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1707 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1766

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1767 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1826

Query: 161  SALSNKDTWSSGG-GGGNT-----SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
            S+ ++ + W+  G  G N      + LWGTP   +   SLWG P  D    ++PS +N+F
Sbjct: 1827 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1883

Query: 215  LPGDLL--GGESM 225
            L  D L  GGESM
Sbjct: 1884 LSVDHLGGGGESM 1896


>gi|123791339|sp|Q3UHK8.1|TNR6A_MOUSE RecName: Full=Trinucleotide repeat-containing gene 6A protein
 gi|74181174|dbj|BAE27849.1| unnamed protein product [Mus musculus]
          Length = 1896

 Score =  137 bits (346), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 101/253 (39%), Positives = 139/253 (54%), Gaps = 35/253 (13%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1649 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1706

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1707 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1766

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1767 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1826

Query: 161  SALSNKDTWSSGG-GGGNT-----SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
            S+ ++ + W+  G  G N      + LWGTP   +   SLWG P  D    ++PS +N+F
Sbjct: 1827 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1883

Query: 215  LPGDLL--GGESM 225
            L  D L  GGESM
Sbjct: 1884 LSVDHLGGGGESM 1896


>gi|344240423|gb|EGV96526.1| Trinucleotide repeat-containing gene 6A protein [Cricetulus
           griseus]
          Length = 687

 Score =  137 bits (346), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 138/255 (54%), Gaps = 38/255 (14%)

Query: 3   SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
           +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 439 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 496

Query: 55  SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
             +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 497 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLQHGNALVRYSSKEEVVK 556

Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
           AQ +L+ C+LGNTTI AE  S+ E+    A   +   +        + +  G+   +   
Sbjct: 557 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGTSQSRLGSLDCSHSF 616

Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
           S+ ++ + W+  G  G        + LWGT   P    SLWG PP  S  R  ++PS +N
Sbjct: 617 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGT---PHYSTSLWG-PPSSSDPRGISSPSPIN 672

Query: 213 SFLPGDLL--GGESM 225
           +FL  D L  GGESM
Sbjct: 673 AFLSVDHLGGGGESM 687


>gi|449278991|gb|EMC86719.1| Trinucleotide repeat-containing gene 6A protein [Columba livia]
          Length = 1892

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 104/253 (41%), Positives = 141/253 (55%), Gaps = 35/253 (13%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWS 55
            +++LW    PPK    P  PPPG+ G   KPP + W      GGG GN+     P   W 
Sbjct: 1645 AHELWKVPLPPKSITAPSRPPPGLTGQ--KPPLSTWDNSLRLGGGWGNSDARYTPGSSWG 1702

Query: 56   GT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKA 108
             +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +KA
Sbjct: 1703 ESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVKA 1762

Query: 109  QGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTG----GWARGSSALS 164
            Q +L+ C+LGNTTI AE  S+ E+    A   +   +    + G+     G   GS + S
Sbjct: 1763 QKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQSLGSSQNRLGSIDGSHSFS 1822

Query: 165  NKDT---WSSGGGGGNTS------QLWGTPSNPSSGGSLWGAP-PLDSVDRATPSSLNSF 214
            N++    W+  G  G +S       LWG+P+  +   SLWGAP   D+   ++PS +N+F
Sbjct: 1823 NRNDLNHWNGAGLSGTSSGDLHGTSLWGSPNYST---SLWGAPSSSDTRGISSPSPINAF 1879

Query: 215  LPGDLL--GGESM 225
            L  D L  GGESM
Sbjct: 1880 LSVDHLGGGGESM 1892


>gi|158297465|ref|XP_001689052.1| AGAP007808-PA [Anopheles gambiae str. PEST]
 gi|157015208|gb|EDO63615.1| AGAP007808-PA [Anopheles gambiae str. PEST]
          Length = 1216

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 73/132 (55%), Positives = 92/132 (69%), Gaps = 4/132 (3%)

Query: 25   KPPSNGWMVRPNGGGGGGNTWGT-SQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPL 83
            K  +NGW   P+   GGG+TW + +     WS TW++L+NLT QI+GSTL+TLC+QHGP+
Sbjct: 1067 KLDANGWNT-PSTQAGGGSTWNSGASAANTWSSTWIMLRNLTAQIEGSTLRTLCLQHGPV 1125

Query: 84   QNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATA 143
             NFHLYLN  +AL KY TREEA KAQ  LNNC LGNTTI AE P+++E+Q +L H     
Sbjct: 1126 VNFHLYLNQGIALCKYGTREEAQKAQLALNNCQLGNTTIIAEIPNESEIQYILPH--HVG 1183

Query: 144  NNNNNNNGGTGG 155
            N+N   NG T G
Sbjct: 1184 NSNGMTNGLTSG 1195


>gi|241646725|ref|XP_002409881.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215501453|gb|EEC10947.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 1089

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 108/280 (38%), Positives = 126/280 (45%), Gaps = 82/280 (29%)

Query: 1    MSSNDLWGPPKPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVL 60
            M S+D WG PK RGPPPG+        + GW                       S T+++
Sbjct: 837  MPSSDPWGAPKTRGPPPGLSSSS----TQGWDQS--------------------SCTFLV 872

Query: 61   LKNLTPQ----------------------------------IDGSTLKTLCVQHGPLQNF 86
            LKNLTPQ                                  IDGSTLKTLC+QHGPLQ F
Sbjct: 873  LKNLTPQVGPSHVPFPSTLAAPLGYADCGIRGASWLAKQARIDGSTLKTLCMQHGPLQLF 932

Query: 87   HLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNN 146
            HL+L H LALA+YS+REEA KAQ  L+NCIL NTT+ A  PS+AEV   L     T    
Sbjct: 933  HLFLKHGLALAQYSSREEAAKAQSALHNCILSNTTMLAYIPSEAEVAQFLQLAQGTQQGP 992

Query: 147  NNNNGGTGG-------WARGSSALSNKDTWSSGGGGG-------NTSQLWGTPSNPSSGG 192
                 G GG       +  GS   + +  W+               S LW   S P +GG
Sbjct: 993  PCWAPGGGGGGPSFHRFPYGSRPKAPEAPWNPASTAAPPTSSSSGASHLW---SFPGAGG 1049

Query: 193  SLWGAPPL-------DSVDRATPSSLNSFLPGDLLGGESM 225
             LW AP         D       SSLNSFLPGDLL GESM
Sbjct: 1050 GLWAAPQAPQGPQGGDDHPGGQQSSLNSFLPGDLLSGESM 1089


>gi|194913515|ref|XP_001982714.1| GG16439 [Drosophila erecta]
 gi|190647930|gb|EDV45233.1| GG16439 [Drosophila erecta]
          Length = 1392

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 74/157 (47%), Positives = 96/157 (61%), Gaps = 19/157 (12%)

Query: 2    SSNDLWGPP----KPRGPPPGMMGGGGKPPS--NGWMVRPNGGGGGGNTWGTSQPQG--- 52
            ++++LW  P      RGPPPG+     K  +  N     P    GG N W  ++  G   
Sbjct: 1047 ATSELWTSPLNKSSSRGPPPGLTANSNKSGNGGNSCTSTPTTITGGANGWLQARSGGVPT 1106

Query: 53   ----------GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTR 102
                       W  +W+LL+NLT QIDG TL+TLC+QHGPL +FH YLN  +AL KY+TR
Sbjct: 1107 TNTTWTGGNTSWGSSWLLLRNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIALCKYTTR 1166

Query: 103  EEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHL 139
            EEA KAQ  LNNC+L NTTIFAE+PS+ EVQ+++ HL
Sbjct: 1167 EEANKAQMALNNCVLANTTIFAESPSENEVQNIMQHL 1203


>gi|195064375|ref|XP_001996557.1| GH23931 [Drosophila grimshawi]
 gi|193892103|gb|EDV90969.1| GH23931 [Drosophila grimshawi]
          Length = 1432

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 80/154 (51%), Positives = 99/154 (64%), Gaps = 19/154 (12%)

Query: 2    SSNDLWGPP----KPRGPPPGM---MGGGGKPP---------SNGWMVRPNGGGGGGNTW 45
            ++++LW  P      RGPPPG+     GG   P         SNGW+  PN      NT 
Sbjct: 1077 ATSELWTSPLSKGSSRGPPPGLSTSKTGGVTAPTPSPTVAGNSNGWL--PNRSVPSTNTA 1134

Query: 46   GTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEA 105
             T      W+ +W+LLKNL  QIDG TL+TLC+QHGPL +FH YLN  +AL KY+TREEA
Sbjct: 1135 WTGT-NVSWNSSWLLLKNLNAQIDGPTLRTLCMQHGPLVSFHPYLNQGIALCKYATREEA 1193

Query: 106  IKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHL 139
             KAQ  LNNC+LGNTTIFAE+PS+ EVQ++L HL
Sbjct: 1194 NKAQMALNNCVLGNTTIFAESPSENEVQNILQHL 1227


>gi|395533346|ref|XP_003768721.1| PREDICTED: trinucleotide repeat-containing gene 6C protein, partial
           [Sarcophilus harrisii]
          Length = 928

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 97/238 (40%), Positives = 128/238 (53%), Gaps = 20/238 (8%)

Query: 3   SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
           S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 696 SHELWKVPRNTTAPTRPPPGLTNTK---PSSTWGTSPLGWTSSYSSGSAWSTDSSGRTSS 752

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 753 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 812

Query: 118 GNTTIFAEAPSDAEVQSLLAH---LSATA--NNNNNNNGGTGGWARGSSALSNKDT--WS 170
           GNTTI AE   + EV   LA    L  T+   +N   N    G    S  L   D   W+
Sbjct: 813 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNTGTNQTRMGSTSSSHGLVRNDAGHWN 872

Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
           +   G  G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 873 TPCLGSKGSSDLLWG--GVPQYSSSLWGPPSTDDGRVIGSPTPLNTLLPGDLLSGESI 928


>gi|426346617|ref|XP_004040968.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
            [Gorilla gorilla gorilla]
          Length = 1726

 Score =  136 bits (342), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 97/241 (40%), Positives = 130/241 (53%), Gaps = 25/241 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG----GGGGGNTWGTSQPQGGW 54
            S++LW  P+    P  PPPG+       PS+ W   P G       G   W T     G 
Sbjct: 1493 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSACWSTDT--SGR 1547

Query: 55   SGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNN 114
            + +W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ 
Sbjct: 1548 TSSWLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHM 1607

Query: 115  CILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKD 167
            C+LGNTTI AE   + EV   LA   A    ++  +       R S+A        S+  
Sbjct: 1608 CVLGNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAG 1667

Query: 168  TWSSG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
             W++   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES
Sbjct: 1668 HWNAPCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGES 1725

Query: 225  M 225
            +
Sbjct: 1726 L 1726


>gi|26338668|dbj|BAC33005.1| unnamed protein product [Mus musculus]
          Length = 340

 Score =  136 bits (342), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 96/241 (39%), Positives = 128/241 (53%), Gaps = 26/241 (10%)

Query: 3   SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
           S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 108 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 164

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 165 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 224

Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
           GNTTI AE   + EV   LA   A    ++              +G T G  R  +A  N
Sbjct: 225 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 284

Query: 166 KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
               S   G G++  LWG    P    SLWG P   D+    +P+ LN+ LPGDLL GES
Sbjct: 285 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 339

Query: 225 M 225
           +
Sbjct: 340 I 340


>gi|211830506|gb|AAH24324.2| TNRC6A protein [Homo sapiens]
          Length = 448

 Score =  136 bits (342), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 139/255 (54%), Gaps = 38/255 (14%)

Query: 3   SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
           +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 200 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 257

Query: 55  SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
             +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 258 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 317

Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
           AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 318 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 377

Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
           S+ ++ + W+  G  G        + LWGT   P    SLWG PP  S  R  ++PS +N
Sbjct: 378 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGT---PHYSTSLWG-PPSSSDPRGISSPSPIN 433

Query: 213 SFLPGDLL--GGESM 225
           +FL  D L  GGESM
Sbjct: 434 AFLSVDHLGGGGESM 448


>gi|7023252|dbj|BAA91899.1| unnamed protein product [Homo sapiens]
          Length = 440

 Score =  136 bits (342), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 139/255 (54%), Gaps = 38/255 (14%)

Query: 3   SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
           +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 192 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 249

Query: 55  SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
             +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 250 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 309

Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
           AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 310 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 369

Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
           S+ ++ + W+  G  G        + LWGTP       SLWG PP  S  R  ++PS +N
Sbjct: 370 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHY---STSLWG-PPSSSDPRGISSPSPIN 425

Query: 213 SFLPGDLL--GGESM 225
           +FL  D L  GGESM
Sbjct: 426 AFLSVDHLGGGGESM 440


>gi|426346619|ref|XP_004040969.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
            [Gorilla gorilla gorilla]
          Length = 1690

 Score =  136 bits (342), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 97/241 (40%), Positives = 130/241 (53%), Gaps = 25/241 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG----GGGGGNTWGTSQPQGGW 54
            S++LW  P+    P  PPPG+       PS+ W   P G       G   W T     G 
Sbjct: 1457 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSACWSTDT--SGR 1511

Query: 55   SGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNN 114
            + +W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ 
Sbjct: 1512 TSSWLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHM 1571

Query: 115  CILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKD 167
            C+LGNTTI AE   + EV   LA   A    ++  +       R S+A        S+  
Sbjct: 1572 CVLGNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAG 1631

Query: 168  TWSSG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
             W++   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES
Sbjct: 1632 HWNAPCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGES 1689

Query: 225  M 225
            +
Sbjct: 1690 L 1690


>gi|354473299|ref|XP_003498873.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Cricetulus griseus]
          Length = 1888

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 100/238 (42%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1656 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDASGRTSS 1712

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1713 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1772

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWAR-GSS----ALSNKDT--WS 170
            GNTTI AE   + EV   LA   A    ++      G   R GSS     L   DT  WS
Sbjct: 1773 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQPSSGGSQPRLGSSGSTHGLVRSDTAHWS 1832

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRA-TPSSLNSFLPGDLLGGESM 225
            +    G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GESM
Sbjct: 1833 TPCLSGKGSSELLWG--GVPQYSSSLWGPPSADDARVIGSPTPLNTLLPGDLLSGESM 1888


>gi|363739602|ref|XP_414871.3| PREDICTED: trinucleotide repeat-containing gene 6A protein [Gallus
            gallus]
          Length = 1950

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 103/253 (40%), Positives = 140/253 (55%), Gaps = 35/253 (13%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWS 55
            +++LW    PPK    P  PPPG+ G   KPP + W      GGG GN+     P   W 
Sbjct: 1703 AHELWKVPLPPKSITAPSRPPPGLTGQ--KPPLSTWDNSLRLGGGWGNSDARYTPGSSWG 1760

Query: 56   GT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKA 108
             +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +KA
Sbjct: 1761 ESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVKA 1820

Query: 109  QGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTG----GWARGSSALS 164
            Q +L+ C+LGNTTI AE  S+ E+    A   +   +    + G+     G   GS + S
Sbjct: 1821 QKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQSLGSSQNRLGSIDGSHSFS 1880

Query: 165  NKDT---WSSGGGGGNTS------QLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSF 214
            N++    W+  G  G +S       LWG+P+  +   SLWG P   D+   ++PS +N+F
Sbjct: 1881 NRNDLNHWNGAGLSGTSSGDLHGTSLWGSPNYST---SLWGTPSSNDTRGISSPSPINAF 1937

Query: 215  LPGDLL--GGESM 225
            L  D L  GGESM
Sbjct: 1938 LSVDHLGGGGESM 1950


>gi|281342794|gb|EFB18378.1| hypothetical protein PANDA_006877 [Ailuropoda melanoleuca]
          Length = 1730

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 100/238 (42%), Positives = 132/238 (55%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1498 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1554

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1555 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1614

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTG----GWARGSSALSNKDT--WS 170
            GNTTI AE   + EV   LA   A    ++  ++ GTG    G A  S  L   DT  WS
Sbjct: 1615 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTGQTRLGAAGSSHGLVRSDTGHWS 1674

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
            +    G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1675 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1730


>gi|301766006|ref|XP_002918420.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Ailuropoda melanoleuca]
          Length = 1720

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 100/238 (42%), Positives = 132/238 (55%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1488 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1544

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1545 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1604

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTG----GWARGSSALSNKDT--WS 170
            GNTTI AE   + EV   LA   A    ++  ++ GTG    G A  S  L   DT  WS
Sbjct: 1605 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTGQTRLGAAGSSHGLVRSDTGHWS 1664

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
            +    G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1665 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1720


>gi|345305142|ref|XP_001505551.2| PREDICTED: trinucleotide repeat-containing gene 6A protein
            [Ornithorhynchus anatinus]
          Length = 1906

 Score =  135 bits (341), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 105/254 (41%), Positives = 141/254 (55%), Gaps = 36/254 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1658 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRLGGGWGNSDARYTPGSSW 1715

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1716 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1775

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTG----GWARGSSAL 163
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +    + G+     G   GS + 
Sbjct: 1776 AQKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQSLGSSQNRLGSIDGSHSF 1835

Query: 164  SNKDT---WSSGGGGGNTS------QLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNS 213
            SN++    W+  G  G +S       LWGTP+  +   SLWG P   D+   ++PS +N+
Sbjct: 1836 SNRNDLNHWNGAGLSGTSSGDLHGTSLWGTPNYST---SLWGTPSSNDTRGISSPSPINA 1892

Query: 214  FLPGDLL--GGESM 225
            FL  D L  GGESM
Sbjct: 1893 FLSVDHLGGGGESM 1906


>gi|148702675|gb|EDL34622.1| mCG19297, isoform CRA_b [Mus musculus]
          Length = 1630

 Score =  135 bits (341), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 98/241 (40%), Positives = 132/241 (54%), Gaps = 26/241 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1398 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1454

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1455 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1514

Query: 118  GNTTIFAEAPSDAEVQSLLAH---LSATANNNNNN---------NGGTGGWARGSSALSN 165
            GNTTI AE   + EV   LA    L  T++  +N+         +G T G  R  +A  N
Sbjct: 1515 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSNSGGSQPRLGTSGSTHGLVRSDTAHWN 1574

Query: 166  KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
                S   G G++  LWG    P    SLWG P   D+    +P+ LN+ LPGDLL GES
Sbjct: 1575 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 1629

Query: 225  M 225
            +
Sbjct: 1630 I 1630


>gi|392351792|ref|XP_003751024.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Rattus norvegicus]
          Length = 1919

 Score =  135 bits (341), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 96/241 (39%), Positives = 127/241 (52%), Gaps = 26/241 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1687 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1743

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1744 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1803

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
            GNTTI AE   + EV   LA   A    ++              +G T G  R  +A  N
Sbjct: 1804 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 1863

Query: 166  KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRA-TPSSLNSFLPGDLLGGES 224
                S   G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES
Sbjct: 1864 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSADDTRVIGSPTPLNTLLPGDLLSGES 1918

Query: 225  M 225
            +
Sbjct: 1919 I 1919


>gi|392332214|ref|XP_003752510.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Rattus norvegicus]
          Length = 1815

 Score =  135 bits (341), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 96/241 (39%), Positives = 127/241 (52%), Gaps = 26/241 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1583 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1639

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1640 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1699

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
            GNTTI AE   + EV   LA   A    ++              +G T G  R  +A  N
Sbjct: 1700 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 1759

Query: 166  KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRA-TPSSLNSFLPGDLLGGES 224
                S   G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES
Sbjct: 1760 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSADDTRVIGSPTPLNTLLPGDLLSGES 1814

Query: 225  M 225
            +
Sbjct: 1815 I 1815


>gi|301612973|ref|XP_002935969.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Xenopus (Silurana) tropicalis]
          Length = 1663

 Score =  135 bits (341), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 97/238 (40%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMV-RPNGGGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   +        +    S    G + +
Sbjct: 1431 SHELWKVPRNTTAPSRPPPGLTNAK---PSSAWSSNQLGWTSSYSSGSTWSTDSSGRTSS 1487

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1488 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEATKAQKSLHMCVL 1547

Query: 118  GNTTIFAEAPSDAEVQSLLAH-----LSATANNNNNNNGGTGGWARGSSALSNKDT--WS 170
            GNTTI AE   + EV   LA       +++  +N  N+    G A GS  L   D   W+
Sbjct: 1548 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNTGNSQPRLGSAGGSHTLVRSDAAHWN 1607

Query: 171  --SGGGGGNTSQLWGTPSNPSSGGSLWGAP-PLDSVDRATPSSLNSFLPGDLLGGESM 225
                G  GN   LWG    P    SLWG P   D+    +P+ LN+ LPGDLL GESM
Sbjct: 1608 PPCLGSKGNNDLLWG--GVPQYSSSLWGPPGSEDARIIRSPTPLNTLLPGDLLSGESM 1663


>gi|148702674|gb|EDL34621.1| mCG19297, isoform CRA_a [Mus musculus]
          Length = 1580

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 98/241 (40%), Positives = 132/241 (54%), Gaps = 26/241 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1348 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1404

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1405 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1464

Query: 118  GNTTIFAEAPSDAEVQSLLAH---LSATANNNNNN---------NGGTGGWARGSSALSN 165
            GNTTI AE   + EV   LA    L  T++  +N+         +G T G  R  +A  N
Sbjct: 1465 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSNSGGSQPRLGTSGSTHGLVRSDTAHWN 1524

Query: 166  KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
                S   G G++  LWG    P    SLWG P   D+    +P+ LN+ LPGDLL GES
Sbjct: 1525 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 1579

Query: 225  M 225
            +
Sbjct: 1580 I 1580


>gi|432117597|gb|ELK37833.1| Trinucleotide repeat-containing protein 6A protein [Myotis davidii]
          Length = 1886

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 103/255 (40%), Positives = 139/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1638 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1695

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1696 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1755

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ N  G+   +   
Sbjct: 1756 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQNRLGSLDCSHPF 1815

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++   W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1816 SSRTDLSHWNGAGLAGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGMSSPSPIN 1871

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1872 AFLSVDHLGGGGESM 1886


>gi|334323032|ref|XP_001380459.2| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Monodelphis domestica]
          Length = 1887

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 96/238 (40%), Positives = 128/238 (53%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1655 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGTSPLGWTSSYSSGSAWSTDSSGRTSS 1711

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1712 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1771

Query: 118  GNTTIFAEAPSDAEVQSLLAH-----LSATANNNNNNNGGTGGWARGSSALSNKDT--WS 170
            GNTTI AE   + EV   LA       +++  +N   N    G    S  L   D   W+
Sbjct: 1772 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNTGTNQTRMGSTNSSHGLVRNDAGHWN 1831

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
            +   G  G+T  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1832 TPCLGSKGSTDLLWG--GVPQYSSSLWGPPSTDDGRVIGSPTPLNTLLPGDLLSGESI 1887


>gi|211826331|gb|AAH05741.2| Tnrc6a protein [Mus musculus]
          Length = 661

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 95/246 (38%), Positives = 133/246 (54%), Gaps = 33/246 (13%)

Query: 3   SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
           +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 408 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 465

Query: 55  SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
             +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 466 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 525

Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
           AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 526 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 585

Query: 161 SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
           S+ ++ + W+  G  G      + + LWGT   P    SLWG P  D    ++PS +N+F
Sbjct: 586 SSRTDVNHWNGAGLSGANCGDLHGTSLWGT---PHYSTSLWGPPSSDPRGISSPSPINAF 642

Query: 215 LPGDLL 220
           L  D L
Sbjct: 643 LSVDHL 648


>gi|431908726|gb|ELK12318.1| Trinucleotide repeat-containing protein 6C protein [Pteropus
           alecto]
          Length = 670

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3   SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
           S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 438 SHELWKVPRNTTAPTRPPPGLTNPK---PSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 494

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS+++EA KAQ +L+ C+L
Sbjct: 495 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKDEAAKAQKSLHMCVL 554

Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWAR-----GSSALSNKDT--WS 170
           GNTTI AE   + EV   LA   A    ++  +    G  R      S  L+  DT  W+
Sbjct: 555 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGAGQTRLGASGSSHGLARSDTGHWN 614

Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
           +    G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 615 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 670


>gi|16551820|dbj|BAB71179.1| unnamed protein product [Homo sapiens]
          Length = 582

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3   SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
           S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 350 SHELWKVPRNSTAPTRPPPGLTNPK---PSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 406

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 407 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 466

Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
           GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 467 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 526

Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
           +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 527 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 582


>gi|363740796|ref|XP_415612.3| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 3
            [Gallus gallus]
          Length = 1897

 Score =  135 bits (339), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 97/240 (40%), Positives = 130/240 (54%), Gaps = 25/240 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1666 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGASPLGWTSSYSSGSAWSTDSSGRTSS 1722

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1723 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1782

Query: 118  GNTTIFAEAPSDAEVQSLLAH---LSATANNNNNN--------NGGTGGWARGSSALSNK 166
            GNTTI AE   + EV   LA    L  T++  +N         + G+ G  RG +   N 
Sbjct: 1783 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNAGSTQPRLGSAGSHGLVRGDTGHWNS 1842

Query: 167  DTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
                  GG G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1843 PCL---GGKGSSELLWG--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGESI 1897


>gi|358418930|ref|XP_614640.5| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 1
            [Bos taurus]
          Length = 1958

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1710 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSSWDNSPLRVGGGWGNSDARYTPGSSW 1767

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1768 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1827

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1828 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPSWQSLGSSQSRLGSLDCSHSF 1887

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1888 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1943

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1944 AFLSVDHLGGGGESM 1958


>gi|440898230|gb|ELR49767.1| Trinucleotide repeat-containing 6A protein, partial [Bos grunniens
            mutus]
          Length = 1928

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1680 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSSWDNSPLRVGGGWGNSDARYTPGSSW 1737

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1738 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1797

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1798 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPSWQSLGSSQSRLGSLDCSHSF 1857

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1858 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1913

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1914 AFLSVDHLGGGGESM 1928


>gi|359079715|ref|XP_002698073.2| PREDICTED: trinucleotide repeat-containing gene 6A protein [Bos
            taurus]
          Length = 1921

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1673 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSSWDNSPLRVGGGWGNSDARYTPGSSW 1730

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1731 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1790

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1791 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPSWQSLGSSQSRLGSLDCSHSF 1850

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1851 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1906

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1907 AFLSVDHLGGGGESM 1921


>gi|363740798|ref|XP_003642381.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
            [Gallus gallus]
          Length = 1719

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 97/240 (40%), Positives = 130/240 (54%), Gaps = 25/240 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1488 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGASPLGWTSSYSSGSAWSTDSSGRTSS 1544

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1545 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1604

Query: 118  GNTTIFAEAPSDAEVQSLLAH---LSATANNNNNN--------NGGTGGWARGSSALSNK 166
            GNTTI AE   + EV   LA    L  T++  +N         + G+ G  RG +   N 
Sbjct: 1605 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNAGSTQPRLGSAGSHGLVRGDTGHWNS 1664

Query: 167  DTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
                  GG G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1665 PCL---GGKGSSELLWG--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGESI 1719


>gi|363740794|ref|XP_003642380.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
            [Gallus gallus]
          Length = 1683

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 97/240 (40%), Positives = 130/240 (54%), Gaps = 25/240 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1452 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGASPLGWTSSYSSGSAWSTDSSGRTSS 1508

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1509 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1568

Query: 118  GNTTIFAEAPSDAEVQSLLAH---LSATANNNNNN--------NGGTGGWARGSSALSNK 166
            GNTTI AE   + EV   LA    L  T++  +N         + G+ G  RG +   N 
Sbjct: 1569 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNAGSTQPRLGSAGSHGLVRGDTGHWNS 1628

Query: 167  DTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
                  GG G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1629 PCL---GGKGSSELLWG--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGESI 1683


>gi|297283685|ref|XP_001098013.2| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 1
            [Macaca mulatta]
          Length = 1971

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1723 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1780

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1781 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1840

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1841 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1900

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1901 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1956

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1957 AFLSVDHLGGGGESM 1971


>gi|124378035|ref|NP_932139.2| trinucleotide repeat-containing gene 6C protein [Mus musculus]
          Length = 1900

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 96/241 (39%), Positives = 128/241 (53%), Gaps = 26/241 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1668 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1724

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1725 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1784

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
            GNTTI AE   + EV   LA   A    ++              +G T G  R  +A  N
Sbjct: 1785 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 1844

Query: 166  KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
                S   G G++  LWG    P    SLWG P   D+    +P+ LN+ LPGDLL GES
Sbjct: 1845 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 1899

Query: 225  M 225
            +
Sbjct: 1900 I 1900


>gi|74184652|dbj|BAE27937.1| unnamed protein product [Mus musculus]
          Length = 1900

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 96/241 (39%), Positives = 128/241 (53%), Gaps = 26/241 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1668 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1724

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1725 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1784

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
            GNTTI AE   + EV   LA   A    ++              +G T G  R  +A  N
Sbjct: 1785 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 1844

Query: 166  KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
                S   G G++  LWG    P    SLWG P   D+    +P+ LN+ LPGDLL GES
Sbjct: 1845 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 1899

Query: 225  M 225
            +
Sbjct: 1900 I 1900


>gi|355710058|gb|EHH31522.1| hypothetical protein EGK_12611 [Macaca mulatta]
          Length = 1940

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1692 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1749

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1750 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1809

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1810 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1869

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1870 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1925

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1926 AFLSVDHLGGGGESM 1940


>gi|345802132|ref|XP_547086.3| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 1
            [Canis lupus familiaris]
          Length = 1931

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1683 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1740

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1741 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1800

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1801 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1860

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1861 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1916

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1917 AFLSVDHLGGGGESM 1931


>gi|28972790|dbj|BAC65811.1| mKIAA1582 protein [Mus musculus]
          Length = 1362

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 96/241 (39%), Positives = 128/241 (53%), Gaps = 26/241 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1130 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1186

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1187 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1246

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
            GNTTI AE   + EV   LA   A    ++              +G T G  R  +A  N
Sbjct: 1247 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 1306

Query: 166  KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
                S   G G++  LWG    P    SLWG P   D+    +P+ LN+ LPGDLL GES
Sbjct: 1307 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 1361

Query: 225  M 225
            +
Sbjct: 1362 I 1362


>gi|355756645|gb|EHH60253.1| hypothetical protein EGM_11578 [Macaca fascicularis]
          Length = 1942

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1694 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1751

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1752 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1811

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1812 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1871

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1872 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1927

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1928 AFLSVDHLGGGGESM 1942


>gi|426254457|ref|XP_004020895.1| PREDICTED: trinucleotide repeat-containing gene 6A protein [Ovis
            aries]
          Length = 1706

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1458 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSSWDNSPLRVGGGWGNSDARYTPGSSW 1515

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1516 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1575

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1576 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPSWQSLGSSQSRLGSLDCSHSF 1635

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1636 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1691

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1692 AFLSVDHLGGGGESM 1706


>gi|395846174|ref|XP_003795787.1| PREDICTED: trinucleotide repeat-containing gene 6A protein [Otolemur
            garnettii]
          Length = 1926

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1678 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1735

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1736 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1795

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1796 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1855

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1856 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1911

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1912 AFLSVDHLGGGGESM 1926


>gi|344294499|ref|XP_003418954.1| PREDICTED: trinucleotide repeat-containing gene 6A protein [Loxodonta
            africana]
          Length = 1931

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1683 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPIRVGGGWGNSDARYTPGSSW 1740

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1741 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1800

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1801 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1860

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1861 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1916

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1917 AFLSVDHLGGGGESM 1931


>gi|297462713|ref|XP_580298.5| PREDICTED: trinucleotide repeat-containing gene 6C protein [Bos
            taurus]
 gi|297487379|ref|XP_002696206.1| PREDICTED: trinucleotide repeat-containing gene 6C protein [Bos
            taurus]
 gi|296476006|tpg|DAA18121.1| TPA: hypothetical protein BOS_19462 [Bos taurus]
          Length = 1724

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 97/239 (40%), Positives = 129/239 (53%), Gaps = 22/239 (9%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1492 SHELWKVPRNTTAPTRPPPGLTN---PKPSSAWGASPLGWTSSYSSGSAWSTDASGRTSS 1548

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1549 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGHAVVRYSSKEEAAKAQKSLHMCVL 1608

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSAT--------ANNNNNNNGGTGGWARGSSALSNKDTW 169
            GNTTI AE   + EV   LA   A         +   +    GT G A G    S+   W
Sbjct: 1609 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQPSPGTSQTRLGTSGSAHG-LVRSDAGHW 1667

Query: 170  SSGG--GGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
            ++ G  G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1668 NAPGLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1724


>gi|149067987|gb|EDM17539.1| trinucleotide repeat containing 6 (predicted), isoform CRA_a [Rattus
            norvegicus]
          Length = 1937

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1689 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1746

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1747 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1806

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1807 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1866

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1867 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1922

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1923 AFLSVDHLGGGGESM 1937


>gi|296219794|ref|XP_002756022.1| PREDICTED: trinucleotide repeat-containing gene 6A protein
            [Callithrix jacchus]
          Length = 1963

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1715 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1772

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1773 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1832

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1833 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1892

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1893 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1948

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1949 AFLSVDHLGGGGESM 1963


>gi|402907996|ref|XP_003916744.1| PREDICTED: trinucleotide repeat-containing gene 6A protein [Papio
            anubis]
          Length = 1926

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1678 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1735

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1736 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1795

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1796 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1855

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1856 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1911

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1912 AFLSVDHLGGGGESM 1926


>gi|403277188|ref|XP_003930258.1| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 1
            [Saimiri boliviensis boliviensis]
          Length = 1923

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1675 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1732

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1733 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1792

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1793 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1852

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1853 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1908

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1909 AFLSVDHLGGGGESM 1923


>gi|345804568|ref|XP_540459.3| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
            [Canis lupus familiaris]
          Length = 1723

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 99/238 (41%), Positives = 132/238 (55%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1491 SHELWKVPRNTTAPTRPPPGL---SNPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1547

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1548 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1607

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTG----GWARGSSALSNKD--TWS 170
            GNTTI AE   + EV   LA   A    ++  ++ GTG    G + GS  L   D   WS
Sbjct: 1608 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTGQTRLGASGGSHGLVRSDPGHWS 1667

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
            +    G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1668 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1723


>gi|397485193|ref|XP_003813742.1| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 1
            [Pan paniscus]
          Length = 1940

 Score =  134 bits (337), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1692 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1749

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1750 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1809

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1810 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPSWQSLGSSQSRLGSLDCSHSF 1869

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1870 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1925

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1926 AFLSVDHLGGGGESM 1940


>gi|116805348|ref|NP_055309.2| trinucleotide repeat-containing gene 6A protein [Homo sapiens]
 gi|296452846|sp|Q8NDV7.2|TNR6A_HUMAN RecName: Full=Trinucleotide repeat-containing gene 6A protein;
            AltName: Full=CAG repeat protein 26; AltName: Full=EMSY
            interactor protein; AltName: Full=GW182 autoantigen;
            Short=Protein GW1; AltName: Full=Glycine-tryptophan
            protein of 182 kDa
 gi|225000816|gb|AAI72409.1| Trinucleotide repeat containing 6A [synthetic construct]
 gi|306921199|dbj|BAJ17679.1| trinucleotide repeat containing 6A [synthetic construct]
          Length = 1962

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1714 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1771

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1772 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1831

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1832 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1891

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1892 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1947

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1948 AFLSVDHLGGGGESM 1962


>gi|395747616|ref|XP_002826294.2| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
            6A protein [Pongo abelii]
          Length = 1932

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1684 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1741

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1742 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1801

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1802 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1861

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1862 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1917

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1918 AFLSVDHLGGGGESM 1932


>gi|390463863|ref|XP_002748839.2| PREDICTED: trinucleotide repeat-containing gene 6C protein
            [Callithrix jacchus]
          Length = 1976

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 130/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1744 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1800

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1801 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1860

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A  + ++  +       R S+A        S+   W+
Sbjct: 1861 GNTTILAEFAGEEEVNRFLAQGQALPSTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1920

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1921 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1976


>gi|281344769|gb|EFB20353.1| hypothetical protein PANDA_019391 [Ailuropoda melanoleuca]
          Length = 1905

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1657 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDSRYTPGSSW 1714

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1715 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1774

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1775 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1834

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1835 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1890

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1891 AFLSVDHLGGGGESM 1905


>gi|119576188|gb|EAW55784.1| trinucleotide repeat containing 6A, isoform CRA_c [Homo sapiens]
          Length = 1935

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1687 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1744

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1745 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1804

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1805 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1864

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1865 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1920

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1921 AFLSVDHLGGGGESM 1935


>gi|345804566|ref|XP_003435198.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
            [Canis lupus familiaris]
          Length = 1687

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 99/238 (41%), Positives = 132/238 (55%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1455 SHELWKVPRNTTAPTRPPPGL---SNPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1511

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1512 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1571

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTG----GWARGSSALSNKD--TWS 170
            GNTTI AE   + EV   LA   A    ++  ++ GTG    G + GS  L   D   WS
Sbjct: 1572 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTGQTRLGASGGSHGLVRSDPGHWS 1631

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
            +    G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1632 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1687


>gi|126253814|sp|Q3UHC0.2|TNR6C_MOUSE RecName: Full=Trinucleotide repeat-containing gene 6C protein
          Length = 1690

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 96/241 (39%), Positives = 128/241 (53%), Gaps = 26/241 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1458 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1514

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1515 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1574

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
            GNTTI AE   + EV   LA   A    ++              +G T G  R  +A  N
Sbjct: 1575 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 1634

Query: 166  KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
                S   G G++  LWG    P    SLWG P   D+    +P+ LN+ LPGDLL GES
Sbjct: 1635 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 1689

Query: 225  M 225
            +
Sbjct: 1690 I 1690


>gi|301787707|ref|XP_002929270.1| PREDICTED: trinucleotide repeat-containing gene 6A protein-like
            [Ailuropoda melanoleuca]
          Length = 1939

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1691 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDSRYTPGSSW 1748

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1749 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1808

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1809 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1868

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1869 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1924

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1925 AFLSVDHLGGGGESM 1939


>gi|21740153|emb|CAD39090.1| hypothetical protein [Homo sapiens]
          Length = 1064

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 832  SHELWKVPRNSTAPTRPPPGLTNPK---PSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 888

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 889  WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 948

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 949  GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1008

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1009 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1064


>gi|52545832|emb|CAH56236.1| hypothetical protein [Homo sapiens]
          Length = 1053

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 139/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 805  AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 862

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 863  GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 922

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 923  AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 982

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGT   P    SLWG PP  S  R  ++PS +N
Sbjct: 983  SSRTDLNHWNGAGLSGTNCGDLHGTSLWGT---PHYSTSLWG-PPSSSDPRGISSPSPIN 1038

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1039 AFLSVDHLGGGGESM 1053


>gi|21693029|emb|CAD37348.1| EDIE protein [Homo sapiens]
          Length = 1962

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1714 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1771

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1772 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1831

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1832 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1891

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1892 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1947

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1948 AFLSVDHLGGGGESM 1962


>gi|195469377|ref|XP_002099614.1| GE14506 [Drosophila yakuba]
 gi|194185715|gb|EDW99326.1| GE14506 [Drosophila yakuba]
          Length = 1386

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 71/142 (50%), Positives = 88/142 (61%), Gaps = 15/142 (10%)

Query: 13   RGPPPGMMGGGGKPPS--NGWMVRPNGGGGGGNTWGTSQPQG-------------GWSGT 57
            RGPPPG+     K  +  N     P    GG N W   +  G              W  +
Sbjct: 1056 RGPPPGLTANSNKSGNGGNSCTSTPTTVAGGANGWLQGRSGGVQTTNTTWTGGNSSWGSS 1115

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W+LLKNLT QIDG TL+TLC+QHGPL +FH YL+  +AL KY+TREEA KAQ  LNNC+L
Sbjct: 1116 WLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLSQGIALCKYTTREEANKAQMALNNCVL 1175

Query: 118  GNTTIFAEAPSDAEVQSLLAHL 139
             NTTIFAE+PS+ EVQ+++ HL
Sbjct: 1176 ANTTIFAESPSENEVQNIMQHL 1197


>gi|426381585|ref|XP_004057417.1| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 1
            [Gorilla gorilla gorilla]
          Length = 1935

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1687 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1744

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1745 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1804

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1805 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1864

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1865 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1920

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1921 AFLSVDHLGGGGESM 1935


>gi|148685347|gb|EDL17294.1| mCG20982, isoform CRA_e [Mus musculus]
          Length = 1953

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 95/247 (38%), Positives = 134/247 (54%), Gaps = 33/247 (13%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1690 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1747

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1748 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1807

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1808 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1867

Query: 161  SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
            S+ ++ + W+  G  G      + + LWGTP   +   SLWG P  D    ++PS +N+F
Sbjct: 1868 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1924

Query: 215  LPGDLLG 221
            L  D L 
Sbjct: 1925 LSVDHLA 1931


>gi|440892464|gb|ELR45644.1| Trinucleotide repeat-containing 6C protein, partial [Bos grunniens
            mutus]
          Length = 1738

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 97/239 (40%), Positives = 129/239 (53%), Gaps = 22/239 (9%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1506 SHELWKVPRNTTAPTRPPPGLTN---PKPSSAWGASPLGWTSSYSSGSAWSTDASGRTSS 1562

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1563 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1622

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSAT--------ANNNNNNNGGTGGWARGSSALSNKDTW 169
            GNTTI AE   + EV   LA   A         +   +    GT G A G    S+   W
Sbjct: 1623 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQPSPGTSQTRLGTSGSAHG-LVRSDAGHW 1681

Query: 170  SSGG--GGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
            ++ G  G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1682 NAPGLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1738


>gi|148685344|gb|EDL17291.1| mCG20982, isoform CRA_b [Mus musculus]
          Length = 1893

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 96/247 (38%), Positives = 134/247 (54%), Gaps = 33/247 (13%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1630 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1687

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1688 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1747

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1748 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1807

Query: 161  SALSNKDTWSSGG-GGGNT-----SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
            S+ ++ + W+  G  G N      + LWGTP   +   SLWG P  D    ++PS +N+F
Sbjct: 1808 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1864

Query: 215  LPGDLLG 221
            L  D L 
Sbjct: 1865 LSVDHLA 1871


>gi|148685343|gb|EDL17290.1| mCG20982, isoform CRA_a [Mus musculus]
          Length = 1892

 Score =  134 bits (336), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 96/247 (38%), Positives = 134/247 (54%), Gaps = 33/247 (13%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1629 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1686

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1687 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1746

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1747 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1806

Query: 161  SALSNKDTWSSGG-GGGNT-----SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
            S+ ++ + W+  G  G N      + LWGTP   +   SLWG P  D    ++PS +N+F
Sbjct: 1807 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1863

Query: 215  LPGDLLG 221
            L  D L 
Sbjct: 1864 LSVDHLA 1870


>gi|403277190|ref|XP_003930259.1| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 2
            [Saimiri boliviensis boliviensis]
          Length = 1706

 Score =  134 bits (336), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1458 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1515

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1516 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1575

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1576 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1635

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1636 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1691

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1692 AFLSVDHLGGGGESM 1706


>gi|7959181|dbj|BAA95984.1| KIAA1460 protein [Homo sapiens]
          Length = 1400

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1152 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1209

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1210 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1269

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1270 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1329

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1330 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1385

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1386 AFLSVDHLGGGGESM 1400


>gi|148685345|gb|EDL17292.1| mCG20982, isoform CRA_c [Mus musculus]
          Length = 1884

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 95/247 (38%), Positives = 134/247 (54%), Gaps = 33/247 (13%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1621 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1678

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1679 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1738

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1739 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1798

Query: 161  SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
            S+ ++ + W+  G  G      + + LWGTP   +   SLWG P  D    ++PS +N+F
Sbjct: 1799 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1855

Query: 215  LPGDLLG 221
            L  D L 
Sbjct: 1856 LSVDHLA 1862


>gi|119576187|gb|EAW55783.1| trinucleotide repeat containing 6A, isoform CRA_b [Homo sapiens]
          Length = 1601

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1353 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1410

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1411 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1470

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1471 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1530

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1531 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1586

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1587 AFLSVDHLGGGGESM 1601


>gi|28374385|gb|AAH45631.1| TNRC6C protein, partial [Homo sapiens]
          Length = 999

 Score =  133 bits (335), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 128/238 (53%), Gaps = 20/238 (8%)

Query: 3   SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
           S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 767 SHELWKVPRNSTAPTRPPPGLTNPK---PSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 823

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+  YS++EEA KAQ +L+ C+L
Sbjct: 824 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVPYSSKEEAAKAQKSLHMCVL 883

Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
           GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 884 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 943

Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
           +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 944 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 999


>gi|21307718|gb|AAK62026.1| GW182 autoantigen [Homo sapiens]
 gi|119576190|gb|EAW55786.1| trinucleotide repeat containing 6A, isoform CRA_e [Homo sapiens]
          Length = 1709

 Score =  133 bits (335), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1461 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1518

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1519 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1578

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1579 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1638

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1639 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1694

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1695 AFLSVDHLGGGGESM 1709


>gi|426381587|ref|XP_004057418.1| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 2
            [Gorilla gorilla gorilla]
          Length = 1709

 Score =  133 bits (335), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1461 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1518

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1519 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1578

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1579 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1638

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1639 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1694

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1695 AFLSVDHLGGGGESM 1709


>gi|397485195|ref|XP_003813743.1| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 2
            [Pan paniscus]
          Length = 1709

 Score =  133 bits (335), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1461 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1518

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1519 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1578

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1579 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPSWQSLGSSQSRLGSLDCSHSF 1638

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1639 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1694

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1695 AFLSVDHLGGGGESM 1709


>gi|74137224|dbj|BAE21997.1| unnamed protein product [Mus musculus]
          Length = 337

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 95/241 (39%), Positives = 127/241 (52%), Gaps = 26/241 (10%)

Query: 3   SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
           S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 105 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 161

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L+N TPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 162 WLVLRNPTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 221

Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
           GNTTI AE   + EV   LA   A    ++              +G T G  R  +A  N
Sbjct: 222 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 281

Query: 166 KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
               S   G G++  LWG    P    SLWG P   D+    +P+ LN+ LPGDLL GES
Sbjct: 282 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 336

Query: 225 M 225
           +
Sbjct: 337 I 337


>gi|410984990|ref|XP_003998808.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
            6A protein [Felis catus]
          Length = 1927

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 101/255 (39%), Positives = 140/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1679 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1736

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1737 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1796

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1797 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1856

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G +      + LWG P   +   SLWG PP  S  R  ++PS +N
Sbjct: 1857 SSRTDLNHWNGAGLSGTSCGDLHGTSLWGAPHYST---SLWG-PPSSSDPRGMSSPSPIN 1912

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1913 AFLSVDHLGGGGESM 1927


>gi|338712781|ref|XP_001501299.3| PREDICTED: trinucleotide repeat-containing gene 6A protein [Equus
            caballus]
          Length = 1924

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 138/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1676 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDTRYTPGSSW 1733

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1734 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1793

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G    +   
Sbjct: 1794 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGALDCSHPF 1853

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGT   P    SLWG PP  S  R  ++PS +N
Sbjct: 1854 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGT---PHYSASLWG-PPSSSDPRGISSPSPIN 1909

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1910 AFLSVDHLGGGGESM 1924


>gi|403280805|ref|XP_003931900.1| PREDICTED: trinucleotide repeat-containing gene 6C protein [Saimiri
            boliviensis boliviensis]
          Length = 2119

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1887 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1943

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1944 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 2003

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 2004 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 2063

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 2064 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 2119


>gi|332849222|ref|XP_001144739.2| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
            6C protein [Pan troglodytes]
          Length = 1942

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1710 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1766

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1767 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1826

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 1827 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1886

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1887 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1942


>gi|441643594|ref|XP_004090530.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
            6C protein [Nomascus leucogenys]
          Length = 1725

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 127/238 (53%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1493 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1549

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1550 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1609

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKDTWSSG----- 172
            GNTTI AE   + EV   LA   A    ++  +       R S+A S+     S      
Sbjct: 1610 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAASSSHGLVRSDAGHWN 1669

Query: 173  ----GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
                GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1670 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1725


>gi|327264967|ref|XP_003217280.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Anolis carolinensis]
          Length = 1884

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1652 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGASPLGWTSSYSSGSAWSTDSSGRTSS 1708

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1709 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1768

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTGGWARGSSALSNKDTWSSG---- 172
            GNTTI AE   + EV   LA        ++  +N GT     GS++ S+    +      
Sbjct: 1769 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNTGTTQTRLGSTSSSHGMVRNEAGHWN 1828

Query: 173  ----GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
                GG G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1829 APCLGGKGSSDLLWG--GVPQYSSSLWGPPSTDDGRVIGSPTPLNTLLPGDLLSGESI 1884


>gi|395749509|ref|XP_002827930.2| PREDICTED: trinucleotide repeat-containing gene 6C protein [Pongo
            abelii]
          Length = 1935

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1703 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1759

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1760 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1819

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 1820 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1879

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1880 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1935


>gi|386781810|ref|NP_001247675.1| trinucleotide repeat-containing gene 6C protein [Macaca mulatta]
 gi|402901227|ref|XP_003913556.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
            [Papio anubis]
 gi|355754415|gb|EHH58380.1| hypothetical protein EGM_08214 [Macaca fascicularis]
 gi|380815096|gb|AFE79422.1| trinucleotide repeat-containing gene 6C protein isoform 1 [Macaca
            mulatta]
 gi|383420321|gb|AFH33374.1| trinucleotide repeat-containing gene 6C protein isoform 1 [Macaca
            mulatta]
          Length = 1725

 Score =  132 bits (333), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1493 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDASGRTSS 1549

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1550 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1609

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 1610 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1669

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1670 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1725


>gi|119609889|gb|EAW89483.1| trinucleotide repeat containing 6C, isoform CRA_d [Homo sapiens]
          Length = 1687

 Score =  132 bits (333), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1455 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1511

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1512 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1571

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 1572 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1631

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1632 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1687


>gi|397494949|ref|XP_003818329.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
            [Pan paniscus]
          Length = 1689

 Score =  132 bits (333), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1457 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1513

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1514 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1573

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 1574 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1633

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1634 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1689


>gi|217416332|ref|NP_001136112.1| trinucleotide repeat-containing gene 6C protein isoform 1 [Homo
            sapiens]
 gi|119609886|gb|EAW89480.1| trinucleotide repeat containing 6C, isoform CRA_a [Homo sapiens]
          Length = 1726

 Score =  132 bits (333), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1494 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1550

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1551 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1610

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 1611 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1670

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1671 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1726


>gi|397494947|ref|XP_003818328.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
            [Pan paniscus]
          Length = 1725

 Score =  132 bits (333), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1493 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1549

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1550 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1609

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 1610 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1669

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1670 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1725


>gi|355568964|gb|EHH25245.1| hypothetical protein EGK_09030 [Macaca mulatta]
          Length = 1725

 Score =  132 bits (333), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1493 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDASGRTSS 1549

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1550 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1609

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 1610 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1669

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1670 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1725


>gi|402901229|ref|XP_003913557.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
            [Papio anubis]
          Length = 1689

 Score =  132 bits (333), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1457 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDASGRTSS 1513

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1514 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1573

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 1574 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1633

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1634 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1689


>gi|33413425|ref|NP_061869.2| trinucleotide repeat-containing gene 6C protein isoform 2 [Homo
            sapiens]
 gi|126253813|sp|Q9HCJ0.3|TNR6C_HUMAN RecName: Full=Trinucleotide repeat-containing gene 6C protein
 gi|119609891|gb|EAW89485.1| trinucleotide repeat containing 6C, isoform CRA_f [Homo sapiens]
 gi|162317668|gb|AAI56367.1| Trinucleotide repeat containing 6C [synthetic construct]
 gi|162318186|gb|AAI57116.1| Trinucleotide repeat containing 6C [synthetic construct]
 gi|168275508|dbj|BAG10474.1| trinucleotide repeat-containing 6C protein [synthetic construct]
          Length = 1690

 Score =  132 bits (333), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1458 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1514

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1515 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1574

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 1575 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1634

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1635 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1690


>gi|20521948|dbj|BAB13408.2| KIAA1582 protein [Homo sapiens]
          Length = 1740

 Score =  132 bits (333), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1508 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1564

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1565 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1624

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R S+A        S+   W+
Sbjct: 1625 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1684

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
            +   GG G++  LWG    P    SLWG P   DS    +P+ L + LPGDLL GES+
Sbjct: 1685 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1740


>gi|338711312|ref|XP_003362511.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
            [Equus caballus]
          Length = 1721

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 96/238 (40%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1489 SHELWKVPRNTAAPTRPPPGLTT---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1545

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1546 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1605

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTGGWARGSSALSNKDTWSSGG--- 173
            GNTTI AE   + EV   LA   A    ++  ++ GT     G+S  S+    S  G   
Sbjct: 1606 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTSQTRLGASGSSHGLVRSDAGHWN 1665

Query: 174  -----GGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
                 G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1666 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1721


>gi|338711310|ref|XP_001491270.3| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
            [Equus caballus]
          Length = 1686

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 96/238 (40%), Positives = 129/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1454 SHELWKVPRNTAAPTRPPPGLTT---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1510

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1511 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1570

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTGGWARGSSALSNKDTWSSGG--- 173
            GNTTI AE   + EV   LA   A    ++  ++ GT     G+S  S+    S  G   
Sbjct: 1571 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTSQTRLGASGSSHGLVRSDAGHWN 1630

Query: 174  -----GGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
                 G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1631 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1686


>gi|344291110|ref|XP_003417279.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
            [Loxodonta africana]
          Length = 1683

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 98/241 (40%), Positives = 129/241 (53%), Gaps = 26/241 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1451 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1507

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1508 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1567

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSA--------TANNNNNNNGGTGGWARGSSALSNKDT- 168
            GNTTI AE   + EV   LA   A        +++  +    G  G A G   L   DT 
Sbjct: 1568 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGTSQTRLGASGSAHG---LVRSDTG 1624

Query: 169  -WSSG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGES 224
             WS+   G  G++  LW     P    SLWG P  D      +P+ LN+ LPGDLL GES
Sbjct: 1625 HWSAPCLGSKGSSDLLWS--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGES 1682

Query: 225  M 225
            M
Sbjct: 1683 M 1683


>gi|417406796|gb|JAA50040.1| Putative thyroid hormone receptor-associated protein complex subunit
            [Desmodus rotundus]
          Length = 1889

 Score =  132 bits (331), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 83/183 (45%), Positives = 110/183 (60%), Gaps = 12/183 (6%)

Query: 53   GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNL 112
            G + +W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L
Sbjct: 1709 GRTSSWLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSL 1768

Query: 113  NNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWAR----GSS---ALSN 165
            + C+LGNTTI AE   + EV   LA   A    ++  +G   G AR    GSS     S+
Sbjct: 1769 HMCVLGNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSGTGTGQARLGAAGSSHGLVRSD 1828

Query: 166  KDTWSSG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGG 222
               W++    G G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL G
Sbjct: 1829 AGHWNAPCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSG 1886

Query: 223  ESM 225
            ES+
Sbjct: 1887 ESL 1889


>gi|344291112|ref|XP_003417280.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
            [Loxodonta africana]
          Length = 1719

 Score =  132 bits (331), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 98/241 (40%), Positives = 129/241 (53%), Gaps = 26/241 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1487 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1543

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1544 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1603

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSA--------TANNNNNNNGGTGGWARGSSALSNKDT- 168
            GNTTI AE   + EV   LA   A        +++  +    G  G A G   L   DT 
Sbjct: 1604 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGTSQTRLGASGSAHG---LVRSDTG 1660

Query: 169  -WSSG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGES 224
             WS+   G  G++  LW     P    SLWG P  D      +P+ LN+ LPGDLL GES
Sbjct: 1661 HWSAPCLGSKGSSDLLWS--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGES 1718

Query: 225  M 225
            M
Sbjct: 1719 M 1719


>gi|312384471|gb|EFR29195.1| hypothetical protein AND_02092 [Anopheles darlingi]
          Length = 378

 Score =  132 bits (331), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 104/259 (40%), Positives = 126/259 (48%), Gaps = 54/259 (20%)

Query: 4   NDLWGPP------KPRGPPPGMMGGGGKP---PSNGWMVRPNGGGGGGNTWGTSQPQGGW 54
            D+W  P        RGPPPG+ G  G      + G   R +     G+  G      G 
Sbjct: 133 TDVWSAPIGKLSATTRGPPPGLGGANGNKHIGSTGGVASRISANATWGSASGGGAGTAGS 192

Query: 55  SGT----WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQG 110
            GT    W+LL+NLT QID STL+TLC+QHGP+ +FH Y  H LAL +YS+ +EA+KAQ 
Sbjct: 193 WGTTGTSWLLLRNLTSQIDASTLRTLCMQHGPILSFHPYPAHGLALCRYSSSDEAMKAQQ 252

Query: 111 NLNNCILGNTTIFAEAP-SDAEVQSLLAHL-SATANNNNNNNGGTGGWARG--------- 159
            LNNC LG +TI AE P S+AEVQ+ L  L   TA     ++ GT               
Sbjct: 253 ALNNCPLGASTISAECPSSEAEVQTYLQQLGGGTAITATVSSTGTASGTGSISSISSQSW 312

Query: 160 -----SSALSNKDTWSS----------GGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVD 204
                ++A    DTW S          G G  NTS LW                PLD  D
Sbjct: 313 RLRTPTAATGGTDTWGSGWPIGRDTGDGSGSTNTSNLWA---------------PLDGGD 357

Query: 205 RATPSSLNSFLPGDLLGGE 223
           R TPSSLNSFLP  LLG E
Sbjct: 358 RETPSSLNSFLPESLLGSE 376


>gi|224074955|ref|XP_002194333.1| PREDICTED: trinucleotide repeat-containing gene 6C protein
            [Taeniopygia guttata]
          Length = 1719

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 98/238 (41%), Positives = 130/238 (54%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1487 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGASPLGWTSSYSSGSAWSTDSSGRTSS 1543

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1544 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1603

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNN--NNNGGTG---GWARGSSALSNKDT--WS 170
            GNTTI AE   + EV   LA   A    ++  +N G T    G +  S  L   D   W+
Sbjct: 1604 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSNTGSTPSRLGSSGSSHGLVRPDAGHWN 1663

Query: 171  --SGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
                GG G++  LWG    P    SLWG P  D      +P+ LN+ LPGDLL GES+
Sbjct: 1664 PPCLGGKGSSDLLWG--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGESI 1719


>gi|395826836|ref|XP_003786620.1| PREDICTED: trinucleotide repeat-containing gene 6C protein [Otolemur
            garnettii]
          Length = 1696

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 96/238 (40%), Positives = 127/238 (53%), Gaps = 20/238 (8%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1464 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1520

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1521 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1580

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWAR----GSS---ALSNKDTWS 170
            GNTTI AE   + EV   LA   A    ++  +       R    GSS     S+   WS
Sbjct: 1581 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLGASGSSHGLVRSDAGHWS 1640

Query: 171  SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
            +   GG G +  LWG    P    SLWG P  D      +P+ L + LPGDLL GES+
Sbjct: 1641 APCLGGKGGSELLWG--GGPQYSSSLWGPPSADDGRVIGSPTPLTTLLPGDLLSGESL 1696


>gi|334333496|ref|XP_001369211.2| PREDICTED: trinucleotide repeat-containing gene 6A protein
            [Monodelphis domestica]
          Length = 1884

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 103/254 (40%), Positives = 138/254 (54%), Gaps = 36/254 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   K P + W   P   GGG GN+     P   W
Sbjct: 1636 AHELWKVPLPPKNITAPSRPPPGLTGQ--KAPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1693

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1694 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1753

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTG----GWARGSSAL 163
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +      G+     G    S + 
Sbjct: 1754 AQKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQPLGSSQSRLGSIDSSHSF 1813

Query: 164  SNKDT---WSSGGGGGNTS------QLWGTPSNPSSGGSLWGAP-PLDSVDRATPSSLNS 213
            SN++    W+  G  G +S       LWGTP+  +   SLWG P   D+   ++PS +N+
Sbjct: 1814 SNRNDLNHWNGAGLSGTSSGDLHGTSLWGTPNYST---SLWGTPNSSDTRGISSPSPINA 1870

Query: 214  FLPGDLL--GGESM 225
            FL  D L  GGESM
Sbjct: 1871 FLSVDHLGGGGESM 1884


>gi|157817055|ref|NP_001101019.1| trinucleotide repeat-containing gene 6A protein [Rattus norvegicus]
 gi|149067989|gb|EDM17541.1| trinucleotide repeat containing 6 (predicted), isoform CRA_c [Rattus
            norvegicus]
          Length = 1954

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 97/249 (38%), Positives = 135/249 (54%), Gaps = 36/249 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1689 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1746

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1747 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1806

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1807 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1866

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1867 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1922

Query: 213  SFLPGDLLG 221
            +FL  D L 
Sbjct: 1923 AFLSVDHLA 1931


>gi|395515519|ref|XP_003761950.1| PREDICTED: trinucleotide repeat-containing gene 6A protein
            [Sarcophilus harrisii]
          Length = 1978

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 103/254 (40%), Positives = 138/254 (54%), Gaps = 36/254 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   K P + W   P   GGG GN+     P   W
Sbjct: 1730 AHELWKVPLPPKNITAPSRPPPGLTGQ--KAPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1787

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1788 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1847

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTG----GWARGSSAL 163
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +      G+     G    S + 
Sbjct: 1848 AQKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQPLGSSQSRLGSIDSSHSF 1907

Query: 164  SNKDT---WSSGGGGGNTS------QLWGTPSNPSSGGSLWGAP-PLDSVDRATPSSLNS 213
            SN++    W+  G  G +S       LWGTP+  +   SLWG P   D+   ++PS +N+
Sbjct: 1908 SNRNDLNHWNGAGLSGTSSGDLHGTSLWGTPNYST---SLWGTPNSSDTRGISSPSPINA 1964

Query: 214  FLPGDLL--GGESM 225
            FL  D L  GGESM
Sbjct: 1965 FLSVDHLGGGGESM 1978


>gi|301605745|ref|XP_002932511.1| PREDICTED: trinucleotide repeat-containing gene 6A protein [Xenopus
            (Silurana) tropicalis]
          Length = 1835

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 97/250 (38%), Positives = 136/250 (54%), Gaps = 32/250 (12%)

Query: 3    SNDLWGPP-------KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWS 55
            +++LW  P        P  PPPG+ G   KPP + W       GG G++     P   WS
Sbjct: 1591 AHELWKVPLPSKNISAPSRPPPGLTGQ--KPPLSTWDTNSLRLGGWGSSDSRYTPGSTWS 1648

Query: 56   GT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKA 108
                     W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +KA
Sbjct: 1649 ENSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGTALVRYSSKEEVVKA 1708

Query: 109  QGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGS----SALS 164
            Q +L+ C+LGNTTI AE  S+ E+    A   +   +    + G+G    GS     ++S
Sbjct: 1709 QKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQSLGSGHSRLGSLDSPHSIS 1768

Query: 165  NK---DTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSFL 215
            N+   + W+S G  G++      + LWGTP+  +   SLWG P  +    ++PS + +FL
Sbjct: 1769 NRGDINHWNSPGASGSSSGDLHGTSLWGTPNYST---SLWGNPSNEGRGLSSPSPVPAFL 1825

Query: 216  PGDLLGGESM 225
              D L GE M
Sbjct: 1826 SVDQLNGEPM 1835


>gi|158297475|ref|XP_001237966.2| AGAP007803-PA [Anopheles gambiae str. PEST]
 gi|157015213|gb|EAU76399.2| AGAP007803-PA [Anopheles gambiae str. PEST]
          Length = 1188

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 92/200 (46%), Positives = 115/200 (57%), Gaps = 17/200 (8%)

Query: 36   NGGGGGGNTWGTSQPQG-GWSG--TWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNH 92
            N  G GG   GT Q +G GWS   +W+LLKN T QID STL+TLC+QHGP+  FH Y  H
Sbjct: 992  NKSGNGGTAAGTQQQRGAGWSTGTSWLLLKNFTSQIDASTLRTLCMQHGPILTFHSYPAH 1051

Query: 93   SLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAP-SDAEVQSLLAHLSATANNNNNNNG 151
             LAL +Y+TREEA KAQ  LNNC LG++TI AE P S++EVQ+ L  L   A   +    
Sbjct: 1052 GLALCRYATREEAAKAQQALNNCTLGSSTISAECPASESEVQTYLQQLGGAAAAASVAVS 1111

Query: 152  GTGG------WARG-SSALSNKDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-V 203
             +        W +  +S+ S  DTW   G G       G     ++  +LW   PLD+  
Sbjct: 1112 SSASSLTSPTWRQERTSSSSGADTW---GSGWAIGGSSGASGAGAAAANLWA--PLDAGT 1166

Query: 204  DRATPSSLNSFLPGDLLGGE 223
            D  TP+SLNSFLP  LLG E
Sbjct: 1167 DSGTPTSLNSFLPDSLLGPE 1186


>gi|149067992|gb|EDM17544.1| trinucleotide repeat containing 6 (predicted), isoform CRA_f [Rattus
            norvegicus]
          Length = 1904

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 97/249 (38%), Positives = 135/249 (54%), Gaps = 36/249 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1639 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1696

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1697 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1756

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1757 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1816

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1817 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1872

Query: 213  SFLPGDLLG 221
            +FL  D L 
Sbjct: 1873 AFLSVDHLA 1881


>gi|149067990|gb|EDM17542.1| trinucleotide repeat containing 6 (predicted), isoform CRA_d [Rattus
            norvegicus]
          Length = 1915

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 97/249 (38%), Positives = 135/249 (54%), Gaps = 36/249 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1650 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1707

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1708 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1767

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1768 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1827

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1828 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1883

Query: 213  SFLPGDLLG 221
            +FL  D L 
Sbjct: 1884 AFLSVDHLA 1892


>gi|149067991|gb|EDM17543.1| trinucleotide repeat containing 6 (predicted), isoform CRA_e [Rattus
            norvegicus]
          Length = 1894

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 97/249 (38%), Positives = 135/249 (54%), Gaps = 36/249 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1629 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1686

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1687 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1746

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1747 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1806

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1807 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1862

Query: 213  SFLPGDLLG 221
            +FL  D L 
Sbjct: 1863 AFLSVDHLA 1871


>gi|348584988|ref|XP_003478254.1| PREDICTED: trinucleotide repeat-containing gene 6A protein-like
            [Cavia porcellus]
          Length = 1924

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 101/255 (39%), Positives = 139/255 (54%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1676 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1733

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +        ++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1734 GESSSGRITNCLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1793

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1794 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1853

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1854 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1909

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1910 AFLSVDHLGGGGESM 1924


>gi|326668565|ref|XP_002662398.2| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
            6C protein [Danio rerio]
          Length = 1740

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 95/236 (40%), Positives = 125/236 (52%), Gaps = 28/236 (11%)

Query: 12   PRGPPPGMM--------GGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
            P  PPPG+         GG     S GW    N     G TW +  P  G   +W++L+N
Sbjct: 1511 PSRPPPGLTNTKPSSTWGGNSLGLSQGW----NNSYSSGGTWSSDSPNRG--SSWLVLRN 1564

Query: 64   LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
            LTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+LGNTTI 
Sbjct: 1565 LTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVLGNTTIL 1624

Query: 124  AEAPSDAEVQSLLAH---LSATANNNNNNNGGTGGWARGSSALSNKDTWSSGGGGGNTSQ 180
            AE  S+ +V    A    L+ T +   +   G+     G+   S+     SGGGG  +  
Sbjct: 1625 AEFASEEDVNRFFAQGQSLTPTTSWQASPAPGSSQPRLGNPTASHPTGLWSGGGGTKSVC 1684

Query: 181  LWGTPSNPSSGGSLWG---------APP--LDSVDRATPSSLNSFLPGDLLGGESM 225
              G  S+ + G  LWG         APP   D+    +P  +N+ LPGDLL GESM
Sbjct: 1685 SAGNSSSGNGGDMLWGGVPQYSSLWAPPNGDDARVIGSPIPINTLLPGDLLSGESM 1740


>gi|351698083|gb|EHB01002.1| Trinucleotide repeat-containing gene 6C protein [Heterocephalus
            glaber]
          Length = 1984

 Score =  130 bits (326), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 95/231 (41%), Positives = 128/231 (55%), Gaps = 25/231 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1771 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1827

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1828 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1887

Query: 118  GNTTIFAEAPSDAEVQSLLAH---LSATANNNNNNNGGTGGWARGSSALSNKDTWSSGGG 174
            GNTTI AE   + EV   LA    L  T++   ++       A  +   S++  W  GGG
Sbjct: 1888 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQPSSGSSQPQAAPMACKGSSELLW--GGG 1945

Query: 175  GGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSFLPGDLLGGESM 225
               +S LWG PS  +  G L G+P        TP  LN+ LPGDLL GES+
Sbjct: 1946 PQYSSSLWGPPS--TDDGRLIGSP--------TP--LNTLLPGDLLSGESI 1984


>gi|348558232|ref|XP_003464922.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Cavia porcellus]
          Length = 1886

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 96/243 (39%), Positives = 128/243 (52%), Gaps = 30/243 (12%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1654 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGPSPLGWTSSYSSGSAWSTDTSGRTSS 1710

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1711 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1770

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNN------------NGGTGGWARGSSALSN 165
            GNTTI AE   + EV   LA   A    ++              +G T G  R     S+
Sbjct: 1771 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSNSGSSQSRLGASGSTHGLVR-----SD 1825

Query: 166  KDTWSSG--GGGGNTSQLWGTPSNPSSGGSLWGAP-PLDSVDRATPSSLNSFLPGDLLGG 222
               W +   G  G++  LWG    P    SLWG P   DS    +P+ LN+ LPGDLL G
Sbjct: 1826 ATHWGAPCLGSKGSSELLWG--GGPQYSSSLWGPPGADDSRLIGSPTPLNTLLPGDLLSG 1883

Query: 223  ESM 225
            ES+
Sbjct: 1884 ESI 1886


>gi|348530814|ref|XP_003452905.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Oreochromis niloticus]
          Length = 1795

 Score =  129 bits (324), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 81/180 (45%), Positives = 105/180 (58%), Gaps = 18/180 (10%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS+++EA KAQ +L+ C+L
Sbjct: 1622 WLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVVRYSSKDEAAKAQKSLHMCVL 1681

Query: 118  GNTTIFAEAPSDAEVQSLLAH---------LSATANNNNNNNGGTGGWARGSSALSNKDT 168
            GNTTI AE   + EV    A            AT   N    GGTG  A  S  + +   
Sbjct: 1682 GNTTILAEFAGEEEVNRFFAQGQSLGGTTSWQATPGTNQTRMGGTGSGA--SHPIGHSPH 1739

Query: 169  W-SSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRA--TPSSLNSFLPGDLLGGESM 225
            W ++  G G++  LWG     S   SLWG PP     R   +P+ +N+ LPGDLL GESM
Sbjct: 1740 WNNNNNGAGSSKLLWGGVQQYS---SLWG-PPSGEEGRVMGSPTPINTLLPGDLLSGESM 1795


>gi|345312835|ref|XP_001517138.2| PREDICTED: trinucleotide repeat-containing gene 6C protein
            [Ornithorhynchus anatinus]
          Length = 1452

 Score =  128 bits (322), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 94/241 (39%), Positives = 123/241 (51%), Gaps = 26/241 (10%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1220 SHELWKVPRNTPAPTRPPPGLTNTK---PSSSWGAGPLGWTSSYSSGSAWSTDSSGRTSS 1276

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1277 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1336

Query: 118  GNTTIFAEAPSDAEVQSLLAH----------LSATANNNNN--NNGGTGGWARGSSALSN 165
            GNTTI AE   + EV   LA            S T  N     + GG  G  RG +    
Sbjct: 1337 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNTGTNQTRLGSTGGAHGLVRGDAG--- 1393

Query: 166  KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
               W++   GG           P    SLWG P   D     +P+ LN+ LPGDLL GES
Sbjct: 1394 --HWNAPCLGGKGGGDLLWGGVPQYSSSLWGPPSAEDGRVVGSPTPLNTLLPGDLLSGES 1451

Query: 225  M 225
            +
Sbjct: 1452 I 1452


>gi|225733942|pdb|2WBR|A Chain A, The Rrm Domain In Gw182 Proteins Contributes To Mirna-
           Mediated Gene Silencing
          Length = 89

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 58/86 (67%), Positives = 70/86 (81%)

Query: 53  GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNL 112
            W  +W+LLKNLT QIDG TL+TLC+QHGPL +FH YLN  +AL KY+TREEA KAQ  L
Sbjct: 4   AWGSSWLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIALCKYTTREEANKAQMAL 63

Query: 113 NNCILGNTTIFAEAPSDAEVQSLLAH 138
           NNC+L NTTIFAE+PS+ EVQS++ H
Sbjct: 64  NNCVLANTTIFAESPSENEVQSIMQH 89


>gi|432845644|ref|XP_004065839.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Oryzias latipes]
          Length = 1968

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 133/255 (52%), Gaps = 50/255 (19%)

Query: 3    SNDLWGPPK-------PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWS 55
            S++LW  P+       P  PPPG+       PS  W          GN+ G +Q   GWS
Sbjct: 1732 SHELWKVPQGPRSTTAPSRPPPGLTNTK---PSTSW---------SGNSLGLTQ---GWS 1776

Query: 56   GT------------------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALA 97
            G+                  W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ 
Sbjct: 1777 GSYSSEGTAWSTDTSNRTSSWLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVV 1836

Query: 98   KYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAH---LSATANNNNNNNGGTG 154
            +YS+++EA KAQ +L+ C+LGNTTI AE   + EV    A    L AT  + + N G   
Sbjct: 1837 RYSSKDEAAKAQKSLHMCVLGNTTILAEFAGEEEVNRFFAQGQSLGATTTSWHANPGPNQ 1896

Query: 155  GWARGSSALSNKDTWSSGGGGGNTSQ---LWGTPSNPSSGGSLWGAPP-LDSVDRATPSS 210
                G+S   +   WSSG GGG  +    LWG     S   SLWG P   D+    +P+ 
Sbjct: 1897 NRMGGASQSHSIGQWSSGAGGGKANGGDLLWGGVPQYS---SLWGPPNGEDARVIGSPTP 1953

Query: 211  LNSFLPGDLLGGESM 225
            +N+ LPGDLL GESM
Sbjct: 1954 INTLLPGDLLSGESM 1968


>gi|441598125|ref|XP_004087437.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
            6A protein [Nomascus leucogenys]
          Length = 1938

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 98/255 (38%), Positives = 135/255 (52%), Gaps = 38/255 (14%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG GN+     P   W
Sbjct: 1690 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSFW 1747

Query: 55   SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
              +       W +L  L P IDGSTL+TLC+QHGPL  FHL L H  AL +YS++EE +K
Sbjct: 1748 GESISWIITNWFVLNTLLPXIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1807

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
            AQ +L+ C+LGNTTI AE  S+ E+    A   +   +       ++ +  G+   +   
Sbjct: 1808 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1867

Query: 161  SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
            S+ ++ + W+  G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N
Sbjct: 1868 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1923

Query: 213  SFLPGDLL--GGESM 225
            +FL  D L  GGESM
Sbjct: 1924 AFLSVDHLGGGGESM 1938


>gi|348520957|ref|XP_003447993.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Oreochromis niloticus]
          Length = 1844

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 101/256 (39%), Positives = 135/256 (52%), Gaps = 52/256 (20%)

Query: 3    SNDLWGPPK-------PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWS 55
            S++LW  P+       P  PPPG+       PS+ W         GGN+ G SQ   GWS
Sbjct: 1608 SHELWKVPQGPRSTTAPSRPPPGLTNTK---PSSTW---------GGNSLGLSQ---GWS 1652

Query: 56   GT------------------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALA 97
            G+                  W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ 
Sbjct: 1653 GSYSSEGTTWSTDSSNRTSSWLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVV 1712

Query: 98   KYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEV-------QSLLAHLSATANNNNNNN 150
            +YS+++EA KAQ +L+ C+LGNTTI AE   + EV       QSL A+ ++   N   N 
Sbjct: 1713 RYSSKDEAAKAQKSLHMCVLGNTTILAEFAGEEEVNRFFAQGQSLGANTTSWQANPGTNQ 1772

Query: 151  GGTGGWARGSSALSNKDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPP-LDSVDRATPS 209
               GG A+ S ++    + + GG       LWG     S   SLWG P   D+    +P+
Sbjct: 1773 NRMGGAAQ-SHSIGQWSSSAGGGKASGGDLLWGGVPQYS---SLWGPPSGEDARVIGSPT 1828

Query: 210  SLNSFLPGDLLGGESM 225
             +N+ LPGDLL GESM
Sbjct: 1829 PINTLLPGDLLSGESM 1844


>gi|410917113|ref|XP_003972031.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Takifugu rubripes]
          Length = 1858

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 96/242 (39%), Positives = 130/242 (53%), Gaps = 27/242 (11%)

Query: 3    SNDLWGPPK-------PRGPPPGMMGGGGKPPSNGWM-----VRP--NGGGGGGNTWGTS 48
            S++LW  P+       P  PPPG+       PS+ W      + P  NG    G TW T 
Sbjct: 1625 SHELWKVPQGPRSTTAPSRPPPGLTNSK---PSSTWSGNSLGLAPGWNGSYSSGTTWSTD 1681

Query: 49   QPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKA 108
                  + +W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS+++E+ KA
Sbjct: 1682 S--SNRTSSWLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVVRYSSKDESAKA 1739

Query: 109  QGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNN--NNNGGTGGWARGSSALSNK 166
            Q +L+ C+LGNTTI AE   + EV    A   +   N      N G+     G++   + 
Sbjct: 1740 QKSLHMCVLGNTTILAEFAGEEEVNRFFAQGQSLGANTTSWQANQGSNQNRMGAAQSHSI 1799

Query: 167  DTWSSGGGGGNTSQ--LWGTPSNPSSGGSLWGAPP-LDSVDRATPSSLNSFLPGDLLGGE 223
              WS GGGG  +    LWG     S   SLWG P   D+    +P+ +N+ LPGDLL GE
Sbjct: 1800 GQWSGGGGGKTSGGDLLWGGVPQYS---SLWGPPSGEDARVIGSPTPINTLLPGDLLSGE 1856

Query: 224  SM 225
            SM
Sbjct: 1857 SM 1858


>gi|312384469|gb|EFR29193.1| hypothetical protein AND_02090 [Anopheles darlingi]
          Length = 1745

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 68/139 (48%), Positives = 90/139 (64%), Gaps = 14/139 (10%)

Query: 55   SGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNN 114
            + TW+LL+NLT QIDGSTL+TLC+QHGPL NF  Y +HS+AL KY+TREEA KAQ  LNN
Sbjct: 1592 ASTWILLRNLTAQIDGSTLRTLCMQHGPLLNFQPYTHHSVALCKYATREEAQKAQQALNN 1651

Query: 115  CILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKDTWSSGGG 174
            C LGNTTI AE P++++VQ +L+ L              GG    S+ ++N  T ++ GG
Sbjct: 1652 CPLGNTTICAEIPTESDVQYILSQL--------------GGSMNASNGMTNGLTGAASGG 1697

Query: 175  GGNTSQLWGTPSNPSSGGS 193
            G N   +    S P + G+
Sbjct: 1698 GQNWRLVAAQQSQPPTPGA 1716


>gi|47208767|emb|CAF91958.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1579

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 95/245 (38%), Positives = 123/245 (50%), Gaps = 30/245 (12%)

Query: 3    SNDLWGPPK-------PRGPPPGMMGGGGKPPSNGWMVRPNGGGGG--------GNTWGT 47
            S++LW  P+       P  PPPG+       PS+ W     G   G        G TW T
Sbjct: 1343 SHELWKVPQGPRSSTAPSRPPPGLTNSK---PSSTWGGSSLGLAPGWTGSYSSEGTTWST 1399

Query: 48   SQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
                G  + +W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS+++EA K
Sbjct: 1400 DS--GNRTSSWLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVVRYSSKDEAAK 1457

Query: 108  AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN------NNNNNGGTGGWARGSS 161
            AQ +L+ C+LGNTTI AE   + EV    A   +   N      N   N    G A+  S
Sbjct: 1458 AQKSLHMCVLGNTTILAEFAGEEEVNRFFAQGQSLGANTTSWQANPGTNQNRMGAAQSHS 1517

Query: 162  ALSNKDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPP-LDSVDRATPSSLNSFLPGDLL 220
                      GG       LWG     S   SLWG P   D+    +P+ +N+ LPGDLL
Sbjct: 1518 IGQWGSGGGGGGKASGGDLLWGGVPQYS---SLWGPPSGEDARVIGSPTPINTLLPGDLL 1574

Query: 221  GGESM 225
             GESM
Sbjct: 1575 SGESM 1579


>gi|431908494|gb|ELK12089.1| Trinucleotide repeat-containing protein 6A protein [Pteropus alecto]
          Length = 1848

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 98/251 (39%), Positives = 130/251 (51%), Gaps = 41/251 (16%)

Query: 3    SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQ---- 51
            +++LW    PPK    P  PPPG+ G   KPP + W   P   GGG   WG++  +    
Sbjct: 1611 AHELWKVPLPPKSITAPSRPPPGLTGQ--KPPLSAWDPAPLRVGGG---WGSADARYTPG 1665

Query: 52   -------GGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREE 104
                    G    W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL +Y ++EE
Sbjct: 1666 SSWGESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLVTFHLSLPHGNALVRYGSKEE 1725

Query: 105  AIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALS 164
             +KAQ +L+ C+LGNTTI AE  S+ E+    A   + A   +  + G+G    G    S
Sbjct: 1726 VVKAQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLAPAPSWQSLGSGQSRLGPLDCS 1785

Query: 165  N--KDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSFLP 216
            +     W+  G  G +      + LWG    P    SLWG         + PS +N+FL 
Sbjct: 1786 HPFSSHWNGAGLSGTSCGDLPGASLWG---GPHYSASLWGP-----PSSSDPSPINAFLS 1837

Query: 217  GDLL--GGESM 225
             D L  GGESM
Sbjct: 1838 VDHLGGGGESM 1848


>gi|351702889|gb|EHB05808.1| Trinucleotide repeat-containing gene 6A protein [Heterocephalus
            glaber]
          Length = 1787

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 76/180 (42%), Positives = 106/180 (58%), Gaps = 19/180 (10%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  AL  YS++EE +KAQ +L+ C+L
Sbjct: 1555 WLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLLHGNALVCYSSKEEVVKAQKSLHMCVL 1614

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGSSALSNKDTWS 170
            GNTTI AE  S+ E+    A   +   +       ++ +  G+   +   S+ ++ + W+
Sbjct: 1615 GNTTILAEFASEEEISRFFAQGQSLTPSPGWQSLGSSQSRLGSLDCSHAFSSRTDLNHWN 1674

Query: 171  SGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLNSFLPGDLLGG 222
              G  G        + LWGTP   +   SLWG PP  S  R  ++PS +N+FL  D LGG
Sbjct: 1675 GAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPINAFLSVDHLGG 1730


>gi|170052152|ref|XP_001862092.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167873117|gb|EDS36500.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1503

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 80/142 (56%), Positives = 94/142 (66%), Gaps = 6/142 (4%)

Query: 5    DLWGPP--KP-RGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLL 61
            DLWG P  KP RGPPPG+   G    +NGW     GG    N+ G      GW  +W+LL
Sbjct: 1271 DLWGTPMGKPTRGPPPGL---GANKNANGWAGGAGGGPQRSNSGGNWPGGNGWGSSWLLL 1327

Query: 62   KNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTT 121
            KNLT QIDG+TL+TLC+QHGPLQ+  LY NH LAL KYS+REEA KAQ  LNNC LG+T 
Sbjct: 1328 KNLTSQIDGATLRTLCMQHGPLQSLQLYPNHGLALCKYSSREEASKAQQALNNCPLGSTN 1387

Query: 122  IFAEAPSDAEVQSLLAHLSATA 143
            I AE PS+A+ Q+ L  L A A
Sbjct: 1388 IGAECPSEADAQTYLQQLGAPA 1409


>gi|410902161|ref|XP_003964563.1| PREDICTED: trinucleotide repeat-containing gene 6A protein-like
            [Takifugu rubripes]
          Length = 1162

 Score =  120 bits (300), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 82/184 (44%), Positives = 106/184 (57%), Gaps = 28/184 (15%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  A+  YS+++EA KAQ +L+ C+L
Sbjct: 991  WLVLKNLTPQIDGSTLRTLCMQHGPLNTFHLNLPHGNAVVCYSSKDEAAKAQKSLHMCVL 1050

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKD---------- 167
            GNTTI AE  S+ E+    A   + A         + GW    S+ S  D          
Sbjct: 1051 GNTTILAEFASEEEINRFFAQGQSLAT-------PSSGWQAVGSSQSRMDQSHHFPSRAP 1103

Query: 168  ---TWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDR-ATPSSLNSFLPGDLL--G 221
                W+S     ++S LWG  SN SS  SLWG P      R ++PS ++SFLP D L  G
Sbjct: 1104 EPNQWNS--SDLHSSSLWGG-SNYSS--SLWGTPGGTETGRMSSPSPISSFLPVDHLAGG 1158

Query: 222  GESM 225
            G+SM
Sbjct: 1159 GDSM 1162


>gi|355725522|gb|AES08584.1| trinucleotide repeat containing 6C [Mustela putorius furo]
          Length = 431

 Score =  119 bits (299), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 85/213 (39%), Positives = 115/213 (53%), Gaps = 19/213 (8%)

Query: 3   SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
           S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 216 SHELWKVPRSTAAPTRPPPGL---ANPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 272

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 273 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 332

Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTGGWARGSS------ALSNKDTWS 170
           GNTTI AE   + EV   LA   A    ++  ++ GTG    G+S        S+   WS
Sbjct: 333 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTGQTRLGASGSSHGLVRSDAGHWS 392

Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLD 201
           +    G G++  LWG    P    SLWG P  D
Sbjct: 393 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSD 423


>gi|432869446|ref|XP_004071751.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Oryzias latipes]
          Length = 1798

 Score =  119 bits (298), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 97/261 (37%), Positives = 130/261 (49%), Gaps = 61/261 (23%)

Query: 3    SNDLWGPPK--------PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGW 54
            S++LW  P+        P  PPPG+       PS+ W         GGN+ G +Q   GW
Sbjct: 1561 SHELWKVPQGPRSGTAAPSRPPPGLTNTK---PSSTW---------GGNSLGLAQ---GW 1605

Query: 55   S-----------------GTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALA 97
            S                  +W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ 
Sbjct: 1606 SNSYTAGTTWSTDSSTRASSWLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVV 1665

Query: 98   KYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAH---------LSATANNNNN 148
            +YS+++E+ KAQ +L+ C+LGNTTI AE   + EV    A            A+   N +
Sbjct: 1666 RYSSKDESAKAQKSLHMCVLGNTTILAEFAGEEEVNRFFAQGQSLGGTTSWQASPGTNQS 1725

Query: 149  NNGGTGGWARGSSALSNKDTWSSGGGGGNTSQ--LWGTPSNPSSGGSLWGAPPLDSVDRA 206
              GG G        +     W+S   G ++S   LWG     S   SLWG PP     R 
Sbjct: 1726 RMGGAG----AHHPIGQSPHWNSNSNGSSSSSKLLWGGVQQYS---SLWG-PPSGEEGRV 1777

Query: 207  --TPSSLNSFLPGDLLGGESM 225
              +P+ +N+ LPGDLL GESM
Sbjct: 1778 MGSPTPINTLLPGDLLSGESM 1798


>gi|326666077|ref|XP_689365.4| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
            [Danio rerio]
          Length = 1696

 Score =  117 bits (294), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 82/187 (43%), Positives = 103/187 (55%), Gaps = 25/187 (13%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1516 WLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1575

Query: 118  GNTTIFAEAPSDAEV-------QSLLAHLSATANNNNNNNGGTGGWARGSSALSNKDTWS 170
            GNTTI AE   + EV       QSL    S  AN   N     GG   G++A      W+
Sbjct: 1576 GNTTILAEFAGEEEVNRFFAQGQSLTPTTSWQANPGTNQTRLGGG---GTAATHPIGHWN 1632

Query: 171  SGGGG-----------GNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGD 218
            S   G            +   LWG     S   SLWG P   D     +P+ +N+ LPGD
Sbjct: 1633 SSSLGGGGAGTGSGGKASNELLWGGVPQYS---SLWGPPSAEDGRVVGSPTPINTLLPGD 1689

Query: 219  LLGGESM 225
            LL GESM
Sbjct: 1690 LLSGESM 1696


>gi|426239229|ref|XP_004013528.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
            6C protein [Ovis aries]
          Length = 1856

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 83/228 (36%), Positives = 110/228 (48%), Gaps = 46/228 (20%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1670 SHELWKVPRNTTAPTRPPPGLTN---PKPSSAWGASPLGWTSSYSSGSAWSTDASGRTSS 1726

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1727 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1786

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKDTWSSGGGGGN 177
            GNTTI AE   + EV   LA                        AL    +W        
Sbjct: 1787 GNTTILAEFAGEEEVNRFLAQ---------------------GQALPPTSSW-------- 1817

Query: 178  TSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSFLPGDLLGGESM 225
                      PS G S       D     +P+ +N+ LPGDLL GES+
Sbjct: 1818 ---------QPSPGTSQTRLSSDDGRVIGSPTPVNTLLPGDLLSGESI 1856


>gi|405965787|gb|EKC31141.1| hypothetical protein CGI_10028774 [Crassostrea gigas]
          Length = 1616

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 70/147 (47%), Positives = 90/147 (61%), Gaps = 12/147 (8%)

Query: 3    SNDLWGPPKPRG---PPPGMMGGGGKPPSNGWM-VRPNGGGGGGNTWGTSQPQGGWSG-- 56
            SN++WG P P+    PPPG++     P S  W  V       G  +   S     W G  
Sbjct: 1396 SNEVWGVPLPKNNSRPPPGLL-----PKSGNWTGVNRQHSWAGTTSSMLSGNSAAWDGIS 1450

Query: 57   TWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCI 116
            T ++LKNLTPQIDGSTL+TLC+QHGPLQ F+L L++  AL +Y ++EEA KAQ +LN C+
Sbjct: 1451 TCLMLKNLTPQIDGSTLRTLCMQHGPLQWFYLSLHNGQALVRYHSKEEAFKAQKSLNTCV 1510

Query: 117  LGNTTIFAEAPSDAEVQSLLAHLSATA 143
            LGNTTI A   S+AE  +  A  SA A
Sbjct: 1511 LGNTTIVANFVSEAEA-TRFAEQSAMA 1536


>gi|432871522|ref|XP_004071958.1| PREDICTED: trinucleotide repeat-containing gene 6A protein-like
            [Oryzias latipes]
          Length = 1840

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 74/178 (41%), Positives = 98/178 (55%), Gaps = 17/178 (9%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L H  A+  YS+++EA KAQ +L+ C+L
Sbjct: 1670 WLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNAVVCYSSKDEATKAQKSLHMCVL 1729

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKDTWSSGGGGGN 177
            GNTTI AE  S+ E+    A   + A         T GW    S+ S  D   S     +
Sbjct: 1730 GNTTIMAEFASEEEISRFFAQGQSLAT-------PTSGWQAIGSSQSRMDQSQSFPSRAS 1782

Query: 178  TSQLWGT-------PSNPSSGGSLWGAPPLDSVDRA-TPSSLNSFLPGDLL--GGESM 225
                W +         + S   +LWG P      R  +PS ++SFLP D L  GG+S+
Sbjct: 1783 EPNQWNSGELHGSSLWSRSYSSTLWGNPSSADPGRINSPSPISSFLPVDHLTGGGDSL 1840


>gi|47211044|emb|CAF93674.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 802

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 96/276 (34%), Positives = 129/276 (46%), Gaps = 74/276 (26%)

Query: 3   SNDLWGPPK--------PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGW 54
           S+DLW  P+        P  PPPG+       P++ W         GG + G +Q   GW
Sbjct: 548 SHDLWKVPQAPRSANTAPSRPPPGLTN---TKPASTW---------GGTSLGLAQ---GW 592

Query: 55  SGT-----------------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALA 97
           S +                 W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ 
Sbjct: 593 SSSYTTGTTWSTDSSTRTSSWLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGSAVV 652

Query: 98  KYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAH---------LSATANNNNN 148
           +YS+++EA KAQ +L+ C+LGNTTI AE   + EV    A            AT   N  
Sbjct: 653 RYSSKDEAAKAQKSLHMCVLGNTTILAEFAGEEEVNRFFAQGQLLGGTTSWQATPGTNQT 712

Query: 149 NNGGTGGWARGSSALSNKDTWSSGGGGGNTSQ-----------------LWGTPSNPSSG 191
             GG    A  +  + +   W++   G N++                  LWG     S  
Sbjct: 713 RMGGASSGA--AHPIGHSSHWNNNNNGSNSNSSSNSSGGGGAAKTGGELLWGGVQQYS-- 768

Query: 192 GSLWGAPPLDSVDRA--TPSSLNSFLPGDLLGGESM 225
            SLW  PP     R   +P+ +N+ LPGDLL GESM
Sbjct: 769 -SLW-RPPSAEEGRVMGSPTPINTLLPGDLLSGESM 802


>gi|47219653|emb|CAG02698.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1835

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 81/184 (44%), Positives = 105/184 (57%), Gaps = 28/184 (15%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++LKNLTPQIDGSTLKTLC+QHGPL  FHL L H  A+  YS+++EA KAQ +L+ C+L
Sbjct: 1664 WLVLKNLTPQIDGSTLKTLCMQHGPLITFHLNLPHGNAVVCYSSKDEAAKAQKSLHMCVL 1723

Query: 118  GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKD---------- 167
            GNTTI AE  S+ E+    A   + A         + GW    S+ S  D          
Sbjct: 1724 GNTTILAEFASEEEINRFFAQGQSLAT-------PSSGWQAVGSSQSRMDQSHHFPSRAP 1776

Query: 168  ---TWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDR-ATPSSLNSFLPGDLL--G 221
                W+S     ++S LWG P+  S   SLWG P      R ++PS ++SFLP D L  G
Sbjct: 1777 EPSQWNS--SDLHSSSLWGGPNYSS---SLWGTPGGSEAGRISSPSPISSFLPVDHLTGG 1831

Query: 222  GESM 225
            G+SM
Sbjct: 1832 GDSM 1835


>gi|327289345|ref|XP_003229385.1| PREDICTED: trinucleotide repeat-containing gene 6A protein-like
            [Anolis carolinensis]
          Length = 1965

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 94/156 (60%), Gaps = 23/156 (14%)

Query: 4    NDLWG---PPK----PRGPPPGMMGGGGKPPSNGW---MVRPNGGGGGGNTWGTSQ---- 49
            ++LW    PPK    P  PPPG+ G  G  P + W   + R  GGGGGG  W  S+    
Sbjct: 1695 HELWKVPLPPKSVAAPSRPPPGLTGQKG--PLSSWENPLQRFGGGGGGGAGWSASEGRYT 1752

Query: 50   PQGGWSGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTR 102
            P   W  +        ++LKNLTPQIDGSTL+TLC+QHGPL+ FHL L H  AL +YS++
Sbjct: 1753 PGSAWGESSSGRITNCLVLKNLTPQIDGSTLRTLCMQHGPLKTFHLNLPHGNALVRYSSK 1812

Query: 103  EEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAH 138
            EE +KAQ +L+ C+LGNTTI AE  S+ E+    A 
Sbjct: 1813 EEVVKAQKSLHMCVLGNTTILAEFASEEEISRFFAQ 1848


>gi|432113374|gb|ELK35786.1| Trinucleotide repeat-containing protein 6C protein [Myotis davidii]
          Length = 1695

 Score =  113 bits (282), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 64/141 (45%), Positives = 85/141 (60%), Gaps = 8/141 (5%)

Query: 3    SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
            S++LW  P+    P  PPPG+       PS+ W   P G      +    S    G + +
Sbjct: 1516 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1572

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC+QHGPL  FHL L    A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1573 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1632

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE   + EV   LA 
Sbjct: 1633 GNTTILAEFAGEEEVNRFLAQ 1653


>gi|47207588|emb|CAF90193.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 319

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 56/104 (53%), Positives = 69/104 (66%), Gaps = 2/104 (1%)

Query: 40  GGGNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKY 99
           G G+ W      GG    W+LL NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +Y
Sbjct: 201 GPGSPWNEGVSTGG--SCWLLLSNLTPQIDGSTLRTICMQHGPLLTFHLGLTQGSALIRY 258

Query: 100 STREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATA 143
           S+++EA+KAQG L+ C+LGNTTI AE  S+ EV    AH  A  
Sbjct: 259 SSQQEAVKAQGALHMCVLGNTTILAEFVSEDEVARYFAHSQAEV 302


>gi|426225804|ref|XP_004007052.1| PREDICTED: trinucleotide repeat-containing gene 6B protein [Ovis
            aries]
          Length = 1844

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1660 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1719

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1720 GNTTILAEFATDDEVSRFLAQ 1740


>gi|326672343|ref|XP_002663990.2| PREDICTED: trinucleotide repeat-containing gene 6B protein [Danio
            rerio]
          Length = 2020

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YS+R+EA KAQ  L+ C+L
Sbjct: 1779 WLVLSNLTPQIDGSTLRTICMQHGPLLTFHLGLTQGSALIRYSSRQEAAKAQSALHMCVL 1838

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  S+ EV    AH
Sbjct: 1839 GNTTILAEFVSEEEVARYFAH 1859


>gi|73969030|ref|XP_859340.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
            [Canis lupus familiaris]
          Length = 1726

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1542 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1601

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1602 GNTTILAEFATDDEVSRFLAQ 1622


>gi|119580772|gb|EAW60368.1| trinucleotide repeat containing 6B, isoform CRA_b [Homo sapiens]
 gi|119580773|gb|EAW60369.1| trinucleotide repeat containing 6B, isoform CRA_b [Homo sapiens]
          Length = 1527

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1343 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1402

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1403 GNTTILAEFATDDEVSRFLAQ 1423


>gi|338721305|ref|XP_001500122.3| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 1
            [Equus caballus]
          Length = 1726

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1542 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1601

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1602 GNTTILAEFATDDEVSRFLAQ 1622


>gi|296486914|tpg|DAA29027.1| TPA: trinucleotide repeat containing 6B [Bos taurus]
          Length = 1836

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1652 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1711

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1712 GNTTILAEFATDDEVSRFLAQ 1732


>gi|440903036|gb|ELR53750.1| Trinucleotide repeat-containing 6B protein, partial [Bos grunniens
            mutus]
          Length = 1835

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1651 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1710

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1711 GNTTILAEFATDDEVSRFLAQ 1731


>gi|338721304|ref|XP_003364347.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
            [Equus caballus]
          Length = 1836

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1652 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1711

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1712 GNTTILAEFATDDEVSRFLAQ 1732


>gi|119580777|gb|EAW60373.1| trinucleotide repeat containing 6B, isoform CRA_e [Homo sapiens]
          Length = 1759

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1575 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1634

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1635 GNTTILAEFATDDEVSRFLAQ 1655


>gi|410349311|gb|JAA41259.1| trinucleotide repeat containing 6B [Pan troglodytes]
          Length = 1722

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1538 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1597

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1598 GNTTILAEFATDDEVSRFLAQ 1618


>gi|351699314|gb|EHB02233.1| Trinucleotide repeat-containing gene 6B protein, partial
            [Heterocephalus glaber]
          Length = 1827

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1643 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1702

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1703 GNTTILAEFATDDEVSRFLAQ 1723


>gi|300798505|ref|NP_001179584.1| trinucleotide repeat-containing gene 6B protein [Bos taurus]
          Length = 1836

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1652 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1711

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1712 GNTTILAEFATDDEVSRFLAQ 1732


>gi|431900059|gb|ELK07994.1| Trinucleotide repeat-containing protein 6B protein [Pteropus alecto]
          Length = 1885

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1701 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1760

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1761 GNTTILAEFATDDEVSRFLAQ 1781


>gi|426394558|ref|XP_004063560.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 1
            [Gorilla gorilla gorilla]
          Length = 1723

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1539 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1598

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1599 GNTTILAEFATDDEVSRFLAQ 1619


>gi|148491080|ref|NP_055903.2| trinucleotide repeat-containing gene 6B protein isoform 2 [Homo
            sapiens]
          Length = 1723

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1539 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1598

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1599 GNTTILAEFATDDEVSRFLAQ 1619


>gi|403282937|ref|XP_003932888.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
            [Saimiri boliviensis boliviensis]
          Length = 1833

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1649 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1708

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1709 GNTTILAEFATDDEVSRFLAQ 1729


>gi|395753472|ref|XP_002831212.2| PREDICTED: trinucleotide repeat-containing gene 6B protein [Pongo
            abelii]
          Length = 1790

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1606 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1665

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1666 GNTTILAEFATDDEVSRFLAQ 1686


>gi|348569274|ref|XP_003470423.1| PREDICTED: trinucleotide repeat-containing gene 6B protein, partial
            [Cavia porcellus]
          Length = 1811

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1627 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1686

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1687 GNTTILAEFATDDEVSRFLAQ 1707


>gi|168269678|dbj|BAG09966.1| trinucleotide repeat-containing 6B protein [synthetic construct]
          Length = 1722

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1538 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1597

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1598 GNTTILAEFATDDEVSRFLAQ 1618


>gi|397502026|ref|XP_003821672.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 1
            [Pan paniscus]
          Length = 1723

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1539 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1598

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1599 GNTTILAEFATDDEVSRFLAQ 1619


>gi|395819715|ref|XP_003783225.1| PREDICTED: trinucleotide repeat-containing gene 6B protein [Otolemur
            garnettii]
          Length = 1837

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1653 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1712

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1713 GNTTILAEFATDDEVSRFLAQ 1733


>gi|297261128|ref|XP_001101111.2| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
            [Macaca mulatta]
          Length = 1832

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1648 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1707

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1708 GNTTILAEFATDDEVSRFLAQ 1728


>gi|426394560|ref|XP_004063561.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
            [Gorilla gorilla gorilla]
          Length = 1833

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1649 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1708

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1709 GNTTILAEFATDDEVSRFLAQ 1729


>gi|241982729|ref|NP_001155973.1| trinucleotide repeat-containing gene 6B protein isoform 1 [Homo
            sapiens]
 gi|229904901|sp|Q9UPQ9.4|TNR6B_HUMAN RecName: Full=Trinucleotide repeat-containing gene 6B protein
 gi|194377566|dbj|BAG57731.1| unnamed protein product [Homo sapiens]
          Length = 1833

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1649 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1708

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1709 GNTTILAEFATDDEVSRFLAQ 1729


>gi|397502028|ref|XP_003821673.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
            [Pan paniscus]
          Length = 1833

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1649 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1708

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1709 GNTTILAEFATDDEVSRFLAQ 1729


>gi|410965583|ref|XP_003989326.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 1
            [Felis catus]
          Length = 1840

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1656 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1715

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1716 GNTTILAEFATDDEVSRFLAQ 1736


>gi|383416819|gb|AFH31623.1| trinucleotide repeat-containing gene 6B protein isoform 1 [Macaca
            mulatta]
          Length = 1775

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1591 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1650

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1651 GNTTILAEFATDDEVSRFLAQ 1671


>gi|345776961|ref|XP_538361.3| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 1
            [Canis lupus familiaris]
          Length = 1836

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1652 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1711

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1712 GNTTILAEFATDDEVSRFLAQ 1732


>gi|296237974|ref|XP_002763956.1| PREDICTED: trinucleotide repeat-containing gene 6B protein
            [Callithrix jacchus]
          Length = 1834

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1650 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1709

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1710 GNTTILAEFATDDEVSRFLAQ 1730


>gi|383416817|gb|AFH31622.1| trinucleotide repeat-containing gene 6B protein isoform 2 [Macaca
            mulatta]
          Length = 1722

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1538 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1597

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1598 GNTTILAEFATDDEVSRFLAQ 1618


>gi|14133235|dbj|BAA83045.2| KIAA1093 protein [Homo sapiens]
          Length = 1727

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1543 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1602

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1603 GNTTILAEFATDDEVSRFLAQ 1623


>gi|410965585|ref|XP_003989327.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
            [Felis catus]
          Length = 1730

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1546 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1605

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1606 GNTTILAEFATDDEVSRFLAQ 1626


>gi|355563693|gb|EHH20255.1| hypothetical protein EGK_03069 [Macaca mulatta]
 gi|355785008|gb|EHH65859.1| hypothetical protein EGM_02715 [Macaca fascicularis]
          Length = 1846

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1662 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1721

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1722 GNTTILAEFATDDEVSRFLAQ 1742


>gi|327272521|ref|XP_003221033.1| PREDICTED: trinucleotide repeat-containing gene 6B protein-like
           [Anolis carolinensis]
          Length = 1028

 Score =  110 bits (275), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 845 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 904

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  +D EV   LA 
Sbjct: 905 GNTTILAEFATDEEVSRFLAQ 925


>gi|444723822|gb|ELW64452.1| Trinucleotide repeat-containing 6B protein [Tupaia chinensis]
          Length = 2247

 Score =  110 bits (274), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 2063 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 2122

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 2123 GNTTILAEFATDDEVSRFLAQ 2143


>gi|67782330|ref|NP_001020014.1| trinucleotide repeat-containing gene 6B protein isoform 3 [Homo
           sapiens]
 gi|20306948|gb|AAH28626.1| TNRC6B protein [Homo sapiens]
 gi|119580771|gb|EAW60367.1| trinucleotide repeat containing 6B, isoform CRA_a [Homo sapiens]
 gi|119580775|gb|EAW60371.1| trinucleotide repeat containing 6B, isoform CRA_a [Homo sapiens]
          Length = 1029

 Score =  110 bits (274), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 845 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 904

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  +D EV   LA 
Sbjct: 905 GNTTILAEFATDDEVSRFLAQ 925


>gi|332231295|ref|XP_003264834.1| PREDICTED: trinucleotide repeat-containing gene 6B protein
           [Nomascus leucogenys]
          Length = 1029

 Score =  110 bits (274), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 845 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 904

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  +D EV   LA 
Sbjct: 905 GNTTILAEFATDDEVSRFLAQ 925


>gi|332859853|ref|XP_003317299.1| PREDICTED: trinucleotide repeat-containing gene 6B protein [Pan
           troglodytes]
          Length = 1028

 Score =  110 bits (274), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 844 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 903

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  +D EV   LA 
Sbjct: 904 GNTTILAEFATDDEVSRFLAQ 924


>gi|338721307|ref|XP_003364348.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform
           3 [Equus caballus]
          Length = 1032

 Score =  110 bits (274), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 848 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 907

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  +D EV   LA 
Sbjct: 908 GNTTILAEFATDDEVSRFLAQ 928


>gi|410965587|ref|XP_003989328.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform
           3 [Felis catus]
          Length = 1036

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 852 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 911

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  +D EV   LA 
Sbjct: 912 GNTTILAEFATDDEVSRFLAQ 932


>gi|449481809|ref|XP_004175955.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
            6B protein [Taeniopygia guttata]
          Length = 1831

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +Y+T++EA KAQ  L+ C+L
Sbjct: 1646 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYNTKQEAAKAQTALHMCVL 1705

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1706 GNTTILAEFATDEEVSRFLAQ 1726


>gi|355725504|gb|AES08578.1| trinucleotide repeat containing 6B [Mustela putorius furo]
          Length = 452

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 269 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 328

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  +D EV   LA 
Sbjct: 329 GNTTILAEFATDDEVSRFLAQ 349


>gi|449271928|gb|EMC82102.1| Trinucleotide repeat-containing gene 6B protein, partial [Columba
            livia]
          Length = 1667

 Score =  109 bits (273), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +Y+T++EA KAQ  L+ C+L
Sbjct: 1483 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYNTKQEAAKAQTALHMCVL 1542

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1543 GNTTILAEFATDEEVSRFLAQ 1563


>gi|402884320|ref|XP_003905634.1| PREDICTED: trinucleotide repeat-containing gene 6B protein-like,
           partial [Papio anubis]
          Length = 459

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 275 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 334

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  +D EV   LA 
Sbjct: 335 GNTTILAEFATDDEVSRFLAQ 355


>gi|334347952|ref|XP_003342002.1| PREDICTED: trinucleotide repeat-containing gene 6B protein-like
            [Monodelphis domestica]
          Length = 1797

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 48/79 (60%), Positives = 59/79 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1610 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1669

Query: 118  GNTTIFAEAPSDAEVQSLL 136
            GNTTI AE  +D EV   L
Sbjct: 1670 GNTTILAEFATDEEVSRFL 1688


>gi|281351171|gb|EFB26755.1| hypothetical protein PANDA_002541 [Ailuropoda melanoleuca]
          Length = 1690

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +Y+T++EA KAQ  L+ C+L
Sbjct: 1506 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYNTKQEAAKAQTALHMCVL 1565

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  +D EV   LA 
Sbjct: 1566 GNTTILAEFATDDEVSRFLAQ 1586


>gi|395538134|ref|XP_003771040.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
            6B protein [Sarcophilus harrisii]
          Length = 1823

 Score =  109 bits (272), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 48/79 (60%), Positives = 59/79 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1644 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1703

Query: 118  GNTTIFAEAPSDAEVQSLL 136
            GNTTI AE  +D EV   L
Sbjct: 1704 GNTTILAEFATDEEVSRFL 1722


>gi|159110982|ref|NP_796098.3| trinucleotide repeat-containing gene 6B protein isoform 2 [Mus
            musculus]
          Length = 1774

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1590 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1649

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  ++ EV   LA 
Sbjct: 1650 GNTTILAEFATEDEVSRFLAQ 1670


>gi|67782332|ref|NP_659061.2| trinucleotide repeat-containing gene 6B protein isoform 1 [Mus
            musculus]
 gi|229891742|sp|Q8BKI2.2|TNR6B_MOUSE RecName: Full=Trinucleotide repeat-containing gene 6B protein
          Length = 1810

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1626 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1685

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  ++ EV   LA 
Sbjct: 1686 GNTTILAEFATEDEVSRFLAQ 1706


>gi|148672646|gb|EDL04593.1| trinucleotide repeat containing 6b, isoform CRA_a [Mus musculus]
          Length = 1817

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1633 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1692

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  ++ EV   LA 
Sbjct: 1693 GNTTILAEFATEDEVSRFLAQ 1713


>gi|198041672|ref|NP_620200.2| trinucleotide repeat-containing gene 6B protein [Rattus norvegicus]
          Length = 1818

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1634 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1693

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  ++ EV   LA 
Sbjct: 1694 GNTTILAEFATEDEVSRFLAQ 1714


>gi|344246759|gb|EGW02863.1| Trinucleotide repeat-containing gene 6B protein [Cricetulus griseus]
          Length = 1810

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1626 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1685

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  ++ EV   LA 
Sbjct: 1686 GNTTILAEFATEDEVSRFLAQ 1706


>gi|354490746|ref|XP_003507517.1| PREDICTED: trinucleotide repeat-containing gene 6B protein
            [Cricetulus griseus]
          Length = 1913

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1729 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1788

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  ++ EV   LA 
Sbjct: 1789 GNTTILAEFATEDEVSRFLAQ 1809


>gi|344296348|ref|XP_003419871.1| PREDICTED: trinucleotide repeat-containing gene 6B protein [Loxodonta
            africana]
          Length = 1557

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 48/79 (60%), Positives = 59/79 (74%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 1373 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1432

Query: 118  GNTTIFAEAPSDAEVQSLL 136
            GNTTI AE  +D EV   L
Sbjct: 1433 GNTTILAEFATDDEVSRFL 1451


>gi|26342470|dbj|BAC34897.1| unnamed protein product [Mus musculus]
          Length = 812

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 628 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 687

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  ++ EV   LA 
Sbjct: 688 GNTTILAEFATEDEVSRFLAQ 708


>gi|149065870|gb|EDM15743.1| androgen receptor-related apoptosis-associated protein CBL27,
           isoform CRA_a [Rattus norvegicus]
          Length = 1005

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 821 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 880

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  ++ EV   LA 
Sbjct: 881 GNTTILAEFATEDEVSRFLAQ 901


>gi|74184073|dbj|BAE37059.1| unnamed protein product [Mus musculus]
          Length = 838

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 654 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 713

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  ++ EV   LA 
Sbjct: 714 GNTTILAEFATEDEVSRFLAQ 734


>gi|74183955|dbj|BAE37027.1| unnamed protein product [Mus musculus]
          Length = 731

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 547 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 606

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  ++ EV   LA 
Sbjct: 607 GNTTILAEFATEDEVSRFLAQ 627


>gi|28972612|dbj|BAC65722.1| mKIAA1093 protein [Mus musculus]
          Length = 571

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 387 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 446

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  ++ EV   LA 
Sbjct: 447 GNTTILAEFATEDEVSRFLAQ 467


>gi|38197636|gb|AAH61751.1| Tnrc6b protein [Rattus norvegicus]
          Length = 432

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 248 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 307

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  ++ EV   LA 
Sbjct: 308 GNTTILAEFATEDEVSRFLAQ 328


>gi|51873840|gb|AAH80750.1| Tnrc6b protein, partial [Mus musculus]
          Length = 312

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 128 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 187

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  ++ EV   LA 
Sbjct: 188 GNTTILAEFATEDEVSRFLAQ 208


>gi|9295520|gb|AAF86977.1|AF275151_1 androgen receptor-related apoptosis-associated protein CBL27
           [Rattus norvegicus]
 gi|109730969|gb|AAI17549.1| Tnrc6b protein [Mus musculus]
 gi|109735037|gb|AAI18055.1| Tnrc6b protein [Mus musculus]
          Length = 249

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 65  WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 124

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  ++ EV   LA 
Sbjct: 125 GNTTILAEFATEDEVSRFLAQ 145


>gi|17028428|gb|AAH17531.1| Tnrc6b protein [Mus musculus]
 gi|26341766|dbj|BAC34545.1| unnamed protein product [Mus musculus]
          Length = 249

 Score =  107 bits (267), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+ C+L
Sbjct: 65  WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 124

Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
           GNTTI AE  ++ EV   LA 
Sbjct: 125 GNTTILAEFATEDEVSRFLAQ 145


>gi|260792404|ref|XP_002591205.1| hypothetical protein BRAFLDRAFT_131100 [Branchiostoma floridae]
 gi|229276408|gb|EEN47216.1| hypothetical protein BRAFLDRAFT_131100 [Branchiostoma floridae]
          Length = 1431

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 47/79 (59%), Positives = 60/79 (75%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++LKNLTPQIDGSTL+TLC+QHGPL  FHL L+   AL  Y ++EEA KAQ +L+ C+L
Sbjct: 1203 WLILKNLTPQIDGSTLRTLCMQHGPLLTFHLNLSQGCALVCYMSKEEAAKAQKSLHTCVL 1262

Query: 118  GNTTIFAEAPSDAEVQSLL 136
            GNTTI A+  S+ E + L 
Sbjct: 1263 GNTTILADFISEDEARRLF 1281


>gi|432921828|ref|XP_004080242.1| PREDICTED: uncharacterized protein LOC101163614 [Oryzias latipes]
          Length = 1885

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 60/82 (73%)

Query: 57   TWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCI 116
             W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +Y +++EA KA+  L+ C+
Sbjct: 1336 CWLVLSNLTPQIDGSTLRTICMQHGPLLTFHLGLTQGTALIRYGSKQEASKARSALHMCV 1395

Query: 117  LGNTTIFAEAPSDAEVQSLLAH 138
            LGNTTI AE  S+ +V   +AH
Sbjct: 1396 LGNTTILAEFVSEEDVARYIAH 1417


>gi|291239873|ref|XP_002739846.1| PREDICTED: hypothetical protein, partial [Saccoglossus kowalevskii]
          Length = 1669

 Score =  102 bits (253), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%), Gaps = 1/81 (1%)

Query: 58   WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
            W++L+NLTPQIDGSTL+TLC QHGPL  FHL L+   AL +Y TR+EA KAQ  L+ C+L
Sbjct: 1470 WLVLRNLTPQIDGSTLQTLCKQHGPLHTFHLNLSQGQALIQYGTRDEAAKAQKALHMCVL 1529

Query: 118  GNTTIFAEAPSDAEVQSLLAH 138
            GNTTI AE  S +E+  +L  
Sbjct: 1530 GNTTIMAEF-SSSEMTRMLER 1549


>gi|410896158|ref|XP_003961566.1| PREDICTED: trinucleotide repeat-containing gene 6B protein-like
            [Takifugu rubripes]
          Length = 2001

 Score =  102 bits (253), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 62/147 (42%), Positives = 78/147 (53%), Gaps = 27/147 (18%)

Query: 15   PPPGMMGGGGKPPSNGWMVRPNGGGGGGN------------------TWGTSQPQGGWSG 56
            PPPG+       PS      P  GGG  N                  TW     Q     
Sbjct: 1717 PPPGLGNQKQPSPS------PWSGGGPRNSILGLGTQNQTFFLCTVSTWSDGSAQ---ES 1767

Query: 57   TWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCI 116
             W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +Y++++EA KAQ  L+ C+
Sbjct: 1768 CWLVLSNLTPQIDGSTLRTICMQHGPLLTFHLGLTQGTALIRYNSKQEAAKAQSALHMCV 1827

Query: 117  LGNTTIFAEAPSDAEVQSLLAHLSATA 143
            LGNTTI AE  S+ +V   +AH  A A
Sbjct: 1828 LGNTTILAEFVSEEDVARYIAHSQAGA 1854


>gi|47226125|emb|CAG04499.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1191

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 63/158 (39%), Positives = 84/158 (53%), Gaps = 28/158 (17%)

Query: 2    SSNDLWGPP-----KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGG-WS 55
            S N L  PP     + +  P    GGG +    GW       G G +T G++   GG   
Sbjct: 879  SQNQLSRPPPGLGSQKQPSPSPWSGGGPRFAGRGW-------GSGSSTTGSAWSDGGAQE 931

Query: 56   GTWVLLKNLTPQ---------------IDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYS 100
              W++L NLTPQ               IDGSTL+T+C+QHGPL  FHL L    AL +Y+
Sbjct: 932  SCWLVLSNLTPQVITDGVTATEAEVDAIDGSTLRTICMQHGPLLTFHLGLTQGNALIRYN 991

Query: 101  TREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAH 138
            +++EA KAQ  L+ C+LGNTTI AE  S+ +V   +AH
Sbjct: 992  SKQEAAKAQSALHMCVLGNTTILAEFVSEEDVARYIAH 1029


>gi|170052147|ref|XP_001862090.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167873115|gb|EDS36498.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1332

 Score = 89.7 bits (221), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 38/61 (62%), Positives = 51/61 (83%)

Query: 79   QHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAH 138
            +HGPL  FH+YL+H +AL KYS+R+EA KAQ  LNNC+LGNTTI AE P++++VQ++L H
Sbjct: 1198 KHGPLLAFHVYLHHGIALCKYSSRDEATKAQLALNNCMLGNTTICAEIPTESDVQNILQH 1257

Query: 139  L 139
            L
Sbjct: 1258 L 1258


>gi|148685348|gb|EDL17295.1| mCG20982, isoform CRA_f [Mus musculus]
          Length = 176

 Score = 86.7 bits (213), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 55/157 (35%), Positives = 84/157 (53%), Gaps = 16/157 (10%)

Query: 78  VQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLA 137
           +QHGPL  FHL L H  AL +YS++EE +KAQ +L+ C+LGNTTI AE  S+ E+    A
Sbjct: 1   MQHGPLITFHLNLPHGNALVRYSSKEEVVKAQKSLHMCVLGNTTILAEFASEEEISRFFA 60

Query: 138 HLSATANN-------NNNNNGGTGGWARGSSALSNKDTWSSGGGGG------NTSQLWGT 184
              +   +       ++ +  G+   +   S+ ++ + W+  G  G      + + LWGT
Sbjct: 61  QSQSLTPSPGWQSLGSSQSRLGSLDCSHSFSSRTDVNHWNGAGLSGANCGDLHGTSLWGT 120

Query: 185 PSNPSSGGSLWGAPPLDSVDRATPSSLNSFLPGDLLG 221
           P   +   SLWG P  D    ++PS +N+FL  D L 
Sbjct: 121 PHYST---SLWGPPSSDPRGISSPSPINAFLSVDHLA 154


>gi|149067988|gb|EDM17540.1| trinucleotide repeat containing 6 (predicted), isoform CRA_b
           [Rattus norvegicus]
          Length = 178

 Score = 83.6 bits (205), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 57/159 (35%), Positives = 85/159 (53%), Gaps = 19/159 (11%)

Query: 78  VQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLA 137
           +QHGPL  FHL L H  AL +YS++EE +KAQ +L+ C+LGNTTI AE  S+ E+    A
Sbjct: 1   MQHGPLITFHLNLPHGNALVRYSSKEEVVKAQKSLHMCVLGNTTILAEFASEEEISRFFA 60

Query: 138 HLSATANN-------NNNNNGGTGGWARGSSALSNKDTWSSGGGGGNT------SQLWGT 184
              +   +       ++ +  G+   +   S+ ++ + W+  G  G        + LWGT
Sbjct: 61  QSQSLTPSPGWQSLGSSQSRLGSLDCSHSFSSRTDLNHWNGAGLSGTNCGDLHGTSLWGT 120

Query: 185 PSNPSSGGSLWGAPPLDSVDR--ATPSSLNSFLPGDLLG 221
           P   +   SLWG PP  S  R  ++PS +N+FL  D L 
Sbjct: 121 PHYST---SLWG-PPSSSDPRGISSPSPINAFLSVDHLA 155


>gi|194385232|dbj|BAG64993.1| unnamed protein product [Homo sapiens]
          Length = 762

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 35/56 (62%), Positives = 44/56 (78%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLN 113
           W++L NLTPQIDGSTL+T+C+QHGPL  FHL L    AL +YST++EA KAQ  L+
Sbjct: 706 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALH 761


>gi|335287525|ref|XP_003355376.1| PREDICTED: trinucleotide repeat-containing gene 6B protein-like
           [Sus scrofa]
          Length = 165

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 34/61 (55%), Positives = 41/61 (67%)

Query: 78  VQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLA 137
           +QHGPL  FHL L    AL +YST++EA KAQ  L+ C+LGNTTI AE  +D EV   LA
Sbjct: 1   MQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVLGNTTILAEFATDDEVSRFLA 60

Query: 138 H 138
            
Sbjct: 61  Q 61


>gi|363727808|ref|XP_416246.3| PREDICTED: trinucleotide repeat-containing gene 6B protein [Gallus
           gallus]
          Length = 165

 Score = 70.9 bits (172), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 32/61 (52%), Positives = 40/61 (65%)

Query: 78  VQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLA 137
           +QHGPL  FHL L    AL +Y+T++EA KAQ  L+ C+LGNTTI AE  +D EV   L 
Sbjct: 1   MQHGPLLTFHLNLTQGTALIRYNTKQEAAKAQTALHMCVLGNTTILAEFATDEEVSRFLT 60

Query: 138 H 138
            
Sbjct: 61  Q 61


>gi|198433645|ref|XP_002122194.1| PREDICTED: similar to trinucleotide repeat containing 6B [Ciona
           intestinalis]
          Length = 964

 Score = 70.5 bits (171), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 75/260 (28%), Positives = 114/260 (43%), Gaps = 52/260 (20%)

Query: 13  RGPPPGMMGGGGKP-----------PSNGWMVRPNGGGGGGNTWGTSQPQGGWS------ 55
           R PPPG+ G   +P           P+N W         GGN W + Q Q          
Sbjct: 710 RPPPPGIGGSAFRPTKELAPTWENVPNNSWDQNMTRTSQGGNNWASQQQQQQPQQPQQQQ 769

Query: 56  -----GTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFH-LYLNHSLALAKYSTREEAIKAQ 109
                G+W++L N   Q+D + ++ LC+QHG + +F   Y    +AL +Y++ E+A  A+
Sbjct: 770 QQESLGSWLVLTNFNQQVDVAGVRQLCMQHGNMVSFQGHYPIEGMALVRYASPEDAANAK 829

Query: 110 GNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATA------------NNNNNNNGGTGGWA 157
             LN  + G+T + A   +D EV +    ++ATA            +   +N+G TGG+ 
Sbjct: 830 KALNMFMAGSTMLVATVATDHEVANF---VNATAGGSWGSTAGTPGSRFVSNSGSTGGFI 886

Query: 158 ----RGSSALSNKDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPP--------LDSVDR 205
               +  S   + D  S         QLWG  SNP  G      PP         + + R
Sbjct: 887 SQPPQNPSLAISSDAASVSSQQQQQQQLWGN-SNPQGGNWPSNMPPSMPWSGTSSEDMSR 945

Query: 206 ATPSSLNSFLPGDLLGGESM 225
              S L++ LP +LLGGE+M
Sbjct: 946 IM-SPLHTLLPENLLGGETM 964


>gi|195999420|ref|XP_002109578.1| hypothetical protein TRIADDRAFT_53746 [Trichoplax adhaerens]
 gi|190587702|gb|EDV27744.1| hypothetical protein TRIADDRAFT_53746 [Trichoplax adhaerens]
          Length = 1438

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 40/69 (57%)

Query: 59   VLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILG 118
            +L+K  + Q+D + L+ LC+QHG +  F            YS+ +EA++AQ  LNNC + 
Sbjct: 1268 ILIKGFSSQVDENLLQALCLQHGRITEFVFDPRKRAVFVSYSSVDEAVRAQSRLNNCKIM 1327

Query: 119  NTTIFAEAP 127
            ++T+ A  P
Sbjct: 1328 DSTLEASFP 1336


>gi|156375504|ref|XP_001630120.1| predicted protein [Nematostella vectensis]
 gi|156217135|gb|EDO38057.1| predicted protein [Nematostella vectensis]
          Length = 1727

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 50/95 (52%), Gaps = 19/95 (20%)

Query: 57   TWVLLKNLTP-------------------QIDGSTLKTLCVQHGPLQNFHLYLNHSLALA 97
            TW++L+NL+P                   Q D + ++ +C Q+GPL  F L L H  +L 
Sbjct: 1499 TWLVLRNLSPRADCFPLSLRLSSTISDFFQADPTAMRAVCQQYGPLLTFTLNLRHGNSLI 1558

Query: 98   KYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEV 132
            +YS +++A  A+ NLN  ++    + A+  +D+++
Sbjct: 1559 RYSNKDQAASARNNLNGMMVKGMQLIADFATDSDI 1593


>gi|156340471|ref|XP_001620456.1| hypothetical protein NEMVEDRAFT_v1g223093 [Nematostella vectensis]
 gi|156205405|gb|EDO28356.1| predicted protein [Nematostella vectensis]
          Length = 199

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 42/77 (54%)

Query: 69  DGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPS 128
           D + ++ +C Q+GPL  F L L H  +L +YS +++A  A+ NLN  ++    + A+  +
Sbjct: 2   DPTAMRAVCQQYGPLLTFTLNLRHGNSLIRYSNKDQAASARNNLNGMMVKGMQLIADFAT 61

Query: 129 DAEVQSLLAHLSATANN 145
           D+++          +NN
Sbjct: 62  DSDIGGFFEQTPDWSNN 78


>gi|21430084|gb|AAM50720.1| GM23685p [Drosophila melanogaster]
          Length = 215

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 20/28 (71%), Positives = 25/28 (89%)

Query: 112 LNNCILGNTTIFAEAPSDAEVQSLLAHL 139
           LNNC+L NTTIFAE+PS+ EVQS++ HL
Sbjct: 3   LNNCVLANTTIFAESPSENEVQSIMQHL 30


>gi|340371985|ref|XP_003384525.1| PREDICTED: hypothetical protein LOC100636235 [Amphimedon
            queenslandica]
          Length = 2381

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/89 (26%), Positives = 52/89 (58%), Gaps = 1/89 (1%)

Query: 57   TWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLN-NC 115
            ++++L+N+TPQID ++L+ +C ++G +    +   +   L +YST+EEA  A+  L+ N 
Sbjct: 2181 SFIVLRNVTPQIDETSLREVCSEYGKVLACTINSFNESVLIRYSTKEEAALAKSGLDRNP 2240

Query: 116  ILGNTTIFAEAPSDAEVQSLLAHLSATAN 144
             +    +  +  S+A++ S     + ++N
Sbjct: 2241 SICGVYVNPQFASEADISSFSDQRTPSSN 2269


>gi|345565557|gb|EGX48506.1| hypothetical protein AOL_s00080g135 [Arthrobotrys oligospora ATCC
           24927]
          Length = 147

 Score = 43.9 bits (102), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 6/82 (7%)

Query: 50  PQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHS------LALAKYSTRE 103
           P     G  VL+ N+  +     L  L  +HG +QN HL L+         AL +YST+E
Sbjct: 23  PSRSIEGWIVLVTNVHEEAGEEDLNDLFAEHGEVQNLHLNLDRRTGYVKGYALVEYSTKE 82

Query: 104 EAIKAQGNLNNCILGNTTIFAE 125
           EA  A  +++   L + T+ A+
Sbjct: 83  EAQSAIDSIDGSKLLDQTVSAD 104


>gi|384500774|gb|EIE91265.1| hypothetical protein RO3G_15976 [Rhizopus delemar RA 99-880]
          Length = 256

 Score = 42.4 bits (98), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 44/87 (50%), Gaps = 7/87 (8%)

Query: 43  NTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLY-----LNHSLALA 97
           NTW  S P+   S  ++++KN++PQ    T+K   +  G ++ F L        H +AL 
Sbjct: 3   NTWTISVPETP-SPNYIVVKNISPQSSEQTVKEFFLFCGKIKEFELKNDEEDEKHKIALV 61

Query: 98  KYSTREEAIKAQGNLNNCILGNTTIFA 124
            +  RE A K    L+N ++ ++ I A
Sbjct: 62  HFE-RESAAKTAALLSNALIDDSHIVA 87


>gi|390341101|ref|XP_786178.3| PREDICTED: uncharacterized protein LOC581062 [Strongylocentrotus
            purpuratus]
          Length = 2930

 Score = 41.2 bits (95), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 51/191 (26%), Positives = 81/191 (42%), Gaps = 47/191 (24%)

Query: 56   GTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNC 115
            G  +L+  +T  ++ + L+ +C Q G +  F         +  Y   ++A KA   +++ 
Sbjct: 2758 GNSILISGVTSDVNVTALRNICGQQGQVDQFQENRAQGSVMVAYRFPDDAAKALAIISSA 2817

Query: 116  ILGNTTIFAE--APSDA-----------------------EVQSLLAHLSATANNNNNNN 150
                  I AE  +PSDA                          S+L   S+ +    +++
Sbjct: 2818 F---PNIIAELVSPSDAFSTPASSSSSGWPQGGGSSGGSKFGNSVLPSTSSASGAGKDDS 2874

Query: 151  GGTGGWARGSSALSNKDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSS 210
            GG+            K  WS+G  G   SQLW    +P  GGS+ G  P+D  D ++ S+
Sbjct: 2875 GGS------------KQNWSAGLPGMPGSQLW----SPGPGGSM-GWSPMDG-DSSSASN 2916

Query: 211  LNSFLPGDLLG 221
              SFLPGDLLG
Sbjct: 2917 F-SFLPGDLLG 2926


>gi|301609397|ref|XP_002934255.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
            6B protein-like [Xenopus (Silurana) tropicalis]
          Length = 1869

 Score = 40.8 bits (94), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 38/84 (45%), Gaps = 4/84 (4%)

Query: 60   LLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGN 119
            + KNLT Q + S  KT        Q   ++L+ SL  A   +    +    +   C+LGN
Sbjct: 1696 ISKNLT-QKEKSQKKTTXKNKS--QKLRIHLD-SLVYAAMPSSHXXLLLSPSQPRCVLGN 1751

Query: 120  TTIFAEAPSDAEVQSLLAHLSATA 143
            TTI AE  +D EV   LA     A
Sbjct: 1752 TTILAEFATDEEVSRYLAQAQPPA 1775


>gi|19172018|gb|AAL85701.1|AF474982_5 Mei2-like protein [Hordeum vulgare subsp. vulgare]
          Length = 961

 Score = 40.8 bits (94), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 39/161 (24%), Positives = 60/161 (37%), Gaps = 18/161 (11%)

Query: 28  SNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFH 87
            N  +++ NGG   G T     P G      + ++N+   ++ + LK L  Q+G +Q  +
Sbjct: 205 ENNKLLKHNGGANTGQTGLNGLPYGENPSRTLFIRNINANVEDTELKLLFEQYGDIQTLY 264

Query: 88  L-YLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQ-SLLAHLSATANN 145
             Y +H L +  Y     A +A   L               S    Q  L  H S    N
Sbjct: 265 TAYKHHGLVIISYYDIRSAERAMKALQ--------------SKPFRQWKLEIHYSIPKEN 310

Query: 146 --NNNNNGGTGGWARGSSALSNKDTWSSGGGGGNTSQLWGT 184
              N+NN GT        +++N D     GG G    +  T
Sbjct: 311 LLENDNNQGTLAVINLDQSVTNDDLRHIFGGYGEIKAIHET 351


>gi|443716715|gb|ELU08106.1| hypothetical protein CAPTEDRAFT_185432 [Capitella teleta]
          Length = 1399

 Score = 38.9 bits (89), Expect = 1.5,   Method: Composition-based stats.
 Identities = 25/98 (25%), Positives = 46/98 (46%), Gaps = 6/98 (6%)

Query: 58  WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
           +V ++ L P +  S L+ L   HG +++  +      A  K+   ++A+KA+   NN  +
Sbjct: 22  YVFIEGLAPGVSISRLRILFSDHGVVEDVQVREEDHCAWIKFKQAKDALKAKKLTNNTAI 81

Query: 118 GNTTI----FAEAPSDAEVQSLLAHLSATANNNNNNNG 151
           GNT +     +E PS   V  +L+    +     +N G
Sbjct: 82  GNTRVKVVTLSEEPS--RVIRILSAFCGSLTGKGHNGG 117


>gi|159473631|ref|XP_001694937.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276316|gb|EDP02089.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1623

 Score = 38.1 bits (87), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 23/89 (25%), Positives = 40/89 (44%), Gaps = 11/89 (12%)

Query: 36  NGGGGGGNT---WGTSQPQG--------GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQ 84
           N  GG G++      +QP+G         W    + L NL P   G+ L+ L   +GPL+
Sbjct: 266 NANGGEGSSRLLLAATQPRGQLHQSVAQSWEARHLWLGNLLPTTTGAQLERLFAPYGPLE 325

Query: 85  NFHLYLNHSLALAKYSTREEAIKAQGNLN 113
           +  ++ + + A   + T + A  A+  L 
Sbjct: 326 SVRVFADRNFAFVNFMTAQHASTAKAALE 354


>gi|449470045|ref|XP_004152729.1| PREDICTED: polyadenylate-binding protein RBP47C-like [Cucumis
           sativus]
 gi|449496017|ref|XP_004160013.1| PREDICTED: polyadenylate-binding protein RBP47C-like [Cucumis
           sativus]
          Length = 429

 Score = 37.4 bits (85), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 24/108 (22%), Positives = 47/108 (43%), Gaps = 2/108 (1%)

Query: 17  PGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTS--QPQGGWSGTWVLLKNLTPQIDGSTLK 74
           P  +G      S+G+  + +  G   N   +   Q  G ++ T + +  L P +    LK
Sbjct: 256 PMRIGAATPKKSSGYQQQYSSQGYASNGSFSHGHQSDGDFTNTTIFIGGLDPNVTDEDLK 315

Query: 75  TLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTI 122
            L  QHG + +  + +       +++ R+ A +A   LN  ++G  T+
Sbjct: 316 QLFSQHGEIVSVKIPVGKGCGFIQFANRKNAEEALQKLNGTVIGKQTV 363


>gi|66808185|ref|XP_637815.1| SAP DNA-binding domain-containing protein [Dictyostelium discoideum
           AX4]
 gi|60466244|gb|EAL64306.1| SAP DNA-binding domain-containing protein [Dictyostelium discoideum
           AX4]
          Length = 421

 Score = 36.6 bits (83), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 22/76 (28%), Positives = 39/76 (51%), Gaps = 3/76 (3%)

Query: 59  VLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCI-- 116
           +L+  L        ++TL  ++G ++N+ +    S     YST EEAIKA+ +LN  +  
Sbjct: 228 ILISKLVRPFRVDMIETLMNEYGSVKNYWMNSVKSFCFVTYSTSEEAIKARNSLNGLVWP 287

Query: 117 -LGNTTIFAEAPSDAE 131
            L  + +  E  S++E
Sbjct: 288 PLNRSKLIVEFSSESE 303


>gi|239817964|ref|YP_002946874.1| Crp/Fnr family transcriptional regulator [Variovorax paradoxus
           S110]
 gi|239804541|gb|ACS21608.1| transcriptional regulator, Crp/Fnr family [Variovorax paradoxus
           S110]
          Length = 261

 Score = 36.2 bits (82), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 19/71 (26%), Positives = 33/71 (46%)

Query: 29  NGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHL 88
            G +   N    GG+   T  P GGW G   ++K    + +   L+   V   P+++FH 
Sbjct: 75  EGLLKMSNDNADGGSVTYTGVPPGGWFGEGTVMKREPYRYNIQALRRSVVAGLPIESFHW 134

Query: 89  YLNHSLALAKY 99
            L+HS+   ++
Sbjct: 135 LLDHSIGFNRF 145


>gi|123485368|ref|XP_001324476.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121907359|gb|EAY12253.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 576

 Score = 36.2 bits (82), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 40/72 (55%)

Query: 54  WSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLN 113
           +S T +++KNL  +     L+ +    G L  F L   HS+A+ +++  ++A KA  +LN
Sbjct: 375 YSKTVLIIKNLRWETTEEELRGIFASKGTLVRFVLAPTHSVAIVEFARGDDARKAFNSLN 434

Query: 114 NCILGNTTIFAE 125
             +L +T I+ +
Sbjct: 435 YRLLHDTPIYIQ 446


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.309    0.131    0.422 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,869,147,268
Number of Sequences: 23463169
Number of extensions: 262593790
Number of successful extensions: 1603711
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 657
Number of HSP's successfully gapped in prelim test: 2058
Number of HSP's that attempted gapping in prelim test: 1557584
Number of HSP's gapped (non-prelim): 40698
length of query: 225
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 88
effective length of database: 9,144,741,214
effective search space: 804737226832
effective search space used: 804737226832
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 74 (33.1 bits)