BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy10308
(225 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|340727004|ref|XP_003401841.1| PREDICTED: hypothetical protein LOC100648841 [Bombus terrestris]
Length = 1992
Score = 211 bits (538), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 130/229 (56%), Positives = 152/229 (66%), Gaps = 16/229 (6%)
Query: 3 SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWG-TSQPQGGWSGTWV 59
+++LWG P K RGPPPG+ SNGW G ++WG S W TW+
Sbjct: 1774 TSELWGAPMSKVRGPPPGLSSKTTGNTSNGWAGF--GTVSRSSSWGFQSSTNAAWVSTWL 1831
Query: 60 LLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGN 119
LLKNLTPQIDGSTLKTLC+QHGP+Q+F LYLNH +AL KYS+R+EAIKAQG LNNC+LGN
Sbjct: 1832 LLKNLTPQIDGSTLKTLCMQHGPVQDFRLYLNHGIALTKYSSRDEAIKAQGALNNCVLGN 1891
Query: 120 TTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL-SNKDTWSSGGGGGNT 178
TTIFAE+P+D EV SLL LS G G R S+ DTW GG++
Sbjct: 1892 TTIFAESPADTEVHSLLQQLSHGGQQQAGATTGAGWGLRPSNKTGPPPDTW-----GGSS 1946
Query: 179 SQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
SQLWG P PSS SLW + +DS D RATPSSLNS+LPGDLLGGESM
Sbjct: 1947 SQLWGAP--PSS-NSLWSSAGIDSNDQQRATPSSLNSYLPGDLLGGESM 1992
>gi|350414279|ref|XP_003490265.1| PREDICTED: hypothetical protein LOC100744615 [Bombus impatiens]
Length = 1991
Score = 211 bits (537), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 130/229 (56%), Positives = 152/229 (66%), Gaps = 16/229 (6%)
Query: 3 SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWG-TSQPQGGWSGTWV 59
+++LWG P K RGPPPG+ SNGW G ++WG S W TW+
Sbjct: 1773 TSELWGAPMSKVRGPPPGLSSKTTGNTSNGWAGF--GTVSRSSSWGFQSSTNAAWVSTWL 1830
Query: 60 LLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGN 119
LLKNLTPQIDGSTLKTLC+QHGP+Q+F LYLNH +AL KYS+R+EAIKAQG LNNC+LGN
Sbjct: 1831 LLKNLTPQIDGSTLKTLCMQHGPVQDFRLYLNHGIALTKYSSRDEAIKAQGALNNCVLGN 1890
Query: 120 TTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL-SNKDTWSSGGGGGNT 178
TTIFAE+P+D EV SLL LS G G R S+ DTW GG++
Sbjct: 1891 TTIFAESPADTEVHSLLQQLSHGGQQQAGATTGAGWGLRPSNKTGPPPDTW-----GGSS 1945
Query: 179 SQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
SQLWG P PSS SLW + +DS D RATPSSLNS+LPGDLLGGESM
Sbjct: 1946 SQLWGAP--PSS-NSLWSSAGIDSNDQQRATPSSLNSYLPGDLLGGESM 1991
>gi|383860126|ref|XP_003705542.1| PREDICTED: protein Gawky-like [Megachile rotundata]
Length = 1832
Score = 205 bits (521), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 122/224 (54%), Positives = 145/224 (64%), Gaps = 13/224 (5%)
Query: 3 SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVL 60
+++LWG P K RGPPPG+ SNGW G + S GW TW+L
Sbjct: 1586 TSELWGAPMSKVRGPPPGLSSKATGNTSNGWAGLGTVGRSSSSWGLQSSTNAGWVSTWLL 1645
Query: 61 LKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNT 120
LKNLTPQIDGSTLKTLC+QHGP+Q+F LYLNH +AL KYS+R+EAIKAQG LNNC+LGNT
Sbjct: 1646 LKNLTPQIDGSTLKTLCMQHGPVQDFRLYLNHGIALTKYSSRDEAIKAQGALNNCVLGNT 1705
Query: 121 TIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL-SNKDTWSSGGGGGNTS 179
TIFAE+P+D+EV +LL LS G G R S+ DTW GG++S
Sbjct: 1706 TIFAESPADSEVHALLQQLSHGGQQQTGATTGAGWSLRPSNKTGPPPDTW-----GGSSS 1760
Query: 180 QLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLG 221
QLWG P S SLW + +DS D RATPSSLNS+LPGDLLG
Sbjct: 1761 QLWGA---PQSSNSLWSSTGIDSNDQQRATPSSLNSYLPGDLLG 1801
>gi|328783711|ref|XP_395115.4| PREDICTED: hypothetical protein LOC411646 [Apis mellifera]
Length = 1801
Score = 203 bits (516), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 126/225 (56%), Positives = 147/225 (65%), Gaps = 16/225 (7%)
Query: 3 SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWG-TSQPQGGWSGTWV 59
+++LWG P K RGPPPG+ SNGW G ++WG S W TW+
Sbjct: 1575 TSELWGAPMSKVRGPPPGLSSKATGNASNGWAGF--GTVSRSSSWGFQSSTNAAWVSTWL 1632
Query: 60 LLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGN 119
LLKNLTPQIDGSTLKTLC+QHGP+Q+F LYLNH +AL KYS+R+EAIKAQG LNNC+LGN
Sbjct: 1633 LLKNLTPQIDGSTLKTLCMQHGPVQDFRLYLNHGIALTKYSSRDEAIKAQGALNNCVLGN 1692
Query: 120 TTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL-SNKDTWSSGGGGGNT 178
TTIFAE+P+D EV SLL LS G G R S+ DTW GG++
Sbjct: 1693 TTIFAESPADTEVHSLLQQLSHGGQQQAGATTGAGWGLRPSNKTGPPPDTW-----GGSS 1747
Query: 179 SQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLG 221
SQLWG P PSS SLW +DS D RATPSSLNS+LPGDLLG
Sbjct: 1748 SQLWGAP--PSS-NSLWSNAGIDSNDQQRATPSSLNSYLPGDLLG 1789
>gi|380028808|ref|XP_003698078.1| PREDICTED: uncharacterized protein LOC100863913 [Apis florea]
Length = 1807
Score = 203 bits (516), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 127/229 (55%), Positives = 149/229 (65%), Gaps = 16/229 (6%)
Query: 3 SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWG-TSQPQGGWSGTWV 59
+++LWG P K RGPPPG+ SNGW G ++WG S W TW+
Sbjct: 1560 TSELWGAPMSKVRGPPPGLSSKATGNASNGWAGF--GTVSRSSSWGFQSSTNAAWVSTWL 1617
Query: 60 LLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGN 119
LLKNLTPQIDGSTLKTLC+QHGP+Q+F LYLNH +AL KYS+R+EAIKAQG LNNC+LGN
Sbjct: 1618 LLKNLTPQIDGSTLKTLCMQHGPVQDFRLYLNHGIALTKYSSRDEAIKAQGALNNCVLGN 1677
Query: 120 TTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL-SNKDTWSSGGGGGNT 178
TTIFAE+P+D EV SLL LS G G R S+ DTW GG++
Sbjct: 1678 TTIFAESPADTEVHSLLQQLSHGGQQQAGATTGAGWGLRPSNKTGPPPDTW-----GGSS 1732
Query: 179 SQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
SQLWG P PSS SLW +DS D RATPSSLNS+LPGDLLG S+
Sbjct: 1733 SQLWGAP--PSS-NSLWSNAGIDSNDQQRATPSSLNSYLPGDLLGDGSV 1778
>gi|307198673|gb|EFN79509.1| Trinucleotide repeat-containing gene 6C protein [Harpegnathos
saltator]
Length = 2031
Score = 196 bits (499), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 123/230 (53%), Positives = 151/230 (65%), Gaps = 15/230 (6%)
Query: 3 SNDLWGPP--KPRGPPPGMMGGGGKPPSNGW--MVRPNGGGGGGNTWGTSQPQGGWSGTW 58
+++LWG P K RGPPPG+ G SNGW + + ++ GW TW
Sbjct: 1810 TSELWGAPMSKARGPPPGLGSKGATNTSNGWAGLGSVSRSSSSWGLQSSTVSNSGWMSTW 1869
Query: 59 VLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILG 118
+LLKNLTPQIDGSTLKTLC QHGP+Q+F LY NH +AL KYSTR+EAIKAQG LNNC+LG
Sbjct: 1870 LLLKNLTPQIDGSTLKTLCAQHGPVQDFRLYQNHGIALTKYSTRDEAIKAQGALNNCVLG 1929
Query: 119 NTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGS-SALSNKDTWSSGGGGGN 177
NTTIFAE+P+++EV ++L L +GG G R + A DTW GG+
Sbjct: 1930 NTTIFAESPAESEVHTILQQLGHGGQQQAGGSGGAGWGLRPTNKAGPPPDTW-----GGS 1984
Query: 178 TSQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
+SQLWG P P+S SLW +D+ D RATPSSLNS+LPGDLLGGESM
Sbjct: 1985 SSQLWGVP--PTS-NSLWSNAGIDNSDQQRATPSSLNSYLPGDLLGGESM 2031
>gi|332026373|gb|EGI66502.1| Trinucleotide repeat-containing gene 6A protein [Acromyrmex
echinatior]
Length = 1888
Score = 195 bits (496), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 125/236 (52%), Positives = 154/236 (65%), Gaps = 26/236 (11%)
Query: 3 SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGT--------SQPQG 52
+++LWG P K RGPPPG+ G SNGW G G G + + +
Sbjct: 1666 TSELWGAPMSKARGPPPGLSSKGATNASNGW-----GAGLGSVSRSSSSWGLQSSTVSNS 1720
Query: 53 GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNL 112
GW TW+LLKNLTPQIDGSTLKTLC+QHGP+Q+F LY NH +AL KYS+R+EAIKAQG L
Sbjct: 1721 GWMSTWLLLKNLTPQIDGSTLKTLCMQHGPVQDFRLYQNHGIALTKYSSRDEAIKAQGAL 1780
Query: 113 NNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGS-SALSNKDTWSS 171
NNC+LGNTTIFAE+P+++EV ++L L +GG G R + A DTW
Sbjct: 1781 NNCVLGNTTIFAESPAESEVAAILQQLGHGGQQQAGGSGGAGWGLRPTNKAGPPPDTW-- 1838
Query: 172 GGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
GG++SQLWG P P+S SLW +D+ D RATPSSLNS+LPGDLLGGESM
Sbjct: 1839 ---GGSSSQLWGAP--PTS-NSLWSNAGIDNSDQQRATPSSLNSYLPGDLLGGESM 1888
>gi|307185285|gb|EFN71385.1| Trinucleotide repeat-containing gene 6A protein [Camponotus
floridanus]
Length = 2022
Score = 192 bits (488), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 125/236 (52%), Positives = 154/236 (65%), Gaps = 26/236 (11%)
Query: 3 SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQG-------- 52
+++LWG P K RGPPPG+ G SNGW G G G + +S
Sbjct: 1800 TSELWGAPMSKARGPPPGLGSKGATNASNGW-----GAGLGTVSRSSSSWGLQSSSVSNT 1854
Query: 53 GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNL 112
GW TW+LL+NLTPQIDGSTLKTLC+QHGP+Q+F LY NH +AL KYS+R+EAIKAQG L
Sbjct: 1855 GWMSTWLLLRNLTPQIDGSTLKTLCMQHGPVQDFRLYQNHGIALTKYSSRDEAIKAQGAL 1914
Query: 113 NNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGS-SALSNKDTWSS 171
NNC+LGNTTIFAE+P ++EV ++L L ++GG G R + A DTW
Sbjct: 1915 NNCVLGNTTIFAESPGESEVHTILQQLGHGGQQQAGSSGGAGWGLRPTNKAGPPPDTW-- 1972
Query: 172 GGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
GG++SQLWG P P+S SLW +D+ D RATPSSLNS+LPGDLLGGESM
Sbjct: 1973 ---GGSSSQLWGAP--PTS-NSLWSNAGIDNSDQQRATPSSLNSYLPGDLLGGESM 2022
>gi|322794818|gb|EFZ17765.1| hypothetical protein SINV_10484 [Solenopsis invicta]
Length = 2013
Score = 189 bits (481), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 122/234 (52%), Positives = 152/234 (64%), Gaps = 26/234 (11%)
Query: 3 SNDLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGT--------SQPQG 52
+++LWG P K RGPPPG+ G SNGW G G G + + +
Sbjct: 1771 TSELWGAPMGKARGPPPGLSTKGATNASNGW-----GAGLGSVSRSSSSWGLQSSTVSNS 1825
Query: 53 GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNL 112
GW TW+LLKNLTPQIDGSTLKTLC+QHGP+Q+F LY NH +AL KYS+R+EAIKAQG L
Sbjct: 1826 GWMSTWLLLKNLTPQIDGSTLKTLCMQHGPVQDFRLYQNHGIALTKYSSRDEAIKAQGAL 1885
Query: 113 NNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGS-SALSNKDTWSS 171
NNC+LGNTTIFAE+P+++EV ++L L +GG G R + A DTW
Sbjct: 1886 NNCVLGNTTIFAESPAESEVAAILQQLGHGGQQQAGGSGGAGWGLRPTNKAGPPPDTW-- 1943
Query: 172 GGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGE 223
GG++SQLWG P P+S SLW +D+ D RATPSSLNS+LPGDLLGG+
Sbjct: 1944 ---GGSSSQLWGAP--PTS-NSLWSNAGIDNSDQQRATPSSLNSYLPGDLLGGD 1991
>gi|189240445|ref|XP_973043.2| PREDICTED: similar to gawky CG31992-PA [Tribolium castaneum]
Length = 1014
Score = 188 bits (478), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 121/229 (52%), Positives = 148/229 (64%), Gaps = 35/229 (15%)
Query: 3 SNDLWGPPKPRGPPPGMMGGGGKPPSNGWMVRPNGGGG--GGNTWGTSQPQGGWSGTWVL 60
+++LW PK RGPPPG+ GG NGW + GGG G +WG S W+L
Sbjct: 815 TSELWAAPKSRGPPPGLSAKGGAL-VNGWSSAASWGGGQRGSGSWGGS--------PWLL 865
Query: 61 LKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNT 120
L+NLT QIDGSTL+TLC+QHGPLQ+FHLYL+ ALAKYSTREEA KAQ LNNC+LGNT
Sbjct: 866 LRNLTAQIDGSTLRTLCMQHGPLQSFHLYLHQGFALAKYSTREEATKAQTALNNCVLGNT 925
Query: 121 TIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL--SNKDTWSSGGGGGNT 178
TI AE PS+ + +LL +++ ++ +G W RGS+ + DTWS+G
Sbjct: 926 TILAENPSEWDANALLQQVASQQSS-------SGAW-RGSTKQPSTGSDTWSTG------ 971
Query: 179 SQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSFLPGDLLGGESM 225
W SN S SLWG+ LD+ D RATPSSLNSFLPGDLLGGESM
Sbjct: 972 ---W---SNSQSSASLWGSTTLDTTDPARATPSSLNSFLPGDLLGGESM 1014
>gi|403182875|gb|EAT40858.2| AAEL007447-PA [Aedes aegypti]
Length = 1541
Score = 169 bits (428), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 111/230 (48%), Positives = 136/230 (59%), Gaps = 29/230 (12%)
Query: 5 DLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLK 62
DLW P K R PPG+ GGK SNGW G G G G + WS TW+LLK
Sbjct: 1328 DLWDNPLGKSRVGPPGLKTAGGKLDSNGWSSHSAGSGAAGWNSGAAT----WSSTWILLK 1383
Query: 63 NLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTI 122
NL+ QIDG TL+TLC+QHGPL FHLYLNH +AL KYSTREEA KAQ LNNC+LG+TTI
Sbjct: 1384 NLSAQIDGPTLRTLCIQHGPLLAFHLYLNHGIALCKYSTREEANKAQMALNNCMLGSTTI 1443
Query: 123 FAEAPSDAEVQSLLAHLSATANNNNNNNGGTGG--WARGSSALSNK------DTWSSGGG 174
AE P++++VQ++L HL N +GG W G++A S D W S
Sbjct: 1444 CAETPTESDVQNILQHLGPPNGTNGLTGSQSGGQNWRLGAAAQSQSVRTPAADAWGSA-- 1501
Query: 175 GGNTSQLWGTPSNPSSGGSLWGAPPLD-SVDRATPSSLNSFLPGDLLGGE 223
W T +G +LWG PL+ DRATP++LNS+LP LLG +
Sbjct: 1502 -------WPT---TGAGSNLWG--PLEGPSDRATPANLNSYLPESLLGTD 1539
>gi|312384473|gb|EFR29197.1| hypothetical protein AND_02094 [Anopheles darlingi]
Length = 698
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 86/147 (58%), Positives = 101/147 (68%), Gaps = 14/147 (9%)
Query: 5 DLWG------PPKPRGPPPGMMGG------GGKPPSNGWMVRPNGGGGGGNTWGTSQPQG 52
D+W P PRGPPPG+ G GG +NGW+ RP+ G W G
Sbjct: 448 DVWAGGSSGVPKTPRGPPPGLSSGKPAGTPGGPTGTNGWIQRPSHSSAG--NWSAGGATG 505
Query: 53 GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNL 112
W TW+LLKNLT QIDGSTL+TLC+QHGPLQNFHLYLNH +AL KY TREEA KAQ L
Sbjct: 506 AWYSTWLLLKNLTAQIDGSTLRTLCMQHGPLQNFHLYLNHGIALCKYLTREEASKAQLAL 565
Query: 113 NNCILGNTTIFAEAPSDAEVQSLLAHL 139
NNC+LGNTTI AE+P+D+EVQ++L HL
Sbjct: 566 NNCVLGNTTICAESPTDSEVQAILQHL 592
>gi|270012524|gb|EFA08972.1| hypothetical protein TcasGA2_TC006679 [Tribolium castaneum]
Length = 1344
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 110/218 (50%), Positives = 135/218 (61%), Gaps = 35/218 (16%)
Query: 3 SNDLWGPPKPRGPPPGMMGGGGKPPSNGWMVRPNGGGG--GGNTWGTSQPQGGWSGTWVL 60
+++LW PK RGPPPG+ GG NGW + GGG G +WG S W+L
Sbjct: 1132 TSELWAAPKSRGPPPGLSAKGGAL-VNGWSSAASWGGGQRGSGSWGGS--------PWLL 1182
Query: 61 LKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNT 120
L+NLT QIDGSTL+TLC+QHGPLQ+FHLYL+ ALAKYSTREEA KAQ LNNC+LGNT
Sbjct: 1183 LRNLTAQIDGSTLRTLCMQHGPLQSFHLYLHQGFALAKYSTREEATKAQTALNNCVLGNT 1242
Query: 121 TIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSAL--SNKDTWSSGGGGGNT 178
TI AE PS+ + +LL +++ +G W RGS+ + DTWS+G
Sbjct: 1243 TILAENPSEWDANALLQQVASQQ-------SSSGAW-RGSTKQPSTGSDTWSTG------ 1288
Query: 179 SQLWGTPSNPSSGGSLWGAPPLDSVD--RATPSSLNSF 214
W SN S SLWG+ LD+ D RATPSSLNSF
Sbjct: 1289 ---W---SNSQSSASLWGSTTLDTTDPARATPSSLNSF 1320
>gi|157116102|ref|XP_001652769.1| hypothetical protein AaeL_AAEL007447 [Aedes aegypti]
Length = 1270
Score = 167 bits (424), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 111/230 (48%), Positives = 136/230 (59%), Gaps = 29/230 (12%)
Query: 5 DLWGPP--KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLK 62
DLW P K R PPG+ GGK SNGW G G G G + WS TW+LLK
Sbjct: 1057 DLWDNPLGKSRVGPPGLKTAGGKLDSNGWSSHSAGSGAAGWNSGAAT----WSSTWILLK 1112
Query: 63 NLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTI 122
NL+ QIDG TL+TLC+QHGPL FHLYLNH +AL KYSTREEA KAQ LNNC+LG+TTI
Sbjct: 1113 NLSAQIDGPTLRTLCIQHGPLLAFHLYLNHGIALCKYSTREEANKAQMALNNCMLGSTTI 1172
Query: 123 FAEAPSDAEVQSLLAHLSATANNNNNNNGGTGG--WARGSSALSNK------DTWSSGGG 174
AE P++++VQ++L HL N +GG W G++A S D W S
Sbjct: 1173 CAETPTESDVQNILQHLGPPNGTNGLTGSQSGGQNWRLGAAAQSQSVRTPAADAWGSA-- 1230
Query: 175 GGNTSQLWGTPSNPSSGGSLWGAPPLD-SVDRATPSSLNSFLPGDLLGGE 223
W T +G +LWG PL+ DRATP++LNS+LP LLG +
Sbjct: 1231 -------WPT---TGAGSNLWG--PLEGPSDRATPANLNSYLPESLLGTD 1268
>gi|357617904|gb|EHJ71059.1| hypothetical protein KGM_13480 [Danaus plexippus]
Length = 1088
Score = 162 bits (411), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 101/215 (46%), Positives = 117/215 (54%), Gaps = 40/215 (18%)
Query: 16 PPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGT----SQPQGGW-SGTWVLLKNLTPQIDG 70
PP GG KP + W +P G N W S+ W + TW+LL+NLT QIDG
Sbjct: 909 PPATSAGGLKP-LDVWGAKPRPAPPGLNKWPQHHVNSRAAPSWQTSTWLLLRNLTAQIDG 967
Query: 71 STLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDA 130
STLKTLCVQHGPLQNFHLYLN LALA+YSTREEA KAQ LNNC+L NTTIFAE+P+++
Sbjct: 968 STLKTLCVQHGPLQNFHLYLNQGLALARYSTREEAAKAQMALNNCVLSNTTIFAESPAES 1027
Query: 131 EVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKDTWSSGGGGGNTSQLWGTPSNPSS 190
+VQ +L HL + GW G LW
Sbjct: 1028 DVQLILQHLGSGGGGAWRGGASKDGW------------------NGAFPGLWQE------ 1063
Query: 191 GGSLWGAPPLDSVDRATPSSLNSFLPGDLLGGESM 225
RATPSSLNSFLP DLLGGES+
Sbjct: 1064 ----------QHEQRATPSSLNSFLPPDLLGGESI 1088
>gi|157116104|ref|XP_001652770.1| hypothetical protein AaeL_AAEL007449 [Aedes aegypti]
gi|108876634|gb|EAT40859.1| AAEL007449-PA [Aedes aegypti]
Length = 1501
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/240 (47%), Positives = 139/240 (57%), Gaps = 42/240 (17%)
Query: 5 DLWGPP--KP-RGPPPGMMGGGGKPPS---NGWM----VRPNGGGGGGNTWGTSQPQGGW 54
DLWG P KP RGPPPG+ G K S NGW V+ +G GG W + GW
Sbjct: 1281 DLWGAPVGKPTRGPPPGL--GANKNVSSAPNGWPGSSGVQRSGSGG---NWPS-----GW 1330
Query: 55 SGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNN 114
+W+LLKNLTPQID +TL+TLC+QHGPLQN LY NH LAL KYS+REEA KAQ LNN
Sbjct: 1331 GSSWLLLKNLTPQIDVATLRTLCMQHGPLQNLQLYANHGLALIKYSSREEANKAQQALNN 1390
Query: 115 CILGNTTIFAEAPSDAEVQSLLAHLSATANNNN--------NNNGGTGGWA---RGSSAL 163
C LG++TI AE PSD EVQ+ L L A + N++GG A R +
Sbjct: 1391 CPLGSSTIGAECPSDTEVQAYLQQLGTQAGSITSNAMVAPPNSSGGVTSVAQSWRQAPRT 1450
Query: 164 SNKDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSFLPGDLLGGE 223
DTW SG W P + GS++ AP + +R+TPS+LNSFLP LLG E
Sbjct: 1451 GGSDTWGSG---------W--PPTSTGTGSMFWAPIEGATERSTPSNLNSFLPESLLGSE 1499
>gi|170049180|ref|XP_001854407.1| gawky [Culex quinquefasciatus]
gi|167871061|gb|EDS34444.1| gawky [Culex quinquefasciatus]
Length = 1406
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/185 (56%), Positives = 124/185 (67%), Gaps = 18/185 (9%)
Query: 54 WSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLN 113
W TWVLLKNLT QIDGSTL+TLC+QHGP+QNFHLYLNH +AL KY +REEA KAQ LN
Sbjct: 1223 WYSTWVLLKNLTAQIDGSTLRTLCMQHGPVQNFHLYLNHGIALCKYLSREEANKAQQALN 1282
Query: 114 NCILGNTTIFAEAPSDAEVQSLLAHLSAT--ANNNNNNNGGTGGWARGSSAL------SN 165
NC+LGNTTI AE+P +EVQ++L HL ANNNN NN G GSS L +N
Sbjct: 1283 NCVLGNTTICAESPLASEVQTILQHLGIPGGANNNNINNNNNGNINVGSSGLGNNNNNNN 1342
Query: 166 KDTWSSGGGGG----NTSQLWGT--PSNPSSGGSLWGAPPLD-SVDRATPSSLNSFLPGD 218
W S G + + WG+ PS+ +G +LW PLD +R TPS+LNSFLP +
Sbjct: 1343 AQPWRSSGSQQANIRSAADTWGSGWPSS-GAGANLWT--PLDGPTERGTPSNLNSFLPEN 1399
Query: 219 LLGGE 223
LLGGE
Sbjct: 1400 LLGGE 1404
>gi|321479469|gb|EFX90425.1| hypothetical protein DAPPUDRAFT_300013 [Daphnia pulex]
Length = 1645
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 113/271 (41%), Positives = 137/271 (50%), Gaps = 56/271 (20%)
Query: 3 SNDLWGPP-KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQP---------QG 52
S D W P KPRGPPPG+ G V P G + G+ P G
Sbjct: 1383 SADPWSAPNKPRGPPPGITPSAG--------VGPKTAGRDWSAAGSRSPWPGTNTTGSGG 1434
Query: 53 GWSGT------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAI 106
W+G+ W++L+NLTPQIDGSTLKTLCVQHGPL NFHLYLNH +AL +YST EEA
Sbjct: 1435 TWAGSLNGSSSWLVLRNLTPQIDGSTLKTLCVQHGPLHNFHLYLNHGVALIRYSTGEEAA 1494
Query: 107 KAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNG-----------GTGG 155
KAQ LNNC+LGNTTI+A+ ++++VQ L L A + G
Sbjct: 1495 KAQSALNNCVLGNTTIYADLANESDVQGWLQQLGMPAQQQQQQQQQQNQQQQQQAVSSSG 1554
Query: 156 W-ARGSS---------ALSNKDTWSSGGGGGNTSQLWGTPSNPSSGG-------SLWGAP 198
W RGS+ ++ G N WG+ + S S+W P
Sbjct: 1555 WGVRGSTPGAGSNSGSGNGGGVGSNASKGSANAGDNWGSGAGGPSSSPWSTGPNSVWSTP 1614
Query: 199 PLDSVDRATPS----SLNSFLPGDLLGGESM 225
LD R TPS SLNSFLPGDLLG ESM
Sbjct: 1615 NLDRDLRTTPSSLNASLNSFLPGDLLGNESM 1645
>gi|157116106|ref|XP_001652771.1| hypothetical protein AaeL_AAEL007436 [Aedes aegypti]
gi|108876635|gb|EAT40860.1| AAEL007436-PA, partial [Aedes aegypti]
Length = 1086
Score = 149 bits (377), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 83/132 (62%), Positives = 96/132 (72%), Gaps = 6/132 (4%)
Query: 12 PRGPPPGMMGGGGKPP----SNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKNLTPQ 67
PRGPPPG+ GK P SNGW RP GG T G GW TW+LLKNLT Q
Sbjct: 928 PRGPPPGL--SAGKNPGGFGSNGWNQRPGPGGNNWPTGGGGGGGPGWYSTWILLKNLTTQ 985
Query: 68 IDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAP 127
IDG TL+TLC+QHGPLQNFHLYLNH +AL KY +REEA KAQ LNNC+LGNTTI AE+P
Sbjct: 986 IDGPTLRTLCMQHGPLQNFHLYLNHGIALCKYQSREEANKAQQALNNCVLGNTTICAESP 1045
Query: 128 SDAEVQSLLAHL 139
+++EVQ++L HL
Sbjct: 1046 TESEVQTILQHL 1057
>gi|158297477|ref|XP_317704.4| AGAP007802-PA [Anopheles gambiae str. PEST]
gi|157015214|gb|EAA12443.4| AGAP007802-PA [Anopheles gambiae str. PEST]
Length = 1218
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/136 (59%), Positives = 95/136 (69%), Gaps = 8/136 (5%)
Query: 12 PRGPPPGMMGGGGKPPSNG--------WMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
PRGPPPG+ G + WM R + G GG + G G W TW+LLKN
Sbjct: 1046 PRGPPPGLSSGKVSGSGSVGGGPGSNGWMPRTSHGQGGNWSAGGGGASGSWYSTWLLLKN 1105
Query: 64 LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
LT QIDGSTL+TLC+QHGPLQNFHLYLNH +AL KY TREEA KAQ LNNC+LGNTTI
Sbjct: 1106 LTAQIDGSTLRTLCMQHGPLQNFHLYLNHGIALCKYLTREEANKAQLALNNCVLGNTTIC 1165
Query: 124 AEAPSDAEVQSLLAHL 139
AE+P+D+EVQ++L HL
Sbjct: 1166 AESPTDSEVQTILQHL 1181
>gi|442614446|ref|NP_001014691.2| gawky, isoform J [Drosophila melanogaster]
gi|440218154|gb|AAX52511.2| gawky, isoform J [Drosophila melanogaster]
Length = 1382
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 81/167 (48%), Positives = 101/167 (60%), Gaps = 36/167 (21%)
Query: 2 SSNDLWGPP----KPRGPPPGMMGGGGKPP-------------SNGWMVRPNGGG----- 39
++++LW P RGPPPG+ K +NGW+ +P GG
Sbjct: 1044 ATSELWTSPLNKSSSRGPPPGLTANSNKSANSNASTPTTITGGANGWL-QPRSGGVQTTN 1102
Query: 40 ----GGGNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLA 95
GG TWG+S W+LLKNLT QIDG TL+TLC+QHGPL +FH YLN +A
Sbjct: 1103 TNWTGGNTTWGSS---------WLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIA 1153
Query: 96 LAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSAT 142
L KY+TREEA KAQ LNNC+L NTTIFAE+PS+ EVQS++ HL T
Sbjct: 1154 LCKYTTREEANKAQMALNNCVLANTTIFAESPSENEVQSIMQHLPQT 1200
>gi|24638679|ref|NP_726596.1| gawky, isoform A [Drosophila melanogaster]
gi|24638681|ref|NP_726597.1| gawky, isoform B [Drosophila melanogaster]
gi|24638687|ref|NP_726600.1| gawky, isoform E [Drosophila melanogaster]
gi|24638689|ref|NP_726601.1| gawky, isoform F [Drosophila melanogaster]
gi|75017682|sp|Q8SY33.1|GAWKY_DROME RecName: Full=Protein Gawky
gi|18447359|gb|AAL68245.1| LD47780p [Drosophila melanogaster]
gi|22759367|gb|AAF59323.2| gawky, isoform A [Drosophila melanogaster]
gi|22759368|gb|AAF59322.2| gawky, isoform B [Drosophila melanogaster]
gi|22759371|gb|AAN06508.1| gawky, isoform E [Drosophila melanogaster]
gi|22759372|gb|AAN06509.1| gawky, isoform F [Drosophila melanogaster]
Length = 1384
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 81/167 (48%), Positives = 101/167 (60%), Gaps = 36/167 (21%)
Query: 2 SSNDLWGPP----KPRGPPPGMMGGGGKPP-------------SNGWMVRPNGGG----- 39
++++LW P RGPPPG+ K +NGW+ +P GG
Sbjct: 1046 ATSELWTSPLNKSSSRGPPPGLTANSNKSANSNASTPTTITGGANGWL-QPRSGGVQTTN 1104
Query: 40 ----GGGNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLA 95
GG TWG+S W+LLKNLT QIDG TL+TLC+QHGPL +FH YLN +A
Sbjct: 1105 TNWTGGNTTWGSS---------WLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIA 1155
Query: 96 LAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSAT 142
L KY+TREEA KAQ LNNC+L NTTIFAE+PS+ EVQS++ HL T
Sbjct: 1156 LCKYTTREEANKAQMALNNCVLANTTIFAESPSENEVQSIMQHLPQT 1202
>gi|442614444|ref|NP_726599.2| gawky, isoform I [Drosophila melanogaster]
gi|440218153|gb|AAN06507.2| gawky, isoform I [Drosophila melanogaster]
Length = 1381
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 81/167 (48%), Positives = 101/167 (60%), Gaps = 36/167 (21%)
Query: 2 SSNDLWGPP----KPRGPPPGMMGGGGKPP-------------SNGWMVRPNGGG----- 39
++++LW P RGPPPG+ K +NGW+ +P GG
Sbjct: 1043 ATSELWTSPLNKSSSRGPPPGLTANSNKSANSNASTPTTITGGANGWL-QPRSGGVQTTN 1101
Query: 40 ----GGGNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLA 95
GG TWG+S W+LLKNLT QIDG TL+TLC+QHGPL +FH YLN +A
Sbjct: 1102 TNWTGGNTTWGSS---------WLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIA 1152
Query: 96 LAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSAT 142
L KY+TREEA KAQ LNNC+L NTTIFAE+PS+ EVQS++ HL T
Sbjct: 1153 LCKYTTREEANKAQMALNNCVLANTTIFAESPSENEVQSIMQHLPQT 1199
>gi|195354411|ref|XP_002043691.1| GM26772 [Drosophila sechellia]
gi|194128879|gb|EDW50922.1| GM26772 [Drosophila sechellia]
Length = 1385
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 80/155 (51%), Positives = 99/155 (63%), Gaps = 18/155 (11%)
Query: 2 SSNDLWGPP----KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTW-----GTSQPQ- 51
++++LW P RGPPPG+ K +N P GG N W G+ QP
Sbjct: 1047 ATSELWTSPLNKSSSRGPPPGLTANSNKS-ANCNTSTPTTITGGANGWLQPRSGSVQPTN 1105
Query: 52 ----GG---WSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREE 104
GG W +W+LLKNLT QIDG TL+TLC+QHGPL +FH YLN +AL KY+TREE
Sbjct: 1106 TNWTGGNTTWGSSWLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIALCKYTTREE 1165
Query: 105 AIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHL 139
A KAQ LNNC+L NTTIFAE+PS+ EVQS++ HL
Sbjct: 1166 ANKAQMALNNCVLANTTIFAESPSETEVQSIMQHL 1200
>gi|195564306|ref|XP_002105762.1| GD24374 [Drosophila simulans]
gi|194201637|gb|EDX15213.1| GD24374 [Drosophila simulans]
Length = 1164
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 81/158 (51%), Positives = 100/158 (63%), Gaps = 18/158 (11%)
Query: 2 SSNDLWGPP----KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTW-----GTSQPQ- 51
++++LW P RGPPPG+ K +N P GG N W G+ QP
Sbjct: 986 ATSELWTSPLNKSSSRGPPPGLTANSNKS-ANSNTSTPTTITGGANGWLQPRSGSVQPTN 1044
Query: 52 ----GG---WSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREE 104
GG W +W+LLKNLT QIDG TL+TLC+QHGPL +FH YLN +AL KY+TREE
Sbjct: 1045 TNWTGGNTTWGSSWLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIALCKYTTREE 1104
Query: 105 AIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSAT 142
A KAQ LNNC+L NTTIFAE+PS+ EVQS++ HL T
Sbjct: 1105 ANKAQMALNNCVLANTTIFAESPSETEVQSIMQHLPQT 1142
>gi|427794461|gb|JAA62682.1| Hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 967
Score = 142 bits (359), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 103/262 (39%), Positives = 129/262 (49%), Gaps = 63/262 (24%)
Query: 5 DLWGP-PKPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
D W P PK RGPPPG+ S+GW + P GN ++++LKN
Sbjct: 728 DPWNPAPKIRGPPPGLS-------SSGWELHPVKQTSSGN------------NSFLVLKN 768
Query: 64 LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
LTPQIDGSTLKTLC+QHGPLQ FHL+L H LALA+YST EEA KAQ L+NC+L NTT+
Sbjct: 769 LTPQIDGSTLKTLCMQHGPLQLFHLFLKHGLALAQYSTCEEASKAQSALHNCVLSNTTMV 828
Query: 124 AEAPSDAEVQSLLAHL------------------SATANNNNNNNGGTGGWARGSSALSN 165
A P++ EV L L + A N + +A +
Sbjct: 829 AYIPNEVEVAQFLQQLGNGLGQHPSSQQQQHQQQAWGAPTNAYHPPRPAQFAPSRPSKQP 888
Query: 166 KDTWSS---------GGGGGNTSQLWGTPSNPSSGGSLWGAPPL---------DSVDR-- 205
+ W++ NT+ LW S P + SLW AP + +D
Sbjct: 889 AEPWNTAAPPSVSSAAVSSSNTNHLW---SFPGAASSLWAAPQTSQAGGSSGSNQIDHDH 945
Query: 206 --ATPSSLNSFLPGDLLGGESM 225
SSLNSFLPGDLL GESM
Sbjct: 946 GGGPQSSLNSFLPGDLLNGESM 967
>gi|195172562|ref|XP_002027066.1| GL18179 [Drosophila persimilis]
gi|194112844|gb|EDW34887.1| GL18179 [Drosophila persimilis]
Length = 1226
Score = 142 bits (359), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 82/180 (45%), Positives = 108/180 (60%), Gaps = 22/180 (12%)
Query: 2 SSNDLWGPP----KPRGPPPGMMGGGGKP----------------PSNGWMVRPNGGGGG 41
S+++LW P RGPPPG+ K +NGW+ +G
Sbjct: 897 STSELWTSPLNKASSRGPPPGLTTNANKSGNGVSGVTSTSSTIAGSANGWLQTRSGVPTT 956
Query: 42 GNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYST 101
T + WS +W+LLKNLT QIDG TL+TLC+QHGPL +FHLYLN +AL KY+T
Sbjct: 957 NTT--WTGGNTSWSSSWLLLKNLTAQIDGPTLRTLCMQHGPLVSFHLYLNQGIALCKYTT 1014
Query: 102 REEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSS 161
REEA KAQ LNNC+LGNTTIFAE PS+ EVQ++L HL ++ N+ G + G + G++
Sbjct: 1015 REEASKAQMALNNCVLGNTTIFAETPSENEVQNILQHLPQVPSSTNSAIGSSVGSSVGTA 1074
>gi|427797345|gb|JAA64124.1| Putative trinucleotide repeat-containing protein, partial
[Rhipicephalus pulchellus]
Length = 804
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 103/262 (39%), Positives = 129/262 (49%), Gaps = 63/262 (24%)
Query: 5 DLWGP-PKPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
D W P PK RGPPPG+ S+GW + P GN ++++LKN
Sbjct: 565 DPWNPAPKIRGPPPGLS-------SSGWELHPVKQTSSGNN------------SFLVLKN 605
Query: 64 LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
LTPQIDGSTLKTLC+QHGPLQ FHL+L H LALA+YST EEA KAQ L+NC+L NTT+
Sbjct: 606 LTPQIDGSTLKTLCMQHGPLQLFHLFLKHGLALAQYSTCEEASKAQSALHNCVLSNTTMV 665
Query: 124 AEAPSDAEVQSLLAHL------------------SATANNNNNNNGGTGGWARGSSALSN 165
A P++ EV L L + A N + +A +
Sbjct: 666 AYIPNEVEVAQFLQQLGNGLGQHPSSQQQQHQQQAWGAPTNAYHPPRPAQFAPSRPSKQP 725
Query: 166 KDTWSS---------GGGGGNTSQLWGTPSNPSSGGSLWGAPPL---------DSVDR-- 205
+ W++ NT+ LW S P + SLW AP + +D
Sbjct: 726 AEPWNTAAPPSVSSAAVSSSNTNHLW---SFPGAASSLWAAPQTSQAGGSSGSNQIDHDH 782
Query: 206 --ATPSSLNSFLPGDLLGGESM 225
SSLNSFLPGDLL GESM
Sbjct: 783 GGGPQSSLNSFLPGDLLNGESM 804
>gi|198462026|ref|XP_001352316.2| GA16600 [Drosophila pseudoobscura pseudoobscura]
gi|198140166|gb|EAL29242.2| GA16600 [Drosophila pseudoobscura pseudoobscura]
Length = 1396
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 82/180 (45%), Positives = 108/180 (60%), Gaps = 22/180 (12%)
Query: 2 SSNDLWGPP----KPRGPPPGMMGGGGKP----------------PSNGWMVRPNGGGGG 41
S+++LW P RGPPPG+ K +NGW+ +G
Sbjct: 1067 STSELWTSPLNKASSRGPPPGLTTNANKSGNGVSGVTSTSSTIAGSANGWLQTRSGVPTT 1126
Query: 42 GNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYST 101
T + WS +W+LLKNLT QIDG TL+TLC+QHGPL +FHLYLN +AL KY+T
Sbjct: 1127 NTT--WTGGNTSWSSSWLLLKNLTAQIDGPTLRTLCMQHGPLVSFHLYLNQGIALCKYTT 1184
Query: 102 REEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSS 161
REEA KAQ LNNC+LGNTTIFAE PS+ EVQ++L HL ++ N+ G + G + G++
Sbjct: 1185 REEASKAQMALNNCVLGNTTIFAETPSENEVQNILQHLPQVPSSTNSAIGSSVGSSVGTA 1244
>gi|194770632|ref|XP_001967395.1| GF19039 [Drosophila ananassae]
gi|190618126|gb|EDV33650.1| GF19039 [Drosophila ananassae]
Length = 1375
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 78/161 (48%), Positives = 100/161 (62%), Gaps = 28/161 (17%)
Query: 2 SSNDLWGPP----KPRGPPPGMMGGGGKPP---------------SNGWMVRPNGGGGGG 42
S+++LW P RGPPPG+ K +NGW+ GG
Sbjct: 1029 STSELWTSPLNKSSSRGPPPGLTASSNKSGNGGSTTSTSTAISGGANGWLQT-----RGG 1083
Query: 43 NTWGTSQPQGG----WSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAK 98
+ TS G WS +W+LLKNLT QIDGSTL+TLC+QHGPL +FHLYL+ +AL K
Sbjct: 1084 SVQATSTTWSGGNAPWSSSWLLLKNLTAQIDGSTLRTLCMQHGPLVSFHLYLSQGIALCK 1143
Query: 99 YSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHL 139
Y+TREEA KAQ LNNC+L NTTIFAE+P++ EVQ+++ HL
Sbjct: 1144 YATREEANKAQMALNNCVLANTTIFAESPNENEVQNIMQHL 1184
>gi|427793715|gb|JAA62309.1| Putative alpha-1 collagen type iii, partial [Rhipicephalus
pulchellus]
Length = 1160
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 103/262 (39%), Positives = 129/262 (49%), Gaps = 63/262 (24%)
Query: 5 DLWGP-PKPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
D W P PK RGPPPG+ S+GW + P GN ++++LKN
Sbjct: 921 DPWNPAPKIRGPPPGLS-------SSGWELHPVKQTSSGNN------------SFLVLKN 961
Query: 64 LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
LTPQIDGSTLKTLC+QHGPLQ FHL+L H LALA+YST EEA KAQ L+NC+L NTT+
Sbjct: 962 LTPQIDGSTLKTLCMQHGPLQLFHLFLKHGLALAQYSTCEEASKAQSALHNCVLSNTTMV 1021
Query: 124 AEAPSDAEVQSLLAHL------------------SATANNNNNNNGGTGGWARGSSALSN 165
A P++ EV L L + A N + +A +
Sbjct: 1022 AYIPNEVEVAQFLQQLGNGLGQHPSSQQQQHQQQAWGAPTNAYHPPRPAQFAPSRPSKQP 1081
Query: 166 KDTWSS---------GGGGGNTSQLWGTPSNPSSGGSLWGAPPL---------DSVDR-- 205
+ W++ NT+ LW S P + SLW AP + +D
Sbjct: 1082 AEPWNTAAPPSVSSAAVSSSNTNHLW---SFPGAASSLWAAPQTSQAGGSSGSNQIDHDH 1138
Query: 206 --ATPSSLNSFLPGDLLGGESM 225
SSLNSFLPGDLL GESM
Sbjct: 1139 GGGPQSSLNSFLPGDLLNGESM 1160
>gi|410981860|ref|XP_003997284.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
[Felis catus]
Length = 1727
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 100/238 (42%), Positives = 131/238 (55%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1495 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1551
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1552 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1611
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWAR-----GSSALSNKDT--WS 170
GNTTI AE + EV LA A + + +G G AR GS L DT WS
Sbjct: 1612 GNTTILAEFAGEEEVNRFLAQGQAVPSTSGWQSGTGAGQARLGASGGSHGLVRSDTGHWS 1671
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
+ G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1672 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1727
>gi|427788417|gb|JAA59660.1| Putative trinucleotide repeat-containing protein [Rhipicephalus
pulchellus]
Length = 1449
Score = 140 bits (353), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 103/262 (39%), Positives = 129/262 (49%), Gaps = 63/262 (24%)
Query: 5 DLWGP-PKPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
D W P PK RGPPPG+ S+GW + P GN ++++LKN
Sbjct: 1210 DPWNPAPKIRGPPPGLS-------SSGWELHPVKQTSSGNN------------SFLVLKN 1250
Query: 64 LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
LTPQIDGSTLKTLC+QHGPLQ FHL+L H LALA+YST EEA KAQ L+NC+L NTT+
Sbjct: 1251 LTPQIDGSTLKTLCMQHGPLQLFHLFLKHGLALAQYSTCEEASKAQSALHNCVLSNTTMV 1310
Query: 124 AEAPSDAEVQSLLAHL------------------SATANNNNNNNGGTGGWARGSSALSN 165
A P++ EV L L + A N + +A +
Sbjct: 1311 AYIPNEVEVAQFLQQLGNGLGQHPSSQQQQHQQQAWGAPTNAYHPPRPAQFAPSRPSKQP 1370
Query: 166 KDTWSS---------GGGGGNTSQLWGTPSNPSSGGSLWGAPPL---------DSVDR-- 205
+ W++ NT+ LW S P + SLW AP + +D
Sbjct: 1371 AEPWNTAAPPSVSSAAVSSSNTNHLW---SFPGAASSLWAAPQTSQAGGSSGSNQIDHDH 1427
Query: 206 --ATPSSLNSFLPGDLLGGESM 225
SSLNSFLPGDLL GESM
Sbjct: 1428 GGGPQSSLNSFLPGDLLNGESM 1449
>gi|410981862|ref|XP_003997285.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
[Felis catus]
Length = 1691
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 100/238 (42%), Positives = 131/238 (55%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1459 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1515
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1516 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1575
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWAR-----GSSALSNKDT--WS 170
GNTTI AE + EV LA A + + +G G AR GS L DT WS
Sbjct: 1576 GNTTILAEFAGEEEVNRFLAQGQAVPSTSGWQSGTGAGQARLGASGGSHGLVRSDTGHWS 1635
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
+ G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1636 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1691
>gi|427788411|gb|JAA59657.1| Putative trinucleotide repeat-containing protein [Rhipicephalus
pulchellus]
Length = 1471
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 103/262 (39%), Positives = 129/262 (49%), Gaps = 63/262 (24%)
Query: 5 DLWGP-PKPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
D W P PK RGPPPG+ S+GW + P GN ++++LKN
Sbjct: 1232 DPWNPAPKIRGPPPGLS-------SSGWELHPVKQTSSGNN------------SFLVLKN 1272
Query: 64 LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
LTPQIDGSTLKTLC+QHGPLQ FHL+L H LALA+YST EEA KAQ L+NC+L NTT+
Sbjct: 1273 LTPQIDGSTLKTLCMQHGPLQLFHLFLKHGLALAQYSTCEEASKAQSALHNCVLSNTTMV 1332
Query: 124 AEAPSDAEVQSLLAHL------------------SATANNNNNNNGGTGGWARGSSALSN 165
A P++ EV L L + A N + +A +
Sbjct: 1333 AYIPNEVEVAQFLQQLGNGLGQHPSSQQQQHQQQAWGAPTNAYHPPRPAQFAPSRPSKQP 1392
Query: 166 KDTWSS---------GGGGGNTSQLWGTPSNPSSGGSLWGAPPL---------DSVDR-- 205
+ W++ NT+ LW S P + SLW AP + +D
Sbjct: 1393 AEPWNTAAPPSVSSAAVSSSNTNHLW---SFPGAASSLWAAPQTSQAGGSSGSNQIDHDH 1449
Query: 206 --ATPSSLNSFLPGDLLGGESM 225
SSLNSFLPGDLL GESM
Sbjct: 1450 GGGPQSSLNSFLPGDLLNGESM 1471
>gi|449283096|gb|EMC89799.1| Trinucleotide repeat-containing gene 6C protein [Columba livia]
Length = 1719
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 101/238 (42%), Positives = 134/238 (56%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1487 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGASPLGWTSSYSSGSAWSTDSSGRTSS 1543
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1544 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1603
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTGGWARGSS----ALSNKDT--WS 170
GNTTI AE + EV LA A ++ +N G+G GSS AL DT W+
Sbjct: 1604 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSNSGSGQTRLGSSSSSHALVRSDTGHWN 1663
Query: 171 --SGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRA-TPSSLNSFLPGDLLGGESM 225
GG G++ LWG +P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1664 PPCLGGKGSSDLLWG--GDPQCSSSLWGPPSTDDGGVIGSPTPLNTLLPGDLLSGESI 1719
>gi|74210597|dbj|BAE23657.1| unnamed protein product [Mus musculus]
Length = 727
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 100/253 (39%), Positives = 138/253 (54%), Gaps = 35/253 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 480 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 537
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 538 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 597
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 598 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 657
Query: 161 SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
S+ ++ + W+ G G + + LWGT P SLWG P D ++PS +N+F
Sbjct: 658 SSRTDVNHWNGAGLSGANCGDLHGTSLWGT---PHYSTSLWGPPSSDPRGISSPSPINAF 714
Query: 215 LPGDLL--GGESM 225
L D L GGESM
Sbjct: 715 LSVDHLGGGGESM 727
>gi|26336695|dbj|BAC32030.1| unnamed protein product [Mus musculus]
Length = 627
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 100/253 (39%), Positives = 138/253 (54%), Gaps = 35/253 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 380 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 437
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 438 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 497
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 498 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 557
Query: 161 SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
S+ ++ + W+ G G + + LWGT P SLWG P D ++PS +N+F
Sbjct: 558 SSRTDVNHWNGAGLSGANCGDLHGTSLWGT---PHYSTSLWGPPSSDPRGISSPSPINAF 614
Query: 215 LPGDLL--GGESM 225
L D L GGESM
Sbjct: 615 LSVDHLGGGGESM 627
>gi|26341344|dbj|BAC34334.1| unnamed protein product [Mus musculus]
Length = 337
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 100/253 (39%), Positives = 139/253 (54%), Gaps = 35/253 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 90 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 147
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 148 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 207
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 208 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 267
Query: 161 SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
S+ ++ + W+ G G + + LWGTP + SLWG P D ++PS +N+F
Sbjct: 268 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 324
Query: 215 LPGDLL--GGESM 225
L D L GGESM
Sbjct: 325 LSVDHLGGGGESM 337
>gi|195450735|ref|XP_002072610.1| GK13697 [Drosophila willistoni]
gi|194168695|gb|EDW83596.1| GK13697 [Drosophila willistoni]
Length = 1437
Score = 139 bits (351), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 62/89 (69%), Positives = 76/89 (85%)
Query: 54 WSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLN 113
WS +W+LLKNLT QIDG TL+TLC+QHGPL +FHLYLN +AL KY+TREE+ KAQ LN
Sbjct: 1166 WSSSWLLLKNLTAQIDGPTLRTLCMQHGPLVSFHLYLNQGIALCKYATREESNKAQMTLN 1225
Query: 114 NCILGNTTIFAEAPSDAEVQSLLAHLSAT 142
NC+LGNTTIFAE+P++AEVQ++L HL T
Sbjct: 1226 NCVLGNTTIFAESPNEAEVQNILQHLPQT 1254
>gi|344241798|gb|EGV97901.1| Trinucleotide repeat-containing gene 6C protein [Cricetulus
griseus]
Length = 802
Score = 139 bits (351), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 97/243 (39%), Positives = 129/243 (53%), Gaps = 30/243 (12%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 570 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDASGRTSS 626
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 627 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 686
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN------------NNNGGTGGWARGSSALSN 165
GNTTI AE + EV LA A ++ ++G T G R +A
Sbjct: 687 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQPSSGGSQPRLGSSGSTHGLVRSDTA--- 743
Query: 166 KDTWSSG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRA-TPSSLNSFLPGDLLGG 222
WS+ G G++ LWG P SLWG P D +P+ LN+ LPGDLL G
Sbjct: 744 --HWSTPCLSGKGSSELLWG--GVPQYSSSLWGPPSADDARVIGSPTPLNTLLPGDLLSG 799
Query: 223 ESM 225
ESM
Sbjct: 800 ESM 802
>gi|74218630|dbj|BAE25197.1| unnamed protein product [Mus musculus]
gi|74218632|dbj|BAE25198.1| unnamed protein product [Mus musculus]
Length = 500
Score = 139 bits (351), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 100/253 (39%), Positives = 138/253 (54%), Gaps = 35/253 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 253 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 310
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 311 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 370
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 371 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 430
Query: 161 SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
S+ ++ + W+ G G + + LWGT P SLWG P D ++PS +N+F
Sbjct: 431 SSRTDVNHWNGAGLSGANCGDLHGTSLWGT---PHYSTSLWGPPSSDPRGISSPSPINAF 487
Query: 215 LPGDLL--GGESM 225
L D L GGESM
Sbjct: 488 LSVDHLGGGGESM 500
>gi|391334963|ref|XP_003741867.1| PREDICTED: uncharacterized protein LOC100898741 [Metaseiulus
occidentalis]
Length = 1067
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 98/214 (45%), Positives = 117/214 (54%), Gaps = 32/214 (14%)
Query: 39 GGGGNTWGTS-QPQGGWSGT----------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFH 87
G G WG S QP SG +++LKNLT QIDGSTLKTLC+QHGP+Q FH
Sbjct: 859 GSNGKKWGESDQPGSILSGLPGPVSPTGKGFLVLKNLTAQIDGSTLKTLCIQHGPVQLFH 918
Query: 88 LYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNN 147
L+LNH AL +Y TREEA+KA+ LNNC+L NTTI A PS+ EVQ LL + + N
Sbjct: 919 LFLNHGFALIQYMTREEALKAESALNNCVLSNTTILAYVPSEREVQQLLYLANYQSLNQG 978
Query: 148 NNN---------GGTGGWARGSSALSNKDTWSSG------GGGGNTSQLWGTPSNPSSG- 191
N +G+ L + SG G Q G PS +SG
Sbjct: 979 RPNPQQQQQQQQQQANHSQQGNPQLQGVNPQQSGPRLPPSGVLQQPQQNCGWPSAANSGA 1038
Query: 192 GSLWGAPPLDSVDRATPSSLNSFLPGDLLGGESM 225
G+LWG P D+ D A LNSFLPGDLL GESM
Sbjct: 1039 GALWGPP--DANDTA---PLNSFLPGDLLSGESM 1067
>gi|444727787|gb|ELW68265.1| Trinucleotide repeat-containing 6C protein [Tupaia chinensis]
Length = 922
Score = 139 bits (350), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 97/238 (40%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 690 SHELWKVPRNTTAPTRPPPGLTNPK---PSSTWGASPLGWTSSYSSGSAWSADTSGRTSS 746
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 747 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 806
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTGGWARGSSALSNKDTWSSG---- 172
GNTTI AE + EV LA A ++ ++GGT G+S S+ S
Sbjct: 807 GNTTILAEFAGEEEVNRFLAQGQALPTTSSWQSSGGTSQPRLGASGSSHGLVRSDAGHWN 866
Query: 173 ----GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
G GN+ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 867 APCLGAKGNSELLWG--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGESI 922
>gi|195402239|ref|XP_002059714.1| GJ14351 [Drosophila virilis]
gi|194155928|gb|EDW71112.1| GJ14351 [Drosophila virilis]
Length = 1377
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/154 (51%), Positives = 99/154 (64%), Gaps = 19/154 (12%)
Query: 2 SSNDLWGPP----KPRGPPPGMM-----GGGGKPPS-------NGWMVRPNGGGGGGNTW 45
++++LW P RGPPPG+ G PS NGW+ PN NT
Sbjct: 1052 ATSELWTSPLNKSSSRGPPPGLSTNKSGGVTATTPSPTVAGNSNGWL--PNRSVPNTNTT 1109
Query: 46 GTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEA 105
T W+ +W+LLKNL QIDGSTL+TLC+QHGPL +FH YLN +AL KY+TREEA
Sbjct: 1110 WTGA-NAAWNSSWLLLKNLNAQIDGSTLRTLCMQHGPLVSFHPYLNQGIALCKYTTREEA 1168
Query: 106 IKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHL 139
KAQ LNNC+LGNTTIFAE+PS+ EVQ++L HL
Sbjct: 1169 NKAQMALNNCVLGNTTIFAESPSENEVQNILQHL 1202
>gi|355725495|gb|AES08575.1| trinucleotide repeat containing 6A [Mustela putorius furo]
Length = 486
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 100/252 (39%), Positives = 136/252 (53%), Gaps = 40/252 (15%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRPNGGGGG----------GNTW 45
+++LW PPK P PPPG+ G KPP + W P GGG G +W
Sbjct: 239 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGCSW 296
Query: 46 GTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEA 105
G S G W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE
Sbjct: 297 GESS--SGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEV 354
Query: 106 IKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWAR 158
+KAQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 355 VKAQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSH 414
Query: 159 GSSALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSS 210
S+ ++ + W+ G G + LWGT P SLWG PP S R ++PS
Sbjct: 415 SFSSRTDLNHWNGAGLSGTNCGDLHGTSLWGT---PHYSTSLWG-PPSSSDPRGISSPSP 470
Query: 211 LNSFLPGDLLGG 222
+N+FL D LGG
Sbjct: 471 INAFLSVDHLGG 482
>gi|449475873|ref|XP_002196372.2| PREDICTED: trinucleotide repeat-containing gene 6A protein
[Taeniopygia guttata]
Length = 1913
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 105/253 (41%), Positives = 141/253 (55%), Gaps = 35/253 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWS 55
+++LW PPK P PPPG+ G KPP + W GGG GN+ P W
Sbjct: 1666 AHELWKVPLPPKSITAPSRPPPGLTGQ--KPPLSTWDNSLRLGGGWGNSDARYTPGSSWG 1723
Query: 56 GT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKA 108
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +KA
Sbjct: 1724 ESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVKA 1783
Query: 109 QGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTG----GWARGSSALS 164
Q +L+ C+LGNTTI AE S+ E+ A + + + G+ G GS + S
Sbjct: 1784 QKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQSLGSSQSRLGSIDGSHSFS 1843
Query: 165 NKDT---WSSGGGGGNTS------QLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSF 214
N++ W+ G G +S LWG+P+ S SLWGAP D+ ++PS +N+F
Sbjct: 1844 NRNDLNHWNGAGLSGTSSGDLHGTSLWGSPNYSS---SLWGAPSSNDTRGISSPSPINAF 1900
Query: 215 LPGDLL--GGESM 225
L D L GGESM
Sbjct: 1901 LSVDHLGGGGESM 1913
>gi|263359644|gb|ACY70480.1| hypothetical protein DVIR88_6g0017 [Drosophila virilis]
Length = 1394
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/154 (51%), Positives = 99/154 (64%), Gaps = 19/154 (12%)
Query: 2 SSNDLWGPP----KPRGPPPGMM-----GGGGKPPS-------NGWMVRPNGGGGGGNTW 45
++++LW P RGPPPG+ G PS NGW+ PN NT
Sbjct: 1069 ATSELWTSPLNKSSSRGPPPGLSTNKSGGVTATTPSPTVAGNSNGWL--PNRSVPNTNTT 1126
Query: 46 GTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEA 105
T W+ +W+LLKNL QIDGSTL+TLC+QHGPL +FH YLN +AL KY+TREEA
Sbjct: 1127 WTGA-NAAWNSSWLLLKNLNAQIDGSTLRTLCMQHGPLVSFHPYLNQGIALCKYTTREEA 1185
Query: 106 IKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHL 139
KAQ LNNC+LGNTTIFAE+PS+ EVQ++L HL
Sbjct: 1186 NKAQMALNNCVLGNTTIFAESPSENEVQNILQHL 1219
>gi|444725719|gb|ELW66274.1| Trinucleotide repeat-containing 6A protein [Tupaia chinensis]
Length = 1894
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 103/257 (40%), Positives = 140/257 (54%), Gaps = 42/257 (16%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRPNGGGGG----------GNTW 45
+++LW PPK P PPPG+ G KPP + W P GGG G+ W
Sbjct: 1646 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSNW 1703
Query: 46 GTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEA 105
G S G W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE
Sbjct: 1704 GESS--SGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEV 1761
Query: 106 IKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWAR 158
+KAQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1762 VKAQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSH 1821
Query: 159 GSSALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSS 210
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS
Sbjct: 1822 SFSSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSP 1877
Query: 211 LNSFLPGDLL--GGESM 225
+N+FL D L GGESM
Sbjct: 1878 INAFLSVDHLGGGGESM 1894
>gi|148685346|gb|EDL17293.1| mCG20982, isoform CRA_d [Mus musculus]
Length = 1937
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 101/253 (39%), Positives = 139/253 (54%), Gaps = 35/253 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1690 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1747
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1748 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1807
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1808 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1867
Query: 161 SALSNKDTWSSGG-GGGNT-----SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
S+ ++ + W+ G G N + LWGTP + SLWG P D ++PS +N+F
Sbjct: 1868 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1924
Query: 215 LPGDLL--GGESM 225
L D L GGESM
Sbjct: 1925 LSVDHLGGGGESM 1937
>gi|117190552|ref|NP_659174.3| trinucleotide repeat-containing gene 6A protein [Mus musculus]
Length = 1896
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 101/253 (39%), Positives = 139/253 (54%), Gaps = 35/253 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1649 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1706
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1707 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1766
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1767 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1826
Query: 161 SALSNKDTWSSGG-GGGNT-----SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
S+ ++ + W+ G G N + LWGTP + SLWG P D ++PS +N+F
Sbjct: 1827 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1883
Query: 215 LPGDLL--GGESM 225
L D L GGESM
Sbjct: 1884 LSVDHLGGGGESM 1896
>gi|123791339|sp|Q3UHK8.1|TNR6A_MOUSE RecName: Full=Trinucleotide repeat-containing gene 6A protein
gi|74181174|dbj|BAE27849.1| unnamed protein product [Mus musculus]
Length = 1896
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 101/253 (39%), Positives = 139/253 (54%), Gaps = 35/253 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1649 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1706
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1707 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1766
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1767 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1826
Query: 161 SALSNKDTWSSGG-GGGNT-----SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
S+ ++ + W+ G G N + LWGTP + SLWG P D ++PS +N+F
Sbjct: 1827 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1883
Query: 215 LPGDLL--GGESM 225
L D L GGESM
Sbjct: 1884 LSVDHLGGGGESM 1896
>gi|344240423|gb|EGV96526.1| Trinucleotide repeat-containing gene 6A protein [Cricetulus
griseus]
Length = 687
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 138/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 439 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 496
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 497 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLQHGNALVRYSSKEEVVK 556
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + + + G+ +
Sbjct: 557 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGTSQSRLGSLDCSHSF 616
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGT P SLWG PP S R ++PS +N
Sbjct: 617 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGT---PHYSTSLWG-PPSSSDPRGISSPSPIN 672
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 673 AFLSVDHLGGGGESM 687
>gi|449278991|gb|EMC86719.1| Trinucleotide repeat-containing gene 6A protein [Columba livia]
Length = 1892
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/253 (41%), Positives = 141/253 (55%), Gaps = 35/253 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWS 55
+++LW PPK P PPPG+ G KPP + W GGG GN+ P W
Sbjct: 1645 AHELWKVPLPPKSITAPSRPPPGLTGQ--KPPLSTWDNSLRLGGGWGNSDARYTPGSSWG 1702
Query: 56 GT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKA 108
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +KA
Sbjct: 1703 ESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVKA 1762
Query: 109 QGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTG----GWARGSSALS 164
Q +L+ C+LGNTTI AE S+ E+ A + + + G+ G GS + S
Sbjct: 1763 QKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQSLGSSQNRLGSIDGSHSFS 1822
Query: 165 NKDT---WSSGGGGGNTS------QLWGTPSNPSSGGSLWGAP-PLDSVDRATPSSLNSF 214
N++ W+ G G +S LWG+P+ + SLWGAP D+ ++PS +N+F
Sbjct: 1823 NRNDLNHWNGAGLSGTSSGDLHGTSLWGSPNYST---SLWGAPSSSDTRGISSPSPINAF 1879
Query: 215 LPGDLL--GGESM 225
L D L GGESM
Sbjct: 1880 LSVDHLGGGGESM 1892
>gi|158297465|ref|XP_001689052.1| AGAP007808-PA [Anopheles gambiae str. PEST]
gi|157015208|gb|EDO63615.1| AGAP007808-PA [Anopheles gambiae str. PEST]
Length = 1216
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 73/132 (55%), Positives = 92/132 (69%), Gaps = 4/132 (3%)
Query: 25 KPPSNGWMVRPNGGGGGGNTWGT-SQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPL 83
K +NGW P+ GGG+TW + + WS TW++L+NLT QI+GSTL+TLC+QHGP+
Sbjct: 1067 KLDANGWNT-PSTQAGGGSTWNSGASAANTWSSTWIMLRNLTAQIEGSTLRTLCLQHGPV 1125
Query: 84 QNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATA 143
NFHLYLN +AL KY TREEA KAQ LNNC LGNTTI AE P+++E+Q +L H
Sbjct: 1126 VNFHLYLNQGIALCKYGTREEAQKAQLALNNCQLGNTTIIAEIPNESEIQYILPH--HVG 1183
Query: 144 NNNNNNNGGTGG 155
N+N NG T G
Sbjct: 1184 NSNGMTNGLTSG 1195
>gi|241646725|ref|XP_002409881.1| conserved hypothetical protein [Ixodes scapularis]
gi|215501453|gb|EEC10947.1| conserved hypothetical protein [Ixodes scapularis]
Length = 1089
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 108/280 (38%), Positives = 126/280 (45%), Gaps = 82/280 (29%)
Query: 1 MSSNDLWGPPKPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVL 60
M S+D WG PK RGPPPG+ + GW S T+++
Sbjct: 837 MPSSDPWGAPKTRGPPPGLSSSS----TQGWDQS--------------------SCTFLV 872
Query: 61 LKNLTPQ----------------------------------IDGSTLKTLCVQHGPLQNF 86
LKNLTPQ IDGSTLKTLC+QHGPLQ F
Sbjct: 873 LKNLTPQVGPSHVPFPSTLAAPLGYADCGIRGASWLAKQARIDGSTLKTLCMQHGPLQLF 932
Query: 87 HLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNN 146
HL+L H LALA+YS+REEA KAQ L+NCIL NTT+ A PS+AEV L T
Sbjct: 933 HLFLKHGLALAQYSSREEAAKAQSALHNCILSNTTMLAYIPSEAEVAQFLQLAQGTQQGP 992
Query: 147 NNNNGGTGG-------WARGSSALSNKDTWSSGGGGG-------NTSQLWGTPSNPSSGG 192
G GG + GS + + W+ S LW S P +GG
Sbjct: 993 PCWAPGGGGGGPSFHRFPYGSRPKAPEAPWNPASTAAPPTSSSSGASHLW---SFPGAGG 1049
Query: 193 SLWGAPPL-------DSVDRATPSSLNSFLPGDLLGGESM 225
LW AP D SSLNSFLPGDLL GESM
Sbjct: 1050 GLWAAPQAPQGPQGGDDHPGGQQSSLNSFLPGDLLSGESM 1089
>gi|194913515|ref|XP_001982714.1| GG16439 [Drosophila erecta]
gi|190647930|gb|EDV45233.1| GG16439 [Drosophila erecta]
Length = 1392
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 74/157 (47%), Positives = 96/157 (61%), Gaps = 19/157 (12%)
Query: 2 SSNDLWGPP----KPRGPPPGMMGGGGKPPS--NGWMVRPNGGGGGGNTWGTSQPQG--- 52
++++LW P RGPPPG+ K + N P GG N W ++ G
Sbjct: 1047 ATSELWTSPLNKSSSRGPPPGLTANSNKSGNGGNSCTSTPTTITGGANGWLQARSGGVPT 1106
Query: 53 ----------GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTR 102
W +W+LL+NLT QIDG TL+TLC+QHGPL +FH YLN +AL KY+TR
Sbjct: 1107 TNTTWTGGNTSWGSSWLLLRNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIALCKYTTR 1166
Query: 103 EEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHL 139
EEA KAQ LNNC+L NTTIFAE+PS+ EVQ+++ HL
Sbjct: 1167 EEANKAQMALNNCVLANTTIFAESPSENEVQNIMQHL 1203
>gi|195064375|ref|XP_001996557.1| GH23931 [Drosophila grimshawi]
gi|193892103|gb|EDV90969.1| GH23931 [Drosophila grimshawi]
Length = 1432
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 80/154 (51%), Positives = 99/154 (64%), Gaps = 19/154 (12%)
Query: 2 SSNDLWGPP----KPRGPPPGM---MGGGGKPP---------SNGWMVRPNGGGGGGNTW 45
++++LW P RGPPPG+ GG P SNGW+ PN NT
Sbjct: 1077 ATSELWTSPLSKGSSRGPPPGLSTSKTGGVTAPTPSPTVAGNSNGWL--PNRSVPSTNTA 1134
Query: 46 GTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEA 105
T W+ +W+LLKNL QIDG TL+TLC+QHGPL +FH YLN +AL KY+TREEA
Sbjct: 1135 WTGT-NVSWNSSWLLLKNLNAQIDGPTLRTLCMQHGPLVSFHPYLNQGIALCKYATREEA 1193
Query: 106 IKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHL 139
KAQ LNNC+LGNTTIFAE+PS+ EVQ++L HL
Sbjct: 1194 NKAQMALNNCVLGNTTIFAESPSENEVQNILQHL 1227
>gi|395533346|ref|XP_003768721.1| PREDICTED: trinucleotide repeat-containing gene 6C protein, partial
[Sarcophilus harrisii]
Length = 928
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 97/238 (40%), Positives = 128/238 (53%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 696 SHELWKVPRNTTAPTRPPPGLTNTK---PSSTWGTSPLGWTSSYSSGSAWSTDSSGRTSS 752
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 753 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 812
Query: 118 GNTTIFAEAPSDAEVQSLLAH---LSATA--NNNNNNNGGTGGWARGSSALSNKDT--WS 170
GNTTI AE + EV LA L T+ +N N G S L D W+
Sbjct: 813 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNTGTNQTRMGSTSSSHGLVRNDAGHWN 872
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
+ G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 873 TPCLGSKGSSDLLWG--GVPQYSSSLWGPPSTDDGRVIGSPTPLNTLLPGDLLSGESI 928
>gi|426346617|ref|XP_004040968.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
[Gorilla gorilla gorilla]
Length = 1726
Score = 136 bits (342), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 97/241 (40%), Positives = 130/241 (53%), Gaps = 25/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG----GGGGGNTWGTSQPQGGW 54
S++LW P+ P PPPG+ PS+ W P G G W T G
Sbjct: 1493 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSACWSTDT--SGR 1547
Query: 55 SGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNN 114
+ +W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+
Sbjct: 1548 TSSWLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHM 1607
Query: 115 CILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKD 167
C+LGNTTI AE + EV LA A ++ + R S+A S+
Sbjct: 1608 CVLGNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAG 1667
Query: 168 TWSSG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
W++ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES
Sbjct: 1668 HWNAPCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGES 1725
Query: 225 M 225
+
Sbjct: 1726 L 1726
>gi|26338668|dbj|BAC33005.1| unnamed protein product [Mus musculus]
Length = 340
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 128/241 (53%), Gaps = 26/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 108 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 164
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 165 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 224
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
GNTTI AE + EV LA A ++ +G T G R +A N
Sbjct: 225 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 284
Query: 166 KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
S G G++ LWG P SLWG P D+ +P+ LN+ LPGDLL GES
Sbjct: 285 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 339
Query: 225 M 225
+
Sbjct: 340 I 340
>gi|211830506|gb|AAH24324.2| TNRC6A protein [Homo sapiens]
Length = 448
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 139/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 200 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 257
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 258 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 317
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 318 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 377
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGT P SLWG PP S R ++PS +N
Sbjct: 378 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGT---PHYSTSLWG-PPSSSDPRGISSPSPIN 433
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 434 AFLSVDHLGGGGESM 448
>gi|7023252|dbj|BAA91899.1| unnamed protein product [Homo sapiens]
Length = 440
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 139/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 192 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 249
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 250 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 309
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 310 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 369
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP SLWG PP S R ++PS +N
Sbjct: 370 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHY---STSLWG-PPSSSDPRGISSPSPIN 425
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 426 AFLSVDHLGGGGESM 440
>gi|426346619|ref|XP_004040969.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
[Gorilla gorilla gorilla]
Length = 1690
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 97/241 (40%), Positives = 130/241 (53%), Gaps = 25/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG----GGGGGNTWGTSQPQGGW 54
S++LW P+ P PPPG+ PS+ W P G G W T G
Sbjct: 1457 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSACWSTDT--SGR 1511
Query: 55 SGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNN 114
+ +W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+
Sbjct: 1512 TSSWLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHM 1571
Query: 115 CILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKD 167
C+LGNTTI AE + EV LA A ++ + R S+A S+
Sbjct: 1572 CVLGNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAG 1631
Query: 168 TWSSG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
W++ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES
Sbjct: 1632 HWNAPCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGES 1689
Query: 225 M 225
+
Sbjct: 1690 L 1690
>gi|354473299|ref|XP_003498873.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Cricetulus griseus]
Length = 1888
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 100/238 (42%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1656 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDASGRTSS 1712
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1713 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1772
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWAR-GSS----ALSNKDT--WS 170
GNTTI AE + EV LA A ++ G R GSS L DT WS
Sbjct: 1773 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQPSSGGSQPRLGSSGSTHGLVRSDTAHWS 1832
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRA-TPSSLNSFLPGDLLGGESM 225
+ G G++ LWG P SLWG P D +P+ LN+ LPGDLL GESM
Sbjct: 1833 TPCLSGKGSSELLWG--GVPQYSSSLWGPPSADDARVIGSPTPLNTLLPGDLLSGESM 1888
>gi|363739602|ref|XP_414871.3| PREDICTED: trinucleotide repeat-containing gene 6A protein [Gallus
gallus]
Length = 1950
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 103/253 (40%), Positives = 140/253 (55%), Gaps = 35/253 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWS 55
+++LW PPK P PPPG+ G KPP + W GGG GN+ P W
Sbjct: 1703 AHELWKVPLPPKSITAPSRPPPGLTGQ--KPPLSTWDNSLRLGGGWGNSDARYTPGSSWG 1760
Query: 56 GT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKA 108
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +KA
Sbjct: 1761 ESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVKA 1820
Query: 109 QGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTG----GWARGSSALS 164
Q +L+ C+LGNTTI AE S+ E+ A + + + G+ G GS + S
Sbjct: 1821 QKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQSLGSSQNRLGSIDGSHSFS 1880
Query: 165 NKDT---WSSGGGGGNTS------QLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSF 214
N++ W+ G G +S LWG+P+ + SLWG P D+ ++PS +N+F
Sbjct: 1881 NRNDLNHWNGAGLSGTSSGDLHGTSLWGSPNYST---SLWGTPSSNDTRGISSPSPINAF 1937
Query: 215 LPGDLL--GGESM 225
L D L GGESM
Sbjct: 1938 LSVDHLGGGGESM 1950
>gi|281342794|gb|EFB18378.1| hypothetical protein PANDA_006877 [Ailuropoda melanoleuca]
Length = 1730
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 100/238 (42%), Positives = 132/238 (55%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1498 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1554
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1555 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1614
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTG----GWARGSSALSNKDT--WS 170
GNTTI AE + EV LA A ++ ++ GTG G A S L DT WS
Sbjct: 1615 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTGQTRLGAAGSSHGLVRSDTGHWS 1674
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
+ G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1675 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1730
>gi|301766006|ref|XP_002918420.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Ailuropoda melanoleuca]
Length = 1720
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 100/238 (42%), Positives = 132/238 (55%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1488 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1544
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1545 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1604
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTG----GWARGSSALSNKDT--WS 170
GNTTI AE + EV LA A ++ ++ GTG G A S L DT WS
Sbjct: 1605 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTGQTRLGAAGSSHGLVRSDTGHWS 1664
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
+ G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1665 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1720
>gi|345305142|ref|XP_001505551.2| PREDICTED: trinucleotide repeat-containing gene 6A protein
[Ornithorhynchus anatinus]
Length = 1906
Score = 135 bits (341), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 105/254 (41%), Positives = 141/254 (55%), Gaps = 36/254 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1658 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRLGGGWGNSDARYTPGSSW 1715
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1716 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1775
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTG----GWARGSSAL 163
AQ +L+ C+LGNTTI AE S+ E+ A + + + G+ G GS +
Sbjct: 1776 AQKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQSLGSSQNRLGSIDGSHSF 1835
Query: 164 SNKDT---WSSGGGGGNTS------QLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNS 213
SN++ W+ G G +S LWGTP+ + SLWG P D+ ++PS +N+
Sbjct: 1836 SNRNDLNHWNGAGLSGTSSGDLHGTSLWGTPNYST---SLWGTPSSNDTRGISSPSPINA 1892
Query: 214 FLPGDLL--GGESM 225
FL D L GGESM
Sbjct: 1893 FLSVDHLGGGGESM 1906
>gi|148702675|gb|EDL34622.1| mCG19297, isoform CRA_b [Mus musculus]
Length = 1630
Score = 135 bits (341), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 98/241 (40%), Positives = 132/241 (54%), Gaps = 26/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1398 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1454
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1455 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1514
Query: 118 GNTTIFAEAPSDAEVQSLLAH---LSATANNNNNN---------NGGTGGWARGSSALSN 165
GNTTI AE + EV LA L T++ +N+ +G T G R +A N
Sbjct: 1515 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSNSGGSQPRLGTSGSTHGLVRSDTAHWN 1574
Query: 166 KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
S G G++ LWG P SLWG P D+ +P+ LN+ LPGDLL GES
Sbjct: 1575 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 1629
Query: 225 M 225
+
Sbjct: 1630 I 1630
>gi|392351792|ref|XP_003751024.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Rattus norvegicus]
Length = 1919
Score = 135 bits (341), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 127/241 (52%), Gaps = 26/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1687 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1743
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1744 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1803
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
GNTTI AE + EV LA A ++ +G T G R +A N
Sbjct: 1804 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 1863
Query: 166 KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRA-TPSSLNSFLPGDLLGGES 224
S G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES
Sbjct: 1864 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSADDTRVIGSPTPLNTLLPGDLLSGES 1918
Query: 225 M 225
+
Sbjct: 1919 I 1919
>gi|392332214|ref|XP_003752510.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Rattus norvegicus]
Length = 1815
Score = 135 bits (341), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 127/241 (52%), Gaps = 26/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1583 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1639
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1640 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1699
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
GNTTI AE + EV LA A ++ +G T G R +A N
Sbjct: 1700 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 1759
Query: 166 KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRA-TPSSLNSFLPGDLLGGES 224
S G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES
Sbjct: 1760 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSADDTRVIGSPTPLNTLLPGDLLSGES 1814
Query: 225 M 225
+
Sbjct: 1815 I 1815
>gi|301612973|ref|XP_002935969.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Xenopus (Silurana) tropicalis]
Length = 1663
Score = 135 bits (341), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 97/238 (40%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMV-RPNGGGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W + + S G + +
Sbjct: 1431 SHELWKVPRNTTAPSRPPPGLTNAK---PSSAWSSNQLGWTSSYSSGSTWSTDSSGRTSS 1487
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1488 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEATKAQKSLHMCVL 1547
Query: 118 GNTTIFAEAPSDAEVQSLLAH-----LSATANNNNNNNGGTGGWARGSSALSNKDT--WS 170
GNTTI AE + EV LA +++ +N N+ G A GS L D W+
Sbjct: 1548 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNTGNSQPRLGSAGGSHTLVRSDAAHWN 1607
Query: 171 --SGGGGGNTSQLWGTPSNPSSGGSLWGAP-PLDSVDRATPSSLNSFLPGDLLGGESM 225
G GN LWG P SLWG P D+ +P+ LN+ LPGDLL GESM
Sbjct: 1608 PPCLGSKGNNDLLWG--GVPQYSSSLWGPPGSEDARIIRSPTPLNTLLPGDLLSGESM 1663
>gi|148702674|gb|EDL34621.1| mCG19297, isoform CRA_a [Mus musculus]
Length = 1580
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 98/241 (40%), Positives = 132/241 (54%), Gaps = 26/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1348 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1404
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1405 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1464
Query: 118 GNTTIFAEAPSDAEVQSLLAH---LSATANNNNNN---------NGGTGGWARGSSALSN 165
GNTTI AE + EV LA L T++ +N+ +G T G R +A N
Sbjct: 1465 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSNSGGSQPRLGTSGSTHGLVRSDTAHWN 1524
Query: 166 KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
S G G++ LWG P SLWG P D+ +P+ LN+ LPGDLL GES
Sbjct: 1525 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 1579
Query: 225 M 225
+
Sbjct: 1580 I 1580
>gi|432117597|gb|ELK37833.1| Trinucleotide repeat-containing protein 6A protein [Myotis davidii]
Length = 1886
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 103/255 (40%), Positives = 139/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1638 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1695
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1696 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1755
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ N G+ +
Sbjct: 1756 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQNRLGSLDCSHPF 1815
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1816 SSRTDLSHWNGAGLAGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGMSSPSPIN 1871
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1872 AFLSVDHLGGGGESM 1886
>gi|334323032|ref|XP_001380459.2| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Monodelphis domestica]
Length = 1887
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 96/238 (40%), Positives = 128/238 (53%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1655 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGTSPLGWTSSYSSGSAWSTDSSGRTSS 1711
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1712 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1771
Query: 118 GNTTIFAEAPSDAEVQSLLAH-----LSATANNNNNNNGGTGGWARGSSALSNKDT--WS 170
GNTTI AE + EV LA +++ +N N G S L D W+
Sbjct: 1772 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNTGTNQTRMGSTNSSHGLVRNDAGHWN 1831
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
+ G G+T LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1832 TPCLGSKGSTDLLWG--GVPQYSSSLWGPPSTDDGRVIGSPTPLNTLLPGDLLSGESI 1887
>gi|211826331|gb|AAH05741.2| Tnrc6a protein [Mus musculus]
Length = 661
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 95/246 (38%), Positives = 133/246 (54%), Gaps = 33/246 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 408 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 465
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 466 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 525
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 526 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 585
Query: 161 SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
S+ ++ + W+ G G + + LWGT P SLWG P D ++PS +N+F
Sbjct: 586 SSRTDVNHWNGAGLSGANCGDLHGTSLWGT---PHYSTSLWGPPSSDPRGISSPSPINAF 642
Query: 215 LPGDLL 220
L D L
Sbjct: 643 LSVDHL 648
>gi|431908726|gb|ELK12318.1| Trinucleotide repeat-containing protein 6C protein [Pteropus
alecto]
Length = 670
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 438 SHELWKVPRNTTAPTRPPPGLTNPK---PSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 494
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS+++EA KAQ +L+ C+L
Sbjct: 495 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKDEAAKAQKSLHMCVL 554
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWAR-----GSSALSNKDT--WS 170
GNTTI AE + EV LA A ++ + G R S L+ DT W+
Sbjct: 555 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGAGQTRLGASGSSHGLARSDTGHWN 614
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
+ G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 615 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 670
>gi|16551820|dbj|BAB71179.1| unnamed protein product [Homo sapiens]
Length = 582
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 350 SHELWKVPRNSTAPTRPPPGLTNPK---PSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 406
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 407 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 466
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 467 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 526
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 527 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 582
>gi|363740796|ref|XP_415612.3| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 3
[Gallus gallus]
Length = 1897
Score = 135 bits (339), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 97/240 (40%), Positives = 130/240 (54%), Gaps = 25/240 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1666 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGASPLGWTSSYSSGSAWSTDSSGRTSS 1722
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1723 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1782
Query: 118 GNTTIFAEAPSDAEVQSLLAH---LSATANNNNNN--------NGGTGGWARGSSALSNK 166
GNTTI AE + EV LA L T++ +N + G+ G RG + N
Sbjct: 1783 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNAGSTQPRLGSAGSHGLVRGDTGHWNS 1842
Query: 167 DTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
GG G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1843 PCL---GGKGSSELLWG--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGESI 1897
>gi|358418930|ref|XP_614640.5| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 1
[Bos taurus]
Length = 1958
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1710 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSSWDNSPLRVGGGWGNSDARYTPGSSW 1767
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1768 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1827
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1828 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPSWQSLGSSQSRLGSLDCSHSF 1887
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1888 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1943
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1944 AFLSVDHLGGGGESM 1958
>gi|440898230|gb|ELR49767.1| Trinucleotide repeat-containing 6A protein, partial [Bos grunniens
mutus]
Length = 1928
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1680 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSSWDNSPLRVGGGWGNSDARYTPGSSW 1737
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1738 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1797
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1798 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPSWQSLGSSQSRLGSLDCSHSF 1857
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1858 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1913
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1914 AFLSVDHLGGGGESM 1928
>gi|359079715|ref|XP_002698073.2| PREDICTED: trinucleotide repeat-containing gene 6A protein [Bos
taurus]
Length = 1921
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1673 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSSWDNSPLRVGGGWGNSDARYTPGSSW 1730
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1731 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1790
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1791 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPSWQSLGSSQSRLGSLDCSHSF 1850
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1851 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1906
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1907 AFLSVDHLGGGGESM 1921
>gi|363740798|ref|XP_003642381.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
[Gallus gallus]
Length = 1719
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 97/240 (40%), Positives = 130/240 (54%), Gaps = 25/240 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1488 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGASPLGWTSSYSSGSAWSTDSSGRTSS 1544
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1545 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1604
Query: 118 GNTTIFAEAPSDAEVQSLLAH---LSATANNNNNN--------NGGTGGWARGSSALSNK 166
GNTTI AE + EV LA L T++ +N + G+ G RG + N
Sbjct: 1605 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNAGSTQPRLGSAGSHGLVRGDTGHWNS 1664
Query: 167 DTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
GG G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1665 PCL---GGKGSSELLWG--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGESI 1719
>gi|363740794|ref|XP_003642380.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
[Gallus gallus]
Length = 1683
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 97/240 (40%), Positives = 130/240 (54%), Gaps = 25/240 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1452 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGASPLGWTSSYSSGSAWSTDSSGRTSS 1508
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1509 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1568
Query: 118 GNTTIFAEAPSDAEVQSLLAH---LSATANNNNNN--------NGGTGGWARGSSALSNK 166
GNTTI AE + EV LA L T++ +N + G+ G RG + N
Sbjct: 1569 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNAGSTQPRLGSAGSHGLVRGDTGHWNS 1628
Query: 167 DTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
GG G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1629 PCL---GGKGSSELLWG--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGESI 1683
>gi|297283685|ref|XP_001098013.2| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 1
[Macaca mulatta]
Length = 1971
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1723 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1780
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1781 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1840
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1841 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1900
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1901 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1956
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1957 AFLSVDHLGGGGESM 1971
>gi|124378035|ref|NP_932139.2| trinucleotide repeat-containing gene 6C protein [Mus musculus]
Length = 1900
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 128/241 (53%), Gaps = 26/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1668 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1724
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1725 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1784
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
GNTTI AE + EV LA A ++ +G T G R +A N
Sbjct: 1785 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 1844
Query: 166 KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
S G G++ LWG P SLWG P D+ +P+ LN+ LPGDLL GES
Sbjct: 1845 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 1899
Query: 225 M 225
+
Sbjct: 1900 I 1900
>gi|74184652|dbj|BAE27937.1| unnamed protein product [Mus musculus]
Length = 1900
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 128/241 (53%), Gaps = 26/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1668 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1724
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1725 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1784
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
GNTTI AE + EV LA A ++ +G T G R +A N
Sbjct: 1785 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 1844
Query: 166 KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
S G G++ LWG P SLWG P D+ +P+ LN+ LPGDLL GES
Sbjct: 1845 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 1899
Query: 225 M 225
+
Sbjct: 1900 I 1900
>gi|355710058|gb|EHH31522.1| hypothetical protein EGK_12611 [Macaca mulatta]
Length = 1940
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1692 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1749
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1750 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1809
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1810 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1869
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1870 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1925
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1926 AFLSVDHLGGGGESM 1940
>gi|345802132|ref|XP_547086.3| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 1
[Canis lupus familiaris]
Length = 1931
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1683 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1740
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1741 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1800
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1801 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1860
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1861 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1916
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1917 AFLSVDHLGGGGESM 1931
>gi|28972790|dbj|BAC65811.1| mKIAA1582 protein [Mus musculus]
Length = 1362
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 128/241 (53%), Gaps = 26/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1130 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1186
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1187 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1246
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
GNTTI AE + EV LA A ++ +G T G R +A N
Sbjct: 1247 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 1306
Query: 166 KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
S G G++ LWG P SLWG P D+ +P+ LN+ LPGDLL GES
Sbjct: 1307 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 1361
Query: 225 M 225
+
Sbjct: 1362 I 1362
>gi|355756645|gb|EHH60253.1| hypothetical protein EGM_11578 [Macaca fascicularis]
Length = 1942
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1694 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1751
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1752 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1811
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1812 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1871
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1872 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1927
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1928 AFLSVDHLGGGGESM 1942
>gi|426254457|ref|XP_004020895.1| PREDICTED: trinucleotide repeat-containing gene 6A protein [Ovis
aries]
Length = 1706
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1458 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSSWDNSPLRVGGGWGNSDARYTPGSSW 1515
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1516 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1575
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1576 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPSWQSLGSSQSRLGSLDCSHSF 1635
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1636 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1691
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1692 AFLSVDHLGGGGESM 1706
>gi|395846174|ref|XP_003795787.1| PREDICTED: trinucleotide repeat-containing gene 6A protein [Otolemur
garnettii]
Length = 1926
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1678 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1735
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1736 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1795
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1796 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1855
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1856 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1911
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1912 AFLSVDHLGGGGESM 1926
>gi|344294499|ref|XP_003418954.1| PREDICTED: trinucleotide repeat-containing gene 6A protein [Loxodonta
africana]
Length = 1931
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1683 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPIRVGGGWGNSDARYTPGSSW 1740
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1741 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1800
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1801 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1860
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1861 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1916
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1917 AFLSVDHLGGGGESM 1931
>gi|297462713|ref|XP_580298.5| PREDICTED: trinucleotide repeat-containing gene 6C protein [Bos
taurus]
gi|297487379|ref|XP_002696206.1| PREDICTED: trinucleotide repeat-containing gene 6C protein [Bos
taurus]
gi|296476006|tpg|DAA18121.1| TPA: hypothetical protein BOS_19462 [Bos taurus]
Length = 1724
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 97/239 (40%), Positives = 129/239 (53%), Gaps = 22/239 (9%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1492 SHELWKVPRNTTAPTRPPPGLTN---PKPSSAWGASPLGWTSSYSSGSAWSTDASGRTSS 1548
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1549 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGHAVVRYSSKEEAAKAQKSLHMCVL 1608
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSAT--------ANNNNNNNGGTGGWARGSSALSNKDTW 169
GNTTI AE + EV LA A + + GT G A G S+ W
Sbjct: 1609 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQPSPGTSQTRLGTSGSAHG-LVRSDAGHW 1667
Query: 170 SSGG--GGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
++ G G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1668 NAPGLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1724
>gi|149067987|gb|EDM17539.1| trinucleotide repeat containing 6 (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 1937
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1689 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1746
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1747 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1806
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1807 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1866
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1867 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1922
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1923 AFLSVDHLGGGGESM 1937
>gi|296219794|ref|XP_002756022.1| PREDICTED: trinucleotide repeat-containing gene 6A protein
[Callithrix jacchus]
Length = 1963
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1715 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1772
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1773 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1832
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1833 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1892
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1893 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1948
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1949 AFLSVDHLGGGGESM 1963
>gi|402907996|ref|XP_003916744.1| PREDICTED: trinucleotide repeat-containing gene 6A protein [Papio
anubis]
Length = 1926
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1678 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1735
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1736 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1795
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1796 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1855
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1856 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1911
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1912 AFLSVDHLGGGGESM 1926
>gi|403277188|ref|XP_003930258.1| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 1
[Saimiri boliviensis boliviensis]
Length = 1923
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1675 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1732
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1733 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1792
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1793 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1852
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1853 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1908
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1909 AFLSVDHLGGGGESM 1923
>gi|345804568|ref|XP_540459.3| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
[Canis lupus familiaris]
Length = 1723
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/238 (41%), Positives = 132/238 (55%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1491 SHELWKVPRNTTAPTRPPPGL---SNPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1547
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1548 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1607
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTG----GWARGSSALSNKD--TWS 170
GNTTI AE + EV LA A ++ ++ GTG G + GS L D WS
Sbjct: 1608 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTGQTRLGASGGSHGLVRSDPGHWS 1667
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
+ G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1668 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1723
>gi|397485193|ref|XP_003813742.1| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 1
[Pan paniscus]
Length = 1940
Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1692 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1749
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1750 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1809
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1810 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPSWQSLGSSQSRLGSLDCSHSF 1869
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1870 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1925
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1926 AFLSVDHLGGGGESM 1940
>gi|116805348|ref|NP_055309.2| trinucleotide repeat-containing gene 6A protein [Homo sapiens]
gi|296452846|sp|Q8NDV7.2|TNR6A_HUMAN RecName: Full=Trinucleotide repeat-containing gene 6A protein;
AltName: Full=CAG repeat protein 26; AltName: Full=EMSY
interactor protein; AltName: Full=GW182 autoantigen;
Short=Protein GW1; AltName: Full=Glycine-tryptophan
protein of 182 kDa
gi|225000816|gb|AAI72409.1| Trinucleotide repeat containing 6A [synthetic construct]
gi|306921199|dbj|BAJ17679.1| trinucleotide repeat containing 6A [synthetic construct]
Length = 1962
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1714 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1771
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1772 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1831
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1832 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1891
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1892 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1947
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1948 AFLSVDHLGGGGESM 1962
>gi|395747616|ref|XP_002826294.2| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
6A protein [Pongo abelii]
Length = 1932
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1684 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1741
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1742 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1801
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1802 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1861
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1862 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1917
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1918 AFLSVDHLGGGGESM 1932
>gi|390463863|ref|XP_002748839.2| PREDICTED: trinucleotide repeat-containing gene 6C protein
[Callithrix jacchus]
Length = 1976
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 130/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1744 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1800
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1801 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1860
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A + ++ + R S+A S+ W+
Sbjct: 1861 GNTTILAEFAGEEEVNRFLAQGQALPSTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1920
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1921 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1976
>gi|281344769|gb|EFB20353.1| hypothetical protein PANDA_019391 [Ailuropoda melanoleuca]
Length = 1905
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1657 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDSRYTPGSSW 1714
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1715 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1774
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1775 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1834
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1835 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1890
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1891 AFLSVDHLGGGGESM 1905
>gi|119576188|gb|EAW55784.1| trinucleotide repeat containing 6A, isoform CRA_c [Homo sapiens]
Length = 1935
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1687 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1744
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1745 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1804
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1805 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1864
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1865 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1920
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1921 AFLSVDHLGGGGESM 1935
>gi|345804566|ref|XP_003435198.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
[Canis lupus familiaris]
Length = 1687
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/238 (41%), Positives = 132/238 (55%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1455 SHELWKVPRNTTAPTRPPPGL---SNPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1511
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1512 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1571
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTG----GWARGSSALSNKD--TWS 170
GNTTI AE + EV LA A ++ ++ GTG G + GS L D WS
Sbjct: 1572 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTGQTRLGASGGSHGLVRSDPGHWS 1631
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
+ G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1632 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1687
>gi|126253814|sp|Q3UHC0.2|TNR6C_MOUSE RecName: Full=Trinucleotide repeat-containing gene 6C protein
Length = 1690
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 128/241 (53%), Gaps = 26/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1458 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 1514
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1515 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1574
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
GNTTI AE + EV LA A ++ +G T G R +A N
Sbjct: 1575 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 1634
Query: 166 KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
S G G++ LWG P SLWG P D+ +P+ LN+ LPGDLL GES
Sbjct: 1635 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 1689
Query: 225 M 225
+
Sbjct: 1690 I 1690
>gi|301787707|ref|XP_002929270.1| PREDICTED: trinucleotide repeat-containing gene 6A protein-like
[Ailuropoda melanoleuca]
Length = 1939
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1691 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDSRYTPGSSW 1748
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1749 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1808
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1809 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1868
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1869 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1924
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1925 AFLSVDHLGGGGESM 1939
>gi|21740153|emb|CAD39090.1| hypothetical protein [Homo sapiens]
Length = 1064
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 832 SHELWKVPRNSTAPTRPPPGLTNPK---PSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 888
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 889 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 948
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 949 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1008
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1009 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1064
>gi|52545832|emb|CAH56236.1| hypothetical protein [Homo sapiens]
Length = 1053
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 139/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 805 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 862
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 863 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 922
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 923 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 982
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGT P SLWG PP S R ++PS +N
Sbjct: 983 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGT---PHYSTSLWG-PPSSSDPRGISSPSPIN 1038
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1039 AFLSVDHLGGGGESM 1053
>gi|21693029|emb|CAD37348.1| EDIE protein [Homo sapiens]
Length = 1962
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1714 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1771
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1772 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1831
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1832 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1891
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1892 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1947
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1948 AFLSVDHLGGGGESM 1962
>gi|195469377|ref|XP_002099614.1| GE14506 [Drosophila yakuba]
gi|194185715|gb|EDW99326.1| GE14506 [Drosophila yakuba]
Length = 1386
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 71/142 (50%), Positives = 88/142 (61%), Gaps = 15/142 (10%)
Query: 13 RGPPPGMMGGGGKPPS--NGWMVRPNGGGGGGNTWGTSQPQG-------------GWSGT 57
RGPPPG+ K + N P GG N W + G W +
Sbjct: 1056 RGPPPGLTANSNKSGNGGNSCTSTPTTVAGGANGWLQGRSGGVQTTNTTWTGGNSSWGSS 1115
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W+LLKNLT QIDG TL+TLC+QHGPL +FH YL+ +AL KY+TREEA KAQ LNNC+L
Sbjct: 1116 WLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLSQGIALCKYTTREEANKAQMALNNCVL 1175
Query: 118 GNTTIFAEAPSDAEVQSLLAHL 139
NTTIFAE+PS+ EVQ+++ HL
Sbjct: 1176 ANTTIFAESPSENEVQNIMQHL 1197
>gi|426381585|ref|XP_004057417.1| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 1
[Gorilla gorilla gorilla]
Length = 1935
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1687 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1744
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1745 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1804
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1805 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1864
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1865 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1920
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1921 AFLSVDHLGGGGESM 1935
>gi|148685347|gb|EDL17294.1| mCG20982, isoform CRA_e [Mus musculus]
Length = 1953
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 95/247 (38%), Positives = 134/247 (54%), Gaps = 33/247 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1690 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1747
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1748 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1807
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1808 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1867
Query: 161 SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
S+ ++ + W+ G G + + LWGTP + SLWG P D ++PS +N+F
Sbjct: 1868 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1924
Query: 215 LPGDLLG 221
L D L
Sbjct: 1925 LSVDHLA 1931
>gi|440892464|gb|ELR45644.1| Trinucleotide repeat-containing 6C protein, partial [Bos grunniens
mutus]
Length = 1738
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 97/239 (40%), Positives = 129/239 (53%), Gaps = 22/239 (9%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1506 SHELWKVPRNTTAPTRPPPGLTN---PKPSSAWGASPLGWTSSYSSGSAWSTDASGRTSS 1562
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1563 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1622
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSAT--------ANNNNNNNGGTGGWARGSSALSNKDTW 169
GNTTI AE + EV LA A + + GT G A G S+ W
Sbjct: 1623 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQPSPGTSQTRLGTSGSAHG-LVRSDAGHW 1681
Query: 170 SSGG--GGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
++ G G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1682 NAPGLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1738
>gi|148685344|gb|EDL17291.1| mCG20982, isoform CRA_b [Mus musculus]
Length = 1893
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 96/247 (38%), Positives = 134/247 (54%), Gaps = 33/247 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1630 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1687
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1688 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1747
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1748 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1807
Query: 161 SALSNKDTWSSGG-GGGNT-----SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
S+ ++ + W+ G G N + LWGTP + SLWG P D ++PS +N+F
Sbjct: 1808 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1864
Query: 215 LPGDLLG 221
L D L
Sbjct: 1865 LSVDHLA 1871
>gi|148685343|gb|EDL17290.1| mCG20982, isoform CRA_a [Mus musculus]
Length = 1892
Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 96/247 (38%), Positives = 134/247 (54%), Gaps = 33/247 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1629 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1686
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1687 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1746
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1747 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1806
Query: 161 SALSNKDTWSSGG-GGGNT-----SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
S+ ++ + W+ G G N + LWGTP + SLWG P D ++PS +N+F
Sbjct: 1807 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1863
Query: 215 LPGDLLG 221
L D L
Sbjct: 1864 LSVDHLA 1870
>gi|403277190|ref|XP_003930259.1| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 2
[Saimiri boliviensis boliviensis]
Length = 1706
Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1458 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1515
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1516 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1575
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1576 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1635
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1636 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1691
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1692 AFLSVDHLGGGGESM 1706
>gi|7959181|dbj|BAA95984.1| KIAA1460 protein [Homo sapiens]
Length = 1400
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1152 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1209
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1210 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1269
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1270 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1329
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1330 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1385
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1386 AFLSVDHLGGGGESM 1400
>gi|148685345|gb|EDL17292.1| mCG20982, isoform CRA_c [Mus musculus]
Length = 1884
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 95/247 (38%), Positives = 134/247 (54%), Gaps = 33/247 (13%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1621 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1678
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1679 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1738
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1739 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1798
Query: 161 SALSNKDTWSSGGGGG------NTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSF 214
S+ ++ + W+ G G + + LWGTP + SLWG P D ++PS +N+F
Sbjct: 1799 SSRTDVNHWNGAGLSGANCGDLHGTSLWGTPHYST---SLWGPPSSDPRGISSPSPINAF 1855
Query: 215 LPGDLLG 221
L D L
Sbjct: 1856 LSVDHLA 1862
>gi|119576187|gb|EAW55783.1| trinucleotide repeat containing 6A, isoform CRA_b [Homo sapiens]
Length = 1601
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1353 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1410
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1411 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1470
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1471 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1530
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1531 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1586
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1587 AFLSVDHLGGGGESM 1601
>gi|28374385|gb|AAH45631.1| TNRC6C protein, partial [Homo sapiens]
Length = 999
Score = 133 bits (335), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 128/238 (53%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 767 SHELWKVPRNSTAPTRPPPGLTNPK---PSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 823
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ YS++EEA KAQ +L+ C+L
Sbjct: 824 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVPYSSKEEAAKAQKSLHMCVL 883
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 884 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 943
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 944 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 999
>gi|21307718|gb|AAK62026.1| GW182 autoantigen [Homo sapiens]
gi|119576190|gb|EAW55786.1| trinucleotide repeat containing 6A, isoform CRA_e [Homo sapiens]
Length = 1709
Score = 133 bits (335), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1461 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1518
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1519 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1578
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1579 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1638
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1639 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1694
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1695 AFLSVDHLGGGGESM 1709
>gi|426381587|ref|XP_004057418.1| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 2
[Gorilla gorilla gorilla]
Length = 1709
Score = 133 bits (335), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1461 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1518
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1519 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1578
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1579 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1638
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1639 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1694
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1695 AFLSVDHLGGGGESM 1709
>gi|397485195|ref|XP_003813743.1| PREDICTED: trinucleotide repeat-containing gene 6A protein isoform 2
[Pan paniscus]
Length = 1709
Score = 133 bits (335), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1461 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1518
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1519 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1578
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1579 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPSWQSLGSSQSRLGSLDCSHSF 1638
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1639 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1694
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1695 AFLSVDHLGGGGESM 1709
>gi|74137224|dbj|BAE21997.1| unnamed protein product [Mus musculus]
Length = 337
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 95/241 (39%), Positives = 127/241 (52%), Gaps = 26/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 105 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGTSPLGWTSSYSSGSAWSTDTSGRTSS 161
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+N TPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 162 WLVLRNPTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 221
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNN------------NNGGTGGWARGSSALSN 165
GNTTI AE + EV LA A ++ +G T G R +A N
Sbjct: 222 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGGSQPRLGTSGSTHGLVRSDTAHWN 281
Query: 166 KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
S G G++ LWG P SLWG P D+ +P+ LN+ LPGDLL GES
Sbjct: 282 TPCLS---GKGSSELLWG--GVPQYSSSLWGPPSAEDARVIGSPTPLNTLLPGDLLSGES 336
Query: 225 M 225
+
Sbjct: 337 I 337
>gi|410984990|ref|XP_003998808.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
6A protein [Felis catus]
Length = 1927
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 101/255 (39%), Positives = 140/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1679 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1736
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1737 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1796
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1797 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1856
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + + LWG P + SLWG PP S R ++PS +N
Sbjct: 1857 SSRTDLNHWNGAGLSGTSCGDLHGTSLWGAPHYST---SLWG-PPSSSDPRGMSSPSPIN 1912
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1913 AFLSVDHLGGGGESM 1927
>gi|338712781|ref|XP_001501299.3| PREDICTED: trinucleotide repeat-containing gene 6A protein [Equus
caballus]
Length = 1924
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 138/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1676 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDTRYTPGSSW 1733
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1734 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1793
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G +
Sbjct: 1794 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGALDCSHPF 1853
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGT P SLWG PP S R ++PS +N
Sbjct: 1854 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGT---PHYSASLWG-PPSSSDPRGISSPSPIN 1909
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1910 AFLSVDHLGGGGESM 1924
>gi|403280805|ref|XP_003931900.1| PREDICTED: trinucleotide repeat-containing gene 6C protein [Saimiri
boliviensis boliviensis]
Length = 2119
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1887 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1943
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1944 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 2003
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 2004 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 2063
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 2064 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 2119
>gi|332849222|ref|XP_001144739.2| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
6C protein [Pan troglodytes]
Length = 1942
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1710 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1766
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1767 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1826
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 1827 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1886
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1887 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1942
>gi|441643594|ref|XP_004090530.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
6C protein [Nomascus leucogenys]
Length = 1725
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 127/238 (53%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1493 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1549
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1550 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1609
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKDTWSSG----- 172
GNTTI AE + EV LA A ++ + R S+A S+ S
Sbjct: 1610 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAASSSHGLVRSDAGHWN 1669
Query: 173 ----GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1670 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1725
>gi|327264967|ref|XP_003217280.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Anolis carolinensis]
Length = 1884
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1652 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGASPLGWTSSYSSGSAWSTDSSGRTSS 1708
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1709 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1768
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTGGWARGSSALSNKDTWSSG---- 172
GNTTI AE + EV LA ++ +N GT GS++ S+ +
Sbjct: 1769 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNTGTTQTRLGSTSSSHGMVRNEAGHWN 1828
Query: 173 ----GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
GG G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1829 APCLGGKGSSDLLWG--GVPQYSSSLWGPPSTDDGRVIGSPTPLNTLLPGDLLSGESI 1884
>gi|395749509|ref|XP_002827930.2| PREDICTED: trinucleotide repeat-containing gene 6C protein [Pongo
abelii]
Length = 1935
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1703 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1759
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1760 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1819
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 1820 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1879
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1880 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1935
>gi|386781810|ref|NP_001247675.1| trinucleotide repeat-containing gene 6C protein [Macaca mulatta]
gi|402901227|ref|XP_003913556.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
[Papio anubis]
gi|355754415|gb|EHH58380.1| hypothetical protein EGM_08214 [Macaca fascicularis]
gi|380815096|gb|AFE79422.1| trinucleotide repeat-containing gene 6C protein isoform 1 [Macaca
mulatta]
gi|383420321|gb|AFH33374.1| trinucleotide repeat-containing gene 6C protein isoform 1 [Macaca
mulatta]
Length = 1725
Score = 132 bits (333), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1493 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDASGRTSS 1549
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1550 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1609
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 1610 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1669
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1670 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1725
>gi|119609889|gb|EAW89483.1| trinucleotide repeat containing 6C, isoform CRA_d [Homo sapiens]
Length = 1687
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1455 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1511
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1512 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1571
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 1572 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1631
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1632 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1687
>gi|397494949|ref|XP_003818329.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
[Pan paniscus]
Length = 1689
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1457 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1513
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1514 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1573
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 1574 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1633
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1634 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1689
>gi|217416332|ref|NP_001136112.1| trinucleotide repeat-containing gene 6C protein isoform 1 [Homo
sapiens]
gi|119609886|gb|EAW89480.1| trinucleotide repeat containing 6C, isoform CRA_a [Homo sapiens]
Length = 1726
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1494 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1550
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1551 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1610
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 1611 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1670
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1671 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1726
>gi|397494947|ref|XP_003818328.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
[Pan paniscus]
Length = 1725
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1493 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1549
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1550 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1609
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 1610 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1669
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1670 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1725
>gi|355568964|gb|EHH25245.1| hypothetical protein EGK_09030 [Macaca mulatta]
Length = 1725
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1493 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDASGRTSS 1549
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1550 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1609
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 1610 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1669
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1670 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1725
>gi|402901229|ref|XP_003913557.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
[Papio anubis]
Length = 1689
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1457 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDASGRTSS 1513
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1514 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1573
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 1574 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1633
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1634 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1689
>gi|33413425|ref|NP_061869.2| trinucleotide repeat-containing gene 6C protein isoform 2 [Homo
sapiens]
gi|126253813|sp|Q9HCJ0.3|TNR6C_HUMAN RecName: Full=Trinucleotide repeat-containing gene 6C protein
gi|119609891|gb|EAW89485.1| trinucleotide repeat containing 6C, isoform CRA_f [Homo sapiens]
gi|162317668|gb|AAI56367.1| Trinucleotide repeat containing 6C [synthetic construct]
gi|162318186|gb|AAI57116.1| Trinucleotide repeat containing 6C [synthetic construct]
gi|168275508|dbj|BAG10474.1| trinucleotide repeat-containing 6C protein [synthetic construct]
Length = 1690
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1458 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1514
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1515 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1574
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 1575 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1634
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1635 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1690
>gi|20521948|dbj|BAB13408.2| KIAA1582 protein [Homo sapiens]
Length = 1740
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1508 SHELWKVPRNSTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1564
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1565 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1624
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSA-------LSNKDTWS 170
GNTTI AE + EV LA A ++ + R S+A S+ W+
Sbjct: 1625 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLSAAGSSHGLVRSDAGHWN 1684
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGESM 225
+ GG G++ LWG P SLWG P DS +P+ L + LPGDLL GES+
Sbjct: 1685 APCLGGKGSSELLWG--GVPQYSSSLWGPPSADDSRVIGSPTPLTTLLPGDLLSGESL 1740
>gi|338711312|ref|XP_003362511.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
[Equus caballus]
Length = 1721
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 96/238 (40%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1489 SHELWKVPRNTAAPTRPPPGLTT---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1545
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1546 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1605
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTGGWARGSSALSNKDTWSSGG--- 173
GNTTI AE + EV LA A ++ ++ GT G+S S+ S G
Sbjct: 1606 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTSQTRLGASGSSHGLVRSDAGHWN 1665
Query: 174 -----GGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1666 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1721
>gi|338711310|ref|XP_001491270.3| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
[Equus caballus]
Length = 1686
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 96/238 (40%), Positives = 129/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1454 SHELWKVPRNTAAPTRPPPGLTT---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1510
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1511 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1570
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTGGWARGSSALSNKDTWSSGG--- 173
GNTTI AE + EV LA A ++ ++ GT G+S S+ S G
Sbjct: 1571 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTSQTRLGASGSSHGLVRSDAGHWN 1630
Query: 174 -----GGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
G G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1631 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSGESI 1686
>gi|344291110|ref|XP_003417279.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 1
[Loxodonta africana]
Length = 1683
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/241 (40%), Positives = 129/241 (53%), Gaps = 26/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1451 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1507
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1508 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1567
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSA--------TANNNNNNNGGTGGWARGSSALSNKDT- 168
GNTTI AE + EV LA A +++ + G G A G L DT
Sbjct: 1568 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGTSQTRLGASGSAHG---LVRSDTG 1624
Query: 169 -WSSG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGES 224
WS+ G G++ LW P SLWG P D +P+ LN+ LPGDLL GES
Sbjct: 1625 HWSAPCLGSKGSSDLLWS--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGES 1682
Query: 225 M 225
M
Sbjct: 1683 M 1683
>gi|417406796|gb|JAA50040.1| Putative thyroid hormone receptor-associated protein complex subunit
[Desmodus rotundus]
Length = 1889
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 110/183 (60%), Gaps = 12/183 (6%)
Query: 53 GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNL 112
G + +W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L
Sbjct: 1709 GRTSSWLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSL 1768
Query: 113 NNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWAR----GSS---ALSN 165
+ C+LGNTTI AE + EV LA A ++ +G G AR GSS S+
Sbjct: 1769 HMCVLGNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSGTGTGQARLGAAGSSHGLVRSD 1828
Query: 166 KDTWSSG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGG 222
W++ G G++ LWG P SLWG P D +P+ LN+ LPGDLL G
Sbjct: 1829 AGHWNAPCLAGKGSSDLLWG--GVPQYSSSLWGPPSSDDGRVIGSPTPLNTLLPGDLLSG 1886
Query: 223 ESM 225
ES+
Sbjct: 1887 ESL 1889
>gi|344291112|ref|XP_003417280.1| PREDICTED: trinucleotide repeat-containing gene 6C protein isoform 2
[Loxodonta africana]
Length = 1719
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/241 (40%), Positives = 129/241 (53%), Gaps = 26/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1487 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1543
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1544 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1603
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSA--------TANNNNNNNGGTGGWARGSSALSNKDT- 168
GNTTI AE + EV LA A +++ + G G A G L DT
Sbjct: 1604 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSGTSQTRLGASGSAHG---LVRSDTG 1660
Query: 169 -WSSG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGES 224
WS+ G G++ LW P SLWG P D +P+ LN+ LPGDLL GES
Sbjct: 1661 HWSAPCLGSKGSSDLLWS--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGES 1718
Query: 225 M 225
M
Sbjct: 1719 M 1719
>gi|312384471|gb|EFR29195.1| hypothetical protein AND_02092 [Anopheles darlingi]
Length = 378
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 104/259 (40%), Positives = 126/259 (48%), Gaps = 54/259 (20%)
Query: 4 NDLWGPP------KPRGPPPGMMGGGGKP---PSNGWMVRPNGGGGGGNTWGTSQPQGGW 54
D+W P RGPPPG+ G G + G R + G+ G G
Sbjct: 133 TDVWSAPIGKLSATTRGPPPGLGGANGNKHIGSTGGVASRISANATWGSASGGGAGTAGS 192
Query: 55 SGT----WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQG 110
GT W+LL+NLT QID STL+TLC+QHGP+ +FH Y H LAL +YS+ +EA+KAQ
Sbjct: 193 WGTTGTSWLLLRNLTSQIDASTLRTLCMQHGPILSFHPYPAHGLALCRYSSSDEAMKAQQ 252
Query: 111 NLNNCILGNTTIFAEAP-SDAEVQSLLAHL-SATANNNNNNNGGTGGWARG--------- 159
LNNC LG +TI AE P S+AEVQ+ L L TA ++ GT
Sbjct: 253 ALNNCPLGASTISAECPSSEAEVQTYLQQLGGGTAITATVSSTGTASGTGSISSISSQSW 312
Query: 160 -----SSALSNKDTWSS----------GGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVD 204
++A DTW S G G NTS LW PLD D
Sbjct: 313 RLRTPTAATGGTDTWGSGWPIGRDTGDGSGSTNTSNLWA---------------PLDGGD 357
Query: 205 RATPSSLNSFLPGDLLGGE 223
R TPSSLNSFLP LLG E
Sbjct: 358 RETPSSLNSFLPESLLGSE 376
>gi|224074955|ref|XP_002194333.1| PREDICTED: trinucleotide repeat-containing gene 6C protein
[Taeniopygia guttata]
Length = 1719
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 98/238 (41%), Positives = 130/238 (54%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1487 SHELWKVPRNTTAPTRPPPGLTN---TKPSSTWGASPLGWTSSYSSGSAWSTDSSGRTSS 1543
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1544 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1603
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN--NNNGGTG---GWARGSSALSNKDT--WS 170
GNTTI AE + EV LA A ++ +N G T G + S L D W+
Sbjct: 1604 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSNTGSTPSRLGSSGSSHGLVRPDAGHWN 1663
Query: 171 --SGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
GG G++ LWG P SLWG P D +P+ LN+ LPGDLL GES+
Sbjct: 1664 PPCLGGKGSSDLLWG--GVPQYSSSLWGPPSADDGRVIGSPTPLNTLLPGDLLSGESI 1719
>gi|395826836|ref|XP_003786620.1| PREDICTED: trinucleotide repeat-containing gene 6C protein [Otolemur
garnettii]
Length = 1696
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 96/238 (40%), Positives = 127/238 (53%), Gaps = 20/238 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1464 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1520
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1521 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1580
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWAR----GSS---ALSNKDTWS 170
GNTTI AE + EV LA A ++ + R GSS S+ WS
Sbjct: 1581 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSSASSQPRLGASGSSHGLVRSDAGHWS 1640
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-VDRATPSSLNSFLPGDLLGGESM 225
+ GG G + LWG P SLWG P D +P+ L + LPGDLL GES+
Sbjct: 1641 APCLGGKGGSELLWG--GGPQYSSSLWGPPSADDGRVIGSPTPLTTLLPGDLLSGESL 1696
>gi|334333496|ref|XP_001369211.2| PREDICTED: trinucleotide repeat-containing gene 6A protein
[Monodelphis domestica]
Length = 1884
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 103/254 (40%), Positives = 138/254 (54%), Gaps = 36/254 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G K P + W P GGG GN+ P W
Sbjct: 1636 AHELWKVPLPPKNITAPSRPPPGLTGQ--KAPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1693
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1694 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1753
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTG----GWARGSSAL 163
AQ +L+ C+LGNTTI AE S+ E+ A + + G+ G S +
Sbjct: 1754 AQKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQPLGSSQSRLGSIDSSHSF 1813
Query: 164 SNKDT---WSSGGGGGNTS------QLWGTPSNPSSGGSLWGAP-PLDSVDRATPSSLNS 213
SN++ W+ G G +S LWGTP+ + SLWG P D+ ++PS +N+
Sbjct: 1814 SNRNDLNHWNGAGLSGTSSGDLHGTSLWGTPNYST---SLWGTPNSSDTRGISSPSPINA 1870
Query: 214 FLPGDLL--GGESM 225
FL D L GGESM
Sbjct: 1871 FLSVDHLGGGGESM 1884
>gi|157817055|ref|NP_001101019.1| trinucleotide repeat-containing gene 6A protein [Rattus norvegicus]
gi|149067989|gb|EDM17541.1| trinucleotide repeat containing 6 (predicted), isoform CRA_c [Rattus
norvegicus]
Length = 1954
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/249 (38%), Positives = 135/249 (54%), Gaps = 36/249 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1689 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1746
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1747 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1806
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1807 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1866
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1867 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1922
Query: 213 SFLPGDLLG 221
+FL D L
Sbjct: 1923 AFLSVDHLA 1931
>gi|395515519|ref|XP_003761950.1| PREDICTED: trinucleotide repeat-containing gene 6A protein
[Sarcophilus harrisii]
Length = 1978
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 103/254 (40%), Positives = 138/254 (54%), Gaps = 36/254 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G K P + W P GGG GN+ P W
Sbjct: 1730 AHELWKVPLPPKNITAPSRPPPGLTGQ--KAPLSTWDNSPLRIGGGWGNSDARYTPGSSW 1787
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1788 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1847
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTG----GWARGSSAL 163
AQ +L+ C+LGNTTI AE S+ E+ A + + G+ G S +
Sbjct: 1848 AQKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQPLGSSQSRLGSIDSSHSF 1907
Query: 164 SNKDT---WSSGGGGGNTS------QLWGTPSNPSSGGSLWGAP-PLDSVDRATPSSLNS 213
SN++ W+ G G +S LWGTP+ + SLWG P D+ ++PS +N+
Sbjct: 1908 SNRNDLNHWNGAGLSGTSSGDLHGTSLWGTPNYST---SLWGTPNSSDTRGISSPSPINA 1964
Query: 214 FLPGDLL--GGESM 225
FL D L GGESM
Sbjct: 1965 FLSVDHLGGGGESM 1978
>gi|301605745|ref|XP_002932511.1| PREDICTED: trinucleotide repeat-containing gene 6A protein [Xenopus
(Silurana) tropicalis]
Length = 1835
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/250 (38%), Positives = 136/250 (54%), Gaps = 32/250 (12%)
Query: 3 SNDLWGPP-------KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWS 55
+++LW P P PPPG+ G KPP + W GG G++ P WS
Sbjct: 1591 AHELWKVPLPSKNISAPSRPPPGLTGQ--KPPLSTWDTNSLRLGGWGSSDSRYTPGSTWS 1648
Query: 56 GT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKA 108
W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +KA
Sbjct: 1649 ENSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGTALVRYSSKEEVVKA 1708
Query: 109 QGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGS----SALS 164
Q +L+ C+LGNTTI AE S+ E+ A + + + G+G GS ++S
Sbjct: 1709 QKSLHMCVLGNTTILAEFASEEEISRFFAQGQSLTPSPGWQSLGSGHSRLGSLDSPHSIS 1768
Query: 165 NK---DTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSFL 215
N+ + W+S G G++ + LWGTP+ + SLWG P + ++PS + +FL
Sbjct: 1769 NRGDINHWNSPGASGSSSGDLHGTSLWGTPNYST---SLWGNPSNEGRGLSSPSPVPAFL 1825
Query: 216 PGDLLGGESM 225
D L GE M
Sbjct: 1826 SVDQLNGEPM 1835
>gi|158297475|ref|XP_001237966.2| AGAP007803-PA [Anopheles gambiae str. PEST]
gi|157015213|gb|EAU76399.2| AGAP007803-PA [Anopheles gambiae str. PEST]
Length = 1188
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 92/200 (46%), Positives = 115/200 (57%), Gaps = 17/200 (8%)
Query: 36 NGGGGGGNTWGTSQPQG-GWSG--TWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNH 92
N G GG GT Q +G GWS +W+LLKN T QID STL+TLC+QHGP+ FH Y H
Sbjct: 992 NKSGNGGTAAGTQQQRGAGWSTGTSWLLLKNFTSQIDASTLRTLCMQHGPILTFHSYPAH 1051
Query: 93 SLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAP-SDAEVQSLLAHLSATANNNNNNNG 151
LAL +Y+TREEA KAQ LNNC LG++TI AE P S++EVQ+ L L A +
Sbjct: 1052 GLALCRYATREEAAKAQQALNNCTLGSSTISAECPASESEVQTYLQQLGGAAAAASVAVS 1111
Query: 152 GTGG------WARG-SSALSNKDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDS-V 203
+ W + +S+ S DTW G G G ++ +LW PLD+
Sbjct: 1112 SSASSLTSPTWRQERTSSSSGADTW---GSGWAIGGSSGASGAGAAAANLWA--PLDAGT 1166
Query: 204 DRATPSSLNSFLPGDLLGGE 223
D TP+SLNSFLP LLG E
Sbjct: 1167 DSGTPTSLNSFLPDSLLGPE 1186
>gi|149067992|gb|EDM17544.1| trinucleotide repeat containing 6 (predicted), isoform CRA_f [Rattus
norvegicus]
Length = 1904
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/249 (38%), Positives = 135/249 (54%), Gaps = 36/249 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1639 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1696
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1697 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1756
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1757 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1816
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1817 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1872
Query: 213 SFLPGDLLG 221
+FL D L
Sbjct: 1873 AFLSVDHLA 1881
>gi|149067990|gb|EDM17542.1| trinucleotide repeat containing 6 (predicted), isoform CRA_d [Rattus
norvegicus]
Length = 1915
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/249 (38%), Positives = 135/249 (54%), Gaps = 36/249 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1650 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1707
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1708 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1767
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1768 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1827
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1828 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1883
Query: 213 SFLPGDLLG 221
+FL D L
Sbjct: 1884 AFLSVDHLA 1892
>gi|149067991|gb|EDM17543.1| trinucleotide repeat containing 6 (predicted), isoform CRA_e [Rattus
norvegicus]
Length = 1894
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/249 (38%), Positives = 135/249 (54%), Gaps = 36/249 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1629 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1686
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1687 GESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1746
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1747 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1806
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1807 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1862
Query: 213 SFLPGDLLG 221
+FL D L
Sbjct: 1863 AFLSVDHLA 1871
>gi|348584988|ref|XP_003478254.1| PREDICTED: trinucleotide repeat-containing gene 6A protein-like
[Cavia porcellus]
Length = 1924
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 101/255 (39%), Positives = 139/255 (54%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1676 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRVGGGWGNSDARYTPGSSW 1733
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ ++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1734 GESSSGRITNCLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1793
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1794 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1853
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1854 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1909
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1910 AFLSVDHLGGGGESM 1924
>gi|326668565|ref|XP_002662398.2| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
6C protein [Danio rerio]
Length = 1740
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 95/236 (40%), Positives = 125/236 (52%), Gaps = 28/236 (11%)
Query: 12 PRGPPPGMM--------GGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKN 63
P PPPG+ GG S GW N G TW + P G +W++L+N
Sbjct: 1511 PSRPPPGLTNTKPSSTWGGNSLGLSQGW----NNSYSSGGTWSSDSPNRG--SSWLVLRN 1564
Query: 64 LTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIF 123
LTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+LGNTTI
Sbjct: 1565 LTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVLGNTTIL 1624
Query: 124 AEAPSDAEVQSLLAH---LSATANNNNNNNGGTGGWARGSSALSNKDTWSSGGGGGNTSQ 180
AE S+ +V A L+ T + + G+ G+ S+ SGGGG +
Sbjct: 1625 AEFASEEDVNRFFAQGQSLTPTTSWQASPAPGSSQPRLGNPTASHPTGLWSGGGGTKSVC 1684
Query: 181 LWGTPSNPSSGGSLWG---------APP--LDSVDRATPSSLNSFLPGDLLGGESM 225
G S+ + G LWG APP D+ +P +N+ LPGDLL GESM
Sbjct: 1685 SAGNSSSGNGGDMLWGGVPQYSSLWAPPNGDDARVIGSPIPINTLLPGDLLSGESM 1740
>gi|351698083|gb|EHB01002.1| Trinucleotide repeat-containing gene 6C protein [Heterocephalus
glaber]
Length = 1984
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 95/231 (41%), Positives = 128/231 (55%), Gaps = 25/231 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1771 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1827
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1828 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1887
Query: 118 GNTTIFAEAPSDAEVQSLLAH---LSATANNNNNNNGGTGGWARGSSALSNKDTWSSGGG 174
GNTTI AE + EV LA L T++ ++ A + S++ W GGG
Sbjct: 1888 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQPSSGSSQPQAAPMACKGSSELLW--GGG 1945
Query: 175 GGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSFLPGDLLGGESM 225
+S LWG PS + G L G+P TP LN+ LPGDLL GES+
Sbjct: 1946 PQYSSSLWGPPS--TDDGRLIGSP--------TP--LNTLLPGDLLSGESI 1984
>gi|348558232|ref|XP_003464922.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Cavia porcellus]
Length = 1886
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 96/243 (39%), Positives = 128/243 (52%), Gaps = 30/243 (12%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1654 SHELWKVPRNTTAPTRPPPGL---ANPKPSSTWGPSPLGWTSSYSSGSAWSTDTSGRTSS 1710
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1711 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1770
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNN------------NGGTGGWARGSSALSN 165
GNTTI AE + EV LA A ++ +G T G R S+
Sbjct: 1771 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSNSGSSQSRLGASGSTHGLVR-----SD 1825
Query: 166 KDTWSSG--GGGGNTSQLWGTPSNPSSGGSLWGAP-PLDSVDRATPSSLNSFLPGDLLGG 222
W + G G++ LWG P SLWG P DS +P+ LN+ LPGDLL G
Sbjct: 1826 ATHWGAPCLGSKGSSELLWG--GGPQYSSSLWGPPGADDSRLIGSPTPLNTLLPGDLLSG 1883
Query: 223 ESM 225
ES+
Sbjct: 1884 ESI 1886
>gi|348530814|ref|XP_003452905.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Oreochromis niloticus]
Length = 1795
Score = 129 bits (324), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 81/180 (45%), Positives = 105/180 (58%), Gaps = 18/180 (10%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS+++EA KAQ +L+ C+L
Sbjct: 1622 WLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVVRYSSKDEAAKAQKSLHMCVL 1681
Query: 118 GNTTIFAEAPSDAEVQSLLAH---------LSATANNNNNNNGGTGGWARGSSALSNKDT 168
GNTTI AE + EV A AT N GGTG A S + +
Sbjct: 1682 GNTTILAEFAGEEEVNRFFAQGQSLGGTTSWQATPGTNQTRMGGTGSGA--SHPIGHSPH 1739
Query: 169 W-SSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRA--TPSSLNSFLPGDLLGGESM 225
W ++ G G++ LWG S SLWG PP R +P+ +N+ LPGDLL GESM
Sbjct: 1740 WNNNNNGAGSSKLLWGGVQQYS---SLWG-PPSGEEGRVMGSPTPINTLLPGDLLSGESM 1795
>gi|345312835|ref|XP_001517138.2| PREDICTED: trinucleotide repeat-containing gene 6C protein
[Ornithorhynchus anatinus]
Length = 1452
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 94/241 (39%), Positives = 123/241 (51%), Gaps = 26/241 (10%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1220 SHELWKVPRNTPAPTRPPPGLTNTK---PSSSWGAGPLGWTSSYSSGSAWSTDSSGRTSS 1276
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1277 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1336
Query: 118 GNTTIFAEAPSDAEVQSLLAH----------LSATANNNNN--NNGGTGGWARGSSALSN 165
GNTTI AE + EV LA S T N + GG G RG +
Sbjct: 1337 GNTTILAEFAGEEEVNRFLAQGQPLPPTSSWQSNTGTNQTRLGSTGGAHGLVRGDAG--- 1393
Query: 166 KDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGDLLGGES 224
W++ GG P SLWG P D +P+ LN+ LPGDLL GES
Sbjct: 1394 --HWNAPCLGGKGGGDLLWGGVPQYSSSLWGPPSAEDGRVVGSPTPLNTLLPGDLLSGES 1451
Query: 225 M 225
+
Sbjct: 1452 I 1452
>gi|225733942|pdb|2WBR|A Chain A, The Rrm Domain In Gw182 Proteins Contributes To Mirna-
Mediated Gene Silencing
Length = 89
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 58/86 (67%), Positives = 70/86 (81%)
Query: 53 GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNL 112
W +W+LLKNLT QIDG TL+TLC+QHGPL +FH YLN +AL KY+TREEA KAQ L
Sbjct: 4 AWGSSWLLLKNLTAQIDGPTLRTLCMQHGPLVSFHPYLNQGIALCKYTTREEANKAQMAL 63
Query: 113 NNCILGNTTIFAEAPSDAEVQSLLAH 138
NNC+L NTTIFAE+PS+ EVQS++ H
Sbjct: 64 NNCVLANTTIFAESPSENEVQSIMQH 89
>gi|432845644|ref|XP_004065839.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Oryzias latipes]
Length = 1968
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 133/255 (52%), Gaps = 50/255 (19%)
Query: 3 SNDLWGPPK-------PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWS 55
S++LW P+ P PPPG+ PS W GN+ G +Q GWS
Sbjct: 1732 SHELWKVPQGPRSTTAPSRPPPGLTNTK---PSTSW---------SGNSLGLTQ---GWS 1776
Query: 56 GT------------------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALA 97
G+ W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+
Sbjct: 1777 GSYSSEGTAWSTDTSNRTSSWLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVV 1836
Query: 98 KYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAH---LSATANNNNNNNGGTG 154
+YS+++EA KAQ +L+ C+LGNTTI AE + EV A L AT + + N G
Sbjct: 1837 RYSSKDEAAKAQKSLHMCVLGNTTILAEFAGEEEVNRFFAQGQSLGATTTSWHANPGPNQ 1896
Query: 155 GWARGSSALSNKDTWSSGGGGGNTSQ---LWGTPSNPSSGGSLWGAPP-LDSVDRATPSS 210
G+S + WSSG GGG + LWG S SLWG P D+ +P+
Sbjct: 1897 NRMGGASQSHSIGQWSSGAGGGKANGGDLLWGGVPQYS---SLWGPPNGEDARVIGSPTP 1953
Query: 211 LNSFLPGDLLGGESM 225
+N+ LPGDLL GESM
Sbjct: 1954 INTLLPGDLLSGESM 1968
>gi|441598125|ref|XP_004087437.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
6A protein [Nomascus leucogenys]
Length = 1938
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/255 (38%), Positives = 135/255 (52%), Gaps = 38/255 (14%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRP-NGGGGGGNTWGTSQPQGGW 54
+++LW PPK P PPPG+ G KPP + W P GGG GN+ P W
Sbjct: 1690 AHELWKVPLPPKNITAPSRPPPGLTGQ--KPPLSTWDNSPLRIGGGWGNSDARYTPGSFW 1747
Query: 55 SGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
+ W +L L P IDGSTL+TLC+QHGPL FHL L H AL +YS++EE +K
Sbjct: 1748 GESISWIITNWFVLNTLLPXIDGSTLRTLCMQHGPLITFHLNLPHGNALVRYSSKEEVVK 1807
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGS 160
AQ +L+ C+LGNTTI AE S+ E+ A + + ++ + G+ +
Sbjct: 1808 AQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLTPSPGWQSLGSSQSRLGSLDCSHSF 1867
Query: 161 SALSNKDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLN 212
S+ ++ + W+ G G + LWGTP + SLWG PP S R ++PS +N
Sbjct: 1868 SSRTDLNHWNGAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPIN 1923
Query: 213 SFLPGDLL--GGESM 225
+FL D L GGESM
Sbjct: 1924 AFLSVDHLGGGGESM 1938
>gi|348520957|ref|XP_003447993.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Oreochromis niloticus]
Length = 1844
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 101/256 (39%), Positives = 135/256 (52%), Gaps = 52/256 (20%)
Query: 3 SNDLWGPPK-------PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWS 55
S++LW P+ P PPPG+ PS+ W GGN+ G SQ GWS
Sbjct: 1608 SHELWKVPQGPRSTTAPSRPPPGLTNTK---PSSTW---------GGNSLGLSQ---GWS 1652
Query: 56 GT------------------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALA 97
G+ W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+
Sbjct: 1653 GSYSSEGTTWSTDSSNRTSSWLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVV 1712
Query: 98 KYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEV-------QSLLAHLSATANNNNNNN 150
+YS+++EA KAQ +L+ C+LGNTTI AE + EV QSL A+ ++ N N
Sbjct: 1713 RYSSKDEAAKAQKSLHMCVLGNTTILAEFAGEEEVNRFFAQGQSLGANTTSWQANPGTNQ 1772
Query: 151 GGTGGWARGSSALSNKDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPP-LDSVDRATPS 209
GG A+ S ++ + + GG LWG S SLWG P D+ +P+
Sbjct: 1773 NRMGGAAQ-SHSIGQWSSSAGGGKASGGDLLWGGVPQYS---SLWGPPSGEDARVIGSPT 1828
Query: 210 SLNSFLPGDLLGGESM 225
+N+ LPGDLL GESM
Sbjct: 1829 PINTLLPGDLLSGESM 1844
>gi|410917113|ref|XP_003972031.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Takifugu rubripes]
Length = 1858
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 96/242 (39%), Positives = 130/242 (53%), Gaps = 27/242 (11%)
Query: 3 SNDLWGPPK-------PRGPPPGMMGGGGKPPSNGWM-----VRP--NGGGGGGNTWGTS 48
S++LW P+ P PPPG+ PS+ W + P NG G TW T
Sbjct: 1625 SHELWKVPQGPRSTTAPSRPPPGLTNSK---PSSTWSGNSLGLAPGWNGSYSSGTTWSTD 1681
Query: 49 QPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKA 108
+ +W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS+++E+ KA
Sbjct: 1682 S--SNRTSSWLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVVRYSSKDESAKA 1739
Query: 109 QGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNN--NNNGGTGGWARGSSALSNK 166
Q +L+ C+LGNTTI AE + EV A + N N G+ G++ +
Sbjct: 1740 QKSLHMCVLGNTTILAEFAGEEEVNRFFAQGQSLGANTTSWQANQGSNQNRMGAAQSHSI 1799
Query: 167 DTWSSGGGGGNTSQ--LWGTPSNPSSGGSLWGAPP-LDSVDRATPSSLNSFLPGDLLGGE 223
WS GGGG + LWG S SLWG P D+ +P+ +N+ LPGDLL GE
Sbjct: 1800 GQWSGGGGGKTSGGDLLWGGVPQYS---SLWGPPSGEDARVIGSPTPINTLLPGDLLSGE 1856
Query: 224 SM 225
SM
Sbjct: 1857 SM 1858
>gi|312384469|gb|EFR29193.1| hypothetical protein AND_02090 [Anopheles darlingi]
Length = 1745
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/139 (48%), Positives = 90/139 (64%), Gaps = 14/139 (10%)
Query: 55 SGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNN 114
+ TW+LL+NLT QIDGSTL+TLC+QHGPL NF Y +HS+AL KY+TREEA KAQ LNN
Sbjct: 1592 ASTWILLRNLTAQIDGSTLRTLCMQHGPLLNFQPYTHHSVALCKYATREEAQKAQQALNN 1651
Query: 115 CILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKDTWSSGGG 174
C LGNTTI AE P++++VQ +L+ L GG S+ ++N T ++ GG
Sbjct: 1652 CPLGNTTICAEIPTESDVQYILSQL--------------GGSMNASNGMTNGLTGAASGG 1697
Query: 175 GGNTSQLWGTPSNPSSGGS 193
G N + S P + G+
Sbjct: 1698 GQNWRLVAAQQSQPPTPGA 1716
>gi|47208767|emb|CAF91958.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1579
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 95/245 (38%), Positives = 123/245 (50%), Gaps = 30/245 (12%)
Query: 3 SNDLWGPPK-------PRGPPPGMMGGGGKPPSNGWMVRPNGGGGG--------GNTWGT 47
S++LW P+ P PPPG+ PS+ W G G G TW T
Sbjct: 1343 SHELWKVPQGPRSSTAPSRPPPGLTNSK---PSSTWGGSSLGLAPGWTGSYSSEGTTWST 1399
Query: 48 SQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIK 107
G + +W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS+++EA K
Sbjct: 1400 DS--GNRTSSWLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVVRYSSKDEAAK 1457
Query: 108 AQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANN------NNNNNGGTGGWARGSS 161
AQ +L+ C+LGNTTI AE + EV A + N N N G A+ S
Sbjct: 1458 AQKSLHMCVLGNTTILAEFAGEEEVNRFFAQGQSLGANTTSWQANPGTNQNRMGAAQSHS 1517
Query: 162 ALSNKDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPP-LDSVDRATPSSLNSFLPGDLL 220
GG LWG S SLWG P D+ +P+ +N+ LPGDLL
Sbjct: 1518 IGQWGSGGGGGGKASGGDLLWGGVPQYS---SLWGPPSGEDARVIGSPTPINTLLPGDLL 1574
Query: 221 GGESM 225
GESM
Sbjct: 1575 SGESM 1579
>gi|431908494|gb|ELK12089.1| Trinucleotide repeat-containing protein 6A protein [Pteropus alecto]
Length = 1848
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 98/251 (39%), Positives = 130/251 (51%), Gaps = 41/251 (16%)
Query: 3 SNDLWG---PPK----PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQ---- 51
+++LW PPK P PPPG+ G KPP + W P GGG WG++ +
Sbjct: 1611 AHELWKVPLPPKSITAPSRPPPGLTGQ--KPPLSAWDPAPLRVGGG---WGSADARYTPG 1665
Query: 52 -------GGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREE 104
G W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL +Y ++EE
Sbjct: 1666 SSWGESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLVTFHLSLPHGNALVRYGSKEE 1725
Query: 105 AIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALS 164
+KAQ +L+ C+LGNTTI AE S+ E+ A + A + + G+G G S
Sbjct: 1726 VVKAQKSLHMCVLGNTTILAEFASEEEISRFFAQSQSLAPAPSWQSLGSGQSRLGPLDCS 1785
Query: 165 N--KDTWSSGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSFLP 216
+ W+ G G + + LWG P SLWG + PS +N+FL
Sbjct: 1786 HPFSSHWNGAGLSGTSCGDLPGASLWG---GPHYSASLWGP-----PSSSDPSPINAFLS 1837
Query: 217 GDLL--GGESM 225
D L GGESM
Sbjct: 1838 VDHLGGGGESM 1848
>gi|351702889|gb|EHB05808.1| Trinucleotide repeat-containing gene 6A protein [Heterocephalus
glaber]
Length = 1787
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 76/180 (42%), Positives = 106/180 (58%), Gaps = 19/180 (10%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H AL YS++EE +KAQ +L+ C+L
Sbjct: 1555 WLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLLHGNALVCYSSKEEVVKAQKSLHMCVL 1614
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANN-------NNNNNGGTGGWARGSSALSNKDTWS 170
GNTTI AE S+ E+ A + + ++ + G+ + S+ ++ + W+
Sbjct: 1615 GNTTILAEFASEEEISRFFAQGQSLTPSPGWQSLGSSQSRLGSLDCSHAFSSRTDLNHWN 1674
Query: 171 SGGGGGNT------SQLWGTPSNPSSGGSLWGAPPLDSVDR--ATPSSLNSFLPGDLLGG 222
G G + LWGTP + SLWG PP S R ++PS +N+FL D LGG
Sbjct: 1675 GAGLSGTNCGDLHGTSLWGTPHYST---SLWG-PPSSSDPRGISSPSPINAFLSVDHLGG 1730
>gi|170052152|ref|XP_001862092.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167873117|gb|EDS36500.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1503
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 80/142 (56%), Positives = 94/142 (66%), Gaps = 6/142 (4%)
Query: 5 DLWGPP--KP-RGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLL 61
DLWG P KP RGPPPG+ G +NGW GG N+ G GW +W+LL
Sbjct: 1271 DLWGTPMGKPTRGPPPGL---GANKNANGWAGGAGGGPQRSNSGGNWPGGNGWGSSWLLL 1327
Query: 62 KNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTT 121
KNLT QIDG+TL+TLC+QHGPLQ+ LY NH LAL KYS+REEA KAQ LNNC LG+T
Sbjct: 1328 KNLTSQIDGATLRTLCMQHGPLQSLQLYPNHGLALCKYSSREEASKAQQALNNCPLGSTN 1387
Query: 122 IFAEAPSDAEVQSLLAHLSATA 143
I AE PS+A+ Q+ L L A A
Sbjct: 1388 IGAECPSEADAQTYLQQLGAPA 1409
>gi|410902161|ref|XP_003964563.1| PREDICTED: trinucleotide repeat-containing gene 6A protein-like
[Takifugu rubripes]
Length = 1162
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 82/184 (44%), Positives = 106/184 (57%), Gaps = 28/184 (15%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H A+ YS+++EA KAQ +L+ C+L
Sbjct: 991 WLVLKNLTPQIDGSTLRTLCMQHGPLNTFHLNLPHGNAVVCYSSKDEAAKAQKSLHMCVL 1050
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKD---------- 167
GNTTI AE S+ E+ A + A + GW S+ S D
Sbjct: 1051 GNTTILAEFASEEEINRFFAQGQSLAT-------PSSGWQAVGSSQSRMDQSHHFPSRAP 1103
Query: 168 ---TWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDR-ATPSSLNSFLPGDLL--G 221
W+S ++S LWG SN SS SLWG P R ++PS ++SFLP D L G
Sbjct: 1104 EPNQWNS--SDLHSSSLWGG-SNYSS--SLWGTPGGTETGRMSSPSPISSFLPVDHLAGG 1158
Query: 222 GESM 225
G+SM
Sbjct: 1159 GDSM 1162
>gi|355725522|gb|AES08584.1| trinucleotide repeat containing 6C [Mustela putorius furo]
Length = 431
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 85/213 (39%), Positives = 115/213 (53%), Gaps = 19/213 (8%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 216 SHELWKVPRSTAAPTRPPPGL---ANPKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 272
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 273 WLVLRNLTPQIDGSTLRTLCLQHGPLVTFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 332
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNN-NNNGGTGGWARGSS------ALSNKDTWS 170
GNTTI AE + EV LA A ++ ++ GTG G+S S+ WS
Sbjct: 333 GNTTILAEFAGEEEVNRFLAQGQALPPTSSWQSSTGTGQTRLGASGSSHGLVRSDAGHWS 392
Query: 171 SG--GGGGNTSQLWGTPSNPSSGGSLWGAPPLD 201
+ G G++ LWG P SLWG P D
Sbjct: 393 APCLAGKGSSDLLWG--GVPQYSSSLWGPPSSD 423
>gi|432869446|ref|XP_004071751.1| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Oryzias latipes]
Length = 1798
Score = 119 bits (298), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 97/261 (37%), Positives = 130/261 (49%), Gaps = 61/261 (23%)
Query: 3 SNDLWGPPK--------PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGW 54
S++LW P+ P PPPG+ PS+ W GGN+ G +Q GW
Sbjct: 1561 SHELWKVPQGPRSGTAAPSRPPPGLTNTK---PSSTW---------GGNSLGLAQ---GW 1605
Query: 55 S-----------------GTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALA 97
S +W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+
Sbjct: 1606 SNSYTAGTTWSTDSSTRASSWLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVV 1665
Query: 98 KYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAH---------LSATANNNNN 148
+YS+++E+ KAQ +L+ C+LGNTTI AE + EV A A+ N +
Sbjct: 1666 RYSSKDESAKAQKSLHMCVLGNTTILAEFAGEEEVNRFFAQGQSLGGTTSWQASPGTNQS 1725
Query: 149 NNGGTGGWARGSSALSNKDTWSSGGGGGNTSQ--LWGTPSNPSSGGSLWGAPPLDSVDRA 206
GG G + W+S G ++S LWG S SLWG PP R
Sbjct: 1726 RMGGAG----AHHPIGQSPHWNSNSNGSSSSSKLLWGGVQQYS---SLWG-PPSGEEGRV 1777
Query: 207 --TPSSLNSFLPGDLLGGESM 225
+P+ +N+ LPGDLL GESM
Sbjct: 1778 MGSPTPINTLLPGDLLSGESM 1798
>gi|326666077|ref|XP_689365.4| PREDICTED: trinucleotide repeat-containing gene 6C protein-like
[Danio rerio]
Length = 1696
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/187 (43%), Positives = 103/187 (55%), Gaps = 25/187 (13%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1516 WLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1575
Query: 118 GNTTIFAEAPSDAEV-------QSLLAHLSATANNNNNNNGGTGGWARGSSALSNKDTWS 170
GNTTI AE + EV QSL S AN N GG G++A W+
Sbjct: 1576 GNTTILAEFAGEEEVNRFFAQGQSLTPTTSWQANPGTNQTRLGGG---GTAATHPIGHWN 1632
Query: 171 SGGGG-----------GNTSQLWGTPSNPSSGGSLWGAPPL-DSVDRATPSSLNSFLPGD 218
S G + LWG S SLWG P D +P+ +N+ LPGD
Sbjct: 1633 SSSLGGGGAGTGSGGKASNELLWGGVPQYS---SLWGPPSAEDGRVVGSPTPINTLLPGD 1689
Query: 219 LLGGESM 225
LL GESM
Sbjct: 1690 LLSGESM 1696
>gi|426239229|ref|XP_004013528.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
6C protein [Ovis aries]
Length = 1856
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 83/228 (36%), Positives = 110/228 (48%), Gaps = 46/228 (20%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1670 SHELWKVPRNTTAPTRPPPGLTN---PKPSSAWGASPLGWTSSYSSGSAWSTDASGRTSS 1726
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1727 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1786
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKDTWSSGGGGGN 177
GNTTI AE + EV LA AL +W
Sbjct: 1787 GNTTILAEFAGEEEVNRFLAQ---------------------GQALPPTSSW-------- 1817
Query: 178 TSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSSLNSFLPGDLLGGESM 225
PS G S D +P+ +N+ LPGDLL GES+
Sbjct: 1818 ---------QPSPGTSQTRLSSDDGRVIGSPTPVNTLLPGDLLSGESI 1856
>gi|405965787|gb|EKC31141.1| hypothetical protein CGI_10028774 [Crassostrea gigas]
Length = 1616
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 70/147 (47%), Positives = 90/147 (61%), Gaps = 12/147 (8%)
Query: 3 SNDLWGPPKPRG---PPPGMMGGGGKPPSNGWM-VRPNGGGGGGNTWGTSQPQGGWSG-- 56
SN++WG P P+ PPPG++ P S W V G + S W G
Sbjct: 1396 SNEVWGVPLPKNNSRPPPGLL-----PKSGNWTGVNRQHSWAGTTSSMLSGNSAAWDGIS 1450
Query: 57 TWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCI 116
T ++LKNLTPQIDGSTL+TLC+QHGPLQ F+L L++ AL +Y ++EEA KAQ +LN C+
Sbjct: 1451 TCLMLKNLTPQIDGSTLRTLCMQHGPLQWFYLSLHNGQALVRYHSKEEAFKAQKSLNTCV 1510
Query: 117 LGNTTIFAEAPSDAEVQSLLAHLSATA 143
LGNTTI A S+AE + A SA A
Sbjct: 1511 LGNTTIVANFVSEAEA-TRFAEQSAMA 1536
>gi|432871522|ref|XP_004071958.1| PREDICTED: trinucleotide repeat-containing gene 6A protein-like
[Oryzias latipes]
Length = 1840
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 74/178 (41%), Positives = 98/178 (55%), Gaps = 17/178 (9%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++LKNLTPQIDGSTL+TLC+QHGPL FHL L H A+ YS+++EA KAQ +L+ C+L
Sbjct: 1670 WLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPHGNAVVCYSSKDEATKAQKSLHMCVL 1729
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKDTWSSGGGGGN 177
GNTTI AE S+ E+ A + A T GW S+ S D S +
Sbjct: 1730 GNTTIMAEFASEEEISRFFAQGQSLAT-------PTSGWQAIGSSQSRMDQSQSFPSRAS 1782
Query: 178 TSQLWGT-------PSNPSSGGSLWGAPPLDSVDRA-TPSSLNSFLPGDLL--GGESM 225
W + + S +LWG P R +PS ++SFLP D L GG+S+
Sbjct: 1783 EPNQWNSGELHGSSLWSRSYSSTLWGNPSSADPGRINSPSPISSFLPVDHLTGGGDSL 1840
>gi|47211044|emb|CAF93674.1| unnamed protein product [Tetraodon nigroviridis]
Length = 802
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 96/276 (34%), Positives = 129/276 (46%), Gaps = 74/276 (26%)
Query: 3 SNDLWGPPK--------PRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGGW 54
S+DLW P+ P PPPG+ P++ W GG + G +Q GW
Sbjct: 548 SHDLWKVPQAPRSANTAPSRPPPGLTN---TKPASTW---------GGTSLGLAQ---GW 592
Query: 55 SGT-----------------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALA 97
S + W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+
Sbjct: 593 SSSYTTGTTWSTDSSTRTSSWLVLRNLTPQIDGSTLRTLCMQHGPLITFHLNLTQGSAVV 652
Query: 98 KYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAH---------LSATANNNNN 148
+YS+++EA KAQ +L+ C+LGNTTI AE + EV A AT N
Sbjct: 653 RYSSKDEAAKAQKSLHMCVLGNTTILAEFAGEEEVNRFFAQGQLLGGTTSWQATPGTNQT 712
Query: 149 NNGGTGGWARGSSALSNKDTWSSGGGGGNTSQ-----------------LWGTPSNPSSG 191
GG A + + + W++ G N++ LWG S
Sbjct: 713 RMGGASSGA--AHPIGHSSHWNNNNNGSNSNSSSNSSGGGGAAKTGGELLWGGVQQYS-- 768
Query: 192 GSLWGAPPLDSVDRA--TPSSLNSFLPGDLLGGESM 225
SLW PP R +P+ +N+ LPGDLL GESM
Sbjct: 769 -SLW-RPPSAEEGRVMGSPTPINTLLPGDLLSGESM 802
>gi|47219653|emb|CAG02698.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1835
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 81/184 (44%), Positives = 105/184 (57%), Gaps = 28/184 (15%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++LKNLTPQIDGSTLKTLC+QHGPL FHL L H A+ YS+++EA KAQ +L+ C+L
Sbjct: 1664 WLVLKNLTPQIDGSTLKTLCMQHGPLITFHLNLPHGNAVVCYSSKDEAAKAQKSLHMCVL 1723
Query: 118 GNTTIFAEAPSDAEVQSLLAHLSATANNNNNNNGGTGGWARGSSALSNKD---------- 167
GNTTI AE S+ E+ A + A + GW S+ S D
Sbjct: 1724 GNTTILAEFASEEEINRFFAQGQSLAT-------PSSGWQAVGSSQSRMDQSHHFPSRAP 1776
Query: 168 ---TWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDR-ATPSSLNSFLPGDLL--G 221
W+S ++S LWG P+ S SLWG P R ++PS ++SFLP D L G
Sbjct: 1777 EPSQWNS--SDLHSSSLWGGPNYSS---SLWGTPGGSEAGRISSPSPISSFLPVDHLTGG 1831
Query: 222 GESM 225
G+SM
Sbjct: 1832 GDSM 1835
>gi|327289345|ref|XP_003229385.1| PREDICTED: trinucleotide repeat-containing gene 6A protein-like
[Anolis carolinensis]
Length = 1965
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 94/156 (60%), Gaps = 23/156 (14%)
Query: 4 NDLWG---PPK----PRGPPPGMMGGGGKPPSNGW---MVRPNGGGGGGNTWGTSQ---- 49
++LW PPK P PPPG+ G G P + W + R GGGGGG W S+
Sbjct: 1695 HELWKVPLPPKSVAAPSRPPPGLTGQKG--PLSSWENPLQRFGGGGGGGAGWSASEGRYT 1752
Query: 50 PQGGWSGT-------WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTR 102
P W + ++LKNLTPQIDGSTL+TLC+QHGPL+ FHL L H AL +YS++
Sbjct: 1753 PGSAWGESSSGRITNCLVLKNLTPQIDGSTLRTLCMQHGPLKTFHLNLPHGNALVRYSSK 1812
Query: 103 EEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAH 138
EE +KAQ +L+ C+LGNTTI AE S+ E+ A
Sbjct: 1813 EEVVKAQKSLHMCVLGNTTILAEFASEEEISRFFAQ 1848
>gi|432113374|gb|ELK35786.1| Trinucleotide repeat-containing protein 6C protein [Myotis davidii]
Length = 1695
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 64/141 (45%), Positives = 85/141 (60%), Gaps = 8/141 (5%)
Query: 3 SNDLWGPPK----PRGPPPGMMGGGGKPPSNGWMVRPNG-GGGGGNTWGTSQPQGGWSGT 57
S++LW P+ P PPPG+ PS+ W P G + S G + +
Sbjct: 1516 SHELWKVPRNTTAPTRPPPGLTN---PKPSSTWGASPLGWTSSYSSGSAWSTDTSGRTSS 1572
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC+QHGPL FHL L A+ +YS++EEA KAQ +L+ C+L
Sbjct: 1573 WLVLRNLTPQIDGSTLRTLCLQHGPLITFHLNLTQGNAVVRYSSKEEAAKAQKSLHMCVL 1632
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE + EV LA
Sbjct: 1633 GNTTILAEFAGEEEVNRFLAQ 1653
>gi|47207588|emb|CAF90193.1| unnamed protein product [Tetraodon nigroviridis]
Length = 319
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/104 (53%), Positives = 69/104 (66%), Gaps = 2/104 (1%)
Query: 40 GGGNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKY 99
G G+ W GG W+LL NLTPQIDGSTL+T+C+QHGPL FHL L AL +Y
Sbjct: 201 GPGSPWNEGVSTGG--SCWLLLSNLTPQIDGSTLRTICMQHGPLLTFHLGLTQGSALIRY 258
Query: 100 STREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATA 143
S+++EA+KAQG L+ C+LGNTTI AE S+ EV AH A
Sbjct: 259 SSQQEAVKAQGALHMCVLGNTTILAEFVSEDEVARYFAHSQAEV 302
>gi|426225804|ref|XP_004007052.1| PREDICTED: trinucleotide repeat-containing gene 6B protein [Ovis
aries]
Length = 1844
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1660 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1719
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1720 GNTTILAEFATDDEVSRFLAQ 1740
>gi|326672343|ref|XP_002663990.2| PREDICTED: trinucleotide repeat-containing gene 6B protein [Danio
rerio]
Length = 2020
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YS+R+EA KAQ L+ C+L
Sbjct: 1779 WLVLSNLTPQIDGSTLRTICMQHGPLLTFHLGLTQGSALIRYSSRQEAAKAQSALHMCVL 1838
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE S+ EV AH
Sbjct: 1839 GNTTILAEFVSEEEVARYFAH 1859
>gi|73969030|ref|XP_859340.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
[Canis lupus familiaris]
Length = 1726
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1542 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1601
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1602 GNTTILAEFATDDEVSRFLAQ 1622
>gi|119580772|gb|EAW60368.1| trinucleotide repeat containing 6B, isoform CRA_b [Homo sapiens]
gi|119580773|gb|EAW60369.1| trinucleotide repeat containing 6B, isoform CRA_b [Homo sapiens]
Length = 1527
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1343 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1402
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1403 GNTTILAEFATDDEVSRFLAQ 1423
>gi|338721305|ref|XP_001500122.3| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 1
[Equus caballus]
Length = 1726
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1542 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1601
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1602 GNTTILAEFATDDEVSRFLAQ 1622
>gi|296486914|tpg|DAA29027.1| TPA: trinucleotide repeat containing 6B [Bos taurus]
Length = 1836
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1652 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1711
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1712 GNTTILAEFATDDEVSRFLAQ 1732
>gi|440903036|gb|ELR53750.1| Trinucleotide repeat-containing 6B protein, partial [Bos grunniens
mutus]
Length = 1835
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1651 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1710
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1711 GNTTILAEFATDDEVSRFLAQ 1731
>gi|338721304|ref|XP_003364347.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
[Equus caballus]
Length = 1836
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1652 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1711
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1712 GNTTILAEFATDDEVSRFLAQ 1732
>gi|119580777|gb|EAW60373.1| trinucleotide repeat containing 6B, isoform CRA_e [Homo sapiens]
Length = 1759
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1575 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1634
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1635 GNTTILAEFATDDEVSRFLAQ 1655
>gi|410349311|gb|JAA41259.1| trinucleotide repeat containing 6B [Pan troglodytes]
Length = 1722
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1538 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1597
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1598 GNTTILAEFATDDEVSRFLAQ 1618
>gi|351699314|gb|EHB02233.1| Trinucleotide repeat-containing gene 6B protein, partial
[Heterocephalus glaber]
Length = 1827
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1643 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1702
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1703 GNTTILAEFATDDEVSRFLAQ 1723
>gi|300798505|ref|NP_001179584.1| trinucleotide repeat-containing gene 6B protein [Bos taurus]
Length = 1836
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1652 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1711
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1712 GNTTILAEFATDDEVSRFLAQ 1732
>gi|431900059|gb|ELK07994.1| Trinucleotide repeat-containing protein 6B protein [Pteropus alecto]
Length = 1885
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1701 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1760
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1761 GNTTILAEFATDDEVSRFLAQ 1781
>gi|426394558|ref|XP_004063560.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 1
[Gorilla gorilla gorilla]
Length = 1723
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1539 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1598
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1599 GNTTILAEFATDDEVSRFLAQ 1619
>gi|148491080|ref|NP_055903.2| trinucleotide repeat-containing gene 6B protein isoform 2 [Homo
sapiens]
Length = 1723
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1539 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1598
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1599 GNTTILAEFATDDEVSRFLAQ 1619
>gi|403282937|ref|XP_003932888.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
[Saimiri boliviensis boliviensis]
Length = 1833
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1649 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1708
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1709 GNTTILAEFATDDEVSRFLAQ 1729
>gi|395753472|ref|XP_002831212.2| PREDICTED: trinucleotide repeat-containing gene 6B protein [Pongo
abelii]
Length = 1790
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1606 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1665
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1666 GNTTILAEFATDDEVSRFLAQ 1686
>gi|348569274|ref|XP_003470423.1| PREDICTED: trinucleotide repeat-containing gene 6B protein, partial
[Cavia porcellus]
Length = 1811
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1627 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1686
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1687 GNTTILAEFATDDEVSRFLAQ 1707
>gi|168269678|dbj|BAG09966.1| trinucleotide repeat-containing 6B protein [synthetic construct]
Length = 1722
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1538 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1597
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1598 GNTTILAEFATDDEVSRFLAQ 1618
>gi|397502026|ref|XP_003821672.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 1
[Pan paniscus]
Length = 1723
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1539 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1598
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1599 GNTTILAEFATDDEVSRFLAQ 1619
>gi|395819715|ref|XP_003783225.1| PREDICTED: trinucleotide repeat-containing gene 6B protein [Otolemur
garnettii]
Length = 1837
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1653 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1712
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1713 GNTTILAEFATDDEVSRFLAQ 1733
>gi|297261128|ref|XP_001101111.2| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
[Macaca mulatta]
Length = 1832
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1648 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1707
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1708 GNTTILAEFATDDEVSRFLAQ 1728
>gi|426394560|ref|XP_004063561.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
[Gorilla gorilla gorilla]
Length = 1833
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1649 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1708
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1709 GNTTILAEFATDDEVSRFLAQ 1729
>gi|241982729|ref|NP_001155973.1| trinucleotide repeat-containing gene 6B protein isoform 1 [Homo
sapiens]
gi|229904901|sp|Q9UPQ9.4|TNR6B_HUMAN RecName: Full=Trinucleotide repeat-containing gene 6B protein
gi|194377566|dbj|BAG57731.1| unnamed protein product [Homo sapiens]
Length = 1833
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1649 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1708
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1709 GNTTILAEFATDDEVSRFLAQ 1729
>gi|397502028|ref|XP_003821673.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
[Pan paniscus]
Length = 1833
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1649 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1708
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1709 GNTTILAEFATDDEVSRFLAQ 1729
>gi|410965583|ref|XP_003989326.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 1
[Felis catus]
Length = 1840
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1656 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1715
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1716 GNTTILAEFATDDEVSRFLAQ 1736
>gi|383416819|gb|AFH31623.1| trinucleotide repeat-containing gene 6B protein isoform 1 [Macaca
mulatta]
Length = 1775
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1591 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1650
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1651 GNTTILAEFATDDEVSRFLAQ 1671
>gi|345776961|ref|XP_538361.3| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 1
[Canis lupus familiaris]
Length = 1836
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1652 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1711
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1712 GNTTILAEFATDDEVSRFLAQ 1732
>gi|296237974|ref|XP_002763956.1| PREDICTED: trinucleotide repeat-containing gene 6B protein
[Callithrix jacchus]
Length = 1834
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1650 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1709
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1710 GNTTILAEFATDDEVSRFLAQ 1730
>gi|383416817|gb|AFH31622.1| trinucleotide repeat-containing gene 6B protein isoform 2 [Macaca
mulatta]
Length = 1722
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1538 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1597
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1598 GNTTILAEFATDDEVSRFLAQ 1618
>gi|14133235|dbj|BAA83045.2| KIAA1093 protein [Homo sapiens]
Length = 1727
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1543 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1602
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1603 GNTTILAEFATDDEVSRFLAQ 1623
>gi|410965585|ref|XP_003989327.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform 2
[Felis catus]
Length = 1730
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1546 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1605
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1606 GNTTILAEFATDDEVSRFLAQ 1626
>gi|355563693|gb|EHH20255.1| hypothetical protein EGK_03069 [Macaca mulatta]
gi|355785008|gb|EHH65859.1| hypothetical protein EGM_02715 [Macaca fascicularis]
Length = 1846
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1662 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1721
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1722 GNTTILAEFATDDEVSRFLAQ 1742
>gi|327272521|ref|XP_003221033.1| PREDICTED: trinucleotide repeat-containing gene 6B protein-like
[Anolis carolinensis]
Length = 1028
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 845 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 904
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 905 GNTTILAEFATDEEVSRFLAQ 925
>gi|444723822|gb|ELW64452.1| Trinucleotide repeat-containing 6B protein [Tupaia chinensis]
Length = 2247
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 2063 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 2122
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 2123 GNTTILAEFATDDEVSRFLAQ 2143
>gi|67782330|ref|NP_001020014.1| trinucleotide repeat-containing gene 6B protein isoform 3 [Homo
sapiens]
gi|20306948|gb|AAH28626.1| TNRC6B protein [Homo sapiens]
gi|119580771|gb|EAW60367.1| trinucleotide repeat containing 6B, isoform CRA_a [Homo sapiens]
gi|119580775|gb|EAW60371.1| trinucleotide repeat containing 6B, isoform CRA_a [Homo sapiens]
Length = 1029
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 845 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 904
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 905 GNTTILAEFATDDEVSRFLAQ 925
>gi|332231295|ref|XP_003264834.1| PREDICTED: trinucleotide repeat-containing gene 6B protein
[Nomascus leucogenys]
Length = 1029
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 845 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 904
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 905 GNTTILAEFATDDEVSRFLAQ 925
>gi|332859853|ref|XP_003317299.1| PREDICTED: trinucleotide repeat-containing gene 6B protein [Pan
troglodytes]
Length = 1028
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 844 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 903
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 904 GNTTILAEFATDDEVSRFLAQ 924
>gi|338721307|ref|XP_003364348.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform
3 [Equus caballus]
Length = 1032
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 848 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 907
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 908 GNTTILAEFATDDEVSRFLAQ 928
>gi|410965587|ref|XP_003989328.1| PREDICTED: trinucleotide repeat-containing gene 6B protein isoform
3 [Felis catus]
Length = 1036
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 852 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 911
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 912 GNTTILAEFATDDEVSRFLAQ 932
>gi|449481809|ref|XP_004175955.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
6B protein [Taeniopygia guttata]
Length = 1831
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +Y+T++EA KAQ L+ C+L
Sbjct: 1646 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYNTKQEAAKAQTALHMCVL 1705
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1706 GNTTILAEFATDEEVSRFLAQ 1726
>gi|355725504|gb|AES08578.1| trinucleotide repeat containing 6B [Mustela putorius furo]
Length = 452
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 269 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 328
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 329 GNTTILAEFATDDEVSRFLAQ 349
>gi|449271928|gb|EMC82102.1| Trinucleotide repeat-containing gene 6B protein, partial [Columba
livia]
Length = 1667
Score = 109 bits (273), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +Y+T++EA KAQ L+ C+L
Sbjct: 1483 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYNTKQEAAKAQTALHMCVL 1542
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1543 GNTTILAEFATDEEVSRFLAQ 1563
>gi|402884320|ref|XP_003905634.1| PREDICTED: trinucleotide repeat-containing gene 6B protein-like,
partial [Papio anubis]
Length = 459
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 275 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 334
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 335 GNTTILAEFATDDEVSRFLAQ 355
>gi|334347952|ref|XP_003342002.1| PREDICTED: trinucleotide repeat-containing gene 6B protein-like
[Monodelphis domestica]
Length = 1797
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 48/79 (60%), Positives = 59/79 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1610 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1669
Query: 118 GNTTIFAEAPSDAEVQSLL 136
GNTTI AE +D EV L
Sbjct: 1670 GNTTILAEFATDEEVSRFL 1688
>gi|281351171|gb|EFB26755.1| hypothetical protein PANDA_002541 [Ailuropoda melanoleuca]
Length = 1690
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +Y+T++EA KAQ L+ C+L
Sbjct: 1506 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYNTKQEAAKAQTALHMCVL 1565
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE +D EV LA
Sbjct: 1566 GNTTILAEFATDDEVSRFLAQ 1586
>gi|395538134|ref|XP_003771040.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
6B protein [Sarcophilus harrisii]
Length = 1823
Score = 109 bits (272), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 48/79 (60%), Positives = 59/79 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1644 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1703
Query: 118 GNTTIFAEAPSDAEVQSLL 136
GNTTI AE +D EV L
Sbjct: 1704 GNTTILAEFATDEEVSRFL 1722
>gi|159110982|ref|NP_796098.3| trinucleotide repeat-containing gene 6B protein isoform 2 [Mus
musculus]
Length = 1774
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1590 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1649
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 1650 GNTTILAEFATEDEVSRFLAQ 1670
>gi|67782332|ref|NP_659061.2| trinucleotide repeat-containing gene 6B protein isoform 1 [Mus
musculus]
gi|229891742|sp|Q8BKI2.2|TNR6B_MOUSE RecName: Full=Trinucleotide repeat-containing gene 6B protein
Length = 1810
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1626 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1685
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 1686 GNTTILAEFATEDEVSRFLAQ 1706
>gi|148672646|gb|EDL04593.1| trinucleotide repeat containing 6b, isoform CRA_a [Mus musculus]
Length = 1817
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1633 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1692
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 1693 GNTTILAEFATEDEVSRFLAQ 1713
>gi|198041672|ref|NP_620200.2| trinucleotide repeat-containing gene 6B protein [Rattus norvegicus]
Length = 1818
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1634 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1693
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 1694 GNTTILAEFATEDEVSRFLAQ 1714
>gi|344246759|gb|EGW02863.1| Trinucleotide repeat-containing gene 6B protein [Cricetulus griseus]
Length = 1810
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1626 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1685
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 1686 GNTTILAEFATEDEVSRFLAQ 1706
>gi|354490746|ref|XP_003507517.1| PREDICTED: trinucleotide repeat-containing gene 6B protein
[Cricetulus griseus]
Length = 1913
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1729 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1788
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 1789 GNTTILAEFATEDEVSRFLAQ 1809
>gi|344296348|ref|XP_003419871.1| PREDICTED: trinucleotide repeat-containing gene 6B protein [Loxodonta
africana]
Length = 1557
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/79 (60%), Positives = 59/79 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 1373 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 1432
Query: 118 GNTTIFAEAPSDAEVQSLL 136
GNTTI AE +D EV L
Sbjct: 1433 GNTTILAEFATDDEVSRFL 1451
>gi|26342470|dbj|BAC34897.1| unnamed protein product [Mus musculus]
Length = 812
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 628 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 687
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 688 GNTTILAEFATEDEVSRFLAQ 708
>gi|149065870|gb|EDM15743.1| androgen receptor-related apoptosis-associated protein CBL27,
isoform CRA_a [Rattus norvegicus]
Length = 1005
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 821 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 880
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 881 GNTTILAEFATEDEVSRFLAQ 901
>gi|74184073|dbj|BAE37059.1| unnamed protein product [Mus musculus]
Length = 838
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 654 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 713
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 714 GNTTILAEFATEDEVSRFLAQ 734
>gi|74183955|dbj|BAE37027.1| unnamed protein product [Mus musculus]
Length = 731
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 547 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 606
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 607 GNTTILAEFATEDEVSRFLAQ 627
>gi|28972612|dbj|BAC65722.1| mKIAA1093 protein [Mus musculus]
Length = 571
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 387 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 446
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 447 GNTTILAEFATEDEVSRFLAQ 467
>gi|38197636|gb|AAH61751.1| Tnrc6b protein [Rattus norvegicus]
Length = 432
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 248 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 307
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 308 GNTTILAEFATEDEVSRFLAQ 328
>gi|51873840|gb|AAH80750.1| Tnrc6b protein, partial [Mus musculus]
Length = 312
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 128 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 187
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 188 GNTTILAEFATEDEVSRFLAQ 208
>gi|9295520|gb|AAF86977.1|AF275151_1 androgen receptor-related apoptosis-associated protein CBL27
[Rattus norvegicus]
gi|109730969|gb|AAI17549.1| Tnrc6b protein [Mus musculus]
gi|109735037|gb|AAI18055.1| Tnrc6b protein [Mus musculus]
Length = 249
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 65 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 124
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 125 GNTTILAEFATEDEVSRFLAQ 145
>gi|17028428|gb|AAH17531.1| Tnrc6b protein [Mus musculus]
gi|26341766|dbj|BAC34545.1| unnamed protein product [Mus musculus]
Length = 249
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+ C+L
Sbjct: 65 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVL 124
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE ++ EV LA
Sbjct: 125 GNTTILAEFATEDEVSRFLAQ 145
>gi|260792404|ref|XP_002591205.1| hypothetical protein BRAFLDRAFT_131100 [Branchiostoma floridae]
gi|229276408|gb|EEN47216.1| hypothetical protein BRAFLDRAFT_131100 [Branchiostoma floridae]
Length = 1431
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 47/79 (59%), Positives = 60/79 (75%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++LKNLTPQIDGSTL+TLC+QHGPL FHL L+ AL Y ++EEA KAQ +L+ C+L
Sbjct: 1203 WLILKNLTPQIDGSTLRTLCMQHGPLLTFHLNLSQGCALVCYMSKEEAAKAQKSLHTCVL 1262
Query: 118 GNTTIFAEAPSDAEVQSLL 136
GNTTI A+ S+ E + L
Sbjct: 1263 GNTTILADFISEDEARRLF 1281
>gi|432921828|ref|XP_004080242.1| PREDICTED: uncharacterized protein LOC101163614 [Oryzias latipes]
Length = 1885
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 60/82 (73%)
Query: 57 TWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCI 116
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +Y +++EA KA+ L+ C+
Sbjct: 1336 CWLVLSNLTPQIDGSTLRTICMQHGPLLTFHLGLTQGTALIRYGSKQEASKARSALHMCV 1395
Query: 117 LGNTTIFAEAPSDAEVQSLLAH 138
LGNTTI AE S+ +V +AH
Sbjct: 1396 LGNTTILAEFVSEEDVARYIAH 1417
>gi|291239873|ref|XP_002739846.1| PREDICTED: hypothetical protein, partial [Saccoglossus kowalevskii]
Length = 1669
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 60/81 (74%), Gaps = 1/81 (1%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
W++L+NLTPQIDGSTL+TLC QHGPL FHL L+ AL +Y TR+EA KAQ L+ C+L
Sbjct: 1470 WLVLRNLTPQIDGSTLQTLCKQHGPLHTFHLNLSQGQALIQYGTRDEAAKAQKALHMCVL 1529
Query: 118 GNTTIFAEAPSDAEVQSLLAH 138
GNTTI AE S +E+ +L
Sbjct: 1530 GNTTIMAEF-SSSEMTRMLER 1549
>gi|410896158|ref|XP_003961566.1| PREDICTED: trinucleotide repeat-containing gene 6B protein-like
[Takifugu rubripes]
Length = 2001
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 62/147 (42%), Positives = 78/147 (53%), Gaps = 27/147 (18%)
Query: 15 PPPGMMGGGGKPPSNGWMVRPNGGGGGGN------------------TWGTSQPQGGWSG 56
PPPG+ PS P GGG N TW Q
Sbjct: 1717 PPPGLGNQKQPSPS------PWSGGGPRNSILGLGTQNQTFFLCTVSTWSDGSAQ---ES 1767
Query: 57 TWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCI 116
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +Y++++EA KAQ L+ C+
Sbjct: 1768 CWLVLSNLTPQIDGSTLRTICMQHGPLLTFHLGLTQGTALIRYNSKQEAAKAQSALHMCV 1827
Query: 117 LGNTTIFAEAPSDAEVQSLLAHLSATA 143
LGNTTI AE S+ +V +AH A A
Sbjct: 1828 LGNTTILAEFVSEEDVARYIAHSQAGA 1854
>gi|47226125|emb|CAG04499.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1191
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/158 (39%), Positives = 84/158 (53%), Gaps = 28/158 (17%)
Query: 2 SSNDLWGPP-----KPRGPPPGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTSQPQGG-WS 55
S N L PP + + P GGG + GW G G +T G++ GG
Sbjct: 879 SQNQLSRPPPGLGSQKQPSPSPWSGGGPRFAGRGW-------GSGSSTTGSAWSDGGAQE 931
Query: 56 GTWVLLKNLTPQ---------------IDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYS 100
W++L NLTPQ IDGSTL+T+C+QHGPL FHL L AL +Y+
Sbjct: 932 SCWLVLSNLTPQVITDGVTATEAEVDAIDGSTLRTICMQHGPLLTFHLGLTQGNALIRYN 991
Query: 101 TREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAH 138
+++EA KAQ L+ C+LGNTTI AE S+ +V +AH
Sbjct: 992 SKQEAAKAQSALHMCVLGNTTILAEFVSEEDVARYIAH 1029
>gi|170052147|ref|XP_001862090.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167873115|gb|EDS36498.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1332
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 38/61 (62%), Positives = 51/61 (83%)
Query: 79 QHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLAH 138
+HGPL FH+YL+H +AL KYS+R+EA KAQ LNNC+LGNTTI AE P++++VQ++L H
Sbjct: 1198 KHGPLLAFHVYLHHGIALCKYSSRDEATKAQLALNNCMLGNTTICAEIPTESDVQNILQH 1257
Query: 139 L 139
L
Sbjct: 1258 L 1258
>gi|148685348|gb|EDL17295.1| mCG20982, isoform CRA_f [Mus musculus]
Length = 176
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 55/157 (35%), Positives = 84/157 (53%), Gaps = 16/157 (10%)
Query: 78 VQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLA 137
+QHGPL FHL L H AL +YS++EE +KAQ +L+ C+LGNTTI AE S+ E+ A
Sbjct: 1 MQHGPLITFHLNLPHGNALVRYSSKEEVVKAQKSLHMCVLGNTTILAEFASEEEISRFFA 60
Query: 138 HLSATANN-------NNNNNGGTGGWARGSSALSNKDTWSSGGGGG------NTSQLWGT 184
+ + ++ + G+ + S+ ++ + W+ G G + + LWGT
Sbjct: 61 QSQSLTPSPGWQSLGSSQSRLGSLDCSHSFSSRTDVNHWNGAGLSGANCGDLHGTSLWGT 120
Query: 185 PSNPSSGGSLWGAPPLDSVDRATPSSLNSFLPGDLLG 221
P + SLWG P D ++PS +N+FL D L
Sbjct: 121 PHYST---SLWGPPSSDPRGISSPSPINAFLSVDHLA 154
>gi|149067988|gb|EDM17540.1| trinucleotide repeat containing 6 (predicted), isoform CRA_b
[Rattus norvegicus]
Length = 178
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 57/159 (35%), Positives = 85/159 (53%), Gaps = 19/159 (11%)
Query: 78 VQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLA 137
+QHGPL FHL L H AL +YS++EE +KAQ +L+ C+LGNTTI AE S+ E+ A
Sbjct: 1 MQHGPLITFHLNLPHGNALVRYSSKEEVVKAQKSLHMCVLGNTTILAEFASEEEISRFFA 60
Query: 138 HLSATANN-------NNNNNGGTGGWARGSSALSNKDTWSSGGGGGNT------SQLWGT 184
+ + ++ + G+ + S+ ++ + W+ G G + LWGT
Sbjct: 61 QSQSLTPSPGWQSLGSSQSRLGSLDCSHSFSSRTDLNHWNGAGLSGTNCGDLHGTSLWGT 120
Query: 185 PSNPSSGGSLWGAPPLDSVDR--ATPSSLNSFLPGDLLG 221
P + SLWG PP S R ++PS +N+FL D L
Sbjct: 121 PHYST---SLWG-PPSSSDPRGISSPSPINAFLSVDHLA 155
>gi|194385232|dbj|BAG64993.1| unnamed protein product [Homo sapiens]
Length = 762
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 35/56 (62%), Positives = 44/56 (78%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLN 113
W++L NLTPQIDGSTL+T+C+QHGPL FHL L AL +YST++EA KAQ L+
Sbjct: 706 WLVLHNLTPQIDGSTLRTICMQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALH 761
>gi|335287525|ref|XP_003355376.1| PREDICTED: trinucleotide repeat-containing gene 6B protein-like
[Sus scrofa]
Length = 165
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 34/61 (55%), Positives = 41/61 (67%)
Query: 78 VQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLA 137
+QHGPL FHL L AL +YST++EA KAQ L+ C+LGNTTI AE +D EV LA
Sbjct: 1 MQHGPLLTFHLNLTQGTALIRYSTKQEAAKAQTALHMCVLGNTTILAEFATDDEVSRFLA 60
Query: 138 H 138
Sbjct: 61 Q 61
>gi|363727808|ref|XP_416246.3| PREDICTED: trinucleotide repeat-containing gene 6B protein [Gallus
gallus]
Length = 165
Score = 70.9 bits (172), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 32/61 (52%), Positives = 40/61 (65%)
Query: 78 VQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQSLLA 137
+QHGPL FHL L AL +Y+T++EA KAQ L+ C+LGNTTI AE +D EV L
Sbjct: 1 MQHGPLLTFHLNLTQGTALIRYNTKQEAAKAQTALHMCVLGNTTILAEFATDEEVSRFLT 60
Query: 138 H 138
Sbjct: 61 Q 61
>gi|198433645|ref|XP_002122194.1| PREDICTED: similar to trinucleotide repeat containing 6B [Ciona
intestinalis]
Length = 964
Score = 70.5 bits (171), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 75/260 (28%), Positives = 114/260 (43%), Gaps = 52/260 (20%)
Query: 13 RGPPPGMMGGGGKP-----------PSNGWMVRPNGGGGGGNTWGTSQPQGGWS------ 55
R PPPG+ G +P P+N W GGN W + Q Q
Sbjct: 710 RPPPPGIGGSAFRPTKELAPTWENVPNNSWDQNMTRTSQGGNNWASQQQQQQPQQPQQQQ 769
Query: 56 -----GTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFH-LYLNHSLALAKYSTREEAIKAQ 109
G+W++L N Q+D + ++ LC+QHG + +F Y +AL +Y++ E+A A+
Sbjct: 770 QQESLGSWLVLTNFNQQVDVAGVRQLCMQHGNMVSFQGHYPIEGMALVRYASPEDAANAK 829
Query: 110 GNLNNCILGNTTIFAEAPSDAEVQSLLAHLSATA------------NNNNNNNGGTGGWA 157
LN + G+T + A +D EV + ++ATA + +N+G TGG+
Sbjct: 830 KALNMFMAGSTMLVATVATDHEVANF---VNATAGGSWGSTAGTPGSRFVSNSGSTGGFI 886
Query: 158 ----RGSSALSNKDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPP--------LDSVDR 205
+ S + D S QLWG SNP G PP + + R
Sbjct: 887 SQPPQNPSLAISSDAASVSSQQQQQQQLWGN-SNPQGGNWPSNMPPSMPWSGTSSEDMSR 945
Query: 206 ATPSSLNSFLPGDLLGGESM 225
S L++ LP +LLGGE+M
Sbjct: 946 IM-SPLHTLLPENLLGGETM 964
>gi|195999420|ref|XP_002109578.1| hypothetical protein TRIADDRAFT_53746 [Trichoplax adhaerens]
gi|190587702|gb|EDV27744.1| hypothetical protein TRIADDRAFT_53746 [Trichoplax adhaerens]
Length = 1438
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 40/69 (57%)
Query: 59 VLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILG 118
+L+K + Q+D + L+ LC+QHG + F YS+ +EA++AQ LNNC +
Sbjct: 1268 ILIKGFSSQVDENLLQALCLQHGRITEFVFDPRKRAVFVSYSSVDEAVRAQSRLNNCKIM 1327
Query: 119 NTTIFAEAP 127
++T+ A P
Sbjct: 1328 DSTLEASFP 1336
>gi|156375504|ref|XP_001630120.1| predicted protein [Nematostella vectensis]
gi|156217135|gb|EDO38057.1| predicted protein [Nematostella vectensis]
Length = 1727
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 50/95 (52%), Gaps = 19/95 (20%)
Query: 57 TWVLLKNLTP-------------------QIDGSTLKTLCVQHGPLQNFHLYLNHSLALA 97
TW++L+NL+P Q D + ++ +C Q+GPL F L L H +L
Sbjct: 1499 TWLVLRNLSPRADCFPLSLRLSSTISDFFQADPTAMRAVCQQYGPLLTFTLNLRHGNSLI 1558
Query: 98 KYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEV 132
+YS +++A A+ NLN ++ + A+ +D+++
Sbjct: 1559 RYSNKDQAASARNNLNGMMVKGMQLIADFATDSDI 1593
>gi|156340471|ref|XP_001620456.1| hypothetical protein NEMVEDRAFT_v1g223093 [Nematostella vectensis]
gi|156205405|gb|EDO28356.1| predicted protein [Nematostella vectensis]
Length = 199
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 42/77 (54%)
Query: 69 DGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPS 128
D + ++ +C Q+GPL F L L H +L +YS +++A A+ NLN ++ + A+ +
Sbjct: 2 DPTAMRAVCQQYGPLLTFTLNLRHGNSLIRYSNKDQAASARNNLNGMMVKGMQLIADFAT 61
Query: 129 DAEVQSLLAHLSATANN 145
D+++ +NN
Sbjct: 62 DSDIGGFFEQTPDWSNN 78
>gi|21430084|gb|AAM50720.1| GM23685p [Drosophila melanogaster]
Length = 215
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/28 (71%), Positives = 25/28 (89%)
Query: 112 LNNCILGNTTIFAEAPSDAEVQSLLAHL 139
LNNC+L NTTIFAE+PS+ EVQS++ HL
Sbjct: 3 LNNCVLANTTIFAESPSENEVQSIMQHL 30
>gi|340371985|ref|XP_003384525.1| PREDICTED: hypothetical protein LOC100636235 [Amphimedon
queenslandica]
Length = 2381
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/89 (26%), Positives = 52/89 (58%), Gaps = 1/89 (1%)
Query: 57 TWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLN-NC 115
++++L+N+TPQID ++L+ +C ++G + + + L +YST+EEA A+ L+ N
Sbjct: 2181 SFIVLRNVTPQIDETSLREVCSEYGKVLACTINSFNESVLIRYSTKEEAALAKSGLDRNP 2240
Query: 116 ILGNTTIFAEAPSDAEVQSLLAHLSATAN 144
+ + + S+A++ S + ++N
Sbjct: 2241 SICGVYVNPQFASEADISSFSDQRTPSSN 2269
>gi|345565557|gb|EGX48506.1| hypothetical protein AOL_s00080g135 [Arthrobotrys oligospora ATCC
24927]
Length = 147
Score = 43.9 bits (102), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 6/82 (7%)
Query: 50 PQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHS------LALAKYSTRE 103
P G VL+ N+ + L L +HG +QN HL L+ AL +YST+E
Sbjct: 23 PSRSIEGWIVLVTNVHEEAGEEDLNDLFAEHGEVQNLHLNLDRRTGYVKGYALVEYSTKE 82
Query: 104 EAIKAQGNLNNCILGNTTIFAE 125
EA A +++ L + T+ A+
Sbjct: 83 EAQSAIDSIDGSKLLDQTVSAD 104
>gi|384500774|gb|EIE91265.1| hypothetical protein RO3G_15976 [Rhizopus delemar RA 99-880]
Length = 256
Score = 42.4 bits (98), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 44/87 (50%), Gaps = 7/87 (8%)
Query: 43 NTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLY-----LNHSLALA 97
NTW S P+ S ++++KN++PQ T+K + G ++ F L H +AL
Sbjct: 3 NTWTISVPETP-SPNYIVVKNISPQSSEQTVKEFFLFCGKIKEFELKNDEEDEKHKIALV 61
Query: 98 KYSTREEAIKAQGNLNNCILGNTTIFA 124
+ RE A K L+N ++ ++ I A
Sbjct: 62 HFE-RESAAKTAALLSNALIDDSHIVA 87
>gi|390341101|ref|XP_786178.3| PREDICTED: uncharacterized protein LOC581062 [Strongylocentrotus
purpuratus]
Length = 2930
Score = 41.2 bits (95), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 51/191 (26%), Positives = 81/191 (42%), Gaps = 47/191 (24%)
Query: 56 GTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNC 115
G +L+ +T ++ + L+ +C Q G + F + Y ++A KA +++
Sbjct: 2758 GNSILISGVTSDVNVTALRNICGQQGQVDQFQENRAQGSVMVAYRFPDDAAKALAIISSA 2817
Query: 116 ILGNTTIFAE--APSDA-----------------------EVQSLLAHLSATANNNNNNN 150
I AE +PSDA S+L S+ + +++
Sbjct: 2818 F---PNIIAELVSPSDAFSTPASSSSSGWPQGGGSSGGSKFGNSVLPSTSSASGAGKDDS 2874
Query: 151 GGTGGWARGSSALSNKDTWSSGGGGGNTSQLWGTPSNPSSGGSLWGAPPLDSVDRATPSS 210
GG+ K WS+G G SQLW +P GGS+ G P+D D ++ S+
Sbjct: 2875 GGS------------KQNWSAGLPGMPGSQLW----SPGPGGSM-GWSPMDG-DSSSASN 2916
Query: 211 LNSFLPGDLLG 221
SFLPGDLLG
Sbjct: 2917 F-SFLPGDLLG 2926
>gi|301609397|ref|XP_002934255.1| PREDICTED: LOW QUALITY PROTEIN: trinucleotide repeat-containing gene
6B protein-like [Xenopus (Silurana) tropicalis]
Length = 1869
Score = 40.8 bits (94), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 38/84 (45%), Gaps = 4/84 (4%)
Query: 60 LLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGN 119
+ KNLT Q + S KT Q ++L+ SL A + + + C+LGN
Sbjct: 1696 ISKNLT-QKEKSQKKTTXKNKS--QKLRIHLD-SLVYAAMPSSHXXLLLSPSQPRCVLGN 1751
Query: 120 TTIFAEAPSDAEVQSLLAHLSATA 143
TTI AE +D EV LA A
Sbjct: 1752 TTILAEFATDEEVSRYLAQAQPPA 1775
>gi|19172018|gb|AAL85701.1|AF474982_5 Mei2-like protein [Hordeum vulgare subsp. vulgare]
Length = 961
Score = 40.8 bits (94), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 60/161 (37%), Gaps = 18/161 (11%)
Query: 28 SNGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFH 87
N +++ NGG G T P G + ++N+ ++ + LK L Q+G +Q +
Sbjct: 205 ENNKLLKHNGGANTGQTGLNGLPYGENPSRTLFIRNINANVEDTELKLLFEQYGDIQTLY 264
Query: 88 L-YLNHSLALAKYSTREEAIKAQGNLNNCILGNTTIFAEAPSDAEVQ-SLLAHLSATANN 145
Y +H L + Y A +A L S Q L H S N
Sbjct: 265 TAYKHHGLVIISYYDIRSAERAMKALQ--------------SKPFRQWKLEIHYSIPKEN 310
Query: 146 --NNNNNGGTGGWARGSSALSNKDTWSSGGGGGNTSQLWGT 184
N+NN GT +++N D GG G + T
Sbjct: 311 LLENDNNQGTLAVINLDQSVTNDDLRHIFGGYGEIKAIHET 351
>gi|443716715|gb|ELU08106.1| hypothetical protein CAPTEDRAFT_185432 [Capitella teleta]
Length = 1399
Score = 38.9 bits (89), Expect = 1.5, Method: Composition-based stats.
Identities = 25/98 (25%), Positives = 46/98 (46%), Gaps = 6/98 (6%)
Query: 58 WVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCIL 117
+V ++ L P + S L+ L HG +++ + A K+ ++A+KA+ NN +
Sbjct: 22 YVFIEGLAPGVSISRLRILFSDHGVVEDVQVREEDHCAWIKFKQAKDALKAKKLTNNTAI 81
Query: 118 GNTTI----FAEAPSDAEVQSLLAHLSATANNNNNNNG 151
GNT + +E PS V +L+ + +N G
Sbjct: 82 GNTRVKVVTLSEEPS--RVIRILSAFCGSLTGKGHNGG 117
>gi|159473631|ref|XP_001694937.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276316|gb|EDP02089.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1623
Score = 38.1 bits (87), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 23/89 (25%), Positives = 40/89 (44%), Gaps = 11/89 (12%)
Query: 36 NGGGGGGNT---WGTSQPQG--------GWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQ 84
N GG G++ +QP+G W + L NL P G+ L+ L +GPL+
Sbjct: 266 NANGGEGSSRLLLAATQPRGQLHQSVAQSWEARHLWLGNLLPTTTGAQLERLFAPYGPLE 325
Query: 85 NFHLYLNHSLALAKYSTREEAIKAQGNLN 113
+ ++ + + A + T + A A+ L
Sbjct: 326 SVRVFADRNFAFVNFMTAQHASTAKAALE 354
>gi|449470045|ref|XP_004152729.1| PREDICTED: polyadenylate-binding protein RBP47C-like [Cucumis
sativus]
gi|449496017|ref|XP_004160013.1| PREDICTED: polyadenylate-binding protein RBP47C-like [Cucumis
sativus]
Length = 429
Score = 37.4 bits (85), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 24/108 (22%), Positives = 47/108 (43%), Gaps = 2/108 (1%)
Query: 17 PGMMGGGGKPPSNGWMVRPNGGGGGGNTWGTS--QPQGGWSGTWVLLKNLTPQIDGSTLK 74
P +G S+G+ + + G N + Q G ++ T + + L P + LK
Sbjct: 256 PMRIGAATPKKSSGYQQQYSSQGYASNGSFSHGHQSDGDFTNTTIFIGGLDPNVTDEDLK 315
Query: 75 TLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCILGNTTI 122
L QHG + + + + +++ R+ A +A LN ++G T+
Sbjct: 316 QLFSQHGEIVSVKIPVGKGCGFIQFANRKNAEEALQKLNGTVIGKQTV 363
>gi|66808185|ref|XP_637815.1| SAP DNA-binding domain-containing protein [Dictyostelium discoideum
AX4]
gi|60466244|gb|EAL64306.1| SAP DNA-binding domain-containing protein [Dictyostelium discoideum
AX4]
Length = 421
Score = 36.6 bits (83), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 22/76 (28%), Positives = 39/76 (51%), Gaps = 3/76 (3%)
Query: 59 VLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLNNCI-- 116
+L+ L ++TL ++G ++N+ + S YST EEAIKA+ +LN +
Sbjct: 228 ILISKLVRPFRVDMIETLMNEYGSVKNYWMNSVKSFCFVTYSTSEEAIKARNSLNGLVWP 287
Query: 117 -LGNTTIFAEAPSDAE 131
L + + E S++E
Sbjct: 288 PLNRSKLIVEFSSESE 303
>gi|239817964|ref|YP_002946874.1| Crp/Fnr family transcriptional regulator [Variovorax paradoxus
S110]
gi|239804541|gb|ACS21608.1| transcriptional regulator, Crp/Fnr family [Variovorax paradoxus
S110]
Length = 261
Score = 36.2 bits (82), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 19/71 (26%), Positives = 33/71 (46%)
Query: 29 NGWMVRPNGGGGGGNTWGTSQPQGGWSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHL 88
G + N GG+ T P GGW G ++K + + L+ V P+++FH
Sbjct: 75 EGLLKMSNDNADGGSVTYTGVPPGGWFGEGTVMKREPYRYNIQALRRSVVAGLPIESFHW 134
Query: 89 YLNHSLALAKY 99
L+HS+ ++
Sbjct: 135 LLDHSIGFNRF 145
>gi|123485368|ref|XP_001324476.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121907359|gb|EAY12253.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 576
Score = 36.2 bits (82), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 40/72 (55%)
Query: 54 WSGTWVLLKNLTPQIDGSTLKTLCVQHGPLQNFHLYLNHSLALAKYSTREEAIKAQGNLN 113
+S T +++KNL + L+ + G L F L HS+A+ +++ ++A KA +LN
Sbjct: 375 YSKTVLIIKNLRWETTEEELRGIFASKGTLVRFVLAPTHSVAIVEFARGDDARKAFNSLN 434
Query: 114 NCILGNTTIFAE 125
+L +T I+ +
Sbjct: 435 YRLLHDTPIYIQ 446
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.309 0.131 0.422
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,869,147,268
Number of Sequences: 23463169
Number of extensions: 262593790
Number of successful extensions: 1603711
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 657
Number of HSP's successfully gapped in prelim test: 2058
Number of HSP's that attempted gapping in prelim test: 1557584
Number of HSP's gapped (non-prelim): 40698
length of query: 225
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 88
effective length of database: 9,144,741,214
effective search space: 804737226832
effective search space used: 804737226832
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 74 (33.1 bits)