BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 024104
(272 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|118485488|gb|ABK94599.1| unknown [Populus trichocarpa]
Length = 268
Score = 427 bits (1097), Expect = e-117, Method: Compositional matrix adjust.
Identities = 204/263 (77%), Positives = 234/263 (88%), Gaps = 1/263 (0%)
Query: 11 LFFQLLTL-QFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC 69
L QLL L QF SIISAIRKDIGF+++ C+ TVQGRYLLSDDNGYVCDA+S+DP SRCC
Sbjct: 6 LSLQLLILFQFSSIISAIRKDIGFEQSRSCRNTVQGRYLLSDDNGYVCDALSVDPPSRCC 65
Query: 70 PEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDF 129
PE G +FSCQGCNL+SQCCNSYE+CVSCC++P+RT + QVL VKIAKP+TA +Y SVFDF
Sbjct: 66 PENGLRFSCQGCNLVSQCCNSYEFCVSCCLHPSRTQEAQVLKVKIAKPSTAVSYTSVFDF 125
Query: 130 CAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDS 189
CAGRCRHNSESVVHENAY SDFHHCFS+PSN+SGA TQLE RL GI+V+IG+QGESCDS
Sbjct: 126 CAGRCRHNSESVVHENAYRSDFHHCFSLPSNSSGANYTQLEARLGGINVVIGKQGESCDS 185
Query: 190 VCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLY 249
VCKS+GQSCVLNKL++LNQC+++QKYMSCK GCLAS+G DQPAEV +DAP +LNP ACLY
Sbjct: 186 VCKSNGQSCVLNKLLVLNQCDVMQKYMSCKGGCLASIGTDQPAEVVEDAPRHLNPGACLY 245
Query: 250 TRIQSMLSCDGSHRHTRRLCPCA 272
TR QS+LSCDGS HTRRLCPCA
Sbjct: 246 TRTQSLLSCDGSLWHTRRLCPCA 268
>gi|297743774|emb|CBI36657.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 410 bits (1053), Expect = e-112, Method: Compositional matrix adjust.
Identities = 199/273 (72%), Positives = 227/273 (83%), Gaps = 1/273 (0%)
Query: 1 MPKSRLNLLFLFFQLLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAV 60
M SRL L LF +L +Q LS +SAIRKD E C+TTVQGRYL +DDNG+VCDA+
Sbjct: 1 MSNSRLILSLLFRFILIIQILSRVSAIRKDPNLLEKRICRTTVQGRYLRTDDNGHVCDAL 60
Query: 61 SMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA 120
SMDPQS CC K E+FSC GCNL+SQCCNSYE+CVSCC+NPART KE L VK+AKP TA
Sbjct: 61 SMDPQSGCCFGKREQFSCHGCNLISQCCNSYEFCVSCCLNPARTQKELALKVKMAKPVTA 120
Query: 121 GTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQ-LEGRLSGISVI 179
G Y SVFDFC GRCRHNSESVVHENAYLSDFHHCFS+PSN++ GV + E RL+GI V+
Sbjct: 121 GAYASVFDFCTGRCRHNSESVVHENAYLSDFHHCFSLPSNSTAGGVDRNPELRLNGIHVV 180
Query: 180 IGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAP 239
+GRQGESCDSVCKS+GQSCV +KL +LNQCEI+QKYMSCK CLAS+GADQPAEV DDAP
Sbjct: 181 VGRQGESCDSVCKSNGQSCVPSKLSVLNQCEIMQKYMSCKGACLASIGADQPAEVVDDAP 240
Query: 240 SNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
+LNP ACLYTR Q++LSCDGSH+HTRRLCPCA
Sbjct: 241 RHLNPGACLYTRTQTILSCDGSHQHTRRLCPCA 273
>gi|359478124|ref|XP_002264803.2| PREDICTED: UPF0454 protein C12orf49 homolog [Vitis vinifera]
Length = 279
Score = 409 bits (1052), Expect = e-112, Method: Compositional matrix adjust.
Identities = 199/273 (72%), Positives = 227/273 (83%), Gaps = 1/273 (0%)
Query: 1 MPKSRLNLLFLFFQLLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAV 60
M SRL L LF +L +Q LS +SAIRKD E C+TTVQGRYL +DDNG+VCDA+
Sbjct: 7 MSNSRLILSLLFRFILIIQILSRVSAIRKDPNLLEKRICRTTVQGRYLRTDDNGHVCDAL 66
Query: 61 SMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA 120
SMDPQS CC K E+FSC GCNL+SQCCNSYE+CVSCC+NPART KE L VK+AKP TA
Sbjct: 67 SMDPQSGCCFGKREQFSCHGCNLISQCCNSYEFCVSCCLNPARTQKELALKVKMAKPVTA 126
Query: 121 GTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQ-LEGRLSGISVI 179
G Y SVFDFC GRCRHNSESVVHENAYLSDFHHCFS+PSN++ GV + E RL+GI V+
Sbjct: 127 GAYASVFDFCTGRCRHNSESVVHENAYLSDFHHCFSLPSNSTAGGVDRNPELRLNGIHVV 186
Query: 180 IGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAP 239
+GRQGESCDSVCKS+GQSCV +KL +LNQCEI+QKYMSCK CLAS+GADQPAEV DDAP
Sbjct: 187 VGRQGESCDSVCKSNGQSCVPSKLSVLNQCEIMQKYMSCKGACLASIGADQPAEVVDDAP 246
Query: 240 SNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
+LNP ACLYTR Q++LSCDGSH+HTRRLCPCA
Sbjct: 247 RHLNPGACLYTRTQTILSCDGSHQHTRRLCPCA 279
>gi|224112759|ref|XP_002332721.1| predicted protein [Populus trichocarpa]
gi|222832884|gb|EEE71361.1| predicted protein [Populus trichocarpa]
Length = 245
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 191/245 (77%), Positives = 221/245 (90%)
Query: 28 RKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQC 87
RKDIGF+++ C+ TVQGRYLLSDDNGYVCDA+S+DP SRCCPE G +FSCQGCNL+SQC
Sbjct: 1 RKDIGFEQSRSCRNTVQGRYLLSDDNGYVCDALSVDPPSRCCPENGLRFSCQGCNLVSQC 60
Query: 88 CNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
CNSYE+CVSCC++P+RT + QVL VKIAKP+TA +Y SVFDFCAGRCRHNSESVVHENAY
Sbjct: 61 CNSYEFCVSCCLHPSRTQEAQVLKVKIAKPSTAVSYTSVFDFCAGRCRHNSESVVHENAY 120
Query: 148 LSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLN 207
SDFHHCFS+PSN+SGA TQLE RL GI+V+IG+QGESCDSVCKS+GQSCVLNKL++LN
Sbjct: 121 RSDFHHCFSLPSNSSGANYTQLEARLGGINVVIGKQGESCDSVCKSNGQSCVLNKLLVLN 180
Query: 208 QCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRR 267
QC+++QKYMSCK GCLAS+G DQPAEV +DAP +LNP ACLYTR QS+LSCDGS HTRR
Sbjct: 181 QCDVMQKYMSCKGGCLASIGTDQPAEVVEDAPRHLNPGACLYTRTQSLLSCDGSLWHTRR 240
Query: 268 LCPCA 272
LCPCA
Sbjct: 241 LCPCA 245
>gi|356538903|ref|XP_003537940.1| PREDICTED: uncharacterized protein LOC100782588 [Glycine max]
Length = 281
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 192/258 (74%), Positives = 221/258 (85%)
Query: 15 LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE 74
L+ F S I+AIRKD+GFQ CKTTVQGRYLLSDDNGYVC+A S++ +SRCCP++GE
Sbjct: 24 LVIFHFSSTITAIRKDVGFQIIHPCKTTVQGRYLLSDDNGYVCNAASVNSRSRCCPQRGE 83
Query: 75 KFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRC 134
KFSC GCNLLSQCCNSYEYCVSCC+NPA KEQVL +K+AKPTTA TY SVFD+CAGRC
Sbjct: 84 KFSCHGCNLLSQCCNSYEYCVSCCLNPALISKEQVLKMKVAKPTTARTYSSVFDYCAGRC 143
Query: 135 RHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSS 194
RH+SESVVHENAY+SDFHHCFS+ SN+SG T E RLSGI+V++GRQGESC+SVCKS
Sbjct: 144 RHSSESVVHENAYISDFHHCFSLSSNSSGKNSTLTEARLSGINVVVGRQGESCNSVCKSR 203
Query: 195 GQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQS 254
GQSCV NKL++LN C+IIQKYMSCK CLAS+GADQPAEV DAP +LNP +CLYT QS
Sbjct: 204 GQSCVPNKLVVLNSCDIIQKYMSCKGSCLASVGADQPAEVVYDAPVHLNPGSCLYTETQS 263
Query: 255 MLSCDGSHRHTRRLCPCA 272
+LSCDG H+HTRRLCPCA
Sbjct: 264 ILSCDGLHQHTRRLCPCA 281
>gi|297817236|ref|XP_002876501.1| hypothetical protein ARALYDRAFT_486406 [Arabidopsis lyrata subsp.
lyrata]
gi|297322339|gb|EFH52760.1| hypothetical protein ARALYDRAFT_486406 [Arabidopsis lyrata subsp.
lyrata]
Length = 272
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 190/272 (69%), Positives = 224/272 (82%)
Query: 1 MPKSRLNLLFLFFQLLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAV 60
M S +L L+ F +SAIRKDIGF E C+TTVQGRYL+SDD G VCD +
Sbjct: 1 MSTSGFSLFLAVLHLMGFLFPLRVSAIRKDIGFLEERSCRTTVQGRYLISDDEGNVCDVL 60
Query: 61 SMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA 120
S++ ++RCCP KGE+FSC GCNLLSQCCNSYE+CVSCC+NP+RTL E+V+ VK+AKP TA
Sbjct: 61 SLESRTRCCPWKGERFSCHGCNLLSQCCNSYEFCVSCCLNPSRTLLEKVVKVKVAKPATA 120
Query: 121 GTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVII 180
GTY SVFDFCAGRCRHNSESVVHENAY S+FHHCFS+ SNASGA +TQ+E RL GI VI+
Sbjct: 121 GTYKSVFDFCAGRCRHNSESVVHENAYHSEFHHCFSLTSNASGANLTQVETRLLGIDVIV 180
Query: 181 GRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPS 240
G QG+SCD+VCKS GQ CV+NKL LLNQC+++++YM+CK CLAS GADQPAEV +DAP
Sbjct: 181 GSQGDSCDAVCKSRGQLCVMNKLSLLNQCDVMKRYMTCKGSCLASAGADQPAEVVEDAPR 240
Query: 241 NLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
+L P ACLYTR QSMLSCDGSH+HTRRLCPCA
Sbjct: 241 DLYPGACLYTRTQSMLSCDGSHQHTRRLCPCA 272
>gi|388496984|gb|AFK36558.1| unknown [Lotus japonicus]
Length = 272
Score = 389 bits (999), Expect = e-106, Method: Compositional matrix adjust.
Identities = 184/255 (72%), Positives = 214/255 (83%), Gaps = 1/255 (0%)
Query: 19 QFLSIISAIRKDIGFQET-PFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFS 77
F S I+AIRKDIG Q P CKTTVQGRY L+DDNG+VC A+S+D SRCCP+ EKFS
Sbjct: 18 HFSSPITAIRKDIGLQSIIPTCKTTVQGRYHLTDDNGFVCSALSVDSMSRCCPQSREKFS 77
Query: 78 CQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHN 137
C GCN+LSQCCNSYEYCVSCC+NPA T KEQVL +KI+KP TA TY +VFD+CA RCRH+
Sbjct: 78 CHGCNVLSQCCNSYEYCVSCCLNPALTSKEQVLKMKISKPVTARTYENVFDYCAARCRHS 137
Query: 138 SESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQS 197
SESVVHENAYLSDFHHCFS+PSN+SG VT E RL+GI+V++GRQGESC SVCK GQS
Sbjct: 138 SESVVHENAYLSDFHHCFSLPSNSSGTNVTLTEARLNGINVVVGRQGESCSSVCKLRGQS 197
Query: 198 CVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLS 257
CV NKL++LN C+IIQ+YMSCK CLAS+G DQPAEV DAP +LNP +CLYT +S+LS
Sbjct: 198 CVPNKLVVLNNCDIIQRYMSCKGSCLASVGTDQPAEVVYDAPKDLNPGSCLYTETESVLS 257
Query: 258 CDGSHRHTRRLCPCA 272
CDGSH+HT+RLCPCA
Sbjct: 258 CDGSHQHTKRLCPCA 272
>gi|255585501|ref|XP_002533442.1| conserved hypothetical protein [Ricinus communis]
gi|223526704|gb|EEF28938.1| conserved hypothetical protein [Ricinus communis]
Length = 251
Score = 382 bits (981), Expect = e-104, Method: Compositional matrix adjust.
Identities = 179/242 (73%), Positives = 206/242 (85%)
Query: 1 MPKSRLNLLFLFFQLLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAV 60
M K LN + F L LQFL +I +IR+DIGF E C+TTVQGR+LLSDDNG+VCDA+
Sbjct: 4 MSKFGLNAPCILFPLFILQFLPVIFSIRRDIGFHEMKICRTTVQGRFLLSDDNGHVCDAL 63
Query: 61 SMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA 120
S+DPQSRCCP +GEKFSC GCNLLS CCNSYE+CVSCC+NPART +E+V+ +KIAKP TA
Sbjct: 64 SLDPQSRCCPVRGEKFSCHGCNLLSHCCNSYEFCVSCCLNPARTQEEEVMKLKIAKPATA 123
Query: 121 GTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVII 180
GTY S FDFCAGRCRHNSESVVHENAYLSDFHHCFS+PSN++GA QLE RL GI+VII
Sbjct: 124 GTYTSTFDFCAGRCRHNSESVVHENAYLSDFHHCFSLPSNSTGANYRQLEARLVGINVII 183
Query: 181 GRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPS 240
GRQGESCDS CKS+GQSCVLNKL++LNQC+I+QKYMSC+ CLASMG DQPAEV D AP
Sbjct: 184 GRQGESCDSACKSNGQSCVLNKLLVLNQCDIMQKYMSCRGACLASMGTDQPAEVVDIAPK 243
Query: 241 NL 242
+L
Sbjct: 244 HL 245
>gi|30694985|ref|NP_191498.2| uncharacterized protein [Arabidopsis thaliana]
gi|42572729|ref|NP_974460.1| uncharacterized protein [Arabidopsis thaliana]
gi|332646394|gb|AEE79915.1| uncharacterized protein [Arabidopsis thaliana]
gi|332646395|gb|AEE79916.1| uncharacterized protein [Arabidopsis thaliana]
Length = 273
Score = 379 bits (974), Expect = e-103, Method: Compositional matrix adjust.
Identities = 183/258 (70%), Positives = 221/258 (85%)
Query: 15 LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE 74
++ F +SAIRKDIGF E C+TTVQGRYL+SDD G VCDA+S++ ++RCCP KGE
Sbjct: 16 IMGFLFPLRVSAIRKDIGFLEERSCRTTVQGRYLISDDEGNVCDALSLESRTRCCPWKGE 75
Query: 75 KFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRC 134
+FSC GCN+LSQCCNSYE+CVSCC+NP++TL E+V+ VK+AKP T+GTY SVFDFCAGRC
Sbjct: 76 RFSCHGCNILSQCCNSYEFCVSCCLNPSQTLLEKVVKVKVAKPATSGTYKSVFDFCAGRC 135
Query: 135 RHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSS 194
RHNSESVVHENAY S+FHHCFS+ SNASGA +TQ+E RL GI VI+G QG+SCD+VCKS
Sbjct: 136 RHNSESVVHENAYHSEFHHCFSLTSNASGANLTQVETRLLGIDVIVGSQGDSCDAVCKSR 195
Query: 195 GQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQS 254
GQ CV+NKL LLNQC+++++YM+CK CLAS GADQPAEV +DAP +L P ACLYTR QS
Sbjct: 196 GQLCVMNKLSLLNQCDVMKRYMNCKGSCLASAGADQPAEVVEDAPRDLYPGACLYTRTQS 255
Query: 255 MLSCDGSHRHTRRLCPCA 272
+LSCDGSH+HTRRLCPCA
Sbjct: 256 LLSCDGSHQHTRRLCPCA 273
>gi|242084782|ref|XP_002442816.1| hypothetical protein SORBIDRAFT_08g003310 [Sorghum bicolor]
gi|241943509|gb|EES16654.1| hypothetical protein SORBIDRAFT_08g003310 [Sorghum bicolor]
Length = 276
Score = 354 bits (909), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 163/253 (64%), Positives = 203/253 (80%)
Query: 20 FLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQ 79
F + +S IRKDIGF E C++TVQGR+L+SDDNGYVC A+S++P S CCP G++FSCQ
Sbjct: 24 FPNPVSGIRKDIGFIEPIVCRSTVQGRHLISDDNGYVCSALSINPWSHCCPTTGDRFSCQ 83
Query: 80 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 139
GC L QCCNSYEYCVSCC+NP++ KE VL +K+AKP TAGTY +VFDFC GRCRH+S
Sbjct: 84 GCKLDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143
Query: 140 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 199
SVVHENAY SDFHHCFS+ N+SG+ + + +L GI+V++GR GESC VCK GQSCV
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSVVSKLLGINVVVGRPGESCSLVCKVRGQSCV 203
Query: 200 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 259
++L +LN+CEI+QKYM CK GC S+G DQPAEV D+AP++LNP ACLY ++ L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 263
Query: 260 GSHRHTRRLCPCA 272
GSH+HTRRLCPCA
Sbjct: 264 GSHQHTRRLCPCA 276
>gi|218185557|gb|EEC67984.1| hypothetical protein OsI_35752 [Oryza sativa Indica Group]
Length = 276
Score = 348 bits (893), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 164/276 (59%), Positives = 212/276 (76%), Gaps = 4/276 (1%)
Query: 1 MPKSRLNLLFLFFQLLTLQFLS----IISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYV 56
M + L LL L +++ F + +++ IRKDIG C++TVQGR+L+SDDNGYV
Sbjct: 1 MATTALPLLHLLPRIVCAAFFACLPGLVAGIRKDIGLAAPIMCRSTVQGRHLISDDNGYV 60
Query: 57 CDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAK 116
C ++S+DP SRCCP G +FSCQGC L QCC+SYEYCVSCC+NP+RT + VL +K+A+
Sbjct: 61 CSSLSIDPWSRCCPRTGSRFSCQGCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVAR 120
Query: 117 PTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGI 176
P T+GTY +VFDFC GRCRH+S SVVHENAY SDFHHCF + N+SG+ + RL GI
Sbjct: 121 PVTSGTYRNVFDFCMGRCRHSSASVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGI 180
Query: 177 SVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVAD 236
++ IGR+GESC SVC++ GQSCV ++L +LN+CEI+QKYM CK GC +S+G DQPA+V D
Sbjct: 181 NISIGRRGESCSSVCRAKGQSCVPSRLSVLNKCEILQKYMRCKSGCFSSLGPDQPAQVVD 240
Query: 237 DAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
+APSNLNP ACLY ++ L+CDGSH+HTRRLCPCA
Sbjct: 241 EAPSNLNPGACLYMQMDERLTCDGSHQHTRRLCPCA 276
>gi|222615817|gb|EEE51949.1| hypothetical protein OsJ_33587 [Oryza sativa Japonica Group]
Length = 276
Score = 347 bits (890), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 163/276 (59%), Positives = 212/276 (76%), Gaps = 4/276 (1%)
Query: 1 MPKSRLNLLFLFFQLLTLQFLS----IISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYV 56
M + L LL L +++ F + +++ IRKDIG C++TVQGR+L+SDDNGYV
Sbjct: 1 MATTALPLLHLLPRIVCAAFFACLPGLVAGIRKDIGLAAPIMCRSTVQGRHLISDDNGYV 60
Query: 57 CDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAK 116
C ++S+DP SRCCP G +FSCQGC L QCC+SYEYCVSCC+NP+RT + VL +K+A+
Sbjct: 61 CSSLSIDPWSRCCPRTGSRFSCQGCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVAR 120
Query: 117 PTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGI 176
P T+GTY +VFDFC GRCRH+S SVVHENAY SDFHHCF + N+SG+ + RL GI
Sbjct: 121 PVTSGTYRNVFDFCMGRCRHSSASVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGI 180
Query: 177 SVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVAD 236
++ IGR+GESC SVC++ GQSCV ++L +LN+CEI+QKYM CK GC +++G DQPA+V D
Sbjct: 181 NISIGRRGESCSSVCRAKGQSCVPSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVD 240
Query: 237 DAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
+APSNLNP ACLY ++ L+CDGSH+HTRRLCPCA
Sbjct: 241 EAPSNLNPGACLYMQMDERLTCDGSHQHTRRLCPCA 276
>gi|357156916|ref|XP_003577619.1| PREDICTED: uncharacterized protein LOC100836694 [Brachypodium
distachyon]
Length = 275
Score = 345 bits (885), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 156/249 (62%), Positives = 196/249 (78%)
Query: 24 ISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNL 83
+SAIRK+ GF C+ TVQGR+L+SDDNGYVC ++S+DP S CCP G +FSCQGC L
Sbjct: 27 VSAIRKETGFAAPIMCRRTVQGRHLISDDNGYVCSSLSIDPWSHCCPRTGARFSCQGCKL 86
Query: 84 LSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVH 143
QCCNSYEYCVSCC++P+RT + VL +K+AKP TAGTY ++FDFC GRCRH+S SVVH
Sbjct: 87 DLQCCNSYEYCVSCCLDPSRTKERDVLKLKVAKPVTAGTYANIFDFCMGRCRHSSASVVH 146
Query: 144 ENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKL 203
ENAY+SDFHHCF + N SG+ + RL+GISVI+GRQGE C S C++ GQSCV ++L
Sbjct: 147 ENAYVSDFHHCFMVQQNLSGSTDSNYGSRLAGISVIVGRQGEPCSSTCRAKGQSCVPSRL 206
Query: 204 ILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHR 263
LN+C+I+QKYM C+ GC S+G DQPAEV D+AP++LNP ACLY + L+CDGSH+
Sbjct: 207 SELNKCQILQKYMRCRSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQTDERLTCDGSHQ 266
Query: 264 HTRRLCPCA 272
HTRR+CPCA
Sbjct: 267 HTRRVCPCA 275
>gi|449469174|ref|XP_004152296.1| PREDICTED: uncharacterized protein LOC101217783 [Cucumis sativus]
Length = 252
Score = 343 bits (881), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 170/251 (67%), Positives = 195/251 (77%), Gaps = 19/251 (7%)
Query: 22 SIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGC 81
S IS+IR +IG C++TVQGRYLLSDDNGYVCDA+S+DP SRCC KGEKF C GC
Sbjct: 21 STISSIRTNIGLAVNSSCRSTVQGRYLLSDDNGYVCDALSVDPLSRCCHGKGEKFPCHGC 80
Query: 82 NLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESV 141
NL+ QCCNSYEYCVSCC NP+RT +EQ+L +KIAKP TAG
Sbjct: 81 NLVLQCCNSYEYCVSCCQNPSRTKREQILKIKIAKPATAGK------------------- 121
Query: 142 VHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLN 201
VHENAY S FHHCFS+PS +SG TQLE RLSGI+VIIGRQGESCDSVCKS+GQSCV N
Sbjct: 122 VHENAYHSAFHHCFSLPSYSSGDNSTQLEFRLSGINVIIGRQGESCDSVCKSNGQSCVPN 181
Query: 202 KLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGS 261
K+++LN C+I+QKY+SCK GCLAS+G DQPAEVA DAP LNP ACLY+ QSM+SCDGS
Sbjct: 182 KILVLNHCDIMQKYLSCKGGCLASVGTDQPAEVAYDAPKELNPGACLYSSAQSMVSCDGS 241
Query: 262 HRHTRRLCPCA 272
H HT+RLCPCA
Sbjct: 242 HPHTKRLCPCA 252
>gi|226499180|ref|NP_001143347.1| uncharacterized protein LOC100275960 precursor [Zea mays]
gi|195618642|gb|ACG31151.1| hypothetical protein [Zea mays]
Length = 276
Score = 340 bits (872), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 161/253 (63%), Positives = 198/253 (78%)
Query: 20 FLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQ 79
F S +S IRK IGF + C++TVQGR+L+SDDNGYVC A+S++P S CCP G++FSCQ
Sbjct: 24 FPSPVSGIRKGIGFIQPIVCRSTVQGRHLISDDNGYVCSALSINPWSHCCPTTGDRFSCQ 83
Query: 80 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 139
GC L QCCNSYEYCVSCC NP++ KE VL +K+AKP TAGTY +VFDFC GRCRH+S
Sbjct: 84 GCELDLQCCNSYEYCVSCCFNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143
Query: 140 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 199
SVVHENAY SDFHHCFS+ N+SG+ + +L GI+V++GR ESC VCK GQSCV
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVGRPRESCSLVCKVRGQSCV 203
Query: 200 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 259
++L +LN+CEI+QKYM CK GC S+G DQPAEV D+AP++LNP ACLY ++ L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 263
Query: 260 GSHRHTRRLCPCA 272
GSH HTRRLCPCA
Sbjct: 264 GSHHHTRRLCPCA 276
>gi|219887191|gb|ACL53970.1| unknown [Zea mays]
gi|413916125|gb|AFW56057.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
gi|413916126|gb|AFW56058.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
gi|413916127|gb|AFW56059.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
Length = 276
Score = 338 bits (867), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 160/253 (63%), Positives = 198/253 (78%)
Query: 20 FLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQ 79
F S +S IRK I F + C++TVQGR+L+SDDNGYVC A+S++P S CCP G++FSCQ
Sbjct: 24 FPSPVSGIRKGIDFIQPIVCRSTVQGRHLISDDNGYVCSALSINPWSHCCPTTGDRFSCQ 83
Query: 80 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 139
GC L QCCNSYEYCVSCC+NP++ KE VL +K+AKP TAGTY +VFDFC GRCRH+S
Sbjct: 84 GCELDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143
Query: 140 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 199
SVVHENAY SDFHHCFS+ N+SG+ + +L GI+V++GR ESC VCK GQSCV
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVGRPRESCSLVCKVRGQSCV 203
Query: 200 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 259
++L +LN+CEI+QKYM CK GC S+G DQPAEV D+AP++LNP ACLY ++ L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 263
Query: 260 GSHRHTRRLCPCA 272
GSH HTRRLCPCA
Sbjct: 264 GSHHHTRRLCPCA 276
>gi|413916124|gb|AFW56056.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
Length = 258
Score = 330 bits (847), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 156/245 (63%), Positives = 193/245 (78%)
Query: 28 RKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQC 87
RK I F + C++TVQGR+L+SDDNGYVC A+S++P S CCP G++FSCQGC L QC
Sbjct: 14 RKGIDFIQPIVCRSTVQGRHLISDDNGYVCSALSINPWSHCCPTTGDRFSCQGCELDLQC 73
Query: 88 CNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
CNSYEYCVSCC+NP++ KE VL +K+AKP TAGTY +VFDFC GRCRH+S SVVHENAY
Sbjct: 74 CNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSASVVHENAY 133
Query: 148 LSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLN 207
SDFHHCFS+ N+SG+ + +L GI+V++GR ESC VCK GQSCV ++L +LN
Sbjct: 134 ASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVGRPRESCSLVCKVRGQSCVPSRLSVLN 193
Query: 208 QCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRR 267
+CEI+QKYM CK GC S+G DQPAEV D+AP++LNP ACLY ++ L+CDGSH HTRR
Sbjct: 194 KCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCDGSHHHTRR 253
Query: 268 LCPCA 272
LCPCA
Sbjct: 254 LCPCA 258
>gi|7801689|emb|CAB91609.1| putative protein [Arabidopsis thaliana]
Length = 273
Score = 326 bits (836), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 158/228 (69%), Positives = 194/228 (85%)
Query: 15 LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE 74
++ F +SAIRKDIGF E C+TTVQGRYL+SDD G VCDA+S++ ++RCCP KGE
Sbjct: 16 IMGFLFPLRVSAIRKDIGFLEERSCRTTVQGRYLISDDEGNVCDALSLESRTRCCPWKGE 75
Query: 75 KFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRC 134
+FSC GCN+LSQCCNSYE+CVSCC+NP++TL E+V+ VK+AKP T+GTY SVFDFCAGRC
Sbjct: 76 RFSCHGCNILSQCCNSYEFCVSCCLNPSQTLLEKVVKVKVAKPATSGTYKSVFDFCAGRC 135
Query: 135 RHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSS 194
RHNSESVVHENAY S+FHHCFS+ SNASGA +TQ+E RL GI VI+G QG+SCD+VCKS
Sbjct: 136 RHNSESVVHENAYHSEFHHCFSLTSNASGANLTQVETRLLGIDVIVGSQGDSCDAVCKSR 195
Query: 195 GQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNL 242
GQ CV+NKL LLNQC+++++YM+CK CLAS GADQPAEV +DAP +L
Sbjct: 196 GQLCVMNKLSLLNQCDVMKRYMNCKGSCLASAGADQPAEVVEDAPRDL 243
>gi|108864223|gb|ABA92588.2| expressed protein [Oryza sativa Japonica Group]
Length = 251
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 142/246 (57%), Positives = 187/246 (76%), Gaps = 4/246 (1%)
Query: 1 MPKSRLNLLFLFFQLLTLQFLS----IISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYV 56
M + L LL L +++ F + +++ IRKDIG C++TVQGR+L+SDDNGYV
Sbjct: 1 MATTALPLLHLLPRIVCAAFFACLPGLVAGIRKDIGLAAPIMCRSTVQGRHLISDDNGYV 60
Query: 57 CDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAK 116
C ++S+DP SRCCP G +FSCQGC L QCC+SYEYCVSCC+NP+RT + VL +K+A+
Sbjct: 61 CSSLSIDPWSRCCPRTGSRFSCQGCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVAR 120
Query: 117 PTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGI 176
P T+GTY +VFDFC GRCRH+S SVVHENAY SDFHHCF + N+SG+ + RL GI
Sbjct: 121 PVTSGTYRNVFDFCMGRCRHSSASVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGI 180
Query: 177 SVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVAD 236
++ IGR+GESC SVC++ GQSCV ++L +LN+CEI+QKYM CK GC +++G DQPA+V D
Sbjct: 181 NISIGRRGESCSSVCRAKGQSCVPSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVD 240
Query: 237 DAPSNL 242
+APSNL
Sbjct: 241 EAPSNL 246
>gi|255637592|gb|ACU19121.1| unknown [Glycine max]
Length = 224
Score = 295 bits (756), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 138/185 (74%), Positives = 159/185 (85%)
Query: 15 LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE 74
L+ F S I+AIRKD+GFQ CKTTVQGRYLLSDDNGYVC+A S++ +SRCCP++GE
Sbjct: 24 LVIFHFSSTITAIRKDVGFQIIHPCKTTVQGRYLLSDDNGYVCNAASVNSRSRCCPQRGE 83
Query: 75 KFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRC 134
KFSC GCNLLSQCCNSYEYCVSCC+NPA KEQVL +K+AKPTTA TY SVFD+CAGRC
Sbjct: 84 KFSCHGCNLLSQCCNSYEYCVSCCLNPALISKEQVLKMKVAKPTTARTYSSVFDYCAGRC 143
Query: 135 RHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSS 194
RH+SESVVHENAY+SDFHHCFS+ SN+SG T E RLSGI+V++GRQGESC+SVCKS
Sbjct: 144 RHSSESVVHENAYISDFHHCFSLSSNSSGKNSTLTEARLSGINVVVGRQGESCNSVCKSR 203
Query: 195 GQSCV 199
GQSCV
Sbjct: 204 GQSCV 208
>gi|147779400|emb|CAN76795.1| hypothetical protein VITISV_030117 [Vitis vinifera]
Length = 258
Score = 275 bits (704), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 145/243 (59%), Positives = 166/243 (68%), Gaps = 38/243 (15%)
Query: 28 RKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQC 87
RKD E C+TTVQGRYL +DDN + +D + GCNL+SQC
Sbjct: 8 RKDPNLLEKRICRTTVQGRYLRTDDNDNPQLGLDVD----------MRVIHIGCNLISQC 57
Query: 88 CNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
CNSYE+CVSCC+NPART KE L VK+AKP TAG Y SVFDFC GRCRHNSESVVHENAY
Sbjct: 58 CNSYEFCVSCCLNPARTQKELALKVKMAKPVTAGAYASVFDFCTGRCRHNSESVVHENAY 117
Query: 148 LSDFHHCFSMPSNASGAG---------------VTQL-------------EGRLSGISVI 179
LSDFHHCFS+PSN++G V QL E RL+GI V+
Sbjct: 118 LSDFHHCFSLPSNSAGEAWSLIPNSWNGLFLKIVKQLMASMHAGGVDRNPELRLNGIHVV 177
Query: 180 IGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAP 239
+GRQGESCDSVCKS+GQSCV +KL +LNQCEI+QKYMSCK CLAS+GADQPAEV DDAP
Sbjct: 178 VGRQGESCDSVCKSNGQSCVPSKLSVLNQCEIMQKYMSCKGACLASIGADQPAEVVDDAP 237
Query: 240 SNL 242
+L
Sbjct: 238 RHL 240
>gi|168051463|ref|XP_001778174.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670496|gb|EDQ57064.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 235
Score = 265 bits (678), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 130/235 (55%), Positives = 162/235 (68%), Gaps = 1/235 (0%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCC 98
C TVQGRYLLSD GYVC+ + ++ + CCP +GE+F+CQGCN+ SQCCNSYEYCVSCC
Sbjct: 1 CNNTVQGRYLLSDSRGYVCNIMEVNSLTGCCPREGEQFACQGCNIGSQCCNSYEYCVSCC 60
Query: 99 VNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMP 158
+NP RT E VL K+A+ TAG++ S+FD+C+GRCRHNS+SVVHENAY+S+ HHCFS
Sbjct: 61 LNPVRTTLEVVLKTKLARQPTAGSWTSLFDYCSGRCRHNSQSVVHENAYVSEDHHCFSAR 120
Query: 159 SNASGAGVT-QLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMS 217
S G V E L+ + V +GR+G SCD+ C + SC KL LN C +QKY
Sbjct: 121 STPPGVQVEMDAEDELADVFVYVGREGLSCDATCSAQKLSCREEKLAALNNCAELQKYNK 180
Query: 218 CKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
C C+ S G DQPAEV AP +++P ACLY S LSC G H +RRLCPCA
Sbjct: 181 CHGLCIPSTGPDQPAEVVSTAPRHMHPGACLYNTQVSQLSCKGYHLFSRRLCPCA 235
>gi|297728217|ref|NP_001176472.1| Os11g0267100 [Oryza sativa Japonica Group]
gi|255679983|dbj|BAH95200.1| Os11g0267100 [Oryza sativa Japonica Group]
Length = 208
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 123/194 (63%), Positives = 157/194 (80%)
Query: 79 QGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 138
+GC L QCC+SYEYCVSCC+NP+RT + VL +K+A+P T+GTY +VFDFC GRCRH+S
Sbjct: 15 RGCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVARPVTSGTYRNVFDFCMGRCRHSS 74
Query: 139 ESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSC 198
SVVHENAY SDFHHCF + N+SG+ + RL GI++ IGR+GESC SVC++ GQSC
Sbjct: 75 ASVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGINISIGRRGESCSSVCRAKGQSC 134
Query: 199 VLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSC 258
V ++L +LN+CEI+QKYM CK GC +++G DQPA+V D+APSNLNP ACLY ++ L+C
Sbjct: 135 VPSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVDEAPSNLNPGACLYMQMDERLTC 194
Query: 259 DGSHRHTRRLCPCA 272
DGSH+HTRRLCPCA
Sbjct: 195 DGSHQHTRRLCPCA 208
>gi|302788558|ref|XP_002976048.1| hypothetical protein SELMODRAFT_415982 [Selaginella moellendorffii]
gi|300156324|gb|EFJ22953.1| hypothetical protein SELMODRAFT_415982 [Selaginella moellendorffii]
Length = 263
Score = 252 bits (644), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 118/235 (50%), Positives = 163/235 (69%), Gaps = 1/235 (0%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCC 98
C TVQG +LL+D GYVC ++D CCPE+G++FSC GCNL+SQCCN++EYCVSCC
Sbjct: 29 CNNTVQGPHLLADSQGYVCSVRNIDFGRGCCPEEGQQFSCSGCNLVSQCCNTFEYCVSCC 88
Query: 99 VNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMP 158
++P T KE L+ K A+ TA + SVF++C GRCRHNS SVVHENAY S++HHCF
Sbjct: 89 LDPIHTSKEFALSSKSARQLTAANFVSVFEYCTGRCRHNSGSVVHENAYTSEYHHCFFTQ 148
Query: 159 SNASGAGVTQLE-GRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMS 217
+N S +E G + S+I GR+G+SCD C+ G+ C + L L+N+C ++QK M+
Sbjct: 149 TNLSSGRDHGVEAGENTDYSLITGRRGQSCDEACRQRGKKCKASILPLINECSVLQKSMN 208
Query: 218 CKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
CK C++ G D P+EV AP +L+P AC++ ++LSC G+HR+T+RLCPC
Sbjct: 209 CKGACISGSGPDLPSEVVSSAPRHLHPGACVFNSKNTLLSCQGAHRYTKRLCPCG 263
>gi|302769810|ref|XP_002968324.1| hypothetical protein SELMODRAFT_409585 [Selaginella moellendorffii]
gi|300163968|gb|EFJ30578.1| hypothetical protein SELMODRAFT_409585 [Selaginella moellendorffii]
Length = 263
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 118/235 (50%), Positives = 163/235 (69%), Gaps = 1/235 (0%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCC 98
C TVQG +LL+D GYVC ++D CCPE+G++FSC GCNL+SQCCN++EYCVSCC
Sbjct: 29 CNNTVQGPHLLADSQGYVCSVRNIDFGRGCCPEEGQQFSCSGCNLVSQCCNTFEYCVSCC 88
Query: 99 VNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMP 158
++P T KE L+ K A+ TA + SVF++C GRCRHNS SVVHENAY S++HHCF
Sbjct: 89 LDPIHTSKEFALSSKSARQLTAANFVSVFEYCTGRCRHNSGSVVHENAYTSEYHHCFFTQ 148
Query: 159 SNASGAGVTQLE-GRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMS 217
+N S +E G + S+I GR+G+SCD C+ G+ C + L L+N+C ++QK M+
Sbjct: 149 TNLSSGRDHGVEAGENTDYSLITGRRGQSCDEACRQRGKKCKASILPLINECSVLQKSMN 208
Query: 218 CKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
CK C++ G D P+EV AP +L+P AC++ ++LSC G+HR+T+RLCPC
Sbjct: 209 CKGACVSGSGPDLPSEVVSSAPRHLHPGACVFNSKNTLLSCQGAHRYTKRLCPCG 263
>gi|449484814|ref|XP_004156988.1| PREDICTED: uncharacterized LOC101231289 [Cucumis sativus]
Length = 154
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 117/151 (77%), Positives = 131/151 (86%)
Query: 122 TYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIG 181
TY SVFDFC GRCRHNSESVVHENAY S FHHCFS+PS +SG TQLE RLSGI+VIIG
Sbjct: 4 TYASVFDFCVGRCRHNSESVVHENAYHSAFHHCFSLPSYSSGDNSTQLEFRLSGINVIIG 63
Query: 182 RQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSN 241
RQGESCDSVCKS+GQSCV NK+++LN C+I+QKY+SCK GCLAS+G DQPAEVA DAP
Sbjct: 64 RQGESCDSVCKSNGQSCVPNKILVLNHCDIMQKYLSCKGGCLASVGTDQPAEVAYDAPKE 123
Query: 242 LNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
LNP ACLY+ QSM+SCDGSH HT+RLCPCA
Sbjct: 124 LNPGACLYSSAQSMVSCDGSHPHTKRLCPCA 154
>gi|413916128|gb|AFW56060.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
Length = 186
Score = 214 bits (544), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 101/162 (62%), Positives = 127/162 (78%)
Query: 20 FLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQ 79
F S +S IRK I F + C++TVQGR+L+SDDNGYVC A+S++P S CCP G++FSCQ
Sbjct: 24 FPSPVSGIRKGIDFIQPIVCRSTVQGRHLISDDNGYVCSALSINPWSHCCPTTGDRFSCQ 83
Query: 80 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 139
GC L QCCNSYEYCVSCC+NP++ KE VL +K+AKP TAGTY +VFDFC GRCRH+S
Sbjct: 84 GCELDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143
Query: 140 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIG 181
SVVHENAY SDFHHCFS+ N+SG+ + +L GI+V++G
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVG 185
>gi|62734222|gb|AAX96331.1| hypothetical protein LOC_Os11g16560 [Oryza sativa Japonica Group]
Length = 216
Score = 207 bits (527), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 112/246 (45%), Positives = 153/246 (62%), Gaps = 39/246 (15%)
Query: 1 MPKSRLNLLFLFFQLLTLQFLS----IISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYV 56
M + L LL L +++ F + +++ IRKDIG C++TVQGR+L+SDDNG
Sbjct: 1 MATTALPLLHLLPRIVCAAFFACLPGLVAGIRKDIGLAAPIMCRSTVQGRHLISDDNGRT 60
Query: 57 CDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAK 116
CP ++ YC+ T + VL +K+A+
Sbjct: 61 P-----------CPSH----------------FAHIYCL--------TKETDVLKLKVAR 85
Query: 117 PTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGI 176
P T+GTY +VFDFC GRCRH+S SVVHENAY SDFHHCF + N+SG+ + RL GI
Sbjct: 86 PVTSGTYRNVFDFCMGRCRHSSASVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGI 145
Query: 177 SVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVAD 236
++ IGR+GESC SVC++ GQSCV ++L +LN+CEI+QKYM CK GC +++G DQPA+V D
Sbjct: 146 NISIGRRGESCSSVCRAKGQSCVPSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVD 205
Query: 237 DAPSNL 242
+APSNL
Sbjct: 206 EAPSNL 211
>gi|384245502|gb|EIE18996.1| hypothetical protein COCSUDRAFT_31544 [Coccomyxa subellipsoidea
C-169]
Length = 278
Score = 167 bits (422), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 87/235 (37%), Positives = 129/235 (54%), Gaps = 4/235 (1%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCC 98
C T+QGR+ ++D G +C S+ P + CC G+K+SC CN+ CC YE+CVSCC
Sbjct: 42 CNNTIQGRWQVADSQGMLCSRQSLRPDNGCC-SAGDKYSCATCNMHDSCCADYEHCVSCC 100
Query: 99 VNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMP 158
+P L + + G + SVF++C +CR +S+S VHENAYLS FHHCFS
Sbjct: 101 QDPQHGLPDLSTVHRAPDRPETGLWPSVFEYCRNKCRTSSKSTVHENAYLSPFHHCFSS- 159
Query: 159 SNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSC 218
S ++V++G G+SC+++C+S+G+ C L LN C +++++ +C
Sbjct: 160 SGKPTTAAPPTPPIPKSVTVVLGSAGQSCNALCESTGRRCAQQHLASLNNCNVLREHYAC 219
Query: 219 KRGCLASMG-ADQPAEVADDAPSNLNPTACLYTRIQ-SMLSCDGSHRHTRRLCPC 271
+ GC G AD P V AP PT CL + + SC+ + RLCPC
Sbjct: 220 EAGCEPDSGTADAPIYVNASAPKQQRPTICLTMDSKVNQFSCEATSASNHRLCPC 274
>gi|242070783|ref|XP_002450668.1| hypothetical protein SORBIDRAFT_05g009356 [Sorghum bicolor]
gi|241936511|gb|EES09656.1| hypothetical protein SORBIDRAFT_05g009356 [Sorghum bicolor]
Length = 111
Score = 163 bits (413), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 74/111 (66%), Positives = 90/111 (81%)
Query: 28 RKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQC 87
RKDI F E C++TVQGR+L+SDDNGYVC A+S++P S CCP G++FSCQGC L QC
Sbjct: 1 RKDICFIEPIVCRSTVQGRHLISDDNGYVCSALSINPWSHCCPTTGDRFSCQGCKLDLQC 60
Query: 88 CNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 138
CNSYEYCVSCC+NP++ KE VL +K+AKP T GTY +VFDFC GRCRH+S
Sbjct: 61 CNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTVGTYTNVFDFCMGRCRHSS 111
>gi|159475585|ref|XP_001695899.1| predicted membrane-anchored protein [Chlamydomonas reinhardtii]
gi|158275459|gb|EDP01236.1| predicted membrane-anchored protein [Chlamydomonas reinhardtii]
Length = 269
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 141/278 (50%), Gaps = 30/278 (10%)
Query: 12 FFQLLTLQFLSIISAI---RKDIGFQETP----FCKTTVQGRYLLSDDNGYVCDAVSMDP 64
+ L + ++ +S+I + + ++ P C+ TVQG +++DD+G VC +
Sbjct: 3 LYGALVVALMASLSSIVLGERRLFIRDLPEPKTRCRNTVQGLNMVTDDHGVVCKRSDLSY 62
Query: 65 QSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNP---ARTLKEQVLNVKIAKPTTAG 121
+ CC G + C C++ +CC+ YE CVSCC+ P A ++ +Q L + + +G
Sbjct: 63 STGCC-TSGNQHDCALCDMRDRCCSEYESCVSCCLAPQHNAVSIAKQAL--RSPRHKDSG 119
Query: 122 TYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFS-----MPSNASGAGVTQLEGRLSGI 176
+G F++C G CR +S S HENAY+S HHCFS M S+ AGV + G+
Sbjct: 120 FWGDPFEYCKGICRTHSRSTAHENAYISSRHHCFSQLGRPMLSDPLPAGV------MDGV 173
Query: 177 SVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGC--LASMGADQPAEV 234
V+ G++ +CD VC + + C + L L+ C+ +++ C+ GC +A +G P+ V
Sbjct: 174 EVVTGQRNANCDDVCAAKQKKCSADHLRWLSSCDRLREQFGCEAGCEVVAGLG---PSYV 230
Query: 235 ADDAPSNLNPTACLYTRIQ-SMLSCDGSHRHTRRLCPC 271
+AP P C + LSC LCPC
Sbjct: 231 DGNAPKPARPAMCFAQPAEGGKLSCSAREEQHLMLCPC 268
>gi|428177708|gb|EKX46586.1| hypothetical protein GUITHDRAFT_107373 [Guillardia theta CCMP2712]
Length = 233
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 22/230 (9%)
Query: 15 LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG- 73
L+ + + + + + G C+ TVQG + DD GY C +D Q+ CC K
Sbjct: 5 LILMAMMGLAAGLEPSSG--SGSVCRNTVQGPKHVVDDQGYFCSVDQVDYQTGCCDTKKV 62
Query: 74 --EKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV----- 126
++SC+ C ++CC+ YE CVSCC+NP+ +++ N +A T+
Sbjct: 63 TEPRWSCRSCEAETKCCDHYENCVSCCLNPS---NDELRNTAMAASRDKKTFADAVKRKD 119
Query: 127 -FDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGE 185
F+FC CR +S+S VH NAY+S+ C S PS S + ++V +G+QGE
Sbjct: 120 SFEFCRASCRTSSKSTVHGNAYISEKRFCLS-PSILSNT-------LPADVAVTVGKQGE 171
Query: 186 SCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVA 235
SC VC S G+ C + +N CE++QK CK C + G DQPA V+
Sbjct: 172 SCAEVCSSVGKRCDTRYITSINTCEVLQKNFPCKNTCEKNYGHDQPAYVS 221
>gi|242051320|ref|XP_002463404.1| hypothetical protein SORBIDRAFT_02g043236 [Sorghum bicolor]
gi|241926781|gb|EER99925.1| hypothetical protein SORBIDRAFT_02g043236 [Sorghum bicolor]
Length = 82
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 54/83 (65%), Positives = 66/83 (79%), Gaps = 1/83 (1%)
Query: 56 VCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIA 115
VC A+S++P S CCP ++FSC+GC L QCCNSYEYCVSCC+NP++ KE VL +K A
Sbjct: 1 VCSALSINPWSHCCPTL-DRFSCRGCKLDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKEA 59
Query: 116 KPTTAGTYGSVFDFCAGRCRHNS 138
KP TAGTY +VFDFC GRCRH+S
Sbjct: 60 KPVTAGTYTNVFDFCMGRCRHSS 82
>gi|307110379|gb|EFN58615.1| hypothetical protein CHLNCDRAFT_17323, partial [Chlorella
variabilis]
Length = 225
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 73/234 (31%), Positives = 112/234 (47%), Gaps = 17/234 (7%)
Query: 46 RYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTL 105
R+ ++DD G VC S++ + CC GE SC C +CC+ YE+CVSCC+ P
Sbjct: 1 RWYIADDQGMVCSRDSLNYATGCC-TGGELHSCTSCEAKDKCCSRYEHCVSCCLRPENKP 59
Query: 106 KEQVLNVKIA--KPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASG 163
E++ ++ KP T G + + F++C CR + S HENA++ D CFS
Sbjct: 60 GEKMQSIFRGRNKPET-GHWSTPFEYCQAVCRTTARSTQHENAFILDRRFCFSKVGFVPP 118
Query: 164 AGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQ-KYMSCKRGC 222
A E S++ G G+SCD C + +C + +N C ++ K+M C G
Sbjct: 119 APALPKEA-----SLVAGAAGQSCDVACAAKQMTCRAEWFLSINDCNSLRAKFMCC--GS 171
Query: 223 LASMGADQPAEVADDAPSNLNPTAC-----LYTRIQSMLSCDGSHRHTRRLCPC 271
A+ + P V AP + P C L ++ +C ++ RRLCPC
Sbjct: 172 CAADQTEYPGYVEGSAPKHQQPAFCAFLPPLAAQLLPSFNCSHANGSVRRLCPC 225
>gi|358345744|ref|XP_003636935.1| hypothetical protein MTR_065s0010 [Medicago truncatula]
gi|355502870|gb|AES84073.1| hypothetical protein MTR_065s0010 [Medicago truncatula]
Length = 131
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/75 (68%), Positives = 56/75 (74%)
Query: 197 SCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSML 256
S VL I L IQKYMSCK GCLAS+G DQPAEV DAP +NP +CLYT QS+L
Sbjct: 28 SYVLTVSIDLRYVGSIQKYMSCKGGCLASVGTDQPAEVVYDAPKYMNPESCLYTETQSIL 87
Query: 257 SCDGSHRHTRRLCPC 271
SCDGSH+HTRRLCPC
Sbjct: 88 SCDGSHQHTRRLCPC 102
>gi|328719433|ref|XP_003246762.1| PREDICTED: UPF0454 protein C12orf49 homolog [Acyrthosiphon pisum]
Length = 193
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 49/122 (40%), Positives = 69/122 (56%), Gaps = 8/122 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCC 98
C+ TVQGR ++DD GY+C + + CC +++ C CN CC YE+CVSCC
Sbjct: 70 CRNTVQGREYIADDKGYICQRDDLLANN-CCGTTEKRYICDTCNA-DGCCVLYEHCVSCC 127
Query: 99 VNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLSDFHHC 154
+NP R L E VL A T + ++ F+ C +CR +S+SV HENAYL HC
Sbjct: 128 LNPNKRKLLETVLGK--ASETFRVLFSAINNHFELCLAKCRTSSQSVQHENAYLKSNKHC 185
Query: 155 FS 156
++
Sbjct: 186 YT 187
>gi|195393082|ref|XP_002055183.1| GJ19227 [Drosophila virilis]
gi|194149693|gb|EDW65384.1| GJ19227 [Drosophila virilis]
Length = 215
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 66/118 (55%), Gaps = 9/118 (7%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
C+ +VQG LL+D+ G+VC M C E +SC CN S CC YEYCVS
Sbjct: 76 CRNSVQGSQLLADERGFVCLREQMLANGCCNVELAGISYYSCHSCNSTSHCCGVYEYCVS 135
Query: 97 CCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLSD 150
CC++P R+L EQVL A+ T Y SV F+ C +CR NS SV HEN Y ++
Sbjct: 136 CCLHPNKRSLLEQVLQ---AQDTQKYIYASVVDHFELCLVKCRTNSHSVEHENKYRNE 190
>gi|195130989|ref|XP_002009933.1| GI15640 [Drosophila mojavensis]
gi|193908383|gb|EDW07250.1| GI15640 [Drosophila mojavensis]
Length = 216
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 66/118 (55%), Gaps = 9/118 (7%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
C+ +VQG+ LL+D+ G+VC M C E +SC CN S CC YEYCVS
Sbjct: 77 CRNSVQGQQLLADERGFVCLREQMLVNGCCNVELAGISYYSCYSCNTTSHCCGVYEYCVS 136
Query: 97 CCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLSD 150
CC++P R+L EQVL A T Y SV F+ C +CR NS SV HEN Y ++
Sbjct: 137 CCLHPNKRSLLEQVLQ---AHNTQKYIYASVVDHFELCLVKCRTNSHSVEHENKYRNE 191
>gi|194768977|ref|XP_001966587.1| GF22253 [Drosophila ananassae]
gi|190617351|gb|EDV32875.1| GF22253 [Drosophila ananassae]
Length = 217
Score = 83.6 bits (205), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 51/134 (38%), Positives = 70/134 (52%), Gaps = 5/134 (3%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
C+ +VQGR LL+D+ G+VC + P C E +SC+ CN + CC YEYCVS
Sbjct: 78 CRNSVQGRQLLADERGFVCRREELLPSGCCNTELPGIAYYSCRSCNTTTHCCAIYEYCVS 137
Query: 97 CCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS-DFHHC 154
CC++P + L EQVL + F+ C +CR NS SV HEN Y HC
Sbjct: 138 CCLHPEKQPLLEQVLRSANSPKYIFTKVADHFELCLVKCRTNSHSVQHENQYRDPSAKHC 197
Query: 155 FSM-PSNASGAGVT 167
+ + ++ S VT
Sbjct: 198 YGLTEAHESQKNVT 211
>gi|357603409|gb|EHJ63752.1| hypothetical protein KGM_18059 [Danaus plexippus]
Length = 198
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 57/186 (30%), Positives = 96/186 (51%), Gaps = 29/186 (15%)
Query: 3 KSRLNLLFLFFQLLTLQFLSII------SAIRKDIGFQETPF---------------CKT 41
+ RL L +F LT +S + + + +D+ + PF C+
Sbjct: 11 RRRLVLGIIFASSLTYCIVSFLKEGSNKNMVYQDVSIEHKPFVWRTLQEHNETQDTLCRN 70
Query: 42 TVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCVSCC 98
++QG+ LL DD GYVC + + ++ CC + +++SC+ C + CC YEYCVSCC
Sbjct: 71 SIQGKSLLVDDRGYVCQRMDL-AKNGCCNIEAKHTQRYSCKSCKN-NNCCVIYEYCVSCC 128
Query: 99 VNPA-RTLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFH-HCF 155
++P+ R+L E VL+ + + + ++ C +CR +S SV+HEN+Y H HCF
Sbjct: 129 LDPSKRSLLELVLSKLSAEEKVLFRSISDDYELCLTKCRTSSHSVLHENSYKDPSHKHCF 188
Query: 156 SMPSNA 161
+N+
Sbjct: 189 GDRTNS 194
>gi|449529758|ref|XP_004171865.1| PREDICTED: uncharacterized protein LOC101231289 [Cucumis sativus]
Length = 144
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/55 (69%), Positives = 44/55 (80%)
Query: 22 SIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKF 76
S IS+IR +IG C++TVQGRYLLSDDNGYVCDA+S+DP SRCC KGEKF
Sbjct: 58 STISSIRTNIGLAVNSSCRSTVQGRYLLSDDNGYVCDALSVDPLSRCCHGKGEKF 112
>gi|195044419|ref|XP_001991819.1| GH11862 [Drosophila grimshawi]
gi|193901577|gb|EDW00444.1| GH11862 [Drosophila grimshawi]
Length = 214
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/124 (41%), Positives = 71/124 (57%), Gaps = 10/124 (8%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
C+ +VQGR LL D++G+VC M C E +SC+ CN+ S CC YE+C+S
Sbjct: 77 CRNSVQGRNLLVDEHGFVCKRDQMLVNGCCNVELPGISYYSCRSCNVTSHCCAIYEHCIS 136
Query: 97 CCVNPAR-TLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLS-DF 151
CC++P + +L EQVL A T Y SV F+ C +CR NS SV HEN Y + D
Sbjct: 137 CCLHPNKQSLLEQVLQ---ASDTQKYIYASVVDHFELCLVKCRTNSHSVEHENKYRNEDA 193
Query: 152 HHCF 155
+C+
Sbjct: 194 KYCY 197
>gi|308805236|ref|XP_003079930.1| unnamed protein product [Ostreococcus tauri]
gi|116058387|emb|CAL53576.1| unnamed protein product [Ostreococcus tauri]
Length = 189
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 4/127 (3%)
Query: 38 FCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSC 97
C+ + ++DD G C+ +D S CCP SC+ C+ +CC YE CVSC
Sbjct: 41 LCRVSHASVSRVADDRGRSCEWYDVDTVSGCCPTTSATHSCEKCDAAVECCERYETCVSC 100
Query: 98 CVNPAR--TLK-EQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHC 154
C+ PAR TL+ E + + G + FDFCA RCR +HEN Y HC
Sbjct: 101 CLAPARRATLETEMRTHARGRNQVVTGFFDDPFDFCANRCRTQPSVTLHENEYAYSTKHC 160
Query: 155 F-SMPSN 160
F P N
Sbjct: 161 FGDYPKN 167
>gi|405975633|gb|EKC40187.1| UPF0454 protein C12orf49-like protein [Crassostrea gigas]
Length = 185
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/124 (37%), Positives = 73/124 (58%), Gaps = 6/124 (4%)
Query: 38 FCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQG-CNLLSQ--CCNSYEYC 94
C+ +VQG+ L+ D+ GYVC+ S+D + C ++G + + + C+ S+ CC YEYC
Sbjct: 50 ICRNSVQGKSLIVDERGYVCNRTSVDARGCCHVKRGSRLTSKHRCDTCSKNACCVVYEYC 109
Query: 95 VSCCVNP-ARTLKEQVL-NVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDF- 151
V+CC+ P R L +++L N + G+ F+ C +CR +S SV HEN+Y + F
Sbjct: 110 VACCLKPDKRPLLQKILTNAREGLDIALGSVSDHFELCLAKCRTSSSSVQHENSYRNAFA 169
Query: 152 HHCF 155
HCF
Sbjct: 170 KHCF 173
>gi|198471721|ref|XP_001355702.2| GA13867 [Drosophila pseudoobscura pseudoobscura]
gi|198146047|gb|EAL32761.2| GA13867 [Drosophila pseudoobscura pseudoobscura]
Length = 216
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 50/125 (40%), Positives = 67/125 (53%), Gaps = 8/125 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----PEKGEKFSCQGCNLLSQCCNSYEYC 94
C+ +VQGR LL+D+ G+VC P S CC P G +SC+ CN + CC YEYC
Sbjct: 77 CRNSVQGRQLLTDERGFVCLREQALP-SGCCNLEMPGIG-YYSCRTCNATTSCCRLYEYC 134
Query: 95 VSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS-DFH 152
VSCC++P + L E+VL + F+ C +CR NS SV HEN Y
Sbjct: 135 VSCCLHPDKQPLLERVLQSAHTPKYIYASVADHFELCLVKCRTNSHSVEHENKYRDGKAK 194
Query: 153 HCFSM 157
HC+ +
Sbjct: 195 HCYGI 199
>gi|328869989|gb|EGG18364.1| hypothetical protein DFA_03858 [Dictyostelium fasciculatum]
Length = 1057
Score = 80.9 bits (198), Expect = 6e-13, Method: Composition-based stats.
Identities = 42/122 (34%), Positives = 62/122 (50%), Gaps = 4/122 (3%)
Query: 36 TPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP-EKGEKFSCQGCNLLSQCCNSYEYC 94
+ +CK TVQ + DD G++C +D S CC +++SC GC+ +CC Y YC
Sbjct: 925 STYCKYTVQSSKYVVDDQGHICLRSELD--SGCCNINNFDRYSCNGCSSKYRCCEEYTYC 982
Query: 95 VSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYL-SDFHH 153
VSCC+ + E VL + + G FD C CR +S SV+H+ Y D +
Sbjct: 983 VSCCLGARQETLEAVLVQMVEGVSPIGEPHDQFDLCQLVCRTSSHSVIHQREYKHPDTKY 1042
Query: 154 CF 155
C+
Sbjct: 1043 CY 1044
>gi|195164215|ref|XP_002022944.1| GL16447 [Drosophila persimilis]
gi|194105006|gb|EDW27049.1| GL16447 [Drosophila persimilis]
Length = 216
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 50/125 (40%), Positives = 66/125 (52%), Gaps = 8/125 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----PEKGEKFSCQGCNLLSQCCNSYEYC 94
C+ +VQGR LL+D+ G+VC P S CC P G +SC+ CN + CC YEYC
Sbjct: 77 CRNSVQGRQLLTDERGFVCLREQALP-SGCCNLEMPGIG-YYSCRTCNATTNCCRLYEYC 134
Query: 95 VSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS-DFH 152
VSCC++P + L E VL + F+ C +CR NS SV HEN Y
Sbjct: 135 VSCCLHPDKQPLLELVLQSAYTPKYIYASVADHFELCLVKCRTNSHSVEHENKYRDGKAK 194
Query: 153 HCFSM 157
HC+ +
Sbjct: 195 HCYGI 199
>gi|194894481|ref|XP_001978075.1| GG19394 [Drosophila erecta]
gi|190649724|gb|EDV47002.1| GG19394 [Drosophila erecta]
Length = 217
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 57/152 (37%), Positives = 79/152 (51%), Gaps = 15/152 (9%)
Query: 15 LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----P 70
L T Q L+ + + G E C+ +VQGR L+D+ G+VC + CC P
Sbjct: 54 LWTQQLLAPEEQLNRTRGDPEA-LCRNSVQGRQWLADERGFVCRREQVLTNG-CCNLELP 111
Query: 71 EKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV--- 126
G +SC+ CN + CC YEYCVSCC++P + L E+VL +A T + SV
Sbjct: 112 GIG-YYSCRTCNTSTHCCAVYEYCVSCCLHPGQQPLLERVL---LAPNTPKYIFASVADH 167
Query: 127 FDFCAGRCRHNSESVVHENAYLSD-FHHCFSM 157
F+ C +CR NS SV HEN Y HC+ +
Sbjct: 168 FELCLVKCRTNSHSVEHENKYRDPAAKHCYGL 199
>gi|242012077|ref|XP_002426767.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212510949|gb|EEB14029.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 203
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/115 (40%), Positives = 67/115 (58%), Gaps = 11/115 (9%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG+ L+ DD GYVC + S CC E +KF+C C + CC+ YEYC+
Sbjct: 75 CRNSVQGKVLIVDDRGYVCSRSDL-LSSGCCNTGSENTKKFACDTCKD-NGCCSIYEYCI 132
Query: 96 SCCVNPARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
SCC++P K+ VL + K + + SV F+ C +CR NS+SV HEN+Y
Sbjct: 133 SCCLHPD---KKGVLQNVLGKESENVLFASVSDHFELCLAKCRTNSQSVHHENSY 184
>gi|195478878|ref|XP_002100681.1| GE16041 [Drosophila yakuba]
gi|194188205|gb|EDX01789.1| GE16041 [Drosophila yakuba]
Length = 217
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 57/152 (37%), Positives = 79/152 (51%), Gaps = 15/152 (9%)
Query: 15 LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----P 70
L T Q L+ + + G E C+ +VQGR L+D+ G+VC + CC P
Sbjct: 54 LWTQQLLAPEEQLNRTRGDPEA-RCRNSVQGRQWLADERGFVCRREEVLTNG-CCNLELP 111
Query: 71 EKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV--- 126
G +SC+ CN + CC YEYCVSCC++P + L E+VL +A T + SV
Sbjct: 112 GIG-YYSCRTCNTSTHCCGVYEYCVSCCLHPGQQPLLERVL---LAPNTPKYIFASVTDH 167
Query: 127 FDFCAGRCRHNSESVVHENAYLSD-FHHCFSM 157
F+ C +CR NS SV HEN Y HC+ +
Sbjct: 168 FELCLVKCRTNSHSVEHENKYRDPAAKHCYGL 199
>gi|28571353|ref|NP_788909.1| CG15643 [Drosophila melanogaster]
gi|7293087|gb|AAF48472.1| CG15643 [Drosophila melanogaster]
gi|17944302|gb|AAL48044.1| RE11751p [Drosophila melanogaster]
gi|220947868|gb|ACL86477.1| CG15643-PA [synthetic construct]
gi|220957174|gb|ACL91130.1| CG15643-PA [synthetic construct]
Length = 212
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 74/149 (49%), Gaps = 9/149 (6%)
Query: 15 LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----P 70
L T Q L+ + + G E C+ +VQGR L+D+ G+VC + CC P
Sbjct: 54 LWTQQLLAPEEQLNRTRGDPEA-RCRNSVQGRQWLADERGFVCRREEVLTNG-CCNLELP 111
Query: 71 EKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDF 129
G +SC+ CN + CC YEYCVSCC++P + L E+VL + F+
Sbjct: 112 GIG-YYSCRTCNTSTHCCGVYEYCVSCCLHPGQQPLLERVLQAPNTPKYIFASVTDHFEL 170
Query: 130 CAGRCRHNSESVVHENAYLSD-FHHCFSM 157
C +CR NS SV HEN Y HC+ +
Sbjct: 171 CLVKCRTNSHSVEHENKYRDPAAKHCYGL 199
>gi|347964633|ref|XP_316825.5| AGAP000855-PA [Anopheles gambiae str. PEST]
gi|333469436|gb|EAA12112.5| AGAP000855-PA [Anopheles gambiae str. PEST]
Length = 243
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 81/167 (48%), Gaps = 19/167 (11%)
Query: 16 LTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC------ 69
L + L +A D + C+ ++QGR LL DD GYVC + CC
Sbjct: 69 LGAEVLPAGTANETDDQPAQPASCRNSIQGRSLLVDDRGYVCPRADLLANG-CCSAGSGG 127
Query: 70 ----PEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYG 124
+ G F C+ C L ++CC YEYCV+CC+NP R + E+VL A Y
Sbjct: 128 TTANADAGRLFPCRTC-LPNRCCAVYEYCVACCLNPDKRPILEEVL--AKANGRQVALYA 184
Query: 125 SV---FDFCAGRCRHNSESVVHENAYLS-DFHHCFSMPSNASGAGVT 167
V F+ C +CR NS+SV +EN Y + + HC+ S A+ A +
Sbjct: 185 EVTDQFELCLTKCRTNSQSVQNENKYRNPEQKHCYGEMSEAAWASAS 231
>gi|156380036|ref|XP_001631761.1| predicted protein [Nematostella vectensis]
gi|156218806|gb|EDO39698.1| predicted protein [Nematostella vectensis]
Length = 135
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 73/127 (57%), Gaps = 8/127 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE---KFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG L+ D+ GY+C + S CC GE +++C+ C + + CC+ YE CV
Sbjct: 2 CRNSVQGPSLIVDERGYLCSRKDLSA-SGCCHSDGETTHRYNCESCQV-NNCCSIYENCV 59
Query: 96 SCCVNPA-RTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESVVHENAYLSD-FH 152
SCC++P + L +VLNV P + F+ C +CR +S+SV HEN+Y + +
Sbjct: 60 SCCLDPKQKELLREVLNVWRTAPNVILKSITDQFELCLTKCRTSSKSVWHENSYKDNKYK 119
Query: 153 HCFSMPS 159
HCF + S
Sbjct: 120 HCFGLTS 126
>gi|195432368|ref|XP_002064195.1| GK19833 [Drosophila willistoni]
gi|194160280|gb|EDW75181.1| GK19833 [Drosophila willistoni]
Length = 215
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 59/112 (52%), Gaps = 3/112 (2%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE--KFSCQGCNLLSQCCNSYEYCVS 96
C+ +VQG L+D+ G+VC + P C E + +SC CN S CC YEYCVS
Sbjct: 76 CRNSVQGIQWLADELGFVCQREQILPSGCCNSELPDVSYYSCHTCNQTSMCCEIYEYCVS 135
Query: 97 CCVNP-ARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
CC++P R L E +L A+ F+ C +CR NS SV HEN Y
Sbjct: 136 CCLHPDKRPLLETILETVNAQRYIYAQVEDHFELCLVKCRTNSHSVEHENKY 187
>gi|195355329|ref|XP_002044144.1| GM22548 [Drosophila sechellia]
gi|194129433|gb|EDW51476.1| GM22548 [Drosophila sechellia]
Length = 197
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 48/125 (38%), Positives = 65/125 (52%), Gaps = 8/125 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----PEKGEKFSCQGCNLLSQCCNSYEYC 94
C+ +VQGR L+D+ G+VC + CC P G +SC+ CN + CC YEYC
Sbjct: 63 CRNSVQGRQWLADERGFVCRREEVLTNG-CCNLELPGIG-YYSCRTCNTSTHCCGVYEYC 120
Query: 95 VSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSD-FH 152
VSCC++P + L E+VL + F+ C +CR NS SV HEN Y
Sbjct: 121 VSCCLHPGQQPLLERVLQAPNTPKYIFASVTDHFELCLVKCRTNSHSVEHENKYRDPAAK 180
Query: 153 HCFSM 157
HCF +
Sbjct: 181 HCFGL 185
>gi|391333374|ref|XP_003741090.1| PREDICTED: UPF0454 protein C12orf49 homolog [Metaseiulus
occidentalis]
Length = 193
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 67/123 (54%), Gaps = 8/123 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG-EKFSCQGCNLLSQCCNSYEYCVSC 97
C+ ++QG ++DD GYVC+ + C ++G E++ C C CC+SYE+C+SC
Sbjct: 69 CRNSLQGPQWVADDRGYVCNRTLLLSSGCCDADQGLERYPCDSC--ADGCCSSYEFCISC 126
Query: 98 CVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLSDF-H 152
C+ P R E+VL P + Y V FD C +CR +S+SV +EN Y+
Sbjct: 127 CLRPQQRPTLERVLGRLGRSPKSLKLYALVQTQFDLCLTKCRTSSQSVRNENTYIDPVKK 186
Query: 153 HCF 155
HCF
Sbjct: 187 HCF 189
>gi|289742927|gb|ADD20211.1| uncharacterized conserved protein [Glossina morsitans morsitans]
Length = 217
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/114 (39%), Positives = 62/114 (54%), Gaps = 4/114 (3%)
Query: 38 FCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEK--FSCQGCNLLSQCCNSYEYCV 95
C+ ++QG+ LL DD G+VC+ + C E E ++C+ CN S CC+ YE+CV
Sbjct: 78 ICRNSLQGKELLVDDRGFVCNREHILQNGCCNVESPETLYYTCETCNNSSHCCSIYEFCV 137
Query: 96 SCCVNP-ARTLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC++P R L E L V + T F+ C +CR NS SV HEN Y
Sbjct: 138 SCCLHPEKRPLLELALQAVNGRQLTVFAQVLDQFELCLVKCRTNSHSVEHENRY 191
>gi|124486989|ref|NP_001074705.1| UPF0454 protein C12orf49 homolog precursor [Mus musculus]
gi|81897450|sp|Q8BTG6.1|CL049_MOUSE RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
gi|26355654|dbj|BAC41192.1| unnamed protein product [Mus musculus]
gi|74355387|gb|AAI04374.1| RIKEN cDNA 2410131K14 gene [Mus musculus]
gi|74355856|gb|AAI04373.1| RIKEN cDNA 2410131K14 gene [Mus musculus]
gi|148687847|gb|EDL19794.1| mCG15258 [Mus musculus]
Length = 205
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 66/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG++LL+D+ GYVC+ + CC +++ C GC L + CC +YEYCV
Sbjct: 75 CRNSVQGKHLLTDELGYVCERKDLLANG-CCDVSVPSTKQYCCDGC-LANGCCEAYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|351694757|gb|EHA97675.1| hypothetical protein GW7_10171 [Heterocephalus glaber]
Length = 273
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 66/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC L + CC +YEYCV
Sbjct: 143 CRNSIQGKHLITDELGYVCERKDLLANG-CCDVSIPSAKQYCCDGC-LANGCCGAYEYCV 200
Query: 96 SCCVNPAR-TLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 201 SCCLQPSKQLLLERFLNRAAVAFQNLFLAVEDHFELCLAKCRTSSQSVQHENTY 254
>gi|157818285|ref|NP_001102536.1| uncharacterized protein LOC498188 [Rattus norvegicus]
gi|149063483|gb|EDM13806.1| similar to hypothetical protein FLJ21415 (predicted) [Rattus
norvegicus]
Length = 205
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 66/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG++L++D+ GYVC+ + CC +++ C GC L + CC +YEYCV
Sbjct: 75 CRNSVQGKHLITDELGYVCERKDLLANG-CCDVSVPSTKQYCCNGC-LANGCCEAYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|194214270|ref|XP_001489615.2| PREDICTED: UPF0454 protein C12orf49 homolog, partial [Equus
caballus]
Length = 178
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 68/114 (59%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC + ++SC GC L S CC++YE+CV
Sbjct: 48 CRNSIQGKHLITDELGYVCERRDLLVNG-CCDVRVPSTRQYSCDGC-LSSGCCSAYEHCV 105
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 106 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 159
>gi|440791825|gb|ELR13063.1| hypothetical protein ACA1_097480 [Acanthamoeba castellanii str.
Neff]
Length = 264
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 76/142 (53%), Gaps = 12/142 (8%)
Query: 30 DIGFQETPF--CKTTVQGRYLLSDDNGYVC--DAVSMDPQSRCCPEKGE----KFSCQGC 81
D G +T C + QGR ++D+ GYVC D + CC E ++SCQ C
Sbjct: 123 DAGLGKTWLRTCDNSKQGRTYVTDERGYVCRWDQMYSGAGYGCCWESDRTNISRYSCQQC 182
Query: 82 NLLSQCCNSYEYCVSCCVNPA---RTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 138
++ + CC YE+CVSCC+NP+ R LN++ ++ S+F+ C CR +S
Sbjct: 183 DIKTTCCLEYEFCVSCCLNPSQEQRVASWLQLNIEDSREREEMAPLSLFEQCLALCRTSS 242
Query: 139 ESVVHENAYLSDFHHCFS-MPS 159
S++ ++ ++S H C++ +PS
Sbjct: 243 ASILRQHIFISPSHFCYAGVPS 264
>gi|148224108|ref|NP_001085845.1| UPF0454 protein C12orf49 homolog precursor [Xenopus laevis]
gi|82184289|sp|Q6GNT2.1|CL049_XENLA RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
gi|49115513|gb|AAH73420.1| MGC80896 protein [Xenopus laevis]
Length = 205
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/123 (35%), Positives = 69/123 (56%), Gaps = 8/123 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG+ L++DD GY+C+ + CC + +SC+ C L + CC+ YE+CV
Sbjct: 75 CRNSVQGKLLVTDDMGYICERKELLANG-CCNINVASTKLYSCETC-LPNGCCSVYEFCV 132
Query: 96 SCCVNPARTLK-EQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDF-H 152
SCC+ P + L+ E+ LN +A F+ C +CR +S+SV HEN Y +
Sbjct: 133 SCCLQPNKQLQLERFLNKAAVAFQNLFQAVEDHFELCLAKCRTSSQSVQHENTYRNPIAK 192
Query: 153 HCF 155
HC+
Sbjct: 193 HCY 195
>gi|157167582|ref|XP_001655053.1| hypothetical protein AaeL_AAEL010931 [Aedes aegypti]
gi|157167584|ref|XP_001655054.1| hypothetical protein AaeL_AAEL010931 [Aedes aegypti]
gi|157167586|ref|XP_001655055.1| hypothetical protein AaeL_AAEL010931 [Aedes aegypti]
gi|94469020|gb|ABF18359.1| uncharacterized conserved protein [Aedes aegypti]
gi|108872807|gb|EAT37032.1| AAEL010931-PA [Aedes aegypti]
gi|108872808|gb|EAT37033.1| AAEL010931-PB [Aedes aegypti]
gi|108872809|gb|EAT37034.1| AAEL010931-PC [Aedes aegypti]
Length = 228
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/123 (38%), Positives = 63/123 (51%), Gaps = 8/123 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG+ L+ DD GYVC + + CC E + F C C S CC YEYCV
Sbjct: 75 CRNSVQGKALIVDDRGYVCTRFEL-LWTGCCNVEVESTKLFHCDTCQS-SNCCAIYEYCV 132
Query: 96 SCCVNP-ARTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESVVHENAYLS-DFH 152
SCC++P R L E+VL + T ++ C +CR NS SV +EN Y
Sbjct: 133 SCCMHPNKRPLLERVLAKATGRQIAVYATVNDQYELCLTKCRTNSHSVQNENKYRDPKLK 192
Query: 153 HCF 155
HC+
Sbjct: 193 HCY 195
>gi|62858887|ref|NP_001016016.1| uncharacterized protein LOC548770 [Xenopus (Silurana) tropicalis]
gi|183985985|gb|AAI66293.1| hypothetical protein LOC548770 [Xenopus (Silurana) tropicalis]
gi|213627063|gb|AAI70669.1| hypothetical protein LOC548770 [Xenopus (Silurana) tropicalis]
gi|213627065|gb|AAI70671.1| hypothetical protein LOC548770 [Xenopus (Silurana) tropicalis]
Length = 204
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 64/114 (56%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG+ L++DD GY+C+ + CC + +SC+ C L + CC+ YEYCV
Sbjct: 74 CRNSVQGKLLITDDMGYICERKELLANG-CCNINVASTKLYSCETC-LSNGCCSVYEYCV 131
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 132 SCCLQPNKQLLLERFLNRAAVAFQNLFQAVEDHFELCLAKCRTSSQSVQHENTY 185
>gi|321477285|gb|EFX88244.1| hypothetical protein DAPPUDRAFT_305537 [Daphnia pulex]
Length = 222
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 69/118 (58%), Gaps = 15/118 (12%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
C+ +VQG+ L++DD GYVC ++ P C E + +SC+ C+ + CC+ YEYC+S
Sbjct: 79 CRNSVQGKALIADDKGYVCSRQNVMPNGCCKAETDGTKHYSCETCH-NNGCCSIYEYCIS 137
Query: 97 CCVNPARTLKEQVLNVKIAKPTTAGTYGSV-------FDFCAGRCRHNSESVVHENAY 147
CC+ P K++VL + K AGT + F+ C +CR +S+SV HEN+Y
Sbjct: 138 CCLQPD---KKEVLQDVLGK--AAGTLNLLLTSVTDHFELCLTKCRTSSQSVQHENSY 190
>gi|348584144|ref|XP_003477832.1| PREDICTED: UPF0454 protein C12orf49 homolog [Cavia porcellus]
Length = 205
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 67/114 (58%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC L + CC++YEYCV
Sbjct: 75 CRNSIQGKHLITDELGYVCERKDLLANG-CCDGSVPSTKQYCCDGC-LANGCCSAYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFLAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|115496644|ref|NP_001069491.1| UPF0454 protein C12orf49 homolog precursor [Bos taurus]
gi|122133614|sp|Q17QN8.1|CL049_BOVIN RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
gi|109659176|gb|AAI18255.1| Chromosome 12 open reading frame 49 ortholog [Bos taurus]
gi|296478528|tpg|DAA20643.1| TPA: hypothetical protein LOC534423 precursor [Bos taurus]
Length = 205
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 66/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC L S CC++YEYCV
Sbjct: 75 CRNSIQGKHLITDELGYVCERRDLLVNG-CCNVNVPGTKQYCCDGC-LSSGCCSAYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|417397061|gb|JAA45564.1| Putative conserved secreted protein precursor [Desmodus rotundus]
Length = 205
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 74/132 (56%), Gaps = 9/132 (6%)
Query: 23 IISAIRKDIGFQETPF--CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFS 77
I+ ++ ++G P C+ ++QG++L++D+ GYVC+ + CC +++
Sbjct: 57 ILWKVQFNLGNSSRPSNQCRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSSKQYC 115
Query: 78 CQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCR 135
C GC L + CC++YE+CVSCC+ P + L E+ LN +A F+ C +CR
Sbjct: 116 CDGC-LSNGCCSAYEHCVSCCLQPGKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCR 174
Query: 136 HNSESVVHENAY 147
+S+SV HEN Y
Sbjct: 175 TSSQSVQHENTY 186
>gi|91087999|ref|XP_973740.1| PREDICTED: similar to CG15643 CG15643-PA [Tribolium castaneum]
gi|270011897|gb|EFA08345.1| hypothetical protein TcasGA2_TC005988 [Tribolium castaneum]
Length = 203
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 67/117 (57%), Gaps = 13/117 (11%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG+ L+ DD G+VC + S CC P+ ++SC C + CC YE+C+
Sbjct: 69 CRNSIQGKVLIVDDRGFVCPRHEI-LHSGCCNDTPQTSPQYSCDTCKG-NNCCAIYEFCI 126
Query: 96 SCCVNPARTLKEQVLNVKIAKPTTAGT--YGSV---FDFCAGRCRHNSESVVHENAY 147
SCC++P K+++L + K T + SV F+ C +CR NS+SV HEN+Y
Sbjct: 127 SCCLHPD---KKEMLENLLGKATEQNNVLFASVTDHFELCLAKCRTNSQSVQHENSY 180
>gi|426247320|ref|XP_004017434.1| PREDICTED: UPF0454 protein C12orf49 homolog [Ovis aries]
Length = 205
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 65/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC L S CC +YEYCV
Sbjct: 75 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPGTKQYCCDGC-LSSGCCGAYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|311270685|ref|XP_003132942.1| PREDICTED: UPF0454 protein C12orf49 homolog [Sus scrofa]
Length = 205
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 67/114 (58%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC L + CC++YEYCV
Sbjct: 75 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPGTKQYCCDGC-LSNGCCSAYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|440912218|gb|ELR61808.1| hypothetical protein M91_20260, partial [Bos grunniens mutus]
Length = 201
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 66/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC L S CC++YEYCV
Sbjct: 71 CRNSIQGKHLITDELGYVCERRDLLVNG-CCNVNVPGTKQYCCDGC-LSSGCCSAYEYCV 128
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 129 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 182
>gi|56753599|gb|AAW25002.1| SJCHGC02079 protein [Schistosoma japonicum]
Length = 244
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 70/126 (55%), Gaps = 7/126 (5%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
C+ Q LL+D+ GY+C+ +++ S CCP + ++F C C + CC+ YE+CVS
Sbjct: 118 CRHIKQSPILLADELGYLCNYLNLLKNS-CCPFQNLTQRFVCHSCKF-NHCCSVYEHCVS 175
Query: 97 CCVNPA-RTLKEQVL-NVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS-DFHH 153
CC+NP + L +QV+ N F+FC CR +S +V+HEN Y + D +
Sbjct: 176 CCLNPTNKPLWDQVMQNANANSRRHLKLAIDAFEFCVAVCRTSSLTVLHENKYRNLDEIY 235
Query: 154 CFSMPS 159
CF + +
Sbjct: 236 CFGLET 241
>gi|57099849|ref|XP_540555.1| PREDICTED: UPF0454 protein C12orf49 homolog [Canis lupus
familiaris]
gi|355735600|gb|AES11718.1| hypothetical protein [Mustela putorius furo]
Length = 205
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GY+C+ + CC +++ C GC L CC +YEYCV
Sbjct: 75 CRNSIQGKHLITDELGYICERKDLLVNG-CCNVNVPGTKQYRCDGC-LPHGCCGAYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|301770727|ref|XP_002920782.1| PREDICTED: UPF0454 protein C12orf49 homolog [Ailuropoda
melanoleuca]
gi|281343735|gb|EFB19319.1| hypothetical protein PANDA_009565 [Ailuropoda melanoleuca]
Length = 205
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GY+C+ + CC +++ C GC L CC +YEYCV
Sbjct: 75 CRNSIQGKHLITDELGYICERKDLLVNG-CCNVNVPGTKQYRCDGC-LPHGCCGAYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|354466980|ref|XP_003495949.1| PREDICTED: UPF0454 protein C12orf49 homolog [Cricetulus griseus]
gi|344237025|gb|EGV93128.1| UPF0454 protein C12orf49-like [Cricetulus griseus]
Length = 205
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 66/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG++L++D+ GYVC+ + CC +++ C GC L + CC +YE+CV
Sbjct: 75 CRNSVQGKHLITDELGYVCERKDLLANG-CCDVTVPSTKQYCCDGC-LANGCCEAYEHCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAMAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|403281581|ref|XP_003932260.1| PREDICTED: UPF0454 protein C12orf49 homolog [Saimiri boliviensis
boliviensis]
Length = 205
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 73/132 (55%), Gaps = 9/132 (6%)
Query: 23 IISAIRKDIGFQETPF--CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFS 77
I+ ++ ++G P C+ ++QG++L++D+ GYVC+ + CC +++
Sbjct: 57 ILWKVQFNLGNSSRPSNQCRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYC 115
Query: 78 CQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCR 135
C GC + CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR
Sbjct: 116 CDGC-WPNGCCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCR 174
Query: 136 HNSESVVHENAY 147
+S+SV HEN Y
Sbjct: 175 TSSQSVQHENTY 186
>gi|296213028|ref|XP_002753098.1| PREDICTED: UPF0454 protein C12orf49-like isoform 1 [Callithrix
jacchus]
Length = 205
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 73/132 (55%), Gaps = 9/132 (6%)
Query: 23 IISAIRKDIGFQETPF--CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFS 77
I+ ++ ++G P C+ ++QG++L++D+ GYVC+ + CC +++
Sbjct: 57 ILWKVQFNLGNSSRPSNQCRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYC 115
Query: 78 CQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCR 135
C GC + CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR
Sbjct: 116 CDGC-WPNGCCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCR 174
Query: 136 HNSESVVHENAY 147
+S+SV HEN Y
Sbjct: 175 TSSQSVQHENTY 186
>gi|344295368|ref|XP_003419384.1| PREDICTED: UPF0454 protein C12orf49 homolog [Loxodonta africana]
Length = 174
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 74/132 (56%), Gaps = 9/132 (6%)
Query: 23 IISAIRKDIGFQETPF--CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFS 77
I ++ ++G P C+ ++QG++L++D+ GYVC+ + CC E++
Sbjct: 26 IAWKVQFNLGNSSRPSNQCRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTEQYC 84
Query: 78 CQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCR 135
C GC L + CC++YEYCVSCC+ P++ L E+ L+ +A F+ C +CR
Sbjct: 85 CNGC-LSNGCCSAYEYCVSCCLQPSKQLLLERFLSRAAVAFQNLFMAVEDHFELCLAKCR 143
Query: 136 HNSESVVHENAY 147
+S+SV HEN Y
Sbjct: 144 TSSQSVQHENTY 155
>gi|444723218|gb|ELW63879.1| hypothetical protein TREES_T100018686 [Tupaia chinensis]
Length = 205
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 66/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC L S CC +YE+CV
Sbjct: 75 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVHVPTTKQYCCDGC-LPSGCCGAYEHCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|410976770|ref|XP_003994786.1| PREDICTED: UPF0454 protein C12orf49 homolog [Felis catus]
Length = 171
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 67/114 (58%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GY+C+ + CC +++ C GC L + CC++YEYCV
Sbjct: 41 CRNSIQGKHLITDELGYICERKDLLVNG-CCNVNVPGTKQYRCDGC-LPNGCCSAYEYCV 98
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 99 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 152
>gi|346472337|gb|AEO36013.1| hypothetical protein [Amblyomma maculatum]
Length = 206
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 69/125 (55%), Gaps = 10/125 (8%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--KGEKFSCQGCNLLSQCCNSYEYCVS 96
C+ +VQG+ L++DD GYVC + C E ++ C C L + CC+ YEYC+S
Sbjct: 72 CRNSVQGKVLVADDRGYVCQRSEVASNGCCIAEATSARQYCCDTC-LPNGCCSIYEYCIS 130
Query: 97 CCVNPAR-TLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLS-DF 151
CC++P + TL +++L A T S+ F+ C +CR +S SV HEN+Y
Sbjct: 131 CCMHPQKITLLQKILGK--ASDTFKMLVASITDHFELCLTKCRTSSNSVHHENSYRDPKA 188
Query: 152 HHCFS 156
HC++
Sbjct: 189 KHCYA 193
>gi|241792024|ref|XP_002400643.1| conserved hypothetical protein [Ixodes scapularis]
gi|215510808|gb|EEC20261.1| conserved hypothetical protein [Ixodes scapularis]
Length = 209
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 8/134 (5%)
Query: 29 KDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--KGEKFSCQGCNLLSQ 86
KD T C+ +VQG+ L++DD GYVC + C E +++ C C +
Sbjct: 61 KDKNSSATISCRNSVQGKVLVADDRGYVCQRSEVASTGCCIVESPNTKRYCCDTCQ-PNG 119
Query: 87 CCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVH 143
CC+ YEYC+SCC+ P + Q + K A T S+ F+ C +CR +SESV H
Sbjct: 120 CCSIYEYCISCCMQPQKIALLQKIMGK-ASETFKMLVASITDHFELCLTKCRTSSESVHH 178
Query: 144 ENAYLS-DFHHCFS 156
EN+Y HC++
Sbjct: 179 ENSYRDPSAKHCYA 192
>gi|412993097|emb|CCO16630.1| predicted protein [Bathycoccus prasinos]
Length = 152
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 62/128 (48%), Gaps = 12/128 (9%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC-PEKG--EKFSCQGCNL-----LSQCCNS 90
C T QG + ++DDNGY CD + P + CC PEK +++ C C+L +CC +
Sbjct: 8 CVNTKQGLHHITDDNGYTCDWFDVFPPTGCCPPEKAIEKRYGCSACDLAKGGETEKCCGT 67
Query: 91 YEYCVSCCVNPARTLKEQVLN---VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
YE CVSCC N + E + +P T G + F C +CR VHE+ Y
Sbjct: 68 YESCVSCCQNAKVSTAETDRTREPIGRMQPQT-GYFSDAFSHCKSKCRTQPTVTVHESTY 126
Query: 148 LSDFHHCF 155
+ CF
Sbjct: 127 GYEKRFCF 134
>gi|402580999|gb|EJW74948.1| hypothetical protein WUBG_14143 [Wuchereria bancrofti]
Length = 183
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 69/132 (52%), Gaps = 8/132 (6%)
Query: 35 ETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE---KFSCQGCNLLSQCCNSY 91
E C+ T+QGR ++DD GY+C+ + S C E G +FSC C+L + CC+ Y
Sbjct: 49 EGRACRNTIQGRTSVTDDRGYICERAFL--MSNGCCEVGTQSIRFSCVWCDLYTGCCSHY 106
Query: 92 EYCVSCCVNPART--LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS 149
E CVSCC+ P + L + +++ + + F+ C +CR +S SV EN Y S
Sbjct: 107 EQCVSCCLRPEQKNILLKMIVSTSGHRLRQILSATDQFELCMLKCRTSSNSVHSENKYKS 166
Query: 150 D-FHHCFSMPSN 160
+ +C+ N
Sbjct: 167 EKMKYCYKQEPN 178
>gi|62897527|dbj|BAD96703.1| hypothetical protein FLJ21415 variant [Homo sapiens]
Length = 205
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC + CC++YEYCV
Sbjct: 75 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 132
Query: 96 SCCVNPARTLK-EQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLPLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|312070660|ref|XP_003138249.1| hypothetical protein LOAG_02664 [Loa loa]
gi|307766590|gb|EFO25824.1| hypothetical protein LOAG_02664 [Loa loa]
Length = 224
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 66/131 (50%), Gaps = 6/131 (4%)
Query: 35 ETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP--EKGEKFSCQGCNLLSQCCNSYE 92
E C+ TVQGR ++DD GY C+ + CC + +FSC C+L + CC+ YE
Sbjct: 90 EGKTCRNTVQGRTSVTDDRGYTCERAFLMLNG-CCEVGTQSVRFSCAWCDLYTGCCSYYE 148
Query: 93 YCVSCCVNPAR--TLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSD 150
CVSCC+ P + L E + + + + F+ C +CR +S SV EN Y S+
Sbjct: 149 QCVSCCLRPEQKNILLEMIESTSGHRLRQILSATDQFELCMLKCRTSSNSVHSENKYKSE 208
Query: 151 -FHHCFSMPSN 160
+C+ N
Sbjct: 209 KMKYCYKQEPN 219
>gi|13376060|ref|NP_079014.1| UPF0454 protein C12orf49 precursor [Homo sapiens]
gi|109098869|ref|XP_001112172.1| PREDICTED: UPF0454 protein C12orf49-like isoform 1 [Macaca mulatta]
gi|297693075|ref|XP_002823850.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Pongo
abelii]
gi|402887789|ref|XP_003907263.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Papio
anubis]
gi|74733622|sp|Q9H741.1|CL049_HUMAN RecName: Full=UPF0454 protein C12orf49; Flags: Precursor
gi|10437508|dbj|BAB15058.1| unnamed protein product [Homo sapiens]
gi|18044293|gb|AAH19843.1| Chromosome 12 open reading frame 49 [Homo sapiens]
gi|119618494|gb|EAW98088.1| chromosome 12 open reading frame 49, isoform CRA_a [Homo sapiens]
gi|119618495|gb|EAW98089.1| chromosome 12 open reading frame 49, isoform CRA_a [Homo sapiens]
gi|380816602|gb|AFE80175.1| hypothetical protein LOC79794 precursor [Macaca mulatta]
gi|383413575|gb|AFH30001.1| hypothetical protein LOC79794 precursor [Macaca mulatta]
gi|384949484|gb|AFI38347.1| hypothetical protein LOC79794 precursor [Macaca mulatta]
Length = 205
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC + CC++YEYCV
Sbjct: 75 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|114647164|ref|XP_001154941.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Pan
troglodytes]
gi|410209358|gb|JAA01898.1| chromosome 12 open reading frame 49 [Pan troglodytes]
gi|410255302|gb|JAA15618.1| chromosome 12 open reading frame 49 [Pan troglodytes]
gi|410289894|gb|JAA23547.1| chromosome 12 open reading frame 49 [Pan troglodytes]
gi|410335531|gb|JAA36712.1| chromosome 12 open reading frame 49 [Pan troglodytes]
Length = 205
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC + CC++YEYCV
Sbjct: 75 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|109098867|ref|XP_001112213.1| PREDICTED: UPF0454 protein C12orf49-like isoform 2 [Macaca mulatta]
Length = 272
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC + CC++YEYCV
Sbjct: 75 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|395833970|ref|XP_003789990.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Otolemur
garnettii]
Length = 205
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC + CC++YEYCV
Sbjct: 75 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLLLERFLNRAAMAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|307207910|gb|EFN85471.1| UPF0454 protein C12orf49-like protein [Harpegnathos saltator]
Length = 276
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/183 (31%), Positives = 86/183 (46%), Gaps = 26/183 (14%)
Query: 3 KSRLNLLFLFFQLLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSM 62
K+ L + L + + S+ +A + P C+ ++QG+ L+ D+ G VC +
Sbjct: 29 KAPLWQMELLERAFSENAASVDTANLNESDASTQP-CRNSIQGKALIVDERGIVCARHEV 87
Query: 63 DPQSRCCPE--------------KGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLK-- 106
P C E + E++SC+ CN CC YEYCVSCC++P + +K
Sbjct: 88 LPNGCCRIEQKQSPKNEETPIMARRERYSCKTCNPQG-CCTIYEYCVSCCLHPGKQIKGR 146
Query: 107 EQVL--NVKIAKPTTAGTYG----SVFDFCAGRCRHNSESVVHENAYLSDF-HHCFSM-P 158
+ V+ +VK K A F C CR +S SV HEN Y F HC++M P
Sbjct: 147 KDVVSGSVKAQKEEDAVRTRIRNLDRFQICLAACRTSSASVRHENTYKDPFAKHCYAMQP 206
Query: 159 SNA 161
SN+
Sbjct: 207 SNS 209
>gi|397525490|ref|XP_003832699.1| PREDICTED: UPF0454 protein C12orf49 homolog [Pan paniscus]
Length = 247
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC + CC++YEYCV
Sbjct: 117 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 174
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 175 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 228
>gi|427782911|gb|JAA56907.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 210
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 69/125 (55%), Gaps = 10/125 (8%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
C+ +VQG+ L++DD GYVC + C E ++ C C + CC+ YEYC+S
Sbjct: 72 CRNSVQGKVLVADDRGYVCQRSEVASNGCCIAEASSTRRYCCDTCQP-NGCCSIYEYCIS 130
Query: 97 CCVNPAR-TLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLS-DF 151
CC++P + T+ +++L A T S+ F+ C +CR +S+SV HEN+Y
Sbjct: 131 CCMHPQKITILQKILGK--ASDTFKMLVASITDHFELCLTKCRTSSDSVHHENSYRDPKA 188
Query: 152 HHCFS 156
HC++
Sbjct: 189 KHCYA 193
>gi|225711380|gb|ACO11536.1| UPF0454 protein C12orf49 homolog precursor [Caligus rogercresseyi]
Length = 210
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 68/128 (53%), Gaps = 8/128 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
C+ T+QG L++DD G +C +M QS CC ++C CNL CC++YE CV
Sbjct: 78 CRHTIQGSLLIADDQGVLCLRKNM-LQSGCCERNVTGSTLYACSHCNLDMGCCSAYEACV 136
Query: 96 SCCVNP-ARTLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS-DFH 152
SCC++P R VLN + + ++ C +CR +S SV HEN Y + ++
Sbjct: 137 SCCMSPDKRAFLMSVLNEARSLSNILILSVSDQYELCLAKCRTSSRSVHHENTYKNPEYK 196
Query: 153 HCF-SMPS 159
C+ +PS
Sbjct: 197 FCYKKIPS 204
>gi|332250748|ref|XP_003274513.1| PREDICTED: UPF0454 protein C12orf49 homolog [Nomascus leucogenys]
Length = 286
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC + CC++YEYCV
Sbjct: 156 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 213
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 214 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 267
>gi|256086857|ref|XP_002579602.1| hypothetical protein [Schistosoma mansoni]
Length = 392
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 62/112 (55%), Gaps = 5/112 (4%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
C+ T Q L+ D+ GY+C+ ++ S CCP + ++F C C + CC+ YE+CVS
Sbjct: 66 CRRTKQSPILVVDELGYLCNYFNLLTNS-CCPFQNVTQRFFCNSCKS-NHCCSIYEHCVS 123
Query: 97 CCVNPA-RTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
CC+NP ++L ++V N+ F+FC CR +S +V HEN Y
Sbjct: 124 CCLNPINKSLWDKVQNLDANYKRHLQLATDAFEFCIAVCRTSSLTVFHENRY 175
>gi|353230151|emb|CCD76322.1| hypothetical protein Smp_168260 [Schistosoma mansoni]
Length = 375
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 62/112 (55%), Gaps = 5/112 (4%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
C+ T Q L+ D+ GY+C+ ++ S CCP + ++F C C + CC+ YE+CVS
Sbjct: 66 CRRTKQSPILVVDELGYLCNYFNLLTNS-CCPFQNVTQRFFCNSCKS-NHCCSIYEHCVS 123
Query: 97 CCVNPA-RTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
CC+NP ++L ++V N+ F+FC CR +S +V HEN Y
Sbjct: 124 CCLNPINKSLWDKVQNLDANYKRHLQLATDAFEFCIAVCRTSSLTVFHENRY 175
>gi|56119050|ref|NP_001007842.1| UPF0454 protein C12orf49 homolog precursor [Gallus gallus]
gi|82081584|sp|Q5ZJE4.1|CL049_CHICK RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
gi|53133640|emb|CAG32149.1| hypothetical protein RCJMB04_18o22 [Gallus gallus]
Length = 205
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 64/114 (56%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG+ L++D+ GY+C+ + CC + +SC C L + CC+ YEYCV
Sbjct: 75 CRNSVQGKLLITDELGYICERKDLLVNG-CCNVNVPSTKLYSCDSC-LPNGCCSVYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQHLLERFLNRAAMAFQNLFMAVEDRFELCLAKCRTSSQSVQHENTY 186
>gi|355564717|gb|EHH21217.1| hypothetical protein EGK_04231, partial [Macaca mulatta]
gi|355786567|gb|EHH66750.1| hypothetical protein EGM_03800, partial [Macaca fascicularis]
Length = 169
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC + CC++YEYCV
Sbjct: 39 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 96
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 97 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 150
>gi|224072158|ref|XP_002200066.1| PREDICTED: UPF0454 protein C12orf49 homolog [Taeniopygia guttata]
Length = 205
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 64/114 (56%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG+ L++D+ GY+C+ + CC + +SC C L + CC+ YE+CV
Sbjct: 75 CRNSVQGKLLITDELGYICERKDLLVNG-CCNVHVPSTKLYSCDSC-LPNGCCSIYEFCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN IA F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQHLLERFLNRAAIAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186
>gi|324509188|gb|ADY43866.1| Unknown [Ascaris suum]
Length = 218
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 78/156 (50%), Gaps = 20/156 (12%)
Query: 18 LQFLSIISAIRKDIGFQETP---FCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP--EK 72
+Q+L+ SA R ET C+ T QG+ ++DD GY CD + CC
Sbjct: 65 IQWLAD-SAWRSGSDINETTGETACRNTRQGKTTVTDDRGYTCDRSYLMLNG-CCEVIPL 122
Query: 73 GEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAG-------TYGS 125
+F+C GC+L S CC+ YE+CV CC+ P + KE +L + TT+G +
Sbjct: 123 SVRFNCNGCDLHSGCCSFYEHCVGCCMRPDQ--KEILLEM---IETTSGHRLRQILSATD 177
Query: 126 VFDFCAGRCRHNSESVVHENAYLSD-FHHCFSMPSN 160
F+ C +CR +S SV EN Y S+ +C+ N
Sbjct: 178 QFELCTLKCRTSSNSVHSENKYKSEALKYCYKQEPN 213
>gi|348533454|ref|XP_003454220.1| PREDICTED: UPF0454 protein C12orf49 homolog [Oreochromis niloticus]
Length = 200
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 65/116 (56%), Gaps = 11/116 (9%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG+ LL+D+ GYVC+ + CC ++ C+ C L + CC+ YEYCV
Sbjct: 70 CRNSIQGKTLLTDELGYVCERKDLLVNG-CCNVNAPSTRQYICKSC-LANGCCSIYEYCV 127
Query: 96 SCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN A + +V F+ C +CR +S+SV HEN Y
Sbjct: 128 SCCLQPDKQLLLERFLNR--AAEGFQNLFTAVEDHFELCLAKCRTSSQSVQHENTY 181
>gi|47221493|emb|CAG08155.1| unnamed protein product [Tetraodon nigroviridis]
Length = 201
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 65/116 (56%), Gaps = 11/116 (9%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG+ LL+D+ GYVC+ + CC ++ C+ C L + CC+ YEYCV
Sbjct: 71 CRNSIQGKTLLTDELGYVCERKDLLVNG-CCNVNAPSSRQYICKSC-LANGCCSIYEYCV 128
Query: 96 SCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN A + +V F+ C +CR +S+SV HEN Y
Sbjct: 129 SCCLQPDKQPLLERFLNR--AAEGFRNLFTAVEDHFELCLAKCRTSSQSVQHENTY 182
>gi|324520392|gb|ADY47626.1| Unknown [Ascaris suum]
Length = 180
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 72/151 (47%), Gaps = 10/151 (6%)
Query: 18 LQFLSIISAIRKDIGFQETP---FCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP--EK 72
+Q+L+ SA R ET C+ T QG+ ++DD GY CD + CC
Sbjct: 27 IQWLAD-SAWRSGSDINETTGETACRNTRQGKTTVTDDRGYTCDRSYLMLNG-CCEVIPL 84
Query: 73 GEKFSCQGCNLLSQCCNSYEYCVSCCVNP--ARTLKEQVLNVKIAKPTTAGTYGSVFDFC 130
+F+C GC+L S CC+ YE+CV CC+ P L E + + + F+ C
Sbjct: 85 SVRFNCNGCDLHSGCCSFYEHCVGCCMRPDQKEILLEMIETTSGHRLRQILSATDQFELC 144
Query: 131 AGRCRHNSESVVHENAYLSD-FHHCFSMPSN 160
+CR +S SV EN Y S+ +C+ N
Sbjct: 145 TLKCRTSSNSVHSENKYKSEALKYCYKQEPN 175
>gi|198423098|ref|XP_002130011.1| PREDICTED: similar to Chromosome 12 open reading frame 49 ortholog
[Ciona intestinalis]
Length = 210
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 70/141 (49%), Gaps = 20/141 (14%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG+ L+ DDNGYVC + CC +++C+ C S CC+ YE+CV
Sbjct: 79 CRNSLQGKQLIVDDNGYVCKRDYLLDNG-CCDVTMHDTGRYACKTCQT-SNCCSVYEHCV 136
Query: 96 SCCVNPA-RTLKEQVLNVKIAKPTT----AGTYGSVFDFCAGRCRHNSESVVHENAYLS- 149
SCC+ P + L E+V V+ K T + FD C +CR +S SV HEN Y
Sbjct: 137 SCCLQPKHKNLLEKV--VRGLKTNTMLIMMSSIHDQFDLCLMKCRTSSTSVRHENTYRDP 194
Query: 150 DFHHCFSMPSNASGAGVTQLE 170
+CF G G +L
Sbjct: 195 SMKYCF-------GVGAPELR 208
>gi|410923361|ref|XP_003975150.1| PREDICTED: UPF0454 protein C12orf49 homolog [Takifugu rubripes]
Length = 199
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 65/116 (56%), Gaps = 11/116 (9%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG+ LL+D+ GYVC+ + CC ++ C+ C L + CC+ YEYCV
Sbjct: 69 CRNSIQGKTLLTDELGYVCERKDLLVNG-CCNVNAPSSRQYICKSC-LANGCCSIYEYCV 126
Query: 96 SCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN A + +V F+ C +CR +S+SV HEN Y
Sbjct: 127 SCCLQPDKQPLLERFLNR--AAEGFRNLFTAVEDHFELCLAKCRTSSQSVQHENTY 180
>gi|326930051|ref|XP_003211166.1| PREDICTED: UPF0454 protein C12orf49 homolog [Meleagris gallopavo]
Length = 205
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 63/114 (55%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG+ L++D+ GY+C+ + CC + +SC C L + CC+ YEYCV
Sbjct: 75 CRNSVQGKLLITDELGYICERKDLLVNG-CCNVNVPSTKLYSCDSC-LPNGCCSVYEYCV 132
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN +A F+ C +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQHLLERFLNRAAMAFQNLFMAVEDRFELCLAKCRTSSQSVQHENTY 186
>gi|327276076|ref|XP_003222797.1| PREDICTED: LOW QUALITY PROTEIN: UPF0454 protein C12orf49 homolog
[Anolis carolinensis]
Length = 205
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 47/124 (37%), Positives = 70/124 (56%), Gaps = 13/124 (10%)
Query: 32 GFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC--PEKGEK-FSCQGCNLLSQCC 88
G Q+ C+ +VQG+ L++D+ GY+C+ + CC G K +SC+ C L + CC
Sbjct: 68 GSQQGNQCRNSVQGKLLITDELGYICERKDVLVNG-CCNVDAPGTKLYSCESC-LSNGCC 125
Query: 89 NSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA--GTYGSV---FDFCAGRCRHNSESVVH 143
+ YE CVSCC+ P K+QVL + + A + SV F+ C +CR +S+SV H
Sbjct: 126 SVYEACVSCCLQPN---KQQVLENFLNRAAVAFRNLFLSVEDHFELCLAKCRTSSQSVQH 182
Query: 144 ENAY 147
EN Y
Sbjct: 183 ENTY 186
>gi|66472238|ref|NP_001018573.1| UPF0454 protein C12orf49 homolog precursor [Danio rerio]
gi|82192840|sp|Q503V3.1|CL049_DANRE RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
gi|63100598|gb|AAH95168.1| Zgc:110063 [Danio rerio]
Length = 199
Score = 67.0 bits (162), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 65/116 (56%), Gaps = 11/116 (9%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG+ LL+D+ GYVC+ + CC ++ C+ C L + CC+ YEYCV
Sbjct: 69 CRNSIQGKSLLTDELGYVCERKDLLVNG-CCNVNAPSSSQYVCKSC-LSNGCCSIYEYCV 126
Query: 96 SCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN A + +V F+ C +CR +S+SV HEN Y
Sbjct: 127 SCCLQPDKQPLLERFLNR--AAEGFQNLFTAVEDHFELCLAKCRTSSQSVQHENTY 180
>gi|348670076|gb|EGZ09898.1| hypothetical protein PHYSODRAFT_549244 [Phytophthora sojae]
Length = 367
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 70/133 (52%), Gaps = 14/133 (10%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDP-QSRCCPE------KGEKFSCQGCNL-LSQCCNS 90
C+ TVQG LL+D GYVC +D ++ CC E + ++F+C C+ CC+
Sbjct: 181 CENTVQGVRLLTDSMGYVCTRSQLDKVKAGCCNETDASLPRIKQFTCDQCDTNPPHCCDV 240
Query: 91 YEYCVSCCVNPARTL-----KEQVLNVKIAKPTTAGTYG-SVFDFCAGRCRHNSESVVHE 144
+E+CVSCC++P ++ + + P + +VF +C RCR +S SV H+
Sbjct: 241 FEHCVSCCMHPLNVRFTGIRRDYLTHADPNHPVYSDPAEITVFQYCQYRCRTSSASVQHQ 300
Query: 145 NAYLSDFHHCFSM 157
N+Y S C+ +
Sbjct: 301 NSYRSHRSFCYGI 313
>gi|432872487|ref|XP_004072113.1| PREDICTED: UPF0454 protein C12orf49 homolog [Oryzias latipes]
Length = 201
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 64/116 (55%), Gaps = 11/116 (9%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG+ LL+D+ GYVC+ + CC + C+ C L + CC+ YEYCV
Sbjct: 71 CRNSIQGKSLLTDELGYVCERKDLLVNG-CCNVNAPNSRQHICKSC-LANGCCSIYEYCV 128
Query: 96 SCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
SCC+ P + L E+ LN A + +V F+ C +CR +S+SV HEN Y
Sbjct: 129 SCCLQPDKQLLLERFLNR--AAEGFQNLFTAVEDHFELCLAKCRTSSQSVQHENTY 182
>gi|449685232|ref|XP_004210848.1| PREDICTED: UPF0454 protein C12orf49 homolog [Hydra magnipapillata]
Length = 191
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 68/129 (52%), Gaps = 10/129 (7%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE---KFSCQGCNLLSQCCNSYEYCV 95
C+ + QG YL++D+ G VC + ++ CC +++C C+ ++CC YE+CV
Sbjct: 61 CRNSKQGPYLIADEKGQVC-MIKNVLKNGCCNFHANSTVRYNCNSCHT-NRCCEVYEHCV 118
Query: 96 SCCVNPA---RTLKEQVL-NVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYL-SD 150
SCC+ P R L E++L + +A+ F+ C +CR +S SV EN Y S
Sbjct: 119 SCCLQPVPRKRRLIEKILVHAHMARNPLLREVKDTFELCLHKCRTSSASVYQENKYKDSS 178
Query: 151 FHHCFSMPS 159
F +CF S
Sbjct: 179 FKYCFGTDS 187
>gi|170033812|ref|XP_001844770.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167874847|gb|EDS38230.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 228
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 64/123 (52%), Gaps = 8/123 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG+ L+ DD G+VC + + CC E + F C C + CC YE+CV
Sbjct: 75 CRNSVQGKALIVDDRGFVCTRSEL-LWTGCCNVEVESTKLFHCDTCQS-NACCAIYEFCV 132
Query: 96 SCCVNP-ARTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESVVHENAYLS-DFH 152
SCC++P ++L E+VL + T ++ C +CR NS SV +EN Y
Sbjct: 133 SCCMHPEKKSLLEKVLAKASGRQIAVYATVNDQYELCLTKCRTNSHSVQNENKYRDPKSK 192
Query: 153 HCF 155
HC+
Sbjct: 193 HCY 195
>gi|170590022|ref|XP_001899772.1| hypothetical protein [Brugia malayi]
gi|158592898|gb|EDP31494.1| conserved hypothetical protein [Brugia malayi]
Length = 295
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 71/137 (51%), Gaps = 21/137 (15%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE---KFSCQGCNLLSQCCNSYEYCV 95
C+ T+QGR ++DD GY C+ + S C E G +FSC C+L + CC+ YE CV
Sbjct: 160 CRNTIQGRTSVTDDRGYTCERAFL--MSNGCCEVGTQSLRFSCVWCDLYTGCCSHYEQCV 217
Query: 96 SCCVNPARTLKEQVLNVKIAKPTTAGTYGSV------FDFCAGRCRHNS-----ESVVHE 144
SCC+ P +++ + +K+ + T+ + F+ C +CR +S ESV E
Sbjct: 218 SCCLKP----EQKNILLKMIESTSGHRLRQILSATDQFELCMLKCRTSSNVRAHESVHSE 273
Query: 145 NAYLSD-FHHCFSMPSN 160
N Y S+ +C+ N
Sbjct: 274 NKYKSEKMKYCYKQEPN 290
>gi|449265786|gb|EMC76924.1| UPF0454 protein C12orf49 like protein, partial [Columba livia]
Length = 187
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 64/114 (56%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG+ L++D+ GY+C+ + CC + +SC+ C L + CC+ YE CV
Sbjct: 57 CRNSVQGKLLITDELGYICERKDLLVNG-CCNVNVPNTKLYSCESC-LPNGCCSVYECCV 114
Query: 96 SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
SCC+ P++ L E+ LN IA F+ C +CR +S+SV HEN Y
Sbjct: 115 SCCLQPSKQHLLERFLNRAAIAFQNLFMAVEDRFELCLAKCRTSSQSVQHENTY 168
>gi|390359679|ref|XP_797277.2| PREDICTED: UPF0454 protein C12orf49 homolog [Strongylocentrotus
purpuratus]
Length = 198
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 74/147 (50%), Gaps = 13/147 (8%)
Query: 17 TLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEK- 75
T Q +S+ + + + F T C+ + QG+ + D+ YVC + C PE
Sbjct: 46 TPQQISLRTRTSEKLKFNLTS-CRNSRQGKTFVVDEKAYVCSWKDLLANGCCNPEAKSSS 104
Query: 76 -FSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKP--TTAGTYGSV---FDF 129
+C GC+ + CC YE CVSCC++P KE+VL + K T + SV F+
Sbjct: 105 LHNCNGCSK-NGCCVMYERCVSCCMHPE---KERVLQDMLGKASHTFRLLFQSVADHFEL 160
Query: 130 CAGRCRHNSESVVHENAYLS-DFHHCF 155
C +CR +S+SV HEN Y D HC+
Sbjct: 161 CLTKCRTSSQSVQHENTYRDPDAKHCY 187
>gi|383856970|ref|XP_003703979.1| PREDICTED: UPF0454 protein C12orf49 homolog [Megachile rotundata]
Length = 325
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 69/148 (46%), Gaps = 32/148 (21%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---------------EKGEKFSCQGCNL 83
C+ +VQG+ L+ D+ G VC + P CC K E++SC+ CN
Sbjct: 112 CRNSVQGKVLIVDERGIVCSRQDILPNG-CCNIDQKDPTKNEDTSTVTKSERYSCKTCNA 170
Query: 84 LSQCCNSYEYCVSCCVNPARTLK--EQVL------NVKIAKPTTAGTYG----SVFDFCA 131
CC YEYCVSCC++P + +K + VL N K K A F C
Sbjct: 171 -QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKAHKDQDAVKKRLRNLDRFQVCL 229
Query: 132 GRCRHNSESVVHENAYLSDFH--HCFSM 157
CR +S SV HEN Y D H HC+++
Sbjct: 230 AACRTSSASVRHENTY-KDPHSKHCYTL 256
>gi|345483911|ref|XP_001602536.2| PREDICTED: UPF0454 protein C12orf49 homolog [Nasonia vitripennis]
Length = 329
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 69/151 (45%), Gaps = 30/151 (19%)
Query: 39 CKTTVQGRYLLSDDNGYVC-------------DAVSMDPQSRCCPE-KGEKFSCQGCNLL 84
C+ +VQG+ L+ D+ GYVC +A S P K ++SC+ C+
Sbjct: 113 CRNSVQGKLLIVDEKGYVCSRHEVLSNGCCSLEASDQRTISSSVPSSKKTRYSCKTCSP- 171
Query: 85 SQCCNSYEYCVSCCVNPARTLKEQVLNV-KIAKPTTAGTYGS-----------VFDFCAG 132
CC YEYCVSCC++P + K + + +I K G F C
Sbjct: 172 QGCCTIYEYCVSCCLDPNKVKKRKSFTIDQIRKENLKLRRGEDILKLRLRSLDRFQLCLA 231
Query: 133 RCRHNSESVVHENAYLSD-FHHCF--SMPSN 160
CR +S SV HEN Y + + HC+ +PSN
Sbjct: 232 ACRTSSASVQHENTYKNPLYKHCYILQLPSN 262
>gi|301113522|ref|XP_002998531.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111832|gb|EEY69884.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 363
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 68/133 (51%), Gaps = 14/133 (10%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDP-QSRCCPE------KGEKFSCQGCNL-LSQCCNS 90
C+ TVQG LL+D GYVC +D + CC E + ++++C C+ CC+
Sbjct: 181 CENTVQGLKLLTDSMGYVCTRSQLDKVKPGCCNETDASLPRIKQYTCDQCDTNPPHCCDV 240
Query: 91 YEYCVSCCVNPARTLKEQVLN--VKIAKPT----TAGTYGSVFDFCAGRCRHNSESVVHE 144
+E+CVSCC++P + + A P + +VF +C RCR +S SV H+
Sbjct: 241 FEHCVSCCMHPLNVRSSGIRRDFLTHADPNHPVYSDPAEITVFQYCQYRCRTSSASVQHQ 300
Query: 145 NAYLSDFHHCFSM 157
N+Y S C+ +
Sbjct: 301 NSYRSHRSFCYGI 313
>gi|281208176|gb|EFA82354.1| cGMP-specific phosphodiesterase [Polysphondylium pallidum PN500]
Length = 283
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 67/146 (45%), Gaps = 30/146 (20%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---------------------------PE 71
C+ TVQG ++DD G+ C +D ++ CC +
Sbjct: 133 CRNTVQGTKFIADDQGFNCLRSEVD-KNGCCIVNNNNNNSNNNNSTDNNNNNKINQYNSD 191
Query: 72 KGEKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFC 130
++FSC GC+ CCN YE CVSCC+ + + E+VL + K ++ + F+ C
Sbjct: 192 IFKRFSCYGCSNKFHCCNEYELCVSCCMGSDKVPILEEVLGRMLDKGISSNYLRNQFELC 251
Query: 131 AGRCRHNSESVVHENAYLS-DFHHCF 155
CR +S S++H+ Y+ D C+
Sbjct: 252 TMTCRTSSRSLIHQREYIEPDLKFCY 277
>gi|443709414|gb|ELU04086.1| hypothetical protein CAPTEDRAFT_93600 [Capitella teleta]
Length = 211
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 64/116 (55%), Gaps = 11/116 (9%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ +VQG+ L++DD GYVC + S CC +++ C C + CC YE+CV
Sbjct: 77 CRNSVQGQVLIADDEGYVCKRKDVK-GSGCCSLSAATTKRYVCDSCT-KNGCCEVYEHCV 134
Query: 96 SCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
SCC+ P + L +L+ A + + SV F+ C +CR +S+SV HEN+Y
Sbjct: 135 SCCLLPEKQPLLRNILSR--ASDSFRHLFASVADHFELCLAKCRTSSQSVQHENSY 188
>gi|350405127|ref|XP_003487334.1| PREDICTED: UPF0454 protein C12orf49 homolog [Bombus impatiens]
Length = 329
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 68/147 (46%), Gaps = 30/147 (20%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--------------KGEKFSCQGCNLL 84
C+ ++QG+ L+ D+ G VC + P C E K E++SC+ CN
Sbjct: 113 CRNSIQGKVLIVDERGIVCSRQEILPNGCCNVEQKDSNKNEDTSTIAKKERYSCKTCNA- 171
Query: 85 SQCCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV----------FDFCAG 132
CC YEYCVSCC++P + +K + VL + A V F C
Sbjct: 172 QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKAHRDQDVVKKRLRNLDRFQVCLA 231
Query: 133 RCRHNSESVVHENAYLSDFH--HCFSM 157
CR +S SV HEN Y D H HC+++
Sbjct: 232 ACRTSSASVRHENTY-KDPHSKHCYTL 257
>gi|340712268|ref|XP_003394684.1| PREDICTED: UPF0454 protein C12orf49 homolog [Bombus terrestris]
Length = 329
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 68/147 (46%), Gaps = 30/147 (20%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--------------KGEKFSCQGCNLL 84
C+ ++QG+ L+ D+ G VC + P C E K E++SC+ CN
Sbjct: 113 CRNSIQGKVLIVDERGVVCSRQEILPNGCCNMEQKDSNKNEDTSTIAKKERYSCKTCNA- 171
Query: 85 SQCCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV----------FDFCAG 132
CC YEYCVSCC++P + +K + VL + A V F C
Sbjct: 172 QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKAHRDQDVVKKRLRNLDRFQVCLA 231
Query: 133 RCRHNSESVVHENAYLSDFH--HCFSM 157
CR +S SV HEN Y D H HC+++
Sbjct: 232 ACRTSSASVRHENTY-KDPHSKHCYTL 257
>gi|332016576|gb|EGI57457.1| UPF0454 protein C12orf49-like protein [Acromyrmex echinatior]
Length = 293
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 65/134 (48%), Gaps = 19/134 (14%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE-------------KGEKFSCQGCNLLS 85
C+ ++QGR L+ DD+G VC + C E + E++SC+ CN S
Sbjct: 110 CRNSIQGRGLIVDDHGIVCSRHEVQSNGCCAIEQKQSSKNEESFMIRRERYSCKTCN--S 167
Query: 86 Q-CCNSYEYCVSCCVNPARTLK-EQVLNVKIAK-PTTAGTYGSVFDFCAGRCRHNSESVV 142
Q CC YEYCVSCC++P + ++ + L+ AK G F C CR +S SV
Sbjct: 168 QGCCTIYEYCVSCCLHPDKQIRGRKDLSSGSAKVQKNVGRTAVHFQICLAACRTSSSSVR 227
Query: 143 HENAYLSDF-HHCF 155
+N Y HC+
Sbjct: 228 RDNTYKDPLAKHCY 241
>gi|145348106|ref|XP_001418497.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578726|gb|ABO96790.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 198
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 4/116 (3%)
Query: 43 VQGRYLLSDDNGYVCDAVSMDPQSRCCPEK--GEKFSCQGCNLLSQCCNSYEYCVSCCVN 100
V R ++ DD G C +D + CC +++C GC+ ++CC YE C +CC
Sbjct: 58 VDARRVV-DDRGRACAWYEVDSVTGCCSTTTTAARYACDGCDESARCCRDYETCAACCQG 116
Query: 101 PARTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCF 155
P ++ + + A G + F++C RCR HEN Y D +C+
Sbjct: 117 PTHDVEAAMKRHARGRNQIATGFFEDAFEYCKSRCRTQPSVTFHENTYAYDTAYCY 172
>gi|380014036|ref|XP_003691050.1| PREDICTED: UPF0454 protein C12orf49 homolog [Apis florea]
Length = 327
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 30/147 (20%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--------------KGEKFSCQGCNLL 84
C+ ++QG+ L+ D+ G VC + P C E + E++SC+ CN
Sbjct: 112 CRNSIQGKVLIVDERGIVCSRQEILPNGCCSMEQKGPIKNEDTSTIPRRERYSCKTCNA- 170
Query: 85 SQCCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV----------FDFCAG 132
CC YEYCVSCC++P + +K + VL + + V F C
Sbjct: 171 QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKSHRDQDVVKKRLRNLDRFQVCLA 230
Query: 133 RCRHNSESVVHENAYLSDFH--HCFSM 157
CR +S SV HEN Y D H HC++
Sbjct: 231 ACRTSSASVRHENTY-KDPHSKHCYTF 256
>gi|110755335|ref|XP_001121865.1| PREDICTED: UPF0454 protein C12orf49 homolog [Apis mellifera]
Length = 327
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 30/147 (20%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--------------KGEKFSCQGCNLL 84
C+ ++QG+ L+ D+ G VC + P C E + E++SC+ CN
Sbjct: 112 CRNSIQGKVLIVDERGIVCSRQEILPNGCCSMEQKDSIKNEDTSTTPRRERYSCKTCNA- 170
Query: 85 SQCCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV----------FDFCAG 132
CC YEYCVSCC++P + +K + VL + + V F C
Sbjct: 171 QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKSHRDQDVVKKRLRNLDRFQVCLA 230
Query: 133 RCRHNSESVVHENAYLSDFH--HCFSM 157
CR +S SV HEN Y D H HC++
Sbjct: 231 ACRTSSASVRHENTY-KDPHSKHCYTF 256
>gi|307185715|gb|EFN71631.1| UPF0454 protein C12orf49-like protein [Camponotus floridanus]
Length = 322
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 64/139 (46%), Gaps = 24/139 (17%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--------------KGEKFSCQGCNLL 84
C+ ++QG+ L+ D++G VC + C E K +++SC+ CN L
Sbjct: 117 CRNSIQGKGLIVDEHGIVCSRYEVLSNGCCTMEQKQAPKNKETSSIVKRKRYSCETCNSL 176
Query: 85 SQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV-------FDFCAGRCRHN 137
CC YEYCVSCC++P + + Q+ K+ G + F C CR +
Sbjct: 177 G-CCTIYEYCVSCCLDPDKQIL-QMRGRKLGSARMQKDGGRLRIRNQDRFQICLAVCRTS 234
Query: 138 SESVVHENAYLSDF-HHCF 155
S SV HEN Y HC+
Sbjct: 235 SLSVQHENTYKDPLAKHCY 253
>gi|431914235|gb|ELK15493.1| hypothetical protein PAL_GLEAN10010741 [Pteropus alecto]
Length = 219
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 73/146 (50%), Gaps = 23/146 (15%)
Query: 23 IISAIRKDIGFQETPF--CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFS 77
I+ ++ ++G P C+ ++QG++L++D+ GYVC+ + CC +++
Sbjct: 57 ILWKVQFNLGNSSRPSNQCRNSIQGKHLITDELGYVCERKDLLANG-CCNVNVPGTKQYC 115
Query: 78 CQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCR 135
C GC L + CC +YE+CVSCC+ P + L E+ LN +A F+ C +CR
Sbjct: 116 CDGC-LSNGCCGAYEHCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCR 174
Query: 136 HNS--------------ESVVHENAY 147
+S +SV HEN Y
Sbjct: 175 TSSQVRIFDVSRKLEDGQSVQHENTY 200
>gi|255083216|ref|XP_002504594.1| predicted protein [Micromonas sp. RCC299]
gi|226519862|gb|ACO65852.1| predicted protein [Micromonas sp. RCC299]
Length = 366
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 65/157 (41%), Gaps = 40/157 (25%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNL---LSQCCNSYEYCV 95
C T V+ R+L++D +G VC +++D + CC E S + C CC++Y CV
Sbjct: 29 CATAVENRHLVADSSGGVCPPLALDAATGCCLEPKRVVSKEMCAESCSERGCCDAYAACV 88
Query: 96 SCC------------------------VNPARTLKEQVLNVK------IAKPTTAGTYGS 125
CC ++PA + + + +PT T+
Sbjct: 89 VCCHDDIARRIAGSEGHDPIELAGFTPMHPAMWRRWLITHDPNGVRKLEGEPTRGETFAG 148
Query: 126 V-------FDFCAGRCRHNSESVVHENAYLSDFHHCF 155
FD+C RCR NS +ENA+ D HHCF
Sbjct: 149 ADDHAVAPFDYCKHRCRTNSGVTKYENAFQDDRHHCF 185
>gi|334327393|ref|XP_003340895.1| PREDICTED: UPF0454 protein C12orf49 homolog [Monodelphis domestica]
Length = 174
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 54/101 (53%), Gaps = 11/101 (10%)
Query: 54 GYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPAR-TLKEQV 109
GYVC+ + CC +F C GC L + CC +YEYCVSCC+ P++ L E+
Sbjct: 59 GYVCERKDLLVNG-CCNVNAASARRFWCDGC-LANGCCGAYEYCVSCCLQPSKQLLLERF 116
Query: 110 LNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
LN A + +V F+ C +CR +S+SV HEN Y
Sbjct: 117 LNQAAA--AFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 155
>gi|452823888|gb|EME30895.1| peptidyl-prolyl cis-trans isomerase [Galdieria sulphuraria]
Length = 360
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 49/98 (50%), Gaps = 3/98 (3%)
Query: 178 VIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASM---GADQPAEV 234
+I ++ ++C C+ C +LN C + +Y C+ +A+M G+D P V
Sbjct: 248 IIPAKRKQNCYRACQLKNLVCFKKGFSILNNCPTLSEYFDCRECEIATMESAGSDMPCRV 307
Query: 235 ADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
+ APS + C+ + S+ SC+ SH T RLCPC
Sbjct: 308 SLKAPSTYSSGVCMISPNLSLTSCEASHSQTERLCPCV 345
>gi|291221905|ref|XP_002730960.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 200
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 63/114 (55%), Gaps = 7/114 (6%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCN--LLSQCCNSYEYCVS 96
C+ +VQG+ L++D+ GY+C+ + CC EK C+ L + CC+ YE+CVS
Sbjct: 69 CRNSVQGKVLIADEKGYLCERSDVLGNG-CCDITAEKTVLHCCDKCLDNGCCSIYEHCVS 127
Query: 97 CCVNPARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
CC+ P + Q + + +K + SV F+ C +CR +SESV HEN Y
Sbjct: 128 CCLQPQKQQLLQGVLGRASKAFQI-LFSSVTDHFELCLAKCRTSSESVQHENTY 180
>gi|255071235|ref|XP_002507699.1| glycosyltransferase family 13 protein [Micromonas sp. RCC299]
gi|226522974|gb|ACO68957.1| glycosyltransferase family 13 protein [Micromonas sp. RCC299]
Length = 636
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 48/97 (49%), Gaps = 4/97 (4%)
Query: 175 GISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEV 234
G++ + QG++CD+ C+ + C LN CE++ K+ C+ GC +G D P V
Sbjct: 540 GLTPVAAVQGQTCDAACRYNHMKCHAPDFWFLNSCEVLAKFFPCEAGCALVLGDDIPNYV 599
Query: 235 ADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
+ CL + Q+ +C SHR T RLC C
Sbjct: 600 VSNRMDTYQ--KCLVSERQA--TCAASHRATTRLCAC 632
>gi|260782034|ref|XP_002586098.1| hypothetical protein BRAFLDRAFT_252331 [Branchiostoma floridae]
gi|229271187|gb|EEN42109.1| hypothetical protein BRAFLDRAFT_252331 [Branchiostoma floridae]
Length = 200
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 56/113 (49%), Gaps = 7/113 (6%)
Query: 40 KTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG---EKFSCQGCNLLSQCCNSYEYCVS 96
+ + G +S G+VCD + CC +F+C C + CC YE+CVS
Sbjct: 70 QNRIHGYDFVSTTAGFVCDREDILVGG-CCDVTSLTTRRFTCASCQD-NGCCQVYEHCVS 127
Query: 97 CCVNPART--LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
CC+ P + L+ + +V A T + F+ C +CR +S+SV HEN+Y
Sbjct: 128 CCLQPEKQPLLRSLLDHVPDAFKTLFASVTDHFELCLAKCRTSSQSVQHENSY 180
>gi|358339009|dbj|GAA47150.1| UPF0454 protein C12orf49 homolog, partial [Clonorchis sinensis]
Length = 186
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 72/161 (44%), Gaps = 23/161 (14%)
Query: 20 FLSIISAIRKDIGFQETPFCKTTVQ-GRYLLSDDNGYVCDAVSMDPQSRCCPEKG----- 73
+L+ +A+ + GF C +T Q + DD G C ++ S CCP
Sbjct: 31 WLNNTNALTTNKGF-----CSSTRQSASSYVVDDTGTYCPRTAL-LNSGCCPTPVAPDHA 84
Query: 74 ------EKFSCQGCNLLSQ-CCNSYEYCVSCCVNPART--LKEQVLNV-KIAKPTTAGTY 123
+F C CN + CC YE+CVSCC+N +E +L ++
Sbjct: 85 DTAIIVRRFVCDSCNQTRENCCQIYEHCVSCCMNRQHIHLWREALLQASELPAQRHLLLA 144
Query: 124 GSVFDFCAGRCRHNSESVVHENAYLSDFH-HCFSMPSNASG 163
+F +C +CR +S+SV+HEN Y H +CF + + G
Sbjct: 145 DGLFQYCQAKCRTSSQSVLHENTYRDAEHRYCFGLYLTSPG 185
>gi|320162724|gb|EFW39623.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 319
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 16/121 (13%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
C+ TVQGR L++DD G +C S+ P C P ++ C C + CC +E C
Sbjct: 103 CRNTVQGRVLVTDDRGNMCTRTSVLPNGCCSPAANTTRQYVCDSC-TPNDCCTQFEVCTF 161
Query: 97 CCVNPARTLKEQV-LNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYL-SDFHHC 154
+ ++ + +NV+ F+ C +CR +S SVVHEN Y + + +C
Sbjct: 162 FLIKFLHRARDALFMNVQ-----------DQFEMCLAKCRTSSLSVVHENTYKDTQYKYC 210
Query: 155 F 155
F
Sbjct: 211 F 211
>gi|326432709|gb|EGD78279.1| hypothetical protein PTSG_09343 [Salpingoeca sp. ATCC 50818]
Length = 198
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 53/122 (43%), Gaps = 6/122 (4%)
Query: 28 RKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE--KFSCQGCNLLS 85
R+ + CK T L DD G VC+ P CC + + C C+
Sbjct: 70 RRHLAAASNDSCKYTTFDGLQLVDDLGRVCNVEDAQPNG-CCFSTSQIVEAVCDSCDA-G 127
Query: 86 QCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPT--TAGTYGSVFDFCAGRCRHNSESVVH 143
CC YE+CVSCC+ P + + + +P F++C +CR +S+SV H
Sbjct: 128 HCCRVYEHCVSCCLAPKHVPLRRFVFHNLTEPQRRVLSLVSDQFEYCEAKCRTSSQSVQH 187
Query: 144 EN 145
EN
Sbjct: 188 EN 189
>gi|426374279|ref|XP_004054004.1| PREDICTED: UPF0454 protein C12orf49 homolog [Gorilla gorilla
gorilla]
Length = 220
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/130 (32%), Positives = 65/130 (50%), Gaps = 24/130 (18%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
C+ ++QG++L++D+ GYVC+ + CC +++ C GC + CC++YEYCV
Sbjct: 75 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 132
Query: 96 SCCVNPART-LKEQVLN---VKIAKPTTAGTY------GSVFDF--------CAGRCRHN 137
SCC+ P + L E+ LN V A Y FDF C+ +++
Sbjct: 133 SCCLQPNKQLLLERFLNRAAVAFQNLFMALLYPLPDFCSPPFDFMPLLMSLVCSASIQYH 192
Query: 138 SESVVHENAY 147
SV HEN Y
Sbjct: 193 -WSVQHENTY 201
>gi|432116785|gb|ELK37410.1| hypothetical protein MDA_GLEAN10014215 [Myotis davidii]
Length = 149
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%), Gaps = 3/76 (3%)
Query: 74 EKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCA 131
+++ C GC L + CC++YE+CVSCC+ P + L E+ LN +A F+ C
Sbjct: 56 KQYCCDGC-LTNGCCSAYEHCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCL 114
Query: 132 GRCRHNSESVVHENAY 147
+CR +S+SV HEN Y
Sbjct: 115 AKCRTSSQSVQHENTY 130
>gi|402887791|ref|XP_003907264.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 2 [Papio
anubis]
Length = 175
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 3/76 (3%)
Query: 74 EKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCA 131
+++ C GC + CC++YEYCVSCC+ P + L E+ LN +A F+ C
Sbjct: 82 KQYCCDGC-WPNGCCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCL 140
Query: 132 GRCRHNSESVVHENAY 147
+CR +S+SV HEN Y
Sbjct: 141 AKCRTSSQSVQHENTY 156
>gi|332840508|ref|XP_003314001.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 2 [Pan
troglodytes]
Length = 175
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 3/76 (3%)
Query: 74 EKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCA 131
+++ C GC + CC++YEYCVSCC+ P + L E+ LN +A F+ C
Sbjct: 82 KQYCCDGC-WPNGCCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCL 140
Query: 132 GRCRHNSESVVHENAY 147
+CR +S+SV HEN Y
Sbjct: 141 AKCRTSSQSVQHENTY 156
>gi|330794772|ref|XP_003285451.1| hypothetical protein DICPUDRAFT_149321 [Dictyostelium purpureum]
gi|325084626|gb|EGC38050.1| hypothetical protein DICPUDRAFT_149321 [Dictyostelium purpureum]
Length = 322
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 47/84 (55%), Gaps = 2/84 (2%)
Query: 74 EKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTY-GSVFDFCAG 132
E++SC+ C S CC++YEYC+SCC+ + Q + + A+ A Y + FD C
Sbjct: 232 ERYSCKRCLEGSDCCDTYEYCISCCMRYDKVPVVQEMLIGFAERGVAKEYLKNQFDLCTM 291
Query: 133 RCRHNSESVVHENAYLS-DFHHCF 155
CR +S S++++ Y + F C+
Sbjct: 292 ACRTSSSSLLNQREYQNPKFKFCY 315
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 23/34 (67%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK 72
C++TVQG ++DD GYVC S D ++CC ++
Sbjct: 90 CQSTVQGNLYIADDQGYVCSRSSWDSITKCCKKQ 123
>gi|395833972|ref|XP_003789991.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 2 [Otolemur
garnettii]
Length = 175
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 3/76 (3%)
Query: 74 EKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCA 131
+++ C GC + CC++YEYCVSCC+ P + L E+ LN +A F+ C
Sbjct: 82 KQYCCDGC-WPNGCCSAYEYCVSCCLQPNKQLLLERFLNRAAMAFQNLFMAVEDHFELCL 140
Query: 132 GRCRHNSESVVHENAY 147
+CR +S+SV HEN Y
Sbjct: 141 AKCRTSSQSVQHENTY 156
>gi|290976460|ref|XP_002670958.1| predicted protein [Naegleria gruberi]
gi|284084522|gb|EFC38214.1| predicted protein [Naegleria gruberi]
Length = 350
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 55/131 (41%), Gaps = 15/131 (11%)
Query: 144 ENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKL 203
E + +DF H S+P A L I ++G GE+CD++C S C N
Sbjct: 229 ETSAPNDFVHD-SLPCETLPASAIHSLQELQTIKAVLGALGENCDTICAKSESVCAPNYF 287
Query: 204 ILLNQCEIIQKYMS---CKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDG 260
+N+C+ ++K+ S CK+ V D + C+ +S C
Sbjct: 288 PYINRCDEMRKHYSSCECKK-----------EGVIDSRAPYFDGKYCIIGNRRSKFRCGN 336
Query: 261 SHRHTRRLCPC 271
+H RRLCPC
Sbjct: 337 AHPSERRLCPC 347
>gi|221042418|dbj|BAH12886.1| unnamed protein product [Homo sapiens]
Length = 175
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 43/76 (56%), Gaps = 3/76 (3%)
Query: 74 EKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCA 131
+++ C GC + CC++YEYC SCC+ P + L E+ LN +A F+ C
Sbjct: 82 KQYCCDGC-WPNGCCSAYEYCASCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCL 140
Query: 132 GRCRHNSESVVHENAY 147
+CR +S+SV HEN Y
Sbjct: 141 AKCRTSSQSVQHENTY 156
>gi|299117113|emb|CBN73884.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 83
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 38/78 (48%), Gaps = 1/78 (1%)
Query: 195 GQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQS 254
G C + LLN C +++ + C C S G +QP V DAP + P CL +
Sbjct: 3 GMICKESSFQLLNSCGLLRDHFPCT-SCTESNGFEQPCYVELDAPEDNLPGQCLISSNPR 61
Query: 255 MLSCDGSHRHTRRLCPCA 272
+C SH TRRLCPC
Sbjct: 62 HSTCSASHEVTRRLCPCT 79
>gi|291000376|ref|XP_002682755.1| N-acetylglucosaminylaminotransferase [Naegleria gruberi]
gi|284096383|gb|EFC50011.1| N-acetylglucosaminylaminotransferase [Naegleria gruberi]
Length = 617
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 59/124 (47%), Gaps = 12/124 (9%)
Query: 153 HCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSS-GQSCVLNKLILLNQCEI 211
HC + S L + S I+ I +QG+SC S CK C +N C+
Sbjct: 469 HCPAPISKFHKDSNWSLPKKDSIITFAISKQGQSCSSYCKQGFNMECYRFSFEEMNSCKF 528
Query: 212 IQKYMSCK-RGCLASMGADQPAEVADDAPSNLNPTACLYT---RIQSMLSCDGSHRHTRR 267
++K++ + C+ S G DQPA +++ CL+ R + + +C+ H +T+R
Sbjct: 529 LRKFVGQECTECIPSSGPDQPA-------IDVDSKQCLFNGGGRGKFISTCEAYHVNTKR 581
Query: 268 LCPC 271
+CPC
Sbjct: 582 ICPC 585
>gi|66817203|ref|XP_642480.1| hypothetical protein DDB_G0277747 [Dictyostelium discoideum AX4]
gi|60470568|gb|EAL68547.1| hypothetical protein DDB_G0277747 [Dictyostelium discoideum AX4]
Length = 312
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 69/172 (40%), Gaps = 55/172 (31%)
Query: 39 CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----------------------------- 69
C++T+QG ++DD GYVC D + CC
Sbjct: 133 CESTIQGSTYIADDQGYVCLRSQWDSVTGCCFGGNKTSTTTNNNKHFRDDDDDDNDNDEG 192
Query: 70 -------------------PEKG-EKFSCQGC--NLLSQCCNSYEYCVSCCVNPART--L 105
E G ++F+C C +S CC+SYE+C+SCC+ + L
Sbjct: 193 NQKTTNNKNRGDNENQIENEEIGVQRFTCSRCLGGEVSDCCDSYEFCISCCMRQDKVSIL 252
Query: 106 KEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS-DFHHCFS 156
+ ++N K + FD C+ CR +S S+V++ Y +F C+S
Sbjct: 253 HDTLINF-AEKNLDKKYLKNQFDLCSMACRTSSASLVNQREYQDPEFKFCYS 303
>gi|294955121|ref|XP_002788417.1| alpha-1,3-mannosyl-glycoprotein beta-1,
2-n-acetylglucosaminyltransferase, putative [Perkinsus
marinus ATCC 50983]
gi|239903817|gb|EER20213.1| alpha-1,3-mannosyl-glycoprotein beta-1,
2-n-acetylglucosaminyltransferase, putative [Perkinsus
marinus ATCC 50983]
Length = 1171
Score = 46.6 bits (109), Expect = 0.013, Method: Composition-based stats.
Identities = 28/89 (31%), Positives = 38/89 (42%), Gaps = 3/89 (3%)
Query: 183 QGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNL 242
+G SC C + C + L N C I+ + C+ GC +G + PA V D +
Sbjct: 513 RGVSCTHTCIHNNARCYDHLLEFANDCHIMLSHFPCEEGCGHQLGLELPAYVMDSTLHTV 572
Query: 243 NPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
CL T S SC H T R+C C
Sbjct: 573 G--QCLVTDHGSP-SCATYHISTARMCTC 598
>gi|340369555|ref|XP_003383313.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase B-like
[Amphimedon queenslandica]
Length = 662
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 44/97 (45%), Gaps = 1/97 (1%)
Query: 176 ISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCK-RGCLASMGADQPAEV 234
+ +I + G++C C + C + L+N E + ++ C L AD +
Sbjct: 553 LRTVISKPGQNCRDSCLENKMVCESSFFSLINTHEAVFNHLGCNVTHTLEEKYADYAPAI 612
Query: 235 ADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
A A + P C+ I+ +L C G ++ RRLCPC
Sbjct: 613 AVSADKPIEPGHCVIIDIRRLLGCPGYSKNFRRLCPC 649
>gi|290976364|ref|XP_002670910.1| predicted protein [Naegleria gruberi]
gi|284084474|gb|EFC38166.1| predicted protein [Naegleria gruberi]
Length = 466
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 57/116 (49%), Gaps = 18/116 (15%)
Query: 158 PSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQK-YM 216
P+N G +T+L + I+ +GE+CD+ C + +C +N+C+ ++K Y
Sbjct: 364 PTNTHG--ITELHN----LKAILCEKGENCDTCCSRNNATCDGLFFPYINRCDEMRKHYP 417
Query: 217 SCKRGCLASMGADQPAEVADDAPS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
CK C E+AD++ S N + C+ +S C S+R RRLCPC
Sbjct: 418 DCK--C--------AYELADESRSPNFDGELCVIGNRRSRFRCGNSYRTERRLCPC 463
>gi|290987267|ref|XP_002676344.1| predicted protein [Naegleria gruberi]
gi|284089946|gb|EFC43600.1| predicted protein [Naegleria gruberi]
Length = 481
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 53/123 (43%), Gaps = 9/123 (7%)
Query: 152 HHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEI 211
H F P +T E L+ + ++ +GE CD+ CK ++C ++ +N C
Sbjct: 365 HDVFEFPLKPPTLKITLSE--LNKVESVLCEEGEDCDTCCKKRAKTCDVSFFPYINHCRE 422
Query: 212 IQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDG--SHRHTRRLC 269
IQK+ K C + + + +A +N + C S SC S + RRLC
Sbjct: 423 IQKH--SKERCNCKIDDEFFSSIAPSMDTNKD---CYVGYRPSRFSCSNKKSKSNHRRLC 477
Query: 270 PCA 272
PC
Sbjct: 478 PCK 480
>gi|426221180|ref|XP_004004788.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Ovis aries]
Length = 739
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 51/104 (49%), Gaps = 17/104 (16%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
LS + V G+SC VC+ + C + LN+ + + KY ++C Q
Sbjct: 633 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYEVNC-----------QS 681
Query: 232 AEVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
+E+A D APS +P C++ + SC G+H H RR+CPC
Sbjct: 682 SELAKDIVAPS-FDPKNKHCVFQGDLLLFSCAGAHAHHRRICPC 724
>gi|440795738|gb|ELR16854.1| GNT-I family protein [Acanthamoeba castellanii str. Neff]
Length = 566
Score = 40.4 bits (93), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 62/165 (37%), Gaps = 21/165 (12%)
Query: 117 PTTAGTYGSV------FDFCAGRCRHNSESVVH---ENAYLSDFHHCFSMPSNASGAGVT 167
P YGS+ +D R HN V+ L+D HC +P +
Sbjct: 401 PYVRENYGSISLSLALWD-TPSRGTHNHSLVLRYKKNTLLLADKRHCAYLPESLRILPHE 459
Query: 168 QLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRG-CLASM 226
QL RL +V GE C + C G C L C+ + C RG C ++
Sbjct: 460 QL--RLVTSAV-----GEDCHAPCAREGLKCHQGDFDFLRHCDALASRFPCHRGRCQLAV 512
Query: 227 GADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
G + P + P CL + + SC H+ +RLCPC
Sbjct: 513 GPEMPIFLP-HGPHQNATGRCLVS--EDFASCAAKHKGGQRLCPC 554
>gi|443721485|gb|ELU10776.1| hypothetical protein CAPTEDRAFT_200210 [Capitella teleta]
Length = 484
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 55/120 (45%), Gaps = 7/120 (5%)
Query: 155 FSMPSNASGAGVTQLEGRLSGISVII-GRQGESCDSVCKSSGQSCVLNKLILLNQ-CEII 212
F+ P A + + + GRL +I+ G G C VC + C + L LLN+ C ++
Sbjct: 368 FATPHGAQ-SPLRGVIGRLQLDYIIVTGDDGLDCSQVCAGVYKECDVQGLFLLNKGCAVL 426
Query: 213 QKY-MSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
+ +SCK + QP D P+ L+ C T SC G H ++R+CPC
Sbjct: 427 ELLALSCKVCSEVKVEGSQP--YTGDMPAMLSDGNCC-TGWPHHSSCFGKHVESKRICPC 483
>gi|298710994|emb|CBJ32301.1| hypothetical protein Esi_0327_0028 [Ectocarpus siliculosus]
Length = 550
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 36/114 (31%), Gaps = 49/114 (42%)
Query: 91 YEYCVSCCVNPARTLKEQVLNVKIA---KPTTAGTYGS---------------------- 125
YE+CVSCC NP R + + + A P GS
Sbjct: 391 YEFCVSCCQNPQRGEEREAIQASAALSGHPAYRDLLGSGNNDAANGREPPPPRQLRASAA 450
Query: 126 ------------------------VFDFCAGRCRHNSESVVHENAYLSDFHHCF 155
F CA RCR S SV HEN+Y S HCF
Sbjct: 451 AAEAGGVEKNGQQVEAEERDARELAFAHCAFRCRTYSGSVAHENSYRSPLKHCF 504
>gi|440902995|gb|ELR53712.1| Alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Bos grunniens
mutus]
Length = 739
Score = 38.5 bits (88), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 17/104 (16%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
LS + V G+SC VC+ + C + LN+ + + KY ++C Q
Sbjct: 633 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYDVNC-----------QS 681
Query: 232 AEVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
+E+A D APS +P C++ + SC G+H RR+CPC
Sbjct: 682 SELAKDIVAPS-FDPKNKHCVFQGDLLLFSCAGAHARHRRICPC 724
>gi|300797805|ref|NP_001178271.1| alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Bos taurus]
gi|296490527|tpg|DAA32640.1| TPA: mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [Bos taurus]
Length = 739
Score = 38.5 bits (88), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 17/104 (16%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
LS + V G+SC VC+ + C + LN+ + + KY ++C Q
Sbjct: 633 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYDVNC-----------QS 681
Query: 232 AEVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
+E+A D APS +P C++ + SC G+H RR+CPC
Sbjct: 682 SELAKDIVAPS-FDPKNKHCVFQGDLLLFSCAGAHARHRRICPC 724
>gi|296204935|ref|XP_002749539.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Callithrix
jacchus]
Length = 741
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 9/100 (9%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
LS + V + G+SC VC+ S C + LN+ + + KY ++C+ LA +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVTCQSSELA-----KD 689
Query: 232 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
V P N C++ + SC G+H RR+CPC
Sbjct: 690 ILVPSFDPKN---KHCVFQGDLLLFSCAGAHPRHRRVCPC 726
>gi|403280116|ref|XP_003931580.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Saimiri
boliviensis boliviensis]
Length = 741
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 9/100 (9%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
LS + V + G+SC VC+ S C + LN+ + + KY ++C+ LA +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVTCQSSELA-----KD 689
Query: 232 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
V P N C++ + SC G+H RR+CPC
Sbjct: 690 ILVPSFDPKN---KHCVFQGDLLLFSCAGAHPRHRRVCPC 726
>gi|303280994|ref|XP_003059789.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458444|gb|EEH55741.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 274
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 125 SVFDFCAGRCRHNSESVVHENAYLSDFHHCF-SMPSNASGAGVTQLEGR 172
S F++CA RCR NS +EN + HHCF S+ + + GA T GR
Sbjct: 204 SSFEYCAHRCRTNSAVTKYENEFQDAKHHCFGSVYAESVGADDTLSPGR 252
>gi|62702324|gb|AAX93248.1| unknown [Homo sapiens]
Length = 231
Score = 37.7 bits (86), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 9/100 (9%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
LS + V + G+SC VC+ S C + LN+ + + KY ++C+ LA +
Sbjct: 125 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYKVTCQSSELA-----KD 179
Query: 232 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
V P N + C++ + SC G+H +R+CPC
Sbjct: 180 ILVPSFDPKNKH---CVFQGDLLLFSCAGAHPRHQRVCPC 216
>gi|348520977|ref|XP_003448003.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase B-like
[Oreochromis niloticus]
Length = 805
Score = 37.7 bits (86), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 44/105 (41%), Gaps = 19/105 (18%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 232
LS + +++ ++G+SC C+S+G C +N E ++ G D
Sbjct: 699 LSSLRLLVSQEGQSCVEACQSAGFICEPAHFRFINNKEALR-------------GLDVQC 745
Query: 233 EVADDAPSNLNPT------ACLYTRIQSMLSCDGSHRHTRRLCPC 271
EV D +++ P C R + SC G RRLCPC
Sbjct: 746 EVVDSEINHILPAFSVVRQECGLQREPLLFSCAGHSPKYRRLCPC 790
>gi|301770689|ref|XP_002920763.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A-like
[Ailuropoda melanoleuca]
gi|281337646|gb|EFB13230.1| hypothetical protein PANDA_009537 [Ailuropoda melanoleuca]
Length = 741
Score = 37.4 bits (85), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 9/100 (9%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
LS + V + G+SC VC+ + C + LN+ + + KY ++C+ LA +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYEVTCQSSELA-----KD 689
Query: 232 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
V P N + C++ + SC G+H RR+CPC
Sbjct: 690 ILVPSFDPKNRH---CVFQGDLLLFSCAGAHPRHRRVCPC 726
>gi|291391470|ref|XP_002712459.1| PREDICTED: N-acetylglucosaminyltransferase V [Oryctolagus
cuniculus]
Length = 741
Score = 37.4 bits (85), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 9/100 (9%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
LS + V + G+SC VC+ + C + LN+ + + KY +SC+ LA +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKNKDVLKYEVSCQSSELA-----KD 689
Query: 232 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
V P N C++ + SC G+H +R+CPC
Sbjct: 690 ILVPSFDPKN---KHCVFQGDLLLFSCAGAHPRHQRICPC 726
>gi|426337254|ref|XP_004032629.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Gorilla
gorilla gorilla]
Length = 741
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 232
LS + V + G+SC VC+ S C + LN+ + + KY K C Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684
Query: 233 EVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQADLLLFSCAGAHPRYQRVCPC 726
>gi|428167931|gb|EKX36882.1| hypothetical protein GUITHDRAFT_116905 [Guillardia theta CCMP2712]
Length = 652
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 37/91 (40%), Gaps = 6/91 (6%)
Query: 183 QGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASM-GADQPAEVADDAPSN 241
QG SC + C+ +C +LN CE + + C G D PA +D
Sbjct: 290 QGVSCFTACQEQNLACQRIWFDVLNNCEALAVAFPSHKICSTDFYGRDLPAYRPEDLTVL 349
Query: 242 LNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
+N Y+ SC G H T+RLC C
Sbjct: 350 VNKKPRTYSS-----SCGGKHEKTKRLCGCG 375
>gi|196010315|ref|XP_002115022.1| predicted protein [Trichoplax adhaerens]
gi|190582405|gb|EDV22478.1| predicted protein [Trichoplax adhaerens]
Length = 1490
Score = 37.4 bits (85), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 3/64 (4%)
Query: 77 SCQGCNLLSQCCNSYEYCVSCCV--NPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRC 134
+C CN ++CC+ YE+CVSCC A LKE + N + + F+FC +C
Sbjct: 107 ACHTCNS-NRCCDQYEFCVSCCQYRKEALDLKEIIRNSPDGIQQLLTSAKTRFEFCLIQC 165
Query: 135 RHNS 138
R +S
Sbjct: 166 RTSS 169
>gi|428175838|gb|EKX44726.1| hypothetical protein GUITHDRAFT_109505 [Guillardia theta CCMP2712]
Length = 551
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 15/92 (16%)
Query: 186 SCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQP------AEVADDAP 239
+CDSVC+S+G+ C + + L++C+I +K + C + D P E+
Sbjct: 124 NCDSVCQSAGKVCARDWFVFLDRCDIAKKVFP-EGTCSFAFYPDMPFYDKTTKEIFVSKD 182
Query: 240 SNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
+ L+P C + R RLCPC
Sbjct: 183 TKLDPPTCDAAPLGG--------RSAERLCPC 206
>gi|344268130|ref|XP_003405916.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Loxodonta
africana]
Length = 741
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 17/104 (16%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
LS + V + G+SC VC+ S C + LN+ + + KY ++C Q
Sbjct: 635 LSALQVKLAEPGKSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVTC-----------QS 683
Query: 232 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 271
+E+A D PS +P + C++ + SC G+H +R+CPC
Sbjct: 684 SELAKDILVPS-FDPKSKHCVFQGDLLLFSCAGAHPRHQRVCPC 726
>gi|297266872|ref|XP_002799448.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A-like [Macaca
mulatta]
Length = 547
Score = 37.0 bits (84), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 232
LS + V + G+SC VC+ S C + LN+ + + KY K C Q +
Sbjct: 441 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 490
Query: 233 EVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 491 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 532
>gi|297668509|ref|XP_002812479.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Pongo abelii]
Length = 741
Score = 37.0 bits (84), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 232
LS + V + G+SC VC+ S C + LN+ + + KY K C Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684
Query: 233 EVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726
>gi|291233073|ref|XP_002736478.1| PREDICTED: N-acetylglucosaminyltransferase V-like [Saccoglossus
kowalevskii]
Length = 561
Score = 37.0 bits (84), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 55/129 (42%), Gaps = 19/129 (14%)
Query: 145 NAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLI 204
NAY+ + C + P LS + V I ESC +VC +G C +
Sbjct: 435 NAYIENQDFCSANPKQWPP---------LSALKVKISNPKESCKAVCMKNGLICEPSFFK 485
Query: 205 LLNQCEIIQK--YMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSH 262
LN E+I++ Y+ CK ++ P+ + + SN+ C + + + SC G
Sbjct: 486 SLNTQEMIEREGYLECK-SVISVEDIYAPSFIKE---SNI----CHLEKQKLLFSCAGGS 537
Query: 263 RHTRRLCPC 271
R+CPC
Sbjct: 538 DTRNRVCPC 546
>gi|4505167|ref|NP_002401.1| alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Homo sapiens]
gi|114580983|ref|XP_515802.2| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A isoform 2 [Pan
troglodytes]
gi|397504546|ref|XP_003822849.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Pan paniscus]
gi|1169980|sp|Q09328.1|MGT5A_HUMAN RecName: Full=Alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A; AltName:
Full=Alpha-mannoside
beta-1,6-N-acetylglucosaminyltransferase; AltName:
Full=GlcNAc-T V; Short=GNT-V; AltName: Full=Mannoside
acetylglucosaminyltransferase 5; AltName:
Full=N-acetylglucosaminyl-transferase V
gi|4545222|gb|AAD22449.1|AF113921_1 alpha-1,3(6)-mannosylglycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [Homo sapiens]
gi|870752|dbj|BAA04570.1| N-acetylglucosaminyltransferase V [Homo sapiens]
gi|119632062|gb|EAX11657.1| mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase, isoform CRA_a
[Homo sapiens]
gi|119632063|gb|EAX11658.1| mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase, isoform CRA_a
[Homo sapiens]
gi|162317656|gb|AAI56314.1| Mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [synthetic
construct]
gi|162319448|gb|AAI57049.1| Mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [synthetic
construct]
gi|410216290|gb|JAA05364.1| mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [Pan
troglodytes]
gi|410265502|gb|JAA20717.1| mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [Pan
troglodytes]
gi|410308382|gb|JAA32791.1| mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [Pan
troglodytes]
gi|410332945|gb|JAA35419.1| mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [Pan
troglodytes]
Length = 741
Score = 37.0 bits (84), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 232
LS + V + G+SC VC+ S C + LN+ + + KY K C Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684
Query: 233 EVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.135 0.429
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,066,340,777
Number of Sequences: 23463169
Number of extensions: 157805885
Number of successful extensions: 407431
Number of sequences better than 100.0: 229
Number of HSP's better than 100.0 without gapping: 113
Number of HSP's successfully gapped in prelim test: 116
Number of HSP's that attempted gapping in prelim test: 406977
Number of HSP's gapped (non-prelim): 263
length of query: 272
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 132
effective length of database: 9,074,351,707
effective search space: 1197814425324
effective search space used: 1197814425324
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.5 bits)
S2: 75 (33.5 bits)