BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 028746
(204 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|118485488|gb|ABK94599.1| unknown [Populus trichocarpa]
Length = 268
Score = 324 bits (831), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 152/193 (78%), Positives = 175/193 (90%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCNL+SQCCNSYE+CVSCC++P+RT + QVL VKIAKP+TA +Y SVFDFCAGRCRHNSE
Sbjct: 76 GCNLVSQCCNSYEFCVSCCLHPSRTQEAQVLKVKIAKPSTAVSYTSVFDFCAGRCRHNSE 135
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY SDFHHCFS+PSN+SGA TQLE RL GI+V+IG+QGESCDSVCKS+GQSCV
Sbjct: 136 SVVHENAYRSDFHHCFSLPSNSSGANYTQLEARLGGINVVIGKQGESCDSVCKSNGQSCV 195
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
LNKL++LNQC+++QKYMSCK GCLAS+G DQPAEV +DAP +LNP ACLYTR QS+LSCD
Sbjct: 196 LNKLLVLNQCDVMQKYMSCKGGCLASIGTDQPAEVVEDAPRHLNPGACLYTRTQSLLSCD 255
Query: 192 GSHRHTRRLCPCA 204
GS HTRRLCPCA
Sbjct: 256 GSLWHTRRLCPCA 268
>gi|224112759|ref|XP_002332721.1| predicted protein [Populus trichocarpa]
gi|222832884|gb|EEE71361.1| predicted protein [Populus trichocarpa]
Length = 245
Score = 323 bits (828), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 152/193 (78%), Positives = 175/193 (90%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCNL+SQCCNSYE+CVSCC++P+RT + QVL VKIAKP+TA +Y SVFDFCAGRCRHNSE
Sbjct: 53 GCNLVSQCCNSYEFCVSCCLHPSRTQEAQVLKVKIAKPSTAVSYTSVFDFCAGRCRHNSE 112
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY SDFHHCFS+PSN+SGA TQLE RL GI+V+IG+QGESCDSVCKS+GQSCV
Sbjct: 113 SVVHENAYRSDFHHCFSLPSNSSGANYTQLEARLGGINVVIGKQGESCDSVCKSNGQSCV 172
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
LNKL++LNQC+++QKYMSCK GCLAS+G DQPAEV +DAP +LNP ACLYTR QS+LSCD
Sbjct: 173 LNKLLVLNQCDVMQKYMSCKGGCLASIGTDQPAEVVEDAPRHLNPGACLYTRTQSLLSCD 232
Query: 192 GSHRHTRRLCPCA 204
GS HTRRLCPCA
Sbjct: 233 GSLWHTRRLCPCA 245
>gi|359478124|ref|XP_002264803.2| PREDICTED: UPF0454 protein C12orf49 homolog [Vitis vinifera]
Length = 279
Score = 318 bits (814), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 151/194 (77%), Positives = 171/194 (88%), Gaps = 1/194 (0%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCNL+SQCCNSYE+CVSCC+NPART KE L VK+AKP TAG Y SVFDFC GRCRHNSE
Sbjct: 86 GCNLISQCCNSYEFCVSCCLNPARTQKELALKVKMAKPVTAGAYASVFDFCTGRCRHNSE 145
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQ-LEGRLSGISVIIGRQGESCDSVCKSSGQSC 130
SVVHENAYLSDFHHCFS+PSN++ GV + E RL+GI V++GRQGESCDSVCKS+GQSC
Sbjct: 146 SVVHENAYLSDFHHCFSLPSNSTAGGVDRNPELRLNGIHVVVGRQGESCDSVCKSNGQSC 205
Query: 131 VLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSC 190
V +KL +LNQCEI+QKYMSCK CLAS+GADQPAEV DDAP +LNP ACLYTR Q++LSC
Sbjct: 206 VPSKLSVLNQCEIMQKYMSCKGACLASIGADQPAEVVDDAPRHLNPGACLYTRTQTILSC 265
Query: 191 DGSHRHTRRLCPCA 204
DGSH+HTRRLCPCA
Sbjct: 266 DGSHQHTRRLCPCA 279
>gi|297743774|emb|CBI36657.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 318 bits (814), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 151/194 (77%), Positives = 171/194 (88%), Gaps = 1/194 (0%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCNL+SQCCNSYE+CVSCC+NPART KE L VK+AKP TAG Y SVFDFC GRCRHNSE
Sbjct: 80 GCNLISQCCNSYEFCVSCCLNPARTQKELALKVKMAKPVTAGAYASVFDFCTGRCRHNSE 139
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQ-LEGRLSGISVIIGRQGESCDSVCKSSGQSC 130
SVVHENAYLSDFHHCFS+PSN++ GV + E RL+GI V++GRQGESCDSVCKS+GQSC
Sbjct: 140 SVVHENAYLSDFHHCFSLPSNSTAGGVDRNPELRLNGIHVVVGRQGESCDSVCKSNGQSC 199
Query: 131 VLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSC 190
V +KL +LNQCEI+QKYMSCK CLAS+GADQPAEV DDAP +LNP ACLYTR Q++LSC
Sbjct: 200 VPSKLSVLNQCEIMQKYMSCKGACLASIGADQPAEVVDDAPRHLNPGACLYTRTQTILSC 259
Query: 191 DGSHRHTRRLCPCA 204
DGSH+HTRRLCPCA
Sbjct: 260 DGSHQHTRRLCPCA 273
>gi|356538903|ref|XP_003537940.1| PREDICTED: uncharacterized protein LOC100782588 [Glycine max]
Length = 281
Score = 312 bits (800), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 148/193 (76%), Positives = 168/193 (87%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCNLLSQCCNSYEYCVSCC+NPA KEQVL +K+AKPTTA TY SVFD+CAGRCRH+SE
Sbjct: 89 GCNLLSQCCNSYEYCVSCCLNPALISKEQVLKMKVAKPTTARTYSSVFDYCAGRCRHSSE 148
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY+SDFHHCFS+ SN+SG T E RLSGI+V++GRQGESC+SVCKS GQSCV
Sbjct: 149 SVVHENAYISDFHHCFSLSSNSSGKNSTLTEARLSGINVVVGRQGESCNSVCKSRGQSCV 208
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
NKL++LN C+IIQKYMSCK CLAS+GADQPAEV DAP +LNP +CLYT QS+LSCD
Sbjct: 209 PNKLVVLNSCDIIQKYMSCKGSCLASVGADQPAEVVYDAPVHLNPGSCLYTETQSILSCD 268
Query: 192 GSHRHTRRLCPCA 204
G H+HTRRLCPCA
Sbjct: 269 GLHQHTRRLCPCA 281
>gi|388496984|gb|AFK36558.1| unknown [Lotus japonicus]
Length = 272
Score = 308 bits (788), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 143/193 (74%), Positives = 167/193 (86%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCN+LSQCCNSYEYCVSCC+NPA T KEQVL +KI+KP TA TY +VFD+CA RCRH+SE
Sbjct: 80 GCNVLSQCCNSYEYCVSCCLNPALTSKEQVLKMKISKPVTARTYENVFDYCAARCRHSSE 139
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAYLSDFHHCFS+PSN+SG VT E RL+GI+V++GRQGESC SVCK GQSCV
Sbjct: 140 SVVHENAYLSDFHHCFSLPSNSSGTNVTLTEARLNGINVVVGRQGESCSSVCKLRGQSCV 199
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
NKL++LN C+IIQ+YMSCK CLAS+G DQPAEV DAP +LNP +CLYT +S+LSCD
Sbjct: 200 PNKLVVLNNCDIIQRYMSCKGSCLASVGTDQPAEVVYDAPKDLNPGSCLYTETESVLSCD 259
Query: 192 GSHRHTRRLCPCA 204
GSH+HT+RLCPCA
Sbjct: 260 GSHQHTKRLCPCA 272
>gi|297817236|ref|XP_002876501.1| hypothetical protein ARALYDRAFT_486406 [Arabidopsis lyrata subsp.
lyrata]
gi|297322339|gb|EFH52760.1| hypothetical protein ARALYDRAFT_486406 [Arabidopsis lyrata subsp.
lyrata]
Length = 272
Score = 303 bits (776), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 148/193 (76%), Positives = 171/193 (88%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCNLLSQCCNSYE+CVSCC+NP+RTL E+V+ VK+AKP TAGTY SVFDFCAGRCRHNSE
Sbjct: 80 GCNLLSQCCNSYEFCVSCCLNPSRTLLEKVVKVKVAKPATAGTYKSVFDFCAGRCRHNSE 139
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY S+FHHCFS+ SNASGA +TQ+E RL GI VI+G QG+SCD+VCKS GQ CV
Sbjct: 140 SVVHENAYHSEFHHCFSLTSNASGANLTQVETRLLGIDVIVGSQGDSCDAVCKSRGQLCV 199
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
+NKL LLNQC+++++YM+CK CLAS GADQPAEV +DAP +L P ACLYTR QSMLSCD
Sbjct: 200 MNKLSLLNQCDVMKRYMTCKGSCLASAGADQPAEVVEDAPRDLYPGACLYTRTQSMLSCD 259
Query: 192 GSHRHTRRLCPCA 204
GSH+HTRRLCPCA
Sbjct: 260 GSHQHTRRLCPCA 272
>gi|30694985|ref|NP_191498.2| uncharacterized protein [Arabidopsis thaliana]
gi|42572729|ref|NP_974460.1| uncharacterized protein [Arabidopsis thaliana]
gi|332646394|gb|AEE79915.1| uncharacterized protein [Arabidopsis thaliana]
gi|332646395|gb|AEE79916.1| uncharacterized protein [Arabidopsis thaliana]
Length = 273
Score = 297 bits (761), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 144/193 (74%), Positives = 171/193 (88%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCN+LSQCCNSYE+CVSCC+NP++TL E+V+ VK+AKP T+GTY SVFDFCAGRCRHNSE
Sbjct: 81 GCNILSQCCNSYEFCVSCCLNPSQTLLEKVVKVKVAKPATSGTYKSVFDFCAGRCRHNSE 140
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY S+FHHCFS+ SNASGA +TQ+E RL GI VI+G QG+SCD+VCKS GQ CV
Sbjct: 141 SVVHENAYHSEFHHCFSLTSNASGANLTQVETRLLGIDVIVGSQGDSCDAVCKSRGQLCV 200
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
+NKL LLNQC+++++YM+CK CLAS GADQPAEV +DAP +L P ACLYTR QS+LSCD
Sbjct: 201 MNKLSLLNQCDVMKRYMNCKGSCLASAGADQPAEVVEDAPRDLYPGACLYTRTQSLLSCD 260
Query: 192 GSHRHTRRLCPCA 204
GSH+HTRRLCPCA
Sbjct: 261 GSHQHTRRLCPCA 273
>gi|255585501|ref|XP_002533442.1| conserved hypothetical protein [Ricinus communis]
gi|223526704|gb|EEF28938.1| conserved hypothetical protein [Ricinus communis]
Length = 251
Score = 274 bits (701), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 128/163 (78%), Positives = 145/163 (88%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCNLLS CCNSYE+CVSCC+NPART +E+V+ +KIAKP TAGTY S FDFCAGRCRHNSE
Sbjct: 83 GCNLLSHCCNSYEFCVSCCLNPARTQEEEVMKLKIAKPATAGTYTSTFDFCAGRCRHNSE 142
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAYLSDFHHCFS+PSN++GA QLE RL GI+VIIGRQGESCDS CKS+GQSCV
Sbjct: 143 SVVHENAYLSDFHHCFSLPSNSTGANYRQLEARLVGINVIIGRQGESCDSACKSNGQSCV 202
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNL 174
LNKL++LNQC+I+QKYMSC+ CLASMG DQPAEV D AP +L
Sbjct: 203 LNKLLVLNQCDIMQKYMSCRGACLASMGTDQPAEVVDIAPKHL 245
>gi|242084782|ref|XP_002442816.1| hypothetical protein SORBIDRAFT_08g003310 [Sorghum bicolor]
gi|241943509|gb|EES16654.1| hypothetical protein SORBIDRAFT_08g003310 [Sorghum bicolor]
Length = 276
Score = 273 bits (699), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 126/193 (65%), Positives = 155/193 (80%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GC L QCCNSYEYCVSCC+NP++ KE VL +K+AKP TAGTY +VFDFC GRCRH+S
Sbjct: 84 GCKLDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY SDFHHCFS+ N+SG+ + + +L GI+V++GR GESC VCK GQSCV
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSVVSKLLGINVVVGRPGESCSLVCKVRGQSCV 203
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
++L +LN+CEI+QKYM CK GC S+G DQPAEV D+AP++LNP ACLY ++ L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 263
Query: 192 GSHRHTRRLCPCA 204
GSH+HTRRLCPCA
Sbjct: 264 GSHQHTRRLCPCA 276
>gi|357156916|ref|XP_003577619.1| PREDICTED: uncharacterized protein LOC100836694 [Brachypodium
distachyon]
Length = 275
Score = 270 bits (690), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 122/193 (63%), Positives = 153/193 (79%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GC L QCCNSYEYCVSCC++P+RT + VL +K+AKP TAGTY ++FDFC GRCRH+S
Sbjct: 83 GCKLDLQCCNSYEYCVSCCLDPSRTKERDVLKLKVAKPVTAGTYANIFDFCMGRCRHSSA 142
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY+SDFHHCF + N SG+ + RL+GISVI+GRQGE C S C++ GQSCV
Sbjct: 143 SVVHENAYVSDFHHCFMVQQNLSGSTDSNYGSRLAGISVIVGRQGEPCSSTCRAKGQSCV 202
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
++L LN+C+I+QKYM C+ GC S+G DQPAEV D+AP++LNP ACLY + L+CD
Sbjct: 203 PSRLSELNKCQILQKYMRCRSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQTDERLTCD 262
Query: 192 GSHRHTRRLCPCA 204
GSH+HTRR+CPCA
Sbjct: 263 GSHQHTRRVCPCA 275
>gi|218185557|gb|EEC67984.1| hypothetical protein OsI_35752 [Oryza sativa Indica Group]
Length = 276
Score = 266 bits (679), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 124/193 (64%), Positives = 156/193 (80%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GC L QCC+SYEYCVSCC+NP+RT + VL +K+A+P T+GTY +VFDFC GRCRH+S
Sbjct: 84 GCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVARPVTSGTYRNVFDFCMGRCRHSSA 143
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY SDFHHCF + N+SG+ + RL GI++ IGR+GESC SVC++ GQSCV
Sbjct: 144 SVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGINISIGRRGESCSSVCRAKGQSCV 203
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
++L +LN+CEI+QKYM CK GC +S+G DQPA+V D+APSNLNP ACLY ++ L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFSSLGPDQPAQVVDEAPSNLNPGACLYMQMDERLTCD 263
Query: 192 GSHRHTRRLCPCA 204
GSH+HTRRLCPCA
Sbjct: 264 GSHQHTRRLCPCA 276
>gi|297728217|ref|NP_001176472.1| Os11g0267100 [Oryza sativa Japonica Group]
gi|255679983|dbj|BAH95200.1| Os11g0267100 [Oryza sativa Japonica Group]
Length = 208
Score = 265 bits (677), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 123/193 (63%), Positives = 156/193 (80%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GC L QCC+SYEYCVSCC+NP+RT + VL +K+A+P T+GTY +VFDFC GRCRH+S
Sbjct: 16 GCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVARPVTSGTYRNVFDFCMGRCRHSSA 75
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY SDFHHCF + N+SG+ + RL GI++ IGR+GESC SVC++ GQSCV
Sbjct: 76 SVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGINISIGRRGESCSSVCRAKGQSCV 135
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
++L +LN+CEI+QKYM CK GC +++G DQPA+V D+APSNLNP ACLY ++ L+CD
Sbjct: 136 PSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVDEAPSNLNPGACLYMQMDERLTCD 195
Query: 192 GSHRHTRRLCPCA 204
GSH+HTRRLCPCA
Sbjct: 196 GSHQHTRRLCPCA 208
>gi|222615817|gb|EEE51949.1| hypothetical protein OsJ_33587 [Oryza sativa Japonica Group]
Length = 276
Score = 264 bits (675), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 123/193 (63%), Positives = 156/193 (80%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GC L QCC+SYEYCVSCC+NP+RT + VL +K+A+P T+GTY +VFDFC GRCRH+S
Sbjct: 84 GCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVARPVTSGTYRNVFDFCMGRCRHSSA 143
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY SDFHHCF + N+SG+ + RL GI++ IGR+GESC SVC++ GQSCV
Sbjct: 144 SVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGINISIGRRGESCSSVCRAKGQSCV 203
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
++L +LN+CEI+QKYM CK GC +++G DQPA+V D+APSNLNP ACLY ++ L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVDEAPSNLNPGACLYMQMDERLTCD 263
Query: 192 GSHRHTRRLCPCA 204
GSH+HTRRLCPCA
Sbjct: 264 GSHQHTRRLCPCA 276
>gi|449469174|ref|XP_004152296.1| PREDICTED: uncharacterized protein LOC101217783 [Cucumis sativus]
Length = 252
Score = 264 bits (674), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 131/193 (67%), Positives = 150/193 (77%), Gaps = 19/193 (9%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCNL+ QCCNSYEYCVSCC NP+RT +EQ+L +KIAKP TAG
Sbjct: 79 GCNLVLQCCNSYEYCVSCCQNPSRTKREQILKIKIAKPATAGK----------------- 121
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
VHENAY S FHHCFS+PS +SG TQLE RLSGI+VIIGRQGESCDSVCKS+GQSCV
Sbjct: 122 --VHENAYHSAFHHCFSLPSYSSGDNSTQLEFRLSGINVIIGRQGESCDSVCKSNGQSCV 179
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
NK+++LN C+I+QKY+SCK GCLAS+G DQPAEVA DAP LNP ACLY+ QSM+SCD
Sbjct: 180 PNKILVLNHCDIMQKYLSCKGGCLASVGTDQPAEVAYDAPKELNPGACLYSSAQSMVSCD 239
Query: 192 GSHRHTRRLCPCA 204
GSH HT+RLCPCA
Sbjct: 240 GSHPHTKRLCPCA 252
>gi|219887191|gb|ACL53970.1| unknown [Zea mays]
gi|413916125|gb|AFW56057.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
gi|413916126|gb|AFW56058.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
gi|413916127|gb|AFW56059.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
Length = 276
Score = 263 bits (672), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 125/193 (64%), Positives = 152/193 (78%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GC L QCCNSYEYCVSCC+NP++ KE VL +K+AKP TAGTY +VFDFC GRCRH+S
Sbjct: 84 GCELDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY SDFHHCFS+ N+SG+ + +L GI+V++GR ESC VCK GQSCV
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVGRPRESCSLVCKVRGQSCV 203
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
++L +LN+CEI+QKYM CK GC S+G DQPAEV D+AP++LNP ACLY ++ L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 263
Query: 192 GSHRHTRRLCPCA 204
GSH HTRRLCPCA
Sbjct: 264 GSHHHTRRLCPCA 276
>gi|413916124|gb|AFW56056.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
Length = 258
Score = 263 bits (671), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 125/193 (64%), Positives = 152/193 (78%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GC L QCCNSYEYCVSCC+NP++ KE VL +K+AKP TAGTY +VFDFC GRCRH+S
Sbjct: 66 GCELDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 125
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY SDFHHCFS+ N+SG+ + +L GI+V++GR ESC VCK GQSCV
Sbjct: 126 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVGRPRESCSLVCKVRGQSCV 185
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
++L +LN+CEI+QKYM CK GC S+G DQPAEV D+AP++LNP ACLY ++ L+CD
Sbjct: 186 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 245
Query: 192 GSHRHTRRLCPCA 204
GSH HTRRLCPCA
Sbjct: 246 GSHHHTRRLCPCA 258
>gi|226499180|ref|NP_001143347.1| uncharacterized protein LOC100275960 precursor [Zea mays]
gi|195618642|gb|ACG31151.1| hypothetical protein [Zea mays]
Length = 276
Score = 263 bits (671), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 125/193 (64%), Positives = 151/193 (78%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GC L QCCNSYEYCVSCC NP++ KE VL +K+AKP TAGTY +VFDFC GRCRH+S
Sbjct: 84 GCELDLQCCNSYEYCVSCCFNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY SDFHHCFS+ N+SG+ + +L GI+V++GR ESC VCK GQSCV
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVGRPRESCSLVCKVRGQSCV 203
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
++L +LN+CEI+QKYM CK GC S+G DQPAEV D+AP++LNP ACLY ++ L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 263
Query: 192 GSHRHTRRLCPCA 204
GSH HTRRLCPCA
Sbjct: 264 GSHHHTRRLCPCA 276
>gi|147779400|emb|CAN76795.1| hypothetical protein VITISV_030117 [Vitis vinifera]
Length = 258
Score = 258 bits (659), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 131/201 (65%), Positives = 149/201 (74%), Gaps = 28/201 (13%)
Query: 2 LPRERREILLGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDF 61
L + R I +GCNL+SQCCNSYE+CVSCC+NPART KE L VK+AKP TAG Y SVFDF
Sbjct: 40 LDVDMRVIHIGCNLISQCCNSYEFCVSCCLNPARTQKELALKVKMAKPVTAGAYASVFDF 99
Query: 62 CAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAG---------------VTQL----- 101
C GRCRHNSESVVHENAYLSDFHHCFS+PSN++G V QL
Sbjct: 100 CTGRCRHNSESVVHENAYLSDFHHCFSLPSNSAGEAWSLIPNSWNGLFLKIVKQLMASMH 159
Query: 102 --------EGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRG 153
E RL+GI V++GRQGESCDSVCKS+GQSCV +KL +LNQCEI+QKYMSCK
Sbjct: 160 AGGVDRNPELRLNGIHVVVGRQGESCDSVCKSNGQSCVPSKLSVLNQCEIMQKYMSCKGA 219
Query: 154 CLASMGADQPAEVADDAPSNL 174
CLAS+GADQPAEV DDAP +L
Sbjct: 220 CLASIGADQPAEVVDDAPRHL 240
>gi|449484814|ref|XP_004156988.1| PREDICTED: uncharacterized LOC101231289 [Cucumis sativus]
Length = 154
Score = 246 bits (627), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 117/151 (77%), Positives = 131/151 (86%)
Query: 54 TYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIG 113
TY SVFDFC GRCRHNSESVVHENAY S FHHCFS+PS +SG TQLE RLSGI+VIIG
Sbjct: 4 TYASVFDFCVGRCRHNSESVVHENAYHSAFHHCFSLPSYSSGDNSTQLEFRLSGINVIIG 63
Query: 114 RQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSN 173
RQGESCDSVCKS+GQSCV NK+++LN C+I+QKY+SCK GCLAS+G DQPAEVA DAP
Sbjct: 64 RQGESCDSVCKSNGQSCVPNKILVLNHCDIMQKYLSCKGGCLASVGTDQPAEVAYDAPKE 123
Query: 174 LNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 204
LNP ACLY+ QSM+SCDGSH HT+RLCPCA
Sbjct: 124 LNPGACLYSSAQSMVSCDGSHPHTKRLCPCA 154
>gi|7801689|emb|CAB91609.1| putative protein [Arabidopsis thaliana]
Length = 273
Score = 244 bits (622), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 119/163 (73%), Positives = 144/163 (88%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCN+LSQCCNSYE+CVSCC+NP++TL E+V+ VK+AKP T+GTY SVFDFCAGRCRHNSE
Sbjct: 81 GCNILSQCCNSYEFCVSCCLNPSQTLLEKVVKVKVAKPATSGTYKSVFDFCAGRCRHNSE 140
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY S+FHHCFS+ SNASGA +TQ+E RL GI VI+G QG+SCD+VCKS GQ CV
Sbjct: 141 SVVHENAYHSEFHHCFSLTSNASGANLTQVETRLLGIDVIVGSQGDSCDAVCKSRGQLCV 200
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNL 174
+NKL LLNQC+++++YM+CK CLAS GADQPAEV +DAP +L
Sbjct: 201 MNKLSLLNQCDVMKRYMNCKGSCLASAGADQPAEVVEDAPRDL 243
>gi|108864223|gb|ABA92588.2| expressed protein [Oryza sativa Japonica Group]
Length = 251
Score = 218 bits (555), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 102/163 (62%), Positives = 131/163 (80%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GC L QCC+SYEYCVSCC+NP+RT + VL +K+A+P T+GTY +VFDFC GRCRH+S
Sbjct: 84 GCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVARPVTSGTYRNVFDFCMGRCRHSSA 143
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY SDFHHCF + N+SG+ + RL GI++ IGR+GESC SVC++ GQSCV
Sbjct: 144 SVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGINISIGRRGESCSSVCRAKGQSCV 203
Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNL 174
++L +LN+CEI+QKYM CK GC +++G DQPA+V D+APSNL
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVDEAPSNL 246
>gi|168051463|ref|XP_001778174.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670496|gb|EDQ57064.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 235
Score = 216 bits (549), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 110/204 (53%), Positives = 135/204 (66%), Gaps = 2/204 (0%)
Query: 3 PRERREIL-LGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDF 61
PRE + GCN+ SQCCNSYEYCVSCC+NP RT E VL K+A+ TAG++ S+FD+
Sbjct: 32 PREGEQFACQGCNIGSQCCNSYEYCVSCCLNPVRTTLEVVLKTKLARQPTAGSWTSLFDY 91
Query: 62 CAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGV-TQLEGRLSGISVIIGRQGESCD 120
C+GRCRHNS+SVVHENAY+S+ HHCFS S G V E L+ + V +GR+G SCD
Sbjct: 92 CSGRCRHNSQSVVHENAYVSEDHHCFSARSTPPGVQVEMDAEDELADVFVYVGREGLSCD 151
Query: 121 SVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACL 180
+ C + SC KL LN C +QKY C C+ S G DQPAEV AP +++P ACL
Sbjct: 152 ATCSAQKLSCREEKLAALNNCAELQKYNKCHGLCIPSTGPDQPAEVVSTAPRHMHPGACL 211
Query: 181 YTRIQSMLSCDGSHRHTRRLCPCA 204
Y S LSC G H +RRLCPCA
Sbjct: 212 YNTQVSQLSCKGYHLFSRRLCPCA 235
>gi|302788558|ref|XP_002976048.1| hypothetical protein SELMODRAFT_415982 [Selaginella moellendorffii]
gi|300156324|gb|EFJ22953.1| hypothetical protein SELMODRAFT_415982 [Selaginella moellendorffii]
Length = 263
Score = 207 bits (528), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 97/194 (50%), Positives = 135/194 (69%), Gaps = 1/194 (0%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCNL+SQCCN++EYCVSCC++P T KE L+ K A+ TA + SVF++C GRCRHNS
Sbjct: 70 GCNLVSQCCNTFEYCVSCCLDPIHTSKEFALSSKSARQLTAANFVSVFEYCTGRCRHNSG 129
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLE-GRLSGISVIIGRQGESCDSVCKSSGQSC 130
SVVHENAY S++HHCF +N S +E G + S+I GR+G+SCD C+ G+ C
Sbjct: 130 SVVHENAYTSEYHHCFFTQTNLSSGRDHGVEAGENTDYSLITGRRGQSCDEACRQRGKKC 189
Query: 131 VLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSC 190
+ L L+N+C ++QK M+CK C++ G D P+EV AP +L+P AC++ ++LSC
Sbjct: 190 KASILPLINECSVLQKSMNCKGACISGSGPDLPSEVVSSAPRHLHPGACVFNSKNTLLSC 249
Query: 191 DGSHRHTRRLCPCA 204
G+HR+T+RLCPC
Sbjct: 250 QGAHRYTKRLCPCG 263
>gi|302769810|ref|XP_002968324.1| hypothetical protein SELMODRAFT_409585 [Selaginella moellendorffii]
gi|300163968|gb|EFJ30578.1| hypothetical protein SELMODRAFT_409585 [Selaginella moellendorffii]
Length = 263
Score = 207 bits (527), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 97/194 (50%), Positives = 135/194 (69%), Gaps = 1/194 (0%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCNL+SQCCN++EYCVSCC++P T KE L+ K A+ TA + SVF++C GRCRHNS
Sbjct: 70 GCNLVSQCCNTFEYCVSCCLDPIHTSKEFALSSKSARQLTAANFVSVFEYCTGRCRHNSG 129
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLE-GRLSGISVIIGRQGESCDSVCKSSGQSC 130
SVVHENAY S++HHCF +N S +E G + S+I GR+G+SCD C+ G+ C
Sbjct: 130 SVVHENAYTSEYHHCFFTQTNLSSGRDHGVEAGENTDYSLITGRRGQSCDEACRQRGKKC 189
Query: 131 VLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSC 190
+ L L+N+C ++QK M+CK C++ G D P+EV AP +L+P AC++ ++LSC
Sbjct: 190 KASILPLINECSVLQKSMNCKGACVSGSGPDLPSEVVSSAPRHLHPGACVFNSKNTLLSC 249
Query: 191 DGSHRHTRRLCPCA 204
G+HR+T+RLCPC
Sbjct: 250 QGAHRYTKRLCPCG 263
>gi|255637592|gb|ACU19121.1| unknown [Glycine max]
Length = 224
Score = 201 bits (510), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 94/120 (78%), Positives = 106/120 (88%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GCNLLSQCCNSYEYCVSCC+NPA KEQVL +K+AKPTTA TY SVFD+CAGRCRH+SE
Sbjct: 89 GCNLLSQCCNSYEYCVSCCLNPALISKEQVLKMKVAKPTTARTYSSVFDYCAGRCRHSSE 148
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
SVVHENAY+SDFHHCFS+ SN+SG T E RLSGI+V++GRQGESC+SVCKS GQSCV
Sbjct: 149 SVVHENAYISDFHHCFSLSSNSSGKNSTLTEARLSGINVVVGRQGESCNSVCKSRGQSCV 208
>gi|62734222|gb|AAX96331.1| hypothetical protein LOC_Os11g16560 [Oryza sativa Japonica Group]
Length = 216
Score = 180 bits (456), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 84/139 (60%), Positives = 110/139 (79%)
Query: 36 TLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASG 95
T + VL +K+A+P T+GTY +VFDFC GRCRH+S SVVHENAY SDFHHCF + N+SG
Sbjct: 73 TKETDVLKLKVARPVTSGTYRNVFDFCMGRCRHSSASVVHENAYASDFHHCFLLQQNSSG 132
Query: 96 AGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCL 155
+ + RL GI++ IGR+GESC SVC++ GQSCV ++L +LN+CEI+QKYM CK GC
Sbjct: 133 SADSGSGPRLDGINISIGRRGESCSSVCRAKGQSCVPSRLSVLNKCEILQKYMRCKSGCF 192
Query: 156 ASMGADQPAEVADDAPSNL 174
+++G DQPA+V D+APSNL
Sbjct: 193 SNLGPDQPAQVVDEAPSNL 211
>gi|413916128|gb|AFW56060.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
Length = 186
Score = 139 bits (350), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 66/102 (64%), Positives = 81/102 (79%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GC L QCCNSYEYCVSCC+NP++ KE VL +K+AKP TAGTY +VFDFC GRCRH+S
Sbjct: 84 GCELDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143
Query: 72 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIG 113
SVVHENAY SDFHHCFS+ N+SG+ + +L GI+V++G
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVG 185
>gi|384245502|gb|EIE18996.1| hypothetical protein COCSUDRAFT_31544 [Coccomyxa subellipsoidea
C-169]
Length = 278
Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/193 (36%), Positives = 104/193 (53%), Gaps = 3/193 (1%)
Query: 13 CNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSES 72
CN+ CC YE+CVSCC +P L + + G + SVF++C +CR +S+S
Sbjct: 83 CNMHDSCCADYEHCVSCCQDPQHGLPDLSTVHRAPDRPETGLWPSVFEYCRNKCRTSSKS 142
Query: 73 VVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVL 132
VHENAYLS FHHCFS S ++V++G G+SC+++C+S+G+ C
Sbjct: 143 TVHENAYLSPFHHCFSS-SGKPTTAAPPTPPIPKSVTVVLGSAGQSCNALCESTGRRCAQ 201
Query: 133 NKLILLNQCEIIQKYMSCKRGCLASMG-ADQPAEVADDAPSNLNPTACLYTRIQ-SMLSC 190
L LN C +++++ +C+ GC G AD P V AP PT CL + + SC
Sbjct: 202 QHLASLNNCNVLREHYACEAGCEPDSGTADAPIYVNASAPKQQRPTICLTMDSKVNQFSC 261
Query: 191 DGSHRHTRRLCPC 203
+ + RLCPC
Sbjct: 262 EATSASNHRLCPC 274
>gi|159475585|ref|XP_001695899.1| predicted membrane-anchored protein [Chlamydomonas reinhardtii]
gi|158275459|gb|EDP01236.1| predicted membrane-anchored protein [Chlamydomonas reinhardtii]
Length = 269
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 105/202 (51%), Gaps = 22/202 (10%)
Query: 13 CNLLSQCCNSYEYCVSCCVNP---ARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHN 69
C++ +CC+ YE CVSCC+ P A ++ +Q L + + +G +G F++C G CR +
Sbjct: 78 CDMRDRCCSEYESCVSCCLAPQHNAVSIAKQAL--RSPRHKDSGFWGDPFEYCKGICRTH 135
Query: 70 SESVVHENAYLSDFHHCFS-----MPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCK 124
S S HENAY+S HHCFS M S+ AGV + G+ V+ G++ +CD VC
Sbjct: 136 SRSTAHENAYISSRHHCFSQLGRPMLSDPLPAGV------MDGVEVVTGQRNANCDDVCA 189
Query: 125 SSGQSCVLNKLILLNQCEIIQKYMSCKRGC--LASMGADQPAEVADDAPSNLNPTACLYT 182
+ + C + L L+ C+ +++ C+ GC +A +G P+ V +AP P C
Sbjct: 190 AKQKKCSADHLRWLSSCDRLREQFGCEAGCEVVAGLG---PSYVDGNAPKPARPAMCFAQ 246
Query: 183 RIQ-SMLSCDGSHRHTRRLCPC 203
+ LSC LCPC
Sbjct: 247 PAEGGKLSCSAREEQHLMLCPC 268
>gi|358345744|ref|XP_003636935.1| hypothetical protein MTR_065s0010 [Medicago truncatula]
gi|355502870|gb|AES84073.1| hypothetical protein MTR_065s0010 [Medicago truncatula]
Length = 131
Score = 105 bits (263), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 51/75 (68%), Positives = 56/75 (74%)
Query: 129 SCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSML 188
S VL I L IQKYMSCK GCLAS+G DQPAEV DAP +NP +CLYT QS+L
Sbjct: 28 SYVLTVSIDLRYVGSIQKYMSCKGGCLASVGTDQPAEVVYDAPKYMNPESCLYTETQSIL 87
Query: 189 SCDGSHRHTRRLCPC 203
SCDGSH+HTRRLCPC
Sbjct: 88 SCDGSHQHTRRLCPC 102
>gi|428177708|gb|EKX46586.1| hypothetical protein GUITHDRAFT_107373 [Guillardia theta CCMP2712]
Length = 233
Score = 102 bits (255), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 88/169 (52%), Gaps = 17/169 (10%)
Query: 5 ERREILLGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV------ 58
E R C ++CC+ YE CVSCC+NP+ +++ N +A T+
Sbjct: 64 EPRWSCRSCEAETKCCDHYENCVSCCLNPS---NDELRNTAMAASRDKKTFADAVKRKDS 120
Query: 59 FDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGES 118
F+FC CR +S+S VH NAY+S+ C S PS S + ++V +G+QGES
Sbjct: 121 FEFCRASCRTSSKSTVHGNAYISEKRFCLS-PSILSNT-------LPADVAVTVGKQGES 172
Query: 119 CDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVA 167
C VC S G+ C + +N CE++QK CK C + G DQPA V+
Sbjct: 173 CAEVCSSVGKRCDTRYITSINTCEVLQKNFPCKNTCEKNYGHDQPAYVS 221
>gi|242070783|ref|XP_002450668.1| hypothetical protein SORBIDRAFT_05g009356 [Sorghum bicolor]
gi|241936511|gb|EES09656.1| hypothetical protein SORBIDRAFT_05g009356 [Sorghum bicolor]
Length = 111
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 41/59 (69%), Positives = 48/59 (81%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 70
GC L QCCNSYEYCVSCC+NP++ KE VL +K+AKP T GTY +VFDFC GRCRH+S
Sbjct: 53 GCKLDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTVGTYTNVFDFCMGRCRHSS 111
>gi|242051320|ref|XP_002463404.1| hypothetical protein SORBIDRAFT_02g043236 [Sorghum bicolor]
gi|241926781|gb|EER99925.1| hypothetical protein SORBIDRAFT_02g043236 [Sorghum bicolor]
Length = 82
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/68 (64%), Positives = 50/68 (73%)
Query: 3 PRERREILLGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFC 62
P R GC L QCCNSYEYCVSCC+NP++ KE VL +K AKP TAGTY +VFDFC
Sbjct: 15 PTLDRFSCRGCKLDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKEAKPVTAGTYTNVFDFC 74
Query: 63 AGRCRHNS 70
GRCRH+S
Sbjct: 75 MGRCRHSS 82
>gi|307110379|gb|EFN58615.1| hypothetical protein CHLNCDRAFT_17323, partial [Chlorella
variabilis]
Length = 225
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/199 (29%), Positives = 93/199 (46%), Gaps = 16/199 (8%)
Query: 13 CNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIA--KPTTAGTYGSVFDFCAGRCRHNS 70
C +CC+ YE+CVSCC+ P E++ ++ KP T G + + F++C CR +
Sbjct: 35 CEAKDKCCSRYEHCVSCCLRPENKPGEKMQSIFRGRNKPET-GHWSTPFEYCQAVCRTTA 93
Query: 71 ESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSC 130
S HENA++ D CFS A E S++ G G+SCD C + +C
Sbjct: 94 RSTQHENAFILDRRFCFSKVGFVPPAPALPKEA-----SLVAGAAGQSCDVACAAKQMTC 148
Query: 131 VLNKLILLNQCEIIQ-KYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLY-----TRI 184
+ +N C ++ K+M C G A+ + P V AP + P C + ++
Sbjct: 149 RAEWFLSINDCNSLRAKFMCC--GSCAADQTEYPGYVEGSAPKHQQPAFCAFLPPLAAQL 206
Query: 185 QSMLSCDGSHRHTRRLCPC 203
+C ++ RRLCPC
Sbjct: 207 LPSFNCSHANGSVRRLCPC 225
>gi|255071235|ref|XP_002507699.1| glycosyltransferase family 13 protein [Micromonas sp. RCC299]
gi|226522974|gb|ACO68957.1| glycosyltransferase family 13 protein [Micromonas sp. RCC299]
Length = 636
Score = 66.2 bits (160), Expect = 6e-09, Method: Composition-based stats.
Identities = 39/125 (31%), Positives = 59/125 (47%), Gaps = 13/125 (10%)
Query: 80 LSDFHHCFSMPSNASGAGVTQLEGRLS-GISVIIGRQGESCDSVCKSSGQSCVLNKLILL 138
L+D C +PS+ L R S G++ + QG++CD+ C+ + C L
Sbjct: 520 LADERSCPYLPSH--------LRVRPSRGLTPVAAVQGQTCDAACRYNHMKCHAPDFWFL 571
Query: 139 NQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTR 198
N CE++ K+ C+ GC +G D P V + CL + Q+ +C SHR T
Sbjct: 572 NSCEVLAKFFPCEAGCALVLGDDIPNYVVSNRMDTYQ--KCLVSERQA--TCAASHRATT 627
Query: 199 RLCPC 203
RLC C
Sbjct: 628 RLCAC 632
>gi|195393082|ref|XP_002055183.1| GJ19227 [Drosophila virilis]
gi|194149693|gb|EDW65384.1| GJ19227 [Drosophila virilis]
Length = 215
Score = 63.5 bits (153), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/74 (50%), Positives = 45/74 (60%), Gaps = 7/74 (9%)
Query: 13 CNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRH 68
CN S CC YEYCVSCC++P R+L EQVL A+ T Y SV F+ C +CR
Sbjct: 120 CNSTSHCCGVYEYCVSCCLHPNKRSLLEQVLQ---AQDTQKYIYASVVDHFELCLVKCRT 176
Query: 69 NSESVVHENAYLSD 82
NS SV HEN Y ++
Sbjct: 177 NSHSVEHENKYRNE 190
>gi|195130989|ref|XP_002009933.1| GI15640 [Drosophila mojavensis]
gi|193908383|gb|EDW07250.1| GI15640 [Drosophila mojavensis]
Length = 216
Score = 63.2 bits (152), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 37/74 (50%), Positives = 44/74 (59%), Gaps = 7/74 (9%)
Query: 13 CNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRH 68
CN S CC YEYCVSCC++P R+L EQVL A T Y SV F+ C +CR
Sbjct: 121 CNTTSHCCGVYEYCVSCCLHPNKRSLLEQVLQ---AHNTQKYIYASVVDHFELCLVKCRT 177
Query: 69 NSESVVHENAYLSD 82
NS SV HEN Y ++
Sbjct: 178 NSHSVEHENKYRNE 191
>gi|308805236|ref|XP_003079930.1| unnamed protein product [Ostreococcus tauri]
gi|116058387|emb|CAL53576.1| unnamed protein product [Ostreococcus tauri]
Length = 189
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 42/84 (50%), Gaps = 4/84 (4%)
Query: 13 CNLLSQCCNSYEYCVSCCVNPAR--TLK-EQVLNVKIAKPTTAGTYGSVFDFCAGRCRHN 69
C+ +CC YE CVSCC+ PAR TL+ E + + G + FDFCA RCR
Sbjct: 84 CDAAVECCERYETCVSCCLAPARRATLETEMRTHARGRNQVVTGFFDDPFDFCANRCRTQ 143
Query: 70 SESVVHENAYLSDFHHCF-SMPSN 92
+HEN Y HCF P N
Sbjct: 144 PSVTLHENEYAYSTKHCFGDYPKN 167
>gi|328719433|ref|XP_003246762.1| PREDICTED: UPF0454 protein C12orf49 homolog [Acyrthosiphon pisum]
Length = 193
Score = 60.1 bits (144), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 50/88 (56%), Gaps = 7/88 (7%)
Query: 5 ERREILLGCNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FD 60
E+R I CN CC YE+CVSCC+NP R L E VL A T + ++ F+
Sbjct: 103 EKRYICDTCNA-DGCCVLYEHCVSCCLNPNKRKLLETVLGK--ASETFRVLFSAINNHFE 159
Query: 61 FCAGRCRHNSESVVHENAYLSDFHHCFS 88
C +CR +S+SV HENAYL HC++
Sbjct: 160 LCLAKCRTSSQSVQHENAYLKSNKHCYT 187
>gi|195044419|ref|XP_001991819.1| GH11862 [Drosophila grimshawi]
gi|193901577|gb|EDW00444.1| GH11862 [Drosophila grimshawi]
Length = 214
Score = 59.7 bits (143), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 48/81 (59%), Gaps = 8/81 (9%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPAR-TLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCR 67
CN+ S CC YE+C+SCC++P + +L EQVL A T Y SV F+ C +CR
Sbjct: 120 SCNVTSHCCAIYEHCISCCLHPNKQSLLEQVLQ---ASDTQKYIYASVVDHFELCLVKCR 176
Query: 68 HNSESVVHENAYLS-DFHHCF 87
NS SV HEN Y + D +C+
Sbjct: 177 TNSHSVEHENKYRNEDAKYCY 197
>gi|195432368|ref|XP_002064195.1| GK19833 [Drosophila willistoni]
gi|194160280|gb|EDW75181.1| GK19833 [Drosophila willistoni]
Length = 215
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 13 CNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
CN S CC YEYCVSCC++P R L E +L A+ F+ C +CR NS
Sbjct: 120 CNQTSMCCEIYEYCVSCCLHPDKRPLLETILETVNAQRYIYAQVEDHFELCLVKCRTNSH 179
Query: 72 SVVHENAY 79
SV HEN Y
Sbjct: 180 SVEHENKY 187
>gi|194768977|ref|XP_001966587.1| GF22253 [Drosophila ananassae]
gi|190617351|gb|EDV32875.1| GF22253 [Drosophila ananassae]
Length = 217
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 41/80 (51%), Gaps = 2/80 (2%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 70
CN + CC YEYCVSCC++P + L EQVL + F+ C +CR NS
Sbjct: 121 SCNTTTHCCAIYEYCVSCCLHPEKQPLLEQVLRSANSPKYIFTKVADHFELCLVKCRTNS 180
Query: 71 ESVVHENAYLS-DFHHCFSM 89
SV HEN Y HC+ +
Sbjct: 181 HSVQHENQYRDPSAKHCYGL 200
>gi|195355329|ref|XP_002044144.1| GM22548 [Drosophila sechellia]
gi|194129433|gb|EDW51476.1| GM22548 [Drosophila sechellia]
Length = 197
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 41/79 (51%), Gaps = 2/79 (2%)
Query: 13 CNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
CN + CC YEYCVSCC++P + L E+VL + F+ C +CR NS
Sbjct: 107 CNTSTHCCGVYEYCVSCCLHPGQQPLLERVLQAPNTPKYIFASVTDHFELCLVKCRTNSH 166
Query: 72 SVVHENAYLSD-FHHCFSM 89
SV HEN Y HCF +
Sbjct: 167 SVEHENKYRDPAAKHCFGL 185
>gi|452823888|gb|EME30895.1| peptidyl-prolyl cis-trans isomerase [Galdieria sulphuraria]
Length = 360
Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 49/98 (50%), Gaps = 3/98 (3%)
Query: 110 VIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASM---GADQPAEV 166
+I ++ ++C C+ C +LN C + +Y C+ +A+M G+D P V
Sbjct: 248 IIPAKRKQNCYRACQLKNLVCFKKGFSILNNCPTLSEYFDCRECEIATMESAGSDMPCRV 307
Query: 167 ADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 204
+ APS + C+ + S+ SC+ SH T RLCPC
Sbjct: 308 SLKAPSTYSSGVCMISPNLSLTSCEASHSQTERLCPCV 345
>gi|198471721|ref|XP_001355702.2| GA13867 [Drosophila pseudoobscura pseudoobscura]
gi|198146047|gb|EAL32761.2| GA13867 [Drosophila pseudoobscura pseudoobscura]
Length = 216
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 41/79 (51%), Gaps = 2/79 (2%)
Query: 13 CNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
CN + CC YEYCVSCC++P + L E+VL + F+ C +CR NS
Sbjct: 121 CNATTSCCRLYEYCVSCCLHPDKQPLLERVLQSAHTPKYIYASVADHFELCLVKCRTNSH 180
Query: 72 SVVHENAYLS-DFHHCFSM 89
SV HEN Y HC+ +
Sbjct: 181 SVEHENKYRDGKAKHCYGI 199
>gi|28571353|ref|NP_788909.1| CG15643 [Drosophila melanogaster]
gi|7293087|gb|AAF48472.1| CG15643 [Drosophila melanogaster]
gi|17944302|gb|AAL48044.1| RE11751p [Drosophila melanogaster]
gi|220947868|gb|ACL86477.1| CG15643-PA [synthetic construct]
gi|220957174|gb|ACL91130.1| CG15643-PA [synthetic construct]
Length = 212
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 41/79 (51%), Gaps = 2/79 (2%)
Query: 13 CNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
CN + CC YEYCVSCC++P + L E+VL + F+ C +CR NS
Sbjct: 121 CNTSTHCCGVYEYCVSCCLHPGQQPLLERVLQAPNTPKYIFASVTDHFELCLVKCRTNSH 180
Query: 72 SVVHENAYLSD-FHHCFSM 89
SV HEN Y HC+ +
Sbjct: 181 SVEHENKYRDPAAKHCYGL 199
>gi|195478878|ref|XP_002100681.1| GE16041 [Drosophila yakuba]
gi|194188205|gb|EDX01789.1| GE16041 [Drosophila yakuba]
Length = 217
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 8/82 (9%)
Query: 13 CNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRH 68
CN + CC YEYCVSCC++P + L E+VL +A T + SV F+ C +CR
Sbjct: 121 CNTSTHCCGVYEYCVSCCLHPGQQPLLERVL---LAPNTPKYIFASVTDHFELCLVKCRT 177
Query: 69 NSESVVHENAYLSD-FHHCFSM 89
NS SV HEN Y HC+ +
Sbjct: 178 NSHSVEHENKYRDPAAKHCYGL 199
>gi|156380036|ref|XP_001631761.1| predicted protein [Nematostella vectensis]
gi|156218806|gb|EDO39698.1| predicted protein [Nematostella vectensis]
Length = 135
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 47/79 (59%), Gaps = 3/79 (3%)
Query: 16 LSQCCNSYEYCVSCCVNPA-RTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESV 73
++ CC+ YE CVSCC++P + L +VLNV P + F+ C +CR +S+SV
Sbjct: 48 VNNCCSIYENCVSCCLDPKQKELLREVLNVWRTAPNVILKSITDQFELCLTKCRTSSKSV 107
Query: 74 VHENAYLSD-FHHCFSMPS 91
HEN+Y + + HCF + S
Sbjct: 108 WHENSYKDNKYKHCFGLTS 126
>gi|195164215|ref|XP_002022944.1| GL16447 [Drosophila persimilis]
gi|194105006|gb|EDW27049.1| GL16447 [Drosophila persimilis]
Length = 216
Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 40/79 (50%), Gaps = 2/79 (2%)
Query: 13 CNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
CN + CC YEYCVSCC++P + L E VL + F+ C +CR NS
Sbjct: 121 CNATTNCCRLYEYCVSCCLHPDKQPLLELVLQSAYTPKYIYASVADHFELCLVKCRTNSH 180
Query: 72 SVVHENAYLS-DFHHCFSM 89
SV HEN Y HC+ +
Sbjct: 181 SVEHENKYRDGKAKHCYGI 199
>gi|291000376|ref|XP_002682755.1| N-acetylglucosaminylaminotransferase [Naegleria gruberi]
gi|284096383|gb|EFC50011.1| N-acetylglucosaminylaminotransferase [Naegleria gruberi]
Length = 617
Score = 57.0 bits (136), Expect = 5e-06, Method: Composition-based stats.
Identities = 46/154 (29%), Positives = 71/154 (46%), Gaps = 27/154 (17%)
Query: 56 GSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQ 115
GS F A C ++ H A +S FH SN S L + S I+ I +Q
Sbjct: 453 GSYSPFYAETC---GKAGSHCPAPISKFHK----DSNWS------LPKKDSIITFAISKQ 499
Query: 116 GESCDSVCKSS-GQSCVLNKLILLNQCEIIQKYM--SCKRGCLASMGADQPAEVADDAPS 172
G+SC S CK C +N C+ ++K++ C C+ S G DQPA
Sbjct: 500 GQSCSSYCKQGFNMECYRFSFEEMNSCKFLRKFVGQECTE-CIPSSGPDQPA-------I 551
Query: 173 NLNPTACLYT---RIQSMLSCDGSHRHTRRLCPC 203
+++ CL+ R + + +C+ H +T+R+CPC
Sbjct: 552 DVDSKQCLFNGGGRGKFISTCEAYHVNTKRICPC 585
>gi|347964633|ref|XP_316825.5| AGAP000855-PA [Anopheles gambiae str. PEST]
gi|333469436|gb|EAA12112.5| AGAP000855-PA [Anopheles gambiae str. PEST]
Length = 243
Score = 56.6 bits (135), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 50/90 (55%), Gaps = 7/90 (7%)
Query: 15 LLSQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNS 70
L ++CC YEYCV+CC+NP R + E+VL A Y V F+ C +CR NS
Sbjct: 144 LPNRCCAVYEYCVACCLNPDKRPILEEVL--AKANGRQVALYAEVTDQFELCLTKCRTNS 201
Query: 71 ESVVHENAYLS-DFHHCFSMPSNASGAGVT 99
+SV +EN Y + + HC+ S A+ A +
Sbjct: 202 QSVQNENKYRNPEQKHCYGEMSEAAWASAS 231
>gi|194894481|ref|XP_001978075.1| GG19394 [Drosophila erecta]
gi|190649724|gb|EDV47002.1| GG19394 [Drosophila erecta]
Length = 217
Score = 56.6 bits (135), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 8/82 (9%)
Query: 13 CNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRH 68
CN + CC YEYCVSCC++P + L E+VL +A T + SV F+ C +CR
Sbjct: 121 CNTSTHCCAVYEYCVSCCLHPGQQPLLERVL---LAPNTPKYIFASVADHFELCLVKCRT 177
Query: 69 NSESVVHENAYLSD-FHHCFSM 89
NS SV HEN Y HC+ +
Sbjct: 178 NSHSVEHENKYRDPAAKHCYGL 199
>gi|405975633|gb|EKC40187.1| UPF0454 protein C12orf49-like protein [Crassostrea gigas]
Length = 185
Score = 56.2 bits (134), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 42/72 (58%), Gaps = 3/72 (4%)
Query: 19 CCNSYEYCVSCCVNP-ARTLKEQVL-NVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC YEYCV+CC+ P R L +++L N + G+ F+ C +CR +S SV HE
Sbjct: 102 CCVVYEYCVACCLKPDKRPLLQKILTNAREGLDIALGSVSDHFELCLAKCRTSSSSVQHE 161
Query: 77 NAYLSDF-HHCF 87
N+Y + F HCF
Sbjct: 162 NSYRNAFAKHCF 173
>gi|391333374|ref|XP_003741090.1| PREDICTED: UPF0454 protein C12orf49 homolog [Metaseiulus
occidentalis]
Length = 193
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/74 (41%), Positives = 42/74 (56%), Gaps = 5/74 (6%)
Query: 19 CCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVV 74
CC+SYE+C+SCC+ P R E+VL P + Y V FD C +CR +S+SV
Sbjct: 116 CCSSYEFCISCCLRPQQRPTLERVLGRLGRSPKSLKLYALVQTQFDLCLTKCRTSSQSVR 175
Query: 75 HENAYLSDF-HHCF 87
+EN Y+ HCF
Sbjct: 176 NENTYIDPVKKHCF 189
>gi|328869989|gb|EGG18364.1| hypothetical protein DFA_03858 [Dictyostelium fasciculatum]
Length = 1057
Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats.
Identities = 27/77 (35%), Positives = 38/77 (49%), Gaps = 1/77 (1%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
GC+ +CC Y YCVSCC+ + E VL + + G FD C CR +S
Sbjct: 968 GCSSKYRCCEEYTYCVSCCLGARQETLEAVLVQMVEGVSPIGEPHDQFDLCQLVCRTSSH 1027
Query: 72 SVVHENAYL-SDFHHCF 87
SV+H+ Y D +C+
Sbjct: 1028 SVIHQREYKHPDTKYCY 1044
>gi|357603409|gb|EHJ63752.1| hypothetical protein KGM_18059 [Danaus plexippus]
Length = 198
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 17 SQCCNSYEYCVSCCVNPA-RTLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVV 74
+ CC YEYCVSCC++P+ R+L E VL+ + + + ++ C +CR +S SV+
Sbjct: 115 NNCCVIYEYCVSCCLDPSKRSLLELVLSKLSAEEKVLFRSISDDYELCLTKCRTSSHSVL 174
Query: 75 HENAYLSDFH-HCFSMPSNA 93
HEN+Y H HCF +N+
Sbjct: 175 HENSYKDPSHKHCFGDRTNS 194
>gi|242012077|ref|XP_002426767.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212510949|gb|EEB14029.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 203
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 40/64 (62%), Gaps = 6/64 (9%)
Query: 19 CCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVH 75
CC+ YEYC+SCC++P K+ VL + K + + SV F+ C +CR NS+SV H
Sbjct: 124 CCSIYEYCISCCLHPD---KKGVLQNVLGKESENVLFASVSDHFELCLAKCRTNSQSVHH 180
Query: 76 ENAY 79
EN+Y
Sbjct: 181 ENSY 184
>gi|124486989|ref|NP_001074705.1| UPF0454 protein C12orf49 homolog precursor [Mus musculus]
gi|81897450|sp|Q8BTG6.1|CL049_MOUSE RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
gi|26355654|dbj|BAC41192.1| unnamed protein product [Mus musculus]
gi|74355387|gb|AAI04374.1| RIKEN cDNA 2410131K14 gene [Mus musculus]
gi|74355856|gb|AAI04373.1| RIKEN cDNA 2410131K14 gene [Mus musculus]
gi|148687847|gb|EDL19794.1| mCG15258 [Mus musculus]
Length = 205
Score = 52.8 bits (125), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 41/70 (58%), Gaps = 3/70 (4%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
GC L + CC +YEYCVSCC+ P++ L E+ LN +A F+ C +CR +
Sbjct: 118 GC-LANGCCEAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 176
Query: 70 SESVVHENAY 79
S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186
>gi|115496644|ref|NP_001069491.1| UPF0454 protein C12orf49 homolog precursor [Bos taurus]
gi|122133614|sp|Q17QN8.1|CL049_BOVIN RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
gi|109659176|gb|AAI18255.1| Chromosome 12 open reading frame 49 ortholog [Bos taurus]
gi|296478528|tpg|DAA20643.1| TPA: hypothetical protein LOC534423 precursor [Bos taurus]
Length = 205
Score = 52.8 bits (125), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 41/70 (58%), Gaps = 3/70 (4%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
GC L S CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +
Sbjct: 118 GC-LSSGCCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 176
Query: 70 SESVVHENAY 79
S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186
>gi|157818285|ref|NP_001102536.1| uncharacterized protein LOC498188 [Rattus norvegicus]
gi|149063483|gb|EDM13806.1| similar to hypothetical protein FLJ21415 (predicted) [Rattus
norvegicus]
Length = 205
Score = 52.8 bits (125), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 41/70 (58%), Gaps = 3/70 (4%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
GC L + CC +YEYCVSCC+ P++ L E+ LN +A F+ C +CR +
Sbjct: 118 GC-LANGCCEAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 176
Query: 70 SESVVHENAY 79
S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186
>gi|440912218|gb|ELR61808.1| hypothetical protein M91_20260, partial [Bos grunniens mutus]
Length = 201
Score = 52.8 bits (125), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 41/70 (58%), Gaps = 3/70 (4%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
GC L S CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +
Sbjct: 114 GC-LSSGCCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 172
Query: 70 SESVVHENAY 79
S+SV HEN Y
Sbjct: 173 SQSVQHENTY 182
>gi|351694757|gb|EHA97675.1| hypothetical protein GW7_10171 [Heterocephalus glaber]
Length = 273
Score = 52.8 bits (125), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 41/70 (58%), Gaps = 3/70 (4%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
GC L + CC +YEYCVSCC+ P++ L E+ LN +A F+ C +CR +
Sbjct: 186 GC-LANGCCGAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFLAVEDHFELCLAKCRTS 244
Query: 70 SESVVHENAY 79
S+SV HEN Y
Sbjct: 245 SQSVQHENTY 254
>gi|348584144|ref|XP_003477832.1| PREDICTED: UPF0454 protein C12orf49 homolog [Cavia porcellus]
Length = 205
Score = 52.8 bits (125), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 42/70 (60%), Gaps = 3/70 (4%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
GC L + CC++YEYCVSCC+ P++ L E+ LN +A F+ C +CR +
Sbjct: 118 GC-LANGCCSAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFLAVEDHFELCLAKCRTS 176
Query: 70 SESVVHENAY 79
S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186
>gi|311270685|ref|XP_003132942.1| PREDICTED: UPF0454 protein C12orf49 homolog [Sus scrofa]
Length = 205
Score = 52.8 bits (125), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 38/63 (60%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P++ L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|426247320|ref|XP_004017434.1| PREDICTED: UPF0454 protein C12orf49 homolog [Ovis aries]
Length = 205
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 40/70 (57%), Gaps = 3/70 (4%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
GC L S CC +YEYCVSCC+ P + L E+ LN +A F+ C +CR +
Sbjct: 118 GC-LSSGCCGAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 176
Query: 70 SESVVHENAY 79
S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186
>gi|307207910|gb|EFN85471.1| UPF0454 protein C12orf49-like protein [Harpegnathos saltator]
Length = 276
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 46/85 (54%), Gaps = 10/85 (11%)
Query: 19 CCNSYEYCVSCCVNPARTLK--EQVL--NVKIAKPTTAGTYG----SVFDFCAGRCRHNS 70
CC YEYCVSCC++P + +K + V+ +VK K A F C CR +S
Sbjct: 125 CCTIYEYCVSCCLHPGKQIKGRKDVVSGSVKAQKEEDAVRTRIRNLDRFQICLAACRTSS 184
Query: 71 ESVVHENAYLSDF-HHCFSM-PSNA 93
SV HEN Y F HC++M PSN+
Sbjct: 185 ASVRHENTYKDPFAKHCYAMQPSNS 209
>gi|148224108|ref|NP_001085845.1| UPF0454 protein C12orf49 homolog precursor [Xenopus laevis]
gi|82184289|sp|Q6GNT2.1|CL049_XENLA RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
gi|49115513|gb|AAH73420.1| MGC80896 protein [Xenopus laevis]
Length = 205
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 3/72 (4%)
Query: 19 CCNSYEYCVSCCVNPARTLK-EQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC+ YE+CVSCC+ P + L+ E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCSVYEFCVSCCLQPNKQLQLERFLNKAAVAFQNLFQAVEDHFELCLAKCRTSSQSVQHE 183
Query: 77 NAYLSDF-HHCF 87
N Y + HC+
Sbjct: 184 NTYRNPIAKHCY 195
>gi|301770727|ref|XP_002920782.1| PREDICTED: UPF0454 protein C12orf49 homolog [Ailuropoda
melanoleuca]
gi|281343735|gb|EFB19319.1| hypothetical protein PANDA_009565 [Ailuropoda melanoleuca]
Length = 205
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC +YEYCVSCC+ P++ L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCGAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|57099849|ref|XP_540555.1| PREDICTED: UPF0454 protein C12orf49 homolog [Canis lupus
familiaris]
gi|355735600|gb|AES11718.1| hypothetical protein [Mustela putorius furo]
Length = 205
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC +YEYCVSCC+ P++ L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCGAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|410976770|ref|XP_003994786.1| PREDICTED: UPF0454 protein C12orf49 homolog [Felis catus]
Length = 171
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 42/70 (60%), Gaps = 3/70 (4%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
GC L + CC++YEYCVSCC+ P++ L E+ LN +A F+ C +CR +
Sbjct: 84 GC-LPNGCCSAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 142
Query: 70 SESVVHENAY 79
S+SV HEN Y
Sbjct: 143 SQSVQHENTY 152
>gi|194214270|ref|XP_001489615.2| PREDICTED: UPF0454 protein C12orf49 homolog, partial [Equus
caballus]
Length = 178
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 46/80 (57%), Gaps = 3/80 (3%)
Query: 2 LPRERREILLGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVF 59
+P R+ GC L S CC++YE+CVSCC+ P++ L E+ LN +A F
Sbjct: 81 VPSTRQYSCDGC-LSSGCCSAYEHCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHF 139
Query: 60 DFCAGRCRHNSESVVHENAY 79
+ C +CR +S+SV HEN Y
Sbjct: 140 ELCLAKCRTSSQSVQHENTY 159
>gi|334327393|ref|XP_003340895.1| PREDICTED: UPF0454 protein C12orf49 homolog [Monodelphis domestica]
Length = 174
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 7/78 (8%)
Query: 6 RREILLGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDF 61
RR GC L + CC +YEYCVSCC+ P++ L E+ LN A + +V F+
Sbjct: 81 RRFWCDGC-LANGCCGAYEYCVSCCLQPSKQLLLERFLNQAAA--AFQNLFMAVEDHFEL 137
Query: 62 CAGRCRHNSESVVHENAY 79
C +CR +S+SV HEN Y
Sbjct: 138 CLAKCRTSSQSVQHENTY 155
>gi|289742927|gb|ADD20211.1| uncharacterized conserved protein [Glossina morsitans morsitans]
Length = 217
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/69 (44%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 13 CNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNS 70
CN S CC+ YE+CVSCC++P R L E L V + T F+ C +CR NS
Sbjct: 123 CNNSSHCCSIYEFCVSCCLHPEKRPLLELALQAVNGRQLTVFAQVLDQFELCLVKCRTNS 182
Query: 71 ESVVHENAY 79
SV HEN Y
Sbjct: 183 HSVEHENRY 191
>gi|157167582|ref|XP_001655053.1| hypothetical protein AaeL_AAEL010931 [Aedes aegypti]
gi|157167584|ref|XP_001655054.1| hypothetical protein AaeL_AAEL010931 [Aedes aegypti]
gi|157167586|ref|XP_001655055.1| hypothetical protein AaeL_AAEL010931 [Aedes aegypti]
gi|94469020|gb|ABF18359.1| uncharacterized conserved protein [Aedes aegypti]
gi|108872807|gb|EAT37032.1| AAEL010931-PA [Aedes aegypti]
gi|108872808|gb|EAT37033.1| AAEL010931-PB [Aedes aegypti]
gi|108872809|gb|EAT37034.1| AAEL010931-PC [Aedes aegypti]
Length = 228
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 17 SQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESVV 74
S CC YEYCVSCC++P R L E+VL + T ++ C +CR NS SV
Sbjct: 122 SNCCAIYEYCVSCCMHPNKRPLLERVLAKATGRQIAVYATVNDQYELCLTKCRTNSHSVQ 181
Query: 75 HENAYLS-DFHHCF 87
+EN Y HC+
Sbjct: 182 NENKYRDPKLKHCY 195
>gi|62897527|dbj|BAD96703.1| hypothetical protein FLJ21415 variant [Homo sapiens]
Length = 205
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPARTLK-EQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLPLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|403281581|ref|XP_003932260.1| PREDICTED: UPF0454 protein C12orf49 homolog [Saimiri boliviensis
boliviensis]
Length = 205
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|109098867|ref|XP_001112213.1| PREDICTED: UPF0454 protein C12orf49-like isoform 2 [Macaca mulatta]
Length = 272
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|62858887|ref|NP_001016016.1| uncharacterized protein LOC548770 [Xenopus (Silurana) tropicalis]
gi|183985985|gb|AAI66293.1| hypothetical protein LOC548770 [Xenopus (Silurana) tropicalis]
gi|213627063|gb|AAI70669.1| hypothetical protein LOC548770 [Xenopus (Silurana) tropicalis]
gi|213627065|gb|AAI70671.1| hypothetical protein LOC548770 [Xenopus (Silurana) tropicalis]
Length = 204
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 36/63 (57%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC+ YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 123 CCSVYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFQAVEDHFELCLAKCRTSSQSVQHE 182
Query: 77 NAY 79
N Y
Sbjct: 183 NTY 185
>gi|296213028|ref|XP_002753098.1| PREDICTED: UPF0454 protein C12orf49-like isoform 1 [Callithrix
jacchus]
Length = 205
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|440791825|gb|ELR13063.1| hypothetical protein ACA1_097480 [Acanthamoeba castellanii str.
Neff]
Length = 264
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 49/83 (59%), Gaps = 4/83 (4%)
Query: 13 CNLLSQCCNSYEYCVSCCVNPA---RTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHN 69
C++ + CC YE+CVSCC+NP+ R LN++ ++ S+F+ C CR +
Sbjct: 182 CDIKTTCCLEYEFCVSCCLNPSQEQRVASWLQLNIEDSREREEMAPLSLFEQCLALCRTS 241
Query: 70 SESVVHENAYLSDFHHCFS-MPS 91
S S++ ++ ++S H C++ +PS
Sbjct: 242 SASILRQHIFISPSHFCYAGVPS 264
>gi|444723218|gb|ELW63879.1| hypothetical protein TREES_T100018686 [Tupaia chinensis]
Length = 205
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 41/70 (58%), Gaps = 3/70 (4%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
GC L S CC +YE+CVSCC+ P++ L E+ LN +A F+ C +CR +
Sbjct: 118 GC-LPSGCCGAYEHCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 176
Query: 70 SESVVHENAY 79
S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186
>gi|13376060|ref|NP_079014.1| UPF0454 protein C12orf49 precursor [Homo sapiens]
gi|109098869|ref|XP_001112172.1| PREDICTED: UPF0454 protein C12orf49-like isoform 1 [Macaca mulatta]
gi|297693075|ref|XP_002823850.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Pongo
abelii]
gi|402887789|ref|XP_003907263.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Papio
anubis]
gi|74733622|sp|Q9H741.1|CL049_HUMAN RecName: Full=UPF0454 protein C12orf49; Flags: Precursor
gi|10437508|dbj|BAB15058.1| unnamed protein product [Homo sapiens]
gi|18044293|gb|AAH19843.1| Chromosome 12 open reading frame 49 [Homo sapiens]
gi|119618494|gb|EAW98088.1| chromosome 12 open reading frame 49, isoform CRA_a [Homo sapiens]
gi|119618495|gb|EAW98089.1| chromosome 12 open reading frame 49, isoform CRA_a [Homo sapiens]
gi|380816602|gb|AFE80175.1| hypothetical protein LOC79794 precursor [Macaca mulatta]
gi|383413575|gb|AFH30001.1| hypothetical protein LOC79794 precursor [Macaca mulatta]
gi|384949484|gb|AFI38347.1| hypothetical protein LOC79794 precursor [Macaca mulatta]
Length = 205
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|56119050|ref|NP_001007842.1| UPF0454 protein C12orf49 homolog precursor [Gallus gallus]
gi|82081584|sp|Q5ZJE4.1|CL049_CHICK RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
gi|53133640|emb|CAG32149.1| hypothetical protein RCJMB04_18o22 [Gallus gallus]
Length = 205
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC+ YEYCVSCC+ P++ L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCSVYEYCVSCCLQPSKQHLLERFLNRAAMAFQNLFMAVEDRFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|432872487|ref|XP_004072113.1| PREDICTED: UPF0454 protein C12orf49 homolog [Oryzias latipes]
Length = 201
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 45/81 (55%), Gaps = 7/81 (8%)
Query: 3 PRERREILLGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV--- 58
P R+ I C L + CC+ YEYCVSCC+ P + L E+ LN A + +V
Sbjct: 105 PNSRQHICKSC-LANGCCSIYEYCVSCCLQPDKQLLLERFLNR--AAEGFQNLFTAVEDH 161
Query: 59 FDFCAGRCRHNSESVVHENAY 79
F+ C +CR +S+SV HEN Y
Sbjct: 162 FELCLAKCRTSSQSVQHENTY 182
>gi|114647164|ref|XP_001154941.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Pan
troglodytes]
gi|410209358|gb|JAA01898.1| chromosome 12 open reading frame 49 [Pan troglodytes]
gi|410255302|gb|JAA15618.1| chromosome 12 open reading frame 49 [Pan troglodytes]
gi|410289894|gb|JAA23547.1| chromosome 12 open reading frame 49 [Pan troglodytes]
gi|410335531|gb|JAA36712.1| chromosome 12 open reading frame 49 [Pan troglodytes]
Length = 205
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|224072158|ref|XP_002200066.1| PREDICTED: UPF0454 protein C12orf49 homolog [Taeniopygia guttata]
Length = 205
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC+ YE+CVSCC+ P++ L E+ LN IA F+ C +CR +S+SV HE
Sbjct: 124 CCSIYEFCVSCCLQPSKQHLLERFLNRAAIAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|449685232|ref|XP_004210848.1| PREDICTED: UPF0454 protein C12orf49 homolog [Hydra magnipapillata]
Length = 191
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 43/80 (53%), Gaps = 5/80 (6%)
Query: 17 SQCCNSYEYCVSCCVNPA---RTLKEQVL-NVKIAKPTTAGTYGSVFDFCAGRCRHNSES 72
++CC YE+CVSCC+ P R L E++L + +A+ F+ C +CR +S S
Sbjct: 108 NRCCEVYEHCVSCCLQPVPRKRRLIEKILVHAHMARNPLLREVKDTFELCLHKCRTSSAS 167
Query: 73 VVHENAYL-SDFHHCFSMPS 91
V EN Y S F +CF S
Sbjct: 168 VYQENKYKDSSFKYCFGTDS 187
>gi|395833970|ref|XP_003789990.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Otolemur
garnettii]
Length = 205
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLLLERFLNRAAMAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|332250748|ref|XP_003274513.1| PREDICTED: UPF0454 protein C12orf49 homolog [Nomascus leucogenys]
Length = 286
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 205 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 264
Query: 77 NAY 79
N Y
Sbjct: 265 NTY 267
>gi|260782034|ref|XP_002586098.1| hypothetical protein BRAFLDRAFT_252331 [Branchiostoma floridae]
gi|229271187|gb|EEN42109.1| hypothetical protein BRAFLDRAFT_252331 [Branchiostoma floridae]
Length = 200
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART--LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC YE+CVSCC+ P + L+ + +V A T + F+ C +CR +S+SV HE
Sbjct: 118 CCQVYEHCVSCCLQPEKQPLLRSLLDHVPDAFKTLFASVTDHFELCLAKCRTSSQSVQHE 177
Query: 77 NAY 79
N+Y
Sbjct: 178 NSY 180
>gi|397525490|ref|XP_003832699.1| PREDICTED: UPF0454 protein C12orf49 homolog [Pan paniscus]
Length = 247
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 166 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 225
Query: 77 NAY 79
N Y
Sbjct: 226 NTY 228
>gi|281208176|gb|EFA82354.1| cGMP-specific phosphodiesterase [Polysphondylium pallidum PN500]
Length = 283
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 42/78 (53%), Gaps = 2/78 (2%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 70
GC+ CCN YE CVSCC+ + + E+VL + K ++ + F+ C CR +S
Sbjct: 200 GCSNKFHCCNEYELCVSCCMGSDKVPILEEVLGRMLDKGISSNYLRNQFELCTMTCRTSS 259
Query: 71 ESVVHENAYLS-DFHHCF 87
S++H+ Y+ D C+
Sbjct: 260 RSLIHQREYIEPDLKFCY 277
>gi|56753599|gb|AAW25002.1| SJCHGC02079 protein [Schistosoma japonicum]
Length = 244
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 16 LSQCCNSYEYCVSCCVNPA-RTLKEQVL-NVKIAKPTTAGTYGSVFDFCAGRCRHNSESV 73
+ CC+ YE+CVSCC+NP + L +QV+ N F+FC CR +S +V
Sbjct: 163 FNHCCSVYEHCVSCCLNPTNKPLWDQVMQNANANSRRHLKLAIDAFEFCVAVCRTSSLTV 222
Query: 74 VHENAYLS-DFHHCFSMPS 91
+HEN Y + D +CF + +
Sbjct: 223 LHENKYRNLDEIYCFGLET 241
>gi|299117113|emb|CBN73884.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 83
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 38/78 (48%), Gaps = 1/78 (1%)
Query: 127 GQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQS 186
G C + LLN C +++ + C C S G +QP V DAP + P CL +
Sbjct: 3 GMICKESSFQLLNSCGLLRDHFPCT-SCTESNGFEQPCYVELDAPEDNLPGQCLISSNPR 61
Query: 187 MLSCDGSHRHTRRLCPCA 204
+C SH TRRLCPC
Sbjct: 62 HSTCSASHEVTRRLCPCT 79
>gi|355564717|gb|EHH21217.1| hypothetical protein EGK_04231, partial [Macaca mulatta]
gi|355786567|gb|EHH66750.1| hypothetical protein EGM_03800, partial [Macaca fascicularis]
Length = 169
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 88 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 147
Query: 77 NAY 79
N Y
Sbjct: 148 NTY 150
>gi|417397061|gb|JAA45564.1| Putative conserved secreted protein precursor [Desmodus rotundus]
Length = 205
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YE+CVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCSAYEHCVSCCLQPGKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|290976460|ref|XP_002670958.1| predicted protein [Naegleria gruberi]
gi|284084522|gb|EFC38214.1| predicted protein [Naegleria gruberi]
Length = 350
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 61/149 (40%), Gaps = 15/149 (10%)
Query: 58 VFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGE 117
+F F C ++ E + +DF H S+P A L I ++G GE
Sbjct: 211 MFMFFVDACPNSLFYPCDETSAPNDFVHD-SLPCETLPASAIHSLQELQTIKAVLGALGE 269
Query: 118 SCDSVCKSSGQSCVLNKLILLNQCEIIQKYMS---CKRGCLASMGADQPAEVADDAPSNL 174
+CD++C S C N +N+C+ ++K+ S CK+ V D
Sbjct: 270 NCDTICAKSESVCAPNYFPYINRCDEMRKHYSSCECKK-----------EGVIDSRAPYF 318
Query: 175 NPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
+ C+ +S C +H RRLCPC
Sbjct: 319 DGKYCIIGNRRSKFRCGNAHPSERRLCPC 347
>gi|326930051|ref|XP_003211166.1| PREDICTED: UPF0454 protein C12orf49 homolog [Meleagris gallopavo]
Length = 205
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 36/63 (57%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC+ YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 124 CCSVYEYCVSCCLQPNKQHLLERFLNRAAMAFQNLFMAVEDRFELCLAKCRTSSQSVQHE 183
Query: 77 NAY 79
N Y
Sbjct: 184 NTY 186
>gi|348533454|ref|XP_003454220.1| PREDICTED: UPF0454 protein C12orf49 homolog [Oreochromis niloticus]
Length = 200
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 45/81 (55%), Gaps = 7/81 (8%)
Query: 3 PRERREILLGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV--- 58
P R+ I C L + CC+ YEYCVSCC+ P + L E+ LN A + +V
Sbjct: 104 PSTRQYICKSC-LANGCCSIYEYCVSCCLQPDKQLLLERFLNR--AAEGFQNLFTAVEDH 160
Query: 59 FDFCAGRCRHNSESVVHENAY 79
F+ C +CR +S+SV HEN Y
Sbjct: 161 FELCLAKCRTSSQSVQHENTY 181
>gi|145348106|ref|XP_001418497.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578726|gb|ABO96790.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 198
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 2/83 (2%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNS 70
GC+ ++CC YE C +CC P ++ + + A G + F++C RCR
Sbjct: 96 GCDESARCCRDYETCAACCQGPTHDVEAAMKRHARGRNQIATGFFEDAFEYCKSRCRTQP 155
Query: 71 ESVVHENAYLSDFHHCF-SMPSN 92
HEN Y D +C+ P N
Sbjct: 156 SVTFHENTYAYDTAYCYGDYPVN 178
>gi|354466980|ref|XP_003495949.1| PREDICTED: UPF0454 protein C12orf49 homolog [Cricetulus griseus]
gi|344237025|gb|EGV93128.1| UPF0454 protein C12orf49-like [Cricetulus griseus]
Length = 205
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 41/70 (58%), Gaps = 3/70 (4%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
GC L + CC +YE+CVSCC+ P++ L E+ LN +A F+ C +CR +
Sbjct: 118 GC-LANGCCEAYEHCVSCCLQPSKQLLLERFLNRAAMAFQNLFMAVEDHFELCLAKCRTS 176
Query: 70 SESVVHENAY 79
S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186
>gi|91087999|ref|XP_973740.1| PREDICTED: similar to CG15643 CG15643-PA [Tribolium castaneum]
gi|270011897|gb|EFA08345.1| hypothetical protein TcasGA2_TC005988 [Tribolium castaneum]
Length = 203
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 41/68 (60%), Gaps = 8/68 (11%)
Query: 17 SQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGT--YGSV---FDFCAGRCRHNSE 71
+ CC YE+C+SCC++P K+++L + K T + SV F+ C +CR NS+
Sbjct: 116 NNCCAIYEFCISCCLHPD---KKEMLENLLGKATEQNNVLFASVTDHFELCLAKCRTNSQ 172
Query: 72 SVVHENAY 79
SV HEN+Y
Sbjct: 173 SVQHENSY 180
>gi|353230151|emb|CCD76322.1| hypothetical protein Smp_168260 [Schistosoma mansoni]
Length = 375
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 17 SQCCNSYEYCVSCCVNPA-RTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVH 75
+ CC+ YE+CVSCC+NP ++L ++V N+ F+FC CR +S +V H
Sbjct: 112 NHCCSIYEHCVSCCLNPINKSLWDKVQNLDANYKRHLQLATDAFEFCIAVCRTSSLTVFH 171
Query: 76 ENAY 79
EN Y
Sbjct: 172 ENRY 175
>gi|256086857|ref|XP_002579602.1| hypothetical protein [Schistosoma mansoni]
Length = 392
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 17 SQCCNSYEYCVSCCVNPA-RTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVH 75
+ CC+ YE+CVSCC+NP ++L ++V N+ F+FC CR +S +V H
Sbjct: 112 NHCCSIYEHCVSCCLNPINKSLWDKVQNLDANYKRHLQLATDAFEFCIAVCRTSSLTVFH 171
Query: 76 ENAY 79
EN Y
Sbjct: 172 ENRY 175
>gi|340369555|ref|XP_003383313.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase B-like
[Amphimedon queenslandica]
Length = 662
Score = 50.1 bits (118), Expect = 5e-04, Method: Composition-based stats.
Identities = 26/97 (26%), Positives = 44/97 (45%), Gaps = 1/97 (1%)
Query: 108 ISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCK-RGCLASMGADQPAEV 166
+ +I + G++C C + C + L+N E + ++ C L AD +
Sbjct: 553 LRTVISKPGQNCRDSCLENKMVCESSFFSLINTHEAVFNHLGCNVTHTLEEKYADYAPAI 612
Query: 167 ADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
A A + P C+ I+ +L C G ++ RRLCPC
Sbjct: 613 AVSADKPIEPGHCVIIDIRRLLGCPGYSKNFRRLCPC 649
>gi|47221493|emb|CAG08155.1| unnamed protein product [Tetraodon nigroviridis]
Length = 201
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 45/81 (55%), Gaps = 7/81 (8%)
Query: 3 PRERREILLGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV--- 58
P R+ I C L + CC+ YEYCVSCC+ P + L E+ LN A + +V
Sbjct: 105 PSSRQYICKSC-LANGCCSIYEYCVSCCLQPDKQPLLERFLNR--AAEGFRNLFTAVEDH 161
Query: 59 FDFCAGRCRHNSESVVHENAY 79
F+ C +CR +S+SV HEN Y
Sbjct: 162 FELCLAKCRTSSQSVQHENTY 182
>gi|390359679|ref|XP_797277.2| PREDICTED: UPF0454 protein C12orf49 homolog [Strongylocentrotus
purpuratus]
Length = 198
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 49/93 (52%), Gaps = 11/93 (11%)
Query: 3 PRERREILLGCNLLSQ--CCNSYEYCVSCCVNPARTLKEQVLNVKIAKP--TTAGTYGSV 58
P + L CN S+ CC YE CVSCC++P KE+VL + K T + SV
Sbjct: 98 PEAKSSSLHNCNGCSKNGCCVMYERCVSCCMHPE---KERVLQDMLGKASHTFRLLFQSV 154
Query: 59 ---FDFCAGRCRHNSESVVHENAYLS-DFHHCF 87
F+ C +CR +S+SV HEN Y D HC+
Sbjct: 155 ADHFELCLTKCRTSSQSVQHENTYRDPDAKHCY 187
>gi|332840508|ref|XP_003314001.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 2 [Pan
troglodytes]
Length = 175
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 94 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 153
Query: 77 NAY 79
N Y
Sbjct: 154 NTY 156
>gi|410923361|ref|XP_003975150.1| PREDICTED: UPF0454 protein C12orf49 homolog [Takifugu rubripes]
Length = 199
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 45/81 (55%), Gaps = 7/81 (8%)
Query: 3 PRERREILLGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV--- 58
P R+ I C L + CC+ YEYCVSCC+ P + L E+ LN A + +V
Sbjct: 103 PSSRQYICKSC-LANGCCSIYEYCVSCCLQPDKQPLLERFLNR--AAEGFRNLFTAVEDH 159
Query: 59 FDFCAGRCRHNSESVVHENAY 79
F+ C +CR +S+SV HEN Y
Sbjct: 160 FELCLAKCRTSSQSVQHENTY 180
>gi|402887791|ref|XP_003907264.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 2 [Papio
anubis]
Length = 175
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 94 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 153
Query: 77 NAY 79
N Y
Sbjct: 154 NTY 156
>gi|344295368|ref|XP_003419384.1| PREDICTED: UPF0454 protein C12orf49 homolog [Loxodonta africana]
Length = 174
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 42/70 (60%), Gaps = 3/70 (4%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
GC L + CC++YEYCVSCC+ P++ L E+ L+ +A F+ C +CR +
Sbjct: 87 GC-LSNGCCSAYEYCVSCCLQPSKQLLLERFLSRAAVAFQNLFMAVEDHFELCLAKCRTS 145
Query: 70 SESVVHENAY 79
S+SV HEN Y
Sbjct: 146 SQSVQHENTY 155
>gi|395833972|ref|XP_003789991.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 2 [Otolemur
garnettii]
Length = 175
Score = 49.7 bits (117), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYCVSCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 94 CCSAYEYCVSCCLQPNKQLLLERFLNRAAMAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 153
Query: 77 NAY 79
N Y
Sbjct: 154 NTY 156
>gi|66472238|ref|NP_001018573.1| UPF0454 protein C12orf49 homolog precursor [Danio rerio]
gi|82192840|sp|Q503V3.1|CL049_DANRE RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
gi|63100598|gb|AAH95168.1| Zgc:110063 [Danio rerio]
Length = 199
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 38/65 (58%), Gaps = 6/65 (9%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVV 74
CC+ YEYCVSCC+ P + L E+ LN A + +V F+ C +CR +S+SV
Sbjct: 118 CCSIYEYCVSCCLQPDKQPLLERFLNR--AAEGFQNLFTAVEDHFELCLAKCRTSSQSVQ 175
Query: 75 HENAY 79
HEN Y
Sbjct: 176 HENTY 180
>gi|427782911|gb|JAA56907.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 210
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 45/75 (60%), Gaps = 7/75 (9%)
Query: 19 CCNSYEYCVSCCVNPAR-TLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVV 74
CC+ YEYC+SCC++P + T+ +++L A T S+ F+ C +CR +S+SV
Sbjct: 121 CCSIYEYCISCCMHPQKITILQKILGK--ASDTFKMLVASITDHFELCLTKCRTSSDSVH 178
Query: 75 HENAYLS-DFHHCFS 88
HEN+Y HC++
Sbjct: 179 HENSYRDPKAKHCYA 193
>gi|241792024|ref|XP_002400643.1| conserved hypothetical protein [Ixodes scapularis]
gi|215510808|gb|EEC20261.1| conserved hypothetical protein [Ixodes scapularis]
Length = 209
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 41/74 (55%), Gaps = 5/74 (6%)
Query: 19 CCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVH 75
CC+ YEYC+SCC+ P + Q + K A T S+ F+ C +CR +SESV H
Sbjct: 120 CCSIYEYCISCCMQPQKIALLQKIMGK-ASETFKMLVASITDHFELCLTKCRTSSESVHH 178
Query: 76 ENAYLS-DFHHCFS 88
EN+Y HC++
Sbjct: 179 ENSYRDPSAKHCYA 192
>gi|432116785|gb|ELK37410.1| hypothetical protein MDA_GLEAN10014215 [Myotis davidii]
Length = 149
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 41/70 (58%), Gaps = 3/70 (4%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
GC L + CC++YE+CVSCC+ P + L E+ LN +A F+ C +CR +
Sbjct: 62 GC-LTNGCCSAYEHCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 120
Query: 70 SESVVHENAY 79
S+SV HEN Y
Sbjct: 121 SQSVQHENTY 130
>gi|345483911|ref|XP_001602536.2| PREDICTED: UPF0454 protein C12orf49 homolog [Nasonia vitripennis]
Length = 329
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 42/89 (47%), Gaps = 15/89 (16%)
Query: 19 CCNSYEYCVSCCVNPARTLKEQVLNV-KIAKPTTAGTYG-----------SVFDFCAGRC 66
CC YEYCVSCC++P + K + + +I K G F C C
Sbjct: 174 CCTIYEYCVSCCLDPNKVKKRKSFTIDQIRKENLKLRRGEDILKLRLRSLDRFQLCLAAC 233
Query: 67 RHNSESVVHENAYLSD-FHHCF--SMPSN 92
R +S SV HEN Y + + HC+ +PSN
Sbjct: 234 RTSSASVQHENTYKNPLYKHCYILQLPSN 262
>gi|346472337|gb|AEO36013.1| hypothetical protein [Amblyomma maculatum]
Length = 206
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 44/75 (58%), Gaps = 7/75 (9%)
Query: 19 CCNSYEYCVSCCVNPAR-TLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVV 74
CC+ YEYC+SCC++P + TL +++L A T S+ F+ C +CR +S SV
Sbjct: 121 CCSIYEYCISCCMHPQKITLLQKILGK--ASDTFKMLVASITDHFELCLTKCRTSSNSVH 178
Query: 75 HENAYLS-DFHHCFS 88
HEN+Y HC++
Sbjct: 179 HENSYRDPKAKHCYA 193
>gi|221042418|dbj|BAH12886.1| unnamed protein product [Homo sapiens]
Length = 175
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 36/63 (57%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC++YEYC SCC+ P + L E+ LN +A F+ C +CR +S+SV HE
Sbjct: 94 CCSAYEYCASCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 153
Query: 77 NAY 79
N Y
Sbjct: 154 NTY 156
>gi|426221180|ref|XP_004004788.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Ovis aries]
Length = 739
Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 33/104 (31%), Positives = 51/104 (49%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V G+SC VC+ + C + LN+ + + KY ++C Q
Sbjct: 633 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYEVNC-----------QS 681
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D APS +P C++ + SC G+H H RR+CPC
Sbjct: 682 SELAKDIVAPS-FDPKNKHCVFQGDLLLFSCAGAHAHHRRICPC 724
>gi|225711380|gb|ACO11536.1| UPF0454 protein C12orf49 homolog precursor [Caligus rogercresseyi]
Length = 210
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
Query: 13 CNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNS 70
CNL CC++YE CVSCC++P R VLN + + ++ C +CR +S
Sbjct: 122 CNLDMGCCSAYEACVSCCMSPDKRAFLMSVLNEARSLSNILILSVSDQYELCLAKCRTSS 181
Query: 71 ESVVHENAYLS-DFHHCF-SMPS 91
SV HEN Y + ++ C+ +PS
Sbjct: 182 RSVHHENTYKNPEYKFCYKKIPS 204
>gi|321477285|gb|EFX88244.1| hypothetical protein DAPPUDRAFT_305537 [Daphnia pulex]
Length = 222
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 40/68 (58%), Gaps = 12/68 (17%)
Query: 19 CCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV-------FDFCAGRCRHNSE 71
CC+ YEYC+SCC+ P K++VL + K AGT + F+ C +CR +S+
Sbjct: 128 CCSIYEYCISCCLQPD---KKEVLQDVLGK--AAGTLNLLLTSVTDHFELCLTKCRTSSQ 182
Query: 72 SVVHENAY 79
SV HEN+Y
Sbjct: 183 SVQHENSY 190
>gi|449265786|gb|EMC76924.1| UPF0454 protein C12orf49 like protein, partial [Columba livia]
Length = 187
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 36/63 (57%), Gaps = 2/63 (3%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC+ YE CVSCC+ P++ L E+ LN IA F+ C +CR +S+SV HE
Sbjct: 106 CCSVYECCVSCCLQPSKQHLLERFLNRAAIAFQNLFMAVEDRFELCLAKCRTSSQSVQHE 165
Query: 77 NAY 79
N Y
Sbjct: 166 NTY 168
>gi|358339009|dbj|GAA47150.1| UPF0454 protein C12orf49 homolog, partial [Clonorchis sinensis]
Length = 186
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 48/95 (50%), Gaps = 5/95 (5%)
Query: 6 RREILLGCNLLSQ-CCNSYEYCVSCCVNPART--LKEQVLNV-KIAKPTTAGTYGSVFDF 61
RR + CN + CC YE+CVSCC+N +E +L ++ +F +
Sbjct: 91 RRFVCDSCNQTRENCCQIYEHCVSCCMNRQHIHLWREALLQASELPAQRHLLLADGLFQY 150
Query: 62 CAGRCRHNSESVVHENAYLSDFH-HCFSMPSNASG 95
C +CR +S+SV+HEN Y H +CF + + G
Sbjct: 151 CQAKCRTSSQSVLHENTYRDAEHRYCFGLYLTSPG 185
>gi|198423098|ref|XP_002130011.1| PREDICTED: similar to Chromosome 12 open reading frame 49 ortholog
[Ciona intestinalis]
Length = 210
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 8/77 (10%)
Query: 17 SQCCNSYEYCVSCCVNPA-RTLKEQVLNVKIAKPTT----AGTYGSVFDFCAGRCRHNSE 71
S CC+ YE+CVSCC+ P + L E+V V+ K T + FD C +CR +S
Sbjct: 126 SNCCSVYEHCVSCCLQPKHKNLLEKV--VRGLKTNTMLIMMSSIHDQFDLCLMKCRTSST 183
Query: 72 SVVHENAYLS-DFHHCF 87
SV HEN Y +CF
Sbjct: 184 SVRHENTYRDPSMKYCF 200
>gi|327276076|ref|XP_003222797.1| PREDICTED: LOW QUALITY PROTEIN: UPF0454 protein C12orf49 homolog
[Anolis carolinensis]
Length = 205
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 8/66 (12%)
Query: 19 CCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA--GTYGSV---FDFCAGRCRHNSESV 73
CC+ YE CVSCC+ P K+QVL + + A + SV F+ C +CR +S+SV
Sbjct: 124 CCSVYEACVSCCLQPN---KQQVLENFLNRAAVAFRNLFLSVEDHFELCLAKCRTSSQSV 180
Query: 74 VHENAY 79
HEN Y
Sbjct: 181 QHENTY 186
>gi|324509188|gb|ADY43866.1| Unknown [Ascaris suum]
Length = 218
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 47/89 (52%), Gaps = 13/89 (14%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAG-------TYGSVFDFCAG 64
GC+L S CC+ YE+CV CC+ P + KE +L + TT+G + F+ C
Sbjct: 130 GCDLHSGCCSFYEHCVGCCMRPDQ--KEILLEM---IETTSGHRLRQILSATDQFELCTL 184
Query: 65 RCRHNSESVVHENAYLSD-FHHCFSMPSN 92
+CR +S SV EN Y S+ +C+ N
Sbjct: 185 KCRTSSNSVHSENKYKSEALKYCYKQEPN 213
>gi|170033812|ref|XP_001844770.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167874847|gb|EDS38230.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 228
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 39/72 (54%), Gaps = 3/72 (4%)
Query: 19 CCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESVVHE 76
CC YE+CVSCC++P ++L E+VL + T ++ C +CR NS SV +E
Sbjct: 124 CCAIYEFCVSCCMHPEKKSLLEKVLAKASGRQIAVYATVNDQYELCLTKCRTNSHSVQNE 183
Query: 77 NAYLS-DFHHCF 87
N Y HC+
Sbjct: 184 NKYRDPKSKHCY 195
>gi|324520392|gb|ADY47626.1| Unknown [Ascaris suum]
Length = 180
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 47/89 (52%), Gaps = 13/89 (14%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAG-------TYGSVFDFCAG 64
GC+L S CC+ YE+CV CC+ P + KE +L + TT+G + F+ C
Sbjct: 92 GCDLHSGCCSFYEHCVGCCMRPDQ--KEILLEM---IETTSGHRLRQILSATDQFELCTL 146
Query: 65 RCRHNSESVVHENAYLSD-FHHCFSMPSN 92
+CR +S SV EN Y S+ +C+ N
Sbjct: 147 KCRTSSNSVHSENKYKSEALKYCYKQEPN 175
>gi|294955121|ref|XP_002788417.1| alpha-1,3-mannosyl-glycoprotein beta-1,
2-n-acetylglucosaminyltransferase, putative [Perkinsus
marinus ATCC 50983]
gi|239903817|gb|EER20213.1| alpha-1,3-mannosyl-glycoprotein beta-1,
2-n-acetylglucosaminyltransferase, putative [Perkinsus
marinus ATCC 50983]
Length = 1171
Score = 46.6 bits (109), Expect = 0.007, Method: Composition-based stats.
Identities = 28/89 (31%), Positives = 38/89 (42%), Gaps = 3/89 (3%)
Query: 115 QGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNL 174
+G SC C + C + L N C I+ + C+ GC +G + PA V D +
Sbjct: 513 RGVSCTHTCIHNNARCYDHLLEFANDCHIMLSHFPCEEGCGHQLGLELPAYVMDSTLHTV 572
Query: 175 NPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
CL T S SC H T R+C C
Sbjct: 573 G--QCLVTDHGSP-SCATYHISTARMCTC 598
>gi|330794772|ref|XP_003285451.1| hypothetical protein DICPUDRAFT_149321 [Dictyostelium purpureum]
gi|325084626|gb|EGC38050.1| hypothetical protein DICPUDRAFT_149321 [Dictyostelium purpureum]
Length = 322
Score = 45.4 bits (106), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 17 SQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTY-GSVFDFCAGRCRHNSESVVH 75
S CC++YEYC+SCC+ + Q + + A+ A Y + FD C CR +S S+++
Sbjct: 243 SDCCDTYEYCISCCMRYDKVPVVQEMLIGFAERGVAKEYLKNQFDLCTMACRTSSSSLLN 302
Query: 76 ENAYLS-DFHHCF 87
+ Y + F C+
Sbjct: 303 QREYQNPKFKFCY 315
>gi|402580999|gb|EJW74948.1| hypothetical protein WUBG_14143 [Wuchereria bancrofti]
Length = 183
Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 44/85 (51%), Gaps = 3/85 (3%)
Query: 11 LGCNLLSQCCNSYEYCVSCCVNPART--LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRH 68
+ C+L + CC+ YE CVSCC+ P + L + +++ + + F+ C +CR
Sbjct: 94 VWCDLYTGCCSHYEQCVSCCLRPEQKNILLKMIVSTSGHRLRQILSATDQFELCMLKCRT 153
Query: 69 NSESVVHENAYLSD-FHHCFSMPSN 92
+S SV EN Y S+ +C+ N
Sbjct: 154 SSNSVHSENKYKSEKMKYCYKQEPN 178
>gi|383856970|ref|XP_003703979.1| PREDICTED: UPF0454 protein C12orf49 homolog [Megachile rotundata]
Length = 325
Score = 45.1 bits (105), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 42/85 (49%), Gaps = 15/85 (17%)
Query: 19 CCNSYEYCVSCCVNPARTLK--EQVL------NVKIAKPTTAGTYG----SVFDFCAGRC 66
CC YEYCVSCC++P + +K + VL N K K A F C C
Sbjct: 173 CCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKAHKDQDAVKKRLRNLDRFQVCLAAC 232
Query: 67 RHNSESVVHENAYLSDFH--HCFSM 89
R +S SV HEN Y D H HC+++
Sbjct: 233 RTSSASVRHENTY-KDPHSKHCYTL 256
>gi|348670076|gb|EGZ09898.1| hypothetical protein PHYSODRAFT_549244 [Phytophthora sojae]
Length = 367
Score = 44.7 bits (104), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 6/78 (7%)
Query: 18 QCCNSYEYCVSCCVNPARTL-----KEQVLNVKIAKPTTAGTYG-SVFDFCAGRCRHNSE 71
CC+ +E+CVSCC++P ++ + + P + +VF +C RCR +S
Sbjct: 236 HCCDVFEHCVSCCMHPLNVRFTGIRRDYLTHADPNHPVYSDPAEITVFQYCQYRCRTSSA 295
Query: 72 SVVHENAYLSDFHHCFSM 89
SV H+N+Y S C+ +
Sbjct: 296 SVQHQNSYRSHRSFCYGI 313
>gi|326432709|gb|EGD78279.1| hypothetical protein PTSG_09343 [Salpingoeca sp. ATCC 50818]
Length = 198
Score = 44.7 bits (104), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 32/62 (51%), Gaps = 2/62 (3%)
Query: 18 QCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPT--TAGTYGSVFDFCAGRCRHNSESVVH 75
CC YE+CVSCC+ P + + + +P F++C +CR +S+SV H
Sbjct: 128 HCCRVYEHCVSCCLAPKHVPLRRFVFHNLTEPQRRVLSLVSDQFEYCEAKCRTSSQSVQH 187
Query: 76 EN 77
EN
Sbjct: 188 EN 189
>gi|440902995|gb|ELR53712.1| Alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Bos grunniens
mutus]
Length = 739
Score = 44.7 bits (104), Expect = 0.022, Method: Composition-based stats.
Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V G+SC VC+ + C + LN+ + + KY ++C Q
Sbjct: 633 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYDVNC-----------QS 681
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D APS +P C++ + SC G+H RR+CPC
Sbjct: 682 SELAKDIVAPS-FDPKNKHCVFQGDLLLFSCAGAHARHRRICPC 724
>gi|300797805|ref|NP_001178271.1| alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Bos taurus]
gi|296490527|tpg|DAA32640.1| TPA: mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [Bos taurus]
Length = 739
Score = 44.7 bits (104), Expect = 0.022, Method: Composition-based stats.
Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V G+SC VC+ + C + LN+ + + KY ++C Q
Sbjct: 633 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYDVNC-----------QS 681
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D APS +P C++ + SC G+H RR+CPC
Sbjct: 682 SELAKDIVAPS-FDPKNKHCVFQGDLLLFSCAGAHARHRRICPC 724
>gi|403280116|ref|XP_003931580.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Saimiri
boliviensis boliviensis]
Length = 741
Score = 44.7 bits (104), Expect = 0.023, Method: Composition-based stats.
Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ S C + LN+ + + KY ++C Q
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVTC-----------QS 683
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS +P C++ + SC G+H RR+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHRRVCPC 726
>gi|296204935|ref|XP_002749539.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Callithrix
jacchus]
Length = 741
Score = 44.7 bits (104), Expect = 0.023, Method: Composition-based stats.
Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ S C + LN+ + + KY ++C Q
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVTC-----------QS 683
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS +P C++ + SC G+H RR+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHRRVCPC 726
>gi|312070660|ref|XP_003138249.1| hypothetical protein LOAG_02664 [Loa loa]
gi|307766590|gb|EFO25824.1| hypothetical protein LOAG_02664 [Loa loa]
Length = 224
Score = 44.3 bits (103), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 3/84 (3%)
Query: 12 GCNLLSQCCNSYEYCVSCCVNPAR--TLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHN 69
C+L + CC+ YE CVSCC+ P + L E + + + + F+ C +CR +
Sbjct: 136 WCDLYTGCCSYYEQCVSCCLRPEQKNILLEMIESTSGHRLRQILSATDQFELCMLKCRTS 195
Query: 70 SESVVHENAYLSD-FHHCFSMPSN 92
S SV EN Y S+ +C+ N
Sbjct: 196 SNSVHSENKYKSEKMKYCYKQEPN 219
>gi|301113522|ref|XP_002998531.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111832|gb|EEY69884.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 363
Score = 44.3 bits (103), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 6/78 (7%)
Query: 18 QCCNSYEYCVSCCVNPARTL-----KEQVLNVKIAKPTTAGTYG-SVFDFCAGRCRHNSE 71
CC+ +E+CVSCC++P ++ + + P + +VF +C RCR +S
Sbjct: 236 HCCDVFEHCVSCCMHPLNVRSSGIRRDFLTHADPNHPVYSDPAEITVFQYCQYRCRTSSA 295
Query: 72 SVVHENAYLSDFHHCFSM 89
SV H+N+Y S C+ +
Sbjct: 296 SVQHQNSYRSHRSFCYGI 313
>gi|443709414|gb|ELU04086.1| hypothetical protein CAPTEDRAFT_93600 [Capitella teleta]
Length = 211
Score = 44.3 bits (103), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 38/65 (58%), Gaps = 6/65 (9%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVV 74
CC YE+CVSCC+ P + L +L+ A + + SV F+ C +CR +S+SV
Sbjct: 126 CCEVYEHCVSCCLLPEKQPLLRNILSR--ASDSFRHLFASVADHFELCLAKCRTSSQSVQ 183
Query: 75 HENAY 79
HEN+Y
Sbjct: 184 HENSY 188
>gi|307185715|gb|EFN71631.1| UPF0454 protein C12orf49-like protein [Camponotus floridanus]
Length = 322
Score = 43.9 bits (102), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 39/83 (46%), Gaps = 10/83 (12%)
Query: 13 CNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV-------FDFCAGR 65
CN L CC YEYCVSCC++P + + Q+ K+ G + F C
Sbjct: 173 CNSLG-CCTIYEYCVSCCLDPDKQIL-QMRGRKLGSARMQKDGGRLRIRNQDRFQICLAV 230
Query: 66 CRHNSESVVHENAYLSDF-HHCF 87
CR +S SV HEN Y HC+
Sbjct: 231 CRTSSLSVQHENTYKDPLAKHCY 253
>gi|66817203|ref|XP_642480.1| hypothetical protein DDB_G0277747 [Dictyostelium discoideum AX4]
gi|60470568|gb|EAL68547.1| hypothetical protein DDB_G0277747 [Dictyostelium discoideum AX4]
Length = 312
Score = 43.9 bits (102), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 42/76 (55%), Gaps = 4/76 (5%)
Query: 16 LSQCCNSYEYCVSCCVNPART--LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESV 73
+S CC+SYE+C+SCC+ + L + ++N K + FD C+ CR +S S+
Sbjct: 229 VSDCCDSYEFCISCCMRQDKVSILHDTLINF-AEKNLDKKYLKNQFDLCSMACRTSSASL 287
Query: 74 VHENAYLS-DFHHCFS 88
V++ Y +F C+S
Sbjct: 288 VNQREYQDPEFKFCYS 303
>gi|291391470|ref|XP_002712459.1| PREDICTED: N-acetylglucosaminyltransferase V [Oryctolagus
cuniculus]
Length = 741
Score = 43.5 bits (101), Expect = 0.046, Method: Composition-based stats.
Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ + C + LN+ + + KY +SC Q
Sbjct: 635 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKNKDVLKYEVSC-----------QS 683
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRICPC 726
>gi|380014036|ref|XP_003691050.1| PREDICTED: UPF0454 protein C12orf49 homolog [Apis florea]
Length = 327
Score = 43.5 bits (101), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 46/102 (45%), Gaps = 16/102 (15%)
Query: 2 LPRERREILLGCNLLSQCCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV- 58
+PR R CN CC YEYCVSCC++P + +K + VL + + V
Sbjct: 157 IPRRERYSCKTCNA-QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKSHRDQDVV 215
Query: 59 ---------FDFCAGRCRHNSESVVHENAYLSDFH--HCFSM 89
F C CR +S SV HEN Y D H HC++
Sbjct: 216 KKRLRNLDRFQVCLAACRTSSASVRHENTY-KDPHSKHCYTF 256
>gi|301770689|ref|XP_002920763.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A-like
[Ailuropoda melanoleuca]
gi|281337646|gb|EFB13230.1| hypothetical protein PANDA_009537 [Ailuropoda melanoleuca]
Length = 741
Score = 43.5 bits (101), Expect = 0.054, Method: Composition-based stats.
Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ + C + LN+ + + KY ++C Q
Sbjct: 635 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYEVTC-----------QS 683
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS +P C++ + SC G+H RR+CPC
Sbjct: 684 SELAKDILVPS-FDPKNRHCVFQGDLLLFSCAGAHPRHRRVCPC 726
>gi|426337254|ref|XP_004032629.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Gorilla
gorilla gorilla]
Length = 741
Score = 43.5 bits (101), Expect = 0.056, Method: Composition-based stats.
Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V + G+SC VC+ S C + LN+ + + KY K C Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684
Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQADLLLFSCAGAHPRYQRVCPC 726
>gi|344268130|ref|XP_003405916.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Loxodonta
africana]
Length = 741
Score = 43.5 bits (101), Expect = 0.056, Method: Composition-based stats.
Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ S C + LN+ + + KY ++C Q
Sbjct: 635 LSALQVKLAEPGKSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVTC-----------QS 683
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS +P + C++ + SC G+H +R+CPC
Sbjct: 684 SELAKDILVPS-FDPKSKHCVFQGDLLLFSCAGAHPRHQRVCPC 726
>gi|340712268|ref|XP_003394684.1| PREDICTED: UPF0454 protein C12orf49 homolog [Bombus terrestris]
Length = 329
Score = 43.1 bits (100), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 41/85 (48%), Gaps = 15/85 (17%)
Query: 19 CCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV----------FDFCAGRC 66
CC YEYCVSCC++P + +K + VL + A V F C C
Sbjct: 174 CCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKAHRDQDVVKKRLRNLDRFQVCLAAC 233
Query: 67 RHNSESVVHENAYLSDFH--HCFSM 89
R +S SV HEN Y D H HC+++
Sbjct: 234 RTSSASVRHENTY-KDPHSKHCYTL 257
>gi|350593258|ref|XP_003133326.3| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Sus scrofa]
Length = 738
Score = 43.1 bits (100), Expect = 0.060, Method: Composition-based stats.
Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ + C + LN+ + + KY + C Q
Sbjct: 632 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKDKDMLKYEVIC-----------QS 680
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS +P + C++ + SC G+H RR+CPC
Sbjct: 681 SELAKDILVPS-FDPKSKHCVFQGDLLLFSCAGAHPRHRRICPC 723
>gi|350405127|ref|XP_003487334.1| PREDICTED: UPF0454 protein C12orf49 homolog [Bombus impatiens]
Length = 329
Score = 43.1 bits (100), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 41/85 (48%), Gaps = 15/85 (17%)
Query: 19 CCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV----------FDFCAGRC 66
CC YEYCVSCC++P + +K + VL + A V F C C
Sbjct: 174 CCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKAHRDQDVVKKRLRNLDRFQVCLAAC 233
Query: 67 RHNSESVVHENAYLSDFH--HCFSM 89
R +S SV HEN Y D H HC+++
Sbjct: 234 RTSSASVRHENTY-KDPHSKHCYTL 257
>gi|297668509|ref|XP_002812479.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Pongo abelii]
Length = 741
Score = 43.1 bits (100), Expect = 0.069, Method: Composition-based stats.
Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V + G+SC VC+ S C + LN+ + + KY K C Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684
Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726
>gi|71424327|ref|XP_812757.1| glycosyl transferase [Trypanosoma cruzi strain CL Brener]
gi|70877576|gb|EAN90906.1| glycosyl transferase, putative [Trypanosoma cruzi]
Length = 657
Score = 43.1 bits (100), Expect = 0.070, Method: Composition-based stats.
Identities = 31/114 (27%), Positives = 42/114 (36%), Gaps = 29/114 (25%)
Query: 116 GESCDSVCKSSGQSCVLNKLIL--LNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPS- 172
G SCD+VC +G C L L LN CE ++ C R MG +D P+
Sbjct: 543 GLSCDAVCAQAGLRCALEALYFTSLNDCEELRSVFGCSR---CEMGVYWRPHPGNDFPAV 599
Query: 173 --------------------NLNPTACLYTRI---QSMLSCDGSHRHTRRLCPC 203
N C Y + + + +C S+ TRR CPC
Sbjct: 600 EDVSHEMEHENKERKRRLRDNSTNLVCRYNYLHDARGVPNCTASYTTTRRYCPC 653
>gi|402892052|ref|XP_003909237.1| PREDICTED: LOW QUALITY PROTEIN: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A-like [Papio
anubis]
gi|355566030|gb|EHH22459.1| hypothetical protein EGK_05729 [Macaca mulatta]
gi|355751629|gb|EHH55884.1| hypothetical protein EGM_05175 [Macaca fascicularis]
gi|380784409|gb|AFE64080.1| alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Macaca
mulatta]
gi|383412021|gb|AFH29224.1| alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Macaca
mulatta]
gi|384943612|gb|AFI35411.1| alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Macaca
mulatta]
Length = 741
Score = 43.1 bits (100), Expect = 0.074, Method: Composition-based stats.
Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V + G+SC VC+ S C + LN+ + + KY K C Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684
Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726
>gi|4505167|ref|NP_002401.1| alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Homo sapiens]
gi|114580983|ref|XP_515802.2| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A isoform 2 [Pan
troglodytes]
gi|397504546|ref|XP_003822849.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Pan paniscus]
gi|1169980|sp|Q09328.1|MGT5A_HUMAN RecName: Full=Alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A; AltName:
Full=Alpha-mannoside
beta-1,6-N-acetylglucosaminyltransferase; AltName:
Full=GlcNAc-T V; Short=GNT-V; AltName: Full=Mannoside
acetylglucosaminyltransferase 5; AltName:
Full=N-acetylglucosaminyl-transferase V
gi|4545222|gb|AAD22449.1|AF113921_1 alpha-1,3(6)-mannosylglycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [Homo sapiens]
gi|870752|dbj|BAA04570.1| N-acetylglucosaminyltransferase V [Homo sapiens]
gi|119632062|gb|EAX11657.1| mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase, isoform CRA_a
[Homo sapiens]
gi|119632063|gb|EAX11658.1| mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase, isoform CRA_a
[Homo sapiens]
gi|162317656|gb|AAI56314.1| Mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [synthetic
construct]
gi|162319448|gb|AAI57049.1| Mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [synthetic
construct]
gi|410216290|gb|JAA05364.1| mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [Pan
troglodytes]
gi|410265502|gb|JAA20717.1| mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [Pan
troglodytes]
gi|410308382|gb|JAA32791.1| mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [Pan
troglodytes]
gi|410332945|gb|JAA35419.1| mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [Pan
troglodytes]
Length = 741
Score = 42.7 bits (99), Expect = 0.075, Method: Composition-based stats.
Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V + G+SC VC+ S C + LN+ + + KY K C Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684
Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726
>gi|332236901|ref|XP_003267637.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Nomascus
leucogenys]
Length = 741
Score = 42.7 bits (99), Expect = 0.076, Method: Composition-based stats.
Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V + G+SC VC+ S C + LN+ + + KY K C Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684
Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726
>gi|332016576|gb|EGI57457.1| UPF0454 protein C12orf49-like protein [Acromyrmex echinatior]
Length = 293
Score = 42.7 bits (99), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 36/72 (50%), Gaps = 3/72 (4%)
Query: 19 CCNSYEYCVSCCVNPARTLK-EQVLNVKIAK-PTTAGTYGSVFDFCAGRCRHNSESVVHE 76
CC YEYCVSCC++P + ++ + L+ AK G F C CR +S SV +
Sbjct: 170 CCTIYEYCVSCCLHPDKQIRGRKDLSSGSAKVQKNVGRTAVHFQICLAACRTSSSSVRRD 229
Query: 77 NAYLSDF-HHCF 87
N Y HC+
Sbjct: 230 NTYKDPLAKHCY 241
>gi|290976364|ref|XP_002670910.1| predicted protein [Naegleria gruberi]
gi|284084474|gb|EFC38166.1| predicted protein [Naegleria gruberi]
Length = 466
Score = 42.7 bits (99), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 57/116 (49%), Gaps = 18/116 (15%)
Query: 90 PSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQK-YM 148
P+N G +T+L + I+ +GE+CD+ C + +C +N+C+ ++K Y
Sbjct: 364 PTNTHG--ITELHN----LKAILCEKGENCDTCCSRNNATCDGLFFPYINRCDEMRKHYP 417
Query: 149 SCKRGCLASMGADQPAEVADDAPS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
CK C E+AD++ S N + C+ +S C S+R RRLCPC
Sbjct: 418 DCK--C--------AYELADESRSPNFDGELCVIGNRRSRFRCGNSYRTERRLCPC 463
>gi|110755335|ref|XP_001121865.1| PREDICTED: UPF0454 protein C12orf49 homolog [Apis mellifera]
Length = 327
Score = 42.7 bits (99), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 45/101 (44%), Gaps = 16/101 (15%)
Query: 3 PRERREILLGCNLLSQCCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV-- 58
PR R CN CC YEYCVSCC++P + +K + VL + + V
Sbjct: 158 PRRERYSCKTCNA-QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKSHRDQDVVK 216
Query: 59 --------FDFCAGRCRHNSESVVHENAYLSDFH--HCFSM 89
F C CR +S SV HEN Y D H HC++
Sbjct: 217 KRLRNLDRFQVCLAACRTSSASVRHENTY-KDPHSKHCYTF 256
>gi|428167931|gb|EKX36882.1| hypothetical protein GUITHDRAFT_116905 [Guillardia theta CCMP2712]
Length = 652
Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats.
Identities = 27/91 (29%), Positives = 37/91 (40%), Gaps = 6/91 (6%)
Query: 115 QGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASM-GADQPAEVADDAPSN 173
QG SC + C+ +C +LN CE + + C G D PA +D
Sbjct: 290 QGVSCFTACQEQNLACQRIWFDVLNNCEALAVAFPSHKICSTDFYGRDLPAYRPEDLTVL 349
Query: 174 LNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 204
+N Y+ SC G H T+RLC C
Sbjct: 350 VNKKPRTYSS-----SCGGKHEKTKRLCGCG 375
Score = 38.1 bits (87), Expect = 2.0, Method: Composition-based stats.
Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 4/91 (4%)
Query: 117 ESCDSVCKSSGQ--SCVLNKLILLNQCEIIQKYMSCKRGCLASM-GADQPAEVADDAPSN 173
++C C +G+ C LLN+C+++Q CL+ G D PA +D
Sbjct: 133 QNCHEACAENGEEWQCSSPWFDLLNRCDLLQVAFPHAATCLSHFYGRDLPALSSDVGGYA 192
Query: 174 LNPTACLYTRIQSM-LSCDGSHRHTRRLCPC 203
+ ++ Q +SCD H+ RLC C
Sbjct: 193 RGDYVLVNSKPQEYDISCDAQGEHSARLCAC 223
>gi|444512939|gb|ELV10213.1| Alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Tupaia
chinensis]
Length = 702
Score = 42.0 bits (97), Expect = 0.14, Method: Composition-based stats.
Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ + C + LN+ + + KY ++C Q
Sbjct: 596 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKDKDMAKYAVTC-----------QT 644
Query: 164 AEVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 645 SELAKDILVPS-FDPKDKHCVFQGDLLLFSCAGAHPRHQRICPC 687
>gi|290987267|ref|XP_002676344.1| predicted protein [Naegleria gruberi]
gi|284089946|gb|EFC43600.1| predicted protein [Naegleria gruberi]
Length = 481
Score = 42.0 bits (97), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 53/123 (43%), Gaps = 9/123 (7%)
Query: 84 HHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEI 143
H F P +T E L+ + ++ +GE CD+ CK ++C ++ +N C
Sbjct: 365 HDVFEFPLKPPTLKITLSE--LNKVESVLCEEGEDCDTCCKKRAKTCDVSFFPYINHCRE 422
Query: 144 IQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDG--SHRHTRRLC 201
IQK+ K C + + + +A +N + C S SC S + RRLC
Sbjct: 423 IQKH--SKERCNCKIDDEFFSSIAPSMDTNKD---CYVGYRPSRFSCSNKKSKSNHRRLC 477
Query: 202 PCA 204
PC
Sbjct: 478 PCK 480
>gi|432096647|gb|ELK27235.1| Alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Myotis
davidii]
Length = 741
Score = 41.6 bits (96), Expect = 0.18, Method: Composition-based stats.
Identities = 31/104 (29%), Positives = 49/104 (47%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ + C + LN+ + + KY + C Q
Sbjct: 635 LSALQVKLAGPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYEVVC-----------QS 683
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS +P C++ + SC G+H RR+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHRRICPC 726
>gi|348585775|ref|XP_003478646.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A-like [Cavia
porcellus]
Length = 740
Score = 41.6 bits (96), Expect = 0.18, Method: Composition-based stats.
Identities = 31/104 (29%), Positives = 49/104 (47%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ S C + LN+ + + KY + C Q
Sbjct: 634 LSALQVKLASPGQSCKQVCQESQLICEPSFFQHLNKAKDLLKYEVIC-----------QS 682
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E A D PS + P + C++ + SC G+H +R+CPC
Sbjct: 683 SERAKDILVPS-IEPKSKHCVFQGDLLLFSCAGAHPMHQRICPC 725
>gi|417404503|gb|JAA49000.1| Putative secreted protein precursor [Desmodus rotundus]
Length = 771
Score = 41.6 bits (96), Expect = 0.20, Method: Composition-based stats.
Identities = 30/103 (29%), Positives = 48/103 (46%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ + C + LN+ + + KY + C Q
Sbjct: 665 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKEKDMLKYEVIC-----------QS 713
Query: 164 AEVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS + C++ + SC G+H RR+CPC
Sbjct: 714 SELAKDILVPSYDPKNKHCVFQGDLLLFSCAGAHPRHRRICPC 756
>gi|417404303|gb|JAA48911.1| Putative secreted protein precursor [Desmodus rotundus]
Length = 741
Score = 41.6 bits (96), Expect = 0.20, Method: Composition-based stats.
Identities = 30/103 (29%), Positives = 48/103 (46%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ + C + LN+ + + KY + C Q
Sbjct: 635 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKEKDMLKYEVIC-----------QS 683
Query: 164 AEVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS + C++ + SC G+H RR+CPC
Sbjct: 684 SELAKDILVPSYDPKNKHCVFQGDLLLFSCAGAHPRHRRICPC 726
>gi|412993097|emb|CCO16630.1| predicted protein [Bathycoccus prasinos]
Length = 152
Score = 41.6 bits (96), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 39/91 (42%), Gaps = 9/91 (9%)
Query: 5 ERREILLGCNL-----LSQCCNSYEYCVSCCVNPARTLKEQVLN---VKIAKPTTAGTYG 56
E+R C+L +CC +YE CVSCC N + E + +P T G +
Sbjct: 45 EKRYGCSACDLAKGGETEKCCGTYESCVSCCQNAKVSTAETDRTREPIGRMQPQT-GYFS 103
Query: 57 SVFDFCAGRCRHNSESVVHENAYLSDFHHCF 87
F C +CR VHE+ Y + CF
Sbjct: 104 DAFSHCKSKCRTQPTVTVHESTYGYEKRFCF 134
>gi|387019379|gb|AFJ51807.1| Alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Crotalus
adamanteus]
Length = 744
Score = 40.8 bits (94), Expect = 0.32, Method: Composition-based stats.
Identities = 30/103 (29%), Positives = 47/103 (45%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V I G+SC VC+ + C + LN+ + + KY + C
Sbjct: 638 LSALQVKIAETGKSCKQVCQENQLICEPSFFQHLNKDKDLLKYNIQC-----------HT 686
Query: 164 AEVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
E A+D PS + C++ + SC GSH + +R+CPC
Sbjct: 687 VESANDILVPSFDEKRKHCVFQGDLLLFSCAGSHLNHKRICPC 729
>gi|187608310|ref|NP_001120456.1| mannosyl (alpha-1,6-)-glycoprotein
beta-1,6-N-acetyl-glucosaminyltransferase [Xenopus
(Silurana) tropicalis]
gi|170284640|gb|AAI61239.1| LOC100145551 protein [Xenopus (Silurana) tropicalis]
Length = 742
Score = 40.8 bits (94), Expect = 0.33, Method: Composition-based stats.
Identities = 30/103 (29%), Positives = 46/103 (44%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V I G+SC VC+ C + LN+ + K+ + C Q
Sbjct: 636 LSALQVKIAEPGKSCKQVCQEEQLICEPSYFQHLNKDTDLAKFGVRC-----------QS 684
Query: 164 AEVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
+E A+D PS + C+ + SC GSH + +R+CPC
Sbjct: 685 SETANDIVVPSLDEKRNHCVLQGDLLLFSCAGSHSNHKRVCPC 727
>gi|170590022|ref|XP_001899772.1| hypothetical protein [Brugia malayi]
gi|158592898|gb|EDP31494.1| conserved hypothetical protein [Brugia malayi]
Length = 295
Score = 40.8 bits (94), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 48/94 (51%), Gaps = 16/94 (17%)
Query: 11 LGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV------FDFCAG 64
+ C+L + CC+ YE CVSCC+ P +++ + +K+ + T+ + F+ C
Sbjct: 201 VWCDLYTGCCSHYEQCVSCCLKP----EQKNILLKMIESTSGHRLRQILSATDQFELCML 256
Query: 65 RCRHNS-----ESVVHENAYLSD-FHHCFSMPSN 92
+CR +S ESV EN Y S+ +C+ N
Sbjct: 257 KCRTSSNVRAHESVHSENKYKSEKMKYCYKQEPN 290
>gi|196010315|ref|XP_002115022.1| predicted protein [Trichoplax adhaerens]
gi|190582405|gb|EDV22478.1| predicted protein [Trichoplax adhaerens]
Length = 1490
Score = 40.8 bits (94), Expect = 0.34, Method: Composition-based stats.
Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 3/60 (5%)
Query: 13 CNLLSQCCNSYEYCVSCCV--NPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 70
CN ++CC+ YE+CVSCC A LKE + N + + F+FC +CR +S
Sbjct: 111 CNS-NRCCDQYEFCVSCCQYRKEALDLKEIIRNSPDGIQQLLTSAKTRFEFCLIQCRTSS 169
>gi|255083216|ref|XP_002504594.1| predicted protein [Micromonas sp. RCC299]
gi|226519862|gb|ACO65852.1| predicted protein [Micromonas sp. RCC299]
Length = 366
Score = 40.8 bits (94), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 19/29 (65%)
Query: 59 FDFCAGRCRHNSESVVHENAYLSDFHHCF 87
FD+C RCR NS +ENA+ D HHCF
Sbjct: 157 FDYCKHRCRTNSGVTKYENAFQDDRHHCF 185
>gi|326923090|ref|XP_003207774.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A-like [Meleagris
gallopavo]
Length = 741
Score = 40.4 bits (93), Expect = 0.38, Method: Composition-based stats.
Identities = 32/102 (31%), Positives = 46/102 (45%), Gaps = 13/102 (12%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V I G+SC VC+ S C + LN+ + + K+ CL
Sbjct: 635 LSALQVKIAEPGKSCKQVCQESQLICEPSFFQHLNKDKALLKH---NIECLT-------V 684
Query: 165 EVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
E A+D PS + C++ + SC GSH RR+CPC
Sbjct: 685 ESANDILVPSFDGRRKHCVFQGDLLLFSCAGSHPTHRRICPC 726
>gi|73984271|ref|XP_541015.2| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Canis lupus
familiaris]
Length = 741
Score = 40.4 bits (93), Expect = 0.43, Method: Composition-based stats.
Identities = 30/104 (28%), Positives = 49/104 (47%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + + + G+SC VC+ + C + LN+ + I KY + C Q
Sbjct: 635 LSALQMKLAEPGQSCKQVCQENQLICEPSFFQHLNKDKDILKYEVVC-----------QS 683
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRICPC 726
>gi|118093838|ref|XP_422131.2| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Gallus gallus]
Length = 741
Score = 40.4 bits (93), Expect = 0.47, Method: Composition-based stats.
Identities = 28/99 (28%), Positives = 44/99 (44%), Gaps = 7/99 (7%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V + G+SC VC+ S C + LN+ + + K+ CL + A+
Sbjct: 635 LSALQVKLAEPGKSCKQVCQESQLICEPSFFQHLNKDKALLKH---NIECLTTESANDIL 691
Query: 165 EVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
+ D C++ + SC GSH RR+CPC
Sbjct: 692 VPSFDGRRK----HCVFQGDLLLFSCAGSHPTHRRICPC 726
>gi|351707581|gb|EHB10500.1| Alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Heterocephalus
glaber]
Length = 741
Score = 40.0 bits (92), Expect = 0.51, Method: Composition-based stats.
Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ S C + LN+ + + KY + C Q
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKVKDLLKYEVIC-----------QS 683
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E A D PS ++P + C++ + SC G++ +R+CPC
Sbjct: 684 SEQAKDILVPS-IDPQSKHCVFQGDLLLFSCAGANPRHQRICPC 726
>gi|410968570|ref|XP_003990775.1| PREDICTED: LOW QUALITY PROTEIN: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Felis catus]
Length = 741
Score = 40.0 bits (92), Expect = 0.56, Method: Composition-based stats.
Identities = 30/104 (28%), Positives = 49/104 (47%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ + C + LN+ + + KY + C Q
Sbjct: 635 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKDKDMLKYEVIC-----------QS 683
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726
>gi|440795738|gb|ELR16854.1| GNT-I family protein [Acanthamoeba castellanii str. Neff]
Length = 566
Score = 40.0 bits (92), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 62/165 (37%), Gaps = 21/165 (12%)
Query: 49 PTTAGTYGSV------FDFCAGRCRHNSESVVHENA---YLSDFHHCFSMPSNASGAGVT 99
P YGS+ +D R HN V+ L+D HC +P +
Sbjct: 401 PYVRENYGSISLSLALWD-TPSRGTHNHSLVLRYKKNTLLLADKRHCAYLPESLRILPHE 459
Query: 100 QLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRG-CLASM 158
QL RL +V GE C + C G C L C+ + C RG C ++
Sbjct: 460 QL--RLVTSAV-----GEDCHAPCAREGLKCHQGDFDFLRHCDALASRFPCHRGRCQLAV 512
Query: 159 GADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
G + P + P CL + + SC H+ +RLCPC
Sbjct: 513 GPEMPIFLP-HGPHQNATGRCLVS--EDFASCAAKHKGGQRLCPC 554
>gi|46047861|ref|NP_660110.2| alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Mus musculus]
gi|68052853|sp|Q8R4G6.1|MGT5A_MOUSE RecName: Full=Alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A; AltName:
Full=Alpha-mannoside
beta-1,6-N-acetylglucosaminyltransferase; AltName:
Full=GlcNAc-T V; Short=GNT-V; AltName: Full=Mannoside
acetylglucosaminyltransferase 5; AltName:
Full=N-acetylglucosaminyl-transferase V
gi|18997007|gb|AAL83249.1|AF474154_1 N-acetylglucosaminyltransferase V [Mus musculus]
gi|26326405|dbj|BAC26946.1| unnamed protein product [Mus musculus]
gi|116138347|gb|AAI25534.1| Mannoside acetylglucosaminyltransferase 5 [Mus musculus]
gi|116138349|gb|AAI25536.1| Mannoside acetylglucosaminyltransferase 5 [Mus musculus]
gi|148707821|gb|EDL39768.1| mannoside acetylglucosaminyltransferase 5 [Mus musculus]
Length = 740
Score = 40.0 bits (92), Expect = 0.60, Method: Composition-based stats.
Identities = 31/103 (30%), Positives = 48/103 (46%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V + G+SC VC+ S C + LN+ + + KY K C Q +
Sbjct: 634 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKEKDLLKY---KVTC-------QSS 683
Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
E+ D PS P + C++ + SC G+H +R+CPC
Sbjct: 684 ELYKDILVPS-FYPKSKHCVFQGDLLLFSCAGAHPTHQRICPC 725
>gi|410906063|ref|XP_003966511.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A-like [Takifugu
rubripes]
Length = 741
Score = 39.7 bits (91), Expect = 0.64, Method: Composition-based stats.
Identities = 32/103 (31%), Positives = 43/103 (41%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGAD-QP 163
LS + V + G SC VC+ C + LN K LA GAD Q
Sbjct: 635 LSALQVRLAEPGRSCKRVCQEEQLICEPSFFQHLN-----------KDKDLARFGADCQT 683
Query: 164 AEV-ADDAPSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
E AD +PT C++ + SC G+H +R+CPC
Sbjct: 684 VESSADTVVPAYSPTRQHCIFQSDLLLFSCAGAHPTLQRICPC 726
>gi|431914235|gb|ELK15493.1| hypothetical protein PAL_GLEAN10010741 [Pteropus alecto]
Length = 219
Score = 39.7 bits (91), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 36/77 (46%), Gaps = 16/77 (20%)
Query: 19 CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNS------ 70
CC +YE+CVSCC+ P + L E+ LN +A F+ C +CR +S
Sbjct: 124 CCGAYEHCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQVRIFD 183
Query: 71 --------ESVVHENAY 79
+SV HEN Y
Sbjct: 184 VSRKLEDGQSVQHENTY 200
>gi|298710994|emb|CBJ32301.1| hypothetical protein Esi_0327_0028 [Ectocarpus siliculosus]
Length = 550
Score = 39.7 bits (91), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 36/114 (31%), Gaps = 49/114 (42%)
Query: 23 YEYCVSCCVNPARTLKEQVLNVKIA---KPTTAGTYGS---------------------- 57
YE+CVSCC NP R + + + A P GS
Sbjct: 391 YEFCVSCCQNPQRGEEREAIQASAALSGHPAYRDLLGSGNNDAANGREPPPPRQLRASAA 450
Query: 58 ------------------------VFDFCAGRCRHNSESVVHENAYLSDFHHCF 87
F CA RCR S SV HEN+Y S HCF
Sbjct: 451 AAEAGGVEKNGQQVEAEERDARELAFAHCAFRCRTYSGSVAHENSYRSPLKHCF 504
>gi|395843276|ref|XP_003794419.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Otolemur
garnettii]
Length = 741
Score = 39.3 bits (90), Expect = 0.91, Method: Composition-based stats.
Identities = 30/104 (28%), Positives = 48/104 (46%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
L+ + V + G+SC VC+ S C + LN+ + + KY + C Q
Sbjct: 635 LTALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVIC-----------QS 683
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS +P C++ + SC G+H R+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQDDLLLFSCAGAHPRHWRICPC 726
>gi|443721485|gb|ELU10776.1| hypothetical protein CAPTEDRAFT_200210 [Capitella teleta]
Length = 484
Score = 39.3 bits (90), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 55/120 (45%), Gaps = 7/120 (5%)
Query: 87 FSMPSNASGAGVTQLEGRLSGISVII-GRQGESCDSVCKSSGQSCVLNKLILLNQ-CEII 144
F+ P A + + + GRL +I+ G G C VC + C + L LLN+ C ++
Sbjct: 368 FATPHGAQ-SPLRGVIGRLQLDYIIVTGDDGLDCSQVCAGVYKECDVQGLFLLNKGCAVL 426
Query: 145 QKY-MSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
+ +SCK + QP D P+ L+ C T SC G H ++R+CPC
Sbjct: 427 ELLALSCKVCSEVKVEGSQP--YTGDMPAMLSDGNCC-TGWPHHSSCFGKHVESKRICPC 483
>gi|224056071|ref|XP_002197900.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Taeniopygia
guttata]
Length = 741
Score = 39.3 bits (90), Expect = 0.94, Method: Composition-based stats.
Identities = 31/102 (30%), Positives = 46/102 (45%), Gaps = 13/102 (12%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V I G+SC VC+ S C + LN+ + + ++ CL
Sbjct: 635 LSALQVKIAEPGKSCKQVCQESQLICEPSFFQHLNKDKALLRH---NIECLT-------M 684
Query: 165 EVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
E A+D PS + C++ + SC GSH RR+CPC
Sbjct: 685 ESANDILVPSFDGRRKHCVFQGDLLLFSCAGSHPAHRRICPC 726
>gi|449275163|gb|EMC84106.1| Alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A, partial
[Columba livia]
Length = 742
Score = 39.3 bits (90), Expect = 0.99, Method: Composition-based stats.
Identities = 31/102 (30%), Positives = 46/102 (45%), Gaps = 13/102 (12%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V + G+SC VC+ S C + LN+ + + K+ CL
Sbjct: 636 LSALQVKMAEPGKSCKQVCQESQLICEPSFFQHLNKDKALLKH---NIECLT-------M 685
Query: 165 EVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
E A+D PS + C++ + SC GSH RR+CPC
Sbjct: 686 ESANDILVPSFDGRRKHCVFQGDLLLFSCAGSHPTHRRICPC 727
>gi|12831213|ref|NP_075583.1| alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Rattus
norvegicus]
gi|729609|sp|Q08834.1|MGT5A_RAT RecName: Full=Alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A; AltName:
Full=Alpha-mannoside
beta-1,6-N-acetylglucosaminyltransferase; AltName:
Full=GlcNAc-T V; Short=GNT-V; AltName: Full=Mannoside
acetylglucosaminyltransferase 5; AltName:
Full=N-acetylglucosaminyl-transferase V
gi|349091|gb|AAA41665.1| N-acetylglucosaminyltransferase V [Rattus norvegicus]
gi|161728817|dbj|BAF94223.1| alpha-1,6-mannosyl-glycoprotein
6-beta-N-acetylglucosaminyltransferase [Rattus
norvegicus]
Length = 740
Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats.
Identities = 31/103 (30%), Positives = 48/103 (46%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V + G+SC VC+ S C + LN+ + + KY K C Q +
Sbjct: 634 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKEKDLLKY---KVIC-------QSS 683
Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
E+ D PS P + C++ + SC G+H +R+CPC
Sbjct: 684 ELYKDILVPS-FYPKSKHCVFQGDLLLFSCAGAHPTHQRICPC 725
>gi|161728838|dbj|BAF94243.1| alpha-1,6-mannosyl-glycoprotein
6-beta-N-acetylglucosaminyltransferase [Rattus
norvegicus]
Length = 740
Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats.
Identities = 31/103 (30%), Positives = 48/103 (46%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V + G+SC VC+ S C + LN+ + + KY K C Q +
Sbjct: 634 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKEKDLLKY---KVIC-------QSS 683
Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
E+ D PS P + C++ + SC G+H +R+CPC
Sbjct: 684 ELYKDILVPS-FYPKSKHCVFQGDLLLFSCAGAHPTHQRICPC 725
>gi|149730542|ref|XP_001489147.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Equus
caballus]
Length = 741
Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats.
Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 17/104 (16%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V G+SC VC+ + C + LN+ + + KY + C Q
Sbjct: 635 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDMLKYEVIC-----------QS 683
Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
+E+A D PS +P C++ + SC G+H R+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHHRICPC 726
>gi|395519453|ref|XP_003763863.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Sarcophilus
harrisii]
Length = 743
Score = 38.9 bits (89), Expect = 1.2, Method: Composition-based stats.
Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 9/100 (9%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ + C + LN+ + + KY + C LA+ D
Sbjct: 637 LSALQVKLSEPGKSCKQVCQENQLICEPSFFQHLNKDKDVLKYEVVCHTTELAN---DIL 693
Query: 164 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
DD + C++ + SC G+H +R+CPC
Sbjct: 694 VPSYDDRKKH-----CVFQGDLLLFSCAGAHPKHKRICPC 728
>gi|291221905|ref|XP_002730960.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 200
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 37/64 (57%), Gaps = 4/64 (6%)
Query: 19 CCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVH 75
CC+ YE+CVSCC+ P + Q + + +K + SV F+ C +CR +SESV H
Sbjct: 118 CCSIYEHCVSCCLQPQKQQLLQGVLGRASKAFQI-LFSSVTDHFELCLAKCRTSSESVQH 176
Query: 76 ENAY 79
EN Y
Sbjct: 177 ENTY 180
>gi|303280994|ref|XP_003059789.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458444|gb|EEH55741.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 274
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 57 SVFDFCAGRCRHNSESVVHENAYLSDFHHCF-SMPSNASGAGVTQLEGR 104
S F++CA RCR NS +EN + HHCF S+ + + GA T GR
Sbjct: 204 SSFEYCAHRCRTNSAVTKYENEFQDAKHHCFGSVYAESVGADDTLSPGR 252
>gi|428175838|gb|EKX44726.1| hypothetical protein GUITHDRAFT_109505 [Guillardia theta CCMP2712]
Length = 551
Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 15/92 (16%)
Query: 118 SCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQP------AEVADDAP 171
+CDSVC+S+G+ C + + L++C+I +K + C + D P E+
Sbjct: 124 NCDSVCQSAGKVCARDWFVFLDRCDIAKKVFP-EGTCSFAFYPDMPFYDKTTKEIFVSKD 182
Query: 172 SNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
+ L+P C + R RLCPC
Sbjct: 183 TKLDPPTCDAAPLGG--------RSAERLCPC 206
>gi|156407244|ref|XP_001641454.1| predicted protein [Nematostella vectensis]
gi|156228593|gb|EDO49391.1| predicted protein [Nematostella vectensis]
Length = 746
Score = 38.1 bits (87), Expect = 2.2, Method: Composition-based stats.
Identities = 27/99 (27%), Positives = 37/99 (37%), Gaps = 5/99 (5%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
L+ + G G SC C S G C +N E + C + D P+
Sbjct: 639 LNALQARKGAMGSSCKETCHSLGLVCEPQYFPAINTKE---RMTRSGFPCNTTRVEDMPS 695
Query: 165 EVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
VA + P CL + SC + T+RLCPC
Sbjct: 696 LVAPGYRDD--PPVCLRQAQNLLFSCTANSPTTKRLCPC 732
>gi|62702324|gb|AAX93248.1| unknown [Homo sapiens]
Length = 231
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 9/100 (9%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ S C + LN+ + + KY ++C+ LA +
Sbjct: 125 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYKVTCQSSELA-----KD 179
Query: 164 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
V P N + C++ + SC G+H +R+CPC
Sbjct: 180 ILVPSFDPKNKH---CVFQGDLLLFSCAGAHPRHQRVCPC 216
>gi|113671559|ref|NP_001038776.1| alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Danio rerio]
gi|108742113|gb|AAI17665.1| Zgc:136939 [Danio rerio]
Length = 742
Score = 37.7 bits (86), Expect = 2.7, Method: Composition-based stats.
Identities = 26/100 (26%), Positives = 45/100 (45%), Gaps = 9/100 (9%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G+SC VC+ C + LN+ + + +Y + CK D
Sbjct: 636 LSSMQVRLAPAGKSCKQVCQEEQLICEPSFFQHLNKDKDLARYGVECKT-------VDSA 688
Query: 164 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
++ A S + C++ + SC G+H+ R+CPC
Sbjct: 689 NDIVVPAFSE-SSRHCVFQSDLLLFSCAGAHQTLTRICPC 727
>gi|126326120|ref|XP_001363544.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A [Monodelphis
domestica]
Length = 743
Score = 37.4 bits (85), Expect = 3.1, Method: Composition-based stats.
Identities = 27/100 (27%), Positives = 46/100 (46%), Gaps = 9/100 (9%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
L+ + V + G+SC VC+ + C + LN+ + + KY + C LA+ D
Sbjct: 637 LNALQVKLSEPGKSCKQVCQENQLICEPSFFQHLNKDKDVLKYEVICHTTELAN---DIL 693
Query: 164 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
DD + C++ + SC G+H +R+CPC
Sbjct: 694 VPSYDDKKKH-----CVFQGDLLLFSCAGAHPKHKRICPC 728
>gi|348520977|ref|XP_003448003.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase B-like
[Oreochromis niloticus]
Length = 805
Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 44/105 (41%), Gaps = 19/105 (18%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + +++ ++G+SC C+S+G C +N E ++ G D
Sbjct: 699 LSSLRLLVSQEGQSCVEACQSAGFICEPAHFRFINNKEALR-------------GLDVQC 745
Query: 165 EVADDAPSNLNPT------ACLYTRIQSMLSCDGSHRHTRRLCPC 203
EV D +++ P C R + SC G RRLCPC
Sbjct: 746 EVVDSEINHILPAFSVVRQECGLQREPLLFSCAGHSPKYRRLCPC 790
>gi|432849807|ref|XP_004066622.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A-like [Oryzias
latipes]
Length = 741
Score = 37.4 bits (85), Expect = 3.6, Method: Composition-based stats.
Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 7/99 (7%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V + G+SC VC+ C + LN+ + + ++ +A + A
Sbjct: 635 LSALQVKLAEPGQSCKQVCQEQQLICEPSFFQHLNKDKDLDRW------GVACQTVESSA 688
Query: 165 EVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
+ A S + C+ + SC G+H+ RR+CPC
Sbjct: 689 DTVVPAYSE-SRKHCVLQSDLLLFSCAGAHQSLRRVCPC 726
>gi|297266872|ref|XP_002799448.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A-like [Macaca
mulatta]
Length = 547
Score = 37.4 bits (85), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
LS + V + G+SC VC+ S C + LN+ + + KY K C Q +
Sbjct: 441 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 490
Query: 165 EVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 203
E+A D PS +P C++ + SC G+H +R+CPC
Sbjct: 491 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 532
>gi|426239231|ref|XP_004013529.1| PREDICTED: LOW QUALITY PROTEIN: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase B [Ovis aries]
Length = 801
Score = 37.0 bits (84), Expect = 4.6, Method: Composition-based stats.
Identities = 26/88 (29%), Positives = 32/88 (36%), Gaps = 7/88 (7%)
Query: 116 GESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLN 175
G +C C G C + LN + QK G + M PA P
Sbjct: 706 GRACTDACLDHGLICEPSFFPFLNSQDAFQKLQVPCDGIESEMNHLYPAFAQ---PGQ-- 760
Query: 176 PTACLYTRIQSMLSCDGSHRHTRRLCPC 203
C + + SC GS R RRLCPC
Sbjct: 761 --ECFLQKEPLLFSCAGSSRKCRRLCPC 786
>gi|348506712|ref|XP_003440902.1| PREDICTED: alpha-1,6-mannosylglycoprotein
6-beta-N-acetylglucosaminyltransferase A-like
[Oreochromis niloticus]
Length = 741
Score = 36.6 bits (83), Expect = 5.7, Method: Composition-based stats.
Identities = 25/100 (25%), Positives = 43/100 (43%), Gaps = 9/100 (9%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
LS + V + G SC VC+ C + LN+ + + ++ M C+ +
Sbjct: 635 LSALQVKLAEPGHSCKQVCQEEQLICEPSFFQHLNKDKDLTRFGMDCQT-------VESS 687
Query: 164 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
A+ A S C++ + SC G+H +R+CPC
Sbjct: 688 ADTVVPAYSETR-HHCIFQSDLLLFSCAGAHSTLQRICPC 726
>gi|291233073|ref|XP_002736478.1| PREDICTED: N-acetylglucosaminyltransferase V-like [Saccoglossus
kowalevskii]
Length = 561
Score = 36.6 bits (83), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 55/129 (42%), Gaps = 19/129 (14%)
Query: 77 NAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLI 136
NAY+ + C + P LS + V I ESC +VC +G C +
Sbjct: 435 NAYIENQDFCSANPKQWPP---------LSALKVKISNPKESCKAVCMKNGLICEPSFFK 485
Query: 137 LLNQCEIIQK--YMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSH 194
LN E+I++ Y+ CK ++ P+ + + SN+ C + + + SC G
Sbjct: 486 SLNTQEMIEREGYLECK-SVISVEDIYAPSFIKE---SNI----CHLEKQKLLFSCAGGS 537
Query: 195 RHTRRLCPC 203
R+CPC
Sbjct: 538 DTRNRVCPC 546
>gi|156397344|ref|XP_001637851.1| predicted protein [Nematostella vectensis]
gi|156224967|gb|EDO45788.1| predicted protein [Nematostella vectensis]
Length = 561
Score = 35.8 bits (81), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 44/101 (43%), Gaps = 11/101 (10%)
Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCK-RGCLASMGADQ 162
LS + + GR+G+SC C G C + LN + +++ M CK L ++ A
Sbjct: 455 LSEMLTVKGREGQSCKDACFQKGLLCEPSFFPQLNSVKQLKRNGMRCKSYKSLETLYA-- 512
Query: 163 PAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
PA + D +CL + SC H RLCPC
Sbjct: 513 PASNSTDG-------SCLLQEQTLLFSCVARDSHVTRLCPC 546
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.133 0.424
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,106,558,634
Number of Sequences: 23463169
Number of extensions: 116772032
Number of successful extensions: 219176
Number of sequences better than 100.0: 229
Number of HSP's better than 100.0 without gapping: 114
Number of HSP's successfully gapped in prelim test: 115
Number of HSP's that attempted gapping in prelim test: 218859
Number of HSP's gapped (non-prelim): 243
length of query: 204
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 68
effective length of database: 9,168,204,383
effective search space: 623437898044
effective search space used: 623437898044
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 73 (32.7 bits)