BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 028746
         (204 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|118485488|gb|ABK94599.1| unknown [Populus trichocarpa]
          Length = 268

 Score =  324 bits (831), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 152/193 (78%), Positives = 175/193 (90%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCNL+SQCCNSYE+CVSCC++P+RT + QVL VKIAKP+TA +Y SVFDFCAGRCRHNSE
Sbjct: 76  GCNLVSQCCNSYEFCVSCCLHPSRTQEAQVLKVKIAKPSTAVSYTSVFDFCAGRCRHNSE 135

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY SDFHHCFS+PSN+SGA  TQLE RL GI+V+IG+QGESCDSVCKS+GQSCV
Sbjct: 136 SVVHENAYRSDFHHCFSLPSNSSGANYTQLEARLGGINVVIGKQGESCDSVCKSNGQSCV 195

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
           LNKL++LNQC+++QKYMSCK GCLAS+G DQPAEV +DAP +LNP ACLYTR QS+LSCD
Sbjct: 196 LNKLLVLNQCDVMQKYMSCKGGCLASIGTDQPAEVVEDAPRHLNPGACLYTRTQSLLSCD 255

Query: 192 GSHRHTRRLCPCA 204
           GS  HTRRLCPCA
Sbjct: 256 GSLWHTRRLCPCA 268


>gi|224112759|ref|XP_002332721.1| predicted protein [Populus trichocarpa]
 gi|222832884|gb|EEE71361.1| predicted protein [Populus trichocarpa]
          Length = 245

 Score =  323 bits (828), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 152/193 (78%), Positives = 175/193 (90%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCNL+SQCCNSYE+CVSCC++P+RT + QVL VKIAKP+TA +Y SVFDFCAGRCRHNSE
Sbjct: 53  GCNLVSQCCNSYEFCVSCCLHPSRTQEAQVLKVKIAKPSTAVSYTSVFDFCAGRCRHNSE 112

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY SDFHHCFS+PSN+SGA  TQLE RL GI+V+IG+QGESCDSVCKS+GQSCV
Sbjct: 113 SVVHENAYRSDFHHCFSLPSNSSGANYTQLEARLGGINVVIGKQGESCDSVCKSNGQSCV 172

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
           LNKL++LNQC+++QKYMSCK GCLAS+G DQPAEV +DAP +LNP ACLYTR QS+LSCD
Sbjct: 173 LNKLLVLNQCDVMQKYMSCKGGCLASIGTDQPAEVVEDAPRHLNPGACLYTRTQSLLSCD 232

Query: 192 GSHRHTRRLCPCA 204
           GS  HTRRLCPCA
Sbjct: 233 GSLWHTRRLCPCA 245


>gi|359478124|ref|XP_002264803.2| PREDICTED: UPF0454 protein C12orf49 homolog [Vitis vinifera]
          Length = 279

 Score =  318 bits (814), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 151/194 (77%), Positives = 171/194 (88%), Gaps = 1/194 (0%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCNL+SQCCNSYE+CVSCC+NPART KE  L VK+AKP TAG Y SVFDFC GRCRHNSE
Sbjct: 86  GCNLISQCCNSYEFCVSCCLNPARTQKELALKVKMAKPVTAGAYASVFDFCTGRCRHNSE 145

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQ-LEGRLSGISVIIGRQGESCDSVCKSSGQSC 130
           SVVHENAYLSDFHHCFS+PSN++  GV +  E RL+GI V++GRQGESCDSVCKS+GQSC
Sbjct: 146 SVVHENAYLSDFHHCFSLPSNSTAGGVDRNPELRLNGIHVVVGRQGESCDSVCKSNGQSC 205

Query: 131 VLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSC 190
           V +KL +LNQCEI+QKYMSCK  CLAS+GADQPAEV DDAP +LNP ACLYTR Q++LSC
Sbjct: 206 VPSKLSVLNQCEIMQKYMSCKGACLASIGADQPAEVVDDAPRHLNPGACLYTRTQTILSC 265

Query: 191 DGSHRHTRRLCPCA 204
           DGSH+HTRRLCPCA
Sbjct: 266 DGSHQHTRRLCPCA 279


>gi|297743774|emb|CBI36657.3| unnamed protein product [Vitis vinifera]
          Length = 273

 Score =  318 bits (814), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 151/194 (77%), Positives = 171/194 (88%), Gaps = 1/194 (0%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCNL+SQCCNSYE+CVSCC+NPART KE  L VK+AKP TAG Y SVFDFC GRCRHNSE
Sbjct: 80  GCNLISQCCNSYEFCVSCCLNPARTQKELALKVKMAKPVTAGAYASVFDFCTGRCRHNSE 139

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQ-LEGRLSGISVIIGRQGESCDSVCKSSGQSC 130
           SVVHENAYLSDFHHCFS+PSN++  GV +  E RL+GI V++GRQGESCDSVCKS+GQSC
Sbjct: 140 SVVHENAYLSDFHHCFSLPSNSTAGGVDRNPELRLNGIHVVVGRQGESCDSVCKSNGQSC 199

Query: 131 VLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSC 190
           V +KL +LNQCEI+QKYMSCK  CLAS+GADQPAEV DDAP +LNP ACLYTR Q++LSC
Sbjct: 200 VPSKLSVLNQCEIMQKYMSCKGACLASIGADQPAEVVDDAPRHLNPGACLYTRTQTILSC 259

Query: 191 DGSHRHTRRLCPCA 204
           DGSH+HTRRLCPCA
Sbjct: 260 DGSHQHTRRLCPCA 273


>gi|356538903|ref|XP_003537940.1| PREDICTED: uncharacterized protein LOC100782588 [Glycine max]
          Length = 281

 Score =  312 bits (800), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 148/193 (76%), Positives = 168/193 (87%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCNLLSQCCNSYEYCVSCC+NPA   KEQVL +K+AKPTTA TY SVFD+CAGRCRH+SE
Sbjct: 89  GCNLLSQCCNSYEYCVSCCLNPALISKEQVLKMKVAKPTTARTYSSVFDYCAGRCRHSSE 148

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY+SDFHHCFS+ SN+SG   T  E RLSGI+V++GRQGESC+SVCKS GQSCV
Sbjct: 149 SVVHENAYISDFHHCFSLSSNSSGKNSTLTEARLSGINVVVGRQGESCNSVCKSRGQSCV 208

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
            NKL++LN C+IIQKYMSCK  CLAS+GADQPAEV  DAP +LNP +CLYT  QS+LSCD
Sbjct: 209 PNKLVVLNSCDIIQKYMSCKGSCLASVGADQPAEVVYDAPVHLNPGSCLYTETQSILSCD 268

Query: 192 GSHRHTRRLCPCA 204
           G H+HTRRLCPCA
Sbjct: 269 GLHQHTRRLCPCA 281


>gi|388496984|gb|AFK36558.1| unknown [Lotus japonicus]
          Length = 272

 Score =  308 bits (788), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 143/193 (74%), Positives = 167/193 (86%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCN+LSQCCNSYEYCVSCC+NPA T KEQVL +KI+KP TA TY +VFD+CA RCRH+SE
Sbjct: 80  GCNVLSQCCNSYEYCVSCCLNPALTSKEQVLKMKISKPVTARTYENVFDYCAARCRHSSE 139

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAYLSDFHHCFS+PSN+SG  VT  E RL+GI+V++GRQGESC SVCK  GQSCV
Sbjct: 140 SVVHENAYLSDFHHCFSLPSNSSGTNVTLTEARLNGINVVVGRQGESCSSVCKLRGQSCV 199

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
            NKL++LN C+IIQ+YMSCK  CLAS+G DQPAEV  DAP +LNP +CLYT  +S+LSCD
Sbjct: 200 PNKLVVLNNCDIIQRYMSCKGSCLASVGTDQPAEVVYDAPKDLNPGSCLYTETESVLSCD 259

Query: 192 GSHRHTRRLCPCA 204
           GSH+HT+RLCPCA
Sbjct: 260 GSHQHTKRLCPCA 272


>gi|297817236|ref|XP_002876501.1| hypothetical protein ARALYDRAFT_486406 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322339|gb|EFH52760.1| hypothetical protein ARALYDRAFT_486406 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 272

 Score =  303 bits (776), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 148/193 (76%), Positives = 171/193 (88%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCNLLSQCCNSYE+CVSCC+NP+RTL E+V+ VK+AKP TAGTY SVFDFCAGRCRHNSE
Sbjct: 80  GCNLLSQCCNSYEFCVSCCLNPSRTLLEKVVKVKVAKPATAGTYKSVFDFCAGRCRHNSE 139

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY S+FHHCFS+ SNASGA +TQ+E RL GI VI+G QG+SCD+VCKS GQ CV
Sbjct: 140 SVVHENAYHSEFHHCFSLTSNASGANLTQVETRLLGIDVIVGSQGDSCDAVCKSRGQLCV 199

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
           +NKL LLNQC+++++YM+CK  CLAS GADQPAEV +DAP +L P ACLYTR QSMLSCD
Sbjct: 200 MNKLSLLNQCDVMKRYMTCKGSCLASAGADQPAEVVEDAPRDLYPGACLYTRTQSMLSCD 259

Query: 192 GSHRHTRRLCPCA 204
           GSH+HTRRLCPCA
Sbjct: 260 GSHQHTRRLCPCA 272


>gi|30694985|ref|NP_191498.2| uncharacterized protein [Arabidopsis thaliana]
 gi|42572729|ref|NP_974460.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332646394|gb|AEE79915.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332646395|gb|AEE79916.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 273

 Score =  297 bits (761), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 144/193 (74%), Positives = 171/193 (88%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCN+LSQCCNSYE+CVSCC+NP++TL E+V+ VK+AKP T+GTY SVFDFCAGRCRHNSE
Sbjct: 81  GCNILSQCCNSYEFCVSCCLNPSQTLLEKVVKVKVAKPATSGTYKSVFDFCAGRCRHNSE 140

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY S+FHHCFS+ SNASGA +TQ+E RL GI VI+G QG+SCD+VCKS GQ CV
Sbjct: 141 SVVHENAYHSEFHHCFSLTSNASGANLTQVETRLLGIDVIVGSQGDSCDAVCKSRGQLCV 200

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
           +NKL LLNQC+++++YM+CK  CLAS GADQPAEV +DAP +L P ACLYTR QS+LSCD
Sbjct: 201 MNKLSLLNQCDVMKRYMNCKGSCLASAGADQPAEVVEDAPRDLYPGACLYTRTQSLLSCD 260

Query: 192 GSHRHTRRLCPCA 204
           GSH+HTRRLCPCA
Sbjct: 261 GSHQHTRRLCPCA 273


>gi|255585501|ref|XP_002533442.1| conserved hypothetical protein [Ricinus communis]
 gi|223526704|gb|EEF28938.1| conserved hypothetical protein [Ricinus communis]
          Length = 251

 Score =  274 bits (701), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 128/163 (78%), Positives = 145/163 (88%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCNLLS CCNSYE+CVSCC+NPART +E+V+ +KIAKP TAGTY S FDFCAGRCRHNSE
Sbjct: 83  GCNLLSHCCNSYEFCVSCCLNPARTQEEEVMKLKIAKPATAGTYTSTFDFCAGRCRHNSE 142

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAYLSDFHHCFS+PSN++GA   QLE RL GI+VIIGRQGESCDS CKS+GQSCV
Sbjct: 143 SVVHENAYLSDFHHCFSLPSNSTGANYRQLEARLVGINVIIGRQGESCDSACKSNGQSCV 202

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNL 174
           LNKL++LNQC+I+QKYMSC+  CLASMG DQPAEV D AP +L
Sbjct: 203 LNKLLVLNQCDIMQKYMSCRGACLASMGTDQPAEVVDIAPKHL 245


>gi|242084782|ref|XP_002442816.1| hypothetical protein SORBIDRAFT_08g003310 [Sorghum bicolor]
 gi|241943509|gb|EES16654.1| hypothetical protein SORBIDRAFT_08g003310 [Sorghum bicolor]
          Length = 276

 Score =  273 bits (699), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 126/193 (65%), Positives = 155/193 (80%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GC L  QCCNSYEYCVSCC+NP++  KE VL +K+AKP TAGTY +VFDFC GRCRH+S 
Sbjct: 84  GCKLDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY SDFHHCFS+  N+SG+  + +  +L GI+V++GR GESC  VCK  GQSCV
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSVVSKLLGINVVVGRPGESCSLVCKVRGQSCV 203

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
            ++L +LN+CEI+QKYM CK GC  S+G DQPAEV D+AP++LNP ACLY ++   L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 263

Query: 192 GSHRHTRRLCPCA 204
           GSH+HTRRLCPCA
Sbjct: 264 GSHQHTRRLCPCA 276


>gi|357156916|ref|XP_003577619.1| PREDICTED: uncharacterized protein LOC100836694 [Brachypodium
           distachyon]
          Length = 275

 Score =  270 bits (690), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 122/193 (63%), Positives = 153/193 (79%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GC L  QCCNSYEYCVSCC++P+RT +  VL +K+AKP TAGTY ++FDFC GRCRH+S 
Sbjct: 83  GCKLDLQCCNSYEYCVSCCLDPSRTKERDVLKLKVAKPVTAGTYANIFDFCMGRCRHSSA 142

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY+SDFHHCF +  N SG+  +    RL+GISVI+GRQGE C S C++ GQSCV
Sbjct: 143 SVVHENAYVSDFHHCFMVQQNLSGSTDSNYGSRLAGISVIVGRQGEPCSSTCRAKGQSCV 202

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
            ++L  LN+C+I+QKYM C+ GC  S+G DQPAEV D+AP++LNP ACLY +    L+CD
Sbjct: 203 PSRLSELNKCQILQKYMRCRSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQTDERLTCD 262

Query: 192 GSHRHTRRLCPCA 204
           GSH+HTRR+CPCA
Sbjct: 263 GSHQHTRRVCPCA 275


>gi|218185557|gb|EEC67984.1| hypothetical protein OsI_35752 [Oryza sativa Indica Group]
          Length = 276

 Score =  266 bits (679), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 124/193 (64%), Positives = 156/193 (80%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GC L  QCC+SYEYCVSCC+NP+RT +  VL +K+A+P T+GTY +VFDFC GRCRH+S 
Sbjct: 84  GCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVARPVTSGTYRNVFDFCMGRCRHSSA 143

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY SDFHHCF +  N+SG+  +    RL GI++ IGR+GESC SVC++ GQSCV
Sbjct: 144 SVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGINISIGRRGESCSSVCRAKGQSCV 203

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
            ++L +LN+CEI+QKYM CK GC +S+G DQPA+V D+APSNLNP ACLY ++   L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFSSLGPDQPAQVVDEAPSNLNPGACLYMQMDERLTCD 263

Query: 192 GSHRHTRRLCPCA 204
           GSH+HTRRLCPCA
Sbjct: 264 GSHQHTRRLCPCA 276


>gi|297728217|ref|NP_001176472.1| Os11g0267100 [Oryza sativa Japonica Group]
 gi|255679983|dbj|BAH95200.1| Os11g0267100 [Oryza sativa Japonica Group]
          Length = 208

 Score =  265 bits (677), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 123/193 (63%), Positives = 156/193 (80%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GC L  QCC+SYEYCVSCC+NP+RT +  VL +K+A+P T+GTY +VFDFC GRCRH+S 
Sbjct: 16  GCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVARPVTSGTYRNVFDFCMGRCRHSSA 75

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY SDFHHCF +  N+SG+  +    RL GI++ IGR+GESC SVC++ GQSCV
Sbjct: 76  SVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGINISIGRRGESCSSVCRAKGQSCV 135

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
            ++L +LN+CEI+QKYM CK GC +++G DQPA+V D+APSNLNP ACLY ++   L+CD
Sbjct: 136 PSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVDEAPSNLNPGACLYMQMDERLTCD 195

Query: 192 GSHRHTRRLCPCA 204
           GSH+HTRRLCPCA
Sbjct: 196 GSHQHTRRLCPCA 208


>gi|222615817|gb|EEE51949.1| hypothetical protein OsJ_33587 [Oryza sativa Japonica Group]
          Length = 276

 Score =  264 bits (675), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 123/193 (63%), Positives = 156/193 (80%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GC L  QCC+SYEYCVSCC+NP+RT +  VL +K+A+P T+GTY +VFDFC GRCRH+S 
Sbjct: 84  GCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVARPVTSGTYRNVFDFCMGRCRHSSA 143

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY SDFHHCF +  N+SG+  +    RL GI++ IGR+GESC SVC++ GQSCV
Sbjct: 144 SVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGINISIGRRGESCSSVCRAKGQSCV 203

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
            ++L +LN+CEI+QKYM CK GC +++G DQPA+V D+APSNLNP ACLY ++   L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVDEAPSNLNPGACLYMQMDERLTCD 263

Query: 192 GSHRHTRRLCPCA 204
           GSH+HTRRLCPCA
Sbjct: 264 GSHQHTRRLCPCA 276


>gi|449469174|ref|XP_004152296.1| PREDICTED: uncharacterized protein LOC101217783 [Cucumis sativus]
          Length = 252

 Score =  264 bits (674), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 131/193 (67%), Positives = 150/193 (77%), Gaps = 19/193 (9%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCNL+ QCCNSYEYCVSCC NP+RT +EQ+L +KIAKP TAG                  
Sbjct: 79  GCNLVLQCCNSYEYCVSCCQNPSRTKREQILKIKIAKPATAGK----------------- 121

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
             VHENAY S FHHCFS+PS +SG   TQLE RLSGI+VIIGRQGESCDSVCKS+GQSCV
Sbjct: 122 --VHENAYHSAFHHCFSLPSYSSGDNSTQLEFRLSGINVIIGRQGESCDSVCKSNGQSCV 179

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
            NK+++LN C+I+QKY+SCK GCLAS+G DQPAEVA DAP  LNP ACLY+  QSM+SCD
Sbjct: 180 PNKILVLNHCDIMQKYLSCKGGCLASVGTDQPAEVAYDAPKELNPGACLYSSAQSMVSCD 239

Query: 192 GSHRHTRRLCPCA 204
           GSH HT+RLCPCA
Sbjct: 240 GSHPHTKRLCPCA 252


>gi|219887191|gb|ACL53970.1| unknown [Zea mays]
 gi|413916125|gb|AFW56057.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
 gi|413916126|gb|AFW56058.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
 gi|413916127|gb|AFW56059.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
          Length = 276

 Score =  263 bits (672), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 125/193 (64%), Positives = 152/193 (78%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GC L  QCCNSYEYCVSCC+NP++  KE VL +K+AKP TAGTY +VFDFC GRCRH+S 
Sbjct: 84  GCELDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY SDFHHCFS+  N+SG+  +    +L GI+V++GR  ESC  VCK  GQSCV
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVGRPRESCSLVCKVRGQSCV 203

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
            ++L +LN+CEI+QKYM CK GC  S+G DQPAEV D+AP++LNP ACLY ++   L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 263

Query: 192 GSHRHTRRLCPCA 204
           GSH HTRRLCPCA
Sbjct: 264 GSHHHTRRLCPCA 276


>gi|413916124|gb|AFW56056.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
          Length = 258

 Score =  263 bits (671), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 125/193 (64%), Positives = 152/193 (78%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GC L  QCCNSYEYCVSCC+NP++  KE VL +K+AKP TAGTY +VFDFC GRCRH+S 
Sbjct: 66  GCELDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 125

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY SDFHHCFS+  N+SG+  +    +L GI+V++GR  ESC  VCK  GQSCV
Sbjct: 126 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVGRPRESCSLVCKVRGQSCV 185

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
            ++L +LN+CEI+QKYM CK GC  S+G DQPAEV D+AP++LNP ACLY ++   L+CD
Sbjct: 186 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 245

Query: 192 GSHRHTRRLCPCA 204
           GSH HTRRLCPCA
Sbjct: 246 GSHHHTRRLCPCA 258


>gi|226499180|ref|NP_001143347.1| uncharacterized protein LOC100275960 precursor [Zea mays]
 gi|195618642|gb|ACG31151.1| hypothetical protein [Zea mays]
          Length = 276

 Score =  263 bits (671), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 125/193 (64%), Positives = 151/193 (78%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GC L  QCCNSYEYCVSCC NP++  KE VL +K+AKP TAGTY +VFDFC GRCRH+S 
Sbjct: 84  GCELDLQCCNSYEYCVSCCFNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY SDFHHCFS+  N+SG+  +    +L GI+V++GR  ESC  VCK  GQSCV
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVGRPRESCSLVCKVRGQSCV 203

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 191
            ++L +LN+CEI+QKYM CK GC  S+G DQPAEV D+AP++LNP ACLY ++   L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 263

Query: 192 GSHRHTRRLCPCA 204
           GSH HTRRLCPCA
Sbjct: 264 GSHHHTRRLCPCA 276


>gi|147779400|emb|CAN76795.1| hypothetical protein VITISV_030117 [Vitis vinifera]
          Length = 258

 Score =  258 bits (659), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 131/201 (65%), Positives = 149/201 (74%), Gaps = 28/201 (13%)

Query: 2   LPRERREILLGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDF 61
           L  + R I +GCNL+SQCCNSYE+CVSCC+NPART KE  L VK+AKP TAG Y SVFDF
Sbjct: 40  LDVDMRVIHIGCNLISQCCNSYEFCVSCCLNPARTQKELALKVKMAKPVTAGAYASVFDF 99

Query: 62  CAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAG---------------VTQL----- 101
           C GRCRHNSESVVHENAYLSDFHHCFS+PSN++G                 V QL     
Sbjct: 100 CTGRCRHNSESVVHENAYLSDFHHCFSLPSNSAGEAWSLIPNSWNGLFLKIVKQLMASMH 159

Query: 102 --------EGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRG 153
                   E RL+GI V++GRQGESCDSVCKS+GQSCV +KL +LNQCEI+QKYMSCK  
Sbjct: 160 AGGVDRNPELRLNGIHVVVGRQGESCDSVCKSNGQSCVPSKLSVLNQCEIMQKYMSCKGA 219

Query: 154 CLASMGADQPAEVADDAPSNL 174
           CLAS+GADQPAEV DDAP +L
Sbjct: 220 CLASIGADQPAEVVDDAPRHL 240


>gi|449484814|ref|XP_004156988.1| PREDICTED: uncharacterized LOC101231289 [Cucumis sativus]
          Length = 154

 Score =  246 bits (627), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 117/151 (77%), Positives = 131/151 (86%)

Query: 54  TYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIG 113
           TY SVFDFC GRCRHNSESVVHENAY S FHHCFS+PS +SG   TQLE RLSGI+VIIG
Sbjct: 4   TYASVFDFCVGRCRHNSESVVHENAYHSAFHHCFSLPSYSSGDNSTQLEFRLSGINVIIG 63

Query: 114 RQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSN 173
           RQGESCDSVCKS+GQSCV NK+++LN C+I+QKY+SCK GCLAS+G DQPAEVA DAP  
Sbjct: 64  RQGESCDSVCKSNGQSCVPNKILVLNHCDIMQKYLSCKGGCLASVGTDQPAEVAYDAPKE 123

Query: 174 LNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 204
           LNP ACLY+  QSM+SCDGSH HT+RLCPCA
Sbjct: 124 LNPGACLYSSAQSMVSCDGSHPHTKRLCPCA 154


>gi|7801689|emb|CAB91609.1| putative protein [Arabidopsis thaliana]
          Length = 273

 Score =  244 bits (622), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 119/163 (73%), Positives = 144/163 (88%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCN+LSQCCNSYE+CVSCC+NP++TL E+V+ VK+AKP T+GTY SVFDFCAGRCRHNSE
Sbjct: 81  GCNILSQCCNSYEFCVSCCLNPSQTLLEKVVKVKVAKPATSGTYKSVFDFCAGRCRHNSE 140

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY S+FHHCFS+ SNASGA +TQ+E RL GI VI+G QG+SCD+VCKS GQ CV
Sbjct: 141 SVVHENAYHSEFHHCFSLTSNASGANLTQVETRLLGIDVIVGSQGDSCDAVCKSRGQLCV 200

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNL 174
           +NKL LLNQC+++++YM+CK  CLAS GADQPAEV +DAP +L
Sbjct: 201 MNKLSLLNQCDVMKRYMNCKGSCLASAGADQPAEVVEDAPRDL 243


>gi|108864223|gb|ABA92588.2| expressed protein [Oryza sativa Japonica Group]
          Length = 251

 Score =  218 bits (555), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 102/163 (62%), Positives = 131/163 (80%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GC L  QCC+SYEYCVSCC+NP+RT +  VL +K+A+P T+GTY +VFDFC GRCRH+S 
Sbjct: 84  GCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVARPVTSGTYRNVFDFCMGRCRHSSA 143

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY SDFHHCF +  N+SG+  +    RL GI++ IGR+GESC SVC++ GQSCV
Sbjct: 144 SVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGINISIGRRGESCSSVCRAKGQSCV 203

Query: 132 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNL 174
            ++L +LN+CEI+QKYM CK GC +++G DQPA+V D+APSNL
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVDEAPSNL 246


>gi|168051463|ref|XP_001778174.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670496|gb|EDQ57064.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 235

 Score =  216 bits (549), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 110/204 (53%), Positives = 135/204 (66%), Gaps = 2/204 (0%)

Query: 3   PRERREIL-LGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDF 61
           PRE  +    GCN+ SQCCNSYEYCVSCC+NP RT  E VL  K+A+  TAG++ S+FD+
Sbjct: 32  PREGEQFACQGCNIGSQCCNSYEYCVSCCLNPVRTTLEVVLKTKLARQPTAGSWTSLFDY 91

Query: 62  CAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGV-TQLEGRLSGISVIIGRQGESCD 120
           C+GRCRHNS+SVVHENAY+S+ HHCFS  S   G  V    E  L+ + V +GR+G SCD
Sbjct: 92  CSGRCRHNSQSVVHENAYVSEDHHCFSARSTPPGVQVEMDAEDELADVFVYVGREGLSCD 151

Query: 121 SVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACL 180
           + C +   SC   KL  LN C  +QKY  C   C+ S G DQPAEV   AP +++P ACL
Sbjct: 152 ATCSAQKLSCREEKLAALNNCAELQKYNKCHGLCIPSTGPDQPAEVVSTAPRHMHPGACL 211

Query: 181 YTRIQSMLSCDGSHRHTRRLCPCA 204
           Y    S LSC G H  +RRLCPCA
Sbjct: 212 YNTQVSQLSCKGYHLFSRRLCPCA 235


>gi|302788558|ref|XP_002976048.1| hypothetical protein SELMODRAFT_415982 [Selaginella moellendorffii]
 gi|300156324|gb|EFJ22953.1| hypothetical protein SELMODRAFT_415982 [Selaginella moellendorffii]
          Length = 263

 Score =  207 bits (528), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 97/194 (50%), Positives = 135/194 (69%), Gaps = 1/194 (0%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCNL+SQCCN++EYCVSCC++P  T KE  L+ K A+  TA  + SVF++C GRCRHNS 
Sbjct: 70  GCNLVSQCCNTFEYCVSCCLDPIHTSKEFALSSKSARQLTAANFVSVFEYCTGRCRHNSG 129

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLE-GRLSGISVIIGRQGESCDSVCKSSGQSC 130
           SVVHENAY S++HHCF   +N S      +E G  +  S+I GR+G+SCD  C+  G+ C
Sbjct: 130 SVVHENAYTSEYHHCFFTQTNLSSGRDHGVEAGENTDYSLITGRRGQSCDEACRQRGKKC 189

Query: 131 VLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSC 190
             + L L+N+C ++QK M+CK  C++  G D P+EV   AP +L+P AC++    ++LSC
Sbjct: 190 KASILPLINECSVLQKSMNCKGACISGSGPDLPSEVVSSAPRHLHPGACVFNSKNTLLSC 249

Query: 191 DGSHRHTRRLCPCA 204
            G+HR+T+RLCPC 
Sbjct: 250 QGAHRYTKRLCPCG 263


>gi|302769810|ref|XP_002968324.1| hypothetical protein SELMODRAFT_409585 [Selaginella moellendorffii]
 gi|300163968|gb|EFJ30578.1| hypothetical protein SELMODRAFT_409585 [Selaginella moellendorffii]
          Length = 263

 Score =  207 bits (527), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 97/194 (50%), Positives = 135/194 (69%), Gaps = 1/194 (0%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCNL+SQCCN++EYCVSCC++P  T KE  L+ K A+  TA  + SVF++C GRCRHNS 
Sbjct: 70  GCNLVSQCCNTFEYCVSCCLDPIHTSKEFALSSKSARQLTAANFVSVFEYCTGRCRHNSG 129

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLE-GRLSGISVIIGRQGESCDSVCKSSGQSC 130
           SVVHENAY S++HHCF   +N S      +E G  +  S+I GR+G+SCD  C+  G+ C
Sbjct: 130 SVVHENAYTSEYHHCFFTQTNLSSGRDHGVEAGENTDYSLITGRRGQSCDEACRQRGKKC 189

Query: 131 VLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSC 190
             + L L+N+C ++QK M+CK  C++  G D P+EV   AP +L+P AC++    ++LSC
Sbjct: 190 KASILPLINECSVLQKSMNCKGACVSGSGPDLPSEVVSSAPRHLHPGACVFNSKNTLLSC 249

Query: 191 DGSHRHTRRLCPCA 204
            G+HR+T+RLCPC 
Sbjct: 250 QGAHRYTKRLCPCG 263


>gi|255637592|gb|ACU19121.1| unknown [Glycine max]
          Length = 224

 Score =  201 bits (510), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 94/120 (78%), Positives = 106/120 (88%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GCNLLSQCCNSYEYCVSCC+NPA   KEQVL +K+AKPTTA TY SVFD+CAGRCRH+SE
Sbjct: 89  GCNLLSQCCNSYEYCVSCCLNPALISKEQVLKMKVAKPTTARTYSSVFDYCAGRCRHSSE 148

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 131
           SVVHENAY+SDFHHCFS+ SN+SG   T  E RLSGI+V++GRQGESC+SVCKS GQSCV
Sbjct: 149 SVVHENAYISDFHHCFSLSSNSSGKNSTLTEARLSGINVVVGRQGESCNSVCKSRGQSCV 208


>gi|62734222|gb|AAX96331.1| hypothetical protein LOC_Os11g16560 [Oryza sativa Japonica Group]
          Length = 216

 Score =  180 bits (456), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 84/139 (60%), Positives = 110/139 (79%)

Query: 36  TLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASG 95
           T +  VL +K+A+P T+GTY +VFDFC GRCRH+S SVVHENAY SDFHHCF +  N+SG
Sbjct: 73  TKETDVLKLKVARPVTSGTYRNVFDFCMGRCRHSSASVVHENAYASDFHHCFLLQQNSSG 132

Query: 96  AGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCL 155
           +  +    RL GI++ IGR+GESC SVC++ GQSCV ++L +LN+CEI+QKYM CK GC 
Sbjct: 133 SADSGSGPRLDGINISIGRRGESCSSVCRAKGQSCVPSRLSVLNKCEILQKYMRCKSGCF 192

Query: 156 ASMGADQPAEVADDAPSNL 174
           +++G DQPA+V D+APSNL
Sbjct: 193 SNLGPDQPAQVVDEAPSNL 211


>gi|413916128|gb|AFW56060.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
          Length = 186

 Score =  139 bits (350), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 66/102 (64%), Positives = 81/102 (79%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           GC L  QCCNSYEYCVSCC+NP++  KE VL +K+AKP TAGTY +VFDFC GRCRH+S 
Sbjct: 84  GCELDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143

Query: 72  SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIG 113
           SVVHENAY SDFHHCFS+  N+SG+  +    +L GI+V++G
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVG 185


>gi|384245502|gb|EIE18996.1| hypothetical protein COCSUDRAFT_31544 [Coccomyxa subellipsoidea
           C-169]
          Length = 278

 Score =  134 bits (337), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 71/193 (36%), Positives = 104/193 (53%), Gaps = 3/193 (1%)

Query: 13  CNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSES 72
           CN+   CC  YE+CVSCC +P   L +     +       G + SVF++C  +CR +S+S
Sbjct: 83  CNMHDSCCADYEHCVSCCQDPQHGLPDLSTVHRAPDRPETGLWPSVFEYCRNKCRTSSKS 142

Query: 73  VVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVL 132
            VHENAYLS FHHCFS  S                ++V++G  G+SC+++C+S+G+ C  
Sbjct: 143 TVHENAYLSPFHHCFSS-SGKPTTAAPPTPPIPKSVTVVLGSAGQSCNALCESTGRRCAQ 201

Query: 133 NKLILLNQCEIIQKYMSCKRGCLASMG-ADQPAEVADDAPSNLNPTACLYTRIQ-SMLSC 190
             L  LN C +++++ +C+ GC    G AD P  V   AP    PT CL    + +  SC
Sbjct: 202 QHLASLNNCNVLREHYACEAGCEPDSGTADAPIYVNASAPKQQRPTICLTMDSKVNQFSC 261

Query: 191 DGSHRHTRRLCPC 203
           + +     RLCPC
Sbjct: 262 EATSASNHRLCPC 274


>gi|159475585|ref|XP_001695899.1| predicted membrane-anchored protein [Chlamydomonas reinhardtii]
 gi|158275459|gb|EDP01236.1| predicted membrane-anchored protein [Chlamydomonas reinhardtii]
          Length = 269

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 68/202 (33%), Positives = 105/202 (51%), Gaps = 22/202 (10%)

Query: 13  CNLLSQCCNSYEYCVSCCVNP---ARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHN 69
           C++  +CC+ YE CVSCC+ P   A ++ +Q L  +  +   +G +G  F++C G CR +
Sbjct: 78  CDMRDRCCSEYESCVSCCLAPQHNAVSIAKQAL--RSPRHKDSGFWGDPFEYCKGICRTH 135

Query: 70  SESVVHENAYLSDFHHCFS-----MPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCK 124
           S S  HENAY+S  HHCFS     M S+   AGV      + G+ V+ G++  +CD VC 
Sbjct: 136 SRSTAHENAYISSRHHCFSQLGRPMLSDPLPAGV------MDGVEVVTGQRNANCDDVCA 189

Query: 125 SSGQSCVLNKLILLNQCEIIQKYMSCKRGC--LASMGADQPAEVADDAPSNLNPTACLYT 182
           +  + C  + L  L+ C+ +++   C+ GC  +A +G   P+ V  +AP    P  C   
Sbjct: 190 AKQKKCSADHLRWLSSCDRLREQFGCEAGCEVVAGLG---PSYVDGNAPKPARPAMCFAQ 246

Query: 183 RIQ-SMLSCDGSHRHTRRLCPC 203
             +   LSC         LCPC
Sbjct: 247 PAEGGKLSCSAREEQHLMLCPC 268


>gi|358345744|ref|XP_003636935.1| hypothetical protein MTR_065s0010 [Medicago truncatula]
 gi|355502870|gb|AES84073.1| hypothetical protein MTR_065s0010 [Medicago truncatula]
          Length = 131

 Score =  105 bits (263), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 51/75 (68%), Positives = 56/75 (74%)

Query: 129 SCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSML 188
           S VL   I L     IQKYMSCK GCLAS+G DQPAEV  DAP  +NP +CLYT  QS+L
Sbjct: 28  SYVLTVSIDLRYVGSIQKYMSCKGGCLASVGTDQPAEVVYDAPKYMNPESCLYTETQSIL 87

Query: 189 SCDGSHRHTRRLCPC 203
           SCDGSH+HTRRLCPC
Sbjct: 88  SCDGSHQHTRRLCPC 102


>gi|428177708|gb|EKX46586.1| hypothetical protein GUITHDRAFT_107373 [Guillardia theta CCMP2712]
          Length = 233

 Score =  102 bits (255), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 88/169 (52%), Gaps = 17/169 (10%)

Query: 5   ERREILLGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV------ 58
           E R     C   ++CC+ YE CVSCC+NP+    +++ N  +A      T+         
Sbjct: 64  EPRWSCRSCEAETKCCDHYENCVSCCLNPS---NDELRNTAMAASRDKKTFADAVKRKDS 120

Query: 59  FDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGES 118
           F+FC   CR +S+S VH NAY+S+   C S PS  S           + ++V +G+QGES
Sbjct: 121 FEFCRASCRTSSKSTVHGNAYISEKRFCLS-PSILSNT-------LPADVAVTVGKQGES 172

Query: 119 CDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVA 167
           C  VC S G+ C    +  +N CE++QK   CK  C  + G DQPA V+
Sbjct: 173 CAEVCSSVGKRCDTRYITSINTCEVLQKNFPCKNTCEKNYGHDQPAYVS 221


>gi|242070783|ref|XP_002450668.1| hypothetical protein SORBIDRAFT_05g009356 [Sorghum bicolor]
 gi|241936511|gb|EES09656.1| hypothetical protein SORBIDRAFT_05g009356 [Sorghum bicolor]
          Length = 111

 Score = 92.8 bits (229), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 41/59 (69%), Positives = 48/59 (81%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 70
           GC L  QCCNSYEYCVSCC+NP++  KE VL +K+AKP T GTY +VFDFC GRCRH+S
Sbjct: 53  GCKLDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTVGTYTNVFDFCMGRCRHSS 111


>gi|242051320|ref|XP_002463404.1| hypothetical protein SORBIDRAFT_02g043236 [Sorghum bicolor]
 gi|241926781|gb|EER99925.1| hypothetical protein SORBIDRAFT_02g043236 [Sorghum bicolor]
          Length = 82

 Score = 92.0 bits (227), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/68 (64%), Positives = 50/68 (73%)

Query: 3  PRERREILLGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFC 62
          P   R    GC L  QCCNSYEYCVSCC+NP++  KE VL +K AKP TAGTY +VFDFC
Sbjct: 15 PTLDRFSCRGCKLDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKEAKPVTAGTYTNVFDFC 74

Query: 63 AGRCRHNS 70
           GRCRH+S
Sbjct: 75 MGRCRHSS 82


>gi|307110379|gb|EFN58615.1| hypothetical protein CHLNCDRAFT_17323, partial [Chlorella
           variabilis]
          Length = 225

 Score = 92.0 bits (227), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 59/199 (29%), Positives = 93/199 (46%), Gaps = 16/199 (8%)

Query: 13  CNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIA--KPTTAGTYGSVFDFCAGRCRHNS 70
           C    +CC+ YE+CVSCC+ P     E++ ++     KP T G + + F++C   CR  +
Sbjct: 35  CEAKDKCCSRYEHCVSCCLRPENKPGEKMQSIFRGRNKPET-GHWSTPFEYCQAVCRTTA 93

Query: 71  ESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSC 130
            S  HENA++ D   CFS       A     E      S++ G  G+SCD  C +   +C
Sbjct: 94  RSTQHENAFILDRRFCFSKVGFVPPAPALPKEA-----SLVAGAAGQSCDVACAAKQMTC 148

Query: 131 VLNKLILLNQCEIIQ-KYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLY-----TRI 184
                + +N C  ++ K+M C  G  A+   + P  V   AP +  P  C +      ++
Sbjct: 149 RAEWFLSINDCNSLRAKFMCC--GSCAADQTEYPGYVEGSAPKHQQPAFCAFLPPLAAQL 206

Query: 185 QSMLSCDGSHRHTRRLCPC 203
               +C  ++   RRLCPC
Sbjct: 207 LPSFNCSHANGSVRRLCPC 225


>gi|255071235|ref|XP_002507699.1| glycosyltransferase family 13 protein [Micromonas sp. RCC299]
 gi|226522974|gb|ACO68957.1| glycosyltransferase family 13 protein [Micromonas sp. RCC299]
          Length = 636

 Score = 66.2 bits (160), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 39/125 (31%), Positives = 59/125 (47%), Gaps = 13/125 (10%)

Query: 80  LSDFHHCFSMPSNASGAGVTQLEGRLS-GISVIIGRQGESCDSVCKSSGQSCVLNKLILL 138
           L+D   C  +PS+        L  R S G++ +   QG++CD+ C+ +   C       L
Sbjct: 520 LADERSCPYLPSH--------LRVRPSRGLTPVAAVQGQTCDAACRYNHMKCHAPDFWFL 571

Query: 139 NQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTR 198
           N CE++ K+  C+ GC   +G D P  V  +         CL +  Q+  +C  SHR T 
Sbjct: 572 NSCEVLAKFFPCEAGCALVLGDDIPNYVVSNRMDTYQ--KCLVSERQA--TCAASHRATT 627

Query: 199 RLCPC 203
           RLC C
Sbjct: 628 RLCAC 632


>gi|195393082|ref|XP_002055183.1| GJ19227 [Drosophila virilis]
 gi|194149693|gb|EDW65384.1| GJ19227 [Drosophila virilis]
          Length = 215

 Score = 63.5 bits (153), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 37/74 (50%), Positives = 45/74 (60%), Gaps = 7/74 (9%)

Query: 13  CNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRH 68
           CN  S CC  YEYCVSCC++P  R+L EQVL    A+ T    Y SV   F+ C  +CR 
Sbjct: 120 CNSTSHCCGVYEYCVSCCLHPNKRSLLEQVLQ---AQDTQKYIYASVVDHFELCLVKCRT 176

Query: 69  NSESVVHENAYLSD 82
           NS SV HEN Y ++
Sbjct: 177 NSHSVEHENKYRNE 190


>gi|195130989|ref|XP_002009933.1| GI15640 [Drosophila mojavensis]
 gi|193908383|gb|EDW07250.1| GI15640 [Drosophila mojavensis]
          Length = 216

 Score = 63.2 bits (152), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 37/74 (50%), Positives = 44/74 (59%), Gaps = 7/74 (9%)

Query: 13  CNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRH 68
           CN  S CC  YEYCVSCC++P  R+L EQVL    A  T    Y SV   F+ C  +CR 
Sbjct: 121 CNTTSHCCGVYEYCVSCCLHPNKRSLLEQVLQ---AHNTQKYIYASVVDHFELCLVKCRT 177

Query: 69  NSESVVHENAYLSD 82
           NS SV HEN Y ++
Sbjct: 178 NSHSVEHENKYRNE 191


>gi|308805236|ref|XP_003079930.1| unnamed protein product [Ostreococcus tauri]
 gi|116058387|emb|CAL53576.1| unnamed protein product [Ostreococcus tauri]
          Length = 189

 Score = 61.6 bits (148), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/84 (40%), Positives = 42/84 (50%), Gaps = 4/84 (4%)

Query: 13  CNLLSQCCNSYEYCVSCCVNPAR--TLK-EQVLNVKIAKPTTAGTYGSVFDFCAGRCRHN 69
           C+   +CC  YE CVSCC+ PAR  TL+ E   + +       G +   FDFCA RCR  
Sbjct: 84  CDAAVECCERYETCVSCCLAPARRATLETEMRTHARGRNQVVTGFFDDPFDFCANRCRTQ 143

Query: 70  SESVVHENAYLSDFHHCF-SMPSN 92
               +HEN Y     HCF   P N
Sbjct: 144 PSVTLHENEYAYSTKHCFGDYPKN 167


>gi|328719433|ref|XP_003246762.1| PREDICTED: UPF0454 protein C12orf49 homolog [Acyrthosiphon pisum]
          Length = 193

 Score = 60.1 bits (144), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 50/88 (56%), Gaps = 7/88 (7%)

Query: 5   ERREILLGCNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FD 60
           E+R I   CN    CC  YE+CVSCC+NP  R L E VL    A  T    + ++   F+
Sbjct: 103 EKRYICDTCNA-DGCCVLYEHCVSCCLNPNKRKLLETVLGK--ASETFRVLFSAINNHFE 159

Query: 61  FCAGRCRHNSESVVHENAYLSDFHHCFS 88
            C  +CR +S+SV HENAYL    HC++
Sbjct: 160 LCLAKCRTSSQSVQHENAYLKSNKHCYT 187


>gi|195044419|ref|XP_001991819.1| GH11862 [Drosophila grimshawi]
 gi|193901577|gb|EDW00444.1| GH11862 [Drosophila grimshawi]
          Length = 214

 Score = 59.7 bits (143), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 48/81 (59%), Gaps = 8/81 (9%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPAR-TLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCR 67
            CN+ S CC  YE+C+SCC++P + +L EQVL    A  T    Y SV   F+ C  +CR
Sbjct: 120 SCNVTSHCCAIYEHCISCCLHPNKQSLLEQVLQ---ASDTQKYIYASVVDHFELCLVKCR 176

Query: 68  HNSESVVHENAYLS-DFHHCF 87
            NS SV HEN Y + D  +C+
Sbjct: 177 TNSHSVEHENKYRNEDAKYCY 197


>gi|195432368|ref|XP_002064195.1| GK19833 [Drosophila willistoni]
 gi|194160280|gb|EDW75181.1| GK19833 [Drosophila willistoni]
          Length = 215

 Score = 58.2 bits (139), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 13  CNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           CN  S CC  YEYCVSCC++P  R L E +L    A+          F+ C  +CR NS 
Sbjct: 120 CNQTSMCCEIYEYCVSCCLHPDKRPLLETILETVNAQRYIYAQVEDHFELCLVKCRTNSH 179

Query: 72  SVVHENAY 79
           SV HEN Y
Sbjct: 180 SVEHENKY 187


>gi|194768977|ref|XP_001966587.1| GF22253 [Drosophila ananassae]
 gi|190617351|gb|EDV32875.1| GF22253 [Drosophila ananassae]
          Length = 217

 Score = 58.2 bits (139), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 41/80 (51%), Gaps = 2/80 (2%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 70
            CN  + CC  YEYCVSCC++P +  L EQVL    +           F+ C  +CR NS
Sbjct: 121 SCNTTTHCCAIYEYCVSCCLHPEKQPLLEQVLRSANSPKYIFTKVADHFELCLVKCRTNS 180

Query: 71  ESVVHENAYLS-DFHHCFSM 89
            SV HEN Y      HC+ +
Sbjct: 181 HSVQHENQYRDPSAKHCYGL 200


>gi|195355329|ref|XP_002044144.1| GM22548 [Drosophila sechellia]
 gi|194129433|gb|EDW51476.1| GM22548 [Drosophila sechellia]
          Length = 197

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 41/79 (51%), Gaps = 2/79 (2%)

Query: 13  CNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           CN  + CC  YEYCVSCC++P +  L E+VL           +    F+ C  +CR NS 
Sbjct: 107 CNTSTHCCGVYEYCVSCCLHPGQQPLLERVLQAPNTPKYIFASVTDHFELCLVKCRTNSH 166

Query: 72  SVVHENAYLSD-FHHCFSM 89
           SV HEN Y      HCF +
Sbjct: 167 SVEHENKYRDPAAKHCFGL 185


>gi|452823888|gb|EME30895.1| peptidyl-prolyl cis-trans isomerase [Galdieria sulphuraria]
          Length = 360

 Score = 57.8 bits (138), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 49/98 (50%), Gaps = 3/98 (3%)

Query: 110 VIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASM---GADQPAEV 166
           +I  ++ ++C   C+     C      +LN C  + +Y  C+   +A+M   G+D P  V
Sbjct: 248 IIPAKRKQNCYRACQLKNLVCFKKGFSILNNCPTLSEYFDCRECEIATMESAGSDMPCRV 307

Query: 167 ADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 204
           +  APS  +   C+ +   S+ SC+ SH  T RLCPC 
Sbjct: 308 SLKAPSTYSSGVCMISPNLSLTSCEASHSQTERLCPCV 345


>gi|198471721|ref|XP_001355702.2| GA13867 [Drosophila pseudoobscura pseudoobscura]
 gi|198146047|gb|EAL32761.2| GA13867 [Drosophila pseudoobscura pseudoobscura]
          Length = 216

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 41/79 (51%), Gaps = 2/79 (2%)

Query: 13  CNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           CN  + CC  YEYCVSCC++P +  L E+VL           +    F+ C  +CR NS 
Sbjct: 121 CNATTSCCRLYEYCVSCCLHPDKQPLLERVLQSAHTPKYIYASVADHFELCLVKCRTNSH 180

Query: 72  SVVHENAYLS-DFHHCFSM 89
           SV HEN Y      HC+ +
Sbjct: 181 SVEHENKYRDGKAKHCYGI 199


>gi|28571353|ref|NP_788909.1| CG15643 [Drosophila melanogaster]
 gi|7293087|gb|AAF48472.1| CG15643 [Drosophila melanogaster]
 gi|17944302|gb|AAL48044.1| RE11751p [Drosophila melanogaster]
 gi|220947868|gb|ACL86477.1| CG15643-PA [synthetic construct]
 gi|220957174|gb|ACL91130.1| CG15643-PA [synthetic construct]
          Length = 212

 Score = 57.0 bits (136), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 41/79 (51%), Gaps = 2/79 (2%)

Query: 13  CNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           CN  + CC  YEYCVSCC++P +  L E+VL           +    F+ C  +CR NS 
Sbjct: 121 CNTSTHCCGVYEYCVSCCLHPGQQPLLERVLQAPNTPKYIFASVTDHFELCLVKCRTNSH 180

Query: 72  SVVHENAYLSD-FHHCFSM 89
           SV HEN Y      HC+ +
Sbjct: 181 SVEHENKYRDPAAKHCYGL 199


>gi|195478878|ref|XP_002100681.1| GE16041 [Drosophila yakuba]
 gi|194188205|gb|EDX01789.1| GE16041 [Drosophila yakuba]
          Length = 217

 Score = 57.0 bits (136), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 8/82 (9%)

Query: 13  CNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRH 68
           CN  + CC  YEYCVSCC++P +  L E+VL   +A  T    + SV   F+ C  +CR 
Sbjct: 121 CNTSTHCCGVYEYCVSCCLHPGQQPLLERVL---LAPNTPKYIFASVTDHFELCLVKCRT 177

Query: 69  NSESVVHENAYLSD-FHHCFSM 89
           NS SV HEN Y      HC+ +
Sbjct: 178 NSHSVEHENKYRDPAAKHCYGL 199


>gi|156380036|ref|XP_001631761.1| predicted protein [Nematostella vectensis]
 gi|156218806|gb|EDO39698.1| predicted protein [Nematostella vectensis]
          Length = 135

 Score = 57.0 bits (136), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 47/79 (59%), Gaps = 3/79 (3%)

Query: 16  LSQCCNSYEYCVSCCVNPA-RTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESV 73
           ++ CC+ YE CVSCC++P  + L  +VLNV    P     +    F+ C  +CR +S+SV
Sbjct: 48  VNNCCSIYENCVSCCLDPKQKELLREVLNVWRTAPNVILKSITDQFELCLTKCRTSSKSV 107

Query: 74  VHENAYLSD-FHHCFSMPS 91
            HEN+Y  + + HCF + S
Sbjct: 108 WHENSYKDNKYKHCFGLTS 126


>gi|195164215|ref|XP_002022944.1| GL16447 [Drosophila persimilis]
 gi|194105006|gb|EDW27049.1| GL16447 [Drosophila persimilis]
          Length = 216

 Score = 57.0 bits (136), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 13  CNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
           CN  + CC  YEYCVSCC++P +  L E VL           +    F+ C  +CR NS 
Sbjct: 121 CNATTNCCRLYEYCVSCCLHPDKQPLLELVLQSAYTPKYIYASVADHFELCLVKCRTNSH 180

Query: 72  SVVHENAYLS-DFHHCFSM 89
           SV HEN Y      HC+ +
Sbjct: 181 SVEHENKYRDGKAKHCYGI 199


>gi|291000376|ref|XP_002682755.1| N-acetylglucosaminylaminotransferase [Naegleria gruberi]
 gi|284096383|gb|EFC50011.1| N-acetylglucosaminylaminotransferase [Naegleria gruberi]
          Length = 617

 Score = 57.0 bits (136), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 46/154 (29%), Positives = 71/154 (46%), Gaps = 27/154 (17%)

Query: 56  GSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQ 115
           GS   F A  C    ++  H  A +S FH      SN S      L  + S I+  I +Q
Sbjct: 453 GSYSPFYAETC---GKAGSHCPAPISKFHK----DSNWS------LPKKDSIITFAISKQ 499

Query: 116 GESCDSVCKSS-GQSCVLNKLILLNQCEIIQKYM--SCKRGCLASMGADQPAEVADDAPS 172
           G+SC S CK      C       +N C+ ++K++   C   C+ S G DQPA        
Sbjct: 500 GQSCSSYCKQGFNMECYRFSFEEMNSCKFLRKFVGQECTE-CIPSSGPDQPA-------I 551

Query: 173 NLNPTACLYT---RIQSMLSCDGSHRHTRRLCPC 203
           +++   CL+    R + + +C+  H +T+R+CPC
Sbjct: 552 DVDSKQCLFNGGGRGKFISTCEAYHVNTKRICPC 585


>gi|347964633|ref|XP_316825.5| AGAP000855-PA [Anopheles gambiae str. PEST]
 gi|333469436|gb|EAA12112.5| AGAP000855-PA [Anopheles gambiae str. PEST]
          Length = 243

 Score = 56.6 bits (135), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 35/90 (38%), Positives = 50/90 (55%), Gaps = 7/90 (7%)

Query: 15  LLSQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNS 70
           L ++CC  YEYCV+CC+NP  R + E+VL    A       Y  V   F+ C  +CR NS
Sbjct: 144 LPNRCCAVYEYCVACCLNPDKRPILEEVL--AKANGRQVALYAEVTDQFELCLTKCRTNS 201

Query: 71  ESVVHENAYLS-DFHHCFSMPSNASGAGVT 99
           +SV +EN Y + +  HC+   S A+ A  +
Sbjct: 202 QSVQNENKYRNPEQKHCYGEMSEAAWASAS 231


>gi|194894481|ref|XP_001978075.1| GG19394 [Drosophila erecta]
 gi|190649724|gb|EDV47002.1| GG19394 [Drosophila erecta]
          Length = 217

 Score = 56.6 bits (135), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 8/82 (9%)

Query: 13  CNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRH 68
           CN  + CC  YEYCVSCC++P +  L E+VL   +A  T    + SV   F+ C  +CR 
Sbjct: 121 CNTSTHCCAVYEYCVSCCLHPGQQPLLERVL---LAPNTPKYIFASVADHFELCLVKCRT 177

Query: 69  NSESVVHENAYLSD-FHHCFSM 89
           NS SV HEN Y      HC+ +
Sbjct: 178 NSHSVEHENKYRDPAAKHCYGL 199


>gi|405975633|gb|EKC40187.1| UPF0454 protein C12orf49-like protein [Crassostrea gigas]
          Length = 185

 Score = 56.2 bits (134), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 42/72 (58%), Gaps = 3/72 (4%)

Query: 19  CCNSYEYCVSCCVNP-ARTLKEQVL-NVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC  YEYCV+CC+ P  R L +++L N +       G+    F+ C  +CR +S SV HE
Sbjct: 102 CCVVYEYCVACCLKPDKRPLLQKILTNAREGLDIALGSVSDHFELCLAKCRTSSSSVQHE 161

Query: 77  NAYLSDF-HHCF 87
           N+Y + F  HCF
Sbjct: 162 NSYRNAFAKHCF 173


>gi|391333374|ref|XP_003741090.1| PREDICTED: UPF0454 protein C12orf49 homolog [Metaseiulus
           occidentalis]
          Length = 193

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 31/74 (41%), Positives = 42/74 (56%), Gaps = 5/74 (6%)

Query: 19  CCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVV 74
           CC+SYE+C+SCC+ P  R   E+VL      P +   Y  V   FD C  +CR +S+SV 
Sbjct: 116 CCSSYEFCISCCLRPQQRPTLERVLGRLGRSPKSLKLYALVQTQFDLCLTKCRTSSQSVR 175

Query: 75  HENAYLSDF-HHCF 87
           +EN Y+     HCF
Sbjct: 176 NENTYIDPVKKHCF 189


>gi|328869989|gb|EGG18364.1| hypothetical protein DFA_03858 [Dictyostelium fasciculatum]
          Length = 1057

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 38/77 (49%), Gaps = 1/77 (1%)

Query: 12   GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 71
            GC+   +CC  Y YCVSCC+   +   E VL   +   +  G     FD C   CR +S 
Sbjct: 968  GCSSKYRCCEEYTYCVSCCLGARQETLEAVLVQMVEGVSPIGEPHDQFDLCQLVCRTSSH 1027

Query: 72   SVVHENAYL-SDFHHCF 87
            SV+H+  Y   D  +C+
Sbjct: 1028 SVIHQREYKHPDTKYCY 1044


>gi|357603409|gb|EHJ63752.1| hypothetical protein KGM_18059 [Danaus plexippus]
          Length = 198

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 48/80 (60%), Gaps = 3/80 (3%)

Query: 17  SQCCNSYEYCVSCCVNPA-RTLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVV 74
           + CC  YEYCVSCC++P+ R+L E VL+ +   +     +    ++ C  +CR +S SV+
Sbjct: 115 NNCCVIYEYCVSCCLDPSKRSLLELVLSKLSAEEKVLFRSISDDYELCLTKCRTSSHSVL 174

Query: 75  HENAYLSDFH-HCFSMPSNA 93
           HEN+Y    H HCF   +N+
Sbjct: 175 HENSYKDPSHKHCFGDRTNS 194


>gi|242012077|ref|XP_002426767.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212510949|gb|EEB14029.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 203

 Score = 54.7 bits (130), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 40/64 (62%), Gaps = 6/64 (9%)

Query: 19  CCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVH 75
           CC+ YEYC+SCC++P    K+ VL   + K +    + SV   F+ C  +CR NS+SV H
Sbjct: 124 CCSIYEYCISCCLHPD---KKGVLQNVLGKESENVLFASVSDHFELCLAKCRTNSQSVHH 180

Query: 76  ENAY 79
           EN+Y
Sbjct: 181 ENSY 184


>gi|124486989|ref|NP_001074705.1| UPF0454 protein C12orf49 homolog precursor [Mus musculus]
 gi|81897450|sp|Q8BTG6.1|CL049_MOUSE RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
 gi|26355654|dbj|BAC41192.1| unnamed protein product [Mus musculus]
 gi|74355387|gb|AAI04374.1| RIKEN cDNA 2410131K14 gene [Mus musculus]
 gi|74355856|gb|AAI04373.1| RIKEN cDNA 2410131K14 gene [Mus musculus]
 gi|148687847|gb|EDL19794.1| mCG15258 [Mus musculus]
          Length = 205

 Score = 52.8 bits (125), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 30/70 (42%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
           GC L + CC +YEYCVSCC+ P++  L E+ LN   +A           F+ C  +CR +
Sbjct: 118 GC-LANGCCEAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 176

Query: 70  SESVVHENAY 79
           S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186


>gi|115496644|ref|NP_001069491.1| UPF0454 protein C12orf49 homolog precursor [Bos taurus]
 gi|122133614|sp|Q17QN8.1|CL049_BOVIN RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
 gi|109659176|gb|AAI18255.1| Chromosome 12 open reading frame 49 ortholog [Bos taurus]
 gi|296478528|tpg|DAA20643.1| TPA: hypothetical protein LOC534423 precursor [Bos taurus]
          Length = 205

 Score = 52.8 bits (125), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 31/70 (44%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
           GC L S CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +
Sbjct: 118 GC-LSSGCCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 176

Query: 70  SESVVHENAY 79
           S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186


>gi|157818285|ref|NP_001102536.1| uncharacterized protein LOC498188 [Rattus norvegicus]
 gi|149063483|gb|EDM13806.1| similar to hypothetical protein FLJ21415 (predicted) [Rattus
           norvegicus]
          Length = 205

 Score = 52.8 bits (125), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 30/70 (42%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
           GC L + CC +YEYCVSCC+ P++  L E+ LN   +A           F+ C  +CR +
Sbjct: 118 GC-LANGCCEAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 176

Query: 70  SESVVHENAY 79
           S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186


>gi|440912218|gb|ELR61808.1| hypothetical protein M91_20260, partial [Bos grunniens mutus]
          Length = 201

 Score = 52.8 bits (125), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 31/70 (44%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
           GC L S CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +
Sbjct: 114 GC-LSSGCCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 172

Query: 70  SESVVHENAY 79
           S+SV HEN Y
Sbjct: 173 SQSVQHENTY 182


>gi|351694757|gb|EHA97675.1| hypothetical protein GW7_10171 [Heterocephalus glaber]
          Length = 273

 Score = 52.8 bits (125), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 30/70 (42%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
           GC L + CC +YEYCVSCC+ P++  L E+ LN   +A           F+ C  +CR +
Sbjct: 186 GC-LANGCCGAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFLAVEDHFELCLAKCRTS 244

Query: 70  SESVVHENAY 79
           S+SV HEN Y
Sbjct: 245 SQSVQHENTY 254


>gi|348584144|ref|XP_003477832.1| PREDICTED: UPF0454 protein C12orf49 homolog [Cavia porcellus]
          Length = 205

 Score = 52.8 bits (125), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 30/70 (42%), Positives = 42/70 (60%), Gaps = 3/70 (4%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
           GC L + CC++YEYCVSCC+ P++  L E+ LN   +A           F+ C  +CR +
Sbjct: 118 GC-LANGCCSAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFLAVEDHFELCLAKCRTS 176

Query: 70  SESVVHENAY 79
           S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186


>gi|311270685|ref|XP_003132942.1| PREDICTED: UPF0454 protein C12orf49 homolog [Sus scrofa]
          Length = 205

 Score = 52.8 bits (125), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 38/63 (60%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|426247320|ref|XP_004017434.1| PREDICTED: UPF0454 protein C12orf49 homolog [Ovis aries]
          Length = 205

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/70 (44%), Positives = 40/70 (57%), Gaps = 3/70 (4%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
           GC L S CC +YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +
Sbjct: 118 GC-LSSGCCGAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 176

Query: 70  SESVVHENAY 79
           S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186


>gi|307207910|gb|EFN85471.1| UPF0454 protein C12orf49-like protein [Harpegnathos saltator]
          Length = 276

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 46/85 (54%), Gaps = 10/85 (11%)

Query: 19  CCNSYEYCVSCCVNPARTLK--EQVL--NVKIAKPTTAGTYG----SVFDFCAGRCRHNS 70
           CC  YEYCVSCC++P + +K  + V+  +VK  K   A          F  C   CR +S
Sbjct: 125 CCTIYEYCVSCCLHPGKQIKGRKDVVSGSVKAQKEEDAVRTRIRNLDRFQICLAACRTSS 184

Query: 71  ESVVHENAYLSDF-HHCFSM-PSNA 93
            SV HEN Y   F  HC++M PSN+
Sbjct: 185 ASVRHENTYKDPFAKHCYAMQPSNS 209


>gi|148224108|ref|NP_001085845.1| UPF0454 protein C12orf49 homolog precursor [Xenopus laevis]
 gi|82184289|sp|Q6GNT2.1|CL049_XENLA RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
 gi|49115513|gb|AAH73420.1| MGC80896 protein [Xenopus laevis]
          Length = 205

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 3/72 (4%)

Query: 19  CCNSYEYCVSCCVNPARTLK-EQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC+ YE+CVSCC+ P + L+ E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCSVYEFCVSCCLQPNKQLQLERFLNKAAVAFQNLFQAVEDHFELCLAKCRTSSQSVQHE 183

Query: 77  NAYLSDF-HHCF 87
           N Y +    HC+
Sbjct: 184 NTYRNPIAKHCY 195


>gi|301770727|ref|XP_002920782.1| PREDICTED: UPF0454 protein C12orf49 homolog [Ailuropoda
           melanoleuca]
 gi|281343735|gb|EFB19319.1| hypothetical protein PANDA_009565 [Ailuropoda melanoleuca]
          Length = 205

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC +YEYCVSCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCGAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|57099849|ref|XP_540555.1| PREDICTED: UPF0454 protein C12orf49 homolog [Canis lupus
           familiaris]
 gi|355735600|gb|AES11718.1| hypothetical protein [Mustela putorius furo]
          Length = 205

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC +YEYCVSCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCGAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|410976770|ref|XP_003994786.1| PREDICTED: UPF0454 protein C12orf49 homolog [Felis catus]
          Length = 171

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/70 (42%), Positives = 42/70 (60%), Gaps = 3/70 (4%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
           GC L + CC++YEYCVSCC+ P++  L E+ LN   +A           F+ C  +CR +
Sbjct: 84  GC-LPNGCCSAYEYCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 142

Query: 70  SESVVHENAY 79
           S+SV HEN Y
Sbjct: 143 SQSVQHENTY 152


>gi|194214270|ref|XP_001489615.2| PREDICTED: UPF0454 protein C12orf49 homolog, partial [Equus
           caballus]
          Length = 178

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 46/80 (57%), Gaps = 3/80 (3%)

Query: 2   LPRERREILLGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVF 59
           +P  R+    GC L S CC++YE+CVSCC+ P++  L E+ LN   +A           F
Sbjct: 81  VPSTRQYSCDGC-LSSGCCSAYEHCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHF 139

Query: 60  DFCAGRCRHNSESVVHENAY 79
           + C  +CR +S+SV HEN Y
Sbjct: 140 ELCLAKCRTSSQSVQHENTY 159


>gi|334327393|ref|XP_003340895.1| PREDICTED: UPF0454 protein C12orf49 homolog [Monodelphis domestica]
          Length = 174

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 7/78 (8%)

Query: 6   RREILLGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDF 61
           RR    GC L + CC +YEYCVSCC+ P++  L E+ LN   A       + +V   F+ 
Sbjct: 81  RRFWCDGC-LANGCCGAYEYCVSCCLQPSKQLLLERFLNQAAA--AFQNLFMAVEDHFEL 137

Query: 62  CAGRCRHNSESVVHENAY 79
           C  +CR +S+SV HEN Y
Sbjct: 138 CLAKCRTSSQSVQHENTY 155


>gi|289742927|gb|ADD20211.1| uncharacterized conserved protein [Glossina morsitans morsitans]
          Length = 217

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/69 (44%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 13  CNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNS 70
           CN  S CC+ YE+CVSCC++P  R L E  L  V   + T        F+ C  +CR NS
Sbjct: 123 CNNSSHCCSIYEFCVSCCLHPEKRPLLELALQAVNGRQLTVFAQVLDQFELCLVKCRTNS 182

Query: 71  ESVVHENAY 79
            SV HEN Y
Sbjct: 183 HSVEHENRY 191


>gi|157167582|ref|XP_001655053.1| hypothetical protein AaeL_AAEL010931 [Aedes aegypti]
 gi|157167584|ref|XP_001655054.1| hypothetical protein AaeL_AAEL010931 [Aedes aegypti]
 gi|157167586|ref|XP_001655055.1| hypothetical protein AaeL_AAEL010931 [Aedes aegypti]
 gi|94469020|gb|ABF18359.1| uncharacterized conserved protein [Aedes aegypti]
 gi|108872807|gb|EAT37032.1| AAEL010931-PA [Aedes aegypti]
 gi|108872808|gb|EAT37033.1| AAEL010931-PB [Aedes aegypti]
 gi|108872809|gb|EAT37034.1| AAEL010931-PC [Aedes aegypti]
          Length = 228

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 17  SQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESVV 74
           S CC  YEYCVSCC++P  R L E+VL     +      T    ++ C  +CR NS SV 
Sbjct: 122 SNCCAIYEYCVSCCMHPNKRPLLERVLAKATGRQIAVYATVNDQYELCLTKCRTNSHSVQ 181

Query: 75  HENAYLS-DFHHCF 87
           +EN Y      HC+
Sbjct: 182 NENKYRDPKLKHCY 195


>gi|62897527|dbj|BAD96703.1| hypothetical protein FLJ21415 variant [Homo sapiens]
          Length = 205

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPARTLK-EQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P + L  E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLPLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|403281581|ref|XP_003932260.1| PREDICTED: UPF0454 protein C12orf49 homolog [Saimiri boliviensis
           boliviensis]
          Length = 205

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|109098867|ref|XP_001112213.1| PREDICTED: UPF0454 protein C12orf49-like isoform 2 [Macaca mulatta]
          Length = 272

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|62858887|ref|NP_001016016.1| uncharacterized protein LOC548770 [Xenopus (Silurana) tropicalis]
 gi|183985985|gb|AAI66293.1| hypothetical protein LOC548770 [Xenopus (Silurana) tropicalis]
 gi|213627063|gb|AAI70669.1| hypothetical protein LOC548770 [Xenopus (Silurana) tropicalis]
 gi|213627065|gb|AAI70671.1| hypothetical protein LOC548770 [Xenopus (Silurana) tropicalis]
          Length = 204

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 36/63 (57%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC+ YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 123 CCSVYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFQAVEDHFELCLAKCRTSSQSVQHE 182

Query: 77  NAY 79
           N Y
Sbjct: 183 NTY 185


>gi|296213028|ref|XP_002753098.1| PREDICTED: UPF0454 protein C12orf49-like isoform 1 [Callithrix
           jacchus]
          Length = 205

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|440791825|gb|ELR13063.1| hypothetical protein ACA1_097480 [Acanthamoeba castellanii str.
           Neff]
          Length = 264

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 49/83 (59%), Gaps = 4/83 (4%)

Query: 13  CNLLSQCCNSYEYCVSCCVNPA---RTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHN 69
           C++ + CC  YE+CVSCC+NP+   R      LN++ ++        S+F+ C   CR +
Sbjct: 182 CDIKTTCCLEYEFCVSCCLNPSQEQRVASWLQLNIEDSREREEMAPLSLFEQCLALCRTS 241

Query: 70  SESVVHENAYLSDFHHCFS-MPS 91
           S S++ ++ ++S  H C++ +PS
Sbjct: 242 SASILRQHIFISPSHFCYAGVPS 264


>gi|444723218|gb|ELW63879.1| hypothetical protein TREES_T100018686 [Tupaia chinensis]
          Length = 205

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/70 (42%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
           GC L S CC +YE+CVSCC+ P++  L E+ LN   +A           F+ C  +CR +
Sbjct: 118 GC-LPSGCCGAYEHCVSCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 176

Query: 70  SESVVHENAY 79
           S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186


>gi|13376060|ref|NP_079014.1| UPF0454 protein C12orf49 precursor [Homo sapiens]
 gi|109098869|ref|XP_001112172.1| PREDICTED: UPF0454 protein C12orf49-like isoform 1 [Macaca mulatta]
 gi|297693075|ref|XP_002823850.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Pongo
           abelii]
 gi|402887789|ref|XP_003907263.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Papio
           anubis]
 gi|74733622|sp|Q9H741.1|CL049_HUMAN RecName: Full=UPF0454 protein C12orf49; Flags: Precursor
 gi|10437508|dbj|BAB15058.1| unnamed protein product [Homo sapiens]
 gi|18044293|gb|AAH19843.1| Chromosome 12 open reading frame 49 [Homo sapiens]
 gi|119618494|gb|EAW98088.1| chromosome 12 open reading frame 49, isoform CRA_a [Homo sapiens]
 gi|119618495|gb|EAW98089.1| chromosome 12 open reading frame 49, isoform CRA_a [Homo sapiens]
 gi|380816602|gb|AFE80175.1| hypothetical protein LOC79794 precursor [Macaca mulatta]
 gi|383413575|gb|AFH30001.1| hypothetical protein LOC79794 precursor [Macaca mulatta]
 gi|384949484|gb|AFI38347.1| hypothetical protein LOC79794 precursor [Macaca mulatta]
          Length = 205

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|56119050|ref|NP_001007842.1| UPF0454 protein C12orf49 homolog precursor [Gallus gallus]
 gi|82081584|sp|Q5ZJE4.1|CL049_CHICK RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
 gi|53133640|emb|CAG32149.1| hypothetical protein RCJMB04_18o22 [Gallus gallus]
          Length = 205

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC+ YEYCVSCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCSVYEYCVSCCLQPSKQHLLERFLNRAAMAFQNLFMAVEDRFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|432872487|ref|XP_004072113.1| PREDICTED: UPF0454 protein C12orf49 homolog [Oryzias latipes]
          Length = 201

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 45/81 (55%), Gaps = 7/81 (8%)

Query: 3   PRERREILLGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV--- 58
           P  R+ I   C L + CC+ YEYCVSCC+ P +  L E+ LN   A       + +V   
Sbjct: 105 PNSRQHICKSC-LANGCCSIYEYCVSCCLQPDKQLLLERFLNR--AAEGFQNLFTAVEDH 161

Query: 59  FDFCAGRCRHNSESVVHENAY 79
           F+ C  +CR +S+SV HEN Y
Sbjct: 162 FELCLAKCRTSSQSVQHENTY 182


>gi|114647164|ref|XP_001154941.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Pan
           troglodytes]
 gi|410209358|gb|JAA01898.1| chromosome 12 open reading frame 49 [Pan troglodytes]
 gi|410255302|gb|JAA15618.1| chromosome 12 open reading frame 49 [Pan troglodytes]
 gi|410289894|gb|JAA23547.1| chromosome 12 open reading frame 49 [Pan troglodytes]
 gi|410335531|gb|JAA36712.1| chromosome 12 open reading frame 49 [Pan troglodytes]
          Length = 205

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|224072158|ref|XP_002200066.1| PREDICTED: UPF0454 protein C12orf49 homolog [Taeniopygia guttata]
          Length = 205

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC+ YE+CVSCC+ P++  L E+ LN   IA           F+ C  +CR +S+SV HE
Sbjct: 124 CCSIYEFCVSCCLQPSKQHLLERFLNRAAIAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|449685232|ref|XP_004210848.1| PREDICTED: UPF0454 protein C12orf49 homolog [Hydra magnipapillata]
          Length = 191

 Score = 51.2 bits (121), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 17  SQCCNSYEYCVSCCVNPA---RTLKEQVL-NVKIAKPTTAGTYGSVFDFCAGRCRHNSES 72
           ++CC  YE+CVSCC+ P    R L E++L +  +A+          F+ C  +CR +S S
Sbjct: 108 NRCCEVYEHCVSCCLQPVPRKRRLIEKILVHAHMARNPLLREVKDTFELCLHKCRTSSAS 167

Query: 73  VVHENAYL-SDFHHCFSMPS 91
           V  EN Y  S F +CF   S
Sbjct: 168 VYQENKYKDSSFKYCFGTDS 187


>gi|395833970|ref|XP_003789990.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Otolemur
           garnettii]
          Length = 205

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCSAYEYCVSCCLQPNKQLLLERFLNRAAMAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|332250748|ref|XP_003274513.1| PREDICTED: UPF0454 protein C12orf49 homolog [Nomascus leucogenys]
          Length = 286

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 205 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 264

Query: 77  NAY 79
           N Y
Sbjct: 265 NTY 267


>gi|260782034|ref|XP_002586098.1| hypothetical protein BRAFLDRAFT_252331 [Branchiostoma floridae]
 gi|229271187|gb|EEN42109.1| hypothetical protein BRAFLDRAFT_252331 [Branchiostoma floridae]
          Length = 200

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART--LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC  YE+CVSCC+ P +   L+  + +V  A  T   +    F+ C  +CR +S+SV HE
Sbjct: 118 CCQVYEHCVSCCLQPEKQPLLRSLLDHVPDAFKTLFASVTDHFELCLAKCRTSSQSVQHE 177

Query: 77  NAY 79
           N+Y
Sbjct: 178 NSY 180


>gi|397525490|ref|XP_003832699.1| PREDICTED: UPF0454 protein C12orf49 homolog [Pan paniscus]
          Length = 247

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 166 CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 225

Query: 77  NAY 79
           N Y
Sbjct: 226 NTY 228


>gi|281208176|gb|EFA82354.1| cGMP-specific phosphodiesterase [Polysphondylium pallidum PN500]
          Length = 283

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 42/78 (53%), Gaps = 2/78 (2%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 70
           GC+    CCN YE CVSCC+   +  + E+VL   + K  ++    + F+ C   CR +S
Sbjct: 200 GCSNKFHCCNEYELCVSCCMGSDKVPILEEVLGRMLDKGISSNYLRNQFELCTMTCRTSS 259

Query: 71  ESVVHENAYLS-DFHHCF 87
            S++H+  Y+  D   C+
Sbjct: 260 RSLIHQREYIEPDLKFCY 277


>gi|56753599|gb|AAW25002.1| SJCHGC02079 protein [Schistosoma japonicum]
          Length = 244

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 16  LSQCCNSYEYCVSCCVNPA-RTLKEQVL-NVKIAKPTTAGTYGSVFDFCAGRCRHNSESV 73
            + CC+ YE+CVSCC+NP  + L +QV+ N               F+FC   CR +S +V
Sbjct: 163 FNHCCSVYEHCVSCCLNPTNKPLWDQVMQNANANSRRHLKLAIDAFEFCVAVCRTSSLTV 222

Query: 74  VHENAYLS-DFHHCFSMPS 91
           +HEN Y + D  +CF + +
Sbjct: 223 LHENKYRNLDEIYCFGLET 241


>gi|299117113|emb|CBN73884.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 83

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 38/78 (48%), Gaps = 1/78 (1%)

Query: 127 GQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQS 186
           G  C  +   LLN C +++ +  C   C  S G +QP  V  DAP +  P  CL +    
Sbjct: 3   GMICKESSFQLLNSCGLLRDHFPCT-SCTESNGFEQPCYVELDAPEDNLPGQCLISSNPR 61

Query: 187 MLSCDGSHRHTRRLCPCA 204
             +C  SH  TRRLCPC 
Sbjct: 62  HSTCSASHEVTRRLCPCT 79


>gi|355564717|gb|EHH21217.1| hypothetical protein EGK_04231, partial [Macaca mulatta]
 gi|355786567|gb|EHH66750.1| hypothetical protein EGM_03800, partial [Macaca fascicularis]
          Length = 169

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 88  CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 147

Query: 77  NAY 79
           N Y
Sbjct: 148 NTY 150


>gi|417397061|gb|JAA45564.1| Putative conserved secreted protein precursor [Desmodus rotundus]
          Length = 205

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/63 (41%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YE+CVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCSAYEHCVSCCLQPGKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|290976460|ref|XP_002670958.1| predicted protein [Naegleria gruberi]
 gi|284084522|gb|EFC38214.1| predicted protein [Naegleria gruberi]
          Length = 350

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/149 (26%), Positives = 61/149 (40%), Gaps = 15/149 (10%)

Query: 58  VFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGE 117
           +F F    C ++      E +  +DF H  S+P     A        L  I  ++G  GE
Sbjct: 211 MFMFFVDACPNSLFYPCDETSAPNDFVHD-SLPCETLPASAIHSLQELQTIKAVLGALGE 269

Query: 118 SCDSVCKSSGQSCVLNKLILLNQCEIIQKYMS---CKRGCLASMGADQPAEVADDAPSNL 174
           +CD++C  S   C  N    +N+C+ ++K+ S   CK+             V D      
Sbjct: 270 NCDTICAKSESVCAPNYFPYINRCDEMRKHYSSCECKK-----------EGVIDSRAPYF 318

Query: 175 NPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           +   C+    +S   C  +H   RRLCPC
Sbjct: 319 DGKYCIIGNRRSKFRCGNAHPSERRLCPC 347


>gi|326930051|ref|XP_003211166.1| PREDICTED: UPF0454 protein C12orf49 homolog [Meleagris gallopavo]
          Length = 205

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 36/63 (57%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC+ YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 124 CCSVYEYCVSCCLQPNKQHLLERFLNRAAMAFQNLFMAVEDRFELCLAKCRTSSQSVQHE 183

Query: 77  NAY 79
           N Y
Sbjct: 184 NTY 186


>gi|348533454|ref|XP_003454220.1| PREDICTED: UPF0454 protein C12orf49 homolog [Oreochromis niloticus]
          Length = 200

 Score = 50.4 bits (119), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 45/81 (55%), Gaps = 7/81 (8%)

Query: 3   PRERREILLGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV--- 58
           P  R+ I   C L + CC+ YEYCVSCC+ P +  L E+ LN   A       + +V   
Sbjct: 104 PSTRQYICKSC-LANGCCSIYEYCVSCCLQPDKQLLLERFLNR--AAEGFQNLFTAVEDH 160

Query: 59  FDFCAGRCRHNSESVVHENAY 79
           F+ C  +CR +S+SV HEN Y
Sbjct: 161 FELCLAKCRTSSQSVQHENTY 181


>gi|145348106|ref|XP_001418497.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578726|gb|ABO96790.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 198

 Score = 50.4 bits (119), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 2/83 (2%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNS 70
           GC+  ++CC  YE C +CC  P   ++  +      +   A G +   F++C  RCR   
Sbjct: 96  GCDESARCCRDYETCAACCQGPTHDVEAAMKRHARGRNQIATGFFEDAFEYCKSRCRTQP 155

Query: 71  ESVVHENAYLSDFHHCF-SMPSN 92
               HEN Y  D  +C+   P N
Sbjct: 156 SVTFHENTYAYDTAYCYGDYPVN 178


>gi|354466980|ref|XP_003495949.1| PREDICTED: UPF0454 protein C12orf49 homolog [Cricetulus griseus]
 gi|344237025|gb|EGV93128.1| UPF0454 protein C12orf49-like [Cricetulus griseus]
          Length = 205

 Score = 50.1 bits (118), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
           GC L + CC +YE+CVSCC+ P++  L E+ LN   +A           F+ C  +CR +
Sbjct: 118 GC-LANGCCEAYEHCVSCCLQPSKQLLLERFLNRAAMAFQNLFMAVEDHFELCLAKCRTS 176

Query: 70  SESVVHENAY 79
           S+SV HEN Y
Sbjct: 177 SQSVQHENTY 186


>gi|91087999|ref|XP_973740.1| PREDICTED: similar to CG15643 CG15643-PA [Tribolium castaneum]
 gi|270011897|gb|EFA08345.1| hypothetical protein TcasGA2_TC005988 [Tribolium castaneum]
          Length = 203

 Score = 50.1 bits (118), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 41/68 (60%), Gaps = 8/68 (11%)

Query: 17  SQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGT--YGSV---FDFCAGRCRHNSE 71
           + CC  YE+C+SCC++P    K+++L   + K T      + SV   F+ C  +CR NS+
Sbjct: 116 NNCCAIYEFCISCCLHPD---KKEMLENLLGKATEQNNVLFASVTDHFELCLAKCRTNSQ 172

Query: 72  SVVHENAY 79
           SV HEN+Y
Sbjct: 173 SVQHENSY 180


>gi|353230151|emb|CCD76322.1| hypothetical protein Smp_168260 [Schistosoma mansoni]
          Length = 375

 Score = 50.1 bits (118), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 17  SQCCNSYEYCVSCCVNPA-RTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVH 75
           + CC+ YE+CVSCC+NP  ++L ++V N+              F+FC   CR +S +V H
Sbjct: 112 NHCCSIYEHCVSCCLNPINKSLWDKVQNLDANYKRHLQLATDAFEFCIAVCRTSSLTVFH 171

Query: 76  ENAY 79
           EN Y
Sbjct: 172 ENRY 175


>gi|256086857|ref|XP_002579602.1| hypothetical protein [Schistosoma mansoni]
          Length = 392

 Score = 50.1 bits (118), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 17  SQCCNSYEYCVSCCVNPA-RTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVH 75
           + CC+ YE+CVSCC+NP  ++L ++V N+              F+FC   CR +S +V H
Sbjct: 112 NHCCSIYEHCVSCCLNPINKSLWDKVQNLDANYKRHLQLATDAFEFCIAVCRTSSLTVFH 171

Query: 76  ENAY 79
           EN Y
Sbjct: 172 ENRY 175


>gi|340369555|ref|XP_003383313.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase B-like
           [Amphimedon queenslandica]
          Length = 662

 Score = 50.1 bits (118), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 26/97 (26%), Positives = 44/97 (45%), Gaps = 1/97 (1%)

Query: 108 ISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCK-RGCLASMGADQPAEV 166
           +  +I + G++C   C  +   C  +   L+N  E +  ++ C     L    AD    +
Sbjct: 553 LRTVISKPGQNCRDSCLENKMVCESSFFSLINTHEAVFNHLGCNVTHTLEEKYADYAPAI 612

Query: 167 ADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           A  A   + P  C+   I+ +L C G  ++ RRLCPC
Sbjct: 613 AVSADKPIEPGHCVIIDIRRLLGCPGYSKNFRRLCPC 649


>gi|47221493|emb|CAG08155.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 201

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 45/81 (55%), Gaps = 7/81 (8%)

Query: 3   PRERREILLGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV--- 58
           P  R+ I   C L + CC+ YEYCVSCC+ P +  L E+ LN   A       + +V   
Sbjct: 105 PSSRQYICKSC-LANGCCSIYEYCVSCCLQPDKQPLLERFLNR--AAEGFRNLFTAVEDH 161

Query: 59  FDFCAGRCRHNSESVVHENAY 79
           F+ C  +CR +S+SV HEN Y
Sbjct: 162 FELCLAKCRTSSQSVQHENTY 182


>gi|390359679|ref|XP_797277.2| PREDICTED: UPF0454 protein C12orf49 homolog [Strongylocentrotus
           purpuratus]
          Length = 198

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 37/93 (39%), Positives = 49/93 (52%), Gaps = 11/93 (11%)

Query: 3   PRERREILLGCNLLSQ--CCNSYEYCVSCCVNPARTLKEQVLNVKIAKP--TTAGTYGSV 58
           P  +   L  CN  S+  CC  YE CVSCC++P    KE+VL   + K   T    + SV
Sbjct: 98  PEAKSSSLHNCNGCSKNGCCVMYERCVSCCMHPE---KERVLQDMLGKASHTFRLLFQSV 154

Query: 59  ---FDFCAGRCRHNSESVVHENAYLS-DFHHCF 87
              F+ C  +CR +S+SV HEN Y   D  HC+
Sbjct: 155 ADHFELCLTKCRTSSQSVQHENTYRDPDAKHCY 187


>gi|332840508|ref|XP_003314001.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 2 [Pan
           troglodytes]
          Length = 175

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 94  CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 153

Query: 77  NAY 79
           N Y
Sbjct: 154 NTY 156


>gi|410923361|ref|XP_003975150.1| PREDICTED: UPF0454 protein C12orf49 homolog [Takifugu rubripes]
          Length = 199

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 45/81 (55%), Gaps = 7/81 (8%)

Query: 3   PRERREILLGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV--- 58
           P  R+ I   C L + CC+ YEYCVSCC+ P +  L E+ LN   A       + +V   
Sbjct: 103 PSSRQYICKSC-LANGCCSIYEYCVSCCLQPDKQPLLERFLNR--AAEGFRNLFTAVEDH 159

Query: 59  FDFCAGRCRHNSESVVHENAY 79
           F+ C  +CR +S+SV HEN Y
Sbjct: 160 FELCLAKCRTSSQSVQHENTY 180


>gi|402887791|ref|XP_003907264.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 2 [Papio
           anubis]
          Length = 175

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 94  CCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 153

Query: 77  NAY 79
           N Y
Sbjct: 154 NTY 156


>gi|344295368|ref|XP_003419384.1| PREDICTED: UPF0454 protein C12orf49 homolog [Loxodonta africana]
          Length = 174

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 42/70 (60%), Gaps = 3/70 (4%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
           GC L + CC++YEYCVSCC+ P++  L E+ L+   +A           F+ C  +CR +
Sbjct: 87  GC-LSNGCCSAYEYCVSCCLQPSKQLLLERFLSRAAVAFQNLFMAVEDHFELCLAKCRTS 145

Query: 70  SESVVHENAY 79
           S+SV HEN Y
Sbjct: 146 SQSVQHENTY 155


>gi|395833972|ref|XP_003789991.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 2 [Otolemur
           garnettii]
          Length = 175

 Score = 49.7 bits (117), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 94  CCSAYEYCVSCCLQPNKQLLLERFLNRAAMAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 153

Query: 77  NAY 79
           N Y
Sbjct: 154 NTY 156


>gi|66472238|ref|NP_001018573.1| UPF0454 protein C12orf49 homolog precursor [Danio rerio]
 gi|82192840|sp|Q503V3.1|CL049_DANRE RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
 gi|63100598|gb|AAH95168.1| Zgc:110063 [Danio rerio]
          Length = 199

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 28/65 (43%), Positives = 38/65 (58%), Gaps = 6/65 (9%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVV 74
           CC+ YEYCVSCC+ P +  L E+ LN   A       + +V   F+ C  +CR +S+SV 
Sbjct: 118 CCSIYEYCVSCCLQPDKQPLLERFLNR--AAEGFQNLFTAVEDHFELCLAKCRTSSQSVQ 175

Query: 75  HENAY 79
           HEN Y
Sbjct: 176 HENTY 180


>gi|427782911|gb|JAA56907.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 210

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 45/75 (60%), Gaps = 7/75 (9%)

Query: 19  CCNSYEYCVSCCVNPAR-TLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVV 74
           CC+ YEYC+SCC++P + T+ +++L    A  T      S+   F+ C  +CR +S+SV 
Sbjct: 121 CCSIYEYCISCCMHPQKITILQKILGK--ASDTFKMLVASITDHFELCLTKCRTSSDSVH 178

Query: 75  HENAYLS-DFHHCFS 88
           HEN+Y      HC++
Sbjct: 179 HENSYRDPKAKHCYA 193


>gi|241792024|ref|XP_002400643.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215510808|gb|EEC20261.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 209

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/74 (39%), Positives = 41/74 (55%), Gaps = 5/74 (6%)

Query: 19  CCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVH 75
           CC+ YEYC+SCC+ P +    Q +  K A  T      S+   F+ C  +CR +SESV H
Sbjct: 120 CCSIYEYCISCCMQPQKIALLQKIMGK-ASETFKMLVASITDHFELCLTKCRTSSESVHH 178

Query: 76  ENAYLS-DFHHCFS 88
           EN+Y      HC++
Sbjct: 179 ENSYRDPSAKHCYA 192


>gi|432116785|gb|ELK37410.1| hypothetical protein MDA_GLEAN10014215 [Myotis davidii]
          Length = 149

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHN 69
           GC L + CC++YE+CVSCC+ P +  L E+ LN   +A           F+ C  +CR +
Sbjct: 62  GC-LTNGCCSAYEHCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTS 120

Query: 70  SESVVHENAY 79
           S+SV HEN Y
Sbjct: 121 SQSVQHENTY 130


>gi|345483911|ref|XP_001602536.2| PREDICTED: UPF0454 protein C12orf49 homolog [Nasonia vitripennis]
          Length = 329

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 42/89 (47%), Gaps = 15/89 (16%)

Query: 19  CCNSYEYCVSCCVNPARTLKEQVLNV-KIAKPTTAGTYG-----------SVFDFCAGRC 66
           CC  YEYCVSCC++P +  K +   + +I K       G             F  C   C
Sbjct: 174 CCTIYEYCVSCCLDPNKVKKRKSFTIDQIRKENLKLRRGEDILKLRLRSLDRFQLCLAAC 233

Query: 67  RHNSESVVHENAYLSD-FHHCF--SMPSN 92
           R +S SV HEN Y +  + HC+   +PSN
Sbjct: 234 RTSSASVQHENTYKNPLYKHCYILQLPSN 262


>gi|346472337|gb|AEO36013.1| hypothetical protein [Amblyomma maculatum]
          Length = 206

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 44/75 (58%), Gaps = 7/75 (9%)

Query: 19  CCNSYEYCVSCCVNPAR-TLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVV 74
           CC+ YEYC+SCC++P + TL +++L    A  T      S+   F+ C  +CR +S SV 
Sbjct: 121 CCSIYEYCISCCMHPQKITLLQKILGK--ASDTFKMLVASITDHFELCLTKCRTSSNSVH 178

Query: 75  HENAYLS-DFHHCFS 88
           HEN+Y      HC++
Sbjct: 179 HENSYRDPKAKHCYA 193


>gi|221042418|dbj|BAH12886.1| unnamed protein product [Homo sapiens]
          Length = 175

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/63 (41%), Positives = 36/63 (57%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC++YEYC SCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HE
Sbjct: 94  CCSAYEYCASCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHE 153

Query: 77  NAY 79
           N Y
Sbjct: 154 NTY 156


>gi|426221180|ref|XP_004004788.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Ovis aries]
          Length = 739

 Score = 48.1 bits (113), Expect = 0.002,   Method: Composition-based stats.
 Identities = 33/104 (31%), Positives = 51/104 (49%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V     G+SC  VC+ +   C  +    LN+ + + KY ++C           Q 
Sbjct: 633 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYEVNC-----------QS 681

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D  APS  +P    C++     + SC G+H H RR+CPC
Sbjct: 682 SELAKDIVAPS-FDPKNKHCVFQGDLLLFSCAGAHAHHRRICPC 724


>gi|225711380|gb|ACO11536.1| UPF0454 protein C12orf49 homolog precursor [Caligus rogercresseyi]
          Length = 210

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 45/83 (54%), Gaps = 4/83 (4%)

Query: 13  CNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNS 70
           CNL   CC++YE CVSCC++P  R     VLN  +        +    ++ C  +CR +S
Sbjct: 122 CNLDMGCCSAYEACVSCCMSPDKRAFLMSVLNEARSLSNILILSVSDQYELCLAKCRTSS 181

Query: 71  ESVVHENAYLS-DFHHCF-SMPS 91
            SV HEN Y + ++  C+  +PS
Sbjct: 182 RSVHHENTYKNPEYKFCYKKIPS 204


>gi|321477285|gb|EFX88244.1| hypothetical protein DAPPUDRAFT_305537 [Daphnia pulex]
          Length = 222

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 40/68 (58%), Gaps = 12/68 (17%)

Query: 19  CCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV-------FDFCAGRCRHNSE 71
           CC+ YEYC+SCC+ P    K++VL   + K   AGT   +       F+ C  +CR +S+
Sbjct: 128 CCSIYEYCISCCLQPD---KKEVLQDVLGK--AAGTLNLLLTSVTDHFELCLTKCRTSSQ 182

Query: 72  SVVHENAY 79
           SV HEN+Y
Sbjct: 183 SVQHENSY 190


>gi|449265786|gb|EMC76924.1| UPF0454 protein C12orf49 like protein, partial [Columba livia]
          Length = 187

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 36/63 (57%), Gaps = 2/63 (3%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC+ YE CVSCC+ P++  L E+ LN   IA           F+ C  +CR +S+SV HE
Sbjct: 106 CCSVYECCVSCCLQPSKQHLLERFLNRAAIAFQNLFMAVEDRFELCLAKCRTSSQSVQHE 165

Query: 77  NAY 79
           N Y
Sbjct: 166 NTY 168


>gi|358339009|dbj|GAA47150.1| UPF0454 protein C12orf49 homolog, partial [Clonorchis sinensis]
          Length = 186

 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 48/95 (50%), Gaps = 5/95 (5%)

Query: 6   RREILLGCNLLSQ-CCNSYEYCVSCCVNPART--LKEQVLNV-KIAKPTTAGTYGSVFDF 61
           RR +   CN   + CC  YE+CVSCC+N       +E +L   ++           +F +
Sbjct: 91  RRFVCDSCNQTRENCCQIYEHCVSCCMNRQHIHLWREALLQASELPAQRHLLLADGLFQY 150

Query: 62  CAGRCRHNSESVVHENAYLSDFH-HCFSMPSNASG 95
           C  +CR +S+SV+HEN Y    H +CF +   + G
Sbjct: 151 CQAKCRTSSQSVLHENTYRDAEHRYCFGLYLTSPG 185


>gi|198423098|ref|XP_002130011.1| PREDICTED: similar to Chromosome 12 open reading frame 49 ortholog
           [Ciona intestinalis]
          Length = 210

 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 8/77 (10%)

Query: 17  SQCCNSYEYCVSCCVNPA-RTLKEQVLNVKIAKPTT----AGTYGSVFDFCAGRCRHNSE 71
           S CC+ YE+CVSCC+ P  + L E+V  V+  K  T      +    FD C  +CR +S 
Sbjct: 126 SNCCSVYEHCVSCCLQPKHKNLLEKV--VRGLKTNTMLIMMSSIHDQFDLCLMKCRTSST 183

Query: 72  SVVHENAYLS-DFHHCF 87
           SV HEN Y      +CF
Sbjct: 184 SVRHENTYRDPSMKYCF 200


>gi|327276076|ref|XP_003222797.1| PREDICTED: LOW QUALITY PROTEIN: UPF0454 protein C12orf49 homolog
           [Anolis carolinensis]
          Length = 205

 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 8/66 (12%)

Query: 19  CCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA--GTYGSV---FDFCAGRCRHNSESV 73
           CC+ YE CVSCC+ P    K+QVL   + +   A    + SV   F+ C  +CR +S+SV
Sbjct: 124 CCSVYEACVSCCLQPN---KQQVLENFLNRAAVAFRNLFLSVEDHFELCLAKCRTSSQSV 180

Query: 74  VHENAY 79
            HEN Y
Sbjct: 181 QHENTY 186


>gi|324509188|gb|ADY43866.1| Unknown [Ascaris suum]
          Length = 218

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 47/89 (52%), Gaps = 13/89 (14%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAG-------TYGSVFDFCAG 64
           GC+L S CC+ YE+CV CC+ P +  KE +L +     TT+G       +    F+ C  
Sbjct: 130 GCDLHSGCCSFYEHCVGCCMRPDQ--KEILLEM---IETTSGHRLRQILSATDQFELCTL 184

Query: 65  RCRHNSESVVHENAYLSD-FHHCFSMPSN 92
           +CR +S SV  EN Y S+   +C+    N
Sbjct: 185 KCRTSSNSVHSENKYKSEALKYCYKQEPN 213


>gi|170033812|ref|XP_001844770.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167874847|gb|EDS38230.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 228

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 39/72 (54%), Gaps = 3/72 (4%)

Query: 19  CCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESVVHE 76
           CC  YE+CVSCC++P  ++L E+VL     +      T    ++ C  +CR NS SV +E
Sbjct: 124 CCAIYEFCVSCCMHPEKKSLLEKVLAKASGRQIAVYATVNDQYELCLTKCRTNSHSVQNE 183

Query: 77  NAYLS-DFHHCF 87
           N Y      HC+
Sbjct: 184 NKYRDPKSKHCY 195


>gi|324520392|gb|ADY47626.1| Unknown [Ascaris suum]
          Length = 180

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 47/89 (52%), Gaps = 13/89 (14%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAG-------TYGSVFDFCAG 64
           GC+L S CC+ YE+CV CC+ P +  KE +L +     TT+G       +    F+ C  
Sbjct: 92  GCDLHSGCCSFYEHCVGCCMRPDQ--KEILLEM---IETTSGHRLRQILSATDQFELCTL 146

Query: 65  RCRHNSESVVHENAYLSD-FHHCFSMPSN 92
           +CR +S SV  EN Y S+   +C+    N
Sbjct: 147 KCRTSSNSVHSENKYKSEALKYCYKQEPN 175


>gi|294955121|ref|XP_002788417.1| alpha-1,3-mannosyl-glycoprotein beta-1,
           2-n-acetylglucosaminyltransferase, putative [Perkinsus
           marinus ATCC 50983]
 gi|239903817|gb|EER20213.1| alpha-1,3-mannosyl-glycoprotein beta-1,
           2-n-acetylglucosaminyltransferase, putative [Perkinsus
           marinus ATCC 50983]
          Length = 1171

 Score = 46.6 bits (109), Expect = 0.007,   Method: Composition-based stats.
 Identities = 28/89 (31%), Positives = 38/89 (42%), Gaps = 3/89 (3%)

Query: 115 QGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNL 174
           +G SC   C  +   C  + L   N C I+  +  C+ GC   +G + PA V D     +
Sbjct: 513 RGVSCTHTCIHNNARCYDHLLEFANDCHIMLSHFPCEEGCGHQLGLELPAYVMDSTLHTV 572

Query: 175 NPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
               CL T   S  SC   H  T R+C C
Sbjct: 573 G--QCLVTDHGSP-SCATYHISTARMCTC 598


>gi|330794772|ref|XP_003285451.1| hypothetical protein DICPUDRAFT_149321 [Dictyostelium purpureum]
 gi|325084626|gb|EGC38050.1| hypothetical protein DICPUDRAFT_149321 [Dictyostelium purpureum]
          Length = 322

 Score = 45.4 bits (106), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 40/73 (54%), Gaps = 2/73 (2%)

Query: 17  SQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTY-GSVFDFCAGRCRHNSESVVH 75
           S CC++YEYC+SCC+   +    Q + +  A+   A  Y  + FD C   CR +S S+++
Sbjct: 243 SDCCDTYEYCISCCMRYDKVPVVQEMLIGFAERGVAKEYLKNQFDLCTMACRTSSSSLLN 302

Query: 76  ENAYLS-DFHHCF 87
           +  Y +  F  C+
Sbjct: 303 QREYQNPKFKFCY 315


>gi|402580999|gb|EJW74948.1| hypothetical protein WUBG_14143 [Wuchereria bancrofti]
          Length = 183

 Score = 45.1 bits (105), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 44/85 (51%), Gaps = 3/85 (3%)

Query: 11  LGCNLLSQCCNSYEYCVSCCVNPART--LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRH 68
           + C+L + CC+ YE CVSCC+ P +   L + +++    +     +    F+ C  +CR 
Sbjct: 94  VWCDLYTGCCSHYEQCVSCCLRPEQKNILLKMIVSTSGHRLRQILSATDQFELCMLKCRT 153

Query: 69  NSESVVHENAYLSD-FHHCFSMPSN 92
           +S SV  EN Y S+   +C+    N
Sbjct: 154 SSNSVHSENKYKSEKMKYCYKQEPN 178


>gi|383856970|ref|XP_003703979.1| PREDICTED: UPF0454 protein C12orf49 homolog [Megachile rotundata]
          Length = 325

 Score = 45.1 bits (105), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 33/85 (38%), Positives = 42/85 (49%), Gaps = 15/85 (17%)

Query: 19  CCNSYEYCVSCCVNPARTLK--EQVL------NVKIAKPTTAGTYG----SVFDFCAGRC 66
           CC  YEYCVSCC++P + +K  + VL      N K  K   A          F  C   C
Sbjct: 173 CCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKAHKDQDAVKKRLRNLDRFQVCLAAC 232

Query: 67  RHNSESVVHENAYLSDFH--HCFSM 89
           R +S SV HEN Y  D H  HC+++
Sbjct: 233 RTSSASVRHENTY-KDPHSKHCYTL 256


>gi|348670076|gb|EGZ09898.1| hypothetical protein PHYSODRAFT_549244 [Phytophthora sojae]
          Length = 367

 Score = 44.7 bits (104), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 6/78 (7%)

Query: 18  QCCNSYEYCVSCCVNPARTL-----KEQVLNVKIAKPTTAGTYG-SVFDFCAGRCRHNSE 71
            CC+ +E+CVSCC++P         ++ + +     P  +     +VF +C  RCR +S 
Sbjct: 236 HCCDVFEHCVSCCMHPLNVRFTGIRRDYLTHADPNHPVYSDPAEITVFQYCQYRCRTSSA 295

Query: 72  SVVHENAYLSDFHHCFSM 89
           SV H+N+Y S    C+ +
Sbjct: 296 SVQHQNSYRSHRSFCYGI 313


>gi|326432709|gb|EGD78279.1| hypothetical protein PTSG_09343 [Salpingoeca sp. ATCC 50818]
          Length = 198

 Score = 44.7 bits (104), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 32/62 (51%), Gaps = 2/62 (3%)

Query: 18  QCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPT--TAGTYGSVFDFCAGRCRHNSESVVH 75
            CC  YE+CVSCC+ P      + +   + +P           F++C  +CR +S+SV H
Sbjct: 128 HCCRVYEHCVSCCLAPKHVPLRRFVFHNLTEPQRRVLSLVSDQFEYCEAKCRTSSQSVQH 187

Query: 76  EN 77
           EN
Sbjct: 188 EN 189


>gi|440902995|gb|ELR53712.1| Alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Bos grunniens
           mutus]
          Length = 739

 Score = 44.7 bits (104), Expect = 0.022,   Method: Composition-based stats.
 Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V     G+SC  VC+ +   C  +    LN+ + + KY ++C           Q 
Sbjct: 633 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYDVNC-----------QS 681

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D  APS  +P    C++     + SC G+H   RR+CPC
Sbjct: 682 SELAKDIVAPS-FDPKNKHCVFQGDLLLFSCAGAHARHRRICPC 724


>gi|300797805|ref|NP_001178271.1| alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Bos taurus]
 gi|296490527|tpg|DAA32640.1| TPA: mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [Bos taurus]
          Length = 739

 Score = 44.7 bits (104), Expect = 0.022,   Method: Composition-based stats.
 Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V     G+SC  VC+ +   C  +    LN+ + + KY ++C           Q 
Sbjct: 633 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYDVNC-----------QS 681

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D  APS  +P    C++     + SC G+H   RR+CPC
Sbjct: 682 SELAKDIVAPS-FDPKNKHCVFQGDLLLFSCAGAHARHRRICPC 724


>gi|403280116|ref|XP_003931580.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Saimiri
           boliviensis boliviensis]
          Length = 741

 Score = 44.7 bits (104), Expect = 0.023,   Method: Composition-based stats.
 Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY ++C           Q 
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVTC-----------QS 683

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS  +P    C++     + SC G+H   RR+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHRRVCPC 726


>gi|296204935|ref|XP_002749539.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Callithrix
           jacchus]
          Length = 741

 Score = 44.7 bits (104), Expect = 0.023,   Method: Composition-based stats.
 Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY ++C           Q 
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVTC-----------QS 683

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS  +P    C++     + SC G+H   RR+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHRRVCPC 726


>gi|312070660|ref|XP_003138249.1| hypothetical protein LOAG_02664 [Loa loa]
 gi|307766590|gb|EFO25824.1| hypothetical protein LOAG_02664 [Loa loa]
          Length = 224

 Score = 44.3 bits (103), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 3/84 (3%)

Query: 12  GCNLLSQCCNSYEYCVSCCVNPAR--TLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHN 69
            C+L + CC+ YE CVSCC+ P +   L E + +    +     +    F+ C  +CR +
Sbjct: 136 WCDLYTGCCSYYEQCVSCCLRPEQKNILLEMIESTSGHRLRQILSATDQFELCMLKCRTS 195

Query: 70  SESVVHENAYLSD-FHHCFSMPSN 92
           S SV  EN Y S+   +C+    N
Sbjct: 196 SNSVHSENKYKSEKMKYCYKQEPN 219


>gi|301113522|ref|XP_002998531.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262111832|gb|EEY69884.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 363

 Score = 44.3 bits (103), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 6/78 (7%)

Query: 18  QCCNSYEYCVSCCVNPARTL-----KEQVLNVKIAKPTTAGTYG-SVFDFCAGRCRHNSE 71
            CC+ +E+CVSCC++P         ++ + +     P  +     +VF +C  RCR +S 
Sbjct: 236 HCCDVFEHCVSCCMHPLNVRSSGIRRDFLTHADPNHPVYSDPAEITVFQYCQYRCRTSSA 295

Query: 72  SVVHENAYLSDFHHCFSM 89
           SV H+N+Y S    C+ +
Sbjct: 296 SVQHQNSYRSHRSFCYGI 313


>gi|443709414|gb|ELU04086.1| hypothetical protein CAPTEDRAFT_93600 [Capitella teleta]
          Length = 211

 Score = 44.3 bits (103), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 38/65 (58%), Gaps = 6/65 (9%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVV 74
           CC  YE+CVSCC+ P +  L   +L+   A  +    + SV   F+ C  +CR +S+SV 
Sbjct: 126 CCEVYEHCVSCCLLPEKQPLLRNILSR--ASDSFRHLFASVADHFELCLAKCRTSSQSVQ 183

Query: 75  HENAY 79
           HEN+Y
Sbjct: 184 HENSY 188


>gi|307185715|gb|EFN71631.1| UPF0454 protein C12orf49-like protein [Camponotus floridanus]
          Length = 322

 Score = 43.9 bits (102), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 39/83 (46%), Gaps = 10/83 (12%)

Query: 13  CNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV-------FDFCAGR 65
           CN L  CC  YEYCVSCC++P + +  Q+   K+         G +       F  C   
Sbjct: 173 CNSLG-CCTIYEYCVSCCLDPDKQIL-QMRGRKLGSARMQKDGGRLRIRNQDRFQICLAV 230

Query: 66  CRHNSESVVHENAYLSDF-HHCF 87
           CR +S SV HEN Y      HC+
Sbjct: 231 CRTSSLSVQHENTYKDPLAKHCY 253


>gi|66817203|ref|XP_642480.1| hypothetical protein DDB_G0277747 [Dictyostelium discoideum AX4]
 gi|60470568|gb|EAL68547.1| hypothetical protein DDB_G0277747 [Dictyostelium discoideum AX4]
          Length = 312

 Score = 43.9 bits (102), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 42/76 (55%), Gaps = 4/76 (5%)

Query: 16  LSQCCNSYEYCVSCCVNPART--LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESV 73
           +S CC+SYE+C+SCC+   +   L + ++N    K        + FD C+  CR +S S+
Sbjct: 229 VSDCCDSYEFCISCCMRQDKVSILHDTLINF-AEKNLDKKYLKNQFDLCSMACRTSSASL 287

Query: 74  VHENAYLS-DFHHCFS 88
           V++  Y   +F  C+S
Sbjct: 288 VNQREYQDPEFKFCYS 303


>gi|291391470|ref|XP_002712459.1| PREDICTED: N-acetylglucosaminyltransferase V [Oryctolagus
           cuniculus]
          Length = 741

 Score = 43.5 bits (101), Expect = 0.046,   Method: Composition-based stats.
 Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ +   C  +    LN+ + + KY +SC           Q 
Sbjct: 635 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKNKDVLKYEVSC-----------QS 683

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRICPC 726


>gi|380014036|ref|XP_003691050.1| PREDICTED: UPF0454 protein C12orf49 homolog [Apis florea]
          Length = 327

 Score = 43.5 bits (101), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 46/102 (45%), Gaps = 16/102 (15%)

Query: 2   LPRERREILLGCNLLSQCCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV- 58
           +PR  R     CN    CC  YEYCVSCC++P + +K  + VL   +     +     V 
Sbjct: 157 IPRRERYSCKTCNA-QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKSHRDQDVV 215

Query: 59  ---------FDFCAGRCRHNSESVVHENAYLSDFH--HCFSM 89
                    F  C   CR +S SV HEN Y  D H  HC++ 
Sbjct: 216 KKRLRNLDRFQVCLAACRTSSASVRHENTY-KDPHSKHCYTF 256


>gi|301770689|ref|XP_002920763.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A-like
           [Ailuropoda melanoleuca]
 gi|281337646|gb|EFB13230.1| hypothetical protein PANDA_009537 [Ailuropoda melanoleuca]
          Length = 741

 Score = 43.5 bits (101), Expect = 0.054,   Method: Composition-based stats.
 Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ +   C  +    LN+ + + KY ++C           Q 
Sbjct: 635 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYEVTC-----------QS 683

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS  +P    C++     + SC G+H   RR+CPC
Sbjct: 684 SELAKDILVPS-FDPKNRHCVFQGDLLLFSCAGAHPRHRRVCPC 726


>gi|426337254|ref|XP_004032629.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Gorilla
           gorilla gorilla]
          Length = 741

 Score = 43.5 bits (101), Expect = 0.056,   Method: Composition-based stats.
 Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY   K  C       Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684

Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQADLLLFSCAGAHPRYQRVCPC 726


>gi|344268130|ref|XP_003405916.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Loxodonta
           africana]
          Length = 741

 Score = 43.5 bits (101), Expect = 0.056,   Method: Composition-based stats.
 Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY ++C           Q 
Sbjct: 635 LSALQVKLAEPGKSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVTC-----------QS 683

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS  +P +  C++     + SC G+H   +R+CPC
Sbjct: 684 SELAKDILVPS-FDPKSKHCVFQGDLLLFSCAGAHPRHQRVCPC 726


>gi|340712268|ref|XP_003394684.1| PREDICTED: UPF0454 protein C12orf49 homolog [Bombus terrestris]
          Length = 329

 Score = 43.1 bits (100), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 41/85 (48%), Gaps = 15/85 (17%)

Query: 19  CCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV----------FDFCAGRC 66
           CC  YEYCVSCC++P + +K  + VL   +     A     V          F  C   C
Sbjct: 174 CCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKAHRDQDVVKKRLRNLDRFQVCLAAC 233

Query: 67  RHNSESVVHENAYLSDFH--HCFSM 89
           R +S SV HEN Y  D H  HC+++
Sbjct: 234 RTSSASVRHENTY-KDPHSKHCYTL 257


>gi|350593258|ref|XP_003133326.3| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Sus scrofa]
          Length = 738

 Score = 43.1 bits (100), Expect = 0.060,   Method: Composition-based stats.
 Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ +   C  +    LN+ + + KY + C           Q 
Sbjct: 632 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKDKDMLKYEVIC-----------QS 680

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS  +P +  C++     + SC G+H   RR+CPC
Sbjct: 681 SELAKDILVPS-FDPKSKHCVFQGDLLLFSCAGAHPRHRRICPC 723


>gi|350405127|ref|XP_003487334.1| PREDICTED: UPF0454 protein C12orf49 homolog [Bombus impatiens]
          Length = 329

 Score = 43.1 bits (100), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 41/85 (48%), Gaps = 15/85 (17%)

Query: 19  CCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV----------FDFCAGRC 66
           CC  YEYCVSCC++P + +K  + VL   +     A     V          F  C   C
Sbjct: 174 CCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKAHRDQDVVKKRLRNLDRFQVCLAAC 233

Query: 67  RHNSESVVHENAYLSDFH--HCFSM 89
           R +S SV HEN Y  D H  HC+++
Sbjct: 234 RTSSASVRHENTY-KDPHSKHCYTL 257


>gi|297668509|ref|XP_002812479.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Pongo abelii]
          Length = 741

 Score = 43.1 bits (100), Expect = 0.069,   Method: Composition-based stats.
 Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY   K  C       Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684

Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726


>gi|71424327|ref|XP_812757.1| glycosyl transferase [Trypanosoma cruzi strain CL Brener]
 gi|70877576|gb|EAN90906.1| glycosyl transferase, putative [Trypanosoma cruzi]
          Length = 657

 Score = 43.1 bits (100), Expect = 0.070,   Method: Composition-based stats.
 Identities = 31/114 (27%), Positives = 42/114 (36%), Gaps = 29/114 (25%)

Query: 116 GESCDSVCKSSGQSCVLNKLIL--LNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPS- 172
           G SCD+VC  +G  C L  L    LN CE ++    C R     MG        +D P+ 
Sbjct: 543 GLSCDAVCAQAGLRCALEALYFTSLNDCEELRSVFGCSR---CEMGVYWRPHPGNDFPAV 599

Query: 173 --------------------NLNPTACLYTRI---QSMLSCDGSHRHTRRLCPC 203
                               N     C Y  +   + + +C  S+  TRR CPC
Sbjct: 600 EDVSHEMEHENKERKRRLRDNSTNLVCRYNYLHDARGVPNCTASYTTTRRYCPC 653


>gi|402892052|ref|XP_003909237.1| PREDICTED: LOW QUALITY PROTEIN: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A-like [Papio
           anubis]
 gi|355566030|gb|EHH22459.1| hypothetical protein EGK_05729 [Macaca mulatta]
 gi|355751629|gb|EHH55884.1| hypothetical protein EGM_05175 [Macaca fascicularis]
 gi|380784409|gb|AFE64080.1| alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Macaca
           mulatta]
 gi|383412021|gb|AFH29224.1| alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Macaca
           mulatta]
 gi|384943612|gb|AFI35411.1| alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Macaca
           mulatta]
          Length = 741

 Score = 43.1 bits (100), Expect = 0.074,   Method: Composition-based stats.
 Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY   K  C       Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684

Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726


>gi|4505167|ref|NP_002401.1| alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Homo sapiens]
 gi|114580983|ref|XP_515802.2| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A isoform 2 [Pan
           troglodytes]
 gi|397504546|ref|XP_003822849.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Pan paniscus]
 gi|1169980|sp|Q09328.1|MGT5A_HUMAN RecName: Full=Alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A; AltName:
           Full=Alpha-mannoside
           beta-1,6-N-acetylglucosaminyltransferase; AltName:
           Full=GlcNAc-T V; Short=GNT-V; AltName: Full=Mannoside
           acetylglucosaminyltransferase 5; AltName:
           Full=N-acetylglucosaminyl-transferase V
 gi|4545222|gb|AAD22449.1|AF113921_1 alpha-1,3(6)-mannosylglycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [Homo sapiens]
 gi|870752|dbj|BAA04570.1| N-acetylglucosaminyltransferase V [Homo sapiens]
 gi|119632062|gb|EAX11657.1| mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase, isoform CRA_a
           [Homo sapiens]
 gi|119632063|gb|EAX11658.1| mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase, isoform CRA_a
           [Homo sapiens]
 gi|162317656|gb|AAI56314.1| Mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [synthetic
           construct]
 gi|162319448|gb|AAI57049.1| Mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [synthetic
           construct]
 gi|410216290|gb|JAA05364.1| mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [Pan
           troglodytes]
 gi|410265502|gb|JAA20717.1| mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [Pan
           troglodytes]
 gi|410308382|gb|JAA32791.1| mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [Pan
           troglodytes]
 gi|410332945|gb|JAA35419.1| mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [Pan
           troglodytes]
          Length = 741

 Score = 42.7 bits (99), Expect = 0.075,   Method: Composition-based stats.
 Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY   K  C       Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684

Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726


>gi|332236901|ref|XP_003267637.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Nomascus
           leucogenys]
          Length = 741

 Score = 42.7 bits (99), Expect = 0.076,   Method: Composition-based stats.
 Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY   K  C       Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684

Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726


>gi|332016576|gb|EGI57457.1| UPF0454 protein C12orf49-like protein [Acromyrmex echinatior]
          Length = 293

 Score = 42.7 bits (99), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 36/72 (50%), Gaps = 3/72 (4%)

Query: 19  CCNSYEYCVSCCVNPARTLK-EQVLNVKIAK-PTTAGTYGSVFDFCAGRCRHNSESVVHE 76
           CC  YEYCVSCC++P + ++  + L+   AK     G     F  C   CR +S SV  +
Sbjct: 170 CCTIYEYCVSCCLHPDKQIRGRKDLSSGSAKVQKNVGRTAVHFQICLAACRTSSSSVRRD 229

Query: 77  NAYLSDF-HHCF 87
           N Y      HC+
Sbjct: 230 NTYKDPLAKHCY 241


>gi|290976364|ref|XP_002670910.1| predicted protein [Naegleria gruberi]
 gi|284084474|gb|EFC38166.1| predicted protein [Naegleria gruberi]
          Length = 466

 Score = 42.7 bits (99), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 57/116 (49%), Gaps = 18/116 (15%)

Query: 90  PSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQK-YM 148
           P+N  G  +T+L      +  I+  +GE+CD+ C  +  +C       +N+C+ ++K Y 
Sbjct: 364 PTNTHG--ITELHN----LKAILCEKGENCDTCCSRNNATCDGLFFPYINRCDEMRKHYP 417

Query: 149 SCKRGCLASMGADQPAEVADDAPS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
            CK  C          E+AD++ S N +   C+    +S   C  S+R  RRLCPC
Sbjct: 418 DCK--C--------AYELADESRSPNFDGELCVIGNRRSRFRCGNSYRTERRLCPC 463


>gi|110755335|ref|XP_001121865.1| PREDICTED: UPF0454 protein C12orf49 homolog [Apis mellifera]
          Length = 327

 Score = 42.7 bits (99), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 35/101 (34%), Positives = 45/101 (44%), Gaps = 16/101 (15%)

Query: 3   PRERREILLGCNLLSQCCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV-- 58
           PR  R     CN    CC  YEYCVSCC++P + +K  + VL   +     +     V  
Sbjct: 158 PRRERYSCKTCNA-QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKSHRDQDVVK 216

Query: 59  --------FDFCAGRCRHNSESVVHENAYLSDFH--HCFSM 89
                   F  C   CR +S SV HEN Y  D H  HC++ 
Sbjct: 217 KRLRNLDRFQVCLAACRTSSASVRHENTY-KDPHSKHCYTF 256


>gi|428167931|gb|EKX36882.1| hypothetical protein GUITHDRAFT_116905 [Guillardia theta CCMP2712]
          Length = 652

 Score = 42.0 bits (97), Expect = 0.13,   Method: Composition-based stats.
 Identities = 27/91 (29%), Positives = 37/91 (40%), Gaps = 6/91 (6%)

Query: 115 QGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASM-GADQPAEVADDAPSN 173
           QG SC + C+    +C      +LN CE +       + C     G D PA   +D    
Sbjct: 290 QGVSCFTACQEQNLACQRIWFDVLNNCEALAVAFPSHKICSTDFYGRDLPAYRPEDLTVL 349

Query: 174 LNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 204
           +N     Y+      SC G H  T+RLC C 
Sbjct: 350 VNKKPRTYSS-----SCGGKHEKTKRLCGCG 375



 Score = 38.1 bits (87), Expect = 2.0,   Method: Composition-based stats.
 Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 4/91 (4%)

Query: 117 ESCDSVCKSSGQ--SCVLNKLILLNQCEIIQKYMSCKRGCLASM-GADQPAEVADDAPSN 173
           ++C   C  +G+   C      LLN+C+++Q        CL+   G D PA  +D     
Sbjct: 133 QNCHEACAENGEEWQCSSPWFDLLNRCDLLQVAFPHAATCLSHFYGRDLPALSSDVGGYA 192

Query: 174 LNPTACLYTRIQSM-LSCDGSHRHTRRLCPC 203
                 + ++ Q   +SCD    H+ RLC C
Sbjct: 193 RGDYVLVNSKPQEYDISCDAQGEHSARLCAC 223


>gi|444512939|gb|ELV10213.1| Alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Tupaia
           chinensis]
          Length = 702

 Score = 42.0 bits (97), Expect = 0.14,   Method: Composition-based stats.
 Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ +   C  +    LN+ + + KY ++C           Q 
Sbjct: 596 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKDKDMAKYAVTC-----------QT 644

Query: 164 AEVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 645 SELAKDILVPS-FDPKDKHCVFQGDLLLFSCAGAHPRHQRICPC 687


>gi|290987267|ref|XP_002676344.1| predicted protein [Naegleria gruberi]
 gi|284089946|gb|EFC43600.1| predicted protein [Naegleria gruberi]
          Length = 481

 Score = 42.0 bits (97), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 53/123 (43%), Gaps = 9/123 (7%)

Query: 84  HHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEI 143
           H  F  P       +T  E  L+ +  ++  +GE CD+ CK   ++C ++    +N C  
Sbjct: 365 HDVFEFPLKPPTLKITLSE--LNKVESVLCEEGEDCDTCCKKRAKTCDVSFFPYINHCRE 422

Query: 144 IQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDG--SHRHTRRLC 201
           IQK+   K  C   +  +  + +A    +N +   C      S  SC    S  + RRLC
Sbjct: 423 IQKH--SKERCNCKIDDEFFSSIAPSMDTNKD---CYVGYRPSRFSCSNKKSKSNHRRLC 477

Query: 202 PCA 204
           PC 
Sbjct: 478 PCK 480


>gi|432096647|gb|ELK27235.1| Alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Myotis
           davidii]
          Length = 741

 Score = 41.6 bits (96), Expect = 0.18,   Method: Composition-based stats.
 Identities = 31/104 (29%), Positives = 49/104 (47%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ +   C  +    LN+ + + KY + C           Q 
Sbjct: 635 LSALQVKLAGPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYEVVC-----------QS 683

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS  +P    C++     + SC G+H   RR+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHRRICPC 726


>gi|348585775|ref|XP_003478646.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A-like [Cavia
           porcellus]
          Length = 740

 Score = 41.6 bits (96), Expect = 0.18,   Method: Composition-based stats.
 Identities = 31/104 (29%), Positives = 49/104 (47%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY + C           Q 
Sbjct: 634 LSALQVKLASPGQSCKQVCQESQLICEPSFFQHLNKAKDLLKYEVIC-----------QS 682

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E A D   PS + P +  C++     + SC G+H   +R+CPC
Sbjct: 683 SERAKDILVPS-IEPKSKHCVFQGDLLLFSCAGAHPMHQRICPC 725


>gi|417404503|gb|JAA49000.1| Putative secreted protein precursor [Desmodus rotundus]
          Length = 771

 Score = 41.6 bits (96), Expect = 0.20,   Method: Composition-based stats.
 Identities = 30/103 (29%), Positives = 48/103 (46%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ +   C  +    LN+ + + KY + C           Q 
Sbjct: 665 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKEKDMLKYEVIC-----------QS 713

Query: 164 AEVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS +     C++     + SC G+H   RR+CPC
Sbjct: 714 SELAKDILVPSYDPKNKHCVFQGDLLLFSCAGAHPRHRRICPC 756


>gi|417404303|gb|JAA48911.1| Putative secreted protein precursor [Desmodus rotundus]
          Length = 741

 Score = 41.6 bits (96), Expect = 0.20,   Method: Composition-based stats.
 Identities = 30/103 (29%), Positives = 48/103 (46%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ +   C  +    LN+ + + KY + C           Q 
Sbjct: 635 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKEKDMLKYEVIC-----------QS 683

Query: 164 AEVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS +     C++     + SC G+H   RR+CPC
Sbjct: 684 SELAKDILVPSYDPKNKHCVFQGDLLLFSCAGAHPRHRRICPC 726


>gi|412993097|emb|CCO16630.1| predicted protein [Bathycoccus prasinos]
          Length = 152

 Score = 41.6 bits (96), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 39/91 (42%), Gaps = 9/91 (9%)

Query: 5   ERREILLGCNL-----LSQCCNSYEYCVSCCVNPARTLKEQVLN---VKIAKPTTAGTYG 56
           E+R     C+L       +CC +YE CVSCC N   +  E       +   +P T G + 
Sbjct: 45  EKRYGCSACDLAKGGETEKCCGTYESCVSCCQNAKVSTAETDRTREPIGRMQPQT-GYFS 103

Query: 57  SVFDFCAGRCRHNSESVVHENAYLSDFHHCF 87
             F  C  +CR      VHE+ Y  +   CF
Sbjct: 104 DAFSHCKSKCRTQPTVTVHESTYGYEKRFCF 134


>gi|387019379|gb|AFJ51807.1| Alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Crotalus
           adamanteus]
          Length = 744

 Score = 40.8 bits (94), Expect = 0.32,   Method: Composition-based stats.
 Identities = 30/103 (29%), Positives = 47/103 (45%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V I   G+SC  VC+ +   C  +    LN+ + + KY + C             
Sbjct: 638 LSALQVKIAETGKSCKQVCQENQLICEPSFFQHLNKDKDLLKYNIQC-----------HT 686

Query: 164 AEVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
            E A+D   PS +     C++     + SC GSH + +R+CPC
Sbjct: 687 VESANDILVPSFDEKRKHCVFQGDLLLFSCAGSHLNHKRICPC 729


>gi|187608310|ref|NP_001120456.1| mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [Xenopus
           (Silurana) tropicalis]
 gi|170284640|gb|AAI61239.1| LOC100145551 protein [Xenopus (Silurana) tropicalis]
          Length = 742

 Score = 40.8 bits (94), Expect = 0.33,   Method: Composition-based stats.
 Identities = 30/103 (29%), Positives = 46/103 (44%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V I   G+SC  VC+     C  +    LN+   + K+ + C           Q 
Sbjct: 636 LSALQVKIAEPGKSCKQVCQEEQLICEPSYFQHLNKDTDLAKFGVRC-----------QS 684

Query: 164 AEVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E A+D   PS +     C+      + SC GSH + +R+CPC
Sbjct: 685 SETANDIVVPSLDEKRNHCVLQGDLLLFSCAGSHSNHKRVCPC 727


>gi|170590022|ref|XP_001899772.1| hypothetical protein [Brugia malayi]
 gi|158592898|gb|EDP31494.1| conserved hypothetical protein [Brugia malayi]
          Length = 295

 Score = 40.8 bits (94), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 48/94 (51%), Gaps = 16/94 (17%)

Query: 11  LGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV------FDFCAG 64
           + C+L + CC+ YE CVSCC+ P    +++ + +K+ + T+      +      F+ C  
Sbjct: 201 VWCDLYTGCCSHYEQCVSCCLKP----EQKNILLKMIESTSGHRLRQILSATDQFELCML 256

Query: 65  RCRHNS-----ESVVHENAYLSD-FHHCFSMPSN 92
           +CR +S     ESV  EN Y S+   +C+    N
Sbjct: 257 KCRTSSNVRAHESVHSENKYKSEKMKYCYKQEPN 290


>gi|196010315|ref|XP_002115022.1| predicted protein [Trichoplax adhaerens]
 gi|190582405|gb|EDV22478.1| predicted protein [Trichoplax adhaerens]
          Length = 1490

 Score = 40.8 bits (94), Expect = 0.34,   Method: Composition-based stats.
 Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 3/60 (5%)

Query: 13  CNLLSQCCNSYEYCVSCCV--NPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 70
           CN  ++CC+ YE+CVSCC     A  LKE + N          +  + F+FC  +CR +S
Sbjct: 111 CNS-NRCCDQYEFCVSCCQYRKEALDLKEIIRNSPDGIQQLLTSAKTRFEFCLIQCRTSS 169


>gi|255083216|ref|XP_002504594.1| predicted protein [Micromonas sp. RCC299]
 gi|226519862|gb|ACO65852.1| predicted protein [Micromonas sp. RCC299]
          Length = 366

 Score = 40.8 bits (94), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 16/29 (55%), Positives = 19/29 (65%)

Query: 59  FDFCAGRCRHNSESVVHENAYLSDFHHCF 87
           FD+C  RCR NS    +ENA+  D HHCF
Sbjct: 157 FDYCKHRCRTNSGVTKYENAFQDDRHHCF 185


>gi|326923090|ref|XP_003207774.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A-like [Meleagris
           gallopavo]
          Length = 741

 Score = 40.4 bits (93), Expect = 0.38,   Method: Composition-based stats.
 Identities = 32/102 (31%), Positives = 46/102 (45%), Gaps = 13/102 (12%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V I   G+SC  VC+ S   C  +    LN+ + + K+      CL         
Sbjct: 635 LSALQVKIAEPGKSCKQVCQESQLICEPSFFQHLNKDKALLKH---NIECLT-------V 684

Query: 165 EVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           E A+D   PS +     C++     + SC GSH   RR+CPC
Sbjct: 685 ESANDILVPSFDGRRKHCVFQGDLLLFSCAGSHPTHRRICPC 726


>gi|73984271|ref|XP_541015.2| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Canis lupus
           familiaris]
          Length = 741

 Score = 40.4 bits (93), Expect = 0.43,   Method: Composition-based stats.
 Identities = 30/104 (28%), Positives = 49/104 (47%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + + +   G+SC  VC+ +   C  +    LN+ + I KY + C           Q 
Sbjct: 635 LSALQMKLAEPGQSCKQVCQENQLICEPSFFQHLNKDKDILKYEVVC-----------QS 683

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRICPC 726


>gi|118093838|ref|XP_422131.2| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Gallus gallus]
          Length = 741

 Score = 40.4 bits (93), Expect = 0.47,   Method: Composition-based stats.
 Identities = 28/99 (28%), Positives = 44/99 (44%), Gaps = 7/99 (7%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V +   G+SC  VC+ S   C  +    LN+ + + K+      CL +  A+   
Sbjct: 635 LSALQVKLAEPGKSCKQVCQESQLICEPSFFQHLNKDKALLKH---NIECLTTESANDIL 691

Query: 165 EVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
             + D         C++     + SC GSH   RR+CPC
Sbjct: 692 VPSFDGRRK----HCVFQGDLLLFSCAGSHPTHRRICPC 726


>gi|351707581|gb|EHB10500.1| Alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Heterocephalus
           glaber]
          Length = 741

 Score = 40.0 bits (92), Expect = 0.51,   Method: Composition-based stats.
 Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY + C           Q 
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKVKDLLKYEVIC-----------QS 683

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E A D   PS ++P +  C++     + SC G++   +R+CPC
Sbjct: 684 SEQAKDILVPS-IDPQSKHCVFQGDLLLFSCAGANPRHQRICPC 726


>gi|410968570|ref|XP_003990775.1| PREDICTED: LOW QUALITY PROTEIN: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Felis catus]
          Length = 741

 Score = 40.0 bits (92), Expect = 0.56,   Method: Composition-based stats.
 Identities = 30/104 (28%), Positives = 49/104 (47%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ +   C  +    LN+ + + KY + C           Q 
Sbjct: 635 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKDKDMLKYEVIC-----------QS 683

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726


>gi|440795738|gb|ELR16854.1| GNT-I family protein [Acanthamoeba castellanii str. Neff]
          Length = 566

 Score = 40.0 bits (92), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 44/165 (26%), Positives = 62/165 (37%), Gaps = 21/165 (12%)

Query: 49  PTTAGTYGSV------FDFCAGRCRHNSESVVHENA---YLSDFHHCFSMPSNASGAGVT 99
           P     YGS+      +D    R  HN   V+        L+D  HC  +P +       
Sbjct: 401 PYVRENYGSISLSLALWD-TPSRGTHNHSLVLRYKKNTLLLADKRHCAYLPESLRILPHE 459

Query: 100 QLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRG-CLASM 158
           QL  RL   +V     GE C + C   G  C       L  C+ +     C RG C  ++
Sbjct: 460 QL--RLVTSAV-----GEDCHAPCAREGLKCHQGDFDFLRHCDALASRFPCHRGRCQLAV 512

Query: 159 GADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           G + P  +    P       CL +  +   SC   H+  +RLCPC
Sbjct: 513 GPEMPIFLP-HGPHQNATGRCLVS--EDFASCAAKHKGGQRLCPC 554


>gi|46047861|ref|NP_660110.2| alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Mus musculus]
 gi|68052853|sp|Q8R4G6.1|MGT5A_MOUSE RecName: Full=Alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A; AltName:
           Full=Alpha-mannoside
           beta-1,6-N-acetylglucosaminyltransferase; AltName:
           Full=GlcNAc-T V; Short=GNT-V; AltName: Full=Mannoside
           acetylglucosaminyltransferase 5; AltName:
           Full=N-acetylglucosaminyl-transferase V
 gi|18997007|gb|AAL83249.1|AF474154_1 N-acetylglucosaminyltransferase V [Mus musculus]
 gi|26326405|dbj|BAC26946.1| unnamed protein product [Mus musculus]
 gi|116138347|gb|AAI25534.1| Mannoside acetylglucosaminyltransferase 5 [Mus musculus]
 gi|116138349|gb|AAI25536.1| Mannoside acetylglucosaminyltransferase 5 [Mus musculus]
 gi|148707821|gb|EDL39768.1| mannoside acetylglucosaminyltransferase 5 [Mus musculus]
          Length = 740

 Score = 40.0 bits (92), Expect = 0.60,   Method: Composition-based stats.
 Identities = 31/103 (30%), Positives = 48/103 (46%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY   K  C       Q +
Sbjct: 634 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKEKDLLKY---KVTC-------QSS 683

Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           E+  D   PS   P +  C++     + SC G+H   +R+CPC
Sbjct: 684 ELYKDILVPS-FYPKSKHCVFQGDLLLFSCAGAHPTHQRICPC 725


>gi|410906063|ref|XP_003966511.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A-like [Takifugu
           rubripes]
          Length = 741

 Score = 39.7 bits (91), Expect = 0.64,   Method: Composition-based stats.
 Identities = 32/103 (31%), Positives = 43/103 (41%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGAD-QP 163
           LS + V +   G SC  VC+     C  +    LN           K   LA  GAD Q 
Sbjct: 635 LSALQVRLAEPGRSCKRVCQEEQLICEPSFFQHLN-----------KDKDLARFGADCQT 683

Query: 164 AEV-ADDAPSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
            E  AD      +PT   C++     + SC G+H   +R+CPC
Sbjct: 684 VESSADTVVPAYSPTRQHCIFQSDLLLFSCAGAHPTLQRICPC 726


>gi|431914235|gb|ELK15493.1| hypothetical protein PAL_GLEAN10010741 [Pteropus alecto]
          Length = 219

 Score = 39.7 bits (91), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 36/77 (46%), Gaps = 16/77 (20%)

Query: 19  CCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNS------ 70
           CC +YE+CVSCC+ P +  L E+ LN   +A           F+ C  +CR +S      
Sbjct: 124 CCGAYEHCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQVRIFD 183

Query: 71  --------ESVVHENAY 79
                   +SV HEN Y
Sbjct: 184 VSRKLEDGQSVQHENTY 200


>gi|298710994|emb|CBJ32301.1| hypothetical protein Esi_0327_0028 [Ectocarpus siliculosus]
          Length = 550

 Score = 39.7 bits (91), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 36/114 (31%), Gaps = 49/114 (42%)

Query: 23  YEYCVSCCVNPARTLKEQVLNVKIA---KPTTAGTYGS---------------------- 57
           YE+CVSCC NP R  + + +    A    P      GS                      
Sbjct: 391 YEFCVSCCQNPQRGEEREAIQASAALSGHPAYRDLLGSGNNDAANGREPPPPRQLRASAA 450

Query: 58  ------------------------VFDFCAGRCRHNSESVVHENAYLSDFHHCF 87
                                    F  CA RCR  S SV HEN+Y S   HCF
Sbjct: 451 AAEAGGVEKNGQQVEAEERDARELAFAHCAFRCRTYSGSVAHENSYRSPLKHCF 504


>gi|395843276|ref|XP_003794419.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Otolemur
           garnettii]
          Length = 741

 Score = 39.3 bits (90), Expect = 0.91,   Method: Composition-based stats.
 Identities = 30/104 (28%), Positives = 48/104 (46%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           L+ + V +   G+SC  VC+ S   C  +    LN+ + + KY + C           Q 
Sbjct: 635 LTALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVIC-----------QS 683

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS  +P    C++     + SC G+H    R+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQDDLLLFSCAGAHPRHWRICPC 726


>gi|443721485|gb|ELU10776.1| hypothetical protein CAPTEDRAFT_200210 [Capitella teleta]
          Length = 484

 Score = 39.3 bits (90), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 55/120 (45%), Gaps = 7/120 (5%)

Query: 87  FSMPSNASGAGVTQLEGRLSGISVII-GRQGESCDSVCKSSGQSCVLNKLILLNQ-CEII 144
           F+ P  A  + +  + GRL    +I+ G  G  C  VC    + C +  L LLN+ C ++
Sbjct: 368 FATPHGAQ-SPLRGVIGRLQLDYIIVTGDDGLDCSQVCAGVYKECDVQGLFLLNKGCAVL 426

Query: 145 QKY-MSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           +   +SCK      +   QP     D P+ L+   C  T      SC G H  ++R+CPC
Sbjct: 427 ELLALSCKVCSEVKVEGSQP--YTGDMPAMLSDGNCC-TGWPHHSSCFGKHVESKRICPC 483


>gi|224056071|ref|XP_002197900.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Taeniopygia
           guttata]
          Length = 741

 Score = 39.3 bits (90), Expect = 0.94,   Method: Composition-based stats.
 Identities = 31/102 (30%), Positives = 46/102 (45%), Gaps = 13/102 (12%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V I   G+SC  VC+ S   C  +    LN+ + + ++      CL         
Sbjct: 635 LSALQVKIAEPGKSCKQVCQESQLICEPSFFQHLNKDKALLRH---NIECLT-------M 684

Query: 165 EVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           E A+D   PS +     C++     + SC GSH   RR+CPC
Sbjct: 685 ESANDILVPSFDGRRKHCVFQGDLLLFSCAGSHPAHRRICPC 726


>gi|449275163|gb|EMC84106.1| Alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A, partial
           [Columba livia]
          Length = 742

 Score = 39.3 bits (90), Expect = 0.99,   Method: Composition-based stats.
 Identities = 31/102 (30%), Positives = 46/102 (45%), Gaps = 13/102 (12%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V +   G+SC  VC+ S   C  +    LN+ + + K+      CL         
Sbjct: 636 LSALQVKMAEPGKSCKQVCQESQLICEPSFFQHLNKDKALLKH---NIECLT-------M 685

Query: 165 EVADD--APS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           E A+D   PS +     C++     + SC GSH   RR+CPC
Sbjct: 686 ESANDILVPSFDGRRKHCVFQGDLLLFSCAGSHPTHRRICPC 727


>gi|12831213|ref|NP_075583.1| alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Rattus
           norvegicus]
 gi|729609|sp|Q08834.1|MGT5A_RAT RecName: Full=Alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A; AltName:
           Full=Alpha-mannoside
           beta-1,6-N-acetylglucosaminyltransferase; AltName:
           Full=GlcNAc-T V; Short=GNT-V; AltName: Full=Mannoside
           acetylglucosaminyltransferase 5; AltName:
           Full=N-acetylglucosaminyl-transferase V
 gi|349091|gb|AAA41665.1| N-acetylglucosaminyltransferase V [Rattus norvegicus]
 gi|161728817|dbj|BAF94223.1| alpha-1,6-mannosyl-glycoprotein
           6-beta-N-acetylglucosaminyltransferase [Rattus
           norvegicus]
          Length = 740

 Score = 38.9 bits (89), Expect = 1.1,   Method: Composition-based stats.
 Identities = 31/103 (30%), Positives = 48/103 (46%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY   K  C       Q +
Sbjct: 634 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKEKDLLKY---KVIC-------QSS 683

Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           E+  D   PS   P +  C++     + SC G+H   +R+CPC
Sbjct: 684 ELYKDILVPS-FYPKSKHCVFQGDLLLFSCAGAHPTHQRICPC 725


>gi|161728838|dbj|BAF94243.1| alpha-1,6-mannosyl-glycoprotein
           6-beta-N-acetylglucosaminyltransferase [Rattus
           norvegicus]
          Length = 740

 Score = 38.9 bits (89), Expect = 1.1,   Method: Composition-based stats.
 Identities = 31/103 (30%), Positives = 48/103 (46%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY   K  C       Q +
Sbjct: 634 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKEKDLLKY---KVIC-------QSS 683

Query: 165 EVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           E+  D   PS   P +  C++     + SC G+H   +R+CPC
Sbjct: 684 ELYKDILVPS-FYPKSKHCVFQGDLLLFSCAGAHPTHQRICPC 725


>gi|149730542|ref|XP_001489147.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Equus
           caballus]
          Length = 741

 Score = 38.9 bits (89), Expect = 1.1,   Method: Composition-based stats.
 Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 17/104 (16%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V     G+SC  VC+ +   C  +    LN+ + + KY + C           Q 
Sbjct: 635 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDMLKYEVIC-----------QS 683

Query: 164 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 203
           +E+A D   PS  +P    C++     + SC G+H    R+CPC
Sbjct: 684 SELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHHRICPC 726


>gi|395519453|ref|XP_003763863.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Sarcophilus
           harrisii]
          Length = 743

 Score = 38.9 bits (89), Expect = 1.2,   Method: Composition-based stats.
 Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 9/100 (9%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ +   C  +    LN+ + + KY + C    LA+   D  
Sbjct: 637 LSALQVKLSEPGKSCKQVCQENQLICEPSFFQHLNKDKDVLKYEVVCHTTELAN---DIL 693

Query: 164 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
               DD   +     C++     + SC G+H   +R+CPC
Sbjct: 694 VPSYDDRKKH-----CVFQGDLLLFSCAGAHPKHKRICPC 728


>gi|291221905|ref|XP_002730960.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 200

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 26/64 (40%), Positives = 37/64 (57%), Gaps = 4/64 (6%)

Query: 19  CCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVH 75
           CC+ YE+CVSCC+ P +    Q +  + +K      + SV   F+ C  +CR +SESV H
Sbjct: 118 CCSIYEHCVSCCLQPQKQQLLQGVLGRASKAFQI-LFSSVTDHFELCLAKCRTSSESVQH 176

Query: 76  ENAY 79
           EN Y
Sbjct: 177 ENTY 180


>gi|303280994|ref|XP_003059789.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458444|gb|EEH55741.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 274

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 57  SVFDFCAGRCRHNSESVVHENAYLSDFHHCF-SMPSNASGAGVTQLEGR 104
           S F++CA RCR NS    +EN +    HHCF S+ + + GA  T   GR
Sbjct: 204 SSFEYCAHRCRTNSAVTKYENEFQDAKHHCFGSVYAESVGADDTLSPGR 252


>gi|428175838|gb|EKX44726.1| hypothetical protein GUITHDRAFT_109505 [Guillardia theta CCMP2712]
          Length = 551

 Score = 38.1 bits (87), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 15/92 (16%)

Query: 118 SCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQP------AEVADDAP 171
           +CDSVC+S+G+ C  +  + L++C+I +K    +  C  +   D P       E+     
Sbjct: 124 NCDSVCQSAGKVCARDWFVFLDRCDIAKKVFP-EGTCSFAFYPDMPFYDKTTKEIFVSKD 182

Query: 172 SNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           + L+P  C    +          R   RLCPC
Sbjct: 183 TKLDPPTCDAAPLGG--------RSAERLCPC 206


>gi|156407244|ref|XP_001641454.1| predicted protein [Nematostella vectensis]
 gi|156228593|gb|EDO49391.1| predicted protein [Nematostella vectensis]
          Length = 746

 Score = 38.1 bits (87), Expect = 2.2,   Method: Composition-based stats.
 Identities = 27/99 (27%), Positives = 37/99 (37%), Gaps = 5/99 (5%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           L+ +    G  G SC   C S G  C       +N  E   +       C  +   D P+
Sbjct: 639 LNALQARKGAMGSSCKETCHSLGLVCEPQYFPAINTKE---RMTRSGFPCNTTRVEDMPS 695

Query: 165 EVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
            VA     +  P  CL      + SC  +   T+RLCPC
Sbjct: 696 LVAPGYRDD--PPVCLRQAQNLLFSCTANSPTTKRLCPC 732


>gi|62702324|gb|AAX93248.1| unknown [Homo sapiens]
          Length = 231

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 9/100 (9%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY ++C+   LA     + 
Sbjct: 125 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYKVTCQSSELA-----KD 179

Query: 164 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
             V    P N +   C++     + SC G+H   +R+CPC
Sbjct: 180 ILVPSFDPKNKH---CVFQGDLLLFSCAGAHPRHQRVCPC 216


>gi|113671559|ref|NP_001038776.1| alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Danio rerio]
 gi|108742113|gb|AAI17665.1| Zgc:136939 [Danio rerio]
          Length = 742

 Score = 37.7 bits (86), Expect = 2.7,   Method: Composition-based stats.
 Identities = 26/100 (26%), Positives = 45/100 (45%), Gaps = 9/100 (9%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G+SC  VC+     C  +    LN+ + + +Y + CK         D  
Sbjct: 636 LSSMQVRLAPAGKSCKQVCQEEQLICEPSFFQHLNKDKDLARYGVECKT-------VDSA 688

Query: 164 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
            ++   A S  +   C++     + SC G+H+   R+CPC
Sbjct: 689 NDIVVPAFSE-SSRHCVFQSDLLLFSCAGAHQTLTRICPC 727


>gi|126326120|ref|XP_001363544.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Monodelphis
           domestica]
          Length = 743

 Score = 37.4 bits (85), Expect = 3.1,   Method: Composition-based stats.
 Identities = 27/100 (27%), Positives = 46/100 (46%), Gaps = 9/100 (9%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           L+ + V +   G+SC  VC+ +   C  +    LN+ + + KY + C    LA+   D  
Sbjct: 637 LNALQVKLSEPGKSCKQVCQENQLICEPSFFQHLNKDKDVLKYEVICHTTELAN---DIL 693

Query: 164 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
               DD   +     C++     + SC G+H   +R+CPC
Sbjct: 694 VPSYDDKKKH-----CVFQGDLLLFSCAGAHPKHKRICPC 728


>gi|348520977|ref|XP_003448003.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase B-like
           [Oreochromis niloticus]
          Length = 805

 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 44/105 (41%), Gaps = 19/105 (18%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + +++ ++G+SC   C+S+G  C       +N  E ++             G D   
Sbjct: 699 LSSLRLLVSQEGQSCVEACQSAGFICEPAHFRFINNKEALR-------------GLDVQC 745

Query: 165 EVADDAPSNLNPT------ACLYTRIQSMLSCDGSHRHTRRLCPC 203
           EV D   +++ P        C   R   + SC G     RRLCPC
Sbjct: 746 EVVDSEINHILPAFSVVRQECGLQREPLLFSCAGHSPKYRRLCPC 790


>gi|432849807|ref|XP_004066622.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A-like [Oryzias
           latipes]
          Length = 741

 Score = 37.4 bits (85), Expect = 3.6,   Method: Composition-based stats.
 Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 7/99 (7%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V +   G+SC  VC+     C  +    LN+ + + ++       +A    +  A
Sbjct: 635 LSALQVKLAEPGQSCKQVCQEQQLICEPSFFQHLNKDKDLDRW------GVACQTVESSA 688

Query: 165 EVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           +    A S  +   C+      + SC G+H+  RR+CPC
Sbjct: 689 DTVVPAYSE-SRKHCVLQSDLLLFSCAGAHQSLRRVCPC 726


>gi|297266872|ref|XP_002799448.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A-like [Macaca
           mulatta]
          Length = 547

 Score = 37.4 bits (85), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 164
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY   K  C       Q +
Sbjct: 441 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 490

Query: 165 EVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 203
           E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 491 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 532


>gi|426239231|ref|XP_004013529.1| PREDICTED: LOW QUALITY PROTEIN: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase B [Ovis aries]
          Length = 801

 Score = 37.0 bits (84), Expect = 4.6,   Method: Composition-based stats.
 Identities = 26/88 (29%), Positives = 32/88 (36%), Gaps = 7/88 (7%)

Query: 116 GESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLN 175
           G +C   C   G  C  +    LN  +  QK      G  + M    PA      P    
Sbjct: 706 GRACTDACLDHGLICEPSFFPFLNSQDAFQKLQVPCDGIESEMNHLYPAFAQ---PGQ-- 760

Query: 176 PTACLYTRIQSMLSCDGSHRHTRRLCPC 203
              C   +   + SC GS R  RRLCPC
Sbjct: 761 --ECFLQKEPLLFSCAGSSRKCRRLCPC 786


>gi|348506712|ref|XP_003440902.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A-like
           [Oreochromis niloticus]
          Length = 741

 Score = 36.6 bits (83), Expect = 5.7,   Method: Composition-based stats.
 Identities = 25/100 (25%), Positives = 43/100 (43%), Gaps = 9/100 (9%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 163
           LS + V +   G SC  VC+     C  +    LN+ + + ++ M C+         +  
Sbjct: 635 LSALQVKLAEPGHSCKQVCQEEQLICEPSFFQHLNKDKDLTRFGMDCQT-------VESS 687

Query: 164 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           A+    A S      C++     + SC G+H   +R+CPC
Sbjct: 688 ADTVVPAYSETR-HHCIFQSDLLLFSCAGAHSTLQRICPC 726


>gi|291233073|ref|XP_002736478.1| PREDICTED: N-acetylglucosaminyltransferase V-like [Saccoglossus
           kowalevskii]
          Length = 561

 Score = 36.6 bits (83), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 34/129 (26%), Positives = 55/129 (42%), Gaps = 19/129 (14%)

Query: 77  NAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLI 136
           NAY+ +   C + P              LS + V I    ESC +VC  +G  C  +   
Sbjct: 435 NAYIENQDFCSANPKQWPP---------LSALKVKISNPKESCKAVCMKNGLICEPSFFK 485

Query: 137 LLNQCEIIQK--YMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSH 194
            LN  E+I++  Y+ CK   ++      P+ + +   SN+    C   + + + SC G  
Sbjct: 486 SLNTQEMIEREGYLECK-SVISVEDIYAPSFIKE---SNI----CHLEKQKLLFSCAGGS 537

Query: 195 RHTRRLCPC 203
               R+CPC
Sbjct: 538 DTRNRVCPC 546


>gi|156397344|ref|XP_001637851.1| predicted protein [Nematostella vectensis]
 gi|156224967|gb|EDO45788.1| predicted protein [Nematostella vectensis]
          Length = 561

 Score = 35.8 bits (81), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 44/101 (43%), Gaps = 11/101 (10%)

Query: 105 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCK-RGCLASMGADQ 162
           LS +  + GR+G+SC   C   G  C  +    LN  + +++  M CK    L ++ A  
Sbjct: 455 LSEMLTVKGREGQSCKDACFQKGLLCEPSFFPQLNSVKQLKRNGMRCKSYKSLETLYA-- 512

Query: 163 PAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 203
           PA  + D        +CL      + SC     H  RLCPC
Sbjct: 513 PASNSTDG-------SCLLQEQTLLFSCVARDSHVTRLCPC 546


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.133    0.424 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,106,558,634
Number of Sequences: 23463169
Number of extensions: 116772032
Number of successful extensions: 219176
Number of sequences better than 100.0: 229
Number of HSP's better than 100.0 without gapping: 114
Number of HSP's successfully gapped in prelim test: 115
Number of HSP's that attempted gapping in prelim test: 218859
Number of HSP's gapped (non-prelim): 243
length of query: 204
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 68
effective length of database: 9,168,204,383
effective search space: 623437898044
effective search space used: 623437898044
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 73 (32.7 bits)